Текст
                    Encyclopedic
Dictionary
of
Mathematics


Second Edition


by the
Mathematical Society
of Japan


edited by
Kiyosi Ito


Volume I
A-N


The MIT Press
Cambridge, Massachusetts,
and London, England





Encyclopedic Dictionary of Mathematics Second Edition by the Mathematical Society of Japan edited by Kiyosi Ito Volume II O-Z Appendices and Indexes The MIT Press Cambridge, Massachusetts, and London, England 
First MIT Press paperback edition, 1993 Originally published in Japanese in 1954 by Iwanami Shoten, Publishers, Tokyo, under the title /wanami Siigaku Ziten Copynght 'G 1954, revised and augmented edition (Q 1960, second edition  1968, third edition (Q 1985 by Nihon Sugakkai (Mathematical Society of Japan). English translation of the third edition (( 1987 by The Massachusetts Institute of Technology All rights reserved. No part of this book may be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage and retrieval system, without permission in writing from The MIT Press This book was set in Monolaser Times Roman by Asco Trade Typesetting Ltd , Hong Kong, and printed and bound by Arcata Graphics, Kingsport in the United States of America. Library of Congress Cataloging-in-Publication Data Iwanami siigaku jiten (ziten). English. Encyclopedic dictionary of mathematics Translation of Iwanami siigaku jiten. Includes bibliographies and indexes. 1. Mathematics-Dictionaries 1. Ito, Kiyosi, 1915- II Nihon Siigakkai III. Title. QA5.I8313 1986 510'.3'21 86-21092 ISBN 0-262-09026-0 (HB), 0-262-59020-4 (PB) 
Contents Forewords vii Prefaces ix Introduction XIX Encyclopedic Dictionary of Mathematics Appendix A. Tables of Formulas Appendix B. Numerical Tables Statistical Tables for Reference Numerical Tables for Reference Journals 1840 Publishers 1848 Special Notation 1850 Systematic List of Articles Alphabetical List of Articles Contributors 1867 Translators 1873 Name Index 1875 Subject Index 1917 1421 1820 1837 1838 1857 1863 
Foreword to the Second English Edition The Mathematical Society of Japan takes pleasure in presenting this second edition of our Encyclopedic Dictionary of Mathematics to every researcher and user of mathematics, It is intended to be a compact, up-to-date source of information comprising, as com- pletely as possible, all significant results in all fields of our Science, pure and applied, from the elementary to the advanced level The success of the first edition owed much to the kind assistance given by the American Mathe- matical Society, As described in the preface, the members of our Society have taken re- sponsibility for compiling this new edition. We hope that it will be as useful to the mathe- maticians of today as the first edition was to the mathematicians of yesterday, We also hope that this edition will be followed in years to come by subsequent ones incorporat- ing the future development of our Science. Hikosaburo Komatsu President 1985-1987 Mathematical Society of Japan Foreword to the First English Edition The American Mathematical Society welcomes the publication of the Encyclopedic Dictionary oj Mathematics. For many years we have been fascinated by the publication in Japanese, Iwanami Sugaku Ziten, because we saw that this was an encyclopedia that contained effec- tive and penetrating information about all the fields of advanced mathematical research We were also frustrated because we could not read Japanese and so we could not really reach out to this expert and effective source of information, We now welcome the fact that the second Japanese edition has been trans- lated into English and we look forward to the fascination which we can now have in getting at this rich mine of information. Saunders MacLane President 1973-1974 American Mathematical Society 
Preface to the Second English Edition This second edition of the Encyclopedic Dic- tionary of Mathematics is in substance an English version of the third edition of Iwanami Sugaku Ziten (in Japanese), We shall explain how these two versions are related to each other and how the present edition differs from the first English edition. For the sake of sim- plicity we abbreviate Encyclopedic Dictionary of Mathematics and /wanami Sugaku Ziten to EDM and ISZ, respectively, and indicate the numbers of editions beyond the first by Arabic numerals. The prefaces of the previous editions will clarify how ISZ, its augumented edition, ISZ2, ISZ3, and EDM came into existence in 1954, 1960,1968,1985, and 1977, respectively. EDM, published nine years later than ISZ2, consisted of its English translation and some new mate- rials added to update its contents. In retro- spect, it was felt that this procedure was not adequate to cope with the recent rapid pro- gress of mathematics, and a suggestion was raised in the Mathematical Society of Japan that ISZ3 and EDM2 be produced simulta- neously. The favorable reaction of the mathe- matical public to EDM encouraged us greatly. In 1978 an agreement was made among the Society, Iwanami Shoten, Publishers, and The MIT Press for the publication of ISZ3 and EDM2. An editorial committee was estab- lished in the Society with the members named in the preface to ISZ3. Manuscripts were pre- pared simultaneously in Japanese and English by each contributor, with Yuji Ito acting as linguistic consultant. I should mention that we benefited greatly from the kind comments on EDM by the following mathematicians: J. F. Adams, M, Atiyah, A. Borel, H. Cartan, K. Chandrase- kharan, S. S. Chern, J. Dieudonne, E. Hewitt, F. Hirzebruch, O. Lehto, J. L. Lions, L. Markus, and J. P. Serre. In particular, we are deeply grateful to J. Dieudonne for his many detailed comments. Compared with ISZ3 and EDM, EDM2 has the following characteristics: (1) EDM2 contains many more new mathe- matical results than EDM, For the details we refer the reader to the main points of revision mentioned in the preface to ISZ3, because EDM2 has the same mathematical content as ISZ3 and the additions made to ISZ2 and subsequently in EDM are of relatively minor importance. (2) The Japanese textbooks listed in the article references of ISZ3 are replaced in EDM2 by references to standard textbooks written in English, Such replacement was not done in compiling EDM, (3) Years of birth and death are included in the Name Index of EDM2 as far as possible. This information was given in ISZ2 but was removed in compiling EDM. (4) The Subject Index of EDM2 is so de- signed that every concept consisting of two or more words can be traced from each of these component words. (This principle was adopted in ISZ2 and EDM but was not rigidly fol- lowed in ISZ3 due to the lack of space.) (5) While all editions of ISZ were in one volume and EDM was in two volumes, EDM2 is in four volumes: 1. Forewords, prefaces, introduction, text A-E; II. Text F-N; III, Text O-Z; IV, Appendices, indexes and other backmatter. The Systematic List of Articles appears in volume IV, page 1857, While EDM2 is more voluminous than ISZ3, we hope that, being written in English and printed with generous margins, it can easily provide readers with information on every significant result of today's mathematics and so will be useful to a large number of mathematicians. I am responsible for any shortcomings that may exist despite all our efforts, and I would appreciate any remarks from the readers, I hope to have occasion to remedy any such shortcomings in the future. Finally I would like to repeat my thanks to all the collaborators named in my preface to ISZ3. Also I express my gratitude to Yuji Ito for his tremendous effort to polish the English of all manuscripts, to Shigeru litaka for his laborious assistance in the final stage of the work, to Mrs. M. Nawata for her excellent secretarial work, and to the staff of the De- partment of Mathematics at Gakushuin U ni- versity for providing me with an office for compilation for eight years. Kiyosi Ito December 1986 
Preface to the Third Japanese Edition The first edition of Iwanami Siigaku Ziten was published in 1954, and a revised and aug- mented edition appeared in 1960, Extensive revision of the work was carried out sub- sequently, and the result was published in 1968 as the second edition, which has retained its valuable and useful life for the past 17 years, The English translation of this second edition published by The MIT Press has achieved international recognition. In the meantime, remarkable progress has been made in mathe- matical sciences: recognition of the interrela- tionships amongst various branches within mathematics has increased significantly, and the formula tion of mathematics as a synthetic entity is in the making, Furthermore, advanced mathematical theories have been utilized fre- quently in the physical, biological, and social sciences, and expectations for mathematics to be the basis of all the sciences have been increasing. In order to cope with these devel- opments, it was decided that the second edition of Siigaku Ziten should be further revised and an updated version be published as the third edition, The English translation of this third edition will also be published by The MIT Press. The main points of the revision are as follows: (I) On the Size and Scope of the Encyclo- pedia: The total number of articles has in- creased only slightly from that of the second edition, from 436 to 450. However, in view of the rapid and extensive development in mathematics in recent years, many old articles have been reorganized and unified, and much new material has been added. Consequently, the mathematical content of the encyclopedia has increased a great deal over that of the second edition, and this, together with the authors' efforts to render the articles more readable than before, necessitated a 50% in- crease in the number of pages devoted to the main text. The account of computer science has been taken out of the subject area of Numerical Analysis, and a new subject area has been created for it; thus mathematics is now classified into 21 subject areas altogether instead of 20, (2) On the Arrangement of the Articles: In the second edition, the title of each article was spelled out in Romanized Japanese, and articles were arranged alphabetically, In the third edition, however, the titles of the articles are arranged in the order of the Japanese syl- labary (katakana) and each article is numbered accordingly. To each article a parenthetical number is also attached, giving the subject area to which the article belongs and its place in that list. (3) On the Text: Sections within each arti- cle are headed by letters A, B, C, . , . ; the ini- tial section of each article gives an outline and is devoted to an introductory and general account of the topic with which the article is concerned. In the actual exposition of the articles, every effort has been made to pay close attention to the interrelationships amongst various fields of ma thematics. Even when there was no need to revise the original article, changes were made in the bibliography to list items that would be more accessible to the readers. (4) On the Appendices: The appendices have been revised and augmented along with the text. Numerical tables were deleted when- ever it was felt that scientific calculators or microcomputers could easily reproduce their contents. (5) On the Indexes: The listing of mathe- matical terms in Japanese has been changed, as in the case of the arrangement of articles, from the alphabetical order to the order of the Japanese syllabary. However, terms in Japanese that start with a European word ha ve been separated out, and, in order to facilitate finding, have been listed indepen- dently in an Index for Mathematical Terms in Japanese with European Headings, The loca- tion of a term listed in the indexes is indicated by the article number and the letter heading of the section in the article where the term appears. We here outline briefly how the process of compilation of this edition was organized, In the summer of 1978 the Mathematical Society of Japan decided, following a proposal by Professors Sh6kichi Iyanaga and Yukiyosi Kawada, the chief editors of the previous editions of Siigaku Ziten, to undertake the compilation of the third edition, Following a resolution by the members of the Soci- ety, Kunihiko Kodaira, Sigeru Mizohata, Jtiro Tamura, Nagayoshi Iwahori, Tosihusa Kimura, and myself have joined Iyanaga and Kawada to form an editorial committee and have formulated basic guidelines for the com- pilation of the third edition. The work of editing and compiling articles in each subject area was delegated to the editors designated for the areas, and I was asked to undertake the responsibility of putting the entire volume in order, The subject areas and their editors are as follows: 
xi Logic and Foundations: Shoji Maehara Sets, General Topology, and Cate- gories: Shoji Maehara, Hikosaburo Komatsu, Masayoshi Nagata Algebra: Masayoshi Nagata Group Theory: Nagayoshi Iwahori Number Theory: Takayoshi Mitsui Euclidean and Projective Geometry: Itiro Tamura Differential Geometry: Morio Obata, Shigeru Iitaka, !tiro Tamura Algebraic Geometry: Shigeru Iitaka Topology: Itiro Tamura Analysis: Sigeru Mizohata, Kiyosi Ita Complex Analysis: Yukio Kusunoki, Shigeru Iitaka Functional Analysis: Hikosaburo Komatsu XIII Differential, Integral, and Functional Equations: Tosihusa Kimura, Sigeru Mizohata Special Functions: Sin Hitotumatu Numerical Analysis: Masaya Yamaguti Computer Science and Combinatorics: Sin Hitotumatu Probability Theory: Kiyosi Ita Statistics: Kei Takeuchi Mathematical Programming and Operations Research: Shigeru Furuya Mechanics and Theoretical Physics: Huzihiro Araki XXI History of Mathematics: Shakichi Iyanaga Appendices: Sin Hitotumatu Before the actual process of compilation started, the editorial committee had met sev- eral times, and two meetings of all the subject- area editors had been called. Furthermore, editors for each area consulted with other experts in their fields as they selected the titles of articles to be included. In areas where remarkable development had taken place, such as differential geometry, functional analysis, probability theory, and applied mathemat- ics, extensive revision and insertion of many new arttcles were proposed by the respective editors. For this reason, the original articles were thoroughly reorganized and systematized. We resolved to keep the whole work in one volume, even though a number of new articles have had to be added. The selection of articles and of their re- spective authors was finally completed in the spring of 1980, and 197 colleagues were asked to write the articles. The names of these con- tributors and those of the previous editions are listed elsewhere in this volume. To all of the authors we express our sincere gratitude. Editing of the entire manuscript has been carried out since the summer of 1982, and we are deeply indebted to the following colleagues for their painstaking efforts in checking cross I II III IV V VI VII VIII IX X XI XII XIV XV XVI XVII XVIII XIX XX Preface to the Third Japanese Edition references and in proofreading: Nobuyosi Moto- hashi (Foundations, Set Theory), Takeo Y oko- numa (Algebra, Group Theory), Takayoshi Mitsui (Number Theory), Tetsuro Kawasaki (Geometry, Topology), Shigeru Iitaka, Isao Wakabayashi (Algebraic Geometry), Morio Obata, Koichi Ogiue, Osamu Kobayashi (Dif- ferential Geometry). Seizo Ito. Hisashi Oka- moto (Analysis), Hiroaki Aikawa, Makoto Ohtsuka (Computer Analysis), Hikosaburo Komatsu, Akihiko Miyachi (Functional Analysis), Kazuo Okamoto, Daisuke Fujiwara (Differential Equations), Sin Hitotumatu (Special Functions), Teruo Ushljlma (Nu- merical Analysis), Hideo Wad a (Computer Science), Yasunori Okabe (Probability Theory), Mituaki Huzii, Yoshihiro Yajima (Statistics), Shigeru Furuya (Mathematical Programming), Koichi Nakamura (Theoretical Physics), ShGichi Okamoto (History of Mathematics), Kosaku Okutsu (History of Mathematics, Number Theory). On those occasions when it became neces- sary to rapidly revise manuscripts in order to unify the presentation, we were forced to go through the revision without consulting the authors of the man uscripts. I am responsible for all such revisions and hereby express my apologies to the authors concerned. As for the indexes, we received assistance from Takeo Yokonuma, Koichi Yano, Hiroaki Aikawa, and Hisashi Okamoto in aII aspects of the work, which lasted for two years, through the final proofreading. Preliminary work on the Name Index was carried out by staff members of the University of Tokyo, and Seiza Ito supervised its final compilation. We also asked many other colleagues, and in par- ticular Nobuyuki Ikeda and Huzihiro Araki, to participate in the preparation of the Name Index, using resources available at different universities, To all of these people goes our sincere gratitude for their assistance, Last August, when the completion of the work was drawing near, it became necessary for me to leave Japan for some time, and this made it imperative that I entrust Shigeru litaka with responsibility for supervising the work at the final stage of editing, I am most grateful to Iitaka for having agreed to take over this responsibility and for having brought the work to completion under the pressure of time. From the beginning of this project, we have received an unlimited amount of assistance from the members of the Dictionary Depart- ment of Iwanami Shoten, Publishers. Messrs, Ikutaro Sasaki, Hirotomo Ushida, Kazuhiko Uetake, and Nagao Sato in particular, have made supreme efforts and have come up fre- 
Preface to the Third Japanese Edition quently with innovative ideas to make this dictionary polished and perfect. To them and also to those at Dai-Nippon Printing Co., who typeset with the use of computers the entire text, including the complicated mathematical formulas, and who have cooperated with us fully in the process of editing the indexes, goes our deep gratitude. Klyosi It6 October 1985 xii Preface to the First English Edition The first and second editions of Iwanami Sugaku Ziten (in Japanese) were published, respectively, in April 1954 and June 1968 by Iwanami Shoten, Publishers, Tokyo. Begin- ning in the late 1950s, a number of unsuccess- ful attempts were made to arrange for translat- ing the Sugaku Ziten into European languages. Finally an agreement for an English transla- tion was made between The MIT Press and the Mathematical Society of Japan in July 1968. The discussions were carried on first by Professor Kosaku Y osida, then presi- dent of the Mathematical Society of Japan, and later by Yukiyosi Kawada, who suc- ceeded him in April 1968. Throughout these initial negotiations, which lasted from 1966 to 1968, we received the kindest assistance from Dr. Gordon Walker, Executive Director of the American Mathematical Society, and from Professors W. T. Martin and Shizuo Kakutani. The agreement for the project was shortly followed by the establishment of a commit- tee for the English edition of Siigaku Ziten within the Mathematical Society of Japan, with the following membership: Professors Yasuo Akizuki, Shigeru Furuya, Sin Hito- tumatu, Masuo Hukuhara, Isao Imai, Sh6- kichi Iyanaga, Yukiyosi Kawada, Kunihiko Kodaira, Atuo Komatu, Hirokichi Kudo, Shoji Maehara, Yukio Mimura, Kiyoshi Noshiro, Shigeo Sasaki, Shoji Ura, Nobuo Yoneda, and Kosaku Yosida, This committee requested the original authors of the articles and other mcmbers of the Society to translate the work. A list of translators will be found at the end of this work. In November 1968, an advisory committee for the project was formed with the following membership: Professor Edwin Hewitt (chair- man), Dr. Sydney H. Gould, Professor Shizuo Kakutani, Professor Kenneth O. May, and Professor Isaac Namioka. As the translating began, we were immedi- ately faced with problems concerning unifica- tion of terminology and style, some of which were inherent in the differences between the structures of our two languages-for example, the fact that the Japanese language makes no distinction between singular and plural forms of nouns. In August 1969, Professor Hewitt kindly arranged a meeting at the University of Wash- ington, Seattle, that included the members of the Japanese and American committees and a representative of The MIT Press. It was agreed during this meeting that the transla- 
xiii tion should be faithful, with only a minimum number of changes, such as the correction of mathematical errors; whereas the references to each article might be augmented consider- ably for the convenience of Western readers. Professor Kenneth 0, May volunteered to review the entire translated manuscript, and Professors Isaac N amioka and Shizuo Kak u tani, who are proficient in both Japanese and English, proposed to read through some of the manuscript of the translated articles. It was also agreed that the Systematic List of Arti- cles should appear in French, German, and Russian, as well as in English. We owe very much to the American com- mittee: Professor Hewitt organized the whole work, and Professor May revised the entire manuscript and gave us important advice con- cerning the appendices, according to which we deleted some of the numerical tables which may be easily found in readily accessible West- ern books, Professor Namioka reviewed a great part of the man uscript, transmitting his views to Professor May, who forwarded them to us with his comments. All of this assistance helped us greatly in making our final decisions. Professor Kakutani gave us very detailed and important advice on the choice of reference works. We were also assisted concerning English terminology and reference books by the fol- lowing Japanese mathematicians working in American universities: Tadatoshi Akiba, Pro- fessors Kiyosi Ito, Tatsuji Kambayashi, Tosio Kato, Teruhisa Matsusaka, Katsumi Nomizu, Ichiro Satake, Michio Suzuki, and Gaisi Takeuti. The Mathematical Society of Japan estab- lished the following double reviewing system: group A with its twenty subgroups, each headed by the members listed below, reviewed their respective subjects; while group B re- viewed the whole manuscript, mainly from the linguistic standpoint. Group A I, Foundations of mathematics: Shoji Maehara 2, Set theory: Atuo Komatu and Shoji Maehara 3. Algebra: Akira Hattori, Masayoshi Nagata, and Hideyuki Matsumura 4, Group theory: Shingo Murakami, Mitsuo Sugiura, and Reiji Takahashi 5. Number theory: Yukiyosi Kawada and Tomio Kubota 6, Geometry, 7. Differential geometry: Sho- kichi Iyanaga, Shigeo Sasaki, and Kentaro Yano 8. Algebraic geometry: Yasuo Akizuki and Kunihiko Kodaira 9. Topology: Atuo Komatu 10. Real analysis: Sin Hitotumatu, Shunji Kametani, and ShigekJ Yano Preface to the First English Edition II. Complex analysis: Kiyoshi Noshiro 12. Functional analysis: Yukio Mimura and Kosaku Yosida 13. Differential equations: Masuo Hukuhara and Sigeru Mizohata 14. Special functions: Sin Hitotumatu 15, Numerical analysis, 16. Probability theory: Kiyosi Ito 17. Statistics: Hirokichi Kudo 18. Information theory: Tosio Kitagawa and Hirofumi Uzawa 19. Theoretical physics: Isao Imai and Kazuhiko Nishijima 20. History of mathematics: Tamotsu Murata Group B Kenichi Iyanaga and Mitsuyo Iyanaga Professor Sin Hitotumatu also assisted us in translating the titles of Japanese books given in the references and the explanations attached to the lists of formulas and numerical tables in the appendices. We are also grateful for the generous cooperation offered to us by our colleagues in the Department of Mathemat- ics, Faculty of Science, University of Tokyo: Hiroshi FUjita, Shigeru Furuya, Akio Hat- tori, Seizo Ito, Nagayoshi Iwahori, Tosihusa Kimura, Kunihiko Kodaira, Hikosaburo Komatsu, Akihiro Nozaki, and Itiro Tamura. We are indebted as well to Professors Walter L. Bailey of the University of Chicago and Yuji Ito of Brown University for many valu- able consultations concerning both mathe- matical and linguistic questions. In translating the Systematic List of Articles into French, German, and Russian, we were assisted by Professor Hideya Matsumoto in Paris, Professor Emanuel Sperner in Hamburg, Professor Katsuhiro Chiba in Tokyo, and Professor Arkadii Maltsev in Moscow. We began to send the manuscript to The MIT Press in March 1970 and finished send- ing it in July 1972, The manuscript was edited there, then sent to Professors May, Namioka, and Kakutani, and finally was sent back to us with their comments and questions. All of the references were carefully checked by Laura Platt. Iwanami Shoten, Publishers, have always been cooperative with us. In the office of the Mathematical Society of Japan, Yoko Endo, Reiko Nagase, and Chieko Sagawa helped us with their efficient secretarial work. The fruition of this project was made pos- sible only by the gracious assistance offered to us by many people, including those already mentioned. We should like to express our most sincere gratitude to all those who have helped us so kindly. Shokichi Iyanaga, Yukiyosi Kawada August 1973 
Preface to the First English Edition Addition made in January 1976 After the procedures described above, the whole manuscript of this Encyclopedia was sent to The MIT Press in August 1973. It was, however, toward the end of November 1975 that the final decision was made by The MIT Press to send the manuscript to composition in early 1976 in order that the work be pub- lished in 1977. At the same time, we were asked to review and update the manuscript up to the end of February 1976. We are now making our best effort to this effect with the kind help, especially from the linguistic viewpoint, of Dr. E. J. Brody. In so doing, we have noticed that perhaps too much emphasis has been given to results obtained by Japanese mathematicians and that there are still many things in this book which should be improved. We hope that ongoing revisions will be carried out in subsequent editions, xiv Preface to the Second Japanese Edition Seven and a half years have passed since the revised and augmented edition of Iwanami Sugaku Ziten was published. The nature and purpose of this book remain the same as de- scribed in the preface to the first edition: It is an encyclopedic dictionary with articles of medium length aimed at presenting the whole of mathematics in a I ucid system, giving exact definitions of important terms in both pure and applied mathematics, and describing the present state of research in each field, together with historical background and some per- spectives for the future, However, mathemat- ical science is in rapid motion, and the "pre- sent state of research" changes constantly. The present updated second edition has been published to remedy this situation as far as possible. The main points of revision are as follows: (I) On the Articles and the Size of the Ency- clopedia: From the articles of the last edition, we have removed those whose Importance has diminished recently (e.g., Geometry of Tri- angles), while we have added new articles in domains of growing importance (e.g., Cate- gories and Functors; K - Theory). Many articles concerning applied mathematics in the first edition were short; in this edition, they have been combined into articles of medium length to save space and to systematize the presenta- tion, The number of articles, 593 in the first edition, has thus diminished to 436. We have made every effort to keep the size of the ency- clopedia as it was, but the substantial aug- mentation of content has necessarily brought about an enlargement of about 30%. (2) On the Text: When the title of an article has remained the same as in the first edition, we have reviewed the whole text and revised whenever necessary. Especially for the fun- damental ideas, we have endeavored to give thorough explanations. In the first edition we gave English, French, and German transla- tions of article titles; in the present edition, we have also added Russian. The bibliographies at the ends of articles have been updated. (3) On Terminology: In previous editions we endeavored to unify the terminology of the whole encyclopedia so that the reader would have no difficulties with cross-references, Here wc have done this once again with the hope of attaining results more perfect than before, (4) On the Appendices: The appendices were designed to supplement the text efficiently, Some overlapping of the appendices with the text found in previous editions has been re- moved. Also deleted in this edition are elemen- 
xv tary formulas in analytic geometry and tables of statistical distributions, which can be easily found in other books, However, we have added some formulas in topology, the theory of probability, and statistics, as well as tables of characters of finite groups, etc. (5) On the Indexes: Important terms are listed multiply in the indexes to facilitate find- ing, e.g., the term transcendental singularity appears under both transcendental and singu- larity. Both names in the text and those in the references are included in the Name Index of this edition. The numbers of items in the Index of Mathematical Terms in Japanese and in European Languages and in the Name Index in this edition are 17740, 10124, and 2438, com- pared with 8254, 8070, and 1279, respectively, in the previous edition. The compilation of this edition was organized as follows. In the spring of 1964 we began to select the titles of articles with the aid of the following colleagues, in set theory and foun- datIOns of mathematics, Shoji Maehara; in algebra and number theory, Yasuo Akizuki, Yukiyosi Kawada; in differential geometry, theory of Lie groups, and topology, Y ozo Matsushima, Atuo Komatu; in analysis, Masuo Hukuhara, Kosaku Yo sid a, Shunji Kametani, Sin Hitotumatu; in probability theory, statistics, and mathematics for pro- gramming, Kiyosi Ita, Hirokichi Kudo, and Shigeru Furuya; in theoretical physics, Isao Imai; and for the appendices, Sin Hitotumatu. I have participated in compiling the articles on geometry and the history of mathematics. Kawada and Hitotumatu undertook the re- sponsibility of putting the volume in order, The work of selecting titles was completed in the summer of 1964. We then asked 173 colleagues to contribute articles. The names of these contributors and those of the previous editions are listed elsewhere. To all of them goes our most sincere gratitude. In editing the manuscript, we were assisted by the following colleagues in addition to those mentioned already: in set theory and foundations of mathematics, Setsuya Seki and Tsurane Iwamura; in algebra and number theory, Masayosi Nagata, Akira Hattori, Hideyuki Matsumura, Ichiro Satake, Tikao Tatuzawa; in geometry, theory of Lie groups, and topology, Singo Murakami, Hideki Ozeki, Noboru Tanaka, Kiiti Morita, Hirosi Toda, Minoru Nakaoka, Masahiro Sugawara, Shara Araki; in analysis, Kiyoshi Noshiro, Yusaku Komatu, Seizo \to, Hiroshi Fujita, Shige- Toshi Kuroda, Sigeru Mizohata, Masaya Yamaguti, Tosiya Saito, Tosihusa Kimura, Masahiro Iwano; in probability theory, sta- tistics, and mathematics of programming, Preface to the Second Japanese Edition Nobuyuki Ikeda, Tadashi Ueno, Masashi Okamoto, Haruki Morimoto, Kei Takeuchi, Goro Ishii, Tokitake Kusama, Hukukane Nikaido, Toshio Kitagawa; and in theoretical physics, Ryogo Kubo, Hironari Miyazawa, Y oshihide Kozai. After the summer of 1965, we entered into the period of finer technical editing, in which we were assisted by the following colleagues: in algebra, Keijiro Yamazaki, Shin-ichiro Ihara, Takeshi Kondo; in geometry, Tadashi Nagano, Mitsuo Sugiura, Ichiro Tamura, Kiyoshi Katase; in analysis, Nobuyuki Suita, Kotaro Oikawa, Kenkiti Kasahara, Tosinobu Muramatsu, Hikosaburo Komatsu, Setuzo Yosida, Hiroshi Tanaka; and in history, Tamotsu Murata. We were also assisted by Katsuhiro Chiba in the Russian-language translation of the article titles, and by Osamu Kota and Kiyoshi Katase in the indexing. Proof sheets began appeanng in the spring of 1966. I n proofreading, Kaoru Sekino, Osamu Kota, Kiyoshi Katase, and Teruo Ushijima helped us, as well as Mrs. Rieko Fujisaki. Miss Yoko Endo worked with all of us throughout the entire revision at the office of the Mathematical Society of Japan. She helped us especially in looking for and check- ing references and preparing the Name Index, Yukiyosi Kawada supervised the whole work, succeeding me in the role I had played in the compilation of the first and augmented editions, Sin Hitotumatu collaborated with him throughout, especially on the appendices. The second and the third proof sheets of the text were read by Kawada; the fourth proof by H itotumatu; the proof sheets of appendices by both Kawada and Hitotumatu. The Editorial Committee of the Mathe- matical Society of Japan asked me to write this preface. Having edited the previous edi- tions and realizing full well the difficulty of the task, I would like to express my particular gratitude to Kawada. In the Dictionary De- partment of Iwanami Shoten, Publishers, Messrs, Hiroshi Horie, Tetsuo Misaka, Shi- geki Kobayashi, and Toshio Kouda were very cooperative in their collaboration with us. To them and also to those who typeset and printed this book at Dai-Nippon Printing Co. and Shaken Co. goes our gratitude S Iyanaga March 1968 
Preface to the Revised and Augmented Japanese Edition Six years have passed since the publication of the first edition of this encyclopedia. This re- vised and augmented edition incorporates the achievement of these years. It contains, to- gether with the correction of errors found in the first edition, some new articles such as Abelian Varieties, Automata, Sheaves, Homo- logical Algebra, Information Theory, and also supplements to articles in the first edition such as Complex Multiplication, Computers, and Manifolds. These additional items render the new edition 93 pages longer than the previous one. Each article has been thoroughly re- vised, and the indexes have been completely rewritten. We were assisted by the following colleagues in selecting articles, writing, and proofreading: In set theory and foundations of mathematics, Sigekatu Kuroda, Setsuya Seki; in algebra and number theory, Tadao Tannaka, Tsuneo Tamagawa: in real analysis, Shunji Kametani, K6saku Y osida; in function theory, Kiyoshi Noshiro, Sin Hitotumatu; in theory of dif- ferential and functional equations, Masuo Hukuhara, Masahiro Iwano, Ken Yamanaka; in functional analysis, K6saku Yosida, Seizo It6; in geometry, Shigeo Sasaki, Nagayoshi Iwahori; in topology, Atuo Komatu, Jtiro Tamura, Nobuo Yoneda; in theory of proba- bility, Kiyosi It6, Seiza Ita; in statistics, Toshio Kitagawa, Sigciti Moriguti, Tatsuo Kawata; in applied mathematics, Sigeiti Moriguti; and in mechanics and theoretical physics, Taka- hiko Yamanouchi, Isao Imai. The revision and augmentation of the articles concerning the history of mathematics was done by myself. Miles. Y6ko Tao, Eiko Miyagawa, and Mutsuko Nogami worked in the office of the Society. The names of authors who contributed to the completion of this edition have been added to the original list of contributors, The project of editing this edition started in the summer of 1958. We acknowledge our deep gratitude to all those who have col- laborated with us since that time. S. Iyanaga August 1960 Preface to the First Japanese Edition This encyclopedia, Iwanami Sugaku Ziten, was compiled by the Mathematical Society of Japan at the request of the Iwanami Shoten, Publishers, who have hitherto published a series of scientific dictionaries such as Iwanami Rikagaku-Ziten (Iwanami Encyclopedia of Physics and Chemistry) and Iwanami Tetugaku- Ziten (Iwanami Encyclopedia of Philosophy). As mentioned in the prefaces to these volumes, the importance of such encyclopedias in clari- fying the present state of each science is obvious if we observe the rapid pace of contemporary research. Mathematics is also in rapid motion. As a fundamental part of exact science, it serves as a basis of all science and technology. It also retains its close contact with philo- sophy, Therefore, the significance of having an encyclopedia of mathematics cannot be overem phasized. Mathematics have made remarkable pro- gress in the 20th century. As for the situation toward the end of the 19th century, we quote the following passage from the article Mathe- matics in the 19th Century of this encyclo- pedia: "Toward the end of [the 19thJ century, the subjects of mathematical research became highly differentiated, Branches split into more specialized areas of study, while unexpected relations were found between previously unconnected fields. The situation became so complicated that it was difficult to view mathematics as a whole. It was in these cir- cumstances that in 1898, at the suggestion of Franz Meyer and under the sponsorship of the Academies of Gottingen, Berlin, and Vienna, a project was initiated to compile an encyclo- pedia of the mathematical sciences. Entitled the Enzyklopiidie der mathematischen Wissen- schaften, it was completed in 20 years.... " One of the characteristics of 20th-century mathematics is the conscious utilization of the axiomatic method and of general concepts such as sets and mappings, which serve as foundations of different theories. Indeed, mathematics is being reorganized on the basIs of topology and algebra. One such example of reorganization is found in Bourbaki's Elements de mathematique; some fifteen volumes of this series have been published since 1939, and more are coming. This encyclopedia, with its limited size, cannot contain proofs for theorems. However, we intend to present a lucid view of the totality of mathematics, including its historical background and future possibilities. Each article of this encyclopedia is of me- dium length-sufficiently short to permit 
xvii the reader to find exact definitions of notions, and sufficiently long to contain explanations clarifying how important concepts in the same field are related to each other. The problem of choosing adequate titles required some de- liberation. The Systematic List of Articles, classified according to specific fields, shows those we have chosen. The Index of Terms contains detailed references for each notion. The appendices, including formulas and tables, supplement the text, and will be particularly useful for applied mathematicians, The project of compiling this encyclopedia was proposed in the spring of 1947 by the Steering Committee of the Mathematical Society of Japan. It was promptly adopted, and the selection of articles in specific fields was started by the sectional committees of the Society. After seven years, our encyclopedia is finaIly appearing. We shaIl not give a de- tailed description of how our work proceeded through all these years. We list simply the names of those who assisted us greatly and to whom we should like to express our deep gratitude. The president of the Society at the start of this project was the late Professor Tadahiko Kubota; but our work has been supported also by Professors Teiji Takagi, Zyoiti Suetuna, and Masatsugu Tsuji as weIl as by other leading members of the Society. At the stage of selecting articles, we were assisted by the foIlowing colleagues: in history and the foundations of mathematics, Sige- katu Kuroda, Motokiti Kondo; in algebra and number theory, Kenjiro Shoda, Tadasi Nakayama, Masao Sugawara, Yukiyosi Kawada, Kenkichi Iwasawa; in geometry, Kentaro Yano, Asajiro Ichida; in function theory, Kiyoshi Noshiro, Yusaku Komatu; in the theory of differential and functional equa- tions, Masuo Hukuhara, Shigeru Furuya; in topology, Atuo Komatu, Ryoji Shizuma; in functional analysis, Yukio Mimura, Shizuo Kakutani, Kasaku Y osida; in the theory of probability and statistics, Tatsuo Kawata, Toshio Kitagawa, Junjiro Ogawa; and in applied mathematics, Ayao Amemiya, Isao Imai, Kunihiko Kodaira, Shigeiti Moriguti. We asked 190 colleagues to contribute arti- cles, which were coIlected in 1949. Since then we have spent an unexpectedly long time edit- ing them. Terminology had to be unified throughout the encyclopedia so that the reader would have no trouble with cross-references. Repetitions had to be eliminated and gaps fiIled. Part of the manuscript thus had to be rewritten a number of times, We have made out utmost effort in this editing work, but we are not completely without apprehension that our result has still left something to be desired. Preface to the First Japanese Edition For any shortcomings in the work, I take complete responsibility, as I have acted as the editor-in-chief. Also, since we have rewritten the manuscript, as already mentioned, we have refrained from printing the name of the original author of each article; for this, I must request the understanding of the contributors, In the stage of editing and proofreading, we were assisted by the following colleagues: Yukio Mimura, Yukiyosi Kawada, Kazuo Matsuzaka, Sin Hitotumatu, Setsuo Fuku- tomi, Setsuya Seki, Shoji Irie, Shigeo Sasaki, Tatsuo Kawata, Sigekatu Kuroda, Yusaku Ko- matu, Ayao Amemiya, Isao Iami, Tosio Kato, Tsurane Iwamura, Morikuni Goto, Kosaku Yosida, Jira Tamura, Yasuo Akizuki, Kiyoshi Noshiro, Motosaburo Masuyama, Sigeiti Moriguti, Osamu Kata, Nobuo Yoneda, Tsuneo Tamagawa, Jun-ichi Hano; and more particularly, in the foundations of mathematics, Sigekatu Kuroda, Tsurane Iwamura; in alge- bra and number theory, Kazuo Matsuzaka, Yukiyosi Kawada; in algebraic geometry, Yakuo Akizuki; in real analysis, Tatsuo Ka- wata; in complex analysis, Yusaku Komatu, Sin Hitotumatu, Jira Tamura; in functional analysis, Kasaku Y osida; in topology, Setsuo Fukutomi, Nobuo Yoneda; in the theory of probability and statistics, Motosaburo Masu- yama, Sigeiti Moriguti; and in applied mathe- matics, Ayao Amemiya, Isao Imai, Tosio Kato, Sigeiti Moriguti. The portraits of Abel and Riemann were kindly loaned to us by Torataro Shim omura. The formulas in the appendices were com- piled by Isao Imai, Sin Hitotumatu, and Sigeiti Moriguti; the Subject Index (in Japanese and European languages) by Osamu Kata and Mrs. Hiroko Ide; the Name Index and Com- ments on Journals and Serials by Setsuo Fukutomi. Setsuo Fukutomi has taken an active part in Our work ever since 1948 and given much effort to collecting and rewriting the manuscript and to unifying terminology. The editorial staff of Iwanami Shoten, Pub- lishers, has always been cooperative. Without their generous support, this encyclopedia could never have been published. I should like to express my sincere gratitude to all those who have coIlaborated with us directly or indirectly. S.Iyanaga March 1954 
Introduction The Cyrillic alphabet is transliterated as follows: The text of this Encyclopedic Dictionary con- Cyrillic sists of 450 articles arranged alphabetically, beginning with 1 Abel and ending with 450 Alphabet Transliteration Zeta Functions, Most of these articles are A a (a) a divided into sections, indicated by A, B, C, "', I> 6 (be) b AA, BB, ." . Cross-references to articles, e.g., B B (ve) v to the second article, are of the form: ( 2 r r (ge) g Abelian Groups) or ( 2 Abelian Groups A), A JJ (de) d according as the whole article or a specific E e (ye) e section is being referred to. CitatiOns in the E e (yo) e indexes are also given in terms of article and )l{ )!( (zhe) zh section numbers. 3 3 (ze) z Key term.l accompanied by their definitions VI II (i) in the text are printed in boldface, All of these H ii (i kratkoe) terms are found in the Subject Index at the K K (ka) k end of the volume, J1 J1 (el') I The sign t means that the term preceded by M M (em) m it can be found in the index. A list of special H H (en) n notations used throughout the work (with 0 0 (0) 0 explanations of their meanings) appears after TI n (pe) p the appendices, P p (er) r A Systematic List of Articles, showing the C c (es) s general structure of the work, will be found on T T (te) p, 1857 (vol. IV). The number in parentheses Y y (u) u after each article title refers to this systematic <D rP (ef) f classification into subject areas; e.g., "Abelian X x (kha) kh Varieties (VIII.5)" means that the article on U L\ (tse) ts Abelian varieties is the fifth article in Section 4 'I (che) ch V111 of the systematic list. III ill (sha) sh Books and articles in journals are cited in the III 1l.\ (shcha) shch text by numbers in brackets: [1], [2J,.... At D b (tvcrdyi znak) the end of each article there is a section of bI bl (yery) y references in which, for books, the name of the (b) b (myagkii znak) author or authors, title, name of the pu blisher, '3 ') (e) e year of publication, and the number of the IO 10 (yu) yu edition are given; for journals, the name of the 51 51 (ya) ya author, title of the article, name of the Journal, and the volume numbers and inclusive page numbers are given in this order, (The names of journals and publihers are abbreviated as indicated in the lists at the end of the work.) 
1 Abel, Niels Henrik 1 (XXI.12) Abel, Niels Henrik Niels Henrik Abel (August 5, 1802-ApriI6, 1829) was born a son of a poor pastor in the hamlet of Findo in Norway. In 1822, he en- tered the University of Christiania; however, he studied mathematics almost entirely on his own, He was recognized as a promising stu- dent by his senior, Holmboe, and after gradu- ation he studied abroad in Berlin and Paris. In Berlin he met and was aided by A. Crelle, the founder of the Journalfur die Reine und Angewandte Mathematik, and participated in the founding of this journal. Although he did brilliant work in Paris, he did not gain the fame he deserved. He returned to Norway in May 1827, but, unable to find ajob, he was obliged to fight poverty while continuing his research. He died at twenty-six of tuberculosis. His best-known works are the result that algebraic equations of order five or above cannot generally be solved algebraically, the result that tAbelian equations can be solved algebraically, the theory of tbinomial series and of series in general, the theory of tellip- tic functions and more generally of talge- braic functions, the introduction oft Abelian integrals, and the establishment of t Abel's theorem. His work in both algebra and analy- sis, written in a style conducive to easy com- prehension, reached the highest level of attain- ment of his time. References [1] N. H. Abel, Oeuvres completes I, II, L. Sylow and S. Lie (eds.), Grondahl & Son, new edition, 1881 (Johnson, n.d.). [2] C. A. Bjerknes, Niels Henrik Abel, Gauthier- VilIars, 1885. [3J F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19, Iahrhundert I, Springer, 1926 (Chelsea, 1956). 2 (IV.2) Abelian Groups A. General Remarks A tgroup G is called an Abelian group (or commutative group) if G satisfies the commuta- tive law ab=ba for all a, hEG. In this article, G always denotes an Abelian group. Every tsub- group of G is a tnormal subgroup, and all the elements of finite order in G form a subgroup 2 T, for which the tfactor group G/T has no elements of finite order except the identity e. T is called the (maximal) torsion subgroup of G, If G = T, then the Abelian group G is called a torsion group (or periodic group). On the other hand, if T = {e}, then G is called torsion-free; if T # G, T # {e}, then G is called mixed. If the order of every element of a torsion group G is a power of a fixed prime number p, then Gis called an Abelian p-group (or primary Abelian group). An Abelian torsIOn group IS the tdirect sum of primary Abelian groups. Thus the study of torsion groups is reduced to that of primary Abelian groups. B. Finite Abelian Groups The following fundamental theorem on finite Abelian groups was cstablishcd by L. Kro- necker, G. Frobenius, and L. Stickelberger in the 1870s. An Abelian group G of order pn, where p is a prime number, is a direct product oftcyclic subgroups Zl, ...,Z,:G=ZI x... x Z,. If Zi is of order pn" then n = n I + ." + n" and we can assume that n;  ni+l' A direct product decomposition of G, as above, is not unique, but nl"'" n, are determined uniquely by G, The system {pn1, ""pn,} or {nl' ".,n,} is called the system of invariants (or type) of G, and a system of tgenerators {z l' '" ,z,} of Zl, ,.., Z, is called a basis of G. An Abelian group of type (p, p, .,., p) is calIed an elemen- tary Abelian group. The decomposition of a finite Abelian group into a tdirect sum of subsets (not necessarily of subgroups) was considered by G. Haj6s (1942) and applied successfully to a problem of number theory ( 151 Finite Groups), C. Finitely Generated Abelian Groups The theory of tfinite1y generated Abelian groups, i,e., Abelian groups generated by a finite number of elements, is as old as that of finite Abelian groups, The direct product of tin finite cyclic groups is called a free Abelian group. A finitely generated Abelian group G is the direct product of a finite Abelian group and a free Abelian group. The finite factor is the torsion subgroup of G. The free factor of the group G is not necessarily unique; however, the number of infinite cyclic factors of the free factor is uniquely determined and is calIed the rank of G. Two finitely generated Abelian groups are isomorphic if they have isomorphic maximal torsion subgroups and the same rank. This theory can be extended to the theory of tmodules over a tprincipal ideal domain ( 67 Commutative Rings K). 
3 D. Torsion Groups The structure of Abelian p-groups is relatively well known, compared with other infinitely generated Abelian torsion groups, In the 1920s, H. Priifer made the first important contribution to the study of Abelian p-groups, and H. Ulm and L. Zippin completed the theory for countable groups in the 1930s. The uncountable case was first treated by L. Kuli- kov in the 1940s, but the study of this case is still in progress. An Abelian p-group G # {e} is called divis- ible (or complete) if for any a E G there is an element x E G satisfying x P = a. A divisible group is a tdirect sum of Abelian groups of type p'" (Priifer), Here a group of type prO is isomorphic to the tmultiplicative group of all the pnth roots of unity (n = 1,2, .,. ) in the complex number field. Let G be any Abelian p-group, The maximal divisible subgroup V of G is a direct product factor of G: G = V x R, where R has no divisible subgroups. An Abelian p-group without a divisible sub- group is caIled a reduced Abelian group. An element x of an Abelian p-group G is said to have infinite height iffor any n there is an element YnE G satisfying x = yn. The ele- ments of infinite height form a subgroup G 1 of G. If G I = {e} and G is countable, then G is decomposed uniquely into the direct sum of cyclic groups. This assertion fails if the hy- pothesis of countability is dropped. By ttrans- finite induction we can define GP as follows. If fJ is an tisolated ordinal number, then G P = (GP-l)l; if fJ is a tlimit ordinal number, then GP == n,<p G'. For the least ordinal number r such that G' = G Hl , G' is the maximal divis- ible subgroup of G. If G is reduced, then G' = {e}. We call r the type of an Abelian p-group G. For ex < r, G' = G'/Ga+l is called the Ulm factor of G, and the sequence GO, ..., G', .., (ex < r) is called the sequence of Ulm factors of G. Each Ulm factor G' has no element of infinite height, and if ex < r - 1, then G' has an element of arbitrarily large order. Let r be a countable ordinal number; assume that for any ordinal number ex <r there is given a countable Abelian p-group A. such that A, has no element of infinite height, and that for ex # r - 1 A, has an element of arbitrarily large order. Then there is a reduced coun- table Abelian p-group which is of type r with a sequence of Ulm factors isomorphic to Ao, A"... ,A.,... (ex<r) (Zippin). Two reduced countable Abelian p-groups A and B are iso- morphic if they have the same type r; and for any ex < r the Ulm factors A' and B' are iso- morphic. The assertion fails if the hypothesis of countability is dropped. 2E Abelian Groups E. Torsion-Free Groups In Abelian groups, the group operation is often written a + b, using the additive notation; an additively written group, called an additive group, is generally assumed to be Abelian. In the rest of this article we consider exclusively additive Abelian groups, of which the additive group Z of rational integers is the most primi- tive example. In such a group the identity element is called the zero element and is de- noted by 0; the inverse of a is denoted by - a, and we write a+( -b)=a-b. The direct sum of additive groups A, (AEA) is called a free additive group if each A, is isomorphic to Z. An additive group G is regarded as a tmodule over the tring Z, to which the notion of linear independence is applicable ( 277 Modules). Elements aI, .,., a, of G are linearly dependent if there are integers n 1, ." , n, not all of which are zero such that n 1 a 1 + ." + n,a, = O. Those that are not linearly dependent are termed linearly independent. An infinite set of elements of G is called linearly independent if the ele- ments of any finite subset are linearly inde- pendent. If there are N elements of G that are linearly independent, but if any N + 1 elements of G are linearly dependent, then N is called the rank of G. Such a system of N linearly independent elements is called a maximal independent system. A torsion-free additive group G is not necessarily free if G is not finitely generated, The first important work on torsion-free additive groups was done by F. W. Levi (1917). A, G. Kurosh (1937) completed the theory in the case of finite rank. In the general case little is known, and I. Kaplansky, J. Rotman, and others are continuing the investigation. The additive group Q of rational numbers is of rank 1, and conversely any additive group of rank 1 is isomorphic to some subgroup of Q. An additive group G is called divisible (or complete) if for any a E G and for any integer n there is an element X n E G such that nX n = a, A divisible torsion-free additive group is isomor- phic to a direct sum of some copies of Q. For any torsion-free additive group G there is a divisible torsion-free additive group containing G. A minimal additive group F among these groups is uniquely determined up to isomor- phism and has the rational number field Q as an toperator domain. Let Q(p) = {a/b I (a, b) = I, p(b} be the ring of p-integers in Q, and let G p be the smaIlest Q(P)-subgroup of F contain- ing G. Let Qp be the tp-adic number field and Zp the ring of tp-adic integers. Extending the operator domain Q to Qp we obtain naturally a Qp-module Fp from F. Let G p be the natural closure of G p in Fp. Then G p has Zp as an 
2F Abelian Groups operator domain and thus becomes a Zp- module. A Zp-module of rank N is isomorphic to the direct sum of K p copies of Qp and N - K p copies of Zp: GpLnQpVn+ LmZpW m (n = 1,... ,Kp;m= 1,..., N -K p )' Here Kp is caIled the p-rank of G. As the invariants of G, Kurosh gives the rank, the p-ranks for all primes p, and a certain equivalence class of the sequence of the matrices 9Jl p . Here p ranges over all primes, and 9Jl p is the matrix of coefficients when the elements of a maximal independent system of F are written as linear combinations of(v n , w m ). F. General Abelian Groups An Abelian group is, in general, an textension of a torsion group by a torsion-free group. A torsion group T is called bounded if there is an integer n such that t n = 1 for all tE T. Suppose there is a torsion group T. Then T is a direct summand of an Abelian group G which con- tains T as its maximal torsion subgroup if and only if T is the direct product of a divisible group and a bounded group (R. Baer and S. Fomin). G. Characters A character X of an Abelian group G is a func- tion which assigns to each a E G a complex number x(a) of absolute value 1 and satisfies x(ab) == x(a)x(b) for all a, bE G. The product X = Xl X2 of two characters X 1 and X2 is defined by x(a) = Xl (a)x2(a), and X is also a character of G. Thus all the characters of G form an Abelian group C(G), which is called the char- acter group of G. The identity element of the character group is the identity character (or principal character) X such that x(a) = I for all a E G. If G is finite, then G  C( G). This implies the duality G=C[C(G)]. This fact was ex- tended by L. S, Pontryagin to tlocaIly compact ttopological Abelian groups ( 422 Topo- logical Abelian Groups B-D). For additive Abelian groups with operator domains  277 Modules. References [1] L. Fuchs, Abelian groups, Pergamon, 1960. [2] I. Kaplansky, Infinite Abelian groups, Univ. of Michigan, 1954. [3] A. G. Kurosh, Theory of groups I, II, Chelsea, 1960. (Original in Russian, 1953.) 4 3 (VIII.5) Abelian Varieties A. History Except for C. F. Gauss, whose work on this subject saw the light many years after his death, N. H. Abel was the first to consider talgebraic functions as functions of complex variables and to discovcr doublc periods of teIliptic functions such as x = x(u), which is the inverse function of an eIliptic integral f x dx u= Jf4(X) , where f4(X) is a polynomial in x of degree 4. C. G. Jacobi expressed elliptic functions explicitly as ratios of theta series. As a natural generali- zation of elliptic functions, which are t Abelian functions of genus 1, Jacobi and his successors studied the inverse function of the thyperellip- tic integral, or more generaIly, the inverse function of the Abelian integral. By investigat- ing hypereIliptic integrals of the first kind of genus 2, f X 1 dx Jf6(d ' f X 2 x2dx2 J f6(Xzj' where f6(X) is a polynomial of degree 6 in x, Jacobi obtained multiple-valued functions with quadruple periods, He then discovered the remarkable fact that if we consider sums of two integrals f X 1 dXI f X 2 dX2 U l = JUx l ) + JU X 2) ' f X 1 xldxj f X 2 x2dx2 U2= Jf6(xd + Jf6(X 2 ) ' the elementary symmetric functions Sl = x I +X2 and S2 =X j X 2 of Xl and X2 are single- valued functions of U l and U 2 with quadruple periods. He also conjectured that these func- tions Sl and S2 might be expressed explicitly in terms of theta series of U j and U2; this conjec- ture was later confirmed by J. G. Rosenhain and A. Gape\. In the latter half of the 19th century, the general theory of Abelian functions was es- tablished. The central subject was Jacobi's inverse problem ( Section L), which is a natural generalization of the above results, B. Riemann solved this problem by expressing elementary Abelian functions ( Section M) as rational functions of theta functions. The theory of functions with multiple periods was developed further by H. Poincare, G. Frobenius, and E. Picard. In the 20th cen- 
5 tury, the importance of the theories of Abelian functions and Abelian varieties has become more obvious with the development of the theory offunctions of several complex vari- ables and algebraic geometry. In particular, problems intimately related to number theory have given rise to the development of a purely algebraic theory of Abelian varieties. Valuable contributions have been made by S. Lefschetz, C. L. Siegel (tSiegel modular forms), A. Weil (algebraic theory of Abelian varieties, syste- matization of the theory of theta functions), and by D. Mumford (tmoduli theory, algebraic theory of theta functions). B. Algebraic Theory When a tgroup variety is tcomplete as a variety, the group law is commutative; such a group variety is caIled an Abelian variety ( 12 Algebraic Varieties H.). Let B be a sub- variety of an Abelian variety A, and assume that B is a subgroup of A as an abstract group. Then B has the structure of an Abelian variety whose law of composition is induced by that of A, and B is caIled an Abelian subvariety of A. More generaIly, when an algebraic subset !B is a subgroup of A, then the component B of!B containing the identity element is an Abelian subvariety, and !B is a union of B and a finite number of cosets of B. When A is defined over a field k, then any Abelian subvariety of A is defined over a finite tseparable extension of k (W. L. Chow's theorem). An Abelian variety A is called simple if A has no Abelian sub- varieties other than A itself and O. Every trational mapping of an algebraic variety V into an Abelian variety is defined at each simple point of V. This implies that an Abelian variety is tabsolutely minimal. C. Homomorphisms A rational mapping of an Abelian variety A into an Abelian variety B is called a rational homomorphism (or simply homomorphism) if f is a group homomorphism. Let F be a rational mapping of A into B; then F can be uniquely expressed as foIlows: F(x)= Fo(x) + F(O) (xEA), where Fo is a homomorphism and F(O) is the image of the unit element 0 of A. Hence the structure of an Abelian variety (as a group variety) is uniquely determined by the underly- ing algebraic-variety structure. When a rational homomorphism f is bira- tional, f is caIled a birational isomorphism (or simply isomorphism). It is clear that a rational isomorphism is an abstract isomorphism, but the converse is not necessarily true. Let A, B 3C Abelian Varieties be two Abelian varieties. We say that A is isogenous to B if the dimension of A is equal to that of B and there exists a surjective homo- morphism of A onto B, or equivalently, if there exists a surjective homomorphism of A onto B whose kernel is finite, The relation of isogeny is an equivalence relation. For an Abelian variety A and an Abelian subvariety X of A there is an Abelian subvariety Y of A such that the natural homomorphism X x Y--> A :(x,y)-->x+ y is an isogeny (Poincare's com- plete reducibility theorem), In particular, every Abelian variety is isogenous to a product of simple Abelian varieties that are determined uniquely up to isogeny and order. Let A, B be two Abelian varieties; we denote by Hom(A, B) the additive group of rational homomorphisms of A into B. When a rational homomorphism ..1. is surjective, then the degree V(A) of A is defined by A(A) = v(A)B as talgebraic cycles. If ), is an isogeny, then v(A) # 0, and the order of the kernel {t I tEA, At = O} is at most v(A) and equal to v(A) if and only if ..1. is tsepa- rable. The additive group Hom(A, B) is shown to be free of rank .s;4dim A 'dim B by the /-adic representation ( Section E). If A = B, then Hom(A, A) has a ring structure; it is called the ring of endomorphisms (or endomorphism ring) of A and is denoted by (A). The tensor prod- uct o(A) = (A) @ Q, where Q is the field of rational numbers, is an tassociative alge- bra over Q. If A is simple, then o(A) is a tdivision algebra. More generaIly, o(A) is iso- morphic to a direct product of some ttotal matrix algebras over division algebras; thus o(A) is tsemisimple. In particular, if A is 1- dimensional (in other words, if A is an telliptic curve), the types of o(A) are weIl known; when the characteristic p = 0, then o(A) is either the field of rational numbers or an timaginary quadratic field. When p > 0, aside from these two fields, we have a tquaternion algebra over Q as a possible type of o(A). Let k be a finite field with q elements, An algebraic integer is caIled a Weil number for q if every conjugate of it has absolute value jq. If A is an Abelian variety defined and simple over k, the qth power endomorphism of A: x-->x q determines a conjugacy class ofWeil numbers for q, as Weil showed ( 450 Zeta Functions). Moreover, we have the foIlowing classification theorem (1. Tate, T. Honda): There is a one-to-one correspondence between the set of all k-isogeny classes of k-simple Abelian varieties over k and the set of all con- jugacy classes of Weil numbers for q. Tate also determined the structure of the division alge- bra o(A) over Q, which is described in terms of the decomposition of the qth power endo- morphism into prime ideals. 
3D Abelian Varieties D. Divisors Let ffi be the additive group of tdivisors on an Abelian variety A and ffia be the subgroup of divisors that are talgebraically equivalent to O. Then the factor group ffi/ffia has no ttorsion part; this implies that for an Abelian variety tnumerical equivalence coincides with talge- braic equivalence. We denote this relation by =. Given an element a of A, the translation 7;,:A3x-->x+aEA gives a tbirational trans- formation, which is everywhere tbiregular, on the underlying variety of A; we denote by Xa the image of a divisor X on A. Then X = 0 if and only if Xa is t)inearly equivalent to X for each point a of A. The t Albanese variety of an Abelian variety A is A itself, and the tPicard variety A of A is isogenous to A, In particular, for the tJacobian variety J of an algebraic curve, J is isomorphic to J itself. The Picard variety A of A is isomorphic to A (duality theorem). Let X be a divisor on A; the map- ping a-->the tlinear equivalence class of the divisor Xa - X, a E A, is a rational homomor- phism of A into A, and we denote it by <Px. If <Px=O, then X=O, and vice versa; hence we have an additive map of ffi/ffia into Hom(A, A). If <Px is surjective, we say that X is nondegener- ate. A tpositive divisor X is nondegenerate if and only if X is tampl e , and then nX is tvery ample for n  3. There always exist positive nondegenerate divisors on an Abelian variety; therefore an Abelian variety is a tprojective variety. For a given divisor X on A, we can find n suitable points U!, ..., Un, where n is the dimension of A, so that the tintersection prod- uct Xu, . '" . Xu, is defined. We denote by (x(n») the tdegree of the zero cycle Xu, . ." . Xu" If X is positive nondegenerate, then the dimension I(X) of the tdefining module of the tcomplete linear system determined by X is equal to (x(n»)/n! (Poincare's theorem), Furthermore, the degree v(<Px) of <Px, where X is any divisor on A, is given by the formula v(<px)=«x(n»)/n,)2 (Frobenius's theorem). For a nondegenerate divisor X on A there is a unique integer i=i(X), O.s;i.s;dimA, called the index of X, such that HP(A, @(X))=O for p#i and Hi(A,@(X))#O. We have i(-X)= dim A - i(X) and i(X) = 0 if and only if X is ample. E. I-adic Representations Let A be an Abelian variety of dimension n. For a given prime number I, let ffil(A) denote the group of points on A whose order is a power of I. If I is different from the character- istic of the base field of A, then the group ffi[(A) is isomorphic to the direct product of 2n 6 factor groups Qi/Z/> where Q[ is the field of 1- adic numbers and Z[ is the group of I-adic integers (439 Valuations). We call such an isomorphism the I-adic coordinate system of ffi/A). Now let ..1. be a rational homomorphism of A into an Abelian variety B of dimension m. Then we can see that A induces a homomor- phism of ffi[(A) into ffi[(B). This shows that by placing I-adic wordinate systems on ffi[(A) and ffi[(B) respectively, we get a matrix representa- tion M[(A) of ..1. with 2m rows and 2n columns. The representation ).--> M[(A) is faithful, and M[(A) is called the I-adic representation of ..1.. When A = B, then A-->M[().) is a faithful repre- sentation of the ring of endomorphisms (A). This representation can be naturaIly extended to the representation of the algebra o(A); the characteristic polynomial of the I-adic representation M[(A) (where ). is an element of o(A)) is a polynomial with coefficients in Q. Moreover, the polynomial does not depend on the choice of the prime number I. When AE(A), then v().) is equal to detMP). The trace of M,(A) is usually written as 0"(..1.). Let ..1. be a rational homomorphism of A into Band Y be a divisor on B. Then by the correspondence cl( Y)-->cl(A -I( Y)), we obtain a rational homomorphism of fj into A, called the transpose of ). and denoted by tA, where cl means the linear equivalence class ( 16 Alge- braic Varictics M), If X is a nondegcnerate divisor on A, hen the composition map fJ: A--> A of'<Px:A-->A and the canonical isomorphism A-->A satisfy the equality fJo<Px=v(<Px)t5 (t5 = the identity mapping of A). We denote by <pi l the element (ljv(<px)) fJ in Hom(A, A) @ Q. The correspondence ex-->ex', ex' = <pi! 0 'ex 0 <Px (c(Eo(A)) is an tinvolution ofo(A) and is of order 1 or 2. If ex 7" 0, then O"(ex' 0 ex) > 0 (Castelnuovo's lemma). A. Weil was the first to recognize the importance of this theorem in connection with tRiemann's hypothesis on tcongruence zeta functions. F. Differential Forms A tdifferential form w on an Abelian variety A of dimension n is called invariant if Tw = w for every point aE A, where 7;, is the trans- lation by a ( Section D). The differential form tof the first kind is invariant, and conversely, every invariant differential form is of the first kind. Let K be the tuniversal domain and K(A) be the tfunction field of A. The set of the linear differential forms of the first kind on A is a linear space over K of dimension n, and its basis becomes a basis over K(A) of the linear space consisting of all linear differential forms on A, An invariant derivation on A is a deriva- tion D in K(A) satisfying (Pf)o 7;, = D(.fo 7;,) 
7 for any element f of K(A) and every point a of A. For a linear differential form w='L/;du i and a derivation D, we put (w, D) == 'L/;Du j . Then (w, D) is a bilinear form in wand D. A derivation D is invariant if and only if (w, D) is a constant function for every invariant linear differential form w. Similarly, a linear differen- tial form w is invariant if and only if (w, D) is a constant function for every invariant deriva- tion D. The linear space of invariant linear differential forms and that of invariant deriva- tions are dual to each other with respect to the bilinear form (w, D). Now consider the case when the character- istic p of the universal domain is positive. The automorphism a-->a P , aE K, of the universal domain K induces a group isomorphism of A; we denote by AP the image of A and by x P the image of a point x of A, The image AP is an Abelian variety, and the group isomorphism n:x-->x P , xEA, is an isogeny of A onto AP, Let B be another Abelian variety and let ..1. be an isogeny of A onto B. If there is an isogeny J1: B-->AP such that n = J10 ..1., then we say that ..1. is of height 1, The function field K (B) of B can be considered as a subfield of the function field K (A) of A by the mappi ng ..1.. If ..1. is of height 1 and v(A) = p, there exists an invariant deriva- tion D of K(A) with the constant field K(B), uniquely determined up to constants. More- over, we can choose D so that DP = D or DP = O. In the first case ..1. is said to be of type (i l ); in the second case it is said to be of type (i2)' An isogeny whose degree is a prime different from the characteristic p is said to be of type (sd, and a tseparable isogeny whose degree is p is said to be of type (S2)' Any isogeny can be written as a product of isogenies of these four types, G. Polarized Abelian Varieties Let X be a divisor on an Abelian variety A; we denote by I the class of divisors X' such that mX = m' X' for suitably chosen positive inte- gers m, m'. When the class I contains positive nondegenerate divisors, we say that I deter- mines a polarization on A, and the couple (A, I) is called a polarized Abelian variety. In particular, if A is a tJacobian variety whose polarization I is determined by a theta divi- sor, we call (A, I) the canonically polarized Jacobian variety. If an endomorphism (X of A keeps the polarization invariant, i.e" if the class determined by (X-I(X) coincides with the class I, then (X is called an endomorphism of the polarized Abelian variety (A, I). In partic- ular, if (X is an automorphism of A, then we say that (X is an automorphism of (A, I). The group of all automorphisms of a polarized 3H Abelian Varieties Abelian variety is finite. In particular, the group of automorphisms of a canonicalIy polarized Jacobian variety is finite. Hence follows the famous theorem concerning the finiteness of the group of automorphisms of an talgebraic curve of genus not less than 2. On the other hand, the algebraic equiva- lence class of a nondegenerate divisor is called an inhomogeneous polarization. (The above polarization is then sometimes called a homo- geneous polarization.) An inhomogeneous polarization X determines an isogeny <Px: A-->A uniquely. If <Px is an isomorphism, the polarization is called principal. An endo- morphism of an inhomogeneously polarized Abelian variety can be defined similarly, H. Analytic Theory For the rest of this article we take the complex number field C as the universal domain, and in this case we can utilize analytic and topolog- ical methods. Let cn be an n-dimensional vectOr space over C. In a natural way, the space C" be- comes a 2n-dimensional vector space R 2 n over the real field R, and the mapping J: z--> J=1 Z, ZEC n , is an R-linear automorphism of R 2 n such that j2 = -1. Conversely, if for an even-dimensional R-vector space R 2 n such a mapping J is given, then by putting (a+ J=1 b)x=ax+bJx (xER 2 n;a,bER), we can introduce an n-dimensional complex linear structure into R 2n . We then say that J deter- mines a complex structure on R 2 n; we denote by C"=(R 2 n,J) the space having the complex structUre determined by J. Let WI' ... ,W2n be 2n R linearly independent points on C" = (R 2 n, J). Then the tIattice D generated by these points is discrete, and the factor group Tn=C"/D is a complex torus of dimension n, We fix a basis of C" and introduce a complex coordinate system on C". Utilizing the basis WI, ..., W2n of R 2n , we also introduce a real coordinate system on R 2 n. We then obtain an n x 2n matrix o = (Wi), where the (w\i"'" w n ;) are the complex coordinates of Wi; the ma- trix 0 is called the period matrix of Tn. Let (z l' ... ,zn) be the complex coordinates of a point ZEC n and (XI' ...,x 2n ) be the real coordinates. Then we have '(ZI' ""zn)= O'(x l , ...,x 2n ).lfwe let the same symbol J stand for the representation matrix of the linear transformation J with respect to the basis WI, ,., , W2n, then we have J=1 0 = OJ. The underlying real Lie group of Tn is a 2n- dimensional (real) Horus group (R/Z)2n, Hence the tPoincar{: polynomial of Tn is (1 +x)2n, Any tHermitian metric on C" (as a vector space) induces a tf1at tKiihler metric; hence the 
3 I Abelian Varieties Hodge numbers ( Kahler Manifolds C) are h p,q == (;)(;). I. Theta Functions A holomorphic function f(z) on C" = (Rn, J) is called a theta function if for every dE D we have f(z+ d)= f(z)exp(lAz) + Cd)' where IAz) is a linear form on C n which, as for Cd, depends on d. The set of zeros of a theta function I, which we write as (/), determines an analytic divisor on Tn. Conversely, for every effective analytic divisor X on Tn, there exists a theta function f such that (.f) = X. With respect to the real coordinate system x I' ..., X 2n deter- mined by the basis WI' ,., , W 2n , we can find 2n x 2n matrices A, Ao and a 1 x 2n matrix b, with elements in C, such that the transforma- tion formula f(x + a) = f(x) exp(2nj=t ('aAx +!taAoa+ 'ba)) (where A == Ao (modZ), 'Ao = Ao) holds for every 1 x 2n matrix a whose elements are rational integers. Moreover, if we put E = A - 'A, then E is an talternating matrix whose elements are rational integers, and S = EJ is a tpositive semidefinite sym- metric matrix. Conversely, if there exists such an alternating matrix E we can find a theta function. (There does exist, however, a com- plex torus on which no theta function exists other than trivial ones, i.e., ones of the form exp(<p(z)), where <p(z) is a polynomial of degree at most 2.) A theta function f is called nondegenerate if it cannot be a function of n - 1 complex vari- ables, and f is nondegenerate if and only if the matrix S = EJ is tpositive definite. A complex torus has the structure of an Abelian variety if and only if there exists a nondegenerate theta function, i.e., if and only if there exists an alternating matrix E whose elements are ra- tional integers such that EJ is a positive de- finite symmetric matrix. The latter condition is satisfied if and only if there exists an alter- nating matrix E whose elements are rational integers such that O'E-I,O=O, j=t O'E- 1t Q >0 (positive definite Hermitian matrix). In particular, a period matrix 0 satisfying these conditions is called a Riemann matrix, and the rational matrix 'E- I is called the principal matrix belonging to O. Determining a polarization on an Abelian variety can be reduced to designating a class of principal matrices obtained from a principal matrix by multiplying by positive integers. Let X be a positive divisor on an Abelian variety Tn = C" /D, and let f be a theta function such that (/) = X. Then the divisor X is nondegen- erate if and only if the theta function f is non- degenerate, and the latter statement holds if and only if the alternating matrix E obtained 8 from f is tinvertible. For a given principal matrix 'E -I, we can choose suitable coordinate systems of C" and D so that 0 = (In, F) and E = ( 0  ) h . . . d A' , were In IS a umt matnx an illS - 0 a diagonal matrix whose elements are te1emen- tary divisors of E. In these situations, F is symmetric and its imaginary part positive definite, i.e., F is a point of the tSiegel upper half-space 6n. Thus to the polarized Abelian variety (where the polarization is determined by 'E- I ) there corresponds a point of 6 n , This gives a one-to-one correspondence between the isomorphism classes of Abelian varieties inhomogeneously polarized by a principal matrix with given elementary divisors and the points of the factor space 6n/r n()' where rn() is a subgroup that is tcommensurable to the Siegel tmodular group of degree nand operates on 6n discontinuously. The case of principal polarization corresponds to the case when  = In, and then r n() is the Siegel modular group itself. 6n/rn(M is the coarse tmoduli space of Abelian varieties, polarized as above; the projective embedding of 6n/rn() is given by means of tSiegel modular forms. J. Abelian Functions Meromorphic periodic functions on C" with periods WI' .., ,w2n (i.e., meromorphic func- tions on T = cn /D, D = ZW I EB ... EB Zw 2n ) are called Abelian functions with periods D (or on T). The quotient of two theta functions having the same periods and the same transforma- tion formulas is an Abelian function, and con- versely any Abelian function can be written as such. All Abelian functions on T form a field C(T) called the Abelian function field. If T has the structure of an Abelian variety, then C(T) coincides with the field of rational functions on T. In general, for any complex torus T there are an Abelian variety T' (possibly of dimO) and a surjective homomorphism ..1.: T --> T' which induces an isomorphism ..1.*: C(T') C(T). Such a pair (T', ..1.: T --> T') is unique up to isomorphism, and is called the talgebraic red uction of T. K. Homomorphisms Let C"' = (R2n',Ji) (i = 1, 2) be complex linear spaces; an R-linear mapping f: R2n 1 -->R 2 n 2 is C-linear if and only jf the relation f 0 J I == J 2 0 f holds. Let D i be tlattice groups of C"' for i== 1, 2.lfa C-linear mapping ;\:C"1-->C"2 satisfies ;\(D I ) C D2' then ;\ induces a com- plex analytic homomorphism of T 1= C"I/DI to T 2 =C n 2/D 2 . Conversely, any complex analytic homomorphism ofT I to T 2 is ob- 
9 tained in this way. Let Tt and T 2 be Abelian varieties, and let 0 1 =(wP), ...,w») and O 2 = (wI 2 ), ..., w») be their Riemann matrices. Then for a homomorphism A:T l -->T 2 , we can find a representation matrix W(A) with complex coefficients; and with respect to the real coordinate systems (w\t),..., w\) and (wI 2 ), "', w), we can find a representation matrix M (A) with coefficients in Z such that W(A)OI =02M(A). Conversely, if for a complex matrix W there is a matrix M, with coefficients in Z, satisfying the relation given above, then W gives a homomorphism ofT I to T 2 . The above equation is called Hurwitz's relation. The notion of l-adic coordinate system, which is valid for a general characteristic, corre- sponds to that of the lattice group, and the 1- adic representation Ml(A) of A is the abstrac- tion of the integral representation M (},). L. Abelian Integrals Let  be a compact tRiemann surface of genus 9  1 ( 11 Algebraic Functions) and let W be a sum of t Abelian differentials of the first kind or of the second kind. Then the tperiod of W along a cycle y depends only on the thomology class of y. The set of all differentials of the first kind forms a complex linear space of dimen- sion g; we denote it by :Do' Let P be a point and Po a fixed point of R; then we denote by u(P) the vector integral (Jo w t , "', Jo w g ), where (WI, ..., w q ) is a basis of :Do and the path from Po to P is common to every integral. The correspondence P-->u(P) is not a single-valued mapping; the totality of differences of values of u(P) coincides with the group D consisting of periods (J y W t , .., , J y w g ), where y varies over all cycles. Let a set of cycles {y I, ... , 1'29} be a basis of the homology group, with coefficients in Z; then 2g column vectors of the 9 x 2g matrix 0 = (Wij), Wi} Ljw i , are linearly inde- pendent over R. Since the group D coincides with the set of linear combinations, with co- efficients in Z, of column vectors of 0, this D is a tlattice of rank 2g, and the matrix 0 is a period matrix of the complex torus Tg = c g jD of dimension g. For a basis of the homology group with coefficients in Z, we take tnormal sections :XI, ..., ex g , exg+l, ..., ex 2g of 9i, and let the same symbol 0 stand for the period matrix (Wi), w.. = J (1).' then we have lJ C1j" OE'O = 0, j=1 OEtQ > 0 (positive definite Hermitian matrix), where ( 0 Iq ) . .' ) E = . (with Ig the umt matnx . -Ig 0 This implies that E is a principal matrix be- 3L Abelian Varieties longing to 0; we call the equality and inequal- ity just given Riemann's period relation and Riemann's period inequality, respectively. Fur- thermore, we can choose a suitable basis of :Do so that the period matrix 0 is of the form (Ig, F) with FE 6g ( Section I). We consider the function 9(u), U =(u l , ..., u g ), defined by an infinite series 9(u) = L exp(2nj=1 (m'u +tmF'm)), m where the sum is taken over all row vectors m = (ml, ..., mg) with coefficients in Z. If u is in a bounded region, then the series for 9(u) is uniformly and absolutely convergent. Hence 9(u) is a holomorphic function of u. This is a theta function corresponding to the princi- pal matrix E = ( 0 I g ) and is called the -Ig 0 Riemann theta function. As 9(u) is nondegen- erate, the complex torus P=CgjD has the structure of an Abelian variety. If we regard the Riemann surface  as an algebraic curve, this Abelian variety P is precisely the Jaco- bian variety of the curve . The collection of zeros of 9(u), a divisor on Tg, defines the ca- nonical polarization on the Jacobian variety ( Section G). The correspondence P-->u(P), pE induces a well-defined mapping <p of  into P = CqjD. Moreover, if we set <p(A)=Ljt <p()- Ljt <p(Q) for any divisor A=P I ... P,jQI ...Q, of degree 0, then <p(A) is a point on Tq = c g jD, which is represented by the vector (LjdW" ..., Lj1 Jd,w q ) of C g , and the mapping A--> <p(A) is a homomorphism of the group <f>o of divisors of degree 0 onto Tg = c g jD. The kernel of this homomorphism coincides with the group G5 l of tprincipal divisors (Abel's theo- rem). Hence a divisor A == PI ... P,jQ I '" Q, of degree 0 is a divisor of some function if and only if we have LjdWi=O (mod D) (i= 1, .."g), or equivalently, the left-hand side is o for a suitable path. Given 9 fixed points PI' ... , Pg on  and given (u l , ..., u g ) as any vector of C g , the prob- lem of finding 9 points QI, ,.., Qg satisfying relations L1d;Wi= U i (mod D), i= 1,... ,g, is called Jacobi's inverse problem. To solve the problem, we take a divisor A of degree 0 such that the class <piA) (mod D) is represented by (u I, ... , u g ); then, by virtue of the tRiemann- Roch theorem, there exists a divisor Q t '" Qg satisfying A=QI'" QgjPt... Pq (modO)l)' Abel's theorem implies that the set of points {Q h ... , Qg} is a solution of Jacobi's problem. Moreover, for general (u t , ..., u g ), the solution is unique; i.e., there exists a subvariety X of dimension 9 - 2 on C g such that the solution is 
3M Abelian Varieties unique up to order if and only if (U I , "', u q ) does not lie on X. In particular, if every point Pi coincides with the fixed point Po that ap- peared in the definition of u(P), the subvariety on P=Cg/D determined by X is obtained in the following way: Let WI + ". + W 2g - 2 be a canonical divisor on , and put c = <p( WI) + ... + <P(W 2g - 2 ). Then the locus of points c-<p(Rd -,.. -<p(R g - 2 ) (where 9 -2 points R I "." Rg-2 are taken independently over all points of ) is the desired subvariety. In terms of the theory of complex manifolds the above result can be rewritten as follows: Let sg() be the symmetric product of  and I the subvariety ofTg=Cg/D induced from X, Then the holomorphic map sg( <p): sg()--> Tg induced from <p:  --> Tq is bimeromorphic and isomor- phic outside that I whose elements corre- spond to effective divisor classes of degree 9 contained in canonical divisors on . M. Elementary Abelian Functions Let z be a nonconstant meromorphic func- tion on . Then for any u=(u I ,.." Uq)EC g that does not lie on X, there exist g points Q I, '" , Qq (uniquely determined up to order) as the solution of Jacobi's problem. Therefore the elementary symmetric functions 9 SI(U;Z)= L z(Qj)' j1 9 S2(U; z)= L z(Q)z(Q),.., ,Sg(u; z)= TI z(Qj) i<j j=l are well defined if U lies outside the variety X of dimension 9 - 2. Each function Si(U; z), re- garded as a function of u, can be extended uniquely to an Abelian function in the whole space C g ; the function is denoted by the same symbol s;(u; z). The Abelian functions SI (u; z), .., ,Sg(u; z) are called the elementary Abelian functions obtained from z. Now let K be the field of Abelian functions on C" /D and k the field of meromorphic func- tions on ; then the dimension of Kover C is g, and [K: C(sdu; z),..., Sg(u; z))J =r g , where r is the degree of the function z that is given by [k: C(z)]. Moreover, if we take any function w such that k = C(z, w), then we have K =C(SI (u; z), ..., Sq(u; z); sdu; w), ..., Sq(u; w)), where SI (u; w), ..., Sg(u; w) are the elementary Abelian functions obtained from w, We can write any elementary Abelian func- tion as a rational function of Riemann theta functions; therefore any Abelian function can be written as a rational function of Riemann theta functions, Furthermore, if U and v are variable vectors, then s;(u + v; z) can be repre- 10 sented as an algebraic function of 'I (u; z), ... , Sg(u; z), sdv; z), ..., Sg(u; z); i.e., we can choose a suitable polynomial H;(Z; Xl, ..., X q; Y I , ..., 1;,) with coefficients in C so that Hi(s;(u + v;z); SI(U;Z), ...,Sg(u;z); SI(V;Z), ...,Sg(v;z))=O. This algebraic addition formula with respect to the elementary Abelian functions Si(U; z), i = 1, .., , g, is a function-theoretic interpretation of the fact that the addition map X x X --> X :(x,y)-->x + y is a tmorphism of algebraic varieties. As the study oftAbelian integrals of the first kind led us to the theory of Jacobian varieties, t Abelian integrals of the second and the third kind give rise to the theory of tgeneralized Jacobian varieties ( 9 Algebraic Curves). The theory of Abelian varieties has signifi- cant applications to number theory, as shown by the following examples: the theory of tun- ramified Abelian extensions with respect to a function field of several variables defined over a finite field (S. Lang), the theory of heights of points on an Abelian variety (Weil, A. Neron, J, Tate), and the theory of complex multipli- cation ( 73 Complex Multiplication) in the case of higher dimensions (Y. Taniyama, G. Shimura). N. Some Recent Results (1) Level Structure, Moduli of Abelian Varieties. Let A be an Abelian variety over k of dimension 9 and n a positive integer which the characteristic of k does not divide. A level n structure on A is defined to be a set of 2g points (J I' . .. , (J 2g on A which form a basis for the group of points of order n on A. Let A(g, d, n; k) be the set of triples: (i) an Abelian variety A over k of dimension g, (ii) an inhomogeneous polarization X on A with v(<Px)=d 2 , and (iii) a level n structure (J I, ... , (J2g of A, all up to isomorphisms, Simi- larly we can define A(g, d, n; S) for Abelian schemes over a scheme S. The correspondence S -->A(g, d, n; S) defines a functor cW'(g, d, n). D. Mumford has shown that there exists the tcoarse moduli scheme A(g, d, n) quasi- projective over Spec(Z[I/nJ), and that it is even fine if n  3 [6]. He used the technique of tHil- bert schemes and tstable points ( 16 Alge- braic Varieties). One of the key steps of his proof is to show that for an em bedding Ij!: A --> pm of degree r (i.e., the degree of Ij!(A) in pm) over an algebraically closed field k and a posi- tive integer n such that char(k) does not divide nand n> J(m+ l)r , the point (Ij!(X;))i1., ,n2, in (pm)"" is stable with respect to the action of PGL(m), where the Xi are the points of order n on A (with an arbitrary order). Mumford 
11 later showed another method of constructing the moduli of polarized Abelian varieties by using algebraic theta constants [7]. (2) Neron Minimal Models, Good and Stable Reduction. Let R be a discrete valuation ring with residue field k and quotient field K. For an Abelian variety A over K, there exists a smooth group scheme ,r;1 of finite type over S = Spec(R), called the Neron minimal model of A, such that for every scheme S' smooth over S there is a canonical isomorphism Homs(S', d)  HomK(S, A), where S is the pullback of S' by Spec(K)--> Spec(R) (A. Neron, M. Raynaud). In particu- lar, we have d K  A. Denote by Ao the fiber of ,7/ over the closed point of S. If ,7/ is proper over S, we say that A has a good reduction at R. If the connected compo- nent Ag of Ao containing 0 has no unipotent radical (or equivalently, Ag is an extension of an Abelian variety by an talgebraic torus over a finite algebraic extension of k), we say that A has a stable reduction. If there is a finite sepa- rable extension K' of K with a prolongation R' of R to K' such that A x K K' has good (stable) reduction, we say that A has a potential good (stable) reduction at R. Let Ks be a separable closure of K and R a prolongation of R to K,. For a prime number l#char(k), we have a canonical homomorphism p:Gal(K)K)--> Aut(t\J1(A)), calIed a monodromy. Then A has potential good reduction if and only if the image of the tinertia group I (R) by p is a finite group (J.-P. Serre and Tate). Every Abelian variety A over K has potential stable reduc- tion at R (stable reduction theorem, A. Gro- thendieck [16]). (3) Graded Ring of Theta Functions. If f is a theta function with a period system D,fn is also a theta function with the same period system for every positive n. We denote by Sn the vector space of the theta functions with the period system D subject to the same trans- formation law as 1". If we denote by X the effective divisor on T=Cq/D defined by f, then Sn can be naturalIy considered as the tdefining module of the tcomplete linear system of nX, and the dimension of Sn (= l(nX)) is equal to the product of the nonzero diagonal elements of n ( Section I) (Frobenius). If gES m and hES n , then ghES m + n ; hence S= E8n;;>oSn is a tgraded ring, which is normal and finitely generated, For m2 and n3, the product map Sm x Sn-->Sm+n is surjective (D. Mumford, S, Koizumi). If the elementary divisors of E can be divided by an integer 4, the kernel of the natural graded mappmg S(Sj)-->S (where 3 Ref. Abelian Varieties S(Sj) denotes the tsymmetric algebra over Sd is generated by the quadratic relations (i,e., the part of degree 2) for sufficiently large degrees (Mumford). Geometrically this means that if X =(/),!ES j , is nondegenerate, then, with the prOjective embedding i:T -->p N defined by the complete linear system of X, i(T) is an inter- section of quadrics in p N containing i(T). Mumford has developed a theory of alge- braic theta functions that also works for the positive characteristic case [7J and has proved the above results in general. References [IJ Sem. H. Cartan, vo\. 2, Fonctions analy- tiques de plusieurs variables I-V, 1951-1952, Benjamin, 1967. [2J F. Conforto, Abelsche Funktionen und algebraische Geometrie, Springer, 1956. [3J A. Krazer and W. Wirtinger, Abelsche Funktionen und allgemeine Thetafunktionen, Enzykl. Math. Wiss., II, 2, B7 (1920), 604-873. [4] J. Igusa, Theta functions, Springer, 1972. [5J S. Lang, Abelian varieties, Interscience, 1959. [6] D. Mumford, Geometric invariant theory, Erg. Math., Springer, 1965; second edition, 1982. [7J D. Mumford, On the equations defining Abelian varieties I-III, Inventiones Math., I (1966),287-354; 3 (1967), 75-135, 215-244. [8] D, Mumford, Varieties defined by quadra- tic equations, Questioni sulle variete alge- braiche, Corsi dal C. 1. M. E., Cremona, 1969, III Cicio, Cremonese, 1970, 31-94, [9] D, Mumford, Abelian varieties, Tata Inst. studies in math" Oxford U niv, Press, 1974, [10] A. Neron, Modeles minimaux des varietes abeliennes sur les corps locaux et globaux, Pub\. Math. Inst. HES, 21 (1964),5-128. [11J W. F. Osgood, Lehrbuch der Funktion- entheorie II 2, Teubner, 1932. [12] M. Raynaud, Modeles de Neron, c. R. Acad. Sci. Paris, 262 (1966),345-347. [13] B. Riemann, Theorie der Abelschen Funktionen,1. Reine Angew. Math., 54 (1857), 115-155. [14J J.-P. Serre, Quelques proprietes des varietes abeliennes en caracteristique p, Amer. J. Math., 80 (1958), 715-739. [15J 1.-P. Serre and J. Tate, Good reduction of Abelian varieties, Ann. Math., (2) 88 (1968), 492-517. [16J Seminaire de geometrie algebriques (SGA) 7, Groupes de monodromie en geo- metrie algebrique, Lecture notes in math. 288, 340, Springer, 1972, 1973. 
4A Additive Number Theory [17] G. Shimura, Moduli and fibre systems of Abelian varieties, Ann. Math., (2) 83 (1966), 294-338. [18] C. L. Siegel, Analytic functions of several complex variables, Lecture notes, Institute for Advanced Study, Princeton, 1948-1949, [19] J. Tate, Classes d'isogenie des varietes abeliennes sur un corps fini (d'apres T. Honda), Sem, Bourbaki, 1968-1969, no. 352; Lecture notes in math. 179, Springer, 1971. [20] A. Weil, Theoremes fondamentaux de 1a theorie des fonctions theta, Seminaire Bour- baki, no. 16, 1949, Benjamin, 1966. [21] A. Weil, Varietes abeliennes et courbes algebriques, Actualites Sci. Ind., Hermann, 1948. [22] A. Weil, Varietes kahleriennes, Actualites Sci, Ind., Hermann, 1958. [23] A. Weil, Generalisation des fonctions abeliennes, J. Math. Pures Appl., (9) 17 (1938), 47-87. [24J D. Mumford, Tata lecture notes on theta functions I, II, Birkauser, 1983. 4 (V.5) Additive Number Theory A. General Remarks Let N be the set of natural numbers, and let A, B,... be subsets ofN. The sum C=A + B is defined as the set {clcEA, cEB, or c=a+b, aEA, bEB}. A finite sum of subsets ofN is defined similarly. If N is the sum A + ... + A (r times), then we say that A is a basis of order r in N. Let A(x) denote the number of integers in A that do not exceed x. The density of A is defined as infA(x)/x. If A(n)(Xn, B(n) fJn (0 < (x, fJ < 1) for all nE N, then we have (A + B)(n)  (min(J, (X + fJ))n. This result was stated by E. Landau (1937) without proof; A. J. Khinchin had given a proof (1932) for the case (X = fJ, and H, B. Mann (1942) and E. Artin and P. Scherk (1943) succeeded in proving the statement for the general case. Suppose that the densities of A and B are, respectively, (X and 0, If B is a basis of finite order in N, then the density of A + B is greater than that of A (Khinchin and P. Erdos, 1936). Let P be the set of all prime numbers. Though the density of Pis 0, the density of P + P is positive (L. G. Shnirel'man, 1930). Hence P is a basis of finite order in N; III other words, there exists a posi- tive integer r such that every natural number can be expressed as a sum of at most r primes. Though the density of the set Q of the kth powers of natural numbers is 0, there exists a positive integer s(k) such that the sum Q + ... 12 + Q (s(k) times) is a set of positive density. L. K. Hua (1956) gave a simple proof of this fact, based on Yu, V. Linnik's idea. It follows, therefore, that any natural number n can be expressed as a sum n = a + ... + a;, where aiEN, ts(k). This result had already been shown by D. Hilbert (1909). An ancient method of finding prime num- bcrs is tEratosthenes' sieve. V. Brun (1920) devised a new sieve method to express an arbitrary integer n as the sum of two integers n = a + b, where the number of prime factors of a and b is as small as possible. This method was improved by H. A. Rademacher (1924), Landau (1931), A. Bustab (1937), and others. Among these the method found by A. Selberg (J 952) is notable ( 123 Distribution of Prime Numbers E). B. Farey Sequences Let r be a positive integer. We arrange in increasing order the set of all positive irreduc- ible fractions lying between 0 and 1 whose denominators do not exceed r. This sequence is called the Farey sequence of order r. For example, the Farey sequence of order 5 con- sists of 01 12132341 - - - - - - - - I' 5' 4' 3' 5' 2' 5' 3' 4' 5' I' A necessary and sufficient condition that a fraction a/b be directly followed by a fraction c/d in the Farey sequence of order n is b + d  n + 1, bc - ad = 1. In this case the fraction (a+c)/(b+d) is called the mediant of a/b and c/d. Interpolating the Farey sequence of order n with such mediants (a+ c)/(b + d) satisfying b+d=n+ 1, we obtain the Farey sequence of order n+ 1. Let a/q be a fraction in the Farey sequence of order r, and a'/q', a"/q" be adjacent mem- bers of a/q in the sequence such that a'/q' < a/q < a"/q". The interval [(a' +a)/(q' +q), (a+a")/(q+q")J is known as the Farey arc surrounding a/q. In particular, if a/q=O/I, then we set [ - l/n, l/nJ to be the Farey arc surrounding 0/1, where n = [rJ + 1 ([ ] is the tGauss symbol). We can thus decompose the interval [ -l/n, 1- l/nJ into a disjoint union of Farey arcs. If (X is contained in the Farey arc surrounding a/q, then I(X-a/ql < l/qr. Therefore, for a given r  1 and a real (x, we can prove the existence of a/q such that (lal,q)= 1, O<q.s;r, !(X-a/ql< l/qr. Let f(O == Lo an," be a power series which is convergent in the disk 1(1.s; 1. Then, by Cauchy's integral formula, 
13 an= r fn( d(= f lf(e2ni')e-2nin'dex. 2nIJ_I( 0 To estimate such an integral, we sometimes utilize the decomposition of the interval [0, IJ into a disjoint union of Farey arcs as men- tioned above. This method is called the circle method, and the subdivision of the interval is known as the Farey dissection. Given a positive number c, a Farey arc around a/q is usually called a major arc (or basic interval) if q does not exceed the given bound c; otherwise, it is called a minor arc (or supplementary interval). Usually, the principal part of the previously mentioned integral is derived from the integral over the major arcs, and the residual part is provided by the in- tegral over the minor arcs. C. Goldbach's Problem Goldbach's problem is found in letters (1742) he exchanged with L. Euler. In them he stated that every positive integer can be expressed as the sum of primes. More precisely, he conjec- tured that any even integer not smaller than 6 can be expressed as the sum of two odd primes and that any odd integer not smaller than 9 can be expressed as the sum of three odd primes. I. M. Vinogradov (1937) proved that every sufficiently large odd integer can be expressed as the sum of three primes. Let N be a suffi- ciently large odd integer. If we write J1(q) ( .a ) A(q,N)= L exp -2m-N , <p (q) I"q q (a,q)' then the series S(N) = L;I A(q, N) is ab- solutely convergent and is equal to 1](1 + (P 1)3 )D(I- P2_P+3 )' It is known that S(N»6/n 2 for all N. If we denote by r(N) the number of solutions of N = PI + P2 + P3, then r(N)(N2/2(1ogN)3). S(N). To prove this, Vinogradov used the cir- cle method. He employed the tprime number theorem for arithmetic progressions to esti- mate the integrals over the major arcs and devised an ingenious method to estimate the series LpNexp(2niexp) in the computation of the integrals on the minor arcs. A finite or infinite sum of exponential functions such as this is called a trigonometric sum. More gener- ally, we consider trigonometric sums of several variables. Vinogradov provided detailed re- marks and calculations [6]. In the case of even integers, the problem is still unsolved, although J. G. van der Corput, T. Estermann, and N. G. Chudanov proved 4D Additive Number Theory simultaneously (1938) that almost all even integers (i.e., except a set of density 0) can be expressed as the sum of two primes. For these problems, Linnik (1946) and Chudanov (1947) introduced function-theoretic methods. They obtained the density theorem concerning the zeros of L-series, and A. Zulauf (1952) con- tinued along the same lines. These methods had been suggested by G. H. Hardy and J. E. Littlewood (c. 1919), although they had as- sumed that the textended Riemann hypothesis held. We denote by  a number having at mostj prime factors counted with their multiplicity, Using his sieve method, Brun (1919) proved that every sufficiently large even integer 2n can be expressed in the form 2n = Pg + Pg. Radema- cher (1924) improved this result and obtained 2n = P 7 + P 7 . Applying his new sieve method to this problem, Selberg (1950) proved that 2n = P 2 + P 3 . On the other hand, using the tlarge sieve, A. Renyi proved that 2n = PI + P k for some k. Afterwards, combining Richert's sieve with.a large sieve, Chen ling-Run (1973) proved that 2n = PI + P 2 for large 2n [11]. D. polygonal Numbers Let m be an integer greater than 3, and let a l = 1, a n + 1 -an =(m- 2)n + 1 (n= 1,2,...). The sequence {an} forms the system of polyg- onal numbers of order m. The general term of {an} is given by n+1(m-2)(n 2 -n) (n= 1,2, ...), Such an are said to be triangular numbers if m = 3, square numbers if m = 4, and pentagonal numbers if m = 5. P. Fermat (1636) stated that every natural number can be expressed as the sum of m polygonal numbers of order m. This conjecture was proved by A. M. Legendre (1798) for m = 3, by J. L. Lagrange (1772) for m = 4, and by A. L. Cauchy (1813) for the general case. With regard to Lagrange's result, Legendre noticed that in order that a positive integer n be ex- pressed as the sum of three squares, it is neces- sary and sufficient that n not be of the form 4 a (8m+7). Given a positive integer n, the number of integral solutions of the equation xi + x + ... + x; == n is denoted by rAn). For example, r2(5) =8. The tgenerating function LI r2(n)n-S can be expressed as L';;, _ 00(m 2 + n 2 ) -s, where the term corresponding to m = n = 0 is omitted; the function is equal to 4(K(S), where (K(S) is the tDedekind zeta function of the Gaussian number field K = Q(p). The equation 4(K(S) ==4(s)L(s, X) (where x(n) ==( - 4/n)) leads to r 2 (n)=4 L'( _1)(m-I)/2, ml" 
4E Additive Number Theory where L' means the sum over all odd factors m of 11. This result was obtained by C. G. J. Jacobi (1829). He also obtained the following formula: r4(n)=8 L' m, IIIln4/111 where L' means the sum over all divisors m of n not divisible by 4 (Hardy and E. M. Wright, C. L. Siegel, 1964). Let q =exp(2nir) (1m r > 0), and 00 f(q)= 1 + I r,(n)qn n=l =(1 +2q+2q4+2q9 +...)'. Hardy (1920) considered the variation of f'(q) for q=aexp(2nih/k) (with O<a< 1) as a-->l; he obtained ( s ) ' ( 1 ) -'/2 f(q)n'/2 ,k log  ' where Sh.k = Ll exp(2ni(h/k)/). Furthermore, he constructed the singular series n'/2 n (s/2)-1 0:. Ps(n)=---- I Ak> Al = 1, r(s/2) kl Ak==k- S I (sh.k)'exP ( -2ni k n ) , llIk (h.k)1 and showed that rs(n) = ps(n) for s = 5,6, 7, and 8. P. T. Bateman (1951) proved the same equa- tion for s = 3 and 4. If s  9 then r,(n) = p,(n) + O(n'f 4 ) (Hardy, Littlewood, and S. Rama- nujan, c. 1919). The detailed exposition of this result is in the notes of A, Z. Walfisz (1952) and Rademacher [4]. H. D. Kloostermann (1926) and Estermann (1962) studied the equa- tion axi + bx + cx + dx = n, which led to a new field of study concerning the tKlooster- mann sum. For instance, estimates such as ! P-I ( ex + d/X ) ! XI exp 2ni .s;2JP are obtained by using the theory of zeta func- tions in algebraic function fields in one vari- able (A. Weil, 1948) ( 450 Zeta Functions). E. Waring's Problem The first formulation of Waring's problem is found in E. Waring, Mediluliunes ulyebruieue (1770), in which he discusses the problem of expressing an arbitrary positive integer as the sum of at most nine cubes or as the sum of at most nineteen biquadratics. Hilbert proved ( Section A) that there exists a positive integer s(k) such that, for any integer N, the equation X +X + ... +X= N 14 has a nonnegative integral solution if s  s(k). We denote by g(k) the least value of s(k), and by G(k) the least value of s(k) for which the equation is solvable with at most finitely many exceptions of N. Research concerning g(k) and G(k) received its initial impetus from the circle method considered by Hardy and Littlewood, and it underwent considerable development in the works of H. Wey1 and Vinogradov. Let rs(N) be the number of solutions of the above equation. We then have r,(N)= r 1 ( L eX P (2nic>:x k ) ) ' J 0 \Nllk x exp( -2niNex)dex, If we make the Farey dissection, translating the interval [0,1] slightly, then the main term of rs(N) is provided by the integrals over major arcs, and the residual term is derived from the integrals over minor arcs. Let p be a prime, and let M(N,pl) denote the number of solu- tions of the congruence equation x + x +... + x:=N(modpl). Then limlw M(N,pl)/pl(s-11 = Xp(N) is not zero, and the infinite product I1 p X p (N)=S(N) con- verges for s4k, where S(N) is larger than a positive constant which is determined inde- pendently of the choice of N. On the other hand, let q-l ( a ) S(a,q)= L exp 2ni-x k , xo q A(q,N)=q-' L s(a, qy ex P ( -2nt N ) . laq q (a,ql1 Then LI A(q, N) is absolutely convergent, and the sum is equal to S(N). According to Hua (1959) we have r ( 1 + l / k ) S r. ( N ) S ( N ) N,/k-l , r(s/k)' provided that s2k2(2logk+loglogk+c). Next, if we denote by V(N, b) the volume of the closed region satisfying N .s; x + x + '" + x.s; N + r5 in the s-dimensional Euclidean space, then limbo V(N, b)/b = Xw(N) exists and is equal to (r(1 + l/k)'/r(s/k))N s / H . Hence we can show that the principal part of r,(N) is equal to the infinite product I1 p X p (N), where p runs over all finite and infinite prime spots in Q, This is a generalization of the singular series studied by Hardy. With regard to g(k), there are studies by L. E, Dickson (1936) and others. It is easy to see that g(k)  2 k + (3/2)k - 2 and that G(k)  k + 1. It has been shown that g(3)=9, G(3).s;7 (G, L. Watson, Linnik, 1947) and that g(4).s;37, G(4)= 16 (H. Davenport, 1939). More gener- ally, Vinogradov (1959) proved that 
15 G(k).s; 2klogk +4k loglogk + 2klogloglogk+ck, To prove this, Vinogradov (1934) introduced the following integral, which is closely related to the tprime number theorem: I(P)= 1 1... 1 1 1 L exp(2ni(ex l x+ex 2 X Z + ... o 0 xp 1 2S +exkX k )) dex t ...dex k . Hua (1949) improved Vinogradov's result and proved that if sik(k+ 1)+lk, then I (P).s; (5S)5s1(log P)21 p 2s - k (k+l )/2 +0, where b=tk(k+ 1)(1-ljk)H. Concerning I(P), another notable approach was made by A. A. Karacuba and N. M. Korobov (1963). Further investigation proved that I(P)= C 1 C 2 p Z '-k(k+I)/2 +O(P 2S - k (k+l)/2) if S  cP logk (Vinogradov, Hua, 1959), The result is called the Vinogradov mean value theorem. There are many variations and generaliza- tions of this theorem. Vinogradov and Hua (1944) studied the problem of representing an arbitrary N as N = P + P + ... + P; (with Pi prime). Hua (1937) and others also considered the problem of representing N as N = f(xd + f(x 2 ) +... +.f(x s ), where fix) is a given poly- nomial. Also, let n n n C(xt,x z ,..' ,X n )== L L L CijkXiXjXk i=l j=l k=l be a homogeneous polynomial of degree 3 with integral coefficients. Davenport (1963) proved that if n  16, then the equation C(x t , X 2 , ..., x n ) = 0 has at least one nontrivial integral solution, There are further develop- ments of the theory of representations of in- tegers by forms in many variables by V. A. Tartakovskii, H. Davenport, B. J. Birch, and D. J. Lewis. F. Additive Problems in an Algebraic Number Field Siegel (1922) considered the generalization of Hardy's square sum problem to the case of an algebraic number field. He later (1945) studied the generalized Waring's problem in an alge- braic number field K of finite degree: Let I be the principal order of K, and let J k be the subring of I generated by the kth powers of integers in K. It is easily seen that the index (I: Jd is finite. Hence our concern regarding s(k) must be restricted to integers contained in J k . Another question is how to extend the Farey dissection to an algebraic number field. Siegel succeeded in solving these difficulties. 4 Ref. Additive Number Theory His ingenuity is seen in his way of dealing with the minor arcs, which provided a stimulus to the research of T. Mitsui (1960). A generalization of Goldbach's problem to the case of an algebraic number field was obtained by Mitsui (1960) and O. Korner (1961). As another extension of the Vinogradov three primes theorem, Mitsui (1971) proved the following theorem: Let K be an algebraic number field of degree n. Let C be the prin- cipal ideal class generated by a totally positive number in K, and P be the set of prime ideals of degree 1 contained in C. Let N be a positive integer and Is(N) be the number of representa- tions of N as the sum of the norms of s prime ideals belonging to P, I.(N) = L 1, PiEP (l.s;i.s;s). N==Np\+.. +Nps If N is sufficiently large and s  3, we have the asymptotic formula Ns- l ( 10 10 N ) I N - A S N - 0 N s - 1 g g s( )- s ( )(IogN)S+ (logN)'+1 ' where As is a positive constant depending on s and K independent of N, and SIN) denotes the singular series. If s = N (mod 2D), where D is the discriminant of K, then SIN)  c > 0, where c is a constant. Later this problem was ex- tended by Mitsui and T. Tatuzawa (1981). References [1] R. G. Ayoub, An introduction to the ana- lytic theory of numbers, Amer. Math. Soc. Math. Surveys, 1963. [2] G. H. Hardy and E. M, Wright, An intro- duction to the theory of numbers, Clarendon Press, fourth edition, 1963. [3J H. H. Ostmann, Additive Zahlentheorie I, II, Erg. Math., Springer, 1956. [4] H. A. Rademacher, Lectures on analytic number theory, Lecture notes, Tata Inst., 1954-1955, [5J E. G. H. Landau, Dber einige neuere Fort- schritte der additiven Zahlentheorie, Cam- bridge Univ. Press, 1937. [6] L. K. Hua, Additive Primzahltheorie, Teubner, 1959. [7] L. K. Hua, Die Abschiitzung von Ex- ponentialsummen und ihre Anwendung in der Zahlentheorie, Enzykl. Math., Bd. I, 2, Heft 13, Teil I, 1959. [8J 1. M. Vinogradov, Selected works (in Rus- sian), Akad. Nauk SSSR, 1952. [9] I. M. Vinogradov, Upper boundary of G(n) (in Russian), Izv. Akad. Nauk SSSR, 23 (1959), 637-642, [10] T. Mitsui, On the Goldbach problem in 
5A Additive Processes an algebraic number field I, II, J. Math. Soc. Japan, 12 (1960), 290-324, 325-372. [IIJ J.-R. Chen, On the representation of a large even integer as the sum of a prime and the product of at most two primes, Sci. Sinica, 16 (1973),157-176. [12J T. Mitsui and T. Tatuzawa, Sur un pro- bleme en theorie additive des nombres, Mono- graphies Scientifiques de la Maison Franco- Japonaise, fasc. 3,1981. 5 (XVII.g) Additive Processes A. General Remarks The study of sums of tindependent random variables has been one of the main topics in modern probability theory ( 250 Limit Theorems in Probability Theory). Combin- ing the ideas of this study with the considera- tion of tstochastic processes with continuous time parameter, we get the notion of additive processes. B. Definitions and Fundamental Properties A real-valued tstochastic process {Xt}o,,;,<O'' denoted by X(t) (O.s;t< 00) for the rest of this article, where for simplicity we assume that X (0) = 0, is called an additive process (or pro- cess with independent increments) if for any to < t l <... < tn, X(t i )- X(ti-d (i = 1,2,..., n) are tindependent. An additive process is essentially the same as a tspatially homogeneous tMar- kov process (i.e., a Markov process on R 1 that is invariant under translations). When, for any h>O and t>s, X(t+h)-X(s+h) and X(t)- X (s) have the same law, i.e., the distribution law of X(t)-X(s) depends only on t-s, we call the additive process X (t) temporally homo- geneous. This is essentially the same notion as a temporally and spatially homogeneous Mar- kov process. Let X(t) be a given additive process. If f(t) is a function oft only, then clearly y(t)=X(t)- f(t) is also an additive process, and we can choose f(t) in such a way that for every t >0 and for every sequence Sn it (sntt), Y(sn) con- verges with probability one. Here, lim Y(sn) is independent of a particular choice of Sn, and we denote it by Y(t -) (Y(t + )). We call such Y(t) a central process and also say that Y(t) is obtained from X(t) by centering, This f(t) is given, for example, by the condition E(arctan(X(t)- f(t)))O. 16 Let Y(t) be a centered additive process. Then Y(t -) = Y(t) = Y(t +) for all t > 0, except on an at most countable t-set S, and tES is called a fixed point of discontinuity of Y(t). Then Y l (t) = limno: Un(t) exists with proba- bility 1, where Un(t)= L (Y(Si+)- Y(Si-)) Oi::;;;n Si<t + Y(t)- Y(t-)-q, q being a constant determined by E(arctan Un(t)) =0, and S={sJ (j= 1,2, ...). Y 2 (t)= Y(t) - Ydt) is a centered additive process without any fixed point of discontinuity. Fur- thermore, Y1(t) and Y 2 (t) are independent. Thus we have a decomposition of Y(t): Y(t)= Ydt)+ Y 2 (t), where Y1(t) and Y 2 (t) are mutu- ally independent additive processes. The struc- ture of Y l (t) is simple. and it is not worthwhile to study its behavior in more detail. On the other hand, since Y 2 (t) is a centered additive process without any fixed point of discontinu- ity, it is an additive process that is tcontinuous in probability. Let Y2(t) be a tseparable modi- fication of Y 2 (t). Then the discontinuities of almost all sample functions of Y2(t) are of the first kind. If we set Y 2 *(t) = Y 2 (t +), Y 2 *(t) is a tmodification of Y 2 (t). and almost all sample functions of Y 2 *(t) are right continuous and have a left-hand limit at every t. In the study of the process Y 2 (t), it is always convenient to take such a modification. Thus we give the following general definition: An additive pro- cess is called a Levy process if it is continuous in probability and almost all sample functions are right continuous and have a left-hand limit at every t E [0,00) [3,9]. The notions of additive processes and Levy processes can also be considered fOr RN-valued processes. C. Additive Processes and Infinitely Divisible Distributions Let X(t) be a Levy process and <1>" (s<t) be the tdistribution of X(t)-X(s). Then <1>" is an tinfinitely divisible distribution ( 341 Proba- bility Measures G). Conversely, for a given infinitely divisible distribution <I> we can con- struct an essentially unique temporally homo- geneous Levy process X(t) such that <I> coin- cides with the distribution of X(I). If X(t) is temporally homogeneous, the tcharacteristic function <p,,(z) = E(eiz(X(')-X(,») of the distri- bution <l>s' is given in the form <p,,=exp«t- s)1/1(z)); hence the law of the process X(t) is completely determined by the function 1/1 (z), By the tLevy-Khinchin canonical form, 1/1(z) is 
17 written in the form f  ( ' ) V 2 . lZU ljJ(z)=imz--z + e,zu_I_ n(du), 2 -ac I +u (I) where m, vER, vO, and n(du) is a nonnegative f XJ u 2 measure on R- {O} such that n(du) _ 1 +u < 00. These m, v, and n(du) are uniquely deter- mined by ljJ(z). D. Basic Additive Processes Wiener Process. When almost all sample func- tions of a Levy process X(t) are continuous, the distribution of X (t) - X(s) is a tnormal distribution. If, further, X(t) is temporalIy homogeneous, ljJ(z) has the form ljJ(z) = imz- 1VZ2 In particular, if m = 0 and v = I, then X(t) is called a Wiener process or Brownian motion. This stochastic process was introduced by N. Wiener (1923) as a mathematical model for the random movement of colloidal par- ticles first observed by a British botanist, R. Brown. This is one of the most fundamental and important stochastic processes in modern probability theory ( 45 Brownian Motion). Poisson Processes. When almost all sample functions of a Levy process are increasing step functions with only jumps of size 1, the distri- bution of X (t) - X (s) is a tPoisson distribution. If, further, X(t) is temporally homogeneous, ljJ(z) in (I) has the form ljJ(z)=).(e iZ -l) (..1.>0), and X (t) is calIed a Poisson process, Let X (t) be a Poisson process, and let To, To + T I , To + TI + T 2 ,... be successive jumping times of a sample function X (t), Then To, T I , T 2 ,... is a sequence of mutually independent ran- dom variables with the common exponential distribution P(TEdt)=Ae-Mdt. Conversely, given such a sequence P;,}, if we define X(t)= inf{nl To+ TI + ...+ 7;,>t}, then X(t) is a Poisson process. Thus, for example, the num- ber of telephone calIs at a switchboard is a Poisson process when the intervals between successive calls can be regarded as inde- pendent and having a common exponential distribution. E. The Structure of the General Levy Process [8,9] In this section we restrict ourselves for sim- plicity to temporalIy homogeneous Levy pro- cesses. As we noted, the probability law of the process X(t) is determined by the function ljJ(z), The Levy-Khinchin formula (1) in a 5E Additive Processes certain sense shows that IjJ is a combination of a Wiener process and Poisson processes. This fact can be seen more clearly from the Levy-WI theorem, which states that the sample function of X(t) itself can be expressed as a composite of those of a Wiener process and Poisson processes. The Levy-Ita theorem actualIy implies formula (1) and, moreover, clarifies its probabilistic meaning. The Levy-Ita theorem can be summarized as follows: Let U be a Borel subset of R which has a positive distance from the origin, and let N(t, U) be the number of s such that X(s)- X(S-)E U, s.s; t. Then N(t, U) is a Poisson process. The expectation E(N(t, U) can be written in the form tn(U), where n(U) defines a nonnegative norel measure on R - {O}; fur- thermore, it satisfies f ro u 2 n(du)< 00. -00 1+ u Next, we set S,(t)= r uN(t,du) JIUI>' L (X(s)-X(s-)). st IX(,)-X(,-)I;;<' GeneralIy, S,(t) diverges as dO. However, with probability one, a centered process S,(t)=S,(t)-t r n(du) JI"I>' 1 +u converges uniformly in t on every finite inter- val as f.O, Furthermore, X (t) -lim,to S,(t) is continuous with probability one. However, a Levy process X(t) of which almost all sample functions are continuous has the form mt +.j; B(t), where m, vO are constants and B(t) is a Wiener process. Hence we have X(t)=mt+.j; B(t) +lim r ( UN(t,dU)-tn(dU) ) . (2) dOJIUI>' l+u Furthermore, we can show that if U I , U 2 ,..., Un are disjoint, then B(t), N(t, Ud, N(t, U 2 ), ...,N(t, Un) are mutualIy independent Levy processes. In particular, in (2) the terms are mutualIy inde- pendent. The m, v, and n(du) in (2) correspond, of course, to those in (1). Conversely, given m, v, and n(du), we can construct B(t) and N (t, U) with the properties above, and then (2) defines a Levy process which corresponds to ljJ(z) given by (I). The measure n(du) is calIed the Levy measure of X(t). 
5F Additive Processes F. Examples of Levy Processes Compound Poisson Processes. A temporally homogeneous U:vy process is calJed a com- pound Poisson process if almost all sample functions are step functions, namely, if ljJ(z) in (1) is given by ljJ(z) = Loooo (e iZU -l)n(du), A= f:oo n(du) < 00. If we set <D(du) = (I/A)n(du), then <D(du) is a probability distribution on R. <D is the distri- bution of the size of jumps when they occur. A compound Poisson process is constructed in the following way: Let To, T J , T2""; UI> U 2 ,... be mutually independent random vari- ables such that P(T" Edt) = Ae-;" dt, P(UnEdu)=<D(du), t>O, and let N(t)= U I + U 2 + '" + UN(I)' where n=inf{nl To+TJ +... + T,,>t}. Then X(t) is a compound Poisson process. Thus the number of jumps of X(t) follows a Poisson process, and the size of each jump obeys the distribution <D. Stable Processes. A temporally homogeneous Levy process X(t) is called a stable process if for every a> 0 we can find b > 0 and creal such that the processes Xdt)=X(at) and X 2 (t) =hX(t)+ct are equivalent in law, It is called a strictly stable process if in the above c can be chosen to be 0, X(t) is a stable process (resp. a strictly stable process) if and only if the corre- sponding infinitely divisible distribution is a tquasistable distribution (resp. tstable distri- bution). The texponent ex (0 < ex.s; 2) of the quasistable distribution is called the exponent (or index) of the stable process. ljJ(z) in (1) cor- responding to a stable process is given as folJows: f ro. du ljJ(z)=imz+C+ (e'ZU-l)J+; o u f o. du +c ( e IZU _l ) _ -00 lul1+" O<ex< 1, f ro ( . izu ) dU ljJ(z)=imz+C+ elZu_I__  o 1 + u 2 u 2 f o ( . izu ) du +C -w e lZu _1_ 1 +u2 ' ex=l, 18 f ro. du ljJ(z)=imz+C+ 0 (e IZU -l-izu) u1+, f o. du +c ( e IZU -l-izu ) - - -00 lul1+" 1 <ex<2, v ljJ(z)=imz- 2 z 2 , ex=2. Here m, C +, C _ , and v are real constants such that C+ O, C_ O with C+ +C_ >0 and v>O. ljJ(z) corresponds to a strictly stable process if and only ifm=O when ex# 1 and C+ =C_ when ex = 1. The above ljJ(z) is also expressed as follows: (i) if 0 < ex.s; 2, ex # 1, then ( nex Z ) ljJ(z)=imz-colzl' l-ifJtan- , 2 Izi where v c O =2 if ex=2 and nex C o = -(C+ +C_)q -ex)cos-, 2 C+ -C_ fJ if ex#2; C+ +C_ (ii) if a == I, then ljJ(z) = iyz-colzi ( 1 + ifJloglzl ) , n Izi where y=m+a(C++C_) ( a= roo ( sinu_ ) dU ) , J 0 1 + u 2 u 2 n C+ -C c O =- 2 (C+-C), fJ - . C+ +C_ When ex = 2, it is thus essentially a Wiener process: X(t)=mt+JV B(t), where B(t) is a Wiener process. When ex= 1, it is called a Cauchy process, a symmetric Cauchy process if m=O and C+ =C_ or equivalently y=fJ=O, or an asymmetric Cauchy process if fJ # O. Gener- ally it is called a symmetric stable process if m=O and C+ =C_ or ex=2 and m=O. In par- ticular, for the symmetric Cauchy process cor- responding to ljJ(z) = -lzl, we have 1 f x t P(X(t)<x)=- -22:dy. n -rot +y Next, when O<ex< 1 and m=C_ =0, almost all sample functions of X(t) are purely dis- continuous increasing functions (i.e., sums of positive jumps). In this case, X(t) is calJed a one-sided stable process of the exponent ex (or subordinator of the exponent ex). Now, if X (t) is 
19 a symmetric stable process of the exponent fJ (0< fJ.s; 2) and Y(t) is a subordinator of the exponent ex which is independent of X(t), then Z(t)=X(Y(t)) is a symmetric stable process of the exponent exfJ. This operation is called a subordination and is closely related to the theory of tfractional powers of tinfinitesimal generators of semigroups ( 261 Markov Processes) [2]. A stable process X(t) is defined in a simi- lar way when X(t) takes values in an N- dimensional space R N . In particular, if X(t) is a symmetric stable process of the exponent ex (0 < ex.s; 2) given by E(ei(Z, X('))) = e -'lzl", ZE R N , then for every bounded measurable function f(x) with compact support, we have E(I<X' f(X+X(t»dt) r«N -ex)/2) r x- ,-N d -2'n N / Z r(ex/2)JR) yl f(y) y. The right-hand side is the tRiesz potential of order ex ( 338 Potential Theory). This fact is a generalization of a well-known rela- tion between Brownian motion and Newton- ian potential (45 Brownian Motion), and through this relation we can study several properties of sample functions and also com- pute various quantities related to stable pro- cesses [I]. G. Sample Path Properties of Levy Processes Let X(t) be a temporally homogeneous U:vy process. For a Borel set B in R, the hitting time (JB is defined by (JB = inf{t > 0 I X (t)E B}. Recurrence. X (t) is called recurrent if 0" B < 00 a.s. for every nonempty open set B. Otherwise it is called transient. X (t) is recurrent if and f l 1 only if Re-dz = 00 (Port and Stone, -, -1/1(z) Ornstein). If, in particular, E(X (1)) exists, it is recurrent if and only if E(X (1)) = O. When X (t) is a stable process, it is recurrent if and only if ex> 1, or ex= 1 and fJ=O. Hitting Probabilities for Single Points. If B = {a}, (JB is denoted by O"a' OER is said to be regularfor X(t) if 0" 0 = 0 a,s. Set C = {xER I P«(Jx < 00) > O}. The following result is due to Kesten [10]: (i) If v # 0, then C = Rand 0 is regular; (ii) If v=O and Slluln(du)= 00, then either C = Rand 0 is regular, or C = 0 and 0 is not 5H Additive Processes regular according to whether I Re ..1. _ (Z) dz < 00 for all }, > 0 or = 00 for some ). > 0; (iii) If v =0 and .flluln(du) < 00, then the following four cases are possible, where a=m - SR u(1 + u Z )-' n(du) and S(n) is the sup- port of the Levy measure n: (a) C=0 when a=O, (b) C=(O, 00) when a>O and S(n)c(O, 00), (c) C=( -00,0) when a<O and S(n)c( -00,0), and (d) C = R in the remaining case. For further properties of sample functions  [4]. H. Generalization of Additive Processes A temporally homogeneous Levy process is, as we have seen, essentially a temporally homo- geneous Markov process on R which is homo- geneous in space (i.e., invariant under transla- tions of the space). Thus, on a homogeneous space when homogeneity in space makes sense, we can generalize the notion of additive pro- cesses. Let M be a thomogeneous space with transformation group G. A temporally homo- geneous Markov process X (t) is called an invariant Markov process, (or homogeneous Markov process) if its system of ttransition probabilities {P(t, x, E)} satisfies P(t, x, E) = P(t, gx, gE) for all gE G. Thus an additive pro- cess is exactly an invariant Markov process on R N when G is the group of translations. G. A, Hunt determined all invariant Markov processes when M is a tLie group or a factor space of a Lie group [6]. Let G be a Lie group and A=A(G) be the (left-invariant) tLie algebra of G. Let G c = G U {ill} be a tone-point compactification of G, and C be the set of all continuous functions on Gc- We can define Yf(YEAJEC) as usual by Y(f) = lim R n (,/ - f , tjo t l)(t)=exptY, Ra{(r) = {(w), when the limit exists uniformly. Let C z = {lECI Y(Xf) exists for every X, YEA}. Let XI' Xz, ..., X d be a basis of A(G), and let x I' x 2 , ..., X d be functions in C 2 such that x,(e) = o and Xi (x) (e) = r5ij (i,j = 1,2, ..., d; e is the unit element of G), Take a neighborhood of e, and define a function <p(g) = Lf1 xf(g) for 9 contained in the neighborhood, and extend this function to G c in such a way that <p E C 2 and <p  k > 0 (k = constant) outside of the neighborhood of e. Then gE G defines a trans- formation of G c by rg(J = g(J, rgill = ill, and in this way G c is supplied with the structure of a thomogeneous space with the transformation 
5 Ref. Additive Processes group G. Now let X(t) be an invariant Markov process on G c which is tcontinuous in proba- bility. Then the tsemigroup 1; (which is a t s trong1y continuous semigroup on C) of the process X(t) is characterized as follows: The domain of the infinitesimal generator A of 1; contains C 2 , and for fEC 2 d d Af(r) = L ai(X;/)(r) + L aijXi(Xjf)(r) i=l i.j:l + f { f(W)-f(r) Gc-{e} -i x;/(r).x;(O")}n(da), where ai, aij are real numbers (i,j = 1,2, ..., d) such that (a i ) is a symmetric nonnegative definite matrix, and n(dO") is a nonnegative measure on G c - {e} such that k- {e) <p(a). n(da) < 00. Conversely, given such ai' aij and n(da), there exists one and only one invariant Markov process on G c whose infinitesimal generator is given as above. A similar result is obtained when M is a factor space of a Lie group by its compact subgroup. Furthermore, for more concrete homogeneous spaces such as spheres or Loba- chevskii spaces (more generally, tsymmetric Riemannian spaces) the canonical form of the invariant Markov processes and infinitely divisible laws is obtained by making use of harmonic analysis [5]. References [1] R. M. Blumenthal, R. K. Getoor, and D. B. Ray, On the distribution of the first hits for the symmetric stable processes, Trans. Amer. Math. Soc., 99 (1961), 540-554. [2J S. Bochner, Harmonic analysis and the theory of probability, Univ. of California Press, 1955. [3J 1. L. Doob, Stochastic processes, Wiley, 1953. [4J B. Fristedt, Sample functions of stochastic processes with stationary, independent incre- ments, Advances in Probability and Related Topics III, P. Ney and S. Port (eds.), Dekker, 1974,241-396. [5J R. Gangolli, Isotropic infinitely divisible measures on symmetric spaces, Acta Math., 111 (1964),2\3-246. [6J G, A. Hunt, Semi-groups of measures on Lie groups, Trans. Amer. Math. Soc., 81 (1956),264-293. [7J K. Ito. On stochastic processes I, Japan. 1. Math., 18 (1942), 261-301. [8J K. Ito, On stochastic processes, Lecture notes, Tata Inst., 1960. 20 [9] K. Ito, Stochastic processes, Lecture Notes Series, no. 16, Aarhus Univ., 1969. [IOJ H. Kesten, Hitting probabilities of single points for processes with stationary indepen- dent increments, Mem. Amer. Math. Soc., 93 (1969). [IIJ P. Levy, Theorie de l'addition des vari- ables aleatoires, Gauthier-Villars, 1937. [12] P. Levy, Processus stochastiques et mouvement brownien, Gauthier- ViJlars, 1948. 6 (111.20) Adeles and Ideles A. Introduction The concept of idele was first introduced by C. Chevalley (J, Math. Pures Appl" (9) 15 (1936); Ann. Math" (2) 41 (1940)), for ta1gebraic num- ber fields. Later on, this concept and the alIied concept of adele were defined for tsimple alge- bras and also for talgebraic groups over algebraic number fields, and the two concepts became important in the arithmetical theory of these objects. We shall first explain the general concept of restricted direct product, by means of which adeles and ideles wilI be defined. B. Restricted Direct Product Let I be an index set. Suppose we are given, for each pEl, a tJocally compact group G p , and for each p except for a given finite set, say p" P2, ..., p" a compact open subgroup Up of G p . Let G be the subgroup of the direct prod- uct I1 pE1 G p consisting of elements (gp) whose Gp-components gp lie in Up, except for a finite number of p. Put U == I1=1 G pi x np#pi Up' Then U is a locally compact group with re- spect to the tproduct topology. The group G can be supplied naturally with a topology with respect to which G is a locally compact group and the quotient space GjU is discrete. The group G together with this topology is called the restricted direct product of {G p } with re- spect to {Up}. C. Adeles and Ideles Let k be an talgebraic number field of finite degree and I be the totality of finite and in- finite tprime divisors of k, For each pEl we denote by kp and kp x the tcomp1etion of k with respect to p and the multiplicative group of nonzero elements of kp, respectively. Further- more, for each finite prime divisor p, we de- note by op and up the ring of tp-adic integers 
21 of k and the multiplicative group of tun its of 0p, respectively. (1) Since op is a compact open subgroup of kp as an additive group, we can construct the restricted direct product Ak of {kp} with re- spect to {op}. Then Ak is a locally compact ring with respect to the componentwise ring oper- ations. We call Ak the adele ring (or ring of valuation vectors) of k, and an element of Ak an adele (or valuation vector) of k. The element of the direct product I1 p kp whose p-component is a fixed element of k for all p is an adele. We call such an ade1e a principal adele. Since up is a compact open subgroup of kp x for each finite prime p, we can construct the restricted direct product J k of {kp X} with respect to {up}. We call J k the idele group of k and an element of J k an ide Ie of k. The element of the direct product II kp x whose p-component is a fixed element of k for aJl p is an idele. We call such an ide1e a principal idele. Each element b of J k induces an automorphism j;, of the additive group Ak defined by j;,( 0) = b. 0 (0 E Ad. Thus J k can be regarded as a subgroup of the automorphism group Aut(Ad of the additive group Ak' The topology of J k coincides with the relative topology of J k as a subgroup of Aut(Ad. We note, however, that the topology of J k is dif- ferent from the relative topology of J k as a su bspace of Ak, and the former is stronger than the latter. Finally, for a tfunction field in one variable over a tfinite field, the ade1e ring and the ide1e group can be defined similarly. (2) Let 9i be a tnorma1 simple algebra over k and D be a tmaxima1 order of 9i. For each pEl put 9i p = 9i @ kkp, and for each finite prime divisor p put Dp=op'D. Then Dp is a compact open additive subgroup of 9i p . By the ade1e ring A9j of 9i we mean the restricted direct product of {9i p } with respect to {Dp}' Let 9i p x and Up be the multiplicative group of nonzero divisors of 9i p and the multiplicative group of the units of Dp, respectively. (Up can be defined only if p is a finite prime divisor.) By the ide1e group J9j of 9i we mean the re- stricted direct product of {9i p X} with respect to {Up}. The notion of principal ade1e (or ide1e) of 9i can be defined similarly, as in (1). The struc- tures, as topological groups, of A9j and J9j do not depend on the choice of a maximal order D. The adele ring Ak and the idele group J k described in (1) are special cases of A9j and J9j, respecti vel y. (3) Let G be a linear t a 1gebraic group de- fined over k, and let G p be the set of kp- trationa1 points of the group for each pEl. For each finite prime divisor p, let Up be the set of elements (X of G p such that the coordinates of both (X and (X -1 are p-adic integers. We can then construct the restricted direct product of 6D Adeles and Ideles {G p } with respect to { Up}, which is called the idele group (or adele group) of G, In the following section we focus on describ- ing the fundamental properties of adeles and ide1es of an algebraic number field k. We shall start, however, by observing more generally those ade1es and ide1es of a normal simple algebra 9i over k, (For the properties of the ade1e group of algebraic groups  [7J; 13 Algebraic Groups.) D. The Structures of the Adele Ring and Idele Group Let 9i be a normal simple algebra over an algebraic number field of finite degree k. We identify the totality of principal ade1es of 9i (principal ideles of 9i) with 9i (9i X), and denote it by the same letter 9i (9i X). Then 9i (9i X) is a discrete subgroup of A9j (J9j). The quotient group A9j/9i is compact. Denoting by l(Xplp «(Xp E k p ) and N p( (Xp) ((Xp E 9i p ) the tnorma1ized valuation of kp and the treduced norm from 9i p to kp, respectively, we define, for OEJ9j, a posi- tive number: V(o)=I1 p €IIN p «(Xp)l p , where 0= «(Xp)' We call V(o) the volume of o. If 0 is a prin- cipal ide1e, we have V(o) = 1 by the tproduct formula on valuations. Denote by J.the set ofideles 0 with V(o) = 1 and put C=J/9ix, Then C has finite volume with respect to the tHaar measure of J9!. Furthermore, C is compact if and only if 9i is a tdivision algebra. In particular, q is compact. Let Q be the field of rational numbers. For each rational prime p, we define a character Ap of the completion Qp of Q with respect to the p-adic topology (by a character of Qp, we mean a continuous homomorphism from Qp to the I-dimensional torus R/Z): If p = p 00 is the infinite prime of Q, then we put ).Poo(x) = -xmodZ (XEQp)' If pis finite, then we let Ap be the composite of the following three canonical homomorphisms, namely, the one from Qp to Qp/Zp, the one from QvJZp to Q/Z, and the one from Q/Z to R/Z. We define a character Ap of 9i p as follows: AP=}'potr(9i p /Qp), where p is the rational prime divisible by p and tr(9i p /Qp) denotes the treduced trace from 9i p to Qp' For X,YE9i p , put (x, Y)p = exp(2niA p (xy)). Then the additive group 9i p is self-dual relative to (x, Y)p' Fur- thermore, if we put < 0, b) = I1 p (a p , bp)p for 0== (a p ) and b == (b p ) E A9j then A 91 is self-dual relative to <0, b). The tannihi1ator of the group of principal ade1es with respect to < 0, b) is 9i. Hence it follows from tPontryagin's duality theorem that A9j/9i is compact. Hence- forth let 9i=k. We call the quotient group C k = Jdk x (an element of Cd the idele class group of k (an idele class). If a character X of J k satisfies the condition x((X) = 1 for all (xEk (i.e., 
6E Adeles and Ideles if X is a character of C k ), we call such a charac- ter a Grossencharakter (or Hecke character). Grossencharakters were introduced by E. Hecke as characters of a certain type of the tideal group of k (Math. Z., 1 (1918),5 (1920)), but they are essentialIy the same as the ones defined above [1]. Let Dk be the connected component of the identity element of C k . Then CklD k is totally disconnected and compact. Hence a Grossencharakter X is of finite order if and only if X(D k ) = 1. We can prove by tclass field theory that CklD k is canonicalIy isomor- phic to the Galois group over k of the maxi- mal Abelian extension of k ( 59 Class Field Theory). For the structure of D k , the following fact is known: Let r l and r2 be the number of trea1 infinite prime divisors and timaginary infinite prime divisors of k, respectively. Then the dual group of Dk is isomorphic to R x Q'I +,,-' X Z", where R is the additive group of real numbers with the usual topology and Q (Z) is the additive group of rational numbers (rational integers) with the discrete topology. Let F be a function field in one variable over a finite field F o. The properties of the adele ring and idele group of F are similar to the prop- erties of Ak and J k , while the group C F has a simpler structure than C k . To explain the structure of C F , let F be the maximal Abelian extension of F, e be the Galois group of F/F, and G F be the subgroup of e consisting of the elements 0" such that O"(ex)=ex qn for all exEl?o (= the talgebraic closure of F 0)' where q is the number of elements of the finite field Fo and n is a given rational integer. Also, let G be the subgroup of G F consisting of the elements inducing the identity map on Yo' G is a com- pact group with respect to the tKrull topology. G F can be naturally supplied with a topology such that the group G F is a localIy compact group and the quotient group GFIG is dis- crete. Then class field theory implies that G F is isomorphic to C F as a topological group. The following characterization of the adele ring of a number field or function field in one variable over a finite field is the work of K. Iwasawa (Ann. Math., (2) 57 (1953)), Let A be a tsemisimple commutative and loca1\y compact topological ring with unity 1. Assume that A is neither discrete nor compact, and moreover that A contains a discrete subfield k31 and A/k is compact. Then k is an algebraic number field or a function field in one variable over a finite field, and A is isomorphic to the adele ring of k as a topological ring. E. Ideles and Cohomology Let K be a Galois extension of finite degree of an algebraic number field k, and (fj be the 22 Galois group of the extension K/k. (fj operates naturally on the idele group J K and the idele class group C K of K. The structures of the tcohomology groups of (fj with the coefficient groups J K and C K were investigated by G. Hochschild, T. Nakayama, E. Artin, J. Tate, and others. In particular, we have H'(ffi,C K ) = {O} and H 2 (ffi, CK) Z/nZ (cyclic group of order n), where n = [K: k]. These facts play an important role in one of the proofs of class field theory ( [3]; 59 Class Field Theory). Furthermore, A. Weil introduced the so-ca1\ed Weil group, which is a tgroup extension of a certain type of C K by (fj, He defined the most general L-functions, which include both tArtin L-functions and tHecke L-functions with Grossencharakters ( [2]; 450 Zeta Functions). F. Fourier Analysis on the Adele Group tDedekind zeta functions and Hecke L- functions are tmeromorphic on the whole complex plane and satisfy functional equa- tions of certain types. This can be proved by methods of Fourier analysis on the adele group Ak (Artin, Iwasawa, Tate [1,8,9]). For a continuous complex-valued function <p(a) on Ak satisfying suitable conditions, we define the Fourier transform of <p(a) as follows: cP(a)= f <p(a)<a,b)db, Ak where db denotes the Haar measure on Ak' By normalizing db suitably and applying tPoisson's summation formula, we get, for each idele a of k, L <p(aex) = V(a)-' L cP(a-' ex). aek aek This is called the E)-formula. Consider the following integral on J k : ((s) = f V(a)'X(a)<p(a)d*a, J k where d*a denotes the Haar measure on J k , s is a complex number, and X is a Grossen- charakter of k, namely, a character of C k . This in tegra1 con verges if s > 1, and by using the E)-formula one can show that (s) is meromor- phic on the whole complex plane and satis- fies a functional equation of a certain type. When the function <p is of special type, then the above integral can be explicitly expressed as the product of L-functions, r-functions, and exponential functions. This method of express- ing L-functions by integrals on J k and apply- ing the E)-formula can be applied to investi- gate tHey zeta functions and L-functions of various types defined for a simple algebra ( 
23 450 Zeta Functions) (G. Fujisaki [6J; T. Tama- gawa, Ann. Math., 77 (1963)). References [IJ 1. Tate, Fourier analysis in number fields and Heeke's zeta-functions, Thesis, Princeton Univ., 1950. (Algebraic number theory, edited by 1. W. S. Cassels and A. Frohlich, Academic Press, 1967, ch. 15.) [2] A. Weil, Sur la theorie du corps de classes, 1. Math. Soc. Japan, 3 (1951),1-35. [3J E. Artin and 1. Tate, Class field theory, Lecture notes, Harvard Univ., 1961 (Benjamin, 1967). [4] E. Artin, Representatives of the connected component of the idele class group, Proc. Intern. Symp. Alg, Number Theory, Tokyo and Nikko, 1955,51-54. [5J G. Shimura and Y. Taniyama, Modern number theory (in Japanese), Kyoritu, 1957, [6] G. Fujisaki, On the zeta-function of the simple algebra over the field of rational num- bers, J. Fac, Sci. Univ. Tokyo, 7 (1958), 567- 604. [7] A. Weil, Adeles and algebraic groups, Lecture notes, Institute for Advanced Study, Princeton, 1961. [8J S. Lang, Algebraic numbers, Addison- Wesley, 1964. [9J A. Weil, Basic number theory, Springer, 1967. 7 (VI.15) Affine Geometry A. Construction of Affine Spaces An affine space A is constructed as follows: Let V be a tvector space over a tfield K, and let A be a nonempty set. For any vector a E V and any element P of A, suppose that an addition P + a E A is defined so as to satisfy the following three conditions: (i) P + 0 = P (0 being a zero vector); (ii) (p+a)+ b= p+(a+ b) (a, bE V); and (iii) for any qEA there exists a unique vector a E V such that q = p + a. (Condition (i) follows from (ii) and (iii).) Then we call A an affine space, V the standard vector space of A, and K the coefficient field of A. Each element of A is called a point. If we fix an arbitrary point oEA, there is a one-to-one correspondence between A and V gi ven by the mapping sending pEA to a E V such that p = 0 + a. Such an element a of V is called a position vector of p with the initial point 0 and is denoted by op. We say that r+ 1 7B Affine Geometry points p, (O.s; ex.s; r) of A are independent if r vectors a i = POPi (1 .s; i.s; r) are linearly inde- pendent in V; otherwise, they are said to be dependent. This definition of dependence of points p, is independent of the choice of the initial point among them. If V is of dimension n, we say that A is of dimension n, dim A = n; in this case, we sometimes write An instead of A and v n instead of V. The affine space A is of dimension n if and only if the maximum num- ber of independent points in A is n + 1. Next, for any vector subspace V k of v n and an arbitrary point pEA n , we put A= {qEAnl q = p + x, x E V k } and call it a subspace of An. It is an affine space of dimension k. Conversely, every subset of An that is an affine space under the affine space structure of A can be expressed in this form. A I, A 2, and A n-I in An are called a line, plane, and hyperplane, respectively. A set that consists of only one point is also consid- ered as a subspace A O . For subspaces A' and A S of An, we denote by A' n AS the intersection (i.e., the set-theoretic intersection) of A' and A', and by A'U AS the join of A' and AS (i.e., the intersection of all subspaces that contain both A' and AS). Then A' n A' is the affine space of highest dimension contained in A' and AS, and A' U AS is the affine space oflowest dimension that contains A' and AS. If r + 1 points are given in An, there always exists a subspace A' that contains all these points. In particular, if the points are independent, then such an A' is unique. Moreover, if A'nA s #0 (0 is the empty set), then we have r+s=dim(A'UA S )+ dim (A' n A S). This is called the dimension theorem (or intersection theorem) of affine geometry. Next suppose that r + I points p, (O.s; ex.s; r) in An are independent, and put P,+I == Po. Let q, be an arbitrary point on p, U PHI that differs from p, and P,+1' If A' are elements of K such that ..1.'. p;q;, = q;p;;;, then qo, ..., q, are de- pendent if and only if ..1. 0 ..1. 1 ... ..1.' = ( _1)r+1. And if r)o 2 and O",=q,U Pa+2 U... U Pa+, (Pi+'= Pi-I if i)o 1), then 0"0' ". , 0", have a point in common if and only if ..1. 0 ),1 ... ..1.' = 1. The former is called Menelaus's theorem, and the latter Ceva's theorem. The set L(A) of all subspaces (including 0 considered as an affine space of dimension -1) constitutes a tlattice with respect to the inclu- sion relation, B. Parallelism in Affine Spaces Let A' and AS be subspaces of An. We say that A r and AS are parallel in the wider sense if either of the following conditions holds: (i) A'=:> AS or AS=:> A'; or (ii) A'n A S = 0 and dim(A'UA').s;r+s. Next, let A' and 
7C Affine Geometry B' be subspaces of An of the same dimension. If A' and B' coincide, or A' n B' = 0 and dim(A'UB')=r+ 1, then they are said to be parallel in the narrower sense (or simply paral- lel), and we denote the relation by A'II B'. If r = s = 1, the definitions of parallelism in the narrower sense and wider sense are equivalent. If r > 1, parallelism in the narrower sense im- plies paralIelism in the wider sense. For two sets (a,) and (b,) (O.s;IX.s;r) ofr+ 1 independent points, let V' and W' be vector spaces with bases a;;ai and bob i (1.s; i.s; r), respectively. Then A' =aoU... Ua, and B'=boU.,. U b, are paraIle1 if and only if V' = W'; and for an arbitrary point p, there exists a unique r- dimensional subspace that is paraIlel to A' and passes through point p. If A' and B' are paral- lel in the wider sense, then there exist sub- spaces At and B' (t» 1) of A' and B S that are paraIlel to each other. Moreover, if neither A' nor B' is contained in the other and if t is the largest integer with the property just given, then we have t=r+s+ I-dim(A'U B S ). Parallelism between subspaces of An is an equivalence relation. Specifically, the equiva- lence class of a I-dimensional subspace A I is caned a point at infinity and is denoted by A. Given a subspace A' of An, the set of points at infinity A represented by lines A I con- tained in A' is denoted by A';l; we have A'II B' if and only if A';; 1 = B';;l. A';I is called a space at infinity. In particular, the set A-I is called the hyperplane at infinity. The set-theoretic sum AnUA-1 =A n is supplied with the structure of a tprojective space; the "points" in An are elements of An, and the "lines" in An are A I U A and A. C. Coordinates of Affine Spaces If we fix a point 0 in An and a basis {e l ,..., en} of the standard vector space V n , then any point p in An is uniquely expressed as n p=o+ L xi.e i , XiEK. i=l The system 3' = (0; e I' , .. ,en) is called an affine frame (simply the frame) of An; the point 0 is called its origin, and e i is called the ith unit vector. The mapping sending p to (Xl, .. . , x n ) gives a tbijection of An to K n ; we call (Xl, ..., x n ) affine coordinates of p with respect to 3', and Xi the ith affine coordinate. In partic- ular, if K is a topological field (e.g" the real number field R or the complex number field C), this bijection An-->K n induces a topology of An, which can be shown to be independent of the choice of 3'. For the remainder of this article, by "coordinates" we mean affine co- ordinates unless otherwise stated. Putting 24 (1) a i ==o+e;(l.s;i.s;n), we sometimes call (o;a l , '" ,an) an affine frame. Further, putting Ii = 0 U ai, ni=oUa l U... Ua i - l Ua i + l U... Ua n , we call ai, I;, and n i the ith unit point, the ith coordi- nate axis, and the ith coordinate hyperplane, respectively. Assume that subspaces A' and AS (r,s>O,r+ s = n) are not parallel in the wider sense. For a point p of An, denote by A'(p) the subspace that passes through p and is parallel to A', and put q = A'(p) n AS. A mapping cp: An-->A s de- fined by <p(p) = q is called a parallel projection on AS with respect to A'. In particular, if A' = n i and AS=l i (r=n-1,s= 1), we write <p(p)=p;- Then the ith coordinate Xi of p is an element of K such that Ofii==xiOii;. Hence such coordi- nates are also caned parallel coordinates (or Cartesian coordinates). Suppose that we are given r+ 1 points b o , ...,b, of An and r+ 1 elements ..1. 0 , ...,..1.' of K such that L:=o ..1.' = 1. We fix a point 0 of An. If a point p in An satisfies op = L:=o A' ob :, then bop = LI Ai bob i ; hence p is contained in the subspace b o U .., U b" Conversely, if a point p is contained in the latter subspace, then there exists a system (..1. 0 , ... , ..1.') such that op = Lo A'ob, and Lo): = 1. The system (..1. 0 , . .. , ..1.') has a geometric mean- ing since we also have o'P =LoA' o'b, if we replace the point 0 by any other point 0' of An. The elements ..1. 0 , .,. , ..1.' are called barycentric coordinates of p with respect to {b o , ...,b,}. In particular, if {b o , "', b,} are independent, then the barycentric coordinates (..1. 0, '" ,..1.') are uniquely determined by the point p on b o U ., . U b,. Furthermore, let (y 1 , .. . , yn) be affine coordinates of p with respect to an affine frame 3', and let (x, ... ,x) be affine coordi- nates of b, (IX = 0, "', r). Then p belongs to the subspace A' = b o U... U b, if and only if i = Lo A'x (i = 1, ..., n). In this case we say that the system of the linear equations yi = LA a x (LA' = 1) gives a parametric represen- tation of the subspace A' (by parameters A'). Specifically, ifr = n -1, the solvability of the system of equations i=L::::A'x (i= 1, ...,n), 1 = L A' implies the equation L7=1 yi J1i = J10 for some nontrivial constants J10, '" , J1n' Hence the latter equation represents the hyperplane n = An-I. If a point p has barycentric coordi- nates },0=..1.' =... =A'=(r+ 1)-1 with respect to {b o , ... , b,}, it is called the barycenter of b o , b l , ..., b,. The barycenter is uniquely deter- mined by the set {b o ,..., b,} and is denoted by g(b o , ..., b,). Specifically, the barycenter of two points b o and b l is called the midpoint (or middle point) of b o and b l . If we divide B = {b o , ... , b,} into two sets of points and if gl and g2 are barycenters of these two sets of points, respectively, then gl U g2 passes 
25 through the barycenter of B. More generally, a point with barycentric coordinates (..1.0, .,., A') with respect to {b o , ... , b,} is called a barycen- ter of b o , ..., b, with weights ..1. 0 , ..., A'. D. Affine Spaces over Ordered Fields Suppose the coefficient field K is an tordered field (e.g., the real number field R). Given a hyperplane n of An, we take an affine frame for which n is the nth coordinate hyperplane. If we denote coordinates of points with respect to this frame by (Xl, "', x n ), then the equation of n is given by xn=O. Let A"+- and A"- be Sets of points whose nth coordinates are positive and negative, respectively. They are called half- spaces of An divided by n. The union of nand a half-space is called a closed half-space. A half-space of a subspace A' of An (divided by some A,-I on A') is called a half-space of di- mension r. For a point P of An that does not lie on n, the half-space containing P is called the side of P with respect to n. In particular, when n = 1, let P and q be two points on a line I. The closed side of q with respect to P is called the (closed) half-line (or ray) from P to q. The inter- section of the closed half-lines emanating from P to q and from q to P is called the segment joining P and q and is denoted by pq. Clearly pq = q P, A subset C of An is called a convex set if the segment joining two arbitrary points of C is contained in C. Each half-space of each dimension is convex. For any family C, of convex sets, n, C, is also convex. Therefore, for any subset D in An there exists a minimal convex set that contains D. It is called the convex closure (or convex hull) of D. The con- vex closure C(P) of a finite set of points P = {Po,... ,Pk} in An is called a convex cell, and dim (Po U ... U Pk) is caned the dimension of the convex cell. In particular, when Po, ..., Pk are independent, C(P) is called a k-dimensional simplex with vertices Po, ..., Pk' The I-dimen- sional simplex having two distinct points P and q as vertices is the segment pq , and the vertices P and q are called ends of the segment. A point is regarded as a O-dimensional sim- plex. Each 2-dimensional or 3-dimensional simplex is called a triangle or tetrahedron, respectively. A k-dimensional simplex S with vertices Po, "', Pk is a set of points whose barycentric coordinates )! (O.s; ex.s; k, LA' = I) with respect to the vertices satisfy ..1.')0 O. On the other hand, if we put A k = Po U . " U Pk and n, = Po U ... U P,_I U PHI U... U Pb and denote by A the side of p, in A k and by A the closed side of p, in A k with respect to n" then the . I S . , b n k A -k d n k A k' simp ex IS given y FO ., an FO, IS called an open simplex. An has the structure of a ttopological space in which the set of open n- 7E Affine Geometry dimensional simplexes forms a tbase of topen sets. In particular, if K is R, the topology of An thus defined is compatible with the one that is naturally induced by the topology of R. With respect to this topology, An is a tHausdorff space. The terms open and closed used before for n-dimensional simplexes agree with the corresponding notions with respect to this topology. A subset of An is said to be bounded if it is contained in some simplex. A bounded set obtained through a finite process of construct- ing intersections and unions from a finite number of closed half-spaces is called a poly- hedron. The points of a convex polyhedron are characterized by several linear inequalities satisfied by their coordinates. A set of points whose coordinates (Xl, ..., x n ) satisfy hi.s; xi.s; k i for k i , hi E K is caned a parallelotope; it is a polyhedron whose tinterior is called an open parallelotope. A simplex is a polyhedron, and polyhedra admit tsimplicial decompositions. A polyhedron can also be defined as the set- theoretic union of a finite number of simplexes. Let P be a finite set of points, and let its convex closure C(P) be a convex cell of dimen- sion m. Then C(P) is contained in an m- dimensional subs pace Am. The tboundary of C(P) in the topological space A m is called the boundary of a convex cell C(P). We can take a subset Q of P so that dimC(Q)=m-l and C(Q) is the intersection of the boundary of C(P) and an (m - 1 )-dimensional subspace. Such a C(Q) is called a face of C(P), and we denote this relation by C(P)>-C(Q). If C(P)>- C(P I )>-... >-C(P s ), then C(Ps) is called an (m-s)-dimensional face of C(P). A 0- dimensional face is caned a vertex, and a 1- dimensional face is called an edge. Suppose that C(P)>-C(Q) for P={Po, "',Pk} and Q = {Pio' "', Pik-l}' Then F = Pio U ... U Pik-l is a hyperplane of E = Po U... U Pk, and C(P) is contained in a closed side of E divided by F. Therefore, if C( P) has d (m -I)-dimensional faces, then C(P) is expressed as the intersection of d m-dimensional closed half-spaces. This shows that any convex cell is a polyhedron. E. Affine Transformations A mapping <p:An-->A m is an affine mapping if there is a linear mapping ip: V n --> V m of the standard vector spaces of An and Am such that <p(p+x)=<p(p)+cp(x) holds for any pEA n and any XE V n . An affine mapping of An into itself is called an affine transformation (or affinity) of An. Specifically, a bijective affine transforma- tion is called a regular (or proper) affine trans- formation. An affine transformation <p of An is characterized by each one of the following 
7F Affine Geometry properties: (i) Let oEA n be a fixed point. Then qJ is a mapping of A" onto itself that can be expressed as rp(o + x) = 0 + a + f(x), where a is a fixed vector of V" and f is a linear transformation of V n , (ii) T he mappi ng <p: An->A n is a mapping such that <p(a)cp(b)= /. rp (p)cp(q) if =A' pq UEK). Moreover, if the tcharacteristic of K is not equal to 2, an affine transformation is also characterized as follows: (iii) <p is a mapping that sends lines into lines and preserves the ratio of each pair of parallel segments. The set (An) of all regular affine transfor- mations of A" constitutes a group that we call the group of affine transformations. If the linear mapping f associated with a regular affine transformation !fI is the identity map- ping, then (jJ is called a translation. The set %( A") of all translations is a normal subgroup of 9(A n ) and is called the group of translations. The group of translations is isomorphic to V regarded as an additive group. The vector group !B(A n ) (i.e., an additive group of a linear space) acts tsimply transitively on A". We see that I2£(A n )j!B(A n )::::::GL(n, K), where GL(n, K) denotes the tgenerallinear group. The set of all regular affine transformations that leave a point 0 of A" invariant constitutes a subgroup (I)(A n ) of !(An); it is called an tisotropy group at 0 and is isomorphic to GL(n, K). Let !Y = (0; e, , , e,) be an affine frame of A" with origin 0, and let !fI be a regular affine trans- formation of A" given by (2); put x = L x'e i , ip(o +x) = 0 + L xie i , a =  aie i , and f(eJ = L a{e j . Then ip is expressed with respect to  by the following equation: n .xi==ait I axk, det(aj) #0, I .s;i.s;n. (3) k1 Conversely, a transformation that is given by (3) is a regular affine transformation. Elements of !B(A n ) and ffi(A n ) are expressed with respect to  by Xi=Xi ta i , 1 in, and n Xl = L axk, k=l respectively. Hence \lI(A n ) is represented as a tsemidirect product group of !B(A n ) and (F>(A"), In particular, a regular affine transformation that is represented by Xi = ax i (1 .s; i.s; n) for some a E K (a # 0) is called a similarity with the origin 0 as center. According to F. Klein, the objects we deal with in affine geometry are the properties (parallelism, barycenters, etc.) that are invar- iant under regular affine transformations. det(a;) ¥' 0, l.s;i.s;n, 26 (2) Subsets S, and 52 of A" are called affinely congruent if there exists a regular affine trans- formation rp sending Sl onto S,. For a fixed k, two k-dimensional simplexes are affinely congruent. Now we fix an affine frame  in A" and denote by x the coordinates of n t I points p, (0 :::; a  n) in A ". Then the quantity 1 X6 Xl Xl V(Po, ...,Pn)=- I n (6) n! x n x n x n 0 I n is called the volume with respect to 3' of the II- dimensional simplex with vertices Po, . P.. If (p is a regular affine transformation given by (3), we have V = det(aJ) V. Hence the ratio of volumes of two n-dimensional simplexes is independent of the choice of coordinate sys- tems, and is invariant under regular affine transformations. A regular affine transformation given by (3) satisfying det(aj) = I is called an equivalent affnity. The set of all equivalent affinities constitutes a subgroup of I2£(A n ). The geometry belonging to this group is called affine geome- try in the narrower sense. For instance, the concept of volume is an invariant in affine geometry in the narrower sense. F. Relation to Projective Geometry (4) Let pn be a projectIve space over a coefficient field K (343 Projective Geometry). If we fix a hyperplane 1rC( in pn, then the set of projec- tive transformations that leave n r invariant constitutes a subgroup of the group of tprojec- tive transformations of pn; this subgroup is isomorphic to a group of regular affine trans- formations. Actually, if we use a tprojective frame [a o , a" ,a" u] such that a" , a, are points on 1r00 ' then each projective transfor- mation leaving n,x invariant is expressed by equations of the same form as (3) with respect to the tinhomogeneous projective coordinates. The point set A" complementary to 11:C1J in P" is an affine space, and ITx, coincides with the hyperplane at infinity. Moreover, two distinct lines in pn are parallel in A" if they meet on the hyperplane at infinity. Hence, denoting by (0, 1', . , In) the +homogeneous projective co- ordinates of the intersection of a line I in A" and ITC(, we call (II, , In) the direction ratio of 1. A projective transformation leaving each point of ncyj invariant induces a translation. The tprinciple of duality that holds in projec- tive geometry does not hold in affine geome- try. The tpole of the hyperplane at infinity, with respect to a quadric hypersurface, is called the center of the quadric hypersurface. A regular quadric hypersurface is called central (5) 
27 or non central according as its center belongs to An or is a point at inl1nity. Quadric hypersur- faces in an affine space are c1assilled in several ways, by taking account of their relations with the hyperplane at infinity ( 78 Conic Sec- tions, 350 Quadric Surfaces). References [ I J O. Schreier and E. Sperner, Einflihrung in die analytische Geometrie und Algebra I, II, Teubner, 1931, 1935; English translation, Introduction to modern algebra and matrix theory, Chelsea, second edition, 1961. [2] E. Sperner, Einflihrung in die analytische Geometrie und Algebra I, II, Vandenhoeck & Ruprecht, 1948, 1951. [3] E. Artin, Geometric algebra, Interscience, 1957. [4J H. Weyl, Raum, Zeit, Materie, Springer, fifth edition, 1923; English translation, Space, time, matter, Dover, 1952. [5] S. Iyanaga and K. Matsuzaka, Affine geometry and projective geometry, J. Fac. Sci. Univ. Tokyo, 14 (1967), 171-196. 8 (111.1) Algebra Though the word "algebra" usually refers to a field of mathematics as will be explained below, the word may also denote specific mathematical structures such as tassociative algebras, tJordan algebras, tClifford algebras, etc. The first concepts concerning "unknowns" in algebra originated in India, whence came also our decimal positional system of numer- ation. These ideas were introduced to Europe through Arabia in the Renaissance period. F. tViete systematized them into a symbolic method, called algebra, representing numbers by letters. The first problem of algebra was solving equations. Before Viete, G. Cardano and L. Ferrari had solved algebraic equations of degrees 3 and 4; the solution of equations of lower degree had been known from antiquity. The effort to solve equations of higher degree remained unresolved until the middle of the 19th Century, when N. H. tAbel and E. tGalois proved the nonexistence of algebraic solutions of such equations. They considered not only individual roots of these equations but also any rational transforms of their roots at the same time, and thus were led to the concept of tfields. They also noticed that the problem of algebraic solution could be characterized by properties of permutation groups of the roots. 8 Algebra After the discovery of the Galois group, group theory and group-theoretical considerations maintained the central position in algebra for some time ( 172 Galois Theory). These de- veloped into the "abstract algebra" of this Century in the general atmosphere of arithme- tization and ofaxiomatization of mathematics. At the turn of the century the monumental textbook in three volumes by H. Weber [1J was considered the standard work on algebra. Then there appeared in 1910 an epoch-making paper [2] by E. Steinitz on the abstract theory of l1elds. The main objects of algebra today are talge- braic systems of various kinds, such as tgroups, trings, tfields, and tmodules. Another fun- damental concept of algebra is that of tiso- morphism or of thomomorphism. The col- lection of algebraic systems of a given kind, together with the homomorphisms among them, gives rise to the notion of tcategory; a functor is a sort of homomorphism between categories ( 52 Categories and Functors). These notions were first used in thomological algebra, created in the 1940s by methods transferred from topology to algebra; now they are of basic significance to the whole of mathematics. An important branch of algebra with wide applications is the theory of tvector spaces, or more generally that of tmodules over a ring. This branch is called linear algebra. Homo- morphisms between finitely generated modules can be represented by tmatrices. Another branch of algebra, called trepresentation theory, is concerned with representations of groups or rings by matrices. The methods of modem algebra provide useful and powerful tools for the whole of mathematics, in partic- ular for the theory of numbers and algebraic geometry. The present development of algebra owes much to the activity of the German school in the late 1920s represented by E. Noether, E. Artin, W. KrulI, and B. L. van der Waerden. The book by van der Waerden [3] has had a great impact on mathematics. N. Bourbaki [4] has been influenced by van der Waerden but gives accounts of more recent developments, particularly in linear algebra. In Japan, M. Sono, who worked at about the same period as E. Noether, was a forerunner in this field; after him, algebraists of the Kyoto School, Y. Akizuki, M. Nagata, and their followers, did notable research, especially in algebraic geom- etry. On the other hand, K. Shoda studied with E. Noether toward 1930 in Germany; his school includes such algebraists as T. Naka- yama, K. Asano, and G. Azumaya. Finally K. Morita and his disciples have made signif- icant contributions to homological algebra. 
8 Ref. Algebra References [IJ H. M. Weber, Lehrbuch der Algebra, Vieweg, I, 1894; II, 1896; III, 1891. [2J E. Steinitz, Algebraische Theorie der Kor- per, J. Reine Angew. Math., 137 (1910), 167- 309, [3J B. L. van der Waerden, Modeme Algebra, Springer, first edition, I, 1930; II, 1931; English translation, Algebra I, II, Ungar, 1970. [4J N. Bourbaki, Elements de mathematique II. Algebre, ch. 1-9, Actualites Sci. Ind., 1144b, 1236b,1044, 1 to2b, 1179a, 1261a, 1272a, Her- mann, 1958-1964. [5J G. Birkhoff and S. MacLane, A survey of modem algebra, Macmillan, third edition, 1965, 9 (VII1.2) Algebraic Curves A. General Remarks An talgebraic variety of dimension 1 is called an algebraic curve (for analytic theory...... 11 Algebraic Functions). The theory of algebraic curves has two aspects, the geometry of 1- dimensional complex manifolds in projective spaces and the theory of function fields of transcendence degree 1 ( 3 Abelian Varieties, 16 Algebraic Varieties). The number-theoretic study of algebraic function fields concerns the latter theory (- 73 Complex Multiplication, 450 Zeta Functions). In this article, the geo- metric aspect of the theory is emphasized. We denote the t un iversa1 domain by K. B. Classical Results on Plane Algebraic Curves Let f(X, Y) be a polynomial of degree m in two variables X and Y. A point set in an affine two-space defined by J(X, Y) = 0 is called the plane algebraic curve of degree m If we set F(Xo, XI' X 2 ) = X;;'f(XdX o , X 2 /X O ), the homogeneous polynomial F defines an alge- braic curve of degree m in a projective plane p 2 . The curve is called irreducible if the poly- nomial f(X, Y) is irreducible. A curve of degree 1 is said to be a line. Some results of this sec- tion are valid only in the case where the char- acteristic of K is zero. Let C be a plane curve defined by the equa- tion f(X, Y) = O. A pointP = (a, b) on C is called an r-p1e point if f(X + a, Y + b) has no term of degree < r in X and Y. At an r-ple point there are r tangent straight lines (count- ing ill ultiplicity), An r-ple point with r> 1 is 28 called a multiple point (or singular point). If these tangents are distinct, P is called ordi- nary. An ordinary double point is called a node; e.g" the origin for X 3 + y3 -3X Y = O. An algebraic curve can be transformed into a plane curve that has only ordinary multiple points by a finite number of plane tCremona transformations (tquadratic transformations of the projective plane into itself). Let C be an'irreducible plane curve of de- gree m> 1 in a projective plane. The set of all tangent lines at nonsingular points of C deter- mines a set of points in the dual projective plane, and its closure is an algebraic curve C, that is called the dual curve of C. The dual curve of C becomes the original C. The degree m' of C is said to be the class of C, which is equal to the number of tangent lines to C drawn from a general point. A nonsingular point P is called a point of inflection (or a flex) if the tangent line at P has contact of order > 2. If C is defined by an irreducible homoge- neous polynomial F(X o , XI, X,), the curve defined by det(i3 2 Fji3X j oX j )=0 is said to be the Hessian of C. A nonsingular point P is a point of inflection if and only if P is contained in the Hessian. A singular point P is said to be a cusp if C is detlned by an equation y2 == X 3 + higher terms, in terms of suitable affine coordinates X and Y where P = (0,0), Whenever the singular points of C are only v nodes and y cusps, the effective genus g is given by the formula g = (m - 1) (m- 2)/2 - v -yo In addition, suppose that the dual curve has only v' nodes and y' cusps as singularities. Then C has y' points of inflection, and m' and y' are given by m' == m.(m- I) - 2v - 3)1 and )I' = 3m(m - 2) - 6v - 8)1. Moreover, ==ni(ni- 1)  2v' - 3)1' and y = 3m'(m' - 2) 6v' 8)1' hold. These formulas are called Plucker formulas. For example, a nonsingular plane curve of degree 3 is an elliptic curve, i.e., g = 1, of class 6, which has 9 points of inflection. A non- singular plane curve of degree 4 has, in gen- eral, 24 points of inflection and 28 bitangents. This results from the Plucker formulas. C. Fundamental Notions In what follows, by a curve we mean an alge- braic variety of dimension 1. Let r be a non- singular complete irreducible curve. An ele- ment of the free Abelian group generated by points of r is called a divisor. A divisor is written in the form a = L niP i , with n j E Z. The integer n = L ni is called the degree of a and is denoted by deg a. The expression for a divisor a is said to be reduced if P; if- If for i # j. A divisor whose reduced expression has only positive coefficients is called a positive divisor (or effec- 
29 tive divisor, or integral divisor), and this is denoted by a;..O. The group of divisors on f is denoted by G(f), and the subgroup consisting of divisors of degree 0 is denoted by Go(f). Let P be a point of f, The subset of the function field K(f) of r consisting of functions regular at P forms a valuation ring R p for a tdiscrete valuation of K(T). A prime element t of R p is called a local parameter at P. Let V p be the tnormalized valuation of K(f) defined by R p ; the integer vp(f) is called the order off at P. The point P is a zero off if vp(f) > 0; it is a pole off if vp(f) < O. There are only a finite number of poles and zeros of a given function f The divisor L vp(f) P is called the divisor of the function f and is denoted by (f). The set of divisors of functions forms a subgroup G, of G,. Any divisor a in G, is called a principal divisor (we also say that a is tlinearlyequiva- lent to zero and write a  0). Let a be an arbitrary divisor. The set of all positive divisors that are linearly equivalent to a forms a complete linear system lal deter- mined by a. We set L(a) = {fE K(r) (f) + a;..O} U {O}, Then L(a) is a finite-dimensional vector space over K, and I-dimensional sub- spaces of L(a) correspond bijectively to the ,elements of ai, We set J(a) = dim, L(a) and dimlal=l(a)- 1. Then dimlal is called the dimension of al. For any divisor a, the integer deg a - diml a is nonnegative and bounded. The supremum 9 of such integers is called the genus of r, The nonnegative integer i(a) = g - deg a + dim a is called the speciality index of a, Let w be a tdifferential form on r, P be a point of f, and t be a local parameter at P. Then w can be written in the form w = fdt. We now set yp(w) = v p (/) and (w) = L Yp(w)P, Then (w) is a well-defined divisor, and the class of (w) in GjG, is independent of the choice of w. This divisor class is called the canonical class; any divisor in this class is called a canonical divisor (or differential divisor) and is denoted by f. We have J(f) = y, deg f = 2g - 2. Given a divisor a, the index i(a) is equal to the number of linearly independent differentials ill such that (ill);" a, i(a)=l(f-a). The equality J(a)= deg a - 9 + I + i(a) is called the Riemann-Roch theorem. For any irreducible algebraic curve f, there exists a birationally equivalent nonsingular curve f that is unique up to isomorphism. The genus of f is called the effective genus of r. A curve whose effective genus is zero is called a rational (or unicursal) curve. An elliptic curve is a curve whose effective genus is 1. An irreduc- ible curve r with genus ) 2 is called a hyper- elliptic curve if K(f) is a quadratic extension of a field K(t) for some t. A positive divisor a on a nonsingular com- 9D Algehraic Curves plete irreducible curve f is called a special divisor if i(a) > 0, In this case, 2 (J(a) - I) <S; deg a, where equality holds if and only if a = 0 or aN I, or r is a hyperelliptic curve (Clifford's theorem). Let f be a nonsingular complete irreducible curve and k be a sublj'eld of K such that r is defined over k Denoting by k the algebraic closure of k, we call a divisor p = LniP i on f a prime rational divisor over kif p satisfies the following three conditions: (i) p is invariant under any automorphism (J of kjk; (ii) for any j, there exists an automorphism (]j of klk such that IJ = Pji; (iii) n l = . = n, = [k(Pd:kl. An element in the subgroup of G(f) generated by prime rational k-divisors is called a k-rationa1 divisor. Let k(f) be the subset of K(f) consisting of functions f defined over k Then kif) is a subfield of K(f), and the quotient field of k(f)@ kK = K(T). kif) is called the function field of f over k Let p be a prime rational k-divisor, and let P be a point of jJ, Then Rp n kif) is a valuation ring of kif) uniquely determined by p and independent of the choice of the point P in p. We caIl this valuation ring the valuation ring determined by!). D. Algehraic Function Fields Let k be a field, and let K be a finite separable extension of a purely transcendental extension l(x) of k such that k is maximally algebraic in K. Then K is called an a1gehraic function field over k of dimension I (or of transcendence degree 1). The eq uivalence class of texponen- tial valuations of K that are trivial over k is called a prime divisor of K/k, An element of the free Abelian group generated by prime divisors is called a divisor of Kjk. The group opera- tion in the divisor group of Kjk is usually denoted multiplicatively. Let R p be the valuation ring of the prime divisor P, and let M p be the maximal ideal of R,. The de- gree deg P of the prime divisor P is defined by [(Rp/M p ): k]. If we replace the terms: curve r by function field K/k: K(f) by K; K by k; and points on r by prime divisors of K/k, we can develop the theory of the function field K/k, which is similar to the theory of nonsingular curves f (- Sections B, C). Thus we define the genus of the function field K/k. Suppose we are given an algebraic function field Kjk of dimension 1. An algebraic curve r defined over k is called a model of K/k if kif) and K are k-isomorphic. For any function field of dimension 1, there always exist two ele- ments x and y in K such that K= k(x,y). Let fiX, Y) be an irreducible polynomial such that fix, y) = O. Then the plane curve defined by the equation fiX, Y) = 0 is a model of K/k. Among 
9E Algebraic Curves the models of K/k there exists a tnormal model fo over k that is unique up to isomorphism (and the uniqueness of the normal model of the function field within the birational equiva- lence class of varieties holds only for curves). In particular, if k is the complex number tleld, the normal model fo is the tRiemann surface of the function l1eld K/k, If fo has no singular point, the theory of the curve fo and the theory of the function tleld K /k are essentially identical. (This occurs, for example, when k is tperfect.) In that case the genus of fo is equal to that of K/k. In general, the genus of the function field is not less than the genus of the normal model fo, and it is greater than the latter if fo has a singular point. If the genus of K/k is zero, we can take a plane quadratic curve as a model of K/k. Moreover, K/k has a prime divisor of degree 1 if and only if K is a purely transcendental extension of k. A func- tion l1eld K/k of genus 1 is called an elliptic function field. If K has a prime divisor of de- gree 1, an elliptic function field K has a model of a plane cubic curve. Moreover, if the char- acteristic of the l1eld k is different from 2, we can take as the model fo the curve defined by anequationoftheform y2 =4X 3 - g2X - 93' This is called Weierstrass's canonical form. The numberj=(g23-27g/flg23 (#O)is a birational invariant of fo. E. Jacobian Varieties Let f be a nonsingular curve. A tgroup variety J is called the Jacobian variety of r if it has the following four properties (we fix an algebrai- cally closed tfield k of definition for f and J): (i) There exists an isomorphism <I> (of.abstract groups) of Go(f)/G,(f) into J. (ii) 11> is contin- uous in the following sense: Let ii, b be elements of Go(f)/G,(f) represented by a, b. Ifb is a specialization of a over a field K( => k), then <I>(b) is also a specialization of <1>(0) over K (iii) If there exists a K-rational divisor in the class a, then the point 11>(11) is also K-rational. (iv) For any EJ, there exists a k()-rational divi- sor a in Go such that l1>(a mod G,) = . A group variety J satisfying these conditions is neces- sarily a complete variety, hence an tAbelian variety, and is determined uniquely up to isomorphism. The construction of Jacobian varieties over a l1eld of arbitrary characteristic is due to A. Weil [27J (for analytic construc- tion -" 11 Algebraic Functions C). Let P be a tgeneric point of f over k, and let Po be a fixed k-rational point. Then <p(P)= <I>(P- Po) defines a rational mapping of f into J, and <p, which is an isomorphism of f and its image <p(f), is determined uniquely by 11> up to translation on J, This mapping !fI is called the 30 canonical function on f. The dimension of J is equal to the genus g off. If Ph' . . , Pg are independent generic points of f over k, then k( PI' . .. , Pg)s is the function field of J over k, where k(P I , . , Pq)s is the subfield invariant under the group of g! automorphisms (P, , . . , P g )-->(P'I' ' P, ). The Jacobian variety of f is also the +Picafd variety of f, and it is equal to the tAlbanese variety of f (- 16 Algebraic Varieties P). Hence for any function f on r with values in an Abelian variety A, there exists a unique homomorphism ..1. of J into A such that f == ..1. 0 IfJ + const. This ). is called the linear extension of f Let e be the set of points on J that can be written as <p(Pd + + <p(Pg-d. Then e is an irreducible subvariety of codimension 1. The divisor e is called the canonical divisor of J. The Jacobian variety that is polarized by the divisor e is called the canonically polarized Jacobian variety (-" 3 Abelian Varieties G). If two curves f and f' are birationally equiva- lent, the canonically polarized Jacobians of f and f' are isomorphic. Conversely, if the canonically polarized Jacobian varieties J of f and J' of f' are isomorphic, then f and f' are birationally equivalent (Torelli's tbeorem). Let r be any integer such that 1 .s; r .s; g, and let vv, be the set of points that are written in the form <p(Pd+... + <p(P,) (WI = cp(f), _I = e,  = J). Then we have 0(') = r! _, (tnumerically equivalent) and (0(g» = y!, where 0(r) is the class of intersec- tions of r copies of 0. The existence of a divi- sor 0 is characteristic for Jacobian varieties. Actually, if A is an Abelian variety of dimen- sion IJ that has an irreducible subvariety X n - 1 of codimension 1 and a positive I-cycle C such that (x(n») = n! and x(n-l)= (n 1)! C, then C is a nonsingular irreducible curve, A is the Jacobian variety of C, and X is the canonical divisor. The canonical divisor 0 is defined by a ttheta function in the classical case. For a divisor a =  niP;, we define <p(a) to be Lni<P(Pi)EJ. For I1xed numbers 1 r.s;g, O<d, welet W,d denote {<p(a)EJI/(a))od+ 1, dega= y}. Then W; C;; Iv, and is a tsubscheme of J. Wecall the number p=g-(dt I)(g-r+d) the Brill-Noether number. Then dim W;)o p. Moreover, if r is a general curve, then dim W; = p. In particular, if p < 0, then W; == 0. This result has been verified recently by S. Kleiman, D. Laksov, P. Griffiths, and J. Harris [9,14]. Let f be a nonsingular curve, and let w be a differential form on r. If the divisor (w) is >-0, the w is called a regular I-form or differential form of tbe first kind. Let Q be the tsheaf of germs of regular differential forms. A differen- tial form of the first kind is an element of 
31 HO(f, Q), and vice versa. Let f be a canonical divisor. Then we have a natural isomorphism HO(f, Q)  L(f), and the number of linearly independent differential forms of the first kind is equal to the genus g of r, The tresidue of a differential can be defined as in the classical case. A differential that has nonzero residues is called a differential of the third kind. The re- sidue theorem  Respw = 0 holds for any dif- ferential w. The form (j) is called a differential form of the second kind if for any P E f there exists a rational function fp such that W - dfp is regular at P. The set of differential forms of the second kind forms a linear space G 2 over the universal domain and contains the subspace G, consisting of the differential forms of the first kind. The quotient space G 2 /dk(f) has dimension 29 or y according as the character- istic of the universal domain is 0 or not. When the characteristic p of the universal domain is positive, we have what is called the Cartier operator. Let f be a curve defined over a perfect tleld k, let L = k(f), and let t be an element of L that is transcendental over k and such that L/k(t) is separable. Then any dif- ferential w of L/k is written uniquely as (j) =(/6' + fft +... + fJ-l t P - I )dt, where /;E L. Then the Cartier operator C given by Cw = !P_I dt is well defined and independent of the choice of t and leaves G I invariant. Hence given a basis WI' , w g of G l , we obtain a matrix (a,) with coefficient in L by Co, = L QijW j ( 1 ::; i ::; y). This g x g matrix A is called the Hasse-Witt matrix of f. The class of A modulo the transformations of the form S-P AS is a birational invariant of f and plays an important role in the theory of unramified cyclic p-extensions of the algebraic function field. F. Generalized Jacobian Varieties The notion of linear equivalence of divisors on a nonsingular curve can be extended to a more general situation. Such attempts have been made by M. Noether, F. Severi, and M. Rosen- licht, who succeeded in obtaining such a gen- eralization [22]. Let f be an algebraic curve, and let PI' , Pt be singular points of r. Let J: p . be the tlocal ring of . We set D = n:l .oi and f' = r - {PI' ' P,). An element of the free Abelian group G(f) generated by points of f' is called a f-divisor. Let a be a f-divisor and set L(a)={lE.o I (/)+a>-O} U {O}. Then L(a) is a finite-dimensional linear space (over the universal domain). The dimension of L(a) is denoted by I(a), and we setdimlal=T(a)-1. The upper bound IT of deg(a) - dim l al is a nonnegative integer and is called the .I:-genus 9F Algeb raic Curves of f. We call t(a)=n-dega+ dim lal the .0- speciality index of the divisor a. Let C be a nonsingular curve birationally equivalent to f, and letQI"'" Q, be points ofC that correspond to singular points of f. An .0- differential w is a differential form on C (of K(f) = K(C» such that Lf1 ResQJw = 0 for any fED. Then i(a) is equal to the number of linearly independent .o-differentials w such that (w)>-a in f', The equality l(a)=dega- IT + 1 + i( a) is called the generalized Riemann- Roch theorem. An 'c)-differential (j) is called an D-differen tial of the first kind if (j) is regular everywhere on f'. The number of linearly independent .o-differentials of the first kind is equal to the D-genus 11. Let 9 be the effective genus of f, i.e , the genus of C. Then we have the equality n -g = dimK(OjD) = b, where D IS the integral closure of 8 in K(T). The set of .0- differentials forms an D-module that is in general not of rank 1. Hence in this case, we do not have the "canonical divisor." Let c be the conductor of DID. Then c determines a divisor in a natural way. If we denote the de- gree of this divisor by d, we have the inequal- ity b+ l.s;d.s;2b. We have d=2b if and only if the set of D-differentials forms an D-module of rank 1. This case occurs, for example, if f is a curve on a nonsingular surface or a complete intersection. Two f-divisors a and b are said to be .a-linearly equivalent if there exists a unit .f of D such that a-b=(f). The set of f- divisors that are .o-linearly equivalent to zero forms a subgroup G/(r) of G(T). There exists a group variety J!C, unique up to isomorphism, that satisfies the four conditions required for Jacobian varieties (- Section E) with respect to the class group Go(f)jG/(f). The variety J o is called the generalized Jacobian variety. The generalized Jacobian variety is not complete, in general. If J is the Jacobian variety of C, then J D is an extension of Jby a connected tjinear algebraic group I D . Any Abelian exten- sion of the function tleld of f can be obtained by the tisogenies of the generalized Jacobian variety of f [22]. This fact plays an important role in class field theory OVer algebraic func- tion fields ( 59 Class Field Theory G). The theory for nonsingular curves is considered as the special case in which D = K(r). Suppose that r is situated in a projective space of dimension n. Let p be the prime ideal in k[X o , XI' , XnJ deilning f and X(P, m) be the number of linearly independent homoge- neous polynomials of degree m modulo p. Then X(P, m) is a polynomial in m for large m, This polynomial is called the Hilbert polynomial of p (or f). Let c be the constant term of the Hilbert polynomial. The number Pa(f) = 1 -c is called the arithmetic genus off and is equal to the D-genus of r. Let r be a nonsingular 
9G Algebraic Curves irreducible curve in p3 of degree d If r is contained in a plane, then Pa(r) = (d 1). (d-2). Otherwise, Pa(r).s;d 2 /4-d+ 1 when <lis even, and Pa(r).s;(d 2 - 1)/4-d+ 1 when d is odd [11]. G. Sheaf Theory Let r be an irreducible curve and (!)p be the local ring of p. Then (Dr = U (!)p is an talgebraic coherent sheaf, which is called the structure sheaf of r, and dim, HI(r, (Dr) is equal to the arithmetic genus n of f. Let a be a f-divisor, and let @r(o) be the tsheaf of germs of rational functions f such that (j)+ a>-O and fED Q for every singular point Q of r (- 383 Sheaves D). Then dim, Hl(f, O,(a» is equal to the speciality index 1(0), and dim, RO(r, @r(o)) is equal to I(a). When r has no singular point, the Riemann-Roch theorem is deduced naturally from tSerre's duality theorem: HI (f, iPr(a)):'+HO(f, ((Jr(f - 0)), H. Algebraic Correspondence Let r be a nonsingular curve. A divisor of the product variety r x r is called an alge- braic correspondence of f [26, 27]. Let Do be the subgroup consisting of divisors that are linearly equivalent to degenerate divisors a x r + r x b. Then the class group 'ti(r) = G(f x r)/ Do is called the group of classes of algebraic correspondences, We write X sO if X is an element of Do, Let X be an algebraic correspondence, k a field of delInition for r over which X is rational, and P a generic point of rover k Then X(P)=pr 2 [X(P x r)] is rational over l(P). The composite X 1 0 X 2 of two correspondences X, and X 2 is defined by (Xl 0 X 2 )(P) = XI (X 2 (P» whenever they have meaning. The composite X, 0 X 2 deter- mines an element of (f) that depends only on the classes of X, and X,. This multiplication supplies the group 'ti(f) with the structure of an associative ring. This ring is called the correspondeuce ring of f. The correspondence ring 'ti(r) and the ring d of endomorphisms of the Jacobian variety J are isomorphic, and the isomorphism is given by the following rule: Let  be an element of 'ti(f), and let X be a divisor in . Let P be a generic point of f with re- ference to k over which X is rational. Let Po be a k-rational point of r, Then the class of X(P) - X(Po) modulo G/(r) is independent of the choice of a divisor X in the given class. We set IJI(P)=<D(X(P)-X(P o )) andlet )" be the linear extension of Y. The correspondence .;:-->}, is an anti-isomorphism of (r) and d, Now we set ,r;l/o = ,w @ Q. Then .91 0 contains an automorphism I of order 2 called an in- 32 voIution. Let 1 be a rational prime different from the characteristic p, Then A has a faithful representation by 2g x 2g matrices with co- efficients in l-adic integers. The ttrace (J of this representation has the property that 0"( # 0 #') > 0 if Ii f. 0 (Castelnuovo's lemma). do is an algebra of finite tank over Q, and d is a finitely generated Abelian group. Based on these results A. Weil proved the Riemann hypothesis for congruent (-functions on a non- singular curve (- 450 Zeta Functions P). I. Coverings Let rand C be nonsingular curves such that there exists a regular mapping n: r --> C. Then there is an injection of the function field K(C) into K(T). If K(f) is separably algebraic over K(C), then r is called a covering (curve) of C. The integer [K(I): K(C)] = n is called the de- gree of covering. Let P be a point of r and let Q=n(P). Let t, s be local parameters at P on r and at Q on C, respectively. The nonnegative integer vp(ds/dt) is called the differential index at P and is denoted by mp. The index mp is zero except for a finite number of points. The divisor L mpP is called the branch divisor. The covering f is called an unramified covering if the branch divisor is zero. If we denote the branch divisor by a, we have the formula 2g(r)-2=n(2g(C)-2)+dego, where g(f) and g(C) are genera of f and C, respectively. This is called the Riemann-Hurwitz formula. This formula yields at once that a rational curve has no nontrivial unramified covering and that r can be an unramified covering of itself if and only if r is an elliptic curve. J. Theory of Moduli Let ./ltg(k) be the set of isomorphy classes of complete nonsingular irreducible curves (here simply called "curves") of genus 9 defined over a field k We can endow Atg(k) with a structure of an algebraic variety over k with the prop- erty that for any smooth family (over R) n:--> S of curves of genus y the map T: S-->Atq(k) sending s to the isomorphy class of the fiber 1[-I(S) is a tmorphism. This algebraic variety is called the (coarse) moduli space of curves of genus g. Furthermore, the moduli space over Spec(Z) exists (D. Mumford [16J). Atg(k) is tnormal, irreducible and tquasiprojective (but not complete for 9 > 0) of dimension 0 (g = 0), 1 (g == 1),39 -3 (g 2) (== 3g - 3 + dim Aut(C)) with only quotient singularity [5]. Since when g = 1 any elliptic curve Cover an algebraically closed lIeld k is isomorphic to a plane curve with the Weierstrass canonical 
33 form y2 =4x 3 - g2x - g3 (if char(k) # 2; the case of char(k) = 2 needs a slight modification), the correspondence C-->j(g2, g3) ( Section D) defines an isomorphism .111  A (affine line). In the case of g ? 2 and k = C we have another function-theoretic construction of jig due to Teichmiiller (- 11 Algebraic Functions F). Let .sl g be the coarse moduli space (over Z) of principally polarized Abelian varieties of dimension g ( 3 Abelian Varieties 1). For a curve C we denote the Jacobian variety of C by J(C) (- Section E). The correspondence C-->J(C) defines a morphism i:,/!'tg-->,w g which is injective (Theorem of Torelli). It is even an timmersion (F. Oort, J. Steenbrinck). If g= 1, 2, 3, then i is an open immersion whose image we can describe. For k =C we can express the above map i by using periods of curves. Namely, let IX", IXg, PI" fig be a (canonical) basis of HI (C, Z) detlned by tnormal sections of C (considered as a real surface). Let WI' . . , w q be a basis of tdifferential forms of the first kind on C with Jpi w j = (jij' Then the matrix Q = (Sa i w) is symmetric and has a positive definite imaginary part, i.e., is an element of the tSie- gel upper half-space 6q of degree y. With the identification $g  6 g /Sp(g, Z) (- 3 Abelian Varieties 1) the map i:.$Ig-->M g is nothing but the one sending the isomorphy class of C to Rmod Sp(g, Z). For g ==4 the closure of i(.i1 4 ) is a principal divisor detlned explicitly with theta constants (Schottky, J.-1. Igusa). K. Stable Curves A reduced connected complete curve Cover k is called a stable curve of genus g ( > (0) if(i) C has only ordinary double points as possible singularities; (ii) when r is a smooth rational irreducible component of C, then r intersects the other components in more than 2 points; (iii) dim, III (C, (]:cI = g. There exists a tcoarse moduli space Yg of stable curves of genus g which contains -.#t q as a Zariski open subset. The space . detlned over Spec(Z) is tcomplete and even tpro- jective; hence 2'q gives a compactification of .$Ig. The completeness of Yg follows from the stable reduction theorem as follows: let R be a discrete valuation ring with quotient field K and C be a smooth connected curve over K. Then there exists a finite separable algebraic extension L of K such that the curve C x K L extends to a flat family of stable curves OVer Spec(R L ), where R L denotes the integral clo- sure of R in L. In this case we say that the curve C x K L has a stable reduction in R,. With the above notation a curve Cover K has a stable reduction in R if and only if its 9 Ref. Algebraic Curves Jacobian variety J(C) has a tstable reduction in R (- 3 Abelian Varieties N). Over C, $q has the tSatake compactification <w g , which is a disjoint union of $g" 0 g' g, as a set. The injection i:.$Ig-->M g (- Section J) extends to a morphism j:. -+ $q that sends the isomorphy class of a stable curve C = C l U U C k (C i irreducible) to that of J(C l ) x x J(Ck)U#g" g'=Ligenus(C;), where C j is the tnormalization of C i . In particular the closure of the image of i in $; is the set of products of Jacobian varieties. References [IJ S. Arakelov, Families of algebraic curves with fixed degeneracies, Math. USSR-I-V., 5 (1971),127771302. [2] H. F. Baker, Analytical principles of the theory of curves, Principles of Geometry 5, Cambridge Univ. Press, 1933. [3J A. Brill and M. Noether, Uber die alge- braischen Functionen und ihre Anwendungen in der Geometrie, Math. Ann., 7 (1874),269- 310. [4] G. Castelnuovo, Numero delle involutione rationali giancenti sopra uno cur va di dato genere, Rend. Acad. Lincei, 4, 5 (1889). [5] P. Deligne and D. Mumford, The irreduci- bility of the space of curves of given genus, Pub\. Math. Inst. RES, 36 (1969),75- 109. [6J F. Enriques and O. Chisini, Teoria geo- metrica delle equazioni dele funzioni alge- briche, Zanichelli, Bolgna, 1934. [7J T. Fujita, Detlning equations for certain types of polarized varieties, Complex Analysis and Algebraic Geometry, Iwanami, Tokyo, 1977,165-173. [8] W. Fulton, Algebraic curves, Benjamin, 1969. [9] P. Griffiths.and J. Harris, Principles of algebraic geometry, Wiley, 1978. [ IOJ P. Griffiths and J. Harris, On the variety of special linear systems on a general algebraic curve, Duke Math. J., 47 (1980), 233-272. [1IJ R. Hartshorne, Algebraic geometry, Graduate text 52, Springer, 1977. [ 12J R. Hartshorne, On the classification of algebraic space curves, Vector Bundles and Differential Equations, Birkhauser, 1980. [13] G. Kempf, Schubert methods with an application to algebraic curves, Publications of Mathematisch Centrum, Amsterdam, 1971. [ 14J S. Kleiman and D. Laksov, On the exis- tence of special divisors, Amer. J. Math., 94 (1972),43 1-436. [ 15J S. Kleiman, r-Special subschemes and an argument of Severi's, Advances in Math., 22 (1976), 1-31. 
10 A Algebraic Equations [ 16J D. Mumford, Geometric invariant theory, Erg. Math., Springer, 1965. [17] D. Mumford, Curves and their Jacobians, Univ. of Michigan Press, 1975. [18J M. Namba, Families of meromorphic functions on compact Riemann surfaces, Lec- ture notes in math. 767, Springer, 1979. [ 19J M. Noether, Zur Grundlegung der Theorie der algebraischen Raumcurven, J. Reine Angew. Math., 93 (1882),271-318 (Berlin, 1883), [20J B. Saint-Donat, On Petri's analysis of the linear system of quadrics through a canonical curve, Math. Ann., 206 (1973),157-175. [21J A. Seidenberg, Elements of the theory of algebraic curves, Addison-Wesley, 1968. [22J J.-P. Serre, Groupes algebriques et corps de classes, Actualites Sci. Ind., Hermann, 1959. [23] F. Severi, VorIesungen iiber algebraische Geometrie, Teubner, 1921. [24J J. M. Wahl, Deformations of plane curves with nodes and cusps, Am. J. Math., 96 (4) (1974), 529-577. [25J R. J. Walker, Algebraic curves, Princeton Univ, Press, 19.50, [26J A. Weil, Sur les courbes algebriques et les varietes qui s'en deduisent, Actualites Sci. Ind., Hermann, 1948. [27J A. Weil, Varietes abeliennes et courbes algebriques, Actualites Sci. Ind., Hermann, 1948, [28J A. Weil, Zum Beweis des Torellischen Satzes, Nachr. Akad. Wiss. Gottingen (1957), 33-53. [29J H. Weyl, Die Idee der Riemannschen Flache, Teubner, third edition, 1955. [30J G. B. Winters, On the existence of certain families of curves, Amer. J. Math., 96 (1974), 215-228, 10 (111.5) Algebraic Equations A. General Remarks Let FI(XI,,,,,X,),, F,(X l ,. ,X m ) be r tpolynomials in m variables Xl' . , X m over a tfield k. Then the equations Fl = 0, . ..) F, = ° are called algebraic equations in m unknowns. When we consider these equations simulta- neously, where r? 2, we call them a system of r equations or simultaneous equations. (For r = 1, a system of one equation means the single equation F 1 = 0.) Coefficients of FI , , F, are called coefficients of the system, and the great- 34 est of the degrees of Fl , . . , F, is called the degree of the system, To solve a system of equations (henceforth in this article we shall omit the word "alge- braic") means to find the common tzero points (in an talgebraicalIy closed field containing k) of elements FI" . , F, of the tpolynomial ring k[X I , , X,]. If there exist no common zero points, the system is said to be inconsistent; if there exists a finite number of such points, it is said to be regular; and if there are an infinite number of such points, it is called indetermin- ate. The telimination method a\1ows us to reduce the problem of solving a system of r equations to the case r = 1. In particular, any regular system of equations can be reduced to the case m=r= 1. B. Equations in One Unknown For the above reason, it is important to con- sider an equation of the fOfmf(X) = 0, where f(X) = aoX n + a 1 xn-I +... + an' a o #0, (1) This gives the general form of an algebraic equation in one unknown. According asf(X) is reducible or not in the tpolynomial ring k[XJ, the equationf(X)=O is called reducible or irreducible (- 337 Poly- nomials). In some talgebraic extension field K of k, f(X) can be factored: f(X)=ao(X -ctd(X -ct 2 )... (X -ctn)' (2) ex l ,. , an are called the roots of the equation f (X) =0. Hence, any algebraic equation of degree n has exactly n roots (Kronecker's theorem). Now, ( -1 )ia)ao is equal to the tele- mentary symmetric function of degree i of  l' ..., ct.. Some of the roots ct l \"', exn may be identical. If ct appears p times in ct l ' . , exn, we say that r/, is a p-tuple root, and p is called the multiplicity of the root r/" When p = 1, ct is called a simple root, and when p ? 2, ex is called a multiple root. Let p" ' , , , Pv be all the distinct roots among ex l , . , an, and let Pi be the multi- plicityof P,(i= 1, , v). Then [(X)=ao(X - Pd 1 " ,(X - Pv)Pv, (2') PI + . + Pv = n. If PI' . , Pv are not divisible by the tcharacter- istic of k, the greatest common divisor g off and f' =naoX n - 1 +(n -l)a l X n - 2 + ," +a n - I is (X - fJd 1 -I", (X - fJ,)Pv-l, Thus we can reduce the multiplicity of every root to 1 by dividing f by g. Any irreducible equation over a field of characteristic 0 has no multiple roots. Equation (1) has multiple roots if and only if 
35 its tdiscriminant D is equal to 0 (- 149 Fields; 172 Galois Theory), C. Equations of Special Type In Sections C and D we assume that the char- acteristic of k is zero. Binomial Equations. An equation of the type X m _ a = 0 is called a binomial equation. It is solved by root extraction. Let  (mtb root of u) be one of the roots (if a is a positive real number,  usually denotes a positive real root). Then  multiplied by 1, (, (2" (m-I are the roots of X m - a == 0, where ( is a tprim- itive mth root of unity. Reciprocal Equations. An equation a, X n + a, X n - 1 +... + an =0 is called a reciprocal equation if ao == an' a, = a n - j , a 2 = a n - 2 , '.'. A reciprocal equation of an odd degree n = 2 m + 1 has a root X = - I, and dividing the left side by X + 1 we get a reciprocal equation of degree 2m. A reciprocal equation of degree n = 2m is reduced to an equation of degree m in Y = X + X -I and the quadratic equation X 2 -XY+ 1 =0. D. Equations of Lower Degree (- Appendix A, Table 1) (1) A linear equation ao X + a, = 0 has a single root -a I! a o ' (2) the roots of a quadratic equa- tion a o X 2 +ajX +a 2 =0 arc givcn by (-a, ::!:: J a-4aOa2)/2ao' (3) To solve a cubic equation a o X 3 + II, X 2 + 11 2 X + a 3 =0, we set A, =9aOa,a2-2ai-27a6a3' A2=a-3aoa2' and solve the quadratic equation T 2 A, T + A  = O. Let t 1 and t 2 be the roots of this quadratic equation, and let w be any cube root of I. Then (-a, +(J) +(J)2)/3ao is a root of the original cubic equation. (Cardano's formula). If we apply this method to a cubic equation aX 3 + hX 2 + eX + d = 0 with real coefficients, we need to use complex cube roots even if the roots of the equation are real. In fact, it has been proved that it is not pos- sible to solve this equation within the real numbers in this case; i.e" if the cubic equation is irreducible over the extension Q(a, b, c, d) of the rational number field Q, and if all of its roots are real, it is impossible to find the roots only by rational operations and with real radicals, This is called the casus irreducibilis. (4) A quartic equation a o X 4 + a, X 3 + a 2 X 2 + a 3 X + a 4 = 0 can be solved by means of reduction to a cubic equation (L. Ferrari) ( Appendix A, Table 1). Generally, the proce- dure of solving an algebraic equation, i,e., 10 E Algebraic Equations finding the roots of a given equation from its coefficients by means of a finite number of rational operations and extractions of radicals, is called a solution by radicals (or algebraic solution). The tgeneral algebraic equation whose degree is  5 cannot be solved by radicals (N. H. Abel) ( 172 Galois Theory). E. Analytic Tbeory In this section, k denotes the field R of real numbers or the field C of complex numbers. These cases have been studied for a long time, for practical reasons. Concerning the case k = C, the field C is talgebraically closed; i,e" every equation with coefficients from C has a root in C (Gauss's tbeorem, called the fundamental tbeorem of algebra). Accordingly, in the field C, we always have equations (2) and (2'). Let ()( l' , :1" be the roots of equation (1). Then each ex i is a continuous function of coeffi- cients a" a" , a,. Concerning the location of roots ofj(X)=O andf'(X)==O on the complex plane, we have the following theorems: (1) Any convex polygon on the complex plane containing the roots of f(X) = 0 also contains the roots off (X) = 0 (Gauss). (2) Let C be a rectifiable tJordan curve not passing through a root off(X) = O. Then the number (CJ) of the roots off(X)==O lying in the region enclosed by C is equal to ( 1/21fi) SdJ'(z)l.f(x))dz, where the multiplicity of the roots is taken into account. (3) Let C be a Jordan curve on the complex plane. If I f(z)l > Ig(z)1 at every point z on C, then the equations j == 0 and! + q = 0 have the same number of roots (counting multiplicity) within the region enclosed by C (Roucbe's theorem). (4) The absolute value of a root of equation (1) is less than M =max(laj/aol,..., lan/aol) + 1. (5) Let D be the tdiscriminant off, and as- sume that lex;!  M (i = 1,... ,n). Then IX i - exp  D/(2M)n(n-' )-2 = E. Since the value of IDI is known fromf'and one value of M is given by theorem (4), we have one value of E. If we draw a circle on the complex plane with center at the origin and with radius M, and if we cover it with a net whose meshes have diameters less than jE/2, then the interior of each mesh contains at most one root of r = O. When kR,i.e.JER[XJ, letfJ"", ./iv denote the distinct roots off= 0, and recall equation (2'). Suppose that fJl' , f3A E Rand the others R, Then V-A is an even integer 2K, and we can renumber /i;.+1, ,Pv so that f3;.+1 ==fl;,+K+l' ...,lJA+K = fJv (/i denotes the con- 
10 Ref. Algebraic Equations jugate of P) and P!.+l = P!'+K+l' ..., P!.+K = Pv' In this case, PI> ' , P 1 are the real roots of equation (2), and the other p's are called the imaginary roots. (6) UfER[XJ and ao>a l > , , >an>O, then the absolute value of any root of equation (1) is less than 1 (Kakeya-Enestrom theorem). Concerning the real roots of an equationf = 0, wherefER[X], we have the following theorems: Let N(a, b)(a, bE R) denote the number of real roots in the interval (a, b). Furthermore, let V(c 1 , C 2 " , c p ) denote the number of changes of sign in the sequence C I' C 2, , c p of real numbers, which is defined as follows: Suppose that we have the sequence C v",.., C Vq after deleting the terms c i = 0 from the sequence c l , C2" cpo Then 1 q-I V(cl,c2,,,,,cp)=- 2 L(I-sgncvcv ). j=l J )+1 (7) N(O, 00) = V(a o , a" , an) (mod 2) and N (; V(Descartes's theorem). (8) Let V(c}= V(J(c),f'(c), , , , , f"'(c», Then N(a, b) == Via) V(b) (mod 2) and N <s: V(a)- V(b) (Fourier's theorem). (9) We may assume thatf =0 has no multi- ple roots, Construct a finite seriesfo == f,fl = {', ,(, of polynomials over R such that 1;-1 =};qi- LI fori== 1, 2" , , ,1- 1 and;;E R, by successive application of the tdivision al- gorithm, Let V(c)= V(JO(C),fI(C), ,;;(c»). Then N(a, b) = V(a) - V(b) (Sturm's theorem). By means of this theorem we can determine the location of real roots as precisely as we wish, (10) In order that every root ex i of an equa- tionf= 0 with a, > 0 lies on the left side of the imaginary axis, i,e., Re ex, < 0, it is necessary and sufficient that in the following matrix the tprincipal minors composed of the first r rows and first r columns be positive for all r = 1,2, , n (Hurwitz's theorem): a, a 3 as a, a, a2 a 4 a 6 0 a l a 3 as 0 a, a 2 a 4 o o a n - 2 a, Also, for f E C [X J, various results have been obtained about under what conditions all the roots off= 0 lie on one side of a given straight line or inside a given circle (e,g" the unit circle) (- 301 Numerical Solution of Algebraic Equations), References [1] L. E, Dickson, Elementary theory of equ- ations, Wiley, 1914, [2J ], Dieudonne, La theorie analytique des 36 polyn6mes, Memor. Sci. Math" Gauthier- Villars, 1938, [3J M, Marden, Geometry of polynomials, Amer. Math, Soc, Math, Surveys, second edition, 1966, [4J W, Specht, Algebraische Gleichungen mit reelen oder komplexen Koeffizienten, Teubner, 1958, 11 (X1.10) Algebraic Functions A. Definition An algebraic function is a multiple-valued tanalytic function w = w(z) defined by an tirre- ducible algebraic equation P(z, w) =0 with complex coefficients, B. History and Methods The theory of algebraic functions evolved from the works of C, F. Gauss, N, H. Abel, and C, G, ], Jacobi on telliptic functions in the early 19th Century, Stimulated by their works, B, Riemann and K. Weierstrass established the foundations of the theory of complex functions and developed the important theory of alge- braic functions, The equation P(z, w) = 0 defines a curve in the 2-dimensional complex tprojective space with inhomogeneous coordinates z, w, Inves- tigations from this point of view were initiated by Riemann, A, Clebsch, and p, Gordan, This approach was followed by A, Brill, M, Noether, and the Italian school (F. Severi, C, Segre, etc,) and has developed into contempo- rary algebraic geometry ( 9 Algebraic Curves, 12 Algebraic Geometry), The set of tfunction elements w(z) satisfying P(z, w) = 0 is a tcomplex manifold iR, a closed (= compact) tRiemann surface, on which z and w are tmeromorphic functions. The field K'R consisting of the meromorphic functions on m is an talgebraic function field C(z, w), Con- versely, for any closed Riemann surface !It, the field K'R is an talgebraic function field in one variable over C, and any pair of functions z and w with K:Ji = C(z, w) has the property that iR is tconformally equivalent to the Riemann surface determined in the above fashion by the irreducible algebraic equation P(z, w) = 0 satis- fied by z and w, Two Riemann surfaces iR I, iR 2 determined by the equations PI = 0, P 2 = 0 are conformally equivalent if and only if the fields K'R 1 and K 912 are C-isomorphic, This condition is equivalent to the existence of a tbirational 
37 transformation between the algebraic curves PI = 0, P2 = O. The "analytic method" (the method of studying algebraic functions as functions on Riemann surfaces) is the creation of Riemann. It was extended by F. Klein and D. Hilbert, and later by H. Weyl, who es- tablished in his monograph [6J a rigorous foundation of the analytic method for the theory of algebraic functions. Given an arbitrary algebraic function field K in One variable over C, the set 91 of its tprime divisors with a suitable topology and analytic structure is a closed Riemann surface whose function field K91 coincides with K The "algebraic method" (the method of studying algebraic functions as elements of an algebraic function ti'eld) was founded by J. W. Dedekind and H. Weber at the end of the 19th Century. In the 20th Century, the algebraic method has made remarkable progress, owing to the devel- opment of abstract algebra. It covers the case of an arbitrary ground li'eld as well as that of more than one variable. The theory of algebraic functions has had considerable in- fluence on the development of number theory because of a basic analogy between the two subjects, The tuniversal covering spaces (surface) 9{ of a closed Riemann surface !It can be regarded, by conformal mapping, as the Riemann sphere, the plane, or the unit disk (or, equiva- lently, to the upper half-plane) if the tgenus 9 of !It is 0, 1, or  2, respectively. Then the tcovering transformation group G, consisting of tlinear fractional transformations without fixed points in 9"\, is tproperly discontinuous and has a compact tfundamental domain. Conversely, if D is one of the three domains just mentioned and if G is the group just de- scribed, then 91 = D/G is a closed Riemann surface such that D and G are its universal covering space and covering transformation group. A meromorphic function on !It is repre- sented as an tautomorphic function on 9'i with respect to G, If Y = 0, then G = { I }, 9'i = 91, and K91 is the field of rational functions. If 9 = 1, then K'.J1 is the field of telliptic functions. The study of algebraic functions as automorphic functions was initiated by H. Poincare and Klein. Recently, C. L. Siegel made a remark- able contribution to the investigation of the case of several variables. The theory of auto- morphic functions is also related to number theory. Works of E. Hecke, M. Eichler, and G. Shimura on this domain are noteworthy (- 32 Automorphic Functions, 73 Complex Multiplication). Another important topic concerning alge- braic functions (closed Riemann surfaces) is the problem of moduli. Riemann stated, with- out rigorous proof, that the set of conformal lIc Algebraic Functions equivalence classes of closed Riemann surfaces of genus 9 (? 2) depends on 3g - 3 complex parameters, called moduli. This has led to the modem theory of Teichmiiller spaces, which is developing into an extensive new field (- 234 Kleinian Groups, 416 Teichmiiller Spaces). In the rest of this article, we deal mainly with the analytic method (for the case of two variables --' [IJ). C. Abelian Differentials An Abelian differential on a closed Riemann surface 91 is, by definition, a complex tdif- ferential form W = a(z)dz, where a(z) is a mero- morphic function of a local parameter z. Such a differential is said to be of the first kind if a(z) is holomorphic, of the second kind if the residue vanishes everywhere, and of the tbird kind otherwise, The indeli'nite integral W(p) = S:o w of an Abelian differential m, where Po is assumed not to be a pole of w, is called an Abelian integral. It is said to be of the first, second, or tbird kind if the same holds for w, If y is a 1- tcycle on 9l, the quantity Lw is referred to as the period of w along y. An elliptic integral is defined to be an Abelian integral on a closed Riemann sur- face of genus 1. For example, this is the case if the equation P(z, w) = 0 defining the surface is of degree 2 with respect to wand of degree 3 or 4 with respect to z. More generally, a closed Riemann surface is called byperelliptic if P(z, w) is of degree 2 with respect to w, or, equiva- lently, if !It carries a meromorphic function with exactly two poles, An Abelian integral on such a surface is called a hyperelliptic integral. On a closed surface 9\, let I-;, be the linear space Over C of the Abelian differentials of the first kind. Given a I-cycle (1 of 9l, there exists a unique W,E v., such that Re Lm, is equal to the tintersection number ((1, y) for every I -cycle y, This differential is also characterized by the property ((w. w,h ==) S'.J1W /\ *w,== - 2J=l J,w for every WEv;,. If {ex l ,. . ..a.,} form a basis of the I-dimensional thomology group with integral coefficients, then Re w. (i = 1, . , 2g) form a basis of the linear space  over R of the tharmonic differentials on !It as well as that of the space { Re W WE v;,}. Accordingly, dime I-;, = g, dim, v" = 2g. These identities show a close relationship between the topological structure of 9l and the space of the Abelian differentials on 91 (--, 194 Harmonic Integrals). One can choose a I-dimensional homology basis {exi,iXg+i}Yl so that (exi,exj)=(exg+;,exg+)= 0, (exi,exg+i)= 1, and (ex"(1g+)==O U#j),(i,j== 1,2" g), Such a basis is called a canonical 
llD Algebraic Functions homology basis. If wand (J are Abelian dif- ferentials of the first kind, then we have (w,a)m= -yC1 t (i W i (J }-1 a.) (1.g+j - i W i (J ) =O, a.g+j a.j (W,W)91=yC1 t (i W i w }-1 a) ag+j - i W i W ) O. (lg+j aj Together, these are caIled the (Riemann) period relation. The second formula implies that w vanishes identicaIly ifthe periods of w along alllX j (j = 1, ..., g) are zero. Let lXi' .., ,1X 2 be a I-dimensional homology basis, and let w:, ... , w g form a basis of V. over C. The g x 2g matrix 0 with J'JWi as its (i,j)- component is caIled a period matrix. Corre- sponding to the change of bases (IX) and (w), it is subject to transformation into the form AOM, where A is a g x g invertible complex matrix and M is a 2g)( 2g integral square matrix with determinant :!:: 1. Conversely, two Riemann surfaces are conformaIly equivalent if they possess period matrices transformable to each other in this manner (Torelli's theorem). We can choose Wi' ..., W g so that the cor- responding period matrix with respect to a canonical homology basis becomes (I g , T) with the g x g unit matrix Ig. Then from the period relation, T is symmetric and 1m T is positive definite. On the complex linear space C g , consider the subgroup generated by the 2g column vectors of a period matrix 0 (the subgroup is also denoted by 0). Since it is of rank 2g and properly discontinuous, a group mani- . fold Cg/O is obtained. It is determined by 9l uniquely up to analytic isomorphism and is caIled the Jacobian variety of 9l. The general- ized Jacobian variety is introduced in a simi- lar fashion by means of Abelian integrals of the second and third kinds ( 9 Algebraic Curves). D. The Riemann-Roch Theorem In the present context, a 0- tchain with inte- gral coefficients on a Riemann surface 9l is referred to as a divisor. A divisor d == L niPi (niEZ,piE9l) is an integral divisor (or positive divisor) if n j  0 in the reduced expression; d is a prime divisor if it consists of a single point Pi and nl == 1. A divisor of a meromorphic func- tion f or an Abelian differential W is defined by taking the Pi as the zeros (poles) of for w and n i ( - n i ) as the multiplicity of the zero 38 (pole) at Pi' The divisors on 9l constitute an Abelian group :D in which principal divisors, i.e., divisors of meromorphic functions, consti- tute a subgroup . The factor group :D/ is caIled the divisor class group; an element of it is caIled a divisor class. The divisors of Abelian differential constitute a single divisor class, which is referred to as the canonical divisor class (or differential divisor class). The degree and the dimension of a divisor class Dare defined as foIlows, independent of the choice of the representative d=LniPiED: degD=Ln i , dim D == dimdflf is meromorphic, (divisor of f)+d is a positive divisor}. For example, the degree of the principal divisor class is zero, In terms of these concepts, the Riemann- Roch theorem is stated as foIlows: For a divi- sor class D on a closed Riemann surface 9l of genus g and for an integer n, we have dim(D+nW)-dim(-D-(I- n) W) =degD+(2n-l)(g-I), where W is the canonical divisor class ( 9 Algebraic Curves). This theorem implies the foIlowing prop- erties of 9l: (i) deg W == 2g - 2. (ii) The holo- morphic invariant forms <pdz 2 (i.e., analytic tensors of order 2), referred to as quadratic differentials, constitute a linear space over C of dimension 0 (if g=O), 1 (if g= 1), or of dimen- sion 3g - 3 (if g  2). The quadratic differentials have close connection with textremal quasi- conformal mappings and play an important role in the theory of TeichmiiIler spaces ( 352 Quasiconformal Mappings, 416 Teich- muller Spaces). (iii) For a point PE9l, a posi- tive integer m is caIled a gap value if 9l carries no meromorphic function having a pole only at P with multiplicity m. Then if g=O, no point has gap values; and if g  1, every point P has exactly g gap values; in this case, P also has a nongap value m.s;g+ 1. A point pis caIled an ordinary point if the gap values at pare 1,2, ..., g; otherwise P is caIled a Weierstrass point. If g  2, then the total number N of Weierstrass points is not less than 2g + 2 and not greater than (g-l)g(g+ 1) (A. Hurwitz). Moreover, the case N = 2g + 2 occurs if and only if 9l is hyperelliptic, and then the gap  lues at Weierstrass points are 1, 3, ..., 2g-1. his implies that every closed Riemann surface f genus 2 is hyperelliptic. (iv) Suppose the enus g of 9l is  2. A conformal mapping f of onto itself with the property that every 1- cycle y is always homologous to f(y) is neces- sarily the identity transformation. Also, 9l is known to admit only a finite number of con- formal mappings onto itself (H. Schwarz); the total number does not exceed 84(g-l) (Hurwitz). 
39 E. Abel's Tbeorem Abel's tbeorem is stated as follows: A divisor d of degree zero is a principal divisor if and only if it is expressed as d = 0'1 by means of a 1- chain y that has the property that S y ill = 0 for every ill EVa. Given a divisor class D of degree zero, con- sider a I-chain y with DYED. For every I-cycle I'J. there corresponds the quantity x,(D)=ex P (2nj=1Re 1 ill} independent of the choice of y. Thus D deter- mines a tcharacter on the I-dimensional ho- mology group, called the integral character. Conversely, every character on the homology group is shown to be the integral character of some D. In terms of this notion, Abel's theorem can be stated as follows: D is the principal divisor class if and only if X,(D) = 1 for every 11. This result shows that the 1- dimensional homology group with integral coefficients and the group of the divisor classes of degree zero (with compact topology) are, with respect to integral characters, mutually dual (in the sense of Pontryagin) topological Abelian groups ( 422 Topological Abelian Groups). For the relationship between Abelian integrals and Jacobian varieties, in particular the tJacobi inverse problem, tAbelian func- tions, and tRiemann theta functions,  3 Abelian Varieties L. Also  references to 234 Kleinian Groups, 352 Quasiconformal Map- pings, 367 Riemann Surfaces, 416 Teichmiiller Spaces, References [IJ B. Riemann, Gesammelte mathematische Werke und wissenschaftlicher Nachlass, Teub- ner, 1876 (Collected works of B. Riemann, Dover, 1953), [2J P. E. Appel1 and E. Goursat, Theorie des fonctions algebriques et de leurs integrales, Gauthier-Villars, second edition, 1929. [3J R. Dedekind and H. Weber, Theorie der algebraischen Funktionen einer Veranderli- chen, J. Reine Angew. Math., 92 (1882), 181- 290. (Gesammelte math. Werke, F. Vieweg, 1930, voL 1, p, 2388350,) [4J K. Hensel and G. Landsberg, Theorie der algebraischen Funktionen einer Variablen und ihre Anwendung auf algebraische Kurven und Abelsche Integrale, Teubner, 1902 (Chelsea, 1965), [5J W. F. Osgood, Lehrbuch der Funktionen- theorie II, Teubner, 1932. [6J H. Weyl, Die Idee der Riemannschen Flache, Teubner, 1913, third edition, 1955; 12 A Algebraic Geometry English translation, The concept of a Riemann surface, Addison-Wesley, 1964. [7J M. Eichler, Einfiihrung in die Theorie der algebraischen Zahlen und Funktionen, Birk- hiiuser, 1963, [8J C. Chevalley, Introduction to the theory of algebraic functions of one variable, Amer. Math. Soc. Math. Surveys, 1951. [9J C. Siegel, Topics in complex function theory II, Wiley-Interscience, 1971. For history and further references, [ 10J A. Brill and M. Noether, Die Entwick- lung der Theorie der algebraischen Funk- tionen, Jber. Deutsch. Math. Verein., 3 (1894), 107-566. For the classical theory of algebraic functions of two variables, [IIJ E. Picard and G. Simart, Theorie des fonctions algebriques de deux variables inde- pendentes, Gauthier-Villars, I, 1897; II, 1906. 12 (VII1.1) Algebraic Geometry A. Introduction Algebraic geometry is the branch of mathe- matics that deals with talgebraic varieties, that is, point sets detlned by several algebraic equa- tions in a space of any dimension or those derived from these sets by means of certain constructions (- 16 Algebraic Varieties). It may also be considered to be a theory of the tfield of algebraic functions in several variables in geometric language, and it is closely related to the theories of complex analytic manifolds, commutative algebra, and homological alge- bra. It also has an important connection with number theory through the theories of automorphic functions, Diophantine equa- tions, and zeta functions. To investigate local properties of algebraic varieties we consider varieties embedded in an taffine space; to study global properties we usually consider varieties contained in tprojec- tive spaces. A quantity (or property) that is invariant under tprojective transformations, isomorphisms, i,e., tbiregular and tbirational transformations, or birational transformations is called a tprojective invariant, a relative invariant, or an absolute invariant (birational invariant), respectively, The study of projective invariants is a part of projective geometry, whose methods are important in algebraic geometry. The notions of relative invariant and absolute invariant are used, for example, in the classification of algebraic varieties. 
12B Algebraic Geometry We usually assume that the coordinates of each point of the variety belong to a certain fixed tfield K. In the classical case, namely, when the field K is the field C of complex numbers, the algebraic varieties are considered as complex spaces and are studied by apply- ing the theories of partial differential equa- tions, tdifferential geometry, etc. Topological methods may also be applied. Algebraic geom- etry originated from such studies, but, for the study of properties such as trational mappings or talgebraic systems, it became necessary to consider as well the case where the ground field K is not talgebraically closed. Further- more, to apply this to number theory, it is necessary to establish the theory over the field of any tcharacteristic p. For this purpose it is necessary to establish a theory for varieties having ground domains as general as possible. B, History Analytic geometry began with the study of lines and quadratic curves (surfaces) and later came to include the study of cubic and quartic curves (surfaces), and so on. These subjects originally belonged exclusively to analytic (or projective) geometry. At that time, the study could not have been described by so specific a title as algebraic geometry. The study of such theories as the construc- tion of an algebraic plane curve by families of curves of lower degree or the talgebraic m-n correspondence on a straight line prob- ably began with research such as that by M. Chasles, The most outstanding event in the history of algebraic geometry was the intro- duction and development of the theory of algebraic functions (- 11 Algebraic F unc- tions) by B. Riemann (1857). Before that time the degree of an algebraic curve (surface) was the only quantity known to be a projective invariant of the curve (surface). With the theory of algebraic functions, Riemann gathered into one family all the curves that can be transformed onto each other by birational transformations. As the basis for his study, Riemann examined bira- tional transformations in place of projective transformations. This idea led to the notion of the so-called tRiemann surface. The tgenus of the surface was obtained as the characteristic number of the family of curVes, The concept of genus was the first absolute invariant to appear in the history of algebraic geometry. Riemann based his theory on tAbelian inte- grals using tDirichlet's principle, under the assumption that any algebraic curve reduces to one without tsingularities. After Riemann many mathematicians tried 40 to reconstruct the theory more precisely with- out using transcendental methods. M. Noether attempted this reconstruction by using geo- metric methods. Using the tCremona trans- formation, he confirmed Riemann's assump- tion for curves: that any algebraic curve on a plane can be transformed by a birational transformation to a plane curve without sin- gularities except for simple tnodes. He also contributed in making more precise the basis conditions for the tRiemann-Roch theorem, which is considered to be one of the most important theorems in the field. His results on space curves and surfaces are also note- worthy. J. Plucker detlned the concept of genus in geometric terms and introduced the tPlucker coordinates. A. Cayley and A. Brill worked along similar lines, Cayley's idea was developed later by B. L. van der Waerden and W. L. Chow, who introduced the tassociated form of an algebraic variety and its tChow coordinates. Around 1890 the Italian school of algebraic geometry appeared. Following the tradition established by Noether, they employed alge- brogeometric methods and uncovered many new facts conc=ing algebraic surfaces. Among those who belonged to this school were G. Castelnuovo, F. Enriques, and F. Severi. In France, H. Poincare and E. Picard ini- tiated their study of algebraic functions of two complex variables. After them S. Lefschetz investigated the theory of complex algebraic surfaces [11, 123. The results attained by the Italian and French schools were very sugges- tive but lacked rigorous foundations. On the o'ther hand, rigorous number- theoretic theories of algebraic curves appeared in Germany. R. Dedekind and H. Weber de- veloped the theory of algebraic function fields parallel to that of talgebraic number fields. K. Hensel introduced the concept of tp-adic num- bers in analogy to tpower series expansions of analytic functions. E. Noether constructed an abstract theory of tpolynomial ideals from a formal theory by E. Lasker and F. S. Macau- ley. Under her influence there appeared the arithmetic algebraic geometry (of curves) over an abstract tleld as developed by F. K. Schmidt and others. In the higer-dimensional case, van der Waerden attempted to create a more rigorous foundation for algebraic geometry under the influence of Noether's abstract ideal theory (c. 1930) [14]. He introduced the concept of tgeneric points and tspecialization, and specifi- cally defined the tmultiplicity of intersections of two varieties in a projective space. He succeeded in getting a rigorous proof of Be- zout's theorem: In n-dimensional projective 
41 space, the number of intersections of an r- dimensional algebraic subvariety of degree with an (n - r)-dimensional subvariety of degree m is always 1m if they intersect in only a finite number of points. The problem of intersections was taken up by C. Chevalley and A. Weil in the 1940s. Cheval ley developed the ideal theory of tlocal rings (studied initially by W. Krull); he intro- duced topological concepts and applied them to the problem of intersections. The theory in this direction was later extended further by P. Samuel, M. Nagata, and J.-P. Serre. Weil gave foundations of algebraic geometry over an abstract field and reconstructed the theory by introducing geometric language to designate objects of abstract algebra [15]. He thus gave quite a new aspect to the theory and extended H. Hasse's arithmetization of the theory of algebraic functions in one variable to the case of several variables. Reconstructing Severi's theory of algebraic correspondence over abstract fields, he succeeded in proving an analogy of the tRiemann hypothesis on tcongruent zeta functions (- 450 Zeta Func- tions N). He also constructed, purely algebrai- cally, the entire theory of tAbelian varieties independent of characteristic. Around 1930, O. Zariski gave another foundation to algebraic geometry by applying the generalized +Valuation theory that had been introduced by Krull. Zariski clarified especially the properties of birational trans- formations by using valuation theory. Zariski's main theorem states that if a birational map- ping is not tregular at a tnormal point P (- 16 Algebraic Varieties 1), each component of the image of P by the mapping is of dimension 1. Zariski also solved the problem of treso- lution of singularities in the affirmative in the case of characteristic 0 for varieties of dimen- sion  3. The affirmative resolution of this problem (which Riemann assumed) says that any algebraic variety in a projective space Can be transformed birationally to a projective algebraic variety without singularities. In 1964, H. Hironaka gave an affirmative answer for any dimension in the case of char- acteristic O. Along with the achievements in algebraic methods, great development took place in analytic methods. Unification of the concepts of Riemann surfaces and tRiemannian mani- folds led to the concept of komplex analytic manifolds. Furthermore, G. de Rham's theo- rem on the duality of topologically defined homology and cohomology based on differen- tial forms was proved; also, W. V. D. Hodge's theory of tharmonic integrals was developed. In the case of the complex dimension 1, any 12 B Algebraic Geometry compact Riemann surface is derived from a certain projective algebraic curve, However, the situation is not so simple in the case of higher dimensions. Weil's concept of tabs tract complete algebraic varieties can be consid- ered as an analog of compact complex mani- folds. If a compact complex analytic variety is projective, then it must be an algebraic variety (tChow's theorem). K. Kodaira proved that a necessary and sufficient condition for a com- pact complex analytic manifold to be biholo- morphically equivalent to a projective com- plex analytic manifold is that the manifold is a tHodge variety. Using results on harmonic integrals, J. Igusa and Weil established the theory of +Picard and t Albanese varieties associated with algebraic manifolds of arbitrary dimensions as a gen- eralization of the theory of tJacobian varieties associated with algebraic curves (- 9 Alge- braic Curves E). Thus many ambiguities in the theory as developed by the Italian school were clarified. Later the theory was gen- eralized to the case of characteristic p by T. Matsusaka, Chow, and S. Lang. The duality theorem in this case was later proved by M. Nishi and P. Cartier (- 3 Abelian Varieties D). The concept of tsheaves (- 383 Sheaves) had already been used in Kodaira's theory. Serre defined an abstract algebraic variety as a tringed space by using an analogy to the con- cept of complex analytic spaces; he considered it as a topological space with respect to the tZariski topology. By introducing tcoherent algebraic sheaves, Serre c1aritied the idea that classical invariants (such as tarithmetic genus) may be considered cohomological quantities ( 16 Algebraic Varieties E). A. Grothendieck invented the concept of a tscheme, which is far more general than that of an algebraic variety, by admitting the existence of tnilpotent elements in structure sheaves and taking as a coordinate ring a general commu- tative ring with unity element. By the device of taking into al:l:ount nilpotent elements, an analog of the method of successive approxima- tion in analysis is now applicable. By master- ful use of cohomological techniques, Grothen- dieck derived many results, including Zariski's important theorems. In the classical case, F. Hirzebruch gen- eralized the Riemann-Roch theorem to higher- dimensional manifolds. He made use of the language of sheaves and some topological results of A. Borel and R. Thorn [S]. Later Grothendieck generalized the theorem for the abstract case as well. His idea in this work is recognized as the origin of t K-theory. Every nonsingular complete curve of genus o is isomorphic to the projective line, and any 
12 Ref. Algebraic Geometry nonsingular complete curve of genus 1 is iso- morphic to a projective curve del1ned by the equation xixo=xl(X I -X,)(X, -AX o ) for some A '# 0, 1. On the other hand, the set of all isomorphism classes of nonsingular complete curves of genus g> 1 is parametrized by a normal quasiprojective variety of dimension 3g - 3. Such facts were first discussed by Rie- mann as the problem of moduli. Concerning the moduli of manifolds of higher dimensions in the classical case, Kodaira and D. C. Spencer developed their theory of tdeformations of complex structures (--" 72 Complex Manifolds G). The meaning of number of tmoduli is clarified by deformation theory. Deformation theory has been extended in various ways, and deformation is considered as one of the fundamental concepts in alge- braic geometry. To investigate the global structure of the moduli varieties, D. Mumford introduced geometric invariant theory (- 16 Algebraic Varieties W). Etale and crystalline cohomologies initiated by Grothendieck and others are useful for the study of algebraic varieties of positive charac- teristic. In particular, the conjecture made by Weil concerning tcongruent zeta functions has been solved affirmatively by P. Deligne with the help of etale cohomology [2J. Many important questions have been answered by means of the geometric theory. For example, (1) every tvector bundle on Ak is trivial (- 16 Algebraic Varieties Z); (2) there exist tunirationa1 but nonrational fields over C (- 16 Algebraic Varieties J); (3) the funda- mental groups of the complements of node- curves on p are commutative (- 16 Alge- braic Varieties I); (4) the cancellation theorem holds for A (--" 15 Algebraic Surfaces H); (5) whenever the tangent vector bundles are am- ple, the varieties are P k (--" 16 Algebraic Var- ieties R), References [IJ G. Castelnuovo, Sulle funzioni Abeliane, Memorie scelte, Zanichelli, 1937, 5299549. [2J P. Deligne, La conjecture de Weil 1, Pub\. Math. Inst. RES, 43 (1974),2733307. [3J J. Dieudonne, Cours de Geometrie Alge- brique, I. Apen;u Historique sur Ie Devel- oppement de la Geometrie Algebrique, Presses Univ. France, Collection Sup" 1974. [4] F. Enriques, Memorie scelte di geometria, Zanichelli, 1,1956; II, 1959; III, 1966. [5J A. Grothendieck, The cohomology theory of abstract algebraic varieties, Proc, Intern. Congr. Math., Edinburgh, 1958, Cambridge Univ. Press, 1958, 103-118. 42 [6J A. Grothendieck, Elements de geometrie algebrique, Pub\. Math. Inst. RES, 4 (1960); 8 (1961); 11 (1961); 17 (1963); 20 (1964); 24 (1965); 28 (1966); 32 (1967), [7J H. Hironaka, Resolution of singularities of an algebraic variety over a field of character- istic zero, Ann. Math., (2) 79 (I964), 1099326. [8J F. Hirzebruch, Topological methods in algebraic geometry, Erg. Math., Springer, third edition, 1966, [9J K. Kodaira, Collected Works I, II, III, Iwanami, 1975. [1 OJ S. Lefschetz, L'analysis situs et la geo- metrie algebrique, Gauthier-Villars, 1924. [ 11 J E. Picard and G. Simart, Theorie des fonctions algebriques de deux variables inde- pendantes, Gauthier-Villars, I, 1897; II, 1906. [ 12J D. Quillen, Projective modules over polynomial rings, Inventiones Math., 36 (1976), 1677171. [ 13J J. G. Semple and L. Roth, Introduction to algebraic geometry, Clarendon Press, 1949. [ 14J B. L. van der Waerden, Einflihrung in die algebraischen Geometrie, Springer, 1939. [ 15J A. Weil, Foundations of algebraic geome- try, Amer. Math. Soc. Colloq. Pub\., 1946, 1962, [16J A. Weil, Oeuvres Scientifiques, Collected Papers I, II, III. Springer, 1980. [ 17J O. Zariski, Collected papers, MIT Press, I, 1972; II, 1973; III, 1978; IV, 1979. 13 (IV.1 1) Algebraic Groups A. Definitions and General Remarks [1,2, 8J Let k be a field and Q a tuniversal domain containing it. An affine algebraic group G defined over k is, by definition, a group G which has the structure of a (not necessarily irreducible) talgebraic variety defined over kin an affine space QN such that the group oper- ation (x, y) f-> X -1 Y on G is an everywhere tregular trational mapping defined over k For such a group G, the set G k of all k-trational points on G is an abstract group. The tirreduc- ible component Go of G (viewed as an alge- braic set) containing the identity element e is unique and is a normal subgroup defined over k with finite index in G; the decomposition of G into (absolute) irreducible components coincides with that into the co sets of G by G,. When G = Go, the group G is called connected, It should be noted that for an algebraic group G defined over k, a coset gG o is not necessarily del1ned over k,' and if it is, a representative g cannot necessarily be taken to be k-rationa\. 
43 However, if k is an infinite field and G is con- nected, and if moreover k is tperfect or G is treductive, then G k is tlariski dense in G ([9J, A. Borel and T. A. Springer [30J); hence G is uniquely determined by G k , (In this case, G k is sometimes referred to as a k-group.) When k is a ttopological tleld, G k is a +topological group with respect to the natural topology de- fined by that of k, which is generally stronger than the Zariski topology on G k . For instance, when k = R, (GO)R is a tLle group with finitely many connected components. A subgroup H of an affine algebraic group G, which is Zariski closed, is an affine alge- braic group with respect to its natural induced structure and is called an algebraic subgroup of G. If H is defined over k, then His k -closed; the converse is also true when k is tperfect. (An affine algebraic set A is called k-c1osed if A is a commOn zero of a set of polynomial equations with coefficients in k The set A is k-c1osed if and only if all irreducible com- ponents of A are detlned over the algebraic closure k of k, and for every tGalois auto- morphism (J of k/k, A a = A.) The notions of homomorphism and isomorphism for alge- braic groups can be detlned in a natural manner. For instance, for affine algebraic groups G and G' defined over k, a rational bomomorpbism ({J : G -! G' defined over k, or a k-morpbism for short, is a homomorphism of G into G' that is at the same time an (every- where regular) rational mapping defined over k For a k-morphism cp of G into G', the (set- theoretic) image <p(G) is a closed subgroup of G' defined over k, the kernel rp -I (e') is a k- closed subgroup of G, and dim cp(G) = dim G- dim <p-t (e'). In particular, when dim G = dimcp(G)=dim G' (or equivalently, when <p(G o )= G and <p -l(e') is finite), <{J is called an isogeny. (Two groups G and G' are called isogenous if there exist a third group G" and isogenies Goo -->G, Goo -->G',) When a k-morphism rp is bijective and rp -I is also an (everywhere regular) rational mapping defined over k, cp is called a birational isomorphism defined over k, or a k-isomorphism for short. It should be noted that a rational homomorphism which is an isomorphism of abstract groups is not necessarily an isomorphism of algebraic groups (e.g., a Frobenius homomorphism); a similar statement holds for an injective con- tinuous homomorphism of topological groups. Given a connected affine algebraic group G and a closed subgroup H, both defined over k, the quotient space G/ H has the uniquely deter- mined structure of an algebraic variety defined over k such that the canonical mapping G--> G/H is separable. The tfunction field of G/H is then identified with the subfield of the func- tion field of G formed by all H-invariant e1e- 13e Algebraic Groups ments. In particular, if H is a closed normal subgroup, then G/H has a natural structure of an affine algebraic group (defined over the same ground field k) [8, 10, 11]. As an example, we have the group GL(n) of all n x Il nonsingular matrices (xij) GL(n) may be viewed as an algebraic set in on 2 +1, defined by a single equation det(x i ) y = 1, and as such is a connected algebraic group defined over the tprime field. In general, an algebraic group reahzed as a closed subgroup of GL(n) is called a linear algebraic group. Since an affine alge- braic group is always isomorphic to a linear algebraic group, these two terminologies are essentially synonymous [1,2]. B. Generalization of the Definition Replacing the term affine algebraic set (or affine cariety) in the definition of an affine algebraic group by a more general term t alge- braic variety, we obtain the notion of an algebraic group (algebraic group variety, or simply group variety). On this subject, the following facts are fundamental. A tcomplete connected algebraic group is an Abelian variety ( 3 Abelian Varieties). More gener- ally, given a connected algebraic group G defined over k, there always exists a (k-c1osed) largest linear connected closed normal sub- group L, and the factor group G / L is an Abelian variety. Furthermore, for a closed normal subgroup H of G, the factor group G/H is complete if and only ifH:::>L(Che- valley's theorem [8J), In particular, if a con- nected algebraic group G is comp1ete and linear at the same time, then G reduces to the identity group. In view of these theorems, the study of algebraic groups can be reduced, in a sense, to the study of Abelian varieties and linear algebraic groups. For this reason, we henceforth restrict ourselves to linear algebraic groups, which are simply called algebraic groups. (The notion of tgeneralized Jacobian variety, introduced by M. Rosenlicht [12J, is an example of an algebraic group in a general sense;  9 Algebraic Curves.) The notion of algebraic groups has been generahzed further to that of tgroup schemes by A. Grothendieck [3,4]. C. Lie Algebras Since an algebraic group G defined over k has no singularities, the ttangent space 9 to G at the identity element e is detlned and has the same dimension as G : dim 9 = dim G. The space 9 can be identified in a natural manner with the space of a]] 1eft-invariant tderivations of the function fie1d of Go and thus has the struc- 
13D Algebraic Groups ture of a Lie algebra defined over k (  248 Lie Algebras). We call !'! (the Lie algebra gi over k of all k-rational points in g) the Lie algebra of G (of the k-group Gd. If G is a linear algebraic group contained in GL(n), then gk is a Lie subalgebra of gl(n, k) with the Lie product defined by [x, y] = xy - yx; a linear Lie algebra corresponding to a linear algebraic group is called an algebraic Lie algebra. When the characteristic of k is zero, conditions for a linear Lie algebra to be algebraic Can be given in terms of the replica [5]. Also, in the case of characteristic zero, for x E gl(n, k), x E gk if and only if exp(tx)E G, where t is a variable OVer k and exp(tx) is understood as a tformal power series in t (contained in Q), From this, we can prove, exactly as in the theory of Lie groups ( 249 Lie Croups), a one-to-one correspon- dence between k-c\osed subgroups H of G and algebraic Lie subalgebras k of gb establish- ing a complete parallelism of the theories of algebraic groups and Lie algebras [5]. This parallelism breaks down when k has positive characteristic [I, 2]. On the other hand, over a field of characteristic p > 0, we have formal groups, an analog of local Lie groups intro- duced by J. Dieudonne [ 13], and also thyper- algebras, which play the role of Lie algebras in characteristic O. D. Tori [1,2J The group G m = GL(1), the multiplicative group of nonzero elements in Q, is a 1- dimensional connected algebraic group defined over the prime field. In general, an algebraic group G that is isomorphic to the direct prod- uct (G m )" is called an (algebraic) torus. When a torus G defined over k is isomorphic to (G m ) over an extension K of k, G is called K-trivia1 (or K-spJit), and the field K is called a splitting field for G. A torus G defined over k always has a splitting field K which is a finite separable extension of k In general, a rational homomorphism X of an algebraic group G into G m is called a char- acter of G,If we define the sum of two charac- ters Xl and X2 ofG by (Xl +X2)(g)=Xl(g)' X2(g) (g E G), the totality of characters of G is an additive group, called the character module of G and denoted by X(G). Let G be a torus defined over k and X =X(G) its character module, and let K be a splitting field for G that is a finite tGalois extension of k If a K- isomorphism G(Gm)" is given by the corre- spondence G3g-->(Xl(g), ,Xn(g), then the Xi are characters of G, and X is a tfree module of rank /J generated by XI' '" , Xn' Furthermore, if r denotes the Galois group of Klk, then for O"E rand XEX, the conjugate X G is also a char- 44 acter of G; under this action of r, X becomes a right f-module. We have complete duahty between a torus G and its character module X in the following sense. There exists a one-to- one correspondence between the closed sub- groups G l (defined over k) of G and a (f- invariant) submodule X I of X for which XIX, has no p- ttorsion (where p is the character- istic of k). This correspondence is determined by the relation of the annihilators X I = Gt, G 1 = X t, and under the correspondence the character modules of G l and of GIG I are canonically identiiled with XIX, and X I, respectively. Furthermore, let G' be another torus (defined over k and split over K) with the character module X', and suppose that we have a (k-) homomorphism ((i: G--.G', Then we can define a (f-) homomorphism '((i:X'-->X, called the dual homomorphism of ((i, by the relation \p(x') = x' 0 I.p for x' EX'; conversely, any (f-) homomorphism of X' into X is ob- tained uniquely in this manner. In particular, 'P is a (k-) isomorphism if and only if its dual t((i is a (f-) isomorphism. Since for any free (f-) module X of finite rank there always exists a torus G (defined over k and split over K) such that X(G)  X (as a special case of the existence theorem of k-forms), the tcategories of all tori (defined over k and split over K) and that of all free (f-) modules of finite rank are mutually dual. E. Semisimp1e Elements and Unipotent Elements A matrix a is called tsemisimple if it is diago- nalizable, i.e" if the tminimal polynomial of a has only simple roots. A matrix a is called tunipotent if a 1 is nilpotent, i,e., if all charac- teristic roots of a are equal to 1, (When the characteristic of the ground field is zero, the unipotent elements u in G L(n, k) and the nilpo- tent elements x in gl(n, k) are in one-to-one correspondence by the relation u = expx.) Any nonsingular matrix a can be written uniquely as a product of a nonsingular semisimple ma- trix a' and a unipotent matrix a" of the same size which are mutually commutative: a = u'a" = a" a' (tmuItiplicative Jordan decompo- sition); a' (a") is called the semisimp1e (unipo- tent) part of a and is denoted by a, (a,); a, can be expressed as a polynomial of the matrix a with scalar coefficients. For an element a of a (linear) algebraic group G, the semi simplicity (unipotency) of a does not depend on the matrix representation of G, Moreover, these properties are preserved by homomorphisms of algebraic groups. Also, if aE G, then a" auE G, For an algebraic group G, we denote the 
45 totality of semisimple (unipotent) elements contained in G by G s (G u ) and call it the semi- simple (unipotent) part of G, (Note that G, and G u are not necessarily subgroups.) A torus G is then characterized by the property that G = Go = G,. On the other hand, an algebraic group G such that G = G u is called unipotent. For instance, the additive group of the universal domain, Ga{C )IXEQ}, is a 1 -dimensional connected unipotent alge- braic group. F. Solvable Groups and Nilpotent Groups [1,2,7,9J For two closed normal subgroups HI' H 2 (deilned over k) of an algebraic group G, the tcommutator group [HI, H 2 J (in the sense of abstract group theory) is also a closed normal subgroup (defined over k) of G. In view of this fact, an algebraic group G is called solvable (nilpotent), when it is tsolvable (tnilpotent) as an abstract group. For example, the totality T(n) of n x n nonsingular upper unipoteni ma- trices, i.e., matrices of the form * "' , ( o' ), is a connected solvable algebraic group. A unipotent algebraic group is always nilpotent. For any connected solvable algebraic group G c GL(n), there always exists an element a in GL(n) such that a-I Ga c T(n) (Lie-Kolcbin tbeorem [ I, 2J), A connected solvable algebraic group G has a tcomposition series G = Go :J G j  ".  G, = {e} such that each G i is a connected closed normal subgroup of G and Gi-dG i is isomorphic to either G m or G,. If G is de- fined over k, the subgroup G u is a connected k- closed normal subgroup of G, and for any maximal torus T in G, we have a decompo- sition into a tsemidirect prod uct G = T. G u (in the sense of algebraic groups, i.e., the natural map T x Gu-->G is birational). It is known that for any algebraic group G defined over k there exists a maximal torus defined over k ([4J, A. Borel and T. A. Springer [30J), G is nilpotent if and only if G has a unique maxi- mal torus T; when that is so, T= G, and Tis contained in the tcenter of G. For a connected solvable algebraic group G defined over k, we can take a E GL(n, k) such that a-I Ga C T(n) (see the Lie-Kolchin theorem) if and only if all characters X EX (G) are defined over k; when this condition is satisfied, G is called k-so1vab1e. G u is then defined over k, and GIG u is a k- trivial torus. 13G Algebraic Groups When the characteristic of k is zero, any commutative unipotent algebraic group (de- fined over R) is (k.) isomorphic to the direct product (Ga)n, When k is an algebraically closed field of characteristic p > 0, any con- nected commutative unipotent algebraic group defined over k is k-isogenous to a direct prod- uct of a certain number of the groups W m of tWitt vectors (of length m) (Chevalley-Chow theorem [ 12J), A 1 -dimensional connected unipotent algebraic group detlned over a per- fect field k is k-isomorphic to G a [1,2]. G. Borel's Theory Let G be an algebraic group and V an alge- braic variety (both defined over k), We say that Vis a transformation space of G (defined over k), or simply G acts on V, if there is given an everywhere regular rational mapping G x V3(g, V)-->gvE V (defined over k) such that gdg2V)=(gjg2)V, ev=v (gl,g2EG,VE V), When the action of G on V is ttransitive, V is called a thomogeneous space of G. For a closed sub- group H of a connected algebraic group G (both defined over k), the quotient space GI H has the natural structure of a homogeneous space of G (deilned over k), A. Borel [ 1, 2J proved the following theorems: (1) If G is a connected solvable algebraic group and Va complete transformation space of G, then G has at least one fixed point in V, More precisely, in order that a connected algebraic group G defined over k be k-solvable, it is necessary and sufficient that for any com- plete transformation space V of G defined over k for which V k i= 0, G have at least one k- rational fixed point in V [9]. (2) Let G be a connected algebraic group. A maximal connected solvable closed subgroup of G is called a Borel subgroup of G. Then (i) all pairs (T, B) formed by a maximal torus Tin G and a Borel subgroup B containing it are con- jugate to each other with respect to inner auto- morphisms of G. (ii) For a closed subgroup H of G, the quotient space G/ H is complete if and only if H contains a Borel subgroup ofG; and, when that is so, G/H is actually a tprojective algebraic variety. (For instance, ifG= GL(n),B = T(n), thenG/B is a so-called ttlag manifold.) (iii) The conjugates of B(T) cover the whole group G (G s ). A closed sub- group of G is called parabolic if it contains a Borel subgroup of G. A parabolic subgroup H coincides with its own tnormalizer N(H); in particular, H is always connected. Parabolic subgroups are significant in the theory of automorphic functions. (For the parabolic subgroups associated with BN-pairs  Sec- tions Q, R.) 
13H Algebraic Croups When G is a connected algebraic group detlned over a perfect field k, proposition (i) can be sharpened: The pairs (A, H) formed by a maximal k-trivial torus A in G and a maxi- mal connected k-solvable subgroup H con- taining it are conjugate to each other with respect to the inner automorphisms detlned by elements in G k . The normalizer N(ll) of such a k-solvable subgroup H is a minimal k-c1osed parabolic subgroup of G. When the maximal connected k-solvable subgroups of G are re- duced to the identity group, G is called k- compact or k-anisotropic. (OthelWise, G is called k-isotropic.) For instance, the tortho- gonal group G = SO(n, f) of a tquadratic form f of n variables is k-compact if and only if the form f is anisotropic, i.e" the homogeneous equation f= 0 has no solution other than zero in k Similar facts hold for other classi- cal groups. When k is a tlocal field, G is k- compact if and only if G k is compact as a topological group. In general, a k-compact group is treductive, H. The Wey1 Group Let G be a connected algebraic group and Q an arbitrary torus in G. The tcentralizer Z(Q) of Q is then connected and coincides with the connected component of the normalizer N(Q). Hence the factor group W= N(QJjZ(Q) is finite and can be identified with a subgroup of the automorphism group of Q (or of its character module X(Q» in a natural manner. The group W is called the Wey1 group of G relative to Q. In particular, when Q = T (a maximal torus), the order of W is equal to the number of Borel subgroups containing T In this case, the cen- tralizer C = Z(T) is called a Cartan subgroup of G; it is characterized by the property that C is a (maximal) connected nilpotent closed sub- group of G which coincides with the connected component of its own normalizer N(C). The notions of Borel subgroups, Cartan subgroups, and maximal tori are preserved under rational homomorphisms of algebraic groups. 1. Semisimp1e Groups and Reductive Groups In an algebraic group G defined over k, there exists a largest connected solvable closed normal subgroup R, called the radical of G. The unipotent part Ru of R is called the unipo- tent radical of G, When R = { e}, G is called semisimp1e. WhenR is a torus, namely, Ru = {e], G is called reductive. Semisimplicity and reductiveness are preserved under forming a direct product and taking the image (or in- verse image) of an isogeny. For a reductive group G, the +commutator subgroup D(G) is 46 . semisimple, and G = D( G)' R, D(G) n R = finite; in other words, G is isogenous to the direct product of a connected semisimple algebraic group and a torus. In general, if R is the rad- ical of a connected algebraic group G and Ru is the unipotent radical of R, then the factor groups GjR, GjRu are semisimple and reduc- tive, respectively. Furthermore, if the charac- teristic of the field k is zero, there exists a reductive closed subgroup H of G such that G decomposes into a semidirect product G = H. Ru (Chevaliey decomposition [5J), (In this case, Rand Ru are k-c1osed, and H can be taken to be k-rational; such an H is unique up to inner automorphisms defined by elements in G k .) Also in the case of characteristic zero, reductive algebraic groups are characterized by the property that all rational represent a - tions are completely reducible. But when k has the characteristic p > 0, this property char- acterizes tori (M. Nagata). J. Root Systems [1,2, 14J Let G be a connected semisimple algebraic group, T a maximal torus, and X =X(T) its character module. A character :J. E X is called a root of G relative to T if there exists an iso- morphism x, of G a onto its image in G such that t- l x,()t=x,(ex(t)) for a1l EGa' tE T. For a root iX, such an isomorphism Xx is uniquely determined up to a scalar multipli- cation in G a ; hence we put P, = x,(G a ), If we denote by r the totality of roots (rela- tive to T), r satisfies the following axioms, where E = X (8) Q and E* is the tdual space of E with respect to the inner product ( ): (i) For each rJ. E r, there corresponds rJ.* E E* such that <cx*,cx)=2 and <ex*,{J)EZ for all {JEt, (ii) If we define a reflection W x of E by w,x==x-<cx*,x)ex for xEE, then w,{J E t for all f3 E r. (In particular, W,iX = -exEr.) (iii) If a, fJEr are linearly dependent, then p = ::!::ex, (iv) If dim E= r, r contains r linearly independent elements. In general, a finite subset r in a finite- dimensional vector space E over Q satisfying the axioms (i)-(iv) is called a root system in E, (This root system is sometimes said to be reduced, to distinguish it from the root system defined in Section Q, which does not satisfy axiom (iii).) For a root system r, the elements cx* of E" corresponding to rJ. E r are uniquely determined by these conditions, and the set r* = {ex*} is a root system in E" (ei" Er* is called a coroot). Also, the group W of linear transformations of E (E*) generated by w, 
47 (W,*) with (,(0 is finite and is called the Wey1 group of the root system t, If we identify E* with E by means of any W-invariant (positive definite) metric on E, then a* ==(2/<(,(, (,(»(,(. When r is a root system of a semisimple alge- braic group, <(X*,X)EZ for all (XEt, XEX, so that X is W-mvanant, and the Weyl group of the root system r can be identified with the group N(T)jZ(T) of Section H. (In general, a maximal torus in a connected reductive alge- braic group coincides with its own centralizer, so that the Weyl group W can be identified with N( T)/T.) When a tlinear ordering (compatible with the addition) is given in E, we denote by t+ the set of all positive roots in r. An element (X E t + is called a simple root if it cannot be written as a = (x' + a" with (X', (X" E t +. If  = { (X I, . .. , a,} is the totality of (distinct) simple roots in t +, the elements (XI" (x, are linearly independent, and any root (,(0 can be written uniquely in the form a = i: L;=I mi(Xi' with miE Z, m i  O. In general, a subset A of r having this property is called a fundamental system; a fundamental system is always obtained in the manner ex- plained from a linear ordering on E. For a fundamental system A, the cone A, in E*, defined as the set of x in E* satisfying the inequalities <lXi' x) > 0 (1  i  r), is called a Wey1 chamber. If we denote by L, the hyper- plane defined by the linear equation (oc, x) = 0 for a root (x. then E* - U,Et L, = U8A8, and W acts tsimply transitively on the set of all Weyl chambers {A 8 }. The Weyl group W is gen- erated by r reflections w" (1 :;;; i:;;; r). In a semisimple algebraic group G, Borel subgroups B containing a (fixed) maximal torus Tare in one-to-one correspondence with the fundamental systems A (or r+) relative to T by the relation Bu = II w + P" where P, = x,(G a ), (More precisely, every element in Bu Can be written uniquely as a product of the elements in Pa> where the ordering of the P, is taken arbitrarily. ) K. Bruhat Decomposition If we take a representative Sw of WE Win N(T), there is a decomposition G== UWEW BSwB (disjoint union). Furthermore, if for w E W we put N - II n. P and in particular N = N = B - andEdnrte b W o the unique element in' W sch that w o L1 == -A, then the element in Bs B can be written uniquely as a product of el:ments in N wwo ' Sw T, N. Hence we have G = U Nwwo'swT'N, WEW which is called a Bruhat decomposition of G, In 13 M Algebraic Croups (I) particular, if we put N' = sW o -I Ns wo (which is the unipotent part of the Borel subgroup cor- responding to -A), then N'TN is a Zariski open set in G, and the natural map N' x T x N --> G is birational. This implies that the function field of G is rational (i.e., tpurely transcendental over Q). 1. Structure of Semisimp1e Groups A subset r I of a root system r is called a closed subsystem if r I I n r = r I , where r I z denotes the submodule of X generated by r I A closed subsystem satisfies conditions (i), (ii), and (iii) of a root system. For a closed subsystem r l of a root system r of a semisimple algebraic group G, the subgroup G(r I) of G generated by the Pa ((,( E r d is a semisimple closed subgroup with a maximal torus TI =(G(rl)n T)o, of which the rOot system relative to  coincides with the restriction of r I on TI and the coroot system can be identified with rf = {(X* I (X E r I}' The subgroup G(rd is normal if and only if r - r 1 is also a closed subsystem; when this is so, G == G(r l )' G(r -rd, G(rdn G(r- rd=finite, All connected closed normal subgroups of G are obtained in this manner. In order that G be simple (sometimes called absolutely simple or almost simple) as an algebraic group (i.e., without proper connected normal subgroups), it is necessary and sufftcient that t be irreduc- ible (i,e" r cannot be decomposed into a dis- joint union of two proper closed subsystems). In general, a root system r can be decomposed uniquely into the disjoint union r = r l U U r, of irreducible closed subsystems r i such that r U U r ( 1:< i :< S ) are also closed subsystems; 1 ! -....;: "'""' correspondingly, G is isogenous to the direct product G I x x G s of (absolutely) simple algebraic groups G j = G(rJ (G is actually a direct product if it is simply connected or an adjoint group.) The subgroups G i are deter- mined uniquely and only by G M. k-Forms [15J Let K be an extension of k and G 1 an alge- braic group defined over K. An algebraic group G defmed over k is called a k-form of G if there is a K-isomorphism f of G onto 1 G 1 , Suppose further that Kjk is finite sepa- rable, and for every Galois automorphism 0" of kjk, put <Pa = r 0 f -1. Then <Pa is an iso- morphism of G I onto Gf, and the lfJa satisfy the relation <P 0 <p, = lfJa,' Conversely, given a collection of isomorphisms { <Pa} satisfying these conditions, there always exists a k-form G (with a K-isomorphism f onto G 1 such that CPa =.f" a r I), which is unique up to k- isomorphism (Weil). In particular, if K/k is a 
13 N Algebraic Groups finite Galois extension with Galois group r, then \ <Pa} is a (continuous) tl-cocycle of r in AutK(Gd (the group of all K-automorphisms of G,), and by the above correspondence the k- isomorphism classes of k-forms G are in one- to-one correspondence with the (continuous) 1- tcohomology classes of the cocycle {<Pa} (in the cohomology set Hl(i,AutdGd)) (- 172 Galois Theory J) To a given finite separable extension Klk of degree d and an algebraic group G] defined over K of dimension n, we can associate a certain algebraic group 91 K / k ( G I ) defined over k of dimension 00, which is obtained from G l by restricting the ground field [ 19]. A more pre- cise definition is as follows. Let {O"I , (12' . , O"d) (0"1 = 1) be a set of automorphisms of Ilk such that 0"; K( 1 ,,:; i ,,:; d) are all distinct. Then one can find a k-form G of G l = nd=1 Gi with an isomorphism f: G-->G l such th'at CPa = Ja oj-I is given by CPa((X i ) = (Xia), where i" is defined by the relation (O'iO') K = O"i" K If we denote by PI the canonical projection of C l Onto its first component G l and put p = PI of then the pair (G,p) is uniquely characterized (up to k- isomorphism) by the following universality property: If G' is any algebraic group defined over k and (p is a K-morphism of G' into G I , then there exists a (uniquely determined) k- morphism ;p of G' into G such that t.p = p 0 ip, The group G (together with p) is denoted by 1tKik(Gd. For the group of rational points, G k = G, K' When the algebraic group G l has some additional structure (such as that of tvector space, talgebra, etc.), then 9'lK/k( Gd automatically has the same kind of additional structure. N. Chevalley's Fundamental Theorems Let G, G' be connected semisimple algebraic groups, and let T( T') be a maximal torus in G(G'), X(X') its character module, r (r') a root system of G (G') relative to T (T'), etc, If we have an tisogeny <P of G onto G' such that <p(T) = T', then there is a bijection ex-->ex' ofr onto r' such that ljJ(ex') = q,rx, where if; is the dual homomorphism of t.p T and q. is a posi- tive integer, which equals 1 if the characteristic is zero and is a power of p if the characteristic is p > O. Conversely, any injective homomor- phism if;: X'-->X satisfying this condition (with respect to a certain bijection rr' and q,) comes from an isogeny <p: G --> G' in the mannel already stated. In particular, t.p is an isomor- phism if and only if IjJ is an isomorphism such that ljJ(r')=r (i.e" q,= 1 for all exEr) [7]. The isomorphism class of G is thus completely determined by the pair (X, r), so that we some- times write G = G(X, r). A connected semi- 48 simple algebraic group G defined over k is called of Chevalley type over k(or k-spJit) if there exists a k-trivial maximal torus T in G. If, in the above theorem, G and G' are of Cheval- ley type over k and T and T' are k-trivial, then the theorem remains true if we replace isogeny by k-isogeny. In particular, the k-isomorphism class of a connected semisimple algebraic group of Chevalley type over k is completely determined by (X, r). Chevalley also showed that, for any pair (X, r) satisfying condition (1) above, there exists a connected semisimple algebraic group G(X, r) of Chevalley type defined over the prime field. Therefore, since the classification of semisimple algebraic groups of Chevalley type is reduced essentially to that of root systems (X, r), it turns out that, over any ground field k, there exist as many connected simple algebraic groups of Cheval- ley type as connected simple complex Lie groups ( 249 Lie Groups; Appendix A, Table 5.1). For a given semisimple algebraic group G = G(X, r) delj'ned over k, put Xo = r, (= the submodule of X generated by r), X O = {XE F I <ex*, x) EZ for all exEI}. Then we have natural isogenies G(X o , r)-->G(X, r)-->G(X o , r) (with q, = I). all of which can be taken to be defined over k The group G(X o , r) (G(Xo, r» is called the simply connected group (the adjoint group) isogenous to G. When the characteristic of k is zero, these isogenies (which are already known in the classical theory of complex Lie groups) are essentially the only possible iso- genies among the semisimple algebraic groups. But when the characteristic is p > 0, there are, in addition to these, the Frobenius homo- morphism (with q, = p) and the following "singular" isogenies (for which q, = 1 or p de- pending on (X): BnpCn, F4 --> F4. (p = 2), G 2 -->G 2 (p = 3). In particular, when k is a finite field, taking the set of fixed points of the singular k- isogenies, we obtain the simple finite groups of M. Suzuki and R. Ree (- 151 Finite Groups). O. Classification Theory A connected semisimple algebraic group G defined over k is called k- (almost) simple if there is no proper connected closed normal subgroup of G defined over k (When G is k- simple and k-split, the factor group D(Gk)1 center is an abstract simple group except for a few special cases [ 17]. For more general results  Section Q.) For a k-simple algebraic group G, let G, be anyone of its absolutely simple components, and let k, be the smallest field of definition for G] containing k Then kllk is a finite separable extension, and Gis k- isogenous to 9'lk,!k(Gd. Hence the problem of 
49 classifying all k-simple groups (up to isogeny) is equivalent to that of tlnding all k,-forms of simple groups of Chevalley type. This latter problem can be reduced, in principle, to the classification of compact k,-forms and that of certain diagrams (i.e., tDynkin diagrams along with an action of the Galois group) [ 16J, J. Tits [30J (- Appendix A, Table 5.1). For instance, when k is a I1nite field (or, more gen- erally, a field of dimension ,,:; 1 [ 15J), there is no compact k-simple group; hence, using a simple classification theory of the diagrams, we can show that the only absolutely simple algebraic groups G deilned over k are either of Chevalley type or of the types introduced by R. Steinberg (denoted by 2An, 2Dn, 3D 4 , 6D 4 , 2E 6 ). Connected semisimple algebraic groups composed of the groups of these types are characterized by the property that they have a Borel subgroup defined over k Such groups are said to be of Steinberg type over k (or k- quasi-split). Absolutely simple algebraic groups over a tp-adic l1eld have been classified by M. Kneser and J. Tits [30,31]. When the charac- teristic of k is not equal to 2, the classification of simple groups of classical type (except for the type D 4 ) is known to be equivalent to that of semisimple tassociative algebras with tinvo- lution [ 18]. A similar relation also holds be- tween some of the exceptional simple groups and tCayley algebras or tJordan algebras (H. Hijikata, T. A. Springer, J. Tits). The following is a list of absolutely simple algebraic groups of classical type. 1. k-fonns of SL(n) (n ) 2), 1.1. G 1k =SL(m,5) = {gE Mm(H) I N(g)= 1}, where  is a tcentral division algebra over k with (:k)=r2, n=mr, and N denotes the treduced norm in Mm()' 1.2. G lk =SU(m, sU)= {!JESL(m,)1 f(gx,gy)=f(x,y) for x, YESt m }, where  is a central division algebra over a quadratic extension Ii of k with an in- volution I of the second kind (which means that {Ek'i ' == 0 = k), (K: k')= r 2 , n = mr, and f is a (nondegenerate) tHermitian form of m variables Over st with respect to the involution i. II, k-forms of SO(n) (n) 3, n #4). Sp(n) (n even, n ) 21, Glk=SU(n,SJ), where R is a central division algebra Over k with an involu- tion I of the first kind (i.e" such that {Ekl (' = (} = k), (: k)= r 2 , n = mr. and I is a nondegenerate c-Hermitian form of m variables over 5t with re- spect to the involution l. In this case, dimgEK I '=} =r(rHo)/2 with <'0= }: I, and G j is a k-form of SO or Sp according as <'<'0 = 1 or - 1. (50(8) may have other k-forms coming from the so- 13 P Algebraic Groups called triality.) The case where the characteristic of k is 2 can also be dis- cussed by a method given by J. Tits (Inventiones Math 5, 1968). When k is a local field or an algebraic num- ber field, the only central division algebra with an involution of the first kind is a tquaternion algebra (and, if I is the "canonical involution," then<.o= 1). P. Algebraic Groups over an Algebraic Number Field Let G be a connected algebraic group detlned over an algebraic number field k of finite de- gree. Let [v} be the totality of tprime divisors (i.e., eq ui valence classes of valuations) of k Taking the trestricted direct product of a family of locally compact topological groups ( GkJ, we obtain a locally compact topological group G A' called the ade1e group of G [19J (- 6 Adeles and Ideles). In particular, when G = G m , the adele group 1= (Gm)A is exactly the tidele group introduced by Chevalley in class field theory. If we identify x E G k with an adele whose components are all equal to x, G k becomes a discrete subgroup of G 4 . Concerning the tlniteness property of GA/Gk> the following results have been obtained [21,22]: A character XEXk(G)( = the module of all k-rational characters of G) gives rise to a (continuous) homomorphism XA: G A --> I = (Gm)A- Put G={gEGAllxA(g)l= I forall XE X,(G)}, where II is the standard norm in [. Then G is tulllmodular, and the quotient space G/Gk is of finite volume with respect to the (unique) invariant measure on it. G/Gk (GA/Gd is compact if and only if the semi- simple part G/R (the reductive part G/RJ of G is k-compact. From the arithmetic point of view, it is important to determine explicitly the volume of G/Gk with respect to the invariant measure normalized in a certain manner; such a volume is called the Tamagawa number of G and is usually denoted by 7(G) [19,23]. For instance, Siegel's formulas on the volume of the fundamental domain of the unit group of a quadratic form j over k are essentially equiva- lent to a theorem on the Tamagawa number stating that r(SOU)) == 2. Let 0 be the tring of integers in k and L an o-Iattice in the vector space on which G is acting. We can define in a natural manner an action of G A on the set of all o-Iattices; then the orbit GAL (GkL) of L with respect to G A (G k ) is called the genus (class) of L. The tstability subgroup G A . L of L in G A is open, and the double coset space G A L \GA/G k is finite (fi- niteness of the class number). Moreover, let {v I' , I',} be the totality of tmfinite prime 
13Q Algebraic Croups divisors of k, and put G", = r1;=1 G ke ' Then G", is a Lie group, and the canonical pojection G L on G", of GA,L C G A is a discrete subgroup of finite type. (In general, (discrete) subgroups of G, which are tcommensurable with G L are called arithmetic subgroups.) As in the adele case, XEXk(G) gives rise to a (continuous) homomorphism Xoo : G", -->(R x )', and if G, = {gEG", IX",(g)l= I}, then the quotient space G/GL is of finite volume. Moreover, G/GL (G, IGd is compact if and only if G/Gk (GAIG k ) is compact. In addition to these, the tapproximation theorem and the tHasse principle are also extended to (classical, or general) algebraic groups (M. Eichler, M. Kneser, G. Shimura, Hijikata, Springer;  [30J), Q. Structure of Reductive Groups [24J Let G be a connected treductive group defined over a field k, Then G has a tmaximal torus defined over k and G is tsplit over a finite tseparable extension K of k The structure of the group G K can be discussed as in Sections T, J, K, and L. Here, we discuss the structure of Gk> i.e., a tk-form of G,. Let S be a maximal k-split torus of G, i.e" a k-subtorus of G that is k-split and maximal with respect to these properties. Any two such tori are conjugate over k, i.e., by an element of G k . Their dimension is called the k-rank of G. To say that G is of k-rank zero is equivalent to saying that G is tk-anisotroplc. The centralizer Z = Z,(S) of S in G is a reductive k-group, and its derived group is k-anisotropic. Let N = N,(S) be the normalizer of S in G. The tWeyl group k W = N /Z relative to S is called the k-Weyl group of G, A k-root of G with respect to S is a nontrivial character of S that appears when one diagonahzes the repreSen- tation of S in the +Lie algebra  of G, S operat- ing via adjoint representation. Denote by kr the set of all k-roots of G with respect to S. There is a decomposition of the Lie algebra  of G: 9 = 90 + I,okr9, where fl,= {X Efll Ad(s)X = :x(s)X for all SES). Then 90 is the Lie alge- bra of Z, and there is a unique unipotent k- subgroup p" of G normalized by S such that its Lie algebra is fI,. The set kr is a troot system in a suitable Euclidean space whose Weyl group is isomorphic to k W; if G is k-split, kf is the ordinary root system, and the P, are as in Section J. In general, kr need not be reduced (i.e., axiom (iii) in Section J need not be satis- fied), nor should dim "= dim P, always be 1. A closed subgroup of G defined over k which is minimal among the parabolic sub- groups of G is called a minimal paraholic k- subgroup of G, (If G is k-split, a minimal para- 50 bolic k-subgroup is a tBorel subgroup of G.) Any two minimal parabohc k-subgroups are conjugate to each other over k, If P is one such subgroup, then there exists a maximal k-split torus S such that P is the semidirect product of the reductive k-subgroup Z = Z,(S) and the tunipotent radical U = R,(P) of P. The ex- pression P = Z V is called a Levi-decomposition of P, and Z is called a Levi-suhgroup of P. Any two Levi-subgroups of P are conjugate by an element in Uk' There is an ordering of kr such that P is generated by Z and P, with li > O. The minimal parabolic k-subgroups containing a given maximal k-split torus S correspond to the tWeyl chambers of kr. They are permuted simply transitively by the Weyl group k W. Fix an ordering of kr, and let kL1 be the tfun- damental system of kr with respect to the given order. For any subset () of kL1, denote by P 9 the subgroup generated by Z = Z,(S) and Pa. where a is a linear combination of the roots of kr in which all roots not in 0 occur with a coefficient  O. Then Pe contains P and, in particular, P", = P. Pe is called a standard para- bolic k-subgroup of G containing P. Any para- bolic k-subgroup of G is conjugate over k to a unique Pe. If S9 is the identity component of n'Ee(kerex), then 5e is a k-split torus of G and Pe = Z(Se)Ru(Pe). This shows that any parabohc k-subgroup of G has a Levi-decomposition and its Levi-subgroups are conjugate to each other over k Let P be a minimal parabolic k- subgroup of G containing a maximal k-split torus S, Put V = R,(P), Z = Z,(S), and N = N,(S). Then, N = NkZ, so G k is the disjoint union over kW of double cosets VknwP k (WE k W), where nw is a representative in N k of WEkW. More precisely, if WEkW, there exist two k-subgroups U and V: such that V = V x V: (product of k-varieties), the map U x P--> VnwP sending (x, y) onto xnwy is an isomorphism, and (Gln=GdP k = U n«V)d, WEk W where n is the projection G-->GIP. This is called a relative Bruhat decomposition. If G is k-split, this gives an ordinary Bruhat decom- position ( Section K). If 0 is a subset of kL1, let We be the subgroup of k W generated by reflections defined by the :x's in O. If 0, 0' are subsets of kL1, then there is a bijection of dou- ble cosets (Pe). \ Gk/(Pw). = We \k W/W w , (Note that all these properties follow from the fact that (P k , N k ) is a BN-pair in G k ;  Section R) If G is k-isotropic, let G + be the subgroup of G, generated by all Vb where U runs over the unipotent radicals of the minimal parabolic k- 
51 subgroups of G. Kneser and Tits conjectured that when G is semisimple and simply con- nected and k arbitrary, G k = G +. Platonov showed that this conjecture fails for some k- forms of SL n (n  2), but it is true in many cases, e.g., when G is tk-split or tk-quasl-split (1. Tits, Sem. Bourhaki 2ge. no. 505, 1976677). In this case, if G is simple as an algebraic group and the cardinality of k is ? 4, then Gk/Z( G k ) is simple as an abstract group. The connected semisimple algebraic k-groups over a finite field k are k -split or k -quasi -split (  Section 0). 1. Tits has given a reduction pro- cess for classifying the groups over an arbi- trary field k. He defined the index of the k- group (to a certain extent a generalization of Witt's theorem characterizing a quadratic form by means of its index and anisotropic kernel) and gave all possible indices of the k- groups and also a complete list in the case of local Held (1. Tits [30, 3IJ). R. Buildings and BN-pairs [25J The origin of the notations of buildings and BN-pairs lies in an attempt to give a systema- tic procedure for the geometric interpretation of the semisimple Lie groups and, in partic- ular, the exceptional groups. The theory has various applications to the groups of Lie types. To describe this precisely, we must introduce a number of deHnitions. A complex A is a set with an order relation c, read "is a face of' or "is contained in" such that for a given element A, the ordered subset SlAY of all faces of A is isomorphic to the set of all subsets of a set. The subset SlAY of A is called a simplex in A. A complex has a smallest element, which we denote by O. For an element A, the number rk A of minimal nonzero faces of A is called the rank of A. Define rk A = sup(rk A I A EI'1). A morpbism (X:I'1-->' (where A and A' are complexes) is a mapping of the underlying sets such that for every A E 1'1, S( A)  S (ex( A)). A subcomplex of A is a com- plex whose underlying set is a subset of A such that the inclusion is a morphism. If A E 1'1, the star St(A) of A is the set of all elements of A containing A. With the order relation induced from A, St(A) is a complex. If BESt(A), the rank of Bin St(A) is called the codimension of A in B and is denoted by codim B A, A complex A is called a cbamber complex if every element is contained in a maximal ele- ment, which is called a cbamber, and if given two chambers C, C', there exists a sequence of chambers C = Co, C, , , C m = C' such that c odil('i_l (C i - 1 n C;)= codim ci (C i - I n C;):;;; 1 for all i= 1,2, , m. A chamber complex is called tbick (resp. tbin) if every element of 13R Algebraic Croups codimension 1 is contained in at least three (resp. exactly two) chambers. An endomor- phism <p of a thin chamber complex A is called a folding if <p2 = i.p and if any chamber on <p() is the image of exactly two chambers in A by 'p. For any folding tp, <p(I'1) is called a root. A thin chamber complex is called a Coxeter complex if for any pair of adjacent chambers C, C', i.e., chambers such that codim c ( C n C') = 1, there is a root containing C and not C'. Let I be a Coxeter complex. For any root (p(I) of I, there is only one root <p'(I), called the opposite of <p(I), such that tp(I) n <p'(l') does not contain any chamber and <p(I) U tp'(l') = L'; there is also an involutive automorphism called the reflection associated with <p that transforms <p(L') onto <p'(E). The group W(C) generated by all reflections of I is called the Wey1 group of E. which turns out to be a Coxeter group, i.e., there exist sets I and {m;j}, (i, i) E 1 x I, where mij are integers or XJ and m ii = 1, such that the group is presented by the generators {riLEI and the fundamental relations (rirj)m i ' = 1, i, j E I, mij # CJJ. A Coxeter complex is called irreducible if it is not the join of two nonempty Coxeter subcomplexes. A building is a thick chamber complex A with a system III of Coxeter subcomplexes, called the apartments of A such that (i) every two simplexes of A belong to an apartment. (ii) If I, I' E 121, there exists an isomorphism of I onto E that fixes I n L' (elementwise). Since the apartments of a building are isomorphic to each other, we can define the Weyl group, rank, and irreducibility of the building to be those of its apartment. A building with finitely many chambers is said to be of spberica1 type. If a building can be realized as a simplicial decomposition of a Euclidean space, then one says it is of Eucildean type. Examples of buildings are provided by BN- pairs. A BN-pair or Tits system in a group G is a system (B, N) consisting of two subgroups of G such that (BNO) B and N generate G; (BNl) BnN=H<JN;and (BN2) The group W= N/H has a generating set R such that for any r E R and any WE W, (BN2') rBw c BwB n BrwB, (BN2") rBr tB. The group W is called the Wey1 group of the BN-pair. For any subset S of R, let v, denote the subgroup of W generated by S. Set Ps = B W s B. Then Ps is a subgroup of G and the mapping Sf->P s is a bijection of the lattice of all subsets of R onto the lattice of all sub- groups of G containing B. A subgroup of G is called parabolic if it is conjugate to some P s ' Let A be the set of all left cosets of all sub- groups P s , S c R, ordered by the opposite of the inclusion relation. Let G operate on A on 
13 Ref. Algebraic Groups the left. Let}; be the subset {nPs n E N, S c R} of A, and  be the set of G-translates of };, Then (A, ) is a building whose Weyl group is W, Let G be a reductive algebraic group defined over a field k and P be a minimal parabolic k- subgroup of G containing a maximal k-split torus S and set N = N,(S). Then (P b N k ) is a BN-pair in G k , Therefore there is a building of spherical type associated with the group G. Conversely, the buildings of rank  3 and irreducible spherical type (roughly speaking) all turn out to be associated with simple alge- braic or classical groups (J. Tits [25J), This result gives a complete and unified description of structures that were discovered previously in certain cases. For example, the building of type A, gives a tprojective space (E. Abe, T. Tsuzuku), that of type C n gives a polar space (Veldkamp), and that of a k-form of type E6 gives a Cayley space (J. Tits). As an applica- tion, one can show that a finite building of rank  3 and irreducible spherical type is isomorphic to the building of an tabsolutely simple algebraic group over a finite field. When k is local (i.e., endowed with a com- plete discrete valuation whose residue field is Perfect), the reductive group defined over k has another BN-pair such that the associated building is Euclidean. This theory was ini- tiated by N. Iwahori and H. Matsumoto [26]. who considered split semisimple groups. Later, quasi-split and classical groups were studied by H. Hijikata, and the theory for the general case was given by F. Bruhat and J. Tits [27,28]. To distinguish from the usual BN- pair structure, the subgroups conjugate to B in this case are called Iwabori subgroups, and parabolic subgroups are called parabolic sub- groups. The Euclidean buildings are the "ultra- metric analogs" of the tRiemannian symmetric spaces. In other words, in the study of p-adic simple groups, they playa role similar to that of the symmetric spaces in the theory of simple groups. References [IJ A. Borel, Linear algebraic groups, Ben- jamin, 1969. [2J J. E. Humphreys, Linear algebraic groups, Springer, 1975. [3J M. Demazure and P. Gabriel, Groupes algebriques 1, North-Holland, 1970. [4J M. Demazure and A. Grothendieck, Schemas en groupes, Seminaire de geometrie algebrique, 196331964, Pub\. Math. Inst. RES, 1964 (Lecture notes in math. 153, Springer, 1970), [5J C. Chevalley, Theorie des groupes de Lie, 52 Groupes algebriques, Theoremes generaux sur les algebre de Lie, Actualites Sci. Ind., 1152 and 1226, Hermann, 1968. [6J C. Chevalley, Sur certains groupes simples, T6hoku Math. J., (2) 7 (1955),14-66. [7J C. Cheval ley, Classification des groupes de Lie algebriques, Seminaire C. Chevalley, Inst. H. Poincare, Univ. Paris, 195661958. [8J M. Rosenlicht, Some basic theorems on algebraic groups, Amer. J. Math., 78 (1956), 401-443. [9J M. Rosenlicht, Some rationality questions on algebraic groups, Ann. Mat. Pura App\., 43 (1957),25-50. [ 10J A. Weil, On algebraic groups of trans- formations, Amer. J. Math., 77 (1955),355- 391. [ l1J A. Weil, On algebraic groups and homo- geneous spaces, Amer. J. Math., 77 (1955), 4933512, [ 12J J.-P. Serre, Gr:oupes algebriques et corps de classes, Actualites Sci. Ind., Hermann, 1959. [ 13J J. Dieudonne, Introduction to formal groups, Dekker, 1973. [ 14J N. Bourbaki, Groupes et algebres de Lie, Hermann, 1968, chs. 446. [ 15J J.-P. Serre, Cohomologie galoisienne, Lecture notes in math. 5, Springer, 1964. [ 16J I. Satake, Classification theory of semi- simple algebraic groups, Dekker, 1971. [ 17J J. Tits, Algebraic and abstract simple groups, Ann. Math., (2) 80 (1964),313-329. [ 18J A. Weil, Algebras with involutions and the classical groups, J. Indian Math. Soc., 24 (1960),5899623. [ 19J A. Weil, Adeles and algebraic groups, Lecture notes, Institute for Advanced Study, Princeton, 1961. [20J A. Borel, Introduction aux groupes arith- metiques, Hermann, 1969. [21 J A. Borel and Harish-Chandra, Arithmetic subgroups of algebraic groups, Ann. Math., (2) 75 (1962),4855535. [22J G. D. Mostow and T. Tamagawa, On the compactness of arithmetically defined homo- geneous spaces, Ann. Math., (2) 76 (1962),446- 463, [23J T. Ono, On the Tamagawa number of algebraic tori, Ann. Math., (2) 78 (1963), 44- 73, [24J A. Borel and J. Tits, Groupes reductifs, Pub\. Math. Inst. RES, 27 (1965), 555151. Complements, Pub\. Math. Inst RES, 41 (1972),2533276. [25J J. Tits, Buildings of spherical type and finite BN-pairs, Lecture notes in math. 386, Springer, 1974. [26J N. Iwahori and H. Matsumoto, On some Bruhat decompositions and the structure of p- adic Chevalley groups, Pub\. Math. lnst. RES. 25 (1965),5-48. 
53 [27J H. Hijikata, On the structure of semi- simple algebraic groups over valuation fields I, Japan. J. Math. 1 (1975),2255300. [28] F. Bruhat and J. Tits, Groupes reductifs sur un corps local I. Donnees radicielles valuees, Pub!. Math. Inst. RES, 41 (1972), 55252. II. Schemas en groupes. Existence d'une donnee radicielle valuee, Pub\. Math. Inst. RES, 60 (1984), 5-184. [29J Colloque sur la theorie des groupes alge- briques, C. B, R, M" Brussels, 1962, [30J Algebraic groups and discontinuous subgroups, Amer. Math. Soc. Proc. Symposia in Pure Math., 9, 1966. [31] Automorphic forms, representations and L-functions, Amer. Math. Soc. Proc, Symposia in Pure Math., 33, 1979. [32] T. A. Springer, Linear algebraic groups, Birkhiiuser, 198 1. [33J G. P. Hochschild, Basic theory of alge- braic groups and Lie algebras, Springer, 198 1. 14(V.13) Algebraic Number Fields A. Introduction A complex number that satisfies an algebraic equation with rational integral coefficients is said to be an algebraic number. If the coeffi- cient of the term of highest degree of the equa- tion is 1, this algebraic number is said to be an algebraic integer. The set A of all algebraic numbers is a field which is the talgebraic clo- sure of the rational number Held Q in the complex number Held C. The set 1 of all alge- braic integers is an tintegral domain which contains the integral domain Z of all the ra- tional integers. The tfield of quotients of I is A. B. Principal Order An extension field k of Q of finite degree (which we shall always suppose to be con- tained in C) is said to be an algebraic number field of finite degree, and k is a subfield of A The intersection 0 = k n [ is an integral domain whose field of quotients is k; 0 is called the principal order of k (More generally, a subring R of 0 containing 1 is said to be an order of k if the field of quotients of R is k The set f of all elements y of 0 such that 'rO c R is an ideal of 0; in addition f is called the conductor of R,) Let n be the degree of k over Q. Then the additive group of the principal order 0 of k is a tfree Abelian group of trank n, A tbasis (WI' , w n ) 14 D Algebraic Number Fields of 0 as a free Abelian group (or Z-module) is said to be a minimal basis of 0 (or of k), Let wji) (i = 1, , n) be tconjugate elements of Wj over Q, and let A = IwY)1 be the determinant whose (i,j) entry is wl i ), Then Dk = A 2 is a rational integer that is independent of the choice of a minimal basis of o. Dk is called the discrimi- nant of k If k #Q, then IDkl> 1 (Minkowski's theorem, 1891). For any given rational integer m there are only a finite number of algebraic number fields whose discriminants are equal to m (C. Hermite and H. Minkowski, 1896). The proof of these theorems depends on the methods of geometry of numbers (- 182 Geometry of Numbers). C. Ideals of tbe Principal Order An tideal a of the principal order 0 is said to be an integral ideal of k, In particular, a prime ideal ( # 0) of 0 is called simply a prime ideal of k The domain 0 is not necessarily a tprincipal ideal ring but is always a tDedekind domain. That is, every ideal a of 0 is uniquely ex- pressed (up to the order of the factors) as a finite product of powers of prime ideals of o. This theorem is called the fundamental theo- rem of tbe principal order o. The quotient ring o/a of 0 by an ideal a (#0) of 0 is a finite ring. The number of elements of o/a is called the absolute norm of a and is denoted by N(a). We have N(ab) = N(a)N(b). Every prime ideal p ( # 0) of 0 is a tmaximal ideal of 0, and o/p is a Imite field. Let the tcharacteristic of o/p be p, where p is a prime number. Then o/p is a Imite extension of the tprime field Z/pZ, Let the degree of o/p over Z/pZ be f Then N(p) = pI, and.f is said to be the degree of the prime ideal p. Let s be a complex variable. The (complex- valued) function (k(S)= L I/(N(a))'= n 1/(1-(N(p)(') . P of sEe is called the Dedekind zeta function of k (R. Dedekind, 1871). Here the summation extends over all ideals a of 0, and the product extends over all prime ideals p of o. This series converges absolutely for Re s> 1, and the function (ds) has a single-valued tanalytic continuation to a +meromorphic function on the whole complex plane (- 450 Zeta Functions), D. Units An algebraic integer f of k is said to be a unit of k if f, -I is also an algebraic integer. Hence E is a unit of k if and only if the +Principal ideal 
14 E Algebraic Number Fields (E) is o. The set E. of all units of k forms an Abelian group under multiplication, which is called the unit group of k, The set of all ele- ments of Ek of finite order coincides with the set of all the roots of unity contained in k and forms a cyclic group of a finite order w. Let n be the degree of k over Q. Then for each ele- ment IXE k there are n conjugate elements IX(i) over Q. Let c/') (i = 1, , r 1 ) be real for any IXE k, and let ex('I+j) and ex('I+'2+j) U = 1, . , r 2 ) be pairs of complex conjugates for any exE k, Then we have n = r 1 + 2r 2 . The unit group Ek of k is the direct product of a cyclic group of order w and the free Abelian multiplicative group of trank r = r 1 + r 2 - 1. This theorem is called Dirichlet's unit theorem (1846), A basis (81' , 8,) of this free group is called a system of fundamental units of k. Let [(i) ex = 10glex(i)1 (i = 1,." ,rt), [U)ex == 210glex U )1 (j=r 1 + 1, ' , " rt +r2) for IXEk, For r elements tft', 1/, of Eb I [(1)tft [(1)tf2 [(1 )tf, I R[llI,...,tf,J= [tZ)tfl [tZ)tf2 [(2)tf, [(r)tfl [(')tf2 [(')tf, is called the regulator of (I]t" 1],) (Dedekind). In order for tfl, ..., IJ, to be multiplicatively independent, it is necessary and sufficient that R [lIt, , 1],] # O. The absolute value of R [I]" . , 1],] takes the minimum positive value R for fundamental units (8 1 , . . , 8,). R = R [81> ' E,] is independent of the choice of fundamental units (8 1 , . , 8,) of k, R is called the regulator ofk, In general, R[l]t" tf,JI/R is equal to the index [E k : H] of the group H generated by the roots of unity in k and '11' , Yf,. H. W. Leopoldt conjectured that units in k, which are multiplicatively indepen- dent over Z, remain multiplicatively indepen- dent over Zp (the ring of p-adic integers) when they are considered as elements of the tensor product k (8) Qp over Q. This conjecture was affirmatively proved in some special cases by J. Ax (Illinois J. Math., 9 (1965» and others. If k/Q is a Galois extension, there exists a unit E of k such that the conjugates of E over Q contain r multiplicatively independent units (Minkowski's theorem). E. Ideal Classes An o-module contained in k (i,e., 0(1 c a) such that IXa C 0 holds for some element IX ( # 0) of k is said to be a fractional ideal of k, For two fractional ideals a, b of k the "product" ab defined by {LexifJj (finite sum)lcxjEa, fJjEb} is also a fractional ideal. Thus the set of the 54 fractional ideals of k forms a multiplicative commutative semigroup. For a fractional ideal a the set a -) = {,x E k IXa co} is also a fractional ideal of k, and we have aa -1 = o. Thus the set of all nonzero fractional ideals of k forms an Abelian group 3k under multiplication with o as identity. Each fractional ideal a (#0) is uniquely expressed as a finite product of powers of prime ideals, if we admit negative powers. Namely, 3k is a free Abelian multi- plicative group with the set of all prime ideals as basis. Given fractional ideals a and b, we say that a is divisible by b if a c b; in this case, we call b a divisor of a and a a multiple ofb. Also, a c b if and only if there exists an integral ideal ( such that a = bc. Given fractional ideals a= IJpfi and b=I1qfj (ej#OJi#O), we say that a and b are relatively prime if {p;} and {qj} are disjoint. Usually a fractional ideal of k is simply called an ideal of k, For an element iX (# 0) of k, (cx) = cw is a (fractional) ideal of k, and (ex) is said to be a principal ideal of k. The set P k of all principal ideals (x) (iXEk,iX#O) is a subgroup of 3k' Since (ex) = 0 is equivalent to exE E k , we have P k :;; k* / Eb where k* is the multiplicative group of all nonzero elements of k, Each coset of 3k modulo P k is called an ideal class of k, and the group [k = 3k/P. is called the ideal class group of k, Each ideal class contains an integral ideal a with N(a) 0:( JIQJ (more precisely, with N(a).s;(4/n)'2(n!/nn) JjDJ). From this it follows that <£:k is a finite Abelian group. The order h of [. is called the class number of k, For the calculation of the class number the tresidue at the pole s = 1 of the Dedekind zeta function is used. Namely, Jim (s- l)(ds)=gh, s---tl +0 9 == 2'1 +'2n'2 Rk/WkJjDJ. where Rk is the regulator of k and w. is the number of roots of unity in k (Dedekind, 1877). This formula is used, in particular, for the computation of the class numbers of tquadratic fields and tcyclotomic fields (- 347 Quadratic Fields). The class numbers of cubic and quartic (real) tcyclic fields over Q were computed by H. Hasse in the case where the tconductor of k/Q is less than 100 (Abh, Deutsch, Akad, Wiss, Berlin, 2 (1948». Hasse has also given a detailed computation of the class number of cyclotomic fields [ 15). In general, let the degree n = [k: Q] be fixed and let I Dd-->oo. Then lim(log(h k Rk)/IogJjDJ)= 1. (This formula was proved for n = 2 by C. L. Siegel, 1935, and for general n by R. Brauer, Amer, J. Math" 69 (1947),) 
55 F. Valuations All the tArchimedean and tnon-Archimedean valuations of an algebraic number field k can be obtained as follows (- 439 Valuations): Archimedean Valuations. Let n = [k: Q], and let n conjugates of c(E k be ex(1)" ex(n) such that !X'i) (i = 1, , 1'1) is real, and ex('I+j) and C(('I+,,+j) (j = 1, , r 2 ) are pairs of complex conjugates. W e writrrlexli=lexU)1 (j==I,r l +r,);these are Archimedean valuations of k that are not mutually equivalent. The equivalence classes of these valuations are denoted by p:i), ,..,P\;',I+'2), respectively, and are called the tinfinite prime divisors of k, The first rt infinite prime divisors are called treal and the remain- ing 1'2 are called timaginary (or complex). The valuations of k defined by lexlp,;,=lexlj, j= 1"", r 1 , ==Iexlj, j==rl + 1, .." r 1 +r2' are called tnormal valuations of k, Here Ip' (j=rt + I, "., r l + r 2 ) are valuations in the wider sense. If r t = n we call k a totally real field, and if r, = 0 we call k a totally imaginary field. Non-Archimedean Valuations. Let p be a prime ideal of k and C( an element of k, Let (ex)=p"b, where p and b are relatively prime. Put v,(a) = a. Then for any constant p (0 < p < I), lexlp=p V ,(,) is a non-Archimedean valuation of k, This valuation of k is called the tp-adic valuation of k; p-adic valuations for different prime ideals are mutually inequivalent. The valuation lexlp with p=(N(p))-1 is called a tnormal valuation of k, The equivalence class of valuations con- taining I p is denoted by the same letter p and is called a tfinite prime divisor of k, A formal finite product of powers of finite or infinite prime divisors m* = II pfi is called a divisor of k, If all e i ;? 0, then m* is called an integral divisor of k. Given divisors m* = II pr i and n* = II p{i, we write m* n* if e i  J; (i = 1,2,), Any valuation of k is equivalent to one of the valuations defined previously (A. Ost- rowski, 1918; E. Artin, 1932). For any element C( (#0) of k the tproduct formula I1vliXl, = 1 holds, where p runs over all finite and infinite prime divisors of k and I, are the normal valuations of k, Conversely, let k be a field, and let V = { Iv} be a set of inequivalent valuations of k such that (i) for any a E k (a # 0) I exlp -# 1 holds only for a finite number of p in V; (ii) the product formula IIpliXl p = 1 (!XEk, 14 H Algebraic Number Fields IX #0) holds; and (iii) there is at least one Archimedean valuation in V, Then k is an algebraic number field and Vis the set of all the prime divisors of k (Artin and G. Whaples, Bull, Amer, Math, &c" 5 1 (1945». G. Ideal Classes in the Narrow Sense For a, P E k, the expression a == {3 (mod p\;D means ex U ) [3(i) > 0 for a real infinite prime divi- sor p and exU){J(i) # 0 for an imaginary infi- nite prime divisor p, We call an element IX E k totally positive if all real conjugates !XU) (i == 1,. , r l ) are positive. In the notation just given, this means a == 1 (mod p) (i = 1, , r l ). The set of an principal ideals (a) generated by totally positive elements a E k is a multipli- cative subgroup P: of P k , Each coset of Jk modulo P: is called an ideal class of k in the narrow sense. Let E( be the group of all totally positive units of k, Then we have (3k: P:J = h2'1/(Ek: E:), H. Multiplicative Congruence Let m be an integral ideal of k, and let k*(m) be the multiplicative group of all elements a in k such that (a) is relatively prime to m. Any element c(E k*(m) can be expressed in the form  = PlY such that (J, y E 0 and (fJ), (y) are rela- tively prime to m. Consider an integral divisor m* = m II p\:! which is a formal product of m and intlnite prime divisors p\2 of k, We call In the finite part of m*. Given an element c(E k*(m) and elements P, y E k*(m) n 0 such that a = Ply, we set a == 1 (mod' nt*) if P = y (mod nr), and a == 1 (mod p). The set of all a in k*(m) such that a == 1 (mod" m*) forms a multiplicative group. We write a == P (mod" m*) for a, P E k if exl (J E k*(m) and !X/ {J == 1 (mod x m*), This con- gruence is called the multiplicative congruence. In the following discussion we shall write mod m* for mod" m*. We denote by Jk(m) the group of all ideals of k that are relatively prime to an integral ideal m, and by S(m*) the group of all prin- cipal ideals (a) such that a E k*(m), a == 1 (mod m*); S(m*) is known as the ray modulo m*. Any subgroup H of 3dm) which contains S(m*) is called an ideal group modulo m*, and the factor group 3k(m)/H is called a group of congruence classes of ideals modulo m*, If n* m* for integral divisors m* and n* of k, then 3dm) c 3dn) and S(m*) c S(n*). If H is an ideal group modulo n*, then Q(H) = H n Jdm) is an ideal group modulo m*, and we have Jk(n)/H  s,(m)/@(H). For any given ideal group Ho modulo m* there is a smallest 
141 Algebraic Number Fields integral divisor f* such that f* 111*, and there exists an ideal group H modulo f* with (»(H) = Ho (i.e., if there is an ideal group H'modulo n* with <D(H') = Ho, then f* 11*), We call f* the conductor of the ideal group H The notion of multiplicative congruence is used in tclass field theory and in the theory of tnorm-residue symbols, I. Ideal Tbeory for Relative Extensions If an algebraic number Ileld K has a subfield k, we say that K/k is a relative algebraic number field. Let .0 be the principal order of K. For a (fractional) ideal a of k, Da is an ideal of K. We write Da=E(a) and call E(a) the exten- sion of a to K For ideals a, b of k, we have E(ab)=E(a)E(b) and E(a)nk==a. Let 'Pi: K  C be k-isomorphisms (i = 1,..., n), where n = [K: k]. We write K(i) = Il'.(K) and A(i)=Y;(A)for AEK. For an ideal  of K, l(i) = {A(i) A E} is an ideal of K(I), and Ill(i) is called the conjugate ideal of I in K('). Let L be the composite field of K(1 " , K(n), Then the ideal generated by \1[< 1 1 . " (nl in L is the extension of an ideal a of k We write a = NK/k() and call a the rela- tive norm oflliover k WehaveN K / k (IlI!B) = NK/k()NK/k(!B) and NKidE(a)) = a" (for an ideal a of k). In particular, for k = Q, N KIQ(IlI) = (N (Ill». Let p be a prime ideal of k Then E(p) = 1 2 g in .0, where  1" q are prime ideals of K Let ,t; be the degree of the finite field D/i over o/p Then NK/k(J = pIi, /, is called the relative degree ofi over k, and e i is called the (relative) ramification index of j over k We have the relation n = Lfl eJi between these numbers. If e, = = e g = 1, the prime ideal p is said to be unramified for K/k. Otherwise, p is said to be ramified for K/k, If every prime ideal of k is unramified for K/k, we call K/k an unramified extension. (For an in- finite prime divisor p f of k we write Poc = rU1 ei if the Archimedean valuation I" of k can be extended to g Archimedean valutions 1'I(i) (i = 1, . . . , g) of K, where ej == 2 if <2 is imagiary and Per is real, e i = 1 otherwise,) J. Relative Differents and Relative Discriminants Let K/k be a relative algebraic number field and 0, 8 be the principal orders of k, K, re- spectively, Put 9')1 = {A E K TrK/k( AD) co}, where Tr K / k is the ttrace (- 149 Fields J). Then 9')1 is a (fractional) ideal of K and JJ1-1 = 1\ Kk is an integral ideal of K; 'D K/k is called the rela- 56 tive different of Kover k. When k = Q 1\ is , KiQ simply called the different of K. For L:::J K:::J k, we have the chain theorem: 'Dl.lk = !J L1K !JKlk' Let the conjugates of A E Kover k be A(!), , A"", and assume that A(l)==A. Put DK/k(A) = I172(A _A(i),)) for AE K. If A ED, thenaK/dA)E:DKik :DvK/k is generated by {DKidA)1 AED}. The integral ideal (f(i) gen- erated by {A - A(i) A ED} in the field L = K(!) K(2) K(nl was called an element by D. Hilbert. We also have 1\K/k = (£(21(£(3), li:(n l , The integral ideal b K / k = NK/dT: K/k) of k is called the relative discriminant of K/k, If k = Q, b K1Q = (D K ), For the relative different !J Klk to be divisible by a prime ideal  of K, it is necessary and sufficient that E(p) = el.13' q with e > 1, where p =  n k (Dedekind's discriminant theorem, 1882). Hence a prime ideal p of k is ramilled for K/k if and only if p divides the relative discriminant b K / k ; there are thus only a Ilnite number of prime ideals of k which ramify for K/k, In particular, K/k is unramilled if and only if b K/k = 0, K. Arithmetic of Galois Extensions Let K/k be a relative algebraic number field such that K is a tGalois extension of k of degree n, and let G be the tGalois group of K/k, Let 0, 'D be the principal order of k, K, respectively. The conjugate ideals of an ideal ( of K are given by IlF = {A al A EIlI}(O'E GI, If NK1k(lll) = a, then E(a) = I1 aEG IlIa. For a prime ideal p of K, E(p)=(, 2'" g)", where NK/dJ == p r (i = 1, . . . . g), n = erg, and \1." . . . , \ are mutually conjugate prime ideals of Kover k Hilbert (1894) developed the decomposition theory of a prime ideal p of k for a Galois extension K/k in terms of the Galois group G as follows: Let \ be a prime ideal of D. Then Z=={O'EGI\a==} is a subgroup of the Galois group G of K/k, Z is called the decomposition group of\ over k Let G = Ui Zr j be the left coset decomposition of G. Then \1.\ = \1.r, (i = 1, . , g) are all the conjugate ideals of  over k The subgroup T= {O"EZI An=: A(mod ), A ED} of the decomposition group Z is normal, and T is called the inertia group of  over k The quotient group ZIT is a cyclic group of order .r (the relative degree of \.13), There exists an element (J of Z such that An=A'I(P)(mod). AeD, and (J is uniquely determined mod T; O"T gen- 
57 erates the cyclic group ZIT. This (J is called the Frobenius substitution (or Frobenius automor- phism) of  over k. For m = 1,2, ' v(m)= {lTEZ I A a = A(mod '!3 m + 1 ), A ED} are normal subgroups of Z; the group Vim) is called the mth ramification group of  over k Let V(O)= =VIVI)VIVI+II=_ = VIV z )*...  V(I',-I+l)=... = V(IJ,+I) V(v,+I)=] Let V p = V(vo+l) (p =0, I,..., r), where V o = -1. In particular, V o = VIO) = T The integers VI' u 2 , .. are called the ramification numbers of. The group TIll; is isomorphic to a subgroup of the multiplicative group of the finite field D/'.J3, Hence TIV I is a cyclic group whose order eo is a divisor of N() - 1. The group V m fVm+l (m  1) is isomorphic to a subgroup of the additive group of the finite field D/'tJ. Hence V m lv m + 1 is an Abelian group of type (p, p, , p) whose order divides N(\j3), From e::: TI = (T: VI)IVI it follows that e==e o p!1. (eo,p)= 1. Here GI denotes the order of a finite group G. Hence the decomposition group of  is a tsolvable group. The relation between the ramification numbers for Klk and those for an intermediate Galois extension Flk was com- pletely determined by J. Herbrand (J. Math. Pures Appl" ]0 (1931» [II]. Let J be the \-V-component of the relative different  Klk of a Galois extension Klk. Then r-I l'r d= L (U p +l -vp)(Iv"I-I)== L: (I Vli)I-I). po ]=0 In particular, d = 0 if T= 1, and d = e 1 if VII)=1. Let k z , k T , and kv(m) be the intermediate fields which correspond to the subgroups Z, T, and v(ml, respectively, in the sense of tGalois theory; the fields k z , k 1 , and kv(m) are called the decomposition field, the inertia field, and the mth ramification field of >.p, respectively. Let '.J3 be a prime ideal of K containing P, and let pz and PT be prime ideals in k L and k l such that pz =  n k z and PI == ,t n k T . Then we have E(p)==pzpk 2 ) p) for kzlk; E(pz)=pl' and NkT/kZ(P rJ = p{ for kdk z ; and E(PT)== >.pc for Klk l . If a prime ideal p of k is unramified for a Galois extension K!k then we have E(p) = 1.13 ] 1J32 '!3q, i = '!3'i (i == 1, I g), NN/k( ;) = pI, and IJ = fq. The Frobenius automor- phism O"i: A ai =' A NIP) (mod '!3;) (A ED) for the prime ideal i is uniquely determined, and its order is f. Since 'tJi== \).1\ we have 0", = ri-llT l 'i' Hence (JI ' , (Jq belong to the same tcon- jugate class of G. In particular, if G is an Abelian group, then a, = .. = (Jq and 14 L Algebraic Number Fields Aa'",ANIPI(modp), AED. We then write ilk ) (Jl = \;- ( E G and call this symbol the Artin symbol for p for the Abelian extension Klk. For an ideal a = II p' of k that is relatively prime to the relative discriminant of Klk, we define ( K:k )=IT( K:k } (EG) Evidently, we have ( :k ) = ( K:k ) ( Kt ). The arithmetic of quadratic fields (347 Quadratic Fields) and the arithmetic of cyclo- tomic fields ( Section L) have been developed since the 19th Century. L. Arithmetic of Cyclotomic Fields A complex number ( whose mth power is I but whose m'th power is not 1 for m' <m is called an mth primitive root of unity. There are <p(m) primitive roots of unity: exp(2nirlm) ((r, m) = 1), where ({J is tEuler's function, These q(m) primi- tive roots of unity are the zeros of an irreduc- ible polynomial over Q of degree <p(m}: Fm(X) = IT (xmld Jj"(a!, dim where II is the tMobius function. The coeffi- cient of the highest term of F,(X) is 1, and the other coefficients are all rational integers. F,(X} is called a cyclotomic polynomial. An example is F I2 (X)=(X I2  I)(X 2 -1)/(X 6  1)(X 4 -1) =X 4 _X 2 + 1. The algebraic number field Km = Q((m) (Sm = exp(2nilm)) obtained by adjoining an mth primitive root of unity to Q is a Galois extension over Q of degree rp(m) whose Galois group G is isomorphic to the multiplicative Abelian group of treduced residue classes of Z modulo m:G={lT,I(;;;'=(:, (r,m)= I), Km IS called the mth cyclotomic field. Cyclotomic fields are tAbelian extensions of Q. Conversely, every Abelian extension of Q is a subfie!d of a cyclotomic field (Kronecker's theorem, 18.53, 1877). We can choose (1, Sm, (" , (;;:Im)-I) as a minimal basis of K,. Let In = l' 12 I;t be the decompositions of In in powers of prime numbers 11' ,It, Put K(i)= K l :,. Then Km is the composite field Km= K(I)K(2).., K(i!, The different of K». is given by :DKm'Q = 
14 L Algebraic Number Fields :D K ("/Q!)KI21;Q" . 1\(t)/Q' and the discriminant of Km is given by DKm = D: 1 (11 Dill' (n, = cp(m)/cp(IFi», If m = I h , the discriminant of Km is DKm=Ela (a=IH(hl-h-I»), where /;= -1 if /h=4 or 1= 3 (mod4), and E= 1 other- wise. Hence the discriminant of K is D K = (,;=1 m/IIplmpl/(P-l»)",(m). Suppos; that m m of- 2 (mod 4). Then a prime number p ramifies for Km/Q if and only if p divides m. In par- ticular, if m= I h (m> 2), then only 1 ramifies for Km/Q and (1)== Iq>(ml, N(I) = I (I is explicitly given by 1== (I-i",». For I I- 2 the ramification numbers for I are Vi = Ii 1 (i = 1,2,), and the ramitlcation fields are K" K/2, For 1== 2 the ramification numbers are I, 3, 7, and the ramification fields are Q, K 4 , K 8' In Km/Q a prime number p (p.tm) is decom- posed as (P)==PIP g , N(pJ==pI(i== 1,.... y) and fq = qn(m), Here the degree f of Pi is de- termined as the minimal positive integer f such that pI == 1 (mod mY, Hence the decom- position law of a prime number pin Km/Q is derermined by its residue class modulo m. This is a prototype form of class tleld theory (- 59 Class Field Theory). The class number of the cyclotomic tleld Km can be calculated by Dedekind's formula (- Section E; see also Hilbert [4J). Here we shall give the result for m = I (a prime number). Let r be a +Primitive root modulo /. For ( = exp(2ni/I), we put /;=r.«()= C ; =: r2 Then E is a unit in K,. Define an element (J of the Galois group of K1/Q by (" = (', and put Ei=E at (i=O, 1,...), Then eo, e 1'"'' Gp-I (p = (1- 3)/2) are multiplicatively independent units. That is, the regulator R [eo, E l " ep-lJ = E 1-0, The units /;0, ':1, , " , E p - l are called circular units. The class number 11 of K, is the product of two factors, h = hi h,. Here h 2 is the class number of the real subfield K; = Q(( + C l ). E. E. Kummer called /; 1 the first factor and h 2 the second factor of the class number h. Let X j, Xz, , XI-l be the multiplicative char- acters of the treduced residue classes of Z modulo 1, and let X;(i = 1, , P + 1) be the characters among them such that Xi( - 1) = - 1. Then (_I)P+1 P + 1 ( '-1 ) lEI hi =- (21) P IT L ax;(a) , h z =- R II a1 0 (Kummer, 1850). Here Ro is the regulator of K;. Since circular units belong to K;, the class number h z of K; is equal to the index of the subgroup generated by ::!:: 1, Eo" Ep-l in the group of units of K;, The class number h of K, is equal to 1 for 1< 19 and it has been con- jectured that there exist no more fields K, 58 with h= 1. H. L. Montgomery and K. Uchida solved this conjecture by proving that the first factor hi> 1 for I > 19 (Tohoku Math. J,,23 (1971». J. M. Masley and Montgomery proved that there are precisely 29 distinct Km with class number 1 (J. Reine Angew, Math., 286/287 (1976», According to Kummer, I divides h if and only if I divides h" Since hi can be computed explicitly, we can readily determine whether II h or not (- 145 Fermat's Problem; Ap- pendix B, Table 4.111). A prime I is called regu- lar if Ith, otherwise it is called irregular. Thus an odd prime 1';3 5 is irregular if and only if there exists an even integer j with 2 <j  1- 3 such that I divides the numerator of the jth Bernoulli number Bj' This criterion of Kum- mer can be strengthened as follows. Let S denote the I-Sylow subgroup of the ideal class group <£:K/ of the prime cyclotomic tleld KI = Q((,), Because the Galois group G of Q((,)/Q operates on S, the Abelian group 5 becomes naturally a Z,[G]-module. Choosing the canonical character O:G->Zr defined by (, = ala) for any O"E G, we have a direct decomposition 5 = II::6 5(i), where S(i) = {s E S I sa = sO/(a), 0" E G}. Then it is necessary and sufficient for II (the numerator of B j ) that SU-j) #0 (K. A. Ribet, Inventiones Math., 34 (1976». Moreover, because G is naturally isomorphic to the multiplicative group of the reduced residue classes of Z/IZ, each fJi becomes a tDirichlet character. Let 1 / - 1 . BI,O-i=[ o fJ-/(a)a; then this number, regarded as an element in Qh is equal to a factor of the product (up to :1: 1 and 2) appearing in the class number formula for h" Let m i be the I-exponent of HI ,o-i for each odd i (2 < i < /- 3). Then it is conjectured that the order of S(i) is precisely equal to Imi for each odd i with 2 <i < 1- 3. In particular, if 5' = II j even S(}) is trivial, then this is known to be true. In general, it is also conjectured that Ith z holds for any prime I (Vandiver's conjecture). If the group S(i) is cychc, then S(i l is of order 1 m ; (A. Wiles, I nven- tiones Math., 58 (1980», When a Galois extension K over a finite algebraic number field k has the Galois group isomorphic to the additive group of the I-adic integer ring Zit the extension K/k is called a Z,-extension or r-extension. Then for each integer 11 ';3 0 there exists a unique subfield k n of K with degree In over k. Let k = K, = Q((,) (I> 2), and let K be the union of alll n + 1 th cyclotomic fields K,n+ I (n;? 0). Then K/k is a typical ex- ample of a Z,-extension with subfields k n = Kl"+ '. Let K/k be a Z,-extension with sub- 
59 fields k n , and let Ie. denote the I-component of the class number of k.. Then there exist integers ..1.  0, J.I.  0, v independently of n such that en = }.n + I1l n + V for all sufftciently large numbers n (K. Iwasawa, Bull. Amer. Math. Soc., 65 (1959». These numbers A. {I, v are called the Iwasawa invariants for the Z,-extension Kjk, There exists a unique Zl-extension KU over the rationals Q, An ex- tension Kok over k obtained by taking the composite of Ko and k is called the basic Z,- extension of k or the cyclotomic Zl-extension of k We denote its Iwasawa invariants by A1(k), J.l.l(k), v,(k). It is known that there are Z,- extensions with arbitrarily large J.I.. But it is conjectured that 111(k) = 0 holds for the basic Z,-extension of any k In particular, in the case k = Kl = Q((l) it has been computed that 111(k) = o for 1< 125,000 (for I < 4001 by K. Iwasawa and C. C. Sims, J. Math. Soc. Japan, 18 (1966); for 1< 30,000 by W. Johnson, Math Comp., 29 (1975); for 1< 125,000 by S, Wagstaff, Math Comp., 32 (1978». By applying the theory of uniform distributions to I-adic situations, B. Ferrero and L. C. Washington (Ann. Math, 109 (1978» proved that }ll(k) = 0 when k/Q is Abelian. In particular, when k = K, and k n = R;n+l, en> 0 if and only if the class number of Kl is divisible by I (P. Furtwangler, 1911). Since any quadratic field is a subfield of a cyclotomic field (by a tGaussian sum formula we have Q(j;) c Q«(ldl)' where d is the dis- criminant of Q(j;), the computation of the class number of quadratic fields and the proof of the law of reciprocity for the tLegendre symbol follow from the arithmetic of cyclo- tomic fields. M. Aritbmetic of Kummer Extensions Assume that an algebraic number field k con- tains an nth primitive root of unity. Then a tKummer extension K = k(:fP.) (J.l.E k) is a tcyc1ic extension of k Assume that [K- kI = n. In order that a prime ideal p of k ramify for K/k, it is necessary that p be a divisor appear- ing in (n) or (}l). If pf(n) and vp(}l)O (modn), then p ramifies for K/k. A prime ideal p which is relatively prime to (}l) has the decomposition E(p) =  I n with distinct primes i in Kif and only if the equation II == (n (mod p") is satisfied by some (E 0 for any positive integer m In particular, ifp+(n) and vp(}l) = 0, we have E(p) =  1 n if and only if J.I. == (n (mod p) is solvable in o. If for an element Ii of 0 J.I.:=n(modp) is solvable by some  EO, J.I. is said to be a 14 0 Algebraic Number Fields residue of tbe ntb power modulo p, Assume that p)(n) and v p (J1) = O. Let f be the minimal positive integer such that }If is a residue of the nth power modulo p. Then p is decomposed in K as E(p)=1}31 q, and NK/k(;)=pf (i= 1,... ,g). N. Power-Residue Symbol Let ,. = exp(2nijn) E k, and let p be relatively prime to (n) and () (aE k). Then for some r we have ex(N(p)-1 )/n = G (mod p), and we write G = ( ). This symbol is called the nth power-residue symbol (Kummer). Generalizing this definition, we can define the nth power-residue symbol (ex/b)n for an ideal b of k which is relatively prime to the relative discriminant of k(y0.)/k by using the tArtin symbol « Kjk)jb): (y0.r == (). y0., 0" = ( Kt k ). This symbol satisfies ( blbJ n =( :J ( :J : ( lbex2 ). = ( I ). ( ex b2 ). if all the symbols are well del1ned. In partic- ular, a is a residue of the nth power modulo p if and only if (/P)n = 1. This symbol coincides with the tquadratic residue symbol for /J = 2, k=Q, and p#2. O. Law of Reciprocity for tbe Power-Residue Symbol Several formulas concerning the power-residue symbol are known which are similar to that for the quadratic residue symbol (F. G. M. Eisenstein, Kummer, Furtwangler, Takagi, Artin, Hasse). These can be proved by means of Artin's tgenerallaw ofreciprocity in class field theory ( 59 Class Field Theory). There are many formulas concerning the reciprocity of the power-residue symbol. One of them is as follows: Let n = I be a prime number. Let ex. p E k and assume that (i) rJ. is totally positive; (ii) v,(a) == 0 (mod I) if vp(P)  0 (modI), and vp(P)==O (modi) if vp(C(hO (modi), for any prime ideal p; and (iii) a:= 1 (mod l) and p := 1 (mod( 1 - (,)), Then 
14 P Algebraic Number Fields (} (rl =(, a= TrkQ ( (X 1 [3-1 ) 1-(1 (general law of reciprocity, Hasse, 1924), This result is a generalization of the formula of Eisenstein (1850). If (X(Ek) is totally positive and lJ. == 1 (mod I), then ( (I ) b ( (X-I ) -;, 1=(1, b= Tr k / Q I . If lJ. == 1 (mod I( 1 - (I))' then G} =(, c= Trk/Q C:=IJ ( (X-I ) d=-TrkIQ 1(1-(1) C(I }=(t, (complementary law of reciprocity, Hasse, 1924), P. Norm Residue Let m be an integral divisor of k such that m = TI i pii TI j P (e i > 0) with finite prime divi- sors {p;} and infinite prime divisors {p)}. and let fJ be an element of k that is relatively prime to m. For a relative algebraic num- ber ileld K/k and an element B of K, we set P == N K/k( B) (mod m) if the following two con- ditions are satisfied: (i) fJ == NK/k(B) (mod pii) for every finite Pi and (ii) [3U) > 0 for every infinite prime pljl such that M is real and its extension to K is imaginary. fJE k is then said to be a norm residue modulo nt for K/k if there exists a number B of K such that fJ = NK/k(B) (modm). Let p be a finite prime divisor of k. If fJ is a norm residue modulo pC for a sufficiently large c, then P is a norm residue modulo pe for any e > c. Let c be the smallest such integer (c  0). Then the ideal fp = pC is said to be the p- conductor of norm residue for K/k, If p is un- ramified for K/k, then c = 0; i,e" every fJ E k which is relatively prime to p is a norm residue modulo pe for any e > O. For a ramified p put f p = p'. For a Galois extension K/k, c is not greater than '-1 I Vpl v, V(i)1 L -(Vpl-Vp)= I _ I ' p 0 I V o I i"O V o In particular, for an Abelian extension K/k this value is an integer and is equal to c (Hasse, J. Fac. Sci. Univ. Tokyo, 1934). For example, the I-conductor of the cyclotomic field Klh/Q is I h , We define the pro-conductor for K/k for an infinite prime divisor Px, of k by f Pro == Pn if P XJ is real and its extension to K is imaginary, and f = 1 otherwise (- 257 Local Fields F). Px, 60 Q. Norm Residue Symbol For an Abelian extension K/k, the positive divisor f=rIfp p (where p runs over all finite and infinite prime divisors of k) is caned the conductor of K/k ( 59 Class Field Theory). For a E k (a # 0) take (xo such that (X/(Xo == 1 (mod fp) and iXo == 1 (mod ffpI), and put (cx o ) = pab with b relatively prime to 13, Then b is relatively prime to the relative discriminant b K / k . We define a new symbol by ( (x, :/k ) = ( Kt ) (EG), where ((K/k)/b) is the Artin symbol. This value is independent of the choice of the auxiliary element (Xo. The new symbol is called the norm-residue symbol (Hasse, J. Reine Angew, Math., 162 (1930». In particular, for an infinite real prime divisor pljl of k whose extension lQ for K is imaginary, we have ((X'p:k )== 1 or =0" according to whether the conjugate (XU) is positive or negative, where (J is the automor- phism of K/k induced from the complex con- jugation of the completion C of K with respect to , The norm-residue symbol has the following properties: (1) e(X"p K / k ) = e,:/k ) e,,;/k } (2) if P is unramified for K/k, then ( (X' :/k ) = ( K:k ) -V p ('); (3) in order that rJ. be a norm residue modulo fp for K/k, it is necessary and sufficient that ( ' :/k }- 1:, (4) the product formula for the norm-residue symbol (Hasse) is n ( (X' K/k ) = 1, p P where p runs over all finite and infinite prime divisors of k; and (5) if the domain of the variable (X is the whole k (#0), or the set of all :y such that ((X) is relatively prime to p, or the set of all (X such that (X == 1 (mod p") (v p + 1  m;( v p + I ), then the range of«a, K/k)/p) is the decomposition group Z of p, the inertia group T of p, or the ramification group v" 
61 of p, respectively (Hasse, S. Iyanaga, 1933) (- 257 Local Fields F). R. Hilbert Norm-Residue Symbol The symbol first introduced by Hilbert for quadratic fields can be defined in a general algebraic number field k containing an nth primitive root (n of unity. Let ex, p E k (ex # 0, P f 0), and let p be a prime divisor. Then the following nth root of unity ((a, m/P)n is defined by using the norm-residue symbol: ( / }:I# =(:1#)", O"= C,k(f)/k } This symbol «a, {i)/p). is called the Hilbert norm- residue symbol. It is also called the Hilbert-Hasse norm-residue symbol. For ex. ', P E k, we have 0) ( ex/J } = ( ex/ } ( ex'/ ).: (2) the law of symmetry: ( / } = ejex )n-l; and (3) the product formula for the Hilbert norm-residue symbol: rr ( ex, (3 ) = 1 P P n (Hilbert, Furtwangler, Takagi, Artin, Hasse). For detailed properties concerning the norm- residue symbol, power-residue symbol, and Hilbert norm-residue symbol and for refer- ences for them see Hasse [6]. In general, the problem of obtaining various laws of reciprocity for the power-residue sym- bol is reduced to the one of computing the Hilbert norm-residue symbol explicitly. De- tailed formulas for these symbols are called explicit reciprocity laws; they are treated as a topic in the number theory of local fields (- 257 Local Fields H). S. Density Tbeorem Let M be a set of prime ideals of k If lim 5' -/log 'I+OPEM(N(p)) ( s-1 exists, its value is said to be the density of M. The density of the set of all prime ideals of k is I. Let H be an ideal group modulo an integral divisor m. Then the density of the set of all prime ideals contained in each coset of 3(111) 14 u Algebraic Number Fields modulo H is I /(3( 111): H). In particular, let H be the ray S(m). Then this result implies that each coset of 3(111) modulo S(m) contains intlnitely many prime ideals (a generalization to algebraic number fields of the tprime num- ber theorem for arithmetic progression). Let K/k be a Galois extension, C be a conju- gate class of the Galois group G of K/k, and M(C) be the set of all prime ideals p of k such that the tfrobenius automorphism of each prime factor \.\3i of p in K belongs to C. Then the density of M(C) is I CI/I G I (Cbebotarev's density tbeorem, Math. Ann, 95 (1926». Each element (J of the Galois group G of K/k can be expressed by the permutation z of the conjugate fields K(1\, Kin) of Kover k Let z be expressed as the product of y tcycles of length II, ,fr. Hence n = 11 + + fr. Let C(!I' . ,I,) be the set of all such (J in G, and let M(J" ,I,) be the set of all prime ideals p of k such that p is decomposed in K/k as the product of r prime ideals of K with rela- tive degree II, . .j,. Then the density of M(Jl, ,[,,) is C(Jl,' . . ,fr)I/IGI (Artin, Math, Ann, 89 (1923»). T. Relation to the Arithmetic of Local Fields It is quite useful to investigate the relation between the arithmetic of algebraic number fields and that of local fields. For example, let a prime ideal p of an algebraic number field k be decomposed as E(p)= \11e l 1 meg N ( In ) 'P . 'Pg, K(k 'Pi = pIi (i = 1, ..., g) in an extension K of k, Let K\ and kp be the completion of K and k with respect to \.\3-adic and p-adic valuations, re- spectively. Then we have [K\i: k,J = e j t; and K @K kp  K'1 + + K'l\ (direct sum). The relative different DK/k is gexpressed as (the p-component of DK/d = IIf=1 :DK'j1 (k . For a Galois extension K!k the p-conductr I p = pC for the norm-residue and the conductor pC of local extension Kl/k" have the same exponent c. For a local field K jkp' each norm-residue modulo I" is a norm of an element of K,. Hence precise results concerning the norm- residue in local fields Can be applied Imme- diately to a global field K/k (- 257 Local Fields). We can also apply the method of the idele group of an algebraic number field k, and therefrom we can prove results concerning the ideal class group, unit group, and zeta function of k (-> 6 Adeles and and Ideles). U. History of the Arithmetic of Algebraic Number Fields C. F. Gauss (1832) was the first to generalize the notion of integers to algebraic number 
14 Ref. Algebraic Number Fields fields in considering the elements of Z[J=1J, now called Gaussian integers (Z[J=1J is the principal order of Q(J=1». After investiga- tions by G. L. Dirichlet and Kummer, the notion of ideals was introduced by Dedekind (1871) [2]. L. Kronecker gave another founda- tion for the arithmetic of algebraic number fields (1882) [3]. Dirichlet proved the unit theorem and, introducing the analytic method to num- ber theory, gave the class number formula of quadratic fields ( 347 Quadratic Fields). H. Minkowski first applied the theory of lattice points to number theory (- 182 Geometry of Numbers), and K. Hensel introduced the p- adic method (- 257 Local Fields). Hilbert (1897) [4J and Hasse (\926,1927,1930) [6J summarized the main results on the arithmetic of algebraic number fields known at that time. In particular, Hilbert's report centered around the arithmetic of Galois extensions, and Hasse's around the class field theory obtained by T. Takagi, E. Artin, and H. Hasse (- 59 Class Field Theory). Since c. 1950, when the notions of ideles and adeles were introduced, cohomology-theoretic methods have been successfully applied to number theory ( 6 Adeles and Ideles). Recently various local methods, for example, the Iwasawa theory of Zcextensions, tformal groups, and tp-adic L- functions (T. Kubota and H. W. Leopoldt, J. Reine Angew. Math., 214/215 (1964» have been frequently applied to research in algebraic number fields. References [I J K. F. Gauss, Theoria residuorum bi- quadraticorum 1 (1825), II (1831), Werke 2, Gottingen, 1863,65-148. [2J R. Dedekind, Dirichlet's Vorlesungen iiber Zahlentheorie, fourth edition, 1894, Supple- ment XI (Gesammelte mathematische Werke 3, Braunschweig, 1932) (Chelsea, 1969). [3J L. Kronecker, Grundziige einer arithmeti- schen Theorie der algebraischen Grosse, J. Reine Angew. Math., 92 (1882), 1-122 (Werke 2, Teubner, 1897,237-387) (Chelsea, 1968). [4J D. Hilbert, Die Theorie der algebraischen Zahlkorper, Jber. Deutsch. Math. Verein. 4, (1897), 175-546 (Gesammelte Abhandlungen 1, Springer, 1932, 63-363) (Chelsea, 1967). [5J E. Hecke, V orlesungen iiber die Theorie der algebraischen Zahlen, Akademische Ver- lag, 1923 (Chelsea, 1970). [6J H. Hasse, Bericht iiber neuere Untersu- chungen und Probleme aus der Theorie der algebraischen Zahlkorper, Jber. Deutsch. Math. Verein., 1, 35 (1926), I-55; la, 36 (1927), 233-311; II (1930) (Physica Verlag, 1965). 62 [7J H. Weyl, Algebraic theory of numbers, Ann. Math. Studies, Princeton Univ. Press, 1940, [8J H. Hasse, Zahlentheorie, Akademie- Verlag, third edition, 1969; English translation, Number theory, Springer, 1980. [9J E. Weiss, Algebraic number theory, McGraw-Hill, 1963, [1OJ S. Lang, Algebraic number theory, Addison-Wesley, 1970, [ 1 I J E. Artin, Algebraic numbers and alge- braic functions, Lecture notes, Princeton Univ., 1950-1951 (Gordon & Breach, 1967). [12] S. Iyanaga (ed.), Theory of numbers, North-Holland, 1975. (Original in Japanese, 1969,) [ 13J A. Weil, Basic number theory, Springer, second edition, 1973. [14] J. W. S. Cassels and A. Frohlich (eds.), Algebraic number theory, Academic Press, 1967, [ 15J H. Hasse, Uber die Klassenzahl abelscher Zahlkorper, Akademie-Verlag, 1952. [ 16J Z. 1. Borevich and 1. R. Shafarevich, Number theory, Academic Press, 1966. (Original in Russian, 1964.) [ 17J S. Lang, Cyclotomic fields, Springer, 1978, II, 1980, 15 (VIII.3) Algebraic Surtaces A. Definition An algebraic variety of dimension 2 is called an algebraic surface. In this article, by a sur- face we mean a complete irreducible algebraic surface defined over an algebraically closed field K. B. History The history of algebraic surfaces originated with the study of algebraic functions of two variables. In the case of algebraic functions of one variable, the introduction of tRiemann surfaces attached to such functions played an essential role in the development of the theory. The study of algebraic functions of two vari- ables led naturally to the consideration of the surfaces defined by a suitable polynomial equation. H. Poincare and E. Picard are among those who studied the homological structure of the surface defined by the equation P(x, y, z) = O. The theory of+Abelian integrals (Picard integrals) is one of the consequences of 
63 such topological investigations. S. Lefschetz obtained further results in this direction. M. Noether and geometers of the Italian school, such as F. Enriques, G. Castelnuovo, and F. Severi, studied algebrogeometric prop- erties of algebraic surfaces. In particular, the Italian school geometers recognized the im- portance of irregularity and thoroughly inves- tigated its geometric meaning. In the early 20th Century they succeeded in constructing the great edifice of the theory of algebraic surfaces. Though some of their results lack rigorous proof, efforts to build a foundation for those results have led to the recent devel- opments in algebraic geometry. A signifi- cant contribution to the modernization of the theory was made by O. Zariski and K. Kodaira. The resolution of singularities of an alge- braic surface is one of the most fundamental problems in the field. When the tuniversal domain is the complex number field, function- theoretic methods were used by Italian-school geometers and R. J. Walker (Ann. Math., 36 (1935». Zariski introduced the tvaluation- theoretic method to deal with the problem when the characteristic of the universal do- main is zero. S. Abhyankar (1966) succeeded in resolving the case of positive characteristics. C. Divisors and Linear Systems In what folIows, let S denote a nonsingular surface. S can be embedded into some projec- tive space, Let I be a linear system of divisors on Sand 10' ,j be a basis of the detlning module for I over K. Associated with I: we have a rational mapping It> 1 : 5 --> P" defined by It> j P) = Uo (P), . . . In (P)) for general points P on S. Pullbacks of hyperplanes by It> l' are called variable components of I. Any member of I is a sum of a variable component and a fixed component of 1:. Let S' denote the clo- sure of the image of S by <1>1' If dim S' = 2, a general variable component is irreducible. If dim S' = I. then a general variable component is composed of an algebraic system of dimen- sion 1, which is called an algebraic pencil. These result from Bertini's theorems. For any divisors D and D' on S, the intersection number (or the tKronecker index) I (D' D') is defmed; this number is a symmetric bilinear form such that I(D'D)=I(Dt'D'd for any divisors Dl and D'tlinearly equivalent to D and D', respec- tively. If C is a nonsingular curve on S, I(e- D) coincides with the degree of the restriction e- D of D to C. I(D' D)=(D 2 ) issaidtobethe self-intersection number of O. If D is an ample divisor on S, then (D 2 ) > 0 andI(D' C) > 0 for any irreducible curve C on S. These properties 15D Algebraic Surfaces characterize an ample divisor (Y. Nakai;  16 Algebraic Varieties E). Let I be an irreducible linear system of dimension r(  1), and let C be a generic com- ponent of I:. Let C' be a member of I: different from C. Then the set of C-divisors C' C' forms a linear system of dimension Y - 1 on C. This is called the trace of I: on C and is denoted by TrcI. The trace is, in general, not com- plete. The integer dim ITrcI:l- dim TrcI: = t5(I:) is called the deficiency of 1:. The defi- ciency of the complete linear system IDI is denoted by S (D) . Let x be a point of S, and let (SIX be the local ring of x. Then (!)s = U",€s @x is an talgebraic coherent sheaf, called the tstructure sheaf of S. Let D be a divisor on S. The sheaf of germs of rational functions 1 such that U) + D >-0 is denotedbyG', (0). ThenHU(S,(S's(D)) isadefin- ing module for the complete linear system D I D>O if and only if (9s( - D) is a sheaf of {!;s- ideals. The quotient sheaf (Ds/(I)s( - D) will be denoted by 0,. If D is a tprime divisor, then (l)D is the structure sheaf of the algebraic curve D ( 9 Algebraic Curves). Let 17 be a sheaf on S. We set xiS, g;) = L=o( -l)q dim Hq(S, 17). xiS, (!;s) will be denoted simply by xiS), We call p,(S) = x(S) 1 the aritbmetic genus of the algebraic surface S. Sometimes xiS) is re- ferred to as the arithmetic genus of S. We set Xs(D) = xiS) xiS. @s( -D», The integer p, (0) = 1 - Xs( D) is, by definition, the arithmetic genus of the divisor D. If D is a prime divi- sor, then p, (0) coincides with the arithmetic genus of the algebraic curve, i.e., p, (0) dim HI (D. (!;D)' D. Riemann-Roch Theorem Let S be a nonsingular surface and let K de- note a canonical divisor on S, i.e., K = (w) for some nonzero rational 2-form W on S. If C is a nonsingular irreducible curve on S, (K + C), C becomes a canonical divisor on C; hence, deg((K + C). C) = 2Pa(C) - 2 byacorollaryto the Riemann-Roch theorem on C (9 Alge- braic Curves C). Since I((K + C). C) = deg((K + C). C), it follows that Pa(C) = I((K + C)' C)/2+ 1. Moreover, the formula Pa(D)==I((K + D)' D)/2+ 1 holds for an arbitrary curve D on S; this is called the adjunction formula. For any divisor, wedefinep, (0) to be I (0 '(D +K))/2+1.Then X(S,(i"'(D)= X(5)- Xs( - D)== X(5)+ Par - D)- 1 =I(D'(D- K)/2+ xiS) 
15 E Algeb raic Surfaces This formula is called the Riemann-Rocb theorem on S ( 366 Riemann-Roch Theo- rems C). Applying Serre's duality theorem to D ( 16 Algebraic Varieties E) we have dim Hi(S, (!)(D))=dimH 2 - i (S,(!J(K-D)) for i==O, 1, 2. In particular, dim H 2 (S,(!)(D))=I(K-D), which is called the index of speciality of D; dlln HI(S, (!)(D))=dimHI(S, (()(K -D)) is called the superabundance of D, denoted by hi (D). The inequality I(D)+I(K-D) ) x(S, (()(D)) d(D' (D - K»/2+X(S) is called the Riemann-Roch inequality, where equality holds if and only if h I (D) = O. If D is a curve with s connected components, then I(D+K)=hl( -D)+Pa(D)-s+ x(S). In addition, if hi (0) = 0, then hi ( D) = 0 and thus I(D+K)==Pa(D)-s+ X(S). This is called the Riemann-Roch theorem for the adjoint system. Note that D + KI is called the adjoint system of D. The Noether formula, (K 2 ) + c 2 (S)== 12x(S), is a special case of Hirzebruch's theorem of Riemann-Roch type (- 366 Riemann-Roch Theorems B). Here, C2(S) denotes the second Chern number of S, which coincides with the Euler number of S if K is the field of complex numbers. Let Div(S) denote the group of all divi- sors on S; by linearity we can define the bi- linear form I(D' D') on Div(S)o = Div(S) Q9 zQ. J={DEDiv(S)o I(D' D') =OforallD'} is a subgroup and X = Div(S)o/l is a finite- dimensional vector space over Q, on which the nondegenerate bilinear form I is induced. I has a unique positive eigenvalue; thus the other eigenvalues are all negative. This is called the index theorem of Hodge; it is derived from the Riemann-Roch theorem on S. From this, we inferthatif D 2 = I (D' D) > 0, then I (D' D')2) D 2 . D,2 for any D'. X is said to be the Neron- Severi group of S and dim X is said to be the Picard number of S ( 16 Algebraic Vari- eties P), To study l(mD) as a function of m » 0, 0, Zariski writes an arbitrary effective divisor D as a sum of D(+) and D(-)E Div(S)o with non- negative rational coefficients such that (1) D(+) is arithmeticaliy effective (or, numerically semipositive), i.e., I (D (+). C) > 0 for any curve C on S; (2) D(-)=O or the intersection matrix of the support of D(-) is negative deti'nite; and (3) [(D(+1. D(-)) = O. Such a decomposition is unique and is called the Zariski decomposition of D [7]. If mDI+) is a divisor for some m > 0, then l(mD)=I(mD(+»). 64 E. Invariants of Algebraic Surfaces There are many invariants besides the arith- metic genus discussed earlier. We set h q , p == dim, HP(S, Qq). Then h 2 ,o is equal to the number of linearly independent holomorphic 2-forms; it is called the geometric genus and is usually denoted by P q , Since h O , I gives the maximum among the deficiencies 15(1:) of linear systems on S, it is called the maximal defi- ciency of S. For a divisor C such that hi (C) = 0, we have c5(C)=h o , I, The number h O ,1 was formerly called the irregularity of S, because hO, 1 was considered to be a correction term in the equality ParS) = Pg - h O , 1, The study of higher-dimensional varieties showed, however, that it was unnatural to regard hO, I as a cor- rection term, At present, by irregularity we mean the dimension of the Picard variety of S ( 16 Algebraic Varieties P), and we denote this number by q, When S is defined over the complex number field. we have hp,q = hq,p, In particular, q ;;;;; h O , 1 = hl,o, This number is equal to the num- ber of linearly independent tAbeIian simple integrals of the first kind; it is also equal to one-half the first Betti number of S. In cases with positive characteristic, these equalities do not hold in general. J.-P. Serre gave an example of an algebraic surface S such that h O , I #hl,o, and J. Igusa gave an example such that q <h o . I =hl,o. Let K be a canonical divi- sor of S. The number Pi = l(iK) is called the i- genus, and Pi (i = 2, 3, . ) are generally called p1urigenera. If Pn = 0 and d n, then P d is also zero. The numbers p.(S) = h 2 ,o - h O 1, pg(S) = h 2 ,o,h 1 ,o,hO,1,p;ti=2,3,,,,)(Pl=P q ) are tabsolute invariants of S'; i,e., they take the same values for any nonsingular surface S' that is birationally equivalent to S. However, h1.1 is not an absolute invariant. For a pro- jective plane, all plurigenera p. vanish and the irregularity q = 0, Thus if S is a rational sur- face (- 16 Algebraic Varieties J), i.e., a sur- face which is birationally equivalent to p2, all p. = q = 0, Conversely, any surface with q = P 2 = 0 is a rational surface. This is called Castelnuovo's criterion. A ruled surface is de- fined to be a surface that is birationally eq uiv- alent to a product of the projective line and a curve. All p. of a ruled surface equal ° and any surface with P 4 = P 6 = 0 is a ruled sur- face, This is called the criterion of ruled sur- faces (Enriques), F. Characteristic Linear Systems of Algebraic Families One of the central problems considered by the Italian school was to prove that the irregular- 
65 ity q is equal to the maximal deficiency h O . 1 For that purpose, Severi introduced the notion of characteristic linear systems of algebraic families. Let I: be an irreducible algebraic family of positive divisors on S such that a generic member C of I: is an irreducible non- singular curve, and let r be the dimension of I:, Let ..[1 be a I-dimensional subfamily of I: containing C as a simple member, and let C be a generic member of ..[1' Then the speciali- zation of C' 'C over the specialization C'...... C is a well-defined C-divisor of degree n = I(C C'). The set of C-divisors thus obtained is called the characteristic set. The characteristic set forms an (Y - 1 )-dimensionallinear system and contains Tr, CI as a subfamily. This linear system is called the characteristic linear system of L. For any algebraic family of dimension r, we have r (dim CI + q, In particular, there exists an algebraic family ..[ that contains ooq linear systems and such that for a generic curve C we have hi (C) = O. For such an alge- braic family, we have the equality r = dim CI + q; hence the inequality q ( h O , I follows. Moreover, if the characteristic linear system is complete, we have q = h O , I, The proof of the completeness of characteristic linear systems given by Severi is valid only in some special cases (e,g" the case Pg = 0). For a complex algebraic manifold, a rigorous proof was given later. When the characteristic is positive, the completeness does not hold in general (Igusa); however, for the surface with Pg = 0, the com- pleteness holds (Y. Nakai). The completeness holds if and only if the Picard scheme of S is reduced [ 14 J. G. Birationa1 Transformations of Algebraic Surfaces Let Sand S' denote nonsingular surfaces. If there exists a birational morphism T: S-->S', we say that S dominates S', and we write S  S', In addition, if T is not an isomorphism, we write S > S'. In case there does not exist an S' with S > S', S is said to be relatively minimal. On the other hand, if we have S' > S for any S' which is birationally equivalent to S, S is said to be minimal. Any minimal surface is, by definition, relatively minimal. If a minimal (resp. rela- tively minimal) surface S is birationallyequiva- lent to S', we say that S is a minimal (resp. relatively minimal) model of S' or of the field K(S). A necessary and sufficient condition for S to have a minimal model is that S not be a ruled surface (Castelnuovo and Enriques). Let S be a nonsingular surface and P be a point on S. Replacing P by a projective line p" we have a nonsingular surface S' and a birational morphism T: S'.....S such that T- '(P) 15 H Algebraic Surfaces =E:::;pl andS'-E:::;S-{P} by T T:S'--tS (or T- I in some references) is said to be a locally quadratic transformation. Any bira- tional morphism between nonsingular sur- faces is a composition of locally quadratic transformations and an isomorphism. Given a birational mapping T: S......S' and P on S', T- 1 {P} is called an exceptional curve when- ever it is not a point. Moreover, it is called an exceptional curve of the first kind if T is regu- lar along T- I {P}. Otherwise, it is of the second kind. Exceptional curves consist of irreducible rational curveS. An irreducible curve E is an exceptional curve of the first kind if and only if (£2) = _ 1 and E :;::: Pl. S is relatively minimal if and only if S has no exceptional curves of the first kind; S is minimal if and only if S has no exceptional curves at all, A relatively mini- mal surface that is not minimal is a ruled surface. Such a surface is either p2 or a pl- bundle over a curve. In particular, a relatively minimal rational surface is either p2 or a pl- bundle over P'. Any surface of the latter type is occasionally called a Hirzebruch surface and its (K 2 ) is 8, where K denotes a canonical divisor. However, (K 2 ) of p2 is 9. Define the linear genus of a rational surface to be 10. If S is not a rational surface, taking a relatively minimal surface S', we del1ne the linear genus pO) ofS to be(K 2 )+ 1 ofS'. H. Examples of A1gehraic Surfaces Let Sm denote a nonsingular surface in p3 defined by a homogeneous polynomial of degree m. Let H denote a divisor on Sin in- duced from a (hyper)plane on p3. Then the canonical divisor K is linearly equivalent to (m-4)H, i.e. K-(m-4)H Hence, pg=(m-l) (m 2) (m 3)/6. Moreover, q = 0; if K is the field of complex numbers, Sin is simply con- nected. S2 is isomorphic to the product of two copies of PI, i.e. S2:::; pi X pI; hence, it is a ra- tional surface. S3 is also rational. There exist 27 lines on S3' Contracting 6 mutually disjoint lines among these 27 lines, we obtain a pro- jective plane. Conversely, given 6 points on p2 in general position, by performing locally quadratic transformations with these points as centers we get a cubic surface S3 and a birational morphism T' S3 -->p2, The inverse images by T of the 6 given points, the proper transforms of 15 lines connecting every pair of points chosen from the 6, and 6 conics passing every 5 of the 6 points by T- I , exhaust 27 lines on S,. If m = 4, then K N O. In general, a non- singular surface with K - 0 and q = 0 is said to be a K3 surface (- 72 Complex Manifolds J). K3 surfaces have certain properties similar to Abelian surfaces that are defined to be 2- 
15 Ref 66 Algebraic Surfaces dimensional Abelian varieties ( 3 Abelian Varieties). For K = C, the period mapping defined by integrating regular 2-forms has been studied extensively in connection with moduli theory. In general, surfaces with the bigenus P2 = Pg = 1 and q =0 are birationally equivalent to K3 surfaces; surfaces with P4 = Pq = 1 and q = 2 are birationally equivalent to Abelian surfaces. Every Abelian surface has the involution i defined by i(iX) = -a and the quotient surface by i is a singular surface with 16 ordinary double points. By performing 16 locally qua- dratic transformations successively with these singular points as centers we obtain a K3 surface. Such a K3 surface is called a Kummer surface The original singular surface is also called a Kummer surface. A quartic surface in p3 with 16 double points is an example of a Kummer surface. The theory of birational classification of surfaces was developed by Castelnuovo, En- riques, and others of the Ttalian school. This theory has been extensively enriched and gen- eralized in various ways. Kodaira's theory of analytic surfaces includes classification of algebraic surfaces (72 Complex Manifolds T, J), and classification of surfaces in the poitive- characteristic case has been recently studied in detail [4,5]. When the field K(S) is a sub field of a purely transcendental extension K(X, Y), S is said to be a unirational surface. If the exten- sion K(X, Y)/K(S) is separable, S is a rational surface. However, if it is inseparable, S may become nonrational, a K3 surface, an elliptic surface, or a surface of general type [5,12]. Even for noncomplete surfaces, we have a classification theory [ 10J similar to the previ- ous ones. The following result is one of the applications: whenever S x A'  A 3 , S is iso- morphic to A 2 . Here A" means the affine n- space (M. Miyanishi, T. Sugie, T. Fujita; [7,9,13]). Let I be the ring of integers of a real qua- dratic field of discriminant d The Hilbert modular group G = SL(2, I)/{ ::!: I} acts on the product jf2, jf being the complex upper half- plane. The normal complex space £2/G can be compactified by adding a finite number of points and thus a compact norfhal surface is obtained. Resolving these singularities in the canonical minimal way, we have a nonsingular surface Y( d) over C, which is called the Hilbert modular surface with discriminant d Y( d) is simply connected; hence, q( Y(dJ) = O. If d = 5, 8, 12, 13, 17,21,24,28,33, 60, then Y(d) is a rational surface. If d = 29,37,40,41,44,56,57, 69, 105, then Y(d) is birationally equivalent to a K3 surface, If d = 53, 61, 65, 73, 76, 77, 85, 88, 92,93, 120, 140, 165, then Y(d) is an elliptic surface with K( Y(d)) = 1. Otherwise, it is a surface of general type ( [ 19,201 in 72 Com- plex Manifolds). For a-cycles on a surface -" 16 Algebraic Varieties J. References [IJ M. Artin, Some numerical criteria for contractibility of curves on algebraic surfaces, Amer. J. Math., 84 (1962),4855496. [2] H. F. Baker, Introduction to the theory of algebraic surfaces and higher loci, Principles of Geometry 6, Cambridge Univ. Press, 1933. [3] A. Beauville, Surfaces algebriques com- plexes, C.I.M.E. (1977), Asterique 54 (1978). [4J E. Bombieri and D. Husemoller, Classifi- cation and embeddings of surfaces, Algebraic Geometry, Arcata, 1974, Amer. Math. Soc. Proc. Symp. Pure Math., 29 (1975),3299420. [5J E. Bombieri and D. Mumford, Enriques classification of surfaces in characteristic p: II, Complex Analysis and Algebraic Geometry (In Honour of Kodaira), Iwanami Shoten and Cambridge Univ. Press (1977), 23342, III, Invent Math., 35 (1976),1977232. [6J F. Enriques, Le superficie algebriche, Zani- chelli, 1949, [7J T. Fujita, On Zariski problem, Proc. Japan Acad., 55 (1979), 106-110. [8J H. W. E. Jung, Algebraische Fhichen, Helwingsche, 1925. [9J T. Kambayashi, On Fujita's strong can- cellation theorem for the affine plane, J. Fac. Sci. Univ. of Tokyo, 27 (1980), 5355548. [10] Y. Kawamata, On the classification of non-complete algebraic surfaces, Proc. Alge- braic Geometry, Copenhagen, 1978, Lecture notes in math. 732, Springer, 1979,215-232. [IIJ K. Kodaira, Collected Works III, Iwa- nami. 1975. [ 12J M. Miyanishi, Unirational quasi-elliptic surfaces, Japan. J. Math., 3 (1977),395-416. [13] M. Miyanishi and T. Sugie, Affine sur- faces containing cylinderlike open sets, J. Math. Kyoto, 20 (1980),11-42. [ 14J D. Mumford, Lectures on curves on an algebraic surface, Ann. Math. Studies, Prin- ceton Univ. Press, 1966. [ 15J D. Mumford, Enriques' classification of surfaces in char p: I, Global Analysis (In Honor of Kodaira), Univ. of Tokyo Press and Princeton Univ. Press, 1969. [16J M. Nagata, On rational surfaces T, 11, Mem. Coli. Sci. Univ. Kyoto, (A) 32 (1960), 351-370; 33 (1961),271-293. [ 17J E. Pascal, Repertorium der h6heren Mathematik, II" Teubner, second edition, 1922, [ 1 8] E. Picard and G. Simart, Theorie des 
67 fonctions algebriques de deux variables inde- pendentes 1, II, Gauthier-Villars, 1897, 1906. [ 19J A. N. Rudakov and I. R. Safarevich, Inseparable morphisms of algebraic surfaces, Math. USSR-I-V., 10 (1976), 1205-1237. [20J 1. R. Safarevich, Algebraic surfaces, Proc. Steklov Inst. Math., Amer. Math. Soc., 1967. [21J O. Zariski, Collected Papers II, III, MIT Press, 1973, 1978, [22J O. Zariski, Algebraic surfaces, Erg. Math., Springer, second edition, 1971. 16 (VII1.4) Algebraic Varieties A. Affine Algebraic Varieties and Projective Algebraic Varieties Fix a field k. A subset of the n-dimensional taffme space k" over k is called an affine alge- braic variety (or simply affine variety) if it can be expressed as the set of the common zeros of a (finite or intlnite) set of polynomials Fi(X I . , X,) with coefficients in k. Similarly, a subset of the n-dimensional tprojective space P"(k) over k that can be expressed as the set of the common zeros of a set of homogeneous polynomials G j ( Y" . , Y,,) is called a projective algebraic variety (or simply projective variety). In this section, variety means either an affine or projective variety. A variety which is a subset of another variety is called a subvariety. These varieties are the forerunners of the modem, more general versions of algebraic varieties, which we will discuss later. When V is an affine variety in k", the set of the polynomials in k[XJ = k[X I , ,XnJ that vanish at every point of V form an ideal I(V) of k[X]. The residue class ring A, = k[XJ/I( V) is called the coordinate ring (or affine ring) of V. We can regard A, as the ring of k-valued functions on V that can be expressed as poly- nomials of the coordinates of k". When Vis a projective variety, the thomogeneous ideal generated by the homogeneous polynomials in k[ YJ = k[ Y" . , y"J that vanish at every point of V is denoted by I(V), and the ring A, = k [ y]/ I ( V) is called the bomogeneous coordi- nate ring of V. A variety Vis said to be reducible or irreduc- ible according as it is the union of two proper subvarieties or not. A maximal irreducible subvariety of V is called an irreducible compo- nent of V. Any variety can be written uniquely as the union of a finIte number of irreducible components. A variety V is irreducible if and only if I( V) is a tprime ideal. When that is the case, the field of quotients of A, (when Vis affine) or the su bfield of the field of quotients 16 A Algebraic Varieties of A, consisting of the homogeneous elements of degree 0 (when V is projective) is called the function field of V and is denoted by k(V). Elements of k( V) are called rational functions (or simply functions) on the variety V. The field k(V) is tfinitely generated over k. The tran- scendence degree of k(V) over k is called the dimension of V. When V is reducible, the max- imum of the dimensions of its irreducible components is called its dimension. If W is a subvariety of an irreducible variety V. then dim V - dim W is called the codimension of W on V. A subvariety of pure codimension 1 of an affine or projective space can be defined by a single equation and is called a hypersurface. If the ideal I(V) of a variety V of dimension r in a projective space P"(k) is generated by n-r homogeneous polynomials, then Vis called a complete intersection. Compared with general varieties, complete intersections have some simpler properties. On the other hand, many important varieties are not complete intersec- tions, e.g., t Abelian varieties of dimension  2. The intersections and finite unions of sub- varieties on a variety V are also subvarieties. Thus the subvarieties can be taken as the tsystem of closed sets of a topology on V (- 426 Topology), which is called the Zariski topology of the variety V. When k is the field of complex numbers, V can be viewed as an tanalytic space, and the topology of Vas such (the "usual" topology) is much stronger than the Zariski topology. For the rest of this arti- cle, varieties will be considered as having Zar- iski topologies unless stated otherwise. Terms such as Zariski open, Zariski closed, and Zar- iski dense are used to mean open, closed, or dense in a Zariski topology. Suppose a con- dition (P) concerning the points of an irreduc- ible variety V (concerning the elements of a set M parametrized by the points of V) is satisfied in a nonempty Zariski open set of V. Then we say that the condition (P) holds at almost all points of the variety Vor at general points of V (almost all elements of the set M). Let U and V be affine varieties in k" and k m , respectively. Then the product set U x Vis an affine variety in kn+m and is called the product algebraic variety (or simply the product) of U and V. Note that the Zariski topology on U x Vis stronger than the product of the topologies of U and V. When k is talgebrai- cally closed. U x Vis irreducible if U and V are irreducible. Suppose that k is algebraically closed. Let  be a tprime ideal of k[XJ = k[X 1 , ,X.J, and let V be the affine variety in k n detlned as the zero points of \.13, Then I(V) = \.13 (tHilbert zero point theorem) (- 369 Rings of Polynomials D). Therefore there exists a one-to-one COrre- spondence between the set of prime ideals of 
16 B A1geb raie Varieties k[XJ and the set of irreducible varieties in k". In particular, the tmaximal ideals correspond to the points of k". Similarly, there exists a one-to-one correspondence between the set of homogeneous prime ideals of k[ Y] other than L=o Y;k[ Y] and the set of irreducible sub- varieties in P"(k). When we deal with nonlinear algebraic equations, we cannot expect a-simple, c1earcut theory without assuming that k is algebraically closed. Hence we take an algebraically closed field K containing k and regard a variety V in k" as a subset of the variety V K in K n defined by the same equations. From now on, we suppose that k is algebraically closed. If the ideal I(V) of k[XJ or k[ Y] determined by a variety Vis generated by polynomials with coefficients in a subfield k' of k, we say that V is defined over k' or that k' is a field of defi- nition for V. Any variety has the smallest field of definition, which is finitely generated over the prime field. In the theory of A. Weil [92J, we fix an algebraically closed field K that has an infinite transcendence degree over the prime field. This K is called the universal domain. A point of Vis called a k'-rational point of V if all of its coordinates belong to a subfield k' of K. Let Kt> K 2 be two extension fields of a field L, and let (x)EK, (Y)EK. We say (y) is a specialization of the point (x) over L (notation: (x)7(Y» if all polynomials j(X)EL[X1,....X,] satisfying f(x) = 0 also satisfy f(y) = 0; in other words, if there exists a homomorphism of L-algebras L [x l' ,xnJ ...... L [y l' . , y,,] mapping Xi to y;. Let K be the universal domain, Van irreducible variety in K n , and k' ( c K) a field of definition for V having a finite transcendence degree over the prime field. Then there exists a point (x) of V such that all points of V are specializations of (x) over k'. Such a point (x) (in general not uniquely determined) is called a generic point of V over k'. The ring k'[x J is isomorphic to k,[XJII( V) n k,[XJ over k'. (Some authors use the term generic point to mean almost all points as defined earlier.) B. Local Rings Let V be an affine variety and let W be an irreducible subvariety of V. Let '13w be the subset of A, consisting of the elements that vanish identically on W. Then '13w is a prime ideal of A,. The ring of tquotients of A, with respect to '13w is denoted by Dv, w or by :Dw and is called the local ring of W on V (or of V at W). Suppose for simplicity that Vis irre- ducible. Then 'Dw is the subring {JIg If, g E Av, g \.j3v} of k(V), and the tresidue field of 'Dw modulo the maximal ideal \.j3w:Dw can be 68 identilled with k( W). When a function rp on V (qJ E k( V)) belongs to :Dw, it is said to be regular at W. For a given function <p E k( V), the set of the points of V where rp is regular is Zariski open. In the case of a projective variety, the local ring Dv, w is defined as the subring of the ring of quotients of A, with respect to '13w, consisting of the homogeneous elements of degree O. A mapping from an open set U of a variety V to k that is regular at every point of U is called a regular function on V. The ring of regular functions on V is denoted by A,. By assigning A, to each open set V, we can define a tsheaf of rings (Dv on V, of which the tstalk (Cv,x at a point XE V is the local ring Dv,x. The sheaf C7 v is called the sheaf of germs of regular functions on V (or the structure sheaf of V) (- 383 Sheaves H). C. General Definition Consider a pair (V, 0) of a topological space V and a sheaf (!! of germs of mappings from V to k. If V has a finite open covering (Vi) such that each (Vi> C71 VJ is isomorphic to an af- fine variety V, (in the sense that there exists a homeomorphism from Vi to V, that transforms CDI Vi to the structure sheaf of v,), the pair (V, @) is called a prea1gebraic variety over k, and (f) is called its structure sheaf. Usually (V, @) is denoted simply by V. A regular mapping between prealgebraic varieties is detlned as a continuous mapping g: V--> V' satisfying <pogE(()v,x for any XE V and qJE (riv'.g(X)' Furthermore, if g is a homeo- morphism and g-l is also regular, then 9 is called an isomorphism or a hiregular map- ping. The Cartesian product X x Y of pre- algebraic varieties X and Y is locally a prod- uct of affine varieties. Therefore X x Y has the structure of a prealgebraic variety. A pre- algebraic variety X is called an algebraic va- riety if the image of the diagonal mapping X--> X x X is closed in the Zariski topology of the product variety X x X ("separation con- dition"). (This definition is due to J.-P. Serre [81].) The separation condition corresponds to tHausdorff's separation axiom. If W is a locally closed subset (i.e, the intersection of an open set and a closed set) of an algebraic variety V, then it becomes an algebraic variety in a natural manner (the germs of regular functions at PE Ware taken to be the germs of functions induced on W by the functions in (!!v.p). Locally closed subvarieties of k n or P"(k) are called quasi-affine or quasiprojective alge- braic varieties, respectively, Definitions of irreducibility and local rings for general alge- G:'aic varieties are given in the same manner 
69 as before. In this article, algebraic varieties will often be referred to simply as varieties. The notion of an irreducible algebraic variety was developed from that of abstract algebraic variety (or simply abstract variety) defined by Weil. D. Scbemes The set of prime ideals ( =1= (1» of a commuta- tive ring A with unity element 1 is denoted by Spec(A) and is called the spectrum of A. For any subset a of A, we denote by V(a) the set of the prime ideals containing a. We define a topology on Spec(A) in which the closed sets are V(a). This, again, is called the Zariski topology ofSpec(A), For an elementf of A, the open set D(f)==Spec(A)- V(f) is called an elementary open set. The elementary open sets form a base of open sets in the Zariski topol- ogy of Spec(A), The set of closed points is nothing but the set of maximal ideals of A Assigning to each point  of Spec (A) the tring of quotients A" we obtain a sheaf of rings A on Spec(A). We have the equality f(D(f), A)= Af, where Ar is the tring of quotients by the multiplicative system {In n  OJ. In particular, we have f(Spec(A), A) = A. Regarded as a t1ocal-ringed space with A as the structure sheaf, Spec(A) is called an affine scbeme. A local-ringed space X which is locally isomorphic to an affine scheme is called a scbeme. A morphism of schemes is, by defi- nition, a tmorphism between them as local- ringed spaces, Thus we obtain a tcategory whose objects are schemes. We denote it by (Sch). Giving a morphism f:X ......Spec(A) is equivalent to giving a ring homomorphism fer): A...... f(X, 0,). Hence the category of affine schemes (which is a tfull subcategory of (Sch» is contravariantly equivalent to the category of commutative rings. If there is given a morphism of schemes f: X -->S, X is said to be an S-scheme or a scheme over S, and f is called the structure morphism and S the base scheme. For two S-schemes f: X -->S, g: Y -->S, a morphism of S-schemes is defined to be a morphism of schemes h: X...... Y with f = go h. Thus we obtain the category of S-schemes denoted by (SchjS). Spec(Z) is the unique tfinal object in (Sch), hence (Sch) is nothing but (Sch/Spec(Z». The tfiber product always exists in (Sch). In fact in the case of affine S-schemes X = Spec(B) and Y= Spec(C) with S = Spec(A) we have X x ,Y = Spec(B Q9A C), and in the general case we construct X x s Y by patching together fiber products of affine schemes. A morphism f: X -->S is called separated if the image of the diagonal morphism XIS : 16 D Algebraic Varieties X......X x sX is closed. We also say that X is separated over S or X is a separated S-scheme. A scheme X is said to be separated if it is separated over Spec(Z), All affine schemes are separated. When K is a field, Spec(K) is a space having only one point and equipped with K as the stalk of the structure sheaf. For a point x of a scheme X, denote by k(x) the residue field of (Yx,x' For f E (()x,x we caB the residue class off in k(x) the value off at x, denoted by f(x). We have a natural morphism i,: Spec(k(x))......X whose image is {x}. More generally, we call a morphism i of a spectrum Spec(K) of a field K to X a point of X with values in K Such a point is determined by a point x in X and an embedding of k(x) in K A point of X with values in an algebraically closed field is called a geometric point. For a morphism f: X ...... S and a point s in S, the fiber product X x s Spec(k(s» is called the fiber off over sand denoted by r'(s). For a geometric point Spec(K)-->S, X x sSpec(K) is called a geo- metric tiber. A scheme X is called reduced if the local ring at each point of X has no tnilpotent elements. A scheme is said to be irreducible if its underly- ing topological space is not a union of two proper closed subsets. A scheme is called in- tegral if it is reduced and irreducible. Every local ring of an integral scheme is an tintegral domain. If a scheme X has an affine open covering { U i = Spec(AJ} such that every Ai is a tNoetherian ring, X is said to be locally Noetherian. A locally Noetherian scheme is called Noetherian if its underlying topological space is +( quasi- )Compact. A morphism f: X...... Y = Spec(A) is said to be locally of finite type (of finite type) if X has an open affine covering (a finite open affme cover- ing) { U i = Spec(A;)} such that each Ai is a finitely generated A-algebra. A general mor- phism f: X ...... Y is said to be locally of finite type (of finite type) if there is an open affine covering {Vi} of Y such that every restriction of f:FI(J;;)...... J;;is locally of finite type (of finite type), If f: X --> Y is (locally) of finite type we say that X is (locally) of finite type over Y. A scheme of finite type over a field K (i.e" over Spec( K)) is called an algebraic scheme over K There is a tnatural equivalence of categories between the category of reduced separated algebraic K-schemes (as a full sub- category of (SchjK)) and the category of alge- braic varieties over K (defined in Section C) equipped with regular mappings as mor- phisms. Hence we identify these categories from now on. Occasionally, algebraic vari- ety means irreducible variety. Nonalgebraic schemes are also important as tools for the 
16 E Algeb raic Varieties study of algebraic varieties. For example, for a point x in a scheme X there is a canonical monomorphismj,: Spec(CDx.xJ......X by which the unique closed point of Spec«(f)x,.J is map- ped to x. If for two algebraic K-schemes X, Y and for two points XEX, YE Y there is a K- isomorphism (Dx,x  (9y,y, then suitable neigh- borhoods of x and y are isomorphic over K. Many concepts concerning varieties, e.g., dimension, generic points, specialization, can be naturally extended to the case of schemes by virtue of commutative ring theory. A morphism of schemes f: X --> Y is called proper if it satisfies the following two con- ditions: (1) f is separated and of finite type, (2) for every scheme T and for every morphism T -+ Y, the morphism X x y T --> T obtained from f by the "change of base" is a closed mapping. We also say that X is a proper Y-scheme or X is proper over Y. A proper algebraic K-scheme is called complete. A projective variety is com- plete, while an affine variety over K is com- plete only when it is of dimension zero. Every algebraic variety can be embedded in a com- plete variety (M. Nagata [63J). A morphism of schemes f: X...... Y is called affine if every inverse image by f of an open affine subset of Y is again an affine scheme. A morphism of schemes f: X...... Y is called finite if it is of finite type and there is an affine open covering { Vi = Spec(A;)} of Y such that f -l( UJ = Spec(B;), where B i is tintegral over Ai' For a locally Noetherian scheme Y and a morphism of schemes f: X...... Y the following three conditions are equivalent: (i) f is finite; (ii) f is affine and proper; (iii) f is proper and every fiber off is a finite set. For a finite sur- jective morphism of Noetherian schemes f: X --> Y, X is an affine scheme if and only if Y is an affine scheme. A morphism of schemesf: X...... Y is said to be flat if for each point x EX, CDx,x is a tflat ((Iy,j(x)-module. If, moreover, fis surjective, then f is called faithfully flat. Assume that g: Y' + Y is a faithfully flat morphism of finite type of locally Noetherian schemes and f: X ...... Y is a morphism of schemes. Then for many important properties of morphisms, f has these properties if and only if the pull-back if' : X x !' Y' + Y' has the same properties (theory of descent [29, 30J). E. Cohomology Theory Let (X, 0) be a ringed space. An (D-Module (i.e., a sheaf of a-modules) F is said to be quasicoherent if for each point x of X there exist a neighborhood U of x and an texact sequence M......N -->F!u......O, where M and N are free (DIu-Modules, An ((i-Module F is said to 70 be of finite type if F is locally generated by a finite number of sections over ((I; F is of finite presentation if, locally, there exists an exact se- quence (9P-->(Dq......F -->0 where p and q are posi- tive integers (they need not be globally con- stant); F is coherent if(i) F is of finite type and (ii) the kernel of any homomorphism 01';;...... Flu (where n is an arbitrary positive integer, and V is an open set) is of finite type. Obviously, if F is coherent, then F is of finite presentation, which implies that F is quasicoherenet and of finite type. In the category of U -Modules, the full subcategory of coherent sheaves is closed under almost all operations of sheaves. If f!J itself is coherent as an a-Module, (!) is said to be a coherent sheaf of rings. In this case every CD-Module of finite presentation is coherent. The structure sheaf of a locally Noetherian scheme is a coherent sheaf of rings. On a lo- cally Noetherian scheme X, every quasicoher- ent sub-@,-Module or quotient @,-Module of a coherent (Dx-Module is coherent. A coherent (lJv-Module on an algebraic variety V is called a coherent algebraic sheaf. Let X = Spec(A) be an affine scheme. Then every quasicoherent (Dx-Module F on X is generated by its global sections. The corre- spondence F ......f(X, F) defines an equivalence beeween the category of quasicoherent sheaves on X and the category of A-modules; if A is Noetherian, then the coherent sheaves and the finite A-modules correspond to each other under this equivalence, Let X be a separated scheme, and U = {U A } an affine open covering of X. For each quasi- coherent @,-Module F, the cohomology group Hq(X, F) is canonically isomorphic to the tCech cohomology Hq(U, F) (- 383 Sheaves F). If X is of dimension d, then Hq(X, F) =0 for every sheaf F of Abelian groups on X and q>d. For a scheme X we define the cohomo10gica1 dimension cd (X) to be the largest integer q such that Hq(X, F) #0 for a quasicoherent CD x - Module F on X [35]. The cohomological dimension cd (X) does not exceed the dimen- sion of X. If X is an affine scheme, then cd (X) = O. The converse is true under the assump- tion that X is Noetherian (Serre's criterion [29, III]). For an algebraic scheme X of dimen- sion n, cd(X) = n if and only if X is complete (S. L. Kleiman). Let f: X ...... Y be a proper morphism of Noetherian schemes. Then for every coherent CDx-Module F and for every q  0, R'1*(F) (->. 200 Homological Algebra) is also coherent. In the special case ofY = Spec(k) with a field k this means that for an algebraic coherent sheaf F on a complete variety X the cohomology group Hq(X, F) is a finite-dimensional vector space over k. 
71 Let X be a scheme over k and let F be a locally free ('\-Module of rank n (i.e., an @x- Module which is locally isomorphic to @;). If we take an open covering {U i } of X and iso- morphisms <Pi: F!u,--:+0;lu" then <Pi 0 <pj-l defines a morphism gij: uin Uj-->GL(n, k), which is called the coordinate transformation of F. If we construct a tvector bundle B on X by the same coordinate transformations gij' then F can be regarded as the sheaf B(B) of germs of sections of B. By means of the canonical homomor- phism GL(n, k)...... PGL(n -1, k) we Can con- struct a projective bundle P(F) on X (which is said to be associated with F). (Note that in [29J, P(F) is defined to be a projective bundle with coordinate transformations 'gijl, i.e., as- sociated with the dual of F in our sense.) This procedure of associating P(F) with a locally free l1\-Module F can be generalized for any quasicoherent Os-Module of finite type on an arbitrary scheme S. A closed (locally closed) S-subscheme f: X...... S of p: P(F)......S is called a projective scheme (quasiprojective scheme) over S, or f is said to be a projective morphism (quasiprojective mor- phism). A projective morphism is proper. A reduced projective scheme over a field k is nothing but a projective variety over k. We can develop the theory of projective schemes by means of tgraded rings in a way similar to affine schemes. A locally free l1\-Module ofrank 1 is called an invertiWe sheaf. Invertible sheaves corre- spond to line bundles up to isomorphisms. Let p = PN(k) be a projective space, (y" y" . . , YN) a system of homogeneous coordinates of P, and Vi the open subset of P defined by Yi f O. Denote by (D(n) the invertible sheaf on P de- fined by the coordinate transformation gij= (yjYi)". More generally, let p: P = P(F)......S be the projective bundle associated with a locally free Us-Module F ofrank N + 1 on a scheme S. Then there is an invertible sheaf O(n) = 0(1 )lNn on P with the properties: (i) for each SES its restriction to the fiber p-l(S) = pN(k(s)) is (D(n) detlned above; (ii) p*«(I)(n)) = sn(F) for n > 0 where S" (F) denotes the nth symmetric product of F The invertible sheaf @(1) is called the tautological line hundle on P. (Note that (!I( 1) in the sense of [29J is (D( -1) in our sense, but since the definition of P(E) is also differ- ent, the above property (ii) holds without modification. ) For a quasiprojective S-scheme f: X...... P(E) -->S, the restriction of (II( 1) to X is denoted by @x( I) (or simply lD( I», An invertible sheaf L on X is called very ample over S if there exist a locally free (Ds-Module of finite type Eon S and an S-immersion i: X -->P(F) such that (Dx( 1) = L; L is called relatively ample over S or S-ample if L On is very ample over S for a cer- 16 E A1geb raic Varieties tain n > 0 (- Section N). When S is an affine scheme, an ample (very ample) sheaf over S is simply called ample (very ample). There is the following cohomological criterion of ample- ness (generalized Serre's theorem). Let Y be a Noetherian scheme, f: X --> Y a proper morphism, and L an invertible (Dx- Module. Then the following four conditions are equivalent: (i) L is f-ample; (ii) for each coherent @,-Module F there is an integer N such that R'1*(F @ LINn) = 0 for all n N and q> 0; (iii) for each coherent sheaf of ideals .f of (Dx there is an integer N such that Rlf*(J @ L0 n )=0 forallnN They imply the condition (iv): for each coherent {()x- Module F there is an integer N such that the canonical homomorphisms f*f*(F @ L 0 n )--> F @ LOn are surjective for all n N Let X be a scheme proper over a field k (such an X is called a k-comp1ete scheme), and let F be a coherent Q-module. Then all Hq(X, F) are finite-dimensional k-vector spaces (finiteness theorem), and X(F) = L( -l)q dim W(X, F) is a finite number, called the Euler-Poincare characteristic of F over X. For every invertible sheaf L, X(F @ L 0 m ) is a polynomial in m, which is said to be the Snapper polynomial [48]. Supp F, de- fined to be the set {x EX Fx f OJ, is a closed subset of X and is called the support of F The degree of the polynomial X(F @ L om) is at most y = dim supp F, and we have X(F @ L om) = e' m' /r! + lower terms in m for some e E Z. e is said to be the intersection number of L' with F [48]. When W is a closed sub scheme of X defined by an @x-ideal I, (L" W) is defined to be the intersection number of L' with @x/J, where r = dim W. If L is ample, then (L' ' W) > o for any W. In particular, when W= X, it follows that (L") > 0, where n = dim X. The converse of this fact is the Nakai-Moishezon criterion, saying that if(L' W) > 0 for any closed subvariety W of X where r = dim W, then L is ample [56,64]. When L is ample, Hq(X, F @ L 0 m ) = 0 for any q > 0 and for sufficiently large m; hence X(F @ L0 m ) = dim HO(X, F @ L0 m ) and the Snapper polynomial X(F @ L0 m ) is also called the Hilbert polynomial or the Hilbert charac- teristic function of F.1f L is the invertible sheaf detlned by a hyperplane section of X in pt, i.e., L = (Dx(1), then X(L0 m ) = dim HO(X, L0 m )= dim Rm for sufficiently large m where Ef)Rm denotes the homogeneous coordinate ring of Xin Pt'. In general, for a complete irreducible variety V of dimension Y, we put , X(V)=X((Dv)= L (-I)qh O ,q q=O (h o . q = dim Hq( V, (]iv)) and call it the arithmetic 
16 F Algebraic Varieties genus ofV. Classically, the number p,(V) de- fined by Pa(V)  (-1 nX(V) - 1)  h O ,' - hO,'-1 +. . I h O , I was called the arithmetic genus of V, instead of X( V). When V is a nonsingular irreducible curve, p,(V) is the usual tgenus. If V is a projective variety, the constant term of the Hilbert polynomial is x( V) and the coeffi- cient of its highest term is (deg V)/rL Let V be a tnormal variety, and D a tdivisor on V. If, for each point x E V, we denote by (D(D)x the set of the functions f E k( V) that satisfy (f) + D  0 on some neighborhood of x, we obtain a coherent algebraic sheaf (!!(D), and we have dim HO( V, O(D)) = I(D). If, moreover, V is complete, we put X v(D) = x( V) - x( (9( - D)) and cal1 it the virtual arithmetic genus of D. Classically, the number p,(D) =( -lr l (xv(D) - 1) was called by that name. When D is effec- tive and has no multiple components, Xv(D) coincides with the arithmetic genus of D re- garded as a variety. In general, Xv(D) stays in- variant if D is replaced by a divisor that is algebraically equivalent to D (-" Section N). If D is a Cartier divisor, O(D) is an inver- tible sheaf. For two Cartier divisors Db D2' (D(D I + D 2 ) = CJ(D I ) @ (D(D 2 ), and Dl and D 2 are linearly equivalent if and only if (!!(Dd (!!(D 2 ). Let V be an irreducible nonsingular variety and let QP denote the sheaf of germs of regular differential forms of degree p (Qo = (Dv). If V is complete, then we denote dim Hq( V, QP) by hp,q. Serre's duality theorem: Let V be a nonsin- gular complete irreducible variety of dimen- sion r, B an algebraic vector bundle over V, and B* the dual vector bundle of B. Denote by !1J and !1J* the sheaves of germs of sections of Band B*, respectively. Then (i) W( V, Q') is canonically isomorphic to k, and (ii) Hq( V, !1J) and w-q( V, !1J* Q9 Q') are dual to each other as linear spaces by means of the tcup product of the above spaces with W( V, ft)  k In partic- ular, Hq( V, QP) is dual to H,-q( V, Q'-P); hence we have hp,q = h,p,r-q, This theorem was extensively generalized by Grothendieck in the category of schemes [1,331. When the field k is of characteristic 0, we furthermore have hp,q = hq,p by complex con- jugation (-" 232 Kahler Manifolds C); but in characteristic p, there are examples for which this symmetry does not hold [61]. In general, hp,q is a trelative invariant but not an tab- solute invariant; however, as hP'o is the dimen- sion of the hnear space of the differential forms of the first kind with degree p, hP'o is an ab- solute invariant. Hence hO,p is also an absolute invariant in the case of characteristic O. When the characteristic is positive, the absolute invariance of hO,p has not yet been proved. 72 F. Simple Points and Singular Points Let V be a variety over an algebraically closed field k We say that a point P of Vis simple or that Vis nonsingu1ar or smooth at P if the local ring .c p is a tregular local ring. Since the problem is local, we may assume that Vis an affine variety in k". Then the simplicity of P on Vis equivalent to the following condition: P is contained in only one irreducible component of V, and if that component has dimension r there exist n-r polynomials F;(X) in I(V) such that rank (JF;/JX)(xJ=P = n-r. A point of V that is not simple is called a singular point or a multiple point. The set of singular points on V (called the singular locus of V) is a proper closed subset of V. A variety with no multiple points is called smooth or nonsingular. This notion can be made relative. A mor- phism f: X -+ Y of a locally Noetherian scheme is called smooth if f is flat and locally of finite type and all the geometric fibers off are non- singular. In the case of an affine morphism f: X = Spec(R[X l ,... ,Xr+J/UI'''' ,fm))...... Y == Spec(R) of relative dimension r (by which we mean the dimension of the general fiber) with a Noetherian ring R, the smoothness off amounts to a condition that rank «i3/;/i3X)(x)) = S at each point x of X. When for a point P of a variety V the local ring D p is tnormal, P is called a normal point. A simple point is normal. The set of normal points is a nonempty open subset of V. An irreducible variety whose points are all normal is called a normal algebraic variety (or simply normal variety). The singular locus of a normal variety has codimension  2. For an irreduc- ible variety V, there exists a pair (V', f) of a normal variety V' and a birational finite mor- phism f: V' --> V; V' is unique up to isomor- phisms and is called the derived normal model or normalization of V Simplicity and normality for V at a sub- variety W are deti'ned in the same way as at a point by using the local ring Dv, w. For a morphism f: X -+ Y of locally Noe- therian schemes, 10cal1y of finite type, the fol- lowing three conditions are equivalent: (i) f is smooth and every fiber off is a discrete set; (ii) fis flat and every geometric fiber over Spec(K) off is a union of spectra of fields isomorphic to K; (iii) f is flat and every fiber off over y E Y is a union of spectra of fields that are finite tseparable extensions of k(y). These conditions are local with respect to X. If a morphism f satisfies these equivalent con- ditions, we say that f is eta1e or X is etale over Y. A morphism f: X = Spec(R[X I , , XnJ/(.fI, . ,j,,) ...... Y= Spec(R) 
73 is etale if and only if det«DfilDX) (x)) toO for all XE X, Hence etale morphisms correspond to local isomorphisms in the analytic category. For a surjective etale morphism f: X ---> Y many important geometric properties (reduced, integral, normal, nonsingular, etc.) hold on X if and only if they hold on Y (theory of descent). G. Dimension Theorems Let V be an irreducible variety, and let U and W be irreducible subvarieties of V. Then any irreducible component of un W that is simple on Vhas dimension  dim U + dim W - dim V. When the equality holds, the component is called a proper component of the intersection un w. If every component of U n W that is simple on VIS proper, we say that U and W properly intersect on V. Any two subvarieties U and W of P"(k) with dim U + dim W  n intersect each other. When VIS an irreducible r-dimensional variety in pn(k), the number of points of intersection V n L of V with an (n- r)-dimensional linear variety L is independent of the choice of L as long as L is in a "gen- eral position." This number is called the degree of Vand is denoted by deg( V). Letting (9( 1) be the fundamental sheaf of P"(k), we have deg V =((1;(1)" V) (- Section E). H. Group Varieties An algebraic variety G is called an talgebraic group if it has a group structure and if the mapping G x G......G sending (x, y) to xy-l is a morphism. Every algebraic group is quasi- projective (Chow). If G is irreducible, then it is also called a group variety; a complete group variety is called an tAbelian variety (- 3 Abelian Varieties B, 13 Algebraic Groups B). A scheme G over another scheme S equipped with morphisms over S:G x sG......G, G-->G, and S -->G, called multiplication, inverse, and unit section, respectively, which satisfy the relations corresponding to the usual axioms of group, is called a group scheme (over S). As a point set, G is not a group, while, for any scheme T over S the set G(T) = Hom s ( T. G) of the morphisms from T to G is a group (- 52 Categories and Functors M). Consider an algebraic group scheme Gover S = Spec(k). If the characteristic of k is zero, then G is necessarily reduced, so an algebraic group scheme over k is essen- tially the same as an algebraic group; if k has characteristic p, there exist algebraic group schemes over k that are not reduced. 16 I Algebraic Varieties I. Rational Mappings Let f: V-I V' be a morphism of varieties. If V is not complete, the image f(V) is not always closed; the closure off(V) (in V') is called the closed image of V. The image f(V) contains an open dense subset of the closed image. Let V and W be irreducible varieties. A closed subset T of Vx W is called an algebraic correspondence of V and W. We say that points P E Vand Q E W correspond to each other by T if (P, Q)E T. If T is irreducible and the closed image of the projection T --> V coincides with V, then the function field k(V) can be identified with a subfield of k(T); if we have k(V) = k(T) with this identification, then T is called a rational mapping from V to W. Moreover, if the same conditions are satisfied for W, then T is called a birational mapping (of birational correspondence or birational transformation), and in this case we have k(V) = k( W). A mor- phism can be considered a special kind of rational mapping by taking the graph. If T is a rational mapping from V to W and WI is the closed image of T in W, then k( Wd can be regarded as a subfield of k(T) = k( V). If k(V) is tseparably generated (tpurely inseparable) over k( WI), then T is said to be separable (purely inseparable). Let T be a rational mapping from V to W, and let V' and W' be irreducible subvarieties of V and W, respectively. If there exists an irreducible subvariety T' of T whose projec- tions have the closed images V' and W', then we say V' and W' correspond to each other by T. The union of irreducible subvarieties of W that correspond to V' by T is a closed subset of W; it is called the proper transform of V' by T and is denoted by T[ V']. Note that V':::J V" does not imply T[V'] :::J T[ V"]. The set of points of W that correspond to the points of V' is called the total transform of V' by T and is denoted by T{ V'j. Identifying k(T) with k(V), we have Dr, r' :::t Dv. v' in general; if the equality holds, we say that T is regular (or defined) along V'. In that case, W' is the unique irreducible subvariety of W corresponding to V' by T. If V' is simple and of co dimension 1 in V, then T is always regular along V'. The set U of points of Vat which T is regular is a nonempty open subset, and the restriction of T to U defines a morphism from U to W. A rational mapping can be defined as the closure of the graph of a morphism defined on an open subset of V. The study of a rational mapping can be reduced to that of the bira- tional morphism T --> V. Zariski's main theo- rem: Let S : X ---> Y be a birational mapping, and assume that the inverse S-l: Y-X is regular, and that X is normal along an irre- ducible subvariety X'. If there exists an irre- 
16J Algebraic Varieties ducible component Y' of S[X'J with dimX' dim Y', then S is regular along X'. It follows from the above main theorem that, if T: V--> W is a rational mapping and if P is a normal point of V such that T[ PJ contains an iso- lated point, then T is regular at P. For a birational mapping T: V...... W between complete irreducible varieties, a subvariety V of V is said to be fundamental when dim T[ V'J > dim V'. When V' is a point (curve) we say V' is a fundamental point (fundamental curve) with respect to T. The most classical example of a birational correspondence with fundamental points is the quadratic transformation T of a projective plane onto itself given by (xo: x, : x 2 ) ......(XtX2:X2XO:XOXt) with a suitable coordinate system. In this case, T 2 = identity and the points Pi defined by x j = 0 for j * i correspond to the lines Xi = 0 by T Let Cr, denote the group of birational transformations of P"(k) into itself. These transformations are called Cremona transformations when n  2, Cr, is generated by linear transformations and qua- dratic transformations (M. Noether). Recently, Cr, has been studied in detail by M. Dema- zure [21J and H. Umemura [91]. Let V be a complete nonsingular irreducible variety over k It is called relatively minimal if every bira- tional morphism from V to a complete non- singular variety V' is an isomorphism. It is called minimal if every birational mapping from a nonsingular variety V' to Vis a mor- phism. Replacing birational mapping by ra- tional mapping, we can define strong (abso- lute) minimality. Abelian varieties and com- plete nonsingular curves with positive genera are strongly minimal. In general, minimality implies relative minimality. Zariski's main theorem is closely related to the general connectedness theorem due to W. Fulton and J. Hansen stated as follows: Let P be the product of r copies of P;' and A be the diagonal subvariety of P. If X is an irreducible projective variety and f: X --> P is a morphism with dimf(X»(r-l)m, then rl() is con- nected [22]. The following results are derived from this theorem: (1) if X is singular with only normal crossings (- Section L), then X can- not be imbedded in p 2n - t , where n = dim X. (2) Let X be a nonsingular subvariety of pN which is not contained in any hyperplane. Suppose that by the projection n x : p N --> pN-l with cen- ter x, x being a general point of p N , X is iso- morphic to njX), Then 3 . dim X < 2(N - 2) (Zak's theorem). (3) Let X be a nonsingular subvariety of P'" and H be an arbitrary hyper- plane section of X. Then dim(Sing H) < codimX (1. Roberts). By a similar connectedness theorem, it was proved that if C is a (reducible) curve with normal crossings on p2 over C, then the fun- 74 damental group of p2 -C is commutative, which was conjectured by O. Zariski [20]. J. Rational Varieties An irreducible algebraic variety V over k whose function field is purely transcendental over k is called a rational variety. A complete smooth surface S over an algebraically closed field is rational if and only if P 2 (S) = q(S) = 0 (the Castelnuovo-Zariski criterion, -' 15 Alge- braic Surfaces E). If the function field of V has a finite alge- braic extension which is purely transcendental over k, then Vis called unirationaJ. A unira- tional curve is in fact rational. More generally, if the function field k( C) of a curve Cover k is contained in a field finitely generated and purely transcendental over k, then C is rational (Liiroth's theorem). A unirational surface over an algebraically closed field of characteristic zero is rational by virtue of the above criterion, but in the case of positive characteristic there are unirational surfaces which are not rational (Zariski). There are nonrational unirational threefolds even of characteristic zero; for example, all smooth cubic hypersurfaces in p4 (C. H. Clemens and P. A. Griffiths [ 1 6J), and some smooth quartic hypersurfaces in p4 (V. A. Iskovski'i and Yu. 1. Manin [45J). See [6]. K. Monoida1 Transformations Let V be an irreducible variety and ,j be a sheaf of ideals of (1;v. For any affine open set V of V, ,j I V is determined by an ideal a of the coordinate ring A of U, Let a" a" , a, be a system of generators of a, and let U' be the graph of a rational mapping from V to pm(k) such that the points PEU and (ao(P):adP): : am(P))E pm(k) correspond to each other by U' Then V' is uniquely determined (up to isomorphisms) by V and a only, Suppose V has a covering by affine open sets Vi' We obtain V; over Vi as before. By patching them together, we get a birational morphism T: V'...... V, which is unique up to isomorphisms. This T with V' is called the monoida1 trans- formation or blowing up of Vby the ideal sheaf .f. Note that in some references, the inverse transformation T- t is said to be the monoidal transformation. The inverse image ideal T- l (.J") is an invertible sheaf, that is relatively ample over V. If W is the support of (!v/ and W# V, then T-'(W) has codimen- SIan 1 and T gives rise to the isomorphism from V' - T-I(W) onto V- W. Thus one can say that V' is obtained from V by replacing W with T- '(W), which is locally defined by prin- 
75 cipal ideals. If every point of W is a nonsin- gular point of V and W is itself nonsingular, T-l(W) is the projective bundle over W whose fiber is the projective r-space where r = dim V - dim W - 1. In general, if W is a subvariety defined by a sheaf of ideals J, then T is said to be the monoidal transformation with center W. In particular, if W is a point, T is called the (locally) quadratic transformation. 1. Resolution of Singu1arities Given an arbitrary irreducible variety V, we have the problem of finding out a nonsingular projective variety V' birationally equivalent to V. This is called the problem of resolution of singu1arities. In the case of characteristic zero, this problem was solved by Zariski (1944) for dimension < 3 and by H. Hironaka (1964) [37J for any dimension. In the case of characteristic p, S. Abhyankar solved the 2- dimensional case (1956) and the 3-dimensional case (1966). Hironaka's theorem of resolution of singularities is stated as follows: Let V be a variety over a field of characteristic zero. Then there exists a finite sequence of morphisms of varieties: v,...... V,-I ......, Vi...... V o = V such that (1) V, has no singular points, (2) each Vi +1 -> Vi is a monoidal transformation of Vi with center D i , (3) each D i is a nonsingular subvariety, (4) each Vi is normally flat along D i . Here, Vis said to be normally flat along a sub scheme D of V defined by the sheaf of ideals I, if the quotient modules o1! / o1!+1 are flat (9D,x- modules for all p and all points x of D. Let V be a nonsingular variety and D an effective divisor on V. D is said to be a divisor with (only) normal crossings at x E V, if D is defined by fl' Is such that (/1' ,f,) is a part of a tsystem of local coordinates around x. D is said to be a divisor with only normal cross- ings, if it is so everywhere. For any subvariety Won V, there exist h: V...... V that is a composi- tion of monoidal transformations with non- singular centers, such that h-I(W) has only normal crossings. This results from Hironaka's main theorem II [37]. The normal crossing divisor is also defined for complex manifolds, and similar results hold. M. Cycles and Divisors Let V be an irreducible variety. We denote by 23, the set of r-dimensional irreducible sub- varieties of V that are simple on V (i.e., are not contained in the singular locus of V), and by 3,(V) the free Abelian group with basis 23,. Elements of 3,( V) are called cycles of dimen- sion Y (or r-cycles) on V. Let A and B be r- 16N Algehraic Varieties cycles; A = L n i Ai' B = L mi Ai (Ai E 23" Ai i- Aj if i i- j). If n i ? mi for all i, then we write A ? B If A ? 0, then A is said to be a positive cycle. For a O-cycle A = L niP i the integer deg(A) = L n i is called the degree of A A cycle of codimension 1 is called a divisor. A divisor? 0 is usually called effective in- stead of positive. If Vis of dimension d and if W. 23,-, , the local ring (!Jv w is a tdiscrete valuation ring. The tnormalized valuation defined by it is denoted by v w ( ), For a func- tion J E k{ V), we say that W is a zero of order n if vw(f) = n> 0, and that W is a pole of order - n if vw(f) = n < o. Any function f E k{ V), other than the constant 0, has at most a fmite number of zeros and poles. We denote by (no the sum L vw(f) W extended over all the zeros Woff, and put (f-l)O=(!)oo and (f)o- (f),=(f). We call (f)o, (f)oo' and (f) the zero divisor, the pole divisor, and the divisor off; respectively. The divisor (f) is equal to L vwcn w, where the summation is taken over WE23 d _ l , and we have (fg)==(f)+(g). When V is complete and the singular locus of V has codimension > 1, then fis constant if and only if(f), =0 (or (/)0=0). Let DI and D 2 be divi- sors; if there exists a function f( '# O)E k{V) such that DI -D 2 = (f), then DI and D 2 are called linearly equivalent to each other and we write DI  D 2 . The linear equivalence class containing a divisor D is denoted by c1(D). A divisor which is linearly equivalent to 0 on a neighborhood of each point of V is called a Cartier divisor (some authors call a Cartier divisor simply a divisor). If V is smooth, then any divisor is a Cartier divisor. If a divisor D can be written as D=(f) on an open set U, then the function fis called a local equation of D on U. Let T: V' --> V be a rational mapping from a normal variety V' to a complete variety V, let D be a Cartier divisor on V, and assume that the closed image of T is not contained in D. Since T is regular on some open set U such that codim( V' U) > 1, we have a morphism <p = T IU and the pullback qJ*(D) defined by composing the local equations of D with qJ. Taking the closure of this divisor in V', we obtain a divisor on V', which is denoted by T*(D). N. Divisors and Linear Systems Let V be a complete irreducible variety, fo, JI" , "f" be elements of the function field k{V) of V, and D be a divisor on V satisfying (fi) + D ? 0 for each i. Then the set 1: of the divisors of the form CE a i ,!;) + D, where the a i are elements of k and not all zero, is called a linear system. The linear space kfo + kJl +. + kfn is called a defming module of 1:. The divisors in 
16 0 Algebraic Varieties I: are positive and are linearly equivalent to each other; if every positive divisor that is linearly equivalent to a member of 1: belongs to I:, then I: is said to be a complete linear system. For any linear system I, there exists a unique complete linear system containing it, which is denoted by 11:1. The maximal positive divisor Do that is contained in all divisors of 1: is called the fixed component of 1:, and for each DEC we caIl D - Do the variable compo- nent of D (or of l). A point P of V is called a base point of a linear system I: if P is on each variable component of I:. A linear system J; is called irreducible if its generic member is irreducible; otherwise it is called reducible. The dimension of a defining module of a linear system I: is denoted by 1(C); we call I(C)- 1 the dimension of I: and denote it by dim 1:. A linear system of dimension 1 is called a linear pencil. A defining module of a linear system 1: is determined uniquely up to k-isomorphisms. Let L be a defming module, and let io'!" . ,f" be a linearly independent basis of Lover k If we associate to each point P of V the point Q = Uo(P):fdP):... :fn(P)) of the n- dimensional projective space, then we obtain a rational mapping <1>E from V to another variety V'. Outside the base points of I:, the rational mapping <1>1: is regular; and the base points are the fundamental points of 11>1:' We say that <1>E is the rational mapping defined by the linear system 1:. When Chas no t'ixed components and 11>.1; is a closed immersion, I: is said to be very ample (or ample). For a divisor D, the set of positive divisors that are linearly equivalent to D is a linear system, which is called the complete linear system and is denoted by IDI. We usually write I(D) in- stead of l( IDI). If IDI is very ample, we say that D is very ample. We say that D is ample (or nondegenerate) if mD is very ample for some m>O. O. Differential Forms Let V be an n-dimensional irreducible variety, and let A = k( V) be its function field. We de- note by D* the set of derivations of  over k, i.e., the k-Iinear mappings D: --> satisfy- ing DUg) = DU)g + fD(g). Then !J* is an n- dimensional linear space over R Let D denote the tdual space of !J* over R. For each f E, let df be an element of !J defined by <df, D) = D(f) (DE!J*). Let XI' Xl,' . . , x n be a separat- ing transcendence basis of  over k, in the sense that x I' ,x n are algebraically inde- pendent over k and  is a tseparable algebraic extension over k(x " ,x n ). (Such a basis exists under the weaker hypothesis that k is tperfect.) 76 Then dx" . . . . dx, form a basis of!J over R. The homogeneous elements of degree r of the tGrassmann algebra of D over  are called differential forms of degree r on V (or rational r-forms). The set of the differential forms of degree n is a I-dimensional linear space over  spanned by dx, A dx, A. A dx,. A set of n functions fl, In in  is called a system of local coordinates on an open set U of V if fl - fdP), . ,in - fn(P) is a tregular system of parameters of the local ring D p for each P E U. In that case, fl' , f" is also a separat- ing transcendence basis of R If P is a simple point of V. then there exists a system of local coordinates on a suitable neighborhood of P Let w be a differential form of degree r on V, and write (J) = Li, < '" <i,<fJ(;)d/;1 A A dJ.., where Ul' . . ,in) is a system of local coordi- nates around P If the coefficients <P(i) are regular at p, then w is said to be regular at P When Vis a complete variety without sin- gular points, a differential form that is every- where regular on V is called a differential form of tbe first kind (or a regular form); the dif- ferential forms of the first kind are determined by the function field A and are independent of the choice of the nonsingular model V. The number of linearly independent differential forms of the first kind, of degree n, is denoted by Pg and is called the geometric genus of V Let V be a complete variety, Wan irreduc- ible subvariety of V of codimension 1, and P a point of W that is simple on V. Choose a system of local coordinates (fi). Given a dif- ferential form won V, we write it as a "poly- nomial" in the dJ;, and denote by vw(w) the minimum of the values of the coefficients for the tvaluation v w ( ), The number vw(w) is determined by w and W, and it is independent of the choice of P and of the local coordinates. Then w defines a divisor (w) = Lw vw(w) Won v, which is called the divisor of a differential form (1), The divisor of a differential form of degree n (= dim V) is called a canonical divisor and is usually denoted by K The canonical divisors form a linear equivalence class of divisors, P. Albanese Variety, Picard Variety, Neron- Severi Group Let V be an irreducible variety. Then we can construct a couple (AJ) consisting of an t Abelian variety called the Albanese variety of V and a rational mapping f: V+A (called an Albanese mapping) such that: (i) the image of f generates A, i.e., the sum off with itself n times, F: V" --> A, is generically surjective for suft'iciently large n; (ii) for every rational map- ping g: V --> B of V into an Abelian variety B, 
77 there exist a homomorphism h: A -7 B and a point bE B such that 9 == h. f + b. The Albanese variety is uniquely determined up to isomor- phisms and f is determined up to translations, In the case of k = C, if V is a complete non- singular variety and if q is the dimension of the linear space of differential I-forms of the first kind on V, then the first tBetti number i?, is equal to 2q, Let WI, , W q be a basis of the linear space and let YI, , Y2q be a basis of the first homology group modulo torsion, Put rx ji = L J Wi and ex j = (rx j1 , , Ci. jq ). Then the period vectors rx j (1 <J < 29) are linearly independent over R in C q . If r denotes the discrete sub- group of C q generated by the Ci. j , then the quo- tient group cq/r is the Albanese variety of V. The Albanese mapping is given by the map- ping P-7(JWI' ' ,JWq) (modr), where P is a variable point on V and Q is a fixed point on V (  232 Kahler Manifolds C), Replacing the term "rational mapping" by "morphism" in the definition, we can define the strict Albanese variety of V and prove its existence, It is a quotient Abelian variety of the Albanese variety of V. If V is nonsingular, both coincide by virtue of the tstrong mini- mality of an Abelian variety (- Section I), Let V be a complete normal variety, U the set of the simple points of V, and D a divisor on V. Then D is said to be algebraicaJly equiva- lent to 0 if there exist a nonsingular curve C, a divisor r on U x C, and two points P and Q on C such that D can be written as D = qJ(f) - qJQ(r), where <pp and <PQ are the mar- phisms qJp: V -->V X P-->V X C and qJQ: V...... U x Q-U x C. We denote by <»(V), <»a(V), and (fj /(V) the set of all divisors on V, the set of divisors that are algebraically equivalent to 0, and the set of divisors that are linearly equiva- lent to 0, respectively, We can introduce a canonical structure of an Abelian variety into <»a(V)/<»/(V), which is called the Picard variety of V. The dimension q of the Picard variety is called the number of irregularity of V; if q = 0 we say that Vis regular. The Albanese variety and the Picard variety of V are tisogeneous to each other, and each one is the Picard variety of the other. If Vis a curve, they are isomorphic and called the Jacobian variety ( 9 Algebraic Curves E), Using Cartier divisors instead of divisors, we get an analogous theory to construct another kind of Picard variety which turns out to be isomorphic to the Picard variety of the strict Albanese variety of V The group of the linear equivalence classes of Cartier divisors can be identified with HI( V, (9:), where (9: is the sheaf of multiplicative groups of the invertible ele- ments in 0" From this point of view, we can generalize the theory of Picard variety to the case of schemes also, The theory thus obtained 16R Algebraic Varieties is called the theory of Picard schemes [60]. The quotient group NS(V) = ffi(V)/ffi a ( V) is finitely generated [66J and is called the Neron- Severi group of V. We call the rank of NS(V) the Picard number of V and denote it by p(V), In the case of a nonsingular projective vari- ety over k = C we have an inequality p(V)  h 1, 1 (V) ( = dim, HI (\I, Q)) and the Lefschetz number B 2 ( V) - p( V) is a birational invariant (where B,(V) is the second Betti number of V) [39]. For the positive characteristic case, however, the above inequality does not hold in general [61]. The torsion part of NS(V) is <»n( V)/ffi a ( V), where (\)n( V) denotes the group of divisors numerically equivalent to zero (- Section Q) (T. Matsusaka), The last fact cannot be gen- eralized for higher codimensional cycles [25]. Q. General Intersection Theory Let V be an irreducible variety of dimension n, and let A and B be irreducible subvarieties of V of dimension rand s, respectively, If C is a proper component of An B, we can define the intersection multiplicity i(A B, C; V) of A and B along C on V, which has properties consis- tent with our geometric intuitions, In partic- ular, it is invariant under biregular mappings, If A and B intersect properly on V and if C" , C N are the proper components of A n B, we define an (r+ s-n)-cycle A. B by A. B== Lv itA , B, Cv; V) C V and cal1 it the intersection product of A and B. If each component X, of an r-cycle X = L, noXa and each component yp of an s-cycle Y = L{J mp Y, intersect properly, we define x, Y=L,Lpn,mpX,' Y p . Then we have the associative law (X Y)' Z = X, (y. Z) for cycles X, Y, and Z whenever both sides are defined, Two r-cycles XI and X 2 are said to be numerically equivalent if for every (n - r)-cycle Y that intersects them pro- perly we have deg(X, Y) = deg(X 2 Y), The theory of intersection is one of the most basis theories in algebraic geometry, for the other theories can be constructed from it [92]. R. Chow Rings Let U and T be nonsingular irreducible vari- eties, If Z is a cycle on U x T such that Z . (U x t) is defined for every point t of T. we put Z' (V x t)=X(t) x t, and we obtain a family {X(t)} of cycles on U parametrized by the points of T Such a family is called an algebraic family of cycles, Two cycles X I and X 2 whose difference is equal to the difference of two cycles in an algebraic family are said to 
168 Algebraic Varieties be algebraically equivalent. In particular, if Xl - X 2 can be expressed as the difference of two cycles of an algebraic family parametrized by the points of the affine line, then X, and X 2 are said to be rationally equivalent. The set of the cycles that are algebraically (rationally) equivalent to 0 is a subgroup 3.( U)(3,., ( U» of J(U). For divisors, rational equivalence coin- cides with linear equivalence. Let f: V --> U be a morphism between non- singular irreducible varieties. For an irre- ducible subvariety W of V, let W' denote the closure of f( W), and put f J ( W) = 0 if dim W> dim W' and h(W) = m W' if dim W = dim W', where m= [k(W): k( W')J is the degree of the morphism W--- W'. Extending f.1 by linearity, we obtain a module homomorphism t,: 3( V) --> 3(U). If f is proper, then tJ induces a module homomorphism f* from 3(V)/3r.,(V) =A(V) to A(U). Let U and V be nonsingular irreducible projective varieties, and f: V + U be a mor- phism with graph r. If Y E 3( U) is such that r. (V x Y) is defined, we denote by f.f( Y) the image of r. (Vx Y) under the induced isomor- phism r --> V. Each class of the rational equiva- lence class group A(U) = 3(U)/3rat( U) contains a cycle for which ti is defined. Hence we can define f*: A(U)-->A(V). Let : U --> U x U be the diagonal morphism, and define x' y = *(x x y) for x, YE A(U). Then A(U) is a ring with respect to this product; moreover, it is a tgraded ring with the grading by codimension. This graded ring A(U) is called the Chow ring of U, and the mappingf*: A(U)+ A( V) is a ring homomorphism. If f is proper, we have j(y' f*(x)) = f*(y), x for xEA(U), YEA(V). Zero Cycles. Let V be a nonsingular irreduc- ible projective variety over an uncountable algebraically closed field of characteristic zero (say C). Denote by A,,(V) the group of classes of a-cycles of degree 0 on V modulo rational equivalence. We say A,(V) is finite- dimensional if the mapping V n x V"+A,(V) sending (a I'"'' an,b 1'"'' b n ) to La i - Lb i is surjective for a certain n. If A,(V) is finite-dimensional, then the canonical mapping A,( V)-->Alb( V) induced from V ___ Alb( V) is bijective (A. A. Rojtman). In general, the torsion part of A,(V) is isomor- phic to that of Alb( V) (Rojtman [75J). If h P ' o ( V) > 0 for some p> 1, then A,(V) is not finite-dimensional (Mumford, Rojtman [74J). If Vis a surface which is not of general type and with h 2 ,0( V) = 0, then Ao(V) is isomorphic to Alb( V) (S. Bloch, A. S. Kas, D. Lieberman [13J). There exist surfaces of general type with 11 2 ,0 =0 such that A,(V) =0. For instance, Godeaux surfaces are such surfaces (H. Inose [43J). 78 One Cycles. Let V be a nonsingular projective irreducible variety and N(V) denote {numer- ical equivalence classes of I-cycles on V} zR. Via the intersection pairing, N(V) is dual to the Neron-Severi group tensored by R (- Section P); and let NE(V) denote the smallest COne in N(V) containing all effective I-cycles. S. Mori studied the structure of NE(V) in detail. One of his results is stated as follows: If the anti canonical divisor is ample, then there exist rational irreducible curves II ' , I, such that (1) -(I;, Kv)<n+ 1 and (2) NE(V)= R+ [1 1 J + ... + R+ [I,J, where n=dim V, Kv is the canonical divisor on V, and [IJ denotes the class represented by 1,. Moreover, Mori has proved that (1) if Kv is not numerically effective, e.g., if - Kv is ample, then V con- tains a rational curve. and (2) if the tangent bundle of V is tample, then Vis isomorphic to the projective space [59]. (This is called the Hartshorne conjecture.) S. Chow Coordinates, Hilbert Schemes Consider an irreducible algebraic correspon- dence T between irreducible varieties V and W. Let V' and W' be the closed images of the projections of T to V and W, and let a and c be their dimensions, respectively. Take generic points P, Q of V', W', and consider the total transforms T{P] of P to Wand T{Q} of Q to V. Denoting the dimensions of T{ P} and T{ Q} by b and d, respectively, we have a + h = c + d, where both sides are equal to the dimension of T. This property is called the principle of counting constants. This simple principle has wide application. For instance, let V be an r-dimensional variety in P"(k), and let L j UijX j = 0 (0  i  r) be the equations of r + 1 hyperplanes Hi' The con- dition vn (Ho n n Hr) =1= 0 defines an irreduc- ible algebraic correspondence T between V and the multiprojective space W = F' "(k) x x P"(k) with uij as coordinates: T={(x,u)lxEV, L UijX j = O}. In this case, we have a = r, b = (n l)(r + I), and d =0 in the notation intro- duced previously, so that c = n(r + 1)- 1. This implies that W' is of codimension 1 in W; hence W' is defined by a single equation F(uiJ = O. This form F is the associated form of Vof B. L. van der Waerden and W. L. Chow. It is a homogeneous form of degree d (d = deg( V)) in each (u iO , , u in ) and is symmetric in the indices i. More generally, for a positive cycle X = L n" Va of dimension r and of degree d ( = L n, deg( V x )) in P"(k), the product II F;" of the associated forms F, of V, is called the as- sociated form of X. The coefficients of F, arranged in a fixed order and regarded as the homogeneous coordinates of a point of a pro- 
79 jective space, are called the Chow coordinates of X. This is a natural generalization of the tPli.icker coordinates. Given r, d and a projec- tive variety U ( c P"(k», the set of Chow coor- dinates of the positive cycles that are con- tained in U, whose dimension is r and whose degree is d, is a projective variety called a Chow variety. In scheme theory, the Hilbert scheme is introduced in the following way [60]. Let Pz be the n-dimensional projective space over Spec Z. For a locally Noetherian scheme S, and for a closed subscheme Z of Pz x S, we have the composition of the immersion of Z and the projection Pz x S to S; this is denoted by fz: Z -->S, For any point 5 of S, fZ-l(S) is a closed sub scheme of Pk(s) and the restric- tion of (1)(1) to JZ-I(S) is denoted by L,. Put ..It(S) = {Z c Pz X S} Ifz is flat}. Then M be- comes a contravariant functor to the cate- gory of sets, which is representable, i.e., there exists a locally Noetherian scheme M such that uIt is naturally isomorphic to the functor Hom( -, M). M is written as Hilb(Pz) and is called the Hilbert scheme. There exists a closed sub scheme W of Pi x Hilb(Pz) with flat fw such that for any ZEM(S), there exists a unique <p: S-->M = Hilb(Pz) in such a way that Z = W X MS, In particular, if X is a closed sub- scheme of Pi over a field k, there exist SE Hilb(Pz) and a field extension k(k(s) such that X = f;I(S) Q9 k('lk. Thus Hilb(Pi) parametrizes all closed subschemes of p;;. Fix a polynomial P and define AP(S) to be {Z EM(S)I X(Lm) = P(m) for all s}. A P is also a contravariant functor, represented by a scheme Hi1b P (PZ) which is projective over SpecZ. The direct sum of all Hilb P (Pz) is just Hilb(Pz). T. A1gehraic Geometry and Complex Analytic Geometry When k = C, an algebraic variety is called a complex a1gehraic variety, and it has the struc- ture of a tcomplex analytic manifold or (if it has singular points) of an tanalytic space. If we denote by (I) x the ring of holomorphic fUnc- tions at a point X of V, then @v,x C (I).x, and their tcompletions coincide. If x is a simple point of V, then (I).x is the tring of convergent power series, and its completion is the tring of formal power series. The prime ideals of (I).x remain prime under completion (M. Nagata). Because of this, the analytic behavior of Vin a neighborhood of x can be investigated alge- braically through the completion of @v,x' If Vis complete, then the analytic coher- ent sheaves on V and the algebraic coherent sheaves on V correspond to each other bijec- tively; consequently, for propositions that can 16U Algebraic Varieties be stated in terms of coherent sheaves, the results in the analytic sense remain valid in the algebraic sense also, and vice versa (J.-P. Serre [82]) (- 72 Complex Manifolds E). U. Topology of Algebraic Varieties Every algebraic variety defined Over R (or C) can be triangulated by real analytic cells [46]. Let V be a nonsingular connected algebraic variety defined over C. For an algebraic auto- morphism (J of C we can define va by letting (J operate on the coefficients of the defming equations of open affine coverings of V. V and Va are not necessarily homeomorphic or even of the same homotopy type [83]. Grothen- dieck has shown that there is a tspectra1 se- quence Ej'P= HP( V; Q)W+q( V, C), called a Hodge spectral sequence, This spectral se- quence can be defined in a purely algebraic way and Hp+q( V, C) can be considered as the thypercohomology of the de Rham complex { f( V; Qv), d} of V. If V is projective, V carries a Kiihler metric and by the theory of tharmonic integrals the Hodge spectral sequence degener- ates and Ef'q is C-isomorphic to the complex conjugate of Ej'P (- 232 Kahler Manifolds B). This is also the case if V is complete, The topology of a nonsingular projective surface was studied by Lefschetz using the method of Lefschetz pencils, For a projective nonsingular irreducible variety V of dimension IJ del'ined over C, a Lefschetz pencil { W,}'EP1 of V is, by definition, a linear pencil consisting of hyperplane sections W, of V such that: (i) for all t E U = P - {I \, 1 2 , , I d }, w, is nonsingular; (ii) each w" has only one singular point that is an ordinary double point; and (iii) W o n Woo is nonsingular, where we assume 0, (f) E U. Embed V into p N by a high multiple of a hyperplane of V and take a generic linear pencil {H,} of hyperplanes in p N . Then { W, = Ht n V} is a Lefschetz pencil of V. By blowing up V along W o n Woo we obtain a smooth variety V and a surjective morphism n: V -->p l . Let W=n-I(O), nu=nln-I(U). RPnu.Q is a local system attached to the monodromy representation <pp: n l (U, O)-->GL(HP(W, Q)). f{Jp is trivial if pin - 1. For each point Ii there corresponds a cocycle 6 i of H"-I (W, Q) called a vanishing cocyc1e such that if y, is a loop based at 0 going once (counterclockwise) around Ii> we have, for each XE W- l ( W, Q), <pp(Y;) (x) = x::!: (x, 6 i )b i , where ( ) is the intersection pairing of W- l ( W, Q). <PP(Yi) is called a Picard- Lefschetz transformation. The main results due to Lefschetz are restated as follows. (1) (Weak Lefschetz theorem). The natural homomor- phism Hi( V; Q)->Hi( W, Q) is an isomor- phism for 0  i  n - 2 and is an injection for 
16 V A1geb raic Varieties i = n - 1, or equivalently H;( V, w, Q) = 0 for o  i  n - 1. (2) (Strong Lefschetz theorem). Let  be the cohomology class of H2( V, Q) corre- sponding to the hyperplane section W, and let L:H*(V, Q)-->H*T2(V, Q) be the homomor- phism det'ined by the cup product with . Then foreachjn,L"-I:Hi(V,Q)-->H2n-i( V, Q) is an isomorphism. The weak Lefschetz theorem is true for a cohomology with integral coeffi- cients. In fact, V - W has the homotopy type of a real n-dimensional finite CW complex, and IT,( V, W) = 0 for Y < n The strong Lefschetz theorem is equivalent to the statement that H n - 1 ( W, Q) is the direct sum of the vector space spanned by the vanishing cocycles 6 i and the vector space spanned by the invar- iant cocyles (i.e., <Pn-l (yJx == x, i = 1, , d). Lefschetz's original proof of this statement is incomplete, and no direct topological proof is known. The transcendental proof of (2) is given by the theory of harmonic integrals. A version of Lefschetz pencils is a proper mor- phism f:X -->D = {zllzl <s} of a complex mani- fold X onto a disk D such that f* =flf-I(D*), D* = D - {O}, is of maximal rank at every point off-I(D*). Fix a pointsED*. ndD*,s) oper- ates on Hj( W, Z), W =fI(S), and we have a representation qJj: n l (D*, s)-->GL(Hj(W, Z». For a loop y based at s and going once around o the Picard-Lefschetz transformation qJ)y) is quasiunipotent (i.e., for a certain integer m, <p/y)m is unipotent). For a nonsingular projective variety det'ined over a field k with characteristic p> 0, the above Lefschetz theorems hold for an /-adic cohomology (lIp) [19,30 (SGA 7)] (- Sec- tion AA). Using the theory of finite etale cover- ings of an algebraic variety det'ined over a field k, we can define the algebraic fundamental group and the algebraic homotopy groups, which are profinite completions of the topo- logical fundamental group and the topological homotopy groups, respectively, where k C [5, 30 (SGA I)]. Let (X,x) be a germ of a complex space with isolated singular point x. (X,x) is always algebraizable, i.e., the com- pletion {!Jx,x of the analytic local ring (!)x.x is isomorphic to the completion of the local ring of a closed point of an algebraic variety de- fined over C [3]. For topological type of al- gebraic surfaces -" Moishezon [57]. V. Hodge Theory Let H R be a finite-dimensional real vector space containing a lattice Hz, and let H = H R Q9 R C be its complexification. A Hodge structure of weight m on H (or H R) is, by de- finition, a direct sum decomposion H = ffip+q=m HP . q , Hp,q Hq,p, whereHP,q is a 80 ;omplex vector subspace and the overbar de- notes complex conjugation. If H and H' carry Hodges structures of weight In and m', respec- tively, then H @ H', HomdH, H'), I\PH, and H * carry Hodge structures of weight In + m', m' - m, pm, and -m, respectively, For a Hodge structure H of weight m, FPH = ffik;;,pHk.m-k, p = 0, 1 , . , m, induces a decreasing filtration. Let H be a Hodge structure of weight m, and let Q be a bilinear form on H. If the fo1\ow- ing three conditions are satisfied, the Hodge structure H is said to be polarized by Q, (i) Q is det'ined over Q and is symmetric (skew- symmetric) if In is even (odd), (ii) Q (Hp,q, HP ,q) = 0 unless p = p', q = q' (iii) (j=1y-qQ(v, v) > o for nonzero VE H p, q. Let V be a compact Kiihler manifold, Then H = Hn( V, C) carries the Hodge structure induced by the type (p, q)- decomposition (232 Kiihler manifolds B). This is also the case if V is a compact complex manifold which is the image of a holomorphic mapping from a compact Kahler manifold of the same dimension, Moreover, if Vis projec- tive, the Hodge-Riemann bilinear relations define a natural polarization on the subspace P of H consisting of all primitive cohomology classes, Each algebraic cycle W of codimension s determines a cohomology class [W] E H ZS ( V, Q), which belongs to W' '( V). (Such a class is called an algebraic cycle.) The con- verse of this fact is called the Hodge conjecture, which says that H2'( V. Q)n HS,S(V) is spanned by algebraic cycles. The case S = 1 has been verified by Hodge., Lefschetz, and Kodaira. Let V (W) be a smooth irreducible algebraic variety deUned over C (complex manifold), and let rp: V --> W be a projective smooth morphism with connected fibers, Then H = RmqJ* C is a flat vector bundle on W with the flat connec- tion V. V is often called the Gauss-Manin connection, and it can be deUned algebraically if W is also algebraic. For each fiber V. = <P -I (s), SE W, the filtra- tion fPHm(V" C)= ffik;;,pHk,m-k( V,) induces a complex subbundle FP and the connection V has the property V(@(FP)) c (!)(W- I Q9 T*), where T is the tangent bundle of W [26]. Moreover, if W is algebraic, V is a differential equation with regular singular points on W where W IS a smooth compactification of W, such that W - W is a divisor with normal crossings ( Section L). If we consider the sub bundle P of H consisting of all primitive cohomology classes, the polarization on each fiber induces a Hermitian pseudometric on P. Curvatures of bundles P n FP have been studied by Griffiths [26]. There exists a classi- fying space D for polarized Hodge structures and there exists a holomorphic mapping of the universal covering Hi of W into D, usualIy 
81 called a period mapping. D may not be a bounded symmetric domain but has several interesting properties [24,27]. In some cases D/f with a suitable discrete subgroup r is the moduli space of polarized algebraic varieties (e.g" curves, Abelian varieties). P. Deligne [ 1 8J has generalized Hodge theory to arbitrary algebraic varieties (more generally schemes of finite type over C). The simplest case is the Hodge theory of a smooth noncomplete irreducible variety X. By Na- gata's embedding theorem [63J there exists a complete algebraic variety X such that Y = x-x is a subvariety. By virtue of Hironaka's resolution theorem we can assume that X is nonsingular and that Y is a divisor with normal crossings. Let 0l(1og Y) be a sheaf of germs of meromorphic I-forms with loga- rithmic pole along Y, i.e., locally written as Ll a;(x)(dxjx;) + Lj=k+1 aj(x)dxJ' where (XI' , x n ) is a system of local coordinates with center pE Y in X such that x x k = 0 is a local equation of Y and ai(x), aj(x) are holomor- phic at p. Using the complex {QWog Y)= NQl(log Y), d}, with a suitable filtration, DeIigne has shown that H = Hm(X, C) carries a mixed Hodge structure and this structure is independent of the choice of X. The mixed Hodge structure on H consists of two finite filtrations, i.e.,Oc.. c w,,-l C w" c. c H, the weight filtration which is detlned Over Q, and 0 c. cFc F+' C. c H, the Hodge filtration such that F induces on w,,/w,,-I a Hodge structure of weight n, As a corollary he has shown that a meromorphic p-form on X with logarithmic pole along Y (i.e., a section of QHlog Y) is d-c\osed on X, and w = 0 if and only if UJ lx is zero in HP(X, C). An important application of the theory of this mixed Hodge structure on Hm(X, C) is the following. Let V and W be smooth irreducible varieties, and let <p: V --> W be a smooth projective mor- phism. If V is a smooth compactification of V, the canonical homomorphism Hm(V, Q)...... H O ( W, Rm<p*Q) is surjective Fix a point SE W. n 1 (W, s) operates on Hm(, Q). Then this action is semisimple [ 18]. W. Deformations, Moduli, Algebraic Spaces In this section for simplicity the field k is as- sumed to be algebraically closed. Let X be an algebraic scheme over k A (flat) deformation of X over a connected scheme S over k with base point So consists of the following data: (1) A morphism p: I -->S that is flat and of finite type. If X is complete, p is also proper. (2) A closed point So E S such that the fiber :E x s k(so) is isomorphic to X. For any closed point S E S, the Iiber X,=X Xs 1(8) is called a flat deforma- 16 W Algebraic Varieties tion of X. If X is smooth and complete, we assume further that p is smooth. Similarly, we can define a deformation of a polarized alge- braic manifold, an embedding deformation of X in an algebraic scheme Y over k, a de- formation of an affine scheme with isolated singular points, and a deformation of vector bundles on a fixed algebraic scheme over k, etc. The theory has two aspects: local theory and global theory. Let (R, m) be a complete Noetherian local ring such that Rim = k Set Rn = Rim"' A for- mal deformation X R of X is a sequence {X n } such that (i) X n is a deformation of X over Spec(R n ) and (ii) there is a compatible se- quence of isomorphisms X n Q9 R"Rn-l Xn-I for any n. Let (FLAlk) be the category of finite-dimensional commutative local k- algebras. The local theory of deformation is the study of the covariant functor F of (FLA/k) to (Set), where, for A E (FLA/k), FrA) is the set of isomorphism classes of deformations of X over Spec(A). The functor is in general neither representable nor prorepresentable (i.e., there exists a formal deformation X R of X such that FrA) = Homk_alg(R, A)). But, under reasonably mild conditions on F, F has the hull R [78]. That is, there is a formal deformation X R of X and a natural transformation j: G-F, where G(A) = Homk_alg(R, A), such that j is for- mally smooth (i.e., for any surjection A'+ A in (FLAlk), G(A')......G(A) XF(A)F(A') is surjective) and G(k[[.J)-->F(k[cJ) is bijective for the ring of dual numbers k [[.]. The formal deformation X R is called a versal deformation of X. The hull R is unique up to noncanonical isomor- phism. The deformation functor F has the hull R if X is (i) a complete algebraic scheme over k, (ii) an affine scheme with isolated singular points, (iii) a polarized algebraic variety over k, or (iv) a vector bundle on a complete alge- braic scheme over k, etc. If there exists a de- formation n::E...... S of X over a scheme S with base point So over k such that R = 0 s ,so' X n  3 Q9 Rn. the formal deformation {nn : X n -->Spec(R n )} is called algebraizable. Alge- braizability of the versal deformation has been studied by M. Artin Since the assumption that S is a scheme is rather restrictive, Artin has introduced the notion of algebraic spaces and has considered algebraizability in the category of algebraic spaces [2-4]. For a complete algebraic variety, the versal deformation is not necessarily algebraizable and we need to con- sider deformations of polarized algebraic varieties. The versa 1 deformation of an affine variety with an isolated singularity is alge- braizable in the category of algebraic spaces For the global theory of deformations, we need the projectivity assumption, and the theory is essentially reduced to the theory of 
16 W Algebraic Varieties Hilbert schemes (or Chow varieties) ( Sec- tion S). The problem of moduli is considered as the study of the set A4 of all isomorphism classes of deformations of X. Usually we con- sider the moduli of polarized varieties. Let (Sch) be the category of Noetherian schemes and ..4t a contravariant functor from (Sch) to (Set) del1ned by .it(X)== {isomor- phism classes of families of polarized varieties parametrized by X EOb(Sch), e,g" families of polarized Abelian varieties with(out) addi- tional conditions}. The functor .it is called a moduli functor. If it is represented by a scheme M, M is called a fine moduli scheme [60J In this case there is the universal family n".y -->M. In many cases the moduli functor is not representable. A coarse moduli scheme for a given moduli problem is a scheme A4 together with a natural transformation rjJ: ..4t -->Hom( -, M) such that (1) rjJ(Spec(k)): M(Spec(k))--> Hom(Spec(k), M) is bijective for any algebrically closed field k, and (2) for any scheme N and any natural transformation i/1 : .it --> Ham ( -, N), there is a unique natural transformation A. Hom( -, M)-->Hom(, N) with i/1 = A' , A coarse moduli scheme is called a moduli space or a moduli scheme. In many cases, the moduli space can be obtained as the quotient space of a certain (locally closed) subset H of a Hilbert scheme by the following equivalence relation: s-s' EH if and only if XS.4X" as polarized varieties, where n: 1--t H, n -1(S) = X, (T. Matsusaka [53J), This equivalence relation is often in- duced by an action of a treductive algebraic group G. Suppose that a reductive algebraic group G operates on an algebraic k-scheme Z. A G-invariant morphism.f: Z--> Y (i.e., for the trivial action of G on Y, j is a G-equivariant morphism) is called a geometric quotient if (1) f is a surjectIve affine morphism and f*( (Dz)G = (Dy, (2) if X is a G-stable closed subset of Z, then .f (X) is closed in Y, and (3) for x I' X 2 E Z, f(xJ = f(x 2 ) if and only if the G-orbits of Xl and Xl are the same. Let G be a reductive algebraic group, X, G......Aut( V) a rational rep- resentation on a finite-dimensional vector space V over k, and V o #0 a G-invariant point. Then there exists a G-invariant homogeneous polynomial F of degree  I on V such that F(v o ) # 0 (W. J. Haboush [32J). This implies that if a reductive group G operates on an algebraic k-affine scheme Spec(A), then the invariant ring A G is a finitely generated k- algebra and, moreover, if any G-orbit in Spec(A) is closed, the natural morphism Spec(A)-->Spec(A G ) is a geometric quotient. For a quasiprojective scheme Z over k with an action of a reductive group G we need a notion of stable points [60,62]. The subset of stable points of Z consists of all geometric 82 points of a G-stable open sub scheme Z' of Z, and there exists a geometric quotient f: ZS--> Y where Y is quasiprojective (Mumford [60J, C. S. Seshadri [79J, Haboush [32J), In this way Mumford has shown the existence of coarse moduli schemes of nonsingular complete irreducible algebraic curves and polarized Abelian varieties. But, in general, analysis of stable points is very difficult and it is desirable to extend the category of schemes so that it becomes easier to obtain a quotient. Matsu- saka has introduced the notion of a Q-variety [52]. M. Artin has introduced the notion of an algebraic space (D. Knutson [49J). An algebraic space X of finite type consists of an affine scheme U and a closed subscheme R c U x U such that (I) R is an tequivalence relation, and (2) the projections Pi: R+U (i = 1,2) are etale. (These are often written as R  U --> x.) A morphism g: V --> X of an affine scheme V to an algebraic space X consists of a closed subscheme We U x V such that (1) the projection W --> V is etale and surjective, and (2) the two closed sub schemes R x V W, W x v W of U x U x V are equal. Let S V...... Y be an algebraic space. Then Hom( Y, X) is de- fined as the kernel ofHom(V,X)Hom(S, X). If Y is an affine scheme, this definition of Hom( Y, X) is equivalent to the previous defi- nition by virtue of the etaie descent. Thus algebraic spaces form a category which con- tains the category of schemes. We can define the structure sheaf of an algebraic space and construct a cohomology theory. Many impor- tant notions and theorems for schemes can be generalized to those for algebraic spaces, Every algebraic space has a dense open subset that is an affine scheme. A group algebraic space is a group scheme (J. P. Murre). Suppose k = C. If an algebraic group G operates on an algebraic k-scheme properly with a finite stabi- lizer group, the quotient space exists as an algebraic space. In this way H. Popp has shown the existence of moduli spaces of alge- braic surfaces of general type as algebraic spaces [70,7IJ (- 15 Algebraic Surfaces; also [54J). Moreover, every separated algebraic space X of finite type over C carries a natural structure x",' of an analytic space. If there exists a proper modification morphism f X",' --> Y of a separated algebraic space X onto an analytic space Y, then Y carries a structure of an algebraic space and f becomes a morphism of algebraic spaces (Artin [2J). For any alge- braically closed field k, Artin has introduced the notion of formal algebraic space and formal contraction and has obtained results similar to those for algebraic spaces [49]. An irreducible compact complex space X whose algebraic dimension ( 72 Complex Manifolds F) is equal to dim X is called a 
83 Moishezon space. As a corollary of the above theorem, any Moishezon space X carries a structure M of a compact algebraic space such that X  M'''' X. Formal Schemes Let A be a ring which we assume to be N oe- therian, for simplicity, and I an ideal of A. Taking { l"}n>o as a fundamental system of neighborhoods of 0, we can introduce a struc- ture of a topological ring into A called I-adic topology. The tcompletion of A with I-adic to- pology is isomorphic to the projective limit A = limn>oA/I" (here All" are regarded as dis- <- crete topological rings) and called the comple- tion of A along I. If A is Noetherian, then A. is again Noetherian. There is a canonical con- tinuous homomorphism i' A...... A whose kernel comprises the zero divisors a with a-I E I (intersection theorem of Krull;  284 Noeth- erian Rings B). If i is an isomorphism, we say A is complete with respect to I. The topology of A is the l-adic topology where [ = i(l)A and A is complete with respect to I Take a Noeth- erian ring A complete with respect to I which we consider as an I-adic topological ring by identifying its completion along I with A. On I= V(I)c: Spec(A) we can define a sheaf of topological rings &} by f(9(f), 0,) = limn>o Af/I" A f for 9(f)= D(f)n1 with /E A. We call (X, @}) the format spectrum of A and write Spf(A). l is called a defining ideal of Spf(A). A Qocally Noetherian) formal scheme is by definition a topological local ringed space which is locally isomorphic to a formal spec- trum (of a Noetherian ring). If we define mor- phisms between two formal schemes by those in the category of topological local ringed spaces, the formal schemes form a category. For two formal spectra Spf(A) and Spf(B) with dei'ining ideals I and J, respectively, the direct product Spf(A) x Spf(B) in the category of formal schemes is the formal spectrum of the completion of A Q9 B along the image of I Q9 B + A Q9 J. Similarly, we can construct a fiber product of formal schemes. A formal scheme 1 is called separated if the image of the diagonal morphism .1}: 1-->1 x 1 is closed ( Section D). For a Noetherian ring A with an ideal I, the formal spectrum Spf(A.) (with a defining ideal 1) is called the completion of Spec(A) along P'(I). Similarly for a Noetherian scheme X and a closed subscheme X' we can define the com- pletion XIx' of X along X'. Every completion of a separated scheme is separated. For a coher- ent sheaf F on X one can define its completion FIx' along X', which is again coherent (under the assumption that X is locally Noetherian). 16 Y A1gehraic Varieties Thus we can develop a theory of formal schemes in a way similar to that of schemes, which we call "formal geometry" (for the more general definitions and further discussions see [2; 29, I, III; 303). Roughly speaking, a function on XIX' is a formal Taylor series with respect to the direction normal to X' whose coeffi- cients are regular functions on X'. The method of formal completion enables us to introduce "analytic" or "intlnitesimal" methods in alge- braic geometry. Among many important theorems, we state here the following two theorems. (1) The fundamental theorem of proper mapping: Let f: X --> Y be a proper mor- phism of locally Noetherian schemes, Y' a closed subscheme of Y, and X' = X x y Y' the inverse image of Y'. Denote the respective completions of X and Y along X' and Y' by X and Y, respectively. We have the induced proper morphism of formal schemes J: g ...... Y. Then we have canonical isomorphisms, (Wf*(F))IY'  Rn!*(FIX')' n  0, for every coher- ent, (Dx-Module F on X. This theorem can be applied to prove Zariski's connectedness theorem: for a proper morphism f: X...... Y of locally Noetherian schemes with f*«Dx) = (Dy, every fiber f-1 (y) off is connected and non- empty for yE Y. (2) We use the same nota- tion as in (1) and assume, moreover, that Y = Spec(A) for a Noetherian ring A, complete with respect to an ideal I, and Y' = V(I). Then the correspondence F ......F 1x , gives an equiva- lence between the category of coherent (!Jx- Modules with proper support over Y and the category of coherent (l!x-Modules with proper support over Y. This theorem plays an impor- tant role in the theory of tdeformations of algebraic varieties. Y. Algebraic Vector Bundles In this section a vector bundle is a locally free sheaf of finite constant rank (- Section E). A quotient sheaf of a vector bundle is called a quotient bundle if it is a vector bundle. A sub- sheaf F of a vector bundle E is a sub bundle when both F and E/ F are vector bundles. A vector bundle is said to be indecomposable unless it is a direct sum of proper subbundles. Every vector bundle E on pi is a direct sum of line bundles, that is, E  EB (fI(a;) (Grothen- dieck). This property characterizes pi in the category of nonsingular projective varieties. In fact, if X is a nonsingular projective variety with dim X > 0 and X *- pi and if r is an in- teger with r  dim X, there are stable (see below), a fortiori, indecomposable vector bundles on X of rank r (1. Simonis and M. Maruyama). Vector bundles are closely related to sub schemes of the base variety. Let E be a 
16Z Algebraic Varieties vector bundle of rank 2 on a nonsingular quasiprojective variety X and (()( 1) an tample invertible sheaf on X. For n »0 and general sEHO(X,E(n», E(n)js(!)xL <8J I with L a line bundle and 1 an ideal in (!)x which defines a smooth subscheme Y= (s)o of codimension 2. When X is, for example, P" (n  3) or affine, the converse holds. Let Y be a sub scheme of X purely of co dimension 2 and locally of com- plete intersection. If cay  A4 <8J (!)y for an inver- tible sheaf M on X, there is a vector bundle E ofrank 2 and sE HO(X, E) with (s)o = y. Moreover, E is decomposable if and only if Y is globally a complete intersection. An Abelian surface can be embedded in P4(C), and hence we have an indecomposable vector bundle of rank 2 on P4(C) [40]. For n  5, we have no examples of indecomposable vector bundles of rank 2 on P"(C) [36]. Every vector bundle on A" = k" is trivial (D. Quillen [72J, A. Suslin), From this and the fact stated above several results can be deduced; every nonsingular curve in A 3 is set-theoretically a complete intersection (L. Szpiro). For a vector bundle E on a complete scheme, the following are equivalent: (i) for every coherent (!)x-module F, F <8J S"(E) is generated by its global sections for n» 0, (ii) for every coherent sheaf F on X, Hi(X, F <8J S"(E» = 0 for all i > 0 and n» 0, (iii) the "tauto- logical line bundle (i7( 1) on P(E) is ample. A vector bundle having these properties is said to be ample. This is a generalization of the notion of ampleness of invertible sheaves (- Section E). The set of ample vector bundles is closed under several operations [34]. No good criterion such as Nakai-Moishezon's for the ampleness of invertible sheaves ( Section E) is yet known. The tangent bundle of a nonsingular complete variety X is ample if and only if X  pn (  Section R; S. Mori [59J). Let Y be a nonsingular projective variety over an algebraically closed field and (!)y( 1) an ample invertible sheaf on Y. A coherent sheaf E on Y is said to be stable (or, semistab1e) with respect to (!)y( 1) if Eis torsion-free and iffor every coherent sub sheaf Fof Ewith 0 i=F i=E, x(F(m»jr(F)«or .s;)x(E(m))jr(E) for all n'l»O, where r(*) denotes the rank. If E is stable (or semistable) and locally free, it is called a stable (resp. semistab1e) vector bundle. Let f: X...... S be a tsmooth, projective, geometrically integral morphism and (!)x( 1) an f-ample invertible sheaf on X. For a numerical polynomial H and an S-scheme T, set L,s( T) = {E E is a coherent sheaf on X r with the properties (a) and (b)} / " ; (a) E is T - flat, (b) for every geo- metric fiber X"E,;;;;; E x, is stable with re- spect to (1)x( 1) x, and x(E,(m)) = H(m), and if 84 EI  £2 @f!(L) with L an invertible sheaf on T, EI  E 2 . Lf/s is a tcontravariant functor of the category of tIocally Noetherian S-schemes to the category of sets. If S is of tfinite type over a tuniversally Japanese ring A, 'L/s has a coarse moduli scheme M x/s(H) ( Section W) and it is tlocally of finite type over S (D. Mumford, Seshadri, D. Gieseker, Maruyama [23, 51J), The set of the classes of semis table sheaves under a suitable equivalence relation (S-equivalence) on geometric libers of X over S also has a coarse moduli scheme Mx/s(H) and M",(H) is its open sub scheme. Moreover, it is known that M",(H) is projective over S in some cases, for example, when A is a field of characteristic zero. When dim XjS = 1, X p2 or p3, the structures of M",(H) have been extensively studied [ 11,69]. Theories of vector bundles are used in theoretical physics [9]. Z. Torus Embeddings Over an algebraically closed field k, a torus embedding, or a toric variety, is a normal scheme X locally of finite type over k on which an algebraic torus T acts with a dense open orbit isomorphic to T. Such X's, as well as many of their algebrogeometric properties, can be described very simply in terms of cones in real affine spaces, as Demazure [21 J first saw in the nonsingular case in connection with algebraic subgroups of the Cremona trans- formation group, and then as D. Mumford et al. [4 7J as well as K. Miyake and T. Oda [6 7J saw in the general case immediately after H. Sumihiro [SS] proved a basic theorem on linear algebraic group actions. The group N, written additively, of One- parameter subgroups of an r-dimensional algebraic torus T is a free Abelian group of rank r. A COnvex rational polyhedral cone (J in N R = R@z N with 0 as the vertex is the set of nonnegative linear combinations of a finite number of elements of N such that (J n (-(J) = {O}. A subset'[ of (J is called a face, and is denoted by "C < (J, if there exists a linear func- tional m on N R having nonnegative values on (J and r={YE(Jlm(y)=O}, A fan  in N R is a collection of such (J's satisfying the conditions (i) 3(J, (J>'[.13,[ and (ii) .13(J, (J'O"> (J nO"' < (J'. Each (JE.1 gives rise to an affine torus em- bedding Va as follows: Let M be the Z-module dual to N; hence A4 is the group oftcharacters of T, For (JE.1, the set Mni'J= {mEMlm(y);;:'O for all Y E (J} is seen to be a IInitely generated additive subsemigroup of M containing 0 and generating M as a group. Va is then the spectrum of the semigroup algebra k[M n it]. 
85 Thanks to the condition (ii) above, the Va's can be naturally pasted together to produce a torus embedding X = UaE Va. Every torus embedding is obtained in this way. Equivariant dominant morphisms be- tween torus embeddings can be described in terms of Z-Iinear maps between N's which send a fan to another. Many algebrogeometric properties of X can be described in terms of A, e.g., X is nonsin- gular if and only if every (J E A can be spanned by a part of a Z-basis of N. X is complete (proper over k) if and only if the union of (J'S in A coincides with N,. X has at worst cyclic quotient singularities if and only if every (J E A is simplicial, i.e., if it is spanned by R-Iinearly independent elements of N. An equivariant resolution of singularities of X exists and is obtained by a suitable subdivision of A (Mum- ford et al. [4 7J). X has only rational singular- ities, hence is Cohen-Macaulay (M. Hochster [38J and Mumford et al. [47J). The set of T- orbits in X is in one-to-one correspondence with (JE, A reduced T-invariant sub scheme Y of X is the union of T-orbits, and hence corre- sponds to a subset}; of A. M.-N. Ishida [44] determined when Y is Cohen-Macaulay or Gorenstein in terms of the combinatorics of E. A T-invariant Cartier divisor D on X cor- responds to a support function h, which is a continuous R-valued function on the union UacA (J which is (i) positively homogeneous, i.e., h(y)==).h(y) for ,bO and YE UaE a, (ii) Z-valued on N n ( UOEA (J) and (iii) linear when restricted to each a E , Various prop- erties of the invertible sheaf (l)x(D) ( Section E) can be described in terms of h, For in- stance, when X is complete, CVx(D) is t am ple if and only if h is upper convex, i.e., h(y) + h(y') ,,; h(y + y'), and moreover, A is the coarest fan with the property (iii) above. The cohomology of (Dx(D) can be calculated by means of h (Demazure [21J and V. Danilov [17]). A support function h, on the other hand, gives rise to a convex polyhedron in M R with vertices in M. In this way, certain aspects of the geometry of Con vex sets can be thought of as a part of the theory of projective varieties (R. Stanley [87], B. Teissier [89]). Mumford et al. [47] introduced a more gen- eral concept of toroidal embedding: A normal algebraic variety Y and a nonsingular Zariski open subset U such that Y ::J U is formally isomorphic at each point to a torus embedding X ::J T. This concept has been used veryeffec- tively to prove important theorems: (1) Sys- tematic nice compactitlcations of arithmetic quotients of bounded symmetric domains (D. Mumford et al. [7], 1. Satake [77]). Y. Nami- kawa [65J worked out the details in the case 16 AA A1geb raie Varieties of the Siegel upper planes, improving their earlier Satake compactifications studied by I Satake, W. Baily, 1.-1. Igusa, and A. Borel [12]. (2) Semistab1e reduction theorem (Mumford et al. [47J). Let f: V -->C be a flat morphism from a nonsingular variety V to a nonsingular curve C in characteristic O. After a finite base extension C'...... C and a modification V'--> VX c C', we can get f ': V' --> C' whose singular fibers are reduced with only nonsingular com- ponents crossing normally with each other (- Section L). Without the reducedness requirement, the existence is derived from a result of Hironaka [37]. AA. Etale Topology Let /J' be a category. We say that a Gro- thendieck topology on Y is given if, for each SE Ob(S), families ofmorphisms (covering families of S) are given and satisfy the follow- ing conditions: (1) If cp: T -->S is an isomor- phism, {cp : T ......S} is a covering family, (2) if {<Pi: Ri-->S};EI is a covering family, for any morphism cp': S' -->S, the fiber product R: = R i x ,S' exists and the induced family \ c:p; : R: --> S'} is a covering family of S', (3) if { <Pi: Ri-->S};EI is a covering family and if for each iEI, {Si,a:Si,a......R;}aEA, is a covering family, then {<p?si'a:Si,a......S};EI,aEAi is a covering family. In general, it is more convenient to use a notion of sieves to define a Grothen- dieck topology [30 (SGA 4)].) A category with Grothendieck topology is called a site. Let X be a scheme and EtjX the category whose objects consists of schemes etale over X. If we choose a family of morphisms {IA: 1;--> Y}iEI, with UiEI ;( 1;) = Y, as a covering family of Y E Ob(EtjX), this defines the Male topology on X and the etale site Xii' Similarly, one can define the Zariski site X Zar (resp. the flat site X fl), using open immersions (resp. flat mor- phisms 10calIy of fimte type). A presheaf on a site Y is a contravariant functor from Y to (Set). A presheaf F is a sheaf if, for any COver- ing family {i:Ri-->R}iEI' F(R)......IlEIF(R;)::::t Il,jEIF(Ri x RR) is exact. If Y has a final ob- ject X, the functor F f-> F(X) is left exact on the category of Abelian sheaves on /J' and the cohomology groups H. (X, F) are de- fined as the right derived functors. Using covering families, one can define the tech cohomology i1 . (X, F) as usual. In the follow- ing, a sheaf means an Abelian sheaf. For a tgeometric point i: X --> X of a scheme, an etale neighborhood V of x consists of an etale mor- phism .f: U-->X and a morphismj: x--> U such that f oJ = i For a sheaf F on X ep the stalk Fx at a geometric point x is defined by Fx = 
16 Ref. Algebraic Varieties lim r( U, F), U being the etale neighborhoods ...... of x. On X"' the sheaf G m is defined by G,(S)= f(S, @s)* for each S E Ob(Etj X), The kernel of the nth power homomorphism G G is m m J denoted by fJn' The sequence l-->l1n-->G m -'> G n --> 1 is exact in X"' if p is prime to resId- ual characteristics of X, but not necessarily exact in X Zar' There are canonical isomor- phisms Pic(X)= H 1 (X Zan (Sif)'4H I (X e " G m ).:-; H 1 (Xf[, G,). For X = Spec(k), k a field, the etale cohomology theory of sheaves on X"t is equivalent to the tGalois cohomology theory over k; hence Hi(Xet> F)':-;Hi(G, Fd, where G = Gal(kjk), k the tseparable closure of k and Fj; is the stalk of F at the geometric point (8h of X. For a morphism f: X -+ Y of schemes and a sheaf F on X e" the direct image sheaf j F of F is defined by f* F(S) = F(X x yS), S E Ob(EtjY), and higher direct-image sheaves Rif* F are deti'ned by the right-derived functor. Let X be a separated scheme of l1nite type over a field k. By a theorem of Nagata [63J, there exists a scheme X proper over k and an open immersion j: X -->x. For a torsion sheaf F on X", let j,F be a sheaf on X"' ex- tended by 0 outside X. The cohomology with compact support H(X"" F) is defined by Hi(X "t, F) = Hq(X",j, F). This is inde- pendent of compactifications. Similarly, for a separated morphism f: X -->5 of l1nite type of schemes and a torsion sheaf F on X "I' one can define higher direct image sheaf with compact support R'If, F. For the etale topology, torsion sheaves are important. All torsion sheaves on X e , on a Noetherian scheme X are inductive limits of constructible sheaves. A sheaf F on X '" is called locally constructible (constant) if F is represented by an etale covering of X. A sheaf F on X e, is called constructible if there exists a lj'nite surjective family of sub scheme Xi of X such that the restriction of F to Xi is locally constructible (constant). This is equiva- lent to saying that every irreducible closed subscheme Z of X contains a nonempty open subscheme U such that the restriction F to U is locally constant and has finite stalks, i,e" there is a covering family {i/Ji: U i --> U} such that F Vi is constant and the stalk Fx is a finite Abelian group for all geometric points x of U, The cohomology of a torsion sheaf or a con- structible sheaf has properties similar to those of the classical cohomology. Let f: X...... S be a proper morphism and F a torsion sheaf on X", Then the stalk Rqf*F at a geometric point s of 5 is isomorphic to Hq(X",t> F), where X,=X x s Spec k(s). If f is a separated morphism of lj'nite type of Noetherian schemes, a similar fact holds for the cohomology with compact support. Moreover, if F is a con- structible sheaf, Rqj.F is also constructible (finiteness theorem). For an affine scheme X of 86 finite type over a separably closed field and F a torsion sheaf on Xel' one has Hq(X"" F) =0 for q > dim X. Let f: X...... S be a separated morphism of schemes of l1nite type on C, Fa torsion sheaf on X e " One has a canonical isomorphism (Rqf,F)an.:-;RqfnFan. In partic- ular, if X is proper over C, one has Hq(X "" Zj(n».:-;Hq(x an , Z/(n». On the other hand, for a nonsingular complete curve C over an algebraically closed field, one always has H 1 (C", Z) = O. For a geometric point s of S the strict localization of S at s is the ring (9s.s = l f( U, (9v), U being etale neighbor- hoods of s. A geometric point t of S is called a generalization of s if tis deti'ned by an alge- braic closure of the residue field of a point of Spec (9s,s' In this case, S is called a special- ization of t. If f: X -+ S is a proper smooth morphism the sheaf Rqf*Z/(n) is locally con- structible (constant), and ifs is a specializa- tion of t one can define a cospecialization of Hq(X"", Z/(n» to Hq(Xs,et> 'li(n» which is bijective. Let X be a scheme of finite type over a field k. If 1 #ch(k) is prime, one can define the /-adic cohomology Hq(X, Ql) = limn Hq(X, Zj (1")) Q9Z,Ql' The /-adic cohomology has as many good properties as the classical co- homology ( Section U). References [1 J A. Altman and S. L. Kleiman, Introduc- tion to Grothendieck duality theory, Lecture notes in math. 146, Springer, 1970. [2J M. Artin, Algebraization of formal moduli; I, Global analysis, Univ. Tokyo Press and Princeton Univ. Press, 1969, 21-71; II, Exis- tence of Modifications, Ann. Math., (2) 91 (1970), 88-135. [3J M. Artin, Algebraic approximation of structures Over complete local rings, Pub!. Math. Inst. HES, 36 (1969),23358. [4J M. Artin, Theoremes de representabilite pour les espaces algebriques, Les Presses de l'Universite de Montreal, 44, 1973. [5J M. Artin and B. Mazur, Etale homotopy, Lecture notes in math. 100, Springer, 1969. [6J M. Artin and D. Mumford, Some elemen- tary examples of unirational varieties which are not rational, Proc, London Math. Soc., (3) 25 (1972),75595. [7J A. Ash, D. Mumford, M. Rapoport, and Y. Tai, Smooth compactitlcation of locally sym- metric varieties, Math.-Sci. Press, 1975. [8J M. F. Atiyah, Vector bundles over an elliptic curve, Proc. London Math. Soc., (3) 7 (1957),414-452. [9J M. F. Atiyah and R. S. Ward, Instantons and algebraic geometry, Comm. Math. Phys., 55 (1977),117-124. 
87 [ 10J P. Berthelot, Cohomologie cristalline des schemas de caracteristique p > 0, Lecture notes in math. 407, Springer, 1974. [ IIJ W. Barth, Some properties of stable rank-2 vector bundles on Pn' Math. Ann., 226 (1977), 1255150. [ 12J W. L. Baily and A. Borel, Compactifica- tion of arithmetic quotients of bounded sym- metric domains, Ann. Math., (2) 84 (1966), 442-528. [13] S. Bloch, D. Lieberman, and A. Kas, Zero cycles on surfaces with Pg = 0, Compositio Math., 33 (1976), 135145. [ 143 E. Brieskorn, Rationale Singularitaten Komplexer Fliichen, Inventiones Math., 4 (1968), 336-358. [ 153 Seminaire c. Cheval ley, Anneaux de Chow et applications, Ecole Norm. Sup., 1958. [ 16J C. H. Clemens and P. A. Griffiths, The intermediate Jacobian of the cubic threefold, Ann. Math., (2) 95 (1972),281356, [ 17J V. 1. Danilov, The geometry of toric varieties, Russian Math. Surveys, 33 : 2, (1978), 977154. [ 1 8J P. Deligne, Theorie de Hodge: I, Actes Congr. Intern. Math., 1970, Nice, Gauthier- Villars, 1971, vol. I, 425430; II, Pub!. Math. Inst. HES, 40 (1971), 557; III, 44 (1974), 5577. [19J P. Deligne, La conjecture de Weil L Pub!. Math. Inst. HES, 43 (1943),273307, [20J P. Deligne, Le groupe fondamental du complement d'une courbe plane n'ayant que des points doubles ordinaires est abelien (d'apres W. Fulton), Seminaire Bourbaki, expo 543, Nov. 1979, Lecture notes in math. 842, Springer, 1980. [21 J M. Demazure, Sous-groupes algebriques de rang maximum du groupe de Cremona, Ann. Sci. Ecole Norm. Sup"3 (1970),507-588. [22J W. Fulton and J. Hansen, A connected- ness theorem for projective varieties, with applications to intersections and singularities of mappings, Ann. Math., (2) 110 (1979),159- 166. [23J D. Gieseker, On the moduli of vector bundles on an algebraic surface, Ann. Math., (2) 106 (1977), 4560, [24J P. A. Griffiths, Periods of integrals on algebraic manifolds; summary of main results and discussion of open problems, Bull. Amer. Math. Soc., 76 (1970),228-296. [25J P. A. Griftlths, On the periods of certain rational integrals I, II, Ann. Math., (2) 90 (1969), 460495, 496541. [26J P. A. Griffiths, Periods of integrals on algebraic manifolds III, Pub!. Math. Inst. RES, 38 (1970),1255180. [27J P. A. Griffiths and W. Schmid, Recent developments in Hodge theory, Discrete Sub- groups of Lie Groups and Application to Moduli, Oxford Univ. Press, 1975, 3 1-127. 16 Ref. Algebraic Varieties [28J P. A. Griftlths and J. Harris, Principles of algebraic geometry, Wiley, 1978. [29J A. Grothendieck, Elements de geometrie algebriques, Pub!. Math. Inst. HES: 1,4 (1960) (Springer, 1971); 11,8 (1961); III, 11 (1961),17 (1963); IV, 20 (1964), 24 (1965), 28 (1966), 32 (1967). [30J Cited as SGA. A. Grothendieck and others, Seminaire de geometrie algebrique: SGA 1, Revetements etales et groupe fonda- mental, Lecture notes in math. 224, Springer, 1971; SGA 2, Cohomologie locale des fais- ceaux et theoremes de Lefschetz locaux et globaux, North-Holland, 1962; SGA 3 (with M. Demazure), Schemas en groupes I, II, III, Lecture notes in math. 15 1,152, 153, Springer, 1970; SGA 4 (with M. Artin and J. L. Verdier), Theorie des topos et cohomologie etale des schemas, Lecture notes in math. 269, 270, 305, Springer, 1972-1973; SGA 5, Cohomologie 1- adique et fonctions L; SGA 6, Theorie des intersections et theoreme de Riemann-Roch, Lecture notes in math. 225, Springer, 1971; SGA 7, Groupes de monodromie en geome- trie algebrique, pt. I, Lecture notes in math. 288; pt. II (by P. Deligne and N. Katz), 340, Springer, 197221973. [31J B. Griinbaum, Polytopes, graphs and complexes, Bull. Amer. Math. Soc., 76 (1970), 1131  1201. [32J W. J. Haboush, Reductive groups are geometrically reductive, Ann. Math., (2) 102 (1975),67783. [33J R. Hartshorne, Residues and duality, Lecture notes in math. 20: Springer, 1966. [34J R. Hartshorne, Ample subvarieties of algebraic varieties, Lecture notes in math. 156, Springer, 1970. [35J R. Harshorne, Algebraic geometry, Graduate texts in math. 52, Springer, 1977. [36J R. Hartshorne, Algebraic vector bundles on projective spaces: A problem list, Topology, 18 (1979), 117-128. [37J H. Hironaka, Resolution of singularities of an algebraic variety over a field of charac- teristic zero, Ann. Math., (2) 79 (1964), 109- 326. [38J M. Hochster, Rings of invariants of tori, Cohen-Macaulay rings generated by mono- mials and polytopes, Ann. Math., (2) 96, (1972), 3188337. [39J W. V. D. Hodge and M. F. Atiyah, In- tegrals of the second kind on an algebraic variety, Ann. Math., (2) 62 (1955),56-91. [40J G. Horrocks and D. Mumford, A rank-2 vector bundle on p4 with 15,000 symmetries, Topology, 12 (1973), 63-81. [41J J. Igusa, Arithmetic variety of moduli for genus two, Ann. Math., (2) 72 (1960),612- 649. [42J J. Igusa, Betti and Picard numbers of 
16 Ref 88 Algebraic Varieties abstract algebraic surfaces, Proc. Nat. Acad. Sci. US, 46 (1960),7244726. [43J H. Inose and M. Mizukami, Rational equivalence of a-cycles on some surfaces of general type with Pg = 0, Math. Ann., 244 (1979),205-217. [44J M.-N. Ishida, Torus embeddings and dualizing complexes, T6hoku Math. J., 32, (1980),111-146. [45J V. A. Iskovski'i and Yu. 1. Manin, Three- dimensional quartics and counterexamples to the Lliroth problem, Math. USSR-Sb., 15 (1971), 141-166. (Original in Russian, 1971.) [46] S. Lojasiewicz, Triangulation of semiana- lytic sets, Ann. Scuola Norm. Sup. Pisa, 18 (1964),449-474. [47] G. Kempf, F. Knudsen, D. Mumford, and B. Saint-Donat, Toroidal embeddings I, Lec- ture notes in math. 339, Springer, 1973. [48J S. L. Kleiman, Toward a numerical theory of ampleness, Ann. Math., 84 (1966), 293-344. [49J D. Knutson, Algebraic spaces, Lecture notes in math. 203, Springer, 197 1. [SO] K. Kodaira, Collected works I-III, Iwanami, 1975. [51J M. Maruyama, Moduli of stable sheaves, I, II, J. Math. Kyoto Univ., 17 (1977),91-126 and 18 (1978), 5577614. [52] T. Matsusaka, Theorey of Q-varieties, Pub\. Math. Soc. Japan, no. 8, Tokyo, 1965. [53J T. Matsusaka, Polarized varieties with a given Hilbert polynomial, Amer. J. Math., 94 (1972),102771077. [54] T. Matsusaka and D. Mumford, Two fundamental theorems on deformations of polarized varieties, Amer. J. Math., 86 (1964), 668-684. [55] M. Miyanishi, On curves on rational and unirational surfaces, Tata lecture notes 60, Springer, 1978. [56J B. G. Moishezon, A criterion for proJec- tivity of complete algebraic abstract varieties, Amer. Math. Soc. Transl., 63 (1967),1-50, [57J B. G. Moishezon, Complex surfaces and connected sums of complex projective planes, Lecture notes in math. 603, Springer, 1977. [SS] S. Mori, On a generalization of complete intersections, J. Math. Kyoto Univ., 15 (1975), 619-646. [59J S. Mori, Projective manifolds with ample tangent bundles, Ann. Math., (2) 110 (1979), 593-606. [60J D. Mumford, Geometric invariant theory, Springer, 1965; second edition, 1982. [61J D. Mumford, Pathologies of modular algebraic surfaces, Amer. J. Math., 83 (1961), 3399342; Further pathologies in algebraic geometry, ibid., 84 (1962), 642-648; Pathol- ogies III, ibid., 89 (1967), 944104. [62J D. Mumford, Stability of projective varieties, Enseignement Math., 23 (1977), 39- 110. [63] M. Nagata, Imbedding of an abstract variety in a complete variety, J. Math. Kyoto Univ.,2 (1962),1-10. [64J Y. Nakai, A criterion of an ample sheaf on a projective scheme, Amer. J. Math., 85 (1963), 14-26. [65J Y. Namikawa, A new compactillcation of the Siegel space and the degeneration of Abelian varieties, I, II, Math. Ann., 221 (1976), 97-141 and 201-241. [66] A. Neron, Problemes arithmetiques et geometriques attaches a la notion du rang d'une courbe algebrique, Bull. Soc. Math. France, 80 (1952),101-166. [67] T. ada, On torus embeddings and appli- cations (based on joint work with K. Miyake), Tata lecture notes 57, Springer, 1978. [68J T. Oda and C. S. Seshadri, Compactifi- cations of the generalized Jacobian variety, Trans. Amer. Math. Soc., 253 (1979),1-90, [69] C. Okonek, M. Schneider, and H. Spind- ler, Vector bundles on complex projective spaces, Progress in math. 3, Birkhiiuser, 1980. [70] H. Popp, On moduli of algebraic varieties II, Compositio Math. 28 (1974), 51-81; III, Fine moduli spaces, ibid., 31 (1975),237-258. [71] H. Popp, Moduli theory and classifica- tion theory of algebraic varieties, Lecture notes in math. 620, Springer, 1977. [72] D. Quillen, Projective modules over polynomial rings, Inventiones Math., 36 (1976),167-171. [73J M. A. Reid, Canonical 3-folds, Journees de Geometrie Algebrique, July 1979, 273-310, Sijthoff and Noordhoff, 1980. [74] A. A. Rojtman, Rational equivalence of zero cycles, Math. USSR-Sb., 18 (1972),571- 588. (Original in Russian, 1972.) [75J A. A. Rojtman, The torsion of the group of a-cycles modulo rational equivalence, Ann. Math., (2) 111 (1980),553-569. [76J L. Roth, Algebraic threefolds, Springer, 1955. [77] I. Satake, On the arithmetic of tube domains, Bull. Amer. Math. Soc., 79 (1973), 1076- 1094. [78J M. Schlessinger, Functors of Artin rings, Trans. Amer. Math. Soc., 130 (1968),208-222. [79] C. S. Seshadri, Quotient spaces modulo reductive algebraic groups, Ann. Math., (2) 95 (1972),511-556, [SO] J.-P. Serre, Un theoreme de dualite, Comm. Math. Helv., 29 (1955),9-26. [81J J.-P. Serre, Faisceaux algebriques co her- ents, Ann. Math., (2) 61 (1955),197-278. [82] J.-P. Serre, Geometrie algebrique et geo- metrie analytique, Ann. Inst. Fourier, 6 (1956), 1-42. [83J J.-P. Serre, Exemple de variete projectives 
89 conjuguees non homeomorphes, C. R. Acad. Sci. Paris, 258 (1964),419444196. [84J I. R. Shafarevich, Basic algebraic geome- try, Springer, 1974. [SS] G. Shimura, Moduli and fibre systems of Abelian varieties, Ann. Math., (2) 83 (1966), 294-338. [86J G. Shimura, Algebraic varieties without deformation and the Chow variety, J. Math. Soc. Japan, 20 (1968),336-341. [87J R. Stanlay, The number of faces of a simplicial convex polytope, Advances in math., 35 (1980),2366238. [88] H. Sumihiro, Equivariant completion I, J. Math. Kyoto Univ. 14 (1974),1-28; II, 15 (1975),573-605, [89] B. Teissier, Varietes toriques et polytopes, Sem. Bourbaki, 1980/81, expo 565, Lecture notes in math. 901, Springer, 1981. [90J K. Ueno, Classification theory of alge- braic varieties and compact complex spaces, Lecture notes in math. 439, Springer, 1975. [91J H. Umemura, Sur les sous-groupes alge- briques primitifs du groupe de Cremona a trois variables, Nagoya Math. J., 79 (1980), 47767, [92J A. Weil, Foundations of algebraic geome- try, Amer. Math. Soc. Colloq. Pub\., second edition, 1962. [93] O. Zariski, Collected papers, I, II, III, N, MIT Press, 1972, 1973, 1978, 1979, 17 (X1.11) Aigebroidal Functions A. General Remarks If an analytic function I satisfies an tirreduc- ible algebraic equation Ao(z)fk+Adz)fk-1 +... +Ak(z)=O (I) with single-valued tmeromorphic functions Aiz) in a domain G in the complex z-plane, then I is called a k-valued algebroidal function in G. With no loss of generality, we can as- sume that there is no commOn zero among the Aiz) and that all the Aj(z) are tholomorphic in G. When k = 1, the solution of (1) is a single- valued meromorphic function in G. If all the Aiz) are polynomials, then f is an talgebraic function, Thus algebroidal functions can be regarded as extensions of single-valued to multiple-valued functions and also as exten- sions of algebraic to ttranscendental functions. Since (1) is irreducible, its discriminant D(z) does not vanish identically. For all the points a satisfying D(a)#O, Ao(a)#O, aEG, equation (1) determines k holomorphic function ele- 17 B Algebroidal Functions ments II (z), Ik(Z) in a suitable neighborhood of a that determine the analytic function .r They can be tprolonged analytically in G in the wider sense. At any point satisfying A,(z) = 0, at least One element has a pole; and at any point satisfying D(z) = 0, there may appear ramified elements. Therefore an algebroidal function can be defined as a finitely multiple- valued analytic function in G with the excep- tion of poles and taigebraic branch points. Every algebroidal function f(z) determines a tRiemann surface, which may be considered a tcovering surface 3 of G. This surface 3 is a k- sheeted tcovering surface over G with no sin- gular point except for algebraic branch points. Also, f reduces to a single-valued meromor- phic function on 3, and all the function ele- ments over a point z are different. A k-valued algebroidal function can also be characterized by this property, These two (equivalent) de\i'nitions of alge- broidal functions give rise to two distinct methods of studying these functions, In adopt- ing the first definition, we can make use of results in the single-valued case, as did G. Remoundos and G. Valiron. When the lat- ter definition is adopted, we can use several methods that are also applicable in the single- valued case, as did H. Selberg and E. Ullrich, Research on algebroidal functions has been carned out mainly for the case where G is the finite plane Izi < 00 or the unit disk Izl < 1. Almost all the known results for algebroidal functions are extensions of those on single- valued functions, but several results particu- larly relevant to algebroidal functions have been discovered. The existence of branch points makes it difficult to investigate alge- broidal functions in some cases. B. Absolute Value Among several results on the absolute values of algebroidal functIons, the tmaximum prin- ciple, one of the basIc principles, holds on the Riemann surface :l The following relation holds among the Aj(z) and 1/,(z)l, Assume that (1) has the form fk + Al (Z)fk-I +.., + Ak(z)=O, (1') Then log( 1 + A(z))/log( 1 + F(z» is bounded, where A(z)=maxIAAz)1 and F(z)=maxl/v(z)l, An algebroidal function that has no pole in z I < 00 is called an entire algebroidal func- tion. The successive derivatives of an entire algebroidal function may have poles at every branch point, which is a departure from the case of single-valued integral functions. An algebroidal function defined by (1) or (1') with entire functions Aj(z) and zero-free ,4,,(z) is 
17C Algebroidal Functions entire. Dividing both sides of (1) by Ao(z), we obtain equation (1'), and all its coefficients are entire. In equation (1'), the torder, type, and class of an entire algebroidal function coincide with those of the largest Aj(z) ( 429 Tran- scendental Entire Functions). For an entire algebroidal function of order less than 1/2, F(z) tends to infinity uniformly along a sequence of concentric circles Izl =r n (r n --> 00). However, it can be shown that not all the branches of f(z) necessarily tend to in- finity on the Riemann surface 3; in this sense, tWiman's theorem does not remain true. But, using minlzl=,maxvlfv(z)l, one can obtain some extensions of the generalized Wiman theorem. C. Picard's Theorem and Its Extension Remoundos first extended tPicard's theorem and tBorel's theorem to an algebroidal func- tion. Every k-valued transcendental alge- broidal function in the finite plane takes on every value infinitely often with at most 2k exceptional values. There are examples where 2k values are actually omitted. Hence the theorem is the best one possible in this sense. T. Varopoulos (Bull. Soc. Math. France, 53 (1925),23-34) introduced the degeneracy index A=dim{(c o , c i ,..., Ck)E Ck+ilcoAo +C i Ai +... +CkAk=O} (O.s;A.s; k-l) into equation (1) and showed that the number of Picard's exceptional values of f is at most k + ..1. + 1. There is no single- valued meromorphic function with ..1. > 0, so this result is relevant only for algebroidal functions. Extending this idea further, J. Du- fresnoy and others obtained more precise results. The situation is the same for the Borel exceptional values. That is, the tconvergence exponent of f(z)-w =0 coincides with the tord.er of f except for at most 2k values, for which the convergence exponents of fare less than the order of f (E. Borel). There are at most 2k polynomials P(z) for which f(z)- P(z)=O has at most a finite number of roots (Borel), and furthermore, using the degeneracy index, we can give more precise results. Selberg was the first to extend the tNevan- linna theory of meromorphic functions to algebroidal functions ( 272 Meromorphic Functions). Almost simultaneously, Valiron obtained the same results starting from the coefficients of (1); then Ullrich improved the results by considering the effect of branch points of 3. Let 3, be the part of 3 over Izl <r, let n(r, w) be the number of roots of f(z) - w = 0 in 3" and let n(r, 3) be the number of branch points 90 of 3,. With these notations we write 1 i ' dt n(O, w) N(r, w)=- (n(t, w)-n(O, w»-+- k logr, k 0 t m(r, w) 1 i + 1 =- log , d<p, 2kn Izl=' If(re"")-wl T(r, w)=m(r, w)+N(r, w). Let T(rJ) be the tlogarithmic integral of the spherical area of the image of 3, under w = f(z), w#oo, 1 i 'dt If 1f'(tei",W T r =- - tdtd (,I) k 0 t 3,(1 +If(te i '''W)2 <p, and let N(r, 3) be the logarithmic integral of n(r, 3). Then we have T(r, w)= T(rJ)+O(I) and the ramification theorem: N(r, 3)«2k-2)T(rJ)+ 0(1). Also, T(rJ) = O(logr) holds if and only if f is algebraic. Let A(z) be the maximum of I Aj(z)l, and let 1 (2" J1(r) = 2kn Jo 10gA(rei"')d<p. Then T(rJ)== J1(r) + 0(1). As the second funda- mental theorem we have q L N(r, w v » (q-2)T(rJ)-N(r, 3) \1=1 q + L Ni(r, wv)+O(logrT) v=1 > (q-2k)T(rJ) q + L Ni(r, wv)+O(logrT), ,,=1 where N i (r, w) is the logarithmic integral of n i (r, w) which is the sum of the tmultiplicity minus one of all the roots of f(x)-w=O in 3,. Furthermore, the deficiency, ramification index of f(z), and ramification index of the surface 3 are defined by Ci(w) = l-limsupN(r, w)/T(rJ) == lim infm(r, w)/T(rJ), :D(w) = liminfN i (r, w)/T(rJ),  ==liminfN(r,3)/T(rJ). With these notations, we have LCi(w v )+ L :D(wv).s;2+ .s;2k. These results contain the Picard theorem and the Borel theorem. Furthermore, by con- sidering the effect of branch points, the Ahlfors theory of covering surfaces can be extended to algebroidal functions (Y. Tumura). By using this result, tBloch's theorem can be obtained very simply. 
91 But these results do not contain the Varo- poulos result. Taking into consideration the degeneracy index A, H. Cartan proved the following inequality when ). == 0: q (q - k -I) T(r, f) < L Nd r , w v ) + S(r), v=l and for general }, he conjectured that the number q-k - 1 may be changed to the num- ber q - k - A-I. This conjecture is still un- solved except for some special cases where for example, q = k + ,; + 2 (N. Toda, Nagoya Math. 1.,91 (1983),37-47), Interesting results follow upon a study of this degeneracy index. For example, when f is an entire algebroidal function, if L;l 6(w v ) > 2k 2 (w, * 00), there are at least k- 1 tPicard's exceptional values in {w v } (Niino, Ozawa, and Toda). With respect to the relation between the number of exceptional values and the order of fin Izi < 00, there are some results similar to those for single-valued functions. For example, if f has k + 1 Picards exceptional values, the order off must be a positive integer or 00. D. Asymptotic Values and Other Results In the single-valued case, VaJiron, L. Ahlfors, W. H. J. Fuchs, A. Edrei, W. K. Hayman, and others studied the tBorel direction, the number of tasymptotic values, the relation between the deficiency values and the asymptotic values, etc. However, almost none of the correspond- ing results holds for the algebroidal case as shown by several counterexamples. There is no relationship between the order of an entire algebroidal function and the num- ber of its finite tasymptotlc values, which is quite different from the single-valued case. Furthermore, it is possible to have an infinite number of asymptotic values even if the order is equal to zero. If an algebroidal function f satisfies Jim infT(r, f)/(log r)2 < + 00, then it has at most k asymptotic values (Val iron- Tumura) The Ahlfors theorem, which is con- cerned with the number of tdirect transcen- dental singular points of the inverse function and the order of a meromorphic function in the single-valued case, was extended to the algebroidal function by Lii Yinian (Scientia Sinica, 23 (1980». The tJulia direction or the Borel direction for an algebroidal function is defined not on 3 but on I zl < cc because of the appearance of branch points. With respect to the Julia direc- tion, there are results similar to those for the single-valued case, but it is unknown in gen- eral whether the Borel direction exists. A. Rauch proved that if Joc T(rJ)/rp+ldr= cP, 18B Almost Periodic Functions then there is a sector with an angle of at least n/ p in which L(q»== Joo log+ F(rei'P)Jr P + 1 dr diverges. Apart from the theory of distribution of values, Selberg obtained some conditions under which the inverse functions of tAbelian integrals of a special kind reduce to alge- broidal functions. References [1 J G. Remoundos, Extensions aux fonctions algebro'ides multi formes du theoreme de M. Picard et de ses generalisations, Gauthier- Villars, 1927. [2J H. Selberg, Algebroide Funktionen und Umkehrfunktionen Abelscher Integrale, Avhandl. Norske Videnskaps-Akad., Oslo, 8 (1934), 1-72. [3J N. Toda, Surrounding algebroidal func- tions (in Japanese), Bull. Math. Soc. Japan, 24 (1972),2006209. [4J H. Cartan, Sur les zeros des combinaisons Jineaires de p fonctions holomorphes donnees, Mathematica, 7 (1933), 5-31. 18 (X.25) Almost Periodic Functions A. History The theory of almost periodic functions was originated by H. Bohr in 1924 as a result of his study of tDirichlet series, The theory provides a method of studying a wide class of trig- onometric series ( 159 Fourier Series) of general type. Further generalizations were made by N. Wiener, V. V. Stepanov, A. S. Besikovich, S. Bochner, and others. H. Weyl, J. von Neumann, and others clarified the rela- tions between this theory and trepresentation of groups, specifically, the relations between almost periodic functions in a ttopological group and representation theory of a tcompact group. B. Almost Periodic Functions in the Sense of Bohr Let f(x) be a complex-valued continuous function defined for all real values of x. A number r is called a translation numher off(x) belonging to e > 0 if sup If(x+c)- f(x)I.s;£. - oo<x<cr; If for any E> 0 there exists a number 1(£) > 0 
18 c Almost Periodic Functions such that any interval of length 1(0) contains a translation number off belonging to E, then f(x) is called almost periodic in the sense of Bohr. We denote by B the set of all almost periodic functions in the sense of Bohr. If f(x) is tperiodic with a period p, then f E B, because each number 1  p plays the role of 1(0) for any B > o. Any f E B is bounded and uniformly continuous. A necessary and suffi- cient condition for a bounded continuous function on (- 00, 00) to belong to B is that for any given sequence {h n } of real numbers, there exist a subsequence {hnJ such that the se- quence of functions {f(x + h )} is uniformly convergent in (-00, 00); i.e., ;he set {f(x+ h) hE( -00, oo)} is ttotally bounded with re- spect to the uniform norm II f II == sup f(x)1 in the space of bounded continuous functions in (-oo,co). If fix) E B, then f( - x), fix), exf (x) (where (J, is a complex number), and f(x + h) (where h is a real number)EB. If f(x), g(x)EB, then f(x):tg(x) and f(X)g(X)EB. If fn(X)EB and {fn(x)} converges uniformly to f(x), then f(x) EB. For any real number A, expiAx (where i is the timaginary unit) is continuous and periodic. Hence the polynomial function L;llXn exp iAnx E B. Moreover, if the latter function converges uniformly to L exnexp iAnX as m tends to ro, then the limit function is also an element of B. The polynomial L; I !X n exp iJenx and the series LI !X n exp iA.nX are called a generalized trigonometric poly- nomial and a generalized trigonometric series, respectively. For any f EB, its mean exists: 1 i a + r M[fJ = lim - T +'(x) dx. T-+oo I a The convergence of the right-hand formula is uniform in aE( -00, 00), and the limit is independent of the choice of a. Thus M [fJ is a tlinear functional defined on B. Since M[expih] = 1 for ..1. =0 and =0 for A#O, the family {exp i},x -00 < A < oo} is an tortho- normal system with respect to the tinner prod- uct (1, g) = M[f(x)g(x)] defined on B. Let ex(A) = M [t(x)exp( ih)] forany fE B; then there exist countably many values of ..1. for which ex(A) differs from zero. Denote these values of A by A" A2' . and write ex(An) = Cl. n . We call the numbers !Xl' ex 2 " !X n , the Fourier coefficients of f(x). The formal series Ll !X n exp iAnx is called the Fourier series of f(x). Moreover, the Parseva1 equality M[lf(xWJ =L11!XnI2 is valid for any fEB. For every periodic function, these definitions coincide with the ordinary Fourier coefficients and the Fourier series ( 159 Fourier Series). Any almost periodic function in the sense of Bohr is uniquely determined by its Fourier 92 coefficients; i.e., if two almost periodic func- tions have the same Fourier series, then they are identical. For any f E B, its Fourier series does not always converge uniformly, but f(x) E B can be approximated uniformly by a sequence of trigonometric polynomials. Hence the almost periodic functions in the sense of Bohr are also called uniformly almost periodic functions. C. Generalizations of Almost Periodic Functions Let C( -00,00) be the space (- 168 Func- tion Spaces) of all bounded continuous func- tions on (-co, 00) with distance p(.f, g) = SuP-oo<x<oo If(x)-g(x)l. Then a uniformly almost periodic function is the limit of a se- quence of trigonometric polynomials with respect to this distance. Generally, let p be a tdistance function introduced in a function space (whose elements are not necessarily continuous in ( - 00, 00)). Then the limit of a sequence of generalized trigonometric poly- nomials with respect to the distance p is called an almost periodic function with respect to p. For example, for p  1, we set {f a+l } liP Dsp[j, 81 = _:P<Q a If(x)-g(x)IPdx I Dwp[{, 81 = { I f a+/ } 1!P lim sup - If(x)-g(x)IPdx l->oo-oo<a<oo I  These are distance functions. The properties of the corresponding almost periodic functions and their relations to other classes of almost periodic functions have been studied by Besi- kovich [ I]. D. Analytic Almost Periodic Functions Let D be a strip domain, a < Re z < b, defined in a complex plane. For any tholomorphic function f(z) in D and <. > 0, a real number T is called a translation number off belonging to 8>0 if SUpzEDlf(z + iT)- f(z)1 o Iffor any 0 > o there exists a number 1(0) > 0 such that any interval of length 1(<.) contains a translation number off belonging to 8, then f(x) is called an analytic almost periodic function in D We denote the set of all analytic almost periodic functions in D = fa < Rez < b} by A(a, b). If we fix an x in a < x < b, then g(y) = f (x + iy) for any f(z)EA(a, b) belongs to B. For any fEA(a, b) there corresponds a tDirichlet series Ll !X n exp AnZ such that two analytic almost periodic functions are identi- cal if the corresponding Dirichlet series are 
93 identical. Here the coefficients CJ. n = My [f(x + iy) exp( - ii, y)] are determined independently of x (a <x <b), and Parseval's identity 00 My[lf(x+iyWJ = L IIXnl2exp2Anx n:::-l holds (Bohr [3J). If the series I CJ. n exp AnX exp iAnY at x = a and x = b represent the Fourier series of fa(y) and fb(Y) E B, respectively, then there exists f E A(a, b) such that f(z) is continuous on i5 and f(a+iy)== f(y), f(b+iy)= fb(Y)' The behavior of fEA(a, b) at the boundary or exterior points of the domain D = {a < Re z < b} has also been investigated by Besikovich [1 ]. E. Almost Periodic Functions on Groups Yon Neumann defined almost periodic func- tions on any group, generalizing the charac- terization of uniformly almost periodic fUnc- tions on (-00, oc:). Let B(G) be the set of all complex-valued bounded functions on a group G. Then B(G) is a metric space with the tdis- lance p(j,g)=suPxEGlf(x)-g(x)l. If for any fEB(G) the set A,= {fa,b(x)=f(axb)la,bEG} is totally bounded in the metric space B(G), we call f an almost periodic function on the group G. This condition is equivalent to the total boundedness of BJ = {!a(x) = f(xa) a E G} or C J = {'f(x)== f(ax)laEG}. We denote the set of almost periodic functions on G by d( G), For f(x), g(x)u1(G), the linear combina- tions a-f(x!+ b'g(x) (a, bEC) and the prod- uct f(x)g(x) are both contained in d(G). If 1. E d(G) and { in} converges to .f uniformly on G, then f ESi(G). If fE,w'(G), then fa,bJa,afE d(G) also, Hence d(G) is a closed subalgebra invariant under two-sided translation in the tBanach algebra i?(G). For any fEs1(G) there exists only one number M[.fJ in the closure (with respect to the distance p in B(G» of Af={ cJ(aixbJlci>O,LC i = 1, ai, biEG} (= the least closed tconvex set including A,). We call M[fJ the mean off on G. The map- ping f -->M[.fJ is a linear functional on s1(G), and we have M [fJ  0 if f O. F. Relation to Bounded Representation Suppose that we are given a finite-dimensional matrix representation D(x) = (d,(x» of a group 18 H Almost Periodic Functions G. Then the following three conditions are equivalent: (i) All the diix) are bounded on G. (ii) All the d,(x) are almost periodic in G. (iii) The representation D is tequivalent to a repre- sentation by unitary matrices. The inner prod- uct (1, g) = M[ f(x)g(x)J provides the algebra s1(G) with the structure of a tpre-Hilbert space. Let H(G) be the Hilbert space that is the completion of ,w'( G). If we select D"(x) = (di)(x)) from each L, where L is an equIValence class of bounded irreducible representations of G, and if n A is the order of D A , then {(l/)dj(x) 11  i, j  n A' A E L} is a tcomplete orthonormal sys- tem in the Hilbert space H(G). Any f(x) E s1(G) can be approximated uniformly in G by a finite linear combination of the dij(x). G. Almost Periodic Functions on Topological Groups When G is a tseparated topological group, we denote the set of all continuous functions on G Contained in d(G) by d *(G). The statements of the theorems in the previous section con- c=ing d(G) and the representation D remain valid if we replace s1(G) by d,(G) and replace D by a continuous representation of G. In particular, if G is the additive group of real numbers R, then d,(R) is exactly B. H. Relation to Compact Groups Every continuous function on a compact group G is almost periodic; i.e" s1 *(G) = C(G). The mean value M [fJ of fEd *( G) is identical to SGf(x)dx, where the tHaar measure dx is normalized so that S G dx = 1. In this case, the theory of bounded representations discussed above is the Peter-Weyl theory (- 69 Com- pact Groups). In general, let G be a separated topological group. There exists a continuous homomor- phism cp of G onto a compact group K = K(G) with the following two properties: (i) For any compact group K' and a continuous homo- morphism <p': G--> K', there exists a continuous homomorphism IjJ : K...... K' such that cp' == IjJ 0 <p, (ii) Such a pair K ==(K, cp) is unique up to isomorphism. K is called the Bohr compactifi- cation of G, and cp is called the canonical map- ping. In particular, suppose that G is a locally compact Abelian group and G* is its tcharac- ter group. We denote by G' the group G* with discrete topology. Let K be the character group of G', and let cp* be the identity map- ping G' ......G* and cp be its conjugate mapping G-->K, which is a continuous homomorphism. Then K is the Bohr compactification of G with the canonical mapping <p, A necessary and sufficient condition for f on G to be con- 
181 Almost Periodic Functions tinuous almost periodic is that a continuous function f' on K such that f = f '0 rp must exist. If this condition is satisfied, then the mean M [f] is identical to SKf'(X) dx. For any finite-dimensional continuous unitary representation D' of K, D = D'o qJ is a finite- dimensional continuous unitary represen- tation of G, and vice versa. Hence there exists a canonical isomorphism (determined by D = D'o <p) between the equivalence classes of finite-dimensional unitary representations of a separated topological group G and the equivalence classes of finite-dimensional uni- tary representations of its Bohr compactifica- tion K The tkernel of the canonical mapping rp: G --> K is identical to the intersection of all the kernels of finite-dimensional continuous unitary representations of G. 1. Maximally Almost Periodic Croups Let G be a topological group. If for each pair a, b of distinct elements of G there exists a continuous almost periodic function f on G such that f(a) #f(b), then G is called a maxi- mally almost periodic group. This is the case if and only if G has suf11ciently many finite- dimensional unitary representations. For a connected locally compact group G, the fol- lowing six conditions are equivalent: (1) G is a maximally almost periodic group. (2) There is a one-to-one continuous homomorphism from G into a compact group. (3) G is the direct product of a compact group and a vector group R". (4) G is the tprojective limit of +Lie groups that are locally isomorphic to compact groups. (5) The quotient group G/Z is com- pact, where Z is the center of G. (6) The system of all neighborhoods that are invariant under the tinner automorphisms constitutes a basis for the neighborhood system of the unit [7]. Moreover, any discrete free group is maxi- mally almost periodic. If there is no continu- ous almost periodic function eXl.:ept constant functions, the topological group is called minimally almost periodic. Any noncompact connected tsimple Lie group is minimally almost periodic. References [IJ A. S. Besicovich (Besikovitch), Almost periodic functions, Cambridge Univ. Press, 1932. [2J S. Bochner, Bertrager zur Theorie der fast- periodischen Funktionen I, II, Math. Ann., 96 (1926),119-147,383-409 [3J H. Bohr, Fastperiodischen Funktionen, Erg. Math., Springer, 1932. 94 [4J V. Stepanov, tiber einige Verallgemei- nerungen der fastperiodischen Funktionen, Math. Ann., 95 (1926) 473-498. [5J N. Wiener, On the representation of func- tions by trigonometrical integrals, Math. Z., 24 (1926),5755616. [6J J. von Neumann, Almost periodic func- tions in a group I, Trans. Amer. Math. Soc., 36 (1934),4455492. [7J J. Dixmier, Les C*-algebres et leurs repre- sentations, Gauthier-Villars, 1964. [8J W. Maak, Fastperiodische Funktionen, Springer, 1950. 19 (XV.1 0) Analog Computation A. History The term analog computation is a generic term describing various techniques of computa- tion employing diagrams, or physical systems whose equations are similar to the mathemat- ical problems in question. The history of ana- log computation is probably as old as that of digital computation; for example, the ancient Greeks tried to solve cubic equations using diagrams, and the astrolabe widely used by astronomers through the medieval period is also a kind of analog computer. Soon after the discovery of logarithms, the slide rule was invented. In the 18th Century, the planimeter, used to measure plane areas, was introduced, and in the 19th Century, the nomogram ap- peared (- Section D). In the first half of 20th Century, a large electronic analog computer was developed, thus predating the I1rst prac- tical digital computer. However, analog computation has an essen- tial defect, namely, the limitation of accuracy. Today it is useful for simple calculation, but is rapidly becoming less important as the devel- opment of digital computers, including pocket calculators, advances, B. Graphical Calculation Graphical calculation is a method of compu- tation by means of geometric constructions using common drawing tools. Some typical examples of practical graphical calculation are the following: evaluation of linear functions of several variables (J. Massau, 1887), of systems of linear equations (F. J. van den Berg, ! 888), of polynomials (J. A. Segner, 1761), of alge- braic equations (LiII, 1867), graphical integra- tion, graphical differentiation, and the solu- 
95 tion of ordinary differential equations of the first order (1. Massau, 1878), of linear differ- ential equations (Czuber), and of ordinary differential equations of the second order (Lord Kelvin, 1892). Using bisquare-root graph paper, sold under the names of binomial probability paper or stochastic paper; we can handle t F- distributions or other probability distributions reducible to F-distributions, such as binomial or normal distributions, to a fairly good de- gree of approximation. This grapbica1 method of statistical inference is, even now, a powerful method of statistical quality control. C. Graphical Mechanics Graphical mechanics is the graphical treatment of mechanical problems, especially problems of equilibrium. In this method, the fundamental constructions are the composition and reso- lution of forces by means of force polygons (Fig. 1). This method is also applicable to problems in dynamics when they can be re- duced to problems of equilibrium by means of d'A1embert's principle, (4)0 ray 3 I 0 1 2 , 3m 0 1  kg I , , , I. I IF,): lengths forces i (a) (b) Fig. 1 An example of the graphical method of constructing the composite and the line of action of three given forces F I , Fz, and F3 In the force polygon in (b), the vector m represents the composite R. On the link polygon in (a), the point d is a point on the line of action of the force. If the fourth force F4 has the same magnitude as R with opposite direction, both polygons are dosed, and the four forces F I , F 2 , F 3 , and F4 are in equilibrium. Graphical mechanics is most convenient when applied to problems reducible to 2- dimensional structural mechanics. But we can also use it for 3-dimensional problems if we work on projections, e,g" on the plan and the elevation, of the original body. In recent years, however, the amount of work involved in geometric construction has been deemed non- negligible, and the technique of computer 19 D Analog Computation graphics is now considered to be much more convenient. Here the computation is done by a digital computer, and the result is displayed in a convenient graphical form. D. Nomograms Nomograms are charts in which we can easily read off the corresponding value Un from given values U", u n - l when there is a relation F(u " '" , un) = 0 among n real variables u, ' , '" un' The construction of nomograms has been thoroughly investigated by M. d'Ocagne [5]. A function of two variables F(u l , U 2 ) = 0 can be represented by a two-sided scale or by func- tional paper. The most useful nomograms are those for functions of three variables, F(u l , U 2 , U 3 ) = O. Changing this into h(x, y, uJ = 0 (i = 1,2,3) and drawing the curves J;(x. y, u i ) = 0 for fixed values of u i , we have an intersection chart, where three curves for the correspond- ing values u" u 2 , u 3 meet at a point. In prac- tice, however, it is much more convenient to use the alignment chart, which is a dual of the intersection chart. It is especially useful when the relation F(u l , u 2 , u 3 ) = 0 can be decom- posed into an equation of the form fl (u I ) gl (utJ hI (utJ f2(U 2 ) g2(U 2 ) h 2 (U 2 ) =0 .f3(U3) g3(U 3 ) h 3 (U 3 ) Putting Xi = J;jh i , Yi = gjh i (i = 1,2,3) and drawing the curves (Xi' yJ with the parameter U i scaled on it (ui-scaled), the corresponding values u I , u 2 , and u 3 satisfying relation (1) lie on a straight line (Fig. 2). Using this property we can easily read off the corresponding values (by laying down a ruler). In the strict sense, the term "nomogram" usually refers specifically to an alignment chart. (I) u, u, U3 5 -2 4 -1 3 0 2 ] 0 Fig. 2 For four or more variables, we can apply similar techniques if the function is separable into functions of three variables. If it is not separable, we Can sometimes apply such tech- niques such as two-functional scales with functional networks, cocircular charts, co- planar charts, or moving charts. Using func- 
19 E Analog Computation tional approximations, an approximate nom- ogram has recently been constructed with errors considered to be admissible in practical use, Historically, the origin of Hilbert's 13th problem, which asks if it is possible to rep- resent a function of many variables as the composite of functions of two variables, comes from a study of nomograms with many variables. E. Analog Computers The analog computer is a special purpose machine designed for a specified analog com- putation. In a wider sense it includes special devices that perform tanalog simulation. The following mechanical devices are well known: the pantograph for copying plans, the harmonic analyzer for obtaining the Fourier expansion of a periodic function, and V. Bush's differen- tial analyzer. Since about 1940 large electronic analog computers for solving differential equa- tions have been extensively developed. Analog computers have up to recently had the advantages of ease of construction, sim- plicity, and inexpensive operation, and they were also considered to be fast enough for use in real-time computation, To compensate for the limitations of analog computers in com- parison to digital ones, several hyhrid com- puters have also been used. However, with the rapid progress of digital computers and digital-analog converters, the analog com- puters are now considered less important than in the pas t. F. Curve Fitting Curve fitting is a method of finding a sim- ple curve y = f(x) supplying the best possible approximation to the values y" y" for discrete values XI, x 2 , of the independent variable. A polynomial passing through all the given points is constructed by means of tinter- polation. For experimental data we usually construct a curve by using methods, such as the "method of least squares, that take the errors due to observation into account. A function constructed to best fit the ob- served values of the function y = f(x) express- ing some physical law is called an empirical formula, in contrast to the theoretical formula. Usually, we first assume that the function contains several empirical constants, which are then determined to fit the experimental data. The most cOmmon case is a linear approx- imation (using a linear function) with a suit- able change of variables. Semilogarithmic 96 paper, logarithmic paper, or probability paper are used to facilitate transformations in terms of logarithms or normal distributions. We also frequently use polynomials or trig- onometric polynomials of lower degree. To determine the empirical constants the folIow- ing methods are used: the graphing method, the selection of certain points, the mean value method, or the method of least squares. G. Orthogonal Polynomials Let there be given the values YO,YI,' Yn-I for discrete values x = 0, 1, . , n - 1 with equal differences, A polynomial f(x) of degree k (.s;n) that minimizes the sum of the squares G,(n) = L(Ym- [(m))2 is given by k f(x)= L avqv(n,x), v=O n-l a v = L Ymqv(n, m)jSv(n), m=O n-l Sv(n)= L (qv(n,m))2, m=O and we have Gk(n)==L-;}oY-L=oa;SAn). Here the function qv(n, x), called a Chebyshev q-function, is defined by v ( v+m )( v-n )( x ) q,(n,x)= L m=O m v-m m In practical applications, it is better to replace qv(n, x) by the function q:(n, x)=qv(n,x)jrVv!Mv(n), where M,(n) is the greatest COmmon divisor of (v:m)C:m) (m==O, 1, "', v). The values q:(n,m) (m =0,1,.", n-l) are mu- tualIy coprime integers. The function q(n, m) is called the simplest Chebyshev q-function or simplest orthogonal polynomial, and is some- times denoted by X",(x), .m(x), or 4'n,m(x), When two functions satisfy the condition n (f,g)n= L f(m)g(m)=O, m=O they are calIed orthogonal for a finite sum. If from 1, x, x 2 , , , , , x v we construct a system orthogonal with respect to this definition, then we have the polynomial Pv,n(x) = mt (_It(:)(v:m) (;)/(:). which has the following connection with qv: qv(n, x) ==(( -1)'(n- 1)!j2 V (n-v-I)!)Pv,n_l(x). 
97 The polynomial P"n(x) is called a Chebyshev orthogonal polynomial or sometimes simply an orthogonal polynomial. The polynomials Pv..(x) are orthogonal for the finite sum. If n 00 and O.s;X::;; 1, the function p,.n(x) tends to P,(1- 2x), where P, is the tLegendre polynomial. These functions may be conveniently used in least-squares curve fitting. References [IJ G. W. Smith and R. C. Wood, Principles of analog computation, McGraw-Hill, 1959. For graphical calculation, [2] C. Runge and F. A. WIllers, Numberische und graphische Quadratur und Integration gewohnlicher und partieller Differential- gleichungen, Enzykl. der Math., Bd. II, Teil 3, Heft 2 (1915), 477176. For graphical mechanics, [3J C. B. Biezeno and R. Grammel, Technische Dynamik, Springer, 1939. [4J H. F. B. Miiller-Breslau, Die graphische Statik der Baukonstruktionen, Kroner, I, sixth edition, 1927; II, pt. 1, fifth edition, 1922; II, pt. 2, second edition, 1925. For nomograms, [5] M. d'Ocagne, Traite de nomographie, Gauthier-Villars, second editIOn, 1921. [6J R. Soreau, Nomographie, E. Chiron, sec- ond edition, 1921. [7J H. Schwerdt, Lehrbuch der Nomographie, Springer, 1924. [S] D. P. Adams, An index of nomograms, MIT Press, 1950, For analog computers, [9J G. A. Kom and T. M. Korn, Electronic analog computers, McGraw-Hill, 1952; second edition, 1956, For curve t'itting and empirical formulas, [ IOJ T. R. Running, Empirical formulas, Wiley, 1917, [IIJ P. G. Guest, Numerical methods of curve fitting, Cambridge Univ. Press, 1961. For tables of Chebyshev q functions, [ 12] R. A. Fisher and F. Yates, Statistical tables for biological, agricultural, and medical research, Oliver & Boyd, 1938 [n ,,:; 75, v::;; 5]. [ 13] R. L. Anderson and E. E. Houseman, Tables of orthogonal polynomial values ex- tended to /1 = 104, Agricultural Exp. Station, Iowa State College Stat. Sec. Res. Bull., 297 (1942), 595-672 [n.s; 104, v.s; 5]. [ 14J J. Yamanouchi, On the Chebyshev q functions. I. Simplest Chebyshev q functions and their tables (in Japanese), Report of the Electrotechnical Laboratory, 454 (1942) [n.s; 15, v.s;n -I; 16.s;n.s;21, v.s;9; 22 .s;n.s; 31, v.s;7; 32.s;n.s;51,v.s;5]. 20 Analysis 20 (X.1) Analysis The origin of analysis can be traced back to the time when Eudoxus (4th century B,C,) and Archimedes (3rd century B.C.) devised the 80- called method of exhaustion for calculating the area of a plane figure and the volume of a solid. Their objects of investigation were re- stncted, however, to particular types of figures or solids. In the 16th and 17th centuries, F. Viete, J. Kepler, and B. Cavalieri again took up this problem. In the 17th Century, the prob- lem of drawing a tangent to a given curve was studied by R. Descartes, P. de Fermat, B. Pascal, and J. Wallis. Fermat, in particular, applied the result to find the maxima and minima of certain functions. It is worth noting that a certain type of mathematics powerful enough to produce similar results was inde- pendently developed in Japan around that time. In 1684, G. Leibniz in Germany intro- duced the symbols dx and dy in treating the same problem. He proved that dyjdx repre- sents the slope of the tangent to the curve at a given point and discovered a new operation to calculate it. In 1686, he established the "in- verse tangent method," which is what we now call integral calculus. He also introduced the notation J. On the other hand, I. Newton in England developed his "method of fluxions," corresponding to our differential and integral calculus, from the viewpoint of mechanics. But neither Leibniz nor Newton formulated the fundamental concepts rigorously, and there- fore they were criticized severely by many contemporary scholars. The new calculus gained ground in Great Britain rather slowly. B. Taylor in England and C. MacLaurIn in Scotland demonstrated its usefulness in 1715 and in 1745, respectively. On the continent, however, Leibniz's symbolic calculus was taken up by mathematicians of the Bernoulli family, G. F. A. de I'Hopital, G. Fagnano, and many others; with it, they solved many scien- tific problems which until then had remained intractable, This motivated subsequent re- searchers to pose new problems in the form of differential equations. One of these problems, treated by J. Ie Rond d' Alembert in relation to the vibration of a chord, concerns the tpartial differential equation fP yjat 2 == a 2 a 2 yjax 2 (1 ) for Y = y(t, x) with the boundary conditions y = 0 for x = 0 and x = l. He obtained the solu- tion y= f(at+x)- f(at-x), where f is an arbitrary function of period 21. In 1753, D. 
20 Analysis Bernoulli showed that solutions of equation (1) are given by functions of the form a o C1J ( knx h TY ) Y=T+ kl akcosT+bksinT These two kinds of solutions gave rise to the question of whether an arbitrary function can be expressed by a ttrigonometric series. This problem was studied by A. C. Clairaut, J. L. Lagrange, and L. Euler. In 1807, J. Fourier in France, in treating a problem on the conduc- tion of heat, claimed that an arbitrary function of period 2n can be expressed as a 00 Y= 2 0 + kl (akcoskx+bksinkx), where the coefficients a k and b k are given by I f n a k =- f(x)coskxdx, 1! -n I f " b k =- f(x)sinkxdx. n _, This series is now called the tFourier series, but Fourier never veri lied the fact that the series (2) with coefficients (3) converges and represents f(x). It was only as late as 1820 that A. L. Cauchy in France first noted that to treat a series properly, one must examine its convergence. In the 19th Century, the concept of tfunc- tions, which had been taken in the sense of "analytic expressions," came to be detlned by the correspondence relation. Cauchy cJarified the ideas of tlimit and tcontinuity, tdifferentia- bility and tintegrability. He showed that a function that is continuous in a bounded closed interval is integrable in that interval. But his proof was not rigorous, as he lacked the notion of tuniform continuity. In his 1854 paper on the trigonometric series, B. Riemann in Germany considered the integrability of functions that might be discontinuous and introduced the concept of what we now call the tRiemann integral. The theory of tsets, initiated by G. Cantor in Germany in his paper of 1874, revolutionized analysis. R. Baire, E. Borel, and H. Lebesgue in France contributed to the establishment of analysis on the basis of set theory. Baire made a classification of discontinuous functions. Generalizing his results, Lebesgue gave a definition of analytic expressions, thus clarify- ing the term that had been used vaguely since the time of Euler. Lebesgue also tried to define the concepts of the integral of a function, the length of a curve, and the area of a surface from the most general viewpoint. In generaliz- ing the notion of tmeasure introduced by Borel, he established in 1902 the theory of tLebesgue measure with which he laid the 98 (2) foundations of the theory of tLebesgue inte- grals. The introduction of this theory gave to the theory of Fourier series a new turn in the direction of functional analysis. Measure theory was also employed by A. N. Kolmo- gorov of the Soviet Union to lay a solid foun- dation for tprobability theory in 1933. The study of functions of a complex variable was originated by Cauchy in the first half of the 19th Century. He began his research by introducing the notion of "monogenic func- tions"; a function which is monogenic at every point of a domain is what we now call a tholo- morphic function. He established tCauchy's integral theorem and integral formulas for these functions, and deduced from these theo- rems the tresidue theorem for functions with tpoles. Making use of the integral formula, Cauchy proved that a function that is holomorphic at a point a Can be expanded in a power series of the form Lo ak(z - a)" in a neighborhood of this point. Riemann considered a complex variable w as a function of another complex variable z when dwjdz is independent of the value of the differential dz, This amounts to the same thing as a "monogenic function" of Cauchy. Riemann's mapping theorem became a model for subsequent developments. Riemann intro- duced the concept of tRiemann surfaces in order to tuniformize multivalued functions. This important idea was basic to the progress of analysis and ttopology in the 20th Century. K. Weierstrass, who was a contemporary of Riemann, developed the theory of functions of a complex variable from a purely analytic viewpoint. He del'ined an element of a function to be a power series Lo ak(z 4 of z -a, representing a holomorphic function in the interior of its tcircle of convergence, and de- fined an analytic function to be an aggregate of such elements that are derived from one of them by meanS of tanalytic continuations, along all curves having the point a as the initial point. Riemann and Weierstrass constructed their theory to complete the theory of telliptic func- tions and tAbelian functions that was initiated by N. H. Abel and C. G. J. Jacobi. Their re- sults were a high point of 19th-century c1ass- ica] mathematics. H. Poincare in France built upon their work. Another high point in the theory of functions of a complex variable was reached when J. Hadamard and C. de La Vallee Poussin in France made use of it to prove the tprime number theorem in 1896. Weierstrass also initiated the study of func- tions of several complex variables and was succeeded by Poincare, P. Cousin, and E. Picard in France. They tried to extend the theory of functions of one complex variable to (3) 
99 that of many variables. F. Hartogs, however, discovered a phenomenon quite different from the case of one variable. Cousin posed in 1895 the problem of constructing a function of several complex variables that has assigned poles. This problem was pursued by A. Weil in France and solved by K. Oka in Japan in 1936. Also, the problem of characterizing tdomains of holomorphy was investigated by E. E. Levi in Italy, H. Cartan in France, and P. Thullen and K. Stein in Germany and was finally solved by Oka in 1953. H. Cartan and J.-P. Serre reformulated these results in terms of tcohomology with coefficients in tsheaves. This considerably inflUenced the formulations of mathematics thereafter. These results were further generalized to tanalytic space by H. Grauert, R. Remmert, and Stein in Germany. The theory of tcomplex manifolds, which are generalizations of Riemann surfaces to several variables, was initiated by W. V. D. Hodge in England and K. Kodaira in Japan and con- tinued by F. Hirzebruch in Germany, M. F. Atiyah in England, and I. M. Singer in the United States. Differential calculus gives a general method of finding extreme values of a given function. Likewise, in order to find a function that pro- duces an extremal of the given tfunctlonal, the calculus of variations was created in the 18th Century. For example, Euler considered the problem of finding a particular function y(x) that renders the functional S: F(x, y, y') dx an extreme value among all those functions y(X) for which the plane curve y = y(x) passes through two given points (a, A) and (b, B) of the plane; he showed that this y(x) must satisfy the differential equation dFy.fdx - Fy = 0 (1744). Lagrange, W. R. Hamilton, and others devel- oped this result into a general tvariational principle that governs not only classical tme- chanics but also tquantum mechanics. From research on the continuity or differen- tiability of the functional with respect to y emerged the idea of considering a function as a "point" in a tfunction space. This gave rise to tfunctional analysis, a branch of analysis that treats functions as elements of certain spaces and utilizes the methods of algebra and topol- ogy. The first result in this regard was the theory of tintegral equations of V. Volterra in Italy and E. I. Fredholm in Sweden in the beginning of the 20th Century. Fredholm's work was motivated by his desire to solve the tDirichlet problem, the solution of which had been used by Riemann in the proof of his mapping theorem, etc. However, Riemann's own proof of existence of the solution, called the tDirichlet principle, was not rigorous, and attempts to save the proof provided one of the central problems of analysis for some time. 20 Ref. Analysis Fredholm's solutions were different from Rie- mann% but D. Hilbert in Germany was able to justify Riemann's original proof. Hilbert's proof was later simplitled and generalized b) R. Courant, H. Weyl, and others. Hilbert also introduced the tfunction spaces 1 2 and L 2 to study the teigenvalue problem of integral equations with tsymmetric kernels. Later J. von Neumann in Germany established tspec- tral theory in abstract tHilbert spaces and applied it to the mathematical foundations of quantum mechanics (1929). S. Banach in Poland created the theory of tlinear operators in Banach spaces (1932). This theory was further generalized to that of tIinear topolog- ical spaces and was applied to the theory of distributions. In their study of partial differential equa- tions and Fourier analysis, Hadamard, J. Leray, S. L. Sobolev, T. Carleman, and many others had to extend the notion of functions; they also enlarged the notion of derivatives. L. Schwartz in France introduced tdistributions and defined derivatives in the sense of distribu- tions to unify these generalizations (1945). M. Sato in Japan defined more general generalized functions, called thyperfunctions (1958). It has become evident that both distributions and hyperfunctions have provided the most power- ful tools in recent research in the general theory of tpartial differential equations, to which L. Hormander in Sweden has made outstanding contributions. References [1] M. B. Cantor, Vorlesungen iiber Gesch- ichte der Mathematik, Teubner, 189441908. Also ...... references to 265 Mathematics in the 17th Century, 266 Mathematics in the 18th Century, and 267 Mathematics in the 19th Century, [2J N. Bourbaki, Elements de mathematique, Fonctions d'une variable reele, Integration, Espace vectoriels topologiques, Theorie spec- trales, Actualites Sci. Ind., Hermann, 1955- 1969. [3J C. Caratheodory, Vorlesungen iiber reelle Funktionen, second edition, Teubner, 1927 (Chelsea, 1968), [4J H. Carlan, 1. Calcul differentiel, 2. Formes differentielles, Hermann, 1967. [5J G. Choquet, Lectures on analysis I, II, III, Benjamin, 1969. [6J R. Courant and D. Hilbert, Methods of mathematical physics I, II, Interscience, 1953, 1962, [7] R. Courant, Differential- und Integral- rechnung I, II, Springer, third edition, 1955; English translation, Differential and integral calculus, Nordemann, 1938. 
21 A Analytic Functions, Several Complex Variables [S] C. J. de La Vallee-Poussin, Cours d'ana- lyse infinitesimale I, II, Gauthier-Villars, seventh edition, 1930. [9] J. Dieudonne, Treatise on analysis I-VII, Academic Press, 1960- 1978. [ 10] J. Dieudonne, Elements d'analyse I-VIII, Hermann, 1971-1978. [11J E. Goursat, Cours d'analyse mathema- tique I, II, III, Gauthier-Villars, fifth edition 1933-1956. ' [12J J. Hadamard, Cours d'analyse I, II, Her- mann, 1927,1930. [ 133 G. H. Hardy, A course of pure mathemat- ics, Cambridge Univ. Press, seventh edition, 1938. [ 14J D. Hilbert, Grundztige einer allgemeinen Theorie der Iinearen Integralgleichungen, Teubner, second edition, 1924. [ 15J C. Jordan, Cours d'analyse de I'ecole poly technique: I. Calcul differentiel; II. Calcul integral; III. Ca\cul integral (equations dif- ferentielles), Gauthier-Villars, third edition, I, 1909; II, 1913; ill, 1915. [ 16J H. L. Lebesgue, Lec;ons sur I'integra- tion et la recherche des fonctions primitives, Gauthier-Villars, second edition, 1928. [ 17J E. Picard, Traite d'analyse, Gauthier- Villars, I, 1922; II, 1926; III, 1928. [ 1 8J G. P61ya and G. Szego, Aufgaben und Lehrsatze aus der Analysis I, 11, Springer, second edition, 1954. [ 19] W. Rudin, Real and complex analysis, McGraw-Hili, 1966. [20] L. Schwartz, Cours d'analyse mathema- tique I, II, Hermann, 1967. [20A J L. Schwartz, Analyse- Topologie gen- erale et analyse fonctionelle, Hermann, 1971. [21] L. Schwartz, Theorie des distributions, Actualites Sci. Ind., Hermann, 1950, second edition, 1966. [22J V. I. Smimov, A course of higher mathe- matics: 1. Elementary calculus; II. Advanced calculus; III-I. Linear algebra; 111-2. Complex variables, special functions; N. Integral equa- tions and partial differential equations; V. Integration and functional analysis, Addison- Wesley, 1964. (Original in Russian, 1961.) Also  references to 106 Differential Calculus, 107 Differential Equations, 162 Functional Analysis, and 198 Holomorphic Functions. 21 (X1.20) Analytic Functions of Several Complex Variables A. Ho10morphic Functions As in the case of tholomorphic functions of one complex variable, the definition of holo- 100 morphic functions can be given in two ways: the first definition utihzes differentiability, following the approach of B. Riemann; and the second method utilizes the notion of power series expansion as developed by K. Weier- strass. In this article, N = {O, 1,2" }. B. Power Series Let z be an n-tuple of complex variables z l' ..., Zn' and c ==(c J ,.." cn) a point of cn. An infinite sum P of monomials ak(z d = a k ,. ,dzJ -cdk',,,(zn-cn)k, (k=(kj, ...,kn)E Nn), where a k E C, is called a power series with center c and coefficients a k . If, for a bi- jection cp of N onto N n , the simple series PENlaq>(p)(z - c)q>(P)\s convergent at z = zo, we say that P is absolutely convergent at zo, Its sum at zO, denoted by L ak(zO -c)k, is de- fined as the sum L aq>(p)(zo - c)q>(p), which is independent of the choice of cp If P is uni- formly bounded at zO, then P is absolutely convergent at every point of the open polydisk S={zllzj-cjl<lzJ-cjl,j=l,. .,n}. Further- more, in this case P converges absolutely and uniformly on every compact set in S (N. H. Abel). The convergence domain of a power series P is the set D of points ZO such that P is ab- solutely convergent at every point in a neigh- borhood of zO, The interior of the set B of points at which the infinite sum P is uniformly bounded is equal to D A (complete) Reinhardt domain with center c is a domain D in C" such that whenever D contains Zo, the domain D also contains the torus {z IZ j - c) = IzJ Cjl,} = 1, ,n} (the closed polydisk {zllzj-cjl lzJ- cjl,j= 1,., n}), If the convergence domain D of the power series P is not empty, it is a com- plete Reinhardt domain and is also logarithmi- cally convex; that is, the set D - U j {z I Zj = cJ is mapped onto a convex domain in Rn by the mapping Zj-->logIZj-cjl G= 1,.. .,n). The set fj of points at which P is absolutely convergent is, in general, greater than D, and it is possible that fj contains exterior points of D, A thorn of D is the set of exterior points of D contained in jj that are located on the planes {z Zj = c j } G = 1. . , n). An n-tuple r E R"'- is called a set of associated convergence radii if P is absolutely convergent at every point of {z II z) - cjl < rj,J == 1 , ... , n} but not of{ Z II Zj - cjl > r j ,} == 1, ' n}. An n-tuple of associated conver- gence radii may not be uniquely determined, but it satistles Iimsup(l a kl rk )J/'k l = 1, Ikl==k l +...+kn Ikl + 00 (E. Lemaire). Let f be a complex-valued function defined in a neighborhood of ZO E C". If there exists a 
101 convergent power series P with center Zo such that at every point of a neighborhood of ZO the value off and the sum of P coincide, then f is called analytic at zO in the sense of Weierstrass, and P is the Taylor expansion of I at zO, C. Differentiability Let I be a complex-valued function det'ined in a neighborhood of Zo E C". If in a neighbor- hood of Zo we have f(z)- f(zO)=D:dzl -z)+... +lXn(zn-z)H, with ()(I," .,exnEC and lim ;;/(lZl -z1 +... +lzn-zI)=O, z-zo then we say that f is (totally) differentiable at zO, The function I is then continuous at zO, and the partial derivatives af/azj G = 1, , n) exist. Furthermore, the Cauchy-Riemann differential equations afNzj=o G= 1, , n) hold, where aIlazj=(1/2)(aIlaxj-iaJlay) and aflozj==(1/2)(aIlaxj+ iaIloy) with Zj= x j + iYj We say that I is ho10morphic at ZO in the sense of Riemann if f is differentiable at every point in a neighborhood of zoo Analyticity in the sense of Weierstrass is equivalent to holomor- phy in the sense of Riemann. Furthermore, if the partial derivatives af/oz) (j = 1, . . . ,n) exist at every point in a neighborhood of zo, then I is, without assuming continuity, proved to be holomorphic. Thus the holomorphy off in each variable Zj implies the holomorphy off in z == (z I' ' ,,) z,) (Hartogs's theorem of ho10mor- phy, 1906), A complex-valued function in a domain G c en is called ho10morphic or analytic in G if it is holomorphic at every point of G. Let H(G) be the ring of holomorphic functions in G. For f = u + iv E N(G), u and V satisfy in G the dif- ferential equations a 2 u(z, z)/OZjOZk = 0; that is, a 2 u (J2u -+-=0 (2) axjax k ayjOYk ' (J2U (J2u ---=0, j,k=l,...,n. OXjaYk OXkOYj A tdistribution TED' (G) is called plurihar- monic in G if it satisfies (2) in G, Then T is harmonic and hence a real analytic function by tWeyl's lemma. Let G j be a domain in the z{plane with piecewise smooth boundary C j . If IE H(G) (G = nj1 G) is continuous on G, then (2i)n L x f(O XCJ(I -zd... «(n-zn) { f(Z),ZEG, d(1 /\... /\d(n= - O,zrf=G 21 E Analytic Functions, Several Complex Variables (1) (Cauchy's integral representation). Thus if n :) 2, then I is determined by its values only on the proper subset C = Ci x x C n of oG, which is called the skeleton (or determining set) of G, For (pluriharmonic) functions of several com- plex variables, the boundary value problem, not necessarily solvable in its classical form, is not so effective as the Dirichlet problem in one complex variable. As in the case of one variable, the tLaurent expansion is valid for every holomorphic function in a domain of the form G = nj1 G), where the G) are circular annuli c C. Suppose that we are given II E H(G I ) and (2 E H(G 2 ), where G I , G 2 are domains in C" such that G l n G 2 is nonempty and connected. If II = J on {zllzj-zJi < r j , y= l,j= 1,..., n}, where zO = X O + iyo E G l n G 2 , then there exists a unique IEH(G I U G 2 ) such that II G I = II and fl G 2 = I2 (tbeorem of identity). Thus tanalytic continu- ation proceeds as in the case of one variable. Similarly, some fundamental theorems in one variable, such as tLiouville's theorem on ten- tire functions and the tmaximum principle, hold also for several variables. However, there are some properties that reveal the differences between the cases of one and several variables. For instance, the set of zeros of a holomorphic function ( 23 Analytic Spaces B) has no isolated point for n ? 2. The investigation of these remarkable differences is one of the purposes of the theory of analytic functions of several complex variables. D. Shilov Boundaries While the maximum principle holds for a holomorphic function in a domain G, the set of points where the maximum is attained may be a proper subset S of 6G. For instance, if Gis the product of annuli as before, then the skele- ton of G can be taken as S. In connection with the theory of tnormed rings, G. E. Shilov proved that there exists a unique smallest member So (called the Shilov boundary of G) in the family of closed subsets S such that sup{ II(z) I 1 z E S} == sup{ II(z)11 ZE G} for every f E H(G) continuous on G. The structure of So is investigated in detail together with the pseudoconvexity of G connected with it. Applying tPerron's method for the Dirichlet problem to tplurisubharmonic functions and the Shilov boundary, H. J. Bremermann solved one type of boundary problem. E. Local Theory (3) Let I and g be two functions defined in neigh- borhoods of a set &: en. If I == g in a neigh- borhood of S, then I and 9 are called equiva- 
21F Analytic Functions, Several Complex Variables lent with respect to S. The germ off on S, denoted by fs, is the equivalence class off: A germ of a bo10morpbic function on S is the germ on S of a holomorphic function defined in a neighborhood of S, and H(S) denotes the ring of germs of holomorphic functions on S. Given a point 0 in C", H(O) = H( {O}) is iso- morphic to the ring H. of convergent power series at 0, i.e., the power series that are ab- solutely convergent in some neighborhoods of O. For every nonzero function fE H(O), there exists a system of coordinates (z I' , z,,) cen- tered at 0 such that .f(0, , 0, zn) f- 0 for every Zn f- 0 in a neighborhood of z. = O. In a neigh- borhood of 0, .f is then equal to the product of an invertible element of Hn and a distinguished pseudopo1ynomial P(z.) = z + a l (z I, . . . . Zn-I)Z%-I + + ap(z I' ... , zn-I) E Hn-I [znJ, with al(O, ...,0)=... =ap(O, ...,0)==0, and P(zn) is uniquely determined by .f and the coordi- nates z " , z. (Weierstrass's preparation theorem). It follows from this that Hn is an /1- dimensional tregular local ring. Considering H(O) as the tinductive limit of tlocally convex rings H(U), where U ranges over a base for a neighborhood system of 0, H. Cartan proved the preparation theorem in a more precise form in which the association .f -->a j is con- tinuous with respect to the supremum norm. Based on a deep consideration of this situ- ation, K. Oka proved a theorem of funda- mental importance: The tsheaf (!;c defined by (Yc n , , = H(z) (z E C") is tcoherent. F. Domains of Ho10morphy Given a domain G c G. for n:; it may be that there exists a domain G' strictly greater than G such that all the functions that are holomorphic in G extend to holomorphic functions in G'. For instance, let S = S' x 0", where S' and a are open polydisks in (ZI,' , zn_d-space and zn-space, respectively, and let T c C" be an open set. If there exists an open set V (# 0) e S' such that (V x a) U (S' x oa) c T and if S n T is connected, then all the functions that are holomorphic in T extend uniquely to holomorphic functions in SU T (Hartogs's continuation theorem). In particular, if A is an tanalytic set in a domain G c C" with dim A .s; n 2, then all the functions that are holomorphic in G-A extend uniquely to holomorphic functions in G. Furthermore, if A is an analytic set in G with A #G, then every J E H(G - A) that is locally bounded at the points of A extends uniquely to a holomorphic function in G (Riemann% continuation theorem 102 for n ) 2). The domain G f of holomorphy for .f is defined to be the maximal domain to which .f may be continued analytically. A domain G is called a domain of ho10morphy if G = G f for some f E H(G). However, G f is, in general, not a subdomain of C". G f is, generally, a mani- fold spread over C"; i,e" G J is a connected /1- dimensional tcomplex analytic manifold with a holomorphic mapping <p: Gf-->C n of maximum Jacobian rank (<p is then an open mapping). The same is true for the common existence domain of functions in a subfamily of H(G). The common existence domain G of all the functions in H(G) is called the envelope of ho10morphy. A ho10morphically complete domain is a domain G such that G = (j, These notions carryover to the case where G is a manifold spread over C". The (general) Levi problem of determining the conditions for a given domain to be holomorphically complete is fundamental to the theory of analytic func- tions of several complex variables ( Section I), In connection with this problem, various notions of pseudoconvexity of holomorphi- cally complete domains are defined. G. Pseudo convexity An upper semicontinuous real-valued function U (-00 .s;u< +00) in a domain Gee" is said to be p1urisubharmonic if for every zO E G and every a E C" the function u(ZO + ta) of t is 'sub- harmonic (including the constant - 00) in all the connected components of {t ZO + ta E G}. A domain G is said to be pseudoconvex (or d. pseudoconvex) if U = -Iogd, is plurisubhar- monic in G, where d,(z) is the distance from z E G to aG with respect to any norm in C". Every connected component of the interior of the intersection of a family of pseudo convex domains is pseudoconvex, and the union of an increasing sequence of pseudoconvex domains is pseudoconvex. Suppose that we are given a domain G and a function u of class C 2 in a neighborhood of G such that G = {z I u(z) < O} and, for some £>0, jk(02u/ozjozda}rk)f.laI2 for every aEC n . Then the domain G is said to be strongly pseudoconvex. Strong pseudo- convexity implies pseudo convexity. Every pseudoconvex domain is exhausted by an increasing sequence of strongly pseudo convex domains, An open set Pc C" is called an ana- lytic polyhedron if P = {z II X,(z) I < 1, ex == I, , N}, X,E H(P) (a= 1, . . . N). Then every connected component of P is pseudoconvex. A Weil domain is a connected and bounded analytic polyhedron P defined by X,(ex = 1, , N) with N  /I, such that for every k (1:( k:( n) the intersection of the hypersurfaces I X,,(Z) 1 = 1 (1 :( i:( k) is of dimension,;;, 2n - k. 
103 H. Ho10morphic Convexity A domain G c cn is called ho10morphical1y convex if for every compact set KeG, K = nfEH(G) {z II(z)1  supwEK II(w)l} (the holomor- phic hull of K) is a compact set contained in G. (For a domain G contained in an tanalytic set we can similarly define holomorphic con- vexity of G.) Every connected component of the intersection of a family of holomorphically convex domains is holomorphically convex, and a holomorphically convex domain is exhausted by an increasing sequence of Weil domains. Holomorphic completeness implies holomorphic convexity. The converse is true for domains in en. If G is holomorphically convex, then for every point ( of aG there exists an 1 E H(G) such that 1 is not locally bounded at ( (H. Cartan and P. Thullen, 1932). Hence a holomorphically convex domain is a domain of holomorphy. Thus a domain in C is holomorphically convex if and only if it is a domain of holomorphy. (The same is true for unramified covering domains over cn (Oka, 1953).) The union of an increasing sequence of domains of holomorphy is a domain of holo- morphy (H. Behnke and K. Stein [2], 1939). Suppose that we are given a domain G and domains S., 1;, (a = 1,2,) such that S, U 7" c G and SUPT, III = sUPSaUT, If I for every f EH(G). Suppose also that So = lim S, is bounded. We say that the continuity principle holds in G if To c G (To = Jim 7;,) implies So c G. The con- tinuity principle holds in a domain of holo- morphy (Hartogs's theorem of continuity). This implies that if G is a bounded domain c en (n ) 2) with connected boundary oG, then every function holomorphic in a neigh- borhood of oG extends to a holomorphic function in G (Hartogs-Osgood theorem). In particular, for n ) 2, the set of singular points of a holomorphic function has no isolated point. A domain is pseudoconvex if the con- tinuity principle holds there. Hence a domain of holomorphy is pseudoconvex. I. The Levi Problem Let G be a domain in C" and zOEOG, If there exists an open neighborhood U of ZO such that every connected component of Gnu is a domain of holomorphy, then G is called Car- tan pseudoconvex at zoo On the other hand, if every I-dimensional analytic set that has zO as an ordinary point contains points not belong- ing to G U {zO} in the neighborhoods of zO, then G is called Levi pseudoconvex at ZO Fur- thermore, G is called locally Cartan (Levi) pseudoconvex if G is Cartan (Levi) pseudo- convex at every point of uG, Every domain of 21 J Analytic Functions, Several Complex Variables holomorphy is locally Cartan pseudoconvex. If G is pseudoconvex and there exists a neighbor- hood U ofzo such that Gn U={zl<p(z)<O}, where rp E C l ( U), then G is Levi pseudoconvex at zoo The (proper) Levi problem of whether every pseudoconvex domain is a domain of holo- morphy was proposed by E. E. Levi (1911). After unsuccessful efforts by various mathe- maticians to solve the problem, it was affirma- tively solved by Oka (1942 for n = 2 and 1953 for manifolds spread over C" for n ) 2), H. Bremermann, and F. Norguet. The problem was solved also by H. Grauert [14] (1958) in a more general form (- Section L) using results on linear topological spaces and by L. Hor- mander [ 1 8J (1965) using methods of the theory of partial differential equations. A fundamental step in Oka's solution is his gluing theorem: Let G be a bounded domain c en. If every connected component of G I = {zlx 1 >a}nG and G 2 ={zlx 1 <b}nG (a <b) is a domain of holomorphy, then G is a do- main of holomorphy. Indeed, by virtue of the Behnke-Stein theorem and the fact that every pseudoconvex domain is the union of an increasing sequence of bounded locally Cartan pseudoconvex domains, it suffices to solve the Levi problem in the case of a bounded locally Cartan pseudoconvex domain. The Levi prob- lem in this case is solved by the gluing theo- rem. Various integral representations of holo- morphic functions are known besides the Cauchy representation. The Bergmann-Weil integral representation in a Weil domain was used as an important means of solving the Levi problem. J. Ho10morphic Mappings Holomorphic functions with values in a tquasi- complete tJocally convex complex vector space E have also been investigated. The classical theory described above has been generalized, to some extent, to this case. In this way, many applications of the theory have been dis- covered. An E-valued function in a domain G c C" is holomorphic if and only if the map- ping uof: G-->C is holomorphic for every con- tinuous linear form u on E. By this theorem, most problems concerning E-valued holo- morphic functions can be reduced to those of ordinary holomorphic functions. Note that the vector space H(G) of ordinary holomorphic functions in G is a tFrechet space, The spaces CP and complex tBanach spaces belong to the above category of E. A. CP-valued holomor- phic function in a domain G c cn is called a ho10morphic (or analytic) mapping of G into CPo An isomorphism in the category of do- 
21 K Analytic Functions, Several Complex Variables mains G c en and holomorphic mappings is called an analytic isomorphism (or biholomor- phic mapping). An automorphism in the cate- gory is called an analytic (or ho10morphic) automorphism. With every domain G c C is associated the sheaf (!)G of germs of holo- morphic functions over G. Thus we have the notion of a tringed space (G, 0,). A complex analytic manifold can be defined as a (Haus- dorff) ringed space that is locally isomorphic to some (G, 0,). A meromorphic function in G is a function that is locally the quotient of two holomor- phic functions with denominator #0, It may be defined more rigorously as a meromor- phic mapping of G into P,(C) (- 23 Analytic Spaces D). K. The Cousin Problems The Cousin problems are those of constructing meromorphic functions with given zeros or poles, In terms of sheaves the problems are stated as follows: Let %G be the sheaf of germs of meromorphic functions over a domain G c C". The first Cousin problem asks whether the mapping f(G'%G)-->f(G'&G) induced by the exact sequence O-->(9G-->%G-->&G-->O (&G = %G/(9G) is surjective, where r(G, $') is the module of tsections of  over G (- 383 Sheaves C). Let %G * be the sheaf of multiplica- tive groups of germs of meromorphic functions not identically 0 and (!)G* be the subsheaf of %G * formed by germs of nonzero holomorphic functions, The second Cousin problem asks whether the mapping f( G, %6*)--> P( G, 9 G ) (9 6 = %G * /(9G *) is surjective. P. Cousin (1895) solved the first problem for G = C n or TIj=1 G j and the second problem for G ==C". aka (1935) proved that the first problem is solvable in every domain of holomorphy. In solving the second problem in a domain of holomorphy, aka established the notion of tfiber bundles and proved that the problem for any domain is reduced to holomorphic triviality of a holomorphic principal fiber bundle over the domain and that holomorphic triviality is equivalent to topological triviality when the domain is of holomorphy (Oka's principle), Using the solutions of the Cousin problems, aka proved his gluing theorem, described in Section 1. L. Stein Manifolds Abstracting certain important properties of a domain of holomorphy, Stein [33] introduced the following category of complex analytic manifolds (X, (!)x): (1) X is paracompact (i.e., 104 each connected component of X has a count- able open base). (2) Functions in f(X, (!)x) separate the points of X. (3) For every point x E X there exists a system of local coordi- nates around x that is formed by functions in f(X, (!)r)' (4) X is holomorphically convex, (X, 0,) is then called a Stein manifold. It was later discovered by Grauert [ 12J that con- ditions (2) and (4) imply (1) and (3), Applying the theory of tcohomology with coefficients in sheaves, H. Cartan and J.-P. Serre obtained for an tanalytic coherent sheaf ff on a Stein manifold X, the following funda- mental theorems of Stein manifolds. Theorem A: HO(X, J1') generates the stalk :Y'x (as an (!)x- module) at every point x of X. Theorem B: Hq(X, J1') =0 for all q> 0 [7,30]. Conversely, for a complex analytic manifold X, if for every analytic coherent sheaf ..f of ideals defined by a O-dimensional analytic set in X (i.e" a dis- crete subset of X), H I (X,..f) = 0, then X is a Stein manifold. Furthermore, if f(X, (9x) = f( Y, (9r) for a Stein manifold Y (as in the case where X c C"), then the fundamental theorem A for every coherent sheaf of ideals implies that X is a Stein manifold (I. Wakabayashi). Due to the fundamental theorems, most results on domains of holomorphy hold un- changed for Stein manifolds. For instance, the first Cousin problem is always solvable. The second Cousin problem is solvable if and only if H 2 (X, Z)=O. An n-dimensional Stein mani- fold can be realized as a (ramiti'ed) covering domain of holomorphy over C". Furthermore, some theorems on differentiable manifolds have analogs on Stein manifolds. For instance, the cohomology groups of the complex of holomorphic differential forms over a Stein manifold X are isomorphic to the cohomol- ogy groups H*(X, C) (analog of tde Rham's theorem). Every n-dimensional Stein manifold X is realized as a closed complex analytic submanifold in C 2n + l ; that is, there exists an injective tproper holomorphic mapping f: X -->C 2n +1 with df #0, Consider all the holo- morphic tprinciple ti'ber bundles over a Stein manifold X whose fibers are isomorphic to a complex Lie group G. The analytic isomor- phism classes of the bundles and the elements in HI (X, G") (where GO is the sheaf of germs of holomorphic mappings of X into G) are in one-to-one correspondence. The same is true for the topological isomorphism classes of the bundles and the elements in HI(X, GC) (where G' is the sheaf of germs of continuous map- pings of X into G), The mapping HI (X, GO)--> H I (X, G ' ) induced by the canonical injection Ga  G' is bijective (Grauert [ 13]). Every rela- tively compact domain in a complex analytic manifold is holomorphically convex if it is strongly pseudoconvex (Grauert [14]). Hence 
105 such a domain is a Stein manifold. It follows from this that every real analytic manifold with countable base for open sets is realized as a closed real analytic submanifold of some R". The notion of Stein manifolds is generalized to that of weakly I-complete manifolds: a complex manifold X is called a weakly 1- complete manifold if there exists a plurisubhar- monic function u of class Coo on X, such that for any cER, Xc == {XE X I u(x) < c} is relatively compact. The family of such manifolds in- cludes compact complex manifolds too. For a weakly I-complete manifold also, vanishing theorems of cohomology have been estab- lished by H. Hironaka, S. Nakano [26], and H. Kazama [2J (- 232 Kahler Manifolds D). The theory of entire functions of two vari- ables has been developed from a new view- point established by T. Nishino. An entire function f of two variables defines a family of Riemann surfaces {f= c C E C} on C 2 , and investigations of the structure of such a family play an important role. For instance the fol- lowing is proved in this way: If every irreduc- ible component off= c (CE C) is biholomor- phic to C I, then there exists an entire function g such that (f y): C2 -->C 2 is a biholomorphic mapping (Nishino [28J). In this theorem, the analyticity of g is obtained from the fact that C 2 is a Stein manifold. M. Continuation of Analytic Sets The application of the theory of cohomology with coefficients in sheaves is not restricted to problems concerning Stein manifolds. Given G o = {z Ilzjl < 1, l.s;j.s;n} (n3), G, = {z Ilz,1 < f., Iz) < 1, 2.s;j  n}, and G(m) = G, U (G o -{zlz2=...=zm=0})(3.s;m.s;n), we have W(G(m), (OG(ml) = 0 (1  p .s;m - 2) (Scheja's theorem [32J). Let :F be a coherent analytic sheaf over a domain G c en, If, for every point z of an analytic set A  G, = {Of or p.s; n - dim, A - 2 hd, ff (where hd, :Ji is the thomological dimension of the i'Do,z-module .), then it follows from Scheja's theorem that the mapping HP(G, ff)-->HP(G - A, :Ji) induced by the canonical injection G - A -+ G is bijec- tive. This generalizes Hartogs's continuation theorem for holomorphic functions, which corresponds to the case p = O. Besides the continuation of holomorphic functions, we can consider the continuation of analytic sets. Let A be an analytic set in a domain G c en and S an analytic set in G A. A point z EGis said to be regular (essentially singular) with respect to S if the closure S of S in G is (is not) analytic at z. If dim A < dim, S for every point z E S, then S is analytic in G. If S is purely d-dimensional and S is analytic at 21 N Analytic Functions, Several Complex Variables a point of a d-dimensional irreducible com- ponent A' of A, then S is analytic at every point of A' that is not located in any other ir- reducible component of A. Furthermore, if dim A  dim S and S is purely d-dimensional, then the following hold: (1) The set E of essen- tial singularities of S is, if not empty, a purely d-dimensional analytic set in G, formed by irreducible components of A. (2) If every irreducible component of A contains points of E not located in any other irreducible com- ponent of A, then AcE, and A is, if not empty, purely d-dimensional. (3) If every d- dimensional irreducible component of A con- tains points that are regular with respect to S, then S IS a purely d-dimensional analytic set in G (Thullen, Remmert, and Stein). By these results it is possible to give a proof for tChow's theorem that every analytic set in Pn(C) is algebraic. The continuation of holomorphic functions is related to the continuation of their graphs. W. Rothstein investigated the continuation of analytic sets to obtain the following analog to Hartogs's theorem of continuity: If G = G, UG 2 , G, ={zllz,I<lj2,Lj2Izl < I}, G 2 = {zllz,1 < 1, Ij2<Lj=2Izl< I}, and G= {z Ilz,1 < 1, Lj2lzjl2 < I} (the envelope of holomorphy of G) with n;::: 3, then every purely (n I)-dimensional analytic set A in G extends to an analytic set in G; that is, there exists a purely (n - I)-dimensional analytic set A in G such that A = An G. K. Kasahara and H. Fujimoto generalized this theorem to the case of analytic spaces. N. Nevanlinna Theory for Several Complex Variables The Nevanlinna theory investigates holomor- phic mappings between complex manifolds. In function theory of one variable, for a holo- morphic mappingf:C-+p 1 (C) from C into the I-dimensional tcomplex projective space, the famous +Picard theorem states that if f omits three values, then f must be constant. R. Nevanlinna developed the quantitative theory of value distributions off: L. Ahlfors [IJ, introduced the geometric approach, and enunciated the principle that the negative curvature of the image manifold restricts a holomorphic mapping. In higher-dimensional situations, this principle has been realized in many cases. The Nevanlinna theory for a holomorphic mapping f: c-->pn(C) with re- spect to thyperplanes was established by Ahl- fors and H. Weyl and J. Weyl [34]. Holomor- phic mappings into projective spaces have been studied in detail by W. Stoll, H. Fujimoto, and M. Green. Following the work of S. S. 
21 0 Analytic Functions, Several Complex Variables Chern [9] of around 1970, the equidimen- sional case was investigated (P. A. Grifllths, K. Kodaira, S. Kobayashi and T. Ochiai), J. A. Carlson and Grifllths [6J succeeded in obtain- ing a tdefect relation for equidimensional holomorphic mappings and thypersurfaces of the image manifold. This yields the following Picard-type theorem (after the formulation by F. Sakai, lnventiones Math., 16 (1974»: Let X be a tprojective algebraic manifold of dimen- sion nand f: C" --> X a holomorphic mapping. Let D be a hypersurface of X. We take a tdesingularization n: X*+X so that the in- verse image D* = n- 1 (D) has at most normal crossings as singularities. We denote by K* a tcanonical divisor of X*. We assume that . dim HO(X*, (9(m(K* +D*») hm sup n >0, m'"""'oo 1n If f omits D, then the Jacobian off vanishes everywhere. As a corollary, it follows that the tuniversal covering manifold of X -D cannot be cn, When X = P"(C) and D is a nonsingular hypersurface of degree d, the above assump- tion is satisfied if d) n + 2. In the nonequidi- mensional case, Ochiai [29J (with supple- mentary works by M. Green, Y. Kawamata, and P. Wong) verified the following asser- tion, which was first stated by A. Bloch with rough arguments: Let X be a projective alge- braic manifold of dimension n, and suppose dim HI(X, (9x) > n; then the image of a holo- morphic mapping f: C-->X is contained in a proper subvariety of X ( 124 Distributions of Values of Functions of a Complex Variable; 272 Meromorphic Functions). O. Hyperbolic Manifolds Every complex analytic space X has two in- variant tpseudodistances: the Caratheodory pseudodistance c and the Kobayashi pseudo- distance d x , both of which generalize the Poin- care distance p of the unit disk D = { Izl < I}. These pseudodistances can be defined by the property that d x is the largest pseudodistance among all pseudodistances (jx on X for which all holomorphic mappings (X, bx)-->(D,p) are distance-decreasing, while c x is the smallest among all pseudo distances c5 x for which all holomorphic mappings (D, p)-->(X,6,) are distance-decreasing. Then C x .s; d,. If Y is an- other complex analytic space, every holomor- phic mapping f: X --> Y is distance-decreasing with respect to either its Caratheodory or Kobayashi pseudodistance. This may be con- sidered to be a generalization of the Schwarz- Pick lemma. (The tSchwarz lemma is often 106 referred to by this name, since G. Pick also investigated the distance-decreasing property.) The Kobayashi pseudo distance d x can also be obtained by integrating an infinitesimal differential pseudometric called the Koba- yashi pseudometric in the same manner as the Riemannian distance is obtained from the Riemannian metric (H. L. Royden). If d x is a (complete) distance on X, then X is said to be (complete) hyperbolic. A Riemann surface is hyperbolic in this sense if and only if its universal covering is biholomorphic to the unit disk. If X is open in Y and if, for every pair of sequences of points {Pn} and {qn} in X converging to distinct points of X, Iim dx(Pn, qn) is positive, then X is said to be hyperbolically embedded in y, The Riemann sphere CD { oo} minus three points is not only complete hyperbolic but also hyperboli- cally embedded in the Riemann sphere. Every holomorphic mapping of C into a hyperbolic complex analytic space is constant, while every holomorphic mapping of the punctured disk D* = {O< Izl< 1} into a hyperbolically embed- ded space Xc Y extends to a holomorphic mapping of D into Y. Thus the classical little Picard theorem reduces to the statement that C - {O, I} is hyperbolic, while the great Picard theorem reduces to showing that C - {O, I} is hyperbolically embedded in the Riemann sphere. These classical theorems can be gen- eralized in two ways. If M is a tsymmetric bounded domain and r is a discrete arithmetic group acting freely on M, then M/f is not only complete hyperbolic but also hyperboli- cally embedded in its Satake compactification (Kobayashi and Ochiai). If X is the comple- ment of 2n + 1 hyperplanes in general position in the complex projective space P"(C), then X is complete hyperbolic and hyperbolically embedded in P"(C) (a restatement of a re- sult going back to E. Borel, A. Bloch, and H. Cartan). Although there are some noncompact non- hyperbolic complex manifolds X for which every holomorphic mapping f: C-->X is con- stant, a compact complex manifold X is hyper- bolic if and only if every holomorphic map- ping f: C --> X is constant (R. Brody). For the tTeichmiiller space X = Tg of com- pact Riemann surfaces of genus g, the Koba- yashi distance d x agrees with the Teichmiiller distance which had been introduced before the complex structure of Tg was defined (H. L. Royden), In the study of pseudo convex domains, the t Bergman metric, the Caratheodory distance, and the Kobayashi distance serve as useful tools. Their behavior at the boundary has been studied extensively. 
107 Given an n-dimensional complex mani- fold X, an invariant measure 'I' x can be de- fined as the largest measure such that every holomorphic mapping f from the polydisk D" with invariant measure into X is measure- decreasing. For an algebraic manifold of gen- eral type, this measure is everywhere positive. p, Bounded Domains in C" For any bounded domain in C" there is natu- rally assigned a tKiihler metric called the Bergman metric. Using the invariance of this metric, E. Cart an proved that all the Her- mitian symmetric spaces of noncompact type are realized as bounded domains, In view of the fact that some important period matrix domains (e.g., the 19-dimensional bounded symmetric domain of type N, the Siegel upper space, etc.) are of this type, it is obviously of great significance in algebraic geometry to study discontinuous subgroups of the auto- morphism groups of such domains (Pyatetskii- Shapiro). On the other hand, it is almost impossible for general bounded domains to determine the explicit form of their Bergman metrics. But Cartan also investigated strongly pseudo- convex real hypersurfaces in C 2 , and he solved completely the (local) equivalence problem for them, introducing a definite type of tCartan connections over them in a functorial way. Thereafter N. Tanaka (and for hypersurfaces also Chern and J. Moser) generalized this result to all pseudoconvex real submanifolds in higher-dimensional complex manifolds. One can further apply this result to the equi- valence problem of bounded domams with smooth strongly pseudoconvex boundaries, for C. Fefferman [ 10J proved by analyzing the boundary behavior of the Bergman metric that every biholomorphic mapping between two such domains is extended smoothly up to their boundaries (I. Naruki [27J gave an alternative proof). Q, History In connection with t Abelian functions, ana- lytic functions of several complex variables have been studied sporadically since the time of Riemann and Weierstrass (H. Poincare, Cousin). A series of investigations by Hartogs ([ 17J (J 906), etc.) that revealed the distinctive properties of several complex variables ini- tiated a new epoch in complex analysis. Levi (191O 1911) generalized Hartogs' s results to the case of meromorphic functions, introduced the notion of pseudoconvexity, and proposed 21 Q Analytic Functions, Several Complex Variables the so-called Levi problem. After a lapse of time, many contributions to this new area of complex analysis have been made since 1920. The study by K. Reinhardt (1921) of analytic automorphisms was further developed by C. Caratheodory and Behnke. The tkernel func- tion introduced by S. Bochner and S. Berg- mann (1922) produced many remarkable results. In contrast with tPicard's theorem in one variable, P. Fatou found a holomor- phic mapping f: C 2 -->C 2 with nonvanishing Jacobian such that the image f(C 2 ) has an exterior point. The theory of analytic functions of several complex variables has flourished since 1926. Behnke and Thullen in Miinster, together with G. Julia and H. Cart an in Paris, were the most active investigators. The results on tnormal families of analytic functions of several com- plex variables (Julia [20], 1926), the uniqueness theorem of holomorphic mappings (H. Cartan, 1930), and a characterization of a domain of holomorphy by holomorphic convexity (Car- tan and Thullen, 1932) are their most remark- able achievements. Behnke and Thullen [3] systematized the results obtained since the discovery of the theory by providing a com- plete bibliography of articles up to 1934. The three major unsolved problems at that time-those of Cousin, Levi, and the approx- imation of holomorphic functions-were intensively studied by Oka from 1936, who has given complete solutions [30]. The investiga- tion of ideals of holomorphic functions by H. Cartan (1944), together with that of ideals with undetermined domains by Oka, has developed into the theory of coherent analytic sheaves. The notion of analytic spaces, first introduced by Behnke and Stein (195 I), extended the field of investigation in the theory of analytic func- tions of several complex variables. The theory of cohomology with coefficients in sheaves has been effectively applied by H. Cart an and Serre (1951-1952), The introduction of the notion of Stein manifolds (195 1) came at the same time. Grauert's deep investigations since 1955, together with those of Stein and Rem- mert, have contributed greatly to the develop- ment of the theory of analytic spaces. In the 1960s, active investigations took place also in the United States [16]. The theory of auto- morphic functions of several complex variables has been developed by C. L. Siegel, I. Satake, and others in connection with the theory of numbers. Entire functions of two variables have been investigated from a new point of view by Nishino and others since 1968 (last paragraph of Section N). The notion of a hyperbolic manifold introduced by Kobayashi (1967) enables us to obtain many results on 
21 Ref. Analytic Functions, Several Complex Variables complex manifolds by means of the methods of differential geometry. The theory of automor- phisms of bounded pseudo convex domains was developed extensively in the 1970s by Fefferman, Naruki, and others. The theory of an envelope of holomorphy has also been successfully applied to the theory of elemen- tary particles in physics. References [1 J L. Ahlfors, An extension of Schwarz's lemma, Trans. Amer. Math. Soc., 43 (1938), 359- 364, [2J H. Benhke and K. Stein, Konvergenten- folgen von Regularitiitsbereichen und der Meromorphiekonvexitiit, Math. Ann., 116 (1939),204-216. [3J H. Behnke and P. Thullen, Theorie der Funktionen mehrerer komplexer Veriinderli- chen, Springer, 1934; second edition, 1970. [4J L. Bers, Introduction to several complex variables, Courant Institute, 1964. [5J S, Bochner and W. T. Martin, Several complex variables, Princeton Univ. Press, 1948, [6J J. A. Carlson and P. A. Griffiths, A defect relation for equidimensional holomorphic mappings between algebraic varieties, Ann. Math., (2) 95 (1972),557-584. [7J Sem. H. Cartan, Ecole Norm. Sup" 1951- 1952, [8J H. Cartan, Oeuvres, Springer, 1979. [9J S. S. Chern, On holomorphic mappings of Hermitian manifolds of the same dimension, Amer. Math. Soc. Proc. Symposia in Pure Math., 11 (1968),157-170, [ 10J C. Fefferman, The Bergman kernel and biholomorphic mappings of pseudoconvex domains, Inventiones Math., 26 (1974), 1-65, [ 11 J B. A. Fuks, Special chapters in the theory of analytic functions of several complex vari- ables, Amer. Math. Soc. Trans\. of Math. Monographs, 1965. [ 123 H. Grauert, Charakterisierung der holomorph-vollstiindiger komplexen Riiume, Math. Ann., 129 (1955),233-259. [ 13J H. Grauert, Holomorphe Funktionen mit Werten in komplexen Lieschen Gruppen, Math. Ann., 133 (1957), 450-472. [ 14J H. Grauert, On Levi's problem and the imbedding of real-analytic manifolds, Ann. Math., (2) 68 (1958),460-472. [ 15J H. Grauert and R. Remmert, Theory of Stein spaces, Springer, 1979. [ 16J R. C. Gunning and H. Rossi, Analytic functions of several complex variables, Prentice-Hall, 1965. [ 17J F. Hartogs, Zur Theorie der analytischen 108 Funktionen mehrerer unabhangiger Ver- iinderlichen, insbesondere iiber die Darstellung derselben durch Reihen, welche nach Potenzen einer Veriinderlichen fortschreiten, Math. Ann., 62 (1906),1-88. [\ 8J L. Hormander, U estimates and exis- tence theorems for the a operator, Acta Math., 113 (1965),89-152. [ 193 L. Hormander, An introduction to com- plex analysis in several complex variables, Van Nostrand, 1966. [20J G. Julia, Sur les families de fonctions analytiques de plusieurs variables, Acta Math., 47 (1926), 53-115. [21J H. Kazama, Approximation theorem and application to Nakano's vanishing theorem for weakly I-complete manifolds, Mem. Fac. Sci. Kyushu Univ., 27 (1973),221-240. [22J S, Kobayashi, Hyperbolic manifolds and holomorphic mappings, Dekker, 1970. [23J S. Kobayashi, Intrinsic distances, mea- sures and geometric function theory, Bull. Amer. Math. Soc., 82 (1976),357-416, [24J P. Lelong, Fonctions plurisoushar- moniques et formes differentielles positives, Cordon & Breach, 1968. [25J L. Nachbin, Holomorphic functions, domains of holomorphy and local properties, North-Holland, 1970. [26J S. Nakano, Vanishing theorems for weakly I-complete manifolds, II, Pub\. Res. Inst. Math. Sci., 10 (1974), 101-1 10. [27J 1. Naruki, On extendibility of isomor- phisms of Cartan connections and biholomor- phic mappings of bounded domains, T6hoku Math. J., 28 (1976),117-122. [28J T. Nishino, Nouvelles recherches sur les fonctions entieres de plusieurs variables com- plexes (II), Fonctions entieres qui se reduisent a celles d'une variables, J. Math. Kyoto Univ., 9 (1969), 221-274. [29J T. Ochiai, On holomorphic curves in algebraic varieties with ample irregularity, Inventiones Math., 43 (1977),83-96, [30J K. Oka, Sur les fonctions analytiques de plusieurs variables (colIected papers), Iwanami, 1961. [31J W. F. Osgood, Lehrbuch der Funk- tionentheorie II, Teubner, 1924; revised edi- tion, 1929 (Chelsea, 1965), [32J G. Scheja, Riemannsche Hebbarkeitssatze fiir Cohomologieklassen, Math. Ann., 144 (1961), 345-360, [33J K. Stein, Analytische Functionen meh- rerer komplexer Veriinderlichen zu vorgege- benen Periodizitiitsmoduln und das zweite Cousinsche Problem, Math. Ann., 123 (1951), 201-222, [34J H. Weyl and J. Weyl, Meromorphic functions and analytic curves, Ann. Math. 
109 Studies 12, Princeton Univ. Press, 1943. Also ...... references to 23 Analytic Spaces. 22 (1.12) Analytic Sets A. General Remarks The notion of analytic sets was first defined by N. N. Luzin and M. Va. Suslin in 1916, and it was extended to that of projective sets by operations such as complementation and pro- jection (Luzin, 1924). Most mathematicians, including Luzin and W. Sierpinski, who worked in this field, were in agreement with tFrench empiricism (or tsemi-intuitionism), which defended the standpoint of R. Baire, E. Borel, H. Lebesgue, and others. An object is said to be effectively given if it can be uniquely, individually, and unambiguously determined in finite terms so that anyone can reach the same object by following the defining pro- cedure. Semi-intuitionists claim that only effectively given objects have mathematical existence, and they do not recognize as a mathematical object something that needs the axiom of choice for its definition, From this point of view, tBorel sets were "well-defined" sets to which classical analysis had to be re- stricted. Thus the question was raised whether it is possible to extend the class of Borel sets to a wider class of sets with the same certainty. Lebesgue (J. Math. Pures Appl., 1 (1905» de- fined a function not belonging to any class of Baire functions by using the totality of tordinals of the second class. (Later, this method was systematically developed as the ttheory of sieves by Luzin.) However, it did not satisfy Borel as being effective. Can we, then, extend the Borel sets without any use of ordinals of the second class? The discovery of analytic sets gave an affirmative answer. In this article (except in Section 1), we treat a space (denoted by X, Y, . ) that is thomeo- morphic to a tcomplete tseparable tmetric space and its subspace. Denote by 91 the space of irrational numbers (a metric space consis- ting of the irrational numbers E R with the metric Ix - yl of x and y). The following prop- erties of a subset S of a space X are equivalent: (i) S is a continuous image of 91; (ii) S is a con- tinuous image of a Borel set in X; (iii) S is the projection of a closed set in a product space X x n; (iv) S is the projection of a Borel set in X x Y. We calI a set satisfying one of these properties an analytic set, an A set, or a 1: i set (in X). The complement of an analytic set is 22 c Analytic Sets called a complementary analytic set (or simply coanalytic set), a CA set, or ani set. B. The Operation A and Sieves When to each (n l , , nd of finite sequences of natural numbers there corresponds a unique element E(n" , n k ) of a family F of sets, this correspondence {E(n" . . , n,)} is called a schema of Suslin (or system of Suslin) cOnsist- ing of sets in F. Denoting an infinite sequence of natural numbers by {n;}, the set given by U:n j } nkE(nl, ,nd is called the kernel of a system of Suslin, and the operation of taking the kernel is caned the operation A (analytic operation). Let Q be the set of all rational numbers between 0 and 1 and F be a family of sets. Take a family { C,},EQ of sets belonging to F with the index set Q (or more geometrically, a subset C= U'EQ C, x (r) of X x Q when F is a family of subsets of a space X), and calI it a sieve consisting of sets in F. Denoting by {r;} a (strictly) monotone decreasing sequence of elements of Q, we call the set Uh} nk C'k (namely, the set of aII x such that C(X) == {r (x, rYE C} is not well-ordered by the order:::; of rational numbers) the set obtained by a sieve C or the sieved set obtained by C. If the family F is closed with respect to countable intersection, then the family of all sets ob- tained by sieves consisting of sets in F is ident- ical to the family of all sets obtained by apply- ing the operation A to F. When F consists of all the closed sets in a given space, this is the family of all analytic sets. In particular, it is sufficient to take the family of closed intervals as F when X is the space of real numbers. Note that we can define sieves and sieved sets more generally by using the space of real num- bers R instead of the set Q of rationals. C. Properties of Analytic Sets It is evident from the definition that every +Borel set is analytic. If a Borel set is uncount- able, then it is the union of a countable set and a one-to-one continuous image of m. The analyticity of sets is invariant under countable unions, intersections, and Cartesian products and the operation A and tBorel-measurable transformations. An uncountable analytic set contains a tperfect subset (Suslin). Therefore the possible cardinality of an analytic set is at most countable or is that of the continuum. Every analytic set enjoys the tBaire property, and in Euclidean space every analytic set is tLebesgue measurable (Luzin, Sierpinski). If a set E in the Euclidean plane is analytic (coana- 
22 D Analytic Sets lytic), then I(E) is also analytic (coanalytic), where I(E) is the set of all x such that the section E(x) of E that is parallel to the y-axis has a positive measure (M. Kondo and T. Tugue). Concerning the Baire property, the similar result for a set E of X x Y is obtained by replacing "to have a positive measure" by "to be the tsecond category (nonmeager) set" (A. S. Kechris). The Lebesgue measure of an analytic set is effectively calculable (in the sense that the measure of a Ei (or Ei(ex)) set is a Ei (resp. Ei(ex)) real number (- Section G and 356 Recursive Functions) (H. Tanaka). Every analytic (coanalytic) set E can be de- composed into  I Borel sets. This decompo- sition is called a decomposition of E into constituents. An analytic (coanalytic) set is a Borel set if and only if it is decomposable into a countable number of constituents (Luzin, Sierpinski). In a space with the cardinality of the continuum, there exist analytic sets that are not Borelian. For example, in the space C( [0, I J) of continuous functions on the inter- val [0, 1] ( 168 Function Spaces) the set of all differentiable functions is co analytic but not Borelian (S. Mazurkiewicz). The following theorems are especially im- portant in analytic set theory. Luzin's first principle (the first separation theorem): For every pair of disjoint analytic sets A" A" there exists a Borel set B such that A, c Band B II A, = 0. An immediate corollary of Luzin's first principle is Suslin's theorem: If both A and X - A are analytic,' then A is a Borel set. Luzin's second principle (the second separation theorem): For every pair of analytic sets A and B, there exist complementary analy- tic sets C and D such that A - B c C, B-Ac D, and C n D = 0. A one-to-one continu- ous image of a Borel set is Borelian (Suslin). More generally, for a given B-measurable function f defined on a Borel set B, the set A ( c f( B)) of all points y whose inverse images /-1 (y) are singletons is a complementary analytic set (Luzin's unicity theorem). In this theorem, we can replace "a singleton" by "a to"-compact set" (V. Ya. Arsenin, K. Kunugui). Therefore, if a set is the image of a Borel set by a continuous function such that the inverse image of each point is a o-compact set, then it is a Borel set. D. Generalization to Projective Sets A projective set of class n is inductively defined as follows: (i) the Borel sets are the projective sets of class 0; (ii) the projective sets of class 2n + 1 are the continuous images of the sets of class 2n; (iii) the projective sets of class 2n are the complements of the sets of class 2n - 1. 110 The projective sets of class 1 are exactly the analytic sets, and those of class 2 are the com- plementary analytic sets. The following are fundamental properties of projective sets. Denote by Ln the family of the projective sets of class n. Then (1) L 2n C L 2n +k and L 2n +1 C L 2n + 2 +k (k= 1,2, . ..). (2) the property of being a set of class n is invariant under count- able unions, intersections, and Cartesian products and homeomorphisms; (3) a contin- uous image of a projective set of class 2n + 1 is of the same class; (4) the projection on X of a set of class 2n + I in X x Y is a set of class 2n + 1 in X; (5) the family of the projective sets of class 2n + 1 in a space X is the family of the projections of all sets of class 2n in X x X (or X x \H); (6) the kernel of a system of Sus\in consisting of sets of class n is a projective set of the same class, where II #0,2. We frequently call a projective set of class 2n 1 a Pn set or a En 1 set, and that of class 2n a C n set or a Il set. A Bn set or a A set is a set that is both Pn and Cn' The respective families of these sets are also denoted by Pn (En, c (Il), and Bn (A), respectively. In general, for a family  of sets in a space X, we denote by C3' the family of the complements X-E of all sets E in , We write SePlm) and SePII() for the propositions obtained by substituting "set in 3'," "set in 3' and in C3'," and "set of C3''' for "analytic set," "Borel set," and "coanalytic set," respectively, in Luzin's first principle and Luzin's second principle, respectively. Fur- thermore, we say that the reduction principle holds for , and denote it by Redm), when for any sets A, BE  there exist A" Bl E  such that AI c A, BI c B, Al UBI = A U B, and A, n B I = 0, C. Kuratowski introduced the latter principle Redm) which implies Sepl(Cts') and SePII(C), and proved Red(Ili), Classi- cally, reduction or separation principles for the projective sets of higher classes were not settled except for Sepl(Ili), Sepu(Ili) (P. S. Novikov), and Red(Ei) (Kuratowski). Now- adays, it is known that these principles for {E,Il}n"3 are undecidable from the axiom- atic set theory ZFC. If we assume tthe axiom of constructibility V = L, then Red(E) holds for n  3 (1. W. Addison). On the other hand, under the assumption of tprojective deter- minacy PD ( Section H), it follows that Rcd(Il) (Rcd(En) holds when n is odd (even) (A. Martin, Addison and Y. N. Moschovakis). E. Universal Sets A set U in m x X is called the universal set for the projective sets of class n in X if for any projective set P of class II in X, there exists zoE91 such that P={xl(Zo,X)EU}. ConCern. 
111 ing universal sets, we have the following result: for every n > 0, there exists a universal set for the projective sets of class n in X that is of the same class in 91 x X. Hence in a space with the cardinality of the continuum, there exists a projective set of class n + 1 which is not of class n. In general, if a class 3' of sets is closed under taking a universal set and continuous preimages, then 3' and C3' cannot both have the reduction property. Therefore Red(l1n or Red( L') fails for each n ? 1. Any two universal sets of analytic sets are tBorel isomorphic. On the other hand, an analytic set E is not Borel isomorphic to any universal analytic set if the complement of E is an uncountable set without a perfect subset (A. Maitra and C. Ryll-Nardzewski), Hence, there are at least two equivalence classes of uncount- able analytic sets with respect to the Borel isomorphism  under the assumption of V= L. since the latter assumption implies the existence of such a set E (- Section H). F. Uniformization Principle The uniformization problem arose during investigations of imphcit functions, For a set E in a space X x Y, uniformization of E is the finding of a subset V of E such that VX(3Y((X,Y)E E)=:Jly({x, Y)E V), where 3!y is the tquantifier which means "there exists exactly one y." A Borel set can be uniformized by choosing a suitable coanalytic set (Luzin), Kondo's uniformization theorem (Japan. J. Math., 15( 1938» is the most impor- tant result in descriptive set theory: every co analytic set is uniformizable by a co analytic set. As a corollary to this, any L' set is uni- formizable by a L' set, and every L' set is obtained as a one-to-one continuous image of a co analytic set. Kondo's original proof was very difficult to understand. Since teffective descriptive set theory was introduced by Kleene and Ad- dison, the proof has been simpliti'ed by Ad- dison, and a more elegant one has been given by J. R. Shoenfield [7, p. 188]. Nowadays, the theorem is also called the Novikov-Kond6- Addison theorem, and is one of the most powerful and fundamental theorems not only in descriptive set theory, but also in the tfoun- dations of mathematics. The uniformization of an analytic set is, in general, not to be found among analytic or co analytic sets. There was a conjecture that any analytic set is uniformizable by specifying an A, set (difference of two analytic sets). Recently, this conjecture was negatively settled (J. Steel, Martin). Assuming that V= L, the 22G Analytic Sets uniformization of a L' (n ? 3) set is determined by specifying a L'; set, and that of a 11 (n ? 2) set by specifying a l1p set. On the other hand, if an axiom system of set theory (e,g., ZF; -" 33 Axiomatic Set Theory) is consistent, then it is still consistent even if we add to it the follow- ing proposition: There exists a 11 set whose uniformization is impossible by any choice of a definable set in the system (P. J. Cohen, A. Levy). However, if we assume the existence of a tmeasurable cardinal (Me), it is known that every l1i set is uniformizable by a l1i set (Martin and R. M. Solovay). On the other hand, PD implies "every m (L') set is uni- formizable by a 11 (L',) set for each odd (resp. even) n" (Moschovakis). There are sufficient conditions on sections E(X) of a Borel set E in the Cartesian product X x Y for E to be uniformizable by the Borel set as follows: all sections E(X) are (i) countable (Luzin, Novikov), (ii) a-compact (Arsenin, Kunugui); or, as "large section property," (iii) in the second category (S. K. Thomason, P. G. Hinman), or (iv) of positive measure (Tanaka, G. E. Sacks), For applications of descriptive set theory to analysis, an important uniformi- zation result is von Neumann's selection theorem: for any A set Ec X x Y, there exists a tBaire measurable and tabsolutely measur- able function r X --> Y (- 270 Measure Theory L (vi» such that Vx(3y((x, Y)E E)=(X,f(X)E E). Concerning implicit functions, any Borel set E c X x Y such that all sections E(x) are at most countable is expressed by a union of intersections of E with graphs of some Baire functions in: X --> Y, n EN (Luzin). Recently, it was shown known as a generalization of this that any Borel set E c X x Y such that all sections are a-compact is a union of countably many Borel sets En for each of which all sec- tions EX) are compact (J. Saint-Raymond). G. Kleene's Hierarchy and Effectiveness First, projective set theory in any space is reducible to the theory in the space of irra- tional numbers. Second, if we introduce a tweak topology in the set NN = "'w of tnumber- theoretic functions :x with one argument, the resulting topological space NN is homeomor- phic to the tBaire zero-dimensional space of irrational numbers. Third, any subset B of NN is Open and closed in this topology if and only if there exist a function (ENN) and a predicate A(ex) that is tgeneral recursive in  such that txEB= A{(ex). Fourth, logical opera- tions such as I, v , A, 3x (where x is a vari- able ranging over the natural numbers), and 3ex 
22 H 112 Analytic Sets exactly correspond to the operations (on sets) complementation, union, intersection, count- able union, and projection, respectively. On the basis of these facts, projective set theory is regarded as the theory of the NN tanalytic hier- archy of Kleene, Here, the following example is remarkable: We can construct a E\ set which is universal for the analytic sets (namely, a E: [N"] set) in the space of irrational nUm- bers ( 356 Recursive Functions H). The connection between projective set theory and logic has been discussed by C. Kuratowski and A. Tarski. From their point of view, semi-intuitionists such as Borel regard the set of natural numbers to be precisely clear in itself and also the continuum to be immedi- ately recognizable by our geometric intuition. In their argument rational numbers do not play such an important role. They take, a priori, the set of irrational numbers as the fundamental domain, and intervals with ra- tional extremities as the simplest sets of points among the subsets of the fundamental domain as the starting point of their argument. Here, the fundamental domain or each interval is not conceived as a totality of its elements, but recognized as a "uniform extent." In contrast to this, singletons and individual irrational numbers are not so simple. For this reason Borel introduced the notion of calculable numbers to study definable real numbers. Following Luzin, we say that a calculable number is a constructible real number in the sense that we can give it by an arithmetical approximation as precisely as we want. Now, this notion is nearly identical to the notion of an effectively calculable real number given by A. Church or A. M. Turing. In the mathematics of the semi-intuitionists, the word "effective" has played an especially important role, Although these mathema- ticians have always agreed not to accept the taxiom of choice, the exact meaning of "effec- tive" has differed slightly among different members of their group or in different stages of the development of the theory. Such dif- ferences mainly arose in connection with the question: How can we tell whether given en- tities are finitary or individual? One way to guess the original intention held by Borel and others when they used the term "effective" is to replace the term by "recursive." Nowadays, the concept of "effectiveness" is used in this sense ( 356 Recursive Functions C), and all clas- sical results in descriptive set theory essentially have effective versions (or refinements via relativization). For example, the Novikov- Kondo-Addison theorem is described as fol- lows: Any nl (nl ((X) for an exE"'W) set is uni- formizable by a n: (resp. nl (ex)) set. Similarly, Suslin's theorem is that every analytic and coanalytic (i.e.,,1: (a)) setA is a Borel set of class v(x) which is effective relative to the "definability ex" of given A, where v(a) is the +Constructive ordinal relative to !J. (Kleene, Tugue and Tanaka, A. Louveau, etc.). H Further Results in Axiomatic Set Theory and Strong Axioms The recent development of axiomatic set theory has yielded the following propositions. Under the assumption that V= L (K. Godel, Novikov, Addison): (1) there exists an uncount- able n: set that does not contain any perfect subset; (2) there exists a nonmeasurable ,;j  set; (3) there exists a ,11 set that does not have the Baire property. On the other hand, if the axioms of set theory ZFC plus "there exists an tinaccessible cardinal number" are consistent, then so is "every projective set is Lebesgue measurable, has the Baire property, and has a perfect subset when it is uncountable" with them (Solovay). Concerning these properties, tMartin's axiom MA and "2"°> I" (MA + I CH is consistent with ZFC; ...... 33 Axiom- atic Set Theory E) implies that every .r is measurable, and has the Baire property (Martin and Solovay). The possible cardinality of a E1 set is at most I or that of the continu- um, which is implied from "every E1 set is a union of l Borel sets" (Sierpinski). It is known that one cannot prove or refute from ZFC the converse of the latter statement. If one assumes MC (the existence of a measur- able cardinal), then every Ej set is a union of 2 Borel sets. Therefore MC implies that the possible cardinality of a Ej set is at most 2 or that of the continuum. With each set A c '"co we associate the following infinite game of perfect information G(A), played by two players I and II. First player I chooses no E w, then player II chooses 111 E w, then I chooses 11 2 E co, and so on. The game ends after co steps. Let r(k) = Ilk; if XE A, then I wins G(A); otherwise II wins. A strategy (for 1 or II) is a rule that tells the player what move to make, depending on the previous moves of both players. A winning strategy is one such that the player who follows it always wins. The game G(A) is determined if one of the players has a winning strategy. The axiom of projective determinacy PD is the assertion that for every projective set A( c '"co) the game G(A) is determined. PD is widely used to solve problems on projective sets that are not decid- able from ZFC, and is a plausible hypothesis that does not seem to contradict the axiom of choice at present; and it has pleasing conse- quences in descriptive set theory. For n 3 3, 
113 reductions and uniformization results of L' (n ) sets mentioned earlier are such examples. Furthermore, PD implies that every projective set is Lebesgue measurable, has the Baire property, and contains a perfect subset if it is uncountable. Now, write Determinacy (L') (Determinacy (A)), or simply, Det(L') (Det(A)), when G(A) is determined for every L' (A) set A. Using the fact that the open game is determined, namely Det(Ep) (D. Gale and F. M. Stewart), D. Blackwell has given a short proof of Sep[(L'f). The extremely difficult problem was to show Det(D in ZF (indeed, one cannot prove Det(AD only from the Zermelo axioms z; -' 33 Axiomatic Set Theory B). In 1975, Martin solved this problem. Concerning E!, Det(L'!) cannot be proved from only the ZF axioms, but MC implies Det(Ef) (Martin). However, it is known that Det(L'i) cannot be proved even if we assume MC. By a pre-well-ordering, we mean a relation  having all the properties of a well-ordering except for tantisymmetry. Denote by I) the least ordinal that cannot be the length of a preewell-ordering on "'w belonging to A, It is a classical result that I)) =W 1 (= K,). The following results are mainly due to Martin: I) :(2' MC =* () :( 3' and also, PD =* I)l :(4' Let K be an infinite cardinal. A set A c X is called K-Suslin when there exists a closed set CcXx wI( ("'K={tlf:w-->K}) such that X E A = 3f[f E ,oK /\ (x,f) E C]. A is o-S uslin iff A E L': If A is a L' set, then A is  l-Suslin (Shoenfield), Under the assump- tions of MC, if A is a L' set, then it is r Suslin (Martin). Furthermore, Det(An) implies the following facts (Kechris, Moschovakis): every L'n+2 set is K-Suslin, where K is the cardinal of /)n+1 : every Eint! set in K-Suslin for some K < /)n+l The axiom of determinacy AD is the asser- tion that the game G(A) is determined for every set A. AD implies that every set is measurable (1. Mycielski and S. Swierczkow- ski), has the Baire property, and has a perfect subset if it is uncountable (Morton Davis). Though AD contradicts the axiom of choice, some consequences of AD in the area of pro- jective sets are more desirable than the conse- quences of the axiom of choice, The following are examples of consequences of AD. (1) A is a L'l set iff A is a union Ofl Borel sets (Mos- chovakis). (2) For each n, I) is a cardinal (Moschovakis). (3) Generalization of Suslin's theorem holds as follows: A = lBo' (= the smallest tBoolean algebra containing the open sets and closed under complementation and unions of length <I)) (Martin, Moschovakis). 221 Analytic Sets I. Polish Spaces, Luzin Spaces, and Suslin Spaces A topological space homeomorphic to a com- plete separable metric space is called a Polish space. A subspace E of a Polish space X is Polish if and only if it is a Grsubset of X, i.e., a countable intersection of open subsets of X (Aleksandrov and Uryson). A Hausdorff top- ological space X is called a Luzin space (resp. Suslin space) if we can find a Polish space S and a continuous bijective (resp. surjective) mapping f: S --> X. Every Polish space is a Luzin space and every Luzin space is a Suslin space, Let X be a Hausdorff topological space, A subset E of X is called a standard set (resp. analytic set) if the set E with the relative top- ology is a Luzin (resp. Suslm) space. The analytic subsets are closed under analytic operations. Every analytic set is obtained from closed sets by applying the analytic operation. Every analytic set is tuniversally measurable. Every standard set is a Borel set. Every count- able union or intersection of standard sets is standard. A subset E of a Suslin space is a Borel set if and only if both E and E' are analytic. A subset E of a Luzin space is a Borel set if and only if E is standard. Let f be a Borel measurable mapping from a Suslin space X into another Suslin space Y. Then the image f(X) is an analytic subset of Y. Furthermore, if f is injective, then fgives a tBorel isomorphism between X and f(X), If both X and Y are Luzin spaces and if f is injective, then f(X) is a Borel subset of Y. Every Suslin space is tBorel isomorphic to an analytic subset of R, and every Luzin space is Borel isomorphic to one of the following spaces: (1) N n = {l, 2".. ,n}, (2) N = {I, 2, ...}, (3) R. The selection theorem due to von Neumann (- Section F) holds when X and Y are Suslin spaces. This fact and its ramifications [ 15,161 are useful in nonlinear functional analysis and control theory. Practically all useful spaces appearing in functional analysis and probability theory are Polish or Luzin spaces, Examples: (i) Every locally compact Hausdorff space with a count- able open base is Polish. (ii) Every separable Banach space is Polish. (iii) The set C of all continuous functions on [0, IJ with the Hop- ology of uniform convergence is a Polish space. (iv) The set D of all right continuous functions on [0, I J with left limits is a Polish space when it is endowed with the Skorokhod topology (- 250 Limit Theorems in Proba- bility Theory E). (v) The space!Y' of distri- butions and the space /f" of tempered distri- 
22 Ref. Analytic Sets butions (- 125 Distributions and Hyperfunc- tions) are Luzin spaces. References [IJ e. Kuratowski, Topologie I, Warsaw, revised edition, 1948. [2J A. A. Lyapunov (Liapunov), E. A. Shche- gol'kov (Stschegolkow), and V. J. Arsenin, Arbeiten zur deskriptiven Mengenlehre, Deut- scher Verlag der Wissenschaften, 1955. [3J N. N. Luzin, Sur les ensembles analytiques, Fund. Math., 10 (1927).1-95, [4J N. N. Luzin, Lec;ons sur les ensembles analytiques et leurs applications, Gauthier- Villars, 1930. [5J W. Sierpiilski, Les ensembles projectifs et analytiques, Memor. Sci. Math., Gauthier- Villars, 1950. [6J H. Rogers, Jr., Theory of recursive func- tions and effective computability, McGraw- Hill, 1967, ch. 16. [7J J. R. Shoenfield, Mathematical logic, Addison-Wesley, 1967, ch. 7. [S] D. A. Martin, Descriptive set theory: Pro- jective sets, Handbook of Mathematical Logic, North-Holland, 1977, ch. e.8. [9J T. Jech, Set theory, Academic Press, 1978, pI. IV. [ 10J P. G. Hinman, Recursion-theoretic hierarchies, Springer, 1978, pt. B. [ 1 I J Y. N. Moschovakis, Descriptive set theory, North-Holland, 1980. [12J e. A. Rogers, J. E. Jayne, e. Dellacherie, F. Topsoe, J. Hoffman-Jorgensen, D. A. Mar- tin, A. S. Kechris, and A. H. Stone, Analytic sets Academic Press, 1980. [ 133 N. Bourbaki, Elements de mathematique, Topologie generale, Hermann, 1974, ch. X, sec. 6, [ 14J J. Hoffmann-Jorgensen, The theory of analytic spaces, Various publication series 10, Math. Institute, Aarhus Univ., 1970. [ 15J M.-F. Sainte-Beuve, On the extension of von Neumann-Aumann's theorem, J. Func- tional Anal., 17 (1974), 112-129. [ 16J A. F. Filippov, On certain questions in the theory of optimal control, SIAM J. Con- trol, 1 (1962), 76-84. 23 (X1.21) Analytic Spaces A. General Remarks An tanalytic function of a complex variable has as its natural domain of detlnition a +Rie- 114 mann surface, i.e., a 1 -dimensional complex tanalytic manifold. In the case of several com- plex variables, the set of zeros of an analytic function, the quotient space of a domain by a tproperly discontinuous group of analytic automorphisms, the existence domain of an talgebroidal function, etc., are, strictly speak- ing, not necessarily complex analytic mani- folds, It is necessary to consider a more gen- eral category of complex analytic manifolds with singularities, and the notion of analytic spaces is drawn from these examples. Many of the properties of complex analytic manifolds are extended to analytic spaces; on the other hand, theories specific to analytic spaces have also been developed. B. Analytic Sets We say that a subset A of a complex analytic manifold G is an analytic set in G if it is a closed subset and each point of A has a neigh- borhood U such that UnA is the set of com- mon zeros of a finite number of holomorphic functions in U. Specifically, if A is locally the set of zeros of a single holomorphic function that does not vanish identically, then A is called principal. Two subsets 51 and S2 of G are called equivalent at ZO E G if there exists a neighborhood U of ZO such that Sl n U = S2 n U, By this equivalence relation, every subsets S of G defines its germ SzD at zO, A germ of an analytic set at Zo is the germ at ZO of an analytic set in a neighborhood of zo, Each germ A, of an analytic set at Zo = OE Gis associated with an ideal I(A o ) = {II jEH(O), [I A, =O} in the ring H(O) of tgerms of holo- morphic functions at O. We call A, reducible if A, is the union of two germs of analytic sets A,' and A," with Ao' # A" Ao" # A,; other- wise, A, is called irreducible. An analytic set A is called irreducible at 0 if the germ at 0 of A is irreducible. Properties of A, and I(Ao) cor- respond to each other. Thus A, is irreducible if and only if I( A,) is prime. As the ring H(O) is tNoetherian, in a neigh- borhood of every point ZO an analytic set A is represented as a union of a finite number of analytic sets A. that are irreducible at zO, These A. are essetially unique. If an analytic , . set A is irreducible at zO, then there eXists a system of local coordinates (z" . , z,) centered at ZO and a pair of natural numbers d .s; n and k such that, in a neighborhood of zO, A is a k- sheeted ramified covering space with covering mapping rp: (z 1> . . . . zn)-->(z" , Zd); i.e.. for an analytic set R in a neighborhood of 0 E Cd, (p : A - rp -I (R)--> Cd - R is, in a neighborhood of ZO a k-sheeted covering mapping, where A rp '-1 (R) is a connected d-dimensional com- 
115 plex analytic manifold in the neighborhood. The coordinates of the k points of A - <p -I (R) over each point (Zl , , Zd) are holomorphic in these variables. The number d is the (local) dimension of A at ZO and is denoted by dimzo A. From this local representation, we obtain Riickert's zero-point theorem: Every prime ideal  in H(O) is equal to /(Ao), with A, an irreducible germ of an analytic set at O. In this case the dimension at 0 of an analytic set A that defines A, is equal to the tKrull dimen- sion of the t\ocal ring H(O)/. The theory of local rings is very important in the study of germs of analytic sets. The dimension of a general analytic set A at ZO is deti'ned by dimzoA ==suPidimzoAi' where A = UiAi in a neighborhood of zU, with the A, irreducible at ZO If dimzo Ai is equal to d for all i, then A is called purely d-dimensional at zoo The (global) dimension of A is defined by dim A = SUpzEA dim, A. A purely d-dimensional ana- lytic set is defined to be an analytic set that is purely d-dimensional at everyone of its points. A point ZO of an analytic set A is called ordinary (regular or simple) if A has the struc- ture of a complex analytic submanifold in a neighborhood of ZO The set A' of ordinary points of A is dense and open in A. The set A * = A -A' of singular (not ordinary) points is an analytic set in G. If A is purely d-dimensional, then A' is a d-dimensional complex analytic manifold and A * is an analytic set of dimen- sion:::;d- 1. Let A be an analytic set of dimension d in G, and B a purely d'-dimensional analytic set in G -A with d' > d. Then the closure if of B in G is a purely d' -dimensional analytic set (Remmert- Stein continuation theorem [1 7]). For every analytic set A in G, the tanalytic sheaf f(A) of germs of holomorphic functions over G that vanish on A is tcoherent (H. Caf- tan). We call .!(A) the sheaf of ideals defined by an analytic set A. Let {!}G be the sheaf of germs of holomorphic functions over G. Then (I'A = ({!}dJ(A» I A is a coherent sheaf of rings over A. (]JA is called the sheaf of germs of holo- morphic functions on an analytic set A. C. Analytic Spaces A tringed space (X, 0,) with Hausdorff base space X is called an analytic space if for every point x EX, there exists an open neighborhood U of x such that the ringed space (U, (l)x \ U) is isomorphic to a ringed space (A, (l)A), where A is an analytic set in an open set G of some C". The structure sheaf {!)x is then called a sheaf of germs of ho10morphic functions. The notion of tholomorphic mapping from one open set in en into another is generalized to the case of 23 D Analytic Spaces mappings from one analytic set into another. An analytic set Y in an analytic space X and the sheaf (l)y of germs of holomorphic functions on Y are deti'ned as in the case where X is a complex manifold. The ringed space (Y, 0,) is an analytic space and is called an analytic suhspace of X. For an analytic space X, the notions of dim, X, dim X, irreducibility, and pure dimensionality are deti'ned as for an analytic set A c G c C". Every analytic space X is the union of a locally finite family of irreduc- ible analytic subspaces X, called the irreduc- ible components of X Let ({J : X --> Y be a holomorphic mapping of an analytic space X into another, Y. Its rank at XE X is deti'ned by r",(x) = dim, X- dimx<p-l(<p(x)). The number r",=suPXEXr",(X) is called the rank of q;, The set of degeneracy E", of q; is the set of points X E X such that r'l'IX'(x) < r'l'lx' for an irreducible component X' of X through x. The mapping If' is 000- degenerate if E", = 0. For any kEN, (x E X r",(x)  k} is an analytic set (R. Remmert [ 16J). In particular, E", is analytic. For a holomor- phic mapping ({J :X --; Y, the inverse image of an analytic set in Y is an analytic set in X. However, the image of an analytic set is not necessarily analytic. If cp is tproper, then the image <p(X') of an analytic set X' in X is an analytic set in Y of dimension rq>jX' and is irreducible if X' is irreducible (Remmert's theorem [ 16J). D. Modifications and Resolution of Singu1arities Let M be a subset of an analytic space X. If, for every point x E X, there exists an open neighborhood U of x and and an analytic set M* in U, containing un M such that U-M* is dense in U, then 1'v1 is called analytically thin. Let ({J: X --; Y be a holomorphic map- ping. Suppose that there exist two analyti- cally thin sets A4 c X, N c Y such that rp induces an isomorphism between X -M and Y - N. Then X is called a ho10morphic modifi- cation of Y. If furthermore ({J is proper, then X is called a proper modification of Y. A mon- oidal transformation of an analytic space X with respect to a coherent sheaf of ideals.! is deti'ned as in the case of a complex manifold or an algebraic variety ( 16 Algebraic Varieties L; 72 Complex Manifolds H). It is a proper modification f: x* --> X such that the inverse image ideal sheaf f-lf. {!}x* is inyertible, and I is universal among all proper holomorphic mappings h: Z --; X with the property that h -If. 0z* is invertible, where f -I, is the tinverse image of , If ,/ is the sheaf of ideals deti'ned by an analytic set Y in X, the mon- 
23E Analytic Spaces oidal transformation of X with respect to ,f is often called the blowing-up of X with center Y. H. Hironaka [14J proved that, if X is an analytic space which is countable at infinity (i.e., a countable union of compact sets), then there is a proper modification n: X' --> X with X' smooth (i.e., free from singular points). Such a modification is called a desingu1arization (or resolution of singu1arities) of X Moreover, over any relatively compact open set U of X, n is the product of a finite seq uence of blowing- ups n i : X;-->Xi_1 (Xb =X), with smooth centers 1';'-1 along which Xi-l is tnormally fiat. This deep result enables one to derive properties of analytic spaces from those of complex manifolds. Let X and Y be two analytic spaces and G an analytic set in X x Y. If the canonical pro- jection n:G-->X is a holomorphic (or proper) modification, then we say that a meromorphic mapping (a proper meromorphic mapping) 11 of X into Y is defined. The set G is then called the graph of ,.. A holomorphic mapping i{J: X --; Y can be viewed as a proper meromorphic mapping. Let f.!: X  Y be a proper meromor- phic mapping. Then II(X) (the projection of n- 1 (x) into Y) is a nonempty analytic set in Y for every point x E X. Moreover, there exists an analytic set N, with X-N dense in X, such that J1 maps X-N into Y holomorphically. The smallest set N with this property is called the set of points of indeterminacy or the sin- gularity set of /.I. A meromorphic mapping f:X -->pl(C) is called a meromorphic function on X if none of the irreducible components of X is mapped to \ oo} by f The set.f -I (0) = n(X x {O} ) n G) is called the set of zero points of X, and the set f-l (00) is called the set of poles. These are analytic sets in X. Let II> . , h be meromorphic functions on X. Then, by a suitable proper modification of X, one can eliminate the points of indeterminacy of the meromorphic mapping f: X -->(PI(C))k defined by X-->(fl (x), . ,Mx»), i.e., one can modify f to be holomorphic. The ring of meromorphic functions on X is invariant under proper modifications of X. If X is irreducible and compact, then the field of meromorphic func- tions on X is a simple algebraic extension of the tfield of rational functions of k (:::; dim X) variables. Let (X, 0,) be an analytic space. A point x E X is called normal for X if (i)x,x is a tnormal local ring. The set of nonnormal points for X is an analytically thin analytic set in X (K. Oka). Every ordinary point of X is normal. We call X normal if everyone of its points is normal. Every nondegenerate holomorphic mapping of an irreducible X into an irreduc- ible and normal Y is an open mapping if its rank is equal to the dimension of Y (Remmert). 116 For every analytic space X, there exists a proper modification v:X -->X with X normal such that v is nondegenerate and v X -V-I(S) is an isomorphism, where S is the set of non- normal points. Such a proper modification of X is unique up to isomorphisms. We call X a normalization of X with normalizing map- ping v, Let ({J : X --; Y be a holomorphic modifi- cation. Suppose that Y is normal at <p(x o ) (XOEX). If the set <p-I(<p(X O )) contains an iso- lated point, then <p-I(<p(XO))=x o , and ({J is an isomorphism in a neighborhood of X O (an analog of tZariski's main theorem). In partic- ular, if ({J : X --> Y is a holomorphic modifica- tion and Y is normal, then <p maps X - E", isomorphically onto the dense open set <p(X E",). Furthermore, if a holomorphic mapping <p : X --; Y is injective and X and Y are irreduc- ible, normal, and n-dimensional, then <p(X) is an open set in Y and ({J-I: <p(X)......X is holomorphic. E. Analytic Spaces in the Sense of Behnke and Stein Let ({J : G--> G be a proper continuous mapping of a connected tlocally compact space G onto a domain G c en. The triple (fj = (G, <p, G) is an analytic covering space over G if the following conditions are satisfied: (i) <p -I (zo) is a finite set for every point ZO E G. (ii) There exists an ana- lytic set A c G of dimension::::; n - 1 such that qJ G - <p -I(A) is a local homeomorphism and every point of <p -I (A) has a fundamental sys- tem of neighborhoods U such that both U and U - <p -I (A) i- 0 are connected, As (Ij is unramitled over G-A, the number of points in rp -I (zo) is constant for ZO E G-A and is called the number of sheets of ffi. A point Z EGis called a ramification point of(fj if the restric- tion of ({J to any neighborhood of Z is not a homeomorphism. Denote by B the set of ramification points of ffi. Then <p(B) c A is an analytic set of dimension n - 1. Let f be a continuous complex-valued function in an open set D in G, We calI I ho10morphic in D if for every point ZO ED B and for every open neighborhood V of ZO = <p(zo) over which <p is a homeomorphism, 10 <p -I is holomorphic in V Denote by (!)f; the sheaf of germs ofholo- morphic functions over G. Then (G, (DG) is a ringed space, An analytic space in the sense of Behnke and Stein is a Hausdorff ringed space (X, 0,) that is locally isomorphic to a ringed space of the form (G, ((,G) [3]. tRiemann's theorem on removable singularities holds for such spaces. Every normal analytic space is an analytic space in the sense of Behnke and Stein. An analytic covering space (Ij = (G, <p, G) 
117 is a C-covering space (covering space in the sense of Cartan) if for every point ZO E G there exist an open neighborhood V of z" and a holomorphic function 9 in U = ({J -I (V) which can be defined by an irreducible polynomial of degree k such that its coefficients are holo- morphic functions on V and the coefficient of its highest term is I, where k is the number of sheets of l\). A C-ana1ytic space is an analytic space in the sense of Behnke and Stein that is locally isomorphic to a C-covering space. The category of C-analytic spaces coincides with that of normal analytic spaces. According to H. Grauert and Remmert [ 10J, every analytic covering space is a C-analytic covering space. Therefore every analytic space in the sense of Behnke and Stein is a normal analytic space. Let R be an equivalence relation in an ana- lytic space X. Given a subset A of X, denote by R[A] the set of points of X which are R- equivalent to points of A. We call R proper if R[ K] is compact for every compact set K in X. Let rp be a proper holomorphic mapping of an analytic space X into another Y. For x, X'E X, let x =x' (R) be defined by <p(x) = <p(x'). The equivalence relation R is then proper. We consider the quotient space X / R and the canonical projection p: X ...... X / R. With each open set U in the quotient space X/ R we can associate the ring of holomorphic functions in p-I(U) that are constant on p-I(X) for every x E U. This leads to a ringed space (X/R,0 x /R), which is proved to be an analytic space by Grauert's theorem [9]: All the tdirect images of a coherent analytic sheaf over X by a proper holomorphic mapping (p: X --> Y are coherent. For every proper equivalence relation R in X, the ringed space (X/R, (()x/R) is an analytic space if and only if for every point X EX/ R there exists an open neighborhood V of x such that functions in f( V, flx / R) separate the points of V (H. Cartan). F. Stein Spaces For an analytic space (X, (Ox) let us consider the following conditions: (i) Functions in I(X, (Ox) separate the points of X. (ii) X is K- complete; i,e" for every point XEX there exist a finite number of J;E I(X, (()x) (i = 1, , k) such that the holomorphic mapping f == ( f;): X -->C k is nondegenerate at x. (iii) Every compact analytic set in X is a finite set. Condition (i) implies (ii), and (ii) implies (iii). If an irreducible analytic space X is K-complete, then X is a countable union of compact sets (Grauert [SD. In fact, if n == dim X, there exist functions .t;E r(X, (Ox) (i == 1, . . " n) such that the holo- morphic mapping f =(J;): X -->C" is nondegen- 23G Analytic Spaces erate. The notion of holomorphic convexity (21 Analytic Functions of Several Com- plex Variables H) is carried over to analytic spaces. For a holomorphically convex analytic space, conditions (i), (ii), and (iii) are equivalent (Grauer [8J). A Stein space (or holomorphi- cally complete space) is a holomorphically convex analytic space that satisfies one of the conditions (i), (ii), or (iii). In a holomorphically convex analytic space (X, (!)x), let R be the equivalence relation defined by I(X, 6,); i.e., for x, x' EO X, X:= x'(R) if and only if f(x) == f(x') for every f E r(X,f x). Then R is proper. The analytic space (X/R. (()x/R) is a Stein space. A Stein space is a generalization of the notion of a Stein manifold. Fundamental theorems A and B on Stein manifolds hold Verbatim for Stein spaces ( 21 Analytic Functions of Several Complex Variables L). Therefore the main properties of Stein manifolds are in- herited by Stein spaces. Let <p: X -4 Y be a holomorphic mapping of an analytic space X into another, Y. If for every X E X all the COn- nected components of the fibers <p -I (<p(x)) are compact, then the equivalence relation R' defined by those components (i.e" for x and x' in X, x == x'(R') if and only if x and x' belong to the same component of <p -I (<p(x))) is proper, and the ringed space (X/R', flx/R') is an ana- lytic space. In particular, if X is a holomor- phically convex irreducible analytic space and R is the equivalence relation defined by f(X, (I:'x), then all the fibers of the canonical projection p: X -'t X / R are connected. G. Further Topics The notion of analytic space can be gen- eralized as follows (Grauert [9J), A ringed space (X, (l::x) is a general analytic space if it is locally isomorphic to a ringed space (A, ), where A is an analytic set in a domain G S C", and y{ = ((!)G/ f) I A for some coherent analytic sub sheaf j of .Jf(A) such that Supp«(f!G/J') (= {z E G «(l::G//)z i- O}) = A. An analytic sub- space of (X, fl'x) is a ringed space (Y, (Oy) where Y = Supp((()x/J) and (l)y = (ilx/,f for some co- herent sheaf of ideals .f of (I:'x. (Y, (I:,y) is also a general analytic space. A. Douady [6J showed that for any general analytic space (X, (C'x) there exists a natural structure of a general analytic space on the totality of all the com- pact analytic subspaces of (X, (()x)' The result- ing analytic space is called the Douady space of (X, 0x). For the proof, the notion of a Banach analytic space is used, which is 0 b- tained by first defining analytic subspaces in an open subset of a complex tBanach space and then patching them. A Douady space can 
23 Ref. Analytic Spaces be used, for example, to show that the identity component of the analytic automorphism group of a compact Kiihler manifold is a tcomplex Lie group which is naturally an extension of a tcomplex torus by a tlinear algebraic group (A. Fujiki, D. Lieberman; 232 Kahler Manifolds C). References [1 J S. S. Abhyankar, Local analytic geometry, Academic Press, 1964. [2J C. Banica and O. Stanasila, Algebraic methods in the global theory of complex spaces, Wiley, 1976. [3J H. Behnke and K. Stein, Modillkation komplexer Mannigfaltigkeiten und Riemann- scher Gebiete, Math. Ann., 124 (1951),1-1 6. [4] L. Bers, Introduction to several complex variables, Courant Institute, 1963. [5J Sem. H. Cartan, Ecole Norm. Sup., 1953- 1954, 1957-195 1960-1961. [6J A. Douady, Le probleme de modules pour les sous-espaces analytiques complexes d'un espace analytique donne, Ann. Inst. Fourier, 16 (1966),1-95, [7J G. Fischer, Complex analytic geometry, Springer, 1976. [8J H. Grauert, Charakterisierung der holomorph-vollstandiger komplexen Raume, Math. Ann., 129 (1955),233-259. [9J H. Grauert, Ein Theorem der analytischen Garbentheorie und die Modulraume kom- plexer Strukuren, Pub!. Math. Inst. RES, 5 (1960). [ 10J H. Grauert and R. Remmert, Komplexe Raume, Math. Ann., 136 (1958),245-318. [ 1 IJ H. Grauert and R. Remmert, Analytische Stellenalgebren, Springer, 1971. [ 12J R. C. Gunning and H. Rossi, Analytic functions of several complex variables, Prentice-Hall, 1965. [ 13J M. Herve, Several complex variables, local theory, Tata Inst. Studies in Math., Oxford Univ. Press, 1963. [ 14J H. Hironaka, Desingularization of complex-analytic varieties, Act. Congr. Intern. Math., Nice, 1972, Gauthier-Villars, vol. 2, 6277632. [15J R. Narashimhan, Introduction to the theory of analytic spaces, Lecture notes in math. 25, Springer, 1966. [ 16J R. Remmert, Holomorphe und meromOr- phe Abbildungen komplexer Rimme, Math. Ann., 133 (1957), 3288370. [ 17J R. Remmert and K. Stein, Uber die we- sentlichen Singularitaten analytischer Mengen, Math. Ann., 126 (1953),2633306. Also  references to 21 Analytic Functions of Several Complex Variables. 118 24 _(Xx1.1) Ancient Mathematics A. General Remarks To determine the beginning of the history of mathematics, one must detlne the term "math- ematics." Only speculation based on the ob- servation of primitive peoples today Can be made regarding the development of the number concept among prehistoric peoples. The pre- historic period ended in Egypt and Meso- potamia c. 3000 B.C., and a little later in the valleys of the large rivers in India and China. Since the basis of the civilizations in the river valleys of the ancient world was agricul- ture, the administrators first had to control watering systems through inigation, drainage, pumping, and canalization; second, they had to measure land and harvests for tax collec- tion; and third, they had to establish a calen- dar by observation of the heavenly bodies. All these tasks demanded some knowledge of mathematics. Additional knowledge of mathematics was certainly needed for construction of the pal- aces and tombs. We know something of the development of mathematical knowledge during these ages from some recovered arti- facts, but there remains the possibility of new finds that will bring about a basic change in our knowledge of the history of mathematics during this period. B. Mathematics in Egypt The main sources for our understanding of the history of mathematics in Egypt are the Mos- cow papyrus and the more important Rhind papyrus, both discovered in the 19th Century. The Greeks place the origin of their mathe- matics in Egypt, but it seems that Egyptian mathematics was limited to practical mathe- matics. The Egyptians had a decimal numera- tion system, but the place value was not clear; they used fractions, which they always decom- posed into the sums of unit fractions (i.e., frac- tions with 1 as numerator); they solved the problems of everyday arithmetic that were reducible to linear equations; they computed approximate areas and volumes of some fig- ures for the purpose of measurement of farm- land or granaries and for construction work; they had exact formulas for the computation of areas of triangles and of trapezoids; and they used (16/9)2 = 3.1605 as the value of n; but no trace has been found to prove the existence of demonstrative mathematics in ancient Egypt. 
119 C. Mathematics in Mesopotamia Sources abound for the study of Mesopota- mian mathematics, and these sources may very well increase in the future. The Meso- potamians kept exact records of astronomical observations for long periods of time. Their more advanced mathematics was not limited to practical use, as was that of the Egyptians. They used a sexagesimal system of numeration with place value, and also used sexagesimal fractions; however, they lacked a cipher to denote zero until the 4th Century H.C, and they did not have a symbol corresponding to our decimal point, so that exact place value had to be determined from the context of each ex- pression. They had a multiplication table and tables of inverses, squares, and cubes of num- bers, and they used these tables to solve equa- tions, even some simple equations of the third degree, as well as simultaneous equations of the second degree for two unknowns. They had accurate solutions for quadratic equations (expressed in words); they discarded negative roots, but they admitted both positive roots when two existed. They studied integral solu- tions of a 2 + b 2 = c 2 (the largest of their solu- tions were 12,709, 13,500, and 18,541) and approximate computation of quadratic roots. which suggests some relation to Greek mathe- matics. We have evidence that some of the geometric algebra in Euclid's Elements can be traced to Mesopotamian algebra. Some his- torians also affirm that the concept of dem- onstration in Greek mathematics originated with the Mesopotamians, but this theory lacks suffcient proof. By the 7th Century, the Mayas in Central America also possessed a numeration system, with the base 20. As far as we know, the Meso- potamians and the Mayas were the earliest people to possess numeration systems with place value (- 57 Chinese Mathematics, 187 Greek Mathematics, 209 Indian Mathematics). References [IJ O. Neugebauer, Vorlesungen iiber Ge- schichte der antiken mathematischen Wissen- schaften I, Springer, 1934. [2J O. Neugebauer, Mathematische Keilschrift-Texte, Springer, 1935. [3J A. B. Chace, The Rhind mathematical papyrus II, Mathematical Association of America, 1929. [4J B. L. van der Waerden, Science awakening, Noordhoff, 1954. [5J O. Neugebauer, The exact sciences in antiquity, Brown Univ. Press, second edition, 1957. 25 B Approximation Methods in Physics [6J K. Vogel, Vorgriechische Mathematik I, Vorgeschichte und Agypten II. Die Mathe- matik der Babylonier, Schroedel, 1958-1959. 25 (XX.35) Approximation Methods in Physics A. Introduction It is often possible to solve a differential equa- tion of mathematical physics analytically by expanding the solution in a series with respect to a small ( or large) parameter involved in the equation. In general, such an approach is called a perturbation method (- Section B). For an equation which is difficult to treat analytically, it is possible to get a numerical solution either by replacing the derivatives by difference quotients (tdifference method), or by expanding the solution in terms of suitable functions and determining the expansion coef- ficients numerically from the equation re- written in a weak form (tvariational method, tmethod of weighted residuals, tGalerkin's method, tfinite-element method, etc.). For such numerical methods  46 Calculus of Vari- ations, 301 Numerical Solution of Algebraic Equations, 302 Numerical Solution of Linear Equations, 303 Numerical Solution of Ordi- nary Differential Equations, 304 Numerical Solution of Partial Differential Equations, 441 Variational Principles, In Section C we describe a method for deriv- ing an asymptotic expression for a function written as an integral in a complex plane. B. Perturbation Method (1) Regular Perturbation. (a) Initial-value prob- lem. Consider an initial-value problem of an ordinary differential equation involving a small parameter F.: du dt = f(t, u; 6), (1) u=b(e) at t==O. (2) We assume that f is sufficiently smooth as a function of t and is regular as a function of u and £, and that b is a regular function of c;, Substituting the power series in E for j, u, and b, f{t, U; E) = f!°)(t, u) + Efl l)(t, u) +, u(t;F.)==u(O)(t) HU(I)(t)+ ...) b(f.) = b(O)+ "b(l)+, 
25 B 120 Approximation Methods in Physics into equations (1) and (2) and equating terms of equal power, we get du(O) ---;[t = P O )( t, u(O)), U(OI(O) = bioi, du'" ----;It = fu(O)(t, dO»u(t) + f(l)(t, u(O», u(1)(O) = b(l), dU(2) I _= r(O) ( t U(O» ) U(2)+_ r(o) ( t U(0» ) U(1)2 dt Ju, 2 Juu , + j,}l)(t, U(O»)u(l) +P2)(t, U(oI), d 2 )(0) = b(2), the solutions of which determine the perturba- tion series for U, The equation for u(O) (the unperturbed equation) is nonlinear in general, but those for u(1), U (2 ), are inhomogeneous linear equations with identical principal parts. The system of differential equations duo i=/;(I,U"""Un;F.) (i=1,2,.."n), ui=b;(F.) at t==O. or a differential equation of higher order, dnu ( dn-I U ) dt" == f t, u, ..., dt n - I ; 0 , d"-I U u=b(O)(f.) , .." dt n - l =b(n-I)(c:) at t=O, can be treated in a similar way. (b) Boundary-value problem. Given a linear differential operator H = Ho + f. V involving a small parameter f. linearly (H o is a linear differential operator and Vis a function of (X I ,X2'''''X,», consider the boundary-value problem (H O +f. V)u =.f(XI' ..., x n ) (x]> ..., Xn)EQ, (3) Bu=O (xI'"'' Xn)EoQ, (4) where B is a linear and homogeneous operator. Substitution of the Taylor expansion u=UO+W I + into equations (3) and (4) gives Houo=f, Bu, =0, Hou l + Vuo=O, Bu, =0, of which the solutions are UO== Kf u 1 = - K VKf, u2==KVKVKf,.... Here K '= H o - I is an integral operator such that, for any function U = v(x l ' X2, , x,), we have (Kv)(x], ...,x n )= G(x l' ,,,,xnl,, ""n) s s . X V(I . , n)d1 dn' where G(x" ..., xnl I , "" n) is the Green's function for the unperturbed problem. The perturbation series u=Kf -f.KVKf+f.2KVKVKf -... converges if VK is bounded and 101 < III1 VKII, (c) Eigenvalue problem. The solution of an eigenvalue problem (HO+f.V)u==}u (5) with the boundary condition Bu = 0 for the operators considered in (b) can be obtained in power series in /;: }.=A O +c.}(l)+f. 2 }Pl+..., u= U o +w(t) +f.2 U (2) +..., where u o and A.o are the eigenfunction and nondegenerate eigenvalue, respectively, of the unperturbed operator H o , which is assumed to be self-adjoint. Substitution of the series into (5) gives (Ho-)'O)u(l) = - Vuo+},(l)u o , (Ho -}'O)u(2) = Vu(1)+A(2)u o +A(l)u(t) (Ho -}'0)U(3) = - VU(2)+A(3)U o + A(2)d l ) +)'p)U(2), (6) If we normalize U o and U by the conditions (u o , u o ) = 1 and (u, u o ) = 1, we get from equation (6) A(') =(Vu o , Uo)=Cl, Let S be the inverse of Ho - AoI in the sub- space perpendicular to u o (we put Su, = 0), then each term of the perturbation series for U and }, is obtained in the following way from the system of the equations given above: u(l)== SVuo, A (2 ) = - (VS vu" uo), d 2 ) == S(V -ex)SVu o , },i3) = (VS( V - ex)SVu o , u o ), u(3)= -S(V-ex)S(V-ex)SVuo +(VSVu o , u o )S2 Vu o , (2) Singular Perturbation Formal application of the procedure described in (1) often fails when the term including the highest-order derivative is multiplied by a small parameter (or a lower-order term is multiplied by a large parameter), i,e" when we deal with singular 
121 perturbation problems. Typical methods for treating such cases are described below. (a) WKB method. The method for getting an asymptotic solution of the second-order dif- ferential equation d 2 u -----z + k 2 P(x)u = 0 dx for large values of k was developed for prob- lems arising in classical wave motion (Jeffreys, 1924 [I J) and for problems of quantum mechanics (Wentzel, Kramers, Brillouin, 1926), and is called the Jeffreys method, WKB method, or WKBJ method. If the function Pin (7) does not vary rapidly, the solution is expected to have the form u cx:exp {ik<p(x)}, with <p(x) not very different from a linear func- tion of x, since the exact solution for P = Po = const is exp( I ikjP; x). Substitution of this expression for u in equa- tion (7) gives ik<p" _F<p,2 +FP=O. Neglecting the first term, we get <p,2=p, l.e" fP(x)== r pl/2dx. This result suggests a transformation of the form W= p1I4 U , z= IX p1/2 dx, which gives d 2 w 2 dz 2 +(k -Q)w=O, where d 2 d 2 Q=p-1/4_(PI/4)= _p-3/4_(P-I/4 ) dz 2 dx 2 ' Equation (8) can be transformed further into a Volterra's integral equation of the second kind: w(z; k) = Ae ikz + Be -ikz 1 f z +k sin {k(z - m Q«()w((; k)d(. Zo Therefore, if we expand w as I w(z; k)= wo(z)+- WI (z) +.... k we have wo(z) = Ae ikz + Be -ikz, wn(z)== J z sin{k(z-mQ(OWn-l(Ods Zo (n == 1, 2, .. . ). 25 B Approximation Methods in Physics (7) If P is positive for the interval zo:::;; z.s; z I in question and has continuous derivatives up to second order, the foregoing series converges to the solution of (8). On the other hand, when kz takes complex values, the series (9) does not converge in general. The series expansion appropriate for this case is 1, 1 w(z;k)=exp{ljt(z;k)}, 1/1= iikz+ f (ik)" I/1n(z), and we get the asymptotic expansion for w: { i f I w-exp i:ikz+ 2k Qd(+ 4k 2 Q (Z) i: 83 [ Q'(z)- r Q2 d (] + IL4 {Q"(z)-2Q(N} +} (8) Now, in many cases, P takes both positive and negative values within the interval in question, e,g., in the problem of transonic flow of a gas, in the problem of determining the reflectivity and transmissivity of a wave propa- gating in a nonuniform medium, or in the quantum-mechanical problem of a material wave passing through a potential wall. The point at which P vanishes is called a turning point. In this case there arises the problem of analytic continuation of the solution obtained in one side into the other through the turning point. For the special case when Prxx n (the turning point is chosen as x = 0), it can be shown that the exact solution of equation (7) is explicitly given by wcx: Z,(kz), (9) where v = 1/(n + 2), and Zv is a cylindrical function of order v. Based on this fact, the continuation formula for the case when P is approximated by x n near x = 0 has been obtained. (b) Lighthill's method. With nonlinear equa- tions, formal application of a perturbation procedure sometimes leads to the difficulty that the solution of the unperturbed equation has a singularity not exhibited by the exact solution of the original equation and whose order increases in higher approximations, so that the perturbation series ceases to have any meaning near this point. Such a difficulty can often be overcome by so-called coordinate straining. This method has been applied to a number of fluid-dynamical problems, such as those arising in theories of aircraft wings, boundary layers, and shock waves. Before describing the method, it is conve- nient to sketch Poincare% method of getting the perturbation series for the period of a nonlinear oscillation. In order to solve the 
25B Approximation Methods in Physics nonlinear equation du -=f(u;To,r.), dt where To is the period of oscillation when the small parameter t is taken to be zero, Poincare wrote series expansions for both u and the period T: u = uo(t) +sUI (t) +..., T == To + r. TI + .. . , and determined T" T2' . . . in such a manner as to avoid resonance at each step in the deter- mination of u,(t), u,(t), . . Based on this idea, Lighthill(1949) [2] developed coordinate straining in order to overcome the difficulty stated above. We illus- trate the method here by means of the follow- ing simple example. Consider a boundary-value problem of a nonlinear differential equation du (x+w)-+u==O (O<x< 1): dx u(1)= 1. Application of the usual perturbation method would give the series solution u=+ ( - ) - ( - ) + .... x2xx 2x x As is seen, however, the order of an apparent singularity occun1ng at x = 0 increases as we proceed to higher approximations so that it is not possible to see the true behavior of the solution near x = O. In Lighthill's method, we assume that series expansion is valid not only for U but also for the independent variable x, i.e., u==uo()+8Ud)+..., x =  +r.xd)+.... Substitution of these series into equations (10) and (11) gives u+uo=O, uo(lJ= 1, U'I +u I = -(uox', +UXI +uou), udl)=O, XI(1)=O, From the first equation we have U o = l/. Before solving the second equation for U 1 ' we determine Xl () so as to avoid the increase of the order of singularity in uoW at ( =0. This requirement is satisl1ed by taking Xl = (1/2) ( 1/), and this gives u 1 == 0; thus, at this stage of approximation, we have 1 u=uO()+Wl ()=, r. ( 1 ) x=+r.xl()=+2 - 122 The guiding principle is the same for higher approximations. (The foregoing expression happens to be the exact solution, but this a fortuitous result of our choice of this particular example. ) The convergence of the series obtained by Lighthill's method has been proved by Wasow (1955) [3J for the more general problem du (x+w) dx +q(x)u==r(x), u(a) = b, (10) where r., a, and b are positive constants and q(x) and r(x) are functions regular in Ixl.s;a. Lighthill's method can also be applied to partial differential equations. Sometimes it is called the PLK method (Poincan\-Lighthill-Kuo method), after its successful application to the problem of the boundary layer of a thin flat plate by Kuo. (c) Method of matched asymptotic expan- sions. In some cases it is possible to get a uni- formly valid series solution if we divide the domain into two or more subdomains, solve the equation by use of a suitable independent variable for each subdomain, and then deter- mine the coefficients in each solution by the process of matching neighboring solutions on their common boundary. Such a perturba- tion technique was developed in treating the boundary-layer equation in fluid dynamics and has been systematized into the method of matched asymptotic expansions. The idea of the method can be shown by the following simple example. Consider the boundary-value problem d 2 u d u f,+-=a (O<x< 1), dx dx (11 ) u(O) = 0, u(l)= 1, (12) where t is a small positive parameter and a is a constant such that 0 < a < 1. In order to solve this problem, we first take x itself to be an independent variable (the outer variable) in the domain x » £ and expand the solution as u== Uo(x) HUdx)+ Because the highest derivative is multiplied by a small parameter r. in equation (12), the per- turbation equations to be solved are all of first order, of which the first one is dUo -=Q, dx U o (l)==1. Solving this, we get an approximation for the outer solution: Uo(x)== l-a(l-x) On the other hand, we introduce a new independent variable (the inner variable) by 
123 putting x = 8 for x ;;S 8. This transforms equa- tion (12) into d 2 V dv de + d ==8Q, v()=u(x). If we assume the expansion v == Vo()+ 8 VI ()+ .", we have the equation for V o : d 2 V O dV o de +df=O' Vo(O) = O. This gives an approximation for the inner solution: Vo()==c(l-e-). Finally, in order to match the inner and the outer solutions, we equate their values at some point within the interval 8« x« 1 (--> 00 for the inner solution, x-->O for the outer solution): V o ( 00) = Uo(O) = 1 -a, from which the coefficient in V o is determined to be c = 1 -a. Therefore we have an approxi- mate solution valid in the whole interval Oxl: u Uolx) + Vo()-(1 -a) =ax+(1-a)(I-e- x /,). C. Method of Steepest Descent If an analytic function f(z) of a complex vari- able z is expressed in terms of an analytic function g(t) by an integral I(z) = Ie exp{zg(t)}dt, where C is a curve on the complex t-plane, then an asymptotic expression for f(z) for large values of Izl can be derived by the method of steepest descent (the saddle-point method). The idea of the method may be traced back to Riemann, and various asymp- totic expressions for cylindrical functions were obtained by Debye (1909) [5]. The point at which the first derivative of g(t) in the integral (13) vanishes is called a saddle point. The function g(t) is expanded in a power series near this point as 1 g(t) = g(t o ) + 2 g "(to)(t- t o )2 +.. We have an inequality zg"(to)(t - t 0)2.s; 0 along the line n 1 L: arg(t-to)=---arg{zg"(to)}' 2 2 25 Ref. Approximation Methods in Physics Therefore the absolute value of the integrand exp{zg(t)} reaches a maximum at to on the line L and decreases along it more rapidly than along any other direction. Hence if we deform the integration path in such a way that it passes through the saddle point to in the direction of steepest descent, the value of the integrand is practically zero on the new path except very near to to when Izi is large, whereas the value of the integral remains the same. Therefore we can get an asymptotic expression of f(z) for large Izi by truncating the Taylor series for g(t) up to the second term and taking the line L as the integration path: f(z)exp{zg(to)} I exp HZg"(t o )(t-t o )2} dt 2n exp{ zg(t o )}. - zg"(to) As an example, Stirling's formula n! n"e- n for a large positive integer is derived if we start with the formula n!= Lac e-'tndt= Lac e-nS(ns)"d(ns) r x =nn+l JtJ exp{n(\ogs-s)}ds and make use of the method of steepest descent. References (13) [1] H. Jeffreys, On certain approximate solu- tions of linear differential equations of the second order, London Math. Soc., (2) 23 (1924), 428-436. [2J M. J. Lighthill, A technique for rendering approximate solutions to physical problems uniformly valid, Phil. Mag., (7) 40 (1949), 117991201. [3J W. Wasow, On the convergence of an approximation method of M. J. Lighthill, J. Rational Mech. Anal., 4 (1955), 751-767. [4J H. S. Tsien, The Poincare-Lighthill-Kuo method, Advances in Applied Mechanics, Academic Press, 1956, vol. 4, 281-349. [5] P. Debye, Naherungsformeln fUr die Zy- linderfunktionen flir grosse Werte des Argu- ments und unbeschrankt veranderliche Werte des Index, Math. Ann., 67 (1909),5355558. [6J P. M. Morse and H. Feshback, Methods of theoretical physics, McGraw-Hill, 1953. [7J H. Jeffreys and B. S. Jeffreys, Methods of mathematical physics, Cambridge Univ. Press, third edition, 1962. [8J J. D. Cole, Perturbation methods in ap- plied mathematics, Blaisdell, 1968. (14) 
26 Arab Mathematics [9J A. H. Nayfeh, Perturbation methods, Wiley, 1973. [ IOJ A. Erdelyi, Asymptotic expansions, Dover, 1956. 26 (Xx1.4) Arab Mathematics The role of the Arabs in cultural history has been partly that of cultural transmitter. Between the 7th and 13th centuries, they established a religious empire that extended from India to Spain; later it was divided into the eastern and western empires. The caliphs of these empires encouraged research in the sciences, so the capitals Baghdad and Cordova became centers of culture where scholars from different coun- tries gathered. Arabic scholarship is sometimes called the stepfather of European culture. During the 13th Century, Alphonso X (1252- 1284) invited Islamic and Hebrew scholars to the Spanish court to translate their writings on algebra, medicine, and astronomy into Spanish. This accomplishment earned him the title of Al- phonso the Wise. The first contact between Greek and Indian mathematics took place in Baghdad under Caliph AI-Mansur (754-775); Euclid's Ele- ments was introduced by way of the Byzantine Empire, while Brahmagupta's Brahmasphuta- siddhdnta came directly from India, Many mathematical texts found in the Eastern Ro- man Empire and Syria, including some Greek works, were translated into Arabic. Though it is difficult to discern essential scientific ad- vances in Arabian works, the diffusion of these translations was instrumental in the development of European mathematics. The Arabs did not use written numerals until Mohammed's time (570-632). Signs representing numbers had been introduced into Arabia when its influence encompassed Egypt and Greece. Indian numerals were imported with Brahmagupta's book and became our present Arabic numerals after a series of modifications. Among all the branches of Arab mathe- matics, algebra was the most advanced. It started with Alkwarizmi's (820) AI Gebr W'al M uquabala, the origin of the word "algebra." It was the first mathematical book written in Arabic. Its content was essentially a variety of methods of solving algebraic equations. AI gebr means "transposition of negative terms on one side of the equation to the other side and changing their signs." and al muqua- bala means "simplification of the equation by 124 gathering similar terms." For example, qua- dratic equations can be brought by these methods into one of the following three types: x 2 = px +q; x 2 +q = px; x 2 + px =q, where p, q are positive numbers. The Arabs expressed the rule of solving these equations verbally. They apparently knew that quadratic equations have two roots, but they adopted only posi- tive roots; when the equation had two posi- tive roots they adopted the smaller root. The proof was given geometrically. It is possible that they learned geometric proofs from the Greeks. For the Arabs, geometry was secondary to algebra. They did not appreciate proof as seen in Euclid's Elements. The book on conic sec- tions by Apollonius was also translated into Arabic, but no essential progress was made in this area. The only remarkable contribution was that of Omar Khayyam, author of The Rubdiyat, who applied conic sections to the solution of the cubic equation x 3 + bx = a. In trigonometry, AI Battani (c. 858-929) left a notable contribution. He studied The Almag- est, the Arabic translation of Ptolemy's astro- nomical work. He added nothing outstand- ing to plane trigonometry, but obtained such formulas as ms a = ms b CDS c + sin b sin c ms rJ. for spherical triangles, which were not men- tioned in The Almagest. References [1 J M. Cantor, Vorlesungen iiber Geschichte der Mathematik 1, Teubner, third edition, 1907. [2J G. Sarton, Introduction to the history of science 1. From Homer to Omar Khayyam, Carnegie Institution of Washington, 1927. 27 (V.18) Arithmetic of Associative Algebras A. General Theory Let 9 be a tDedekind domain (i.e., an integral domain in which every ideal is uniquely de- composed into a product of prime ideals), let F be the field of quotients of g, and let A be a tseparable algebra of finite degree over F. A g- lattice a of A is a g-submodule of A that is I1nitely generated Over 9 and satisfies A = Fa. If a subring 0 of A is a g-Iattice containing g, then 0 is called an order. A maximal order is an order that is not contained in any other order. A maximal order always exists although 
125 it may not be unique; in particular, if A is commutative, it has only one maximal order. Ifweput o,={xEAlxaca} (o,={xEAlaxc a}) for a g-Iattice a of A, then 0, (0,) is an order of A and is called the left (right) order of a. If the left order of a is maximal, then so is the right order; the converse is also true. If 0, and 0, are maximal, we call a a normai g. lattice. To describe the same situation we say that a is a left o[-ideal or a right o,-ideal. If 0, = 0, = 0, then we say that a is a two-sided 0- ideal. A lattice a with a co, (or equivalently with a co,) is called an integral g-lattice or integral left (right) ideal. The product ab of two normal g-Iattices a, b is called a proper product if the right order of a coincides with the left order of b. The proper product defines the structure of a tgroupoid on the set of all normal g-Iattices of A. In particular, the in- verse a -[ of a normal g-Iattice a satisfying aa- l =oz, a-la=o, is given by a-I ={xEAlxa co,} = {x EA ax c or}. For a fixed maximal order 0, a maximal integral two-sided o-idea1 p different from 0 is called a prime ideal of o. If P is prime, then o/p is the matrix algebra of degree K over a division algebra, and K is called the capacity of the prime ideal p. The set of all two-sided o-ideals forms a multiplicative group, of which prime ideals are independent generators. B. Maximal Orders of a Simple Ring In the rest of this article, A is a tsimple ring, F is the tcenter of A, and 0 is a maximal order of A. The prime ideals q of 0 and the prime ideals p of  are in one-to-one correspondence by the relation q n 9 = p; o/q is a simple algebra over !'1/p, and q' = po for some natural num- ber e. The different b of 0 is defined by b- I = {XE A I Tr(xo) c g}. (Tr is the treduced trace from A to F; b is an integral two-sided o-ideal, and is divisible by q e-1 if q' == po.) For q to divide b, it is necessary and sufficient that either e > 1 or o/q not be separable over g/p. In particular, if A is a ttotal matrix algebra over F, then b = 0, q = po. The ideal of 9 gener- ated by the treduced norms (to F) of the ele- ments in a normal g-Iattice a of A is denoted by N",(a). If ab is a proper product, then we have NA/F(ab) = NA/F(a)NAIF(b), where NA/F(b) does not depend on the choice of 0; this is called the discriminant of A. If [A : F] = n 2 and qe = po, then NA/F(q) = pi, ef= n. C. Simple Rings over a Local Field Let F be a field that is complete with respect to a tdiscrete valuation whose field of residue classes is finite, Let 9 be the tvaluation ring of 27 D Arithmetic of Associative Algebras F, p be the maximal ideal of g, and A be a simple algebra with F as its center. If A is a tdivision algebra and [A : FJ = n 2 , then A has only one maximal order 0, and 0 has only one prime ideal q such that q" = po; o/q is an exten- sion of degree n of !'1/p. If A is not necessarily a division algebra, the relation OCI = 0' with an element  of A holds between two maximal orders 0 and 0' of A. Furthermore, for any left (right) o-ideal a, there exists an element rJ. such that a = OIX (a = c.:o). By using the notation of tcyclic algebra, we can express A in the form (Kn, 0", n'). Here Kn is the unramified extension of degree n over F, (J is the tFrobenius substi- tution of Kn/F, 7! is a prime element of F, and o (Y < n. The element of Q/Z determined by r/n (mod Z) is denoted by fA) and is called the tHasse invariant of the algebra class contain- ing A. The mapping A -{A} gives an isomor- phism of the tBrauer group of F onto the additive group Q/Z. If M is an extension of F of finite degree, then {A"'} = [M: FJ {A} holds for the algebra A M obtained from A by scalar extension (- 29 Associative Algebras). D. Simple Rings over an Algebraic Number Field Let F be an algebraic number field of finite degree, and let A be a simple algebra with center F. Then A is a cyclic algebra and is isomorphic to a total matrix algebra over a division algebra D. The order n of the algebra class of A over F is determined by n 2 = [D: FJ (H. Hasse, R. Brauer, and E. Noether). Denote by Fp the completion of F with respect to a tprime divisor p of F, and let A, be the algebra obtained from A by the scalar extension Fp over F. For a finite prime divisor p, the meaning of {A,,} is as before; for an infinite prime divisor p, put {A,} = 0 or 1/2 (mod Z) according as A, is a total matrix alge- bra over Fp or not. Furthermore, define the subgroup J p of Q/Z by J p = Q/Z, p a finite prime divisor, = {O, 1/2 (mod Z)} p a real infinite prime divisor, = {O}, P a complex infinite prime divisor. Now let J be the subgroup of the direct prod- uct IIp J p consisting of all elements of the form (a,) (ex p E J,) such that ex p = 0 except for a finite number of prime divisors and l:plXp = O. Then A+( { A,j) gives rise to an isomorphism of the Brauer group over F onto J (Hasse). Each fA,) is called the tp-invariant of A. In particular, A is a total matrix algebra over F if 
27E Arithmetic of Associative Algebras and only if A, is a total matrix algebra over Fp for all p (- 29 Associative Algebras). These theorems are closely related to tc1ass field theory. If 0 is a maximal order of A and a, b are left o-ideals, then o = b with an element  of A defines an equivalence relation between a and b. The number of equivalence classes of left ideals with respect to this equivalence is called the class number of A; it is independent of the choice of 0 and is equal to the class number defined by using right ideals. The product of all real infinite prime divisors p with {Ap} = 1/2 is denoted by Pro' If Pro is the product of all inllnite prime divisors and [A : F] = 4, then A is called a totally definite quaternion algebra. If 0 is a maximal order of A and A is not a totally detlnite quaternion algebra, then 0--> N",(a) gives a one-to-one correspondence between the classes of left o-ideals and the congruence classes of ideals of F modulo P :JJ (- 14 Algebraic Number Fields II) (Eichler's theorem). In particular, if A is a total matrix algebra over F, then the class number of A is equal to the class number of F. The class number of a totally detlnite quaternion algebra was deter- mined by M. Eichler by using the zeta function (- Section F) of A [4]. Let 0 be a maximal order of A, a be an integral two-sided o-ideal, b be an integer of F, and  be an element of o. Furthermore, assume b == 1 (mod Pro), NA/F()= b (mod a n F) (tmulti- plicative congruence). Then there exists an element e of 0 such that NA/F(fJ) = b, # ==  (mod 0) (N A / F is the reduced norm), provided that A is not a totally definite quaternion algebra [5]. This theorem, which is called EicWer's approximation theorem, is widely applicable; e.g., it yields the previous theorem on the class number and can be generalized to the case of semisimple talgebraic groups ( 13 Algebraic Groups). E. Algebras over a Function Field The Hasse-Brauer-Noether and Hasse theo- rems also hold for tnormal simple algebras over a tfield of algebraic functions of one vari- able over a finite field. On the other hand, a normal simple algebra over a tleld K of alge- braic functions of one variable over an alge- braically closed field is a total matrix algebra over K (Tsen's theorem). F. Other Notions Adeles and ideles for a simple algebra A over an algebraic number field of finite degree can 126 be introduced as in the case of number fields ( 6 Adeles and Ideles). If N(a) stands for the number of elements in 0/0, where 0 is a maxi- mal order of A and a is an integral left o-ideal, then the zeta function of the simple algebra A, called the Hey zeta function, is detlned by (A(S) = N(o)-S (the sum over all integral left 0- ideals). This function has properties similar to those of Dedekind zeta functions (- 450 Zeta Functions L). Let p be an infinite prime divisor of the center F of A, and let G p be the group of elements in A, with the reduced norm 1. Put G = D p G p (the product over all infinite prime divisors of F). Then the group r of units with the reduced norm 1 in 0 is naturally regarded as a subgroup of G. To be more precise, f is a discrete subgroup of G, the volume of G/r is finite with respect to an invariant measure, and G/f is compact if and only if A is a divi- sion algebra ( 122 Discontinuous Groups). This result can be viewed as a special case of more general facts about semisimple algebraic groups (- 13 Algebraic Groups). If K is a tmaximal compact subgroup of G, then r gives rise to a tdiscontinuous group operating on the homogeneous space GIK, and we obtain taut om orphic forms with respect to f. The case where A is a tquaternion algebra has been studied extensively (- 32 Automorphic Functions). References [lJ C. Chevalley, L'arithmetique dans les algebres de matrices, Actualites Sci. Ind., Her- mann, 1936. [2] M. Deuring, Algebren, Erg. Math., Springer, second edition, 1968. [3J M. Eichler, Uber die Idealklassenzahl hyperkomplexer Systeme, Math. Z., 43 (1938), 481-494, [4J M. Eichler, Uber die Idealklassenzahl total definiter Quaternionen Algebren, Math. Z., 43 (1938), 102-109. [5] M. Eichler, Allgemeine Kongruenzklas- seneinteilungen der Ideale einfacher Algebren iiber algebraischen Zahlkorpern und ihre L- Reihen, 1. Reine Angew. Math., 179 (1938), 2277251. [6] N. Jacobson, The theory of rings, Amer. Math. Soc. Math. Surveys, 1943. [7J C. L. Siegel, Discontinuous groups, Ann. Math., (2) 44 (1943),674-689. (Gesammelte Abhandlungen, Springer, 1966, vol. 2, 390- 405.) [8] A. Weil, Adeles and algebraic groups, Lecture notes, Institute for Advanced Study, Princeton, 1961. 
127 28 (XXI.1 3) Artin, Emil Emil Artin (March 3, 1898-December 20, 1962) was born in Vienna. After he studied at the Universities of Vienna, Leipzig, and Got- tingen, he taught at the University of Ham- burg from 1923 to 1937. In 1937 he left Ger- many under the Nazi regime for America, where he taught at Notre Dame, Indiana, and Princeton universities. He returned to Ger- many in 1948 and taught again at the Univer- sity of Hamburg until his fatal heart attack. In his thesis (1923), he proved affirmatively the tRiemann hypothesis in function fields over finite constant fields in the hyperelliptic case, and conjectured that this should be valid in general. This was eventually verified by A. Weil in 1941. Also in 1923, he introduced the tL-function for Galois extension and was led to the tgeneral law of reciprocity, which he stated as a conjecture and proved four years later, thus bringing tclass field theory to com- pletion. Around the same period, he estab- lished in collaboration with 0. Schreier the theory of tformally real fields and solved the 17th tproblem of Hilbert. He also initiated the theory of tbraids, which he later developed with F. Bohnenblust in the 1940s. In view of these and other ingenious works as well as his inspiring teaching in all areas of mathe- matics, Artin is considered to have been one of the most influential mathematicians of this Century. References [1] S. Lang and J. T. Tate (eds.), The co11ected papers of Emil Artin, Addison-Wesley, 1965. [2J H. Cartan, Emil Artin, Hamb. Abh., 29 (1965). 29 (111.1 0) Associative Algebras A. Fundamental Concepts Let K be a commutative ring with unity ele- ment 1 (- 368 Rings A), and let A be a ring which is a tunitary K-module (- 277 Mod- ules). Such a ring A is called an associative algebra over K (or simply algebra over K) if it satisfies the condition A(ab) = (Aa)b = a(Ah) (h K; a, bE A). An (associative) algebra A over 29 A Associative Algeb ras K is often written AI K, and K is called the coefficient ring (or ground ring) of the algebra A= A/K. In particular, if K is a tfield, then it is called the coefficient field (or ground field) of A. Algebras over fields have been studied in de- tail. Notions such as zero algebra, unitary algebra, commutative algebra, (semi) simple algebra, and division algebra are replicas of the respective ones for rings (- 368 Rings). Con- sidering both structures as rings and as K- modules, homomorphisms and isomorphisms are defined in a natural manner, and are called algebra homomorphism and algebra isomor- phism, respectively. In this connection, sub- algebra, quotient algebra (or residue class algebra), and direct product of algebras are also defined as in the case of rings. An tideal of an algebra A is defined as an ideal of the ring A, which is at the same time a submodule of the module A over the coeffi- cient ring. The tradical of an algebra A con- sidered as a ring is then an ideal of A in this sense. In fact, the existence of a unity in an algebra A implies that any ideal of A consid- ered as a ring is necessarily an ideal of the algebra A. In the rest of this article, we assume that all rings have a unity element and that all homo- morphisms are unitary. Hence, when we con- sider subalgebras of an algebra A, we require that they share the unity element with A. If e is the unity element of an algebra A over K, then the mapping A-->Ae = X (A.E K) is a homomor- phism K --> A whose image Ke is contained in the tcenter of A, and the scalar multiplication Aa is equal to the ring multiplication A'a (AE' K, GE A). Conversely, given a homomorphism of K into a ring A whose image is contained in the center of A, we can regard A as an algebra over K in an obvious way. Hence we are given an algebra A over K if and only if there exists a pair (A, p) of a ring A and a homomorphism p : K --I A whose image is contained in the center of A. There exists a uniquely determined (unitary) homomorphism of the ring Z of rational integers into any ring; hence any ring can be regarded as an algebra over Z. If the coefficient ring K of a nonzero algebra A is a field, then K can be regarded as a subfie1d contained in the center of A, and the unity element 1 of K coincides with the unityele- ment of A. Let A and B be algebras over K. Then the ttensor product A Q9K B of K-modules is an algebra over K under the multiplication (a 0 b)(a' 0 b') =aa' 0 bb' (a, a' E A, b, h' E B). This algebra is called the tensor product of algebras A and B. Moreover, the mapping a-->a 01 (resp. b-->1 0 b) (aE'A, bE'B) gives an algebra homomorphism A-l A 0 K B (resp. B--> 
29 B 128 Associative Algebras A Q9K B, which is called the canonical homo- morphism. In particular, if A is a commutative algebra, then A Q9K B can be regarded as an algebra over A by this canonical homomor- phism; and in this case A Q9K B is called the algebra obtained by extension of the coeffi- cient ring of B to A (or a scalar extension of B by A), and is often denoted by B A . The algebra K Q9K B is canonically isomorphic to B. Fur- thermore, (A Q9K B) Q9K C and A 0 K (B Q9K C) are canonically isomorphic and are written A Q9 B @ C. Let A and B be commutative algebras over K. Then, for any commutative algebra C and homomorphisms a : A+ C, fi: B -->C, there exists one and only one homomor- phism y:A Q9KB-->C such that lX(a)=y(a@ I), f3(b) = y( 1 0 b) (aE A, b E B). This property char- acterizes the tensor product A 0 K B of com- mutative algebras A and B. In this sense, A @K B is sometimes called the coproduct of A and B (- 52 Categories and Functors E). B. Examples of Associative Algebras As we mentioned, any ring can be regarded as an algebra over the ring Z of rational integers. But it is often useful to deal with algebras over "large? or more "efficient" coefficient rings. For instance, we have many rings which are algebras over a commutative ring K, such as the tring of polynomials, the tring of forma 1 power series in n variables with coefficients in K, the tendomorphism ring of a K-module, and the tfull matrix ring of degree n over K (- 368 Rings C). There are other important classes of algebras, such as (semi) group alge- bras, Hecke algebras, and crossed-product algebras, which will be explained later. These algebras are defined by a canonical basis con- nected directly with a (semi) group structure. On the other hand, the ttensor algebra and the texterior algebra of linear spaces and the tClifford algebra associated with a given quadratic form are also important (- 61 Clifford Algebras, 256 Linear Spaces). The most frequently used example of a division algebra is the quaternion field H (often called Hamilton's quaternion algebra, W. R. Hamilton, 1858). This is a 4-dimensional linear space over the real number field R with basis ( 1, i,j, k), with the following laws of multiplication: 1 is a unity element, e = / "" k 2 = -1, U"" -ji=k,jk= -kj""i, and ki= -ik=j, An element of H is called a quaternion. The only finite-dimensional division algebras over the real number field R are the real number field R, the complex number field C, and the quatemion field H. C. Group Algebras and Hecke Algebras Let K be a commutative ring and K(G) be the direct sum LSEG Ks of modules K" where each K, is isomorphic to K with a group G as the index set (- 277 Modules F). The elements of K(G) are the families (A')SEG of elements of K whose components are all zero except for a finite number of them. Let {U'}SEG be the canonical basis of K(G), namely, Us is an ele- ment of K(G) of which the sth component is 1 and the others are O. The module K(GI has the structure of an algebra over K, where the law of multiplication is determined by U,U, = us,(s. t E C). This algebra is called the group algebra of Gover K. The product A * J1 of elements A = (Je,) and 11 = (11,) of K(G) is then given by (hJ1)s= L A,J11' SEC. s=rl (1 ) Each basis element Us is often identified with the group element s, and in this manner, the group G is regarded as a basis of K(G), which is usually written KG or K[Gj. In this definition, the group G can be re- placed by a semi group C, and then the algebra K(G) is called a semigroup algebra. As an example, let N be the additive semigroup of all nonnegative rational integers and Nn be the direct product of n copies of N. If we denote the elements (iI' , i,) of N" by X1 Xn and use multiplication instead of addition, then the semi group algebra of N n over K is exactly the tring of polynomials K [XI" , XnlOn the other hand, if G is a semigroup, then even when it is infinite, it may occur that for any SE C, there exists only a finite number of pairs (y, I) of elements of G such that s = rl. In this case, formula (1) also defines the law of multi- plication on the tCartesian product KG. This algebra is called a large semigroup algebra and contains K(G) as a subalgebra. In par- ticular, the large semigroup algebra of N n is exactly the tring of forma 1 power series K [[XI'"'' XnJJ. Let H be a subgroup of a group C, and assume that the index of H n sHs- 1 in H is fmite for any S E G. This assumption is equiva- lent to the condition that any double coset of G by H is a union of a finite number of left as well as right cosets. Let H\G, GIH, and H\G/H be the set of all tright cosets, tleft cosets, and tdouble cosets, respectively. Then each element of the direct sum K(H\G) K(G/H) or K(H\G/H) can be regarded as a func;ion ' defined on G taking a constant value on each right, left, or double coset, respectively. Con- versely, any function defined on G can be regarded as an element of K(H\GI, K(G/H), or 
129 K(H GjH) if it takes a constant value on each left, right, or double coset, respectively, and if it vanishes everywhere except on a finite num- ber of cosets of respective type. Under such identification, let As. AI> A" and At denote the values of a function A:G-->K on SEG, IE H\G, r EG/H, and t EH\G/H, respectively. For any I" fi E K1H\G/H) we define a function A * fl by (A*fl),= L A,fll' .lEG, sErl where the right-hand side is the sum taken over all pairs (r, I) such that sErl, rEG/H, IE ffiG. It can be shown that this sum is a Unite sum and that A * flE K(H\GjH} Hence the module KIH\G/H) has the structure of an alge- bra over K, which is called the Hecke alge- bra of (G, H) over K and often denoted by £dG, H). If H = tel, the Yf'K(G, H) is exactly the group algebra of G. In general, ,1f K (G, H) can be regarded as an algebra obtained by extending the coefficient ring of ffz(G, H) to K. Furthermore, K 1H1G ) (K 1G / H ») can be re- garded as a right (left) module over the group algebra K(G), and the endomorphism ring r%K(G)(K(HIG) (resp. cffK(G)(K(GIH))) is canonically isomorphic (anti-isomorphic) to the Hecke algebra K1H.G'H). D. General Crossed Product Let G be a group that operates on a commuta- tive ring L, and denote the operation by (s, A)--> s(A) (SEG,).EL); thus for any .lEG the map- ping ),->s(A) (}.E L) is an automorphism of L satisfying s(t(),)) = st(A) (s, t E G). For any A. fiE LiG). we define the product). * J1E UG) by (A * l1)s = I ).,r(J1/)f(r.I), SE G, s=rl where {f(r,I)}',IEG is a given family of elements of L If this family satisfies the equations frs, r)f(sr, I) = s(f(r, I)j (.I, rl), s, r, lEG, then LiG) forms a ring. In terms of the canon- ical basis {US}SEG, this ring structure is defined by the formulas u,u/ = f(r, I)url> u,A == S(A)U s (J. E L). If K is the subring of L consisting of all h Lsuch that s().) = A(SE G), then the ring VG) is an algebra over K, called the crossed product of Land G with respect to the given operation and the given factor set (of G. In a narrower sense of the term, we consider only the case when L is a field, G is a finite group, fIr, I) # 0, and G operates on L faithfully, In this case, G can be identified with the tGalois group of a Unite tGalois extension L/K. and so the crossed product is written (L/K, f). This is a tcentral simple algebra over K (- Sections E, F). If the operation of G in the above- mentioned general crossed product is trivial, 29E Associative Algebras (2) namely, L = K, the crossed product of K and G is called an algebra extension of Gover K with respect to .f Usually, we assume that K is a field and fIr, I) #0. If fIr, I) = 1, the algebra extension is the group algebra. If G is a finite group whose order is prime to the tcharacter- istic of K, then any algebra extension of Gover K (in particular, the group algebra) is always tsemisimple and tseparable. If G is a finite Abelian group and fis a factor set of G, then a bihomomorphism <p: G x G-->K* (a mapping which is a homomorphism in each variable) is deUned by <p(s,t)==I(s, t). {(t,srl.An alge- bra extension of Gover K with respect to f is a central simple algebra if and only if IfJ is nondegenerate. In particular, if G is a direct product of IJ copies of a group of order 2, then by choosing n elements SI, S2, . , sn of G suitably, any element of G can be uniquely ex- pressed as Sa Sb . Sz> where {a, b. . , z} is a subset of { 1,. , 11} in the natural order. Hence, if we take ula, ,z) = UaU b U z as a basis of an algebra extension, then any factor set f is determined by I(Si, s) = Aij, where A ij = 1 (i <j), )'ii== :t 1 (i >j), and I.U are arbitrary. When Aij = -1 (i > j), the corresponding algebra exten- sion is a tClifford algebra. Furthermore, if A ii = 0, then it is a tGrassmann algebra. If Au f- 0 and IJ is even, then it is a central simple alge- bra whenever the characteristic of K is not 2 (- 61 Clifford Algebras). Now let K be the real number field Rand A, be the Clifford algebra with i'ii = 1. Then A, is the quatemion field. Elements of A, are called sedenions, and are important in tspinor theory and tDirac's equation. In gen- eral, let K be an arbitrary field whose charac- teristic is not 2 and put n = 2, A 1 2 = 1, AZI = -1, All =), #0, and ,1,22 = fl #0, in the pre- vious notation. Then the corresponding cen- tral simple algebra Q is called a (general- ized) quaternion algebra. Thus Q has a basis { 1, u, v, w} satisfying the following laws: 1 is the unity element, w = uv = -vu, u 2 = A, and v 2 = 11 (..1., II E K). Any central simple algebra of dimension 4 is isomorphic to a certain quater- nion algebra. (In particular, if K = R and A = fi = -1, then Q coincides with the quaternion field H.) For any element X = 11 + pu + yv + bw of Q, the element X == 11- pu - tV - bw is said to be conjugate to x, and N(x) = xx E K is called the norm of x. An element x of Q is invertible in Q if and only ifN(x) f- O. (3) E. Finite-Dimensional Associative Algebras over a Field For the rest of this article, we assume that the algebras considered are unitary and finite 
29F 130 Associative Algebras dimensional over a field K. Then by general properties of left (right) t Artinian rings, any associative algebra A has the following struc- ture: the tradical N of A is the greatest tnilpo- tent ideal, and the quotient algebra A/N = A is tsemisimple and decomposed into a direct sum of ideals which are simple algebras: A =AI +...+An. Each simple component Ai is a full matrix ring of degree r i over a certain division algebra D i , and Ai is decomposed into a direct sum of r i minimal left ideals which are mutually A - isomorphic: Aj=AeP)+... + Aef'i), 1 in, where eP),.... e!'i) are orthogonal tidempotent elements of Ai whose sum is equal to the unity element of .4;. On the other hand, Ai==ep) A + ... +e!'i) A gives a decomposition of A i into the direct sum of minimal right ideals that are mutually A-isomorphic. Moreover, we can choose an idempotent element eS) of A from each residue class e!') such that {e!')} forms a system of orthogonal idempotent elements, whose sum is equal to the unity element 1 of A, and A= t I AeS)= t f eS)A i""l s=l i=l s=1 gives a decomposition of A into a direct sum of tdirect indecomposable left (right) ideals. Here, AeS) and AeY) (e!S)A and eY)A) are A- isomorphic if and only if i = j. Conversely, every decomposition of A into indecom- posable ideals is obtained as above. The A- submodule Nd') of AeS) is the unique maxi- mal proper submodule, and Ae!S) / N e!S) and AeY)jNeY J are A-isomorphic if and only if "  ]. Any simple A-module is A-isomorphic to a certain Ae!')/NeS) (- Sections H, I). Any simple algebra A over K is isomorphic to a full matrix ring M,(D) over a certain divi- sion algebra D, This is called Wedderburn's tbeorem. Here n is determined uniquely by A, and D is also determined uniquely by A up to isomorphism. Moreover, the center of A is isomorphic to the center of D If the center of A coincides with K, then A is called a central simple algebra (or normal simple algebra) over K. In this case, any isomorphism of two simple sub algebras of A can be extended to an tinner automorphism of A. Let V(B) denote the tcom- mutor of a simple subalgebra B of A. Then V(B) is also a simple subalgebra and V(I'(B» = B, dim A = dim B . dim V (B). In particular, if B is central over K, then there is a canonical isomorphism A  B @ K V(B). If D is a central division algebra, then any maximal commuta- tive subalgebra L of D is a field and satistles (dim L)2 = dim D. Moreover, there are t sepa - rable extensions over K among such L. In general, the dimension of a central simple algebra A is a square number r 2 , where r is called the degree of A. Two central simple algebras are said to be similar if they are isomorphic to full matrix rings over the same division algebra. This is an equivalence relation, and each equivalence class is caIled an algebra class. On the other hand, if A is a central simple algebra and B is a simple algebra, then A 0 K B is a simple alge- bra. Moreover, if B is central, then A 0 K B is also a central simple algebra. If A and B are similar to A' and B', respectively, then A 0 K B is similar to A' @K B'. Hence, the tensor prod- uct 0 defines multiplication in the set 8&(K) of the algebra classes over K. If A is a central simple algebra and A 0 is an algebra anti- isomorphic to A, then A 0 is also central simple and A 0 K AO is isomorphic to a full matrix ring over K. This shows that @(K) forms a group, which is called the Brauer group, after R. Brauer, who introduced this concept, or the algebra class group over K. For a central simple algebra A, the degree of a division algebra similar to it is called the Schur index of A (or of the algebra class of A), and the order of the algebra class of A in the Brauer group is called the exponent of A. The Schur index is divisible by the exponent; and conversely, the exponent is divisible by every prime divisor of the Schur index. F. Extensions of Coefficient Fields Let L be an textension tleld over a field K. Then for any algebra A over K, L @K A can be regarded as an algebra over L. This algebra is denoted by A L and is called an algebra ob- tained by extending the coefficient field to L Let us denote the radical of a ring A by 9i(A). An algebra A over K is called a separable algebra if it satisfies \R(A L ) = {O} for any exten- sion field Lover K. In the special case when A is an talgebraic extension field over K, A is a separable algebra if and only if every element of A (or every element of a subset which gener- ates A) is tseparable over K (- 149 Fields). A (finite-dimensional) algebra A over K is sepa- rable if and only if A is semisimple and the center of every simple component of A is a separable extension over K. If the quotient algebra A/\R(A) of an algebra A is separable, then there exists a subalgebra S such that A =S + \R(A) and sn \R(A) = {O}, and S is uniquely determined up to inner automor- phisms (Wedderburn-Mal'tsev tbeorem). In order that an algebra A over K be central simple, it is necessary and sufficient that A L be 
131 simple for any extension tleld Lover K. This latter statement holds if and only if A L is iso- morphic to a full matrix algebra over L for a certain extension tleld Lover K. Such an exten- sion field is a tsplitting field of A (- 362 Re- presentations F). For a central simple algebra A, a (finite) extension field L of degree rover K is a splitting field of A if and only if there exists a central simple algebra B of degree r which is similar to A having a subfield which is K- isomorphic to L. In this case, r is divisible by the Schur index of A. Furthermore, A has a separable splitting field whose extension de- gree is equal to the Schur index of A. This shows that A has a splitting tleld which is a finite Galois extension tleld over K. Let L be a finite tGalois extension over a tleld K and G be its tGalois group. If f is a factor set with respect to the operation of G on L *, then the crossed product (L/K,f) is a cen- tral simple algebra over K (- Section D). The productfq of two factor sets f and 9 is also a factor set, and the set of all factor sets forms an tAbelian group. Thus (L/K,fg) is similar to (L/K,f) 0 K (L/K, g). On the other hand, factor sets f and 9 are said to be associated with each other if there exists a family {ASLEG of elements of L such that f(r, I) = g(r, l)r(A 1 ))"i 1 An r, lEG. Hence, f and 9 are associated with each other if and only if (L/K, f) and (L/K, g) are similar. Therefore, the mapping f -->(L/ K,f) gives a monomorphism of the group H 2 (G, L*) of all associated classes of factor sets (which can be identilled with the 2-dimensional tcohomology group of G with coefficients in L*) into the Brauer group @(K) over K. Its image coin- cides with the subgroup of all algebra classes which have L as a splitting field. In particular, any algebra class is similar to the crossed product of a certain finite Galois extension L and its Galois group G (R. Brauer, E. Noether, A. Albert, K. Shoda, and others). G. Cyclic Algebras Let Z be a tcyclic extension field of degree n over a field K. Then the crossed product of Z and its Galois group G is called a cyclic algebra over K. For a fixed generator S of G and any element if. 1= 0 of K, a factor set {(Si, sj) (0  i,.i < n) can be defined by f(Si, sj) = 1 (i+.i<n) and j(Si,sj)=(X (i+i?n). Let (Z, s, ex) denote the corresponding crossed product. Then (Z, s, a) and (Z, s, PJ are similar if and only if (X/I! is a norm of a certain element of Z into K. On the other hand, any crossed product of Z and G is similar to a certain (l, s, cc), and the correspondence (X -->(Z, s, a) 29 G Associative Algebras gives an isomorphism of K*/Nz/K(Z*) (the norm class group of l/K) to the group con- sisting of the algebra classes over K which have Z as a splitting field. If K is a tp-adic field or a finite talgebraic number tleld, then any central simple algebra over K is isomorphic to a certain cyclic algebra. In this section, we describe this situation in detail. Let K be a p-adic field and q the number of residue classes modulo p. If A is a central simple algebra over K, then its Schur index coincides with the exponent, which is called simply the index. A finite extension tleld Lover K is a splitting field for A if and only if the degree of L is a multiple of the index of A. If n is the degree of A, then the field W= K(w) ob- tained by adjoining to K a tprimitive (q" 1)st root of unity w is a cyclic (and tun ramified) extension of degree n over K, and its Galois group is generated by the automorphism (J of Wdetermined by w G == w q , (J K = iden- tity. Moreover, we have A  (W, a, (X) for a certain if.E K*. Let v be the texponential p- adic valuation v((X) of if., Then v/n (mod Z) is uniquely determined by the algebra class of A, which is called the Hasse invariant of A (or of the algebra class of A). By assigning to each of the algebra classes its Hasse invariant, we get an isomorphism of the Brauer group B( K) over K to the group Q/Z, the additive group of the rational numbers mod Z (H. Hasse, 1931). Let K be a finite algebraic number tleld and A be a central simple algebra over K. Let p be a (finite or intlnite) tprime divisor of K and Kp the tp-adic extension field over K. The algebra A, which is obtained from A by extending the coefficient tleld to Kp is a central simple alge- bra over K,. Except for a finite number of p, A, is isomorphic to a full matrix ring over Kp, and A itself is isomorphic to a fuII matrix ring over K if and only if A, is isomorphic to a full matrix ring over Kp for all p. The index mp of A, is called the p-index of A, and the Hasse invariant of A, is called the p-invariant of A, which is denoted by (A/p). If p is an tin finite prime divisor, then mp is equal to 1 or 2, and in each case, we define the p-invariant by setting (A/p) = 0 or 1/2 (mod Z) correspondingly. The Schur index of A is the L.C.M. of the p-indices mp for all p and coincides with the exponent of A. This is called simply the index of A. On the other hand, the p-invariants satisfy (A/p) == 0 (mod Z) except for a finite number of p, and L(A/P)=O (modZ). p Conversely, given a rational number P p for each p such that (i) Pp == 0 (mod Z) except for a finite number of p; (ii)pp == 0 (mod Z) if P is infinite and imaginary, P p == 0 or 1/2 (mod Z) if 
29H Associative Algebras P is infinite and real; (iii) LpPp =0 (mod Z), then there is a uniquely determined algebra class of central simple algebras A over K such that (Alp) == P p (mod Z) for each p. In this way the structure of the Brauer group over a finite algebraic number Held is completely deter- mined (Hasse, 1933). H. Frobenius Algebras Let A be an algebra over a field K. A is called a Frobenius algebra if its tregular representa- tion and tcoregular representation ( 362 Representations E) are similar. Thus A is a Frobenius algebra if the left A-module A and the dual module A * of the right A-module A are isomorphic as left A-modules. Let A = it s Aes) = it s eS) A be direct decompositions of A into indecom- posable left (right) ideals ( Section E). We denote ell) by e i . Then A is a Frobenius alge- bra if and only if there exists a permutation IT on I. . , n such that (i) Ae,  (en(i) A)*; (ii) r i = r x(i)' When there exists a permutation satisfy- ing only condition (i), A is called a quasi- Frobenius algebra. For a subset S of A, the ideals I(S) = {aEAlaS=O} and r(S)={aEAISa=O} are called the left annihilator and right annihilator of S, respectively. Then A is a quasi-Frobenius algebra if and only if (ii) l(r(I») = I and r(l(r») = r for any left ideal I and any right ideal r. In general, if we are given a left (right) A-module M, we denote the right (left) A-module HomA(M, A) by fi. Then if A is a quasi- Frobenius algebra, there is a canonical iso- morphism M  M, and the annihilator relation gives a one-to-one and dual correspondence between the set of the submodules of M and the set of submodules of M (M. Hall). If a quasi-Frobenius algebra A satisfies (iv) dim r + dim I(r) = dim I + dim r(l) = dim A for any left ideal] and any right ideal r, then A is a Fro- benius algebra; the converse is also true. A criterion for an algebra A to be a Frobe- nius algebra is that there is a linear form x--> X(x) on A such that if A(xa) == 0 for all XE A, then a = O. Moreover, if ), satishes A(XY) = ic( yx) (x, YE A), then A is called a symmetric algebra. For example, semisimple algebras and group algebras are symmetric algebras. If A is a symmetric algebra, then for any left (right) A-module M the right (left) A-modules M* = HomdM, K) and 1\1. = HomA(M, A) are canonically A-isomorphic. If A is a Frobenius algebra, the radical N of A satisfies l(N) = r(N), and the annihilator of N is a principal left and principal right ideal; 132 the converse is also true. For a two-sided ideal z of a Frobenius algebra A, the quotient alge- bra Alz is a Frobenius algebra if and only if I(z) and r(z) are a principal left and a principal right ideal, respectively. If A is a symmetric algebra, then any two-sided ideal z satistles I(z) = r(z), and A/z is also a symmetric algebra if and only if I(z) = r( z) is a principal ideal generated by an element in the center. Furthermore, for any extension L of the coefficient Held K, A L is a quasi-Frobenius algebra (resp. Frobenius algebra, symmetric algebra) if and only if A is (T. Nakayama, 1939,1941). The concept of Frobenius algebras has been extended to algebras B over a ring A (F. Kasch, 1954). I. Uniseria1 Algebras In the notation of the preceding section, if each indecomposable left ideal Ae i (right ideal eiA) of an algebra A over K has a unique compo- sition series, then A is called a generalized uniseria1 algebra. If an algebra A is decom- posed into a direct sum of ideals which are primary rings, then it is called a uniseria1 alge- bra. Any left module over a generalized uni- serial algebra A is decomposed into a direct sum of submodules which are A-homomorphic images of Aei: An algebra whose radical N is a principal left and principal right ideal is a generalized uniserial algebra. For an algebra A to be uniserial, it is necessary and sufficient that every two-sided ideal of A be a principal left and principal right ideal. Hence A is uni- serial if and only if every quotient algebra of A is a Frobenius algebra. If A L is uniserial for an extension field L of K, then A itself is uniserial. The converse, however, is not always true. If A L is uniserial for any extension Held L of K, then A is called an absolutely uniserial algehra. For A to be absolutely uniserial it is necessary and sufficient that its radical N be a principal ideal generated by an element in the center Z and Z be decomposed into a direct sum of simple extensions of K (i.e" ideals of the form K [a]) (K. Asano, G. Kothe, Nakayama, G. Azumaya). J. Algebraic Algebras Here, we consider general (not necessarily finite-dimensional) algebras A over a field K. We say that A is an algebraic algebra if every element of A is algebraic over K, i.e., every element of A is a root of a certain polynomial with coefficients in K. We say that A satistles a polynomial identity p( X" . . , X,,) = 0 or that A is a PI-algebra with an identity p(X I " X,) = 
133 o if there exists a nonzero (noncommutative) polynomial p(X I , ,X n ) in XI" X n with coefficients in K such that p(a I" a,) = 0 for all a, EA. A PI-algebra satisfies an identity which is homogeneous linear in each variable, and also an identity of the form [Xl' , xnr= o (where [ J is the sum L :t Xi . . ,x i taken over all permutations of 1,2, ,1 n and + is the sign of the permutation). An algebra is said to be locally finite if any finite number of ele- ments of A generate a finite-dimensional subalgebra. For PI-algebras, an affirmative answer was found for Kurosh's problem, which asks whether an algebraic algebra is locally finite if the degree of any element rx of A (i.e., dim K [aJ) is bounded. K. Brauer Group of a Ring Let R be a commutative ring. An R-algebra is called a separable algebra if A is tprojective as a two-sided A-module (- 200 Homological Algebra F). When the base ring is a field, this agrees with the classical notion of separability; and A is separable over R if and only if AjmA is separable over the residue field Rjm for every maximal ideal m of R. A central separa- ble algebra is also called an Azumaya algebra. If P is a finitely generated faithful projective R-module (briefly: R-progenerator), the endo- morphism ring EndR(P) is an Azumaya R- algebra. Azumaya algebras A, and A, are said to be in the same class (similar) if there exist R-progenerators Pi and P 2 such that A, 0 EndR(P I );?: A, 0 End R (P 2 ). The set of similarity classes forms an Abelian group with respect to 0. This is called the Brauer group B(R) of R (Auslander and Goldman [10J), Every element of B(R) is of finite order [12,13]. B(R) is a covariant functor from commuta- tive rings to Abelian groups. If R is a field, B(R) coincides with the classically defined one (- Section E). If R is a tHenselian local ring with the residue field k, the mapping B(R)-+ B(k) is an isomorphism [9]. If R is a tregular ring with the quotient field K, B(R)--> B(K) is injective. If further dim R (2, we have B(R) = npB(R p ), p running over all primes of height 1 of R, where Rp is the localization of R at p and B(R) and B(R p ) are considered as em- bedded in B(K) [10]. As an example, put- ting these facts together with the structure of B( K) of the algebraic number field K ( Sec- tion G), we have the structure of B(R) of the ring of integers of K: B( R) = 0 if K is totally imaginary, and ;?: (Zj2Z)r-l if K has r( > 0) real infinite places. A commutative R-algebra S is called a split- ting ring of A if the S-algebra S 0 A is isomor- 29 Ref. Associative Algebras phic to Ends(P) for some S-progenerator P. We denote by B(S/R) the subgroup of B(R) consisting of all algebra classes split by S. Since an Azumaya algebra over a ring need not have a Galois extension as splitting ring, the description of the Brauer groups by meanS of Galois cohomology ceases to have full generality. Instead we have the following exact sequence of t Amitsur cohomology, assuming S is an R-progenerator (Chase and Rosenberg [IIJ): O-->H I(SjR, U)--> Pic(R)-->Ho(SjR, Pic) -->H 2 (SjR, U)-->B(S/R)-->HI(S/R, Pic)--> H 3(SjR, U)--> , where we denote by U( 7') and Pic(T) of a commutative ring T the unit group and the +Picard group of rank 1 projec- tive modules of T, respectively. The full Brauer group B(R) is mapped monomorphically into H2(R, U)=limH 2 (SjR, U), the limit over the --> tfaithfully flat R-algebras S. Grothendieck and others studied the Brauer groups in a more general geometrical context [12]. References [IJ A. A. Albert, Structure of algebras, Amer. Math. Soc. Colloq. Pub\., 1939. [2J E. Artin, C. J. Nesbitt, and R. Thrall, Rings with minimum condition, Univ. of Michigan Press, 1944. [3J M. Deuring, Algebren, Erg. Math., Springer, second edition, 1968 (Chelsea, 1948). [4J N. Jacobson, The theory of rings, Amer. Math. Soc. Math. Surveys, 1943. [5J N. Jacobson, Structure of rings, Amer. Math. Soc. Colloq. Pub\., 1956. [6J B. L. van der Waerden, Algebra II, Springer, 1967. [7J C. W. Curtis and 1. Reiner, Representation theory of finite groups and associative alge- bras, Interscience, 1962. [8J J.-P. Serre, Corps locaux, Actualites Sci. Ind., Hermann, 1962. [9J G. Azumaya, On maximally central alge- bras, Nagoya Math. J., 2 (1951),119-150. [ 10J M. Auslander and O. Goldman, the Brauer group of a commutative ring, Trans. Amer. Math. Soc., 97 (1960),3677409. [ 1 IJ S. Chase and A. Rosenberg, Amitsur cohomology and the Brauer group, Mem. Amer. Math. Soc., 52 (1965), 34-79. [12J A. Grothendieck, Le groupe de Brauer, 1, II, III, Dix exposes sur la cohomologie des schemas, North - Holland, 1968, 46-188, [13J M.-A. Knus and M. Ojanguren, Theorie de la descente et algebres d' Azuma ya, Lecture notes in math. 389, Springer, 1974. [ 14J M. Orzech and C. Small, the Brauer group of commutative rings, Lecture notes in pure and app\. math. 11, Dekker, 1975. 
30 A Asymptotic Series 30 (X.19) Asymptotic Series A. Asymptotic Series and Asymptotic Expansions Let <Pn(x), n=O, 1, 2, be functions del1ned in R+, The sequence of functions {<Pn(x) }o is called an asymptotic sequence for x--> CIJ if <Pn+1 (x) =o(lpn(x)) as x--> 00 for each n. When {<Pn(x)};:;"o is an asymptotic sequence, the formal series L anlpn(x) is called an asymptotic series. In most cases, the func- tions Ipn(x) have the form .p(x)<p(x)". When <Pn(x)==x-n an asymptotic series is called an asymptotic power series, A function f(x) de- fined on R I is said to have the asymptotic expansion (or be asymptotically developable in the form) f(x) ao<po(x)+ a l <PI (x)+ ... + an <Pn(x) +... (2) as X  00 if f(x) satisfies f(x) - ao<po(x) - a, <PI (x) -... - an<Pn(x) = O(lpn+I(X» for any integer nO as X--> 00. The coefficients a, (n = 0, 1, ) appearing in (2) are uniquely determined. This fact immediately follows from the formulas a o = lim f(x)/<po(x),. ", xoo a n = \im (/(x)-ao<po(x)-... xoo -an-I <Pn-dx»/<Pn(x). For example, the tHankel function of the I1rst kind HI)(X) has an asymptotic expansion HI)(X) 1 l ( nv IT )] 00 (v,m) -exp i x---- L -  2 4 m0(-2ixt' (4v 2 -1)(4v 2 -3 2 )... [4v 2 -(2m-l)2J (v m)== 2 2m , rn. If v +t is not a positive integer, L:'=o (v, m) (- 2ixr m is not convergent, and (4) means that for each n, HI)(x)-exp l i ( x- vn - )J nf (V,) m  2 4 m0(-2IX) =O(X- n - I / 2 ) for X""" 00, Assume that f(x) and g(x) admit asymptotic expansions in power series for x-t 00. Then all functions del1ned by f(x)+g(x), f(x)- g(x), f(x)g(x), f(x)/g(x) (limxoo g(x) to) admit 134 asymptotic expansions in power series for X 00. Furthermore, if f(x) has an asymptotic expansion for x --> 00, its expansion is obtained by termwise differentiation of the f(x)'s. For any asymptotic power series Loanx-n there always exists a smooth function f(x) defined in R+ such that r(x)ao+alx-I +a 2 x- 2 +... (1) for x...... CXJ, and all the derivatives of f(x) are asymptotically developable in power series. Indeed, a function del1ned by 00 fIx) = L anO(x/tn)x- n n==O satisfies the required properties if O(t) is a smooth function satisfying { O. O(t)= 1, t.s; 1, t?2, and the sequence {t n } o satisl1es 1  to.s; t I  ... and (3) In(n+ 1) ...(2n-1)2na n t;II.s;2- n for all n. Until now only asymptotic series for X--> 00 have been considered, but the notion of asymptotic series has been adapted in vari- ous forms to the spaces of functions under consideration. (a) Asymptotic series in a complex domain. Let (X be a boundary point of an open con- nected domain D in the complex z-plane. When the asymptotic behavior for Z-->(X of holomor- phic functions in D is considered, we adopt as an asymptotic sequence {<Pn(z) };:;"o of ho10- morphic functions in D satisfying for each n <Pn+1 (z) = o(<Pn(z» (4) as z tends to (X through D. Let D be an angular domain with vertex at rJ.. For any asymptotic power series Lo an(z - (X)n there always exists a function f(z) that is holomorphic in D and that admits an asymptotic expansion f(z)ao+adz-(X)+... + an(z-C()" + ... (5) for z......(X. Concerning the uniform convergence of asymptotic expansions, Carleman's tbeorem is well known [2]: If the asymptotic expansion (5) is valid as z tends to (X through an arbi- trary angular domain in the Riemann sur- face oflog(z - a) having C( as its vertex, then the asymptotic expansion (5) is uniformly convergent. (b) Asymptotic expansions by a large para- meter. In the theory of differential equations, functions with a large parameter are often considered. Let Q be a domain in R" and k a parameter? 1, and let Ipn(x, k), n == 0, 1, 2,. , be functions defined for (x, k) E Q x [1,00). If 
135 they satisfy for each xEQ and n <Pn+l (x, k) = o({Pn(x, k)) for k...... 00, the sequence \ rpn(x, k)}no is called an asymptotic sequence, and the formal series Lo an(x)rpn(x, R) is called an asymptotic series. A function f(x, k) is said to be asymptotically developable in the form f(x,k) ao(x)cpo(x, k) + al (X)cPl (x, k) +... + an(x)CPn(X, k) + . for k-> 00 if for every xEQ and every n fix, k)-ao(x)cpo(x, k)-,., -an-! (X)CPn-l (x, k) = O(CPn(x, k)) (7) holds as k--> 00. Especially, if the order rela- tions (6) and (7) hold for each n uniformly in xED, we say that fix, k) is uniformly asymp- totically developable. In most cases, the rpn(x, k) are of the form .p(x, k)k -no B. Methods of Asymptotic Expansion There are several general methods of obtaining asymptotic expansions of functions represented by integrals with a large parameter. These methods are useful in applications because most tspecial functions are represented by integrals (- 39 Bessel Functions, 389 Special Functions). (a) Laplace's method. Let h(s) be a smooth real-valued function in [a-b, a + bJ (3 > 0) satisfying h(a)<h(s) for all sE[a-b,a+15J-{a} and li'(a) > 0, and let g(s)ECOO([a-b,a+bJ). Then f atO I(x) = e-XIi(S)g(s)ds o-s has an asymptotic expansion j2;. { I I(x)e-Xh(") g(a)  +CIX-l +.., V x v' h"(a) +Cnx- n +... } for X--> 00, where the C n are determined by derivatives of h(s) at S = aof order:;:; 2n + 2 and those of g(s) at S = a of order:;:; 2n. As an application of this method, con- sider the [-function fix) = SgJ e -ttx-l dt. Bya change of variable t = xs, we have fix) = XX J e -xls-logs)S -) ds. Since for any b > 0 \L-O e- x (,-logs)s-l d1 'If:o e -x(s-logS)S-1 ds I ;;:; Ce -x(1 +') (ex> 0), 30 c Asymptotic Series (6) the asymptotic expansion by the Laplace method f l+0 e-x('-Iog')s-ldse-x f27r (1 +cjx- l + ...) 1-0 V implies tStirling's formula nx)=JhxX-l/2e-X(1 +O(x- j )). (b) Stationary phase method. Let h{s} be a real-valued smooth function deti'ned in [a D, a + b] such that li(u) = 0, h"(u) f= 0 and h'(s) f= 0 for s #a, and let g(s) be a function in Cg'(a - b, a + 6). Then f. " +0 I(x) = eiXhls)g(s)ds o-s has an asymptotic expansion . h( ) . 4 1 { g(a) I(x)Ne 'X "e ,a "/ - +cjx- 1 +.. fi +CnX- n + } where (1= h"(a)/Ih"(u)) and the C n are deter- mined by derivatives of his) at s = a of order :0;; 2n + 2 and those of g(s) at s = a of order  2n. Further, tintegration by parts and the tmethod of steepest decent are often used to derive asymptotic expansions of integrals with a large parameter. C. Application to Differential Equations For a system of linear ordinary differential equations with an tirregular singular point at Z =0, even when there exists a formal power series solution, the power series is generally divergent. Taking the hint given by Stirling's formula for the [-function, Poincare intro- duced the notion of asymptotic expansions and succeeded in giving an analytic meaning to formal solutions of divergent type. Actual solutions for tdifference equations, tdifference- differential equations, and tordinary differen- tial equations of canonical form (including second-order linear tdifferential equations of confluent type) always have asymptotic expansions. In the study of certain ordinary differential equations containing a large parameter k, an asymptotic solution is often obtained in the form U(X,k) eik"'(X)(ao(x)+adx)k-l + + an(x)k -n +... (tWKB method). Also, in the study of linear 
30 Ref, Asymptotic Series partial differential equations, an asymptotic solution of the above form plays an important role. For example, consider the Schrodinger equation 1 au 1 L [ u J =----u-v ( x ) u=O. A iA "at (iA)2 An asymptotic series u(x, t; ),) N eiAq>(x,,) ( ao(x, t) + at (x, t) +... lA +an(x, t) CJ n +.) (8) satisfies L;Ju J -0 for ..1.--> 00 if o<p - (V<p)2 v(x)qJ = 0 (eikonal equation), ot oan -- 2V<p' Van-<pan =an-I' n==O, 1, 2, at (transport equation), where we set a-I == O. For each asymptotic solution of the form (8) there exists an actual solution of L;.[f(x, t; J.)J = 0 which is asymp- totically developable in the form 00 f(x,t;A) L eiAq>(X,t)a.(x,t)(iAr n n=O Then asymptotic solutions of the form (8) can describe the fundamental properties of the phenomenon governed by the equation LA[UJ =0, tGeometric optics can also be illustrated by asymptotic solutions of the form (8) ( 325 Partial Differential Equations of Hyperbolic Type L). The ideas of asymptotic series, asymp- totic expansions, and asymptotic solutions are essential to the theory of tpseudodifferen- tial operators and tFourier integral operators ( 345 Pseudodifferential Operators). References [IJ A. Erdelyi, Asymptotic expansions, Dover, 1956. [2J T. Carleman, Les fonctions quasi- analytiques, Gauthier-Villars, 1926. [3J H. Poincare, Sur les integrales irregulieres des equations Iineaires, Acta Math., 8 (1886), 295-344. [4J W. R. Wasow, Asymptotic expansions for ordinary differential equations, Interscience, 1965. [5J W. B. Ford, The asymptotic developments of functions defined by Maclaurin series, Scientitlc Series, Univ. of Michigan, 1936 (Chelsea, 1960). [6J M. Hukuhara, T. Kimura, and T. Matuda, Equations differentielles ordinaires du premier 136 ordre dans Ie champ complex, Pub\. Math. Soc. Japan, 1961. [7J F. W. J. Olver, Asymptotic and special functions, Academic Press, 1974. [8J G. D. Birkhoff, Collected mathematical papers, Dover, 1968. [9J V. P. Maslov, Theorie des perturbations et methodes asymptotiques, Dunord, Gauthier- Villars, 1972. (Original in Russian, 1965.) 31 (XVI.1) Automata A. General Remarks Automaton is a generic term for the math- ematical models for automatic machines hav- ing memory devices and capable of perform- ing definite acts within each unit of time. There are several types of automata, namely, Turing machines for the theory of computa- bility [6], linear bounded automata, push- down automata, and finite automata for math- ematical language theory [10-1 3]. These models are also used in the theory of tcomplex- ity of computation of algorithms [ 14]. B. Turing Machines Since the tarithmetization of metamathematics was used in the proof of tGMel's incomplete- ness theorem, mathematical formulations of the notion of teffectively computable functions have been attempted by many mathematicians, including S. C. Kleene, A. Church, and others, as well as by GOdel himself ( 356 Recursive Functions), With this end in mind, A. M. Tur- ing and E. L. Post independently introduced a sort of ideal computer [ 1, 2J, called a Turing machine. Such a machine is capable of being in one of a finite number' of internal states (possible contents of a memory device) qo, ql' ' , qn fixed for the machine; and the machine is supplied with a tape divided into squares. The tape is infinite in both directions. Each square can be blank or can have printed on it anyone of the previously specified symbols so, si, . , sm, where So stands for the blank. In any situa- tion the number of nonblank squares is finite. Let Q={qo,ql,...,qn}' M=={so,s"""sm},and consider the mapping t : Q x M-Q x M x {R, L, O}, A Turing machine T is specified by (Q, M, t, qo, F), where F is a subset of Q called a set of final states and qo is the specified ini- tial state. If t(q, s) = (q', s', x), then the 5-tuple (q, s, q', s', x) is called an instruction for T. The 
137 machine acts as follows: (1) At the start, the machine is in the state qo. Some sequence of symbols in M are written on the tape as an input, and the machine is on the square next to the leftmost nonblank symbol. (2) With respect to the present state q and the scanned symbol, the machine selects an instruction (q, s, q', s', x); then the next state becomes q', s is replaced by s', and the machine moves one square right or left or stands still according as x is R, L, or O. (3) The machine stops when the state becomes an element of F, and then the sequence of symbols on the tape is the result of the computation and considered to be the output. A Turing machine is called deterministic if there is always at most one instruction associated with a given pair (q, s); otherwise it is called nondeterministic. We represent a natural number y on the tape by y + I squares, each of which has the symbol SI printed on it. A pair (YI,Y2) is repre- sented on the tape by the following sequence of symbols of length y, + Y2 + 5: one So, the representation of y,; one 50, the representation of Y2; and one So. Similarly, in general we can represent a k-tuple (y" . , Yk) of natural num- bers on the tape. Let <D be a partial function of k variables, k) O. We say that a Turing ma- chine T computes <D if T starts with the tape in which the representation of (Xj , , x k ) is printed and stops with the tape having the representation of (XI' , xbx) when <D(XI' . ..> xd is defined and equal to x. A partial function is computable (in the sense of Turing) if there exists a Turing machine that com- putes the function. It is known that a partial function is computable if and only if it is par- tial recursive. The equivalence of comput- ability (in the sense of Turing) and recursive- ness for number-theoretic functions is strong evidence for the validity of Church's thesis (- 356 Recursive Functions), We can apply a Turing machine not only to the computation of number-theoretic functions but also to the transformation of a finite se- quence of letters in any language with a finite list of letters. A finite nonempty list of distinct symbols is called an alphabet, and a finite se- quence of symbols in the alphabet is called a word in that alphabet. All the words in an alphabet constitute a tfree monoid if we add the empty sequence and think of the concate- nation of two words as its multiplication. We call a finite nonempty list (A l' B I ), , (A" Bn) of pairs of words a set of production rules. By an application of a rule (A;, B i ) to a word W, we mean that an occurrence of A, exists in W, and by the application, Ai is replaced by B i . We say that two words Rand S are equivalent if they are transformable into each other by a finite number of applications of the production 31 D Automata ru1es. The following problem is called Tbue's (general) problem: Is there an algorithm for deciding, for any given alphabet and a set of production rules, whether or not any given two words are equivalent? E. L. Post and A. A. Markov proved by applying Turing machines that this decision problem is unsolvable [4,3] (a negative solution for the so-called word problem for semigroups). C. Universal Turing Machine and Generalizations Turing showed that it is possible to construct a machine U with the following property [ 1]: If we supply U with a tape on which a suitably cOded list of instructions of any given machine T is printed then U successively prints on it the behavior of T which results from the exe- cution of the instructions of T. In other words, U simulates the operations performed by any machine. Such a machine U is called a univer- sal Turing machine. We can construct a univer- sal Turing machine with only two internal states and also one having only two symbols, including the blank SO (C. E. Shannon and J. McCarthy [7J). Let j , . , k be given functions that are all defined everywhere on the domain considered. Then we can relativize ( 356 Recursive Func- tions) the notion of computability of functions (in the sense of Turing) to  I ' , 'h, Indeed, we can consider a machine acting as follows: When the machine is in a state qi J (j = 1, ." , k and we assume that these states are specified correspondingly to  I ' . , k and are not in F), then the machine prints the representation of .phi" Y,,) to the right of (YI" y,,) and scans the resulting (n j + I)-tuple (y" . , Ynj' j(YI' , y,,». In this process, the symbol a that is written to the right of (y" . . , y,,) is moved one step to the right so that a is pre- served on the right of the resulting (n + 1)- tuple. Extending the preceding, Kleene defined a machine that computes a functional of finite types [S]. A functional of variables of any finite types is computable by such a general- ized Turing machine if and only if it is partial recursive. D. Language and Automata Let M and N be mutually disjoint alphabets and V=MUN. Then G ==(N,M,P, S) is called a Chomsky grammar if P is a set of production rules consisting of finite number of pairs of elements of V* and S is specified element of N, where V* means the free monoid generated by V. For a word U E V*, if the result of a finite 
31 E Automata number of applications of rules in P is v, then we write U-->V, and v is said to be derived from u by G. The language generated by G is L(G) = {u S......u and uEM*}. N. Chomsky classi- fied the grammars into three families with respect to the form of production rules: (1) Context-sensitive grammar; For each (u, V)E P, the length of u is less than or equal to the length of v, (2) Context-free grammar; For each (u, V)EP, u is an element ofN and v is not a empty word. (3) Regular grammar; For each (u, v) E P, U E N and v is either an element of M or of the form ax, where aE M and XE N. Let F 1 (M), F 2 (M), and F,(M) be the families of languages generated by the grammars in the corresponding families classified as above. Then F1(M)i'J2(M)7F3(M) holds and this is called a Chomsky hierarchy [ 10]. Formal language theory studies various structures of these families of languages. Corresponding to the three families of lan- guages, we have the following families of auto- mata: (I) A deterministic (nondeterministic) linear bounded automaton is a Turing machine with input tape, namely, T= (Q, MU N, t, qo, FI, where Q is the set of states, M an input alpha- bet, N a tape alphabet, t is a mapping from Q x(MU{e}) x(NU{e}) into Q x N x {L,R} (the power set of Q x N x {L, RJ), qo E Q is the initial state, and F is a subset of Q. A word wE M* is placed on the input tape, and T starts its computation from the initial state with blank tape and by reading one symbol from the input tape, and with respect to the mapping t, we decide the configuration of the next step. If at a certain stage, T reaches a state in F when T reads up wand if the length of the tape used is a constant multiple of the length of the input tape, then w is said to be accepted by T. (The special symbol e makes it possible to change the configuration without moving the input tape.) (2) A push-down auto- maton is a Turing machine with an input tape. The working tape is of infinite length and moves in only one direction, say to the right. The computation with input word wE M* starts with the position at the left end of the work tape. It uses the tape according to a first-in-first-out principle; namely, the ma- chine may write a symbol to the blank square, but if it is on the rightmost nonblank square, it may not go left without erasing the non- blank symbol. If it reaches a state in F, then w is said to be accepted by T. (3) A finite auto- maton is a machine without its infinite tape; a word w is accepted if T reaches a state in F (without using the symbol e). Let L(T) be the set of words over M ac- cepted by T; then T) is called the language accepted by T. Corresponding to the three families of automata, DFJM) and NF,(M), i= 138 1,2,3, denote the family of languages ac- cepted by deterministic or nondeterministic automata. It is known that DFi(M) = NF,(M) = Fi(M) for i = 2 and 3, and N F j (M) = F j (M); however, whether D F j (M) = NF,(M) holds or not is still open, and this problem is called the LBA problem. These automata are used to study the closure properties under various operations within the families Fi(M). In par- ticular, the LBA problem is equivalent to the problem of whether F,(M) is closed under complementation. E. Applications of Automata Theory FinIte automata are formulated as models for digital circuits with memory. Therefore there exist structural studies of finite automata; for example, methods to decompose a given finite automaton to smaller automata correspond- ing to a circuit and its subcircuits are studied using tsemigroup theory [14, 17]. Context-free grammars are useful in describ- ing the syntax of programming languages. For a context-free grammar G, a derivation tree of x E L(G) is a tree whose terminal nodes are labeled by each symbol of x, its root is labeled by S, and other nodes are labeled by some elements of N in such a way that if for a node with A EN the nodes connected by an out- going edge from the former have labels Xl, ... ,xk> then (A, XI" Xk) is a production rule of G. Parsing is an algorithm to construct a derivation tree of each XE L(G). Various parsing algorithms have been studied within subfamilies of context-free grammars [ 16J and they are used in the construction of compilers. References [IJ A. M. Turing, On computable numbers, with an application to the Entscheidungs- problem, Proc. London Math. Soc., (2) 42 (1936), 230-265; 43 (1937), 544-546. [2J E. L. Post, Finite combinatory processes- formulation 1, J. Symbolic Logic, 1 (1936), 103-105. [3J A. A. Markov, On the impossibility of certain algorithms in the theory of associative systems I (in Russian), Dok\. Akad. Nauk SSSR, 55 (1947), 583-586; II, Dok\. Akad. Nauk SSSR, 58 (1947),3533356. [4J E. L. Post, Recursive unsolvability of a problem of Thue, J. Symbolic Logic, 12 (1947), 1-11. [5J S. C. Kleene, Introduction to metamath- ematics, Van Nostrand, 1952. [6J M. Davis, Computability and unsolva- bility, McGraw-Hili, 1958. 
139 [7J C. E. Shannon and J. McCarthy, Auto- mata studies, Ann. Math. Studies, Princeton Univ. Press, 1956. [8J S. C. Kleene, Turing-machine computable functionals of Unite types 1. Logic, metho- dology and philosophy of science, Proc, 1960 Intern. Congr. Logic, Standford Univ. Press, 1962, 38845: II, Proc. London Math. Soc., (3) 12 (1962),245-258. [9J H. Hermes, Enumerability, decidability, computability, Springer, 1965. [ IOJ N. Chomsky, Three models for the de- scription of language, IRE Trans. on Infor- mation Theory, 2 (1956), 1133124. [llJ J. E. Hopcroft and J. D. Ullman, Intro- duction to automata theory, languages, and computation, Addison-Wesley, 1979. [12J S. Eilenberg, Automata, languages and machines, 2 vols., Academic Press, 1974, 1976. [ 13J J. Berstel, Transductions and context-free languages, Teubner, 1979. [14J A. V. Aho, J. E. Hopcroft, and J. D. Ull- man, The design and analysis of computer algorithms, Addison-Wesley, 1974. [15J A. V. Aho and J. D. UIIman, Principles of compiler design, Addison-Wesley, 1977. [16J K. Krohn and J. L. Rhodes, Algebraic theory of machines I, Prime decomposition theorem for finite semi-groups and machines, Trans. Amer. Math. Soc., 116 (1965),450-464. [17J A. Nozaki, Practical decomposition of automata, Information and Control, 36 (1978), 275-291. 32 (X1.22) Automorphic Functions A. General Remarks [1 J Let X be a (complex) tanalytic manifold and r a tdiscontinuous group of (complex) analytic automorphisms of X. A set of tholomorphic functions (without zero) on X, {j,(z» (YE r), is called a factor of automorphy if it satisfies the condition j".(z) = jy(y'(z))jy'(z) for all y, / E r, ZEX. A tmeromorphic function 1 on X is called a (multiplicative) automorpWc func- tion with respect to the factor of automorphy {j,(z)} if it satisfies the condition I(y(z)) == f(z)jy(z) for all YE f, ZE X. When all j,(z) are identically equal to I, i.e., when f(z) is r- invariant, 1 is simply called an automorphic function with respect to r. When all jy are con- stant (and hence y-j, is a "quasicharacter" of r), [ is called a multiplicative function. If we denote by J,(z) the functional determinant of the transformation y, then for an integer m, an automorphic function 1 with respect to 32 B Automorphic Functions a factor of automorphy of the formj,,(z) = J,(z»" is called an automorphic form of weight tn, (In this case, it is customary to assume that fis holomorphic. Also, in this and most other cases, we add a suitable condition on the behavior off at the "points at infinity," when r\X is not compact (- 234 Kleinian Groups).) B. The Case of One Variable [2-6J Except for the cases where r is a Unite group or its extension by a free Abelian group of rank  2 ( 134 Elliptic Functions E), it is essentially enough to consider the case where X is the upper half-plane i) = {ZE C 1m z > O}, In this case, r may be obtained as a tdiscrete subgroup of the tspeciallinear group G = 5L(2, R). In the following, we restrict our- selves to the case where f is a tFuchsian group of the first kind (i.e., the case where the tHaar measure p(f\5) of the quotient space r\5 is finite; -" 122 Discontinuous Groups). An automorphic function (or Fuchsian function) 1 with respect to r is, by definition, a meromor- phic function 1 on i) satisfying the following conditions: (i) [ is f -invariant, i.e., [(yz) == [(z) for all YE f; (ii) 1 is also meromorphic around any tcusp Xo of r, i.e., if <Po is a real linear fractional transformation mapping Xo to 00 (e.g., Ipo(z) = -1/(2 -x o )) and if the trans- formation z  Z + h (h > 0) is a generator of 4?0 r x <Po I (where r x is the tstabilizer of x o ), then °f«(PO-l(Z)) can be expanded into a tLau- rent series of qh = exp(2ni/h)z) (which has only finitely many terms with negative exponent) in a neighborhood 1m z > y, of ex; (qh is called the local parameter around the cusp x o )' If we denote by 9i r the compact tRiemann surface obtained from the quotient space f\i) by adjoining a certain (Unite) number of "points at infinity" corresponding to the equivalence classes of cusps of f, then conditions (i) and (ii) amount to saying that [ gives rise in a natural manner to a meromorphic function on \H r , Thus the field 5\r of all automorphic func- tions with respect to r can be identified with the talgebraic function field belonging to the Riemann surface 9i r ( 11 Algebraic Func- tions A). (While any (nonconstant) automor- phic function with respect to a Fuchsian group of the first kind r has the real axis as its tnat- ural boundary, an automorphic function with respect to a Fuchsian group of the second kind can always be analytically extended to the lower half-plane through a neighborhood of any "ordinary point" on the real axis ( 234 Kleinian Groups).) Let k be an even integer. A (holomorphic) automorphic form (or Fuchsian form) .r of 
32C Automorphic Functions weight k or of dimension -k with respect to f is, by definition, a holomorphic function j on ,\S satisfying the following conditions: (i) For ( a b\ every 0' = Er, j(O'z)=j(z)'(cztd)k. (In cd) other words, J a (Z)-k/2 = (dO"(z)jdzr k / 2 = (cz + d)k is the factor of automorphy.) (ii) j is also holomorphic around any cusp xo. i.e" in the above notation, we have an integral "Fou- rier expansion" f(<pOI(Z))(d<pOl(Z)jdz)k/2 = Lo avq. Note that this definition of weight is different from that given in Section A (- Sections F, G). If, moreover, the constant term a, in this Fourier expansion vanishes at all cusps of f, j is called a cusp form. In partic- ular, fis a cusp form of weight 2 if and only if j(z)dz gives rise in a natural manner to a tdifferential form of the first kind on the Riemann surface 9i r . (By a slight modifica- tion of condition (ii), we can also define an automorphic form of odd weight k. When ( -1 0 ) E r, condition (i) (for k odd) im- o -1 plies that j is identically equal to zero. So as- ( -I 0\" sume that 0 -1 f f, Then cusps Xo are classified into two categories, according as ( -1 h ) o -1 E <Po fxo <p;; 1 or not, and in the first case we should replace the power series in qh in condition (ii) by q/2 X (power series in qh)') (Note that the above definition of weight is slightly different from the general one given in Section A. See also the Siegel and Hilbert modular cases (- Sections F, G).) We denote by Wlk(f) (resp. 6 k (r)) the linear space of all automorphic forms (cusp forms) of weight k with respect to f, Since clearly the relations Wlkmk' cWl k +k" Wlk6k' C 6 k + k , hold, the direct sum Wl(r) = LkWlk(f) is a (commuta- tive) tgraded algebra, of which 6(r) = Lk 6 k (f) is an ideal. It is known that Wlk(f) is of finite dimension, and d k = dim Wlk(f), dp = dim 6 k (f) can be determined (from the tRiemann-Roch theorem) as follows: dk=O for k<O; o f for It= >o, d o = 1, d o = 0 for {g for I = 0, d=g; d 2 = for t> 0, g+t-I dk=(k-I)(g-I)+ t (  ( 1- )) +t, i1 2 e i 2 dP=dk-t for k even, 4, where s is the number of the equivalence classes of telliptic fixed points of r, e i (1 :( i :( s) is the order of the stabilizers of these elliptic points, t is the number of the equivalence 140 classes of cusps of f, and g is the tgenus of the ( -1 0 ) Riemann surface 9i r . (When 0 -1 rj;r, we can obtain a similar formula for k odd except for d l , d?, which, in general, cannot be determined from the Riemann-Roch theorem.) One method of constructing automorphic forms is provided by Poincare series as fol- lows: Let 0"0 be a tlinear fractional transfor- mation that maps the unit disk to the upper half-plane (e.g., the tCayley transformation), and put r' = 0"0- 1 rO"o; let <p be a function holomorphic on the unit disk including its boundary (e.g., a polynomial). Then for k 4, the series P,(z)= L <p(0"'(z)(dO"'(z)jdz)k/2 u'er' is tuniformly convergent in the wide sense (i.e., uniformly convergent on every compact set) in the unit disk and expresses an automorphic cusp form of weight k with respect to r', i.e., we have Pq>(aol(z))(dO"ol(z)jdz)k/2E6dr). Conversely, every element in 6 k (f) (k4) can be expressed in this form. A series of this type is called a theta-Fuchsian series of Poincare (or simply Poincare series). For k ? 3, we can define a (positive definite Hermitian) inner product on 6 k (f) as follows: (f,g)= Lf(Z) 9(z) yk-2 d X d Y , where z = x + iyand F is a tfundamental region off in,\S. This is called the Petersson metric. Since this integral converges also for f, gE Wlk(f) if one of them belongs t{) 6 k (r). we can define the orthogonal complement k(f) of 6 k (f) in Wlk(f), As we shall see, k(n is generated by tEisenstein series. Any automorphic function j with respect to r can be expressed as a quotient of two auto- morphic forms II '/2 E \JJlk(f) for a sufficiently large k. If we put w = I(z), the inverse function z = j -I (w) can be expressed as the quotient of two linearly independent solutions of a tIinear differential equation of the form d 2 zjdw 2 = <p(w)z, where <p(w) is an talgebraic function belonging to the Riemann surface 9i r . C. Modular Functions and Modular Forms [4,7,8J The (elliptic) tmodular group f==[(l) ==SL(2, Z) = {G )la,b,c,dEz,ad-bc= I} is a Fuchsian group of the first kind acting on 
141 the upper half-plane . The quotient space r\ can be compactified to get a compact Riemann surface 9\r = r\ U { 00 } of genus zero by joining one point at infinity corre- sponding to the cusp 00, around which we take q = e hiz as a local parameter. The dimen- sion d k = dim WZk(r) (k even,  2) is given as follows: d = J[k/12] for k=2 (mod 12), k [k/12J+ 1 for h!_2 (mod 12), where [ ] is the tGauss symbol. More generally, an automorphic function (automorphic form) with respect to the prin- cipal congruence subgroup r(N)={(; )=C }modN)} is called a modular function (modular form) of level N. (For the number of cusps of r(N), denoted by t(N), and the genus of 9\r(N) -->. 122 Discontinuous Groups.) At any cusp of T(N), the local parameter qh in (ii) is given by qN = exp(2niz/ N). For N, k  3, the dimensions d k , df are given by the general formula of the preceding paragraph (including odd k). For k ) 3, we define the (extended) Eisenstein series by I G k (Z;C I ,c 2 ,N)==  ( k' mi=ci(modN) m l +rn2z) where c 1 , C 2 are integers such that(c p c 2 , N) = 1 and the symbol L' denotes the summa- tion excepting the pair (rn l , rn 2 ) = (0,0). Then Gk(z; C p c 2 , N) depends only on c 1 , C 2 (mod N), and if we take a set {(c p c 2 )} such that {(CI,C 2 ), ( - c I' - c 2)} forms a complete set of repre- sentatives of the (primitive) residue classes (mod N), then the corresponding set of Eisen- stein series forms a basis of the space <1\ (r(N) (k  3), whence we obtain dim (fk = d k - df = t(N) (k) 3) (for details, including the case k= I, 2, see E. Hecke, Abh Math &m Univ. Harn- rorg, 5 (1927». The Fourier coefficient a, of the Eisenstein series can be calculated easily, and we get an estimate a, = O(n k -1+') for k) 2, where B is an arbitrary positive number. On the other hand, for cusp forms, Hecke (Abh. Math. Sern. Univ. Hamburg, 5 (1927» gave an = O(n k / 2 ). In an attempt to improve this estimate, H. D. Kloos- terman was led to consider the sum ( 2ni ( V )) K(u,v,q)= I exp - ux+- x(modq) q X (x,q)=1 (u, V,4 E Z), called the Kloosterman sum, which is also related to the arithmetic of quadratic forms. Using A. Weil's estimateIK(u, v,p)1 2JP 32 C Automorphic Functions (where p is an odd prime, (u, p) = (v, p) = I), based on the analog of the Riemann hypoth- esis, we obtain a, = O(n k / 2 - 1 / 4 +,). The estimate an = O(n kj2 -1/2+') is a weak form of what is known as the Ramanujan-Petersson conjec- ture ( Section D). In the case N = I, we ob- tain the classical Eisenstein series Gk(z) = Gdz; 0,0, 1) (k even, ) 4), from which we define the modular forms g2(Z)== 60G 4 (z), g3(Z) = 140 G 6 (z), and A(z) = gi - 27g of weight 4, 6, and 12, respectively. If we denote by f.J = gJ(u; 1, z) the Weierstrass tgJ-function with the fundamental period (1, z) we have the relation &0'2 = 4gJ 3 -g2 gJ -g3' and A(z) is the tdiscriminant of the cubic polynomial appear- ing in this relation. It is known that every modular form can be expressed uniquely as a polynomial in g2 and 93' or in other words, we have \JJl(r(I))C[g2,g3]. The polynomial in g2' g3 expressing a modular form is, moreover, isobaric, i.e., consists of terms of the form cg/93 v , CEC, where 211+ 3v is a constant called the weight of the isobaric polynomial. Also, A(z) is a cusp form of the smallest weight, and the ideal of all cusp forms, 6(r( 1)), is a principal ideal in WZ(r( 1» generated by A(z). The Fourier expansion of the Eisenstein series Gk(z) is given as follows: 1 ( 00 v ) G k (z)=(2n)k, B k +( -1)k/22k 2: VH I k. v=1 1-q where q = e2iz and Bk is the tBernoulli num- ber. The discriminant A(z) is expressed as an infinite product as follows: 0) (z)=(2n)12q fl (1 _q')24. vl If we putJ(z)=g/, the function J(z) is a modular function and gives an tanalytic iso- morphism of the Riemann surface 9\r(1) = r( I )\ U { 00 } onto the Riemann sphere C U {oo} (which maps (3' (4' r:JJ to 0, 1, 00, respectively). Hence the field of modular func- tions S{ql) is a rational function field C(J) generated by the function J. The analytic isomorphism class of komplex tori of dimen- sion 1 (= telliptic curves) E("".",,) = C!(ZWI+ Z( 2 ) with the fundamental period (WI' (2) is uniquely determined by the r( I)-equivalence class of the modulus r = W 2 /W I (E ), and hence by the value J(t) of the function J. This is the historical origin of the name of tmodular function [4]. As an example of a modular function of level 2, we have the ).-function: 3O((t + z)/2) - 30 (z/2) A(Z) = g'J (1/2) - gJ (z/2) I which gives an analytic isomorphism of the Riemann surface 9\[(2) = r(2)\ U (three 
32 D 142 Automorphic Functions points) to the Riemann sphere CD { oo} (map- ping 0, 1,00 to 1,00,0, respectively; this prop- erty is used in a proof of the tPicard theo- rem). Hence we again have stf(2) = C(A), The following relation holds between J and A: 4(1-..1.+..1. 2 )3 J( z)- -i 7 ,P(I-A)2 In general, the field .Rr(N) of modular func- tions of level N is generated by (g2jg3)' g'J((al +a2z)jN; l,z)(a 1 ,a 2 EZ). D. The Heeke Ring and Its Representation [S-I13 In general, let f be a group, r a subgroup of f, and suppose that, for every 0" E f, or a-I and r are tcommensurable, which amounts to saying that every double coset r 0" r is a union of a finite number of left or right cosets of r Then in the free module generated by all double cosets rO"r (O"E f'), we can define a bilinear associative product as follows: rO"r-rrr =') m(O", 7;p)rpr, Hr where, writing faf = Ui rO"j,r,f = Uj rr j , we denote by m(O", 7; p) the number of pairs (i,j) such that r O"irj = f p. The associative ring thus obtained is called the Heeke ring (or Heeke algehra) and is denoted by %(f, f). (If f is a ttopological group and r is an open compact subgroup, the Hecke ring %(f', r) can also be interpreted as the ring of all Z-valued continu- ous functions on f with compact support provided with a tconvolution product; -" 29 Associative Algebras C.) As an example, setting r = GL +(2, Q), f = SL(2, Z), we obtain a Hecke ring %(f, r). In virtue of the theory of telementary divisors, a complete set of representatives of r\f' jr is given by {(l Jlal,a2EQ,aj,a2>0,alla2}' . ( a 1 0 ) Wewnte T(a 1 ,a 2 )=r r. From the o a 2 relation T(a l a 2 , a l a2)T(al, a;l)== T(a p a 2 ) follows the commutativity of %(f, f). Fur- thermore, if we put, for a positive integer n, 1;.== T(n) I T(aj, a 2)' al,a2 EZ QIQ2=n then the following multiplication formula holds: T(n)' 'frm) = L dT(d, d). T(nm/d 2 ). d!n,m We obtain a representation of %(f, r) in Wlk(f) as follows. For T= rar = u rO"jE %(f, f) and jE9Jl k (r) we put UI T)(z) = Lf(O";(z))(ciz+d;)-kdet(O"Y, i ( a. b. ) where 0", = I I and I is a fixed integer. This c; d i representation leaves 6 k (r) and 6\(r) invar- iant. In particular, the representation of 1;. (with 1= k 1), called the Heeke operator (Hecke, Math. Ann., 114 (1937», is tHermitian with respect to the Peters son metric. Following Hecke, we associate with every modular form f(z) = L:'=o anqn (q = e 2niz ) a tDirichlet series <p(s) = I ann-'. Since a, = O(n k -1+e) for fE9Jl k (k;::2), <p(s) is absolutely convergent in the half-plane Res > k The conditions for f to be a modular form of weight k are equivalent to the following con- ditions for <p: (s-k) (jJ (s) can be extended to an Tentire function of fmite tgenus (actually, of genus 1) and, if we put R(s) = (2n)-sr(s)<p(s), R(s) satisfies a functional equation of the formR(k- s) = (-It/ 2 R(s). A correspondence between f and rp that satisl1es these con- ditions is one-to-one; in fact we have a, = ( -1 )k/2 Ressk R(s), and the function g(x) = f(ix) - ao(x > 0) and R(s) are related by the tMe11in transform as follows: R(s)= 1 00 g(x)x s - I dx, g(X)=.f- r R(s)x-"ds. 1[1 J Res=ao This correspondence between Dirichlet series and automorphic forms can be generalized to the case of a certain discrete subgroup of SL(2, R), which is not necessarily of the first kind (Heeke, Math. Ann, 112 (1936); [ 10J). It is also known that if the functional equations for LI x(n)ann- S hold for sufficiently many char- acters X, then f == LI anqn is a cusp form with a character for some congruence subgroup (Weil, Math, Ann, 168 (1967); [12J). Now, suppose that a linear subspace Wl of \JJldr) is invariant under a11 1;. (n = 1, 2, . . . ), and let Ul , . . ,JK) (K == dim Wl) be a basis of Wl, If we denote by A(n) the K x K matrix represent- ing 1',. in this basis, it can be shown that there exist K x K (complex) matrices B(i) (1  i  K) and A(O) such that we have 00 K F(z) = n qn A(n) =  /;(z)B(i) Similarly we associate with F(z) a matrix- valued Dirichlet series 00 <1>(s) = L n -s A(n). n=1 Then from the multiplicative property of the A(n), we obtain the following tEuler product 
143 expression of $(s): $(s) = f1 (IK - A(p)p -s + pK-t-2SIK)-I, p where IK denotes the unit matrix of degree K. In particular, when dim \JJl = 1, i,e" when Wl = Cf, where I is an teigenfunction of all Hecke operators T", the corresponding Dirichlet series <p(s) has an Euler product of the follow- ing form: <p(s) == Il (1- ApP -s + p k - I - 2 T I , p For instance, the Dirichlet series correspond- ing to the Eisenstein series G k is (2(2ni)k (k- l)!)((s)((s- k+ 1) (where' is the tRiemann zeta function), which has a well-known Euler product. Since 6 12 = CA, the Dirichlet series corresponding to 1\(z) also has an Euler prod- uct. (This is part of the Ramanujan conjecture. Ramanujan also,conjectured that the quadra- tic polynomial 1 - ApX + pi I X 2 appearing in this Euler product has imaginary roots, i.e., IApl < 2 p l 112.) Generalizing the Ramanujan conjecture, Petersson conjectured that all eigenvalues of the Hecke operator Tp in 6k have absolute value  2p(k-t)!2 (Ramanujan- Petersson conjecture). P. Deligne proved this conjecture for k  2, and his method extends to the case when r is a congruence subgroup (Publ. Math Inst. HES, 53 (1974». For k= 1, see Deligne and J.-P. Serre (Ann. Sci. &de Nann. Sup., 7 (1974». It may be noted that this conjecture was proved earlier for k = 2 (and for almost aJlp) by M. Eichler (1954) and by G. Shimura (1958) ( 450 Zeta Functions M). E. The Case of Many Variable; [1, 15,161 Let X be a bounded domain in eN and r a tdiscontinuous group of analytic automor- phisms of X. The Poincare series of weight (  Section A) m 2, defined similarly to the case of the unit disk, converges normally in X and expresses an automorphic form of weight m If f\X is compact, let UI' JK) be a basis of the space of Poincare series of weight m Then for a sufficiently large m (which is a multiple of the order of r x for all x E X), the map X '3 z...... (f,(z), ,f.(z» defines in a natural manner a projective embedding of the quotient space f\X into P K - t (C), which actually gives an analytic isomorphism of r\X onto a tnormal projective algebraic variety. It follows that the field of automorphic functions with respect to f is an talgebraic function field of dimension N, and that every automorphic function can be written as the quotient of two Poincare series of the same weight. In the case where f\X is not compact, we first discuss some examples. 32F Automorphic Functions F. Siegel Modular Functions [15,17, 18J The Siegel upper half-space bn of degree n (or Siegel space of degree n) is, by definition, the space of all n x n complex symmetric matrices Z = X + iY with the imaginary part Y > 0 (posi- tive definite). (bn is analytically equivalent to a tsymmetric bounded domain.) The group of all (complex) analytic automorphisms of fin is given by the real (projective) tsymplectic group Sp(n, R)/{ :t f 2n }, which acts transitively on fin by Z -->O"(Z) = (AZ + B)/(CZ + D) for 0"= ( A B\ C D J Sp(n, R). The subgroup f n = Sp(n, Z) consisting of integral matrices, or the corre- sponding group of linear fractional transfor- mations, is called the Siegel modular group of degree n. f n is a tdiscontinuous group of the first kind acting on bn. A Siegel modular form I of weight k or of dimension -k is, by definition, a holomorphic function on bn satisfying the following con- ditions: (i) for ()= ( %)E fn,f(O"(Z)) == f(Z)det(CZ + Dt: (ii) I has an integral Fourier expansion of the form f(Z) = LT;'O aTe2nitr(TZ), where T runs over all half-integral tpositive semidefinite symmetric matrices of degree n. (Note that the Jacobian determinant J,(Z) of ( A B ) . . (J= c D IS gIVen by Ja(Z) = det(CZ + D)-k-l) For n2, condition (ii) is superfluous (M. Koecher, Math. Z., 59 (1954». We denote by !Ilrt) = \JJldrn) the space of all Siegel modu- lar forms of degree n and weight k When we write bn 3 Z = (1 3z) with ZI E tin-I' 3E cn-I, zEtI" and \JJlt n )3f(Z) = f a n (ZI' 3)e2ninz, n=O it turns out that ao(ZI' 3) depends only on ZI and, writing it as II (ZI)' we have 11 E \JJlt n - I ). The mapping $: I -+ II thus defined is a linear mapping from !JJltn) into \JJlt n - I }, which is sur- jective if k is even and> 2n (H. Maass, Math. Ann., 123 (1951». We denote the kernel of <I> by ISt n ) and call IE ISt n ) a cusp form (viewing b'-I as a cusp of bn)' For IEWln), the fol- lowing three conditions are equivalent: (a) IE 1St); (b) f(Z) = LT>OaTe2nitr(TZ); (c) I/(Z)det( Y)k/2 is bounded. We have Wln) = {O} if k < 0 or if both k and n are odd, and Wlb') = C. In general, dim Wln) is finite and = O(k nln + I )/2) (k-t co). The graded algebra \JJl(r n ) = Lo Wlln) is finitely generated, and its tKruIJ dimension is n(n + 1)/2 + 1. In par- ticular, for n= 2 the structure of the graded algebra Wl(r 2 ) is determined explicitly. (The even part of this algebra is isomorphic to the 
32 F Automorphic Functions polynomial ring of four variables.) Also, dim 9R2) and the singularities of the quotient variety f 2 \2 are known explicitly (1. 1. Igusa, Amer. J Math, 84 (1962), 86 (1964); U. Christian). As an example of Siegel modular forms, we have the Eisenstein-Poincare series, deljned as follows: Es,k(Z) == L e 2ni 'r(Sa(z» det( CZ + D) -\ <Tern(s)\r ll where S is an n x n rational symmetric matrix ;:: 0, and we put rn(s) = { ( ,-I)Efnl'USU =S, e 2nitr (ST) = det( U)k} This series is convergent for k> n + rank S + 1 if rank S <n and for k> 2n if S >0, and the totality of Es.k (those series with S > 0) spans 9Rkn) (6k n ) (Maass, Math. Ann., 123 (1951». tTheta series del1ned by integral quadratic forms are examples of Siegel modular forms (with a certain level) and are of great signifi- cance in the arithmetic of quadratic forms (C. 1. Siegel, Ann Math, (2) 36 (1935». The quotient space f n \n can be compacti- fied as follows (I. Satake and W. Baily [15J): There exists a positive integer ko such that, for any multiple k of ko, any basis of 9Rn) defines in a natural manner a one-to-one biholomorphic projective embedding of fn\n' of which the image is a Zariski open set of a normal projective algebraic variety. Since the structure of this projective variety is indepen- dent of k, we denote it simply by f n \5. . In other words, f n \tln is the projective variety Proj \JJl(f n ) associated with the graded algebra 9R(f.). Then we have fn\n = U;=o f,\tI,. When n  2 for r < n, f, \r is of codimension ?< 2 in f n \n' so that (by virtue of tHartogs's continuation theorem) the conditions at the points at infinity in the del1nitions of modular functions and forms become superfluous. In fact, for n ;:: 2, if we define a Siegel modular function of degree n simply as a fn-invariant meromorphic function on tin, then it can auto- matically be extended to a meromorphic func- tion on the compactillcation f n \tln, and hence be expressed as the quotient of two modular forms of the same weight. It also follows that the field of all Siegel modular functions of degree n is an algebraic function l1eld of di- mension n(n t 1)/2. (These results can also be obtained from the tpseudoconcavity of f n \tln without using compactillcation (A. Andreotti and H. Grauert, Nachr. Akud Wiss. Gottin- gen).) For n = 2, the field of Siegel modular 144 functions is a rational function l1eld. On the other hand, for PI == 1 mod 8, n > 9, the field of Siegel modular functions is not rational (E. Freitag, Abh Math Sern Univ. Hamburg (1978». Let r be a subgroup of f n of finite index. Then f\n can be compactified in the same way as above [23]. However the singularities of r\n are extraordinarily complicated (Igusa [24J). When r is the principal congruence su bgroup fn( N) of level N > 2, the blowing-up of fn (N) \ n along the cusps is nonsingular for n = 2, 3 (Igusa, Math Ann, 168 (1967». F 01- lowing Igusa's idea, it is possible to construct an explicit nonsingular compactification of f\D, in which D is a bounded symmetric domain and f is a suitable arithmetic sub- group of Aut(D) ([19J; see also Satake, Bull. Amer. Math Soc., 79 (1973». For D = f) x tI and r the Hilbert modular group ( Section G), F. Hirzebruch has given explicit nonsin- gular compactillcations of f\D (Enseignement Math., 19 (1973». The Hecke theory (...... Section D) can also be extended to the Siegel modular case. For any integer m> 0, let 8m = S:;:) be the set of 2n x 2n integral matrices M such that 'M J M = mJ, where J= ( 0 - In ) and In is the identity In ° matrix of degree n. We define the Hecke operator 'frm) on \JJli n ) by T(m)f =m nk - n (n+I)/2 L det(CZ ( g)MEV(m) + D) -kf(MZ), where fE 9Rn), 8m = U M€V(m) fnM, and V(m) is a complete system of representatives of fn\8m. Then for any prime number p, the formal operator series D,(X)= Lo T(p.)X n is form- ally equal to a rational function in X: D X ) - P".(X) p( - Qp(X) , in which P,(X) and Q,(X) are polynomials in X of degree 2" _ 2 and 2 n , respectively, with coefficients in the ring of Hecke operators (Satake, Pub I. Math Inst. HES, 18 (1964». For n = 2 we have (Shimura, Proc. Nat.Acud &i. US, 49 (1963» D p (X)=(I- p 2k- 4 X 2 ){1- T(p)X +(T(pf _ T(p2)_ p2k-4)X 2 _ p2k-3 T(p)X 3 +p4k6X4} -I. Let f(Z) E \JJlL 2 ) be a Siegel modular form of degree 2 such that T(m)f == A(m)f for all m?< 1. We define Lf(s) by 00 A(m) L f (s)=((2s-2kt4) L -;--, m1 m 
145 in which ((s) is the Riemann zeta function. It is known that L f(s) can be meromorphically continued to the entire plane and satisfies the functional equation 'l'f(2k - 2 - s) = '¥f(S)' where Y,(s) = (2n)-2sr(s)r(s - k+2)L f (s) and the function (s - k)(s - k + 2)'I'f(s) is entire (A. Andrianov, Trudy Math I nst. Stekloo., 112 (1971». G. Hilbert Modular Functions Following a suggestion of Hilbert, O. Blu- menthal studied a generalization of modular functions of the following type: Let K be a totally real talgebraic number field of Imite degree, and let K(l), , K(n) be the conjugates of K, where n = [K : Q]. ("Totally real" means that all the K(i) are real.) For IJ. E K, we denote by rx(i) E K(i) the ith conjugate of a. Let !i be the +Principal order of K, and consider the group r'D = SL(2, D) = {O"=(; ) I ex, p, y, (5E!i, ex(5 - py = I}. We define the action of ['D on the n-fold prod- uct of the upper half-plane 5 n = {z = (ZI ' . , z.) ZiE5} by ( rx(1)z +p(1) ex(n)z +p(n» ) a(z)= I n y(1) Z 1+ (5(l) , ..., y(n) Zn + (5(n) for 0" == ( :). then ['D becomes a discon tin u- ous group of the first kind (f'D can also be considered as an tirreducible discrete subgroup of SL(2, R)"). The group r 1\ is called the Hil- bert modular group of K (in the strict sense). (The defmition is sometimes modilled by re- placing the condition afj - fJy = 1 by, say. ex(5 py =(totally positive tunit).) If the tclass num- ber of K is h, the quotient space ['D \5 n can be compactified by adjoining h points at in- finity. Therefore, if n ;<: 2, a Hilbert modular function can be defined as a meromorphic function on 5 n invariant under [1\; and simi- larly a Hilbert modular form / of weigbt k or of dimension -k as a holomorphic function / on 5 n such that UI 0 )(z) = f(lT(z))rr71 (y(i) z, + (5(i»-k = f(z) for all (JE r!,. (In the latter case, fis holomorphic at all cusps, i.e., for every O"E SL(2, K), /10" has an integral Fourier ex- pansion.) For Hilbert modular functions and forms, results quite similar to those for the case n = 1 or the case of Siegel modular groups have been obtained (Kloosterman, Maass, K. B. Gundlach, H. Klingen). 32 Ref. Automorpbic Functions H. Furtber Generalizations As further generalizations of the notion of modular function, we have Hilbert-Siegel modular functions (1.1. Pyatetskil-Shapiro, Baily, Christian [20J, where we can tlnd some 150 references), Hermitian modular functions (H. Braun, Klingen), etc. For the most general case, i.e., for an arithmetically defined discon- tinuous group acting on a tsymmetric bounded domain, a unified theory of automorphic func- tions has been established in the works of Pyatetskil-Shapiro [21, 22J and Baily and A. Borel [23,24]. On the other hand, exactly as in the classical theory, where the elliptic modular function gave an invariant of I-dimensional complex tori, generalized modular functions can be viewed as giving an analytic invariant of a certain family of (polarized) t Abelian varieties. From this point of view, a deep number- theoretic (and algebrogeometric) study of automorphic functions (initiated by Hecke and Eichler) has been substantially carried forward by the work of Shimura (see the series of his papers in Ann. Math starting from vol. 70 (1959); see also [9,24,25]). From the analytic point of view, the theory of automorphic functions is closely connected with the unitary representation of G in the space L 2 ([\ G) (or its adelic analog) [26,27J (for adelic analogs -->. [ 12,131). In this respect, the ttrace formula of A. Selberg [28J, gen- eralizing the +Poisson summation formula, is of fundamental importance; and actually it can be used effectively for calculations of the dimension of the space of automorphic forms and of the trace of Hecke operators (R. P. Langlands, Amer. J. Math, 85 (1963); H. Shimizu, Ann. Math, (2) 77 (1963), J. Fm:. &i. Univ. Tok;yo, 10 (1963); also [24]). When X = GIK is a symmetric domain, we can define, for any representation p of K, a (matrix- valued) canonical automorphy factor, by which we defme (vector-valued) automorphic forms with respect to a discrete subgroup f of G, and under a further condition (say, r free, r\X compact, and the thigh est weight of p sufftciently large) we obtain a formula for the dimension of the space of such automorphic forms in terms of the tarithmetic genus of r\X and certain numbers related to the "dual" Xu = Gj K and the representation p [ 15,29,30]. References [1J Sem. H. Cartan 6, Theorie des fonctions de plusieurs variables; Fonctions automorphes et espaces analytiques, Ecole Norm. Sup., 1953- 1954. 
33 A Axiomatic Set Theory [2J M. Eichler, Einflihrung in die Theorie der algebraischen Zahlen und Funktionen, Birk- hauser, 1963; English translation, Introduction to the theory of algebraic numbers and func- tions, Academic Press, 1966. [3] R. Fricke and F. Klein, Vorlesungen iiber die Theorie der automorphen Funktionen, Teubner, 1, 1897; II, 1901; second edition, 1926. [4J F. Klein and R. Fricke, Vorlesungen iiber die Theorie der elliptischen Modulfunktionen, Teubner, I, 1890; II, 1892. [5J J. Lehner, Discontinuous groups and automorphic functions, Amer. Math. Soc. Math. Surveys, 1964. [6J H. Poincare, Memoire sur les fonctions fuchsiennes, Acta Math., 1 (1882), 193-294 (Oeuvres, Gauthier-Villars, 1916, vol. 2, 167- 257). [7J A. P. Ogg, Modular forms and Dirichlet series, Benjamin, 1969. [8J G. Shimura, Introduction to the arithmetic theory of automorphic functions, Publ. Math. Soc. Japan, Iwanami Shoten and Princeton Univ. Press, 1971. [9J M. Eichler, Quadratische Formen und Modulfunktionen, Acta Arith., 4 (1958),217- 239. [ IOJ E. Hecke, Mathematische Werke, Van- denhoeck & Ruprecht, 1959. [ 1 IJ E. Hecke, Dirichlet series, modular forms and quadratic forms, Lecture notes, Institute for Advanced Study, Princeton, 1938. [ 12J H. Jacquet and R. P. Langlands, Auto- morphic forms on GL(2), Lecture notes in math. 114, Springer, 1970. [13J Automorphic forms, representations and L-functions, Amer. Math. Soc. Proc. Symposia in Pure Math., 33 (1979). [14J Modular functions of one variable I-VI, Lecture notes in math., Springer, 320 (1973); 349 (1973); 350 (1973); 476 (1975); 601 (1977); 627 (1977). [ 15J Sem. H. Cartan 10, Fonctions automor- phes, Ecole Norm. Sup., 1957-1958. [ 16J C. L. Siegel, Analytic functions of several complex variables, Lecture notes, Institute for Advanced Study, Princeton, 1950. [ 17J H. Maass, Siegel's modular forms and Dirichlet series, Lecture notes in math. 216, Springer, 1971. [18J C. L. Siegel, Einflihrung in die Theorie der Modulfunktionen n-ten Grades, Math. Ann., 116 (1939),617-657 (Gesammelte Ab- handlungen, Springer, 1966, vol. 2, 97-137). [19J D. Mumford, et aI., Smooth compactifica- tion of locally symmetric varieties, Math. Sci. Press, 1975. [20J U. Christian, Zur Theorie der Hilbert- Siegelschen Modulfunktionen, Math. Ann., 152 (1963),275-341. 146 [21J 1.1. Pyatetskii-Shapiro (Pjateckii-Sapiro), Automorphic functions and the geometry of classical domains. Gordon & Breach, 1969. (Original in Russian, 1961.) [22J 1. 1. Pyatetskil-Shapiro (Pjateckii-Sapiro), Arithmetic groups in complex domains, Rus- sian Math. Surveys, 19 (1964),83-109. (Orig- inal in Russian, 1964.) [23J W. L. Baily and A. Borel, Compactifica- tion of arithmetic quotients of bounded sym- metric domains, Ann. Math., (2) 84 (1966), 442-528. [24J Algebraic groups and discontinuous sub- groups, Amer. Math. Soc. Proc. Symposia in Pure Math., 9 (1966). [25J G. Shimura, Automorphic functions and number theory, Lecture notes in math. 54, Springer, 1968. [26J I, M. Gel'fand, M. I. Graev, and I. I. Pyatetskil-Shapiro (Pjateckii-Sapiro), Gen- eralized functions 6, Representation theory and automorphic functions, Saunders, 1969. (Original in Russian, 1966.) [27J Harish-Chandra, Automorphic forms on semisimple Lie groups, Lecture notes in math. 62, Springer, 1968. [28J A. Selberg, Harmonic analysis and dis- continuous groups in weakly symmetric Rie- mannian spaces with applications to Dirichlet series, J. Indian Math. Soc., 20 (1956),47-87. [29J Y. Matsushima and S. Murakami, On vector bundle valued harmonic forms and automorphic forms on symmetric Riemannian manifolds, Ann. Math., (2) 78 (1963),365-416, [30J Y. Matsushima and S. Murakami, On certain cohomology groups attached to Her- mitian symmetric spaces, Osaka J. Math., 2 (1965), 1-35, [31J R. Tsushima, A formula for the dimen- sions of spaces of Siegel cusp forms of degree three, Amer. J. Math., 102 (1980),933-977. 33 (1.5) Axiomatic Set Theory A. General Remarks Axiomatic set theory pursues the goal of rees- tablishing the essentials of G. Cantor's rather intuitive tset theory by axiomatic construc- tions consistent with modern theories of the foundations of mathematics. A system of axioms for set theory was first given by E. Zermelo [4 7J, and was completed by A. Fraenkel [8]. J. von Neumann [30J expressed it in tsymbolic logic, gave a formal generalization, and eliminated ambiguous concepts. P. Bernays and K. G6del [2, IIJ 
147 further refined and simplified von Neumann's formulation. The theories based on the sys- tems before and after the formal generaliza- tion are called Zerme1o-Fraenke1 set theory (ZF) and Bernays-Godel set theory (BG), respectively. B. Zerme1o-Fraenke1 Set Theory ZF is a formal system expressed in tfirst-order predicate logic with the predicate symbol = (equality) and based on the following axioms 1-9 (- 411 Symbolic Logic). These axioms do not contain any predicate symbol other than E, where x E y is read "x is an element of y." Any formula containing only E as a predicate symbol is called a set-theoretic for- mula. Following the usual convention, we omit the outermost universal quantifier, and use restricted quantifiers such as 3x E a, Vx E a. Axiom 1 (axiom of extensionality): VX(XEa= X Eb)-->a = b. This asserts that sets formed by the same elements are equal. The formula x E a(x E b) is denoted by a c b. This means "a is a subset of b." Then Axiom 1 can be expressed by ac b /\ b c a-->a== b. Axiom 2 (axiom of the unordered pair): 3XVY(YEX =y=a v y= b). This asserts the existence, for any sets a and b, of a set x having a and b as its only elements. This x is called the unordered pair of a and b and is denoted by {a, b}. The notion of ordered pair is characterized by (a, b)=(c, d)-=a =c /\ b =d, An example of such a set is (a, b) = { {a, a}, {a, b}}. Axiom 3 (axiom of the sum set): 3xVy(yEX= 3zEa(YEz». This asserts the existence for any set a of the sum (or union) x of an the sets that are ele- ments of a. This x is denoted by Ua or S(a). We write a U b for U {a, b} and a' for aU{a,a}. Axiom 4 (axiom of the power set): Vx 3y( YEX = VZE y(z Ea)). This asserts the existence for any set a of the power set x consisting of all the subsets of a. This x is denoted by P(a). We have S(P(a)) :::: a, so S is a left inverse of P, and the products SP and PS are idempotent. Axiom 5 (axiom of the empty set): :JXVY(IYEX). 33 B Axiomatic Set Theory This asserts the existence of the empty set. The empty set is denoted by 0 or O. Axiom 6 (axiom of infinity): :JX(OEX v VYEX(y' EX)). This asserts the existence of a set containing all the natural numbers, where 0, 1 = 0' = {O}, 2 = l' = {O, l}, 3 = 2' = {O, 1, 2}, T his defini- tion of natural number is due to von Neumann. Axiom 7 (axiom of separation): :JXVy(yEX =YEa/\ A(y)). This asserts that the existence for any set a and a formula A( y) of a set x consisting of all elements of a satisfying A( y). This x is denoted by i YEa A( y)}. This is rather a schema for an intlnite number of axioms, for there are an infinite number of A( y). This axiom, also called the axiom of comprehension or axiom of subsets, was introduced by Zermelo. For example, the set of all natural numbers is introduced by {xEaIVy(OEYA VZEY(Z'Ey)  XEY)}, where a is a set satisfying Axiom 6. The set of all natural numbers is denoted by (J) or N. Axiom 8 (axiom of replacement): 3xVYEa(3zA(y, z)--> 3ZE xA(y, z)). This asserts the existence for any set a of a set x such that for any y of a, if there exists a z satisfying A( y, z) then such z exists in x. If the relation A( y, z) determines a function, then the image of a set by the relation is included in a set, so by Axiom 7, the image is a set. This axiom was introduced by Fraenkel. Axiom 9 (axiom of regularity): :JxA(x)-->:Jx(A(x) /\ VYE x( I A(y))). This asserts that if there exists a set satisfying A(x), then there is a set x satisfying A(x) but every element y of x does not satisfy the prop- erty ,4(y). This axiom is also called the axiom of foundation. Its dual expression, called E- induction, is of course equivalent to Axiom 9. Axiom 9' (axiom of E-induction): VX(VyE xA (y)--> A(x»--> V xA(x). Using this we can show that IXEX, l (x E yAy E x), etc. If we assume the axiom of choice stated below, then this is equivalent to the nonexistence of an infinite descending seq uence Xn E ." EX2 EX l , If a model of set theory satisfies the axiom of regularity and has an infinite descending se- quence that is not in the model, then such a model is called a nonstandard model. 
33 c Axiomatic Set Theory Axiom 10 (axiom of choice): VXEa:!yA(x, y)......:!yVxE aA(x, y(x)), This asserts that if for any element x of a there is a set y such that A (x, y), then there is a choice function y for the formula, i.e., A(x, y(x» for all x in a. Usually a function is represented by its graph. A setfis called a function detlned on a if 'v'x'v'y«x,y)Ef -->xEa), 'v'xEa3y«x,y)Ef), 'v'xEa'v'y'v'z«X,y)E f A(X, Z)E f -->y=z). This formula is denoted by Fnc(f); then the formula A(x,f(x)) is an abbreviation of Fnc(f) 1\ :!y((x, Y)E f 1\ A(x, y)). The axiom of choice is equivalent to many properties, such as the well-ordering theorem, Zorn's lemma, and Tikhonov's theorem on the product of compact spaces, and it is used widely and essentially in mathematics. The system of axioms 1 to 9 is called Zerme1o-Fraenke1 set theory and is denoted by ZF; the system ZF minus the axiom of replace- ment is called Zerme10 set theory, denoted by Z; and the system ZF plus the axiom of choice is denoted by ZFC. The system Z is weaker than ZF. Indeed, the existence of the set W of all the natural num- bers and of P(ro), P(P(w)), can be proved in Z, but the existence of the set {w, p( w), P(P(w)), ) cannot be proved in Z. However, we can prove its existence in ZF. The theory ZF minus the axiom of intlnity is called general set theory. Its consistency can be reduced to the consistency of the theory of natural numbers as follows. Let n be any natural number and n = 2"' + 2 n2 + '" + 2"k(n 1 < n 2 < < n k ) be its binary expan- sion. We can define a model of general set theory by identifying a set with the natural number n and defining mEn by "m appears as one of the n i ." In general set theory we can show the existence of 0, prO), P(P(O)), but the existence of the set {O, P(O), P(P(O)),.) cannot be proved. However, we can prove its existence in Z. The set consisting of all heredi- tary finite sets is the smallest model of general set theory. C. Bernays-GOdel Set Theory The existence of {u I A(u)} for an arbitrary set- theoretic formula .4(u) cannot be deduced from the axioms of ZF. We call this object a class to distinguish it from sets. We introduce a generalized logical system of first-order predicate logic by adding to the logical system used in ZF class variables, class constants, and 148 inference rules with respect to quantifiers for classes. For any set-theoretic formula A(u) in which no tbound class variable occurs, we adopt .3XVu(uEXA(u)) as an axiom, where capital letters X are class variables. The set theory thus obtained is equivalent to Glidel set theory [11]. Yon Neumann axiomatized set theory by making use of the notion of functions instead of that of classes [30]. In refining this theory and intro- ducing the notion of classes, Bernays and Godel [2, IIJ initiated Bernays-Godel set theory, BG. ZF and BG are related as follows: Any formula provable in ZF is provable in BG, and any set-theoretic formula provable in BG and having neither class variable nor class constant is provable in ZF. In this sense, the systems can be regarded as essentially equivalent, but as BG has class variables and class constants, it is more convenient for expressing set- theoretic notions. Yon Neumann detlned the following func- tion R by ttransfinite induction: R(O) = 0, R(C1:)= U 'lJ(R(P), p<, where :J. and # are ordinal numbers and U p<, 'lJ(R(P)) denotes the set sum of'lJ(R(O)), 'lJ(R(I)), .... (R(P)), ... (p<C1:), The function R can be detlned by a set-theoretic formula, so it exists as a class, Now consider the model M =M(ex) for a fixed ordinal number i'J., We define sets of the model M as elements of R(i'J.), and classes of the model as subsets of R(i'J.). We denote the E relation of the model by EM' For classes X and Yof the model, we write XE, YpX E Y Then a necessary and sufficient condition for R(ex) to be a model of BG is that  is an tinaccessible ordinal number ( 3 12 Ordinal Numbers). The existence of an in- accessible ordinal number cannot be deduced from the axioms of ZF. There is a series of studies ofaxiomatization of set theory in which any number of inaccessible ordinal numbers is assumed to exist [23]. When R(ex) is a model ofBG (ZF), it is called a natural model of BG (ZF). Furthermore, if H is U, R(C1:) then H satisfies all the axioms of BG (ZF). As we do not need the axiom of regularity for defining the class H, we see that BG (ZF) is consistent as long as BG (ZF) without the axiom of regularity is consistent. D. Independence of the Continuum Hypothesis and the Axiom of Choice These axiomatizations of set theory motivated a series of studies from the standpoint of the 
149 tfoundations of mathematics on problems that remained unsolved after the appearance of Cantor's primitive set theory. Among these, the problem of the relation between the con- tinuum hypothesis and the axiom of choice was central. Consistency of the Axiom of Choice and the Continuum Hypothesis. G6del [IIJ proved that if ZF without the axiom of choice is con- sistent, then the system obtained by adding to ZF the axiom of choice and the continuum hypothesis is also consistent. To show this, he constructed a model of ZF satisfying the axiom of choice and the generalized con- tinuum hypothesis as follows. Assume first that M is an arbitrary domain of objects among which the E relation is detlned. By a formula on M, we understand a formula having con- stants of M as its only constants, having E as its sole predicate symbol, and further having exclusively variables whose ranges are re- stricted to M. Let us denote by M' the totality of subsets of M detlned by a formula A(x) on M. Now we put Mo = {0}, M>+l = M, Mp = U.<p M" if # is a tlimit ordinal number. We call x constructible if x E M, for some ordinal number ex, assumed to be less than the first inaccessible ordinal number, if any. We denote the totality of constructible sets by L, and the totality of sets of ZF by V. We call the asser- tion V= L, that is, every set is constructible, the axiom of constructibility. If we add this axiom to ZF, the axiom of choice and the generalized continuum hypothesis become provable. On the other hand, if we regard elements of L as sets of the model and the original E relation as the E relation of the model, we have a model of ZF in which the axiom of constructibility holds. Independence of the Axiom of Choice and tbe Continuum Hypothesis. Since the result of G6del, attempts have been made to prove that the axiom of choice is independent of the other axioms. Fraenkel constructed a model of set theory without satisfying the axiom of choice, starting from a countable number of objects that are not sets. A. Mostowski constructed in ZF a model of set theory having objects that are not sets, and he proved that the model satisfies the axiom: Every set can be t)inearIy ordered, but does not satisfy the axiom of choice [27]. E. Mendelson constructed a model of set theory that does not satisfy the axiom of choice by making use of an infinite descending chain a, '3 a 2 3 Q) 3.. [25]. These models, however, do not satisfy all the axioms of ZF or of Zermelo set theory minus the axiom of choice, even though they satisfy most 33E Axiomatic Set Theory of the axioms. Consequently, they are not sufficient for proving independence. P. J. Cohen [3,4] proved the following results in connection with the independence of the axiom of choice, the continuum hypo- thesis, and V == L: If ZF is consistent, each of the following conditions has a model. (1) The axiom of choice as well as the generalized continuum hypothesis holds, but there exists an a satisfying a rt L and a c w, (2) Ip(w) is not well-ordered. (3) The axiom of choice holds, but the continuum hypothesis does not hold. (4) 1p(Ip(w») cannot be linearly ordered. By (1), we see that V= L is independent of the axiom of choice and the generalized con- tinuum hypothesis: (2) shows the independence of the axiom of choice; and (3) shows that the continuum hypothesis is independent of the axiom of choice. In (4), I1J (Ip(w)) corresponds to the set F of all real-valued functions detlned on the interval [0, IJ, and (4) shows that the proposition "F can be linearly ordered" is not deducible in ZF without the axiom of choice. E. Some Recent Results Boo1ean- Valued Set Theory. D. Scott and R. M. Solovay defined models of set theory in which the set-theoretic formula has values in a complete Boolean algebra. This viewpoint is motivated by P. J. Cohen's original notions of the forcing relation and the generic filter. According to the relation between sets and their representing functions, a function f: A+ B corresponds to a subset of A when B is a complete Boolean algebra. S:J by transtlnite induction we put VB)= U p(B)(qB»), p<, where P(B)(A) is the Boolean-valued power set of A defined by uEP(B)(A)=u:A......B. Let V(B) be the union of all VB)'S; then the truth values of the formulas [u E v] and [u = v] are detlned by transtlnite induction as [UEV]= L v(x)'[x=u], xedom(v) [U == v] = IT u(x)=> [XEV]- TI v(x)=> [XEU], xedom(u) xEdom(v) where a  b is an abbreviation for the element -a+b inB. In V(B) the following properties are satistled: (i) The truth value of any formula provable from ZFC is 1, the largest element of B. (ii) Any complete homomorphism h: B, ......B 2 
33F Axiomatic Set Theory can be extended to h: V(B 1 )--> V(B,), so that the commutative relation h([ A(u l , ..., un)]) == [A(h(utJ, ..., h(u n ))] is satisfied for every restricted formula; further, if h is an epimorphism then it is satisfied for every formula. (iii) (maximum principle) For any formula A(x) there is an element u of V(B) such that [3xA(x)] = [A(u)]. Moreover, if [3xR(x)] = 1 then there an ele- ment u of V(B\ such that [R(u)] = 1 and [3x(R(x)A A(x)] = [A(u)]. Property (i) means that if [A] < 1, then A is not provable from ZFC. Many consistency results in set theory are obtained in this way by constructing special partial order struc- tures, topological spaces, or complete Boolean algebras. Axiom of Strong Infinity. The axiom of infin- ity, Axiom 6, of ZF asserts that there exists a set including the set (J) of all natural numbers. The set (J) is the smallest infinite cardinal num- ber. In general, the axiom of strong intlnity is stated in a form such that certain special prop- erties of w are satistled for an uncountable cardinal number. Though such an axiom asserts the existence of large cardinal numbers, their properties sometimes reflect, for example, those of the real numbers, or of the set of real numbers, etc. We state here some typical examples: (i) Weakly inacce;sib1e cardinal number. This is a regular limit cardinal number, that is, cf«(J),)=w., V{J<ex«(J)P+1 <w.). (ii) Strongly inacce;sib1e cardinal number. cf((J).) = W" v fJ < ex(2"'p < w,), (iii) Weakly compact cardinal number. w. is uncountable and the space X = 2"', with < (J).- topology is w,-compact. (iv) Measurable cardinal number. (J). is un- countable and there is a nonprincipal < (J),- additive 2-valued measure J1: P«(J),)-->{O, 1). (v) Strongly compact cardinal number. w. is uncountable and any product of 2 = j 0, 1 ) with < w,-topology is w,-compact. The strength of these properties increases in the order presented here. Properties (i) to (iii) are compatible with the axiom of constructi- bility V = L. And (iv) implies V# L while it is compatible with V == L(I1). But (v) implies V f- L(a) for any set a. There are many other stronger axioms of 150 infinity, and the consistency proofs of these "axioms" within ZF or ZFC would involve essential difficulties. Inconsistency proofs of these properties, if they exist, would be very interesting. F. Examples of Results (1) Cardinality and Cofinality ( 3 12 Ordinal Numbers). (i) (W. B. Easton [6J) Let WI be a model of ZFC (the Zermelo-Frankel axioms plus the axiom of choice) in which the GCH (generahzed continuum hypothesis) is valid, i.e., Vrx(2"" = Wa+l), and let g be a function from ordinals to ordinals in Wl such that Va, [3(cx<fJ = (J)g(,).s;(J)g(P») and Vex«(J),<cf((J)q("»))' Then there is a Boolean model 91 of ZFC, 91 =:> \JJ1, having the same co finality and satisfying 2"', =w 9 (') for every regular cardinal. (This means that Konig's condition (cf(2"',) > (J),) is the only restriction on the cardinality of powers of regular (J),.) (ii) (1. H. Silver [39J) Suppose W < cf(rx) = cf((J),) < W,. Then for any). < cf(rx) Vv < rx(2"'v.s; w v + ,,J--> 2"".s; wa+.<' However, the validity of the implication Vn < (J)(2"'n = w n + 1 )-->2"'''' = (J)"'+I still remains an open question. And it is known that if there is a model for Vn < (J)(2"'n = (J)n+1), 2"''''> (J)",+1> then there is a model in which there are many measurable cardinal numbers. (iii) (K. Prikry [31]) Let w, be a measurable cardinal number in a model M of ZFC. Then there is a Boolean extension N of M in which the notion of cardinality is not changed but the notion of cofinality is changed, that is cf((J).) = w. in M but cf((J),) = W in N. (iv) (Solovay) Let (J). be a strongly compact cardinal number. Then for any strong-limit singular cardinal w p > w, the continuum hypo- thesis holds at w p ; that is, the implication cf«(J)p) < w p , Vy < fJ(2"'Y < (J)p)-->2"'p = (J)P+l is provable. (2) Lebe;gue Measurability and the Baire Prop- erty. As is well known, every J (Borel) set (and consequently every It' (analytic) set) of real numbers is Lebesgue measurable and has the Baire property. (i) (Godel) V= L implies the existence of a D. set of real numbers that is neither Lebesgue measurable nor has the Baire property. 
151 Let DC denote the principle of depending choice: \lxEa3YEaP(x,y) ......3f: w......a\ln EWP(f(n),j(n + I». DC is adequate for the development of the classical notions of measure theory, such as Jordan decomposition, the Radon-Nikodym derivate, etc. Let I denote the hypothesis 3o:(cf(w,) = W, A \I{3 < a(2"'p < W,» (strongly inacce;sib1e ). (ii) (Solovay [4OJ) The consistency of ZFC and 1 implies that of ZF and either of the following two axioms: (a) DC plus the hypothesis that every set of real numbers is Lebesgue measurable and has the Baire property. (b) The axiom of choice plus the hypothesis that every set of real numbers definable by an (J) sequence of ordinals is Lebesgue measurable and has the Baire property. Axiom (a) means that DC is not strong enough to construct a (Lebesgue) nonmeasur- able set, while Axiom (b) implies that every projective set is Lebesgue measurable and has the Baire property, and hence implies that the set of all constructible real numbers, as a Ii set, has Lebesgue measure 0, and is of the first category. (3) Martin's Axiom. Let B be a Boolean algebra. We say that B satisfies w. c.c. (chain condition) if the cardinality of every disjoint family of positive elements of B is at most w,. Let B* denote the topological space consisting of all homomorphisms h: B-->2 = {O, I} with the open base U(a) = {h I h(a)= 1 }(aEB); then B* is a tBaire space. Then Martin's axiom (MA) is: Let B be an (J) c.c. Boolean algebra and a < 2". Then the intersection of a dense open sets is dense in B*. Since {h Ivh(av) = h(b)} is dense and open in B* if L'va, = b, MA means the existence of an h E B* preserv- ing any given set of ex ( < 2") equations in B, If 2" = w]' MA merely reduces to the Baire prop- erty of B*. However, if 2" > w], then the W c.c. hypothesis is essential, for there exists a B satisfying (J)I c.c. such that B* contains w] dense open sets with empty intersection. (i) (Solovay and S. Tennenbaum [43J) The consistency of ZF implies that of ZFC, MA, and 2"'>w]. (ii) (D. A. Martin and Solovay [24J) ZFC, MA, and 2" > WI imply the propositions: (a) \lw, < 2"'(2"" = 2<J), hence 2" = 2"'1; (b) the totality of the first category sets of Lebesgue measure zero sets is a-additive for any a < 2"; (c) every L't set of real numbers is Lebesgue measurable and has the Baire property. 33 F Axiomatic Set Theory (4) Suslin's Hypothesis (SH) is: Every dense, linear, order complete set without end points, having at most w disjoint intervals, is order isomorphic to the continuum of real numbers. (i) (T. J. Jech, Tennenbaum) The consistency of ZF implies that of (a) ZFC, l SH, and GCH, as well as (b) ZFC, l SH, and 2" > WI (ii) (R. Jensen [15J) The consistency of ZF implies that of ZFC, SH, and 2" = Wi' (iii) (Solovay and Tennenbaum [43J) ZFC, MA, and 2" > w] imply SH. (iv) (Jensen [15J) V=L implies ISH. (5) Measurah1e and Real-Valued Measurah1e Cardinals. A cardinal K > (J) is said to be measurah1e if there is a measure J1: (K)--> {O, I} with (a) J1(K) = 1, (b) VV<K(J1({V})=O) and (c) I1(Lv<aA,)=Lv<aJ1(Av)and K is said to be real-valued measurah1e if there exists a p:(K) ---; [0, IJ that satifies (a), (b) and (c), and is not measurable. Let MC (RMC) denote the existence of a measurable cardinal (real-valued measurable cardinal). (i) (S. Ulam [45J) The existence of an (J)- additive measure J1: (A)......[O, IJ, with I1(A) == 1 and \lxEA(J1({x})==O), implies RMC orMC; RMC implies the existence of an extension of Lebesgue measure defined on ( [0, IJ); every real-valued measurable cardinal is  2'" and weakly inaccessible. (ii) Every measurable cardinal is strongly (hyper) inaccessible, and (a) 3aF(a)---;3a < w] F(a) for any IJ formula F(a) on the ordinal numbers, (b) 3aF(a)......3a <J11 F(a) for any II! formula F(a) and the smallest measurable J1I' (Many results have been obtained concerning the ordinal magnitude of w] and the measur- able cardinals.) (iii) (Solovay [41J) The consistency of ZFC and MC is equivalent to that of ZFC and RMC. (iv) (Martin and Solovay [24J) ZFC, RMC, and MA are not consistent. (v) (Levy and Solovay [21 J) The consistency of ZFC and MC implies that of ZFC, MC, MA, and 2'" > w2' (vi) (Solovay) ZFC and MC imply that every Ii set of real numbers is Lebesgue measurable. (vii) (Martin, Solovay) ZFC, MC, MA, and 2" > w 2 imply that every I set of real num- bers is Lebesgue measurable and has the Baire property. (viii) (Silver [38J) The consistency of ZFC and MC implies that of ZFC and MC as well as the existence of (Lebesgue) non-measurable  sets of real numbers. (6) Axiom of Determinateness. We consider the following inl1nite game. Let A be a set consist- 
33 Ref. Axiomatic Set Theory ing of functions w to w. The players I and II choose alternatively natural numbers n j and m i , and, for the resulting inl1nite sequence, if (n 1 ,m l .n Z , ..., nk,m k , ...)EA, then 1 wins, otherwise II wins. A functi o n 11' U W n --> W is called a . new strategy. If II plays according to the sequence r11=(m 1 ,m Z ,.... mk, .) and I plays according to the strategy 11, then the resulting sequence is (]*I!! ==((J(0), ml, (J(md, ..., mk, (J(mj, ..., md, ...). The strategy (J is called a winning strategy for I of the game defined above, if for any play r11 of II, the resulting play (J*r11 is in the setA, and similarly for the winning strategy of II. The axiom of determinateness AD is the assertion that for any A c w W , the player I or II has a winning strategy. (i) (J. Mycielski [28J) Assume ZF and AD; then we have: (a) axiom of choice for countable sequence of sets of real numbers; (b) every set of real numbers is Lebesgue measurable and has the Baire property, so the axiom of choice does not hold; (c) every uncountable set of real numbers contains a perfect set. (ii) (Solovay) Assume ZF and AD; then we have: (a) cardinalities of W 1 and 2" are not comparable; (b) 0)1 and W 2 are measurable cardinals, but W 3 is a singular cardinal such that cf(W3) = (1)2' (iii) (Martin) In ZFC, every Borel, namely ;. game is detennined. (iv) (Martin, Solovay) In ZFC: (a) if there exists a measurable cardinal, then every analytic, namely IIi, game is detennined; (b) if every i-game is determined, then there is a model of ZFC with many measurable cardinal numbers. Since the axiom of determinateness AD is very strong, there is some suspicion that it might contradict ZF. But at present there is no such evidence References [IJ J. Bell, Boolean-valued models and inde- pendence proofs in set theory, Clarendon Press, 1977. [2J P. Bernays, A system of axiomatic set theory I, J. Symbolic Logic, 2 (1937), 65577. [3J P. J. Cohen, The independence of the continuum hypothesis I, II, Proc. Nat. Acad. Sci. US, 50(1963), 1143-1148; 51 (1964),105- 110. 152 [4J P. J. Cohen, Set theory and the continuum hypothesis, Benjamin, 1966. [5J F. R. Drake, Set theory, North-Holland, 1974. [6J W. B. Easton, Power of regular cardinals, Ann. Math. Logic, 1 (1970), 1399178. [7J G. Foder, On stationary sets and regres- sive functions, Acta Sci. Math., 27 (1966), 1055 110. [S] A. Fraenkel, Zu den Grundlagen der Cantor-Zermeloschen Mengenlehre, Math. Ann., (1922), 230-237. [9J H. Gaifman, Uniform extension operators for models and their applications, Sets, Models and Recursion Theory, North-Holland, 1967, 1222155. [ IOJ F. Galvin and A. Haynal, Inequalities for cardinal powers, Ann. Math., (2) 101 (1975), 491-498. [ 11 J K. Godel, The consistency of the axiom of choice and of the generalized continuum hypothesis with the axioms of set theory, Ann. Math. Studies, Princeton Univ. Press, 1940. [12J K. G6del, What is Cantor's continuum problem?, Amer. Math. Monthly, 54 (1947), 5155525. [ 133 T. Jech, Non-provability of Suslin's hypo- thesis, Comm. Math. Univ. Carolinae, 8 (1972), 2299308. [ 14J T. Jech, Set theory, Academic Press, 1978. [ 15J R. B. Jensen, The fine structure of the constructible hierarchy, Ann. Math. Logic, 4 (1972),2299308. [ 16J A. Kanamori and M. Magidor, The evo- lution of large cardinal axioms in set theory, Lecture notes in math. 669, Springer, 1978, 99-275. [ 17J H. J. Keisler and A. Tarski, From acces- sible to inaccessible cardinals, Fund. Math., 53 (1965),2255308; 57 (1965),119. [ 18J K. Kunen, Inaccessibility properties of cardinals, Thesis, Stanford Univ. [19J K. Kunen and J. B. Paris, Boolean exten- sions and measurable cardinals, Ann. Math. Logic, 2 (1971), 3599378. [20J A. Levy, Basic set theory, Perspectives in mathematical logic, Springer, 1979. [21J A. Levy, and R. M. Solovay, Measurable cardinals and the continuum hypothesis, Israel J. Math., 5 (1967),2344248. [22J M. Magidor, How large is the first strongly compact cardinal?, Ann. Math. Logic, 10 (1976),33-57. [23J P. Mahlo, Zur Theorie und Anwendung der po-Zahlen, Ber. Verh. K6niglish Siich. Ges. Wiss. Leipzig. Math. Phys. KI.,63 (1912), 108- m; 65 (1913), 2688282. [24J D. A. Martin and R. M. Solovay, Internal Cohen extensions, Ann. Math. Logic, 2 (1970), 143-178. [25J E. Mendelson, The independence of weak 
153 axiom of choice, J. Symbolic Logic, 21 (1956), 350-366. [26J Y. N. Moschovakis, Determinacy and prewellordering of the continuum, Mathe- maticallogic and foundations of set theory, North-Holland, 1970, 24462. [27] A. Mostowski, aber die Unabhangigkeit des Wohlordnungssatzes vom Ordnungsprin- zip, Fund. Math., 32 (1939),201-252. [28] J. Mycielski, On the axiom of determi- nateness, Fund. Math., 53 (1964),205-224, [29] K. Namba, An axiom of strong infinity and analytic hierarchy of ordinal numbers, Proc. Symp. Pure Math., 13 (1971),279-319. [30J J. v. Neumann, Die Axiomtisierung der Mengenlehre, Math. Z., 27 (1928),6699752. [31J K. L. Prikry, Changing measurable into accessible cardinals, Diss. Math., 68 (1970),5- 52. [32] J. B. Rosser, Boolean-valued models for set theory, Academic Press, 1969. [33] G. E. Sacks, Forcing with perfect closed sets, Proc. Symp. Pure Math., 13 (1971),331- 355. [34J D. S. Scott, Measurable cardinals and constructible sets, Bull. Acad. Polon. Sci., Ser. Math. Astr. Phys., 7 (1961),145-149. [35J D. S. Scott and R. M. Solovay, Boolean- valued models for set theory, Lecture notes, Summer Institute on Axiomatic Set Theory, UCLA, 1967. [36] S. Shelah, A compactness theorem for singular cardinals, free algebras, Whitehead problem and transversals, Israel J. Math., 21 (1975), 319-349. [37J J. H. Silver, Some applications of model theory in set theory, Ann. Math. Logic, 3 (1971),45-110. [38J J. H. Silver, The consistency of the GCH with the existence of a measurable cardinal, Proc. Symp. Pure Math., 13 (1971),391-396. [39J J. H. Silver, On the singular cardinal problem, Proc. Int. Congr. Math., Vancouver, 1 (1974), 2655286. [40]R. M. Solovay, A model of set theory in which every set of reals is Lebesgue measur- able, Ann. Math., (2) 92 (1970), I-56. [41J R. M. Solovay, Real-valued measurable cardinals, Proc. Symp. Pure Math., 13 (1971), 4977428. [42J R. M. Solovay, Strongly compact car- dinal and the G.C.H., Proc. Symp. Pure Math., 25 (1974),3655372. [43J R. M. Solovay and S. Tennenbaum, Iterated Cohen extensions and Suslin's prob- lem, Ann. Math., (2) 94 (1971),201-245. [44] G. Takeuti and W. B. Zaring, Axiomatic set theory, GTM 8, Springer, 1973. [45J S. U1am, Zur Masstheorie in der al- gemainen Mengenlehre, Fund. Math., 16 (1930), 140-150. 34 B Axiom of Choice and Equivalents [46J P. Vopenka and K. Hrbacek, On strongly measurable cardinals, Bull. Acad. Polon. Sci., Ser. Sci. Math. Astr. Phys., 14 (1966), 587-591. [47J E. Zermelo, Untersuchungen iiber die Grundlagen der Mengenlehre I, Math. Ann., 65 (1908),261-281. 34 (11.5) Axiom of Choice and Equivalents A. The Axiom of Choice In set theory, the following axiom is known as the axiom of choice: For any nonempty family '!( of nonempty subsets of a set X, there exists a single-valued function f, called the choice function, whose domain is 21, such that for every element A of '!( the value f(A) is a mem- ber of A. This axiom is equivalent to each of the following three propositions: (1) If {A,}", is a family of sets not containing the empty set and with an index set A, then the Cartesian product I1;, A, is not the empty set. (2) If a set A is a disjoint union U 1 A, of a family of subsets {A,} '" which does not contain the empty set, there exists a subset B, called the choice set, of A such that every intersection of B and A;.(AEA) contains one and only one element. (3) For every mapping f from a setA onto a set B, there is a mapping from B to A such that fo g = 1, (identity mapping). Also equivalent to the axiom of choice are the well- ordering theorem and Zorn's lemma, which are discussed in the following sections. B. The Well-Ordering Theorem In 1904, E. Zermelo [2] first stated the axiom of choice and used it for his proof of the well- ordering theorem, which says that every set can be twell-ordered by an appropriate tordering. Conversely, the well-ordering theorem implies the axiom of choice. Many important results in set theory can be obtained by using the axiom of choice, for example, that tcardinal numbers are comparable, or that various definitions of the tlniteness or infiniteness of sets are equivalent. Various important theorems outside of set theory, e.g., the exis- tence of bases in a tlinear space, tcompactness of the direct product of compact ttopological spaces (tTikhonov's theorem), the existence of a subset which is not tLebesgue measurable in Euclidean space, etc., are proved using the axiom of choice. But for those proofs the well- ordering theorem or Zorn's lemma (stated 
34 c Axiom of Choice and Equivalents below) are used more often than the axiom of choice. Using the well-ordering theorem, it can be proved in the following manner that for every field k, any linear space X over k has a basis. Let {XV}'EA be an enumeration of X. By ttrans- finite induction, we can define the function f from X to B = {O, I} such that (i) if Xo = 0, f(x o ) = 0; if Xo of- 0, f(x o ) = 1; (ii) for v > 0, if Xv is expressed as a linear combination of ele- ments of {xIf1<vJ(x)= 1},.f(x,)=O; other- wise f(x,) = 1. Then U={xEXlf(x)= 1} is a basis of X. C. Zorn's Lemma An ordered set X is called an inductively ordered set if every ttotally ordered subset of X has an tupper bound. A condition C for sets is called a condition of finite character if a set X satisfies C if and only if every finite subset of X satisfies C. A condition C for functions is called a condition of finite character if C is a condition of finite character for the graph of the function. Zorn's lemma [4J can be stated in anyone of the following ways, which are all equivalent to the axiom of choice. It is often more convenient to use than the axiom of choice or the well-ordering theorem. (1) Every inductively ordered set has at least one maximal element. (2) If every well-ordered subset of an ordered set M has an upper bound, then there is at least one maximal element in M. (3) Every ordered set A4 has a well-ordered subset W such that every upper bound of M belongs to W. (4) For a condition C of finite character for sets, every set X has a maximal (for the rela- tion of the inclusion) subset of X that satis- fies C. (5) Let C be a condition of finite character for functions from X to y. Ther., in the set of functions that satisfy C, there is a function whose domain is maximal (for the relation of the inclusion). Using Zorn's lemma (1), we can prove again in the following way any linear space X over a field k has a basis. Let IJ( be the set of all non- empty subsets A of X such that arbitrary finite subsets of A are linearly independent over k. IJ( is not empty. If we order IJ( by the relation of inclusion, then IJ( is an inductively ordered set. By Zorn's lemma (1), there is a maximal ele- ment U of 21. Since U is maximal, U is a basis of X. The same theorem is proved as follows using Zorn's lemma (4). Condition C for the subset A of X, that arbitrary finite subsets of A are linearly independent over k, is a condition 154 of finite character. Hence there is a maximal subset U that satistles C and is a basis of X. Concerning recent developments regarding the axiomatic basis for the axiom of choice -J, 33 Axiomatic Set Theory D. References [1] G. Cantor, Uber unendliche, lineare Punkt- mannigfaltigkeiten, Math. Ann., 21 (1883), 545-591. (Gesammelte Abhandlungen, Sprin- ger, 1932.) [2J E. Zermelo, Beweis, dass jede Menge wohl- geordnet werden kann, Math. Ann., 59 (1904), 5144516. [3J E. Zermelo, Neuer Beweis fUr die Moglich- keit einer Wohlordnung, Math. Ann., 65 (1908),1077128. [4] M. Zorn, A remark on method in trans- finite algebra, Bull. Amer. Math. Soc., 41 (1935), 6677670. [5] J. W. Tukey, Convergence and uniformity in topology, Ann. Math. Studies, Princeton Univ. Press, 1940. [6] N. Bourbaki, Elements de mathematique I, Theorie des ensembles, ch. 3, Actualites Sci. Ind., 1243, Hermann, second edition, 1966; English translation, Theory of sets, Addison- Wesley, 1968. 35 (1.2) Axiom Systems A. History A mathematical theory is based on a specific system of axioms, i.e., a system of hypotheses from which the whole theory is deduced with- out reliance on other assumptions. One of the first deductive methods of math- ematical reasoning was utilized by Thales, who returned to Greece from Egypt with the knowl- edge of surveying methods, and who deduced additional results from that empirical knowl- edge. His method gave impetus to the devel- opment of Greek geometry, which flowered with the Pythagorean school and research by members of Plato's Academy. In the course of this development, the deductive method led to the idea of constructing the whole theory upon a system of "absolutely obvious" statements from which the whole theory could be de- duced. Euclid systematized Greek geometry in his Elements utilizing this idea. His work became the basis of geometry after the Re- naissance, and Greek geometry came to be called tEuclidean geometry. In the Elements 
155 Euclid called the basic obvious statements common notions when they were of general nature, and postulates when they were specifi- cally geometric. Both were later called axioms (or postu1ate;). Among the axioms stated by Euclid, the "Ilfth postulate" concerning parallels was longer and more complicated than the other axioms. Many efforts were made to deduce this particular axiom from the other axioms. The failure of these attempts suggested the possibility of establishing a tnon-Euclidean geometry, which was actually done by N. 1. Lobachevskii and J. Bolyai, who replaced the fifth postulate by its negation and showed that the new system of "axioms" was as valid as the classical one. This development naturally led to a new evaluation of the idea of axioms, and eventually the traditional concept of recogniz- ing the axioms as obvious truths was replaced by the understanding that they are hypotheses for a theory. D. Hilbert [ I] established the latter idea as axiomatization and claimed that the whole science of mathematics should be built upon a system of axioms. His idea be- came the foundation of present-day mathe- matics. Hilbert reorganized classical geometry based upon his idea and published his result in Foundations of Geometry (1899). B. Systems of Axioms The system of axioms of a theory, i,e., the system of basic hypotheses from which we hope to deduce the whole theory, is written in undefined terms (or in tenns of undefined con- cepts) by means of which all other terms are defined. On the other hand, a given theory is axiomatized by specifying such a s ys tern of axioms upon which the theory may be re- organized. It should also be noted that a sys- tem of axioms determines a tstructure ( 409 Structures ). A system of axioms is considered to be mathematically valid if and only if it is consist- ent. It is also desirable that the axioms in such a system be mutually independent (i.e,; the negation of anyone of the axioms is still con- sistent with the others). When such a system is not independent, it can be simplified by delet- ing redundant axioms from it. When any two models of a system of axioms are isomorphic to each other, we call the sys- tem complete or categorical (  409 Struc- tures). For example, the system of axioms (1)- (V) postulated by Hilbert as the foundation for Euclidean geometry is complete ( 155 Foundations of Geometry), whereas the sys- tems of axioms for the theories of tgroups, trings, or tfields are not complete since there 35 Ref. Axiom Systems are nonisomorphic groups, etc. Although it is desirable that the systems of axioms postu- lated for a given theory (e.g., the ttheory of real numbers, or Euclidean geometry) be complete, the study of partial systems that may not be complete is also important ( 156 Founda- tions of Mathematics). References [lJ D. Hilbert, Axiomatisches Denken, Math. Ann., 78 (1918),405-415, (Gesammelte Ab- handlungen III, Springer, 1935, 1466156.) [2] N. Bourbaki, Elements de mathematique, Theorie des ensembles, ch. 1-4 and fascicule de resultats, Actualites Sci. Ind., 1212c,1243b, 1258a, 114Id, Hermann, 1960, 1967, 1966, 1964. 
36 A Banach A1gehras 36 (XII.1 7) Banach Algebras A. Definition [1-4J A real or complex tBanach space R is called a Banach algebra (or normed ring) if a multiplica- tion law between elements of R is introduced that makes R an tassociative algebra and if Ilxyll  Ilxll'll YII is satisfied, The complex case is generally easier to handle, and a real Banach algebra can always be embedded in a complex one isomorphically and isometrically. When a Banach algebra contains a unity element e with respect to the multiplication, i.e., it is unital, we can suppose Ilell = 1. If it is not uni- tal, we can adjoin a unity element to it. B. Examples 1. Let C o (9R) be the set of complex - (real-) valued continuous functions x() on a locally compact space 9R that vanish at intlnity. Endowed with the pointwise operations and the tsupremum norm (- 168 Function Spaces), it forms a complex (real) Banach algebra. In the complex case, it has the invo- lution x*(()=x@. 2. Let X be a Banach space. The set (X) of tbounded linear operators on X forms a Banach algebra if we define addition, multi- plication by scalars, multiplication between elements, and the norm in the usual fashion. 3. Let G be a locally compact Hausdorff group (- 423 Topological Groups) and f.l be its tleft-invariant Haar measure. The Banach space L,(G) (with respect to /1) can be made into a Banach algebra by defining xy(g) = J x(h)y(h- I g)df1(h) (- Section L). 4. Every subalgebra of a Banach algebra R, which is dosed in the norm topology, is a Banach algebra with respect to the original algebraic operations and norm and is called a closed sub algebra of R Throughout this article, all Banach algebras are supposed to be complex Banach algebras unless otherwise specified. C. Spectrum of an Element We define a new operation X' Y in R by setting x' y = x + Y - xy. If x' y = y' x = 0, Y is called a quasi-inverse of x. When a unity element e exists, y is a quasi-inverse of x if and only if e-y is an inverse of e x. For a complex number A such that j),1 > II x I, we see that 158 ic -1 X possesses a quasi-inverse y given by the strongly convergent series y= - L'%I ), -n xn, The set of all complex numbers ), such that ..1. -I X does not have a quasi-inverse is called the spectrum of x and is denoted by Sp(x). (When x itself does not have an inverse, in particular, when R does not contain a unity element, we include 0 in this set.) Sp(x) is a bounded closed set in the complex plane, and sup{ llel AESp(x)} =limnOJ Ilxnll l / n . In Exam- ple 2, this spectrum is the spectrum of the operator x, and the problem of determining the behavior of this spectrum constitutes one of the central problems in the theories of Banach and Hilbert spaces. D. Representations of a General Banach Algebra [ 1,2J We understand by a representation of a Banach algebra R on a Banach space X any algebraic homomorphism IT of R into the alge- bra (X) (see Example 2) satisfying Iln(x)ll.s; Ilxll for all xER. We call X the representation space. A Banach algebra always possesses an isomorphic and isometric representation. Especially important are the irreducible rep- resentations. A vector subspace (closed or not) Y of X is an invariant subspace if n(x) Y c Y for any XE R. A representation is called alge- braically irreducible if the invariant subspaces are trivial, i.e" they are only {O} or X. A repre- sentation is said to be topologically irreduc- ible if closed invariant subspaces are trivial. The tkernel of an algebraically irreducible representation is called a primitive ideal, which can alternatively be detlned in the following way: A left ideal J ( of- {O}, R) is regular, by definition, if R contains an element u, a unity element modulo J, such that x - xu E J for any XE R. Such an ideal is always contained in a maximal left ideal, which in turn is necessarily regular and closed. A two-sided ideal I is primitive if it is the set of elements a in R for which aR c J, where J is some fixed regular maximal left ideal. If R is commutative, a primitive ideal is a regular maximal ideal, and conversely. The intersection of all primitive ideals is called the radical of R, and when it is {O}, R is called semisimp1e. The set of primitive ideals .J is known as the structure space of R, in which the hull-kernel topology (or Jacobson topology) is introduced. The tclosure of a set III in .J is, under this topology, the set of primitive ideals containing the intersection of the ideals in 21. This topol- ogy is rather intractable; even in commutative cases, it does not coincide with the Gel'fand topology ( Section E), in general. 
159 E. The Gel'fand Representation of a Commutative Banach Algebra A complex Banach algebra is a field if and only if it coincides with the field C of complex numbers (Gel'fand-Mazur theorem). This is the most fundamental fact in the study of com- mutative Banach algebras. Now let R be a commutative Banach algebra. Then every regular maximal ideal J of R is closed and the tquotient algebra R/J is isomorphic to C, so that the quotient mapping R......R/J is viewed as a complex-valued homomorphism of R. Conversely, if If! is a nonzero complex-valued homomorphism of R, it has norm  1 as a linear functional on R and its kernel, say R"" is a regular maximal ideal of R such that rp is exactly the quotient mapping R ---; R/ R",. The set %II(R) of nonzero complex-valued homo- morphisms (or regular maximal ideals) of R endowed with the tweak* topology of the Banach space tdual R' of R is called the maxi- mal ideal space of R. W/(R) is a locally com- pact Hausdorff space, and its topology is called the Gel'fand topology. For each XE R we define a function x on Wl(R) by setting x(tp) = tp(x), Then the mapping x......x is a homo- morphism of R into the algebra Co(Wl(R)) of a\1 continuous complex-valued functions on \JJl(R) vanishing at infinity. This is the Gel'fand representation of R whose image, the Gel'fand transform of R, is denoted by R. Concerning this, we have (1) Wl(R) is compact if R has a unity element; (2) Sp(x) equals the closure of the range x(W/(R)) of x; (3) the representation x-->x is norm-decreasing if Co(Wl(R» is en- dowed with the tsupremum norm; (4) Ilxllx. (=sup{ Ix(<p)11 q1E\JJl(R)}) equals limnoo Ilxnll l / n . The tkernel of the Gel'fand representation of R is the radical of R, which consists of gen- eralized nilpotent elements of R. F. Banach Star Algebras An involution in a Banach algebra R is an operation x-->x* that satisfies (1) (x + y)* =x* ty*; (2) (AX)* == AX*; (3) (xy)* == y*x*; (4) (x*)* = x, A Banach algebra with an involution is called a Banach *-algebra. A *-homomorphism <I> between two Banach *-algebras is an alge- braic homomorphism which preserves invo- lutions, i.e., cI>(x*) = cI>(x)*. To represent a Banach *-algebra we prefer a *-representation, i.e., a representation x-t Tx on a Hilbert space such that Tx' is equal to the adjoint Tx* of Tx for any xER, G. C*-A1gebras [5-10J A Banach *-algebra A satisfying Ilx*xll == IIxl1 2 for all x E A is called a C*-algebra. Every C*- 36H Banach Algebras llgebra is *-isomorphic and -isometric to a anach algebra of operators on a Hilbert pace (see Example 2) that contains, along ",ith an operator, its adjoint (the Gel'fand- Va'imark theorem). A C* -algebra is semi- :imple, and a commutative C*-algebra A is ,-isomorphic to Co(\JJl(A)) under the Gel'fand 'epresentation. A topologically irreducible *- 'epresentation of a C*-algebra is also algebrai- :ally irreducible (R. V. Kadison), and the set of unitary equivalence classes of these irreduc- ble *-representations is called the dual space. It becomes a topological space if we introduce .he hu\1-kernel topology inherited from the structure space, but other topologies are also mtroduced. Moreover, for the study of separ- lble C*-algebras, Borel structure in the sense )f G. W. Mackey is a very powerful tool. C*- algebras also have intimate connections with the theory of tunitary representations of a topological group (see below) and with quan- tum physics. Many works have been published on *-representations, dual spaces, etc. A linear functional (1) on a C*-algebra A is said to be positive if w(x* x)  0 for any x E A. For any nonzero positive linear functional w of A, there exist a *-representation (11"" H",) of A on a Hilbert space If, and a vector '" E H", such that w(a) =(11",(a)"" "') for all aE A and that the subspace {11",( a)", I a E A} is dense in HW' We call (11"" Hw) a cyclic representation of A induced by uJ. A *-representation (n, H) of A is called universal if for any *-representation (p, K) of A there exists a tIT-weakly continuous *-homomorphism p of n(A)" onto p(A)" such that p(a) = (j) 0 11) (a) for all a E A, where n(A)" is the double tcommutant of 11(A) and so is a tvon Neumann algebra ( 308 Operator Algebras). Yon Neumann algebras 11(A)" for universal representations 11 of A are mutually *-isomorphic, so that they determine the envel- oping von Neumann algebra of A. Especially, the direct sum 11 of all cyclic representations 1rw is a universal representation and there is a unique isometric isomorphism of n(A)" onto the second dual A ** of A that is bicontinu- ous in the o-weak topology of n(A)" and the a(A**, A*)-topology of A **. So A ** is identi- fied with the enveloping von Neumann algebra of A, H. Some Classes of C*-A1gebras Let A be a C*-algebra. If is a CCR (liminal C*-) algebra if it is mapped to the algebra of compact operators under any irreducible *- representation. It is a GCR (post1iminal C*-) a1gehra if every nonzero quotient C*-algebra of A has a nonzero CCR closed two-sided ideal. These classes of C*-algebras have inter- 
361 Banach Algebras esting properties. For example, the dual space of a separable C*-algebra is a tT o space (tT I space) if and only if the algebra is GCR (CCR) (J. Glimm). A is called an AF (approximately finite) algebra if it is the uniform closure of an in- creasing sequence of finite-dimensional C*- algebras (0. Bratteli). AF algebras are classi- fied by so-called dimension groups. To see this, we call an ordered Abelian group G (written additively) a Riesz group if (a) for any integer n>O and any gEG, ngO implies gO and (b) for any g;, h j (I .s;i.s;m, l.s;j«n) in G with gi  h j for all i and j, there exists kEG with gi  k  h j for all i, j. Then the isomorphism classes of AF algebras correspond bijectively to the isomorphism classes, as local semi- groups, of generating upward-directed heredi- tary subsets of the positive cones of countable Riesz groups, and a dimension group, defined otherwise, is exactly a countable Riesz group (G. Elliot, E. G. Effi'os, D. E. Handelman and C. L. Shen) ( [II]). Now consider two C*-algebras A and B. The algebraic ttensor product A (8) B over C becomes a *-algebra in the natural way. A norm fJ on A (2) B is a C*-crossnorm ifllxYllp  Ilxll p Ilyil p and Ilx*xll p = Ilxll for x, yEA (8) B and Ila0bllp Ilallllbll for aEA, bEB. The completion of A (2) B under such a norm p is a C*-algebra, which is denoted by A 0 p B. There are two special C*-crossnorms on A (2) i?: 11'llmin and 11'llmax. The former is called the spatial (minimal, injective) C*-crossnorm and is defined by Ilxll min = sup"",!, 11(<p 01jJ)(x)ll, where ({! and IjJ run over all *-representations of A and B, respectively. The algebra A (8)min B is called the spatial tensor product of A and B. The latter is called the greatest C*-crossnorm and is defined by Ilxllmax=SUPT IIT(x)ll, where T runs over all *-representations of A (2) B. And the algebra A (2) max B is called the projec- tive C*-tensor product of A and B. Any C*- crossnorm fJ on A (8) B lies in between these two norms, i.e., Ilxllmin « Ilxllp.s; Ilxll max for xEA (8) B. A C*-algebra A is called nuclear if A (8) B has a unique C*-crossnorm for any C*- algebra B. It is known that any OCR algebra is nuclear and that an inductive limit of nu- clear C*-algebras (e.g., any AF algebra) is nuclear. Given a linear mapping <p between C*-algebras A and B, we define for any inte- ger 11 ) 1 a linear mapping !fln :A (8) Mn-->B (8) M n by setting <Pn(x (8) eij) = <p(x) (2) eij' where {eij} are the matrix units for the C*-algebra M n of n x n complex matrices. !fl is said to be completely positive if <Pn is positive for any n ) 1. A C*-algebra A is then called injective if, for any C*-algebras B, C with B  C and any completely positive contraction <p: B--> A, there 160 exists a completely positive contraction ip: C+A that extends <po A C*-algebra A is nuclear if and only if its enveloping von Neu; mann algebra A ** is injective. Every injective C*-algebra B is not necessarily a von Neu- mann algebra but every A W*-algebra, i,e., every maximal commutative *-subalgebra 1\1 of B, is monotone complete (i.e., every increas- ing net of self-adjoint elements has a least upper bound in M) and the lattice of projec- tions in B is conditionally complete. I. Crossed Product Let A be a C*-algebra, G a locally compact Hausdorff group with left-invariant mea- sure jl, and i'J. a continuous action of G on A as *-automorphisms ex g , 9 E G. Let K(G, A) be the linear space of continuous functions from G to A with compact support, which is a *-algebra under the multiplication y * z and the involution y* given by (y * z)(g) = SGy(h)i'J. h (z(h- l g»dJ1(h) and y*(g)=(g-I). ex g (y(y-l)*) for y, zEK(G,A), where A is the tmodular function of G. By completing K(G, A) under the norm ]lylll = JG II yIIdJ1(g), we get a Banach *-algebra LI (G, A). LI (G,A) is then made into a C*-algebra by furnishing the norm Ilxli ==suPn Iln(x)ll, where n runs over all *-representations of LI (G, A). This C*-algebra, denoted by A 0, G, is the crossed product of A by G relative to the action iX, Crossed prod- ucts are useful in the structure theory of C*- algebras. J. Extensions by C*-A1gebras (BDF Tbeory) [12J Let H be a separable infinite-dimensional Hilbert space and 5L'(6'(H) the ideal of :J6(H) consisting of all compact operators. The quo- tient C* -algebra :J6( H)! 5L'CC( H) is called the Calkin algebra. We denote it by (H) and the quotient mapping by n::J6(H)......(H), By an extension of :t'CC(H) by a separable unital C*-algebra A we mean a unital (preserving unity elements) *-isomorphism, of A into f2(H). We call two extensions 'I, '2 :A -->f2(H) equivalent if, for some unitary operator UE :J6(H), n(u)r l (a)n(u)* = rz(a) for any aE A, and denote by Ext(A) the set of all equivalence classes [t] of extensions r by A. Ext(A) forms a commutative semigroup with respect to the addition [rJ = [rlJ + [r 2 J, where ,(a) = 'I (a) EB '2(a)d(H) EB(H)c(H (fjH). CalI an ex- tension,: A -->(H) trivial if there is a unital *-isomorphism (J of A into 8B(H) with r = na. Then (1) all trivial extensions are equivalent, 
161 and we define the unity element of the semi- group Ext(A) (D. Voiculescu); (2) Ext(A) is a group if A is commutative (L. G. Brown, R. G. Douglas, and P. A. Fillmore) or, more gener- ally, if A is nuclear (M. D. Choi and E. G. Effros); (3) Ext(A) is not always a group (J. Anderson). For further information -1. 390 Spectral Analysis of Operators J. K. Derivations in C*-Algebras A linear operator 15 in a C*-algebra A is a derivation in A if its domain D(6) is a dense subalgebra of A and b(xy) = b(x)y + xb(y) for X, yE D(6). b is a *-derivation if, moreover, x E D(6) implies x* E D(S) and b(x*) = b(x)*, b is called bounded (unbounded, closed) if it is bounded (unbounded, closed) as a linear operator. Every bounded derivation b is expressed as b(x) = ax - xa with some a in the envelop- ing von Neumann algebra A ** of A (R. V. Kadison and S. Sakai). The element a can be taken from the multiplier algebra M(A) = {bEA**lbA+Ab£;A} of A if A is simple (Sakai) or if A has continuous trace (C. A. Akemann, Elliott, G. K. Pedersen, and J. Tomiyama). If A is separable, we see that all bounded derivations in A are given by ele- ments in M(A) if and only if A is the C*-direct sum of a family of simple C*-algebras and a C*-algebra with continuous trace (Elliott, Akemann, and Pedersen) [13J and that every bounded derivation in the quotient C*-algebra All, I being any closed two-sided ideal, can be lifted to a derivation in A (Pedersen). Next we consider unbounded derivation. By a *-automorphism group on A we mean a tone- parameter group p" t E R, of *-automorphisms of A such that, for each XE A, p,x is contin- uous in t E R. A C*-dynamical system is a pair consisting of a C*-algebra A and a *- automorphism group p, on A. The fact that the time evolution of a physical system is often represented by such a dynamical system has made the study of unbounded derivations quite active. We have to see if a given deriva- tion b is tclosable and if, in case of a closable b, the closure () generates a *-automorphism group. Sample results: (1) If a *-derivation b is well-behaved in the sense that for every self- adjoint x in D(6) there exists a tstate ([1 with I tp(x)1 == IIxll and tp (b (x)) = 0, then b is closable and its closure is well-behaved (A. Kishimoto and Sakai); (2) if a *-derivation b is closable, its closure J generates a *-automorphism group if and only if b is well-behaved and (1 ::!: b)D(b) is dense in A (R. T. Powers and S. Sakai; O. Bratteli and D. W. Robinson). For further results -1. [ 14]. 36L Banach Algebras L. Applications to the Theory of Topological Groups [2,3,6, 15J Banach algebras have many applications in different branches of mathematics and quan- tum physics. Here we mention some that con- cern topological groups. Let G be a locally compact Hausdorff group and J1 be its left- invariant measure. We make the complex algebra L,(G) of Example 3 into a Banach *-algebra by detlning x*(g) = X(g-I ) (g-I), where A is the tmodular function of G. This is called the LI-algebra (or group algebra). It is not C* but is semisimple. Considering a uni- tary representation of G is equivalent to con- sidering a *-representation of the LI-algebra. Replace the norm of xELdG) by suplln(x)ll, where the supremum is taken over all the *- representations, The new norm satistles the C* condition, and the completion of L,(G) with respect to this norm is a C*-algebra, which we call the C*-group algebra of G. The dual of the C*-group algebra thus detlned is called the dual of the group G, and this notion plays an important role in the study of topological groups. Unitary representations of a group G, *-representations of the L I-algebra of G, and *-representations of the C*-group algebra of G are all characterized by positive definite fUnc- tions on G. A function p(g) is positive detbite, by definition, if it is measurable on G and f f p(g-I h)x(g)x(h)df1(g)df1(h);?-O for any continuous function x(g) with compact support. The Abelian case. When G is an Abelian group (- 422 Topological Abelian Groups), a regular maximal ideal M of the LI-algebra R of G and a character y of G are in a one-to-one correspondence by the relation x(M) = f x(g)y(gdJ1(g) (the left-hand side is the value of x at M under the Gel'fand representation). Moreover, the set of regular maximal ideals of R provided with the Gel'fand topology and the set G of charac- ters of G provided with the Pontryagin topol- ogy (the tcharacter group of G) are homeo- morphic by this correspondence, Therefore the GeI'fand transform of an element x of the L 1- algebra R is seen to be a function x(y) on G defined by the integral on the right-hand side in the above expression, which is properly called the Fourier transform of x. Of course, the Fourier transform Can be detlned for other classes of functions (e,g., for the L 2 -space over G. the Fourier transform in the sense of Plan- 
36M Banacb Algebras cherel), and classical theories of Fourier series, Fourier integrals, and harmonic analysis (- 192 Harmonic Analysis) are studied from a more extensive point of view. Thus the state- ment that the Fourier transform of an element x of the LI-algebra of G is a continuous func- tion vanishing at inl1nity ( Section E), for example, is a version of the classical tRiemann- Lebesgue theorem. Bochner's tbeorem in classical tFourier analysis is restated thus: A continuous tpositive definite function on an Abelian group can be put in the form p(g)= fY(9)dP(y), where p is a uniquely determined bounded positive tRadon measure on the character group G, Developing these theories further, we obtain an alternative proof of the tPontryagin duality theorem (H. Cart an and R. Godement). A closed ideal I in the LI -algebra R deter- mines a set Z(I) in G as the set of common zeros of the Fourier transforms of elements of I. We ask whether, conversely, I is character- ized by Z(I). This question is the problem of spectral syntbesis, and many important results have been obtained. The statement that I must coincide with R when Z(I) is empty is a for- mulation of the generalized Tauberian tbeorem of N. Wiener. A considerable simplification of the proof was accomplished by using the theory of Banach algebras, which was the first application of the theory (I. M. Gel'fand). M. Ho10morpbic Functional Calculus [16,171 Let R be a unital commutative Banach algebra with maximal ideal space Wl = Wl(R), We de- fine the joint spectrum Sp(X) of a finite n-tuple X = {XI,'''' xn} in R by {(Xl (cp)" xn(CP)) cpEWl}, a nonempty compact subset of cn, Then the bo10morpbic functional calculus says the following: For each X = {Xl" , x,} c R there exists a unique algebra homomorphism <l>x of the algebra H(Sp(X») of tgerms of holo- morphic functions on Sp(X) into R with the following properties: (1) <l>x(1) = 1, the unity element of R, and <I>x(z;) = Xi' 1 .s;i.s; n; (2) if n<m, X'=={Xl'.... x,} andF(zl'.... zm)= F(zl" .. . zn) with FEH(Sp(X», then <I>x,(F)= <I>x(F); (3) if {Fk k = 1,2, } are holomorphic in a fixed neighborhood U of Sp(X) and Fk--> F uniformly on U. then <l>x(Fk)--><I>x(F) (L. Waelbroeck). From this follows the implicit function tbeorem: Let X I' . ,x n E R, fEe (Wl), and let F(w, z , , . . , zn) be holomorphic in a neighborhood of the set Sp(f, Xl" . . ,X n ) = {(/(<p),X t (cp)" Xn(cp)) I cpE9Jl}. Suppose that 162 F(f,Xl, '" ,xn)==O but 6Fj6w#0 on Sp(f,Xl> ""x,). Then there exists a unique YE R such that y= f and F(y,x l ,. ,xn)=O (R. Arens and A. Calderon). As an application we obtain Shilov's idempotent tbeorem: If K is a closed- open subset of Wl, there exists x E R with x = Ion K and X=O on Wl-K. Set Q(R)= {XE Rle 2 . ix = 1). Then Q(R) is an additive sub- group of Rand Shilov's theorem says that the Gel'fand representation gives an isomor- phism of Q(R) onto Q(c(W» (-::::;HO(IJJl, Z». Another theorem of this sort is the Arens- Royden tbeorem: R -I jexp(R)  C(1JJl)-1 j exp(C(Wl))  Hl(lJJl, Z), where R -I is the set of invertible elements of R and exp(R) = {e I XE R}. Further extensions related to K-theory have been given by J. L. Taylor and others. (For functional calculus for one variable --" 25 1 Linear Operators) References [IJ C. E. Rickart, General theory of Banach algebras, Van Nostrand, 1960. [2J M. A. NaImark, Normed algebras, Wolters-Noordhoff, 1972. (Original in Russian, 1968.) [3J L. H. Loomis, An introduction to abstract harmonic analysis, Van Nostrand, 1953. [4J K. Yosida, Functional analysis, Springer, third edition, 1971. [5J W. Arveson, An invitation to C*-algebra, Springer, 1976. [6J J. Dixmier, Les C*-algebres et leurs repre- sentations, Gauthier-Villars, 1964. [7J G. K. Pedersen, C*-algebras and their automorphism groups, Academic Press, 1979. [S] S. Sakai, C*-algebras and W*-algebras, Springer, 1971. [9J M. Takesaki, Theory of operator algebras 1, Springer, 1979. [ 10J o. Bratteli and D. W. Robinson, Opera- tor algebras and quantum statistical mechan- ics, Springer, 1979. [ I1J G. A. Elliott, On the classification of inductive limits of sequences of semisimple finite-dimensional algebras, J. Algebra, 38 (1976), 29-44. [12] R. G. Douglas, Extensions of C*-algebras and algebraic topology, Proc. Internat. Con- gress Math. Helsinki 1978, vol. 2, 1980, 6999 704. [13J C. A. Akemann and G. K. Pedersen, Central sequences and inner derivations of separable C*-algebras, Amer. J. Math., 101 (1979), 1047-1061. [14] S. Sakai, Recent developments in the theory of unbounded derivations in C*- 
163 algebras, Proc. Internat. Congress Math. Helsinki 1978, vol. 2, 1980,709-714, [ 15J W. Rudin, Fourier analysis on groups, Interscience, 1962. [16J T. W. Gamelin, Uniform algebras, Prentice-Hall, 1969. [17J J. L. Taylor, Banach algebras and topol- ogy, Algebra in Analysis, J. H. Williamson (ed.), Academic Press, 1975, 118186. 37 (X11.3) Banach Spaces A. General Remarks The notion of Banach space was introduced in 1922 by S. Banach and N. Wiener, working independently. The idea was to apply topo- logical and algebraic methods to fundamental problems of analysis, such as mapping prob- lems in infinite-dimensional function spaces (- 168 Function Spaces, 197 Hilbert Spaces, 25 1 Linear Operators). B. Definition of Banach Spaces We associate to each element x of a tlinear space X over the real (or complex) number field a real number Ilxll satisfying the following conditions: (i) Ilxll ",,0 for all x, and Ilxll =0 is equivalent to X == 0; (ii) IllXx II == IIXI ' II xii for any real (complex) number IX; (iii) I!x+ yll  Ilxll + II y!l, Then !Ix II is called the norm of the vector x, and X is called a normed linear space. The norm is thus an extension of the notion of the length of a vector in a tEuc1idean space. A normed linear space X is a tmetric space under the distance p(x, y) = Ilx- yll, We write s- limnoo X n = x or simply x n -+ X when limnoo II x n - xii = 0 and say that X n converges strongly to x. If this metric space X is tcomplete, then X is called a Banach space. In normed linear spaces the addition and the multiplication by scalars are continuous. A closed linear subspace M of a Banach space X is again a Banach space, and the tquotient space XjM becomes a Banach space if the norm of a tcoset is defined by Ilx + M II = inf mEM Ilx + mil, The subset {x II x II  I} of a normed linear space X is called the (closed) unit ball (or unit sphere) of X. Examples. tFunction spaces C, Lp (l.s; p < 00), M, W;, H6, tsequence spaces c, I, m, and BVare all Banach spaces (- 168 Function Spaces). 37 D Banach Spaces C. Linear Operators and Linear Functionals Suppose that a linear subspace D(T) of a linear space X is the (definition) domain of a mapping T with values in a linear space XI T is called a linear operator ifT(lXx + f3y) = IX Tx + f3Ty for any scalars a, f3 and x, y E D(T). R(T)={TxEXllxED(T)} is called the range of T. In the special case where XI is the real or complex number l1eld, T is called a linear functional. If X and XI are both normed linear spaces, then T is continuous if and only if s-lim", Tx, = Tx whenever s. limn«) X,=X. This is equivalent to the condi- tion SUPxED(T), Ilxl: I II Txll < 00. In particular, if D (T) = X, the linear operator T is continuous if and only if the set {Tx Ilxll  I} is bounded. In this case, T is called a bounded linear oper- ator, and II TII = sUPllXI III Tx II is called the norm of the operator T. In particular, a linear operator T satisfying II T II  1 is called a con- traction. (Sometimes T is called a contraction only when II T II < 1. In that case, an operator with II T I  1 is called a nonexpansive opera- tor.) The scalar multiple, sum, and product of linear operators are defined by (IXT)x = a( Tx), (T + S)x = Tx + Sx, and (ST)x = S( Tx), respec- tively. The identity operator I in X is defined by I. x = x for all x EX. If the inverse mapping T- I of x-+ Tx exists, then it is called the in- verse operator of T. A normed linear space X is said to be isomorphic to a normed linear space Y if there exists a bounded linear opera- tor T from X onto Y with bounded inverse. If T can be chosen to be isometric (i.e., II Txll II x II for all x E X), then X is said to be iso- metrically isomorphic to Y. D. The Dual Space and the Dual Operator The totality of continuous linear functionals f defined on a normed linear space X is a Banach space X' under the previously de- fined linear operations and the norm IIJII = sUPlixiiO If(x)l, This X' is called the dual (or conjugate) space of X. In view of useful prop- erties of the tinner product in tHilbert spaces, it is sometimes convenient to write <xJ) for f(x). Let X and Y be normed linear spaces, and let T be a linear operator with the dense domain D (T) in X and the range R (T) in Y. If (f, g) is a pair with fEY' and g EX' satisfy- ing the equation (IX, f) = <x, g) for any XE D(T), then g is determined uniquely by f The operator T' defined by T'f = g is linear and is called the dual operator (or conjugate or adjoint operator) of T. This is an extension of the notion of the ttranspose of a matrix in matrix theory. If T is a bounded linear operator then 
37E Banach Spaces T' is also a bounded linear operator such that IIT'I = IITII. E. The Weak Topology and the Strong Topology Let X be a normed linear space and X' its dual space. Take a finite number of elements x;, x, , x from X', and consider the subset of x:{xEXlsuPI';;i,;;nl<x,x;)I.s;s}, s>O. If we take the totality of such subsets of X as a tfundamental system of neighborhoods of 0 of X, then X is a tlocally convex topological linear space, denoted sometimes by X w ' This topology is called the weak topology of X. If a sequence {xn} C X converges to x E X with respect to the weak topology of X, then it is said to converge weakly. This is equivalent to the convergence (x" f )--><x, f) for any f E X'. The original topology of X determined by the norm is then called the strong topology of X, and to stress the strong topology we sometimes write Xs in place of the original X. Take a finite number of elements x I' X 2, , " , X n from X, and consider the subset of X': {x' Ex'i SUPI ';;i,;;nl<xi,x') I.s;s}, s>O, If we take the totality of such subsets of X' as a fundamental system of neighborhoods of 0 of X', then X' is a t10cally convex topological linear space. We write this space as X. and call the topology the weak* topology of X' . The topology of X' detlned by the norm II III is called the strong topology of X', and to stress the strong topology we write X;. The terms "weak" and "weakly" are used in reference to the weak topology, for instance, weak c1osed- ness and weakly closed. Similar conventions are used for the weak* and strong topologies. The unit ball of the dual space of a normed linear space X is weak*-compact (Banach- A1aog1u theorem). Then by the tKrein-Milman theorem ( 424 Topological Linear Spaces) the unit ball of X' is the weak* closure of the tconvex hull of its textreme points. If X is a Banach space, a convex subset K of X' is weak*-c1osed if and only if the intersection of K and each weak*-compact subset is weak*- closed (Krein-Shmul'yan theorem). F. The Hahn-Banach Extension Theorem Let M be a linear subspace of a real linear space X and p(x) a real-valued functional detlned on X such that p(x + Y)  p(xl + p(y) and p(h)=Ap(x) for all x, YEX and AO. If a linear functional It defined on M satisfies II (x) <p(x) for all x EM, then there is a linear functional f on X which extends II and satis- fies f(x) .s;p(x) for all XEX (the Hahn-Banach 164 (extension) theorem). The Hahn-Banach exten- sion theorem has many applications. (1) Let M be a linear subspace of a normed linear space X. Then for any 11 EM', we can construct an fEX' such that f(x) = fdx) for all xEA1 and Ilfll = Ilflll. (2) For any Xo #0 of a normed linear space X, we can construct an 10 E X' such that fo(xo) = Ilxoll and 111011= 1. (3) For any closed linear subspace M of a normed linear space X and a point xor/; M we can construct foEX' such that I foil =1, fo(x o ) = inf mEM Ilxo-mll and fo(x)=O for all xEM. A proposition more general than (3) is Mazur's theorem, which is useful in applications: (4) Let a closed subset M of a normed linear space X be tconvex, Then for any Xo r/; M we can con- struct an fo E X' such that fo(x o ) > fo(x) for all XE M, By (4) we can prove, e.g., that a convex set of a normed linear space is weakly closed if it is strongly closed. This proposition has the following corollaries: (5) A convex set A containing 0 is closed if and only if (A 0)0 = A, where A 0 is the tpolar of A (the bipolar theorem). (6) For two closed convex sets A and B, containing 0, the polar of A n B coincides with the weak* closure of the convex hull of AO U BO, For complex linear spaces, most of the propositions in this section are valid with Re f(x) instead of f(x), The Hahn-Banach theorem can also be em- ployed to prove the existence of the general- ized limit (or Banach limit) Limnoo (n defined for all bounded real sequences {(n} such that lim infnoo (n.s; Limnoo (n.s; lim sup"oo (n and Lim"oo(Cl';" + 13l1n) =  Lim"oo n + 13 Lim"oo I1n' G. Duality in Normed Linear Spaces An element Xo of a normed linear space X gives rise to an element x of (X;)' determined by <xo, x,> == <x', x> for all x' E X;, If we WrIte x = Jx o , then J is a linear operator satisfying IIJxol1 = Ilxoll by (2), and so the space X is isometrically isomorphic to a linear subspace of (X;), If Xs coincides with (X;); under this isomorphism, we call X a reflexive (or regular) Banach space. A necessary and sufficient con- dition for the normed linear space X to be reflexive is that the unit ball of X be weakly compact. A convenient criterion for the reflex- ivity of X is that any bounded sequence {x n } of X contains a subsequence weakly convergent to a point of X (Eherlein-Shmul'yan theorem). In this connection, a Banach space X is reflex- ive if and only if each X'E X' attains its norm, i.e., there is an Xo EX such that Ilx o II = 1 and I (xo,x')1 == Ilx'll (James's theorem). A normed linear space is said to be uni- formly convex iffor any f, > 0 there exists a b > o such that Ilxll (1, II YII.s; 1 and Ilx-yl s 
165 implies II x + y II  2 - b, A normed linear space is said to be uniformly smooth if for any £ > 0 there is a b > 0 such that II X I = 1 and II yll  b imply Ilx+YII + Ilx-YII 2Hllyll, A Banach space X is uniformly convex (resp. uniformly smooth) if and only if the dual X' is uniformly smooth (resp. uniformly convex). If a Banach space X is uniformly convex or smooth, it is isomorphic to a Banach space that is uniformly convex and uniformly smooth (P. Enflo). The space Lp with 1 < P < (fJ is uniformly convex and uniformly smooth. Any uniformly convex or uniformly smooth Banach space is reflexive (Milman's theorem). H. The Resonance Theorem Let {T,} be a sequence of bounded linear operators from a Banach space X into a normed linear space Y. The uniform bounded- ness theorem (resonance theorem or Banach- Steinhaus theorem) states that sUPn?1 II T.II < (fJ if sUPn?llI1;,xll < 00 for every XEX. As a corol- lary, we have sUP9111xnli < 00 for any weakly convergent sequence of X. Another corollary states that the set {XEX limsuPnoo II T.xll < (fJ} either coincides with X or is a subset of X of the tfirst category. This implies the so-called principle of condensation of singularities, which gives a general existence theorem for func- tions exhibiting various kinds of singularities, for example, a continuous function whose tF ourier expansion diverges at every point of a tperfect set of points having the cardinal num- ber of the continuum. I. The Closed Graph Theorem Let T be a bounded linear operator from a Banach space X into a normed linear space Y. If the image of the unit ball of X under T is dense in the unit ball of Y, then for any G > 0 and any Yo E Y the equation Tx = y, has a solution x with Ilx I':::; (1 + E) //y, ,I (Banach's theorem). By using the tBaire category theo- rem, we can derive from this the open map- ping theorem: Every bounded linear operator from a Banach space X onto a Banach space Y maps each open set in X onto an open set in Y. As an application of the open mapping theorem, we can prove the closed graph theo- rem: A linear operator T from the whole of a Banach space into a Banach space is con- tinuous if and only if T is a closed operator, i.e., s-Iimnooxn =x and s-Iim", Tx, = y imply Tx = y. This theorem plays an important role in modern treatments of linear partial differen- tial equations. 37 K Banach Spaces J. The Closed Range Theorem Let X and Y be Banach spaces and Ta linear closed operator with domain D(T) dense in X and with range R(T) in Y. Under these con- ditions, the following four propositions are mutually equivalent. (1) R(T) is a closed set in Y. (2) R(T) is a closed set in X'. (3) R(T) = {YE YI<y,y*)=O for alI Y*ED(T') such that T'y* =O}. (4) R(T')= {X*E X' I <x, x*) =0 for all x E D( T) such that Tx= OJ, These four propositions, as a whole, are called the closed range theorem. This theorem implies (5) R(T) = Y if and only if T has a continuous inverse; and (6) R( T) = X' if and only if T has a con- tinuous inverse. The following theorem is of similar nature: the following three propositions on two closed linear subspaces M and N of a Banach space are mutually equivalent. (7) M + N is closed, (8) Me + N° is strongly closed. (9) MO + N° is weak*-closed, The Hahn-Banach theorem, the resonance theorem, the open mapping theorem, the closed graph theorem, and the closed range theorem can be extended to various classes of tlocally convex topological linear spaces. By virtue of this extension, we are able not only to treat various fundamental problems of analysis from a unified viewpoint but also to develop the theory of functional analysis itself in a new direction ( 424 Topological Linear Spaces; concerning linear operators on a Banach space  68 Compact and Nuclear Operators, 251 Linear Operators, 390 Spectral Analysis of Operators). K. Differential and Integral Calculus of Functions with Values in Banach Spaces Calculus involving functions from a set to a Banach space is also an effective tool in vari- ous problems. A function x(t) det'ined on an interval [a, b] with values in a Banach space X is said to be strongly (weakly) continuous if x(t) converges strongly (weakly) to x(to) as t-->t o ' For a strongly (weakly) continuous function x(t), the Riemann integral can be defined in a standard way, using strong (weak) convergence of the Riemann sum r x(t)dt=limIx(t()(t i + 1 -tJ The tfundamental theorem of calculus, i.e., strong (weak) differentiability of the indefinite integral, remains true. Various defmitions of integrals of a Banach space-valued function on a tmeasure space are discussed elsewhere (- 443 Vector-Valued Integrals). Now let x(ic) be defined on a domain Q in the complex plane with values in a complex 
37 L 166 Banach Spaces Banach space X. x(,1,) is said to be bolomorpbic if f(x(,1,» is tholomorphic in Q for every fE X'. If x(,1,) is holomorphic, then there exists an X- valued function y(,1,) on Q such that IIC I (x(,1,+O-x(,1,))-y(,1,))II-->O a , --> o. Therefore there is no difference between "strong" and "weak" in analyticity. tCauchy's integral theorem remains true for a holo- morphic function x(,1,) with values in X, and the tLaurent expansion 00 x(,1,) = L an(A - Ao)n, n= - 00 an= f x(A)(A-Ao)-n-ldA, 2mJc is valid with the integral taken in the Rieman- nian sense. Banach space-valued holomorphic functions on complex (or real) tanalytic mani- folds of higher dimension can be defined in a natural way by means of power series expan- sion. A function 0(A) defined on a domain in the complex plane with values in the Banach space B(X, Y) of bounded linear operators from X to Y becomes holomorphic if f(0(A)X) is holomorphic for every X E X and fEY' . An operator-valued holomorphic function is often called an analytic operator function. L. The Approximation Property A Banach space X is said to have the approxi- mation property if there is a family {S,} of bounded linear operators of tfinite rank in X such that inf;. I (S;. - I) T il = 0 for all tcompact linear operators T in X. There is a Banach space that fails to have the approximation property (Enflo [9J), More surprisingly, the space of all bounded linear operators on an infinite-dimensional Hilbert space fails to have the approximation property (A. Szankowski). The approximation property plays a deci- sive role in the theory of ttensor products of Banach spaces. A Banach space X is said to have the hounded approximation property if a family {S,} in the definition of the approxi- mation property can be taken bounded, i.e., sup;. IIS;,II < 00. The bounded approximation property does not follow from the approxi- mation property. The bounded approximation property is closely related to the existence of a basis. A sequence {e,} in a Banach space X is called a Schauder basis or simply a basis (or base) for X if to each XE X there corresponds a unique sequence of numbers {cc,} such that Iimnoo Ilx-L=1 (J.kek II =0, Most separable Banach spaces appearing in analysis have bases. A separable Banach space has the bounded approximation property if and only if it is isomorphic to a complemented linear subspace ( Section N) of a Banach space with a basis (A. Pelczynski), M. Injective and Projective Banach Spaces Banach spaces of the type tC(Q) with compact Q and of the type HI (Q) on a set Q playa spe- cial role in the theory of Banach space. This is already seen in the fact that every Banach space is isometrically isomorphic to a subspace of a space C(nd as well as to a quotient space of a space II (0,). A Banach space X is said to be injective if for any Banach space Y and its linear subspace M, each bounded linear operator T from M to X can be extended to a bounded linear operator f from Y to X, that is, Tx = Tx for all XE M, A Banach space is m. jective if and only if it is isomorphic to a com. plemented linear subspace of the space C(R) for a compact Hausdorff space Q with the property that the closure of every open set is open. Such a topological space is called tex- tremally disconnected. The tmaxima1 ideal space of the tBanach algebra Loo(Q) is extrem- ally disconnected. Hence the Banach space Loo(Q) is injective, Whether every injective Banach space is isomorphic to a space C(O) is still an Open problem. However, a Banach space is isometrically isomorphic to a space C(Q) with Q extremally disconnected if and only if it is injective with the property of norm- preserving extension, i,e" II TII = II TI is always possible (Nachbin-Goodner-Kelley theorem). In this connection, the following propositions on a Banach space X are mutually equivalent. (1) X' is isometrically isomorphic to a space t L I (fl), (2) For any Banach space Y, its linear subspace M, and E> 0, each compact linear operator T from M to X can be extended to a compact linear operator f from Y to X with II TII::::;; (1 + 8)11 TII (1. Lindenstrauss). A Banach space X is said to be projective if for any Banach space Y and its closed linear subspace M, each bounded linear operator S from X to the quotient space YI M is lifted to a bounded linear operator S from X to Y, i.e., q;(8x) = Sx for all x EX, where IfJ is the quotient mapping from Y to YIM. Projectivity of X is character- ized by its being isomorphic to the space II (Q) on a set Q, A Banach space is isometrically isomorphic to a space II (R) if and only if it is projective with the property of norm- preserving lifting, i,e., 11811 = IISII is always possible (- [8J). N. Complemented Subspaces Problems A linear subspace M of a Banach space X is complemented, i.e., there is a closed linear 
167 subspace N such that M n N = {o} and M + N = X if and only if M is the range of a bounded projection P, i,e" p2 = P and R(P) = M, Each nonzero closed hnear subspace of a Hilbert space is complemented, or more pre- cisely, it is the range of a projection of norm one (- 197 Hilbert Spaces). This property distinguishes Hilbert spaces from general Banach spaces: (1) A Banach space of more than 3 dimensions is isometrically isomorphic to a Hilbert space if each nonzero closed linear subspace is the range of a projection of norm one (Kakutani's theorem). (2) A Banach space is isomorphic to a Hilbert space if each closed linear subspace is complemented (J. Linden- strauss and L. Tzafriri [ lOJ). O. Quasi-Banach Spaces and Fn\chet Spaces Let X be a linear space over the real (or com- plex) number field, Suppose that a real-valued function I!xll on X satisfies (i) and (ii) of Sec- tion B and (iii') Ilx+ yll k(llxll + II yll) with a constant k :;:: 1 independent of x and y. Then Ilxll is called the quasinorm ofx, and X equip- ped with a quasinorm is called a quasinormed linear space. Let 0 < p  1 be the root of the equation k = 2(1/p)-I, Then there is a distance d(x, y) = d(x -y) depending only on x -y such that d(x-y) Ilx-yIIP2d(x-y). Hence a quasinormed linear space is a metric space in which a sequence x n converges to x if and only if Ilx n - X 11-->0. If a quasinormed linear space X is complete under this metric, then X is called a quasi-Banach space. The tfunction space Lp is a quasi-Banach space for 0 < P < 1. If we denote by Ilxll the distance d(x-O) of a quasinormed hnear space, then it satisfies (i) and (iii) of Section B and (ii') II-xii = Ilxll and limnoo Ilanxn-axll =0 whenever ana and limn...")) Ilxn-xll =0. A functional II xii satisfying (i), (ii'), and (iii) is called a pseudonorm. If a hnear space X equipped with a pseudo norm is complete, then X is called a Fn\chet space (in the sense of Banach). The tfunction space S(Q) is a Frechet space. Quasinormed linear spaces and Frechet spaces are ttopological linear spaces, but they need not be tlocally convex. Hence it is possible that there is no continuous linear functional except for zero. However, the open mapping theorem and the closed graph theorem hold for linear operators from a Frechet space into a Frechet space. References [IJ S. Banach, Theorie des operations lin- eaires, Warsaw, 1932 (Chelsea, 1963). [2J N. Dunford and J. T. Schwartz, Linear operators I, Interscience, 1958. 38 Bernoulli Family [3J K. Yosida, Functional analysis, Springer, 1978. [4J M. M. Day, Normed linear spaces, Springer, 19.58. [5J H. E. Lacey, The isometric theory of class- ical Banach spaces, Springer, 1974. [6J J. Lindenstrauss and L. Tzafriri, Classical Banach spaces I and II, Springer, 1977, 1979. [7J J. Diestel, Geometry of Banach space, Springer, 1975. [8J A. Pelczynski, Some aspects of the present theory of Banach spaces, Oeuvres de Stefan Banach II, Warsaw, 1979. [9J P. Enflo, A counterexample to the approx- imation problem in Banach spaces, Acta Math., 130 (1973), 3099317. [ 10J J. Lindenstrauss and L. Tzafriri, On complemented subs paces problem, Israel J. Math., 9 (1971),2633269. 38 (XXI.1 Bernoulli 4) Family The Bemoullis, Protestants who came ongl- naIly from Holland and settled in Switzerland, were a signitlcant family to the mathematics of the 17th Century. In a single Century, the family produced eight brilliant mathemati- cians, alI of whom played important roles in the development of calculus. The brothers James (1654-1705) and John (1667- 1748) and Daniel (1700-1782), John's second son, were especially outstanding. James and John were close friends of G. W. tLeibniz, with whom they exchanged the correspon- dence through which it might be said that calculus developed. James studied problems related to the ttautochrone and tbrachisto- chrone, as well as problems in geometry, dy- namics, and other fields, including the tisoperi- metric problem. He was the first to change the name calculus SU/nff11ltoris to calculus iniegralis (1690). His Ars conjectandi was published after his death in 1713; in it is found the tlawof large numbers, which made his name promi- nent in the theory of tprobability. James had little guidance, leaming mathematics on his own. He was a professor of experimental physics at the University of Basel and later became a professor of mathematics. He taught mathematics to his brother John, who suc- ceeded him as professor at the University. John's many achievements appeared in such publications of the time as Acta eruditorum and JJUrool dRs savanis. In 1701, the begin- nings of the tcalculus of variations were seen in his solution to the isoperimetric problem. He 
38 Ref. Bernoulli Family was the first to use the term functio, the root of the present term function (1714). Despite discord between the brothers and also between fathers and sons, the Bernoullis were ardent teachers and brilliant researchers, who instructed not only their sons but also such mathematicians as tEuler. Their achieve- ments were numerous in consolidating the content and form of calculus and also in ex- panding its application. Daniel was especially outstanding in the theory of probability; he also made contributions to the field of thydro- dynamics and to the tkinetic theory of gases. The eldest John's eldest son Nicolas (1695- 1726) achieved distinction as a professor of mathematics in St. Petersburg. Daniel's youngest brother, John (1710-1790), suc- ceeded his father, John Sr., as a professor at the University of Basel. The son of John, Jr., also named John (1744-1807), was the chair- man of mathematics at the Academy of Berlin. His brother, another James (1759-1789), was a professor of experimental physics at the University of Basel. Nicolas (1687-1759), a grandson of the founder Nicolas (162331708) and son of Nicolas the painter (1662-1716), held Galileo's old chair of mathematics at Padua from 1716 to 1719. Nicolas I James 1654-1705 1623-1708 I Nicolas (painter) 1662-1716 I I John 1667-1748  John 1687-1759 169551726 1700-82 1710-90  James I Nicolas I Nicolas I Daniel I John 1744-1807 1759-89 In this article, the first names have been given in English. The German names corre- sponding to James, John, and Nicolas are Jakob, Johann, and Nikolaus, respectively. References [ 1] M. B. Cantor, Vorlesungen iiber Ge- schichte der Mathematik III, Teubner, 1898. [2J Jacobi Bernoulli, Opera 1, II, Cramer, 1744. [3J Jakob Bernoulli, Die Werke von Jakob Bernoulli, Birkhauser, 1969. [4] Jacobi Bernoulli, Ars conjectandi, Bale, 1713. [5J Johannis Bernoulli, Opera omnia I-N, M. M. Bousquet, 1742. 168 39 (XIV.8) Bessel Functions A. General Remarks Bessel functions were first introduced in order to solve tKepler's equation concerning plane- tary motions and were systematically investi- gated by F. W. Bessel in 1824. Since then they have appeared in various problems and have become important. B. Bessel Functions Separating variables for the Helmholtz eqUa- tion ,1\)1 + P'P = 0 in terms of cylindrical coordinates, we obtain Bessel's differential equation d 2 w I d w ( V2 ) -+--+ 1-- w=O dz 2 Z dz z2 (1 ) for the component of the radius vector. The following two linearly independent solutions of(1): HI)(Z)= f e-izsin(+iv(d, n JL 1 H2)(Z)= f e-izsin(+iv(d(, n JL 2 (2) are called the Hankel functions of the first and second kind, respectively, where the contour L 1 of the first integration is a curve from ( - n + 0) +ico to -O-ioo, and L 2 is a curve from +0 - ioo to (n - 0)+ ioo, If both z and v are real, we have Hv (z) = Ht 2 )(z), H2)(Z) = HI)(Z), (3) where z is the complex conjugate of z. Hence JAz) ==(HI)(Z) + He)(z»)/2, Nv(z) == Y,(z) =(Hl)(z) - H2)(Z))/2i (4) are real functions, If both z and v are complex, the functions J,(z) and Nv(z) defined in (4) are also called Bessel functions and Neumann functions, respectively. The other names for J,(z), Nv(z), and H,(z) are Bessel functions of the first, second, and third kind, respectively. Each of them satisfies the following recurrence formulas: 2 dC ,(z) _ - C V - I (z)- C,+1 (z), (2vlz)C v (z) = C V - I (z) + C V + I (z). (5) In general, functions satisfying the simulta- neous tdifferential-difference equations (5) are called cylindrical functions. Every cylindrical 
169 function C,(z) is represented in the form C,(z) == a l (V)HI)(Z)+ a2(v)H2)(z), where a,(v) and a 2 (v) are arbitrary periodic functions of period 1 with respect to v. If v = n (an integer) we have J -n(z) = (-1)" In(z), N_n(z)=( -1)" Nn(z), which show the linear dependency of Ln and I n , and N_ n and N n , respectively. If v 1= n (an integer), as the fundamental solutions of (1) we can take a pair J, and J-v> or N v and N- v ' In (2) if we take as a contour of integration a curve from (-n+O)+;oo to (n-O)+ioo, we obtain an integral representation for .1,(z), which yields the relations Jv(ze im ,) = e imvn Jv(z), J _v(ze imn ) == e- imvn J -v(z), If v = n (an integer) and Re z > 0, we obtain 1 I n In(z)=- eiZ,jn(+in(d( 2n -n 1 I n =- cos(zsin(-'1()d(, IT  which is called Bessel's integral. These repre- sentations imply the following expansions by means of tgenerating functions: CD ei2sin' = L In(z)e in ,, n""" - 00 00 cos(z sin() = Jo(z) + 2 L J 2 n(z)cos2n(, nl 00 sin(zsin()=2 L J 2n +1(z)sin(2n+ 1)(. n=O Making a change of variable u = exp( (2), we obtain iOin J,(Z)= f exp (  ( u- )) u-'-l du, 2m L 2 u where L is a contour starting at the point at infinity with the argument - n, encircling the origin in the positive direction, and tending to the point at infinity with the argument n. From (12) we obtain a power series expansion z v 00 ( -1 t ;m Jv(z) = 02 mo m!r(V+m+ 10 2 I (13) obtained also from (1) by a power series ex- pansion at z = 0, which is a tregular singular point of (1). Substituting (13) into Nv(z) = (ms vzJv(z) - J_v(z»)/sin vn, (14) we obtain a power series expansion for Nv(z). A power series expansion for N,(z) for an inte- ger n is obtained by taking the limit v-->n ( Appendix A, Table 19). In particular, if v = 39 c Bessel Functions n + 1/2 (n is an integer), we have (2z)n+1/2 d n ( sinz ) I n +1/2 (z) = (-1)" fi d(Z2)" -----;- I n=0,1,2"", (6) which is represented by elementary functions and is sometimes called simply the half Bessel function. Bessel functions for half-integers have appeared also as radius vector components when the variables in the Helmholtz equation are separated by spherical coordinates. The function jn(z) = Jnl2z I n + l / 2 (Z) is called the spherical Bessel function. We have the following addition theorem: (7) 00 H)(kp)einlJ! = L In(kr2)H!!'m(krl)eimqJ, m;;;;-oo f.J.=1,2. where (8) p = J rI + r - 2r I r 2 cos <p , pcoso/ ==r l - r 2 cos q;, p sin 0/ = r 2 sin <po (9) C. Zero Points of the Function .1,(z) (10) From the differential equations satisfied by J,(exz), we have (a 2 - f32) I I zJ v (az)J v (f3z)dz '0 = f3J,(a)J(f3) aJ(a)J'(f3). (15) (11) By letting f3-->a in (15), we have IOI z(J,(az))2 dz =K(1 - ::) (J,(a)f+(J(a)j2) (16) (12) If r/, and fi are distinct roots of .1,(z) = 0, we have from (15) I I zJ,(az)Jv(f3z)dz=O, Rev> -1. (17) '0 The integral formulas (15), (16), and (17) are called Lommel's integrals. As for the zero points of .1,(z), the following facts are well known: .I(O) = 0 if v > O. .I(z) has no multiple zero points other than z = O. J v ( - a) = 0 if Jv(a) = O. Every zero point of .1,(z) is real if v > -1. Between two adjacent zero points that are positive, there exists one and only one zero point of J v - I (z) and J V + I (z), respectively. .1,(z) has a countably inIlnite set of zero points on the real axis. When v :? 0 is rational, every zero point of .1,(z) except z = 0 is a ttranscen- dental number. The transcendentality of IT is a special case of this result for v = 1/2. 
39 D Bessel Functions D. Expansion by means of Bessel Functions Let f(r, q;) be defined for 0 < r < 1, - n < q; < n, and (1..,1, (1.n,2, ,,,,an,s' ,,,(O<exn.s<ex n ""'-.,, s= 1, 2, for every n) be zero points of In(x) (n = 0, 1,2, . . . ), Then we have an expansion ao ao f(r, q;) = I L (an,scosnq; + bn,ssin nq;)J.((1.n,sr), "=0 s1 (18) which is called the Fourier-Bessel series. The coefficients an., and bn,s are determined by the properties oftFourier series and (16) and (17) as follows: an,, } b n . s [;n n(1n+1 «(1.n,.))2 x l' f n f(r, Jo -n cos I!<p <p)Jn(a n sr) r d<p dr; , sm nq; [;0=1, [;1=[;2=...=2. The integral transformation g(y) = Lao xf(x)Jn{xy)dx is called Ihe Fourier-Bessel transform. If f(x) is sufficiently smooth and tends rapidly to zero as X --> 00, the following inversion formula holds: f(x) = {" yg(y)Jn(xy)dy. There are other types of series expansions in terms of Bessel functions as follows: Dini's series 00 L amJ,(}'m x ) m==l (Am is the mth positive root of xJ(x) + HJv(x) = 0, where H is a real constant); Kapteyn's series 00 L amJv+m«v+m)x); m=! Schlomilch's series a 00 + L amJO(mx); 2 mO and Ihe generalized SchlOmilch series + f: amJv(mx)+bmHv(mx) 2r(v+l) m! (mx/2)' , where H,(mx) is the Struve function (- Section F). E. Asymptotic Expansion If zlor v is sufftciently large, the asymptotic representation for Bessel functions is obtained 170 by applying the tmethod of the steepest de- scent for (2). If Izl > I vi, we have HI)(Z) !f exPi(z-V-), - n < argz < 2n, H2)(Z) If ex p ( -i(Z-V-)), - 2n< argz < IT, Jv(z) If cos(z-v-} -n<argz<n, (19) Nv(z) If Sin(z-v-). Hence HI)(Z) tends to zero as Izl->cx:: in the upper half-plane, and becomes large exponen- tially as Izl-> 00 in the lower half-plane. The results for H2) are obtained by interchanging "upper half-plane" with "lower half-plane" in this statement. If both Izi and Ivl are sufficiently large, we have the Debye asymptotic representations. For example, if z = v sec fJ (v > 0, fJ> 0), we have Ht l . 2)(vsec fJ)  (nvtan fJ/2f1/2 x exp( :t i(v(tan fJ - fJ) - n/4». (20) Ifz=vsecha (v> 0, (1. > 0), we have J, (v sech a) (2nv tanh a)-1/2 expv(tanh a -a), N v (v sech a) (nv lanh a/2)-1/2 exp via - lanh a), If Ivl  Izl, we have tan fJ H(I.2)(vsecfJ)-  j3 xexp( :ti (+ v( tanfJ - tan3 fJ - fJ))) x H\)/)(v/3)tan 3 fJ)+O(v- I ), which is called Watson's formula. F. The Wagner Function As an application of Bessel functions to the theory of nonstationary aircraft wings, T. Theodorsen introduced the function C(z) = H(2)(Z)/(H&2)(Z) + H(2)(Z)) [6J, and H. Wagner considered the function 1 1 2C( -iw) k I (s) = -----: e ws dw 2m Dr W 
171 [5J, where Hb 2 )(z), H\2)(Z) are Hankel func- tions, and SBr means a Bromwich integral giving the inverse Laplace transform. Then C(z) and k,(s) are called the Theodorsen func- tion and Wagner function, respectively. The function k,(s) is equal to the lift coefficient when a 2-dimensional flat wing suddenly proceeds forward a distance s at an angle of incidence l/n, G. Functions Related to Bessel Functions The following functions are closely related to Bessel functions (- Appendix A, Table 19.IV), (I) Modified Bessel functions. I,(z) == e- i ,"/2 lv(e in / 2 z), Kv(z) = L,(z)- Iv(z) 2 sm V7! (2) Kelvin functions. ber ,(z) 1: i beivlz) = l v (e:t3ni/4 z ), her v(z)::!:: i hei,(z) = ml)(e:t 3ni/4 z ), ker v(z) = - (n/2)heiv(z), keiv(z) == (7!/2)her v(z), (3) Struve function. 00 (_ i)m(z/2)"+2m+l Hv(z)= L mO r[m+ (3/2)] r[v+ m + (3/2)J (4) Anger function. 1 I n Jv(z)=- cos(v8 -2 sin 8)d8, n 0 When v is an integer n, we have J,(z)= J,,(z). (5) H. F. Weber function. 1 I n Ev(z)= sin(v8-zsinO)dO, n  The last three functions satisfy certain in- homogeneous Bessel differential equations. Many other functions, such as tAiry's integral, can be represented by Bessel functions. References [IJ G. N. Watson, A treatise on the theory of Bessel functions, Cambridge Univ. Press, 1922. [2J A. Gray and G. B. Mathews, A treatise on Bessel functions and their application to physics, Macmillan, second edition, 1922. [3J R. Weyrich, Die Zyhnderfunktionen und ihre Anwendungen, Teubner, 1937. [4J F. Bowman, Introduction to Bessel func- tions, Longmans-Green, 1938. [5J H. Wagner, Dber die Entstehung des dynamische Auftriebes von Tragfliigeln, Z. Angew. Math. Mech.,5 (1925), 17-35. 40 B Biometrics [6J T. Theodorsen, General theory of aero- dynamic instability and the mechanism of flut- ter, NASA Tech. Rep. 496 (1935). Also........ references to 167 Functions of Conflu- ent Type, 389 Special Functions. 40 (XVIII.1 6) Biometrics A. General Remarks Biometrics is the branch of science that ap- plies mathematical and statistical methods to biological problems, and deals with all the phenomena that affect the physical, social, and psychological well-being of human beings. These phenomena involve the relationships of groups of human beings to other human beings, to animals, microbes, and plants, and to physical and chemical elements in the en- vironment. In dealing with these problems the biometrician encounters such theoretical tasks as analyzing autocorrelated data in time series, and such practical undertakings as cost-vs- benefit evaluations of health programs. Biometrics began in the middle of the 17th century when Sir William Petty and John Graunt developed a new method of analyzing the London Bills of Mortahty. Petty and Graunt essentially invented the field of vital statistics by studying the reports of christen- ings and causes of death and proposed a method called "political arithmetic." Some fields to which biometries is relevant are given below. B. Statistical Genetics After early development in vital statistics, statistical genetics was founded on the new ideas emerging in statistics. Major contri- butions were made by Charles Darwin (1809- 1882), Francis Galton (1822-1911), Karl Pearson (1857- 1936), and Ronald A. Fisher ( 1890-1962). Galton was the first to use the term "regres- sion" in statistics (........ 403 Statistical Models D), when he observed that sons regressed linearly on their fathers with respect to stature. He called the phenomenon a "regression to mediocrity" because the deviations of the stature of sons were less than those of fathers. This gave rise to the measurement of cor- relation in the bivariate normal distribution by means of the coefficient of correlation (Pearson, 1897). Pearson is credited with the 
40 c Biometrics creation of the discipline of biometry (bio- metrics), and he established the journal Bio- metrika to promote studies in the tleld. Fisher's major contributions were to genetics and statistical theories. His General theory of nat- ural selection appeared in 1930. This landmark book, along with earlier and later publica- tions, represented Fisher's attempts to give quantitative form to Darwin's views and to frame a statistical theory of evolution. C. Bioassay Bioassay is a set of techniques for evaluating the effectiveness of dosages of drugs by moni- toring biological responses. It entails the use of special transformations, such as pro bits and logits, as well as the application of regression to the estimation of dosages that are p percent effective within stated confidence limits. Prob- lems to be solved include measuring relative potency, slope-ratio assays, and quanta 1 re- sponses vis-a-vis tolerance distributions. D. Demography Demography, which includes traditional vital statistics, rates and ratios, life tables, com- peting risks, actuarial statistics, and census enumeration techniques, is a part of bio- metrics. In this category, many tabulations of data consist of time series of events or rates c1assitled by age. For the analysis of such data, the cohort analysis techniques described in Hastings and Berry [3J are employed. E. Epidemiology The quantitative description of an epidemic should state the sensitivity and specificity of any diagnostic tests, as well as the true in- cidence or prevalence of the epidemic from survey results. Within an epidemiological theory, a disease is studied by the use of deter- ministic and stochastic models, wherein the theory of Markov chains can be applied (- 260 Markov Chains). Differential equations involving probability-generating functions or tmoment-generating functions can be solved to yield the tprobability distribution, tmean, and tvariance of the number of infected individuals as functions of time [4, 5]. Fundamental to this whole field of application is a clear under- standing of causality and association. When clinical trials are possible, two groups of persons, "treated" and "untreated," are monitored over a period of time with regard to the incidence or recovery from the disease 172 under study. The techniques in this procedure include compiling reports on persons to be observed, using double-blind techniques, and combining multiple-response variables by use of multivariate analysis (- 280 Multivariate Analysis). If clinical intervention is forbidden on ethical grounds, e.g., studying congenital mal- formation by infecting pregnant women with German measles, the relative risk of expo- sure is estimated from retrospective studies. In practice, almost all statistical studies in epidemiology are retrospective with the sole exception of clinical trials. The research is ex post facto because investigators are mostly confined to describing and analyzing sudden and/or obvious events in the etiology of the disease. F. Clinical Trials In clinical trials, many problems have arisen for which biometricians have had to develop special techniques. One such technique takes account of unexpected adverse effects of drugs, and the consequent early termination of a trial. Moreover, when data demonstrate a trend earlier than expected, investigators will desire to end the accession of patients and to stop further treatment with what may be an inferior regimen. This means that the bio- metrician must be familiar with the problems of multiple examinations of data, multiple comparisons, and other adjustment procedures required by the ex post facto dredging of data. G. Future Trends There are two areaS in which the biometrician has played a leading role recently. These are pertinent in many different applications and problems, and considerable methodological research has been devoted to the two areas. The areaS are "mathematical modeling" and "effects of hazardous substances." Mathematical Modeling. The relationship between a set of independent variables and the dependent or response variable(s) is usually referred to as a mathematical model. The model may take the form of a standard multi- ple regression analysis with a single response variable or with multiple response variables as in multivariate analysis (- 280 Multivariate Analysis). It is generally assumed that specialists with substantive knowledge of the specitlc field of applications (epidemiology, toxicology, pharmacology, radiology, genetics, etc.) playa 
173 crucial role in detennining a model or rela- tionship between a set of independent vari- ables and response variable(s). However, it is mainly the biometrician who finally selects the specific model establishing the functional relationship and who attempts to measure the strength or influence of the independent vari- ables therein. The biometrician is also ex- pected to contribute substantially to the deci- sion as to whether the relationship is causal, or merely one of association or correlation. For example, in the measurement of carcinogenic- ity of a food additive or drug, questions arise as to whether a substance can be judged harm- ful if it "accelerates" the appearance of a tumor even though it does not increase the incidence of the abnormal growth. In general, the answer to this question is in the affirmative when an unequivocal dosage-response relationship is indicated between the substance and the tumor. Effects of Hazardous Substances. With the successful conq uest of most of the infectious diseases that have plagued mankind through- out history, health authorities have recently been concentrating on two chronic diseases whose etiology is yet to be determined: cardio- vascular disease and cancer. In both cases there is no disagreement with the thesis that heredity exercises a determining influence, but the role of the environment in causing many cases is also unquestioned. Measurement of the risk due to potentially toxic substances in the environment, principally with respect to these two diseases, represents the greatest chal- lenge to the biometrician today. The social benefits of success make this a tantalizing area of research, though exceptional complexities are involved. For related topics ->. 263 Mathematical Models in Biology. References [IJ D. J. Finney, Statistical method in biolog- ical assay, Griffin, second edition, 1964. [2] D. R. Cox, The analysis of binary data, Methuen, 1970. [3] D. W. Hastings and L. G. Berry (eds.), Cohort analysis: A collection of interdiscipli- nary readings, Scripps Foundation for Re- search in Population Problems, Oxford, Ohio, 1979. [4J N. T. J. Bailey, The mathematical theory of epidemics, Griffin, 1957. [5] M. S. Bartlett, Stochastic population mod- els in ecology and epidemiology, Methuen, 1960. 418 Boltzmann Equation 41 (XX.20) Boltzmann Equation A. Introduction The Boltzmann equation is an equation of motion of a rarefied gas given by L. Boltzmann in 1872 [1]. He used it successfully in his pioneering work on the kinetic theory of gases and on the more general statistical mechan- ics (- 402 Statistical Mechanics), but the equation was held to be inconsistent with the tc1assical mechanics used in its derivation; objections were raised by, e,g" J. Loschmidt (1895) and E. Zermelo (1896). Through the controversies it became widely recognized that the Boltzman equation should be justiiled by means of the ttheory of probability, and many such justifications have been proposed. Also, this equation has been studied extensively as a significant nonlinear partial differential equation. B. Boltzmann Equation Let .r = fit, x, ) be the density of gas molecules having position x E R 3 and velocity ( E R 3 at time t, The Boltzmann equation is a conser- vation law for .r of the form [1- 3J f, = - 'Vxf -a(t, x). Vd + Q[f]. (1) Here a(t, x) denotes the external force and Q is a quadratic nonlinear integral operator in - space describing binary collisions of the mole- cules. The integral kernel of Q, called the colli- sion cross section, depends on the intermole- cular forces. The two classical examples are the hard ball model (a gas of rigid spheres) and the inverse power law potential (one proportional to r- s , where r is the intermolecular distance and s> 1). The latter gives rise to singularities in the kernel, and a cutoff is customarily em- ployed to avoid this difficulty, Grad's hard (s  5) and soft (s < 5) cutoff potentials may be effectively used in the study of (1) [3]. If the gas is contained in a vessel (domain), then f must also satisfy boundary conditions determined by the assumed law of reflection at the walls (boundaries) (specular reflection, random reflection, etc.). For spatially homo- geneous gases with no external forces, (1) re- duces to f,=Q[f], (2) with f = f(t, ). The Maxwellians, those solu- tions satisfying the tMaxwell-Boltzmann dis- tribution law, are the only stationary solutions of (2); they describe equilibrium states. 
41 c B oHzmann Eq uation C. Justification by tbe Tbeory of Probability Consider (2). The collision process is a tMar- kov process, and the tmaster equation ob- tained from the tChapman-Kolmogorov equal- ity is equivalent to (2) under the assump- tion of "propagation of molecular chaos" (G. Uhlenbeck (1942». This assumption becomes valid in the limit as N (total number of mole- cules) -->00 (M. Kac (19551, H. McKean (1967), F. Griinbaum (1971» [4]. Also, non- linear Markov processes defined by (2) (with probability density fiN) have been studied (McKean (1967), H. Tanaka [5J), No such results have yet been established for (1). D. Existence of Solutions T. Carleman [2J gave the first solutions of the Boltzmann equation, solving globally in time the tCauchy problem for (2) for the hard ball model. His result has been extended to a wide class of potentials. The spatially inhomogeneous case (1) is also known to have global solutions if a(t, x) = 0 and if the initial data for f are nearly Max- wellian. This was first proved by S. Ukai [6J assuming periodicity in x, and then proved for the Cauchy problem by Ukai [7J and T. Nishida and K. Jmai [8J, and for the initial boundary problem by J. P. Guiraud [9J (ran- dom reflection) and Y. Shizuta and K. Asano [ IOJ (specular reflection) for bounded domains, and by Ukai and Asano [ 1 1 J for exterior domains, all assuming Grad's hard cutoff potentials. The case of soft cutoff potentials was also solved (Ukai and Asano, R. Caflisch (1980». All the solutions are unique and tend to Maxwellians as t--> 00, with certain decay rates. There are some results also on I-dimensional shock wave solutions (B. Nico- laenko (1975» and stationary solutions (J: P. Guiraud (1972) (bounded domains), Ukai and Asano (1980) (exterior domains». For initial data far from the Maxwellian, (1) remains unsolved even locally in time for hard cutoff potentials. As for noncutoff potentials, the existence theorem in [5J is the only result known so far for either (1) or (2). E. Hilbert Expansion Put f '= fiN with E = I/N. If f' has a power series expansion in 8, then (1) gives an infinite system of equations for the coefficients. D. Hilbert (19 12) proposed a method of solving this system as an application of his theory of integral equations, initiating the attempt to 174 solve the Boltzmann equation. No proofs of convergence exist, but his expansion and the improved one by S. Chapman (1917) and D. Enskog (19 17) made possible the applications of (1) to hydrodynamics. The zeroth-order approximation to the expansion gives rise to the compressible Euler equation, and the first- order approximation yields the compressible Navier-Stokes equation. The solutions f' of the Cauchy problem for (1) with nearly Max- wellian initial data converge to those of the compressible Euler equation locally in t as 8-->0 (Nishida [12J, Caflisch (1980), and to those of the compressible Navier-Stokes equation asymptotically as t --+ 00 when E> 0 is fixed [13]. References [1J L. Boltzmann, Weitere Studien iiber das Wiirmegleichgewicht unter Gasmolekiilen, Wien. Ber., 66 (1872),275-370, [2J T. Carlemann, Problemes mathematiques dans la theorie cinetique des gaz, Almqvist and Wiksell, 1957. [3J H. Grad, Asymptotic theory of the Boltz- mann equation, Rarefied Gas Dynamics, J. A. Laurmann (ed.), Academic Press, 1963, vol. 1. [4J M. Kac, Some probabilistic aspects of the Boltzmann equation, Acta Physica Austriaca, suppl. x (1973),379-400, [5J H. Tanaka, On Markov process corre- sponding to Boltzmann's equation of Maxwel- lian gas, Japan-USSR Symp. on Probability Theory, 2nd, Kyoto, 1972, G. Maruyama and Yu. V. Prokhorov (eds.): Lecture notes in math. 330, Springer, 1973. [6J S. Ukai, On the existence of global solu- tions of mixed problem for the Boltzmann equation, Proc, Japan Acad., 50 (1974),179- 184. [7] S. Ukai, Les solutions globales de I'equa- tion de Boltzmann dans I'espace tout entier, C. R. Acad. Sci. Paris, 282A (1976),317-320, [8J T. Nishida and K. Jmai, Global solutions to the initial value problem for the nonlinear Boltzmann equation, Publ. Res. Inst. Math. Sci., 12 (1976),229-239. [9J J. P. Guiraud, An H theorem for a gas of rigid spheres in a bounded domain, Theories Cinetiques Classiques et Relativistes, Colloq. CNRS, 1975. [IOJ Y. Shizuta and K. Asano, Global solu- tions of the Boltzmann equation in a bounded domain, Proc. Japan Acad., 53A (1977), 3-5. [ 1 1 J S. Ukai and K. Asano, On the initial boundary value problem of the linearized Boltzmann equation in an exterior domain, Proc, Japan Acad., 56A (1980), 12-17. 
175 [ 12J T. Nishida, Fluid dynamical limit of the Boltzmann equation to the level of the com- pressible Euler equation, Comm. Math. Phys., 61 (1978), 119-148. [ 13J S. Kawashima, A. Matsumura, and T. Nishida, On the fluid dynamical approxima- tion to the Boltzmann equation at the level of the Navier-Stokes equation, Comm. Math. Phys., 70 (1979), 97-124. 42 (11.15) Boolean Algebras A. Boolean Algebras Boolean algebra was introduced by G. Boole to study logical operations (- 411 Symbolic Logic). It is now included within the more general concept of tlattice or lattice-ordered set (- 243 Lattices) and appears not only in logic but also very often in analysis in the form of a particular lattice of sets, e.g" the lattice of tmeasurable sets. Let L be a given set and suppose that to any pair of its elements x, y there correspond two elements x U y, x n y of L (called join and meet of x and y, respectively) such that the follow- inglaws arevalid: (1) xUy=yUx, xny=ynx (commutative law); (2) x U (y U z) = (x U y) U z, xn(ynz)=,(xny)nz (associative law); (3) xU (y n x) = (x U y) n x = X (absorption law); (4) x U (ynz)=(xU y)n(xUz), xn(yUz)=(xny)U (x n z) (distributive law); (x. y, ZE L). From (1), (2), and (3) it follows further that x U x = x n x = x (idempotent law). Ifx  y is defined to mean x U y = y, L becomes an tordered set with respect to the ordering .s; Now suppose, moreover, that the following law holds: (5) there exist a least element 0 and a greatest element !, and for any element x there exists an element x' satisfying x U x' = I, x n x' = 0 (law of complementation), Then L is called a Boolean algebra (or tBoolean lattice). In this case x' is uniquely determined by x and is called the complement of x. The binary operations (x, y)-x U y, x n y together with the operation X.... x' are called Boolean operations. These operations obey de Morgan's law (x U y)' == x' ny', (xny)' =x' u y'. B. Generalized Boolean Algebras Suppose that a  b holds for two given ele- ments a, b of an ordered set. Then the set of all elements x satisfying a  X  b is denoted by [a, bi and is called an interval. An interval of a 42D Boolean Algebras Boolean algebra is also a Boolean algebra with respect to the induced operations U and n, where the least and greatest elements are a and b, respectively, and the complement of x in [a, bi is equal to a U (x' n b) =(a U x') n b, More generally, if a set L with two operations U, n satisfying (1)-(4) above has a least element 0 and if each interval of L satistles (5) (i,e., is a Boolean algebra), then L is called a generalized Boolean algebra. C. Boolean Rings A ring L satisfying the condition xx = x for all x E L (i.e., all of its elements are tidempotent) is called a generalized Boolean ring, and if it has a unity element then it is called a Boolean ring. A generalized Boolean ring L satisfies x + x = 0 for all x ELand is necessarily a commutative ring. A (generalized) Boolean algebra L be- comes a (generalized) Boolean ring if for any elements x, y of L the sum x + y is defmed to be the complement of x n y in the interval [O,x U y], and the product xy is del1ned to be x n y. A nonempty subset J of a (generalized) Boolean algebra L is an ideal with respect to the corresponding structure of the ring if and only if xU YEJ for x, YEJ and XnYEJ for xEJ, YEL. More generally, in any lattice, a nonempty subset that satisfies these conditions is sometimes called an ideal of the lattice. D. Representation of a Boolean Algebra Any Boolean algebra L is isomorphic to a Boolean lattice of subsets in a set X. If L is of finite theight, then L is isomorphic to the Boolean lattice (X) of all subsets of X. In general X can be taken to be the set of all maximal ideals of L. Let a ELand let O(a)= {mlmEX, atm}. The isomorphism is obtained by the mapping a-->O(a). If we define a topol- ogy in X such that (O(a) aEL} is the topen base, then X is a compact, totally discon- nected TI space and O(a) is characterized as a compact open set in X. Such a space X is called a Boolean space (M. H. Stone [3,4J), In any complete Boolean algebra L, the complete distributive laws hold: (sup/x;) n y = sup/(x j n y) and its dual. These are equivalent to the stronger relations: (sup/xi)n(suPJY)= suP/Ax i n y) and its dual. In order for a Boolean algebra L to be isomorphic to the Boolean algebra (X) of all subsets of X, it is necessary and sufficient that the following strongest complete distributive laws hold: inf/(suPJ(i)x i ) = sUPF(inf/xi,q>(i») (where F is the set of all functions rp assigning to each i E I a value cp(i)E1(i)) and its dual. 
42 Ref. Boolean Algebras References [IJ P. R. Halmos, Lectures on Boolean alge- bras, Van Nostrand, 1963. [2J R. Sikorski, Boolean algebras, Erg. Math., Springer, second edition, 1964. [3J M. H. Stone, The theory of representations for Boolean algebras, Trans. Amer. Math. Soc., 40 (1936), 37-111. [4J M. H. Stone, Applications of the theory of Boolean rings to general topology, Trans. Amer. Math. Soc., 41 (1937),375-481. [5J G. Boole, CoIlected logical works 1, II, Open Court, 1916. 43 (X1.4) Bounded Functions A. General Remarks A complex-valued function defined on a subset E of the complex z-plane is called a bounded function defined on E if its range f(E) is bounded, that is, if there exists a positive constant M such that If(z)1 (M on E. How- ever, when studying the theory of bounded functions, we usually restrict ourselves to the consideration of tanalytic or tharmonic func- tions. On the other hand, the classes of fUnc- tions f(z) satisfying conditions such as Ref(z) >0 or ex <argf(z) <[3 rather than If(z)IM are studied by a method similar to that ap- plied to the study of bounded functions. tSchwarz's lemma, tLiouvilIe's theorem, and tRiemann's theorem on the removability of singularities (which will be explained later) are among the classical theorems in the theory of bounded functions. B. Maximum Principle When a function f(z) is holomorphic and not constant in a domain D of the complex plane, If(z)1 never attains its maximum in the interior of D, In particular, when f(z) is continuous on the bounded closed domain 15 = D u aD, the maximum of If(z)1 on D is taken on its bound- ary aD. This fact is called the maximum (mod- ulus) principle. As a direct application of the maximum principle, we can deduce Schwarz's lemma: If a holomorphic function f(z) in I z I < R satisfies If(z)IM and f(O) = 0, we have If(z)I(M' Izl/R (lzl<R). The equality at zo, O<lzol<R, occurs only for the functions f(z) = ei)'Mz/R (where A is a real constant). 176 C. LindelOf's Principle E. Lindelof extended Schwarz's lemma and obtained various extensions of the maximum principle, from which he, together with E. Phragmen, deduced several useful theorems on the behavior of a function that is single-valued and holomorphic in a neighborhood of the boundary. We mention some representative theorems: Let z = 4'(() and w = t/!(() both be tmero- morphic and tunivalent functions in 1'1 < 1 that map 1'1 < 1 onto Dz and Dw, respectively. Set 4'(O)==zo and tj;(O)=wo. Let D,(p) and D,(p) denote the images of I (I (p (0 < p < 1) under the mappings 4' and 1/1, respectively. Under these circumstances, if a function f(z) that is holomorphic in Dz satisfies f(Dz) C Dw and f(zo) = w o , then f(Dz!p» C D,(p), Furthermore, unless f(z) maps Dz onto Dw univalently, f(Dz!p») is contained in the interior of D,(p) (an extension of Schwarz's lemma). Let f(z) be analytic in a bounded domain D but not necessarily single-valued. Suppose that If(z)1 is single-valued, Suppose, furthermore, that there is a positive constant M such that, for each boundary point ( oi D, except for a finite number of boundary points and for each e > 0, the inequality If(z)1 < M+E holds on the intersection of D with a suitable neighborhood of (, and suppose also that each of the excep- tional points has a neighborhood such that f(z) is bounded on the intersection of D with this neighborhood. Under these assumptions we have If(z)1 ( M, Moreover, if If(zo)1 = Mat a point Zo of D, then f(z) is a constant (an extension of the maximum principle). Let f(z) be holomorphic in an angular domain W: arg zl < IJ.n/2, Suppose that there is a constant M such that, for each c > 0, each finite boundary point has a neighborhood such that If(z)1 < M + £ on the intersection of D with this neighborhood, and that for some positive number P > rJ. and for sufftciently large Izi the inequality If(z)1 <explzll/ P holds. Under these assumptions we have If(z) (M in D (Phragmen- LindelOf theorem). Let f(z) be a function that is holomorphic and bounded in a closed angular domain w.. rJ. ( arg z.s; /3 except for the point at infinity. Suppose that f(z)-->a as z--> rx; along a side of the angle and that f(z)-->b as z-->oo along the other side of it. Then we have a = b and j(z)-->a uniformly as 2400 in W (LindelOf's asympto- tic value theorem). D. Bounded Functions in a Disk If f(z) is a bounded holomorphic function in the unit disk Izi < 1, it has a limit at every 
177 point Zo on the circle C: Izi = 1, except for a set of 1 -dimensional measure zero, as z tends to Zo from within an angle with vertex at Zo and con- tained in Izi < 1 (or along a tStolz's path at zo) (Fatou's theorem). Under the same assump- tion, if the boundary value function f(e ie ) = limrl_of(rei8), which exists by Fatou's theo- rem, is equal to a constant a for a set of posi- tive measure on the circle C, then f(z) == a in Izl < 1 (F. and M. Riesz theorem). These theo- rems are valid for some kinds of meromorphic functions in Izi < 1 (- 272 Meromorphic Functions D). E. Three-Circle Theorem and Related Theorems Let f(z) be a function that is single-valued, holomorphic, and not identically zero in an annulus P< Izi <R. Set M(r)==maxlzlrlf(z)1 (p < r < R) for f(z). Then log M(r) is a .con- vex function of log r in log p < log, < log R (Hadamard's three-circle theorem). The same assertion holds for a function fez) that is not necessarily single-valued, as long as If(z)1 is single-valued. When f(z) is single-valued, a stronger assertion can be obtained (0. Teich- muller). The following theorems are regarded as the analog of Hadamard's three-circle theorem for the respective basic regions: Set L(O')=suP-oo<t<oo If(H it) I «(X <0' <fJ) for a function that is bounded and regular in a strip a < Re z < fJ. Then log L(O") is a convex function of (J in a < (J < /3 (Doetsch's three-line theorem). Set 1(0') = lim SUPHOO If(O' + it)1 (a < Ii < /3) for a function that is holomorphic and bounded in a half-strip r/, < Re z < /3, Im z > O. Then log I(a) is a convex function of 0' in (X < 0' < /3 (Hardy- Littlewood theorem). Set 1 i 2n Ip(r) =- 2 If(rei8Wd8 n 10 for a holomorphic function in a disk Izi <R, Then for every p > 0, log 1,(') IS an increasing convex function of logr for -00 < logr < log R (Hardy's theorem). F. Hardy Class Hardy's theorem motivates us to introduce a class of functions. An analytic function f in the unit disk is said to belong to the Hardy class W (0<17< 00) if I p (r,f)=(2n)-IWlf(re ie Wd8 remains bounded as r--> 1. For the case p= 00, IE Hoo if loo(rJ)=maxlzl,lf(z)1 is bounded. 43F Bounded Functions An analytic function fis said to be of the class N if JJnlog+ If(reie)l dO is bounded for r< 1. f EN if and only if f = q;/tjJ, where q;,tjJEHd.!, fEW if and only if Ifl P has a tharmonic majo- rant. If 1  P  00, HP is a tBanach space with the norm Ilfllp=(sup'<1 Ip(rJ))I/P for p< 00 and Ilflloo == SUP'<I loo(rJ) for p = 00, and if 0 < p < 1, HP is a complete metric space with the metric SUP'<I Ip(rJ -g), For O<p<q< 00 the inclusion relation N ::J HP ::J Hq ::J H oo holds, so that Fatou's theorem and that of F. and M. Riesz are valid for functions of HP (p> 0). For a nonnegative integer p and a sequence of nonzero complex numbers a" lanl < 1, the infinite product B(z) == zPII1 (iXn(a n - z)/Ianl (1 iXnz)) converges locally uniformly in the unit disk if and only ifLI(I-lanl)<oo. Then B(z) is called a Blaschke product and {a,} a Blaschke sequence. If the sequence {a,} U {O} is the set of zeros of an analytic function f (an m- tuple zero appears m times in {an}, and z = 0 is a p-tuple zero), B(z) is said to be generated by the zeros of f If fE N, then l:dl -1(Xnl)< 00 for the set {a,} of zeros off: An analytic func- tion f in the unit disk is called an inner func- tion if its modulus is less than 1 and its non- tangential limit on the unit circle is of modulus 1 almost everywhere. A Blaschke product is an inner function. An inner function f without zeros is said to be singular if f(O) > O. A sin- gular inner function f can be represented by fez) == exp{ - W(e ie + z)/(e ie - z)dJl(O)} with a positive measure J1 tsingular with respect to Lebesgue measure. An outer function F(z) for HP (p > 0) is an analytic function that can be represented by F(z) = eirexr {  f 2n P::+7 10gtjJ(O)dO } , 2n Jo e -z where r is a real number and tjJ(O) is a non- negative function such that tjJ(O)E LV and logljJ(8)E L'. An HP-function f (p>O) is fac- torized uniquely as f(z) = B(z)S(z)F(z), where B(z) is a Blaschke product (generated by the zeros off), S(z) a singular inner function, and F(z) is an outer function for Hp (with tjJ(O)= If(e i8 )1) (factorization theorem). Conversely, such a product belongs to HP. The inter- polation proh1em raised by R. C. Buck asks whether or not there exists a sequence of dis- tinct points {zn IZnl< I} such that, given any bounded sequence of complex numbers {w n }, there is a function f E H oo for which f(zn) = W n . The sequence {z,} in the problem is called an interpolating sequence. L. Carleson proved that a necessary and sufficient condition for {z,} to be an interpolating sequence is that there be a (»0 such that IT n #l(zn- z d/(I-z k z n )I;:3<> (k = 1,2,. . . ). An interpolating sequence is a Blaschke sequence, and any sequence {z, II zn -+ I} contains an interpolating subsequence 
43G Bounded Functions tending to the unit circle. An analogous theo- rem is obtained for a function of HP (p:? 1). G. Corona Problem The Hardy space Hoo on the unit disk D is furthermore a commutative tBanach algebra with identity. Let M denote the trnaximal ideal space of H oo . It is a compact Hausdorff space with respect to the tGel'fand topology. To each point ZE D, there correspond a homomor- phism Ipz: q;zCf) =f(z) for f E H oo , and a maxi- mal ideal M z = { f E H oo If(z) = O} as its kernel. This correspondence gives a continuous injec- tion of D into the maximal ideal space M of H OO . By identifying z and M z , D is regarded as an open subset of M, and the corona problem then asks: Are there points of M outside the closure of D? The negative answer was given by L. Carleson, that is, D is dense in A4 (corona theorem). The corona theorem is obtained as a direct consequence of the following theorem and is equivalent to it. For functions /;E H oo , i = 1,2, .' , /1, satisfying L 1j;(z)1 > bin D with some b > 0, there exist giE H oo , i = 1, ..., n, such that I.fi(z)gi(Z) = 1 in D. As a trivial conse- quence of the corona theorem, we have the following cluster value theorem: Let C(f, () denote the tcluster set of f E H oo at ( with 1(1 = 1; then C(f, O=/(M,), where! is the Gel'fand transform off and M, is the tfiber of Mover (. M is decomposed into pairwise disjoint tGleason parts. A Gleason part is a connected open set or a singleton according as each point of it is captured in the closure of an interpolating sequence or not. Each point of the tShiiov boundary r of M forms by itself a trivial Gleason part, and r is tadherent to a sequence S in D with the property that the set of its nontangential limit points covers the unit circle (Brown-Shield-Zeller theorem). Let D be, in turn, a general bounded domain. The corona theorem is proved for some classes of such domains, A Shilov boundary r lies only over the set of points on iJD which are not tremovable for bounded analytic functions. It is adherent to a sequence S in D on which sups I f(z) I = Ilfll 00' For any fE [[00 and' E iJD, the boundary of!(M,) is contained in/(r() if r( = r n M( is not empty. This, together with the cluster value theorem, has an important im- plication in the theory of cluster sets. The ex- tremal problem of maximizing f'(zo)1, ZoE D, among all f E H 00 satisfying II f II 00  1 has the unique normalized solution G such that G'(zo) = max II'(zo)l. G(z) is called the Ah1fors func- tion. The Ahlfors function has unit modulus on the Shilov boundary (Fisher's theorem). The theory of H m is also applied to the problem of 178 trational approximation of bounded analytic functions. H. Applications of the Maximum Principle Theorems of the following type are useful for some problems of holomorphic functions: Let D be a domain, Suppose that there exists an arc of angular measure lI. that is on a circle of radius R centered at a point Zo of D and not contained in D. Let C denote the intersection of the boundary of D with the disk Iz - zo\ < R. If f(z) is a single-valued holorn- orphic function that satisfies f(z)1  M, and if Iim sUPz, f(z)l.s; M for everY(E C, then the inequality f(zo)l.s; MI-I/nml/n holds for every positive integer n satisfying 2n/n  (j, (LindelOf's theorem). Let D be a domain bounded by two seg- ments OA, OB both starting from 0 and mak- ing an angle nex, and a Jordan arc iB, and let R be the maximal distance between 0 and the points on AB. Suppose that f(z) is holo- morphic in D and that lim sup I f(z) I as z--> (E 3D with zED is not greater than Mfor (E OA U OB and m for (EAR Then we have If(z)I.s;M I -A m A (where A=(lzI/R)I/') at every point on the bisector of the angle L AOB in D (Carlemann's theorem). I. Holomorphic Functions with Positive Real Parts Holomorphic functions with positive real parts are intimately connected with bounded func- tions. Concerning these functions we have the following classical result, which is equivalent to Schwarz's lemma: If f(z) is holomorphic in z 1 < R, Re f (z) :? 0 in the same domain, and f(O)== 1, then (R-lzl)/(R + Izl).s; Ref(z).s;(R + Iz\)/(R -lzl) (izi < R). The right or left inequality becomes equality for some zo, 0 < Izol < R, only if f(z)=(Rz o fZoz)/(R :tzoz), respectively. In order to prove various results for the class of functions with positive real part, Herg10tz's integral representation, which is based on Poisson's integral representation and unique to this class, can be used effectively. It is given by f 2" e i <P+ z f (z = dp(q;), Izl<R, , 0 e'l'-z where p(q;) is monotone increasing (real- valued) with total variation 1 and is deter- mined uniquely up to an additive constant by f(z). An analogous integral representation is introduced for a holomorphic function in an annulus. Similar to the case of holomorphic functions is the notion of Hardy classes to harmonic 
179 functions, that is, a harmonic function u in the unit disk is said to be of the class h P (0 < p  00) if Ip(r, u) is bounded for r < 1. u E h P if and only if u + iVE HP, where v is conjugate to u, hi is equal to the vector space of functions which are representable as a difference of two non- negative harmonic functions. The Herglotz theorem stated above is then a corollary of the theorem of integral representation for hl_ functions. J. Coefficient Problems There are many classical results for partial sums and coefficients of the Taylor expan- sion of bounded functions in a disk. Let f(z) = Lo cnz" be the Taylor expansion of a bounded function in Izi < 1. Set its partial sum s"(z)=L=ocvzV(n=O, 1,. ..). and let t,(z)= (l/(n+ I))LoSv(z) (n=O, 1,...), which is the sequence of the arithmetic means of the par- tial sums (the Fejer sums). Then If(z) I  1 in Izi < 1 if and only if Itn(z)1 ,;; 1 for Izi = 1 (n = 0, 1, ) (L. Fejer). Thus the sequences {t,(z)} for bounded functions f(z) are uniformly bounded, whereas the sequences {s,,(z)} are no! uniformly bounded. Indeed the maximum value of ISn(l)1 over the set of function f satis- fying II(z)l,;; 1 for Izl < 1 is n ( _1/2 ) 2 G n = 1+ L . j=1 ) =I ( 1.3:.....(2:1) ) 2 ]=0 2 4 ... 2) (Gnn-llognas n-->oo). The following result is decisive for coeffi- cient problems: Set h, = L'tOCrjC,-j (/1';; v), h, = h, for f(z) = L:=o CnZn, let m be the maximal eigenvalue (a nonnegative real num- ber) of the Hermitian matrix ( - hv):, FO' and let m =lim m n ( ? 0). Let H be a Hermitian form in inl1nitely many variables given by 00 00 H_2,\,- '\' h - -m L. xvxv- L. VXXV v=O ,v=O 00 00 I 00 1 2 =m2,0 I X vI 2 - o vo C,Xh Then a necessary and sufficient condition for If(z)1 ( 1 when Izi < 1 is that Hbe positive semidetlnite, i,e., the sequence of the tprincipal minor determinants of H, ,1 ( 1 2 ... m ) _ 1 2 ' m-l,2 ,..., ... m is all positive or positive for an initial fmite number of them and zero for the remainder (I, Schur). 43K Bounded Functions A corresponding result for functions with positive real part in a disk can be stated in a ;impler form: A holomorphic function f(z) = 1/2+ LI cnz" in Izl < 1 satisfies Ref(z);?:O if llld only if [ 1 e I . . . en 'i 1 c n -! ;?: 0, n = 1,2, ". C n cn-I .. 1 (C. Caratheodory). Furthermore, for n = 1,2, , when we regard (e" . , c,) as a point of complex n-dimensional Euclidean space, we can determine the domain of existence of points satisfying this criterion by Caratheo- dory. This result is generalized for coefficients of the Laurent expansion of a function that is holomorphic and single-valued in an annulus. Next, there are some results for a function that omits two values: If f(z) = a, + a, z + ... (a l # 0) is holomorphic in Izl < Rand f(z) #0, 1 in the same domain Izl <R, then there exists a constant L(ao, a,), depending only on a, and a" such that R,;; L(a o , ad (Laudau's tbeorem). Under these circum- stances, If(z)IS(ao,lJ) in IzlIJR forO<IJ< 1, where S(a" 0) is a constant depending only on a, and e (Scbottoky's tbeorem). These theo- rems have applications in value distribution theory. On the other hand, coefficient prob- lems have been investigated as extremal prob- lems in HP-spaces under general conditions. K. Angular Derivative Let f(z) be holomorphic in Izl < 1. If f(z)-->w o uniformly as z-->zo along Stolz paths with end point at Zo and if the limit limzzo«(f(z) - wo)/(z - zo» =oD exists, we call D the angular deriva- tive of j"(z) at zoo In the case of the half-plane Re z > 0, the angular derivative at the point Zo on the imaginary axis is similarly defined. It should be noted that f(z) - W o is replaced by l/I(z) for Wo = 00 and l/(z - zo) by z for Zo = 00. In the latter case, a Stolz path is a path con- tained in an angular domain 1 argzl ,;; Cl ( < n12) and tending to 00, The study of angular de- rivatives was initiated by G. Julia (1920) and J. Wolff (1926) and was further advanced by Caratheodory (1929) and E. Landau and G. Varilon (1929). A fundamental theorem for angular deriva- tives can be stated as follows: If a holomorphic function f(z) in Re z > 0 satisfies Re f(z) ;?: 0, there exists a constant c (0 ( c < +00) such that f(z)/z-->c and .f'(z)-->c uniformly as z......oo along every Stolz path. Moreover, the pth derivative of f(z) for an arbitrary positive p, denoted by DPf(z), has the property that ZP-I DPf(z)......c/f(2 - p) uniformly. Further- 
43 Ref. Bounded Functions more, the inequality Re f(z)  c Re z holds everywhere in Re z > O. An analogous theorem is valid for the unit disk. An important problem in the theory of conformal mapping is to find some condition for a mapping w = l(z) of the unit disk (or half- plane) G onto a simply connected domain B to have a nonzero and finite angular derivative at a boundary point zo, that is, the condition for conformality at the boundary point. Cara- theodory showed that a sufftcient condition is the existence of two circles that are mutually inscribed and circumscribed at the boundary point W o '= 1(zo) 1= 00 of B and that lie inside and outside of B, respectively. Ahlfors later established a necessary and sufftcient condi- tion for the existence of the angular derivative by making use of his tdistortion theorem for a strip domain, The angular derivative was used by Wolff in his research on the iteration of conformal mappings. References [IJ L. Bieberbach, Lehrbuch der Funktionen- theorie II, Teubner, 1931 (Johnson Reprint Co., 1969). [2J C. Caratheodory, Funktionentheorie 1, II, Birkhiiuser, Basel, 1950; English translation, Theory of functions, Chelsea, I, 1958; II, 1960. [3J E. G. H. Landau, Darstellung und Be- griindung einiger neuerer Ergebnisse der Funktionentheorie, Springer, 1929. [4J J. E. Littlewood, Lectures on the theory of functions I, Clarendon Press, 1944. [5J P. L. Duren, Theory of HP-spaces, Aca- demic Press, 1970. [6J R. Nevanlinna, Eindeutige Analytische Funktionen, Springer, second edition, 1953; English translation, Analytic functions, Springer, 1970. [7J L. Carleson, Interpolations by bounded analytic functions and the corona problem, Ann. Math., (2) 76 (1962). [S] H. S. Shapiro and A. L. Shield, On some interpolation problem for analytic functions, Amer. J. Math., 83 (1961). [9J K. Hoffman, Banach spaces of analytic functions, Prentice-Hall, 1962. [IOJ K. Hoffman, Bounded analytic functions and Gleason parts, Ann. Math., (2) 86 (1967). [11J M. L. Weiss, Cluster sets of bounded analytic functions from a Banach algebraic viewpoint, Ann. Acad. Sci. Fenn., 367 (1965). [12J T. W. Gamelin, Lectures on H"(D), Univ. Nacional de la Plata, 1972. [13J T. W. Gamelin, The algebra of bounded analytic functions, Bull. Amer. Math. Soc., 79 (1973). 180 44 (XVII.1 0) Branching Processes A. General Remarks A brancbing process is a mathematical model for random motion of a family of particles each of which is in an isolated process of multi- plication and death. Examples of such ran- dom motions are population growth, miosis of genes, growth of the numbers of neutrons in an atomic chain reaction, and cascade showers of cosmic rays. The simplest and most fundamen- tal branching process is discussed in Section B. For a historical introduction to the study of branching processes -->. Kendall [2J and Harris [3]. B. Galton-Watson Processes Although there is a similar process with COn- tinuous parameter t E R, we consider here only the case of a discrete time parameter (i = 0, 1,2,. ), Suppose that we are given a family of particles of the same kind. Each member of the family splits into several par- ticles according to a given probability law independently of the other members and its own past history. Let Zn be the number of particles of the family at a moment (or gener- ation) n; then {Zn} gives rise to a tMarkov chain, This is called the Galton-Watson pro- cess. A precise mathematical description of the process is as follows: A Galton-Watson pro- cess is a Markov chain{Zn n = 0,1,. ..} on the nonnegative integers with ttransition proba- bility defined in terms of a given tprobability distribution {Pk k = 0, 1, }, Pk ;?: 0, r.o Pk '= 1, by . . { pt if i 1, jO, Pij""P[Zn+l "";IZn=IJ= '" . f '",, 0 '>' 0 U') I I ,J '" , where the probability distribution {pi k = 0, 1, . } denotes the i-fold tconvolution of the probability distribution {Pk k""O, 1, . . . } and Di} is the tKronecker delta. Then Pk is interpreted as the probability that an object existing in the nth generation has k children in the (n + I)th generation. We assume that Zo = 1 a,s. for the rest of this section. An impor- tant tool in the analysis of the process is the tgenerating function f(s) = Lj";O PjS}, Isl ::; 1. Denote the generating function Lo P(Zn = k)Sk (s  1) of Zn by fn(s)- Then we have fo(s) = s, II (s) = f(s), .h+j(s) = /;(Jj(s)) (i,j = 0, 1,2,.), and the texpectation of Zn is given by E(Zn) = mn, where m= 1'( 1) = £(ZI) is the expecta- tion of Z,. A Galton-Watson process is said to be sub critical, critical, or supercritical if m< 1, = 1, or> 1, respectively. Ifwe have Zn = 
181 o for some n, then Zn+1 = Zn+2 = . . = O. The probability q = P (lim, In = 0) is called the extinction probability. The case fds) == s is excluded in the following. The extinction pro- bability q of the process {Zn} is the smallest nonnegative solution of the equation s == f(s), It is 1 ifm 1 and < I ifm> 1. Moreover, P(limnZ n = 00)== 1 -q when m> 1, and hence the process {Zn} is ttransient. If we put fv" = Zn/mn when m < 00, { W n } gives rise to a tmartingale, and the limit W= Iimnoo fv" exists with probability 1. If m> 1, then the tmoment-generating function <p(s) == E(exp( -sW)) of W satisfies the Konigs- Schroder equation <p(ms) == f( <p(S)), Re s  O. If m (I, then P(Zn = 0 for some n) = 1. The fol- lowing theorem on the conditional distribu- tion of Zn, given that Zn t 0, was first proved by Yaglom [6J under moment restrictions. If m < 1. there exist  = Jim, P(Zn = k Zn of- 0), k = 1,2 ,.." Then {b k I k = 1,2 ,... }isaprobability distribution and its generating function g(s)= LI bks k , Isl ( 1, is the unique solution of the equation g(f(s))==mg(s) + 1 -m, Isl (1. among generating functions vanishing at O. Furthermore, when m < 1, Lj";l jb j < 00 if and only if L)I pijlogj) < 00 (i.e" E[ZllogZI] < 00). If m = 1 and a 2 = variance of Z 1< 00, then limnP[Zn/n> ul Zn to] =exp[ -2uj0"2J, uO. If 0"2 = 00, then this result is still correct with the interpretation that the limit in the left- hand side equals 1. For further details regarding the theory of Galton-Watson processes -' [3, S]. C. Multitype Galton-Watson Processes These are generalizations of the Galton- Watson process, involving k types of particles (k  2), say, T I , T 2 " 1;,. Let pl(rl' '2, ., r k ) be the probability that a particle of type Ii splits into 'm particles of type Tm (m = 1,2,. . , k), and set fl(SI' S2,' , Sk) = L,pl(r l , r 2 ,. . , rk)sls;2 . . s? The number of particles Zn = (Z , Z; , ..' ,Z;) at a moment n gives rise to a tMarkov chain over Z, and its transition probability is determined by pl(r!, r 2 , , ,d as for Galton- Watson processes. Here Z\ denotes the space of all k-dimensional tlattice points whose components are nonnegative integers. We now define a mathematical model of the process mentioned above. A Markov chain {Zn I n =0,1,.., } on Zk+ is called a multi (k)- type Galton-Watson process if its transition probability is given by P( j) = P[Zn+! = j Zn = i] = coefficient of k k IT h' n (f l ( ) i, Sl III SI,S2,...,Sk II 1=1 44D Branching Processes for i =(il, i 2 ,.... i k ), j==(jl,j2, .... jk)EZ k +, n= 0,1,... and S=(SI,S2' ""Sk)ED k , where D k = {S==(SI,S2' ,..,sk)ERkllsll 1,1= 1,2, ...,k}, When m ij =al(1" l)jJs j < XJ (i,j== 1,2, . , k), the tconditional expectation of Zn+m, given Zm, is given as E[Zm+n ZmJ = ZmMn, where M is the matrix (m ij ), From now on we assume that m ij < 00 (i. j = 1,2, . , k). A multi- type Galton-Watson process {Zn n = 0, 1, . } is said to be singular if the generating func- tions F(s!, s2,...,sk), 1=1,2 ,.." k,arealllinear in s1> S2" Sk with no constant terms. We assume non singularity throughout. If there is a positive integer IV such that every compo- nent of M N is positive, the process In is said to be positively regular. In this case, M has a positive teigenvalue ..1. that is simple, and ..1.> 1J11 for all other eigenvalues J1. The eigenvalue Ie plays the role of m in the case of the Galton- Watson process. For every i, 1';; i';; k, set qi = P[ln=O for some nIZo=eJ, where ei=(el,ef, "" en, e{ = ij' Then qi is called the extinction probability of the process {Zn n = 0, 1, . } starting with a single particle of type 1;; let q =(ql,q2,.. ..qk). Suppose that the process { Zn n = 0, I, . . . } is positively regular and not singular. Then q==(l, I, . . . . I) if A I, and 0';; qi < 1 (i = 1,2, ..., k) if ..1. > 1, and q satisfies the equation q = (fl (q)J2(q), . Jk(q)). Further- more, the process {Zn n = 0, I,. . . } is ttran- sient, i,e" P[Zn = j inllnitely often] = 0 for any j i- 0, j E Z\ Moreover, if ). > 1, then fv" = Zn/An converges with probability 1, and limZn/An=v Was., n....ao where v denotes the positive left teigenvector v = (VI' . , v k ) for A of the matrix M, and W is a nonnegative trandom variable. P[ W> OJ > o holds if and only if E [ Z(i! lo g Z(i! J < 00 forall l "ci j '"ck ( 1) l;!; -....:::::, -....::::: . Here Z denotes the number of type 1) par- ticles in the first generation for a process with Zo = e;, 1 ,;; i (k. If condition (1) holds, the tmoment-generating functions of W, <Pi(Cl) = E[exp[ -ClW] I Zo =e;], i= 1,2,.., ,k, ex >0, satisfy <Pi(Cl) = fi(<pdCl/A), <P2(Cl/A),. , <Pk(Cl/A», l.s; i.s;k, When A (1, a theorem analogous to Ya- glom's holds. For further information -' Joffe and Spitzer (J, Math. Anal. Appl. 19 (1967» and Athreya and Ney [S]. D. Markov Branching Processes The branching processes dealt with in Sections Band C are limited in the sense that gener- ation times are Ilxed. We now formulate a continuous-time version of branching pro- cesses. The treatment in this section is limited 
44E Branching Processes to Markov processes that are extensions to continuous time of the Galton-Watson pro- cess. A Markov process {X t , P;} on Z  is called a Markov branching process if the tran- sition probability P(t, i,j)== Pi[X,==jJ (i,jEZ, t  0) satisl1es Jo P(t, i,j)sJ =Co Ft, l,j)SJ} for all iEZ and Isl  1. Then there exist a positive constant a and a probability distribution { Pk; k = 0,2,3,. } such that lim'jo( I-P(t,i,i))jt=iaifiEZand Iimtjo P(t, i,j)jt = iapj_i+! if j i-I, j # i. The +Kolmogorov equations are d dt P(t, i,j) = j+! -jaP(t,i,j)+a )' Ipj_I+IP(t,i,l) l=l:t;'j (t[orward equation) and d dt P(t, i,j)= - iaP(t, i,j) 00 + ia L PH+! P(t, l,j) l=i-l,l#i (tbackward equation). Then a and {Pk k = 0,2,3, . } are interpreted as follows. An object existing at t has a proba- bility adt of dying in the time interval [t, t + dt) of length di. If it dies at any time t, the proba- bilities are Po, P2, P3' . that it is replaced by 0,2,3,. objects. When Lk'=o P(t, i, k) < 1, the number of particles attains + 00 in a finite time interval with positive probability. In order that Lo P(t, i, k) = 1 for all i E Z, it is necessary and suft'icient that for each 1 > F. > 0, Ii -,(f(u) - u) -I du =XJ, where f(s)= Ljo.H I pJsj for Isl.s; 1. Most of the theory of the Galton-Watson process carries Over to the continuous case. In particular, concerning the tlimit distribution of X" more precise results have been obtained. (For detailed discussion of Markov branching processes  [3,8].) Some work has also been done on the asymptotic behavior for temporally inhomo- geneous cases of multi type Markov branching processes, E. Branching Markov Processes We now give a systematic treatment of gen- eral branching processes in which an object is characterized by a parameter x in a tcompact tHausdorff space S with a countable open tbase, The results are formulated in terms of strong Markov processes. Set SO = {a}, where 182 a is an extra point. For every positive integer n, let sn = S X S x ,,' x Sj ,where  isthe  n tequivalence relation given by the tpermu- tations of coordinates. S" is compact with respect to the tquotient topology, and its ttopological direct sum S = Lo sn is tlocally compact. Let S = s U {A} be the one-point tcompactification of S. A point XES is de- noted by x=[X I ,X 2 , ...,xnJ if X is the tequiv- ale nee class containing (XI' X 2 ,. . , XnES"). X = [x] is denoted simply by x. Set B(S) = {lif a bounded Borel measurable function on S}, B*(S)== (fEB(S) I Ilfll <1},C(S)={flfa bounded continuous function on S}, and C*(S) = C(S) n B*(S). B(S), C(S), . are defined similarly. For fEB*(S), define!EB(S) by !(X) = 1 ifx=o, =I1j=d(x) ifx=[x l ,x 2 , ""XnJE sn, and = 0 if x =A. A tstrong Markov pro- cess X = {X" P,} on S is called a branching Markov process if its semigroup {T,} (- 261 Markov Processes) satisfies .  A TJ(x)==(TJ)ls(X) for every XES and fEB*(S), where for 9 E B(S), gls is the restriction of g on S [10-12]. We set(i) Z,==n ifX,ES n , n=O, 1, ..., 00, where soo = {A}, and (ii) T= inf{ t Z, #Zo}' Z, and r are called the number of particles and the first splitting time, respec- tively. Let X = {x" px} be a strong Markov process on S. We assume that the semigroup {H,] of X is a strongly continuous semigroup on C(S). Let <p, be a nonnegative continuous tadditive functional of X. Let p,(x), n = 0,2, 3,. be a sequence of nonnegative functions in B(S) such that LO,ni" I p,(x) = 1. Con- sider a sequence of stochastic kernels 7!n(x, dy), n = 0,2,3,. . . . on S X sn, i.e., for \'ixed A E !B(sn), 7!n(x, A) is a measurable function of x, and for \'ixed x it is a probability on (sn, !B(sn), where !B(sn) is the ttopological O'-field on S". Set n(x, D) = LO,n;' I Pn(x)7!n(x, D n sn) for DE!B(S), XES, where S(S) is the topological (J- field on S. Then there exists a unique branch- ing Markov process X = {X" P,} on S satisfy- ing the following: (i) The Markov process {X" t < r, Px} on S obtained from X by shorten- ing its tjifetime is equivalent to the Markov process obtained by tkilling X at a rate d<p, (- 261 Markov Processes). (ii) For any DE S(S) and ic>O, Ex[exp[ -ArJ; X,EDIX,_J= Ex[exp[ -ArJIX,_Jn(X,_,D) a,s, on {r< oo} for every XES. (iii) With probability one, X, = A for all t  lim n 100 Cn' where r n denotes the nth splitting time [ 10]. This process X = {X" P,} is referred to as the {Xt, <PI' n}-branching Markov process, Example 1. Let S = {a"a 2 ,... ,a k }; then Scan 
183 be identified with Z"+ and S with 'if; = Zk+ U ( oo}. Therefore a branching Markov process X on S is a Markov process on i! such that its + transition probability P(t, i, j), i, j E Zk+ satisfies L P(t, i,(jl ,j2" ,jk))S]ts1'. s1' jl,h, ,hEZ\ k = Il L P(t, e l . (jl ,J2, ,jk») l=l (jl.h ,)kEZ ) il X S{l S2 ... si k , P(t, 00,00)= 1 fori=(i!,i 2 ,.... ik)EZ k +, S=(SI,S2' ""Sk)ED k , Then the process X is called a multi (Q-type Markov brancbing process. Example 2. Let S = [0, ooJ, k(x) be a non- negative locally integrable function on [0, (0), and p,(x), n = 0, I, , be a sequence of non- negative measurable functions on [0, 00) such that LoPn(x)= 1 and PI(X)=O. We extend k(x) and p,(x) (n #2) as functions on [0, 00 J by setting them to 0 at 00, and set P2( 00) = 1. Define a stochastic kemeln(x, dy) on S x S by Into Pn(X)b uy } n(x, D) = l L8{[oo, CD]} (DnS 2 ) (Dns n ) ifx E [0, 00), if X = 00. Let X = {X" Px} be the uniform motion on S with the semigroup {HI} such that for jEC(S), H,J(x) = f(x +t) if X E [0, 00) and = f( 00) if x = 00. Set rp, = fh k(xJ ds. Then we have a {XI' <p" n}-bran"ching Markov process, and we can it an age-dependent brancbing process. We now discuss some fundamental prop- erties of {x" rp" n}-branching Markov pro- cesses. For the sake of simplicity we assume the following: (i) rp, = H k(x,)ds, kEC +(S) = {fEC(S)lfo}, (ii) Pn(X)EC+(S) and nn(x, D) = bUx,x, ..xJ}(D) "-v--/ for DE!B(sn). Then {T,} is a strongly con- tinuous semigroup on C,(S) = {f E C(S) limxd(x)=O}. Let A be the tinfinitesimal generator of {H,} (- 261 Markov Processes) and £0(A) be the domain of A. If fE2&(A)n C*(S), then u(t,x)=(T'!)IS(x)E£0(A), and it satisfies au/at = Au + k (F('; u) u), u(O +, x) = f(x), where F(x; g)= Jsg(y)n(x, dy) [10]. Let At be the space of all nonnegative tRadon measures on Sand At o be the sub- space of ..It consisting of all probability Radon measures on S endowed with the topology of tweak convergence. Set At = AtU {8}, where b is an extra point and Ji = [0, 00 J x At o. Define 44 Ref. Brancbing Processes a mapping p:,i13(J, Ao)f->p(X, Ao)EAt by p(J, ..1. 0 ) = rico if .!c < oc and = i5 if ), = 00, and define the topology of ,It as the strongest of all the topologies rendering p continuous. Set C++(S)={fEC(S)lf>O}, and for fE'C++(S), det'ine a function <Dj(A) on Jil by <Dr(A) = exp[ -(.!c,f)J if AEAt and =0 if A=b, where (A, f) = Ssf(X)A(dx), Consider a mapping : S- A defined by t/J(x) == 0 if x = a, = L=I b{xd if x=[X l ,X 2 "",X n JES n , and =6 ifx=A. Let At p be the subspace of ,jj consisting of all non- negative integer-valued Radon measures on S, i.e., At p = {tP(x) I x ES}. We denote by '!iJ the space of an right-continuous mappings v: [0, oo)-->At p whose discontinuities are at most of the first kind and such that v, == ,15 for t  s if V s = 6. Consider a branching Markov process {X" Px}, and let Px be the tprob- ability law on Cj) of the stochastic process {V t = t/J(X t ); 0  t < 00 } with X,=x, where J1x = (x). Then the branching property can be rewritten as follows: For every tO, fECH(S) and 111' 112EAtp,EI+,[exp[ -(v,J)]J= E.Eexp[ -(vt>f)JJE,[exp[ -(vt,f)]] ( Silverstein, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 9 (1968); for a treatment of branching processes in the framework of tmartingale problems  Holley and Stroock, Publ. Res. Inst. Math. Sci., 14 (1978». There are analogous processes which pre- serve the basic features of the branching property. A Markov process [JI." P} on .it with b as a ttrap is called a continuous-state brancbing process if it satistJes the property that for every tO,fEC++(S), and Jl.l' 112EM, E1 +,[exp[ -(l1t,f)JJ = E.[exp[ -(I1,,f)]] E,[exp[ -(I1,,f)]]. This concept was intro- duced by Jiiina [ 15J for some special cases. For further information ->. Jifina (Third Prague Con!, 1964), Lamperti (Bull. Amer. Math. Soc., 73 (1967)), Watanabe (J, Math. Kyoto Univ., 8 (1968»), and Fujimagari and Motoo (K6dai Math. Sem. Rep., 23 (1971». References [ lJ F. Galton and H. W. Watson, On the prob- ability of the extinction of families, J. Roy. Anthropol. Inst., 4 (1874), 138-144. [2J D. Kendall, Branching processes since 1873, J. London Math. Soc., 41 (1966),385- 406, [3J T. E. Harris, The theory of branching processes, Springer, 1963. [4J M. S. Bartlett, An introduction to stochas- tic processes, Cambridge Univ. Press, 1966. [5J A. N. Kolmogorov and N. A. Dmitriev, Branching random processes (in Russian), Dok\. Akad. Nauk SSSR, (N.S.) 56 (1947),5-8. 
45 A Brownian Motion [6J A. M. Yaglom, Some limiting theorems of branching random processes (in Russian), Dok\. Akad. Nauk SSSR, (N.S.) 56 (1947), 795- 798, [7J H. Kesten and B. P. Stigum, A limit theo- rem for multidimensional Galton-Watson pro- cesses, Ann. Math. Statist., 37 (1966),1211- 1223, [8J K. B. Athreya and P. E. Ney, Branching processes, Springer, 1972. [9J B. A. Sevast'yanov, Branching processes (in Russian), Nauka, 1971. [10J N. Ikeda, M. Nagasawa, and S. Wata- nabe, Branching Markov processes I, II, III, J. Math. Kyoto Univ., 8 (1968),233-278, 365- 410; 9 (1969),95-160; 11 (1971), 195-196. [ IIJ J. E. Moyal, The general theory of sto- chastic population processes, Acta Math., 108 (1962),1-31. [ 12J A. V. Skorokhod, Branching diffusion processes, Theory of Prob. Appl., 9 (1964), 445-449, (Original in Russian, 1964.) [ 13J T. W. Mullikin, Limiting distributions for critical multitype branching processes with discrete time, Trans. Amer. Math. Soc., 106 (1963), 469-494. [ 14J S. Watanabe, Limit theorem for a class of branching processes, Markov Processes and Potential Theory, J. Chover (ed.), Wiley, 1967, 205-232, [ 15J M. Jiiina, Stochastic branching processes with continuous state space, Czech. Math. J., 8 (1958),292-313. 45 (XVII. 7) Brownian Motion A. General Remarks R. Brown, an English botanist, observed in 1827 that the minute particles comprising the pollen of plants, when suspended in water, exhibit peculiarly erratic movements [ 1]. The physical explanation of this phenomenon is that haphazard impulses are given to the sus- pended particles by collisions with molecules of the fluid. Let X(t) be the x-coordinate of a particle at time t. Then X(t) is treated as a trandom variable, and the distribution of X(t) - X( s) is a tnormal distribution N (0, D I t - s I), with t mean 0 and tvariance D t - S I, where D is a positive constant. To be more exact, such a family of random variables {X(t)} is now considered as the family of random variables determining a tstochastic process. Various aspects of the theory were analyzed by A. Einstein [2J, L. Bachelier, N. Wiener, P. Levy, and others. 184 B. Wiener Processes Let T be the real line R I or a subinterval. A tstochastic process {X(t)} '" defined on a tprobability space (Q, , P) is called a Wiener process on R d if it satisfies the following three conditions: (1) X(t,w)ERd(tE T,wEQ), (2) The Rd-valued random variables X(t l ), X(tj)- X(t j _,), ] = 2,3, , n, are tindependent for any t 1 < t 2 < ... < tn' tjE T (j = 1,2, ... ,n), where n is an arbitrary positive integer. (3) If Xi(t) is the ith component of the vector X(t), then the {Xi(t) }tET (1 :( i:( d) are independent as sto- chastic processes and every increment Xi(t) - Xi(s) is normally distributed with mean 0 and variance t - sl. A Wiener process is also called a Brownian motion. A Wiener process is a ttemporary homogeneous additive pro- cess. A tseparable Wiener process has continu- ous paths with probability 1. Conversely, if {X(t)}'€1 is a temporary homogeneous addi- tive process on R I whose tsample function is continuous with probability 1 and the incre- ment X(t) - X(s) has mean 0 and variance t - sl, then it is a Wiener process (- 407 Sto- chastic Processes, 406 Stochastic Differential Equations). Let {.?;} t  0 be an tincreasing family of a-subfields of 23. We assume that {g;} is right continuous. Ad-dimensional continuous process X=(X(t»"", is called a d-dimensional {g;;}-Brownianmotion if it is t { S<;}-adapted and satisfies E[exp[i<, X(t)- X(s» J I9;J =exp[ -(t-s)I12/2J a.s. for every  E R d , O.s; s < t, where I  1 denotes the tnorm of  E R d [9]. Then X satisfies conditions (1)-(3) mentioned above, and hence it is a Brownian motion on R d (- 406 Stochastic Differential Equations). Let {Xk(w)}(k=O, 1,...) be a sequence of independent random variables defined on a probability space (Q, , P) such that each X k has the normal distribution N(O, 1) with mean o and variance 1. Then the series t /; 00 sin nt C Xo(w)+ - L Xn(w), tE[O,nJ=T, v n n n1 n converges uniformly in t E T with probability 1, and its limit, denoted by X(t, w), is a Wiener process [3, 10]. Let W d = C( [0, 00)--> R d ) be the space of all continuous functions w: [0, 00)--> R d endowed with the topology of uniform convergence on finite intervals and !B( wdj be the ttopological a-field. Let X=(X(t»"", be a Brownian motion on R d and 11 be the probability law of X(O). The probability law pX of the Brownian motion X on ( W d , ( W d )) is called the d- 
185 dimensional Wiener measure with the initial distribution 11 [9, 11, 12]. For a probability J1 on (R d , !B(R d )) the d-dimensional Wiener mea- sure P with the initial distribution J.1. is a prob- ability on ( W d , ( W d )) characterized by the property that for every 0 = to < t I < < t n and AjE(Rd),j = 1, 2,.., ,n, P[w(tdEA" w(t2)EA 2 , ..., w(tn)EAnJ i f f n ( 1 ) d = ". TI R d AI A.j=1 J 2n(t j -t j -d [ IXj-Xj_112 ] xexp - _ J1(dxo)dxl...dxn, 2(t j tj-d where (Rd) denotes the topological O'-field on Rd C. Brownian Motion as a Diffusion Process In terms of the general framework of tMarkov processes, Brownian motion is a typical exam- ple of a diffusion process ( 115 Diffusion Pro- cesses). Let X = {X,(w), R d , Px} be a continuous Markov process on R d ( 26 1 Markov Pro- cesses) with the ttransition probability P(t. x, B) == L (2ntf'J2 e x  -dl x - yl) dy, t > 0, xER d , BE!B(R d ). For each XE R d , the process {X,(w), px} is a Wiener process in the sense mentioned above. The Markov process X, which is a collection of Wiener processes {X" Px} starting at x, is said to be a d-dimensional Brownian motion. A d-dimensional Brownian motion possesses the tstrong Markov property. Let (fj be the tgen- erator of the tsemigroup T, corresponding to X. A bounded uniformly continuous function f defined on R d belongs to the domain of ffi if its partial derivatives af/ax i and o2f/oXiOXj, i,j = 1,2,. d, are bounded and uniformly con- tinuous. For such a function f we have (fjf(x) = (1/2)N(x), where A is the tLaplacian [8,12]. D. Brownian Motions and Potentials For ex> 0, the function G.(x) detlned by G,(X)=Loo e-"(2nt)-d/2 exp ( -IXI2)dt is said to be the cc-order Green's function. Since Brownian motion is tnonrecurrent for d  3, the limit Go+(x) = Iim'jO G,(x), XE R d , exists for  3, and G,+(x) is equal to K,(x) = (f(d/2 _1)/4n d / 2 )lx -d+2, which is the kernel for the tNewtonian potentials. Brownian motion is trecurrent when d  2 and G,+(x) = +00, xER d In this case, K,(x) is defined by 45E Brownian Motion Ko(x) = Iim'jo( G,(x) - G,(x o )), and Ko(x) = (l/2n)10g(l/lxl), when d=2 and IXol= 1. This is the kernel for the tlogarithmic poten- tials. When d = 1 and Xo = 0, K,(x) = -lxl/2. Using this relationship, we can express many concepts of classical potential theory in an elegant form in probability language. Let X = {X(t), PJ be the d-dimensional Brown- ian motion. Given a setA, set ()A = inf{ t t> 0, X(t)EA), where the infimum over the empty set is understood to be +00. Then O"A is called the thitting time of X for the set A (  261 Markov Processes). For a Green domain (i.e., a domain which is a tGreen space) Din R d (d 2), set 2gD(t, x,y)d;y = PAX(t) Edy, O"CD> t), x, YE D. Then the right-hand side of this equation is the ttransition probability of the Brownian motion on D with the tabsorbing barrier oD, Then GD(x, y) == S gD(t, x, y)dt is tGreen's function of D. If B is a compact sub- set of a Green domain D or an open subset with compact closure jj cD, then the hitting probability p(x) = Px(O"B < O'w) is the equilib- rium potential of B relative to D. Suppose A is an tanalytic subset of R d tBlumenthal's 0-1 law implies that PAO"A = 0) = 1 or O. The point x is said to be regular for A if this probability is 1 and irregular for A otherwise. Let B be a compact subset in R d (d  2). Then tWiener's test ( 120 Dirichlet Problem) states that x is regular or irregular for B according as the following series diverge or converge: 00 I 2 k (d-2)C(B k ), d  3, k=1 00 L kC(B k ), d=2, k=1 where C(Bd is the tNewtonian capacity (the logarithmic capacity relative to a bounded domain when d = 2) of the set Bk = {y 2 -(k+I).s; Iy-xi <r k } nB [12]. Suppose D is a bounded domain in R d (d 2). A point x E aD is regular or irregular for R d - D according as x is regu- lar or irregular in the sense of the tDirichlet problem for D. Given a continuous boundary function f on aD, u(x) = Ex(f(X(O'aD» is the solution of the generalized Dirichlet problem for D. Given xED, the distribution h(x, B)= PAX(O'CD)EB) (BE!B(aD)) used in the solution u(x) = JCDf(y)h(x, dy) of the generalized Diri- chlet problem is the tharmonic measure of aD as viewed from x. E. I-Dimensional Brownian Motion In his monograph [6], P. Levy gave a pro- found description of the fine structure of the individual I-dimensional Brownian path 
45 F Brownian Motion {X(t,w)},;,o' Let us set (Ja = the hitting time to the point a in R', m(t,w)= min X(s,w), ao::;;;:st M(t,w)= max X(s,w), nost Yo(t,w)= IX(t,w)l, { X(t,W), Y I (t, w) = M(t,w)-X(t,w), t < 0"0 (w), t? 0"0 (w), and Y( ) _ { X(t,W), 1 <0'0 (w), 2 t, w - X( ( ( t, w)-m t,W), 1 ?Q'O w). Then we have (1) Po(M(t»a)= 2P o(M(t» a, X(t»a)= 2Po(M(t) >a,X(t) < a)= Po(IX(t)1 >a) (reflection principle of D.Andre), (2) The stochastic process {O"a, O.s; a< 00, Po} is a tone-sided stable process with exponent 1/2, that is, it is additive and homogeneous with the law PO(O"b - O"a.s; t) = Po( O"b-a:<;;; t) I b-a -e -(b-a)2/2" ds =Jo J2ns 3 ' O.s;a<b, tO. (3) Let <p-I(t,W) be the right continuous in- verse function of <p(t,w) = L X[O,oo)(X(s,w))ds, where X[O. 00)( .) is the tindicator function of the interval [0, 00), Set Y 3 (t,w)=X(<p-I(t,W),w). Then the four processes {1';(t, w), O.s; t < 00, Px} (x E [0, 00), 0:<;;; i.s; 3) on [0, 00) have the same probability law. Each of them is a diffusion process with transition probability P(t, x, B) = I  (e-IX-YI2/2'+e-IX.YI2/2')dY, BE!B([O,oo)), t>O, XE[O, 00), and is said to be a Brownian motion on [0, 00) with a treflecting barrier at the origin. (4) As a consequence of (3), if X(O, w) = 0 a,s., then, for fxed t, M(t), -m(t), and 1';(t) (O.s; i.s; 3) have a common distribution. For example, Po(M(t) a)= Po(O"a.s; t)= 2Po(X(t) > a) f2 J r == '>l;t e- x2j2 'dx a>O, Po(X(t) Eda, M(t)Edb) =::  (2b - a)e-(2b-a)2/ 2 ' dadb, I;t3 O.s;a<b. 186 (5) The diffusion process {X(t, w), 0:( t< 0"0' Px} (x E (0, 00)) obtained from a I-dimensional Brownian motion by shortening its tlifetime is called a Brownian motion on (0, 00) with an tabsorbing barrier at the origin, and its tran- sition probability is given by P(t,x,B) = f (e-\x-y\2/2' _e-\x+ y \2/ 2 ')dy, 8JhU BE([O,oo)), 1>0, XE(O,OO). The arcsin law is valid for many functionals of I-dimensional Brownian motion. For example, Po(f: X[O,OO)(X(s,w))ds.s;O )=arcsin t, O<O.s;t, Po(r,(w) .s;s) = (2/n)arc sinvfs7t, 0 < s';;; t, where r,(w)=sup{sIX(s,w)=O, O.s;s:<;;;t}. The visiting set (w); {tIX(t,w)=O} of a Brownian path is a hotally disconnected set. Its tLebesgue measure is 0, and the tHausdorff-Besikovich dimension number of (w) is 1/2. Consider a i-dimensiOnal Wiener process {X (t) }o,.;;t<oo starting at the origin. Let us set B\(t)={X(I/t) t = 0, t>O, and B2(t)==cX(t/C 2 ), tO, (c>O). Then the stochastic processes {BI (t) }o,.;;,<oo and {B 2 (t)}0<:;t<00 are likewise I-dimensional Wiener processes starting at the origin. Hence the properties of the Wiener process starting at the origin in a neighborhood of t = 0 (t = 00) can be obtained from those in a neighborhood of t == 00 (t == 0). F. d-Dimensiona1 Brownian Motion Almost all paths of d-dimensional Brown- ian motion are continuous but are not of tbounded variation on any finite interval. Accordingly, they cannot have lengths. A positive, continuous, increasing function <p defined on [to, 00) with to> 0 is said to belong to tbe upper (lower) class witb respect to local continuity if Po((inf{ t IIX(t, w)1 > Ji <p(l/t), t> OJ) > 0) = 1 (0), Ko1mogorov's test states that <p belongs to the upper class or to the lower class with respect to local continuity according as J: t- I (<p(t))d e -",2(')/2dt 
187 converges or diverges. For example, <p(t)=(210g(21 t +(d+ 2)log(3) t + 210g(4)t + . + 2log(n_l) t + (2+ <5)log(n) t)I/2 belongs to the upper or lower class with re- spect to local continuity according as J > 0 or b.s;O, where 10g(n)t=loglog(n_1)t and 10g(l)t= log t. The law of the iterated logarithm for 1- dimensional Brownian motion, ( . IX(t,w)-X(s,w)1 _ 1) -1 P. Imsup - -, o 1->S J 2It-slloglogl/lt-sl is a special case of this example. For the case d = 1, consider the space-time Brownian motion {(-t,X(t)),P o }. Set D",={(s,x)I-I/to";s";O,  <p( -l/s)";x <oo}. Then 0=(0,0) is a regular point of D", or an irregular point of D", for the space-time Brownian motion accord- ing as cp belongs to the lower or upper class with respect to local continuity. (- 261 Mar- kov Processes). Thus Kolmogorov's test is the Wiener test for space-time Brownian motion. Let t/J be a positive, continuous decreasing function defined on [to, 00) with to > O. Then P o ( (inf{ t IIX(t, w)1 < Jt t/J(1/t), t >O}) > 0) =0 or 1. We have 1 "L -(t/J(t))d-2 dt = (f) or < OC , I o when d  3, and f oc dt - 00 or < Cf) "n tllogt/J(t)1 To describe uniform continuity of a path on the interval [0, IJ, take a positive, continuous increasing function t/J defined on [to, 00) with to> 0 and seUp(t) = Jt 1jJ( 1ft), Then IjJ is said to he10ng to the upper class with respect to uniform continuity if almost all paths X (t, w) (0"; t  1) satisfy the tLipschitz condition rela- tive to <p, that is, for almost all w there exists an E(W) > 0 such that 0 < t - sl <£(w) implies IX(t, w)- X(s, w)1 < rp(lt-sl). And t/J is said to belong to the lower class with respect to uni- form continuity if almost all paths X(t, w) (0  t"; 1) do not satisfy the Lipschitz condition relative to <po Then IjJ belongs to the upper or lower class with respect to uniform continuity according as when d = 2. f ro t/J(t)d+2e-",2(t)/2 dt '0 converges or diverges [ 16]. For example, ljJ(t)=(2logt +(d +4)log(2) t + 2log(3)t + . + 210g(n_l) t +(2 + b)log(n) t)I/2 45 G Brownian Motion belongs to the upper or lower class with re- spect to uniform continuity according as b > 0 or b .s; O. The following theorem on the uni- form continuity of I-dimensional Brownian motion is a special case of this criterion: I \ Po ( limsu p IX(t,w)-X(s,w)1 _- 1 ) = 1. J-,I;:OI ) 21 t -sllog 1/lt - sl \ ' Now we state some other properties of Brownian paths. Let A be a set of zero touter capacity in R d (d 2). Then PAX(t)E A for some t > 0) = 0 for any x E Rd. Let A be a plane set with positive tinner capacity, Then PAX(t) E A for intlnitely many t larger than any given S > 0) = 1 for anyx E R 2 . With probability 1, the Lebesgue measure of the set {X(t, (iI) 10  t < oo} in R d is zero for d 2. With proba- bility 1 this set is everywhere dense in R 2 when d = 2 and nowhere dense in R d when d  3 [ 13, 15, 17]. Almost all 2-dimensional Brownian paths have k-fold multiple points for any integer k  2. In the 3-dimensional case, almost all paths have tdouble points but Cannot have any triple point. In the d{  4)- dimensional case, almost all paths have no double point [18- 20]. G. Ito's Formula and Brownian Local Time The following formula of Ita [21J is of funda- mental importance in the theory of stochastic processes, Ito's formula. Let X={X(t) =(X,(t), X 2 (t), ..., Xd(t)), Px} be a d-dimensional Brownian motion and f(x) = f(xI' x 2 ,. . , x d ) be a C 2 _ function del1ned on R d Let us set <p(t) = f(X(t))- f(X(O)). Then {rp(t)},,,o is a continu- ous tadditive functional of the Brownian mo- tion X (- 261 Markov Processes), and we have f(X(t))- f(X(O)) = i I ( a/ ) (X(s))dXi(s) 1 l ' +- (M)(X(s))ds, tO, 2 0 where the first term in the right-hand side is a tstochastic integral with respect to the Brown- ian motion X [22J (- 406 Stochastic Differ- ential Equations). In particular, if f is a C 3 _ function defined on R d , Ito's formula can be rewritten as f(X(t)) - f(X(O»  it r ( a/)< X(S)OdXi(S)' tO, where the right-hand side is a stochastic inte- gral of the tStratonovich type with respect to 
45H Brownian Motion the Brownian motion X [9J (- 406 Stochastic Differential Equations). Consider a I-dimensional Brownian motion X = (X(t), Px}. By the local time or the sojourn time density of X we mean a family of non- negative random variables {<p(t, x)} (t E [0, 00), xER I ) such that, with probability 1, the fol- lowing holds: (a) the mapping [0, 00) x R I 3(t,X)f-><p(t,X)ER I is continuous, (b) for every Borel subset A ofR! and t  0, f XA(X(S»ds==21 cp(t,x)dx, The notion of the local time of Brownian motion was first introduced by Levy [6]. The family ofrandom variables {cp(t, x)} (t  0, XE R!) defmed by <p(t,x)=(X(t)-x)+ -(X(O)-x)+ _ I I X(x, oo)(X(s))dX(s) ,0 satistles properties (a) and (b) mentioned above and hence it is a local time [9,22]. Here a+ is the bigger of a and O. It is clear that the local time cp( t, x) of X is given by 1 i ' <p(t, x) = Jim- 4 X(x-£ XH)(X(s)ds dO Eo' for every x E R I, t)o O. Furthermore, we have f) ( I . Irp(t,b)-<p(t,a)1 '0 1m sup h-a=o)O,a<h j 15 log 1/15 .s;2 j max <p(t, a» ) = 1, aeRl and Po ( Jim sup I <p(t, 8) - <p(t, 0)1 .s; 2 j <p(t, 0) ) = 1. 0)0 j bloglogl/8 Levy [6] studied the fine structure of the local time. Define the visiting sets !!l+ = {tl Yo(t,w) =O} and X- ={ti Ydt,w)=O}, where Yo(t,w) and Y, (t, w) are the stochastic processes de- fined in Section E above. Then we have Po ( Jim rm x [the number of flat stretches of £)0...)2 M(s,w) (O.s;s.s;t) of length E] =M(t,w), t?O)=1. Since the two diffusion processes X + = {Yo(t),O:;;t< 00, px} and X- =={YI(t),O:;;t< 00, PJ define the same probability law on W( [0, 00», there exists a functional <p+(t, w) of X + corresponding to M(t, w) of X -. Here, 188 W( [0, 00) denotes the space of all continuous functions w: [0, 00 )......[0,00), The flat stretches of the graph of <p +(t, w) are the open intervals X n (n)o 1) such that Ln031 X n == [0, 00) - X+ Then we have ( . fi E Po 11m - x [the number of intervals dO 2 ,q'n C [0, t) of length )0 EJ == <p +(t, w), t)oO ) = 1. Furthermore, we have Po ( lim f!i x [the total length of the inter- dO ...) 2i. vals :!t n C [0, t) of length < E] = I{J +(t, w), t?O) = 1 and Po (IfnI (2E)-1 I X[O,£)(Yo(s,w))ds ==cp+(t,W),t)oO) = 1. Let dolt, w) be the number of times that the reflecting Brownian path Yo(s, w) crosses down from E> 0 to 0 before time t, Then Po ( Jim Ed£(t, w) = <p +(t, w), t ?o ) = 1. dO H. Flows and Random Distributions The flow derived from the I-dimensional Wiener process {X,} -00<1< 00 is tKolmogorov's flow. It has tmixing properties of all orders and is tergodic (- 136 Ergodic Theory, 39.5 Stationary Processes). The tstationary process with independent values at every point corre- sponding to the tcharacteristic functional ex p ( -f:oo <p(t)2 d t ) on the tSchwartz space Y defines the same probability law with the stationary process obtained by differentiation of the Wiener process in the tdistribution sense [23J (- 395 Stationary Processes, 407 Stochastic Processes), I. Generalizations of Brownian Motion In addition to the Brownian motion described above, there are several stochastic processes that are also called Brownian motion. A Gauss- ian system {X (a) LERN defined on a proba- bility space (Q, !B, P) is said to be a Brownian motion witb an N-dimensional time parameter 
189 if(i) E(X(a))=O, (ii) E(X(a)X(b»==1(lal+ Ibl-Ia-bl), (iii) P(X(O)=O)==1. Leta*bethe tspherical inversion of a ERN with respect to the unit sphere. Set X*(a)=laIX(a*) (a#O) and X*(O) = O. Then {X*(a)}aERNdefines the same probability law with a Brownian mo- tion with an N-dimensional time parameter. Almost all paths of a Brownian motion with an N-dimensional time parameter are con- tinuous. A positive, continuous, increasing function r.p defined on [to, 00) with to >0 is said to belong to the upper (lower) class with re- spect to local continuity if the probability that the closure of the set {a IX(a, w)1 > M r.p( Ilia!), lal >O} contains the origin 0 is equal to 0 (1). Then r.p belongs to the upper or lower class with respect to local continuity according as the integral f a 1 _(<p(t))2N-t e-<{J2(')/2 dt , t o converges or diverges. For example, <p(t)=(2log(2)t +(2N + 1)log(3)t + 2log(4) t + + 2log(n_l) t +(2+ c5)log(n) t)I/2 belongs to the upper or lower class with re- spect to local continuity according as b > 0 or c5 :s; O. As a special case, we have P ( Iimsup IX(a, w)1 - I ) == I "o J 21alloglog l/ial . Take a positive, continuous increasing fUnc- tion r.p defined on [to, 00) with to> 0, and set l/1(t) = jt r.p( I/t), If almost all paths X(a, w) (lal:S; 1) satisfy the Lipschitz condition relative to l/1(t), then r.p is said to belong to the upper class with respect to uniform continuity. It is said to belong to the lower class with respect to uniform continuity if, with probability 1, these paths do not satisfy the Lipschitz con- dition relative to l/1(t), Then r.p belongs to the upper or lower class with respect to uniform continuity according as f. oo tN-I(<p(t))4N-te-<{J2(')/2dt (0 converges or diverges (T. Sirao, Nagoya Math J., 17 (1960». For example, (r(t) = (2N1og t + (4N + 1 )log(2) 1 + 2log(31 t +... +21og(n-l)t +(2+ c5)log(n)t)I/2 belongs to the upper or lower class with re- spect to uniform continuity according as b > 0 or b:s; o. As a special case, ( . IX(a.m)-X(h,m)1 P hmsup - la-blO,lal.lbl(1 J 2Nla- bllog Ilia -bl =1)=1 45 Ref. Brownian Motion (For general information about Brownian motion with a multidimensional time para- meter -' P. Levy [6J and H. P. McKean, Theory ofProb. Appl, 8 (1963).) Let X(t) be a Wiener process. L. S. Ornstein and G. E. Uhlenbeck based their investigation of the irregular movements of small particles immersed in a liquid on Langevin's equation dU(t) == - U(t)dt + f3dX (t), where U(t) is the velocity of a particle. The first term on the right-hand side of this tsto- chastic differential equation (- 406 Stochastic Differential Equations) is due to frictional resistance or its analog, which is thought to be proportional to the velocity. The second term represents random external force. The station- ary solution of this equation is given by V(t) = foofJe-'(/-U)dX(U), The stochastic process {U(t)}-oo<,<oo is a stationary tGaussian Markov process with covariance function I(t) = (f32 12ex)e -'Itl. This process is called Ornstein-UWenheck Brownian motion [24J ( 176 Gaussian Processes). Brownian motion has been defined on state spaces that are tRiemannian spaces or +Lie groups, and its properties are being investi- gated ( 5 Additive Processes, 115 Diffusion Processes), References [1] R. Brown, A brief account of microscopical observations made in the months of June, July, and August, 1827, on the particles contained in the pollen of plants; On the general existence of active molecules in organic and inorganic bodies, Philos. Mag., Ann. Philos. New Series, 4 (1828), [2J A. Einstein, Investigations on the theory of the Brownian movement, Methuen, 1926. [3J R. E. A. C. Paley and N. Wiener, Fourier transforms in the complex domain, Amer. Math. Soc. Colloq. Pub\., 1934. [4J N. Wiener, Differential space, J. Math. and Phys., 2 (1923),131-174. [5J P. Levy, Le mouvement brownien plan, Amer. J. Math., 62 (1940),487-550. [6J P. Levy, Processus stochastiques et mouvement brownien, Gauthier-Villars, 1948. [7J J. L. Doob, Stochastic processes, Wiley, 1953, [8J K. Ito, Lectures on stochastic processes, Tata Inst, 1960. [9J N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, North-Holland and Kodansha, 198 1. 
Rcl 100 Brownian Motion [10] K. Ita and M. Nisio, On the convergence of sums of independent Banach space-valued random variables, Osaka J. Math., 5 (1968), 35-48. [ 11] P. Billingsley, Convergence of probability measure, Wiley, 1963. [ 12] K. Ita and H. P. McKean, Jr., Diffusion processes and their sample paths, Springer, 196.5, [ 13] J. L. Doob, Semi-martingales and sub- harmonic functions, Trans. Amer. Math. Soc., 77 (1954), 86-121. [ 14] A. Va. Khinchin (Khintchin) Asymptot- ische Gesetze der WahrscheinIichkeitsrech- nung, Erg. Math., Springer, 1933 (Chelsea, 1948), [ 15] S. Kakutani, Two-dimensional Brownian motion and harmonic functions, Proc. Imp. Acad. Tokyo, 20 (1944),7066714. [16] K. L. Chung, P. Erdos, and T. Sirao, On the Lipschitz's condition for Brownian motion, J. Math. Soc. Japan, 11 (1959), 263-274. [ 17] S. Kakutani, On Brownian motions in n- space, Proc. Imp. Acad. Tokyo, 20 (1944),648- 652, [ 1 8] A. Dvoretsky, P. Erdos, and S. Kakutani, Double points of Brownian motion in n-space, Acta Sci. Math. Szeged., 12 (1950),75-81. [19] A. Dvoretsky, P. Erdos, and S. Kakutani, Multiple points of paths of Brownian motion in the plane, Bull. Res. Council Israel, 3 (1954), 3644371. [20] A. Dvoretsky, P. Erdos, S. Kakutani, and S. J. Taylor, Triple points of Brownian paths in 3-space, Proc. Cambridge Philos. Soc., 53 (1957), 8566862. [21 J K. Ita, On a formula concerning stochas- tic differentials, Nagoya Math. J., 3 (1951),55- 65, [22] H. P. McKean, Stochastic integrals, Academic Press, 1972. [23] T. Hida, Brownian motion, Springer, 1980, [24J G. E. Uhlenbeck and L. S. Ornstein, On the theory of Brownian motion, Phys. Rev., 36 (1930),823-841. 
46 A Calculus of Variations 46 (X.32) Calculus of Variations A. General Remarks One of the first objects of differential calculus was to systematize the theory of the extrema of functions of a I1nite number of indepen- dent variables. In the calculus of variations we consider functiona1s (i.e., real-valued or complex-valued functions del1ned on a func- tion space {u} consisting of functions del1ned on a certain domain B), originally so named by J. Hadamard. Such a functional is denoted by J[u BJ (or simply J[uJ). A function u, which is considered an independent variable of the functional, is called an argument function (or admissible function). Concrete examples of functionals are the length of a curve y = f(x) and the area of a surface z = z(x, y), which are expressed by f Xl L[yJ= J l+(y'(x)fdx Xo and S[zJ = f t J l +z;+z;dxdy, respectively. Furthermore, consider a curve y = y(x) connecting two given points (x" Yo) and (XI, y,) with y, >y,. The time in which a parti- cle slides down without friction from (x" y,,) to (x" Yd along this curve under constant gravity acting in the direction of the positive y-axis is expressed by the functional l[y J = k f XI J (I + y'2)/(y - Yo) dx, -"0 k constant. (1) Let F( . . . ) be a known real-valued function that depends on a certain number of indepen- dent variables and on argument functions of these variables, together with derivatives of these functions up to a certain order. Then a typical problem in the calculus of variations is formulated as an extremal problem of a func- tional that is expressed by the integral with F( . . . ) as the integrand, for example, f Xl J[uJ = F(x, u(x), u'(x), . . , u(m)(x))dx, Xo leu, v] = F(x, y, u(x, y), u x , u y ," I SSIJ v(x, y), v x , v y ,... ) dxdy. For instance, a curve minimizing (1) is the curve of steepest descent. In extremal problems, suitable boundary conditions can be assigned to argument func- 192 tions. On the other hand, there are so-called conditional problems in tbe calculus of vari- ations. A typical example of this sort is the tisoperimetric problem, j,e" the determination of the curve that bounds a domain with maxi- mal area among all curves on a plane with given length. In general, an extremal problem of a functional under a subsidiary condition that the value of another given functional remain I1xed is called a generalized isoperi- metric problem. In addition to these, there are Lagrange's problem, in which a finiteness condition is imposed, and Hilbert's problem, in which a condition consisting of differential equations is imposed. The birth of the calculus of variations was almost simultaneous with that of differential and integral calculus. Johann Bernoulli, Jakob Bernoulli, L. Euler, and others had dealt with several Concrete problems of the calculus of variations when in 1760, J. L. Lagrange intro- duced a general method of dealing with vari- ational problems connected with mechanics. Then an equation bearing the name of Euler or Lagrange was introduced. B. Euler's Equation As an example, consider the simplest vari- ational problem f X 1 J[yJ = F(x, y(x), y'(x))dx Xo min, (2) Let the boundary condition y(xo) = y" y(x l ) =y, be assigned to the argument function y(x). Consider a family of admissible functions Y(x; £) = y(x) + £17 (x), where 17(x) is any I1xed function vanishing at both endpoints and € is a parameter. If y(x) gives the minimum of l[yJ, then the function of B, l[ Y], must attain a minimum for € = O. The condition (al[ YJ /&)", = 0 is written in the form f X 1 0= ( F y 17 + Fy'17')dx Xo = [117(Fy_ :X Fy.)dX by taking into account the boundary con- dition. By making use of the arbitrariness of '1(x), we conclude that d 0= Fy -- Fy' = Fy - Fy'x - y' Fy'y - y" Fy'y' dx (3) holds, in view of the following lemma: Let rp(x) be continuous in [x o , xlJ and 17(X) be a func- tion of class CP that satisfies '1(x o ) = '1'(xo)== . . = 17(q)(xo) = 0, 17(xd=17'(xd =... =17(Q)(xd==O (O.s;q.s;p). If J;IJ(x)<p(x)dx=O holds for any such 17(X), then <p(x) = 0, (Here class CP can be 
193 replaced by class C'" when q is supposed finite. If I1E C"', then q = 00 is admitted.) This is called the fundamental lemma in the calculus of vari- ations. We call (3) the Euler-Lagrange differ- ential equation (or Euler's equation for the extremal problem). Since this equation is of the second order, y(x) can be determined by means of the boundary condition. d The quantity [FJy=Fy- dx Fy' contained in equation (3) is called the variational derivative of F with respect to y. Furthermore, Dy = IJde and <5l = (ol [Y]/&)", de are called the first variations of the argument function y and of the functional 1 [y J, respectively. If l1(x) is not subject to the condition that it must vanish at the endpoints, then the first variation of J becomes f X 1 bJ = [F]y<5y dx + [Fy,<5y ]. Xo In comparison with an ordinary extremal problem f(xI' ,x n ) = min in differential calculus, [FJy and Dl correspond to gradfand df, respectively. In general, a solution of the Euler-Lagrange differential equation [FJy =0 is called a stationary function for the V ari- ational problem, and its graph is called a stationary curve. For a variational problem involving several argument functions, we have only to write the system of Euler-Lagrange differential equa- tions corresponding to them. For a problem I x! J[yJ= F(x,y,y',...,y(m»)dx=min ""0 whose integrand involves derivatives of higher orders of an argument function, the Euler- Lagrange differential equation is m d [FJy== L (-1)"- d Fy(")=O, #=0 x For a problem involving a double integral l[uJ = F(x,y, u, u x , yy)dx dy = min, SSB the equation is a a O=[FJu=Fu-- a Fu -- a Fu' x x Y y For a generalized isoperimetric problem, for example l[yJ = min, K [y] = c, the Euler- Lagrange equation becomes [F +AGJy=O with the so-called Lagrange multiplier A, where F and G denote the integrands of J and 1(, respectively. Two integration constants con- tained in a general solution of this differential equation of the second order and an undeter- 46C Calculus of Variations mined constant A can be determined, for in- stance, by the boundary condition y(x o ) = Yo, y(Xt) = y, and a subsidiary condition K [yI = c. As an example, for the classical proper isoperi- metric problem F = y, G = J1+72, the equa- tion is 1 - A(y' / J1+72)' == 0, which after integration leads to (x - ex)2 + (y _ fJ)2 = ..1. 2 ( 228 Isoperimetric Problems). Besides the case of fixed endpoints, there is a boundary condition, for instance, that an endpoint (XI' y,) of the argument function y = y(x) must lie on a given curve T(x, y) = O. For the case of such a movable endpoint, the extremal function is subject to the condition of transversality , (F - y'Fy') Ty-F/T" =0, X=XI' C. Sufficient Conditions A. M. Legendre introduced the notion of the second variation, corresponding to differential quotients of the second order in differential calculus, in order to discuss sufficient con- ditions. Concerning the simplest problem (2), the inequality Fy'y'(x, Yo (x), y(x»  0 is neces- sary in order for Yo(x) to give the minimum. Conversely, the inequality Fy'y' > 0 and Jacobi's condition (which is stated below) imply that y = Yo(x) gives a weak minimum. Here "weak minimum" means the minimum when a family of admissible functions { y - y, < E, y' - y 1< e} is considered a neighborhood of y,. Jacobi's condition: Let u be a solution of a linear ordinary differential equation of the second order, d (F y . y , : ) - (F yy - d F yy .) u == 0, u(xo) = 0; then the smallest zero of u that is greater than Xo (i.e" the conjugate point of xo) is greater than the right endpoint Xi' K. Weierstrass derived suflIcient conditions for a strong minimum by extending the range of admissible functions to { y - Yo < e}, Re- sults that were obtained until about that time constitute the content of what is usually called the classical theory of the calculns of variations. If for the variations problem (2) there exists a unique curve through every point in a do- main on the xy-plane that belongs to a one- parameter family of stationary curves of the functional I X! J[yJ = F(x, y, y')dx, Xo then the domain is called a field of stationary curves. Let the parameter value of the curve through a point (x, y) in such a family of 
46D Calculus of Variations stationary curves y = <p(x; a) be denoted by rx = ex(x, \) The slope p(x, y) = [rp'(x; ex)J,,(X,y) is called the slope of the Held at the point (x, y) or, by regarding x, y as variables, the slope function of the field. The value of a curvilinear integral Ic==I[yJ = Ie (F(x,y,p)-(y'-p)Fy,(x,y,p))dx is then determined and depends only on the two endpoints of the curve C. We call Ie Hilbert's invariant integral. In view of the property mentioned above, we can denote the value of the functional J for a function y = y(x) representing a curve C by J e . Then anyadmis- sible curve C that passes through a Held em- bedding a stationary curve Co satisfies O.s;M=Jc-J co = Ie 6"(x,y;p,y')dx, Here 6"(x,y;p, Y') = F(x, y, y')- F(x, y,p)-(y' - p)Fy'(x, y,p) is the 6"-function introduced by Weierstrass. For Co to give the minimum of J [y], it suffices that g  0 hold for every point (x, y) in. the field and every value y' (279 Morse Theory). D. Optimal Control Let a system of differential equations dx;/dt = /;(x l ,. . ,x n ; U", Uk)' (U I"") Uk)E!1; X;(tO)=X i , i== 1,...) n, be given, where U l . . . , Uk are parameters. In general, a problem of optimal control is to determine u j = uj(t) (to  t  t l ) such that the value of a functional f 'l J[u] = F(x l ,.. ..X n ; u 1 ,.... uk)dt '0 assumes a minimum, where x,(t) are the solu- tions of (4) and are considered functions in UI, . . . , uk> and t. Such a problem is a kind of conditional variational problem. But since the existence region of U is restricted, certain con- ditions in the form of inequalities are imposed, and furthermore u is not necessarily continu- ous, and in many cases the problem cannot be treated within the classical theory of the calculus of variations ( 86 Control Theory). E. The Direct Method in the Calculus of Variations In mathematical physics tvariational principles are derived from discussions of formal corre- 194 spondence between a functional J[uJ to be minimized and Euler's equation for J[u]. This is certainly one of the important methods in the calculus of variations, but it is also possible to investigate a stationary function u o on the basis of its stationary character and indepen- dently of Euler's equation. This is called the direct method in the calculus of variations. It plays an important role in the theoretical treatment of the existence and uniqueness of solutions, and it is also significant as a tech- nique for approximate or numerical solutions. When a differential equation is given inde- pendently of the calculus of variations, it is possible to apply the direct method if a func- tional whose Euler's equation is the given differential equation can be constructed. Let D be a bounded domain in m- dimensional space and fEL 2 (D) be a real- valued function. Consider the variational problem of minimizing the functional J[u]= L Igradul 2 dx-2 Lf udX . Here we suppose the set of admissible func- tions, denoted by Ai> to be the Hilbert space obtained by completing the function space C(D) with respect to the norm N(u) =(LlgraduI2dx+ L u 2 dx r 2 (4) Utilizing F. Riesz's representation theorem in Hilbert spaces, it can be shown that there exists a minimum value I in A, which is uniquely realized by certain U o E AJ' Since the function U o belongs to A" it can be shown that the boundary condition ulcD=O (5) is satisfied in a generalized sense. Furthermore, in view of J[uoJ.s; J[u o + <PJ being valid for any <pEC(D), it can be verified that the equation - u = f (6) is satisti'ed in D in the sense of differentiation of tdistributions. In other words, the station- ary function U o is a solution in the wide sense (a tweak solution) of the classical boundary value problem for tPoisson's equation for- mulated by (6) and (5). If a function space A, with AJ:J AJ:J C(D) is taken as the set of admissible functions, the value 1== J[uoJ be- comes the greatest lower bound of J in AJ' In this case, if {Un}1 is any minimizing se- quence from Ai> that is, if un E A" n = 1,2,. . ; J[U n ] ....../ (n...... 00), then it converges to U o in the sense that N(un-uo)......O (n......oo). (7) In other words, the solution in the wide sense 
195 U o of the boundary value problem can be constructed as the limit of a minimizing se- quence that consists of sufficiently smooth functions vanishing on the boundary. In the proof of the fact that a solution in the wide sense U o coincides with the classical solution of the boundary value problem under an assump- tion of suitable smoothness for f and aD, a standard argument has been established for proving the regularity of a solution in the wide sense, The technique of obtaining the solution of the boundary value problem as the limit of a minimizing sequence was proposed by B. Riemann concerning the classical tDirichlet problem and was completed by D. Hilbert. This pioneering work led to the recent treat- ment of boundary value problems by utilizing Hilbert spaces. For +Self-adjoint boundary value problems, the method stated for the above example has been generalized almost directly to the cases of differential operators of higher order and with variable coefficients (- 323 Partial Differential Equations of Elliptic Type). By making use of some auxiliary argu- ments, this technique can be applied exten- sively to the construction of several kinds of mapping functions in the theory of functions of a complex variable, to the solution of tintegral equations of the second kind, and also in other fields [3,4]. The eigenvalue problem, which is formu- lated by Hu = AU, U#O, with a +Self-adjoint operator H in a Hilbert space, can also be transformed into a vari- ational problem for the tRayleigh quotient R[uJ =(Hu, u)/lluI1 2 ( 298 Numerical Computation of Eigenvalues). F. Solution of Differential Equations by the Direct Method In view of the convergence shown in (7), a minimizing sequence can be regarded as an approximating sequence for a solution of the boundary value problem or a stationary func- tion uo. Let a function Un = un(x; c) = un{x; C 1 ' ' , , c,) involving an n-vector c = (c l , . . . , c,) as a para- meter be admissible for any c. If J [u n {.; c)] = F(c), obtained by substituting un into J, is minimized at c = co, then u n ( '; CO) is considered as a function that approximates U o most pre- cisely within the family u n ( .; c). This vector CO is obtained, in general, by solving the simulta- 46 Ref. Calculus of Variations neous equations oJ[ u n (.; C)]joc j ==0, j == 1, ..., n. (11) The function Un appearing in (10) is often taken to be a so-called linear admissible function. For instance, in the example above, let {<pdk"=1 be a system of independent functions complete in AJ, and set Un=C I <PI +... +cn<Pn' (12) The method that constructs a minimizing sequence u I , u 2 ,. by determining the value of the ck in (12) by (11) is called Ritz's method, and <fJk is called a coordinate function in this method. As for the rate of convergence in approximation by Ritz's method, as well as for the estimation of errors, there are several results by the Soviet school in addition to those of E. Trefftz [3,5]. (Other methods of constructing minimizing sequences are stated in detail in [3J; conc=ing a connection with tGalerkin's method  304 Numerical Solution of Partial Differential Equations.) Ritz's method applied to eigenvalue prob- lems is called the Rayleigh-Ritz method. Since a stationary value of the Rayleigh quotient is itself an eigenvalue in this case, the precision of approximation is far better for eigenvalues than for eigenfunctions, so that it is a convenient method for the approximate computation of eigenvalues. (8) References (9) [1] R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, 1, 1953; II, 1962, [2J P. Funk, Variationsrechnung und ihre Anwendungen in Physik und Technik, Springer, 1962. [3J L.. V. Kantrovich and V. 1. Krylov, Ap- proximation methods of higher analysis (in Russian), Gostekhizdat, 1952. [4J S. G. Mikhlin, Variational methods in mathematical physics, Macmillan, 1964. (Original in Russian, 1957.) [5J S. G. Mikhlin, The problem of the mini- mum of a quadratic functional, Holden-Day, 1965. (Original in Russian, 1952.) [6J S. L. Sobolev, Applications of functional analysis in mathematical physics, Amer. Math. Soc. Trans\. of Math. Monographs, 1963. (Original in Russian, 1950.) [7J E. Goursat, Cours d'analyses mathema- tique, III, Gauthier-Villars, 1927. [8J C. Caratheodory, Calculus of variations and partial differential equations of the first order, Holden-Day, 1967. [9J O. Bolza, Vorlesungen iiber Variations- rechnung, Teubner, 1909. (10) 
47 Ref. Cantor, Georg [lOJ I. M. Gel'fand and S. V. Fomin, Calculus of variations, Prentice-Hall, 1963. (Original in Russian, 1961.) [llJ C. B. Morrey, Multiple integrals in the calculus of variation, Springer, 1966. 47 (XXI.1 5) Cantor, Georg George Cantor (March 3, 1845-January 6, 1918), the founder of set theory, was born in St. Petersburg into a Jewish merchant family that settled in Germany in 1856. He studied mathematics, physics, and philosophy in Zu- rich and at the University of Berlin. After re- ceiving his degree in 1867 in Berlin, he became a lecturer at the University of Halle and served as a professor at that university from 1879 to 1905. In 1884, under the strain of opposition to his ideas and his efforts to prove the tcon- tinuum hypothesis, he suffered the IIrst of many attacks of depression which continued to hospitalize him from time to time until his death. The thesis he wrote for his degree concerned the theory of numbers; however, he arrived at set theory from his research concerning the uniqueness of ttrigonometric series. In 1874, he introduced for the IIrst time the concept of +Cardinal numbers, with which he proved that there were "more" ttranscendental numbers than talgebraic numbers. This result caused a sensation in the mathematical world and became the subject of a great deal of con- troversy. Cantor was troubled by the oppo- sition of L. tKronecker, but he was supported by J. W. R. tDedekind and G. Mittag-Lerner. In his note on point-set theory, he wrote, in connection regard with his concept of infinity, "The essence of mathematics lies in its free- dom!" In addition to his work on cardinal numbers, he laid the basis for the concepts of torder types, ttransfinite ordinals, and the theory of real numbers by means of tfunda- mental sequences. He also studied general point sets in Euclidean space and defined the concepts of taccumulation point, tclosed set, and topen set. He was a pioneer of the point- set theory that led to the development of general ttopology. References [IJ G. Cantor, Gesammelte Abhandlungen, E. Zermelo and A. Fraenkel (eds.), Springer, 1932 (Glms, 1962), [2J A. Schoenflies, Die Krisis in Cantors math- ematischem Schaffen, Acta Math., 50 (1927), 1-23. 196 48(X.31) Capacity A. General Remarks The electric capacity of a conductor in the 3- dimensional Euclidean space R 3 is defined as the ratio of a given positive charge on the conductor to the value of the potential on the surface. This definition of capacity is indepen- dent of the given charge. The capacity of a set as a mathematical notion was defined IIrst by N. Wiener (1924) and was developed by O. Frostman, C. J. de La Vallee Poussin, and several other French mathematicians in con- nection with tpotential theory. B. Energy Let Q be a tlocalIy compact Hausdorff space and <I>(x, y) be a t\ower semicontinuous func- tion on Q x Q such that -00 <<I>:;; YJ. A mea- sure f1 will mean a nonnegative tRadon mea- sure with compact tsupport S", Denote by <I>(x, f1) the tpotential J <II(x, y)df1(Y) of a mea- sure f1 with kernel <I> and by (f1, f1J the tenergy S S <II df1 dJ1 of f1. Let X be a set in Q, and denote by ilIf x the class of normalized measures f1 (i.e., of measures J1 satisfying f1(Q) = 1) with S c X. Let K be a nonempty compact set in Q. Set W(K)=inf(f1,f1) for f1EilIfK' and W(0)=00 for the empty set 0. For <I>(x, y) = l/Ix - yl in Q = R 3 , the general solution u(x) of the tDirich- let problem (texterior problem) for the bound. ary function 1 in the unbounded component of R 3 - K is equal to the potential of an tequilib- rium mass-distribution. Therefore, if S is a smooth surface surrounding K and normals are drawn outward to S, the integral -( 1/4 n)' Ss(3u/3n)d<J of the normal derivative is equal to l/W(K). This is the capacity of K delIned by N. Wiener (1, Math. Phys., MIT, 3 (1924» when K is a closed region. Vallee-Poussin (Ann, Inst. H Poincare, 2 (1932» called the supremum of f1(R3) the Newtonian capacity of a bounded tBorel set E, where J.l runs through the class of measures f1 with S c E whose tNewtonian potentials are not greater than 1 in R 3 . If E is compact, the Newtonian capacity coincides with Wiener's capacity. For the tlogarithmic potential in R 2 , e-W(K) is called the logarithmic capacity. When tGreen's func- tion g(z, 00) with the pole at the point at infin- ity exists in the unbounded component of R 2 -K, limHoo(g(z, oo)-loglzIJ is called Rohin's constant and can be shown to be equal to W(.l(). (For the relation between Robin's con- stant and treduced extremal distance --" 143 Extremal Length.) In the case of a general 
197 kernel it is difficult to define capacity as above by means of W(K), and hence the value of W(K) itself instead of the capacity of K is often used. When W(.l()= 00, we can say that K is of capacity zero. The minimum value of the tGauss integral (J1, fl.) - 2 S S dfl. is a generaliza- tion of W(.l(), where J1Etl/i K and f is an upper semi continuous function bounded above on K. C. Minimax Value Suppose that we are given the kernel  as above. For a set X c: Q and a measure 11, set U(fl.; X) = SUPXEX <I>(x, fl.) and V(fl.; X)= inf xEx <I> (x, fl.), Next, for Y c Q, set U(Y) = inf U(J1; S), V(Y) = sup V(J1; S), U(X, Y) = inf U( J1.; X), and V(X, Y) = sup V( fl.; X), where fl.Etl/i y . If <l>(x, y) = @(y, x) is taken as a kernel instead of <I>(x, y), then the notations u J(.l(), 0(/1; X), 17(/1; X), are used correspondingly. For any compact set K the following relations hold: W(.l()= W(K).s; CI(.l()= arK) < { U(K,K)= V(K, K) } 1 !l(Q,.l()_= V(K,Q) " O(K K)= V(K K)  U(Q, K)-:. V(K,Q) , , V(.l()= V(K). Examples show that all the inequalities can be strict. The tminimax theorem in the ttheory of games plays an important role in the proof of these inequalities [7]. Even if the kernel is symmetric, the inequalities can be strict except for the equality W(K)= U(K). When the kernel is positive, we can define the quantities which correspond to U(Y), V(Y), U(X, Y), V(X, Y) by considering the class of fl. with S c Y and (/1, /l) = 1 instead of qf y, D. Transfinite Diameter As k--> 00, Dk(K)=k-l(k-I)-1 inf. L <I>(xi,x) Xl' ,XkEKi¥-j decreases and the limit D(.l() is equal to W(.l(). For the logarithmic kernel in R 2 , M. Fekete defined D(.l() and called e - DIK) the transfinite diameter of K (1923). F. Leja and his school in Poland studied relations between transfinite diameter and tconformal mapping. Next, set k kRk(X, Y) = sup inf L (x, xJ Xl' ,xkEYxeXi=l Then R(X, Y) = Iim Rk(X, 1? exists as k...... 00, and we have R(K, 1?= V(K,1? Fekete introduced R(.l()=R(K,.l() in R 2 (1923). G. Polya and G. Szego computed D(.l() and R(.l() for special K and cc-kernel r -a (if.  0) in R 2 and R 3 [12]. The equality D(K)==R(K) holds for the logarithmic kernel in R 2 and the 48G Capacity Newtonian kernel in R 3 The maximum of the absolute value on K of tChebyshev's poly- nomial ( 336 Polynomial Approximation) of order k with respect to K in R 2 is equal to exp( - kRdK». E. Evans's Theorem In order that K be of Newtonian capacity zero, it is necessary and sufficient that there exist a measure /l on K such that the New- tonian potential of J1 is equal to 00 at every point of K. This result was proved by G. C. Evans and H. Selberg independently (1935) and is called Evans's theorem (or the Evans- Selberg theorem). The corresponding theorem in R Z is often applied in the theory of functions [ 15,16]. A similar potential exists in case of a general kernel if and only if R(K,.l()= 00, F. Nonadditivity of Capacity Many kinds of capacity satisfy the inequality cap( Un X,)  Ln cap X n , where a capacity is denoted by cap. Even the Newtonian capacity C is not necessarily additive, but it satisfies C(K I UK 2 )+C(K 1 nK 2 ).s;C(K l )+ C(K 2 )(G. Choquet [2J), Choquet [3] proved that X can be divided into mutually disjoint sets X, and Xl such that C;(X) = Ci(X l ) = C i (X 2 ), where C;(X) is the Newtonian inner (or interior) ca- pacity delIned to be SUPK C X C(K) if X#-0 and 0 if X == 0, G. Relation to Hausdorff Measure There are many studies of relations between capacity and tHausdorff measure [ 1]. Frost- man [6] introduced the notion of capacitary dimension and observed that it coincides with the Hausdorff dimension. The capacity of product sets has been evaluated from above and below [ 11]. For compact sets KeRn, K' c Rm, their dimensions a, {3, and the dimension y of K xK', we have the relation a + {3.s; y :( min(m + a, n + {3), where the equalities are attained by general tCantor sets. There are also works on the evaluation of capacities of general Cantor sets [10,15]. If K is a con- tinuum of logarithmic capacity 1 in a plane, then its diameter d satisfies 2  d  4, and its area A satisfies A<n [S]. Consider the sum K = {z 1 +... +Zn I ZkEKb I .s; k.s; n} of continua K I , . . . , Kn in a plane. The logarithmic capacity of K is strictly greater than the sum of the logarithmic capacities of K 1" Kn except when all Kk are convex and mutualIy similar [14]. By various tsymmetrizations the log- arithmic capacity decreases in general ( 228 Isoperimetric Problems; also [ 133). 
48H Capacity H. Capacitability The Newtonian outer (or exterior) capacity C,(X) is defined by inf Ci(G), where Granges over an open set containing X. The inequality C;(X).s; C,(X) holds, in general. When the equality holds, X is called capacitab1e. Cho- quet (1955) [2J proved that all tanalytic sets and hence tBorel sets are capacitable but there exists an analytic set whose complement is not capacitable. He himself generalized his result on capacitability in the following way [4]: Let Q be an abstract space, <p a nondecreasing function defined on the family of all subsets of Q, and ff some family of subsets of Q that is closed under the formation of finite unions and countable intersections. Assume that <p(HnH<p(H) as Hn in ff decreases to H and that rp(Xn)t<p(X) as XnlX. When <p(X) is equal to sup{<p(H) I HEff, HeX}, X is called (rp, ff)-capacitable. Choquet deti'ned ff- Suslin sets and showed that they are (<p, ff)- capacitable. M. Kishi [9J, Choquet [5J, and B. Fuglede [7J investigated capacitability with respect to several kinds of capacity more gen- eral than Newtonian capacity. We Can discuss capacitability with respect to quantities de. fined in connection with the tGauss varia- tional problem. References [IJ L. Carleson, Selected problems on excep- tional sets, Van Nostrand, 1967. [2J G. Choquet, Theory of capacities, Ann. Inst. Fourier, 5 (1955), 131-295. [3J G. Choquet, Potentiels sur un ensemble de capacite nulle, Suites de potentiels, C. R. Acad. Sci. Paris, 244 (1957),1707-1710. [4J G. Choquet, Forme abstraite du theoreme de capacitabilite, Ann. Inst. Fourier, 9 (1959), 83 -89. [5J G. Choquet, Diametre transfini et com- paraison de diverses capacites, Sem. Theorie du Potentiel, 3 (1958-1959), no. 4. [6J O. Frostman, Potentiel d'equilibre et capacite des ensembles avec quelques applica- tions a la theorie des fonctions, Medd. Lunds Univ, Mat. Sem" 3 (1935), [7J B. Fuglede, Une application du theoreme du minimax a la theorie du potentiel, Colloque Intemat. sur la Theorie du Potentiel, Paris, Orsay, 1964, C. N. R. S. expose no. 8. [S] G. M. Goluzin, Geometrische Funktionen- theorie, Deutscher Verlag der Wiss., 1957. [9J M. Kishi, Capacitability of analytic sets, Nagoya Math. J., 16 (1960), 91-109. [10J M. Ohtsuka, Capacite d'ensembles de Cantor generalises, Nagoya Math. J., 11 (1957),151-160, 198 [ 1 IJ M. Ohtsuka, Capacite des ensembles produits, Nagoya Math. J., 12 (1957),95-130. [12J G. P6lya and G. Szego, Uber den trans- finiten Durchmesser (Kapazitatskonstante) von ebenen und raumlichen Punktmengen, J. Reine Angew. Math., 165 (1931),4-49. [ 13J G. P61ya and G. Szego, Isoperimetric inequalities in mathematical physics, Prince- ton Univ. Press, 1951. [ 143 C. Pomerenke, Zwei Bemerkungen zur Kapazitat ebener Kontinuen, Arch, Math., 12 (1961),122-128. [15J M. Tsuji, Potential theory in modem function theory, Maruzen, 1959. [ 16J L. Sario and K. Oikawa, Capacity func- tions, Springer, 1969. 49 (11.6) Cardinal Numbers A. Definition The general concept of cardinal number is an extension of that of natural number (Cantor [ lJ). When there exists a tone-to-one corre- spondence whose tdomain is a set A and whose trange is a set B, this set B is said to be equipotent (or equipollent) to A, and this rela- tion is denoted by A-B. The relation  is an tequivalence relation, and each equivalence class under this relation is said to be a cardinal number. The class of all sets equipotent to a set A is denoted by A (or A I) and is said to be the cardinal number (power, cardinality, or potency) of tbe set A. When A is a finite set, A IS said to be finite, and when A is an infinite set, A is said to be infinite (or transfinite). When the cardinal number of a set A is m, A is also said to consist of m members (or m elements). In this sense, 0 and the natural numbers are considered to express finik.card ina l numbers. For example, 0= 0,1 = {O}, 2 = {O, I}, etc. Examples of infinite cardinal numbers: A set A which is equipotent to the set N of all natural numbers is said to be countab1y infinite, and the cardinal number of the set N is denoted by a. A set A which is finite or countably infinite is said to be countable. The cardinal number of the set of all real numbers is denoted by c and is called the cardinal number of tbe continuum. Moreover, the cardinal number of the set of all real-valued functions whose domain is the interval [0, lJ is denoted by f, These three cardinal numbers are known to be distinct. Henceforth in this article, lower-case German letters denote cardinal numbers. For a defi- nition of cardinal numbers using the concept of ordinal numbers -" 312 Ordinal Numbers. 
199 B. Ordering of Cardinal Numbers m? n or n.s; m will mean that there exist sets A and B such that m = if, n = if, and A=:> B. A # B does not necessarily imply m # n. For example, the cardinal number of the set of all positive even numbers is also a, m? nand n? m imply m = n (Bernshtein's theorem). Since the treflexive and ttransitive laws for the relation .s; between cardinal numbers are obvious, the relation is an tordering relation. The twell-ordering theorem imples that ? is a ttotal ordering (comparability theorem for cardinal numbers). m > n means that m? n and m # n. When B  1t1, B is said to be at most m. C. Sum, Product, and Power of Cardinal Numbers For cardinal numbers m and n, choose sets A and B so th at m= A, n=if, and An R= 0, and put s = A U B . Then s is uniquely deter- mined by m and n. The s is said to be the sum of the cardinal numbers 111 and n and is de- noted by m + n. If the sets A, B are chosen as descnbed above, the cardllla1 numbers of the tCartesian product A x B and of the set of functions A B are called the product of m and n and the nth power of 111, denoted by mn and m", respectively. These operations are also determined by m and n. For these three oper- ations, the following laws are valid: commuta- tive laws m + n = n + m, mn = nm; associa- tive laws (m+n)+ p=m+(n+ pI, (mn)p = m(np); distributive law p(m + n) == pm + pn; exponential laws m"+P=m"111P,m"P=(111")P, (mn)P = mPn P . In particular, if A = m, then 2'" is the cardinal number of the tpower set IJ.\(A) of A. Addition and multiplication of more than two cardinal numhers can he defined as fol- lows. Let ;\ be any set, and suppose that to any element j, of;\ there corresponds a unique car dinal number 111 A Let M A be a set such that M A =m A , and MAn M = 0 for A#X. Then the cardinal number of the tdisjoint union LA M A is said to be the sum of all m A and is denoted by L;. m A , The cardinal number of the Cartesian product IL M A is said to be the product of al1 mA(h;\) and is denoted hy III m A . The axiom of choice can be stated as follows: If m) # 0 for all A E;\, then I1 A 111 A # 0, D. The Continuum Hypothesis For a, c, and f defined as before, f = 2' > c = 2" > a. In general, 2'" > m holds for any cardinal 49 F Cardinal Numbers number m (Cantor). The hypothesis which asserts that for any 111, there does not exist an n such that 2 m > n> m is called the gener- alized continuum hypothesis. In particular, this hypothesis restricted to the case where m = a is called the continuum hypothesis. After Can- tor stated this hypothesis (J, Reine Angew. Math., 84 (1878)), it remained an open ques- tion for many years, In particular, Cantor himself repeatedly tried to prove it, and W, Sierpinski pursued various related hypotheses. Finally, the continuum hypothesis and the generalized continuum hypothesis were proved to be independent of the axioms of set theory by K. Godel (1940) [3J and P. J. Cohen (1963) [4J ( 33 Axiomatic Set Theory). E. Cardinality of Ordinal Numbers Lower-case Greek letters will stand here for tordinal numbers. The cardinal number of g I  < Gf} will be denoted by (j, which is called the cardinality of the ordinal number (X or the cardinal number corresponding to (x. When a cardinal number 111 corresponds to some ordinal number, the minimum among ordinal numbers (X with (X = m is called the initial or- dinal number corresponding to m. An initial ordinal number corresponding to an infinite cardinal number is called a transfmite initial ordinal number. There exists a unique corre- spondence fJ -->w p from the class of ordinal numbers onto the class of all transfinite initial ordinal numbers such that fJ>y implies wp> w y ' In particular, W o = w, and an ordinal number  such that « WI is called a count- able ordinal number. w p is called the fJth transfinite initial ordinal number. The cardi- nality of w p is denoted by p ( is the Hebrew letter aleph). In particular, a is denoted by o (aleph zero). We have p?y if and only if fJ?y, and in this case p+y==p, py= p' The axiom of choice implies that every infinite cardinal number is an p' Hence. in this case, the continuum hypothesis can be formulated as 2"0 =  I, and the generalized continuum hypothesis can be formulated as 2"1 1 = P+l for every ordinal number fJ. F. Finiteness and Infiniteness Oedekind [5J defined a set A to be infinite if A is equipotent to a proper subset of itself, and to be finite otherwise. It is also possible to define finiteness and infiniteness of sets as follows: A Set A is finite if there exists a twell- ordering of A such that its tdual ordering is also a well-ordering, and A is infinite other- 
49 Ref. Cardinal Numbers wise. If a set is finite in the latter sense, then it is also finite in the sense of Dedekind. Under the axiom of choice, these two definitions Can be shown to be equivalent. References [1] G. Cantor, Beitdige zur Begrundung der transfinite Mengenlehre I, Math, Ann., 46 (1895),471-512. GesammeIte Abhandlungen, Springer, 1932; English translation, Contri- butions to the founding of the theory of trans- finite numbers, Open Court, 1915. [2J W. Sierpinski, Hypothese du continu, Warsaw, 1934. [3J K. Gode1, The consistency of the con- tinuum hypothesis, Ann. Math. Studies, Prin- ceton Univ. Press, 1940. [4J p, J. Cohen, Set theory and the continuum hypothesis, Benjamin, 1966. [5J R. Dedekind, Was sind und was sollen die Zah1en? Vieweg, 1888. GesammeIte mathema- tische Werke 3, Vieweg, 1932; English trans- lation, Essays on the theory of numbers, Open Court, 1901. Also  references to 381 Sets. 50 (XXI.16) Cartan,Elie Elie Cartan (April 9, 1869-May 6, 1951) was born at Dolomieu in the French province of Isere, He entered the Ecole Norma1e Supe- rieure in Paris in 1888 and graduated in 1891, having at the same time qualified in the agrege examination. Beginning his research immedi- ately, he completed his thesis on the structure of continuous transformation groups [2J in 1894 at the age of 25. Cartan was a professor first at the Univer- sity of Montpellier, later at the University of Lyon, then the University of Nancy, and fi- nally in 1912 at the University of Paris. He freely used the tmoving coordinate system introduced by 1. G. Darboux, and contri- buted to many areas, such as the theory of tLie groups, the theory of tPfaffian forms, the theory of tjnvariant integrals, ttopo10gy, tdif- ferential geometry (especially the geometry of tconnections), and theoretical physics. His doctoral thesis is still an object of interest among young researchers today, and the con- cept of connection introduced by him is funda- mental in the field of differential geometry. Henri Cartan (1904-) is his eldest son. 200 References [I] E. Cartan, Oeuvres completes I-III, Gauthier-Villars, 1952-1955. [2J E. Cartan, Sur la structure des groupes de transformations finis et continus, Thesis, 1894. (Oeuvres completes, pt. I, vol. 1, 137-287,) [3J S. S. Chern and C. Chevalley, Elie Cartan and his mathematical work, Bull. Amer. Math. Soc" 58 (1952),217-250. 51 (IX.24) Catastrophe Theory A. General Remarks Catastrophe theory was originally proposed by R. Th om [1, 2J in late 1960, This theory provides certain mathematical models for the evolution of forms in nature, in particular in biology and the natura11anguages. In recent years, E. C. Zeeman and others have devel- oped and applied the theory to various fields, including the physical sciences, medicine, economics, and sociology; a detailed bibli- ography may be found in [3,9,10]. B. Static Models The basic concept of the theory is the static model, This is a family of potential functions fu: X --> R, where X is a subset of R n , containing a neighborhood of the origin, and the para- meter u lies in a neighborhood U of the origin in R'. We regard R" as the internal space or state space, which is parametrized by the various variables that are relevant to the process under study, and R' is the external space or control space, which can be either the physical space- time continuum in which the process under consideration takes place or a space of control parameters that govern the process. As a rule, we assume that the dimension r.s; 4, although the dimension n can be arbitrary large. The static model is local in nature, and any one of the local minima of L called a local regime at UE U, is a candidate for the state of the model corresponding to the control point u. Mathematically, a static model is a germ of COO -functions f: R n x R' --> R at 0 that is an unfolding (r-dimensional extension family) of a germ of COO-functions IJ = fl Rn x {O}: Rn-->R at 0; further details are given in Section D. 
201 C. Classification of Singularities Let rS'(n, m) be the vector space of tgerms of C"-functions f:Rn......R m at 0, and let 88(n)c rS'(n,n) be the subset of invertible germs R n ...... Rn mapping 0 to O. The germs 1],  E rS'(n, m) are called right equivalent if there exists an hE:?4(n) such that I]oh=. Let rS'(n)=0'(n, I). Then it follows in the usual manner that 0'(n) is a local algebra with the unique maximal ideal uIt = uIt(n) = {I]ErS'(n)II](O)=O}. A germ I]ErS'(n) such that 1](0) =DI](O) =0 is called a singularity. For any singularity 1], we define the codimension of I] by codim I] =dimR(uIt /<iJI]/iJx;) 6(n»)' where <iJI]/OX i )6(n) is the ideal of rS'(n) gen- erated by ol]/ox I , ..., Ol]/ox n and x =(x I' ..., x n ) denotes the coordinate system of Rn. The following result was proved by J. N. Mather [4]: Up to the addition of a nondegen- erate quadratic form in other variables, and up to multiplication by :i I, a singularity of codi- mension .s; 4 and  1 is right equivalent to one of the I] appearing in Thorn's list of the seven elementary catastrophes below. D. Unfoldings Let I] be a singularity. An r-unfolding of I] is a germ f E .,({(n + r) such that fl Rn x {O} = 1]; this unfolding is denoted by (rJ). Let (rJ) and (s, g) be unfoldings of 1]. A morphism (<p, <1>, 8'): (rJ)......(s, g) consists of (i) a germ <p E rS'(n + r, n + s) such that <p I R n x {O} = identity, (ii) a germ <1> E rS'(r, s) such that ns 0 <p = <1> 0 n" (iii) a germ EE.(r) such thatf=goip+Eon" where n,: R n x R' ......R' is the projection. In this case we say that the unfolding (rJ) is induced by (<p, <1>, E) from (8, g). A morphism is an isomor- phism if <p and <1> are diffeomorphic germs. The addition of unfoldings (rJ) and (s, g) of I] is defined by (rJ)+(s,g)=(r+sJ +g-I]), where the last term on the right is given by (/ + 9 - I])(x, U, v) = f(x, u) + g(x, v) - I](x). Thus if the constant unfolding (8, 1]) of I] is 51 E Catastrophe Theory defined by I](x, u) = I] (x), we have (rJ)+(s, 1])= (r + sJ). An unfolding (rJ) of I] is said to be versal if any unfolding of I] is induced by (rJ) and a suitable morphism. A versal unfolding (rJ) with minimal r is said to be universal. The following facts have also been proved by Mather [4]: A singularity I]E.,{{(n) has a versal unfolding if and only if codim I] is finite. Any two r-versal unfoldings of I] are isomorphic. Every versal unfolding is isomorphic to (rJ) + constant, where r = codim I] and (rJ) is a universal unfolding defined as follows: If {b" ...,b,}c uIt(n) is a system of representatives for a basis of .,({(n)/<ol]/iJx;) 6(n)' then the unfolding f of I] is defined by f(x,u)=I](x)+bdx)uI + ..,+ b,(x)u,. E. The Seven Elementary Catastrophes Let f and 9 be germs in rS'(n+ r). We say that f and 9 are equivalent as r-unfoldings if there exist hE88(r), a family of HuE 88(n), where UE U c R', and an E E uIt(r) such that f(x, y) = g(Hu(x), h(U)+E(U). We say that a static model (rJ) is stable if any small perturbation (r, g) of (rJ) in rS'(n + r) (with the Whitney COO-topology) is equivalent to (rJ). The following is the main result ob- served by Thorn and proved by Mather and others (for references see the bibliography in [3J). Suppose that r .s;4. Then the set of stable static models (rJ) is an open dense subset of rS'(n + r), and up to the addition of a nondegen- erate quadratic form and multiplication by :iI, any stable static model (rJ) is equivalent to one of the models with the standard poten- tials F, which are the universal unfoldings of singularities I] in Table I (x, y denote inter- nal variables, and u, v, w, t denote external variables). The static models with these standard po- tentials are known as the elementary catas- trophes and can serve as qualitative models of various natural processes. Table 1. Thom's List of the Seven Elementary Catastrophes r Singularity 1'/ Standard Potential F Name 1 x 3 x 3 + ux Fold 2 x 4 x 4 + ux 2 + vx Cusp (Riemann-Hugo-Niot) 3 x 5 x 5 + ux 3 + vx 2 + wx Swallowtail (Dovetail) 3 x3 + y3 x 3 + y3 + uxy + vx + wy Hyperbolic umbilic 3 x 3 - xy2 x 3 - xy2+ u(x 2 + y2)+ vx+ wy Elliptic umbilic 4 x 6 x 6 + tx 4 + ux 3 + vx 2 +wx Butterfly 4 x2 y+ y4 x 2 y+ y4+ ux 2 + vy2+ wx+ ty Parabolic umbilic 
51 F Catastrophe Theory F. Catastrophe Sets A process for the static model fE$(n+r) is a subset s of X x U, where X and U are neigh- borhoods of the origin in R n and R', respec- tively. If s is a process and UE U, we define Su= sn(X x {u}). We say that UE U is a regular point for s if there exists a neighborhood V of U in U and a homeomorphism h: X x V --> X x V such that n,oh=n, on X x V and h(sn (X x V))=su x V. A catastrophe point is a non- regular point in U. The set of all catastrophe points is called the catastrophe set. There are several conventions with regard to the definition of a process for the unfolding (r, I). One of these is the Maxwell convention, which requires that Su be a least local regime at U E U. The catastrophe set of the model (r, f) according to the Maxwell convention consists of those points U of the control space R' where the least minimum of fu is attained for at least two points or where this minimum is attained at a unique point but is not stable. Another is the perfect delay convention, which assigns to each path r in U a mapping (possibly discontinuous) m,:r-->X x r such that n,(m,(u)) = u, (m,(u), u) is a local regime, and m, remains continuous for a maximum interval on the path r. Consider the set  = {(x, U)E Rn x R'I d; f(x, u) is degenerate} and its image B = n,(), called the bifurcation set, under the projection n,: Rn x R' -->R'. Then the points of the bifurcation set B are impor- tant candidates for catastrophe points of the static model with respect to the perfect delay convention. For geometrical studies of the elementary catastrophes  [6-8]. The static models have been generalized to metabolic models, the structure of which largely remains an open question [2]. In this connection, the bibli- ographies of [3,9, 10J appear to be fairly complete. References [IJ R. Thom, Topological models in biology, Topology, 8 (1969), 313-335. [2J R. Thom, Stabilite structuelle et mor- phogenese, Benjamin, 1972; English transla- tion, structural stability and morphogenesis, Benjamin, 1975. [3J A. Manning (ed.), Dynamical systems- Warwick 1974, Lecture notes in math. 468, Springer, 1975. [4J J. N. Mather, Right equivalence, Warwick preprint, 1969. [5J G. Wassermann, Stability of unfoldings, Lecture notes in math. 393, Springer, 1974. 202 [6J T. Brocker, Differentiable germs and catastrophes, London Math. Soc. Lecture Notes Series 17, Cambridge Univ. Press, 1975. [7J A. N. Godwin, Three-dimensional pictures for Thom's parabolic umbilic, Publ. Math Inst. HES, 40 (1971), 117-138. [8J A. E. R. Woodcock and T. Poston, A geo- metrical study of the elementary catastrophes, Lecture notes in math. 373, Springer, 1974. [9J T. Poston and I. Stewart, Catastrophe theory and its applications, Pitman, 1978. [IOJ E. C. Zeeman, Catastrophe theory, Select- ed papers 1972-1977, Addison-Wesley, 1977. 52 (11.24) Categories and Functors A. Categories Consider the family of all tgroups. Given two groups X and Y, denote the set of all thomo- morphisms of X to Y by Hom(X, Y). If X, Y, and Z are groups and if f: X --> Y and g: Y--> Z are homomorphisms, we can compose them to get a homomorphism go f: X -->Z. In general, suppose that we are given, as in this example, (1) a family \JJl of mathematical objects, and (2) for every pair (X, Y) of objects in \JJl, a set Hom(X, Y) whose elements are called morphisms; and let f E Hom(X, Y) and gEHom(Y, Z) determine a morphism gofEHom(X,Z), which is called their com- posite. A morphism fEHom(X, Y) is also written f: X --> Y. Suppose further that these morphisms satisfy the following axioms: (1) if f:X --> Y, g: Y -->Z, and h:Z--> Ware mor- phisms, then (hog)of =ho(g of); (2) for each object X E\JJl there exists a morphism 1 x: X -->X such that for any f:X --> Yand g:Z-->X we have fo lx=f and l x og=g; (3) Hom(X, Y) and Hom(X', Y') are disjoint unless X = X' and Y = Y'. Then we call the whole system (i.e., the family of objects \JJl, the morphisms, and the composition of mor- phisms) a category. The elements in \JJl are called the objects of the category. By axioms (1) and (2), the set Hom(X,X) is a semigroup (with respect to the composition of morphisms) which has 1 x as the identity element. Hence Ix is determined uniquely by X. On the other hand, axiom (3) implies that a morphism f determines the objects X and Y such that f E Hom(X, Y). From these facts we can give an alternative definition of cate- gory using only the morphisms and their composition. The totality of the objects (or morphisms) in a category Cf! is denoted by Ob(Cf!) (or FI(Cf!); 
203 the notation FI comes from the French word flee he). The relation x E Ob('&') is often ab- breviated to XE'&', while Hom(X, Y) is written Hom,(X, Y) if necessary. A subcategory of a category (e is a category (e' with Ob('&") c Ob«(e), such that for X, YE'&" we have Hom,(X, Y)cHom,(X, Y) and the compo- sition in '&" is the restriction of'&' to ,&,'. If Hom",(X, Y)=Hom,(X, Y) for all X, YE'&", we say that (e' is a full subcategory of (e. We define the product category (e l x '&'2 of two categories in the canonical way, using the pairs of objects and the pairs of morphisms. B. Examples of Categories (1) Taking all sets as the objects, all mappings as the morphisms, and the composition of mappings as the composition, we obtain a category called the category of sets, denoted by (Sets) (or (Ens) from the french ensemble). For the empty set 0 we make the convention that Hom(0, Y) contains just one element for any Y and that Hom(Y, 0) is empty if.¥ # 0. (2) As we have seen, taking all groups as the objects and the homomorphisms as the mor- ph isms, we get the category of groups, written (Gr). If we limit the objects to t Abelian groups, we get the category of Abelian groups (Ab) as a full subcategory of(Gr). (3) Fix a ring R. The left R-modules and their R-Iinear mappings define the category of left R-modules, which we denote by Rull. The category of right R-modules, 'R, is defined similarly. When R is tunitary, we usually limit the objects of Rull and uII R to tunitary modules. If R is commutative we can identify Rull with uII R . When R=Z (the ring ofra- tiona1 integers), R.t{ can be identified with (Ab). When R is a field, Rull is also called the category of linear spaces over R. (4) Taking rings as objects and homomor- phisms of rings as morphisms, we obtain the category of rings. The subcategory consisting of unitary commutative rings and unitary homomorphisms is called the category of commutative rings and is denoted by (Rings). (5) If we take tdifferentiable manifolds as objects and differentiable mappings as mor- phisms, we obtain the category of differentiable manifolds. Similarly, for tanalytic manifolds and analytic mappings we obtain the category of analytic manifolds. (6) Taking topological spaces as objects and continuous mappings as morphisms, we get a category called the category of topological spaces and denoted by (Top). On the other hand, if we take the thomotopy classes of continuous mappings as morphisms and define their composition in the natural way, 52 D Categories and Functors we obtain another category, which is called the homotopy category of topological spaces. (7) Fix a tpreordered set I. Taking the ele- ments of I as the objects and the pairs (x, y) of elements of I with x.s;y as the (unique) mor- phism from x to y, we get a category, in which we define the composite of the morphisms (x, y) and (y, z) to be (x, z). In examples (1) through (6), the totality of the objects Ob('&') is not a set, but a tclass ( 381 Sets G; for the logical foundation of category theory  [3, 9J). C. Diagrams If a set of arrows {A,} and a set of points {Bp} are given in such a way that each arrow A, has a unique initial point and a unique endpoint, then we say that {A., Bp} is a diagram. (Usu- alIy, we consider the case where each point Bp is the initial point or the endpoint of at least one A, (Fig. 1).) Let'&' be a category and {A., Bp} a diagram. If we associate a mor- phism f, in '&' with each arrow A, and an object ZpE(e with each point Bp so that f,E Hom(Zp, Zy) whenever A, has the initial point Bp and the endpoint By, then we say that {I., Zp} is a diagram in the category'&' (Fig. 2). Suppose, furthermore, that the following con- dition is satisfied: For any pair of points Bp and By, and for any sequence of adjacent arrows A, , ..., A, starting at Bp and ending at By (i.e., he initi;l point of A'I is Bp, the endpoint of A'i is the initial point of A'i+1 and the endpoint of A'm is By), the composite jm 0 f. m - I 0...01. 1 (E Hom(B p , By)) depends only on Bp and By. Then the diagram in (e is said to be a commutative diagram. For exam- ple, commutativity of Fig. 2 is equivalent to 13 0 fl =f4 0 f2 = fs. B) Al Bz A 3 ! %fl A6 f 1 X t -X 2 f2!k, X'T X 4 Bo . B4 Fig. 1 Fig. 2 D. Miscellaneous Definitions A morphism f: X --> Y in a category'&' is called an isomorphism (or equivalence) if there exists a morphism g: Y-->X such that fog= l y , gof = I x. In this case, 9 is determined uniquely by f and is itself an isomorphism. We call 9 the inverse morphism of f Then the inverse of 9 is f An isomorphism is sometimes written f: X'4 Y. Two objects X and Yare said to be isomorphic if there is an isomorphism X --> Y, and then we write X  Y. The composite of 
52 E Categories and Functors isomorphisms is again an isomorphism. In particular, an isomorphism X -->X is an inver- tible element of the semigroup Hom(X, X), and is called an automorphism of X. Examples of isomorphisms are bijections in (Sets), group isomorphisms in (Gr), the R-isomorphisms in RJIt, ring isomorphisms in the category of rings, tdiffeomorphisms in the category of differentiable manifolds, and homeomorphisms in (Top). A morphism f: X --> Y is called a monomor- phism (or injection) if for any object Z and for any morphisms u, v:Z-->X(u#v) we have f 0 u # f 0 v. Dually, f: X --> Y is called an epi- morphism (or surjection) if for any u, v: Y--> Z(u # v) we have u 0 f # v 0 f In the category of sets the monomorphisms and the epimor- phisms coincide, respectively, with the injec- tions and the surjections as mappings ( 381 Sets). A monomorphism which is at the same time an epimorphism is called a bijection. An isomorphism is always a bijection, but the converse is false in some categories. Two monomorphisms fl :Xl-->X and f2: X 2 --> X (with the same X) are said to be eq uivalent if there exist 9 I: X I --> X 2 and g2:X2-->Xl such thatfl =f20g1 andf2= [10 g2 (Fig. 3). An equivalence class with respect to this equivalence relation is called a subobject of X. Similarly, we define a quotient object of X as an equivalence class of epimorphisms X --> Xi' Xl g,f!g, X X2 Fig. 3 An object e of a category  is called a final object of  if for every 0 bject Y of, Hom(Y, e) contains one and only one element. Dually, an object e' is called an initial object (or cofinal object) if Hom(e', Y) contains one and only one element for every YE. If el and e 2 are final objects, then there is a unique iso- morphism el e2' and similarly for initial objects. A set with only one element is the final object in (Sets), and a space with only one point is the final object in (Top). In the category (Gr) (resp. (Ab)), the trivial group {I} (or {O}) is the final object and the initial object at the same time. In the category of commuta- tive rings, the zero ring {O} is the final object and the ring of rational integers Z is the initial object. E. Product and Coproduct Let X I and X 2 be objects of a category . We say that a triple (P, PI, P2) consisting of an 204 object P and morphisms Pi: P --> X;(i = 1,2) is the product (or direct product) of X 1 and X 2 if for any pair of morphisms .f;: X --> Xi (i = 1,2), there exists a unique morphism f:X -->P with Pi 0 [= f; (i = 1,2) (Fig. 4). If (P', p'" P2) is an- other product of XI and X 2 , then by virtue of this definition there is a unique morphism f: P-P' such that p;of = Pi (i = 1, 2), and f is an isomorphism, The product is unique in this sense. The product (or anyone of the prod- ucts) of XI and X 2 is denoted by XI x X 2 or by Xl IT X 2 . X .f Xl P; P, X 2 Fig. 4 The product in the categories of sets, of groups, of rings, and of topological spaces coincides with the notion of tdirect product in the respective systems. In a general category, the product does not always exist. Suppose the product X x X exists for an object X; then there is a unique morphism x:X -->X X X such that 1 x = P 1 0 x = P2 0 x, which is called the diagonal morphism of X. Let /;: X --> X; (i = 1,2) be morphisms and assume that the products (X I X X 2 ,PI ,P2), (X'l x X 2 ,P'I' P2) exist. Then there is a unique morphism f: X I xX 2--> X; X X; satisfying P; 0 [= f; 0 p;(i = 1,2). This f is denoted by fl x f2' On the other hand, if gi: X -->Xi(i = 1,2) are given, the unique mor- phism g:X -->X l X X 2 with Piog=gi (i== 1, 2) is denoted by (gl,g2)' We have (gl,g2)=(gl x g2)0X if X x X exists. The dual notion of product is coproduct. We say that a triple (S,j"J2) of an object Sand morphismsji:Xi-->S (i= 1, 2) is the coproduct (or direct sum) of X I and X 2 if for any mor- ph isms /;: Xi-->X (i = 1,2) there exists a unique morphism [:S-->X with fOji= /; (i= 1, 2) (Fig. 5). The coproduct, like the product, is uniquely determined up to canonical iso- morphisms. It is denoted by Xl +X 2 or by Xl U X 2' The coproduct in (Gr) is the tfree product. In (Ab), or more generally in w$!, the product of two objects can be identified with the coproduct (= direct sum) ( 277 Modules F). The coproduct in the category of com- mutative rings is the ttensor product over Z. X Jl Xl]' ], X 2 Fig. 5 Product and coproduct can also be defined for a family {X;}iEI of objects. Namely, the 
205 product of {X;}iEI is an object P together with a family of morphisms Pi: P -->X i (i E 1) having the property that for any family of morphisms /;:X -->X i (iEI), there exists a unique morphism f: X --> P such that p, 0 f = /; (i E I). The product is unique up to canonical isomorphisms, and similarly for the coproduct ( Sections F, L). F. Dual Category In the theory of categories we often encounter the dual treatment of notions and propo- sitions. To be precise, we may define the no- tion of the dual category Cfjo of a category Cfj as follows: The objects of Cfjo are those of Cfj, i.e., Ob(CfjO) = Ob(Cfj); for any objects X and Y we put Hom"c(X, Y)= Hom,(Y, X); if f:X --> Y and g: Y -->2 in Cfjo (i.e., f: Y -->X and g: 2--> Y in Cfj), then the composite go fin CfjO is de- fined to be fog in Cfj. It is clear that Cfjo then satisfies the axioms of a category. Quite gener- ally, given a proposition concerning objects and morphisms we can construct another proposition by reversing the directions of the morphisms, and we call the latter the dual proposition ofthe former. The dual propo- sition of a proposition in Cfj coincides with a proposition in Cfjo For instance, a monomor- phism (epimorphism) in Cfj is an epimorphism (monomorphism) in Cfjo, and the final (initial) object in Cfj is the initial (final) object in Cfjo. The product (coproduct) in Cfj is the coproduct (product) in (f: 0 . Although the notion of the dual category is defined quite formally, it is useful in describing relations between specific categories. The dual category of (Ab), for instance, is equivalent to the category of commutative compact topological groups (tPontryagin's duality theorem). G. Categories over an Object Fix a category Cfj and an object S ECfj. A pair (XJ) of an object X E (f: and a morphism f: X -->S is called an object over S or an S-object, and f is called its structure morphism. We often omit f and simply say "an S-object X" if there is no danger of misunderstanding. If (X, f) and (Y, g) are S-objects, a morphism h: X --> Y such that f = 9 0 h is called an S- morphism from (X,f) to (Y,g). The category whose objects are the S-objects and whose morphisms are the S-morphisms is called the category of S-objects in (f:, and is denoted by CfjIS. It has (S, Is) as the final object. The prod- uct of two S-objects X and Y, taken in CfjjS, is called the fiber product of X and Yover S (in Cfj), and is denoted by X x s Y or XITs Y. The dual notion of the fiber product is called fiber sum (or amalgamated sum). Thus for two 52 I Categories and Functors morphisms f: S --> X and g: S --> Y, the fiber product of X and Yover S in Cfjo IS is the fiber sum of X and Y (with respect to S); it is de- noted by X Us y. Let Cfj be the category of commutative rings and K ECfj. Then the family of K-objects in (f:O is precisely the family of commutative K- algebras. The fiber product A xKB in Cfjo, i.e., A UK Bin Cfj, is the tensor product A @KB of algebras. H. Functors Let Cfj and Cfj' be categories. A covariant functor F from C(ij to Cfj' is a rule which associates (1) with each object X in Cfj, an object F(X) in Cfj', and (2) with each morphism f:X --> Y in (f:, a morphism F( f): F(X)-->F( Y) such that F(g 0 f) = F(g) 0 F(/), F(1 x) = 1 F(X)' A con- travariant functor is defined dually, by mod- ifying this definition to F(/):F(Y)-->F(X), F(g 0 f) = F(/) 0 F(g). Thus a contravariant functor from '{j to Cfj' is the same as a covariant functor from the dual category C(ij0 to Cfj' (or from Cfj to (f:'O). Functor is a general term for both covariant functors and contravariant functors, but some authors use the word ex- clusively in the sense of a covariant functor. A functor in several variables is defined to be a functor from the product category of the categories in which the variables take their values. A covariant functor F:Cfj-->Cfj' is said to be faithful (fully faithful) if for any X, Y E Cfj, the mapping Hom(X, Y)-->Hom(F(X), F(Y)) in- duced by F is injective (bijective), and simi- larly for contravariant functors. A faithful covariant functor F: Cfj -->(f:' which maps dis- tinct objects of '{f to distinct objects of Cfj' is called an embedding, and In this case '{f can be identified with a subcategory of (f:' by F. A fully faithful covariant functor F: Cfj -->Cfj' is called an equivalence (between the categories) if it satisfies the condition that for any object X' of (f:', there exists an object X of (f: such that F(X)X'. In this case we can consider the two categories essentially the same. A con- travariant functor from Cfj to (f:' which defines an equivalence from (f:O to Cfj' is called an antiequivalence. I. Examples of Functors (1) Let ((j be the category of groups (or rings). For any X E'ff let F(X) be the underlying set of X (i,e., the set obtained from X by "forgetting" its structure as a group or ring), and for any homomorphism f put F( f) = f Then we get a faithful covariant functor (often called the forgetful functor) F: (f: --> (Sets). 
52J Categories and Functors (2) Let (f: be any category and fix an object X of C{j, Then we get a covariant functor hx: C{j -->(Sets) as follows: With each YEC{j we associ- ate the set Hom(X, Y), and with each mor- phism f: Y --> Y' in C{j the mapping f 0 (where fo:Hom(X, Y)-->Hom(X, Y') is defined by (/0 )(g) = fog), Similarly we define a con- travariant functor h x: C{j --> (Sets) by h x (Y) = Hom(Y,X) and hX(j)=of (3) Let p: A --> B be a homomorphism of rings. With each left A-module M associate the tscalar extension p*(M) = B @A M, and with each A-homomorphism f associate the B-homomorphism p*(/) = 1 B @ f Then we get a covariant functor p*: Aull --> Bull. (4) Let R be a ring. With each left R-module M associate its dual module M* == HomR(M, R), and to each R-Iinear mapping f associate its tdual mapping 'f = 0 f Then we get a con- travariant functor Rull -->uII R , and similarly for 'R-->RuII. (5) For each differentiable manifold X let F(X) denote the commutative ring of the dif- ferentiable functions on X, and for each dif- ferentiable mapping f: X --> Y let F(/) be the ring homomorphism 0 f: F(Y)--> F(X). Then F is a faithful contravariant functor. (6) Fix an Abelian group A. By associating with each topological space X the cohomology group H(X, A) and with each continuous map- ping f:X --> Y the homomorphism H(Y, A)--> H(X, A) induced by f, we obtain a contrava- riant functor from (Top) to (Ab). (7) Fix a topological space X, and let T(X) be the set of the open sets in X. Then T(X) is ordered by inclusion, so it is a category ( Section B, nO. 7). The contravariant functors from T(X) to (Ab) are precisely the tpre- sheaves of Abelian groups over X. We can use any category instead of (Ab) to define a pre- sheaf over X ( 383 Sheaves). J. Natural Transformations Let C{j and C{j' be categories, and denote by Hom(C{j, C{j') the collection of all covariant functors C{j -->C{j'. Let F, G E Hom(C{j/f:'), A nat- ural transformation (or functorial morphism) from F to G is a function which assigns to each object X of (f: a morphism <p(X): F(X)--> G(X) in C{j' such that for any morphism f: X--> Y in C{j, the equation G(/) 0 <p(X) = <p(Y)o F(/) holds; in other words, the accompanying dia- gram is commutative: X it Y <p(X) F(X)  G(X) F(j) t t G(j) <p(y) F(Y)  G(Y) 206 A natural transformation between contrava- riant functors is defined similarly. For in- stance, let A and B be Abelian groups and let Hi(', A) and Hj(., B) be the contravariant functors of tcohomology viewed as functors (Top)-->(Sets). Then the natural transform- ations between them are the tcohomology operations. Let <p: F --> G be a natural transformation, and suppose that <p(X): F(X)-->G(X) is an isomorphism for every X EC{j. Then the inverse transformation G --> F of <p exists, and <p is called a natural equivalence (functorial isomor- phism or isomorphism) and is written <p: FG, Suppose that Ob(C{j) is a set. Then the collec- tion Hom(F, G) of all natural transformations F ->G is also a set, and hence we can consider Hom(C{j, C{j') a category in which the objects are the covariant functors C{j -->C{j', the morphisms are the natural transformations, and the com- position of morphisms is the natural one. Then Hom(C{jo, C{j') is the category of contravariant functors from C{j to C{j'. In particular, the cate- gory Hom(C{jo,(Sets) is sometimes denoted by. Given a category (f:, a covariant (resp. con- travariant) functor F: C{j -->(Sets), and an object X E C{j, we can define a canonical bijection <1J x : Hom(h x , F)::::; F(X) (resp. Hom(h x , F)::::; F(X)) by <1J x (<p) = <p(X)l x ' (The functors hx and h X were defined in Section I.) The inverse mapping of <1J x assigns to  E F( X) the natural transformation <p: hx --> F defined by <p( Y)u = F(U)(YEC{j). In particular, if we take F== hy(h Y), we obtain a canonical bijection Hom(h x ,hypHom(Y,X) (Hom(hX,h Y )::::; Hom(X, Y». It follows that there is a fully faithful contravariant (covariant) functor C{j--> Hom(C{j, (Sets)) (C{j --> Hom (C{jo, (Sets» =) which associates hx(hX) with X EC{j. K. Adjoint Functors Let F: C{j -->C{j' and F': C{j' -->C{j be covariant functors. Suppose that there is a rule which assigns to each pair of objects ME C{j and M' EC{j' a bijective mapping 0M,M': Hom.,(M, F'(M'))Hom..,(F(M), M') such that for any pair of morphisms N --> M in C{j and M' --> N' in C{j', the following diagram induced by the morphisms is commutative: R" " Home(M,F'(M'))  Hom(3'(F(M),M') 1 1 Home(N,F'(N'»  Hom(3'(F(N),N') Then we say that F is a left adjoint functor of F' and that F' is a right adjoint functor of F. We can regard Hom.,,(F(M), M') as a functor from C{j x C{j' (contravariant in the variable 
207 M EC{!, covariant in the variable M' EC{!') to the category of sets, and similarly for Hom'6(M, F'(M'). This commutativity of the diagram means that these two functors are isomorphic ( Section J). For instance, let A and B be rings and L a fixed B-A-tbimodule. Let F:AuII-->BuII and F': Bull --> Aull be the functors defined by F(M)=L0 A M, F'(M')=HomB(L,M'), where the assignment for morphisms is defined in the natural way. Then F is the left adjoint of F' and F' is the right adjoint of F. In parti- cular, let p: A --> B be a homomorphism and considcr the casc L = B. Then F is the functor p* : A-4{ --> Bull and F' is the functor p * : Bull --> AuII, so that p* is the left adjoint of p* (and p* is the right adjoint of p*) ( 277 Modules, K, L; for more examples of adjoint functors  [IIJ). L. Representation of Functors We begin by discussing an example. Let T be a set, and consider the following problem: Is it possible to find a group X and a mapping (: T -->X such that, for any group Y and for any mapping 1]: T --> Y, there exists a unique homomorphism u: X --> Y with u 0  == I]? The answer is yes; it is enough to take the tfree group X generated by T and the canonical injection, : T --> X (Fig. 6). On the other hand, let F( Y) be the set of all mappings T --> Y, and for each group homomorphism f: Y --> Y' define the mapping F(f):F(Y)-->F(Y') by F(/)I] = f 0 I] (I] E F( Y). Then we get a co- variant functor from the category C{! of groups to the category of sets, F:c{!-->(Sets). We can now reformulate the condition on X EC{! and ,; E F(X) as follows: For any yee and for any I] E F( Y), there exists a unique morphism u:X --> Y such that F(u)=I]. T E X--;,--y Fig. 6 Proceeding to the general case, let C{! be an arbitrary category and let F:c{!-->(Sets) be a functor. If there exist an object X ofc{! and an element, of F(X) satisfying the condition just stated (with the modification u: Y --> X in the contravariant case), then we say that the pair (X, ) represents the functor F, or less specifi- cally, that X represents F, and we call ,; the canonical element of F(X), We also say that F is representable, The condition stated above is a formulation of the so-called universal map- 52M Categories and Functors ping property. If (X', ') also represents F, the unique morphism u:X -->X' (or X' -->X) with F(ug =,;' is necessarily an isomorphism. When (X,) represents F, the natural trans- formation <p: hx-->F (h x -->F in the contrava- riant case) which corresponds to  by the canonical bijection <1J x : Hom(h x , F)F(X) is an isomorphism. Conversely, if there is a func- torial isomorphism <p: hx -->F (or h X -->F) for some X EC{!, then the object X represents F, with the canonical element of F(X) the element which corresponds to <p by the canonical bijec- tion <1J x , i.e.,  = <p(X) 1 x. We have already seen the example of a free group; here we list a few more examples. (1) Let {X;} iEI be a family of objects in a category C{!. For each YEC{! we put F(Y)=IL1Hom(Y, X,), and for each morphism f: Y --> Y' we define the mapping F(/):F(Y')-->F(Y) by F(/)(/;) = (/; 0 f). Then we get a contravariant functor F:c{! -->(Sets). A pair (X,';) which represents F (where EF(X)=I1iEIHom(X, Xi)) is the product of {X;}. Thus representability of F is equivalent to the existence of the product of {X;}, and similarly for the coproduct. (2) Let R be a ring, M a right R-module, and N a left R-module. For each Abelian group Y let F( Y) denote the set of the R-balanced map- pings M x N --> Y ( 277 Modules J). Since a homomorphism f: Y --> Y' induces a natural mapping F(f):F(Y)-->F(Y') by composition, we obtain a covariant functor F:(Ab)-->(Sets). This functor is representable: the pair consist- ing of the tensor product M @R N and the canonical mapping M x N --> M @R N repre- sent it. (3) Let R be a commutative ring and S a subset of R. For each commutative ring Y, let F( Y) denote the set of homomorphisms R --> Y that map the elements of S to invertible ele- ments of Y. As in the preceding example, we obtain a covariant functor F:(Rings)-->(Sets). This functor is represented by the tring of quotients S-I R and the canonical homomor- phism R -->S -I R. M. Groups in a Category Let C{! be a category with a final object e, and assume that a finite product always exists in (e. If an object G E C{! and morphisms \i: G x G--> G, [3:G-->G, s:e-->G are given such that the the diagrams of Fig. 7 are commutative, then (G, (x, p, c) is called a group in C{! (group object in C{! or C{! -group). If C{! is the category of sets, then (X defines a law of composition in the set G, and the image of e by s is the identity element and fJ(x) is the inverse of x, so that G is an ordinary group. If C{! is the category of topological spaces (analy- 
52N Categories and Functors tic manifolds, algebraic varieties, tschemes) then G is a ttopological group (tLie group, talgebraic group, tgroup scheme). We can also define the '6'-group by lifting the group concept in (Sets) to the category '6' by means of the functor h X . Namely, let G be an object of '6', and suppose that for each YE'6' the set hG(y) = Hom(Y, G) is equipped with a group structure and that for each morphism f: Y --> Y' the induced mapping hG(Y')-->hG(y) is a group homomorphism. In other words, suppose that h G is a contravariant functor from '6' to the category of groups. Then the object G with the additional structure on h G is called a '6'-group. This definition is equivalent to the one given above. GXGXGGXG lexa! !a GxG ' G GXeGXG PI! G GGXG ! !a e----.-.G Fig. 7 N. Additive Categories A category '6' is called an additive category if for each pair X, Y E'6', the set of morphisms Hom(X, Y) has the structure of an additive group such that (1) the composition ofmor- phisms is distributive in both ways: h 0 (/ + g) =hof +hog, (f +g)oh=foh+goh; (2) there exists an object 0' with Hom(O', 0') = {O}; (3) the product (or the coproduct) of any two objects exists. Then the object 0' in (2) is a final and initial object, and is called the zero object. Both the product and the coproduct of any two objects exist and can be identified. The dual category of an additive category is also an additive category. A functor F from an additive category to another is called an addi- tive functor if F(f +g)=F(f)+F(g) holds for morphisms. In an additive category '6', Hom(X, Y) is an additive functor from '6' to (Ab) in each variable. For any ring R, the category ofleft (or right) R-modules is an additive category. The follow- ing definitions are generalizations of the corre- sponding concepts in the theory of modules. The kernel of a morphism f: A --> B is a pair consisting of an object A' and a monomor- phism i:A'-->A withfoi=O, such that any morphism u:X -->A with fou=O is divisible by i (that is, u=iov for some v:X -->A'). Dual- ly, the cokernel of f is a pair consisting of an object B' and an epimorphismj:B-->B' with j 0 f = 0 which divides any morphism u:B-->X with uof=O. We write A' = Kerf, B' 208 = Coker f. The kernel of j: B -->Coker f is called the image of f and is denoted by Imf; the cokerne1 of i: Ker f --> A is called the coimage of f and is denoted by Coim f. If all these exist, it follows from the definitions that there is a unique morphism Coimf-->Imf such that the composite of A -->Coimf --> Imf --> B is equala to f. An additive category '6' is called an Abelian category if it satisfies the following conditions: (1) every morphism has a kernel and a coker- nel, (2) for every morphism f, the morphism Coimf --> Imf just mentioned is an isomor- phism. The dual category of an Abelian cate- gory is also Abelian. The categories of Abel- ian groups, of R-modules, and of sheaves of ((.I-modules on a tringed space (X, ((.I) are important examples_ of Abelian categories. Many propositions which are valaid in (Ab) remain valid in any Abelian category. In parti- cular, the notion of an texact sequence is de- fined in an Abelian category in the same way as in (Ab), and the fiber product and fiber sum of a finite number of objects always exist in an Abelian category. A functor between Abelian categories which carries exact sequences into exact sequences is called an exact functor; (such a functor is automatically additive). If '6' is a category of which Ob('6') is a set and if '6" is an Abelian category, then Hom('6', '6") is an Abelian category. Given an Abelian category '6' and a subcategory '6" which satisfies certain conditions, (:me can construct an Abelian category '6'1'6" which is called the quotient category (Serre's theory of classes of Abelian groups;  [8J). If '6' is an Abelian category of which Ob('6') is a set, there is an embedding of '6' into the category RA of modules over some ring R by a fully faithful flat exact covariant functor (full embedding theorem, B. Mitchell, Amer. J. Math., 86 (1964)). This remarkable theorem enables us to extend results obtained for modules to the case of Abelian categories. The notions of category and functor were introduced in [7J and were applied first in topology and then in homological algebra and algebraic geometry ( 200 Homological Algebra). References [IJ S. Eilenberg and N. Steenrod, Foundations of algebraic topology, Princeton Univ. Press, 1952. [2J H. Cartan and S. Eilenberg, Homological ajgebra, Princeton Univ. Press, 1956. [3J S. MacLane, Homology, Springer, 1963. 
209 [4J P. Freyd, Abelian categories, Harper & Row, 1964. [5J B. Mitchell, The theory of categories, Academic Press, 1965. [6J I. Bucur and A. Deleanu, Introduction to the theory of categories and functors, Wiley- Interscience, 1968. [7J S. Eilenberg and S. MacLane, General theory of natural equivalences, Trans. Amer. Math. Soc., 58 (1945), 231- 294. [8J A. Grothendieck, Sur quelques points d'algebre homologique, Tohoku Math. J" (2) 9 (1957), 119- 221. [9J p, Gabriel, Des categories abeliennes, Thesis, Paris, 1962. [IOJ N. Y oneda, On universality I, II (in Japa- nese), Sugaku, 13 (1961-1962), 109-112; 14 (1962-1963), 39-43. [IIJ S, MacLane, Categorical algebra, Bull. Amer. Math. Soc., 7I (1965),40-106. 53 (XXI.17) Cauchy, Augustin Louis The French mathematician Augustin Louis Cauchy (August 21, 1789-May 25,1857) graduated from the Ecole Poly technique in 1807 and from the Ecole des Ponts et Chaus- sees in 1810, to become a civil engineer. In 1816, his mathematical works were recognized, and he was appointed a member of the Aca- demie des Sciences while a professor at the Ecole Poly technique. After the July revolution in 1830, he refused to pledge loyalty to Louis- Philippe and fled to Turin; he later moved to Prague. He returned to France after the revo- lution of 1848 and became a professor at the University of Paris, where he remained until his death. He was a Catholic and a Royalist all his life. His scientific contributions were numer- ous and covered many fields. In algebra, he did pioneering work in tdeterminants and in the theory of tgroups. He also made notable achievements in theoretical physics, optics, and the theory of elasticity. His main field was analysis. He was interested in making analysis rigorous by giving calculus a solid foundation in such works as Cours d'analyse de l'Ecole Poly technique (1821). In his paper "Memoire sur les integrales definies prises entre les Iimites imaginaires" (1825), he proved the main theorem of the theory of functions of a complex variable. Another important work is his proof of the existence theorem for the solutions of tdifferential equations in the cases of real and complex variables 54 Cayley Algebras References [IJ A. Cauchy, Oeuvres completes 1.1-12; 11.1-14, Gauthier-ViIlars, 1882-1958. [2J F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19. Jahrhundert I, Springer, 1926 (Chelsea, 1956). 54 (111.21) Cayley Algebras Let Q be a tquaternion algebra over a field K of characteristic zero. A general Cayley algebra Cf! is a 2-dimensional Q-tmodule Q + Qe with the multiplication (q+re)(s+te)=(qs+yTr)+ (tq + rs)e, where q, r, s, t E Q, y is a given ele- ment in K, and T, s are the tconjugate quater- nions of t, s, respectively. The elements of ((j are called Cayley numbers; Cf! is a nonassocia- tive, talternative algebra of dimension 8 over K ( 231 Jordan Algebras). The map a = q+re-->a=q -re is an tantiautomorphism of'ili'. Define two maps Cf! --> K by N (a) = aa = aa (norm of a) and T(a)=a+a (trace of a). Then every a in Cf! satisfies the equation x 2 - T(a)x + N(a)=O. Furthermore, N(ab)=N(a)N(b) for a, b in Cf!. The tquadratic form N(x)= T(xx)j2 characterizes Cf!, In particular, any two (non- associative) general Cayley algebras over the same field K which are not talternative fields are isomorphic. I n order for Cf! to be an alternative field, either of the following two conditions is neces- sary and sufficient: (i) N(a)=O implies a=O; (ii) Q is a noncommutative division algebra and y cannot be expressed in the form 0"2_ A.e - J1'1 2 + }'J1(Z (O",, '1, (E K). (For the mean- ing of A., J1 with respect to Q  29 Associative Algebras D.) Every alternative field of finite dimension is a general Cayley algebra. In particular, when Q is the tquaternion field over the real number field with }, = J1 = -1, the general Cayley algebra over Q with y = -1 is called the Cayley algebra. When K is an talgebraic number field of finite degree, there are only a finite num ber of nonisomor- phic general Cayley algebras over K. The Lie algebra 'D(Cf!) of all tderivations of a general Cayley algebra Cf! is a tsimple Lie algebra of type G 2 . If K is the real number field, the identity component of the group of all tautomorphisms of the Cayley algebra Cf! is a compact simply connected tsimple Lie group of type G 2 . The Cayley algebra Cf! is the unique alternative field over the real number field K. This last fact is important because of the fol- lowing proposition: In the theory of tnon- 
54 Ref. Cayley Algebras Desarguesian projective planes, the field which gives rise to the coordinates is an alternative field. Let 5! P 2 be the set of all 3 x 3 tHermitian matrices A over the Cayley algebra Cfj such that tr A = 1, A 2 = A. Then we can define a structure of a projective plane on 5!P 2 , which with this structure is called the Cayley projec- tive plane. Furthermore, let 3 be the set of all 3 x 3 Hermitian matrices over Cfj, with a multi- plication in 3 defined by A' B=(\j2)(BA + AB). The identity component G of the group of all automorphisms of 3 is a compact simply connected simple Lie group of type F4' This group G acts on 5!P 2 transitively, and 5!P 2 = GjSpin(9) ( 249 Lie Groups; Appendix A, Table 5.111). References [1] L. E. Dickson, Algebren und ihre Zahlen- theorie, Orell Fiissli, 1927. [2] M. Zorn, Alternativkorper und quadra- tische Systeme, Abh. Math. Sem. Univ. Ham- burg, 9 (1933), 395-402. [3] N. Jacobson, Cayley numbers and normal simple Lie algebras of type G, Duke Math. J., 5 (1939), 775-783. 55 (XX.6) Celestial Mechanics A. General Remarks The motions of planets, comets, the moon, and satellites in our solar system are the main topics in celestial mechanics ( 392 Spherical Astronomy). However, studies in this subject can also include motions of fixed and binary stars in our galaxy, equilibrium figures of celestial bodies, and rotational motions of the earth and the moon. Celestial mechanics is usually based on tNewtonian mechanics, with the effects of tgeneral relativity sometimes taken into ac- count to determine corrections to the orbits of celestial bodies. Therefore the main task of celestial mechanics is to solve differential equations of motion based on Newtonian mechanics. However, since the equations for the problem of n > 2 bodies cannot be solved rigorously ( 420 Three-Body Problem), appropriate methods have been devised where- by we may obtain approximate solutions of the equations with accuracy comparable to that of observations, The two-body problem, which concerns the behavior of two celestial bodies regarded as 210 points exerting mutual interactions, can be reduced to a one-body problem with reference to a central force, since integrals of motion of the center of gravity for the system exist. The tHamilton-Jacobi equation for the one-body problem is of tseparable type and can be solved completely. The orbit for the two-body problem is a tconic with one of its tfoci at the center of gravity. The majority of celestial bodies in the solar system actually perform elliptic motions. tKepler's orbital elements for elliptic motion are functions of the integration constants in the solution of the Hamilton- Jacobi equation and are determined by the initial conditions. B. Perturbations In studying the tn-body problem, we first solve certain two-body problems and then apply the method of tperturbations, i.e., the method of variation of constants, in order to obtain solu- tions developed as tpower series of small parameters. The parameters are ratios of the masses of planets to that of the sun for plane- tary motions and the ratio of the geocentric lunar distance to the solar distance for lunar motion. Electronic computers have made it possible to compute planetary coordinates for long intervals of time by solving numerically dif- ferential equations of motion including all possible interactions. However, in discuss- ing the stability of the solar system, analytic methods are more effective, particularly the method of obtaining secular perturbations by eliminating short-periodic terms by canonical transformations. This is one of the averaging methods of solving differential equations. However, as the solutions obtained by means of perturbation methods are not always con- vergent, most important problems related to the stability of motion have not yet been solved rigorously. Secular perturbations for planetary motions can be derived by solving differential equations that are linearized by neglecting cubic powers of orbital eccentric- ities and inclinations to the ecliptic ( 309 Orbit Determination), which are small quan- tities. The teigenvalues of these linear differen- tial equations correspond to mean angular velocities of the perihelion and the ascending node. The equations for the eigenvalues are called secular equations. C. Artificial Satellites To discuss motions of artificial satellites close to the earth, the latter cannot be regarded as a 
211 point or as a sphere but must be assumed to be an oblate spheroid, i.e., an tellipsoid of revolution. The effects of oblateness on the motion of satellites can be derived as pertur- bations of the theoretical elliptic motions ob- tained as the solutions of this two-body prob- lem under the assumption that the earth is spherical. Also, by utilizing a special potential very close to the geopotential, we can find a Hamilton-Jacobi equation of separable type which is solvable. This special potential ap- pears in the problem of two fixed centers with equal masses situated on an imaginary axis. When the geopotential is assumed to be axi. ally symmetric, the eq uations of motion for the satellites have two degrees of freedom; there- fore there appear two fundamental frequencies related to the special potential. When these two frequencies are equal, the problem is called a critical inclination problem and is important from the mathematical point of view. Theories for satellites can be applied to the motions of fixed stars in the galaxy. D. Equilibrium Figures There is a large literature concerning equilib- rium figures and the stabilities of celestial bodies assumed to consist of spinning fluids. The two-body problem with tidal interactions is particularly important; problems concerning the evolution of the earth-moon system are special cases of such a problem. The theory of rotation of the earth as it is affected by tprecession, tnutation, and latitude variations is also a part of celestial mechanics; for this theory, elastic theory and geophysics are applied. For the n-body problem  420 Three-Body Problem. References [IJ F, F, Tisserand, Traite de mechanique celeste 1-4, Gauthier-Villars, 1889-1896. [2J H. Poincare, Les methodes nouvelles de la mechanique celeste I, II, III, Gauthier-Villars, 1892-1899. [3] F. L. Moulton, An introduction to celestial mechanics, Macmillan, 1914. [4J A. Wintner, The analytical foundation of celestial mechanics, Princeton Univ. Press, 1941. [5J W. M. Smart, Celestial mechanics, Long- mans-Green, 1953. [6] D. Brouwer and G. M. Clemence, Methods of celestial mechanics, Academic Press, 1961. [7J R. Deutsch, Orbital dynamics of space vehicles, Prentice-Hall, 1963. 56B Characteristic Classes [8J Y. Hagihara, Celestial mechanics. I, Dy- namical principles and transformation theory, MIT Press, 1970; II, Perturbation theory, pts. 1,2, MIT, 1972; III, Differential equations in celestial mechanics, pts. 1,2, Japan Society for the promotion of Science, 1974; IV, Periodic and quadi-periodic solutions, pts. 1,2, Japan Society, 1975; V, Topology of the three-body problem, pts. 1,2, Japan Society, 1976. [9J C. L. Siegel and 1. K. Moser, Lectures on celestial mechanics, Springer, 1971. [lOJ E. L. Stiefel and G. Scheifele, Linear and regular celestial mechanics, Springer, 1971. 56 (IX.14) Characteristic Classes A. General Remarks The theory of characteristic classes arose from the problem of whether or not there exists a ttangent r-frame field on a tdifferentiable manifold (E. Stiefel [5J) ( 105 Differentiable Manifolds). The importance of the character- istic class as a fundamental invariant oftvec- tor bundle structure is now fully recognized ( 114 Differential Topology, 147 Fiber Bundles, 427 Topology of Lie Groups and Homogeneous Spaces). B. Stiefel-Whitney Classes Let (= (E, B, R") be an n-dimensional real tvector bundle (called an Rn-bundle) with tparacompact Hausdorff tbase space B, fiber Rn, and torthogonal group O(n) as the tstruc- ture group. Then the element w i (,) of the i- dimensional tcohomology group H'(B; Z2) (i = 1,2, .. . , n) of the base space B with coeffi- cient in Z2 = Zj2Z, called the i-dimensional (or ith) Stiefel-Whitney class, and the element w() = 1 + W 1 «() +'" + w n «() of the tcohomology ring H*(B; Z2), called the total Stiefel-Whitney class, are defined as follows. First, we deal with the case n= 1. We call the infinite-dimensional real projective space the tinductive limit POO(R) = lim pn(R) of the finite-dimensional real tpro- --> jective space pn(R). The nontrivial t1ine bundle Yl over the infinite-dimensional real projective space POO(R) is a universal RI-bundle (tuniver- sal bundle for the orthogonal group 0(1)); therefore any RI-bundle  =(E, B, RI) is equiv- alent to an tinduced bundle from the universal bundle Yl by a tclassifying mapping k B--> POO(R):  = ftYl' We define the I-dimensional universal Stiefel-Whitney class W dYd to be the generator of HI (POO(R); Z2) and set WI () = 
56C Characteristic Classes f{*w l (Yd. For general n, we consider the prin- cipal O(n)-bundle (P, B, O(n)) tassociated with the given Rn-bundle (=(E, B, R n ). Let Qn be the subgroup of O(n) consisting of all diag- onal matrices. Then the orbit space Y = PIQn is the base space of the tprincipal Qn-bundle I] =(P, PIQn' Qn)' Let p: Y -->B = PIO(n) be the natural projection. Then the Rn-bundle p* over Y induced by p is associated with I] and is equivalent to the tWhitney sum of n line bundles (F. Hirzebruch [4J), P*(=(I EB.., EB n' Moreover, the homomorphism p*:H*(B; Z2)-->H*(Y,Z2) is injective (A. Borel [lJ). Therefore we can uniquely define the total Stiefel- Whitney class w«() of the Rn-bundle by the relation p*w() =w( d... w(n)' The Stiefel- Whitney classes defined above are compatible with bundle mappings f: w(/*) =.f*w() ( 147 Fiber Bundles M). The Stiefel-Whitney class W i (Yn)EH i (B o (n);Z2) (l.s;i.s;n) of the universal Rn-bundle Yn over the tclassifying space Bo(n) is called the i-dimensional univer- sal Stiefel-Whitney class. For the Whitney sum of vector bundles, we have w( EB 1])=w()w(I]). In order for an Rn-bundle  to be torient- able, namely, for  to have an SO(n)-structure, it is necessary and sufficient that WI()= O. For an oriented Rn-bundle , the Euler- Poincare class Xn() is defined to be the tpri- mary obstruction Xn()E Hn(B; Z) for con- structing a tcross section of the tassociated (n - 1 )-sphere bundle. In particular, the Euler- Poincare class of the universal bundle for SO(n) is called the universal Euler-Poincare class. Xn() mod2 is equal to wn()' Ifn is odd, we have 2Xn()=0. C. Chern Classes We consider an n-dimensional complex vector bundle w=(E, B, C") (calJed C"-bundle in the following) with a paracompact Hausdorff base space B, fiber C", and unitary group U (n) as the structure group. The cohomology class C;(W)E H 2i (B; Z) (i = 1, 2,..., n), calJed the 2i- dimensional (or ith) Chern class, and the total Chern class c(w)= 1 +cdw)+... +Cn(W)E H*(B; Z) are defined as follows. In the case n= 1, let C'" =limC" (the inductive limit of --> the tcomplex Euclidean spaces cn), S'" be its unit sphere, and POO(C) be the infinite- dimensional complex projective space consist- ing of all complex lines through the orgin 0 of COO. Then the natural mapping S'" -->pOO(C) defines a universal principal U(I)-bundle (SOO, POO(C), U(I)). Let YI be its associated universal Cl-bundle. Then we define the 1- dimensional universal Chern class C I (YI)E H 2 (POO(C); Z) to be the cohomology class that takes the value -Ion the cycle S2(::::; pI (C) C 212 POO(C)) with the natural orientation. Since a general CI-bundle (=(E, B, C I ) is induced from YI by a classifying mapping f( B--> POC(C), we set cd)= ftCl(YI)' When n> 1, let (P, B, U(n)) be the principal U(n)-bundle asso- ciated with the given C"-bundle (=(E, B, C"). Let T" be the subgroup of U(n) consisting of all diagonal matrices (which is a tmaximal torus of U(n)). Then the quotient space Y == PIT" is the base space of the principal T,,- bundle I] =(P, PIT", T,,). Let p: Y --> B= P IU(n) be the natural projection. Then the C"-bundle p* over Y is associated with I] and is equiva- lent to the tWhitney sum of n complex line bundles: p*  =  I EB ... EB (n. Moreover, p*: H*(B; Z)-->H*(Y; Z) is a monomorphism (Borel [IJ, Hirzebruch [4J). Therefore we can uniquely define the total Chern class c( () of the C"-bundle , by the relation p*c() =c«( d... c(n)' The Chern classes, as defined above, are compatible with bundle mappings f (i.e., c(f*)= f*c@) ( 147 Fiber Bundles N). The Chern class C i (Yn)EH 2i (B v (n);Z) (l.s;i.s;n) of the universal Cn-bundle Yn over BU(n) is called the 2i-dimensional universal Chern class. Let , I] be complex vector bundles over B. Then we have c( EB 1]) =c()c(I]). By the nat- ural inclusion U(n)cSO(2n), we can identify a C n -bund1e W with an oriented R 2 n-bundle wR' Then we have c i (w)mod2=w 2 ;(w R ), W2i+dWR)=0 (i=0, 1,..., n), cn(w) =X 2n (WR)' Examples. Let (S2n+\ pn(C), U(I)) be the tHopf bundle, and let Y be the associated complex line bundle. Then the classifying mapping of Y is the natural inclusion pn(C)-->BU(I)=POO(C), and c(y) = 1 -gn' where gnE H 2 (p n (C); Z) is the cohomology class dual to the homology class represented by the hyperplane pn-l (C). On the other hand, for the complex line bundle  = {pn-l(C)} determined by the tdivisor ( 72 Complex Manifolds) pn-I (C) C pn(C), we have c(7) = 1 + gn, and , Y are dual to each other.  is called the canonical line bundle over the complex projective space. Moreover, the Whit- ney sum r EB £1 of the tangent bundle r(pn(C)) of the n-dimensional complex projective space pn(C) and the trivial CI-bundle £1 is equivalent to the Whitney sum of(n+ 1) copies of 7. Therefore we have c(r(pn(C))) =c( EB 7 EB ...EBW=(1+gnt+ l . D. Pontryagin Classes Utilizing the inclusion mapping O(n)c U(n) of structure groups, we can associate a Cn-bundle c= EBj=!  with an Rn-bundle. We define the 4i-dimensional (or ith) Pontryagin class of the Rn-bundle  by p;() =( -1)ic2i(d (E H 4i (B; Z)) (i = 1, 2,..., [nI2J) (Hirzebruch 
213 [4J). In particular, the Pontryagin classes of the universal bundle for O(n) are called the universal Pontryagin classes. We also define the total Pontryagin classes p() = 1 + pd) + ... +P[n/2](()' We have 2C2i+1(d=0. For Rn_ bundles (and 1], p( ffi 1])=p(()p(l]) modulo 2- torsion elements. We have Pi(() (mod2)= (W2i(»)2 and Pn()=(X2n()2 for the oriented R 2 n-bund1e . Moreover, for a complex vector bundle w, we have (Wu [8J) 2k (-I)k p d w R)= L (-I)iCi(W)C2k_i(W), i=O where we put co(w) = 1. All such classes as defined in Sections B- D are called characteristic classes. E. Other Definitions of Characteristic Classes Axiomatic Definition. (1) For a C"-bundle ( over a paracompact Hausdorff base space B, Chern classes Ci(()E H 2i (B; Z) (i;, 0) are de- fined, and we have c o (()= 1, C;(() =0 (i>n). (2) For the total Chern class c«()=Loc;(), we have c(f*()= f*c() for each bundle mapping f (3) For the Whitney sum, we have c( ffi 1])= c(). c(I]). (4) Normalization condition: For the canonical line bundle (, we have c«() = 1 + gn ( Section C). We can verify the existence and uniqueness of Ci() satisfying these four conditions, so Chern classes can be defined axiomatically by these conditions (Hirzebruch [4J). We can similarly defined Stiefel-Whitney classes axiomatically. Definition in Terms of Obstruction Classes. When the base space is a tCW complex, we can define the Chern class of a C"-bundle ( over B as follows: Let v..,n-q+1 (C) = U(n)/I n _ q +1 x U(q-l) be the tcomplex Stiefel manifold of aU orthonormal (n - q + 1 )-frames in C" with Hermitian metric, Then v...n-q+l (C) is t(2q - 2)- connected, and its (2q -I)-dimensional thomotopy group n 2q - I (v",n-q+I(C»=Z, Let (' be the tassociated bundle of  with fiber v",n-q+1 (C). Then the tprimary obstruction (E H 2 q(B; Z») to constructing a tcross section of ' coincides with the Chern class Cq()' Anal- ogously, we can interpret Wq() for an R n _ bundle  as an obstruction class ( 147 Fiber Bundles). Definition in Terms of Schubert Varieties. We denote by C k the subspace defined by Zk+l =Zk+2 =... =Zn+N = 0 of the space cn+N = {(Zl' ""zn+N)lziEC,i= 1, ...,n+N}, and fix the sequence of subs paces C I c C 2 c... C cn+N. The set of all complex n-planes X through the origin 0 in C n + N forms the tcomplex Grassmann manifold Mn+N,n(C)' We denote 56E Characteristic Classes by E(y:) the set of all pairs (X, v), where XEMn+N.n(C) and v is a vector in X. Then we can define a 2N-universal complex n- dimensional vector bundle YnN with base space Mn+N.n(C), with total space E(Yn N ) and projec- tion (X,v)-->X. Let w=(w(I), ""w(n)) be a sequence of integers satisfying the condition O.s; w(I).s; ....s; w(n).s; N. Then the set e", of aU n-planes Xc cn+ N through the origin 0 satis- fying dim(X n Ci+",(i)) = i, dim(X n Ci+",(i)-I) = i-I, i== 1,2, ...,n, forms a real (2L7=1 w(i»- dimensional open tcell. The set of all these open cells e", gives a tcellular subdivision of Mn+N.n(C) as a CW complex. The closure e", of e", is a cellular subcomplex of Mn+N.n(C), called a Schubert variety. This is a tpseudo- manifold with canonical orientation and represents a (2 L7=1 w(i))-dimensional inte- gral cycle, called a Schubert cycle. We denote eO) by (w(I),... ,w(n». AU these homology classes form the basis of the homology group H*(Mn+N,n(C); Z). The cocycle dual to the cycle (0, ... , 0 1, ... , 1) represents the Chern class '-v----' '--v--' n-q q Cq(Y:)E H 2 q(M n + N ,n(C); Z), For the real Grass- mann manifold Mn+N,n(R), we can analogously define the universal Stiefel-Whitney classes. Thom's Definition. Let ( be an Rn-bundle over B, B be its tThom space, and U E Hn(Bf.; Z2) be the tfundamenta1 class of B{. Letj:B-->Bf. be the inclusion induced from the zero cross section and <p:Hk(B;Z2)Hk+n(B;Z2) be the tThom-Gysin isomorphism. Then we havej*U ==wn()' <p-I(SqiU)==Wi(() (O.s;i.s;n), where Sqi is the tSteenrod square (R. Thom [6J). Definition in Terms of Differential Forms. Let B be a tdifferentiable manifold and  = (p{, B, U(n») be a differentiable principal U(n)- bundle over B. Let 0=(0,) be the tcurvature form corresponding to the tconnection form w=(w ij ), i,j= 1,..., n, on P. Then Oij is a complex-valued 2-form, and Qii== -Oji' For the matrix 0, we consider the differential form 1jJ= L IjJq==detlI + (2nj=1r I0 1, q where I is the unit matrix, the multiplication in the determinant is the texterior product, and IjJq is the part of degree 2q in 1jJ. Then IjJ is defined as a real form independent of the con- nection w. We have dljJq=O, and the coho- mology class of( -l)qljJq in H 2 q(B;R) is the Chern class c q «() with real coefficients (Borel and Hirzebruch [2J, S. S. Chern [3J). Definition in Terms of Symmetric Polynomials. ( 427 Topology of Lie Groups and Homoge- neous Spaces.) 
56 F Characteristic Classes F. Characteristic Classes of Manifolds For a differentiable (complex or almost com- plex) manifold M, the characteristic classes of its tangent bundle are calIed characteristic classes of the manifold M. We shaII denote Stiefel-Whitney classes, Pontryagin classes, Euler-Poincare classes, and Chern classes of M by w;(M), p;(M), Xn(M), and c;(M), respec- tively. These are invariants of differentiable structures, orientations, or (almost) complex structures of a manifold M if M is a differ- entiable, oriented differential, or (almost) com- plex manifold. By the Stiefel-Whitney num- bers of an n-dimensional manifold M, we mean the values of n-dimensional monomials of Stiefel- Whitney classes of M on the fun- damental homology class «wdM)'IW2(M)'l... wn(M)'n) [MJ EZ 2 , where r l + 2r 2 +... + nr n = n, ri;'O. We can define integer-valued Pon- tryagin numbers and Chern numbers similarly. These numbers are called generally character- istic numbers of the given manifold. In partic- ular, Xn(M) [MJ = X(M) is the tEuler-Poincare characteristic. In the case of topological manifolds, we can define characteristic classes in the folIow- ing sense. Let M be a closed n-dimensional topological manifold and x n the genera- tor of H"(M; Z2)' By defining Xi(yn-i)= (x i yn-i)[MJEZ 2 for XiE Hi(M; Z2), yn-i E Hn-i(M; Z2), we have an isomorphism Hi(M;Z2)Hom(Hn-i(M,Z2); Z2)' The ele- ment UiE Hi(M; Z2), corresponding to the homomorphism yn-i-->Sqiyn-i[MJ under this isomorphism is called the Wu class of M, where Sqi is the tSteenrod square. Moreover, we calI Wj= L{o Sqh UiE Hj(M; Z2) the Stiefel- Whitney class of the topological manifold M. Then for any tdifferentiable structure £0, we have wiM,£0)=Wj' Therefore Stiefel-Whitney classes of differentiable manifolds are topolog- ical invariants (more precisely homotopy type invariants) (Thom [6J, W. T. WU [8J). J. W. Milnor [9J proved that Pontryagin classes of differentiable manifolds are not topological invariants. The image of Pi(M) by the homo- morphism H 4i (M; Z)-->H 4i (M; Q) induced by the inclusion ZcQ (the rational number field) is called the rational Pontryagin class. In 1966, S. P. Novikov [10] proved the topological invariance of the rational Pontryagin class ( Section H). G. Index Theorem for Differentiable Manifolds Let M be an oriented closed manifold of di- mension 4k. Putting f(x, y) = x. y[ MJ for elements x, y of the 2k-dimensional real coho- 214 mology group H 2k (M; R), we obtain a bilinear form on H 2k (M; R). The tsignature of the quadratic form f(x,x) (namely, (number of positive terms) -(number of negative terms) in its canonical form) is a topological invariant (homotopy type invariant) of the manifold M. We call it the index or the signature of the manifold M and denote it by r(M). If the di- mension of M is not divisible by 4, we define r(M) = O. For the product of manifolds we have r(M x N)=r(M)'r(N). Also r(M) is an invariant of the tcobordism class of M (Thom [6J). The index r of a differentiable manifold gives a homomorphism of the tcobordism ring Q into the ring Z of integers. Hirzebruch inves- tigated the multiplicative property of rand gave its expression by means of Pontryagin numbers. Let Pi be the ith telementary sym- metric function of indeterminates [31, .., , [3n. Then a homogeneous part of the formal power senes fI fl; i1 tanhfl; of PI' .,. , [3n is a symmetric polynomial of PI' ..., [3n, and therefore a polynomial of Pi with rational coefficients. For k.s; n, we de- note the homogeneous part of degree k by Lk(PI"'" P k ). SpecificalIy, if Pi are the Pontrya- gin classes Pi(M 4k ) of a 4k-dimensional closed differentiable manifold M 4k , then Ld PI' '" , P k ) is a 4k-dimensional cohomology class of M 4k . Then we have the formula r(M4k) = LdPl' ..., Pd[M 4k J, calIed the index theorem of differentiable mani- folds (or Hirzebruch index theorem). For exam- ple, LI = (1/3)P" L 2 = (1/45)(7 P 2 - PI2), and L 3 =(1/945)(62P 3 -13P 2 P I +2ph... ( 114 Differential Topology). Later this index theo- rem was generalized to the tAtiyah-Singer index theorem ( 237 K-Theory). H. Combinatorial Pontryagin Classes Let K be an oriented n-dimensional compact thomology manifold, and let E' be the bound- ary of an oriented (r+ I)-simplex, namely, the combinatorial r-sphere. Let f: K -->En-4i be a tpiecewise linear mapping. Then for almost all points y of E n - 4i , f- I (y) is an oriented 4i- dimensional compact homology manifold, and its index r(/-I(y)) is independent of y. We denote this by r(/). The r(j) is an invariant of the homotopy class of f Let 0" be the funda- mental class of Hn-4i(E n - 4i ; Z). Then for n;' 8i + 2, there exists a unique cohomology class li= li(K)EH 4i (K; Q) such that for any piece- wise linear mapping f: K -->E n - 4i , we have 
215 (li'f*O")[KJ=r(/), We can remove the restric- tion n;' 8i + 2 if we take K x r m for K and define li(K) to be li(K x r m ) for sufficiently large m. If K is a tCI-triangulation of a dif- ferentiable manifold M, I;(K) coincides with the class Li(PI' ..., P;) defined by Hirzebruch (R. Thom [7]; V. Rokhlin and A. Shvarts), where P; = pj(M) is the Pontryagin class of M, Since the variable Pi can be expressed as a polynomial with rational coefficients of L)Pl' ..., P),j.s; i, we define the combina- torial Pontryagin class p;(K) of a homology manifold K as the polynomial of lj(K) with rational coefficients. Therefore, if K is a C I - triangulation of a differentiable manifold M, we have Pi(K)= Pi(M) (modulo torsion ele- ments), The class I;(K) and consequently Pi(K) are important combinatorial invariants of K. These classes are topological invariants [10]. References [IJ A. Borel, Sur la cohomologie des espaces fibres principaux et des espaces homogenes de groupes de Lie compacts, Ann, Math., (2) 57 (1953),115-207. [2J A. Borel and F. Hirzebruch, Characteristic classes and homogeneous spaces I, II, III, Amer. J. Math., 80 (1958), 459-538; 81 (1959), 315-382; 82 (1960), 491-504. [3J S. S. Chern, Characteristic classes of Her- mitian manifolds, Ann. Math., (2) 47 (1946), 58-121. [4J F. Hirzebruch, Topological methods in algebraic geometry, Springer, third edition, 1966. [5J E. Stiefel, Richtungsfelder und Fernparal- lelismus in n-dimensionalen Mannigfaltig- keiten, Comment. Math, Helv., 8 (1936), 3-51. [6J R. Thom, Espaces fibres en spheres et carres de Steenrod, Ann. Sci. Ecole Norm. Sup., (3) 69 (1952),109-182. [7J R. Thom, Les classes caracteristiques de Pontrjagin des varietes triangulees, Sym- posium Internacional de Topologia Alge- braica, Mexico, 1958. [8J W. T. Wu, Sur les classes caracteristiques des structures fibres spheriques, Actualites Sci. Ind., Hermann, 1952. [9J J. W. Milnor, Microboundles I, Topology, 3 (1964), Supp\. I., 53-80. [IOJ S. P. Novikov, Manifolds with free Abelian fundamental groups and their applica- tions (Pontrjagin classes, smoothness, multi- dimensional knots), Amer. Math. Soc. Trans\., 71 (1968), 1-42. (Original in Russian, 1966.) [l1J N. Steenrod, The topology of fibre bun- dles, Princeton Univ. Press, 1951. [12J D. Husemoller, Fibre bundles, McGraw- Hill, 1966. 57 A Chinese Mathematics [13J 1. Milnor and 1. Stasheff, Characteristic classes, Princeton Univ. Press, 1974. 57 (XXI.6) Chinese Mathematics A. Mathematics in the Chao, Han, and Tang Dynasties (3rd Century B.c.-10th Century A.D.) In ancient China, the art of divination, caned yi, was used in government administration. This was a kind of calculation that used pieces called tse. The book embodying it, called the I-Ching, is still popularly used. It shows that "numbers" or mathematics was seriously uti- lized in China at that time. The multiplica- tion table for numbers up to nine (called the Pythagorean table in the West) was known in China from the legendary period. However, mathematics in the Greek sense, that is, math- ematics as a logically systematized science, was unknown in ancient China. Suanching-Shihshu, or the Ten Books on Arithmetic-namely, Choupi-Suanching, Chiuchang-Suanshu, Haitao-Suanching (edited by Liu Hui), Suntzu-Suanching, Wutsao- Suanching, H siahouyang-Suanching, Changchiu-Suanching, Wuching-Suanshu, Chiku-Suanching (edited by Wang Hsiao- Tong), and Shushu-Chiyi (edited by Hsu Vue) -came into being between the 2nd century R.C. and the 6th century A.D., from the Chao to the Han eras, with the exception of the Chiku-Suanching compiled in the Tang era. These are the only mathematical texts from this early period whose authors and times of publication are known. They were used in the civil service examination for selecting adminis- tratOrs up to the beginning of the Sung era (960 A.D.). The most important among them is Chinchang-Suanshu, or the Book of Arithmetic, which contains nine chapters. It treats positive and negative fractions with laws of operations on signed numbers, equations, and the elemen- tary mathematical knowledge of daily life, The Chiku-Suanching contains a number of prob- lems reducible to equations of the 3rd and 4th degrees. There were also two works called San Tung Shu (edited by Tong Chuan) and Chui Shu (edited by Tsu Chung-Chih), but nO copies of them are extant. Later works, one from the Sui era (published in 636) and another from the Tang era, tell us that the latter contained the result 3.1415927>n>3.1415926 and the ap- proximate values 355/113 and 22/7 for n. In the 1st century A.D. Buddhism was intro- duced from India, and paper was invented. 
57 B Chinese Mathematics However, despite the communication with India, neither the Indian numeration system, written calculation, nor the abacus was at that time widely used in China. The extraction of square or cube roots was done with calculat- ing rods. B. Mathematics in the Sung and Yuan Dynasties (10th-14th Centuries) In the Sung and Yuan periods contact was made with the Arab world. In the 13th cen- tury, a mechanical algebra utilizing calculating rods made remarkable progress; this can be attributed to Arab influence. Toward the end of the Sung era appeared the Shushu- Chiuchang by Ch'in Chiu-Shao and the Yiku- Yentan by Li Chih. The former gives a method like Horner's for approximate solution of equations, and the latter gives the principle of tienyuan-shu, i.e. the mechanical algebra of this period. The principle of tienyuan-shu was further expounded in the Suanhsueh Chimeng (J 295) and the Suyuan Yuchien (1303) by Shih Shih-Chieh, the Yanghui Suanfa by Yang Hui, and other works, These were introduced into Japan and they influenced the wasan (Japanese mathematics) of early times. Until recently, no further original mathematical ideas appeared in China. C. Mathematics after the Ming Era (15th Century) In this epoch, European renaissance civiliza- tion began to influence the Orient. In 1607, Matteo Ricci (J 552-1610) translated Books 1- IV of Euclid's Elements into Chinese with the aid ofHsu Kuang-Chi. In 1592 Suanfa Tang- tsung by Ch'eng Ta-Wei appeared, which dealt with the use of the abacus. This book had great influence upon wasan. No development was seen in the indigenous mathematics of the Ching era, that is, after the 17th century, but science and technology were imported by Christian missionaries. This brought about calendar reform from the lunar to the solar method. On the other hand, new editions of classical works such as the Ten Books on Arithmetic began to appear in this period. Emperor Kang Hsi- Ti (1655-1722), who was in correspondence with Leibniz, asked Ferdinand Verbiest (renamed Nan Huai Jen in Chinese) to compile Shuli-Chingwen (Advanced mathematics), whose 53 chapters were completed in 1723. This book dealt with European-style algebra and trigonometry. In the latter half of the 19th century, Alexander Wylie translated a number of Western mathe- 216 matical books into Chinese, including Books VII-XIII of Euclid's Elements and some works on calculus. Many current Chinese and Japa- nese mathematical terms originated with this translation. References [1] Y. Li, History of mathematics in China (J apanese translation), Seikatusya, 1940. [2] Y. Mikami, The development of mathe- matics in China and Japan, Teubner, 1913 (Chelsea, 1961). 58 (X.g) Coo-Functions and Quasi- Analytic Functions A. General Remarks An example of a tCOO-function is a treal ana- lytic function, which is defined to be a function that can be expressed as a power series that converges in a neighborhood of each point of the domain where the function is defined. Many examples, however, show that real analytic functions form a rather small subset of the COO-functions. Sometimes COO-functions (not real analytic functions) play essential roles in the development of theories of analysis ( 105 Differentiable Manifolds S). On the other hand, there is a subfamily of COO-functions having some remarkable properties in com- mon with the family of real analytic func- tions. This family is called the family of quasi- analytic functions. It has been an important object of study since the beginning of the 20th century. On the other hand, the Gevrey class functions, which are no longer quasi-analytic in general, also constitute an important subset of COO-functions. The first part of this article deals with COO-functions, the second part with quasi-analytic functions, and the third part with Gevrey class functions. B. COO-Functions Let 0 be an open set of the n-dimensional real Euclidean space R n . A real-valued function f(x I' '" , x n ) defined on 0 is called a function of class Coo on 0 (or Coo-function on 0) if f(x l , ...,x n ) is continuously differentiable up to any order. The totality of Coo -functions defined on 0 is denoted by COO (0). It is an tassociative algebra over the real number field R. A contin- uous function f defined on some closed set F 
217 of Rn is called a Coo.function on F if there exist an open neighborhood U of F and a function gE COO(U) such that f = g I F. This definition is equivalent to the following (H. Whitney [6]): For any multi-index (X =«(XI'''.' (Xn)' we can find a continuous function j"(x I, '" ,x n ) on F such that (i) fO(x l ,..., x n ) = f(x l , ..., x n ) and (ii) for any positive integer r and for every multi- index (X with [(XI.s;r, i LIY_II'-I'1 k'(X) - L j"+P(X) (X-,y)P IJ =o, 1'+PI<r fJ. where II II denotes the Euclidean norm of Rn. C. Local Theory of Coo -Functions We shaIl now introduce an equivalence rela- tion  in COO(R n ), defined as follows: f  g <0> fl U =gl U for some open neighborhood U of the origin. Let $n denote the quotient set COO (Rn)/ , which naturally inherits the struc- ture of an associative algebra from COO(R n ). An element of $n is called a germ of a Coo-function at the origin. We denote the germ of fEcoo(Rn) by J To ]E6n, we assign the formal Taylor expansion L(D'f(O)!(X!)x' around 0, where D'f means (8'1 + . +'"f)/(8'1 X I ...8'" x n ) for (X = (r l ,..., r.). This assignment induces a homo- morphism r from $n to the tring of formal power series R [ [x I, .. . , xnJ J of n variables. The homomorphism r is surjective but not injective. Put An =r-I(O) c $n. A function f whose germ 1 belongs to An is called a flat function, The function <p(x) defined by <p(x) == exp( -1/x2) when x#O and <p(0)=0 is an example of a flat function on R j. A close study of the relationship between $n and R [[x I, ". , xnJ J leads to the preparation theorem for COO-functions, which can be stated as follows: Let F(xj, ...,Xn)E6n satisfy F(O, ".,O,X n )= x g(x n ) (g E $" g(O) # 0). Then any] E 6n can be expressed as] = FQ + R, where Q E 6n and R = Lf:Jri(XI,,,,, xn-dx with riE6n-1 (B. Malgrange [3J). Let f(x I, ... , x n ) be a symmetric function in (x I' ..., x n ) of class Coo. Then there exists a germ iJE $n such that l(x" "', x n ) = g(O"I, "', O"n), where 0"1' ..., O"n denote elementary sym- metric functions with respect to x I, .,. , X n (G. Glaeser, Malgrange). Let lE61 satisfy f(X) = f( -x). Then there exists a germ gE61 such that ](x)=g(x 2 ) (H. Whitney). D. Global Results Case of n Variables. COO(Q) becomes a tFrechet space when it is endowed with the topology of 58 E Coo-Functions and Quasi-Analytic Functions uniform convergence on compact sets for all partial derivatives. Let 1 and 1 1 be two closed tideals of COO (Q). Then we have 1 =1 1 if and only ifrAJ) = rAJd for each xEf.!, where r x is the mapping from COO(Q) to the ring of the formal power series that assigns the formal Taylor series of f around x (Whitney). Case of One Variable. In the case of one vari- able, further information can be obtained from various points of view. In the following, f denotes a Coo.function defined on the unit interval 1= [0,1]. If j satisfies j (l) = 1, 1'(0) =... = f(,-I)(O) =0, then we have 1 1 1 -+ ,e- +"'+ ,C <k, m 1 '.1m2 m,. where mi=sup{IPi)(x)llxEI} and where k is sOme constant independent of the choice of f and r (E. Borel). Similar kinds of in equal- ities were obtained by A. N. Kolmogorov, A. Gorny, and H. Cartan. Let A be an arbi- trary countable set of real numbers. If for any x E I we can find an integer r(x) such that f('(X))(X)E A, then such a function f is neces- sarily a polynomial. The interval I can be divided into three disjoint subsets: sl1), Sf), and S!j). These are characterized as fol1ows: For x E SV) the formal Taylor series rxU) of f around x converges to f in some neighbor- hood of x. For XE Sf{), rAf) diverges. And for x E S!j), rAf) converges in some neighbor- hood of x but does not tend to f Then S1I) is an open set and S'j) is a Go-set, while S!j) is an Fa-set of the tfirst category. Conversely, let 1== SI + S2 + S3 be any partition of I into an open set SI, a Go-set S2' and an Fa-set S3 of the first category. Then there is SOme fEcoo(I) with Si = SIf) (i= 1, 2, 3) [5]. E. Relations between COO-Functions and Real Analytic Functions Let C"'(!) be the set of real analytic functions on I. Then C"'(I) is a subalgebra of COO (I). Applying the above result in the case of SI = 0, we find a function fE COO (I) that admits nO real analytic function coinciding with f in a subinterval of I. Actually, functions with such a property are distributed densely in coo(I). A necessary and sufficient condition for a function f E COO (I) to belong to C"'(I) is that for suitable constants A and k, Ipn)(x)l.s; Aknn!, XE I, n =0, 1,2, ..., be valid (Pring- sheim's theorem). If f(n)(x);' 0 for all x E I and n =0,1,2, ..., then f E C"'(!) (S. N. Bernshteln). For any open set Q (c R n ), the set C"'(Q) of real analytic functions on Q is dense in COO(Q) (polynomial approximation theorem). This re- sult is true even when the topology of COO(Q) 
58 F COO-Functions and Quasi-Analytic Functions is replaced by a stronger one (Whitney [6J). Let f ECOO(O) and <pECOO(O). Then we can find g E Coo (0) satisfying f = g<p if and only if for any XEO, rAf) is divisible by rA<p) in the ring of formal power series (S. Lojasiewicz, Malgrange [3J). F. Quasi-Analytic Functions The investigation of quasi-analytic functions began with the attempt to obtain an intrinsic characterization of analytic functions. Borel defined monogenic functions as functions differentiable on their domains of definition, which can be any subset of the complex plane, not necessarily assumed to be open ( 198 Holomorphic Functions Q). Similar to com- plex analytic functions, monogenic functions are uniquely determined by their values on any curve. While quasi-analyticity can be defined by such properties, it is customary to approach quasi-analytic functions from an- other aspect, that is, the behavior of higher derivatives of Coo -functions, Generally, a subset B of COO (I) is called a set of quasi-analytic functions if the mapping r x : B-->R[[xJJ defined in Section D is injective at each point x E I. The functions belonging to B are called quasi-analytic (with respect to B). Here, an important problem is to characterize a set of quasi-analytic functions B by specializ- ing the image of rAB). Now let {M n } be a sequence of positive numbers. Let C(M n ) be the subset of COO (I) consisting of f such that If(n)(x)I.s;AknM n , xEI, n=O, 1,2, ''', where A = A (f) and k = k(f) are constant. Then Pringsheim's theorem simply asserts that C(n!)=C"'(I). In 1912, J. Hadamard raised the problem of determining the condition that the sequence {M n } should satisfy so that C(M n ) becomes a set of quasi-analytic functions [9]. A. Denjoy showed that if Mn=(nlog l nlog 2 n.. .logPn)", where logl n = logn, logP n = log(logP-1 n), p=2,3, ..., then C(M n ) is a set of quasi-analytic functions [10]. Later he derived an improved result that the condition L M n - I / n == 00 is sufficient. T. Carleman first gave a necessary and sufficient condition for C(M n ) to be a set of quasi- analytic functions, and later A, Ostrowski and T. Bang gave another version of the same condition [11-13]. The condition states essen- 218 tially the foIlowing: A necessary and sufficient condition for C(M n ) to be a family of quasi- analytic functions in the interval (a, b) is given by either (i) L [3n- 1 = +00, where [3n = infk?on Ml/k (Carleman), Or (ii) Joo(log T(r))/(r 2 )dr = 00, where T(r)=suPn?o I (rn/Mn) (Ostrowski, Bang). S. Mandelbrojt and T. Bang also gave another condition [13,14]. (The simplest proof of these results is found in [13J or [15J, where the proof follows Bang's idea.) Related to the above theorem, we also have the following: Let {M n } be a sequence of posi- tive numbers with L(Mn/Mn+i) < 00. For (X> 0 we can find f E C(M n ) defined on ( -00,00) such that f(O»O, f(n)( :t(X)=0. Moreover, for 0< (X < [3 there exists f E C(M n ) such that f(O) > 0, fen) =0 «(X.s;x .s;[3, n = 0,1,2,...) [16]. Suppose that we are given an interval I and increasing sequences {v n } and {M n } of positive numbers. Then we have the problem of finding suitable conditions on {v n } and {M n } under which the mapping f --> {f(V,)(x o )} gives an injective mapping from C(M n ) to the sequences above. When {v n } and {M n } satisfy the above conditions, then a function belonging to C(M n ) is called quasi-analytic (v n ) in the generalized sense. The study of the inclusion relation be- tween two families C(M n ) and C(M) also deserves attention. In [15J the relation be- tween C(M n ) and C(n!)=C"'(I) is discussed in detail. There are many open problems con- cerning the relationship between C(M n ) and C(M) in general. Quasi-analytic functions are closely related to problems in various branches of analysis, in particular the theories of complex analytic functions, Fourier series, Fourier integrals, Dirichlet series, and asymptotic expansions [8,15,17]. G. GeVfey Class Functions This class of functions has its origin in the study oftparabolic equations. A Coo-function f(x) defined in a domain of R n is caIled of GeVfey class s (1 .s; s < 00) iffor every com- pact set K in that domain there exist positive constants A K and C K such that max ID'f(x)I.s;AKCI(X!s K for all multi-indices (X (I(XI=(XI + '''+(Xn, (X! = (XI! iX 2 ! ... (Xn!). A typical example is ( { exP( -1/x) for x> 0, f x)== o for x.s;O, which is of Gevrey class 2. Let us denote the function space of a Gevrey class s by yeS), Evi- dntly yeS) c yes') if s < s'. Let f(X)EY(') and F(Y)EY(s); then xf->F(/(x))EY(S). In particular, 
219 the sum and the product of two functions in y(S) belong also to y(S). Furthermore, the tim- plicit function theorem holds in this class. Unlike the class of analytic functions, we can use this class of functions of class s (> 1) for a tpartition of unity, and several problems in partial differential equations can be treated within this function space. Gevrey class func- tions are discussed in a complete form in [20J ( also [19J). References For functions of class Coo, [1 J R. P. Boas, A primer of real functions, Carus Math. Monographs, Wiley, 1960. [2J G. Glaeser, Fonctions composees differen- tiables, Ann, Math" (2) 77 (1963),193-209, [3J B. Malgrange, Le theoreme de preparation en geometrie differentiables, Sem. H. Cartan, 1962-1963, Exp. 11, Inst. H. Poincare, Univ. Paris, 1964. [4J B. Malgrange, Ideals of differentiable functions, Oxford Univ. Press, 1966. [5J H. Salzmann and K. Zeller, Singularitaten unendlich oft differenzierbarer Funktionen, Math. Z., 62 (1955),354-367. [6J H. Whitney, Analytic extensions of dif- ferentiable functions defined in closed sets, Trans. Amer. Math. Soc., 36 (1934), 63-89. [7J H. Whitney, Differentiable even functions, Duke Math. J., 10 (1943),159-160. [8J H. Whitney, On ideals of differentiable functions, Amer. J. Math., 70 (1948),635-658. For quasi-analytic functions, [9J J. Hadamard, Sur la generalisation de la notion de fonction analytique, Bull. Soc. Math. France, 40 (1912),28-29. [IOJ A. Denjoy, Sur les fonctions quasi- analytiques de variable reelIe, C. R. Acad. Sci. Paris, 173 (1921),1329-1331. [IIJ T. Carleman, Les fonctions quasi- analytiques, Gauthier-Villars, 1926. [12J A. Ostrowski, Dber quasianalytische Funktionen und Bestimmtheit asymptotischer Entwickelungen, Acta Math., 53 (1929), 181- 266. [J3J T. S. V. Bang, Om quasi-analytiske Funk- tioner, Thesis, Univ. of Copenhagen, 1946. [J4J S. Mandelbrojt, Analytic functions and classes of infinitely differentiable functions, Rice Inst. Pamphlet 29, no. I, 1942. [15J S. Mandelbrojt, Series adherentes, regu- larisation des suites, applications, Gauthier- VilIars, 1952. [16J S. Mandelbrojt, Some theorems con- nected with the theory of infinitely differenti- able functions, Duke Math. J., 11 (1949),341- 349. 59 A Class Field Theory [17J S. Mandelbrojt, Series de Fourier et classes quasi-analytiques de fonctions, Gauthier- Vi\1ars, 1935. [18J R. E. A. C. Paley and N. Wiener, Fourier transformations in the complex domain, Amer. Math. Soc. Co\1oq. Publ., 1934. For Gevrey class functions, [19J J. Hadamard, Les fonctions de classe superieure dans l'equation de Volterra, J. Analyse Math., 1 (1951), 1-10. [20J M. Gevrey, Sur la nature des solutions des equations aux derivees partielIes, Ann. Sci, Ecole Norm. Sup., ser. 3, 35 (1918), 243-304, 59 (V .14) Class Field Theory A. History The notion of a class field was first introduced by D. Hilbert (1898). Let k be an talgebraic number field and K a tGalois extension of k. Hilbert calIed such a field K a class field over k (or K/k was ca\1ed a class field) when the fol- lowing property was satisfied: A tprime ideal p of k of absolute degree I (i.e., a prime ideal whose tabsolute norm is a prime number) is decomposed in K as the product of prime ideals of K of absolute degree I if and only if p is a tprincipal ideal. (Such a field K is now said to be an absolute class field over k in order to distinguish it from a class field defined later more generally by T. Takagi as explained below,) Hilbert conjectured the fo\1owing theo- rems (1 )-(4) together with the principal ideal theorem ( Section D), and proved them in some special cases. (I) For any algebraic num- ber field k there exists one and only one class field K/k. (2) A class field K/k is an tAbelian extension whose tGalois group is isomorphic to the tideal class group of k. Hence the degree n = [K: kJ is equal to the tclass number h of k, (3) The trelative different of a class field K/k is the principal order; thus K/k is an tunramified extension. (4) Let p be a prime ideal of k, and let f be the smallest positive integer such that pf is a principal ideal. Then p is decomposed in the class field K/k as P=\131 \132'" \13g, N K / k (\13;) = pf,jg =n. Hilbert was led to these conjectures by the analog to the theory of talgebraic functions in one variable. Theorems (1)-(4) were proved by p, Furtwangler (Math, Ann" 63 (1907)), but these results were subsumed under the class field theory of Takagi, who generalized the notion of class field and proved that every Abelian extension of k is a class field over k (J. Call. Sci. Imp. Univ. Tokyo, (9) 41 (1920)), Since 
59B Class Field Theory then, the arithmetic of Abelian extensions of k has developed through this theory. In Takagi's paper, L. Kronecker's problem concerning Abelian extensions of an imaginary quadratic field ( 73 Complex Multiplication) was solved simultaneously; this had been an open problem since the 19th century. Later, E. Artin proved the genera11aw of reciprocity (Abh. Math. Sem. Univ. Hamburg,S (1927)) which put class field theory into its complete form. The original proof by Takagi was rather com- plicated, and H. Hasse, Artin, J. Herbrand, C. Chevalley, and others tried to simplify it. In particular, Chevalley introduced the notion of tide1es and gave a purely arithmetic proof. On the other hand, attempts are also being made to generalize this theory to non-Abelian extensions. We mention here the results of G. Shimura [15J and Y. Ihara [16]. Today, class field theory is considered one of the most beautiful theories in mathematics. B. Definition of a Class Field Let k be an algebraic number field. For the definition of a general class field over k, we need a generalization of the ideal class group of k ( 14 Algebraic Number Fields H). Let m be an tintegra1 divisor of k, and let 3(m) be the multiplicative group of all tfractiona1 ideals of k which are trelatively prime to m. For the rest of this article, we mean by an ideal of k a fractional ideal of k. Denote by S(m) the tray modulo m. Let H(m) be an tideal group modulo m, that is, a subgroup of 3(m) con- taining S(m). A Galois extension K of k is said to be a class field over k for the ideal group H (m) if the following property is satisfied: A prime ideal p of k of absolute degree 1 which is relatively prime to m is decomposed in K as the product of prime ideals of K of absolute degree 1 if and only if p belongs to H(m). The absolute class field of Hilbert is the case where m =(1) and H(m) is the group of all principal ideals of k. A class field K/k for an ideal group H is uniquely determined by H (uniqueness theo- rem). The tconductor f of H is said to be the conductor of the class field for H. The ideal group H corresponding to the class field K/k is determined by K as follows: H(f)/S(f) is the union of all co sets C of 3(f) modulo S(f) such that C contains a trelative norm NK/k() of some ideal  of K which is relatively prime to f. In general, let K/k be a Galois extension and m be an integral divisor of k. Let H(m) be the union of all cosets C of 3(m) modulo S(m) such that C contains a relative norm NK/k() of some ideal\]{ of K which is relatively prime to 220 m. Then H(m) is a multiplicative subgroup of 3(m), and the index h =(3(m): H(m)) is not greater than the degree n = [K : k ]. We have h = n if and only if K/k is the class field for H. Hence a class field K/k can be defined as a Galois extension of k such that h = n for a suitable integral divisor m of k. C. Fundamental Theorems in Class Field Theory (1) Main theorem: Any Abelian extension K/k is a class field over k for a suitable ideal group H. (2) Existence theorem: For any ideal group H(m) there exists one and only one class field for H(m). (3) Composition theorem: Let K I and K 2 be class fields for HI and H2' respectively. Then the composite field K I K 2 is the class field over k for HI n H2' Consequently, K I=:> K 2 if and only if HI cH 2 . (4) Isomorphism theorem: The Galois group of a class field K/k for H(m) is isomorphic to 3(m)/H(m). In particular, every class field K/k is an Abelian extension of k. (5) Decomposition theorem: Let f be the conductor of the class field for H. If p is a prime ideal of k relatively prime to f and f is the smallest positive integer with pI E H, then p is decomposed in K/k as p = 1.1311.132'" g, NK/k(\) = pf, fg = n. (6) Conductor-ramification theorem: Let f be the conductor of a class field K/k. Then f is not divisible by any prime divisor that is unrami- fied for K/k, and f is divisible by every prime divisor that ramifies for K/k. Let f = I1f p , fp = pc. Then fp coincides with the tp-conduc- tor of K/k, and the exponent c can be explicitly expressed by the order of the tramification groups and the tramification numbers of p for K/k ( 14 Algebraic Number Fields P). (7) Let p be a prime ideal of k that rami- fies for K/k. Let Hp be the ideal group of k such that (i) the conductor of Hp is relatively prime to p and (ii) Hp is the minimal ideal group of k containing H with property (i). Let n=[K:k], e=(Hp:H), and pf EHp, where pd(d<f)rf;Hp. Then p is decomposed in K/k as P=(11.132'" g)", N K / k (I.13J=pf, n=efg. (8) Translation theorem: Let K/k be the class field for an ideal group H(m), and let Q be an arbitrary finite extension of k. Then KQ/Q is the class field for H*, where H* is the ideal group of Q consisting of all ideals b of Q with Nn/k(b)E H(m). In particular, the conductor of KQ/Q is a divisor of the conductor of K/k. (9) Artin's general law of reciprocity: Let K/k be the class field for an idea group H with the 
221 conductor f. We denote the t Artin symbol of an ideal a of k that is relatively prime to f by ( K/k ) (K/a)=  ' Let a mapping <I> from 3(f) to the Galois group G of K/k be defined by <I>(a)=(K/a) for aE .](f). Then <I> induces the isomorphism .](f)/H(f)-::;;G. Namely, the isomorphism men- tioned in (4) is explicitly given by the Artin symbol. Also, the ideal group H(f) is character- ized as the set of all ideals a such that a E 3(f) and (K/a) = 1. From this theorem we can prove all the known laws of reciprocity for power- residue and norm-residue symbols ( 14 Algebraic Number Fields 0, Q, R). From the general results of class field theory we can systematically derive all the known theorems concerning the arithmetic of qua- dratic fields, cyclotomic fields, and Kummer extensions. D. Principal Ideal Theorem Let K/k be an absolute class field, Then the extension of any ideal of k to K is a principal ideal of K. This theorem is called the principal ideal theorem. It was conjectured by Hilbert, formulated by Artin as a theorem of group theory, and proved by Furtwiingler (Abh. Math. Sem. Univ, Hamburg, 7 (1930)). Later a simple proof was given by S. Iyanaga (Abh. Math. Sem. Univ. Hamburg, 10 (1934)). This theorem was also generalized to the following general principal ideal theorem (Iyanaga, Japan. J. Math., 7 (1930)): Let K/k be the class field for the ray S(f), and let f = 3':D, where :D is the relative different of K/k, Then the exten- sion to K of any ideal of k that is relatively prime to f belongs to S(). Put 3' = IT v. Then v is equal to the ramification number v,+ 1 ( 14 Algebraic Number Fields K). For an absolute class field K/k, let the extension of an ideal a to K be (0(a)), Then we can choose 0(a)EK such that 0(a)0(w(a)0(ab)-lEk, where O"(a)=(K/a) is the Artin symbol for a (T. Tannaka, Ann. Math., 67 (1958)). This result can also be generalized for the class field for S(f). E. Theory of Genera Let K/k be a Galois extension and let H(m) be an ideal group of k. The set of all ideals III of K relatively prime to m such that NK/k() be- longs to H(m) forms an ideal group of K. This ideal group is said to be the principal genus for H. Each coset of :j(m) modulo the principal genus for H is said to be a genus for H. In 59 F Class Field Theory particular, let K/k be a cyclic extension with the conductor f, and let H(f) be the ideal group of k generated by NK/k(A) (A E K) and S(f). Then the principal genus for H(f) is the ideal group formed by the ideal classes of K of the form Cl-a, where (J is a generator of the Galois group of K/k ( 347 Quadratic Fields F). In general, let K/k be an Abelian extension, and let f be the conductor of K/k, Then for the ideal 3' = II v of K defined in Section D, N K / k (S(m3')) = S(mf) for an arbitrary integral ideal m of k. In particular, let K/k be a cyclic extension, and let H = S(mf), Then the prin- cipal genus for H is the ideal group consist- ing of all cosets of 3(m) modulo S(m3') of the form B l - a (Herbrand; Iyanaga, J. Reine Angew. Math., 171 (1934)). F. Class Field Tower Problem and Construction Problem Furtwiingler considered the following prob- lem: Let k be a given algebraic number field, k = ko C k 1 C k 2 ... be the sequence of fields such that k i is the absolute class field over k i - l , and K rye be the union of all the k i . Is K 00 a finite extension of k? The answer is yes if and only if k n is of class number 1 for some n. This problem is called the class field tower problem, Artin remarked that if for every algebraic number field F of degree n we have the inequal- ity I Drl > (n/4)2'2(nn/n!)2 > (ne 2 /4)"j(2nne l / 6n ) for the tdiscriminant D F , then K oo/k is always finite [1, p. 46]. E, S, Golod and I. R, Shafa- revich (1964) solved the class field tower prob- lem negatively; they proved that K oo/k is infinite if k i (i = 1, 2, ...) is the maximal unrami- fied Abelian p-extension of k i - 1 for a fixed prime number p and if the inequality y;, 3 + 2JP+2 holds, where y is the minimal num- ber of generators of the p-component of the ideal class group of k and p is the rank of the unit group of k, (We call an extension K/k a p- extension if the degree [K : k J is a power of a prime number p,) For example, the class field tower K oo/k is actually infinite if k is an imagi- nary quadratic field ( p = 1) and y;' 7 fOr p = 2, for example, k=Q( J -3.5'7,11, 13,17,19). Construction problem. Let k be a given alge- braic number field and G a finite group. The construction problem asks us whether there exists a Galois extension K/k such that its Galois group Gal(K/k) is isomorphic to G. If G is Abelian the problem can be solved affirma- tively by using class field theory. This problem was also solved affirmatively for p-groups by A. Scholtz and H. Reinhardt in 1937, and for general solvable groups by Shafarevich in 1954 (Izv. Akad. Nauk SSSR, ser. mat. 18). 
59 G Class Field Theory G. Class Field Theory for Algebraic Function Fields and Local Class Field Theory F. K. Schmidt developed an analog of class field theory for Abelian extensions over an algebraic function field in one variable with finite coefficient field (1930;  [8J), An arith- metic proof was given by M. Moriya (1938). An analog of class field theory for local fields with finite residue-class fields, called local class field theory ( 257 Local Fields) was first developed by Hasse, and later Chevalley gave an algebraic derivation (1933). H. Cohomology of Groups and Class Field Theory For the purpose of simplifying the proof of the main theorems in class field theory, the theory of tGalois cohomology was developed by T. Nakayama, G. Hochschild, A. Weil, Artin, J. Tate, and others, In particular, Artin and Tate [9] constructed class field theory on the basis of the cohomology theory of finite groups as fol- lows: Let G be a finite group, A a tG-module (or a multiplicative commutative group with the operator domain G), and fJn(G, A) the nth tcohomology group (n= 0, :1:: 1, :1::2, ...) of G with coefficients in A ( 200 Homolog- ical Algebra N). Then we have fJo(G,A)=;;; A G/ NG(A), where A G is the set of all G-invariant elements in A and NG(A) is the set of all ele- ments of the form NG(a)== LaEG O"a (aEA). We can consider Z a G- mod ule by defining O"n = n (nEZ,O"EG). Let A, B, C be G-modules such that a G-bilinear mapping (A,B)-->C is de- fined. Then we can define the tcup product (ex,fJ)-->!Y..fJ (!Y..EH'(G,A), fJEHS(G,B),!Y.. fJE H'+'(G, C) for r, SEZ with the usual prop- erties, Let A be a G-module and H a sub- group of G. Then the trestriction homomor- phism R G1H : fJn(G, A)-->fJn(H, A) and the tinjection homomorphism InjHIG: fJn(H, A)--> fJn(G, A) are defined for nEZ. If H is a normal subgroup of G, then the tinfIation homomorphism Inf(G/H)/G :fJn(G/H, A H)--> fJn(G,A) can be defined for n? 1 ( 200 Homological Algebra M), Let k be an algebraic number field, and let K be a Galois extension of k of degree n with the Galois group G= G(K/k), The multiplica- tive group K x == K - {O}, the tidele grOl'p J K of K, and the idele class group C K of K are multi- plicative commutative groups with G as their operator domain. The fundamental formulas in Galois cohomology for class field theory are Rl(G,CK)=O, fJ2(G, C K ) =;;; Z/nZ. 222 It is possible to realize the isomorphism of (2) by the invariant inv K/k: fJ2(G, CK){ (r/n) (modZ) I r=O, 1, ..., n-l} in such a way that the following properties hold, where the canon- ical cohomology class for K/k is the element (K/k of fJ 2 (G,C K ) such that inv K / k (K/k=(1/n) (modZ): (i) for kclcK, G=G(K/k), H= G(K/I), the relation Res G / H (K/k = (K/I holds; (ii) if I/k is also a Galois extension with F = G(l/k), then we have Inf p1G (I/k = (;/k (m= [K: I]); (iii) for a cyclic extension K/k we have inv K / k (K/k = Lpinvp«(K/k)p (mod Z), where p runs over all prime divisors of k and inv p is the invariant in the local theory ( 257 Local Fields E). By these properties, the canonical cohomology class K/k is uniquely determined. After these preliminaries we can state Tate's theorem, from which the fundamental theo- rems in class field theory follow, Tate's theorem. Let K/k be a Galois exten- sion with the Galois group G. Then we have the isomorphism <1>n: fJn-2( G, Z) =;;; fJn(G, C K ) (n = 0, :1:: 1, :1:: 2, ... ) that is given explicitly by <1>n(!Y..)=(K/k!Y..' where (K/kEfJ2(G,CK) is the canonical cohomology class for K/k (Ann. Math" (2) 56 (1952)), Corollary 1. Since fJ -2(G, Z) =;;; G/[G, GJ and fJo(G, C K )=;;; Ck/NK/dCK), we have the isomor- phism <1>0: G/[G, GJ =;;; C k / NK/k(C K ). Let f(r, 0") (r, 0" E G) be a 2- tcocycle belonging to (K/k' Then by the explicit expression for the cup product we obtain the isomorphism <1>0: 0"( mod[G, GJ)--> IT f(r, O"fl(mod NK/dCK) ) ' rEG (1 ) (2) This is an analog of the result in local theory that was proved earlier by T. Nakayama and Y. Akizuki (Math. Ann., 112 (1936»). Corollary 2. For an Abelian extension K/k we have the isomorphism <1>0: G =;;; C k / NK/dCK)' <1>0 1 has the property of being the norm- residue symbol for Ck> and from this iso- morphism we can prove immediately Artin's law of reciprocity. Thus we can prove the main theorems in class field theory by cohomology- theoretic methods [9]. We can also see, by generalizing this iso- morphism to infinite Abelian extensions, that the Galois group of the maximal Abelian extension of k over the ground field k with tKrull topology is algebraically and topologi- cally isomorphic to Ck/Dk> where Dk is the connected component of the unit element in C k . The structures of Dk and Ck/D k were ex- plicitly determined by Artin and T. Kubota, respectively ( 6 Adeles and Ideles D). If we assume the fundamental formulas (1) and (2) stated above and several other simple assumptions as axioms for an infinite ex- 
223 tension of a fixed ground field, we can de- velop the results stated in this section purely cohomology-theoretically. Such a system is called acIass formation (Artin;  [9J). In ad- dition to the cases of algebraic number fields, algebraic function fields in one variable with finite coefficient fields, and local fields with finite residue-class fields, which we have men- tioned already, we also know several other cases for which analogies of class field theory are valid. These analogies can be explained systematically by using class formation the- ory (Y. Kawada, Duke Math. J., 22 (1955)), Examples are (1) the theory ofunramified Abelian extensions of an algebraic function field in one variable with algebraically closed constant field of characteristic 0 (Tate and Kawada, Amer, J. Math., 77 (1955)); (2) the theory of Kummer extensions over a field k such that (i) the characteristic of k is 0, (ii) k contains all the roots of unity, and (iii) for any Galois extension K/k, NK/k(K)=k; (3) the theory of Abelian p-extensions of a field of characteristic p (E. Witt, J, Reine Angew. Math., 176 (1963); I. Satake and Kawada, J. Fac, Sci. Univ, Tokyo, 7 (1955)); (4) the theory of unramified Abelian p-extensions of an algebraic function field in one variable with algebraically closed constant field of char- acteristic p (Hasse and Witt, Monatsh. Math., 43 (1936), H. L. Schmid, I. R. Shafarevich, Kawada, T. Tamagawa); and (5) the theory of Abelian extensions of a local field with algebraically closed residue-class fields (1.-P, Serre, Bull. Soc. Math. France, 89 (1961)). An analogy of class field theory for infinite Abelian extensions was considered by Her- brand, Moriya, M, Mori, and Kawada. References [1] H. Hasse, Bericht iiber neuere Untersu- chungen und Probleme aus der Theorie der algebraischen Zahlkorper, Jber. Deutsch, Math. Verein, I, 35 (1926),1-55; la, 36 (1927), 231-311; II, 1930 (Physica Verlag, 1965). [2] H. Hasse, Vorlesungen iiber Klassenkor- pertheorie, Lecture notes, Univ. Marburg, 1932 (Physica Verlag, 1967). [3] C. Chevalley, Sur la theorie du corps des classes dans les corps finis et les corps locaux, J, Fac. Sci. Univ. Tokyo, 2 (1933),365-476. [4] J. Herbrand, Le developement moderne de la theorie des corps algebriques, Gauthier- Villars, 1936. [5J E. Artin, Algebraic numbers and algebraic functions, Lecture notes, Princeton Univ" 1950-1951 (Gordon & Breach, 1967). [6] C. Chevalley, La theorie du corps de classes, Ann. Math., (2) 41 (1940), 394-418. 60B Classical Groups [7] A. Weil, Sur la theorie du corps de classes, 1. Mafh. Soc. Japan, 3 (1951),1-35. [8] F. K. Schmidt, Die Theorie des Klassen- korper iiber einem Korper algebraischer Funktionen in einer Unbestimmten und mit endlichem Konstantenbereich, S.-B. Phy.-Med. Soz, Erlangen, 62 (1931), 267-284. [9] E. Artin and J. Tate, Class field theory, Princeton Univ. Press, 1951 (Benjamin, 1967). [IOJ S, Iyanaga, Class field theory notes, Univ. of Chicago, 1961. [IIJ 1.-P. Serre, Groupes algebriques et corps de classes, Actualites Sci. Ind., Hermann, 1959, [12] 1.-P. Serre, Corps locaux, Actualites Sci, Ind., Hermann, 1962; English translation, Local fields, Springer, 1979, [13J A, Weil, Basic number theory, Springer, 1967. [14] 1. W. S. Cassels and A. Frohlich (eds.), Algebraic number theory, Academic Press, 1967, [15] G. Shimura, A reciprocity law in nonsolv- able extensions, 1. Reine Angew. Math., 221 (1966),209-220. [16J Y. Ihara, Non-Abelian class fields over function fields in special cases, Actes Congr. Intern. Math., 1970, Nice, Gauthier-Villars, 381-389. [17] Y. Ihara, Some fundamental groups in the arithmetic of algebraic curves over finite fields, Proc. Nat. Acad, Sci. US, 72 (1975), 3281-3284. [18] Y. Kawada, Class formations, 1969 Num- ber Theory Institute, Amer. Math. Soc. Proc. Symposia in Pure Math., 20 (1971), 96-114. [19] S. Iyanaga (ed.), Theory of numbers, North-Holland, 1975. (Original in Japanese, 1969.) 60 (IV.5) Classical Groups A. Introduction The general linear groups, unitary groups, orthogonal groups, symplectic groups, etc., that are described below are all called classical groups ( 13 Algebraic Groups, 151 Finite Groups, 248 Lie Algebras, 249 Lie Groups). B. General Linear Groups Let V be a tlinear space of dimension n over a tfield K, and let GL( V) denote the set of all tlinear mappings of V onto V (hence they are 
60 C Classical Groups all bijections). Then G L( V) is a group under the composition of mappings. This group is called the general linear group (or full linear group) on V. Let e" ... , en be a basis of Vover K, and let (ex)) be the matrix associated with an element A ofGL(V):Aei=Ljexlej. Then the mapping A -->(ex)) is an isomorphism of GL(V) onto the multiplicative group GL(n, K) of all n x n tinvertible matrices over K. We can thus identify the group GL(V) with GL(n, K). GL(n, K) is called the general linear group of degree n over K. Consider the homomorphism A-->IAI (I A I is the determinant of A) ofGL(V) onto the multiplicative group K* = K - {O}. Its kernel S L( V) is a normal subgroup of G L( V) and is caIled the special linear group (or uni- modular group) on V. The subgroup SL(n, K) = {A I A EGL(n, K), IAI = I} of GL(n, K) corre- sponds to SL(V) under the above isomorphism GL(V)GL(n,K). SL(n,K) is called the special linear group of degree n over K. Unless n = 2 and K is the tfinite field F 2 = GF(2), SL(n, K) is the tcommutator subgroup of GL(n, K). The tcenter 3 of GL(n, K) coincides with the set of all scalar matrices exI (exEK*), and the center 30 of SL(n, K) is a finite group given by 3n SL(n, K)= {exII exEK, exn = I}. Now let P(V) be the tprojective space of dimension n - 1 obtained from a linear space V of dimension n. Namely, P(V) is the set of all linear subspaces of dimension 1. Then there exists a natural homomorphism <p of GL(V) into the group of all projective trans formations of P(V), and the tkernel of <p coincides with the center 3 of GL( V), Hence <p(GL(V))GL(V)/3. This group is written as PG L( V) and is called the projective general linear group on P( V). Similarly, PGL(n, K) = G L( n, K)/3 is called the projective general linear group of degree n over K. The quotient group SL(n, K)/30 of SL(n, K) by the center 30 is called the projective special linear group and is written as PSL(n, K) or LF(n, K) (linear fractional group). The groups GL(n, K), SL(n, K), etc" are also written as GLn(K), SLn(K), etc. In particular, when K is the tfinite field Fq, these groups are denoted by GL(n, q), SL(n; q), PGL(n, q), PSL(n,q), LF(n,q). Simplicity of PSL(n,k). When n=2 and K = F 2 , PSL(2,2)53 (the tsymmetric group of degree 3). When n=2 and K =F3' PSL(2,3) 4 (the alternating group of degree 4). Except for these cases, the group PSL(n, K)(n? 2) is a noncommutative tsimple group ( 151 Finite Groups I). Suppose that K is the finite field Fq, and let ex(n, q), [3(n, q), y(n, q), b(n, q) denote the orders of GL(n,q), SL(n, q), PGL(n, q), PSL(n, q), res- 224 pectively, Then we have ex(n, q) =(qn _ 1 )(qn _ q).. ,(qn _ qn-I), {j(n, q)= y(n, q) = ex(n, q)/(q - 1), b(n, q) = y(n, q)/d, where d = (n, q - 1) (the greatest common divi- sor of nand q - 1), C. Properties as Lie Groups If the ground field K is the field R of real num- bers (the field C of complex numbers), the above groups are all tLie groups (tcomplex Lie groups), In particular, SL(n, C) is a tsimply connected, tsimple, and tsemisimple complex Lie group of type An-I> and PSL(n, C) is the tadjoint group of the complex simple Lie algebra of type An-I' D. Determination of the Rational Representations of G L( V) In Sections D and E, the field K is assumed to be of characteristic O. Let p be a homomor- phism of GL(V)=GL(n, K) into GL(m, K) (p: A = (ex))--> B = ([3:». Then if each [3: is a rational function (or polynomial or analytic function) in (exL exL .", ex;) over K, p is called a rational representation (or polynomial or analytic repre- sentation) of degree m of G L( V). (We suppose that K is R or C when we consider analytic functions,) For example, every rational repre- sentation of degree 1 can be expressed as A--> IAle (e is an integer). In particular, if K is the field C of complex numbers, every analytic representation of GL(n, C) is a rational repre- sentation. Since GL(n, C) is the tcomplexifica- tion of the tunitary group Urn), there exists a one-to-one correspondence between the com- plex analytic representations of GL(n, C) and the continuous representations of U(n); this correspondence preserves equivalence, irre- ducibility, ttensor product, and direct sum of the representations ( 249 Lie Groups). Hence, determining the rational representa- tions of GL(n, C) is equivalent to determining the continuous representations of Urn). In the general case, the rational representations of GL(V)= GL(n, K) are all completely reducible. F or any rational representation p of G L( V), there exists a natural number e such that the representation p': A-->IA Ie p(A) is a polynomial representation. Hence in order to determine the rational representations of GL(V), it is sufficient to determine the irreducible poly- nomial representations of GL(V), which, as described below, can be obtained by decom- posing the representations on the ttensor space 
225 V m = V@ ,., @ V of degree m (m copies of V) (m= 1,2,,,.), For AEGL(V), define Dm(A)E GL( V m ) as the tensor product Dm(A)=A @ ,,, @A (m copies of A), Namely, for VI> '" 'V m E V, we have Dm(A)(v l @." @ Vm)=AVI @ ", @ Av m , The mapping GL(V)3A-->D m (A)EGL(V m ) is a polynomial representation of degree n m of GL(V), Now let i!(V m ) be the tassociative algebra of aU1inear mappings of v m into v m (ttota1 matrix algebra), and let  be the sub- algebra of (vm) generated by {Dm(A)IAE GL(V)}, Next, for an element (J of the symmet- ric group 5m of degree m, define BaEGL(Vm) by BAv l @ ", @ v m )= V a -l(l) @ ,,, @ va-'(m)' Then the mapping (J-->Ba is a representation of 5m on vm, Thus we obtain a representation r of the tgroup ring K[5 m J of 5m over K on vm:K[5mJ-->(Vm). Set r(K[5mJ)=!B, Then  and !B are tcommutors of each other in i!( V m ), i,e.,  = {X E i!( vm) I X B = BX (for aU BE!B)}, !B = {X Ei!(vm)1 AX = XA (for all AE)}, Now for a right ideal r of!B, let r( V m ) be the subspace of vm composed of alJ the finite sums of the form LBx (BEr,xE V m ), Then the fol- lowing statements hold: (1) r( V m ) is invariant under ; hence it is a subspace of v m invariant under GL(V), Con- versely, for any subspace V of vm invariant under GL(V), there exists a unique right ideal r of!B such that V = r( vm), (2) Let r I, r 2 be right ideals of !B, and put VI =rd vm ), V 2 =r 2 (Vm), Then r l r2 (as right 5m-modules) if and only if VI  V 2 (as repre- sentation spaces of GL( V)). (3) The mapping r-->r(V m ) is a lattice iso- morphism of the t1attice of right ideals of !B onto the lattice of G L( V)-invariant subspaces of vm, Hence if r = r 1+ r 2 (direct sum), then V = VI + V 2 (direct sum). Also, r(V m ) gives an irreducible representation of GL(V) if and only if r is a minimal right ideal of !B, Since the algebra K [6 m J is a tsemisimp1e algebra, !B can be considered as a two-sided ideal of K [5 m J. Hence a minimal right ideal r of!B is also a minimal right ideal of K [5m], and the tidempotent element c; which generates r is a tprimitive idempotent of K[6m]. From the theory of symmetric groups ( 362 Repre- sentations H) the primitive idempotents of K[6 m J are all given (up to isomorphism) by tY oung's diagrams T(/I, ,., ,fd U  f2  '"  h> 0, m = fl + ", + h), In this setting, we have (4) Let c; = C;(fl, ." ,h) be the primitive idempotent determined by Young's diagram T(/I' " ,Jk)' Then c;K [6 m J c !B if and only if k.s; n, In this case, put c;K [6 m J = r, r( V m ) = 60E Classical Groups c;(V m )= Vm(T(/"", ,h)) and denote the irreducible representation of G L( V) on vm(T(/"", ,h») by A -->D(A;fI'''''h). We call (/" .., ,h) the signature of this irreducible representation, (5) The representation D(A;f1> .." fd is an irreducible polynomial representation of GL(V), Furthermore, for any irreducible poly- nomial representation p of GL( V), there exists a unique D(A;f"", ,h) equivalent to p, For ex- ample, if k = 1, then fl =m, c;=(m!)-I L aE 6m(J, and Vm(T(m)) is the space of tsymmetric ten- sors of degree m, Iffl = ,., = h = 1, then k=m, c; = (m!) -I, L aE6 Jsgn (J)(J, and V m (T(I,,,., 1)) is the space of talternating tensors of degree m, (6) Let x( A; fl, . " , fd be the tcharacter of the irreducible representation D(A; f" '" , h). Then X (A;fl> ,..,h) c;i 1 c;i' C;l, I C1 C;12 C;l, n n n-I n-2 C;I C;I C;I n-I n-2 C;n en f. n where e l , "', f. n are the teigenva1ues of A and 1 1 = fl +(n-l), 1 2 = f2 +(n-2),.., ,In = fn (set fk+ I = " , = fn = 0), Hence the degree d of D(A; f"." ,h) is expressed as d=D(lI, ".,In)/D(n-l, "', 1,0), where D(xl> ..., x n ) = I1 i <j(X i - xJ (7) In particular, denote the character of D(A; m) by Pm = Pm (A), Then they satisfy II -zAI- I =PO+PIZ+P2Z2+". and x(A;f1> ", Jd Pf, Pf,+l Pf1 +(n-I) Ph-I Ph Ph+(n-2) Pf,-(n-I) P f,-(n- 2) Pf. where we put fk+1 = h+2 =". = fn=O, P-I = P _ 2 = ... = 0, This matrix is simply written as IP1-(n-I), ,., , pll, with the convention that in each row, we set II = fl +(n-l),,,,, In-I = fn-I + 1, In = fn. E. Determination of the Rational Represent- ations of SL(V) The rational representations of S L( V) are completely reducible, By restricting any irreducible representation D(A;fI' ''', fn) (/1  f2  '"  fn O) of GL(V)to SL(V), we get an irreducible rational representation i5(A;fI' ..., fn) of SL( V), Furthermore, any irreducible rational representation of SL(V) 
60F Classical Groups can be obtained in this way. D(A; fl' '" , /.,) and D(A;f;, .., ,I:) are equivalent representa- tions of SL( V) if and only if /; - /;+1 = /;' - /;I (i = I, . " , n - 1). F. Unitary Groups The set U (n) of all n x n tunitary matrices with complex elements is a group under multipli- cation ( 269 Matrices). This group U(n) is called the unitary group (or unitary transfor- mation group) of degree n. The subset of U(n) consisting of all matrices of determinant 1 is a normal subgroup of U(n). This group is called the special unitary group and is denoted by SU(n). U(n) and SU(n) are subgroups of GL(n, C) and SL(n, C), respectively, and can be obtained from these groups through the tunitary restric- tion. Hence they are both compact, connected Lie groups; in particular, SU(1) is composed only of the identity and U (1) is the multiplica- tive group of all complex numbers of absolute value 1. The center 3 of U (n) is the set of all diagonal matrices AI(AEC,IAI= 1), and we have 3U(J), 3' SU (n)=U(n), U(n)/SU(n) U(I). Moreover, for n?2, SU(n) is a simple, semi- simple, and simply connected Lie group, which gives one of four infinite series of simple com- pact Lie groups. U(n)/3 is denoted by PU(n) and is called the projective unitary group. We have the relations PU(n)2ifSU(n)/3n SU(n), 3nSU(n)Z/nZ. Hence PU(n) is locally isomorphic to SU(n). G. Irreducible Representations of U(n) Restricting the irreducible representation D(A;fI'''' Jd of GL(n, C) on SU(n), we obtain a continuous irreducible representation of SU(n), and conversely, all continuous irreduc- ible representations of SU (n) are obtained in this manner. Similarly, any continuous irre- ducible representations of U (n) are given by A -->IAl e D(A;fI' "., h), where e is an integer. Since both U(n) and SU(n) are compact, any continuous representation of these groups can be decomposed into a direct sum of the irreducible representations mentioned above ( 69 Compact Groups). The representation theory of U(n) and SU(n) is important as the most typical and concrete example of the representation theory of gen- eral compact Lie groups ( 69 Compact Groups, 248 Lie Algebras, 249 Lie Groups). 226 H. Unitary Groups over General Fields A unitary matrix and the unitary group can also be defined over some fields other than the field C of complex numbers. Namely, let P be a field and K a quadratic extension field of P; for an element  of K, let  be the tconjugate of  over P. Then a matrix of degree n with entries in K is called a unitary matrix of K (relative to P) if it leaves invariant the tHer- mitian form I I + 2 2 + ", + nn' The multi- plicative group consisting of all unitary mat- rices is called the unitary group over K (relative to P) and is denoted by U (n, K, P); its sub- group consisting of all unitary matrices of determinant 1 is called the special unitary group over K and is denoted by SU(n, K, P). The quotient group of SU(n, K, P) by its subgroup consisting of all AI (An = 1,1..1.1 = 1) is called the projective special unitary group over K and is denoted by PSU(n, K, P). In partic- ular, when K and P are the finite fields Fq2 and Fq (q = pm), U(n, K, P), SU (n, K, P), PSU (n, K, P) are written simply as U(n, q), SU(n,q), PSU(n,q). Then for n?3, each PSU(n, q) is a noncommutative simple group, except for PSU(3, 2) ( 151 Finite Groups I). I. Orthogonal Groups The set of all torthogonal matrices of degree n (with real entries) forms a group under multi- plication. This group O(n) is called the ortho- gonal group (or orthogonal transformation group) of degree n. The subset of O(n) consist- ing of all orthogonal matrices of determinant 1 forms a normal subgroup of O(n) of index 2. This group SO(n) (also denoted by On+) is called the rotation group (special orthogonal group or proper orthogonal group) of degree n. Geometrically, O(n) is the set of all orthogonal transformations leaving a point in Euclidean space of dimension n fixed, and SO(n) is com- posed of all rotations around the point. Both O(n) and SO(n) are compact Lie groups, and SO(n) coincides with the con- nected component of O(n) which contains the identity, For n = 3 or n? 5, each SO(n) is a simple and semisimple Lie group. Following the theory of Lie algebras, we divide the set of all SO(n) (n? 3 but n #4) into two classes according as n is even or odd, and we thus get two of the four infinite series of simple and semisimple compact Lie groups (for SO(4), for example, see [I J). Although SO(n) (n? 3) is a connected Lie group, it is not simply connected. The simply connected Lie group which is locally isomor- phic to SO(n) is called the spinor group and is denoted by Spin(n). SO(n) is isomorphic to the 
227 quotient group of Spin(n) by a normal sub- group of order 2, Let 3 be the center of Spin(n). Then 3  Z/2Z for odd n, 3  Z/4Z for n = 2 (mod 4), and 3(Z/2Z) EB (Z/2Z) for n=O (mod 4) (61 Clifford Algebras). The group O(n, C) of all tcomplex ortho- gonal matrices is called the complex ortho- gonal group, and the group SO(n, C) of all matrices in O(n, C) of determinant 1 is called the complex special orthogonal group. SO(n, C) (n;;d,n#4) is a simple and semisimple com- plex Lie group. J. Irreducible Representations of Orthogonal Groups In the same way as for GL(n, K), the irreduc- ible representations of O(n) can be obtained by decomposing the tensor product Dm(A) = A @ ... @ A of m copies of an orthogonal matrix A using Young's diagram. Namely, con- sider the Young's diagram T(/IJ2, ..,Jd such that the sum of the lengths of the first column and of the second column is not greater than n, and call it an O(n) diagram. Then to any O(n) diagram T= T(/"... Jk)' there corresponds an absolutely irreducible representation DO(A;f1 ,12' ... Jd, and the representations DO(A;f1 ,12, .., Jd correspond- ing to two distinct O(n) diagrams are mutually inequivalent. Dm(A) can be decomposed into the direct sum of those representations DO(A; fl,f2,'" ,fd such that f = fl +... + h takes the values m, m-2, m-4,..., Furthermore, any continuous irreducible representation of O(n) is equivalent to a D O (A;fIJ2'''' Jd obtained from some O(n) diagram T= T(/IJ2,'" Jk)' In general, two O(n) diagrams T and T' are called mutually associated diagrams if the sum of the lengths of their first columns is equal to n and if the lengths of each column other than the first one coincide. In particular, if T= T(/I ,12, . .. Jd and 2k = n, then T is said to be self-associated. The set of all O(n) diagrams can be divided into pairs of mutually asso- ciated T, T' (and self-associated T= T'). Suppose that we are given mutually asso- ciated diagrams T and T' and that the length k of the first column of T== T(/t ,12, .., Jd is not greater than n/2, Then the character X1 (A) of D O (A;fIJ2' ... ,h) corresponding to T and the character xdA) of the irreducible repre- sentation corresponding to T' are given by XT(A) = IpI-(v-l) - PI-(v+l)' PI-(v-2) - PI-(v+2), ... ,PI- PI-2vl, Xr(A)=IAlxT(A), v=[n/2J, where Pi and IpI-(v-l) - PI-(v+I),...1 have the same meaning as in the formula for the char- 60K Classical Groups acters of irreducible representations of GL(n,K). The irreducible representations of SO(n) can be obtained immediately from those of O(n). Namely, if T= T(jlJ2,'" Jd is not self- associated, DO(A;f1 ,12' ... ,h) is irreducible as a representation of SO(n), and the represen- tations of SO(n) derived from T and the asso- ciated T' coincide. If T is self-associated, DO(A;f1 ,12, ... Jk) can be decomposed into two irreducible representations of SO(n) of the same degree over the field of complex num- bers. Furthermore, the irreducible represen- tations of SO(n) obtained in this way from different pairs of associated diagrams are mutually inequivalent, while any continuous irreducible representation of SO(n) is equiva- lent to one of these representations. For the representations of SO(3) (the rotation group of degree 3)  353 Racah Algebra. Since SO(n) is isomorphic to the quotient group of Spin(n) by a normal subgroup N of order 2, a continuous representation of Spin(n) which is not the identity representation on N can be considered as a double-valued repre- sentation of SO(n). This representation is called the spin representation and is important in the field of applied mathematics. The orthogonal group O(n) consists of all n x n real matrices which leave invariant the quadratic form i + '" + , while the group of all n x n real matrices which leave invariant the quadratic form i + . .. + e - ;+1 - . .. - ( of tsignature (r, n - r) is called the Lorentz group of signature (r, n - r). The case for n == 4 and r == 3 i used in special relativity ( 359 Relativ- ity). Let Go be the connected component of the identity of the Lorentz group of signature (3, I). Then Go is called the proper Lorentz group, For 0"=(9i)EG, we have 10"1 =::J:: 1 and 944? 1 or 944.s; -1. Moreover, we have Go = {O"IIO"I = I, 944? I}, G/Go(Z/2Z) EB (Z/2Z) (tfour group), and Go  SL(2, C)/{ ::J::I}. K. Orthogonal Groups over General Fields Orthogonal groups can also be defined over other general fields than the field ofreal num- bers as follows: Fix a tquadratic form Q(,) = L?j1 exijij (Iexijl #0) over a field K. Then a linear transformation of Ui == 1,2, ..., n) over K which leaves Q invariant is called an ortho- gonal transformation with respect to Q. The set of all orthogonal transformations forms a group. This group is denoted by O(n, K, Q) or simply O(Q) and is called the orthogonal (trans- formation) group over K with respect to Q. In particular, the normal subgroup of all trans- formations in O(n, K, Q) of determinant 1 is called the special orthogonal group over K with 
60L Classical Groups respect to Q and is denoted by SO(n, K, Q) (or simply SO(Q)), O(n) and SO(n) are special cases of O(n, K, Q) and SO(n, K, Q), where K is the field R of real numbers and Q(,) is the unit quadratic form i +  + ,.. +;, Let Q(n, K, Q) be the tcommutator subgroup of O(n, K, Q). Then this subgroup coincides with the commutator subgroup of SO(n, K, Q). If K is of characteristic #2, and if n? 5 and the tindex v of Q? 1, then Q(n, K, Q)/3 (3 is the center of Q(n, K, Q)) is a simple group, where 3 = {I} or 3 = { :tI} (L. Dickson, J. Dieudonne). Suppose that K is a finite field Fq (of character istic #2). Then we have v = m if n = 2m + 1, and v = m or m - 1 if n == 2m, Hence v? 2 if n? 5. If v = 0 and K = R, we have Q(n, R, Q) = SO(n) and, as mentioned before, SO(n)/3 is simple for n? 5, The same proposition also holds when K is an talgebraic number field (M. Kneser, 1956), If K is of characteristic 2, then O(n, K, Q) = SO(n, K, Q), 3 = {I}, and Q(n, K, Q) is a sim- ple group in many cases (Dieudonne [5J). For the case where K is a finite field (Dickson)  151 Finite Groups I. L. Symplectic Groups Let (I, (2, ... , (2n and IJ" 1J2, ... , 1J2n be two sets of variables, and suppose that the same linear transformation A over a field K acts on them (from the left), If A leaves the tbilinear form L71«(2i-IIJ2i-2iIJ2i-d invariant, this linear transformation (or the corresponding matrix) A is called a symplectic transformation (sym- plectic matrix) of degree 2n, The set of all sym- plectic transformations (or matrices) of degree 2n over K forms a group denoted by Sp(n, K) and calIed the symplectic group (symplectic transformation group, complex group, or Abelian linear group) over K. Any matrix in Sp(n, K) is always of determi- nant 1, and the center 3 of Sp(n, K) consists of I and - I. The quotient group PSp(n, K) of Sp(n, K) by 3 is calIed the projective symplectic group over K, Except for the three cases n = 1, K =F 2 ; n= 1, K =F 3 ; and n=2, K =F 2 , the group PSp(n, K) (n? I) is always simple. Properties of Symplectic Groups as Lie Groups. When K is the field C of complex numbers or the field R of real n um bers, S p( n, K) is a Lie group. The intersection of the complex sym- plectic group Sp(n, C) and the unitary group U(2n), namely, the unitary restriction of Sp(n, C), is denoted by Sp(n) and is called the unitary symplectic group (or simply symplectic group). Sp(n, C) is a simple and semisimple complex Lie group, and both Sp(n, R) and Sp(n) are simple and semisimp1e Lie groups. Moreover, Sp(n) is compact and simply con- 228 nected and gives one of the four series of simple, semisimple and compact Lie groups ( 249 Lie Groups). Let H" be the linear space of dimension n over the tquaternion field H. Define the inner prod uct of two elements x = (x I> .. . , x n ) and Y =(YI>"" Yn) in Hn by (x, Y)=Xd'l +... +x.:Yn CVi is the tconjugate quaternion of y;), and consider the group of alIlinear transforma- tions which leave this inner product invar- iant. Then this group is isomorphic to Sp(n), Sp(n) is thus compared with the orthogonal group O(n), which leaves invariant the inner product of a linear space over the field R of real numbers and with the unitary group U(n), which has the same property over the field C of complex numbers (c. ChevalIey [4, ch. IJ). M. Irreducible Representations of Symplectic Groups In the same way as for GL(n, K), the represen- tation Dm(A) == A @ ... @ A (tensor product of m copies of A) of Sp(n, C) can be decomposed into irreducible components using Young's diagram. Namely, for any Young's diagram T= T(/IJ2' ...Jd (k.s;n) such that the num- ber k of rows is not greater than n, an irreduci- ble representation D'(A;fl' ... Jk) of Sp(n, C) is determined. These DS(A;fI' ... Jk) are mutu- ally inequivalent, and Dm(A) can be decom- posed into the direct sum of representations D'(A;f1 ,12, ... Jd such that f = fl + ... + fk is equal to any of the values 111, m-2, m-4,.... The character of DS(A;fl' f2, ... Jk) is given by h(A)= \PI-n+1, PI-n+2 + PI-"' ..., PI + PI-2n+21, where Pi and IPI-n+I> PI-n+2 + Pl-n, ...1 have the same meaning as in the formula for the char- acters of the irreducible representations of GL(n,K). For the matrices A in Sp(n), D'(A;fIJ2' ..., fk) gives rise to a continuous irreducible representation of Sp(n). Furthermore, any con- tinuous irreducible representation of Sp(n) is equivalent to a representation D'(A;flJ2, ..., fk) corresponding to some diagram T. N. Relations among Various Classical Groups There are some isomorphisms (homomor- phisms) among the classical groups mentioned above. For general fields K  [1,5]. For finite fields K  151 Finite Groups I. For K = R or C, the following isomorphisms hold: SO(3)  SU(2)j{::J::I}, SU(2)Sp(I), SO(5)Sp(2)j {:t I}, SO(6)  SO(4)j{:tI} ( 248 Lie Alge- bras, 249 Lie Groups), 
229 O. Classical Groups over Noncommutative Fields Let V be a right linear space over a non- commutative field K. Then the set of alIlinear transformations of V forms a group under the multiplication defined by the composition of mappings. This group GL(V) is calIed the general linear group on V. It is isomorphic to the multiplicative group of alIn x n invertible matrices with entries in K. The commutator subgroups SL(V) and SL(n, K) of GL(V) and GL(n, K), respectively, are caIled the special linear group of degree n on Vand over K, respectively. Now, suppose that an element A of G L( V) leaves each element of a subs pace U of dimension n - 1 of V fixed. Choose an ele- ment x of V which does not belong to U, and set Ax = xex (mod U); ex E K depends not only on A but also on the choice of x, However, the conjugate class eX = {Aex}. -IIAE K*} of ex in the multiplicative group K* of K is determined only by A. In particular, if eX = {I} and A # I, then A is called a transvection. For a tmatrix unit E ij , BiAex) = 1+ exE ij is a transvection if i # j and ex # O. S L( V) coincides with the su b- group of GL(V) generated by all transvections, This fact also holds when K is a commutative field, except when n = 2 and K = F2' In this case, transvections generate the whole GL(2, 2), which is isomorphic to the symmetric group 6 3 of degree 3 and does not coincide with the commutator subgroup, The center 3 of GL(n, K) consists of all scalar matrices corre- sponding to nonzero elements in the center of K. Let C be the commutator subgroup of the multiplicative group K* of K. Then for n? 2, GL(n; K)jSL(n, K) is isomorphic to K* jC. This isomorphism can be obtained by appropriately defining, for A EGL(n, K), an element detA of K* jC which is called the determinant of A [6,9]. The centef30 of SL(n, K) is {exI I exnE C}. The quotient group PSL(n, K)=SL(n, K)j30 is caIled the projective special linear group of degree n over K. If K is a noncommutative field, then PSL(n,K) (n?2) is always a simple group [5,8]. Next, let K be any field (commutative or noncommutative), and let V be a right linear space of dimension n over K. Consider a Her. mitian form fIx, y) ( 256 Linear Spaces) on V relative to an tinvolution J of K. If for a fixed element f. in the center of K we have fIx, y) == f.f(y, x), then f is called an f.-Hermitian form. For the rest of this article, f is assumed to be an f.-Hermitian form on V. Let W be a subs pace of V. If fIx, y) =0 for any x, yE W, then W is calIed a totally isotropic subspace. The largest dimension m of the totalIy iso- tropic subspaces of V is caIled the index of f We always have 2m.s;n, If f(Ax, Ay)= f(x,y) 600 Classical Groups for any x, yE V, then A is called a unitary trans- formation relative to f The set Urn, K,f) of all unitary transforma- tions relative to f forms a subgroup of G L( V). This group is calIed the unitary group relative to f Also, the group SU(n,KJ)= U(n,KJ)n SL(n, K) is called the special unitary group. When J = 1 and f. = 1, a unitary transformation (unitary group) is caIled an orthogonal trans- formation (orthogonal group), and U (n, KJ) is written as O(n, KJ). Also, when J = 1 and f.= -1, a unitary transformation (unitary group) is caIled a symplectic transformation (symplectic group), and Urn, KJ) is written as SpIn, K). In fact, in these cases, for arbitrary choice of f, the corresponding groups are mutuaIly isomorphic. An f.-Hermitian form f is called an f.-trace form if for any x E V, there exists an ex E K which satisfiesf(x,x)=ex+f.ex J . If J = 1, f.=-1 (hence K is commutative) or f.= 1 and K is of characteristic #2, then any f.-Hermitian form is an f.-trace form. Iff is an f.-trace form, a linear mapping B of any subspace W of V into V such that for any x, YE W, f(Bx, By) = fIx, y) can be extended to an element A of the unitary group U (n, K, f) relative to f (Witt's theorem). In particular, Urn, K,f) acts transitively on the maximal totally isotropic subs paces, and their dimensions are equal to the index m of f Now, let P be a Pythagorean ordered field (an or- dered field which contains square roots of any positive element), If K = P and J = 1, or if K = P(), or if the noncommutative field K is a tquaternion algebra over P and J is the operation oftconjugation of K, then for two Hermitian forms f, /" their unitary groups Urn, K,f) and Urn', KJ') are isomorphic if and only if n = n', and the indices of f and /' are equal. In this case, U (n, KJ) can be writ- ten as U (n, m, K), where m is the index of f If the field K is a tquaternion algebra over P and f is an tanti-Hermitian form, there exists an orthogonal basis (e;) of V such that f(ej, e i ) = j (quaternion unit), 1.s; i.s; n, Hence, in this case, the,unitary group U (n, K, f) relative to f is determined only by nand K. Suppose that we are given an f.-trace form f over a general field K whose index m is not equal to O. We exclude the case where J = 1 and f.= 1. Then the unitary group U(n, KJ) contains transvections, Let T(n, KJ) denote the subgroup of U (n, K,f) generated by trans- vections which are unitary transformations, If m? 2, then T(n, KJ) is the commutator sub- group of Urn, K,f). The center Tv" of T(n, KJ) coincides with the intersection of T(n, K,f), and the center 3 of G L( n, K). If n? 3 and K contains more than 25 elements, then the quo- tient group T(n, KJ)jTv" is a simple group [6]. Also, if K is commutative and n? 2, m? 1, 
60 Ref. Classical Groups J # 1, then T(n, KJ)= SU(n, KJ), except for the case where n = 3, K = F4' If K is the field R of real numbers, the field C of complex numbers, or the quaternion field H, then GL(n, K), SL(n, K), and Urn, KJ) are all Lie groups. In particular, SL(n, K) and Urn, K,f) are simple Lie groups except in the following three cases: (1) n = 1, K = R or C; (2) n=2, K =R, J = 1, £= 1; (3) n==4, K =R or C, J= 1, £= 1, m=2. In cases (1) and (2) they are commutative groups, and in case (3) they are locally direct sums of two noncommutative simple groups. Suppose that K = H. Since H contains C as a subfield, a vector space V of dimension n over H has the structure of a vector space of dimension 2n over C. From this fact, GL(n, H) can be considered as a subgroup of GL(2n, C) in a natural way. Each of the complex classical simple groups G = SL(n, C), SO(n, C), Sp(n, C) has the struc- ture of an talgebraic group defined over R ( 13 Algebraic Groups). The real forms of G, i.e., the algebraic subgroups of G whose scalar extension to C is G, can be realized as SL(n, K), Urn, KJ) corresponding to K =R, C, H. Namely, a real form of a complex classical group G is conjugate in G to one of the follow- ing groups: (i) The real forms of SL(n, C): SL(n, R) (type AI); SL(k, H) only for n = 2k (type All); and the special unitary group SU(n, m, C), O.s;m.s; [n/2J, relative to a Her- mitian form f of index m (type AlII). (ii) The real forms of SO(2n + 1, C): the proper ortho- gonal group SO(2n+ l,m,R), O.s;m.s; n, relative to a quadratic form of index m on a space of dimension 2n+ 1 (type BI and BII). (iii) The real forms of SO(2n, C): SO(2n, m, R), O.s; m.s; n (type DI and DII); and Urn, HJ) relative to an anti-Hermitian form f on H (type DIll). (iv) The real forms of Sp(n, C): Sp(n, R) (type CI); the unitary group U(2n, m, H), O.s; m.s; n, rela- tive to a Hermitian form f of index m on H (type CII); and Sp(n) corresponds to the special case m == 0, The quotient groups of these real forms by their centers can all be realized as the groups of automorphisms of semisimple alge- bras with involutions J which commute with J (A. Weil [lOJ). References [IJ B. L. van der Waerden, Gruppen von Iinearen Transformationen, Erg. Math., Springer, 1935 (Chelsea, 1948). [2J H. Weyl, The classical groups, Princeton Univ. Press, 1939, revised edition, 1946. [3J N. Bourbaki, Elements de mathematique, Algebre, ch, 9, Actualites Sci. Ind" 1272a, Hermann, 1959. 230 [4J C. Chevalley, Theory of Lie groups I, Princeton Univ. Press, 1946. [5J J. Dieudonne, Sur les groupes classiques, Actualites Sci, Ind., Hermann, 1948. [6J J. Dieudonne, La geometrie des groupes classiques, Springer, 1955. [7J C. Chevalley, The algebraic theory of spinors, Columbia Univ. Press, 1954. [8J M. Eichler, Quadratische Formen und orthogonale Gruppen, Springer, 1952. [9J E. Artin, Geometric algebra, Interscience, 1957. [lOJ A. Weil, Algebras with involutions and the classical groups, 1. Indian Math. Soc., 24 (1960),589-623. 61 (111.16) Clifford Algebras A. Definitions and Basic Properties Let V be an n-dimensional tlinear space over a field K, and let Q be a tquadratic form on V. Denote the ttensor algebra over V by T(V), the tensor multiplication by @. Let I(Q) be the two-sided ideal of T(V) generated by the ele- ments x @ x - Q(x)' 1 (x E V). The resulting tquotient associative algebra T(V)/I(Q) is then denoted by C(Q) and is called the Clifford algebra of the quadratic form Q, The elements of C(Q) are called Clifford numbers. The composite of two canonical mappings r: V-->T(V), 0": T(V)-->c(Q) is a linear injection 0" 0 r: V --> C( Q). Hence we can regard V as a linear subs pace ofc(Q) via O"or. Then C(Q) is an associative algebra over K generated by 1 and V, Furthermore, x 2 = Q(x)' 1 for every x in V, Indeed, C(Q) is the universal associative algebra with these properties. That is, let A be any associative algebra with a unity element, and let f: V --> A be a linear mapping such that f(X)2 = Q(X)' 1 for every x in V. Then f can be extended uniquely to an algebra homomor- phism J: C(Q)-->A with J(l)= 1. Furthermore, let <I> be the tsymmetric bilinear form associ- ated with Q:<I>(x,y)=Q(x+ y)-Q(x)-Q(y), x,yE V. Then xy+ yx=<I>(x,y)'1 for every x, y in V. C(Q) is of dimension 2 n over K. If e" ''', en is a basis of V, then 1, e i , eie j (i <j), ..., e l e 2 ... en form a basis of C(Q). In particular, if {e;} is an orthogonal basis relative to Q, we have eie j = -eje i , ef=Q(eJ 1; i,j = 1, .. . , n, i # j. (1 ) 
231 In this case, C(Q) can be defined as an as- sociative algebra (with a unity element) gen- erated by the {e i } together with the defining relations (I). In particular, for Q = 0, C(Q) is the texterior algebra (tGrassmann algebra) over V. B. The Principal Automorphism and the Principal Antiautomorphism of C(Q) There exists a unique automorphism ex of the algebra C( Q) such that ex (x) = - x for every x in V. This automorphism ex is called the principal automorphism of C( Q), and we have ex 2 = 1. Also, there exists a unique antiautomorphism fJ of the algebra C(Q) such that fJ(x) =x for every x in V. This antiautomorphism fJ is caned the principal antiautomorphism of c(Q), and we have {32 = 1. F Or the rest of this article we assume that the tdiscriminant of Q is #0. We also assume for the sake of simplicity that the characteristic of K is #2. Let C+ =C+(Q)=K'1 + y2+ y4 + ''', and C- =C-(Q)= y+ y3+ y5 +".. Then C( Q) is the direct sum of the linear subspaces C+(Q) and C- (Q). Furthermore, C+C+ cC+, C+C- cC-, C-C+ cC-, and C- C C C+. Thus C(Q)= C+ + C- has the structure of a tgraded algebra with the index group {::f:: I}, and C+ is a subalgebra of C(Q). The elements of C+ (Q), C- (Q) are called even elements and odd elements, respectively. We have dim C+(Q)=dimC (Q)=2 n - 1 . C. The Structure of C(Q) and C + (Q) C(Q) and C + (Q) are both tseparable, tsemi- simple associative algebras over K. Suppose n is even: n = 2r. Then C( Q) is a tsimp1e algebra with K as its center; the center Z of C+ (Q) is 2- dimensional over K. Let e I' ... ,en be an ortho- gonal basis of V. Then 1 and z = 2' el ... en form a basis of Z, and we have Z2 = 22'( -1),Q(ed... Q(e n ) = (-I)' D, where D is the tdiscriminant of <I> relative to the basis {e;}. Thus if (-1)' D has a square root in K, Z:::;;K EB K (direct sum), and so C+(Q) is decomposed into the direct sum of two simple algebras. If (-I)'D does not have a square root in K, then Z is a field and C+ (Q) is a simple algebra. In particular, if the tindex of Q (i.e., the dimension of a maximal ttotally sin- gular subspace of Y ( 348 Quadratic Forms)) is r, C(Q) is isomorphic to the ttota1 matrix algebra of degree 2' over K, and C+(Q) is isomorphic to the direct sum of two copies of the total matrix algebra of degree 2,-1 over K. N ow suppose that n is odd: n = 2r + 1. Then 61 E Clifford Algebras C+(Q) is a simple algebra with K as its center. (In particular, if Q is of index r, then C + (Q) is isomorphic to the total matrix algebra of degree 2' over K.) The center Z of C(Q) is 2- dimensional over K, and we have C(Q):::;; Z @K C+ (Q). If e 1 , ..., en is an orthogonal basis of V, then 1 and z = e 1 ... en form a basis of Z. Putting Z'=2'+IZ, we have z'2=2( -1)'D, where D is the discriminant of <I> relative to {eJ Thus if 2( -1)'D has a square root in K, C(Q) is the direct sum of two 2 2 '-dimensiona1 simple algebras. If 2( -1)'D has no square root in K, then C(Q) is a simple algebra. D. The Clifford Group Let G be the set of an invertible elements s in C(Q) such that sVs- 1 = V. Then G forms a group relative to the multiplication of C(Q). This group G is called the Clifford group of the quadratic form Q. The subgroup G + =Gnc+(Q) is called the special Clifford group. The linear transformation <p(s): X--> sxs- 1 of V induced by SEG belongs to the torthogonal group O(Q) of V relative to Q. Moreover, the mapping s--><p(s) is a homomor- phism from G into O(Q). Thus <p is a trepre- sentation of G on V, This representation <p is called the vector representation of G. The tker- ne1 of <p consists of invertible elements in the center Z of C(Q). If xEGn V, then Q(x)#O and - <p(x) is the reflection mapping of V relative to the hyperplane orthogonal to x, Ifn=dim V is odd, <p(G)=<p(G+)=SO(Q). Ifn is even, <p(G) =O(Q), <p(G+)=SO(Q). Exploiting the principal antiautomorphism fJ of c(Q), we obtain a homomorphism N: G+ --> K* (the multiplicative group of K) defined by N(s) = fJ(s)s (SE G +), and N(s) is caned the spinorial norm of SE G +. The normal subgroup of G+ defined as the kernel of N is denoted by Gt and is called the reduced Clifford group (of Q). The subgroup <p(Gt) of SO(Q) is denoted by ot (Q) and is caned the reduced orthogonal group. In particular, when the ground field K is the real number field R, 0t (Q) coincides with the tidentity component of the tLorentz group O(Q). Furthermore, if Q is definite, 0t (Q):::;; SO(n), so that the identity component Spin(n) of Gt is a tsimply connected tcovering group of SO(n) via the covering homomorphism <p (with each point in SO(n) covered twice), The group Spin(n) is caned the spinor group of degree n. E. Spin Representations In this section we assume that the ground field K is the complex number field C and that n 
61 Ref. Clifford Algebras =dim V? 3. Then we have 0; (Q) SO(n, C), so G; is a simply connected covering group of SO(n, C) via the covering homomorphism <po In this section we denote Gt by Spin(n, C) and call it the complex spinor group of degree n. Spin(n, C) is the tcomplexification ( 249 Lie Groups) of the compact Lie group Spin(n) and is a complex analytic subgroup of the tcom- plex Lie group C(Q)* consisting of all inver- tible elements of C(Q). With the bracket operation [x, y J = xy- yx, C(Q) becomes the tLie algebra of c(Q)*. Furthermore, the Lie subalgebra of C(Q) associated with the com- plex analytic subgroup Spin(n, C) is given by Li<j Ceiej, where e 1, ... ,en is an orthogonal basis of V. The spin representations of the group Spin(n, C) are defined as follows: (1) When n is odd: n=2r+ 1. Since C+ (Q) is isomorphic to a total matrix algebra of degree 2' over C, C+ (Q) has a unique (up to equiva- lence) tirreducible representation p, which is of degree 2'. The restriction of p on Spin(n, C) (on Spin(n)) defines an irreducible representation p of degree 2' of Spin(n, C) (of Spin(n)); p is called the spin representation of the group Spin(n, C) (of Spin(n)). The elements in the representation' space of p are called spinors. Thus we can say that a spinor is a quantity with 2 r components which obey the transformation law according to the spin representation ( 258 Lorentz Group). This representation p defines a rep- resentation of the Lie algebra so(n, C) of Spin(n, C) (note that so(n, C) is a tcomplex simple Lie algebra of type B,). This represen- tation of so(n, C) is also called the spin repre- sentation of so(n, C). Note that p is not well defined on SO(n, C) or on SO(n); p is of valence 2 on SO(n, C) or on SO(n). (2) When n is even: n == 2r. Since C(Q) is isomorphic to a total matrix algebra of degree 2' over C, C(Q) has a unique (up to equiva- lence) irreducible representation p, which is of degree 2'. The restriction of p on Spin(n, C) (on Spin(n)) defines a representation p of degree 2' of Spin(n, C) (of Spin(n)); p is called the spin representation of the group Spin(n, C) (of Spin(n»). This representation p is, however, not irreducible; p is decomposed into the direct sum of two irreducible representations p+ and p_. They are not equivalent to each other, and both are of degree 2'-1. By taking a suitable minimal left ideal L of C(Q) as the representa- tion space of the representation p, we obtain the representation spaces L +, L - of p+, p_, respectively, by putting L + = L n C+ (Q) and L - ==LnC-(Q). The representation p+ (or p-) is called the half-spin representation of the group Spin(n, C) or of the group Spin(n). The elements in the representation space of p + (or p -) are called half-spinors. Again, p + and p - are not well defined on SO(n, C) or on SO(n). 232 They are of valence 2 on these groups. The representations of the Lie algebra of the Lie group SO(n, C) (note that this Lie algebra is a tcomplex simple Lie algebra of type Dr) asso- ciated with p + , P - are also called half-spin representations of this Lie algebra. References [lJ N. Bourbaki, Elements de mathematique, Algebre, ch. 9, Actualites Sci. Ind., 1272a, Hermann, 1959. [2J C. Chevalley, The algebraic theory of spinors, Columbia Univ. Press, 1954. [3J M. Eichler, Quadratische Formen und orthogonale Gruppen, Springer, 1952. 62 (Xl.g) Cluster Sets A. Cluster Sets of Functions Meromorphic in an Arbitrary Domain Let D be an arbitrary tdomain in the complex z-plane, r its boundary, and E a ttotally dis- connected closed set contained in r. Let w = f(z) be a single-valued tmeromorphic func- tion defined in D. Then for each point Zo in r, we can define the following sets related to the mapping w= f(z) in the complex w-plane (or on the complex w-sphere C). The Cluster Set. A value ex is called a cluster value of f(z) at Zo if there exists a sequence of points {ZH} such that ZnED, Zn-->ZO' f(zn)-->ex. The totality C D (/, zo) of all the cluster values of f(z) at Zo is called the cluster set of f(z) at Zo or, more precisely, the interior cluster set. It is a nonempty, closed, but not necessarily con- nected set. The Boundary Cluster Set. The set of all values ex such that there exist a sequence of points gn} of r - {zo} (resp. r - {zo} - E) and a se- quence of points {w n } in the complex w-plane satisfying (n-->zo, WnECD(f,(n), wn-->ex is called the boundary cluster set of f(z) at Zo and is denoted by Cdf, zo) (resp. Cr-df, zo)). These are closed sets, and Cr-E(f, zo) c Cr(f, zo) c CD(f, zo). If Zo E r - E or Zo is an isolated point of E, then Cr-df, zo) = CrCf, zo). Furthermore, Cdf, zo) 
233 (resp, Cr-E(f, zo» is empty if and only if Zo is an isolated boundary point (resp. Zo is an exterior point of r - E). Range of Values. The set of values (X such that Zn ED , zn-->zO, f(Zn) = (X is called the range of values of f(z) at Zo and is denoted by RD(f, zo). In other words, R D (/, zo) is the set of values (X assumed by f(z) infinitely often in any neighborhood of Zo and is a tG o - set. The Asymptotic Set. Let Zo be an taccessible boundary point of D. If f(z) converges to a value (X as z tends to Zo along a simple arc in D terminating at Zo, then (X is called an asymp- totic value of fat Zo' The totality AD(f, zo) of asymptotic values of fat Zo is called the asymptotic set of fat zo0 If Zo is an inacces- sible boundary point, we let AD(f, zo) be the empty set. B. Iversen-BeurIing-Kunugui Theorems Suppose now that E is empty, and put Q= CD(1, zo)- Cd 1, zo). tIversen's theorems in the case where D is the unit disk Izi < I and Zo is a point on Izl = I are generalized as follows. (1) O-RD(1,zo)cAD(1,zo) (K. Noshiro, 1936). (2) First Beurling-Kunugui theorem: If Zo is not an isolated boundary point, then Q is an open set. (3) Second Beurling-Kunugui theorem: Sup- pose that the open set 0 is not empty. Then f(z) assumes every value belonging to each component Qn of Q infinitely often, with two possible exceptions belonging to Qn, that is, On n (C: - RD(1, zo)) consists of at most two values (an extension oftPicard's theorem on an isolated essential singularity). Next suppose that E is not empty and of t10garithmic capacity zero, and put 0 = CD(f,zo)-Cr-E(f,zo).If(xEO-RD(1,ZO)' then either (X is an asymptotic value of f(z) at Zo or there exists a sequence of points (nE E (n = 1,2,...) converging to Zo such that (X is an asymptotic value of f(z) at each (n (Noshiro, 1937). Furthermore, if Zo is contained in the closure of r - E, then Q is an open set (which may be empty), and Q- RD(1, zo) is at most of logarithmic capacity zero (M. Tsuji, 1943). If E is contained in a single component ro of the boundary r, Zo is contained in the closure of r - E, and Q is nonempty, then w== f(z) assumes every value belonging to each component Qn of Q infinitely often, with two possible excep- tions belonging to Qn (Noshiro, 1950). In par- 62D Cluster Sets ticular, if f is bounded in a neighborhood of Zo, the number of such exceptional values is at most 1. This is stiIl true if each point of E is contained in a boundary component that is a tcontinuum containing at least two points and Q is not empty (M. Herve, 1955). However, this conclusion does not hold if we remove the hypothesis on the set E (K. Matsumoto, 1960). C. Cluster Sets of Functions Meromorphic in the Unit Disk Let D be the unit disk {Izl < I}, Zo =e iOo be a fixed point on the unit circumference r, A be an open arc of r containing Zo, and E be a set of t1inear measure zero such that Zo E E c A. With every e iO E A - E we associate an arbi- trary simple arc Ao in D terminating at eie and the curvilinear cluster set CA,(f, e iO ), defined as the set of all values (X such that ZnEAO' zn-->e iO , f(zn)-->(X' We put Cf-d1,zo)= n M" ,>0 where M, denotes the closure of the union CA,(1, e iO ) for all e iO in the intersection of A - E with Iz-zol<r. By using the above cluster set Cf-d1, zo) instead of the boundary cluster set C r - df, zo), we obtain results similar to those in the pre- ceding section (M. Ohtsuka, 1950; Noshiro, 1955). Many interesting results have been obtained by F, Bagemihl and W, Seidel, E, F, Collingwood, O. Lehto and K. I. Virtanen, and others concerning the cluster sets of func- tions meromorphic in the unit disk. They studied functions of Seidel's class U, normal meromorphic functions, and other functions where the class U is the totality of regular bounded functions in the unit disk possessing almost everywhere on Izi = 1 radial limits of the constant modulus I; a nonconstant mero- morphic function f(z) in I zl < 1 is normal if the family {I(T(z))} is tnormal in the sense of Montel, where T(z) is an arbitrary con- formal mapping of Izl < 1 onto itself. D. The More General Case The definitions of cluster sets are also available for arbitrary functions for which neither ana- lyticity nor continuity is assumed. If there exist two simple arcs Al and A 2 in the unit disk D terminating at a point z = e iO such that C A , (1, e iO ) n CA,(1, e ie )= 0, then z == e iO is called an ambiguous point of f Bagemihl proved the following: The set of am biguous points of an arbitrary complex- valued function defined in the unit disk D is at 
62 E Cluster Sets most countable [5]. Under the same hypoth- esis, the set of points e i8 such that C n ( f, e i8 ) # C r (/, ei8) is at most countable (Collingwood, 1960). This result shows the importance of introducing the cluster set C1-E(f, 2 0 ) that was previously mentioned. E. History The theory of cluster sets originated from the tvalue distribution theory of analytic functions in the neighborhood of their essential singular- ities. The first systematic results were those of F. Iversen and W. Gross, obtained about 1920. Subsequent significant contributions were made by Seidel, J. L. Doob, M. L. Cart- wright, A. Beurling, and others. Since 1940, some important results have been obtained by K. Kunugui, S. Irie, Y. T6ki, Y. Tumura, S. Kametani, Tsuji, Noshiro, and other Japa- nese mathematicians. Many results have been extended to tpseudoanalytic functions. As can be seen from the Bagemihl ambiguous point theorem, some properties of cluster sets are not intrinsic to analytic mappings [2,5]. On the other hand, it seems to be an interesting problem to extend the theory of cluster sets to the case of analytic mappings between open Riemann surfaces. References [1] K. Noshiro, Cluster sets, Erg. Math., Springer, 1960. [2] E. F. Collingwood and A. J. Lohwater, The theory of cluster sets, Cambridge Univ. Press, 1966. [3J K. Kunugui, Sur un theoreme de MM. Seidel-BeurJing, Proc. Japan Acad., 15 (1939), 27-32; Sur un probleme de M. A. Beurling, Proc. Japan Acad., 16 (1940), 361-366. [4] M. Ohtsuka, On the cluster sets of analytic functions in a Jordan domain, 1. Math. Soc. Japan, 2 (1950), 1-15. [5J F. Bagemihl, Curvilinear cluster sets of arbitrary functions, Proc. Nat. Acad. Sci. US, 41 (1955), 379-382. 63 (XVI.3) Coding Theory A. General Notions of Coding Theory When we wish to store, to search for, or to send information in the presence of noise efficiently and with the least error, we can 234 apply various bounds to the efficiency ( 213 Information theory). Sophisticated coding operations are required in order to achieve efficiencies as close as possible to the bounds. In the abstract sense, the information is gener- ally understood to be a choice of an element from a finite set X. For implementation, we take a set K of q elements called alphabets. Each element of K is called a letter. We con- sider the direct product K n , i.e., the set of all sequences of n letters. An injection IjJ from X into K n is called encoding. The sequence ljJ(x) for x E X is called a code word, and the image IjJ(X) (a collection of code words) is called a code. Such a code is also called a block code. The noise is represented by a mapping w: Kn -->Kn, but usually it is restricted to a certain subset Q of the set of such mappings with special properties, For example, we usually assume that the sequences x E K n and w(x) are different only at less than d letters, where d is a preassigned constant. The inverse mapping <p, i,e" a mapping <p: K n --> X satisfying <p 0 ljJ(x) = x for all XEX, is called the decoding ofljJ. A code IjJ satisfying the property woljJ(x)<f=IjJ(X) for an XEX and for all WEQ is called error-detecting with respect to the noise Q. If IjJ has the decod- ing <p satisfying <p 0 W 0 ljJ(x) = x for aJl x EX and for all WEQ, IjJ is calJed error-correcting with respect to the noise Q. To discuss such properties, we can assume X = IjJ(X) ( c K n ) without restricting the generality, so hereafter we assume this condition, Also, we take q = 2 unless explicitly stated otherwise, Because in many communication systems q = 2 is commonly adopted, and the generalization for other prime powers q can be obtained naturally. B. Bounds for the Size of Codes Let x=(x I , ...,x n ), y=(y" ...,Yn)EK n . The Hamming distance d(x, y) == dH(x, y) between the elements x and y is the total number of unequal bits (Xi#yJ We put d min = min{dH(x,y)lx,YEX,x# y}, When e= max{dH(x,w(x»)lwEQ,xEX} is less than d min , the errors due to Q can be detected, and if d min ? 2e + 1, the errors can be corrected. The maximal integer t satisfying d min ? 2t + 1 is often called the error-correcting capability. Several important relations are known among ddim, t, n (the length of the code), and IXI (the size of the code word) as follows. Hamming bound: IXI.s; 2n/L:d7). A code satisfying the equality here is called a perfect code [6]. Plot- kin bound: d min .s;nIXI/(2(IXI-l)). On the other hand, if the V arsharmov-GiIbert-Sacks bound 2 n - k > L1,:,\}-2(nil) is satisfied, there exists an (n, k)-linear code (Section C). 
235 C. Linear Codes Let K = GF(2) = {O, I}, K n be an n-dimensional vector space over K, and X be a k-dimensional linear subspace. Then X is called a group code or (n, k)-linear code. In the present case, we can take a suitable basis of K for which there exists a k x (n - k) matrix P over K such that every vector x = (x I' '" ,x n ) representing a code word is expressed as x=(zP,z) by a suitable vector z, and conversely, every vector of this form is a code word, i.e., a vector XEX is a code word if and only if (Xl, ..., xn-d = (X n -k+l, ..., xn)P= o. (*) Introducing the matrix H = [In-k> - pTJ, where In-k is the unit matrix of order n - k and T indicates the transpose, the condition (*) is equivalent to XHT =0. Therefore, H is called the parity check matrix, X n -k+ I' " . , X n are called the information bits, and x" "', X n - k are called the check bits. If x is deformed to y = wx by a noise w, i.e., the original signal x is transmitted as signal y, we call s = y H T = (y - x)H T the syndrome of the transmitted signal y. If we have an algorithm to determine the error vector e = y - x from the syndrome s, we have a decoding with error-correcting property. For the linear code, it is evident that d min = min {the number of nonzero elements of xix (#O)EX}. The Hamming code is given by n = 2 m - I (m being an integer? 2), k = n - m, where the (i,j)- component h ij (i= I,..., m;j = 1,..., n) of the parity check matrix H is given by the ith bit of the number j -- 1 expressed in the binary (2- adic) number system. The check bits are x j (j = 2 0 , 2 1 , ... , 2 m - I ), and all other bits are the information bits. This code has the error-cor- recting property with respect to the noise w for which the nonzerO component of e = wx- x is at most 1. In fact, from the syndrome s = (SI'"'' sm), we compute ]=SI + 2s 2 +... + 2 m - I Sm and put ej= I, ej=O for allj#J; irj=O, we can put e=O. The Hamming code is a per- fect code. D. Cyclic Codes The cyclic code is a special case of a linear code, This is the object of one of the most important applications of the theory of finite fields. By using this theory, we can actually construct the codes with high for error- correcting capability in which the encoding and the decoding operations are performed algebraically. An (n, k)-linear code X over GF(2) is called a cyclic code if x =(x" X 2 , ..., Xn)E X implies (x n , x t ,x 2 , ... ,xn-d, (xn-1,xn,x 1 , ... ,x n - z ), ... ,(Xl, 63 Ref. Coding Theory "', X n , XdEX. To each XEX, we can associate a polynomial of one variable X()=XI + X2 + ... + xnn-I over GF(2). Let gx() be the poly- nomial of the least degree n - k among the polynomials corresponding to XEX. gx() is called the generator of X, because the property XEX and the property that x() is divisible by gx() are equivalent. gx@ also divides n_1. Let the quotient be hx()=(C-1)/gx()' Then XEX is equivalent to x()hx()=O (mod(n- I)). hx() is called the parity check polynomial. The BCH code (Bose-Chaudhuri-Hocquenghem code) defined below is a typical example of a cyclic code. Let (X be an element of order n in GF(2 m ) (the finite field with 2 m elements), and let gi() be the polynomial of the lowest degree in GF(2) [J for which (Xi is a root (i = 1, 3, "', 2t-I). The BCH code is given by the least common multiple g() of the polynomials 9 1 (), g3()' ..., g2'-1 (). The BCH code has at least n - mt information bits and satisfies d min ? 2t + 1, and algebraic decoding methods are known [3,5]. The Hamming code is a BCH code when t= 1, n=2 m - \. E. Other Codes Other important types of codes include con- volutional codes (not block codes) for correct- ing errors in consecutive digits (burst errors) and Goppa codes (an extension of the BCH class). For correcting burst errors, special cyclic codes or their mixtures are also used. This field of research is closely connected to information theory, algebra, and various ap- plications of combinatorial analysis, such as texperimental design [3]. References [IJ J. Wolfowitz, Coding theorems of informa- tion theory, Springer, 1961. [2J R. G. Gallager, Low-denity parity-check codes, MIT Press, 1963. [3J E. R. Berlekamp, Algebraic coding theory, McGraw-Hili, 1968, [4J S. Un, An introduction to error-correcting codes, Prentice-Hall, 1970. [5J W. W. Peterson and E. J, Weldon, Jr., Error-correcting codes, MIT Press, second edition, 1972. [6J A. Tietavainen, On the non-existence of perfect codes over finite fields, SIAM J. Appl. Math., 24 (1973),88-96. [7J E. R. Berlekamp, Goppa codes, IEEE Trans. on Information Theory, IT -19 (1973), 590-592. 
64A Cohomology Operations 64 (IX.8) Cohomology Operations A. General Remarks The notion of cohomology operations was introduced by L. S. Pontryagin and N. E. Steenrod in order to solve homotopy classifi- cation problems ( 202 Homotopy Theory). Since then, numerous works have proved the importance of cohomology operations as applied to thomotopy theory, tdifferential topology, and other branches of topology. In fact, the use of cohomology operations is indispensable in studying problems related to thomotopy groups, tcharacteristic classes of manifolds, etc. We denote by the symbol H*(X;A)= L H"(X; A) the tsingular cohomology ring of a topological space X with coefficients in an t Abelian group A. B. Primary Cohomology Operations A (primary) cohomology operation (or simply an operation) <p is a tnatural transformation <p:ITH 1 ,( ;A;,)-->ITH m -( ;B) A  between the cohomology functors defined on the tcategory of topological spaces and con- tinuous mappings. That is, <P is a family of mappings satisfying the following conditions: (1) For each space X, <p defines a mapping <p: IT H1,(X; AA)--> IT Hm.(x; B) A  that is not necessarily additive. (2) For each mapping f: X --> Y, the commu- tativity f* 0 <p = <p 0 f* holds in the diagram IT H1!(X; A)!.IT Hm_(X; B) A if'  if' IT H1,(y; A)!. IT Hm_(y; B). ;.  We list here two trivial examples, (I) Addition of cohomology groups deter- mines an operation <p: H'(X; A) x H1(X; A)--> H1(X;A). (II) The tcup product determines an operation <p:H 1 1(X;Ad x H I 2(X;A 2 )-->H 1 l +12(X; Al @ A 2 ) denoted by <p(ex, fJ) = ex '-' fJ. The composite of two cohomology opera- tions is defined in the obvious way. Among cohomology operations the most important ones are operations of one variable. A co- 236 homology operation of type (A, I; B, m) is a natural transformation <p:H 1 ( ;A)-->H m ( ;B). D(A, I; B, m) denotes the Abelian group consist- ing of all such operations. Denote by Hm(A, I; B) the mth tcohomology group of an tEilenberg-MacLane space K(A, I), and let U E Hl(A, I; A) be the tfundamental class of K(A, I). If X is a CW complex, by assigning f*u to each f: X --> K (A, I), we obtain a one- to-one correspondence between the set of the homotopy classes n(X, K(A, I)) and the co- homology group Hl(X; A) ( 70 Complexes F). Hence by utilizing condition (2), it can be shown that the value of <p on H1(X; A) is uniquely determined by its operation on HI(A, I; A). Thus the assignment <p--><pu defines the isomorphism D(A, I; B, m)::::;: Hm(A, I; B). Here we should remark that the isomorphism n(K(A, I), K(B,m))::::;:Hm(A, I; B) defines a one- to-one correspondence D(A, I; B, m)-->n(K(A, I), K(B, m)). In some cases Hm(A, I; B) vanishes, for example, when O<m<I; 1= 1 <m, A=Z; 1=2, m=2m' + 1, A=Z; 1=21' <m=2m' + 1, A == Z ' B = Q; 1= 21' + 1 < m, A = Z, B = Q; A is finite, B = Q; etc. The following four types of operations to- gether with the two above are caIled elemen- tary operations: (III) Homomorphisms induced by a coeffi- cient homomorphism: There are homomor- phisms 1]*: H1(X; A)--> H1(X; B) induced by a homomorphism 1]: A --> B. (IV) Bockstein (cohomology) operation: This operation is given by the tconnecting homo- morphisms (j* :H1(X; C)-->Hl+I(X; A) asso- ciated with a short exact sequence O-->A-->B--> C -->0 of coefficient groups ( 200 Homo- logical Algebra). For example, the coefficient sequence O-->Z-'!. Z--> Zn-->O (Zn = ZjnZ) defines a Bockstein operation (or Bockstein homomor- phism), which is usuaIly denoted by (ljn)6 or n' (V) Steenrod (or reduced) square operations Sqi (i? 0): Sqi are sequences of operations defined by the following five axioms [2,5]: (VI) For each pair of integers i?O and I? 0, Sql: H1(X; Z2)-->Hl+I(X; Z2) is a natural transformation of functors that is a homomorphism. (V2) SqO = 1. (V3) If degx = i, then Sqi x =x '-' X (cup product). (V4) Ifdegx<i, then SqiX=O. (V5) (Cartan formula) I Sqi(X '-' y) = L Sqj x '-' Sqi-j y. jO 
237 These five axioms imply the following two formulas: (V 6) Sq I is the Bockstein operation [32 of the coefficient sequence 0-->Z2-->Z4 -->Z2-->0, (V7) (Adem relations) If 0 < i < 2j, then . . [ i/2 J ( j_l_k ) . Sq'oSqJ= L. Sq'+J-kosl. kO 1-2k The binomial coefficient is taken mod 2. We can extend the definition of the Sqi so that they operate on the relative cohomology groups. Now (VI), (V2), and (V5) imply: (V8) If 15: H1(y; Z2)-->H 1 + I (X, Y; Z2) is the tcoboundary homomorphism, then 15 0 Sqi = Sqiob. (V') Steenrod pth power operations ;JJi (i  0): Let p be an odd prime. Then ;JJi is a sequence of operations defined by the following five axioms [5]. (V' 1) For each pair of integers i  0 and I  0, ;JJi: H1(X; Zp)--> H 1 + 2i (p-1)(X; Zp) is a natural transformation that is a homo- morphism. (V'2) ;JJ0 = 1. (V'3) If deg x = 2k, then ;JJk x = xp. (V'4) If degx < 2k, then ;JJkx = O. (V'S) (Cartan's formula) i g>i(X '-' y) = L ;JJj x '-' g>i- j y. jO These axioms imply the Adem relations for !?I'i and the Bockstein homomorphism [3p asso- ciated with the coefficient sequence O-->Zp--> Zp2-->Zp-->0 ( Appendix A, Table 6.11). We can extend the definition of;JJi to the relative cohomology groups too, and we ob- tain 15 0 qJJi =;JJi 0 b. (VI) Pontryagin pth power operations p' Let p be a prime. p is a system of operations satisfying the following five conditions [3]: (VI I) For each pair of integers I  0 and hl, p: Hl(X; Zph)-> Hpl(X; Zph+1) is a natural transformation. (VI2) IfI]:Zph+I-->Zph is a homomorphism defined by 1](1)= I, then we have l]*opx=xP, (VB) If P:Zph-->Zph+1 is a homomorphism defined by p(l) == p, then we have p(x + y) = px + Py+ :t: ( G) I P )P*(X i '-' yp-i), (VI4) p(x '-' y) == px '-' py. (VIS) If p > 2 and degx = 2k + 1 (odd), then px=O. Let A, B be finitely generated Abelian groups. Then the computation of H*(A, I) 64 B Cohomology Operations shows that each element of D(A, I; B, m) can be written as the composite of a finite number of the operations of the types (I)-(VI) (H. Cartan). Here we show some examples. Let u be the fundamental class of K(Z2' 2). Then the ele- ments of H 2 (Z2' 2; Z4) Z2, H 3 (Z2' 2; Z4) Z2, H4(Z2,2;Z4)Z4' and H5(Z2,2;Z4) Z4 + Z2 correspond to the cohomology opera- tions ..1.2*, A.I]* 0(15/2), J12, and J11]* 0 (15/4) 0 2 +..1.2*oSq 2 0Sql, respectively, where ..1., J1 denote integers satisfying O.s; A..s; I, O.s; J1.s; 3 and 1]: Z-->Z4' 2: Z2 -->Z4 stand for the homomorphisms defined by 1](1)= 1,2(1)=2, respectively. Suppose that we are given sets of integers Ii and mj' We define the stable (primary) coho- mology operation <p with respect to these sets of integers Ii and m j as a system of natural transformations <p: IT H"+l;( ; AA)--> IT W+ m .( ; Bp) A P satisfying the following condition, for all integers nO; (3) Let S:H1+I(SX;A)-->H1(X;A) denote the tsuspension. Then the commutativity So <p = <p 0 S holds in the diagram IT W+1,(X; A) !.IT W+m.(X; B) A is p is IT H"+12+ I (SX; A)!.IT H"+m.+1(Sx; B). A p This condition is equivalent to the commuta- tivity with the coboundary homomorphisms. For example, the cohomology operations (_I)n [3p: W(X; Zp)--> W+ I (X; Zp) define a stable cohomology operation [3. Sq\ i are also examples of stable cohomology operations. A stable cohomology operation <p of type (A, B) and of degree q is a sequence of coho- mology operations of type (A, n; B, n + q) de- fined for all integers n  O. (A, B)q denotes the Abelian group consisting of all stable coho- mology operations of type (A, B) and of degree q. When A=B, (A)=L;;'o(A,A)q is a tgraded algebra, where multiplication of two operations is given by their composition. Let p be a prime. Then (Zp) is called the Steenrod algebra modp and is denoted by (p). (2) is the augmented graded algebra over Z2 gen- erated by Sqi subject to the Adem relations. Suppose that we are given a sequence of non- negative integers 1= (i I' i 2 , ..., id. We call I an admissible sequence if is-I  2is holds for 2.s;s.s;k. We write SqI =Sqi 1 oSqi 20 ... oSqi,. If I is an admissible sequence, we say that SqI is an admissible monomial. The admissible mono- mials form an additive basis for (2), which 
64C Cohomology Operations has the structure of a tHopf algebra whose tcomultiplication t/J: (2)--> (2) @ (2) is given by t/JSqi = Ljo Sqj @ Sqi- j . The dual space (2)* = Homz)(2), Z2) gives a Hopf algebra that is the polynomial algebra gen- erated by i of degree 2 i -1, where i is the dual element of Sq2 i - 1 Sq2'-Z ... Sql with respect to the additive basis. The comultiplication <p* :(2)*-->(2)* @ (2)* is given by <P*i =L\ofi@ j' (p) has properties similar to (2) (p is an odd prime). In particular, (p) is a tHopf algebra generated by ;JJi and fJ subject to the Adem relations with comultiplication t/J given by t/JfJ = fJ @ 1 + 1 @ fJ and t/J;JJi = Ljo;JJj @ ;JJi-j. An additive basis of ll1(p) is given by {fJ'o;JJi1fJ'1 ...;JJikfJ'k}, where 8/=0 or I, i m ? pi m +1 + 8m (m? I) and i k ? 1. Such a sequence 1= (80, iI, 8 1 , ..., ik> 8d is also calIed an admis- sible sequence. Denote the dual element of ;JJpi-1 t J}'p'-Z ...;JJI and ;JJpi-1;JJpi-Z ...;JJI fJ by i and r i , respectively; then the dual algebra (p)* is isomorphic to the tensor product of the polynomial ring over Zp generated by I' 2' ... and the exterior algebra over Zp gen- erated by r l , r2, "'. The comultiplication <p* is given by <P*(i)=Ljofj@ j and <p*(r;) = r i @ 1 + Lo fj @ r j [5J ( Appendix A, Table 6.II1), C. Secondary Cohomology Operations Here we restrict our attention to a special type of operation treated by j, F, Adams [4J that has been proved to be powerful in applications. For a specified prime p, we write H +(X)= Lf;;1 Hi(X; Zp)' We can regard H +(X) as a tgraded left module over (p). Now let C s (s = 0, 1) be a pair of tleft free modules over (p), and let d: C I -->C o be a graded homomor- phism over (p), Suppose that we are given an element ZE C I such that dz = 0, For future pur- poses, Co and C I are assumed to have bases {CO,A} and {c 1 ,#} with degco'A =IA and degc l ,# = m#, respectively, in terms of which d has the representation dc l ,# = LA a#,AcO,;,' Then z is expressed in the form z=L#b#cI,W We say that <1> is a stable secondary coho- mology operation associated with the pair (d, z) if it satisfies the following four axioms: (I) Let Dn(d, X) be the module consisting of homomorphisms 8:C o -->H+(X) over (p) of degree n such that 8d==0. Putting 8(C O ,A)=X A , we can assume that Dn(d, X) = {8= I1 A x A E I1 A H n +/'(X)1 LAa#,AxA =O}. Let Qn(z, X) be the submodule of H +(X) consisting of ele- ments of the form (z), where  is an (p)- homomorphism: C I -->H+(X) of degree n-1. 238 In other words, Qn(z,X)==L#b#H"+m.-I(x). Then for each n ? 0 and each X, <1> is a mapping <1>: Dn(d, X)-->H+(X)/Qn(z, X). (2) For each mapping f:X --> Y, the commu- tativity f* 0 <1> = <1> 0 f* holds in the diagram Dn(d, X).!.H+(X)/Qn(z, X) if * i (* <I> Dn(d, Y)-->H+(Y)/Qn(z, Y). (3) Let S: H"+I (SX)-->Hn(X) denote the suspension. Then the commutativity So <1> = <1> 0 S holds in the diagram Dn(d, X).!.H+(X)/Qn(z, X) is is Dn+l (d, SX).!.H+(SX)/Qn+l (z, SX). (4) Let i: Y-->X be an injection such that i* 08=0, That is, i*x A = O. We can then find homomorphisms (over (p))1'/: Co-->H+(X, Y) of degree nand (: C I -->H+(Y) of degree n-l such that the following diagram commutes: H+( y)t- H+(X)t H+(X, Y)!':- H + (Y)t- H +(X) 1 il'/ i( Co........L- C I Then for any such pair (1'/, () we have i* 0 <1>8 = (z mod i*Qn(z, X), where b* denotes the homo- morphism defined by b*y=( _1) dimY by. For each pair (d, z), there is at least one associated operation <1>. Existence is proved by means of a tPostnikov system. Let <1> and <1>' be two operations associated with the same (d, z). Then they differ by a stable primary operation in the sense that there is an element <pECo/dC I such that <1>'8 = <1>8 + <p8 mod Q(z, X), where <p=LAaAcO,A moddC I means <p8=L A a A x A , For example, define the action of (2) on Z2 by the rule a' v = 0 if dega > 0 and I . v = v for each VEZ 2 . We consider a minimal projec- tive resolution 0<-Z2COtcCI f2c 2 <- '" ( 200 Homological Algebra). First we take Co=(2) and define 8(1)= 1. Next we take C I to be free over (2) with generators Cj, where O.s; i and d I c i == Sq2 i . Furthermore, we take C 2 to be free over (2) with generators Ci,j with O.s;i.s;j,j#i+ 1 and d2(Ci)=Sq2iCj+ L bi,j,kCk Ok<j with bi,j,kE(2), Here the d l Ci = Sq2i form a minimal set of generators of (2), and the equations 0== d l d 2 c i ,j== Sq2i 0 Sq2 i + LO""k<jb/, j,kSq2k form a minimal set of gen- erators over (2) of the Adem relations. Let C 1 (j) be the submodule over (2) generated by C k with O.s; k.s;j in C 1 , and let d 1 (j) be the restriction of d l on C 1 (j). We write zi.j=d 2 c i ,j. 
239 Let <IIi,j denote an operation associated with (dl(j),ZiJ Then <IIi,j is defined on the sub- module DAX) consisting of elements XE H"(X) such that Sq2'X=0 for O.s;k.s;j and takes values in H n + 2 '+2 j -1 (X) modulo the submodule Q7jX) = Sq2'H n + 2J-l (X) + '\' b.. H n + 2 '-1 ( X ) ,  t,j,k o k<j For example, <11 0 ,0: Ker [32-->Cok [32 is the generalized Bockstein homomorphism defined by b/2 2 and <Ill, 1 is the operation discovered by J. Adem that appears as the ttertiary ob- struction of sn for n? 2. If k? 3, we have the relation 2k+1 _ '\' .... Sq = L.., ai,j,k""i,j Oijk j"fi+1 ( mod L a i ,j,kQ7jX) ) . Oijk j#i+l These formulas can be applied to prove the nonexistence of an element with tHopf- invariant 1 in n 2n - 1 (sn) unless n = 1,2,4,8 (Adams [4J), Analogous formulas also hold for QI(p) with p > 2. We have no satisfactory theories concerning cohomology operations of orders higher than 2. For cohomology oper- ations in generalized cohomology theory  237 K-Theory. References [IJ N. E. Steenrod, Products of co cycles and extension of mappings, Ann. Math., (2) 48 (1947),290-320. [2] J.-P. Serre, Cohomologie modulo 2 des complexes d'Eilenberg-MacLane, Comment. Math. Helv., 27 (1953),198-232. [3J N. E. Steenrod and P. E. Thomas, Coho- mology operations derived from cyclic groups, Comment. Math, Helv., 32 (1957), 129-152, [4] J. F. Adams, On the non-existence of ele- ments of Hopf invariant one, Ann. Math., (2) 72 (1960),20-104. [5J N. E. Steenrod, Cohomology operations, Ann. Math. Studies, Princeton Univ. Press, 1962. [6] J. F. Adams, On the structure and applica- tions of the Steenrod algebra, Comment. Math. Helv., 32 (1958),180-214. [7] R. E. Mosher and M. C. Tangora, Coho- mology operations and applications in homo- topy theory, Harper, 1968. [8] E. H. Spanier, Algebraic topology, McGraw-Hili, 1966. [9J G. W. Whitehead, Elements of homotopy theory, Springer, 1978. 65 A Combinatorial Manifolds 65 (IX.17) Combinatorial Manifolds A. Introduction First of all, we explain some notions in geo- metric combinatorial topology, also called PL (piecewise linear) topology, in order to distin- guish between the usual topological view- point and the combinatorial (or PL) one. By an n-dimensional simplicial complex K we mean an n-dimensional, tlocally finite and trectilinear simplicial complex in the Euclidean N -space R N , i.e., an n-dimensional tEuclidean simplicial complex ( 70 Complexes). The subspace P=IKI ofR N covered by all sim- plexes of K is called an n-dimensional poly- hedron, and K is referred to as a simplicial division or simply a division of P. A poly- hedron covered by a tsubcomplex of a division of P is called a subpolyhedron of P. A poly- hedron contained in a polyhedron P is a sub- polyhedron of P if and only if it is a closed subset of P. An open subset of a polyhedron P is always a polyhedron but not necessarily a subpolyhedron of P [13]. For arbitrary divi- sions KI and K 2 of a polyhedron there is a common tsubdivision of KI and K 2 . Thus a property of a simplicial complex K which is invariant under subdivision is called a com- binatorial property of K or of a polyhedron IKI. Two polyhedra P and Q are said to be combinatorially equivalent if they have tisomor- phic divisions K and L, respectively. Then a homeomorphism f: P-->Q which is induced from an isomorphism f: K --> L is called a combinatorial equivalence. Since a combina- torial equivalence f: P-->Q is an isomorphism in the category of polyhedra and PL (tpiece- wise linear) mappings, (PL maps) which is called the PL category, it is also called a PL homeomorphism or a PL isomorphism, and P and Q are said to be PL homeomorphic or PL isomorphic. A ttopological space X which is homeomorphic with an n-dimensional polyhedron P is said to be trianguIable and is referred to as an n-dimensional topological polyhedron. Then a division K of P, or a homeomorphism t:P-->X, is called a triangula- tion of X. A division K of P is a triangulation of P. However, there is a compact polyhedron of dimension n (? 5) which has a triangulation Knot combinatorially equivalent to its divi- sion (  70 Complexes; for triangulation prob- lems on manifolds  Section C below). The notion of a simplicial complex or cell complex had been introduced as a tool to describe topological invariants of triangulable mani- folds (H. Poincare [IJ), The notion of an n- dimensional combinatorial manifold (i.e., a 
65 B Combinatorial Manifolds polyhedron localIy combinatorialIy equivalent to an n-dimensional simplex) was established as an important geometric object in com- binatorial topology by means of j, H, C. Whitehead's theory of regular neighborhoods [2, 1939, 1940]. The 1960s saw remarkable results in the study of combinatorial manifolds together with the topological study of dif- ferentiable manifolds, which gave rise to con- temporary combinatorial or PL topology [3,4]. The fundamental conjecture (Haupt- vermutung) for combinatorial manifolds and the combinatorial triangulation problem of manifolds were negatively solved by R. C. Kirby and L. C. Siebenmann [5,1969]. In the 1970s, important results about general trian- gulations were obtained by T. A. Chapman [6] R, D. Edwards, and 1. W. Cannon [7]. B. Pseudomanifolds and Homology Manifolds In this section we explain some topological properties of triangulable manifolds which can be defined by means of incidence relations among their triangulations. A tpolyhedron M = I K I is called an n-dimensional pseudomani- fold if K satisfies the following three con- ditions: (i) Every simplex of K is either an n- simplex or a face of an n-simplex; (ii) each (n - 1 )-simplex is a face of at most two n- simplexes; (iii) for any two n-simplexes 0", r of K, there exists a finite sequence of n-simplexes 0"=0"0,0"1' ''',O"s=r such that O"i and O"i+1 have an (n-l)-face in common, Consider the set S of (n - 1 )-simplexes of K, each of which is a face of only one n-simplex. Then the set of all 0" E S together with their faces forms a subcomplex L of K. The poly- hedron 1 L I of L is calIed the boundary of the pseudo manifold M = I K I and is denoted by aM as well as by aK = L. The boundary of a pseudomanifold is not necessarily a pseudomanifold. Let IKI be an n-dimensional polyhedron (or tceII complex). A point P in IKI is calIed a regular point of 1 K I if it has a neighborhood in IKI homeomorphic to an n-dimensional sim- plex; otherwise, P is called a singular point. A pseudomanifold without singular points is a topological manifold, called a triangulated manifold. An n-dimensional polyhedron IKI is a pseudomanifold if and only if the set of all regular points in I K I is dense and connected and the set of alI the singular points is of di- mension less than n -1. Thus a connected triangulated manifold is a pseudomanifold whose boundary coincides with the boundary as a topological manifold: Let r" r 2 be oriented n-simplexes of an n- dimensional pseudomanifold M = I K I. Suppose 240 that they have an (n-I)-face 0" in common. If the tincidence numbers of these simplexes with 0" satisfy the relation [r I, 0" J == - [r 2 , 0" J, then r 1 and r 2 are said to be coherently oriented. We call M = I K I with all its n-simplexes oriented an oriented pseudomanifold if any two n- simplexes r 1 and r2 are coherently oriented whenever they have an (n -I)-face in common. For an oriented pseudomanifold, the formal sum of all its oriented n-simplexes forms an integral cycle of the pair (M, aM), which is called the fundamental cycle of (M, aM). Its homology class, or fundamental class, gener- ates the homology group Hn(M,aM;Z)=Z. When M = I K I is nonorientable, the funda- mental class with coefficient Z2 (Z2 = Z/2Z) is similarly defined. A pseudomanifold is orient- able if and only if the topological manifold consisting of aII its regular points is orientable. If for each point p of a polyhedron M=IKI the local homology groups are Hi(p) = 0 (i # n) and Hn(p)=Z or 0, we call M = IKI an n- dimensional homology manifold, where Hi(p) = Hi(M, M - p; Z). If M is connected, it is also a pseudomanifold. The set aM of points p of M = I K I with Hn(P) = 0 is called the boundary of M and is an (n - I )-dimensional homology manifold without boundary. Let M = I K I be an n-dimensional homology manifold. Given simplexes 0", r of K, 0" < r means that 0" is a face ofr and 0" # r. By defi- nition, a vertex of the tbarycentric subdivision Sd K of K is the barycenter of a simplex of K, and a k-simplex r of Sd K is defined to be 18 0 8 1 ", 8 k l, where 0"0,0"1'''', O"k are simplexes of K such that 0"0 < 0" 1 < .., < O"k and 8 i is the barycenter of O"i' In the following we denote Sd K by K'. For a q-simplex 0" of K, we de- note by K(O") the union of alI (n-q)-simplexes 188 1 ", 8n- q l of K' such that 0" < 0"1 < .., < CJ n - q . Then K(O") is an (n-q)-dimensional homology manifold and pseudomanifold contained in M. K(O") is called the (n-q)-dual cell or simply the dual of 0". Let Ka be the subcomplex of K' consisting of simplexes of K' contained in K(O"). Thus we have IKffl = IK(O")I. We denote by (K(O")j' the union of simplexes of Ka which do not intersect with 0". The set of dual celIs K*={K(O")IO"EK} of simplexes of K is called the dual complex of K; and K* is also called the dual subdivision of I K I. K* satisfies the following conditions: (i) UaEKK(O")=IKI. Each K(O") is the tjoin 8 * (K(O")j' of the barycenter 8 and (K(O"))', (ii) 0" <r = (K(O")j' =:> K(r) (0", rE K), (K(CJ)j' = Ua<'.KK(r). (iii) If K (0") n K (r) # 0, then there exist sim- plexes of K with 0" and r as faces, and for the least simplex p of K with 0" and r as faces, it holds that K(O")nK(r)=K(p). The boundary a(K(O")) of the (n-q)- 
241 dimensional homology manifold K(O") satisfies 8(K (0")) = (K(O"))' U «8K)(0")), where (8K) (0") denotes the dual cell of 0" in 8K, and we under- stand that(8K)(0") = 0 if O"rf 8K. K* U (8K)* is called the dual complex of (K, 8 K) and is de- noted by (K,8K)*. For an r-simplex 0" (O.s;r.s; n - 1), it holds that Hj«K(O"))', (8K(0"))'; Z) =0, i #n - r -1, H n -'-1 «K(O"))', (8K(0"))'; Z) Z. Let I K* I q be the union of i-d ual cells of (K,8K)*=K*U(8K)* such that i.s;q. Let Cq(K*)= Hq(IK*l q IIK*lq-l), and let 8q: Cq(K*) -->Cq_I(K*) be the tconnecting homomorphism 8*: Hq(1 K*lqll K*lq-I)-->Hq_l (I K*lq-l II K*l q - 2 ) in the treduced homology exact sequence of (IK*lq, IK*lq-l, IK*lq-2) ( 201 Homol- ogy Theory F). Then Cq(K*) is a free Abe- lian group generated by the q-dual cells of (K, 8K)*, and C(K*) = {Cq(K*), 8q} is a tchain complex. It follows from the property of H*«K(O")j', (8K(0")j'; Z) as above that Hq(C(K*)) = Hq(IK I) as in the case of a tCw complex. Now suppose that K is oriented, For a q- simplex 0" = la o a 1 ... aql we give an torientation [a o , aI' ... , a q ]. Furthermore, we choose a q + 1 , a q + 2 , ..., an so that [a o , aI, ..., anJ is the orien- tation of laoa l ... ani induced from the orien- tation of K, and we give K(O") the orientation determined by [6,61' ...,8n-qJ, where O"i= laoa l ... aq+il. We define 0"' K(O") = 1 for 0" and K(O") thus oriented, and r' K (0") =0 if r #0". Let c=LAjO"iECq(K) and c*=LJ.l.jK(O"). Then we define C' c* EZ by c. c* == Li,j AiJ.l.j(O"i' K (0")). The integer c. c* is called the intersection number of c and C*. For homology classes [zJ EHq(K) and [z*J EHn_q(K*), the intersec- tion number z' z* is independent of the choice of representing cycles, and thus we define the intersection number [zJ' [z*J = z' z*. When X is compact, the isomorphism of the Poincare-Lefschetz duality theorem Hq(X,8X;R)H"-q(X;R) is given in terms of intersection numbers. C. Combinatorial Manifolds and PL Manifolds A polyhedron which is PL homeomorphic with a k-dimensional simplex or its boundary is called a PL k-ball or a PL (k-l )-sphere and is denoted by B\ Sk-I, respectively. A simplicial complex K or a polyhedron M = I K I is called an n-dimensional combinatorial manifold if the tstar of each vertex in K is a PL k-ball. If M is an n-dimensional combinatorial mani- fold, then so is an open subset of M. An n- dimensional PL manifold is a topological manifold X with a distinguished maximal coordinate system n = {( V;, h;)}, called a PL 65C Combinatorial Manifolds structure on X, which consists of homeomor- phisms hi: V i --> V. from open subsets Vi onto open subsets V. of an n-dimensional simplex such that h j oh i - 1 :hi(Vin )-->h/Vin V j ) are PL homeomorphisms, For a PL manifold (X,n) there is a triangulation t:M-->X from a combinatorial manifold M such that for each (Vj,h;)En, h i otlt- 1 (V;) is a PL homeomor- phism from an open subset t- 1 (V,) of M onto h i ( V,). Thus the notions of combinatorial mani- fold and PL manifold are essentially the same. For a tsmooth manifold (X, 0"), there is a unique PL homeomorphism class of a com- binatorial manifold by means of a tsmooth triangulation, In this sense, smooth manifolds are combinatorial manifolds; but the converse of this is not true ( 114 Differential T opol- ogy q, A triangulation of a topological mani- fold which is a combinatorial manifold is called a combinatorial triangulation. The trian- gulation problem for n-dimensional topolog- ical manifolds, especially the existence and uniqueness problem of their combinatorial triangulations, a long-standing problem, is stated as follows, CT n (Combinatorial Triangu- lation Problem): Is every n-dimensional topo- logical manifold homeomorphic with a com- binatorial manifold? CH n (Hauptvermutung for combinatorial manifolds): Homeomorphic n- dimensional combinatorial manifolds should be PL homeomorphic. When n.s; 3, CT n and CH n hold; this was shown for n = 2 by T. Rad6 [15J and for n == 3 by E. E. Moise [16]. More- over, in this case, topological manifolds admit unique smooth structures, and homology manifolds are combinatorial manifolds. For n? 5, tsurgery theory was used and developed as an obstruction theory for "homotoping" a homeomorphism h: M -->M' between n- dimensional combinatorial manifolds to a PL homeomorphism by D. Sullivan (1967). By the discovery of the torus-unfurling method for "isotoping" h: M -->M' to a PL homeomor- phism, Kirby reduced CT n and CH n for n? 5 to CH n for some specific manifolds, such as thickened tori modulo the boundary, for which surgery methods had been sufficiently developed (for isotopy  Section D). The solutions (1)-(9) can be stated as follows. (1) Classification of combinatorial trian- gulations. Assume n? 5. Existence: An n- dimensional closed topological manifold X admits a combinatorial triangulation if and only if an obstruction (X) E H 4 (X; Z2) van- ishes. Uniqueness: A homeomorphism h: M--> M' between n-dimensional closed combina- torial manifolds is isotopic ( Section D) to a PL homeomorphism if and only if an obstruc- tion (h)EH3(X;Z2) vanishes (Kirby and Siebenmann [5J). In fact, for each n? 5, there is an n- 
65C Combinatorial Manifolds dimensional combinatorial manifold which is homeomorphic to a torus Sl x ... X Sl but is not PL homeomorphic, and there is an n- dimensional closed manifold which is not homeomorphic to a combinatorial manifold (Siebenmann [18]). As a special case, the following conjecture is proved for n;3 5, (2) The n-dimensional annulus conjecture. The closed region bounded by two mutually disjoint tlocally flat (n - 1 )-dimensional topo- logical spheres SI and S2 in an n-dimensional sphere sn is homeomorphic with an n- dimensional annulus sn-l X [0, 1] (Kirby [19]). Regarding the uniqueness problem for (not necessarily combinatorial) triangulations of topological manifolds, we mention the follow- ing deep result. (3) Double suspension theorem for homology spheres. The double suspensions of homology n-spheres are homeomorphic with sn+2, where a homology n-sphere is a closed topological manifold with the same homology groups as those of sn (Cannon [7]). For each n;3 3, there is a homology n-sphere pn which is a combinatorial manifold with the nontrivial tfundamenta1 group (for n = 3, see Poincare [1,1904] and for n;34, see M. Newman, 1948). The suspension of pn is an example of an (n + I )-dimensional homology manifold which is not a topological manifold. The double suspension of P" is not a com- bmatorial manifold but is homeomorphic with sn+2. The problem of whether every n- dimensional topological manifold is homeo- morphic to a polyhedron is still open for n;3 4. This problem is reduced to a problem in homology 3-spheres (T. Matumoto; D. E. Galewski and R. J. Stern). In the following we list some results on combinatorial manifolds focusing on White- head's theory of regular neighborhoods [2]. Let M = IKI be an n-dimensional combina- torial manifold without boundary, The dual complex K* consists of PL balls. Thus the division K of M gives rise to a decomposition into PL thandles H(K) of M in such a way that for each k-dimensiona1 simplex O"E K, the star StK',(6-) of its barycenter 6- in the sec- ond barycentric subdivision K" of K is a PL handle of index k. If L is a subcomplex of K or K*, then the star StK"(ILI) of ILl in K" is a subhandlebody of H(K) or of its dual han- dlebody H*(K)= H(K*), denoted by HK(L) or HdL), respectively. In particular, if M is closed, orientable, and of dimension 3, then U = HK(K(I») and V = HK*(K(l)) are PL homeomorphic with a PL boundary- connected sum of some copies of a solid torus Sl x B 2 , and M = UU V, un V=cu =av, where K(k) and K) stand for the k-tskeletons 242 of K and K*, respectively. This decomposition (U, V; au =av) of M is called a Heegaard decomposition (or splitting) of M [12]. We say that a simplicial complex L collapses elemen- tarily to a subcomplex Ll of L, referred to as an elementary collapse L " L I' if L - L I = {O", r}, where 0" is a k-dimensional simplex of L and r is a (k -I)-dimensional face of 0" which is not a face of any simplex in L - {O", r}. We say that a polyhedron P collapses to a subpoly- hedron Q of P or Q expands to P, referred to as a collapse P " Q or expansion Q 7' P, if there is a division L of P containing a subcomp1ex L, dividing Q so that there is a seq uence of elementary collapsings L "Ll " ... "L,. For a subpolyhedron P of a combinatorial manifold M, a subpolyhedron U of M is called a regular neighborhood of Pin M if (1) U is a combina- torial manifold which is a closed neighbor- hood of P in M and (2) U" P. In general, for a subcomplex L of a simplicial complex K, the tstar StK"(ILI) of ILl in the second barycentric subdivision K" of K is called a second barycen- tric derived neighborhood or simply a derived neighborhood of ILl in IKI (E. C. Zeeman [4J), (4) Regular neighborhood theorem. Let P be a compact subpolyhedron of a combinatorial manifold M, Existence: Every derived neigh- borhood of pin M is a regular neighborhood. Uniqueness: Any two regular neighborhoods of P in Mare PL homeomorphic by a map- ping keeping P pointwise fixed [2, 1939]. On one hand, this theorem is regarded as a com- binatorial counterpart to the Hubular neigh- borhood theorem of differential topology. For a topological manifold X with boundary ax, there are a neighborhood U of ax in X and a homeomorphism h: ax x [0, 1] --> U, called a collar of ax in X, such that h(x,O)=x (XEaX). In particular, for a combinatorial manifold M = X we can take U as a derived neighbor- hood of aM in M and h as a PL homeomor- phism, called a PL collar of aM in M. (For more about normal bundle theory  147 Fiber Bundles Q). On the other hand, it is also regarded as a combinatorial deforma- tion theorem of a hand1ebody decomposition HK,,(L) of a derived neighborhood StK,,(ILI). Two polyhedra P and Q are said to be simple homotopy equivalent if Q is obtained from P by a finite sequence of collapsings and expansions. A composite mapping f: P -->Q of the inclusion mappings and deformation re- tractions associated naturally to the expan- sions and the collapsings from P to Q is called a simple homotopy equivalence. Moreover, if P and Q are subpolyhedra of a combinatorial manifold M and those collapsings and expan- sions take place in M, then their regular neigh- borhoods are PL homeomorphic [2, 1940]. (5) A simply homotopy equivalence is clear- 
243 ly a homotopy equivalence. Conversely, we have the simple homotopy theorem: For the homotopy class of a homotopy equivalence f: P-->Q between compact connected poly- hedra P and Q, there is a well-defined element W(f), called the Whitehead torsion off, in the tWhitehead group Wh( G) of the fundamental group G = n l (P) such that f is homotopic to a simple homotopy equivalence if and only if W(/)=O, ([2, 1950J;  also J, W. Milnor [20J ). Let W be a compact combinatorial mani- fold of dimension n + 1. If 8W consists of two connected components M and N such that the inclusion mappings M c. Wand N c. Ware simple homotopy equivalences, then (W; M, N) is called an s-cobordism of dimension n + 1, The following theorem of D. Barden, B. Mazur, and J, Stallings is the nonsimply connected PL version of Smale's th-cobordism theorem. (6) s-Cobordism theorem, For n;3 5, an s- cobordism (W; M, N) of dimension n + 1 is PL homeomorphic with (M x [0, 1J; M x 0, M xl) (B. Mazur [21J; M. Kervaire [22J; C. Weber [23J; J, F, p, Hudson [8J). (7) Topological invariance of simple homo- topy types, Two homeomorphic compact polyhedra are simple homotopy equivalent (T. A, Chapman [6J). The famous Poincare conjecture that a sim- ply connected closed manifold of dimension 3 is homeomorphic to a 3-sphere is still unsolved despite much effort by many mathematicians. This conjecture is generalized for dimension n as follows. (8) Generalized Poincare conjecture in dimen- sion n, Any homotopy n-sphere In is homeo- morphic to an n-sphere, where a homotopy n-sphere is defined to be an n-dimensional topological manifold homotopy equivalent to an n-sphere. This has been affirmatively answered for n;3 5, The first proof was given by S. Smale ( 114 Differential Topology F, K). From the viewpoints of collapsing and general position ( Section D), 1. Stallings [3, I 960J and E. C. Zeeman [11, 198-204J proved the following. (9) Engulfing lemma. For a compact poly- hedron P of dimension m in the interior M = M - aM of an n-dimensional combinatorial manifold M, there is a PL n-ball in M contain- ing P in its interior (i,e" engulfing P), provided that M is m-connected (i.e" ni(M)=O (i.s;m)) and n - m;3 3. (For the topological version  M. H. A. Newman [24].) If a combinatorial manifold M = I K I is a homotopy n-sphere (n;3 5), then by the engulfing lemma, there are PL n-balls Bl and B 2 engulfing HK"(K(n-3») and H K ,,(K(2)) so that 8 1 U 8 2 = M, By the generalized Schoenflies theorem ( Section G), B 2 - 8 1 = M - 8 1 is homeomorphic to 65D Combinatorial Manifolds an n-balL It follows that by the Alexander trick ( Section D) M is homeomorphic to an n-sphere. Since CH n holds for sn (n;3 5), the homotopy PL n-sphere M is actually PL homeomorphic with sn. D. Embeddings of Combinatorial Manifolds An injective PL mapping f: P-->Q between polyhedra is called a PL embedding if f(P) is a subpolyhedron of Q ( Section A). Namely, a PL embeddingf:P-->Q is a PL mapping which is induced from a simplicial mJection f: K --> L, called a division of I, for some divi- sions K and L of P and Q, respectively. In the following we explain PL embeddings from an m-dimensional combinatorial manifold Minto an n-dimensional combinatorial manifold N, For a PL embeddingf:M -->N, c=n-m is called the codimension of f If f-I(8N)=8M, f is said to be proper. Since an injective PL mapping f: M --> N is a PL embedding if and only if it is proper as a mapping between topo- logical spaces M and N (i,e" the preimages of compact sets are compact), this term "proper" for PL embeddings should not be confused with the term "proper" for mappings. Two PL embed dings f: M --> N and 1': M' --> N' are said to be (PL) equivalent if there is a PL homeo- morphism h: N -->N', called an equivalence from f to 1', such that h 0 f(M) = I'(M'). When M = M', f and I' are said to be (PL) isomor- phic if there is an equivalence h: N --> N' from f to I' such that h 0 f = f'. A submanifold of N is defined as a subpolyhedron of N which is a combinatorial manifold. Thus the equiv- alence class of a PL embedding f: M --> N is nothing but the PL homeomorphism class of a pair (NJ(M) of N and its submanifold f(M). For a submanifold of N, the same terms as those for the inclusion mapping are used, For every division f: K --> L of a (proper) PL em- bedding f: M --> N and for x E M we have a (proper) PL embedding [I StK(x) from a PL m- ball StK(x) into a PL n-ball StL(/(x)) which represents the tgerm of fat XEM. A PL em- bedding of a PL m-sphere into a PL n-sphere is called a PL (n, m)-knot or (n, m)-sphere pair and a proper PL embedding of a PL m-ball into a PL n-ball is called a PL (n, m)-ball knot or (n, m)-ball pair. A PL (n, m)-knot (a PL (n, m)-ball knot) is said to be trivial or un- knotted if it is equivalent to the inclusion mapping 8r+ l x Oc. 8r+1(r c. In), where 1= [ -1, 1]. Then we have the following state- ments (1)-(5). (1) Zeeman's unknotting theorem. Every PL (n, m)-knot and every PL (n, m)-ball knot are unknotted, provided that n - m;3 3. In general, an injection g: X --> Y between 
65 E Combinatorial Manifolds topological manifolds is said to be locally flat if for each point XEX, there is a neighborhood U of x in X so that the local germ of f at x is equivalent to the inclusion mapping U x 0 c. U x I C . Then f is necessarily proper, i,e.,f-I (DY) =DX. A PL embeddingf:M-->N is said to be locally flat if it is locally flat in the PL cate- gory. Namely, fl StK(x) is equivalent to r x o c.1". Conversely, Zeeman's unknotting theo- rem implies the following, (2) Every proper PL embedding of codimen- sion ;3 3 is locally flat. Since every locally flat PL embedding f: M -->N admits a normal tblock bundle, the classi- fication of F: M --> U up to isomorphism is reduced to the classification of block bundles ( 147 Fiber Bundles Q), where U is a derived neighborhood of f(M) in N. (For the general classification of locally flat PL embeddings by surgical methods  e. T. e. Wall [25, Corol- lary 11.3.1 J). In the codimension 2 case, there exist non- trivial knots (the classical knots) ( 235 Knot Theory) and there is a proper PL submanifold which is not locally flat. For example, a com- plex projective curve V defined by x n + yn-l Z = 0 can be regarded as a PL 2-sphere in the complex projective plane Cp2, and for the inclusion mapping i: Vc.Cp2, at 0=(0:0: 1), i I Stv(O) is PL equivalent to a cone of a torus knot k:S I c.S 3 of type (n,n-l), For further results in higher-dimensional knot theory ( 235 Knot Theory) we mention the works of M. Kato and Y. Matsumoto [26] and S. Capell and 1. Shaneson [27]. Here we consider only proper PL embed- dings of a fixed m-dimensional combinato- rial manifold M into a fixed connected n- dimensional combinatorial manifold N, so that a codimension 0 PL embedding is a PL homeomorphism, Two PL embeddings fo, fl : M --> N are said to be pseudoisotopic if there is a proper PL embedding F: M x [0, 1] --> N x [0, 1], called a pseudoisotopy from fo to fl> such that F(x, 0) = (/0 (x), 0) and F(x,I)= (/1 (x), 1) (xEM). Moreover, if F satisfies the level- preserving condition F( M x { t} ) c: N x {t} (O.s; t.s; 1), then fo and fl are said to be isotopic, and F or a homotopy {f,} defined by F(x, t) = (f,(x), t) (x, t)EM x [0, IJ) is called an isotopy from fo to fl' When M = N, a PL homeomor- phism which is isotopic to the identity, or the isotopy of the identity itself, is called an am- bient isotopy of N. Two PL embeddings from M to N are said to be ambient isotopic if they are isomorphic by an ambient isotopy of N. Since I k (I = [ -1, 1]) is a k-dimensional tconvex cell and is a cone 0 * Dl k of Dl k from the center 0, each point x of I k - {O} can be written uniquely as x = t. u for 0 < t.s; 1 and UEt7I k , Thus a PL embedding f:ar-->al" can 244 be extended to a PL embedding F: Im-->I n , called a cone extension or simply a cone of I, by setting F(O) = 0 and F(t, u) = t. f(u) for t. u El m - {O}. This method of cone extension is often referred to as Alexander's trick. By using Alexander's trick and the uniqueness of regu- lar neighborhoods, one can show without much difficulty that (i) the isotopy classes of PL homeomorphisms of PL spheres or PL balls onto themselves are classified by their tdegrees (= i: 1) (V. K. A. M. Gugenheim, 1953), and (ii) every PL (n, m)-knot or PL (n, m)-ball knot is equivalent to the standard one aIm+! x 0 c. al"+1 or r x 0 c. I" up to iso- topy, provided that n - m;3 1. This is why we define the notion of ambient isotopy in a dif- ferent way from the differentiable category, where the smooth isotopy is always covered by the ambient isotopy. However, for codimen- sion ;3 3, because of Zeeman's unknotting theorem, the following holds, (3) The pseudoisotopy class, the isotopy class, and the ambient isotopy class of a PL embedding of codimension ;3 3 are the same (J. F. P. Hudson and E. e. Zeeman, 1964). In order to construct a PL embedding from a continuous mapping within its homotopy class, the notion of general position is useful. A PL mapping <p: M --> N is said to be in gen- eral position if dim <p -I (y).s; 0 (y EN) and dimS(<p).s;2m-n, where S(<p) is the closure of {xEMI<p-I(<p(x))#{x}} in M (1. F. P. Hudson [8]), (4) General position theorem. Every con- tinuous mapping 1jJ: M --> N can be approxi- mated by a PL mapping <p : M --> N which is in general position. The following theorem can be proved by sharpening the general position theorem and the engulfing lemma. (5) Irwin's embedding theorem. Let IjJ:M-->N be a continuous mapping from a compact combinatorial manifold of dimension m into a combinatorial manifold of dimension n such that IjJIDM is a PL embedding from aM into aN. Assume that (i) M is d-connected, (ii) N is (d + 1 )-connected, and (iii) n - m;3 3, where d = 2m - n. Then 1jJ: M --> N is homotopic to a proper PL embedding f: M -->N relative to aM. Moreover, if M and N are closed and if (i*) M is (d+ I)-connected and (ii*) N is (d +2)- connected, then two homotopic PL embed- dings from Minto N are ambient isotopic (M. e. Irwin, 1965). E. 3-ManifoIds There has been a great deal of research done on 3-manifolds, and we have, among others, the following results (1 )-(4). 
245 (1) Sphere theorem, Let M be an orientable 3-manifold with n 2 (M) # O. Then there exists a PL embedding f of a 2-sphere into M such that f is not homotopic to 0 in M (c. Papa- kyriakopoulos [28J). (2) Dehn's lemma. Let D be a 2-cell with self-intersections in a 3-manifold M, with its boundary a simple polygon C. If some neigh- borhood U of C in D has no singularities, then there exists a 2-ceII in M without singularities whose boundary is C. In 1910, M. Dehn as- serted this lemma, but his proof was incom- plete. In 1957, Papakyriakopoulos and T. Homma proved the lemma independently [28,29]. (3) Loop theorem. Let N be the boundary of a 3-manifold M and U be an open set in a component of N. If there is a closed curve in U which is homotopic to 0 in M but not in N, then there exists a simple closed curve in U homotopic to 0 in M but not in N (Papakyriakopoulos [30], J, Stallings [31]). (4) Unique decomposition theorem for a 3- manifold (J. W. Milnor [32], H. Kneser), We assume that all manifolds are connected, oriented, and triangulated 3-manifolds without boundaries and that all homeomorphisms are piecewise linear. Two manifolds M, M' are said to be isomorphic (M  M') if there exists an orientation-preserving homeomorphism of M onto M'. Removing an open 3-ceII from each of two 3-manifolds M, M' and identifying the boundaries of these removed celIs, we obtain a 3-manifold M # M', called the con- nected sum of M and M', A manifold that is not isomorphic to the 3-sphere S3 is calIed nontrivial. A nontrivial manifold P is called prime if P cannot be decomposed as P = M l # M 2 with M l and M 2 both nontrivial. A manifold M is calIed irreducible if every 2-sphere in M bounds a 3-cell. Then from the sphere theorem the following results can be deduced: If a compact 3-manifold Mis nontrivial, then M is isomorphic to a con- nected sum PI # P 2 # '" # P k of prime mani- folds Pi (i= 1,2,..., k), where PI' P 2 ,..., P k are determined uniquely up to order. Every irreducible 3-manifold M has n2(M)=0. Con- versely, if a 3-manifold M has n2(M) =0, then M is irreducible, provided that the Poincare conjecture is correct. (For further results on 3-manifolds  F. Waldhausen [10] and 1. Hempel [12].) Irreducible sufficiently large 3-manifolds [10] can further be decomposed into Seifert fibered spaces and "simple manifolds" which contain no essential tori (W. Jaco and P. B. Shalen, K. Johannson). Existence of hyperbolic structures on "simple manifolds" has been extensively studied by W. Thurston. 65G Combinatorial Manifolds F. Wild Spaces Nonclosed 3-manifolds behave very differently from closed 3-manifolds. For example, there exists an open 3-manifold U that has the same thomotopy type as an open 3-cell but is not homeomorphic to it (M. H. A. Newman and J. H. C. Whitehead, 1937). The construction of this and similar examples generaIly involves infinite processes. Such examples, which do not exhibit the properties we expect, are com- monly termed pathological (or wild) spaces. For the manifold U above, the product U x R 1 is found to be homeomorphic to the Euclidean 4-space R 4 . R. Moore proved that the quotient space of the Euclidean 2-space R 2 by any upper semicontinuous decomposition consist- ing of continua not separating R 2 is homeo- morphic to R 2 . On the other hand, R. H. Bing (1959) gave an example of a similar upper semi- continuous decomposition ofR3 for which the quotient space B is not homeomorphic to R 3 and is not even a manifold. This quotient space has the property B x R 1  R4. Bing (1957) also proved that at most countably many copies of a wild closed surface can simultaneously be embedded in R 3 . On the other hand, Stallings (1960) proved that there exists a wild disk of which a set of copies with the power of the continuum can simulta- neously be embedded in R 3 [11]. The study of the quotient space of n- dimensional manifolds by cell-like decompo- sitions has been fully generalized by Edwards and Cannon [7] for n;3 5 to prove the double suspension theorem of homology spheres ( Section C (3)). G. The Schoenflies Theorem In 1906, A. M. Schoenflies proved that every tsimple closed curve in the plane is the bound- ary of a 2-cell. This result is sharper than the tJordan curve theorem and is called the Schoenflies theorem. However there was a gap in the proof, and the complete proof was given by L. E. J. Brouwer in 1910. The folIowing higher-dimensional analog of the above theorem is called the Schoenflies problem: Is every (n - 1 )-sphere embedded in the n-dimensional Euclidean space R n (n;3; 3) the boundary of an embedded n-disk? For the case where the embedding is topo- logical, L. Antoine constructed a counter- example, called Antoine's necklace, for n = 3 in 1921 and solved this problem negatively. Also, 1. W. Alexander constructed another counter- example, calIed Alexander's horned sphere, in 1924 (Fig. 1). A similar example was given by R. H. Fox and E, Artin in 1948 by making use 
65 Ref. Combinatorial Manifolds ofknot theory (Fig. 2). The study of wild em- beddings began with these examples, and many interesting results were obtained (Bing). For the case where the embedding is PL or smooth, the Schoenflies problem has been solved affirmatively for n = 3 (Alexander). M. Brown (1960) and B. Mazur (1959) proved the foIlowing: Let h:S n - 1 x [-1, IJ-->sn be a topological embedding; then sn - h(sn-t x {O}) consists of connected open sets D+ and D_ of sn, and the closures of D+ and D_ are both homeomorphic to the n-disk. This result solves the Schoenflies problem affirmatively for n;3 5 when the em bedding is PL and lo- cally flat or smooth. " Fig. 1 Fig. 2 References [IJ H. Poincare, Analysis situs, J. Ecole Poly- tech., 1 (1895), 1-121; Cinquieme complement a I'analysis situs, Rend. Circ. Mat. Palermo, 18 (1904),45-110. [2J 1. H. C. Whitehead, Simplicial spaces, nuclei and m-groups, Proc. London Math. Soc., 45 (1939), 243-327; On the homotopy type of manifolds, Ann. Math" (2) 41 (1940), 825-832; Simple homotopy types, Amer. J. Math., 72 (1950),1-57. [3J 1. Stallings, Polyhedral homotopy spheres, Bull. Amer. Math. Soc., 66 (1960), 485-488; Notes on polyhedral topology, Tata Inst., 1967. 246 [4J E. C. Zeeman, Unknotting combinatorial balls, Ann. Math., (2) 78 (1963), 501-526; Seminars on combinatorial topology, Publ. Math. Inst. HES and Univ, of Warwick (Con- ventry), 1963-1966. [5J R. C. Kirby and L. C. Siebenmann, On the triangulation of manifolds and the Haupt- vermutung, Bull. Amer. Math. Soc., 75 (1969), 742-749; Foundational essays on topological manifolds, smoothings and triangulations, Ann. Math. Studies 88, Princeton Univ. Press, 1977. [6J T. A. Chapman, Topological in variance of Whitehead torsion, Amer. J, Math., 96 (1974), 488-497. [7J 1. W, Cannon, Shrinking cell-like decom- positions of manifolds. Codimension three, Ann. Math., (2) 110 (1979),83-112. [8J 1. F. P. Hudson, Piecewise linear topology, Benjamin, 1969. [9J C. P. Rourke and B. J, Sanderson, Intro- duction to piecewise linear topology, Erg. Math., 69 (1972). [IOJ F. Waldhausen, On irreducible 3-mani- folds which are sufficiently large, Ann. Math., (2) 87 (1968), 56-88, Amer. Math. Soc. Proc. Symposia in Pure Math., vol. 32, part 2, 1978. [11] M. K. Fort, Jr. (ed.), Topology of 3-mani- folds and related topics, Prentice-HaIl, 1962. [12] J. Hempel, 3-manifolds, Ann. Math. Study 86, Princeton Univ. Press, 1976. [13J P. Alexandroff and H. Hopf, Topologie I, Springer, 1935. [14J H. Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934, [15J T. Rado, Ueber den Begriff der Riemann- schen Fliiche, Acta Litt. Sci. Szeged, 2 (1925), 101-121. [16J E. E. Moise, Affine structures on 3-mani- folds, Ann. Math., (2) 56 (1952), 96-114. [17J D. Sullivan, On the Hauptvermutung for manifolds, Bull. Amer. Math. Soc., 73 (1967), 698- 700. [18J L. C. Siebenmann, Topological mani- folds, Proc. Int. Congress Math. (Nice, 1970), vol. 2, 133-163. [19J R. C. Kirby, Stable homeomorphisms and the annulus conjecture, Ann. Math., (2) 89 (1969), 575-582. [20J 1. W. Milnor, Whitehead torsion, Bull. Amer. Math, Soc., 72 (1966), 358-426. [21] B. Mazur, Differential topology from the viewpoint of simple homotopy theory, Pub!. Math, Inst. HES, 15 (1963), 5-93, [22J M. Kervaire, Le Theorem de Barden- Mazur-Stallings, Comment. Math. Helv., 40 (1965),31-42, [23J C. Weber, L'elimination des points dou- bles dans Ie cas combinatoire, Comment. Math, Helv., 41 (1966), 179-182. 
247 [24J M. H. A. Newman, The engulfing theo- rem for topological manifolds, Ann. Math., (2) 84 (1966), 555-571. [25J C. T. C. Wall, Surgery on compact mani- folds, Academic Press, 1970. [26J M. Kato and Y. Matsumoto, Simply connected surgery of submanifolds in codi- mension two, J. Math. Soc. Japan, 24 (1972), 586-608. [27] S. Cappell and 1. Shaneson, The codi- mension two placement problem and ho- mology equivalent manifolds, Ann. Math., (2) 99 (1974), 277-348. [28J C. Papakyriakopoulos, On Dehn's lemma and the asphericity of knots, Ann. Math., (2) 66 (1957), 1-26. [29J T. Homma, On Dehn's lemma for S3, Yokohama Math. 1., 5 (1957), 223-244. [30] C. Papakyriakopoulos, On solid tori, Proc. London Math. Soc., 7 (1957), 281-299. [31 J 1. Stallings, On the loop theorem, Ann. Math., (2) 72 (1960),12-19. [32J 1. W. Milnor, A unique factorization theorem for 3-manifolds, Amer. J. Math., 84 (1962), 1-7. [33J W. Thurston, The geometry and topology of three-manifolds, Princeton Univ. Press, 1982. 66 (XVI.1 0) Combinatorics A. General Review of Combinatorics Combinatorics is concerned mainly with prob- lems of discrete sets, such as the enumeration of subsets satisfying certain conditions, or their actual construction and the selection of an optimal subset with respect to a suitable crite- rion. Generally speaking, both the theoretical analysis and the construction of discrete sets are much more difficult than those problems in analysis concerning infinite sets with suitable topologies, The main emphasis and the name of this field have changed from time to time and from person to person, Other names such as com- binatorial analysis, combinatorial theory, or combinatorial mathematics are commonly used and mean almost the same thing. Recently, the expression "discrete mathematics" has also been used to describe the same field. In many cases a problem originally conceived as number-theoretic, algebraic, analytic, or geo- metric has eventually turned out to be com- binatorial ( 102 Design of Experiments, 241 Latin Squares). 66C Combinatorics tGraph theory, originally a combinatorial study of I-dimensional topological tcomplexes, has developed into an important branch of mathematics. In this article we discuss various topics not yet considered to be established branches or organized fields of mathematics. The topics treated under the title "combinatorics" are at present not unified, and consist of a mix- ture of isolated techniques. However, recent progress with electronic computers is play- ing an important role in the solution of various combinatorial problems arising from large systems, and the study of combinatorics is now quite active both in theoretical develop- ment and application. We now have special journals in this field, such as the Journal of Combinatorial Theory (Academic Press), Com- binatorica (Elsevier), the European Journal of Combinatorics (Academic Press), Discrete Mathematics (North-Holland), and Discrete Applied Mathematics (North-Holland). B. Three Fundamental Principles The most fundamental among the various principles in combinatorics are the following three principles for enumeration problems. Let Q be a finite set and let IAI denote the number of elements in a subset A of Q. (1) Rule of sums: If A n B= 0, then IA + BI = IAI+IBI. (2) Rule of products: I A x BI = I A I x I BI, where A x B is the tdirect product of the two sets A and B. (3) Principle of inclusion and exclusion: IA I UA 2 U...UA n l =LlAd-LIAinAjl+ L IAinAjnAkl l i<j i<j<k -... +( -l)n-IIA! n A 2 n... n Ani. These principles can be generalized as state- ments concerning various measures. C. Mobius Inversion Formula Let P be an ordered set whose arbitrary inter- val is a finite set. Let [P x PJ = {(x, Y)E P X P I x.s; y} be the set of intervals of P. A func- tion satisfying the following two conditions is determined uniquely and is called the Mobius function over P: (i) j.t(x,x)= 1 for all XEP; (ii) Lz x<;z,;;y j.t(x, z) = 0 for all x, YEP with x < y and x# y. When K is a field, for a mapping f: [P x PJ --> K, we can define another mapping g: [P x 
66D Combinatorics PJ-->K by g(x, y) = L f(x, z). ZE[X.Y] Then we have the Mobius inversion formula f(x,y)= L g(x,z)J.l(z,y). ZE[X,y] This formula implies, as its special cases, many important formulas, such as the tMobius in- version formula in number theory, the prin- ciple of inclusion and exclusion, and the in- verse relation of difference and summation. D. Some Special Sequences In the enumeration problem, many important sequences {an} are given by the tgenerating function f(x) = LanX n . Typical examples are as follows: (1) tBinomial coefficients: (1 +xr= Ln(':)X n . (2) Multinomial coefficients: (x I + .., +X,t=L(n 1 m nJxI...x;'. (3) Multiple combi- nation an = mHn = (m+-I): (1 - xfm = Ln(mHn)xn. (4) Bell numbers: exp(e X -1) = L Bnxnln! (5) The Stirling number of the second kind S;:' is the total number of bijections from N n to N m divided by m!, where N n ={1,2, ".,n}. This is equivalent to the total number of partitions of N n into m nonempty blocks: (e x -l) m lm! = LmS;:'xnln!. In general, when the generating function f(x) is an analytic function of one or several variables as above, we can obtain recurrence formulas for the sequence from the functional equations or the differential equations which f(x) satisfies; or we can get asymptotic esti- mates by using the tsaddle point method. E. P6lya's Enumeration Theorem Let V be a finite set, V be a (finite or infinite) set, G be a transformation group operating on V, and w be a function assigning to each ele- ment of Va "weight." A weight is an element of a ring and is usually defined by a suitable generating function. The weight of a function f: V --> V is defined by w(f) = ITuEV w(/(u)). Two functions fl and f2 are considered iden- tical (/1 = f2) ifthere exists a n E G such that fl = f2 0 n. Put F = VV 1(3), the set of all equiv- alence classes of the relation 3. w(]) is well- defined for JEF, because fl =f2 implies w(f0 =W(/2)' Then we have the equality LfEFW(/) = PG(LVEV w(v), LVEV W(V)2, .." LVEV w(v)m). Here, PG(YI' Y2' ..., Ym) is the cycle index of G, which is given by n- I Ln€GY{I(n)Y2(n) ...ym(n), where n = IGI, m= I VI, andjk(n) is the number of cyclic permutations of length k contained in n. This is called P6lya's enumeration theorem. When G = Sm (the tsymmetric group), psJy I' ..., Ym) = LAk"O,EkAkm(.A'1 ! 2A2A 2! ... mAmAm!fl Yt, 248 ...ym, When G=Am (the ta1ternating group), PAm(y"... ,Ym)= LAk"0,kAkm,EAkeven2(AI !2 A2 A 2 ! ... mAmAm!fl Yt 1 ... ym. When G = C m (the tcy- clic group), PCJYI'''' ,Ym)=m- l Lklm<p(k)y;:'/k, where <p(k) is the tEu1er function, i,e., the num- ber of integers less than k and coprime to k. This theorem is widely applicable to various problems with suitable choices of weights, Several generalizations have also been studied (see, e.g., [3]), F. The Notion of Polymatroids The theory of matroids was originally intro- duced as an abstraction of the notion of tlinear dependence in a vector space or of some prop- erties of tgraphs or of the notion of talge- braic dependence in the theory of fields (see, e.g., [4,5]), But recently, as many important applications were discovered, it has become an important branch of combinatorics (see [6-8]). Let J.l be a function from a tlattice L to a totally ordered module R. J.l is called sub- modular if J.l(x) + J.l(y)  J.l(x v y) + J.l(x /\ y) for every pair of elements x, Y E L. The set of a E L such that J.l(a)==minxELJ.l(x) constitutes a sub- lattice in L. Let L be the family of all subsets of a finite set E, The sublattices K in L = 2 E and the pseudo-orders over E correspond in a one-to-one manner as follows (here pseudo- order means a binary relation taken as an tordering without assuming the tantisym- metric law): (i) The class of a partition P of E corresponds to the difference of two subsets which are the immediate predecessor and immediate successor in K, or the complement with respect to E of the maximum element in K, or the complement of the minimum ele- ment. (ii) The complement of the maximum element in K is maximum in the order of P, and the minimum element in K is minimum in the order of P. (iii) Two classes  I, 2 in P, given as the difference of two elements in K, satisfy the relation  I   2 with respect to the order of P if and only if all elements of K such that x=:> 1 in E satisfy x=:> 2' Now let E be a finite set, L = 2 E , and let R be the real number field with the usual addition and order relations. If a submodular function J.l: 2 E --> R satisfies the following conditions, then P=(E,J.l) is called a polymatroid over E, and J.l is called the rank function over P. (i) J.l(0) = 0, where 0 is the empty set. (ii) xc Y (c E) implies J1(x).s; J1(Y). A function u: E --> R is automatically extended to u: 2 E --> R by u(x) = Le€x u( e) (x c E), which is also denoted by the same symbol u. If a function u satisfies O.s;u(x).s;J.l(x) for every xEE, then u is called an independent vector over P. An independent 
249 vector u with u(E)=I1(E) is called a base ofP. A function u: E --> R is identified with an ele- ment in Rn, where n = I EI, Then a set of inde- pendent vectors in P forms a tconvex poly- hedron in R n , If we denote the polyhedron by P, then such a P is characterized by the fonow- ing two conditions: (P 1) If u E P then each v satisfying O.s;v(e).s;u(e) for every eEE also belongs to P. (P2) If two U, VE P satisfy u(E) < v(E), then there exists WE P, W # u, satisfying u(e).s; w(e).s; max[u(e), v(e)] for every eE E. If there are two polymatroids PI = (E, J1d and P 2 = (E, 112) over the same finite set E, then we have max{u(E)1 UEP I np 2 } = min{l1dx)+ 112(E -x)lx c E}, According to tinformation theory, an example of a poly matroid (E, 11) is given by the fonowing: l1(x), x E E, is the simul- taneous tent ropy of a subset x of the infor- mation source E with correlation. G. Matroids A poly matroid P = (E, 11) is caned a matroid if the values of 11 are always integers and 11 (x) .s; Ixl = the cardinality of the set x. A set xc E is caned an independent set if 11 (x) = lxi, and a dependent set otherwise. An independent set with maximal cardinality is caned a base. A minimal dependent set is caned a circuit. The set Y with max {IYII Y =:> x, l1(x) = I1(Y)} is uniquely determined by x, which is caned the closure of x and denoted by cI x. Several axioms stated in terms of these notions exist, and can be used to characterize matroids. Some of them are: (I) The axiom for the family of independent sets J: (i) 0c; (ii) XEJ and ycx implies yEl; (iii) If x, YEJ and Ixl < IYI, then there exists an eE E in Y - x such that xU {e} EJ. (2) The axiom for the family of bases fJa: (i) fJd is not empty; (ii) If b, b' E fJa and b # b', then for every eE b - b', there exists an e' such that e' E b' - band (b- {e})U {e'} EfJa. (3) The axiom for the family of circuits '6': (i) 0 E'6'; (ii) For every pair x, YE'6', y is not a proper subset of x; (iii) If x, y E '6' and x # y, then for every eExny there exists a ZE'6' such that zcxU y - {e}, (4) The axiom for the closure function c1: 2E-->2E: (i) xcclx=cI(clx) for every xEE; (ii) xcy implies c1xccly; (iii) If eEcI(xU {e'})- clx, then e'EcI(xU{e})-clx. The fonowing are typical examples of ma- troids: (1) Let E be the family of finite set of vectors in a vector space V and , be the family of sets of linearly independent vectors. (2) Let E be the set of edges of a tgraph and '6' be the set of edges constituting a fundamen- tal circuit. (3) Let E be a finite set and S be a family of subsets of E. x = {e" "', e,} c E is 66H Combinatorics defined to be independent if there exist differ- ent elements SI, ..., s, in S such that eiE Si for every i = I, ... , r. Let a weight function w: E --> R be defined for every element eE E in a matroid (E, 11). To obtain a base b of the matroid with minimal weight w(b)= LeEb w(e), we can apply the fol- lowing procedure, caned the greedy algorithm: (1) Arrange the elements e j , ''', en of E in such a manner that w(e d.s; ... .s; w(e n ). (2) Put b(O):= 0, and i:=O. (3) If x=b(i)U {e i +1} EJ, then b(i+I):= x; otherwise, b(i+l):= b(i). (4) Repeat process (3) for i = 0, 1,..., n - 1. Then the final b(n) is the base of minimal weight. H. Operations on Matroids For a matroid M = (E, 11) with the family fJa of its base, fJa* = {E - x I x E fJa} also satisfies the axioms of the base. The matroid M* = (E, 11*) with /16* as its base is called the dual of M. Here we define 11*(x)= Ixl + I1(E - x)-I1(E). When FeE, M I F = (F, I1F) is called the reduc- tion of M =(E, 11) to F, where I1F is the restric- tion of 11 on 2 F . A matroid M x F = (M* I F)* is caned the contraction of M to F. A matroid given by the form (M I F) x F' is called a minor of M. From two matroids M I =(E,l1d and M 2 = (E, 112) over the same set E, we can construct a matroid M = M I V M 2 =(E, 11) whose rank function l1(x) is defined by the minimum of {l1dy)+ 112(y) + Ix - YII Y ex}. This is called the union of M I and M 2 . Let M == (E, 11) be a matroid and (X be a posi- tive constant. The function j1(x)=I1(x)-(Xlxl is submodular. The set L of x which minimizes j1(x) is a sublattice of 2 E . The partition of E and the order relation of its classes defined by L are caned the principal partition of M with respect to the parameter (x, When the value of the parameter (X is not specified, (X is taken to be 2. As was seen in example (1) at the end of the preceding section, the family of vectOrs in a vector space V over a field K is a matroid M. A matroid isomorphic to such an M is caned linearly representable over K. If K = GF(p), the finite field with p elements, then it is caned a p-ary matroid, A matroid linearly representable over any field is caned regular. If [GJ is a matroid isomorphic to the matroid defined by the family of edges of a graph as shown in example (2), it is called graphic. If the dual M* is graphic, M is caIled cographic, Graphic and cographic matroids are always regular. There exist matroids not linearly representable over a field K, or linearly repre- sentable over a particular field K but not regular, or regular but neither graphic nor cographic. If [GJ is a matroid isomorphic to 
66 Ref. Combinatorics the matroid of the type in example (3), it is called transversal Such a matroid is linearly representable when the cardinality of the field K is sufficiently large. References [1] G, C. Rota, On the foundations of com- binatorial theory I. Theory of Mobius func- tions, Z. Wahrscheinlichkeitstheorie, 2 (1964), 340-368. [2] M. Hall, Jr. Combinatorial theory, Blais- dell, 1967. [3] C. L. Liu, Introduction to combinatorial mathematics, McGraw-HiIl, 1968. [4] H. Whitney, On the abstract properties of linear dependence, Amer. J. Math., 57 (1935), 509-533. [5J B. L. van der Waerden, Algebra, Ungar, 1970, (Original in German, 1937.) [6] D. J. A, Welsh, Matroid theory, Academic Press, 1976. [7] E. L. Lawler, Combinatorial optimization - Networks and matroids, Holt, Rmehart and Winston, 1976, [8] M, Iri and S. Fujishige, Use of matroid theory in operations research, circuits and system theory, Int. J. System Sci., 12 (1981), 27-54. [9J 1. Riordan, An introduction to combina- torial analysis, Wiley, 1958. [!OJ H. J. Ryser, Combinatorial mathematics, Wiley, 1963. 67 (111.11) Commutative Rings A. General Remarks A ring R ( 368 Rings) whose multiplication is commutative is called a commutative ring. Throughout this article, we mean by a ring a commutative ring with unity element. B. Ideals Since our rings are <.:ommutative, we need not distinguish right or left ideals from tideals. A subset a of a ring R is an ideal of R if and only if a is an R-submodule of R ( 277 Modules), if and only if a is the tkernel of a ring homo- morphism from R into some ring (except for the case a = R), Given an ideal a of a ring R, the set of elements which are tnilpotent modulo a, i.e.. {xERlxnEa (3n)}, is called the radical of a and is often denoted by . The radical of the zero ideal is called the radical 250 of R, or more precisely, the nilradical of R ( 368 Rings H) For a subset S of a ring R, the smallest ideal a containing S is called the ideal generated by S, and S is caIled a basis for a; if S is a finite set, then S is called a finite basis. When a A (AE;\) are ideals of R, the sum of these ideals, denoted by L a A , is defined to be the ideal generated by the union of the ideals a A (AE ;\). If ;\ is a finite set, say {1, 2, ... , n}, then the sum is denoted also by a 1 + ... + an, and in this case, the sum is also called a finite sum. Note that a I + ... + an = {a I + ... + an I a i E aJ The product a l ... an of a finite number of ideals ai' ..., an is defined to be the ideal generated by the set {a l ... an I aiE aJ The intersection of an arbi- trary number of ideals is an ideal. When a is an ideal of a ring Rand S is a subset of R, the quotient a: S is defined to be the ideal {XE R IxS c a}. If a, b, c, b A (AE;\) are ideals, we have (a: b): c== a :bc, a:L b A == nA(a :b A ). C. Prime Ideals An ideal p of a ring R is caIled a prime ideal if Rip is an tintegral domain; p is a prime ideal if and only if p # R and also abEp (a, bER) im- plies aEp or bEp (some literature includes the ring R itself in the set of prime ideals). Let S be a multiplicatively closed subset of a ring R, i.e., a nonempty tsubsemigroup of R with respect to multiplication, A maximal member among the set of ideals which do not meet S is called a maximal ideal with respect to S. Such a mem- ber is necessarily a prime ideal; when S = {I }, it is called a maximal ideal of R. An ideal m is a maximal ideal of R if and only if Rim is a field. D. Jacobson Radical The intersection J of all maximal ideals of a ring R is called the Jacobson radical of R; in some cases, this intersection J is called the radical of R ( 368 Rings H). Let N be an R- submodule of a finite R-module M. If M J + N = M, then M = N (Krull-Azumaya lemma or Nakayama lemma). E. Krull Dimension For a prime ideal p of a ring R, the maximum of the lengths n of t descending chains of prime ideals P=PO;:;PI ... Pn which begin with p (or 00 if the maximum does not exist) is called the height or rank of the prime ideal p, For an ideal a, the minimum of the heights of prime ideals containing a is called the height of the ideal a. The maximum of heights of prime 
251 ideals of R is called the Krull dimension (or altitude) of the ring R. For an ideal a of R, the Krull dimension of Ria is called the Krull dimension (or depth) of the ideal a. (The mean- ings of the terms rank, dimension, and depth now depend on the writings in which they are found; standardization of definition of these terms is becoming more and more of a necessi ty.) F. Primary Ideals For an ideal q of R, if q #R and every zero divi- sor of Rlq is nilpotent, then q is called a pri- mary ideal. (If R is included in the set of prime ideals, then R is regarded as a primary ideal.) I n this case, p = jq is a prime ideal, and q is then said to belong to p or to be a p-primary ideal. The intersection of a finite number of pri- mary ideals belonging to the same prime ideal p is p-primary. Assume that an ideal a is ex- pressed as the intersection a = q In,.. n qn of a finite number of primary ideals q" ..., qn' If this intersection is irredundant, that is, if none of the qi is superfluous in the expression of a, then the set of  (i = 1,..., n) is uniquely determined by a. The J qi are called prime divisors (or associated prime ideals) of the ideal a; a minimal one among these is called a mini- mal (or isolated) prime divisor of the ideal a, and those which are not minimal are called embedded prime divisors of the ideal a. A max- imal one among the prime divisors of a is called a maximal prime divisor of the ideal a. (For definitions of these concepts in the case where a is an arbitrary ideal, see [4].) If, in the expression of a as above, n is the smallest occurring in similar expressions, then the expression is called the shortest representation of the ideal a by primary ideals. In this case, each qi is calIed a primary component of the ideal a; if;;t is isolated, then qi is called an isolated primary component of the ideal a, and otherwise qi is calIed an embedded primary component of the ideal a. Isolated primary components are uniquely determined by a, but embedded primary components are not. G. Rings of Quotients Let R be a ring. Then the set U of elements of R which are not zero divisors is multiplica- tively closed. In the set R x U = {(r, u) I rE R, u E U}, we define a relation = by (r, u) = (r',u')=ru'=r'u. Then = is an equivalence relation, and the equivalence class of (r, u) is denoted by rlu. In the set Q of these rlu, addi- tion and multiplication are defined by rlu + r'lu' = (ru' + r'u)luu', (rlu)(r'lu') = rr'luu'. Then 67H Commutative Rings Q becomes a ring, and rll can be identified with r, Thus Q is a ring containing R, gen- erated by R and inverses of elements of U. This property characterizes Q, which is called the ring of total quotients of the ring R. If R is an integral domain, then Q is a field, called the field of quotients of the integral domain R. Let S be a multiplicatively closed subset of R such that orts, and let 0 be {xERlxs=O (3SES)} and <p be the natural homomorphism R --> RID. Then none of the elements of <p(S) is a zero divisor. The subring of the ring of total quo- tients of RID, generated by RID and inverses of elements of <p(S), is called the ring of quotients of the ring R with respect to S, and is denoted by Rs (R[S-IJ or RS- I ). When M is an R- module, M @RRs is called the module of quo- tients of the R-module M with respect to S. It is significant that Rs is t R-flat. There is a one- to-one correspondence between the set of pri- mary ideals q of R which do not meet Sand the set of primary ideals .Q of Rs such that q corresponds to .Q if and only if.Q = qRs (q = .Q n R). When p is a prime ideal of R, the com- plement R - P is multiplicatively closed, and RR-p is called the local ring of p or the ring of quotients of the ring R with respect to the prime ideal p, and is denoted by Rp ( 284 Noe- therian Rings C, D). A ring of quotients is also calIed a ring of fractions. H. Divisibility In a ring R, if a = bc (a, b, CE R), then we say that b is a divisor (or factor) of a, and that a is a multiple of b, or a is divisible by b. We denote this by b I a. This relation between a, bE R is called divisibility relation in R. If, in this situ- ation, c has its inverse in R, we say that a is an associate of b. A factor b of a is called a proper factor if b is neither an associate of a nor tin- vertible. An element which has no proper fac- tor is calIed an irreducible element. A nonzero element which generates a prime ideal is called a prime element. If in an integral domain R every nonzero element is a product of prime elements (up to invertible factors), then we say that the unique factorization theorem holds in R, and that R is a unique factorization domain (or simply u.f.d.). Let A = {ai, ..., an} be a set of nonzero ele- ments of a ring R. A common divisor of A is an element which is a factor of a i . A common multiple of A is defined similarly. The greatest common divisor (G.C.D.) of A is a common divisor which is a multiple of any common divisor; the least common multiple (L.C.M.) of A is a common multiple m which is a factor of any common multiple. Thus, the G.C.D. and L.C.M. exist if R is a u.f.d. 
67 I Commutative Rings I. Integral Dependence Let R be a subring of a ring R" sharing a unity element with R". An element aER" is said to be integral (or integrally dependent) over R if there are a natural number n and elements C i of R such that an + C 1 an-I +... + C n =0, If every element of a subset S of R" is integral over R, we say that S is integral over R. (When R has no unity element, a similar definition is given under an additional condition that CiERi. An important special case is where R is an ideal. See D. G. Northcott and D. Rees, Proc. Cam- bridge Phi/os. Soc., 50 (1954); M. Nagata, Mem. Call. Sci. Univ. Kyoto, 30 (1956).) The set R of elements of R" which are integral over R is a ring and is called the integral closure of R in R". If R = R, then R is said to be integrally closed in R". If R is integrally closed in its ring of total quotients, we say that R is integrally closed. An integrally closed integral domain is called a normal ring. (In some literature, an integrally closed ring is called a normal ring.) An element a E R" is called almost integral over R if there is an element b of R such that b is not a zero divisor and an bE R for every natural number n. If an element a of the ring of total quotients of R is integral over R, then a is almost integral over R. R is said to be com- pletely integrally closed if its ring of total q uo- tients contains no elements which are almost integral over R except the elements of R itself. J. Group Theorem Let Q be the ring of total quotients of a ring R. An R-submodule a of Q is called a fractional ideal of R if there is a non-zero-divisor c of R such that ca c R. The product of fractional ideals is defined similarly as in the case of products of ideals. The inverse a -I of the fractional ideal a is defined to be {x E Q I xa c R}. If a contains an element which is not a zero divisor, then a -I is also a fractional idea\. When R is completely integrally closed, we define fractional ideals a and b to be equiva- lent if a -I = b -I. This gives rise to an equiva- lence relation between fractional ideals. The set of equivalence classes of fractional ideals which contain non-zero-divisors forms a group. This result is called the group theorem. An integral domain R is called a Krull ring if (i) for every prime ideal p of height 1, the ring Rp is a tdiscrete valuation ring; (ii) R is the intersection of all the valuation rings Rp and (iii) every nonzero element a of R is contained in only a finite number of prime ideals of height 1. In a Krull ring R, for an arbitrary nonzero fractional ideal a, there is a uniquely determined product of powers of prime ideals 252 of height 1 which is equivalent to a in the sense stated above (439 Valuations). K. Dedekind Domains and Principal Ideal Domains A ring R is called a Dedekind domain if (i) R is a tNoetherian integral domain, (ii) R is a normal ring, and (iii) the Krull dimension of R is 1. For an integral domain R which is not a field, R is a Dedekind domain if and only if the set of all nonzero fractional ideals is a group, if and only if every nonzero ideal of R is ex- pressed as the product of a finite number of prime ideals and such.expression is unique up to the order of prime factors, An important example of a Dedekind domain is the ring of all talgebraic integers, i.e., the tprincipal order of an talgebraic number field of finite degree. In general, if R is a Dedekind domain with field of quotients K and L is a finite algebraic extension of K, then the integral closure of R in L and any ring R' such that R c R'  K are Dedekind domains. An ideal generated by an element is called a principal ideal; a fractional ideal generated by an element is called a principal fractional ideal (or simply principal ideal). In a Dedekind domain, the set P of nonzero principal frac- tional ideals is a subgroup of the group I of nonzero fractional ideals; I I P is called the ideal class group of R, and a member of it is called an ideal class. The torder of liP is called the class number of R ( 14 Algebraic Number Fields). There are many Dedekind domains whose class numbers are infinite. A ring R is called a principal ideal ring if every ideal is principal; furthermore, if R is an integral domain, then R is called a principal ideal domain. A principal ideal ring is the direct sum of a finite number of rings of which each direct summand is either a principal ideal domain or a t10cal ring whose maximal ideal is a principal nilpotent ideal. A principal ideal domain which is not a field is a Dedekind domain and a u.f.d. We consider, for an arbi- trary natural number n and a principal ideal ring R, the set M(n, R) of all n x n matrices over R. Given an element A of M(n, R), there exist elements X, Y in M (n, R) such that (i) X-I, y- I are in M(n,R), and (ii) denoting by bij the (i,j)-entry of XA Y, we have b ll R =:> b 22 R =:>.,. =:>bnnR and bij=O if i #j. The nonzero members of the set {b ll ,b z2 , ...,b nn } are called the elementary divisors of the matrix A. Apply- ing this to a finite module M over the principal ideal ring R, we see that M is the direct sum of mlR, ...,mrR (miEM) such that, with a i = {xER Imix=O}, we have a l ca 2 c '" ca, ( 2 Abelian Groups B; 269 Matrices E). 
253 L. Euclid Rings A ring R is called a Euclid ring if there is a map <p of R - {O} into a twell-ordered set W (the set of natural numbers is a special case) satisfying the condition that if a, bE R, a # 0, then there are r, q E R such that b = aq + rand either r=O or <p(r) < <p(a). Every Euclid ring is a principal ideal ring. Besides the ring Z of rational integers, there are several familiar examples of Euclid rings, such as Z[j=!J, Z[J=2J, Z[wJ (w 3 = 1, w # 1), and the polynomial ring of one variable over a field. References [IJ W. Krull, Idealtheorie, Erg. Math., Springer, second edition, 1968. [2J B. L. van der Waerden, Algebra I, II, Springer, 1966, 1967. [3J O. Zariski and P. Samuel, Commutative algebra I, II, Van Nostrand, 1958, 1960. [4J M. Nagata, Local rings, Interscience, 1962. [5J N. Bourbaki, Elements de mathematique, Algebre, ch. 1,8, Actualites Sci. Ind., Hermann, 1144b, 1964; 1261a, 1958. [6J N. Bourbaki, Elements de mathematique, Algebre commutative, ch. 1-7, Actualites Sci. Ind., Hermann, ch. 1,2, 1290a, 1961; ch. 3,4, 1293a, 1967; ch. 5,6,1308,1964; ch. 7,1314, 1965. [7] A. Grothendieck, Elements de geometrie algebrique, Publ. Math. Inst. HES, 4 (1960), [8J D. G. Northcott, Lessons on rings, mod- ules and multiplicities, Cambridge Univ, Press, 1968, 68 (X11.1 0) Compact and Nuclear Operators A. General Remarks Let X be a finite-dimensional linear space. Then a linear operator in X is surjective if and only if it is injective. If X is infinite- dimensional, this is no longer the case in general. For an operator of the form 1 + K with an integral operator K of continuous kernel, I. Fredholm [IJ developed the the- ory of determinants and retrieved the above equivalence ( 217 Integral Equations). Later, F. Riesz [3] simplified the proof and showed that the equivalence holds if K is a compact operator in a Banach space, I. Ts. Gokhberg and M, G, Krein [4J reformulated the result as 68C Compact and Nuclear Operators the stability of indices of operators. The con- cept of compact operators itself was introduced by D. Hilbert (who used the terminology, "com- pletely continuous operators") The Hilbert- Schmidt class is also due to him. These classes of operators are employed in the spectral theory of integral operators [2]. The trace class of operators in Hilbert spaces is defined as the class of operators for which the trace has meaning. A. Grothendieck [5J introduced nuclear operators, extending the definition of trace class operators to general Banach spaces. Nuclear operators and related classes of operators, such as integral operators and absolutely summing operators, play important roles in the theory of topological tensor prod- ucts [5J, determinants [6J, vector measures, and measures in linear spaces [7]. B. Compact Operators A tlinear operator T from a tBanach space X to a Banach space Y is said to be compact (or completely continuous) if T maps any tbounded set of X to a trelative1y compact set of Y. In other words, T is compact if, for any bounded sequence {xn} in X, the sequence {Tx n } in Y contains a tstrongly convergent subsequence. A compact operator is necessarily tbounded and hence continuous. A compact operator from X to Y maps any tweakly convergent sequence in X to a strongly convergent se- quence in Y. If X is treflexive, the converse is also true. In this article the set of all bounded (resp. compact) linear operators from X to Yare denoted by B(X, Y) (resp. B(C)(X, Y)). C. Examples of Compact Operators (1) Degenerate operators. An operator TEB(X, Y) is said to be degenerate or of finite rank if the trange R(T) of T is finite- dimensional. A degenerate operator is neces- sarily compact. The tidentity operator in X is compact (Y=X) if and only if X is finite- dimensional. (2) tIntegral operators with con- tinuous kernel. Let E and F be bounded closed regions in the Euclidean spaces Rm and Rn, respectively, and let k=k(t,s), tEF, sEE, be a continuous function defined on F x E. Then the integral operator T with tkernel k, (Tx) (t) = I k(t, s)x(s) ds, t E F, (1) is a compact operator from the Banach space C(E) to C(F). (For the notation for various function spaces  168 Function Spaces,) (3) Integral operators of Hilbert-Schmidt type. In example (2) let E and F be tLebesgue mea- 
68D Compact and Nuclear Operators surable sets, and let kEL 2 (F x E). Then the integral operator T determines a compact operator from the tHilbert space L 2 (E) to the Hilbert space L 2 (F). (4) Let Q be a bounded open set in Rn, S < t two real numbers, and H'(Q) the tSobolev space of order S ( 168 Function Spaces). Then the natural injection from H'(Q) into W(Q) is compact (F. Rellich's lemma). D. Properties of Compact Operators Any linear combination of compact opera- tors is again compact. If a seq uence {Tn} of compact operators converges in the norm (i.e., in the tuniform operator topology), then the limit T is compact. Thus 8<c)(X, Y) is a closed subspace of the Banach space B(X, Y) with the toperator norm. Any product of a compact operator and a bounded opera- tor is compact. Namely, A E B(C)(X, Y), BE B(Y,Z), and CEB(Z,X) imply BAEB(c)(X,Z) and AC E 8<c\Z, Y). In particular, 8<C)(X) = B(C)(X, X) forms a closed ttwo-sided ideal of B(X)=B(X, X). An operator TEB(X, Y) is compact if and only if its tdual operator T' E B( Y', X') is compact. The range of a com- pact operator is always tseparable. Let X and Y be Hilbert spaces, Then for any TEB(c)(X, Y) there exist torthonormal sets {<Pn} in X and {i/1n} in Yand a sequence {c n } of nonnegative numbers with limcn=O such that Tu= LC n (U,<Pn)i/1n, UEX. Consequently, any compact operator between Hilbert spaces X, Y can be approximated by a seq uence of degenerate operators in the operator norm. However, there are Banach spaces X and Y for which the statement is no longer true. In fact, a Banach space X (resp. the dual Y' of a Banach space Y) has the tap- proximation property ( 37 Banach Spaces L) if and only if every TE B(C)(y, X) is the limit in norm of a sequence of degenerate operators for any Banach space Y (resp. X). E. The Riesz-Schauder Theorem Let TE B(C)(X) and consider a pair of linear equations u-Tu=f, fEX, <p-T'<p=g, gEX', where T' E B(X') is the dual operator of Tin the tdual space X' of X. Put uH={uEXlu= Tu} and uH' = { <P E X' I <P = T' <P }. Then one and only one of the following two cases (i) and (ii) occurs. (i) uH = {O}, uH' = {O}; for any f EX 254 equation (2) has a unique solution; and for any gEX' equation (3) has a unique solution. (ii) dimuH =dimuH' =m, I.s;m< 00; (2) has a solution if and only if f is orthogonal to uH' (i.e., <p(/)=O for any <pE.4t'); and (3) has a solution if and only if 9 is orthogonal to uH (i.e., g(u)=O for any uEuH), This is called the Riesz-Schauder theorem. In particular, when T is an integral operator in a suitable function space, this theorem is also called Fredholm's alternative theorem for integral equation (2). F. Fredholm Operators and Their Indices (2) (3) Let T be a tclosed linear operator from a Ba- nach space X to a Banach space Y. T is calIed a Fredholm operator or an operator with index if both Ker T ( = null space N (T)) and Coker T (= YjR(T» are finite-dimensional. Then the integer ind T = dim Ker T - dim Coker T is called the index of T. If a closed linear opera- tor T has Coker T of finite dimension, then the range R(T) is closed. Moreover, if R(T) is closed and the domain D(T) is dense, then dim Coker T= dim Ker T', where T' is the dual of T, and hence ind T=dimKer T -dimKer T" A linear operator K from X to Y is said to be T-compact if the domain D(K) contains D(T) and if K from D(T) with the tgraph norm into Y is compact, i.e" for any bounded sequence {xn} in D(T) with sup II Txnll < 00, the se- quence {Kxn} contains a strongly convergent subseq uence. The following are basic properties of Fred- holm operators [4, I I]: (1) If T is Fredholm and K is T-compact, then T + K is Fredholm and ind(T +K)=ind T. (2) If T is Fredholm and another linear operator S is sufficiently close to T in (graph) norm, then S is also Fred- holm and ind S = ind T. (3) If T is a Fredholm operator from X into Yand S is a Fredholm operator with dense domain in Y into Z, then ST is Fredholm and indST=indS +ind T. If D(T) is dense, then D(ST) is also dense. (4) Let T be a closed linear operator with dense domain; then T is Fredholm if and only if the dual T' is Fredholm, and then ind T' = -ind T. Fredholm operators and their indices were first studied by Russian mathematicians in connection with boundary value problems of differential equations and singular integral equations. M. F. Atiyah and I. M. Singer have proved that an telliptic linear differential operator P of order m on a compact differenti- able manifold M is a Fredholm operator from the Sobolev space W(M) into W-m(M) for any s and that its index is computed from the tsymbol of P and the tcharacteristic classes of M ( 237 K - Theory). Similarly, the indices of linear ordinary differential operators (and 
255 more generally of maximally overdetermined systems oflinear partial differential operators) are computed in various spaces of functions and generalized functions (H. Komatsu, B. Malgrange, J.-P, Ramis, M. Kashiwara). G. Spectra of Compact Operators The following structure of the tspectrum of compact operators TEB(C)(X) is derived from the Riesz-Schauder theorem. The spectrum O"(T) of T consists of at most countably many points and has no accumulation points except possibly for 0, Any nonzero point of O"(T) is an teigenvalue of T. When X is infinite- dimensional, 0 always belongs to O"(T) but is not necessarily an eigenvalue of T, Each non- zero eigenvalue of T has finite (algebraic) tmultiplicity, and hence the teigenspace .4t A (T) = {u I Tu = AY}, ..1. # 0, is finite-dimensional. If }, is an eigenvalue of T, then A is an eigenvalue of the tadjoint operator T* of T with the same (either algebraic or geometric) multiplicity as that of A. H. Spectral Representations of Compact Normal Operators Let T be a compact tnormal operator in a Hilbert space H. Then we can find a tcomplete orthonormal set consisting solely of eigen- vectors of T. Namely, for each nonzero eigen- value Aj of T, take an orthonormal basis {<pf))} of the eigenspace associated with Aj. Rearrange all the <pfj) into a sequence {<Pn} and add to it, if 0 is an eigenvalue, a complete orthonormal set of the eigenspace associated with 0, Then we obtain a desired complete orthonormal set of H. Let Iln be the eigenvalue associated with <Pn' Then the sequence {Iln} is precisely an enumeration of nonzero eigenvalues of T with repetitions according to multiplicity. In terms of {<Pn} and {Iln}, a tspectral repre- sentation of T is given as 00 Tu= L Iln(u, <Pn) <Pn , uEH. n=l The eigenvalue problem of a compact non- negative tself-adjoint operator T can be solved by means of the following Rayleigh prin- ciple. Consider Rayleigh's quotient R(x) = (Tx,x)/llxI1 2 , x#O. The largest eigenvalue III is obtained as III = max XEX R(x) and any vector XI which attains this maximum is an eigen- vector. When the largest n - 1 eigenvalues Ill'''' ,Iln-I and eigenvectors Xl, ''''X n - l are determined, the nth eigenvalue I1n is the maxi- mum of R(x) on the subspace orthogonal to Xl' "', X n - l and any vector which attains the 68 I Compact and Nuclear Operators maximum is an eigenvector. A more direct characterization of Iln, involving no previous eigenvectors, is given by Iln = min ( max R(X» ) . fl' .In-l EX .J.x.f)= fl. ,fn-l #0 )-1, ,n 1 This formula is referred to as the minimax principle, I. Classification of Compact Operators in Hilbert Spaces Let Hand K be Hilbert spaces. Then a TE B(H, K) is compact if and only if (TenJn)-->O for any orthonormal sequences en in Hand J" in K. J. von Neumann and R. Schatten (Ann. Math., 49 (1948)) classified operators TE B(C)(H, K) in the following way, The opera- tor A =(T*T)I/2 is a compact nonnegative self-adjoint operator. Let ex l ex2;3... be the enumeration in decreasing order of the posi- tive eigenvalues of A, with each repeated ac- cording to its multiplicity, The exn = exn(T) are sometimes called the characteristic numbers of T. For any p>O the set of all TEB(C)(H) such that ( 00 ) l/P IITll p = nl ex < +00 is denoted by Bp(H) (or simply Bp). Among these classes, BI and B 2 are most important. BI is called the trace class and B 2 the Hilbert- Schmidt class, Correspondingly, II TIll and liT 112 are called the trace norm and Hilbert- Schmidt norm of T, respectively. BI is also called the nuclear class and any operator TE BI a tnuclear operator. The norm II T lip can also be defined more directly as follows. (i) If 1 < p < 00, II TII p = sup II(Ten,J,,)lll p ; (ii) If 0 < p.s; 2, II Tll p = infll(11 Te,ll)lIl p ; (iii) If 2 .s;p< 00, II Tll p = sup 11(11 Tevll)lll , where en and fn range over the orthonoral sequences in Hand K, and e v over the orthonormal bases in H. The class Bp is a two-sided * ideal in the Banach algebra B. Moreover precisely, TE Bp and REB imply IIRTllp.s; IIRIIIITllp, IITRllp.s; IIRlllITilp and II T* II p = II Tll p ' When I .s;p< 00, the class Bp becomes a Banach space with the norm liT lip' It is always a tquasi-Banach space, in which the set of all degenerate operators is dense, The norm II T II p (T # 0) is a decreasing func- tion of p. Hence Bp c Bq if p.s; q, Also, TE Bp and SEB q imply TSEB" where I/r= l/p+ l/q. Let TEBp, and let {IlJ be an enumeration of eigenvalues of T with repetitions according to (either geometric or algebraic) tmultiplicity, Then DIlX.s;IITII, j 
68J Compact and Nuclear Operators Hilbert-Schmidt class. For an arbitrary complete orthonormal set {ud, we have II TII = L II Tukll z . Thus Bz can be defined as the set of all T E B such that the sum on the right-hand side is finite for a certain complete orthonormal set {ud. A linear operator T: L 2 (E)-->L 2 (F) is of Hilbert-Schmidt class if and only if it is an integral operator of Hilbert-Schmidt type (example (3) in Section C). The space Bz with the norm II T112 becomes a Hilbert space with the inner product defined by (T,S)z = L(Tuk> Sud, where {ud is as above. Trace class. For an operator TE B l , the trace tr(T) of T is defined as tr(T) = L (TUb ud. Here the right-hand side converges absolutely and does not depend on the complete ortho- normal set {ud. The trace is a bounded linear functional on the Banach space BI' The prod- uct of two operators of Hilbert-Schmidt type belongs to the trace class, and the converse is also true. If 1 < P, q < 00 satisfy l/p + l/q = 1, then the inner product < T, S) of TE Bp(H, K) and SE Bq(K, H) is defined by <T, S) =tr(ST). Under this inner product the dual of the Banach space Bp(H, K) is identified with Bq(K, H). Similarly, the dual of 8<C)(H, K) is isomorphic to BI (K, H), and the dual of Bl (K, H) to B(H, K) (1. Dixmier, Schatten). J. Volterra Operators A compact operator T in a Hilbert space is said to be a Volterra operator if it is tquasi- nilpotent, i.e., its tspectral radius is O. The integral operator (Tx)(t) = r k(t,s)x(s)ds, XELz(a,b), appearing in the integral equation of Volterra type is a Volterra operator. Conversely, a Volterra operator satisfying a suitable con- dition is unitarily equivalent to such an inte- gral operator (M. S, Livshits, Gokhberg and Krein [13]). A Volterra operator admits an abstract triangular representation in a manner similar to Jordan's canonical form ( 390 Spectral Analysis of Operators H), K. Nuclear Operators Extending the definition of trace class to operators in Banach spaces, A. Grothendieck [5J defined a nuclear operator (or Fredholm 256 operator according to [6J) from a Banach space X into a Banach space Y to be a linear operator T: X --> Y that is represented as Tx = L}'j<x,aj)b j with a sequence AjO in II, a bounded sequence a j in X', and a bounded sequence b j in Y. In other words, a linear operator T: X --> Y is nuclear if and only if it is decomposed as the product X Ioc Il!!. Y, (4) where A and B are bounded linear operators and;\ is multiplication by Ai in 11' The in- fimum of IIAIIIIAjll,1IlBII is called the nuclear norm of T and is denoted by II T III' When X and Yare Hilbert spaces, this coincides with the trace norm, The integral opera- tor Tdefined by (I) is nuclear with II Till = J sup,lk(t, s)1 ds, The totality BI (X, Y) of nuclear operators T: X --> Y forms a Banach space under the nuclear norm. If T is nuclear and A is bounded, then liT AliI .s; II T III II A II and II A TIll .s; II A II Fill . If T is nuclear, then the dual T' is nuclear, and II T' III .s; II Till' Suppose that X and Yare Banach spaces satisfying one of the following conditions: (i) Y is the dual of a Banach space; (ii) X' has the approximation property; (iii) Y" has the approximation prop- erty. Then conversely a TEB(X, Y) is nuclear if the dual T' is nuclear, and II TIll = liT' 111' However, this is not necessarily the case in general (T. Figiel and W. B. Johnson). Replacing II by I p , we obtain the definition of operators T: X --> Y of summable pth power. Grothendieck [5] considered the case 0 < p.s; 1 and showed that if Tl> ..., 1'. are of summable PI' .. . , Pn th power, then the product Tl ... 1'. is of summable rth power, where r is ,given by l/r=(L I/Pi)-(n+ 1)/2. L. Traces and Determinants of Operators Let T be a linear operator of tfinite rank in a linear space X. There are a finite number of elements aiEX' and biEX such that n Tx= L <x,ai)b i . i=l (5) Then tr(T)=L<b;,a) and det(J - T)= det(bi,j- <bi' a j ») are independent of the repre- sentation (5). Let X be a Banach space. If TEBl(X) is represented as TX=LAj<x,aj)b j , then it seems reasonable that the trace tr(T) be defined by L Aj<b j , a), but the sum may depend on the representation. It is known that a Banach space X has the approximation property if and only if the sum does not de- pend on the representation for any nuclear operator T. However, if T is of summable (2/3)rd power, the trace tr(T) is always de- fined uniquely. If X or T satisfies the afore- 
257 mentioned condition, then the determinant det(1- T) is also defined uniquely as the limit of det(1- 7;,), where 7;, is the nth partial sum, det(1 - zT) is an entire function of z and its zeros are exactly the reciprocals of the nonzero eigenvalues of T. If det(J - zT) #0, then the resolvent (l-zT)-1 is given by Fredholm's formula extending tCramer's rule, Eigen- vectors are also computed explicitly [6]. Let H be a Hilbert space, and let TEBdH), Then det(l-zT)=I1(1-zA j ), where Aj are the nonzero eigenvalues of T, and is an entire function of genus O. Moreover, let p;3 2 be an integer. Then the modified determinant detp(l- zT)= I) [(1- ZAj) exp Ct: k- I Zk Af) ] is defined for TEBp(H) and is an entire func- tion of genus p -1. This type of determinant was introduced by T. Carleman (Math, Z" 9 (1921)) for the Hilbert-Schmidt class of oper- ators ( also Hilbert [2J), It is utilized to prove the completeness of the troot vectors of T and many other facts [9,12,13]. M. Weakly Compact Operators A linear operator from a Banach space X to a Banach space Y is said to be weakly com- pact if T maps any bounded set of X to a rela- tively weakly compact set. Weakly compact operators are bounded. They have proper- ties similar to those of compact operators, The following are equivalent conditions for a bounded linear operator T: X --> y, (1) Tis weakly compact; (2) T': Y' -->X' is weakly compact; (3) T": X" --> Y" maps X" into Y; (4) There is a reflexive Banach space Z and bounded linear operators S: X -->Z and R: Z --> Y such that T = RS, The last characteri- zation is due to W, J, Davis, Figiel, Johnson, and A, Pelczyilski, Any linear combination of weakly compact operators is weakly compact. The uniform limit of a sequence of weakly compact operators is weakly compact. The product of a weakly compact operator and a bounded operator is weakly compact. Dunford-Pettis theorem ([9]; R, S, Phillips, Trans, Amer, Math, Sac" 48 (1940), Let (Q,J.l) be a O"-finite measure space and X a Banach space, If T: Lt (Q)--> X is a weakly compact linear operator, then there exists a bounded strongly measurable function 9 on Q with values in X such that Tf = f f(t)g(t)dJ.l(t), Then T maps weakly convergent sequences in LI (Q) into strongly convergent sequences in X. In particular, the product of two weakly com- 68N Compact and Nuclear Operators pact linear operators in LI (Q) is compact. Similarly, Grothendieck and R, G. Bartle, N, Dunford, and 1. Schwartz ( [9J) have proved the folIo wing: Let Q be a compact Hausdorff space and X a Banach space, If T: C(Q)--> X is a weakly compact linear mapping, then there is a tvector measure J.l defined on the tBorel sets in Q with values in X such that <J.l,x') is a tRadon measure for any x' E X' and Tf = f f(t)dJ.l(t), T maps weakly convergent sequences in C(Q) into strongly convergent sequences in X, The product of two weakly compact linear opera- tors in C(Q) is also compact. N. Absolutely Summing Operators A linear operator T from a Banach space X to a Banach space Y is called an integral operator ifthere is a positive tRadon measure J.l on the product A x B of the unit ball A in X' and the unit ball Bin Y" both equipped with the weak* topology such that <Tx, y') = f <x, x') (y', y") dJ.l(x', y"), or equivalently if T: X --> Y" is decomposeri as (4) with 100 and II replaced by Loo(M) and LI (M) for a suitable compact space M with a Radon measure, Nuclear operators are clearly integral. Integral operators are weakly com- pact, but they are not necessarily compact or nuclear. However, the product of an integral operator and a weakly compact operator is nuclear, and hence every integral operator in a reflexive Banach space is nuclear. If a compact (resp. weakly compact) opera- tor T from a Banach space X to a Banach space Y maps a closed linear subspace XI of X into a closed linear subspace Y I of Y, then the restriction T I : XI --> Y I and the induced operator T I :XjX t --> YjY I are also compact (resp, weakly compact), The corresponding result does not hold for nuclear operators and integral operators, The following conditions are equivalent for a bounded linear operator T: X --> Y: (i) There is a Banach space Y =:> Y such that T: X --> Y is integral; (ii) there is a positive Radon measure J.l on the unit ball A of X' such that II Txll.s; fl<x, x') IdJ.l(x'); (iii) there is a constant C such that L IITx;lI.s;Csup{D<xi,x')llllx'lI.s; I} holds for any finite set {x t, '" , x n } in X; (iv) if L Xi is an tunconditionally convergent series 
68 Ref. Compact and Nuclear Operators in X, then L II Txill converges. Grothendieck called such an operator a right semi-integral in view of property (i), and A. Pietsch called it absolutely summing in view of(iv). Similarly, a bounded linear operator T: X --> Y is called a left semi-integral if there is a Banach space X such that X = X/Xl for a closed subspace Xl, and the product f:x --> Y is integral. The product of two right (resp. left) semi-integral operators is nuclear. A bounded linear opera- tor in a Hilbert space is right (or left) semi- integral if and only if it is of Hilbert-Schmidt class. A bounded linear operator T from a Banach space X into a Banach space Y is integral if and only if the dual T' is integral. It is a right (resp. left) semi-integral if and only if T' is a left (resp. right) semi-integral. For other related classes of operators  Grothendieck's paper in Boletin Soc. Mat. sao Paulo, 8 (1953). References [IJ I. Fredholm, Sur une classe d'equations fonctionnelles, Acta Math., 27 (1903),365-390. [2J D. Hilbert, Grundziige einer allgemeinen Theorie der linearen Integralgleichungen, Teubner, 1912. [3J F. Riesz, Dber lineare Funktionalgleichun- gen, Acta Math., 41 (1918),71-98. [4J I. Ts. Gokhberg (Gohberg) and M. G. Krein, The basic propositions on defect num- bers, root numbers and indices of linear opera- tors, Amer. Math. Soc. Transl., ser. 2, 13 (1960), 185-264. (Original in Russian, 1957.) [5J A. Grothendieck, Produits tensoriels topo- logiques et espaces nucleaires, Mem. Amer. Math. Soc., 16 (1955). [6J A. Grothendieck, Theorie de Fredholm, Bull. Soc, Math. France, 84 (1956),319-384. [7J L. Schwartz, Geometry and probability in Banach spaces, Lecture notes in math. 852, Springer, 1980. [8J F. Riesz and B. Sz.-Nagy, Levon d'analyse fonctionnelle, Akad. Kiad6, Budapest, 1952. [9J N. Dunford and J. T. Schwartz, Linear operators I, II, Interscience, 1958, 1963. [lOJ K. Yosida, Functional analysis, Springer, 1965. [IIJ T. Kato, Perturbation theory for linear operators, Springer, 1966. [12J I. Ts. Gokhberg (Gohberg) and M. G. Krein, Introduction to the theory of linear nonselfadjoint operators, Amer. Math. Soc., 1969. (Original in Russian, 1965.) [13J I. Ts. Gokhberg (Gohberg) and M. G. Krein, Theory and applications of Volterra operators in Hilbert space, Amer. Math. Soc" 1970. (Original in Russian, 1967.) 258 [14J A. Pietsch, Nuclear locally convex spaces, Springer, 1972. 69 (IV.8) Compact Groups A. Compact Groups A ttopological group G is called a compact group if the underlying topological space of G is a tcompact Hausdorff space. The ttorus group T" = R n /zn (n = 1,2, ... ) (commutative group), the torthogonal group O(n), the tuni- tary group U(n), the tsymplectic group Sp(n), and the additive group Zp of tp-adic integers are compact groups ( 60 Classical Groups; for other compact Lie groups  249 Lie Groups, 248 Lie Algebras). Let C(G) be the tlinear space formed by all the complex-valued continuous functions f, g, h, ... defined on a compact group G; C(G) is a tBanach space with the norm Ilfll =suplf(x)l. XEG Since a compact group G is tlocally com- pact, there exists a right-invariant tHaar mea- sure on G. Because of the compactness of G, the total measure of G is finite, and the mea- sure is also left-invariant. By the condition that the total measure is 1, such a measure is uniquely determined. The integral off in C(G) relative to this measure is called the mean value of f Since for I, gE C(G),f(xy-l)g(y) is continuous in two variables x, y, the tconvo- lution f x g(x) = J f(xy-I )g(y)dy also belongs to C(G). C(G) constitutes a ring under the multiplication defined by the convolution. This ring can be considered as an extension of the notion of a tgroup ring for finite groups; it is called the group ring of the compact group G. The function x--> f(x- l ) will be denoted by f*, and the inner product in the tfunction space L 2 (G) will be written as (I, g). B. Representations of Compact Groups Let G(E) be the group of units of tbounded linear operators on a Banach space E, and suppose that we have a homomorphism U of a topological group G into G(E). The homomor- phism U is called a strongly (weakly) contin- uous representation on E of G if, for any a E E, the map x--> U(x)a of G into E is continuous with respect to the tstrong (tweak) topology on E. When E is a tHilbert space, a strongly con- tinuous representation U such that every 
259 U(a) is a unitary operator is called a tuni- tary representation. Let U be any strongly continuous representation of a compact group G on a Hilbert space E; for a, b in E, let <a, b) be the mean value of the tinner product (U(x)a, U(x)b) on E. Then U is a unitary repre- sentation on the Hilbert space E with the new inner product <a,b), A representation U of G on a Banach space E is said to be irreducible if E contains no closed subspace other than {O} and E, which is invariant under every U (x) (x E G). If a weakly continuous representation of a compact group on a Banach space E is irreducible, then E is finite-dimensional. Moreover, any unitary representation U of a compact group on a Hilbert space E can be decomposed into a discrete tdirect sum of irreducible representa- tions. Namely, there exists a family {E,} 'EA of irreducible (hence finite-dimensional) invariant subspaces E, of E which are orthogonal to each other such that E = L'EA E.. In particular, any continuous representation of a compact group on a finite-dimensional space is tcom- pletely reducible. In L 2 (G) with respect to the tHaar measure on a tlocall y compact group G, the representation U defined by (U(x)f)(y)= f(yx) (X,yEG,jEL 2 (G)) is a unitary repre- sentation of G. This representation U is called the (right) regular representation of G. Decom- position of the regular representation of a compact group G into irreducible represen- tations is given by the Peter-Weyl theory, descri bed later. In the rest of this article, the representations under consideration will be (continuous) repre- sentations by matrices of finite degree, Let DI(x)=(d\})(x)) and D 2 (x)=(d!J>(x)) be irredu- cible unitary representations which are not mutually tequivalent. Then from tSchur's lemma follow the orthogonality relations (d?,dH»=O and (dU), d)=bimbjn (where n j is the degree of the representation Dd. From each tclass D, of irreducible repre- sentations of G, choose a unitary representa- tive D,(x) = (d(j(x)), and let n, denote its de- gree. Then from the orthogonality relations it follows that the Ad(j(x) form an tortho- normal system of L 2 (G). Let hE C( G) and consider the map H: f --> h x fof C(G) to C(G). Then H is a tcompact operator in C(G). Since C(G) is contained in L 2 (G), we can define the inner product in C(G). By (h x f,g) = ( f, h* x g), h=h* implies (HI, g) = (f, H g); that is, H is a tHermitian operator. For a given f in C(G), there exists h( = h*) in C(G) such that h x fis uniformly arbitrarily near f From the theory of compact Hermitian operators, Hfcan be uniformly approximated by linear combinations of the teigenfunctions of H. Since the teigenspace of H is finite- 69B Compact Groups dimensional and invariant under U(a), the eigenfunctions of H are linear combinations of a finite number of Adi)x). Hence any func- tion fin C( G) can be uniformly approximated by linear combinations of a finite number of A dij(x). This fact, like the similar result in tFourier series, is called the approximation theorem. From this, it follows that the or- thogonal system {Adij(x)} is tcomplete; i.e., if an element of C(G) is orthogonal to each element in this system, then it is O. Since C(G) is dense in L 2 (G), {A di)x)} is a tcomplete orthonormal system of the Hil- bert space L 2 (G). Hence for any {EL 2 (G), its "Fourier series" 'L.,'L.ijcijA dij(x) (where cij = (f, j;;: dij)) converges to f in the mean of order 2 (i.e., with respect to the tmetric of L 2 (G)). In particular, if G is a compact tLie group and f is sufficiently many times dif- ferentiable, then this series converges uni- formly to f The space Vi' of dimension n, spanned by the elements d;j(x) (1 .s;j.s; n,) in the ith row of the matrix D,(x) is invariant under the right regular representation U. The representation on Vi' given by U can only be D,. Then the fact that {A d(j(x)} is an orthonormal system of L 2 (G) means that the regular representation U of a compact group G is decomposable into a discrete direct sum of finite-dimensional irreducible representations, Each irreducible representation D, is contained in U with multi- plicity equal to its degree n.. If a function <p(x) in C(G) satisfies <p(y-l xy) = <p(x) for any x, y, then it is called a class function. The set K(G) of all the continuous class functions coincides with the tcenter of the group ring C(G). The tcharacter of an irredu- cible representation of G is a class function, and the set {X,(x)} of all characters plays the same role as the orthogonal system {A dij(x)} in C(G). Namely, {X,(x)} is a complete orthonormal system in (the tcom- pletion of) K(G), and any class function can be uniformly approximated by linear combi- nations of a finite number of these characters. The preceding paragraphs give a brief de- scription of the Peter- Weyl theory. If G is the I-dimensional ttorus group T 1 = R/Z, namely, the compact group of real numbers mod 1, then this is actually the theory of Fourier series concerning periodic functions on the line. (For concrete irreducible representations of O(n), U (n), Sp(n), and formulas for charac- ters  60 Classical Groups. For represen- tations of compact Lie groups  249 Lie Groups, 248 Lie Algebras.) The theory of compact groups was completed by F. Peter and H. Weyl (Math, Ann., 97 (1927», and J. von Neumann's theory concerning almost periodic functions in a group (1934) united the 
69C Compact Groups theory of compact groups with H. Bohr's theory of almost periodic functions ( 18 Almost Periodic Functions). C. Structure of Compact Groups Let a be an element of a compact group G different from e. Since the underlying space of a topological group is a tcompletely regular space, there exists a function in C(G) such that f(a) # f(e). Hence there exists a representation D(x) of G such that D(a) is not equal to the unit matrix, This means that any compact group G can be expressed as a tprojective limit group of compact Lie groups. Beginning with this fact, von Neumann (1933) showed that a tlocally Euclidean compact group is a Lie group ( 423 Topological Groups N). D. Set of Representations The set G' = {D} of representations of G by matrices admits the following operations: (i) DI @ D 2 (ttensor product representation); (ii) ( DI 0 ) DI EB D 2 = (tdirect sum representa- o D 2 tion); (iii) p- I DP (equivalent representation); and (iv) D (complex conjugate representation). Let M be a subset of G' such that DEM when- ever DEM and the irreducible components of DI @ D 2 are in M whenever DI' D 2 E M, Then M is called a module of representations of G. There is a one-to-one correspondence between closed normal subgroups H of G and modules M formed by all the representations of G/H. A representation of G' is a correspondence which assigns to each D a matrix A( D) of the same degree as that of D and preserves the operations ofG': A(DI @D 2 )=A(Dd@A(D 2 ), A(DI EBD2)=A(DdEBA(D2)' A(P-IDP)= P-IA(D)P, and A(D)== A(D) , Let Goo be the set of all the representations of G', Define the product of AI, A 2 E Goo by Al A 2 (D)== AdD)A2(D) and a topology on Goo by the tweak topology of the functions A( D) of D. Namely, a typical neighborhood of Ao is of the form U(Ao; DI'"'' Ds; e)= {A III A(D;)-Ao(D;) II <e)= I,..., s}. Goo is a topological group under this multiplication and topology. Then the Tannaka duality theorem states that Goo  G holds (T. Tannaka, Tohoku Math. J., 45 (1939)). Let R(G) be the talgebra over the complex number field C formed by the set of all the linear combinations of a finite number of drj{x), and let Aut R(G) be the automor- phism group of this algebra R(G). Let G* be the set of all the elements 0" in Aut R(G) which commute with every left translation L(x) «L(x)f)(y) = f(xy)) and which satisfy 260 O"(J ) = 0"(/) . Then G * is a topological group with respect to the weak topology, and the Tannaka duality theorem implies that the correspondence which assigns to each x E G the right translation R(x) (the restriction of U(x) to R(G)) is an isomorphism of G onto G* as topological groups. For the case where G is a compact Lie group, C. Chevalley re- stated the Tannaka duality theorem as one giving a relation between compact Lie groups and complex algebraic groups ( 249 Lie Groups U), References See references to 18 Almost Periodic Func- tions and 423 Topological Groups. For compact Lie groups  references to 249 Lie Groups. 70 (IX.5) Complexes A. General Remarks The notion of complexes was introduced by H. Poincare to study the topology of tmanifolds by combinatorial methods ( 65 Combina- torial Manifolds). Various kinds of complexes have been introduced in the course of the development of topology. These will be dealt with individually in the sections that follow, B. Euclidean Simplicial Complexes Let RN be the N-dimensional tEuclidean space, and let a o , a l ,.,., am be points of RN with coordinates a i = (al), a2), ..., aN)) for i = 0, I, "', m. For real numbers ..1. 0 , ..1.1' ..., Am, we denote by Aoa o + Al a l +... + Am am the point (Lo Aial), Lo Aia2), ..., Lo AiaN)) of R N . A set ofn+ I points a o , a p .."an ofR N is said to be independent or in general position if vectors a,;a;, a;;a;, ... ,Gc;a;; are linearly inde- pendent. For given independent points a o , ai' ""an' let laoa l ...anl denote the subset of R N given by laoal...anl={Aoao+Alal +... +AnanlAo +..1. 1 +... +An= I, A i ;3 O}. laoa l '" ani is called an n-simplex with vertices a o , ai' "', an' Simplexes are denoted by , ', 0", r, etc, For example, a O-simplex laol is a point a o , a I-simplex laoall is a segment aOal , and a 2-simplex laoa l a 2 1 is a triangle aoal a 2 . 
261 We attach to Aoa o + ..1.1 a l + ... + An an the co- ordinates (..1. 0 , AI> ..., An), called the barycentric coordinates. If laoa l .., ani is an n-simplex, every subset {a io ' ail' ..., a i ,} of {a o , al> ..., an} is independent. A q-simplex I a io ai, ... ai,l is called a q-face of laoa l ... ani. A set 5t of simplexes in a Euclidean space R N is called a Euclidean (simplicial) complex in RN if 5t satisfies the following three conditions: (i) Every face of a simplex belonging to 5t is also an element of R (ii) The intersection of two simplexes belonging to 5t is either empty or a face of each of them. (iii) Each point of a sim- plex belonging to 5t has a neighborhood in RN that intersects only a finite number of sim- plexes belonging to R A Euclidean complex is also called a geometric complex (or rectilinear complex). Each O-simplex in 5t is called a ver- tex in R We define the dimension of 5t to be n if 5t contains an n-simplex but no (n + 1)- simplex, and 00 if 5t contains n-simplexes for all n;30. Bya subcomplex of a Euclidean complex 5t we mean a Euclidean complex that is a subset of R For a Euclidean complex 5t, its r-section (or r-skeleton) is defined to be the subcomplex of 5t consisting of all n-simplexes (n.s; r) in R If 5t is a Euclidean complex in RN, we de- note by 15t1 the set of points in R N belonging to simplexes in R This set 15t1 is calIed the Eucli- dean polyhedron of R By subdivision 5t' of a Euclidean complex 5t we mean a Euclidean complex such that 15t'1 = 15t1 and each simplex in 5t' is contained in a simplex in R Specifically, we can construct a subdivision 5t' of 5t utilizing tbarycenters of simplexes in 5t; namely, we let 5t' be the set of all r-simplexes whose vertices consist of barycenters of the series 110 c 11 1 C ... c l1, of sim pIe xes in R Then 5t' is a subdivision of , caIled the barycentric subdivision of 5t and denoted by Sd R Given a Euclidean complex 5t and a subset A of 15t1, we define the star of A in 5t to be the subcomplex of 5t that consists of simplexes {11} and their faces such that 11 n A # 0. Further- more, we define the open star of A in 5t as the union oft open simplexes (the interiors of simplexes) of 5t whose closures intersect A. We denote by St,\\(A) the star of A in 5t and by OJ\(A) the open star of A in 5t; then OJ\(A) is an open set whose closure is ISt,\\(A)I. The notion of Euclidean simplicial com- plexes can be generalized to that of Euclidean cell complexes; this is done by replacing the term simplex by tconvex cell in the definition of Euclidean simplicial complex, For a Euclidean cell complex 5t, the notions of vertex, dimen- sion, subcomplex, r-section, and subdivision are defined similarly as in the case of Eucli- dean simplicial complexes. 70C Complexes C. Simplicial Complexes Given a Euclidean simplicial complex 5t, let K denote the set of all the vertices in 5t, and let E denote the set consisting of those subsets {v o , .'. , v,} of K for which there exist simplexes 11 in 5t such that {v o , "', v r } coincide with the set of vertices of 11. Then we have (1) if sEE and S::J s', s' # 0, then s' E E; (2) every set con- sisting of a single element in K is in E, and the empty set is not in E, In general, if a pair (K, E) of a set K and a set E consisting of finite subsets of K satisfy (1) and (2), then the pair (or the set K) is called an abstract simplicial complex (or simply sim- plicial complex). If K is a simplicial complex, each element of the set K is called a vertex in K, and each set of E is called a simplex in K. A simplex consisting of n + 1 vertices is called an n-simplex, We say that a simplicial complex K is finite if it consists of a finite number of ver- tices; it is locally finite if every vertex of K belongs to only finitely many simplexes in K. We define similarly countable simplicial com- plexes and locally countable simplicial com- plexes. The dimension and r-section of a sim- plicial complex are defined as in the case of Euclidean complexes. By a subcomplex of a simplicial complex K we mean a simplicial complex Ko such that each simplex of Ko is a simplex in K. Let K and L be simplicial complexes. A mapping <p: K --> L is caIled a simplicial map- ping (simplicial map) of K to L if the following condition is satisfied: Ifvo, VI, ''',V n are ver- tices of a simplex of K, then <p(v o ), <P(VI), "" <p(v n ) are vertices of some simplex of L. Two simplicial complexes K and L are said to be isomorphic if there exist simplicial mappings <p: K -->L, 1jJ: L-->K such that IjJ a <p and <p a IjJ are the identity mappings. Given a simplicial complex K, let IKI denote the set of all functions x from the set of ver- tices of K to the closed interval 1= [0,1] satis- fying the folIo wing conditions: (i) The set {vEKlx(v)#O} is a simplex of K. (ii) LVEKX(V) = 1. The value x(v) is calIed the barycentric coordinate of the point xEIKI with respect to the vertex v. Each vertex V of K is identified with the point of IKI whose barycentric co- ordinate with respect to the vertex V is 1 and is called a vertex in IKI. For a simplex s = {vo, V" ..., vn} in K, we define Isl = {XE IKII x(v)=O (vlfS)}, which is called a simplex in IKI. We call {XE Isil X(Vi) >0 (i =0,1, "', n)} an open simplex of IKI or the interior of Isl. We remark here that for an arbitrary simplex s in K a point XE Isl can be written in the form x = LVE'X(V)'v. We can define a metric d on IKI by d(X,Y)==(LvEK(X(V)- Y(V))2)1/2. However, IKI is usually supplied with a tstronger topology 
70C Complexes defined as follows: (1) Each simplex Isl in IKI has the topology given by the metric d. (2) A subset U of IKI is open if and only if un Is I is an open subset of Isl for each simplex s of K. Henceforward, by the topology of I K I we mean the topology just defined unless other- wise stated. The set IKI with such a topology is called the polyhedron of K. The topology of I K I coincides with the above metric topol- ogy if and only if K is loca\1y finite. If sim- plicial complexes K and L are isomorphic, then IKI and ILl are homeomorphic. If K is the simplicial complex defined by a Euclidean simplicial complex R, then IKI and IRI are homeomorphic. When K is finite, there exists an Euclidean simplicial complex R whose sim- plicial complex is isomorphic to K. Accord- ingly I K I is homeomorphic to the Euclidean polyhedron IRI. If K and L are simplicial complexes, a map- ping j: I KI-->ILI satisfying the fo\1owing con- dition is said to be linear: If s = {vo, VI' ..., Vn} is a simplex in K and x = Aovo +... + }'nvn (..1. 0 + ... + 2n= 1, AiO), then f(vo),... J(v n ) belong to a simplex in Land f(x) = Aof( vo)+ .. . + Anf(v n ). The linear mapping determined by a simplicial mapping <p : K --> L is denoted by I <p I: I K 1--> I L I and is also called a simplicial mapping and denoted by the same letter <po Let K and K' be simplicial complexes. If there exists a linear mapping I: I K' 1--> I K I, which is a homeomorphism, then we identify IKI and IK'I by I and call K' a subdivision of K. The barycentric subdivision Sd K of a sim- plicial complex K is defined as in the case of Euclidean complexes. We also have notions of star StK(A) and open star OK (A) for a simpli- cial complex K and a subset A of I K I. If K and L are simplicial complexes, a mapping f: I K 1--> I L I is called a piecewise linear map- ping if there exist subdivisions K' and L' of K and L, respectively, such thatf:IK'I-->IL'1 is linear. Given an open tcovering \JJl = {MvLEK of a topological space X, the index set K becomes a simplicial complex if we consider each finite nonempty subset s of K such that nVES Mv # o (empty) to be a simplex. The resulting sim- plicial complex K is called the nerve of the open covering \JJl. Furthermore, if \JJl = {MvLEK and 91= {NW}WEL are open coverings of a set X, 91 is a trefinement of the covering \JJl, and L is the nerve of iJI, then a simplicial mapping <p: L-->K is defined by sending each vertex w in L to a vertex V in K such that NwcMv. Given two disjoint simplicial complexes K and L, a simplicial complex K * L, called the join of K and L, is defined by the following: (1) the vertices of K * L are the vertices of K and the vertices of L. (2) A nonempty subset of vertices is a simplex of K * L if and only if its 262 subsets in K and L are empty or simplexes there. In particular, the join of a simplicial complex K and a single point is ca\1ed the cone of K. A simplicial complex K is said to be ordered if a tpartial ordering is given in the set of ver- tices in K such that the set of vertices of each simplex is ttotally ordered. Given ordered simplicial complexes K and L, an ordered simplicial complex K x L, called the Cartesian product of K and L, is defined by the fo\1owing: (1) The vertices in K x L are pairs (v, w), where v and ware any vertices in K and L, respec- tively. (2) A set of vertices (v o , w o ), .", (v n , w n ) such that vo.s; ... .s; V n and wo.s; ... .s; w n is a simplex in K x L if (vo, ,." v n ) and (wo, "" w n ) are simplexes in K and L, respectively; all simplexes in K x L are obtained in this man- ner. (3) (Vb W I ).s;(v 2 , W 2 ) if and only if VI .s;v 2 andw l .s;w 2 . Assume that either K or L is locally finite or that both K and L are localIy countable. Let X and Y be topological spaces and I be the closed interval [0, I]. We define an equivalence realtion  in the topological space X U (X x Y x I)U Y by x(x,y,O) and y(x,y, 1), where XEX, yE Y. The quotient space of XU(X x Y x I) U Y by this relation is called the join of X and Yand is denoted by X * Y. Then the poly- hedron IK x LI is homeomorphic to the prod- uct space IKI x ILl of the topological spaces IKI and ILl. And the polyhedron IK * LI is homeomorphic to IKI * ILl. By a triangulation T of a topological space X we mean a pair (K, t) consisting of a sim- plicial complex K and a homeomorphism t: I K 1--> X. A triangulation is also called a simplicial decomposition. If T=(K, t) is a trian- gulation of X, the various concepts defined for K can be transferred to X by means of the mapping t. For example, by a simplex of the triangulation T we mean the image of a sim- plex of I K I under t. We say that a triangula- tion T=(K, t) is finite if K is a finite simplicial complex. If T = (K, t) is a triangulation and (K', I) is a subdivision of K, then T' =(K', to I) is called a subdivision of T. If TI =(KI' t l ), T 2 = (Kz, t 2 ) are triangulations of topological spaces XI, X 2 , respectively, a mapping f: XI -->X 2 is called a simplicial mapping relative to T 1 and T 2 if t 2 1 of ot l : IK 1 1-->IK 2 1 is a sim- plicial mapping, The fo\1owing two problems on triangulations are famous: (1) Under what topological conditions is it possible for a given topological space to be supplied with a trian- gulation? (2) Given two triangulations TI' T 2 of a space X, are there subdivisions T; = (K;, t'd, T; =(K;,t) of Tl and T2' respec- tively, such that K; and K; are isomorphic? Concerning the second problem, the conjec- ture asserting the existence of subdivisions T; 
263 and Tz as above is known as the fundamental conjecture (Hauptvermutung) in topology. Every 3-dimensional manifold is triangulable, and any two of its triangulations admit sub- divisions satisfying the condition in (2) (E. E. Moise [7J). In 1961, J. Milnor showed that the funda- mental conjecture is not true for polyhedra [8]. The triangulation problem and the funda- mental conjecture for topological manifolds were negatively solved by R. Kirby and L. Siebenmann [9J ( 65 Combinatorial Mani- folds). Any tdifferentiable manifold is triangu- lab Ie, and the fundamental conjecture holds for its tC'-triangulations (r 1) [6J ( 114 Differential Topology). Let T l , T 2 be triangulations of topological spaces X" X 2 , respectively, and let f: X, -->X 2 be a continuous mapping. Then a simplicial mapping <p:X,-->X 2 relative to Tl and T 2 is called a simplicial approximation to f if, for each XEX, the image <p(x) lies on the simplex of T 2 whose interior contains f(x). The follow- ing existence theorem is called the simplicial approximation theorem: For every continuous mapping f:X l -->X 2 , there exist a subdivision T; of Tl and a simplicial mapping <p: X, -> X 2 relative to T; and Tz that is a simplicial ap- proximation of f If the triangulation Tl is finite, then for a sufficiently large n we can choose Sd n T, as the T; above (where Sd o T= T and Sd n T=Sd(Sdn-t T) (n 1)). If <p:X,--> X 2 is a simplicial approximation to a continu- ous mapping I, then f and <p are thomotopic. D. Cell Complexes Let V n be the tunit n-disk, sn-, be the tunit (n -1 )-sphere, and X be a Hausdorff space. For a subset e of X, let e be the closure of e in X, and let e = e - e, A subset e of the space X is called an n-cell, or open n-cell in X if there is a relative homeomorphism <p: (V n , sn-l )-->(e, e), i.e., a continuous mapping <p: v n -->e such that <p(sn-l) C e and <p: v n - sn-, -->e -e is a homeo- morphism. For example, sn - {p} (p E sn) is an n-cell. A set {e AIAE;\} of cells in the Haus- dorff space X is called a cellular decomposition of X if the following three conditions are satis- fied: (i) e;.ne is empty if A#J1; (ii) X= UAEAe A ; (iii) If the dimension of e A is n + 1, then e A C X n , where X n is the union of all the cells e (J1EA) whose dimensions are not greater than n. For example, the n-sphere sn has a cellular decomposition consisting of a single O-cell and a single n-cell. A Hausdorff space X together with its cellu- lar decomposition {e A } is called a cell com- plex, and each e A is called a cell in the cell 70D Complexes complex X. For a cell complex, the notions of vertex, n-section, and dimension are defined in the same way as the corresponding notions in Euclidean complexes. Let X be a cell complex and A a topological subs pace of X such that the closure of each cell of X intersecting A is contained in A. Then the set of cells e such that en A # 0 forms a cellular decomposition of A. The set A together with this cellular decomposition is called a subcomplex of the cell complex X. A cell complex X with its cells {e;,} is said to be finite if the number of e A is finite. If each point in a cell complex X is an interior point of some finite subcomplex of X, then X is said to be locally finite. We define similarly a countable cell complex and a locally countable cell complex. If the closure of each n- cell of a cell complex X is homeomorphic to v n (n = 0, 1, ... ), X is said to be regular. If X and Yare cell complexes, a continuous map- pingf:X --> Y such that f(Xn)c y n (n =0,1,.,,) is called a cellular mapping. If X and Yare cell complexes, the set oft topological products e, x e 2 , where e" e2 run over all cells of X, Y, respectively, is a cellular decomposition of the product space X x y, The resulting cell com- plex X x Y is called the product complex of the cell complexes X and Y. A cell complex X is said to be closure finite if each cell in X is contained in a finite sub- complex of X; and X is said to have the weak topology if a subset U c X is open if and only if U ne is relatively open in e for each cell e of X. We call a cell complex a CW complex if it is closure finite and has the weak topology. The cellular decomposition of a CW complex is called a CW decomposition. A locally finite cell complex is a CW complex. Fundamental properties ofCW complexes are as follows: (i) A CW complex is a tpara- compact (hence tnorma1) space and is tlocally contractible, (ii) A subcomplex A of a CW complex X is a closed subspace of X, and A itself is a CW complex. (iii) A mapping f: X...... Y of a CW complex X to a topological space Y is continuous if and only if the restriction fl e is continuous for each cell e of X. (iv) If X and Yare CW complexes and f: X --> Y is any continuous mapping then there exists a cellular mapping of X to Y that is homotopic to f (cellular approximation theorem). (v) A pair (X, A) consisting of a CW complex X and its subcomplex A has the thomotopy extension property for any topological space. (vi) A CW complex has the tcovering homotopy property for any tfiber space. (vii) The product complex X x Y of two CW complexes X and Y is not necessarily a CW complex, but it is thomo- topy equivalent to a CW complex. (viii) If either X or Y is locally finite, or if both X and Yare locally countable, then the product 
70E Complexes complex X x Y is a CW complex. (ix) For CW complexes X and Y, the tmapping space Yx is homotopy equivalent to a CW complex. (x) The tcovering space of a CW complex has a CW decomposition. If K is a simplicial complex, the polyhedron IKI is a regular CW complex whose cells are all open simplexes in IKI. A simplicial complex K is (locally) finite if and only if the CW com- plex I K I is (locally) finite. In particular, the Euclidean polyhedron of a Euclidean sim- plicial (or cell) complex is a locally finite CW complex. A polyhedron IKI generally admits a CW decomposition whose cells are far smaller in number than the simplexes constituting a simplicial decomposition of K. For any CW complex X, there exists a polyhedron IKI that is homotopy equivalent to X. In partic- ular, if X is an n-dimensional finite (coun- table) CW complex, we can choose as K an n-dimensional finite (locally finite and coun- table) simplicial complex. E. Semisimplicial Complexes By an ordered simplex in a simplicial complex K we mean a finite sequence (v o , V"..., Vn) (nO) of vertices in K, contained in the set of vertices of a simplex in K. Let O(K)n be the set of all ordered simplexes of K of length n + 1, and define mappings ai:O(K)n-->O(K)n-' and Sj: O(K)n-->O(K)n+l for i =0, 1, ..., n by aj(v o , ".,vn)=(V O , "', Vi-I, V i +l' ...,v n ) and Si(V O ' ..., v n ) =(v o , ..., Vi-I' Vi' Vi, Vi+l, ..., v n ). Then the following relations hold: ajOaj=aj-t oa i (i<j), Sj 0 Sj = Sj+l 0 Sj (i .s;j), aioSj=Sj_l0ai (i<j), aiosj=SjOaj-j (i>j+ 1), ajOSj=ai+,osi=identity. Let n be the n-dimensional simplex in R n with vertices eo =(0, 0, ...,0), e, =(1,0, ...,0), . " , en = (0, .., , 1). By a singular n-simplex in a topological space X we mean a continuous mapping T:n-->x. Let S(X)n be the set of all singular n-simplexes in X, and define map- pings a i :S(X)n-->S(X)n-l and Sj:S(X)n-->S(X)n+l for i=O, 1, "', n by a j T(Ao,"', An) = T(Ao,'''' Ai-I' 0, Aj, ..., An) and Sj T(A o , ..., A n +,) = T(Ao, .." Ai-I, Ai + Ai+l' Ai+1' "', A n +1)' where (..1. 0 , ." ,An) is the point L70 Aje i , Aj;::' 0, L7=0 Ai = 1. Then relation (1) holds between a j and Si' Because of the importance of relation (1), which is basic in defining thomology of sim- plicial complexes and topological spaces ( 201 Homology Theory), S, Eilenberg and 1. A. Zilber gave the following definition: A semi- 264 (1 ) simplicial complex K consists of a sequence of sets Kn (n =0,1,...) together with mappings a i :K n -->K n - 1 , Si:Kn-->Kn+l (i=0, 1, ...,n) satis- fying relations (1). An element of Kn is called an n-simplex in K, and aj, Si are called the ith face operator and the ith degeneracy operator, respectively, A simplex is said to be degenerate if it is the image of a simplex under some Si' A semisimplicial complex is abbreviated as s.s. complex. The s.s. complexes O(K) = {O(K)n' a;, Si} and S(X) = {S(X)n, ai, sJ are called the ordered complex of K and the singular complex of X, respectively. Let K be an s.s. complex, and let Ln be a subset of Kn for n =0, 1, .... If ai(Ln)c Ln-, and si(L n ) c L n +1 for each i, then L = {Ln' a i I Ln' Si I Ln} is an s.s, complex, and L is ca\1ed a subcomplex of the S.s. complex K, If A is a subspace of a topological space X, S(A) is a subcomplex of S(X). If K is an ordered sim- plicial complex, a subcomplex O'(K) of O(K) is obtained by considering the set of all ordered simplexes (vo, v" "" Vn) such that Vo.s; VI .s; ... .s; V n (n = 0, 1, ...). If K and L are s.s, com- plexes, a sequence f = {In} of mappings fn: Kn --> Ln defined for each n is called an s.s. map- ping if aiofn= fn-' 0 a i and SiO fn = fn+, OSi (O.s;i.s;n). If f:X --> Y is a continuous mapping of topological spaces, then f determines an s,s. mapping S(/): S(X)-->S( Y) by S(f)(T) = f 0 T, Two s.s. complexes K and L are said to be isomorphic if there is a bijective s.s, mapping of K to L. For two s.s. complexes K and L, we define the Cartesian product K x L to be the s.s. complex given by (K x L)n = Kn x Ln, ai«J', r) = (ai(J', air), s;(O", r) = (Si(J', Sir) (J'EKn, rEL n ). If K and L are ordered simplicial complexes, the s.s, complexes O'(K) x O'(L) and O'(K x L) are isomorphic. If X and Yare topological spaces, the s,s. complexes S(X) x S( Y) and S(X x Y) are isomorphic. Given an s.s. complex K, we construct a topological space IKI as follows: First, we provide Kn with the tdiscrete topology and consider the topological space K = Un;,o Kn x n. Next we consider simplicial mappings si:n-l-->n and '7j:n+,-->n defined by Si(pj) = Pj (j < i), sj(Pj) = Pj+' (j;::' i) and '7 i (p) = Pj (j.s; i), '7 i (p) = Pj-l (j> i), where Po, ..., Pn are the vertices of n. The topological space IKI is defined to be the tquotient space of K with respect to an equivalence relation  that is defined by the following: (ai(J', y)  (0", Si(y)) (0" E Kn' Y En-l), (siO"' y)  (0", '7 i (y» (0" E Kn' YEn+l), where i=O, 1, ...,n. The space IKI is called the (geometric) realization of the s.s. complex K. Given an s.s. mappingf:K-->L, we obtain a continuous mapping If I : IKI-->ILI defined by Ifl(IO',yl)==lf(O'),yl, where 100,yl is the point in IKI represented by (O",y)EK. We call If I the realization of the s.s. mapping f 
265 The realization I K I of an s,s. complex K is a CW complex whose cells are in one-to-one correspondence with the nondegenerate sim- plexes in K. For a topological space X, a tweak homotopy equivalence p:IS(X)I-->X is defined by p(l T, yl) = T(y)(TE S(X)n, YEn). This mapping gives rise to a homotopy equiv- alence when X is a CW complex. The singular complex S(X) of a topological space X has the following property: Given simplexes 0"0, ... , O"k-l> 0"k+1, ... , O"n+1 E Kn with ai(Jj= a j _ 1 O"i (i <j, i,j # k), there exists a simplex O"EK n +1 with aiO" = O"i (i#k), An s,s, complex K with this property is called a Kan complex. If, in addition, aiO" = aiO"' (0",0"' E Kn, i # k) imply akO"=akO"', we call K a minimal complex, For every Kan complex K, there are minimal sub- complexes M of K that are isomorphic to each other. Moreover, IMI is a tdeformation retract of I K I. For a Kan complex K, the thomotopy group can be defined combinatorially. Two CW complexes X and Yare homotopy equivalent if and only if the minimal sub- complexes of S(X) and S(Y) are isomorphic. F. Eilenberg-MacLane Complex Given an integer n;::' 1 and a group n (Abelian if n;::' 2), there exists an tarcwise connected topological space X for which the thomotopy groups ni(X) are trivial for i # nand nn(X)  n. Such a space is called an Eilenberg-MacLane space of type (n,n). Let Q(X;X, *) be the tpath space over X, and let Po:Q(X;X, *)-->X be the natural projection. If X is an Eilenberg- MacLane space of type (n, n), then (Q(X; X, *), Po, X) gives rise to a standard tcontractible tfiber space whose fiber is an Eilenberg- MacLane space of type (n, n -1). Assume that X is an Eilenberg Mac Lane space of type (n, n) with Abelian group n. Then X is (n -1)- connected; hence the tHurewicz theorem can be utilized to show the existence of an isomor- phism h: nn(X)  Hn(X), while the universal coefficient theorem can be utilized to show that H"(X; n)Hom(Hn(X), n). Since in this case we have nn(X)n, the element h- I E Hom(Hn(X), nn(X)) can be regarded as an element of Hom(Hn(X), n). Now the funda- mental class of X is defined to be the coho- mology class U E Hn(X; n) corresponding to h -I . Let Y be a tCw complex and n( Y; X) be the set of thomotopy classes of continuous mappings from Y to X. Then there exists a one-to-one correspondence n( Y; X)-> Hn( Y; n) given by the assignment [fJ --> f*u ( 305 Obstructions). Let S(X) be the singular com- plex of X, and let M(X) be a minimal complex of S(X), If X is an Eilenberg-MacLane space 70F Complexes of type (n, n), M(X) is isomorphic to a certain complex determined uniquely by nand n. This complex is called the Eilenberg-MacLane complex of type (n, n) and is denoted by K(n, n). The notation K(n, n) is also used to mean the space X itself. Let (q) be a simplicial complex whose sim- plexes are all nonvacuous subsets of {O, 1, ''', q}, Let t.i :(q-l )->(q) be the simplicial mapping defined by t.i(j) = j (O.s;j.s; i-I), t.j(j) =j+ 1 (i.s;j.s;q-l), and let lJi:(q+ l)-->(q) be the mapping defined by lJi(j) = j (O.s;j.s; i), lJ;(j)=j-1 (i + l.s;j.s; q+ 1). Now K(n, n) is a Kan complex defined by K(n, n)q == zn((q); n), aiO"=O"Of. i , Sio"=o"0lJi> where ZO((q);n)=n, Zl ((q); n) is the set of n-valued functions defined on the set of pairs (i,j) such that O.s; i <j.s; q and satisfying the equality O"(j, k). O"(i, kr l . 0" (i,j) = 1 for O.s; i<j < k.s; q, and zn((q); n) (n;::' 2) is the group of toriented cocycles of the simplicial complex (q). If n is Abelian, the structure of the Abelian group zn((q); n) gives K(n, n) the structure of an Abelian group in the S.S. category. This struc- ture yields a one-to-one correspondence K(n, n)q-->K(n, n-l)q-I x '" x K(n, n -1)0 for n;::' 1 and leads to the expression ofrE K(n, n)q in the form (O"q-l> ...,0"0) with O"iEK(n,n-l)i' The W-construction of K (n, n - 1) for n;::' 1 is a Kan complex W(n, n -1) defined by W(n, n -l)q == K(n, n -l)q x K(n, n)q and ao«Jq x r q ) == (aoO"q)' O"q_1 X a o r q , aj(O"q x r q ) = aiO"q x air q for l.s; i.s; q, Si(O"q x r q ) = SiO"q x Sirq' where O"q E K(n, n-l)q and rq=(O"q_l> "',O"o)EK(n,n)q' Let p: W(n, n - I )-->K(n, n) be a natural projection. Then (W(n, n-l),p, K(n, n)) plays the role of the tuniversal bundle for K(n, n -1) in the s.s. category in the following sense: Let L be an S.s. complex, and let f: L-->K(n, n) be an S.S. mapping. We define f# W(n, n - 1) to be the subcomplex of W(n, n -1) x L generated by simplexes (O"q x r q ) x pq such that rq = f(pq), where O"q x rqE W(n, n -l)q and pqE Lq' Let p:f# W(n, n -l)-->L be the natural projection, Then (/# W(n, n - 1), p, L) is called the prin- cipal fiber bundle induced from W(n, n -1) by f Any principal bundle over L with group K (n, n - 1) can be ex pressed as an ind uced bundle. This property means that (W(n, n-l), p, K(n, n)) is universal. On the other hand, we have an algebraic analog of the univer- sal bundle for the chain group of K(n, n-l), called the bar construction [12]. Both these concepts were defined by Eilenberg and Mac- Lane in order to determine the structure of the (co)homology of K(n,n), which is denoted by (H*(n, n))H*(n, n). This object was later achieved by H. Cartan, who introduced an improved notion called Cartan construction ( Appendix A, Table 6.II1). Let n(L,K(n,n)) be the set of s,s, homotopy classes of s,s, map- 
70G Complexes pings from L to K(n, n). If n is Abelian, there exists a one-to-one correspondence n(L, K(n, n))-->Hn(L; n) given by the assignment [fJ--> f*u, where UE Hn(n, n; n) is the fundamental class of K(n,n). By virtue of this correspon- dence, k=f*UEHn(L;n) determines an in- duced bundle f# W(n, n -1) uniquely up to equivalence, which is denoted by K(n, n- 1) xkL. Let X be an arcwise connected topological space. Let x n be the Cartesian product of n copies of X. Clearly, the symmetric group 6n of degree n operates on X n . The n-fold sym- metric product spn X of X is defined to be the quotient space of x n under the action of 6n. If we specify a reference point of X, we have a natural inclusion spn-I X C SP"X and can consider the inductive limit space U l";n spn X, denoted by sp oo X. Then the Dold- Thom theorem [15] shows that M(SpOOX)c;;; I11 K(H;(X), i). In particular, we have M(SPooSn);;;K(Z,n) for n 1. This result can be applied to obtain a direct relationship be- tween the axiomatic definition of tcohomology operations using K(n,n) due to Eilenberg and Serre and the constructive definition using the symmetric groups due to Steenrod (A. Dold [16J, T. Nakamura [17J). For a detailed study of the (co)homology of spnx  [18]. G. Postnikov Complex Let X be an arcwise connected topological space. For the sake of simplicity, throughout this section we assume that X is tsimple ( 202 Homotopy Theory). Then the tPostnikov system of X can be defined as an inverse sys- tem (X n , Pn) (n = 0, 1,2, ...) consisting of topo- logical spaces X n , continuous mappings Pn: Xn--> X n - b and a system (X, qn) (n = 0, 1,2, ... ) consisting of continuous mappings qn: X --> X n such that Pn 0 qn = qn-l and satisfying the fol- lowing three properties: (1) Xo is one point. (2) (X n , Pn, X n - l ) is a tfiber space induced from a standard contractible fiber space over an Eilenberg-MacLane space of type (nn(X),n+ 1) by a mapping corresponding to a cohomology class kn+l E H n +1 (X n - l ; nn(X)), (3) qn*: ni(X)--> ni(X n ) gives an isomorphism for O.s; i .s;n. These cohomology classes kn+1 are called Eilenberg-Postnikov invariants (or simply k- invariants), Corresponding to the above facts, the minimal complex M(X) can be obtained as the inverse limit of a certain inverse sys- tem (K(n),p(n)) consisting of Kan com- plexes K(n) and s.s. mappings p(n): K(n)--> K(n -I) defined by K(O)== K(O, 0) and K(n) == K(nn' n) x kd1K(n -1) for n  1 with natural 266 projections p(n), where nn = nn(X) and k n + l E Hn+l (K(n -1); nn)' This system is determined uniquely up to S.s. homotopy equivalence by its limit, called the Postnikov complex and denoted by K(nl' k 3 , n 2 ,..., kn+" nn,"')' As yet we are ignorant of an effective method of computing the cohomology of a Postnikov complex from nn and k"+I. References [IJ H. Cartan, Sur la theorie de Kan, Sem. H. Cartan, Ecole Norm. Sup., 1956-1957. [2J S. Eilenberg and N. E. Steenrod, Founda- tions of algebraic topology, Princeton Univ. Press, 1952. [3J P. 1. Hilton and S. Wylie, Homology theory, Cambridge Univ. Press, 1960. [4] E. H. Spanier, Algebraic topology, McGraw-Hill, 1966. [5J R. E. Mosher and M. C. Tangora, Coho- mology operations and applications in homo- topy theory, Harper & Row, 1968. [6] 1. R. Munkres, Elementary differential topology, Princeton Univ. Press, second edi- tion, 1966. [7J E. E. Moise, Geometric topology in dimen- sions 2 and 3, Springer, 1977. [8J 1. Milnor, Two complexes that are homeo- morphic but combinatorially distinct, Ann. Math., (2) 74 (1961),575-590. [9] R. C. Kirby and L. C. Siebenmann, Foun- dational essays on topological manifolds, smoothings and triangulations, Princeton Univ. Press, 1977. [lOJ J. P. May, Simplicial objects in algebraic topology, Van Nostrand, 1967. [IIJ E. Curtis, Simplicial homotopy theory, Lecture notes, Aarhus Univ., 1967. [12J S, Eilenberg and S. MacLane, On the groups H(n, n) I, II, III, Ann. Math., (2) 58 (1953),55- 106; 60 (1954),49-139; 60 (1954), 513-557. [13J J.-P. Serre, Cohomologie modulo 2 des complexes d'Eilenberg-MacLane, Comment. Math, Helv., 27 (1953), 198-232. [14J Sem. H. Cartan, Ecole Norm. Sup., 1954- 1955 [15J A. Dold and R. Thom, Quasifaserungen und unendliche symmetrische Produkte, Ann. Math., (2) 67 (1958), 239-281. [16J A. Dold, Dber die Steenrodschen Ko- homologieoperationen, Ann. Math., (2) 73 (1961),258-294. [17] T. Nakamura, On cohomology opera- tions, Japan. 1. Math., 33 (1963), 93-145. [18J M. Nakaoka, Decomposition theorem for homology groups of symmetric groups, Ann. Math., (2) 71 (1960), 16-42. 
267 [19J G. W. Whitehead, Elements of homotopy theory, Springer, 1978. 71 (XVI.g) Complexity of Computations A. Measures for Complexity of Computation Intuitively, complexity of computation means the amount of computing efforts measured on some suitable scale. When a problem can be solved by means of anyone of several algo- rithms, it is highly desirable to compare the complexities of those algorithms. For example, the power x 10 can be evaluated with a pocket calculator by using either of the algorithms xlO=xx X X X xx x xx x x x x x x xx x, (A) and u = x x x, v = u x u, w = v x v, xlO=u X w. The complexities of these algorithms can be measured by various quantities: (Tl) time in seconds, (T2) the number of times keys and buttons are touched, (T3) the number of basic operations (here, multiplications), and (S) the number of values to be stored in the calcu- lator. The quantities (Tl), (T2), and (T3) are calIed time complexities, and (S) is called the space complexity. Obviously, algorithm (B) is preferable with respect to time complexity, although (A) is better with respect to space complexity. (If the calculator has only one memory register, then only (A) can be executed without recording numbers by hand.) For many calculators, the foIlowing algorithm is best with respect to both time and space complexities: X 10 =((X 2 )2 X X)2. In the general theory of complexity, the num- ber of basic operations (T3) is often taken as the basis of time complexity, since it represents the intrinsic complexity of the algorithm, rather than of extrinsic factors such as human skiII or mechanical performance. For the evaluation of a power x n in general, the complexity depends on the value of n: such a parameter dominating the complexity is caIled the size of the problem, Let Tx(n) be the number of multiplications required for evaluating x n by an algorithm X, Let :IF be the set of all algorithms for evaluating x n . The time complexity T(n) for evaluation of the power is then defined by T(n)= MinxES" Tx(n), 71B Complexity of Computations This is satisfied when [6J logz n.s; T(n).s; 210gz n and Jim T(n) = 1. noo log2 n In a problem concerning a tgraph, the num- ber m of nodes in the graph, the number n of its edges, or their sum m + n are chosen as the size of the graph. However, the complex- ity is not determined by such a size, because there are many different instances (particular graphs) of the same size, So the complexity of this problem is defined in the foIlowing man- ner. Let T(I) be the complexity of solving an instance I of the problem and L(n) be the set of alI possible instances of size n, Then W(n) = Max1EL(n) T(I) A(n)= LP(I)T(I), (B) where p(I) denotes the relative frequency of the instance I, The value W(n) is called the worst-case complexity, and A(n) is called the average complexity of the problem, In some cases the complexity of a circuit is measured by the number of its building blocks [IJ, and the complexity of a program is mea- sured by its length [2, 3]. B. Complexity of a Decision Problem (C) A decision problem and its complexity are rigorously formulated in terms of tTuring machines, Let L be a finite set of symbols and E+ be the whole set of nonempty strings of symbols in L. Every instance of the problem is assumed to be represented by a string in E+. A decision problem is a triple (E, L, P) of the set E of symbols, a subset L of E+, and a mapping P from L to {O (false), 1 (true)}. The set L repre- sents the set of all possible instances of the problem. The problem is said to be solvable if and only if there exists a Turing machine M satisfying the foIlowing condition for every string (X in the set L: When a tape containing the string (X is given to the machine M whose head is initially put on the leftmost symbol of (x, the machine writes the value of P((X) on the tape and stops after a finite number of steps. Such a Turing machine is said to compute the function P. The number TM((X) of steps is the time complexity of solving the instance (X by the machine M, and the length SM((X) of the used area of the tape is the space complexity, The worst-case complexity TM(n) is defined by TM(n) = Max'EL,I'I=n TM«(X), 
71 C Complexity of Computations The space complexity SM(n) for the size n is defined in a similar way. The complexities T(n) and S(n) of the problem are not defined, because improvement by a constant factor is always possible for any Turing machine [5]. A problem is said to be solved in linear or polynomial time if its time complexity TM(n) is bounded by a linear or polynomial function of the size 11. C. Objectives of Complexity Theory The objectives of complexity theory are as follows: (i) Analyze an algorithm X for a problem P and evaluate its complexity. When exact evalu- ation is hard, the order of magnitude O(Tx(n» is investigated. (ii) Construct better algorithms for the prob- lem P. This gives a better upper bound to the complexity T(n) of the problem. (iii) Make clear the limitation of improving algorithms for the problem P. This is done by establishing a lower bound to the complexity T(n) of the problem. A common approach to constructing better algorithms is to partition the problem into smaller parts, apply algorithms to the parts, and then combine the solutions for the parts into a solution for the whole (divide and con- quer, [5J). This approach often yields an effi- cient algorithm for the problem, especially when the partitioning can be repeated recur- sively. For demonstrating the limitations of algorithms of a certain class, diagonalization [5J, determination of information-theoretic lower bounds [6J, or the oracle method [6J are often utilized. D. Elementary Results on Time Complexity (1) Number of arithmetic operations. A poly- nomial of degree n with one variable can be evaluated in about 3n/2 arithmetic operations if preconditioning on coefficients is allowed. Otherwise, 2n - 1 operations are necessary and sufficient for evaluating a polynomial of degree n. For computing the prod uct of two sq uare matrices of degree n, 0(n 2 ) operations are necessary and 0(n 2 . 5 ) operations are sufficient. The inverse matrix and the value of the deter- minant of a square matrix of degree n are computed with the same order of operations as the product. Discrete Fourier transforma- tion ofn points is executed in O(nlogn) arith- metic operations (fast Fourier transformation). (2) Number of comparisons and data trans- fers. Rearrangement of n items in increasing order is realized in O(l1logn) comparisons and 268 data transfers. Selection of the kth largest item among n items is done in O(n) operations. The position of an item in a list containing n items is found in O(n) comparisons by linear search, in O(logl1) comparisons by binary chopping, and in 0(1) comparisons on average by the thashing method. Many other results can be found in [5J-[7]. E. NP-Completeness A problem is easily solved if its complexity is O(n) or less. The problem is very hard if its complexity is 0(2 n ), unless the size n is small. Between these types of problems there is a class of problems, each of which can be solved in polynomial time by a Turing machine, A tnondeterministic Turing machine is said to solve a decision problem (E, L, P) if it can detect the case when P((X) = 1 by exercising good choices; more precisely, if it can stop after a finite number of steps and write the value I of P((X) on the tape, starting from an initial state with the head on the leftmost sym- bol of a string (X on the tape, provided that P((X) = 1. If P((X)=O, then the machine may never stop. For a string (X such that P«(X) = 1, NTM«(X) and NSM((X) represent the minimum number of steps and the minimum length of tape used in computing P((X). Nondeterminis- tic complexities NTM(n) and NSM(n) for the size n are defined in a similar way as before. A problem is said to be solvable in NP-time or NP-space if it can be solved by a nondeter- ministic Turing machine M whose time com- plexity NTM(n) or space complexity NSM(n) is bounded by a polynomial function of size 11. The class of all problems solvable in NP- time is denoted by NP, while that of the prob- lems solvable in polynomial time by ordinary (deterministic) Turing machines is denoted by P. Obviously, the class NP contains the class P. Nevertheless, whether NP = P or not re- mains one of the biggest unsolved problems. A decision problem (E, L, P) is polynomially transformable to a problem (2, L', P') if there exists a mapping h from E+ to 2+ satis- fying the foIl owing conditions: (1) h(L)S L'; (2) for every string (X in L, P((X) = P'(h((X»; (3) the mapping h is computed by a Turing machine in polynomial time. If a problem C is poly- nomially transformable to another problem C' in P or NP, then the problem C is also in P or NP. A problem C is said to be NP-hard if every problem in NP is polynomially transfor- mable to C. It is called NP-complete if it is in NP and NP-hard. Many problems that have been known empirically to be very hard have recently been shown to be NP-complete [8]. For instance, integer linear programming is 
269 NP-complete. Linear programming has re- cently been shown to be in p, The satisfiability problem of a conjunctive normal form was proven to be NP-complete in 1971 [9]. The complement of a decision problem (1:', L, P) is the problem (1:', L, P'), where P'(ex) = 1- P(ex). If a problem C is in P, then its com- plement C' is also in P. When C is in NP, it is not guaranteed that C' is in NP. The class of all problems whose complements are in NP is denoted by co-NP. The intersection of NP and co-NP contains the class P, but it is an open problem whether this containment is proper or not. The problem PRIME, which asks whether a number N is prime or not, the size n being the number of digits for represent- ing the number N, belongs to both NP and co- NP [10]. However, it is not known whether it is in P. F. Other Topics The notion of completeness is introduced in many other classes of problems solvable in linear space, polynomial space, exponential time, etc. Comparisons among these classes have been done, although many problems still remain open. The fundamental conjecture in this field is the nonequality NP # P, the prov- ing of which seems to be extremely difficult. This conjecture can be relativized in ways such that it can be either true or false, so that it is not provable in some formal system. Some tricks may be used for hard problems. For decision problems, some algorithms have been proposed for guessing the correct answer with high probability [11]. Some algorithms give nearly optimal solutions efficiently, in- stead of constructing optimal ones over too long a time [7]. A general theory of computational com- plexity would include investigations on rela- tions among various complexity measures, the complexity hierarchy, and an axiomatic approach independent of any machine models [12]. References [1] 1. E. Savage, The complexity of computing, Wiley, 1971. [2] A. N. Kolmogorov, Three approaches to the quantitative definition of information, Problems of Information Transmission, 1 (1965),1-7. [3] P. Martin-Lof, The definition of random sequences, Information and Control, 9 (1966), 602-619. [4] M. J. Fischer and M. O. Rabin, Super- exponential complexity of Presburger arith- 72A Complex Manifolds metic, Complexity of Computation, R. M. Karp (ed.), Proc. SIAM-AMS Symposium in Appl. Math., 1974. [5J A. V. Aho, J. E. Hopcroft, and 1. D. Ull- man, The design and analysis of computer algorithms, Addison-Wesley, 1974. [6J D. E. Knuth, The art of computer pro- gramming, Addison-Wesley, 1, 1968; 2,1969; 3, 1973. [7] E. Horowitz and S. Sahni, Fundamentals of computer algorithms, Computer Science Press, 1978. [8] M. R. Garey and D. S. Johnson, Com- puters and intractability, a guide to the theory of NP-completeness, Freeman, 1979. [9] S. A. Cook, The complexity of theorem- proving procedures, Proc. 3rd Ann. ACM Symposium on Theory of Computing, 1971. [lOJ G. L. Miller, Riemann's hypothesis and test for primarity, Proc. 7th Ann. ACM Sym- posium on Theory of Computing, 1975. [l1J 1. F. Traub (ed.), Algorithms and com- plexity: New directions and recent results, Academic Press, 1976. [12] M. Blum, A machine-independent theory of the complexity of recursive functions, 1. Assoc. Com put. Mach., 14 (1967),322-336. 72 (VII.g) Complex Manifolds A. Definitions A Hausdorff topological space X is called a complex manifold (or complex analytic mani- fold) of complex dimension n if there are given an open covering {Vi};EI and a family {<Pi}iEI of homeomorphisms of Vi onto open sets in the n-dimensional complex affine space cn such that in case vin V j # 0, the mapping <Pi 0 <p;I: <Pj(V i n VJ--><Pi(V i n V j ) is biholomorphic (i.e., <PiO<P;' and its inverse are both tholomorphic functions when expressed in terms of coordi- nate functions in C"). We call X the underlying topological space of this complex manifold, and we say that an open covering { V;} iEI and a family {<p;} iEI define a complex analytic structure (or simply complex structure) on X, A complex-valued function f defined on an open set V in X is called a holomorphic func- tion on V if for any i the function f 0 <pj-l on <pj(V n V;) is holomorphic, When we express the mapping <Pi as <Pi(P) = (Zl (p), ..., zn(p)) on Vi in terms of the coordinates in cn, each z' is a holomorphic function on Vi' We caIl (z I, .. . , zn) a holomorphic local coordinate sys- tem on Vi' Given two complex manifolds Y, X, a mapping <p: Y -->X is said to be holo- 
72B Complex Manifolds morphic if for any open set V in X and any holomorphic function f on V, f 0 <P is holo- morphic on <p-I(V)c Y. When a mapping <p: Y-->X is bijective and both <p and <p-I are holomorphic, we say Y and X are isomorphic by <p as complex manifolds. As in the case of tdifferentiable manifolds of class Coo, we can define concepts such as com- plex analytic submanifolds, holomorphic tan- gent vectors, holomorphic vector fields, and holomorphic differential forms of degree k (or simply holomorphic k-forms). Meromorphic functions on complex manifolds can also be defined as in the theory of analytic functions of several complex variables ( 23 Analytic Spaces D). Let X be a complex manifold and p a point of X. Take a holomorphic local coordinate system (z" ...,zn) with center p (i.e., z'(p)=O for all ex). A holomorphic function defined on a neighborhood of p can be expressed as a holomorphic function in (Zl, "', zn), hence as a power series in (Zl, ..., zn) absolutely conver- gent in a neighborhood of p. If we denote by (rJ = (rJx the tsheaf of germs of holomorphic functions on X, the tstalk (rJp of (rJ at p is iso- morphic to the tlocal ring of convergent power series in n variables Zl, "', zn. At a point p, (a/azl)p, ...,(a/azn)p form a basis of the holo- morphic tangent vector space at p. A holomor- phic k-form w defined on a neighborhood of p can be expressed as w= Li 1 <...<i,/;"..i,dz i1 A ... A dz i " where /;1'" i, is a holomorphic func- tion for each (i I, ... , id. B. Almost Complex Structures Let X be a complex manifold, and let {Vi, <p;} iEI be its complex analytic structure, i.e., a covering of X by holomorphic local coordi- nate systems with <Pi = (zl , ..., zn. Express zr in the form z = x + I=t Y  where x and Y  l l V-I l' l l are the real and imaginary parts of zf, respec- tively. Then xr and yf real-valued functions on the open set Vi of X, and the mapping i/1i: V ->R 2 n defined by i/1i(P) = (x;'(p),yi(p), ..., x;'(p), y;'(p) is a homeomorphism of Vi onto an open set of R 2 n. This {Vi, i/1;}iEI defines on X a tdifferentiable structure of class Coo (in fact, a treal analytic structure). Thus a complex manifold of complex dimension n admits canonically a COO-structure of real dimension 2n. For every point p of X there is a real co- ordinate system on a neighborhood of p, such as (Xl,y" ...,xn,yn), where (Zl, ...,zn) (z'=x' +P y', ex= 1,..., n) forms a holomorphic coordinate system in X, The real tangent vector space at a point p of X has {(a/axl)p, (a/ay')p, ..., (a/axn)p, (a/ayn)p} as its basis. Define a linear endomorphism J p by (a/ax')p 270 -->(a/ay')p, (a/ay')p--> -(a/ax')p (ex = 1, ..., n); then f}; = -1, and J p does not depend on the choice of holomorphic coordinate system at p. Considering J p as a tensor of type (1, 1), we thus obtain a tensor field J of type (1, 1) of class Coo on X, which is called the tensor field of almost complex structure induced by the complex structure of X. More generally, when a real differentiable manifold X is provided with a tensor field J of type (1, 1) of class Coo such that J2 = -1 (con- sidering J as a linear transformation of vector fields), we say that X admits an almost com- plex structure or that X is an almost complex manifold. In this case, for contravariant vector fields x and y on X, we define a tensor field S of type (1,2) by S(x,y)= -[x,yJ+[J(x),J(y)] -J([J(x),yJ)-J([x,J(y)J). S is called a Nijen- huis tensor. An almost complex structure J is induced by a complex analytic structure if and only if its Nijenhuis tensor S vanishes identically [49]. A differentiable manifold X of dimension 2n admits an almost complex structure if and only if the structure group GL(2n, R) of the bundle of ttangent 2n-frames of X can be treduced to GL(n, C). Almost complex manifolds are torientable. C. Types of Differential Forms Let X be a complex manifold, and let (z" ... , zn) be a holomorphic local coordinate system in a neighborhood of a point p with z' =x'+P y' (ex = 1,... ,n). On the complexi- fied real tangent vector space Tp(X) @ C at p, we define a/az', a/az' by (a/az')p = (1/2){(a/ax')p -p (a/ay')p}, (a/az')p= (1/2) {(a/ax')p + P (a/ay')p}' It is easy to see that the operation a/az' on holomorphic functions coincides with that of the holomorphic tangent vector a/az' defined in Section A. A function f is holomorphic at p if and only if(a/az')f = 0 (ex = 1,2, "', n). Tp(X) 0 C is the direct sum of the subspace spanned by {a/az l , ..., a/az n } and the subs pace spanned by {a/az l , ..., a/az n }. Moreover, this decomposition is independent of the choice of holomorphic coordinate system. Elements of the two subspaces are respectively called tan- gent vectors of type (1,0) and tangent vectors of type (0, 1). Similarly, the complexified space of real differentiable I-forms can be decomposed into the direct sum of two subspaces spanned by {dzl,...,dz n } and {dzl,...,dz n }, wheredz' =dx'+P dy' and dZ'=dx'-P dy'. We say that the elements of the former sub- space are of type (1,0) and those of the latter are of type (0, 1). Thus the space of differential forms of arbitrary degree can be written as 
271 the direct sum of subspaces of type (r, s), Here the subspace of differential forms of type (r, s) has as a basis {dZ'1 A '" A dz" A crzP1 A ... A dZ p , (l.s;ex l < ...<ex,.s;n, 1.s;/31 < ...</3s.s;n)}. This decomposition is independent of the choice of a local coordinate system, hence the concept of type can be defined globalIy on X. D. -Cohomology In the rest of this article we consider only complex differential forms. For every differen- tial form w of type (r, s) on X, its texterior derivative dw decomposes into a sum of dif- ferential forms of types (r+ 1, s) and (r, s + 1), which we denote by aw (or d'w) and Ow (or d"w), respectively, We have d=a+o, (d')2==0, (0)2 = 0, and ao + oa = O. In terms of a local coordinate system, we write af a w=" -dz Y A d Z'1 A ';'az Y ... Adz" Aaz P1 A.,. Aaz P " a - _ ( _ 1) '" af d 'I w- L.. a y z A... Y Z A dz" A dZ Y A dZ P 1 A ." A dz P " forw=fdz'l A... Adz"Adz P1 A... Adz p ,. A differential k-form w is holomorphic if and only if w is of type (k, 0) and Ow = O. For the operator 0 Dolbeault's lemma holds: Let w be a differential form on a neighborhood U of a point p. If ow = 0, there is a neigh bor- hood V of p contained in U and a differen- tial form a on V such that w = oa on V. Let A(r.s) and QP be the tsheaf of germs of differential forms of type (r, s) and the sheaf of germs ofholomorphic p-forms on X, respec- tively, and let r(X, A(r.,)) be the set of tsections of A("S) on X. r(X, A(r.s)) is the set of differen- tial forms of type (r,s) on X, and Lir(X,A(P.i») forms a tcochain complex with respect to 8. This complex is called the o-complex or the Dolbeault complex, and its tcohomology groups are called the o-cohomology groups or the Dolbeault cohomology groups. The qth cohomology group is denoted by Hp,q(A, 0). It follows easily from Dolbeault's lemma that 0-> QP->A(P.O)A(P.1)-->... is an texact sequence of sheaves. From this we get Dolbeault's theorem: Hq(X, QP) Hp,q(A, 0), where the left-hand side is the cohomology group with coefficient sheaf QP. More generally, for any tcomplex analytic (holomorphic) vector bundle E on X, we can define the 0 cohomology groups of the dif- ferential forms on X with values in E, and they can be shown to be isomorphic to the coho- 72E Complex Manifolds mology groups with coefficient sheaves of germs of holomorphic forms with values in E ( 194 Harmonic Integrals E). E. Analytic Coherent Sheaves The structure of a complex manifold X is determined by the tsheaf (rJ of germs of holo- morphic functions on X, and (rJ is a tcoherent sheaf (of rings) (Oka's theorem), Sheaves of (rJ- modules are calIed analytic sheaves, and coher- ent sheaves of (rJ-modules are called coherent analytic sheaves. Many properties of X can be expressed in terms of coherent analytic sheaves and their cohomology groups; some exam- ples appear later in this article (also  366 Riemann-Roch Theorems B). It is important to know whether an analytic sheaf on a complex manifold is coherent. To this question, not only does Ok a's theorem apply but so does Cartan's theorem: The tsheaf of ideals defined by an analytic subset of a complex manifold is coherent. (We say that a subset Y of X is an analytic subset if it is a closed subset and each point of Y has a neigh- borhood U such that un Y is the set of com- mon zeros of a finite number ofholomorphic functions on U.) Also relevant is Grauert's theorem: If n: X --> Y is a tproper holomorphic mapping of complex manifolds (i.e., the inverse image of any compact subset of Y for a holo- morphic mapping n is also compact), then for any coherent analytic sheaf F on X its tdirect images Rqn*(F)(q =0,1,2,...) are also coher- ent [16]. (In fact, this theorem holds for ana- lytic spaces;  23 Analytic Spaces.) For an analytic coherent sheaf F on a tStein manifold X, we have the following fundamen- tal theorems of the Stein manifold. Theorem A: HO(X, F) generates the stalk Fx (as an (rJx- module) at every point x of X, Theorem B: Hq(X, F)=O for all q>O. Conversely, a Stein manifold X is characterized by the following property: For any coherent analytic sheaf F of ideals of (rJ, HI(X, F) =0. If a complex manifold X is compact and F is a coherent analytic sheaf on X, then Hq(X, F) is a complex vector space of finite dimension. If X is an open sub- manifold of another complex manifold and its closure is compact, then Hq(X, F) is finite- dimensional for some q that depends on vari- ous properties (convexity or concavity) of the boundary of X [3]. Let E be a tcomplex analytic (ho1omor- phic) vector bundle on a complex manifold X of dimension n, and let E* be the tdual vector bundle of E. Then Hq(X, (rJP(E)) and H-q(X, (rJn-p(E*) (where H* denotes the cohomology group with compact support) are dual as topological vector spaces under suit- 
72F Complex Manifolds able conditions. The duality is given by the integration on X of the exterior products of the differential forms representing the re- spective elements of the cohomology groups, The duality holds, for example, when dim Hq(X, (DP(E)) < 00. If X is compact, we need not distinguish H* from H (Serre's duality theorem) [50]. F. Compact Complex Manifolds On a compact and connected complex mani- fold X, there are no holomorphic functions except constants (by the tmaximum principle of holomorphic functions). The field K (X) of meromorphic functions on X is finitely gen- erated over the complex number field, and its ttranscendence degree d does not exceed the complex dimension n of X. d is said to be the algebraic dimension of X and is denoted by a(X), For elements of K(X), functional inde- pendence and algebraic independence are equivalent [52]. When n== 1, X is a compact tRiemann surface and the classical theory of algebraic functions shows that K(X) is an talgebraic function field of one variable and X is a tprojective algebraic variety. When n == 2, a(X)=2, 1, and 0 can all occur. If a(X)=2, X is a projective talgebraic surface (Chow- Kodaira theorem). If a(X) = 1, there exist an talgebraic curve 11 and a surjective holo- morphic mapping <p: X ->l1 such that K(l1) is isomorphic to K(X) under <p*, and <p-I(X) is an telliptic curve for all but a finite number of XEl1. K. Kodaira investigated the structure of compact complex surfaces in detail [34, III]. On a compact complex manifold X, the free Abelian group generated by the set of irre- ducible analytic subsets of codimension 1 is called the divisor group of X, and an element of it is called a divisor of X. For an analytic sub- set Y of codimension I, the sheaf of ideals 3( Y) defined by Y is a sheaf of locally principal ideals of (D. For a divisor D == L a, y., the sheaf of locally principal fractional ideals 3(D) = II, 3( y')a, is called the sheaf of ideals of D. The set of nonzero coherent sheaves of locally principal fractional ideals corresponds bijec- tively to the set of divisors. An element r # 0 of K (X) generates a sheaf of principal fractional ideals and therefore defines a divisor, which is denoted by (f). The divisor group has an ordering defined by D = L a, y.>-O if and only if all a,;;" 0, under which it becomes an tordered group. For a divisor D, let L(D)= {lEK(X)lf#O, (/)+D>-O} U {O}. Then L(D) is a C-module of finite dimension, This submodule of K(X) is easy to handle and exhibits various analytic properties of D. The 272 tRiemann-Roch theorem is used to calculate the dimension of L(D) in terms of other factors ( 366 Riemann- Roch Theorems C). Here is an example of how L(D) exhibits a property of D: We call two divisors D and D' linearly equivalent ifthere is a 0# FEK(X) such that (F)=D-D'. This is an equivalence relation finer than thomological equivalence. If D and D' are linearly equivalent, then L(D) and L(D') are isomorphic by the mapping L(D)3f--> fF E L(D'); therefore dim L(D) = dim L(D'). (The latter equation does not follow from the homological equivalence.) A holomorphic vector bundle with fiber C and structure group C* is called a complex line bundle. For the sheaf of ideals 3(D) of a divisor D, we can take a suitable open covering {V j } of X such that for each V j there is an RjEr(,3(D)) which generates 3(D)x for any XE Vi' Also gjk = RjRk is a holomorphic function nowhere vanishing on n Vk' With {gjd as the system oftcoordi- nate transformations, we define the complex line bundle determined by D and denote it by [D]. It is easy to see that [DJ is independent of the choice of {} or {RJ. Moreover, [DJ is determined only by the linear equivalence class of D. If we denote by (D( [DJ) the sheaf of germs of holomorphic sections of [DJ, the mapping HO(X, (D([DJ))3<p= {<pJ--> f = <pj Rj = <Pk/ Rk E L(D) is an isomorphism of these modules. On an algebraic variety in a projec- tive space, any complex line bundle comes from a divisor (i.e., it can be expressed in the form [DJ for some divisor D) [35, 51J, but this is not necessarily true on general compact complex manifolds. However, the importance of complex line bundles in the theory of com- plex manifolds lies in the relation L(D)  HO(X, (D([DJ)), which replaces "things with poles" with "things holomorphic." Any analytic subvariety X of the projec- tive space pn is an algebraic variety (Chow's theorem) [51]. Suppose that X is an analytic submanifold of pN, and let Y be a general thyperplane section of X. Then Y is a divisor on X, and the tChern class of the complex line bundle [YJ corresponds to the canonical Hodge metric on X ( 232 Kiihler Manifolds D). When [y] is represented by the system of coordinate transformations {gjd with respect to an open covering {V;}, we can associate with a coherent analytic sheaf F sheaves F(n) (n = 0, :t 1, ::!: 2, ... ) as follows. Denoting by Fj the restriction of F to V j , we glue Fj and Fk together on  n V k with the relation f j  fk <=> f j =g(:Jk (wherefjEFjc FJkEFkcF) and obtain a sheaf (denoted by F(n)) that is locally iso- morphic to F. The following theorems for F(n) hold, For each coherent analytic sheaf F there exists an integer no such that for any n;;" no the following fundamental theorems A, B of projec- 
273 tive algebraic varieties hold [17]. Theorem A: qx, F(n)) generates Fx (as an (!ix-module) for every XEX. Theorem B: Hq(X, F(n))=O for all q>O. This means that if we permit "poles" on Y of sufficiently high order, then F has suffi- ciently many sections and the higher coho- mology groups vanish. On a (nonsingular) algebraic variety X in pN we have the sheaf (!i of germs of holomor- phic functions (the tstructure sheaf as a com- plex manifold) and the sheaf (!ialg of germs of holomorphic rational functions (the structure sheaf as an algebraic variety). Therefore we have two kinds of coherent sheaves, coher- ent analytic sheaves and coherent algebraic sheaves, In fact, the cohomology theories de- rived from them are isomorphic. More pre- cisely, for any coherent algebraic sheaf F, F = F @l!J"g (!i is a coherent analytic sheaf. The correspondence F -->F gives an equivalence between the tcategory of coherent algebraic sheaves and that of coherent analytic sheaves, thus giving an isomorphism of their cohomol- ogy groups. In other words, as far as the prop- erties that can be expressed by cohomology- theoretic terms of coherent sheaves are con- cerned, there is no difference between the analytic and algebraic theories of projective algebraic varieties [51]. G. Deformations of Complex Structures The deformation theory of complex structures was initiated by Kodaira and Spencer [36] in order to explain various phenomena in the theory of (compact) complex manifolds. A triple (X, n, S) is called a family of compact complex manifolds if X and S are connected tanalytic spaces and n is a tproper holomor- phic mapping of X onto S such that (i) it is smooth, i.e., is locally identified with the pro- jection S' xU -->S', where S' and U are open in S and en, respectively, and (ii) every fiber Ji.= n-I(s) of n is connected, Ji. is then a compact complex manifold. We sometimes write {Ji.LES instead of (X, n, S), S is called the parameter space of the family. Take a point OES and fix it. We say that Ji. is a deformation of v" for any SES, There are a neighborhood S' of 0 in Sand a diffeomorphism n-I(S')->S' x V o (Kuranishi [39J). Thus every Ji., SES', determines a tcom- plex structure on a fixed differentiable mani- fold. We say that the complex structure of Ji., S E S', is a deformation of that of V o . Now (X, n, S) = {Ji.}.ES is said to be complete at 0 if for any family (Y, J1, T) = {W;}'ET with a point 0' E T and a holomorphic isomorphism I: w;,.--> v", there is a neighborhood T' of 0' in T and holomorpl;1ic mappings f: T' -->S and h; J1- I (T')-->X such that (i) fJ1 = nh, (ii) f(o') = 0, 72G Complex Manifolds and (iii) h = 1 on w;". In this case, if T' is suffi- ciently small, then h induces a holomorphic isomorphism h,: W;--> Vf(I)' for any tE T'. Hence {Ji.}.ES contains all small deformations of Yo. On the other hand, {Ji.}.eS is said to be effec- tively parametrized at 0 if the Kodaira-Spencer mapping (Kodaira-Spencer map) Po: T"S--> HI (v", e) is injective, where T"S is the Zar- iski tangent space to S at 0 and e is the sheaf of germs of holomorphic vector fields on Yo, Here, the linear mapping Po is defined by Po (%s) = {(ogjdos)(z"O)}' where %sEToS and gjk are the coordinate transformations (Zj, s) = (gjk(Zk> s), s), using the smoothness of n. Po(%s) is called the infinitesimal deformation to the direction %s. Kuranishi's fundamental theorem [39,40J states; For any compact complex manifold V, there exists a family {Ji.}SES with a point OES such that (i) it is complete at every point of S, (ii) is effectively parametrized at 0, and (iii) v" = V. The parameter space S in the theorem is called the Kuranishi space or the local moduli space of V. It is given as the zeros of a holo- morphic mappingf: U --> H 2 (V, e) with f(O) =0, where U is a neighborhood of 0 in HI (V, e), Hence (1) if H2( V, e) = 0, then S = U is nonsin- gular (Kodaira, Nirenberg, and Spencer [37]; (2) if HI(V,e)=o, then S={O}, i.e., one point (e.g., V==pn(C), the complex projective space). Kuranishi's theorem was generalized to a compact analytic space V by Grauert [18J and A. Douady [8]. There are Kuranishi-type theorems in the deformation theory of other objects: (1) com- pact analytic subvarieties of an analytic space (Douady [9J;  23 Analytic Spaces G), (2) holomorphic mappings (Y. Miyajima [41]), (3) germs of analytic spaces with tisolated singularities [7J, etc. So far, only the local theory of deformations has been developed. The global theory is not yet in a satisfactory state. Its final purpose is to construct moduli spaces and to understand them, For a compact differentiable manifold V, we denote by M(V) the set of all isomorphism classes of complex structures on V. It is diffi- cult in general to determine the set M(V). As yet unsolved problems are: (1) Is M(S6) non- empty? (2) Does M(pn(c)) (n  3) consist of one point? (M(p2(C) is known to consist of one point (S. T. Yau;  232 Kahler Manifolds C). If M(V) has a reasonable structure (e.g" an analytic space structure) and a universal prop- erty, then we call it the moduli space ( 11 Algebraic Functions F, 16 Algebraic Varieties W), Kodaira [34, IIIJ constructed the moduli space of Hopf surfaces. Only a few examples of moduli spaces are known. It is to be noted that moduli spaces cannot in general exist. 
72H Complex Manifolds One of the reasons for this is that there may exist jumpings of structures (Kodaira and Spencer [36J). As for talgebraic manifolds (varieties), there are two known methods for the moduli prob- lem: (1) Griffiths' period mapping ( 16 Alge- braic Varieties V) and (2) Mumford's geomet- ric invariant theory ( 16 Algebraic Varieties V,W). H. Monoidal Transformations Let Y be an analytic subspace of a complex manifold X, which is defined by a nonzero coherent sheaf of ideals 3. Then X has an open covering such that for each member U of the covering, there are elements <PI' ... , <Pm E qu, 3) that generate the stalks 3x at all XE U. Let W' be the graph of the holomorphic map- ping Xf->(<PI (x): <P2(X):",: <Pm(x) from U - un Y to pm-I, and denote by W the closure of W' in U X pm-I. Then W is an analytic space, pos- sibly with singularities, which is independent of the choice of the generators {<pJ and is deter- mined uniquely by U and 3. Therefore all the W's can be glued together to form an analytic space X and to determine a holomorphic map- ping p: X -->X. We call p the monoidal transfor- mation (or blowing-up) of X with center 3 or with center Y. When Y is a point, p is also called a locally quadratic transformation or a (J'-process. When Y is an analytic submanifold, X also is a manifold, Y=p-I(y) is a nonsin- gular divisor on X, and p-I(y)pk-I for all YE Y, where k is the codimension of Y. More- over, the line bundle [YJ restricted to each fiber p-I(y) is isomorphic to the Hopfbundle L on pkl, i.e., the line bundle associated to the natural C* bundle C k --> pk-I. Conversely, suppose that we are given YX and a holo- morphic mapping Po: Y --> Y with the property as above so that, in particular, p1 (y)  pk-l and [YJ  L on p1 (y) for all YE Y. Then there exists a complex manifold containing Yas a submanifold such that X is obtained by the monoidal transformation of X with center Y as above (S. Nakano [47J). Let f: X --> Y be a proper modification of complex manifolds (or analytic spaces). Then there exists a proper holomorphic mapping h: Y' --> Y such that f- l a h: Y' ->X is holomorphic, where over any relatively compact subdomain of Y, h is obtained by a finite succession of monoidal transformations with nonsingular centers (Hironaka). The result is called Chow's lemma. I. Fiber Spaces A triple of compact complex manifolds V, W, and a surjective holomorphic mapping f: 274 V --> W is called a fiber space if general fibers f- I (w) are irreducible. In addition, if both V and Ware algebraic, it is called an algebraic fiber space. For a compact complex manifold V, letting Kv denote the canonical line bundle of V, define a subfield 5t of K(V) to be {wdw 2 , where W l ,W 2 EH O (V,Q) for some m>O}. The transcendental degree of 5t over C is denoted by K(V), which we call the Kodaira dimension ofV. If 5t=0, then define K(V)= -00, If K(V)== dim V then V is said to be of general type, and there exist a projective manifold X and a bimeromorphic holomorphic mapping J1: X --> V. If K(V) O, there exists a fiber space f: V*--> W such that (1) V* is bimeromorphically equivalent to V, (2) dim W = K(V), and (3) some general fibers f- I (w) satisfy K(Fl(w» =0. In general, for a fiber space f: V --> W, we have K(V).s;K(FI(w))+dim W, where rl(w) is a general fiber. Moreover, if it is algebraic, an inequality of the form K(V)K(FI(W)+K(W) is called the conjecture en, n being dim V. e has been verified by K. Veno, E. Viehweg, T Fujita, and Y. Kawamata [J 3,14,32,33,54- 57J in the following cases: (1) n.s; 3, (2) when general fibers are curves, (3) dim w = 1, K(W) = 1, (4) K(V) O, and W is of general types, etc. By using the results of case (4), Kawamata proves that an algebraic compact complex manifold V is birationally equivalent to an Abelian variety if and only if K(V)==O and the irregularity of V equals the dimension of V [32]. J. Analytic Surfaces In what follows, an analytic surface means a 2- dimensional compact complex manifold. For an analytic surface S, an exceptional curve on S and a (relatively) minimal model, etc., are defined with respect to bimeromorphic map- pings in analogy with the corresponding con- cepts for an algebraic surface ( 15 Algebraic Surfaces). Let e c S be an irreducible curve on S, Then there exists a holomorphic mapping <P from S onto another surface S' such that <p(C) is a point and such that <P induces the isomor- phism S - es' - <p(C) if and only if e 2 = -1 and e is a nonsingular rational curve (Grauert [17J). S has a minimal model if and only if S is not a ruled surface (Kodaira [34, IIIJ). The irregularity q = hO,I, the geometric genus Pg, i- genus Pi' etc., are also defined in the same way as in the case of algebraic surfaces. Note that, in general, h O ,1 #hl,o. The Riemann-Roch theorem and M, Noether's formula are valid also for an analytic surface (Atiyah and Singer;  366 Riemann- Roch Theorems C). 
275 K. Classification of Surfaces The classification of analytic surfaces by the aid of their numerical invariants was com- pleted by Kodaira and includes as a special case Enriques's classification of algebraic surfaces [34, III]. By an eIliptic surface we mean a surface from which there exists a sur- jective holomorphic mapping <p onto an alge- braic curve 11, such that for a.general point P on 11, <p -1 (p) is an irreducible nonsingular elliptic curve. If a(S) = 1 or K(S) = 1, S has a unique structure as an elliptic surface. The image by <p of the points on S at which <p is not of maximal rank is a finite subset {at, ... , a,} of 11. Let t i be a local coordinate on 11 around a; with ti(ai)=O. We call a singular fiber of <p the divisor on S defined by {tiO <p = O}, The structure and the construction of singular fibers of elliptic surfaces have been completely determined by Kodaira. By an elliptic surface of general type we mean a surface with Kodaira dimension 1. If K(S) = 2, S is projective algebraic and is called a surface of general type. If a( S) = 0, then there exists only a finite number of irreducible curves on S, By a Hopf surface we mean a surface whose universal covering is C 2 - (0, 0), If an analytic surface S is homeomorphic to Sl x S3, S is a Hopf surface. Let bv(S) be the vth Betti num- ber of S. If a(S)=O, bdS)= 1, b 2 (S)=0, and S contains a curve, then S is a Hopf surface (Kodaira). By a surface of class VII o we mean a minimal surface S with bl(S)= 1. M. Inoue [26] constructed three families of surfaces of class VII o with b 2 = 0 which contain no curves. These are sit) (M ESL(3, Z)) and S,q",t and Sr,q" (NESL(2,Z), p, q, rEZ, tEC)which have H x C as their universal covering sur- faces, H being the complex upper half-plane. These have a line bundle L such that HO(O,I @ (!)(L» =0. This property characterizes these Inoue surfaces among VIIo surfaces with b 2 = o which contain no curves. A couple of new surfaces of class VII o with b 2 > 0 were con- structed and studied by Inoue, Ma, Kato, T. Oda, I. Nakamura, and I. Enoki [10, 27-29, 46]. Some of these have close connections with cusp singularities of Hilbert modular sur- faces, torus embeddings, and global spherical shells. Enoki's surface is denoted by Sn.", (n>O,O< lexl < 1, tEC") and has the following properties: (1) class VII o and b 2 = n, (2) there exists a connected curve D with D 2 = 0, (3) Sn"" - D is an affine bundle over an elliptic curve. Properties (1) and (2) characterize Sn",t (Enoki). Let Ks denote the canonical line bundle of an analytic surface S. An analytic surface S is said to be a K3 surface, if Ks is trivial, i.e., if there exists a nonvanishing holomorphic 2- 72K Complex Manifolds form wand if q = O. K3 surfaces are simply connected and are deformations of a nonsin- gular quartic surface in p3 (Kodaira), We shall define the period mapping of K3 surfaces. Let L be a free Abelian group of rank 22 with the pairing < ) which is the direct sum of the two copies of - E8 and three copies of U = Ze 1 + Ze 2 with <e"e I )==<e 2 ,e 2 )==0, <e"e 2 )= <e 2 , e I ) = 1, where - E8 denotes the lattice with the pairing corresponding to the Dynkin diagram E8 with the opposite signs. H 2 (S, Z) with the intersection form is such a pair, con- sisting of Sand < ). A marked K3 surface is defined to be a pair (S, 1/1), where S is a K3 surface and 1/1: H 2 (S, Z)-->L is an isomorphism preserving < ). The class [wJ of w is a base of H 2 ,0 and satisfies <[w], [wJ) =0 and <[wi, [wJ) >0. Let P(Ld denote the 21-dimensional projective space associated with Le = L@zC. = {(a)EP(Ldl <a,a) =0, <a,a) >O} is an open set of the quadric c= {(a)E P(LeJl <a,a) = O}.  is the set of all Hodge structures on Le. The period of a marked K3 surface (S, 1/1) is defined to be (l/1e[wJ). Every point of  is the period of some marked Kiihler K3 surface. If the periods of two marked Kiihler K3 surfaces (S, 1/1) and (S', 1/1'), coincide then S is isomorphic to S'. An analytic surface with Pg=q=O and Kf> 2  0 is called an Enriques surface, which has a K3 surface as its universal covering. An Enriques surface is an algebraic elliptic surface. An analytic surface with q == 1 and Kf> 12  0 is said to be a hyperelIiptic surface, which has an Abelian surface as an unramified covering. It is an algebraic surface and is an elliptic bundle over an elliptic curve. The classification of minimal surfaces is given in Table 1. The fol- lowing relations hold among these invariants: Let b+ (b-) be the number of positive (nega- tive) eigenvalues (counted with multiplicities) of the intersection matrix on H 2 (S, R) and Ci be the ith Chern class of S. Then (1) b + - b - = i( d - 2c 2) (Hirzebruch signa- ture theorem); (2) if b i is even, q=hI,o=tb i and b+ = 2pg+ 1; (3) if b i is odd, q= hI,o + 1 =t(b i + 1) and b+ =2pq. Let ci and C2 denote the Chern numbers of an analytic surface S, i.e., ci == (K) and C2 is the Euler number of S. Then 3c 2 ;;o.ci [43J, and if equality holds, then Ks is ample (Y. Miyaoka) ( 232 Kiihler Manifolds C). If S is a surface of general type which is minimal, then d> 0, the bigenus P2;;o. 2 and the m-genus Pm is m(m-l)ci;2+ l-q+P g for m;;o.2. For each m;;o. 5, the mth canonical mapping of S is a bi- rational holomorphic mapping onto its image (Kodaira, E. Bombieri [5]). In general, ci ;;o.2p g -4, and if the canonical mapping, i.e., the 
72 Ref. Complex Manifolds rational mapping associated with Ks, is bira- tional, then CT;;" 3p g -7. For certain kinds of surfaces of general type, E. Horikawa suc- ceeded in determining completely the structure of surfaces obtained as deformations of a given surface. Among others, every minimal surface with Pg = 4, q = 0, and d = 5 is a deformation of a nonsingular quintic surface [22,23]. Table 1. Classification of Minimal Surfaces K Pg P12 q b l Structure 2 >0 algebraic surface of general type I elliptic surface of general type I I 2 4 complex torus elliptic surface I I 2 3 with a trivial canonical bundle 0 0 I I 2 hyperelliptic surface elliptic surface 0 I I I belonging to class VIla I I 0 0 K3 surface 0 I 0 0 Enriques surface 0 0 rational surface 0 0 > 1 2q ruled surface -00 of genus q I I surface of class VIla For a, b > 0, let Eab be the set of isomor- phism classes of all minimal surfaces with ci = a and l-q+ Pg =b. D. Gieseker [15] proved the existence of N and constructed the mapping h:Eab-->pN so that the following hold: (1) h is injective; (2) if f: X --> Y is a smooth holomor- phic mapping such that all h -1 (y) represent some classesEE ab , then the set-theoretic map- pingf: Y -->pN induced from h is a morphism of schemes; (3) h(E ab ) is a locally closed sub- variety of pN, Hence, h(E ab ) is a moduli variety of surfaces of general type with d = a and 1- q+pg=b. For each m with 1 .s; m.s; 9, there exists a minimal surface of general type with Pg = q = 0 and ci = m. If there exist a compact analytic surface S and a curve C on S such that S - C is biholo- morphic to a complex manifold M, then we say that S is a compactification of M with boundary C. Every compactification of C 2 is a rational surface (Kodaira, 1. Morrow [45]), Every compactification of C x (C - {O}) is also rational (T. Ueda [53]). However, all com pac- 276 tifications of (C - {0})2 are rational surfaces, certain kinds of Hopf surfaces, or pI-bundles over elliptic curves as constructed by Serre (Ueda). References [1] T. Akahori, Complex analytic construction of the Kuranishi family on a normal strongly pseudo-complex manifold. Pub!. Res. Inst. Math, Sci., 14 (1978),789-847, [2] I. Amemiya, K. Masuda, and K. Shiga, Lie algebras of differential operators, Osaka J. Math., 12 (1975),139-172. [3J A. Andreotti and H. Grauert, Theoremes de finitude pour la cohomologie des espaces complexes, Bul!. Soc. Math. France, 90 (1962), 193-259. [4J A. Blanchard, Sur les varietes analytiques complexes, Ann. Sci. Ecole Norm. Sup., 73 (1956),157-202. [5J E. Bombieri, Canonical models of surfaces of general type, Pub!. Math. Inst. HES, 42 (1973), 171-219. [6] M. Commichau, Deformation kompakter komplexer Mannigfaltigkeiten, Math. Ann., 213 (1975), 43-96. [7] I. F. Donin, Complete families of deforma- tions of germs of complex spaces (in Russian), Mat. USSR Sb" 18 (1972), 397-406, [8] A. Douady, Le probleme des modules pour les varietes analytiques complexes, Sem. Bourbaki, 1964/65, no. 277. [9] A. Douady, Le probleme des modules locaux pour les espaces C-analytiques com- pacts, Ann. Sci. Ecole Norm. Sup., ser. IV, 4 (1974),569-602. [IOJ I. Enoki, On surfaces of class Vll o with curves, Proc. Japan Acad., 56 (1980), 275-279, [11] O. Forster and K. Knorr, Konstruction verseller Familien kompakter komplexer Riiume, Lecture notes in math. 705, Springer, 1979. [12] A. Fujiki, On the blowing down of ana- lytic spaces, Pub!. Math. Res. Inst. Math. Sci., 10 (1975), 473-507. [13] T. Fujita, On Kiihler fiber spaces over curves, J. Math. Soc. Japan, 30 (1978), 779- 794. [14] T. Fujita, The sheaf ofrelative canonical forms of a Kiihler fiber space over a curve, Proc. Japan Acad., 54 (1978), 183-184. [15J D. Gieseker, Global moduli for surfaces of general type, Inventiones Math., 43 (1977), 233-282. [16] H. Grauert, Ein Theorem der analyti- schen Garbentheorie und die Modulriiume komplexer Struckturen, Pub!. Math. Inst. HES, 5 (1960). [17] H. Grauert, Ober Modifikationen und 
277 exzeptioneIle analytische Mengen, Math. Ann., 146 (1962),331-368. [18] H. Grauert, Der Satz von Kuranishi fUr kompakte komplexe Raume, Inventiones Math., 25 (1974), 107-142. [19J F. Hirzebruch, Hilbert modular surfaces, Enseignement Math., 19 (1973), 183-281. [20J F. Hirzebruch and D. Zagier, Classifica- tion of Hilbert modular surfaces, Complex Analysis and Algebraic Geometry in honor of Kodaira, Iwanami Shoten (1977),137-150. [21] E. Horikawa, On deformations of holo- morphic maps. I, II, J. Math. Soc. Japan, 25 (1973), 372-396; 26 (1974), 647-666. III, Math. Ann" 222 (1976), 275-282. [22J E. Horikawa, On deformations of quintic surfaces, Inventiones Math., 31 (1975),43-85. [23] E. Horikawa, Algebraic surfaces of gen- eral type with smaIl Cf. I, Ann, Math., (2) 104 (1976),357-387. II, III, IV, Inventiones Math., 37 (1976),121-155; 47 (1978), 209-248; 50 (1979), 103-128. [24] E. Horikawa, Surjectivity of the period map of K3 surfaces of degree 2, Math. Ann" 228 (1977),113-146. [25] S. Iitaka, On D-dimensions of algebraic varieties, J. Math. Soc. Japan, 23 (1971), 356- 373. [26] M. Inoue, On surfaces of class VII o , Inventiones Math., 24 (1974),269-310. [27] M. Inoue, New surfaces with no mero- morphic functions, Proc. Int. Congr. Math., Vancouver, 1974, vol. 1,423-426. [28J M. Inoue, New surfaces with no mero- morphic functions, Complex Analysis and Algebraic Geometry II, Iwanami Shoten and Cambridge Univ. Press, 1977,91-106. [29J M. Kato, Complex manifolds containing global spherical shells, Proc. Int. Symp. Alge- braic Geom. Kyoto, 1977,45-84. [30] S. Kawai, On compact complex analytic manifolds of complex dimension 3. I, II, J. Math. Soc. Jap., 17 (1965), 438-442; 21 (1969), 604-616. [31] Y. Kawamata, On deformations of com- pactifiable complex manifolds, Math. Ann., 235 (1978), 247-265. [32] Y. Kawamata, Characterization of abelian varieties, Compositio Math., 43 (1981), 253-276. [33] Y. Kawamata, Kodaira dimension of cer- tain algebraic fiber spaces, J. Fac. Sci. Univ. Tokyo, sec. lA, 30 (1983),1-24. [34] K. Kodaira, CoIlected works I-III, Iwanami, 1975. [35] K. Kodaira and D. C. Spencer, Groups of complex line bundles over compact Kahler varieties, Divisor class groups on algebraic varieties, Proc. Nat. Acad. Sci. US, 39 (1953), 868-877. [36J K. Kodaira and D. C. Spencer, On de- 72 Ref. Complex Manifolds formations of complex analytic structures: I, II, Ann. Math., (2) 67 (1958), 328-466; III, ibid., 71 (1960), 43- 7 6. [37] K. Kodaira, L. Nirenberg, and D. C. Spencer, On the existence of deformations of complex analytic structures, Ann. Math., (2) 68 (1958),450-459. [38] R. S. Kulkarni, On the principle of un i- formization, J. Differential Geom., 13 (1978), 109-138. [39J M. Kuranishi, On the locally complete families of complex analytic structures, Ann, Math., (2) 75 (1962), 536-577. [40J M. Kuranishi, New prooffor the exis- tence of locally complete families of complex structures, Proc. Con£. Complex Analysis, Minneapolis, 1964, Springer, 1965, 142-154. [41] K. Miyajima, On the existence of Kura- nishi family for deformations of holomorphic maps, Sci. Rep. Kagoshima Univ., 27 (1978), 43-76. [42] Y. Miyaoka, Kahler metrics on eIliptic surfaces, Proc. Japan Acad., 50 (1974), 533- 536. [43J Y. Miyaoka, On the Chern numbers of surfaces of general type, Inventiones Math., 42 (1977),225-237. [44J B. G, Moishezon, The algebraic analog of compact complex spaces with a sufficiently large field of meromorphic functions I, II, III, Math. USSR-Izv., 3 (1969), 167-226; 305343; 447-513. [45] J. Morrow, Minimal normal compacti- fication ofC 2 , Rice Univ. Studies, 59, no. 1 (1973),97-112, [46J I. Nakamura, Inoue-Hirzebruch surfaces and a duality of hyperbolic unimodular sin- gularities, I, Math. Ann., 252 (1980), 221-225. [47J S. Nakano, On the inverse monoidal transformation, Pub!. Res. Inst. Kyoto Univ., 6 (1971), 483-502. [48] M_ Namba, Moduli of open holomorphic maps of compact complex manifolds, Math. Ann., 220 (1976), 65-76, [49] A. Newlander and L. Nirenberg, Com- plex analytic coordinates in almost complex manifolds, Ann. Math., (2) 65 (1957), 391-404, [50] J.-P, Serre, Un theoreme de dualite, Com- mcnt. Math. Helv., 29 (1955), 9-26. [51] J.-P. Serre, Geometrie algebrique et geo- metrie analytique, Ann. Inst. Fourier, 6 (1955- 1956), 1-42. [52] C. L. Siegel, Meromorphe Funktionen auf kompacten analytischen Mannigfaltig- keiten, Nachr. Akad. Wiss. Gottingen, 1955, 71-77. [53] T. Ueda, Compactification of C x C* and (C)*2, Tohoku Math. J., 31 (1979), 81-90. [54J K. Veno, Classification theory of alge- braic varieties and compact complex spaces, Lecture notes in math. 439, Springer, 1975. 
73A Complex Multiplication [55J K. Veno, Classifacation of algebraic varieties II. Algebraic threefolds of parabolic type, Int. Symp. Algebraic Geometry, Kyoto, 1977, 693- 708. [56] K. Veno, On algebraic fiber spaces of Abelian varieties, Math. Ann., 237 (1978), 1- 22. [57J E. Viehweg, Canonical divisors and the additivity of the Kodaira dimension for mor. phisms of relative dimension one, Compositio Math., 35 (1977), 197-223. [58J E. Viehweg, Klassifikationstheorie alge- braischer Varietiiten der Dimension Drei, Compositio Math., 41 (1980), 361-400. [59] K. Yano, Differential geometry on com- plex and almost complex spaces, Macmillan, 1965. 73 (V.15) Complex Multiplication A. Classical Theory If the ratio W l /W 2 of two periods WI' W 2 of an telliptic function f belongs to an imaginary tquadratic field K, then there exists an alge- braic relation between f(z) and ((Az) for any ic in K, and such an f is said to have complex multiplication. This phenomenon for the tsn function with modulus j=! was discovered by C. F. Gauss and was applied to the prob- lem of dividing a tlemniscate into five arcs of equal length. More generally, N. H. Abel showed that the special dividing equation of an sn function with complex multiplication is algebraically solvable. From a number- theoretic point of view, L. Kronecker conjec- tured that every tAbelian extension of an imaginary quadratic number field K is deter- mined by a transform equation of an elliptic function with complex multiplication by a number of K (1880). This is an analog of the fact, announced by Kronecker and proved by H. Weber, that every Abelian extension of the rational number field is a subfield of a tcy- clotomic field. Kronecker's work was con- tinued by Weber [2], and his conjecture was proved by T. Takagi (1903) for K = Q( j=!), by T. Takenouchi (1916) for K =Q(e 2 . i / 3 ), and by Takagi (1920) for the general case using tclass field theory. H. Hasse [5] and M. Suga- wara simplified the theory of complex multi- plication, and Hasse noticed a relationship between complex multiplication and tcon- gruence zeta functions. Working from Hasse's idea, M. Deuring constructed the theory of complex multiplication purely algebraically 278 and determined tHasse's zeta function of an elliptic curve with complex multiplication. F or the rest of this article, K always stands for an imaginary quadratic field. Let L be a tlattice group on the complex plane C gen- erated by WI' W2, and let Z be a complex vari- able. Define functions fJ, g2' g3 as follows: fJ(z, L)= &;1(Z;WI,W 2 )=Z-2 + L«Z _W)-2_ w- 2 ), Y2(L)=g2(W"w 2 )=60LW- 4 , g3(L)= g3(W I ,W 2 )= 140LW- 6 , where the sum L is over the elements W of L except O. Let &;1' be the derivative of &;1; then z-->(1, &;1 (z), &;1'(z)) is a one-to-one correspondence between the points on the complex torus C/L and those on the telliptic curve E; XoX =4X -g2XXI - g3X in the projective plane. If the quotient wdw 2 generates K, then the ring of analytic endomorphisms of C/ L (i.e., the ring of endo- morphisms of E) is isomorphic to a subring of the tprincipal order 0 of K. In particular, if the lattice group L is an tideal of K (for K c C;  347 Quadratic Fields), then the ring of endo- morphisms coincides with o. The function J(r)= J(E) = J(L) = 2 6 3 3 g2(L)3 /!l(L) (!l(L) = g2(L)3 -27g 3 (L)2) ofr =WdW2' 1m r>O, is a tmodular function of levell, and J (E) is called the invariant of the elliptic curve E. If E has a complex multiplication, then J(E) is an algebraic integer. Then the three main theorems of the classical theory of complex mult,iplication can be stated, Theorem 1. Let h be the tclass number of K, and let 0 1 ' ..., ah be a set of representatives of ideal classes of K. Then J (a I), ... , J (a h ) are exactly the conjugates of J(ad over K, and K(J(ad) is the maximal unramified Abelian extension (the tabsolute class field) of K ( 59 Class Field Theory). Next we define the function f by [(z; L)=g2g3!l-I. &;1(z; L), K #Q(j=!), Q(e 2 . i / 3 ), =g!l-I,tJ(z;Lf, K=Q(j=!), = g3!l-1 . tJ(z; L)3, K = Q(e 2 . i / 3 ). Theorem 2. Let 0 be the tprincipal order of K, m be an integral ideal of K, and a be an arbi- trary ideal of K. Choose a number  of K such that m = {).EO IAEa}. Then K(J(a), [(; a)) is the tclass field for the tray modulo m. The number  in this theorem is obtained by a-Im()=b, where b is an integral ideal be- longing to the ideal class of a -I m. Thus f(; a) is a "special value" of an elliptic function as well as of a modular function. The tgenerallaw of reciprocity, the tprin- cipal ideal theorem, and the ramification in the class field in theorem 2 can also be described 
279 in terms of elliptic functions or elliptic curves. In general, the ring of endomorphisms of an elliptic curve defined over a field k of charac- teristic 0 is either Z or an order of an imagi- nary quadratic field. On the other hand, if the characteristic of k is not zero, then the ring of endomorphisms may be an order in a definite tquaternion algebra. B. Complex Multiplication of an Abelian Variety Following Kronecker's idea, D. Hilbert posed in his lecture at Paris (1900) the so-called 12th problem: to find an analytic function whose special values generate Abelian extensions over a given algebraic number field ( 196 Hilbert). E. Hecke constructed unramified Abelian extensions of an imaginary biquadra- tic field using tHilbert modular functions [10]. After this, there was no notable devel- opment concerning the problem until the theory of complex multiplication was gen- eralized to the case oft Abelian varieties, which was made possible by progress in algebraic geometry, in particular A. Weil's geometric theory of Abelian varieties (G. Shimura and y. Taniyama [11]). The following results have been obtained: Consider a triple (A, X, t), consisting of an Abelian variety A defined over C, a tpolarization x of A, and a point t of A. Call two such triples (A, x, t) and (A', x', t') isomorphic if an isomorphism of A onto A' maps x onto x' and t onto t'. Then there exists one and only one subfield ko of C with the following property: In order for (A, x, t) and (A a, x a , t a ) to be isomorphic for an automor- phism (J of C, it is necessary and sufficient that (J fix all elements of ko. We call ko the field of moduli of (A, x, t). If A is an elliptic curve E and if t = 0, then ko = Q(J (E)). In the higher- dimensional case, the field of moduli is gen- erated by special values of a tSiegel modular function, If F is a totally imaginary number field that is a quadratic extension of a totally real field Fo of degree n, then there exists a set {<PI' ... , <Pn} of n different isomorphisms of F into C such that ?Pi = <Pj never occurs for i # j, where the bar denotes complex conjugation. We call (F; {<pJ) a CM-type. If a is an ideal of F, then L= {(<PI (ex), ..., <Pn(ex)) E C" I ex E a} is a lattice group in the n- dimensional complex linear space cn, and C"IL is analytically isomorphic to an Abelian variety A of dimension n in a complex pro- jective space. The tendomorphism ring (A) of A contains a ring that is isomorphic to the principal order 0 of F. Conversely, every 73 Ref. Complex Multiplication Abelian variety of dimension n such that (A) contains a ring isomorphic to 0 can be con- structed in this way. Let M be a normal extension of Q contain- ing F. Denote the Galois group of M by G and the subgroup of G corresponding to F by H, and put S = U;, <p;,H, where <P;, stands for a prolongation of the previous <PA to G. Then the following three conditions are equiv- alent: (i) (A) 0; (ii) A is simple; (iii) H = {YEGISy=S}. If H*={bEGlbS=S}, then we can choose i/1nEG such that S= U"H*i/1w If F* is the subfield of M corresponding to H*, then (F*; {i/1,,}) is also a CM-type, and F* = Q(LA<PA(ex)lexEF). Moreover, for an ideal r of F*, II" i/1,,(r) is an ideal of F. Now let Hm be the group of ideals r of F* that are relatively prime to the norm N(m) of an integral ideal m of F* and for which there exists a number  of F with ni/1,,(r)=(), N(r)=112, =I (modXm). " Then Hm is an tideal group modulo m in F*. Theorem 3. Assume that (A)o, and let x be an arbitrary polarization of A. Denote a point of A with m={AEoIAt=O} by t, and let k m be the field of moduli of (A, x, t). Then k,J* is the tclass field over F* corresponding to Hm' The point t in theorem 3 always exists. In particular, if m = 0, then t = O. When A is an elliptic curve E, F is an imaginary quadratic field and we have F* = K, and theorem 3 coin- cides essentially with the content of theorems 1 and 2.lfn> 1, F=F* holds only in special cases. The theory of complex multiplication of A is closely related to the tHasse zeta function of A ( 450 Zeta Functions). References [IJ L. Kronecker, Zur Theorie der elliptischen Funktionen, Monatber. Koniglich Preuss. Akad. Wiss. Berlin, (1881), 1165-1\ 72. (Werke IV, Teubner, 1929, 309-318; Chelsea, 1969.) [2] H. Weber, Lehrbuch der Algebra III, Vieweg, second edition, 1908 (Chelsea, 1961). [3J T. Takagi, Dber eine Theorie des relativ- Abelschen Zahlkorpers, J. CoIl. Sci. Imp. Univ. Tokyo, 41 (1920), 1-132. [4] R. Fueter, Vorlesungen uber die singularen Moduln und die komplexe Multiplikation der elliptischen Funktionen, Teubner, I, 1924; II, 1927. [5] H. Hasse, Neue Begrundung der kom- plexen Multiplikation, J. Reine Angew. Math., 1,157 (1926),115-139; 11,165 (1931), 64-88, 
74A Complex Numbers [6J R. Fricke, Lehrbuch der Algebra III, Vieweg, 1928. [7] M. Deuring, Die Typen der Multiplika- torenringe elliptischer Funktionenkorper, Abh. Math. Sem. Univ. Hamburg, 14 (1941), 197-272. [8J M. Deuring, Die Struktur der elliptischen Funktionenkorper und die Klassenkorper der imaginiiren quadratischen Zahlkorper, Math. Ann., 124 (1952),393-426. [9J M. Deuring, Die Klassenkorper der kom- plexen Multiplikation, Enzykl. Math., Bd. 1 2 , Heft 10, Teil 11,23, 1958, [10] E. Hecke, Hohere Modulfunktionen und ihre anwendung auf die Zahlentheorie, Math. Ann., 71 (1912), 1-37; Dber die Konstruktion relativ-Abelscher Zahlkorper durch Modul- funktionen von zwei Variabeln, Math. Ann., 74 (1913),465-510. (Mathematische Werke, Van- denhoeck & Ruprecht, 1959,21-58,69-114,) [IIJ G. Shimura and Y. Taniyama, Complex multiplication of Abelian varieties and its applications to number theory, Publ. Math. Soc. Japan, 1961. [12] G. Shimura, Construction of class fields and zeta functions of algebraic curves, Ann. Math., (2) 85 (1967), 58-159. [13J A. Borel et aI., Seminar on complex multiplication, Lecture notes in math. 21, Springer, 1966, [14J G. Shimura, Introduction to the arith- metic theory of automorphic functions, Publ. Math. Soc. Japan and Princeton Univ. Press, 1971. 74(11.11) Complex Numbers A. Algebraic Properties of Complex Numbers A complex number is an expression of the form a + ib with arbitrary real numbers a and band the imaginary unit i. Writing (X==a+ib, [3== c + id, we define (X = [3 if and only if a = c and b == d. As regards algebraic operations with complex numbers, we define (X+[3=(a+c)+ i(b+d), (X- [3=(a-c)+ i(b-d), (X[3=(ac- bd) + i(ad +bc), and for [3 #0, i.e., c 2 +d 2 #0, (X/[3 = (ac + bd)/(c 2 +d 2 ) + i«(bc -ad)/(c 2 + d 2 »). Then the addition and multiplication thus defined obey commutative, associative, and distributive laws, and complex numbers form a tcommutative field with 0 = 0 + iO and 1 = 1 + iO as its zero element of addition and iden- tity element of multiplication, respectively. The set of all complex numbers is usually denoted by C. 280 By assigning to each real number a a com- plex number a + iO, algebraic operations on real numbers are carried into those of the corresponding complex numbers. That is to say, the field R of all real numbers is mapped isomorphically into the field C of all complex numbers. By identifying a with a + iO, we are taking R as a subfield of C. Also, 0 + i 1 will be denoted simply by i. From the previous definition of algebraic operations, it follows that i 2 = -I. Furthermore, since (X = a + ib =(a+iO)+(b+iO)(O+il), a+ib is not a mere symbolic expression but can also be regarded as the outcome of algebraic operations on a, b, i in C. The real and imaginary parts of a complex number (X = a + ib are, by definition, a and b, denoted by Re (X and 1m (x, respec- tively. A complex number that is not a real number is sometimes called an imaginary number; in particular, a complex number (X with Re (X = 0 is called a purely imaginary number, For each complex number (X=a+ ib, we define its conjugate complex number as a - ib, and denote it by Cl. We then have (X + [3 =Cl+fJ and (X[3 =ClfJ. The mapping (X-->Cl is an tautomorphism of C which leaves each ele- ment of R invariant. Also, the following rel- ations hold: Re (X = «(X + Cl)/2 and Im(X=«(X- rj)/(2i). Regarded as an overfield of R, C is an textension field of R of degree 2 obtained by the adjunction of i, which is a root of an irre- ducible equation x 2 + 1 =0. The important algebraic property of C is that it is talgebra- ically closed. Namely, for any polynomial f(x) with coefficients in C, the equation f(x) = 0 possesses at least one root in C (tGauss's fundamental theorem of algebra). B. Topology of C The absolute value (or modulus) of a complex number (X=a+ib, denoted by I(XI, is by defi- nition 1(XI= Ja 2 +b 2 =j@. If (X is real, then the absolute value of (X in the sense of complex numbers is identical to the one in the sense of real numbers. It always holds that 1(XI;;o.O and I(XI ==O=(X==O. It follows further that I(X+ [31.s; I(XI + 1[31, I (X[3 I = 1(X11[31, and I(XI = Irjl. For each pair of complex numbers (X and [3, define p( (x, m = I (X - [31. Then with p( (x, [3) as the tdistance function, C satisfies the axioms for a tmetric space, and in particular, Iimnoo (Xn = (Xo=limp«(Xn, (Xo)=O=liml(Xn- (Xol =0= (lim an = a o and lim b n = b o ) (where (Xn = an + ib n , a o = a o + ib o ). From this it is easily seen, as in the case of the set R of all real numbers, that C also becomes a tlocally compact and tcom- plete metric space. With respect to this topology, the four oper- 
281 ations (except for division by zero) are con- tinuous: (Xn -->(Xo and fJn--> f10 imply (Xn + fJn--> (Xo + /10; (Xn - /1n......(Xo - /10; (XnfJn-->(XofJo; and (Xn/fi n -->(Xo//1o (where in the last case we assume fJn #0 and fJo #0). Thus C becomes a topo- logical field. Furthermore, the assignment (X --> Ci gives a continuous mapping C --> C, a homeomorphic automorphism of C. C. The Complex Plane If in a plane to which is assigned rectangular coordinate axes a complex number (X=a+ib is represented by a point (a, b), then the plane is called the complex (number) plane (Gauss- Argand plane or Gaussian plane) (Fig. 1), and the point representing (X is called simply the point (x. The abscissa and ordinate axes are called the real and imaginary axes, respec- tively. A point (X = a + ib can be represented by tpol ar coordinates r, 0 with the origin and the real axis as the pole and the generating line, respectively, where r= Ja 2 +b 2 is the absolute value I(XI of (X and () is the argument (or ampli- tude), denoted by arg(X, of (x. The argument of (X is uniquely determined mod2n if a #0 and is an arbitrary real number if (X = o. imaginary axis a x real axis o -- - - a- Fig. 1 The absolute value I(XI of a complex number (x, regarded as a vector from the origin to the point (x, is the length of this vector. For com- plex numbers (X and /1, to the sum of the vec- tors (X and fJ corresponds the sum (X + fJ of the complex numbers. A complex number (x, in terms of its absolute value r and argument 0, is expressed as (X = r(cos 8 + i sin 8), which is called the polar form of (x. For polar forms the following hold: Ci = r(cos 0 - i sin 0) = r(cos( -8) + isin( -8)); (X-I =Ci/I(X1 2 = r-l(cos( -0)+ isin( - 0)) «(X #0); and (Xl (X2 = rlr 2 (cos(OI +(2)+isin(OI +( 2 )), where l(Xjl=rj and arg (Xj = OJ for j = 1, 2. This last relation when r l = r 2 = 1 is called De Moivre's formula. The nth roots of unity in C are given by Pj= cos 2nj/n + i sin2nj/n (j = 0,1, ..., n- I) (Fig. 2). In the complex plane, the mapping (X --> Ci corresponds to the trellection of the plane in the real axis, (X --> (X + fJ to the parallel ttrans- lation along a vector fJ, (X --> (XfJ (fJ #0) to the trotation through the angle argfJ followed by 74D Complex Numbers a thomothetic transformation with center o and constant ratio IfJl, and (X --> Ci- I to the tinversion with respect to the unit circle {(XII(XI= I}. The distance p( (x, fJ) = I (X - fJ I between (X and fJ in C coincides with their Euclidean distance, provided that (X and fJ are regarded as points in the Euclidean plane, so that the complex plane is tisometric, and accordingly homeo- morphic, to the Euclidean plane. P2 po=l  Fig. 2 D. The Complex Sphere In the rest of this article, P denotes the com- plex plane and E denotes the sphere of radius 1, with 0, the origin of P, as its center. The points N(O, 0,1) and S(O,O, -I) of E will be called the north and south pole, respectively (Fig. 3), where the 1st and 2nd coordinate axes are the real and imaginary axes of P, respec- tively, and the 3rd coordinate axis is ortho- gonal to P. A straight line from N through a point z (a complex number) in P intersects E at a point Z =(x I , X 2 , x 3 ) different from N, where z=(x i +ix 2 )/(I-X 3 ), XI =(Z+2)/ (1 +lzI 2 ),x 2 =(z-Z)/(i(1 +lzI 2 )), and X 3 = (lzl 2 - I )/(lzI 2 + I). The mapping z --> Z is called a stereographic projection from N, by means of which P and E - {N} become tconformally equivalent to each other. Consequently z can be represented by a point Z of E - {N}, and E thus used is called a complex sphere (or Riemann sphere). Let us adjoin to the complex plane P a new element, denoted by 00, called the point at infinity, which corresponds to the only exceptional point N of E. The topology of the complex plane with 00 can be intro- duced by the corresponding topology of the Riemann sphere. Indeed, the family of all the sets {zllzl > M} U {oo} for M > 0 forms a tlo c a1 base around 00. By introducing local complex coordinates (= I/z into the neigh- borhoods of 00, each element of this local base is represented as W 1(1 < M -I}, in which the 
74 E Complex Numbers convention (=0 is adopted for z = 00. The complex sphere thus defined can be regarded as a tRiemann surface (i.e" a I-dimensional tcomplex manifold). The complex plane (complex sphere) whose points are represented by a variable z or w is called a z-plane or a w-plane (a z-sphere or a w- sphere), N' 1\ 1 \ ---:-\  --...., S{-- - -- / 1 -/ xu/ z y p Fig. 3 E. Linear Fractional Functions Given complex numbers a, b, c, and d with ad - bc # 0, we define a linear fractional func- tion (or simply linear function) by az+b W=-. cz+d As a mapping from the z-sphere into the w- sphere, this linear function is called a Mobius transformation (linear fractional Or simply linear transformation). The usual linear trans- formation, i.e., the one with c == 0 in the pre- sent case, is sometimes distinguished as an entire linear transformation. Since (1) depends only on the proportion a:b:c:d, we can assume ad -bc= 1 without loss of generality. The transformation (1) is tho1omorphic and tunivalent on the whole z-sphere with only One exceptional tpo1e at -dlc (00 if c=O) of order 1, and the inverse of (1) is also a linear fractional function The set of alllinear trans- formations forms a tgroup with composition of transformations as the group operation. One of its subgroups is the tmodular group. Linear transformations carry any circle of the complex plane (or of the Riemann sphere) into a circle of the same plane (or of the same sphere) if we adopt the convention that straight lines are a special kind of circle. (I n the case of a Riemann sphere, no such conven- tion is necessary.) Given on a plane a circle with center 0 and radius r and two points p and p' on a half-line issuing from 0 satisfying op'op' =r2, the points p and p' are called symmetric points (or reflection points) with respect to the circle. The transformation p-->p' is called the inversion with respect to this cir- cle. In the complex plane, let z and z' be sym- metric points with respect to a circle C. Sup- pose that by a linear transformation, z, z' and 282 C are carried to points w, w' and a circle D, respectively; then wand w' become symmctric with respect to the circle D (principle of reflec- tion). Thus symmetricity is invariant under linear transformations, Also the tanharmonic ratio of any four points Z\, Z2, Z3' and Z4' (z" Z2; Z3, Z4) == (z 1- z3)/(z 1 - Z4):(Z2 - z3)1 (z 2 - Z4), is invariant under a linear transfor- mation; i.e., (z" Z2; Z3,Z4) =(w\, w 2 ; w 3 , w 4 ) holds, where w j is the image of Zj under a linear transformation (j = 1,2,3,4). F. Normal Forms of Linear Transformations (1) Thcre cxist fixcd points of the transformation (J) on the Riemann sphere, i.e., points satisfy- ing z =(az + b)/(cz + d). The number of fixed points is 2 or 1, except when w=z. If the trans- formation has two fixed points, they will be denoted here by p and q. The natural conven- tion p = q is adopted if the transformation has one fixed point. If c = 0, then p or q is 00, and furthermore if c = a - d = 0, then p and q are both 00. For unequal finite p and q, (I) can be re- written in the following normal form: w-p z-p -=ex- w-p z-q' a-cp ex=-#I, a-cq in which, according as arg ex = 0, 1111 = I, or otherwise, (1) is called a hyperbolic (Fig. 4), elliptic (Fig. 5), or loxodromic transformation, respectively, This classification can be applied also for finite p and infinite q, i.e., to the trans- formation w - p = l1(z - p). Furthermore, for p = q # 00, (J) is rewritten in the following form: I 1 -=-+#, w-p z-p c {3=- a-cp Fig. 4 Hyperbolic transformation. Fig. 5 Elliptic transformation. 
283 In this case (1) is called a parabolic transfor- mation (Fig. 6). For p=q = 00, i.e., if w=z + [3, (I) is also called parabolic. We can easily de- termine to which class (I) belongs from the discriminant D=(a+d)2-4 of the quadratic equation cz 2 -(a-d)z-b =0 obtained from z=(az+b)/(cz+d) with ad-bc= I by multi- plying both sides by cz + d. If a + d is real, then according as D> 0, < 0, or = 0, (1) is hyperbolic, elliptic, or parabolic, respectively, and if a + d is not real, then the transforma- tion is loxodromic. Fig. 6 Parabolic transformation, Let D and D' be two arbitrary circular disks. Then there always exists a linear transforma- tion which gives a one-to-one tconformal mapping from D onto D'. Conversely, any mapping with this property is given only by linear transformations (provided that the half- plane having a straight line together with the point at infinity as its boundary is regarded as a closed disk), which are uniquely determined by giving three points a, b, c from the bound- ary of D and as their corresponding points, three arbitrary points a', b', c' from the bound- ary of D'. G. The Poincare Metric Since conformal mappings from the domain Izl < 1 onto Iwl < 1 are given by the trans- formations W=I;(Z-Zo)/(I- 20Z) (II;I = 1, Izol < 1) ( Appendix A, Table 13), for corre- sponding z and w it holds that Idwl Idzl 1-lw1 2 l-lzI 2 ' Idzl/(1-lzI 2 ) is called Poincare's differential invariant. With a metric having ds= Idzl/(I- Iz12) as its tjine element, the unit disk Izi < 1 becomes a tnon-Euclidean space in the sense of Lobachevskii, and the metric is called the Poincare metric, Furthermore, since the trans- formations (2) leave the length of curves invar- iant, they can be regarded as tmotions in this space, where the tgeodesic through two points z 1 and z 2 is the circular arc orthogonal to the unit circle. If we denote the intersections of the arc with the unit circle by Z3 and Z4, then the tnon-Euc1idean distance between two points 75A Computers Z 1 and z 2 along the geodesic is given by (1/2)log(zl,z2;z3,z4)' provided that the points Z4, z 1> Z 2' Z 3 are arranged on the arc in this order ( 285 Non-Euclidean Geometry). References [1] C. F. Gauss, Werke I-XII, Gottingen, 1863-1931. [2] N. Bourbaki, Elements de mathematique III. Topologie generale, ch. 8. Nombres com- plexes, Actualites Sci. Ind., 1235b, Hermann, third edition, 1963; English translation, Gen- eral topology, pt. 2, Addison-Wesley, 1966, [3] L. V. Ahlfors, Complex analysis, McGraw- Hill, second edition, 1966. [4] C. Caratheodory, Funktionentheorie I, Birkhauser, 1950; English translation, Theory of functions of a complex variable I, Chelsea, 1954. 75 (XVI.2) Computers A. History (2) Since the beginning of civilization, people have utilized tools for aiding computation. In Japan, bamboo computing rods were used in the 7th century; before the close of the 16th century, the abacus was imported from China ( 230 Japanese Mathematics (Wasan)). The first calculator capable of performing the four arithmetic operations automatically was de- signed by W. Schickard of Tiibingen Univer- sity (1623). After that, B. Pascal independently made his famous adding machine; this was improved by G. W. Leibniz so that it was able to execute multiplications and divisions, Most calculators manufactored today are electronic; however, many of these machines are not "programmable" and hence require at every step of computation a manual operation for specifying the machine operations to be executed. On the other hand, modern auto- matic computers can execute automatically a sequence of operations according to a given program without any manual intervention. The automatic computer was conceived by the English mathematician C. Babbage in the 19th century, but mechanical engineering at that time was not advanced enough to allow the construction of such a computer. His idea was first realized by the relay computers Z3 of K. Zuse (1941) and Mark I of Harvard Uni- versity (1944), In 1947, the first electronic computer, ENIAC, appeared, in which vacuum 
75 B Computers tubes were utilized instead of mechanical com- ponents. Since then, the capabilities of com- puters have increased rapidly and it can be said that we now live in the "computer age," B. Principles of Modern Computers Information processing in a computer is based on communication among its constituents by electric signals. In digital computers, informa- tion is encoded as a sequence of binary num- bers 0, 1, whereas continuous values are al- lowed in tanalog computers, The minimum quantity of information in digital computers is therefore a binary digit, called a bit. Since continuous values fluctuate on account of electric noise in the circuits, digital computers have the advantage of maintaining high preci- sion during computation. The binary numbers 0, 1 are represented in practice by two distinct electric signals: two distinct voltages, two distinct phases of alter- nating current, the existence or nonexistence of a pulse, etc. A system of circuits is called synchronous when it contains a clock, a gener- ator of periodic pulses, which synchronizes the transmission and transformation of in- formation. In an asynchronous system, circuits execute each step of information processing independently and advance to the next step after verifying the termination of the preceding step, In ENIAC, a program was inserted into the computer by carefully connecting many con- trollines on plugboards. Since this caused much trouble in inserting the different pro- grams needed for different jobs, 1. von Neu- mann proposed encoding programs in se- quences of binary digits and storing these programs in the memory unit of the computer. His principle, the stored program principle, was realized by EDSAC (1949) and has since been widely used. An automatic computer in general con- sists of the following five units: the arithme- tic unit, memory, control, input, and output (Fig. 1). These units are interconnected by wires that exchange information in the course of computation, =;:> Passage of information - Passage of control signals Fig. 1 Construction of electronic computers. 284 The arithmetic unit consists mainly of sev- eral memory registers and operational circuits associated with them. Each register stores a binary number of n bits (usually 16.s; n.s; 64). An important building block of the opera- tional circuit is the basic adder of i-bit num- bers, A parallel adder of n-bit numbers can be obtained by connecting n copies of the basic adder. Alternatively, a sequential adder can be composed of a single basic adder which is utilized repeatedly to sum up bit by bit two binary numbers from the lowest bit, Subtrac- tion is usually carried out by adding the com- plement of the subtrahend. Multiplication is realized by shifting the multiplicand to the left and adding it to the intermediate sum; this addition may be omitted, depending on the relevent bit of the multiplier. In division, we shift the dividend to the left and if possible subtract the divisor from it. By recording at each step whether or not the subtraction is possible, we obtain the quotient. The memory unit stores the instructions and given data as well as the necessary data obtained in the course of computation. It is divided into many registers, each of which contains a number and is referred to by means of a serial number called an address. At present, high-speed memory units are usually made up of magnetic cores or integrated circuits. As auxiliary large-scale memory, there are also magnetic drums, disks, and magnetic tapes, etc. Other elements under investigation are extreme-low-temperature elements, chemical elements, optical elements, etc. . The control unit repeats the following oper- ations consecutively: (1) takes an instruction from the memory location indicated by the sequential control counter, (2) gets a data word from the memory according to the ad- dress part of the instruction, (3) decodes the function part of the instruction and sends control signals to appropriate circuits, and (4) increases the content of the sequential control counter by 1 and, after receiving end signals from the arithmetic unit, returns to step (1). For these purposes the control unit contains a counter, a decoder, an encoder to send control signals, and a register to store the instruction to be executed. The arithmetic, control, and memory units form the central processor of the computer. In contrast to the central processor, the input and output units are called peripheral devices. An input device receives the necessary information (a program and data); a card reader reads punched cards, and a teletypewriter sends signals directly into the computer. An output device presents the results obtained by the computer. The results are usuaIly printed by a teletypewriter or line printer. Magnetic tape 
285 units can be considered as auxiliary input- output devices, since tape reels are removable. The tape units can accept prepared input data as well as rccord the results, the output. These devices are operated according to given input-output instructions under the control of the central processor. However, since every input-output device contains me- chanical components and is extremely slow in comparison with the central processor, a large- scale computer is often accompanied by sat- ellite computers which undertake the control of input-output devices. C. Instructions and Programming In stored-program operation, a computer performs a sequence of calculations according to given instructions, There are various types of instructions, but those most frequently used are single-address instructions, each of which contains a single-address part designating an operand. A program is a finite sequence of instruc- tions arranged suitably for the required com- putation. Programming, or making a program, is therefore the task of decomposing the re- quired computation into elementary steps each of which corresponds to an instruction. Every instruction is represented in a com- puter by a number, a numeric code, which is determined in a definite way by the construc- tion of the control unit. Before starting com- putation, instructions thus encoded are stored in the memory. In this sense, a program is a sequence of numbers. This seq uence is caIled a machine-language program. A program is usuaIly divided into several blocks, called subprograms or subroutines. Some subroutines are made in advance, espe- cially those for frequently required jobs such as evaluating elementary functions and manipulat- ing input-output devices, etc. The system of these ready-made programs is called software, in contrast to the hardware (i.e., mechanico- electronic equipment) of the computer. Quite often basic routines are microprogrammed, i.e., written in a simple code and stored in a fast read-only memory (ROM). In this case the set of built-in programs is called firmware. The handliness of a computer depends mainly on close matching of the software and hardware. Programs are usuaIly written in certain forms called external languages which are easy to master. A problem-oriented language is an advanced external language in which ordinary arithmetic expressions are available with slight modifications. Programs written in these lan- guages are translated into machine languages by program input routines. The translator for 75D Computers a problem-oriented language is called the compiler. A compiler accepts several macroinstruc- tions. Moreover, it is equipped with the fol- lowing facilities: (i) the ability to translate arithmetic expressions into machine language; (ii) the ability to generate linkages to various ready-made subroutines according to certain simple indications; (iii) the ability to auto- matically allocate programs, subroutines, and data in the memory; (iv) the ability to check automatically the syntactic correctness of programs. Thus it acccpts an external form such as if xO then printreal (SQRT(x)) else printstring ('negative'). Another important translator is the as- sembler, which translates mnemonic codes of instructions (add for addition, etc.) into their numeric codes according to a given table. It allows us to utilize symbols for specifying addresses. It also converts decimal numbers into binary and generates certain segments of the program from rather simple indications. Compilers and assemblers are important con- stituents of software. The large-scale high-speed computers that have recently appeared have made software systems inevitably more complex. There is now software, so-called monitors or operating systems, that supervises the uninterrupted processing of many programs. Some operating systems coordinate several assemblers and compilers so that several languages can be mixed in writing a program, Examples of other important software are mathematical (numerical) software and database manage- ment systems. D. Mathematical Models of Computers An operational circuit in the arithmetic unit is usually constructed from basic elements (log- ical gates), each of which performs a certain operation on the binary signals. Thus the construction of a circuit from basic elements is represented by a composition of a logical function defined over the set {O, I} from a given set of basic functions. Post [3J consid- ered functional composition without feed- back loops and established a general crite- rion for a given set of logical functions to be complete, i.e., to be capable of generating the whole set oflogical functions. Strictly, how- ever, a logical gate takes a definite time delay to perform its operation. Therefore Kudryav- tsev [4J proposed as a model of a logical gate a pair (f, d) of a logical function f and a non- 
75 E Computers negative integer d: f represents the operation of the gate and d represents its delay. He de- fined the feedback-free composition of such functions with delays and gave a completeness criterion for a set of logical functions with delays. His study has been extended in various directions by Loomis [5], Nozaki [6], and Rosenberg [7]. When a circuit contains memory elements or feedback loops, its function is suitably represented by an tautomaton, Hence the design and analysis of circuits with memory elements, state-minimization, and equivalence checking [8], decomposition into simpler components [9,10], verification of complete- ness [11], etc., can be studied in terms of auto- mata ( 31 Automata), A computer itself can be considered to be a finite automaton, since it has a finite number of memory elements and its behavior is completely determined by the content of the memory (its internal state) and the inputs. However, since a modern computer has an enormous memory and contains re- placeable parts, such as magnetic tapes, it is more adequately represented by an infinite model, such as a tTuring machine. Turing machines are capable of simulating many intellectual activities governed by for- mal rules. It is believed that any welI-defined finite algorithm can be simulated by a Turing machine (Church's thesis.) Thus any compu- ter can be simulated by a Turing machine, and hence it is unable to resolve those deci- sion problems which are unsolvable for TUr- ing machines. For instance, no program can decide in finite steps whether a given pro- gram written in a computer language, say FORTRAN or PASCAL, eventually stops or not. On the other hand, a modern computer can simulate a universal Turing machine pro- vided that its memory can be extended un- boundedly by supplying magnetic tapes. Hence any we II-defined finite algorithm can be simulated by a computer unless the limi- tation of memory capacity hampers its accomplishment. E. Mathematical Theory of Programming Languages Along with the development of software, pro- gramming techniques have gradualIy accumu- lated. For instance, we now have an almost satisfactory method of translating arithmetic expressions into machine language. However, we stiIIlack a general theory to cover effec- tively a wide array of programming problems. The design of an adequate metalanguage 286 is stilI an important problem, if the word "adequate" implies complete description of syntax and semantics of problem-oriented languages. An interesting problem is to define and suitably classify grammars with respect to their capabilities of forming languages. Such research is an important branch of mathema- tical linguistics. N. Chomsky [12J has inves- tigated this problem, starting from research on natural languages, and has given formal defini- tions of the grammars ( 31 Automata D). tContext-free grammars (type-2Ianguages) play an important role in the theory of soft- ware as well as in Chomsky's syntax, since they are powerful enough to describe the pa- renthesis structure and simple and easy to manipulate, Some variants of context-free grammar have been proposed for attaining high efficiency in the automatic appraisal of programs [13,14] or for increasing the pro- grams' generative power [15]. F. Branches of Information Science Many fields related to computers now make up the information sciences or informatiques. Important mathematical theories born or developed in the information sciences are: (1) Design and analysis of hardware devices: tBoolean algebra [16], switching theory [17], and theory oftautomata [18, 19]. (2) Design and analysis of programming lan- guage; theory of formal languages [14,21]. (3) Design and analysis of algorithms; tnumer- ical analysis, theory of tcomplexity of compu- tation, theory oft data processing. (4) Mathematical foundation of programming; logical verification of correctness and equiva- lence of programs [22J, trecursive function theory, tdecision problems. Applications of the information sciences are found in diversified fields, such as statistics, operations research, mathematical psychology, econometrics, jurimetrics, and behaviorimetrics. References [IJ H. H. Goldstine, The computer from Pas- cal to von Neumann, Princeton University Press, 1972. [2J W. Eicken and E. H. Zuse, Elektronische Datenverarbeitung, 2 (1962), 7475. [3] E. L. Post, Introduction to a general the- ory of elementary propositions, Amer. J. Math., 43 (1921),163-185. [4] V. B. Kudryavtsev, A completeness theo- rem for a class of feedback-free automata (in Russian), Problemy Kibernet., 8 (1962), 91- 115. 
287 [5] H. H. Loomis, Jr., Completeness of sets of delayed-logic devices, IEEE Trans., EC-14 (1965),157-172. [6] T. Hikita and A, Nozaki, A completeness criterion for spectra, SIAM 1. Comput., 6 (1977),285-297. [7] I. G. Rosenberg, Completeness properties of multiple-valued logic algebras, Computer Science and Multiple-Valued Logic, Theory and Applications, D. C. Rine (ed.), North- Holland, 1977, 142-186. [8J S. Eilenberg, Automata, languages and machines, Academic Press, vol. A, 1974; vol. B, 1976. [9] 1. Hartmanis and R. E. Stearns, Algebraic structure theory of sequential machines, Pren- tice- Hall, 1966. [10] A. Nozaki, Practical decomposition of automata, Information and Control, 36 (1978), 275-291. [11J M. A. Arbib, Theories of abstract auto- mata, Prentice-HaIl, 1969. [12] N. Chomsky, On certain formal prop- erties of grammars, Information and Control, 2 (1959),137-167. [13] J. E. Hopcroft and J. D. Ullman, Formal languages and their relation to automata, Addison-Wesley, 1969. [14J A. V. Aho and J. D. UIIman, The theory of parsing, translation and compiling, Prentice HaIl, vol. 1, 1972; vol. 2, 1973. [15J A, Salomaa, Formal languages, Academic Press, 1973, [16] J. Kuntzmann, Algebra de Boo1e, Dunod, 1968. [17J S. Muroga, Logic design and switching theory, Wiley, 1979. [18] M. L. Minsky, Computation: Finite and infinite machines, Prentice-Hall, 1967. [19J J. E. Hopcroft and J. D. Ullman, Intro- duction to automata theory, languages and computation, Addison-Wesley, 1979. [20J Z. Manna, Mathematical theory of com- putation, McGraw-Hill, 1974. [21J W, A. Wulf, M. Shaw, P. N. Hilfinger, and L. Flon, Fundamental structures of com- puter science, Addison-Wesley, 1981. 76 (VI.17) Conformal Geometry A. Mobius Geometry We represent an n-dimensional sphere sn as the tquadric hypersurface sn: xi + x + .,. + xi; -2xoxoo =0 in an (n + I)-dimensional real tprojective space pn+!, where the (x,) are 76A Conformal Geometry thomogeneous coordinates in pn+!. We denote by M(n) the group of all tprojective transfor- mations of pn+! that leave sn invariant. Then the transformation group M(n) acts on sn. The pair (sn, M(n» is called the conformal geom- etry or Mobius geometry. We calI sn an n- dimensional conformal space, a transformation belonging to M(n) a Mobius transformation, and M(n) the Mobius transformation group. Every point of the projective space pn+1 corresponds to a hypersphere on sn. For example, if a point A lies outside of sn, then the intersection of S" and the tpolar hyperplane of A with respect to sn is an (n-l)-dimensional sphere sn-I, and the point A corresponds to this real hyper- sphere sn-l. Similarly, if a point A lies on sn, it corresponds to a point hypersphere, and if a point A lies inside of sn, then A corresponds to an imaginary hypersphere. We sometimes identify the point A with the corresponding hypersphere. For any two points A = (a,) and B=(b,), we put AB=a l b l +a 2 b 2 +... +anb n -(aob oo +aoob o ) and can it the inner product of the two hyperspheres A and B. The angle () between two intersecting real hyperspheres A and B is defined by cos () == AB/(jA2 . JB2). This angle is invariant under the Mobius transformation. In the projective space pn+l, we take a tframe (A o , AI' .., ,An, Aoo) that satisfies the conditions AJ =AoAj=AiAoo == A =0, AoAoo = -1, A i A j =9ij, i,j= 1,2, ...,n, where (9i) is a positive definite matrix. We see that Ao and Aoo are points on sn and each Ai is a real hypersphere passing through these two points. Every hypersphere X of sn can be written as a linear combination of the Ai:X = uOAo+ulAI +... +unAn+uooAoo. That is, X is represented by tprojective coordinates (u,) with respect to the above frame. We call these homogeneous coordinates (u,) (n + 2)- hyperspherical coordinates of the hypersphere X. If we use these coordinates, the inner prod- uct of two hyperspheres X = (u,) and Y = (v,) is given by XY=L?jlgijUiVj-(U01Joo +uoov o ). The Mobius transformation group M(n) is a topological group with two tconnected com- ponents. If we denote by Mo(n) the maximal connected subgroup of M(n) and by H the subgroup of M(n) that leaves invariant a real . hypersphere of So, then the set E of all real hyperspheres of sn can be identified with the thomogeneous space Mo(n)/H. The group H also consists of two connected components. If we denote by Ho the maximal connected subgroup of H, the homogeneous space E= Mo(n)/ Ho is a two-fold tcovering space of E. 
76 B Conformal Geometry For each real hypersphere AEE, an element A E E over A is called an oriented real hyper- sphere. The Mobius transformation group con- tains as its subgroups ones that are isomor- phic to the group of tcongruent transforma- tions of a Euclidean space and ones that are isomorphic to the group of congruent trans- formations of a non-Euclidean space. That is, the subgroup of M(n) that leaves a point hypersphere invariant is isomorphic to the group generated by congruent transformations and thomotheties of the Euclidean space En. The subgroup of tindex 2 (the factor group by a tcyclic subgroup of torder 2) of the subgroup of M(n) that leaves invariant a real (imaginary) hypersphere is isomorphic to the group of congruent transformation of n-dimensional hyperbolic (elliptic) tnon-Euclidean space. In the n-dimensional Euclidean space En, consider a hypersphere of radius r with center O. For each point P of En, mark a point Q on the tray OP such that OP . OQ =r 2 . We call the point transformation that sends P to Q an inversion with respect to the hypersphere. A tsymmetry with respect to a hyperplane, con- sidered as an extreme case of inversions, is also called an inversion. We adjoin a point at infinity to the space En to construct an n- dimensional sphere sn. Each inversion can be extended to a transformation of sn, which we also call an inversion. Then each Mobius transformation is generated by a finite number of inversions. By a Mobius transformation of sn, each hypersphere is transformed to a hypersphere. Any angle between two curves that intersect at a point of En is invariant under Mobius transformations. Conversely, if n:;:, 3, each local transformation of En that leaves invariant the angle of each pair of inter- secting curves is a trestriction of a Mobius transformation, However, for n == 2 this is not true in general; any transformation that leaves angles invariant is called a tconformal map- ping. Any Mobius transformation z-->w on the tcomplex sphere S2 = C U { oo} can be ex- pressed by an equation of the form w=(exz+P)/ (l'z+£5) or w=(exz + [3)/(yz+ £5), where 11, p, 1', and £5 are complex numbers such that ex£5 - py # o and z denotes the tcomplex conjugate of z. B. Laguerre Geometry Let r be an oriented smooth curve in a Euclid- ean plane E 2 . The tangent line at a point P of r is supplied with an orientation that is induced by the orientation of the curve r in an obvious manner. The oriented line 1 thus obtained is called the oriented tangent line of r at p. 288 Let S be the set of oriented lines in E 2 and T be a given set of oriented smooth curves in E 2 . Consider a bijection y of the direct product S x T to itself satisfying the following condition: If 1 is an oriented tangent line of a curve r belonging to T and y sends (I, r) to (I', r'), then I' is an oriented tangent of the curve P. The set of such bijections forms a group G. Sup- pose that we have IES for which there exist two curves r 1 and r 2 in T such that 1 is a common tangent line of r l and r 2 at PI and P2, respectively. Further suppose that the element y E G sends (I, r i ) to (I', r;) (i = 1,2). Then the element y of G is called an equilong transformation if the following conditions (i) and (ii) are satisfied: (i) I' is tangent to r( at points pi (i = 1,2), (ii) the distance between PI and P2 is equal to the distance between P'I and p. The set of equilong transformations forms a subgroup H of G. In particular, if T is the set of oriented circles (including point circles), the elements of H are called Laguerre transfor- mations. In an obvious manner, we can divide the set of oriented circles into two classes, those with "positive" and those with "nega- tive" orientations. With an oriented circle r we associate the pair <p(r)=(P,r), where P is the origin of the circle and r is a real number whose absolute value is equal to the radius of the circle and whose signature coincides with that of the orientation of r. The mapping <p: T -->E 2 x E I thus defined is called Lie's minimal projection. An example of a Laguerre transformation, called a dilatation, is given by a bijection y of S x T to itself satisfying the following condition: Let y(l, r) = (I', r'); then I' is parallel to I, the distance between 1 and l' is a given number, and <p(r') = (P, r + c), where <p(r) ==(P, r) and c is a given constant. We note here that the action of a Laguerre transfor- mation y :(1, r)-->(I', r') is determined by its action on S (y acts on S by y(l) = 1'). Another example of a Laguerre transformation y, called a Laguerre inversion, is determined by means of a given oriented circle 0 and a line P that is not tangent to 0; given an oriented line I, its image l' under the action of y is determined as follows (here we describe the case where 1 is not parallel to p). There exists a uniquely determined oriented tangent line 9 of the circle o parallel to I. Now we have a uniquely deter- mined oriented tangent line g' of 0 that passes through the point of intersection of the lines 9 and P such that g' #g, The image I' is the line parallel to g' and passing through the point of intersection of the lines 1 and P (Fig. I). Each Laguerre transformation can be written as a product of a finite number of Laguerre inver- sions, We denote the group of Laguerre trans- formations by L. The pair (L, S) is, by defini- 
289 tion, a model of Laguerre geometry. Notions such as Laguerre inversions, dilatations, and transformations can be generalized to cases of higher dimension by utilizing oriented hyper- spheres and oriented hyperplanes. p Fig. 1 C. Sphere Geometry Let S be the set of oriented circles (including point circles and oriented lines) in the Eucli- dean plane E 2 . Two oriented circles C l and C 2 are said to be in contact with each other if they have a point p and an oriented tan- gent line passing through p in common. (An oriented circle C and an oriented line I are in cOntact with each other if and only if I is an oriented tangent line of C.) In this case, we call the pair (C" C 2 ) a contact pair. A bijection y of S to itself is called a Lie transformation if it sends any cOntact pair to another. (The Lie transformation is a special case of the tcontact transformations.) An inversion with respect to a circle determines in an obvious manner a Lie transformation, which is also ca1\ed an inver- sion. We denote the group of Lie transforma- tions by G, Any element y of G can be written as the product of a finite number of inversions and Laguerre inversions, and G contains the group of Mobius transformations and the group of Laguerre transformations as sub- groups. The pair (G,S) is called a model of circle geometry. The notion of circle geometry can be generalized to that of hypersphere geom- etry for the case of higher dimensions. Speci- ficaIly, when we replace E 2 by E 3 and circles by spheres, we have sphere geometry. Let V be a complex 3-dimensional space, and let M, N be the sets of oriented lines and oriented spheres in V, respectively. Then M, N have the structure of 4-dimensional complex manifolds that are homeomorphic to each other. The homeomorphism is given by the Lie line-sphere transformation that induces a bijection from the set of pairs of intersecting oriented lines onto the set of pairs of oriented spheres that are in contact with each other. 76D Conformal Geometry D. Group-Theoretic Considerations Here we discuss the preceding three kinds of geometries from the group-theoretic point of view ( 137 Erlangen Program). Let us con- sider the quadratic form Q defined by Q(x) = - X6 + xi + x + X5 - xi in a real projective space p4, where x=(X O ,XI,X2,X3'X 4 ) are homogeneous coordinates. We denote by G the set of all projective transformations of p4 that leave Q invariant. The group G consists of the set of matrices A of order 5 such that det A = 1 and Q(Ax) = Q(x) holds for all x in p4; we denote by L 3 the set of all points x in p4 that satisfy Q(x)=O. Then G acts transitively on L 3 . Hence if we denote by Ha the set of all ele- ments of G that fix a point a in L 3, for example a == ( -1, 1,0,0,0), we may assume that L 3 = GIRa. The circle geometry that belongs to the group of Lie transformations of circles is exactly the geometry of the homogeneous space GIHa. The group G b ofa1\ transforma- tions of G that leave the hyperplane X 4 == 0 invariant acts transitively on L 3 n {X4 =O} (G b is isomorphic to M (2)). The geometry of the homogeneous space GblRa n G b is plane con- formal geometry. Next, the group G a of all transformations of G that leave invariant Xo + x 1= 0 coincides with Ha and acts transitively on L 3 n{xo+xl =O}. The geometry of the homogeneous space L 3 n {xo+ XI =O} (on which G a acts transitively) is plane Laguerre geometry. In this sense, the circle geometry that belongs to the group of Lie transforma- tions contains the other circle geometries as subgeometries, We now describe how plane Laguerre geom- etry can be realized as the geometry of the space L 3 n {xo +x 1 =O}. Let E 2 be a plane in a Euclidean space E 3 . We fix a Cartesian co- ordinate system (YO,YI,Yz) in E 3 so that E 2 is given by Yo = O. To each point y of E 3 we can associate an oriented circle in E 2 with center (0,Yl'Y2), radius IYol, and positive (negative) orientation if Yo is positive (negative). If Y lies on E 2 , the corresponding circle is the point circle Y itself. Now let us consider the group G of all affine transformations of E 3 whose rota- tion parts leave the quadratic form Q'(y) = - Y6 + yi + Y invariant. (G is an isometry with respect to the tmetric defined by Q'.) Each element of G induces a transformation of the set of oriented circles in E 2 (including the oriented lines and point circles) onto itself. The mappings of E 3 into L 3 defined by Xo = (1 +Q'(Y))/2, Xl =(I-Q'(Y)/2, X2= Yl, X3= Y2, X 4 = Yo is a one-to-one correspondence of E 3 onto the subset of L 3 such that xo + x I # 0, This correspondence induces an isomorphism of G onto Ga. In Laguerre geometry, there are 
76 Ref. Conformal Geometry no essential distinctions between points and oriented circles on E2; and the group G a acts on a 3-dimensional space of oriented circles (including point circles). Reference [IJ W. Blaschke, Vorlesungen iiber Differen- tialgeometrie III, Springer, 1929. 77 (XI.14) Conformal Mappings A. General Remarks Let a function w= f(z) that maps a tdomain D on the tcomplex z-sphere homeomorphically onto a domain A on the complex w-sphere satisfy the following two conditions: (I) Every curve C z : z(t) (O.s; t.s; 1) that starts at any point Zo in D and possesses a tangent there has an image curve Cw:w=w(t)=f(z(t)) (O.s;t.s; 1) that also possesses a tangent at the image point Wo = f(zo)' (2) The angle between any two curves Cl) and C2) possessing tangents at Zo is equal to the angle between their image curves Owl) and Ow 2 ), where the direction of the angle is also taken into account. Then the mapping from D onto A is said to be conformal, and A is called conformally equivalent to D. It has been proved that w = f(z) is necessarily a func- tion tanalytic in D (D. Men'shov, 1931). Consequently, the theory of conformal mapping is a branch of the theory of analytic functions. That a function w = f(z) maps a domain D conformally onto a domain A means that it is a tmeromorphic function tunivalent in D and its range is A. Then I'(z) # o holds at every (finite) point in D, and the ratio of the lengths of the segments between two points w 0 = w(O), w( t) on C w and between two points Zo = z(O), z(t) on C z tends to a fixed nonvanishing limit II'(zo)1 as t -->0 indepen- dently of the choice of C z . Hence if Zl and Z2 lie on Cl) and q2), respectively, and WI and W 2 are their image points lying on the image curves C} and C), respectively, then the two triangles L.z I ZOZ2 and L.w I W O W2 are nearly similar in the positive sense, provided that z I and Z2 are near enough to zo0 This justifies the word "conformal," which means "of the same form" ( Appendix A, Table 13). B. Conformal Mapping onto the Unit Disk A fundamental theorem in the theory of con- formal mapping is Riemann's mapping theo- 290 rem, which states that any tsimply connected domain D with at least two boundary points can be mapped conformally onto the interior A of the unit circle. This theorem is equivalent to the assertion of the existence of tGreen's function of D and can be proved in various ways. B. Riemann (1851) gave a proof, based on an idea of C. F. Gauss, by assuming the existence of a solution for a variational prob- lem minimizing the tDirichlet integral. The logical incompleteness implied by this as- sumption was later removed by D. Hilbert and others. The proof that is now regarded as simplest is due to L. Fejer and F. Riesz's method (T. Rad6, 1922, 1923), which applies tnormal family theory. On the other hand the osculating process due to P. Koebe (19'12) is a purely constructive method of proving existence that is also applicable to the case of tmuItiply connected domains. The map- ping function w = f(z) in Riemann's mapping theorem is uniquely determined under the normalization condition f(zo) = 0, argl'(zo) = eo at a point Zo in D, where eo is a given angle. The other types of simply connected do- mains are the Riemann sphere less one point and the sphere itself. In both cases every con- formal mapping of these domains is a linear transformation. A simply connected domain is called hyperbolic, parabolic, or elliptic if it is conformally equivalent to the unit disk the complex plane, or the Riemann sphere,' respectively. Let w= f(z) map a simply connected domain D On the z-sphere conformally onto a simply connected domain A on the w-sphere. If a sequence {z,} in D tends to a boundary point ( of D, then the corresponding sequence {w,} (w v = f(zv» has no taccumulation point in A and does not necessarily tend to a boundary point of A. If for any sequence {zv} tending to (, {I(z,)} tends to a unique point w on the boundary of A, it is said that .f(z) possesses a boundary value w at (. To investigate the behavior of {w v } on a hyperbolic domain D, we can assume, by using a suitable mapping if necessary, that D is bounded. Then the problem is reduced, in view of Riemann's mapping theorem, to the case where D is a bounded simply connected domain and A is the unit disk Iwl < 1. C. Correspondence between Boundaries Concerning the correspondence between boundaries under the conformal mapping w= [(z) of a bounded simply connected domain D onto the unit disk I wi < 1, we have the fol- lowing three theorems: 
291 (1) To any taccessible boundary point Zc of D there corresponds a unique point on Iwl = 1, and to any distinct accessible boundary points zC 1 and zC 2 of D there correspond distinct points On the unit circumference. Furthermore, the set of all points on Iwl = 1 that correspond to accessible boundary points of D has tangu- lar measure equal to 2n. (2) Thcre is a onc-to-one correspondence between tboundary elements of D and points on I wi = 1 (c. Caratheodory). (3) Let w= f(z) map the interior D of a tJor- dan curve C conformally onto the unit disk A: I wi < 1. Then it possesses a boundary value, say f(O, at every point' on C that satisfies If(OI = 1. Hence f(z) is continuous On the closed domain i5 = D U C and maps i5 bijec- tively onto the closed disk A: Iwl.s; 1. Similarly, the inverse function z = <p( w) has an analogous property and maps A bijective1y and continu- ously onto i5; that is, a conformal mapping of the interior D of a Jordan curve onto the unit disk A can be extended into a homeomor- phism of the closure i5 to A (Caratheodory). In this case, if the Jordan curve C contains a tregular analytic arc r, then the mapping function w== f(z) can be prolonged analytically beyond r (except at the endpoints of r). Hence the mapping w= f(z) is conformal at interior points of r. Problems on the correspondence of angles at the boundary are closely related to tangular derivatives. These problems have been at- tacked by Caratheodory, S. Warschawski, 1. Wolff, and others. D. Schwarz-Christoffel Transformation The problem of determining the form of an analytic function that maps the interior of a circle or a half-plane conform ally Onto the interior of a polygon was first dealt with by H. A. Schwarz and E. B. Christoffel (1869) ( Appendix A, Table 13). Let P be a polygon in the complex w-p1ane with vertices b ll (J1 = 1, "', m) and interior angles IX. Il n at b w Then a function w = f(z) that maps a circular disk or a half-plane in the z- plane conformally onto the interior of P is given by f(z)=C r D (all-z)a.-l dz+C'. Here b ll = f(a ll ), and C, C' are constants de- pending on the position and magnitude of the polygon P. If instead of a disk we consider a half-plane in the z-plane, and if one of the all is the point at infinity, then, modifying this for- mula by deleting the factor of the integrand corresponding to all = 00, we obtain a formula 77E Conformal Mappings for w= f(z). This representation is called the Schwarz-Christoffel transformation formula. This formula was originally derived by Chris- toffel to solve a problem of 2-dimensional distribution of stationary temperature. It then found extensive application to several problems of conformal mapping concerning polygonal domains and to problems of deter- mining force lines or stream lines and equi- potential lines in 2-dimensional electrostatics or hydromechanics. An analogous formula is also derived for a function that maps the interior of a circle or a half-plane conform ally onto the exterior of a polygon. In connection with these formulas, a third-order differential equation that is satis- fied by a function mapping the interior of a circle onto a domain bounded by a circular polygon is found useful in the theory of auto- morphic functions. Moreover, a representation analogous to the Schwarz-Christoffel trans- formation formula is obtained for a function that maps the interior of a circle onto a curvi- linear polygonal domain bounded by arcs of logarithmic spirals with the origin as asymp- totic point. E. Conformal Mapping of Multiply Connected Domains It is also important to consider problems concerning one-to-one conformal mapping of a multiply connected domain on the z-sphere onto a suitable multiply connected domain :D on the w-sphere. The two domains D and 1) are then homeomorphic, but the converse is not true; i.e., there does not necessarily exist a one-tO-One conformal mapping between D and 1) even when they are homeomorphic. Now let D and 1) be multiply connected domains on the z- and w-planes, respectively, both possess- ing at least three boundary points. The tuni- versal covering surfaces fj and :D of D and 1) are hyperbolic and they can be mapped onto the unit disks. The groups of their tcovering transformations form the tFuchsian groups G and <D. Then in order for D to be mapped one-to-one and conformally onto 1), it is neces- sary and sufficient that the group <D be trans- formed into the group G by a suitable linear transformation. To a domain of finite tconnectivity having only continua for its boundary components, we can associate conformal invariants (namely, moduli) expressed by one real parameter in the doubly connected case and by 3n - 6 real parameters in the n( > 2)-connected case. A one-to-one conformal mapping is possible only within a class of domains having the same invariants ( 416 Teichmiiller Spaces). 
77F Conformal Mappings While a circular disk is taken as a canonical domain in the simply connected case, an tan- nul us is often taken as a canonical domain in the doubly connected case. In the latter case, the logarithm of the ratio (> 1) of the radii of two concentric boundary circles is usually called the modulus. There are various types of n(? 2)-connected canonical domains, for in- stance, the whole plane, a circular disk or annulus slit along concentric circular arcs or radial segments, a parallel slit plane, etc. The possibility of a one-to-one conformal mapping of a given domain onto a canonical domain of such a type was proved by Hilbert, Koebe, and others in a tpotential-theoretic way and by E. Rengel, R. de Possel, H. Grunsky, and others in a purely function-theoretic way. Such ca- nonical domains are characterized by some extremal properties. For example, the horizon- tal parallel slit mapping function Po(z, zo) = (Z-ZO)-l +a[poJ(z-zo)+..., zoED, has the extremal property that it is the unique function maximizing Rea[f] within the family of univalent functions f(z) =(z - ZO)-l + a[f] (z- zo)+... in D. For a general domain D, Po(z, zo), called the extremal horizontal slit mapping, is defined as the limit function of the seq uence of horizontal slit mappings Pbn)(z, zo) of Dn, zoEDn' Here {Dn};;='=1 is a canonical ex- haustion of D. It has the same extremal prop- erty as in the case of finite connectivity. The boundary components of the image domain of D under Po(z, zo) consist of a horizontal slit and a point. The parallel slit mapping Po(z, zo) in the direction of 0 is defined similarly to the one that maximizes Ree- 2iO a[f] [8]. For a domain of infinite connectivity, by accumulation of boundary components, a pointlike boundary component can be mapped onto a continuum, A boundary component r of a domain D is called weak if its image {(I') is a point under every conformal mapping f on D. I' is called strong if f(y) always consists of more than One point. A boundary component y that is neither weak nOr strong is called unstable (L. Sario, J. Analyse Math., 5 (1956)). Criteria of weakness, etc., by means of ex- tremallength are given in [8]. Moreover, in terms of extremal length, a generalization of a boundary element is given by Oikawa and Suita [9]. On the other hand, the existence of one-to-one and conformal mapping of a domain bounded by a finite number of curves onto the whole plane with mutually disjoint circular disks removed was proved by Koebe and later derived by J. Douglas and R. Cou- rant as a particular case of the existence of a solution of tPlateau's problem [10]. M. Schif- fer showed that the mapping function is a solution of an extremal problem involving 292 Fredholm eigenvalues [11]. L. Bieberbach and H. Grunsky showed the possibility of mapping an n-connected domain onto an n-sheeted disk, Concerning doubly connected domains, detailed investigations were made by 0, Teich- mUller, Y. Komatu, and others. For a domain D bounded by n analytic Jordan curves, as an extension of the Schwarz lemma, Ahlfors showed that there exists a unique extremal function fo(z) maximizing Re I'(zo), Zo E D, within the family of analytic functions f(z) satisfying If(z)1 < 1 in D and that fo(z) maps D onto the n-sheeted unit disk (1947). fo(z) is called the Ahlfors function at zo0 For a general domain the uniqueness of the extremal func- tion in nontrivial cases was established first by S. Havinson (1961) and later by L. Carleson (1967) and S. Fisher (1969). Most tkernel functions of a plane domain D have connections with conformal mappings. For example, the Bergman kernel U(z,O of exact differentials is equal to (2n) -1(p(Z, 0 - P/2 (z, m. The adjoint kernel V(z,O is de- fined by (2n)-I(p(z,()+P/2(Z,m. M. Schiffer showed that the integral of V(z, (), S(z, 0 = St V(z, Odz, is univalent and maps D onto a domain bounded by n analytic convex curves if D is bounded by n Jordan curves [13]. For a general domain D, this is true in the sense that each boundary component of the image domain under S(z, 0 is a convex set (K. Oikawa and N. Suita, Kodai Math. Sem. Rep., 16 (1964)). It is known that S(z, () maximizes the area of the complementary set of the image domain under f(z) within the family of univa- lent functions f(z)=(n(z- m- l + bdz-O+.... The maximum value multiplied by 4n is called the span of the domain D; it is equal to a[poJ - a [Pn/2]. For the tSzegb kernel function k(z,O of a domain D bounded by n analytic curves, the adjoint kernell(z, () is defined, and the Ahlfors function fo(z) at ( is expressed by k(z, O/I(z, () (P. Garabedian, Trans. Amer. Math. Soc., 67 (1949)). F. Universal Constants Among various universal constants appearing in the theory of conformal mapping, Bloch's constant is especially famous. A. Bloch (1924) showed that a covering surface over the w- plane obtained from a mapping w = F(z) = z + ... that is one-to-one, conformal, and holomorphic in Izl < 1 always contains a tuni- valent (schlicht) disk whose radius B is a posi- tive number independent of the function F (Bloch's theorem). The supremum !B of such constants B is called Bloch's constant. The true value of!B is yet unknown, but estimates have 
293 been given by Ahlfors (Trans. Amer. Math. Soc., 43 (1938)) and by Ahlfors and Grunsky (Math. Z., 42 (1937)) in the form J3 C 1 / 4 r (I/3) ( r(11/12) ) 1/2 -.s;!B.s;yn 2 - 4 r(1/4) r(1/12) =0.4719... . M. Heins showed that the equality sign can be deleted from the left-hand side (Nagoya Math. J., 21 (1962)) and C. Pommerenke gave a simpler proof for the same fact (J. London Math. Soc., 2 (1970)). It is conjectured that the correct value of!B is equal to the upper bound of Ahlfors and Grunsky. Landau's constant £! corresponding to the case where the image disks are not necessarily univalent satisfies 0.5.s; £! < 0.55. The lower and upper bounds are due to Ahlfors (cited above) and Landau (Math. Z" 30 (1929)). Pommerenke showed 0.5 < £! (1970, cited above). For the family of univalent functions these constants coincide, and the value is called the schlicht Bloch constant, denoted by . Its lower and upper bounds are known to be 0.5705 (1. Jenkins, J. Math. Mech., 10 (1961)) and 0,6565 (R. Robin- son, Bull. Amer, Math. Soc., 41 (1935)). (For distortion theorems and coefficient problems  438 Univalent and Multivalent Functions.) References [IJ L. V. Ahlfors, Complex analysis, McGraw- Hill, second edition, 1979. [2J Z. Nehari, Conformal mappings, McGraw- Hill, 1952. [3J D. Gaier, Konstruktive Methoden der konformen Abbildung, Springer, 1964. [4J C. Caratheodory, Conformal representa- tion, Cambridge Univ. Press, 1932, second edition, 1952. [5J N. Ohtsuka, Dirichlet problem, extremal length and prime ends, Van Nostrand, 1970, [6J H. Grunsky, Lectures on theory of func- tions in multiply connected domains, Vanden- hoeck & Ruprecht, 1978. [7J G. M. Golusin, Geometric theory of func- tions of a complex variable, Amer. Math. Soc. Transl., 1969. (Original in Russian, 1952.) [8J L. Sario and K. Oikawa, Capacity func- tions, Springer, 1969. [9J K. Oikawa and N. Suita, On conformal mappings onto incised radial slit discs, K6dai Math. Sem. Rep., 22 (1970), 45-72. [lOJ R. Courant, Dirichlet's principle, con- formal mapping, and minimal surfaces, with an appendix by M. Schiffer, Interscience, 1950. [11J M. Schiffer, Fredholm eigenvalues of multiply connected domains, Pacific J. Math., 9 (1959), 211-269. 78B Conic Sections [12J S. Bergman, The kernel function and conformal mapping, Amer. Math. Soc. Math. Surveys, 5 (1970). [13J M. Schiffer, The span of multiply con- nected domains, Duke Math. J., 10 (1943), 209-216. [14J L. V. Ahlfors, Conformal invariants, McGraw-Hili, 1973. 78 (Vl.g) Conic Sections A. General Remarks Suppose that we are given two straight lines I and m intersecting at V (but not orthogonally) in the 3-dimensional Euclidean space E 3 . By rotating the line m around I, we obtain a sur- face 3'. We call this surface 3' a circular cone with vertex V and axis I; a straight line on the surface passing through V is called a generat- ing line of 3'. A section C of 3' by a plane n not passing through V (C = n n 3') is called a conic section (or simply a conic). This C is a plane curve on the plane n. The point set 3' - V consists of two tconnected components 3'1 and 3'2' Let n i (i = 1,2,3) be planes not passing through V. If the conic section C I = n l n 3' is tbounded, then C, is contained either in 3'1 or in !Y2 and is tconnected. We call such a C I an ellipse. When C z = n 2 n 3' is not bounded but is connected, then n 2 is parallel to one of the generating lines of 3', and C 2 is contained either in 3'1 or in 3'2' We call such a C 2 a parahola. When n 3 intersects both of 3'1 and 3'2, then C 3 =n 3 nF has two connected components and is not bounded, We call such a C 3 a hyperhoIa. These three types exhaust all possible types of conic sections. In particular, if the plane n is per- pendicular to the axis I, then C = n n 3' be- comes a circle. Thus a circle is a special kind of ellipse. B. Foci and Directrices Let C = n n 3' I be an ellipse, The Euclidean space E 3 is divided by n into two thalf-spaces Ei, E1 (two "sides" of n). If we put 3' 1 n Ef = 3'11,3'1 n E1 = 3'12, we can construct a sphere S that is contained in Ef, tangent to 3' II along a circle K, and tangent to n at a point F. Similarly, we can construct a sphere S' that is in E1, tangent to 3' 12 along a circle K', and tangent to n at a point F'. We call F, F' the foci of the ellipse (Fig. 1). 
78C Conic Sections Let K, K' be the planes containing K, K'. Straight lines d = K n n, d' = K' n n are called directrices of C. Unless C is a circle, we have F # F', and K, n (and K', n) actually intersect; hence d, d' exist. When C = n n !'S' is a parabola or a hyperbola (Figs. 2 and 3), we can similarly define foci (a parabola has only one focus, F, while a hyperbola has two foci, F, F') and directrices (a parabola has only one directrix, d, while a hyperbola has two directrices, d, d'). v Fig. I v Fig. 2 Fig. 3 Let X be a point on the plane n, let DAX) be the distance between the point X and a focus F, and let Dd(X) be the distance between X and a directrix d. Then the Curve C is the locus of the points X satisfying the condition DF(X)=e' Dd(X), where e is a constant. We call 294 e the eccentricity of the conic section C. Ac- cording as C is an ellipse, a parabola, or a hyperbola, we have e< 1, e= 1, or e> 1. A circle is an ellipse whose eccentricity is zero. An ellipse is also characterized as the locus of X such that FX +F'X =2a; a hyperbola IS the locus of X such that IFX -F'XI=2a, where a is a positive cOnstant. When there are two foci, the straight line FF' is perpendicular to direc- trices d and d'. C. Canonical Forms of Equations When C is a hyperbola or an ellipse that is not a circle, C has two foci, F and F'. In this case, the midpoint 0 of the segment F F' is the cen- ter of symmetry of C (when C is a circle, its center 0 is, of course, the center of symmetry of C). We call 0 the center of C; an elIipse Or a hyperbola is called a central conic. If we choose a rectangular coordinate system (x, y) ha ving 0 as the origin and F F' as x-axis, then the equation of C can be expressed in the form x 2 /a 2 :ty2/b 2 =1, a,b>O. According as C is an ellipse or a hyperbola, we take the + or - of the double sign. If C is an ellipse, we have a> b. Furthermore, e= J a 2 -b 2 /a if C is an ellipse and e== J a 2 + b 2 /a if C is a hyperbola. We also have F = (ae, 0) and F' =( -ae, 0); the equations of directrices are x == :t a/ e, On the other hand, if C is a parabola, the straight line that is perpendicular to the direc- trix d and passes through F becomes the axis of symmetry of C. We call this straight line the axis of C; the intersection 0 of the axis and C is called the vertex of C. If we choose a rectan- gular coordinate system (x, y) having 0 as the origin and having the axis of C as the x-axis, the equation of C can be expressed in the form (1 ) y2=4ax, a>O. (2) We call (1) and (2) the canonical (or stan- dard) forms of the equation of C. We call the associated coordinate system the canonical coordinate system. Suppose that C is an ellipse (hence a> b). Let A, A' be points of intersec- tion of the x-axis and the ellipse and B, B' be the points of intersection of the y-axis and the ellipse. We call AA' the major axis of C and BB' the minor axis of C. If C is a hyperbola and (x, y) is the canonical coordinate system, we call the x-axis the transverse axis and the y- axis the conjugate axis. If C is a central conic, the x- and y-axes of the canonical coordinate system are called the principal axes; if C is a parabola, the x-axis is sometimes called the principal axis. 
295 D. Properties of Ellipses An ellipse may be considered the image of a circle under a tparallel projection. Conse- quently, the s<:ction of a circular cylinder by a plane is an ellipse. Also, if we are given a circle C and a fixed diameter D of C, an ellipse is obtained as the locus of the points X lying on lines PM, which are perpendicular to D, with P E C, ME D, satisfying the condition that the ratio PM:XM is constant. Suppose that we are given two concentric circles having the center at the origin 0 and with radii a, b. Let P, Q be points of intersec- tion of moving half-lines through 0 and the two circles, Then the locus of points X of intersection of the ordinates (lines parallel to the y-axis) passing through P and the ab- scissae (lines parallel to the x-axis) passing through Q is an ellipse (Fig. 4). Suppose that the equation of an ellipse is given by x 2 /a 2 + y2/b 2 = 1, with a> b. Then the lengths of its major axis and minor axis are 2a and 2b, respectively. Fig. 4 Given an ellipse and its center 0, the circle with center 0 and diameter equal to the major axis of the ellipse is called the auxiliary circle of the ellipse. Given an ellipse C and its focus F, the auxiliary circle of C is the locus of the points X satisfying the condition that the line FX is perpendicular to a tangent line to C passing through X. Suppose that X is a point on an ellipse with foci F, F'. Let TT' be the line tangent to the ellipse at X (X lies between T and T'). Then the angle L TXF' is equal to L T'XF (Fig. 5). Consequently, the rays start- ing from one focus of an ellipse and "reflected" by the ellipse converge on the other focus of the ellipse. Also, the product of the distances from two foci of an ellipse to an arbitrary tan- gent is constant and is equal to b 2 . Fig. 5 78E Conic Sections The ellipse C: x 2 /a 2 + y2/b 2 = 1 is expressed parametrically in the form x==acosO, y==bsinO. (3) We call the parameter 0 the eccentric angle of a point (x,y) on C. Consequently, C is a tJor- dan curve and divides the plane into two parts, the inside and the outside. The inside is the set of points (x, y) satisfying x 2 /a 2 + y2 /b 2 < 1, and the outside is the set of points (x, y) satisfying x 2 /a 2 + l/b 2 > 1. The inside is a tconvex set. From a point Q outside C, two tangents to C can be drawn. The locus of points Q such that these two tangents are orthogonal is the circle x 2 + y2 = a 2 + b 2 , We call this circle the direct circle. The area of the "sector" OAX formed by two points A(a,O), X(acos e, bsinO) (0)0) and the origin 0 is abO/2=(ab/2) Arccos(x/a); the length of the arc Ax of the ellipse is repre- sented by the valu e of the telliptic integral a f: J I-e 2 cos 2 0de=aE(n/2-0,e). In particular, the area inside an ellipse is equal to nab, and the whole length of the ellipse is 4aE(0, e). With respect to a polar coordinate system (r, e) having the focus F(ae,O) as the origin and the ray directed positively along the x-axis as the initial line, the equation of the elJipse C is I r - 1 + I cos <p , b 2 [ - -- a (4) Here I is equal to half of the length of the chord that is perpendicular to the major axis and passes through the focus. (This chord is called the latus rectum of the ellipse.) Suppose that F is a fixed point and that X is a moving particle attracted toward F by a tcentral force inversely proportional to the square of the length of FX. Suppose further that X begins with an initial velocity whose direction is tangent to the ellipse C with focus F. Then X always moves on C, and the areal velocity described by the radius FX is constant (Kepler's second law). E. Properties of Hyperbolas Two straight lines x 2 /a 2 - y2/b 2 = 0, that is, y/x = ::I:: bfa, are tasymptotes of the hyperbola C: x 2 /a 2 - y2 /b 2 = 1. The hyperbola C': x 2 /a 2 _ y2/b 2 = -1 is called the conjugate hyperbola of C. When a = b, the asymptotes are ortho- gonal to each other, and C, C' are congruent. In this case, we call C a rectangular hyperbola (or equilateral hyperbola). When we draw parallels to asymptotes from a point X on C, the area of the parallelogram formed by these 
78F Conic Sections lines and two asymptotes is constant. (In par- ticular, when C is a rectangular hyperbola, the equation of C becomes xy=k 2 /2 if we take two asymptotes as coordinate axes. The seg- ment cut off by the asymptotes on the tangent to C at X is divided equally at X. In the case of the hyperbola as well, the product of the distances from two foci to an arbitrary tangent is constant and is equal to b 2 , and the angle between two straight lines joining two foci to a point X on C is divided equally by the tangent at X. A hyperbola C is represented parametrically by x=asec8, y=btan8. In this case also, we call 8 the eccentric angle of (x, y). If we use the thyperbolic functions and the parameter u, the equation of a hyper- bola can be written as x =acosh u, y=bsinh u instead of (3'). The area of a "sector" OAX formed by two points A(a,O), X(x,y) on the hyperbola and the origin 0 is, in this case, abu ab x ab x+ Jx2-a 2 -=-Arccosh-=-Iog . 2 2 a 2 a The length of the arc AX of the hyperbola is given by the elliptic integral f: e 2 x 2 _a 2 2 2 dx. x -a With respect to a polar coordinate system (r, <p) having the focus F(ae, 0) as origin and the ray directed positively along the x-axis as the initial line, the equation of the hyperbola C becomes I r - 1 - e cos <p , b 2 [=-, a where I is equal to half the length of the chord passing through the focus and perpendicular to the principal axis. (This chord, too, is called the latus rectum.) F. Properties of Parabolas The curve described by a particle attracted by "gravitation" in a fixed direction and affected by no other force is a parabola (G. Gali1ei). The tangent at a point X on a parabola makes equal angles with the straight line joining F and X and the direction of the principal axis (Fig. 6). Consequently, if the rays starting from the focus of a parabola are "reflected" by the parabola, they all become rays parallel to the principal axis. Let X' be 296 Fig. 6 (3') the point of intersection of the tangent at X(xo,yo) and the x-axis, X" the point of inter- section of the normal at X and the x-axis, and Xo the foot of the perpendicular from X to the x-axis. Then FX = FX', 6FXX' is an isosceles triangle, and XX' is divided equally by the y- axis. Consequently, the locus of the foot of a perpendicular from F to a tangent is the y- axis. Also, the length of subtangent X' Xo = 2xo, and the length of subnormal X" Xo = 2a = 20F. Conversely, a curve whose length of subnormal is constant is a parabola, The locus of the midpoints of parallel chords of a para- bola is a straight line parallel to the principal aXIs. With respect to a polar coordinate system having the focus as origin and the ray directed positively along the x-axis as the initial line, the equation of a parabola is (3") I r= l-cos<p , The area bounded by a chord BC and an arc BC of a parabola (Fig. 7) is equal to 4/3 the area of 6ABC, where A is the point of contact on the tangent of the parabola parallel to BC (Archimedes). Also, the length of the arc OX of parabola (2) is 1=2a. (4") (4')  Yo  1 YO+v'yo+a -v' yo+a +a og , 4a 2a X having the coordinates (x o , Yo). Fig. 7 G. Conjugate Diameters The diameter is a straight line passing through the center of a central conic, The locus of the midpoints of the chords parallel to a diameter d is another diameter d', called conjugate to d. 
297 Then the diameter conjugate to d' is d. The x- axis and y-axis of a canonical coordinate sys- tem form a set of conjugate diameters. Let 2a', 2b' be the lengths of the segments (some- times called conjugate diameters) cut off from d, d' by the curve or by the curve and the con- jugate one for hyperbolas and by w, the angle between d and d'. Then the following relations hold (as to the double signs :t, we take + in the case of an ellipse and - in the case of a hyperbola): a,2 :tb'2 =a 2 :tb 2 , a'b' sinw =ab. The product of the slopes of d, d' is eq ual to :t b 2 /a 2 . With respect to an oblique coordinate system having d and d' as axes, the equation of the curve is x 2 /a'2:t y2 jb'2 == 1. H. Confocal Conic Sections The set of ellipses and hyperbolas having two fixed points F, F' as foci is called the family of confocal central conics with foci F and F' (Fig. 8), The family of confocal central conics con- taining the ellipse x 2 /a 2 + y2/b 2 = 1 is repre- sented parametricalIy by x2 y2 -+-=1 a 2 + ..1. b 2 + ), . There exist only one ellipse and only one hyperbola that pass through a point inside each quadrant (for example, a point (xo,Yo), Xo > 0, Yo> 0, inside the first quadrant) and be- long to the family of curves. The ellipses and hyperbolas of the same family cut each other orthogonally. Thus parameters corresponding to ellipses and hyperbolas belonging to a family of confocal central conics define an orthogonal curvilinear coordinate system, calIed an telIiptic coordinate system ( 90 Coordinates). Fig. 8 The set of parabolas having a fixed point F as focus and a straight line passing through F as axis is called a family of confocal parabolas (Fig. 9). The family of confocal parabolas containing y2 = 4ax is the set of curves y2 = 4(a + ..1.) (x + ..1.). Such families also give rise to orthogonal curvilinear coordinate systems. 78J Conic Sections Fig. 9 I. Curves of the Second Order With respect to a rectangular coordinate sys- tem, a curve represented by an equation with real coefficients of the second degree with two variables x, y, ax 2 + 2hxy+by2 + 2gx + 2fy+c=0, (5) where (a, h, b) # (0, 0, 0), is called a curve of the second order. A curve of the second order is either an empty set, one point, one or two straight lines, or a conic section. For equation (5), we put Do=l I, a h 9 D== h b f 9 f c (6) and call D the discriminant of the curve of the second order. If Do # 0, D # 0, and the curve is not an empty set, then the curve is a central conic. If Do > 0, then the curve is an ellipse or an em pty set. If Do < 0, then the curve is a hyperbola. If Do=O, D#O, then the curve is a parabola. If D = 0, Do> 0, then the curve con- sists of one point. If D = 0, Do < 0, then the curve is two intersecting straight lines. If D == Do = 0, then the curve is an em pty set, one straight line, or two parallel straight lines. J. Poles and Polars Let F(x,y)=ax 2 + 2hxy+ by2 + 2gx +2fy+c =0 be the equation of a conic C and (xo, Yo) the coordinates of a point P on the plane, A straight line p* having the equation axox + h(xoY+xyo)+ byoy+g(x + xo) + f(y+ Yo)+c=O is calIed the polar of P with respect to C (Fig. 10). When the polar of a point P is I, we call P the pole of I and denote it by 1*. In general, 1* is uniquely determined by I, and P** = P, 1** = I. IfQEP*, then PEQ*. If P'EI, then I*EP'*. When a straight line passing through P inter- sects C at X, Y and intersects p* at P', then P, P' are tharmonic conjugate with respect to X, Y. In particular, if PEP*, then PE C, and p* becomes the tangent of C at P. Given a 
78K Conic Sections triangle 6PQR on the plane of C, we caIl the triangle with sides P*, Q*, R* the polar triangle of 6PQR. Let Q*nR*=p', R*np* ==Q', and p*nQ*=R', Then the three straight lines PUP', Q U Q', R U R' meet at a point (M. Chasles). When the polar triangle of 6PQR coincides with itself, then 6PQR is caIled a self-polar triangle, The polar of a focus is a directrix. Fig. 10 K. Curves of the Second Class When the coefficients u, v, w of a straight line ux+vy+w=O satisfy an equation with real coefficients of the second order, Au 2 + 2Huv+ Bv 2 + 2Guw+2Fvw+ Cw 2 =0, (5') where (A, H, B) #(0, 0, 0), the curve enveloped by these straight lines is caIled a cnrve of the second class. Let A be the discriminant defined analogously to Din (6) by using A, H, B,... instead of a, h, b,... . A curve of the second class (with A #0) is essentially the same as a curve of the second order with D#O. In order for the curve (5') with A # 0 to coincide with the curve (5) with D#O, it is necessary and sufficient that A, B, C, F, G, H be proportional to the tco- factors of a, b, c, f, g, h in the determinant D given by (6). If A==O, then (5') represents either the empty set, or a point (regarded as the set of straight lines passing through the point), or two points. From a projective point of view, a curve of the second order is defined as a locus of the point of intersection 1 n /' = X of corre- sponding lines 1 and I' when two tpencils of lines A(l, m,.,.), A'(/', m'",,) passing through two different centers A and A' are in corre- spondence under a tprojective mapping f (1. Steiner) (Fig. 11). From this it can be proved that three points of intersection of three pairs of opposite sides (AB, DE), (BC, EF), (CD, FA) of a hexagon inscribed in a curve of the second order are on the same straight line (Pascal's theorem, Fig. 12). In particular, if the curve of the second order in this theorem consists of two straight lines, the theorem coincides with Pappus's theorem (Fig. 13). We call this 298 straight line 1 the Pascal line of ABCDEF. Given a set of six points A, B, C, D, E, F on a curve of the second order, by considering all possible combinations of the points, we get 60 Pascal lines. A configuration consisting of these 60 lines is called Pascal's configuration, and has been studied by Steiner, Kirkman, and others. As a tdual to Pascal's theorem, Brianchon's theorem holds: Three diagonals of a hexagon with a curve of the second class inscribed meet at a point (Fig. 14).  .4' I' , m' m X y Fig. 11 Fig. 12  Fig. I3 Fig. 14 References [IJ G. Salmon, A treatise on conic sections, Longmans, Green, sixth edition, 1879 (Chelsea 1962). [2] H. F. Baker, Principles of geometry II. Plane geometry, Cambridge Univ. Press, 1922. [3] M. Protter and C. Morrey, Jr., Calculus with analytic geometry, Addison-Wesley, 1962. 79 (11.20) Connectedness A. General Remarks A ttopological space X is said to be connected if there are no proper closed subsets A and B of X such that A nB= 0 and AUB=X (c. Jordan, Cours d'Analyse 1,1983). A subset S of X is connected if S considered as a tsubspace of X is connected. If a subset S of X is connected, then the tclosure S is also connected. Let {A,} be a family of connected subsets of X such that either A,nAp# 0 or A,nAp# 0 for any pair A, and Ap. Then the union U,A, is con- nected. The continuous image of a connected set is connected The tproduct space IT, X, of a family of connected spaces {X,} is also con- 
299 nected. Let {A,} be a family of connected subsets of X and Ao be a connected subset of X. If Ao n A, # 0 for every A., then the union Ao U U, A, is also connected. For a point p of a topological space X, the union of all connec- ted subsets containing p is connected and is called the connected component of p (F. Haus- dorff, 1927). The n-dimensional Euclidean space En (n  0) and the n-dimensional unit sphere sn (n 1) are connected, whereas So, consisting of two points, is not connected, En\{O} is connected for n2, whereas EI\{O} is not connected. This fact implies that E I is not homeomorphic to En (n  2). A connected open subset of a topological space X is called a domain (or region) in X. A topological space X is said to be locally connected at a point p if for every open set U containing p, there is an open set V containing p and contained in the connected component of p in U. X is said to be locally connected if it is locally connected at each point of X. A space X is locaIly connected if and only if every connected component of every open subset is open in X. There are connected spaces that are not locally connected. For example, the comb space {(x, y)E R 2 1 x = l/n and 0 .s;y.s; 1 (n = 1,2, 3, ... ) or O.s; x .s; 1 and y = O} is not locally connected at (0, 1) (see Fig, 1), Illil i I I' , I II I I I i I I I I  Fig. 1 Comb space, B. Arcwise Connectedness Two points a and b of a topological space X are said to be joined by an arc in X if there is a continuous map l(t) of the closed interval 1= [0,1] into X such that f(O) =a and f(l) = b. A topological space X is said to be arcwise connected (or path-connected) if every two points of X are joined by an arc in X. For a point p of X, the union of all arcwise- connected subsets in X containing p is arcwise connected and is called the arcwise-connected component (or path-component) of p. A topo- logical space X is said to be locally arcwise connected at a point p if for every open subset U containing p, there is an open subset V 79C Connectedness containing p such that every two points in V are joined by an arc in U. X is said to be lo- cally arc wise connected if it is locaIly arcwise connected at each point of X. An arcwise-connected space is connected, but the converse is not true. The tsinusoid {(x,y)ER2Iy==sin l/x and O<x.s; lor x=O and - l.s;y.s; I} is connected but not arcwise connected. A complete metric space is arc- wise connected if it is connected and locally connected. C. Simple Connectedness and n-Connectedness Denote by sn the n-dimensional unit sphere and by D n + 1 the (n+ I)-dimensional unit disk. A topological space X is said to be n-connected if every continuous map f from sm to X is extendable over Dm+l (m = 0,1,2, "', n), 0- Connectedness is equivalent to arcwise con- nectedness. A I-connected space is also called simply connected. (For simple connectedness and n-ply connectedness of plane domains  333 Plane Domains) A topological space X is said to be locally n-connected at a point p if for every open subset U containing p, there is an open subset V containing p and contained in U such that any continuous map f: sm--> V is extendable to a continuous map J: D m +!--> U (m == 0, 1,2, ... ,n). X is said to be locally n-connected if it is locally n-connected at each point of X. A space X is said to be w- connected (or locally w-connected) if X is n- connected (or locally n-connected) for every n. sn is (n - 1 )-connected but not n-connected. Similarly for En+! \ {O}, If X is n-connected (n I), then the tsuspension SX is (n+ 1)- connected and the tloop space QX is (n - 1)- connected. The Hawaiian earring { (x, y) E R21(x_l/n)2 + y2== l/n 2 (n= 1, 2, 3,...)} is arcwise connected but not locally simply con- nected. The tcone over the Hawaiian earring is simply connected, but not locally simply connected (Fig. 2). \ J Fig. 2 Hawaiian earring and its cone, A topological space X is said to be contrac- tible if the identity map 1 x is thomotopic to a constant map c Xo to a point Xo of X (K. Bor- suk, Fund. Math., 24, 1935). A topological space X is said to be locally contractible at a 
79D Connectedness point p of X, if for every open subset U con- taining p, there is an open subset V containing p and contained in U such that the inclusion i: V c U is homotopic to a constant map to a point in U. X is said to be locally contractible if it is locally contractible at each point of X. A contractible or locally contractible space is w-connected or locally w-connected, res- pectively. The n-dimensional simplex, the n- dimensional disk D n , and the n-dimensional Euclidean space En are contractible. More generally, a tconvex set in En is contractible. tTopological manifolds and tpolyhedra are locally contractible. D. Continua and Discontinua A topological space is said to be totally dis- connected if each connected component con- sists of one point. Divide the closed interval I = [0, 1 J into three eq ual parts, and let 111 and 1 12 be the closed intervals obtained from I by removing the middle open interval. As the next step, divide 1 11 and 1 12 into three equal parts, respectively, and remove the middle open intervals. Inductively we obtain 2 n +1 closed intervals In+l,; (i= 1, ",,2n+l) from 2 n closed intervals In,j by removing open intervals lying in their middles. Let Dn) = Ul1 In, i and C = n1 Dn). Then C is called the Cantor dis- continuum or simply the Cantor set or the ternary set (G. Cantor, Math. Ann., 21 (1883) (Fig. 3). " I ;) " '-....../21/ " 1 23 / '--/22/ Fig. 3 Cantor discontinuum. C is a subset of I consisting of points with coordinates t = (nd3) + (n 2 /3 2 ) + ... + (n;/3 i ) + "', where n i = 0 or 2. As a topological space C is homeomorphic to the Cartesian product of countably many copies of the discrete space D = {O, I}. C has the power of the continuum c and is a compact, totally disconnected, tperfect set. The Cartesian product of infinitely many (countably or not) copies of D is called the general Cantor set. The continuous image of a general Cantor set is called a dyadic com- pactum, Compact metric spaces and compact groups are examples of dyadic compacta. A continuum is by definition a connected compact metric space consisting of more than one point. A metric space X is said to be well- chained if for every two points a, band f. > 0 there are points Xl> X2,"" X n - I such that d(x i ,x i +1)<6 (xo=a,xn=b). A well-chained compact metric space is a continum, Let K be a continuum containing two 300 points a and b. Then K is said to be irreducible between a and b if there is no proper subcon- tinuum of K containing a and b (L. Zoretti, Ann. Sci. Ecole Norm. Sup., 26 (1909») ( 93 Curves). A continuum K is said to be indecomposable if there are no proper subcontinua K I, K 2 such that K = K I UK 2 (L. E. J. Brouwer, Math. Ann., 66 (1910). Simple examples of inde- composable continua have been given by A. Denjoy, C. R. Acad. Sci. Paris, 151 (1910); K. Yoneyama, Tohoku Math. J., 12 (1917); and B. Knaster, Fund. Math., 3 (1922). C. Kuratowski conjectured the following: If a plane continuum K is homogeneous (that is, for any two points a, bEK there exists a homeomorphism h: K --> K with h(a) = b) then K is homeomorphic to the circle. A counter- example for this conjecture has been found by R. H. Bing and E. E. Moise. It is an indecom- posable, homogeneous, plane continuum and is called the pseudo-arc [8]. References [IJ K. Kuratowski, Topologie II, Monograf. Mat. (1950). [2J S. Lefschetz, Topics in topology, Ann. Math. Studies, Princeton Univ. Press, 1942. [3J R. L. Wilder, Topology of manifolds, Amer. Math. Soc. Colloq. Publ., 1949. [4J G. T. Whyburn, Analytic topology, Amer. Math. Soc. Colloq. Pub!.. 1942. [5J S. T. Hu, Elements of general topology, Holden-Day, 1964. [6J O. Hanner, Retraction and extension of mappings of metric and non-metric spaces, Ark. Mat., 2 (1952), 315360. [7J K. Borsuk, Theory of retracts, Monograf. Mat. (1967). [8J 1. G. Hocking and G. S. Young, Topology, Addison-Wesley, 1961. Also  references to 425 Topological Spaces, 80 (VII.4) Connections A. History The geometric notion of connections orig- inated with T. Levi-Civita's parallelism (Rend. Cire. Mat. Palermo, 42 (1917)) and was later generalized to the notion of connections of differentiable fiber bundles. Notions such as affine connections, Riemannian connections, projective connections, and conformal connec- tions can be described in terms of bundles 
301 constructed from the tangent bundles of dif- ferentiable manifolds. They are also standard examples of the Cartan connections formu- lated by E. Cartan and C. Ehresmann. B. Connections in Principal Bundles Let P = (P, n, M, G) be a differentiable tprin- cipal fiber bundle. (For the sake of conve- nience, we assume that differentiability always means that of class COO,) The total space P and the base space Mare tdifferentiable manifolds, and the projection n is a differentiable map- ping, The tstructure group G is a tLie group and acts on P from the right as a transforma- tion group. On each fiber, G acts transitively without fixed points. For elements a, x in G, P, we write Ra(x)==xa. The mappings induced on ttangent vector spaces by Ra and n will be denoted by the same letters, namely Ra: TAP) --> Txa(P), n: Tx(P)--> T,,(x}(M). The tangent vec- tor space (P) at each point x of P is mapped by the projection n onto the tangent vector space Tp(M) at the point p= n(x) of M. The kernel of this mapping is denoted by VAP), and each vector in VAP) is said to be vertical. The kernel Vx(P) is the totality of elements of TAP) that are tangent to the fiber. C. Connections We say that a connection is given in P if for each point x E P, a subspace Qx of the tangent space TAP) is given in such a way that the following three conditions are satisfied: (i) (P)= VAP)+Qx (direct sum); (ii) Ra(QxJ= Qxa(Q is invariant under G); and (iii) the map- ping x-->Qx is differentiable. A vector in Qx is said to be horizontal. Now suppose that X is an arbitrary tvector field on P. By condition (i), the value Xx of X at each point x of P can be expressed uniquely as X x = +Zx, where E VAP) and ZxEQx' The vector fields Y and Z defined by  and Zx (x E P) are called the vertical and horizontal components of X, respectively. Condition (iii) implies that if X is a differentiable vector field, then its horizontal and vertical components are also differentiable vector fields. Let X be a vector field on the base space M. Since n de- fines an isomorphism of Qx and Tp(M) (p = n(x», we have a unique vector field X* on P such that (a) n(X*)=X and (b) X:EQx' We call X* the lift of X, and it is invariant under G by condition (ii). Suppose that a connection is given in p, If C is a piecewise differentiable curve in the base space M, we can define a mapping <p that maps the fiber over the initial point p of C 80E Connections onto the fiber over the endpoint q of C as follows: Take an arbitrary point x on the fiber at p. Then we have a unique curve C: in P starting at x such that (a) n( C:) = C, and (b) each tangent vector to C: is horizontal. (C: is called a lift of C that starts at x.) The endpoint y of the curve C: belongs to the fiber over q. We set <p(x) = y. Because C: a = Ra(Cn the mapping <p commutes with transformations of G. We call this mapping <p the parallel dis- placement or parallel translation along the curve C. D. Holonomy Groups Fix a point p in the base space. If C is a closed curve in M starting from p, the parallel dis- placement along C maps the fiber over ponto itself. So if we fix a point x on the fiber over p, x is transformed by the parallel displacement to a point xa (a E G). Thus each closed curve C starting from p determines an element a(x, C) of G. If C varies over the set of closed curves that start from p, the totality of such elements of G forms a subgroup of G. This subgroup is called the holonomy group of the connection defined over P with the reference point x, If M is connected, holonomy groups with different reference points are conjugate. In the above, if we choose as the closed curves C starting from p only those curves that are null-homotopic, the elements a(x, C) form a subgroup of the holonomy group. This is called the restricted holonomy group. The holonomy group is a tLie subgroup of the structure group, and its con- nected component containing the identity coincides with the restricted holonomy group. Holonomy groups are useful in the study of the behavior of connections. E. Connection Forms Let 9 be the Lie algebra ( 249 Lie Groups) of the structure group of G of a principal fiber bundle P =(P, n, M, G). For each A in g, the 1- parameter subgroup exptA (-00 < t < 00) of G defines a tone-parameter group RexP'A of trans- formations on P, and it determines a vector field A * on P ( 105 Differentiable Mani- folds). Each element of the vector field A * is vertical at each point x on P, and the A * (A E g) at x generate Vx(P). Moreover, for each element a of G we have Ra(A*)=(ad(a-I)A)*. For a connection in P, we define the connec- tion form w on P with values in 9 by the fol- lowing: (i) wx(A;) = A (AEg), and (ii) wx(X)=O (X E QJ, The connection form w thus defined satisfies (iii) R:(w)=ad(a-I)w (UEG), where R:(w) is the tdifferential form induced by the 
80 F Connections transformation Ra from the differential form w. Conversely, given a I-form w with values in 9 that satisfies conditions (i) and (ii), we can define a connection in P by defining vectors X such that w(X)=O as the horizontal, and its connection form coincides with w. Thus giving a connection in P is equivalent to giving a connection form in P. In particular, when a principal fiber bundle P is trivial, i.e., when P=M x G, we can iden- tify the tangent vector space TAP) at a point x = (p, g) of P with the direct sum of Tp(M) and Tg( G). If we set Qx = T p ( M), then Q defines a connection in P= M x G. Such a connection is called flat. When a connection can always be expressed as above locally, it is called locally flat. Since each principal fiber bundle is locally a product fiber bundle, we see that locally there exists a connection. If the base space M is tparacompact, we can show the existence of connections on P. F. Extension and Restriction of Connections When a principal fiber bundle P=(P, n, M, G) has a treduced fiber bundle P', we shall con- sider the relation between the connections of P and of P'. Let G' be a Lie subgroup of G and g' its Lie algebra. We shall denote by j both the injection of G' into G and also the injection of g' into g. If there exist a differentiable principal fiber bundle P' = (P', n', M, G') and a differenti- able embedding f of P' into P such that no f = n' and foRa = Rj(a) 0 f(aE G') are satisfied, then (P'J) is said to be a reduced fiber bundle of P. Then we have f*(A:) = j(A)j(x) for each AEg' and XEP'. Suppose that a connection is given in P'; we denote the horizontal space at the point x of P' by Q. At the point f(x) of P, we take f*(Q) as the horizontal space and transform it by right translations of G. Thus we obtain a con- nection on P. Let w' and w be the correspond- ing connection forms. Then we have j 0 w' = f*(w) on P'. Conversely, suppose that we are given a connection in P with the connec- tion form w. If the induced form f*(w) on P' has values always inj(g'), we can write {*(w)=jow', and w' defines a connection in P'. In this case the connection in P is called an extension of the connection in P', and the connection in P' is called the restriction of the connection in P. G. Curvature Forms Suppose that a principal fiber bundle P= (P, n, M, G) has a connection. Let F be a finite- dimensional vector space and (X be a differen- tial form of degree k on P with values in F. We 302 define the covariant differential D(X of (X by (D(X)(X I ,..., Xk+ d=(d(X)(hX I , "" hXk+ d, where the Xi are vector fields on P and h de- notes the projection to the horizontal compo- nent. D(X is a differential form of degree k + 1 on P with values in F. Let p: G-->GL(F) be a trepresentation of a Lie group G on F. A differential form (X on P with values in F is called a pseudotensorial form of type p if (X satisfies R:( (X) = p( a -I)(X (aEG). In particular, if a pseudotensorial form (X satisfies t(A *)(X = 0 for any A E 9 ( 105 Dif- ferentiable Manifolds Q), it is called a tensorial form of type p. For each representation p of G, we can construct an associated vector bundle E over M with fiber F. A tensorial form of type p is identified with a differential form on M with values in E. If (X is a pseudotensorial form of type p, then D(X is a tensorial form of type p. For a connection form w on P, the co- variant differential Dw = 0. of w is called the curvature form of the connection. Since w is a pseudotensorial form of type ad, 0. is a ten- sorial form of type ad. For the connection form we have the structure equation dw= - [w, wJ +0. [4,6]. Let X and Y be vector fields on M, and let X* and y* be their lifts, respectively. Then we have w([X*, Y*J)= Q(X*, Y*), which shows that the curvature form 0. for X*, y* gives the vertical compo- nent of [X*, Y*]. It is known that the following three con- ditions for a connection are equivalent: (i) The connection is locally flat. (ii) The curvature form vanishes. (iii) The restricted holonomy group is trivial (i.e., the identity group). The following two theorems are fundamental: (1) Suppose that a connection is given in a principal fiber bundle P=(P, n, M, G). Then the structure group of P can be reduced to the holonomy group [4,6]. In fact, for XE P, let P(x) be the set of points y in P that can be connected to x by a piecewise horizontal curve in P. Then P(x) gives a reduced fiber bundle of P, and the connection in P is an extension of a connection in P(x) [4, 6J. (2) The Lie algebra of the holonomy group with a reference point x in P coincides with the vector subspace of 9 spanned by {Qy(X, Y)I YEP(X),X, YE1;'(P)} [4,6]. The curvature form 0. is used to express the tcharacteristic classes of the bundle P [1,2] ( 56 Characteristic Classes). In some cases, a connection in the principal fiber bundle induces a connection in an tas- sociated fiber bundle. In particular, when G is GL(n, R) or GL(n, q, we can define a connec- tion in any associated vector bundle. The notion of connections in vector bundles can be 
303 defined more algebraicalIy (M. F. Atiyah, Trans. Amer. Math. Soc., 85 (1957)) and can also be defined as a kind of differential oper- ator on vector bundles [8]. H. Affine Connections Let M be a differentiable manifold of dimen- sion nand P be the tbundle of tangent n- frames over M, Then P has the structure group GL(n, R), and it is the principal bundle associated with the tangent vector bundle of M, which consists of aII tangent vectors of M. A connection in the bundle of tangent n- frames is calIed an affine connection (or linear connection) on M. An affine connection on M defines (as well as the curvature form Q) a new form e called the torsion form on P, which is given as folIows: Let F be an n-dimensional vector space with a fixed basis (I '2' . ., 'n)' Since the bundle of tangent n-frames Pis the set of aII bases (i.e., n-frames) (e l , ..., en) in Tp(M) at each point p of M, every point x=(eJ,'" ,en) of P is given as a mapping x of F onto I;,(M) (p=n(x» defined by x:i-->ei' We define differential form 8 of degree 1 with values in F on P by 8AX)=r l (n x (X) (X E T.(P». 8 is calIed a canonical I-form of the bundle of tangent n-frames of the manifold M and has the following property: Any diffeo- morphism <p of M onto itself induces a bundle automorphism ip of P onto itself, and ip pre- serves 8, that is, ip*(8)=8. Conversely, we can show that any bundle automorphism of P that preserves 8 is induced by a diffeomorphism of the base space M. For an affine connection on M, we define the torsion form e by e = D8. e is a differen- tial form of degree 2 on P with values in F and satisfies R a e=a- l .e (aEGL(n,R)). Further- more, we have the structure equation for e, d8= [w, 8] + e [2,4,6]. For each element  in F, there exists a unique horizontal vector field B@ on P such that 8(B@)=. B() is ca\1ed the basic vector field corresponding to. At each point XEP, B(I)x,"', B(n)x form a basis of Qx' Let {AJ, ..., Am} (m = n 2 ) be a basis of 9 = gI(n, R). Then at each point x E P, {(Af)x, ... , (A::',)x, B(dx,"', B(n)x} is a basis of the tangent vector space Tx! P). Thus the bundle P of frames is a tparalIelizable manifold. The pro- jection to M of any tintegral curve of a basic vector field is a geodesic, which is defined in Section I [4]. An affine connection on M gives a parallel displacement of the tangent vector space of M as folIows: Let C = p, (O.s; t.s; 1) be a curve in M and C* =x, be a lift of C to P. The paralIel displacement of the tangent n-frame Xo at Po 80 I Connections along the curve C is x" and the mapping X,O Xc:;I: Tpo(M)--> Tp,(M) is ca\1ed the parallel dis- placement of the tangent space Tpo(M) onto Tp,(M) along C. It is easily seen that the map- ping is independent of the choice of lifts. I. Covariant Differentials Let C = {p,} (O.s; t.s; 1) be a differentiable curve in M. If we have a vector Y, in Tp,(M) for each t and the correspondence t --> Y, is differenti- able, then {Y,} is ca\1ed a vector field along the curve C. For {Y,} we set Y,'=lim(1/t)(<p,-(Y,+h)- Y,), hO . where <P"h is the para\1el displacement of I;,,(M) onto Tp,+h(M) along the curve C. The vector field { Y,'} along C thus obtained is called the covariant derivative of { Y,}. {Y,} is para\1el along C; that is, Y, == <Po, ,( Yo) if and only if Y,' == O. In particular, if the tangent vectors to a curve Care para\1el along C itself, then C is said to be a geodesic. Let X and Y be vector fields on a manifold M with an affine connection. The covariant derivative V x Y of the vector field Y in the direction of the vector field X is defined as follows: Let Po be a point in M, C = {p,} ( - E.s; t.s; E) be an integral curve of X through Po, and {<p,} be the parallel displacement along C. We set (V x Y) p =lim (1/t)(<p,-1 (Y p )- Y p ). o t-JoQ I a Then V x Y is also a vector field on M. The mapping (X, Y)--> V x Y satisfies the folIowing three conditions: (i) V x Y is linear with respect to X and Y; (ii) V fX Y = f' V x Y; and (iii) Vx(/Y) = f' V x Y + (Xi). Y, where f is a differentiable function on M. Conversely, if a mapping satisfying conditions (i)-(iii) above is given, then there exists a unique affine connec- tion on M whose covariant derivative coin- cides with the given mapping [4,6]. Fix a vector field Y. Then the mapping X -+ V x Y defines a ttensor field of type (1, 1). This tensor field is called the covariant differential of Yand is denoted by VY Now fix a vector field X. Then the mapping Y --> V x Y can be naturally extended to tensor fields of arbitrary type, and it commutes with the tcontraction of the tensors. For a tensor field K this is denoted by K -->VxK. Furthermore, the mapping X--> VxK is ca\1ed the covariant differential of K and is denoted by VK. We call VxK the co- variant derivative of K in the direction of X. A tensor field K is invariant under para\1el dis- placements if and only if VK =0 ( 417 Ten- sor Calculus). 
80J Connections J. Curvature Tensors and Torsion Tensors For an affine connection on M, the curvature tensor R and the torsion tensor T are defined by R(X, Y)(Z) = Vx(VyZ) - Vy(VxZ) - V[X, Y](Z), T(X' y)=V x Y -VxX -[X, Y], where X, Y, and Z are vector fields on M, and Rand T are tensors of types (1, 3) and (1,2), respectively. Also, in terms of the curvature form Q and the torsion form 8 on the bundle of tangent n-frames P over M, they can be defined by Rp(X, Y)(Z)=rl.QAx*, Y*)'x(Z), Tp(X, Y)=r l (8AX*, Y*), where n(x) = p, X, Y E I;,(M), and X*, y* are lifts of X, Y, respectively. The curvature tensor and the torsion tensor satisfy the relations R(X, Y)= -R(Y,X), T(X, Y)= -T(Y,X). Moreover, Bianchi's identities hold: 6(R(X, Y)(Z)) = 6(T(T(X, Y), Z) + (V x T)(Y, Z) and 6«(V x R)(Y,Z)+R(T(X, Y),Z»=O, where 6 denotes the sum of terms that are obtained by cyclic permutations of X, Y, Z [4]. For instance, in the case of a Riemannian connection (Section K), we have T=O, and Bianchi's identities reduce to R(X, Y)(Z) + R(Y, Z)(X) +R(Z,X)(Y)=O, (VxR)(Y, Z)+(VyR)(Z,X) + (VzR)(X, Y)=O. We now consider a system of coordinates (Xl, ..., x n ) in an n-dimensional linear space M = Rn. The vector fields (Xl,'" ,X n ) (Xi=8/8x i ) form a basis for vector fields on M. If we set VX,Xj=O, we get an affine connection Qn R". For such a connection we have R=O, T=O, and any straight line in Rn is a geodesic with respect to this connection. The connection is called the canonical affine connection on Rn. An affine connection on a manifold M satisfies R =0 and T=O if and only if the connection on M is locally isomorphic to the canonical affine connection of Rn. Let <p be a diffeomorphism of a manifold M with an affine connection onto itself. We calI <p an affine transformation of M if the induced automorphism cp on the bundle of tangent n- frames preserves the connection. In terms of covariant differentials, this condition is equiva- lent to the condition V,,(X)<p(Y) = <p(V x Y) for any vector fields X, Y. An affine transforma- tion of the canonical affine connection on Rn 304 is an ordinary affine transformation. For an affine connection on a manifold M, the set of all affine transformations forms a Lie group and acts on M as a Lie transformation group [4,5]. Let M be a manifold with an affine connec- ti on. M is caIled an affine locally symmetric space if V R = 0 and VT = 0 are satisfied. These conditions are satisfied if and only if at each point p of M, there exist a neighborhood U of p and an affine transformation <p of U such that <p2 = 1 and p is the isolated fixed point of <po If for each point p of M there exists an affine transformation of M such that <p2 = 1 and p is an isolated fixed point, then M is called an affine symmetric space. A symmetric Riemannian space is a special case of this type ( 413 Symmetric Spaces). At each point p in a manifold M with an affine connection, we can choose local coordi- nates (Xl, ..., x n ) such that Xi(p) = 0 and the curve xi=ait (-J < t <J) is a geodesic for each (ai, ..., an) with L(aif = 1. Such local coordi- nates are caIled geodesic coordinates at p [4]. With respect to geodesic coordinates, we have (Vx,(X))p=O (8/8x i =X;j. K. Riemannian Connections When a Riemannian metric g ( 364 Rieman- nian Manifolds) is given on a manifold M, it defines a metric on the tangent space Tp(M) at each point p of M, and we can take ortho- normal bases in Tp(M). The set P' of all ortho- normal bases of tangent spaces is a subset of the bundle P of tangent n-frames of M and forms a subbundle of P; its structure group is the torthogonal group O(n), and P' gives a reduction of the bundle of tangent n-frames. Conversely, when a reduction of the bundle of frames to O(n) is given, we can define a Riemannian metric on M such that the re- duced bundle consists of all orthonormal frames. For a Riemannian metric 9 on M, there exists a unique affine connection on M such that (i) Vg=O, and (ii) the torsion tensor T vanishes [4]. This connection is called the Riemannian connection corresponding to g. The first condition is equivalent to the invar- iance of the Riemannian metric 9 under paral- lel displacement. Thus the affine connection transforms orthonormal bases on M to ortho- normal bases and induces a connection in the bundle P'. It is known that the restricted holo- nomy group of any Riemannian connection is a closed subgroup of O(n) [4]. An affine con- nection on a manifold M is caIled a metric 
305 connection if it preserves a Riemannian metric g on M, i.e., if it satisfies the condition (i). L. Representations in Local Coordinates (1) Let (x I , ... ,xn) be a local coordinate system in a manifold M and consider vector fields Xi=olox i . For an affine connection on M, the covariant derivative can be expressed as V'x(X k )= LrjkXi' J . , The rjk are ca\1ed coefficients of the affine connection with respect to the 10cal coordinate system (xl, ... , x n ). We denote by rjk the coeffi- cients of connection with respect to another loca1 coordinate system (y', "', yn). Then on the intersection of their coordinate neighbor- hoods, we have _. oyi ( OXPOX Y 02X' ) r;k==.y aX' ayjayk rpY+ ayjayk . Conversely, if the rjk are given in each local coordinate system of M and satisfy this rela- tion on each intersection of their coordinate neighborhoods, then there exists a unique affine connection such that the coefficients of the connection are given by rjk' (2) The coefficients of the Riemannian con- nection corresponding to a Riemannian metric g=Lgijdxidx j on a manifold M are given by r i = " il { Ogjl Og'k _ 09jk } jk 2"j-g ox k + ox j ox ' and are ca\1ed the Christoffel symbols. (3) With respect to each local coordinate system (Xl, .." x n ), we express the torsion ten- sor T and the curvature tensor R of an affine connection by " T i d j d k T=L... jk x &; X &;X i , ijk R= L R;k,dxj@dxk@dx'@Xi' ijkl The components  and R;kl are given by =rjk-nj, R;kl = (0r&lox k - orjlox') + L (rlj rm - r:j rim). m (4) Let K =(K:..'j) be a tensor field of type (r, s). Then the covariant differential V' K = (K i 1 ..I. ) is given by Jt...Js,k K i 1' i, ==OK i 1 i  / axk ltu.Js;k It...Js +  ( " i"Ki1...1 "' ) L...J L. kl It .Js <%=1 I s ( ) _ r m K i 1... i , I L kjp j1 m...j, . p1 m 80N Connections (5) A curve Xi=Xi(t) is a geodesic if and only if d 2 x i . dxjdx k dt 2 +r;kdtdt=O' i=I,2,...,n ( 178 Geodesics, 417 Tensor Calculus). M. Cartan Connections Let M be a differentiable manifold of dimen- sion n. Consider a homogeneous space F = GIG' of the same dimension n, where G is a Lie group and G' is a closed subgroup of G ( 199 Homogeneous Spaces). Let B = (B, M, F, G) be a fiber bundle over M with fiber F and struc- ture group G, and P = (P, M, G) be the prin- cipal fiber bundle associated with B. Suppose that there exists a cross section f over M to B. Then the structure group of P can be reduced to G'. We denote this reduced fiber bundle by P' =(P', M, G') and the injection of P' into P by j ( 147 Fiber Bundles). Suppose that a connection is given in P. Its connection form w is a differential form of degree 1 on P with values in g, and the in- duced form w' = j*(w) is also a differential form of degree 1 on P' with values in g. We call the connection in P a Cartan connection on M with the fiber F = GIG' if at each point x of p', w gives an isomorphism of Tx(P') onto 9 as linear spaces. Such a connection in P is equivalently defined as a I-form w' on P' with values in g satisfying the fo\1owing three con- ditions: (i) w'(A *) = A (A E g' (Lie algebra of G')); (ii) R:(w')=ad(a-1)w' (aEG'); and (iii) w gives an isomorphism of TAP') onto g at each point XEP'. For such w', we can take a con- nection form w in P such that w' = j*(w); w defines a Cart an connection. N. Soudures A cross section f over M to B gives a vector bundle T'(B) on M defined as follows: For each point p of M, the projection B-->M de- fines a mapping Tf(p)(B)--> 'Tp(M). The kernel of this mapping is denoted by Vf(p)(B). Then T' (B) = UP Vf(P)( B) forms a vector bundle over M, and the dimension of its fibers is equa1 to n=dimF. A Cartan connection in P gives a bundle isomorphism between T'(B) and the tangent vector bundle T(M) of Mas fo\1ows: Let x be an arbitrary point in P', and put p = n(x). The projection n: P' -->M induces an isomorphism of 'T.:(P')/VAP') onto Tp(M). On the other hand, w gives an isomorphism of Tx(P')jVAP') onto gig'. As a point in P', x gives a mapping of F 
800 Connections = GIG' onto the fiber in B over p and sends the point {G'} in F to f(p). By this map- ping, To(F)=/g' is mapped isomorphically onto Vf(p)(B). Combining these isomorphisms, we get an isomorphism between Tp(M) and Vf(p)(B) that is independent of the choice of x E P' over p. The set of such isomorphisms for pEM defines a bundle isomorphism of T(M) and T'(B). If a fiber bundle B over M has an isomorphism such as above through a cross section, then B is said to have a soudure [3]. Conversely, if a fiber bundle B over M has a soudure with respect to a cross section f, then there exists a Cart an connection in P such that the soudure given by the connection coincides with the original one [3]. There are many Cartan connections in P with the given sou- dure. However, when F = GIG' is a symmetric space of compact type such that G is noncom- pact and contains thc identity component of the group of isometries, we can determine uniquely the so-called normal Cartan connec- tion among the Cartan connections which gives rise to the given soudure [10]. For the tangent vector bundle T(M) of M, the fiber F is an n-dimensional linear space and can be expressed as F=GIG', where G is the taffine transformation group of F and G' = GL(n, R). Then T(M) has the O-section over M, and there exists a natural soudure. Fur- thermore, an affine connection on M canoni- cally induces a Cartan connection on M with the fiber F = GIG' [3]. For a Cartan connection on M, we can introduce the notion of development of a curve in M into the fiber and also the notion of completeness [3]. O. Projective Connections Let F 1 = G 1 /G; and F 2 = G 2 /G; be homoge- neous spaces with dim FI = dim F 2 = n. Sup- pose that G I is a Lie subgroup of G 2 and G; is contained in G; by the injection. Then we have a canonical injection F 1 -->F 2 (Fl is an open subset of F 2 by the assumption). Suppose that a fiber bundle BI with fiber Fl over M has a cross section fl' Using fl' we can construct a bundle B 2 with fiber F 2 over M which also has a cross section f2' The prin- cipal bundle of B 2 is given by extending the structure groups from the principal bundle of Bl' We can show that if B 1 has a soudure with respect to the cross section fl' then B 2 also has a soudure with respect to the cross sec- tion f2' A Cartan connection in the principal fiber bundle PI associated with B 1 that is com- patible with a given soudure on BI induces a Cart an connection in the principal fiber bundle P 2 associated with B2' which then 306 induces a soudure on B 2 . The latter is called a Cartan connection induced from the former. Let F 1 be an n-dimensional linear space and F 2 be the real tprojective space of dimension n, Then the affine transformation group of FI can be embedded into the projective transfor- mation group of F 2 . Thus the tangent vector bundle of a manifold M induces a fiber bundle over M with the n-dimensional projective space as its fiber. A Cartan connection in this fiber bundle is called a projective connection on M. By the argument in this section, we see that every affine connection on M induces a projec- tive connection on M. Given two affine connections on M, we denote by V and V' their corresponding covar- iant differentials. The two affine connections on M induce the same projective connection on M if and only if there exists a differential form <p of degree 1 on M such that Vi Y - V x Y = <p(X) Y + <p(Y)X for any vector fields X, Y[7]. If a diffeomorphism <p of M preserves the projective connection induced by an affine connection in M, then <p maps geodesics of M into geodesics. P. Conformal Connections Let F 1 be an n-dimensional Euclidean space and F 2 an n-dimensional sphere (a tconforma1 space). We can embed the group oftisometries of Fl canonicaIly into the group of tconformal transformations of F 2 . A Riemannian metric of the tangent vector bundle of a manifold M of dimension n gives a fiber bundle over M with fiber F 2 . A Cartan connection in this fiber bundle is called a conformal connection on M; a Riemannian connection on M induces a conformal connection on M. Two Riemannian metrics 9 I' g2 on M in- duce the same conformal connection on M if and only if there exists a positive function f on M such that g2 = fgl' Thus for a Rieman- nian manifold M with metric tensor 9 I' a dif- feomorphism <p of M such that <p*(g)= fgl leaves invariant the conformal connection induced by gl' Such a <p is called a conformal transformation of M with respect to the given Riemannian metric gl' For a Riemannian manifold M with metric tensor g, we define Weyl's conformal cnrvature tensor W by . . 1 . . . . W;/ = Rjk/ + -(R jk 6,' - Rjl 6 + qjk R; - gj,RD n-2 R .. (n-l)(n-2) (gjkJj-gj1Jk), where the RjL and Rjk are components of the curvature tensor and Ricci tensor, respectively, 
307 and R is the scalar curvature ( 364 Rieman- nian Manifolds, 417 Tensor Calculus). When dim M;:, 3, the conformal connection induced by 9 on M is locally flat if and only if the conformal curvature tensor vanishes [7]. Q. Yang-Mills G-Connection Let P = (P, n, M, G) be a differentiable principal fiber bundle over a compact oriented Rieman- nian manifold M with group G. Then fiber bundles Gp=P x cG and gp= P x Ad 9 asso- ciated with P are induced naturally from the group conjugation c: G-->Aut(G) and the ad- joint representation Ad: G--> Aut(g), respec- tively. A (local) section of G p is called a (local) gauge transformation of P. The set of all global gauge transformations, which is denoted by '1f p , has a group structure. A locally faithful representation p of G to an n-dimensional complex vector space F with a fixed basis (I' ... , n) defines a differenti- able complex vector bundle E=P x pF asso- ciated with P. Every point x of P is identified with a linear mapping x of F onto the fiber E.(x) defined by x: i-->ei' where e; denotes the equivalence class of {x,;} EP X F, 1.s; i.s; n. In a manner similar to the case of an affine connection, a connection in P with connection form w gives a notion of parallel displacement of E as follows: Let c = P, (0 .s; t.s; 1) be a curve in M and c* = x, be a lift of c to P. The map- ping x,O XOI : Epo --> Ep, is called the parallel displacement of Epo onto Ep, along c. Let X be a vector field on M and <p be a differentiable section of E. The covariant de- rivative Vx<P of <p in the direction of X is de- fined as follows: Let Po be a point of M, c = P, ( - 8.s; t.s; - 8) be an integral curve of X through Po and c* = x, be a lift of c to P. We set 1 ( Vx<p) p = lim-(x,oxol(<p p )-<P p )' o t-+O t I a Then V x is also a differentiable section of E. The mapping (X, <p)--> Vx<p satisfies the fol- lowing conditions: (i) Vx<P is linear with respect to X and <p; (ii) Vjx<P= 1Vx<p; and (iii) Vx(f<p) = (Xf)<p + fVx<p, where f is a differentiable func- tion on M, From these conditions it is seen that the mapping X --> Vx<P for a fixed section <p of E defines a differential form of degree one with values in E, denoted by V <po V is called a G-connection on E (induced from the connection in P). A linear operator d V : r(AP 0 E)-->r(Ap+l 0 E), defined by dV(ex 0 <p)=dex 0 <p + ( -1)p ex A V <p for a differential form ex of degree P and a differentiable section <p of E is called a covariant exterior differentiation. Here r(AP 0 E) denotes the set of all differ- ential forms of degree P with values in E. 80Q Connections The curvature form R V of V is defined by RV(X, Y)<p= Vx(Vy<p)- Vy(Vx<p)- V[X,Yj<P, where X and Yare vector fields on M and <p is a differentiable section of E. In terms of the curvature form Q of the connection in P, it can be defined also by R(X, Y)<p=xoQAX*, Y*)'rl(<p), where X and Y are in TpM and xEn-t(p) and X*, y* are lifts of X, Yto P, respectively, and <p is an element of Ep. Since Q is a tcnsorial form of type ad and the differential p* of p induces a faithful representation of 9 to gI(F), R V can be regarded as a differential form of degree two with values in the bundle gp = P X Ad g. The curvature form R V satisfies Bianchi's identity: dVR v =0, where d V is a covariant exterior differentiation with respect to the G- connection on gp canonically induced from the connection in P. We denote by {(j the set of all G-connections on E. Now let G be a compact semisimple Lie group. A functional Y':{(j-->R defined by V -->Y'(V) = -HMtr(R v A *R v ) is called the action integral (Yang-Mills functional) of V, where * is Hodge's star operator, given by the fixed orientation of M. The group '1f p acts on {(j by V --> f- 1 0 V 0 f for f 0 '1f p and then the curvature form is trans- formed by this action as R V -->Ad(/-I )R v . Thus .'I' is '1fp-invariant. A connection V is called a Yang-Mills G- connection if V is a critical point of .'1'. The Euler-Lagrange equation of .'I' is given by d V * R V = 0 by the aid of the formal adjoint operator d V * of d V . This equation, called the Yang-Mills equation, is a system of non- linear second-order elliptic partial differential equations. When M is a 4-dimensional vector space R4 with Minkowskian metric and G is the Abelian group U(I), the Yang-Mills equation coincides with Maxwell's equations for an electromag- netic field. Thus the Yang-Mills equation for a non-Abelian group G is a natural extension of Maxwell's equations. In fact, the theory of Yang-Mills connections has its origin in the field theory of physics [11]. In the case of dim M = 4, special Yang- Mills G-connections occur. A G-connection V satisfying the condition * R V = R V (resp. * R V = - R V ) as a differential form of degree two is called a self-dual (resp. anti-self-dual) G-connection. From Bianchi's identity and the expression of d V * : d V * = - * 0 d V 0 *, it fol- lows that every self-dual (anti-self-dual) G- connection gives a solution to the Yang-Mills equation. Since the first Pontryagin number pdt:) is given by Pl(E)= -(1/4n 2 jJ M tr(R v A R V ) by virtue of the Chern- Weil theorem, the 
80 Ref. Connections action integral satisfies Y'(V');:' 2n 2 1PI (E)I for every V' in {(j and the equality holds if and only ifV' is self-dual or anti-self-dual. Explicit forms have been obtained for (anti-) self-dual connections over the 4-sphere S4 by many interesting methods [12-15]. And it has been shown that moduli space of self- dual G-connections (i.e., the set of all solu- tions to the Yang-Mills equation modulo p) over S4 has the structure of a Hausdorff mani- fold of dimension PI (gp) - dim G for every principal bundle P with group G [16,17]. It is not yet known whether there exists a Yang- Mills G-connection over S4 whose holonomy group is an open subgroup of G and which is neither self-dual nor anti-self-dual [18]. The following is one of the few known facts concerning the properties of Y ang- Mills G- connections: If a Yang-Mills G-connection, G=SU(2), SU(3), or 0(3), over S4 is weakly stable, i.e., if the second variation of .'I' is posi- tive semidefinite, then it is self-dual or anti- self-dual [19]. References [1] H. Cartan, Notions d'algebre differentieIle; application aux groupes de Lie et aux varietes ou opere un groupe de Lie, Colloque de Topo- logie, Brussels, 1950, 15-27. [2] S. S. Chern, Topics in differential geome- try, Lecture notes, Institute for Advanced Study, Princeton, 1951. [3] C. Ehresmann, Les connexions infinite- simales dans un espace fibre differentiable, Colloque de Topologie, Brussels, 1950,29-55. [4J S. Kobayashi and K. Nomizu, Founda- tions of differential geometry, Interscience, I, 1963; II, 1969. [5] A. Lichnerowicz, Theorie globale des connexions et des groupes d'holonomie, Cre- mona, 1955. [6] K. Nomizu, Lie groups and differential geometry, Publ. Math. Soc. Japan, 1956. [7] K. Yano and S. Bochner, Curvature and Betti numbers, Ann. Math, Studies, Princeton Univ. Press, 1953. [8J R. S. Palais, Seminar on the Atiyah-Singer index theorem, Ann. Math, Studies, Princeton Univ. Press, 1965. [9] S. S. Chern, Differentiable manifolds, Lec- ture notes, Univ. of Chicago, 1959. [!OJ N Tanaka, On the equivalence problem associated with a certain class of homogeneous spaces, J. Math. Soc. Japan, 17 (1965),103- 139. [11] C. N. Yang and R. L. Mills, Conservation of isotopic spin and isotopic gauge invariance, Phys. Rev., 96 (1954),191-195. [12] A. A, Belavin, A. M. Polyakov, A. S, 308 Schwartz, and Y u. Tyupkin, Pseudo-particle solutions of the Yang-Mills equations, Phys, Lett., 59B, 1 (1975),85-87. [J3J A. A. Belavin and V. E. Zakharov, Yang- Mills equations as inverse scattering problem, Phys. Lett. 73B, (1978), 53-57. [14J R. Jackiw, C. Nohl, and C. Rebbi, Con- formal properties of pseudo particle con- figurations, Phys, Rev, D15, 6 (1977), 1642- 1646. [15J V. G. Manin and Yu. I. Manin, A de- scription of instantons, Comm. Math. Phys., 63 (1978),177-192. [16] M. F. Atiyah, N. J. Hitchin, and I. M. Singer, Self-duality in four-dimensional Rie- mannian geometry, Proc. Royal Soc. London A, 362 (1978), 425-461. [17] M. Itoh, Moduli of anti-self-dual connec- tions on Kahler manifolds, Proc. Japan Acad., 57A (1981),176-180. [18] M. F. Atiyah and J. D. S. Jones, Topolog- ical aspects of Yang-Mills theory, Comm. Math. Phys., 61 (1978) 97-118. [19] J. p, Bourguignon and H. B. Lawson, Stability and isolation phenomena for Yang- Mills fields, Comm. Math, Phys" 79 (1981), 189-230. 81(1.11) Constructive Ordinal Numbers A. General Remarks To extend the theory of trecursive functions to transfinite ordinal numbers, A. Church and S. C. Kleene [IJ considered the set of effectively accessible ordinal numbers and defined the concept of constructive ordinal numbers as explained later in this article. Their work became the basis of fruitful research by Kleene, W. Markwald, C. Spector, and others [2-5]. A constructive ordinal number was origin- ally introduced as an "expression" in a tformal system utilizing the A-notation. Since such a system is "effective," we can arithmetize it utilizing tGodel numbers and assume from the outset that each ordinal number is represent- able by a natural number. The notations, terminology, and theorems mentioned in this article are mainly those for constructive ordinal numbers of the tsecond number class. B. Definition and Fundamental Properties We call a set of natural numbers satisfying conditions (I) and (II) a system of notations for ordinal numbers, and an ordinal number a 
309 constructive ordinal number when it is represent- able by a natural number belonging to such a system of notations: (I) No natural number represents two distinct ordinal numbers. (II) There are three tpartial recursive functions K(x), P(x), and Q(x, n) defined as follows: (i) for any natural number x representing X, K(x) takes the value 0, 1, or 2 according as X is zero, an tisolated ordinal number, or a t1imit ordinal number, respectively; (ii) when X is the ordinal number timmediately after an ordinal number Y, P(x) represents Y for any natural number x representing X; (iii) when X is a limit ordinal number, for any natural number x representing X there exists an increasing sequence {Y,,} of ordinal numbers such that X == limn Y" and Q(x, n) represents Y" for each natural number n. The system called S3 by Kleene is the most useful and convenient among systems of nota- tions for ordinal numbers. Let no be a tprimi- tive recursive function of the variable n defined by 0 0 = 1, (n+ 1)0=2 n 0. The fundamental notion aEO and relation a<ob of the system S3 are introduced by the following inductive definition: (1) 1 EO; (2) if YE 0, then 2 Y EO and Y < 02 Y ; (3) if a sequence {Yn} of natural num- bers has the property that for each n, YnEO and Yn<oYn+!, and if Y is a Godel number that defines Yn recursively as a function of no (i.e., Yn {y} (no) ( 356 Recursive Functions»), then 3, 5 Y E 0, and for each n, Yn<03'Y; (4) if x, Y, ZEO, x<oY, and Y < oZ, then x < oZ; (5) aE 0, a < ob hold only when they follow from (1)-(4). Now, for brevity, we write a .s; ob for (a<ob)v(a=b). The following propositions hold for S3: (1) If a<ob, then b# 1; (2) If a < ob, then a, bE 0; (3) If a < 02 Y , then a.s; oY; (4) If a < 03' Y, then there is a natural number n such that a.s; oYn, where Yn = {y} (no); (5) If aEO, then l.s;oa; (6) If aEO, then for any tnumber-theoretic function ex such that ex(O) = a and Vn (ex (n) # l-->ex(n+ 1) < oex(n)), there is a k such that ex(k) = I; (7) For each a, 'I (a < oa); (8) If CEO, a.s; oC, and b.s; oC, then a<ob or a=b or b<oa. Each member a of 0 represents an ordinal number lal as foUows: 111==0; 12 Y I=lyl+ 1 for YEO; 13'YI=lim n IYnl for 3'YEO, where Yn= {y}(no). Let b be a member of O. Then lal < Ibl when a<ob; and conversely, for each ex< I b I, there is a number a such that I a I = ex and a<ob. Hence the set {ala<ob} is a twell-ordered set with respect to <0' and its torder type is Ibl. The least number  greater than lal for every member a of 0 is the least ordinal number that is not constructive. It is denoted by wi K (Church and Kleene denoted it by wd. There is a subsystem of S3 that is well ordered with respect to .s; 0 and contains a 81 D Constructive Ordinal Numbers unique notation for each constructive ordinal number ex. For such a subsystem we can take a III set K such that K is trecursive in 0 (S. Feferman and Spector, R. O. Gandy) ( 356 Recursive Functions H). C. Constructive Ordinals and the Kleene Hierarchy Let R(x, y) be a tpredicate on natural numbers. We write x.s; RY for any natural numbers x, Y for which R(x,y) holds. We consider only the case where .s;R is a tlinear ordering on the set DR = {x 13y(R(x, y) v R(y, x))}. If DR is a well- ordered set with respect to .s;R' we denote its torder type by IRI. (1) For each (constructive) ordinal number ex < wi K , there is a tgeneral recursive (more strictly, tprimitive recursive) predicate R such that I R I = ex (Markwald, Spector, Kleene [3J). (2) Conversely, if R is a thyperarithmetic predicate (e.g., R is general recursive), then I R 1< wi K (Markwald, Spec- tor). The following theorems are the most fruitful ones in the theory of constructive ordinal numbers, and they fully support the validity of Kleene's idea of tanalytic hierarchy. (3) The set 0 is III ( 356 Recursive Func- tions H), and so is the predicate a < ob. Name- ly, for 0, there is a primitive recursive predi- cate R(a, x, ex) such that aEO=V0!3xR(a, x, O!) (Kleene [3J). (4) For each ordinal number ex < wi K , the set {a I ex;:' lal} is a hyperarithmetic set (Spector). (5) 0 is a tcomplete set for Ill. That is, for any III set E, there is a primitive recursive function <p such that aEE=<p(a)EO (Kleene [3J). Accordingly, 0 is not a 1:l set ( 356 Recursive Functions H). D. Relativization and Extension Given a (number-theoretic) predicate Q of one variable (or a set of natural numbers), we can trelativize to Q the notion of constructive ordinal numbers. The least ordinal that is not constructive relative to Q is denoted by w. The relativization to Q of the fundamental notion aEO and relation a<ob of the system S3 of notations are denoted by aEO Q and a< oQb, respectively. Then we can relativize the results of the preceding paragraphs to Q. For example, as the relativization of (3), we have the following: There is a predicate RQ(a, x, ex) which is primitive recursive tuniformly in Q such that aEO Q =Vex3xRQ(a, x, ex). When Q is hyperarithmetic, we have no gen- eralization of the constructive ordinal numbers 
81 E Constructive Ordinal Numbers by relativizing them to Q, that is, w = wi K holds (Spector) Now by relativizing to 0 the concept of constructive ordinal numbers, we obtain a proper extension of it (WiK<W), and then, performing such extensions successively, we have a (transfinite) sequence 0, 0°, 0°°, .... On the other hand, we can extend the constructive ordinal numbers to those corre- sponding to any like higher number class, beyond the second, in which partial recursive functions are used at limit levels to provide an "accessibility" mapping from previously de- fined number classes. There are several exten- sions done by Church and Kleene, H. C. Wang, D. L. Kreider and H. Rogers, Jr., H. Putnam, W. Richter, A. Kino and G. Takeuti, and others. Richter [9J has shown that these two ways of extending the constructive ordinals are equivalent, provided the sets of notations for the higher number classes satisfy certain natural conditions. Specifically, the ordinals of the Addison and Kleene [6J con- structive third number class are exactly the ordinals less than w? and the set 0 20 of nota- tions for those is recursively isomorphic to 0° (Richter). E. Constructive Ordinals in Higher Number Classes Putnum [8J has defined a system C of nota- tions for constructive ordinals of the Cantor higher number classes, improving that of Kreider and Rogers [7J, as follows: N, is the set of notations for the ordinal number ex; if for some ex, XEN., let Ixl=J1[xEN,;J, which is called a hyperconstructive ordinal; and let C,= U{Nf,1 « ex}. Define i to be a G6del number of the tidentity function, and n to be an index in C, if 3" . 5' E C, for some t. There are four cases: Case 1. ex = O. Then N, = { 1 }. Case 2. ex = fJ + 1, where N p is already defined. Then N,={2XlxENp}. Case 3. ex is a limit ordinal such that Nf, is already defined for all « ex, and there exists an ordinal fJ < ex such that 3 a . 5 i E N p for some a, and a partial recursive function f giving an order-preserving tcofinal mapping (o.p.c.m.) from C p into Ca. Then N, is taken to be the set of all numbers 3 a .5 n such that 3 a '5 i EN p (where p is any ordinal with the above property) and {n} is an o.p.c.m. from C p into Ca. Case 4. ex is a limit ordinal such that Nf, is already defined for all  < ex, there is no fJ as in Case 3, but there is a number a E C, which is not an index in Ca. Let fJo be the least fJ for which such a's belong to N p . Then N,== {3a'5nlaENpo and {n} is an o.p.c.m. from C, into C,}, 310 We have that xEN, only as required by Cases 1-4. Finally, let mc be the least ordinal num- ber which is not hyperconstructive, and let C = C"'c' Extending Kleene's Hy for YEO ( 356 Recursive Functions H), Putnum has defined a hierarchy Hx for XEC, and shown that hyper- constructive ordinal numbers are uniqueness ordinals, i.e., for X,YEC, ifixi = IYI, then Hx and Hy are of the same tdegree of (recursive) un solvability. E \ is the type-2 object ( 356 Recursive Functions F) introduced by T. Tugue such that Et(ex)=O ifVfJ:ix[ex(fJ(x))=OJ; otherwise, Et (ex) = 1. It is known that Wc == (()E1, where (()E1 is the least ordinal which is not the order type of any well-ordering recur- sive in EI (Richter [lOJ). References [IJ A. Church and S. C. Kleene, Formal defi- nitions in the theory of ordinal numbers, Fund. Math., 28 (1937),11-21. [2] S. C. Kleene, On notation for ordinal numbers, J. Symbolic Logic, 3 (1938),150-155, [3] S. C. Kleene, On the forms of the predi- cates in the theory of constructive ordinals II, Amer. J. Math., 77 (1955), 405-428. [4J W. Markwald, Zur Theorie der konstruk- tiven Wohlordnungen, Math. Ann., 127 (1954), 135-149, [5J C. Spector, Recursive well-orderings, 1. Symbolic Logic, 20 (1955), 151-163. [6J J, W. Addison and S. C. Kleene, A note on function quantification, Proc. Amer. Math. Soc., 8 (1957), 1002-1006. [7J D. L. Kreider and H. Rogers, Jr., Con- structive versions of ordinal number classes, Trans, Amer. Math. Soc., 100 (1961), 325-369. [8J H. Putnum, Uniqueness ordinals in higher constructive number classes, Essays on the Foundations of Mathematics, Magnes Press, Jerusalem, 1961, and North-Holland, 1962, 190-206. [9J W. Richter, Extensions of the constructive ordinals, J. SymhoJic Logic, 30 (1965),193- 211. [!OJ W, Richter, Constructively accessible ordinal numbers, 1. Symbolic Logic, 33 (1968), 43-55. 82 (XII1.18) Contact Transformations A. General Remarks A transformation of 2n + 1 variables z, x j ' Pj (j= 1,2, ...,n), 
311 Z =Z(Z,X 1 ,... ,Xn,PI ,P2,'" ,Pn), X j =Xi Z ,X 1 ,X 2 ,... ,Xn,PI ,P2,'" ,Pn), j= 1,2,..., n, =PiZ,XI,X2' ,,,,X n ,Pl,P2, ...,Pn), j= 1,2, ...,n, (1) is called a contact transformation in the (n+ 1)- dimensional space R n +! with the coordinate system (z, XI' '" ,x n ) if the ttotal differential equation dZ-Pldxl-P2dx2-...-Pndxn==0 (2) is invariant under the transformation, i.e., if the eq uality dZ-P I dX l -P 2 dX 2 - ...-PndXn = p(dz- PI dX I - P2 dX2 -... - Pndxn) (3) holds identically for a suitable nonzero func- tion p of z, Xj' Pj' Here we assume that (1) has an inverse transformation. Using Lagrange's bracket [j,gJ=i ( Jf ( d 9 ) _ J 9 ( df )) , j=1 apj dX j JPj dX j (:J == : +Pj  ' we see that (1) is a contact transformation if and only if [Xj, XkJ = [Xj, ZJ = [, PkJ =0, [, XkJ = P(jjk, [, ZJ = pP j , where (jjk is tKro- necker's delta. From this fact it follows that the composite of two contact transformations and the inverse transformation of a contact transformation are also contact transformations. Since the iden- tity transformation Z = Z, X j = Xj' Pj = Pj is a contact transformation, the set of all contact transformations forms an infinite-dimensional ttopological group. Given a set of scalars PI' ..., Pn' a pair consisting of a point (z, xJ and an n-dimensional hyperplane z* - z = LJ=l pixt - x) in an (n + 1 )-dimensional space is called a hypersurface element, and the set of hypersurface elements satisfying (2) is called a union of hypersurface elements. Using these concepts, we can state that a transformation (1) of coordinates Z,Xj,Pj (j= 1,2, ...,n) is a contact transformation if it transforms each union of hypersurface elements into another one. Consequently, if two n-dimensional hypersurfaces are tangent at a point (z, x) in the (n + I)-dimensional space, their images under a contact transformation, which are again two n-dimensional hypersurfaces, are tangent at the image point of (z, x). The name "contact transformation" is derived from this fact. For instance, the tcorrelation with respect to a hypersurface of the second order gives a 82B Contact Transformations contact transformation. In fact, from the re- lation between tpoles and tpolar lines with respect to the parabola X2 + 2y=0 in a plane, we have Legendre's transformation X = - P, Y=xp- y, P= -x (p= -x). In general, an invertible transformation defined by the three relations Q(x, y, X, Y)= 0, aQ/ax +paQ/ay=o, aQ/ax+paQ/ay=o derived from a function Q(x, y, X, Y) is a con- tact transformation. The function Q is called the generating function of this transformation. In this transformation, to each point (x o , Yo) there corresponds a curve Q(xo,yo,X, Y)== O. These results are valid also in the case of several variables. For instance, in an (n + 1)- dimensional space, a transformation Z = z- XlPI -... -XvPv; XI = PI"'" Xv= P" Xv+! = X v + 1 , ...,Xn=X n ; Pi = -X b ...,Pv= -xv, P v + I = Pv + I' . .. , Pn = Pn represents a contact transformation. Here v is an integer between 1 and n. In the case n = 2, v = 2, this transfor- mation reduces to a Legendre transforma- tion; and in the case n = 2, v = 1, it is called Ampere's transformation ( Appendix A, Table 15.IV). B. Canonical Transformations A transformation of 2n variables Xj' Pj (j = 1,2, ...,n) Xj=XiXI ,X2' ,.., X n ,PI,P2, .., ,Pn), = (XI' x 2 ,..., Xn,Pl, P2' ..., Pn), j= 1,2, ...,n, (4) is called a canonical transformation if the tdif- ferential form Lj1 (dXj- Pjdx j ) is texact in Xj' Pj' i.e., if there exists a function U of x, P such that n L (dXj-Pjdx)=dU. jl (5) Let (l) denote a contact transformation, and let ..1. denote a new variable which is different from zero. Set xo=z, Po=A, xj=x j , Pj= -APj (j = 1,2, ... , n) and define a transformation of (2n + 2) variables by Xo(x, P) = Z(z, x, p), P)=P)= (x,p)= -Po(z,x,p)/p(z,x,p), j=I,2,...,n, (6) where x, p, X, and P denote points (X l ,X2' ...,x n ), (PI,P2, ...,Pn), (XO,xI,...,x n ), and (Po, PI' ..., Pn), respectively. Then the identity (3) when multiplied by ..1. on both sides becomes PodXo+ PI dX I + ... + p"dX n =Podxo + PI dX I + ... + Pndxn. (7) 
82 B Contact Transformations Thus the transformation (6) represents a canonical transformation (U = constant). Moreover, from the definition the relation (6) is homogeneous, i.e., for all J1#0, Xl", J1p) = Xlx, p), (x, J1p) = J1(x, p), j=O,I,...,n. Conversely, let a transformation of (2n + 2) variables Xj=XAx,p), =(x,p), j=O,I,...,n, (8) be a homogeneous canonical transformation. Define a transformation of (2n + 1) variables z, x j , P j U=I,2, ...,n) by Z(z,x,p)=XO(z,x l , ...,xn,A, -AP" ..., -APn), 1jp(z,x,p) 1 _ =-;-Po(z,x" ...,xn,A, -AP" ..., -APn), Ie X/Z,X, p)=Xj(z, Xl'"'' X n , ..1., -Ap"..., -APn), (Z, X, p) p - = --;;-(Z,X"... ,Xn,A, -API'"'' -APn), Ie j= 1,2, ...,n. (9) Since the necessary and sufficient condition for (8) to be a homogeneous canonical transfor- mation is that (7) hold, it follows that ..1. -(dZ - PI dX l -... - PndXn) p =A(dz-pldxl-...-Pndxn)' Thus the transformation (9) represents a con- tact transformation. Therefore the most general contact transformation of (2n + 1) variables and the most general homogeneous canonical transformation are identical concepts, differing only in the choice of notation. The necessary and sufficient condition for (4) to be a canonical transformation is ex- pressed by the relations (Xj,Xk)=O, (Xj,Pd=Jjk> (,Pk)=O, j,k=1,2,...,n, (10) where ( ., .) is Poisson's bracket, i.e., for a pair of functions f(x, p), g(x, p), (f,g)= t ( a f ag _ af  ) . jl aXj apj aX j apj Denote Poisson's bracket in the coordinate system Xl' X2"'" X n , P" P 2 ,..., Pn by (', .)', and for f(x, p) and g(x, p) denote their trans- formations by f'(X, P) and g'(X, P). Then the relation (10) is equivalent to (/,g)=(/',g')' for all f and g. 312 From this fact it foIlows that the set of all canonical transformations forms an infinite- dimensional topological group. Suppose that we can take XI' X 2 ,... ,X n , XI' Xz, ..., X n as 2n independent variables. Denote by Ql (x, X) the function U of (5) represented by x, X. By means of this Ql' 2n functions which give a canonical transformation (4) are derived from the relations aQI Pj= - ax. ' J aQI p.=-, j=I,2,...,n. J aX j ( a2Q ) Conversely, for Q(x, X) such that det _ a aX j X k # 0, a transformation determined by the relations aQ Pj= - ax. ' J aQ =-, j=I,2,...,n, aX j represents a canonical transformation. For a general canonical transformation, Xl' X 2 , ...,X n , Xl' X 2 , ""X n are not necessarily 2n independent variables. But for a canonical transformation there exists n new variables XiI' X i2 '''' ,Xi" "..., "_, such that Xl' X 2 ,...,x n , XiI' X i2 , ...,X ik , IJl' 2' ""n-k are 2n independent variables, where (ii, i 2 , ..., i k ), U I ,jz,... ,jn-k) is a partitio.n of the set (1,2, ... , n) to two disjoint subsets. Denote by Q2 the function U - L7t Xj;Pj,- Then the transformation (4) is derived from relations aQ2 Pj=- ax. ' j=1,2,...,n, J aQ2 Xj;=- ap. ' 1==1,2,...,n-k, ;, aQ2 Pil= ax. ' 1=1,2,...,k. II These functions Q I' Qz are caIled the generat- ing functions of a canonical transformation. Consider a transformation of 2n variables depending smoothly on a parameter t: X j ==XAt,X l ,X 2 , ..., X n ,PbP2"" ,P.), = (t, Xl' X 2 , ..., X n , PI' P2, ..., Pn)' j= 1,2, ...,n. (11) If (11) represents a canonical transformation of 2n variables for each t, then X j and , j = 1,2, ..., n, satisfy. tHamilton's canonical equations dX j aH d=(t, X, P), t a d aH dt'= - ax/ t,x,P), j= 1,2, ...,n, (12) for some tHamiltonian function H(t, x, p). Conversely, for any solution X j ,  of Hamil- ton's canonical equations (12) depending on 
313 parameters Xl' X2,"" x n , PI' P2, ..., Pn' if the transformation (11) is a canonical transforma- tion for t=O, (11) represents a canonical trans- formation for each t. By a canonical trans- formation (4), the solutions of Hamilton's canonical equations dX j aH di.= apj (t,x,p), dpj aH -= --(t,x,p), dt aX j j=I,2,...,n, (13) are transformed to the solutions of dX j aK --;1=-(t, X, P), t a d aK -= --(t X P) dt oX j '" j=I,2,...,n, (14) where the new Hamiltonian function is given by K(t,X,P)=H(t,x,p). C. Applications to the Integration of Differential Equations Contact transformations have applications to the integration of differential equations since they transform each union of surface elements into another one. As an example, we sha\1 describe an outline of their application to a partial differential equation of the first order F(x, y, z, [J, q) = 0; p=az/Dx, q-=az/oy. For this purpose, we regard (15) as an equation defining unions of surface elements and trans- form it into a simpler equation by means of a contact transformation. If the transformed equation can be solved, then the solution of the original equation can be obtained by means of the inverse transformation. Now, let z=w(x,y,a,b) be a tcomplete solution of(15). Then (15) is reduced to Z - c = 0 by the trans- formation generated by the function Q-=Z-z+w(x,y,X, Y)-c=O, where c is a constant. In this equation the solution X ==a, Y =b, Z =c, exP + fJQ=O (a, b, c, ex, fJ are constants) plays an important role, and this line element will be called a charac- teristic line element. The characteristic line element satisfies equations that can be trans- formed by means of the inverse transformation into Charpit's subsidiary equations for (J 5): dx dy dz -- of/op iJF/oq poF/op+qoF/oq -dp -dg aF(ax+poF(az aF(ay+qaF/az Consequently, if we have p = p(x, y, z; a), q = 82D Contact Transformations q(x, y, z; a) from the solution G(x, y, z, p, q) = a of(16) and F=O, then the total differential equation dz= pdx+ qdy is tcompletely inte- grable, and the tgeneral solution of this equa- tion is a complete solution of (15). Also, if we know two independent solutions G(x, y, z, p, q) =a, H(x,y,z,p,q)=b of(16) such that [G,HJ =0, we can obtain a complete integral of (15) by eliminating p, q among the three equa- tions F = 0, G = a, H = b. This mcthod is called the tLagrange-Charpit method, which is ap- plicable also to the equation F(z, xl> ..., x n ; PI' ..., Pn) = 0 ( 322 Partial Differential Equa- tions (Methods ofIntegration). D. Applications to Analytical Dynamics Consider Hamilton's canonical equations (13). An example of system (13) is the equation of motion for a dynamical system which is de- rived from (j J L( x j , x) dt = O. Here the x j denote generalized coordinates, Pj= oL/ox j and II = Lj':1 PjX j - L. Lct S(X l 'X 2 , ...,x n , X l 'X 2 ' ...,X n ) be a solution depending on parameters X I 'X 2 , "', X. of the tHamilton-Jacobi equation H ( t'Xl'X2' ...,xn' -, -, ..., - ) oX I ax z aX n =K(X I ,X 2 , ...,X n ) (17) (15) ( 02S ) such that det axjax k #0. Then by a canon- ical transformation of a generating function S, i.e., by a transformation determined by the relations as Pj= - ox. ' } oS = ox. ' j=I,2,...,n, } the system (J 3) is transformed to (14). Here K(t,X,P)=H(t,x, -  )=K(X). Then (14) becomes dX. =O dt ' d aK -= --(X), j== 1,2, ...,n, dt aX j and the solutions are (16) (t)=(O)-t oK I oX j XX(O) In particular, a transformation that makes K == 0 is ca\1ed a transformation to an equilib- rium system. Thus solving Hamilton's canon- il.:al equations reduces to finding a tgeneral sol ution of the Hamilton-Jacobi equation (17). When the variables of the Hamilton-Jacobi equation (17) are tota\1y separable we can obtain a general solution of(17) by tquadra- Xj(t) = Xj(O), 
82 Ref. Contact Transformations tures. Then one can integrate by quadratures the system of equations (13). For many im- portant problems exact solutions are obtained by this method ( 271 Mechanics). Furthermore, every mapping in an optical system is a canonical transformation; in optics, the quantity corresponding to S is called an eikonal ( 180 Geometric Optics). References [1] R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. [2J C. Caratheodory, Calculus of variations and partial differential equations of the first order, Holden-Day, 1965. [3J V. Arnold, Les methodes mathematiques de la mecanique classique, Mir, 1976. [4J I. M. Gel'fand and S. V. Fomin, Calculus of variations, Prentice- Hall, 1963. 83 (V.3) Continued Fractions A. The Notion of Continued Fractions Let {b n } (n=O, ..., m) and {c n } (n= 1,..., m) be finite sequences of elements in a tfield F. A fraction of the form b o + b l + b 2 + ... C I C 2 C m - t ...+ C b m - I + b m m is called a finite continued fraction. It expresses an element in the field F unless de vision by 0 occurs in the process of reduction. Symboli- cally, it is also written in the forms b + <i c m - I C m o b l +b 2 +... +b m - I +b m ' b c 1 1 c 2 1 cm-II cml o+- I b +- I b +... +- I b +- I b' 1 2 m-l m b CI,C2' 'Cm-I'C m 0+-+-+"'+-+-' b l b 2 b m - l b m [ CI C 2 C m - 1 Cm J b o , , , ..., , , b l b 2 b m - l b m etc" or more briefly, [ c J m b o + --"- . b n n=1 314 If {b n } (n = 0, 1, ... ) and {c n } (n = 1,2, ... ) are infinite sequences, the expression b o + b l + b 2 + ... CI C 2 C n ...+- b n + ... is called an infinite continued fraction. By analogy with the finite case, it is expressed by b C I C 2 C n 0+- - - b l + b 2 + " . + b n + " . or by b o + I Sc J oo . L b n n1 For an infinite continued fraction, the quantity C I C n kn=b o +- - bl+...+b n is called its nth convergent, b o is called the initial term, and C n and b n (n? 1) are called the partial numerator and partial denominator, respectively. If F is a ttopological field (e.g., the real or complex number field) and the se- quence {k n } of its elements converges, then the infinite continued fraction is said to converge, and the limit is called its value. A finite or infinite continued fraction in which c n (n? 1) are all equal to 1, b o is a ra- tional integer, and b n (n? 1) are all positive rational integers is called a simple continued fraction. It is expressed by [b o , b" ...]. b n is often called the nth partial quotient. In the following paragraphs we shall mostly discuss simple continued fractions. For a real number x we mean by [xJ the greatest integer not exceeding x. [ J is called the Gauss symbol. Let W be any given real number, and put ko == b o 1 w=b o +-, bo=[wJ; WI 1 wn=b n +-, bn=[wnJ, n= 1,2, .... (1)n+l Then an expansion of w into a simple con- tinued fraction 1 1 w=b o +- - b l + ... + b n + ... is obtained. If w is irrational, this expansion is determined uniquely. Conversely, any infinite simple continued fraction converges to an irrational number, If w is rational, the process is interrupted at a finite step (w m = b m ), result- mgm 1 OJ = b o +- b l + .. . + b m 
315 An alternative representation of a rational number by a simple continued fraction is given by replacing b m above by (b m -l)+ 1/1. Examples of infinite simple continued frac- tions are e 2 / p + 1 1 1 =p+- e 2 / p -l 3p+... +(2n+ l)p+...' where p is a natural number (J. H. Lambert), and 1 1 1 1 1 1 e=2+- - - - - - 1+2+1+".+1+2n+l+... (L. Euler). B. Convergents Let the nth convergent of a simple continued fraction be expressed in the form of an irreduc- ible fraction Pn 1 1 -=b o +-  n?O, Qn b t +... +b; and for convenience put P- 2 =0, P-t = 1, Q -2 = 1, Q -I = O. Then we have the recurrence relations Pn==bnPn-1 +P n - 2 , Qn==bnQn-t +Qn-2, n?O, whence folIows PnQn-1 -Pn-I Qn=( _1)n+1, n?-1. Any simple continued fraction represents a real number w which satisfies w=(w n +1 Pn+ P n-d/(w n +1 Qn+ Qn-d in terms of the notation defined in this and the previous section. In particular, let (J) be an irrational number. Then each of the fractions Pk) P n - 2 + kP n - t Q (k) Q kQ ' k=I,2,...,b n -l, n n2+ n-l which are inserted between two conver- gents P n - 2 /Qn-2 and Pn/Qn = (Pn-2 + bnPn-d/ (Qn-2 + bnQn-d, is calIed an intermediate con- vergent, while the original convergent Pn/Qn is called a principal convergent. If a fraction P/Q approximating an irra- tional number w satisfies Iw- P/QI <Iw- p/ql for any other fraction p/q with q.s; Q, then it is said to give the best approximation. The frac- tion giving the best approximation of w is always a principal or intermediate convergent pk) /Qk) of w with k> b n /2 or k = b n /2, Qn> Qn-t OJn- The convergents satisfy the relation Pn/Qn- Pn-t/Qn-I = ( _1)n-1 QnQn-l; hence the se- 83C Continned Fractions quence {P 2n /Q2n} (resp. {P 2n +1/Q2n+I}) is monotonically increasing (decreasing). Ap- proximation by convergents is shown in the relations I Pn I 1 w- Qn .s; Qn+IQn ' I Pn I l Pn-I I w- Qn < w- Qn-I ' p 00 (-1)" w==lim==bo+ L-. "--*00 Qn n=O Qn+l Qn In particular, Iw- Pn/Qnl < I/Q;. On the other hand, if P, Q are relatively prime positive integers with Iw-P/QI < 1/2Q2, then P/Q is a convergent to w (A. M. Legendre). There are several results concerning the measure of approximation by convergents. Any w satisfies Iw-Pn/Qnl < 1/(j5Q;) for infinitely many n, while there exists an w which satisfies Iw- Pn/Qnl < 1/(AQ;) for only a finite number of n provided that A>j5 (e.g., w=j5, A. Hur- witz); at least one of two adjacent convergents satisfies the inequality Iw- Pn/Qnl < 1/(2Q;) (K. T. Vahlen); at least one of three consecutive convergents satisfies the inequality Iw-Pn/Qnl < 1/(j5Q;) (E. Borel); Iw-P/QI < 1/J8Q2 has infinitely many rational solutions P/Q whenever w is not tequivalent to (j5 + 1 )/2 (Hurwitz). One calls an irrational number (J) badly approximable if there is a constant c = c(w»O such that Iw- P/QI >C/Q2 for every rational P/Q. w is badly approximable pre- cisely if the partial quotients of the continued fraction expansion of (J) are bounded. In partic- ular, real quadratic irrationals have bounded partial quotients ( Section C; 182 Geometry of Numbers F). C. Quadratic Irrationals If an infinite simple continued fraction [b o , b l , ... J satisfies bm+k+V = bm+v (v = 0, 1,2, ...), it is called a periodic continued fraction an d is denoted b y the symbol [b o , b l , ..., b m , ..., bm+k-I]. According as m = 0 or m> 1, the periodic continued fraction is said to be pure or mixed, and the sequence b m , b m +1' ..., bm+k-t is calIed a period. In order for the continued fraction of w to be periodic, it is necessary and sufficient that w be a quadratic irrational number, i.e., a root ofax 2 +bx+c = 0 with rational integral coefficients a, b, c and nonsquare discriminant b 2 - 4ac (1. L. Lagrange). In order for w to be represented by a purely periodic continued fraction, it is neces- sary and sufficient that w be an irreducible quadratic irrational number, i.e., that w satisfy w> 1 and O>w' > -1, where w' is the conju- gate root of w (E. Galois). In order for OJ to be equal to the square root of a nonsquare ra- tional number, it is necessary and sufficient 
83D Continued Fractions that its continued fra ction be of the form [b o , bj,..., b k - I , 2b o J (Legendre). D. Application to PeWs Equation Let ax-by= 1 «a,b)= 1) be a tDiophan- tine equation of the first degree, and a/b = [b o , b" ..., bmJ = Pm/Qm. Since PmQm-1 - P m - I Qm =( _l)m-l, a solution of the equation is given by Xo =( _l)m-IQm, Yo =( _l)m-1 Pm. The general solution is then represented in the form Xo + bt, Yo + at (tE Z). This method of obtaining a solution is essentially the same as the method which uses the tEuclidean algorithm. Pell's equation X2 - D y 2 = 1 (D is a non- square integer> 1) was solved by Lagrange in terms of continued fractions. If the length of the period of D is k, all positive solutions of Pell's equation are given by X=P 2vk - l , y= Q2vk-1 if k is odd, and by x= P vk - I , y=Qvk-1 if k is even (v= 1, 2,...), where Pn/Qn denotes the nth convergent of the continued fraction expansion of fl. Incidentally, x = P(2v-l)k-l, y= Q(2v-I)k-l (v= 1,2, ...) are the positive solutions of x 2 - Dl = -1 provided that k is odd. There are no solutions of x 2 - Dl = ::!: 1 other than x"yv(v = 1, 2, ...) given by (XI +fl YI)'=xv+fl y" where Xl' YI is the least positive solution. For instance, the least positive solution of x 2 - 211 y2 = 1 is X = 278,354,373,650, y= 19,162,705,353. Lagrange made further use of continued fractions in order to obtain approximate values of roots of algebraic equations. The method is especially useful for precise compu- tation of neighboring roots. The theory of continued fractions may be invcstigated geometrically making use of lat- tices ([2, 3];  182 Geometry of Numbers) (F. Klein, G. Humbert). For instance, a measure of approximation of Pn/Qn to w in Diophantine approximation is represented by the closeness of a lattice point (Pn, Qn) to the straight line Y = wx on the plane. E. Continued Fractions with Variable Terms There are few results on continued fractions with variable terms. It is noteworthy that from the expansion of tanz into a continued fraction Z _Z2 _Z2 _Z2 tanz=- 1+ 3 + 5 + 7 +... (Lambert), the irrationality of n and of tan z for rational z (#0) can be deduced (A. Pringsheim). Among continued fractions with variable 316 terms, those of the form 00 a o alz anz [a O ,a n zJl =- - 1 + I +...+ I +... are called normal continued fractions. Let the convergent of such a continued fraction be Pn(z)/Qn(z), and for convenience put P -I (z) = 0, Q_I (z) = 1. Then we have the recurrence formulas Pn(z) == Pn-I (z)+ a n zP n _ 2 (z), Qn(z)=Qn-l(z)+a n zQn-2(Z), n;;:' 1. There are the further relations n Pn(z)Qn-dz) - Pn-dz)Qn(z) = (_I)n z n fl a" v=Q 00 (-I)"zn n [ao,anzJ;"'= L fla" "=0 Qn-I (z)Qn(z) v=O where the latter is formal. Let the tpower series expansion of the nth convergen t of [a o , an zJ f be Pn(z) ==  b v  nv Z ' Qn(z) FO n;;:' 1. Then bmv=b n , (Ov.s;m.s;n). If [ao,QnzJf has a power series expansion about the origin, then 00 [ao,anzJ= L bnnz n . n=O If the supremum g of {Ianl};'" is finite, then [a o , anzJ;'" converges uniformly for I zl.s;(1/4) g, and hence it represents an tanalytic function which is holomorphic in Izi «1/4)g. References [1] G. H. Hardy and E. M. Wright, An intro- duction to the theory of numbers, Oxford Univ. Press, 1938. [2] H. M. Stark, An introduction to number theory, Markham, 1970. [3] W. M. Schmidt, Diophantine approxi- mation, Lecture notes in math. 785, Springer, 1980. [4] O. Perron, Die Lehre von den Ketten- briichen, Teubner, third edition, 1954 (Chel- sea, 1950). For functions defined by continued fractions and applications to moment problems, [5J H. S. Wall, Analytic theory of continued fractions, Chelsea, 1967. [6] A. Ya. Khinchin (Hincin), Continued frac- tions, Noordhoff, 1963. (Original in Russian, 1949.) [7] A. N. Khovanskil (Hovanskii), The ap- plication of continued fractions and their generalizations to problems in approximation theory, Noordhoff, 1963. (Original in Russian, 1956.) 
317 84 (X.2) Continuous Functions A. General Remarks The notion of continuity is defined for a map- ping or a function f:X --> Y from a topolog- ical space X to a topological space Y ( 425 Topological Spaces G). In the present article, however, we are concerned mainly with the case where both X and Yare tmetric spaces with the distances Px and py, respectively. The most usual case is X = Rn (Euclidean space), Y=R (real numbers). A function f: X --> Y is said to be continuous at a point Xo E X if for every positive number E, we can select a suitable positive number D (depending on E and also on xo) such that Px(x, xo) < D implies py(/(x)J(x o )) < E. This is equivalent to the condition that x-->x o implies f(x)--> f(x o ) ( 87 Convergence). We call f continnous (on X) if it is continuous at every point Xo of X. If for every positive number E, we can select a suitable positive number D independent of x and y such that Px(x, y) < D implies py(/(x)J(y» < E for all x, YE X, we call f uniformly continuous on X. The tsupremum W(D) of py(/(x)J(y) for x, yE X satisfying Px(x, y) < D is called the modulus of continuity of the function f in X. Uniform continuity means that W(D)-->O for D-->O. If W(D).s; MD' for suitable constants M, ex > 0, that is, if the inequality py(/(x)J(y».s; M(pX<x,y»)" holds for x, YEX, then f is said to satisfy the Holder condition of order ex, also known as the Lipschitz condition of order ex. If ex= 1, this condition is called simply the Lipschitz condition. A function satisfying one of these conditions is uniformly continuous. The family of functions satisfying the Lip- schitz condition of order ex is sometimes de- noted by Lipex. In general, the composite function gof: X -->Z is continuous if both functions f:X --> Y and g: Y -->Z are continuous. If the ranges of 1, 9 are both the real field R (or the complex field C, or more generally a ttopological field), then f ::I:g, fg are continuous if f and 9 are contin- uous; and fig is continuous provided that g(x) #0. If R is the range of both f and g, then min(1, g) and max(f, g) are continuous when f and 9 are continuous. If X is tconnected (for example, an interval I in R) and if f is contin- uous, the image /(X) is also connected. B. Continuity from One Side In this section, we always assume that the domain X is an interval I in Rand f is a func- 84 C Continuous Functions tion from I to a metric space Y. A point Xo of X is called a discontinuity (point) of the first kind of f if both limits limxixJ(x) and Iim x )xJ(x) exist in Yand are different. Then we say also that f has a jump (or gap) at xo. If these two limits exist and have the same value, then f is continuous at xo. We say that f has a discontinuity of at most the first kind at Xo if f is continuous at Xo or if Xo is a discontinuity of the first kind for f (Sometimes the phrase "discontinuity of the first kind" is used to mean "discontinuity of at most the first kind.") A discontinuity point of f (i.e., a point at which f is not continuous) that is not of the first kind is called a discontinu- ity point of the second kind. When limx)xJ(x) = f(x o ), we call f right continuous (or con- tinuous from the right) at xo' In this case, Iim x 1 x J(x) need not exist. Replacing xlx o by xix o , we can similarly define the concept of being left continuous (or continuous from the left). If a function f has a finite number of discontinuity points of the first kind in the interval [a, bJ and is continuous at all other points, we call f a piecewise continnous func- tion in [a, b]. C. Semicontinuous Functions In this section, we assume that the domain of the functions is a subset E of a metric space X, and that the range is the set of real numbers extended to include ::1:00. Let x be a point in the closure of E. We denote by M (x, D) and m(x, D), respectively, the supremum and the infimum of the values of a given function f in the D-neighborhood of x. We put M(x)=lim M(x, D), m(x)=lim m(x, D), oo oo and call them the upper limit function and lower limit function, respectively. We have -00 .s;m(x).s;M(x).s; +00. If M(xo)= f(x o ) at Xo E E, then f is called upper semicontinuous at xo' If m(xo) = f(x o ) at Xo E E (i.e., if - f is upper semicontinuous at x o ), then f is called lower semicontinnous at xo' The function with one of these two properties is said to be semicon- tinuous at xo' Either of the following two conditions is necessary and sufficient for the function f to be upper semicontinuous at xoEE: (1) f(x o ) = + 00, or for every constantA such that f(x o ) < }" there exists a D-neighborhood such that M(xo,D)<A. (2) For every sequence x n of E converging to Xo we have lim sUPnoo f(x n ) = f(x o )' A function f is called upper (lower) semi- continnous in E if it is upper (lower) semicon- tinuous at every point x E E. A necessary and sufficient condition for the upper semicontinu- 
84D Continuous Functions ity of the function f(x) in E is that {x If(x) < ex} be a trelatively open set in E for every real number ex. We can define semicontinuity for functions on a topological space by using this latter property. A real-valued function f(x) is continuous at Xo E E if and only if it is upper and lower semi- continuous at Xo and f(x o ) is finite. A function f(x) is continuous on E if and only if it takes finite real values on E and both {xlf(x)<ex} and {xlf(x»ex} are relatively open in E for any real number ex. When E is tcompact, an upper (lower) semicontinuous function on E attains its (supremum) infimum at a point in E. In particular, a continuous function on a compact set E is bounded and assumes its maximum and minimum on E (Weierstrass's theorem). Furthermore, if E is connected (e.g., the interval I in R), it follows from the con nectedness of the image f(E) that if ex, [3E f(E) and y lies between ex and [3, then YEf(E) (intermediate-value theorem). A real-valued function f(x) on a set E of R satisfies the Lipschitz condition if it is tdiffer- entiable and the derivative is bounded. Such a function is also tabsolutely continuous, con- tinuous, and of tbounded variation. (For the polynomial approximation of real continuous functions  336 Polynomial Approximation.) The limit function f(x) of a monotone de- creasing sequence of upper semicontinuous functions fn(x) is also upper semicontinuous. The limit function f(x) of a uniformly converg- ing sequence of continuous functions is con- tinuous. (Regarding the tequicontinuous family of functions  435 Uniform Convergence.) D. Baire Functions The limit function of a pointwise converging sequence of continuous functions defined on a metric space X is not necessarily continuous. R. Baire (Ann. Mat. Pura Appl., (1899)) intro- duced the notion of Baire functions as follows: He named continuous functions the functions of class O. Then he called a function that is a pointwise limit of a sequence of continuous functions a function of at most class 1. A func- tion is said to be of class 1 if it is of at most class 1 and is not of class O. He similarly de- fined the notion of class n for arbitrary natural number n. Further, a function is said to be of at most class w if it is a pointwise limit of a sequence of functions of class nv for a sequence of natural numbers nv' A function is said to be of class w ifit is of at most class wand is not of class n for any finite number n. In general, using ttransfinite induction, we can define the notion 318 of functions of class ( for an arbitrary tordinal number  [4]. All these functions are caIled Baire functions. Actually, there is no function of class ( for an uncountable ordinal number . If X is a tper- feet set in Euclidean space, then there is actu- ally a function defined on X of class  for an arbitrary countable ordinal number . Here- after, we shall be concerned with this case only. If X has the cardinality of the tcontinuum, then the set of all Baire functions on X has the cardinality of the continuum. On the other hand, the cardinality of all functions on X is actually greater than that of the continuum. Hence there exist functions that are not Baire functions on X. A function is a Baire function if and only if it is tBorel measurable (H. Le- besgue). Therefore a necessary and sufficient condition for a function f to be a Baire func- tion is that the set (xlf(x»ex,xEX} be a Borel set for any real number ex ( 270 Measure Theory J). The limit of a countable sequence of Baire functions is also a Baire function. If f(x) and g(x) are of at most ciass ex on X, then the following functions are also of at most class ex: If(x)l, f(x) :tg(x), f(x). g(x), and f(x)jg(x) (provided that g(x) # 0 on X). The condition that a function f is of at most class 1 on X is equivalent to either of the following two conditions: (1) For any closed subset F of X, the restriction f* of f to F has a continuity point in F. (2) For every real number ex, the set {xlf(x)<ex,xEX} is an tF a set (Baire). In a tcomplete metric space, a necessary and sufficient condition for a func- tion f to be of at most class 1 is that the set of continuity points be dense in X. For example, the Dirichlet function, which takes the value 1 at rational points and 0 at irrational points, is expressed as lim ( lim (cosv! nx)2k ) , v-cc k-oc which is of class 2. A function f(x, y) of two real variables that is continuous in each vari- able x and y separately is a function of at most class 1. References [1] S. Saks, Theory of the integral, Warsaw, 1937. [2J N. Bourbaki, Elements de mathematique, Topologie generale, ch. 10, Actualites Sci. Ind., 1084b, Hermann, second edition, 1967. For Baire functions, [3J R. Baire, Le90ns sur les fonctions discon- tinues, Gauthier-Villars, 1905. [4J C. J. de La Vallee-Poussin, Integrales de Lebesgue, fonctions d'ensemble, classes de Baire, Gauthier-Villars, second edition, 1934. 
319 [5] W. Rudin, Principles of mathematical analysis, McGraw-Hili, second edition, 1964. 85 (VI.19) Continuous Geometry A. General Remarks The structure of a tprojective geometry is determined by the tlattice (lattice-ordered set) of subspaces of the projective space. For this reason, this lattice itself is sometimes called a projective geometry. The concept of con- tinuous geometry was introduced by J. von Neumann as an abstraction of lattice-theoretic properties from a special class of tlattices (lattice-ordered sets) which he encountered in his research on toperator rings in Hilbert spaces [1]. Continuous geometry contains projective geometry as a special case when the dimension is discrete; but more usually the lattices are of continuous dimension. A continuous geometry is a tcomplete and tcomplemented tmodular lattice L ( 243 Lattices F) that satisfies the following prop- erty and its dual (both called properties of continuity): For any element a of L and any subset W of L which is twell ordered with respect to the ordering in L, we have a n sup w = sup(a n w) (w E W). The tcenter Z of the lattice L is called the center of the continuous geometry L, which is said to be irredncible when Z has no elements other than the tleast element 0 and the tgreatest element I; other- wise L is said to be reducible. A reducible continuous geometry is isomorphic to a sub- lattice of a tdirect product of irreducible con- tinuous geometries. On any continuous geometry L, there can be defined a function d(x) whose values belong to a complete lattice-ordered linear space M and which satisfies the following four con- ditions: (1) d(x)?O; (2) d(x)=d(y) implies the existence of a common complement of x and y; (3) d(xU y)+d(xn y)=d(x)+d(y); (4) supd(w)= d(sup w) (w E W) for any subset W of L which is well ordered with respect to the ordering in L. Such a function d(x) is called a dimension function on L. Irreducibility of L is equivalent to the property that a real-valued dimension function can be introduced; in this case, if d(w) takes only a finite number of values, then Lis a finite-dimensional projective geometry; on the other hand, if d(w) takes every number in the closed interval [0, 1J, then L is called a continuous geometry in the strict sense. An example of the latter can be constructed as a limit of a sequence of projective geometries of 86A Control Theory increasing dimensions; another example is the lattice of projection operators of the tvon Neumann algebra of type III' Ifa group G of tautomorphisms of L is given, there can be introduced a generalized dimension function which is invariant under G satisfying slightly weaker conditions (T. Iwamura [3J). B. Representation of Continuous Geometry A ring R is called a regular ring if it has a unity element and if, for any element a of R, there exists an element x in R such that axa = a. A continuous geometry L is isomorphic to the lattice (with =:> as its ordering) oftprincipalleft ideals of a regular ring R provided that d(I) = n' d(x) (n? 4) for some natural number n and some element x of L. The decomposition of R into a tdirect sum of ideals corresponds to the decomposition of L into a direct prod- uct oflattices. The condition that L is irreduc- ible and finite-dimensional is equivalent to the condition that R is a matrix ring over a tskew field; when these conditions hold and L is considered as a projective geometry, then the coordinates are given by this skew field. In continuous geometries, join and meet are often denoted by the symbols for sum and product, respectively, and sometimes a direct product of continuous geometries is called their direct sum. Sometimes, the requirement that a con- tinuous geometry be complete is weakened to the requirement that it be tconditionally 0"- complete. References [IJ J. von Neumann, Continuous geometry, Princeton Univ. Press, 1960 (Lecture notes, 1935-1937). [2] F. Maeda, Continuous geometry (in Japa- nese), Iwanami, 1952; German translation, Kontinuierliche Geometrien, Springer, 1958. [3J T. Iwamura, On continuous geometries I, II, Japan. J. Math., 19 (1944),57-71; 1. Math. Soc. Japan, 2 (1950),148-164. [4J L. A. Skornyakov, Complemented modu- lar lattices and regular rings, Oliver & Boyd, 1964. (Original in Russian, 1961.) 86 (XIX.11 ) Control Theory A. General Remarks The classical theory of automatic control mostly deals with linear feedback control 
86B Control Theory systems with single input and single output. Mathematical structures of such systems must be, in principle, described in terms of ordinary linear differential equations with constant coefficients. Hence control engineers use block diagrams to describe systems, and operational calculus based on tLaplace transforms to ob- tain response characteristics ( 240 Laplace Transform). Thus the input/output relation of a system is described in terms of transfer func- tions. The main objectives of control theory are to ensure system stability, to maximize closed-loop response characteristics by choos- ing the best feedback, and to obtain a desir- able transient response to an impulse or step input. One of the remarkable contributions to classical control theory is Nyquist's criterion for stability testing of linear feedback systems [I]. The test consists of plotting the Nyquist diagram of a transfer function in the frequency domain (complex plane), and differs essentially from the tRouth-Hurwitz stability test for linear differential equations with constant coefficient. Classical control theory was almost complete by the end of W orld War II. Revolutionary technological innovations in electronics and computers and the invention of new control instruments and systems devel- oped after World War II have opened the way to modern control theory. Around 1960, three remarkable contributions were made con- currently; they are dynamic programming (proposed by R. E. Bellman [2J), Pontryagin's maximum principle (L. S. Pontryagin et al. [3]), and linear system tbeory (R. E. Kalman [4J). The first two give rise to mathematical tools to solve optimal control problems and to design optimal controllers or regulators. In contrast to the classical theory of control, optimal control problems are formulated in terms of the system of linear or nonlinear multivariable differential equations with multi- input forcing terms called control variables. This leads to the state-space approach, which has become ubiquitous in modern control theory. Linear system theory derives from the concepts of controllability and observability. The former was introduced by Pontryagin et al. [3J and later modified by Kalman [5J, and the latter was introduced by Kalman [5]. Controllability and observability are con- cerned with the interrelation between internal states of a system and its inputs and outputs. Hence linear system theory, profoundly related to the state-space approach, remains a prin- cipal theme of modern control theory. More advanced control theories, such as adaptive control or learning control, are mostly de- scribed and formulated in the framework of linear system theory. Modern control theory has stimulated 320 the development of tcybernetics, the science of control and communication, and con- trol theory is important in the information sciences. In fact, current control theory has many features that might be considered to belong as well to other areas, such as mathe- matical programming ( 264 Mathematical Programming), operations research ( 307 Operations Research), game theory ( 173 Game Theory), prediction and filtering theory, digital signal processing, circuit theory, and computer or microprocessor technology. Current control theory embodied in modern instrumentation and computer technology has a vast range of applications. We do not con- sider these applications here; we confine our- selves to some important theories that con- tinue to be major topics in control theory. B. Linear Dynamical System Let the state of a physical process to be con- trolled be represented by a real n-dimensional column vector x(t) =(x 1 (t), ..., Xn(tW ERn, where T denotes the transpose of a vector or matrix. If the state is determined by a differen- tial system with input u(t)=(ul(t),..., u,(tW E R' and output y(t)=(YI (t),..., Ym(tW ERm, d dix(t)= A(t)x(t) + B(t)u(t), y(t)= C(t)x(t), (1) then the process is called a linear dynamical system or simply a linear system. The dimen- sions of the coefficient matrices are given by A(t)ER nxn , B(t)ERnx" C(t)ERmxn. If A(t), B(t), and C(t) are constant matrices, the system described by d x==-x=Ax+Bu dt ' y=Cx (2) is called a linear time-invariant system. Corre- spondingly, the system described by equation (1) is called a linear time-varying system. Given an initial state x(to) and input func- tion u( t) for t? to, the state of system (1) is represented by x(t) = <1>(t, to)x(to) + f ' <1>(t, r)B(r)u(r)dr, '0 where <1>(t, r) is a tfundamental solution to the matrix differential equation a -<1>(t, r) = A(t)<1>(t, r), at <1>(r, r) = I (n x n identity matrix). 
321 Hence the output of the system is described by y(t) = C(t)<{)(t, to)x(to) + f ' C(t)<{)(t, r)B(r)u(r)dr. '0 It is seen from this equation that the input- output relation of the system is governed by W(t, r) == C(t)<{)(t, r)B(r)E R m x " which is calIed a weighting matrix. As a spe- cial case, the weighting matrix for the time- invariant system (2) becomes W(t, r)= W(t-r)=CeA(I-')B, where e A ' is a matrix exponential function defined by t 2 t 3 eA'=I +tA+-A 2 +-A 3 + 2! 3! .... The Laplace transform of the weighting matrix is G(s) = {OO e-S'W(t)dt= C(sI -A)-I B, which is called a transfer function matrix of the system (2). First we note that the weighting matrix W(t, r) for the system (1) is invariant under any linear transformation x(t) = P(t)x(t) of the state, and also that the transfer func- tion matrix G(s) for the system (2) is invariant under x==Px. For example, the transformation x = Px yields i= PAP-I x+ PBu, y = Cp- I x. (3) In fact, the transfer function matrix for the system (3) becomes equivalent to that for the system (2), that is, G(s)= CP-I(sI - PAp-I)-1 PB= C(sI - A)-I B =G(s). As a converse to this, there arises the problem of whether any linear time-invariant systems with the same transfer function matrix are similar to each other. To solve this problem, it is necessary to introduce the concept of con- trollability and observability for the system (1) or (2). C. Controllability and Observability The concept of controllability and observa- bility plays a fundamental role in linear system theory. Roughly, controllability implies the possibility of steering the state from the input. Definition of controllability: The linear sys- tem (1) is said to be controllable at time to if for any state x(to)ER n and Xl ERn there exists a control input u(t), t E [to, t IJ, that transfers the 86 C Control Theory state x(to) to the state Xl at a finite time t I > to. Otherwise, the system (1) is said to be un- controllable at time to. The controllability theorem states: The system (1) is controllable at time to if and only if there exists a finite t I (> to) such that the matrix f 'l D(to, t l ) = <{)(t o , t)B(t)BT(t)<{)T (to, t)dt '0 (4) is nonsingular. Similarly, the linear time- invariant system (2) is controllable if and only if the n x (nr) matrix D = [B, AB,..., An-I BJ (5) has rank n. The matrix D is called the con- trolIability matrix of the system (2). The concept of observability is dual to that of controllability. It implies the possibility of estimating the state from the output. Definition of observability: The linear system (1) is said to be observable at to iffor any state X(to)E Rn there exists a finite t I (> to) such that the knowledge of the input u(t) and the output y(t) over the time interval [to, t I] suffices to determine the state x(to). Otherwise, the sys- tem (1) is said to be unobservable at to. The observability theorcm states: The sys- tem (1) is observable at time to if and only if there exists a finite t I (> to) such that the matrix M(tO,td= f '! <{)T(t,tO)CT(t)C(t)<{)(t,to)dt (6) '0 is nonsingular. Similarly, the linear time- invariant system (2) is observable if and only if the n x (nm) matrix M = [C T , ATC T , ...,(AT)n-ICTJ has rank n. The matrix M is called the ob- servability matrix of the system (2). We deal only with linear time-invariant systems for simplicity, although some of the main results to be stated here can be extended to linear time-varying systems. If the controlla- bility matrix D has rank n, « n), there exists a linear transformation .x = Px with nonsingular matrix P which transforms the system (2) into (7) [ c J = [ Ac A12 J[ Xc J + [ Bc J u, xc' 0 Ac' Xc' 0 (8) y=[C c Cc,J [ Xc J Xc' so that the nc-dimensional subsystem Xc = A,x, + Bcu, (9) y = Ccxc is controllable. Clearly this subsystem has the same transfer function matrix as the system (2). 
86D Control Theory If the observability matrix M has rank n Q ( < n), there exists a linear transformation x = Qx that transforms the system (2) into [ Xo J _ [ Ao 0 J[ Xo ] [ Bo ] x o ' - A 21 Ao' X o ' + Bo' U, y=[C Q OJ [ Xo J ' X o ' so that the no-dimensional subsystem Xo = Aoxo + Bou, y= CoXo is observable. This subsystem also has the same transfer function matrix as the system (2). Combination of these two properties yields the canonical decomposition theorem (found by Kalman [6]), which states: There is a nonsin- gular transformation x = Px by which the system (2) can be transformed into the canon- ical form xw 1 [co 0 A13 o J[xro 1 cO' = A 21 A 22 A 23 A 24 Xcv' Xc'o 0 0 A33 o xc'o XC'Q' 0 0 A43 A44 Xc'o' n + f u, (12) y=[C cv 0 C 3 OJ l Xco J , X co ' Xc'o Xc'o' where the subvector Xco of x is controllable and observable, Xco' is controllable but not observable, Xc'o is not controllable but obser- vable, and Xc'o' is neither controllable nor observable. Furthermore, the transfer function matrix of the system (2) is equal to that of the system (12), which is described by G(s) = Cco(sI - Aco) -I Bco, that is, the transfer function matrix of the system (2) depends solely on the controllable and observable part of the state. A linear system (2) is said to be reducible if and only if there exists a linear time-invariant system of smaller dimension that has the same transfer function matrix. Otherwise, the system is said to be irreducible. The canonical decom- position theorem implies that a linear time- invariant system is irreducible if and only if it is controllable and observable. D. Realization Theory Realization theory is concerned with determin- ing (1) a linear time-invariant dynamical sys- 322 (10) tem that has a prescribed rational function matrix or (2) a linear time-varying system that has a prescribed weighting matrix. Here we consider only the former, although there are many interesting results for the latter as well. The rational function matrix G(s) is said to be strictly proper if and only if G( 00) = O. A linear time-invariant system with a prescribed G(s) as a transfer function matrix is called a realization of G(s). A realization of G(s) with the least possible dimension is said to be mini- mal. A minimal realization should be irreduc- ible by definition. The fundamental theorem on minimal realization, which answers the question raised at the end of Section B states' For a given strictly proper rational ma'trix . G(s), any two minimal realizations of G(s) are mutually similar in the sense that there exists a linear transformation x = Px that transforms one of the systems into the other. A minimal realization of a scalar transfer function (11 ) b n-I b - ( s ) = IS +...+ n-Is+b n 9 n n-I S +als +...+an-1s+a n is given by (13) X== l" f[} } 1 o o 1 o o -an-I -a n - 2 (14) y = [b n b n - I b n - 2 bl]x. It is easy to see that this system is controllable. Hence system (14) is called a controllable canonical realization. Kalman [4] showed that this realization is observable and therefore minimal if and only if there is no common factor between the denominator and the nu- merator of g(s). An observable canonical re- alization of g(s) can be similarly written down as a dual of the system (14). For a given strictly proper matrix G(s), the degree of the least common denominator of all entries of G(s) is called the degree of G(s), provided that there is no common factor be- tween the denominator and the numerator of each entry. It was shown by Kalman [7], together with a realization procedure, that the dimension of minimal realization of G(s) is equal to the degree of G(s). Other impor- tant realization procedures for rational func- tion matrices were proposed by B, L. Ho and Kalman [8], W. A. Wolovich and P. L. Falb [9,10], J. Rissanen and T. Kailath [11], and 
323 B. Dickinson et al. [12]. Procedures for time- varying system realization for prescribed weighting matrices were presented by Kalman [4J, D. C. Youla [15], and L. M. Silverman [16]. Realization problems for a given impulse response matrix were first discussed by C. A. Desoer and P. P. Varaiya [17J, and a fast algorithm for discrete-time system realiza- tion was proposed by C. T. Mullis and R. A. Roberts [18]. E. State Estimation When a linear system is observable, it is pos- sible to construct a device, called a state esti- mator, that approximates the state vector. For the given system (2), define a linear dynamical system i=Dz+Ey+Gu, .x=L 1 z+L 2 y with imputs YER m and uER', state zERq, and output x ERn. If the system (2) is observable, rank (C)=m, and qn-m, there exist matrices D, E, G, L" L 2 , and M that satisfy MA-DM==EC, LI M + L 2 C = In (n X n identity matrix), G=MB. Then it follows that x(t)-x(t)= LI eD'[z(O)- Mx(O)]. Furthermore, it is possible to choose a matrix D which is asymptotically stable, that is, every eigenvalue of D has negative real part. Thus i(t)-->x(t) as t-->oo, and system (15) yields a state estimator. This result was first shown by D. G. Luenberger [19,20], and thus the system (15) is called a Luenberger observer. The Kalman filter proposed by Kalman [21J for discrete-time processes and by Kalman and R. S. Bucy [22J for continuous-time processes is an optimal state estimator for a stochastic system. Let x(t) = A(t)x(t) + B(t)u(t), y(t) = C(t)x(t) + v(t), in which u(t) is a white Gaussian noise (tBrown- ian motion) ( 45 Brownian Motion) such that Eu(t) = 0, Eu(t)uT(s) = U(t)t5(t - s), and v(t) is also a white Gaussian noise such that Ev(t) = 0, Ev(t)v T (s) = V(t)t5(t - s), where E and T denote the texpectation and the transpose, respectively, In addition, it is as- 86F Control Theory sumed that Ex(to) = 0, Ex(to)x T (to) = Xo, and that x(to), u(t), and v(s) are uncorrelated. Under these conditions, the process x(t) be- comes a wide-sense Gauss-Markov process (1. L. Doob [23]). The Kalman-Bucy filter is a linear dynamical system described by .i(t)= [A(t)- L(t)C(t)Jx(t)+ L(t)y(t), where L(t) = P(t)CT(t)V-I (t) and P(t) is a solution to the matrix Riccati differential equation (15) p(t) = A(t)P(t) + P(t)A T (t) + B(t) U (t)B T (t) - P(t)CT(t) V -I (t)C(t)P(t), P(to)=Xo. The Wiener filter proposed in 1949 [24J is a special case of the Kalman filter in which the signal process is stationary and of single input and single output. Extensions to non- linear filtering were investigated first by H. J. Kushner [25J and subsequently by W, M. Wonham [26J, R. S. Liptzer and A. N. Shiryaev [27J, and M. Fuzisaki et al. [28]. F. Optimal Control Consider a nonlinear dynamical system de- scribed by a system of differential equations x==f(x,u), (17) where xERn and UER'. For a given set U in Rr, a piecewise continuous function u(t) de- fined over [to,tIJ is said to be an admissible control function if u(t) E U for every t E [to, t I]. When a functional i ll J = '0 fo(x(t), u(t))dt (16) is given as a performance index of control and two points X O and Xl are given in Rn, the optimal control problem consists of finding an admissible function u(t) that minimizes J and simultaneously transforms the state from x(to) =x o to x(tl)=X I . To solve this, a necessary condition for optimality was developed by L. S. Pontryagin and his colleagues [3J and termed the maximum principle. It states: In order for an admissible control u(t) and a corresponding solution trajectory of equation (17) to be optimal, it is necessary that there exist a nonzero vector ljJ(t) = (ljJd l ), ..., IjJn(lW such that (i) ljJ(t) satisfies 8H(IjJ(t), x(t), u(t)) ox(t) (t)= 
86G Control Theory where H(t/J, x, u)= - fo(x, u)+ t/J Tf(x, u), (ii) the function H(t/J(t), x(t), u) of the variable UE V attains its maximum at the point u= u(t) almost everywhere in [to, t IJ, and (iii) at the terminal time t 1 it holds that H(t/J(t I), x(t I), u(t d) == O. When f(x, u) is linear in x and u, fo(x, u)= 1, and t l is free, we have the time-optimal control problem investigated by Bellman et al. [29J and 1. P. LaSaIle [30]. When fo(x, u) is instead a quadratic function of x and u, we have the optimal regulator problem solved by Kalman [31]. The relation between the maximum principle and the tcalculus of variations ( 46 Calculus of Variations) has been pointed out by L. D. Berkovitz [32]. Generalizations of the maximum principle and other results on optimal control problems are found in [33-35]. G. Miscellany There are many other areas of control theory. For details, we refer the reader to [36J for adaptive control, [37J for stability theory, and [38J for system identification. References [IJ H. Nyquist, Regeneration theory, Bell Syst. Tech. 1.,11 (1932),126-147. [2J R. E. BeIlman, Dynamic programming, Princeton Univ. Press, 1957. [3J L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko, The mathematical theory of optimal processes, Interscience, 1962. [4J R. E. Kalman, Mathematical description of linear dynamical systems, SIAM 1. Control, 1 (1963), 152-192. [5] R. E. Kalman, On the general theory of control systems, Proc. First Int. Congress Automatic Control, Butterworth, 1960,481- 493. [6J R. E. Kalman, Canonical structure of linear dynamical systems, Proc. Nat. Acad. Sci. US, 48 (1962), 596-600. [7J R. E. Kalman, Irreducible realization and the degree of a rational matrix, SIAM J. Appl. Math., 13 (1965),520-544. [8J B. L. Ho and R. E. Kalman, Effective construction of linear state variable models from input/output data, Regelungstechnik, 12 (1966),545-548. [9J W. A. W olovich and P. L. Falb, On the structure of multivariable systems, SIAM J. Control, 7 (1969), 437-451. 324 [lOJ W. A. Wolovich, The determination of state-space representations for linear multi- variable systems, Automatica, 9 (1973), 97- 106. [11J 1. Rissanen and T. Kailath, Partial realiza- tion of random systems, Automatica, 8 (1972), 389-396. [12J B. Dickinson, M. Morf, and T. Kailath, A minimal realization algorithm for matrix se- quences, IEEE Trans. Automatic Control, AC- 19 (1974), 31-38. [13J B. D. O. Anderson, A system theory criterion for positive real matrices, SIAM J. Control, 5 (1967), 171-182, [14J B. D. O. Anderson, An algebraic solution to the spectral factorization problems, IEEE Trans. Automatic Control, AC-12 (1967), 410- 414. [15J D. C. Y oula, The synthesis oflinear dy- namical systems from prescribed weighting patterns, SIAM J. Appl. Math., 14 (1966), 527- 549. [16J L. M. Silverman, Synthesis of impulse response matrices by internally stable and passive realizations, IEEE Trans. Circuit Theory, CT-15 (1968), 238-245. [17J c. A. Desoer and P. P. Varaiya, The minimal realization of a nonanticipative im- pulse response matrix, SIAM 1. Appl. Math., 15 (1967), 754-764. [18J C. T. MuIlis and R. A. Roberts, The use of second-order information in the approxi- mation of discrete-time linear systems, IEEE Trans. Acoust. Speech Signal Process., ASSP- 24 (1976),226-238. [19J D. G. Luenberger, Observing the state of a linear system, IEEE Trans. Mil. Electron., MIL-8 (1964), 74-80. [20J D. G. Luenberger, Observers for multi- variable systems, IEEE Trans. Automatic Control, A C-Il (1966), 190-197. [21J R. E. Kalman, A new approach to linear filtering and prediction problems, Trans. ASME, 1. Basic Eng., 82 (1960), 35-45. [22J R. E. Kalman and R. S. Bucy, New re- sults in linear filtering and prediction theory, Trans. ASME, J. Basic Eng., 83 (1961), 95-108. [23J J. L. Doob, Stochastic processes, Wiley, 1953. [24J N, Wiener, Extrapolation, interpolation and smoothing of stationary time series, with engineering applications, Wiley, 1949, [25J H. 1. Kushner, On the dynamical equa- tions of the conditional density functions with applications to optimal stochastic control theory, 1. Math. Appl., 8 (1964), 332-344. [26J W. M. Wonham, Some applications of stochastic differential equations to optimal nonlinear filtering, SIAM J. Control, 2 (1965), 347-369. [27J R. S. Liptzer and A. N. Shiryaev, Non- 
325 linear filtering of Markov diffusion processes (in Russian), Trudy Mat. Inst. Steklov., 104 (1968), 135-180. [28J M. Fuzisaki, G. Kallianpur, and H. Kunita, Stochastic differential equations for the nonlinear filtering problem, Osaka J. Math., 9 (1972), 19-40. [29J R. Bellman, I. Glicksberg, and O. Gross, On the "bang-bang" control problem, Quart. Appl. Math., 14 (1956),11-18. [30J J. P. LaSalle, The time optimal control problem, Contributions to the Theory of Non- linear Oscillations V, Ann. Math. Studies, Princeton Univ. Press (1960), 1-24. [31J R. E. Kalman. Contributions to the theory of optimal control, Bol. Soc. Mat. Mexicana, second ser., 5 (1960),102-119. [32J L. D. Berkovitz, Variational methods in problems of control and programming, J. Math. Anal. Appl., 3 (1961),145-169. [33J E. B. Lee and L. Markus, Foundations of optimal control theory, Wiley, 1967. [34J A. G. Butkovskii, Theory of optimal control of distributed parameter systems, American Elsevier, 1969. [35J J. L. Lions, Optimal control of systems governed by partial differential equations, Springer, 1971. [36J Y. D. Landau, Adaptive control, Dekker, 1979. [37J N. P. Bhatia and G. P. Szego, Stability theory of dynamical systems, Springer, 1970. [38J R. K. Mehra and D. G. Lainiotis (eds.), System identification: Advances and case studies, Academic Press, 1976. 87 (11.19) Convergence A. Introduction The notion of convergence was first intro- duced in the real number system to deal with sequences of numbers, functions, series, or definite integrals ( 379 Series; 216 Integral Calculus). The notion was then extended to the case of generalized sequences where the index moves over a directed set, and the terms are in a topological space. B. Convergence of Sequences of Numbers A sequence {an} of numbers is said to be con- vergent to a number a or to converge to a, written 1imn00 an = a or an-->a as n--> 00, if for any positive number £ we can choose a (suffi- 87C Convergence ciently large) natural number no such that for every n larger than no the inequality I an - al <£ holds. Then a is called the limit (or limit point) of the sequence {an}' If {an} has a limit, it is unique. A sequence which is not conver- gent is said to be divergent or to diverge. A set A of real numbers is said to be bounded from above if there is a real number b such that a.s; b for all a E A, bounded from below if there is a real number c such that a  c for all a E A, and bounded if it is bounded from above and below. A sequence {an} of real numbers is said to be monotonically increasing (monotonically decreasing), written an i (an!), if a 1 .s;a 2 .s;... .s;a n .s;a n +1.s;... (ala2 ... an  an +1  ... ). A monotonically increasing or decreasing sequence is called a monotone sequence. C. Criteria for the Convergence of Sequences of Numbers Every bounded monotone sequence of real numbers is convergent; its limit is sup {an} (inf{ an}) ( 355 Real Numbers B) if it is monotonically increasing (decreasing). For any bounded sequence {an} of real num- bers, setting iX n ==inf{a n ,a n +1''''} and fJn== sup{ an' a n +1' ...}, we have exn j, fJnL and exn.s; an.s; {3n. Hence limnoo exn = ex ( = sup{ ex n }) and Iimnoo fJn = fJ (= inf{fJn}) exist. ex is called the inferior limit (or limit inferior) of {an}, written Iim infnoo an or lim n'" an' while fJ is called the superior limit (or limit supe- rior), written IimsuPnoo an or lim noo an' If IimsuPnoo an = liminfnoo an = iX, then limnoo an exists and equals ex. The limit of a conver- gent subsequence of a sequence {an} of num- bers is called an accumulation point of the sequence, Here we should distinguish an ac- cumulation point of a sequcnce {an} from an accumulation point of {an} viewed as a set ( 425 Topological Spaces 0); for ex- ample, if an = 1 for every n, then the former is 1 and the latter does not exist. For any bounded sequence of real numbers, its supe- rior (inferior) limit is the maximum (minimum) of its accumulation points. Moreover, if {3 is the superior limit of a sequence {an}, then for any positive number £, there exist only a finite number of n's for which an is greater than fJ +£, while there may exist an infinite number of n's for which an is less than fJ - £. The inferior limit of the sequence has a similar property. Suppose that we are given a sequence {an} of real numbers and that there exist two se- quences {un} and {V n } such that un.s; an.s; Vn, lim(u n - V n } = 0, {Un} is monotonicaIly increas- ing, and {v n } is monotonically decreasing. Then lim an exists and is equal to Iim Un = 
87D Convergence lim V n (principle of nested intervals). In partic- ular, if lim sup an = lim inf an, then lim an exists. The converse also holds. If {an} is convergent, then lan-aml-->O as n, m--> 00, and vice versa; that is, {an} is conver- gent if and only if for any positive number 8 there exists a positive integer no such that I an - am 1< 8 for all n, m  no (Cauchy's criterion). D. Infinity F or a set A of real numbers, the expression sup A = +00 means that A is not bounded from above; inf A = - 00 means that A is not bounded from below. For a sequence {an} of real numbers, lim an = +00 means that for any real number b there exists a positive integer no such that an> b for all n  no; the notation lim an = - 00 has a similar meaning. The sym- bols +00 and -00 are called positive (or plus) infinity and negative (or minus) infinity, res- pectively. We say that the limit of {an} is +00 ( -00) if lim an = +00 (-00). In these cases, we customarily say that {an} diverges (or is divergent) to + 00 ( - 00), or that an becomes positively (negatively) infinite as n--> 00. We also define lim sup an = + 00 (Jim inf an = - 00 ) to mean sup{ an} = +00 (inf{ an} = -00). A se- quence {an} is said to oscillate if lim sup an > lim inf an' We now have the following propositions concerning sequences of numbers: If lim an = a and lim b n = b, then lim( eta n + {3b n ) = aa + {3b, lim(anb n ) = ab, and lim(an/b n ) = a/b (provided that b n # 0, b # 0). For sequences of real num- bers, these formulas also hold when a or b is infinity. In those cases we set a'(::I:oo)=::I:oo (iX>O), iX'(::I:oo)= +00 (iX<O), iX::I:oo = ::1:00, a/(::I:oo) = 0 for a real number a. The cases O' (::1:00), + 00 + (-00), ::1:00/(::1:00) are excluded. E. Convergence of Sequences of Points in a Topological Space A sequence {an} of points in a topological space ( 425 Topological Spaces) is said to converge to a point a if for any tneighborhood U of a there exists a positive integer no such that an E V for all n  no' The point a is called a limit (or limit point) of {an} and we write limnoo an = a or an-->a as n--> 00. A sequence is said to diverge if it does not converge to any point. In particular, the set R of all real numbers is a topological space in which the set of inter- vals (a - e, a + e) for some e > 0 is a tbase for the neighborhood system of a point a, so that the notion of limit in R explained previously is 326 a special case of the same notion in a topolog- ical space. By adding the symbols +00 and -ex, to R, we obtain the topological space R, in which any set containing {XIX>iX,XER} ({xl x< iX, xER}) for some iXER is a neighbor- hood of +00 (-00), where the ordering is defined as -00 <a< +00 (iXER). Then lima n = +00 (-00) is interpreted as convergence in the topological space R. The elements of Rare called extended real numbers. In the case where the topological space is a tmetric space ( 273 Metric Spaces) with metric p, an-->a is equivalent to p(a n , a)-->O. F or convergence of sequences of points in a topological space, the following properties (S) hold: (S) (1). If an = a for all n, then lim an = a; (2) if an-->a, then an -->a for any subsequence k {an }; (3) if there is a point a such that any k subsequence {an} of {an} has a suitable sub- k sequence converging to a, then an-->a. In a tHausdorff space (e.g., a metric space), the additional property (S*) holds: If a sequence {an} has a limit, this limit is unique, and can be denoted lima n . F. Limits of Functions Let a real-valued function f(x) of a real vari- able x be defined for x # a belonging to a neighborhood of the point a. We say that the limit of f(x) is b as x tends to a, and write limxaf(x) = b or f(x)-->b as x-->a, if for any positive number 8 there exists a positive num- ber,) such that O#lx-al <,) implies If(x)- bl <8. Replacing O#lx-al <,) by a<x< a+,) (a-,)<x<a), we define f(x)-->b as x-->a+O (x-->a-O) and say that b is the limit on the right (left) of f(x) as x tends to a. We define f(x)--> +00 or f(x)--> -00 as x-->a analogously to the case of sequences. The expression f(x)-->b as x--> +00 means that for any positive number 8 there is a real number k such that If(x)-bl <8 for any x>k. There are similar definitions for x--> -00 and b = ::1:00. When f(x)-->::I:oo as x-->a, we often say that r diverges definitely at a. In general, for a mapping f from a subset D of a topological space X into a topological space Y, with a point a in the closure of D and a point b in Y, limxaf(x) = b or f(x)-->b as x-->a means that any neighborhood V of b contains f(U n D- {a}) for some neighbor- hood U of a. If Y is a Hausdorff space, b is unique (if it exists) for given f and a. This point b is called the limit (or limit value) of f(x) as x-->a. It is easy to see that this definition of limxaf(x) = b is a generalization of the cases where the topological spaces are R or R. Let N be the set of all natural numbers, and let N = 
327 NU{ +oo} be supplied with the trelative topology as a subspace of R. A sequence {an} of real numbers or points can be identified with a mapping f from N into R (N -->R) defined by f(n) =a n . It converges to a if and only iflimnoof(n)=a. Suppose, in particular, that f is a mapping from a metric space (X, p) into a metric space (Y,O"). Then,f(x)-->b as x-->a means that for any e > 0 there exists a t5 > 0 such that O"(/(x),b)<e for all xED such that 0< p(x, a) < t5. Thus f(x)-->b as x-->a if and only if f(x n ) -->b for any {x n } in D with xn-->a. If we set P(ZI,Z2)= IZI -z21 for complex numbers z" Z2, the function p supplies the set of all com- plex numbers C with a metric and C becomes isometric to the plane R 2 ( 74 Complex Numbers). Thus the cases X = Cor Y = Care particular cases of the above generalization. Furthermore, we introduce the tRiemann sphere C = C U { oo} by adding the tpoint at infinity 00 to C. We can define a topology on C such that any set containing { oo} U {z Ilzl > r} for some positive number r is a neighborhood of 00. Thus we can define the notions f(z)-->b as z-->oo, f(x)--> 00 as x-->a, etc., for a complex-valued function f by con- sidering f as a mapping from the topological space C into itself. Then f(x)--> 00 is equiva- lent to llf(x)-->O. G. Orders of Infinities and Infinitesimals Let f be a complex-valued function defined on a topological space X and a a point of X. Then f is called an infinity (at a) or an in- finitesimal (at a) if f(x)--> 00 as x-->a or f(x)--> o as x-->a, respectively. Suppose that f and 9 are infinities and fig is an infinitesimal. Then f is said to be of lower order than g, and 9 is said to be of higher order than f If both fig and gl fare bounded, then f is said to be of the same order as g. This last relation, written f  g, is an equivalence relation. An infinity f is said to be of the nth order with 9 if f  gn. For two infinitesima1s f and g, f is called of higher order than 9 and 9 oflower order than f if fig is an infinitesimal. For infimtesimals, the terms of the same order and of the nth order are defined similarly as above. In particular, when X =C and a= 00, we usually omit the phrase "at 00." Also, for such a function f, we cus- tomarily say that the order of an infinity (in- finitesimal) is n if f  zn (z -n). To describe the order of an infinity or an infinitesimal simply, the following notions, due to E. Landau [10J, are in common use. Let f and 9 be two functions. H [f(x)/g(x) I is bounded as x-->a, then f is called at most of the order of gas x-->a, and we write f(x) = 87H Convergence O(g(x)) as x-->a. Second, if f(x)/g(x) is an in- finitesimal at a, then f is said to be of lower order than 9 as x-->a, and we write f(x) = o(g(x)) as x-->a. The symbols 0, 0, indicat- ing the word "order," are called Landau's symbols. The notation f(x) = h(x) + O(g(x)) means f(x) - h(x) = O(g(x»), When we use the symbols 0, 0 we should indicate clearly the phrase "as x-->a," which is sometimes omitted when no confusion is to be feared (e.g., for the case of a complex variable with a = 00). These symbols are employed for sequences as well, to describe their behavior as n-->oo. H. Convergence of Nets (Moore-Smith Convergence) [7,8J Let  be a (preordered) tdirected set. A family of points in a set X with index set  (namely, a mapping from  to X) is called a net in X. A net is denoted by {X'}'EI ({X'}I or {x,}). A net {y p}2J in X is called a subnet of {X'}l if there exists a mapping <p:!B -->  such that (I) y p = x",(fi) and (2) for any aoE there exists a fJoE!B such that fJ  fJo implies <p(fJ)  iXo. In particu- lar, if!B is a cofinal directed subset of, {xp}2J is called a cofina1 subnet of {X'}l' A net is called a universal net if either {a I X,E Y} or {a I X,E X - Y} is residual in  for any subset Y of X. For any net there is a universal subnet. For a net {x,}., in a topological space X, {x,} is said to converge to a point x in X if for any neighborhood U of x there is an iXo such that {x,1 a  a o } c U. Then a is called a limit of the net {x,}. We then write X,-->X (aE) (or simply x,-->x). The convergence of se- quences of points is the special case where  = N. The notion of convergence using nets was introduced by E. H. Moore and H. E. Smith. Concerning this convergence, we have the following propositions (D): (D) (1) If X,=X for all iX, then x,-->x. (2) If X,-->X and {Yp} is a subnet of {x}, then Yp-->x. (3) If for a net {x,} there is a point x such that any subnet {yp} of {x,} has a suitable subnet converging to x, then X,-->X. (4) Suppose that X,-->X (iXE) and y,p-->x, (fJE!B,) for each iX. Then let <£:=  x II!B, be the direct product of directed sets with projections p:<£:--> and p,:<£:-->!B., and for Y E<£: define Zy = Yap, where a = p(y) and fJ=p,(y). Then Zy-->x (YE<£:). Furthermore, the space_ X is a tHausdorff space if and only if we have the condition (D*): Any net in X has at most one limit. A limit of {x,} is denoted by lim x, or Iim'Eillx,. Then X,-->X if and only if x is con- tained in the closure of any sub net {YplfJE!B} of {x,}, (We may consider this to be a defini- tion of x, --> x.) 
87 I Convergence I. Convergence of Filters [9J Let X be a set. A set 11> of subsets of X is caIled a filter if the foIlowing conditions are satisfied: (i) 0 tl1> (0 is the empty set); (ii) A c Be X and AEI1> imply BE 11>; (iii) A, BE 11> imply An BEI1>. Let !B be a set of subsets of X and 11> be the coIlection of subsets of X such that each element A of 11> contains a subset belong- ing to !B. If 11> is a filter, then !B is caIled a filter base which generates 11>. !B is a filter base if and only if (i) 0t!B; (ii) A, BE!B implies that there is aCE!B with An B =:> C. A filter 11> is caIled an ultrafilter (or maximal filter) if there exists no filter which contains 11> properly. For any filter there exists an ultrafilter containing it. If {11> A} A is a family of filters, then the intersection n 11> A is a filter. If 3' is a filter base in the index set A, then UMEi! (n AEM I1>A) is also a filter. We denote by U(x) the tneighborhood sys- tem of a point x in a topological space X. A filter 11> in X is said to converge to a point a, written l1>-->a, if U(x) c 11>. A filter base !B is said to converge to a if the filter generated by !B converges to a. The convergence of filters just defined has the foIlowing fundamental properties (L): (1) for a point a in X the filter l1>a = {A I aE A c X} converges to a; (2) for two filters 11> and '1', l1>-->a and I1>c'¥ imply '¥-->a; (3) ifl1>A--> a for all members in a family {I1>A} of filters, then n I1>A ==11>-->a; (4) suppose that filters l1>y--> yare assigned for all points y in a subset Y of X, and that we are given a filter 'I' in X which converges to a, generated by a filter base !B in Y; then UBE'IJ (n yEB I1»-->a. Fur- thermore, the space X is Hausdorff if and only if we have the condition (L *): Each filter in X has at most one limit. J. Relations among Various Definitions of Convergence Convergence of sequences of points is a special case of that of nets. Properties (1), (2), and (3) of (D) imply (1), (2), and (3) of (S), respectively, and (D*) implies (S*). Consider a net {x'}2I in X. Then the set {{x,liXE,iXiXo} liXoE} of subsets of X is a filter base in X which gener- ates a filter 11>, and l1>-->x if and only if X,-->X. In this situation, (L) implies (D), and (L*) implies (D*). Suppose that we are given a function f: X --> Y with the domain D and a point aEX, Let U(a) be the neighborhood system of a and assume that, for any U EU(a), UnD-{a}# 0. Then the set {I(UnD- {a} ) I U E U(a)} is  filter base. Let 11> be the filter generated by it. Then f(x)-->b as x-->a if and only if l1>-->b. Consequently, the various types of convergence described previously can 328 be expressed by means of convergence of filters. K. Convergence and Topology In a topological space X, the concept of con- vergence of nets and that of filters can be defined. Conversely, convergence of nets in X defines a topology of X. In fact, let us assume that we are given a set X and a definition of convergence of filters which satisfies the prop- erties (L). Then convergence of nets that satis- fies (D) can be introduced as above. If A is defined as the set of limits of all nets contained in a subset A of X, then A satisfies the axiom of closures ( 425 Topological Spaces), and a topology can be defined on X. Then we have the foIlowing propositions: (i) a E A if and only ifthere is a net {x,} with x, E A converging to a; (ii) U is a neighborhood of a if and only if x,-->a implies that there exists an iXo such that X,E U for all iXexo' Thus, if X is a topological space, it carries a "new" topology defined by way of convergence of nets. But this "new" topology coincides with the original one, Similarly, starting from convergence of filters (or nets), we can obtain a "new" definition of convergence of filters (or nets), which coincides with the initial one. In conclusion, defining a topology on a space X is the same thing as defining convergence of filters in X or of nets inX. We shaH describe here a few notions on topological spaces in terms of convergence. The fact that a topological space X is compact is equivalent to the fact that every universal net in X converges, and to the fact that every ultrafilter in X converges. Also equivalent is the fact that every net has a convergent subnet. A mapping f from a topological space X into a topological space Y is continuous at a if and only if one of the foIlowing conditions is satis- fied: (1) for any net {x,} in X converging to aE X, we have f(x,)--> f(a) in Y; (2) for any filter 11> in X converging to aEX, we have f(I1»== {J(M) 1 M EI1>} -+ f(a) in Y; (3) f(x)...... f(a) in Yas x-->a in X (in the sense of the limit of a func- tion at a). M. Frechet [6J gave a definition of a topol- ogy on a space using the notion of conver- gence as a foundation. A set is caIled an L- space (or Frechet L-space) if convergence of sequences of points in it is defined so as to satisfy conditions (1) and (2) of (S) and (S*) (1906). Such convergence is caIled star conver- gence if it also satisfies (3) of (S), and in that case the space is caIled an L*-space. For any subset A of an L-space X, define If as the set of all points a such that X n --> a for some Se- quence {x n } contained in A. Then the axioms 
329 A =:>A, AUB =AUB, and 0 =0 are satis- fied, so that X is a -,?generalized topological space (the axiom A  A is not nccessarily satisfied). For a Hausdorff space X with the tfirst countabiIity axiom and convergence of sequences defined by means of its topology, the closure operation defined above gives the same topology as the initial one. L. (0 )-Convergence A sequence {an} of elements of an ordered set S is said to be (o)-convergent to an element a of S if there exist two sequences {un} and {V n } such that un.s; an.s; Dn, un.s; U n + l , and V n  D n + l , and a=supun=infv n . When we write this an-->a, properties (I) and (2) of(S) and (S*) con- cerning the convergence of sequences hold. Next, a sequence {an} is said to be (o)-star convergent to a if any subsequence of {an} has a suitable subsequence which converges to a. Then (o)-star convergence satisfies the prop- erties (S) and (S*). For any set X the set I13(X) of all subsets of X is an ordered set under the inclusion rela- tion. The fact that a sequence {An} of su bsets is (o)-convergent to a subset A is equivalent to: 00 r::f) r::f) 00 A = n U An = U n An. m=l n==m m=;ln;m The set A is also equal to Iim An' which is the tlimit of a sequence {An} of subsets. References [1] W. Rudin, Principles of mathematical analysis, second edition, McGraw-Hili, 1964. [2J 1. L. Kelley, General topology, Van Nos- trand, 1955. [3] G. Birkhoff, Lattice theory, Amer. Math. Soc. Colloq. Publ., revised edition, 1948. [4] N. Bourbaki, Elements de mathematique III, Topologie generale, ch. 1, Actualites Sci. Ind., 1142d, Hermann, fourth edition, 1965; English translation, General topology, Addison-Wesley, 1966. [5J 1. W. Tukey, Convergence and uniformity in topology, Ann. Math. Studies, Princeton Univ. Press, 1940. [6] M. Frechet, Sur quelques points du ca\cul fonctionnel, Rend. Circ. Mat. Palermo, 22 (1906),1-74. [7] E. H. Moore and H. L. Smith, A general theory of limits, Amer. 1. Math., 44 (1922), 102-121. [8J G. Birkhoff, Moore-Smith convergence in general topology, Ann. Math., (2) 38 (1937), 39-56. 88A Convex Analysis [9J H. Cartan, Theorie des filtres, C. R. Acad. Sci. Paris, 205 (1937),595-598; Filtres et ultra- fiItres, ibid., 777-779. For Landau's symbol, [IOJ E. G. H. Landau, Handbuch der Lehre von der Verteilung der Primzahlen, Teubner, 1909 (Chelsea, 1953). 88 (X.4) Convex Analysis A. Convex Functions A real-valued function f(x) defined on a tcon- vex set D in a linear space over R is called a convex function if for every x, YE D and O.s; A.s;1 we have f(h+(I-A)y).s;Af(x)+(I-A)f(Y). (1) The function f(x) is called a strictly convex function if the sign .s; in (1) is replaced by < for x# y except when ..1. is either 0 or 1. If -t{J(x) is convex (strictly convex), the function t{J(x) is called a concave function (strictly con- cave function). The notion of convex function was introduced by 1. L. W. V, Jensen [IJ for the case where the domain D is an interval on the real line R. Sometimes the condition for a convex func- tion is weakened in such a way that (1) is as- sumed only for ..1.= 1/2. However, if D is a ttopologicallinear space and f is continu- ous, then the weakened condition implies the original one. Hereafter, we mainly consider the case where D is an interval on the real line. In this case, a convex function f(x) (in the weaker sense) is continuous in the interior of the inter- val if f(x) is tmeasurable or bounded from above on a set of positive measure (the latter was proved by A. Ostrowski [2]), In particu- lar, suppose that f(x) is defined in the interval I and is bounded from below. Then either f(x) is continuous or its graph is dense in the set {(x,Y)lxEI,yg(x)}, where g(x) is a suitable convex continuous function (Hukuhara [3]). We note here that the original definition of a convex function f(x) implies the continuity of f(x) in the interior of the interval. In such a case, f(x) always has tright and tleft deriva- tives and satisfies f'-(x) .s;f(x).s; f'- (y).s; f (y) for x <yo Hence it is differentiable ex- cept for at most countably many points. A function f(x) is a continuous convex function in a.s; x.s; b if and only if it is expre- ssible in the form f(x) = f(a) + r <p(t)dt, 
88 B Convex Analysis where <p(t) is a monotone increasing function. If f(x) is twice differentiable, then f"(x) O (a <x <b) is a necessary and sufficient condi- tion for f(x) to be convex in (a, b). B. Convex Functions and Inequalities If f(x) is convex (in the original sense), we have, for a,>O, f(Lavxv/Lav).s;LaJ(Xv)/Lav. (2) Similarly, we have, for <p > 0, f(f <PljJdx/ f <Pdx).s;f <Pf(ljJ)dx/ f <pdx. (3) The functions x" (a> lor a<O), -x" (O<a< 1), -Iogx, xlogx are strictly convex for x>O, and the functions x 2n (n  1), exp x, log(1 + eX), Ja 2 +x2 (a#O) are strictly convex in -00<:: x < +00. Applying the inequalities (2) or (3) to these functions, we 0 btain various inequalities, including the inequalities on means ( 211 Inequalities). A continuous convex function f(x) over a topological linear space satisfying the relation f(rxx) = rxf(x) for an arbitrary positive number rx is calIed a suhadditive functional and is often utilized in functional analysis. C. M. Riesz's Convexity Theorem Let X=(I' ""n) be an n-tuple of com- plex numbers, and let vO. We put Nv(x) = (Lj'II)I/T for v>O and No(x)=suplJ Let (exi) be an m x n complex matrix, x= (I, "', n), Z = «(I' ''', (m), v O, and J1 O. We put 1 m n I M(v,J1)= sup L L rxijCj . Nv(x)l,N.u(z)l i=l j=l Then logM(v,J1) is a convex function of (v, J1) in the folIowing sense: Let 0 < vi.s; 1, 0 < J1i.s; 1, and Vi + J1i  1 (i = 1,2). Then log M«I- t)v I + tv 2 , (1- t)J11 + tJ12) is a convex function with respect to t for 0 .s; t .s; 1 [4, 5]. These results are calIed M. Riesz's convexity theorem, Famous inequalities such as the tH61der inequality or the tMinkowski inequality follow from this theorem. For example, let T be an tadditive operator from the tfunction space Lp(Q) into Lp(Q) for alI 1 .s; p.s; 00. If T is a continuous operator for p== 1 and p== 00, and the norm of Tis .s;Cforp=1 andp=oo, then Tiscon- tinuous for alI p (1 < P < 00), and its norm is always .s; C. 330 D. Subdifferentials of Convex Functions Currently, convex analysis is playing an im- portant role in the study of nonlinear evolu- tion equations. This analysis treats convex functions on infinite-dimensional spaces [9]. In convex analysis, it is sometimes conve- nient to consider a proper convex function <p on a Hilbert space X. A mapping <p from X into (-00, +00 ] is calIed a proper convex function if <p is not identicalIy equal to +00 and if <p(h+(I-A)Y).s; A<p(x)+(I-A)<p(y) for every x, Y E X and O.s; ..1. .s; 1. The convex set D(<p) = {XEX I <p(x) < oo} eX is calIed the effective domain of <po The subdif- ferential a<p of <p is defined to be a muItivalued function which assigns to each x in D(<p) alI elements YEX atisfying <p() <p(x) + <y, - x> for any EX. Then a<p is monotone in X in the folIowing sense: If YI Ea<p(xd and Y2 E a<p(X2)' then <YI-Y2,XI-X2>0. Under some conditions on <p, a<p becomes a single-valued function, as is seen in the folIow- ing example: Let A be a nonnegative self- adjoint operator in a Hilbert space X. For the t[ractional power JA of A, define <p by {  IIJAxI12, xED(JA), <p(x)= 2 +00, otherwise. Then D(<p)=D(JA) and a<p=A. E. Convex Functions and Nonlinear Semigroups Let <p be a lower semicontinuous proper con- vex function on a Hilbert space X. Since A = -a<p is a maximal tdissipative operator in X, it generates a nonlinear semigroup {1; I t  O} on the closed convex set D(A) : 1;x=\im(I -rA)-['/'J x dO ( 286 Nonlinear Functional Analysis, 378 Semigroups of Operators and Evolution Equa- tions). It can be shown that for any aE D(A) , 1;aED(A) whenever t>O. Thus the tabs tract Cauchy problem du dt +a<p(U)30 (t>O), u( +O)=a, is considered to be "parabolic." Furthermore, 
331 we have II fr T,a II lla- T,all (t>O) for all aE D(A) (H. Brezis [IOJ). References [IJ 1. L. W. V. Jensen, Sur les fonctions con- vexes et 1es inegalites entre les valeurs moy- ennes, Acta Math., 30 (1906), 175-193. [2J A. Ostrowski, Mathematische Miszellen XIV, Uber die Funktionalgleichung der Ex- ponentialfunktion und verwandte Funktion- alg1eichungen, Jber. Deutsch. Math. Verein., 38 (1929), 54-62. [3J M. Hukuhata, Sur la fonction convexe, Proc. Japan Acad., 30 (1954), 683-685. [4] M. Riesz, Sur 1es maxima des formes bilineaires et sur les fonctionelles lineaires, Acta Math., 49 (1927), 465-497. [5J G. O. Thorin, An extention of a convexity theorem due to M. Riesz, Medd. Lunds Univ. Mat. Sem., 4 (1939),1-5. [6J G. H. Hardy, 1. E. Littlewood, and G. P61ya, 1nequahues, ch. 3, Cambridge Univ. Press, 1934, revised edition, 1952. [7J N. Bourbaki, Elements de mathematique, Espaces vectoriels topologiques, ch. 2, Ac- tua1ite Sci. Ind., 1189a, Hermann, second edition, 1966. [8J F. A. Valentine, Convex sets, McGraw- Hill, 1964. [9J V. Barbu, Nonlinear semigroups and dif- ferential equations in Banach spaces, Noord- hoff, 1976. [10J H. Brezis, Operateurs maximaux mono- tones et semi-groupes de contractions dans 1es espaces de Hilbert, Math. Studies 5, North- HolIand, 1973. 89 (VI.11) Convex Sets A. General Remarks A non empty subset X of the n-dimensional Euclidean space Rn is called a convex set if for any elements x, y in X and any number a such that Oa.s; 1, the element ax+(I-a)y ofR n is also contained in X. The tinterior and the tclosure of a convex set are also con- vex. A point x of a convex set X is called an extreme point of X if x cannot be expressed as (x 1+ x2)/2 in terms of a pair of distinct points x" X2 in X. A bounded closed convex set is calIed a convex body if it has tinterior points. 89B Convex Sets Given an arbitrary nonempty subset X ofR n , the minimum convex set containing X exists, called the convex hull of X and denoted by [X]. Each point x of [XJ can be expressed as x = L?:l aix;, where Xi belongs to X and the a i are nonnegative numbers such that L?:l a i = 1. When X is a finite set, [XJ is calIed a convex polyhedron. If X denotes the set of extreme points (also calIed vertices) of a convex poly- hedron X, then X = [X]. For elements x, y of Rn, denote the inner product by (x,y). Given a nonzero element v of Rn and a fixed number a, the thyperplane H = {xl(v,x)=a} divides the space R n into two thalf-spaces {xl(v, x).s; a} and {x I(v, x);;;. a}, each of which is a closed convex set. If a con- vex set X is contained in one of the half-spaces S determined by the hyperplane H and the boundaries of X and H intersect, then we say that H is a supporting hyperplane of X and S is a supporting half-space of X. A closed convex set X is the intersection of its supporting half- spaces. A boundary point of a convex set X is contained in a supporting hyperplane of X. Given mutually disjoint convex sets X and Y, the separation theorems (I) and (2) hold. (1) If X has inner points, then there exist a nonzero element v of Rn and a number a such that X is contained in the set {x I (x, v);;;. a} and Y is contained in the set {x I (x, v).s; a}. (2) If X and Yare closed and X is bounded, then we can replace the signs .s; and;;;' in (1) by < and>, respectively (when the separa- tion of convex sets X and Y is described by strict inequalities, we say that X and Yare strongly separated). As an immediate consequence of the separa- tion theorems, we obtain the following pro- position: Suppose that A is an m x n matrix with real entries. For an element z in a Eu- clidean space, we write z;;;' 0 (> 0) if each component of z is ;;;'0 (>0) Now if'Ay>O never holds for an m-dimensional vector y > 0, then there exists a nonzero n-dimensional vector x;;;' 0 such that Ax.s;O ( 173 Game Theory). The definitions given previously for subsets of R n can be naturally extended to the case of trea1 topo1ogica11inear spaces ( Section G). Also, in the theory of analytic functions of several complex variables, various notions of convexity of the subsets of en are considered ( 21 Analytic Functions of Several Complex Variables). B. Helly's Theorem Suppose that we are given an index set ;\ of cardinality greater than n + 1, and bounded closed convex sets C A (AE;\) in Rn. If any n + 1 
89 C Convex Sets sets of the C A have nonempty intersection, then the intersection of all the C A is nonempty (Helly's theorem). This theorem has a wide range of applica- tions. For example, we have propositions (1)- (4). (1) If a convex set X of Rn is covered by a finite number of half-spaces, then X can be covered by no more than n + 1 half-spaces among them. (2) Let X and Y be finite subsets of Rn. X and Yare strongly separated by a hyperplane if for an arbitrarily chosen subset S of X U Y consisting of at most n + 2 points, the sets S n X and S n Yare strongly separated by a hyperplane. (3) If the tdiameter of a subset X of Rn is not greater than 2, then X is contained in a tball of radius (2n/(n + 1)) 1/2. (4) Let X be a convex body in Rn. There exists a point x in X such that Ilx-ull/llv-ull.s;n/(n+ 1), where u, v are points of intersection of an arbi- trary straight line passing through x with the boundary of X, and IIxll denotes the length (x, X)I/2 of x. Helly's theorem can also be applied to problems of approximation of functions. C. Ovals The boundaries of convex bodies in R 2 and R 3 are calIed ovals and ovaloids, respectively. An oval E is a tJordan curve C(t) which admits at every point Po = C(to) left and right "tangents" 1;0 and 1;0' where lio is a straight line expressed as the set of points P(A), AER such that P(A) - Po= Aa I , with a I =limhi:o(C(t)- Po)flC(t) - Pol). There may exist exceptional points P on E for which left and right tangents do not coincide, but the set of such points is at most countable. Each tangent lio shares a point or a segment with E. A line satisfying this condition is called a supporting line of the oval. It is also a supporting hyperplane in R 2 of the convex body [EJ in the sense of Section A. If we fix an interior point 0 of a convex body X and take an arbitrary point P different from 0, then the boundary E of X admits one and only one supporting line I(P) which is perpendicular to the line OP and meets the half-line OP. Take a rectangular coordinate system with the origin 0, and let (x, y) denote the coordinates of P. Then the points (, rT) on I(P) satisfy the equa- tion x+r]y=H(x,y), where H(x,y) is a func- tion determined for all (x, y) in R 2 and satisfy- ing the folIowing conditions: (i) H (0,0) = 0; (ii) H(tx, ty)= tH(x,y), for tO; (iii) H(x i +X 2 'YI + Yz).s;H(x"Yd+H(X2'Y2)' The function H (x, y) is called the supporting line function of E. The magnitude and shape of E are deter- mined by H, and any function satisfying con- 332 ditions (i)-(iii) is a supporting line function of an oval. An oval E has a finite length L = L(E), and the convex body [EJ has a finite area F = F(E). IfOP' denotes the half-line with direc- tion opposite to that of OP and /' denotes the supporting line I(P'), the distance between the parallel lines I and /' is called the breadth of E in the direction PP'. Let D=D(E) and =(E) be the maximum and minimum of the breadth of E, respectively. D is the tdiameter of E (or [EJ), and  is called the thickness of E (or [EJ). In particular, if D =, then the oval E is called a curve of constant breadth. In the fol- lowing inequalities, equality holds only when E is one of the figures mentioned in paren- theses: (1) U  4nF (circles, 1. Steiner (1838) ( 228 Isoperimetric Problems); (2) nD24F (circles, L. Bieberbach (1915»; (3) L.s;nD (curves of constant breadth, W. Blaschke (1916)); (4) F2/.j3 (regular triangles, 1. Pal (1921». See T. Kubota, T8hoku Sci. Bull., I, 12, 13; T8hoku Math. J., 24, 49. D. Linear Combinations of Ovals Let HI and H 2 be supporting line functions of ovals EI and E 2 , and let t l and t 2 be positive numbers. Since the function tlH I +t 2 H 2 satis- fies conditions (i)-(iii), given before, it is a supporting line function of an oval E(t I' t 2)' In this case, we can also write E(t I ,(2) = tiEl + t 2 E 2 and calI it a linear combination of EI and E 2 . In particular, the oval (EI + E 2 )/2 is called the mean oval of EI and E 2 . In general, there exists a quantity M, called the mixed area of EI and E2' such that F(E(t"t 2 ))= F(Edti+2Mt l t 2 +F(E 2 )ti. M does not depend on the choice of t I and t 2, and M 2  F(EdF(E2). Here, the equality holds if and only if EI and E 2 are homo the tic and situated in a position of homothety. Furthermore, if O.s; t.s; 1, then the square root of F(E(t, 1- t) is a tconvex function of t (H. Minkowski). E. Specific Ovals Suppose that we are given an equilateral triangle ABC. Draw three circles C" C 2 , and C 3 with centers A, B, and C and radii equal to the length of the sides of ABC. The minor arcs AB, BC, and CA of the circles form an oval which is called a Reuleaux triangle. This oval is of constant breadth. Furthermore, given a fixed breadth D, the area F(E) of an oval E of constant breadth D attains its minimum when E is a Reuleaux triangle. A Reuleaux triangle obtained from a triangle ABC revolves freely within the square of side AB and touches each side. In general, an oval which revolves touch- 
333 ing the sides of a convex polygon from the inside is called an in revolvable oval. Any such oval revolves inside some regular polygon (M. Fujiwara, S. Kakeya). Various properties of an oval already de- scribed can be generalized to the case of a boundary of a convex body in Rn. For the volumes of subsets A, Bin Rn and A + B = {x + YlxEA,YEB} in R n , the Brunn-Minkowski inequality [vol(A + B)] I/n  [vo1(A)] I/n + [vol(B)] I/n holds. F. Convex Cones A non empty subset X of Rn is called a convex cone if for any elements x, Y of X and a non- negative number a, ax and x + yare contained in X. A convex cone is a convex set. Given any nonempty subset X of Rn, the minimum con- vex cone K(X) containing X exists. Given two convex cones X and Y, a convex cone X + Y, called the sum of X and Y, is defined as the set of elements x + y, where x, yare elements of X, Y. The intersection of convex cones X and Y is also a convex cone. Given a convex cone X, the subset of Rn consisting of the elements y such that (x,y).s;O for any element x in X is a convex cone which is called the dual convex cone (or conjugate convex cone) of X, denoted by X*. If X is a finite set, K(X) is caIled a convex polyhedral cone. For example, if v is a nonzero vector, then the half-line (v) = {x I x =av, a O} or the half-space (v)* = {xl(v,x).s;O} is a convex polyhedral cone. A convex polyhedral cone is closed. A convex cone X is a convex polyhedral cone if and only if X is the sum of a finite number of half-lines. Given convex cones X and Y, we have pro- positions (1H3): (1) If XI cX 2 , then Xi c xt; (2) (XI +X 2 )*=xt nxi'; (3) xt+Xi'c (XI nX 2 )*. If Xl and X 2 are convex poly- hedral cones, then xt+Xi'=(X I nX 2 )*. Gen- erally, Xc (X*)* = X** for a convex cone X. If X is a closed convex cone, then X =X**. Namely, the duality principle holds for closed convex cones. A linear subspace of R n is a convex polyhedral cone. Also, if A is an m x n real matrix, the subsets {x I Ax = 0, x  O} and {x I Ax  O} are convex poly hedra1 cones. Since the duality principle holds for convex poly- hedral cones, we obtain the tMinkowski- Farkas theorem (i.e., if A is an m x n real matrix and v is an element of Rm, then the equation Ay=v has a solution yO in Rn if and only if(v,x)O for all xER m such that tAxO). (For linear inequalities  255 Linear Programming. ) 89 Ref. Convex Sets G. Convex Sets in Function Spaces The definitions of convex sets and convex cones in Rn can be naturaIly extended to the case of any real linear space. Some of their properties can be generalized and applied to the case of function spaces. (1) Let E be a tJocally convex real topolog- ica11inear space satisfying Hausdorff's tsepara- tion axiom. Let A and B be convex sets in E, and assume that B has interior points and An B is empty. Then A and B are separated by a hyperplane. Namely, there exists a nonzero tcontinuous linear functional f on E such that supf(A) .s;inff(B), (2) Let E be as in (1), and let C be a convex set in E. If a boundary point x of C admits a nonzero continuous linear functional f such that f(x)==supf(c), we call such a point x a supporting point of C, and f a supporting func- tional of C. If C has interior points, then any boundary point x of C is a supporting point of C. (3) Let C be a closed convex set of a tBanach space E. The set of supporting points of C is dense in its boundary. A convex set C contained in the dual space E* of a real topological linear space E is caIled a regularly convex set if for any fo in E* not contained in C, there exists a point Xo in E such that sup{l(xo)lfEC} <fo(x o )' Let E be a real topological linear space satisfying Hausdorff's separation axiom, and let C be a closed convex cone having 0 as its extreme point. Furthermore, assume that C n (- C)== {O}.lfwe set x.s;y when y-XEC, a partial ordering .s; is defined in E. For example, if E is R n , then the positive orthant C = {x = (x,) I Xi O, i = 1, ..., n} satisfies these requirements, and the partial ordering x.s;y defined by means of C is equivalent to the relation Xi .s;Yi for all i. Some of the properties of matrices of posi- tive entries or tintegra1 operators whose tker- nel functions are positive-valued can be gen- eralized to properties of mappings f: E --> E such that f(C) c C ( 255 Linear Program- ming; for the tKrein-Milman theorem  424 Topological Linear Spaces). References [IJ W. Blaschke, Kreis und Kugel, Veit, 1916 (Chelsea, 1949). [2J T. Bonnesen and W. Fenchel, Theorie der konvexen Korper, Erg. Math., Springer, 1934 (Chelsea, 1948). [3] W. Fenchel, Convex cones, sets and func- tions, Lecture notes, Princeton Univ. Press, 1953. 
90A Coordinates [4J Convexity, Amer. Math. Soc. Proc. Symp. in Pure Math. VII (1963). [5J M, G. Krein and M. A. Rutman, Linear operators leaving invariant a cone in a Banach space, Amer. Math. Soc. Transl., 26 (1950). (Original in Russian, 1948.) [6J H. Nikaido, Convex structures and economic theory, Academic Press, 1968. [7J B. Griinbaum, Convex polytopes, Inter- science, 1967. [8J I. M. Yag10m (Jag1om) and V. G. Bol- tyanskii, Convex figures, Holt, Rinehart and Winston, 1961. (Original in Russian, 1951.) [9J G. Choquet, Lectures on analysis, vols, I, II, Benjamin, 1969. [IOJ R. T. Rockafellar, Convex analysis, Princeton Univ. Press, 1970. 90 (VI.13) Coordinates A. General Remarks Suppose that we are given a Euclidean plane E 2 and two lines X'X and Y'Y in E 2 per- pendicular to each other. Let 0 be the point of intersection of X'X and Y'Y. We identify each of the straight lines X'X and Y'Y with the set of real numbers R; the point 0 on each line is identified with zero, Let P be an arbitrary point in E 2 . We draw lines PQ, PR parallel to Y'Y, X'X, where Q, R are on X'X, Y'Y, respec- tively. Let x and y be the real numbers corre- sponding to Q and R. Thus we obtain a map- ping sending the point P to the ordered pair (x, y) of real numbers. This mapping gives a one-to-one correspondence between the points P of E 2 and the ordered pairs (x, y) of real numbers in R 2 . The numbers x and yare called the coordinates of P. In general, given a set of mathematical objects, if we have a mechanism that assigns quantities to each element of the set, then such a mechanism is called a coordinate system on the set, and the quantities corresponding to each element are called its coordinates. In the previous example, the mechanism is called a rectangular coordinate system. Coordinate systems are also useful in expressing quantita- tive concepts by geometric ones which are intuitively easier to grasp, e.g., diagrams of train schedules and tnomograms. tMap projec- tion, tgraphical calculation, tdescriptive geom- etry, etc., may be viewed as applications of the concept of coordinate systems. In many cases, when we introduce a coor- dinate system in a space, it is determined uniquely by fixing a basic figure in the space. 334 In the case of a rectangular coordinate system on a plane E 2 , the basic figure consists of X'X and Y'Y, which are called coordinate axes (the point 0 is called the origin). Sometimes it is convenient to consider real-valued functions f and 9 on R and a coordinate system on the plane E 2 determined by the function that sends an arbitrary point P to (/(x), g(y)), where (x, y) are the coordinates of P in the rectangular coordinate system. Logarithmic papers, tprobability papers, and tstochastic papers (binomial probability papers), etc., are constructed in this way to fit their respective purposes. In various branches of mathematics there are many varieties of coordinate systems. In this article we deal with frames and coordi- nates, curvilinear coordinates, and local coordinates. B. Frames and Coordinates Suppose that we are given a space M and a ttransformation group G acting on M. It is desirable to introduce a coordinate system that best represents the geometric structure of M. Let G* be a set of figures in M such that G acts tsimp1y transitively on G *. Each element of G* is called a frame. Utilizing each frame as basic figure, we introduce a coordinate system that is "G-invariant" in the following sense: Let REG*, XEM, and CR(X) be the coordi- nates of X with R as basic figure. Then the coordinate system is G-invariant if CR(X) = CgR(gX) for any element 9 in G.lfwe have such a coordinate system for each REG*, then the expressions of geometric properties of M in terms of the coordinates are independent of the choice of frames. (1) Projective Coordinates. Let M be an n- dimensional tprojective space pn over a field K, and let G be the tprojective transformation group of pn. As a frame we can take the system of n+ 1 points (Ao, AI>"" An) in general po- sition. The thomogeneous coordinates of an arbitrary point X Epn are given by the (n+ 1)- tuple (xo, XI' ..., x n ) satisfying the equation X = LjoxjAj, xjEK. They are called tprojec- tive coordinates. In fact, if (x o , Xl, .., ,X n ) # (0,0, ...,0), then (xo,x l , ...,x n ) and (ho,h" ..., h n ) (..1.#0) represent the same point in pn. A thyperp1ane n of pn is expressed as the set of points whose coordinates (x o , x I, ... ,x n ) satisfy a linear homogeneous equation LjOXjUj=O, ujEK. Therefore, the hyperplane n is represented by the homogeneous coor- dinates (u o , U 1 , ..., un), called thyperplane coordinates of n. 
335 (2) Affine Coordinates. Let M be an n- dimensional taffine space En, and let G be the tgroup of affine transformations of En. As a frame we can take the system (0; e l , e 2 , ... ,en), where 0, caned the origin, is a point in En, and the set of vectors {e;} is a basis of the tstan- dard vector space of En. Then tinhomogeneous coordinates of an arbitrary point X E En are given by the n-tuple (XI' X2, ..., x n ), where X = o + Li'1 xie i . They are also calIed taffine coordinates of X in En. Sometimes we replace G above by the group of tequivalent affinities and consider the frames (0; e I' e 2 , . ., ,en) such that the volume of [e l , ..., en] = 1. Furthermore, if En has the structure of a tEuclidean space, we sometimes replace G by the tgroup of motions and consider a sys- tem of rectangular coordinates determined by an torthogonal frame, that is, a frame (0; e l , e 2 , "', en) such that the inner product (ei,ej)=,)ij, where,) is the tKronecker delta. By contrast, the general affine coordinate system of a Euclidean space is caned a system of oblique coordinates. In this case the inner prod- ucts (e i , e j ) = gij are invariants of Euclidean geometry, and the distance p between two points (Xi) and (y;) is given by p = (Li',j=1 giiYi -x;)(Yj-X))1/2. We sometimes consider an oblique coordinate system satisfying (ei> e i ) = 1 (i == 1,..., n). In such cases the angle (Jij between two basis vectors e i and e j is determined by gij = cos (Jij' (3) Barycentric Coordinates. In an n- dimensional affine space En, we take n + 1 linearly independent points Ao, AI, .,., An and denote the position vectors from a point 0 to these points by a o , ai, ..., an, respectively. Then for any point X E En there exists a unique set of numbers (..1. 0 , ..1.1' "', An) such that X = 0 + LJ=o Aja j , LJ=o Aj = 1. We can these numbers barycentric coordinates of X in En. They are independent of the choice of the point O. (4) Pliicker Coordinates. Let V(n,m) be the set of all m-dimensional subspaces in an n- dimensional projective space pn. Then V(n,m) has the structure of a tGrassmann manifold. In order to introduce a coordinate system on V(n, m), we fix a projective coordinate system on pn. An m-dimensional subs pace nE V(n, m) in pn is spanned by m + 1 independent points Bo, BI'"'' BmEpn. We denote projective co- ordinates of these points by (b oj ), (bij), ..., (b mj ) and construct the determinants bOjoboj1 ... b Ojm Pjoj 1 jm = . . . . . . . . ., 0 .s;j 0, .. . ,j m .s; n. bmjobmj1 ... b mjm Then the subspace n can be represented by homogeneous coordinates ( ... , PjOj1 .jm"")' 90B Coordinates These coordinates are independent of the choice of m + 1 points that span n and are called Pliicker coordinates (or Grassmann coordinates) of n in V(n, m). In these coordi- nates, the Pjoj, jm are alternating and satisfy the Pliicker relations m+1 L (-I)kpi1i2 imj,Pjo' ,J, jm+' =0, k-O where.h means that jk is removed. In partic- ular, when n == 3 and m == 1, we have only one Plucker relation Q :POIP23 - P02PI3 + P03P12 = 0, which is a homogeneous equation of the second degree. In other words, the set V(3, 1) of alIlines in a 3-dimensional projective space p3 is realized as a quadric surface Q in a 5- dimensional projective space p5 that has (POI,P02,P03,P12,PI3,P23) as projective co- ordinates. Moreover, when p3 is a complex projective space, we put POI =0+i3' P02=1 +i4' P03=2+i5' P23=0-i3, P13= -I +i4' P12 =2-i5 and obtain a relation 5 + a + i +  +; +  = 0 corresponding to the Plucker relation. Thus every line in p3 can be represented by homo- geneous coordinates (O, l'..., 5)' which we call Klein's line coordinates. (5) (n + 2)-Hyperspherical Coordinates. Let (x o , X I' ... ,X n , Xoo) be projective coordinates in an (n + 1 )-dimensional real projective space pn+! . An n-dimensional tconformal space sn is realized as a quadric hypersurface sn in pn+!: Li',j=1 gijxiX j - 2xoxoo = 0, where (gij) is a positive definite symmetric matrix. A general point in pn+1 represents a thypersphere of sn, That is, a hypersphere represented by a point X Epn+1 is realized as the intersection of sn with the tpolar hyperplane of X with respect to sn; according as X lies outside of sn, on sn, or inside of Sn, it represents a real hypersphere, a point hypersphere, or an imaginary hyper- sphere. Therefore any hypersphere of sn in pn+1 is expressed by homogeneous coordinates (x o , XI' "', X n , x oo ), calIed (n + 2)-hyperspherical coordinates. When n = 2, they are calIed tetra- cyclic coordinates, and when n = 3, pentaspher- ical coordinates. Therefore, if we restrict (n + 2)- hyperspherical coordinates for points on Sn, then they satisfy the quadratic relation stated before. In the frame (A o , A" "', An' Aoo) of pn+1 which defines the (n + 2)-hyperspherical co- ordinates, Ao and Aoo are points on sn, and the other Ai are real hyperspheres passing through the points Ao and Aoo. It is possible to choose a frame (Ao, AI' ." ,An, Aoo) such that the equa- tion for sn becomes Li'1 Xf - 2xoxoo = 0 (i.e., 
90C Coordinates gij = ,)i). Among hypersurfaces in sn, one that is expressed by a homogeneous equation of the second degree with respect to (n + 2)- hyper spherical coordinates is called a cyclide. I t is an algebraic surface of the fourth order and is an enveloping surface of the family of hyperspheres that are tangent to n fixed hyperspheres. (6) Moving Coordinates. When we study the differential geometry of an m-dimensional surface W in a space M on which a transfor- mation group G acts, it is often preferable to take a frame or frames at each point of Wand consider a tconnection among them. These frames are called moving frames, and the set of coordinate systems with respect to moving frames is called a moving coordinate system ( 111 Differential Geometry of Curves and Surfaces). C. Curvilinear Coordinates Let (XI, X2, ..., x n ) be a rectangular coordinate system on an n-dimensional Euclidean space En. If Xi = xi(u" U 2 , ..., Un), i = 1, ..., n, are func- tions of n variables (u I, U2, ... , un) of class C' (r  1) and the tfunctional determinant D(XI' ...,xn)/D(u" ...,u n ) is not equal to zero in some open domain, then (u l , U2, "', un) are considered local coordinates in En. We call them curvilinear coordinates of En. A hypersur- face ui=constant (obtained by fixing the value of one of the variables u i ) is called a coordinate hypersurface, and a curve Uj = constant (j # i) is called a coordinate curve. The line element ds of a Euclidean space En is given by n n ds 2 = " dX k 2 = " g ..du.du, L.. L.. l) ! l' k=l i,j=l g..= I aXkaXk '1 k=1 aUi aUj' Thus En is equipped with a tRiemannian metric. However, as En is tflat, its tcurva- ture tensor satisfies Rjkl = O. If tire metric is diagonal, namely, if ds 2 == L7=1 gidu;, the co- ordinates are called orthogonal curvilinear coordinates. Moreover, if 9 I = ... = gn, the coordinates are called isothermal coordinates. The metric is diagonal if and only if coordi- nate curves are mutually perpendicular at the points of intersection. Actually, curvilinear coordinate systems that are often used practi- cally are diagonal. The concept of curvilinear coordinates has been generalized to the case of tdifferentiable manifolds and is utilized to determine their local coordinates. On any 2-dimensional Riemannian manifold there always exist isothermal coordinates in a neigh borhood of any point [7]. 336 (1) Curvilinear Coordinates on Planes or Spaces ( Appendix A, Table 3.V). Let (x,y) be rect- angular coordinates of a point in a Euclidean plane E 2 . We have the following coordinate systems on E2: Polar coordinates (r, e), where x=rcose, y=rsine. Elliptic coordinates (..1., J1), where X2 =(..1. + a 2 )(J1 + a 2 )/(a 2 - b 2 ), y2 = (), + b2)(J1 + b 2 )/(b 2 _ a2), a>b>O, A> -b 2 >J1> _a 2 . Parabolic coordinates (rx, [3), where x= -(ex+{3), y= J -4ex{3 , ex>O>{3. Equilateral (or rectangular) hyperbolic co- ordinates (u, v), where x=uv, y=(U 2 -v 2 )/2. Bipolar coordinates (, 1/), where X = a sin /(cosh  +cos 1/), y = a sin I//(cosh  + cos 1/), -oo<<oo, 0.s;1/.s;2n. Next we consider the case of a 3- dimensional Euclidean space E 3 and let (x,y,z) be rectangular coordinates on E 3 . The follow- ing systems of coordinates on E3 are some- times useful. Cylindrical coordinates (r, e, z), where x=rcose, y=rsine, z=z. Spherical coordinates (r, e, <p), where X = rsin ecos <p, y= rsinesin<p, z = rcos e. Ellipsoidal coordinates (..1., J1, v), where X2 = (..1. + a 2 )(J1 + a 2 )(v + a 2 )/(a 2 _b 2 )(a 2 _c 2 ), y2 = (J, + b 2 )(J1 + b 2 )(v + b 2 )/(b 2 _c 2 )(b 2 - a 2 ), Z2 = (J, + c 2 )(J1 + c 2 )(v + C 2 )/(C 2 - a 2 )(c 2 - b 2 ), a>b >c> 0, A> _c 2 > J1> _b 2 > v> _a 2 . These coordinate systems are all systems of orthogonal curvilinear coordinates. Suppose that we are given two rectangular coordinate systems (, 1/, () and (x, y, z) sharing the same origin. The correlation of the two is given by Euler's angles (e, <p, t/J), where e, <p, and t/J are the angles between the z-axis and (-axis, zx- plane and z(-plane, and (-plane and (z-plane, respectively. The Euler angles e, <p, and t/J are subject to the inequalities O.s; e.s; nand O.s; <p, t/J < 2n. They are often utilized in the dynamics of rigid bodies. (2) Multipolar Coordinates. Let PI' P 2 , ..., Pm be m points in general position in an n- dimensional Euclidean space En, m.s; n. If we 
337 denote by Pi (O) the distance between a point X of En and Pi' then (Pl,P2, ...Pm) can be re- garded as coordinates of a point X contained in a suitable domain of En. They are called multipolar coordinates. In particular, if m = 2 they are called bipolar coordinates, and if m = 3, tripolar coordinates. When m > n, these COor- dinates satisfy m-n relations. Next let aI, IX 2 , ..., IXm be m hyperplanes in general position in En, m.s;n. For an arbitrary point X of En, we denote by (i the directed distance of X from each hyperplane IXi' The m-tup1e (" 2' ..., m) provides coordinates of X that are called multiplanar coordinates in En. When m> n, these coordinates satisfy m - n relations. In particular, when n = 2 and m = 3, they are called trilinear coordinates. In this case, if we denote by S the area of the triangle defined by three lines aI' a2, a 3 , and by ai' a 2 , a3 the lengths of the three sides of the triangle, then the trilinear coordinates (I' 2' (3) satisfy a linear relation a l I + a 2 2 +a33 = 2S. (3) Tangential Polar Coordinates. In a Eu- clidean plane E 2 , we take a directed line 10 passing through a point 0, For an arbitrary directed line g, let p be the directed distance between 0 and g, and let 0 be the angle be- tween 10 and g. Then (p, 0) are called tangential polar coordinates (Fig. 1). They are useful for representing tangent lines to curves in E 2 . Let C be an toval in E 2 . A line is called a tsupport- ing line of C if its intersection with C consists of a point or a line segment. In this case we take the origin 0 inside C and consider the coordinates (p, 0) of the supporting lines of C. Then the equation of C can be represented as p= p(O), where prO) is a periodic function of period 2n. The coordinates (p, 0) are especially useful when the function p(O) can be expanded in a tFourier series. In the case of Euclidean space E 3 , the notion of tangential polar co- ordinate system can also be defined by using tangent planes. Lo Fig. 1 Tangential polar coordinates (4) Normal Coordinates. Let M be an n- dimensional tRiemannian manifold, and let TA(M) be the tangent space to M at a point A. For each tangent vector VE TA(M), we draw a 90D Coordinates tgeodesic through A with the initial direction v and take a point P on the geodesic such that the distance from A to P is equal to the length of v. Then the correspondence that sends v to P is a tdiffeomorphism of a neighborhood of the zero vector 0 of TA(M) with a neighbor- hood of A in M. Therefore the components (VI, v 2 , ... , vn) of v with respect to a basis of T A (M) give the coordinates of the points P contained in a suitable neighborhood of A. We call them normal coordinates about the point A of M. In these coordinates, each geodesic passing through A is given by equations Vi = air (i = 1,2, ..., n), where the (IX i ) are compo- nents of the unit vector in the direction of v and r is the parameter that represents the arc length from A to the point (VI".., vn). In partic- ular, ...hen n=2, we fix a tangent vector Vo at A and denote the angle between v and V o by 0, Then (r,O) are coordinates of P called geodesic polar coordinates, The notion of normal co- ordinates can also be defined for tLie groups or differentiable manifolds with taffine connections. D: Local Coordinates Suppose that we have a space M that has a covering by a family of open neighborhoods with coordinate systems. If, for each pair of neighborhoods with non empty intersection, the coordinate transformation in the intersec- tion satisfies certain specified conditions, then a mathematical structure on M can be defined. Let E be a ttopological space. Suppose that 11> is a family of open sets in E such that the union of any number of open sets in 11> and the intersection of any finite number of open sets in 11> also belong to 11>. A set r of thomeomor- phisms is calIed a pseudogroup of transforma- tions on E if r satisfies the following three con- ditions: (i) Any homeomorphism fEr is de- fined on an open set V EI1>, and f(V)EI1>. (ii) When an open set V EI1> is expressed as the union of a family {V;} of open sets V,EI1>, a homeomorphism f defined on V belongs to r if and only if its restriction to each Vi belongs to r. (iii) For any open set V EI1>, the identity mapping on V belongs to r, and if f, gEr, then the inverse f- l and the composition go f, if it exists, belong to r. Let E and M be topological spaces. A homeomorphism <p: V --> V of an open set V in E to an open set V in M is called a local co- ordinate system of M with respect to E. For two local coordinate systems <PI: V I --> VI and <P2: V 2 --> V 2 , the homeomorphism <p;1 0 <PI: <PII (VI n V 2 )--><p;I(V I n V 2 ) is calIed a transformation of local coordinates. Let r be a pseudo group of transformations 
90 Ref. Coordinates on E. A set I: of local coordinate systems of M with respect to E is said to define a r -structure on M if I: satisfies the following two con- ditions: (1) the totality of the images of local coordinate systems belonging to I: covers M; (2) if two local coordinate systems <PI and <P2 of I: have a transformation of local coordinates, it belongs to r. Now suppose that two sets I: and 1:' ofloca1 coordinate systems define r- structures on M. If the union of I: and 1:' defines a r-structure, then we say that the first two r-structures are equivalent. Let r be a pseudo group of tdiffeomorphisms, each de- fined from an open subset of the n-dimensional space Rn onto another open set. If a r- structure is defined on a space M, then M is an n-dimensional tdifferentiable manifold. On the other hand, let r be a pseudo group of complex analytic homeomorphisms in an n- dimensional complex number space cn. If a r- structure is defined on a space M, then M is an n-dimensional tcomplex analytic manifold. Locally homogeneous spaces, tfoliated mani- folds, the tfiber bundles are all equipped with local coordinate systems with suitable r- structures. References [IJ E. Cartan, La methode du repere mobile, la theorie des groupes continus et 1es espaces generalises, Actualites Sci. Ind., Hermann, 1935. [2J C. Ehresmann, Sur 1a theorie des espaces fibres, Colloques Intern. Topo10gie algebrique, Paris, 1949, C. N. R. S. XII, 3-15. [3J F. Klein, Vorlesungen iiber hohere Geome- trie, Springer, third edition, 1926 (Chelsea, 1957), [4J B. L. van der Waerden, Einfiihrung in die algebraische Geometrie, Springer, 1939. [5J O. Veb1en and J. H. C. Whitehead, The foundations of differential geometry, Cam- bridge Univ. Press, 1932. [6J WU Wen-Tstin and G. Reeb, Sur les es- paces fibres et les varietes feuilletees, Actualites Sci. Ind., Hermann, 1952. [7] S. S. Chern, An elementary proof of the existence of isothermal parameters on a sur- face, Proc. Amer. Math. Soc., 6 (1955), 771- 782. 91 (IX.3) Covering Spaces A. General Remarks A continuous mapping p: Y --> Y of an tarcwise connected topological space Y onto a con- 338 nected topological space Y is called a covering mapping (covering map) if the following con- dition (C) is satisfied: (C) Each point of Y has an open neighborhood V such that every tconnected component of p -I (V) is mapped homeomorphically onto V by p. Here we note that Y is in fact arcwise connected. If there is a covering mapping p: Y --> Y, we call Ya covering space of Y and (Y, p, Y) a covering. In particular, for a tdifferentiab1e manifold Y, if Y is also a differentiable mani- fold and p is differentiable, then Y is called a covering (differentiable) manifold of Y. (In the theory of tRiemann surfaces, a tcovering sur- face may have some tbranch points violating condition (C). Upon removing such points, we obtain a covering space as defined above.) F or each tpath w: 1--> Y (l = [0, 1 J) of Y, a path w: 1--> Y with pow = w is uniquely deter- mined by the point W(I)Ep-l(w(I)), and a bijection w#: p-t (w(1 ))-->p-I(W(O)) is deter- mined by w#(w(I))=w(O). Thus there exists a one-to-one correspondence between p -I (y) and p -I (y') for every pair of points y, y' of Y, and (Y,p, Y,p-I(yo)) is a tJocally trivial fiber space with discrete fiber p-I(yo). When the cardinal number of p -I (y) is a finite number n, we call (Y, p, Y) an n-fold covering. In this case, for a tloop w(l, j)-->(y,yo) with base point Yo, w#:p -I (Yo)-->P -I(yo) is a permutation of the n elements in p-I(yo), and we obtain a homo- morphism of the tfundamental group nl(Y)= n l (Y, Yo) of Y into the tsymmetric group G n , given by the correspondence w-->w#' The permutation group \JJl, which is the image of this homomorphism, is called the monodromy group of the n-fold covering. Two coverings (Y;,Pi' Y) (i= 1,2) are said to be equivalent if there is a homeomorphism <p: Y I --> Y 2 with P2 0 <P = Pt; such a <P is called an equivalence. In particular, a self-equivalence <p: Y --> Y of a covering (Y, p, Y) is called a cover- ing transformation. The set n of all covering transformations forms a group by the compo- sition of mappings, which is called the covering transformation group of Y. We call (Y,p, Y) a regular covering iffor each YE Y and h, Y 2 Ep-l(y), there exists a unique covering transformation that maps Yl to Y2' In this case, the torbit space Yln is homeomorphic to Y, (Y, p, Y, n) is a tprincipal bundle, and the monodromy group \JJl is isomorphic to n. F or a covering (Y, p, Y), we call Ya covering group of Y if Yand Yare topological groups and p is a homomorphism. Then (Y, p, Y) is a regular covering, and its covering transfor- mation group is isomorphic to p-I(e) (e is the identity element of Y), which is a discrete subgroup lying in the center of Y ( 423 Topo- logical Groups 0). 
339 B. Universal Covering Spaces F or  covering (Y, p, Y), we have the following relatIOns of the thomotopy groups: p*:n,(Y)--> n,(Y) is isomorphic (monomorphic) for i2 (i= 1), and ndY)/p*(ndY)) is in one-to-one correspondence with p -1 (Yo) (Yo E Y). If Y is tsimply connected, Y is called a universal covering space of Y; if in addition Y is a cover- ing group, Y is called a universal covering group of Y. For a t10cally arcwise connected space Y, a coveng (Y, p, Y) is regular if and only if p*(ndY)) is a normal subgroup ofndY), and a universal covering space of Y is a covering space of any covering space of Y. Moreover, if Y is a topological group, any covering space Y of Y can be given a unique topological group structure with which Y is a covering group of Y. Let Y be an arcwise connected, locally arc- wise connected, and t\ocally simply connected space. Then the following classification theo- rem of coverings holds: The set of equivalence classes of coverings of Y is in one-to-one cor- respondence with the set of conjugate classes of subgroups of the fundamental group nl (Y); in particular, the equivalence class of a cover- ing (Y,p, Y) corresponds to the conjugate class of the subgroup p*(ndY)). Also, there is a unique universal covering space Y of Y up to homeomorphism. If in addition Y is a topolog- ical group, then Y is a unique universal cover- ing group of Y up to isomorphism of topolog- ical groups. Such a space Yis obtained as follows: Consider the tpath space Q(Y;yo, Y) of all paths in Y starting from a fixed point YoE Y, and define two paths w o , wl:(I,O)--> (Y,Yo) such that wo(1)=wd1) to be equiva- lent if and only if there is a thomotopy W,: (I,O)->(Y,yo) with w,(l)==wo(l) (O.s;t.s; 1). Then we obtain the tidentification space Yof Q(Y; Yo, Y) by this equivalence relation and the mapping p: Y --> Y by p{w} = w(l); this Y is the universal covering space of Y. Let (Y,p, Y) be a regular covering with the covering transformation group n. Then there is a tlocally trivial fiber space (Y',q,B, Y) such that the total space Y' has the same t(co) ho- mology groups as Y, and the base space B is an tEilenberg-MacLane space K(n, 1). The (co) homology tspectral sequence of this fiber space is_called that of the given regular covering (Y, p, Y), Eoo is a bigraded module tassociated with a certain filtration of the tsingular (co) homology module H(Y), and E 2 is the (co) homology module H(n; H(Y)) of the group n, where n operates on the coefficient module H(Y) via the induced homomorphisms of covering transformations ( 148 Fiber Spaces). 91 Ref. Covering Spaces For any group n, there is a regular covering (En, p, Bn) with the covering transformation group n such that En is tcontractible; Bn is an Eilenberg-MacLane space K(n, 1). We can take Sl (1-sphere) as Bz of the infinite cyclic group Z, and the following infinite lens space as Bz of the finite cyclic group Zk ( 70 Complexe). C. Lens Spaces Let k be a positi ve integer and I" ... , In be integers prime to k. Let s2n+1 =={(zo, ""Zn)E en+lllzol2 +... + IZnl2 = I} be the unit sphere in the (n+ I)-dimensional complex linear space en+!, and define the rotation I' by y(zo, Z" ..., zn) = (zo exp 2ni/k, Z I exp2nl l i/k,..., znexp2nlni/k). Then the torbit space s2n+I/Z k = L(k; I" "', In), where Zk = Z/kZ is interpreted as the cyclic group generated by 1', is called a lens space. It is art orientable (2n + 1)- dimensional tdifferentiable manifold. Also, the infinite lens space L OO(k) = L(k; 1, ...,1,...) is defined by taking n = 00; the infinite sphere SOO is a k-fold covering space of L 00 (k), and Loo(k)=B z , = K(Zk' 1). Its tcohomology ring is given as follows. (1) For integral coefficients, H 2i +!(L oo(k)) = 0, H 2i (LOO(k»)=Zk (i>0), and the tcup prod- uct of generators of degree 2i and 2j is a gen- erator of degree 2(i + j). (2) Let k == pk' (p is a prime). If p # 2, or p== 2 and k' is even, H*(L 00 (k); Zp)= ;\(e l ) @ Zp[e2]. If p= 2 and k' is odd, H*(LOO(k); Z2)= Z2[e l J (ei is an element of degree i). Here 1\ indicates the texterior algebra over Zp, and Zp[ J the tpolynomial ring over Zp- Two lens spaces L(k; II' ''', In) and L(k';/'l' .. . , I) are of the same homotopy type if and only if k = k' and there is an integer m prime to k with II" .I n = ::!: mn+!1'1 .. .I(modk) [8]. Furthermore, the condition k = k', 1= ::!: l'icl(mod k) holds if and only if L(k; I) and L(k'; /') are homeomorphic. (Sufficiency is shown in [IJ; necessity follows from the fact that the tHauptvermutung is valid for com- binatorial 3-manifolds and that the condition holds if the polyhedra L(k; I) and L(k; /') have isomorphic subdivisions [6].) Also  [7,11J and 65 Combinatorial Manifolds. References [IJ H. Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934 (Chelsea, 1965). [2J N. Steenrod, The topology of fiber bun- dles, Princeton Univ. Press, 195\. [3J S.-T. Hu, Homotopy theory, Academic Press, 1959. 
92 A Crystallographic Groups [4J S. Eilenberg and S. MacLane, Homology of spaces with operators II, Trans. Amer. Math. Soc., 65 (1949), 49-99. [5J J. H. C. Whitehead, On incidence matrices, nuclei and homotopy types, Ann. Math., (2) 42 (1941),1197-1239. [6J K. Reidemeister, Homotopieringe und Linsenraume, Abh. Math. Sem. Univ. Ham- burg, 11 (1935),102-109. [7J W. Franz, Uber die Torsion einer Uber- deckung,1. Reine Angew. Math., 173 (1935), 245-254. [8J P. Olum, Mappings of manifolds and the notion of degree, Ann. Math., (2) 58 (1953), 458-480. [9J W. S. Massey, Algebraic topology; An introduction, Harcourt, Brace & World, 1967. [10] T. M. Singer and 1. A. Thorpe, Lecture notes on elementary topology and geometry, Scott, Foresman, 1967. [11] M. M. Cohen, A course in simple homo- topy theory, Springer, 1972. 92 (IV.15) Crystallographic Groups A. Space Groups and Point Groups Let G be a discrete subgroup of the group of tmotions ffi in the real n-dimensional Eu- clidean space V. If G contains n linearly inde- pendent translations, then G is called an n- dimensional crystallographic space group, a space group, or a crystallographic group. The following assertions are known to be equiva- lent to each other: (i) A discrete subgroup G of ffi is a space group; (ii) ffijG is compact; (iii) VjG is compact; (iv) There is a compact subset P of V such that V=GP= UgEGgP [1]. The subgroup T consisting of all ttranslations in G is called the lattice group of G. T is a normal subgroup of G and is generated by n linearly independent translations, say, t l , t 2 ,..., tn' Take a point x of V. Then the T-orbit of the point x is called a lattice of G (or T). t I' ..., t n can be identified with a basis of V. Let K be the quotient group G jT and k = gT an element of K. Then k gives rise to a linear transforma- tion k on T via theformula k(t)= gtg- I (the product in G), tE T, and K can be regarded as a subgroup of the torthogonal group of V via the above identification of T with a lattice in V. That is, the set of orthogonal transformations stabilizing x, {gl (g(y) == x + g(y)-g(X),YE V),gEG} forms a group isomor- phic to K. The quotient group K is a finite group and is called the point group of G. A group isomorphic to a point group of an n- dimensional space group is sometimes called 340 an n-dimensional crystallographic group. An algebraic characterization of space groups among abstract groups is given by "A group G is isomorphic to an n-dimensional space group =- G has a normal tfree Abelian subgroup of trank n, which is maximal Abelian and has finite index" [1]. By representing the action of K on Tin terms of a basis of T, K can be regarded as a subgroup of GL(n, Z). Hence we obtain the following crystallographic restriction: If K contains an element of order m, then n  <p(m). Here <p is the tEuler function. For example, if n = 2, 3, then m = 1, 2, 3, 4 or 6; and if n = 4,5, then m= 1,2,3,4,5,6,8, 10, or 12. The number of nonconjugate finite subgroups of GL(n, Z) is finite (Jordan-Zassenhaus theorem;  [2,3,21J). Two space groups G I and G 2 are cal\ed equivalent if they are conjugate via an taffine transformation f of V, i.e., G 2 = fG d -I. Let T; and K i be the lattice group and the point group of G i (i = 1,2), respectively. Then T 2 = h TI and K 2 = hK I h -I for the linear trans- formation h induced by f Moreover, there is an orthogonal transformation k such that K2=kKtk-l; this k can naturally be consid- ered as an element of GL(n, Z). G 1 and G 2 are equivalent =- G 1 and G 2 are isomorphic (as abstract groups) [1]. In applications it is often required that in the definition of equivalence f be an torientation-preserving affine transforma- tion. Under this definition, there can be a pair G I , G 2 such that G l and G 2 are mutually iso- morphic but not equivalent, since they are conjugate only by means of an torientation- reversing affine transformation. In this case, they are cal\ed enantiomorphous to each other, or the pair is called an enantiomorphic pair (Table 1 below). For a given dimension n, there is only a finite number of equivalence classes of space groups [1]. We also conclude from this that point groups have finitely many equivalence classes up to conjugacy in the orthogonal group O( V) or in GL(n, Z). B. Crystal Classes, Bravais Types Let T be an n-dimensional lattice in n- dimensional Euclidean space V, and K a finite subgroup of the orthogonal group O(V). De- note by a pair (T, K) a tfaithfullinear trepre- sentation of K on T, i.e., a tmonomorphism of groups K -->Aut(T). A space group G deter- mines a pair (T, GjT). Conversely, for any pair (T, K) the fol\owing holds: "Any group texten- sion of K over the tkernel T is isomorphic to a space group" ( the algebraic characteriza- tions in Sections A and C). Thus to each pair (T, K) there corresponds a set of space groups. 
341 Two pairs (TI' Kd and (Tz, Kz) are ca\1ed arithmetically equivalent (this is denoted by (TI' K 1)(Tz, Kz) if there exists an invertible linear transformation gE GL(V) such that Tz =gT" Kz ==gKlg- l . These pairs are called geometrically equivalent or simply equivalent (this is denoted by (T I , Kd -(T z , K 2 )) ifthere exists a gE GL(V) such that Kz =gKlg- l . The relation (-) is an equivalence relation, and equivalence classes are called arithmetic (geo- metric) crystal classes. The set of geometric crystal classes is in one-to-one correspondence with the set of equivalence classes of point groups up to conjugacy, and is sometimes called the set of crystal classes. The geometric crystal class of (T, K) is usually denoted by K. Now let T be a lattice. Then the group of all orthogonal transformations that leave T in- variant is called the Bravais group of T and is denoted by B(T), i.e., B(T)={gEO(V)lgT= T}. The group B(T) is finite and determines a pair (T, B(T)). Two lattices TI and Tz are called arithmetically equivalent (this is denoted by TI  Tz) if(T I , B(Td) and (Tz,B(Tz)) are arithmetically equivalent. An equivalence class is called a Bravais type, and an arithmetic crystal class determined by (T, B(T)) is called a Bravais class. For each Bravais type, a repre- sentative is said to be its Bravais lattice (Fig. 3 below). Define an torder relation on the set of lat- tices belonging to an arithmetic crystal class as follows. When (TI' Kd(Tz, Kz), define Tz.s; TI if there exists agE G L( V) such that Tz = 9 T I , Kz =gKlg- I , and B(T 2 )s;gB(Tdg- l . Note that   T 2 =- TI .s; T 2 and T 2 .s; TI' In each arith- metic crystal class, there is a pair (T, K) with a tminimal T with respect to this order relation. In this case the Bravais type of T is referred to as the Bravais type of the class of (T, K). This lattice T is, intuitively, of the most general type appearing in the class. Now define a relation  - f T 2 if there are pairs (7;, K;), i = 1, 2, such that (TI' Kd-(T 2 , K 2 ) and 7; is minimal in the class of (7;, K;). The equivalence class of the equivalence relation generated by the relation - f is called a crystal family. Summarizing, we have defined the following. Let Y, $, 'fJ, ffJ, reg; be the set of equivalence classes of space groups, arithmetic crystal classes, geometric crystal classes, Bravais types, and crystal families, respectively. Then we have the relations shown in Fig. 1. (All arrows in Fig. 1 are surjective.) reg; is nothing but the tcoproduct of'fJ and ffJ over $, reg;  'fJ IL ffJ. Y--'$---'!J ! 1 ffJ ---. reg; yt c- 'fJ l/! rey--.reg; Fig. 1 Fig. 2 92C Crystallographic Groups Moreover, there are injective mappings ffJc."i7I and $ c. Y such that the composites ffJ c.,$ --> ffJ and ,i7I c.,Y -->$ are identities. The first is the mapping sending a Bravais type T to the arithmetic crystal class defined by (T, B( T)). The second one sends a class of (T, K) to a tsemidirect product of T and K ( 190 Groups N). A space group belonging to a class of the image of $ c.Y is said to be symmorphic or symmorphous. A class which belongs to the image yt of the composite ffJ c. $ --> 'fJ is called a holohedral or holosymmetric class, or a holo- hedry. Define a mapping M of'fJ to the power set (ffJ) by M = (ffJ+-$)' ($ -->'fJ) -I. For two elements C I and C 2 of 'fJ, define C I - S C 2 if M(Cd=M(C z ). Each equivalence class is ca\1ed a crystal system, and the set of crystal systems is denoted by re.5l'. Two elements C and C' of'fJ belong to the same crystal family if and only if there exists a sequence of ele- ments of 'fJ, C I , C 2 , ... , C k such that C I = C and Ck=C' and M(C;)nM(C i +d#0 for i= 1, ... , k - 1. Therefore there is a surjective map- ping re Y -->re g; (Fig. 2). When n.s; 4, the com- posite yt c.'fJ-->re Y is bijective. It should be remarked that when n  3, there is no map- ping ffJ-->rey such that ,i7I-->'fJ-->rey and $--> ffJ -->re Y coincide. The numbers of elements of these sets are shown in Table 1. Table 1 n = I 2 4 'l/:F, crystal families ({/Y; crystal systems :!IJ, Bravais types 'fi, point groups (isomorphic classes)" ,rr/, arithmetic crystal classes if, space groups 4 6 23 4 5 7 14 33 64 2 (2) 10 (9) 32 (18) 227 (118) 2 13 73 710 2 17 219' 4783' "When n '" 4, the number of elements of.ff coincides with that of 'l/!/'. "The number in parentheses denotes the number of equiva- lence classes under algebraic isomorphism. c,d The number of equivalence classes under orientation- preserving transformations is 230 and 4895, respectively C. Construction of Space Groups Take an element (T, K) of an arithmetic crystal class. Since K is finite and Tis tfinitely gen- erated, the tcohomology groups Hi(K, T) are finite groups ( 200 Homological Algebra G). Let a be a tsecond co cycle representing an element [aJ of H2(K, T). Let (T, K), be the textension of K with kernel T correspond- ing to [IXJ ( 190 Groups N). Then the set of elements of (T, K), is given by {<t, k) I 
92D Crystallographic Groups t E T, k E K}, and the product is defined by <t" k 1 ) <t 2 , k 2 ) = <t 1 + k 1 t 2 + rx(k 1 , k 2 ), k 1 k 2 ). Since Hi(K, V)=O, i 1, there is a tfirst co- chain fJE C 1 (K, V) such that rx(k 1 , k 2 ) = fJ(k 1 ) + k 1 fJ(k 2 )-fJ(k,k 2 ), and the tfirst cocycle If on VjT defined by fJ induces an isomorphism HI(K, VjT):::::c.H 2 (K, T), [,B]f->[rx]. Define an action of <t, k) on V by <t,k)(x)=kx+t+fJ(k), XEV. Then (7; K), acts on Vas a space group. Con- cerning their equivalence, this result holds: 'The equivalence classes of (T, K)" [rxJ E H2(K, T) are in one-to-one correspondence with the orbits of N(K), the normalizer of K in Aut(T) acting on H2(K, T)" [4]. The action of hEN(K) is given by hrx(k) =hrx(h- 1 kh), rxE C 2 (K, T). For a pair (T, K) the weight lattice or the weight group T* of (T, K) is defined by {v E V I V-kVET, VkEK}. Let R be an irreducible re- duced troot system, Q(R) its lattice, and W(R) its tWeyl group. Then for a pair (Q(R), W(R)), Q(R)* is nothing but the usual weight lattice P(R). ( 248 Lie Algebras). Suppose that K of (T, K) does not contain the central inversion - In: V f-> - V, V E V. Denote the group K U ( - In)K by ::!: K. Then there is an isomorphism Hl(::!:K, VjT):::::c.[H1(K, VjT)J2 x (T* jT)j2( T* jT). Here [. ]2 denotes the subgroup of elements of order not greater than 2 [5]. This isomor- phism provides powerful means for the con- struction of space groups. In constructing a pair (T, K), it is useful to take a basis of T in a special form. Let {tj, t 2 , ..., tn} be a basis of a lattice T, and set aij = (t i , t), where (,) is the inner product in V. The n x n matrix A = (aij) is tsymmetric and tposi- tive definite, and the numbers aij are called the lattice constants of T. A basis {t l' .. . ,t n } is called a reduced basis if the tquadratic form (Ax, x) = L aijxix j is reduced ( 348 Qua- dratic Forms). A lattice admits at least one re- duced basis, and in this case::!: 2a ij .s; aii.s; a jj , i<j [6, Table 5.1; 7]. The totality of an n- dimensional reduced basis forms in a nat- ural manner a semialgebraic set of dimension (n + 2)(n -1 )j2 in R(n+l)n/2. Its closure, say Qn, is compact. On the other hand, the set Ln = (R+ x O(Rn))\GL(n, R)jGL(n, Z) is identi- fied with the set of equivalence classes of n- dimensional lattices under the relation of "same shape." There is a natural inclusion L 1 U... U Ln <=+Qn, and Ln is dense in Qn. More- over, the inclusion gives a bijective homeo- morphism when n.s;4 [8]. A finite subgroup K of the orthogonal group O(V) is said to be fully transitive if there 342 is a set S = {e l' ... , e,} that spans V on which K acts ttransitively and if K has no tinvariant subspace in V. In this case one can choose S as either of the following: (i) the primitive hyper- cubic type, S = {e l , ..., en}, (ei, e) =D ij (tKro- necker delta); (ii) the primitive hyperbolic type, S={lj, ...In+'}' (/;,/;)=1, i= 1, ...,n+ 1, (/;,fj)= -ljn, i,j = 1, ..., n + 1, i # j (especially, L7l /; =0) [10]. D. Color Symmetry Groups, Twinning A color symmetry group, or a colored symme- try group, associated with a space group G is a pair (G, G') such that G' is a subgroup of G with finite index. The index r = [G: G'J is called the number of colors. We call (G, G') a white group when r = 1, a black and white group or a magnetic group when r = 2, and a polychroma- tic group when r  3. G' is also a space group with the lattice group T' = G' nT, where Tis the lattice group of G. Two color symmetry groups (G p G;) and (G 2 , G;) are called equiva- lent if there exists an taffine transformation f such that G 2 = fGd-1 and G; = fG;r 1 . Then (G 1 , G;) and (G 2 , G;) are equivalent if and only if there is an isomorphism 0': G 1 -->G 2 such that O'(G;) = G;. A color symmetry group (G, G') is called lattice equivalent if T' = T, K' c K, and is called class equivalent if T' c T, K' = K. Take a pair (G, G'), and let G" be the tin verse image of K' under the canonical tepimorphism G-->K. Then (G, G") is lattice equivalent and (G", G') is class equivalent. Fix the color number r. Then (i) the equivalence classes of lattice equivalent color symmetry groups are in one-to-one correspondence with the conjugacy classes of subgroups K 1 of index r, and (ii) the number of equivalence classes of class equivalent color symmetry groups is finite. When r is prime to the order of K, the classes are in one-to-one correspondence with K -invariant sublattices T, of index r in T. Therefore, when the number of colors is given, the set of equivalence classes of n-dimensional color symmetry groups is fini te. In case (i), a pair (K, KIJ is called a color point group, and in case (ii), (T, Td is called a color lattice. In particular, when r = 2, they are called a black and white point group and a black and white lattice, respectively. Their equivalence is defined in a similar way. Take a color lattice (T, T l ) of color number r. Let m be the least natural number such that mT c Tj, then m is a divisor of r. Let m = p1 ... pr', Pi # Pj (i # j) be the decomposition of m into its prime factors. Then there is a sequence of K-invariant sublattices of T, 7;, i = 2, ..., k -1, such that (i) T 1 c T 2 ... c 1k C 1k+l =T; (ii) Pi7;+1 C 7;, i= 1, ...,k; (iii) the 
343 submodules T;/pF'T;+1 of i = 1, ..., k, zn @ Z/p7'Z are uniquely determined independently of the choice of such a sequence {T;}. Thus the existence of color lattices is related to the tmodular representation of K ( 362 Repre- sentations G). For example, if r is a prime number, then the representation of K on T/rT  zn @ Z/rZ is the reduction modulo r of that of K on T. A mathematical treatment of twinning is given as follows. Let V be an n-dimensional Euclidean space and Tan n-dimensionallat- tice in V. The volume of T, vol(T), is defined by vol(T) =(Idet A 1)1/2, where A = (a ij ) is the matrix of lattice constants, This definition is independent of the choice of a basis of T. If T' is a sublattice of index r in T, vol(T')=r' vol(T) follows. Let K be a finite subgroup of O(V), TI and Tz K-invariant lattices, and U a hyperplane in V. Then the quadruple (T" Tz, K, U) is called a twinning structure if it satisfies the following: (i) vol(Td = vol(T 2 ); (ii) the lattice T; is minimal in the arithmetic crystal class of (T;, K), i = 1, 2; and (iii) 7J. n U = T 2 n U = To, and To is an (n -1 )-dimensional lattice in U. Two arithmetic crystal classes are said to be twinable if there exist represent a - tives (TI' K), (T z , K), and a hyperplane U such that (T l , T 2 , K, U) is a twinning structure. Let (Tl' K) and (T 2 , K) be two arithmetic crystal classes. If there exists a K -invariant hyper- surface U such that To = Tl n U = Tz n U is an (n-I)-dimensionallattice in U, then two classes (TI' K) and (1;, K) are twinable. E. Three- and Two-Dimensional Crystallography Three-dimensional crystal classes are listed in Appendix B, Table 5.IV. To name these crystal classes, both Schoenflies' notation and the international notation are used. Fix an torthogonal tright-handed system (x, y, z) in V. For each crystal system, define three axes as follows, The first axis is the z- axis. The second is the line x = y = z for the cubic system and the x-axis for the others. The third is the y-axis for orthorhombic, trigonal, and hexagonal systems, and the line x = y, z = 0 for the others. Then a crystal class is expressed by its generators with respect to these axes. The symbol k (k = 1,2,3,4,6) denotes the rota- tion by the angle 2n/k around the axis; k is a composition of the rotation k and the central inversion (Vf-> -v, VE V), and k/m (k=2,4, 6) means the composition of k and the reflection about the hyperplane perpendicular to the above axis. The symbol m denotes 2. Usually, 1 is omitted. This is the full international nota- 92E Crystallographic Groups tion and a set of generators can be read from it. The short international notation is also used. Schoenflies' notation consists of the letters Cn' Dn' Sn' T, 0, and the subscripts h, v, d. The letters denote the group and order: C n is the tcyclic group of order n; Dn is the dihedral group of order 2n; T is the ttetrahedral group isomorphic to the fourth talternating group; 0 is the toctahedral group isomorphic to the fourth tsymmetric group; and Sz == t, S4 = 4, S6 =3. The subscripts h, v, d mean that the group is generated by reflections in the hyperplane which is horizontal, vertical, or perpendicular to a diagonal, respectively. The orientation is appropriately given. For example, in C nh and Dnh there is an nth order rotation around the z-axis and the horizontal plane is the xy-plane. The above notations are also applied to noncrystaIlographic finite subgroups of the 3- dimensional orthogonal group [13]. The symbol for Bravais type is specified by the name of its crystal system and one of the letters P, A, B, C, F, I. Six kinds of simple Bravais lattices are given as foIlows. A simple lattice is generated by {a, b, c} and satisfies the foIlowing condition. Denote the lengths of vectors by a= Iiall, b= Ilbll, c= Ilcll, and let ex= L(b,c), fJ = L(c, a), y= L(a, b) be the angles between the indicated pairs of vectors. For triclinic P, a#b#c#a, ex #fJ #y # ex; for mono- clinic P, a#b#c#a, ex=y=90° # fJ; for or- thorhombic P, a#b#c#a, ex=fJ=y=900; for tetragonal P, a = b # c, ex = fJ = y = 90°; for hexagonal P or trigonal P, a = b # c, ex = fJ = 90°, y= 120°; for cubic P, a=b=c, ex= fJ=y=90°. Let T be one of the above lattices generated by {a, b, c}. Then the lattices generated by {a, b, (b + c)/2}, {(c + a)/2, b, c}, {a, (a + b)/2, c} {(a + b+c)/2,b,c}, {(a+b)/2, (b+c)/2, (c+a)/2} are called A, B, C, I, F lattices determined by T. The original lattice T is called the primitive lattice, The trigonal R lattice is generated by {a', b', c'} satisfying a' =b' = c', ex' = fJ' = y' < 120°, #90°. Set a=a' -b', b=b' -c', c=a' + b' +c'. Then the hexagonal P lattice gener- ated by {a, b, c} is defined to be the primitive lattice of the trigonal R lattice. The fourteen Bravais lattices are iIlustrated in Fig. 3. An arithmetic crystal class (T, K) is denoted by the symbol of the Bravais lattice T and the symbol of K with respect to the action of K on T. Let the primitive lattice of T be generated by {a, b, c}. Put the vector a on the x-axis, and b on the xy-plane. The action of K is then determined by the international notation for K, except when K =42m, 32, 3m, 3m, 6m2, and T is the primitive or I lattice. In these cases two kinds of actions interchanging the second and the third symbols are not equiva- lent. If such is the case, 1 is inserted into the third position for trigonal systems. Let (K) be 
92 F Crystallographic Groups the symbol determined as above and (T) be the symbol of T. Then the action (T, K) is expressed by «T), (K)). For example, when K = 32, there are three arithmetic crystal classes (P, 321), (P, 312) (R, 32).  p Triclinic 4=?1 tf¥l  1G17 p c Monoclinic A;(1 lb1 pel Orthorhombic t]WJ F ' 1 '"-''' I 1 I I I C I I I I . -4 . a a P Hexagonal, Trigonal P I Tetragonal R Trigonal    a -:.;i a a / \ p I Cubic F Fig. 3 The conventional unit cell for each of the 14 Bravais 1a ttices. The notation of space groups determined by (T, K) is given by writing both the symbol (T), and the symbols of actions of elements on V which appear in the symbol (K) defined above. The symbol (T)(K) denotes the tsemidirect product of T and K by the action (T, K), and is the symmorphic space group determined by (T, K). Other nonsymmorphic groups are designated by replacing symbols in (K). If k is replaced by k j (j= 1, "', k-l), then k j is a k- fold screw glide with pitchj/k, that is, the com- posite of the rotation k and the translation jf/k, where f is the vector of minimum length of T along the axis of k. If m is replaced by a, IJ, c, n, d, then it is a glide reflection, that is, the composite of the reflection m and the translation in the reflecting hyperplane with the direction of a, b, c, a face diagonal and a diamond, respectively. See [6J for the pre- cise meaning of the notation discussed in this paragraph. The notations for 2-dimensional point groups and space groups can be adopted from the ones for 3-dimensional ones by setting the reference plane as the xy-plane and the z-axis perpendicular to that plane. In this setting, the notation of a plane point group is given by the 344 corresponding point group in 3-space, and similarly for plane space groups. There is only one glide operation, denoted by g. The motion 9 is the composite of the line reflection about an axis and half the translation along the axis with the vector of minimum length of T. In [13J, the precise group-theoretic description of space groups can be found. It should be noted that the notations for the two groups p3m1 and p31 m are frequently interchanged in the literature (including earlier editions of [13J). F. History The first mathematical study of the structure of crystals was done by the mineralogist J. F. C. Hessel (1830). He enumerated the finite subgroups of the 3-dimensional orthogonal group. Afterward, his result was rediscovered many times. A. Bravais (1850) showed that in 3-dimensional Euclidean space there are only 14 different lattices. All space groups consist- ing of orientation-preserving transformations were determined by C. Jordan (1868, 1869) and L. Sohnke (1879). Finally, almost all the space groups were determined independently by E. S. von Fedorov (1885, 1889) and A. Schoenflies (1887,1889). Probably as a result of com- parison of each other's lists, they established in 1891 the existence of the 230 space groups. Then A, Barlow (1894, 1896) derived these space groups by adding reflection operations to Sohnke's 65 groups. In 1900 in his eighteenth problem D. Hilbert raised the question of whether the number of equivalence classes of space groups of a given dimension is finite. This was answered affirmatively by L. Bieberbach (1910 [IJ). An algorithm for dctcrmining space groups was given by H. Zassenhaus (1948 [4J). Their results, including [2,3], and work by C. Her- mann (1949 [9], 1952 [tOJ) gave a solid foundation for n-dimensional crystallography. Then followed a concrete treatment of 4- dimensional crystallography. A. C. Hurley (1951), using an earlier work of M. E. Goursat (1889) on finite subgroups of the 4-dimensional special orthogonal group, determined 221 of the 4-dimensional point groups. In the 1960s A. L. Mackay and G. S. Pawley (1963) and others gave 56 of the 4-dimensional Bravais types. After E. C. Dade (1965) determined 9 maximal finite subgroups of GL(4, Z), R. Bulow (1967) and H. Brown, J. Neubuser, and H. Zassenhaus (1968) independently deter- mined 710 arithmetic crystal classes. They reconfirmed the result with H. Wondratschek, and as a result, 64 Bravais types and 227 geometric crystal classes wcrc cstablishcd (1971 [15J). They ran Brown's computer 
345 program for the determination of space groups based on [4J and obtained 4,783 space groups. Their work culminated in a book (1978 [16J). There have been several attempts to establish a unified treatment of n-dimensional crystal- lography [5,8,14,17] and crystallography in complex Euclidean spaces. Two-dimensional space groups were em- pirically known in ancient times, as demon- strated by artistic decorations, All the 17 2-dimensional space groups appear in the tile patterns of the Alhambra in Granada. A discrete subgroup of the plane motion group whose translation subgroup is only rank one is called a frieze group. The notion of black and white groups was introduced by H. Heesch (1930), H. J. Woods (1935), and A. V. Shubnikov (1951). Only with the introduction of the use of neutron diffrac- tion techniques did it become apparent that these groups could be used in the description of magnetically ordered structures [12]. Color symmetry groups were defined by B. L. van der Waerden and J. J. Bruckhardt (1961 [11J) for an arbitrary number of colors. A mathe- matical treatment of the twinning structure is found in papers by T. Ito (1938) and R. Sadanaga (1959). References [IJ L. Bieberbach, Dber die Bewegungsgrup- pen der Euklidischen Raume, Nachr. Akad. Wiss. G6ttingen (1910),75-84; Math. Ann. 70 (1910),297-336 and 72 (1912), 400-412. [2J L. Bieberbach, Dber die Minkowskische Reduktion der positiven quadratischen For- men und die endlichen Gruppen linear ganz- zahliger Substitutionen, Nachr. Akad. Wiss. G6ttingen (1912), 207-216, [3J H. Zassenhaus, Neuer Beweis der Endlich- keit der Klassenzahl bei unimidularer Aquiva- lenz endlicher ganzzahliger Substitutions- gruppen, Abh. Math. Sem Univ. Hamburg, 12 (1938),278-288. [4J H. Zassenhaus, Dber einen Algorithmus zur Bestimmung der Raumgruppen, Comment. Math. Helv., 21 (1948), 117-141. [5] G. Maxwell (and N. Broderick), The cry- stallography of Coxeter groups, J. Algebra, 35 (1975),159-177 and 44 (1977),290-318. [6J N, F, M, Henry and K. Lonsdale, Interna- tional tables for X-ray crystallography, vo\. 1, Symmetry groups, Kynoch, Birmingham, 1952, 1965,1969. [7J B. L. van der Waerden, Die Reduktions- theorie der positiven quadratischen Formen, Acta Math., 96 (1956),265-293. [8J R. L. E. Schwarzenberger, Classification of crystal lattices, Proc. Cambridge Philos. Soc., 72 (1972), 325-349. 93A Curves [9] C. Hermann, Kristallographie in Raumen beliebiger Dimensionzahl I, Die Symmetrie- operationen, Acta Cryst., 2 (1949),139-145. [lOJ C. Hermann, Translationsgruppen in n- Dimensionen, Zur Struktur und Materie der Festkorper, Springer, 1952. [11] B. L. van der Waerden and J. J. Burck- hardt, Farbgruppen, Z. Krist., 115 (1961), 231-234. [12] A. V. Shubnikov and N, V. Belov, Col- ored symmetry, Pergamon, 1964. [13] H. S. M. Coxeter and W, O. Moser, Gen- erators and relations for discrete groups, Springer, 1957, 1964, 1972, 1979. [14J E. Ascher and A. Janner, Algebraic as- pects of crystallography, Helv. Phys. Acta, 38 (1965),551-571, and Comm, Math. Phys., 11 (1968), 138-167. [15J R. Bulow, J. Neubuser, and H. Wondrat- schek, On crystallography in higher dimen- sions, Acta Cryst., A27 (1971),517-535. [16J H. Brown, R. Bulow, J. Neubuser, H. Wondratschek, and H. Zassenhaus, Crystallo- graphic groups of 4-dimensional space, Wiley, 1978. [17] R. L. E. Schwarzenberger, N-dimensional crystallography, Research notes in math., Pitman, 1980. [18] A. Kelly and G. W. Groves, Crystal- lography and crystal defects, Longman, 1970. [19J M. J. Buerger, Introduction to crystal geometry, McGraw-Hili, 1971. [20J G. J. Bradley and A. P. Cracknell, The mathematical theory of symmetry in solids, Clarendon Press, 1972. [21J C. W. Curtis and I. Reiner, Representa- tion theory of finite groups and associative algebras, Wiley, 1962, 93 (VI.20) Curves A. Introduction In the beginning of his Elements, Euclid gave definitions such as: A line is a length having no width; an end of a line is a point. However, he left notions such as width and length unde- fined. Thus his definitions were far from satis- factory. Actually, it was only during the latter half of the 19th century that efforts were made to obtain exact definitions of lines and curves. Euclid, among others, distinguished two kinds of curves: straight lines and curves. Nowadays, however, lines in the sense of Euclid are called curves, and a straight line is considered a curve. A first effort to give an exact definition 
938 Curves of a curve using analytic methods was made by C. Jordan in his Cours d'analyse I (1893). B. Jordan Arcs and Jordan Curves Following Jordan, we define a continuous plane curve C to be the image of a tcontinuous mapping sending the interval [0, IJ into the Euclidean plane E 2 . Namely, C is the set of points (x, y) in E 2 such that x= f(t), y=g(t), O.s;t.s; 1, with continuous functions f, 9 defined on [0, 1 J. A continuous curve is also called a continuous arc We call (/(O),g(O)) and (/(1), g(1)) the ends of the arc. Given continuous functions f, 9 defined on (0, 1), the set {(x, y) I x = f(t), y = g(t), 0 < t < I} is called an open arc, More generally, the image of a continuous mapping of [0, 1], (0, 1), [0, 1), or (0, 1 J is called an arc (or curve). Suppose that C is an arc that is the image of an interval I and P = (x, y) is. a point on C to which there correspond two elements t" t 2 (t 1 <t2) of I such that P is the image of both t" t 2 . In this case, the point Pis called a multiple point on C. An arc having no multiple point is called a simple arc or Jordan arc. An arc with one and only one multiple point P=(/(O),g(O))=(/(I),g(1)) is called a Jordan curve or simple closed curve ( Section K). A Jordan curve can be regarded as a topological image in a plane of a circle. Let C be a curve that is the image <p(I) of an interval. Then C is said to be of class C k (analytic) if the mapping <p is of tclass C k (tanalytic). In general, if S is a topological space, then the image <p(I) in S of an interval is called a curve in S. In particular, if S has the structure of a differentiable (ana- lytic) manifold, we can define the notion of curve of class C k (analytic curve) in S. C. Ordinary Curves A tconnected subset of E 2 that is the union of a finite number of simple arcs meeting at a finite number of points is called an ordinary curve. An ordinary curve is called a tree if it does not contain a subset that is homeomor- phic to a Jordan curve. Let p be a point on an ordinary curve C. The tboundary of a suffi- ciently small tneighborhood of p meets C at a finite number of points, and this number is independent of the choice of the small neighborhood. We call it the order of pin C. A point of order 1 is calIed an endpoint of C, a point of order 2 an ordinary point, and a point of order;) 3 a branch point. If we can represent an ordinary curve C as a continuous curve tracing each simple arc of C just once, 346 we say that Cis unicursal (Fig. 1). A necessary and sufficient condition for C to be unicursal is that the number of points of odd orders in C be less than or equal to 2 (L. Euler). @$ Fig. I D. Further Consideration of Definitions Although the set of ordinary curves as defined above contains most familiar curves, it does not contain a point set defined by y=sin l/x in O<x.s; 1 and -1.s;x <0, and by -1 .s;y.s; 1 at x == 0 (Fig. 2). This point set is called a sinusoid, and it is desirable to obtain a definition of curves wide enough to contain a sinusoid. On the other hand, the notion of continuous curves is, in a sense, too wide, because a curve such as a Peano curve ( Section J), which covers a whose square, is among such curves. The notion of simple arcs is too narrow, because even a circle is not a simple arc. As a point set in E 2 , a continuous arc is character- ized as a tlocally connected tcontinuum and is sometimes called a Pea no continuum (H. Hahn, S. Mazurkiewicz). On the other hand, A. Schoenflies. inspired by the statement of the Jordan curve theorem ( Section K), consid- ered a closed set that divides the plane into two parts, forming the common boundary of both domains, and called it a closed curve. According to this definition, however, a simple curve is not a closed curve. Thus as a general definition of curves it is not appropriate. * Fig. 2 To give a general notion of curves on a plane (containing sinusoids), we may define a curve as a continuum that is tnowhere dense in E 2 (i.e., a continuum that is a boundary of open sets on the plane) (0. Cantor). Further- more, to deal with the curves on a topological space, P. S. Uryson and K. Menger defined a general curve to be a I-dimensional continuum (Menger, 1921  1922 [lJ). In £2, the latter notion coincides with the notion of curves defined by Cantor, while in E 3 a general curve 
347 is a continuum that does not divide any tdomain ( 79 Connectedness). E. Universal Curve Consider a 3-dimensional cube f3 (I = [0, IJ). Draw two planes paralIel to each face so that the two planes paralIel to a face trisect the edges of the cube meeting the planes. Thus 1 3 is divided into 3 3 cubes. Let M I be the closure of the subset of f3 that is obtained from f3 by deleting the cube If (II =[1/3,2/3J) and the 6 cu bes having common faces with If. Then M I consists of 20 cubes (Fig. 3). We apply to each of the 20 cubes forming M 1 the same operation that we applied to 1 3 and denote by M 2 the union of point sets thus obtained (consisting of 20 2 cubes, the length of whose edges equals 1/32). Repeating this process, we obtain a point set M n consisting of 20 n cubes, the length of whose edges equals 1/3 n . Thus we obtain the sequence M I =:> M 2 =:>M 3 =:> .... The set U = n1 M n is a general curve in the sense of Uryson and Menger. Moreover, we can prove that an arbitrary general curve is homeomor- phic to a subset of U. Hence we calI U the universal curve. Fig. 3 F. Length of a Curve In this section, by a curve we mean a continu- ous curve in a Euclidean space En. Let C be a curve in En defined by Xi = /;(t) (i = 1,2, ..., n; a.s; t .s;b; the /; are real-valued continuous functions defined in [a, b J ). (We sometimes write this simply as X = f(t), where X = (XI' ...,x n ),) We divide [a,bJ arbitrarily and denote the dividing points by a = to < t I < t 2 < ... <t,==b. Let Xk=f(t k ), k=O, ...,r, and let X k - I X k be the length of the straight line seg- mentjoining X k - I and X k . If the length I = L_I X k - I X k of the broken line (XOXI ... X,) (Fig. 4) is bounded for any subdivision of [a, b J, C is calIed a rectifiable curve, and the upper limit of I with respect to the subdivisions is called the length of C. For C to be rectifi- able, it is necessary and sufficient that the /; (i = 1,2, ", , n) be of tbounded variation (Jordan). Thus if C is a rectifiable curve, then each /;(t) (i = 1,2, ". , n) is almost everywhere differentiable (H. Lebesgue), In particular, if C 93G Curves is of class C I , then C is rectifiable, and its length can be represented by f b ( n ( d/; ) 2 ) 1/2 S = L - dt a i=1 dt ( 246 Length and Area). Fig. 4 G. Shapes of Curves In this section by a curve we mean the image <p(I) in a Euclidean space En of an interval I (bounded or unbounded), where <p is a continu- ous mapping, When a curve C of class C k is given in a Euclidean space En, it often becomes necessary to examine the shape of C globalIy. The determination of the global shape of C from the equation of the curve is calIed the curve tracing of C. The problem has been thoroughly studied in the particular case in which n = 2 and the equation of C is given by f(x, y) =0 in a rectangular coordinate system (by F(r, 0) = 0 in a tpolar coordinate system), where f (or F) is an analytic function. The problems in the case of a rectangular coordi- nate system are as follows: Let <p be a single-valued analytic function and I a (bounded or unbounded) interval on the x-axis. If a curve Co represented by Y = <p(x) (x E 1) is a subset of C, then Co is calIed a branch of C. According as I is bounded or unbounded, Co is said to be a finite branch or infinite branch of C. When a curve represented by x = ljJ(y) (YEJ) (1jJ is a single-valued analytic function and J is an interval on the y-axis) is a su bset of C, it is also calIed a branch of C. If it is necessary to distinguish these two branches, we calI the former the x-branch and the latter the y-branch. C consists of an at most de- numerable set of branches, If P(Xo,Yo)EC and fy=af/ay #0 at P, then there exists an x- branch containing P; if fx = af/ax #0, there exists a y-branch of C. If af/ax = 0, af/ay =0 at P, then P is calIed a singular point of C. Points on C that are not singular points are calIed ordinary points of C. When P is an ordinary point of C, a branch Co of C containing P is determined, and the tangent line and normal line to C at Pare the same as those to Co and are uniquely determined. The equations of these are (x- xo)fAxo,Yo)+(Y- Yo)J;,(x o , Yo) ==0 and (x- xo)J;,(xo, Yo) -(y - Yo)fAxo, Yo) =0, respec- 
93H Curves tively. If we choose a coordinate system with P as origin and the tangent line and normal line as (-axis and 'I-axis, respectively, then the equation of C with respect to this coordinate system is of the form '1 =C22 + C3C +'" in the neighborhood of P. If we denote by p the tcurvature of Cat P, then p==2c 2 = -(/xJ/- 2fxJJy +.fyyf/)/(/} + f/)3/2. When C 2 =p= 0, P is a tstationary point. A stationary point of a curve of class C 2 on C is also called a point of inflection, When P is not a point of inflection and «(, '1) are points on C in a neigh- borhood V of P, the sign of '1 is definite if V is small enough (Fig. 5). However, if P is a point of inflection and C3 #0, then C is of the shape shown in Fig. 6. At a point of inflection, if C3 = ... =Cv-I =0, C, #0, and v is even, C is of the shape shown in Fig. 5, and if v is odd, C is of the shape shown in Fig. 6. \ ITI J, I T1 ) l' ;rf Fig. 5 Fig. 6 In a neighborhood of a singular point, C takes various shapes. For example, consider a curve represented by y2=x 2 (x+a), and let P be the origin (0,0). If a > 0, then there are two branches of C passing through P, and they have different tangents at P (Fig. 7). As in this case, if there are a finite number of different branches passing through P with different tangents, P is called a node of C. If a < 0, then PE C, but there is no other point of C in the neighborhood of P (Fig. 8). Such a point is called an isolated point of C. If a=O, then there are two branches of C starting from P, and the tangents to these at P are the same (Fig, 9). Such a point is called a cusp of C. When C is Fig. 7 Fig. 8 Fig. 9 348 an talgebraic curve, we can examine the shape of a curve in a neighborhood of a singular point using the tPuiseux series. When C has an infinite branch Co (for example, when Co:y=<p(x) (XEI) is an x- branch of C and I = [a, 00 )), if the tangent to Co at P(x o , Yo) (x o E I) has a limiting line for Xo --> 00, then the limiting line I is called an asymptote of Co' In this case, the distance from a point P(x o , Yo) of Co to I converges to zero whcn Xo--> 00. An asymptote of an infinite branch of C is also called an asymptote of C. H. Special Plane Curves The following are the well-known curves (for ellipses, parabolas, and hyperbolas  78 Conic Sections). Among curves of the third order, those having an equation of the form y2 = f(x)/(x - a) (1) (/(x) is a rational expression of at most the third order in x) are symmetric with respect to the x-axis and have x = a as an asymptote. In particular, if a>O, f(x) = _x 3 in (1), then the curve is as shown in Fig. 10 and the origin is a cusp. Let a half-line starting from the origin meet the curve, the circle with diameter [0, a], and the straight line x == a at points X, Y, and A, respectively. Then we have OX = Y A. This curve is called a cissoid of Diodes. If a=O, f(x)=c 2 (c-x) (c>O) in (1), then the curve takes the shape shown in Fig. 11. Let A, C be the points whose coordinates are (a, 0), (c,O) (O<a<c), respectively, and let X, Y be the points in the first quadrant at which the straight line parallel to the y-axis and passing through A meets the curve and the circle with diameter OC, respectively (Fig. 11). Then we have AX: A Y = OC: OA. This curve is called a witch of Agnesi. c Fig. 10 Fig. 11 If a < 0, f(x) = - x 2 (x/3 + a) in (1), then the curve takes the shape shown in Fig. 12a. If we rotate it by 71:/4 and put it in the position shown in Fig, 12b, then the equation of the curve takes the form x 3 + y3 = 3cxy (c = - fi 
349 a). If we take as parameter t = y/x, then we get the parametric representation x = 3ct/(1 + t 3 ), Y = 3ct 2 /( 1 + t 3 ). This curve is called a folium cartesii (or folium of Descartes). A curve that has a parametric representation of the form x = <p(t), y = ljJ(t), where <p, IjJ are rational func- tions, is called an (algebraic) unicursal curve (or rational curve). Such a curve is an algebraic curve of tgenus 0, Fig. 12 Let r== fdO), r = f2(0), "', r = fk(O) be equa- tions of curves C I , C 2 , ... , C k with respect to a polar coordinate system with origin 0, A curve C having equation r=AddO)+... + Adk(O) (the Ai are constants, usually +1 or -1) in the same coordinate system is called a cissoidal curve with respect to O. (Fig, 13: r= - fd O ) + f2(0»). In Fig. 10, let C I be the circumference of the circle with the diameter [0, aJ, let C 2 be the straight line x =a, and put ..1.1 = -1, ..1. 2 = 1. Then we have a cissoid of Diocles. We can regard the folium cartesii as a cissoidal curve obtained from a straight line and an ellipse. When C I is a circle with center at 0, we call C a conchoidal curve of C 2 with respect to O. In particular, when C 2 is a straight line and 0 is not on C 2 , the conchoidal curve is called a conchoid of Nicomedes. Fig. 13 As shown in Fig. 14, when C 2 is perpendic- ular to the initial line of the polar coordinate system, the equation of the conchoid is r = asecO::f: b (b is the radius of C I ), and the Car- tesian equation of the curve is (x - a)2(x 2 + y2)=b 2 x 2 . According as a>b, a=b, or a<b, the curve has a node, cusp, or isolated point, respectively. When C 2 is a circle and 0 is on C 2 , the conchoidal curve of C 2 with respect to o is called a limaon (or limaon of Pascal) (Fig, 15), The equation of a lima<;on C with respect to a polar coordinate system having 93H Curves the diameter of a circle passing through 0 as its initial line is r=acosO::f:b, while the equa- tion of C with respect to a Cartesian coordi- nate system is (x 2 + y2 -ax)2 = b 2 (x 2 + y2). In this case, if a > b, 0 is a node of the curve; if a=b, 0 is a cusp. When a=b, the curve is called a cardioid (the curve shown by a dashed line in Fig. 15; see also Fig. 26 below). c[ Fig, 14 Fig. 15 The locus of a point X having a constant product of its distances from two fixed points A, B is called Cassini's oval (Fig. 16). The equation of this curve with respect to the Cartesian coordinate system whose origin 0 is the midpoint of the segment AB and whose x-axis is the straight line AB is (x 2 + y2f- 2a 2 (x 2 - y2) =k 4 _a 4 (where AB=2a, k 2 = AX, BX). In particular, if a 2 = e, then 0 is a nodal point of the curve. In this case, the curve (shown by the dashed line in Fig. 16) is called a lemniscate (or Bernoulli's lemniscate) (Jakob Bernoulli). x Fig. 16 The locus of the foot of the perpendicular drawn from a fixed point 0 to the tangent of a fixed curve C at each point of the curve is called the pedal curve of C with respect to 0, The pedal curve of a rectangular hyperbola with respect to its center is a lemniscate (Fig. 17), and the pedal curve of a circle with respect to a point is a Iima<;on. 
93H Curves Fig. 17 When a curve C' rolls on a fixed curve C without slipping and is always tangent to C, the locus r of a point X kept fixed with re- spect to the curve C' is called a roulette whose base is C, rolling curve is C', and pole is X. In particular, when C is a straight line, C' is a circle, and X is on C', r is called a cycloid (Fig. 18). When X is not on C', r is called a trochoid (Fig. 19). A trochoid is represented parametri- cally by the equations x=ae-bsine, y=a- bcos e, where the parameter () is the angle of rotation of C'. When a = b, the equation repre- sents a cycloid. The tevolute and tinvolute of a cycloid are also cycloids (Fig. 20).  r e' X f} A e x Fig. 18 y x Fig. 19 yv Fig. 20 Suppose that we are in a gravitational field with a given path represented as a cycloid, as is shown in Fig. 21. Assuming that there is no friction, the time necessary for a particle to slide down the path from a point X on the curve to the lowest point C of the curve is independent of the initial position X (c. Huy- gens). Because of this property, the cycloid is also called a tautochrone. Suppose that a par- ticle starts from a point A in the space and slides down to a lower point B along a curve r (without friction) under the effect of a gravita- tional force. To minimize the elapsed time, we simply take r as a cycloid that lies in the vertical plane containing AB and has a hori- zontalline through A as the base (Fig. 22). Because of this property, the cycloid is called 350 also a brachistochrone, i,e., the line of swiftest descent (Johann Bernoulli and others).  e Fig. 21  Fig. 22 When the base curve C and the rolling curve C' are both circles and X is on C', we call r an epicycloid if C, C' are externally tangent (Fig, 23), and a hypocycloid if C, C' are internally tangent (Fig. 24). When X is not on C', cor- responding to these two cases, we have an epitrochoid and a hypo trochoid, respectively. Let a, b be radii of C, C', respectively, c the distance from the center of C' to X, and e the angle ofrotation of C', Then the para- metric equations of these curves are x =(a::!: b)cos e + ccos«a::!: b)/b)(), y =(a::!: b) sine- c sin( (a::!: b)/b)O. (Take the upper signs when the curve is an epicycloid and the lower signs when the curve is a hypotrochoid. When b=c the equations are equations of epicycloids and hypocycloids.) When the ratio a:b is a rational number p/q (p, q are mutually prime), then C' returns to its initial position after rotating q times around C; in this case each r becomes Fig. 23 A Fig. 24 
351 an algebraic curve. In particular, when a = 4b = 4c, the hypocycloid is called an astroid (Fig, 25). Its equation (with respect to a Car- tesian coordinate system) is X 2 / 3 + y2/3 = a 2 / 3 . The envelope of line segments of length a whose endpoints are on the x-axis and y-axis, respectively, is an astroid (Fig, 25). When a = b=c an epicycloid is a cardioid (Fig. 26). Fig. 25 Fig. 26 When the base C is a straight line, the roIl- ing curve C' is an ellipse or a hyperbola, and the pole is a focus of C', then the roulette is called a Delaunay curve (Fig. 27). c'  Fig. 27 When C is a straight line, C' is a parabola, and X is the focus of C', the roulette is caned a catenary (Fig. 28). When we hold two ends of a string of homogeneous density in the gravita- tional field, the string takes the form of this curve. The equation of the catenary with re- spect to a Cartesian coordinate system is y = a cosh x/a == a(e x / a + e -x/a)/2. The involute starting at the point A (0, a) of this curve is called a tractrix (Fig. 29). Let Q be the point of intersection of the tangent at P to the tractrix and the x-axis; then the length of PQ is con- stant and is equal to a. Consequent1y, when we drag a weight at A by a string of length OA along the x-axis, the curve described by the weight is the tractrix. The parametric equa- tions of the tractrix are x=a(logtant/2+ cost), y=asint. Fig. 28 Fig. 29 Suppose that a point Q moves with constant velocity on the x-axis and another point P also moves with constant velocity always toward Q. 93H Curves The locus of the point P is called a curve of pursuit (Fig. 30). When the velocity of Q is ex times that of P, the equation of the curve of pursuit is 2(x -a)= y'-'/c(1 - ex)-cy1+'/(1 + ex) if ex # 1 and 2(x -a) =(l/c)logy- cy2/2 if ex = 1. We can consider similar problems when Q moves on a general curve instead of on the x- axis. y A x Fig. 30 Many plane curves that are called spirals can be expressed by r= f(O) (/ monotonic) in polar coordinates (r,O). An Archimedes spiral is a curve having the equation r=aO (Fig, 31). Archimedes found that the area bounded by two straight lines 0=0" 0=0 2 (0, <0 2 ) and the curve is a 2 Wi -0)/6. A logarithmic spiral (equiangular spiral or Bernoulli spiral) is a curve having the equation r = ke a8 (Fig. 32). The angle between thc straight line 0 = con- stant and the tangent to the curve is constant. Johann Bernoulli found that the involute and evolute of this curve are congruent to the orig- inal curve. A curve having the equation r= a/O is caned a hyperbolic spiral (or reciprocal spiral), and the one having the equation r 2 0=a is called a Iituus. These two spirals are shown in Figs. 33 and 34, respectively, Let p = <p(s) be the tnatural equation of a curve. Properties of the curves for which the functions <p(s) are simple havc been investigated. Specifically, a curve having the natural equation p == ks (k is a constant) is a logarithmic spiral. A curve hav- ing the equation p = a 2 /s 'is caned a Cornu spiral (or cIothoid;  167 Functions of Con- Fig. 31 Fig. 32 C  Fig. 33 Fig. 34 
93 I Curves fluent Type). Its parametric representation is x = aj;. r' cos nt 2 dt, Jo 2 y = aj;. f' sin nt 2 dt Jo 2 (tFresne1 integral). M. A. Cornu used this curve in the representation of diffraction in physical optics, There are also curves that appear as graphs oftelementary functions. For example, a curve having the equation y = sin x is called the sine curve, and the graphs of equations y = eX and y = log x are called the exponential curve and the logarithmic curve, respectively, although they are congruent. In contrast to algebraic curves, these analytic curves that are not algebraic are called transcendental curves. (Regarding the differential geometric properties of plane and spaces curves  111 Differential Geometry of Curves and Surfaces; for plane algebraic curves  9 Algebraic Curves,) I. Envelopes Let f(s, t) be a function of class C I of real vari- ables s, t, If we fix t = to' then r = f(s, to) is the equation of a curve C'o with a parameter s. If s(t) is a function of t, then f(s(to), to) represents a point P'o on C'o' Let E be the locus of P'o when to moves, If s(t) is a function of class CI, then E is a curve of class C 1. If E and C'o are always tangent at each point P'o' we call E the envelope of the family of curves C,. When f(s, t) is given, to find E we need only determine the function s(t). We note that af/as=Aaf/at is a condition that must be satisfied by the func- tion s(t). When n = 2 and the equation of C'o is given in the form f(x, y, to) = 0, the point of intersection of C'o and J,(x, y, to)=O (J, = af/at) is P'o' The equation R(x,y)=O obtained by eliminating t from f(x, y, t) = 0 and J,(x, y, t) = 0 is called the discriminant of f(x, y, t) = O. The set of points (x, y) satisfying the discriminant R = 0 is the union of E and the locus of the singular points of C'o' J. Peano Curve A tcontinuous curve in the Euclidean plane E 2 (i.e., the image f(I) of a segment 1= [0,1] under a continuous mapping f: 1--> E2) may cover a square. We caIl such a curve a Peano curve after G. Peano, who gave the first exam- ple. D. Hilbert simplified the example and constructed a Peano curve as follows (Math. Ann., 36 (1890), 38 (1891)). Divide a square and a segment into four 352 equal parts (Fig. 35) and let each square D i correspond to the segment 7; (i = 0, 1,2, 3). Then divide each D i into four equal parts (Fig. 36) and let D ij correspond to 7;j (j = 0, 1,2,3), and continue this process (Fig, 37). A sequence of squares D, =:> Dij=:> DUk =:> ... has the unique common point Pijk. .' and we let this point correspond to the unique common point t ijk of the sequence of segments 7; =:> 7;j=:> 7;jk =:>.,.. The correspondence t ijk .. -->Pijk , is a continu- ous mapping of the segment [0, 1] onto the square D, and this continuous curve has double points, triple points, and quadruple points. The set of multiple points has the car- dinal number of the continuum and is a tdense set in the square. We can improve this method so that there are no multiple points other than double and triple points, but it is impossible to eliminate triple points alto get her. B C D j D, Do lJJ A D Tu T j T 2 T3 Fig. 35 I." I Fig. 36 Fig. 37 We can also construct Peano curves as shown in Figs. 38, 39, and 40. That is, we "bisect" a triangle and a segment [0,1] suc- cessively and build a correspondence as fol- lows: Let a point represented by a binary number t = O.ijk... (i,j, k, ... = 0,1) correspond to the unique common point of the sequence 
353 Ai =:> Aij =:> AUk::J ... and obtain a continuous mapping from the segment [0, 1] onto the triangle. b 0.0 I 0.1 I 1.0 I To T, Fig. 38 0.01 0.11 I I T()() TOI TIO Tll Fig. 39  0.011 I I I I I I I I I I TOIl Fig. 40 K. The Jordan Curve Theorem The Jordan curve theorem states: A Jordan curve J in the plane R 2 separates R 2 into inner and outer tregions (c. Jordan, Cours d'analyse, 2nd ed., 1893). More precisely, R 2 -J is the disjoint union G j U G 2 of two regions G l and G 2 whose common tboundary is J. Let p be a point of J. Then there is a Jordan arc with p as an endpoint such that all points of the Jordan arc are contained in G i (i = 1 or 2) except for p (A. Schonflies), that is, J is accessible from G i . Conversely, let J be a compact subset ofR 2 such that R 2 -J = G I U G 2 and G 1 n G 2 = 0, where the G i are regions such that J is acces- sible from both G j and G 2 . Then J is a Jordan curve (Schonflies, 1908). A homeomorphism between a Jordan curve J and the circle C extends to a homeomorphism (more precisely to a tconformal mapping) between a plane containing J and a plane containing the circle C ( 65 Combinatorial Manifolds G; 77 Con- formal Mappings). References [1] K. Menger, Kurventheorie, Teubner, 1932 (Chelsea, 1967). 94B Curvilinear Integrals and Surface Integrals [2J R. L. Wilder, Topology of manifolds, Amer. Math. Soc. Colloq. Publ., 1949. [3J G. T. Whyburn, Analytic topology, Amer. Math. Soc. Colloq. Publ., 1942. [4J G. Loria, Spezielle algebraische und trans- zendente ebene Kurven. Theorie und Ge- schichte I, II, Teubner, 1910, 1911. [5] F. G. Teixeira, Traite des courbes speciales remarquables planes et gauches I-III, Co'imbre,1908-1915. [6] P. S. Alexandrov and H. Hopf, Topologie I, Springer, 1935. [7] S. Lefschetz, Introduction to topology, Princeton Univ. Press, 1949. [8] M. H. A. Newman, Elements of the topol- ogy of plane sets of points, Cambridge Univ. Press, second edition, 1951. [9J M. Greenberg, Lectures on algebraic topol- ogy, Benjamin, 1966. [IOJ E. E. Moise, Geometric topology in di- mensions 2 and 3, Springer, 1977. 94 (X.11) Curvilinear Integrals and Surface Integrals A. General Remarks The integral of a function (or more precisely, a tdifferential form) along a tcurve (tsurface) is called a curvilinear integral (surface integral). Because a curvilinear integral is a special case of the Stieltjes integral, we shall first explain this notion, formulated by T. J. StieItjes (1894) as a generalization of the tRiemann integral. The notion was introduced in connection with StieItjes's study of tcontinued fractions, and led to the idea of integrals with respect to general measures. B. The Riemann-Stieltjes Integral Suppose that f(x), ex(x) are real-valued bounded functions defined on [a, b]. Take a partition of the interval a = Xo < X I < X 2 <... <x n - I <xn=b ( 216 Integral Calculus) and consider the Riemann sum with respect to ex(x): n-I L f(i)(ex(Xi+d -rx(x i )), iE [Xi, Xi+l]. i=O Suppose that the Riemann sum tends to a fixed number as max(x i +1 -x;) tends to zero. Then the limit is called the Riemann-Stieltjes integral (or simply Stieltjes integral) of f(x) with respect to ex(x) and is denoted by 
94C Curvilinear Integrals and Surface Integrals Jif(x)dex(x). The Riemann integral of f(x) is a special case, where ex (x) = x. The Riemann-Stieltjes integral has the ele- mentary properties, such as linearity, of the usual Riemann integral. We also have the following theorem; The integral Jf(x)dex(x) exists for every continuous funtion f(x) if and only if ex(x) is of tbounded variation, Hence when we consider the Stieltjes integral of f(x) with respect to ex(x), we usually assume that f(x) is continuous and ex(x) is of bounded variation. However, the Stieltjes integral can be defined if f(x) is of bounded variation (not necessarily continuous) and ex(x) is continu- ous (not necessarily of bounded variation). If a sequence f,,(n) (n= 1,2, ...) of uniformly bounded continuous functions defined on the interval [a, bJ converges to a continuous func- tionf(x) on the interval [a,bJ, we have ! r f,,(x)dex(x) = r f(x)dex(x), where ex(x) is a function of bounded variation. Furthermore, if ex(x) and exn(x) (n = 1, 2,...) are functions of bounded variation whose ttotal variations are uniformly bounded and limnoo exn(x) = ex(x) at every point of continuity of ex(x), then we have ! r f(x)dexn(x) = r f(x)dex(x) for every continuous function f(x) on [a, bJ (HelIy's theorem). Let ex(x) be a tstrictly monotone increasing continuous function, and let [3(y) be its inverse function. Then we have f b f '(b) f(x)dex(x) = f([3(y))dy, a Il(a} where the right-hand side is the usual Rie- mann integral. A function ex(x) of bounded variation is represented as the difference of two strictly monotone increasing functions ex l (x) and ex 2 (x). If we denote by [3i(y) the inverse function of exi(x) (i = 1,2), we have f b i 'l(b) f(x)dex(x) = f([3dy)) dy a Il t (iX) i '2(b) - f([32(y))dy. a: 2 (a) It ex'(x) exists and is continuous, we have r f(x)dex(x) = r f(x)ex'(x)dx, C. The Lebesgue-Stieltjes Integral Suppose that ex(x) is a monotone increasing and right continuous function and I = (x" x 2 ]. 354 We define an tinter val function U(I)=ex(X2) ex(xd. It is nonnegative and countably addi- tive. Hence by utilizing U(I) we can con- struct the outer measure and also a completely additive measure ( 270 Measure Theory; 380 Set Functions). The Lebesgue integral with respect to this measure is called the Lebesgue- Stieltjes integral (or Lebesgue-Radon integral) and is denoted by Jif(x)dex(x). If ex (x) is a strictly monotone increasing continuous func- tion and [3(y) its inverse function, then formula (1) is true if the left-hand side is a Lebesgue- Stieltjes integral and the right-hand side is a Lebesgue integral. If ex(x) is a function of bounded variation, decomposing ex(x) into the difference of two strictly monotone increas- ing functions, we also have formula (2). If ex(x) is tabsolutely continuous, formula (3) is valid, where the right-hand side is a Lebesgue integral. The Stieltjes integral has the following two properties. Integration by parts: In the interval [a, b J, we have r UdV+ r VdU=U(b)V(b)-U(a)V(a) if one of U(x), V(x) is continuous and the other is of bounded variation. Second mean value theorem: If U(x) is mono- tone increasing and V(x) is continuous, then there exists a  in [a,bJ such that r UdV = U(a)(V@- V(a» + U(b)(V(b)- V(». (1) D. The Curvilinear Integral A continuous mapping from an interval a.s; t.s; bin R I into R n : <p(t) =(<PI (t), ..., <Pn(t» is an oriented curve. Suppose that a function f(x I' ." ,x n ) is defined in a neighborhood U of the image C of the mapping <p(t) or merely on C. The Stie1tjes integral r f(<PI(t),.." <Pn(t))d<Pi(t), i= 1,... ,n, (4) (2) is called the curvilinear integral of the function f(x I' .,. , x n ) along the curve C with respect to Xi and is denoted by Jcfdx i . The curve Cis ca\1ed the contour (or path) of the integration, <p(a) is called the initial point (or lower end), and <p(b) is called the terminal point (or upper end) of the integration. Let C be a rectifiable curve, and denote by s(t) the arc length of C from the initial point to the point <p(t). Then the Stieltjes integral r f(<p(t»ds(t) (3) 
355 or simply Icfds is calIed the curvilinear inte- gral with respect to the line element. Here, th e line element, denote d by ds, means J (<p;(t))2+... +(<p(tnzdt when <p(t) is of class C I . If the integrand in (4) is of bounded vari- ation as a function of t, the curvilinear integral is welI defined. If C is a trectifiable curve, the curvilinear integral is defined for an arbitrary continuous function. In the usual case, we are concerned mainly with this sort of situation. F or a differential form w = fl dx I + ... + f,. dXn defined on U, the curvilinear integral Sew is defined by L7=1 Ic/;dx i . The curvilinear integral is linear with re- spect to its integrand, If the terminal point of C I is the initial point of C 2 , we can construct the joint curve C = C I + C 2 , and we have additivity for the contours r fdx i = r fdx i + r fdxi Jc JC t JC 2 (a similar formula holds if we replace dX i by ds). Monotonicity, which asserts that Sefdxi.s; Ie 9 dX i whenever f.s; g, holds if <Pi(t) is mono- tone increasing, and monotonicity also holds for the curvilinear integral with respect to the line element. If n = 2, R 2 can be identified with the tcom- plex plane C={z=x+iy}, and we define Icf(z)dz by {t u(x)dx- t V(z)d Y } +i{L v(z)dx+ L U(Z)d Y } , where f(z) = u(z) + iv(z). The integral is then said to be an integral in the complex domain. (F or the application of integrals in the com- plex domain to complex analysis  198 Holomorphic Functions.) E. The Surface Integral By an m-dimensional smooth surface S we mean the image S of a tregular mapping of class C I from a domain Gin Rm into R n (m < n), (u) = (I (u l , ..., u m ), ..., n(u",,,, u m ), Given a continuous function f(x l , .,., x n ) defined in a neighborhood U of S in R n , the multiple integral J... t f(du),..., n(u» D(x i ,''',X i ) d x 1 m ul.,.du m , D(u"..., U m ) {i"...,i m }c{I,.."n}, (5) is calIed the surface integral of f along S with respect to Xi" "', x im and is denoted by 94F Curvilinear Integrals and Surface Integrals Isf dx i 1 '" dXi m or I... Isf dx i 1 ...dXim' This definition depends on the choice of the para- meters (u I , ..., u m ) in the folIowing way. Let (U'I"'" u) be another parametrization of S. Since D(X i1 , ...,x i ) _ D(X i1 , .., ,Xi) D(U'I, ...,u) D(u l , ..., U m ) D(u'I' ..., U) D(u l' "', U m ) and , , I D(U'l,...,U) 1 dU I ... dUm = dU I ... dUm, D(UI""'U m ) thus according as D(u l ,..., U m ) , , zO, D(ul"..,u m ) the foregoing expression remains the same or changes its sign. UsualIy we assign to the parameters (u I , .., , u m ) positive or negative orientation, in which case S is calIed oriented. If we replace the Jacobian D(X i1 ,..., x i )/ D(u l , ''', u m ) in (5) by the quantity ( (D( ) ) 2 ) 1/2 2: Xi' ...,X i i1<...<i m D(u:,...,uj , which corresponds to the surface element of S, the integral is calIed a surface integral with respect to the surface element and is denoted by IsfdS or IsfdlT. The surface integral of a differential form of degree m in Rn is similarly defined. In the case m = 1, the surface integral reduces to a curvilinear integral. Just as the Stieltjes integral is a generalization of the curvilinear integral, there are several ways to generalize the notion of surface integral without assuming that the mapping (u) is of class C I . F. The Stokes Formula Let S be an m-dimensional smooth surface in R n (m.s; n) and as be the (m -1)-dimensional surface corresponding to the boundary of S. Let w be a differential form of class CI of degree (m - 1) and dw be its texterior deriva- tive. Then we have Iasw= Isdw, which is calIed the Stokes formula (or the Green-Stokes for- mula). (For the Stokes formula on a general differentiable manifold  105 Differentiable Manifolds,) As special cases of the Stokes formula, we have the folIowing three classical theorems: (1) The case of a plane domain: Let D be a bounded domain on the xy-p1ane bounded by a finite number of smooth curves C with posi- tive directions. If w = P dx + Q dy is a differen- tial form of class C I on 15, we have r P dx + Q dy = If ( aQ - 8P ) dXd Y , Jc D ax ay (6) 
94 Ref. Curvilinear Integrals and Surface Integrals since dw=(iJQ/iJx- iJPjiJy)dx /\ dy. This is called Green's formula (or Green's formula on the plane). Equality (6) is true if P, Q are totally differentiable and the integrand on the right- hand side is continuous (even if the functions iJQ/iJx, iJP/iJy themselves are not continuous) (E. Goursat). This formula remains true under the following weaker assumptions: (i) C is rectifiable; (ii) P and Q are continuous in 15 and iJP/iJy and iJQ/iJx are continuous and summable (in the sense of Lebesgue) in D. (2) The case of a domain in a 3-dimensional space: Let D be a bounded domain in xyz- space surrounded by a finite number of smooth surfaces S. For a tvector field V = <P, Q, R) of class CIon 15, we put OJ = P dy /\ dz+ Qdz /\ dx+ Rdx /\ dy_ Then since dw = (iJPjiJx + iJQ/iJy+ iJRjiJz)dx /\ dy /\ dz, we have f I Pdydz+Qdzdx+Rdxdy = f f t divVdxdydz = ff r ( P + iJQ + iJ R ) dXdYdZ. Jv ox iJy iJz Equality (7) is called the Gauss formula (Ost- rogradskii's formula or the divergence theo- rem). The left-hand side of (7) is equal to the surface integral f fs(V, n) dO", which is the tvec- tor flux through S (where n means the outer unit normal vector of the surfacc S). This formula remains true under the following weaker assumptions: (i) S is piecewise of class Cl; (ii) P, Q, R are continuous in 15 and iJP/iJx, iJQjiJy, and iJR/fJz are continuous and sum- mable in D. (3) The case of a bordered surface in a 3- dimensional space: Let S: x=x(u, v), y= y(u, v), z=x(u, v) «u, V)EG) be a smooth surface in xyz-space, and suppose that the boundary r of the domain of the parameters G consists of a finite number of smooth curves with positive direction. The boundary C of the surface S is the image curve of r. Now let V = <P, Q, R) be a vector field of class CIon S, n be the unit normal vector of S (its direction being canoni- cally assigned by the parameter (u, v), and t be the unit tangent vector, and set w=Pdx+Qdy +Rdz. Then since dw=(oR/iJy-iJQjoz)dy/\ dz +(iJPjiJz-iJR/iJx)dz /\dx + (iJQ/iJx- iJPjiJy)' dx /\dy, we have f L (rotV,n)du= f f/ (  - ; )dYdZ + ( iJP _ iJ R ) dZdx+ ( iJ Q _ iJ P ) dXd Y ) az ax ax iJy = L (Pdx+Qdy+Rdz)= L (V,t)ds. (8) 356 Equation (8) is called the Stokes formula ( Appendix A, Table 3.III). References (7) [IJ H. Lebesgue, Le<;ons sur I'integration et la recherche des fonctions primitives, Gauthier- Villars, second edition. 1928, ch. II. [2J S. Saks, Theory of the integral, Warsaw, 1937. [3] D, V. Widder, The Laplace transform, Princeton Univ Press, 1941,ch_ T [4] H. Flanders, Differential forms, with ap- plications to the physical sciences, Academic Press, 1963. [5J H. K. Nickerson, D. C. Spencer, and N. E. Steenrod, Advanced calculus, Van Nostrand, 1959. [6J H. Okamura, On the surface integral and Gauss-Green's theorem, Mem. ColI. Sci. Univ. Kyoto, 26 (1950),5-14. [7] L. Schwartz, Cours d'analyse, Hermann, 1967. [8] H. Cartan, Formes differentielles, Her- mann, 1967. 95 (XVI.4) Cybernetics The term cybernetics was invented in 1947 by Norbert Wiener [1] to denote a field of science that treats the system of control and communi- cation in animals and machine. The term was derived from a Greek word Kv[3F.pVr'1r;, also the source of the word "governor." The stimu- lus for establishing such a new area of science came from studies of automatic computation, automatic control, and information process- ing. These fields had given rise to technological innovations such as the high-speed electronic computer and automatic control instruments. Such machines have had a profound influence On the information sciences, as weII as On theoretical investigations in biology. However, even now, we can hardly say that cybernetics has been formly established as a systematic branch of science or of applied mathematics. Nevertheless, it has had far-reaching influence On both biology and machine engineering as a methodology and as a philosophy. In the Soviet Union and in European countrics, the word corresponding to cybernetics is still widely used for the "Grenzgebiete" between biology and machine engineering in the wider sense. Nonetheless, it is difficult to say that cybernetics as a whole has undergone system- atic development. Systematic theories have 
357 been established and developed for separate parts of the field, and these have become in- dependent disciplines; they are, for example, tcontrol theory, tinformation theory, the theory of artificial intelligence, the theory of tautomata, mathematical biology, genetics, and ecology, The contributions of Wiener to cybernetics include not only the invention of the concept itself, but also prediction theory or the theory of the Wiener filter [2]. The latter was sub- sequently formalized by R, E. Kalman from a different point of view as an estimation problem within a linear system, working from finite observation data, and including the non- stationary case ( 405 Stochastic Control and Stochastic Filtering G). Another contribution by Wiener is the input-output identification of a nonlinear system using statistical time series analysis of the outputs when the inputs are white noises [4]. This is achieved by expan- sion of the output function in terms of the convolution of input functions, which corre- sponds to the expansion of functions in terms of tHermite interpolation polynomials. The kernel of the expansion is called a Wiener kernel. Recently, this method has been applied to input-output identification for nervous systems; it also finds application in nonlinear system theory [5]. Wiener also studied brain waves [6,9]. Cybernetics in the wider sense, although not necessarily called by this name, may include investigations of the following "Grenzgebiete": information processing of the nervous sys- tem [7]; self-organizing systems in the non- equilibrium thermodynamics of Prigogine and Haken; the self-reproducing machines of von Neumann; and the theory of pattern formation in biological or chemical systems [8]. F or related topics  176 Gaussian Process 1,395 Stationary Processes D. References [lJ N. Wiener, Cybernetics; or, control and communication in the animal and the ma- chine, Wiley, 1947; second edition, 1961. [2J N. Wiener, Extrapolation, interpolation and smoothing of stationary time series, with engineering applications, Wiley, 1949. [3J R. E. Kalman and R. S. Bucy, New results in linear filtering and prediction theory, Trans, ASME, J. Basic Eng., 83 (1961), 95-107. [4J N. Wiener, Nonlinear problems in random theory, MIT Press, 1958, [5J Y. W. Lee and S. Schetzen, Measurement of the Wiener kernels of a non-linear system by cross correlation, lnt. J. Control, 2 (1965), 237254. 95 Ref. Cybernetics [6] N. Wiener and A. Rosenblueth, The math- ematical formulation of the problem of con- duction of impulses in a network of connected excitable elements, specifically in cardiac mus- cle, Arch. Inst. Cardiology, Mexico, 16 (1946), 205-265. [7] M. A. Arbib, Metaphorical brain, Inter- science, 1972. [8J P. C. Fife, Mathematical aspects of react- ing and diffusing systems, Lecture notes in math. 28, Springer, 1979. [9] N. Wiener and J. P. Schade, Cybernetics of the nervous system, Progress in brain research XVII, Elsevier, 1965. [10] K. Steinbuch and S. W. Wagner, Neuere Ergebnisse der Kybernetik, R. Oldenbourg, 1964. [11 J S. Beer, Decision and control, Wiley, 1966. 
96A Data Processing 96 (XVI. 7) Data Processing A. General Remarks With the development of electronic computers, effective systems for data transmission and processing have been created on a large scale, and there is now a large literature concerning data-processing methods and techniques for such systems. Research on data processing encompasses both specific techniques and entire systems of processing, for example, the system of pro- gramming techniques, as well as mathematical problems originating from them, such as the complexity of computations ( 71 Complexity of Computations). As domains of application, we have, for example, information retrieval, stock management, program evaluation, and review techniques. We denote the set of data by D and assume that D is finite. Depending on the properties of D, we have various suit- able representations and processing problems. B. The Notion of Data In recent applications of computers, the main task has been arranging and searching for items or attributes in storage, rather than numerical computation. Individual informa- tion is called a record. Records arranged and stored in the memories of computers are usu- a1\y called data. A collection of such data is ca1\ed a file. A complex, large-scale data co1\ec- tion is often ca1\ed a data base. In an abstract sense, the record consists of a string of letters, but it is usua1\y convenient to view it as consisting of its identifying mark followed by a finite sequence of items. The contents can be classified according to their properties, such as topological relations, order relations, or items representing numerical values. For each case, there may be different suitable representations and operations, We often store information after a suitable process of information compression. In some cases the operations are reversible, and com- plete recovery is possible. In other cases the operations are not completely reversible, and we must throwaway part of the information in order to compress the rest. An example of recoverable information is the replacement of a run of l's (or O's) in a binary code by its length. An example of lost information often occurs in the graphical manipulation of num- bers by means of hashing, In this process we 360 select a suitable function s = f(a l , ..., an), where the vector (a l ,..., an) corresponds to the repre- sentation of the record in such a way that the value s can be monomorphic for the universe of the records as often as possible, and the data (ai, "', an) is kept in the storage corre- sponding to the value s. Genera1\y, the vari- able representing the location of data is called a pointer. In the abstract sense, data bases are simply sets of items, but in most cases a hase will also exhibit some mathematical structure, such as order relations. In order to handle a data base efficiently, we must take its particular structure into account, and suitably represent that struc- ture. Such structures and their representations are ca1\ed data structures. C. Linear Structures The component of a vector or the stations along a single railway line form linearly ordered sets. Their mOst essential feature is the notion of immediate predecessor or immediate successor. Such a data structure is ca1\ed a linear structure, and the aligned sequence is ca1\ed an array. Multidimensional arrays, such as the elements of a matrix, are stored in the form of I-dimensional arrays in the memory of a computer. When the set of data D is a linearly ordered set, it is usually represented as a suitable array. Here, the problems of ordering and table look- up are fundamental. Ordering in this sense means putting the given elements of D into the order defined for D. Since, historically, sort- ing machines were used to put punched cards in order, this process is also called sorting. The process of arranging several individually sorted data packs into one sorted pack is ca1\ed merging. A fundamental data-sorting operation is "comparison" with respect to the order for D. There have been many investigations of the estimation of the lower or upper bounds for the number of comparisons and of devel- oping efficient algorithms. Asymptotically, O(nlogn) is the theoretical lower bound for n elements, and some algorithms are known to achieve this bound ( 71 Complexity of Computations). If there is given a univalent correspon- dence f:D-->D' and the correspondence table is stored in the memory in a suitable form, the problem of table look-up arises, which requires finding f(d) for a given dE D. In this process, a fundamental operation is a comparison of dED with some XED in the table. There have been many investigations of efficient arrange- 
361 ments and of algorithms with minimal num- bers of comparisons. D. List Representation If the given finite set D has an order satisfying the reflexive and transitive laws, addressing is often used as a medium for representing the order in the memory. Let d l , ..., d n be all the immediate successors to an element dED. We can represent them by the following sequence of triplets: If n = 0, we take (d, 0, 0); Ifn= 1, we take (d,dj,O); If n = 2, we take (d, dj, di); Ifn=3, we take (d,dj,ej),(el,d!,dj); Ifn;"4, we take (d,dj,ej), (el,d!,e!), (e 2 , dj, ej), ... ,(e n - 2 , d_I' d), where d'f means the pointer to d i , ei is an ele- ment introduced for convenience, and e'f is the pointer to ei' This is called a list representa- tion. If we denote by d-->dJ, the fact that d l is an immediate successor of d, then the total set can be regarded as a tdirect graph. Usually, this graph is a ttree, and then the graph is called a tree structure and its representation a tree representation. The advantage of this representation is that addition or deletion is quite easy. The set of logical formulas is partially ordered, with the order given by the rules of inference. The set consisting of a series of inferences forms an ordered subset. Thus, if a tree representation can be automatically treated, so can the process of inference. In dealing with linguistic data (words or sentences, for example), it is often natural to consider a noncommutative tfree semigroup D generated by a finite number of generators (the alphabet or vocabulary). In this case, if there exists a natural order for the generators, it determines in D a lexicographic partial order- ing. Then the tree representation can be used for representing a dictionary whose entires are elements of D. This method is not efficient with respect to speed of table look-up and economy of memory, but it sometimes has the advan- tage of simplifying the treatment of compli- cated data. E. Memory Devices for Processing In dealing with algebraic formulas or lan- guages with parentheses, data maintenance methods such as tree representation or push- down storage are often convenient as auxiliary memory-controning methods. The character- 96F Data Processing istic of the push-down storage method is that it returns the data in reverse order with respect to the time of acceptance and remittance. It is sometimes caned a stack or a first-in-Iast- out memory. Contrary to a stack, there is a storage that returns the data in the same order as the acceptance. It is called a queue or a first-in-first-out memory. F. Information Retrieval A request to obtain all the records qualified by some property in a data base Q is caned a query. The part of the collection of records Q qualified for a query, therefore, can be identi- fied by its characteristic subset of the space of all possible records. A procedure that deter- mines the subset Qq of Q composed of an the records relevant to a query q is caned the information retrieval of the query. A system which provides such a procedure for every query qEQ is called an information retrieval system organized for Q with respect to Q. It is essential to design the system in such a way that records can be retrieved quickly for queries in a certain class. Such a selected set of queries can be composed of an the first-order queries specifying an item and asking for aU the records containing the attribute value or the key characterizing the item, It can, in some cases, include second- or higher-order queries specifying a number of items and asking for an the records in which attribute values or keys characterizing the item occur simultaneously. In an information retrieval system, the mas- ter file of a data base Q is usually organized by way of auxiliary memories on data structures such as sequential files, indexed sequential files, virtual storage files, or direct access files, using the magnitude of the accession number or primary key of each record. Various man- agement systems, called SAM (Sequential Access Method), ISAM (Index Sequential Access Method), VSAM (Virtual Storage Access Method), or DAM (Direct Access Method), exist. In addition to the organization of such a master file, various directory files or indexes are organized in order to retrieve each query qEQ quickly, because there are many relevant keys or combination of keys other than the primary key. An inverted filing scheme (IFS) is a typical scheme for organizing such indexes or directory files. A bucket B, or an address- able set of secondary memories is provided for each canonical query q,EQ in a one-to-one way. An index or a list of accession numbers of pertinent records is organized in each of the buckets contiguously so as to make it possible 
96 Ref. Data Processing to retrieve all accession numbers of relevant records quickly, The essentials of an IFS is to define a one-to-one QAT (Query to Ad- dress Transformation) from the inverted set of queries Q= {q,} to the set of buckets B= {B,}. An IFS has efficient retrieval performance with respect to the inverted canonical queries. The scheme, however, may in some cases require a large number of buckets. Moreover, the scheme requires quite redundant storage of accession numbers by a number of buc- kets because a record may be pertinent to a number of canonical queries simultaneously. Although higher retrieval performance can be expected by including higher-order queries in the inverted set, the space and machine time needed for such organization is prohibitive. This is one of the reasons why an IFS for first- order queries is preferable in almost all prac- tical cases. Another reason is that if the first- order queries are inverted, every retrieval can logically be performed by certain Boolean operations executed among a certain number of retrieved sets of accession numbers, A trade- off of space and time needed for the organiza- tion of a scheme and its retrieval performance might be the determining factor for the selec- tion of the set of canonical queries to be inverted. An attempt to overcome the limitations inherent in the inverted scheme can be found in the work on the balanced file organization scheme due to Abraham et al. [7]. By extract- ing the essentials, Yamamoto et al. [9,lOJ defined a BFS (Balanced File-organization Scheme) in a wider sense as follows: (i) Buckets are organized in such a manner that every bucket is associated with more than one query. (ii) Every canonical query is associated with a unique bucket. (iii) The accession number of a record with some additional information is stored in a bucket once if and only if it is pertinent to at least one of the associated queries. The essentials of a BFS is to define a many- to-one transformation from the set of canon- ical queries Q to the set of bucket addresses B, or an MQAT (Multiple Queries to Address Transformation). An MQAT defines a parti- tion ofQ into mutually disjoint subsets. It is a generalization of a QAT which defines an IFS. If c is the number of queries to be asso- ciated simultaneously with a unique bucket in a BFS, then the number of buckets to be prepared is l/c, which is a drastic reduction from that for an IFS. This reduction makes it possible to extend the feasible range of canonical queries, Actually, in the system HUNDRED (Hiroshima University New Documents REtrieval and Dissemination), a 362 51-to-1 MQAT is used for the organization of indexes. Although some space overload may occur by storing some additional information in order to tell which record is pertinent to which query in a bucket, reduction of redundancy can be expected, because a record may be pertinent to more than one query simulta- neously. The reduction in the number of buc- kets and redundancy would contribute much in saving time and space needed for the or- ganization of indexes. Among those BFS's defined by MQAT's with a given data base Q, Q, and the number c of queries associated with the same bucket, one BFS can be called the best if its redun- dancy is the least under some reasonable as- sumptions imposed on the distribution of keys. Several combinatorial problems have been raised and solved in this connection [10]. References [IJ A. T. Berztiss, Data structure-Theory and practice, Academic Press, 1971. [2J H. S. Stone, Introduction to computer organization and data structures, McGraw- HiIl, 1972. [3J A. K. Scidmore and B, L. Weinberg, Storage and seach properties of a tree- organized memory system, Comm. ACM, 6 (1963),28-31. [4J R. D, Luce, R. R, Bush, and E, Galanter, Handbook of mathematical psychology II, Wiley, 1963. [5J Y. Bar-Hillel, Language and information, Addison-Wesley, 1964. [6] J. A. Fodor and J. J. Katz, The structure of language: Readings in the philosophy of lan- guage, Prentice-HalI, 1964. [7J C. T. Abraham, S, P. Ghosh, and D. K. Ray-Chaudhuri, File organization schemes based on finite geometries, Information and Control, 12 (1968), 143-163, [8J S. P. Ghosh, Data base organization for data management, Academic Press, 1977, [9J S. Yamamoto, H. Ikeda, S. Shige-eda, K. Ushio, and N, Hamada, Design of a new bal- anced file organization scheme with the least redudance, Information and Control, 21 (1975), 7291. [lOJ S, Yamamoto, Design of optimal bal- anced filing schemes, Proc. IBM Working Conference on Database Engineering, in Data- base Engineering, Springer 1981, vol. II, 23- 35. [IIJ D. E. Knuth, The art of computer pro- gramming, Addison-Wesley, vol. 1, revised edition, 1976; vol. 2, revised edition, 1980; vol. 3, 1973. 
363 97 (1.10) Decision Problem Suppose that we are given a set S and a propo- sition P(x I, X 2 , ... ,x n ) for elements Xl of S. Then we have the problem of universal validity of P, which is the problem of finding a general algorithm (i.e., a finitary procedure) by which we can discern whether P(x I, ,., ,x n ) is true for all n-tuples (Xl"'" x n ). The problem of finding an algorithm by which we can discern the validity of P(x I, .,., x n ) for some specifically chosen n-tuples (x I' ... ,x n ) is called the prob- lem of satisfiability of P. These two problems are customarily called decision problems. The problems are such that affirmative solution of one of them implies negative solution of the other. To give a precise definition of decision prob- lems, let us note that a tfree semigroup with countable generators can be identified with a subset of the set N of natural numbers (by virtue oftG6del numbering;  185 G6del numbers). On the other hand, if 6 is a given tformal system with countably many symbols, the set of all tformulas in 6 is a subset of the free semigroup generated by the symbols in 6. Thus the set of all formulas in 6 is identified with a subset ofN. A subset M ofN (or N x N x ... x N) is (general) recursive if its tre- presenting function is general recursive ( 356 Recursive Functions). By using the concept of recursive function, a precise definition of the decision problem can be given as follows: The decision problem of M is solved affirmatively if and only if we can obtain effectively a proce- dure defining the representing function of M, and the function is general recursive. The decision problem of M is solved negatively if and only if we can obtain a proof that M is not recursive. For a set A of formulas in 6, we let g(A) be the set of all G6del numbers corresponding to the elements of A. Let A' be the set of formulas in A that are deducible in 6, and let r(A) = g(A'). The decision problem of the set A of formulas is said to be solved affirmatively (negatively) if the decision problem of r(A) is solved affirmatively (negatively). By refining this concept we arrive at the notion of the degree of (recursive) unsolvability. Let A and B be subsets ofN. The relation "A is recursive in Band B is recursive in A" ( 356 Recursive Functions) is reflexive, symmetric, and transi- tive. Hence this relation decomposes the class of all subsets in N into disjoint nonempty equivalence classes. A and B are defined to have the same degree of unsolvability if they belong to the same equivalence class. Thus the degrees of unsolvability can be identified with 97 Decision Problem the equivalence classes. The degree of recursive sets is O. The relation a.s; b is defined between the degrees a of A and b of B to mean "A is recur- sive in B." Clearly, for any degree a, we have O.s; a. The partially ordered system of degrees constitutes an tupper semilattice. Research on the decision problem has been done mostly in areas related to the tfirst-order predicate calculus L I and the formal systems on it. We now list some important results. (I) Results concerning L I. The decision problem has been solved negatively for the sets of formulas of the following forms. (Here it is assumed that no function symbols appear and that  represents a formula involving no occurrence of If, 3, or free variables.) (1) All formulas in L' (A. Church, A. M. Turning), (2) 3x 13x2 ... 3x m \fy I \fyz ... \fYn  (T. Skolem), (3) 3X13x23x3\fYl \fY2'" \fYn3z  (K. G6del), (4) 3x 1 3x 2 \fYl \fY2'" \fYn3z  (L. Kalmar), (5) 3x 13x2 \fy3z 13z2 ... 3z n  (J. Pepis), (6) \fx3y\fz3u I 3u 2 ... 3u n  (W. Ackermann), (7) 3x 13x2 3X3 \fy  or 3x I 3x 2 \fy3z  (J. Suninyi), (8) 3X\fYI\fY23z13z2  or \fx3y\fdu I 3u 2  (Suninyi). The decision problem has been solved affirmatively for the sets of formulas of the following forms, where it is assumed again that no function symbols appear and that  is as above. (1) AII formulas involving variables only on predicates with one argument. (L. L6wen- heim, Skolem, H. Behmann), (2) \fx I \fx2 .., \fx m  (P. Bernays, M. Sch6nfin- kel, Ackermann), (3) \fx I \fx 2 ... \fx m 3y 1 3Y2 ... 3Yn  (Bernays, Sch6nfinkel, Ackermann), (4) \fx I \fX2'" \fxm3YI3Yz\fZl \fz2'" \fz n  (G6del, Kalmar, K. Schutte). (II) Results concerning forma! systems on L 1. Throughout the rest of this article we assume that no tfunction variables appear. Predicate constants, function constants, and object constants may appear. By the decision problem for a formal system i! we mean the decision problem for all tclosed formulas in i!. Most of the results obtained so far concerning the decision problem for formal systems have been negative. Such results include those for formal systems formalizing natural number theory, the theory of rational integers, the elementary theory of tgroups, trings, tfields, t\attices, and the like, and axiomatic set theory (A. Tarski et al.). The word problem for groups was solved negatively by P. S. Novikov ( 161 Free Groups B). In connection with this deci- sion problem, there are some investigations 
97 Ref. Decision Problem by W. W. Boone, G. Higman, and others [14-17]. The decision problem for a formal system formalizing the elementary theory of t Abelian groups has been solved affirmatively (W. Szmielew). Little is known about the decision problem concerning partial systems offor- mulas of given formal systems except the fol- lowing: (I) the decision problem for the set of formulas of the form 'ix i 'ix 2 ... 'ix m  in a formal system ( 161 Free Groups); (2) the Hilbert-type problem, which is the decision problem for the set of formulas of the form 3x I 3x 2 .., 3x m (t = s) in a formal system. In particular, the Hilbert-type problem in a formal system formalizing natural number theory is called Hilbert's tenth problem ( 196 Hilbert). The latter is the problem of find- ing an algorithm for deciding whether a tDio- phantine equation has an integral solution. This decision problem was studied by M. Davis, H. Putnam, J. Robinson, and others, and finally Yu. V. Matiyasevich solved it neg- atively by showing that every recursively enumerable relation is Diophantine [9]. (A relation R(m l , ..., mJ is called Diophantine if there is a polynomial P(x l , ..., x j , YI, ..., Yk) with integer coefficients such that R(m l , ..., mJ holds if and only if P(m l , ..., m j , YI' ..., Yk) = 0 has a solution for Y I' ... , Yk in natural num- bers.) In addition, some investigations have been made about the tsecond-order predicate calculus U, tintuitionistic logic, etc. [1,2]. References [IJ W. Ackermann, Solvable cases of the deci- sion problem, North-HoIland, 1954. [2J S. C. Kleene, Introduction to metamath- ematics, Van Nostrand, 1952. [3J A. Tarski, Undecidable theories, North- Holland, 1953. [4J A. Church, An unsolvable problem of elementary number theory, Amer. J. Math., 58 (1936), 345-363. [5J S. C. Kleene and E. L. Post, The upper semi-lattice of degrees of recursive unsolva- bility, Ann. Math., (2) 59 (1954),379-407. [6J G. E. Sacks, Degrees of unsolvability, Ann. Math. Studies, Princeton Univ. Press, 1963. [7J M, Davis, H, Putnam, and J. Robinson, The decision problem for exponential dio- phantine equations, Ann. Math., (2) 74 (1961), 425-436. [8J M, Davis and H. Putnam, Reductions of Hilbert's tenth problem, J. Symbolic Logic, 23 (1958), 183-187. [9J Yu. V. Matiyasevich, Diophantine enumer- able sets (in Russian), Dok\. Akad. Nauk SSSR, 191 (1970),279-282. 364 [lOJ H. Hermes, Enumerability, decidability, computability, Springer, 1965. [l1J H. Rogers, Theory of recursive functions and effective computability, McGraw-Hili, 1967. [12J J. R. Shoenfield, Mathematical logic, Addison- Wesley, 1967. [13J P. S. Novikov, On the algorithmic un- solvability of the word problem in groups (in Russian), Trudy Mat. Inst. Steklov., 44 (1955). [14J G. Higman, Subgroups of finitely pre- sented groups, Proc. Roy. Soc. London, (A) 262 (1961), 455-475. [15J W. W. Boone, The word problem, Ann. Math., 77 (1963), 1632. [16J C. F. MilIer, On group-theoretic decision problems and their classification, Ann. Math. Studies, Princeton Univ, Press, 1971. [17J W. W. Boone, F. B. Cannonito, and R. C. Lyndon (eds.), Word problems, North- Holland, 1973. 98 (XXI.18) Dedekind, Julius Wilhelm Richard Julius Wilhelm Richard Dedekind (October 6, 1831-February 12, 1916) was born in the city of Braunschweig in central Germany and studied at the University of Gottingen under C. F, tGauss, who was then in his later years. He received his doctorate at Gottingen with a thesis on the tEuler integral. He was professor of mathematics from 1858 to 1862 at Zurich and from 1863 to 1894 at the Technische Hochschule in Braunschweig. During his early twenties, he wrote papers concerning analysis and the theory of probability, but in 1857 he began publishing papers on the theory of numbers, He edited tDirichlet's lectures on number theory (Vorlesungen iiber Zahlen- theorie, first edition 1863, fourth edition 1899) and concentrated on research in arithmetic and algebra. The theory of tideals, which he founded, was originally set out in a supple- ment (1863) to Dirichlet's Vorlesungen. Dedekind treated subjects ranging from the axiomatic foundations of the theory of ideals to t\attices and tgroups as algebraic systems. He was a pioneer of the abstract algebra of the 20th century, Among his notable achievements are the tDedekind zeta functions oftalgebraic number fields, tDedekind cuts in the theory of real numbers, the algebraic theory of talge- braic functions (of which he was a coauthor with H. Weber), and the theory of natural numbers. He was one of the first to support 
365 tCantor's set theory. His theory of natural numbers was founded on the concept of sets and included the idea of trecursive functions. References [IJ R. Dedekind, Gesammelte mathematische Werke I-III, Vieweg, 1930-1932 (Chelsea, 1969). [2] M. Cantor and R. Dedekind, Briefwechsel, Cantor-Dedekind, Actualites Sci. Ind., Her- mann, 1937. 99 (lX.4) Degree of Mapping A. Degree of Mapping Let M n and Nn be n-dimensional tclosed toriented tC' manifolds (or tcombinatorial manifolds). For example, M n = N n = sn (the tn- sphere). Their nth thomology groups with integral coefficients Hn(M"; Z) and Hn(N n ; Z) are infinite cyclic groups generated by the fundamental homology classes [MnJ and [NnJ, respectively ( 201 Homology Theory). A continuous mapping f: M n --> Nn induces a homomorphism f*: Hn(Mn; Z)-->Hn(Nn; Z), and there exists an integer d f such that f*([MnJ) = df[N n ]. This integer d f is called the degree of mapping (or the mapping degree) of f When M n = Nn, d f does not depend on the orienta- tion of M n . If a continuous mapping g: Mn --> Nn is thomotopic to f (/zg), then we have df=d g , If f is homotopic to a tconstant mapping (/zO), then df=O, while if f is a homeomor- phism, then d f = :t 1. When Mn = N n , a homeo- morphism f: Mn --> Mn is called an orientation- preserving mapping if d f = 1 and an orientation- reversing mapping if d f = -1. Suppose that M n and N n are closed oriented n-dimensional combinatorial manifolds and that f:Mn-->Nn is a tsimplicial mapping. Let Li 0";" and Ljsj (0";", sj are n-simplexes of M n , N n , respectively) represent [MnJ, [NnJ, and let Pj (resp. q) be the number of n-simplexes 0";" such that f(O";") is equal to Sjn (resp. - sn. Then Pj - qj is independent of the index j and equal to d f . Suppose that f, g:sn-->s. are continuous mappings (n  I). Then f  9 if and only if d f = d g (Brouwer mapping theorem), This implies that n.(sn)  Z ( 202 Homotopy Theory). 99 C Degree of Mapping B. Local Degree of Mapping Suppose that M" and N" are n-dimensional oriented C' (or combinatorial) manifolds and f: Mn...... N n is a continuous mapping. Suppose further that a point P of M n has a neighbor- hood V such that f(p) # f(q) for any point q contained in U - {p}. Then f induces a homo- morphism f*:Hn(U, U -{p})-->Hn(N, N- {f(p)}) of n-dimensional t\ocal homology groups with integral coefficients that are both isomorphic to Z. If u and v are generators of the groups Hn(U, V - {p}) and Hn(N, N- {I(p)} ) corresponding to orientations, re- spectively, then there exists an integer k such that f*(u) = kv. We call this integer k the local degree of mapping f at p. If Mn and N n are closed oriented C' manifolds (r  1) and f: Mn -->Nn is a C' mapping, then there exists a point r of Nn such that the set f-I(r) is a discrete subset {PI' ...,PI} of Mn, and each Pi has a neighborhood Vi satisfying the foregoing con- dition (tSard's theorem). If k i is the local de- gree of r at Pi> then d f = L k i . C. Linking Numbers Given two mutually disjoint smooth closed curves C I and C 2 in Euclidean 3-space, a quan- tity Lk(C I , C 2 ) indicating how closely they are interlinked with each other was given by Gauss as follows: Let C i be expressed by the parameters Xi = Xi(t i ) (i = 1, 2), where Xi(t i ) are tcontinuously differentiable, Then the quantity Lk(C U C 2 )= - 4  r r I  1 3 nJc 1 JC 2 X 2 XI ( dXI dX 2 ) xdet X2-X,,-,- dt 2 dt l dt I dt 2 is an integer calIed the linking number of C I and C 2 . More generally, let M n be an n-dimensional oriented tcombinatorial manifold (or C' mani- fold (r  1)) and K and K* its tcellular decom- positions such that K* is dual to K. Let z and zi (r + s = n - 1) be tboundaries belonging to the complex K and K*. Suppose that C+ I is any tchain of K whose boundary is z. Then the tintersection number [C'+1 J. [zD does not depend on the choice of such a chain C+ I . We set Lk(z, zi) = [C'+1 J . [z2J and call it the link- ing number of z and z2' The linking number Lk(z, Z2) of tsingular boundaries z, Z2 (r + s = n-l) of Mn is similarly defined by consider- ing the approximations z, z2 of z;, Z2 belong- ing to a suitable cellular decomposition K and its dual K*. The number Lk(z, z) is bilinear with respect to z;, Z2, and we have Lk(z,zD= (_I)'s+1 Lk(zz, z). In the example in 3-dimen- 
99D Degree of Mapping sional Euclidean space R 3 shown in the left half of Fig. 1, we have Lk(z:' z)= 1, while Lk(z:, zi) = 2 for the example shown in the right half of the same figure. In particular, if z; is homologous to 0 in Mn -I z; I, then we have Lk(z;, zD = 0 (Fig. 2). Generally, if z; and z'{ are homologous in Mn-lz21, then Lk(z;,zz) = Lk(z'{, z2)' "C€)" " c@ " Fig. 1 D. Order of a Point with Respect to a Cycle Let M n be an n-dimensional oriented com- binatorial manifold (or C' manifold (r 1)) with the nth Betti number b n = 0, zn-l an (n - I )-dimensional singular boundary of M n , and 0 a point of M n that is not contained in I z n-11. We set ord(z.-I, 0) = Lk(zn-I, 0) and call it the order of the point 0 with respect to zn-1 For example, when Mn = R 2 and ZI = {f(t) I O.s; t.s; 1,1(0)= f(I)}, where f is a continuous function, the order ord(zl, 0) is equal to the rotation number around 0 of a moving vector of(t) as t varies from 0 to 1. This ord(zl, 0) stays invariant as the point 0 moves in a tcon- nected component of the complement R 2 -lzI\ (Fig. 3). On the other hand, if z;' = {/;(t) I O.s; t.s; 1, /;(0) = /;(1)} (i == 0, 1) are closed curves in R 2 and the distance p(/o(t), fl(t)) is smaller than p(f(t),o) for all t in the interval [0, IJ, then we have ord(z6, 0) = ord(z!, 0) (Rouche's theorem). i Fig. 2 Fig. 3 References [1] P. S. Aleksandrov and H. Hopf, Topologie I, Springer, 1935 (Chelsea, 1965). [2J H. Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934 (Chelsea, 1968); English translation, A textbook of topology, Academic Press, 1980. [3] J. W. Milnor, Topology from the differen- tiable viewpoint, Univ. Press of Virginia, 1965. 366 100 (X.17) Denjoy Integrals A. History For a real-valued function f(x) of a real vari- able to be tLebesgue integrable, it is necessary and sufficient that there exist an tabsolutely continuous function F(x) such that F'(x) = f(x) at talmost all points x ( 221 Integration Theory D). In general, the derivative of a func- tion is not necessarily Lebesgue integrable. A function f(x) is Lebesgue integrable if and only if If(x)1 is integrable. Hence a function which has an improper Riemann integral is not neces- sarily Lebesgue integrable ( 221 Integration Theory A). For this reason, it is desirable to extend the concept of Lebesgue integrals. In 1912, A. Denjoy constructively defined a new concept of integrals (Denjoy integral in the restricted sense;  Section D), which is an extension of both Lebesgue and Riemann integrals. Later, N. N. Luzin provided the descriptive theory of this integral. Indepen- dently, and nearly simultaneously, A. J. Khin- chin and Denjoy defined a more general in- tegration (Denjoy integral in the wide sense (1916);  Section D). In 1914, O. Perron, independently of Den- joy, defined a concept of integrals (Perron integrals) that is equivalent to that of Denjoy integrals in the restricted sense. To establish this concept, Perron considered the differen- tial equation y' = f(x) and utilized a method similar to the one used in the proof of the existence theorem for the solution of the dif- ferential equation y' = f(x. y). However, the concept of Denjoy integrals is inadequate to treat unbounded functions. Thus to extend the concepts of Riemann and Lebesgue inte- grals, various ideas have been introduced; for example, Denjoy (1921), J, C. Burkill (1951), and R. D. James (1950) introduced new con- cepts as byproducts of investigations concern- ing the coefticients of trigonometric series [2,3]. The A-integral concept devised by A. N. Ko1mogorov was meant to deal with the pro- blem of the conjugate function of Fourier series [4]. As a certain completion of the space of functionals of step functions, K. Kunugui defined the notion of E. R. integrals (1956), which coincides with that of A-integrals in a special case [5,6]. What has been stated so far deals only with functions of a real variable. Concerning the extension of Denjoy integrals to the case of several variables, research has been done by M. Loomis, S. Kempisty, S. Nakanishi (nee Enomoto), and others [7,8]. 
367 B. Approximate Derivative Ifwe have limho,hom{En( -h,  + k) }/(h+ k)= 1 at a point  of a tmeasurable subset E of the real line, where m is the I-dimensional Lebesgue measure, the point  is called a point of density for E. Almost all the points of E are points of density for E (Lebesgue's density theorem). Let E be a measurable set having Xo as a point of density, and let F(x) be a measur- able function on E, If there exists a number I such that for each 8> 0, Xo is a point of density for the set {xll-8 .s;(F(x)- F(xo))/(x - xo).s; 1+ 8, x E E}, then I is called the approximate derivative of F(x) at Xo and is denoted by AD F(xo), If AD F(x o ) exists, F(x) is said to be approximately derivable at Xo. If F(x) is ap- proximately derivable at each point of E, then F(x) is said to be approximately derivable in E. If F'(x) exists at a point x, then AD F(x) exists at x, and we have AD F(x) = F'(x). How- ever, there exists a continuous function F(x) that is approximately derivable at almost all points of an interval and yet not differentiable at any point of a set of positive measure. C. Generalized Absolute Continnity Let E be a set in R, and let F(x) be a real- valued function whose domain contains E. If for each 8> 0, there is a 15 > 0 such that for every sequence {[an, bnJ} of nonoverlapping intervals whose endpoints belong to E the inequality L(bn-an)<b implies LIF(b n )- F(an)1 <8, then the function F(x) is said to be absolntely continnons on E. We denote by AC the set of all functions that are absolutely continuous on E. If F(x) is continuous on E and E is the union of a countable sequence of sets En on each of which FEAC, then F(x) is calIed a generalized absolntely continuons fnnction, and we write FEGAC. If FEGAC, AD F(x) exists almost everywhere. If, for each 8 > 0, there is a 15 > 0 such that for every sequence {[an' bnJ} of nonoverlapping intervals whose endpoints belong to E the in- equality L(b n -a n )<8 implies LnO{F; [an,bnJ} < 8 (O{ F; [an, bnJ} denotes the oscillation of the function F(x) in [an, bnJ, i.e., the difference between the least upper bound and the great- est lower bound of the values assumed by F(x) on [an, bnJ), then F(x) is said to be absolntely continnons in the restricted sense (or abso- lutely continuous ( * )) on E; and we write FE AC( *). Just as we defined the notions of gen- eralized absolute continuity and absolute con- tinuity, so we define the notions of generalized absolute continuity in the restricted sense and generalized absolnte continnity ( * ). Thus FE GAC( *) means that F(x) is a generalized ab- lOOE Denjoy Integrals solute continuous ( * ) function on E, If FE GAC( *), then F'(x) exists almost everywhere. D. Definitions of Denjoy Integrals Let f(x) be a real-valued function defined on I = [a, b]. If for f(x) there exists a function F(x) that belongs to GAC on I and for which ADF(x)= f(x) holds almost everywhere, then f(x) is said to be Denjoy integrable in the wide sense (or D-integrable) on 1. We call F(b)- F(a) the definite D-integral of f(x) over I, and denote the value by (D)S:f(x)dx. The function F(x) is called an indefinite D-integral of f(x) on I. Similarly, we obtain the definition of Denjoy integral in the restricted sense (or D( * )-integral) by replacing GAC by GAC( *) and AD F(x) by F'(x) in the definition of the D-integral. If a continuous function F(x) satis- fies the equality ADF(x)= f(x) # :too (F'(x) = f(x) # :too) for all except countably many points in I, then F(x) is an indefinite D- integral (D( * )-integral) of f(x). A Lebesgue- integrable function is D( * )-integrable, a D( * )- integrable function is D-integrable, and a D- integrable function that is almost everywhere nonnegative is Lebesgue integrable. E. Constructive Definition of Integrals Let S be a functional whose domain UIK(S; I) consists of the union of sets K(S;1) of real- valued functions defined on closed intervals 1= [a,b]. If r belongs to K(S;1), we denote the value S(/) by S(/; I). Such a functional S is called an integral operator if the following three conditions are satisfied: (1) If f E K (S; 1 0 ) and I is an arbitrary interval contained in 1 0 , then the trestriction h to I of f also belongs to K(S; I). Also, S(/; I) is a tcontinuous addi- tive function of the interval Ie 10' (2) Let 1 1 = [a,bJ, 1 2 = [b,cJ, and I=[a,cJ (a<b<c), If for a function r defined on I, fl E K (S; II) and f2 E K(S; 1 2 ), where fl = h 1 and f2 = h" then f E K (S;1). (3) If f is identically 0 on I, then fEK(S;1) and S(/;1)=O. For two integral operators SI and S2, we say that S2 includes SI (or Sl is weaker than S2) if K(SI;I)cK(S2;I) for every I and SI (/; I) = S2(/; I) for every f E K(SI; I). The D-integral (D( * )-integral) is the weakest integral operator containing the Lebesgue integral and satisfying the following two conditions, (C) and (H) (resp. H( *)): (C) Canchy's condition. If, for every function f defined on 1 0 , we have fIEK(S;1) for any 1= [a +15, b -8J  10 = [a, b J, and also if the finite limit limoo,HO S(/; 1) exists, then f E K(S; 1 0 ) and S(/; 1 0 ) coincides with the foregoing limit value. (H) Harnack's condition. Let E be a closed subset of 1 0 , {Id be a sequence of inter- 
roOF Denjoy Integrals vals contiguous to the set consisting of the points of E and the endpoints of 1 0 , and f be a function on 10 satisfying the following three conditions: (i) fEEK(S;Io), where fE(X) = f(x) whenever x E E and fE = 0 otherwise; (ii) ft. = fI, E K(S;Id for each k; and (iii) LkISUk; Idl < +00 and limkoo O(S;fk; I k ) = 0 when the se- quence {Ik} is infinite. Then it follows that fEK(S;Io) and S(/; 1 0 ) = S(/E; IoJ+LkS(/k; Id. (Here O(S; ft.; I k ) denotes the variation of S (/d on I k> that is, the least upper bound of the numbers IS(/J;J)I, where J denotes any sub- interval of Id We obtain condition (H( *)) by replacing condition (iii) in (H) with a more restrictive condition: Lk o (S;fk;Id < +00. The constructive definition of the Denjoy integral in the wide sense (the Denjoy integral in the restricted sense) is obtained by ttransfinite induction starting with the Lebesgue integral and using two methods, (C) and (H) (resp. (H( * )), of extensions. F. Perron Integrals Given a function f(x) defined on an interval [a,bJ, suppose that F(x) is a function defined on the same interval such that (1) E(x) f(x); (2) f(x) # -OC; (resp. (1') F(x) .s;f(x); (2') F(x) # +00) at every point x, where E(x) (resp, F(x)) denotes the t)ower (upper) derivative of F(x). In this case, F(x) is callcd a major (minor) function of f(x). If for any I: > 0 there is a major function t/J(x) and a minor function cp(x) of f(x) such that t/J(b) - cp(b) < 1:, then f(x) is said to be Perron integrable, We denote by (P)Sf(x)dx the value inftp{t/J(b)-t/J(a)}= sup", {cp(b) - cp(a)}, G. Properties ofIntegrals If {f,.} is a nondecreasing sequence of func- tions that are D-integrable on an interval [a, b J and whose D-integrals over [a, b J consti- tute a sequence bounded from above, then the function f(x) = Iimnoofn(x) is D-integrable on [a, b J, and we have (D) r f(x)dx= ! (D) r fn(x)dx. If F(x) is a function of tbounded variation and g(x) is a D-integrable function on an interval [a,bJ, then F(x)g(x) is D-integrable on [a, bJ; moreover, denoting by G(x) the inde- finite D-integral of g(x), the following formula is valid: (D) r F(x)g(x)dx =G(b)F(b)-G(a)F(a)- f G(x)dF(x), 368 where the last term is the tStieItjes integral (integration by parts). If F(x) is a nondecreasing function and g(x) is D-integrable on [a,bJ, there is a point  in [a, bJ for which the following formula is valid: (D) r g(x) F(x) dx =F(a)'(D) f9(X)dX+F(b)'(D) f g(x)dx (the second mean value theorem). The foregoing theorems remain valid if D is replaced by D( * ) in the hypotheses and conclusions. References [1] S. Saks, Theory of the integral, Warsaw, 1937. [2J R. D, James, Integrals and summable trigonometric series, Bull. Amer. Math. Soc., 61 (1955), 1-15. [3] R. Henstock, Theory of integration, But- terworth, 1963. [4] Yu. S. Ochan, Generalized integrals (in Russian), Mat. Sb" 28 (70), no. 2 (1951),293- 336. [5J K. Kunugui, Sur une generalisation de l'integrale, Fundamental and Applied Aspects of Math., 1 (1959), 1-30, [6] H. Okano, Sur une generalisation de l'integral (E. R.) et un theoreme general de l'integration par parties, J. Math. Soc. Japan, 14 (1962), 430-442. [7] S. Kempisty, Sur les fonctions absolument continue d'intervalle, Fund. Math., 27 (1936), 10-37. [8J S, Enomoto, Sur une totaIisation dans les espaces de plusieurs dimensions I, II, Osaka Math. J., 7 (1955), 69-102, 157-187. 101 (XXI.19) Descartes, Rene Rene Descartes, (March 31, 1596-February 11, 1650), philosopher, mathematician, and natural scientist, was born in the province of Touraine in France. He became dissatisfied with his studies of scholastic philosophy in the Jesuit Academy in La Fleche, and later, in 1619, while stationed in VIm during a tour of duty in the army, he underwent a philosoph- ical conversion. He had an idea of methodo- logicalIy unifying the various fields of interest to him using mathematics as a model. He returned to Paris in 1621, but moved to HoI- land in 1628 to concentrate on his work. Swe- 
369 den's Queen Christina invited him in 1649 to that country, where he died the next year, evidently from a combination of cold and overwork. Descartes, often considered the founder of modern philosophy, discarded early the tra- ditional theological view of the world and stated that all knowledge should be recognized as logical only after it has been submitted to rational criticism. This ushered in the modern view of the world based on mathematics and physics. In 1637, he published Geometrie as an appendix to his Discours de la methode, which also contained his works on optics and meteo- rology, In it, he promoted F. tViete's symbolic algebra, which he applied to geometric prob- lems. His idea that algebra could be used as a general method for geometry established him as the founder oft analytic geometry. References [IJ R. Descartes, Oeuvres I-XII, C. Adam, P. Tannery, and Leopold Cerf (eds.), 1897-1910. [2J H, Lefebvre, Descartes, Paris, 1947. 102 (XVIII.12) Design of Experiments A. General Remarks The design of experiments is a part of the statistical planning required to collect the data appropriate to the purpose of statistical in- ference ( 401 Statistical Inference) in various fields of scientific research and application. The main purposes of the design of experi- ments are (1) to analyze a given statistical linear model ( 403 Statistical Models) and (2) to devise a "good" statistical linear model. Sometimes this term also refers to a statistical method including the tanalysis of variances. Thus the purpose of designing an experiment is to provide the most efficient and economical methods of reaching valid and relevant conclu- sions from that experiment ( 403 Statistical Models). R. A. Fisher, whose contributions to statis- tical theory were remarkable and far-ranging, propounded three required principles to con- trol the experimental field in order to guaran- tee the validity of statistical methods and to increase the sensitivity of experiments: (i) re- plication, or the repetition of the set of all the phenomena to be compared in the experiment, for the evaluation of experimental error var- iance; (ii) randomization, or the procedure 102 A Design of Experiments allotting various experimental treatments at random, in order to change systematic errors into random errors; and (iii) local control, or the procedure which makes the variation within each experimental block as small as possible, in order to minimize or remove sys- tematic errors. These are called Fisher's three principles. A design satisfying principles (i) and (ii) is called a completely randomized design; it enables us to attach a probability statement to estimated treatment differences by obtaining valid estimate of experimental error variance. Let an n-dimensional trandom variable Y = (Y I , ..., 1;,)' be represented by a linear model Y=X.;+W, (1) where X is a given n x s real matrix, .; = (l' ,.,'s)' is an s-vector, and W=(W 1 , ..., J.v,,)' is a random vector with the texpectation E(W)= O. Then Y is called the observation vector, W the error vector, .; the effect vector of Y, and X the design matrix. According to the properties of the effect vector';, the linear model (1) is separated into three classes: (i) The class of fixed-effects models for which'; is a fixed unknown para- meter. In this case, the component i of'; is called a fixed effect, and a linear function n = F'.; of';, with a given coefficient vector F is called a linear parameter or parametric fnnc- tion, (ii) The class of random-effects models for which the components E i of'; are random variables. In this case, each component E i is called a random effect, and'; is denoted by E. (iii) The class of mixed models for which there are both fixed effects i and random effects  in .;. In this case, the model (1) becomes Y=X I .;1+X 2 E 2 +W, (2) where'; 1 = (I' .", ,)' is a fixed-effect vector and E 2 =(E'+I' ..., E,)' is a random-effect vec- tor. The conditions frequently assumed for the tdistribution law of Yare: (a) The errors J.t; (i = 1, ..., n) are uncorrelated and E(WtJ = ... = E(J.v,,) = O. (b) The errors J.t; (i = 1, ..., n) have a common unknown tvariance 0"2. (c) The errors J.t; (i= 1,..., n) have the tnormal distri- bution, (d) The random effects  are uncor- related and independent of the error vector W. (e) The random effects  have a common unknown variance O"t. (f) The random effects  have the normal distribution. Let L(X) be a tlinear subspace of Rn span- ned by the column vectors of X. The linear model Y=X.;+W (3) is called a hypothesis on the linear model (l) if L(X)cL(X). The main issues of the theory of design of experiments are concerned with (I) statis- 
102 B Design of Experiments tical inferences, such as estimation or testing hypotheses, under models (1), (2), (3), (i), (ii), (iii); (II) determination of the design matrix X satisfying certain optimal conditions; (III) construction of a theoretical foundation that can explain the validity of the statistical treat- ment of the observed data by means of the above models. B. Block Design The design of experiments is described here in terms of the so-called block design. There are n expenmental units ex = I, .., , n called plots, and an observation Y, is assigned to each plot ex. A block is constructed with several plots under Fisher's principle (iii), and the number of plots in a block is called the block size, the jth one being denoted by k j , j = I, ". , b, with Lj k j = n, One of v operations, called treatments or varieties, is applied to each plot. It is assumed that the observation Y, at the plot ex in the jth block under the ith treatment has the structure Y.=i+l}j+ I<Y.. The i' i= I,..., v, are called treatment effects, and the I}j,j, ... , h, block effects. It is also as- sumed that Li (i = O. In this case, Y is repre- sented in matrix notation as Y =<1>'; + 'P1f + W, (4) where <I> = (CP'i), ex = 1,..., n, i= I,..., v, with _ { I when the ith treatment is applied CP,i - to the plot ex, o otherwise, and 'I' = (1ft,}), ex = I, ..., n, j = 1, ... , b, with { 1 when the plot ex belongs to Ift,j = the j th block, o otherwise, Here it is ass umed that Li <P,i = 1, L, <P,i = ri 1, Liri=n, Ljlft,j= I, and L,Ift,j=kj 1. We call r i the number of replications of the ith treatment. We set N = (niJ = <1>''1'. Then nij is the number of observations in the jth block to which the ith treatment is applied. The matrix N is called the incidence matrix of the block design. In any experiment, each plot has its own effect. The blocks are constructed so that this plot effect in each block becomes as homoge- neous as possible, although it is impossible to eliminate the effect completely, For this purpose, randomization is adopted. Suppose that we are given k plots in a block and v treatments (k.s;v). Then randomization is 370 utilized to select a treatment out of v treat- ments to be allocated to each plot so that the selection is "at random." Then the plot effects are random, and the error term in model (4) can be considered to be the sum of a plot effect and an original error. A block design satisfying Fisher's three principles is called a randomized block design. Blocks that can accommodate all the treat- ments to be studied are called complete blocks. Those that do not contain all the treatments are called incomplete blocks. Blocking can be considered to be an extension of pairing. In the terminology of block design, we can say that many experiments are block experiments, or even (tautologically, if we allow those with only one block) that all experiments are block experiments. In a block design N ==(n i ), treatments io and i 1 are said to be connected if there exists a chain ioj I i l j2 ... ii-I j1i 1 of integers such that l.s;i p .s;v (p=O, 1,2, ...,1), 1 .s;jq.s;b (q = 1,2,..., I), and n ioj1 >0, ni]j] >0, n i j > 0, " . , n i j > 0, n i j > O. If all pairs of trtments ar-utuall; ]connected, then the design is said to be connected. In this case, the rank of the matrix C defined in Section C is v-I. If the design is disconnected, then the incidence matrix N can be partitioned into two or more connected portions, e.g., N=(I J Thus without loss of generality we can restrict ourselves to the connected case. C. Estimation nnder the Fixed-Effects Model Conditions (a) and (b) of Section A are as- sumed here. The tnormal equation which gives the tleast square estimates e and ij of'; and 'I, respectively, is (:}<I>, 'P)()=(:',)Y. (5) Set <1>'<1> = diag(r I' ... , rv) = D" '1"'1' = diag(k j ,..., kb)=Dk> C=D,- ND k - 1 N', Q= (<I>' - ND k - 1 'P')Y, where diag(...) means a diagonal matrix with the diagonal elements .... Then (5) reduces to Ce = Q, ij==Dk-I('I"Y - N'e). (6) Let L be an orthogonal matrix that transforms the matrix C to a diagonal form; that is, L'CL =diag(PI,'" ,Pv-I,O)=A, Pi>O for all i, Set A * = diag(Pil, "', p';-_\, 0), C* = LA * L'. Then e 
371 = C*Q is a particular solution of (6) and Li i =0, A parametric function n=F'.; with coeffi- cient vector F=(FI,F2' ...,Fv)' is caIled a treat- ment contrast if the sum LiFi of coefficients vanishes. A treatment contrast n = F'.; is called a normalized contrast if F'F = 1. It is caIled the elementary contrast if F has only two nonzero elements 1 and -1. Elementary contrasts of treatment effects show the comparison of treatments involved in them. When a design is connected, any contrast n is testimable, and the tbest linear unbiased estimate of n is it = F' f Furthermore, if f i is the eigenvector of the matrix C with unit length corresponding to an eigenvalue Pi and F=L;aif" then the var- iance of the estimate it is given by (J2 Lia f/Pi' The following properties are equivalent: (i) A design is connected. (ii) Any treatment con- trast is estimable. (iii) The rank of the matrix C is v-I. (iv) The minimum eigenvalue 0 of D,-1/2CD,-1/2 is simple and other eigenvalues 8 1 satisfy 0 < 8 1 .s; 1. (v) The maximum eigenvalue 1 of D,-1/2 ND;1 N'D,-1/2 is simple and other eigenvalues 8 2 satisfy O.s; O 2 < 1. (vi) There exists a positive integer p such that each ele- ment of (D,-1/2 N D k - I N'D,-112y is positive. It holds that O 2 = I - 8 1 , D. Test of a Hypothesis H: I = .,. = v in the Fixed-Effects Model Conditions (a), (b), and (c) of Section A are assumed here. The hypothesis H :  1 = . .. = (v is represented by y=\}',,+w. Consider a direct sum decomposition Rn = L(rJ+ Lt('¥) + L(<I>, '1') + L,'P of R n , where L(A) and L stand for the torthocomple- ments of L(B) with respect to L(A) and Rn, respectively, and r =(1,1, ...,1)' ERn. The tprojection operator matrices for the decom- posed subspaces L(r), Lt(\}'), L(<I>, \}'), and L,'P are denoted by PI' P 2 , P 3 , and P 4 , respec- tively. Then we have PI ==n-IE nn , P 2 = \}'D k - l \}" -n-IE nn , P 3 = (In - \}' D;l \}")<I>C*<I>'(In - \}' D k - l \}"), P4=In-Pl-P2-P3, where Eab is an a x b matrix whose entries are all unity and In is the n x n identity matrix. The analysis of variance for the hypothesis (7) in model (4) is given by Y'Y ==Y'P I Y + Y'P 2 Y + Y'P 3 Y + Y'P 4 Y. This is called an intrablock analysis. A usual test for the hypothesis H is given by a critical 102 E Design of Experiments region with n-v-b+ 1 Y'P3 Y F > constant v-I Y'P 4 Y ( 403 Statistical Models). E. Optimal Block Design A block design is said to be optimal when it minimizes the variance of the estimate it of a normalized contrast n. Suppose that the num- ber v of treatments, the number b of blocks, and each block size k j , j = 1, .,. , b, are given. Under each of the foJ1owing criteria the corre- sponding block design is optimal in the sense indicated: For positive eigenvalues, PI' Pz, ..., Pv-l' of the matrix C in Section C, (I) IIil Pi is maximal (D-optimality); (II) minPi is maximal (E-optimality); (III) Lil Pi- l is mini- mal (or the average variance of the estimates of all normalized contrasts of the parameters (i is minimal) (A-optimality). If PI =... =PV-l =(n-b)/(v- 1) (= P, say) and nij is either 1 or 0, then the design is opti- mal for each of the optimality criteria (I), (II), and (III). In this case, we have C=P(Iv-v-1E vv )' (7) Such a design is called a variance-balanced block design in which every normalized con- trast is estimable with the same variance. If D,-I ND k - 1 N' -n- l Evlr' =J1(Iv-n-1 Evlr') (or C =(1- J1)(D,- n- 1 rr'», every normalized con- trast is estimable with the same efficiency I - J1, where r = D,Evl and 0 < J1 < 1. Such a design is called an efficiency-balanced block design. For a block design, any two of the following properties imply the third: (i) The design is variance-balanced; (ii) the design is efficiency- balanced; (iii) the design is equireplicated. When all block sizes k j equal some number k independent ofj, all numbers r i of replica- tions equal some number r independent of i, and A ii , = Ljnijni'j (number of times that both treatments i and i' are applied to the same block) equals Some number A independent of i and i', then the design is combinatorially balanced. The design is usually called the balanced incomplete block design (BIBD) if these three conditions are fulfilled and k < v. A BIBD is both variance-balanced and efficiency-balanced. If all the treatments are replicated the same number of times and the blocks are of the same size in a block design, then the only variance-balanced design is a BIRD, provided such a design exists. A BIRD is often denoted by BIBD(v, b, r, k, ..1.) and we have the relations vr = bk, ),( v-I) = r( k - 1) and v.s;b, among the parameters; the last 
102 F Design of Experiments relation is called Fisher's inequality. The design is said to be symmetric when v = b. Further- more, if v = b is even, then r - ..1. must be a perfect square. If v = b is odd, then the equa- tion x 2 =(r_).)y2 +( _1)(v-l)/2 },Z2 must have a solution in integers x, y, z not all zero. This is called the Bruck-Ryser-Chowla theorem. Neces- sary conditions for the existence of a BIRD have been obtained. One of these conditions is stated in terms of the Hasse-Minkowski p- invariant Cp(A)=( -I, -1)pII7dDi' -Di-d p , where (m, m')p is the tHilbert norm-residue symbol, Do = 1, and D i is the principal minor of the n x n matrix A. Another is described in terms of the embedding of a quasi residual design into the corresponding symmetric BIBD. No effective necessary and sufficient condition for the existence of a BIBD has been obtained. In general, it is conjectured that for a positive integer k, with finitely many excep- tions, BIBD(v, b, r, k, A)'s exist for all pairs v, Ie of positive integers satisfying v> k, Ie(v - I) = 0 (mod k-I) and Av(v-l)=O (mod k(k-l)). The known methods of constructing block designs are of two main types, direct and re- cursive. Recursive methods are a way of con- structing designs from smaller ones. Direct methods yield easier constructions, but are applicable only for special values of the para- meters. A direct method of constructing a BIRD designates the treatments and blocks, respectively, as the points and subs paces of the tprojective space and the taffine space over a tfinite field. To explain another method of constructing a BIBD, we let G be an tadditive group of order n and x), ..., x) be m treat- ments corresponding to each element x(i) of the group (i = 1,2, "., n), The treatment x) is said to belong to the cdh class (IX = 1,2, ..., m), and a pair (x), xV») of treatments is called a dif- ference of type (IX, [3, x(P») if x(i) - x(j) == x(p) ( # 0). We can form t blocks of size k B - { (it> (i.! } 1- X 111 ' ...'X ak ' B = {X Ut> X U.! } t PI ,..., Pk ' such that each block B i contains exactly r treatments belonging to the IXth class (IX = 1, .., , m) and among all pairs of treatments in the same block there are }, differences of each type (IX, [3, x(P»). Such a set of t blocks is called a difference set. The t blocks in a dif- ference set are called initial blocks. Given such a difference set, we can obtain nt blocks by joining elements of G to the elements of each B, (s = 1,2, ... ,t). These nt blocks form a BIBD(v = mn, b = nt, r, k = rm/t, Ie) ( 66 Combinatorics). As a generalization of Fisher's inequality, for an unequal-replicated block design with unequal block sizes, b:;:, v - (j holds, where (j is 372 the multiplicity of the maximum eigenvalue I of the matrix D,-1/2CD,-If2. The equality sign holds if and only if the projection operator cor- responding to zero eigenvalue of D,-1/2CD,-1/2 is a zero matrix. F. Estimation in a Mixed Model Consider a block design (4), where every block has the same size k and every treatment has the same number r of replications. Let'; be a fixed effect and 1J a random effect denoted by H, and assume that W satisfies conditions (a) and (b) of Section A and that H satisfies con- ditions (d), (e), and (f) (where the  are re- placed by the coordinates Hj of H). If E(H) = y, j = 1, ..., b, then, changing the notation H- EVI Y to H, we can rewrite (4) as Y = ry +<1>'; + \}IH + W, (8) with E(H) = 0, The normal equation that gives the least square estimate of  is (C + 0'2(0'2 + kO')-1 Cd = Q +0'2(0'2 + kO' r l Ql' where C and Q are the same as in (6) and C l = ND k - l N' -rv- l Evv, Ql = (ND k - l \}I'- V-I EVI r') Y. Equation (9) cannot be solved unless the ratio 0'2: O' is given. When 0'2 : O' is not known, substituting in 0'2 + kO' its tun- biased estimate given by analysis of variance, one obtains a solution of (9) which tends to a tconsistent estimate of'; as the number of blocks tends to infinity. (9) G. Estimation in a Random-Effects Model Let'; (denoted by E) and H in the model (8) be random effects. Suppose that E, H, and Ware mutually independent, and that the distribu- tions ofE, H, and Ware N(O,O'il v ), N(O,O'Ib)' and N(O, 0'2 In), respectively. The distribution of Y in (8) contains four parameters y, 0'2, O', 0'5:. When k j < v, the tminimal sufficient statistic is generally incomplete, and therefore the op- timal estimate of y, 0'2, d, and O'i cannot be determined, As an example, the minimal suffi- cient statistic for the random-effects model of a BIBD(v, b, r, k, ..1.) is (Li 1';, Y' P 21 Y, Y' P 22 Y, Y' P 3 Y, Y' P 4 Y, Y' Ps V), where P 21 = k-l(r-lerl BTB- kr(r- Ar l n- l Enn' P 22 =k«k-l)r+Ar l r- l x (T-k- I BT)(T-k- l TB), P 3 == k- l B- k-l(r- Ie)-l BTB + v).(r- ).)-1 n -I Enn, P4=In-k-lB-P22' Ps=r-lT-n-IEnn, 
373 with T= <1><1>' and B = '1''1''. In this case, E(Y'P 4 Y)=(n-v-b+ 1)0'2, E(Y'P 3 Y)=(b- v) (0'2 + kO'D, From these equations unbiased estimates of 0'2 and O'f can be derived, but their optimality is not guaranteed ( 396 Statistic), H. Factorial Experiments Suppose that there are h factors Fl' '" , Fh which affect Y, and each factor Fl has Si levels (i= 1, .."h), It is assumed that V=SI x S2 X ,.. X Sh treatment combinations are derived by all the combinations of the levels of h factors, and that v treatment effects are represented by the sum of subeffects called main effects and inter- actions, or factorial effects covering both. Such an experiment is called a factorial experi- ment or factorial design; it allows us to ex- amine the effects of two or more factors, each factor being applied at two or more levels, by testing all possible treatment combinations formed from the factors under study, Specifi- cally, suppose that we have the case h = 2, called a two-way layout. Let main effects be denoted by'; 1 = (( L '''' u'.; 2 == (f, .." s22)', and interaction by .; 12 =( if,  ii, "" ''.2)'' where Lia =0, Ljl=O, Lii}2=Lji}2=0, When there is no restriction on the number of observations, each of v = Sl X S2 treatment com- binations is replicated t times, Components Yijk of the observation vector Yare represented by a linear model 1';jk=y+l +J+(i2+ W,jk, i=I,,,,,sl; j=I,,,,,s2; k= 1, ..,,t, or, in vector notation, y=ry+xI.;1 +X 2 e +X 12 .;12+W, The analysis of variance in this case is given by Y'Y = '\' Y' PY L., " where Pi is the projection operator matrix on the subspace derived by a direct sum decomposition Rn = L(r) + Lf(X l ) + Lf(X 2 ) + Lt 1 .x,(Xd +Lt 12 , where n=sls2t, Denoting by 1;j the arith- metic mean of 1';jk over the subscript k and using similar notation 1; ., Y j ' and y, ,we haveY'P l Y=ny 2 , Y'P 2 Y=S2 tL (1; - Y )2, Y'P 3 Y =Sl tL(Y j - y. )2, Y'P 4 Y = tL(1;j. + Y -Y; -Y j )2,Y'P s Y=L(1';jk-1';Y,The tanalysis of variance table is given in Table 1. For h  3 similar models can also be con- sidered for dealing with interactions up to h- factor. The factorial experiment is said to be symmetric if S I = S2 = ." = Sh = S and is called the Sh factorial experiment. Otherwise it is 102 I Design of Experiments Table 1 Analysis of Variance Sum of Source Squares Degrees of Freedom Fl main effects F 2 main effects FI F 2 interactions Error Y'P 2 Y Y'P3 Y Y' P 4 Y Y'PsY Sl -1 S2 -1 (Sl -1)(s2 -1) sls2(t-l) said to be asymmetric and is called the S I x S2 X ", X Sh factorial experiment. I. Fractional Factorial Designs Factorial designs require at least an experi- ment for all the combinations of levels of factors under consideration, When there are a large number of treatment combinations re- sulting from a large number of factors to be tested, it will often be beyond the resources of the investigator to experiment with all of them, For such cases Finney (Ann, Eugen., 12 (1945)) proposed a method in which only a fraction of the treatment combinations are experimented with, A design of this type is called a fractional factorial design, One reason for the usefulness of fractional factorial designs in preference to factorial designs is that they involve a smaller number of treatment combinations, since, in most scientific experiments, it is usually found that a large number of the higher-order inter- actions are negligible, The crucial part of the specification of fractional factorial designs is the suitable choice of the defining or identity relationship, Equating the nonestimable fac- torial effects for the selected fraction of treat- ment combinations with I gives the identity relation, Each factorial effect is not estimable after selecting a fraction of treatment com- binations, and any contrast of the selected treatment combinations represents more than one factorial effect. All factorial effects repre- sented by the same treatment combinations are called aliases, In aliases, by assuming that other interactions are negligible when com- pared to the one of interest, estimation can be made by means of the corresponding con- trast of the selected treatment combinations, When all factorial effects of order higher than 1 are assumed to be zero, a fractional factorial design is said to be of resolution 21 + 1 if it satisfies the condition that under the usual model all factorial effects up to order 1 are estimable, whereas a fractional factorial design is said to be of resolution 21 if it satisfies the condition that under the usual model all fac- torial effects up to order 1- 1 are estimable, In the beginning, the theory was developed 
102 J Design of Experiments for an orthogonal fractional factorial design in which the estimates of various effects of inter- est are all uncorrelated. However, these are available only for special values of the number n of treatment combinations and are in general uneconomical in the sense that they require a large value of n in comparison with the num- ber of unknown effects, As a generalization of orthogonal fractional factorial designs, I. M, Chakravarti (Sankhya, 17 (1956)) introduced the concept of balanced fractional factorial designs, In these designs, the estimates of effects are not always uncorre1ated. However, the tcovariance matrix of the estimates is invariant under any permutation of factors, Balanced fractional factorial designs are flex- ible in the number of treatment combinations, with the result that more experimental situ- ations can be handled. These two kinds of fractional factorial designs can be constructed by using orthogonal arrays and balanced arrays, defined in Section L. AIl alias relations can be derived from the identity relation in orthogonal designs, In the model (I) for a design, the variance of the estimates of estimable linear functions of .; depends on the matrix XX which is called the information matrix. For eigenvalues of X'X, the D-, E-, and A -optima1ities of fractional factorial designs can be defined similarly to (I), (II), and (III) given in Section E, An ortho- gonal fractional 2 h factorial design is D-, E- and A-optimal. A fractional factorial design is said to be saturated if the number of treatment combinations is equal to that of parameters in the model to be estimated, However, since saturated designs do not provide an estimate of experimental error, their use should, in general, be confined to those situations in which a prior estimate of experimental error is available. The existence of a symmetric BIBD(4t -1,4t-l, 2t -1, 2t -1, t -1) implies that of an orthogonal saturated fractional 2 4 '-1 factorial design of resolution III, J. Application of Algebras In the theory of design of experiments, the ideas of association algebra and relationship algebra play an important role, Let Ai be a v x v symmetric matrix with entries 0 or 1 (i = 0,1, "" m), If a set {Ai I i = 0,1, .", m} satisfies the conditions m Ao=Iv, I Ai=Evv, j:::::O and there is a nonnegative integer Plk for every i, j, k such that m AjAk= L pjkAi, i=O 374 then the Ai are called association matrices, In this case, there exists a natural number n, for every i such that AiEvv = niEvv holds, If an (IX, fJ) component ati of Ai is unity, then the treat- ments IX and fJ are said to be the ith associates. The talgebra sf over the real number field generated by the matrices Ao, A I , " . , Am is called the association algebra. sf is commuta- tive and tcompletely reducible. Ak -->Y'k = (pjk) is the tregular representation of sf, There is a nonsingular matrix U = (uij) that transforms an  into diagonal matrices simultaneously: UU-I =diag(zoi' ""zm;)' i=O,I,.."m, ( m ) -1 m At = L UifZij .L uijA j , fO JO i=O,I,...,m, are mutually orthogonal idempotent elements of sf, For example, consider the case where Ao = I S1 @ I", Al = (E S ,SI - Is) @ IS2' A 2 == I S1 @ (E S2S2 - Is), A3 =(E S1S1 - Is) @ (E S ,S2 - Is,) with m = 3, where @ means the tKronecker product. The algebra sf generated by Ao, AI' A 2 , A3 is called an F 2 type association algebra, The association matrices Ao, AI, A2' and A3 correspond to the relationships between the treatments in a two-way layout. The mutu- ally orthogonal idempotent elements in this case are A =sjl E S1S1 @ SZI E S2S2 ' Af =(1S1- sjl ES1S) @ SZI Es,s" At =sjl E'I" @ (Is,- szIEs,s,), and At =(1,,-sl IE s 1 s)@(I s ,- SZI Es,s,J. For the factorial experiment with h factors (h  3), association matrices can be constructed in a similar way, If h = 3, the num- ber m of association matrices is 7, Many types of association schemes are known, and some of them for m = 2 are classified as group divis- ible, triangular, Latin square, cyclic type, and so on, For a group divisible type of v = Sl S2, Ao=I" Al =Is @ Es s -A o , A 2 =Evv-Ao- AI; At =v- I Ev, Af'='v- I {(Sl -1) (Ao+AI)- A 2 }, At =SZI {(s2 -1)Ao-Ad, An experimental design consists of a set of n experimental units called plots, Define a relationship R between the plots as a set of ordered pairs (i,j) of plots, A relationship R among a set of n plots can be expressed as a symmetric n x n matrix (r ij ) of O's and I's: { I if the ith plot is related to the jth rij = plot by the relationship R, o otherwise, and this matrix is also denoted by R. If there are k types of relationships R I' " , , Rk among n plots, the a1gebra.'% over the real number field generated by the matrices R I' " , , Rk is calIed the relationship algebra of the design,  is a tsemisimple algebra, Example (1), The relationship algebra f3£ of the factorial experiment with h = 2 and t repli- cations is generated by the following matrices 
375 over the real number field: B] = Ao @ I" B 2 = Ao@(E,,-I,), B 3 =A l @ E", B 4 =A 2 @E", Bs = A3 @ E", where Ai, i = 0, 1, 2, 3, are the association matrices of F 2 type. The mutually orthogonal idempotents that correspond to the ttwo-sided ideal decomposition of  are Bf =At @ t- l E", Bf =Af @ t- l E", B't = Af @ t- 1 E", Bf =A't @ t-] E", and Bt = Ao @ (I, - t -1 E,,), These are the same as the projection Pi in the two-way layout design given in Section H. Example (2), Consider a block design with v treatments, each having the same number r of replications, and with b blocks, each having the same size k ( < v), Suppose that association matrices Ai, i = 0, 1, ,., , m, are given, by which the associations among the treatments are defined. Let Ap be the number of blocks to which the ith associate treatments (J, and fJ are applied. The design is called the partially bal- anced incomplete block design (PBIBD) if Ap = ).i  0 independently of IX and fJ, When m = 1, the design is a BIBD, Let the observation vector be represented by (4), The relationship algebra , of a PBIBD is generated by n x n matrices In' Enn, B='1"I", T;=<1>A i <1>', i= I, 2, ...,m, where NN' =LoAiAi=LoPiAt, Pi = Lj'o AjZij (O.s; Pi.s; rk; i = 0,1, .." m), and Po ==rk=Lon)i' Write T;# == <1>A i # <1>'. Accord- ing as pi=rk, O<pi<rk, or Pi=O, L(T;#) is said to be confounded with the blocks, partially confounded with the blocks, or orthogonal to the blocks,  is noncommutative, completely reducible, and isomorphic to the algebra of matrices of the type shown in Fig, 1. c'" ::.J Fig. 1 The analysis of variance of a PBIBD is given by a decomposition of In into mutually orthogonal idempotent elements of .. For a PBIBD, the matrix C in (6) is of the form C = LiriAi#, where r i = r- k- I Pi' For a connected PBIBD, all At.;, i = 1, 2, ", , m, are estimable, If there exists a group divisible PBIBD(m=2) with }'2 =}, 1 + I, then it is E- and A -optimal. K. Design for Two-Way Elimination of Heterogeneity Consider a design with v treatments in a u x w rectangular block, The row effect and the column effect of this block are denoted by " and v, respectively, Thus the observation vec- tor Y is of the form Y = ry +<1> + \}',,+ ITv+ W, 102 L Design of Experiments where the definitions of r, <1>, \}' and IT are similar to those for block designs, Set L = <1>'IT, M = <1>'\}', D, =<1>'<1>, F=D,-w- l LL' _u- l M M' +(UWfl LEuuL', The matrix F plays a role similar to that of the matrix C in (6). When the rank of F equals v-I, the design is said to be connected. If F = r(Iv - v -I E vv ), (11) then the design satisfies the optimum criteria (I), (II), and (III) given in Section E. When u = w = v and (11) holds, the design is called the Latin square, When u = v and (11) holds, the design is called the Y ouden square, When u> v and (I I) holds, the design is called the Shrik- hande square, In a Y ouden square design treat- ments are grouped into replications in two different ways, i,e" rows and columns, where rows constitute a BIBD, whereas columns are complete blocks, The existence of a Y ouden square design is equivalent to that of a sym- metric BIRD, If the associations among v treatments are defined in terms of association matrices, the partially balanced design for two-way elimi- nation of heterogeneity can be defined in a way similar to the PBIBD, In this case, the equa-' tion F=LiriAt holds, and if d={Iv,Evv-IvL then (11) holds, hence the optimum criteria are fulfilled. The definition of the relationship algebra . of a partially balanced design for two-way elimination of heterogeneity is similar to that used for the PBIBD.  is isomorphic to the algebra of matrices shown in Fig, 2. '. 0 . **. ... ... ... ... **. .. .. .. .. . o Fig. 2 The analysis of variance of this design is given by a decomposition of In into mutually orthogonal idempotent elements of, L. Balanced Array and Orthogonal Array (10) Suppose that T is an n x h matrix with entries from a set A of s (2) distinct elements, Con- sider the s' 1 x t matrices X = (x I, X2, ,,' , x,) that can be formed by giving different values to the x;(EA), i=l, 2, ""t, Suppose that asso- ciated with each 1 x t matrix X there is a non- negative integer A(X l , X2, "', x,) which is invar- iant under any permutation of a given set {X 1 ,X2' ""x,}, If, for every t-columned sub- matrix of T, the s' 1 x t matrices X occur as 
102M Design of Experiments rows )"(X I' X2, .", x,) times, then the matrix T is called the balanced array (BA) of size n, h constraints, S levels, and strength t having the index set {A(x" "', x,)}, Such an array is de- noted by BA(n, h, s, t); n is also called the nnm- ber of treatment combinations, In particular, when A(x 1> ", , x,) = A for every x 1> ... , x" such an array is called an orthogonal array (OA) of size n, h constraints, s levels, strength t, and index ..1., which is denoted by OA(n, h, s, t), We have n == AS', Balanced arrays have the advantage that they can be constructed with fewer treatment combinations than the orthogonal arrays for given h, s, t para- meters, The transpose of the incidence ma- trix of a BIBD(v, b, r, k, A) is, for A = {O, I}, a BA(b, v, 2, 2) with ..1.(0,0) = b - 2r + ..1., ..1.(1,0) = ..1.(0,1)= r - )", and ..1.(1,1) =..1.. There is a close relation between the existence of an OA and that of a BIBD. For example, the existence of an OA(S2, h, S, 2) is equivalent to that of h - 2 mutually torthogonal Latin squares of order s that is for h = s + 1 equivalent to the existence of a BIBD(s2, s(s + 1), s + 1, s, 1), The existence of an OA(4t,4t-I,2,2) is equivalent to that of a symmetric BIBD( 4t - 1, 4t - 1, 2t -1, 2t - 1, t - 1) which is also equivalent to the existence of an OA(8t, 4t, 2, 3), Balanced arrays and orthogonal arrays play a vital role in the construction of symmetric and asymmetric confounded factorial experi- ments and fractional factorial designs, In an OA(n, h, s, t), regarding columns and entries as h factors and levels of a factor of the column, respectively, each row corresponds to a treat- ment combination, In this case, this ortho- gonal array gives an orthogonal fractional S' factorial design of resolution t + 1 with n treat- ment combinations, A necessary and sufficient condition for a fractional Sh factorial design of resol ution 21 + 1 to be orthogonal (resp, balanced) is that the design be an orthogonal (resp, balanced) array of strength 21, provided that the information matrix of the design is nonsingular. M. Response Snrface If all the factors represent quantitative vari- ables, such as time, temperature, amount of ingredients, etc" it is natural to think of the yield or response Y of the experimental results as a continuous function of the levels of these factors, We can write a functional relationship Y, = f(xI., X 2 ."", x h ,) + It;" IX= 1,2,,,,, n, where Yo represents the IXth observation in the designed experiment and Xi' represents the level of the ith factor in the IXth observation, The function f or the surface defined in (h + 1)- 376 dimensional tEuclidean space by f is called the response surface, The residual It;, measures the experimental error of the IXth observation. A knowledge of the function f gives a complete summary of experimental results and also enables us to predict the response for values of the Xi' that were not tested in the experiment, or to determine such combinations of the values of variables Xi' that give the max- imum (or minimum) value of f When the mathematical form of f is not known, the function can sometimes be approximated satisfactorily within the experimental region, which is relatively small, by flexible graduating functions, such as polynomials of degrees 1 and 2 in the variables Xi,: h Yo = [30 + L [3i x i, + It;" i=l h h h Y, = [30 + L [3i X i, + L [3ii X ;, + L [3i j X i ,Xj, + W" i=l i=l i<j The coefficients [30, [31"" are parameters to be estimated from the data. Thus, (i) when the form of the true f is assumed known, the ob- ject is to estimate the parameters; (ii) when the form of the true f is unknown, the object is to approximate the f by some graduating func- tion, Designs appropriate for (i) and (ii) are called designs for estimating parameters and designs for exploring a response snrface, respec- tively, Some experimental designs that have been developed for fitting polynomials of the first and second degrees are called first-order designs and sccond-order designs, respectively, The problem here is to increase the precision of fitting response surfaces by appropriately choosing n points in a given experimental region S of variable x. That is, how do we find the experimental region that interests us in the sense of finding optimum conditions and, having found it, how do we design experiments to map f over the region? As optimum criteria of allocations, corresponding to (I) and (II) given in Section E, consider the following: (IV) the tgeneralized variance of the estimates of coefficients is minimal; (V) the supremum of the variance of estimates of the expectation E(Y) of response Y is minimal for all variables XES, For example, when an observation Y is given by a polynomial regression of a variable X as Yo = [30 + [31 X, + [32 X; + '" + [3h X ; + It;" if for the tLegendre polynomial Ph(x) we allocate an experimen t replicated at X = i: 1 and h - 1 roots of P(x)=O the same number of times, then it is an optimum design in the sense of (IV) and (V), As a design for fitting first -order response planes, we can use a 2 h factorial design (or its fraction) of resolution III, since it is sufficient to consider only linear effects for each of h 
377 factors, In comparing first-order designs, we take as an optimum criterion the minimum of the average variance of the estimates of the coefficients, corresponding to (III) of Section E. For a design for fitting second-order re- sponse surfaces, we need at least three levels of each factor for estimating all effects up to quadratic ones; then we can use a 3 h factorial design (or its fraction) of resolution V for esti- mating the linear x linear component of inter- actions of two factors, In this case, treatment combinations are too large in number, so we can adopt central composite designs, which are constructed by adding further treatment com- binations to those obtained from a 2 h factorial design (or its fraction). If the coded levels of each factor are -1 and + 1 in the 2 h factorial design, the (2h + 1) additional combinations are (0,0, ",,0), (:t d, 0, ", , 0), (0, :t d, 0, ".,0), "., (0, ",,0, :t d), The total number of treat- ment combinations to be tested is 2 h + 2h + 1 ( < 3 h for h  3), The value of d can be chosen to make the coefficients in the quadratic poly- nomials as orthogonal as possible to each other or to minimize the bias that is created if the true form of the response surface is not quadratic or to give the design with the prop- erty of being rotatable, If Yo is the estimated response at (XIO' X20, "', XhO)E S, the response surface design is said to be rotatable if and only if the variance of Yo is a function only of the distance p = (xi 0 + xo + ' , , + XO) 1/2 of (x IO , X 20 ,.", x hO ) from the origin, so that the variance contours in the experimental region of the variables are circles, spheres, or hyper- spheres centered at the origin. Now, when Y = [30 + [31 XI + [32 X 2 + [311 xi + [322X + 2[312 X I X 2 + Wwith Ixil.s; 1 for i= I, 2, an optimum experi- ment in the sense of (V) is given by allocating 0,0960 at the origin, 0,0802 at experimental points (1,0), (-1,0), (0, I), (0, -1), and 0.1458 at experimental points (1, 1), (1, -1), ( -1, 1), ( -1, -1), respectively, However, this design is not rotatable, Instead of an optimum criterion (V), we can consider a region S which is not always equal to the original experimental region S and a criterion: (V') The supremum of the variance of estimates of the expectation E(Y) of response Y is minimal for all variables S - W ' h ' 2 2 2 xE, It lllX I +X 2 .s;C ,anoptimumalloca- tion depends on the value of c, and it is shown that a rotatable design is obtained only if c is in a certain range, There are many topics in the theory of ex- perimental design besides the ones mentioned in this article ( [4J for multiple comparison, [9J for confounding designs that are factorial experiments in which the block size is reduced and in which for each block a fractional of all the treatment combinations are tested, [!OJ for split-plot designs in which certain main effects 103 A Determinants are confounded with blocks, [15J for weighing designs), References [IJ "Experimental design" (in Japanese), Gen- dai tokeigaku daiziten (Encyclopedia of mod- ern statistics), Toyokeizai simposya, 1962, [2J R. A, Fisher, The design of experiments, Oliver & Boyd, seventh edition, 1960, [3J p, W, M. John, Statistical design and analysis of experiments, Macmillan, 1971. [4J H, Scheffe, The analysis of variance, Wiley, 1959, [5J J, Kiefer, Optimum experimental designs, 1. Roy, Statist. Soc" ser. B, 21 (1959),273-319, [6J V, V. Fedorov, Theory of optimal experi- ments, Academic Press, 1972, [7J D. Silvey, Optimal design, Chapman, 1980. [8J D, Raghavarao, Constructions and com- binatorial problems in design of experiments, Wiley, 1971. [9J 0, Kempthorne, The design and analysis of experiments, Wiley, 1952, [!OJ W, G, Cochran and G. M. Cox, Experi- mental designs, Wiley, 1950, [IIJ A, T. James, The relationship algebra of an experimental design, Ann. Math, Statist., 28 (1957),993-1002, [12J G, E. p, Box and N, R. Draper, Evolu- tionary operation, Wiley, 1968, [13J M, Kendall and A. Stuart, The advanced theory of statistics, voL 3, Design and analysis, and time-series, Griffin, third edition, 1976. [14J G. E. p, Box and K. B. Wilson, On the experimental attainment of optimum con- ditions, J, Roy, Statist. Soc., ser. B, 13 (1952), 1-45, [15J K. S, Banerjee, Weighing designs for chemistry, medicine, economics, operation research, statistics, Dekker, 1975, 103 (111.3) Determinants A. Definition The determinant of an n x n tmatrix A = (aid in a tcommutative ring R is defined to be the following element of R: L(sgn P)a IP1 a 2p , .., a npn , where p= ( 1 PI 2 '" n ) '" Pn P2 is a permutation of the numbers 1,2, '" ,n, 
103 B Determinants sgn P denotes the sign of the permutation P (that is, sgn P = 1 if P is an even permutation and sgn P = -1 if P is an odd permutation), and the summation extends over all n! per- mutations of 1,2, '" ,n. The determinant of A is denoted by all a l2 a ln a 21 a 22 a 2n ani a n2 ann It is written laikl or IAI and is also denoted by det A. Usually we suppose that R is the field R of real numbers or the field C of complex numbers, but the foIl owing theorems are also valid for cases in which R is any commutative ring, unless otherwise stated, B. Relation to Exterior Algebras Consider an texterior algebra (tGrassmann algebra) of a linear space (tfree module) of dimension n over R with a basis (el ,e 2 , ..., en)' Set e= ail e 1 + u i2 e 2 +... -r uine n , where aijE R. Then we have e'l/\ e2 /\," /\ e = laikl e l /\e2/\" ./\e n . Conversely, we can define the determinant laikl by this relation, The properties of determinants can be easily deduced from those of exterior algebras, C. Fundamental Properties of Determinants (1) The determinant of the ttransposed matrix 'A of a matrix A is equal to the determinant of A, Hence the theorems stated for rows are also valid for columns, (2) If the elements of one row (column) of a matrix are multiplied by a factor c, the deter- minant of the matrix is also multiplied by c. If the elements of one row (column) of a matrix are zero, its determinant is equal to zero, (3) If from a matrix A = (a ik ), we obtain two matrices A' and A" by replacing one row, for instance the ith row, by a;I;"" a;" and by ail + a;I'"'' a in + a;n, respectively, then IA"I = IAI +IA'I, This relation is equaIly valid for a column, (4) If we obtain AQ by a permutation Q on the rows of a matrix A, then IAQI == (sgn Q)IA I, In particular, if two rows (columns) of a ma- trix are interchanged, then the determinant changes sign, (5) The determinant of a matrix is zero if two rows (or columns) are identical. (6) The determinant of a matrix is not changed if the elements of any row (column), each multiplied by the same factor, are added 378 to the corresponding elements of another row (column), (7) Suppose that R has unity element. Let X ik (i, k = 1, "', n) be n 2 variables in R, and denote a function (having its values in R) of these variables by <p(X), X = (X ik ), Assume that cp(X) has the foIlowing properties: (i) if the elements of one row of X are multiplied by a factor ..1., the value of <p is also multiplied by},; (ii) if we obtain two matrices X' and X" by replacing one row of X, for instance the ith row, by X;l, .." x;n and by XiI + X;I' "', X in + x;n, respec- tively, then <p(X") = <p(X) + <p(X'); and (iii) if two rows of X are equal, <p(X)=O, Then <p(X) =clXI for some constant c (in R), (8) Suppose now that R is a field K. Assume that a function <p(X) (in K) has the foIlowing properties: (i) if the elements of one row in X are multiplied by}" the value of <p is also multiplied by}.; and (ii) the value of cp is not changed if the elements of any row are added to the corresponding elements of another row, Then <p(X)=cIXI for some constant c (in K), D. The Laplace Expansion Theorem Let A = (a ik ) be an n x n matrix, Take r-tuples (i" .." i,) and (k l , "', k,), where ia and kb be- long to {I, ''', n} and i l <... < i" k l <.., < k" Let (i'+J' .." in) and (k'+I' "', k n ) be (n - r)- tuples such that ir+1 < ". < in' k,+J < '" < k n and {it, ...,ini r + b ...,in}={k h ...,k p k r + 1 , ...,k n } ={1, .."n}, Let a U1 . _ ,i,)(k 1 . ,k,) be the deter- minant of an r x r matrix whose (p, q)- component is the (ip, kq)-component of A for each p and q, We caIl this determinant a minor of degree r of the matrix A, (The corresponding submatrix of A is sometimes also called a minor of A.) In particular, if (iI' .." i,) = (k l , .." k,), then it is caIled a princi- pal minor, Furthermore, we define the cofactor of the minor a U1 . ,i,)(k 1 , .,k r ) of A to be Qu]. .ir)(k j , ,kr)=(-1)A+/.!u(, r + 1 .. .In)(k r + 1 , .k n ), where A=il +...+i, and J1=k l +.., +k" In the particular case r = 1, the cofactor of a ik is iiik =( _1)i+k ik> where ik is the determinant of the (n 1) x (n-1) matrix obtained from A by eliminating its ith row and kth column. For simplicity, we abbreviate (iI' "" i,), (k 1 , "', k,), and UI'''' ,j,) as (i), (k), and U), respectively, Then we have _ { IAI La(i)(j)a(k)(j)= 0 (j) if (i) = (k), if (i) # (k), if (i) = (k), if(i)#(k), _ { ,AI L a(j)(k)a(j)(i) = 0 (i) where L(j) means that the sum is taken Over aU combinations U). This is called the Laplace expansion theorem, If a matrix A has the form 
379 A = (  ) or A = (  ). and Band C are square matrices, then by this theorem we have IAI=IBIICI, If we number the combinations (I) = (I" ... , i,) and (k) = (k I' "" k,) appropriately (for instance, in lexi- cographical order) and regard the numbers assigned to them as row numbers and column numbers, respectively, to form a matrix (a(i)(k»)' then the Laplace theorem can be expressed as (a(i)(k»)(a(k)(i») = (a(k)(i»)(a(i)(k») [ ] In the particular case r= 1, we have n _ { IAI Laija kj = 0 JI n _ { IAI .L ajka ji = 0 Fl ifi=k, ifi#k, if i== k, ifi#k, E. Product of Determinants Let A = (a ik ) and B=(b ik ) be two n x n matrices, For the product AB=C=(cid, where C ik = Ljl aijb jk (i, k = 1, "', n), we have IABI = IAIIBI, The tinverse matrix A -I exists for an n x n matrix A =(aid if and only if IAI ,to, and then A -I = (bid with elements bik=ak;/IAI, Moreover, we have IA -11=IAI- l , (In the case where the elements a ik are in the commutative ring R with unity element, A -I exists if and only if I A I is a tregular element of R.) F. Theorems on Determinants (1) Let a ik be the cofactor of a ik in the deter- minant of an n x n matrix A = (a ik ), Then the determinant liiiki is equal to IA In-I, In general, la(i1' ,i,)(k, , .k,)I=IAIG=D, la(i" ,i,)(k 1 , 'k)=IArI), (2) The determinant of a submatrix of the matrix (aid, composed of the i l th, "" i,th rows and k l th, "" k,th columns of (aid is equal to IAlr- 1 a(i 1 . .ir}(k 1 , ,k r )' (3) Let  ( i" "" i, ) be the determinant k" .."k, of the (n - r) x (n - r) matrix obtained from an n x n matrix A by eliminating its i 1 th, '" , i,th rows and k l th,.,., k,th columns, Then IAIG )=G)C)-G)G} i<j, k</, 103G Determinants (4) Sylvester's theorem, Let b ik (i, k = 1, ". , n-r) denote the minor a(1, ",r+i)(I, ""H) of an n x n matrix A = (ajJ, Then I a1l Ibikl=IAI art I n-'-l ai, a" (5) Let A be an n x m matrix and B an m x n matrix. Then A B is an n x n matrix, If n > m, then IABI =0, If n .s;m, let(i)=(ll'"'' in) (II <... < in) be a combination of I, 2, .., , m, taken n at a time, Let A(i) be the n x n matrix composed of the i l th, '''' inth columns of A, and B(i) the n x n matrix composed of the i I th, '" , inth rows of B, Then IABI=L(i)IA(i)IIB(i)I, where the summation extends over all possible combi- nations (i). (6) Determinant of a tKronecker product. If A is an m x m matrix and B is an n x n matrix, then IA @ BI =IAlnIBlm, (7) Let H be an n x n tHermitian matrix, and let Hk denote the matrix composed of its first k rows and columns, Then H is positive definite if and only if IHkl>O for all k= 1, ""n. G. Special Determinants (1) A determinant of the form Xl X 2 X n xf X2 X2 2 n n -, XZ-' n-l Xl X n is called a Vandermonde determinant. It is equal to the tsimplest alternating function II i >k(x i -xd. (2) A cyclic determinant is one of the form Xo Xl X 2 X n - l X n - 1 Xo Xl X n -2 Xl X 2 X 3 Xo n-I - fl( + yi + Y2i + + Y(n-l)i ) - Xo S Xl S X 2 ... S X n - l , i""-O where ( is a tprimitive nth root of unity. (3) Consider the vectors lX i = (ail' a i2 , "" a in ) (i = 1,2, ... ,n), and let (ex i , IX) denote the tinner product of lX i and IX j . Then the following deter- minant is called the Gramian of these vectors: (exl,lXd (ex" 1X 2 ) (IX" IXn) (:X 2 , lX d (:x 2 , 1X 2 ) (1X2, IXn) (:Xn,:Xd (IXn' 1X 2 ) (IXn' IXn) all a l2 alII a 21 an a ZII ani a n2 amI 
103 Ref. Determinants (4) For an talternating matrix (namely, a square matrix X such that 'X = -X), we have the iden tity 0 x 12 Xl3 Xl n -X12 0 X23 x2n -X 1 3 -X 2 3 0 X3n -Xl n -X 2n -x3n 0 ={n( .."Xi}, ",)2 if n is even, if n is odd, where P n ("" Xij' ,.,) is a polynomial of vari- ables Xij' which (equipped with appropriate sign) is called the Pfaffian of these variables, References [1] I. Satake, Linear algebra, Dekker, 1975. [2] G. W, H. Kowalewski, Einfiihrung in die Determinantentheorie, de Gruyter, second revised edition, 1925 (Chelsea, 1948), [3] N, Bourbaki, Elements de mathematique, Algebrc, ch, 3, Actualites Sci. Ind" 1044, Her- mann, new edition, 1958, [4J F, R. Gantmakher, The theory of matrices, Chelsea, I, II, 1959, (Original in Russian, 1953.) [5] A, G, Kurash, Lectures on general algebra, Chelsea, 1970. (Original in Russian, 1955.) [6J R. Godement, Cours d'algebre, Hermann, 1963. [7] S, MacLane and G, Birkhoff, Algebra, Macmillan, 1967, 104 (XIII.15) Difference Equations A. General Remarks Let y be a function of a real variable x defined on an interval I and let x be a fixed quantity. When two points x and x + x are in I, we define the difference y(x) of y at x by y(x)= y(x+x) - y(x) and the difference quotient by y(x)/x; x is caned the difference of x, Without loss of generality, we can take x = 1, for otherwise there is a constant b such that x' = I for the new independent vari- able bx=x', If x== 1, the second difference 2y(X)=(Y(x») is given by 2y(X)=Y(x+ 1)-y(x) = y(x+2)-2y(x+ 1)+ y(x). Similarly, the difference of the nth order is defined by ny(x) = (H-I y(x)), and n ( 11 ) ny(x)= L (_1)n-k y(x+k), kO k 380 Conversely, y(x + n) is expressed by differences as y(x+n)= i ( n ) ky(X) kO k ( 223 Interpolation), B. Summation Given a function g(x) and x, a function y(x) that satisfies y(x)/ x = g(x) is called a sum of g(x), Summation of g(x) is to find a sum of g(x), Given a sum y(x) of g(x), an indefinite sum of g(x), written as Sg(x)x, is given by Sg(x)x = y(x) + c(x), where c(x) is an arbitrary periodic function of period x, In many cases, c(x), which corresponds to an arbitrary con- stant in an indefinite integral, is omitted, For example, a sum of g(x) = nx n - I for x = 1 is the nth-order tBernoulli polynomial Bn(x) for n#O; a sum of X-I is 1/f(x), given by 1/f(x) = dlogr(x)/dx ( 174 Gamma Function), When the series -xLoy(x+kx) or x Ll g(x - kx) converges, both can be sums of g(x), Since the requirement of conver- gence for these series was found to be too strict, the following requirement was given by N, E, Norlund instead: Let x be a real variable and g(x) be continuous for x?- b, Define ).(x) by ic(x) = xP(logx)q (p?-I, q ?-O), Then if for a positive YJ F(x,x,IJ)= Loo g(z)e-nA(Z)dz 00 -x L g(x+kx)e-n;.(X+kdX) kO is convergent for a> h. F satisfies F(x, x, I])/x==g(x)exp( -1J}'(x», Accordingly, if F(x,x,l]) approaches a limit F(x) as 1]-->0, F(x) is a solution of F(x)/x=g(x), F(x) is called the principal solution of F(x)/x = g(x), C. Difference Equations Let x = 1. An equation F(x, y(x), y(x), "" ny(x») = 0 in X and differences of an un- known function y(x) is called a difference equation. If the substitution y=(p(x) satisfies the equation for x in some interval, <p(x) is a solution of the equation, Because of the rela- tion between y(x), y(x + 1), '" , y(x + n) and the differences of y at x, we can transform the given difference equation in the form G(x, y(x), y(x + 1), "., y(x + n»=O, This form appears more often in applications and is called the standard form of a difference equation, 
381 If the equation is linear in y(x), y(x + 1), ...,y(x+n), namely, if it is given by n L p;(x)y(x+i)=q(x), i=O the difference equation is said to be linear. When q(x)=O, it is homogeneous; otherwise, it is inhomogeneous (or nonhomogeneous). D. Linear Difference Equations Assume that Po(x),..., Pn(x) are single-valued analytic functions without poles and common zeros in some domain, Consider the linear difference equation n L Pi(x)y(x+i)=O. iO If <PI (x), <P2(X), ",,<Pm(x) are solutions of (1), then a linear combination adx)<pI(x)+ a 2 (x)<p2(X) + ,., + am(x)<Pm(x) with arbitrary periodic functions a l (x), a 2 (x), ..., am(x) of period 1 is also a solution of (1), Let [31' [32' .., be singular points of PI (x), P2(X), ...,Pn(x), lXI' 1X2,'" be the zeros of Po (x), and YI' Y2, ... be the zeros of Pn(x + n). Then the set of singular points of the Ii near difference equation (1) is the set { lX i,[3i, yJ A function <Pm(x) is said to be linearly de- pendent on thefunctions <pdx), <P2(X), ..., <Pm-I (x) with respect to the difference equa- tion (1) if <Pm(x) = adx)<pd x ) + a2(x)<p2(X) + ... + am-dx)<Pm-dx), where adx), a2(x), "', am-dx) are functions of period 1, everyone of which takes a nonzero finite value at least at one point not congruent (mod Z) to any of the singular points, where Z is the additive group of integers. A set of m functions is called linearly inde- pendent if none of the functions is dependent on the other m - 1 functions. When a set of n solutions of equation (1) is linearly indepen- dent, it is a fundamental system for (1). Any solution of (1) can be expressed as a linear com bination of n solutions of a fundamental system. The determinant <PI (x) <pdx+ 1) <P2(X) <P2(X+ 1) <Pn(x) <Pn(x+ 1) <pI(x+n-1) <P2(x+n-l) <Pn(x+n-l) formed from n functions <P I (x), <P2(X),..., <Pn(x) is called Casorati's determinant and is denoted by D(<pd x ), <P2(X), "',<Pn(x»). A necessary and sufficient condition for a given set of n func- tions to be independent is that Casorati's determinant be nonzero at every point except those which are congruent to singular points of (1). Casorati's determinant is used to deter- 104 E Difference Equations mine whether a given set of solutions is fundamental. Let ljJ(x) be a solution of a nonhomogeneous linear difference equation n PAY) = L p;(x)y(x+i)=q(x). (2) i=O If <pdx), <P2(X),,,,, <Pn(x) are n linearly inde- pendent solutions of (1), then an arbitrary solution of (2) is given by y=adx)<pj (x)+ a 2 (x)<p2(x) +... + an(x)<Pn(x) + ljJ(x), (1) where a l (x),..., an(x) are arbitrary periodic functions of period 1. Then the expression for y is called a general solution of (2). If we ab- breviate Casorati's determinant of a funda- mental system of solutions <PI (x), <P2(X),..., <Pn(x) of (1) by D(x) and write J1i(X) as the quo- tient of the cofactor of <p;(x+n) of D(x+ 1) by D(x+ 1), we have n ljJ(x) = L <Pi(X)Sq(Z)J1i(Z)Z, i=l assuming that the summation S on the right- hand side is known. This is the analog of Lagrange's tmethod of variation of constants in the theory of linear ordinary differential equations. E. Linear Difference Equations with Constant Coefficients If all the coefficien ts in n L piy(x+i)=O, Po#O, Pn#O, i=O (3) are constants, n linearly independent solutions are obtained easily. Indeed, if ..1. is a root of the algebraic equation L?o p)i = 0, AXis a solu- tion of (3). This algebraic equation is called the characteristic equation of (3). If it has n distinct roots ..1.1' ..1. 2 , .... An' then A, ..1. 2 , ..., A are n linearly independent solutions. In gen- eral, if ic is an m-tuple root of the characteristic equation, then AX, xAx, ...,xm-IA x are solu- tions of (3), Accordingly, if Aj is a root of multi- plicity mj (Lj1 m j = n,j = 1,2, ..., s), then Aj, x).j, '" ,x mj - I Aj (j = 1, '" ,s) constitute a set of n linearly independent solutions, Even if all the Pi are real, the characteristic equation may have complex roots, In such a case real solutions are obtained as follows: When ..1. = J1 + iv is a root of multiplicity m, I = J1- iv is also a root of the same multiplicity. If we write p = j J12 + v 2 , tan <P = v/J1, then x x . x x . p cos <px, P Sill <px, xp cos <px, xp Sill <px, ..., x m - I pX cos ({ix, x m - I pX sin <pX are 2m indepen- dent real solutions, Nonhomogeneous equations with con- 
104 F Difference Equations stant coefficients can be generally solved by Lagrange's method with these solutions. How- ever, when the nonhomogeneous term has a special form such as n L piy(x+i)=p(x)A X , i=O where p(x) is a polynomial in x and ..1. is a root of multiplicity m of the characteristic equation, we can use the method of undetermined coeffi- cients. In this particular case, the substitution (Ao +Alx+... + AkXk)XmA x with undetermined coefficients Ao, AI, ,., , Ak gives solutions, k being the degree of p(x). F. Difference and Differential Equations The differential operator djdx acts on the family of functions {xm I m = 0, ::I: 1, ... } accord- ing to dx m jdx = mx m - I , just as the difference operator  acts on the family {x(m) = r(x + I)jr(x-m+ 1)lm=O, ::I:l,...} according to Llx(m) = mx(m-l), Hence by using the factorial series LamX(m) and its similarity to the power series LamX m , we may obtain some analogies with the theory of differential equations. For example, the tFrobenius method in the theory of tregular singular points can be applied to the system of difference equations n (z-I)_I Wdz) = L ajk(z)wj(z), j1 k=I,2,...,n. However, there are certain essential differences between functions defined as solutions of differential and difference equations. For ex- ample, Holder's theorem states that no solu- tion of the simple difference equation y(x+ 1) - y(x)=x- I satisfies any talgebraic differential equation, Consequently, the gamma function, which is related to a solution of the equation ljJ(x)=dlogr(x)jdx, cannot be a solution of any algebraic differential equation. For the numerical solution of ordinary differential equations by difference equation approxima- tion  303 Numerical Solution of Ordinary Differential Equations. G. Geometric Difference Equations For an arbitrary complex number q, an equa- tion of the form y(qx) = f(x, y(x)) is called a geometric difference equation. For example, the ordinary difference equation (1) can be trans- formed into n L Pk(z)U(zl)=B(z) k=O by the change of variable z = qX. Although it is 382 possible to transform an equation of the type (1') into that of the type (1), there are theories developed specifically for the type (1'), since the coefficients of the equation may become more complicated by such a transformation. References [1] p, M. Batchelder, An introduction to linear difference equations, Dover, 1967. [2J G. Boole, A treatise on the calculus of finite differences, Dover, 1960. [3] R. M. Cohn, Difference algebra, Inter- science, 1965, [4J T. Fort, Finite differences and difference equations in the real domain, Oxford Univ. Press, 1948. [5J C. Jordan, Calculus of finite differences, Chelsea, 1933, [6] K. S. Miller, Linear difference equations, Benjamin, 1968. [7] L. M. Milne-Thomson, The calculus of finite differences, Chelsea, 1980. [8J N. E. N6rlund, Vorlesungen tiber Differen- zenrechnung, Springer, 1924. [9] N. E, N6r1und, Lec,;ons sur les equations Iineaires aux differences finies, Gauthier- Villars, 1929. [10] C. H. Richardson, An introduction to calculus of finite differences, Van Nostrand, 1954. 105 (VII.2) Differentiable Manifolds A. General Remarks (1') The rudimentary concept of n-dimensional manifolds can already be seen in J. Lagrange's dynamics. In the middle of the 19th century n- dimensional tEuclidean space was known as a continuum of n real parameters (A. Cayley; H. Grassmann, 1844, 1861; L. Schliifli, 1852). The notion of general n-dimensional manifolds was introduced by B. Riemann as a result of his differential geometric observations (1854). He considered an n-dimensional manifold to be a set formed by a I-parameter family of (n - 1)- dimensional manifolds, just as a surface is formed by the motion of a curve. Analytical studies of topological structures of manifolds and their local properties were initiated and developed by Riemann, E. Betti, H. Poincare, and others. To avoid the difficulties and dis- advantages of analytical methods, Poincare restricted his consideration to those topo- logical spaces X that are tconnected, ttri- angulable, and such that each point of X 
383 has a neighborhood homeomorphic to an n- dimensional Euclidean space, We often refer to such spaces as Poincare manifolds; Poincare called them n-dimensional manifolds. In 1936, H. Whitney published a monumental paper [14J on differentiable manifolds in which the various fundamental concepts on differentiable manifolds were established, This and sub- sequent papers written by Whitney during nearly twenty years greatly influenced the rapid advance of the theory of differentiable manifolds since 1950. B. Topological Manifolds An n-dimensional topological manifold M is by definition a tHausdorff space in which each point p has a neighborhood U(p) homeo- morphic to an open set of R n . Let M' be a Hausdorff space in which each point p of M' has a neighborhood U(p) homeomorphic to an open set of H n , where H n is the half-space {(XI' Xl' ..., Xn)E R n I X n O}. Let 8M' denote the set consisting of points p of M' such that p corresponds to a point of H3= {(XI> ... ,xn)EHn I xn=O} c:Hn under the homeomorphism from U(p) to an open set of Hn, M' is called an n-dimensional topological manifold with boundary if 8M' # 0, and 8M' is called the boundary of M'. On the other hand, M defined as above or M' with 8M' = 0 is called an n-dimensional topological manifold without boundary. The interior of M' is the complement Me, = M' - 8M' of the boundary. The boundary of an n-dimensional topological manifold is an (n -1 )-dimensional topological manifold. A topological manifold without boundary is called closed or open according as it is compact or has no connected component which is compact. There exist connected topo- logical manifolds that are not tparacompact; among them, the I-dimensional ones are called long lines. A connected paracompact topolog- ical manifold M has a tcountable open base and is tmetrizable, C. Local Coordinates Let M be an n-dimensional topological mani- fold. A pair (U, if;) consisting of an open set U of M and a homeomorphism if; of U onto an open set of Rn is called a coordinate neighbor- hood of M. If we denote by (XI(p), ...,xn(p)) (pE U) the coordinates of the point if;(p) ofRn, then Xl, X2, ...,x n are real-valued continuous functions defined on U. We call these n func- tions the local coordinate system in the coordi- nate neighborhood (U,if;) and the n real num- bers Xl (p), ... , xn(p) the local coordinates of the point pE U (with respect to (U, if;)). lOSE Differentiable Manifolds A set S = {(U., if;')}'EA of coordinate neigh- borhoods is called an atlas of M if {U'}.EA forms an topen covering of M. D. Differentiable Manifolds Let S == {(U" if;,)} 'EA be an atlas of an n- dimensional topological manifold M. For each pair of coordinate neighborhoods (U., if;,) and (Up, if;p) in S such that U, n Up # 0, if;p 0 if;,-I is a homeomorphism of the open set if;,(U,n Up) of R n onto the open set if;p(U, n Up) of Rn. Let x=(x"... ,xn)Eif;,(U,n Up). Then we can write (if;poif;,-I)(X)==(fP(x), ...,fP"a.(x)). If the n real-valued functions f/., ... ,I;, defined in if;,(U, n Up) are oftclass C' (1.s; r.s; 00) (resp. treal analytic) for any ex, fJ in A such that U, n Up # 0, then we call S an atlas of class C' (resp. C"') of M. When an n-dimensional topo- logical manifold M has an atlas S of class C' (1.s; r.s; w), we call the pair (M, S) an n- dimensional differentiable manifold of class C' (or C'-manifold). A Coo-manifold is also called a smooth manifold, while a C"'-manifold is called a real analytic manifold. We call M the underlying topological space of (M, S), and we say that S defines a differentiable structure of class C' (or C'-structure) in M. In particular, a C"'-structure is called a real analytic structure. A C'-manifold whose under- lying topological space is compact (tparacom- pact) is called a compact (paracompact) C'- manifold, A coordinate neighborhood (U, ifI) of M is called a coordinate neighborhood of class C' of (M, S) if the union S U {(U, if;)} is also an atlas of class C' of M. In particular, each co- ordinate neighborhood of M belonging to S is of class C'. The set S of all coordinate neigh- borhoods of class C' of (M, S) is an atlas of M containing S, and we call S the maximal atlas containing S. Let Sand S' be two atlases of class C' of M, If S = S', then we say that Sand S' define the same differentiable structure of class C' on M and that the differentiable mani- folds (M, S) and (M, S') of class C' are equiva- lent. In particular, (M, S) and (M, S) are equiv- alent C'-manifolds, Let Sand S' be atlases of class C' and class CS, respectively, where 1 .s; r < s.s; w. Since s> r, we can consider S' an atlas of class C'. If Sand S' define the same C'-structure in M, then we say that the CS- structure defined by S' is subordinate to the C'-structure defined by S. If Mis paracom- pact, then there exists a C"'-structure subordi- nate to a C'-structure of M (Whitney [14J), E. Differentiable Manifolds with Boundaries Let U and U' be open sets in the half-space Hn, and let qJ: U --> U' be a continuous mapping. If 
105F Differentiable Manifolds there exist open sets Wand W' in Rn contain- ing U and U', respectively, and a mapping if;: W --> W' of class C' that extends qJ, we call qJ a mapping of U into U' of class C. Let M be a Hausdorff topological space. A structure of a C-manifold on M is defined by a set S = {( U" if;,)} ,eA, where {U,} 'EA is an open covering of M, and, for each ex, if;, is a homeomorphism of U, onto an open set of H n such that for any ex, fJEA with u,n U p #0, if;poif;,-I is a mapping of class C' from if; ,( u, n Up) onto if;p(U, n Up). Let 8M denote the set consisting of points p of M such that pE U, and if;,(p)E H(; = {(XI' "', Xn)E H n I X n = O} for some exE A. If 8M # 0, the pair (M, S) is called an n-dimen- sional differentiable manifold with boundary of class C (or C-manifold with boundary), and 8M is called the boundary of M. 8M forms an (n-l)-dimensional C-manifold. If we put U = u, n 8M and denote the restriction of if;, to U by if;, then S' = {(U, if;)}'eA is an atlas of class C of 8M. If 8M is empty, then (M, S) is a C'-manifold. In this sense a C-manifold is sometimes called a C'-manifold without boundary. F. Orientation of a Manifold Let S = {(U., if;,)},eA be an atlas of class C'in M, and for each ex let {x, ..., x:} be the local coordinate system in a coordinate neighbor- hood (U., if;,). If U, and Up intersect, then there exist n real-valued functions F i (i = 1, ''', n) defined on if;,(U,n Up) such that x1(p)= Fi(X(p), ...,x:(p)) for pE u,n Up and i= 1, ".,n. The t1acobian D,p=D(F I , ...,F n )/ D(x, ... , x:) is different from zero at each point (x, ..., x:) of if;,(U, n Up), If we can choose an atlas S of M so that, for any ex, fJ such that U, n Up is nonempty, the Jacobian D,p is always positive, then we say that the C'-manifold M is orientabIe, and we call San oriented atlas. Let S={(U"if;,)},eA and S'= {(VA'<PA)}AeA be two oriented atlases of a connected C- manifold M. If M is connected, then the sign of the Jacobian D'A(P) of the transformation of local coordinates is independent of the choice . ofexEA, AEA, and pEU,n VA' We say that S and S' define the same (opposite) orientation if D'A is always positive (negative). Hence if Mis connected, the set of all oriented atlases of class C' is composed of two subsets such that atlases belonging to one of them have the same orientation, while two atlases belonging to different ones have the opposite orientation. Each of these subsets is called an orientation of the connected C'-manifold M, When we assign to M one of two possible orientations, M is called an oriented manifold; the assigned 384 orientation is called its positive orientation and the other its negative orientation. If S = {(U.,if;,)},eA belongs to the positive orien- tation, Sand (U" if;,) are called an atlas and local coordinate system, respectively, compa- tible with the positive orientation. G. Differentiable Functions Let f be a real-valued function defined in a neighborhood of a point p of a Coo-manifold M. Let (U, if;) be a coordinate neighborhood of class Coo such that p E U. If the function f 0 if;-I is of class C (1 .s; r.s; 00) in a neighborhood of the point if;(p) in Rn, then the function f is called a function of class C at p. This defi- nition is independent of the choice of a coor- dinate neighborhood of class Coo. If we de- note the local coordinate system in (U, if;) by (x I, ... ,x n ), there exists a function f(x I, ... , x n ) of n variables defined in a neighborhood of if;(p) in R n such that f(q) = f(xl(q), ..., xn(q)) for each point q in the neighborhood of p. Here we use the same symbol f for the function f defined in a neighborhood of p in M and for the function f 0 if; -I defined in the image of the neighborhood by if; in Rn. The function f is of class C' at p if and only if f(x I, ... , x n ) is of class C' in a neighborhood of the point (Xl (p), ..., xn(p») of R n . A function of class C (or C'-function) in M is a real-valued function in M that is of class C' at every point of M. H. Tangent Vectors Let M be a Coo-manifold, and let 3'(M) be the real vector space consisting of all Coo-functions in M. (For the sake of simplicity, we denote a manifold (M, S) by M.) A tangent vector L at a point p of M is a linear mapping L: 3'(M)-->R such that L(/g) = L(/)g(p) + f(p)L(g) for any f and 9 in 3'(M). For any two tangent vectors LI' L 2 and any pair of real numbers ..1.1' ..1. 2 we define AILI +A2L2 by (AILI +A2L2)(f)== ..1.1 LI (/) + ..1. 2 L 2 (/), f E 3'(M). Thus tangent vectors at p form a real vector space Tp, which we call the tangent vector space (or simply the tangent space) of Mat the point p. The dimension of the tangent vector space Tp equals the dimension of M. The set of all tangent vectors of M forms a tvector bundle over the base space M, called the tangent vector bundle (or tangent bundle) ofM. By a tangent r-frame (r.s; n) at p we mean an ordered set of r linearly independent tangent vectors at p. The set of all tangent r-frames also forms a fiber bundle over M called the tangent r-frame bundle (or bundle of tangent r-frames) ( 147 Fiber Bundles F). 
385 I. Differentials of Functions F or a Coo -function f in M and a point p of M we can define a linear mapping dfp: Tp-->R by dfp(L) = L(/) for all LE Tp, and we calI dfp the differential of fat p. The totality of differen- tials at p of Coo-functions in M forms the tdual vector space of the tangent vector space Tp. J. Differentiable Mappings Let <p be a continuous mapping of a Coo_ manifold M into a Coo-manifold M'. We ca\1 <p a differentiable mapping of class C' (or simply a C' -mapping) (1 .s; r.s; 00 ) if the function f 0 <p is of class C' for any C' -function f on M'. If <p is a homeomorphism of M onto M' and <p and <p -I are both of class C', then we ca\1 qJ a diffeomorphism of class C'. If there exists a diffeomorphism of class Coo of a Coo-manifold M onto a Coo-manifold M', then M and M' are said to be diffeomorphic. Let M and M' be Coo-manifolds and <p be a Coo-mapping of Minto M'. For a tangent vector L of M at p, a tangent vector L' of M' at <p(p) is defined by L'(g) = L(g 0 <p), gE 3'(M'), The mapping L-->L' defines a linear mapping (d<p)p of the tangent vector space Tp of Mat p into the tangent vector space Tq>(p) of M' at <p(p). The linear mapping (d<p)p is ca\1ed the differential of the differentiable mapping <p at p. If (d<p)p is surjective, p is ca\1ed a regular point of <po A point on M which is not a regular point is ca\1ed a critical point of <po A point q on M' which is an image of a critical point is calIed a critical valne of qJ, and a point on M' which is not a critical value is ca\1ed a regular value. In Rn, the diffeomorphic image of a set of Lebesgue measure zero has Lebesgue mea- sure zero, So the set of Lebesgue measure zero is weIl defined on a (paracompact) Coo_ manifold. Then Sard's theorem states: Let <p:M-->M' be a Coo-mapping; then the set of critical values of <p has Lebesgue measure zero inM'. K. Immersions and Embeddings Let M and M' be Coo-manifolds and <p be a Coo-mapping of Minto M'. If (dqJ)p is injective at every point p of M, then <p is calIed an im- mersion of Minto M'. If <p is an immersion, then for some neighborhood Up of any point p of M the restriction qJ I Up gives rise to a homeomorphism from Up into M', If an im- mersion qJ is injective, then <p is caned an embedding (or an imbedding) of Minto M'. An alternative definition is often used, which says that <p is an embedding if, in addition to the 105L Differentiable Manifolds above conditions, <p gives a homeomorphism from M onto <p(M), where <p(M) has the rela- tive topology of M'. If the former definition is adopted as embedding, then a mapping <p satisfying the conditions of the alternative definition is sometimes referred to as a regular embedding. If M is compact, the two defini- tions coincide. In Sections Land M, embed- ding always means regular embeddings. The theory of embeddings and immersions is mainly concerned with ways to embed and immerse a given manifold M into a manifold M' of a particular type with lowest possible dimension. M' is usua\1y the Euclidean space Rn, the projective space pNR or a certain stan- dard manifold. The theory was initiated by Whitney (1936), He proved by "general posi- tion" argument that an n-dimensional C oo _ manifold M with countable basis can always be immersed in the 2n-dimensional Euclidean space and can always be embedded in the (2n + I)-dimensional Euclidean space as a closed set (Whitney's theorem). L. Submanifolds A Coo-manifold M is said to be a submanifold of a Coo-manifold M' if M is a subset of M' and the identity mapping of Minto M' is an immersion, If the identity mapping of Minto M' is an embedding, then M is called a regular submanifold of M'. A regular submanifold M of M' is ca\1ed a closed submanifold if M is a closed subset of M'. Let <p be a Coo-mapping from Minto M' and M" be a submanifold of M'. Then for each qEM" the tangent space T;' of M" at q is a linear subs pace of the tangent space T; of M' at q. Denote by nq the projection of quotient vector space onto T; IT;'. A Coo -mapping <p is cal\ed transverse to M" if for each pE <p -I (M") the composite nq>(p)odqJp: Tp-+T(p/T;p) is surjective. If <p is transverse to M" then <p-l(M") is a submanifold of M, For any Coo_ mapping qJ:M-->M' and any submanifold M" eM', we can find a Coo-mapping <p':M -->M' which is transverse to M" and arbitrarily close to <p (transversality theorem). Let M l and M 2 be submanifolds of M', Then we say M l in- tersects transversely to M 2 if the inclusion M 1 c.M' is transverse to M 2 . A Coo-mapping is ca\1ed a submersion if it has no critical point. Let <p be a submersion from Minto M'. Then for each point qEM', <p -I (q) is a regular submanifold of M, and M is covered by a mutually disjoint family of sub- manifolds: M = UqEM'<P -I (q). Let M be a submanifold of an n-dimensional Euclidean space Rm. We can identify the tan- gent vector space Tp of M at p with the geom- 
105M Differentiable Manifolds etric tangent space of M at p in the Eucli- dean space Rm. A vector in Rm that is ortho- gonal to the tangent vector space Tp of M at p is called a normal vector to M at p. The set of all vectors normal to M forms a vector bundle over M, which we call the normal vector bundle (or normal bundle) of M. If M is compact, then the totality of vectors normal to M whose length is .s; £ (where £ is a sufficiently small positive real number) forms a neighborhood N(M) of M in R m which we call a tubular neighborhood of M. Int N (M) is called an open tubular neighborhood. M. Vector Fields Let N be a subset of a Coo-manifold M. By a vector field on N we mean a mapping X that assigns to each point p of N a tangent vector X p of Mat p. We can consider X as a tcross section over N of the tangent vector bundle of M. Let X be a vector field in M, and let f be a Coo-function in M, Then we can define a func- tion Xfin M by (Xf)(p) = Xpf We call X a vector field of class C' if the function X f is of class C' for any Coo -function fin M. Let (Xl, '" ,xn) be the local coordinate system in a coordinate neighborhood (U, if;), and let (8/8x i )pf = (8f/8x i )(p) for pE U and f E 3'(M). Then 8/8x i (i = 1, "', n) are vector fields in U, and the (8/8x\ form a basis of Tp at every point pE U, A vector field X in U is written uniquely as X p = Lii(p)(8/8xi)p at each point p E U. Then  I, ... ,(n are real-valued func- tions defined in U, called the components of X with respect to the local coordinate system (Xl, ..., x n ). A vector field X in M is of class C' if and only if its components (i with respect to each coordinate system are functions of class C' (O.s; r.s; 00). Let (Xl, ... , x n ) be another local coordinate system in a neighborhood U of p, and let (I, ... , n) be the components of X with respect to (Xl, ..., x n ). Then we have i(q)= Lj(8:x i /8x j )(q) (j(q) at each point qE U. For the rest of this article we mean by a vector field in M a vector field of class Coo, and we denote by 3:(M) the set of all vector fields in M. Then 3:(M) is an 3'(M)-tmodule, where 3'(M) denotes the algebra of all Coo-functions in M. In fact, for f, g E 3'(M) and X, Y E 3:(M), we can define a vector field {X + 9 Y by (IX + gY)p= f(p)Xp+g(p) Y p ' and this defines an 3'(M)-module structure in 3:(M). In a coordinate neighborhood (U, if;), we can write X = Lii(8/axi). The right-hand side of this equation is sometimes called the symbol of the vector field X. A vector field X can also be interpreted as a linear differential operator that acts on 3'(M). Let X and Y be vector fields in M. Then 386 there exists a unique vector field Z in M such that Zf =X(Yf)- Y(Xf) for any Coo-function f in M, We denote Z by [X, Y] and call it the Poisson bracket (or simply bracket) of X and Y. If (i and 1]i denote the components of X and Y, respectively, in a coordinate neighbor- hood (U, if;), then the components (i of [X, YJ are given by (i = Ld k(81]i/8xk)- '1k(8i/8xk)}. The bracket of vector fields has the following properties: (i) [X, YJf =X(Yf)- Y(Xf), (ii) [fX,gYJ= fg[X, YJ+ f(Xg)Y-g(Yf)X, (iii) [X + Y,ZJ=[X,ZJ+[Y,ZJ, (iv) [X, Y]= - [Y, X], and (v) [[X, Y], ZJ + [[Y, ZJ, X] + [[Z, X], Y] == 0 (Jacobi identity). These iden- tities show that 3:(M) is a Lie algebra ( 248 Lie Algebras) over R. If <p is a diffeomorphism of M onto M', then for any vector field X in M we can define a vector field <p*X in M' by the condition (<p*X)p=d<pq(X q ), p=<p(q). Then <p* is an iso- morphism of the Lie algebra 3:(M) onto the Lie algebra 3:(M'), N. Vector Fields and One-Parameter Groups of Transformations A one-parameter group of transformations of M is a family <p, (t'E R) of diffeomorphisms satisfying the following two conditions: (i) the mapping ofR x Minto M defined by (t,p)--> qJ,(p) is of class Coo; and (ii) <Ps 0 <p, = <Ps+, for s,tER. Let <p, be a one-parameter group of trans- formations of M. Then we can define a vector field X by Xpf =lim,o(l(<p,(p))-f(p))/t, where p E M and f E 3'( M). The vector field X thus defined is called the infinitesimal trans- formation of <p,. We also say that <p, is gen- erated by X, and sometimes we denote <p, by the symbol exp tX. In this case, if (Xl, "', x n ) is a local coordinate system, then at each point p of the coordinate neighborhood, we have X p = Li(dx i ( <p,(p) )/dt),=o( 8/8x i )p' If M is compact, then every vector field in M is the infinitesimal transformation of a one- parameter group of transformations; that is, every vector field generates a one-parameter group of transformations. For M noncompact, this is not always true. Nevertheless, for each vector field X we have the following result concerning local properties of X: For each point p of M, there exist a neighborhood U of p, a positive real number £, and a family <p,(ltl < £) of mappings of U into M satisfying the following three conditions, (1) The mapping of( -£,£) x U into M defined by (t,q)--><p,(q) is of class Coo, and for each fixed t, qJ, is a dif- feomorphism of U onto an open set <p,(U) of M. (2) If Isl, Itl, and Is+tl are all smaller than £ and q and <p,(q) both belong to U, then 
387 <Ps(<p,(q» = <Ps+,(q). (3) X(q)f = Iimr-o(f(<p,(q)) - f(q))/t for qE U andfE3'(M). We call <p, the local one-parameter group of local transfor- mations around p generated by X. Let X and Y be vector fields in M, and let qJ, be the local one-parameter group of\ocal transformations around p generated by X. Then [X, YJ p = Iim,o(Yp - «<p,)* Y)p)/t, where (<p,)* Y is a vector field defined as folIows: Let U be a neighborhood of the point p where <p, (It I < e) is defined. Then (<p,)* Y is the vector field on <p,(U) that is the image of Y under the diffeomorphism <p" In particular, if X gener- ates a one-parameter group of transforma- tions of M, then we have [X, YJ=lim,o(Y- (<p,)* Y)jt for any vector field Y in M. O. Tensor Fields Let T,'(p) be the vector space consisting of all r-times contravariant and s-times covariant tensors over the tangent vector space Tp of a Coo-manifold M, that is, T,'(p) = ( 0 T p)@(@Tp*} where T p * denotes the dual linear space of Tp ( 256 Linear Spaces), A tensor field (more precisely, contravariant of order r and covariant of order s, or simply a tensor field of type (r, s)) on a subset N of M is a mapping K that as- signs to each point p of N an element K p of the vector space T,'(p). In particular, ifr=s=O, K is a real-valued function on N, and we call K a scalar field. If r = 1 and s = 0, K is a vector field, called a contravariant vector field. When r = 0 and s = 1, we call K a covariant vector field (or differential form of degree 1). If r # 0, s = 0 or r = 0, s # 0, we call K a contravariant tensor field of order r or a covariant tensor field of order s, respectively. A contravar- iant or covariant tensor field K is said to be symmetric (alternating) if Kp is a symmetric (alternating) tensor at every point p of M. Let (Xl, ..., x n ) be the local coordinate system in a coordinate neighborhood (U, 1/1). Then at each point p of U, the Wjaxi)p (i = 1, ..., n) form a basis of the tangent vector space Tp' the differentials (dxi)p (i = 1, ..., n) form a basis of the dual space T p *, and these bases are dual to each other. A tensor field K of type (r, s) de- fined on M is written at any point p of U in the following form: K p = L KJ: ::J,(p)(ajax i1 )p @ ... @ (ajaxi,)p @ (dxh)p @ .., @ (dxj,)p' The functions K i 1 ' i! defined in U are called h...Js the components of the tensor field K of type (r, s) with respect to the local coordinate sys- 1050 Differentiable Manifolds tem (Xl, ..., x n ) . If K J i 1 i! are the com p onents l' Js of K with respect to the local coordinate sys- tem (Xl, ... , x n ) in another coordinate neigh- borhood (U', 1/1') such that un U' # 0, then for each qE un U', the following relations hold: KJ:"l(q)= L k,Mxi1jaxk1)q. ..(axi'jaxk')q x (ax 11 jax jl )q" .(axl'jaxj')qK.'''i:'. A tensor field K in M is called a tensor field of class C' (O.s; t.s; 00) if the components are func- tions of class C' for any coordinate neighbor- hood of M, The sum K + L and the tensor product K @ L of two tensor fields K and L are defined by the rules (K +L)p=Kp+Lp and (K@ L)p = K p @ Lp, respectively. The contraction of two tensor fields is also defined by taking the contraction pointwise. Let <p be a diffeomorphism of Minto M'. Then the differential (d<p)q is an isomorphism of Tq onto Tp (P=qJ(q)) for each qEM and hence induces an isomorphism ipq of the vector space T,'(q) onto the vector space T,'(p) ( 256 Linear Spaces). For any tensor field K in M we can define a tensor field ipK in M' by (ipK)p = ipq(Kq), p=<p(q), qEM. Then ip(K +L)=ipK + ipL, ip(K @ L) == ipK @ ipL, and the mapping ip commutes with contraction. Let K be a tensor field and X be a vector field (both of class COO) in M. We define a tensor field LxK by (LxK)p=lim,o(Kp- (ip,K)p)jt, where qJ, denotes the local one- parameter group of local transformations around p generated by X. We calI LxK the Lie derivative of K with respect to the vector field X. The operator Lx:K-->LxK has the folIowing six properties: (i) Lx(K + K') = LxK+LxK'; (ii) Lx(K@K')=(LxK)@K'+ K @ LxK'; (iii) the operator Lx commutes with contraction; (iv) Lxf = Xf for a scalar field f and Lx Y = [X, YJ for a vector field Y; (v) L[x,YJ==LxLy - LyLx, that is, L[x,YJK = Lx(LyK)-Ly(LxK); and (vi) K is invariant under qJ" i.e., ip,K == K for all t, if and only if LxK=O. Let K be a covariant tensor field of order r in M. We always assume that K is of class Coo. The value K p of Kat pE M is an element of the vector space Tp* @ , .. @ T p * (r times tensor product of T p *); hence we may consider Kp an r-Iinear mapping of Tp into R ( 256 Linear Spaces). If X"..., X, are vector fields in M, we define a Coo-function K(X I ,... ,X,) by K(X 1 , ... , Xr)(P) = Kp(Xd p , ... ,(X,)p)' Then the mapping that assigns to each r-tup1e (Xl' "', X,) of vector fields the Coo-function K(Xl> ..., X,) is an r-linear mapping on the 3'(M)-modu1e X(M) consisting of all vector fields of class Coo in Minto 3'(M); that is, K(Xl'''' JXi+gY;,.., ,X,)= fK(X l , ...,X i , 
105P Differentiable Manifolds ... , X r ) + gK (XI' ... , 1';, ... , X,) (i = 1, .., ,r) for f, gE 3'(M). Conversely any r-linear map- ping of the 3'(M)-module x(M) into 3'(M) can be interpreted as a covariant tensor field of order r in M. If the tensor field K is sym- metric (alternating), the corresponding r- linear mapping K(XI'"'' X,) is symmetric (alternating) with respect to Xl> ..., X,. For the Lie derivative LxK of a covariant tensor field of order r of K, we have the foIl owing formula: (LxK)(Xl'''' ,X,)=X(K(X I ,... ,X,)) -L;=IK(XI, ..., [X, X;], ...,X,). P. Riemannian Metrics A symmetric covariant tensor field g of order 2 and of class Coo in M is caIled a pseudo- Riemannian metric if the symmetric bilinear form gp on the tangent vector space Tp is non- degenerate at each point pEM; and g is caIled a Riemannian metric if gp is positive definite for all p, If 9 is a Riemannian metric, the length IILII of a tangent vector LETp is defined by II LI12 = gp(L, L). On a paracompact Coo_ manifold there always exists a Riemannian metric. A pair consisting of a differentiable manifold and a Riemannian metric on it is called a Riemannian manifold ( 364 Riemannian Manifolds). Q. Differential Forms An alternating covariant tensor field in M of order r and of class C' (O.s; t .s; 00) is also caIled a differential form (or exterior differential form) of degree r. A differential form of degree 1 is sometimes caIled a Pfaffian form, Let w be a differential form of degree r. Since each alter- nating covariant tensor of order r at a point p is an element of A'T p *, the r-fold texterior product of T p *, the form w is a mapping that sends each point p of M to an element w p of A'T p *' We can also regard w as an alternating r-linear mapping of x(M) into 3'(M). Let (Xl, ... ,xn) be the local coordinate system in a local coordinate neighborhood (U, 1jJ). Since (dx\ (i = 1, .,., n) is a basis of T p * at each point p of U, we can express w p (pE U) uniquely in the form w p = L ail i,(p)(dx i1 )p/\'" /\(dxi,)p, i 1 < <i r where the sum extends over all ordered r- tuples (i I, ... , i,) of indices such that 1.s; i I < i 2 <... < i,.s; n. For an ordered r-tuple (i I' "', i,) of indices with repeated indices we put ail" i, = 0, and for (i I' ... , i,) with r distinct indices we put ail ,i, =(sgnO")aj, .j,' where (jl' '" ,jr) Ul <". <j,) is a permutation of (i I' ". ,i r ) and sgn 0" denotes the sign of the permutation 0": ik-->jk 388 (k = 1, "', r). Then we can write 1  . . wp=;I. L_ a i ,..i,(p)(dx'l)p/\.../\(dx")p. '1 1 , ,lr- 1 The functions ail' i, are the components of the tensor field w, and w is of class C' if these components are of class C' for any coordinate neighborhood. By the support (or carrier) of a differential form w we mean the closure of the subset of M consisting of all p such that wp#O. In the rest of this article, differential forms are always of class Coo, and we denote by :D'(M) the real vector space consisting of all differential forms of degree r and of class Coo. In particular, :D°(M) = 3'(M) and :D'(M) = {O} for r>n, n=dimM. For differential forms we have the foIlowing five important operations. (1) Exterior product. Let wand 0 be dif- ferential forms of degree rand s, respectively. The exterior product w /\ 0 of wand 0 is the differential form of degree r + s defined by (w /\ 8)p= w p /\ 8 p , pE M. Let XI'"'' X,+s be r+s vector fields in M. Then we have (w /\ 8)(X 1 , ..., Xr+s) = Lsgn(i;j)w(X i1 , ... ,X i )8(X j1 , ... ,X j ), where the summation runs over all possible partitions of (1, 2, ..., r+ s) such that i l < i 2 < ... < i, and j I <jz < ... <j" and sgn(i;j) means the sign of the permutation (1,2, ...,r+s)--> (ii, ...,i"jl' ...,js)' In particular, if WI' ...,W, are differential forms of degree 1, then we have (WI /\ ... /\ w,)(X I , ..., X,) = det(wi(X j )). (2) Exterior differentiation, Let w be a dif- ferential form of degree r, and let w = (l/r!) L a i 1 i, dx i 1 /\ ... /\ dx i, in a coordinate neigh- borhood (U, 1jJ). Then we can define a differen- tial form dw of degree r + 1 by the condition dw=(I/r!)Lda i1 i,/\dx i1 /\... /\dx i , in U. The differential form dw is caIled the exterior de- rivative (or exterior differential) of w. A dif- ferential form w satisfying the condition dw == 0 is caIled a closed differential form, and a dif- ferential form 1] that can be expressed as '1 = dw for some w is called an exact differential form. If a, bE Rand w, w' E :D' (M), then we have d(aw+bw')=adw+bdw'. Therefore the set <£:'(M) of all closed differential forms of degree r and the set (f'(M) of all exact differ- ential forms of degree r are linear subs paces of :D'(M). For the exterior derivative dw, we have the following formula: (dw)(Xl''''' X,) ,+1 _  i+1 ..... - L (-1) X i (W(X 1 ,...,X;,...,X,+d) i=l + I (-Ij'+jW([XiXD,X I , i<j ...,X i , ... ,X j , ...,Xr+d, 
389 where the variables under the sign A are to be omitted. (3) Interior product with vector field. Let w be a differential form of degree r and X a vector field. When r): I, we can define a dif- ferential form l(X)W of degree r-l by the formula (l(X)W)(X I , ""X,_I)==W(X,X" ..., X'I) for any r- 1 vector fields X" "', X'I; if r=O, we put l(X)W==O, The differential form l(X)W is called the interior product of w with X. (4) Lie derivative. The Lie derivative Lxw of a differential form of degree r with respect to a vector field X is a differential form of the same degree. For any r vector fields XI' ..., X, we have, by definition, (Lxw) (X"..., X,)== X(w(X I , ...,X,))-Li=l w(X 1 , ..., [X, X;], ... ,X,). (5) Let <p be a Coo-mapping of Minto M', and let <P: : T",p) --> * be the ttranspose (or tdual) of the linear mapping (d<p) p: Tp --> T",(p) for pEM, i.e., the mapping defined by the condition ((d<p)pL, ex)=(L, qJ;ex) for each LE Tp, exE T",p)' We denote the linear mapping of II T",p) into II T p * induced by qJ; by <p; also. Let w be a differential form of degree r in M. Then a differential form <p*w in M, the pullback by <p of w, is defined by (<p*w)p == <p;w",(p)' pE M. The operations defined previously satisfy the following six important relations: (i) d 2 = 0, that is, d(dw)=O; (ii) d(w /\ 8)=dw/\ 8+ ( -I)' w /\ d8, where w is of degree r; (iii) <p*(w /\ 8) = <p*w /\ <p*8, <p*(dw) =d(<p*w); (iv) Lx(w /\ 8)=(Lxw) /\ 8+ w /\ (Lx 8); (v) Lx= l(X)'d +d 'l(X), Lx(dw) = d(Lxw); and (vi) L[x, Y] = Lx' L y - Ly' Lx, l([X, Y]) = Lx 'l(Y)- l(Y)'Lx. R. De Rham Cohomology Let :D(M)=L=o:D'(M), where n=dimM. Then :D(M) is a tcochain complex with tcoboundary operator d. We denote by H'(:D) the r-dimensional cohomology group of this cochain complex, and we call it the r- dimensional de Rham cohomology group of the differentiable manifold M. If we denote by <£:'(M) and (f'(M) the subspaces of :D'(M) con- sisting of closed differential forms and exact differential forms, respectively, then H'(:D) = <£:'(M)j(f'(M) (O.s;r.s;n) by definition. If WE<£:i(M) and 8E<£:'(M), then w /\ OE<£:i+'(M), and if WE<£:i(M) and 8E(f'(M) (or wE(fi(M) and 8E<£:'(M»), then w /\ 8E(fiH(M). So if we put H(:D)=L=1 H'(:D)(direct sum), we can define a product in H(D) by [wJ' [8J = [w /\ 8J for each [w] E Hi(:D), [8] E H'(:D), With respect to this product, H(:D) forms an algebra over R called the de Rham cohomology ring of M. lOST Differentiable Manifolds S. Partitions of Unity Let M be a paracompact Coo-manifold, and let {V;};El be a tlocally finite open covering of M such that the closure of V; is compact for each index i. Then there exists a Coo-function /; in M for each i satisfying the following three conditions: (i) O.s;/;.s; 1, (ii) the support of /; is contained in Vi, and (iii) LiElh(X) = 1 for every XE M. The family of Coo-functions /;, iE I, is called a partition of unity of class Coo subordi- nate to the open covering {V;};El' T. Integrals of Differential Forms Integrals over an Oriented Manifold. Let M be an n-dimensional paracompact oriented COO_ manifold and w a differential form of degree n in M with compact support. We can choose a positively oriented atlas S = {(U., if;,)} 'EA such that {U,} 'EA is a locally finite covering; U, is compact for each ex. Suppose first that the support of w is contained in U, for some index ex. Then (if;,-l )*w, is a differential form of de- gree n in if;,(U,), and we can express (if;,-I)*W, in the form adxl /\ ... /\ dx n , where (Xl, ..., x n ) are the coordinates in Rn such that x = Xi 0 if;, (i = 1, ... , n) give a local coordinate system compatible with the orientation of M and a is a Coo-function with compact support. Then we define the integral of w over M by f w= f adxl...dxn, M IJ.!a(U,,) For the general case let {J.}'EA be a partition of unity of class Coo subordinate to {U'}'EA' Then the support of J.w is contained in U., and except for a finite number of the indices ex, J.w vanishes identically. Therefore we may define the integral of w over M by f w = L ( J.w, M (l JM and we can show that this definition of the integral is independent of the choice of or- iented atlas S and of a partition of unity sub- ordinate to S. Integrals over a Singular Chain. We fix rectan- gular coordinates in R'. Let do be the origin and d i be the unit point on the ith coordinate axis. Let S' denote the oriented r-simplex (do, d I' ... , d,) with vertices do, d I, ... ,d,. When we regard S' as a point set, we denote it by IS'I. An oriented singular r-simplex of class Coo in M is, by definition, a pair (S', qJ) consisting of S' and a Coo-mapping <p of an open neigh- borhood of IS'I into M, An element of the free Z-module generated by singular r-simplexes of class Coo is called an integral singular r-chain 
105U Differentiable Manifolds of class Coo in M. We define a real singular r- chain of class Coo analogously. Let w be a differential form of degree rand (S', <p) be an oriented singular r-simp1ex of class Coo in M. Then <p*w is a differential form of degree r in a neighborhood of IS'I, and we can express qJ*w in the form qJ*w=adx l /\dx 2 /\... /\dx r . We define the integral of w over (S', qJ) by ( w= ( adxl...dx', Js"qJ) J IS'I and the integral of (J) over a singular r-chain C of class Coo by f w=Lm. f w c i 1 (sr,!pi) , where C = Li mJ S', <Pi), m i E Z (or m i E R), When r=O, then w is a function in M, and SO is a point o. In this case we put Sew = Limiw(<Pi(O)). Let C,(S,Z) (C,(S,R)) be the Z-modu1e (vec- tor space over R) of integral (real) singular r-chains of class Coo in M, and let w(C) be the value of the integral of w over a chain C. Then w is a linear function in the vector space C,(S, R), and hence we can consider w a sin- gular r-cochain of class en. U. Stokes's Formulas (1) Let D be a tdomain in an n-dimensional Coo-manifold M, and let aD and D be the boundary and the closure of D, respectively. Let S = {(V., 1/1,)} 'EA be an atlas of class Coo of M, V = V, n D, 1/1 be the restriction of 1/1, to V, and T= {(V, 1/1) }'EA' If the pair(D, T) is a Coo-manifold with boundary under a suitable choice of S, then the domain D is called a domain with regular (or smooth) boundary. The boundary aD of(D, T) is then an (n-I)- dimensional closed submanifo1d of M, and if Mis orientable, aD is also orient able. Now let M be a paracompact and oriented manifold and D be a domain with regular boundary, Let C be a characteristic function of D in M, i.e" a function defined by the condition C(p) = 1 for pED and C(p) ==0 for piD. Let a be a differential form of degree n in M with com- pact support. We define the integral of a over D by Lo= L CO. Let w be a differential form of degree n - I in M with compact support. We then have Stokes's formula: f dw= f i*w, D cD where i denotes the identity mapping of the 390 submanifold aD into M with aD having the orientation induced naturally from that of M. (2) Let C be a singular r-chain of class Coo in M, and let ac be the boundary of C. Then for any differential form w of degree r-l, we have ( dw= f w. Jc ac This formula is also called Stokes's formula, V. De Rham's Theorem Let M be a connected paracompact Coo_ manifold. If we consider w as a singular co- chain, we have (dw)(C) = w(aC); by Stokes's formula, this means that the exterior differen- tial dw of w is equal to the coboundary of the singular cochain w. Let wand C be a closed differential form of degree r and a singular r- cycle of class Coo, respectively, and let [w J and [C] be the de Rham cohomology class and the singular homology class represented by wand C. Using Stokes's formula, we can define the inner product ([wJ, [CJ) by ([wJ, [CJ)= L w. Through this inner product, it follows that the de Rham cohomology group HrCD) is isomorphic to the rth singular cohomology group H'(M,R), the dual space of the rth homology group of the complex of real sin- gular chains of class Coo. Moreover, the de Rham cohomology ring H(':!) is isomorphic to the singular cohomology ring H*(M, R) (de Rham's theorem). W. Divergence of a Vector Field Let M be an n-dimensional oriented CX)_ manifold, and let S be an oriented atlas. Let (J) be a differential form of degree n, and let (Xl, ...,x n ) be the local coordinate system in a coordinate neighborhood in S. Then we can express w in the coordinate neighborhood uniquely in the form w = adx l /\ ... /\ dxn, If the function a is positive for any coordinate neigh- borhood in S, we call w a volume element of M. In a paracompact oriented manifold, there always exists a volume element. (We remark that an n-dimensional differentiable manifold M is orientab1e if and only if there exists an everywhere non vanishing differential form of degree n.) Let f be a Coo-function in M with compact support. Then f' UJ is a differential form of degree n with compact support, and so the integral Lr'w 
391 is defined. We calI this integral the integral of the function f with respect to the volume ele- ment OJ. Let 9 be a Riemannian metric in M and gij the components of 9 with respect to the local coordinate system (x\ ..., x n ) as before. Then we can define a volume element OJ in M by putting OJ=JGdx l /\ ." /\ dxn, G =det(gij) in each coordinate neighborhood. The volume element thus defined is calIed the volume ele- ment associated with the Riemannian metric g, Let OJ be a volume element and X a vector field in M. Then the Lie derivative LxOJ is also a differential form of degree n, and we can express LxOJ in the form LxOJ = fx' OJ, where fx is a scalar field, i.e., a function in M. We call fx the divergence of the vector field X with re- spect to the volume element OJ and denote it by div X. If OJ is associated with a Riemannian metric, then div X is calIed the divergence of X with respect to the Riemannian metric. If M is compact, we have fM divX,OJ=O for any vector field X. This result is known as Green's theorem. X. Jets Let M and N be Coo-manifolds. We define an equivalence relation in the set of alI Coo_ mappings of Minto N. Let f and 9 be such mappings and p be a point of M. Choosing local coordinate systems, we write f(p) = (/1 (x), ... In(x)), g(p) = (g I (x), ..., gn(x)), X = (XI"" ,x m ). We say f and 9 are equivalent at p if f(p), g(p), and the values at p of alI the partial derivatives of fi and gi up to the order r (r an integer, rO) are equal (i= 1, ...,n). An equivalence class with respect to this equiva- lence relation is calIed a jet of order r at p. A jet of order r at p represented by a function f is denoted by j;f, and the points p and f(p) are calIed the source and the target of the jet j;f, respectively. We denote by J;(M, N) the set of alI jets of order r with source at p and target in N and let Jr(M,N)= UPEMJ;(M,N). For any jet j, let ns(j) and n,(j) denote the source and the target ofj, respectively. We can introduce the structure of a Coo-manifold in J'(M, N) in a natural way such that the projections ns:Jr(M, N)-->M and n,:J'(M, N)-->N are both of class Coo and Jr(M, N) is a fiber bundle over M(N) with projection ns(n,). As examples, we have: (1) JJ (R, N) is identified with the tangent vector bundle of N, (2) The set J'() of alI jets of order r deter- lOSY Differentiable Manifolds mined by the sections of a vector bundle  of class Coo is also a vector bundle of class Coo. Let f:M -->N and g: N -->L be Coo-mappings. We define a composition of jets by ff(p)g' j;f = j;(g 0 f), A jet j;f E J'(M, N) is inver- tible if there exists a mapping g: N --> M such thatjf(p)g'j;f=j;(I M ), where 1 M denotes the identity mapping of M onto itself. We de- note by l'(M, N) the set of alI invertible jets in J'(M, N), and put I;(M, N) = l'(M, N) n J;(M,N). (3) Let Gr(n) be the set of alI invertible jets in l'(Rn, Rn) whose source and target are the origin of Rn, Then with respect to the compo- sition of jets, G'(n) forms a Lie group that is an textension ofGI(n)==GL (n,R) by a simply connected nilpotent Lie group. The projec- tion G'(n)-->GI(n) is a special example of the natural projection Jr(M, N) -->J'(M, N) (r  s), which is defined in general. (4) We can identify IJ(Rm,M) (m=dimM) with the tangent m-frame bundle over M. More generalIy, I(j(Rm,M) is a G'(m)-bundle over M, Y. Pseudogroup Structure Let X be a topological space, and let r be a set consisting of homeomorphisms f: U f --> Vf' where U f , V f are open subsets of X. We calI r a pseudogroup of topological transformations if r satisfies the folIowing four conditions: (i) r contains the identity mapping of X onto X; (ii) if fEr, then the restriction of f onto any open subset U of U f is also contained in r; (iii) if f and 9 are in rand V f c U g , then go f is con- tained in r; and (iv) if fEr, then f -1: V f --> U f is also in r. FolIowing the definition of differentiable manifolds we define a pseudogroup structure of M (or, more precisely, a r-structure of M) as a set A of bijections, with each member ex defined on a subset U, of M onto an open set V, of X, satisfying the folIowing three conditions: (i) U, U, = M; (ii) if ex, [3E A, then ex 0 [3-1 E r, where the domain of definition of ex 0 [3-1 is [3(U, n Up); and (iii) A is the maximal set of bijections that satisfies conditions (i) and (ii). We introduce in M the weakest (coarsest) topology such that every bijection ex is a homeomorphism. If r' is a pseudogroup of X containing r and A' is a r'-structure of M such that A c A', then we say that A' is sub- ordinate to A.1f X =Rn (or H n , a half-space of Rn), r is the totality of diffeomorphisms of class C' of open sets of X onto open sets of X, and M is a space with Hausdorff topol- ogy, then the r -structure is the C' -structure with or without boundary which we have already defined. We give three examples of r- 
105Z Differentiable Manifolds structures subordinate to P, where P is the totality of local transformations of class C' (r 1) in Rn. (1) When n is even, we identify R n with e /2 and denote the totality of holomorphic trans- formations of connected open domains by r. The r -structure in this case is called a complex structure, (2) When n is odd, we define r as the totality of transformations of connected open domains in R n that leave invariant a Pfaffian form LI Xi dxm+i + dx 2m +1 (n = 2m + 1) up to scalar factors, The r -structure in this case is called a contact structure. (3) We consider Rn=RP x Rn-p and define r as the set of all diffeomorphisms U --> V (where U, V are open in Rn) such that each set of the form un (RP x {y}) is mapped onto a set of the form Vn(RP x {y'}), The r -structure in this case is called a foliated structure. The problem of determining whether there exists a r-structure for given rand M involves widely ranging problems of topology and analysis. The classification of r with reaSon- able conditions is another important open problem. Haefliger has constructed the classifying space Br for r-structures. Z. Infinite-Dimensional Manifolds Let Band B' be Banach spaces and <p be a mapping from an open set of B to B', Then, using the notion of Frechet derivatives, we can define <p to be smooth, For a smooth mapping cp, the Jacobian J(<p) at each point x is a linear mapping from B to B', for which the inverse function theorem holds true as a straightfor- ward extension of the corresponding one in the finite-dimensional case. Similar extension also holds for the existence and the unique- ness theorems of solution of ordinary differen- tial equations with value in B, These facts permit us to generalize the notion of differ- entiable manifolds to infinite-dimensional ones. Actually, infinite-dimensional manifolds can be defined in a way similar to the finite- dimensional case, taking open sets of a certain Banach space as local coordinate neighbor- hoods. Such a manifold is called a Banach manifold. Various formal definitions of dif- ferentiable manifolds can also be stated for Banach manifolds in extended form. However, while differentiable manifolds are locally com- pact and admit a partition of unity by smooth functions subordinate to a locally finite open covering, Banach manifolds generally lack these properties, Actually, local compactness gives a criterion for whether a manifold is finite-dimensional or not. Banach manifolds 392 often provide a basic functional-analytic view to nonlinear analysis and global analysis. The most important category of Banach manifolds is provided by the Hilbert manifold, that is, a Banach manifold whose local coordi- nates are modeled on a Hilbert space. For a Hilbert manifold, a partition of unity subordi- nate to a locally finite open covering can be taken from smooth functions. In the follow- ing, we refer to a separable Hilbert manifold simply as a Hilbert manifold. An infinite- dimensional Hilbert manifold M can be smoothly embedded as an open set of a Hil- bert space and thus covered by a single coordi- nate neighborhood. Hence, in particular, the tangent bundle of M is trivial. Historically, this fact was the first instance showing that the distinguishing properties are shared by infinite-dimensional manifolds; this was first recognized as a conseq uence of the theorem stating that the unitary group of an infinite- dimensional Hilbert space is contractible. Two infinite-dimensional Hilbert manifolds are diffeomorphic if and only if they are homo- topically equivalent. A typical example of a Hilbert manifold is provided by the space of L 2 -loops on a compact Riemannian manifold. Morse theory can be extended in a suitable way to a Riemannian Hilbert manifold under the Palais-Smale condition, which makes it possible for the integral curve of gradfto tend to a critical point, where f is a Morse function. AA. Gel'fand-Fuks Cohomology The space 3:(M) consisting of all the smooth vector fields on a smooth manifold M has the structure of a Lie algebra under the bracket operation [X, YJ = X Y - YX, where the vector fields X and Yare regarded as derivations on the algebra COO(M) of smooth functions on M. 3:(M) is a topological Lie algebra when en- dowed with the topology defined by uniform convergence of the components of vector fields and all their partial derivatives on each com- pact set of M. When 3:(M) acts continuously on a topological vector space V, the continu- ous cohomology H*(3:(M), V) of 3:(M) with coefficients in V is the cohomology of the cochain complex EB {CP = CP(3:(M), V), d}. Here CO = V, CV (p  1) is the space of all the alternating p-linear continuous mapping <p of 3:(M) x". x 3:(M)(p-times) into V, and d: CV -->CV+ I is defined for <pECV and X i E3:(M) by d<p(X I ) = Xl <p (p=O) and d<p(X I , ..., Xp+d= Li<j( -1)i+j<p([Xi' XjJ; Xl'"'' Xi'"'' Xj"'" X p + l )+ L;( -li+ l Xi cp (Xl , ... ,Xi'"'' Xp+d (p 1), When V is a topological algebra and the elements of 3:(M) act on Vas derivations, the exterior multiplication of cochains induces 
393 a graded algebra structure in H*(I(M), V). When V = R is the trivial I(M)-module, then H*(I(M»=H*(I(M),R) is called the Gel'fand-Fuks cohomology of M, Gel'fand and Fuks proved that, for any compact oriented manifold M, we have dim HP(I(M)) < +00 for all p and HP(I(M»=O for l.s;p.s;n (n= dimM). For example, if M is the circle SI, then the algebra H*(I(SI» is generated by two generators exEH 2 , fJEH 3 , which are expli- citly described as cochains in the following way: exV :t ' 9 :J = fJ: ::Idt, ( d d d ) f f f' f" fJ f dt' 9 dt ' h dt = SI h' h: g" dt. h" Localization of the concept of Gel'fand- Fuks cohomology naturally yields the co- homology of formal vector fields. Here a formal vector field means the expression Lfp(x I ,..., x n )8/8x p ,/p being formal power series in XI' ..., Xn' The set of all the formal vector fields forms a Lie algebra an and the continuous cohomology of an with respect to the Krull topology is denoted by H*(a n ). Let Bu be the universal tclassifying space of the unitary group U(n), let (Bub be its 2n- skeleton, and let P 2n be the canonical principal U(n)-bundle restricted to (Bub. Then there is an algebra isomorphism H*(a n )  H*(P 2n ; R). This cohomology and its variants play im- portant roles in the theory of foliations. An important subcomplex EB {q, d} of EB {CP, d}, the diagonal complex, is defined as C= {<pECPI<p(X I , ...,Xp)=O ifsupp XI n ... nsupp X p = 0}, Here, supp Xi deno tes the support of Xi' that is, {xIXi(x)#O}. Let PM be the principal U(n)-bundle associated with the complexified tangent bundle of M. U(n) acts freely on the product PM X P 2n and the quotient space EM=P M X P 2n /U(n) is a fiber bundle over M with fiber P 2n . Then, if M is a compact oriented manifold, the co- homology HI(I(M)) of the diagonal com- plex is completely determined by the isomor- phisms H[(I(M))  HP+n(E M ; R) for all p. In particular, if all the pontryagin classes of M vanish, then H[(I(M»)=Li+j:p+nHi(M;R)@ Hj(a n ). The Gel'fand-Fuks cohomology has a topo- logical interpretation: H*(I(M))H*(r(EM)' R) as graded algebras, where r(E M ) denotes the space of all the continuous cross-sections of EM-->M with the compact open topology. Moreover the differential graded algebra C*(I(M») has a model in the sense of Sullivan constructed purely algebraically from a model of the de Rham algebra of M and the Pon- 105 Ref. Differentiable Manifolds tryagin classes of M. This model shows in particular that H*(I(M)) is not necessarily finitely generated as a graded algebra, The cohomology theory of I(M) in the case where the representation is nontrivial has also been investigated. The natural representa- tion on COO(M) is a typical example. There is also a topological interpretation: H*(I(M), COO(M))H*(YM,R), where Y M is the fiber product of the evaluation mapping M x r(E M) -->EM and the inclusion PMc.E M correspond- ing to a fiber inclusion U(n)c.P2n. References [1] L. Auslander, Differential geometry, Har- per, 1967. [2] R. L. Bishop and S. I. Goldberg, Tensor analysis on manifolds, Macmillan, 1968. [3] N. Bourbaki, Varietes differentielles et analytiques, Hermann, 1967. [4J E. Cartan, Les systemes differentiels ex- terieurs et leurs applications geometriques, Actualites Sci. Ind., Hermann, 1945, [5] H. Cart an, Formes differentielles, Her- mann, 1967. [6] C. Chevalley, Theory of Lie groups I, Princeton Univ. Press, 1946. [7] G. de Rham, Varietes differentiables, Ac- tualites Sci. Ind., Hermann, 1955. [8] S. Kobayashi and K. Nomizu, Founda- tions of differential geometry, Interscience, I, 1963; II, 1969, [9] S. Lang, Introduction to differentiable manifolds, Interscience, 1962. [lOJ Y. Matsushima, Differentiable manifolds, Dekker, 1972. [11] J. R. Munkres, Elementary differential topology, Ann. Math, Studies 54, Princeton Univ. Press, 1963, [12] K. Nomizu, Lie groups and differential geometry, Publ. Math, Soc. Japan, 1956. [13] S. Sternberg, Lectures on differential geometry, Prentice-Hall, 1964, [14J H. Whitney, Differentiable manifolds, Ann. Math., (2) 37 (1936), 645-680. [15J H. Whitney, Geometric integration theory, Princeton Univ. Press, 1957. [16J J. Eells, A setting for global analysis, Bull. Amer. Math. Soc., 72 (1966),751-807, [17J D. Burghelea and N. Kuiper, Hilbert manifolds, Ann. Math., (2) 90 (1969),335-352. [18] N. Kuiper, The homotopy type of the unitary group of Hilbert space, Topology, 3 (1965), 19-30. [19] R. Pa1ais and S, Smale, A generalized Morse theory, Bull. Amer. Math. Soc., 70 (1964), 156-172. [20J I. M. Gel'fand, The cohomology of infinite-dimensional Lie algebras; some ques- 
106 A Differential Calculus tions of integral geometry, Actes Congr. Int. Math., Nice, 1970, Gauthier-Villars, vol. 1, 95 111. [21J I. M. Gel'fand and D, B. Fuks, Coho- mologies of the Lie algebra of tangent vector fields of a smooth manifold I, II, Functional Anal. Appl., 3 (1969), 194-210; 4 (1970), 110- 116. (Original in Russian, 1969, 1970.) [22J I. M. Gel'fand and D. B. Fuks, The coho- mology of the Lie algebra of formal vector fields, Math. USSR-Izv. (1970), 322-337. (Original in Russian, 1970.) [23J V. W. Guillemin, Cohomology of vector fields on a manifold, Advances in Math., 10 (1973), 192-220. [24J M, V. Losik, On the cohomologies of infinite-dimensional Lie algebras of vector fields, Functional Anal. Appl., 4 (1970), 127- 135. (Original in Russian, 1970.) [25J R, Bott and G. Segal, The cohomology of the vector fields on a manifold, Topology, 16 (1977), 285298. [26J A. Haefliger, Sur la cohomologie de l'a1- gebre de Lie des champs de vecteurs, Ann. Sci. Ecole Norm. Sup., 9 (1976),503-532. 106 (X.6) Differential Calculus A. First-Order Derivatives Let y = f(x) be a real-valued function of x defined on an interval I of the rea11ine R. If for a fixed Xo E I, the limit I . f(xo + h) - f(x o ) 1m hO h xo+hEI exists and is finite, then f is called differenti- able at the point xo, and the limit is the deriva- tive (differential coefficient or differential quotient) of f at the point Xo. Iff is differenti- able at every point of a set A c: I, then f is said to be differentiable on A. The function that assigns the derivative of f at x to XE A is called the derivative (or derived function) of f(x), which is denoted by dyjdx, y', y, df(x)jdx, (djdx)f(x),f'(x), or DJ(x). The process of determining I' is known as the differentia- tion of f The derivative of f at the point Xo is written f'(x o ), (dfjdx)(x o ), DJ(xo), [dyjdxJx=x o ' etc. We say that f is right (left) differentiable or differentiable on the right (left) if the limit on the right, limh + o (/(xo + h) - f(xo) )jh (the limit on the left, Iimh+o(/(xo-h)- f(xo))jh), exists and is finite. This limit is called the right (left) derivative or derivative on the right (left) and is 394 denoted by Dx+ f(x o ) orf(xo)(D; f(x o ) or f'- (xo)). For instance, if f is defined on 1= [a, b), then DJ(a) is identical to Dx+ {(a). B. Differentials In the definitions given above, neither dx nor dy in dyjdx has a meaning by itself. In the following, however, we give a definition of dx and dy, using the concept of increment, so that dy= f'(x)dx. Let y denote the increment f(x + x) - f(x) of f corresponding to the in- crement x of x. Suppose that f(x) is differen- tiable at x. We set yjx= f'(x)+F.. Then we have Iim,\x_ o F.=O. This can be written utiliz- ing tLandau's notation as y= f'(x)x+ o(lxl) (x-->O); in other words, y is the sum of two terms, of which the first, f'(x)x, is proportional to x and the second is an tin- finitesima1 of an order higher than x. Here the principal part f'(x)x of y is called the differential of y = f(x) and is denoted by dy. The differential dy thus defined is a function of two independent variables x and x. In partic- ular, if f(x) = x, from the definition we get dx = 1 . x = x. Hence, in general, we have dy = f'(x)dx and f'(x) = dyjdx. With respect to the rectangular coordi- nates (x, y), the straight line with slope f'(xo) through a point (xoJ(xo)) on the graph of y = f(x) is the ttangent line of the graph at the point (xo,/(x o )). A function is continuous at a point where the function is differentiable, but the converse of this proposition does not hold, In fact, Weierstrass showed that the function defined by the infinite series Lo an cos bnnx, where 0 < a < 1 and b is an odd integer with ab > (3j2)n + 1, is continuous everywhere and nowhere differentiable on (-00,00) [3]. C. Differentiation For two differentiable functions f and 9 de- fined on the interval I, the following formulas hold: (exf + fJg)' = exf' + fJg', where ex and fJ are constants; (/g)' = f'g+ fg'; and (ffg)' =(/'g- fg')jg2 (at every point where 9 #0). Let y = f(x) be a function of x defined on the interval (a, b) and x = <p(t) a function of t defined on (ex, fJ). If <p(t) E(a, b) whenever t E(ex, fJ), then the composite function y= F(t) = f(cp(t)) = (/0 cp)(t) is well defined, Assume further that f and <p are differentiable on (a, b) and (ex, fJ), respectively. Then the composite function F(t)=(/o<p)(t) is differentiable on (ex, fJ), and we have the chain rule, F'(t) = f'(x) <p'(t) (x == <p(t)), or dyjdt = (dyjdx)(dxjdt), Assume that a function y = f(x) is tstrictly increasing or decreasing and differentiable at Xo, If furthermore we have 
395 f'(x) #0, then the inverse function x = f- I (y) is also differentiable at Yo (= f(x o )) and satisfies (dxjdY)yyo(dyjdx)x=xo = I. However, if f'(x o ) = 0, then even though f-I(y) is not differenti- able at Yo, lim,\yo(f-I (Yo + y) - r l (Yo))jy exists and is +00 or -00. D. Higher-Order Derivatives If the derivative f'(x) of a function y = f(x) is again differentiable on I, then (f'(x))' = f"(x) is well defined as a function of x on I. In general, if j<n-O(x) is differentiable on I, then {(x) is called n-times differentiable on I, and the nth derivative (or nth derived function) j<n)(x) of f(x) is defined by j<n)(x) = (.f(n-1)(x))' and is also denoted by d n yjdx" or D(n) y. The nth derivative for n  2 is called a higher-order derivative, Concerning the nth derivative of the product of two functions, Leibniz's formula holds: (/g)(n) = j<")g+G)j<n-l)g' +... +G)j<nk)g(k)+... + fg(n). Analogous to dy= y'x, which is a function of x and x, we can define d 2 y in the notation d 2 yjdx 2 by d2y=d(dy)=d(y'x)=(y'x)'x = Y"X2. Since x=dx, it follows from the above that d 2 y= y" dx 2 , Similarly, d n y== y(n) dx n and is called the nth differential (or differential of nth order) of.f(x). E. The Mean Value Theorem Let f(x) be a continuous function defined on [a, b J, and suppose that for every point Xo on (a, b) there exists a limit limho(/(xo + h)- f(xo))jh, which may be infinite. (These condi- tions are satisfied if f(x) is differentiable on [a,b].) Then there exists a point  such that f(b)-f(a) =f'(), a<<b. b-a This proposition is called the mean value theorem. A special case of the theorem under the further condition that f(a) == {(b) is called Rolle's theorem, If we put b - a = h, (= a + 8h, then the conclusion of the theorem may be written asf(a+h)=f(a)+h' f'(a+8h) (0< 0<1). This theorem implies the following: Let f(x) be a function as in the hypothesis of the mean value theorem, and assume further that A .s;f'(x).s; B holds for all x with a < x < b. Then A.s; (/(b) - f(a))j(b - a).s; B. (French mathematicians sometimes call this the "theo- 106F Differential Calculus reme des accroissements finis.") Using the mean value theorem, the following theorem can be proved: If f(x) is continuous on [a, bJ and f'(x) exists and is positive on (a, b), then f(b) > f(a), Accordingly, if f'(x) > 0 at every point x of an interval I, then f(x) is tstrictly increasing on that interval. (If f'(x) < 0 on I, then f is strictly decreasing,) The converse of the previous statement does not always hold (f'(x)=x 3 is a counterexample, since f'(O)=O). Furthermore, from the mean value theorem it follows that if f'(x) = 0 everywhere in an interval, then f(x) is constant on that inter- val. Consequently, two functions with the same derivative on an interval differ only by a constant. Suppose that f(x) is n-times differentiable on an open interval I, For a fixed aEI and an arbitrary xEI, we put f'(a) f(x)=f(a)+-(x-a)+ ... I! f (n-I) ( a ) + ( x_a ) n-l R. (n-l)! + n Then Rn=j<n)@(x-a)"jn! for some  be- tween a and x. This is called Taylor's formula, where Rn is the remainder of the nth order given by Lagrange. We also have several other forms for Rn ( Appendix A, Table 9). If j<n)(x) is continuous at x=a, then -->a as x-->a, and accordingly, j<n)()--> j<n)(a). Hence f(x) = LkO(j<k)(a)jk!)(x- a)k + o«x -a)"). If f(n)(x) is continuous at x=a, then, by Tay- lor's formula, the value of the polynomial Lk=O(/(k)(a)jk!)(x-a)k can be considered an approximate value of {(x) for x near a, This approximation is called the nth approximation of f(x), and its error is given by IRn+ll. By applying this formula, it is sometimes possible to calculate a limit such as A = limxaf(x)jg(x), where f(x)-->O and g(x)-->O as x-->a. For in- stance, if f'(x) and g'(x) are both continuous at x=a and g'(a) #0, then by taking the first approximations of f(x) and g(x) it is easily seen that A = f'(a)jg'(a), A limit of this type is often called a limit of an indeterminate form OjO. Similarly, we can calculate limits of such indeterminate forms as O' 00 or 000 (for limits of indeterminate forms  [6J). F. Partial Derivatives Let w = f(x, y,..., z) be a real-valued function of n independent real variables x, y, ... , z defined on a domain G contained in n-dimensional Euclidean space R n . We obtain a function of a single variable from f by keeping n - 1 vari- ables (say, (x, y, ..., z), i.e" all except y) fixed. If 
106G Differential Calculus such a function <p(y) =f(x o , y, "', zo) is dif- ferentiable, that is, if <p'(Yo) = lim <p(Yo+Y)-<P(Yo) 8Y0  Y = lim .f(x o , Yo + y, ..., zo) - f(x o , Yo, ..., zo) 8Y0 y exists and is finite, then [ is called partially differentiable with respect to y at (xo, Yo, '" , zo), and the derivative is called the partial derivative (or partial differential coefficient) of [(x,y, ...,z) with respect to y at (xo,Yo, ... ,zo)' It is denoted by [ow/iJyJx=x o , ',ZZo' (iJ/iJy)f(x o , Yo, ..., zo),fy(xo, Yo, ..., zo), or Dyf(x o , Yo, "., zo), etc. We usually assume that the point (x, y, ..., z) where partial derivatives are considered is an tinterior point of the tdomain of the function. Since in a space of dimension higher than I, the tboundary of a domain may be complicated, partial deriva- tives at boundary points are usually not con- sidered. If a function [ possesses a partial de- rivative with respect to x at every point of an open set G, then fx is a function on G and is called a partial derivative of f with respect to x, The process of determining partial deriva- tives of f is called the partial differentiation of f G. Total Differential Let w = f (x, y, ... , z) be a function defined on a domain G, and let P=(x,y, ,."z) be an interior point of the domain G of a function w= f(x,y, ...,z), Put w= j(xo+x,yo+ y, ..., Zo + z) - [(x o , Yo, ..., zo). If there exist constants ex, [3, ..., y such that w = exx + [3y+... +yz+ o(p ) (p-->O), where p = J X2 + y2 + ,., +Z2, then f is called totally differentiable (or differentiable in the sense of Stolz) at P. In this case, f is partially differentiable at P with respect to each of the variables x, y, "', z, and ex =.fAxo, Yo, ..., zo), [3 = J;,(xo, Yo, ..., zo), "', y = fz(x o , Yo, ..., zo). The principal part of w as p-->O is exx+[3y+... + yz, which is called the total differential of w at P. If f is totally differentiable at every point of G, then [is said to be totally differentiable on G. The total differential of w is denoted by dw. Since the total differentials of x, y, ,,' , z are dx =x, dy=y,..., dz=z, respectively, we can write dw= [xdx+ fydy+... + fzdz; and dw is a function of independent variables x, y, .., , z, dx, dy, ... , dz. The total differentiability of f implies the continuity of f, whereas the partial differentiability of f with respect to each variable does not imply that f is continu- ous. (Example: Define f(x, y) = xy/(x 2 + l) for (x, y) #(0, 0), and [(0,0) = 0; then the func- 396 tion f is not continuous at (0,0), even though both .fx and fy exist at (0,0).) The function f is totally differentiable on G if all fx, fy, ... ,.fz exist and are continuous on G, or, more weak- ly, if all Ix, f y , .." [z exist and, with possibly one exception, are continuous. Suppose that w = f(x, y) is totally differentiable at (x, y), and let x=pcos8, y=psin8, As p-->O, for a fixed 8 there exists the limit limpo(w/p)== fx(x,y)cosO+ fy(x,y)sinO. This limit is called the directional derivative in the direction 0 at (x, y). The partial derivatives fx and 1;. are special cases of the directional derivative for 0=0 and n/2, respectively. Suppose that we are given a curve lying in the tinterior of the domain of f and that the curve passes through the point (x, y), where the curve is differenti- able. Then the partial derivative of w = f(x, y) in the direction of the normal to the curve at (x, y) is called the normal derivative of w at the point (x, y) on the curve and is denoted by cw/on. Analogous definitions and notations have been introduced for functions of more than two variables, To see the geometric significance of the total differentiability of w = f(x, y), we consider the graph of the function w = [(x, y) and a point (a, bJ(a, b)) on the graph, Then the plane represented by w- f(a, b) = ex (x -a)+ [3(y-b) is the ttangent plane to the surface at (a,bJ(a, b)) if and only if ex= fAa, b) and [3 =.fy(a, b). The existence of fx and fy depends on the choice of coordinate axes, while the total differentia- bility of f does not. H. Higher-Order Partial Derivatives Suppose that a partial derivative of a function w= f(x, y,.", z) defined on an open set G again admits partial differentiation. The latter partial derivative is called a second-order partial derivative of f We can similarly define the nth order partial derivatives. Higher-order partial derivatives are denoted as follows: c ( cw ) 02W 0- ;) = o 2 =fxAx,y,...,z), ex ex x iJ ( OW ) 02W oy ox = cxcy =.fXY(x,y,...,z), O ( iJ 2 w ) iJ3w '1  o = o =[xyAx,y"."z)"", ex ox y cxoy x In general, fxy and .fyx are not equal. (Peano's example: Let f(x, y) = xy(x 2 - y2)/(X 2 + y2) for (x, y) eft (0,0) and f(O, 0) = (} Then fxy == -I, J;x = I at (0,0),) However, if both fxy and hx are continuous on an open set G', then they coincide in G', Furthermore if ! I' and ! , x'Jy' xy exist in a neighborhood U of a point P belong- ing to the domain of j and [Xy is continuous 
397 at P, then hx exists at P and fxy = hx (H. A, Schwarz). If fx and f y exist in U and are totalIy differentiable at P, then fxy = hx at P (W. H, Young). Similarly, if the partial derivatives of order;:: 3 f xy... and f yx . are all continuous, then f..xy... = f yx . Hence we can change the order of differentiation if all the derivatives concerned are continuous. I. Composite Functions of Several Varia bles Let w be a function of x, y, . ,. , z, and let each x, y, ..., z be a function of t. Suppose that the range of(x(t),y(t), ...,z(t)) is contained in the domain of w. Then w is a function of t. If fur- ther w is totalIy differentiable and x, y, ... ,z are alI differentiable, then w as a function of t is differentiable, and we have dw 8w dx 8wdy 8wdz -=--+-+ ...+-. dt 8x dt 8ydt 8zdt If partial derivatives of order;:: 2 are totally differentiable, then d 2 w/dt 2 , d 3 w/dt 3 ,... are obtained by repeating the above procedure. A similar consideration is valid when x, y, ... ,z are functions of several variables. J. Taylor's Formula for Functions of Several Varia bles Suppose that f(x, y) is defined on an open domain G, f(x, y) has continuous partial de- rivatives of orders up to n, and the line seg- ment (a+(x-a)t,b+(y-b)t) (O.s;t.s; 1) is contained in the domain G. Then there exists a number 0 (0<0< 1) such that f(x,y) = f(a,b)+(X-a) :x +(y-b) :y )f(a,b) 1 ( 8 8 ) 2 + 2! (x-a) 8x +(y-b) 8y f(a,b)+... 1 ( 8 8 ) n-I +- (x-a)-+(y-b)- f(a,b) (n-l)! 8x 8y 1 ( 8 8 ) n +- (x-a)-+(y-b)- n! 8x 8y xf(a+(x-a)O, b+(y-b)O), where, for instance, the third term «x -a)' (8/8x) + (y- b)(8/8y))2f(a, b) means (x -a)2(82f/8x 2 )(a, b) + 2(x -a)(y - b)' Wf/8x8y)(a, b) + (y - W(8 2 f/8l)(a, b), with Wf/8x 2 )(a, b), Wf/8x8y) (a, b), and Wf/8 y 2)(a, b) denoting the values of (82f18x 2 ), (8 2 f/8x8y), and Wf/8 y 2) at (a, b), respectively. The displayed formula is called Taylor's for- mula for a function of two variables, A similar 106L Differential Calculus formula is valid for a function of n variables (n;:: 3). As in the case of functions of one vari- able, we can derive approximation formulas for f from Taylor's formula. K. Classes of Functions If alI the partial derivatives of order n of f(P) are continuous on an open set G, then f is said to be a function of class cn (or n-times contin- uously differentiable) on G, The set of all n- times continuously differentiable functions is denoted by cn (n = 1,2, ,..). A continuous function is of class Co. A function of class C 1 is also called a smooth function. It is obvious that CO =:> C 1 =:> C 2 =:> .... A partial derivative of order r.s; s of a function belonging to class C' does not depend on the order of the differenti- ation. A function belonging to Coo = n1 C' is said to be of class Coo or infinitely differenti- able. We sometimes say that a function has a certain "nice" property or is "well behaved" if it belongs to some C' (r;:: 1). Let w = f(x, y, ..., z) be a function defined on an open set G in Rn and P=(a, b,..., C)EG. If f(x,y, ...,z)= f(a,b, ...,c) 00 00 + L ... L ex'l" ,.(x-a)'l(y-b)" ...(z-c)'. r1=1 r"=1 holds in some open neighborhood U of P, where the right-hand side of the equality is an absolutely convergent series, then f is said to be real analytic at P. In this case, f is r-times differentiable at P for any r, and we have r l !r2! ...r n ! ex - '1',,,,. (rl+r2+...+rn)! aT, +'2+ .+r"f X 8 (a, b, ... , c). 8X'1 Y"... 8z'. If f is real analytic at every point P of the domain G, then f is called a real analytic func- tion on G. Sometimes, a real analytic function is called a function of class C"'. A real analytic function belongs to Coo, but the converse is not true ( 58 Coo-Functions and Quasi-Analytic Functions E). L. Extrema Let f be a real-valued function defined on a domain G in an n-dimensional Euclidean space Rn that has the point fo in its interor. If there exists a neighborhood U of Po such that for every point P (#P o ) of U we have f(P);:: f(po), then we say that f has a relative minimum at Po, and f(Po) is a relative mini- mum of f Replacing ;:: by .s;, we obtain the definition of a relative maximum. f(Po) is 
106 Ref. Differential Calculus called a relative extremum if it is either a rela- tive maximum or a relative minimum. To find relative extrema, the following facts concerning the sign of the derivative are useful. Suppose that a function of a single variable is differentiable on an interval I. Then we have the following: (I) If f has a relative extremum at an interior point Xo of I, then f'(xo) = O. (2) If f'(xo) = 0 and f'(x) changes its sign at Xo from positive (negative) to negative (positive), then f has a relative maximum (minimum) at xo. (3) If f'(xo) = 0 and f is twice differentiable on some neighborhood of xo, then f has a relative maximum or minimum according as f"(xo) < 0 or > O. If f"(xo) = 0, then nothing definite can be concluded about a relative extremum of fat Xo. In general, if there exists a neighborhood of Xo in which f is r-times differentiable (r is even) and j<r) is continu- ous, and if f'(Xo) = f"(Xo) = ... = j<,-I)(xo)=O, f(')(x o ) > 0 (or <0), then f has a relative mini- mum (maximum) at x o ' On the other hand, if this condition holds with odd r, then f(x o ) is not a relative extremum. If f'(xo) == 0, then f(x o ) is called a stationary value of f If a function f on n variables x, Y, ... ,2 has a relative extremum at (x o , Yo, ...,2 0 ), then we have fAx o , Yo, ..., 2 0 )= 0, fy(x o , Yo, ...,2 0 )= 0, ...JAxo,yo, ...,zo)=O, provided that the partial derivatives of f exist. Assume that for a function f of class C 2 of two variables x and y, we have fAxo, Yo) = 0 and fy(x o , Yo) = 0, and let b = fxx!xo, Yo)J;,y(xo, Yo) - fx;(x o , Yo). Then we have the following: (1) If b > 0, then according as fxx!xo,yo)<O or >0, f has a relative maxi- mum or minimum at (xo, Yo)' (2) If b < 0, then f does not have a relative extremum at (x o , Yo). (3) If (j = 0, then without further information nothing definite can be said about a relative extremum of f at the point. Let x I, ... ,X n be independent variables. If a function f of variables XI, ,.. ,X n has a relative extremum at a point Po = (x, ". ,x), then /; = fx,(Po) = 0 (i = 1, ,.. , n), provided that all the partial derivatives of f exist. In general, a point Po where f is totally differentiable and these conditions are satisfied is called a critical point of f The value f(Po) at a critical point is called a stationary value. If further f is of class C 2 , then consider a tquadratic form of n variables Q=Q(X I , ...,Xn)=Li,k/;kXiXk' where /;k =.fx,Xk(P O )' Suppose that I,t;kl # 0, Then according to whether Q is tpositive definite, tnegative definite, or tindefinite, f has a relative minimum, relative maximum, or no relative extremum at Po. If I/;kl = 0, then nothing can be said in general. A critical point P of f is said to be nondegenerate if If,kl #0 and degenerate if lfik I = 0, We can also apply the method of differenti- 398 ation of timplicit functions to find relative extrema of functions defined implicitly. Given functions qJ I' ..., <Pm (m < n), the problem of finding a relative extremum of f(x I' .., ,x n ) under the condition that <P I (x I' ... , x n ) = 0, ... , <Pm(x I , "', x n ) = 0 is called the problem of finding a conditional relative extremum. This problem can be reduced to the problem of finding a relative extremum of an implicit function. Actually, if the functions 1, <P I' ... , qJm are of class C I and the tJ acobian 8(<pI' ..., <Pm)/8(x n - m +1> ..., x n ) does not vanish in the domain considered, then y I = X n - m + I , ... , Ym = X n can be regarded as implicit func- tions of XI' ..., Xl (I = n - m). Hence we can set f(XI'''' ,xl> YI,"" Ym)= f*(x l ,... ,Xl)' Then f has a relative extremum at (x, ,.. ,x) under the condition <P I = ... = qJm = 0 if and only if f* has a relative extremum at Po = (x?, .. " x?). The latter condition implies that all 8f* /8x j (j = 1, ..., I) vanish at Po, which holds if and only if for arbitrary constants ..1.1' ... , Am the function F(x l , ...,x n )= f +..1. 1 <PI +... + Am<Pm satisfies 8F/8x i == 0 (i = 1, .., , n), and further <PI = 0, ..., <Pm =0 at (x, ..., x). From this system of equations we can often find the values of x, ... , x, This method of finding conditional relative extrema is called Lagrange's method of indeterminate coefficients or the method of Lagrange multipliers ( 208 Implicit Func- tions; 216 Integral Calculus H; 379 Series H). References [IJ T. M. Apostol, Mathematical analysis, Addison-Wesley, 1957. [2J N. Bourbaki, Elements de mathematique, Fonctions d'une variable reelle, Actualites Sci. Ind., 1074b, 1132a, Hermann, second edition, 1958,1961. [3J R. C. Buck, Advanced calculus, McGraw- HilI, second edition, 1965, [4J R, Courant, Differential and integral cal- culus I, II, Nordemann, 1938. [5J G. H. Hardy, A course of pure mathemat- ics, Cambridge Univ. Press, seventh edition, 1938. [6J E. HiJle, Analysis, Blaisdell, I, 1964; II, 1966. [7J W. Kaplan, Advanced calculus, Addison- Wesley, 1952, [8J E. G, H. Landau, Einfuhrung in die Dif- ferentialrechnung und Integralrechnung, Noordhoff, 1934; English translation, Dif- ferential and integral calculus, Chelsea, 1965. [9J J. M. H. Olmsted, Advanced calculus, Appleton-Century-Crofts, 1961. [IOJ A. Ostrowski, Vorlesungen uber 
399 Differential-und Integralrechnung I, II, III, Birkhiiuser, second edition, 1960-1961. [IIJ M. H. Protter and C. B. Morrey, Modern mathematical analysis, Addison-Wesley, 1964. [12J W, Rudin, Principles of mathematical analysis, McGraw-HilI, second edition, 1964. [13] V. I. Smirnov, A course of higher mathe- matics. I, Elementary calculus; II, Advanced calculus, Addison-Wesley, 1964. [14] A. E. Taylor, Advanced calculus, Ginn, 1955. 107 (XIII. 1 ) Differential Equations A. Ordinary Differential Equations It was Galileo who found that the accelera- tion of a faIling body is a constant and thence derived his law of a faIling body x(t) = gt 2 /2 as what we would now view as a solution of the differential equation x"(t)=g, where x(t) de- notes the distance the body has falIen during the time interval t and 9 is the constant gravi- tational acceleration. This pioneering work may be regarded as the first example of solu- tion of a differential equation. Also, the tequa- tions of motion, proposed by I. tNewton as the mathematical formulation of the law of motion, including Galileo's law as a special case, are differential equations of the second order. Thus differential equations appeared, simultaneously with differential and integral calculus, as an indispensable tool for the uni- fied and concise expression of the laws of nature. Such laws are generalIy calIed dif- ferential laws. Newton completely solved the equations of the ttwo-body problem proposed by himself; G. W. tLeibniz also succeeded in solving many simple differential equations. In the 18th century, many mathematicians, such as the tBernoulIis, A. C. Clairaut, J, F, Riccati, L. tEuler, and J. L. tLagrange, at- tacked and solved differential equations of various types independently. In that period, the emphasis was on solution by quadrature, that is, applying to telementary functions a finite number of algebraic operations, trans- formations of variables, and indefinite inte- grations. It was toward the end of the 18th century that new methods, such as integration by infinite series, came to be discussed. A method of variation of constants for the solu- tion of linear ordinary differential equations was invented by Lagrange in 1775. At the beginning of the 19th century, C. F. tGauss 107 A Differential Equations initiated the study of differential equations satisfied by thypergeometric series. The problem of existence of solutions, which supplies a foundation of modem differential equation theory, was first treated by A. L. tCauchy. His proof of the existence theorem was later improved by R. L. Lipschitz (1869). Pioneers in the function-theoretic treatment of differential equations were C. A. A. Briot and J. C. Bouquet, who investigated the singular points of a function defined by an analytic differential equation. Also, B. tRiemann pro- posed a new viewpoint which influenced L. Fuchs in his development of the theory of linear ordinary differential equations in the complex domain (1865). Works of A. M. Le- gendre on telIiptic functions and of H. tPoin- care on tautomorphic functions should also be mentioned in this connection. After the Cauchy-Lipschitz existence the- orem for the equation y' = f(x, y) was known, efforts were directed toward weakening the conditions imposed on f(x, y). G. Peano first succeeded in giving a proof under the continu- ity assumption only (1890), and his results were sharpened by 0, Perron (1915), Regarding the uniqueness of solutions of tinitial value problems, there are various re- sults by W. F. Osgood (1898), Perron (1925), and many Japanese mathematicians, In the course of this work the necessary and sufficient condition for uniqueness was successfully formulated in a concise form ( 316 Ordi- nary Differential Equations (Initial Value Problems». For linear differential equations with peri- odic coefficients, investigations were carried out by C. Hermite (1877), E. Picard (1881), G. Floquet (1883), G, W. Hill (1886), and others. For instance, solutions satisfying y(x + w) = AY(x) were found to exist, where w is the period of the coefficients. Analogous re- sults folIowed in the case of doubly periodic coefficients. Techniques of factorization of linear dif- ferential equations developed by G. Frobenius (1873) and E. Landau (1920) should also be noted. Picard (1883), J. Drach (1898), and E. Vessiot (1903, 1904) established a remark- able result on the solvability (in the sense of solution by quadrature) of linear differential equations, successfulIy extending the tGalois theory in this new direction. The concept of tasymptotic series, which in a sense approximate the solution of differential equations, was introduced by Poincare (1886) and extended by M. A. Lyapunov (1892), J. C. C. Kneser (1896),1. Horn (1897), C. E. Love (1914), and others. Poincare was also the founder of topological methods in differential 
107B Differential Equations equation theory, and his ideas were developed extensively by I. Bendixson (1900), Perron (1922,1923), G. D. Birkhoff, and others ( 126 Dynamical Systems). In 1890, Picard invented an ingenious tech- nique of tsuccessive approximation for the proof of existence theorems, and his technique is now widely used in every application of functional equations. The technique of reduc- ing linear differential equations to linear tintegral equations of Volterra type was also developed. On the tboundary value problems and teigenvalue problems that appear in many areas of physics, there was extensive research by mathematicians such as J. C. F. Sturm (1836), J. Liouville, L. Tonelli, Picard, M. BOcher (1898, 192 I), Birkhoff (1901, 1911), and others. In this connection the problem arises of expanding a given function by an torthogonal system of functions obtained as teigenfunctions of a given boundary value problem. Those problems were brought into unified form by D. tHilbert (1904) in his theory of tintegral equations. Subsequently boundary value problems of ordinary and partial dif- ferential equations came to be discussed in this framework. Finally, it should be mentioned that the tcalculus of variations created by Euler and Lagrange gave rise to the study of a certain class of differential equations bearjng the name of Euler ( 46 Calculus of Variations). B. Partial Differential Equations The origin of partial differential equations can be traced back to the study of hydrodynamic problems by J. d'Alembert (1744) and Euler. However, perhaps Lagrange and P. S. tLa- place were the first to investigate the general theory. Subsequently, during the 18th and 19th centuries, it was developed by G. Monge, A.-M. Ampere, J. F, Pfaff, C. G. tJacobi, Cauchy, S, tLie, and many other mathema- ticians. The fundamental existence theorem for the initial value problem, now called the Cauchy-Kovalevskaya theorem, was proved by S. Kovalevskaya in 1875 ( 321 Partial Differential Equations (Initial Value Problems) B). Because of their close connection with prob- lems of physics, linear equations of the second order have been a chief object of research. Up to the 19th century, classification into telliptic, thyperbolic, and tparabolic types and the study of boundary and initial value problems for each of these types constituted the main part of the theory. In the 20th century, more complicated 400 problems-tnonlinear problems appearing in the study of viscous or compressible fluids, or the study of equations of tmixed type in con- nection with supersonic flow-have emerged as important topics; and the newly developed techniques of functional analysis have brought about remarkable changes. Especially in the study of the tSchrodinger equations of quan- tum mechanics and of more general tevolution equations, this method has proved to be a powerful tool. Finally, we should not fail to mention that the development of electronic computers has made it possible to obtain numerical solu- tions and to discover many important facts. tNumerical analysis is now becoming an indis- pensable part of the theory ( 304 Numerical Solution of Partial Differential Equations). References [IJ E. L. Ince, Ordinary differential equations, Dover, 1956. [2J L. S. Pontryagin, Ordinary differential equations, Addison-Wesley, 1962. (Original in Russian, 1961.) [3J E, A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hill, 1955. [4J P. Hartman, Ordinary differential equa- tions, Wiley, 1964. [5J E. Hille, Lectures on ordinary differential equations, Addison-Wesley, 1969, [6J L. Bieberbach, Theorie der gewohnlichen Differentialgleichungen auf funktionen the ore- tischen Grundlage dargestellt, Springer, 1953. [7J E. Hille, Ordinary differential equations in the complex domain, Wiley, 1976. [8J W. Wasow, Asymptotic expansions for ordinary differential equations, Interscience, 1965, [9J I. Kaplansky, An introduction to differen- tial algebra, Hermann, 1957, [!OJ M. A. Naimark, Linear differential oper- ators, I, II, Ungar, 1967, 1968. (Original in Russian, 1954,) [l1J R. Courant and D. Hilbert, Methods of mathematical physics I, II, Interscience, 1953, 1962. [12J P. R. Garabedian, Partial differential equations, Wiley, 1964. [13J I. G, Petrovskii, Lectures on partial differential equations, Interscience, 1954. (Original in Russian, 1950.) [14J L. Hormander, Linear partial differential operators, Springer, 1963. [15J K. Y os ida, Lectures on differential and integral equations, Wiley, 1960, (Original in Japanese, 1950.) 
401 108 (XIX.1 0) Differential Games A. Introduction The study of differential games arose from the study of pursuit and evasion problems and various tactical problems. The first work was done by R. P. Isaacs [1] in a series of RAND Corporation memoranda that appeared in 1954. He applied to many illustrative examples the method of Hamilton-J acobi differential equations ( eq. (4) below). The heuristic results ofIsaacs were made rigorous by W. H. Fleming [2J, L. D, Berkovitz [3J, A. Friedman [4, 5J, and others, B. Zero-Sum Two-Person Games Suppose that there are two antagonists, each exerting partial control over the state of a system, One wishes to maximize a given payoff that is a functional of the state and the control exerted, while the other wishes to minimize this payoff, Let the state of a differential game at time t be represented by an n-dimensional vector x(t) ERn. In a zero-sum differential game between two players I and II, we are given a system of n differential equations dx/dt = f(t, x, u, v) with an initial condition x(r) =  ERn, where UERP is chosen at each instant of time by player I and vERq is chosen by player II, We assume that the function f(t, x, u, v) is Con- tinuous in t and continuously differentiable on the entire (x, u, v)-space. It is usually assumed that both players know the present state of the game and that they know how the game proceeds; that is, they know the system (1). Each player can take the state of the game into account in making his choice. Thus player I can let his choice of u be governed by a vector function u(t, x) defined on D, where Dc [0,00) x Rn is a fixed region of the (t, x)-space, Similarly, player II can let his choice of v be governed by a vector function v(t, x) defined on D. A finite collection of subregions DI' "', Dr of a region D is said to constitute a decomposi- tion of D whenever the following conditions hold: (i) Each D i (i = 1, ... , r) is connected and has a piecewise smooth boundary; (ii) D i n Dj = <p if i # j. A function defined on 15 is said to be piecewise C I in x on 15 ifthere is a decom- position of D such that on each D i the function and aU its derivatives with respect to x are continuous in (t,x) on 15 i . Let U and Vbe fixed closed subsets of RP and Rq, respectively, 108B Differential Games Let Su denote the class of functions u(t, x) that are piecewise C 1 in x on 15 and have their range in U. Similarly, let Sv denote the class of functions v(t, x) that are piecewise C I in x on 15 and have their range in V. Let UES u and VES v , and consider the dif- ferential equation dx/dt = f(t, x, u(t, x), v(t, x)), (2) subject to the initial condition x(r)=. (3) We say that a pair (u, V)E Su X Sv is playable if for every (r,) in D every solution of (2) satisfying (3) stays in D and reaches a terminal manifold F in finite time, where F is a smooth manifold contained in 15. Let ilucSu, ilucS v be the maximal pair of subclasses such that each pair (u, v)Eilu x ilv is playable. We call the functions in ilu and ilv the strategies for the players. For each strategy pair (u, v) we can define a functional J(r,,u,v) f 'l = g(t I, x(t d) + h(t, x(t), u(t, x(t)), v(t, x(t))) dt, '0 (1) where x(t) is a solution of (2) and (3) and t I is the first time that (t, x(t)) reaches the terminal manifold F. The functional J is called the payoff, Let (u*, v*)Eil u x ilv be a strategy pair. Suppose that for any uEil u and vEilv' the inequalities J(r, (, u, v*).s; J(r,, u*, v*) .s;J(r, (, u*, v) hold for all (r,)ED. We say that(u*,v*) is a saddle point relative to the classes ilu and il v ' The function W(t,x)=J(t,x, u*, v*) defined on D is called the value function of the game. Berkovitz [3J proved that the value function W(t, x) is continuous on D and continuously differentiable on each D i and satisfies max [h(t, x, u, v*) + WAt, x)f(t, x, u, v*)J UED = min [h(t, x, U*, v) + WAt, x)f(t, x, u*, v)] VEV = h(t, x, u*, v*) + WAt, x)f(t, x, u*, v*) = - Jv,(t,x). (4) Equation (4) is called the Hamilton-Jacobi equation. Let x*(t; r,) be the optimal trajectory corre- sponding to the saddle-point strategies (u*, v*) and resulting from an initial point (r, ()E D. Then there exists an n-dimensional continuous 
108 C Differential Games vector function ).(t; r, () such that the following hold [3]: (i) The functions x* and ..1. satisfy the system of differential equations dx/dt = f(t, x, u*(t, x), v*(t, x)), dA/dt= - HAt, x, ..1., u*(t, x), v*(t, x)), where H(t, x, ..1., u, v) = h(t, x, u, v) + Af(t, x, u, v), (ii) If x = x*(t; r, ), t? r, then WAt,x)= A(t; r, (). (iii) At t = t h the transversality condition aT ogoT og ax ,oX H-+--+---A-=O 00" at 00" ax 00" 00" holds, where the terminal manifold F is given parametrically by the relations t= T(O"), x = X (0"), 0" ranging over a cube in some finite- dimensional Euclidean space. (iv) For all r:(t:(t l , max min H (t, x*(t), },(t), u, v) ueU VEV =minmax H(t,x*(t), A(t), u, v) veV ueU = H(t, x*(t), A(t), u*(t, x*(t)), v*(t, x*(t))). P. Varaiya and J. Lin [6J and Friedman [4, 5] have defined certain special classes of differential games, and have shown that under their definitions the games have nonzero value functions, C. N-Person Differential Games In a differential game between many players, the state vector x(t)ER n is governed by dx/dt=f(t,x,u I , ...,u N ), x(r)=(, where u i E RP' is chosen at each instant of time by player i. Each u, is constrained to lie in a fixed closed subset U i of RP,. Let Si be the class offunctions ui(t, x) that are piecewise C' in x on i5 and have their range in U i . We say that an element u=(u I , ...,u N ) of SI x ... X SN is playable if, for every (r, )E D, every solution of the differential equation dx/dt = f(t, x, U l (t, x), ..., uN(t, x)), x(r)=, (5) stays in D and reaches a terminal manifold F in finite time. Let QiC Si (i= 1, "', N) be the maximal subclasses such that each ele- ment u=(u I , ...,UN)EQI x... xQN is playable, We call the functions UiEQi (i= 1, ...,N) the strategies, For each strategy N-tuple we define 402 a payoff Ji(r, (, U I ,..., UN)=gi(t l , x(t d) + f 'l h;(t, x(t), udt, x(t)), ..., uN(t, x(t)))dt, '0 where x(t) is a solution of (5) and t I is the first time that (t, x(t)) reaches the terminal manifold F. Each player i is to choose his strategy UjEQi so as to maximize his own payoff J i . There are many definitions of "solution" for games involving more than two players. A strategy N-tuple u* = (u l *, ..., UN *) is called an equilibrium point for the game if the inequalities Ji(U l *,..., u i - t *, u i , Ui+l *,...) :(Ji(U i *, ""Ui-I*,Ui*,Ui+I*, ...) (i= I, ...,N) hold for any u1EQI, ...,UNEQN' J. H, Case [9] has shown that the conclusions drawn for zero-sum two-person games also hold for N- person differential games. References [IJ R. P. Isaacs, Differential games, Wiley, 1965. [2J W. H. Fleming, The convergence problem for differential games II, Advances in game theory, Ann. Math. Studies 52, Princeton Univ. Press, 1964, 195-210. [3J L. D. Berkovitz, Necessary conditions for optimal strategies in a class of differential games and control problems, SIAM J. Con- trol, 5 (1967),1-24, [4J A. Friedman, On the definition of dif- ferential games and the existence of value and of saddle points, J. Differential Equations, 7 (1970), 69-91. [5J A. Friedman, Differential games, Wiley, 1971. [6] P. Varaiya and J. Lin, Existence of saddle points in differential games, SIAM J. Control, 7 (1969),142-157. [7J H. W. Kuhn and G. P. Szego (eds.), Dif- ferential games and related topics, North- Holland, 1971. [8J L. S. Pontryagin, On the theory of dif- ferential games, Russian Math. Surveys, 21 (1966),193-246. [9J J. H. Case, Toward a theory of many player differential games, SIAM J. Control, 7 (1969), 179-197. 109 (VII.1) Differential Geometry In differential geometry in the classical sense, we use differential calculus to study the prop- 
403 erties of figures such as curves and surfaces in Euclidean planes or spaces. Owing to his studies of how to draw tangents to smooth plane curves, P. tFermat is regarded as a pioneer in this field. Since his time, differential geometry of plane curves, dealing with cur- vature, tcircles of curvature, tevolutes, ten- velopes, etc., has been developed as a part of calculus. Also, the field has been expanded to analogous studies of space curves and surfaces, especiaIly of tasymptotic curves, tlines of cur- vature, tcurvatures and tgeodesics on surfaces, and truled surfaces. C. F, tGauss founded the theory of surfaces by introducing concepts of the tgeometry on surfaces (Disquisitiones circa superficie curvas, 1827). Gauss recognized the importance of the intrinsic geometry of surfaces, and it is generally agreed that differ- ential geometry as it is known today was ini- tiated by him. Thus differential geometry came to occupy a firm position as a branch of mathematics. The influence that differential- geometric investigations of curves and surfaces have exerted upon branches of mathematics, physics, and engineering has been profound. For example, E. Beltrami discovered an in- timate relation between the geometry on a tpseudo-sphere and tnon- Euclidean geometry, The study of tgeodesics is a fertile topic deeply related to dynamics, the calculus of variations, and topology, on which there is excellent work by J. Hadamard, H. tPoincare, P. Funk, G. D. Birkhoff, M. Morse, R. Bott, W. Klin- genberg, and M. Berger, among others, The theory of minimal surfaces initiated by J, L. Lagrange was an application of the calculus of variations. At the early stages of develop- ment, G. Monge, J. B, M. C. Meusnier, A. M. Legendre, O. Bonnet, B. Riemann, K. Weier- strass, H. A. Schwarz, Beltrami, and S. Lie contributed to the theory. Weierstrass and Schwarz established its relationship with the theory of functions. 1. A. Plateau showed experimentally that tminimal surfaces can be realized as soap films by dipping wire in the form of a closed space curve into a soap solu- tion (1873), The Plateau problem, i.e., the prob- lem of proving mathematically the existence of a minimal surface with prescribed bound- ary curve, was solved by Tibor Rad6 in 1930 and independently by J. Douglas in 1931. Although the relationship to function theory is lost for higher-dimensional minimal sub- manifolds, their study is intimately related to the calculus of variations and topology. Euclidean geometry is a geometry belonging to F, Klein's Erlangen program ( 137 Er1an- gen Program). For other geometries in the sense of F, Klein we may also consider the cor- responding differential geometries. For instance, in tprojective differential geometry we study 109 Differential Geometry by means of differential calculus the properties of curves and surfaces that are invariant under projective transformations. This subject was studied by E, J, Wilczynski, G. Fubini, and others; taffine differential geometry and tcon- formal differential geometry were studied by W, Blaschke and others ( 110 Differential Geometry in Specific Spaces). Influenced by Gauss's geometry on surfaces, in his inaugural address at Gottingen in 1854 Riemann advocated an intrinsic differential geometry completely independent of embed- dings (Ober die Hypothesen, welche der Geo- metrie zugrund Iiegen; Werke, 2nd ed. 1892, 272-287) ( 364 Riemannian Manifolds). Removing the restriction to two dimensions and considering abstract manifolds of dimen- sion n, he introduced what is now known as the Riemannian metric; actually he considered the more general metrics that had formed the subject matter of the dissertation of P. Finsler in 1918 ( 152 Finsler Spaces). Riemann- ian geometry includes Euclidean and non- Euclidean geometry as special cases, and is important for the great influence it exerted on geometric ideas of the 20th century. Under the influence of the algebraic theory of invar- iants, Riemannian geometry was then studied as a theory of invariants of quadratic tcovar- iant tensors by E. B, Christoffel, C. G, Ricci, and others. Riemannian geometry attracted wide attention after A. Einstein applied it to the tgeneral theory of relativity in 1916. In the same year, T. Levi-Civita introduced the notion oftLevi-Civita parallelism, which contributed greatly to the clarification of geo- metric properties of Riemannian spaces. Ob- serving parallelism to be an affine-geometric concept, H. Weyl and A. S. Eddington devel- oped a theory of Riemannian spaces "affinely" based on the notion of parallelism without using metrical methods. Such a geometry is called a geometry of an affine connection ( 80 Connections). Every straight line in a Euclidean space has the property that all tangents to the line are parallel. In a space with an affine connection, we may define a family of curves called tpaths as an analog of straight lines. Such curves are solutions of a system of ordinary differential equations of the second order of a certain type. Coefficients of such differential equations determine a parallelism and hence an affine connection. H. Weyl discovered transforma- tions of coefficients that leave the family of paths invariant as a whole, namely, projective transformations of an affine connection. A geometry that aims to study properties of paths or affine connections that are invariant under these transformations is called a projec- tive geometry of paths. Such geometry was 
109 Differential Geometry studied by L. P. Eisenhart, O. Veblen, and others. The concept of projective connections was an outcome of such studies. Similarly, the concept of conformal connections was devel- oped from the consideration of tconformal transformations of Riemannian spaces. These geometries cannot in general be re- garded as geometries in the sense of Klein. Actually, anyone of these geometries generally has no transformations that correspond to tcongruent transformations of geometries in the sense of Klein; even if it has such trans- formations, they do not act transitively on the space. Thus geometries are naturally divided into two categories, one consisting of geome- tries in the sense of Klein (based on the group concept) and the other of geometries based on Riemann's idea. Under such circumstances, E. Cartan unified the thoughts of Klein and Riemann from a higher standpoint and con- structed his theory of connections in a series of papers published between 1923 and 1925. He developed the theory of affine, projective, and conformal connections from a viewpoint con- sistent with that of Klein. Just as each tangent space of a Riemannian manifold is viewed as a Euclidean space, an affine connection regards the tangent space at each point as an affine space and develops it onto the tangent space at an infinitesimally nearby point. In discuss- ing projective connections Cart an attached a projective space to each point of a manifold as an infinitesimal approximation, and similarly for conformal connections, More generalIy, he attached to each point of a manifold a fixed Klein space, i.e., a homogeneous space of a Lie group, caUed the structure group. Thus Cartan introduced the concept of fiber bundle ( 147 Fiber Bundles). Then he defined a connection as a development of the fiber, i.e., the gen- eralized tangent space, at each point onto the fiber at an infinitesimalIy nearby point ( 80 Connections B). If G is the group of congruent transformations in Euclidean space, a mani- fold with connection having G as its structural group is called a manifold with Euclidean con- nection. Among manifolds with Euclidean connection, Riemannian manifolds are charac- terized as those without ttorsion. If we take the group of congruent transformations of projective (conformal) geometry as G, we have manifolds with projective (conformal) connec- tion in the sense of Cartan. Among these, there are remarkable ones caUed manifolds with normal projective (or conformal) connections, which are essentially the same as the ones studied by Veblen and others. Cart an's idea had a profound influence on modern differen- tial geometry. The method of moving frames, created by G. Darboux and extensively used by Cartan in his theory of connections, was a 404 forerunner of the theory of fiber bundles, Com- bining tGrassmann algebra with differential calculus, Cartan developed a powerful com- putational tool known as calculus of differen- tial forms ( 105 Differentiable Manifolds Q). Differential forms have become indispensable in topology, algebraic geometry, and in studies of functions of several complex variables, as weU as in differential geometry. The work of Lie on transformation groups also had a profound influence on Cartan, The latter's work on tLie groups, particularly on simple Lie groups, and on differential geome- try culminated in 1926 in his discovery of tRiemannian symmetric spaces. These spaces are natural generalizations of the spherical surface and the unit disk in the complex plane with Poincare metric, and play essential roles in unitary representation theory and in other areas of mathematics. A tangent line to a Curve C at a point P of C is the limit line of the line PQ, where Q is a point on C aporoaching P; hence we can de- fine it locally. A concept (or property), such as this, that can be defined in an arbitrary small neighborhood of a point of a given figure or a space is called a local concept (or local prop- erty) or a concept (or property) in the small. In the early stages of the development of dif- ferential geometry, differential calculus was the main tool of study, so most of the results were local. On the other hand, a concept (or property) that is defined in connection with a whole figure or a whole space is caIled global or in the large. In modern differential geome- try, the study of relations between local and global properties has attracted the interest of mathematicians. This view was emphasized by Blaschke, who worked on the differential geometry oftovals and tovaloids. The study of trigidity of ovaloids by S. Cohn-Vossen be- longs in this category, and many works on geodesics and minimal surfaces were done from this standpoint. From the viewpoint of modern mathemat- ics, the basic concepts on which we construct Riemannian geometry and geometries of con- nections are global concepts of tdifferentiable manifolds. However, in Riemann's time the theory oft Lie groups and topology were not yet developed; consequently, Riemannian geometry remained a local theory. In 1925, H. Hopf began to study the relations between local differential-geometric structures and the topological structures of Riemannian spaces, However, except for the work of Cartan, Hopf, and a few others, differential geometry in the 1920s was stiIllargely concerned with surfaces in the 3-dimensional Euclidean space or local properties of Riemannian manifolds, and with affine, projective, and conformal connections. 
405 Gradually the concept of differentiable mani- folds was clarified, the global theory of Lie groups made progress, and topology devel- oped; and the trend toward global differential geometry began slowly in the early 1930s. The dissertation ofG.-W. de Rham published in 1931 showed that the cohomology of a mani- fold can be computed in terms of differential forms ( 105 Differentiable Manifolds R). His theorem provides the theoretical foundation for expressing cohomological invariants of a manifold in terms of differential geometric invariants. In a series of papers immediately following de Rham's, W. V. D. Hodge es- tablished that, on a compact Riemannian manifold, every r-dimensional cohomology class can be uniquely represented by a har- monic form of degree r ( 194 Harmonic Integrals). An important class of complex manifolds with compatible Riemannian metric was dis- covered by 1. A. Schouten, D. van Dantzig, and E. Kahler around 1929-1932. This class of manifolds, called Kahler manifolds today, comprises the projective algebraic manifolds. Hodge's theory of harmonic integrals is most effective when applied to compact Kahler manifolds, ( 232 Kahler Manifolds). The most celebrated global theorem in classical differential geometry of surfaces is the Gauss-Bonnet formula (1848) ( 364 Rie- mannian Manifolds D). The formula was gen- eralized to closed hypersurfaces of Euclidean space by Hopf in 1925, to closed submanifolds of Euclidean space by C. B. Allendoerfer and W. Fenchel in 1940, and finally to arbitrary closed Riemannian manifolds by Allendoerfer and A. Weil in 1943. But the simple proof given by S. S. Chern in 1944 contained the notion of transgression, which has become essential in the theory of characteristic classes ( 56 Characteristic Classes), The discovery of Pontryagin classes for Riemannian manifolds (1944) and Chern classes for Hermitian mani- folds (1946) culminated in the tindex theorem and the tRiemann-Roch theorem of F. E. P. Hirzebruch, and finally in the tAtiyah-Singer index theorem. A simple but fruitful idea of S, Bochner, relating harmonic forms to curvature, es- tablished tvanishing theorems for harmonic forms of Riemannian manifolds and for holo- morphic forms of Kahler manifolds under suitable positivity conditions for curvature. His idea has led to the vanishing theorems of K. Kodaira and others ( 232 Kahler Mani- folds D). The work of C. Ehresmann in 1950 on con- nections in principal fiber bundles established a solid foundation to Cartan's theory of con- nections. Gauge theory in physics is largely 109 Ref. Differential Geometry based on the theory of connections in prin- cipal fiber bundle. R. H. Nevanlinna's value distribution theory and its subsequent generalization by Chern and others can be best described in differential- geometric terms. Differentiable manifolds are currently ob- jects of research in both differen tial geometry and differential topology. While topology studies manifolds per se, differential geometry may be considered as the study of differenti- able manifolds equipped with geometric struc- tures, such as metric tensors, connections, (almost) complex structures, and various other tensors. Through these geometric structures, differential geometry enjoys close contact with many branches of mathematics, From its early days, differential geometry has had close ties to topology (as exemplified by the Gauss-Bonnet formula) and to partial dif- ferential equations and analytic functions (through, e.g., the study of minimal surfaces). The bonds with topology were strengthened by Morse theory and, more recently, by the theory of characteristic classes. In the most recent proof of the Atiyah-Singer index theo- rem, differential geometry is an important intermediary between topology and analysis, Differential geometry and algebraic geometry have enriched each other through Kahler manifolds. The theory of functions of several complex variables also has points of contact with differential geometry, such as value distri- bution theory and Cauchy-Riemann struc- tures. Contact and symplectic structures are basic to mechanics. Lorentz manifolds and connections in principal bundles are essential mathematical tools in the general theory of relativity and in gauge theory. Topics such as minimal submanifolds, manifolds of positive curvature, and closed geodesics are active and important areas of research belonging to Rie- mannian geometry proper; at the same time, differential geometry provides a language and methods that are important in wider areas of mathematics. References [IJ L. P. Eisenhart, A treatise on the differen- tial geometry of curves and surfaces, Ginn, 1909. [2] G. Darboux, Le<;ons sur la theorie gen- erale des surfaces, Gauthier-Villars, I, second edition, 1914; II, second edition, 1915; III, 1894; IV, 1896. [3] W. Blaschke, Vorlesungen iiber Differen- tialgeometrie, Springer, I, third edition, 1930; II, 1923; III, 1929 (Chelsea, 1967), [4] L. P. Eisenhart, Riemannian geometry, Princeton Univ. Press, 1926. 
BOA Differential Geometry in Specific Spaces [5J E. Cartan, Le<;ons sur la geometrie des espaces de Riemann, Gauthier-Villars, second edition, 1946; English translation, Geometry of Riemannian spaces, Math. Sci. Press, 1983. [6J J. A. Schouten, Ricci-calculus, Springer, second edition, 1954. [7J O. Veblen and 1. H. C. Whitehead, The foundations of differential geometry, Cam- bridge Tracts, 1932. [8J S. S, Chern, Topics in differential geome- try, Lecture notes, Inst. for Adv. Study, Princeton, 1951. [9] W, V. D. Hodge, The theory and applica- tions of harmonic integrals, Cambridge Univ. Press, second edition, 1952. [1OJ S. Kobayashi and K. Nomizu, Founda- tions of differential geometry, Interscience, I, 1953; II, 1969. [11J E. Cartan, Oeuvres completes I-III, Gauthier- Villars, 1952-1955; second edition, CNRS, 1984. [12] A. L. Besse, Manifolds all of whose geo- desics are closed, Springer, 1978. [13J M. Berger, P. Gauduchon, and E. Mazet, Le spectre d'une variete Riemanninne, Lecture notes in math. 194, Springer, 1971. [14J D. Gromoll, W. Klingenberg, and W, Meyer, Riemannsche Geometrie im Groosen, Lecture notes in math. 55, Springer, 1968. [15J S. Helgason, Differential geometry, Lie groups, and symmetric spaces, Academic Press, 1978. [16J S. Kobayashi, Transformation groups in differential geometry, Springer, 1972. [17J T. Levi-Civita, The absolute differential calculus, Blackie, 1926. [18J J. W. Milnor, Lectures on Morse theory, Ann. Math. Studies 51, Princeton Univ. Press, 1963. [19J R, Osserman, A survey of minimal sur- faces, Van Nostrand, 1969. [20J G. de Rham, Varietes differentiables, Hermann, 1955. [21] A. Wei1, Introduction a I'etude des vari- etes kiih1eriennes, Hermann, 1958. [22J K. Yano and S, Bochner, Curvature and Betti numbers, Ann, Math, Studies 32, Princeton Univ. Press, 1953. [23] Y. Matsushima, Differentiable manifolds, Dekker, 1972. 110 (VII.17) Differential Geometry in Specific Spaces A. The Method of Moving Frames The main theme of this article is the theory of surfaces (i.e., submanifo1ds) in a differentiable 406 manifold V on which a tLie transformation group G acts, If a tLie group G of dimension r acts ttransi- tive1y on a space G* and the tstability group for any point of G* consists of the identity element only, then G* is called the group mani- fold of G, and an element of G* is called a frame. If fo is a fixed frame, then the mapping a-->afo E G* (a E G) gives a tdiffeomorphism of G to G*. Let I(G) be the set of all tdifferen- tia1 forms W of degree 1 on G* that are invar- iant under transformations of G. Then I(G) is a linear space of dimension r and is the tdual space of the Lie algebra 9 of G. A basis {wAil Ar} of I(G) is called a set of reI a- tive components of G. The structure equations hold: 1 r dw A =- L CAVW/\W" 2,v=1 where CAV are tstructure constants of the Lie algebra g, Let G be a Lie transformation group of a space E. Then a G-invariant submanifo1d of E on which G acts transitively is called an orbit. Each point YEE determines an orbit contain- ing y. When there exist parameters k j (1 j  t) such that any G-invariant on E is a function of k 1 , "', k" then these parameters are called the fundamental invariants of E. Let H ( c G) be the stability group at a point Yo on an orbit M; then M is identified with the thomogeneous space GjH by the diffeomorphism <p:M-->GjH, <p(ayo)=aH (aEG). Furthermore, a tprincipal fiber bundle (G*, M, H, r) is determined by the projection r: G*-->M, r(afo)=ayo (aE G). The tfiber Hy on a point yEM is a group manifold of H. Hy is called the family of frames on y, and an element of Hy is a frame on y. Local coordinates e (1  J1  s) of the group Hare called the secondary parameters and are used to indicate frames in Hy. When H is not con- nected, let H O be the connected component of the identity of Hand M be the tcovering manifold GjH O of M. An element YEM over YEM is called an oriented element. Now as- sume that the group H is connect'ed, Then the family of frames Hy on each yEM is given as an tintegra1 manifold of a tcomplete1y inte- grable system of total differential equations n i =0 (1 ir-s, niEI(G)) on the group mani- fold G*. Here the ni are linearly independent and are called the horizontal components of M. The n, are linear combinations of the relative components W A of G, and their coefficients are generalIy functions of the fundamental invar- iants k j . For simplicity, we assume that the relative components {w A } are chosen such that the horizontal components ni and components w, (r - s < ex  r) are linearly independent. Then the W p (1  [3  s) are called the secondary com- 
407 ponents. The differentials d8 p of the secondary parameters are linear combinations of the w p . Furthermore, let {xa} (1  0"  n) be local co- ordinates of E; then the differentials dXa are linear combinations of the differentials dk j and the horizontal components n i . Let G be a Lie transformation group of a space V. We regard two m-dimensional sur- faces WI and W 2 passing through xE V as equivalent if they have a tcontact of order p at x, Then an equivalence class of submanifolds is called a contact element of order p at the point x. Let Ep be the set of all contact ele- ments of order p at x, where x runs over all the points in V. A contact element of order p naturally determines a contact element of order p-1, and we denote this correspon- dence by 1/1: Ep-->Ep_I' Thus we obtain the series of correspondences V = Eot- EI <-.., <-EP-l t- Ep<-..., where a contact element of order 0 is identified with a point of V. Since a transformation on the space V induces a transformation on Ep, G is also a Lie transformation group of Ep, and this transformation commutes with the mapping 1/1. The fundamental invariants k j of Ep are said to be of order p. We use similar terminology (such as frames of order p, etc.) throughout this article. The fundamental in- variants k j of order p (1 j  t p ) can be chosen such that they contain the fundamental invar- iants k i (1  i  tp-d of order p -1. The ad- ditional t p - t p - 1 invariants k, (tp-l < ex  t p ) are called the invariants of order p. The family H: (YEE p ) of frames of order p can be chosen such that H: is contained in the family Hf-I (z = I/1YE Ep-d of frames of order p -1. If neces- sary, the family H: of frames of order p can be made connected by defining an orientation of contact elements of order p. Furthermore, the horizontal components n j (1 j  r - sp) of order p can be chosen such that they contain the horizontal components n i (1  i  r - sp-d. The additional Sp-I - sp components n, (r -Sp-I <exr-sp) are called the principal components of order p. Let W be an m-dimensional surface of a space V. The contact element of order p (;;:' 0) is determined at every point of Wand ex- pressed by the family of frames of order p and the values of invariants of orders less than or equal to p. Let {u;} (1  i  m) be local coordi- nates on W. Then the differentials dU i are given as linear combinations of linearly independent differential forms n i (1  i  m), where the n;, called the basic components of W, are certain linear combinations of the differentials of the fundamental invariants of V and of the horizon- tal components of orbits of V. Let P(W) be the set of all families of frames of order p; then 110 A Differential Geometry in Specific Spaces FP(W) depends on m parameters U i and sp secondary parameters 8 p of order p. On the space P(W), the differentials of invariants of orders less than or equal to p - 1 and the prin- cipal components of orders less than or equal to p - 1 are linear combinations of the basic components, whose coefficients are functions of the invariants of orders less than or equal to p. The differentials of invariants of order p and the principal components of order pare linear combinations of the basic components: dk,=h'lnl +... +h,mnm, t p _ 1 <extp, n,=b,lnl +... +b,mnm, r-sp_1 <ex.s;r-s p , where the coefficients h'i' b'i are functions of the invariants of orders less than or equal to p and, in general, the secondary parameters 8 p of order p. These coefficients are caIled the coeffi- cients of order p. Let rp be a subgroup of G preserving a family of frames of order p and Dp be a space whose coordinates are coefficients (h,;, b'i) of order p. Then rp acts on Dp as a transformation group. Knowledge of the prop- erties of contact elements of order less than or equal to p can be utilized to obtain infor- mation about the invariants of order p + 1, etc. In fact, if we can choose in the rp-space Dp a subspace C p that intersects each orbit in Dp at one and only one point, then in general the secondary parameters of order p associated with the points in C p correspond to the frames of order p, and the parameters associated with the points in C p are the invariants of order p + 1. The restrictions of the coefficients of order p to C p are functions of the invariants of orders less than or equal to p + 1; they are indepen- dent of the secondary parameters of order p. Thus the frames of order p + 1 and the in- variants of orders less than or equal to p + 1 determine the contact elements of order p of W and their differentials; generally, the latter can be utilized to determine the contact elements of order p + 1. This process of obtaining information of "order p+ 1" utilizing a suitable subspace C p of Dp is the so-called general method of moving frames. However, the surface W may contain points for which the general method does not apply. ActuaIly, there are surfaces W for which the method does not apply for any point in W. Thus various methods of moving frames are necessary to cope with different kinds of sur- faces. In the actual application of the method of moving frames, we use certain devices that help to simplify the calculations. In fact, an infinitesimal transformation (c'5h,;, c'5b,;) of the group rp acting on the space Dp is expressed as a linear combination of the secondary compo- nents of order p; this expression is easily ob- tained by means of the structure equations of 
HOB Differential Geometry in Specific Spaces G. The group r p +1 is a subgroup of rp fixing every point of the subspace C p , and its in- finitesimal transformation is such that (5h'i = 0, (5b'i = 0. The secondary components of order p + 1 are immediately obtained from the equations for dk, and n,. Furthermore, when m;3 2, the condition for the principal compo- nents of every order to satisfy the structure equations of G is essential to the problem of the existence of (m-dimensional) surfaces. As we apply the method of moving frames consecutively to a surface W, we eventually arrive at the order q having the following properties: The families of frames of order q + 1 coincide with those of order q, and the invariants kp of order q + 1 are expressed as functions <pp(k,) of invariants k, of order less than or equal to q. In this case, the families of frames of orders q + j (j;3 1) are all equal, and the invariants of orders q + j are partial (j -1)- derivatives offunctions <pp(k,), The family of frames of order q is called the Frenet frame, The differential invariants on a surface are defined to be differential forms generated by the basic components and the invariants of each order. Specifically, assume that the group G is an analytic transformation group of V, and the m-dimensional surfaces WI, W 2 are analytic. Then there exists an element 9 of G such that 9 WI = W 2 if and only if W 1 and W 2 are of the same kind and have the same relations among the invariants of orders less than or equal to q + 1. These relations are called the natural equations of the surface. The theory of sur- faces based on the analysis of the natural equations of surfaces is called natural geome- try. The reduction formula can be obtained by utilizing the Frenet frame; it gives the equation of the surface in the form of power series con- taining the invariants of each order. Various results are known concerning the theory of surfaces of the spaces VI' V 2 sharing the same transformation group G. We also have a theory of special surfaces whose in- variants satisfy specific functional relations, Furthermore, we have problems concerning the deformation of a surface (preserving some differential invariants). Actually, the theory of surfaces of dimension m other than curves and hypersurfaces is in general quite difficult. The methods of tensor calculus can be applied to the study of surfaces. The theory of tconnec- tions can be considered to be an outgrowth of the study of surfaces by means of the method of moving frames and tensor calculus. B. Projective Differential Geometry The rudiments of differential geometry sub- ordinated to the tprojective transformation 408 group, or projective differential geometry, can be found in the Theory of surfaces by J, G. Darboux. The subject has been systematically studied by H. G, H. Halphen, E. J. Wilczynski, and G. Fubini. The Fubini theory was en- riched substantially by E. Cartan, E. tech, E. Bompiani, and J. Kanitani. In this section we consider a surface S in a 3-dimensional projective space. Let A(u\ u 2 ) (u l , u 2 are parameters on S) be a point of S, and associate with A all the frames [A, AI' A 2 ,A 3 J (IA,Al,A2,A31= 1), where AI' A2' A3 are points of the tangent plane to S at A. A family of such frames is calIed the family of frames of order 1, and we express its differen- tial by 3 dA,= L wtAp, ex=0,1,2,3, Ao=A. po The wt are tPfaffian forms that depend on two principal parameters determining the origin A and ten secondary parameters determining the frame. We have w8+wi +w +w=O, w=O. Furthermore, w 1 == w6, w 2 == w;i are indepen- dent of each other and depend on the principal parameters only. Let Zl, Z2, Z3 be tnonhomo- geneous coordinates with respect to a frame of order 1. Then in a neighborhood of the origin, S is expressed by Z3 = L2f,., where the f,. are homogeneous functions of degree r with re- spect to z I, Z2. If we write f2 =(a o (zl)2 + 2a 1 Zl Z2 +a 2 (z2)2)/ 2, then it follows from the structure equations of the projective transformation group that wf=aow l +a l w 2 , w =alw l +a 2 w 2 . If we put <p2=ao(wl)2+2alwlw2+a2(w2)2, then a curve on S defined by <P2 = 0 is calIed the asymptotic curve and its tangent the asymp- totic tangent. At any point of this curve, the plane tangent to S is in contact of order 2 with this curve, and there are in general two asymptotic curves through any point of S. Equations of the asymptotic tangent at A are given by Z3 = 0, f2 = 0. A point of Sat which the asymptotic tangents coincide is called a parabolic point. If every point of S is parabolic, then S is a tdevelopable surface, and the general theory is not applicable to such a surface. Among the family of frames of order 1, a frame satisfying a o = a 2 = 0, a 1= 1 is called the frame of order 2. For this frame, the straight lines AA1 ' AA 2 are asymptotic tangents. With respect to this frame, if f3 = -(b o (Zl)3 + 3bdz1)2 Z2 + 3b 2 z l (z2)2 + b 3 (z2)3)/3, then wi = b o w 1 +b l w 2 , -w8+wi+w-w=2(blWl+ b 2 w 2 ), w=b2wl +b 3 w 2 , and the quadric surface Z3 == Zl Z2 - z3(b l Zl + b 2 z 2 + pZ3) (with p arbitrary) is called Darboux's quadric at A, an especially interesting one among contact quad- rics of S. Darboux's curve is a curve on S such 
409 that Darboux's quadric is in contact of order 3 at any point of it. Its tangent is called Dar- boux's tangent and is given by Z3=0, b o (ZI)3+ b 3 (Z2)3 = 0, We have b o b 3 # 0, except in the case of truled surfaces. We take special frames of order 2 determined by b l = b 2 =0, b o = b 3 == 1 and call them the frames of order 3. If a frame of order 3 satisfiesf4= _(CO(ZI)4+4cdzl)3Z2 + 6(C2 - 1 )(Zl Z2)2 + 4C 3 Z 1 (Z2)3 + c 4 (z2)4)/12, then wg-2w: +W=COWI +c I w 2 , w-w? =CIW I +C 2 W 2 , w -W=C2WI +C3W2, wg+w: -2W=C3WI +C 4 W 2 , With respect to this family of frames of order 3, «(W I )3 + (W 2 )3)/ 2w l w 2 is an invariant associated with two neighboring points of S, called the projective line element. Also with respect to this frame, two straight lines AA 3 , Al A 2 are polar with respect to Darboux's quadric. Among the families of frames of order 3, a frame satisfying C I = C 2 = C 3 = 0 is called a frame of order 4. With respect to this frame, there exist ..1., J1, v, P such that w? = AWl + J1W 2 , W=VWI +pw 2 , W=pWI + Aw 2 . Hence if we put Co = - 3a, C4 = - 3b, it follows that ( rg (wt)= w b W 2 w w ) I , W o -'0 Wi r l I w 2 W O 2 w 2 Wi -rl W O I rg= -(3/2)(aw l + bw 2 ), rl = (1/2)(aw l + bw 2 ). Thus the frame of order 4 is the Frenet frame and is attached to every point of S. This frame is called the normal frame, and the invariants a, b, ..1., J1, v, P are called the fundamental differ- ential invariants. The straight lines AA3 and Al A 2 associated with the normal frame are caIled directrices of Wilczynski of the first and second kind, respectively. With respect to the normal frame, S is expressed by Z3 = Zl Z2 _ ((ZI)3 + (Z2)3)/3 +(a(zl)4 + b(z2)4)/4+(ZI Z2)2 /2 +.... A necessary and sufficient condition for two surfaces S, S to be projectively equivalent is that there be normal frames having the same Wi, w 2 and the same six fundamental differen- tial invariants. For six quantities a, b, ..1., J1, v, P to be fundamental differential invariants of a surface, they must satisfy a certain condition of existence [6]. A frame of order 1 such that AA3 and Al A 2 are polar with respect to Darboux's quadric is called Darboux's frame. With respect to this frame also, a theory of surfaces has been established. Consider a pointwise correspondence be- tween two surfaces S, S, and denote by A E S the point corresponding to A E S, If there exists a projective transformation <p that transforms HOC Differential Geometry in Specific Spaces A into A and the image <p(S) is in contact of order 2 with S at rf, then the pointwise corre- spondence is called a projective deformation. A necessary and sufficient condition for the existence of a projective deformation between two surfaces is that these surfaces have the same projective line element [6]. A ruled sur- face is projectively deformable only to a ruled surface. Given an arbitrary surface S, it is generally impossible to find a surface that is different from S and projectively deformable to S; some conditions must be satisfied [6,8]. Let pOI, p02, p03, p12, p13, p23 be tPliicker coordinate of a straight line in a 3-dimensional projective space p3. Then we have pOI p23_ p02 pl3 + p03 pl2 = 0, and there is a one-to- one correspondence between the ratios of {pij} and straight lines ( 90 Coordinates B). If the pi j are regarded as homogeneous coordinates of a 5-dimenslOnal projective space p5, then the previous equation defines a hyperquadric Q in p5. Thus there is a one-to-one correspon- dence between points of Q and straight lines in p3. A curve on Q corresponds to a set of one- parameter families of straight lines, or a ruled surface. Sets of 2-parameter or 3-parameter families of straight lines corresponding to surfaces of 2 or 3 dimensions on Q in p5 are called congruences of lines or complexes of lines, respectively. Thus by using a theory of surfaces in p5, it is possible to establish the theory of congruences and complexes [2, 6, 8J, which is an important part of projective dif- ferential geometry. Specifically, if the surface is either a curve or a hypersurface, there are numerous interesting results [2,4, 6]. C. Affine Differential Geometry The theme of general affine differential geome- try is the study of differential-geometric prop- erties of a point or set of points in a space that are invariant under the action of the taffine transformation group. Affine differen- tial geometry is the study of the properties invariant under the action of the tequivalent affine transformation group, i.e., a subgroup of the affine transformation group formed by elements sending (Xi) to (Xi) such that n xi=ai+LaijXj, i=I,...,n, det(a i ) = 1. j=1 The latter transformation leaves invariant the volume surrounded by an oriented closed hypersurface. The method of moving frames is effective in affine differential geometry. Let C be a plane curve, and associate with any point A = A(t) of C a family offrames [A, e l , e 2 J, where the area of the parallelogram 
1100 Differential Geometry in Specific Spaces determined by the two vectors e l , e 2 is equal to 1. This frame is called the frame of order O. Its differential is expressed by 2 dA = L wSe" ::::1 2 de, = L w:e" s=1 r= 1,2, w +w=O. The frames of order 1, 2, and 3 are character- ized by w 2 =0; w 2 =0, wI=w 1 ; and w 2 =0, wI=w" w =0, respectively. Then a frame of order 3 can be associated with each point of C and coincides with the Frenet frame. We call Wi = d (J the affine arc element; the affine curva- ture K is defined by w1 = - K dO", Then the Frenet formula is given by dA=dO"e l , del =dO"e 2 , de 2 = -KdO"e l . With respect to this frame, C is expressed as y= x 2 /2 + Kx 4 /8 + (dK/dO")x 5 /40+.... Further, dO" and K are given analytically by dO" = IdA,d 2 AI I / 3 , K= W A/d0"2, d 3 A/dO" 3 I, where 1M, NI = det(M, N). M, N are column vectors with two entries. We call 0" the affine arc length. The straight line on which e z is situated is called the affine normal, the dia- meter of the parabola osculating C at A. If K is constant, then C is a conic section. Further- more, C is an ellipse, hyperbola, or parabola according as the constant K is positive, nega- tive, or zero. In affine geometry, parabolas play a role similar to that played by straight lines in Euclidean geometry. There are numerous results concerning the theory of skew curves and surfaces [1]. Con- cerning the theory of skew curves, results on affine length, affine curvature, affine torsion, affine principal normals, and affine binormals are similar to those in Euclidean geometry. The affine transformation group is situated between the projective transformation group and the congruent transformation group and hence has properties analogous to theirs. The theory of surfaces has a character similar to that of projective differential geometry [1]. We may also consider the variation of the affine area of a surface surrounded by a closed skew curve C. We call the extremal surface the affine minimal surface, W. Blaschke and others obtained many results on the global properties of such surfaces. D. Conformal Differential Geometry Let sn be a tconformal space of dimension n, and associate with each point Ao E sn a frame 9i[Ao,AI' ...,An,A",J of the t(n+2)- hyperspherical coordinates with origin Ao ( 76 Conformal Geometry). Then denoting 410 by A . B the tinner product of hyper spheres A, B, we obtain A,'Ap=g,p, ex,[3=O,I,...,n,oo, where (g,p)= (  -1 0 -I) gij o , 0 0 gij=gji' i,j== 1, ...,n, Let z' be homogeneous coordinates with respect to 9i. Then the tMobius transforma- tion z -->z of sn is characterized by z' == cpz P , where gapc:cf = g"" Ic!:1 # O. The differential of the family of the frames is defined by dA,= "f wt A p , po, I (1) where ( wg (wt)= L9W wb w j , L9W ) ' -wg Woo DgikWf + gjkwf) = dg ij , There are (n+ l)(n+2)/2linearly independent forms among w, and this is the number of parameters of the Mobius transformation group. The structure equations of this group are dw P = " w a /\ w P rx  a (1. (2) The theme of conformal differential geometry is the properties of Pfaffian forms wt satisfying (1) and (2). Consider a transformation c: L z' A,--> Lz'(A,+dA,). (1) If all w vanish except wg, then all the circles through Ao, Aoo are in- variant, and any point P is transformed to a neighboring point P on the circle, such that the cross ratio (P, P; Ao, Aoo) is constant. This transformation is called the homothety with centers Ao, Aoo. (2) If all w vanish except wb, w'(' = L gikW, then all the circles tangent to a fixed direction at A", are transformed into themselves, and any hypersphere through Aoo and orthogonal to those circles is transformed into a hypersphere having the same property. This transformation is called the elation with center Aoo' (3) If all (J) vanish except w? = Lg'kWk w j then the transformation is an ela;ionOOith"center Ao. (4) If all w vanish except w{, then the transformation is an in- finitesimal rotation with center Ao, with Aoo regarded as a point at infinity. Thus any in- finitesimal Mobius transformation is de- composed into the previous four types of transformation, To study the theory of curves and hypersur- faces in Sn, we again utilize the Frenet frame chosen from a family of frames associated with 
411 Ao. For example, the Frenet formula of a curve in S3 is given by ( 0 KdO" (wl) -f o 0 ) o dO" rdO" 0 , o 0 o 0 dO" o o o KdO" o o o - rdO" -dO" We call dO", K, and r the conformal arc element, conformal curvature, and conformal torsion, respectively. There are many results on con- formal deformation [5]. Concerning Laguerre differential geometry, we have results dual to those in conformal differential geometry (a point is replaced by a straight line and an angle by a distance be- tween the points of contact of the common tangents of two oriented circles). E. Contact Manifolds Consider a (2n + 1 )-dimensional differentiable manifold M2n+1 with a I-form I] such that I] /\ (dl])" # 0, where dl] is the exterior derivative of I] and /\ denotes exterior multiplication. (Note that this is true for the I-form in the left- hand side of eq. (2) in 82 Contact Transforma- tions A.) Such a manifold is called a contact manifold with contact form 1]. The structure group of the tangent bundle of a contact mani- fold M 2n + 1 reduces to U(n) x 1, where U(n) is the unitary group; hence every contact mani- fold is orientable. Simple but typical examples are given by the unit sphere s2n+1 in Euclidean space E 2n + 2 and the tangent sphere bundle of an (n+ I)-dimensional Riemannian manifold M n + l , both with natural contact forms (S, S, Chern [9]). Every 3-dimensional compact orientable differentiable manifold is a contact manifold (1. Martinet [12J). Now a differentiable manifold M 2n + 1 is said to be an almost contact manifold if it admits a tensor field <p of type (1, 1), a vector field ,;, and a I-form I] such that <p2X=-X+I](X),;, 1](,;)=1, where X is an arbitrary vector field on M2n+l; and the triple (<p,,;, 1]) is then called an almost contact structure. (3) implies that <p,; = 0 and 1](<pX) = 0 (S. Sasaki [14, IJ). The structure group of the tangent bundle of an almost contact manifold M 2n + 1 reduces to U(n) xl. Indeed, J. W. Gray [10J took this property as his definition of almost contact structure, For any pair of vector fields X and Yon M 2n + l , let N(X, Y)=[X, YJ+<p[<pX, YJ +<p[X, <p YJ - [<pX, <p YJ - {X 'I](Y)- Y'I](X)},;, 110 Ref. Differential Geometry in Specific Spaces (3) where [ , J is the Poisson bracket; then N is a tensor field of type (1, 2) over M 2n +1, which we call the torsion tensor of the almost contact structure (<p,,;, 1]). When N vanishes identically on M 2n +1, we say that the almost contact structure is normal. An almost contact structure (<p,,;, 1]) on M 2n +1 induces naturally an almost complex structure Jon M2n+1 X R (resp. M2n+1 X SI), which reduces to a complex structure if and only if(<p,,;, 1]) is normal. A similar statement is also valid for the product space of two almost contact manifolds (A. Morimoto [13J). If M2n+l is an almost contact manifold with structure tensor (<p,,;, 1]), we can find a positive definite Riemannian metric 9 so that g(<pX, <p Y) = g(X, Y) -I](X)I]( Y) for any pair of vector fields X and Y, and the set (<p, ,;, 1], g) is then said to be an almost contact metric structure. When M2n+1 is a contact manifold with contact form 1], there exists a unique vector field'; which satisfies dl](X, ,;) = 0, 1](';) = 1 for any vector field X. We can then find a tensor field <p of type (1, 1) and a positive definite metric tensor 9 so that (i) dl](X, Y)= g(X, <p Y) is satisfied for any pair of vector fields X and y, and (ii) (<p,,;, 1], g) is an almost contact metric structure. The almost contact metric structure determined in this way by a contact form I] is called a contact metric structure. A differenti- able manifold with normal contact metric structure is called a normal contact Rieman- nian manifold or a Sasaki an manifold. Bries- korn manifolds are examples of such mani- folds. They include, besides the standard sphere s2n+ I, all exotic (2n + 1 )-spheres that bound compact oriented parallelizable mani- folds. An almost contact manifold is said to be regular or nonregular according as the integral curve of'; is regular or not as a submani- fold, A compact regular contact manifold is a principal circle bundle over a symplectic mani- fold, and it admits a normal contact metric structure if and only if the base manifold is a Hodge manifold (Boothby and Wang [8J, Hatakeyama [11]). Many research papers on the topology and differential geometry of manifolds with the structures defined above have been pub- lished by S. Tanno, S. Tachibana, D. E. Blair, M. Okumura, K. Ogiue, S. I. Goldberg, and others. References [IJ W. Blaschke, Vorlesungen tiber Differen- tialgeometrie, Springer, II, 1923; III, 1929 (Chelsea, 1967), 
111 A Differential Geometry of Curves and Surfaces [2J G, Bol, Projektive Differentialgeometrie I, II, Vandenhoeck & Ruprecht, 1950, [3] E. Cartan, La theorie des groupes finis et continus, Gauthier- ViIlars, 1951. [4] E. Cartan, Le<;ons sur la theorie des es- paces a connexion projective, Gauthier-ViIlars, 1937. [5] p, C. Delens, Methodes et problemes des geometries differentieIles euclidienne et con- forme, Gauthier-Villars, 1927. [6] G. Fubini and E. Cech, Introduction a la geometrie projective differentieIle des surfaces, Gauthier- ViIlars, 1931. [7] J, Kanitani, Geometrie differentieIle pro- jective des hypersurfaces, Mem, Ryojun CoIL of Eng., 1931. [8] W. M. Boothby and H, C. Wang, On contact manifolds, Ann, Math., (2) 68 (1958), 721-734, [9] S. S, Chern, Pseudo-groupes continus infinis, CoIl. int. du CNRS, Geom. Diff., Stras- bourg (1953),119-135. [10J J. W, Gray, Some global properties of contact structure, Ann. Math., (2) 69 (1959), 421-450. [11] Y. Hatakeyama, Some notes on differenti- able manifolds with almost contact structure, T6hoku Math. J" (2) 15 (1963),176-181. [12] J, Martinet, Formes de contact sur les varietes de dimension 3, Proc, Liverpool Sin- gularities Symposium II (1971),142-163. [13] A. Morimoto, On normal almost contact structures, J, Math, Soc. Japan, 15 (1963), 420- 436. [14J S. Sasaki, On differentiable manifolds with certain structures which are closely re- lated to almost contact structure. I, T6hoku Math. J" (2) 12 (1960), 459-476. II (with Y. Hatakeyama), ibid" 13 (1961), 281-294. [15] K, Yano and S. Ishihara, Tangent and cotangent bundles, Dekker, 1973. [16J D. E. Blair, Contact manifolds in Rie- mannian geometry, Lecture notes in math. 509, Springer, 1976. 111 (VII.12) Differential Geometry of Curves and Surfaces A. General Remarks Let r be an timmersion of an m-dimensional tdifferentiable manifold M of class C' into an n-dimensional Euclidean space En. More pre- cisely, f is a differentiable mapping of class C' such that the tdifferential dfp is injective at every point p of M. The pair (MJ) is called 412 an immersed submanifold (or a surface) of En. When m= 1, we call it a curve of En, and when m = n - 1, a hypersurface in En. The cases of n = 2 and n = 3 have been the main objects of study in differential geometry of curves and surfaces, The differential-geometric properties for the general case of immersion are discussed in 365 Riemannian Submanifolds. B. Frames in En Every tEuclidean motion in En can be ex- pressed as the product of a parallel translation and an torthogonal transformation that keeps the origin of En fixed, The set of all parallel translations is a commutative group that can be identified with R n . It is a norma! subgroup of the group of motions I(P) of En, So we see that I(En) is a tsemidirect product ofR n and the torthogonal group O(n). The Lie algebra of I(E n ) is the direct sum of R n and the Lie algebra o(n) of the orthogonal group, where both are regarded as additive groups. Corre- sponding to this decomposition, we can write the tMaurer-Cartan differential form over I(E n ) as w+Q, where w belongs to R n and Q to o(n). The tstructural equation d(w+Q)= -(1/2)(w+Q) 1\ (w+Q) can be divided into the following two parts: dw=Q 1\ w; dQ= -(1/2)Q 1\ Q, These are known as the structure equations of En. By an orthogonal frame in En we mean an ordered set (x, e l , ..., en) con- sisting of a point x and a set of torthonormal vectors e l , e 2 , ...,en' We denote by @(n) the set of all orthogonal frames in En. If we denote the translation identified with x E Rn by Tx, then there is a one-to-one correspondence <p: I(En)-->G(n) given by <p(TxA)= (x, Ael' ..., Ae n ) (A EO(n)). We can make @(n) into a differ- entiable manifold so that <p is a tdiffeomor- phism. We denote the differential forms over @(n), which are images of wand Q under the tdual mapping of <p -I, by the same letters w and Q, respectively. For @(n) as a tprincipa! fiber bundle over R n with the projection n: n(x, ej, ..., en) = x and n vector-valued func- tions <Pi: <Pi(X, e j , "', e n )= e i over @(n), we have OJ = L wie i , i Q= L QiJE,}, i<j wi=(dn,<p;), Qi} =(d<Pi' <Pi)' (1) dol' = LQi} AW}, } dQiJ= LQik A Qk}, k where {EiJ is a basis of o(n) defined by Eije} = e i , E..e.= -e., E " J .e k =O (k#i,j) and ( , ) is the l} l J scalar product of vector-valued forms induced from the scalar product of En, Any diffeomor- phism of @(n) onto itself preserving wand Q must be a Euclidean motion. 
413 C. Theory of Curves Let (MJ) be an immersion of a I-dimensional differentiable manifold M into En. We identify the tangent space of En at each point with En itself. Then dfx maps the origin of the tangent space Mx to f(x), and the image dfX<M x ) of Mx by dfx is a straight line passing through f(x) in En, called the tangent line of f(M) at f(x). By (!!f(M) we mean the set of all ordered sets (x, e l , "', en), where x E M and {e i } is an ortho- normal basis of En such that e l EdfX<M x )' Then (!!f(M) can be naturally immersed in (!!(n) by the mapping J:l(x, e l , ..., en) = (/(x), e"..., en)' We can pulI back the diffe:..ential forms ill, n, w', nil over (!!(n) to (!!f(M) f* and denote them by 0, E>, Oi, and E>ij, respectively; then we have o i = 0 (i> 1). Let fl and f2 be two immersions of M into En. Then in order for there to exist a Euclidean motion ex of En such that fl = ex 0 f2, it is necessary and sufficient that there exist a diffeomorphism <P of (!!f1 (M) onto (!!J,(M) such that 0f1 = <P*(0J,), E>f 1 = <p*(E>J,)' Let n f be the projection of the fiber bundle (!!f(M), and let <Pi be naturally defined vector-valued functions over (!!f(M), Then we have d(/onf)=OI<p" d<Pi=L'j1 E>il<Pl' If we put ds 2 (X)= IldfX<X)f (XEM x ), then we have ({Jl)2 ==nJ(ds 2 ). For each point x E M there are two possibilities for the choice of e l corresponding to two orientations of the curve. But since (d<Pl, d<p!)= nJ(p2 ds 2 ), p2 depends only on the point x of M, We call p (;;:'0) the absolute curvature. We now choose an orientation of the curve and then e l in accordance with the orientation. Thus we get a submanifold of (!!f(M), which we again express by the notation (!!f(M), If we define the form ds by ds(X)=(df(X),e l ), we have 0 1 =nJ(ds), and ds is called the line element. Any tlocal cross section R:nfoR= 1 of the bundle (!!f(M) is called a moving frame, Putting R*(OI)=ds, R*(E>il)=pilds, we see that the following equation holds over M: de.= " p ildse. t  j" 1 For two immersions (MJl) and (M,f2)' we have fl =exOf2 (ex is a Euclidean motion) if and only if they have the same ds and pil for some moving frames, D. Frenet's Formulas In order to study local properties of curves it is sufficient to consider them on tJordan arcs of class cr. With respect to orthogonal coordi- nates (Xl, .." x n ) in En, such a curve is repre- sented parametrically by Xi = P(t) (tE [a, b J, L(dxijdtf > 0) or by a vector representation 111 E Differential Geometry of Curves and Surfaces x = x(t). If <P is a diffeomorphism of a closed interval [a', b'J onto [a, b J, then f 0 <P and f are representations of the same arc in En, and <P is called a transformation of the parameter. Any curve of class C l is trectifiable, and its arc length is given by s = J(Li=1 (dxijdtf) 1/2 dt. We may choose the arc length s measured from a point on the arc as a parameter, called the canonical parameter of the arc. Consider an arc C of class C" given by the vector repre- sentation x=x(s), sE[a,b]. We assume that its tWronskian Ix'(s), ..., x(n)(s)! is not identically zero (we denote by , the derivative with respect to the canonical parameter), which means that the arc C is not contained in a hyper- plane in En. A point at which the Wronskian vanishes is called a stationary point, and we assume that there exists no stationary point on C. By the tGram-Schmidt orthonormaliz- ing process we obtain an orthonormal basis e" "', en (Ie!, .., ,en I > 0) from n vectors x'(s), "', x(n)(s) at each point of C, We call the frame thus determined the Frenet frame. With re- spect to the Frenet frame, (2) is rewritten as e;(s)= - Ki-l (s)e i - l (s) + Ki(s)ei+l (s), i= 1, ...,n; KO(S) = Kn(S) = 0; K](S) > 0, (3) j=I,...,n-2. (2) These are called Frenet's formulas (or the Frenet-Serret formulas). We call K" K2' ..., K n -2 the first, second, ..., (n-2)nd curvature, respectively, while we call Kn-l the torsion for n;;:,3. For a curve in a lower-dimensional sub- space Em c En, we set Ki = 0 (i > m). The curva- tures and the torsion of a straight line are zero. To get Frenet's formulas in these special cases, we fix e i (i> m) in the subspace ortho- gonally complementary to Em in En and pro- ceed as in the general case. Suppose that C l , C 2 are arcs such that both of their Frenet frames are of class C l . If there exists a diffeo- morphism of C l to C 2 tht preserves arc length and the K i (i = 1, "', n-l) are equal at corre- sponding points, C l and C 2 are mapped onto each other by a motion of En. This is the fun- damental theorem of the theory of curves. Given n - 1 functions of class C I K I (s);;:' 0, .. . , K n -2(S);;:'0 (we assume that the equality signs occur at most at a finite number of points) and K n - l (s) for 0:( s:( L, there exists an arc that has KI, ..., K n -2, K n - l as its first, "" (n - 2)nd curvatures and its torsion, respec- tively. The equations K i = Ki(S) are called the natural equations of the curve. E. Plane Curves Let x = x(s) be a curve of class C 2 in E 2 , and (x(s), e l , e 2 ) its Frenet frame. The tangent and 
111 F Differential Geometry of Curves and Surfaces the normal of this curve at x(s) have para- metric representations x(s) + tel> x(s) + te 2 , respectively (with parameter t). Frenet's for- mulas are written as x' = e l , x" = e'l = Ke 2 , and p is called the curvature of the curve C. The natural equation is given by K= K(S). If K(S)= constant #0 along C, C must be a portion of a circle. Another way of defining the curvature is as folIows: We take a fixed direction (for example, the positive direction of the x-axis on E2) and denote by O(s) the angle made by the tangent T, of the curve C at x(s) with the direc- tion. Then we have K(s)=dOlds. If n=2, the curvature can take both positive and nega- tive values. Figs. 1 and 2 suggest a geometric meaning of K>O and K<O, respectively. The circle with center x=x(s)+(I/K)e 2 and radius IlK has a contact of higher order than any other circle in E 2 . We call this circle the oscu- lating circle (or circle of curvature) at the point x = x(s), its center the center of curvature, and IlK the radius of curvature. The locus C' of the center of curvature of a curve C is called an evolute of C. Conversely, C is called an involute of C', the tenvelope of the family of normal lines of C. When a curve is given in terms of its canonical parameter s, the curvature is given by Ix'(s), x"(s)l; when the curve is given by another parameter t as x = x(t), the curvature is given by K(t)= Ix'(t), x"(t)l/lx'(tW, where ' means dldt. Fig. 1 K>O, Fig. 2 K<O, The facts we have just stated concern local properties of plane curves. We shall now dis- cuss the global theory of curves, which deals with properties of each Curve as a whole. Let C:x=x(s), a:(s:(b, be a closed curve. Let O(s), 0:( O(s) < 2n, be the angle that e(s) makes with the x axis. Put e = J: O'(s)ds. Intuitively, e measures the total rotation of e l (s) as we run along the curve C from a to b, Since Cis closed, e is an integer multiple I of 2n. The integer I is called the rotation number of C, and is equal to (1/2n)JiK(s)ds. Let i5 be a closed domain consisting of points in the inte- rior and on the boundary of a simple closed curve C. C is called a closed convex CUrve or an oval if i5 is tconvex in E 2 . Among alI ovals of given length, the circle has the maximum area. Various generalizations of this theorem have been obtained, and the colIection of 414 problems of this kind is called the isoperimet- ric problem. This problem has intimate con- nections with fields such as integral geometry. The oval has a convenient parameter other than the arc length parameter s. Given a num- ber t, 0:( t:( 2n, there exists a unique point x(t) in the oval such that e 2 = (cos t, sin t) at x(t). When we describe the oval in terms of the parameter t, the tangent vector at x(t) is parallel to that at x(t + n), and we can define the width W(t) at x(t), W(t) is called the width of the oval. A curve is calIed a curve of con- stant width if the curve is an oval whose width W(t) does not depend on t. The circle is a typical example of a curve of constant width. Reuleaux's triangle is another well-known example of a curve of constant width (Fig. 3). F or a curve of constant width of width W and length L, we have L = n W. Fig. 3 There are also some results concerning the relations between local properties (for exam- ple, curvature) and properties of the whole figure. An example is given by the four-vertex theorem. A vertex on a curve C is by defini- tion a point where dKlds = O. Then there are at least four vertices on an oval of class C 3 . A simple closed curve with K;;' 0 ( :( 0) must be convex ( 89 Convex Sets). F. Space Curves Let x = x(s) (SE [a, bJ) be a curve C of class C 3 in E 3 defined in terms of the canonical para- meter s. Let (x(s), e l , e 2 , e 3 ) be Frenet frames along C. Then we have the Frenet formulas e'I==K l e 2 , e=-Klel+K2e3' e=-K2e2' We call IlK!> l/K2 the radius of curvature and the radius of torsion, respectively. The line x == x(so) + tel is the tangent of C at x(so)' The two straight lines through the point x(so) defined by x = x(so) + te 2 and x = x(so) + te 3 are called the principal normal and the binormal of C at x(so), respectively. The three planes through x(so) defined by x = x(so) + te 2 + te 3 , x=x(so)+te 3 +te l , and x=x(so)+ tel + tel are calIed the normal plane, the rectifying plane, and the osculating plane, respectively. 
415 At a point x(so) of a curve x = x(s) of class CO>, we take edso), e 2 (so), and e 3 (so) as unit vectors of the coordinate axes. Substituting the Frenet formulas into the Taylor expansion of x(s), we see that the new coordinate xds), x 2 (s), x 3 (s) of C are given by XI = (s - so) -(Kdso)/6)(s - SO)3 +..., X2 = (K I (so)/2)(s - SO)2 + (K'dso)/6)(s - so)3 + '" , X3 = (K I (So)K 2 (So)/6)(s - so)3 +.... These are called Bouquet's formulas. Utilizing these formulas we can see the nature of the curve with given KI and K2' A curve and its osculating plane at a point on it have contact of order higher than any other plane through that point. The family of osculating planes of C tenvelops a tdevelopable surface Sand coincides with the locus of tangent lines to C. We call S the tangent surface of C, and C the line of regression of S. The family of rectifying planes of C also envelops a developable sur- face called the rectifying surface, and C is a tgeodesic on this surface. The family of normal planes of C envelops either a cone or a tangent surface of another curve C. When the natural equation of a space curve has a special form. the shape of the curve is simple. For example, Kds) = constant, K2(S) = constant represent a curve, called an ordinary helix, on a cylinder which cuts all the generators of the cylinder at a constant angle. More generally, it is known that if KdK2 = constant, the tangent at each point of the curve makes a constant angle with a fixed direction. Such a curve is called a gen- eralized helix or a curve of constant inclination. Each curve satisfying aK 1 +bK 2 =c (ab#O) is called a Bertrand curve. For a Bertrand curve there exists another curve E and a corre- spondence of C onto E such that they have a common principal normal at corresponding points. Conversely, this property is also a sufficient condition for C to be a Bertrand curve. A Mannheim curve is defined analo- gously as a curve having a correspondence with another curve E such that the principal normal of C and the binormal of E coincide at corresponding points, When a correspondence of C and E has the property that tangents at corresponding points are parallel, then the correspondence is called a correspondence of Combescure. We have stated mainly local properties of space curves. There are also several results about global properties of curves in E 3 analo- gous to the case of plane curves. For a sim- ple closed curve C of length L, we call K = SK1(S)ds the total curvature of C. Generally we have K 2n, while K =2n if and only if C is a closed convex curve lying in a plane (W, Fenchel) [5,6]. The total curvature is deeply lliG Differential Geometry of Curves and Surfaces related to the properties of tknots. If a simple closed curve in E 3 is knotted, then the total curvature is at least 4n [7,8]. We fix an origin o in E 3 and draw a unit tangent vector with initial point 0 parallel to the unit tangent vector at each point of a space curve C; then the endpoint of this vector traces a curve E on the unit sphere with the center O. We call E the spherical indicatrix of C and the corre- spondence of C to C a spherical representation. The total curvature K of a curve C is equal to the length of E. Consequently, we have K = cdO, where 0 is the angular deflection of the tangent line along the closed curve C, G. Theory of Hypersurfaces Let (MJ) be an immersion of an (n -1)- dimensional differentiable manifold M of class C' into En, Then we can define on the hyper- surface M a positive definite differential form 9 of degree 2 induced from the inner product of En:gAX, X) = (dfAX), dfx(X)), XEMx' Then M becomes a tRiemannian manifold with tRiemannian metric g. We call 9 the first fun- damental form of (MJ), The Riemannian geometry on a surface with its first funda- mental form as Riemannian metric is called geometry on a surface ( 364 Riemannian Manifolds). By (!)f(M) we mean the set of all the ordered sets (x, e 1 , .., , en) E (!)(n), where x E M and {e i } (i = 1,2",., n) is an orthonormal system of En such that eiEdfAMx)(i= 1, "', n-1), Then (1Jf(M) with natural projection n f and natural differentiable structure is a tprincipal fiber bundle over M and has a natural immersion l:l(x, e b ... ,en) = (/(x), e b .", en) in the prin- cipal fiber bundle (!)(n), We can tpull back the forms on (!)(n) to (!)f(M) by 1* and put 0= l*(w), 8 = 1*(0.); then the structural equa- tions of En are transformed to dO = 8 /\ 0, d8( -1/2)8 /\. Furthermore, if we put Oi = f*(w i ), 8 ii = f*(o.ii), then on = 0 and Oi, 8 ii (i,j < n) depend only on the first funda- mental form of (MJ). Let f1 and f2 be two immersions of M into En. Then in order that there exist a Euclidean motion ex of En such that f1 = ex 0 f2, it is necessary and sufficient that there exist a diffeomorphism <p of (!)f 1 (M) onto (!)J,(M) such that Of 1 = <P*(Of,) and 8 f1 = <p*(8 f J Suppose that M is torientab1e and oriented. Then the unit vector field normal to dfx(M x ) at every point XEM in En defines a mapping of M into the unit sphere in En called the spherical representation of M or the Gauss mapping (Gauss map). Regarding the unit normal vector field  of M as a vector-valued function over M, we can define a symmetric product of df and d( by - (df, d()(X, Y) = 
111 H Differential Geometry of Curves and Surfaces (1/2) [(df(X), d(Y))+ (d[(Y), d(X))], called the second fundamental form of (MJ), Two immersions fl and f2 of M that induce the same first and second fundamental forms have a Euclidean motion ex such that fl = exOf2; and the converse is also true. This fact is called the fundamental theorem of the theory of surfaces, H. Theory of Surfaces in E 3 ( 365 Riemann- ian Submanifo1ds; Appendix A, Table 4.1) A surface in En is locally expressed by para- metric equations Xi = xi(u,) (i = 1, ... , n; ex = 1, ..., m) or by a single vector equation x = x(u,). We are mainly concerned with the case n= 3, m == 2, and we express the surface by a vector representation x = x(u, v). The first and second fundamental forms are written as Edu 2 + 2F dudv+ Gdv 2 , Ldu 2 +2M dudv+N dv 2 . If we use the usual notation of ttensor analy- sis, then the first and second fundamental forms are also denoted by gap du' du P and H,pdu'duP, respectively, where u' (ex = 1,2) are parameters (with L omitted by tEinstein's convention). We call {gap}, {H,p} the first and second fundamental quantities, respectively ( 365 Riemannian Submanifo1ds). At the point Po = x(u o , v o ) on a surface S that corresponds to parameter values (u o , vo), the curves ex- pressed by v = V o and u = U o are called a u- curve and a v-curve through Po, respectively, Let xu, Xv denote the tangent vectors ax/au, ax/av at Po to the u-curve and v-curve, respec- tively, through the given point Po and  de- note the unit vector orthogonal to Xu and xv' Then  is called the normal vector of S at Po and (xu, xv, 0 the Gaussian frame of S at Po' Although a Gaussian frame is not in general an orthogonal frame, it is intimately related to local parameters. The plane that passes through the point Po and is spanned by xu, Xv is called the tangent plane to S at Po. The coefficients of the second fundamental form L(u, v), M(u, v), N(u, v) are expressed by the inner products L = ( - xu, u), M = ( - xu, v), N = (-xv, v) (u = a/au, v = a/av). Let (X, Y) be the coordinates of a point on the tangent space at Po with respect to the Gaussian frame. We call the curve of the sec- ond order defined by LX 2 +2MXY+Ny 2 =F. (F. is a suitable constant) the Dupin indicatrix. The point Po is called an elliptic point or a hyperbolic point on S according as the Dupin indicatrix at the point is an ellipse or a hyper- bola, If Po is an elliptic point, then points near Po on the surface lie on one side of the tangent plane at Po, whereas if Po is a hyperbolic point, 416 points near Po on the surface lie on both sides of the tangent plane at Po (Figs. 4, 5). A hyper- bolic point is also called a saddle point, since in a neighborhood of the point the surface looks like a saddle, A point that is neither elliptic nor hyperbolic is called a parabolic point; at a parabolic point we have LN - M 2 = O. If at least one of L, M, N does not vanish at Po, then there is a neighborhood of Po of the sur- face that lies on one side of the tangent plane at Po (Fig. 6). If a vector (X, Y) on the tangent plane of the surface at Po satisfies the equation LX 2 + 2M X Y + N y 2 = 0, then the direction of the vector is called an asymptotic direction. If the point Po is elliptic, such a direction does not exist; if Po is hyperbolic, the direction is an tasymptotic direction of the Dupin indicatrix on the tangent plane at Po. A curve C on a surface such that the tangent line at each point of the curve coincides with an asymptotic direction of the surface at the point is called an asymptotic curve. ;; .........----.......... Fig. 4 Elliptic point. ....----.......... Fig. 5 Hyperbolic point. , -r-------------- ;,/ I ' // I / ,-------------_/ Fig. 6 Parabolic point. Let C: x = x(u(t), v(t)) be a curve through Po on the surface x = x(u, v), Then the curvature K of C as a space curve is given by Ldu 2 +2M dudv+N dv 2 KCOSO - , Edu 2 +2F dudv+ Gdv 2 
417 where dudv is the direction of C on the surface at Po and () is the angle between the normal of the surface at Po and the principal normal of C at Po. The center of curvature at a point Po of a curve C of class C 2 on a surface of class C 2 is the projection on its osculating plane of the center of curvature of the section C* (of the surface) cut by the plane determined by the tangent to the curve at Po and the normal of the surface at the point (Meusnier's theorem). The curvature of the curve C* at Po is called the normal curvature of the surface at the point for the tangent direction, Since the normal curvature for a direction at a point is a con- tinuous function of this direction that can be represented as a point on a unit circle, there exist two directions that realize the maximum and minimum of the normal curvature. These directions are given by the equation I Edu+PdV Pdu+Gdv 1 -0 L du + M dv M du + N dv - , When this quadratic equation in du/dv has nonzero discriminant, it determines two direc- tions defined by its two roots. These directions are called principal directions at the point. A curve C on a surface such that the tangent line at each point of the curve coincides with a principal direction at the point is caIled a line of curvature. When all lines of curvature of a surface are circles, the surface is called a cyclide of Dupin. The two normal curvatures corresponding to two principal directions are given by I/R, satisfying the following second- order equation: (y EN+GL-2PM 1 LR-M 2 + - 0 EG-p 2 R EG-p 2 ' They are called principal curvatures, and each of their inverses is called a radius of principal curvature. The mean value H=(K I +K 2 )/2 of two principal curvatures K i = I/Ri (i = 1, 2) is called the mean curvature (or Germain's curva- ture), and the product K = Kl K2 is called the total curvature (or Gaussian curvature), These are given by 1 EN + GL-2PM H=- 2 EG-p 2 ' LN-M 2 K- - EG-p 2 . A point on a surface is elliptic, hyperbolic, or parabolic according as K > 0, K < 0, or K = ° at the point. A point where the second fun- damental form is proportional to the first fundamental form is called an umbilical point, and a point where the second fundamental form vanishes is called a flat point or a geo- desic point. If a surface consists of umbilical points only, the ratio (L(du)2 + 2M du dv + N (dV)2)/(E(du)2 + 2P du dv + G(dU)2) is a con- stant, and the surface is either a sphere or a portion of it. If a surface consists of flat points 11tH Differential Geometry of Curves and Surfaces only, the surface must be either a plane or a portion of it. The mean curvature and the Gaussian curvature of a sphere are constant, and those of a plane are both equal to zero. If we use the spherical representation of a surface stated in Section G, we can give to the Gauss- ian curvature the following geometric mean- ing: Let A be the area of the domain enclosed by a closed curve C around a point Po on a surface, and let A * be the area of the domain on the unit sphere enclosed by the curve that is the image of C under the spherical represen- tation of the surface, Then the limit of A * / A as the closed curve C tends to the point Po is equal to K at Po' Let us denote by (g'p) the inverse matrix of the matrix (g,p) whose elements are coefficients of the first fundamental form g,pdu'du p . We easily see that gll =G/(EG-F 2 ), g12=g21 = -P/(EG-p2), g22=E/(EG_P2), We intro- duce the symbols [[3 ex J =! ( ag,p + ag y , _ a g py ) 1', 2 au Y au p au' ' {;J=g'O[[3y,b], which are called the Christoffel symbols of the first and second kinds, respectively, Suppose that a surface is given by the vector represen- tation x = x(u l , u 2 ), and put X,= ax/au', x,p = ax,/au p . Then for the derivatives of the Gaus- sian frame, we obtain x,p = {:[3 }XY + H,p, ,= -gyPHp,xy- We call the former Gauss's formula and the latter Weingarten's formula. The integrability conditions of these partial differential equa- tions are R' pyo=HopHy"- HypH:, Hy"=g,aH ya , aH,p aH,y { O" } { O" } ---+ H - H =0 au Y au p ex[3 ay exy ap , where R"pyo== ay {b}- ao {y}+ {bO"[3} {y:} -t}{b:} are components of the curvature tensor. The former are called the Gauss equations, and the latter the Codazzi-Mainardi equations. In connection with these equations, Bonnet's fundamental theorem states the following: Suppose that a positive definite symmetric matrix (g,p) and a symmetric matrix (H,p) are given that are functions of class C 2 and CI, respectively, defined over a tsimply connected domain D in R 2 . If they satisfy the Gauss 
111 I Differential Geometry of Curves and Surfaces equations and the Codazzi-Mainardi equa- tions, then there exists a surface x = x(u l , u 2 ) with the given (g,p) and (H,p) as coefficients of its first and second fundamental forms, respec- tively. Such a surface is determined uniquely if, for an arbitrary fixed point (u?, u) of D, we assign an arbitrary point Po and a frame (x?, x, o) at Po so that x?, x are orthogonal to the unit vector o and (x, x) = g,p(u?, u) as the Gaussian frame at Po. On the other hand, we can investigate surfaces as Riemannian manifolds defined by the first fundamental form. A tdiffeomorphism between two surfaces preserving arc length is called an isometric mapping. The condition of preserving arc length is equivalent to the condition that the first fundamental quantities of the surfaces coincide at each pair of corresponding points, provided that we have introduced parameters on the two surfaces so that corresponding points have the same parameter values. In such a case two surfaces are said to be iso- metric. From the Gauss equation we can see that the total curvature depends only on the first fundamental quantities, So K is a quantity that is preserved under isometric mappings (Gauss's theorema egregium). A vector field A'(t)x, defined along a curve u' = u'(t) on a surface is said to be parallel in the sense of Levi-Civita along the curve if its tcovariant derivative along the curve vanishes, i.e" if (jA'jdt = dA'jdt + {;y} A P duYjdt= O. The length of a vector belonging to a vector field that is parallel along a curve C is constant along C. The angle of two vectors both be- longing to vector fields that are parallel along C is also constant along C. Choose two vector fields A(a) parallel along a curve C on a surface that satisfy g,pA(a)Afb) = (jab' Then the tangent vector to C is expressed by du'jdt= A(a)va(t), Take a 2-plane and fix an orthogonal coordi- nate system on it; then the integral curve C of a set of ordinary differential equations dx'jdt = C)va(t) (C: = A(a)(PO)) is called the develop- ment of C ( 80 Connections). We denote by K the curvature of a curve C of class C 2 on a surface S of class C 2 and by 0" the angle be- tween the binormal of C and the normal of S at the same point. Then the quantity Kg= KCOSO", belonging to the geometry on the sur- face, is called the geodesic curvature of the curve at the point. A curve with vanishing geodesic curvature is called a geodesic, It satis- fies the differential equations d 2 u' + { ex } dU P duY = O. ds 2 [3y ds ds 418 The development of a geodesic is a straight line ( 364 Riemannian Manifolds). Let us consider a simply connected, orient- able bounded domain D on a surface such that the boundary of D is a simple closed curve C that consists of a finite number of arcs of class C 2 . If we denote by ex i (i == 1, 2, ..., m) the external angles at vertices of the curvilinear polygon C (Fig. 7), we have r Kgds+.f ex i + If Kd(J=2n. Jc II D This is called the Gauss-Bonnet formula. In particular, if all the arcs of C are geodesics, we have .t ex i + If KdO"=2n. l-l D This formula implies as special cases the following well-known theorems in Euclidean geometry and spherical trigonometry: (i) The sum of interior angles of a triangle is equal to n. (ii) The area of a spherical triangle is pro- portional to its spherical excess. The formula also implies the following theorem: On any closed orientable surface we have J J K dO" = 2nx, where X is the tEuler characteristic of the surface. We caIl J J K dO" the integral curvature (or total Gaussian curvature). Fig. 7 I. Special Surfaces in E 3 A surface is called a surface of revolution if it is generated by a curve C on a plane n when n is rotated around a straight line 1 in n. Then 1 and Care caIled an axis of rotation and a generating curve, respectively. A surface of revolution having the x 3 -axis as the axis of rotation is given by the equations XI = rcos 0, x 2 = r sin 0, X 3 = <p(r); its first funda- mental form is (1 + <p'2)dr 2 + r 2 df.J2. The sec- tion of a surface of revolution by a half-plane through its axis of rotation is called a meri- dian. According as the meridian is a straight line parallel to the axis of rotation or a straight line intersecting the axis nonorthogonally, the surface of revolution is called a circular cylinder or a circular cone, respectively. If the 
419 meridian is a circle that does not intersect the axis of rotation, it is called a torus. A surface of class C 2 whose mean curvature H vanishes everywhere is caBed a minimal surface ( 275 Minimal Submanifolds). A surface of class C l realizing a relative mini- mum of areas among all surfaces of class C l with a given closed curve as their boundaries is an tanalytic surface such that H = O. Con- versely, a surface of class C 2 with vanishing mean curvature is an analytic surface. The equation of a surface of revolution with a cate- nary as its generating line is given by xi + x == a(ex,/a + e -x,/a)/2. This surface is caBed a cate- noid and is a minimal surface. Conversely, a minimal surface of revolution is necessarily a catenoid. For a surface obtained by rotating a tDelaunay curve around its base line, the mean curvature H is equal to a constant ( # 0). Con- versely, a surface of revolution with nonzero constant mean curvature must be such a sur- face. A surface with constant Gaussian curva- ture is caBed a surface of constant curvature, and is a 2-dimensional Riemannian space of constant curvature ( 364 Riemannian Mani- folds). A non-Euclidean plane can be repre- sented locaBy as a surface of constant curva- ture ( 285 Non-Euclidean Geometry), Two surfaces of the same constant curvature are locally isometric to each other. Surfaces of revolution of constant curvature are classified, The simplest surface of constant negative curvature is a pseudosphere, which is a surface of revolution obtained by rotating a ttractrix Xl =acos<p, X3=alogtan(<p/2)+ (n/4») - a sin <p ( - n/2 < <p < n/2) around the x3-axis. A surface generated by a I-parameter family of straight lines is caBed a ruled surface; a hyperboloid of one sheet, a hyperbolic para- boloid, a circular cylinder, and a circular cone are examples. The first two can be regarded as ruled surfaces in two ways. Each of the straight lines that generate a ruled surface is called a generating line. A surface consisting of straight lines paraBel to a fixed line and pass- ing through each point of a space curve Cis caBed a cylindrical surface with the director curve C. A surface generated by a straight line that connects a certain point 0 with each point of a curve C is called a conical surface. Both a cylindrical surface and a conical surface are ruled surfaces such that K = 0 everywhere. For ruled surfaces we have K .s;0. In particular, a surface such that H # 0 and K = 0 everywhere is cal1ed a developable surface, A developable surface must be either a cylindrical surface, a conical surface, or a tangent surface of a space curve, There exist ruled surfaces that are not developable, for example, hyperboloids of one sheet and hyperbolic paraboloids. A nondeve1- opable ruled surface is caBed a skew surface, A 1111 Differential Geometry of Curves and Surfaces ruled surface generated by a straight line that moves under a certain rule intersecting a fixed straight line I orthogonaBy is called a right conoid. If we take I as the x 3 -axis, the surface is given by the equations Xl =ucosv, X 2 = usin v, X 3 = f(v). A surface generated by a curve C (C may be chosen as a plane curve) that moves in the direction of a fixed line I with constant velocity and turns around I with certain constant angular velocity is cal1ed a helicoidal surface, If we take I as the x 3 -axis, the surface is given by the equations Xl = ucosv, x 2 =usinv, x 3 = f(u)+kv, where k is a constant and X 3 = f(x l ) is the equation of C. In particular, if C is a straight line that intersects I orthogonaBy, then f(u) = 0, and the surface is called a right helicoid (or ordinary helicoid). A right conoid is both a ruled surface and a minimal surface. Conversely, a ruled surface that is also a minimal surface is necessarily a right conoid. A helicoidal surface with a trac- trix as the curve C is caBed a Dini surface and is a surface of constant negative curvature. On the normal of a surface S two points qi (i = 1,2) are centers of principal curvature at p, The locus of each of these points is a surface called a center surface of S. When S is a sphere, two center surfaces degenerate to a point; if S is a surface of revolution, one of the center surfaces degenerates to the axis of revolution and the other is a certain surface of revolution, If S is general, each of the center surfaces is the locus of an edge of regression of the developable surface generated by normals of S along a line of curvature. When a I-parameter family of surfaces S, is given by the equation F(x I ,x 2 ,x3' t)=O, a surface E that does not belong to this family is caBed an enveloping surface of the family of surfaces {S,} if E is tangent to some S, at each point of E, that is, if E and S, have the same tangent plane. The equation of E is obtained by eliminating t from F(X l ,X 2 ,X 3 ,t)=0 and (8F/8t)(X l ,X 2 ,X3,t)=0. In general, if we de- note by <p(x l , X2, x 3 )= 0 the equation ob- tained by eliminating t from F == 0 and 8F /8t = 0, then the surface defined by <p = 0 is either the enveloping surface of {S,} or the locus of singular points of S,. The intersection C'o of the enveloping surface E of {S,} and S'o is a curve defined by F(x l , X 2 , X 3 , to) =0, (8F /8t)(x" X 2 , X 3 , to)=O, We caB C'o a charac- teristic curve of {S,}. Since {C,} is a family of curves on the enveloping surface E, there may exist an envelope F on E. In such a case, F is caBed the line ofregression of {S,}. The equa- tion of F is obtained by eliminating t from F = 0, 8F/8t=0, and 8 2 F/8t 2 =0, In particular, the enveloping surface of a family of planes is a developable surface, and their characteristic curves are straight lines. Moreover, the line of 
III J Differential Geometry of Curves and Surfaces regression coincides with the line of regression of the tangent surface. If there exists a diffeomorphism between two surfaces such that first fundamental forms at each pair of corresponding points are pro- portional, then the surfaces are said to be in a conformal correspondence. In particular, when the proportionality factor is a constant, they are said to be in a similar (or homothetic) correspondence. There exists a local conformal correspondence between any analytic surface and a plane. Namely, if we choose suitable parameters, we can reduce the first funda- mental form of any analytic surface to the form A(, I])(d2 +dl]2). Such parameters are called isothermal parameters. From the exis- tence of isothermal parameters we can see that there exists a local conformal correspon- dence between any two analytic surfaces. The assumption of analyticity in these theorems is not necessary [10]. If there exists a diffeomor- phism between two surfaces under which geo- desics are mapped to geodesics, then the sur- faces are said to be in geodesic correspondence. A surface has a locally geodesic correspon- dence with a plane if and only if it is a surface of constant curvature. If two surfaces are in geodesic correspondence, then with respect to parameters with the same values at corre- sponding points, we have the relation {;y }={;y }+bpAy+b; AfJ for coefficients of connections of the two surfaces. By the tAlexander-Pontryagin duality theo- rem, a submanifold M in E 3 that is homeo- morphic to S2 divides E 3 into two domains, and two points belonging to different domains cannot be connected by a broken segment unless the segment meets the surface. Such a manifold M is called a closed surface. One of the two domains consists of those points with bounded distance from a point belonging to the domain. Such a domain is called the inte- rior of the closed surface M. If the set M* con- sisting of M and its interior is convex in E 3 , the surface M is called a closed convex surface (or ovaIoid). The Gaussian curvature of an ovaloid can- not be negative at any point. A closed surface with K > 0 must be an ovaloid. Moreover, it is known that on any closed surface there exists at least one point where K>O (J. Hadamard). If there exists no umbilical point and K is strictly positive in a domain on a surface, then the two principal curvatures regarded as con- tinuous functions on the domain cannot take their local maximum and local minimum values at the same point (D, Hilbert). A com- 420 pact, connected surface of class C 4 with con- stant Gaussian curvature is a sphere. A closed surface with K>O and H = constant is a sphere (H. Liebmann). A problem proposed by H. Hopf asks whether an orient able compact surface with constant mean curvature is a sphere. In con- nection with this problem, Hopf showed that a closed orient able surface of class C 3 of genus zero with constant mean curvature is a sphere. If there is a certain relation W(k l , k 2 )== 0 be- tween the two principal curvatures k l , k 2 (k l  k 2 ) of a surface, the surface is called a Weingarten surface (or W-surface). There are many interesting results for W-surfaces. As an extension of the convex surface, tight immer- sions have been studied ( 365 Riemannian Submanifolds). The Gaussian curvature K is invariant under isometries. Hence a sphere is trans- formed to a sphere by each isometry. This fact is sometimes described as the rigidity of a sphere. More generally, if two ovaloids are iso- metric, then they are congruent (Cohn-Vossen's theorem). It is known that if we remove a small circular disk from a sphere, then the remaining portion of the sphere is isometrically deform- able. On the existence of closed geodesics on ovaloids, G. D. Birkhoffproved the follow- ing theorem: There exist at least three closed geodesics on any ovaloid of class C 3 . It is also known that there exist surfaces of revolution that are not spheres but whose geodesics are all closed ( 178 Geodesics). On a hyperbolic non-Euclidean compact tspace form of genus P (p  2) there exists a geodesic whose points are everywhere dense in it (E. Hopf) (for the ergodicity of flows along geodesics on this surface  136 Ergodic Theory; also 126 Dynamical Systems). J. Singular Points of a Surface Suppose that a neighborhood of a point Po of a surface S in E 3 is given by a certain vector- valued function f of class C' as t = f(u, v). Then a point Po where two vectors (iJf/iJu)po' (iJf/iJv)Po are linearly independent is cal1ed a regular point. A point on S that is not regular is called a singular point. If for suitable parameters we have (iJf/iJu)po =0 but (iJf/iJv)Po' (iJ2f/iJu2)po' (0 2 f/ iJu iJv)po are linearly independent, then such a singular point is called a semiregular point. In general, shapes of neighborhoods of singular points are extremely complicated. However, we note the following: (i) by a small deformation of the function f (and its deriva- tives of orders at most r) we can reduce Po to a regular or semi regular point of the deformed surface; (ii) if Po is semiregular, we can choose 
421 suitable parameters and curvilinear coordi- nates of class C' in E 3 near Po so that the surface S in the neighborhood of the origin Po is expressed by the equations XI = u 2 , X 2 = V, X 3 = uv (H. Whitney's theorem [15]). (The higher-dimensional case has also been consid- ered (Whitney [16J).) References [1] W. Blaschke, Vorlesungen iiber Differen- tialgeometrie I, Springer, 1924 (Chelsea, 1967). [2J A. Duschek and W, Mayer, Lehrebuch der Differentialgeometrie I, Teubner, 1930. [3J L. P. Eisenhart, An introduction to dif- ferential geometry, Princeton Univ. Press, 1940; revised edition, 1947. [4J W. Klingenberg, Eine vorlesung iiber differentialgeometrie, Springer, 1973. [5] W. Fenchel, On the differential geometry of closed space curves, Bull. Amer. Math. Soc., 57 (1951), 44-54. [6J S. S. Chern, Curves and surfaces in Eucli- dean space, Studies in global geometry and analysis, Studies in math. 4, Math. Assoc. Amer. [7J I. Fary, Sur la courbure totale d'une courbe gauche fisant un noeud, Bull. Soc. Math. France, 77 (1949), 128-138. [8J J. W. Milnor, On the total curvature of knots, Ann. Math., (2) 52 (1950), 248-257. [9] E. Cartan, La theorie des groupes finis et continus et la geom6trie differentieIle traitees par la methode du repere mobile, Gauthier- ViIlars, 1937. [lOJ L. Bers, Riemann surfaces, Courant Inst. Math. Sci., 1957-1958. [l1J S. Sternberg, Lectures on differential geometry, Prentice-Hall, 1964. [12J A. D. Aleksandrov, Intrinsic geometry of convex surfaces (in Russian), Gostekhizdat, 1948; German translation, Die innere Geo- metrie der konvexen Flachen, Akademie- Verlag, 1955. [13J S. S. Chern, Topics in differential geome- try, Lecture notes, Institute for Advanced Study, Princeton, 1951. [14J H. Hopf, Zur Differential Geometrie geschlossener Flachen in Euklidischen Raum, Convegne Internazionale di Geometria Dif- ferenziale, Italy, 1953,45-54 (Cremona, 1954). [15J H. Whitney, The singularities of a smooth n-manifold in (2n-l)-space, Ann. Math., (2) 45 (1944),247-293. [16J H. Whitney, Singularities of mappings of Euclidean spaces, Symposium Internacional de Topologia Algebraica, Universidad Nacional Autonoma de Mexico and UNESCO, 1958, 285-301. 112 A Differential Operators [17J M. P. do Carmo, Differential geometry of curves and surfaces, Prentice-Hall, 1976. [18] T. 1. Willmore, An introduction to dif- ferential geometry, Clarendon Press, 1959. [19] N. J. Hicks, Notes on differential geome- try, Van Nostrand, 1964. [20J D, Laugwits, Differential and Rieman- nian geometry, Academic Press, 1965. [21J B. O'NeiIl, Elementary differential geome- try, Academic Press, 1966. [22J J. J. Stoker, Differential geometry, Wiley, 1969. 112 (XII.15) Differential Operators A. Definition A mapping (or an operator) A of a function space F I to a function space F 2 is said to be a differential operator if the value f(x) of the image f = Au (uEF lJEF 2) at each point X is determined by the values at X of u and a finite number of its derivatives. If u and fare tdistri- butions, the definition applies with the deriva- tive interpreted in the sense of distributions ( 125 Distributions and Hyperfunctions). In this article we restrict ourselves to the case of linear differential operators and consider only those of the form P(x, D) = L a,(x)D', l'l<;m (1 ) where ex denotes n-tuples (ex I , ex 2 , ..., ::t n ) of nonnegative integers, caIled multi-indices; lexl the length of ex: lexl =ex l + ex 2 +... + exn; and D' the differential operator D' = Df1 D2 , .. D', with D j =( -i)8/8x j . The coefficient (-i) is sometimes omitted. The coefficients a,(x) are functions defined on an open set Q in n- dimensional space. We call P(x, D) an ordinary differential operator if the dimension n of Q is 1 and a partial differential operator if n;:' 2. Ordinary differential operators and partial differential operators behave quite differently in many respects. We set P(x,) = L a,(x)', '= f1 ... " 1,I<;m where  = (I, 2' ..., n) E R n or en. The order of P(x, D) is the greatest integer lexl for which a,(x)¥O. In expression (1) m is assumed to be equal to the order, and in that case Pm(x,D)= L a,(x)D' l'lm is caIled the principal part of P(x, D), and the corresponding polynomial Pm(x,) the charac- teristic polynomial. 
1128 Differential Operators Differential operators have been investi- gated for a long time in connection with the linear differential equations P(x,D)u(x)= f(x), XEQ. Except for ordinary differential operators, however, it is rather recently that the prop- erties of such operators have been studied from the general viewpoint. We denote a differential operator with constant coefficients by P(D). In general, P(x, D) is assumed to be a linear differen- tial operator with coefficients that are C oo _ functions. However, many of the results for Coo-coefficients also hold when the coefficients are sufficiently differentiable. (For tfunction spaces '@(Q), .@'(Q), tC(Q), tC'(Q), C(Q), COO(Q), CO'(Q), Lp(Q), d(Q), (Q), etc"  125 Distri- butions and Hyperfunctions, 168 Function Spaces). Differential operators are classified accord- ing to their properties. The most important are the elliptic, hyperbolic, and parabolic types. A differential operator P(x, D) is said to be an elliptic operator if the characteristic poly- nomial Pm(x, ) has no real zero except for  =0 for each xEQ. Typical examples are the Laplacian  = - (Di + .,. + D;) and the Cauchy-Riemann operator a/az = (1/2)(a/ax + ia/ay), A differential operator is said to be hyper- bolic if the associated Cauchy problem is well posed ( 325 Partial Differential Equations of Hyperbolic Type). The d'Alembertian Di- (Di +... + D;) is an example. A differential operator of the form a/at + P(t, x, Dx) is called parabolic if P(t, x, Dx) is strongly elliptic ( Section G) in x. The heat operator iV n +! -  is typical. These three types of operators appear most often in applications, and if n = 2, then any operator of order 2 with real coefficients in (a/ax)' belongs to one of them at a generic point. In other cases, however, there are dif- ferential operators that do not belong to any of them. 8. Fundamental Solutions If a differential operator P(x, D) with '@(Q) as its domain has a left inverse F that is expressed as an tintegral operator with tkernel distri- bution (in .@' x,y;  125 Distributions and Hyperfunctions F), then the kernel is said to be a fundamental solution (or elementary solution). F is usually a right inverse of the weak exten- sion ( Section F) of P(x, D) and maps '@(Q) into tC(Q). The image is mapped to the original function by P(x, D), Nevertheless, F is not a genuine right inverse, and hence the funda- mental solution is not unique if it exists. 422 When P(D) is a differential operator with constant coefficients, we call a distribution E(x) a fundamental solution if it satisfies (2) P(D)E(x) = 15 (x), (3) where 15(x) is tDirac's distribution (15 function). If E(x) is a fundamental solution in this sense, then F(x, y) = E(x - y) is the kernel of a left inverse of P(D) and is a fundamental solution in the sense of the preceding paragraph. Every differential operator P(D) with con- stant coefficients has a fundamental solu- tion in the sense of OJ (Ehrenpreis-Malgrange theorem; see L. Hormander [4J for a proof). General operators P(x, D) with variable coefficients do not necessarily have funda- mental solutions. However, if P(x, D) belongs to one of the classical types of operators (ellip- tic, hyperbolic, or parabolic), then it has a fundamental solution at least locally. (See F. John [8J for elliptic operators, J. Leray [11J for strongly hyperbolic operators, and S. Mizohata [12J and S, D. Eidel'man [9J for parabolic operators.) Leray has generalized John's method to strongly hyperbolic opera- tors in an enormous work [13]. C. Ranges of Differential Operators Let P(D) be a differential operator with constant coefficients. Then it follows from the Ehrenpreis-Malgrange theorem that P(D).@'(Q)=:>'@(Q) holds for any open set Q. However, there are differential operators P(x, D) with variable coefficients such that for any Q, P(x,D).@'(Q):;b'@(Q), H, Lewy first devised such an example: P(x,D)= -iD I +D 2 -2(x l +ix 2 )D 3 . Let C 2m - 1 (x, D) be the homogeneous part of order 2m -1 of the commutator P(x, D) P(x, D) - P(x, D) P(x, D). Then in order that P(x, D).@'(Q)=:>'@(Q), it is necessary that Pm(x, )=O imply Czm-dx, )=O for all xEQ, ERn (Hormander's theorem [4J). When P(x, D) is a differential operator that does not satisfy this condition (e.g., Lewy's operator), choose an f(X)E'@(Q) that is not in P(x, D)'@'(Q). Then the differential equation (2) has no distribution solutions at all. P. Scha- pira extended this result to the case of thyper- functions (also  274 Microlocal Analysis). Concerning the ranges of differential oper- ators P(D) with constant coefficients, we have the following detailed results due to L. Ehren- preis [28J, B. Malgrange [14J, and Hormander [4]. 
423 An open set Q is said to be P-convex for a differential operator P(D) if for each compact set K cQ there exists a compact set K' cQ such that <pE CO'(Q) and suppP( -D)<p c K imply supp<p c K'. Convex sets are P-convex for any P(D). All open sets are P-convex if and only if P(D) is an elliptic operator. Theorem: The following conditions are equivalent: (i) Q is P-convex; (ii) P(D)'(Q)=:> tC(Q); (iii) P(D)tC(Q) = tC(Q). Property (iii), the Mittag-Leffler theorem, and the solvability of tCousin's first problem for the solutions of P(D)u =0 are equivalent. An open set Q is said to be strongly P- convex if for each compact set K c Q there exists a compact set K' such that J1EtC'(Q) and suppP( -D)J1cK imply suppJ1cK'; and J1EtC'(Q) and sing suppP( -D)J1 cK imply sing supp J1 c K'; where the singular support of J1 is the closure of the set of all points at which J1 is not a COO-function. Convex sets are strongly P- convex, and strongly P-convex sets are P- convex. Theorem: Q is strongly P-convex if and only if P(D)'(Q)='(Q). R. Harvey has shown that every domain Q is P-convex in the sense of hyperfunctions, i.e., the equation P(D)u == f always has a hyper- function solution u on Q for any hyperfunction f on Q. For the treal analytic functions d(Q), however, P(D)d(Q) = d(Q) does not hold for convex open set Q in general. Hormander [15J gave a necessary and sufficient condition for P(D) and Q in order that this hold. D. Hypoellipticity A differential operator P(x, D) is called hypo- elliptic in Q if for any distribution u(x) E '(Q), Pu E Cro(Q) implies U E COO(Q). Further, a differ- ential operator with real analytic coefficients P(x, D) is called analytically hypoelliptic in Q if Pis hypoelliptic and if for any distribution U(X)E'(Q), PUEd(Q) implies u(x)Ed(Q), There are two fundamental facts about such operators, Let P have constant coeffi- cients; then P(D) is hypoelliptic if and only if P(+ir/)=O and 1+i'1l-->oo imply that 1'11-->00 (Hormander's theorem [4]). Furthermore, P(D) is analytic hypoelliptic if and only if P is ellip- tic (Petrovskii's theorem [16J). The heat opera- tor is not elliptic, but hypoelliptic. If P(D) is elliptic, then actually any hyperfunction U(X)E (Q) such that PUEd(Q) is real analytic (R. Harvey, G. Bengel). On the other hand, if P(D) is not elliptic, there is a hyperfunction solution U of Pu = 0 that is not a distribution. Strictly speaking, the notion of the hypo- ellipticity for general differential operators was first formulated explicitly by L. Schwartz 112E Differential Operators [27]. Before that time, D. Hilbert, E. E. Levi and K. O. Friedrichs, and others investigated this problem for some elliptic operators, and the hypoellipticity was called Weyl's lemma ( 323 Partial Differential Equations of Ellip- tic Type). Similarly to the constant coefficient case the following two theorems are fundamental: Elliptic and parabolic operators P(x, D) with Coo coefficients are hypoelliptic (Schwartz [27], Mizohata [12J). Elliptic operators P(x, D) with real analytic coefficients are analytic hypoelliptic (I. G. Petrovskii [16J, C. B. Mor- rey and L. Nirenberg). The latter holds also for hyperfunction solutions (M. Sato and Schapira). Moreover, the following result is known: If for each compact set K in an open set Q, there exists a constant C such that Ilp k uIIK.s;C k +1(mk)!, then uEd(Q), where II'IIK denotes the Lp-norm on K (H. Komatsu, T. Kotake, and M, S. Narasimhan). However, as seen by the example Dl + ix D 2 in R 2 (k is even), the analytic hypoellipticity also holds for nonelliptic operators. Such an operator is called a subelliptic operator; subel- Iiptic operators have been investigated by Hormander, Yu. V. Egorov, F. Treves, and others [19]. Hormander has obtained a fairly complete result on the hypoellipticity of the operators of the form , L= L Xl+Xo+c(x), i=l where Xo, . .. , X, are homogeneous first -order differential operators with real coefficients ([17J; O. A. Oleinik and E, V. Radkevich [18J). Hypoellipticity was investigated exten- sively after the introduction of pseudodifferen- tial operators and Fourier integral operators ( 345 Pseudodifferential Operators). E. Differential Operators in Banach Spaces We consider differential operators P(x, D) de- fined on a domain Q as operators in the func- tion spaces C(Q) or Lp(Q). Differential oper- ators of order m;:: 1 are always unbounded operators in the Banach space X = C(Q) or Lp(Q). Moreover, their domains of definition as operators in X are not generally determined uniquely by the expressions P(x, D) as differen- tial operators. P(x, D) is a linear operator that maps CO' (Q) into X. This operator has a closed extension. The minimal closed extension Po is called the minimal operator of P(x, D) in X. We have UE (Po) and Pou== f if and only if there exists a sequence <PnECO'(Q) such that <Pn-->U' P(x, D)<Pn--> f On the other hand, the 
112F Differential Operators closed linear operator PI whose domain is the set of all UEX such that P(X,D)UEX in the sense of distribution is called the maxi- mal operator (or weak extension) of P(x, D). We have uE(Pd and Plu= f if and only if <u, 'P(x, D)<p > = <f, <p> for any <p E Co (Q), where tP(x, D) is the transposed operator 'P(x, D)<p(x) = I (- D)"(a,(x)<p(x)). l'l<:;m Integration by parts shows that PI is an exten- sion of Po and that when X is the tdual space of a space Y, the weak extension PI in X is the tdual of the minimal operator of 'P(x, D) in Y. Let X = Lp(Q) (1 <p< 00), Q be a bounded open set with smooth boundary, and P(D) have constant coefficients. Then PI coincides with the smallest closed extension of the oper- ator P(D) having as its domain the set of all UECOO(Q)nX such that P(D)UEX. The latter closed extension is called the strong extension. The difference between the weak and the strong extension is not obvious in the vari- able coefficient case. Po coincides with PI when Q is the entire space and P(x, D) is an elliptic operator whose coefficients are constants or close to constants (J. Peetre, Medd. Lunds Univ. Mat. Sem., 16 (1959); T. Ikebe and T. Kato, Arch. Rational Mech. Anal., 9 (1962)). In general, we have Po # PI' Let P(x, D) be an ordinary differential operator with bounded coefficients such that lam(x)I;::15 >0 and Q be the bounded interval (a, b). Then the domain of PI coincides with the set of all (m - 1 )-times continuously differenti- able functions U such that the (m - 1 )st deriva- tive is absolutely continuous and P(x, D)UE X, while the domain of Po is the set of all func- tions U which satisfy in addition the boundary conditions uta) = u'(a) =... =u(m-I)(a) = u(b) =... = utm-I)(b) = 0. (Moreover, U(m)(a) = u(m)(b) =0 when X = C(a,b).) Let G(Po), G(P I )( C X X X) be the tgraphs of Po, PI' Then the quotient space fJB = G(Pd/ G(Po) is calIed the boundary space, and an element of the dual fJB' of fJB, i.e., a continuous linear functional on G(Pd which vanishes on G(po), is called a boundary value relative to P(x, D). For the ordinary differential opera- tors discussed above, the boundary space is the set of all linear combinations of u(i)(a) and u(j)(b). When P(x,D) is an ordinary differen- tial operator, we can explicitly determine the boundary values also in the case where the interval (a, b) is infinite, the coefficients a,(x) are not bounded. or am(x)-.O (x-.a, b); and we can show that fJB is finite-dimensional. When P(x, D) is a partial differential operator, fJB is 424 generalIy of infinite dimension, and the con- crete forms of the elements of fJB and fJB' are not known. However, we have some informa- tion by M. I. Vishik (Amer. Math. Soc. Transl., (2) 24 (1963)) about the boundary values of eIliptic operators of the second order. Combin- ing this with the results by J.-L. Lions and E. Magenes (J. Anal. Math., 11 (1963)), we can obtain information for elliptic operators of higher order. F. Differential Operators with Boundary Conditions A closed operator between the minimal opera- tor Po and the maximal operator PI is deter- mined by designating a closed subspace B of the boundary space fJB. This operator is called the operator with the boundary condition B. Particularly important are boundary condi- tions expressed in the form Qi(x,D)u(x) =0, xEaQ, i=I,...,k, (4) with differential operators Qi(X, D) (i = 1, ..., k) defined on the boundary aQ of Q. When P(x, D) is an ordinary differential operator defined on a finite interval and the orders of Qi are at most m - 1 (or m), (4) always has a definite meaning. However, for partial differential operators, we need an interpreta- tion of (4), i.e., (4) does not necessarily deter- mine the subs pace B of fJB uniquely. Let Ps be the smalIest closed extension of P(x,D) with {uECOO(Q)nXI Q;(x,D)u(x)=O, xEaQ; P(X,D)UEX} as its domain. Ps is called the strong extension of the differential operator P(x,D) with boundary condition (4). On the other hand, when Q, P, and Qi satisfy suitable conditions, we can define the transposed differential operator 'P(x, D) with the transposed boundary operators Rj(x, D) (j = 1, "', I). Namely, there are differential operators Rix, D) on the boundary such that a necessary and sufficient condition for UE COO(Q) to satisfy (4) and Pu(x)= f(x) is Lf(x)V(X)dX= L u(x)'Pv(x)dx (5) for all v(x) E COO (Q) with the boundary con- ditions Rjv=O, xEaQ. Then the operator Pw defined by Pwu(x) = f(x) for the pairs u(x), f(x) E X satisfying (5) is called the weak exten- sion of the differential operator P(x, D) with boundary condition (4). As in the case of operators without boundary conditions, the weak extension is an extension of the strong extension, and generally is the dual of the strong extension in the dual space X' of the transposed differential operator with the trans- posed boundary condition. 
425 Regularity up to the boundary, The funda- mental problems for the differential operator P(x, D) with the homogeneous boundary condition (4) are to determine, for both the strong and the weak extensions, the spaces of solutions of the homogeneous equations Pu = 0 and their ranges. The problems mostly reduce to determining when the strong and the weak extensions coincide and, including this, also to the problem of regularity on the closed domain 0 containing the boundary of the solutions u of the equation Pwu=f This problem was solved by Nirenberg for strongly elliptic operators in U(Q) with the Dirichlet boundary condition aj-Iu(x)/an j - 1 =0, j= 1,2, ...,m/2, and generalized later by F. Browder, M. Schechter [20], S. Agmon, Lions, and others for elliptic operators in Lp(Q) with a kind of coercive boundary condition ( Section H). Consequently, when Q is bounded and smooth, Pw is equal to Ps for those operators. Write P for Pw' Then the space N(P) of the solutions of Pu = 0 is a subs pace of finite di- mension, and the range R(P) is a closed sub- space of finite co dimension. In particular, it follows that the tindex of P, dimN(P)- codim R(P), is finite. G. Strongly Elliptic Operators A differential operator P(x, D) is said to be strongly elliptic if its characteristic polynomial satisfies RePm(x,);;.qlm>O, #O. Many of the elliptic operators, such as the Laplacian, that appear in applications are strongly elIiptic. L. Garding treated the boundary value problem with the Dirichlet condition (in the generalized sense) for strong- ly eIliptic operators. His work initiated the general study of differential operators ( 323 Partial Differential Equations of Elliptic Type). His theory is based on the following inequality, caIled Girding's inequality: IIV m / 2 ulli,.s; c( Re f Pu' udx+ Ilulli,), uE Cg'(Q). H. Coercive Boundary Conditions The boundary condition (4) is said to be coer- cive if IIVmul\.s; c(llpull + Ilull) holds for any u E Coo (0) that satisfies (4). In 112 I Differential Operators order that a differential operator P have a coercive boundary condition, it is necessary that it be a special type of elIiptic operator. In this case, Agmon, N. Aronszajn, Schechter, and others found conditions under which (4) is coercive. Agmon, A. Douglis, and Nirenberg show that the inequality (6) holds in Lp(Q) and in the normed spaces of Holder continuous functions under a suitable condition. The classical boundary conditions au+bau/an=O (a;;'O, b ;;'0, a+ b = 1) are coercive for eIliptic operators of the second order ( 323 Partial Differential Equations of Elliptic Type). How- ever, problems remain when the coefficients of Qj(x, D) are discontinuous. In order to have coincidence of the strong and weak exten- sions or regularity up to the boundary, it is necessary neither that P be elliptic nor that the boundary condition be coercive. But it is not known to what extent these conditions can be weakened. At present, major contributions are Hormander's work (Acta Math., 99 (1958» dealing with operators with constant coeffi- cients and flat boundaries, and works by J. J, Kohn [21J, Nirenberg, and Hormander con- cerning noncoercive boundary conditions. The latter works are connected with the theory of several complex variables, and have at- tracted much attention. I. Self-Adjoint Extension One of the fundamental problems in the case X = L 2 (Q) is whether the minimal operator Po has a self-adjoint extension. Po is symmetric if and on ly if P(x , D) is formally self-adjoint: P(x, D) ='P(x, D). Under this condition the boundary space fJB turns out to be the direct sum of two subs paces fJB et = {(x, PI x) I x E f0(Pd, Plx= i:ix} +G(Po). The numbers net =dimfJB:t are called the deficiency indices of Po, and Po has a self-adjoint extension if and only if n+ =n_. H. Weyl gave a method for computing n:t for the Sturm-Liouville operators: P(x,D)= - C: P(X) d: .)+q(X), xE(a,b). (6) We say that a (resp, b) is of limit circle type if the solutions u(x) of P(x, D)u(x)+ lu(x) = 0 (IEC) always belong to L 2 in a neighborhood of a (resp, b) and of limit point type if a solu- tion does not belong to L 2 . This classification does not depend on the choice of IE C. (1) If both a and b are of limit point type, then n+ = n_ = 0 and hence Po is self-adjoint. (2) If a is of limit circle type and b is of limit point type, then n+ = n_ = 1, and the self-adjoint extensions of Po are the operators P, that are obtained from PI by assigning the boundary 
112 J Differential Operators condition p(a)u'(a)cosCl: + u(a) sin CI: = O. The same is true when a and b are inter- changed. (3) If both a and b are of limit circle type, then n+ = n_ = 2, and we can impose two boundary conditions to obtain the self-adjoint extensions. These results have been extended by K. Kodaira [23J and N. Dunford and J. Schwartz [3J to the case of ordinary differential opera- tors of order m. There are formally self-adjoint operators that have no self-adjoint extensions. For example, the operator -id/dx in L 2 (0, 00) has deficiency indices n+ = 1 # IL = O. For partial differential operators, it is dif- ficult to determine explicitly all self-adjoint extensions of a given formally self-adjoint operator because the boundary space is com- plicated. If the boundary condition (4) is for- mally self-adjoint and coercive, then it follows from the results of Schechter and others that the differential operator with boundary condi- tion (4) is self-adjoint. Furthermore, condi- tions under which Po is self-adjoint or has a self-adjoint extension are known. The follow- ing theorem is often used as a condition of the latter type. If a tsymmetric operator defined on a dense subspace of a Hilbert space X is posi- tive definite: (Tx, x) ;;>0, x E f0(T), then there is a positive definite self-adjoint extension f (Friedrichs's theorem). The self- adjoint extension obtained by this theorem is called the Friedrichs extension. J. Generators of Semigronps From the point of view of probability theory, W. Feller investigated the extensions of the Laplacian £ /dx 2 and similar operators that are the generators of order-preserving semi- groups. Recently various attempts have been made to generalize his results to the multi- dimensional case ( 115 Diffusion Processes; 378 Semigroups of Operators and Evolution Equations). P. D. Lax and A. N. Milgram proved that if P(x, D) is a strongly elliptic operator, then -P(x, D) with the Dirichlet condition in L 2 (Q) is the generator of a semigroup [1]. K. Boundary Value Problems There are two methods of solving the inhomo- geneous boundary value problem P(x,D)u(x)=f(x), XEQ, Qi(x,D)u(x)=gi(X), xEaQ, i= 1,... ,k. 426 In the first method, we take a function v(x) that satisfies Qi(X, D)v(x) = gi(X) and reduce the problem to the homogeneous one for Uo = u - v. In the second method, we consider the pair P(x, D) and Qj(x, D) as an operator that maps a function u to the pair of functions (Pu, Qju) and investigate it directly. The latter method was adopted by Peetre and Hor- mander [4]. L. Estimates in Weighted Spaces J. F. Treves, Hormander [4J, and H. Kumano- go obtained estimates similar to (6) in Lp- spaces relative to the weighted measure w,(x)dx instead of the usual Lp-spaces, and applied them to the proof of the uniqueness of Cauchy problems for differential equations with variable coefficients ( 321 Partial Dif- ferential Equations (Initial Value Problems)). Hormander [22J applied similar estimates to the proof of the tfundamental theorems of Stein manifolds. M. Eigenfunction Expansions When a self-adjoint operator P in the Hilbert space L 2 (Q) is a self-adjoint extension of a differential operator P(x, D), the tspectral decomposition of P is concretely expressed by the expansion of functions uEL 2 (Q) into eigen- functions of P(x, D), If Q is bounded, P(x, D) is an elliptic oper- ator defined on a neighborhood of Q, and the boundary condition is coercive, then the tspectrum of P is composed solely of eigen- values, and the eigenvectors of Pare eigen- functions of P(x, D) in the classical sense and are of class Coo up to the boundary. N. Asymptotic Distribution of Eigenvalues If P = -, the number v(A) of eigenvalues less thanA satisfies the asymptotic relation A n / 2 A V ( A )  as ..1.-->00 2 n - 1 nn/2 n r(n/2) , regardless of the shape of the domain and the boundary condition, where n is the dimension and A is the volume of Q [2]. This was first proved by Weyl and extended by T. Carleman, Garding, and others to the case of operators of higher order with variable coefficients (  323 Partial Differential Equations of Elliptic Type). 
427 O. Weyl-Stone- Titchmarsh-Kodaira Theory When Q is unbounded or the coefficients of P(x, D) have singularities near the boundary of Q, the spectrum of P may have a continuous part. Let P(x, D) be an ordinary differential oper- ator on an interval ( a, b). Then for each ..1. E C the equation (P(x, D)-A)<p(x)=O has m linearly independent solutions <Pk(X,..1.) (k = 1, "" m), and any solution is represented as a linear combination of them. Weyl and M. H. Stone obtained the spectral decomposition of P (of the second order) in the form u(x) = j.l Loooo <p/x, A)dpjk(A) x r <Pk(y,..1.) u(y)dy, Pu(x) = j'l f:oo ).<pj(x, A)dpjk(A) x r <pk(y,..1.) u(y)dy, where the Pjk(A) are functions of bounded variation and their variations represent the spectral measure. This formula shows that linear combinations of the <p/x, ..1.) form gen- eralized eigenfunctions even when ..1. belongs to the continuous spectrum. Later, E, C. Titch- marsh and Kodaira gave a formula to obtain the density matrix Pjk(A) and completed the theory (Titchmarsh [6J, Kodaira [23J, Dun- ford and Schwartz [3J). This expansion theo- rem makes it possible to deduce in a unified manner expansion theorems for classical special functions, such as the tFourier series expansion theorem, the expansions by tHer- mite polynomials and tLaguerre polynomials, the tFourier integral theorem, and various expansions in terms oftBessel functions [6,7]. The relation between the coefficients of the differential operator and the spectral distri- bution of P is important in applications and is the subject of many papers [3,5,6]. For example, let P(x,D)= -d 2 jdx 2 +q(x) and Q=( -00,00). If q(x)--> 00 as Ixl-->oo, then P(x, D) is essentially self-adjoint, the spectrum is entirely composed of the point spectrum, and a detailed estimate of the jth eigenvalue Aj is also known [6]. If q(x) converges rapidly to o as Ixl-->oo, then P(x, D) is essentially self- adjoint, and there is only a continuous spec- trum for ..1.>0 and eigenvalues for ..1.<0 with at most 0 as accumulation point. If q(x) is a periodic function of x, then P(x, D) is again essentially self-adjoint, and the spectrum con- sists of a continuous spectrum decomposed in a sequence of nonoverlapping intervals. The converse problem of determining q(x) when 112Q Differential Operators the spectral measure is given has been studied by I. M. Gel'fand and B. M. Levitan (Amer. Math. Soc. Transl" (2) 1 (1955); original in Russian, 1951). P. Eigenfunction Expansion for Partial Differential Operators The theory of eigenfunction expansion for partial differential operators with continuous spectra is not complete, as it is for ordinary differential operators, What causes difficulties is that the solutions of (P(x, D) - A)u = 0, which should be the generalized eigenfunctions, form an infinite-dimensional space, and except for special cases it is impossible to introduce convenient parameters in it. Many proofs are known for a general result, saying that any self-adjoint elliptic operator has an eigenfunc- tion expansion into generalized eigenfunctions of the form f ro k(A) r u(x) == -00 j <Pj(X, A)dp/A) In <Pj(y,..1.) u(y)dy, [10]. There are few operators, however, for which we know how to construct <p)x, ..1.) and the measure dpAA). The Fourier transform gives the expansion for operators with con- stant coefficients defined on the whole space. By means of a generalized form of the Fourier transform, Ikebe gave an eigenfunction expan- sion for the Schrodinger operator -+q(x) in R 3 under the condition that q(x)EL2 and O(lxl- 2 -') as Ixl-->oo [24]. Y. Shizuta, Mizo- hata, Lax and R. S. Phillips, N. A. Shenk, Ikebe, D. K. Fadeev, and others proved the same results for similar operators defined on an exterior domain with a bounded set deleted from a higher-dimensional Euclidean space. These theories are closely related to the tscattering theory of the Schrodinger eq ua- tion (-idjdt - P)u = 0 or the wave equation (d 2 jdt 2 +P)u=0 associated with those opera- tors. Lax and Phillips have developed the latter scattering theory [26J ( 375 Scattering Theory). Many of the problems in tquantum mechan- ics reduce to finding the spectral distribution of self-adjoint partial differential operators. Q. Expansion Theorems for Non-Self-Adjoint Operators A kind of eigenfunction expansion theorem may hold for non-self-adjoint differential operators or for non-Hilbert spaces. (See papers by the Russian school for ordinary differential operators and those by Browder, 
112R Differential Operators Agmon, and others for partial differential operators.) R. Systems of Differential Operators We have so far dealt with single differential operators that map functions u to functions f A linear differential operator that maps a p-tuple (u l , "', up) of functions to a q-tuple (/t> ... ,J;J of functions can be written as p /;(x) = L ix,D)uix), i=I,...,q, j=l where P(x, D) = (Pij(x, D)) is a matrix of single differential operators. Such a matrix is called a system of differential operators. A system P(x, D) is said to be underdetermined if p > q, determined if p = q, and overdetermined if p < q. Many propositions that hold for single operators hold for determined systems under appropriate conditions. However, there is a fundamental difference between overdetermined (underdetermined) systems and determined systems, as is seen from the theory of several complex variables, which is the theory of a typical overdeter- mined system 8jiJi = (8jiJiJ The theory is much more difficult for overdetermined (under- determined) systems. The general theory of overdetermined and underdetermined systems with constant coefficients has been constructed by Ehrenpreis [28], Malgrange, V. Palamodov, and Hormander [22J for COO-functions and distributions. It has been extended by H. Ko- matsu to the case of hyperfunctions. T. Miwa has discussed the same problem for real ana- lytic functions. S. Symmetric Systems of the First Order Determined systems of the first order are important in applications. Many problems in mathematical physics are formulated in terms of them. Also, single equations of higher order can be reduced to determined systems of the first order by regarding the derivatives as unknown functions. In some cases determined systems of the first order are easier to handle than single operators of higher order. In partic- ular, a system of differential operators P(x,D)= LAJx)8j8x;+B(x) is said to be a symmetric positive system if the matrices Ai(x) and B(x) satisfy the following conditions: Af =A i ; B+B* + L aA;/8x i is positive semi definite. Symmetric positive sys- tems have been studied in detail by Friedrichs [25J, Phillips, C. S. Morawetz, Lax, and others. 428 References [IJ K. Y osida, Functional analysis, Springer, 1965. [2J R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, I, 1953; II, 1962. [3] N. Dunford and 1. T. Schwartz, Linear operators II, Interscience, 1963. [4J L. Hormander, Linear partial differential operators, Springer, 1963, [5] M. A. Naimark, Linear differential opera- tors I, II, Ungar, 1967. (Original in Russian, 1954.) [6] E. C. Titchmarsh, Eigenfunction expan- sions associated with second-order differential equations, Clare don Press, I, revised edition, 1962; II, 1958. [7] K. Y osida, Lectures on differential and integral equations, Interscience, 1955. (Orig- inal in Japanese, 1950.) [8J F. John, Plane waves and spherical means applied to partial differential equations, Inter- science, 1955. [9] S. D. Eidel'man, Parabolic systems, North- Holland, 1969. (Original in Russian, 1964.) [10] I. G. Gel'fand and N. Ya. Vilenkin, Gen- eralized functions IV, Applications of har- monic analysis, Academic Press, 1964. [11] 1. Leray, Hyperbolic differential equa- tions, Lecture notes, Institute for Advanced Study, Princeton Univ. Press, 1953. [12] S. Mizohata, Hypoellipticite des equa- tions paraboliques, Bull. Soc. Math. France, 85 (1957),15-50. [13] 1. Leray, Probltme de Cauchy I-V, Bull. Soc. Math. France, 85 (1957), 389-429; 86 (1958), 75-96; 87 (1959), 81-180; 90 (1962), 39- 156; 92 (1964),263-361. [14] B. Malgrange, Existence et approxima- tion des solutions des equations aux derivees partielles et des equations de convolution, Ann. Inst. Fourier, 6 (1955-1956),271-355. [15] L. Hormander, On the existence of real analytic solutions of partial differential equa- tions with constant coefficients, Inventiones Math., 21 (1973), 151-182. [16] I. G. Petrovskii, Sur I'analyticite des solutions des systemes d'equations differen- tie1les, Mat. Sb., 5 (47) (1939), 3-70. [17J L. Hormander, Hypoelliptic second-order differential equations, Acta Math., 119 (1967), 147-171. [18] O. A. Oleinik and E. V. Radkevich, Second-order equations with nonnegative characteristic form, Amer. Math. Soc., 1973. (Original in Russian, 1971.) [19J L. Hormander, Seminar on singularities of solutions of linear partial differential equa- tions, Ann. Math. Studies 91, Princeton Univ. Press, 1979. 
429 [20] M. Schechter, On U estimates and regu- larity I, II, Amer. 1. Math" 85 (1963), 1-13; Math. Scand., 13 (1963),47-69. [21J J. 1. Kohn, A priori estimates in several complex variables, Bull. Amer. Math. Soc., 70 (1964), 739-745. [22J L. Hormander, An introduction to com- plex analysis in several variables, Van Nos- trand, 1966. [23J K. Kodaira, On ordinary differential equations of any even order and the corre- sponding eigenfunction expansions, Amer. J. Math., 72 (1950), 502-544. [24] T. Ikebe, Eigenfunction expansions as- sociated with the Schrodinger operators and their applications to scattering theory, Arch. Rational Mech. Anal., 5 (1960), 1-34, [25] K. O. Friedrichs, Symmetric positive linear differential equations, Comm. Pure Appl. Math., 11 (1958),333-418. [26J P. D. Lax and R. S. PhilIips, Scattering theory, Academic Press, 1967. [27] L. Schwartz, Theorie des distributions, Hermann, 1966. [28J L. Ehrenpreis, Fourier analysis in several complex variables, Wiley-Interscience, 1970, [29J J.-E. Bjork, Rings of differential operators, North-Holland, 1979. [30J L. Hormander, The analysis of linear partial differential operators, I-IV, Springer, 1983-1985. 113 (111.17) Differential Rings Let R be a tcommutative ring with a unity element 1. If a map 15 of R into R is such that for any pair x, y of elements of R, (i) D(x + y) =Dx+Dy, and (ii) D(xy)=Dx' y+x' Dy, then 15 is called a derivation (or differentiation) in R. A ring R provided with a finite num ber of mutu- ally commutative differentiations in R is called a differential ring. In this article we consider only the case where R contains a subfield that has the unity element in common with R. In particular, if R is a field, we call it a differential field. In the above definition of differential ring, it is not necessary to mention the tcharacteristic of the subfield contained in R. However, to make it more effectively applicable in the case of nonzero characteristics, we may define differential rings using higher differentiation in place of the differentiation defined above. If a sequence D={D,} of maps Do, Db 15 2 ,,,, of R into R satisfies the following conditions (i)-(iv) for any pair x, y of elements of R and any pair )., J1 of nonnegative integers, then 15 is calIed a 113 Differential Rings higher differentiation in R: (i) 0A(X + y) = DAx + DAY; (ii) DA(xy) = L D,x' Dpy (the addition is performed over aII pairs ex, f3 of nonnegative integers that satisfy ex+f3=A); (iii) DA(Dx)= e: J1) 15 HX; (iv) Dox = x. Two higher dif- ferentiations 15 = {b v } and 15' = {D} are said to be commutative if and only if D A and D com- mute for all pairs ..1., J1 of nonnegative integers. Higher differentiations were introduced by H. Hasse (J 935) for the study of the field oftalge- braic functions of one variable in the case of nonzero characteristics. These two definitions of differential rings coincide if the characteristic of R is zero. For simplicity, we shall restrict ourselves to that case. Let 15 1 , .. . , 15m be the differentiations of the differential ring R. If x is an element of R, 15;1 D2 ... b:"mx(s I' S2, ..., Sm are nonnegative integers) is calIed a derivative of x, We calI x constant if and only if 15 1 x = ... = Dmx = O. An tideal a of R with Diaca (i= 1,2, ...,m) is called a differential ideal of R. If it is a tprime ideal (semiprime ideal (i,e., an ideal containing aII those elements x that satisfy x g E a for some natural number g)), then a is called a prime differential ideal (semi prime differential ideal). A subring S of R with DiS c S can be regarded as a differential ring with respect to the differ- entiations 15 1 , .,., 15m. We calI S a differential subring of Rand R a differential extension ring of S. Let Xl' '" , X n be elements of a differential extension field of a differential field K with the differentiations 15 1 , '" ,15m, and let 15;' D2 .., D':fr'X i (Sl ;;'0, ..., Sm ;;.0, 1.s; i .s;n) be talgebra- icalIy independent over K. The totality of their polynomials over K, which forms a differential ring, is calIed the ring of differential polyno- mials of the differential variables X I, .., , X n over K, and is denoted by K {Xl' "', X n }. Its elements are calIed differential polynomials. For this ring of differential polynomials we have an analog of tHilbert's basis theorem in the ring of ordinary polynomials, Ritt's basis theorem: If we are given any set \JJl of differen- tial polynomials of Xl' ..., X n over K, we can choose a finite number of differential polyno- mials PI' '" , Pr from \lI( such that each element Q of\JJl has an integral power Qg equal to a linear combination of PI, '" ,P, and their de- rivatives, where the coefficients of the linear combination are elements of K{X I , ...,X n }. This theorem implies that in the ring of differ- ential polynomials, every semiprime differential ideal can be expressed as the intersection of a finite number of prime differential ideals; if this expression is tirredundant, it is unique ( 67 Commutative Rings). The equation obtained by equating a dif- 
113 Ref. Differential Rings ferential polynomial to zero is called an alge- braic differential equation. Concerning these equations, we are able to use methods similar to those used in talgebraic geometry in study- ing the usual algebraic equations. J. Ritt made interesting studies on solutions of algebraic differential equations by such methods, prin- cipally in the case when the ground field K consists of tmeromorphic functions. Since that time, basic study of differential rings and fields has been fairly well organized and has developed into theories such as the following two: (I) Picard-Vessiot theory. This is a classical theory of t)inear homogeneous differential equations originated by E. Picard and E. Vessiot; it resembles the tGalois theory con- cerning algebraic equations. The Galois group in this case is a linear group, and its structure characterizes the solution of the differential equation, E. Kolchin introduced the genera] concept of the Picard- V essiot extension field of a differential field and studied in detail the group of differential automorphisms (i.e., the group of all those automorphisms that com- mute with the differentiations and fix elements of the ground field), thus making the classical theory more precise and more general. (2) Galois theory of differential fields. Gen- eralizing the concept of the Picard- V essiot extension, Kolchin introduced the notion of the strongly normal extension field and es- tablished the Galois theory for such exten- sions. In this theory, we see that the Galois group is an talgebraic group relative to a tuniversal domain over the field of constants of the ground field. Conversely, every algebraic group is the Galois group of a strongly normal extension field. We also see that a strongly normal extension can be decomposed, in a certain sense, into a Picard-Vessiot extension and an Abelian extension (i.e., an extension whose Galois group is an tAbelian variety) ( Kolchin [2-5J, Okugawa [6J). References [lJ I. Kaplansky, An introduction to differen- tial algebra, ActualiH:s Sci. Ind., Hermann, 1957. [2] E. R. Kolchin, Algebraic matric groups and the Picard- Vessiot theory of homogeneous ordinary linear differential equations, Ann. Math., (2) 49 (1948),1-42. [3J E. R. Kolchin, Galois theory of differential fields, Amer. J. Math., 75 (1953), 753-824. [4J E. R. Kolchin, On the Galois theory of differential fields, Amer. J. Math., 77 (1955), 868-894. 430 [5J E. R. Kolchin, Abelian extensions of dif- ferential fields, Amer. J. Math" 82 (1960), 779- 790. [6J K. Okugawa, Basic properties of differen- tial fields of an arbitrary characteristic and the Picard-Vessiot theory, J. Math. Kyoto Univ., 2 (1963),295-322. [7J J. F. Ritt, Differential algebra, Amer. Math. Soc. Colloq. Pub!., 1950. 114 (IX.18) Differential Topology A. General Remarks Differential topology can be defined as the study of those properties of tdifferentiable manifolds that are invariant under tdiffeo- morphisms. The basic objects studied in this field are the topological, combinatorial, and differentiable structures of manifolds and the relationships among them. Some of the re- markable contributions, such as H. Whitney's embedding theorem [1], the triangulation theo- rems of J. H. C. Whitehead and S. S. Cairnes [2, 3J, and Morse theory [4J, were made in the 1930s, In the late 1950s, outstanding re- sults were obtained by R. Thom, J. W. Milnor, S, Smale, M. Kervaire, and F. Hirzebruch, among others. Differential topology thus be- came a new, fascinating branch of mathematics. B. Differentiable Structures We assume that all manifolds are tparacom- pact. Let M be a ttopological manifold. A C' -equivalence class of tatlases of class C' (1 .s; r.s; 00) on M is called a C' -structure on M. Any C'-structure on M contains an atlas of class Coo on M, and its COO-equivalence class is uniquely determined (Whitney [1]). This C oo _ structure is called a differentiable structure on M compatible with the given C'-structure. Moreover, any COO-structure admits a heal analytic structure compatible with it in this sense (Whitney [IJ), A Coo-manifold is also called a smooth manifold, and a differentiable structure a smooth structure, Let Do, Dl be differentiable structures on M. If two Coo_ manifolds (M, Do) and (M, DtJ are not tdif- feomorphic, we say that the differentiable structures Do, Dl are distinct ( 105 Differen- tiable Manifolds). Milnor defined a certain invariant of differentiable structure using the Hirzebruch index theorem ( 56 Character- istic Classes) and proved that there are several distinct differentiable structures on the 7- dimensional sphere S7 (Milnor [5J). Milnor's 
431 example of such structures was: Let fk: S3 --> SO(4) be the mapping defined by fdO")r= O"hrO"j, where k is an odd integer, h,j are inte- gers determined by h + j == 1, h - j == k, and 0", Tare tquaternions of norm 1. Let MZ be the total space of the S3-bundle over S4 cor- responding to the mapping k This is an oriented closed manifold with the naturally defined differentiable structure, Moreover, for each k, MZ is homeomorphic to the 7- dimensional sphere S7. But if k, / are odd and k 2 =f=./2 (mod 7), MZ is not diffeomorphic to M?, Differentiable manifolds (such as MZ) that are homeomorphic, but not diffeomorphic, to the natural sphere are called exotic spheres. After the discovery made by Milnor, many topological manifolds other than the 7- dimensional sphere and possessing several distinct differentiable structures have been found (N. Shimada, I. Tamura). Moreover, topological manifolds admitting no differen- tiable structures have been constructed (M. Kervaire [6J, Smale, Tamura [7J, J. Eells and N. Kuiper, C. T. C. Wall [8J). On the other hand, manifolds of dimension .s; 3 admit unique differentiable structures, We can introduce a tRiemannian metric on a Coo-manifold, Let M and M z be CW_ manifolds (k-dimensional and n-dimensional, respectively) and f: M --> M 2 be an timmer- sion. We fix a Riemannian metric on M}. For each point p of M, let N p (/) be the linear space of all tangent vectOrs of M 2 at f(p) that are orthogonal to the ttangent space of f( U(p», where U(p) is a small open neighborhood of p in Mi, Then {Np(f)lpEMn forms an (n-k)- dimensional tvector bundle vf over M, called the normal bundle of the immersion f The equivalence class of vf is independent of the choice of Riemannian metric on M 2 . Whenf is an tembedding, we denote the submanifold f(Mn by L k . Let N,(L k) be the set of all points whose distances (with respect to the Riemann- ian metric) from L k are .s;E. If Lk is compact and E is sufficiently small, N,(L k ) is an n- dimensional submanifold of M z with bound- ary and is uniquely determined independently of the choice of I'; up to diffeomorphism, This submanifold is called a tubular neighborhood of L k in M z and is denoted by N(L k ). The interior of N(L k ) is called an open tubular neighbor- hood. The total space E(vf) of the normal bundle vf of the embedding f is diffeomorphic to the open tubular neighborhood of U in M2' C. C'-Triangulation and the Smoothing Problem Let (KJ) be a ttriangulation of the C'- manifold M n (1 .s; r.s; 00) (/: I K \...... M n ) satisfying 114D Differential Topology the following conditions: (i) For any closed n- simplex 0" of K, fl 0": 0"--> M n is a Cr-mapping; (ii) for any point p of 0", the tJacobian matrix of f I 0" has rank n at p. Then we say that (K,f) is a C' -triangulation of the C' -manifold M n , and the Cr-structure of M n is compatible with the triangulation (KJ), Concerning C'- triangulations of C' -manifolds, we have the following results (S. Cairns [2J, 1. H. C. White- head [3J): (i) any C'-manifold (l.s;r.s; 00) has a C'-triangulation, and any C'-triangulation of the boundary can be extended to the whole manifold; (ii) for a C' -triangulation (KJ) of a C'-manifold M n , the triangulated space (K,f; M n ) is a tcombinatorial manifold; (iii) for two C'-triangulations (KlJd and (K 2 J2) of a C' -manifold Mn, the triangulated spaces (K1Jl; M n ) and (K 2 J2; M n ) are combinatori- alIyequivalent. Conversely, for a combinatorial manifold (KJ; M n ), a differentiable structure on M that is compatible with the triangulation (KJ) is called a smoothing of M. The question of whether a smoothing of a combinatorial mani- fold exists is called the smoothing problem. We have some criteria for its solvability. One is expressed in terms of transverse fields, and another in terms oftmicrobundles ( 147 Fiber Bundles P), We also have the following method using the theory of tobstruction (1. Munkres [9J; M. W. Hirsch and B. Mazur [lOJ), Let r k be the group of oriented differen- tiable structures on the tcombinatorial k- sphere (an exact definition of this group is given in Section I). Let K i be the i-tskeleton of a combinatorial n-manifold (KJ; M n ) and ex a smoothing of a neighborhood U of f(Ki) in M n . Then there exists an obstruction cocycle CaE Zi+l (K, rJ satisfying the following con- ditions: (i) if C,(O"i+I) =0 for an (i + I)-simplex O"i+1 of K, the smoothing ex can be extended over a neighborhood of f(Ki U O"i+l), and vice versa; (ii) if there exists another smoothing ex' of a neighborhood of f(K i ) that coincides with the smoothing ex of a neighborhood of f(KH), then C a , - C, is a tcoboundary; conversely, for any (i + 1 )-coboundary b with coefficients in r j , there exists a smoothing IJ.' of a neighbor- hood of f(Ki) such that b= C" - Ca. Using this method we can prove that for n.s; 7 all combinatorial manifolds of dimension n are smoothable. D. Embedding and Immersion Theorems In the following, M n and x m are Coo-manifolds of dimensions nand m, respectively. Two timmersions fo, fl : M n --> X m are said to be regularly homotopic if there exists a thomotopy it: M n -->X m , O.s; t.s; 1, such that r; is an immer- 
114E Differential Topology sion for each t and the induced mapping (tdif- ferential of J,) dJ,: T(M n ) --> T(X m ) on the tan- gent spaces naturalIy gives rise to a continuous mapping over T(Mn) x I. Two tembeddings are isotopic if they are regularly homotopic and the homotopy J, is an embedding for each t, Given M n and X m , a fundamental problem of embedding (immersion) theory is to classify the embeddings (immersions) of M n in x m accord- ing to their isotopy (regular homotopy) classes. Whitney proved that a continuous mapping f:Mn-->x m can be approximated by an immer- sion ifm2n and by an embedding ifm2n+ 1 [1]. The folIowing results are also due to Whitney [1]: Any two immersions of M" in X m that are homotopic are regularly homotopic if m2n+2, and any two embeddings of M n in x m that are homotopic are isotopic if m  2n +3. The range m2n+3 is called the stable range of embeddings. In [11, 12J (1944) Whitney improved his classical theorems and showed that M n can always be immersed in R 2 n-1 for n > 1 and M n can always be embedded in R 2n . The methods used in his proof have played an important role in the subsequent development. Classifi- cation of immersions of the n-sphere sn in Rm was determined by Smale [13]. Let p: T(Mn)--> M n and p': T(xm)-->x m be the projections of the tangent bundles of M n and X m , respec- tively. A mapping <p: T(M n )--> T(X m ) is a linear fiber mapping (linear fiber map) if, for each xEM n , <p(p-I(X)) is contained in a fiber of T(xm) and <p I p-1(X) is a linear mapping of rank n. A linear homotopy <p,: T(M n )--> T(xm) (O.s; t.s; 1) is a homotopy such that each <p, is a linear fiber mapping. The folIowing theorem of Hirsch [14] is fundamental to immersion theory: Assume n < m. If <p: T(M n )--> T(X m ) is a linear fiber mapping, the mapping ip: M n --> x m induced by <p can be approximated by an immersion f: M n --> X m such that df and <p are linearly homotopic, Two immersions f, g: Mn-->x m are regularly homotopic if and only if df and dg are linearly homotopic. If M n is im- mersible in Rm+" where m> n with r linearly independent fields of tnormal vectors, then M n is immersible in R m . In particular, if M n is a n- manifold ( Section I), M n is immersible in Rn+l. Let 8(M n ) be the set of isomorphism classes of real tvector bundles over M n , and consider 0: 8(M n )-->KO(M n ) ( 237 K-Theory). An element EKO(Mn) is said to be positive if  is in the image of e. If oE KO(M n ), the geo- metric dimension of o' written g(o), is the least integer k such that o + k is positive. Then Hirsch's theorem [14J can be expressed as folIows: Mn is immersible in Rn+k (k>O) if and only if g(n - r(M")).s; k, where r(Mn) is the tangent bundle of M n . 432 A. Haefliger [15] obtained the folIowing important result: Let M n be compact and t(k - 1 )-connected, and let x m be k-connected. Then any continuous mapping of M n in x m is homotopic to an embedding if 2k < n, m  2n- k+ 1, and any two homotopic embeddings of M n in x m are isotopic if 2k< n+ 1, m2n- k+2. Thus ifm>3(n+ 1)/2, any two em- beddings of sn in R m are isotopic. The range m>3(n+ 1)/2 is calIed the metastable range. Haefliger further classified the embeddings of s4n-1 in R 6 n and showed the existence of em- beddings of S4 n -1 in R 6 n that are not isotopic to the natural one. More complete results for the classification of embeddings of thomotopy n-spheres in sm were obtained by J, Levine [16]. Furthermore, Levine proved the folIow- ing unknotting theorem in higher-dimensional knot theory [16]: Let ,f:s n - 2 -->sn be a COO_ embedding for n  6; then f(S.-2) is unknotted if and only if sn - f(sn-2) is homotopy equiva- lent to Sl ( 235 Knot Theory G; also 65 Combinatorial Manifolds D). We list some results about embeddings and immersions. If M n is noncompact, M n can always be embedded in R 2n - 1 ; if M n is a non- compact n-manifold, M n can always be im- mersed in Rn; if M n is compact and orientable and n > 4, M n can always be embedded in R 2 n-1. E. Nonembedding and Nonimmersion Theorems We denote the ttotal Stiefel-Whitney class of M n by w(M n ) and the ttotal Pontryagin class of M n by p(Mn) ( 56 Characteristic Classes). Then (w(M n )) -1 (E H*(M n ; Z2)) and (p(M n ))-l (EH*(Mn;Z)) can be written as W(Mn)=LWi(M n ) (W i EH i (M n ;Z2)) and p(Mn) = LP;(M n ) (PiEH4i(Mn; Z)). Then the property of characteristic classes for the tWhitney sum implies the folIowing theorem: If M n can be immersed in Rn+k, then wi(Mn)=O for i>k and Pi(M n ) = ° for i> [k/2]. Furthermore, if M n can be embedded in Rn+k, then "HMn)=O, As an application, these results yield the nonembed- ding (nonimmersion) theorem for projective spaces ( Appendix A, Table 6.VII). Sharper theorems were obtained subsequently. In particular, Atiyah proved the following: Let Ai (i =0,1,...) be texterior power operations (237 K-Theory), and let yi be the operations defined by the formal power series Lo yit i = (Lohi)(l-t)-l. Then yi(n-r(Mn))==o for i> k (i  k) if M n can be immersed (embedded) in Rn+k. Furthermore, we have an interesting result for the differentiable case, For any posi- tive integer q, there exists a differentiable 
433 manifold M n such that M n is immersible in R k but not embeddable in Rk+q. F. Handlebodies Let W be a CXJ-manifold with boundary aw. Then the boundary aw has a neighborhood U that is homeomorphic to awx R+, where R+ = [0, 00). Let WI> W 2 be Coo -manifolds with boundaries and f: aWl -->aw 2 be a diffeomor- phism. Then the quotient space M of WI U W 2 obtained by identifying points corresponding under f has a natural differentiable structure. This construction of the Coo -manifold M from the Coo-manifolds WI' W 2 is caUed pasting together the boundaries. Now we consider the ttopological product WI x W 2 of manifolds WI' W 2 with boundaries aw" aw 2 . WI x W 2 -aWl x aW 2 has a natural atlas of class Coo. By introducing a suitable atlas of class Coo in a neighborhood of aWl x aW 2 in WI x W 2 , we obtain a Coo-manifold with boundary homeomorphic to WI x W 2 . More generally, let M be an n-manifold with boundary, let N be a finite union of submani- folds of dimension .s; n - 2 in aM, and suppose that M has a corner along N. Extension of the C""-structure of M -N over M as in this para- graph is calIed straightening the angle, Let D n be the oriented unit tn-disk in the n-dimensional Euclidean space R n , M and M be oriented compact Coo-manifolds, and /;:Dn-->M? (i= 1,2) be tCOO-embeddings,f1 torientation-preserving and f2 torientation- reversing. Then, pasting together the bound- aries of M-lntfl(D") and M-lntf2(Dn) by the mapping f2 0 fl- l , we obtain an oriented Coo-manifold, calIed the connected sum of M and M and denoted by M # M. The con- nected sum Mf # M has the orientation in- duced from those of M and M, and its dif- ferentiable structure is uniquely determined independently of the mappings /;. Let sn be the natural n-dimensional sphere. Then we have M n # sn;:::; M n (here;:::; means diffeomor- phic), (M I #M 2 )#M 3 ;:::;M I #(M 2 #M,), HI # M 2 ;:::;M 2 #M j . Let M n be a manifold with boundary and f:(aD') x Dn-s-->aM" be a Coo-embedding. Then we calI the quotient space X(M";f;s) of MnU(D s x D n - s ) obtained by the identification of corresponding points under f the manifold with a handle attached by f Also, we calI the construction of X(Mn;f; s) from M n attaching a handle and call D S x Dn-s an s-handle. After straightening the angle, X(Mn;f; s) is consid- ered naturally a Coo-manifold with boundary. Let/;:aDt x D;-'-->aM n (i= 1, ...,k) be em- beddings whose images are mutually disjoint. Then similarly, using embeddings fl> ..., h, we can define a Coo-manifold with handles X(M n ; 114F Differential Topology fl' ... ,h; s). In particular, X(X(... (X(X(D n ; fl; Sl); f2; S2)''');; s)) is caUed a handlebody. Let M n - l be an oriented (n-l)-mani- fold and f:3D s x D n - s -->(M"-1_3M"-I) an orientation-preserving Coo -embedding. Then, by straightening the angle, the quotient space of (M"-I - Intf(aD S x Dn-s))U(D S x aD n - s ) obtained by the identification of points cor- responding under flaD' x aD n - s is a C OO _ manifold x(Mn-I;f). We say this manifold is obtained by a spherical modification (or sur- gery) of type (s,n -s) from M n - l . The manifold x(Mn-l;f) has a natural orientation, and ax(Mn-I;f)= aM n - l . The process of spherical modification has the folIowing relation to that of attaching a handle: Let W be an n- dimensional manifold and f:aD s x Dn-s-->aW an embedding. Then aX(W;f; s) = x(aW;f). When W=M n - 1 x [0, 1J and f:aD s x D n - s --> M x {1}, aX(W;f; s)= X(M x {I };f)U M x {O}, and therefore M is cobordant ( Section H) to X(M ;f). Conversely, let M I be cobordant to M 2 . Then we can obtain M 2 from M I by a finite sequence of spherical modifications (A, Wallace [17J, Milnor). Let M n be an n- dimensional Coo-manifold and f: Mn-->R I be a Coo-function. If f satisfies the folIowing con- ditions, then it is calIed a nice function: (i) AIl tcritical points of fare tnondegenerate; (ii) for any critical point p of 1, the tindex ( 279 Morse Theory) of fat p is equal to f(p). We have the folIowing theorems. 1. Let M be a compact Coo-manifold. Then there exists a nice function on M (M. Morse, Smale [18J, WalIace [17J), 2. Let M be a compact Coo-manifold and f: M --> R I a Coo-function alI of whose critical points are nondegenerate. Suppose that the number of critical points on f-I [ -8, 8J is k and that they are alI contained in f-I(O). Sup- pose further that the indices of these critical points are all equal to s. Then f- I (-00, 8J is diffeomorphic to the manifold with handles X(r l ( -00, -8J;fI,'" ,.fk; s) (Morse, Thom, Smale [18J). 3. tGeneralized Poincare conjecture. Let M n be an n-dimensional thomotopy sphere of class Coo (n;:: 5), Then M n can be obtained by past- ing together the boundaries of two n-disks. Consequently, Mn is homeomorphic to the n- dimensional sphere sn (Smale [18J, H. Yama- suge [19J). 4. Let M n be a tcontractible compact n- dimensional manifold (n> 5), with aM con- nected and simply connected. Then Mn is dif- feomorphic to the n-disk D n (Smale [18J). 5. Let Mf, M be oriented, compact, simply connected n-dimensional manifolds (n >4). If M l is h-cobordant ( Section I) to M 2 , then M I is diffeomorphic to M 2 (h-cobordism theo- rem; Smale [20J). 
114G Differential Topology 6. If Mr x R k is diffeomorphic to M x R k , we say that Mr is k-equivalent to M, Let M I , M 2 be compact n-dimensional manifolds of the same thomotopy type. Then M I is k-equivalent to M 2 if and only if for a thomotopy equiva- lence f:M I -->M 2 , r(M l ) EB f. k is equivalent to f*r(M 2 ) EB €k> where r(M;) is the ttangent bundle of M i and €k the ttrivial vector bundle of dimension k (Mazur), 7. In some special cases the classification of manifolds by diffeomorphism is completely determined. (i) The classification of simply connected 5-dimensional manifolds M with vanishing second tStiefel- Whitney class w 2 (M) is determined by H2(M). (ii) A 2-connected compact 6-dimensional manifold is dif- feomorphic to either S6 or the connected sum of a finite number of copies of S3 x S3 (Smale [20]). Besides these, the classifications of (n - 1 )-connected 2n-dimensional manifolds (Wall [8]) and (n-1)-connected (2n+ 1)- dimensional manifolds (Tamura [21], Wall [22]) have been obtained, G. Thom Complexes Let  be a real n-dimensional vector bundle over a paracompact space X, A be the total space of the associated bundle of  with fiber the closed n-disk D n , and A be the total space of the associated bundle of  with fiber the (n-1)-sphere aDn. Then the quotient space X{=AjA, obtained from A{ by contracting A to a point, is called the Thom space of the vector bundle . If X is a tCW-complex, then the Thom space X of  has the homotopy type of a CW-complex and is calIed a Thom complex. The Thom space X has the canon- ical base point p corresponding to A. For a coefficient group G, denote by G the t\ocal system of coefficient groups with stalk G associated with the torientation sheaf of an Rn_ bundle . Then we have the Tbom-Gysin isomorphism: H"(X; Gd  H"+q(X, p{; G), Hq(X; G)  Hn+q(X, p; G). Let G be a closed subgroup of the ortho- gonal group O(n) and BG be the base space of the universal n-dimensional vector bundle YG with structure group G. Then we can take a connected CW-complex as BG. We denote the Thom complex of the vector bundle YG by MG and calI it the Thom complex associated with (G, n). The Thom complex associated with (G, n) is (n-1)-tconnected. If G is connected, then we have nn(MG)Z, nn(MO(n))Z2' H"(MG; Z)  Z, and H"(MO(n); Z2)  Z2 (Z2 stands for the quotient group Zj2Z), The generator U of H n (M G; Z) is calIed the funda- 434 mental (cohomology) class ofthe Thorn com- plex MG. For a general Thom space X, we can also define the fundamental class of X{ using the Thom-Gysin isomorphism. A Coo-submanifold WP of a compact mani- fold v n is a tsupport of a tsingular cycle and represents a homology class Z E H p( V n ; G) (G = Z2 or Z). In this case we say that the homology class z of V n is realizable by a sub- manifold. A homology class Z E H n - k ( V n ; G) is realizable by a submanifold if and only if there exists a mapping f: Vn-->MO(k) (or MSO(k)) such that f*(U) is the dual cohomology class of Z (Thorn [23]). The homotopy group nn+k(MO(n)) (resp. nn+k(MSO(n)) (n> k) is determined only by k for any n up to isomorphism. It is calIed the k-dimensional stable homotopy group of the Thorn spectrum MO= {MO(n)} (MSO = {MSO(n)}) and is denoted by nk(MO) (nk(MSO)). For the tunitary group U(n) and the tsymplectic group Sp(n), we define Thom complexes MU(n) and MSp(n) as the Thom complexes associated with (U(n), 2n) and (Sp(n),4n) by the canonical inclusions U(n) c O(2n) and Sp(n) c O(4n), respectively, The k-dimensional stable homotopy groups nk(MU) = Jim n2n+k(MU(n)), nk(MSp) = Jim n 4n +k(MSp(n)) of Thom spectra MU = { ..., MU(n), SMU(n), MU(n+ 1), ",}, MSp= { "', MSp(n), SMSp(n), S2 MSp(n), S3 MSp(n), MSp(n+ I),...} can be defined similarly to the case of nk(MO), where SMU(n) denotes the treduced suspension of MU(n). The stable homotopy groups of Thom spectra are cal- culated in connection with the cohomology groups utilizing the Thom-Gysin isomorphism ( 202 Homotopy Theory T) (Thom [23], Milnor [24]). H. Cobordism Cobordism theory is a theory of classification of differentiable manifolds initiated by L. S. Pontryagin and V. A. RokhJin, who called it intrinsic homology. The theory was brought to maturity by Thom [23]. Its fundamental prob- lem is to dtermine whether a given compact manifold is the boundary of another manifold. Corresponding cobordism theories for com- binatorial and topological manifolds are being developed (WalI, R. Williamson). We consider only compact Coo-manifolds that are not necessarily connected. Let :D be the set of all diffeomorphism classes of Coo-manifolds, and let :Do be the set of all orientation-preserving diffeomorphism classes of oriented Coo-manifolds. For an oriented manifold V, we write - V for the manifold with reversed orientation. 
435 For two compact k-manifolds V, WE :Do, we say that V is cobordant to W if there exists a compact (k+ I)-manifold XE:Do such that ax = VU ( - W), In this definition, considering :D instead of :Do, we say that V is cobordant to W mod2. The equivalence class of V k with respect to the cobordism relation (the cobordism relation mod 2) is called an oriented (un- oriented) cobordism class and is denoted by [VkJ ([V k J2)' The set of all oriented (un- oriented) cobordism classes [VkJ ([V k J2) of k- manifolds forms an Abelian group Qk (91 k ) by the natural addition [VkJ + [WkJ = [V k U WkJ ([V k J2 + [W k J2 = [V k U W k J2)' This is called the k-dimensional oriented (unoriented) cobor- dism group. Define the product [VkJ x [W'] = [V k X W'] ([V k J2 X [W 1 J2 =[V k x W l J2)' Then the direct sum Q = L Qk (91 = L 91 k ) forms an anticommutative (commutative) tgraded algebra, which is called the cobordism ring or Thom algebra. We have the following theorems. 1. Qb 91k are isomorphic to the stable homo- topy groups nk(MSO), nk(MO) of the Thom spectra MSO, MO, respectively (Thom's funda- mental theorem) [23]. 2, For a natural number knot of the form 2i-1, there exists a compact manifold P(k), and 91 is a polynomial ring over Z2 with gen- erators {[P(k)]21 k#2i-l}. Q@ Q (Q is the field of rational numbers) is a polynomial ring over Q with generators {[PC 2 mJ 1m  I}, where PC 2 m is the complex 2m-dimensional projec- tive space (Thom [23J). Moreover, Milnor [24] proved that the p-component of Qk is zero for an odd prime p, Wall proved that the 2-component ofQk contains no element of order 4, using a certain exact sequence which contains the natural homomorphism Qk --> 91k [25]. 3. Let T be the ideal of Q consisting of all torsion elements in Q. Then QjT is a poly- nomial ring over Z with generators {[Y 4k J I k I}, where we can take for Y 4k a complex 2k-dimensional nonsingular algebraic variety (Milnor). tCharacteristic numbers are invariants of cobordism classes, Combining the results stated in 2, we have the following theorem: 4. Let V, W be manifolds. V is cobordant to W mod2 if and only if they have the same corresponding tStiefel- Whitney numbers. Let V, W be oriented manifolds. V is cobordant to W if and only if they have the same corre- sponding Stiefel-Whitney numbers and tPon- tryagin numbers. The tindex of an oriented 4k- dimensional compact manifold is a cobordism class invariant and can be expressed as a linear combination with rational coefficients of tPon- tryagin numbers ( 56 Characteristic Classes). A manifold whose tstable tangent bundle 114 I Differential Topology has a complex structure is called a stably (or weakly) almost complex manifold, Let V, W be 2n-dimensional compact stably almost com- plex manifolds (n  1). We say that V is C- equivalent to W if they have the same tChern numbers. The set of all C-equivalence classes [VJc forms the complex cobordism group Un as in the (real) case of cobordism groups (the existence of the inverse of an element is not trivial). Introducing multiplication into the direct sum U* = L Un by the prod uct of mani- folds, we obtain the complex cobordism ring. We have Unn2n(MU) and U* is a poly- nomial ring over Z with generators {[Y;kJcl k  I}, where we can take for Y;k a complex k-dimensional nonsingular algebraic variety (Milnor [24]). I. h-Cobordism Groups of Homotopy Spheres A manifold M is said to be parallelizable if its tangent bundle r(M) is trivial, and almost parallelizable if there exists a finite number of points Xi in M such that M - U {x;} is paral- lelizable, A manifold M is called stably paral- lelizable (or s-parallelizable) if the tWhitney sum r EEl £1 of its tangent bundle r(M) and the trivial line bundle £1 is trivial. A manifold M n is called a n-manifold if M n has a trivial nor- mal bundle when it is embedded into a Eucli- dean space R N (N > 2n). A manifold M n is a n- manifold if and only if M n is s-parallelizable. The concepts defined in this paragraph are related by inclusions as follows: parallelizable  s-parallelizable  almost parallelizable. For a connected manifold with boundary, these three concepts are equivalent. tGroup manifolds are parallelizable. An n-dimensional manifold homeomorphic to the n-dimensional sphere is parallelizable if and only if n = 1, 3, 7 (Milnor [26]), Suppose that we are given an (n -I)-dimensional sphere Sn-I (n even). We can consider the problem of determining the maximal number r such that there exists a tangent r-frame field over sn-l. J. F. Adams solved this problem using tK-theory ( 237 K-Theory) as follows: Let n=(2a+ 1)2 b , b= c + 4d, where a, b, c, d are integers and O.s; c.s; 3. Put p(n) = 2 c + 8d. Then r = p(n) -1. On the other hand, homotopy spheres are n-manifolds [27]. Let V" V 2 be oriented compact manifolds. Then we say that VI is h-cobordant to V 2 if there exists an oriented manifold W with boundary a W= VI U (- V 2 ) such that Vi (i = 1,2) is a deformation retract of W. The set of all h- cobordism classes of oriented homotopy n- spheres forms an Abelian group en with con- nected sum as addition. This is called the (h- cobordism) group of homotopy n-spheres. We 
114 J Differential Topology denote by en(an) the subgroup of homotopy n- spheres that are boundaries of n-manifolds. Kervaire and Milnor gave certain exact se- quences that contain the groups en, the stable homotopy groups G n of spheres, and the stable homotopy groups nn(SO). They clarified rela- tions among these groups and proved that en(an)=O for n even, en(an) = finite group for n odd # 3, On/On ( an) c Coker I n , etc. [27], where I n : nn(SO)--> G n is the tJ-homomorphism. From these results it follows that the On (n # 3) are finite Abelian groups, On = 0 for n < 7, # 3, e 7  Z2 8, etc, ( Appendix A, Table 6.1). By pasting together the boundaries of two n-disks D'j and D2' we obtain an oriented manifold which is considered as a smoothing of the combinatorial n-sphere. The set of all orientation-preserving diffeomorphism classes of smoothings of the combinatorial n-sphere forms an Abelian group r n with connected sum as addition, This is called the group of oriented differentiable structures on the com- binatorial sphere. By the generalized Poincare conjecture and the h-cobordism theorem, we obtain rn=e n (n#3,4). Furthermore, r 3 =0 and r 4 = 0 (J. Cerf [28]). The group r n can also be defined as follows: Let Diff+ D n , Diff+ sn-1 be the groups of orientation-preserving diffeo- morphisms of Dn, sn-1, respectively, where multiplication is defined by composition. Let r:Diff+ D+ -->Diff+ sn-1 be the homomor- phism induced by the restriction D n -->sn-1. Then the image of r is a normal subgroup, and r n = Diff + sn-1 /r (Diff + Dn). J. Surgery Theory A process of modifying a manifold into an- other by a sequence of spherical modifications is called surgery on the manifold ( Section F). The technique of surgery proved to be a powerful tool for the development of differen- tial topology in the 1960s. Kervaire and MiI- nor exploited this technique in their study of homotopy spheres [27]. W. Browder [29] and S. P. Novikov [30] used surgery to construct differentiable manifolds with the same homo- topy type as that of a given Poincare complex in dimension greater than 4. To explain the main points of surgery the- ory, we introduce some terminology. A pair of finite tCW complexes (X, Y) is called a Poincare pair of formal dimension n if there exists a class J1E Hn(X, Y; Z) called the funda- mental class such that the tcap product J1'--'; Hq(X; Z)-->Hn-q(X, Y; Z) is an isomorphism for each q. When Y = 0, X is called a Poin- care complex. Let M be an n-dimensional smooth manifold. Consider an embedding g of M into a Euclidean space Rn+k, where k is 436 large enough (i.e., k  n + 2). Then the isomor- phism class of the tnormal bundle of 9 is inde- pendent of the choice of embedding, and it depends only on M. Any representative of the isomorphism class is called the normal k-vector bundle of M and is denoted by vt. Let (X, Y) be a Poincare pair of formal dimension n, and let  be a real k-vector bundle over X. A normal mapping (normal map) (f, b) consists of a degree 1 mapping f:(M, aM)-->(X, Y) and a tbundle mapping b: vt --> which covers f A normal mapping (f, b): (M, aM)-->(X, Y) is normally cobordant to a normal mapping (f',b'):(M',aM')-->(X, Y) if aM=aM' and there exist a smooth (n+ I)-manifold Wand a mapping F: W-->X such that aW=MU ( - M') U aM x I, F is covered by a bundle mapping B:vfv-->, and (F, B)I(M,aM)=(fb), (F, B) I (M', aM')=(f', b'). The fundamental theorem of surgery theory is formulated as follows: Let (X, Y) be a Poin- care pair of formal dimension n. Suppose that X is simply connected and n5. Let (f,b): (M,aM)-->(X, Y) be a normal mapping that restricts to a homotopy equivalence on the boundary f I aM: aM --> Y. Then (f, b) is nor- mally cobordant to a normal mapping (I', b'): (M', aM')-->(X, Y) with f':M' -->X a homotopy equivalence if and only if a well-defined ob- struction O"(f, b) vanishes. O"(f, b) is called the surgery obstruction. When n is odd, O"(f, b) always vanishes. When n = 0 mod 4, O"(f, b) is an integer. If Y= 0, it is given by (I(M)- I(X))/8, where I( ) denotes the tindex of the manifold or of the Poincare complex. If n == 2 mod4, O"(f, b) is an integer mod 2, called the Arf-Kervaire invariant. For a thorough devel- opment of simply connected surgery  [31]. In the PL or even in the topological cate- gories, surgery theory can be developed simi- larly (Browder and Hirsch, R. C. Kirby and L. C. Siebenmann [32]). In his study of Hauptvermutung for simply connected manifolds, D. Sullivan reformu- lated surgery in terms of the "surgery exact se- quences" involving the classifying spaces G/PL or G/O. (These spaces are "homotopy theoretic fibers" of BPL-->BG or BO-->BG respectively.) In the special case where X is a closed simply connected PL n-manifold, the surgery exact sequence can be formulated as follows (n  5): j z n e O-->hT(X)--> [X, G/PLJ--> :2 (n=O mod4), (n=2 mod4), (n odd). The set hT(X) is the totality of equivalence classes of pairs (MJ), where M is a closed PL n-manifold and f is a homotopy equivalence M -->x. Two such pairs (M'J') and (M"J") are defined to be equivalent if there exists a PL 
437 homeomorphism h: M' --> M" such that f" 0 h is homotopic to 1'. The set of homotopy classes [X, G/PLJ of mappings X -->G/PL is appro- priately identified with the set of all normal cobordism classes of normal mappings with target X or with the set of an PL reductions of the Spivak normal fiber space of X as a Poincare complex [31]. The mapping 17 can be defined naturally, and the mapping e cor- responds to the assignment of the surgery obstruction. The image 17(MJ)E [X, G/PLJ is sometimes caned the normal invariant of (MJ), Sunivan reduced the classification problem of manifolds within a given homo- topy type to a homotopy theoretic problem of the classifying spaces G/PL, G/G [33J (also  [31,34J). In the latter half of the 1960s, surgery theory was extended by Wall to cover all compact non-simply connected manifolds which are not necessarily orientable. He introduced a certain Abelian group Ln(n; w), now called the Wall group, which functorially depends on the fundamental group n =ndX), with the orienta- tion character w:nt!X)-->Z2, and which is of period 4 with respect to the formal dimension n of X. In Wall's theory, the surgery obstruc- tion O"(f, b) takes its value in this group. The group structure of Wall groups has been cal- culated in many cases. Sullivan's exact se- quences are extended to the non-simply con- nected cases [34]. Making use of the extended exact sequences, Wall and W. C. Hsiang and J. L. Shaneson c1assifield homotopy tori. The result played an important role in the work of Kirby and Siebenmann on the tannulus con- jecture and stable homeomorphisms which led to the solution of tHauptvermutung and trian- gulation problems on topological manifolds in 1969 ( 65 Combinatorial Manifolds). Sur- gery theory has many applications to other geometric problems. Among them are find- ing missing boundaries for open manifolds, equivariant surgery, homology surgery [35J, and surgery on codimension two submanifolds [36,37]. K. 4-Dimensional Manifolds The results in differential topology discussed above are mainly concerned with manifolds of dimension 5. For 4-manifolds, however, because of their peculiar nature which is not observed in other dimensions, many funda- mental problems had remained unsolved until M. H. Freedman's epoch-making paper [54J appeared in 1982, His paper, together with S. K. Donaldson's theorem [55J which was published a little later, was a breakthrough in the theory of 4-manifolds. 114K Differential Topology It was Rokhlin who first discovered a strange property of 4-manifolds [38]. Ro- khIin's theorem states: Let M be a closed oriented smooth 4-manifold. If M is almost parallelizable (or equivalently, if M is a spin 4- manifold), then the index of M is divisible by 16. Milnor and Kervaire gave an alternative proof of this theorem from the differential- topological point of view [39]. Freedman and Kirby and y, Matsumoto gave geometric or elementary proofs. Until 1981, Rokh1in's theorem had been a constant source of many curious phenomena in 4-dimensional topology. The list contains, for example: (1) the class (4m+2, 4n+2)E H 2 (S2 x S2)  Z EEl Z cannot be represented by a smoothly embedded 2-sphere in S2 x S2 (Ker- vaire and Mi1nor). This result was improved by W.-c. Hsiang and R. H. Szczarba, A. G. Tristram, and Rokhlin, (2) The h-cobordism theorem fails to hold in 4 or 3 dimensions (T. Matumoto, Siebenmann [40J). (3) There exists a closed smooth 4-manifold M that is homotopy equivalent to the real projective space Rp4, but M *- Rp4 (Cappell and Shane- son). (4) There exists an open 4-manifo1d W 4 properly homotopy equivalent to S3 x R but distinct (Freedman), Closed, connected, simply connected 4- manifolds M and N are homotopy equivalent if and only if the intersection forms defined on the 2-dimensional (co)homo1ogy groups are equivalent as inner product spaces over Z [41]. Wan [42J proved that if closed simply connected smooth 4-manifo1ds M and N are homotopy equivalent, then they are h- cobordant. Moreover, if M and N are h- cobordant, then there exists an integer k  0 such that M # k(S2 X S2) is diffeomorphic to N # k(S2 X S2), About 1973, using a certain infinite repeti- tion process, A, Casson constructed a family of noncom pact smooth 4-manifolds which are properly homotopy equivalent to the open 2- handle D 2 x R 2 . A 4-manifold belonging to the family is called a Casson handle. He observed that if an the Casson handles are diffeomor- phic to D 2 x R 2 , then theories analogous to surgery and the h-cobordism theorem in higher dimensions can also be developed in dimension 4. In his 1982 paper [54J, Freedman proved that each Casson handle is homeomorphic to D 2 x R 2 . (It was proved later that Casson handles are not, in general, diffeomorphic to D 2 x R 2 ,) This result and the proper h- cobordism theorem, also due to Freedman, proved many fundamental results on the topo- logical structure of 4-manifo1ds: (1) If closed simply connected smooth 4-manifo1ds M, N are h-cobordant, they are homeomorphic, In 
114L Differential Topology particular, a 4-dimensional homotopy 4-sphere is homeomorphic to the 4-sphere S4. (Proof of the 4-dimensional Poincare conjecture.) (2) A topological 4-manifold properly homotopy equivalent to R 4 is homeomorphic to R 4 (3) Given a nonsingular symmetric bilinear form w over Z, there exists a closed, connected, simply connected topological 4-manifold whose intersection form is equivalent to w. (Therefore Rokhlin's theorem cannot be ex- tended to topological 4-manifolds. From this, it follows that there exist simply connected topological4-manifolds which are nonsmooth- able Or even nontriangulable.) (4) The homeo- morphism class of a closed connected simply connected 4-manifold is determined by the intersection form and the Kirby-Siebenmann class. (This statement is an improved one by F. Quinn [56].) Donaldson [55J revealed a sharp contrast existing between smooth and topological 4- manifolds. Donaldson's theorem states: If the intersection form of a closed, connected, sim- ply connected smooth 4-manifold is positive definite, then the form is equivalent to the standard one < 1 > EB < 1 > EB . .. EB < 1 >. This theorem, together with Casson's theory and Freedman's result (2) stated above, implies that there exists an exotic differential strUc- ture on a 4-dimensional Euclidean space R 4 . Moreover, as an application of Donaldson's theorem, the problem of representing a 2- dimensional homology class of S2 x S2 by a smoothly embedded 2-sphere was completely solved (K. Kuga [57J): the class (p,q)EH 2 (S2 x S2) is represented by a smoothly embedded S2 if and only if Ipl.s; 1 Or Iql.s; 1. Many problems concerning differential structUres on 4-manifolds remain unsolved. It is not known whether an exotic smooth 4- sphere exists. Among other results, the proof of the 4- dimensional annulus conjecture by Quinn [56J is remarkable. L. Miscellaneous Results The Browder-Livesay invariant. Let En be a homotopy n-sphere, T an involution on En, that is, a smooth mapping T:En-->En with To T= idE"' Assume that T is free from fixed points. The involution T is said to desuspend if there exists a homotopy (n-1)-sphere I:' smoothly embedded in I:" which is invariant under the action of T. Browder and G. R. Livesay defined an obstruction O"(T) in the group O(n: even), Z (n = 3 mod4), Z2 (n = 1 mod4) such that O"(T)=O if and only if T desuspends (provided that n  6) [43]. The invariant O"(T) is now called the Browder- 438 Livesay invariant. The technique used there was equivariant surgery under the action of T. De Rham homotopy theory. A new approach to topology of manifolds was invented by Sullivan [44], who considered the exterior algebra of differential forms (with polynomial coefficients) on a simplicial complex. Through the construction of the minimal model for the algebra of differential forms, he recovers the rational homotopy type of the complex under some reasonable condition on the fundamental group. Following upon the classical de Rham theorem which recovers the (real) cohomology algebra from the algebra of differential forms, this approach is called the de Rham homotopy theory. This method has proved to be useful for the explicit calculation of the algebraic topological invariants of tKiihler manifolds, tloop spaces, cross-section spaces, talgebraic varieties, etc. Kirby calculus. Kirby [45J initiated a link- theoretic approach to 3- and 4-manifolds. Let L be a link in S3. Suppose that each compo- nent of L is given a framing, which means a trivialization of a tubular neighborhood. Such a link is called a framed link. A framing of a component is determined by the tlinking number between the component and its re- placement along the framing. By attaching 2-handles to the 4-disk D 4 along a framed link in S3 = cD 4 , we obtain a handlebody; here we denote it by W L . It is known that each closed connected oriented 3-manifold is obtained as the boundary of such a handlebody (W. B. R. Lickorish, Wallace). Kirby proved that for framed links Land L', the boundaries aWL and aWL' belong to the same orientation- preserving diffeomorphism class if and only if L is transformed into L' by a sequence of the folIowing two kinds of elementary operations: (i) adding or deleting a trivial knot (separated from L) with framing ::1::1, (ii) band summation of components corresponding to "handle sliding." Kirby's approach is called Kirby calculus on framed links. For applications  [45,53]. References [IJ H. Whitney, Differentiable manifolds, Ann. Math., (2) 37 (1936), 645-680. [2J S. S. Cairns, Triangulation of the manifold of class one, Bull. Amer. Math. Soc., 41 (1935), 549-552. [3J J. H. C. Whitehead, On C1-complexes, Ann. Math., (2) 41 (1940),809-824. [4J M. Morse, The calculus of variations in the large, Amer. Math, Soc. Colloq, Pub!., 1934. [5J J. W. Milnor, On manifolds homeomor- 
439 phic to the 7-sphere, Ann. Math., (2) 64 (1956), 399405. [6J M. A. Kervaire, A manifold which does not admit any differentiable structure, Com- ment. Math. Helv., 34 (1960),257-270. [7J I. Tamura, 8-manifolds admitting no dif- ferentiable structure, J. Math. Soc. Japan, 13 (1961),377-382. [8J C. T. C. Wall, Classification of (n -1)- connected 2n-manifolds, Ann. Math., (2) 75 (1962),163-189. [9J J. Munkres, Obstructions to the smooth- ing of piecewise differentiable homeomor- phisms, Ann. Math., (2) 72 (1960), 521-554. [IOJ M. W. Hirsch and B. Mazur, Smoothings of piecewise linear manifolds, Ann. Math. Studies, Princeton Univ. Press, 1974. [11] H. Whitney, The self-intersections of a smooth n-manifold in 2n-space, Ann. Math., (2) 45 (1944),220-246. [12J H. Whitney, The singularities of a smooth n-manifold in (2n - 1 )-space, Ann. Math., (2) 45 (1944),247-293. [13J S. Smale, The classification of immersions of spheres in Euclidean spaces, Ann. Math., (2) 69 (1959), 327-344. [14J M. W. Hirsch, Immersions of manifolds, Trans. Amer. Math. Soc., 93 (1959), 242-276. [15] A. Haefliger, Plongements differentiables de varietes dans vari6tes, Comment. Math. Helv., 36 (1961), 47-82. [16J J. Levine, A classification of differentiable knots, Ann. Math., (2) 82 (1965), 15-50. [17J A. H. Wallace, Modifications and co- bounding manifolds, Canad. J. Math., 12 (1960),503-528. [18J S. Smale, Generalized Poincare's conjec- ture in dimensions greater than four, Ann. Math., (2) 74 (1961), 391-406. [l9J H. Yamasuge, On the Poincare conjec- ture for M 5 , J. Math. Osaka City Univ., 12 (1961),1-17. [20J S. Smale, On the structure of manifolds, Amer. J. Math., 84 (1962),387-399. [21J I. Tamura, Classification des varietes differentiables, (n - 1 )-connexes, sans torsion, de dimension 2n + 1, Sem. H. Cartan 16-19, 1962-1963, Inst. H. Poincare, Univ. Paris, 1964. [22J C. T. C. Wall, Classification problems in differential topology, I, II, Topology, 2 (1963), 253-261,263-272. [23J R. Thom, Quelques proprietes globales des varietes differentiables, Comment. Math. Helv., 28 (1954),17-86. [24J J. W. Milnor, On the cobordism ring Q* and a complex analogue I, Amer. J. Math., 82 (1960),505-521. [25J c. T. c. Wall, Determination of the cobordism ring, Ann. Math., (2) 72 (1960), 292-311. 114 Ref. Differential Topology [26J J. W. Milnor, Some consequences of a theorem of Bott, Ann. Math., (2) 68 (1958), . 444-449. [27] M. A. Kervaire and J. W. Milnor, Groups of homotopy spheres I, Ann. Math., (2) 77 (1963),504-537. [28] J. Cerf, Sur les diffeomorphismes de la sphere de dimension trois (r 4 = 0), Lecture notes in math. 53, Springer, 1968. [29J W. Browder, Homotopy type of differen- tiable manifolds, Colloquium on Algebraic Topology, Aarhus, 1962,42-46. [30J S. P. Novikov, Diffeomorphisms of sim- ply connected manifolds (in Russian), Dok\. Akad. Nauk. SSSR, 143 (1962),1046-1049. [31J W. Browder, Surgery on simply con- nected manifolds, Erg. Math., 65 (1972). [32J R. C. Kirby and L. C. Siebenmann, Foun- dational essays on topological manifolds, smoothings, and triangulations, Ann. Math. Studies 88, Princeton Univ. Press, 1977. [33J D. Sullivan, Triangulating and smooth- ing homotopy equivalences and homeomor- phisms, Geometric Topology Seminar Notes, Princeton Univ. Press, 1967. [34J C. T. C. WaIl, Surgery on compact mani- folds, Academic Press, 1970. [35J S. E. Cappell and J. L. Shaneson, The codimension two placement problem and homology equivalent manifolds, Ann. Math., (2) 99 (1974), 277-348. [36J Y. Matsumoto, Knot cobordism groups and surgery in codimension two, J. Fac. Sci. Univ. Tokyo, 20 (1973), 253-317. [37J M. H. Freedman, Surgery on codimen- sion 2 submanifolds, Mem. Amer. Math. Soc., vol. 12, no. 191,1977. [38J V. A. Rokhlin (Rohlin), New results in the theory of 4-dimensional manifolds (in Russian), Dok\. Akad. Nauk SSSR, 84 (1952), 221-224. [39J J. W. Milnor and M. A. Kervaire, Ber- noulli numbers, homotopy groups, and a theorem of Rohlin, Proc. Int. Congress Math., 1958, Cambridge Univ. Press, 1960,454-458. [40J L. C. Siebenmann, Disruption of low- dimensional handle body theory by Rohlin's theorem, Topology of Manifolds, Markham, 1970,57-76. [41J J. W, Milnor, On simply connected 4- manifolds, Symp. Int. de Top. Alg. (Mexico 1956), Mexico, 1958, 122-128. [42] C. T. C. Wall, On simply connected 4- manifolds, J. London Math. Soc., 39 (1964), 141-149. [43J S. Lopez de Medrano, Involutions on manifolds, Erg. Math., 59, 1971. [44J D. Sullivan, Infinitesimal computation in topology, Publ. Math. Inst. HES, 47 (1977), 269- 331. [45J R. C. Kirby, A calculus for framed links in S3, Inventiones Math" 45 (1978), 35-56. 
11SA Diffusion Processes [46J J. Munkres, Elementary differential top- ology, Ann. Math. Studies, Princeton Univ. Press, 1961. [47J S. Smale, A survey of some recent devel- opments in differential topology, Bull. Amer. Math. Soc., 69 (1963),131-145. [48J C. T. C. Wall, Topology of smooth mani- folds, J. London Math. Soc., 40 (1965),1-20. [49J J. W. Milnor, Lectures on the h-cobordism theorem, Princeton Univ. Press, 1965. [50J R. Stong, Notes on cobordism theory, Princeton Univ. Press, 1968. [51J J. W. Milnor, Topology from the differen- tiable viewpoint, Univ. Press of Virginia, 1969, [52J M. W. Hirsch, Differential topology, Springer, 1976. [53J R. Mandelbaum, Four-dimensional top- ology: An introduction, Bull. Amer. Math. Soc., (New Series) 2 (1980),1-159. [54J M, H. Freedman, The topology of four- dimensional manifolds, J. Differential Geome- try, 17 (1982), 357-453. [55J S. K. Donaldson, An application of gauge theory to four-dimensional topology, J. Dif- ferential Geometry, 18 (1983), 279-315. [56J F. Quinn, Ends of maps III: dimension 4 and 5, J. Differential Geometry, 17 (1982), 503-521. [57J K. Kuga, Representing homology classes of S2 x S2, Topology, 23 (1984),133-137. 115 (XVII.8) Diffusion Processes A. General Remarks Let (Q,!B, P) be a tprobability space. A tMar- kov process {X,}o<:;t<oo on a ttopological space S with tcontinuous time parameter t is called a diffusion process if the tsample function X,(w) is continuous in t with probability 1 until a random time ((OJ), called the tterminal time, After the terminal time, X,(w) stays at the terminal point a. Such a process is said to be I-dimensional or multidimensional according as S is an interval or a manifold (possibly with boundary) with dimension;", 2. Brownian motion is a typical diffusion process ( 45 Brownian Motion), To give conditions for a Markov process to be a diffusion process, let us assume that S is a tcomplete metric space with metric p. Let {X,} be a Markov process on S with time para- meter t ranging over a finite interval [t" t 2 J and satisfying supP(s,x,t,S-U,(x))=O(h) (h!O) (1) for every B >0, where U,(x) is the B-neighbor- 440 hood of x and sup is taken over all XES, and s, tE[t l ,t2J such that O<t-s<h. Then {X,} is a diffusion process if and only if 1 " P(p(X" X'+h»B)dt=o(h) '1 (h!O) for every B>O [5]. This includes the folJowing result due to E. B. Dynkin (1952) and J. R. Kinney (1953) as a special case: If in (1) we can replace O(h) with o(h), then {X,} will be a diffusion process. Specifically, when {X,} is a I-dimensional tstrong Markov process, the latter condition is also necessary for the pro- cess to be a diffusion process. Diffusion processes are intimately related to a certain class of tpartial differential equations. Let S be the real Jine. Assume that the ttran- sition probability P(s, x, t, E) (s < t) of the pro- cess {Xt}o<:;t<oo satisfies 1-P(s,x,s+h,(x-B,xH)=o(h) (h!O) (2a) for every B > 0 and that the following limits exist: I I X +' Jim - (y-x)2p(s,x,s+h,dy) hO+ h x-, =2a(s,x»0, I l x +, Jim - (y-x)P(s,x,s+h,dy)=b(s,x), hO+ h x-, (2b) (2c) 1 lim -(P(s,x,s+h,S)-I)=c(s,x).s;O. hO+ h (2d) Assume further that the transition probability is tabsolutely continuous with respect to Le- besgue measure: P(s, x, t, dy) = p(s, x, t, y)dy. Then under some suitable additional con- ditions, p(s, x, t, y) satisfies ap a 2 p ap -= -a(s, x)----z-b(s, x)- a -c(s,x)p, as ax x p(t-O,x, t,y)=b(X - y), (3) and ap a 2 a - = ----z(a(t, y)p) --(b(t, y)p) + crt, y)p, at ay ay p(s,x, s+O,y)= b(y-x), (4) where b is the tDirac delta function. The co- efficient c vanishes if {X,} is tconservative. Equations (3) and (4) are called Kolmogorov's backward equation and forward equation, re- spectively. They are also called the Fokker- Planck partial differential equations. A. N. Kolmogorov [1] derived equations (3) and (4) in 1931, and W. Feller proved that (3) (or (4)) has a unique solution under certain regularity conditions on the coefficients a, b, and c, and that the solution p(s, x, t, y) is non- 
441 negative, has an integral with respect to y that does not exceed 1, and satisfies the tChapman- Kolmogorovequation f p(s, x, t, y)p(t, y, u, z)dy= p(s, x, u, z) for every 0 < s < t < u. Hence p(s, x, t, y) deter- mines a tMarkov process analytically, How- ever, rigorous proof establishing the sufficiency of condition (2) for the Markov process {X,} to be a diffusion process did not appear until the 1950s. In the ttemporally homogeneous case, p(s, x, s + h, y) does not depend on s and can be written as p(h,x,y). Then a(s, x), b(s,x), and c(s, x) are also independent of s, and p(t, x, y) satisfies ap a 2 p ap - = a(x) + b(x)- + c(x)p, at ax ax p(O +, x, y) = b(x - y). Feller made an intensive study of this case and completely solved the problem of existence and uniqueness of the solution of (5) assuming that p(t, x, y) is nonnegative and that its inte- gral with respect to y does not exceed 1 [7]. In particular, when t varies in [0, +00) and S is an interval [r" r 2 J, Feller used the HiIle- Y osida theory of tsemigroups of operators to determine the conditions that should be satis- fied by rt and r 2 in order that the differential equation (5) (with the initial condition and his additional assumptions) yield one and only one solution. Feller also introduced the notion of generalized differential operators, which expresses the differential operator in the right- hand side of (5) in the most general form [8]. The probabilistic meaning of his results was clarified by Dynkin, H. P. McKean, K. Ito, D. B. Ray, and others, and alii-dimensional diffusion processes with the tstrong Mar- kov property have now been completely determined. Since not much research on temporally nonhomogeneous diffusion processes has been done so far, we restrict our explanation to temporally homogeneous ones. Let \JJl = (X" W,PxlxES) be a Markov process, where S is a tstate space, W is the tpath space con- sisting of all paths w: [0, + 00 J --> S U {a} which are continuous in t for O.s; t < (w) (w(t) = a for t(w) while W(t)ES for O.s;t«(w)), and Px is a probability measure on W under the con- dition that the process starts from x at t = 0 ( 261 Markov Processes). We can actually identify W with the tbasic space Q and set X,(w)=w(t) for wEQ. Assume that \JJl has the tstrong Markov property, It folIows from tDynkin's formula for tinfinitesimal generators ( 261 Markov Processes) that the infini- 115B Diffusion Processes tesimal generator ffi of a diffusion process has the local property stating that if u and v be- long to the domain of ffi and coincide in a neighborhood of xo, then ffiu(x o ) = ffiv(x o )' B. I-Dimensional Diffusion Processes (5) Let S be a straight line, A point XES is called a right singular point if X,(w)x for alI tE [0, (w)) with Px-probability 1. A left singu- lar point is defined analogously, with  re- placed by .s;. A right and left singular point is calIed a trap, while a right singular point which is not left singular is called a right shunt (a left shunt is defined analogously). A point is called a regular point if it is neither right nor left singular. The set of all regular points is open. Let (rl' r 2 ) be a connected component of this open set. One of the most important results con- cerning this situation is the proof of the exis- tence of a strictly increasing function s(x) de- fined on (rl, r2) and two measures m and k on (rt, r 2 ) such that the infinitesimal generator ffi of 9Jl is represented as ffiu(x) _ u +(dx) - u(x)k(dx) m(dx) , (6) where u+(dx) is the measure du+(x) induced by the tright derivative u+(x) ofu(x) with respect to s(x) (i.e., u+(x)=\im8X+0{U(x+x)- u(x)}/{s(x+x)-s(x)}). Equation (6) gives a generalization of second-order tdifferential operator au" +bu' +cu (a>O,c.s;O) [12]. Here m is positive for nonempty open sets, both m and k are finite for compact sets in (rl, r2)' and s, m, and k are unique in the folIo wing sense: If there are two sets of values of Si' m i , and k i (i = 1,2), then S2(X)=CS I (x)+constant, m 2 (dx) = c- I mddx), and k 2 (dx)=c- 1 kddx) for some positive constant c. We calI s, m, and k, respec- tively, the canonical scale, canonical measure (or speed measure), and killing measure for \JJl. They determine the behavior of X,(w) belong- ing to \JJl inside the interval (rl, r2)' Conversely, given any such set of s, m, and k, we can find a I-dimensional diffusion process \JJl such that s, m, and k are, respectively, the canonical scale, canonical measure, and killing measure of\JJl. If X,(w) is non vanishing in (rl, r 2 ) with proba- bility 1, the killing measure k is identically zero, and the canonical scale s satisfies the equation ( s(x)-s(xd P 0" <0" ) - x X2 XI ( ) ( ) sX 2 -sx t for x I < X < x 2 , where O"y is the thitting time of the point y, The motion X,(w) belonging to the process WI and contained in (rl, r 2 ) can be constructed 
USB Diffusion Processes from the Brownian motion by means of a topological transformation of the state space (interval) based on s, a ttime change based on m, and a tkilling based on k. More precisely, we first transform the interval (r 1 , r 2 ) by x--> s(x) into the interval (s(r l + 0), s(r2 - 0)) so that the diffusion process on this new interval has a canonical scale coincident with x. The speed and killing measures are transformed accordingly. We can, therefore, assume that the canonical scale is x. Let us consider the case (r 1 , r2) = (-00, +00) for simplicity. Let t(t, x) be the tlocaJ time of Brownian motion at x ( 45 Brownian Motion). Next, we apply the ttime change to the Brownian motion by means of the tadditive functional f +oo <p(t) == _ 00 t(t, x)m(dx), and finally tkill the latter process by means of the tmuItiplicative functional ex(t) ==ex p ( - r+: t(<p -I (t), X)k(dX). Thus we obtain the process \JJl in (-00,00). In particular, if m(dx) = a(x) -I dx and k(dx) = Ic(x)ldx, we have <p(t) = t a(X,)-l dr, ex(t)=exp(t C(X\>-l(,»)dr} and ffiu = au" + cu. At a shunt the infinitesimal generator ffi has a form that is a generalization of the first- order differential operator bu' + cu (c.s; 0), with b > 0 or b < 0 according as it is a right shunt or a left shunt. At a trap we have ffiu(x) = -u(x)jEAO. When S is an interval with endpoints r l and r 2 and all interior points are regular, the left endpoint r l is classified into the following 4 types, according to the bd'avior of \JJl near r l : Take'an arbitrary fixed point rE(r l ,r2)' and set n(dx) = m(dx) + k(dx) and ex= r (s(r)-s(x))n(dx), J (r 1 ,r) fJ= r n«x,r))s(dx). J (r 1 ,r) Then r 1 is a regular boundary if ex < 00, fJ < 00; an entrance boundary if ex < 00, fJ = 00; an exit boundary if ex == 00, fJ < 00; and a natural boundary if ex == 00, fJ == 00. This classification is independent of the choice of r. A similar classification of r 2 can be established. X,(w) approaches rl in finite time with positive or null probability according as fJ is finite or infi- nite. If ex = 00, it never happens that X,(w) 442 starts from r l and reaches the interior of the interval Seven ifr l ES, whereas ifex< 00 we can construct (adjoining r l to S if necessary) a diffusion process that enters the interior from r 1 and whose motion in the interior coincides with that of X,(w). If r 1 is a regular boundary for 9)1 and r 1 E S, then there are various possibilities for the behavior of X,(w) at r l . They are expressed by the boundary conditions satisfied by the func- tions u belonging to the domain of the in- finitesimal generator ffi. The condition is in general of the form }'u(r l ) + bffiu(rd + J1u +(rd == 0, where )', b, and J1 are constants, }', b.s; 0, J1  0, and I bl + J1 > O. If}' = b = 0, then r 1 is said to be a reflecting barrier. If r 1 is regular for \JJl and does not belong to S, then X,(w) vanishes exactly as X,(w) reaches rl> and r l is called an absorbing barrier. This case corresponds to the boundary condition u(r l ) = O. Whatever the boundary condition may be, \JJl is constructed from the Brownian motion with reflecting barrier by topological transformation of the state space, time change, and kiIling. Here if }' # 0, then killing may occur at r I; if b # 0, the set of visiting times of r l has positive Lebesgue measure; and if J1 # 0, the trace of the motion may go beyond the point r l and reach the interior points of S [2,7-9]. If we weaken the assumption of continuity of paths and admit jumps from r 1 , the general boundary condition becomes }'u(rd + bffiu(r l ) + J1u+(rl) + r (u(x)-u(rl))v(dx) =0, J(Yt'Y2] where v is a measure with respect to which min(1,s(x)-s(r 1 +0)) is integrable. When S == (r l , r 2 ), the transition probability is absolutely continuous with respect to the canonical measure, the density p(t, x, y) has an teigenfunction expansion p(t,x,y)== f:oo eA'e(dA;x,y), and p(t, x, y) is positive, jointly continuous in 3 variables, and symmetric in x and y. A similar result is also known when S is half-open or closed [9]. If \JJl is trecurrent, i.e., PAO"y < +00) = 1 for every x and y in S, then there exists a unique (up to a multiplicative constant) tin variant measure for S that is finite for all closed inter- vals in the interior of S. If 9)1 is conservative and all the interior points of S are regular, then the canonical measure is an invariant measure, provided that the endpoints are either entrance, natural, or regular reflecting. 
443 C. Multidimensional Diffusion Processes Let the state space S be a domain or the closure of a domain in the n-dimensional Euclidean space Rn. Consider a temporally homogeneous diffusion process {X,}o.;t<oo on S. Under suitable regularity conditions the infinitesimal generator <fJ coincides, for suffi- ciently smooth functions u in its domain, with the foIlowing telliptic partial differential oper- ator A: n a 2 n a A= L aij(x)+ L bi(x)--.:;--;+c(x), i,j1 ax ox i=1 ox c.s;o; (7) where S has a boundary, u satisfies a boundary condition of the form  .. a 2 u(x) n1 . au(x) L, ex'}(x)+ L, [3'(x) i,j1 ax ax} i1 ax au(x) + y(x)u(x) + b(x)Au(x) + J1(X)=O, (8) where, for simplicity, we assume that S is the closed half-space defined by xno, (exij) is a symmetric nonnegative definite matrix, y.s; 0, b .s;0, J1  0, and a/an is the inward-directed tconormal derivative associated with a ij . This boundary condition was discovered by A. D. Venttsel' [13]. Conversely, given an operator A such as (7) and a boundary condition (8) (if S has a boundary), the existence and uniqueness of the corresponding diffusion process are known for several special cases. If S = Rn and A has continuous coefficients, there exists at least one diffusion process corresponding to A [14-16]. A probabilistic approach to constructing diffusion processes corresponding to the oper- ator A with c == 0 is given by Ita's method of tstochastic differential equations ( 406 Stochastic Differential Equations E). A some- what different approach was introduced by D. W. Stroock and S. R. S. Varadhan [20] under the name of martingale problems ( 261 Mar- kov Processes C). Let w n = C([O, 00)--> Rn) be the space of all continuous functions w: [0, oo)-->Rn endowed with compact uniform topology and !B(W n ) be the topological 0"- field. Given XE Rn, a solution to the martingale problem for the operator A with c = 0 start- ing from x is a probability measure Px on (W n , !B(W n )) satisfying Px(w(O) = x) == 1 such that f(w(t))- HAf(w(s))ds is a Px-martingale for all f E C(Rn), where C(Rn) denotes the set of all COO-functions on Rn having compact support. If a = (a ij ) is uniformly positive de- finite, bounded, and continuous and if b == (b i ) is bounded and Borel measurable, the martin- gale problem for the operator A with c == 0 is well posed, i.e., for each xER n , there is exactly 1l5C Diffusion Processes one solution starting from x. (For further information related to the theory of martingale problems  [15,20].) Suppose next that S is the closure of a bounded domain with a sufficiently smooth boundary and A is given with sufficiently smooth coefficients. If the boundary condition is yu + J1au/an == 0, J1 # 0, then there exists a unique diffusion process on S corresponding to this situation. Moreover, if y = 0, then the process is said to have a reflecting barrier. S. Watanabe [22] gave a probabilistic condition which characterizes the reflecting diffusion processes in the normal direction among all reflecting diffusion processes in oblique direc- tions. Suppose that a general boundary con- dition (8) is given. We write the left-hand side of(8) as Lu. Write u=Hffor the solution of Au == 0 with boundary value f Under some natural additional conditions, T. Ueno proved that if LH tgenerates a Markov process on the boundary, then there exists a diffusion process for A with boundary condition (8) [21]. If {X,} has a reflecting barrier, the Mar- kov process on the boundary is, conversely, obtained from {X,} through time change by a nonnegative continuous tadditive functional which increases only when the value of X, is on the boundary. Stochastic differential equa- tions are also used in constructing diffusion processes with boundary condition (8) [19] ( 406 Stochastic Differential Equations). An elegant method for constructing diffusion processes with boundary condition (8) has been introduced by Watanabe [22]. It consists of piecing together excursions from the bound- ary to the boundary. In this construction, the stochastic integrals for tPoisson point processes play an important role. There are various other results on multidimensional diffusion processes with general boundary conditions which cannot be covered by the method mentioned above ( [23] and M. Motoo, Proc. Intern. Symp. Stochastic Dif- ferential Equations, Kyoto, 1976). When we have a diffusion process on S with infinitesimal generator of the form A, we can obtain a probabilistic expression for the solutions of various partial differential equa- tions involving A. Let 0" be the hitting time of the boundary of S, Then Hf(x) can be ex- pressed as EAf(Xa-o)). The solution of Au= - f with boundary value 0 is given by u(x) = EASgf(X,)dt), while the solution u(t,x) of au/at == Au with boundary value 0 and initial value u(O, x) = f is EAf(X,); t < 0"). The first case gives the solution of a tDirichlet problem; and in this case, the condition for a boundary point to be tregular relative to the Dirichlet problem can also be expressed probabilisti- caIly ( 45 Brownian Motion, 261 Markov 
115 D Diffusion Processes Processes). Furthermore, if f(X,) is replaced by f(X,)exp(Hk(X,)ds) in the expressions for u(x) and u(t,x), A is replaced by A +k (M. Kac [24J). When k.s; 0, this replacement gives rise to a killing of the process ( 261 Markov Processes E). By using the theory oftDirichlet forms we can investigate a general class of tsymmetric multidimensional diffusion processes which are not in the framework of the classical diffusion processes, i.e., diffusion processes whose in- finitesimal generators are not necessarily dif- ferential operators (M. Fukushima, Dirichlet Forms and Markov Processes, 1980)). D. Diffusion Processes on Manifolds Let M be a tconnected toriented to"-compact COO tmanifold of dimension n. Let AI be M or M U {a} (= the one-point tcompactification of M) according as M is tcompact or non- compact. Suppose that we are given a system ofCOO-tvector fields Ao,At> ...,A, on M. We consider the following stochastic differen- tial equation on M: , dX,= L Ak(X,)odwk(t)+Ao(X,)dt (9) k=1 ( 406 Stochastic Differential Equations), where w(t) = (WI (t), w 2 (t), ,." w'(t)) denotes an r-dimensional Brownian motion and the first term of the right-hand side is understood in the sense of the tStratonovich stochastic dif- ferential. Let X (t, x, w) be the solution of (9) with Xo = x E M defined on the r-dimensional Wiener space (W o , p W ) ( 406 Stochastic Differential Equations) and Px be the proba- bility law on W(M) of(X(t,x, w)),,, 0, where W(M) is the space of all continuous map- pings w: [0, 00)--> AI with a as a trap. Then {P x I x EM} defines a diffusion process \JJl on M which is generated by the second-order differential operator LI AU2 + Ao, i.e., a diffusion process \JJl with the infinitesimal generator ffi such that 1 ' (ljf(x)=- L AU(x) + Aof(x) for fEC!{'(M), 2k1 where CorM) denotes the space of all C oo _ functions on M with compact support. By appealing to the analytical theory of partial differential equations we can discuss regularity properties of the transition probability of WI ([15J; L. Hormander, Acta Math., 119 (1967)). Recently, P. Malliavin [25J also suggested a probabilistic method for proving elliptic regu- larity results (see also [19]). We now assume that all linear sums of Ao, AI , ... , A, are tcomplete. Then the terminal 444 time of \JJl is infinite a.s. Let W (M) be the space of all continuous mappings w: [0, 00)--> M endowed with compact uniform topology, and set Wx(M)=={wlwEW(M),w(O)==x}. We de- scribe the ttopological support .'I'(PxJ of the probability Px on WAM), i.e., the smallest closed subset of WAM) that carries the mea- sure Px. Let .'I' be the set of all piecewise con- stant mappings u: [0, oo)-->R'. For a given u = (u l (t), u 2 (t), ..., u'(t)) of .'1', we consider a system of ordinary differential equations d ' -<p(t)=Ao(<p(t))+ L Ak(<p(t))uk(t), (10) dt kl i.e., for every COO-function f on M with com- pact support, d ' dt f( <p(t)) = Ao.f(<p(t)) + k1 (Ad)( <p(t) )uk(t). Then for every UE.'I' and XE M, we obtain a curve <p==<p(x,u)=(<p,(x,u)) on M by solving (10) with <p(0) = x, Set .'I'x = {<p(x, u)luE.'I'}. Then we have .'I'(Px) = .'I'x for every xEM. Let !i' be the tLie algebra generated by AI' A 2 , ..., A" and set !i'(x) = {Vxl VEi1}. If dim !i'(x) = n for every xEM, then Y'(PxJ= WAM) for every XE M. For further information  Stroock and Varadhan (Proc. 6th Berkeley Symp. Math. Statist. Prob. III, 1972) and H. Kunita (Proc. Int. Symp. Stochastic Differential Equations, Kyoto, 1976). Let A be a smooth tnondegenerate second- order telliptic differential operator on M which is expressed in local coordinates as I .. a 2  _. a A =- L.. alJ(x)+ L.. b'(X)----r, 2i,j1 ax ax] i1 ax where (aij(x)) is symmetric and tstrictly posi- tive definite. Then there exist a tRiemannian metric 9 and a COO - tvector field b on M such that 1 A =2M+b, (11) where M is the tLaplace-Beltrami operator on the tRiemannian manifold (M,g). We now construct the diffusion process generated by the operator A, introducing a stochastic dif- ferential equation on the tbundle O( M) of the torthonormal frames. There exists an taffine connection V tcompatible with the Riemann- ian metric 9 such that for every Coo-functionf onM, 1 n ( I ) - L L(/on)(r)= -M+h (/)(n(r)), 2k1 2 rEO(M), 
445 where (L 1 , L 2 , ..., Ln) is the system of tcanon- ical horizontal vector fields (tbasic vector fields) on O(M) corresponding to the affine connection V and n:O(M)-->M is the natural projection [19]. We now consider the follow- ing stochastic differential equation on O(M): n dr, == L Lk(r,) 0 dwk(t). k=1 Let r(t, r, w) be the solution of (12) with ro == rEO(M) defined on the n-dimensional Wiener space (WO', p W ). Now a stochastic curve X (t, r, w) on M is defined by X (t, r, w) == n(r(t,r, w)). Set X(t,r, w)=8 for t?;(, where' is the texplosion time of r(t, r, w). Then the probability law of (X(t, r, w)),;" 0 on W(M) depends only on x == n(r), and it defines a diffu- sion process \JR on M which is generated by the operator A of (11), (For details  [19,26].) When A =M/2, i.e., b=O in (11), the diffusion process \JJl is called the tBrownian motion on the Riemannian manifold M ( 45 Brownian Motion). Next consider the case when (L"L2' ...,Ln) in (12) is a system of canonical horizontal vector fields corresponding to the tRiemann- ian connection, and let b be a Coovector field on M. Let b be the scalalization of b, i.e., b=(b1,hZ,... ,bn), bi(r)== L;I bj(x)j/, and U/) = (ej) -I for r =(x, el' e 2 , ..., en)E O(M), where b= L?I b i 8/8xi and ei= L;I e!8/8x j in local coordinates. Suppose that bi(r) is bounded on O(M) for i= 1,2,..., n. Then M(t)=exp[t r bi(r,)dwi(s) -i r (b i (r,))2dsJ is an texponential martingale. The diffusion process generated by the operator  M /2 + b is obtained from the Brownian motion on M constructed above by the ttransformation of drift, i.e., the transformation by the tmultiplica- tive functional M(t) ( 261 Markov Processes). (0. Maruyama (Nat. Sci. Rep. Ochanomizu Univ., 15 (1954)) studied this for I-dimensional processes, I. V. Girsanov [27J and M. Motoo (Ann. Inst. Statist. Math., 12 (19601961)) for multidimensional cases.) Consider a diffusion process \JJl = {P x I XE M} on M generated by the operator A of (11). Let W b be the tdifferentiall-form defined from the vector field b by Wb( V)x == gAb x , V x ) for every XEM and every Coo-vector field V on M. Then \JJl is symmetrizable if and only if W b is texact, i.e., if there exists a function F on M such that W b = dF. The tin variant measures 115D Diffusion Processes are then of the form const. x exp [2F(x)] m(dx), where m(dx) is the tRiemannian volume (E. Nelson, Duke Math. J., 25 (1958); Kolmogorov, Math. Ann., 113 (1937). The diffusion process \JJl is said to be locally symmetrizable if for every tsimply connected domain Dc M there exists a Borel measure vD(dx) on D such that (12) t T,vf(x)g(x)vD(dx)== t f (X)7;D g (X)V D (dX) for all bounded continuous functions f and g, where T,D f(x) = r, f(w(t))I{wl'D(w»t) PAdw), JW(M) rD(w) == inf{ t I w(t) i D}. Then \JJl is locally symmetrizable if and only if W b is tc1osed, i.e., dwb=O. R. Z. Khas'minskiI [28] proved a pair of useful tests for explo- sions of diffusion processes on M == Rn, similar to Feller's test for n = 1 mentioned in Section B ( [18J). In case of a general manifold we can investigate the possibility of explosions for 9Jl by appealing to the comparison theorems for tcurvatures in the theory of differential geome- try [19,29]; S. T. Yau, J. Math. Pures Appl., 57 (1978». The diffusion process \JJl on M is said to be trecurrent if Px[X,E U for some t>OJ = 1 for any open subset U of M; otherwise it is called ttransient. It is well known that an n- dimensional Brownian motion on Rn is recur- rent if n.s; 2 and transient if n?; 3 [9]. There also are some results for the criterion which determines whether \JJl is recurrent or transient ([19,28]; A. Friedman, Stochastic Differential Equations and Applications I, II, 1975, and K. Ichihara, Pub I. Res. Inst. Math. Sci., 14 (1978)), Explicit formulas for the transition probability of the Brownian motion on a hyperbolic space are given in [29]. For information related to the asymptotic behavior of diffusion processes as t10 we refer the reader to Varadhan (Comm. Pure Appl, Math., 20 (1967)) and S. A. Mol- chanov (Russian Math. Surveys, 30 (1975)). Diffusion approximations for suitably nor- malized random sequences have been studied extensively beginning with the work of A. J. Khinchin [30]. The theory of tlimiting distri- butions of sums of tin dependent (or weakly dependent) random variables is among the best-known examples ( 250 Limit Theorems in Probability Theory). For information re- lated to the theory of diffusion approxima- tions, see Yu. V. Prokhorov (Theory of Prob. Appl., 1 (1956)), A. V. Skorokhod [16J, and Stroock, Varadhan, and G. Papanicolaou (Proc. 1976 Duke Turbulence Conference). Recently, many interesting examples of multi- dimensional diffusion processes have also been 
115 Ref. Diffusion Processes introduced to describe probabilistic models in physics, biology, etc. (e.g., R. L. Stratonovich, Topics in the Theory of Random Noise I, II, 1963; J. F. Crow and M. Kimura, An Introduc- tion to Population Genetics Theory, 1970; and K. Sato, Proc. Int. Symp. Stochastic Differential Equations, Kyoto, 1976). In general, if a diffusion process {X,} is given on a noncompact space S and X, has no limit points in S as tiC then some natural compactification should be induced by {X,}. The notion of a tMartin boundary for Markov processes is introduced in this connection. References [1] A. N. Kolmogorov, Uber die analytischen Methoden in der Wahrscheinlichkeitsrech- nung, Math. Ann., 104 (1931), 451-458. [2] E. B. Dynkin, Markov processes I, II, Springer, 1965. (Original in Russian, 1963.) [3] J. L. Doob, Stochastic processes, Wiley, 1953. [4] K, Ita, Lectures on stochastic processes, Tata Inst., 1960. [5] L. V. Seregin, Continuity conditions for stochastic processes, Theory of Prob. Appl., 6 (1961),1-26. (Original in Russian, 1961.) [6J W, Feller, Zur Theorie der Stochastischen Prozesse (Existenz- und Eindeutigkeits-siitz), Math. Ann., 113 (1936), 113-160. [7] W. Feller, The parabolic differential equa- tions and the associated semi-groups of trans- formations, Ann. Math., (2) 55 (1952),468-519. [8] W. Feller, On second order differential operators, Ann. Math., (2), 61 (1955),90-105. [9] K. Ita and H. P. McKean, Jr., Diffusion processes and their sample paths, Springer, 1965. [10] P. Levy, Processus stochastiques et mouvement brownien, Gauthier-Villars, 1948. [11] E. B. Dynkin, One-dimensional continu- ous strong Markov processes, Theory of Prob. Appl., 4 (1959),1-52. (Original in Russian, 1959.) [12J 1. S, Kats (Kac) and M. G. Krein, On the spectral functions of the string, Amer. Math. Soc. Transl., (2) 103 (1974). (Original in Rus- sian, supplement of the Russian translation of the book by F. V. Atkinson, Discrete and continuous boundary problems.) [13] A. D. Venttsel' (Ventcel'), On boundary conditions for multidimensional diffusion processes, Theory of Prob. Appl., 4 (1959), 164-177. (Original in Russian, 1959.) [14] N. V. Krylov, On the selection of a Mar- kov process from a system of processes and the construction of quasidiffusion processes, Math. USSR-Izv., 7 (1973), 691-709. (Original in Russian, 1973.) 446 [15] D. W. Stroock and S. R. S, Varadhan, Multidimensional diffusion processes, Springer, 1979. [16] A. V. Skorokhod, Studies in the theory of random processes, Addison- Wesley, 1965. (Original in Russian, 1961.) [17] K. Ita, On stochastic differential equa- tions, Mem. Amer. Math. Soc., 1951. [18] H. P. McKean, Jr., Stochastic integrals, Academic Press, 1969. [19] N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, North-Holland and Kodansha, 1981. [20] D. W. Stroock and S. R. S. Varadhan, Diffusion processes with continuous coeffi- cients I, II, Comm. Pure. Appl. Math., 22 (1969),345-400,479-530. [21] K. Sato and T. Ueno, Multidimensional diffusions and the Markov process on the boundary, J. Math. Kyoto Univ., 4 (1965), 529-605. [22] S. Watanabe, Poisson point process of Brownian excursions and its applications to diffusion processes, Amer. Math. Soc. Proc. Symp. Pure Math., 31 (1977), 153-164. [23] M. Motoo, Application of additive func- tionals to the boundary problem of Markov processes, Proc. 5th Berkeley Symp. Math. Stat. Prob. II, pt. 2, Univ. of California Press, 1967,75-110. [24] M. Kac, On some connections between probability theory and differential and integral equations, Proc. 2nd Berkeley Symp. Math. Stat. Prob., Univ. of California Press, 1951, 189-215. [25] P. Malliavin, Ck-hypoelliptic with degen- eracy, Stochastic Analysis, A. Friedman and M. Pinsky (eds.), Academic Press, 1978, 199- 214,327-340. [26] P. Malliavin, Formule de la moyenne, calcul de perturbations et theoremes d'annulation pour les formes harmoniques, 1. Functional Anal., 17 (1974),274-291. [27] 1. V. Girsanov, On transforming a certain class of stochastic processes by absolutely continuous substitution of measures, Theory of Prob, Appl., 5 (1960), 285-301. (Original in Russian, 1960.) [28] R. Z. Khas'minskii, Ergodic properties of recurrent diffusion processes and stabili- zation of the solution of the Cauchy problem for parabolic equations, Theory of Prob. Appl., (1960), 179-196. (Original in Russian, 1960.) [29] A. Debiard, B. Gaveau and E. Mazet, Theoremes de comparison en geometrie riemannienne, Publ. Res. Inst. Math. Sci., 12 (1976),391-425. [30] A, Ya, Khinchin, Asymptotische Gesetze der Wahrscheinlichkeitsrechnung, Springer, 1933. 
447 116 (XX.2) Dimensional Analysis The system of units for physical quantities is derived from a certain set of fundamental units. If the fundamental units are denoted by 0, <p, i/J, etc., any other unit ex (caIled a de- rived unit) can always be expressed in the form ex = c01<pmi/Jn ... (c, I, m, n, ". are constants), by definition or by physica11aws. The exponents I, m, n, ... are caIled the dimensions of ex, and the content of the previous statement is expressed as [exJ==[OI<pmi/Jn ...J, which is caIled the di- mensional formula. The usual practice is to take as fundamental units length, time, mass, temperature, and energy, which are denoted by L, T, M, 0, and H, respectively. Dimensional analysis investigates the relation between physical quantities by use of the n theorem and the law of similitude given below. A. The n Theorem If a relationship f(ex, [3, '" ) == 0 holds among n physical quantities ex, [3, ... independently of the choice of fundamental units, the equation f( ex, [3, ... ) = 0 can always be transformed into F(n h n 2 , ...)== 0, where the n i are n -m dimen- sionless quantities (m is the number of funda- mental units) of the form ni = ex a , [3b, ... . If we choose the ni so that n l =ex[3-'ly- Z 1... and n 2 , n3, etc. do not contain :1(, then f = 0 implies ex=fJ'lyZ1...<II(n2,n3'''')' which clearly shows the manner in which the quantity ex is related to other quantities [3, y,.... B. The Law of Similitude In general, if two physical systems of the same kind have the same values of the n i , then the physical states of the systems are similar. If we are given a family of mutuaIly similar sys- tems, it is sufficient to observe a particular one among them (a "model") in order to estimate physical values attached to anyone of the given systems. Consider, for example, the case of the drag D acting on geometricaIly similar bodies placed in the flow of a viscous incompressible fluid. If v is the velocity, I the representative length of the body, p the density of the fluid, and J1 the viscosity (which has the dimensional formula ML -IT-I), then the n theorem gives D/pv 2 f = f(pvl/J1). Hence the drag coefficient as given by the left-hand side can be obtained by the experiments performed on a geometri- caIly similar model. The dimensionless quan- tity R = vl/v (v == J1/ p) is caIled the Reynolds number. If the wave resistance due to gravity 117 A Dimension Theory as weIl as the effect of compressibility are taken into account, gravitational acceleration 9 and sound velocity a must be included, so that we have D/pv 2 f == f(vl/v. v 2 /lg, via, C l , C z ,"'), where C 1 , C 2 ,... are other dimension- less quantities depending on the physical properties of the fluid. Fr = V Z /Ig is ca1\ed the Froude number, and M = via the Mach number. Next, consider the case of heat transfer between a solid surface and a flowing fluid. Let the area of the solid surface be denoted by S, the heat transferred per unit time by Q (HT-!), the thermal conductivity of the fluid by k (HL -I T-lO- l ), the specific heat by C (HM-lO- l ), the two representative tempera- tures by To and Tl' and the representative length by I, where expressions in parentheses represent dimensional formulas. Then we have, as dimensionless quantities, the Nusselt num- ber Nu = Q/(kS(T I - To)/I), the Prandtl number Pr=v/K (K==k/pc), the Grashoffnumber Gr= d 3 g(TI - To)/v 2 To, and R, so that from the n theorem we have the relation Nu = f(R, Pr, Gr, C!, C 2 ,...). Furthermore, Pe= vl/K=Pr R is caIled the Peclet number. References [1] A. W. Porter, The method of dimensions, Methuen, third edition, 1946. [2] D. C. Ipsen, Units, dimensions, and dimen- sionless numbers, McGraw-HiIl, 1960, [3] H. L. Langhaar, Dimensional analysis and theory of models, Wiley, 1951. [4] H. E. Huntley, Dimensional analysis, Macdonald, 1952. [5] C. M. Focken, Dimensional methods and their applications, Arnold, 1953. [6] R. Kurth, Dimensional analysis and group theory in astrophysics, Pergamon, 1972. [7] L. I. Sedov, Similarity and dimensional methods in mechanics, Academic Press, 1959. (Original in Russian, fourth edition, 1957.) Also  414 Systems of Units. 117 (11.21) Dimension Theory A. Introduction Toward the end of the 19th century, G. Cantor discovered that there exists a one-to-one corre- spondence between the set of points on a line segment and the set of points on a square; and also, G. Peano discovered the existence of a tcontinuous mapping from the segment onto the square. Soon, the progress of the theory of 
117B Dimension Theory point-set topology led to the consideration of sets which are more complicated than familiar sets, such as polygons and polyhedra. Thus it became necessary to give a precise definition to dimension, a concept which had previously been used only vaguely. In 1913, L. E. J. Brouwer [9J gave a definition of dimension based on an idea of H. Poincare. In 1922, the foundations of dimension theory for separable metric spaces were established by K. Menger [IIJ and P. Uryson [10]. Subsequently, P. S. Aleksandrov and W. Hurewicz contributed much to the development of the theory. The foundations of dimension theory for general metric spaces were established independently by M. Katetov amd K. Morita. More general theory for tnormal spaces has also been inves- tigated; the same results as in metric spaces, however, do not always hold. B. Definition of Dimension Let X be a normal space. If any finite open tcovering of X has an open covering oft order .s; n + 1 as its refinement ( 425 Topological Spaces R) (i.e., iffor any open sets G i (i = 1, ... ,s) such that X == G I U ... U G" there exist open sets Hi (i = 1, ... , s) such that Hi c G i , X = HI U ... U H, and any n + 2 of the Hi have no point in common), then we write dim X .s;n. If dim X .s; n but dim X .s; n - 1 does not hold, then we define X to be n-dimensional and write dim X =n. We call dim X the cover- ing dimension, (or Lebesgue dimension) of X. The idea behind this definition is due to H. Lebesgue. There are other definitions of dimension that are given inductively. Let us define Ind X = - 1 if X is empty. If for any pair consisting of a closed set F and an open set G with Fe G in X there exists an open set V such that Fe V c G and Ind(V- V).s; n-l, then we define Ind X.s; n. Next, we define ind 0 == -1. For any point p of X and any neighborhood G of p, suppose that there exists an open neighborhood V of p such that V c G and ind(V - V) = n - 1. Then we define ind X .s; n. As before, we set Ind X = n (ind X = n) ifIndX.s; n (indX.s; n) but IndX.s; n - 1 (ind X .s; n - 1) does not hold. (The defi- nition of indX is due to Menger.) We call Ind X (ind X) the large inductive dimension of X (the small inductive dimension of X). If dim X .s; n does not hold for any n, then X is ca\1ed infinite-dimensional, written dim X = 00; we define IndX == 00 and indX = 00 similarly. These dimensions are invariant under thomeomorphisms, The set of irrational points in a Euclidean 448 space, the tCantor discontinuum, and tBaire zero-dimensional spaces are all O-dimensional. The set of rational points in a separable tHilbert space is I-dimensional. C. Dimension of Metric Spaces The following theorems hold for the dimension of metric spaces (M. Katetov, Czechoslovak Math. J., 2 (1952); K, Morita, Math. Ann., 128 (1954)). Let X and Y be metric spaces. The equality dim X = Ind X holds. If Y c X, then dim Y.s; dimX. If X is a union of a countable number of closed sets F i (i == 1,2, ...), then dim X = max( dim FJ (sum theorem for dimen- sion). The inequality dim(XU Y).s;dimX + dim Y + 1 holds. If dim X == n, then X is a union of n + 1 O-dimensional subsets (decom- position theorem for dimension). We have dim(X x Y).s; dim X + dim Y, where X # 0 (product theorem for dimension). Each of the following is a necessary and sufficient condition for dim X .s; n: (i) There exists a subspace A of a Baire zero- dimensional space B (r) and a continuous closed mapping f of A onto X such that f-I(X) consists of at most n + 1 points for each point x of X (K. Morita, Sci, Rep. Tokyo Kyoiku Daigaku, 5 (1955)); (ii) there exists a metric of X which gives the same topology on X such that for any positive number £, any point x of X, and any n+2 points Xi (i== I, ...,n+2) at a distance less than £ from the (£/2)- neighborhood of x, there are at least two points Xi and x j (i # j) with distance < £ (J. Nagata, Fund. Math., 45 (1958»). Hurewicz's problem asked whether the equality dimX = n +m (m>O) implies the existence of an m-dimensional space A and a mapping f of A onto X having property (i). It was solved affirmatively for separable metric spaces by J. H. Roberts and for general metric spaces by K. Nagami (Japan. J. Math., 30 (1960)). If X is the union of a countable number of closed tstrongly paracompact subspaces, in particular if X is separable, then IndX = indX [1,2]. However, it was shown by P. Roy (Bull. Amer. Math. Soc., 68 (1962)) that this equality does not hold in general. D. Euclidean Spaces and Dimension The n-dimensional tEuclidean space Rn is exactly n-dimensional in the sense mentioned above; thus this concept of dimension agrees with our intuition. The proof of dim R n  n comes from Lebesgue's theorem: If each mem- ber of a finite closed covering of an n-cube has 
449 sufficiently small diameter, then the order of the covering is not less than n + ]. (The proof of dim Rn.s; n is easy.) Let X be a subset of Rn and f a homeomorphism from X onto a sub- set f(X) of Rn. If x is an interior point of X, then f(x) is an interior point of f(X). Also, if an open set A of Rn is homeomorphic to a subset B of Rn, then B is open in Rn (Brouwer's theorem on the invariance of domain [8J). This theorem holds for any manifold but not for general separable metric spaces. By the theorem of invariance of domain it can be shown that Rm and Rn, m # n, are not homeo- morphic (theorem on invariance of dimension of Euclidean spaces). Any n-dimensional separable metric space is embedded in a Euclidean space R2n+l, or, more precisely, in the subset of R 2 n+l consisting of all points x of which at most n coordinates are rational (Menger-Nobeling embedding theorem, G. N6beling, Math. Ann., 104 (1930)). Thus, from the topological point of view, any finite- dimensional separable metric space can be identified with a subset of a Euclidean space. Moreover, it is known that any n-dimensional separable metric space is homeomorphic to a subset of some n-dimensional compact metric space. If F is a bounded closed subset of Rm, then dim F.s; n if and only if for any positive num- ber 1';, there exists a continuous mapping f from F into an n-dimensional polyhedron in Rm such that the distance between x and f(x) is less than I'; for each point x of F. E. Dimension of Normal Spaces Let X be a normal space. Then IndXdimX and IndXindX, but the equalities do not necessarily hold here. The following theorems were obtained by E. Cech, Aleksandrov, C. H. Dowker, E. Hemmingsen, and Morita [1]. If dim X .s; n, then any locally finite open cover- ing of X has an open covering of order .s; n + 1 as its refinement; if A is an t Fa subset of X or A is strongly paracompact, then dim A.s; dim X; if X has a to"-locally finite closed cover- ing {F,}, then dim X ==max(dim F,). In order that dim X .s; n, it is necessary and sufficient that any continuous mapping from a closed subset of X into an n-sphere sn can be extended continuously to X. If X and Yare tparacompact and X is t10cally compact, or if X x Y is strongly paracompact, then dim(X x Y).s;dimX +dim Y, where X #0; if X is a tCW complex, then the equality holds [14]. M. L. Wage (Proc. Nat. Acad. Sci. US, 75 (1978)) proved under the continuum hypo- thesis (CH) that dim(X x Y).s;dimX +dim Y II7H Dimension Theory does not hold in general even if X x Y is local- ly compact and normal, and dim X = dim Y = 0; T. Przymusinski (Proc. Amer. Math. Soc., 76 (1979)) noted that CH can be avoided by a modification of Wage's construction. Katetov (Casopis Pest. Mat. Fys., 75 (1950») proved that dim X is determined by the ring C*(X) of bounded real-valued continuous functions onX. F. Homological Dimension Aleksandrov contributed much to the develop- ment of dimension theory in introducing the concept of homological dimension (Math. Ann., 106 (1932)). The homological dimension of a compact Hausdorff space X with respect to an Abelian group G is the largest integer n such that the n-dimensional tCech homology group fin(X, A; G) is nonzero for some closed subset A of X. The cohomological dimension D(X; G) is defined similarly by using the tCech cohomology group fin(X, A; G). If dim X < 00, then dim X = D(X; Z) (Z is the additive group of integers). The cohomological dimension of X with respect to an arbitrary Abelian group is determined by the co homological dimension with respect to some specified groups, and the cohomological dimension of the product space X x Y is expressed in terms of those of X and Y (M. F. Bokshteln, [5J). A compact Hausdorff space X has the property that dim(X x Y)= dim X + dim Y for any compact Hausdorff space Y if and on]y if dim X = D(X; Q(p)), where Q(p) is the additive group of rationals mod 1 of the form m/ps for any prime number p (V. BoltyanskiI, [5J); this result holds also when X is paracompact (Y. Kodama, J. Math. Soc. Japan, 18 (1966)). G. Dimension and Measure Let X be a separable metric space. Then dim X .s; n if and only if X is homeomorphic to a subset of a Euc]idean space R 2 n+1 whose (n + I)-dimensional tHausdorff measure is 0 (E. Szpilrajn. Fund. Math., 28 (1937); also  [1, 2J). The infimum of ex  0 such that the Hausdorff measure A,(X) of dimension ex vanishes is called the Hausdorff dimension of X. H. Dimension Type (Frechet's Definition) In analogy to the theory of tcardinal numbers in set theory, M. Frechet (1909) defined the dimension type of topological spaces as fo]- 
1171 Dimension Theory lows. Two spaces X and Yare said to have the same dimension type if X is homeomorphic to a subset of Yand Y is homeomorphic to a subset of X. I. Infinite-Dimensional Spaces If X is a metric space with 0 < dim X < 00, then for each positive integer m with m.s; dim X, X contains a (closed) subset S with dim S = m. Tumarkin asked the following question: For an infinite-dimensional com- pact metric space X and for each positive integer m, does X contain a closed subset S with dim S = m? D. Henderson (Amer. J. Math., 89 (1967)) answered this question in the negative. Furthermore, J. Walsh (Bull. Amer. Math. Soc., 84 (1978)) constructed an infinite- dimensional compact metric space X such that if S is an arbitrary subset of X with dimS>O then dimS= 00. References [IJ K. Morita, Dimension theory (in Japa- nese), Iwanami, 1950. [2J W. Hurewicz and H. Wallman, Dimension theory, Princeton Univ. Press, 1941. [3J J. Nagata, Modern dimension theory, Noordhoff, second edition, 1965. [4J K. Nagami, Dimension theory, Academic Press, 1970. [5J P. S. Aleksandrov, The present status of the theory of dimension, Amer. Math. Soc. Transl., (2) 1 (1955), 1-26. (Original in Russian, 1951.) [6J L. E. J. Brouwer, Beweis der Invarianz der Dimensionenzahl, Math, Ann., 70 (1911),161- 165. [7J H. Lebesgue, Sur la non-applicabilite de deux domaines appartenant respectivement a des espaces a n et n + p dimensions, Math. Ann., 70(1911),166-168. [8J L. E. 1. Brouwer, Beweis der Invarianz des n-dimensionalen Gebiets, Math. Ann., 71 (1912),305-313. [9J L. E. J. Brouwer, Uber den naturlichen Dimensionsbegriff,1. Reine Angew. Math., 142 (1913), 146-152. [lOJ P. Uryson, Les multiplicites cantoriennes, C. R. Acad. Sci. Paris, 175 (1922),440-442. [IIJ K. Menger, Dimensionstheorie, Teubner, 1928. [12J A. Pears, Dimension theory for general spaces, Cambridge Univ. Press, 1975. [13J R. Engelking, Dimension theory, North- Holland, 1978. [14J K. Morita, Dimension of general topo- logical spaces, Surveys in General Topology, G. M. Reed (ed.), Academic Press, 1980. 450 118 (V.9) Diophantine Equations A. General Remarks A Diophantine equation is an talgebraic equa- tion whose coefficients lie in the ring Z of rational integers and whose solutions are sought in that ring. The name comes from Diophantus, an Alexandrian mathematician of the third century A.D., who proposed many Diophantine problems; but such equations have a very long history, extending back to ancient Egypt, Babylonia, and Greece, As early as the sixth century B.C., Pythagoras is said to have partially solved the equation X2 + y2=Z2 by x=2n+ 1, y=2n 2 +2n, z=y+ 1. A general solution is given by the Pythagorean numbers x=m 2 _n 2 , y= 2mn, z=m 2 +n 2 . tFermat's problem also concerns a Diophan- tine equation. Systematic studies of Diophantine equations over Z have been made for the linear equation L71 aixi==a (ai,aEZ) and for the quadratic equation ax 2 +bxy+ cy 2=k (a,b,c,kEZ) in two unknowns. The latter forms a principal topic of C. F. Gauss's Disquisitiones arithme- ticae and can be regarded as a starting point of modern algebraic number theory. The special quadratic equation t 2 - Du 2 = :+:: 4 (D E Z) is called Pell's equation. If D < 0, then Pell's equation has only a finite number of solutions. If D >0, then all solutions tn' Un of Pell's equa- tion are given by :+::«t l + uIJD)/2)" = (tn + u n JD)/2, provided that the pair t I- U 1 is a solution with the smallest t 1+ u I JD> 1 [15]. U sing continued fractions ( 83 Continued Fractions), we can determine t I' U I explicitly, A general quadratic Diophantine equation ax 2 + bxy + cy2 = k with two unknowns can be solved completely if we use solutions of Pell's equation; this is an application of the arithmetic of quadratic fields ( 347 Qua- dratic Fields) [1]. On quadratic Diophantine equations of several unknowns, there are deep studies by C. L. Siegel ( 348 Quadratic Forms). Diophantine problems consist of giving criteria for the existence of solutions of alge- braic equations in rings and fields and even- tually determining the number of such solu- tions. The fundamental ring of interest is Z and the fundamental field of interest is Q. One discovers rapidly, however, that to have all the technical maneuverability necessary for handling general problems, one must consider rings and fields of finite type over Z and Q. Furthermore, one is led to consider finite fields and local fields when one deals with a locali- zation of the problems under consideration. 
451 Techniques from various fields of mathematics, e.g., algebraic number theory, algebraic geom- etry, analysis, Diophantine approximation, etc., have been successfully applied to solve Diophantine problems. However, much re- mains unsolved today. Yu. V. Matiyasevich (1970) showed that Hilbert's tenth problem is unsolvable; there is no general method of telling whether a Diophantine equation has a solution. This theorem in a sense indicates the complexity of Diophantine problems. For many centuries, no other topic has engaged the attentions of so many mathematicians, both professional and amateur, or resulted in so many published papers. For these mis- ce\1aneous results  Dickson [1] and Mordell [2]. B. Equations over Finite Fields Let k be a finite field of characteristic p con- sisting of q ( = pi) elements. Chevalley's theo- rem: Let f be a tform of degree d in n vari- ables with coefficients in k such that d < n. Then f =0 has a nontrivial solution in k. A generalization is Warning's theorem: Let fl' '" ,J, be polynomials with coefficients in k in n variables of degrees d I' ..., d" respectively, and suppose that d = d l + ... + d, < n. Then the number N of common zeros of fl, '" ,J, satis- fies N:= 0 (mod p). Warning's second theorem asserts that if N > 0 then N  qn-d. Warning's theorem was also improved by J, Ax (1964) to the effect that N:=O (modl) for any integer b < n/d [3]. For equations over finite fields, counting the number of solutions is important. Let f(x, y) be an tabsolutely irreducible poly- nomial in x and y over k. Let N denote the number of zeros in k of f(x, y). A. Weil proved IN -ql.s;2g)q+ c(d), where 9 is the genus of the curve f(x, y) = 0 and c(d) is a constant depending on d. Weil's proof requires the use of deep results from algebraic geometry. This theorem is equivalent to the tRiemann hypothesis for algebraic curves over finite fields [4]. Later, using Stepanov's method, W, M. Schmidt and E. Bombieri (1973) independently gave new proofs which do not depend on algebraic geometry [3]. P. Deligne [5J proved a far-reaching generalization of Weil's theorem to tnonsingular absolutely irreducible equa- tions in n variables. He showed IN _ qn-II = O(q(n-I)/2). This is a part of the tWeil conjec- ture for zeta functions of algebraic varieties over finite fields ( Section E; 450 Zeta Func- tions Q). Schmidt obtained in an elementary manner a weaker estimate IN - qn-!I = O(qn-3/2) but without the assumption of non singularity ( Section F). 118C Diophantine Equations C. Equations over Local Fields A method of solving problems in number theory by use of embeddings of the ground field into its tcompletions is called a local method. Such methods have important conse- quences when applied to Diophantine equa- tions. Let f be a polynomial in n variables with rational integer coefficients. The con- gruence f = 0 (mod pk) is solvable for all k  1 if and only if f ==0 is solvable in tp-adic integers. This is an easy consequence of the com- pactness of the ring of p-adic integers. We can solve f = 0 in p-adic integers provided that we can solve an infinite sequence of con- gruences. It is generally difficult to tell when we may limit our consideration to only a finite number of these. In this respect, the following lemma is most useful. Hensel's lemma: Let f(x!, ". , x n ) be a polynomial whose coeffi- cients are p-adic integers. Let Y!' .., , Yn be p- adic integers such that for some i (1.s; i.s; n) and an integer (j:2 0, we have f(YI,"', Yo) = 0 (mod p2iH1) and af/ax;!y!,..., Yn) = 0 (mod pO), =1= 0 (mod pHI). Then there exist p-adic integers e l' ..., en, such that f(e p ..., en) = 0 and e i = Yi (mod pHI ) for i= 1, "', n. The case 15 =0 is often useful; it implies that a nonsingular solution mod p can be extended to a p-adic solution. Generalization to simultaneous equa- tions is also known [6]. Skolem's method is sometimes useful when we investigate certain types of equations over t\ocal fields. This method is based on some simple properties of local analytic manifolds over local fields [7]. If a quadratic form has zeros in each local field, then it has a rational zero (tMinkowski-Hasse theorem). When a theorem of this type holds, we say that the tHasse principle holds ( 348 Quadratic Forms). For forms of higher degree, the Hasse principle no longer holds even if the forms are absolutely irreducible and nonsin- gular. Counterexamples were first found for cubic (E. S. Selmer, 1951) and quintic (M. Fujiwara, 1972) forms. Asymptotic formulas in Waring's problem ( 4 Additive Number Theory E) can be regarded as an analytic form of the Hasse principle. As to the quantitative formulation of the Hasse principle, there are deep results of Siegel for quadratic forms and their generalization by T. Tamagawa, R. Brauer (1945) showed that forms in sufficiently many variables represent zero in all p-adic fields. Forms of odd degree represent zero in the field of rational numbers if the number of variables is sufficiently large compared with the degree (B. J. Birch, 1957). Let.f be a poly- nomial with p-adic integer coefficients and C m (m O) be the number of solutions to the con- gruence f=O (mod pm). The series <p(t)= L;::=ocmt m is called the Poincare series of f 
118 D Diophantine Equations J. Igusa (1975) proved, by using his theory of asymptotic expansions together with Hiro- naka's tresolution theorem, that <p(t) is a rational function of t [8]. D. Integral Solutions of Some Diophantine Equations In this section we are concerned with those equations for which some "theory" exists. For isolated results  [1,2]. (1) Binary Forms. Thue's theorem ( 1908): If f(x) = L7oavxV(a,EZ, n > 2) has distinct roots, then the number of rational integral solutions ofLoavx'yn-v=a (Zow#O) is finite. This theorem is a direct consequence of Thue's theorem on Diophantine approximation, which says that there are only a finite number of rational numbers p/q (p,qEZ,q>O) with IC(- p/ql < l/q(n/2)+1 for a given algebraic num- ber ex of degree n (n > 2) [9, p. 122]. K. F. Roth proved that (n/2) + 1 in this formula can be replaced by 2 + r. (r. is an arbitrary positive number independent of n) (Mathematika, 2 (1955), 1-20). Roth's theorem was generalized to some cases of number fields and function fields ( 182 Geometry of Numbers) and is applied to Diophantine equations [9, 1OJ. A. Baker (1968), using a completely different method, has given explicit upper bounds for the solutions of Thue's equations, thus enabl- ing one to compute effectively all the solu- tions. More precisely, if {in Thue's equation is irreducible over the rationals, then every integer solution (x, y) of the equation satisfies max(lxl,lyl) <exp«nH)oOn)5 +(loga)2n+2), where H is the theight of f The proof of this remarkable theorem is based on his deep result concerning the lower bound for the linear forms in the logarithm of algebraic numbers (Mathematica, 13 (1966); 14 (1967)). Baker's method has been applied to elliptic, hyperellip- tic, and other curves (Baker, H. Stark, J. Coates, V. G. Sprindzhuk, etc.). (2) Higher-Degree Forms. A natural generali- zation of binary forms is a norm form, Let K be an algebraic number field of degree t  3 and C( I, '" ,ex n be elements of K. Then the norm N(exIX I +... + C(nxn) = II:=1 (C(V)x l +... +exi)xn)' where ex(i) denotes a conjugate of ex, is a form of degree t with rational coefficients, It is easy to see that every form which has rational coeffi- cients and is irreducible over Q but which is a product of linear forms with algebraic coeffi- cients is a constant multiple of a norm form [7]. A module M in an algebraic number field K is called degenerate if M has a submodule N such that, for some exEK, exN is a full module 452 in some subfield K' of K, where K' is neither Q nor an imaginary quadratic field. The most basic in the theory of norm forms is (W. M.) Schmidt's theorem: Let ex I, .. . , C(n be linearly independent over Q and suppose that the module generated by C( I, ... , C(n is nonde- generate; then N(C(IX I +... +C(nxn)=c (CEQ) has only finitely many solutions in integers Xl, ..., x.. (Math. Ann., 191 (1971)). The proof IS based on his remarkable result on tsimulta- neous approximation which generalizes Roth's theorem ( 182 Geometry of Numbers G). There are investigations on special norm forms (T. Skolem, N. I. Fel'dman, K. Ramanathan, K. Gyory, M. Fujiwara, etc.). For general forms of higher degree, not much is yet known except for the additive forms ( 4 Additive Number Theory E). H. Davenport [11J proved that if ((x) is a cubic form with rational in- teger coefficients in n variables. then f'(x) == 0 has a nontrivial integral solution, provided that n  16. This theorem was proved by means of an exquisite application of the tcircle method together with some geometry of num- bers. A well-known conjecture is that n  10 instead of n  16. It is known that over local fields, any cubic form in 10 variables has a non- trivial zero, There are various results of this type for simultaneous additive, quadratic, and cubic forms (Davenport, D. Lewis, R. Cook, Schmidt, etc.) [17]. A satisfactory theory of forms of higher degree, like that of quadratic forms, is not yet known but is quite desirable. In this vein, Igusa has obtained some new results of considerable interest, e.g., a tPoisson summation formula for higher-degree forms, using his theory of asymptotic expansions [8]. (3) Algebraic Curves. The fundamental re- sult is Siegel's theorem (1929): Assume that the equations/;(X p ...,Xn)=O (1.s;i.s;m) determine an algebraic curve with a positive tgenus in an taffine space of dimension n. Then the number of rational integral solutions ofJ;(X I' '.. , X n ) = 0 (1.s; i.s; m) is finite. This theorem was generalized by S. Lang in the fol- lowing form: Let K be a finitely generated field over Q and I a subring of K that is finitely generated over Z. Furthermore, let C be a nonsingular projective algebraic curve with a positive genus defined over K, and let <p be a rational function on C defined over K. Then there are only a finite number of points P on C with <p(P)E I [10]. The proof of this theorem is based on a generalization of Roth's theorem in the above sense and on the weak Mordell-Weil theorem ( Section C). A. Robinson and P. Roquette gave another approach to Siegel's theorem from the standpoint of nonstandard arithmetic (J. Number Theory 7 (1975)). On the other hand, a necessary condition for the exis- 
453 tence of infinitely many solutions of f(X, Y) = o with rational integral coefficients was given by C. Runge (J. Reine Angew. Math., 100 (1887». (4) Elliptic Curves. An elIiptic curve E is an tAbelian variety of dimension 1, or what is the same, an irreducible nonsingular tprojective algebraic curve of tgenus 1 furnished with a point 0 as origin. The tRiemann-Roch theorem defines a group law on the set of tdivisor classes of E. ActualIy, if P, P' are points of E, then there exists a unique point P" such that (P)+(P')(P")+(O), where  means linear equivalence, i.e., the left-hand side minus the right-hand side is the divisor of a rational function on the curve. The group law on E is then defined by P + P' = P". If the characteristic # 2 or 3, using the Riemann- Roch theorem one finds that the curve E can be defined by a Weierstrass equation i = x 3 + ax + b with a, b in the ground field over which the curve is defined, Conversely, any homo- geneous nonsingular cubic equation has genus 1 and defines an elIiptic curve in the projective plane once the origin has been selected. If both the curve and the origin are defined over a field k, then the group law is also defined over k, and it becomes a I-dimensional Abelian variety defined over k. If the ground field k is the field of complex numbers, the group law is the same as that given by the taddition theorem of the tWeierstrass !O-function with invariants g2 = -4a and g3 == -4b through the parametrization x = !O(u), Y =1!O'(U). Much of the Diophantine theorems on eIliptic curves are generalized to Abelian varieties. Here we shaIl deal mainly with elliptic curves defined by Weierstrass equations over Q. Extension to algebraic number fields usualIy causes no trou- ble. For more general eIliptic Curves  [19]. The Lutz- Ma ttuck theorem ( Section E) obviously implies that the points of finite order in Ek (k==Qp) form a finite group, This torsion group is computable. In case a and b are in Z then any point of finite order in EQ has coordi- nates (x,y) in Z and, if y#O, i 14a 3 +27b 2 (Lutz-NagelI). The WC group (Weil-Chatelet group) of E over k is the birational class of principal homogeneous spaces over k. The extent of validity of the Hasse principle for elIiptic curves can be measured by the Tate- Shafarevich group, which is defined as the set of elements of the WC group that are every- where locally trivial. This group is conjectured to be a finite group. For other results and interesting conjectures  [12-14]. The num- ber of integral points on eIliptic curves is finite according to Siegel's theorem on algebraic curves. Explicit bounds for the sile of these points have been given for several types of llSE Diophantine Equations elliptic curves by using Baker's method. For example, if f(x, y) is an absolutely irreducible polynomial with coefficients in Z such that the curvef==O has genus 1, then max(ixl,iyl)< expexpexp«2H)m), where m== JOd 10 , d = degf, and H is the height of f (Baker and Coates, 1970. The method of proof was to reduce it to the Weierstrass equation case, which had been treated earlier by Baker, with a better bound.) By the tMordeIl-Weil theorem (Section E), AQ  Z' x finite torsion group. Here r is calIed the rank of E over Q, There is a rather doubt- ful conjecture to the effect that the rank r is bounded. The rank r is conjectured to be equal to the order of the zero of L(s, E) at s = 1 (Birch- Swinnerton-Dyer conjecture). Much numerical and theoretical evidence supports this famous conjecture [13]. E. Rational Points of Algebraic Varieties Let V be an tabstract algebraic variety defined over a field k, and let P be a point of V. Then P is caIled a rational point over k of V if the coordinates of the trepresentative P, contained in an taffine open set T<. of V are in k ( 16 Algebraic Varieties D). This definition is inde- pendent of the choice of the representative P,. In particular, if V is a tprojective variety, the point P given by the thomogeneous coordi- nates (x o , XI' '" ,x n ) is rational if and only if xJxpEk (O.s;i.s;n, xp#O). In the folIo wing we state main results concerning rational points of algebraic varieties, especialIy results con- cerning t Abelian varieties, restricting k to be either an talgebraic number field of finite degree, a tp-adic number field, or a tfinite field. Mordell-Weil Theorem. Let A be an Abelian variety of dimension n defined over an alge- braic number field k of finite degree, Then the group Ak of all k-rational points on A is finitely generated. This theorem was proved by L. J. Mordell (1922) for the case of n == 1 and by Weil (1928) for the general case [10]. The assertion that AdmAk is a finite group for any rational integer m is called the weak Mordell-Weil theorem; this theorem is basic in the proof of the MordelI- Weil theorem and is used in the proof of Siegel's theorem, too. A generalization of the MordeIl- Weil theorem is obtained when k is a field (of arbitrary charac- teristic) finitely generated over the tprime field [10]. If A is defined over a finite algebraic number field k, we have the following conjectures of Birch, Swinnerton-Dyer, and Tate on the rank of Ak' Let p be a prime ideal of k at which A has a good treduction, and denote by A" the reduced variety. Let n,l>,..., n,2nl be the eigen- 
118F Diophantine Equations values of the N(p)th power endomorphism of AI' with respect to an /-adic representation, where N(p) denotes the norm of p ( 3 Abelian Varieties E, N), and put Ll5, A) == II?:j (I - ni) N(prT j . The L-function of A defined by L(s, A) = II' Lp(s, A), where the product ranges over all good primes, is the principal part of the zeta function of A ( 450 Zeta Functions S). Birch and Swinnerton-Dyer conjectured that if k = Q and A is of dimension 1, then there exists a constant C # 0 such that L(s, A)  C(s -1)9 as s--> 1. Tate generalized this con- jecture to any A and k. Moreover, the Constant C, appropriately modified by factors corre- sponding to the bad primes and the infinite primes, is thought to be expressible in terms of certain arithmetic invariants of A [13]. Lutz-Mattuck Theorem. The group of rational points of an Abelian variety A of dimension n over a tp-adic number field k contains a sub- group of finite index isomorphic to the direct sum of n copies of the tring 0 of p-adic integers in k (E. Lutz, J. Reine A ngew. Math., 177 (1937); A. Mattuck, Ann. Math, 62 (1955)). MordeIl's Conjecture. In his 1922 paper, in which the above theorem on the set of rational fields on I-dimensional Abelian varieties (i.e., on elliptic curves) was established, Mordell stated the conjecture: Any algebraic curve of genus g"J;2 defined over Q has only a finite number of rational points. The same can be conjectured for such curves defined over any algebraic number field k of finite degree. This had remained as a conjecture until 1983. In 1961 I. R. Shafarevich conjectured: Let k be any algebraic number field of finite degree, S a finite set of finite prime spots of k, and 9 any natural number "J; 2. Then there are, up to k-isomorphism, only a finite number of non- singular algebraic curves of genus 9 defined over k having good reduction at every finite prime spot outside S. In 1973, A. N. Parshin showed that Mor- dell's conjecture followed from this conjec- ture, which was finally proved in 1983 by G. Failings [7]. Mordell's conjecture was thus settled in the affirmative. Analogs of these conjectures on algebraic function fields over finite fields are easier than the original ones and had been proved in the I 960s for Mordell's conjecture by Yu. I. Manin, H. Grauert, and M. Miwa, and for Shafarevich's conjecture by S. Alakelov, A. N. Parshin, and L. Szpiro. F. Ci-Fields Let F be a field, and let i"J; 0, d "J; 1 be in- tegers. Let r be a homogeneous polynomial of n variables of degree d with coefficients 454 in F. If the equation r = 0 has a solution (x I' ... , x n ) # (0, ... ,0) in F for any f with n > d i , then F is called a C;(d )-field. If F is a C;(d )-field for any d"J; 1, then F is called a C;-field. In order for F to be a Co-field, it is necessary and sufficient that F be an talgebraically closed field. A CI-field is sometimes called a quasi- algebraically closed field. There exists no non- commutative algebra over a Cj-field F. A finite field is C I (c. Chevalley (1936)). If Fo is algebraically closed, then F=Fo(X) (rational function field of one variable) is a CI-field (Tsen's theorem), A homogeneous polynomial r of n = d i variables of degree d with coeffi- cients in F such that f = 0 has no solution in F except (0, ,.. , 0) is called a normic form of order i in F. If a Ci-field Fo has at least one normic form of order i, then (i) Fo(X I' ... , Xd is a Ci+k-field; and (ii) an extension of Fo of finite degree is a Ci-field, A complete field F with respect to an texponential valuation is a Cj-field whenever its residue field Fo is alge- braically closed. The field F of power series of one variable over a finite field Fo is a C 2 -field (Lang). E. Artin conjectured that a p-adic field Qp is a C 2 -field. It was proved by H. Hasse (1923) that Qp is a C 2 (2)-field and by D. Lewis (1952) that Qp is a C 2 (3)-field. However, G. Terjanian (1966) [17] gave a counterexample to Artin's conjecture; that is, he gave a quartic form of 18 variables with coefficients in Q2 having only trivial zero in Q2' Ax and S. Kochen (1965) [18] proved that for any inte- ger d"J; 1 there exists an integer Po(d) such that Qp is a C 2 (d)-field for p > Po(d) ( 276 Model Theory E). References [1] L. Dickson, History of the theory of num- bers I-III, Chelsea, 1952. [2] L. J, Mordell, Diophantine equations, Academic Press, 1969. [3] W. M. Schmidt, Equations over finite fields, Lecture notes in math. 536, Springer, 1976. [4] A. Weil, Sur les courbes algebriques et les varietes qui s'en deduisent, Actualites Sci. Ind., 1041 (1948). [5] G. Faltings, Endlichkeitssatze fUr abelische Varietaten iiber Zahlkorpern, Inventiones Math., 73 (1983), 349-366. [6] M. Greenberg, Lectures on forms in many variables, Benjamin, 1969. [7] Z. I. Borevich and I. R. Shafarevich, Num- ber theory, Academic Press, 1966. (Original in Russian, 1964.) [8] J. Igusa, Lectures on forms of higher de- gree, Tata Inst., 1978, [9] W. J. LeVeque, Topics in number theory, Addison-Wesley, 1956. 
455 [10] S. Lang, Diophantine geometry, Inter- science, 1962. [11] H. Davenport, Analytic methods for Diophantine equations and Diophantine inequalities, Univ. of Michigan, 1962. [12] J. W. S. Cassels, Diophantine equations with special reference to elliptic curves, J, London Math. Soc., 41 (1966), 193-291. [13J H. p, F. Swinnerton-Dyer, The conjec- tures of Birch and Swinnerton- Dyer and of Tate, Proc, Con£. on Local Fields, Driebergen, Springer, 1967. [14J S. Lang, Elliptic curves: Diophantine analysis, Springer, 1978. [15J I. Niven and H. Zuckerman, An introduc- tion to the theory of numbers, Wiley, 1960. [16J J. Joly, Equations et varietes algebriques sur un corps fini, Enseignement Math., (2) 19 (1973),1-117. [17J D. Lewis, Diophantine equations: p-adic method, Studies in Math., vol. 6, Math. Assoc. Amer., 1969. [18J J. Ax and S. Kochen, Diophantine prob- lems over local fields I. II, Amer. J. Math., 87 (1965),605-630,631-648. [19J 1. Tate, The arithmetic of elliptic curves, Inventiones Math., 23 (1974), 170-206. 119 (XXI.20) Dirichlet, Peter Gustav Lejeune Peter Gustav Lejeune Dirichlet (February 13, 1805- May 5, 1859) was born of a French family in Diiren, Germany. From 1822 to 1827 he was in Paris, where he became a friend of 1.-B. tFourier. In 1827, he was appointed lecturer at the University of Breslau; in 1829, lecturer at the University of Berlin; and in 1839, professor at the University of Berlin. In 1855, he was invited to succeed C. F. tGauss at the University of Gottingen, where he spent his last four years as a professor. His works cover many aspects of mathemat- ics; however, those on number theory, analy- sis, and potential theory are the most famous. He greatly admired Gauss and is said to have kept Gauss's Disquisitiones arithmeticae at his side even when traveling. In number theory, he created the tDirichlet series and proved that a sequence in arithmetic progression contains infinitely many prime numbers, provided that the first term and the common difference are relatively prime. Also, using his "drawer principle," which states that if there are n + 1 objects in n drawers then at least one drawer contains at least 2 objects, he clarified the structure of tunit groups of talge- 120A Dirichlet Problem braic number fields. In tpotential theory he dealt with the tDirichlet problem concerning the existence of tharmonic functions. He also gave tDirichlet's condition for the convergence of trigonometric series. References [IJ p, G. L. Dirichlet's Werke I, II, Reimer, 1889, 1897 (Chelsea, 1969). [2J P. G. L. Dirichlet, Vorlesungen iiber Zah- lentheorie, with supplements by R, Dedekind, Vieweg, fourth edition, 1894 (Chelsea, 1968). [3J F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19. Jahrhundert I, Springer, 1926 (Chelsea, 1956). 120 (X.30) Dirichlet Problem A. The Classical Dirichlet Problem Let D be a bounded or unbounded tdomain in Rn (n  2) with compact boundary S. The class- ical Dirichlet problem is the problem of finding a tharmonic function in D that assumes the values of a prescribed continuous function on S. This problem is also called the tfirst boundary value problem ( 193 Harmonic Functions and Subharmonic Functions). In this article f always stands for a boundary function given on S. The problem is called an interior problem If D is bounded and an exterior problem if D is unbounded. In an exterior problem, it is further required that when an tin version with center at an exterior point Po of D is per- formed on D and a tKe1vin transformation is performed on the solution in D (when the solution exists), the function thus obtained on the inverted image of D be harmonic at Po (n3). When n=2, the solution in D, which is already regarded as a function on the in- verted image of D, is required to be harmonic at Po. Thus an interior problem can be trans- formed to an exterior problem, and vice versa. We now explain the history of the classical problem. Let D be a bounded domain with boundary Sin R 3 . G. Green (1828) asserted that if S is sufficiently smooth, u(P)= - f f(Q) 8G(P,Q) d(J(Q) 4n s 8n Q IS the solution for the Dirichlet problem, where f is prescribed on S, G(P, Q) is tGreen's func- tion with the pole at Q in D, n Q is the outward- drawn normal to S at Q, and d(J is the ts ur - (1 ) 
120B Dirichlet Problem face element on S. He took for granted the existence of Green's function from physical consideration of the problem. Thus his dis- cussion was not quite rigorous. This defect was corrected by A. M. Lyapunov (1898) under a certain condition on S. Denote by U m the New- tonian potential of a measure with density mO on S. Assume that a continuous function f on S and a positive constant a are given. In 1840, C. F. Gauss investigated the existence of a density mfO on S of total mass a which satisfies Ss(u mf - 2f)m f dO" = minmSs(u m - 2f)mdO", where the total mass of m is equal to a. He asserted also that u mf - f is equal to a constant b on S. If f=O, then u mo must be equal to a positive constant c on S, and hence u mf -bc-I u mo must be a solution of the ex- terior problem for the boundary function f on S. However, his discussion was incomplete because we cannot always ensure the existence of a density that gives a measure minimizing the integral. Moreover, even in the case where D is a balI, there exists a continuous function fO on S such that there is no Newtonian potential that is equal to f on S up to a Con- stant (M. Ohtsuka, 1961). Gauss (1840), W. Thomson (Lord Kelvin) (1847), and G. L. Dirichlet solved the Dirichlet problem by making use of the so-calIed Dirichlet principle, which is explained in detail in Section F. After K. Weierstrass (1870) pointed out that there is a case where no minimizing function exists, D. Hilbert (1899) gave a rigorous proof of the Dirichlet principle under a certain condition, Meanwhile, C. G. Neumann (1870) solved the Dirichlet problem rigorously for the first time, although he assumed that D is a con- vex domain with a smooth boundary. First, he considered the potential WI = (1 j2n)' Ssf(ar- I jan) dO" of a double layer in D; then he formed the potential W 2 =(lj2n)' Ssfl (ar- I jan) dO" of a double layer with the values fl of WI on S and defined W 3 , W 4 ,... similarly. The series WI - W 2 + W 3 - W 4 + ... plus a suitable Constant gives a solution to the exterior Dirichlet problem for the bound- ary function fIn 1887, H. Poincare also used (1) to solve the Dirichlet problem. He ob- tained Green's function with the pole at 0 in a bounded domain D in the folIowing man- ner: Let D' be the image of D by an inversion with center 0, So be a spherical surface sur- rounding the boundary aD' of D', and J1 be a uniform measure on So such that the potential of J1 is equal to 1 inside So. By tsweeping out J1 to rD', the solution in D' of the exterior prob- lem for the boundary function 1 is obtained. A tKelvin transformation of this solution yields the solution h in D of the interior prob- lem for the boundary function 1 jOP . Now I jOP -h(P) is Green's function in D. In 1899 456 Poincare used another method (without utiliz- ing (1)) to solve the Dirichlet problem [8]. He observed that it is sufficient to consider the case where f is equal to the restriction to S of a polynomial 9 and that 9 is expressed in D as the sum of the Newtonian potential of a signed measure r with density - gj(4n) and a func- tion that is harmonic in D and continuous on D U S. If it is possible to sweep out r to aD, then the solution is obtained, He showed that this is in fact the case if at every point P of S there exists a cone that is disjoint from D and has its vertex at P. This condition is called Poincare's condition. In 1900, I. Fredholm discussed the Dirichlet problem by reducing it to a problem of tintegral equations. A domain D is calIed a Dirichlet domain if the (classical) Dirichlet problem is always solvable in D. H. Lebesgue (1912) showed that a solution is obtained by the method of iterative averaging in every Dirichlet domain. B. The Dirichlet Problem in a General Domain It has been believed that the classical Dirichlet problem is always solvable in every domain until S. Zarem ba observed in 1909 that the problem is not always solvable for a punctured ball. In 1913, Lebesgue gave a decisive exam- ple in which the domain is homeomorphic to a balI and bounded by a surface sufficiently smooth except at one point. Thus the central interest shifted to finding a harmonic function in D that depends only on a continuous func- tion f given on aD and coincides with the classical solution when D is a Dirichlet do- main. Extend f to a continuous function in the whole space, and denote it by fo. Approxi- mate D by an increasing sequence {Dn} of Dirichlet domains, and denote by Un the solu- tion in Dn of the Dirichlet problem for the boundary function fo. N. Wiener proved in 1924 that Un co verges to a harmonic function that is independent of the choice of the exten- sion of f and {Dn}. The pro blem of deciding where on aD the general solution assumes the given boundary values is treated in Section D. O. D. Kellogg (1928) found a general method that includes Poincare's method of sweeping out, Schwarz's alternating method, and the result of Wiener. Both Poincare's method of sweeping out and Lebesgue's method of itera- tive averaging yield Wiener's general solution. C. Perron's Method We explain O. Perron's method (1923) by considering the improved method of M, Brelot [1]. For simplicity we assume that the domain 
457 D is bounded in R 3 , Let U be the family of subharmonic functions u bounded above and satisfying limsuppMu(P).s;f(M) for any boundary point M. Define !if(P) as supuu(P), where u runs through U, if this family is not empty; otherwise, set !if = -00, Call !if a hypofunction. Define Hf by -!i- f and call it a hyperfunction. If !if = Hf, the common func- tion is denoted by H f ; if Hf(P) < 00, then Hf is harmonic. This is ca\1ed a Perron-Brelot solu- tion (Perron-Wiener-Brelot solution or simply PWB solution). The method of defining Hf is called Perron's method (the Perron-Brelot method or the Perron-Wiener-Brelot method). Wiener showed in 1923 that the tDaniell- Stone integral can be regarded as a general solution if a tDanie\1-Stone integrable function f is given on the boundary of a Dirichlet domain; in 1925, he showed that the same is true for a general domain (not necessarily Dirichlet). He showed also that his solution coincides with the Perron-Brelot solution Hf if f is continuous. Unfortunately, however, from a wrong example he concluded that !if # Hf can hold even for a simple discontinuous f, and so he lost interest in Perron's method. Brelot (1939) corrected Wiener's erroneous conclusion and proved that the Danie\1 upper and lower integrals are equal to Hf and !if' respectively. To any continuous f there corre- sponds an Hf, and there exists a tRadon mea- sure J1p satisfying Hf(P) = S fdJ1p. This measure is ca\1ed a harmonic measure or harmonic measure function. Brelot showed that !if = Hf if and only if f is J1rintegrable for one (or every) p, In particular, If D is a Dirichlet domain and E is a closed set on the boundary aD, then the harmonic measure function J1p(E) takes the value 1 at an inner point (in the space aD) of E and vanishes on aD - E. We note that J1p is equal to the measure obtained by sweeping out the unit mass at P to aD. D. Regular Boundary Points If Hf(P)-->f(M) as p-->MEaD for any con- tinuous function f on aD, then M is called regular. The regularity of a point is a local property. A boundary point that is not regu- lar is ca\1ed irregular. The regularity of M is equivalent to the convergence of J1p as P-->M to the unit mass at M with respect to the tvague topology. There are many sufficient conditions and necessary conditions for a boundary point to be regular. The existence of a barrier is a qualitative condition that is necessary and sufficient for a boundary point to be regular. It was used by Poincare and so named and used effectively by Lebesgue. A barrier is a continuous superharmonic func- 120 F Dirichlet Problem tion in D that assumes the boundary value 0 at M and has a positive lower bound outside every ball with center at M. A positive super- harmonic function defined in the intersection of D and a neighborhood of M and taking the boundary value 0 can be used as a barrier. A necessary and sufficient condition for a bound- ary point M to be regular is the existence of a Green's function in D assuming the value 0 at M. This condition was given by G, Bouligand (1925), and it fo\1ows from the existence of a barrier. Another necessary and sufficient con- dition of a quantitative nature was obtained by Wiener. It is equivalent to the requirement that the complement of D is not tthin at M ( 338 Potential Theory G). Kellogg conjec- tured that the set of irregular boundary points is of capacity zero and verified this in R 2 (1928). The conjecture was proved first by G. C. Evans (1933) in R 3 , and different proofs were given by F. Vasilesco (1935) and O. Frostman (1935). The conjecture is also true in Rn for n4. E. The More General Dirichlet Problem So far, we have been concerned with Rn. More genera\1y, Brelot and G. Choquet [3J obtained the fo\1owing result in a Green space rff ( 193 Harmonic Functions and Subharmonic Func- tions): Consider a metric space that contains rff and in which rff is everywhere dense, and de- note by  the complement of rff with respect to the space. Let {F} be a family of tfilters on rff such that each F converges to a certain point of,1. Suppose that u.s;O whenever u is sub- harmonic and bounded above on rff and limsupu.s;O along every F. Assume the exis- tence of a barrier v in a neighborhood in rff of the limit point Q of every F; that is, v is to be positive superharmonic, to tend to 0 along F, and to have a positive lower bound outside every neighborhood of Q. Under these as- sumptions, we obtain the PWB solution on rff as in R 3 . There are various examples of,1 and F that satisfy these conditions. In particular, L. Nalm [6] investigated in detail the case where ,1 is the tMartin boundary. More genera\1y, it is possible to treat the Dirichlet problem axiomatically ( 193 Harmonic Functions and Subharmonic Functions). F. The Dirichlet Principle Let D be a bounded domain with a sufficiently smooth boundary in Rn and f be a piecewise Cl-function on D with finite Dirichlet integral IIfl1 2 = SDIgradfl2 dr, where de is the volume element. Suppose that f has a continuous boundary value <p on aD. The classical Dirich- 
120 Ref. Dirichlet Problem let principle asserts that the solution of the Dirichlet problem for <p has the smalIest Dir- ichlet integral among the functions that are piecewise of class C 1 in D and assume the boundary value q>. In a general domain, H", minimizes llu- fll among harmonic functions u in D. Brelot [2J discussed the principle for a family of competing functions that are defined in a domain in ,g and whose boundary values cannot be defined in the classical manner. References [IJ M. Brelot, Families de Perron et probleme de Dirichlet, Acta Sci. Math. Szeged., 9 (1939), 133-153. [2] M. Bre10t, Etude et extensions du principe de Dirichlet, Ann. Inst. Fourier, 5 (1955), 371- 419. [3] M. Brelot and G. Choquet, Espaces et Iignes de Green, Ann. Inst. Fourier, 3 (1952), 199-263. [4J R. Courant, Dirichlet's principle, con- formal mapping, and minimal surfaces, Inter- science, 1950. [5J O. D. Kellogg, Foundations of potential theory, Springer, 1929, [6J L. Nai'm, Sur Ie role de la frontiere de R. S. Martin dans la theorie du potentiel, Ann. Inst. Fourier, 7 (1957),183-281. [7J M. Ohtsuka, Dirichlet problem, extremal length and prime ends, Lecture notes, Wash- ington Univ., St. Louis, 1962-1963. [8J H. Poincare, Theorie du potentiel New- tonien, Le<;:ons professees i la Sorbonne, Gauthier-Villars, 1899, [9J L. L. Helms, Introduction to potential theory, Interscience, 1969. 121 (XI.3) Dirichlet Series A. Dirichlet Series For z ==x + iy, An>O, and A;i + 00, the series of the form 00 f(z) == L an exp( - Anz) n""l is called a Dirichlet series (more precisely, a Dirichlet series of the type {An}). If An = n, then (1) is a power series with respect to e- z , If An == logn, the series (1) becomes 00 L anln z , n=1 which is caIled an ordinary Dirichlet series. If an = 1, then (2) is the tRiemann zeta function. 458 Series of the form (2) were introduced by P. G. L. Dirichlet in 1839 and utilized in an investigation of the problems of tanalytic number theory. Later J. Jensen (1884) and E. Cohen (1894) extended the variable z to complex numbers, The Dirichlet series is not only a useful tool in analytic number theory, but is also investigated as a generalization of power series. The tLaplace transform is the generalization of the Dirichlet series to the integral, and similar formulas often hold for both cases. B. Convergence Regions If the series (1) converges at Z=Zo, then it converges in the half-plane Rez>Rez o ' There- fore there is a uniquely determined real num- ber S such that (1) converges in Rez>S and diverges in Rez<S. If(I) always converges (diverges), we put S = -00 (+00), We call S the abscissa of convergence (or abscissa of simple convergence). Similarly, there is a uniquely determined real number A such that (1) Con- verges absolutely in Re z > A and is not ab- solutely convergent in Rez<A. We call A the abscissa of absolute convergence. Furthermore, there is a uniquely determined real number U such that (1) converges uniformly in Rez> U' for every U' > U and does not converge uni- formly in Rez> U" for every U" < U. The number U is calIed the abscissa of uniform convergence. Among these abscissas we always have the relations -00 .s;S.s; U.s;A +00, logn A-S.s;limsup-. An The latter was proved by Cohen (1894), The numbers S, A, U are determined from an' An by means of the formulas s=limsuPlog l L an i . x-+oo x [X];'n<X (3) A=limsUPIOg ( L l anl ) , x-+oo x [X];'n<X (4) 1 U == lim sup -log Tx, X-+DJ X (1) Tx= sup I L aneXP(-iAny) l , (5) -ro<y<+oo [X];'n<X where [ J is the tGauss symbol. Formulas (3) and (4) were proved by T. Kojima (1914) and (5) by M. Kunieda (1916), In particular, when limnoo(logn)/An=O, we have loglanl S=U=A=limsup- n_oo An (2) (6) (0. Szisz, 1922; G. Valiron, 1924). 
459 The series (1) converges uniformly in the angular domain {zllarg(z-zo)1 < ex <n/2}, where the vertex Zo lies on the line Re Zo == S. Hence it represents a holomorphic function in the domain Re Z > S, but it is possible that there is no singularity on the line Rez=S. For example, if a n =( _1)n, then the series (2) has S=O, but the sum is an tentire function (2 1 - Z -1) ((z). Taking the analytic continuation f(z) of the series (I), the infimum R of p such that f(z) is holomorphic in Rez> pis caJled the abscissa of regularity. It is stiJl possible that there is no singularity on the line Re z = R. We always have R.s;S, and R is given by R= sup Jim (loglog+I<p(x+iy)1 +x), -oo<y<oo x--+-oo  anexp(-}'n z ) <p(z) = L.., , n=1 r(1 + An) where log+ a ==max(loga, 0) (C. Tanaka, 1951). The infimum B of p such that f(z) is bounded in Re z > p is calIed the abscissa of boundedness. We always have R.s;B.s;A. H. Bohr proved the following three theorems concerning these values: (1) If {An} are linearly independent over the ring of integers, then A = B (1911). (2) If (}'n+I-An)-1 ==O(expe A ,,) for every s>O, then U == B (1913). (3) If lim sup(logn)/An ==0, then S = U = A = B (1913). In the final case, the values are given by (6). C. Properties of Functions Given by Dirichlet Series The coefficients an in (1) are given in terms of the function f(z) by n 1 f C+iOO eWZ L a v =----; f(z)-dz, v=1 2m c-ioo Z where c>max(S,O), An<w<A n + l , and the integration contour does not pass through Pn}. If W == An' then the term an in the sum of the left-hand side of (8) is replaced by a n /2 (0. Perron, 1908). Furthermore, if S < x, then we have 1 f '0+1 a n = lim - f(x+iy)exp[An(x+iy)]dy (9) 1 oo T '0 (1. Hadamard, 1908; C. Tanaka, 1952). If x == Rez> S, then f(z) == o(lyl) (Iyl-->oo). In order to investigate its behavior more pre- cisely, Bohr introduced . loglf(x +iy)1 J1(x) =\!m sup IYI+oo loglyl in his thesis (1910) and called it the order over Re z = x. The function J1(x) is nonnegative, monotone decreasing, convex, and continuous with respect to x. Bohr later found that there is 121D Dirichlet Series (7) a kind of periodicity for the values of /(z) over Re z = x; this was the origin of the theory of talmost periodic functions. As for the zeros of the function f(z), the following theorems are known: If f(z) is not identically zero, it has only a finite number of zeros in x  S + s, e -Mx.s; y.s; e Mx for arbitrary positive numbers s, M (Perron, 1908). If we denote by N(T) the number of zeros in x>S +s, T <y< T +2i5logT, then limsuPToo N(T)/ (log T)2 .s;i5/s (E. Landau, 1927). There have been many investigations into the connection between the singularities of f(z) and the coefficients an' If the an are real and positive, the point z = S is always a singular- ity of f(z). Moreover, if S=O, ReanO, and Iimnoo(cos(argan))I/A,= 1, then z=o is a sin- gularity of {(z) (c. Biggeri, 1939), Furthermore, if An/n--> 00, lim infnoo(An+1 - An) > 0, then the line Re z = S is the tnatural boundary of f(z) (F. Carleson and Landau, 1921; A. Ostrowski, 1923). If S=O, liminfnoo(An+l -An)==q>O, then there always exist singularities on every inter- val on the imaginary axis with the length 2n/q (G. P6lya, 1923). S. Mandelbrojt (1954, 1963) gave some interesting results concerning the relations between the singularities of (1) and the Fourier transform of an entire function. If U = -00, the function f(z) is an entire function. Its torder (in the sense of entire func- tion) p is given by . log+ log+ M(x) p=hmsup , XrOO Ixl M(x)= sup If(x+iy)l. - 00 <y< 00 (8) There have been many investigations into the tJulia direction of f(z) and related topics by Mandelbrojt, Valiron, and Tanaka. D. Tauberian Theorems As in the case of power series, if the series L an converges to s, then f( + 0) = s (Abel's continuity theorem). The converse is not neces- sarily true. The converse theorems, with ad- ditional conditions on an and An' are called Tauberian theorems, as in the case of power series. Many theorems are known about this field. The most famous additional conditions are limnoo Anan/().n - An- d = 0 (Landau, 1926) and an=O«An-An-I)/An) (K. Ananda-Rau, 1928). F or the summation of Dirichlet series (es- pecially tRiesz's method of summation)  379 Series R. For Tauberian theorems (especially the Wiener-Ikehara-Landau theorem) of the ordinary Dirichlet series  123 Distribution of Prime Numbers B. 
121 E Dirichlet Series E. Series Related to Dirichlet Series A series of the form  n!a n L... , z#O, -1, -2,... n=1 z(z+ l)(z+2)..,(z+n) is called a factorial series with the coefficients {an}' It converges or diverges simultaneously with the ordinary Dirichlet series L anln Z except at z = 0 and negative integers, The series JI (Z-I)(Z-n) ...(z-n) a n = n ant 1) is called a binomial coefficient series. It con- verges or diverges simultaneously with the ordinary Dirichlet series L( _1)n anln z except at z == 0 and positive integers, References [1] V. Bernshtein, Lec,;ons sur les progn':s re- cents de la theorie des series de Dirichlet, Gauthier-Villars, 1933. [2] K. Chandrasekharan and S. Minakshi- sundaram, Typical means, Oxford Univ. Press, 1952. [3] G. H. Hardy and M. Riesz, The general theory of Dirichlet's series, Cambridge, 1915. [4] S. Mandelbrojt, Series lacunaires, Ac- tualites Sci. Ind., Hermann, 1936. [5] S. Mandelbrojt, Series adherentes, regu- larisation des suites, applications, Gauthier- Villars, 1952. [6J S. Mandelbrojt, Dirichlet series, Reidel, 1972. [7] G. Val iron, Theorie genera Ie des series de Dirichlet, Mem. Sci, Math., Gauthier- Villars, 1926. 122 (IV.14) Discontinuous Groups A. Definitions [1-4] Suppose that a group r acts continuously on a tHausdorff space X, that is, for every Y E rand XEX, an element yx of X is assigned in such a way that the mapping x-->yx is a homeo- morphism of X onto itself and that we have YI (Y2 X ) =(YI Y2)X, Ix = x, where 1 is the identity element of r. Two points x, yEX are said to be r -equivalent if there exists aYE r such that y = yx. (r-equivalence for subsets of X is defined similarly.) We consider the following conditions of discontinuity of r. (i) For every x E X and any 460 infinite seq uence {y;} consisting of distinct elements of r, the sequence {yiX} has no tclus- ter point in X. (ii) For every XE X, there exists a neighborhood U x such that yU x nux = 0 for all but finitely many YE r. (ii') If x, YEX are not r-equivalent, there exist neighborhoods U x , U y of x, y, respectively, such that yU x n U y = 0 for all Y E r. (iii) For any compact subset M of X, yM n M = 0 for all but finitely many YEr. It is easy to see that (ii) => (i), (ii) + (ii') => (iii); if, moreover, X is tlocall y compact, we also have (iii) => (ii), (ii'). When (i) holds, r is called a discontinuous transformation group of X, and when (ii) holds, r is called a properly discontinuous transformation group. In partic- ular, when X can be identified with a thomoge- neous space GIK of a locally compact group G by a compact subgroup K, the conditions (i), (ii), and (iii) for a subgroup r of G are all equivalent, and they are also equivalent to the condition that r is a tdiscrete subgroup of G. For a discontinuous group r acting on X, the tstabilizer r x= {y E q yx X x} of XE X is always a finite subgroup. When r x = {I} for all x EX, r is said to be free (or to act freely on X). If r x = nXEX r x == {I}, r is said to act teffectively on X. A point XEX is called a fixed point of r if r x # r x. In the following, we assume for simplicity that r acts effectively on X, unless otherwise specified. Since r-equivalence is clearly an tequiva- lence relation, we can decompose X into r- equivalence classes, or r-torbits. The space of all r-orbits, called the quotient space of X by r, is denoted by r\x. When r satisfies the conditions (ii) and (ii'), the space r\X becomes a tHausdorff space with respect to the topol- ogy of the quotient space. If, moreover, r is free, X is an (unramified) tcovering space of r\X with the tcovering transformation group r. (Conversely, a covering transformation group is always a free, properly discontinuous transformation group.) In general, X may be viewed as a covering space of r\X with rami- fications, and the ramifying points (in X) are nothing but the fixed points of r. B. Fundamental Regions A complete set of representatives F of r\X in X (that is, a subset F of X such that rF= X, yFn F== 0 for YE r, y # 1) is called a fun- damental region of r in X if it further satisfies suitable topological or geometrical require- ments. Here we assume that F, the closure of F, is the closure of its interior F i , (In this case, F or F i is sometimes called the funda- mental region of r instead of F itself.) Such a fundamental region exists if r satisfies the 
461 conditions (ii), (ii'), and the set of fixed points is tnowhere dense in X (R. Baer, F. W Levi, 1931). A fundamental region F is caIled normal if the set {y F} (y En is 10caIly finite, that is, if, for every x E X, there exists a neighborhood U x such that yF nux = 0 for all but a finite number of y E r. If X is tconnected and F is normal, then r is generated by the set of Y E r such that yF n F # 0. Thus it is useful to have a fundamental region in order to find a set of generators of r and a set of tfundamental relations for them When X has a r-invariant tBorel measure J1 and r is countable, then the measure J1(F) of F is independent of the choice of F. Hence it is legitimate to put J1(r\X) = J1(F); r is caIled a discontinuous group of the first kind (c. L. Siegel [2J) if r is a discontinu- ous transformation group which has a normal fundamental region F such that {y I y F n F == 0} is finite and J1(F) < 00. For instance, if X is 10caIly compact and F is compact (= r\x: compact), then r is of the first kind. When we are concerned only with the quali- tative properties of r, it is sometimes conve- nient to relax the conditions for a fundamental region, replacmg It by a fundamental (open) set a of r, that is, an (open) subset a of X such that ro = X and ')'0 no = 0 for all but a finite number of')'Er [5-9]. C. The Case of a Riemann Surface Let r be a discontinuous group of analytic automorphisms of a tRiemann surface X. In virtue of tuniformization theory, it is enough, in principle, 10 consider the case where X is tsimply connected. Thus we have the foIl owing three cases: (1) X =CU {oo} (tRiemann sphere). r is a finite group. Since r can also be considered as a group of motions of the sphere, it is either a cyclic, tdihedral, or tregular polyhedral group [10]. (2) X = C (complex plane). r is contained in the group of motions of the plane. The sub- group consisting of all paraIle1 translations contained in r is a tfree Abelian group of rank v.s; 2. If v = 0, then r is a finite cyclic group. When v> 0, r consists of the transformations of the following form: When v= I, Z-->£k z + mw (k,mEZ), When v=2, z-->£kz+mlwl +m 2 w 2 (k,m;EZ), where w, WI, w 2 are nonzero complex numbers with Im(w 2 /w 1 »0, and £==:il in general, except in special cases when v = 2 and W2/WI = 4 (resp. (3 or (6' where (I = exp(2ni/l)), in 122 C Discontinuous Groups which cases we may put B = (4 (resp. 3 or (6)' For the fundamental regions corresponding to these values of £, see Fig. I. In the cases v = I and 2, the tautomorphic functions with respect to rare essentiaIly given by exponential func- tions and eIliptic functions, respectively ( 134 Elliptic Functions). L ;;/ / / 0 1 (a) (b) (c) (d) Fig. 1 (a) v=2, £=1. (b) WI =1, w 2 =i, £=i. (c) WI = 1, W'=(3' 6=(3' (d) WI =1, W,=(3' £=(6' (3) X = {Izl < 1} (unit disk) [3, 10, 11]. By a tCayley transformation, the unit disk can be transformed to the upper half-plane f) = {z == x+ iyIY>O}. Any analytic automorphism of f) is given by a real t)inear fractional transforma- tion (Mobius transformation) z-->(az + b)(cz + dr l (a, b, c, d EO R, ad - bc == 1). The totality of real linear fractional transformations acts tran- sitively on f). Hence f) can be identified with the thomogeneous space G/ K of G = SL(2, R) by K = SO(2) (which is the stabilizer of the point j=1). Hence discontinuous groups r of analytic automorphisms of f> are obtained as discrete subgroups of G. ActuaIly, every element of G defines an analytic automor- phism of the whole Riemann sphere, which leaves the real axis R U {oo} invariant. For any z E C U { oo} and a seq uence {Yi} consisting of distinct elements of r, a cluster point of the sequence {YiZ} in C U {oo} is caIled a limit point of r. When only one or two limit points exist, r can easily be transformed to one of the groups given in (2). Otherwise, the set i! of all limit points of r is infinite, and either i! = R U { 00 } or i! is a tperfect, tnowhere dense subset of RU {oo}. When i! is infinite, r is caIled a Fuchsoid group. Since f) has a G-invariant tRiemannian metric ds 2 = y-2(dx 2 + dy2) (caIled the tPoin- care metric), by which f) becomes a hyperbolic plane (tnon-Euclidean plane with negative curvature), we can construct a fundamental region F of r which is a normal polygon bounded by geodesics, that is, the arcs of cir- 
122 D Discontinuous Groups cles orthogonal to the real axis, A set of gen- erators of r and the fundamental relations for them are easily obtained by observing the correspondence of the equivalent sides of the fundamental polygon. (Conversely, starting from a normal polygon satisfying a suitable condition, one Can Construct a discontinuous group r having F as a fundamental region. In this manner, we generally obtain a (nontrivial) continuous family of discrete subgroups of G.) A Fuchsoid group 1 is finitely generated if and only if the fundamental polygon F has a finite number of sides, and in that case 1 is called a Fuchsian group. (More generally, a finitely generated discontinuous group of linear fractional transformations acting on a domain in the complex plane is called a Kleinian group.) A Fuchsian group 1 is of the first kind if and onl y if i! = R U { 00 }; otherwise, it is of the second kind. It is also known that a discontinuous group 1 is a Fuchsian group of the first kind if and only if J1(r\f» < 00 [12]. For a real point XE R U {oo}, we also denote by 1 < the stabilizer of x (in 1). The point x is called a (parabolic) cusp of 1 if Ix is a free cyclic group generated by a tparabolic trans- formation (# :tl). Cusps of 1 are represented by vertices of the fundamental polygon on the real axis. On the other hand, if a fixed point z of 1 lies in f>, then the stabilizer r z is always a finite cyclic group generated by an telliptic transformation. Hence such a point z is also called an elliptic point of 1. For a Fuchsian group 1 of the first kind, let {z l' ... , z,} be a complete set of representati ves of the 1- equivalence classes of the elliptic points of 1 (which can also be chosen from among the vertices of the fundamental polygon), and let e i be the order of C i ; furthermore, let t be the number of the r-equivalence classes of para- bolic cusps of 1. Then the quotient space 1\f) can be compactified by adjoining t points at infinity, and the resulting space becomes a compact Riemann surface ffi r if we define an analytic structure on it in a suitable manner. The area J1(ffi r ) measured by the Poincare metric is given by the tGauss-Bonnet formula: f dxdy J1 (ffi r ) = -----z F y =2n l 2g2+t ( I- ) +t l , 1=1 e i where 9 is the tgenus of the Riemann sur- face ffi r . It is known that there exists a lower bound (=n/21) for J1(ffirJ [12]. Automorphic functions (or Fuchsian functions) with respect to a Fuchsian group 1, which are essentially the same thing as algebraic functions on the Riemann surface ffi r , have been objects of extensive study since Poincare (1882). 462 D. Modular Groups [1O,13J The group 1 = SL(2, Z) ={ G :)la,b,c,dEz,ad-bC= I} (or the corresponding group of linear frac- tional transformations) is called the (elliptic) modular group, The modular group 1 is a Fuchsian group of the first kind acting on f>, and its fundamental region together with the correspondence of the equivalent sides is shown in Fig. 2. Fig. 3 illustrates the trans- formations under 1 of the fundamental tri- angle, where 1 is regarded as acting on the unit disk. From Fig. 2 we obtain the gen- erators of 1 (mod{:t Iz}) (I2 =G > and the fundamental relations: a l =( :). a 2 =G -} a1=(a2 a l)3= -1 2 , There are two I-equivalence classes of elliptic points of 1, which are represented by '4 = i and '3, with [1;:{ :t 1 2 }] = 2, [r,,: {Hz} J = 3; and only one r -equivalence class of parabolic cusps, which coincides with Q U {oo}. The corresponding Riemann surface ffi r is analyti- cally equivalent to the Riemann sphere. Fig. 2 Fig. 3 For a positive integer N, the totality f'(N) of elements in 1 satisfying the condition G )=G ) (modN) forms a normal subgroup of r, called a principal congruence subgroup of level N. (For the case N = 2, see Fig 4.) In general, a subgroup l' of 1 contain- ing f'(N) for some N is called a congruence subgroup of r. (It is known that there actually exists a subgroup l' of 1 with a finite index, which is not a congruence subgroup.) For N  3, - 1 2 fj:T(N), so that f'(N) is effective. (For N== I, 2, we have f'(N)!) = { :tlz}.) If N  2, f'(N) has no elliptic point. The number t(N) of the equivalence classes of cusps of f'(N) and the genus g(N) of the corresponding Rie- 
463 mann surface ffi nN ) are given as follows: t(l)= 1, t(2)=3, t(N)=(1/2N) [r: r(N)J (N ;d), g(I)=g(2)=O, g(N)= 1 +(N -6)/24N)[r: r(N)J (N3), where [r: r(N)J = N 3 II pIN (1-1/p2). Auto- morphic functions with respect to r(N) are called tmodular functions of level N.  t/;. .. f (. t/ {:/ ::( r r:. / r V r . ( . 1/ .. t t:/ {/ f /;  . ( v/. r. -1 1 -2 o 1 2 Fig. 4 A fundamental region of [(2) that consists of six fundamental regions of [(1). E. The Case of Many Variables Up to the present time, discontinuous groups r and the corresponding automorphic func- tions have been studied only in the following cases: (2') X = en, r  z2n (the free Abelian group of rank 2n, consisting of parallel trans- lations) [14J ( 3 Abelian Varieties); (3') X is a bounded domain in cn and r is a discon- tinuous group of analytic automorphisms of X. (In this case, conditions (i), (ii), (iii) are equivalent.) In the case (3'), the group d(X) of all (com- plex) analytic automorphisms of X, endowed with its natural (tcompact-open) topology, becomes a tLie group, which has r as a discrete subgroup. When r\X is compact, it is known by the theory of automorphic functions (or by a theorem of Kodaira) that r\X becomes a tprojective variety, which is a tminimal model [3,14]. In particular, when X is a tsymmetric bounded domain, i.e., when X becomes a tsymmetric Riemannian space with respect to its tBergman metric, the connected component G of the identity element of G' = ,W'(X) (which incidentally coincides with that of the group I(X) of all tisometries of X) is a tsemisimple Lie group of noncom pact type (i.e., without compact simple factors), and X can be identified with the homogeneous space of G by a maximal compact subgroup K of G, The theory of discontinuous groups of this type, initiated by Siegel (especially in the case 122 F Discontinuous Groups where X = f>n = Sp(n, R)/ K, tSiegel's upper half- space; f' = Sp(n, Z), tSiegel's modular group of degree n), O. Blumenthal, H. Braun, and L.-K. Hua, and continued by those in the German school such as M. Koecher, H. Maass, and others, has undergone substantial develop- ment in recent years under the influence of the theory of algebraic groups [5,8, 14-17J ( 32 Automorphic Functions). On the other hand, for a symmetric Rieman- nian space X of negative curvature, the group of isometries G = I (X) is a tsemisimple Lie group of noncompact type with a finite num- ber of connected components and with a finite center, and X can be identified with the homo- geneous space of G by a maximal compact subgroup. Therefore the study of discontinu- ous groups of isometries of X can be reduced to that of discrete subgroups of a Lie group G of this type. A typical example is the case where X is the space of all real tpositive def- inite symmetric matrices of degree n with determinant 1; this space can be identified with the quotient space SL(n, R)/SO(n) (A ESL(n, R) acts on X by X3S-->tASA). The unimodular group r = SL(n, Z) is a discontinuous group of the first kind acting on this space X, and a method of constructing a fundamental region of r in X is provided by the Minkowski reduc- tion theory [6,7]. F. Discrete Subgroups of a SemisimpIe Lie Group Two subgroups r, r' of a group G are called commensurable if r n r' is of finite index in both rand 1'. For a real tlinear algebraic group G c GL(n, R) defined over Q, a subgroup r commensurable with G z = G n GL(n, Z) is called an arithmetic subgroup (in the original sense) of G (examples: SL(n, Z), Sp(n, Z)). An arithmetic subgroup r is always discrete, and when Gis semisimple, the quotient space r\G is of finite volume (J1(f'\ G) < 00) with respect to an invariant measure J1. Moreover, f'\G is compact if and only if Gis tQ-compact (or tQ- anisotropic), that is, if G Q or G z consists of only tsemisimple elements (A. Borel, Harish- Chandra, G. D. Mostow, and T. Tamagawa [6, 7J); the same results remain true ifG is tZariski connected and has no tcharacter defined over Q. The proofs of these facts (and the compactification of the quotient space r\X for the noncompact case) depend on a construction of fundamental open sets that generalizes the reduction theory of Minkowski and Siegel [5,8,15-18]. For a connected semisimple Lie group G of noncompact type and a discrete subgroup r with J1(r\ G) < 00, the following density 
122 G Discontinuous Groups theorem holds (Borel, Ann Math., (2) 72 (1960)): (i) For any linear representation p of G, the linear closure of p(r) coincides with that of p(G); (ii) if G is algebraic, r is tZariski dense in G. Furthermore, suppose that G is a direct product of simple groups G i and the center of G is finite; r is called irreducible if its projec- tion on any (proper) partial product of {GJ is not dicrete. For instance, if G is a tQ-simple algebraic group then r = G z is irreducible. In general, there exists a partition of the set of indices {i} such that r is commensurable with a direct product of irreduci ble discrete sub- groups of the partial products corresponding to this partition, and these irreducible compo- nents are unique up to commensurability. The method of constructing a discrete sub- group r of G = SL(2, R) in a geometric manner using the upper half-plane can be generalized to some extent to the construction of discrete subgroups of certain groups using hyperbolic spaces oflow dimensions (E. B Vinberg). Except for these few cases, today it is known that a discrete subgroup r of a semisimple Lie group G (of R-rank  2) with J1(r\ G) < 00 is arithmetic in a certain sense ( Section G). This implies there are only very few discrete subgroups for a semisimple Lie group of higher rank. Actually a number of facts sug- gesting this result were already known in the 1960s. First, the only subgroups of SL(n, Z) (n  3), Sp(n, Z) (n  2) with finite index are congruence subgroups (H. Bass, M. Lazard, and J.-P. Serre; this result has been generalized to the case of an arbitrary tChevalley group over an algebraic number field by C. C. Moore and H. Matsumoto). Second, G z = SL(n, Z), Sp(n, Z) are maximal in G [8]. Finally, it is known (rigidity theorem) that if a connected semisimple Lie group G with a finite center does not contain a simple factor which is locally isomorphic to SL(2, R), then any dis- crete subgroup r of G with compact quotient r\G has no nontrivial tdeformation (i.e., all deformations are obtained from inner auto- morphisms of G) (A. Selberg, E. Calabi and E. Vesentini, and A. Weil [19]). This last result amounts to the vanishing of the cohomology group Hl(r,X,ad), and in this connection an extensive study has been made by Y. Mat- sushima, S. Murakami, G. Shimura, and K. G. Raguanathan to determine the tBetti numbers of r\X, and more generally the cohomology groups of the type H(r, X, p) with an arbi- trary representation p of G. These cohomol- ogy groups are closely related to automor- phic forms with respect to r [8,20].) For a tnilpotent or tsolvable Lie group, a general method of constructing discrete subgroups is known; see, for example, M. Saito, Amer. J. Math., 83 (1961).) 464 G. Rigidity and Arithmeticity A discrete subgroup r of a Lie group G with J1(r\ G) < ex; is usually called a lattice of G. A lattice r of G is said to be uniform if r\ G is compact. By a theorem of Borel and Harish- Chandra [7J, an arithmetic subgroup of a real linear group (defined over Q) is a lattice. Using this result, Borel showed further, by a con- structive method, that any semisimple Lie group has a lattice, especially a uniform one (Borel, Topology, 2 (I 963)). For a long time there were no known ex- amples of nonarithmetic irreducible lattices in semisimple Lie groups other than those locally isomorphic to SLz(R). This naturally led to Selberg's conjecture that any irreducible (nonuniform) lattice in a semisimple Lie group G not locally isomorphic to SL 2 (R) is arith- metic (Selberg, International Colloquium on Function Theory, Bombay, 1960). The con- jecture seemed to be well-grounded by the rigidity theorem of Weil and Selberg [19]. However, in 1966-1967, V. S. Makarov and Vinberg constructed nonarithmetic non- uniform lattices in SO(n, I) (n == 3, 4,5) by a geometric method; the lattices are generated by treflections [21]. Thus rigidity and arith- meticity do not necessarily coincide, and the conjecture should be considered under stronger conditions. As for rigidity of uniform lattices, Mostow established in 1970 the following strong rigid- ity theorem [22]: If G, G' are semisimple Lie groups with trivial center and without com- pact factors, and are not locally isomorphic to SL 2 (R), and if r, r' are irreducible uni- form lattices, then any isomorphism 0: r--> r' extends to an analytic isomorphism 8: G--> G' (namely, 81 r = 8). The previous rigidity theorem is now implied by Mostow's. If X is a simply connected symmetric space, then a tlocally symmetric space M covered by X is expressed as a quotient of X by a fixed-point- free properly discontinuous group r in the group I(X) of total isometries (when the Lie algebra of I(X) does not have a compact fac- tor, this condition for r is equivalent to saying that It is a torsion-free discrete subgroup of the semisimple Lie group I(X)), and the funda- mental group n l (M) of M = r\X is isomorphic to r. The strong rigidity theorem implies that compact localIy symmetric spaces M 1 and M 2 (of higher dimensions) are isomorphic as Riemannian spaces if and only if n l (M 1 ) and n 1 (M 2 ) are isomorphic as abstract groups. On the other hand, in 1973, G. A Margulis proved the arithmeticity of irreducible non- uniform lattices in semisimple real linear groups of R-rank greater than I (Russian Math. Surveys, 29 (1974) (original in Russian, 
465 1974); Functional Anal. Appl. 9 (1975) (original in Russian, 1975). M. S. Raghunathan also proved independently the same fact under a slightly stronger condition. Their results, to- gether with the rigidity of nonuniform lattices in (higher-dimensional) semisimple Lie groups of R-rank 1 established by H Garland, Rag- hunathan, and G. Prasad (Inventiones Math., 21 (1973)), imply that the strong rigidity theo- rem holds similarly for nonuniform lattices. The results of Margulis and Raghunathan show that the Selberg conjecture in the orig- inal sense is affirmative for the case of R-rank greater than 1. But neither argument is appli- cable to uniform lattices, for they depend deeply on the fact (proved by D A. Kazdan and Margulis) that a nonuniform lattice contains a nonidentity unipotent element. Previously, in a lecture at the international congress of mathe- maticians at Moscow, 1966, I. I. Pyatetskii- Shapiro generalized the definition of arith- meticity and suggested that arithmeticity of lattices should be investigated without the distinction of whether they are uniform or nonuniform. His definition is equivalent to the following [9,24]: For a connected semisimple algebraic group G defined over R, a subgroup reG = G R is an arithmetic subgroup (of G) if there is an algebraic group H defined over Q and a surjective homomorphism <p: H--> AdG defined over R such that the Lie group (Ker<p)R is compact and that <p(Hz) and Adr are commensurable. The uniform lattice in G that is constructed by the method of Borel is arithmetic in this sense. In 1974, Margulis finally established the following arithmeticity theorem [23,24]: If the R-rank of G is not less than 2, an irreducible lattice r in G is an arithmetic subgroup of G (even if it is uni- form). In the same lecture, Pyatetskii-Shapiro also extended the Selberg conjecture to such "semisimple Lie groups" as those con taming p-adic Lie groups as factors. Margulis proved this Pyatetskii-Shapiro conjecture affirmatively by showing that an analog of the strong rigid- ity theorem holds for such groups. As for semisimple Lie groups of R-rank 1, besides the lattices constructed by Makarov and Vinberg there are only the few examples of nonarithmetic lattices in S U (2, 1) presented by Mostow (Proc. Nat. Acad. Sci. US, 75 (1978); Pacific J. Math., 86 (1980)). The prob- lem of arithmeticity still remains open for groups locally isomorphic to SO(n, 1) (n6), SU(n, 1) (n 3), Spin, 1), or F4' H. Geometric Discontinuous Groups [1,25J The study of discontinuous groups r acting on a Euclidean or projective space X as a trans- formation group of a given structure is a c1ass- 122 Ref. Discontinuous Groups ical problem. AII possibilities for such r have been enumerated in low-dimensional cases. For instance, there are 230 kinds of discontinu- ous groups of tEuclidean motions acting on 3-dimensional Euclidean space without fixed subspaces, which are classified into 32 tcrystal classes (A. Schonflies and E. S. Fedorov, 1891- 1892;  92 Crystallographic Groups). All discontinuous groups of a Euclidean space generated by treflections have also been enu- merated (H. S. M. Coxeter, 1934 [25. 26J). I. Kleinian Groups The last decade has seen considerable research on (finitely generated) Kleinian groups. This research is closely related to the theory of tquasiconformal mappings and the tmoduli of Riemann surfaces. Making use of Eichler cohomology and tpotentials, L. V. Ahlfors established his finite- ness theorem and L. Bers his area theorem, Bers and B. Maskit investigated the bound- aries of tTeichmuller spaces and discovered Kleinian groups with the property that the complement of the set i! of limit points is connected and simply connected. Numerous mathematicians have subse- q uently discussed the classification, defor- mation, and stability properties of the set i!, uniformization and deformation of Riemann surfaces with or without nodes, and other geometric properties, In their discussions, the theory of quasiconformal mappings has played an important role. The discontinuous groups of motions of hyperbolic 3-space have also been studied. References [IJ B. L. van der Waerden, Gruppen von linearen Transformationen, Erg. Math., Springer, 1935 (Chelsea, 1948). [2J C. L. Siegel, Discontinuous groups, Ann. Math., (2) 44 (1943), 674-689 (Gesammelte Abhandlungen, Springer 1966, vol. 2, 390- 405). [3J Sem. H. Cartan 6, Fonctions automorphes et espaces analytiques, Ecole Norm. Sup., 1953-1954. [4J G. Shimura, Introduction to the arithmetic theory of automorphic functions, Pub!. Math. Soc. Japan, 11 (1971). [5] Sem. H. Cartan 10, Fonctions auto- morphes, Ecole Norm. Sup., 1957-1958. [6J A. Weil, Discontinuous subgroups of class- ical groups, Lecture notes, Univ. of Chicago, 1958. [7J A, Borel and Harish-Chandra, Arithmetic subgroups of algebraic groups, Ann. Math., (2) 75 (1962),485-535, 
123 A Distribution of Prime Numbers [8] Algebraic groups and discontinuous sub- groups, Amer. Math. Soc. Proc. Symp. Pure Math., 1966, vol. 9 [9] M. S. Raghunathan, Discrete subgroups of Lie groups, Erg. Math., Springer, 1972. [IOJ R. Fricke and F. Klein, Vorlesungen iiber die Theorie der automorphen Funktionen I, Teubner, 1897, second edition, 1926. [IIJ H. Poincare, Theorie des groupes fuchs- iens, Acta Math., 1 (J 882), 1-62 (Oeuvres, Gauthier-Villars, 1916, vol. 2, 108-168). [12J C. L. Siegel, Some remarks on discontinu- ous groups, Ann Math., (2) 46 (1945), 708-718 (Gesammelte Abhandlungen, Springer, 1966, vol. 3,67-77). [13] F. Klein and R. Fricke, Vorlesungen iiber die Theorie der elliptischen Modulfunktionen I, Teubner, 1890. [14] C. L. Siegel, Analytic functions of several complex variables, Lecture notes, Institute for Advanced Study, Princeton, 1948-1949. [15J C. L. Siegel, Symplectic geometry, Amer. J. Math., 65 (1943),1-86 (Gesammelte Ab- handlungen, Springer, 1966, vol. 2, 274- 359; Academic Press, 1964). [16] I. Satake, On compactifications of the quotient spaces for arithmetically defined discontinuous groups, Ann. Math., (2) 72 (1960), 555-580. [17] W. L. Baily and A. Borel, Compactifi- cations of arithmetic quotients of bounded symmetric domains, Ann. Math., (2) 84 (1966), 442-528. [18] H. Minkowski, DiskontinuiUHsbereich fUr arithmetische Aquivalenz, J. Reine Angew. Math., 129 (1905),220-274 (Gesammelte Ab- handlungen, Teubner, 1911, vol. 2, 53-100; Chelsea, 1967). [19J A. Weil, On discrete subgroups of Lie groups, Ann. Math., (2) 72 (1960),369-384; II, ibid., 75 (1962), 578-602. [20] Y. Matsushima and S. Murakami, On vector bundle valued harmonic forms and automorphic forms on symmetric Riemannian manifolds, Ann. Math., (2) 78 (1963), 365-416. [21J E. B. Vinberg, Discrete groups generated by reflections in Lobacevskii spaces, Math. USSR-Sb., 1(1967),429-444. (Original in Russian, 1967.) [22J G. 0 Mostow, Strong rigidity oflocally symmetric spaces, Ann. Math. Studies 78, Princeton Univ. Press, 1973. [23] G. A. Margulis, Discrete groups of mo- tions of manifolds of nonpositive curvature, Amer. Math. Soc. Transl., (2) 109 (1977),33- 45. (Original in Russian, 1974.) [24J G. A. Margulis, Arithmetic irreducible lattices in semisimple groups of rank greater than 1 (in Russian), Mir, 1977. (Appencix to the Russian translation of [9].) [25] H. S, M. Coxeter and W. 0 J. Moser, 466 Generators and relations for discrete groups, Erg. Math., Springer, 1957. [26] N. Bourbaki, Elements de mathematique, Groupes et algebres de Lie, ch. 4, 5, 6, Ac- tuahtes Sci. Ind., 1377, Hermann, 1968. [27J L. Bers and I. Kra, A crash course on Kleinian groups, Lecture notes in math. 400, Springer, 1974. 123 (V.7) Distribution of Prime Numbers A. General Remarks Given a real number x, we denote by n(x) the number of primes not exceeding x. A. M. Legendre (1808) obtained empirically the for- mula n(x)=:x/(Iogx-B) for some constant B, and C F. Gauss (1849) obtained the formula I x du n(x) =: _ I ' 2 ogu assuming the a verage density of primes to be 1 flog x. The Bertrand conjecture, which asserts the existence of at least one prime between x and 2x, was proved by P. L. Chebyshev (1848), who introduced the functions 00 O(x)= I Ilogp m= 1 p x and ljJ(x)= I logp pm:S.;x =O(x)+O()+O()+ .... (In this section, p represents a prime number.) He thereby proved Ax + O() < O(x) < ljJ(x) «6/5)Ax + O(), where A=log(21/231/351/530-1/30). G. F. B. Riemann (1858) considered the function ((s) (where s = (J + it is a complex variable), ex- pressed by the tDirichlet series LI n-', which is convergent for 0" > I. He found relations between the zeros of ((s) ( 450 Zeta Func- tions) and n(x). F. Mertens (J 874) obtained the formulas logp I -=logx+O(I), p';;, P I  =IOgIOgX+B+O (  ) , p,;;xp logx ( 1 ) e-' ( ( 1 )) n 1-- =- 1+0 - p';;x p logx logx' where c is the Euler constant and B is some constant. 
467 B. Prime Number Theorem The prime number theorem n(x)logx x 1im  1, or n(x)- l ' xoo x ogx was proved almost simultaneously (1896) by J. Hadamard and C. 1. de La Vallee-Poussin. Without using the theory of tentire functions, E. Landau (1908) established the formula n(x)= Lix+ O(xe- cJkV ), where (f 1-0 f x ) du Lix= lim + _ o+o 0 1+0 logu is the tlogarithmic integral. It can be shown by integration by parts that x l!x Lix=-++... log x log x + (k-l)!x +o (  ) . logk x 10gk+1 x For example, by taking x= 10 7 , we get n(x)= 664,579, Lix,=:664,918, and x/log x '=: 620,417. If the Dirichlet series f(s) = L1 ann- s satisfies the condition Ln.;x an  cx, then its abscissa of convergence is 1, and we have lim'1+o(s-l)f(s)=c. The converse is known as the tTauberian theorem. If F(s) = L1 ann- s (an  0) converges absolutely for 0" > 1 and F(s) -c/(s-l) is analytic for O" 1, then we obtain L a cx (Wiener-Ikehara-Landau theorem, 1932), Specifically, if we put -('(s)l((s)= L:'1 A(n)n-', then the conditions of the theo- rem are satisfied, and we obtain Ln.; xA(n) = tj;(x)x. It is easily seen that the prime number theorem is equivalent to tj;(x)x or O(x)x. The tnumber-theoretic function A(n) (Man- goldt's function) introduced above satisfies Ldln A(d) = logn. It follows from the tMobius inversion formula that A(n)=LdlnJ1(d)log(n/d). Hence A(n) = logp (n = pm) and = 0 other- wise. Thus we obtain tj;(x)=Ln.;,A(n)= O(x), When f(x)= O(x) or tj;(x), it is easy to show that Wf(t)/t 2 )dt= logx+ 0(1) and jjm infx f(x)/x';; 1.;; jjm supxoo f(x)/x. How- ever, it is not easy to prove f(x) x. To do so introduce the number-theoretic function M'(n), which satisfies LdlnM(d)=log2n. As before, we have M(n)==LdlnJ1(d)log2(njd); hence { (21-1)IOgp M(n)= IOgPlOgq (n=pl, l 1), (n=plqm, l 1, m 1), ( otherwise). Thus we obtain Ln.;xM(n)=2xlogx+0(x). This leads to A. Selberg's well-known formula 123 B Distribution of Prime Numbers (1949): O(x)logx+ L O(xjp)logp=2xlogx+0(x), px which enabled him to prove O(x)  x Thus he obtained for the first time a proof of the prime number theorem that does not use com- plex analytic methods. The simple formulas L;;';'IJ1(n)/n=O and L n .;xJ1(n)=o(x), obtained by H. von Mangoldt (1897), were revealed by Landau to have a deep meaning concerning the prime number theorem. Let n,(x) denote the number of integers not exceeding x that can be expressed as the product of r distinct primes In generalizing the prime number theorem, Landau (1911) proved that 1 x(loglogxy-1 n,(x) (r_1)! logx Let us write 9(X)=L:'=1 e- nxn2 . Riemann proved that rG)n-'/2(S) =+ f ro 9(X)(xs/2-1 +X- l / 2 - s / 2 )dx s(s-I) I and obtained the well-known functional equa- tion for the zeta function ( 450 Zeta Func- tions B) n- s / 2 qs/2)(s) = n- l /2+s/2 q1/2-s/2)((1 -s). This enables us to extend (s) as a mer om or- phic function to the whole complex plane. Utilizing this extended (s) and the following result of O. Perron on Dirichlet series, we can estimate tj;(x). Let 0"0 (# 00) be the abscissa of convergence of F(s)= LI f(n)n-" and let a>O,a>/J o ,andx>O.1f 1 f a+iT x' lim ---: F(s)-ds Tcn 2m a-iT S exists, then the limit is equal to L.;J(n), where L' means that in the summation the last term f(x) is replaced by f(x)j2 if x is an in- teger. In many cases, F(s) has a pole at s = 1, and the principal part of the sum is obtained from the residue at s = 1, whereas the residual part is given by a certain contour integral. To estimate tj;(x), we use - ('(s)l((s) as F(s); hence the problem arises of determining the zeros of (s). Riemann conjectured that all the zeros of (s) in the strip 0';;0"';; 1 must be situated on the vertical line 0" = 1 /2. If this so-called tRiemann hypothesis ( 450 Zeta Func- tions) is true, then it follows that n(x) = Li x + O(fi log x). The ultimate validity of Riemann's hypothesis remams in doubt. Concerning this, the most recent major result is the following formula, obtained by I. M. Vinogradov (1958): n(x)= Li x+ 
123 C Distribution of Prime Numbers O(x exp( - clog 3 / 5 x/log logl/5x)), Without using Riemann's hypothesis, J. E. Littlewood (1918) proved that n(x)-Lix limsup >0, xoo (/Iogx)logloglogx n(x)-Lix liminf <0. xac (/logx)logloglogx If we denote by N(T) the number of zeros of (s) in the domain 0<0"< 1, O<t< T, then we have 1 1 +log2n N(T)=-TlogT- T +O(logT). 2n 2n Let No(T) denote the number of zeros of (s) on the interval 0" = 1/2, 0 < t < T. Selberg (1942) obtained the impressive result No(T»cTlogT. E. C. Titchmarsh (1936) showed that there exist 1041 zeros of (s) in the domain 0 < 0" < 1, 0< t < 1468 and that all lie on the line 0" = 1/2. Computers have provided further results that seem to justify the Riemann hypothesis. For example, it has been calculated that there are 75,000,000 zeros of (s) in the domain 0< 0"< 1, 0< t < 32,585,736.4 and that all are simple zeros and lie on the line 0" = 1/2 (R. P. Brent, Math. Comp., 33 (1979)). N. Levinson proved in 1974 by another method that at least one- third of the zeros of the Riemann zeta function are on the line 0" = 1/2. The minim um of the modulus of the imaginary part of the zeros with 0"= 1/2 is t= 14.13..,. C. Twin Primes Let Pn be the nth prime. We know from the prime number theorem that Pnnlogn; more precisely, Pn= nlogn+ nloglogn + O(n). A pair of primes differing only by 2 are called twin primes, It is still unknown whether there exist infinitely many twin primes. There exist in- finitely many n satisfying Pn+l - Pn < c logPn (P. Erdos, 1940). Suppose that W/2+it)=O(ltI C ). A. E. Ingham (1937) proved that Pn+! -Pn<P, 0=(1 +4c)/(2+4c)H, by using the following density theorem related to the zeros of (s): If we denote by N(ex, T) the number of zeros of (s) in the domain ;x0" 1, Ot.s; T(I/2.s;ex.s; 1), then there is a positive constant c such that N(ex, T) = O(T 2 (1+2c)(I-')log5 T). The LindelOfhypothe- sis asserts that the constant c can be made arbitrarily small. If the Riemann hypothesis holds, then the Lindelof hypothesis also holds. It is clear that we can substitute 1/6+1: (1:>0) for c, I: being arbitrarily small. W, Haneke 468 (1963) showed that c can be replaced by 6/37 + f., and conseq uently that 0  61/98 + 1:. D, R, Heath-Brown and H. Iwaniec (1979) proved that 0 11/20+1: by using the tsieve method and the zero density theorem of L-functions (Section E). R. A. Rankin (1935) proved that Pn+l - Pn > C logPnloglogPn log log loglogPn x (log log logPn)-2 holds for infinitely many n. If we denote by n 2 (x) the number of primes p.s; x such that P + 2 is also a prime, then it has been conJec- tured by Hardy and Littlewood (Acta Math., 44 (1922)) that I x du n2(X)C _ I 2 2 og U as x-->oo, where C=2 f1 { 1- } = 1.32032.,.. p>2 (p-1) The numerical evidence provided by the com- putation n2(10 9 )=3424506 (Brent, Math. Comp., 28 (1974)) tends to indicate the truth of this conjecture. At present, 76' 3 139 :i: 1 seems to be the largest known pair of twin prime numbers (H. C. Williams and C. R. Zarnke, Math. Comp., 26 (1972)). D. Prime Numbers in Arithmetic Progressions Let k be a positive integer, x(n) be a residue character modulo k ( 295 Number-Theoretic Functions D), and L(S,X)=L1 x(n)n- S (0"> 1) be the tDirichlet L-function. The function L(s, X) of s defined by this series can be ex- tended to an analytic function in the whole complex plane in the same way as the Rie- mann zeta function In particular, when X is the principal character, then L(s, X), thus ex- tended, is a meromorphic function whose only pole is situated at s = 1 and is simple; otherwise the function L(s, X) is holomorphic on C. Using this function L(s, X) and in connection with his research concerning the tclass num- bers of quadratic forms, P G, L Dirichlet (1837) proved that there exist infinitely many primes in the arithmetical progression 1,1 + k, 1 + 2k, "', where 1 is the initial term and k a common difference relatively prime to I. This result is called the Dirichlet theorem (or prime number theorem for arithmetic progressions). Suppose that a runs over all integral ideals in a tquadratic number field K of dlscrimmant d, Then the tDedekind zeta function (K(S) of K is defined by L(NafS for 0"> 1. By virture of the decomposition law of prime ideals ( 347 Quadratic Fields C), we have (K(S)= (s)L(s, X), where x(n)=(d/n) is the tKronecker symbol. 
469 Utilizing (K(S), we obtain formulas concerning the tclass number h(d) of the field K. If d>O, then h(d)=(jd/2logl:)L(I,X), where I: is the tfundamental unit of K. On the other hand, if d<O, then h(d)=(wj=d/2n)L(I,x), where w denotes the number of the roots of unity contained in K. It folIows that L(1, X)? 2log«1 + J5)/2)/JfdI. Let X be a character modulo k induced by a primitive character X O . Since we have L(s, X) = L(s, X O )II p1k (1 - l(p)p-'), it can be shown that L(I,X)#O for a real character X. It is easy to prove that L(I, X) #0 for a complex character X' These statements then lead to the Dirichlet theorem. The proof was simplified by H. N. Shapiro (1951). Besides Landau's three proofs for L(1, X) # 0 for real character X (1908), there are elegant proofs by T. Estermann (1952), Selberg (1949), and others. For a character xmod k, we always have L(1,X)=O(logk), while L(1, xf l = O(logk) with one possible excep- tion, which may occur only if X is a real char- acter. Even in this case, we have L(I,X)-1 = O(k') (where £ > 0 is arbitrary, but 0 depends on 1:). This result was obtained by K. L. Siegel (1934) from his study concerning class num- bers of imaginary quadratic number fields. His proof was simplified by Estermann (1948) and S. D. Chowla (1950). The importance of the prime number theorem for arithmetic pro- gressions was revealed when it was applied to the Goldbach problem ( 4 Additive Number Theory C). Concerning this problem, the man- ner in which the remainder term depends on the modulus k became an object of investiga- tion. The Page-Siegel-Walfisz theorem is convenient to use: Denote by n(x; k, I) the number of primes not exceeding x and of the form ky+ I, where (k, 1)= 1. If x ?exp(k') (where 1:>0 is arbitrary), then we have _ Lix ( xe-C(,»)IOgX ) n(x, k,I)- +0 . <p(k) <p(k) Further research on the distribution of zeros of L(s, X) is necessary for the study of n(x; k, I) when x takes smaller values. If X is a nonprin- cipal real character, then L(s, X) may have at most one real zero fJI around 1; this is called Siegel's zero. Because of this fact, when x is small we are unable to obtain any formula to indicate the uniform distribution of primes. However, we have the following deep result, obtained by E. Fogels (1962). For a given positive 1:, there exist co(£) and c(l:) such that n(x; k, I) > c(l:)x/<p(k)k'logx, provided that X? k Co ('). On the other hand, Titchmarsh (1930), using the tsieve method, obtained n(x; k, I) =O(x/<p(k)logx) for x?k co . A theorem of this type is called the Burn- Titchmarsh theorem ( Section E). Fogels's theorem is based on the following theorem by Yu. V. Linnik (1947) and 123 E Distribution of Prime Numbers K. Prachar (1957), which is an extension of Page's theorem (1935): We let c5 be the function defined by c5 = 1 - fJI if L(s, X) has an excep- tional real zero fJl> and c5 =c l/log(k(ltl + 2)) otherwise (where C 1 is a suitably small num- ber). If we denote by fJ the real part of any zero ( # fJl) of L(s, X), then the theorem states that fJ<l- cl Io ( cle ) "" log(k(ltl+2» g c5log(k(ltl+2)) , provided that c5Iog(k(ltl+2)cl' Linnik called this result the Deuring-Heilbronn phe- nomenon. Using this theorem and the zero density theorem, Linnik (1944) proved that p(k,I)«kL, where p(k,l) is the least prime in the arithmetic progression I, 1 + k, 1 + 2k, ... and L is a constant. We call L Linnik's constant. M. Jutila (1977) and Chen Jing-Run (1979) proved that L  80 and L  17, respectively. Let s be positive, b j , Zj (1 J  s) be complex, and I, m be real numbers satisfying max Izjl? 1, I?s, and m?O. Under these conditions P. Turan (1953) obtained the following funda- mental theorem, which is called the power sum theorem: max Iblz'i+...+bsz;1 mr[+m ;3 ( 1 ) lminlbl+...+bJ 8e(l+m) l,;;j,;;, This theorem is effective in research on the distribution of zeros of zeta functions. Based on this new method, Turan (1961), S. Knapow- ski (1962), and Fogels (1965) reached the re- sults cited above. Another method of research, considered to be a new sieve method, on the distribution of primes was introduced by Selberg and A. I. Vinogradov. This method was followed by W. B. Jurkat and H. E. Richert (1965). Linnik, A. Renyi (1950), and E. Bombieri (1965) founded stiJl another method, called the large sieve method, by which P. T. Batemann, Chowla, and P Erdos studied the value of L(I, X). E. Sieve Method Let A be a set of integers, and P a set of primes. For each peP, let Op be a set of resi- dues modp, and w(p) the number of residues belonging to Op The sieve method is a device for estimating (from above or below) the number of integers n belonging to the set S(A, P, Op)= {nlneA, nmodpttOp for peP}. The combinatorial methods of the Brun, Bus- tab, and Richert sieves are interesting and efficient but quite complicated. Here, we shall briefly describe the Selberg sieve. As an ex- ample, denote by S(x; q, I) the number of n satisfying n=1 modq, n.s;x, (n,D)= 1, where q 
123 E Distribution of Prime Numbers is a prime not exceeding x, z";; x, D = II p <; z p, and (I, q) = 1; then S(x;q,l)= L L J1(d) n<;x dl(n,D) n=:lmodq ( ) 2 .s; L L Ad , n<;x dl(n,D) n::::::lmodq where ..1.1 = 1 and Ad is arbitrary for d> 1. Thus the problem reduces to the optimization of Ad' Proceeding in this manner, C. Hooley (1967) and Y. Motohashi (1975), using analytic methods, obtained certain deep results relating to the Brun-Titchmarsh theorem. Using the large sieve, H. L. Montgomery and R. C. Vaughan (1973) expressed this theorem in a precise form: n(x, q, I) < 2x/<p(q) log (x/q) for all q<x. Let n l , n 2 , ...,n z be Z natural numbers not exceeding N, and Z(p, a) the number of n/s such that nj=amodp. Renyi (1950) proved that p-I { Z } 2 L p L Z(p,a)-- .s;2NZ p<;.J N !l2 a=l p Set an = 1 for n = n j and an = 0 otherwise, and set S(ex) = L n <;Na n exp(2ninex); then P-I { Z } 2 P-l i ( a )1 2 p  Z(p,a)-- = L S - . a-I P a=1 P In view of this simple fact, Bombieri (1965) and P. X. Gallagher (1968) extended the problem and proved that, in general, q I ( a ) 1 2 L L L anexp 2ni-n q<;Q a=1 M<n<;M+N q (a,q)=1 .s;( N +2Q2) L la n l 2 . M<nM+N Similar results can be obtained for character sums LM<n<;M+NanX(n). In this connection Montgomery proved that S({n; M <n.s;M +N};P,Qp) ";;(N+2Q2) { L fl } -I, q<;Q plqP-w(P) qlP(z) where P(z) = fl p. pEP pZ Using these methods, the following estimate was obtained by Bombieri: L maxmax l n(y,q,l) qxI12(logx)-B yx (q,l= 1 1 I y du I -- - «x(logxr A , <p(q) 2 10 g u where A is arbitrary and B is a certain func- 470 tion of A (Vaughan). These methods, known collectively as the large sieve, were first di- rected toward proving the Renyi theorem stating that every sufficiently large even integer can be represented as the sum of a prime and an almost prime integer (M. B. Barban). After- ward, combining Richert's sieve with this large sieve, Chen ling-Run (1973) improved this result to a remarkable degree ( 4 Addi- tive Number Theory C). Several applications of Bombieri's theorem have been demon- strated by P. D. T. A. Elliot and H Halber- stam (1966): e.g., the estimation of the num- ber of representations of n as p + x 2 + y2 (Hooley, Linnik) and the estimation of Lp<;nd(n- p) (Linnik, B. M. Bredihin). Let N(ex, T,X) denote the number of zeros of L(s, X) in the rectangle ex.s; (J.s; 1, I tl.s; T. Com- bining the large sieve with new Fourier in- tegral techniques and the Turan tpower-sum method, Gallagher (1970) proved that there exists a positive constant c satisfying L N(ex, T,X)«(QT)'(H), x mod Q L L N(ex, T,X)«(QT)'(H). q<;Q x mod q Similar results were also obtained by G. Ha- lasz and Montgomery (1969) using another method, which was further exploited by lutila, M. H. Huxley, and Iwaniec. In particular, Heath-Brown and Iwaniec (1979) deduced that n(x+ y)-n(x)cy(logxrl if yxll/20+'. Combined with the Deuring-Heilbronn phenomenon, the zero density theorem above not only establishes the Linnik theory, but also yields the foIlowing result, due to K. A. Rodoskii, T. Tatuzawa, and A. I. Vinogradov: 1 I x du n(x;q,l)=- - <p(q) 2 log U XI (I) I x UPI ( 1 '- ) -E- -du+O -X I -8, <p(q) 2 log u <p(q) if x exp(logqloglogq), where E= 1 or 0 according as Siegel's zero fJ exists or not, and where  = Max(log q,(logx)2/5 (loglogx)I/5). Throughout these researches, estimates of the type xJLG+it,x)14 «<p(q)(1 tl + 2)log'{ q(ltl + 2)} and L I T I L ( +it,x )1 4 dt Xmodq -T 2 «<p(q)(T + 2)log' q(T + 2) 
471 are of great importance and have been studied by A, F. Lavrik (1968), Linnik (1961), Huxley (1972), and K. Ramachandra (1975). F. The Prime Ideal Theorem in Algebraic Number Fields In an algebraic number field of finite degree, the prime number theorem is replaced by the prime ideal theorem (T. Mitsui, 1956, Fogels, 1962), which is based on the theory of the Hecke tL-function (E. Hecke, 1917; Landau, 1918), Let K be a finite Galois extension over an algebraic number field k of finite degree. Suppose that p is a prime ideal of k and is not ramified in K. The tFrobenius automorphism of a prime divisor of p in K determines a conj ugate class C of the Galois group of K/ k. Let n(x; C) denote the number of prime ideals in k associated with the class C in the above sense and whose norm does not exceed x. Then we have h( C) - n ( x C ) =-Lix + O ( xe-cy'IOgX ) , [K: ' where h(C) is the number of elements con- tained in C and c is a positive constant de- pending on K/k. This is an extension of Che- botarev's theorem (E. Artin, 1923). References L1J R. Ayoub, An introduction to the analytic theory of numbers, Amer. Math. Soc. Math. Surveys, 1963. [2J H. Bohr and H. Cramer, Die neuere Ent- wicklung der analytischen Zahlentheorie, EnzykL Math" II C (8), 1922, [3J K. Chandrasekharan, Introduction to analytic number theory, Springer, 1968. [4J P. Erdos, Some recent advances and cur- rent problems in number theory, Lectures on Modern Mathematics III, T. L. Saaty (ed.), Wiley, 1965, 196-244. [5J E. Heeke, Vorlesungen i.iber die Theorie der algebraischen Zahlen, Akademische Ver- lag, Leipzig, 1923 (Chelsea, 1970). [6J E. Landau, Handbuch der Lehre von der Verteilung der Primzahlen I, II, Teubner, 1909 (Chelsea, 1969). [7] E. Landau, Vorlesungen i.iber Zahlen- theorie I, II, Hirzel, 1927 (Chelsea, 1946). [8] G. H. Hardy, Divergent series, Clarendon Press, 1949. [9J H. M. Edwards, Riemann's zeta function, Academic Press, 1974. [10] A. E. Ingham, The distribution of prime numbers, Cambridge Univ. Press, 1932. 124 A Distribution of Values (Complex Variables) [11] D. N. Lehmer, List of prime numbers from 1 to 10,006,721, Carnegie Institution of Washington, 1914. [12J N. Levinson, More than one third of zeros of Riemann's zeta-function on 0" = 1/2, Advances in Math., 13 (1974), 383-436. [13J K. Prachar, Primzahlverteilung, Springer, 1957. [14] A Selberg, An elementary proof of the prime number theorem for arithmetic pro- gressions, Can ad. J. Math., 2 (1950), 66-78. [J 5J E C. Titchmarsh, The theory of the Riemann zeta-function, Clarendon Press, 1951. [16J I. M. Vinogradov, A new estimate of the function ((1 +it) (in Russian), Izv. Akad. Nauk SSSR, 22 (1958),161-164. [17J M. N. Huxley, The distribution of prime numbers, Oxford Univ. Press, 1972. [18J p, Turin, Eine neue Methode in der Analysis und deren Anwendungen, Akademiai Kiad6, Budapest, 1953. [19J H. L. Montgomery, Topics in multiplica- tive number theory, Lecture notes in math., 227, Springer, 1971. [20J E. Bombieri, Le grand crible dans la theorie analytique des nombres, Societe Math- ematique de France, 1974. [21J H. Halberstam and H. E. Richert, Sieve methods, Academic Press, 1974. [22J C Hooley, Applications of sieve methods to the theory of numbers, Cambridge Univ. Press, 1976. [23J H. E. Richert, Lectures on sieve methods, Tata Inst., 1976. 124 (XI.8) Distribution of Values of Functions of a Complex Variable A. General Remarks Suppose that we are given a function f: A --> B and that the variables z, w take on values in A, B, respectively. A value distribution of f(z) is a set of points z where f(z) takes on a certain value w (called w-points of f(z)). Value distri- bution theory is usualIy concerned with the study of value distributions of complex tana- lytic functions. Value distribution theory has been developed extensively and deeply for the case where A is the finite plane Izi < 00 or the unit disk Izi < 1 and B is the extended complex plane Iwl  00, and there are many interest- ing results in this case ( 272 Meromorphic Functions), Value distributions for analytic functions on general domains or on Riemann 
124 B Distribution of Values (Complex Variables) surfaces or of several complex variables have also been studied. B. The Case of Izi < R  00 F or a ttranscendental entire function f(z), every value (including 00) is a value of the tcluster set of f(z) at the point at infinity (tWeierstrass's theorem). This theorem was improved in the following way by E, Picard in 1879: A transcendental entire function f(z) has an infinite number of w-points for any finite value w except for at most one finite value (tPicard's theorem). A value w for which the w- points are at most finite is said to be a Picard's exceptional value, E. Borel gave a precise form of this theorem, taking into consideration the order of a function, and G. Julia proved the existence of Julia's directions ( 272 Mero- morphic Functions; 429 Transcendental En- tire Functions). After other results in value distribution theory had been obtained by J. Hadamard, G. Valiron, and others, R. Nevan- linna published an important work in 1925 in which he established the so-called Nevanlinna theory of meromorphic functions in Izl <R.s; 00, unifying results obtained up until that time, and which became the starting point of the subsequent value distribution theory ( 272 Meromorphic Functions). T. Shimizu and L. V. Ahlfors ga'Ve a geometric meaning to the Nevanlinna tcharacteristic function T(r). Ahl- fors established the theory of covering surfaces by met rico topological methods in 1935, and applied it to obtain the Nevanlinna theory and many other results on meromorphic func- tions. This theory revealed that the topological meaning of the number 2 of Picard's excep- tional values is closely related to the tEuler characteristic 2 of the sphere. H. Selberg es- tablished the value distribution theory of talgebroidal functions and gave a precise form of G. Remoundos's theorem, which corre- sponds to Picard's theorem for algebroidal functions. Moreover, as an extension of the value distribution theory of meromorphic functions, there is the theory ofholomorphic curves or of systems of entire functions. Let fOJ1' ... fn (n;:' 1) be entire functions without common zeros for all and for which fo: fl : ...:fn is not constant. Put f = (/0 ,11' ... ,In). This is a re- duced representation of a nonconstant holo- morphic curve f: C -->pn(C). For ex=(IX o , ex 1 , ..., exn)E C"+! - {a}, considering the zeros of (exJ) = exofo + exJI +... +exnfn (#0), we can extend Picard's theorem and the Ne- vanlinna theory for meromorphic functions to holomorphic curves. 472 We define the characteristic function T(r) ( = T(rJ)) as 1 f, 2n I ' T(r)=- loglfldO, 2n 0 '0 If I =(lfo 1 2 + Ifl1 2 + ... + IfnI2)1/2, where ro is a fixed positive number and A(r)lo = A(r)- A(r o )' Using a bivector [if'J = (/;.fj' - fj/';') of f and f' =(/J;,..., f:), we have another representation of T(r): 1 f ' ds f, s f 2nl[[{,W T(r)=- - -tdtdO n '0 s 0 0 I fl 2 . f is transcendental when Iim T(r)/Iogr= 00, and the number A=dim{(co,c l , ...,c n )EC"+llc o fo+cJl +... +cJn=O} is the degeneracy index of f It holds that O },.s; n -1. As an extension of Picard's theorem, 1. Dufresnoy stated that, for transcendental holomorphic curves f and among IX in gen- eral position, the zeros of (ex, f) are infinite except for at most n+ ..1.+ 1. X (c C"+1 - {O}) is in general position when any p (  n + 1) vec- tors in X are linearly independent. In connec- tion with this result, there are many Picard- type theorems, and when ..1. > 0, there are some particular results for holomorphic curves. H. Cartan, H. and J. Weyl, and Ahlfors extended the Nevanlinna theory to holomorphic curves as follows. For ex=(exO,ex l , .."CXn)EC"+!-{O}, we put I ex I = the length of ex, Ilexfll=l(exJ)I/lexllfl, (ex,f)#O, and define the proximity function 1 f 2n 1 m(r,ex)=- log-dB 2n 0 Ilexfll and the counting function 1 f, 2n I ' N(r,IX)=- logl(exJ)ldOJ. 2n 0 '0 Then we have the first main theorem: For any ex for which (exJ) '¥=0, T(r) + m(ro,ex) = N(r,ex)+ m(r, IX); and the second main theorem: When ..1. = 0, for any ex l , ex 2 , ... , ex q in general position, q (q-n-l)T(r)< L N(r, ex j )+ S(r), jl where S(r)=O(logrT(r)) except for r in a set e of finite logarithmic measure, (i.e., Sedlogr< 00), For ..1.>0, it is conjectured that "q-n- 1" can be changed into "q-n-A-I." This is unsolved except for some special cases. 
473 Some results show relations between excep- tional values and the order as in the case of meromorphic functions. Nevanlinna theory has been extended to the associated curves fP (p = 1,2, ..., n;f 1 = f) forA = 0, and there have been attempts to extend this theory of holo- morphic curves even further by generalizing domains or ranges. C. The Case of General Domains The value distributions of meromorphic func- tions defined in a general domain or an open Riemann surface depend on the function- theoretic "size" of the set of singularities ( 169 Function-Theoretic NuIl Sets) or the type of the Riemann surface ( 367 Riemann Sur- faces). For instance, we have the following theorem of Picard type: A single-valued mero- morphic function with a set of singularities of tlogarithmic capacity zero takes on every value infinitely often in any neighborhood of each singularity except for at most an Fa- set of values of logarithmic capacity zero (Hallstrom-Kametani theorem). For the study of value distribution at general singularities, it is useful to investigate cluster sets ( 62 Clus- ter Sets). In order to generalize the Nevan- linna theory to the case of general domains or Riemann surfaces, we take their exhaustions depending on a real parameter r and define the tcounting function and so on ( 272 Meromor- phic Functions). G. J. Hallstrom established the value distribution theory of meromor- phic functions defined in the complementary domain D of a compact set E of logarithmic capacity zero by taking D, = {z I v(z) < r} as the exhaustion of D, where v(z) denotes the Evans potential for E, i.e., v(z) is the potential corre- sponding to a positive mass distribution J1 on E of total mass 1 which tends to + 00 as z tends to any point of E. Thus the number of Picard's exceptional values is not greater than 2+, where =limsuPHooF(r)/T(r) with - n(r) = the Euler characteristic of D, and F(r) = S:on(r)dr (Hallstrom-Tsuji theorem). J. Tamura, L. Sario, and others studied the value distributions of meromorphic functions de- fined on Riemann surfaces. Sario succeeded in extending the Nevanlinna theory to analytic mappings of a Riemann surface ffi into another Riemann surface is by introducing a suitable metric in is to define the tproximity function. In the Nevanlinna theory on general do- mains, we must sometimes impose condi- tions that the functions must satisfy in order to obtain a concrete conclusion, for instance, the condition that  be finite in the Hallstrom- Tsuji theorem, But it is also important to determine the domains where some result can 125 A Distributions and Hyperfunctions be obtained without imposing any additional conditions on the functions. The Hallstrom- Kametani theorem is an example. Although the set of exceptional values in this theorem cannot be replaced generally by a smaller set than an Fa-set of logarithmic capacity zero, we have the following theorem: Let E be a tCan- tor set with successive ratios n = 2In/ln-I' where In denotes the length of the segments that remain after repeating n times the process of deleting an open segment from the middle of another segment. Then any single-valued meromorphic function with E as the set of singularities has at most 3 Picard's excep- tional values if limnoo n = 0 and at most 2 if n+l =o() (L. CarIeson, K. Matsumoto). By weakening the conditions on E, one can get several improvements of this result. References [IJ K. Noshiro, Modern theory of functions (in Japanese), Iwanami, 1954. [2J A. Dinghas, Vorlesgungen iiber Funk- tionentheorie, Springer, 1961. [3J W. K. Hayman, Meromorphic functions, Clarendon Press, 1964. [4J H. Cartan, Sur les zeros des combinaisons Iineaires de p fonctions holomorphes donnees, Mathematica,7 (1933), 5-31. [5J L. V. Ahlfors, The theory of meromorphic curves, Acta Soc. Sci. Fenn., ser. A, 3 (1941), 3-31. [6J H. and J. Weyl, Meromorphic functions and analytic curves, Ann. Math. Studies 12, Princeton Univ. Press, 1943. [7J M. Cowen and P. Griffiths, Holomorphic curves and metrics of negative curvature, J. Analyse Math., 29 (1976), 93-153. [8J L. Sario and K. Noshiro, Value distri- bution theory, Van Nostrand, 1966. [9J K, Matsumoto, Existence of perfect Picard sets, Nagoya Math. J., 27 (1966), 213-222. 125 (XII.7) Distributions and Hyperfunctions A. History The advancement of analysis, particularly in the field of partial differential eq uations and harmonic analysis, necessitated the generali- zation of the notion of functions and deriva- tives, For instance, "functions" such as Dirac's tdelta function and tHeaviside's function were used by physicists and engineering scientists 
125 B Distributions and Hyperfunctions even though the former is not a function and the latter is not a differentiable function in the classical sense. The finite parts of divergent integrals, used by J. Hadamard to investigate the fundamental solutions of wave equations (1932), and the Riemann-Liouville integrals due to M. Riesz (1938) were the notions that eventually led to the theory of generalized functions. The rudiments of the idea of distri- bution, however, can also be found in other earlier works. S. Bochner (1932) and T. Carle- man (1944) discussed the Fourier transforms of locally integrable functions on the reals with growth as large as a polynomial. S. L. Sobolev introduced the notion of generalized derivative and also of generalized solution of differential equations by means of integration by parts in studying the Cauchy problem for hyperbolic equations (1936); 1. Leray (1934), K. O. Fried- richs (1939), and C. B. Morrey, Jr. (1940) also discussed generalized derivatives. On the other hand, L. Fantappie (1943) investigated analytic functionals that are elements of the dual of the space of analytic functions and applied them to the theory of partial differential equations. Based on a systematic generalization of these investigations, L. Schwartz [1] established the theory of distributions (1945), which not only provided a mathematical foundation for a number of formal methods that had been used in mathematical physics but also gave new and powerful tools for the theories of differential equations (L. Hormander [2]) and tFourier transforms. Furthermore, it has been applied to trepresentation theory oflocally compact groups, the theory of probability, and the theory of manifolds (G. de Rham [3]). As will be seen in Section B, distributions are defined as continuous tfunctionals on a certain func- tion space, and it is essential to select a func- tion space appropriate to the problems con- cerned. For this reason, 1. M. Gel'fand and G. E. Shilov defined various classes of general- ized functions [4J as a natural extension of Schwartz's theory. In this direction there are also various classes of ultradistributions intro- duced by C. Roumieu [5J and A. Beurling. Another but completely different approach was given by M. Sato [6, 15] in the form of the theory of hyperfunctions (1958). Intuitively a hyperfunction is the sum of (ideal) boundary values of holomorphic functions at the real axis. We obtain in this way an ultimate class of localizable generalized functions. Sato em- ployed the relative (or local) cohomology theory to define the "boundary values" and to prove their localizing property. Such an alge- braic approach to generalized functions led naturally to an algebraic treatment of systems of linear partial differential operators [7], called algebraic analysis by comparison with 474 algebraic geometry. Independently, Horman- der established a similar theory for distri- butions using Fourier analysis ( 274 Micro- local Analysis). B. Definition of Distributions Let <p(x) be a complex-valued function of x == (Xl' ..., x n ) defined on an open set 0 in the n- dimensional Euclidean space R n By the sup- port (or carrier) of <p, denoted by supp <p, we mean the tclosure of {x I rp(x) #O} in 0, For multi-indices P, i.e., n-tuples P= (PI, ..., Pn) of nonnegative integers, we set Ipi = PI + '" + Pn' For a function <p(x) of class Clpl, we write olpl<p(x) DP <p - PI 'p ' oX I ...OX n " (1) In particular, D(O, ,O)<p=<p. To indicate the variables x we shall adopt the notation DJ:. :]1(0) denotes the set of all complex-valued functions of class C"; defined on 0 with tcom- pact support, which is a tlinear space under the usual addition and scalar multiplication in function spaces. A sequence {<Pm} in :]1(0) is said to converge to 0 (the function identically equal to zero) as m-->oo, denoted by <Pm =0, if there exists a compact set E in 0 such that E contains supp <Pm for every m, and for every p, {DP<pm} converges uniformly to 0 as m-->oo. We sometimes abbreviate :]1(0) either as :]1 or, when we want to indicate the variables x, as :]1x. A complex-valued tlinear functional T de- fined on :]1(0) is called a distribution on 0 if it is continuous on :]1(0), i.e., <Pm = 0 implies T(<Pm)-->O. The set of all distributions on 0 is denoted by 9tJ'(0) (or :]1'). For distributions S and T, the sum S + T and scalar multiple exT are defined by (S+ T)(<p)=S(<p)+ T(<p) and (exT)(<p)=exT(<p), respectively, which are also distributions. Hence '(O) is a tlinear space. C. Examples of Distributions (J) Let f(x) be a tmeasurable and tlocally integrable function on O. Then a distribution If is defined by If(<p) = S <p(x)f(x)dx. Here dx is the tLebesgue measure and the domain of integration is 0 (in fact, supp <p). If If = Tq, then {(x)=g(x) tal most everywhere. Thus we can identify the distribution If with the cor- responding function f, and sometimes If will be denoted simply by f (2) Let J1 be a tRadon measure on 0, i.e., a complex-valued tregular completely ad- ditive set function on the tBorel sets in O. Then a distribution I;, is defined by 7;,(<p)= S <p(x)J1(dx). Example (I) is a special case where 
475 J1(dx) = f(x)dx. Two Radon measures J1 and v coincide if T" = Tv. If the measure is concen- trated at the origin, then T,,(<p)=c<p(O), denoted by cc5, and 15 is called Dirac's distribution, Sometimes 15 is denoted by c5 x , c5(x), or c5(x) to indicate that it operates on functions of x. A distribution TE !J1'(Q) is a measure if and only if T(<Pm)-->O whenever the supports supp <Pm of a seq uence <Pm E !J1(Q) are in a fixed compact set in Q and <Pm converges uniformly to zero. A distribution T is called a positive distribu- tion if T(<p);:'O for any <pE!J1(Q) that has non- negative values at all points x in Q. Every positive distribution is equal to a T" corre- sponding to a positive measure J1. (3) For given p the distribution c5(p) is defined by c5(P)(<p) = (-I )lpiDP <p(0). 15(0, .0) = 15. (4) Let g(x) be a function defined but not integrable on an interval (a, b), and assume that for any positive number I; it is integrahle on (a+l;,b). Moreover, assume that n g(x)= L A,(x-a)-A'+h(x), v;;;;; 1 where Rd v > 1, Av is not an integer, and h(x) is integrable on (a, b). Then f b g(x)dx- L A,(Av- J)-II;H'=F(£) aH tends to a finite value as 1;-->0. This limit is called the finite part (in French partie finie) of the integral S g(x)dx, denoted by PfJ g(x)dx: Pf r g(x)dx A f b = - L (b-a)H,+ h(x)dx, v Av - t a In the same way, for every <pE!J1(R 1 ), T(<p)= PfJ; g(x)<p(x)dx is defined, and T is a distribu- tion which will be denoted by Pfg, and which is frequently called a pseudofunction. This notion is extended to the n-dimensional case and used to express the fundamental solutions ( Section EE; Appendix A, Table 15.V) of thyperbolic partial differential eq uations. D. Localization of Distributions Let Q' be an open subset of Q. Every function <p E !J1(Q') can be extended to a function (j5 E !J1(Q) by setting (j5(x)=O for xfj:cQ'. Thus if TE!J1'(Q), then a distribution SE!J1'(Q') is de- fined by S(<p) = T«(j5) for every <pE!J1(Q'), and S is calIed the restriction of T to Q'. Two distri- butions T and S are said to be equal in Q' if their restrictions to Q' are equaL If every point in Q has a neighborhood where T= S, then T = S. In this sense a distribution is determined completely by its local data, although the I25F Distributions and Hyperfunctions notion of pointwise value, having exact mean- ing for functions, has no meaning for distri- butions. Moreover, we can construct a distri- bution with given local data in the following way: Suppose that an topen covering {Qj} of Q and a set of distributions 1)E !J1'(Q) are given and that 1) and 1k are equal in Qj n Qk for any j and k. Then there exists a unique distribu- tion TE!J1'(Q) such that T= 1) in Qj for each). (Define T(<p)=L 1)(!Y. j <p) for a tpartition of unity {exJ subordinate to {Qj})' Namely, the distri- butions !J1' form a tsheaf of linear spaces over R n . It is not tflabby but is soft, i.e., for any distribution T defined on a neighborhood of a closed set Fin R n , there is a distribution on R n that coincides with T on a neighborhood of F. For each distribution TE!J1'(Q) there is a largest open subset Q' of Q on which T van- ishes. Its complement is called the support (or carrier) of T and is denoted by supp T. The support of the distribution given in example (I) coincides with that of the function f The sup- port of c5(p) is the origin. E. Derivatives of Distributions In example (I) in Section C, if [(x) is a func- tion of class C\ then by llltegration by parts TDPf(<P)==( -1)lpl1f(DP<p) for Iplk. The right- hand side defines a distribution even if r is not differentiable. In view of this example, we define derivatives DP T of any distribution T by (DPT)(<p)=( _J)lpIT(DP<p), <pEf!Zx. (2) Any distribution is infinitely differentiable, Any locally integrable function is infinitely differentiable in the sense of distributions, and its derivatives DP1f are ca11ed distribution derivatives (or generalized derivatives or weak derivatives). For example: (1) D P c5 =c5(P); (2) (I-dimen- sional case) dx+/dx = 1, dl/dx=c5, where x+ = max(x,O) and 1 (x) is Heaviside's function, which is eq ual to 1 for x;:' 0 and to 0 for x < O. F. The Operation of Linear Differential Operators on Distributions Let T be a distribution and ex(x) a C'" -function on Q. Then we can define the product exT by (exT)(<p) = T(ex<p), where the right-hand side is a continuous hnear functIOnal on !J1(Q) since the multiplication <pf->ex<p is continuous in !J1(Q). We define the dual operator (or conjugate operator) P'(x, D) of a linear differential opera- tor P(x, D) = L ap(x)DP by P'(x,D)<p= L( _1)IPIDP(a p (x)<p(x)), (3) where the ap(x) are C')-functions on Q. Com- 
125 G Distributions and Hyperfunctions bining differentiation of distributions, multi- plication by functions, and addition, we can apply partial differential operators to distribu- tions, and we have (P(x,D)T)(<p) = T(P'(x,D)<p). Linear differential operators commute with restriction mappings. In particular, we have supp P(x, D) T c supp T. In other words, linear differential operators are tsheaf homomor- phisms on the sheaf £0' of distributions over Q. On the other hand, every continuous sheaf homomorphism on £0' is a linear differential operator whose order is finite on each com- pact set in Q. Even when no continuity is assumed, a sheaf homomorphism on £0' is a linear differential operator except on a discrete subset ofQ (1. Peetre). G. The Topology of £0 and £0' Let 0=KI K2" be a sequence of com- pact sets in Q that exhausts Q. For every nondecreasing sequence of positive numbers {a} = {a o , a 2 ,... } and non decreasing sequence of nonnegative integers {k} = {ko, k l ,...} we set P:a},{k}(<P)=SUP sup supajIDP<p(x)1 jO Ipl';;kjxKj for <pE£0(Q). We define a tlocally convex topol- ogy of the space £0(Q) by taking the totality of the tseminorms P{a},{k} as a fundamental system of continuous seminorms. Then £0(Q) is a tnuclear t(LF)-space, and the convergence <Pm =0 of a sequence <Pm is identical to the convergence <Pm-->O in this topology. A sub- set Be £0(Q) is tbounded in the topology if and only if there exist a compact set E c Q such that supp <P c E for every <pE B, and positive numbers Mp for each p such that sup IDP<p(x)l.s; Mp for every <pE B. The topology of the space £0'(Q) is the tstrong topology of the tdual of £0(Q): the ttopology of uniform convergence on every bounded set in !0(Q). Under this topology £0(Q) is a tnuclear treflexive linear topo- logical space. With respect to this topology, the linear differential operators of Section F are continuous in £0'. By virtue of the following convergence theorems, various limiting processes concern- ing distributions can be treated easily. If the limit Iim 1J(<p) = T(<p) exists for every <pE£0, where {1J} is a sequence of distributions, then TE £0' and {1J} is convergent to the distribu- tion T (convergence theorem). Moreover, for any p, DP 1J is convergent to DP T (theorem of termwise differentiation). Any bounded set in £0 or £0' is totally bounded. Thus weak conver- gence of a sequence {I;} implies strong conver- gence (convergence in the topology of the space £0') (strong convergence theorem). 476 H. Distributions Depending on a Parameter Consider distributions T A depending on a parameter }", where }, ranges over the real line, the complex plane, or more generally an open set in Euclidean space. The convergence theo- rem and the strong convergence theorem also hold in the case of a continuous parameter ..1.. Thus T A is continuous (differentiable) with respect to the parameter}" if TA(<p) is continu- ous (differentiable) with respect to A for any <pE£0. If T A is continuous with respect to ..1., then D!:TA is also continuous with respect to },. If T A is differentiable with respect to a real variable }" then D!: T A is also differentiable, and aD!: TAlaA = D!:(aTAlaA). The same facts hold for the case of several real variables. For a com- plex parameter ..1., T A is holomorphic with re- spect to ..1. if TA(<p) is holomorphic for any <pE£0. The fundamental properties of holomorphic functions also hold in this case. If T;, is defined and continuous in an interval [a, b J, then the integral T = S T A dA with respect to }, exists in £0' and we have (4) T(<p) = r TA(<p)dA, <pE£0. I. Distrihutions with Compact Support We denote by 6"(Q) (abbreviated to 6") the space of all complex valued Coo -functions on Q. 6"(Q) is a nuclear tFrechet space with the locally convex topology defined by the semlllorms PP.K(<P) =sup IDP <p(x)1 XEK (5) as p ranges over all multi-indices and K ranges over all compact sets in Q. We denote by 6"'(Q) (abbreviated to 6"') the tstrong dual of 6"(Q), i.e., the set of all continu- ous linear functionals on 6"(Q) equipped with the topology of uniform convergence on bounded sets in 6"(Q). 6"'(Q) is a nuclear t(DF)- space, If TE 6"'(Q), then its restriction to £0(Q) is a distribution with compact support. Con- versely if T is a distribution with compact support in Q, then choosing exE£0(Q) which is identically equal to 1 on a neighborhood of the support of T, we define a linear functional Son 6"(Q) by S(<p) = T(ex<p). Then SE6"'(Q), and S is independent of the choice of ex. In this sense, we can consider that S is the extension of T to 6"(Q) and identify Sand T. thus 6"'(Q) coincides with the set of all distributions with compact support in Q. J. Structure Theorems for Distributions A distribution TE£0'(Q) is said to be of order at most k if I T(<p)I.s; Pia}, [k}(<P) for {k} = {k, k, k,.,.} 
477 and for some {a}, A distribution of order 0 is a measure. Every distribution of finite order can be represented as a finite linear combination of derivatives (in the sense of distributions) of measures or locally integrable functions. The restriction of any distribution to a relatively compact open subset Q' can be represented in this way because it is of finite order. There- fore the distributions form the smallest class of generalized functions that contains all locally integrable functions, is stable under differentiation, and forms a sheaf. Further- more, for any distribution T there exists a sequence {<pJ c £0 that converges to T in the distribution sense. A closed set F is said to be regular if for every point a in F we have a neighborhood U of a and constants wand I ;:, ex > 0 such that every pair of points x and y in F n U is con- nected by a curve contained in F with length less than or equal to wlx- yl'. If the support of a distribution T is contained in a regular closed set F, then T= '" DPjT L. J1j (the expression IS not necessarily unique, and the sum is locally finite), where the J1j are complex-valued measures with support con- tained in F. In particular, if the support of a distribution T contains only one point a, then T can be represented uniquely as a finite linear combination T= L expDPb(x-a) Ipl,;;m of derivatives of the distribution b(ra) defined by b(x-a)(<P) = <p(a), K. Tensor Products of Distributions Let Qx c R n and Qy c R m be open sets. If TE £0'(Qx) and SE£0'(Qy), then there is a unique distribution T@ S E £0'(Qx x Qy), called the tensor product (or direct product), such that T@ S(<p(x)ljt(y)) = T(<p)S(Ijt) for any <pE£0(QxJ and IjtE£0(Qy). More directly we have Fubini's theorem: T@ S(<p(x, y») == TASy(<p(x, y))) == Sy('T;;(<p(x,y))) for any <pE£0(Qx x Qy). If F and G are linear spaces of distributions on Qx and Qy, respectively, then the ttensor product F @ G is identified with the linear combina- tions of tensor products T@ S of TE F and S E G, which form a linear space of distribu- tions on Qx x Qy. When F and G have tlocally convex topologies, the tfompleted tensor products F @ G and F @ G are also usually identified with subspaces of £0'(Qx x Qy). For example, we have £0'(Qx) @ £0'(Qy) = £0'(Qx x Qy), .&"(Qx) @ .&"(Qy) = .&"(Qx x Q) and .&'(QJ @ .&'(Qy) = .&'(Qx x Qy) (mcluding the topologies). Similarly, we have £0(Qx) @ £0(Qy) = 9?(Qx x Qy), 125M Distributions and Hyperfunctions but the left-hand side has a strictly weaker topology than the right-hand side. Spaces of distributions with parameters ( Section H) are often identified with completed tensor products of spaces of distributions [8-11]. L. The Kernel Theorem Let Qx and Qy be as above. Every distribu- tion K on Qx x Qy induces the continuous linear mapping L K : £0(Qx) -->£0'(Qy) defined by (LK(<p(x))) (Ijt(y)) = K(<p(x)ljt(y), Ijt E £0(Qy)' Conversely every continuous linear mapping L:£0(Qx)-->£0'(Qy) is equal to L K for a KE £0'(Qx x Qy) and the correspondence K f-> L K is a topological isomorphism of the locally con- vex space £0'(Qx x Qy) onto L(£0(Qx), £0'(Q) equipped with the topology of uniform conver- gence on the bounded sets (L. Schwartz's kernel theorem, [8-11 J). K is called the kernel (distribution) of the mapping L K . L K is a continuous linear mapping £0(Qx)--> .&'(Qy) (resp. £0(Qx)-->.&"(Qy)) if and only if K E £0'(Qx) @ .&'(Qy) (resp. £0'(Qx) @ .&"(Qy)), in which case K is said to be regular (resp. compact) in y. L K can be extended to a continuous linear mapping .&"(Qx)-->£0'(Qy) (resp. .&'(Qx)-->£0'(Qy)) if and only if K E.&'(Qx)@ £0'(Qy) (resp. .&"(Qx) @ £0'(Qy)), in which case K is said to be regular (resp. compact) in x. K is said to be regular (resp. compact) if it is regular (resp. compact) both in x and y. L K can be extended to a con- tinuous linear mapping .&"(QxJ-->.&'(Qy) (resp, .&'(Qx)-->.&"(Qy)) if and only if K E.&'(Qx x Qy) (resp. .&"(Qx x Qy)). Then K is said to be regu- larizing (resp. compactifying) [8, 10]. M. Convolution For distributions Sand Ton R n , assume that either S or T has a compact support or more generally that supp Tn ({ x} - supp S) is locally uniformly bounded with respect to x. Then the correspondence <p E £0 --> S(x) 7(y)( <p(x + y)) defines a distribution called the convolution of Sand T and is denoted by S * T. In particular, If * Tg = If.g, where f * 9 is the tconvolution of functions f and g, and If, Tg, and Tf.g denote the distributions corresponding to f, g, and f * g, respectively, For example: T* b = b * T= T, DPT* S= T*(DPS)=DP(T*S). Thus a solution of the partial differential equation P(D) T = S with constant coefficients is given by the convolution S * E of S with a tfundamental solution E of P(D), whenever the convolution exists. If TE£0' and <pE£&, the convolution T*<p is equal to a function f of class C>C (or the 
125N Distributions and Hyperfunctions distribution corresponding to I), and f(y) = I;x)(<p(y - x)). f is called the regularization of T. For distributions Sand T, assume that If(x)g(y - x)1 is integrable on R n for any regu- larization f = S * <P and 9 = T * if;. Then there exists a unique distribution V such that V*(<p*if;)= f f(x)g(y-x)dx. This distribution V is called the general- ized convolution and is denoted by S * T (c. Chevalley), N. Tempered Distributions :?' We denote by !f'(Rn) (abbreviated to .'f') the space of all trapidly decreasing functions of class Coo on Rn. :? is a nuclear Frechet space with the locally convex topology defined by the seminorms Pp,q(<p) = sup Ix P Dq<p(x)l, x as p and q range over all multi-indices, where X P =Xf1... x%". We denote by Y"(R n ) (abbre- viated to :?') the strong dual of :?(R n ). Its elements are called tempered distributions (or slowly increasing distributions) and are identi- fied with their restrictions to !J1(Rn). A distri- bution TE q;' can be continuously extended to ."Ie> if and only if any regularization f = T* <p of T is a slowly increasing continuous func- tion (i.e., there is a polynomial P(x) such that I f(x)1  IP(x)l). Y'(R n ) is a nuclear (DF)-space. Let I < p < 00, and let q be the tconJugate ex- ponent. The strong dual of the tfunction space !J1 L (R n ) is denoted by !J1 (R n ). Since:? is dense in 9&L ,!J1{ is identified ith a linear subspace q p of :?'. The strong dual of q;{ (R n ) is equal to q;Lq(R n ). Let :1d(R n ) be the cloed linear sub- space of £I6'(R n ) consisting of all functions <p such that DP <p E Coo(R n ) for all p ( 168 Func- tion Spaces). It is also the closure of ."Ie> in £16'. The strong dual of iiJ is denoted by !J11 (Rn). Its elements are called integrable distributions because the strong dual of !J11 (Rn) is equal to 26'(R n ) so that the integral T( I) has a meaning. Here I is the function identically equal to 1. tSobolev spaces W(Rn) and tBesov spaces B,q(Rn) are also linear subspaces of :?'(R n ). O. Fourier Transforms The Fourier transform ,:F<p(x) = (J2;)-n f <p«()exp( -ix)d (7) is an isomorphism of  onto Y',;, where x == x I  1 + ... + X n n' The Fourier transform 478 :77TE:?i of TE,Cf:; is defined by (iFT)(<p) = T(:77<p), <pE:? (8) (6) The inverse Fourier transform is defined simi- larly, except that - i is replaced by i. For example: (1):771 = (J2;)nc5, where 1 is the distribution corresponding to the function I. (2) :77(DPT) = i1p1x p :77T. A function <p of class Coo is called a slowly increasing Coo-function if <p and its derivatives of any order are slowly increasing continuous functions. The space (!1M is the set of all such functions. Its image under the Fourier tranS- form :77«(!JM)' denoted by (!I, coincides with the set of all distributions T such that any regularization f = T* <p is a rapidly decreas- ing C"'-function. A member of the space (!Ie is called a rapidly decreasing distribution. If exE(!IM ([3E(!I) and TE:?', then exTE,'/''' and :77(rtT)=(J2;)n:77ex *:77T ([3 * TE:?' and :77([3 * T) = (J2;)" :77[3:77T). If T is a distri- bution with compact support in Rn, then its Fourier transform is the tentire function f(() = (J2;)-nT(exp( -ix()), where (= + irJEC". F or a convex compact set K in R n , define its supporting function H K by HK(rJ)=SUPXEKXrJ, Paley-Wiener theorem. An entire function f(O on C" is the Fourier transform of a distri- bution (resp. a function of class COO) with sup- port in K if and only if (i) for any £ > 0 there is a constant C, such that If«() I  C,exp(HK(rJ)+ Elm on C", and (ii) there are constants C and N such that If()I.s; ql + 1lt (resp. for any N there is a constant C such that If()1  ql + IW- N ) on R n . P. Fourier Series and Distributions on Tori The n-dimensional ttorus Tn is the quotient space of R n with respect to the equivalence relation x j = yj mod Z (j == I, ... , n). The space 9'(Tn) of distributions on Tn is defined to be the strong dual of !J1(Tn)=.&'(T n ). The volume element dx of Tn is defined from that of R n . Thus, for an integrable function f, we can define a distribution 7f by 7f(<p)= f f(x)<p(x)dx, <pE!J1(Tn), T" Consider the family of functions fp(x) = exp(2nipx), where p ranges over all n-tuples of integers. Then any distribution T on Tn has the Fourier series expansion T= L cpfp(x), where the Fourier coefficients c p = T(/_p) are slowly increasing, i.e., ICpl.s; ql + Ipl2l for some k and C. Conversely if {c p } is a slowly increasing sequence, then L cpfp converges to a distribution T on Tn 
479 Q. Substitution Let f =(/1,'" ,1m) be a Coo-mapping from an open set Q in R n into an open set Q in R m , and assume that the trank of the Jacobian matrix (a/;/ax) (i= 1, ...,m;j= 1, ...,n) is equal to m in a neighborhood of the tin verse image E of the support of a distribution S = Sty) E !J1'(Q). In a neighborhood V of E, we can choose u l = gl (x), ..., U n - m = gn-m(x) so that the trans- formation (y, u) = (/(x),g(x)) has the inverse transformation x = tjJ(y, u) of class Coo. If the support of <pE!J1(Q) is contained in V, then defining ip by ip(y) = L,-m <p(tjJ(y,u))J(y,u)du, we have ipE!J1(Q), where J(y, u) is the abso- lute value of the tJacobian of tjJ(y, u), and ip is independent of the choice of 9 I ... , gn-m' Now we define T(<p) = S(ip) ifsupp<pc V, and T(<p) = 0 if supp <p does not intersect E. Since a distribution can be determined by its local behavior, we have a distribution TE!J1'(Q) with support E, which is denoted by T=Sof=S(f) = Sf(X) and is called the substituted distribution of Sty) by y = f(x). It is also called the pullback of S by f and is denoted by f*S. The chain rule for derivatives of composites a m a/; ( as ) -(Sof)= I - -of aX j i1 aX j aYi also holds. F or example: (1) If S = c5lil (y E R m ), then E is the surface fl(x) = f2(X) = ." = fm(x) =0. Assum- ing that (/1, .. . ,1m) satisfies the condition in the previous paragraph, we obtain the distribution c5(q) (/1 , .., ,1m) with support contained in a sur- face, From the fact c5(Q) = Dic5, we can write c5 (Q) (f f, ) - a IQ1 c5(/" ... ,1m) I, ..., m (afd Q1 ...(afm)Qm (Gel'fand-Shilov notation). (2) For the map- ping f(x) = Ax + b, where A is an n x n regular matrix and b is an n-vector, we can define the substituted distribution of S by f = f(x) for any SE!J1'(R n ). For instance, c5(X-b)(<P) = <p(b). c5(x'-c2) =(2C)-I(c5(x_c) + c5(x+ej) (c > 0, XE R I). R. Distributions and Currents on a Differentiable Manifold Let M be an n-dimensional tdifferentiable manifold of class Coo. Let {( V., tjJ,)} be an tatlas. Then a distribution Ton M is defined to be a collection of distributions T. on tjJ,( V,) c Rn such that Tp on tjJp(V,n Vp) is equal to T,o (tjJ,OtjJp-l) for any ex and p. Since the distribu- tions obey the same transformation law as 125 S Distributions and Hyperfunctions functions under coordinate transformation, this definition has an invariant meaning, and every locally integrable function on M is re- garded as a distribution, More generally, a distribution tcross section of a tvector bundle of class CX) is defined in the same way. The most important is the case of the tp-fold ex- terior power of the tcotangent bundle. Its distribution cross sections, i.e" texterior dif- ferential forms of order p with distribution coefficients, are called currents of degree p. If M is toriented, then the space of currents of degree p on M is identified with the dual space of the locally convex space !J1(n- p )(M) of all differential forms of degree n - p, of class C'" and with compact support in M. For example, if C is an (n - p)-dimensional tsingular chain, a current Tc of degree p is defined by Tdex) = Scex. If M is not torientable, we have to consider either the double covering 1\1 of M or the cross sections of the tensor product of the bundle of (n - p)-covectors and the orientation bundle [1,3]. Currents were introduced by de Rham to prove his celebrated isomorphism theo- rem of the tde Rham cohomology groups de- fined by differential forms and the tsingular cohomology groups defined by singular co- chains ( 105 Differentiable Manifolds), (9) S. Gel'fand-Shilov Generalized Functions Let E and F be tlocally convex spaces for which a continuous linear injection i: E-->F with dense range is defined. Then the dual i': F' --> E' is a continuous linear injection with weak*-dense range. Therefore if we shrink the function space E, we obtain a larger space E' of continuous linear functionals. Gel'fand and Shilov [4J introduced many function spaces E, called fundamental spaces or test function spaces, and defined corresponding spaces E' of generalized functions. Their motivations were in applications to the theory of partial dif- ferential equations, where Fourier transforms play an essential role and Schwartz's frame- work of tempered distributions is too restric- tive. They considered a pair of funchon spaces E and E such that the Fourier transformation .f7:E-->E is an isomorphism of locally convex spaces. Then the Fourier transformation de- fined to be the dual of.f7 gives an isomor- phism E' --> E' of spaces of generalized func- tions. The function spaces E and E are often spaces of entire functions, so that the Gel'fand- Shilov generalized functions are not neces- sarily localizable in R n . As typical examples of spaces E and E we shall mention only spaces of type S in the next section. L. Ehrenprels's analytically uniform spaces 
125 T Distributions and Hyperfunctions [12J are also a class of generalized function spaces introduced from a similar point of view. T. Spaces of Type S Let ex, fJ, A, B, A, and B be n-dimensional vectors. By A> B we mean Aj> Bj (j= I,.." n). We use notations AP=A l 1... A', pP'=pf1'1... p"', (i) For ex;:' 0, A> 0 the space S"A consists of a1l COO-functions <P such that seminorms IxPDq<p(x)1 Pq,A(<P)=SUpsup AP P' x P P are finite for a1l A> A and q. For example, So, A is the space of al1 functions of class Coo with support in {x II xjl.s; Aj,J = I, ... , n}. (ii) For fJ;:'O, B>O the space Sp,B consists ofa1l Coo-functions <P such that the semi norms Ix P Dq<p(x)1 Pp B(<P)=SUpsup - p , x q Bqqq are finite for a1l jj> Band q. (iii) For ex, fJ;:' 0 and A, B > 0, the space st:;t consists of a1l C oo _ functions such that the semi norms IxPDq<p(x)1 PA,B(<P)=SUpsupsup APBq P' qp x P q P q are finite for a1l A> A and jj> B. The topol- ogies for these spaces are given in terms of the seminorms (10), (11), and (12), respectively. These spaces are generica1ly ca1led spaces of type S. $'(S,.A)=S"A (ex>O), $'(Sp,B) == Sp.B (fJ > 0), $'(SU)=S;J:; (ex+fJ> 1). $'(So, A) = So, A', $'(SO,B) = SO,B', $'(S{.;t) = S'P:;: (ex + fJ = I), where A'=Aexp(ljA) and B'=Bexp(ljB). We set S == U S SP= U Sp,B and SP== 0[ a,A, , a U st:;t, where A and B range over a1l positive n-dimensional vectors. st # {O} if and only if one of the following conditions is satisfied: (i) ex+fJ;:' 1, ex>O, fJ>O; (ii) ex> 1, fJ=O; (iii) ((=0, fJ> I. In such a case the space St' of general- ized functions contains the tempered distribu- tions Y"(R n ). U. Ultradistributions The localizing property of distributions is proved mainly from the existence of tpartitlOns of unity by functions in £0, Therefore if a test function space admits partitions of unity, the corresponding class of generalized functions 480 (10) forms a sheaf. Roumieu [5] took the space 9tJ iMp ](Q) oftu1tradifferentiab1e functions of class {Mp} with compact support. We can also use the space £0(M p )(Q) of class (M p ). The cor- responding generalized functions are ca1led ultradistributions of class {Mp} and of class (M p ), respectively. Here Mp is a sequence of positive numbers satisfying the logarithmic convexity M;.s; M p _ 1 M p +1 and the Denjoy- CarIeman condition L MpjMp+1 < 00. £0{M p }(Q) (resp. £0(M p )(Q) is the space of al1 functions <pE £0(Q) such that there are constants k and C (resp. for any k > 0 there is a C) for which we have ID'<p(x)l.s; Ckl'l MI'I' In particular, the GeVfey classes {s} = {piS} and (s)=(p!S) defined for s> 1 are important, and they ap- pear often in theory of differential equations. Almost all results for distributions have been extended to u1tradistributions under appropri- ate conditions on Mp, which are satisfied by Gevrey sequences pis ( [14J in particular). Closely connected are A. Beurling's test func- tion spaces £0",(Q) ( [13J), where w is a func- tIOn on R n continuous at the origin, satisfying 0= w(O).s; w( (+ '1).s; w() + w('1) for a1l  and '1 in R n and such thatJ w()(1 + 1m -n-I d < 00. £0",(Q) is defined to be the space of a1l <P E £0(Q) such that the Fourier transform cP satisfies S IcP()1 exp(}.w(md < 00 for any ..1. > O. If w() = log(1 + I I), then £0", = £0; if w() = I II/s, then £0", =£0(,). Let w()=w( - ). The space £0;"(Q) of Beurling's generalized distributions is defined to be the strong dual of £0 w (Q). (11) (12) V. Hyperfunctions In this and the fo1lowing sections we mean by a cone r eRn a convex open cone with vertex at O. For two cones r, , we write   r if Ansn-I is relatively compact in rns n - I , where sn-I = {Ixl = I}. By a wedge we mean an open subset ofC" of the form Q+ir, where QeR n is an open subset and r e R n is a cone. r is ca1led the opening of the wedge and Q its edge. By an infinitesimal wedge (O-wedge for short) or a tuboid of opening r and edge Q, we mean a complex open set U such that U e Q + ir and that for any   r, U contains the part of Q + i which is contained in some complex neighborhood of the edge Q. The symbol Q+ irO wi1l represent anyone of such open sets, and CD(Q + irO) (the inductive limit of) the total- ity of functions holomorphic on some of them. A hyperfunctionf(x) on an open set QeR n is an equivalence class offormal expressions of the form N f(x) == L Fj(x + iIjO), jl (13) where Fj(z)Ee:(Q+iIjO). {Fj(z)} is ca1led a set 
481 of defining functions of f(x). Here the equiva- lence relation is given as +++m=++n (14) if fjn r k # 0, that is, we can contract two terms into a single term as above and, conversely, can decompose, if possible, a term in the inverse way. These are considered to be modi- fications of the expression of the same hyper- function, The totality of hyperfunctions on Q is denoted by £i6'(Q). It is a linear space by virtue of the linear structure naturally induced from that of holomorphic functions, combined with the above equivalence relation. The symbol (x + ifjO), which represents by itself a hyperfunction, is called the boundary value of (z) to the real axis. It is merely formal and does not imply any topological limit, though there is some justification for the terminology as will be seen in Section Z. In the case of one variable, we have only two kinds of wedges Q + iR:t, hence a hyper- function can be expressed by two terms: F+(x + iO) - F_(x - iO), Some examples of hyperfunctions of one vari- able are b(x)== -(2ni)-1«x + iO)-l -(x - iO)-l); l(x)= -(2nW 1 (log( -x-iO)-log( -x+iO)); Pfx- l =«x + iO)-l +(x -iO)-1)j2. W. Localization of Hyperfunctions If Q' c Q is an open subset, the restriction mapping £i6'(Q)-->£i6'(Q') is induced from that for holomorphic functions. With this structure the correspondence Qf->£i6'(Q) becomes a tpresheaf. It is in fact a sheaf, because it can be expressed by the terminology of relative (or local) co- homology as follows: Let Ht(C", ff) denote the kth relative cohomology group of the pair (C", C"" Q) (also called the kth local coho- mology group with support in Q) with coeffi- cients in a sheaf ,W' on C". (It is by definition the kth tderived functor of ff f-> rn(C", ff) = the totality of sections of ,<7F defined on a neighborhood ofQ and with support in Q and is calculated as the kth cohomology group of the tcomplex rn(C", 2), where 2 denotes any flabby resolution (i.e., tresolution by tflabby sheaves) of ff,) Let £'A,(ff) denote the kth derived sheaf of ff to Rn. (It is by definition the sheaf on Rn associated with the presheaf Qf->Ht(C",ff).) Then the cohomological definition of the sheaf of hyperfunctions is £i6' ='IH0) (the orientation being neglected), A fundamental theorem by Sato says that R n c C" is purely n-codimensional with respect to o (i.e., <:Yt"«(())=0 for k#n), and moreover Ht(C", (()) = 0 for k # n for any open set Q eRn, 125 W Distributions and Hyperfunctions Then by the general theory it can be shown that the remaining H(C", 0) agrees with the section module £i6'(Q) of £'R"(0), Since H"(V, (()) = 0 for any open set V c C" (B. Malgrange), it follows further that the sheaf £i6' is flabby, i.e., its sections on any open set can always be extended to the whole space. If U is a tStein neighborhood ofQ, then H(C", (()) can be expressed using the covering cohomology as the quotient space (()(U#Q)!jt (()(U#jQ), (16) where U #Q={zEUlprk(z)ttprk(Q) for alI k}, U#jQ={zEUlprk(z)ttprdQ) fork#j}, (17) and pr k is the projection from C" to the kth coordinate, Then the isomorphism H(C", (()) = £i6'(Q) is induced by the correspondence 0(U #Q)3F(z)f-> [F(z)J = LsgnO"F(x + ir a O)E£i6'(Q), a (18) (15) where ra is the O"-orthant {YERnIO"jYj>O,j= 1, ...,n} and sgnO"=O"l "'O"n' F(z) is calIed a defining function of the corresponding hyper- function. For one variable, any complex neigh- borhood U=:>Q is Stein, and the above isomor- phism reads (()(U,,"-Q)j(()(U)=£i6'(Q), from which the naturality of the sign in (15) follows. Thus the notion of support is also legitimate for hyperfunctions. The sheaf of hyperfunc- tions £i6' does not admit partitions of unity as for £0'. It is, however, flabby. Consequently, given a decomposition of a closed set into locally finite closed subsets E = U AEA E A and a hyperfunction f with support in E. we can always find hyperfunctions fA with support in E;, such that.f = LAEAfk For distributions this property holds only under some regularity assumption for the decomposition, There are several practical criteria to deter- mine whether or not a hyperfunction is zero in some open set Q. These are called the edge of the wedge theorem. A hyperfunction F(x + irO) with single expression is zero if and only if F(z) itself is zero. Fdx +ir l 0)= F 2 (x + ir 2 0) if and only if they stick together to a function in 0(Q + i(r 1 + r 2 )0) (Epstein type). (Note that r l + r 2 is equal to the convex hulI of r 1 U r 2 , e.g., r + ( - r) = R n (Bogolyubov type).) Lfl (x + ifjO) =0 if and only ifthere exist Gjk(Z)E(()(Q+i(fj+ rdO),j, k= I, ...,N, such that Gjd Z ) = Gkiz) and =Ll Gjk,j= I, ...,N (A. Martineau [19J). These are interpretations of cohomology in terms of coverings and have global variants concerning the envelope of holomorphy, The real analytic functions <pEd(Q) on Q are naturally included in £i6'(Q) via the ex- 
125 X Distributions and Hyperfunctions pression <p(x + irO) for any r or with r = Rn. They form a subsheaf. Let f be a hyperfunc- tion on Q. The complement in Q of the largest open subset Q' c Q, where f(x) is real analytic, is called the singular support of f(x) and is denoted by sing supp f If Q is bounded and singsuppfcK, then we can choose an ex- pression of the form (13), where Fiz) can be continued analytically to Q,,-K. IfsuppfcK, then these Fiz) satisfy LFj(x)=O on Q,,-K in the usual sense. X. Operations on Hyperfunctions The derivatives of a hyperfunction f(x) with the expression (13) are defined through the defining functions as DP f(x) == L(Df Fj)(x + ir;O). The product by a real analytic function Ij;(x) is defined by Ij;(x)f(x) == L(Ij;Fj)(X + ir;O). Combining these, we have the operation of a linear partial differential operator with real analytic coefficients P(x, D) on hyperfunctions. It is a tsheaf homomorphism. Let fE(Q) and DcQ be a compact set with piecewise smooth boundary. If singsuppfnDD= 0, then by means of the special expression (13) mentioned at the end of Section W the definite integral is defined as f f(x)dx= ,f, f Fj(z)dz, D j=t DJ where Dj is a path deformed from D in such a way that DDj= aD and Fj(z) is holomorphic on Dj' The result is independent of the choice of deformations or of the boundary value ex- pression employed, If supp.f<G D, the result is also indepen- dent of D; hence it can be written as SR"f(x)dx. If f(x, t) is a hyperfunction of the two groups of variables (x, t)EQ x V such that sing supp fn aD x v= 0, then the integral SDf(x, t)dXE .%',(V) is defined by the same method. It com- mutes with differentiation 0r integration with respect to t. If f(x) == L Fj(x + ir;O), g(x) == L Gk(x + ikO), then we can define the product by f(x)g(x) = L(FjGk)(X+ir;nkO) under the assumption that r; n k # 0 for every pair j, k. Especially, the product f(x)g(t) is always legitimate for two hyperfunctions depending on different groups of variables. (It can also be interpreted as the tensor product.) A real analytic coordinate transformation x=<1>(x):O-->Q extends naturally to a holo- morphic coordinate transformation z = <1>(2) on a complex neighborhood. A O-wedge Q + irO is transformed by <1>-1 to a twisted wedge containing, for each xeQ and (T, a 0- wedge Ox + i(D<1> -I )XO with some (real) neigh- borhood Ox of X == <1>-1 (x), where (D<1>-I)x is the 482 derivative or the Jacobian matrix of the map- ping <1> -I at x. Thus the pullback <1>* f(x) = r(<1>(X))E(O) of f(X)E86'(Q) by the transforma- tion x = <1>(x) can be defined by substituting z == <1>(2) into the defining functions. It is consis- tent with the definition of the coordinate trans- formation for real analytic functions. Also, the law of the change of variables for the definite integral is the same as that for real analytic functions. This can be extended to the general operation of substitution as mentioned in Sec- tion Q for distributions or even to much more general operations ( Section CC). The convolution (/*g)(x)= SR"f(x-t)g(t)dt is defined under the same assumption on sup- port as in the case of distributions. It can be defined either literally as the composition of the above operations, or directly by way of the defining functions: For example, if 9 has com- pact support <G D and L Gdx + ikO) is an expression as mentioned at the end of Section W, then by choosing a suitable deformation Dk of D, we have (/*g)(X)==I [f Fj(z-r)Gdr)dr l . . J,k Dk Z-X+I(rj+dO Y. Hyperfunctions and Analytic Functionals The totality 86'[KJ of hyperfunctions with sup- port in a compact set K becomes a nuclear Frechet space. It is the dual of the nuclear (DF)-space .>¥'(K) of real analytic functions de- fined on a neighborhood of K. Thus 86'[KJ is the space of analytic functionals with tporter in the real compact set K. The duality is given by the definite integral (f,<p)= SR"f(x)<p(x)dx for fE86'[KJ and <pEd(K). A sequence {lk(X)} in 86'[KJ converges if and only if it admits an expression (13) for a common fixed set of 0- wedges Q + ir;O,j = 1, ..., N, such that the defining functions F kj E 0(Q + ir;O) are also holomorphic on a fixed complex neighbor- hood U ofQ"-K and converges locally uni- formly in Q + ir;O U U, where Q is a (real) neighborhood of K. 86'[KJ is isomorphic to HHC", 0), and the above duality is a special case of the Martineau-Harvey duality H(C", 0)= 0(K)' for a Stein compact set K cC n . In the 1- dimensional case this is due to G, Kothe. If K=[al,bIJ x... x [an,b n ], then H(C",0) can be represented, including the topology, by the quotient space 0(U#K)rt 0(U#jK), where U is a Stein neighborhood of K and U # K, U #j K are defined in the same way as (17). If we choose U = U I X ... X Un, then by 
483 way of a defining function F(z)E(r;(U # K) for fE86'[KJ the inner product is given by the contour integral r f(x)<p(x)dx JR' =(-1)"1 ...1 F(z)<p(z)dzl...dz n , JYI JYn where Yjc  is a closed path surrounding [a j , bjJ once in the positive sense. Similar inte- gral formulas are known for some special K of various types. Starting from analytic functionals we can reconstruct the sheaf of hyperfunctions. For example, we can put 86'(0) = the totality of locally finite sums of analytic functionals with porter in Q modulo the rearrangement of supports by decomposition (Martineau [17]). If 0 is bounded, we can also put 86'(0) = 86'[OJ/ 86'[oOJ (Schapira [18J). The proof of local- izability and/or flabbiness is based on the decomposability of support (which is the dual of the exact sequence O-->d(K U L)-->d(K) EB d(L)-->d(K n L)-->O) and the denseness of 86'[ KJ c 86'[LJ (which is the dual of the unique continuation property O-->d(L)--><W"(K)) for a pair K c L with the same family of connected components. Note that in no way is the topol- ogy of 86'[ KJ localizable, or eq uivalently, 86'(0) does not admit a reasonable topology. Z. Embedding of Distributions As the dual of the natural mapping d(K)--> t&"(K) we have the topological embedding t&"'(K) c.d(K)' =86'[ K]. This embedding con- serves the support and hence gives rise to an embedding of sheaf f!iJ' c.86' (R. Harvey). For a distribution T with compact support a set of its defining functions as a hyperfunc- tion is given by Fa(z) = J:(W(z-x, ra)), 0" being the multisignature, where W(z, ra) = SS'-1 nr. W(z, w)dw and W(z, w) is the component of a Radon decomposition of b(x) ( Section CC). If supp Tc K = [a p blJ x ." x [an, bnJ, then as a hyperfunction it is represented by the following element of (i7(c" # K): ( I ) F(z)=T . x (2ni)n(x 1- z d... (x n - zn) It is in fact in (i7«P I )" # K) and vanishes at infinity, where pi = C 1 U {oo} is the Riemann sphere. These formulas are valid also for hyper- functions, and they give defining functions of some canonical types. Especially, the one given by (I9) is called the standard defining function and is characterized by the foregoing prop- erties. For the above-mentioned defining functions 125 AA Distributions and Hyperfunctions for a distribution T, we have the following convergence in the sense of distributions on Rn: Tx= LsgnO"lim Fa(x +isYa) for YaEra. a '10 More generally if the limit Llim Fj(x + isy) for YjEr j dO (20) exists in f!iJ'(Q), then the distribution defined as this limit admits {Fj(z)} as a set of defining functions when it is considered as a hyper- function. However, for an arbitrary set of defining functions for a distribution, (20) does not necessarily converge in f!iJ'(0). (This is be- cause we can add terms with any bad behavior which cancel each other formally.) If a distri- bution admist the boundary value expression with only one term, or if the dimension is one, then the convergence of (20) in f!iJ'(0) neces- sarily holds. The convergence of (20) in f!iJ'(0) is equiva- lent to the locally uniform estimate for the defining functions of the type Fj(z) = O(IYI- M ) for some M> O. These assertions can be gener- alized to ultradistributions. For f!iJ{Mp)(O) (resp. f!iJ(M p ) (0) the last growth condition for the defining functions reads as follows: F/z) = O(exp M*(L/lyl)) for some L (resp. every L > 0), where M*(p)=supplog(pPp!Mo/M p ); espe- cially for Mp=p!S we have M*(p)pl/(s-l). AA. Hyperfunctions on a Real Analytic Manifold (19) Sticking the hyperfunctions on coordinate patches by the transformation law mentioned in Section X, we can define the sheaf of hyper- functions on a real analytic manifold. More generally, for a real analytic vector bundle over a real analytic manifold, we can consider the sheaf of its hyperfunction local cross sec- tions, which is also flabby. Thus, especially on a real analytic manifold M, we can obtain a concrete flabby resolution of the constant sheafC M of length dimM by the sheaves of differential forms with hyperfunction coeffi- cients: O-->C M-->86') 86',U)-'!.... -->86'):JimM)-->o. With this sequence we can calculate the rela- tive cohomology groups of open pairs with coefficients in C by an analytic method. This is an extension of the de Rham theory for distributions [16]. If M is a compact manifold equipped with a nowhere vanishing real analytic density glob- ally defined on M, then we have the topolog- ical duality 86'(M) = d(M)'. The inner product is given by the definite integral with respect to the density. 
125 BB Distributions and Hyperfunctions The Fourier series is an example of hyper- functions on real analytic manifolds. The series L c p exp(2nipx) converges in ,'!6'(Tn) and defines a periodic hyperfunction if and only if c p is of infra-exponential growth, i.e., C p == O(e'lpl) for any e > O. Tn has the global complex neighbor- hood (pl)n, and f(x)E86'(T n ) has the corre- sponding boundary value expression F.(z)E(O( {lzlla1 < 1,..., IZnla, < 1}), which represents the terms in the Fourier series such that O"jPj > O. BB. Fourier Hyperfunctions In place of [J? we take as the basic space the space f!li* of exponentially decreasing real ana- lytic functions in the sense of M. Sato [15]: f(x) E f!li* if and only if there exist b > 0 and e> 0 such that for any b' < band e' < e, f(x) extends holomorphically to the neighborhood {Imzl <b'} of the real axis and has order O(e -"IRez l ) there. f!li* is endowed with the struc- ture of nuclear (DF)-space via the inductive limit for b > 0, e > O. The classical Fourier transform :F acts isomorphically on f!li*. (In fact, band e change their roles under :F.) The strong dual of f!li* is called the space of Fourier hyperfunctions and is denoted by !d. It is a nuclear Frechet space. It contains [J?' as a dense subspace in view of the continuous and dense inclusion f!li* c,/f. It also contains classical locally integrable functions of infra- exponential growth, i,e" of order eelxl for any e > O. Thus by the duality we obtain a wider extension of Fourier transformation on !d. In the following a O-wedge of the form Rn'+ irO will be called a O-wedge of the form Dn + irO at the same time if it is a tubular domain (i.e., with fixed imaginary part rO). Then an element f(X)E!d can be expressed in the form (13), where each Fj(z) is holomorphic in a 0- wedge D n + iIJO and is of infra-exponential growth there locally uniformly in Imz. The inner product of such f(x) with <p E f!li* is given by the definite integral ff(X)<P(X) dx = j JrnFYJ Fj(z)<p(z) dz, where the YjE IJO are fixed. Given a cone  we define its dual cone by o = {17 E Rnl (17, Y) O for all YE }. If Fj(z) are all of exponential decrease in Rez locally uniformly with respect to ImzErjO and Rez/IRezlrtO, then the de- finite integral G«()==(Jh)-nf e-ix(f(x)dx = t (Jh)-n r e-iZ(Fj(z)dz (21) )-1 J.mz=Y J 484 converges locally uniformly for' in some 0- wedge D n - iO, and defines there a holo- morphic function of infra-exponential growth. Thus we obtain a Fourier hyperfunction G(  - iO) that agrees with :Ff calculated by the duality. For a general f(X)E!d, the Fourier transform in the manner of Sato is calculated as follows: First we decompose f(x) into the sum Lfk(X) for which the defining functions of fk(X) decrease exponentially outside , Then we calculate G k «() by (21) and put :Ff= L Gd - ikO). An example of such decom- position is given by multiplication by Xa(x)= rr;=11/(1 +expO"jx j ), which decreases expo- nentially outside fa = {(JjXj O}, The relation between !d and tJB is more com- plicated than the relation [J?' c,'(Rn). The growth condition for !d is interpreted as a condition concerning germs at infinity. Thus !d can be considered to be a sheaf on the direc- tional compactification D n = Rn U S-I such that !dl R' = 86'. Just as the sheaf 86' is obtained from (D, the sheaf !d is obtained as the nth derived sheaf ffl)'(@) from the sheaf @ on Dn + iR n consisting of germs of holomorphic functions of infra-exponential growth with respect to Rez. We have H(Dn+iRn,@)=O for k of- n for any open set Q c D n . Especially, !d is flabby, and the decomposition of support is available to calculate the Fourier transform. The symbol !d employed at the beginning to express the global Fourier hyperfunctions corresponds to !d(Dn), and !d(Rn) = 86'(Rn) by definition. The canonical restriction mapping !d(Dn)-->&6'(Rn) is surjective but not injective. As for tempered distributions [J?', we can introduce various subclasses of Fourier hyper- functions, e.g., exponentially decreasing Fourier hyperfunctions Uo>o exp( - b y' X2 + 1 )!d, real analytic functions of infra-exponential growth @(D n ), etc. We can also consider operations such as convolution and multiplication be- tween adequate pairs, and apply differential operators with suitable coefficients. Concern- ing these we can avail ourselves of the same formulas as given in Section O. A hyperfunction with compact support is naturally considered as a Fourier hyperfunc- tion, and its Fourier transform agrees with the inner product <f(x), (Jh) -n exp ( - ix), which gives an entire function of exponential type. Paley-Wiener theorem. An entire function /(0 is the Fourier transform of a hyperfunc- tion with support in a compact convex set K in Rn if and only if it satisfies condition (i) of Section O. The theory of Fourier hyperfunctions de- scribed above is mainly due to Sato and T. Kawai [20]. They are not the largest class of generalized functions stable under the Fourier 
485 transformation, Following a suggestion of Sato, S. Nagamachi and N. Mugibayashi, Y. Saburi, and Y. Ito have extended them to the modified Fourier hyperfunctions in which the radial compactification D 2 n of C" is employed instead of the horizontal compactification Dn + iRn in the above theory. If we discard the localizing property, they can be extended further to the Fourier ultrahyperfunctions or ultradistributions of 1. Sebastiao e Silva (1958), M. Hasumi (1961), and M. Morimoto (1973). cc. Micro-Analyticity of Hyperfunctions The boundary value expression (13) for a hyperfunction f'(x) can be interpreted con- versely as the description of the state of ana- lytic continuation of f(x) to the complex do- main. Thus we say that f is micro-analytic at (xo, (0) if on a neighborhood of Xo it admits the analytic continuation into the half-space (Imz, o) <0 in the sense that it admits an ex- pression (13) satisfying rjn {(Imz, o) < O} # o for every j. The set of points (xo, o)E Q x sn-I, where f E 86'(Q) is not micro-analytic, is called the singularity spectrum or singular spectrum of f and is denoted by S.S.j We have by definition S.S.F(x + irO)c Q x (rO n sn-I). Micro-analyticity can be characterized by the Fourier transformation as follows: f is micro- analytic at (xo, o) if and only if there exists a Fourier hyperfunction g(x) such that the Fourier transform g() decreases exponentially on a tconical neighborhood of o and that .r - 9 is real analytic on a neighborhood of Xo. A hyperfunction f(x) is real analytic in a neighborhood of Xo if and only if it is micro- analytic at (xo, 0 for any (E sn-I. With this notion we can clarify the oper- ations on hyperfunctions. (In the following sn-I is identified with (Rn\{o} )/R + and + denotes the sum in the latter.) We have S.S.(/g)c {(x), +(I-A)I/)I(x, )ES.S.f, (x, I/)E S.S.g, O.s; A.s; I} U S.S.fU S.S.g, S.S.f(<1>(x))c {(x, tD<1>)I(<1>(x), )ES.S.f}, and these operations are legitimate if and only if 0 does not appear in the direction compo- nent of the right-hand side. We also have S,S. f f(x, t)dx c {(x, ()I(x, t,, O)ES.S.j for at}, S.S.(/ * g)c {(x + y, )I(x, )ES.S.f, (y, )ES.S.g} under suitable conditions for support. The classical decomposition formula of Radon (or the plane wave decompostion of 6) (n-1)! r dw 6(x)= (_2ni)" Js'-t(xw+io)n 125 DD Distributions and Hyperfunctions can be interpreted as the decomposition of 6 into hyperfunctions with S.S. in the single direction w. By the convolution, this formula supplies similar decomposition for a general hyperfunction (called the singular spectrum decomposition). (22) can be generalized to (n-I)! i det(grad", 'I'(x,w)) 6(x)= dw, (-2ni)n 8'-1 (<1>(x, w)+ iO)" (23 where the twisted phase <1>(x, w) satisfies (i) <1>(x, w) is a real analytic function of positive type (i.e., Re<1>(x,w)=O implies Im<1>(x,w)O) and (ii) <1>(x, w) is homogeneous of order 1 in w and <1>(0, w) = 0, grad x <1>(0, w) = w; and the vector 'I' (x, w) is such that ('I' (x, w), x) = <1>(x, w). If <1>(x, w) further satisfies <1>(x, w) # 0 for x#O, then the component becomes a hyperfunction (even a distribution) of x whose S.S. is precisely one point (0, w), and this fact is useful in theoretical applications [7,2IJ ( 274 Microlocal Analysis). DD. Structure Theorems of Hyperfunctions (22) A hyperfunction whose support is concen- trated at the origin is expressed as the infi- nite derivative J(D)6(x)=La p D P 6(x) of the Dirac measure, with the coefficients satisfying lim(laplp!)I/lpl == O. Such an operator J(D) is called a local operator with constant coeffi- cients and acts on 86' as a sheaf homomor- phism. Its total symbol J(O is an entire func- tion of infra-exponential growth or of type [1,0]. By way of such an operator, a general (Fourier) hyperfunction can be written in the form J(D)g(x) with a continuous function on Rn (of infra-exponential growth). 1fOEsuPPf c {(v, x) O}, then SSf3(0, :f::v) (Holmgren type theorem of Kashiwara and Kawai), and furthermore, the direction compo- nent ofS.S.fat 0 has the form of {:f::v}Up-I(E) with some E c sn-2, where p: sn-I "-{:f::v}--> sn-2 is the projection to the equator (the watermelon-slicing theorem of Morimoto, Kashiwara, and K. Kataoka). E is called the reduced S.S. of fat O. These theorems have many applications in the theory of linear par- tial differential equations and also in physics. A hyperfunction f(x) with support in the hyperplane X n = 0 has several further remark- able properties in x.. It admits a formal ex- pansion of the form Lofdx')6(k)(xn)' where x' = (x" ..., xn-d and fk(X') = SaJXJ.r(x)x/k! dx n . The sum converges in the sense of the topol- ogy if f has compact support. It reduces lo- cally to a finite sum if f is a distribution, and the kth term represents the k-ple layer distri- bution of mass, electric charge, etc. For a general f the coefficients { fdx')} do not neces- 
125 EE Distributions and Hyperfunctions sarily determine f, though they are determined by f EE. Complex Powers of Polynomials Among examples of special generalized fUnc- tions the most important are those of the form n, where f+ ==max{l(x),O} (or more generalIy it can be replaced by zero on some connected components of {f(x»O}), and ).EC denotes a holomorphic parameter. (The discussion is the same for f- =max{ - f(x),O}.) The simplest example, xi, is defined as the analytic con- tinuation of the locally integrable function x for Re}, > -I by repeated use of the formula xi =(A+ 1)-1 Dxxi+l, and becomes meromor- phic in), with simple poles at A= -1, -2,.... As a hyperfunction we ha ve x = {( - x + iO)A- (- x - iO)A }/2i sin nA. At a negative integer .Ie = -n, xi has residue (-lr l .5(n-I)(x)/(n-1)! and fmite part [ -(2ni) -I Z -nl og ( -z)]. The latter is often denoted by xn. In general, for a germ of a real-valued real analytic func- tion ((x) we can fmd a differential operator P(A, x, Dx) with polynomial coefficients in A and a monic polynomial b(A) of minimum de- gree such that P(A, x, Dx)f;+1 =b(A)(; (Sato, I. N. Bernshtein, Kashiwara, J.-E. Bjork [22J). This formula gives the analytic con- tinuation off; just as for xi. The polyno- mial b(},) is called the b-function or the Sato- Bernshtein polynomial and contains valuable information regarding the singularity of f It has only negative rational roots (Kashiwara). We haverj' = f;+I; hence f+- 1 - f-- I (suit- ably interpreted as above) gives a solution of the division problem u. f = 1. Thus if f is a polynomial, its inverse Fourier transform gives a tempered fundamental solution of(( -iD). Furthermore, when f is the relative invariant of a tprehomogeneous vector space, we can calculate b(A) explicitly by way of the holo- nomy diagram. Also, the Fourier transform of f: can be calculated explicitly by way of the real holonomy diagram as a linear combina- tion of the corresponding objects for the dual prehomogeneous vector space with coefficients similar to the tMaslov index. The simplest example is g;-x =()-I(+i)e-niA/2r(A+ l)(niO)-H =()-I(+i)r(A+ 1){e+niA/2C+I _eIniA/2:::A-I}. Among practical examples are the following classical formulas: Let P(x) be a nondegener- ate real quadratic form and Q() its tdual form, and let q denote the number of negative 486 eigenvalues of P(x). Then g;-p; == _()-n22Hnnn/H r(A+ 1)r(A+n/2) x IdetPI-I/2{sin7r(A+q/2)QA-n/2 - sin n((n -q)/2)Q :::A-n/2}. Here the arguments in the r-factor (..1.+ 1)().+ n/2) give the b-function of P(x). If q=n-I, we further have, letting P;i: = P; 1(:f:: (x, v») for an eigenvector v corresponding to the unique positive eigenvalue, .'#' P; I == (2n) -n/22 2Hn - 1 nn/2-1 r(A + I) x r(';' + n/2)ldet PI- I / 2 {e+ni(Hn/2)Qi-n/2 + e,"ni(Hn/2)Q;-n/2 + Q:::A-n/2}. From these formulas (taking the finite part if necessary) we obtain the fundamental solution of the wave equation, the Laplacian, and their iterations. These are exactly the distributions introduced by Hadamard, M. Riesz, and others, as mentioned in Section A ( Appen- dix A, Table 15. V). References (24) [IJ L. Schwartz, Theorie des distributions, Hermann, revised edition, 1966. [2J L. Hormander, Linear partial differential operators, Springer, 1963. [3J G. de Rham, Varietes differentiables, Her- mann, 1955. [4J 1. M. Gel'fand and G. E. Shilov, Gen- eralized functions. I, Properties and oper- ations; II, Spaces of fundamental and gen- eralized functions; III, Theory of differential equations; IV (I. M. Gel'fand and N. Ya. Vil- enkin), Applications of harmonic analysis; V (I. M. Gel'fand, M. I. Graev, and N. Ya. Vilen- kin), Integral geometry and representation the- ory, Academic Press, 1964, 1968, 1967, 1964, 1966; VI (I. M. GeI'fand, M. I. Graev, and I. I. Pyatetskii-Shapiro), Representation theory and automorphic functions, Saunders, 1969. (Originals in Russian.) [5J C. Roumieu, Sur quelques extensions de la notion de distribution, Ann. Sci. Ecole Norm. Sup. Paris, 77 (1960),41-121. [6] M. Sato, Theory of hyperfunctions I, II, J. Fac. Sci. Univ. Tokyo, sec. I, 8 (1959),139- 193,387-437. [7J M. Sato, T. Kawai, and M. Kashiwara, Microfunctions and pseudo-differential equa- tions, in [16, pp. 265-529]. [8J L. Schwartz, Espaces de fonctions differ- entiables a valeurs vectorielles, J. Analyse Math"4 (1954-1955), 88-148. [9J A. Grothendieck, Produits tensoriels topo- logiques et espaces nucleaires, Mem. Amer. Math. Soc., 16 (1955). 
487 [lOJ L. Schwartz, Theorie des distributions 11 valeurs vectorielles, Ann. Inst. Fourier, 7 (1957),1-141; 8 (1958),1-209. [IIJ F. Treves, Topological vector spaces, distributions and kernels, Academic Press, 1967. [12J L. Ehrenpreis, Fourier analysis in several complex variables, Wiley-Interscience, 1970. [13J G. Bjorck, Linear partial differential operators and generalized distributions, Ark. Mat., 6 (1966), 351-407. [14J H. Komatsu, Ultradistributions. I, Struc- ture theorems and a characterization; II, The kernel theorem and ultradistributions with support in a submanifold; III, Vector-valued ultradistributions and the theory of kernels, 1. Fac. Sci. Univ. Tokyo, sec. lA, 20 (1973),25- 105; 24 (1977),607-628; 29 (1982), 653-718. [15J M, Sato, Theory of hyperfunctions (in Japanese), Sugaku, 10 (1958),1-27- [16J H. Komatsu (ed.), Hyperfunctions and pseudo-differential equations, Lecture notes in math. 287, Springer, 1973. [17J A. Martineau, Les hyperfonctions de M. Sato, Sem. Bourbaki, 13 (1960-1961), no. 214. [18J P. Schapira, Theorie des hyperfonctions, Lecture notes in math. 126, Springer, 1970. [19J A. Martineau, Le "edge of the wedge theorem" en theorie des hyperfonctions de Sato, Proc. Intern. Conf. Functional Analysis and Related topics, Tokyo, 1969, Univ. Tokyo Press, 1970, 95-106. [20J T. Kawai, On the theory of Fourier hyperfunctions and its applications to partial differential equations with constant coeffi- cients, J. Fac. Sci. Univ. Tokyo, sec. lA, 17 (1970),467-517. [21J K. Kataoka, On the theory of Radon transformations of hyperfunctions, J. Fac. Sci. Univ. Tokyo, sec. lA, 28 (1981), 331-413. [22J J.-E. Bjork, Rings of differential opera- tors, North-Holland, 1979. 126 (IX.22) Dynamical Systems A. History The theory of dynamical systems began with the investigation of the motion of planets in ancient astronomy. Qualitative investigation of mechanics in antiquity and the Middle Ages culminated in the work of J. Kepler and G. Galilei in 17th century. At the end of that century, I. Newton founded his celebrated Newtonian mechanics, by means of which 126 A Dynamical Systems Kepler's law on the motion of planets and Galileo's observations of movement can be explained theoretically. Following this, L. Euler, 1. L. Lagrange, P. S. Laplace, W. R. Hamilton, C. G. J. Jacobi, and others devel- oped the theory using analytical methods, and founded analytical dynamics. From the end of the 18th century through the 19th century, the tthree-body problem attracted the attention of many mathematicians. At the end of the 19th century, H. Bruns and H. Poincare found that general solutions of the three-body problem could not be obtained by tquadrature, and this gave rise to a crisis of analytical dynamics. But this was resolved by Poincare himself. He pointed out the importance of the qualitative theory based on topological methods, and obtained many fundamental results. A. M. Lyapunov with his theory of stability and G. D. Birkhoffwith his many important con- cepts of topological dynamics established foundations of the new qualitative theory, In 1937 A. A. Andronov and L. S. Pontryagin introduced the concept of structural stability, which attracted the attention of S. Lefschetz. Lefschetz's school investigated structural sta- bility and tnonlinear oscillations, and obtained many important results (H, F. de Baggis, L. Markus, M. M. Peixoto, and others). In about 1960, S. Smale initiated study of differentiable dynamical systems under the influence of Lef- schetz's school. Smale and his school founded a new theory of differentiable dynamical systems using tdifferential topology. D. V. Anosov generalized the work of E. Hopf and G. A, Hedlund on tgeodesic flows of closed surfaces of tconstant negative curvature and established the concept of Anosov systems, which played an important role in Smale's theory. The work of Hopf, Hedlund, and Anosov is closely related to tergodic theory, Ya. G. Sinai and R. Bowen obtained impor- tant results in ergodic theory. The concept of structural stability and its generalization are essential in the tcatastrophe theory of R. Thorn ( 51 Catastrophe Theory); the theory of bi- furcation of dynamical systems is another essential part of catastrophe theory. D. Ruelle and F. Takens proposed a new mathematical mechanism for the generation of turbulence using Smale's theory and Hopf bifurcation. The new theory of dynamical systems devel- oped by Smale and others is now applied to the mathematical explanation of chaotic phe- nomena in many branches of science. Finally, we mention that in the 1960s A. N. Kolmo- gorov, V. I. Arnold, and 1. Moser obtained remarkable results on the existence of quasi- periodic solutions for the n-body problem, which turned out to solve the long-standing problem of the stability of the solar system. 
126B Dynamical Systems B. Definitions of Dynamical Systems In the study of the evolution of physical, bio- logical, and other systems, we construct math- ematical models of the systems. Usually, the state of a given system is completely described by a collection of continuous parameters, which may be related in some cases. Thus the space X of all possible states of the system can be regarded as a Euclidean space or a subset of a Euclidean space defined by some equa- tions. In general, we assume that the space X of all possible states of the system forms a ttopological space, and we call it a state space or a phase space. Second, we assume that the law of evolution of states in time is given, by which we can tell the state XI at any time r l if we know the state Xo at time to. Assigning Xl to Xo, we have a mapping n(t l , to): X -->X for any times to and t I' which satisfies the follow- ing conditions: (i) n(t 2 , t I) 0 n(t I ,to) = n(t 2,tO); (ii) n(to,to) = Ix, the identity mapping of X. Finally, we assume that the mapping n(t I, to) depends only on t=t l -to. Writing n,= n(tl,t o ) ift=tl-t o , we have the following conditions from (i) and (ii) above: (i') n,o n, = n s +" s, tER; (ii') no= Ix. In general, the theory of topological dynam- ics can be regarded as the study of topological transformation groups ( 431 Transformation Groups) originating in the topological investi- gations of problems arising from classical mechanics. Here, we restrict our attention to some important special cases. (1) Let X be a topological space and R the additive topological group of real numbers. Let <p: X x R --> X be a continuous mapping. For each tER, we define a mapping <p,:X-->X by <p,(x)== <p(x, t), XE X. If the family of map- pings {<P'}'ER satisfies the following conditions, we say that (X, <p) is a (continuous) R-action, a (continuous) flow, or a (continuous) dynamical system on X, and that X is the phase space: (i) <Ps 0 <p, == <Ps+' for all s, t E R; (ii) <Po = 1 x. Let (X, <p) be a flow. Then <p,: X -->X is a thomeomorphism with (<p,)-l == <p_, for each tER. For each XEX, define a mapping <px:R--> X by <pX(t) = <p(x, t), tER. The mapping <px is called a motion through x, and its image C(x) = {<pX(t) I t E R} is called the orbit or the trajec- tory through x. An orbit is nonempty, and two orbits are either identical or mutually disjoint. The family of orbits fills up the phase space X and is called the phase portrait. Let R + be the additive tsemigroup of all nonnegative real numbers. If we replace R by R+ in the definition of a (continuous) flow, we obtain a definition of a (continuous) semiflow. For a semi flow (X, <p), the mapping <Pt:X -->X, tER+ is in general not a homeomorphism but a continuous mapping. 488 Let (X, <p) and (Y, 1jJ) be flows. A homeo- morphism h: X --> Y is called a topological equivalence from (X, <p) to (Y, 1jJ) if it maps each orbit of <p onto an orbit of IjJ preserving orientations of orbits (i.e., there exists an in- creasing homeomorphism exx:R-->R for each XE X such that h<p(x, t) == ljJ(h(x), exAt)) for all t E R). Two flows are topologically equivalent if there exists a topological equivalence from one to the other. If two flows are topologically equivalent, their phase portraits have the same topological structure. Two flows (X, <p) and (Y, 1jJ) are flow equivalent if there exist a c> 0 and a homeomorphism h: X --> Y such that h<p(x,t)=IjJ(h(x),ct) for all tER. Such an h is a topological equivalence from (X, <p) to (Y, 1jJ). (2) Let X be a topological space and Z the additive group of integers. If we replace R by Z in the definition of a continuous flow, we obtain a definition of a (continuous) Z-action, a discrete flow, or a discrete dynamical system on X. If (X, <p) is a discrete flow, then f = <PI: X--> X is a homeomorphism and <Pn==.f" for all nEZ. Conversely, for a given homeomorphism f: X --> X, define a mapping <p : X X Z--> X by <p(x,n)= .f"(x), XEX and nEZ. Then (X,<p) is a discrete flow such that <Pn = .f" for n E Z. So we identify a homeomorphism with a discrete flow. Thus the orbit of a homeomorphism f:X -->X through X is C(x)= (fn(x) I nEZ}. Let Z + be the additive semigroup of all nonnegative integers. If we replace R by Z+ in the definition of a continuous flow, we obtain a definition of a discrete semi flow on X. For a discrete semi flow (X, <p), the mapping <Pn: X--> X, nEZ+ is in general not a homeomorphism but a continuous mapping. We can identify a continuous mapping f: X --> X with a discrete semiflow (X,<p) in a natural way as above. Let f:X -->X and g: Y --> Y be two homeo- morphisms (continuous mappings). A homeo- morphism h:X --> Y such that ho f=goh is called a topological conjugacy from f to g. And f and 9 are called topologically conjugate if there exists a topological conjugacy from f to g. Topologically conjugate homeomor- phisms have the same phase portrait in a topo- logical sense. (3) Let M be a tdifferentiable manifold of class C' (1.s;r.s; 00 or r==w). A continuous flow (M, <p) is a flow of class C', a C'-flow, a differ- entiable dynamical system of class C', or a one-parameter group of transformations of class C', if <p: M x R --> M is of class C'. A semi- flow of class C' is defined similarly. Let (M,<p) and (N,IjJ) be C'-flows. A topo- logical equivalence h: M -->N from (M, <p) to (N, 1jJ) is called a C'-equivalence if it is a tC'- diffeomorphism. Two flows are C'-equivalent if there is a C'-equivalence from one to the other. Classification ofC'-flows by C'-equivalence is 
489 dificult and sometimes too unwieldy to work with. On the other hand, there are many prob- lems which can be solved by the knowledge of the topological structure of the phase por- trait of C' -flows. (4) Let (M, n) be a discrete flow on a dif- ferentiable manifold M of class C'. If n: M x Z-->M is of class C', then (M, n) is called a Z- action of class C', a discrete flow of class C', a discrete C'-flow, or a discrete dynamical system of class C'. If (M, n) is a discrete C'-flow, then f = n I: M --> M is a C' -diffeomorphism, Con- versely, a C'-diffeomorphism f: M -->M defines a discrete C'-flow in a natural way on M, and we identify a C'-diffeomorphism f:M-->M with the discrete C'-flow on M defined by f A discrete semiflow of class C' is defined simi- larly, and a tC'-mappingf:M-->M is identi- fied with a discrete semiflow of class C' on M defined by f Let f: M -->M and g: N -->N be C'- diffeomorphisms (C'-mappings) of differ- entiable manifolds M and N of class C'. A topological conjugacy from f to 9 is called a C'-conjugacy if it is a C'-diffeomorphism. f and 9 are C' -conjugate if there is a C' -conjugacy from f to g. C. Examples and Remarks (1) Let D be an open set ofRn andf:D-->Rn a continuous mapping, Consider the tautono- mous system of ordinary differential equations dx/dt== f(x), XED. We assume that for each XED there exists a unique tnonextendable solution <p(x, t) with the initial condition <p(x, 0) == X defined on a maximal interval (ax, b x ), -00 .s;a x <0< bx.s; 00 ( 316 Ordinary Differential Equations (Ini- tial Value Problems)). The set {<p(x, t) I ax < t < b x } is called the trajectory through x. By the uniqueness assumption, we have <p(<p(x, t), s)= <p(x, t + s) whenever both sides of the equality are defined. When (ax,bx)=R for all xED, then equation (1) is called complete. If(I) is complete, the mapping <p:D x R-->D defined by the solutions <p(x, t) determines a continu- ous flow (D, <p). Furthermore, if f is of class C', then (D, <p) is of class C'. If (1) is not com- plete, then there exists a continuous positive scalar function ex:D-->R such that dx/dt = ex (x)f(x), xED, is complete. The trajectories of (1) and (2) through X coincide for all xED, and thus the phase portraits of (1) and (2) are the same. (2) Let M be a differentiable manifold of class Coo. A tvector field of class C' on M (1 .s; r.s; 00) gives rise to an autonomous system 126C Dynamical Systems (1) of ordinary differential equations in a coordi- nate neighborhood of each point of M, and it generates a tlocal one-parameter group of local transformations of class C'. If M is tcom- pact, this local one-parameter group of local transformations is uniquely extended to a one- parameter group of transformations of class C' ( 105 Differentiable Manifolds). Thus a vector field of class C' on M generates a unique C'-flow on M if M is compact. There- fore, sometimes we identify a vector field of class C' with the C'-flow generated by it. (3) Let (M, <p) be a C'-flow. Then <PI: M--> M is a C'-diffeomorphism, which we call the time-one mapping (time-one map) of (M, <p). Thus every C'-flow (M, <p) induces a C'- diffeomorphism as a time-one mapping. But the set of CI-diffeomorphisms that are time- one mappings of CI-flows is of the tfirst cate- gory in the space of all CI-diffeomorphisms with C I topology (J. Palis). Thus most C I - diffeomorphisms are not expressed as time-one mappings of CI-flows. (4) Let M be a compact differentiable mani- fold of class Coo with dimension m and (M, <p) a C'-flow (l.s;r.s; 00). An (m-1)-dimensional closed submanifold I: of M is called a cross section of (M, <p) if the following conditions are satisfied: (i) For any XEX, there exist t l >0 and t 2 <0 such that <P'I(X), <p'2(X)EI:; (ii) Fvery orbit intersects I: ttransversally whenever it meets I:. Let I: be a cross-section of (M, <p) and X E I:. Let t I be the least positive number with <P'1 (x) EI:. Such a t] exists for every xEI:, and f:I:-->I: defined by f(x) = <P'I(X), xEI:is a C'- diffeomorphism. We call this diffeomorphism f the first-return mapping (first-return map) or the Poincare mapping (Poincare map) for I:. (5) Let N be a compact differentiable manifold of class Coo and f: N --> N a C'- diffeomorphism (l.s;r.s; 00). Define an equiva- lence relation  on N x R generated by (x, t + 1)  (/(x), t) for x E X, t E R. Then the quo- tient space N (/) == N x R/  has a natural tdifferentiable structure of class C', and a C'- flow (N(/),IjJ) is defined by 1jJ([x,tJ,s)=[x,t+ sJ for XE N, t, SE R, where [x, tJ E N(/) is the equivalence class of (x, t). The flow (N (f), 1jJ) thus obtained is called the suspension of f The suspension (N(f),IjJ) has a cross section I:= {[x, 0] I xEN}, and the Poincare mapping for I: is C'-conjugate to f Conversely, if (M, <p) has a cross section I: and the Poincare map- ping for I: is f:I: -->I:, then the suspension (I:(/), 1jJ) is C'-equivalent to (M, <p). (6) Let U be an open set ofRn and f: U -->Rn a continuous mapping. Consider the tdiffer- ence equation (2) x m +1 = f(x m ), XmE U. (3) For each XE U, let <p(x,m) be the solution of (3) 
126 D Dynamical Systems with <p(x, 0) = x. If f(U) c U, then <p(x, m) exists for a1\ mE Z + and x E U, and <p defines a dis- crete semiflow on U. Iff: U --> U is a homeo- morphism, then <p(x, m) exists for a1\ mEZ and XE U, and <p defines a discrete flow on U ( 104 Difference Equations). D. Basic Concepts For simplicity we assume that the phase spaces of dynamical systems are metric spaces, and we denote their me tries by d. (1) Let (X, <p) be a continuous flow on a metric space X and XEX. A subset A of X is called invariant if <p,(A) c A for a1\ tE R. A subset of X is invariant if and only if it is a union of orbits. A subset A of X is positively (resp. negatively) invariant if <p,(A) c A for a1\ tO(resp. t.s;O). The subset C+(x)={<pX(t)lt O} (resp. C(x)= {<pX(t)1 t.s;O}) is called the positive (resp. negative) semi orbit or the posi- tive (resp. negative) half-trajectory starting from-x. A subset is positively (resp. negatively) invariant if and only if it is a union of positive (resp. negative) semiorbits. A subset is invari- ant if and only if it is both positively and nega- tively invariant. The union and the intersection of invariant sets are invariant. If A is invariant, then its closure A, its interior Int A, its boundary oA, and its complement AC=X -A are invariant. If A is invariant, then <p(A x R) c A and the restriction mapping <p I A x R defines a continu- ous flow (A, <p I A x R) on A. The flow thus obtained is called the restriction of (X, <p) on A. (2) A point x E X is a singular point, an equi- librium point, a critical point, a rest point, or a fixed point if C(x) == {x} ( 290 Nonlinear Osci1\ation). A point is regular or nonsingular if it is not a singular point. The set of all singular points is a closed invariant set, and the set of a1\ nonsingular points is an open invariant set. If A is a positively invariant set which is homeomorphic to the closed unit ba1\ in R n , then there exists a singular point in A (tBrou- wer fixed-point theorem). A point x E X is periodic if there exists a T #0 such that <p(x, t + T) == <p(x, t) holds for all t E R. If x is periodic, the motion <px and the orbit C(x) are said to be periodic. A point x is periodic if and only if there exists a T # 0 with <p(x, T) = x. A singular point is periodic. For a periodic point x, a number T satisfying (4) is ca1\ed a period of x. If x is nonsingular and periodic, then there exists a smallest positive period To of x, and any period is an integral multiple of To. An orbit of a nonsingular periodic point is called a closed 490 orbit. If C is a closed orbit, then a1\ points of C are nonsingular periodic points, and their smallest positive periods coincide. A closed orbit is compact. (3) Let x be a point of X. A point YEX is ca1\ed an w-limit (resp. ex-limit) point or a posi- tive (resp. negative) limit point of x if there exists a sequence {t n } of real numbers such that (i) tn-->oo (resp.t n --> -00) as n-->oo and (ii) <p(x,tn)-->y as n-->oo. The set of all w-limit (resp. a-limit) points of x is denoted by w(x) (resp. ex(x») and is called the w-limit (resp. ex- limit) set of x. For each XE X, w(x) and ex(x) are closed invariant sets, and the following equalities hold: C+(x) =C+(x)Uw(x), C(x) = C (x) U ex(x), and C(x) == C(x) U ex(x) U w(x). If x is a periodic point, C(x) = C(x) = ex(x) = w(x). If C is a closed orbit, then C = C = C(x) = :x (x) == w(x) for all xEC.1f A is a compact in- variant set and XE A, then ex(x) and w(x) are nonempty. Assume that X is tJoca1\y compact and XE X. Then w(x) is connected if it is compact, and none of the connected components of w(x) is compact if w(x) is not compact. (4) Let x be a point of X. Let J +(x) (resp. J_(x») be the set of all points Y satisfying the following condition: There exist a sequence {r.} of numbers and a sequence {x n } of points in X such that (i) tn-->oo (resp. t n --> -00) as n--> 00, (ii) xn-->x as n-->oo, and (iii) <p(xn,tn)-->y as n-->oo. The set J+(x) (resp. L(x» is a closed invariant set containing w(x) (resp. ex(x)), called the first positive (resp. negative) prolongational limit set of x. If X is loca1\y compact, then J +(x) is connected if it is compact, and none of the connected components of J +(x) is compact when J+(x) is not compact. Notions of higher prolongations have been defined and investigated by T. Ura, 1. Aus- lander, and P. Seibert. (5) For a discrete flow, we can similarly define basic notions such as an invariant set, fixed point, periodic point, and so on defined in Sections D(I)-D(4) by replacing R by Z. The propositions and theorems stated above hold for discrete flows, except those concern- ing connectedness. (4) E. Recursive Concepts and Dispersive Concepts (1) Let (X, <p) be a flow on a metric space X. Let x E X be a point such that there exist a neighborhood U of x and a T>O satisfying the condition un <p,( U) = 0 for t  T. Then x is caIled wandering, The set of all wandering points is an open invariant set. A point is nonwandering if it is not wandering. The set Q of all nonwandering points is a closed invar- iant set and is called the nonwandering set. 
491 The following conditions are equivalent: (i) x is nonwandering, (ii) xEJ+(X), (iii) xEJ_(x). The nonwandering set Q contains all singular points, closed orbits, and w(x) and ex(x) for all XEX. (2) A point x E X is positively (resp. nega- tively) Poisson stable if XEW(X) (resp. xEex(X)). It is Poisson stable if it is both positively and negatively Poisson stable. A positively Poisson stable point and a negatively Poisson stable point are nonwandering. The following con- ditions are equivalent: (i) x is positively Pois- son stable, (ii) C+(x) == w(x), (iii) C(x)c w(x), (iv) for any neighborhood U of x and T> 0, there exists at> T with <p(x, t)E U. If X is tcomplete and XEX is positively Poisson stable but not periodic, then w(x)- C(x) is tdense in w(x). This implies that if X is complete, then C(x) is periodic if and only if C(x) = w(x), (3) If all the points of the phase space X are wandering, then (X, <p) is called completely unstable. If all the points of X are nonwander- ing, then (X, <p) is called regionally recurrent. If A is an invariant set such that the restriction of (X, <p) on A is regionally recurrent, then (X, <p) is said to be regionally recurrent on A. If (X, <p) is regionally recurrent and X is locally compact, then the set of all Poisson stable points is dense in X. For a given flow (X, <p), we obtain a se- quence of invariant sets {Qn} and a sequence of restriction flows {(Q., <Po)} such that (i) (Qo, <Po)=(X, <p); (ii) Qn+1 is the nonwandering set of (Qn, <Pn), n  0; and (iii) (Qn+l, <Pn+d is the restriction of (X, <p) on Qn+1' Then X == Qo =:> Q I =:>...=:>Qn=:>.... putQ",=nn Q .. Then Q",is an invariant set of (X, <p), and we denote the re- striction of (X, <p) on Q", by (Q"" <p"')' Starting from (Q"" <p"')' we obtain similarly a sequence of invariant sets {Q",+n} and a sequence of flows {(Q",+n,<P",+n)}' If we obtain an ordinal number y such that Qy = QY+1 # 0 by continu- ing this process, then we call Qy the set of central motions. In this case, the flow (Qy, <py) is regionally recurrent, and every invariant sub- set of X on which (X, <p) is regionally recurrent is contained in Qy. When X is tseparable and QI is compact and nonempty, then the mini- mum of such ordinal y is an tordinal of at most the second number class. (4) Let {jf be the set of all XEX satisfying the following condition: For each £, T> 0, there exist a sequence {xo = x, x" ..., X k = x} of points in X and a sequence {to, t" ... , t k - 1 } of numbers such that ti T and d(<p,;(x;), Xi+l) < £ for O.s; i.s; k - 1. The set {jf is a closed invar- iant set containing the nonwandering set Q and is called the chain recurrent set. If X = {jf, then (X, <p) is called chain recurrent. The re- striction ({jf, <p I {jf x R) of (X, <p) on {jf is chain recurrent (c. C. Conley). 126E Dynamical Systems (5) A nonempty closed invariant set is called a minimal set if none of its nonempty proper closed subsets is invariant. For a nonempty compact subset A of X, the following con- ditions are equivalent: (i) A is minimal, (ii) C(x) = A for all xEA, (iii) C+(x) =A for all xEA, (iv) C(x) =A for all XEA, (v) w(x)=A for all xEA, (vi) ex(x) = A for all XE A. A point XEX is positively (resp. negatively) Lagrange stable if C+(x) (resp. C(x) ) is compact. If C(x) is com- pact, then x is called Lagrange stable. Every nonempty compact invariant set contains a compact minimal set. If XEX is positively (resp. negatively) Lagrange stable, then w(x) (resp. ex(x)) contains a compact minimal set. A closed invariant set A is called a saddle set if there exists a neighborhood U of A such that every neighborhood of A contains a point x such that C+(x)<j1': U and C(x)<j1': U. Otherwise A is called a nonsaddle set. For a point x of X, let E(x) be the subset of X consisting of the points y satisfying the following condition: There exist a sequence {x n } of points in X and two sequences {t n }, {sn} of numbers such that (i) xn-->x, tn-->oo, Sn--> -00 as n-->oo and (ii) <p(x n , t n )--> y, <p(x n , sn)--> y as n--> 00. For a subset 8 of X, put E(8)= UXEBE(X). Let {Sa} be the family of all saddle minimal sets and {F p } the family of all nonsaddle minimal sets. If the phase space X is compact, then the nonwan- dering set Q ==( UP Fp)U (U,E(S,))==(U p F p ) U E(U,S,) (T. Saito). (6) A point x E X is said to be recurrent iffor any £ > 0 there exists a T> 0 such that the t£_ neighborhood U of <pX([t, t + TJ) contains C(x) for all tER. If x is recurrent, the motion <px and the orbit C(x) are said to be recurrent. Every orbit of a compact minimal set is recur- rent, and if the phase space is complete, then the closure of a recurrent orbit is a compact minimal set (Birkhoff). A set D of real numbers is called relatively dense if there exists aT> 0 such that D n (t, t+ T)# 0 for all tER. Assume that XEX is Lagrange stable, then x is recurrent if and only iffor every £ > 0 the set {t I d(x, <p(x, t)) < £} is relatively dense. (7) A flow (X, <p) is dispersive iffor any x, YEX there exist neighborhoods U x of x and U y of Y and a T> 0 such that U x n <p,( U y ) = 0 for all t  T. The following conditions are equiva- lent: (i) (X, <p) is dispersive; (ii) For any x, YEX, there exist neighborhoods U x of x and U y of y and a T> 0 such that U x n <p,( U y ) == 0 if I tl  T; (iii) J+(x)= 0 for all XEX. A flow (X, <p) is parallelizable if there exist a subset S of X and a homeomorphism h: X -->S x R such that (i) <p(S x R)=X and (ii) h 0 <p(x, t) ==(x, t) for all XE Sand t E R. A flow (X, <p) is parallelizable if and only if there is a subset S of X satisfying the following con- 
126 F Dynamical Systems ditions: (i) For each XEX there exists a unique r(x)ER with <p(x, r(x))ES; (ii) r:X -->R is con- tinuous. A paralle1izable flow is dispersive. A flow on a localIy compact separable metric space is parallelizable if and only if it is disper- sive (V. V. Nemytskii, V. V. Stepanov). An open set U of X is called a tube if there exist a T> 0 and a subset S of U satisfying the following conditions: (i) <p(S x (- T, T)) = U, (ii) For each XE U there is a unique r(x)ER with Ir(x)1 < T such that <p(x, r(x»)E S. The set S in the above definition is called a local section, If x E X is a regular point, then there exists a tube containing x (M. Bebutov, H. Whitney). The notion of a local section is a generalization of Poincare's "surface sans contact" [IJ or Birkhoff's "surface of section" [4J, and it is related to the notion of the cross section. (8) For discrete flows (homeomorphisms), basic notions, such as a nonwandering set, Poisson stability, regional recurrence, central motion, and minimal set, are defined similarly, and many of the propositions and theorems in Sections E(1)-E(5) hold similarly for discrete flows (homeomorphisms). F. Stability (J) Let (X, <p) be a continuous flow on a metric space X, A point x E X is called orbitally stable if for any I; > 0 there exists a 6> 0 such that C+(y) is contained in the I;-neighborhood of C+(x) for all y with d(x, y) < 6 ( 394 Sta- bility). A nonempty set A is calIed stable if every neighborhood of A contains a positively invariant neighborhood of A, If A is compact (in particular, if A is a periodic orbit), then orbital stability and stability fo A are equiva- lent. A nonempty set A is called asymptotically stable if A is stable and there exists a neighbor- hood V of A such that w(x) c A for any x E V. If A is stable and w(x) c A for all x E X, then A is called globally asymptotically stable, A point x E X is Lya punov stable if for any I; > 0 there exists a 15>0 such that d(<p,(x), <p,(y))<1; for all t  0 and y with d(x, y) < b. Lyapunov stability implies orbital stability, A point x is uniformly Lyapunov stable if for any 1;>0 there exists a 15 > 0 such that for ZE C(x) and y with d(y, z) < 6 we have d(<p,(z), <p,(y)) < I; for all t  O. Uniform Lyapunov stability implies Lyapunov stability. For a singular point, the notions of uniform Lyapunov stability, Lyapunov stability, orbital stability, and stability are equivalent. Assume that the phase space X is locally compact and A is a nonempty compact subset of X. Then A is asymptotically stable if and only if there exist a neighborhood N of A and a continuous real-valued function Lon N such that (i) L(x) =0 if XEA and L(x»O if xA; (ii) L(<p(x, t)) <L(x) if xA, t>O, and <p({x} x [O,tJ)cN. 492 Such a function L is called a Lyapunov func- tion for A ( 394 Stability). Assume that X is locally compact and A is nonempty, stable, and invariant. Then A is asymptotically stable if and only if there exists a neighborhood U of A such that any invariant set in U is neces- sarily contained in A (Ura). Let A be a non- empty closed invariant set. A is called an at- tractor if it has an open neighborhood U satis- fying the folIowing conditions: (i) U is posi- tively invariant; (ii) For each open neighbor- hood V of A, there is a T>O such that <p,(U)c V for all t  T. Condition (ii) implies that n,;"o<p,(U)=A and w(x)cA for all XE U. Thus an attract or is asymptotically stable. If A is an attractor, the basin of A is the union of all open neighborhoods of A satisfying (i) and (ii), (2) Assume that the phase space X is com- plete, A motion <px (x E X) is called almost periodic if for any I; > 0 there exists a rela- tively dense subset {rn} in R such that d(<pX(t), <px(t+rn))<c for all tER and r n ( 18 Almost Periodic Functions). If <px is almost periodic, then <pY is almost periodic for all YE C(x). If A is a compact minimal set and if <px is almost periodic for some x in A, then every motion <pY, YE A, is almost periodic. An almost periodic motion is recurrent. The converse is not true. But if x is recurrent and Lyapunov stable in C(x) (i,e., in the restriction flow (C(x), <p I C(x) x R)), then <px is almost periodic. If x is uni- formly Lyapunov stable in C(x) and negatively Lagrange stable, then <px is almost periodic (A. A. Markov). G. Singular Points and Closed Orbits In this section we assume that the phase space is a tparacompact differentiable manifold of class Coo with metric d. (J) Let E be a finite-dimensional real vector space and L: E --> E a linear automorphism. L is called hyperbolic if it has no teigenvalues of absolute value 1. If L:E-->E is hyperbolic, there are unique vector subspaces ES and E U satisfying the following conditions: (i) E = E' EB E U , (ii) L(E') == E' and L(E U ) = E U , (iii) if 11'11 is a tnorm on E, then there exist constants C > 0 and 0 < ..1. < 1 such that, for any positive integer m, IILm(v)ll.s;cAmllvll when VEE' and IlL -m(v)ll.s;dmllvll when vEEu. The zero 0 of E is the only fixed point of a hyperbolic linear automorphism. Put s = dim ES and u = dim EU. Then s+u=dimE, and s (resp, u) is the num- ber of eigenvalues of absolute value < 1 (resp. > 1) counted with multiplicity. A topological conjugacy class of a hyperbolic linear auto- morphism L:E-->E is determined by s, u, and the signs of det(L I E') and det(L I E U ), where det(L I E") is the tdeterminant of the restriction 
493 L I £G: £G --> £G of Lon E a (0"= S, u). Further investigations of topological classification of linear automorphisms have been carried out by N. H. Kuiper and J. W. Robbin. (2) Let f:Rn-->Rn be a C'-diffeomorphism (1.s;r.s; 00). Assume that the origin OER n is a fixed point of f Then the tdifferential dfo: R n -->Rn of fat 0 is a linear automorphism. It is given by dfo(x) = Jo(/)x, xERn, where J o (/) is the tJacobian matrix of fat 0 and x is ex- pressed as a column vector. If dfo is hyperbolic, then f is topologicaIly conjugate to dfo in a sufficiently small neighborhood of 0 (P. Hart- man). Assume that f is of class Coo, and let ).1' ..., An (possibly repeated) be the eigenvalues of dfo. If Ai #..1.';1... A;;'" for all 1.s; i.s; n and for all nonnegative integers ml, ...,m n with 2.s; m l + .,. + mn' then in a sufficiently small neighborhood of 0, f is COO -conjugate to dfo (S. Sternberg). (3) Let f: M --> M be a C' -diffeomorphism (1.s; r.s; 00) of a differentiable manifold M of class COO. Let p E M be a fixed point of f Then the tdifferential dfp of fat p is a linear auto- morphism of the ttangent space Tp(M) of Mat p. If dfp is hyperbolic, then p is called a hyper- bolic fixed point of f Let p be a hyperbolic fixed point of f and Tp(M) = E' EB E U be the direct sum decomposition with respect to dfp' Put WS(p)= {XE M If"(x)--> pas n-->oo} and WU(p)={xEMlrn(x)-->p as n-->oo}. WS(p) and WU(p) are invariant sets and images of tinjective immersions of class C' of vector spaces E' and E U , respectively. At the point p, WS(p) and WU(p) are tangent to ES and EU, respectively [21]. We call WS(p) (resp. WU(p)) the stable (resp. unstable) manifold of fat p. In a sufficiently smaIl neighborhood of a hyper- bolic fixed point p of f, f is topologicaIly conjugate to its differential dfp' Therefore a hyperbolic fixed point is an isolated fixed point (i.e., there are no fixed points in its sufficiently smaIl neighborhood except itself). A hyperbo- lic fixed point p is called a source if dim W'(p) =0 and a sink if dim WU(p)=O. Otherwise it is a saddle point. The number of topological conjugacy classes of hyperbolic fixed points on an n-dimensional manifold is 4n. Let pE M be a periodic point of f and m the smallest positive period of p. Replacing f by fm, we obtain notions of hyperbolicity, stable manifold, and so on for the periodic point p. We also obtain propositions and theorems similar to those stated above for periodic points. (4) Let A be a real n x n matrix. Consider the system of linear differential equations dx/dt=Ax, xERn. This equation generates a coo-flow (Rn, <p), and <p,:Rn-->R n is given by <p,(x)=e'A x , where e'A is 126G Dynamical Systems the texponential of the matrix tA. Thus <p, is a linear automorphism for all tE R. The origin OE R n is a singular point of (R n , <p). If none of the real parts of the eigenvalues of A are zero, we calI OERn a hyperbolic singular point of (5). If OE R n is a hyperbolic singular point of (5), then there exist two vector subspaces E' and E" of Rn satisfying the foIlowing conditions: (i) Rn=E' EB E U ; (ii) A(E')= E' and A(E")=E"; (iii) E' and E" are invariant sets of (Rn, <p); (iv) there exist positive constants c and ..1. such that for all tO, 11<p,(x)ll.s;ce-A'llxll when XEE' and 11<p-,(x)ll.s; ce- A ' Ilxll when XE E", where 11'11 is the norm on R n . The origin 0 is a hyperbolic singular point of (5) if and only if the time- one mapping <PI =eA:Rn-->R n of <p is a hyper- bolic linear automorphism. If 0 is a hyperbolic singular point of (5), the above direct sum decomposition Rn = E' EB E U coincides with the one with respect to <p I' Put s = dim E S and u = dim E U . Then s + u = n, and s (resp. u) is the number of all eigenvalues of A with negative (resp. positive) real parts counted with multi- plicity. Also we obtain the following proper- ties: (i) XE E' (resp. E U )= <p,(x)-->O as t--> 00 (resp. t--> -00), (ii) XrtE' (resp. E U )= II <p,(x) II -->00 as t-->oo (resp. t--> -00), (iii) the origin 0 is the only singular point. Let A and B be n x n real matrices. Let (Rn, <PA) and (Rn, <PH) be the flows determined by the equations dx/dt = Ax and dx/dt = Bx, respectively. Assume that 0 E Rn is a hyperbolic singular point for both of the equations. Let s and u (resp. s' and u') be the integers defined for <PA (resp. <PH) in the above paragraph. Then the following conditions are equivalent: (i) (R n , <PA) and (R n , <PH) are flow equivalent, (ii) (Rn, <PA) and (R n , <PH) are topologically equiva- lent, (iii) s=s', (iv) u=u', (v) (<PA)I and (<PH)I are topologicaIly conjugate. Further investigations of the phase portrait of the equation dx/dt = Ax without the assumption of hyperbolicity were done by Kuiper. (5) Let D be an open set of Rn and f: D --> Rn a continuous mapping. Consider the system of differential equations (1), which we write down again here: dx/dt= f(x), xED. (6) (5) A point p E D is a singular point of (6) if the trajectory through p consists of a single point p. If (6) generates a flow (D, <p), then p is a singular point of (6) if and only if p is a sin- gular point of (D, <p), A point p is a singular point of (6) if and only if f(p) = 0, Suppose further that f is a C'-mapping (1 .s; r.s; 00). A singular point p of (6) is called hyperbolic if none of the real parts of the eigen- values of the Jacobian matrix J p (/) of fat p is zero. Let p be a singular point of (6). We as- sume for simplicity that p = 0 and (6) generates 
126G Dynamical Systems a flow (D, <p). Denote the Jacobian matrix of f at 0 by A, and let (Rn, <PA) be the flow generated by the equation dx/dt == Ax. If 0 is a hyperbolic singular point, then in a sufficiently small neighborhood of 0, the flows (D, <p) and (Rn, <PA) are flow equivalent and hence topologically equivalent (Hartman, D. M. Grobman). Let }.I, ,.., An (possibly repeated) be the eigen- values of A. Iff is of class Coo and Ai #mlAI + ... + mn).n for alI 1.s; i.s; n and for alI nonnega- tive integers m l , ..., m n with 2 .s;m l +... + m n , then in a sufficiently smalI neighborhood of 0, two flows (D, <p) and (R n , <PA) are Coo -equivalent (Sternberg). (6) Let M be a paracompact differentiable manifold of class CCL and Va vector field of class C' (l.s;r.s; 00) on M. For simplicity we assume that V generates a C'-flow (M, <p). For each point p of M, take a coordinate neighbor- hood (U, ex) of class Coo around p, where U is an open neighborhood of pin M and ex: U --> R n is a homeomorphism onto an open set D in R n . Using (U, ()(), we can express the vector field V as a system of ordinary differential equations of the form (6) in D. The eigenvalues ..1.1, ... ,).n (possibly repeated) of the Jacobian matrix of f at ex(p) are independent of the choice of local coordinates (U, ex) around p. Thus }'l' ..., An are invariants of the C' -equivalence, but they are not invariants of the topological equivalence. A singular point p of a flow (M, <p) (or a vector field V) is called hyperbolic if ex(p) is a hyperbolic singular point of the corresponding equation (6) (i.e., none of the real parts of the above eigenvalues ..1.1' "', }'n are zero). A sin- gular point of (M, <p) is hyperbolic if and only if it is a hyperbolic fixed point of the time-one mapping <PI of <po A hyperbolic singular point p is an isolated singular point. Let p be a hyperbolic singular point of (M, <p) and Tp(M)=E' 8J EU the direct sum decomposition with respect to <Pl' Put s= dim E' and u = dim EU. Then s (resp. u) is the number of alI eigenvalues Ai of negative (resp. positive) real parts. Put W'(p) = {x E M I <p,(x) -->p as t-->oo} and WU(p)={xEMI<p_,(x)-->p as t--> oo}. W'(p) and WU(p) are invariant sets and images of injective immersions of class C' of vector spaces ES and E U , respectively. At the point p, WS(p) and WU(p) are tangent to E' and E", respectively. We call WS(p) (resp. WU(p)) the stable (resp. unstable) manifold off at p. The stable manifold and the unstable manifold of (M, <p) at a hyperbolic singular point p coincide with those of the time-one mapping <PI at p. A hyperbolic singular point pis calIed a source if dim WS(p) = 0 and a sink if dim WU(p) = O. Otherwise it is called a saddle point. If p is a source (resp. a sink), then WU(p) (resp. WS(p) is a neighborhood of p. A sin- gular point p is a source (resp. a sink) if and 494 only if all real parts of the corresponding eigen- values ..1.1, ..., An are positive (resp, negative). (7) Let V be a vector field of class C' on a Coo-differentiable manifold M of dimension n  2 and (M, <p) the C'-flow generated by V. Let C be a closed orbit of (M, <p), P a point of C, and T> 0 its smallest positive period. Then p is a fixed point of the C'-diffeomorphism <PT: M -->M. Let V(p)E Tp(M) be the value of V at p and (d<pT)p: Tp(M)--> Tp(M) the differen- tial of <PT at p. Then (d<PT)P(V(P)) = V(p), and V(p) #0. Therefore 1 is an eigenvalue of (d<PT)p' The other eigenvalues ..1.1, ..., An-I (possibly repeated) of (d<pT)P do not depend on a choice of p E C and are called the characteristic multi- pliers of e. If none of the characteristic multi- pliers of C is of absolute value 1, we calI C a hyperbolic closed orbit. If C is a hyperbolic closed orbit, then there exist vector subspaces E and E for each pE C satisfying the follow- ing conditions: (i) Tp(M) = L(V(p») 8J E 8J E, where L(V(p» is the I-dimensional subspace generated by V(p); (ii)(d<p,)p(E)=E;,(p) (0"= s, u) for alI tE R, where (d<p,)p: Tp(M)--> T",,(p) (M). In particular, (d<PT)p(E)=E (O"=s, u); (iii) dim E (resp. dim E) is independent of pE C and is equal to the number of Ai of absolute value <1 (resp. >1). Put WS(C)=={xEMI d(<p,(x), C)-->O as t-->oo} and WU(C)={xEMI d(<p_,(x), C)-->O as t--> oo}. Then WS(C) (resp. WU(C)) is an invariant set and is an injectively immersed C'-submanifold of M which is tan- gent to L(V(p» 8J E (resp. L(V(p)) 8J E) for each pEe. WS(C) (resp. WU(C)) is called the stable (resp. unstable) manifold for C. Let p be a point of a closed orbit e. An embedded (n -I)-dimensional disk D of class C' in M containing p is calIed a cross section for a closed orbit C if V is transverse to D (i.e" TAM)=L(V(x» 8J TAD) for each xED) and D n C == {p}. For a given cross section D for C, there exists a neighborhood U of p in D such that for any XE U there exists a r(x»O such that <p(x, r(x»E D and <p(x, t)rf'D for 0 < t < r(x). A mappingf: U-->D, defined by f(x) = <p(x, r(x», XE U, is calIed a Poincare mapping for D. The C'-conjugacy class of the tgerm of a Poincare mapping f: V --> D at p is indepen- dent of the choice of pE C and the cross section D for the closed orbit e. The point p is a fixed point of f, and the eigenvalues of dfp coincide with the characteristic multipliers of C includ- ing multiplicity. Therefore C is hyperbolic if and only if pE C is a hyperbolic fixed point of f In a sufficiently small neighborhood of C, the flow is C'-equivalent to the suspension of f The topological equivalence of the flow in a sufficiently small neighborhood of C is deter- mined by the topological conjugacy class off The number of topological equivalence classes of hyperbolic closed orbits which appear in a 
495 flow on an n-dimensional manifolds is 4(n-l) (M. C. Irwin). (8) Let V be a vector field of class C' (1.s; r.s; 00) on a compact Coo-manifold M. If M has a nonempty boundary, we assume that V points outward at all boundary points. Let p be an isolated singular point of V and denote by i(p) the tindex of the vector field Vat the singular point P. If all singular points are isolated, then the sum Lp i(p) is independent of V and is equal to the tEuler-Poincare charac- teristic X(M) of M (Poincare, H. Hopf) ( 153 Fixed-Point Theorems). A vector field (or a flow) is called nonsingular if it has no singular points. If X(M) # 0, then any vector field (and hence any flow) on M has a singular point. If X(M) = 0, then there exists a nonsingular vector field on M. If M is 2-dimensional and without boundary, then M admits a nonsingular vector field if and only if M is a ttorus or a t Klein bottle. There are many directions in which gen- eralizations of the Poincare-Hopf theorem can be made. For example, an index of an isolated closed orbit of a flow has been defined and investigated by F. B. Fuller. The phase portrait near a singular point or a closed orbit which is not hyperbolic is com- plicated. Many results in general situations have been obtained for planer flows, but there are only a few for higher-dimensional flows. H. Generic Properties and Structural Stability (1) Let M be a compact Coo-manifold without boundary. The set .':l"(M) of all vector fields of class C' (1 .s; r.s; 00) on M forms a real vector space in a natural way. We can give a norm II VII, (called a C'-norm) for V E.':l'(M) using its expressions in a given suitable system of local coordinates of class Coo on M and their par- tial derivatives up to order r. By virtue of this norm, .q['(M) is a Banach space with the topology of uniform C'-convergence ( 168 Function Spaces). A subset of a topological space is called a residual set or a Baire set if it is the intersection of a countable number of dense open sets. A residual set in f£'(M) is a dense set. A proposition P concerning a vector field of class C' is called generic if the set {V E 2['(M) I P(V)} contains a residual set for any M. The following properties are generic prop- erties: (i) All singular points and all closed orbits are hyperbolic [21,22]; (ii) any stable manifold (of a hyperbolic singular point or a hyperbolic closed orbit) meets transversally with any unstable manifold (of a hyperbolic singular point or a hyperbolic closed orbit) at any point of their intersection [21, 22J; (iii) the set of all periodic points (i.e" the union of all 126H Dynamical Systems singular points and all closed orbits) is dense in the nonwandering set 0 for VE.':ll(M) [23J; (iv) there is no regular first integral for v E f£l(M), where a regular first integral of a vec- tor field V on M is a CI-function f:M-->R such that .f is not constant on any open set of M and is constant along any orbit of (the flow generated by) V (R. C. Robinson). Let V (resp. V') be an element of .':l'(M) (resp. .':l'(M')), (M, <p) (resp. (M', <p')) the flow generated by V (resp. V'), and 0 (resp. 0') the nonwandering set of(M, <p) (resp. (M', <p')). V and V' are called topologically equivalent (resp. O-equivalent) if (M, <p) and (M', <p') (resp. the restrictions (0, <p lOx R) and (0', <p' I 0' x R)) are topologically equivalent. A vector field VE.':l'r(M) (or the flow generated by V) is called C'-structurally stable (resp. C'-O-stable) if there exists a neighborhood ,AI of V in .':lr(M) such that any V' in ,AI is topologically equivalent (resp, O-equivalent) to V. If V E .':l'(M) is structurally stable (resp. O-stable), then the topological structure of the phase portrait of (the flow generated by) V in the whole space (resp. the nonwandering set) re- mains invariant under a sufficiently small C'- perturbation of V. The generic properties (i)- (iv) above hold for C1-structurally stable vector fields (Markus, Thom, Peixoto, J. Ar- raut), The generic properties (i) and (iii) above hold for C1-0-stable vector fields. (2) Let M be a compact Coo-manifold with- out boundary. Let F'(M) be the set of all C' -mappings of M into itself with the topol- ogy of uniform C' -con vergence. Let Diff' (M) be the subset of F'(M) consisting of all C'- diffeomorphisms of M onto itself. Then F'(M) is a complete metric space, and Diff'(M) is open in F'(M). Thus Diff'(M) is a tBaire space and also a topological group with respect to this topology. A proposition P concerning f E Diff' (M) is called generic if the set {f E Diff'(M) I P(/)} contains a residual set. The following properties are generic: (i) Every periodic point is hyperbolic [21J; (ii) for each pair of hyperbolic periodic points p and q, the stable manifold WS(p) intersects the unstable manifold WU(q) transversally [21J; (iii) for a C1-diffeomorphism f E Diff 1 (M) the set of all periodic points is dense in the nonwandering set ([23J, Robinson). Let f: M --> M and g: N --> N be C'- diffeomorphisms and 0(/) and O(g) their nonwandering sets. f and 9 are called Q- conjugate if fIO(f):O(/)-->O(/) and gIO(g): O(g)-->O(g) are topologically conjugate. A diffeomorphism fEDiff'(M) is called C'- structurally stable (resp. C'-O-stable) if there exists a neighborhood ,AI of fin Diff'(M) such that any 9 in ,AI is topologically conjugate (resp. O-conjugate) to f Generic properties (i)- 
126 I Dynamical Systems (iii) above hold for CI-structurally stable dif- feomorphisms, and (i) and (iii) above hold for C I-Q-stable diffeomorphisms. I. Low-Dimensional Systems (1) Let S I == {z E c II z I = I} be the unit circle in the complex plane C and p:R-->SI the tcover- ing projection defined by p(x) = e 2nix , x E R. Let f: SI-->SI be an torientation-preserving homeomorphism. Then f can be lifted to a mapping F: R --> R satisfying the following conditions: (i) po F = fop; (ii) F is monotone increasing; (iii) F -I R is a periodic function of period 1, where l R is the identity mapping of R. The limit p(/) == 1imn00 Fn(x)/n exists for all xER, and its tresidue class modulo Z is inde- pendent of F and x. We call p(f) the rotation number of f Let f, g: SI-->SI be orientation- preserving homeomorphisms. If f and 9 are topologically conjugate by an orientation- preserving (resp. 7reversing) homeomorphism, then p(f)= p(g) (resp. p(f)= - p(g)) modulo Z, An orientation preserving homeomorphism f: SI -->S2 has a periodic point of the smallest positive period s if and only if p(/)=r/s, where rand s are relatively prime integers. In this case all periodic points are of the smallest positive period s. If p(/) is irrational, then the w-limit set w(x) of XES I is independent of x, and E = w(x) is either tperfect and tno- where dense or the whole space SI. If p(/) is irrational and E = w(x) = SI, then f is called transitive. If f is transitive, then it is topologi- cally conjugate to the rotation rp(f): SI -->SI defined by rp(f)(eZ.iX) = e 2 . i (x+ p (f», XER. Let f: SI-->SI be of class C I with p(f) irrational. If its derivative f' is of tbounded variation, then f is transitive (A. Denjoy). In particular, if f is of class C 2 with p(f) irrational, then f is topologically conjugate to the rotation rp(f)' However, there are CI-diffeomorphisms of SI onto itself whose nonwandering sets are not the whole space. Those CI-diffeomorphisms are never topologically conjugate to C 2 _ diffeomorphisms. M. R. Herman gave a suffi- cient condition for a diffeomorphism of SI onto itself to be differentiably conjugate to a rotation. A C'-diffeomorphism f: SI -->SI is structur- ally stable if and only if Q(f) is finite (hence Q(/) consists of a finite number of periodic points), and all periodic points are hyper- bolic. The set of all structurally stable C'- diffeomorphisms of Sl onto itself is a dense open set in Diffr(S') (Peixoto). (2) Let D be an open set in R 2 andf:D-->R 2 a continuous mapping. We assume that for each XED there exists a unique nonextend- able solution <p(x, t) with the initial condition 496 <p(x, 0) = x defined on (ax, b x ) for the equation dx/dt= f(x), XED. Suppose that for a given point xED there exists a compact set K in D containing the positive semi orbit C+(x)= {<p(x, t) I O.s; t < bJ of x. Then b x = 00. Further, we assume that the w-1imit set w(x) of x con- tains no singular points. Then we have either (i) C+(x)=w(x) and it is a closed orbit, or (ii) C+(x) # w(x) and w(x) is a closed orbit. In case (ii), w(x) is a tsimple closed curve and C+(x) is a spiral which tends the closed orbit w(x) (Poincare, I. Bendixson). We call such a closed orbit w(x) a limit cycle. Let fl and f2 be poly- nomials of two variables and m the maximum of degrees offl andf2' Let f=(/I J2):R 2 -->R z be the mapping defined by fl and f2' The equation dx/dt== f(x), xER 2 , defined by such an f is called a polynomial system of degree m. The following is Hilbert's 16th problem: Is there a number N(m) depending only on m such that the number of limit cycles for any polynomial system of degree m is bounded by N(m)? Let M be a closed (i.e., compact, without boundary) Coo-manifold of dimension 2 and (M,<p) a C 2 -flow on M. Then a minimal set of (M, <p) is either (i) a singular point, (ii) a closed orbit, or (iii) the whole space M. For case (iii), M is a torus (Poincare, Denjoy, C. L. Siegel, A. J. Schwartz). Let T be a 2-dimensiona1 torus and (T, <p) a C'-flow. Suppose that (T, <p) has a cross section L which is CI-diffeomorphic to SI. Let f: L -->L be the Poincare mapping for L. Then (T, <p) has a closed orbit if and only if the rotation number p(/) of f is rational. If p(f) is irrational and (T, <p) is of class C 2 , then T is a minimal set. Let M be an orientab1e 2-dimensional closed Coo -manifold and V E;?[I (M). Then V is structurally stable if and only if the following conditions are satisfied: (i) There are only a finite number of singular points, all hyperbolic; (ii) There are only a finite number of closed orbits, all hyperbolic; (iii) There are no orbits which connect saddle points; (iv) The ex(x) and w(x) for any XEM are singular points or closed orbits (Peixoto). The above theorem was first proved by Andronov and Pontryagin for analytic vector fields on a 2-dimensiona1 disk which are transverse to the boundary of the disk at any boundary point. The set of all structurally stable vector fields of class CIon M is a dense open set in ;?[I(M) (Peixoto). J. Axiom A Systems In this section we assume that phase spaces are closed Coo-manifolds with metric d and I .s; r.s; 00 unless stated otherwise. 
497 (1) A vector field VEP(M) (resp. a C'-flow (M,<p)) is called a Morse-Smale vector field (resp. a Morse-Smale flow) if the following conditions are satisfied: (i) The nonwandering set is the union of a finite number of singular points and a finite number of closed orbits; (ii) The singular points and closed orbits are all hyperbolic; (iii) The stable manifolds and the unstable manifolds of the singular points and closed orbits intersect each other transversely. If dimM =2, the Morse-Smale vector fields on M are exactly the structurally stable ones discussed in Section 1(2). A Morse-Smale vector field is structurally stable [25]. The set of all Morse-Smale vector fields on M is open but not dense in f'l'r(M) if dim M > 2 (Palis, Smale). However, it contains a dense open subset of the set of all tgradient vector fields with respect to a given tRiemannian metric (Smale). In particular, any closed manifold admits Morse-Smale vector fields and hence structurally stable vector fields. For a Morse- Smale vector field, the tMorse inequalities hold as they do in the tcalculus of variations in the large [24]. (2) A vector field VEf'l'r(M) (or the C'-flow (M, <p) generated by V) is called an Anosov vector field (or an Anosov flow) if the following conditions are satisfied: (i) There is a direct sum decomposition TAM) = L(V(x)) EB E EB E of the tangent space TAM) for each xEM which depends continuously on x EM; (ii) (d<p,UE)=E,(x) and (d<p,)x(E)=E,(x) for all XEM and tER; (iii) There are a Riemannian metric on M and constants c, ..1. >0 such that, for all t>O and XE M, II(d<p,)Aw)ll.s;ce-;" Ilwll when wEE, and II(d<p_,)Aw)ll.s;ce-Atllwll when w E E, where 11'11 is the norm induced by the Riemannian metric. The suspensions of Anosov diffeomorphisms and the tgeodesic flows on Riemannian manifolds of negative curvature are important examples of Anosov flows (1. Hadamard [8J). There are exam- ples of Anosov flows other than the ones stated above (M. Handel and W. P. Thurston, 1. Franks and R. F. Williams). The follow- ing have been proved by Anosov [8]: (i) An Anosov flow is structurally stable; (ii) there are countably many closed orbits for an Anosov flow; (iii) if there exists a smooth in- variant measure (i.e., an tin variant measure which has a tsmooth density with respect to the measure associated with the Riemannian metric), then the set of all closed orbits is dense in M. If we assume further that the flow is of class C 2 , then it is tergodic; (iv) the set of all Anosov vector fields is open in f'l'r(M); (v) {E}, {E} (xEM) define tfoliations on M, which are called Anosov foliations. (3) A vector field VEf'l'r(M) (or the Cr-flow (M, <p) generated by V) is called an Axiom A 126 J Dynamical Systems vector field (or an Axiom A flow) if the follow- ing conditions are satisfied: A(a) The non- wandering set consists of a finite set F of sin- gular points, all hyperbolic, and the closure A of the union of closed orbits, and F n A = 0; A(b) The conditions (i)-(iii) in the definition of Anosov flow in which we 1'eplace the terms "xEM" by" XEA." For each xEM, put WS(x) = {YE M I d(<p,(x), <p,(y»-->O as h oo} and WU(x)= {YE M Id(<p_,(x), <p_,(y)-->O as hOO}. We call WS(x) (resp. WU(x)) the stable (resp. unstable) manifold of V at x. For a subset A of M, put Wa(A)= UXEA Wa(x) (O"=s, u). For XE M, put wwa(X) = Wa(C(x)) (O"=s, u), where C(x) is the orbit of x. If the flow satis- fies Axiom A, then WS(x), WU(x), WWS(x), and WWU(x) are injectively immersed submanifolds for all XEM (M. W. Hirsch and C. C. Pugh). If x E M is a hyperbolic singular point, then WS(x) and WU(x) defined above coincide with those defined before. If C(x) is a hyperbolic closed orbit, then WS(C(x)) and WU(C(x» defined above coincide with those defined before. For an Axiom A flow, there is a decomposi- tion of the nonwandering set Q=QI U... UQm (disjoint union), where each Qi is closed, invar- iant, and transitive (i.e., has a dense orbit), and M = U?'I WS(Qi)== UI WU(Qi) (disjoint union) [7]. This decomposition is called the spectral decomposition of Q, and each Qi is called a basic set. Let Q=Qj U... U Qm be the spectral decomposition of the nonwandering set for an Axiom A flow. Denote Qi.s;Qj if W'(Q;) n WU(Qj) # 0. A sequence of basic sets Qi o ' Qi 1 , ..., Qi, (k> 1) is called a cycle if Qio.s; Qi .s;....s; Qi" Qi = Qi" and otherwise Qi # Q: for p #q. An Axiom A flow which ha no q cycles in the above sense is said to satisfy the no cycle condition. The Q-stability theorem: An Axiom A flow with the no cycle condition is Q-stable [27]. Q-explosion: If an Axiom A flow has a cycle, then it is not Q-stable (Palis). An Axiom A flow is said to satisfy the strong transversality condition if WWS(x) and WWU(x) intersect transversely at any xEM. An Axiom A flow with the strong transversality condi- tion satisfies the no cycle condition [7]. The structural stability theorem: An Axiom A flow of class C l with the strong transversality con- dition is structurally stable [29,31]. Morse- Smale flows and Anosov flows are Axiom A flows with the strong transversality condition. There are Axiom A flows other than Morse- Smale flows and Anosov flows that satisfy the strong transversaIity condition [7]. Stability conjecture: A cr-flow is structurally stable (resp. Q-stable) if and only if it is an Axiom A flow with the strong transversality condition (resp. the no cycle condition). S. Newhouse, V. A. Pliss, Robinson, R. Mane, S. D. Liao, 
126 J Dynamical Systems and A. Sannami have made important contri- butions to the study of the stability conjecture. Neither the set of all Axiom A flows nor the set of all a-stable flows nor the set of structur- ally stable flows is dense in :?l'r(M) if dim M > 2 (R. Abraham and Smale, Newhouse). How- ever, the set of all structurally stable flows is dense in !!l'r(M) in the CO topology (M. Shub, M. M. C. de Oliverira). (4) A diffeomorphism fE Diff'(M) is called a Morse-Smale diffeomorphism if the following conditions are satisfied: (i) The nonwandering set a is a finite set, and hence it consists of periodic points; (ii) all periodic points are hyperbolic; (iii) for each pair p, qEO, W'(p) intersects WU(q) transversally. The Morse- Smale diffeomorphisms on the unit circle SI are exactly the structurally stable ones de- scribed in Section 1(1). A Morse-Smale dif- feomorphism is structurally stable [25]. The set of all Morse-Smale diffeomorphisms on M is open but not dense in Diff'(M) ifdimM2 (Palis, Smale). However, it contains a dense open subset of the set of all time-one mappings of the flows generated by gradient vector fields. In particular, any closed manifold admits Morse-Smale diffeomorphisms and hence structurally stable diffeomorphisms. For a Morse-Smale diffeomorphism, the Morse inequalities hold [24]. Let A be a closed invariant set of f E Diff'(M). A is called hyperbolic if the follow- ing conditions are satisfied: (i) There is a split- ting TAM) = E EB E of the tangent space TAM) for each xEA, which depends continu- ously on xEA; (ii) dfAE)= Ef(x) and dfx(E)== Ef(x) for all x E A; (iii) There is a Riemannian metric on M and constants c > 0, 0 < A < 1 such that, for any integer m>O and xEA, II dfxm(w) II .s;c..1. m llwll when wEE and Ildfx-m(w)ll.s; dmllwll when wEE. A diffeomorphism fE Diff'(M) is called an Anosov diffeomorphism if M itself is hyperbolic for f Manifolds which admit Anosov diffeomorphisms are restricted (Hirsch, K. Shiraiwa). Examples of Anosov diffeomorphisms are given as follows: Let L: R n --> Rn be a hyperbolic linear automor- phism with L(zn)=zn, where zn is the dis- crete subgroup of Rn consisting of all elements with integral coordinates. Then L induces an automorphism f: T n --> Tn of the n-dimensional torus Tn=Rn/zn, which is an Anosov diffeo- morphism. There are similar constructions using hyperbolic automorphisms of simply connected tnilpotent Lie groups and their uni- form discrete subgroups (Smale), A homeo- morphism h: X --> X of a metric space X is called expansive if there is a constant /; > 0 such that x, YE X and d(hn(x), hn(y)) < f. for all nEZ imply x= y. An Anosov diffeomorphism is expansive. For Anosov diffeomorphisms, 498 theorems similar to those stated in Section 1(2) hold. Franks, Newhouse, A. Manning, and 1. N. Mather obtained important results concerning Anosov diffeomorphisms. A diffeomorphism f E Diffr(M) is called an Axiom A diffeomorphism if the following con- ditions are satisfied: A(a) The nonwandering set a is hyperbolic; A(b) the set of all periodic points is dense in O. There are examples that satisfy Axiom A(a) but not Axiom A(b) (A. Dankner, M. Kurata). For xEM, put WS(x) = {YE M I d(/n(x)Jn(y))-->O as n--> oo} and WU(x)= {YEM I d(rn(x),rn(y))-->O as n-->oo}. We call WS(x) (resp. WU(x)) the stable (resp. unstable) manifold of fat x. For a subset A of M, put Wa(A) = UXEA Wa(x) (0"== S, u). Iff is an Axiom A diffeomorphism, WS(x) and WU(x) are injectively immersed submanifolds of M. If x is a hyperbolic periodic point, W'(x) and WU(x) defined here coincide with those defined before. For Axiom A diffeomorphisms, notions such as spectral decomposition, basic sets, the strong transversality condition, and the no cycle condition are defined similarly, and theorems similar to those stated in Section 1(3) hold. Let h: X --> X be a homeomorphism of a met- ric space X and ex, fJ>O. A sequence {Xi}iEZ of points in X is an ex-pseudo-orbit if d(h(x i ), Xi+1) <ex for all iEZ. We say that {X;}iEZ is fJ- shadowed (or fJ-traced) by a point XEX if d(hi(x),Xi).s;fJ for all iEZ. We say that h has the pseudo-orbit tracing property if for any fJ > 0 there exists an ex> 0 so that any ex-pseudo- orbit is fJ-shadowed by some point. If f is an Axiom A diffeomorphism, then fl 0(/): 0(/) -->0(/) has the pseudo-orbit tracing property (Bowen). Axiom A diffeomorphisms with the strong transversality condition (in particular, Anosov diffeomorphisms) have the pseudo- orbit tracing property (Bowen, K. Sawada, K. Kato, A. Morimoto). (5) Let S be a discrete topological space with n elements (n2) and 1:'(S)=TI iEZ S i the dou- bly infinite product of copies Si of S with the product topology. A point X=(X;)iEZ of 1:'(S) is a doubly infinite sequence of points in S. 1:'(S) is a ttotally disconnected, perfect, compact, tmetrizable space (i.e., a tCantor set). Let O":1:'(S)-->1:'(S) be a mapping defined by O"(X) = y, x= (x;), y=(y;), Yi=X i + 1 (iEZ). Then 0" is a homeomorphism of 1:'(S) and is called the shift automorphism with n symbols. Define a diffeomorphism f: S2-->S2 of the 2- dimensional sphere S2 as follows. We consider S2 as R 2 U {oo}. Let U be a region of R 2 con- sisting of the rectangle R (= ABCD) and an upper and a lower cap as shown in Fig. 1. f maps U into itself as in Fig. 1 (f(A)=A',f(B) = B'J(C) = C'J(D) = D', and so on). Here, R nf(R)= PU Q is the union of two rectangles, 
499 P' == f-I(P) and Q' = rl(Q) are rectangles, and f is taffine on each of P' and Q' (stretching in the vertical direction and contracting in the horizontal direction). The lower cap is con- tracted into itself, and every point in it tends to a sink Xo. The upper cap is mapped into the lower one and stays there. The point 00 is the only source, and limnoof-n(x)= 00 for all xER 2 - U. Thus the nonwandering set off consists of a sink Xo, a source 00, and A = nnEZ {"(R). The mapping f constructed here is called the horseshoe diffeomorphism. It is an Axiom A diffeomorphism with the strong transversality condition (and hence structur- ally stable). The restriction Ii A of f on a basic set A is topologically conjugate to the shift automorphism with two symbols. It is neither a Morse-Smale diffeomorphism nor an Anosov diffeomorphism [7]. R D c u Fig. 1 A restriction of a shift automorphism on a closed invariant set is called a subshift. Let A = (Ai) be an n x n matrix with (i,j)-component A ij = 0 or 1 for all i,j== 1, ..., n. It is irreducible if for each i,j there is a positive integer m such that Am has a nonzero (i,j)-component. As- sume that A is irreducible and S == {l, ..., n}. Put LA = {x= (Xi)EL(S) I Ax,x'+1 = 1 for all iEZ}. Then LA is a closed invariant set of L(S). The restriction 0" A == 0" I LA: LA --> LA is called a subshift of finite type or a Markov subshift, and A is called the transition matrix. A topological classification of the subshifts of finite type has been investigated by Wi1liams. Let fEDiffr(M) be an Axiom A diffeomor- phism and A its basic set. Bowen constructed a tMarkov partition for fl A by generalizing the Sinai construction for Anosov diffeomor- phisms. The Markov partition connects fl A with a suitable subshift of finite type and is applied to the study of Axiom A diffeomor- phisms, especially to the ergodic theory of Axiom A diffeomorphisms (Bowen, Ruelle). A similar theory for flows was developed by Bowen, Ruelle, and M. E. Ratner. (6) Let XE M be a hyperbolic fixed point of 126K Dynamical Systems f E Diff'(M). A point pE W'(x) n WU(x) (p # x) is called a homoclinic point. If WS(x) and WU(x) intersect transversally at a homoclinic point p, then p is called a transversal homoclinic point. In a neighborhood of a transversal homoclinic point, there is a closed invariant set A of ,r for some positive integer m such that fm I A is topologically conjugate to the shift automor- phism with two symbols (Smale). There are generalizations of this theorem for semiflows by F. R. Marotto, Shiraiwa, and Kurata. K. Topological Entropy and Zeta Functions (1) The notion of topological entropy was first defined by R. L. Adler, A. G. Konheim, and M. H. McAndrew as an analog to measure- theoretic tentropy. Let X be a compact topo- logical space and ex an topen covering of X. Let N(ex) be the minimum number of members of a tsubcovering of ex. Let f: X --> X be a con- tinuous mapping and ex an open covering of X. Then limnoo(1/n)logN(exv f- l exv... V f-n+lex) exists, where ex v f- l ex v... V f-n+l ex is the open covering {Aon rl(Adn... nr n +1(An_dI A o, AI' ... , An-I E ex}. We denote the above limit by h(f, ex) and call it the topological entropy of I with respect to ex. The topological entropy h(f) of f is defined as the suph( f, ex), where the supremum is taken over all open coverings ex of X. Now assume that X is a compact metric space. For an open covering ex of X, put diamex ==sup{diamA I AEex}, where diamA is the tdiameter of A. If {exn}n;<ol is a sequence of open coverings of X such that diam exn -->0 as n --> 00, then h(f, exn)-->h(f) as n--> 00. The topological entropy of the shift automorphism with n symbols is equal to log n. Let L :R n --> R n be a linear mapping with L(zn) c L(zn) and ..1.1' ..., ).n its eigenvalues. Let f: T n --> Tn be the in- duced endomorphism of the n-dimensional torus Tn == R n /zn. Then the topological entropy is given by h(f) = LIA;I>1 log 1..1.;1. Let f:X -->X be a continuous mapping and M(f) the set of all tf-invariant probabil- ity measures on the tBorel sets of X. For each IIEM(f), denote by h(f) the (measure- theoretic) entropy of f with respect to J1. Then h(f)=sup{h(f)IJ1EMU)} (E. I. Dinaburg, T. N. T. Goodman, L. W. Goodwyn). The following properties hold: (i) If I: X --> X is a homeomorphism, then hU n ) = I n I h(f) for all nEZ. (ii) If (X,<p) is a continuous flow, then h(<p,)== Itlh(<pd for all tER. (iii) Let /;:Xi-->X i (i == 1,2) be continuous. Then hUI x f2) = hUd+ h(f2)' (iv) Let /;:Xi-->X i (i= 1,2) be con- tinuous. If there is a continuous mapping g:X I -X 2 such that g(X I )=X 2 and gofl == f2 0 g, then h(fd? h(f2)' In particular, if fl and f2 are topologically conjugate, then h(fd= 
126 L Dynamical Systems h(/2)' (v) If f:X -->X is a homeomorphism and Q is the nonwandering set of f, then h( f)= h(/IQ). In particular, Q-conjugate homeo- morphisms have the same topological entropy (Bowen). (vi) Iff: M -->M is an Axiom A diffeo morphism of a closed C"'-manifold M, then h(f)=limsuPnoc,(1/n)logNn(/), where N n (/) is the number of all periodic points of period n. And h(/)=O if and only if the nonwandering set is finite (Bowen). Bowen gave alternative definitions of the topological entropy and generalized it for uniformly continuous mappings of (not neces- sarily compact) metric spaces. If f: M --> M is a C'-mapping of an n-dimensional Rie- mannian manifold M, then h( f).s; max {O, nlog sup{ Ildfxlll XE M} }, where Ildfxll is the norm of dfx: TAM)--> Tf(x)(M), and hence h( f) is finite (Bowen, S. Ito). (2) Let f: M -->M be a continuous mapping of a closed CO: -manifold M and f*: H*(M)--> H*(M) (resp. 1*1 : HdM)-->Hd M ») the tinduced homomorphism if f on the thomology group H*(M) (resp. the tfirst homology group HdM)) with coefficients in R. The spectral radius s(L) of a linear mapping L: E --> E of a real vector space E is the maximum of the absolute values of the eigenvalues of L. The following entropy conjecture is still open: If f:M -->M is a CI-diffeomorphism (C'- mapping), then h(.f)? log s( f*). Concerning the entropy conjecture, the following are known: (i) The conjecture holds for a dense open set of Diffr(M) in the CO-topology (Shub); (ii) for any continuous mapping f, h(/)?logs(/*d (Manning); (iii) the conjecture fails for a homeomorphism (Pugh); (iv) the conjecture holds for an Axiom A diffeomorphism with the no cycle condition (Shub and Williams); (v) the conjecture holds for any continuous mapping of the n-dimensional torus (M. Misiurewicz, F. Przytycki); (vi) for a CI- mapping f:M -->M, h(/)?logldegfl, where degf is the tmapping degree of f (Misiurewicz, Przytycki). (3) M. Artin and B. Mazur first defined the zeta function of a diffeomorphism by analogy to tWeil's zeta function. Let f:X -->X be a homeomorphism of a compact metric space X. Assume that the number N m = N m ( f) of all periodic points of period m is finite for all m. Put W)=exp('L.1 Nmtm/m) and call it the zeta function of f For any closed en-manifold M, there is a dense set E of Diffr(M) such that Nm(f).s;ck m for fEE, where N m (/) is the number of isolated periodic points of period m and c and k are positive constants depending only onf(Artin, Mazur). Hence, for such fEE, the series exp('L., Nm(/)t m /m) has a positive radius of convergence. Originally, Artin and Mazur called this the zeta function of t: 500 Let O"A:LA-->LA be a subshift of finite type with transition matrix A; then (t)= l/det(E- tA), where E is the unit matrix (Bowen and O. E. Lanford III). The zeta function of an Axiom A diffeomorphism has a positive radius of convergence and is a rational function (K. Meyer, 1. Guckenheimer, Manning). Let (M, <p) be a nonsingular C'-flow on a closed C"'-manifold M. Let r be the set of all closed orbits and 1(1') the smallest posi- tive period of l' E r. Smale defmed the zeta function of (M, <p) by Z(s)= rriE"rro(l- [exp/(y)J -d). If (M, <p) is the geodesic flow on a surface of constant negative curvature, Z(s) reduces to the tSelberg zeta function. There are generalizations and modifications of the notion of zeta function by Ruelle and Franks. L. Classical Dynamical Systems (1) Let M be a Coc -manifold without bound- ary. A C'-flow (M, <p) or a C'-diffeomorphism f: M --> M with a smooth invariant measure is called a classical dynamical system. Most important classical dynamical systems are Hamiltonian or Lagrangian systems. In the modern formulation, a Hamiltonian system consists of M, a symplectic form w on M (i.e., a tnondegenerate tclosed 2-form), and the vector field X H on M defined by a C'+I-function H:M-->R. We call X H the Hamiltonian vector field with energy function H. Let (M, w, X H ) be a Hamiltonian system. Then the following hold: (i) M is of even dimension and there is a system oflocal coordinates (ql , ..., qn, PI' ..., Pn) such that w = 'L.7=1 dqi /\dpi (1. G. Darboux). In these coordinates X H is expressed by tHamilton's equations dqi/dt = aH/api, dp;/dt == - aH /oqi, i= 1, ..., n; (ii) the smooth measure defined by the tvolume element Q= « _1)[n/2 1 / n !)w n is an invariant measure for the flow generated by X H (1. Liouville); (iii) the energy function H is constant along any trajectory of X H . Especially, H-'(e) (eER) is an invariant set for each e and is called an energy surface. Energy surfaces are submani- folds of codimension one for almost all e E R. Important examples of Hamiltonian sys- tems are given as follows. Let Q be an n- dimensional manifold and T*(Q) the tcotan- gent bundle of Q. Then T*(Q) has a canonical symplectic form Wo. We call Q a configuration space and T*(Q) a momentum phase space. For any C'+1-function H on T*(Q), we have a Hamiltonian system (T*(Q), Wo, X H ) in which Xu is of class C. The Hamiltonian formalism is translated into Lagrangian formalism by using the ttangent bundle T(Q) instead of T*(Q). T(Q) is called a velocity phase space. 
501 For a given differentiable function L on T(Q), the energy function E on T(Q) and the Lagran- gian vector field X E on T(Q) are constructed, If L satisfies a certain condition, there is a system of\oca1 coordinates (q" ..., qn, q' , ... ,qn) such that X E is expressed in these coordinates by the tEuler-Lagrange equations dqi/dt == qi, d/dt(8L/oqi) = OL/oqi, i = 1, ..., n. The projec- tion of an integral curve (i.e., an orbit) of X E into Q is called a base integral curve. Under a suitable condition, the base integral curve of a Lagrangian (or a Hamiltonian) system is the geodesic of the Jacobi metric on Q up to a reparametrization, and the restriction of the flow generated by X x on an energy surface is the geodesic flow described below. (2) Let M be a tcomplete Riemannian mani- fold of class cae and S(M)= {VE T(M) I II vii = I} Let n:S(M)-->M be the projection defined by n(v)=x for VE TAM). Then S(M) is a tsphere bundle over M, which is caIled the unit tangent sphere bundle over M. Each vES(M) deter- mines a unique tgeodesic Cv:R-->M such that Cv(O) == n(v) and the ttangent vector C(O) to C v at t = 0 is equal to v. Let v, be the tangent vector C( t) to C v at t for t E R. Then II v, II = 1 and n( v,) == Cv(t) for all t E R. Define a map- ping <p: S(M) x R -->S(M) by <p(v, t) = v,. Then (S(M), <p) is a coo-flow, which is caIled the geodesic flow on M. By the classical Liouville theorem, a geodesic flow has a smooth invar- iant measure. (3) Let Tn = Rn /zn be the n-dimensional torus and w, , ... , W n real constants. Define a mapping <p: Tn X R--> Tn by <p([x"..., xnJ, t)== [XI +W l t,..., X n +wntJ, where [XI'"'' xnJ E Rn/zn == Tn is the residue class of (x I, ... , X n ) E Rn modulo zn, Then (Tn, <p) is a C"'-flow. We call it the translation flow with frequencies WI, ..., W n . If WI' ..., W n are linearly indepen- dent over Z, they are caIled independent. Every orbit of the translation flow is dense if and only if its frequencies are independent (Poin- care, H. Weyl). A translational flow with inde- pendent frequencies is called quasiperiodic. Let (M, w, X H ) be a Hamiltonian system on a 2n-dimensional manifold M. Under a certain condition, there exist an open set U of M and a diffeomorphism f: U --> Tn X Rn such that the following holds: Identify U with Tn X R n by f, and let (ql, ..., qn, PI' ..., Pn) be the coordi- nates of Tn X Rn. Then the energy function H is independent of q==(ql,... ,qn) so that Hamilton's equations becomes dqi/dt= OH/OPi' dp;/dt = - OH/oqi = 0 for i = 1, ..., n, Therefore the solutions are given by qi(t) = (OH/op;(c))t + qi(O) modulo Z, p;(t) = Pi(O) = c" i= 1,..., n, where c ==(c l , "', cn)' Therefore N c == f- I (Tn X {c} ) is an invariant torus (i.e., an invariant set diffeomorphic to T") of (the flow generated by) X H for all cERn, and the restriction flow on 126M Dynamical Systems N c is the translational flow with frequencies WI == oH/oPI (c), ..., W n == oH/oPn(c) (Arnold), As- sume further that the Hessian det(02 H/OPiOPj) does not vanish at c and WI, .. , , W n are inde- pendent. Let Ii be an energy function obtained by adding a sufficiently small perturbation to H. Then for almost all c' near c, there exists an invariant torus IV of Xii near N c ' such that the restriction flow on IV is differentiably equiva- lent to the translational flow with the same frequencies (Kolmogrov, Arnold, Moser). (4) Generic properties for Hamiltonian systems were investigated by M. Buchner, Markus, Meyer, Pugh, Robinson, Takens, and Newhouse. M. Bifurcation (1) Consider a differential equation with a parameter. For example, let X be a domain ofR n , J=( -1,1), and f:J x X -->R n aC r - mapping. For each J1E J, define fv: X -->R n by fv(x) == f(J1, x), xE X. Consider the differential equation dx/dt= fv(x), XEX and J1EJ. (7) As J1 varies, the topological structure of the phase portrait of (7) may change. Suppose that there exists J10 E J such that the topological structure of the phase portrait of (7) changes at J1=J10 but remains the same when J1o-B<J1< J10 or J10 < J1 < J10 + B for some B > O. Then J10 is called a bifurcation point of (7). Hopf bifurcation: Assume that X = R 2 and the origin OE R 2 is a singular point of (7) for all J1EJ. Assume further that the Jacobian matrix of f" at OER 2 has two distinct complex conjugate eigenvalues A(J1) and A(J1) such that the real part Rd(J1) of A(J1) is positive when J1 > 0, zerO when J1 = 0, and negative when J1 < O. Then OE R 2 is a sink for J1 < 0 and a source for J1 > O. Now assume further that d/dJ1(Rd(J1))lvo is positive and OER 2 is a "vague attractor." Then 0 E J is a bifurcation point, and there exists an asymptotically stable closed orbit for (7) near and around OER 2 which depends continuously on J1 for J1 > 0 [37]. Thus a sink of (7) (J1 < 0) changes to a source and an asymptotically stable closed orbit (J1 > 0) when J1 changes its sign. The Hopf bifurcation theorem can be generalized to a higher-dimensional case, and there is a dif- feomorphism version of the theorem. (2) Bifurcations in more general settings have been investigated by many mathema- ticians, including Thom, Arnold, R. J. Sacker, G. R. Sell, D. H. Sattinger, G. Iooss, Ruelle, and Takens. Generic bifurcations of dynamical systems have been investigated by 1. Soto- 
126N Dynamical Systems mayer, Meyer, P. Brunovsky, and others; bifurcations of Morse-Smale systems by New- house, Palis, Peixoto, and S. Matsumoto; and bifurcations of Axiom A diffeomorphisms by Newhouse and Palis. N. Miscellaneous Topics (1) Let S3 be the 3-dimensional sphere and VE,q[I(S3). H. Seifert proved that if V is suffi- ciently close (in CO topology) to a nonsingular vector field tangent to the fibers of the tHopf fibration S3 -->S2, then V has a closed orbit. He conjectured that every nonsingular vector field VE,q[I(S3) had a closed orbit. (Seifert conjec- ture;  154 Foliations D). If a nonsingular vector field V E,q[1 (S3) is transverse to a codi- mension 1 foliation of class C 2 , then it has a closed orbit (S. p, Novikov). Let M be a 3- dimensional Coo-manifold. Then there exists a nonsingular vector field V E,q[ I (M) with no closed orbit in any thomotopy class of a non- singular vector field on M (P. A. Schweitzer). Thus the Seifert conjecture fails for a vector field of class C I, but the conjecture for a vector field of class C' (r 2) is an open problem. Related work has been done by Fuller, H, Chu, and A. Weinstein. (2) Let M be a closed connected coo_ manifold. A C'-flow (M, <p) (resp. fEDiff'(M)) is a minimal flow (resp. a minimal diffeomor- phism) if M itself is a minimal set. If M admits a tlocally free Sl-action of class Coo, then it admits a minimal Coo-diffeomorphism, and if M admits a locally free special (in particular, a tfree) T2-action of class Coo, then it admits a minimal coo-flow (A. Fathi, Herman, A. B. Katok). Open problems: What are the topo- logical properties of the manifolds admitting minimal flows? Does S3 admit a minimal flow? (3) E. N. Lorenz studied numerical solutions of the following nonlinear equations in R 3 which arose from the convection equation: dx/dt = -(Jx+(JY, dy/dt = -xz +rx- y, dz/dt= xy/bz. When (J=10, r=28, and b=8/3, he found irregular behavior in this dynamical system. R. M. May studied numerical solu- tions of the following difference equation in connection with the growth of biological popu- lations with nonoverlapping generations: X n + 1 =ax n (l-x n ), XnE[O, IJ (l<a<4). He found that the dynamical structure of the above difference equation was delicate and complicated. Y. Ueda and H. Kawakami also found similar phenomena in their numerical study of Duffing's equation of the type d 2 x/dt 2 + k dx/dt + x 3 = B cos t. The phenomena ob- served in these investigations were called chaos, which exhibits strange attractors. These investigations have attracted the 502 attention of many mathematicians and scien- tists, For I-dimensional semidynamical sys- tems such as May's equation, T. Y. Li, 1. A. Yorke, A. N. Sharkovskii, 1. W. Milnor, Thur- ston, and many others have obtained notable results, while for Lorenz's equation we have results by Ruelle, Guckenheimer, Williams, Sinai, and many others. Chaos arising from discretization of differential equations has been studied by M. Yamaguti, S. Ushiki, and others ( 433 Turbulence and Chaos). References [IJ H. Poincare, Memoire sur les courbes definies par une equation differentielle, 1. Math. Pures App!., (3) 7 (1881),375-422; 8 (1882),251-296. (Oeuvres I, Gauthier-Villars, 1928,3-84,) [2J H. Poincare, Sur les courbes definies par les equations differentielles, 1. Math. Pures App!., (4) 1 (1885), 167-244; 2 (1886), 151-217, (Oeuvres I, 90-158,167-222.) [3J H. Poincare, Les methodes nouvelles de la mecanique celeste. I, Solutions periodiques, Non-existence des integrals uniforms, Solu- tions asymptotiques. II, Methodes de MM. Newcomb, Gylden, Lindstedt et Bohlin. III, Invariants integraux, Solutions periodiques du deuxieme genre, Solutions doublement asymp- totiques. Gauthier-Villars, 1,1982; 11,1893; III, 1899; English translation, New methods of celestial mechanics I-III, Clearinghouse for Federal Scientific and Technical Informa- tion, Springfield, 1967. [4J G. D. Birkhoff, Dynamical systems, Amer. Math. Soc. Colloq. Pub!., 1927. [5J A. A. Andronov and L. S. Pontryagin, Systemes grossiers, Dok\. Akad. Nauk SSSR, 14 (1937),247-250. [6J S. Lefschetz, Differential equations: geo- metric theory, Interscience, 1957. [7] S. Smale, Differentiable dynamical systems, Bull. Amer. Math. Soc., 73 (1967), 747-817. [8J D. V. Anosov, Geodesic flows on closed Riemannian manifolds with negative curva- ture, Proc, Steklov Inst. Math., 90 (1967), 1- 235. [9J R. Bowen, Equilibrium states and the ergodic theory of Anosov diffeomorphisms, Lecture notes in math. 470, Springer, 1975. [!OJ D. Ruelle and F. Takens, On the nature of turbulence, Comm. Math. Phys., 20 (1971), 167-192. [IIJ J. Moser, Stable and random motions in dynamical systems, Ann. Math. Studies 77, Princeton Univ. Press, 1973, [12J L. H. Loomis and S. Sternberg, Advanced calculus, Addison-Wesley, 1968. 
503 [13J V. V. Nemytskii and V. V. Stepanov, Qualitative theory of differential equations, Princeton Univ. Press, 1960. (Original in Rus- sian, 1947.) [14J N. P. Bhatia and G. P. Szego, Stability theory of dynamical systems, Springer, 1970. [15J W. H. Gottschalk and G. A. Hedlund, Topological dynamics, Amer. Math. Soc. Colloq. Publ., 1955. [16J R. Abraham and 1. W. Robbin, Transver- sal mappings and flows, Benjamin, 1967. [17J L. Markus, Lectures in differentiable dynamics, revised edition, Regional Con£. Series in Math. 3, Amer. Math. Soc., 1980. [18J Z. Nitecki, Differentiable dynamics, MIT Press, 1971. [19J M. Shub, StabiliH: globale des systemes dynamiques, Asterisque, Soc. Math. France, 56 (1978). [20J M. C. Irwin, Smooth dynamical systems, Academic Press, 1980. [21J S. Smale, Stable manifolds for differential equations and diffeomorphisms, Ann. Scuola Norm. Sup. Pisa, (3) 17 (1963),97-116. [22J I. Kupka, Contribution a la theorie des champs generiques, Contributions to Differ- ential Equations, 2 (1963), 457-482. [23J C. Pugh, An improved closing lemma and a general density theorem, Amer. J. Math., 89 (1967),1010-1021. [24J S. Smale, Morse inequalities for a dynam- ical system, Bull. Amer. Math. Soc., 66 (1960), 43-49. [25J 1. Palis and S. Smale, Structural stability theorems, Amer. Math. Soc. Proc. Symp. Pure Math., 14 (1970), 223-231. [26J M. W. Hirsch and C. C. Pugh, Stable manifolds and hyperbolic sets, Amer. Math. Soc. Proc. Symp. Pure Math., 14 (1970),133- 163. [27J S. Smale, The Q-stability theorem, Amer. Math. Soc. Proc. Symp. Pure Math., 14 (1970), 289-297. [28J S. Smale, Notes on differentiable dynam- ical systems, Amer. Math. Soc. Proc. Symp. Pure Math., 14 (1970), 277-287. [29J 1. W. Robbin, A structural stability theo- rem, Ann. Math., (2) 94 (1971),447-493. [30J 1. W. Robbin, Topological conjugacy and structural stability for discrete dynamical systems, Bull. Amer. Math. Soc., 78 (1972), 923-952. [31J R. C. Robinson, Structural stability ofC 1 flows, Lecture notes in math. 468, Springer, 1975,262-277. [32J M. Shub, Dynamical systems, filtrations and entropy, Bull. Amer. Math. Soc., 80 (1974), 27-41. [33J M. W. Hirsch, C. C. Pugh, and M. Shub, Invariant manifolds, Lecture notes in math. 583, Springer, 1977. 127 A Dynamic Programming [34J R, Bowen, On Axiom A diffeomorphisms, Regional Conf. Series in Math. 35, Amer. Math. Soc., 1978. [35J R. Abraham and J. E. Marsden, Founda- tions of mechanics, second edition, Benjamin, 1978. [36J V. I. Arnold and A. Avez, Problemes ergodiques de la mecanique c1assique, Gauthier- Villars, 1967; English translation, Ergodic problems of classical mechanics, Ben- jamin, 1968. [37J 1. E. Marsden and M. McCracken, The H opf bifurcation and its applications, Applied math. sci. 19, Springer, 1976. 127 (XIX.8) Dynamic Programming A. General Remarks There are two types of multistage tdecision processes. In one of them, an outcome of the whole process is determined at the final stage without any consideration of the outcome for each intermediate stage. The textensive form of a tgame is of this type. In the other type, an outcome is assigned at each stage of a multistage decision process. The theory of dynamic programqIing, dealing with this latter type, has been developed by R. Bellman and others since 1950 and is now one of the fun- damental branches of mathematical program- ming, along with the theories of linear and nonlinear programming. The following exam- ples illustrate some features of multistage deci- sion processes. Multistage allocation process. We are given a quantity x> 0 that can be divided into two parts y and (x- y). From y we obtain a return g(y), and from (x - y) a return h(x - y). In so doing, we expend a certain amount of our original resources and are left with a new quantity, ay + b(x - y), 0 < a, b < 1, with which the process is continued. How do we proceed so as to maximize the total return obtained in a finite or unbounded number of stages? Multistage choice process. Suppose that we possess two gold mines A and B, the first of which contains an amount x of gold, while the second contains an amount y. In addition, we have a single gold mining machine with the property that if used to mine gold in the mine A, there is a probability PI that it will mine a fraction r I of the gold there and remain in working order, and a probability 1 - PI that it will mine no gold and be damaged beyond repair. Similarly, the mine B has associated 
127 B Dynamic Programming with it the corresponding probabilities P2 and 1 - P 2 and fraction r2' How do we proceed in order to maximize the total amount of gold before the machine is defunct? These two processes have the following features in common: (1) In each case we have a physical system characterized in any state by a smaIl set of parameters, the state variables. (2) In each state of either process we have a choice of a number of decisions. (3) The effect of a decision is a transformation of the state vari- ables. (4) The past history of the system is of no importance in determining future actions (tMarkov property). (5) The purpose of the process is to maximize some function of the state variables. A policy is a rule for making decisions that yields an aIlowable sequence of decisions; an optimal policy is a policy that maximizes a preassigned function of the final state vari- ables. A convenient term for this preassigned function of the final state variables is criterion function. One of the characteristic features of BeIlman's methodology of dynamic program- ming is the appeal to the principle of optimal- ity: An optimal policy has the property that whatever the initial state and initial decision are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision, In the multistage aIlocation process the state variables are x (the quantity of resources) and z (the return obtained up to the current stage). The decision at any stage consists of an aIloca- tion of a quantity O.s;y.s; x. This decision has the effect of transforming x into ay + b(x - y) and z into z + g(y) + h(x - y). The purpose of the process is to maximize the final value of z. Denote by fN(X) the N-stage return obtained starting from an initial state x and using an optimal policy. Then we have fl(x)= max [g(y)+h[x-y)], Oyx j,,(x) = max [g(y)+ h(x - y) o yx +fn-j(ay+b(x-y))J, n2. This recurrence relation yields a method for obtaining the sequence {In(x)} inductively. In the stochastic gold-mining process, the state variables are x and y (the present level of the two mines) and z (the amount of gold mined to date). The decision at any stage con- sists of a choice of A and B. If A is chosen, (x,y) goes into «I-r j )x,y) and z into z+rlx, and if B is chosen, (x, y) goes into (x, (1 - r2)y) and z into z + r2Y' The purpose of the process is to maximize the expected value of z ob- tained before the machine becomes defunct. Denote by f(x,y) the expected amount of gold obtained using an optimal sequence of 504 choice. Then we have [( ) _ [ PI {rlx+ f«(1-rdx,y)} ] y-mu . P2{rzy+ f(x,(I-r 2 )y)} The optimal policy can be described in the foIlowing way. We choose A or B according as Plrlx/(I-pd is greater or less than P2r2y/(I- P2)' We can choose either A or B if equality holds. After an operation according to such a choice, the machine may become defunct and terminate the process. If the machine is usable, then we can apply our policy to a new com- bination of the amounts of gold in A and B. B. Discrete Deterministic Processes By a deterministic process we mean a process in which the outcome of a decision is uniquely determined by the decision, We assume that the state of the system, apart from time de- pendence, is described in any stage by an M- dimensional vector P constrained to lie within some region D. Let T= {} (where q runs over a set S) be a set of transformations with the property that pE D implies that (p)E D for all q E S, i.e., any transformation  carries D into itself. The term "discrete" signifies here that we have a process consisting of a finite or de- numerably infinite number of stages, A policy, for the finite process which we consider first, consists of a selection of N transformations in order, P = (TI' T 2 , ..., TN), yielding succes- sively the sequence of states Pi= T;(Pi-d (i = 2,3, ..., N) with PI = TI (p). These transfor- mations are to be chosen to maximize a given function R of the final state PN' Observe that the maximum value of R(PN)' as determined by an optimal policy, wiIl be a function of the initial vector P and the number N of stages only. Let us then define our basic auxiliary functions fN(P) =max R(PN)= the N-stage re- turn obtained starting from an initial state P and using an optimal policy. This sequence is defined for N = 1, 2, .., ,and P ED. The essential uses of the principle of optimality can be ob- served from the foIlowing two features. The first is the use of the embedding principle. The original process is embedded in a family of similar processes. In place of attempting to determine the characteristics of an optimal policy for an isolated process, we attempt to deduce the common properties of the set of optimal policies possessed by the members of the family. The second feature is the derivation of recurrence relations by which the functional equations connecting the members of the se- quence Uk(P)} are established. Assume that we choose some transformation  as a result of our first decision, obtaining in this way a new state vector Tq(p). The maximum return 
505 from the following k -1 stages is, by definition, fk-dTq(p». It follows that, if we wish to maxi- mize the total k-stage return, q must now be chosen to maximize this (k - 1 )-stage return. The result is the basic recurrence relation fdp) =maxqEsfH (Tq(p», for k2, with fdp) = maxqEsR(Tq(p». For the case of an unbounded process, the sequence {fk(P)} is replaced by a single function f(p), the total return obtained by using an optimal policy starting from state p, and the recurrence relation is replaced by the functional equation f(p)=maxqf(Tq(p», C. Discrete Stochastic Processes We again consider a discrete process, but one in which the transformations are stochastic rather than deterministic. The initial vector p is transformed into a stochastic vector z with an associated distribution function dGq(p, z) dependent on p and the choice of q. We as- sume that z is known after the decision has been made and before the next decision is to be made. We agree to measure the value of a policy in terms of some average value of the function of the final state. Let us call this expected value the return. Beginning with the case of a finite process, we define fdp) as before. The expected return as a result of the initial choice of Tq is therefore f fH (z)dGq(p, z). ZED Consequently, the recurrence relation for the sequence {fdp)} is fk(p)=max f fk-dz)dGq(p, z), k 2, qeS zeD with 11 (p) = maxqEs JZEDR(Z)dGq(p, z). Con- sidering the unbounded process, we obtain the functional relation f(p)=max f f(z)dGq(p,z). qeS zeD D. Continuous Deterministic Processc!> A number of interesting processes require that decisions be made at each point of a con- tinuum, such as a time interval. The simplest examples of processes of this character are provided by the tcalculus of variations. Let us denote by f(p; T) the return obtained over a time interval [0, T] starting from the ini- tial state p and employing an optimal policy. Although we consider the process as One con- sisting of choices made at each point t on [0, TJ, it is better to begin with the concept of choosing policies (functions) over intervals, and then pass to the limit as these intervals shrink to points. Let d be an allowable deci- 127 E Dynamic Programming sion made over the interval [0, SJ, and let Pd be the state at S starting from the initial state p and employing d. The application of the prin- ciple of optimality suggests that f(p; S + T) = sup f(Pd, T), D (I) where the supremum is taken over the set D of all allowable decisions d. The limiting form of (1) as S -->0 is a non- linear partial differential equation (Bellman partial differential equation). This expression is important for use in actual analysis. For numerical purposes, S is kept nonzero but small. R. Bellman showed that it is possible to avoid many of the quite difficult rigorous details involved in this limiting procedure if we are interested only in the computational solution of variational processes. E. Markovian Decision Processes Consider a physical system which at any of the times t = 0, L1, 2L1, . .. must lie in One of the states SI, S2' ..., SN' Let Yi(n) be the probability that the system is in Si at times nL1, and let P ij be the probability that the system is in state Sj at t + L1 if it is in state Si at time t, We suppose that the transition probabilities Pij are inde- pendent of t, We assume that the P ij depend on a parameter q, which may be a vector, and that at each stage of the process q is to be chosen so as to maximize the probability that the system is in the state SI' We obtain the nonlinear system N yd n + 1)=max L Plj(q)Yj(n) q j1 N = L Yi(n)Pil (q*); i,:l N y;(n + 1) = L Pij(q*)y)n), i = 2, 3, "', N, j1 where q* = q*(n) in the remaining N -1 equa- tions is one of the values of q that maximize Y I (n + 1). There are similar processes that can be considered as continuous analogs of this type of decision process. These are called Mar- kovian decision processes and were discussed by Bellman. There is, however, another type of Markovian decision process in which a re- ward is given at each stage. For each state Si of the system there are k alternatives 1,2,3, .." k. If we choose the alternative h among these k alternatives, then the transition prob- abilities pt (j = 1, 2, "', n) are determined, and a reward r i ' is associated with each state Sj. Let us denote by vi(n) the total expected re- turn obtained at the nth stage by appealing to an optimal policy when the initial state is Si' Then the principle of optimality in the theory 
127 F Dynamic Programming of dynamic programming yields vi(n+ 1)=mxC pt(ri+Vj(n))). A policy-iteration method involving a value-determination operation with a policy- improvement routine was given by R. A. Howard [4]. F. Dynamic Programming and the Calculus of Varia tions Problems in the calculus of variations can be viewed as multistage decision problems of a continuous type. It can be shown that the dynamic-programming approach yields formal derivations of classical necessary conditions for the calculus of variations. Let us consider the problem of minimizing the functional J(y) = f f(x, y(x), y'(x))dx, where the function y is subject to y(a) = c. We embed this problem within the family of problems generated by aJlowing a and C to be parameters with the ranges of variation -00 < a < b, - 00 < C < 00. Now we define the optimal value function S(a,c)=minyJ(y). Then the principle of optimality yields the functional equation S(a,c) =min(fH f(x,y,y')dx+S(a+.-1,c(y))} where the minimization is taken over all func- tions defined over [a,a+.-1J with y(a)=c and c(y) = y(a + N Then, writing v = y'(a), we get as ( as ) - aa =m!n f(a,c,v)+v ac . This yields the Euler equation, the Legendre condition, the Weierstrass condition, and the Erdman corner conditions. Furthermore, it can be shown that the functional equa- tion characterization yields the Hami1ton- Jacobi partial differential equation of classical mechanics. The dynamic-programming approach can be applied to more general problems in the calculus of variations. G. Dynamic Programming and the Maximum Principle In general, the method of dynamic program- ming carries a more universal character than the maximum principle of optimal control theory. However, in contrast to the latter, this 506 method does not have a rigorous logical foun- dation. V. G. Boltyanskij [6J has presented a justification of the dynamic programming method. Let /;(x,u) (i=O, 1, ...,n) be defined for XE Vc: Rn and UE U c: Rr, where V is an open set, and continuously differentiable on Vx U. Suppose that two points X O and Xl are given in V. Among all the piecewise continuous COn- trols u(t) = (u l (t), ..., Ur(t))E U which transfer the phase point moving in accordance with dx. --i'=/;(x,U(t)), i=I,...,n, from X O = x(t o ) to Xl = x(t d, find the control u(t) for which the functional f 'l J= fo(x(t),u(t))dt '0 takes the smaJlest value, A continuous function w(x) = w(x I, '" , x n ) is caJled a Bellman function relative to a point a E V if it possesses the fOJlowing properties: (1) w(a) = 0; (2) there exists a set M (the singular set of w(x)), which is closed in V and does not contain interior points, such that the function w(x) is continuously differentiable on the set V - M and satisfies the condition EC :: /;(x,u)- fo(x,U))=O, XEV- M. The foJlowing theorem gives a sufficient opti- ma1ity condition. Theorem: Assume that for dx/dt = f(x, u(t)) given in a region V c: R n there exists a Bellman function w(x) relative to the point aE V with a piecewise smooth singular set. Assume, further- more, that for any point X O E V there exists a control u(t) which transfers the phase point from X O = x( to) to a = x( t d and satisfies the relation f 'l fo(x(t),u(t))dt= -w(X O ). '0 Then any such control u(t) is optimal in V. Recently, Vinter and Levis [7J obtained the foJlowing general result in this connection. The sufficient condition is given in terms of a solu- tion to the Bellman partial differential equa- tion. It is shown that if this equation is modi- fied so that it is actuaJly an inequality, and if this inequality is required to be satisfied in a limiting sense only, then the condition is also necessary for optimality. H. Characteristic Features of Dynamic Programming The characteristic features of the dynamic- programming approach can be summarized in 
507 the following five points: (1) the advantage of lower dimensionality in comparison with the enumeration approach; (2) the possibility of finding maxima and/or minima of functions defined over restricted domains for which differential calculus may not work well; (3) the availability of numerical solutions in recursive form; (4) the possibility of formulating certain problems to which classical methods do not apply; and (5) the applicability of the method to most types of problems in mathematical programming, such as tinventory and produc- tion control, optimal searching, and some optimal and adaptive control processes. References [IJ R. E. Bellman, Dynamic programming, Princeton Univ. Press, 1957. [2J R. E. Bellman, Applied dynamic program- ming, Princeton Univ. Press, 1962. [3J R. E. Bellman, Adaptive control processes; a guided tour, Princeton Univ. Press, 1961. [4] R. A. Howard, Dynamic programming and Markov processes, Technology Press and Wiley, 1960. [5J S. E. Dreyfus, Dynamic programming and calculus of variations, Academic Press, 1965. [6J V. G. Boltyanskii, Sufficient conditions for optimality and the justification of the dynamic programming method, SIAM J. Control, 4 (1966),326-361. [7] R. B. Vinter and R. M, Lewis, A necessary and sufficient condition for optimality of dy- namic programming type, making no a priori assumption on the control, SIAM 1. Control and Optimization, 16 (1978), 571-583. 127 Ref. Dynamic Programming 
128A Econometrics 128 (XVII1.15) Econometrics A. General Remarks The term econometrics can be interpreted in various ways, In its widest sense, it means the application of mathematical methods to economic problems and includes mathematical economics, tmathematical programming, etc. However, here we use it to mean statistical methods applied to economic analysis. The object of econometrics is to provide methods to analyze relationships between economic variables. We classify these methods into four categories according to the types of relationships involved: (1) Analysis of causal relations: If a set of variables XI' ..., X n affects an economic variable Y, we can estimate the direction and extent of those effects on Y. (2) Analysis of equilibrium: When a set of economic variables Y 1 ,..., Y m is determined through a market equilibrium mechanism, we can analyze the structure of relationships that determines the equilibrium. (3) Analysis of correlation: When a set of economic vari- ables is affected simultaneously by some (un- known) common factors, we can analyze the correlation structure of the variables. (4) Ana- lysis of time interdependence: A process of development in time of a set of economic variables can be analyzed. There are two types of economic data: (a) macroeconomic data, representing quantities and variables related to a national economy as a whole, usually based on national census; and (b) microeconomic data, representing informa- tion about the economic behavior of individ- ual persons, households, and firms. Macro- economic data are usually given as a set of time series ( 421 Time Series Analysis A), while microeconomic data are obtained main- ly through statistical surveys and are given as cross-sectional data. These two types of data, related to macroeconomic theory and micro- economic theory, respectively, require differ- ent approaches; and sometimes information obtained from both types of data has to be combined; obtaining macroeconomic infor- mation from microeconomic data is called aggregation. B. Regression Analysis The most commOn technique for the first category of problems is tregression analysis, which is applied to both microeconomic and macroeconomic analysis. However, there are 510 problems peculiar to economic analysis, where a factor can seldom be controlled; and usually there are too many highly related independent variables. In such cases, if all possible indepen- dent variables are taken into the model, the accuracy of the testimators of the coefficients becomes extremely poor. Such a phenomenon, called multicollinearity, brings up the problem of selection of independent variables ( 403 Statistical Models), to which no satisfactory solution has been given. Also, assumptions about the error terms may be dubious, the error terms may be correlated, or the variances may be different. If the tvariance-covariance matrix of the errors is given, the tgeneralized least squares method can be applied, but usually such a matrix is not available. C. Systems of Simultaneous Equations The second category of problems is peculiar to economic analysis and applies mainly to macroeconomic data. Suppose that Y = (Y 1 , ..., Y G )' is a vector consisting of G economic variables, among which there exist G relation- ships that determine the equilibrium levels of the variables. We also suppose that there exist K variables Z ==(ZJ, ..., ZK)' that are indepen- dent of the economic relations but affect the equilibrium. The variables Yare called endog- enous variables, and the Z are called exog- enous variables. If we assume linear relation- ships among them, we have an expression such as Y==BY +rz+u, (1) where Band r are matrices with constant coefficients and u is a vector of disturbances or errors. (1) is called the linear structural equa- tion system and is a system of simultaneous equations. By solving the equations formally, we get the so-called reduced form Y=IlZ+v, (2) where Il=(I _B)-I r, v=(I _B)-IU. The relation of Y to Z is determined through the reduced form (2), and if we have enough data on Y and Z we can estimate II. The problem of identification is to decide whether we can determine the unknown parameters in Band r uniquely from the parameters in the re- duced form. A necessary condition for the parameters in one of the equations in (I) to be identifiable is that the number of unknown parameters (or, since known constants in the system are usually set equal to zero, the num- ber of variables appearing in the equation) not be greater than K + 1. If it is exactly equal to K + 1, the equation is said to be just identified, 
511 and if it is less than K + 1, the equation is said to be overidentified. If all the equations in the system are just identified, for arbitrary II there exist unique B and r that satisfy II = (I - B) -1 r. Therefore, if we denote the tleast squares estimator of II by iI, we can estimate Band r from the equation (I - B)iI = t. This procedure is called the indirect least squares method and is equivalent to the tmaximum likelihood method if we assume normality for u. When some of the equations are over- identified, the estimation problem becomes complicated. Three kinds of procedures have been proposed: (1) full system methods, (2) single equation methods, and (3) subsystem methods. In full system methods all the para- meters are considered simultaneously, and if normality is assumed, the maximum likelihood estimator can be obtained by minimizing I(Y- IIZ) (Y - IIZ)' I. Since it is usually difficult to compute the maximum likelihood estimator, a simpler, but asymptotically equivalent, three- stage least squares method has been proposed. The single equation methods and the sub- system methods take into consideration only the information about the parameters in one equation or in a subset of the equations, and estimate the parameters in each equation separately. There is a single equation method, called the limited information maximum like- lihood method, based on the maximum likeli- hood approach, and also a two-stage least squares method, which estimates II first by least squares, computes  = iIz, and then applies the least squares method to the model Y=B +rZ+ii. These two and also some others are asymptot- ically equivalent. Among asymptotically equivalent classes of estimators corresponding to different information structures it has been established that the maximum likelihood estimators have asymptotically higher-order efficiency [5J ( 399 Statistical Estimation) than other estimators, and Monte Carlo and numerical studies show that they are in most cases better than others if properly adjusted for the biases. In many simultaneous equation models which have been applied to actual macrO- economic data, the values of endogenous variables obtained in the past appear on the right-hand sides of equations (1). Such vari- ables are called lagged variables, and they can be treated, at least in the asymptotic theory of inference, as though they were exogenous. Hence exogenous variables and lagged endog- enous variables are jointly called predeter- mined variables. When many lagged variables 128 Ref. Econometrics appear over many time periods and when some structure among the coefficients of those lagged variables can be assumed, such a model is called a distributed lag model. Sometimes it is necessary to include some nonlinear equations in the simultaneous equa- tion model. Such nonlinear simultaneous equation models are difficult to deal with, partly because the solution of the equation may not be unique, and in practical applica- tions ad hoc procedures are applied to obtain estimates of the parameters. D. Multivariate and Time Series Analysis Problems in the third category can be ap- proached by tmultivariate analysis techniques. Sometimes tprincipal component analysis and tcanonical correlation analysis have been applied to analyze the variations of a large amount of data. However, the practical mean- ing of the results obtained is often dubious. The fourth category is the problem of time series analysis. Sophisticated theories of sto- chastic processes have little relevance for economic time series, because usually the time series do not satisfy such conditions as being stationary or having the tMarkov property, etc. Recently, however, autoregressive moving average (tARMA) and multivariate ARMA models [4J have been applied to macro- economic data, especially for the purpose of prediction and for determining the direction of causal relations ( 421 Time Series Analysis). Traditionally, fluctuations of economic time series have been thought to consist of trend, cyclic variation, seasonal variation, and error. Various ad hoc techniques have been used to separate or eliminate such components, but the theoretical treatment of such problems is far from satisfactory. References [IJ W. C. Hood and T. C. Koopmans (eds.), Studies in econometric method, Cowles Com- mission Monograph 14, Wiley, 1953. [2] H. Theil, Economic forecasts and policy, North-Holland, 1958. [3] J. Johnston, Econometric methods, McGraw-Hili, 1963. [4] G. E. p, Box and G. M. Jenkins, Time series analysis: Forecasting and control, Holden-Day, revised edition, 1976. [5] M. Akahira and K. Takeuchi, Asymptotic efficiency of statistical estimators: Concepts and higher order efficiency, Lecture notes in statistics 7, Springer, 1981. 
129 Ref. Einstein, Albert 129 (XXI.21) Einstein, Albert Albert Einstein (March 14, 1879-AprilI8, 1955) was born of Jewish parents in the city of Ulm in southern Germany. He became a Swiss citizen sOOn after graduating from the Eid- genossische Technische Hochschule of Zurich in 1900. Afterward, he obtained a position as examiner of patents at Bern, and while at that post, he published his theories on light quanta, tspecial relativity, and tBrownian motion. After briefly holding professorships at the University of Zurich and the University of Prague, he became a professor at the Univer- sity of Berlin in 1913. His general theory of relativity was announced in 1916, and in 1921 he won the Nobel Prize in physics for his contributions to theoretical physics. To escape Nazi persecution, he fled to the United States in 1933, and until his retirement in 1945 he was a professor at the Institute for Advanced Study at Princeton. He advised President Roosevelt of the feasibility of constructing the atomic bomb, but after World War II, along with others who had been connected with the bomb, he was active in promoting the nuclear disarmament mOvement and the establishment of a world government. The theory of relativity raises fundamental epistemological problems concerning time, space, and matter. The results of the general theory were verified in 1919 by observations of the solar eclipse. Through his latter years, Einstein continued to work on tunified field theory and on the generalization of relativity theory. References [1] P. Frank, Einstein, his life and time, Knopf, 1947. [2] A. Einstein, The meaning of relativity, Princeton Univ. Press, fifth edition, 1956. [3J C. Seelig, Albert Einstein, eine dokumen- tarische Biographie, Europa Verlag, 1954. 130 (XX.16) Electromagnetism A. Maxwell's Equations Mathematical formulation of electromagnetics leads to tinitial value and tboundary value problems for MaxweIl's equations according to the geometric nature of the medium. Max- 512 well's equations for a vacuum are written in the form tooEjot == rotH -J., to div E = P., J1ooHjot= -rotE-J m , J1 o divH=Pm, (1) where E and H are the electric and magnetic field vectors, Pe and Pm the electric and mag- netic charge densities, J e and J m the electric and magnetic current densities, to and J10 con- stants, and the quantity 1j J toJ1o = c the speed of light in vacuum (2.99797 x 10 8 mjs). Charge and current densities must satisfy the equations of continuity °Pejot+divJe=O, oPmjot + div J m = O. (2) Following upon the observation that, appar- ently, Pm=O and Jm=O in nature, we hence- forth set them equal to zero. This causes an asymmetry between the electric and magnetic quantities. On the other hand, the proposition "Pm=O and Jm=O" cannot be deduced from the classical theory itself. I n the presence of matter, additional charge and current appear due to the electric and magnetic polarization P and M of the material. Therefore, in this case, it is necessary to make the foIlowing substitutions in (1): Pe-->P=Pe- divP , Je-->J =J e + oPjot +rotM, H-->H+M. (3) Moreover, if we define the electric flux density (or electric displacement) D and the magnetic flux density (or magnetic induction) B by D =toE + P, B= J1o(H + M), (4) then MaxweIl's equations (1) are transformed into oDjot= rot H -J e , div D = P., oBjot= -rotE, divB=O. (5) In the electromagnetic field in a vacuum there is energy with a density u=(toj2)E 2 +(J1oj2)B 2 , (6) and energy flux with a density expressed by the Poynting vector S = E x H. (7) Between these quantities the following relation holds: oujot+divS=O. (8) An electric charge q moving with velocity v in an electromagnetic field is subject to the Lorentz force F=qE+qv x B. (9) 
513 This force can be interpreted as being caused by the Maxwell stress tensor 7ik = (e o /2)( - EiEk + 2b ik E2) + (J10/2)( - HiHk + 2b ik H 2 ). By introducing the scalar potential Vand the vector potential A, we can express the field vecto rs as B=rotA, E= -grad V-aA/at. (11) Furthermore, if we impose an auxiliary con- dition (Lorentz condition) div A+e o J1 o av/at=o, then we obtain from (1) the wave equations OV= -Pe/e o , OA== -J1 o J e , where 0 =-eoJ1oa2/at2 is calJed the d'Alem- bertian and is sometimes written as <> 2. From (13) we conclude that the electromagnetic field can propagate in a vacuum as a wave with speed c= I/ Je o J1o ' In tquantum theory the potentials Vand A are regarded as being more fundamental than E and H themselves. However, they are not uniquely determined, in the sense that the gauge transformation v-->v+al/1/at, A-->A-gradl/1 with an arbitrary function 1/1 of the space and time variables does not affect the fields. Maxwell's equations are invariant under the tLorentz transformation. Therefore they can be written in 4-dimensional tensor form ( 359 Relativity C). Maxwell's equations can be regarded as the twave equations for tbosons with spin 1 (tphotons). The equations of quantum electro- dynamics are obtained if we regard the field quantities as quantum-mechanical vari- ables (q-numbers) and then perform tsecond quantization. B. Concrete Problems In solving Maxwell's equations concretely, we usually make additional assumptions for polarizations, electric current, and field vectors P=XeE, M=XmH, Je=O'E, where Xe> Xm, and 0' are called the electric sus- ceptibility, magnetic susceptibility, and conduc- tivity, respectively. Then we have D=eE, B=J1H, where e = eo + Xe is the dielectric constant and J1 = J10(1 + Xm) is the magnetic permeability. Therefore the equations (5) become identical to (1) if eo and J10 are replaced by e and fl, respec- tively. (Pm and J m are set equal to zero.) 130B Electromagnetism (10) Some cases of practical importance are given below. (I) Electrostatics. If the fields are time- independent and there is no electric current, then E and Hare mutually independent. The static electric field is calculated from the solu- tion of the boundary value problem of the tPoisson equation L1 V = - Pe/e. Specifically, V takes a constant value in each conductor. (2) Magnetostatics. For zero electric current, the problem of magneto statics is solved in the same way as in electrostatics. For the case of non vanishing stationary electric current, the problem is red uced to that of solving (12) A= -J1Je> divA=O. (17) (13) (3) Electric current in a conductor. A station- ary electric current in a conductor is governed by the equation of continuity div J ==0, Ohm's law J = O'E, and a special case of Maxwell's equation rotE=O. The electric current pro- duces heat (Joule heat) proportional to J. E per unit volume per unit time (Joule's law). For certain substances, the specific resis- tance 0'-1 suddenly becomes negligibly small below a critical temperature. This is calJed superconductivity. (4) Quasistationary electric circuit. The problem appearing most often in electrical en- gineering is that of a quasistationary circuit. Its characteristic feature is that the electric currents exist only in the circuit elements (inductors, capacitors, and resistors) and in the lines connecting them. The current (both J e and aD/at) can be neglected in all other parts of the system. (This could be compared to the situation in dynamics where we consider systems of material points or of rigid bodies having a finite number of degrees of freedom, although every material hody is essentially a continuum.) The system network is con- structed as a tlinear graph with the circuit ele- ments as its branches. Topological tnetwork theory deals with the relation between the structure of the linear graph and the electri- cal characteristics of the network, whereas function-theoretic network theory deals with the relation between current and voltage at each part of the network, In the latter theory, current and voltage are considered as func- tions of the frequency of the sinusoidal alter- nating voltage applied to some point of the network. Together these constitute a unique theoretical system in engineering mathe- matics for designing system networks ( 282 Networks). (5) Theory of electromagnetic waves. The theory of electromagnetic waves deals with the case where the changes of all field quantities are proportional to e i "", and in addition the frequency w is so large that the term aD/at in (14) (15) (16) 
130 Ref. Electromagnetism (5) is of the same order as or larger than J e . In such a situation the electromagnetic field behaves like a wave. Problems of various types arise depending on the geometry of the conducting and dielec- tric substances, on the type of the energy source, etc. Important problems are: (i) radi- ation of a wave from a point source into free space; (ii) scattering of a plane wave by small bodies or cylinders; (iii) diffraction of a wave through holes in a conducting plate; (iv) reflec- tion and refraction of a wave at the boundary between different media; (v) wave propagation along a conducting tube (wave guide); and (vi) resonance of the electromagnetic field in a cavity surrounded by a conducting substance. Theoretical treatment similar to that for ordi- nary networks is possible for microwave cir- cuits consisting of wave guides, cavities, etc. (6) Wave guides. For electromagnetic waves with a harmonic time dependence e- i "" propa- gating inside a hollow tube of uniform cross section extending in the z-direction with per- fectly conducting walls (called a wave guide), the Maxwell equations become rot E = iwB, divB=O, rotB= -iJ1£wE, divE=O, with the boundary condition n x E=O, n. B=O, where n is a unit normal at the boundary surface. A further reduction is gained by Fourier analysis in the z-variable. For har- monic z-dependence e ikz , the transverse com- ponents E, = (e z x E) x e z and B, =(e z x B) x e z are determined from the z-components Ez= e z . E and Hz = e z . H by the following part of the Maxwell equations: ikE, + iwe z x B, = grad, E., ikB, - iJ1£we z x E, = grad, Bz (where grad, is the transverse components of the gradient), up to the solutions for Ez = Bz = 0, called transverse electromagnetic (TEM) waves, for which k = w;;;', B, = ::!:;;;. e z x E" and E, is a solution of the electrostatic problem in two dimensions rot E, = 0, div E, =0, To have E,#O for the TEM solution, it is necessary to have two or more surfaces, such as a coaxial cable (region between two con- centric cylinders) or a parallel-wire trans- mission line. Nonzero longitudinal compo- nents Ez and Bz are determined from the 2-dimensional equations [ (:J 2 + (:yy +(J1£w 2 -k 2 )]{::}=0 (20) 514 with boundary conditions Ez =0 and aBz/an = 0 on the wall, where (a/an) denotes the partial derivative in the normal direction. The solutions with Bz = 0 are called transverse magnetic (TM) waves (or electric (E) waves) and those with Ez = 0 transverse electric (TE) waves (or magnetic (M) waves); for each of these cases the equations determine an eigen- wave number k for a given angular frequency w (typically in a waveguide situation) or eigenfrequency v = w/(2n) for a given wave- length ..1. = 2n/k (typically in a resonant cavity situation), References (18) [IJ 1. D. Jackson, Classical electrodynamics, Wiley, second edition, 1975. [2J W. K. H. Panofsky and M. Phillips, Class- ical electricity and magnetism, Addison- Wesley, second edition, 1961. [3J A. Sommerfeld, Electrodynamics, trans. by E. G. Ramberg, Academic Press, 1952. (Original in German, 1948.) [4J P. Moon and D. E. Spencer, Foundations of electrodynamics, Van Nostrand, 1960. [5J 1. A. Stratton, Electromagnetic theory, McGra w- Hill, 1941. [6J B. I. Bleaney, Electricity and magnetism, Clarendon Press, 1957. [7J R. Fano, L. J. Chu, and R. B. Adler, Electromagnetic fields, energy and forces, Wiley, 1960. [8J E. G. Hallen, Electromagnetic theory, Chapman & Hall, 1962. 131 (X.8) Elementary Functions (19) A. Definition A function of a finite number of real or com- plex variables that is talgebraic, exponential, logarithmic, trigonometric, or inverse trig- onometric, or the composite of a finite number of these, is called an elementary function. Ele- mentary functions comprise the mOst common type of function in elementary calculus. 1. Liouville [IJ defined the elementary func- tions as follows: An algebraic function of a finite number of complex variables is called an elementary function of class O. Then e Z and logz are called elementary functions of class 1. Inductively, we define elementary functions of class n under the assumption that elementary functions of class at most n -1 have already 
515 been defined, Let g(t) and gj(w" ." , w n ) (1.s; j.s; m) be elementary functions of class at most 1 and f(z I' '" , zm) be an elementary function of class at most n - 1. Then the composite functions g(/(z I' ..., zm)) and f(gdw" "', w n ), ..., gm(w p "" w n )) (and only such functions) are called the elementary functions of class at most n. An elementary function of class at most n and not of class at most n - 1 is called an elementary function of class n. A function that is an elementary function of class n for some integer n is called an elementary function. In this article, we explain the properties of the most common elementary functions. B. Exponential and Logarithmic Functions of a Real Variable Let a> 0, a # 1. A function f(x) of a real vari- able satisfying the functional relation f(x+ y)= f(x)f(y), f(I)=a, satisfies f(n) = an for positive integers nand f( -n)= l/a n for negative integers -no In gen- eral,f(n/m)= for every rational number r= n/m. If we assume that f(x) is continuous, then there is a unique strictly monotOne func- tion f(x) defined in ( -00,00) whose range is (0, 00). The function f(x) is called the exponen- tial function with the base a and is denoted by aX, read "a to the power x" and also called a power of a with exponent x. Its inverse function is called the logarithmic function to the base a, and is denoted by log" x. The specific value loga x is called the logarithm of x to the base a. If g(x) = loga x, we have g(xy)=g(x)+g(y), g(a)= 1. Hence we have xy= f(g(x)+g(y)). Therefore we can reduce multiplication to addition by using a numerical table for the logarithmic function. C. Logarithmic Computation The logarithm to the base 10 is called the common logarithm. If two numbers x, yex- pressed in the decimal system differ only in the position of the decimal point (i,e., y = x. Ion for an integer n), they share the same fractional parts in their common logarithms. The in- tegral part of the common logarithm is called the characteristic, and the fractional part is called the mantissa. (We note that the word "mantissa" is also frequently used for the frac- tional part a in the tfloating point representa- tion x =a' Ion, 1O- 1 .s;a < 1, or l.s;a < 10,) The 131D Elementary Functions common logarithms of integers have been computed and published in tables. D. Derivatives of Exponential and Logarithmic Functions The function f(x) = aX is differentiable, and f'(x} = kaf(x), where ka is a constant deter- mined by the base a, If we take the base a to be ( 1 ) v 00 1 e=lim 1+- = L -=2,71828..., V-a') v v=Q v! (1) then we have ke= 1. The number L(l/v!) is usually called Napier's number and is denoted by e after L. Euler (his letter to C. Goldbach of 1731;  Appendix B, Table 6), In 1873, C. Hermite proved that e is a transcendental number. We sometimes denote eX by expx; the term exponential function usually means the function exp x. The function eX is invariant under differentiation, and conversely, a func- tion invariant under differentiation necessarily has the form Ce x . The logarithm to the base e is called the Napierian logarithm (or natural logarithm), and we usually denote it by logx without explicitly naming the base e (some- times it is denoted by In x). The derivative oflog x is l/x, hence we have the integral representation f XdX logx= -. I X (3) (2) The constant factor ka in the derivative of aX is equal to loga. The graphs of y=e x and y=logx are shown in Fig. 1. The functions eX and 10g(1 + x) are expanded in the following Taylor series at x = 0: 00 XV e X == " _ L. " vo v. (4) 00 XV 10g(1 +x)= L (_1)V+1_. Fl V (5) The power series in the right-hand sides of (4) and (5) are called the exponential series and the logarithmic series, respectively. The radii of convergence of (4) and (5) are 00 and 1, respectivel y. y 1.J=e:r: x Fig. 1 
131 E Elementary Functions E. Trigonometric and Inverse Trigonometric Functions of a Real Variable The trigonometric functions of a real variable x are the functions sin x, cos x ( 432 Trig- onometry) and the functions tan x = sin x/cos x, cot x = cos x/sin x, sec x = l/cos x, and cosec x = 1 /sin x derived from sin x and cos x. The derivatives of sin x and cos x are cos x and -sin x, respectively. They have the following Taylor expansions at x = 0: 00 (-1)' sin x == L - x 2v +! v=0(2v+ I)! ' 00 (-1)' cosx= L _x 2v v=0 (2v)! ' The radii of convergence of (6) and (7) are both 00. The inverse functions of sin x, cos x, and tan x are the inverse trigonometric functions and are denoted by arcsinx, arccos x, and arc tan x, respectively. (Instead of this nota- tion, sin- l x, cos- I x, and tan-I x are also used), These functions are infintely multiple- valued, as shown in Fig. 2. But if we restrict their ranges within the part shown by solid lines in Fig. 2, they are considered single- valued functions. To be more precise, we re- strict the range as follows: - n/2.s; arc sin x.s; n/2, O.s;arccosx.s;n, - n/2 <arctan x <n/2, y r -", I  Yd.rcsin x yarccos x ".l?' lI=arctan ..( , , x --- /"",-- , , --- Fig. 2 The functions having these ranges are called the principal values and are sometimes denoted by Arcsin x, Arccosx, and Arctanx, respec- tively. The derivatives of these functions are (l-x 2 r l / 2 , -(I-x2rl/2, (1 +x2rl, respec- tively ( Appendix A, Table 9.1; for the Tay- lor or Laurent expansions of tan x, cot x, see x, cosec x, arc sin x, arc cos x, arc tan x, etc.,  Appendix A, Table IO.lV). F. Hyperbolic Functions Let P be a point on the branch of the hyper- bola x 2 -l = 1, x> 0, and let 0 be the origin and A the vertex (1,0) of the hyperbola. De- note by 0/2 the area of the domain surrounded by the line segments OA, OP, and the arcAP 516 of the hyperbola. Then we define the coordi- nates of P to be (cosh 0, sinh 0) as functions of O. We have cosh x =(e X +e -X)/2, sinh x =(e X -e- X )/2, (8) (6) called the hyperbolic cosine and hyperbolic sine, respectively. As in the case of trigonometric functions, we define the hyperbolic tangent by tanh x = sinh x/cosh x, the hyperbolic cotangent by coth x = cosh x/sinh x, the hyperbolic secant by sech x = l/cosh x, and the hyperbolic cose- cant by cosechx= l/sinhx. They are called the hyperbolic functions. The graphs of sinh x and cosh x are shown in Fig, 3, The trigono- metric functions are sometimes called circular functions, (7) Fig. 3 We now introduce the Gudermannian (or Gudermann function): 0= gd u = 2 arc tan e U - n/2, 1 1 + sin 0 u = gd -10 = log Itan 0 + secOI = -log. 2 1 - Sill 0 x Then the hyperbolic functions can be ex- pressed in terms of the trigonometric func- tions. For example, sinhu=tanO, coshu=secO, tanhu=sinO. G. Elementary Functions of a Complex Variable (Appendix A, Table 10) (1) Exponential function. The power series (4) converges for all finite values if we replace x by the complex variable z and gives an tentire function of z with an tessential singularity at the point at infinity. This is the exponential function e Z of a complex variable z. It satisfies the addition formula (1), e Z1 +Z2 =e Z1 e Z z, and it is also the tanalytic continuation of the ex- ponential function of a real variable. For a purely imaginary number z = iy, we have the Euler formula eiY=cosy+isiny. (9) The function w=e z gives a tconformal map- ping from the z-plane to the w-plane, as shown in Fig. 4, which maps the imaginary axis of 
517 the z-plane onto the unit circle of the w-plane (w=u+iv). For z=x+iy (x,y are real num- bers), we have e Z = eX e iY ; hence e Z is a tsimply periodic function with fundamental period 2ni. "i Y -1 0 x -7ri z-plane w-plane Fig. 4 (2) Logarithmic function. The logarithmic function logz of a complex variable z is the inverse function of e Z . It is an infinitely multiple-valued analytic function that has tlogarithmic singularities at z = 0 and z = 00. AIl possible values are expressed by logz + 2nni (n is an arbitrary integer), where we select a suitable value logz. The principal value of logz is usuaIly taken as log r + ie, where z = re iO (r = Izl, e is the argument of z) and O.s; e < 2n. (Sometimes the range of the argument is taken as - n < e.s; n.) The principal value of logz is sometimes denoted by Logz, The power series (5) gives one of its tfunctional elements, The integral representation (3) holds for a complex variable z. The multi valency of logz results from the selection of a contour of integration; the integral of 1/z around the origin is 2ni, which is the increment oflogz. (3) Power. The exponential function a Z for an arbitrary complex number a is defined to be exp(z loga). Similarly, z. is defined to be exp(alogz). The function z. is an algebraic function if and only if a is rational. In other cases, the function z. is an elementary function of class 2. (4) Trigonometric functions, inverse trig- onometric functions, hyperbolic functions. The trigonometric, inverse trigonometric, and hyperbolic functions of a complex variable are defined by the analytic continuations of the corresponding functions of a real variable, For example, sinz and cosz are defined by the power series (6) and (7), respectively. They are entire functions whose zerO points are nn and (n -t)n (n is an integer), respectively. They are also represented by tWeierstrass's infinite product (Appendix A, Table 10.VI). The functions tan z, cot z, sec z, and cOsec z are tmeromorphic functions of z on the com- plex z-plane, and they are expressed by tMittag-Leffier partial fractions ( Appendix A, Tables lO.IV, 10,V). As can be shown from 132 A Elementary Particles (8) and (9), we have cosz = (e iz +e- iZ )/2, sinz = (e iz _ e- iZ )/2i, u coshz = cos iz, sinh z = (sin iz)/i. (10) Each of these formulas (10) is caIled an Euler formula, For a complex variable, the trig- onometric and hyperbolic functions are com- posites of exponential functions, the inverse trigonometric functions are composites of logarithmic functions, and all of them are elementary functions of class 1. The definition of elementary functions by Liouville described in Section A refers, of course, to the functions of a complex variable. We remark that the inverse function of an elementary function is not necessarily an elementary function. FOr example, the inverse function of y = x - a sin x is not an elementary function ( 309 Orbit Determination B). The derivative of an elementary function is also an elementary function. However, the tprimitive function of an elementary function is not necessarily an elementary function. The primitive function of a rational function or an algebraic function of tgenus 0 is again an elementary function, Similar properties hold for rational functions of trigonometric func- tions. LiouviIle [lJ carried through a deep investigation of the situation where the in- tegral of an elementary function is also an elementary function, References [1J 1. Liouville, Memoire sur la classifica- tion des transcendantes et sur l'impossibilite d'exprimer les racines de certaines equations en fonction finie explicite des coefficients, J. Math. Pures Appl., 2 (1837),56-104; 3 (1838), 523-546. [2J E. G. H. Landau, Einfiihrung in die Dif- ferentialrechung und Intergralrechnung, Noordhoff, 1934; English translation: Dif- ferential and integral calculus, Chelsea, 1965. [3J N. Bourbaki, Elements de mathematique, Fonctions d'une variable reele, ch. 3, Fonc- tions elementaires, Actualites Sci. Ind., 1074b, Hermann, second edition, 1958. Also  references to 106 Differential Calculus. 132 (XX.31) Elementary Particles A. Introduction The word "atom" is derived from the Greek word for indivisible. It turns out that an atom 
132 A Elementary Particles is divisible into its constituent nucleus and electrons. The nucleus, in turn, consists of protons and neutrons (together called nu- cleon). Photons (quanta of electromagnetic waves), electrons, protons, and neutrons (de- noted by y, e, p, and n, respectively), along with many other subsequently discovered sub- nuclear particles, are called elementary parti- cles, while nuclei, atoms, and molecules are the composite particles composed of these elemen- tary particles. States of a particle form an irreducible (uni- tary) representation of the tproper inhomoge- neous Lorentz group with positive energy (or a finite direct sum of such representations). Thus a mass (mO) and a spin (j=0, 1, 1,...) are assigned to each particle ( 258 Lorentz Group). For example, e, p, and n have spin t and nonzero masses, while y has spin 1 and zero mass. Many elementary particles are uIlstable, decaying into other particles. The average lifetime is denoted by r and its inverse is called the half-width. The time in which half of many samples of the same particle decays is called the half-life, given by r log 2. For example, electrons, protons, and photons are supposed to be stable (or at least to have very long life- times), while a neutron decays into a proton, an electron, and a neutrino v ([3-decay) with a lifetime of about 15 minutes. From the study of the relativistic equations (Dirac equations) for wave functions of an elec- tron, P. A. M. Dirac predicted the existence of particles with the same mass as the electron, but of opposite electric charge (Dirac's hole theory, 1930). These were discovered in 1932 and called positrons. Every elementary particle is now believed to be associated with an anti- particle characterized by the opposite sign of the particle's additive quantum numbers, the two being connected by the tPCT theorem. Hence the positron is the antiparticle of the electron. The antiproton, theoretically ex- pected for a long time and experimentally found in 1955, is the antiparticle of the proton. Antiprotons, antineutrons, and positrons are constituents of antimatter. The particles whose additive quantum numbers are all invariant under change of sign, such as photons (y), neu- tral pions (no), etc., seem to be antiparticles of themselves and are said to be self-conjugate. Elementary particles have four distinct types of interaction: gravitational, weak, electro- magnetic, and strong, in increasing order of interaction strength. Gravitational and electro- magnetic interactions were recognized in earlier centuries because these interactions are of long range. A. Einstein put forward the idea of a light quantum or photon as a lump of electromagnetic energy behaving like a par- 518 ticle. Whether a corresponding quantum (grav- iton) exists for gravitational interactions is a question related to the existence of gravita- tional waves themselves, and is not yet settled. The nuclear force is an example of a strong interaction and is studied to elucidate nuclear structure and to derive the tcross sections of various collision processes involving nuclei. H. Yukawa predicted in 1935 the existence of a particle associated with the nuclear force, just as photons are associated with the electro- magnetic interaction. Its mass was predicted, from the range of the nuclear force, to be about 200 times the electron mass, which is intermediate between the masses of electrons and nucleons, and hence the particle was named a mesotron or a meSOn. These were found in cosmic rays in 1947 and, in fact, it was found that the mesons relevant to the nuclear force (now called pions and denoted by n+, n-, nO according to their electric charge) decay into other kinds of particles called muons (denoted by J1 +, J1-) with a charged- pion lifetime of 2.6 x 10- 8 sec (n:1: --> J1:!: + v(v); the neutral pion nO decays, over a shorter life- time, into photons). The muons then decay into electrons and neutrinos (J1:!: -->e:!: + v + v, v indicating antineutrinos) with muon life- times of about 2.2 x 10- 6 sec. Weak inter- actions are relevant to these decays as well as to the [3-decay of the nucleus and the neutron. Since 1962 electron neutrinos V e and muon neutrinos v have been distinguished in these decays, SO that electron and muon numbers may be conserved. Since 1949, many new particles (unstable under weak interactions) have been gradually found in cosmic rays; these are called strange particles. Some of the early ones are hyperons (A,.E,2), of spin 1, and kaons (K, K), of spin O. F or such strange particles, the strangeness quantum number, which is preserved in the strong interaction, has been introduced, and the Nakano-Nishijima-Gell-Mann formula concerning this number is known to hold. This says that Q = 13 +t(B+ S) for each elementary particle, where Q is the electric charge in units of that of the positron, 13 is the third compo- nent of the isospin, B is the baryon number, and S is the strangeness. Quantum-mechanical wave functions for a system of identical particles seem to be either totally symmetric (Bose statistics) or totally antisymmetric (Fermi statistics) under permu- tations of particles. Accordingly, particles are either bosons (or Bose particles) or fermions (or Fermi particles). All bosons seems to have integer spins and all fermions half-odd-integer spins. This is the connection of spin and statis- tics and follows from certain axioms in quan- tum field theory ( 150 Field Theory). There 
519 has been some discussion on intermediate statistics (parabosons and parafermions). B. Families of Elementary Particles Elementary particles are classified into families of leptons, photons, and hadrons. The family of leptons now consists of elec- trons (e), muons (J1), and tau-leptons (r), their accompanying neutrinos (v e , VI" v t ), and their antiparticles; r was discovered in the late 1970s. Experimentally, V t is not well estab- lished nor has the possibility v, = V e yet been excluded. Leptons have spin t. They are char- acterized by having no strong interactions. The family of photons consists of photons and the recently discovered intermediary weak vector bosons, Gluons ( Section C (4)), if they exist, also belong to this family. The family of hadrons has a large number of members which are either mesons or baryons, the former being bosons and the latter fer- mions. We now have, in addition to pions and kaons, many resonant mesons (unstable under strong interaction) such as p-mesons and (J)- mesons. At present we have, in all, more than 20 species of mesons. This number does not count spin, charge, and antiparticle degrees of freedom. The baryon subfamily includes nu- cleons, hyperons, and excited states, now con- sisting of more than 30 species, again not counting spin, charge, and antiparticle degrees of freedom. We have nucleonic resonances with spin as high as 11/2. Hadrons are now well understood as com- posites of subhadronic constituents called quarks (and antiquarks), although free quarks have not been observed. (Hence the problem of quark confinement has been discussed ex- tensively.) Mesons are systems made up of a quark and an antiquark. Baryons are sys- tems made up of three quarks. Therefore, at our present level of knowledge, the elemen- tary particles might be leptons, photons, and quarks (and the corresponding antiparticles). C. Methods in the Theory of Elementary Particles (1) tQuantum Field Theory. Application of the ideas of quantum mechanics to electro- magnetic fields and their interaction with elec- trons resulted in the formulation of quantum electrodynamics (and more generally quan- tum field theory). Application of quantum- mechanical perturbation theory to quantum electrodynamics with the fine-structure con- stant ex==e 2 /(hc) (about 1/137) as an expansion parameter resulted in divergent expressions- 132C Elementary Particles the so-called divergence difficulties. The ultra- violet divergence comes from integration over high momenta (of virtual particles), and the infrared divergence is due to the zero mass of photons. It was later found that the ultraviolet divergence can be combined with a small number of parameters of the theory (i.e., elec- tron mass, (possibly nonzero) photon mass, and electromagnetic coupling constant e) into a revised set of constants (called the renormal- ized masses and coupling constant), which are then equated to the observed finite values of these constants-a procedure called renor- malization. Physically, this renormalization is pictured to be effected by virtual photons (and electron-positron pairs) surrounding (bare) electrons and photons. The infrared divergence is supposed to be a reflection of the fact that electrons can be accompanied by infinitely many photons with negligibly small total energy (a situation made possible by the zero mass of photons); this cannot be experimen- tally analyzed (and is indistinguishable from a single electron). The relativistically covariant formulation of the renormalized perturbation theory of quantum electrodynamics proposed by S. Tomonaga, J. S. Schwinger, and R. P. Feyn- man (independently and in different forms), and in particular the Feynman rules and Feynman diagrams that lead to the Feynman integrals ( 146 Feynman Integrals), made possible detailed theoretical computations, and the computed values (such as the Lamb shift of hydrogen and the anomalous magnetic moment of an electron) fit marvelously well with observed values-an achievement con- sidered a great success of quantum electro- dynamics. F. 1. Dyson more or less showed that the renormaIization procedure really absorbs all the divergences in all orders of the perturbation expansion in terms of renormal- ized constants, though there were later refine- ments and elaborations of the proof. This work also leads to the division of quantum field theories into two classes: renormalizable theories, where (infinitely many) ultraviolet divergences can be absorbed into a finite number of constants by renormaIized pertur- bation theory, and unrenormalizable theories. The question of whether perturbation series converge in some sense is an unsolved ques- tion of quantum electrodynamics. In quantum field theory, the central role is played by quantum fields, which are operator- valued generalized functions of a space-time point. Particle interpretations of any state at infinite past and infinite future are obtained in the theory from the asymptotic behavior of the fields (tasymptotic fields) at time -00 and time +00, and from their relation, expressed by the 
132C Elementary Particles tS-matrix, describing how particles scatter by collision, Thus any model of quantum fields makes, in principle, a prediction about what particles appear and how they behave (asymp- totically) in mutual collisions. After the success of quantum electrodynam- ics, the perturbation theory of quantum fields was applied to systems of pions and nucleons; this proved to be unsuccessful, possibly due to the lack of an appropriate expansion param- eter. This has led some people to study the mathematically rigorous consequences of quantum field theories; these mathematical consequences do not require any perturbation calculation, simply following from a small number of mathematically formulated axioms believed to be satisfied by a large class of quantum field theories. This approach, re- ferred to as taxiomatic quantum field theory, has yielded a few physically meaningful con- sequences of general nature: analyticity of some S-matrix elements, the tpCT theorem, and the connection of spin and statistics ( Section A). While the axiomatic approach provides a general framework, the tconstructive field theory developed later provides examples of quantum field theories that fit into such a framework. Because of its concrete nature, it can make statements about detailed properties of the model, such as the (non- )existence of composite particles, the establishment of per- turbation theory as an asymptotic expansion, and phase-transitions phenomena and the related broken symmetry as the coupling con- stant varies. It has, however, been successful for only 2 and 3 space-time dimensions. (2) Analytic S-Matrix Approach. Due to the failure of the perturbation approach in quan- tum field theory, a new approach was devel- oped based on assumptions about the analy- ticity properties of the S-matrix elements. The assumed analyticity properties were surmised from examination of Feynman integrals and of nonrelativistic potential scattering, and par- tially follow from axiomatic field theory. In this approach, the information that scattering amplitudes, or S-matrix elements, possess certain analytic properties with respect to energies, scattering angles, and so On is ex- pressed by means of integral representations. For example, the forward two-body scattering amplitude f(s) as a function of s = (energy in the center of mass system)2 is written as 1 f ro Imf(s') f(s)=- ----,-ds'. n -00 s - s Imf(s) is related to the total cross section by the optical theorem, which is a statement of 520 the unitarity of the S-matrix. Hence equation (1), called a dispersion relation, gives a relation among observable quantities. An integral rep- resentation for the general two-body scattering amplitude f(s, t, u) (2 incoming and 2 outgoing particles) has been proposed by S. Mandelstam and is called the Mandelstam representation, where S=(PI + P2)2, t =(PI + P3)2, and u = (PI + P4)2 (squares in a Minkowski metric) and PI' P2, P3, P4 are the 4-momenta of the incoming and outgoing particles, with sign reversed for the latter. (Some relations hold among variables: P; = m; with m i the mass, LPi=O, S+t+u=Lm(.) Under the inter- change of incoming and outgoing particles, the corresponding f's are related by analytic continuation. This is called the crossing sym- metry. The study of the S-matrix directly from its analyticity and unitarity is called the S-matrix approach ( 386 S-Matrices). The analyticity of the two-body scatter- ing amplitude f as a function of the angular momentum 1 with a fixed s was investigated by T. Regge for nonrelativistic potential scatter- ing, and later the idea was applied to the S- matrix approach. The poles 1 = I(s) of f (/ is considered to be a function of 1 for each fixed s) are called Regge poles, Regge trajectories 1 = I(s) for variable s.s;O have been shown to play important roles in the high-energy behavior of scattering amplitudes for small values of the variable t (four-momentum-transfer squared). An approximate expression of S-matrix elements, with Regge poles and satisfying the crossing symmetry, was introduced by G. Veneziano and is called the Veneziano model. It has developed into the so-called dual reso- nance model (dual in the sense that s-channel poles are dual to t-channel poles) and has subsequently evolved into the string model of hadrons, according to which hadrons are viewed as systems composed of strings joining quarks (and antiquarks). (1 ) (3) Group-Theoretical Approach. In connec- tion with the symmetry properties shown by the spectra and reaction patterns of hadrons, a group-theoretical approach has been de- veloped. For example, the similarity of the behavior of neutrons and protons in nuclei, apart from the difference in their electro- magnetic properties, was formulated as isospin invariance (under the group SU(2)). The sym- metry properties are sometimes understood in terms of new additive quantum numbers that are conserved or nearly conserved, and these properties are made more concrete in the final stage by the introduction offundamental constituents carrying these quantum numbers, Based on the canonical formalism of quan- tum field theory, (Noether) currents, as 
521 quantum-mechanical generators of the sym- metry, are introduced in association with con- served quantum numbers. The commutation relation of these currents, referred to as a cur- rent algebra, shows the structure of a Lie alge- bra expressing the symmetry of the Lagran- gian of the system of hadrons; for example, SU(3) x SU(3) for three species (flavors) of massless quarks (the first factor for vector currents and the second for axial-vector cur- rents). The approach showed remarkable suc- cess when combined with the hypothesis of the partially conserved axial-vector currents (PCAC), which requires the divergence of the axial-vector currents to be proportional to the pseudoscalar meson fields. Even if a theory has a symmetry under a group G in its formulation, a vacuum state of the theory might not have a symmetry under G. If that occurs, we speak of spontaneously broken symmetry. Under some assumptions, a particle of zero mass (which is connected to the vacuum by the current for the spontaneously broken symmetry) is associated with the spon- taneously broken symmetry. This statement is called Goldstone's theorem, and the relevant particle of zero mass is called the Nambu- Goldstone boson, The PCAC hypothesis is believed to be connected with the spontaneous breakdown of the axial SU(3) symmetry with pions, kaons, and eta-mesons as Nambu- Goldstone bosons, This approach produced the Adler-Weisberger sum rule, which relates the weak axial-vector coupling constants to pion-nucleon scattering cross sections. A group-theoretical attempt to treat fer- mions and bosons on an equal footing resulted in the introduction of super Lie algebras (Zr graded Lie algebras). The basic new ingredient in this approach is a special class of generators roughly interpreted as the square root of the four-momentum, whose anticommutators (instead of commutators) are linear combina- tions of ordinary generators. A supermultiplet, which is an irreducible representation of a super Lie algebra, consists of both fermions and bosons, Extensions to local super Lie algebras (and also incorporation of gravitons into the framework) have been tried, with no realistic model emerging so far. (4) Non-Abelian Gauge Field Theory. Recently, quantum field theory has been revived as non- Abelian tgauge theory, resulting in what are considered to be successful qualitative and semiquantitative predictions. The quantization of the theory was at first carried out in terms of the tFeynman path integral, where fictitious particles, called Faddeev-Popov ghosts, appear through the precise definition of the functional measure of the path integral. The canonical 132D Elementary Particles quantization has also been formulated with the explicit introduction of Faddeev-Popov ghosts from the beginning. Non-Abelian gauge field theory exhibits the very important property of asymptotic freedom, which states that the interaction at asymptotically high energies, or at very short distances, approaches that of frce (noninteracting) theory. This prop- erty is required for the description of hadronic systems made of quarks in view of the experi- mental observation of the scaling behavior of deep inelastic inclusive structure functions, Thus gauge theory is believed to describe the dynamics of systems of quarks in hadrons. It is called quantum chromodynamics, and the quantum of the gauge field is called the gluon. The recent development of gauge field theory has been accompanied by many technical refinements of quantum field theory, which include the methods of dimensional regulariza- tion and renormalization groups. Dimensional regularization starts with Feynman integrals defined for n-dimensional momenta with n#4, in order to give meaning to integrals divergent for n=4. Then Feynman integrals have poles at n=4, which are ab- sorbed into the unrenormalized constants by means of the renormalization procedure. This method is particularly suited for non-Abelian gauge theory, since it is the regularization method that keeps gauge invariance at each step of the calculation. The trenormalization-group equation has been known for a long time. It results from the requirement that physical quantities should be dimensionally covariant under re- normalization of constants in the theory. The renormalization-group equation is usually written as a differential equation for a Green's function, expressing the fact that a change of scale of momenta is balanced by changing coupling constants and masses. Further refine- ments are due to C. G. Callan, K. Symanzik, S. Weinberg, and G. 't Hooft. D. Models of Elementary Particles Hadrons are now considered to be made of more fundamental constituents; at present, at least 15 species of subhadronic, fermionic constituents (quarks) have been proposed. Attempts to understand subhadronic structure have a long history, the main landmarks of which include the Fermi-Yang model (where n-mesons are supposed to be made of protons and neutrons), the Sakata model (where all hadrons are supposed to be made of protons, neutrons, and A-hyperons), and a variety of quark models developed from the eightfold way of M. Gell-Mann and Y. Ne'eman (new 
132 Ref. Elementary Particles assignments ofrepresentations of SU(3) to particles somewhat different from the Sakata model; for example, the octet representation for mesons and low-lying baryons, and the decuplet for excited baryons). Originally, quarks were supposed to come in 3 species (u-quarks, d-quarks, s-quarks), each carrying its own quantum number, now called the flavor quantum number. Then it was suggested that each flavored quark has three additional degrees offreedom, i.e., three new quantum numbers (called color quantum numbers) in order to account for experimental data: (1) the spin-statistics problem of baryonic ground state wave functions, (2) the decay rate of n--> 2y, (3) the Drell ratio (= total cross section for {e +e+ -->anything}, divided by the cross section for e- + e+ -->J1- + J1+). Recently, two new flavor degrees of freedom besides the old 3 flavors (u, d, s) have been discovered, the carriers of which are c-quarks and b-quarks, c being the constituents of J jIfJ-particles, b of Y-particles. The combination of 5 flavors and 3 colors results in 15 quarks, as stated earlier. The so-called standard model is a quantum field theory based on a local, non-Abelian gauge group SU(3) x SU(2) x U(I). The group SU(3) is called the color SU(3) group, which is supposed to be strictly unbroken and ex- presses the invariance of the theory under the local S U (3) transformation of the three color degrees of freedom. Quarks having spin t transform as its 3-dimensional fundamental representation, and the vector gauge bosons transforming as its 8-dimensional regular representation are gluons. This part of the theory is quantum chromodynamics (QCD). The remaining part of the theory, based on the local gauge group SU(2) x U(I), is called the Glashow-Weinberg-Salam model or its hadronic extension quantum flavor dynamics (QFD), and unifies the electromagnetic and the weak interactions. This gauge group is supposed to be spontaneously broken with the only unbroken subgroup U(l)' corresponding to the electromagnetic gauge transformation. The underlying mechanism for the spontaneous breakdown of the gauge group SU(2) x U(I) is not well understood, but conventionally it is assumed to occur through the so-called Higgs mechanism. This is a mechanism proposed by P. W. Higgs, whereby the Goldstone boson acquires a nonzero mass if the broken sym- metry occurs in the presence of an associated massless vector field (called a gauge vector field), which also becomes associated with massive bosons. The gauge vector bosons are identified with photons (y) (unbroken sym- metry and hence massless) and weak inter- mediary bosons (broken symmetry and hence massive), usually denoted by Wand Z. Quarks 522 and leptons transform under the group SU(2) as doublet or singlet representations. The renormalizability of the spontaneously broken gauge field theory has been established by 't Hooft. Major predictions of the G1ashow- Weinberg-Salam model, including the existence of the gauge bosons Wand Z, have been borne out experimentally. Grand unified models attempt to unify QCD and QFD, employing a larger Lie group containing SU(3) x SU(2) x U(l) as a sub- group. The most popular ones are those based on the groups SU(5), SO(IO) and on some of the exceptional groups. Super grand unified models attempt to unify QCD, QFD, and the gravitational interaction, with super Lie groups as a possible basis. Recently, there have been attempts to search for subquark structures. References [IJ 1. D. Bjorken and S. D. DreIl, Relativistic quantum mechanics, McGraw-Hili, 1964. [2] J. D. Bjorken and S. D. Drell, Relativistic quantum fields, McGraw-Hili, 1965. [3] G. F. Chew, The analytic S-matrix, Ben- jamin, 1966. [4] S. L. Adler and R. F. Dashen, Current algebras, Benjamin, 1967. [5J V. de Alfaro, S. Fubini, G. Furlan, and C. Rossetti, Currents in hadron physics, North- HoIland, 1973. [6J M. GeIl-Mann and Y. Ne'eman, The eight- fold way, Benjamin, 1964. [7J R. E. Behrends, J. Dreitlein, C. Fronsdal, and W. Lee, Simple groups and strong interac- tion symmetries, Rev. Mod. Phys., 34 (1962), 1-40. [8J E. Fermi and C. N. Yang, Are mesons elementary particles? Phys. Rev., 76 (1949), 1739-1743. [9J S. Sakata, On a composite model for the new particles, Prog. Theoret. Phys., 16 (1956), 686-688. [10] L. Corwin, Y. Ne'eman, and S. Sternberg, Graded Lie algebra in mathematics and phys- ics, Rev. Mod. Phys., 47 (1975),573-603. [11J M. Jacob (ed.), Dual theory, Physics Reports Reprint Book Series, vol. 1, North- Holland, 1974. [12] J. Scherk, An introduction to the theory of dual models and strings, Rev. Mod. Phys., 47 (1975),123-164. [13J H. Harari, Quarks and leptons, Phys. Rep., 42 (1978), 235-309. [14J E. S. Abers and B, W, Lee, Gauge theo- ries, Phys. Rep., 9 (1973), 1-141. [15J S. Weinberg, Recent progress in gauge theories of the weak, electromagnetic and 
523 strong interactions, Rev. Mod. Phys., 46 (1974),255-277. [16] 1. C. Taylor, Gauge theories of weak interactions, Cambridge Univ. Press, 1976. 133 (XIV.g) Ellipsoidal Harmonics A. Ellipsoidal Coordinates If a>b>c, then for any given (x,y,z)ER 3 , the three roots of the cubic equation in e x 2 y2 Z2 F(e)=-+-+--l=O a 2 + e b 2 + e c 2 + e are real and lie in the intervals e> - c 2 , - c 2 > e> _b 2 , and -b 2 >e> _a 2 . Denoting these three roots by ..1., J1, and v (they are labeled so as to satisfy the inequalities ..1. > - c 2 > J1 > -b 2 >v> _a 2 ), F(A)=O, F(J1) ==0, and F(v)=O represent an ellipsoid, a hyperboloid of one sheet, and a hyperboloid of two sheets, re- spectively. They are confocal with the eIlipsoid x 2 l Z2 -+-+--1 =0 a 2 b 2 c 2 ' pass through the point (x, y, z), and mutually intersect orthogonally. The quantities ..1., J1, v are called the ellip- soidal coordinates of the point (x, y, z). Rect- angular coordinates (x, y, z) are expressed in terms of ellipsoidal coordinates (..1., J1, v) by the formula 2 (a 2 +A)(a 2 +J1)(a 2 +v) x = (a 2 _b 2 )(a 2 _c 2 ) , and two others obtained from (1) by cyclic permutations of (a, b, c) and (x, y, z). B. Ellipsoidal Harmonics When a tharmonic function i/J of three real variables is constant on the surface ..1. = con- stant, J1 = constant, or v = constant in ellipsoidal coordinates, the function i/J is called an ellip- soidal harmonic. A solution of Laplace's equa- tion i/J=O in the form i/J=;\(A)M(J1)N(v) can be obtained by the method of separation of variables, The equation i/J = 0 is written in the form a ( ai/J ) L(J1-V)A aA A aA =0, where the summation is taken over the even permut ations of (..1., J1, v), and A = J (a 2 + A)(b 2 + A)(c 2 + ..1.). 133C Ellipsoidal Harmonics The ordinary differential equation d ( d;\ ) 4A dA A dA = (KA + C);\ (2) is satisfied by ;\ and also by M and N if we replace ..1. by J1 and v, respectively. Equation (2) is called Lame's differential equation, with K and C the separation cOnstants. Let K =n(n+ 1) for n=O, 1,2,.... Then equation (2), for a suitable value (the eigen- value) of C, has a solution that is a polynomial in ..1. or a polynomial multiplied by one, two, or three of Ja 2 +A , Jb 2 +A, and Jc 2 +..1.. Among these solutions 2n + 1 are linearly independent. We denote these solutions by ;\ = j"m(A) (m= 1,2, ...,2n+ 1). They are essen- tially equivalent to the Lame functions, to be defined at the end of this section. To be pre- cise, by setting A+(a 2 +b 2 +c 2 )/3 =, C=B+n(n+ 1)(a 2 +b 2 +c 2 )/3, e l =(b 2 +c 2 -2a 2 )/3,...; el+e2+ e 3=0, we have d 2 ;\ + ( £+£+£ ) d;\ de -el -e2 -e3 d n(n+l)+B ;\. (3) 4( - el)( -e2)( - e 3 ) This can also be written in the form d 2 A du 2 =(n(n+ l)p(u)+B);\ (4) (1 ) by the change of variable  = fJ(u) with the Weierstrass tp-function. The differential equation (3) has =e" e2, e 3 , 00 as tregular singular points. A solution of (3) that is a polynomial in  or a polynomial multiplied by one, two, or three of , J -e2 ' and J-e3 is called a Lame function of the first kind. C. Classification of Lame Functions The 2n + 1 linearly independent solutions j"m(A) of (2) are classified into the following four families. If n is an even number 2p, then p+ 1 solutions j"m(A) among the 2n+ 1 solu- tions are polynomials in ..1. of degree p, and the other 3p functions are polynomials in ..1. of de- gr ee p-l multip lied by J (b 2 + A)(c 2 + ..1.), J (c 2 + A)(a 2 + ..1.), or J (a 2 + A)(b 2 + ..1.). Since all these polynomials are products with real factors of degree 1, the solutions belonging 
133D Ellipsoidal Harmonics to the first family are of the type j"m(},) = (), _ 8d(A - 02)'" (..1. - On/2)' while solutions of the latter kinds are of the type { J(b 2 +A)(C 2 +A) } j"m(A)= J(c2+A)(a2+A) J (a 2 +A)W+A) X(A-Od(A-02)",(A-8n/2-1)' (6) The functions (5) and (6) are calIed Lame functions of the first species and of the third species, respectively. On the other hand, if n is an odd number 2p+ 1, then 3(p+ 1) solutions among the 2n + 1 functions j"m(A) are of the type j"m(A)= { ::: } Jc 2 +A x (..1. - 8 1 )(..1. - 02)'" (..1. - 8(n-1)/2), (7) and the other p functions are of the type j"m(A)= J (a 2 +A)(b 2 +A)(c 2 +..1.) x(A-8d(A-82)...(A-8(n-3)/2)' (8) The functions (7) and (8) are calIed Lame functions of the second species and of the fourth species, respectively. Hence in either case we have 2n + 1 linearly independent Lame functions. When n is even, we obtain an elIipsoidal harmonic n/2 1/1::'= Tl (A-8 p )(J1-0 p )(v-8 p ) p1 by multiplying j"m(A), j"m(J1), and j"m(v) belong- ing to the first family. Also, in this case, by setting x 2 y2 Z2 8 =-+-+--1 p a 2 + 8 p b 2 + 8 p c 2 + 8 p (..1. - 8 p )(J1- 8 p )(v - 8 p ) (a 2 + Op) (b 2 + B p ) (c 2 + 8 p ) we have 1/1::' = 8 1 8 2 ", 8n;2 up to constant coefficients. Utilizing Lame functions of the third species (instead of func- tions of the first species) and formula (I), we find that 1/1::'=(yz or zx or xy) x8 1 8 2 ,.. 8 n / 2 - 1 . (10) For even n, every elIipsoidal harmonic ex- pressible in terms of polynomials in x, y, z of degree n can be written as a linear combina- tion of the functions (9) and (10), which are 524 (5) calIed the ellipsoidal harmonics of the first and of the third species, respectively. Similarly for odd n, the ellipsoidal harmonics of the second and of the fourth species I/1n m =(x or y or z)8 1 8 2 ", 8(n-l)/2, (11) 1/1::'=xyz8 1 8 2 ...8(n-3)/2 (12) are composed of the Lame functions of the second and fourth species, respectively. For odd n, these forms are a complete sys- tem of eIlipsoidal harmonics that are linearly independent and expressible in terms of poly- nomials in x, y, z of degree n, The zeros l' 2'"'' p of the Lame func- tions are real, and i # j (i # j). They never coincide with anyone of e 1 , e 2 , and e3' If e l > e 2 >e 3 , then  b ..., p all lie between eland e 3 . If m is an integer such that O.s; m .s;p, we have one and only one Lame function (with the species given) with m of its zeros lying between e l and e 2 and the remaining p-m zeros between e 2 and e 3 (Stieltjes's theorem). In this way, a complete system of linearly inde- pendent Lame functions of the specified type may be obtained, since m assumes p+ 1 differ- ent values. When the constant B appearing in the differential equation (3) takes specific values so that the equation has Lame func- tions of the first kind as its solution, (3) also yields a solution;\ such that A-->c(n+1)/2 as  --> 00. This function A is calIed the Lame function of the second kind. D. Ellipsoids of Revolution (Spheroids) When the fundamental elIipsoid is a spheroid x 2 + y2 Z2 --z-+z= 1, a c (9) it is convenient to use the spheroidal coordi- nates ( , IJ, <p) given by x=I J(e- 1 )(1-1J2) cos<p, y=I J W -1)(1-1J2)sin<p, (13) z=IIJ, I= Jc 2 -a 2 for a 2 < c 2 (prolate) a nd x= l J(e + 1)(1-1J2) cos<p, y= l J W + 1)(1-1J2) sin <p, z=IIJ, I= Ja 2 -c 2 (14) for a 2 > c 2 (oblate). The solutions of Laplace's equation, which are regular at alI finite points, are given by 1/1 = P::'()P::'(IJ):m<p in the prolate case, and 1/1 = P::'(i)Pnm(IJ):m<p 
525 in the oblate case, Here Pn m is the tassociated Legendre function of the first kind. Solutions which are regular outside a finite ellipsoid can be composed of the tassociated Legendre functions of the second kind, Q::'() or Q::'(i) instead of Pnm() or P;:'(i), respectively, E. Spheroidal Wave Functions Transforming the tHelmholtz equation in prolate spheroidal coordinates (13), we have 1 { a ( 2 a'P ) a ( 2 a'P ) W-'1 2 ) a (-I)az + a'1 (1-'1) +( e 1 + 1 '12 ) : }+K2'P=0, K=kl. (15) By separating variables in the form 'P = X() Y('1)i m<p, we have the equations d ( 2 dX ) ( 22 m2 ) - ( 1- ) - + A-K  -- X=o d d 1 - e ' (l6a) d ( 2 dY ) ( 22 m2 ) - (1-'1)- + A-K '1 -- Y=O, d'1 d'1 1 - '1 2 (16b) which X and Y, respectively, must satisfy. The only difference between equations (16a) and (16b) arises from the fact that the domain of (16a) is given by 1 <  whereas the domain of (l6b) is given by -1 <'1< 1. For the oblate spheroid, utilizing formula (14) we have 1 { a ( 2 a'P ) a ( 2 a'P ) W +'1 2 ) a ( + l)az + a'1 (1-'1 ) + C '12 - e 1 )  }+K2'P=0, By separating variables as before, Y('1) satisfies the same equation as (16b), while X() satisfies equation (l6a) with  replaced by i. All these equations are of the type d ( 2 dU ) ( 22 m2 ) dz (l-z ) dz + A-K z - 1-z2 u=o. A solution of (18) is known as a spheroidal wave function. The equation (18) has :tl as regular singular points and 00 as an irregular singular point of class 1. Hence spheroidal wave functions behave like tLegendre func- tions in the interval [-1,1] and like tBessel functions in the neighborhood of 00. F. The Functions pe;:'(x) and qe;:'(x) When we write a solution of (18), it is custom- ary to write x instead of z when z is contained in the interval [ -1,1]. We denote solutions of 133F Ellipsoidal Harmonics (18) which are regular on the whole domain -1 .s; x.s; 1 by pe;:'(x), and the corresponding eigenvalues by An,m (assuming the boundary condition stated in this paragraph concern- ing singularities). In particular, when K-->O, equation (18) reduces to tLegendre's associ- ated differential equation, the eigenvalues of ..1. become n(n + 1) (n is a positive integer), and the corresponding eigenfunctions become the associated Legendre functions of the first kind: dmp P n m (x)=(I_x 2 )m j 2 dx mn . (19) Hence pe;:'(x) is a solution which tends to a constant multiple of Pnm(x) as K-->O. Using a system of orthogonal functions P;:'(x), we can expand pe;:'(x) as pe;:'(x) = I A:'/Pt(x), l;3-m II-nl=even number. (20) The coefficients A satisfy a recurrence formula ( 2212 + 21- 1 - 2m 2 ) An,m-I(l+ 1)+K (21-1)(21+3) A:, 1 (I-m-l)(l-m) _K 2 Am (21-3)(21-1) n,H _K 2 (l+m+l)(l+m+2) Am =0 (21+3)(21+5) n,I+2 . (21) (17) The functions pe;:'(x) and pet (x) are ortho- gonal in the domain -1 < x < 1. Another solution of (18) exists which corre- sponds to the same eigenvalue An,m' is inde- pendent of pe::'(x), and has the opposite parity: qe;:'(x) = L A:,/Qr(x) 1;3--m,l-n=even + L B:'jPt(x), jm.j-n=odd (22) where the A:'I are the same as in (20) and are determined by the recurrence formula (21), while for j  m + 2, the B:'j satisfy the recur- rence formula (18) ( ..1. " 1 2 2/ + 2j - 1 - 2m 2 ) Bm n,m-J(J+ )+K (2;-1)(2;+3) n,j (j-m-l)(j-m) + K 2 B m . (2j - 3)(2j - 1) n,r 2 K 2 (j+m+l)(j+m+2) B m . =0 + (2j + 3)(2j + 5) n.]+2 . (23) Since the associated Legendre function of the second kind Qr(x)=(I-x 2 )m j 2 dmQddx m is of the form Qr(x)=m(x)log J (I +x)/(I-x) +(1_x2rmj2 x (a polynomial in x) 
133 Ref. Ellipsoidal Harmonics for I  m, the qe::'(x) have x = :t 1 as singular points. By expressing the solution of equation (18) in integral form we find that pe::'(x) satisfies an integral equation in-mvn,mpe:(x) = fl (1-x2t/2(1_e)m/2eiKX{pe::'()d, (24) where the coefficient V n . m is related to P,:"(O) or p;;" (0). In order to extend the domain of defini- tion of pe::'(x) and qe::'(x) beyond the interval [-1, IJ, we adopt, in the domain G obtained by deleting the interval [ -1,1] from the com- plex plane, the Heine-Hobson definition of the associated Legendre function, Pnm(z) = (Z2 - 1 )m/2 d m Pn/dz m , instead of N. M. Ferrers's definition (19), and construct a solution of(18) in G: pe::'(z) = L AIPt(Z), l;3-m Il-nl=even number, which is like (20) and again satisfies the inte- gral equation (24). From this we can obtain the expansion formula j2; (Z2 -1 t/ 2 pe::'(z) = ( ) m vn,m KZ x L (_l)(I-n)/2 (1 +m)! A;;, /I+1/2(KZ) , l;;'m (I-m)!' KZ Il-nl=even number. Multiplying this by a constant, we define o (Z2 -It/ 2 je::'(z) = -V 2 zm x " ( _I ) (I-n)/2 Fm Jl+I/2(KZ) ! " F m L.. 11.1 r:::. L.. 11,1' r;::m y KZ lm Fm = (I+m)! Am n,l (I-m)! n,[' This expression asymptotically assumes the form je::'(z)  sin(Kz - nn/2)/KZ for Izl» 1. In a similar manner we find a solution A (Z2 _l)m/2 nem ( z ) = - - n 2 zm x" ( _I ) (I-n)/2F m Nl+1/2(KZ) ! " Fm L.. 11,1 C L.. 11,1 lm v KZ lm having the asymptotic form ne::'(z) COS(KZ - nn/2)/Kz. 526 Disregarding a constant factor, this coincides with the function defined by (22) with Q;;'(z) = (Z2 -It/2dmQn/dzm in place of the associated Legendre function of the second kind. References (25) [IJ M. J. O. Strutt, Lamesche- Mathieusche- und verwandte Funktionen in Physik and Technik, Erg. Math., Springer, 1932 (Chelsea, 1967). [2J E. W. Hobson, The theory of spherical and ellipsoidal harmonics, Cambridge Univ. Press, 1931 (Chelsea, 1955), [3J Y. Hagihara, Theories of equilibrium fig- ures of a rotating homogeneous fluid mass, revised translation, NASA, Washington D,C., 1970. (Original in Japanese, 1933.) [4J Y. Hagihara, Celestial mechanics, vol. 4, pts. 1,2, Japan Society for the Promotion of Science, 1975. [5] C. Flammer, Spheroidal wave functions, Stanford Univ. Press, 1957, [6J J. A. Stratton, P. M. Morse, L. J. Chu, J. D. C. Little, and F. J. Corbat6, Spheroidal wave functions, including tables of separation constants and coefficients, Wiley, 1956. (26) 134 (XIV.3) Elliptic Functions A. Elliptic Integrals (27) Let cp(z) be a polynomial in z of degree 3 or 4 with complex coefficients and R(z, w) a rational function in z and w. Then R(z,) is called an elliptic irrational function. An integral of the type S R dz is called an elliptic integral. The origin of the name comes from the integral that appears in calculating the arc length of an ellipse. Any elliptic integral can be expressed by a suitable change of variables as a sum of elementary functions and elliptic integrals of the following three kinds: f dz ) (1- z2)(1 - k 2 Z2)' f$ -k 2 Z 2 ------z dz, l-z and f dz (1- a 2 z2» ) (1 - z2)(1 - k 2 Z2) ( Appendix A, Table 16.1). These three kinds of integrals are called elIiptic integrals of the first, second, and third kind, respectively, in Legendre-Jacobi standard form. This classifica- 
527 tion corresponds to that oftAbelian integrals. The constant k is called the modulus of these elliptic integrals, and a is called the parameter. Let the four zeros of <p(z) be ex l , ex 2 , ex 3 , ex 4 (we take one of them as 00 when <p(z) is of degree 3). The tRiemann surface 9i corresponding to the elliptic irrational function has the zeros ex l , ex 2 , ex 3 , ex 4 as tbranch points with degree of ramification 1, and is of two sheets and oftgenus 1. If the integrand does not have a pole with a tresidue, then the integral is multi valued only because the value (called the periodicity modulus) of the integral taken along the normal section (basis of the homol- ogy group) is not equal to zero. B. Elliptic Integrals of the First Kind When R is a function without singularities other than branch points, only the topological structure of R gives rise to the multivaluedness of the integral of R. The standard form is f z dz o J (1-z2)(1-k 2 z 2 ) f '" dcp = - w=F(k,<p), o J 1-k 2 sin 2 <p where z = sin cp. This integral is the inverse function of sn w ( Section J). The periodicity moduli are 2iK' and 4K, where K =K(k)= f l dz o J (1_22)(1-k 2 z 2 ) f n / 2 d<p ( n ) = - F k- o J 1 - k 2 sin 2 <p , 2 ' K'=K(k'), k'2=I-k 2 . We cal1 K(k) a complete elliptic integral of the first kind and F(k, <p) an incomplete elliptic integral of the first kind ( Appendix A, Table 16). Setting . (1 + k')sin <p cos <p mn<pI= , J 1-k 2 sin 2 <p we have the relation 1-k' k - 1- 1 + k " F(k, <p)=(1 + kiJF(k!, <pd/2, which is cal1ed Landen's transformation. Since k l < k when 0 < k < 1, this transformation reduces the calculation of elliptic integrals to those with smal1er values of k. For two given positive numbers a and b, put a o = a, bo=b, and an+! =(a n +b n )/2, b n + l =. Then the sequences {an} and {b n } converge rapidly to the common limit, which is cal1ed the arithmetico-geometric mean of a and b, and is denoted by ag(a, b), The complete 134E Elliptic Functions elliptic integral satisfies the relation K (k) = n/[2 ag(1, J1='k2)J. C. Elliptic Integrals of the Second Kind When R has poles with residue zero, its inte- gral has no singularities other than poles. The standard form is F(Z)= f Z J1-k2:2 dz o l-z = f: Ji-k 2 sin 2 <p d<p=E(k,<p), (3) where z = sin <po We have f u 8'(u) E F(z)= dn 2 udu=-+-u o 8(u) K if we set z = sn u. Here, 8 is Jacobi's theta func- tion, and ( 'K' ) 8(u)=9 4 2 ' ' (1 ) where 9 4 is a theta function to be described in Section I and K, K' are the same as in the case of elliptic integrals of the first kind. The quantity f l $ -k 2 Z 2 E= -dz o 1 - Z2 = f/2 J 1-k 2 sin 2cp dCP=E( k,) is cal1ed a complete elliptic integral of the second kind. D. Elliptic Integrals of the Third Kind When R has poles with nonzero residue its integral has logarithmic singularities. In this case, residues also contribute to multivalued- ness of the integral. The standard form is (2) f z dz F(z) == o (1-a 2 z 2 ) J (1-z2)(I-k 2 z 2 ) f '" d<p = 0 (1-a 2 sin2<p) J 1-k 2 sin2<p' and it is expressed as snex ( 1 8(u-ex) 8'(ex) ) F(z) - -log +u- +u cnexdnex 2 8(u+ex) 8(ex) if we set z = sn u and a 2 = k 2 sn 2 ex ( Appendix A, Table 16). E. Elliptic Functions and Periodic Functions Historically the el1iptic function was first intro- duced as the inverse function of the el1iptic 
134F Elliptic Functions integral. However, since it has been realized that elliptic functions are characterized as functions with double periodicity, it is now customary to define them as doubly periodic functions. If f(x), defined on a linear space X, satisfies the relation f(x+w)= f(x) for some WEX and all XEX, the number W is called a period of f(x), and f(x) with a period other than zero is called a periodic function. The set P of all periods of f(x) forms an additive group con- tained in X. If a basis WI, ..., w n of the additive group P exists, its members are called funda- mental periods of f(x). Any continuous, nonconstant, periodic function of a real variable has only one posi- tive fundamental period and is called a simply periodic function. The ttrigonometric functions are typical examples: sin x and cos x have the fundamental period 2n; tan x and cot x have the fundamental period n ( 159 Fourier Series), A single-valued nonconstant tmeromorphic function of n complex variables cannot have more than 2n fundamental periods that are linearly independent on the real number field. A function of one complex variable with two fundamental periods is called a doubly periodic function. Let w, w' be the fundamental periods of a doubly periodic function. For a given number a, the parallelogram with vertices a, a + w, a + w', a + W + w' is called the fundamental period parallelogram. The complex plane is covered with a network of congruent parallel- ograms, called period parallelograms, 0 btained by translating the fundamental period parallel- ogram through mw+nw' (m,n=O, :t1, :t2,...), A doubly periodic function f(u) meromor- phic on the complex plane is called an ellip- tic function. For simplicity, we usually denote the fundamental periods of an elliptic function by 2w I and 2w 3 , and introduce w 2 defined by the relation WI + W2 + w 3 = O. The first, and therefore also higher, derivatives of any elliptic function are elliptic functions with the same periods, The set of all elliptic functions with the same periods forms a tfield. The number of poles in a period parallelogram is finite. The sum of the orders of the poles is called the order of the elliptic function. An elliptic func- tion with no poles in a period paraJlelogram is merely a constant (Liouville's first theorem). The sum of the residues of an elliptic function at its poles in any period parallelogram is zero (Liouville's second theorem). Hence there can be no eJliptic function of order 1. An elliptic function of order n assumes any value n times in a period parallelogram (Liouville's third theorem). The sum of the zeros minus the 528 sum of the poles is a period (Liouville's fourth theorem). F. Weierstrass's Elliptic Functions Weierstrass defined 1 ( 1 1 ) p(u)= u 2 + L' (U-0)2 - 02 as the simplest kind of elliptic function. Here 0=2mw I +2nw 3 , with m, n integers. The sum- mation L' extends over all integral values (pos- itive, negative, and zero) of m and n, except for m=n=O. P(u) is an elliptic function of order 2 with periods 2w I and 2w 3 , called a Weierstrass p-function. The following func- tions (u) and O"(u) are called the Weierstrass zeta and sigma functions, respectively: 1 ( 1 u 1 ) (u)=-+ L' -+-+ u u _ 0 0 2 0 and O"(u)=u n( (1-5)exP(5+ : ). These have quasiperiodicity, expressed by (u + 2w;)== (u) +2f1.. (4) (5) O"{u+2w i )= _e 2n ,(u+",,)0"(u), f11+f12+'13=0, f1i=(W;), i=I,2,3; and they satisfy the relations p(u)= -('(u) (6) and I' dlogO"(u) O"'(u) .,(u)- du O"(u) (7) The function p(u) is an even function of u, and (u) and O"(u) are odd functions ofu. By considering the integral S (u)du once around the boundary of a fundamental period par- aJlelogram, we have f11 w 3 -f13 W I } f12 W I - '11 W2 = :ti, '13 W 2 -f12 W 3 Im( :: )zo, (8) which is called the Legendre relation. The derivative p'(u)= -2L 1/(u-0)3 of a p-function is an elliptic function of order 3 and bears the foJlowing relatIOn to p(u): (p'(U))2 =4(p(U))3 -g2 p(u) -g3 = 4(p(u) - ed(p(u) - e 2 )(p(u) - e 3 ), g2=60L'I/0 4 , g3=140L'I/0 6 , ei=p(W;), i=I,2,3. (9) 
529 By differentiating this relation successively, we see that p(n)(u) is expressed as a polynomial in p(u) if n is an even number and as a prod- uct of polynomials in p(u) and p'(u) if n is an odd number. In particular, writing p(u) = z in (9) we find that the p-function is the inverse function of the elliptic integral f z dz u= 00 ) 4z 3 -g 2 z-g 3 ( Appendix A, Table 16.IV). Any elliptic function can be expressed in terms of Weierstrass's functions. Specifically, let the poles of f(u) and their orders be ai, a 2 , ..., am and hi, h 2 , ..., h m , respectively, and let the principal part in the expansion of f(u) near the pole a k be  , L.. k = 1, 2, ,.. , m. j=1 (u-akY Then we obtain f(u) = C + kt 1 ( Akl (u-a k ) h, (-1 Y A . ) + L . kJ pU- 2 )(u - a k ) , j=1 (J-l)! where C is a constant depending on f(u). This can be reduced to f(u) = A + Bp'(u) by using the addition theorems ( Appendix A, Table 16) for p and zeta functions, where A and B are rational functions of D(u). Therefore, given two elliptic functions with the same periods, after expressing them as rational functions of p and p' in the above form and eliminating p and p', we obtain an algebraic equation with constant coefficients. In par- ticular, for any elliptic function f(u) we ob- tain an talgebraic differential equation of the first order by using this method, with f'(u) an elliptic function with the same periods. Fur- thermore, the functions f(u + v),j(u), and f(v) satisfy an algebraic equation. Thus for any elliptic function an algebraic addition theorem holds. G. Elliptic Functions of the Second Kind As an extension of the definition of elliptic functions, if a tmeromorphic function f satis- fies the relations f(u+2wd=J1d(u), f(u+2w 3 )=J1J!(u) (12) .. (J11 and J13 are constants) with the fundamental periods 2w 1 , 2w 3 , we call f(u) an elliptic func- tion of the second kind. What we have called 134 I Elliptic Functions simply an elliptic function can now be called an elliptic function of the first kind. For con- stants p and v, the function f(u) = ePuO"(u - v)jd u ) (13) is an example of an elliptic function of the second kind. In this case J1i=e 2p ",,-2v ni , i= 1,2. (14) Furthermore, for given constants J11 and J13, any elliptic function of the second kind is expressed as the product of an elliptic function of the first kind and the function (13) with p and v determined by (14). H. Elliptic Functions of the Third Kind If a meromorphic function f satisfies (10) f(u+2w i )=e a ,u+b i f(u), i= 1,3 (15) (11) (a i and b i are constants) with periods 2w l , 2W3, we call it an elliptic function of the third kind ( 3 Abelian Varieties I). The Weierstrass sigma function O"(u) is an example of an elliptic function of the third kind. The functions 0"" 0"2' and 0"3, defined by the equations O"i(U) = eniuO"(u-W i ) O"(w;) (16) i= 1,2,3, are also elliptic functions of the third kind. In the case of elliptic functions of the second and third kinds, 2w I and 2w 3 are not periods in the strict sense defined earlier, but are conve- niently referred to as the periods. The func- tions O"i (i = 1,2,3) are called cosigma functions. I. Theta Functions The theta functions, or more strictly, elliptic theta functions, are defined by 00 9 1 (v,r)=2 L (_I)"q(n+1/2)2 sin (2n + l)nv, n=O 9 2 (v,r)=2 f q(n+I/2)2cos(2n+l)nv, n=O 00 9 3 (v, r)= 1 +2 L qn 2 cos2nnv, n=l 00 9 4 (v,r)=1+2 L (-I)"qn 2 cos2nnv, (17) n=l where q =e in ', 1m r> O. We call (17) the q- expansion formulas of the theta functions, and we sometimes write 9 0 in place of 9 4 , A theta function is an elliptic function of the third kind with periods 1 and r. Any elliptic function can be expressed as a quotient of theta functions ( Appendix A, Table 16 for specific exam- 
134J Elliptic Functions pIes). The q-expansion formula is quite suit- able for numerical computation because of its rapid convergence; its terms decrease as the n Z powers of q as n --> 00. An elliptic function with the fundamental periods 2w I and 2w 3 can also be viewed as having the periods 2w = 2w 3 , 2w; = - 2w l . Consequently, theta functions formed with the parameter r = W3/WI can be expressed in terms of the parameter r' = W3/W = - WdW3 = -1/r, and we have 9 1 (v, r) = iA9 1 (v/r, -1/r), 9z(v, r) = A9 4 (v/r, -1/r), 9 3 (v, r) = A9 3 (v/r, -1/r), 9 4 (v,r)=A9 2 (v/r, -1/r), A=j0exp( -nivz/r). These are called Jacobi's imaginary trans- formations. If Im( -1/r)>> 1, then Iql  1, and therefore the series in the q-expansion for- mula converges slowly. However, by an imag- inary transformation we get Im(r)>> 1 and Iql ,=;0, so that computations become much easier. Each of the theta functions satisfies, as a function of two variables u and r, the follow- ing partial differential equation of the heat- conduction type: a Z 9(u, r)/au z = 4nia9(u, r)/ar ( Appendix A, Table 16.11). J. Jacobi's Elliptic Functions C. G. J. Jacobi defined elliptic integrals as inverse functions of elli ptic integrals of the first kind in the Legendre-Jacobi standard form (1). They are, in the above notation, sn w = J e 1- e 3 O"(u) = 9 3 (0)9 1 (v) , (20) 0"3(U) 9z (0) 9 4 (v) 0" I (u) 94(0)9z(v) cnw= - (21) 0"3(U) 9z(0) 9 4 (v) , dn w = O"z(u) _ 9 4 (0) 9 3 (v) (22) 0"3(U) 9 3 (0)9 4 (v) , where w= Jel -e3 u and v=u/2w l . These functions satisfy the relations snzw+cn z w= 1, kZsnzw+dnzw = 1, (23) where kZ= e Z -e 3 = (9Z(0))4 (24) el -e 3 (9 4 (0))4' The constants k and k' = j1=k2 are called the modulus and the complementary mod- 530 ulus, respectively. Furthermore, the relation d sn w/dw = en w dn w holds. The function z = sn w is the inverse function of the elliptic integral (1) (Appendix A, Table 16.III). References (18) [IJ S. Lang, Elliptic functions, Addison- Wesley, 1973. [2] D. W. Masser, Elliptic functions and trans- cendence, Springer, 1975. [3J C. G. J. Jacobi, Fundamenta nova theo- riae functionum ellipticarum (1829) (c. G. 1. Jacobis Gesammelte Werke I, G. Reimer, 1881,49-239). [4] G. H. Halphen, Traite des fonctions el- liptiques et de leurs applications I, II, III, Gauthier-Villars, 1886-1891. [5J A. Hurwitz and R. Courant, Vorlesungen liber allgemeine Funktionentheorie und ellip- tische Funktionen, Springer, third edition, 1929. [6] H. E. Rauch and A. Lebowitz, Elliptic functions, theta functions and Riemann sur- faces, Williams & Wilkins, 1973. [7] H. Hancock, Elliptic integral, Dover, 1958. (19) 135 (11.3) Equivalence Relations A. General Remarks Suppose that we are given a relation R be- tween elements of a set X such that for any elements x and y of X, either xRy or its nega- tion holds. The relation R is called an equiva- lence relation (on X) if it satisfies the following three conditions: (1) xRx, (2) xRy implies yRx, and (3) xRy and yRz imply xRz. Conditions (1), (2), and (3) are called the reflexive, sym- metric, and transitive laws, respectively. To- gether, they are called the equivalence prop- erties. Condition (1) can be replaced by the following: (1') For each x there exists an x' such that xRx'. The relation "x is equal to y" is an equivalence relation. If xRy means that x and yare in X, then R is also an equivalence relation. An equivalence relation is often de- noted by the symbol . The relations of con- gruence and similarity between figures are equivalence relations. If X is the set of integers and x = y means that x - y is even, then the relation = is an equivalence relation. 
531 B. Equivalence Classes and Quotient Sets Let R be an equivalence relation. "xRy" is read: "x and y are equivalent" (or "x is equiva- lent to y"). The subset of X consisting of all elements equivalent to an element a is called the equivalence class of a. By (1), (2), and (3), each equivalence class is nonempty, the equiv- alence class of a contains a, and different equivalence classes do not overlap. Namely, X is decomposed into a tdisjoint union of equivalence classes. This tpartition is called the classification of X with respect to R. For example, the set of integers is classified into the equivalence class of even numbers and that of odd numbers by the relation =. Conversely, since the relation "x and y belong to the same member of a partition" is an equivalence re- lation, we can regard any partition as a classi- fication. An element chosen from an equiva- lence class is called a representative of the equivalence class. In the example we can take o and 1 as the representatives of equivalence classes of even and odd numbers, respectively. XjR denotes the set of equivalence classes of X with respect to R, and is called the quotient set of X with respect to R. The mapping p:X--> XjR that carries x in X into the equivalence class of x is called canonical surjection (or projection). The idea of equivalence relations can be generalized to deal with the case when X is a tclass. C. Stronger and Weaker Equivalence Relations Let Rand S be two equivalence relations on X. If xRy always implies xSy, then we say that R is stronger than S, S is weaker than R, the classification with respect to R is finer than the one with respect to S, or the classification with respect to S is coarser than the one with re- spect to R. The relations "x is equal to y" and "x and y are in X" are the strongest and the weakest equivalence relations, respectively. Any two equivalence relations on X are ordered by their strength, and the set of equivalence relations on X forms a tcomplete lattice with respect to this ordering, References [lJ N. Bourbaki, Elements de mathematique I, Theorie des ensembles, ch, 2, Actualites Sci. Ind., 1212c, Hermann, second edition, 1960; English translation, Theory of sets, Addison- Wesley, 1968. Also  references to 381 Sets. 136B Ergodic Theory 136 (XVII.15) Ergodic Theory A. General Remarks The origin of ergodic theory was the so-called ergodic hypothesis, which provided the foun- dation for classical statistical mechanics as created by L. Boltzmann and 1. Gibbs toward the end of the 19th century ( 402 Statistical Mechanics). Attempts by various mathemati- cians to give a rigorous proof of the hypoth- esis resulted in the recurrence theorem of H. Poincare and C. Caratheodory and the ergodic theorems of G. D. Birkhoff and J. von Neumann, which marked the beginnings of ergodic theory as we know it today. As the theory developed it acquired close relation- ships with other branches of mathematics, for example, the theory of dynamical systems, probability theory, functional analysis, number theory, differential topology, and differential geometry. The principal object of modern ergodic theory is to study properties of tmeasurable transformations, particularly transformations with an invariant measure, In most cases, the transformations studied are defined on a Le- besgue measure space with a finite (or O"-finite) measure. A Lebesgue measure space with a finite measure (O"-finite measure) is a tmeasure space that is measure-theoretically isomorphic to a bounded interval (to the real line) with the usual tLebesgue measure, possibly together with an at most countable number of atoms. It is known that any separable complete tmetric space with a complete regular Borel tproba- bility measure is a Lebesgue measure space with a finite measure. We assume, unless stated otherwise, that the measure space (X, f1JJ, m) is a Lebesgue measure space. All the subsets of X mentioned are assumed measurable, and a pair of sets or functions that coincide almost every- where are identified. We use the abbreviation "a.e." to denote "talmost everywhere," B. Ergodic Theorems Let (X,f1JJ,m) be a O"-finite measure space. A transformation <p defined on X is called measurable iffor every BEf1JJ, <p-l(B)Ef1JJ. A tbijective transformation <p on X is called bimeasurable if both <p and <p -I are measur- able. A measurable transformation <p is called measure-preserving (or equivalently, the mea- sure m is invariant under <p) if m(<p -I (B» =m(B) holds for every B. It is called nonsingular if m(<p -1(B))=O whenever m(B)=O, and ergodic 
136 B Ergodic Theory if m«<p -I(B) U B)-(<p -I (B) n B»= 0 implies either m(B)=O or m(X -B)=O. The mean ergodic theorem of von Neumann (Proc. Nat. Acad, Sci. US, 18 (1932») states that if <p is a measure-preserving transformation 01" (X, f1lJ, m), then for every function f belonging to the tHilbert space L 2 (X)=L 2 (X,f1lJ,m) ( 168 Function Spaces), the sequence 1 ( n-l ) An!(x)= k/(<pkX) converges in the tnorm of L 2 (X) as n-->oo to a function f* that satisfies f*(<px) = f*(x) a.e. The individual (or pointwise) ergodic theorem of Birkhoff(Proc. Nat. Acad. Sci. US, 17 (1931)) states that for every f belonging to the tfunc- tion space LI(X), the sequence Anf(x) con- verges a.e. to f*. From either of these theo- rems it follows that for any set E satisfying <p -I (E) = E and m(E) < 00, the limit function f* satisfies IEf* dm = IEfdm. In particular, if m(X) = 1 and <p is ergodic, then the limitf* equals the constant If dm a.e. This fact therefore gives a mathematical justification to tbe ergodic hypothesis, which states that the "time mean" (L:f(<pk x»/n of what is observable over a sufficiently long time can be replaced by the "phase mean" Ifdm. Both von Neumann's theorem and Birkhoff's theorem were sub- sequently generalized in various directions by many authors. (1) Mean ergodic theorems are concerned with the tstrong convergence of the sequence of averages An =(Lt;; Tk)/n of the iterates of a tbounded linear operator T on some tBanach space. A generalization of von Neumann's theorem due to F. Riesz, K. Y osida, and S, Kakutani dispenses with the assumptions that the linear operator T is induced by a measure- preserving transformation <p and that T acts on the Hilbert space L 2 (X). A version of this generalization states that if a linear operator T defined on a Banach space f£ satisfies the conditions (i) sup II  nIl T k II < 00, n;'l nk=O 1 (ii) Iim -II T"II = 0, n_if) n then for an elementfEf£ the sequence of averages An! converges strongly to an element f* Ef£ if and only if there exists a subsequence converging tweakly to f*. From this theorem of Riesz, Y osida, and Kakutani follows the Lp-mean ergodic theorem (1 <p< 00) for so- called Markov operators: If T is a linear oper- ator defined on each of the Banach spaces Lp(X) (l.s;p.s; 00) by means of the formula Tf(x) = I f(Y)P(x, dy), where P(x, B) is the ttran- sition probability of a tMarkov process on 532 (X, f1lJ) leaving the measure m invariant (i.e., I P(x, B)dm=m(B) for every BEf1lJ) ( 261 Markov Processes), then for every f belonging to Lp(X) (1 <p< 00) the sequence Anf con- verges in the norm of Lp(X) to a limit function f*. (2) Birkhoff's ergodic theorem has been extended to the following individual ergodic theorem by E. Hopf (1954): If T is a tpositive linear operator mapping L I (X) into L I (X) and Loo(X) into Loo(X) with II TIII.s; 1 and II TII oo.s; 1, then for every f in Lt<X) the se- quence Anf converges a,e. to a Iimitf*. If T is a Markov operator, then T maps each Lp(X) into itself and satisfies II T II p.s; 1 for each p (1 .s; p.s; 00), and therefore Hopf's er- godic theorem applies to such T. Special cases of this theorem were proved earlier by J. Doob and by Kakutani. Later, N. Dunford and 1. Schwartz showed that the assump- tion of the positivity of T can be dispensed with in Hopf's theorem. For a positive linear operator Ton LI (X) satisfying II Till .s; 1, R. Chacon and D. Ornstein (1960) proved that the ratio ergodic theorem holds: For every pair of functions f and 9 in L t< X) with g:;;' 0 a,e" Iimnoo L: Tkf(x)/L: Tkg(x) exists and is finite a.e. on the set {xIL: Tkg(x»O}. This theorem extends earlier results of Hopf and W. Hurewicz dealing with special classes of operators arising from measurable transfor- mations. Hopf's ergodic theorem can be de- duced from the Chacon-Ornstein theorem, while it is known that there are positive oper- ators Ton LI (X) satisfying II Till .s; 1 for which Iimnoo Anf fails to exist on a set of positive measure for some f ELI (X), This shows that the assumption II T II 00 .s; 1 is crucial in Hopf's theorem, (3) As was the case in the original proof by Birkhoff of his ergodic theorem, every known proof of an individual ergodic theorem de- pends crucially on the so-called maximal ergodic lemma (or maximal inequality). For the case of a positive linear operator Ton L I (X) with II Till .s; 1, Hopf proved the rele- vant maximal ergodic lemma: If E(/) is the set {xISUPn;'1 An!(x»O} for each fin Ll(X), then IE(f)fdm:;;' O. Hopf's original proof of this lemma was quite intricate, but A. Garcia (1965) succeeded in giving an extremely sim- ple and elegant proof. From the maximal er- godic lemma the following so-called domi- nated ergodic theorem can also be obtained: Let T be a linear operator mapping each Lp(X) into Lp(X) satisfying II Tllp.s; 1 for each p (I.s;p.s; 00). If for fin Lp(X) we let j(x) = sUPn;'IIAnf(x)l, then if 1 <p< 00 we have f 2Pp f j(x)pdm.s;- I f(x)IPdm, p-l 
533 while if p= I and m(X)< 00 we have f J (X)dm.s;2[ m(X) + flf(X)llOg+ If(x)ldm J This theorem was obtained first by N. Wiener (1939) for the special case of T induced by a measure-preserving transformation. M. Ak- coglu (1975) proved an isometric "dilation" theorem for positive contractions (i.e., positive linear operators T for which II TII.s; 1) in the space Lp(X) for I <p< 00, by means of which he was able to deduce a dominated ergodic theorem for an arbitrary positive contraction in Lp(X) from the'corresponding theorem for a positive linear isometry in Lp(X) proved earlier by A. Ionescu- Tulcea. From this theorem of Akcoglu one can obtain the following indi- vidual ergodic theorem: If T is a positive con- traction in Lp(X) for some p with I <p< 00, then for every f in Lp(X), the sequence Anf converges a.e. (4) Both mean and individual ergodic theo- rems can be extended without difficulty to a continuous time parameter semigroup {1; I t  O} of bounded linear operators such that 1;1',= 1;+s (To=I), under a suitable con- tinuity assumption on 1; with respect to t, by replacing the discrete time average (L;:; Tk)/n with (H 1',ds)/t. Further extensions to n- parameter semigroups were obtained by N. Wiener and by N. Dunford and A. Zygmund. For mean ergodic theorems, even further extensions were possible to tamenable semi- groups of bounded linear operators. For 1- parameter semigroups, the behavior of the mean at zero, (Jb 1',ds)/t as t10 (local ergodic theorem) or }, J e- AS 1',fds as A10 or ..1.1'00 (Abelian ergodic theorem), has also been in- vestigated by Wiener, U. Krengel, E. Hille, Y osida, and others. Abelian ergodic theorems are related to properties of the tresolvent of the semigroup {1;} ( 378 Semigroups of Operators and Evolution Equations). Further extensions of all these theorems in various directions have been given by many authors; for these extensions and related topics  [5-7]. (5) J. F. C. Kingman (1968) proved an inter- esting and useful extension of (both mean and individual) ergodic theorems, called the sub- additive ergodic theorem. A real-valued tsto- chastic process {Xi,k I O.s; i < k, k = 1,2,3, ... } is called a subadditive process if it satisfies the following conditions: (i) Whenever i <j < k, Xi,k.s; Xi,j+ Xj,k' (ii) The tjoint distribution of {Xi+l.j+d is the same as that of {Xi,J. (iii) The texpectation gk = E(Xo,d exists, and satisfies gk  - Ak for some constant A and for aU k> 1. Kingman proved that if {Xj,k} is a sub- additive stochastic process, then the limit  = Iimnoo(1/n)Xo,n exists a.e. and in the mean, 136C Ergodic Theory and E()=limnoo(1/n)gn' Note that if <p is a measure-preserving transformation on a finite measure space (X,.'?6', m) and if f is an element of LdX), then Xi,k = LJ;:;/ f 0 <pj for O.s;i<k, where k = 1,2,3 ... defines a subadditive pro- cess (in fact, Xi,k = Xi,j + X j . k for i <j < k). The multiplicative ergodic theorem, proved by V. Oseledec, which plays a significant role in ergodic theory of tnonhyperbolic smooth dynamical systems and which has found ap- plications also in talgebraic groups, can be derived from the subadditive ergodic theorem [8,9]. C. Recurrence and Invariant Measures In this section we assume that the measure space (X,.'?6',m) is nonatomic. A nonsingular measurable transformation <p defined on (X,.'?6', m) is called recurrent (infinitely recurrent) if for every set B and for almost all x E B, there exists an nEZ+ (infinitely many nEZ+) such that <pn(X)E B. A set W is called wandering under <p if <p -n(W)n <p -k(W)= 0 for n #k. The transformation <p is called conservative if no sets of positive measure are wandering under <p, and incompressible if B =:> <p -I B implies m(B - <p -1 B) = O. The following statements about a nonsingular measurable transforma- tion <p are equivalent: (i) <p is recurrent; (ii) <p is infinitely recurrent; (iii) <p is incompressible; (iv) <p is conservative. An immediate con- sequence of this is the following recurrence theorem of Poincare (in the form formulated by Caratheodory): A measure-preserving transformation on a finite measure space is infinitely recurrent. In fact, in order for a non- singular measurable transformation <p to be recurrent (and hence infinitely recurrent) it is sufficient that there exist a finite measure II invariant under <p and equivalent to (i.e., mutually tabsolutely continuous with) the given measure m. The invariant measure problem is one of the basic problems in ergodic theory and is for- mulated abstractly in the following way: Given a nonsingular measurable transformation <p on a IT-finite measure space (X,.'?6', m), find neces- sary and sufficient conditions for the existence of a finite (or IT-finite) measure invariant under <p and equivalent to m. The given measure m specifies only the class of equivalent measures among which an invariant measure is to be found. Therefore we can assume without loss of generality that m is a finite measure. For the remainder of this section, unless we explicitly state otherwise, we always mean by an invar- iant measure the one that is equivalent to m. The Poincare recurrence theorem states that <p being recurrent is necessary for the existence 
136C Ergodic Theory of a finite invariant measure. The recurrence of <p is, however, not sufficient, since an ergodic transformation with an infinite but IT-finite invariant measure is recurrent and has no finite invariant measure. A necessary and sufficient condition for the existence of a finite invariant measure was given by a theorem of A. Hajian and Kakutani (1964) which states: <p has a finite invariant measure if and only if <p has no weakly wandering sets (a set W is called weakly wandering under <p if there exists an infinite subset {n k } of Z + such that <p -nk wn <p -nj W = 0, k # j). Hajian also proved that a bimeasurable transformation <p has a finite invariant measure if and only if <p is strongly recurrent in the following sense: For every set E with m(E) > 0, there exists a positive integer k = k(E) such that max m(<pn-jEnE»O Ojk for every n E Z. H. Furstenberg [II] obtained the following striking extension of the Poincare recurrence theorem: If a bimeasurable transformation <p possesses a finite invariant measure, then for every set E with m(E»O and for every integer k  2, there exists n  I such that m(En <pnEn <p 2n En... n <p(k-l) n E) > O. From this theorem one can deduce a difficult theorem of E. Szemeredi on tarithmetic progressions which states: Any subset of the integers having positive tupper density contains arithmetic progressions of arbitrary length. In fact, it is not difficult to show that the theorems of Fi.irstenberg and of Szemeredi are mutually equivalent, and for this reason the theorem of Fi.irstenberg is sometimes referred to as the ergodic SzemerMi theorem. For a nonsingular bimeasurable transfor- mation <p, a pair of sets A and B are said to be countably equivalent under <p if there exist countable decompositions {Ak I kE Z +} and {Bk I kE Z +} for A and B, respectively, and an infinite subset {nk} of Z such that <pnkA k = Bk for each k. A and B are said to be finitely equivalent under <p if finite decompositions {Ak} and {Bk} can be chosen. It was proved by Hopf that (i) <p is recurrent if and only if no set of positive measure is finitely equivalent under <p to one of its proper subsets, and (ii) <p has a finite invariant measure if and only if no set of positive measure is countably equivalent under <p to one of its proper subsets. It can be shown that if <p is ergodic, then <p has no IT-finite invariant measure if and only if every pair of sets of positive measure are countably equiva- lent under <po The first example of a transformation ad- mitting no IT-finite invariant measure was constructed by Ornstein in 1960. Since then, a 534 number of simpler examples have been ob- tained by L. Arnold, BruneI, and others. It is now known that there are many different types of transformations having no IT-finite invariant measure ( Section F). Furthermore, it was shown by 10nescu-Tu1cea that in the group of all nonsingular bimeasurable transformations with a suitable metric, those having a IT-finite invariant measure form a subset of the tfirst category. Various extensions of results on the invar- iant measure problem for nonsingular trans- formations to the case of tMarkov processes having nonsingular transition probabilities were obtained by y, Ito, J. Neveu, S, Foguel, and others [12]. For investigation of detailed properties of particular transformations arising from classi- cal dynamical systems, problems in number theory, and so on, rather than the existence of invariant measures it is more important to determine a specific form of an invariant mea- sure with desirable properties and to develop methods to decide when such a measure is unique. Various people have considered spe- cial classes of transformations; these workers have been able to obtain explicit descriptions of invariant measures with nice properties for the transformations in question and have derived interesting consequences. Most im- portant among these is the so-called Gibbs measure, introduced and investigated by Va, Sinai. He obtained this notion by generalizing the concept of the equilibrium Gibbs distri- bution, which plays a prominent role in tstatis- tical mechanics. It is defined in the following way: Let X be a compact metric space, <p a homeomorphism on X, and J10 a probability measure on X invariant under <po For a func- tion g belonging to Loo(X) and for m, n>O let 2 m . n (glJ10)= Ix eXPCn g(<pk X »)d J1 o (X), and form a sequence of probability measures J1m,n(g) absolutely continuous with respect to J10, for which the tRadon-Nikodym derivative dJ1m,n(x I g) dJ1o(x) eXPL_ng(<pkX) 2 m ,n(g I J10) holds. A measure that is a limit point, in the sense of tweak convergence, of the sequence of measures J1m,n(g) is called a Gibbs measure constructed from J10 and g. It is clear that a Gibbs measure is invariant under <po For mix- ing topological Markov shifts ( Section D), Sinai showed that if one starts with the invar- iant measure J10 of maximal entropy ( Sec- tion H), the existence of which was earlier shown by W. Parry, one can determine a class of functions g for which the Gibbs measure 
535 is unique, i.e., J1(g)=limm.nooJ1m.n(g) exists. Sinai further investigated the properties of the unique measure J1(g) in detail. R. Bowen and D. Ruelle, defining the Gibbs measure some- what differently, investigated the existence and uniqueness of such measures, thereby recapturing the results of Sinai in the case of mixing topological Markov shifts. With the aid of Markov partitions ( Section G), these results On Gibbs measures were carried over to the case of t Anosov and "taxiom A" diffeo- morphisms, and they provided essential tools in the investigation of the ergodic behavior of these transformations. For details  [13,14]. The explicit form of the density of the in- variant measure with respect to the Lebesgue measure for the transformation associated with tcontinued fraction expansion was al- ready known to Gauss, and one can draw from it numerous conclusions about metric properties of continued fractions. The way in which Gauss was able to determine this invar- iant measure, however, was never explained. Recently, Sh. Ito, H. Nakada, and S. Tanaka (Keio Eng. Rep., 30 (1977)) developed an inter- esting method to describe the mechanism used to arrive at this density function for the invar- iant measure for the continued fraction trans- formation. They employed similar methods in subsequent work to determine explicit density functions for invariant measures for other related number-theoretic transformations and for certain classes of continuous mappings over an interval; by means of the explicit forms of invariant measures they were able to de- scribe the metric properties of these transfor- mations in detail. D. Examples and Construction of Measure- Preserving Transformations Examples of measure-preserving transforma- tions appear in many different contexts. We describe some of the important Ones. (1) Let G be a tlocally compact Abelian group satisfying the second axiom oftcounta- bility ( 423 Topological Groups), fJjj a (J- algebra of Borel subsets of G, and m its tHaar measure (normalized if G is compact). Then (G,fJjj,m) is a Lebesgue measure space. For a fixed element goEG, define the transforma- tion <Pgo:G-->G by <Pgo(g)=g+go' Then <Pgo is a bijective measure-preserving transforma- tion On (G, fJjj, m) and is called the rotation On G by the element go. If G is compact, then the rotation <Pgo is ergodic if and only if the cyclic subgroup generated by the element go is dense in G. If this happens, the element go is called the topological generator of G. A group 136D Ergodic Theory is called monothetic if it has a topological generator. (2) If <P is a group tendomorphism of a com- pact Abelian group G, then <P preserves the Haar measure m. If qJ is a group tautomor- phism, th61 it is a bijective measure-preserving transformation on (G,fJjj,m). A continuous group automorphism <P induces a group auto- morphism <p* of the character group G* The measure-preserving transformation <P is ergodic if and only if every character except the identity has an infinite orbit under the induced automorphism <p*. When the group is the n-dimensional torus Tn, a continuous group automorphism <P is uniquely repre- sented by an n x n matrix with integer entries and with determinant :t 1. In this case, <P is ergodic if and only if no roots of unity appear among the teigenvalues of the representing matrix. (3) Let (Y, ,W') be a measurable space, let (1";" 'W'n)=(Y, d) for each nEZ, and define (Y*, d*) to be the tproduct measurable space (iinEz Y;., IInEzd n ). The transformation <P defined On (Y*, d*) by y* = (..., Y -I' Yo, YI,"') --><p(y*)==( ...,y'-I,YO,Y'" ...) with Y = Yn+1 for each n is called the shift transformation. Let J1 be a probability measure on (Y"si) such that (Y,d,J1) is a Lebesgue measure space, let J1n = J1 for each n, and define J1* to be the tproduct measure II nE 7.J1n on (Y*,d*). Then (Y*,d*,J1*) is a Lebesgue measure space, and the shift transformation <P is a bijective measure-preserving transforma- tion. Considered with the product measure, <P is called a generalized Bernoulli shift. When the set Y is at most countable and the measure J1 on Y is given by a sequence {Pj} of positive numbers with Lpj= 1, <P is called a Bernoulli shift. Suppose that P(y, A) is a Markov tran- sition function on (Y, d) and n is a proba- bility measure invariant under P(y, A). Then we can define the Markov measure n* on the product space (Y*, d*) by setting n*(E*) = L+, .., L+1 L n(dYo)P(Yo,dy,) x P(YI' dY2)'" P(Ys-I, dys), for a cylinder set E*=(D 1j) xCJI+s Ak) xCrr, Y;.), and extending it to all of d*. The shift trans- formation <p preserves the Markov measure n* Considered as a measure-preserving trans- formation on (Y*, d*, n*), <p is called a Mar- kov shift. 
136D Ergodic Theory A generalized Bernoulli shift is always ergodic. A Markov shift is ergodic if and only if the corresponding Markov process is tirre- ducible, which is the case if and only if the following property is satisfied: For every pair of sets A and B with n(A)n(B) > 0, there exists an neZ+ such that SApn(y,B)dn>O. There are other measures besides the prod- uct measure and the Markov measure that can be defined on the product space (Y*,d*) and are invariant under the shift transforma- tion <po For example, if Y is a Borel subset of Rand d is the IT-algebra of Borel subsets of Y, then any tstationary stochastic process taking values in Y induces such a measure on (Y*,d*). When considered with a measure of this type, the shift transformation <p is called the shift associated with the stationary process. Properties of the shift associated with a tstationary Gaussian process have been inves- tigated by G. Maruyama, 1. Girsanov, H. Totoki, and others. In particular, it is known that the shift is ergodic if and only if the tspec- tral measure for the tcovariance function of the associated Gaussian process is continuous [15]. (4) Other important examples of measure- preserving transformations arise from classical dynamical systems, which will be described in Section G. (5) There are several ways of constructing new measure-preserving transformations from given ones, We describe important cases. (i) Let <p be a nonsingular, measurable, recurrent transformation (not necessarily measure-preserving) on a IT-finite measure space (X, 28, m), and let A be a set of positive measure. For xEA, let n(x)=min{nEZ+ I <pn(x)eA}. The transformation <PA:A-->A de- fined by <PA(X) = <pn(x)(x) is a nonsingu1ar mea- surable transformation on the measure space (A,28nA,m A ), where mA(B)=m(AnB)/m(A), and it is measure-preserving if <P is. We call <PA the transformation induced by <P on A. It is ergodic if <P is ergodic. (ii) Let <P be a nonsingu1ar measurable transformation On a IT-finite measure space (X, 28, m), and suppose that {An} is a countable (possibly finite) partition of X. Define a func- tionf:X -->z+ by settingf(x)==n for xEA n , and let X={(x,j)lxEX, l.s;j.s;f(x)} be a subspace of the product measure space (X x Z +,28 x CC, m x J1), where CC is the IT-algebra of alJ subsets of Z +, and J1 is the measure on (Z +, CC) defined by J1( {n})= I for each nEZ +. The transformation ip defined on X by ip(x,j) =(x,j + 1) if I.s;j<f(x) and = (<p(x), 1) ifj= f(x) is a nonsingular measurable transforma- tion on the measure space (X, (28 x CC)nX, (m x J1)x), and it is measure-preserving if <p is. We 536 call ip the transformation built from <p with the ceiling function f If <p is ergodic and m(An)-->O as n --> 00, then ip is also ergodic. (iii) Suppose that I/J is a measure-preserving transformation on (X, 28, m) and <px is a measure-preserving transformation on (Y,d,J1) for each XEX. Assume that the map- ping (x, Y)--><Px(y) is measurable with respect to the IT-algebras 28 x d and d. The trans- formation 0 defined on the product space (X x Y,28 x d, m x J1) by O(x, y)=(I/J(x), CfJAy» is measure-preserving and is called the skew product of I/J and {<Px}. If <Px= <p for all XE X, then we get a direct product transformation O(x, y)= (I/J(x), <p(y»), (iv) A measure-preserving transformation <p on (Y, d, J1) is said to be a factor transforma- tion (or a homomorphic image) of a measure- preserving transformation I/J on (X, 28, m) if there exists a measurable transformation 11 from X onto Y such that m0I1-1 =J1 and <P11= 111/J. If 28' is a IT-sub algebra of 28 and I/J leaves 28' invariant (i,e., I/J -I 28' C 28'), then I/J induces a factor transformation <p on the measure space (X, 28', m). Conversely, if a measure-preserving transformation <p on (Y, d, J1) is a factor trans- formation of I/J on (X, 28, m) via a mapping 11, then 28' = 11- 1 d is a IT-subalgebra of 28 invar- iant under I/J. (6) A one-parameter family {<p, I te R} of bijective measure-preserving transformations on a measure space (X, 28, m) is called a flow. A flow is called continuous if the mapping t--> T, is tweakly continuous where {T,} is the one-parameter family of tunitary operators on L 2(X) induced by the flow {<p,}. A flow is called measurable if the mapping (t, x)--><p,(x) is a measurable transformation of R x X into X. A measurable flow is continuous. A. Ver- shik and Maruyama proved that for any con- tinuous flow there exists a measurable flow, unique in a specified sense, which is spatially isomorphic (in the sense specified in Section E) to the given flow. Important examples of flows are given by classical dynamical systems ( Section G), and by continuous-time stationary stochastic processes. An important tool in the study of flows is provided by the theorem of W, Ambrose and Kakutani: Every measurable ergodic flow without a fixed point is spatially isomorphic to an S-flow. A measurable flow {<p,} is called an S-flow (special flow or flow built under a func- tion) if there exist a measure-preserving trans- formation <p of a measure space (X, 28, m) and an R + -valued function f on (X, 28, m) such that each <p, is a measure-preserving trans- formation on the subspace X = {(x, u)1 xeX, O.s;u.s;f(x)} of the product measure space 
537 (X x R+,Y6' x At,m x..i.) given by <p,(x, u) =(x,u+t) if -u.s;t< -u+ f(x), =(<pn(x),u+t- >(<pk(X))) if "-1 11 -u+ L f(q/(x).s;t< -u+ L f(<pk(X», k=O kO =(<p-n(x),u+t+J_/(<pk(X))) if I -I -u- L f(<pk(X)).s;t< -u- L f(<pk(X)), k= -" k== -n+l n;" 1. Here At is the IT-algebra of Borel subsets of R +, and ..i. is the usual Lebesgue measure. E. Isomorphism Problems In this section we assume that the Lebesgue measure spaces considered are probability spaces. For simplicity, following the common usage among Russian mathematicians, we call a measure-preserving transformation on (X, f1JJ, m) an endomorphism and a bijec- tive measure-preserving transformation an automorphism. An automorphism <PI (a flow {<p:')}) on (Xl' f1JJ 1 ,md is said to be spatially isomorphic (or metrically isomorphic) to an automorphism <P2 (a flow {<p,(2)}) on (X 2 , f1JJ 2 , m2) if there exist sets N 1 and N 2 with m 1 (Nd=m 2 (N 2 )=0 and a bijective measurable transformation 0 from X1-N I toX2-N2suchthatm200=ml and O<Pl = <P20 (O<p,(l) = <p,<2)O for each t). Classification of automorphisms and flows into isomorphism classes constitutes the cen- tral problem of modern ergodic theory. Prop- erties of automorphisms and flows that are preserved under spatial isomorphisms are called isomorphism invariants (or metric in- variants). There are several isomorphism in- variants that are essential to the study of the isomorphism problem. We describe these below for the case of automorphisms. There are corresponding invariants for flows as well. (1) Spectral Invariants. Two automorphisms <PI and <P2 are said to be spectrally isomorphic if the tunitary operators TI and T 2 induced by <PI and <P2 on the Hilbert spaces L 2 (Xd and L 2 (X 2 ), respectively, are unitarily equivalent (i.e., there exists an isometric isomorphism V of L 2 (Xd onto L 2 (X 2 ) such that VT I = T 2 V). Properties preserved under spectral isomor- phisms are called spectral invariants (or spec- tral properties), If <PI and <P2 are spatially iso- morphic, it is clear that they are spectrally iso- 136 E Ergodic Theory morphic; but the converse is not true in general. (i) The property of <P being ergodic is a spectral property since <P is ergodic if and only if the number 1 is a simple eigenvalue of the induced unitary operator T. If <P is ergodic, then the set of all eigenvalues of the induced operator T forms a subgroup of the circle group, each eigenvalue is simple, and each teigenfunction has constant absolute value. If the tspectrum of the induced operator T con- sists entirely of eigenvalues, <P is said to have discrete spectrum (or pure point spectrum). A theorem due to von Neumann and P. Halmos - the first theorem on the question of isomor- phism-states that two ergodic automor- phisms <PI and <P2 with discrete spectra are spatially isomorphic if and only if they are spectrally isomorphic, which is the case if and only if the induced operators TI and T 2 have the same set of eigenvalues. Furthermore, every ergodic automorphism with discrete spectrum is spatially isomorphic to an ergodic rotation on a compact Abelian group [2]. Analogous results were obtained by L. Abramov for a bigger class of automorphisms, namely, for ergodic automorphisms having so- called quasidiscrete spectra. (ii) An automorphism <P is ergodic if and only if for every pair of sets A, B, 1 ( n1 ) !;;- kO m(<pk(A)nB) =m(A)m(B). Strengthening this condition, we can define <P to be weakly mixing if for every pair of sets A, B, 1 ( n-l ) !;;- kolm(<pk(A)nB)-m(A)m(B)1 =0; strongly mixing if lim m(<pk(A)nB)=m(A)m(B); koo and k-fold mixing if for arbitrary choice of sets Aj,j=O, 1,..., k, limm(Ao n <pn 1 A I n <pn 2 A 2 ... n <pnkAd = m(Ao)m(Ad", m(Ad, where the limit is taken as n" n2,"', n k --> 00 in such a way that n 1 <n2 < ... <n k and mini <;jo(nj- nj-d--> 00. The property of an automorphism <p being weakly mixing or strongly mixing is a spectral property. For instance, <p is weakly mixing if and only if the number 1 is a simple eigenvalue and is the only eigenvalue of the induced operator T. It is also known that <p is weakly mixing if and only if the direct product automorphism <p x <p is ergodic. The set of all weakly mixing auto- morphisms forms a dense tGo-set in the group 
136E Ergodic Theory of all automorphisms on (X, flJ, m) considered with the so-called weak topology (Halmos's theorem). On the other hand, it was shown by V. Rokhlin that the set of all strongly mix- ing automorphisms is a set of first category with respect to the weak topology. However, there are only a few known examples of auto- morphisms that are weakly mixing but not strongly mixing. (iii) An automorphism <p is said to have countable Lebesgue spectrum if the maximal spectral type of the induced unitary operator T restricted to the torthocomplement of the subspace of constant functions in L 2 (X) is equivalent to the Lebesgue measure and its tmultiplicity is countably infinite. (iv) An automorphism <p is called a K- automorphism (or Kolmogorov automorphism) if there exists a O'-subalgebra f1JJ o of f1JJ such that (a) <pf1JJ o =:>f1JJ o and <pf1JJ o #f1JJ o , (b) VnEZ<pnf1JJ O =flJ, and (c) /\nEZ<pnf1JJ O =JV, where JV is the 0'- subalgebra of f1JJ consisting of null sets and their complements. The notion of a K-flow (or Kolmogorov flow) is defined similarly, K- automorphisms are k-fold mixing for all orders k and have countable Lebesgue spectra. Generalized Bernoulli shifts are all K- automorphisms, An ergodic Markov shift is a K -automorphism if and only if it is strongly mixing, which is the case if and only if the corresponding Markov process is tirreducible and taperiodic ( 260 Markov Chains B). A continuous group automorphism of a compact Abelian group is a K-automorphism if and only if it is ergodic. In particular, a continuous group automorphism of the n-dimensional torus Tn is a K-automorphism if and only if no roots of unity appear among the eigenvalues of the representing matrix. Automorphisms and flows arising from classical dynamical systems also provide examples of K-automorphisms and K-flows. In particular, a tgeodesic flow on a surface of negative curvature is a K -flow, and each automorphism (except the identity) of this flow is a K-automorphism, For the shift transformation <p associated with a stationary Gaussian process. it was shown by Maruyama [15] that <p is (a) weakly mixing if and only if it is ergodic, (b) strongly mixing if and only if the covariance function of the associated Gaussian process tends to 0 as n-->oo, and (c) a K -automorphism if and only if the tspec- tral measure of the covariance function is absolutely continuous with respect to the Lebesgue measure, (v) Examples of automorphisms having various types of spectra have been constructed by a number of authors by using stationary Gaussian processes and the theory of approxi- mation developed by A. Katok and A. Stepin [16]. 538 (vi) An ergodic automorphism with quasi- discrete spectrum has a mixed spectrum, that is, the spectrum of the induced unitary opera- tor T has a continuous component and eigen- values in addition to 1. Anzai (1951) con- structed a special class of skew product auto- morphisms having mixed spectra and showed that in this class there are automorphisms that are spectrally isomorphic but not spatiaIly isomorphic. However, the question of whether two spatially nonisomorphic automorphisms exist among automorphisms having the same purely continuous spectrum remained un- answered for a long time, until in 1958 Kol- mogorov (Dokl. Akad. Nauk SSSR, (5) 119 (1958)) settled it affirmatively by using a new isomorphism invariant called entropy. 2. Generators and Entropy. (i) By a partition  = {A J of the space X we mean a collection of sets AA such that AA n Ax = 0 whenever ..1. #;: and U AA = X. We denote by E the partition of X into individual points, and by v the trivial partition {X}. A partition into a finite (count- able) number of sets is called a finite (count- able) partition. A partition  is said to be finer than another partition (or ( is coarser than ) if for every A E  there is a set BE ( such that A c B. For a collection {.} of partitions of X, we denote by V,, the coarsest partition that is finer than each "' and by /\,, the finest partition that is coarser than each ,. If k = {Ak,n} is a sequence of countable (or finite) partitions, then Vkk is precisely the parti- tion of X into nonempty intersections of the form nk Ak,nk with Ak,nk E k for each k. With a partition  of X we associate a O'-subalgebra f1JJ() of f1JJ which is the O'-algebra of all f1JJ- measurable sets that are a union of elements in . Two partitions  and ( are said to coincide a.e. if f1JJ() = f1JJ( () a.e. (i.e., for every A E f1JJ() there exists a set B in f1JJ( () such that m( AU B - A n B) = 0 and conversely). (ii) Suppose that <p is an endomorphism of (X, f1JJ, m) and  a partition of X. By <p -1  we mean the partition {<p -I (A) I AE }. If <p is an automorphism, we also define <p = {<p(A) I A E }. A partition  is called a generator for an endomorphism <p if Vna; 0 <p -n  = E a.s. If vna; -00 <pn  = E a.s.,  is caIled a two-sided gen- erator for an automorphism <p, An endomorphism <p is said to be periodic at a point x E X if there exists a positive integer n such that <pn(x) = x and aperiodic if the set of points of periodicity has measure zero. If the measure space (X, f1JJ, m) is nonatomic, then every ergodic endomorphism on it is aperi- odic. A theorem of Rokhlin states that every aperiodic automorphism <p has a countable two-sided generator. This implies that every such <p is spatially isomorphic to the shift 
539 transformation on the infinite product space (Y*, d*) considered with some invariant mea- sure J1*, where each coordinate space Yj= Y has at most a countable number of points. Krieger improved this result by showing that if an ergodic automorphism <P has finite entropy, then <P has a finite two-sided generator [17]. (iii) For a finite or countable partition  = {An}, define the entropy H () of the partition to be - Lnm(An)log(m(An)) (the logarithms here and below are natural logarithms). We denote by ::z the set of all partitions  with H()< 00. If E::Z, then for any endomorphism <P the limit h(<P,)=lim ( H ( V <p-k )) n_Cf) n k=O exists and is finite. The entropy h(<p) ofthe endomorphism <P is defined to be sup{ h(<p,) I  E::Z} and is an isomorphism invariant. Prop- erties of entropy have been investigated exten- sively since the notion was introduced by Kolmogorov. We cite a few results. (a) If a partition E::Z is a generator for an endomorphism <p or a two-sided generator for an automorphism <p, then h(<p) = h(<p,) (Sinai's lemma). (b) For every integer n, h(<pn) = Inlh(<p), and for a measurable flow {<p,}, h(<p,) = Itlh(<pI) for every real number t. (c) If an automor- phism <p is periodic, then h( <p) = O. (d) If <PI is a factor transformation of <Pz, then h(<pI).s; h( <pz). (e) h(<pI x <p2)=h(<Pl)+h(<P2)' (There is a more complicated formula (due to Rokhlin) for the entropy of a skew product automorphism.) (f) If <p is a recurrent automorphism and <PA is the automorphism induced by <P on a sub- set A with m(A) >0, then h(<PA) = h(<p)/m(A) (Abramov's formula). (g) If <P is a Bernou\1i shift with probability distribution {Pn}, then h(<p) = - LnPnlogPn' (h) If <p is a Markov shift based on the Markov transition probability Pij (defined on a countable or finite state space) and an invariant measure n i , then h(<p)== - LLniPijlogPij' , J (i) If <p is ergodic and has a pure point spec- trum, or more generally, has a quasidiscrete spectrum, then h(<p)=O. (j) For an ergodic group automorphism <p on an n-dimensional torus, h(<p)=LlogIAI, where the sum is taken over all eigenvalues ..1. of modulus > 1 of the representing matrix. (k) If an automorphism <p has positive entropy, then in L 2 (X) there exists a subspace invariant under the induced uni- tary operator T such that the spectrum of T restricted to this subspace is countable Le- besgue (Rokhlin's theorem). It fo\1ows from (k) that automorphisms with tsingular spectra or spectra of finite multiplicity must have zero entropy. In proving assertion (g), Kolmogorov 136E Ergodic Theory established for the first time the fact that there are uncountably many spatia\1y nonisomor- phic Bernoulli shifts. (iv) An automorphism <p is said to have completely positive entropy if h(<p,) > 0 for every partition  #v. It was shown by Rokhlin and Sinai that an automorphism <p has com- pletely positive entropy if and only if <p is a K- automorphism. M. Pinsker proved that for every automorphism <p there exists a partition, ca\1ed the Pinsker partition, that is invariant under <p and such that the factor transforma- tion of <p with respect to this partition has zero entropy and is the largest among the factor transformations of <p with zero entropy. (v) Rokhlin showed that h(<p)=O for an endomorphism <p if and only if all of its factor transformations are automorphisms. An endo- morphism <p is ca\1ed exact if /\o ({J -nl; = V a.e. Rokhlin introduced a way to associate with each endomorphism a certain automorphism, called the natural extension, which reflects the properties of the endomorphism. For example, an endomorphism and its natural extension are simultaneously ergodic or nonergodic, are mixing of the same order, and have equal entropy. The natural extension of an exact endomorphism is a K -automorphism. (vi) Automorphisms <PI and <pz are said to be weakly isomorphic if each of them is a factor transformation of the other. Sinai proved (1964) that for each ergodic automorphism with positive entropy, there exists a factor automorphism having the same entropy and isomorphic to a Bernoulli shift, and hence it follows in particular that Bernoulli shifts with the same entropy are weakly isomorphic. Ornstein (Adv. in Math., 4 (1970)) went further and succeeded in proving the fo\1owing re- markable result: Two Bernoulli shifts with equal entropy are spatially isomorphic. Par- tial results in this direction were obtained earlier by L. Meshalkin, J. Blum, and D. Han- son. In the proof of Ornstein's theorem, essen- tial use was made of the following theorem of C. Shannon and B. McMillan, which plays a fundamental role in information theory ( 213 Information Theory): Suppose that <p is an ergodic endomorphism on (X,,m) and  is a partition of X. For a point XEX, let An(x) denote the element in the partition V/:::J <p -k  that contains x, Then for almost all x, Iim ( - 1 0gm(An(X)) ) "--+ei) n exists and eq uals h( <p, ). (vii) Techniques and ideas developed by Ornstein in his proof of the isomorphism theorem have been refined and extended fur- ther by himself, B. Weiss, N. Friedman, M. Smorodinsky, and others, and numerous re- 
136 E Ergodic Theory suIts were subsequently obtained on the iso- morphism problem. We describe below some of the main results and basic concepts; for more detailed accounts  [21-24]. A sequence {n} of partitions is said to be independent if for every choice of sets AI, '" , As with AkEnk' m(nt1 A k )= IIt1 m(Ak) when- ever n l' n 2 , ... , ns are all distinct. For a fixed a> 0, two partitions  and ( are said to be a- independent if L Im(A nB)-m(A)m(B)1 <a. AE,BE( A pair (cp, ), where <p is an automorphism of(X,,m) and  is a finite (or countable) partition of X, is called a process (on X). (Vn -00 <pn) is a IT-subalgebra of  invariant under <p, and <p restricted to this IT-subalgebra is a factor automorphism of <p and is isomor- phic to a shift transformation on an infinite product space, where each coordinate space has the same number of points as the number of atoms in . A process (<p,) is caIled a Ber- noulli process (or an independent process) if the sequence of partitions {<pnlnEZ} is indepen- dent, and a weak Bernoulli (W.B.) process if for every a> 0 there exists a k > 0 such that the partitions Vi:'+kn<pi( and V?_n<pi are a- independent for all n  O. Let J be a finite set and ex, [3 be two functions from the set {l,2, ...,N} to J. By d N (ex,[3) we denote (ljN) # {n I ex(n) # [3(n)}, where # denotes the car- dinality. d N defines a metric on the space J{I,2, . ,N} called the Hamming distance. Now, for a process (<p,) On X with = {AjIjEl}, we define for x, YE X, d(x, y) to be equal to dN((X), (y)), where for xeX, (x) is the point (j(x),j(<p(x)), ...,j(<pN-I(X)))EJP,2, ..,N} andj(x) is the indexjEJ for which xEA j . A process (<p, ) is called very weak Bernoulli (V.W.B.) iffor any a >0 there exists an No such that whenever N> No there is a set G with m(G» I-a belonging to  (VnO-oo <pn) satisfying the following condition: for any AE (V=-OO <pn) with A c G and m(A» 0, there exists a probability measure v on X x X satisfying (i) v(E x X)=m(E) and v(X x F)= mA(F) for all E, FE and (ii) Sxxxd(x,y)dv < a. It is not difficult to show that W.B. pro- cesses are V.W.B. For processes (<p,) On (X,, m) and (ip,) On (X,:ii, m), where both  and  are indexed by the same finite set J, we define aN«<p, ), (ip, » to be inf{Jx x XdN((X), (x))dp(x, x)}, where inf is taken over all probability measures p on X x X satisfying p(E x X)=m(E) for all EE and p(X x E)= m(E) for all EE:ii. aN decreases with N, and we define a«ip, ), (ip, ) to be \imNoo a,v«<p, ), (ip, m. FinalIy, a process (rp, 0 is said to be finitely determined (F.D.) if for any a >0 there exists a b> 0 and N such that if (ip, ) is any 540 process satisfying (i)  and  are indexed by the same set J, (ii) h( <p, ) - h(ip,) < b, and (iii) aN«<p,), (<p,»<b, then a«<p,), (ip,))<a. Let us caIl a partition  an F.D. generator for an automorphism <p if  is a two-sided gen- erator for <p and the process (<p,) is finitely determined. A general isomorphism theorem proved by Ornstein states: If automorphisms <p and ip have the same entropy and both have F.D. generators, then <p and ip are isomorphic. Ornstein and Weiss proved further that the process (<p,) is F.D. if and only if it is V.W.B. From these basic results a number of results can be deduced. (a) Nontrivial factors of Ber- noulli shifts are Bernoulli. (b) Strongly mixing Markov shifts have two-sided weak Bernoulli generators and hence are isomorphic to Ber- noulli shifts. (c) Every Bernoulli shift can be embedded in a flow, which implies among other things that Bernoulli shifts have roots of all orders. (d) It was shown by Y. Katznelson that every ergodic group automorphism of an n-dimensional torus has a two-sided generator which can be shown to be V.W.B" and hence is also isomorphic to a Bernoulli shift. R. Adler and B. Weiss earlier proved by entirely differ- ent methods that on a 2-dimensional torus, two ergodic group automorphisms having the same entropy are spatially isomorphic. They have done this by constructing a two-sided generator  for such automorphisms which is also a Markov partition ( Section G). (viii) Among further positive results there are the foIl owing: (a) Except for a trivial change in the time scale, any two Bernoulli flows are isomorphic (Ornstein), (b) Every ergodic group automorphism of a compact group is isomorphic to a Bernoulli shift (Thomas and Miles; Lind). (c) A number of automorphisms and flows arising from classi- cal dynamical systems are shown to be Ber- noulli ( Section G). (d) Examples of exact endomorphisms arise in connection with prob- lems in number theory; for example, tcon- tinued fraction expansion and [3-expansion. The natural extensions of the exact endomor- phisms associated with continued fraction expansion and [3-expansion are now known to be spatially isomorphic to Bernoulli shifts, (ix) Since many of the automorphisms that have been shown to be K -automorphisms are nOw known to be isomorphic to Bernoulli shifts, it is natural to expect that every K- automorphism is in fact Bernoulli. However, Ornstein (1973) constructed an example of a K-automorphism that is not a Bernoulli shift. A lot of work has since been done to investi- gate how bad K -automorphisms can be. It turns out that K-automorphisms share almost none of the finer properties of Bernoulli shifts. For example: (a) There are uncountably many 
541 nonisomorphic K-automorphisms of the same entropy (Ornstein and P. Shields). (b) There is a K-flow that is not Bernoulli, and there are uncountably many nonisomorphic K-flows having the same entropy at time one (Smoro- dinsky). (c) There exists a K-automorphism not isomorphic to its inverse (Ornstein and Shields). (d) There exists an automorphism that cannot be written as a direct product of a K-automorphism and an automorphism with zero entropy, This example shows that a con- jecture made earlier by Pinsker was false (Orn- stein). (e) There are weakly isomorphic K- automorphisms that are not isomorphic (Pol it and Rudolph). F. Weak Equivalence and Monotone Equivalence (1) Weak Equivalence. In order to construct examples of tfactors in the theory of von Neu- mann algebras ( 308 Operator Algebras), F. Murray and von Neumann considered various ergodic groups of bimeasurable non- singular transformations on a finite measure space. In this context a group '!f = {g} of trans- formations is called ergodic if m«g -1 (B)U B)- (g-I(B) n B) = 0 for every gE'!f implies either m(B)=O or m(X -B)=O. A measure J1 is said to be invariant under the group '!f = {g} if J1 is invariant under every transformation 9 in '!f. Murray and von Neumann's construction (the so-called group measure space construction) gives a type II I tfactor if the group admits a finite equivalent invariant measure, a type 1100 factor if it admits an infinite (but IT-finite) equiv- alent invariant measure, and a type III factor if it has no IT-finite equivalent invariant measure. In this connection we note that Hajian and Y. Ita extended the theorem of Hajian and Kaku- tani and proved that an arbitrary group '!f = {g} of nonsingular bimeasurable transforma- tions admits a finite invariant measure if and only if no set of positive measure is weakly wandering under the group '!f (a set W is said to be weakly wandering under a group '!f if there exists an infinite subset {gnlnEZ+} of'!f such that gn(W)n gk(W)= 0 for n #k). Analo- gously to the terminologies used in the theory of von Neumann algebras, we define ergodic countable group '!f (or an ergodic bimeasurable nonsingular transformation <p) to be of type 11 1 ,11 00 , or III if'!f (or <p) has a finite equivalent invariant measure, a IT-finite infinite equivalent invariant measure, or no IT-finite equivalent invariant measure, respectively. For a bimeasurable nonsingular transfor- mation <p, define the full group [<pJ to be the group of all bimeasurable nonsingular trans- formations Ij; such that, for some n = n(x), Ij;(x) 136F Ergodic Theory = <pn(x) for almost all x. Two transformations <PI and <P2 are said to be weakly equivalent if there exists a bimeasurable nonsingular trans- formation (J such that (J[<PI](J-I = [<P2]. H. Dye proved that any pair of type II I transforma- tions are weakly equivalent. It easily follows from this that the same is true for type 1100 transformations. Krieger showed, on the other hand, that among type III transformations there are uncountably many weakly non- equivalent ones. In fact, Krieger [28J intro- duced an invariant for weak equivalence, called the ratio set, by means of which he classified type III transformations into mutu- ally weakly nonequivalent subclasses ilIA for O.s;A.s; 1. Krieger showed further that (a) for O<)..s; 1, every pair of transformations in the class IlIA is weakly equivalent; (b) in the class 1110 there are uncountably many mutually weakly nonequivalent transformations; and (c) two ergodic transformations are weakly equivalent if and only if the corresponding factors constructed via the group measure space construction are h-isomorphic. An ergodic flow of nonsingular transformations (not necessarily measure-preserving) called the associated flow was constructed independently by Krieger [29] and by T. Hamachi, Y. Oka, and M. Osikawa [30], and was shown to give another effective invariant for weak equiva- lence. Krieger showed that the mapping which assigns to each ergodic transformation its associated flow gives a tbijective mapping between the set of all weak equivalence classes of ergodic type 1110 transformations and the set of all isomorphism classes of aperiodic conservative ergodic flows. In this connection the theorem of U. Krengel and I. Kubo on the representation of ergodic flows, extending the theorem of Ambrose and Kakutani mentioned in Section D, plays a significant role. A bimeasurable nonsingular transformation (J is called a normalizer of another transforma- tion <P if it satisfies O[<pJ=[<pJ(J. The set of all normalizers JV(<p) of a transformation <p forms a group called the normalizer group which contains the fuIl group [<pJ as a subgroup. One can introduce a suitable topology to JV(<p) to make it a complete separable metrizable group. Hamachi [31J has shown that, for type III transformations <p, the quotient group uY( <p )/[ <p ] - , where [<p J - denotes the closure of [<pJ, is algebraically and topologically isomor- phic to the tcommutant of the associated flow. Results obtained by Krieger and others mentioned above were motivated in part by corresponding developments in von Neumann algebra theory due mainly to A. Connes ( 308 Operator Algebras). From the recent deep result of Connes on the uniqueness of tapprox- imately finite-dimensional factors of type 1100 
136G Ergodic Theory it follows that every approximately finite- dimensional factor with the exception of type 111 1 is *-isomorphic to a factor constructed from an ergodic transformation via the group measure space construction. (2) Monotone Equivalence. The theory of weak equivalence discussed above deals with the structure of orbits of a transformation or of groups of transformations, In fact, transfor- mations <p and Ij; are weakly equivalent if and only if there exists a bimeasura ble nonsingular transformation 0 mapping the <p-orbit of almost all x onto the Ij;-orbit of O(x). A some- what more stringent notion of equivalence dealing with orbit structure is called mono- tone equivalence or Kakutani equivalence. We say that measurable flows {<p,} and {ip,} of measure-preserving transformations on finite measure spaces (X,:JJ, m) and (X,:Jl, m), respec- tively, are monotonely equivalent if there exists a bimeasurable measure-preserving transformation (J on X to X such that for almost all XEX and all tER, O<p,(x)==ip«t,X)O(x), where r(t, x) is a monotone increasing function of t. Two S-flows ( Section D) built over the same base transformation with different ceiling functions are monotonely equivalent, and it can be shown that monotonely equivalent ergodic flows are isomorphic to S-flows built over the same base transformation. This in- duces an equivalence relation on transforma- tions, also called monotone equivalence or Kakutani equivalence: Two transformations <p and ip are monotonely equivalent if they can serve as base transformations of the same (or equivalent) flow. This notion of equivalence was introduced by Kakutani in [32J, where he showed that <p and ip are monotonely equiva- lent if and only if there are sets E c X and E c X such that the induced transforma- tions ( Section D) <PE and ipE are isomorphic. Abramov's formula mentioned in Section E implies that there are at least three classes of transformations (and flows) that are mutually nonequivalent: transformations (flows) of zero, finite, and infinite entropy. Nothing much was done in this equivalence theory for a num- ber of years until in 1975 1. Feldman and E. Satayev independently showed that there are monotonely nonequivalent transformations within each class of zero, positive, and infinite entropy. Since then extensive work has been done by Feldman, Satayev, A. Katok, Orn- stein, Weiss, D. Rudolph, M. Ratner, and others, and numerous results have been ob- tained, For detailed accounts of this devel- opment  [22,24,33,34]. The main idea employed by Feldman and Satayev was to introduce a new metric called an J-metric and then to define a notion corresponding to 542 V.W.B. of the isomorphism theory described in Section E by using an f- instead of a II- metric. They then showed that this property, called loosely Bernoulli (L.B.) by Feldman and monotonely very weak Bernoulli (M.V.W.B.) by Satayev, stays invariant under monotone equivalence, and that there exist transforma- tions with and without this property. An J- metric is defined by starting off with an fN- metric on the space J{ 1,2, ,N} instead of the Hamming distance d N and proceeding in exactly same way as for the definition of II, namely, by extending fN to f and IN and finaIly to J For ex and fJ in J{I,2" ,N}, fN(ex,fJ) is defined by setting 1 - fN(ex, fJ) equal to l/N times the maximal integer n for which there are positive integersjl <j2<'" <jn' k l <k 2 < '.' < k n with ex(j;) = fJ(k;), i = 1, ,.., n. Ornstein and Weiss discovered that by substituting I for done can develop a theory of monotone equivalence that parallels the isomorphism theory described in Section E. One can define a process (<p,) to be finitely fixed (F. F.) by substituting Ifor din the definition of F.D. process, and, as was mentioned earlier, to be L.B. by doing the same in the definition of the V.W.B. process. Ornstein and Weiss proved: (a) If <p and ip have zero entropy, finite posi- tive, or infinite entropy and if both have F.F. generators, then they are monotonely equiv- alent. (b) Let <p have an F.F. generator. If h(<p) = 0, then <p is monotonely equivalent to an irrational rotation of the circle. If 0 < h(<p) < 00, then <p is monotonely equivalent to a Bernoulli shift of finite entropy. If h(<p) = 00, then <p is monotonely equivalent to a BernouIli shift of infinite entropy. (c) If <p has an F.F. generator, then (<p,) is F.F. for all nontrivial partitions . (d) (<p,) is F.F. if and only if it is L.B. It is known further that within each entropy class there exist uncountably many monotonely nonequivalent transforma- tions. A measurable flow {<p,} is called L.B. if it can be represented as an S-flow built over an L.B. transformation. The L.B. flows of zero entropy are those monotonely equivalent to a tKronecker flow on a 2-dimensional torus, and L.B. flows of finite positive (infinite) entropy are those monotonely equivalent to the Bernoulli flow of finite (infinite) entropy. The direct product of an L.B. flow and a Ber- nouIli flow is L.B., while it was shown by M. Ratner that the horocycle flow ( Section G) is L.B. but its direct product with itself is not. G. Classical Dynamical Systems By a classical dynamical system we mean a tdiffeomorphism or a flow generated by a smooth tvector field on some tdifferentiable 
543 manifold M n . Such a system is nonsingular with respect to a measure defined by any tRiemannian metric on M n . For a fixed Rie- mannian metric, we cal1 measures smooth if they have a smooth density with respect to the measure given by the metric. (1) Among classical dynamical systems, geo- desic flows have been investigated most exten- sively. Let l (M) be the unitary ttangent bun- dle over the manifold Mn. A point (x, e) E I (M) defines a unique tgeodesic through x in the direction of e. The geodesic flow on I (M) is the flow defined by <p,(x, e) = (x" e,), where x, is the point in M n reached from x after time t under a motion with unit speed along the geodesic determined by (x, e), and e, is the unit vector at x, tangent to the geodesic, The classical tLiouville theorem in this con- text implies that the measure on I (M) that is the product of the measure on M n induced by the metric and the Lebesgue measure on the (n-1)-dimensional sphere gives a smooth in- variant measure for the geodesic flow. A wide class of systems arising from mechanics can be described as geodesic flows. Hopf and G. Hedlund proved that if the manifold M n is compact and has constant negative curvature, then the geodesic flow is strongly mixing. Later, by using the theory of group trepresentations, I. Gel'fand and S. F omin proved that the spectrum of a geodesic flow on a compact manifold of constant nega- tive curvature is Lebesgue, and is even count- able Lebesgue in the case where the mani- fold is of dimension 2. F. Mautner and later L. Auslander, L. Green, and F. Hahn extended this algebraic method to flows obtained under the action of some one-parameter subgroup of a tLie group acting on its thomogeneous space and obtained extensive results for the case of tnilpotent and some tsolvable Lie groups [35]. (2) The flow on an n-dimensional torus de- fined by <p,(x l , x 2 ,..., x n ) =(x I +w I t, X 2 +w 2 t,..., X n +wnt) is called a translational flow or a Kronecker flow. The numbers WI' W2, ..., W n are called frequencies. Every orbit of {<p,} is dense in the torus if and only if the frequencies are linearly independent over Z. The motion under a translational flow with independent frequen- cies is called a quasiperiodic motion. A trans- lation flow for a quasiperiodic motion has discrete spectrum. (3) Sinai obtained a useful criterion for a classical dynamical system to be a K-system. Let M n be compact, and suppose that {<p,} is a flow on M n defined by a smooth vector field and preserving some smooth measure J1. A 136G Ergodic Theory one-parameter group {!/t,} of transformations of the space M n , given by a vector field, is called the flow transversal to the flow {<p,} if (i) the decomposition of the space Mn into the trajectories of the flow {!/t,} is invariant under {<p,}; (ii) the limit limso limho01(t, x) - t)/ts = ex(x) exists for the function tv,(t, x), which is defined to be the time length of the segment {<Ps!/tu(x) I O.s; u.s; t} of the trajectory of the flow {!/t,}. Sinai's fundamental theorem states that if a flow {<p,} is ergodic and has a transversal ergodic flow {!/t,} for which J ex(x)dJ1 < 0, then {<p,} is a K-flow. Ifex(x) <0, then we can even drop the assumption that {<p,} is ergodic. If J ex(x)dJ1>O, the theorem holds for the flow { <p-,}. A geodesic flow on a 2-dimensional mani- fold of constant negative curvature always has a transversal flow, called a horocycle flow. The ergodicity of a horocycle flow was proved by Hedlund. It follows from Sinai's fundamental theorem, therefore, that a geodesic flow on a surface of constant negative curvature is a K-flow. Sinai proved even more: A geodesic flow on any surface of negative curvature is a K-flow. There is an extension of the notion of transversal flow to higher dimensions, called transversal field. Using this notion Sinai proved that a geodesic flow on a manifold (of any dimension) of constant negative cur- vature is a K -flow. Finally, Ornstein and Weiss established that geodesic flows on com- pact manifolds of negative curvature are Bernoulli. (4) D. Anosov considered a class of flows and diffeomorphisms satisfying a condition that characterizes unstable motions such as geodesic flows on a manifold of negative cur- vature. They are now called Anosov flows (or Y-flows) and Anosov diffeomorphisms (or Y- diffeomorphisms) ( 126 Dynamica1 Systems), Anosov proved that if an Anosov flow has a smooth invariant measure, then it is ergodic and either it has a continuous nonconstant eigenfunction or it is a K-flow. Anosov diffeo- morphisms with smooth invariant measures are K-automorphisms. Sinai constructed for ttransitive Anosov diffeomorphisms (and for Anosov flows) a special partition of the under- lying manifold M called a Markov partition having desirable properties. The importance of such partitions lies in the fact that they enable one to represent such diffeomorphisms as Markov shifts, Starting with a measure invar- iant for such a Markov shift having the maxi- mal entropy ( Section H), Sinai constructed a Gibbs measure ( Section C), which turned out to be unique in this case and gave rise to a natural invariant measure J1 for the corre- sponding diffeomorphism. He proved further that the diffeomorphism <p considered as an 
136H Ergodic Theory automorphism of the measure space (M, &a, J1) is a K-automorphism. It was shown subse- quently by R. Azencott that <p is in fact Ber- noulli with respect to J1. Sinai investigated further the uniqueness of the invariant mea- sure attaining the maximal entropy for transi- tive Anosov diffeomorphisms, and he was able to show among other things that the set of transitive Coo -Anosov diffeomorphisms that do not have an invariant measure absolutely continuous with respect to the measure in- duced by the Riemannian metric on M con- tains an open dense subset. The methods em- ployed by Sinai in these investigations were extended further by D, Ruelle, Bowen, and others. Bowen, in particular, was able to con- struct Markov partitions for a wider class of diffeomorphisms, namely, those satisfying the so-called axiom A introduced earlier by S. Smale, and characterized the Gibbs measures for diffeomorphisms in this class by means of the variational principle ( Section H). For a more detailed account of the results of Sinai and Bowen  [13, 14J. (5) An important example of a system that is neither an Anosov nor an "axiom A" system because of nonsmoothness has been studied by Sinai: the simplest mechanical model due to Boltzmann and Gibbs of an ideal gas, which is described as a system generated by tiny rigid spherical pellets moving inside a rectangular box and colliding elastically. Sinai succeeded in proving that this is a K-system, thereby giving an affirmative answer to the classical question of the ergodicity of the basic model of statistical mechanics [38]. It was shown subsequently by M. Aizenman, S. Goldstein, and 1. Lebowitz that this system is Bernoulli. Sinai also investigated another nonsmooth system of classical importance: the system describing the motion of a billiard ball on a square table with a finite number of convex obstacles. He showed that such a system, is a K-system. Ornstein and G. Gallavotti then showed that Sinai's methods in fact show that the system is Bernoulli. Sinai's methods were extended further by L. Bunimovich and I. Kubo to study properties of billiard systems in more complicated domains. For more detailed account of these matters  [24]. The results obtained by Sinai and Bowen for Anosov and "axiom A" systems were extended to even wider class of systems by A. Stepin, R. Sacksteder, M. Brin, and Va. Pesin ([24J). H. Miscellany (1) An arbitrary aperiodic automorphism can be approximated by periodic automorphisms. More precisely, if <p is an aperiodic automor- 544 phism, then for any positive integer nand (j > 0, there exists a periodic automorphism Ij; of period n such that m( {x I <p(x) # Ij;(x)} ) < (I In) + (j (theorem of Halmos and Rokhlin). The question as to how quickly this approxi- mation can be carried out has been investi- gated in detail by Katok, Stepin, and others. The rate of this approximation was shown to have a close relationship with the entropy and spectral properties of the automorphism <po By utilizing this relationship, various examples of automorphisms with specified spectral prop- erties have been constructed [16]. (2) When an arbitrary automorphism <p is given on a Lebesgue measure space (X, &a, m), there exists a unique decomposition  = {AA} of X such that (i) each AA is invariant under <p and (ii) except for a negligible set (in a specified sense) of AA' each A;. is turned (in a natural manner) into a Lebesgue measure space, and the restriction of <p to AA is an ergodic automorphism. This decomposition is called the ergodic decomposition of X with respect to <po There is a corresponding decom- position with respect to a flow. A formula also exists that enables us to compute the entropy h(<p) in terms of the entropies of ergodic com- ponents of <po (3) Let X be a compact metric space and <p: X --> X a thomeomorphism. N. Krylov and N. Bogolyubov showed that there always exists on X a Borel probability measure J1 that is invariant under <po Let 9 be the collection of all Borel probability measures on X, and let 9" be the subset of 9 consisting of those invar- iant under <po Then q> and 9" are both convex sets compact with respect to the tweak * topol- ogy. If 8" is the set of all extreme points in 9", then by the tKrein-Milman theorem 8" is not empty. A measure J1 in 9" belongs to 8" if and only if <p is ergodic with respect to J1. When the set 8" consists of a single element, <p is called uniquely ergodic; <p is called minimal if for every point XEX, the orbit of x under <p= orb,,(x)= {<pn(x) I nE Z} is dense in X. <p is called strictly ergodic if it is both minimal and uniquely ergodic. A theorem of J. Oxtoby states that <p is strictly ergodic if and only if for every continuous real-valued function f on X, the sequence of averages (L:::f(<pk(x)))/n converges uniformly to a constant M (/). There are homeomorphisms that are minimal or uniquely ergodic but not strictly ergodic. (4) R. Jewett proved that any weakly mix- ing measure-preserving transformation on a Lebesgue space is spatially isomorphic to a strictly ergodic transformation. Krieger ex- tended the result by showing that if the en- tropy of an ergodic transformation <p is finite, then <p is spatially isomorphic to a strictly ergodic transformation. Similar results were 
545 obtained for flows by K. Jacobs, M. Denker, and E. Eberlein ( [39]). (5) A topological analog of the notion of entropy, caIled topological entropy, was in- troduced by Adler, A. Konheim, and 1. McAndrew. This is defined as foIlows: For every open covering ,r;f of a compact topolog- ical space X, let N(d) be the number of sets in the minimal subcovering of ,r;f. For open coverings ,r;f and &a, let d v &a be the open covering {A n BI A Ed, BE &a}. For any open covering d and a continuous mapping <p on X, the limit limnoo(logN(.r;f v <p -I d v ... V <p-(n-l),r;f))/n = h,op(<p, d) exists. Topo- logical entropy h,op(<p) of the continuous transformation <p is now defined by h,op(<p) = sup{ h top ( <p, d) I ,r;f an open covering of X}. L. Goodwyn showed that h,op(<p)?;h(<p) for any <p-invariant probability measure J1, where h(<p) is the measure-theoretic entropy of <p regarded as a J1-preserving transforma- tion. T. Goodman went further and succeeded in proving that h'op(<p)=sup{h(<p)IJ1 a <p- mvariant probability measure}. In connec- tion with these results there is interest in the question of the existence and uniq ueness of an invariant measure for <p with maximal entropy, i.e., a <p-invariant measure J1 for which h(<p) = h,op(<p). Such a measure does not always exist, and even if it does it may not be unique. The notions of topological en- tropy and of measures with maximal entropy are generalized by RueIle in the foIl owing way. For an open covering d, a continu- ous mapping <p, and a real-valued con- tinuous function g on X, let Zn(d, <p, g) be equal to inf{LoErsuPxEoexPLZ;;g(<pk(x))}, where the inf is taken over all subcover- ings r of the covering d v<p -I d v '" v <p -(n-l ),r;f. Then a finite limit P(d, <p, g) = limnoo l/nlogZn(,r;f, <p,g) exists, and the quantity P(<p,g)=sup{P(d,<p,g)ld an open covering of X} is calIed topological pressure. When 9 = 0, P(<p, g) reduces to h,op(<p). Ruelle proved for texpansive mappings <p that P(<p,g) =sup{h(<p)+ J gdJ1lJ1 is a <p-invariant proba- bility measure}. This assertion is calIed the variational principle for the topological pres- sure. The variation principle was proved for general continuous mappings <p by P. Walters. If a <p-invariant measure J1 satisfies P(<p, g) = h (<p) + J 9 dJ1, then J1 is calIed an equilibrium state for 9 with respect to <po It is known that for expansive mappings <p every continuous function 9 on X has an equilibrium state. (6) Application of ergodic theory to problems in analytic number theory has been made by several authors. Ergodic or mixing properties of particular measure-preserving transforma- tions that arise in connection with various problems in number theory have been ex- 136 Ref. Ergodic Theory ploited to give answers to these problems. New and more striking applications of ideas of ergodic theory to different types of questions in number theory have been started by Yu. Linnik, FUrstenberg, W. Veech, T. Kamae, and others ( [40, 41J). 7. Most of the results discussed in this arti- cle dealt with the action of a cyclic group of transformations or of one-parameter flow. There are significant extensions of many of these results to different types of group ac- tions. For recent developments  [22]. References [I J E. Hopf, Ergodentheorie, Springer, 1937. [2] P. R. Halmos, Lectures on ergodic theory, Publ. Math. Soc. Japan, 1956. [3] P. Billingsley, Ergodic theory and infor- mation, Wiley, 1965. [4] K. Jacobs, Neuere Methoden und Ergeb- nisse der Ergodentheorie, Springer, 1960. [5] N. Dunford and 1. T. Schwartz, Linear operators I, I nterscience, 1958. [6] K. Y osida, Functional analysis, Springer and Kinokuniya, second edition, 1968. [7] U. Krengel, Recent progress on ergodic theorems, Soc. Math. France, Asterisque, 50 (1977),151-192. [8] 1. F. C. Kingman, Subadditive ergodic theory, Ann. Probability, I (1973), 883-909. [9] D. RuelIe, Ergodic theory of differentiable dynamical systems, Publ. Math. Inst. HES, 50 (1979), 27 - 58. [10] N. A. Friedman, Introduction to ergodic theory, Van Nostrand, 1969, [11] H. FUrstenberg, Ergodic behavior of diagonal measures and a theorem of Szeme- n':di on arithmetic progressions, 1. Analyse Math., 31 (1977), 204-256. [12J S. R. Foguel, Ergodic theory of Markov processes, Van Nostrand, 1969. [13] Va. G. Sinai, Gibbs measures in ergodic theory, Russian Math. Surveys, 27 (1972),21- 69. (Original in Russian, 1972.) [14J R. Bowen, Equilibrium states and the ergodic theory of Anosov diffeomorphisms, Lecture notes in math. 470, Springer, 1975. [15J G. Maruyama, The harmonic analysis of stationary stochastic processes, Mem. Fac. Sci. Kyushu Univ., (A) 4 (1949),45-106. [16] A. B. Katok and A. M. Stepin, Approxi- mations in ergodic theory, Russian Math. Sur- veys, 22 (5) (1967), 77-102. (Original in Rus- sian, 1967.) [17J W. Krieger, On entropy and generators of measure-preserving transformations, Trans, Amer. Math. Soc., 149 (1970), 453-464. [18] V. A. Rokhlin (Rohlin), New progress in 
137 Erlangen Program the theory of transformations with invariant measure, Russian Math. Surveys, 15 (4) (1960), 1-22. (Original in Russian, 1960.) [19J V. A. Rokhlin, Lectures on the entropy theory of measure-preserving transformations, Russian Math. Surveys, 22 (5) (1967), 1-52. (Original in Russian, 1967.) [20J W. Parry, Entropy and generators in ergodic theory, Benjamin, 1969. [21] D. S. Ornstein, Ergodic theory, random- ness, and dynamical systems, Yale Univ. Press, 1974. [22J D. S. Ornstein, A survey of some recent results on ergodic theory, Math. Assoc. Amer. Studies in Math., 18 (1978), 229-262, [23J P. Shields, The theory of Bernoulli shifts, Chicago Lectures in Math., Univ. of Chicago Press, 1973. [24] A. B. Katok, Va. G. Sinai, and A. M. Stepin, Theory of dynamical systems and general transformation groups with invariant measure, J. Soviet Math., 7 (J 977),974-1065. (Original in Russian, 1975.) [25J 1. Moser, E. Phillips, and S. Varadhan, Ergodic theory, a seminar, NYU Lecture Notes, 1975. [26J A. M. Vershik and S. A. Yuzvinskii, Dy- namical systems with invariant measure, Pro- gress in Math. VIII, Plenum, 1970, 151-215. (Original in Russian, 1967.) [27] H. A. Dye, On groups of measure- preserving transformations I, II, Amer. 1. Math., 81 (1959), 119-159; 85 (1963),551-576, [28J L. Sucheston (ed.), Contributions to ergodic theory and probability, Lecture notes in math. 160, Springer, 1970. [29J W. Krieger, On ergodic flows and the isomorphism of factors, Math. Ann., 223 (J 976), 19-70. [30J T. Hamachi, Y. Oka, and M. Osikawa, Flows associated with ergodic non-singular transformation groups, Publ. Res. Inst. Math. Sci., 11 (1975), 31-50. [31] T. Hamachi, The norI:lalizer group of an ergodic automorphism of type III and the commutant of an ergodic flow, 1. Functional Anal., 40 (1981),387-403. [32] S. Kakutani, Induced measure preserving transformations, Proc. Imp. Acad. Tokyo, 19 (1943),635-641. [33] A. B. Katok, Monotone equivalence in ergodic theory, Math. USSR-Izv., 11 (1977), 99-146. (Original in Russian, 1977.) [34] E. A. Satayev, An invariant of monotone equivalence determining the quotients of auto- morphisms monotonely equivalent to a Ber- noulli shift, Math. USSR-Izv., 11 (1977), 147- 169. (Original in Russian, 1977.) [35J L. Auslander, L. Green, F. Hahn, et a!., Flows on homogeneous spaces, Ann. Math. Studies, Princeton Univ. Press, 1963. 546 [36] V. I. Arnold and A. Avez, Ergodic prob- lems of classical mechanics, Benjamin, 1968. [37J Va. G. Sinai, Introduction to ergodic theory, Math. Notes, Princeton Univ. Press, 1976. [38J Va. G. Sinai, On the foundations of the ergodic hypothesis for a dynamical system of statistical mechanics, Sov. Math. Dokl., 4 (1963),1818-1822. (Original in Russian, 1963.) [39] M. Denker, C. Grillenberger, K. Sig- mund, Ergodic theory on compact spaces, Lecture notes in math. 527, Springer, 1976. [40J Yu. V. Linnik, Ergodic properties of alge- braic fields, Springer, 1968. [41J W, A, Veech, Topological dynamics, Bull. Amer. Math. Soc., 83 (1977), 775-830. 137 (VI.18) Erlangen Program When F. Klein succeeded K. G, C. von Staudt as professor at the Philosophical Faculty of Erlangen University in 1872, he gave an in- auguration lecture entitled "Comparative Con- sideration of Recent Geometric Researches," which later appeared as an article [I]. In it he developed a penetrating idea, now called the Erlangen program, in which he utilized group-theoretic concepts to unify various kinds of geometries that until that time had been considered separately. The concept of transformation is not new; it was, however, not until the 18th century that the concept of transformation groups was recognized as useful. The theory of tinvariants of linear groups and the tGalois theory of algebraic equations attracted attention in the 19th century. In the same century, tprojective geometry made remarkable progress, for example, when A. Cayley and E. Laguerre discovered that metrical properties of Eu- clidean and tnon-Euclidean geometries can be interpreted in the language of projective geom- etry. Cayley proclaimed, "All geometry is projective geometry." After learning geometry under 1. Plucker, Klein made the acquaintance of S. Lie. Both men understood the impor- tance of the group concept in mathematics. Lie studied the theory of tcontinuous transforma- tion groups, and Klein studied discontinuous transformation groups from a geometric stand- point. Klein was thus led to the idea of the Erlangen program, which provided a bird's- eye view of geometry. Klein's idea can be summarized as follows: A space S is a given set with some geometric structure. Let a transformation group G of S be given. A subset of S, called a figure, may 
547 have various kinds of properties. The study of the properties that are left invariant under all transformations belonging to G is called the geometry of the space S subordinate to the group G. Let this geometry be denoted by (S, G). Two figures of S are said to be congruent in (S, G) if one of them is mapped to the other by a transformation of G. The geometry (S, G) is actually the theory of invariants of Sunder G, with the term invariants to be understood in a wider sense; it means both invariant quan- tities and invariant properties or relations. Replacing G in (S, G) by a subgroup G' of G, we obtain another geometry (S, G'). A series of subgroups of G gives rise to a series of geom- etries. For instance, let A be a figure of S. The elements of G leaving A invariant form a sub- group G(A) of G that operates on S' = S - A. We thus obtain a geometry (S', G(A)) in which A is called an absolute figure. In this way, many geometries are obtained from projective geometry. Klem gave numerous examples. It is noteworthy that he mentioned even the groups of trational and thomeomorphic transformations. Klein's idea not only synthesized the geom- etries known at that time, but also became a guiding principle for the development of new geometries. In 1854 G. F. B. Riemann published his epochmaking idea of Riemannian geometry. This geometry has a metric, but in general lacks congruence transformations (isometries). Thus Riemannian geometry is a geometry that is not included in the framework of the Erlangen program. The importance of Rie- mannian geometry was acknowledged when it was used by A. Einstein in 1916 as a founda- tion of his general theory of relativity. H. Weyl, O. Veblen, and J. A. Schouten dis- covered geometries that are generalizations of affine, projective, and tco,nformal geometries in the same way as Riemannian geometry is a generalization of Euclidean geometry. It became necessary to establish a theory that reconciled the ideas of Klein and Riemann; E. Cartan succeeded in this by introducing the notion of tconnection ( 80 Connections). However, the ErIangen program, which gave an insight into the essential character of clas- sical geometries, still maintains its role as one of the guiding principles of geometry, References [1] F. Klein, Vergleichende Betrachtungen iiber neuere geometrische Forschungen, Math. Ann., 43 (1893), 63-100 (Gesammelte mathe- matische Abhandlungen, Springer, 1921, vol. 1,460-497). 138B Error Analysis [2J G. Fano, Kontinuierliche geometrische Gruppen, Enzykl. Math., Leipzig, 1907-1910, Geometrie III, pt. 1, AB4b, 289-388. [3J F. Klein, V orlesungen iiber nicht- Euklidische Geometrie, Springer, 1928. [4] F. Klein, Vorlesungen iiber hohere Geom- etrie, Springer, 1926. [5] E. Cartan, Les recentes generalisations de la notion d'espace, Bull. Sci. Math., 48 (1924), 294-320. [6] E. Cartan, La theorie des groupes et les recherches recentes de geometrie differentielle, Enseignement Math., 24 (1925),1-18; Proc. Internat. Congr. Math., Toronto, 1 (1928), 85- 94. 138 (XV.3) Error Analysis A. General Remarks The data obtained by observations or mea- surements in astronomy, geodesy, and other sciences do not usually give exact values of the quantities in question. The error is the dif- ference between the approximation and the exact value. The theory of errors originated from systematic work with data accompanied by errors, and the statistical treatment of ex- perimental data was the main concern in the beginning stages ( 397 Statistical Data Anal- ysis). However, due to the recent development of high-speed computers it has become pos- sible to carry out computations on a tremen- dously large scale, and the detailed analysis of errors has become an absolute necessity in modern numerical computation. Hence the analysis of errors in relation to numerical computation has become the center of re- search in error theory. B. Errors One rarely makes a mistake in counting a small number of things; therefore the exact value of the count can be determined. On the other hand, the exact value in decimals is never obtainable for a continuous quantity, say length, no matter how fine measurements are made, and a large or small stochastic error is thus inevitable in measuring a continuous quantity. A discrete finite quantity is a digital quantity, and a continuous quantity is an analog quantity. The natures of these two quantities are quite different. The values of a digital quantity are distributed on some discrete set, while the values of an analog quantity are distributed with a continuous 
138C Error Analysis probability. Thus there is the possibility of error even for treatment of digital quantities, although checking the results for these quan- tities is easy. It is preferable to regard digital quantities as being analog quantities if the possible values are densely distributed. On the other hand, when an appropriate tanalog computer is not available, analog quantities receive treatment similar to digital quantities. They are expressed as x times some unit, and x is expanded in the decimal or binary systems. An approximation to such an expansion is obtained by rounding off a numeral at some place, the position depending on the capacity for computation by available methods. There are two ways of rounding off numbers, the fixed point method and the float- ing point method. The former specifies the place of digits where the rounding off is made, and the latter essentially specifies the number of significant digits. Classification of errors, (1) Errors of input data are the errors included in input data themselves. Such input-data errors include the errors that occur when we represent con- stants such as 1/3, )2, n by finite decimals. (2) Truncation errors occur in approximate expressions for the computation formulas under consideration. (3) Roundoff errors OCCur in taking some finite number of digits from the earlier digits in the numerical value at each step. If the computation of an infinite number of digits were actually possible, no crrors of this type would appear. Recently, it has been considered more preferable to call this "com- putational error." The difference between fixed point and floating point rounding off is that the former is better suited for operations of addition and subtraction and the latter is better for multi- plication and division. In fixed point rounding off, if a number is multiplied many times by numbers less than 1, a so-called underflow may occur, and many digits may disappear; a great deal of information can thus be lost. In com- putation for scientific research that involves frequent multiplication and division, floating point rounding off is preferable. It should be noted that rounding off for addition and sub- traction may also cause a critical loss of in- formation. This phenomenon is calIed cancel- ing digits. For instance, in the subtraction 7.6325071 -7.6318425 =0.0006646, where the subtrahend and minuend share several early significant digits, the difference loses those digits. Thus, relative errors may be magni- fied tremendously. By taking a large number of significant digits, such a situation may be avoided to some extent. So-called high- precision computation shows its effectiveness in such cases. Similarly, when a small number b is 548 added to a large number a, the result may be just a and the information of b could be lost completely. This kind of loss of information can often cause serious trouble. C. Methods of Error Analysis In order to analyze the propagation of errors, let us assume that all numbers are carried to infinitely many digits so that no roundoff error occurs. Suppose that we are to evaluate the function y == f(x" ..., x n ) when XI' X2, ..., X n are assigned. Let 11 be the truncation error of an approximate expression, If an input error r'5 i for Xi exists, then the corresponding error for y is n of -'- ,\,_,Ii '/ I iaXi VI. Moreover, suppose that at the final step we round off to get a result with a finite number of digits, by which an error 8 is introduced. Then the final error r'5 for y is n of r'5==8+11+.L ;::-r'5 i . r=l vX i This procedure is performed for each step needed in the computation. If y = f(x I' .., ,x n ) is a specified step, then the input error r'5 i for that step involves all the errors arising before that step, i.e., b i is an accumulated error. For- ward analysis is a method to estimate the total accumulated error from the initial input data. It is usua\1y quite difficult to obtain precise estimates by means of this method. In con- trast,1. H. Wilkinson proposed the following backward analysis. Here, the computational value y is considered as the exact result for the modified initial data x I, '" , X n , say y = f(x i , ..., x n ), and the estimates for Ix i - xii are given. For example, in the binary floating point arithmetic of u bits, we always have the relation: computational value of a::!: b =a(1 + r'5)::!: b(l +t), with 1r'51, 181.s; 2 -u, even when cancellation or loss of information occurs. Wilkinson has made a deep investigation of error analysis for linear computation, algebraic equations, and eigenvalue problems by means of backward analysis [4,5]. D. Proliferation of Errors The phenomena usually caIled "accumulation of errors" should more appropriately be caIled the proliferation of errors, where the algorithm itself includes a particular mechanism to in- crease a small error indefinitely. An example is the recursion formula for tBessel functions 
549 stated as I n + 1 (x) = (2n/x)J n (x) -I n - 1 (x). It is customary to compute In(x) by this for- mula, starting with the values of Jo(x) and J 1 (x), with x given. By putting I n - 1 (x)= Yn' In(x) = Zn, the recursion formula can be re- garded as a linear transformation of the point Pn(Yn, zn) in a plane into another point P n + l (Yn+l' zn+l)' where Yn+l =Zn, zn+l = - Yn+(2n/x)zn' The teigenvalues AI, }"2 of this tdifference equa- tion satisfy the following: As long as n < I xl, we have 1..1. 1 1=1..1. 2 1=1, while ifn>lxl, Al is greater than 1 and increases rapidly as n tends to infinity. Consequently, even the slightest dis- crepancy in the position of Pn gives rise to a greatly magnified error in the result [3]. Many studies have been made of the propa- gation of errors and of instability phenomena in the numerical solution of ordinary differen- tial equations ( 303 Numerical Solution of Ordinary Differential Equations; [2J). References [I] J. von Neumann and H. H. Goldstine, Numerical inverting of matrices of higher order, Bull. Amer. Math. Soc., 53 (1947),1021- 1099. [2J P. Henrici, Discrete variable methods in ordinary differential equations, Wiley, 1962. [3J T. Uno, The problem of error propagation (in Japanese), Sugaku, 15 (1963), 30-40. [4J 1. H. Wilkinson, Rounding errors in alge- braic processes, Prentice-Hall, 1963. [5] 1. H. Wilkinson, Algebraic eigenvalue problem, Clarendon Press, 1965. [6J L. B. Rail (ed.), Error in digital computa- tion I, II, Wiley, 1965, 1966. 139 (VI.3) Euclidean Geometry A. History Attempts to construct axiomatically the geom- etry of ordinary 3-dimensional space were undertaken by the ancient Greeks, culminating in Euclid's Elements ( 187 Greek Mathemat- ics), The fifth postulate of Euclid's Elements requires that two straight lines in a plane that meet a third line, as shown in Fig. I, in angles ex, fJ whose sum is less than 180°, have a com- mon point. In the Elements, two straight lines in a plane without a common point are said to be parallel. I t can be proved from other 139B Euclidean Geometry axioms in the Elements that if ex + fJ = 180°, the two lines I and I' in Fig. 2 are parallel. Hence given a line I and a point P not lying on I, " Fig. 1 Fig. 2 there exists a line /' passing through P that is parallel to I. The fifth postulate ensures the uniqueness of the parallel/' passing through the given point P. For this reason, the fifth postulate is also called the axiom of parallels. Utilizing this axiom, we can prove the well- known theorems on parallel lines, the sum of interior angles of triangles, etc. The axiom plays an important role in the proof of the Pythagorean theorem in the Elements. The axiom is also called Euclid's axiom. However, Euclid states this axiom in a quite complicated form, and unlike his other axioms, it cannot be verified within a bounded region of the space. Many mathematicians tried in vain to de- duce it from other axioms. Finally the axiom was shown to be independent of other axioms in the Elements by the invention of non- Euclidean geometry in the 19th century ( 285 Non-Euclidean Geometry). The term Euclidean geometry is used in contrast to non-Euclidean geometry to refer to the geometry based on Euclid's axiom of par- allels as well as on other axioms explicit or implicit in Euclid's Elements. It was in the 19th century that a complete system of Euclidean geometry was explicitly formulated ( 155 Foundations of Geometry). From the stand- point of present-day mathematics, it would be natural to define first the group of motions by the axiom offree mobility due to H. Helm- holtz ( Section B) and then, following F. Klein, to define Euclidean geometry as the study of properties of spaces that are invariant under the groups of these motions ( 137 Erlangen Program). B. Group of Motions Let P be an tordered field and An the n- dimensional taffine space over P. Let W be an r-dimensional affine subspace of An, W- 1 an (r-I)-dimensional subs pace of B r , W- 2 an (r-2)-dimensional subspace of W-t, etc. In the sequence of subspaces B r , W- 1 , .", B O , each B k - Bk1 consists of two thalf-spaces (k = r, r - I, '.. , 1). Let H k be one of these half- 
139C Euclidean Geometry spaces. Then the sequence of half-spaces H r , Hr-" ...,H I is called an r-dimensional flag, denoted by f>r (n  r  1), and Br and H' are called the principal space and the principal half-space of $.)r, respectively. If f is a tproper affine transformation of An, f(H'), f(Hr-l), .., ,j(H I ) form an r-dimensional flag ft r . We write f($.)r) = ft r . Let n be the group of all proper affine transformations of An. The subgroup !B n of n with the following two properties is called the group of motions, and any element of!B n is called a motion (or congruent transformation). (1) Let r be an integer between 1 and n, and let f>r, ftr be any two r-dimensional flags. Then there exists an element f of !B n that carries f>r to ftr :f(f>r) = ft'. (2) Let f, 9 be two elements of !B n with f(f>r) = ftr, g(f>r) = ftr, and let p be any point on the principal space of f>'. Then f(p) = g(p), that is, f, 9 have the same "effect" on the principal space. In particular, when r = n, then f = g. That n possesses a subgroup !B n with properties (1) and (2) is called the axiom of free mobility. When n = 1, it is easy to see that the ele- ments of!B 1 are only those elements f of I that can be expressed in the form f(x) = ::!:: x + a (a E P). When n  2, P must satisfy the following condition in order that a subgroup !B n with properties (1) and (2) exists in n: If a, bE P, then P contains an element x such that x 2 = a 2 + b 2 . When this condition is satisfied, the ordered field P is called a Pythagorean field. Every treal closed field (e.g., the field R of real numbers) is Pythagorean. If!B n exists, its uniqueness is assured by (1) and (2). Further- more, if P contains a square root of every positive element (this condition is satisfied, for example, by R), then conditions (1) and (2) are reducible to the case r == n only, i.e., conditions (1) and (2) for other values of r follow from (1) and (2) with r = n. Hereafter, we assume the existence of !B n . Suppose that we have An =:> B' =:> B k (n  r  k  0), and let $.)r be a flag with the prin- cipal space Br: f>r =(H', ..., H k , "" HI). Let ftr be another flag with the same principal space B':ftr=(K r ,... ,K k , ...,KI), where we suppose that Hj = Kj for k  j  1, whereas for r  i  k + 1, we suppose that Hi and K i are different half-spaces on B i divided by B H . The flag ftr is denoted by f>. An element f of!B n with f(f>r) = f> is called a symmetry (or reflec- tion) of B r with respect to B k . It leaves every point on B k invariant, and its effect on B' is determined only by B k independently of the choice of half-spaces in f>r and ftr (subject to the conditions mentioned above). In partic- ular, the symmetry of An with respect to a point A O = pis caJled a central symmetry with respect to the center p; and the symmetry of An 550 with respect to a hyperplane An-I = h is called a hyperplanar symmetry. They are uniquely determined by p and h, respectively, and are denoted by Sp and S(h), respectively. If H(An) is the set of all hyperplanes of An, then !B n is generated by {S(h)lhEH(A n )}. Furthermore, if p, q are two points of An, the composite SpSq is a parallel translation by 2. pq (Fig. 3). The parallel translations generate a normal sub- group'In of!B n , For p, qEA n , the element of 'In that carries p to q is denoted by rpq. The Ii' x x"=s,(x') Fig. 3 subgroup of!B n that leaves a point p of An invariant is denoted by 0;. Obviously, we have 0; = rpq O;r;ql . Thus all the 0; (for pE An) are isomorphic. We call 0; the orthogonal group around p and any element of 0; an orthogonal transformation around p. More generally, any element of!B n that leaves a subspace A k of An invariant is called an orthogonal transforma- tion around the subspace A k . An element of !B n that preserves the orienta- tion of An, i.e., is represented by a tproper affinity with a positive determinant, is called a proper motion. Proper motions form a sub- group !B o of !B n . Rotations are, by definition, orthogonal transformations belonging to !B o . Sometimes !B o is called the group of motions; then !B n is called the group of motions in the wider sense. In this article, however, we shall continue to use the terminology introduced above. The study of the properties of An invariant under !B n is n-dimensional Euclidean geometry. Since n =:> !B n , every proposition in affine geometry ( 7 Affine Geometry) can be con- sidered a proposition in Euclidean geometry, but there are many propositions that are proper to Euclidean geometry. Sometimes the subgroup of n generated by !B n and the homotheties of An, i.e., elements of n repre- sented by tscalar matrices, is called the group of motions in the wider sense, and the study of properties of An invariant under this group is called n-dimensional Euclidean geometry in the wider sense. C. Length of Segments Two figures F, F' in An are said to be congru- ent if there exists an f E!B n such that f(F) = F'. 
551 Then we write F = F'. The congruence relation is an tequivalence relation. Let s = pq , s' = p' q' be two tsegments in An. We say that s, s' have equal length when s == s'. Length is an attri- bute of the equivalence class of segments. The length of s is denoted by Isi. All segments of the form pp are congruent, and we define I pp l = o. If we are given a length and a thalf-Iine starting from a point p, we can find a unique point q on it such that I pq l = the given length (Fig. 4). Let r be a point on the extension of pq. The length I pr l is then uniquely deter- mined by Ipql and Iqrl. It is defined as the P,  q, P (a) . P . q (b) Fig. 4 sum of the lengths: I pr l=l pq l+l qr l. With respect to the addition thus defined, lengths of segments in An form a tcommutative semi- group with the cancellation law, which can be extended to an t Abelian group M with 0 as the identity element ( 190 Groups P). Let Isl O and Is'l be any length. On a half- line starting from p, we can find points q, r with I pq l=lsl, I pr l=ls'l. Then the element pr/pq=AEP ( 7 Affine Geometry) is a posi- tive element of P uniquely determined by Isl and Is'l. We caliA the measure of Is' I with the unit Isl and denote it by Is'l: Isi. If Pis tArchi- medean, ..1. can be represented by a real num- ber ( 149 Fields N). We have (ls'l + Is"I): Isl = (ls'l: Isl) +(ls"l: Isl), (ls"l: Is'I)(ls'l: Is"l) = Is"l: Isl (if Is'l #0). Thus the mapping Is'l--> Is'l: Isl sends the additive semigroup of lengths to that of the positive elements of P. This is actually an isomorphism, which can be extended to an isomorphism of M onto the additive group of the field P. Let <p(X, X', ..., X') be a thomogeneous rational function of a finite number of vari- ables X, x', ..., X'. If a relation <p(A,..1.', ..', A') =0 holds for A=\s\: Iso\,..1.' = \s'\: \so\,..., ..1.' = Is'l: ISol, where ISol is a length # 0, then <p(AI,):I, ...,A)=O holds also for Al =lsl:lsll, ..1.'1 = Is'l: Is 1 I, ..., A = Is'l: Is I I, where ISII is any other length #0. Hence, in this case, the ex- pression <p(lsl, Is'I,..., Is'I)=O is meaningful. D. Angles and Their Measure An angle LAOB is a figure constituted by two half-lines OA, OB starting from the same point o but belonging to different straight lines. The point 0 is called the vertex, and the two half- 139D Euclidean Geometry lines OA, OB are called the sides of LAOB (Fig. 5). Two congruent angles are said to have the same measure, denoted by 1 LAOBI or sometimes simply by ex, Let f>2 =(H2, HI) (HI = half-line QR) be a given 2-dimensional flag and the given measure of an angle. Then we can find a unique half-line QP in the given half-plane H 2 such that I L PQR I = ex (Fig. 6). The angle LPQR is said to "belong" to f>2. Let K 2 be the half-plane separated by the line P U Q containing the half-line QR. Then H 2 n K 2 is called the interior of the angle L o B H2 p. p' R Fig. 5 Fig. 6 l1.'+l1."=a. L PQR. Let L P'QR be another angle belong- ing to f>2. If the interior of the latter angle is a subset of the interior of L PQR and QR  QP', then I LPQRI is said to be greater than I LP'QRI, and we write I LPQRI>I LP'QRI. Actually, it can be shown that> is a relation between the measures of LPQR and LP'QR, and that the set of measures of angles forms a tlinearly ordered set with respect to the rela- tion  defined in the obvious way. When I LPQRI> I LP'QRI, we write I LPQRI= I L PQP'I + I L P'QRI. Actually, these are rela- tions between measures of angles. Further- more, if the measure ex of an angle is given, the set of measures of angles .s; ex forms a linearly ordered set order-isomorphic to a segment and satisfying: (i) If fJ < y then there exists a 11 such that fJ + 11 = y; (ii) fJ + 11 = 11 + fJ; (fJI + fJ2) + fJ3 = fJI + (fJ2 + fJ3) if all these sums exist; and (iii) fJI + b = fJ2 + 11 implies fJI = fJ2' When P is Archimedean, these properties imply that the meaSUre of angles .s; I L PQR I can be repre- sented by positive real numbers .s; k (k is any given positive number) such that the relations of ordering an addition are preserved. This one-to-one correspondence between the measures of angles and a subset S of the interval (0, kJ of real numbers can be extended to a correspondence between the measures of general angles and the subset of R obtained from S. When P=R, then we have S=(O,kJ, and any real number appears as a measure of a general angle. We can choose LPQR and the positive number k arbitrarily, but it is customary to choose them as follows. Suppose we are given an angle LAOB. Let the exten- sions of the half-lines OA and OB in the oppo- site directions be OA' and OB', respectively. The angles LAOB and LA'OB are called supplementary angles of each other, and so are 
139E Euclidean Geometry LAOB and LAOB'. The angles LAOB and LA'OB' are called vertical angles to each other (Fig. 7). Any angle is congruent to its vertical angle, and an angle that is congruent to its supplementary angle has a fixed measure. + rs Fig. 7 Such an angle (or its measure) is caned a right angle. In the description of the measurement of angles, we usually consider the case where the special angle L PQR is a right angle, and we set k = n/2, (The existence and uniqueness of the right angle can be proved.) An angle that is greater (smaller) than a right angle is cal1ed an obtuse (acute) angle. A general angle whose measure is twice (four times) a right angle is cal1ed a straight angle (perigon). Sometimes we choose as the "unit angle" 1/90 of a right angle, which is cal1ed a degree (hence a right angle = 90 degrees, denoted by 90°); 1/60 of a degree is called a minute (1 0 = 60 minutes, denoted by 60'), and 1/60 of a minute is called a second (1' = 60 seconds, denoted by 60"). If, as usual, we put the right angle equal to n/2, then the unit angle is (2/n)(right angle). This is called a radian, and 1 radian = 180 0 /n = 57° 17'44.806..." =; 57.3°. If a straight line m intersects two straight lines I, 1', eight angles ex, [3, y, 8, ex', [3', y', 8' appear, as in (Fig. 8). In this figure, ex and ex', [3 and [3', y and y', and 8 and (j' are called corre- sponding angles, while ex and y', [3 and (j', y and ex', and (j and [3' are called alternate angles to each other. When 1 and I' are parallel, each of these angles is congruent to its corresponding or alternate angle. The Pythagorean theorem asserts that if a triangle l:'>.ABC is given for which LABC is a right angle (Fig. 9), then IABI2 + IBCl 2 = ICAI 2 (which makes sense since X 2 + y 2 - Z2 is a homogeneous polynomial). , L::r Fig. 8 Fig. 9 E. Rectangular Coordinates When two straight lines I, m intersect, two pairs of vertical angles appear. If one of these angles is a right angle, then al1 are. Then we 552 say that 1 and m are orthogonal (or perpendic- ular) to each other, and write Lim. Let 1 be a line and A r an r-dimensiona1 subspace of An (l.s;r.s;n-l) intersecting 1 at a point O=Arnl. If 1 is orthogonal to all lines on A r passing through 0, then 1 is said to be orthogonal to Ar, and we write LiAr (Fig. 10). If An-! is any hyperplane in An, then there exists a unique line 1 through a given point P of An that is orthogonal to An-I; this 1 is called the per- pendicular to An-I through P, and the \ Fig, 10 intersection InA n - 1 is caned the foot of the perpendicular through P. When An-I is given, the mapping from An to A,,-I assigning to every point P of An the foot of the perpendic- ular through P is cal1ed the orthogonal projec- tion from A" to A"-I. Let 5 n =(H n ,H n - l , ...,H I ) be an 11- dimensional flag of An and 0 the initial point of the half-line HI. Then we can find a point E i in Hi (i= 1,2, ...,n) such that OUEi..LOUE j (i # j, i,j = 1, 2, ..., n). Moreover, if lei is any unit of length, then E i can be chosen uniquely so that 10E;\ = lei (i= 1, 2,.", n). Then 0, EI' ..., E" are tindependent points in An, and wehaveAn=OUEIU...UE n . Thus we have a tframe .E = (0; E 1 , ... , En) of An with 0 as origin and the E i as unit points. Such a frame is caned an orthogonal frame. A coordinate system with this frame, called an orthogonal coordinate system adapted to 5 n , is uniquely determined by 5 n . A motion is characterized as an taffinity sending one orthogonal frame onto another or onto itself. Utilizing an orthogonal coordinate system, the lengths of segments and the measures of angles can be expressed simply. Let (XI' ..., x n ) be the coordinates of X with respect to such a coordinate system. Then the length of the segment lOX I (with lei as unit) is equal to (L?=I xfjI/2, and when y is another point, with coordinates (YI"'" Yn), 0 # X, 0 # Y, then we have Li=1 XiYi C OSILXOYI- ( .... X2 ) 1/2 ( ...._ Y . ) 1/2' "'1=1 I ""'"1-1 1 In particular, we have 0 U X..LO U Y if and only if L?=I XiYi = O. We may write x = OX for the tJocation vector of X. Then the taffinity Ax + b is a motion if and only if A is an torthogonal ma- trix. Thus the tinner product (x, y) is invariant 
553 under motions; it therefore has meaning in Euclidean geometry. If we put Ixl = (x, X)I/2, then the right-hand sides of the formulas for 10XI and cosl LXOYI can be written as Ixl and (x, y)/(lxl'lyl), More generally, we have IXYI = Iy -xl. This is the Euclidean distance (or simply distance) between X and Y. Then An becomes a tmetric space with this distance, i.e., a Euclidean space. Historically, the notion of metric spaces was introduced in generalizing Euclidean spaces ( 273 Metric Spaces). F. Area and Volume The subset In of An consisting of points (XI' ..., x n ) with respect to an orthogonal co- ordinate system, with O.s; xi.s; 1, i = 1, ... ,n, is called an n-dimensional unit cube. A function m that assigns to tpolyhedra in the wider sense P, Q,... in An nonnegative real numbers m(P), m(Q), ,.. is called an n-dimensional volume if it satisfies the following four conditions: (1) m(0) =0. (2) m(PUQ)+m(pnQ)=m(P)+m(Q). (3) If P is sent to Q by a translation, then m(P) = m(Q). (4) m(In)= 1. It has been proved that such a function is unique and has the property that P=Q implies m(P)=m(Q). Thus the con- cept of volume can be defined in the frame- work of Euclidean geometry. More generally, if the affinity f(x) = Ax + b sends Ponto Q, then m(Q)=cm(P), where c is the absolute value of the determinant IAI. If P is covered by a finite number of hyperplanes, then m(P) = 0, and if P is a tparallelotope with n independent edges a" ..., an, then m(P) = absla" ..., ani, where I a I, ... , an I is the determinant of the n x n matrix with a i as the ith column vector, and absx is the absolute value of the real number x. If P is an tn-simplex whose vertices have location vectors x o , XI' '" ,x n , then we have 1 1 1 1 m(P)=-abs n! Xo Xl The volume of any polyhedron can be ob- tained by dividing it into n-simplexes and summing their volumes. If P is an r- dimensional polyhedron in An, then the r- dimensional volume of P is obtained by divid- ing Pinto r-simplexes and summing their r-dimensional volumes (in the respective r- dimensional Euclidean spaces containing them). In particular, when r == 1, we speak of length (e.g., the length of a broken line), and when r = 2, of area. If V is the r-dimensional volume of an r-dimensional parallelotope with r edges ai' ..., a" we have the formula J (al,ad (al,a r ) V 2 = (a" ad (a" a r ) 139G Euclidean Geometry The notion of measure of point sets other than polyhedra is a generalization of the notion of vol ume of poly hedra ( 270 Meas ure Theory). G. Orthonormalization Let 0, A" ..., An be n + 1 ependent points in An. Then the n vectors OA i = ai' i == 1, ..., n, are independent. The points 0, A" "', An determine an n-dimensional flag 5 n of An as follows. Let HI be the half-line OA I , H 2 be the half-plane on the plane 0 U Al U A 2 separated by the line 0 U Al in which Az lies, "', Hn be the half-space on An separated by the hyper- plane 0 U Al U ... U An-I in which An lies. Let b l , ... , b n be the unit vectors of the rectangular coordinate system adapted to 5 n . Suppose further that we are given a rectangular coordi- nate system. Then b l , ... ,b n can be obtained from ai' ..., an by the following procedure, called orthonormalization (E. Schmidt): First put b l =al/lad, so that Ibll= 1. Then C2= a 2 -(a 2 , bl)b l satisfies (b" cz) = 0, C 2 # O. Put b 2 =c 2 /lc 2 1. Then we have (b l , bz)=O, I bzl = 1. When b l , ... , bi-I are obtained in this way, so that (b j , b k ) = r5 jk for 1 .s;j, k.s; i-I, then ci=a i -(ai' bdbl -... -(ai' bi-dbi-I satisfies (bj,Ci)=O, ci#O. Hence bi=c;/Icil added to b l , ... , bi-I retains the property (b . b ) =r5. for I .s; J ' k.s;i and this procedure l' k Jk " can be continued to i = n. Two vectors u, v are called orthogonal (de- noted u..lv) if (u, v)= 0, and u is called normal- ized when lul = 1. Thus any two of the vectors b l , "', b n are orthogonal, and each of them is normalized. Between given vectors ai, ... , an and b l ,..., b n we have the relation {a l ,..., a;} ( = the linear space generated by a I' .. . , a;) = {b l b} i=I,...,n. L e ''WI"\JJl be two subspaces of the linear I' Z space \JJl of the vectors of the Euclidean space An, If any element of \JJl I is orthogonal to any element of \JJl 2 , then \JJl I and \JJlz are called orthogonal and written \JJl I ..l\JJl 2 . For any proper subspace \JJl I of \JJl, it can be shown by the method of orthonormalization that there exists a unique proper subspace \JJlz of \JJl such that \JJl=\JJl I U\JJl z , \JJlI..l\JJl Z ' Such a subspace \JJl 2 is called the orthocomplement of\JJl 1 (with respect to \JJl). Then \JJl I n \JJlz = {O} follows, and hence \JJl=\JJl I +\JJl 2 . Every element a of \JJl is therefore written uniquely in the form a l +a 2 , a l E\JJl" azE\JJlz; we call a l the \JJlI- component of a and a z the orthogonal compo- nent of a with respect to \ill I' The mapping from \JJl to \JJl I assigning a I to a is called the orthogonal projection from \JJl to \JJl I ; it is a linear and tidempotent mapping. 
139H Euclidean Geometry H. Distance between Subspaces Since the Euclidean space An is a metric space, the distance is defined between any two non- empty subsets of An ( 273 Metric Spaces). Let A r , B' be two subspaces of dimensions r, S of An, and let d be the distance between them. Then it can be shown that there exist points pE A r , q E B S such that d = pq , and if p' EAr, q' E B' are any other points with d = p' q' , then Pii = p' q'. In particular, when r == 0 and S = n - 1 (i.e., when Ar=p is a point and B'=Bn-1 is a hyperplane), the distance d can be obtained as follows: If (a, x) = b is an equation of Bn-I and p is the location vector of p, then d = I(a, p)- bl/lal. If lal = 1 in this equation of Bn-I, then d is given simply by I(a,p)-bl. An equa- tion (a, x) = b of a hyperplane is said to be in Hesse's normal form if lal = 1. I. Spheres and Subspaces The set of points in a Euclidean space lying at a fixed distance r from a given point is called the sphere of radius r with center at the given point. If p is the location vector of the center of this sphere with respect to a given rectangular coordinate system, then the equation of the sphere is Ix-pl=r or (x,x)-2(p,x)+(p,p)- r 2 = O. The set of points lying at equal dis- tances from k + 1 points with location vectors Po, Ph "', Pk (k?; 1) is a linear subspace of the space (which may be 0 or the entire space). If these points are independent, then the sub- space has dimension n - k, where n is the di- mension of the entire space. In particular, if these points are vertices of an n-dimensional tsimplex, then there is a unique sphere passing through them, called the circumscribing sphere of the simplex. In this case, the simplex is said to be inscribed in the sphere. If Po, PI' ..., Pn are location vectors of the vertices of the sim- plex, then the equation of the circumscribing sphere of the simplex is given by Po PI P6 pf Pn X = O. p; x 2 When n = 2 or 3, there are many classical results concerning the circumscribing circle of a triangle, the circumscribing sphere of a sim- plex, and other figures related to a triangle or a simplex. References [IJ G, D. Birkhoffand R. Beatley, Basic geom- etry, Scott, Foresman, 1941 (Chelsea, third edition, 1959). 554 [2J E. E. Moise, Elementary geometry from an advanced standpoint, Addison-Wesley, 1963. [3J H. Weyl, Mathematische Analyse des Raumproblems, Springer, 1923 (Chelsea, 1960). [4J F. Klein, VorIesungen iiber h6here Geom- etrie, Springer, third edition, 1926 (Chelsea, 1957). [5J 1. Dieudonne, Algebre lineaire et geometrie elementaire, Hermann, second corrected edi- tion, 1964; English translation, Linear algebra and geometry, Hermann, 1969. [6J G. Choquet, L'enseignement de la geome- trie, Hermann, 1964. 140 (VI.4) Euclidean Spaces A space satisfying the axioms of Euclidean geometry is calIed a Euclidean space. An taffine space having as tstandard vector space an n- dimensional Euclidean tinner product space over a real number field R is an n-dimensional Euclidean space En. In an n-dimensional Eu- clidean space En, we fix an torthogonal frame 1:'=(O,E I , ...,En), ei= OE i , (e i ,e)==8ij. The frame 1:' determines trectangular coordinates (X I ,X 2 , ...,x n ) of each point in En. We can thus establish a one-to-one correspondence be- tween En and R n = {(XI' .., ,xn)lxjER}, In this sense we identify En and R n and usually call R n itself a Euclidean space. The 1-dimensiona1 space R I is a straight line, and the tCartesian product of n copies of R I is an n-dimensional Euclidean space (or Cartesian space). Given points X=(X h X2, ...,x n ) and Y=(YI,Yz, ...,Yn) in the Euclidean space R n , the tdistance d(x,y) be tween them is given by J (YI-xd2+ ...+(Yn- x n)2. Thus the distance d(x, y) supplies Rn with the structure of a tmetric space, We call Xi the ith coordinate of the point x, the point (0, '" ,0) the origin of R n , and the set of points {x I -00 < X j < 00; x j = 0, j # i} the xi-axis (or ith co- ordinate axis). For an integer m such that -1 .s; m.s; n, we define m-dimensional tsub- spaces in R n ; a -l-dimensional subspace is the empty set, a O-dimensional subs pace is a point, and a I-dimensional subspace is a straight line. If we take an orthogonal frame, an m-dimensional subspace is represented as an Rm ( 139 Euclidean Geometry; 7 Affine Geometry). As a ttopological space, R n is tlocally com- pact and tconnected. A bounded closed set in R n is t com pact (Bolzano- Weierstrass theorem). Given a point a=(a l , ...,a n ) in R n and a real positive number r, the subset {xld(x,a).s;r} of 
555 Rn is called an n-dimensional solid sphere, solid n-sphere, ball, n-ball, disk, or n-disk with center a and radius r, its tinterior {xld(x,a)<r} an n- dimensional open sphere, open n-sphere, open ball, open n-ball, open disk, or open n-disk, and its tboundary {x I d(x, a) = r} an (n - 1)- dimensional sphere or (n - 1 )-sphere. In partic- ular, a 2-dimensional solid sphere is called a circular disk, its interior an open circle, and its boundary a circumference. A disk or a circum- ference is sometimes called simply a circle. The family of n-dimensional open spheres with center a gives a base for a neighborhood system of the point a. Suppose that we are given a sphere S and two points x, y on S. The points x, yare called antipodal points on the sphere S if there exists a straight line L passing through the center of S such that S n L = {x, y}, The segment(or the length of the segment) whose endpoints are antipodal points is called the diameter of the solid sphere (or of the sphere). The notion of tdiameter ( 273 Metric Spaces) of a solid sphere or of a sphere considered as a subset of the metric space R n coincides with the notion of diameter of the corresponding set defined above, When n  3, the intersection of a sphere and a 2- dimensional plane passing through the center of the sphere is called a great circle of the sphere, For m such that 1 .s; m.s; n, we consider an m-dimensional solid sphere or an (m -1)- dimensional sphere in an m-dimensional plane R m . These spheres are also called m- dimensional solid spheres or (m-l)-dimen- sional spheres in R n . In particular, the solid sphere of radius 1 having the origin as its center is called the unit disk, unit ball, or unit cell, and its boundary is called the unit sphere. (In particular, when we deal with the 2-dimensional space R 2 , we use the term circle instead of sphere, as in unit circle.) The points (0, .,. ,0, 1) and (0, ... , 0, -1) are called the north pole and south pole of the unit sphere, respectively. The (n - 2)-dimen- sional sphere, which is the intersection of the unit sphere and the hyperplane X n = 0, is called the equator; the part of the unit sphere that is "above" this hyperplane (i.e., in the half-space X n  0) is called the northern hemisphere, and the part that is "below" the hyperplane (i.e., in the half-space xn.s; 0) the southern hemisphere. Let ai' b i be real numbers satisfying ai < b i (i=1,2,...,n). The subset {xlai<xi<bi,i= 1, 2, ... , n} of R n is called an open interval of R n , and the subset {x I ai.s; xi.s; b;} a closed interval. They are sometimes cal\ed rectangles (when n = 2), rectangular parallelepipeds, or boxes. An open interval is actually an open set of R n , and a closed interval is a closed set. In particular, the closed interval {x I O.s; Xi.s; I, i = 1, 2, ., . , n} is called the unit cube (or unit n- 141 Euler, Leonhard cube) of R n . We can take the set of open inter- vals as base for a neighborhood system of R n . All the tconvex closed sets (for example, closed intervals) having interior points in R n are homeomorphic to an n-dimensional solid sphere. A topological space In that is homeomorphic to an n-dimensional solid sphere is called an n-dimensional (topological) solid sphere, (topological) n-cell, or n-element. A topological space sn-I homeomorphic to an (n -I)-dimensional sphere is called an (n -1)- dimensional topological sphere (or simply (n - 1 )-dimensional sphere. The spaces In and sn-I are torientable ttopological manifolds whose orientations are determined by assign- ing the generators of the (relative) thomology groups Hn(I n , jn) and Hn-I (sn-l), respectively (both are infinite cyclic groups). By means of the tboundary operator a: Hn(In,in)--> Hn-I (Sn-I), the orientation of In or sn-I deter- mines that of the other. References See references to 139 Euclidean Geometry. 141 (XXI.22) Euler, Leonhard Leonhard Euler (April 15, 1707-September 18, 1783) was born in Basel, Switzerland. In his mathematical development he was greatly influenced by the BernoulIis ( 38 Bernoulli Family). He was invited to the St. Petersburg Academy in 1726 and remained there until 1741, when he was invited to Berlin by Fre- derick the Great (1712-1786). Euler was active at the Berlin Academy until 1766, when he returned to St. Petersburg. Already having lost the sight of his right eye in 1735, he now became blind in his left eye also. This, how- ever, did not impede his research in any way, and he continued to work actively until his death in St. Petersburg. Euler was the central figure in the mathe- matical activities of the 18th century. He was interested in alI fields of mathematics, but especially in analysis in the style oftLeibniz, which had been passed down through the . Bernoullis and was developed by him into a form that led to the mathematics of the 19th century. Through his work analysis became more easily applicable to the fields of physics and dynamics. He developed calculus further and dealt formalIy with complex numbers. He also contributed to such fields as tpartial dif- ferential equations, the theory of telIiptic func- 
141 Ref. Euler, Leonhard tions, and the tcalculus of variations. He con- tributed much to the progress of algebra and theory of numbers in this period, and also did pioneering work in topology. He had, how- ever, little of the concern for rigorous founda- tions that characterized the 19th century. He was the most prolific mathematician of all time, and his collected works are still incomplete, though some seventy volumes have already been published. References [1] L. Euler, Opera omnia, ser. 1, vol. 1-29, ser. 2, vol. 1-30, ser. 3, vol. 1-13, Teubner and O. Fiissli, 1911-1967. 142 (XV.11) Evaluation of Functions A. History By the evaluation of a function f(x) we mean the application of algorithms for obtaining the approximate value of the function. The evalu- ation methods are classified roughly into two groups: (1) evaluation offunctions using ap- proximation formulas, and (2) evaluation using microprogramming techniques. The advent of high-speed computers has brought about drastic changes in the evaluation of functions. Before the introduction of high-speed com- puters in the 1940s, mathematical tables had played a prominent role, The first issue (pub- lished in 1943) of the journal Mathematical tables and other aids to computation (MT AC), the predecessor of the journal Mathematics of computation, was primarily concerned with tables of mathematical functions. One of the aims of this journal was to facilitate the ex- change of information on errors in the tables. The tables, obtained in the past by tedious hand calculation, can now be easily prepared by high-speed computers, and there was a period when more accurate and extensive tables were published one after another. It is ironic, however, that high-speed comput- ers revolutionized numerical analysis and prompted a shift of emphasis in the field, beginning in the 1950s, away from the use of numerical tables and toward exploration of the most efficient methods of approximation of the functions, thus causing a rapid decrease in the need for tables. Recently, a significant trend in computer design has replaced the conventional logic control section with "stored logic," or micro- programmed control, stored in high-speed, 556 nondestructive Read-Only Storage (ROS). For example, the microprogrammed control used for the unified COordinate Rotation DIgital Computer (CORDIC) algorithm is effective in calculating elementary functions because of its simplicity, accuracy, and capability of high- speed execution via parallel processing. It is not clear whether the applications of approxi- mation formulas heretofore in use will be superseded by microprogramming techniques in all kinds of computers. However, the value of the approximation formulas recognized in the 1950s has been declining insofar as elemen- tary functions are concerned. Recently, as a method for the evaluation of functions based on a new viewpoint, some "unrestricted" al- gorithms have been proposed by Brent [IJ, which are useful for the computation of ele- mentary and special functions when the re- quired precision is not known in advance or when high accuracy is necessary. It is expected that methods for the evaluation of functions using approximation formulas will be further developed as microprogramming techniques and unrestricted algorithms see wider use. B. Evaluation of Functions Using Approximation Formulas Suppose we approximate a function f(x) by a function g(x) using the following class of func- tions Pi(X) and qj(x). Let continuous functions pdx), ...,Pn(x) and qdx), ...,qm(x) defined on a closed interval [a, h J satisfy the folIowing conditions: (i) PI, ..., Pn and q l' ..., qm are both linearly independent; (ii) there exist at most a finite number of zeros for Lj1 bjqAx) in [a, bJ for any choice of b l , b 2 , ..., b m , except for the case b 1 = b 2 =... = b m = 0; (iii) there is a con- tinuous function g(x) with a nonzero denomi- nator in [a, bJ representable as n 1 m g(X) = i aiPi(x) j bjqj(x), (1 ) where a" i= 1, ...,n, and bj,j=O, 1, ...,m, are constants. Then g(x) is called a generalized rational function based on a class offunctions {Pi(X)}, i= 1, ...,n, and {qj(x)},j= 1, ...,m. If Pi(X)=X H and qAx)=x j - 1 , (1) is reduced to a rational function. Ifm= 1 and qdx)= 1, (1) is a linear combination of Pi(X) and is called an approximation of linear type to f(x); and further, if Pi(X)=X i -\ (1) is reduced to a poly- nomial. The crux of the approximation prob- lem lies in the criterion to be used in choosing the approximate constants in (1). There are three methods for choosing them, which lead to three types of approximation of major importance: (i) interpolatory approximations, (ii) tleast-squares approximations, and (iii) 
557 min-max error approximations (sometimes called best approximations). As a major aim of computer approximation of a function is to make the maximum error as small as possible, the third type of approximation has been used for digital computers. For every continuous function {(x), there always exist min-max error approximations of the form (1). In generating a min-max error approximation in practice, we essentially depend upon the following conditions and theorems. Let a function space F be a d- dimensional linear space. If for [E F which is not identically zero, there exist at most (d -1) zeros of f(x) in [a, b J, then F is called the unisolvent space or the Haar space. Let one of the best approximations to a continuous function f(x) be g(x), If the linear space D of all the functions of the form {L aiP;(x) + L bkg(x)qk(x)} is unisolvent, then the best approximation is unique. For the case where g(x) is an approximation of linear type to f(x), Haar [2J proved the following theorem. Let g(x) be the best approximation to j(x) and g(x) = LI aiPi(x). A necessary and suffi- cient condition for the best approximation to be always unique is that the m-dimensional linear space generated by the functions PI, P2, .., , Pm be unisolvent. In particular, we have the best unique polynomial approximation (one variable) of f(x) defined on [a, b]. Let the necessary and sufficient condition of the previous theorem be satisfied in a d- dimensional linear space D. A necessary and sufficient condition for g(x) given by (1) to be the best approximation of [(x) is that there exist points a.s;x O <x 1 <... <X d - I <xd.s;b, called deviation points, such that (-1 )i(/(X;)_ g(x i »)= J1 (i = 0, I, ..., d). A great number of algorithms are known by means of which one can calculate the best approximation g(x) of a function f(x) for the given values of nand m and for p;(x) = Xi-I and g}x) = xj1 in (1). However, the following three kinds of algorithms are used most fre- quently for generating min-max approxima- tions on computers: Remes's second algorithm [3J, the differential correction algorithm, and Yamauti's folding-up method, C. Evaluation of Elementary Functions Based on Microprogramming Chen [6J has given a general algorithm for calculating an elementary function z = f(x) as follows. Let F(x, y)= yg(x)+ h(x), where Y is some parameter for evaluating Zo= f(x o ) = F(xo, Yo)' We assume that (x o , Yo) has been given and that Zo is unknown. Suppose that a 142 C Evaluation of Functions new point (Xk+I, Yk+l) is obtained from (x k , Yk) according to a pair of transformations, Xk+l = <p(Xk, Yk) and Yk+l = Ij;(x b Yk), keeping the value of F(x o , Yo) invariant. If X k is forced to converge to x", and if g(x",) = I and h(x",) = 0, then Zo = f(x o ) = F(xo,Yo)= F(x l , Yl)='" = F(x""y",) = y",. In this procedure it is necessary to determine g(x), h(x), <p(x), and Ij;(x) for the given f(x). Most of the elementary func- tions can be evaluated by Chen's algorithm, which is essentially identical with Specker's Sequential Table Look-up (STL) method based on addition formulas [7J, e.g., log x = log(xex) -log ex or eX = eX-PeP. The iteratIOn equations (2) ofCORDIC given later can also be derived by the STL method with complex numbers. The use of coordinate rotation to evaluate elementary functions is not new. In 1956 and 1959 VoIder [8J described the CORDIC for the calculation of trigonometric functions, multiplication, division, and conversion be- tween the binary and the decimal and r-adic number systems, Daggett, in 1959, discussed the use of CORDIC for decimal-binary con- versIOn. It was recognized by Walther [9J in 1971 that these algorithms could be merged into one unified algorithm. Consider coordi- nate systems parameterized by m in which the radius R and angle A of the vector P = (x, y) are defined as R =(x 2 +ml)112 and A =m- 112 arctan(m l / 2 yjx). The basis of Walther's algo- rithm is the coordinate rotation in a linear (m = 0), circular (m = 1), or hyperbolic (m = -I) coordinate system, depending on which func- tion is calculated. Iteration equations of CORDIC are as follows. The point (X i + 1 , Yi+l' Zi+l) is obtained from the point (Xi, Yi, z;) by means of the transformation Xi+! ==x i +mYi 8 i, Yi+1 = Yi- x i 8 i, (2) Zi+l =Zi+(Xi, where m is a parameter for the coordinate system, ex i =m- 1 / 2 arctan (m 1 / 2 8;), and 8 i is a suitable value, e.g., ::I:2- i . The angle Ai+l and radius Ri+l are Ai+! = Ai - ex i and Ri+1 = R i X K i = R i x (I +m8 i 2 )1/2, After n iterations we find An=Ao-ex and Rn=RO x K, and then X n = K {xo cos(exml/ 2 )+ Yom 1 / 2 sin(exm 1 / 2 )}, Yn = K {yo cos(exm1/ 2 ) - x o m- 1 / 2 sin(exm l / 2 )}, Zn=ZO+ex, where ex = L?:J ex i and K = II?:J K i . These relations are summarized in Table 1 for m = 1, m = 0, and m = -1 in the following special cases. (i) The value of A is forced to converge to zero; Yn-->O. (ii) The value of z is forced to converge to zero; Zn -->0. 
142 D Evaluation of Functions Table 1. Input and Output Functions for CORDIC Input Function m Xo Yo Zo sin t 1 I/K 0 t cost 1 I/K 0 t tan -I t 1 1 t 0 xz 0 x 0 Z y/x 0 x Y 0 sinht -1 I/K_I 0 t cosh t -I I/K_I 0 t tanh -I t -1 1 0 558 Quantity to be 0 Output Zn -->0 zn-->O Yn-->O zn-->O Yn-->O zn-->O zn-->O Yn-->O Yn-->sin t x n -->cos t zn-->tan -I t Yn-->xz Zn--> y/x Yn -->sinh t X n -->cosh t Zn--> tanh -j t n-I Kl = IT (1 +<5/)1/2, jO n-I K -I = IT (1- <5/)1/2 jO D. Fast Fourier Transform (FFT) When a function f(x) can be taken to be periodic, it is advantageous to use trigono- metric polynomials as least-squares approxi- mating functions, The summations arising in least-squares approximations based on the trigonometric polynomials play an important role in various applications, and a quite effi- cient algorithm for the evaluation of such sums was developed by Cooley and Tukey [10] in 1965, known as the fast Fourier trans- form (FFT). Let x m , (O.s; m.s; N -1) be a set of complex numbers, and consider N-I Xn=(I/N) L xmexp( -2nimn/N) m=O (O.s;n.s;N-l). (3) Equation (3) is often called the discrete Fourier transform (DFT) of the sequence X m , it being analogous to the continuous tFourier transform, X n =(I/T) fx(t)ex P ( - 2nint/T)dt (O.s;n.s;N), (4) where x(t) is a periodic function of t with period T. X n is called the nth Fourier coeffi- cient of a set of N equally spaced samples of size N for the function x(t) (t=jT/N,O.s;j.s; N). In the same way as for the continuous Fourier transform, the discrete transform can be inverted to yield N-j x n = L X m exp(2ninm/N) (O.s;n.s;N -1). m=O Here X n is called the coefficient of the inverse Fourier transform, and X m and X n thus form a transform pair. The FFT algorithm for a finite sequence of length N in (3) is based on the fact that the calculation of (3) can be performed in stages by using the direct product decomposition if N = N I N 2 , with N I and N 2 relatively prime. For example, the 2-dimensional Fourier transform coefficient is given by X n1 ,n2 = [1/(N I N 2 )J x N t -l N 2 -1 L L exp( -2ni(n l m l /N I +n2m2/N2))xm1,m2 rn 1 =Om2=O (0.s;n l .s;N I -1,0.s;n 2 .s;N 2 -1). If by an elementary operation we mean one complex multiplication and one complex ad- dition, we can evaluate X n1 ,n2 through (N I N 2 )2 such operations using Horner's scheme. By the direct product decomposition method, however, the (N I N 2 )2 operations can be re- duced to only N I N 2 (N I +N 2 ) operations. Because the matrix corresponding to the trans- formation mentioned above is a direct product of N I x N I and N 2 x N 2 matrices, we can per- form the calculations in two stages: first to obtain (m 1 n 2 for O.s;m l .s;N j -I and O.s; n 2 .s;N 2 -1 and then to obtain X n1n2 for O.s; n 2 .s; N 2 -1 and O.s;nl.s;N j -1. We have N 2 -1 m1,n2 ==(I/N 2 ) L exp( -2nin2m2/N2)Xm1,m2' m2=O Nt-l X n1 ,n 2 =(I/Nd L exp(-2ninlmdNdm1.n2' m 1 =O This direct product decomposition method is welI known for 2- or 3-dimensional Fourier transforms. Even for a I-dimensional Fourier transform of length N = N I N 2 , if N l and N 2 are relatively prime, one can use the method of direct product decomposition Even when N I and N 2 are not prime, we can use a method of "pseudo"-direct product decomposition to reduce the number of operations, Namely, we can put m=m l +N l m 2 (O.s;m l <N I ,0.s;m 2 < N 2 ); n=N 2 n l +n 2 (0.s;n 1 <N 1 ,0.s;n 2 <N 2 ). Then, similarly as before, X n in (3) can be rewritten as N t -l Xn=(l/N I ) L exp( -2nin l m 1 /N 2 ) m 1 =O x exp( - 2nim 1 n 2 /(N I N2))m!on2' 
559 where N 2 -1 m n =(I/N 2 ) " exp( -2nin2m2/N2)Xm +N m , l' 2  1 1 2 m 2 =Q which is a DFT of length N 2 . If we put m n =exp ( -2nim l n2/(NIN 2)) m n , l' 2 1- 2 then Nt-l Xn==(l/Nd L exp( -2ninlmdNdm1,n2' m 1 =Q Formula (7) is nothing but a DFT of length N I , Thus an FFT of length N = N 1 N 2 can be calculated by decomposing it into three stages as follows: (i) obtain N 1 transforms of length N 2 in (5); (ii) multiply m1,n2 by exp( -2nim l n 2 /N) in (6) (phase rotation); (iii) calculate N 2 transforms of length N I in (7). If either or both of N I and N 2 can be factored further so that, e.g., N = N I N 2 == N 1 N 21 N 22 = ... , an FFT of length N 2 can be calculated similarly by decomposing it, and so on, and in this way one can reduce the total number of operations. This is the principle of FFT pointed out by H. Takahasi. When N is a power of two, it can be shown that the FFT algorithm requires approximately N log2 N /2 operations. E. Pade Approximation Letf(x)co+Clx+C2x2+... be a formal power series. For any pair of nonnegative integers (p, q), we define the (p, q)th Pade ap- proximation of f(x) as foJlows: The Pade ap- proximation is a rational function (a o +al x +a 2 x 2 +... +apx P ) /(b o +b l x + b 2 x 2 +... + bqx q ) satisfying the condition that all terms in the formal power series (bo+blx+... +bqxq)(co+c 1 x+ ...) -(ao+alx+... +apx P ) should vanish up to the term x p + q . An infi- nite matrix whose (p, q)th entry is the. (p, q)th Parle approximation is caJled the Pade table for f(x). The Pade approximation is uniquely determined, provided that every Hankel determinant C p c P +1 c p + q I c p + 1 c p + 2 c P + q +1 c p + q c p + q + 1 C'2q I never vanishes. When we expand f(x) into a continued 142 Ref. Evaluation of Functions fraction of Stieltjes type, say, col exlxl ex 2 xl f(x)f!+I1++"" (5) then its 2pth and (2p + 1 )th approximate frac- tions are the (p,p)th and (p+ l,p)th Pade approximation of f(x), respectively. (6) References (7) [IJ R. P. Brent, Unrestricted algorithms for elementary and special functions, IFIP, 1980, 613-619. F or the theory of approximation, [2J A. Haar, Die Minkowskische Geometrie und die Annaherung an stetige Funktionen, Math. Ann., (1918),294-311. [3J A. Ralston, Rational Chebyshev approx- imation by Remes' algorithms, Numer. Math., 7 (1965), 322-330. [4J E. W. Chenney, Introduction to approxi- mation theory, McGraw-Hill, 1966. [5J M. 1. D. PoweJl, Approximation theory and methods, Cambridge Univ. Press, 1981. For microprogramming techniques, [6J T. C. Chen, The automatic computation of exponential, logarithms, ratios and square roots, IBM Res. Rep. RJ 970 (1972),32. [7J W. H. Specker, A class of algorithms for In x, exp x, sin x, cos x, tan -I x and cot -I x, IEEE Trans. EJec. Comp., EC-14 (1965),85- 86. [8J 1. E. VoIder, Binary computation algo- rithms for coordinate rotation and function generation, Convair Rep. IAR-I 148 Aero- electronics group, 1956. [9J 1. S. Walther, A unified algorithm for elementary functions, Proc. AFIPS, 1971, Spring Joint Compo Con£., 379-385. For fast Fourier transforms, [10J J. W. Cooley and J. W. Tukey, An al- gorithm for the machine calculation of com- plex Fourier series, Math. Comp., 19 (1965), 297-301. [11J E. C. Brigham, The fast Fourier trans- form, Prentice-Hall, 1974. For the Pade approximation, [12J H. Pade, Sur la representation approchee d'une fonction par des fractions ration nelles, Ann. Sci. Ecole Norm. Sup., (3) 9 (1892), 1-92. [13J G. A. Baker, Jr., The Pade approximation 1,2, Encyclopedia of Mathematics and Its Applications, vols. 13, 14, Addison-Wesley, 1981. For tables of approximation formulas, [14J C. Hastings, Jr., Approximations for digital computers, Princeton Univ. Press, 1955. [15J 1. F. Hart (ed.), Computer approximation, Wiley, 1968. 
143 A Extremal Length 143 (XI.18) Extremal Length A. General Remarks The notable relation between the lengths of certain families of curves in a plane domain and the area of the domain has long been recognized and utilized in function theory. L. V. Ahlfors and A. Beurling formulated this relation by introducing the notion of extremal length for families of curves [I]. Although there are various definitions of extremal length, they are essentially the same except for one due to 1. Hersch [3J and A. Pfluger [2]. The image of an interval under a continuous mapping is called a curve. We say that it is locally rectifiable if every tcontinuous arc of the curve is trectifiable. Let C be a finite or countable collection of locally rectifiable curves in a plane and p, O.s; p.s; 00, be a tBaire function defined in the plane. Represent C in terms of arc length s ( 246 Length and Area), and set (C, p) = Sc pds. For a family r of finite or countable collections C, p is called admis- sible if (C, p)??-1 for every CE r. If no CEr consists of a finite or countable number of points, then p = 00 is always admissible for r. Call inf {J S p2 dx dy}, where p runs over admissible Baire functions, the module of r, and denote it by M(rj. The reciprocaIA(rj= I/M(r) is called the extremal length of r. If no Baire functions are admissible for r, we set A(r) = O. The extremal length is defined equivalently in two other ways as follows: Let p be a nonnegative Baire function, and put L(r,p)=inf{(C,p)ICEr}, then A(rj= sup L(r, p)2/SS p 2 dxdy, where p runs over nonnegative Baire functions. Next, let <1> be the collection of nonnegative Baire func- tions p such that S S p2 dx dy.s; I; then A(r) = sup{L(r,p)2IpE<1>}. We obtain the same value for A(rj if we.require an admissible p to be tlower semicontinuous. If p is required to be continuous, then the extremal length de- fined by Hersch and Pfluger is obtained. As is shown in example (1) of Section B, there is a case where the two definitions actually differ. If an admissible p yields M (rj = S S p2 dx dy, then pldsl is called an extremal metric. Beur- ling gave a necessary and sufficient condition for a metric to be extremal [4]. We list four properties of extremal length: (1) },(rd??-A(r 2 ) ifr l cr 2 . (2) M(Unrn).s; Ln M(r n ). (3) Let {r n } and r be given. Suppose that there are mutually disjoint measurable sets {En} such that each C n E r n is contained in En. If each element of Un r n contains at least one CEr, then M(rj??-LnM(r n ), and 560 hence M( Y rn)=M(rn)' If each C E r contains at least one C n E r n for every n, then A(rj??- Ln A(r n ). (4) Let f be an analytic function in a domain 0 and {C} be given in O. Denote by f(C) the image of C by f Then ..1.( {C}).s; Jc( (f(C)}). The equality holds if f is one-to-one. This shows that ..1.( {C}) is conform ally invariant. B. Extremal Distance Let 0 be a domain in a plane, ao its bound- ary, and Xl' X 2 sets on 0 U ao. The extremal length of the family of curves in 0 connecting points of Xl and points of X 2 is called the extremal distance between Xl and X 2 (relative to 0) and is denoted by }.g(X l , X 2 ). Example (1). Let 0 = {z Ilzl <2}, Xl =ao, and X 2 be a countable set in Izi < 1 such that the set of accumulation points of X 2 coincides with Izl= 1. Then A g (X l ,X 2 )=00, but the extremal distance in the sense of Hersch and Pfluger is equal to (2n)-llog2. Example (2). In a rectangle with sides a and b, the extremal distance between the sides of length a is b/a. Example (3). Let 0 be an annulus r l < Izl <r 2 . The extremal distance ..1. between the two boundary circles of 0 is equal to (2n) -llog(r2/rd. The extremal length of the family of curves in 0 homotopic to the bound- ary circles is equal to 1/..1.. Example (4). Let 0 be a domain in the ex- tended z-plane such that 00 E O. Let Zo E 0 and {Iz -zol =r} c 0, and denote by Ar the ex- tremal distance between {I z - Zo 1= r} and a set X cao relative to O. Then A r -(2n)-llogr increases with r. We call the limit the reduced extremal distance and denote it by 2 g (X, 00), tRobin's constant for tGreen's function in 0 with pole at z = 00 is equal to 2n2 g (aO, 00). Extremal length is also defined on Riemann surfaces. Some classical conformal invariants can be given in a generalized form in terms of extremal length. The notion of extremal length has applications in various branches of func- tion theory, such as tconformal and tquasi- conformal mappings, the tPhragmen-Lindelof theorem, the tcoefficient problem, the ttype problem of Riemann surfaces, and studies of the boundary properties of functions of finite Dirichlet integrals. It is also applied to prob- lems in differential geometry. Extending the notion of extremal length, M, Ohtsuka con- sidered extremal length with weight, and B. Fuglede introduced the notion of generalized 
561 module in higher-dimensional spaces [8]. These notions have useful properties and applications. References [IJ L. V. Ahlfors and A. Beurling, Conformal invariants and function-theoretic null-sets, Acta Math., 83 (1950),101-129. [2] A. Pfluger, Extremalliingen und Kapazitiit, Comment. Math. He1v., 29 (1955), 120-131. [3] J. Hersch, Longuers extremales et theorie des fonctions, Comment. Math. Helv., 29 (J 955), 301-337. [4] M. Ohtsuka, Dirichlet problem, extremal length and prime ends, Van Nostrand, 1970. [5] Y. Kusunoki, Some classes of Riemann surfaces characterized by the extremal length, Proc. Japan Acad., 32 (1956). [6] M. Ohtsuka, On limit of BLD functions along curves, 1. Sci. Hiroshima Univ., 28 (1964). [7] T. Fuji'i'e, Boundary behaviors of Dirich- let functions, J. Math. Kyoto Univ., lO (1970). [8] B. Fuglede, Extremal length and functional completion, Acta Math., 98 (1957),171-219. [9] J. A. Jenkins, Univalent functions and conformal mapping, Erg. Math., Springer, 1958. [lOJ J. Viiisiilii, Lectures on n-dimensional quasiconformal mappings, Lecture notes in math. 229, Springer, 1971. 143 Ref. Extremal Length 
144 Ref. Fermat, Pierre de 144 (XXI.23) Fermat, Pierre de Pierre de Fermat (August 20, 1601-January 12, 1665) was born into a family of leather merchants near Toulouse, France. He became an attorney and in 1631 a member of the Toulouse district assembly. When not engaged in such work, he did research in mathematics, so that he consigned his results only to his correspondence or to unpublished manu- scripts, The manuscripts were published post- humously by his son in 1679 and are known as Varia opera mathematica. His research into number theory, stimulated by Bachet's (1581- 1638) translation of the Arithmetika of Dio- Iphantus (published in 1621), made Fermat's name immortal and initiated modern num- ber theory. He posed the famous Fermat's Problem, which has yet to be solved ( 145 Fermat's Problem), He began analytic geome- try by studying the theory of tconic sections of Apollonius, and utilizing this theory he dealt with the notions of tangent lines, maximal (minimal) values of functions, and quadrature, which made him a pioneer in calculus. He also wrote a precursory work in the theory of prob- ability in the course of his correspondence with tpascal. tFermat's principle is important in the field of optics, where it is known as the law of least action, Unlike tDescartes, he em- phasized the revival rather than the criticism of Greek mathematics. References [IJ p, Fermat, Oeuvres I-V and supplement, P. Tannery and C. Henry (eds,), Gauthier- Villars, 1891-1922. 145 (V.16) Fermat's Problem The last theorem of Fermat (c. 1637) asserts that if n is a natural number greater than 2, then x n + yn = zn has no rational integral solution x, y, z with xyz#O, In the case n=2, equation (I) has integral solutions called Pythagorean numbers ( 118 Diophantine Equations). Fermat read a Latin translation of Diophantus' Arith- metika, in which the problem of finding all Pythagorean numbers is treated. In his per- sonal copy of that book, Fermat wrote his as- 564 (1) sertion as a marginal note at the point at which n = 2 in equation (1) i treated and added the famous words, "I have discovered a truly remarkable proof of this theorem which this margin is too small to contain." It is not known whether Fermat actually had a proof. Fermat's problem asks for a proof or disproof of this conjecture, which itself has not been solved despite centuries of efforts by many mathematicians; but its study has promoted remarkable advances in number theory. In particular, E. E. Kummer's theory of ideal numbers and the development of the theory of tcyclotom!c fields were originally conceived in treating Fermat's problem. In this article, we consider only those in- tegral solutions x, y, z of equation (1) with xyz#O that are relatively prime. We also restrict ourselves to the cases n = 1 (odd prime) and n = 4, without loss of generality. For smaller values of n, the nonsolvability of equation (1) was proved long ago, for n = 3 by L. Euler (1770), and later again by A, M. Le- gendre; for n=4 by Fermat and Euler; for n=5 by P. G. L. Dirichlet and Legendre (1825); and for n = 7 by G. Lame (1839), S. Germain and Legendre found some results on more general cases, but the most remarkable result was ob- tained by Kummer (J. Reine Angew. Math., 40 (1850), Abh. Akad. Wiss. Berlin (1857)), Let 1 be an odd prime, ( a primitive Ith root of unity, and h the tclass number of the cyclo- tomic field Q(o. Then the class number h 2 of the real subfie1d Q«( + C') of Q(O divides h. We call hi = h/h 2 and h z the tfirst and tsecond factors of h, respectively. (1) If 1 is tregular, that is, if(h, 1)= 1, then Xl + yl = Zl has no solution (Kummer, 1850). There are infinitely many irregular primes [3J; those under 100 are 37, 59, and 67. There are 7,128 regular primes and 4,605 irregular primes between 3 and 125,000. It is not yet known whether there are infinitely many regu- lar prime numbers, although the beginning part of the sequence of natural numbers con- tains a larger number of these than the num- ber of irregular prime numbers. The condition (I, h) = 1 is equivalent to saying that the numer- ators of tBernoulli numbers B 2m (m = 1,2, ..., (1- 3)/2) are not divisible by 1 (Kummer, 1850). Kummer obtained a result on irregular primes (1857) which was improved later as follows. Note that if 1 is not regular then hi is divisible by 1 (Kummer, 1850) ( 14 Algebraic Number Fields), (2) If (h 2 , I) = I and the numerators of Ber- noulli numbers B 2ml (m = 1,2, ..., (1- 3)/2) are not divisible by 1 3 , then Xl + yl =Zl has no solution (H, S, Vandiver, Trans. Amer, Math. Soc., 31 (1929)). By computation Vandiver confirmed that Xl + yl = Zl has no solution for 
565 1< 619. At present, this procedure has been continued for I.s; 125,080 using computers by the method of D. H. Lehmer, E. Lehmer, and H. S. Vandiver, Proc. Nat. Acad. Sci US, 40 (1954) (S. S. Wagstaff, Math, Camp., 32 (1978)). When the condition (xyz, 1)= 1 or (xyz, 1)=1 is added, we speak of Case I or Case II, respec- tively. The following theorems hold for Case I. (3) If Xl + / = Zl has a solution in Case I, then B 2n1 J;-2m (t)="O (modi), m= 1,2, ...,(1-3)/2, (2) holds for - t = x/y, yjx, yjz, zjy, x/z, and z/x, where fm(t) = L-;;b r m - t t r , and Bm is the mth BernouIli number. This is caIled Kummer's criterion (D. Mirimanov, 1905). A simplification of the above result is (4a) If Xl+/=Zl has a solution in Case I, then (2I-l-l)/1=0 (modi) (A. Wieferich, J. Reine Angew. Math., 136 (1909)). This result created a sensation at the time of its publication. It was first shown that 1093 and 3511 are the only primes with k 3700 for which the above congruence holds; it is presently known that no other I with I.s; 6 x 10 9 satisfies this congruence. The criterion (4a) was gradually improved by Mirimanov (1910,1911), P. Furtwiingler (1912), Vandiver (1914), G. Frobenius (1914), F. PoIlaczeck (1917), T. Morishima (1931), and J. B. Rosser (1940, 1941). For example: (4b) If Xl+ /=Zl has a solution in Case I, then (mI-l-l)/I=O (modi) holds for all m with 2.s;m.s;43. By means of this result, Rosser (1941) showed for k 41,000,000, and D. H. Lehmer and E. Lehmer (Bull. Amer. Math. Soc., 47 (1941)) showed for k253,747,889 that xl+l=ZI has no solution in Case I. We have hitherto been concerned with rational integral solutions of Xl + / = Zl. We may also consider the problem of proving or disproving that ex l + 13 1 = 1'1 has no solution ex, 13, I' with iXf3y # 0 in the ring of talgebraic integers of Q(O. Case I means the impossibil- ityof ex l + 13 1 +1'1 =0, (exf3y, I) = 1, and Case II means the impossibility of ex l + 13 1 =dnlyl, (exf3y, l) = I, where n is a natural number, G is a tunit in Q(s), and ..1.= 1- S. We have the following results: (1 *) If (h, I) = 1, then neithcr equation (4) nor equation (5) has a solution (Kummer, 1850). 146 A Feynman Integrals (2*) Under the same conditions as in state- ment (2), equation (4) has no solution. If we additionaIly restrict iX, 13, I' to relatively prime integers of Q(, + C I ) and replace ), by (1 - 0(1- C t ), then equation (5) also has no solu- tion (Vandiver, 1929). (4b*) If equation (4) has solutions iX, 13, I' in Q(O, then congruence relation (3) holds for all m with 2.s; m.s; 43 (Morishima, 1934). When I is sufficiently large, we have the results of M. Krasner (c. R. Acad. Sci. Paris (1934)) and Morishima (Proc. Japan Acad., 11 (1935), Bibliographies are given in Vandiver and Wahlin [1] and Vandiver [2]. References [1] H. S. Vandiver and G. E. Wahlin, Alge- braic numbers II, Bull. Nat. Res. Council, no. 62, 1928. [2J H. S. Vandiver, Fermat's last theorem, Amer. Math. Monthly, 53 (1946), 555-578. [3J P. Ribenboim, 13 lectures on Fermat's last theorem, Springer, 1979. 146 (XX.30) Feynman Integrals A. Introduction (3) As the S-matrix or tGreen's function in quan- tum field theory is usuaIly prohibitively dif- ficult to calculate, perturbative expansions in tcrms of coupling constants have been em- ployed since the beginning of the theory (  386 S-Matrices). R. P. Feynman (phys. Rev., 76 (1949)) invented a way of calculating the series in terms of Feynman integrals. His method drasticaIly simplified the preceding method due to S. Tomonaga and J. S. Schwinger, even though, as was later shown, the two methods are theoretically equivalent (F. J. Dyson, Phys. Rev., 75 (1949). A Feynman integral is an integral associated with a Feynman graph according to the Feynman rule explained in Section B. Feynman integrals inherit the troublesome problem of divergence, and some recipe which systematicaIly provides them with a definite meaning is needed. Such a recipe is given by the renormalization the- ory of Tomonaga, Schwinger, Feynman, and Dyson. A mathematicaIly rigorous renormali- zation theory was given by N. N. Bogolyubov and O. S. Parasiuk (Acta Math., 97 (1957)), later supplemented by K. Hepp (Comm. Math. (4) (5) 
146 B Feynman Integrals Phys., 2 (1966)). See also W. Zimmermann (Comm. Math. Phys., 15 (1969)) and S. A. Anikin et al. (Theor. Math. Phys., 17 (1973)). Furthermore, E. R. Speer [IJ gave a mathe- matically convenient recipe of renormalization (under the condition that massless particles are irrelevant). Although the series expansion in coupling constants is a divergent series even after renormalization ( 386 S-Matrices), the study of Feynman integrals has given much insight into the qualitative aspects of the S-matrix, and in particular, into its analytic structure (e.g., R. J. Eden et al. [2J). In this respect the discovery of the Landau-Nakanishi equations, which describe the location of sin- gularities of Feynman integrals, was crucially important (L. D. Landau, N. Nakanishi, and J. Bjorken, 1959;  Section C). Later, R. E. Cutkosky found a formula which gives impor- tant information concerning the ramification of Feynman integrals near their singularities ( Section C). It gave impetus to J. Leray's mathematical study of Feynman integrals from the viewpoint of integration of multi- valued analytic functions (Leray, Bull. Soc. Math. France, 87 (1959)), Such studies were subsequently carried out by D. Fotiadi, M. Froissart, J. Lascoux, F. Pham, etc.;  [3-5J and references cited there for this topic. An extensive study by G. Ponzano, T. Regge, Speer, and J. M. Westwater on the mono- dromy structure of Feynman integrals is closely related to the studies by Pham and others ( Regge in [6J and references cited there). On the other hand, the progress of microlocal analysis has thrown new light on the Feynman integrals and has given a unified foundation to these various other studies ( Section C; also Pham, M, Kashiwara, and T. Kawai in [6J, M. Sato et al. in [6J and refer- ences cited there). B. Definitions First, the notion of Feynman graphs is intro- duced. A Feynman graph is sometimes called a Feynman diagram. A Feynman graph G con- sists of finitely many points (called vertices) {T-j}jl....n" finitely many I-dimensional seg- ments (called internal lines) {L1},=I....,N and finitely many half-lines (called external lines) {L;}r=1. ,n' all of which are located in a 4- dimensional affine space. Each of the end- points W; + and W; - of Ll and the endpoint of L; coincide with some vertex T-j. A four-vector Pr=(Pr,0,Pr,I,Pr,2,Pr,3) is associated with each external line L; and a constant m 1 ? 0 is as- sociated with each internal line Ll' For sim- plicity, we usually suppose, in addition, that each internal line and each external line are 566 oriented, that W; + # W; - and that G is con- nected. The orientation of a line is indicated by the symbol-+- . Given an orientation, we de- fine the incidence number [j: IJ to be + 1 or -1 according to whether Ll ends or starts from T-j. In other cases, [j: IJ is defined to be zero. The incidence number [j: r J is defined in the same way but with L; replacing Ll (Fig. 1). P, P, PI Fig. 1 In this example of a Feynman graph, the internal lines L 2 and L4 do not intersect and this diagram should be drawn in R4, not in R2; this is indicated by '?(. For convenience, multiple lines such as L7 and L8 are usually drawn in a curvilinear manner, as shown. The Feynman rule associates the following integral FG(p) with each Feynman graph G: F. _ f I1';1 b4(L;=1 [j: rJPr+ L=I [j: IJkz) G(p)- N 2 2 11 II1I(kl -m 1 +y -1 0) N xL d 4 kz, (1) 1=1 where k?=ko-L=l kv' Here 1/(k?-m?+ PO) means lim'Jo(1/(k?-m?+p E) ( 125 Distributions and Hyperfunctions). Here we consider the case where the interac- tion Lagrangian density does not contain differential operators (i.e., direct coupling) and all relevant particles are spinless. In general, we should multiply the integrand of FG(p) by a matrix of polynomials of the Pr and k 1 . FG(p) has the form b(Lj,r[j: rJPr)fG(P); we often investigate .fG(p) instead of FG(p). The function fG(p) is studied on M = def{PE R 4 n I Lj,r[j:rJPr=O}, and is called a Feynman amplitude. The integral (1) is not well defined as it stands because of the following prob- lems: (a) Is the product appearing in the inte- grand well defined? (b) Is the integral conver- gent? The first problem is not serious if m 1 # 0 ( 274 Microlocal Analysis E) However, the second problem, called the ultraviolet diver- gence, is serious. The renormalization proce- dure is intended to overcome this difficulty, When some m 1 is equal to zero, even the first problem, called the infrared divergence, is serious. See D. R. Yennie et al. (Ann. Phys., 13 (1961)) and T. Kinoshita (J. Math. Phys., 3 (1962)) for analyses of the infrared divergenee, In this article we always assume for simplicity that every m 1 is strictly positive, even though 
567 such an assumption is too restrictive from the physical viewpoint. Calculations of Feynman integrals are often done by means of the parametric representa- tion of the integral (1), of the form b ( L[j:rJPr )f l j.r 0 f l b(I-L=lexl)IIldex, x 0 U(ex)2(V(p, ex) + J=l 0)-N+2n'-2' where U(ex) and V(p, ex) are determined by the topological structure of the graph G. See [7J for the derivation of this formula and the treatment of the integral written in this form. It is useful not only for the study of the sin- gularity structure of FG(p) but also for the study of spectral representations, etc. (, e.g" [7,8]). Note that several different notations are used in the literature for the parametric representation of the integral. Hence one should be careful in referring to papers which use parametric representations. C. Analytic Properties of Feynman Integrals The celebrated result of Landau (Nuclear Phys., 13 (1959)), Nakanishi (Prog. Theor. Phys., 22 (1959)) and Bjorken (thesis, Stanford Univ., 1959) asserts that the singularities of a Feynman amplitude fG(p) are confined to the subset L +(G) (called a positive-ex Landau- Nakanishi variety) of M = {pER 4n I Lj.r[j: rJPr = O}, defined by the following set of equations (called Landau-Nakanishi equations), where u" Wj, and k, are real four-vectors and ex, is a real number, all of which are to be eliminated to define relations among the Pr (note, however, that a positive-ex Landau-Nakanishi variety is not strictly a subvariety of M, because of the constraint (2e): n' ur=L[j:rJWj (r=I,...,n), j=' n N L [j: rJPr+ L [j: IJk, =0 (j= 1,.", n'), r==l 1===1 n' L [j: I] Wj = ex,k, (/ = J, ... , N), j1 ex,(kf -mf)=O (/= 1,..., N), ex,;;;' 0, with some ex(O. Usually a Landau-Nakanishi variety (resp., equation) is called a Landau variety (resp., equation) for short. The equation (2a) is conventional; (2b) represents the energy- 146 C Feynman Integrals (2a) momentum conservation law at the vertex JIj; (2d) corresponds to the mass-shell constraint (if ex,O) on the internal1ine L,. Since (2c) entails L,F.,(C)ex,k, =0 for a closed circuit (= loop) C of G for some set of values F.,(C) = :t 1 or 0 with F.,( C) being 0 if L, does not be- long to C, the equation (2c) is usually called the closed-circuit condition. By attaching Wj to the vertex JIj and associating ex,k, to the inter- nalline L" we get a diagram representing a multiple collision of classical point particles, since the relations (2a)-(2f) are just the class- ical conditions for such a collision (S. Coleman and R. E. Norton, Nuovo Cimento, 38 (1965)). The fact that the Landau-Nakanishi equations admit such an interpretation is neither acci- dental nor superficial in view of the tmacro- scopic causality of the S-matrix, Note also that there is another interpretation of the Landau- Nakanishi equations, which emphasizes their resemblance to tKirchhoff's law (Nakanishi, Prog. Theor. Phys., 23 (1960)). Such a resem- blance can be used to study the structure of Feynman integrals from the viewpoint of tgraph theory. See [7] and the references cited there for this topic. An important observation by Pham and Sato (1973), which opened a way to the micro- local analysis of Feynman integrals and the S-matrix, is the following: If we consider the Landau-Nakanishi equations to define a sub- variety of S* M, the tspherical cotangent bun- dle of M, by eliminating only w, k, and ex, then the resulting variety describes the tsin- gularity spectrum of fG(p), More precisely, S.S.fG(p) is confined to the set (p; J=l u), where (p;u) satisfies the Landau-Nakanishi equations. The rigorous proof of this state- ment was given by Sato et al. in [6]. The subset of S* M or J=l S* M thus obtained is denoted by 2+(G) and is also called a positive-ex Landau-Nakanishi variety. It is noteworthy that thc microlocalization of the classical result of Landau, Nakanishi, and Bjorken had essentially been achieved in a less sophisticated manner by D. Iagolnitzer and H. P. Stapp (Comm. Math. Phys., 14 (1969)) in the framework of S-matrix theory, The variety defined by (2a)-(2d) and (2f) is denoted by L(G) or 2(G) and is a Landau-Nakanishi variety. In a neighborhood of pO in L +(G), fG(p) is the boundary value of a holomorphic function JG(p) whose domain of definition is determined by u-vectors ( 274 Microlocal Analysis E), Furthermore, in simple cases one can verify that JG(p) can be analytically con- tinued to define a holomorphic function on  the universal covering space U - L( G)c of U - L(G)c for a complex neighborhood U of pO, where L(G)c denotes a complexification of L(G). Hence we can discuss the difference of (2b) (2c) (2d) (2e) (2f) 
146 Ref. Feynman Integrals lG(YP) and lG(p), where Y denotes a loop en- circling L(Gf. Cutkosky (J, Math, Phys" 1 (1960)) observed that it can be expressed (in simple cases) by the integral obtained by replacing 1/(kf-mf+pO) in the right- hand side of(1) by -2nPl5+(kf-mf)=der -2np Y(k 1 ,0)l5(kf-mf), where Y(k1,o) denotes the tHeaviside function. A result of this type is called a discontinuity formula and is now obtained for the S-matrix itself in a suitably modified manner, i.e., the discontinu- ity formula holds beyond the framework of perturbation theory ( 386 S-Matrices). As is mentioned in 386 S-Matrices, the discontinuity formula is closely related to Sato's conjecture on the tholonomic character of the S-matrix (T. Kawai and Stapp in [6J). Actually, M. Kashiwara and Kawai (in [6J) proved that fG(p) satisfies a tholonomic system of linear differential equations whose characteristic variety is confined to the extended Landau- Nakanishi variety L(G), They further proved that the system has regular singularities (  274 Microlocal Analysis G). Their result gives, on one hand, a precise version of Regge's statement to the effect that fG(p) is a generaliza- tion of a thypergeometric function (in Battelle Rencantres, C. DeWitt and J. A. Wheeler (eds.), Benjamin, 1968), and, on the other hand, a rigorous proof of the fact that lG(p) is a Nils- son class function. This fact is closely related to the works of D. Fotiadi, J. Lascoux, Pham, and others. Kashiwara and Kawai (Camm. Math. Phys" 54 (1977)) also showed that the holonomic character of the Feynman ampli- tude is an important clue for understanding the so-called hierarchical principle, which had been proposed and studied in connection with the tMandelstam representation by the Cambridge group (Eden et al. [2J). Further- more, Kashiwara et a\. (Camm. Math. Phys., 60 (1978)) gave a useful expression of fG(p) at several physically important points by analyz- ing the microlocal structure of the holonomic system that fG(p) satisfies. Thus the use of t(micro-) differential equations in analyzing Feynman amplitudes has turned out to be effective in understanding their singularity structures in a unified manner. References [IJ E. R. Speer, Generalized Feynman ampli- tudes, Princeton Univ. Press, 1969. [2J R. J. Eden, P. V. Landshoff, D. I. Olive, and J. C. Polkinghorne, The analytic S-matrix, Cambridge Uni v . Press, 1966. [3J R. C. Hwa and V. L. Teplitz, Homology and Feynman integrals, Benjamin, 1966. 568 [4J F. Pham, Introduction a l'etude topo- logique de singularites de Landau, Gauthier- Villars, 1967. [5J S. Lefschetz, Applications of algebraic topology, Springer, 1975. [6J Pub\. Res. Inst. Math. Sci., 12, suppl. (1977) (Proc. Oji Seminar on Algebraic Analy- sis, 1976). [7J N. Nakanishi, Graph theory and Feynman integrals, Gordon & Breach, 1971. (Original in Japanese.) [8J I. T. Todorov, Analytic properties of Feyn- man diagrams in quantum field theory, Per- gamon, 1971. (Original in Russian, 1966.) 147 (IX.13) Fiber Bundles A. General Remarks E. Stiefel [2J introduced certain tdiffeomor- phism invariants of tdifferentiable manifolds by considering a field of a finite number of linearly independent vectors attached to each point of a manifold; and H, Whitney [3J ob- tained the notion of fiber bundles as a com- pound idea of a manifold and such a field of tangent vectors. S. S. Chern [4J emphasized the global point of view in differential geo- metry by recognizing the relation between the notion oftconnections (due to E. Cart an) and the theory of fiber bundles. The theory of fiber bundles is also applied to various fields of mathematics, for example, the theory oft Lie groups, thomogeneous spaces, tcovering spaces, and general vector bundles, vector bundles of class cr, or analytic vector bundles. Homological properties of fiber bundles are studied by means of tspectral sequences, and cohomology structures of several homoge- neous spaces and several characteristic classes are determined explicitly by means of tcoho- mology operations. Also, the group K(X), formed by eq uivalence classes of vector bun- dles over a finite tCW complex X, is a tgener- alized cohomology group, treated in tK-theory, in which further development is expected (237 K-Theory). B. Definitions Let E, B, F be topological spaces, p: E --> B a continuous mapping, and G an teffective left topological ttransformation group of F. If there exist an topen covering {U.} (exE A) of B and a homeomorphism (jJ,: lJ, x Fp-I(U,) for each exEA having the following three prop- erties, then the system (E, P, B, F, G, U,' <p,) is 
569 call cd a coordinate bundle: (1) p<p,(b, y) = b (bEV"yEF). (2) Define <p',b:Fp-l(b) (bE V,) by <P"b(y) = <p,(b,y); then gp,(b)=<pp,O<P',bEG for bE V, n Vp' (3) gp,: V, n V p --> G is continu- ous. We say that this bundle is equivalent to a coordinate bundle (E, p, B, F, G, V, <p) if g,(b) = <p 0 <P"bE G (bE V, n V) and g,: V, n V --> G is continuous. An equivalence class  = (E, p, B, F, G) of coordinate bundles is called a fiber bundle (or G-bundle), and E is called the total space (or bundle space), p the projection, B the base space, F the fiber, and G the bundle group (or structure group). Also, V, of a coordinate bundle (E, p, B, F, G, V., <p,) belonging to the class  is called the coordinate neighborhood, <p, the coordinate function, and gp, the coordinate transformation (or transition function). Let =(E,p, B, F, G) and '=(E', p', B', F, G) be two fiber bundles with the same fiber and group. A continuous mapping '1': E --> E' is called a bundle mapping (bundle map) from  to ' if the following two conditions are satisfied: (1) There is a continuous mapping t(;: B-->B' with p' 0 'I' = 'I' 0 p. (2) t(;,(b) = <Pbl, 0'1' 0 <P"bE G (bE v,n 1/1-1 (V;), b' = 1/1 (b)), and 1/1w' Van 1/1-1 (V;)-->G is continuous, where {V., <p,} and {V;, <p} are pairs of coordinate neighbor- hoods and functions of  and ', respectively. Moreover, if 1/1 is a homeomorphism, then 'I' is also a homeomorphism and '1'-1 is a bundle mapping. Let =(E,p, B, F, G) and (=(E', p', B,F, G) be two fiber bundles with the same base space, fiber, and group. If there is a bundle map- ping '1': E --> E' such that 1/1: B --> B as described before is the identity mapping, then we say that  is equivalent (or isomorphic) to ( and write  0= '. Take the same coordinate neigh- borhoods {V,}, and let gp, and g'p, be the co- ordinate transformations of  and ', respec- tively. Then  0= ( if and only if there are continuous mappings A,: V,-->G with gp,(b) = )'p(b)gp,(b)A,(W I (bE V, n Vp). For a system {gp,} of coordinate transforma- tions of a fiber bundle, we have gyp(b)gp,(b) = gy,(b) (bE v,n V p n V y ). Conversely, given an open covering {V,} and a system of gp,: V, n V p -->G satisfying, this condition, there is a unique G-bundle (E,p, B, F, G) with {gp,} as a system of coordinate transformations. Actually, E is the tidentification space of E = {(b, y, ex) I bE V,} c B x F x ;\ obtained by identifying two points (b, y, ex), (b', y', [3) with b = b', y' = gp,(b)' y, and p is defined by p{ (b, y, ex)} = b, where the index set ;\ = { ex} is considered a discrete space. C. Principal Fiber Bundles A fiber bundle '1 = (P, q, B, G, G) is called a principal fiber bundle (or simply principal 147 E Fiber Bundles bundle) if G operates on G by left translations. This is also defined by the following con- ditions: G is a right ttopological transfor- mation group of P, and there exist an open covering {V} of B and homeomorphisms <p: V x Gq-I(V) with q<p(b,g)=b, <p(b,g)' g' = <p(b,gg') (bE V; g,g' EG). A bundle mapping '1': P--> P' between two principal bundles '1 = (P, q, B, G) and 1'1' = (P', q', B', G) is also defined as a continuous mapping 'I' with 'I' (x 'g)= '¥(x). g. D. Associated Fiber Bundles Let '1 = (P, q, B, G) be a principal bundle, and let F be a topological space having G as an effective left topological transformation group. Then G is a right topological transformation group of the product space P x F by (x, y). 9 = (x' g,g-I 'y) (xEP,yEF,gEG). Consider the torbit space P xGF =(P x F)jG, and define the continuous mapping p:P xGF -->B by p{(x,y)} =q(x). Then '1 xGF=(P xGF,p, B, F, G) is a fiber bundle, called the associated fiber bundle of '1 with fiber F. On the other hand, '1 is called an associated principal bundle of  = (E, p, B, F, G) if  = '1 x G F. A principal bundle '1 having the same coordinate transformations as  is an associated principal bundle of , and two fiber bundles are equivalent if and only if their associated principal bundles are equiva- lent. Therefore, given a fiber bundle , there exists a principal bundle '1 such that  = '1 x G F. E. Examples of Fiber Bundles (1) Product bundle, (B x F, PI> B, F, G), where PI' the projection of the product space, is called a product bundle if there is just one coordinate neighborhood B and the coordi- nate function is the identity mapping of B x F. A bundle that is equivalent to a product bun- dle is called a trivial bundle. (2) A tcovering (Y, p, Y) is a fiber bundle whose fiber is the discrete s-pace p-l(yO) (Yo E Y), and the structure group is a factor group of the tfundamental group nl(Y,yo)' In particular, a tregular covering is a principal bundle, (3) Hopf bundle, Let ;\ be the real number field R, the complex number field C, or the quaternion field H, }, = dimR;\, and ;\ n+1 the (n + 1 )-dimensionallinear space over;\, Iden- tify two points (zo, '" ,zn), (zo, '" , z) of the subspace ;\"+1 - {O} (0 is the origin) ifthere is a ZE;\ such that Zi = ziz (i = 0, ..., n). Then we obtain the identification space pn(;\), called the n-dimensional projective space over ;\. Let S (=S'«"+I) 1, the (),(n+ I)-I)-sphere) be the unit sphere in ;\"+1 Then S is the topological 
147 F Fiber Bundles transformation group of S by the product of ;\, and the orbit space S/sX is pn(;\). Fur- thermore, (S, q, pn(;\), SX) (q is the projection) is a principal bundle called the Hopf bundle (or Hopf fibering). These comments are valid also for n= 00. When n= 1, pl(;\) is homeomorphic to sA, and the Hopf bundle is (SH-l, q, sA, SA-I) ().= 1,2,4). A Hopfbundle is defined similarly forA = 8 using the tCayley algebra, and the projection q: SH-l-->S; (J. = 2, 4, 8) is the Hopf mapping (Hopf map). (4) Let G be a topological group, Hits closed subgroup, and r: G-->G/H, r(g)=gH the natural projection. If there exist a neighbor- hood V of r(H)E G/H and a continuous map- ping f: V --> G such that r 0 f is the identity mapping, then we say that H has a local cross section fin G, and (G/K, p, G/H, H/K, H/Ko) is a fiber bundle for any closed subgroup K of H (where p is the natural projection gK -->gH and Ko is the largest normal subgroup of H con- tained in K). The associated principal bundle of the latter fiber bundle is (G/Ko, p, G/H, H/Ko). Any closed subgroup H of a tLie group G has a local cross section in G; hence the above bundles can be obtained. F. Vector Bundles A system  = (E, p, B) of topological spaces E, B and a continuous mapping p: E -->B is called an n-dimensional real vector bundle if the foIl ow- ing two conditions are satisfied: (1) p-I(b) is a real vector space for each bE B. (2) There exist an open covering { V,} (ex E;\) of B and a co- ordinate function <p,: V, x F;::; P-l( V,) for each exE;\, where F=R n ; furthermore, the <P"b: R n ;::; p -1 (b) are isomorphisms of vector spaces. In this case, gp,(b) = <Pp 0 <P"b: Rn;::; R n ( bE V, n Vp) is an element of the tgenerallinear group GL(n, R). Hence a vector bundle is a fiber bundle with fiber R n and group GL(n, R), and the converse is also true. A I-dimensional vector bundle is caIled a line bundle. A vector bundle '= (E', p', B) is caIled a subbundle of a vector bundle  = (E, p, B) if E' c E, pIE' = p', and p'-I(b) is a vector subs pace of p-l(b) for each bE B. Let  1 and  2 be two vector bundles of dimension n 1 and n 2 with the same base space B. Let E be the union of the direct sum p1 (b) +pzl(b) for bEB, and define p:E-->B by p(pl(b)+Pzl(b))=b. Take the same coordi- nate neighborhoods V, for  1 and  2, and define <p,: V, x R n 1 +n,--> p-l (V,) by <p,(b, y) = (<P,b+ <P;,b)(Y) (yE R n 1 +n, = R n 1 + R n ,), where the <p: V, x R n ';::;pi- 1 (V,) are the coordi- nate functions of i' Then E is topologized by taking the family {<p,(0)} (0 is open in V, x R n 1 +n 2 ) as the topen base, and we obtain 570 an (n 1 + n 2 )-dimensional real vector bundle (E, p, B), denoted by  I EB  2 and called the Whitney sum of 1 and 2' Similarly, we can define the tensor product  1 (8)  2, the p-fold exterior power I\P  (or bundle (P) of p-vectors), and the bundle of homomorphisms Hom( ,, 2)' of dimension n 1 n2, (;), and n 1 n 2 , respectively, using the tensor product R n 1 (8) R n , = R n 1 n " the p-fold texterior power N Rn = R (;) (the space (Rn)(P) of p-vectors in Rn), and the space of homomorphisms Hom(R n 1,R n ')=R n 1 n ,. (For the last one, we use Hom«<p;.b)-I, <P;,b) instead of Hom(<p,b' <P;,b)') Hom(, £1) = * is caIled the dual (vector) bundle of , where £1 is the trivial line bundle. If we use coordinate transformations, EB, (8), I\P, and * are obtained by the direct sum, tKronecker product, matrix of tp-minors, and ttranspose of matrices, respectively. If 2 is a subbundle of  1> the quotient bundle  1 g 2 of dimension n l - n 2 is defined by using the quotient vector space R n '/R n '=R n 1- n " and 2 EB (1/2) is equivalent to  l' These oper- ations preserve the equivalence relation of bundles. Also, EB and (8) are commutative up to equivalence and satisfy the associative and distributive laws. For each  having a finite- dimensional tCW complex as base space, there is a ' such that  EB ' is trivial. Using the complex number field C or the quaternion field H instead of the real number field R, we can define similarly the complex vector bundle or the quaternion vector bundle and the operations EB, (8), etc, For a complex vector bundle , the complex conjugate bun- dle  is defined by the complex conjugate of matrices. (5) Tangent bundles, tensor bundles. Let M be an n-dimensional tdifferentiable manifold of class e. Consider the ttangent vector space Tp(M) at pE M, set T(M) = UPEM Tp(M), and define n: T(M)-->M by n(Tp(M))=p. For a tcoordinate neighborhood V p of p with local coordinate system (x I' ... , x n ), each point of n-l(V p ) is represented by L?=I/;a/aX;, and n -1 (V p ) has a coordinate system (x 1> ... ,X n ' fl' ... ,In)' Hence T(M) is a e- 1 -manifold, and 2(M)=(T(M), n, M, R n , GL(n, R)) is an n- dimensional real vector bundle. 2(M) is caIled the tangent (vector) bundle, its dual bundle 2*(M) the cotangent (vector) bundle, and the tensor product :!(M) (8) ,., (8) :!*(M) (8) .., a tensor bundle of M. The line bundle I\n:!*(M) is called the canonical bundle of M. For a tcomplex manifold M, T(M) is a complex manifold and :!(M) is a complex vector bundle. Therefore these bundles are defined as complex bundles. (6) Tangent r-frame bundle. In the preceding example, the space of all ttangent r-frames of M is a bundle space with base space M and 
571 group GL(n, R). It is called the tangent r-frame bundle (or the bundle of tangent r-frames) of M. G. The Classification Problem For a fiber bundle  =(E, p, B,F, G) and a continuous mapping if;: B' --> B, consider the subs pace E'={(x,b')EE x B'lp(x)=if;(b')} of E x B' and the projections p': E' --> B' and 'P: E' -->E. Then if; #  =(E',p', B',F, G) is a fiber bundle, and 'P is a bundle mapping from if; #  to ; if; #  is called the induced bundle of  by if;. Let {V,} and {gp,} be the systems of coordi- nate neighborhoods and transformations of. Then {if;-I(V,)} and {gp,oif;} are correspond- ing systems of if; # . If 'P: E' -->E is a bundle mapping from ' to  having if;: B' -->B as the mapping of base spaces, then '= if; # . Also, we have if;#1 =if;#2 if I =2; (if;oif;')#= if; , # (if; # (). If ( is a principal bundle, then if; #  is also principal, and if;#(ry xGF)=(if;#ry) xGF. For a tparacompact space B', if;!  = if;f  if if; I, if;2: B' --> Bare thomotopic. For a topological group G, a principal bundle (n, G)=(E(n, G),p, B(n, G), G) is called an n-universal bundle if E(n, G) is tn-connected (n.s; 00); its base space B(n, G) is called an n- classifying space of G. In particular, (oo, G) = G=(EG'p, B G , G) is called simply a universal bundle and BG a classifying space of G. Then we have the classification theorem: Let B be a CW complex with dim B.s; n; then the set of equivalence classes of principal G-bundles with base space B is in one-to-one correspondence with the thomotopy set n(B; B(n, G)) of con- tinuous mappings of B into B(n, G). Such a correspondence is given by associating with the induced bundle if; # (n, G) a continuous mapping if;: B-->B(n, G), called the character- istic mapping or classifying mapping (charac- teristic map or classifying map) of if; # (n, G). Furthermore, if G is an effective left topo- logical transformation group of F, the set of equivalence classes of G-bundles with base space B and fiber F is in one-to-one corre- spondence with n(B; B(n, G)). The correspon- dence is given by associating if;#((n, G) xGF) to if;. H. Construction of Universal Bundles For an arbitrary topological group G, J. W, Mi1nor [6] constructed a universal bundle (E G , p, B G , G) in the following manner. The join EG = Go... 0 Go.,. of countably infinite copies of G is defined as follows: A point e of EG is the symbol t i1 g" E9... E9 timg im (1.s;i l < i 2 <... <im,m= 1,2,3,...), where t i1 ,..., t im are real numbers satisfying t i ,? 0, t i , +... + t im = 1, 147 I Fiber Bundles and gi k is an element of the i k th copy of G, k = 1, "', m, Here we can omit tikg ik if t" = O. Regard EG as a topological space with a weak topology such that the coordinate functions ef->t i , ef->gi are continuous, Define the right action of G on EG by (t i ,gi 1 EB... EB timgiJ'g= t i1 (gi 1 ' g)E9... E9 tiJgi m ' g). Let BG and p:E G --> BG be the identification space of EG by this action of G and the identification mapping. Then (E G , p, BG' G) is a universal bundle for G, and BG is called the classifying space for G. EG and BG are sometimes written as EG and BG. In particular, a classifying space BG is a count- able CW complex for any countable CW group G (i.e., a topological group that is a countable CW complex such that the mapping g-->g-I of G into G and the product mapping G x G-->G are both cellular). The following examples of classifying spaces for Lie groups are also useful. Note that every CW complex BG of a given G has the same thomotopy type. I. Examples of Universal Bundles (1) G is either O(n), V(n), or Sp(n): Let A and A be as in (3) of Section E, According as A is R, C, or H, we let V(n, A) be the torthogonal group O(n), the tunitary group V(n), or the tsymplectic group Sp(n). Then the tStiefel manifold Vm+n,m(A)= V(m+n, A)/Im x V(n, A) (Im is the unit element of V(m, A)) is ()"(n + 1) - 2)-connected, Hence the principal bun- dle (A(n+ 1) -2, V(m+n, A)) = (Vm+n,m(A), Mm+n,m(A), VIm, A)) from (4) of Section E is a (A(n + 1)- 2)-universal bundle of V(m, A), where the base space Mm+n,m(A)= V(m+n,A)/ V(m, A) x V(n, A) is the tGrassmann manifold. (2) G is either 0(00), V(oo), or Sp(oo), The examples in (1) are valid for m, n = 00. Con- sider the tinductive Jimft group V( 00, A)= Un V(n, A) under the natural inclusion V(n, A) c V(n + 1, A), and supply the infinite classical group V( 00, A) with the weak topol- ogy (this means that a set 0 of V(oo, A) is open if and only if each 0 n V (n, A) is open in V(n, A)). Then the infinite Stiefel manifold Vm+n.m(A) and the infinite Grassmann manifold Mm+n.m(A) (m= 00 or n= 00) are defined as before, and we have M" ,m(A) = U Mm+n,m(A), and so on. Furthermore, these manifolds are CW complexes, and Voo,m(A) (m.s; 00) is 00- connected. Although V(oo,A) is not a Lie group, V(m, A) x V(n, A) has a local cross section in V (m + n, A) for m, n.s; 00 [7]. There- fore, setting n= 00 in (1), (oo, V(m, A)) is a universal bundle of VIm, A), and the infinite Grassmann manifold Moo.m(A) is a classifying 
147 J Fiber Bundles space BU(m,A)' Also, ()"(n+ 1)- 2, V(oo, A)) in (I) is a (A(n+ 1)-2)-universal bundle of V(oo,A) (n.s; 00). (3) G is either SO(m) or a general Lie group. For the trotation group SO(m), we have (n- 1, SO(n))=(Vm+n,m(R), p, Mm+n,m' SO(m)) and BSO(m) = Moo,m' where Mm+n,m = SO(m+n)/ SO(m) x SO(n) is the oriented Grassmann manifold. For any compact Lie group G, we have (n-1, G) = (Vm+n,m(R),p,O(m+n)/G x O(n), G), where G c O(m). For any connected Lie group G, we have (n,G)=(n,Gd x G1 G, where G l is the maximum compact subgroup of G (since G/G I is homeomorphic to a Eu- clidean space, (n, G) reduces to (n, Gd). J. Reduction of Fiber Bundles Let G be a topological group and H its closed subgroup. We say that the structure group of a G-bundle  is reduced to H if  is equivalent to a G-bundle whose coordinate transformations take values in H. For a principal H-bundle /]0 =(P,q, B, H), the associated H-bundle /]0 x H G = (P X H G, p, B, G) with fiber G is defined, where H operates on G by the product of G; it is also a principal G-bundle if we define an operation of G on P xHG by {(x,g)}' g' == {(x, gg')}. For a principal G-bundle /], we say that /] is reducible to an H -bundle if there is a principal H-bundle /]0 with /]=/]o xHG, and we call/]0 a reduced bundle of /], It is easy to see that the group of a G-bundle  is reducible to H if and only if the associated principal G- bundle of  is reducible to H. Also, if /]0 is a reduced bundle of/], then if;#/]o is a reduced bundle of if;#/]. Now, assume that H has a local cross sec- tion in G and G/H is oo-connected. Then for an n-universal bundle (n, H) of H, (n, H) x H G is an n-universal bundle of G (n- connectedness of E(n, H) xHG is shown by the thomotopy exact sequence oftfiber spaces). Therefore, by the classification theorem, the group of any G-bundle is reducible to H, and the equivalence classes of G-bundles are in one-to-one correspondence with those of H- bundles. (I) A G-bundle is trivial if and only if its group is reducible to e (identity element). A 2n- dimensional differentiable manifold M of class Ccc has an tal most complex structure if and only if the group of the tangent bundle :!(M) is reducible to GL(n, q, i.e., :!(M) is considered as an n-dimensional complex vector bundle. (2) Since GL(n,R)O(n) x R n (n+I)/2 and GL(n, C) U(n) x R n2 , n-dimensional real (com- plex) vector bundles can be considered as O(n) (V (n))-bundles with fiber Rn(C"). 572 K. Homotopy and Homology Theory of Bundles Since a fiber bundle is a tlocally trivial fiber space, the exact sequence and the spectral sequence of fiber spaces ( 148 Fiber Spaces) are applicable to fiber bundles. For example, the cohomology structures of homogeneous spaces of classical groups have been deter- mined by A. Borel, J.-P. Serre, and others. (1) Characteristic class. For a classifying space BG of a topological group G, we have an isomorphism nn(B G );;; nn-dG) of homotopy groups and the following classification theo- rem of fiber bundles over the n-sphere sn. The set of the equivalence classes of principal G- bundles or G-bundles with fiber F over the base space sn is in one-to-one correspondence with the set nn-I (G)/no(G) of equivalence classes under the operation of G on nn-I (G) given by the inner automorphisms of G; such a correspondence is given by associating with each principal G-bundle /] = (P, q, sn, G) the class (called the characteristic class of /]) con- taining the image (ln) of a generator lnEnn(sn) by the homomorphism : nn(sn);;; nn(P, G) -->nn-I (G). Take VI and V 2 (the open sets of sn such that the last coordinates t n + 1 are> -1/2 and < 1/2, respectively) as coordinate neighborhoods of/]. Then the restriction T= 9 121 sn-l represents the characteristic class of/], where g12: VI n V 2 -->G is the coordinate transformation and sn-I is the equator of sn. (2) For the principal bundle /] = (SO(n + 1), q, sn, SO(n)), the mapping T: sn-I-->SO(n) is given by ( In-l T(t l , ...,t n )=(In- 2 (t;fj)) 0 -) (In is the unit matrix of degree n). Hence the tmapping degree of the composite q' 0 T: sn-l -->sn-I (of T and the natural projection q': SO(n)-->sn-l) is equal to 0 if n is odd and 2 if n is even. From this fact and the homotopy exact sequence, we have nn(Vm+n,m(R)) = {2 = Z/2Z if m = 1 or n is even if m > 1 and n is odd for the real Stiefel manifold Vm+n,m(R), which is (n - 1 )-connected. (3) Sphere bundles. An O(n + 1 )-bundle with fiber sn is called an n-sphere bundle. The set of equivalence classes of n-sphere bundles with base space sm is in one-to-one correspondence with nm_I(O(n+ 1))/no(O(n+ 1)). For example, any I-sphere bundle over sm (m;::' 3) and any n-sphere bundle over 53 is trivial. Every 3- sphere bundle over S4 is equivalent to one of gm,nlm an integer, n a positive integer}, where 
573 m,n is defined as follows: Let p, (J:S3-->0(4) be defined by p(q)q' = qq' q-l, (J(q)q' = qq' (q, q' are tquaternions of norm 1). Then these mappings represent generators of n3(0(4))n3(S3 x S3)Z+Z, and m,n is the 3-sphere bundle over S4 corresponding to the elemen t m {p} + n{ (J} En 3 (0(4)) (i.e., to the mapping fm.,,: S3--> 0(4) defined by fm,n(q)(q') = qm+nq' q-m). (Here we use the fact that the operation of the ele- ment rEO(4) (r(q)=q-l) is given by rpr- 1 =p, r(Jr- 1 = p(J-I.) L. Cross Sections For a fiber bundle =(E,p,B,F,G), a cross section f: Bo-->E over a subspace Bo (c B) is a continuous mapping such that po f is the identity mapping of Bo; a cross section over B is called a cross section of . A bundle  is trivial if and only if the associated principal bundle of  has a cross section. More gener- ally, given a principal bundle 1] = (P, q, B, G) and a closed subgroup H having a local cross section in G, 1] is reducible to H if and only if the associated bundle 1]( xG(GjH) = (PjH,q',B), GjH) with fiber GjH has a cross section. Suppose that the base space B of the fiber bundle  = (E, P, B, F, G) is a tpolyhedron. We denote the tr-skeleton of B by B r and consider the problem of extending cross sections f,: B r -->E successively for r = 0, 1, .... Clearly, there is a cross section fo. For each r-simplex (J of B, we have (p -1 (0'), p, (J, F) = «(J x F, PI' 0', F) since (J is tcontractible. Hence there is a bundle mapping <Pa:(J X F;:::;;p-I«(J) with p0<pa=PI' Assume the existence of a cross section /..-1: Br-1-->E, and consider the mapping ha = P2 0 <p;;1 0 (/..-1 10'): 0'--> F (P2 : (J x F --> F is the pro- jection and 0' is the boundary of (J). Then if ha is extensible to ha:(J-->F, an extended cross section fa: (J-->E of /..-110' is defined by fa(b) = <Pa(b, ha(b)) (bEO'), and the extension /..: B r --> E of /..-1 is defined by /..1 (J = fa, fr 1 B r - 1 = fr-l' If n r - I (F) = 0, for example, there is an extension ha of ha since «(J, 0') ;:::;;(V r , sr-l), and /..-1 is extensible to a cross section /... Now assume that the base space B of a G- bundle  ==(E,p, B, F, G) is an tarcwise con- nected polyhedron and F is t(n - 1 )-connected. Then there is a cross section f: Bn --> E con- structed by the stepwise method of the previ- ous paragraph. But if nn(F) #0, we have an obstruction to extending f over B n + l . Now we explain how to measure this obstruction. Sup- pose that F is tn-simple. Then for each (n+ 1)- simplex (J of B, the mapping ha:O'-->F, defined by f as in the previous paragraph, determines a unique element c(/)(O') of the homotopy group nn(F). Hence we have a tcochain C(/)E C"+1 (B; nn(F)), and f is extensible to a cross 147M Fiber Bundles section over 8 n + 1 if and only if c(/) = O. Thus there is a cross section over B n +! if and only if the set of c(/) for every cross section f over B n contains the cochain 0; {c(/)} is considered as a measure of the obstruction. Let w: 1-->8 be a tpath. We consider the space I x F and the canonical projection PI : I x F -->1. Then there is a bundle mapping Q: I x F -->E with poQ=wo PI> since w#  == (I x F, PI ,I ,F); and a homeomorphism w # : F;:::;; F is defined by w # = <Po.1o 0 Qo 0 Qi l 0 <P1.b 1 , where b,=w(e) (e=O, 1), <p,: U, x F;:::;;p- 1 (U,) is a coordinate function of a coordinate neigh- borhood U,3b" and Q,( = QI e x F): F;:::;; p-I(b,). The homeomorphism w# induces an isomor- phism w#: nn(F) nn(F), and nn(F) forms a tlocal coefficient on B. Then the cochain c(/) is a tcocycle with the local coefficient nn(F), called the obstruction cocycle of f Further- more, the set {cU)} for every cross section f: Bn-->E is a cohomology class cn+I()E W+ 1 (B; nn(F)) (local coefficient), and Cn+l() is called the primary obstruction to the con- struction of a cross section. There is a cross section over the (n + 1 )-skeleton Bn+l if and only if c n + 1 () = O. The local coefficient nn(F) is trivial if B is tsimply connected or, for example, if the structure group G of  is con- nected (in which case  is called an orientable fiber bundle); when this is true, Cn+I() is an element of Hn+1 (B; nn(F)), where nn(F) is not a local coefficient. Furthermore, if c n + 1 () = 0 and ni(F)=O (n< i <m), then the secondary obstruction om+l() C Hm+l (B; nm(F)) is de- fined similarly ( 305 Obstructions). M. Stiefel-Whitney Classes Let  = (E, p, B, F, O(n)) be an O(n)-bundle over an arcwise connected polyhedron B and let o be its associated principal bundle. Con- sider the Stiefel manifold v",n-k == v". n-dR) = O(n)jl n _ k x O(k), which is (k-l)-connected, and the associated bundle k = o X O(n) v".n-k with fiber v",n-k' The primary obstruction Itk+l () = Ck+1 (k)E Hk+1 (B; nk(v",n-k)) (k = 0, 1, .. . , n - 1) is called the Stiefel-Whitney class of. We have 2Wk+l()==0 unless k=n- 1 and k is odd. Hence we usually consider ltk+d)E Hk+l (B; Z2)'  is orientable, i.e" the group of  is reducible to SO(n), if and only if WI () = O. The Stiefel- Whitney classes of an n-dimensional tdifferentiable manifold M are defined to be those of the tangent bundle 2(M). Since the orientability of M coincides with that of 2(M), M is orientable if and only if WI (M) = O. The condition Itk+l (M) = 0 is necessary for the existence of a continuous field of orthonormal tangent (n - k)-frames over M (if k == n - 1 this condition is also suffi- 
147N Fiber Bundles cient). Also, when M is closed, w,.(M) is equal to X(M)J1, where /l is the fundamental coho- mology class of M and X(M) is the tEuler characteristic of M ( 56 Characteristic Classes B). N. Chern Classes For a U(n)- bundle  =(E,p, B, F, U(n)), the primary obstruction Ck+I(O=C2k+2(k)E H2k+2(B; Z) (k = 0,1, "', n - I) of the associ- ated bundle e = o X U(n) v".n-dC) is called the Chern class of r If we consider  as an O(2n)- bundle by U(n)cO(2n), then W2k+d)=0 and It2d)= Cd) (mod 2). The Chern classes of a real 2n-dimensional almost complex manifold are defined to be those of the tangent bundle :!(M) ( 56 Characteristic Classes C). O. Bundles of Class C', Analytic Bundles A fiber bundle  = (E, p, B, F, G) is called a fiber bundle of class C' (r=O, I,... oo,w) if E, B, F are differentiable manifolds of class C', G is a Lie group and a transformation group of F of class C r , and p and the coordinate functions are differentiable mappings of class cr. Bun- dles of class Co are usual G-bundles, and those of class C'" are real analytic fiber bundles. Similarly, complex analytic fiber bundles are defined by the notions of complex manifolds, tcomplex Lie groups, and tholomorphic map- pings. For example, the universal bundles (n -1, O(m)) and ry(2n, U(m)) are real and com- plex analytic principal bundles, respectively, and the tangent bundle :!(M) of a C'+1 (or complex) manifold is a C' (or complex ana- lytic) vector bundle. The operations of the Whitney sum, etc., are defined analogously for these vector bundles. The equivalence of C' (complex analytic) bundles is defined by means of bundle map- pings that are C'-differentiable (holomorphic). Bundles of class C' (r.s; 00) are classified by C' mappings into a classifying space in the same manner as for bundles of class Co. Also, the connection of class C' ( 80 Connections) in C' bundles is an important notion. For complex analytic bundles, a similar classification has been obtained for restricted spaces by K. Kodaira, Serre, S, Nakano [8J, and others. The classification of complex ana- lytic bundles over a tStein manifold is reduced to that of bundles of class Co (Oka's principle [9J), and similar results are valid for C"'- manifolds. The complex analytic (or holo- morphic) connection does not necessarily exist, and M, F. Atiyah [10J found the condition for its existence and its relation with Chern classes. 574 P. Microbundles A system x:BEB of topological spaces E, B and continuous mappings i.j is called an n- dimensional microbundle over B if for each bE B, there exist a neighborhood U of b, a neighborhood V of i( U), and a homeomor- phism h: V U x Rn with hoil U =i 1 ,j1 V =Pl 0 h (il :UU xOcU xRn, andpl:U x Rn-->u is the projection). Let Ho(n) be the topological group of all homeomorphisms of Rn onto itself fixing the origin with compact-open topology. Then the equivalence classes of n-dimensional microbundles over B are naturally in one-to- one correspondence with the equivalence classes of Ho(n)-bundes with base space Band fiber R n [13]. In the category of polyhedra and PL (tpiece- wise linear) mappings, the notion of a PL microbundle can be defined in the same man- ner. The structural group of n-dimensional PL microbundles is defined, in a generalized sense, as a complete tsemi-simplicial complex [11]. The tangent PL microbundle is defined for any ttopological (PL) manifold. J. Milnor classified tsmoothings of PL manifolds by means of PL microbundles [11] and then showed that the ttangent vector bundles and its tPontryagin classes of smooth manifolds are not topologi- cally invariant [12]. For a tPL embedding f: M -->N between PL manifolds, if there is a neighborhood E of f(M) in N and a PL mappingp:E-->M so that a diagram v: M !..E!..M is a PL microbundle, then v is called a normal PL microbundle of.f In this case, f is tlocally flat. There is a locally flat PL embedding be- tween PL manifolds which admits no normal PL microbundle [14]. Q. Block Bundles As the normal bundle theory for locally flat PL embeddings of PL manifolds, the concept of block bundle was introduced independently by C. P. Rourke and B. J. Sanderson [15J, M. Kato [16J, and C. Morlet [17]. Let E be a tpolyhedron, and let K be a cell complex. A set {Ea I (JE K} of PL balls in E indexed on K is called a q-block structure of E if the following three conditions are satisfied: (1) UaEK Ea = E; (2) for each (JE K, there is a tPL homeomor- phism ha: (J x Iq --> Ea such that ha(r x Iq) = E, for each face r of (J, where 1= [ -1, IJ; and (3) if EanEp#0, then EanEp=E" where r=(Jnp. For (JE K, Ea is called the block over (J, and ha in (2) is called a trivialization of Ea. Then a triple (E, K, {Ea I (J E= K} ) is referred to as a q- block bundle over K and is denoted by / K. Another block bundle '/K =(E', K, {EI (JE 
575 K} ) over the same complex K is said to be isomorphic with /K if there is a PL homeo- morphism g: E --> E', called an isomorphism, such that g(Ea)=E «(JEK). A PL embedding i: 1 K 1--> E is a zero-section of / K iffor each (JEK, there is a trivialization ha: (J x F-->Ea such that ha(x, 0) = i(x) (x E (J). In this case we say that /K is a block bundle with a zero- section i: IKI-->E. There is a unique zero- section of /K up to isomorphism of (/K onto itself. For every tlocalIy flat PL embedding between PL manifolds M and W of codimen- sion q and for any ceIl division K of M, a tderived neighborhood N of f(M) in Wadmits a unique q-block bundle v/K =(N, K, {Nal (JE K}) with f: M --> N as a zero-section up to isomorphism respecting the zero-section [15]. The block bundle v/K is called a normal block bundle of f : M --> W. References [IJ N. E. Steenrod, The topology of fiber bundles, Princeton Univ. Press, 1951. [2] E. Stiefel, Richtungsfelder und Fernparal- lelismus in n-dimensionalen Mannigfaltig- keiten, Comment. Math, Helv., 8 (1936), 3-51. [3] H. Whitney, Topological properties of differentiable manifolds, Bull. Amer. Math. Soc., 43 (1937), 785-805, [4J S. S. Chern, Some new view-points in differential geometry in the large, Bull. Amer. Math. Soc., 52 (1946),1-30. [5J F. Hirzebruch, Topological methods in algebraic geometry, Springer, third edition, 1966. [6J J. W. Milnor, Construction of universal bundles II, Ann. Math., (2) 63 (1956), 430-436. [7J J. C. Moore, Espaces classfiants, Sem. H. Cartan, 12, no. 5 (1959-1960), Ecole Norm. Sup., Secretariat Math6matique, [8] S. Nakano, On complex analytic vector bundles, J. Math. Soc. Japan, 7 (1955), 1-12. [9J H. Grauert, Analytische Faserungen iiber holomorph-volIstiindigen Riiumen, Math. Ann., 135 (1958), 263-273. [1 OJ M. F. Atiyah, Complex analytic connec- tions in fiber bundles, Trans. Amer. Math. Soc., 85 (1957),181-207, [IIJ J. W. Milnor, Microbundles and differ- entiable structures, Lecture notes, Princeton Univ.,1961. [12J J. W. Milnor, Microbundles I, Topology, 3 (1964), Suppl. 1,53-80. [13J J. Kister, Microbundles are fiber bundles, Bull. Amer. Math. Soc., 69 (1963), 854-857. [14J C. P. Rourke and B. J, Sanderson, An imbedding without a normal bundle, Inven- tiones Math., 3 (1967), 293-299. 148 B Fiber Spaces [15J C. P. Rourke and B. J. Sanderson, Block- bundles I-III, Ann. Math., (2) 87 (1968). [16J M. Kato, Combinatorial prebundles I, II, Osaka J. Math., 4 (1967). [17J C. Morlet, Les methodes de la topologie differentielIe dans l'etude des varietes semi- lineairs, Ann. Sci. Ecole Norm. Sup., 1 (1968), 313-394. [18J D. HusemolIer, Fiber bundles, McGraw- HilI, 1966. 148 (IX.1 0) Fiber Spaces A. General Remarks J.- P. Serre [1 J generalized the concept of fiber bundles to that of fiber spaces by utilizing the covering homotopy property ( Section B). He applied the theory of tspectral sequences, due to J. Leray, to the (cubic) tsingular (co)- homology groups of fiber spaces. These are quite useful for determining (co)homology structures and homotopy groups of topo- logical spaces, and are nOw of fundamental importance in algebraic topology. B. Definitions Let p:E-->B be a continuous mapping between topological spaces, and let X be a topological space. Then we say that p has the covering homotopy property with respect to X if for any mapping f: X --> E and thomotopy g,: X --> B (O.s; t.s; 1) with po f = go, there is a homotopy f,:X-->E (O.s;t.s;l) with fo=f and pof,=g,. We calI (E, p, B) a fiber space (or fibration) if p has the covering homotopy property with respect to every cube In={(x 1 , ...,xn)I0.s;Xi.s; I}, n = 0, 1, ... (then p has the covering homo- topy property with respect to every tCW complex). Then E is calIed the total space, p the projection, B the base space, and Fb = p-l(b) the fiber over bEB. Let E, B, F be topological spaces and p: E--> B a continuous mapping. We calI (E,p, B, F) a locally trivial fiber space if for each bE B, there exist an open neighborhood U of band a homeomorphism (jJ: U x F --> p -I (U) with po <p(b', y) = b' (b' E U, YE F). In this case, p has the covering homotopy property with respect to each tparacompact space; hence (E, p, B) is a fiber space. A tfiber bundle is clearly a localIy trivial fiber space. 
148 C Fiber Spaces C. Path Spaces Another important example of a fiber space is a path space. A path in a topological space X is a continuous mapping w: I --> X (I = [0, IJ). Given subsets Ao and AI of X, the path space Q(X; Ao, Ad is the space of all paths w: I -->X satisfying w(e)E Ae (e= 0,1) topologized by tcompact-open topology. De- fine Pe:Q(X; Ao, Ad-->Ae by Pe(w) = w(e) (e= 0,1). Then (Q(X; Ao, AI)' Pe, Ae) is a fiber space; in fact, Pe has the covering homotopy property with respect to every topological space. In particular, the total space of the fiber space (Q(X; X, *),Po,X) (* EX) is tcontractible, and the fiber Po I ( * ) = Q( X; *, *) = QX over * is the t\oop space of X with base point *. For a continuous mapping f: Y -->X, consider the space Ef={(Y, W)E Y x Q(X;X,X)lf(y)= w(O)} and the continuous mapping p: E f-->X de- fined by p(y, w) = w(I). Then Y c E f' and Y is a tdeformation retract of E f; furthermore, (E f' p, X) is a fiber space with f = plY. D. Homotopy Groups of Fiber Spaces For thomotopy groups of fiber spaces, the Hurewicz-Steenrod isomorphism theorem holds: Let (E, P, B) be a fiber space and F = P -I ( * ) the fiber over the base point * E B. Then P*: nn(E, F)-->nn(B) is an isomorphism for n? 2 and a bijection for n = 1. By this theorem, we have the homotopy exact sequence of a fiber space: (  i* P ... -->nn+1 B)-->nn(F)--> nn(E):.:'> nn( B)-->". . Furthermore, the more general exact sequence i P* . ..-->n(Z;QB)o-->n(Z; F)o-4n(Z; E)o-->n(Z; B)o is valid for each CW complex Z, where n(Z; X)o denotes the thomotopy set of map- pings from Z to X relative to the base point. Example (1). A cross section of a fiber space (E,p, B) is a continuous mapping f: B-->E with po f = 1. If (E, p, B) has a tcross section or the fiber F is a tretract of E, then nn(E)nn(B)+ nn(F), If F is contractible in E, then nn(B) nn(E) + nn-I (F) (n? 2). Example (2). (E,p,B) is called an n- connective fiber space if B is tarcwise connec- ted, E is tn-connected, and p*: n;(E)-->n;(B) is an isomorphism, for i> n. For each arcwise connected space B and integer n, there is such a fiber space. Example (3). For a CW complex X, there are topological spaces X n and continuous map- pings fn: X -->X n , qn+l: Xn+1 -->X n (n = 0,1, ...) with the following four properties: (i) X n (O.s; n.s; m) is a point if X is m-connected; (ii) fn': n;(X)  n;(Xn) (i.s; n); (iii) (X n , qn, Xn-d is a fiber 576 space, and its fiber is an tEilenberg-MacLane space K(nn(X),n); (iv) qnofn is thomotopic to fn-I' Such a system {XnJn, qn} is called the Postnikov system of X and is in a sense con- sidered a decomposition of X into Eilenberg- MacLane spaces. E. Spectral Sequences of Fiber Spaces The cohomological properties of fiber spaces are obtained mainly from the following results (which are valid similarly for homology except for properties of products). Assume that the base space B of a given fiber space (E, p, B) is tsimply connected and the fiber F = P -I ( * ) is arcwise connected, and let R be a commuta- tive ring with unit. Then the spectral sequence (of singular cohomology) of the fiber space (E, p, B) (with coefficients in R) is defined to be a sequence {E" d r } satisfying the following properties: (i) Er=Lp,qEf,q are bigraded R-modules, and d r = Lp,qd;,q are R-linear differentials such that dr(Er q ) c Ef+r,q-r+,. (ii) Ef+1 = Kerd;,q/Imdf-r,q+r-I, which means that Er+' = H(E r ). (iii) E;,q =0 for p<O or q<O, Ef,q =E;+1 =." =E;;} for r>max(p,q+ 1). (iv) Er has a product for which E;,q. E;,-q' c Ef+P',q+q' and dr(u' v) = (dru)' v +( -l)P+qu' drv (UE Ef,q). Furthermore, the induced prod- uct in H(Er) coincides with the product in Er+" (v) Eoo is the tbigraded module associated with some filtration of the cohomology module H*(E; R); that is, H"(E; R) = Do,n =:> Dl,n-I =:>,,, =:>Dn,O=:>Dn+I,-1 =0 and Eq=DP,q/Dp+l,q-l. Furthermore, the tcup product '-'" in H*(E; R) satisfies DP,q '-'" DP',q' C Dp+p',q+q' and coincides with the given product in Eoo. (vi) E,q = HP(B; W(F; R)), and the product in E 2 coincides with the cup product in H*(B; H*(F; R)). (vii) The composition of H"(B; R)= E'o -->E 3 '0 -->...-->E,+°l =E':x;°=Dn.OcHn(E;R) is equal to p*, and the composition of Hn(E; R)=Do,n--> E':x;° = E+n2 c". c E,n c Eg,n = H"(F; R) is equal to i* (i: FeE), where each --> is the projection onto the quotient module. In the sequence H"-I (F; R)'!".Hn(E, F; R)t H"(B; R), we have 8*-1 (Imp*)= E.n-', Coimp* = E'o, and dn:E,n-I-->E'o is equal to the trans- gression r* = p*-I 08*: 8*-1 (Imp*)-->Coim p*. Each element of 8*-I(Imp*) is called transgressive. In the folIowing examples, we aSsume that R is a principal ideal ring. Example (4). Let k be a commutative 
577 field, and assume that dimkH*(B; k) and dim k H *(F; k) are finite. Then for the tPoincare polynomial Px(t)=Lnbntn, bn=dimkHn(X;k), we have PE(t) = PB(t)PF(t) - (1 +t)<p(t), where <p(t) is a polynomial with nonnegative coeffi- cients (Leray), In particular, for the tEuler characteristic X(X)=P x ( -1), we have X(E) = X(B)x(F). Also, iU* :Hn(F; k)-->Hn(E; k) is monomorphic for each n > 0, then PE(t) = PB(t)PF(t). Example (5). Isomorphism theorem: If Hn(B; R)=O (O<n<r) and Hn(F; R)=O (O<n <s), then p*:Hn(E,F;R)-->Hn(B;R) is isomor- phic for 0< n<r+s and epimorphic for n =r+ s, and we have the following homology exact sequence: i. p. '" --> Hn(F; R)--> Hn(E; R)-->Hn(B; R) HndF; R)-->..., n<r+s (similarly for the cohomology). Example (6). Assume that Hn(F; R)  H"(sr; R) (sr is the r-sphere, r> 1), Let if = M p be the tmapping cylinder of p: E --> Band j3: if --> B be the con tin uous mapping defined by p. Then the Thom-Gysin isomorphism g: H"-'-I(B; R)H"(if, E; R) (n>O) with g(ex) = j3* (ex) "-' g(l) holds ( 114 Differential Top- ology G). Also, we have the Gysin exact sequence: p' '" -->Hn(B; R)--> H"(E; R)-->H"-'(B; R) !.Hn+I(B; R)-->.. where 9 satisfies g(ex) = ex "-' 0 = 0 "-' ex (0 = g(l) E W+ 1 (B; R)). Here 0 is equal to the image of a generator of H' (F; R) = R by the transgression r*, and 20 = 0 if r is even. (These results hold also for r = 0 and R = Z2 ,) Example (7). Assume that Hn(B; R)  H"(S'; R) (r>2). Then we have H"-r(F; R) Hn(E, F; R) (n > 0) and the Wang exact sequence: ., .. ,-->H"(E; R):""'H"(F; R) -'!.. H"-r+l (F; R)--> H"+I (E; R)-->"., where () satisfies (}(ex  fJ) = (}(ex) "-' [3 + (-I)"(r-l)ex"-'(}([3) (ex, [3EH"(F; R)). Example (8). For a field k of odd character- istic, if H"(E; k) = 0 (n > 0) and the algebra H*(F; k) is generated by a finite number of elements of odd degree, then H*(F; k) Ak(X 1, ..., X,) (the texterior algebra) and H*(B; k) k[y"... ,y,J, where Yi=r*(x;) (A. Borel). References [IJ J.-P, Serre, Homologie singuliere des espaces fibres, Ann. Math., (2) 54 (1951), 425- 505. 149 A Fields [2J S. T, Hu, Homotopy theory, Academic Press, 1959. [3J A. Borel, Sur la cohomologie des espaces fibres principaux et des espaces homogenes de groupes de Lie compacts, Ann, Math., (2) 57 (1953), 115-207. [4J E. H. Spanier, Algebraic topology, McGraw-Hin, 1966. [5J G. W, Whitehead, Elements of homotopy theory, Springer, 1978. 149 (111.6) Fields A. Definition A set K having at least two elements is caned a field if two operations, called taddition ( + ) and tmultiplication (.), are defined in K and satisfy the following three axioms. (1) For any two elements a, b of K, the sum a+b is defined; the associative law (a+b)+ c=a+(b+c) and the commutative law a+b= b + a hold; and there exists for arbitrary a, b a unique element X such that a + X = b; that is, K is an t Abelian group with respect to the ad- dition (the tidentity element of this group is denoted by 0 and is caned the zero element of K). (2) For any two elements a, b of K, the prod- uct ab (= a' b) is defined; the associative law (ab)c = a(bc) and the commutative law ab = ba hold; and there exists for arbitrary a, b with a#O a unique element x such that ax=b, that is, the set K* of all nonzero elements of K is an Abelian group with respect to multiplica- tion. K* is caned the multipli1:ative group of K, while the identity element of K* is denoted by 1 and is caned the unity element (unit ele- ment or identity element) of K. (3) The distributive law a(b + c) = ab + ac holds. In other words, a field is a tcummuta- tive ring whose nonzero elements form a group with respect to multiplication. A noncommutative ring whose nonzero elements form a group is caned a noncom- mutative field (skew field or s-field). It should be noted that sometimes a field is defined as a ring whose nonzero elements form a group without assuming the commutativity of that group, and in this case our "field" defined before is caJled a commutative field. (The term "skew field" is sometimes used to mean either a commutative or a noncommutative field.) In this article we limit ourselves to commutative fields (for noncommutative fields  29 As- sociative Algebras). 
149 B Fields B. General Properties Since a field K is a commutative ring, we have properties such as aO = Oa = 0, ( - a)b = a( - b) = - ab for elements a, bin K. If a tsubring k of K is a field, we say that k is a subfield of K or K is an overfield (extension field or simply extension) of k. If a field K has no subfie1d other than K, K is called a prime field. A map f of a field K into another field K' is called a (field) homomorphism if it is a ring homomorphism, i.e., if it satisfies f( a + b) = f(a) + f(b),f(ab) = f(a)f(b), Since a field is tsimp1e as a ring, every (field) homomorphism is an injection unless it maps everything to zero. A homomorphism of K into K' is called an isomorphism if it is a bijection, and K and K' are called isomorphic if there exists an isomorphism of K onto K'. An isomorphism of K onto itself is called an automorphism of K. If there is a natural number n such that the n sum nl =  of the unity element 1 is 0, then the minimum of such n is a prime number p, called the characteristic of K. On the other hand, if there is no natural number n such that nl = 0, we say that the characteristic of K is 0, C. Examples of Fields The rational number field Q consisting of all rational numbers, the real number field R consisting of all real numbers, and the complex number field C consisting of all complex num- bers, are all fields of characteristic O. A subfield of the complex number field C is called a num- ber field. The rational number field is a prime field, and every prime field of characteristic 0 is isomorphic to the rational number field, For the ring Z of all rational integers, the residue class ring modulo a prime number p is a field Z/pZ= {O, 1,2,... ,p-1 (modp)} of character- istic p, called the residue class field for p. Thus Z/pZ is a prime field, and every prime field of characteristic p is isomorphic to Z/pZ. If the number of the elements of a field K is finite, K is called a finite field. Z/pZ is an example of a finite field. D. Extensions of a Field In order to express that K is an extension field of k, we often use the notation K/k. Subfields of K containing k are called intermediate fields of K/k. Consider two extensions Kdk, and K 2/ k 2 , and let <p: K , --> K 2 be an isomorphism which induces an isomorphism if;: k, -->k 2 . Then we call (jJ an extension of if;, Suppose that k" K 2 are given fields and K 2 contains a subfie1d k 2 isomorphic to k,. Then there exist 578 an extension field K, of k, and an isomor- phism <p : K 1--> K 2 which is an extension of the given isomorphism if;: k, -->k 2 ; construction of the field K, is often called the embedding of k, into K 2 . When K, and K 2 are extensions of k, an isomorphism: K , --> K 2 is called a k- isomorphism if it leaves every element of k invariant. In an extension K/k, let S be a subset of K. The smallest intermediate field of K/k contain- ing S is called the field obtained by adjoining S to k or the field generated by S over k, denoted by k(S). The field k(S) consists of those ele- ments in K each of which is a rational ex- pression in a finite number of elements of S with coefficients in k. An extension field k(t) obtained by adjoining a single element t to k is called a simple extension of k, and in this case t is called a primitive element of the extension. The trational function field k(X) with coeffi- cient field k is a simple extension of k with a primitive element X. When su bfie1ds k A ()" E A) of a field K are given, the smallest subfield of K containing all these subfields exists and is called the com- posite field of the k A' E. Algebraic and Transcendental Extensions An element ex of an extension field K of a field k is called an algebraic element over k if ex is a tzero point of a nonzero polynomial, say, f(X) = a o + a, X + '" + anX n with coefficients in k. If ex is not algebraic over k, then ex is called a transcendental element over k. An algebraic element ex is always a root of an irreducible polynomial over k which is uniquely deter- mined up to a constant factor (Ek*) and is called the minimal polynomial of ex over k. K is called an algebraic extension of k if all elements of K are algebraic over k; otherwise we call K a transcendental extension of k. If K, is an algebraic extension of K and K is an algebraic extension of k, then K, is also an algebraic extension of k. In an arbitrary extension field K of k, the set of all algebraic elements over k forms an algebraic extension field of k. A simple extension k(t) with a transcendental element t is isomorphic to the rational func- tion field of one variable with coefficient field k. If t is an algebraic element over k, k(t) is isomorphic to the tresidue class field of the polynomial ring k[XJ modulo the minimal polynomial f(X) of t over k. F. Finite Extensions An extension field K of a field k is called a finite extension if K has no infinite set of ele- ments that are tlinearly independent over k, 
579 i.e" if K is a finite-dimensional linear space over k, The dimension of the linear space over k is called the degree of Kover k and is de- noted by (K: k) (or [K: kJ). If K is a finite extension of k and L is a finite extension of K, then L is also a finite extension of k and (L: K) (K: k) = (L: k). Every finite extension field of k is an algebraic extension of k and is obtained by adjoining a finite number of algebraic elements to k, Conversely, every field obtained by adjoining a finite number of algebraic elements to k is a finite extension of k. If K = k(ex) with an algebraic element ex, then (K: k) is equal to the degree of the minimal polynomial of ex over k, also called the degree of ex over k. Pvprv p.lp.mp.nt of ldfY\ 1'-' p.ynrp.",,,,p.r1 '"  nr\lv_ ........ '--"-J ................................. ........ "'\VVJ ...... ......'''-1''.-'-'....---............ l"........J nomial in ex with coefficients in k, On the other hand, for any nonconstant polynomial f(X) of k[XJ there exists a simple extension k(ex) such that ex is a root of f(X). G. Normal Extensions An algebraic extension field K of a field k is caned a normal extension of k if every irreduc- ible polynomial of k[XJ which has a root in K can always be decomposed into a product of linear factors in K [X]. An extension field K of k is caned a splitting field of a (nonconstant) polynomial f(X)Ek[XJ if f(X) can be decom- posed as a product of linear polynomials, i.e., f(X)=c(X -IXd(X -ex 2 )...(X -IX n ), cEk, exiEK. A splitting field K of f(X) (/(X)Ek[XJ) is called a minimal splitting field of f(X) if any proper subfield L of K (K =:> L =:> k) is not a splitting field of f(X), A minimal splitting field of f(X) is obtained by adjoining an the zero points of f(X). A finite extension field of k is a normal extension if and only if it is a minimal splitting field of a polynomial of k[X]. For any given (nonconstant) polynomial f(X)E k[X], there exists a minimal splitting field of f(X), and all minimal splitting fields of f(X) are k-isomorphic. H. Separable and Inseparable Extensions An algebraic element IX over k is caned a sepa- rable element or an inseparable element over k according as the minimal polynomial of ex over k is tseparable or tinseparable. An algebraic extension K of k is called a separable extension of k if all the elements of K are separable over k; otherwise, K is called an inseparable exten- sion. An element ex is separable with respect to k if and only if the minimal polynomial of ex over k has no double root in its splitting field. If ex is inseparable, then k has nonzero characteristic p, and the minimal polyno- mial f(X) of ex can be decomposed as f(X) = 149J Fields (X -exdpr(X -ex 2 v'... (X -exmyr, r 1, where ex l , ex 2 , ... , exm are distinct roots of f( X) in its splitting field; between the degree n of f(X) and the number m of distinct roots of.f(X), the relation n = mpr holds. In particular, if ex pr Ek for some r, we cal\ ex a purely inseparable element over k. An algebraic extension of k is caned purely inseparable if all elements of the field are purely inseparable over k. In an algebraic extension K of k the set of all separable elements forms an intermediate field Ko of K/k, The field Ko is called the maximal separable extension of k in K. If K is inseparable over k, i.e., if K # Ko, then the characteristic of k is p #0, and K is purely in- ",pno::lTo:.1hlp r\"pr 1( Thp. itp.crrp.p,", n-rT<  .1<1 .......1-'.......................... ...........I ...,..u. ......... -b._........ -L...,..U.....J and pr= [K: KoJ are denoted by [K:kJs and [K: k];, respectively, A separable extension of a separable extension of k is also separable over k, and every finite separable extension of k is a simple extension. If no inseparable irreducible polynomial in k[XJ exists, we call k a perfect field; otherwise, an imperfect field. Every field of characteristic o is a perfect field. A field of characteristic p ( # 0) is perfect if and only if for each a E k the polynomial X p - a has a root in k. Every alge- braic extension of a perfect field is a sepa- rable extension and a perfect field. Any imper- fect field has an inseparable, in fact purely inseparable, proper extension. I. Algebraically Closed Fields If every nonconstant polynomial of k[XJ can be decomposed into a product of linear poly- nomials of k[XJ, or equivalently, if every irreducible polynomial of k[XJ is linear, k is called an algebraically closed field; k is alge- braicaIly closed if and only if k has no alge- braic extension field other than k, and hence every algebraically closed field is perfect. For any given field k there exists an algebrai- cal\y closed algebraic extension field of k unique up to k-isomorphisms (E. Steinitz); hence we call such a field the algebraic closure of k. To proceed further, suppose that we are given a field k and its extension K. If there is no algebraic element of Kover k outside of k, i.e" if k is the intersection of K and the alge- braic closure of k, then we say that k is alge- braically closed in K. The complex number field is an algebraically closed field (c. F. Gauss's fundamental theorem of algebra;  10 Algebraic Equations). J. Conjugates Let k be a field and K an algebraic extension of k. Two elements ex, [3 of K are caIled conju- 
149 K Fields gate over k if they are roots of the same irre- ducible polynomial of k[XJ (or equivalently, if the minimal polynomials of ex and fJ with respect to k coincide); in this case we call the subfields k(ex), k(fJ) conjugate fields over k. The conjugate fields k(ex) and k(fJ) are k-isomorphic under an isomorphism (f such that (f(ex) = fJ. In particular, if K is a normal extension of k, the number of conjugate elements of an element ex of K is the number of distinct roots of the minimal polynomial.f(X) of ex, which is inde- pendent of the choice of a normal extension K containing k. The element ex is separable if and only if the number of conjugate elements in K is the same as the degree of f(X). On the other hand, k(ex) is normal over k if and only if k(ex) coincides with all its conjugate fields. Let ex be a separable algebraic element over k, and let ex j = ex, ex 2 , "', exn be conjugate ele- ments of ex over k. The product A = ex l ex 2 ... exn and sum B=ex j +2 +... +ex n are elements of k. Indeed, if f(X)=X n +c j X n - I +... +c n is the minimal polynomial of ex with respect to k, we have A =( -1)"c n , B= -C j , and A and Bare called the norm and the trace of ex, respectively, denoted by A = N(ex), B = Tr(ex), Let K be a finite separable extension of degree n over k, and let ex be an element of K. Then the degree m of the minimal polynomial of ex is a divisor of n; that is, n = mr with a positive integer r. We define the norm and the trace of ex with respect to K/k by NK/k(ex) = N(ex)', TrK/k(ex) = rTr(ex), respectively. Then these quantities satisfy NK/k(ex/i) = NK/k(ex)NK/k(fJ), TrK/k(ex + fJ) = TrK/k(ex) + TrK/k(fJ) for ex, fJE K. (For the Galois theory of algebraic extensions  172 Galois Theory.) K. Transcendental Extensions Let K be an extension of k and u j , ..., Un be elements of K. An element v of K is said to be algebraically dependent on the elements U j , U2,"', Un if V is algebraic Over the field k(Uj, U 2 , ..., un)' A subset S of K is called algebraically independent over k if no U E S is algebraically dependent on a finite number of elements of S different from U itself; S is called a transcendence basis of Kover k if S is algebraically independent and K is algebraic over k(S). Furthermore, if K is separable over k(S), then S is called a separating transcendence basis of Kover k. There always exists an alge- braically independent basis of Kover k, and the tcardinal number of S depends only on K and k; this cardinal number is called the tran- scendence degree (or degree of transcendency) of Kover k. (When S is an infinite set, we sometimes say that the transcendence degree is 580 infinite.) In particular, if K = k(S) with an algebraically independent S, K is called a purely transcendental extension of k. An extension K of k is called a separably generated extension, or simply a separable extension, if every finitely generated intermedi- ate field of K/k has a separating transcendence basis over k. If K itself has a separating tran- scendence basis over k, then K is separably generated, but not conversely. A purely transcendental extension field of k having a finite transcendence degree n is also called a rational function field in n variables over k, and a finite extension of such a rational function field is called an algebraic function field in n variables over k. Let K and L be extension fields of k, both contained in a common extension field. We say that K and L are linearly disjoint over k if every subset of K linearly independent over k is also linearly independent over L, or equiva- lently, if every subset of L linearly independent over k is also linearly independent over K. An algebraic function field K = k(xj, X2,"', x n ) over k (whose transcendence degree is .s; n) is called a regular extension of k if K and the algebraic closure k of k are linearly disjoint. In order that K be regular over k it is necessary and sufficient that k be algebraically closed in K and that K be separably generated over k. L. Derivations A map D of a field K into itself is called a derivation of K if it satisfies D(a + b) = D(a) + D(b) and D(ab)=aD(b)+bD(a) for all a,bEK. The set of elements C of K for which D(c) = o is a su bfie1d. If the characteristic of K is p(#O), then D(xP)=O for all xEK. Let k be a subfield of K. A derivation D of K is called a derivation over kif D(c)=O for all cEk; the totality of derivations over k is a tk-module. If K = k(x j , X 2 , "', x n ) is an algebraic function field over k, then the k-module of derivations over k has finite dimension s(.s; n) and we can choose s suitable elements U I , U 2 , ..., Us of K such that K is separably algebraic over k(u 1, U 2 , '" , Us). Generally, the transcendence degree r of Kover k does not exceed s, and K is separably generated over k if and only if r=s. M. Finite Fields Finite fields were first considered by E. Galois (1830), so they are also called Galois fields. There is no finite noncommutative field (Wed- derburn's theorem, J. H. M. Wedderburn, Trans. Amer. Math. Soc., 6 (1905)). A simple 
581 proof for this was given by E. Witt (Abh. Math. Sem, Univ. Hamburg, 8 (1931)). The character- istic of a finite field is a prime p, and the num- ber of elements of the field is a power of p. Conversely, for any given prime number p and natural number ex, there exists a finite field with p' elements. Such a field is unique up to isomorphism, which we denote by GF(p') or Fq(q = p'), For any positive integer m, GF(pm') is an extension field of GF(p') of degree m and a tcyclic extension, Every element a of GF(p') satisfies a P ' = a; hence a has its pth root in GF(p'). Therefore every finite field is perfect. The multiplicative group of GF(p') is a cyclic group of order p' - 1. N. Ordered Fields and Real Fields A field K is called an ordered field if there is given a ttotal order in K such that a> b im- plies a +c> b+c for all c and a> b, c> 0 im- plies ac> bc. The characteristic of an ordered field is always O. An element a of K is called a positive element or a negative element accord- ing as a>O or a<O. For an element a of K the absolute value of a, denoted by I a I, is a or - a according as a;;:' 0 or a.s; O. If we define neigh- borhoods of a by the sets {x I a - £ < X < a + £} with positive elements £, K becomes a tHaus- dorff space, If, for any two positive elements a, b of K, there exists a natural number n such that na> b, then we call K an Archimedean ordered field. Two ordered fields are called similarly isomorphic if there exists an isomor- phism between them under which positive elements are always mapped to positive ele- ments. The rational number field and the real number field are examples of Archime- dean ordered fields, while every Archimedean ordered field is similarly isomorphic to a sub- field of the real number field. (For the struc- ture of non-Archimedean ordered fields, see [8].) A field k is called a formally real field (or simply real field) if -1 (1 is the unity element of k) cannot be expressed as a finite sum of squares of elements of k. The real number field is a model offormally real fields. More gener- ally, every ordered field is a formally real field. A formally real field is called a real closed field if no proper algebraic extension of it is a for- mally real field. The real number field is a real closed field. The algebraic closure of a real closed field is obtained by adjoining a root of the polynomial X 2 + 1. If a is a nonzero ele- ment of a real closed field, then either a or - a can be a square of an element of the field. Every real closed field can be made an ordered field in a unique way, namely, by defining squares of nonzero elements to be positive 150 A Field Theory elements. Since it is known that every formally real field is a subfield of a real closed field, it follows that every formally real field is an ordered field and therefore is of characteristic O. The problem of constructing an ordered field out of a formally real field is closely re- lated to the existence of valuations of a certain type [8]. The notion of formally real fields was introduced by E. Artin (Abh. Math. Sem. Univ. Hamburg, 5 (1927». By making use of the theory of formally real fields, Artin succeeded in solving affirmatively tHilbert's 17th prob- lem, which asked whether every positive definite rational expression (i.e., a rational expression with real coefficients that takes positive values for all real variables) can be expressed as a sum of squares of rational ex- pressions. More precisely, it was shown by A. Pfister that every positive definite function in R(X" ''', X n ) is a sum of at most 2" squares. References [1] E. Steinitz, Algebraische Theorie der Kor- per, J. Reine Angew. Math., 137 (1910),167- 309. [2] E, Steinitz, Algebraische Theorie der Kor- per, Walter de Gruyter, 1930 (Chelsea, 1950). [3] H. Hasse, Hohere Algebra I, II, Sammlung Goschen, Walter de Gruyter, 1926-1927. [4] B. L. van der Waerden, Algebra I, Springer, seventh edition, 1966. [5] A. A. Albert, Fundamental concepts of higher algebra, U niv. of Chicago Press, 1956. [6J N. Bourbaki, Elements de mathematique, Algebre, ch. 5, Actualites Sci, Ind" 1102b, Hermann, second edition, 1959. [7J N. Jacobson, Lectures in abstract algebra III, Van Nostrand, 1964. [8] M. Nagata, Field theory, Dekker, 1977. 150 (XX.28) Field Theory A. History When a quantity if;(x), such as velocity, is defined at every point x in a certain region of a space, we say that a field of the quantity if; is given. This general concept is used in many branches of science. Here we confine ourselves to some branches of physics, in particular to the quantum theory of fields, which describes telementary particles. The ttheories of elasticity and thydro- dynamics (in particular, concerning tEuler's 
150 B Field Theory equation of motion) deal with displacement and velocity fields, respectively. However, a field in a vacuum (ether), which is quite differ- ent from a field in a space filled with matter, first became a subject of physics in telectro- magnetism. M Faraday (1837) introduced the electromagnetic field and discovered its funda- mental laws, and J. C. Maxwell (1837) com- pleted the mathematical formulation. On the basis of this formalism, A. Einstein (1905) established the theory of trelativity and later developed the general theories of relativity and of gravity. Although quantum theory originated from the problem of blackbody radiation, the quan- tum theory of the electromagnetic field was first developed by P. A. M. Dirac (1927) after the development of tquantum mechanics. Along similar lines P. Jordan and E, P. Wigner (1927) quantized the matter wave (electron field), and W. Heisenberg and W. Pauli (1929) developed the quantum theory of wave fields in general. Subsequently, Jordan and Pauli (1927), Pauli (1939), S. Tomonaga (1943), J. Schwinger (1948), and others reformulated the theory in a relativistically covariant manner. Quantum electrodynamics, dealing with an electromagnetic field interacting with electrons (and positrons), has given excelIent agreement with experimental measurements when for- mulated in this way. Divergence difficulties inherent to quantized field theory were by- passed by the renormalization procedure of Tomonaga, Schwinger, R. P. Feynman, and F. J. Dyson (1947). On the other hand, H. Yukawa (1934) ap- plied the concept of the quantized field to the interpretation of nuclear force and predicted the existence of n-mesons. Many kinds of new particles have since been found, including n- mesons and muons. The field theory of n- mesons can explain the qualitative features of the meson-nucleon system. Similar theories can be formulated for other types of unstable particles that have been found in cosmic rays since 1949. During the progress of meson theory, vari- ous types of fields were investigated, and a general theory of elementary particles was developed. Dirac (1936) proposed a general wave equation for elementary particles. and Pauli and M. Fierz (1939) proved the con- nection of spin and statistics ( the end of Section D). Schwinger (1951) derived quantum- mechanical equations of motion and commu- tation relations from a unified variational principle. For the cases where no interaction is present, a general theory of elementary par- ticles consistent with the requirements of rela- tivity and of quantum theory was established as the theory of free fields. 582 B. Relativistically Covariant Classical Fields Relativistically covariant fields <p:(x) on the level of classical theory are functions of the space-time point x with either real or com- plex values depending on the index ex (which distinguishes different fields) [1]. A finite- dimensional representation D' of SL(2, C) ( 258 Lorentz Group) on either a real or com- plex vector space is assigned to each ex, and the index r of <p:(x) refers to components in this representation space, Each model is specified in terms of a Lagrangian density 2'(x) that is a function (typically a polynomial) of <Pr'(x), v<p:(x) (V" denoting V/VX, J1=0, ,.. , 3) and their complex conjugates. 2' is taken to be invariant under the replacement of <p:(x) and v<p:(x) by LsD'(A),,<p:(x) and Ls,vD'(A)rsA(A):vv<p:(x) for all A E SL(2, C) (called the Lorentz invariance of 2') and pos- sibly under the replacement of <p:(x) and v<p:(x) by L U'<pf(x) and L U'Pv<pf(x) (called an internal symmetry). The fields are supposed to satisfy the follow- ing partial differential equation, called the field equation, and are obtained as the tEuler equa- tion of the variational problem for the action integral 1= S 2'(x)dx: 3 v2'(x)/v<p:(x) - L (v/vx)(v2'(x)/vW<Pr'(x)) o =0, 3 v2'(x)/v{j5:(x)- I (v/vx)(v2'(x)/v(vqJ,:'(x))) o =0 (the bar indicates the complex conjugate), where the second equation is identical to the first for a real field, For complex fields. oSf(x) is chosen such that 1= T (possibly except for a surface term) in order to ensure that the above two equations are complex conjugates of each other. The invariance of I under a Lie group of (local and/or volume-preserving point) trans- formations of fields implies a conservation law for a certain quantity (Noether's theorem;  e.g., E. L. Hill, Rev. Mod. Phys., 23 (1951)). For translational invariance (x-> x + a), the energy-momentum (stress) tensor Pv(x) = - bv2'(x) + I {v2'(x)/v(v<p:(x))vv<Pr'(x) ',r + v2'(x)/v(v{j5:(x»vv{j5:(x)} (the complex conjugate terms in all equations to be suppressed for real fields) satisfies the differential conservation law LoV Pv(x)=O as a consequence of the field equations, which in turn implies the time-independence of the 
583 folIowing quantity provided that T k (x) vanishes sufficiently fast at spatial infinity: H= f TO o (x)d 3 x, P k = f TO k (x)d 3 x (xo=t). These are the total field Hamiltonian (energy) and momentum. In a general situation, TV( = LgVPT P' where gV is the Minkowski metric tensor with the signature 1, -1, -I, -I) is not necessarily symmetric in J1 and v. F. J. Belinfante (Physica, 6 (1939); 7 (J 940)) has given a formula (by a change of 2'(x) via so-called 4-divergence) for obtaining a symmetrized energy-momentum tensor e v for which Hand P k are equal to those defined from the above T v and for which the differential conservation law is satisfied. Lorentz invariance implies the conservation law LaMVA(x)=O for the angular momen- tum density MVA(X) = XVPA(X)_ xAPV(x) + L {a2';a(8<p:(x))S,VA<p:(x) a,r,S + 82' j8(o </i:(X))S:,VA </i:(x)} = x'eA(x)-xAeA(x) and the time-independence of the total angular momentum M"A= f MO VA d 3 x (xo=t), where S:::' is antisymmetric in J1, v and D'(A)rs =i5rs+L"SfVevj2 for A(A)v=g,+f.v with an infinitesimal ev' (We have (A(A)xt = Lv A(A)xV and ;\(A)v = LpgpA(AYv') A continuous one-parameter group U(p) of internal symmetries implies the conserva- tion law L"oJ(x)=O for the 4-current den- sity (charge (J1 = 0) and current (J1 = 1,2,3) densi ties) JP(x) = L {82'jo(8 p <p:(x))A' P <p!(x) ,pr + 82'jo(a</i:(x))I'P </i!(x)} and the time-independence of the charge S JO(x)d 3 x (xo = t), where U'(O)"P =.Ie ,p. An example of U(p) is the multiplication by exp iQ' p (calIed a gauge transformation of the first kind), where Q' is an integer with Q' = 0 for real fields <Pr'. Examples of Lagrangian densities for non- interacting fields (calIed free Lagrangian den- sities) are: real scalar field: (Lvg' 8 <p(x)ov<p(x) - m 2 <p(x)2)j2, complex scalar field: (Lvg' op </i(x)ov<p(x) - m 2 </i(x)<p(x)), 150 C Field Theory Dirac field: (ij2)Lws(t/;r(x)tYsot/Js(x) - a t/;r(x)tYst/;s(x) - m Lr t/Jr(x)tt/;s(x) (i/Jr(x)t = L, Js(x)y?" y = (Ys) are tDirac's y- matrices), massive real vector field: -(Ij4) L gKPgA'(oKAA(x)-8AA.(x)) KAJJ-V m2 x (aAv(x) - avA(x)) +____ L9V A(x)A,(x), 2 v where D' is trivial for scalar fields, [I, OJ EB [0, IJ for the Dirac field, and [1, IJ for vector 1':.....1....J..... lIC;IU. For interacting fields, some interaction parts "''''00. ....r1r1ar1 +-....... +-.....0. ",,n-. .......f ....,"".... f...aa 1 ................ro..........ro al\" auuu lV lll\" Ulll Vi U\"l1 11\".0"" L£1.t,lQUt,lQU densities of relevant fields. Some examples are: P(<p)(or g<p4) interaction: P(<p(x)) (or g<p(X)4), Yukawa-type interaction: 9 Lrs t/Jr(x)tYst/J,(x)A(x), Fermi interactions: 9 Lkk' Gkk'(Lrst/J:(X)t(l),t/J!(x») x (Lr,s' t/Jr' (x)t(l),At (x)) «(I)k = I (scalar, nO index k), yP (vector, G kk , = gw)' yyV (tensor, Gkk' =g,gvv')' y5y (pseudovector, Gkk' = g" y5 = iy0y' y2y3), y5 (pseudoscalar, no index k). C. Heuristic Theory of Quantized Fields For free fields, a quantization procedure similar to the usual quantum mechanics ( 351 Quantum Mechanics; 377 Second Quan- tization) leads to the following type of canon- ical commutation or anticommutation rela- tions among fields (and their first time deriva- tives) at time 0; for example, real scalar field: [<p(0, x), ljJ(O, y)J _ = ii5 3 (x - y), [<p(0, x), <prO, y)] _ = [ljJ(O, x), ljJ(O, y)J _ = 0, Dirac field: [i/lr(O, x), t/J,(O, y)J + = i5 rs i5 3 (X - y), [t/Jr(O, x), t/Js(O, y)J + = [i/lr(O, x), i/I,(O, y)] + = O. ([A,Bl t =AB::!:BA, brs=O for r#s and b rr = 1, b\x - y) = IIf=1 i5(x k -l).) The free-field equa- tions then lead to the following 4-dimensional commutation relations: real scalar field: [<p(x), <p(y)] _ = - im(x - y), Dirac field: [t/Jr(x), t/;s(y)tJ + =(L p Yisop - im ors)m(x - y), massive real vector field: [A(x), Av(Y)] _ = i(g, +m-28A)m(x- y). Here m is the tin variant distribution, There is a unique representation of such relations for a 
150D Field Theory field (caIled the Fock representation) with a vector Q (called the free vacuum vector) which is annihilated by f <p(x)f(x)dx, by f t/1r(x)f(x)dx and f r(x)f(x)dx, or by f A(x)f(x)dx when- ever f eiPf(x)dx = 0 for pO> 0 (here, P' x = L9vPXV), and which is cyclic (i.e., polynomials of (smeared-out) fields generate a dense subset ofQ). The free fields in the Fock represen- tatio}l satisfy the Wightman axioms ( Sec- tion D). In the Fock representation of canonical fields a translationally invariant vector must be a free vacuum vector, up to multiplication by a complex number. In order to construct a model of a translationally invariant interaction among canonical fields with a unique vector of minimal energy (a vector which is called the true or interacting vacuum and which must be translationally invariant if unique), one must look for some other suitable representa- tion. Such a no-go theorem is caIled Haag's theorem. The earliest formulation of interacting quan- tized fields was developed heuristically by formal manipulation in the Fock representa- tion, described in textbooks of quantum field theory [2-6]. It can be mathematically justi- fied if the so-called cutoff is introduced by limiting the space to a finite volume, possibly changing the space into a lattice and smooth- ing out fields (effectively cutting off the high- energy part of the interaction) (A. M. Jaffe, O. E. Lanford III, and A. S. Wightman, Comm. Math. Phys., 15 (1969». The full theory is then expected to be obtained by taking a limit of the true vacuum expectation values as various cutoff parameters are removed. This is the aim of constructive field theory ( Section F), which has been achieved for some space-time models of dimension 2 and 3. In quantum field theory, the S-matrix is given in terms of (the mass-sheIl restriction of the Fourier transform of) the vacuum expec- tation value of the time-ordered product of fields, called r-functions ( Section E). In the heuristic approach, it is gi ven in terms of the foIlowing Gell-Mann-Low formula (its imagi- nary time version being mathematically used in constructive field theory): If the field <p(x) = e iHxO tpo(O, x»e- iHXO with H =Ho +H] (Ho is the free Hamiltonian as the generator of the time translation for the free field <Po(x) and H] is the interaction Hamiltonian, such as P(<po) and x? > ... > x, then (Q, <p(x 1)'" <p(xn)Q) = lim (e iH 1' Q O' <p(X 1 )... <p(Xn)e-iHTQO) Too /(Qo' e-2iHTQO), where Qo is the free vacuum. The numerator 584 can be written as (Qo, U(T,x?)<Po(xdU(x?,xg).., <Po(xn)U(x, T)Qo) in terms of U(t,s)=ei'Hoe-i(t-S)He-i,Ho and the canonical field <Po at time O. The covariant perturbation series due to Tomonaga, Schwin- ger, Feynman, and Dyson is obtained by sub- stituting the following expansion, which is the iteration of the Duhamel formula: U(t, s) 00 (t f '" 1 =no(-i)"J. dt t ... S dtnH](td...H](tn). Here, H](t)=eitHoH]e-i'Ho. Each term can be represented by connected tFeynman diagrams (the denominator canceling out all discon- nected graphs) and computed according to the tFeynman rule, yielding tFeynman integrals. Each expression so obtained (formally) may be a divergent integral (in the absence of the cutoff), in which case one tries to cancel it out by modifying the original Hamiltonian with the alteration of parameters such as mass and coupling constant (by an amount cal1ed the renormalization constant, which is diver- gent in the absence of the cutoff) or possibly by the addition of terms again involving renormalization constants. If this can be achieved in terms of a finite number of renor- malization constants, the model or the Hamil- tonian is caIled renormalizable [7]. If the divergent integral appears only in a finite number of graphs (counting the same sub- graph of an infinite number of different dia- grams as one graph), the model is called super- renormalizable. D. Axiomatic Quantum Field Theory Relativistically covariant fields <p,(x) on a Hil- bert space Yt' are called Wightman fields if the following four Wightman axioms are fulfilled [8-10]: (I) The fields <p,(x) are operator-valued distri- butions: For each COO-function f of rapid de- crease on the Minkowski space (j EY'(R 4 ), <p,( f) is an operator defined on a common domain D dense in .1t' and satisfying <p,( f)D c D (the domain of <p,(f)* contains D), <p,(f)* I D = <P.(f) for another index a and ('P, <p,(f)<D) for any 'P, and <D in D is linear and continuous in f relative to the topology of the Schwartz space Y'(R 4 ). (2) Relativistic covariance: There exists a continuous unitary representation U(a, A) (aER 4 and AESL(2,C)) of the universal cover- ing group J of the trestricted inhomogeneous Lorentz group 2!'J on Yt' such that U(a,A)Dc 
585 Dand U(a, A)<p.(/)U(a, A)* == L D(A).p<pp(fa,A)' p where fa,A(X) = f(A(A)-I(x-a)) and (D(A).p) is a finite-dimensional representation of SL(2, C). The index ex usually consists of tundotted and dotted indices, interchanged in rz, and of other indices, interchanged among them in rz.) (3) Locality: If the supports of f and 9 are mutually spacelike, then <P.(/)<Pp(g) = £(ex, [3) <pp(g)<P.U) on D, where £(ex, [3) = :tl. If £(ex, [3) = -1 exactly when D(A)..= D(A)pp = -1 for A= -1 (i.e., when both <P. and <Pp are Fermi fields), they are said to satisfy the normal commutation relations. (4) Spectrum conditions: Let U(a, 1)=e ia ' p . The joint spectrum of P (J1 = 0, 1,2,3) is in the forward cone V+ = {pER4Ip' p:;:' 0, pO:;:, O}, with a point spectrum of multiplicity 1 at p=O. A vector Q belonging to the point spectrum 0 of P is called the true (or interacting) vacuum and is required to be in D. Usually D is taken to be minimal, namely, D is the linear hull of Q and <P'I Ud... %.Un)Q with all possible n, ex I, ..., exn, fl, '" ,In. By means of the nuclear theorem, it is possible to define a linear operator <P'I' "'U) for f(x"..., x n ) in Y'(R 4n ), linear and continuous in f on D such that it coincides with <P'I (/d ... <P..Un) if f(x) = IIj1 fj(Xj)' If such operators are introduced, then the linear huH of Q and <p.I.....U)Q is taken to be D. Under the foregoing axiom, the vacuum expectation values of the products of field operators define tempered distributions J.v.(x) = J.v. 1 ..,(x I' ..., x n ), called Wightman functions, such that J.v.(IIfj) = (Q, <P'1 (/d... <P..(/n)Q). The notion of a connected graph in pertur- bation theory corresponds to the following notion of truncated Wightman functions J.v.T: m J.v.T(I)=I(_l)m-l(m-1)! I fl J.v.(Ik)' {h}k1 m J.v. (I) = I L fl J.v.T (l k)' m ilk} k=1 where I indicates a set of variables XjE M,jE I, {Id is a partition of I into m subsets, with the order of the x's in each Ik remaining the same as in I, and the sum over {Id extending over all possible partitions. If the spectrum of {P} in Q.l is contained in V.;"= {pEM Ip' p:;:,m 2 , pO :;:,m} for some m>O (the mass gap), then J.v.T has an exponential clustering property at spatial infinity. For example J.v.T (XI + ai' ..., X n +an)em'R-->O as a distribution in X if m' <m, aJ =0 for aH j and R =max la j - akl--> 00. 150D Field Theory If the mass operator (p. P) 1/2 has an iso- lated point spectrum at m with the eigenspace yt'l (m), then there are sufficiently many n, ex = (ex I' ... ,ex n ), and f(x I' ... , x n ) such that <P.(/) ° E yt'l (m), and the linear hull of such vectors are dense in yt'l (m). In terms of the Jj satisfying <P'uJfj)QE yt'l (mj) (some of the m's may coin- cide) and the solutions gj(X) = S gj(p)exp i(px- w}p)x O )d 3 p, Wj(p)=(p2 +mJ)I/2, of the Klein- Gordon equation, then the following limits, called out and in states, exist: OU' 'Pin (hl...h n )= lim QI(t,gd...Qn(t,gn)Q, t:too Q}t,gj)= f <P.,)XI +(t, X), ...,Xnj+(t,X)) Xfj(X 1+'" + Xn)g}t, x)dx I ... dxnjdx, h j =(2n)3 gj(P)<P./fj)QEyt'1 (m j ). It defines the S-matrix elements as follows: S(h l .., h n ; h'l ... h,) ==('P0U'(h l ... h n ), 'Pin(h'I'" h,)). This is called Haag-Ruelle scattering theory, and the existence proof is based on the cluster- ing property of J.v.T and an asymptotic esti- mate for the behavior of the g's for large t and all x (R. Haag, Phys. Rev., 112 (1958); D. Ruelle, H elv. Phys. Acta, 35 (1962)). The out and in states can be interpreted in the limit of infinite future and past as the state where n particles are moving at velocities v j related to the spectrum p of P through the relation pO = m/(l-vJ)I/2, p=mjv)(1-v])I/2 (with probabil- ity amplitude proportional to hj(p») (H. Araki and R. Haag, Comm. Math. Phys" 4 (1967)). Since ('P0U'(h l ... h n ), 'P0u'(h'I'" h.)) n = bnn' I fl (h j , h(j)), P j=1 where the sum is over all permutations P, 'P 0u ' can be viewed as a unitary mapping from the tFock space yt'0 over L;' yt'l (m) to the closed subspace yt'0u, spanned by 'P0U'(h l ... h n ), n = 0,1,2,... (Q for n= 0, hI itselffor n = 1). The free fields on yt'0u, and yt'in are called (out and in) asymptotic fields. Likewise, 'Pin is a unitary mapping from yt'0 to yt'in. If yt'in = yt'ou" then the matrix element S can be viewed as a ma- trix element of a unitary operator on yt'o, called an S-matrix. Its unitary transform by 'P 0u ' and by 'Pin coincide and define a unitary operator, sometimes called an S-operator, on yt'in = yt'0u'. The equality yt' = yt'in = yt'0u, is caIled the completeness of the scattering states or asymptotic completeness. If the scattering states are complete, then the following LSZ asymptotic condition, due 
150D Field Theory to Lehmann, Symanzik, and Zimmermann (Nuovo Cimento, 1 (1955)), holds: weak-lim Qit, gj) = a: u ' (h), t_::I:oo in where a*(h) is the tcreation operator (a*(h)' 'P(h 1 ... h n ) = 'P(hhl ... h n ) for either out or in states), q',,).fj)OE YC'dm) is not required, and (0, <p,,).fj) 0) = 0 is assumed instead. This leads to an explicit expression for the S-matrix elements in terms of r-functions, which takes the following simple form if all particles have spin 0, all fields <p, are scalar (D(A),p = b,p), and all n j can be taken to be 1. First define the connected part Sc(hl ... h n ; h ... h.) from S by exactly the same equation as truncated Wight- man functions with S to be set to 0 if n = 0, 1 or n' =0, 1 except when n=n' = 1 (S(h, h')= (h,h')). Then Sc(hl ... hI; hi+! ... h n ) = f[ (Q (Pj' pj-m lJ )i 1 '(PI, ." ,p" - P,+!"", - pn)JjQ (- iZj-I/2hj(p)dOj(pj)), where p7 = (pJ + mlJI/2 (the set of such pis called the positive mass shell; there the vanish- ing factors (Pj' Pj - m 2 ) cancel the poles of in, dOm/p) is the Lorentz invariant measure (2Ip°I)-1 d 3 p (= b(p' P -mJ)d 4 p) on the posi- tive mass shell, hE .J't'l (m) is represented by a measurable function h(p) with the inner prod- uct (h, h')= S h(p) h'(p)dOm(p), Zj is defined by (, <P"j)(.fj)0)=ZF2(h,) for all hEYC'dm) and f(p)=(2n)-3/2 S e ip Xf(x)dx, and il is the Fourier transform of the truncated time- ordered function (r-function) defined by inPI' ...,Pn)= f r,T(x l , ...,Xn)eXPi(t pj'x j ) n X n (2n)-3/2d 4 x j , j1 r;(x) = L O(x o ; C p ) Wl,(x) (x=(x"..., x n )), p Wl,(x)=(O, <P,p,l)(x P (1»)'" <p'P,"/Xp(n»)O)T, in which the sum is over all permutations P, T indicates the truncated Wightman functions, 8(x o ; C p ) is the characteristic function of the cone X(1)  x(2)  '"  x(n) (the time ordering) possibly smeared out by convolution with a COO-function of compact support (so that it can multiply a distribution), where the formula does not actually depend on the smearing functions. (Usually fields <p, are normalized so that Zj= 1.) The r-functions (0, Steinmann, He/v. Phys. Acta, 33 (1960); D. Ruelle, Nuovo Cimento, 19 (1961); H. Araki, J. Math. Phys., 2 (1961)) are 586 defined by ri(x) = L O(X O ; CpjC;JW/,,(x) p ( =8(XO;CpjC;)Wp,,(X)) (often with an additional factor( - on-I), where the Fourier-Laplace transform 8(qO; C p ) of O(x o ; C p ) is a rational function and 8(x o ; CpjC;) is the inverse Fourier transform of its bound- ary value as ImqOE C i tending to 0, and the cone C i and hence r i is specified by a consistent choice of signs of LjEI 1m qJ for all nonempty proper subsets I of(1, ...,n), The Fourier transform of ri(x) is a boundary value of an analytic function common to all ri and coin- cides with i; for pO E C i ' Making use of this relation, some analyticity properties of the S- matrix (including TCP symmetry) have been proved ([11, 12J; J. Bros, H. Epstein, and V, Glaser, Comm. Math. Phys., 1 (1965); Epstein, Glaser, and A. Martin, Comm. Math. Phys., 13 (1969); Epstein, J. Math. Phys" 8 (1967)). The two-point function for scalar fields has the following simple expression in terms of tinvariant distributions, sometimes called the Kiillen-Lehmann representation with the Kiillen-Lehmann weight p: (0, <p(x)<p(y)O) = !ORO K(X - y)dp(K 2 ). Under the Wightman axiom (with the normal commutation relations), there exists an antiunitary operator e, called the TCP operator, which satisfies the relations eo=o, eU(a,A)e-I=U(-a,A), e 2 =U(0, -1), e<p,(x)e- I =( -1)"( _i)N <Pii( -x), where I] is the number of undotted indices in ex, N =0 if D(A)" = 1 for A= -1 (Bose fields), and N == 1 if D(A),,= -1 for A = -1 (Fermi fields). The choice of::!: 1 for t:(ex, ex) in the locality axiom cannot be opposite to the normal com- mutation relations for any ex except for the trivial case <p.(x) = 0 ([8-1 OJ; G. Liiders and B, Zumino, Phys. Rev., 110 (1958); N. Burgoyne, Nuovo Cimento, 8 (1958)). This is called the connection of spin and statistics. For a general choice of::!: 1 for t:(ex, [3), the Wightman axioms imply existence of a certain number of even- oddness conservation laws (i.e., the existence of a unitary representation u of a group (Z2)' for some I such that u(g)O=O, u(g)<p,(f)u(g)*= X,(g)<p.(f) for some characters X, on the group) so that the Klein transforms cp,(x)== U(g,)qJ,(x) (for some choice of g,) of the original fields satisfy the Wightman axioms with the normal 
587 commutation relations ([8-10]; H. Araki, J. Math. Phys., 2 (1961)). E. Theory of Local Observables Except for the technical assumption of operator-valued distributions, physical con- tents of the Wightman axioms have been for- mulated in terms of the von Neumann algebra d(0) for bounded open sets 0, generated by those observables which can be measured in the space-time region 0, as follows: (1) Isotony: If 0 1 =:>0 2 , then d(0d=:>d(0 2 ), (2) Covariance: U(a, A)d(0) U(a, A)* = d(A(A)0 + a). (3) Locality: If Q j Ed(0) and 0 1 is spacelike to O 2 , then [Q 1, Q2J = O. (No signal can propa- gate faster than the speed of light.) In addition, the spectrum condition is as before (the stability of the vacuum) and, since we restrict our attention to the closed span of d(0)Q for all 0, Q is assumed to be cyclic for U(';d(0). (Then the latter is irreducible.) By treating Q(x)= U(x, I)QU(x, 1)* for QuJ1(0) as a (noncovariant but localized) field, Haag- Ruelle scattering theory and the analyticity properties of the S-matrix described above for Wightman fields hold in exactly the same way in the theory of local observables. The notion of an algebra of local observables has been a concern of R. Haag since the late 1950s, with the consequent analogy to Wightman fields being demonstrated by Araki in his Zurich lectures of 1961-1962. Hence the above axioms are sometimes called Haag- Araki axioms. With the help of the additional axiom (d(0dU d(0 2 ))" =d(0 1 U( 2 ), called the addi- tivity, the vacuum vector Q is cyclic and sep- arating for d(0) for any bounded open 0 (Reeh-Schlieder theorem) and .W'( 0) = d( dJ), where dJ is the double cone {x Ilxol + Ixl.s; L} if 0= {x Ilxol + Ixl.s;L, Ixl <£} for any £> 0, for example (Borchers theorem). A merit of the Haag-Araki axioms is that these axioms have direct physical interpre- tation. In particular they always imply the commutativity at spacelike separation of sup- ports, in contrast to the anticommutativity for Fermi fields. This then necessitates the con- sideration of the representations associated with some other states, such as states with an odd number of fermions and representations of the C*-algebra d of quasi local observables (generated by all d(0)), which are nonequiva- lent to the vacuum representation and are called superselection sectors. This viewpoint was introduced by Haag and D. Kastler (J. Math. Phys., 5 (1964)) and the C*-algebra 150 F Field Theory version of the Haag-Araki axioms is called the Haag-Kastler axioms. If (!)' denotes the causal complement of 0 (i.e., the set of all points spaceJike to 0), then the assumption .W'«(!))' = d(0') is called duality and is proved for a certain type of region, which includes the double cone in the case of free fields. With the assumption of the duality for double cones in the vacuum sector, S. Doplicher, Haag, and J. Roberts (Comm. Math. Phys., 13 and 15 (1969); 24 (1971); 35 (1974)) succeeded in the analysis of superselection sectors and clarified the connection of spin and statistics in a much more satisfactory fashion, as well as the anticommutativity of intertwin- ing operators for superselection sectors for Fermi statistics. F. Constructive Field Theory An effort to make mathematical sense out of the heuristic theory of quantized fields and to produce examples of Wightman fields and the associated system of local observables has been pushed forward by J. GJimm and A. Jaffe since the mid 1960s and is known as construc- tive field theory. Since 1972, the Euclidean methods, already known in some sense, have become extremely powerful central tools, are collectively known as Euclidean field theory [13-16]. The Wightman function W(z I ... zn) is ana- lytic at the Schwinger points Zj = (ix7, x) (x E R4) if Xj#Xk for j #k, and its value S(x i .., x n )= W(z 1 .,. Zn) is called the Schwinger function. The axioms for Schwinger functions equiv- alent to Wightman axioms are known as Oster walder-Schrader axioms (Comm. Math. Phys., 31 (1973); 42 (1975)). The positivity axiom reflecting the positive definite metric of the Hilbert space for Wightman fields is known as O-S positivity (or T -positivity or reflection positivity). Since the Schwinger function is symmetric in its variables, it can be viewed as the expec- tation value of the product of (commuting) random fields, called Euclidean fields, if an additional positivity holds, This idea was put forward by K. Symanzik in the 1960s. E. Nel- son then realized that Euclidean fields for free fields have the Markov property, and he devel- oped Euclidean Markov field theory. A work of Guerra in 1972 utilizing Nelson symmetry revealed the extreme power of this approach, and the whole of constructive field theory has been studied in Euclidean formulation with remarkable results for super-renormalizable models in space-time of dimension 2 and 3. The key point is the Feynman-Kac-Nelson formula, which expresses Schwinger functions 
150G Field Theory as functional integrals and reveals a mathe- matical connection between Euclidean field theory and classical statistical mechanics. G. Gauge Theory tElectrodynamics in terms of the 4-vector potential A#(x) is invariant under the local gauge transformations A#(x)-->A)x) + o#A(x) (and the associated transformation of the charged fields) for A satisfying the wave equa- tion DA == O. This leads on one hand to com- plication in the canonical quantization of the fields A#(x) and, on the other hand, to the necessity of an indefinite inner-product space as exemplified by the Gupta-Bleuler formalism of quantum electrodynamics. In such a formal- ism, the physical Hilbert space (with positive definite metric) is introduced by considering a specific subspace (physical subspace) of a semidefinite metric and taking the quotient by its null subspace. The corresponding theory with a non- commutative gauge group is known as the theory of Yang-Mills fields. In order to restore the formal unitarity of the S-matrix in the perturbation series in terms of Feynman dia- grams, 1. D. Faddeev and V. N. Popov (Phys. Leu., 25B (1967)) introduced fictitious particles, called Faddeev-Popov ghosts. T. Kugo and I. Ojima (Prog. Theoret. Phys., Suppl., 66 (1979)) developed a canonical quantization scheme for the Yang-Mills field which naturally intro- duces additional fields corresponding to the Faddeev-Popov ghosts and specifies the phy- sical subs pace by means of the simple condi- tion that it be the kernel of the generator of BRS transformations, earlier introduced by C. Becchi, A. Rouet, and R. Stora. Gauge theory can be formulated on a lattice of finite volume as a kind of classical statistical mechanics. This is known as lattice gauge theory, and the important issue currently being investigated is whether or not its limit, as the lattice interval tends to 0 (the continuum limit) and the volume tends to infinity, produces a nontrivial quantum theory of gauge fields. References [I] 1. D. Landau and E. M. Lifshits, The classical theory of fields, Pergamon, 1975. (Original in Russian, 1973.) [2J J. D. Bjorken and S. D. Drell, Relativistic quantum fields, McGraw-HilI, 1965. [3J S. S. Schweber, An introduction to relativ- istic quantum field theory, Harper & Row, 196\. [4] H. UmeLawa, Quantum field theory, North- Holland, 1956. 588 [5] K. Nishijima, Fields and particles, Ben- jamin, 1969. [6] N. N. Bogolyubov and D. V, Shirkov, Introduction to the theory of quantized fields, Wiley, third edition, 1980. [7] K. Hepp, Theorie de la renormalization, Springer, 1969. [8] R. F. Streater and A, S. Wightman, PCT, spin and statistics, and all that, Benjamin, 1978. [9] R. Jost, The general theory of quantized fields, Amer. Math. Soc., 1965. [IOJ N. N. Bogolyubov, A. A. Logunov, and I. T. Todorov, Introduction to axiomatic quan- tum field theory, Benjamin, 1975. (Original in Russian, 1969.) [II] C. DeWitt and R. Omnes (eds.), Disper- sion relations and elementary particles, Wiley, 1960. [12] M. Chretien and S. Deser (eds.), Axiom- atic field theory, Gordon & Breach, 1966. [13] G. Velo and A. S. Wightman (eds.), Con- structive quantum field theory, Springer, 1973. [14] B. Simon, The P(cp)z Euclidean (quan- tum) field theory, Princeton Univ. Press, 1974. [15] J. Glimm and A. Jaffe, Quantum physics, Springer, 1981. [16] B. Simon, Functional integration and quantum physics, Academic Press, 1979. [17] A. Jaffe and C. Taubes, Vortices and monopoles, Birkhiiuser, 1980. 151 (IV.'4) Finite Groups A. The Number of Finite Groups of a Given Order A group is called a finite group if its order is finite ( 190 Groups), Since the early years of the theory of finite groups, a major problem has been to find the number of distinct isomor- phism classes of groups having a given order. It is almost impossible, however, to find a general solution to the problem unless the values of n are restricted to a (small) subset of the natural numbers. Let f(n) denote the num- ber of isomorphism classes of finite groups of order n. If p is a prime number, then f(p)= 1 and any group of prime order is a cyclic group. If p is prime, then any group of order p2 is an tAbelian group and f(p2)= 2. If p and q are distinct primes and p> q, then f(pq) = 2 or 1 according as p is congruent to 1 modulo q or not. If p= 1 (modq), there is a non-Abelian group of order pq as weII as a cyclic group of order pq. For small n, the value of f(n) is as 
589 follows: n 8 12 16 18 20 24 27 28 30 32 60 f(n) 5 5 14 5 5 15 5 4 4 51 13 For any n, f(n)?: 1. When p is prime, f(pm) is known for m.s; 6:f(p3)= 5,!(p4) = 15 ifp>2. For f(p5) see O. Schreier, Abh. Math. Sem. Univ. Hamburg, 4 (1926). For f(26) see [11]. Set f(pm)= pi and I=Am 3 . Then A-->2/27 as m-->oo (G. Higman, Proc. London Math. Soc., 10 (1960); C. C. Sims, Symposium on Group Theory, Harvard, 1963). B. Fundamental Theorems on Finite Groups The foIl owing are some fundamental theorems useful in studying finite groups. (1) The order of any subgroup of a finite group G divides the order of G (J. 1. La- grange), The converse is not necessarily true. If a finite group G contains a subgroup of order n for any divisor n of the order of G, then G is a tsolvable group. Furthermore, if G contains a unique subgroup of order n for each divisor n of the order of G, then G is a cyclic group. Let p be a prime number. Let the order of a finite group G be pnm, where m is not divisible by p. A subgroup of order pn of G is called a p- Sylow subgroup (or simply a Sylow subgroup) of G. The importance of this concept may be seen from the next theorem, (2) A finite group contains a p-Sylow sub- group for any prime divisor p of the order of the group. Furthermore, p-Sylow subgroups are conjugate to each other. The number of distinct p-Sylow subgroups is congruent to 1 modulo p. In general, the number of distinct p- Sylow subgroups of G that contain a given subgroup whose order is a power of p is con- gruent to 1 modulo p (Sylow's theorems), (3) A p-group is a tnilpotent group (a finite group is caIled a p-group if its order is a power of p). Thus any finite p-group G of order> 1 contains a nonidentity element in the tcenter of G. Furthermore, any proper subgroup of G is different from its tnormalizer. A paper by P. HaIl (Proc. London Math. Soc., (2) 36 (1933)) is a classic and fundamental work on p-groups. A group G of order 8 with 2 generators a, r and relations a 4 = 1, rar- 1 =a- I , a 2 =r 2 is called the quaternion group. This group is isomorphic to the multiplicative group consist- ing of {::!:1, ::!:i, ::!:j, ::!:k} in the tquaternion field. A generalized quaternion group is a group of order 2 n with 2 generators a, r and rela- tions a 2 "-1 = 1, rar-I = a-I, r 2 = a 2 "-2. A non- Abelian group all of whose subgroups are normal subgroups is caIled a Hamilton group. A Hamilton group is the direct product of a quaternion group, an Abelian group of odd 151 D Finite Groups order, and an Abelian group of exponent 2 (i.e., p2 = 1 for each element p). Let G be a finite group, and let Go = G =:> G l =:> ." =:> G r = {I} be a tcomposition series of G. The set of isomorphism classes of the simple groups Gi-I/G i , i = 1, 2, "', r, is uniquely deter- mined (up to arrangement) by the tJordan- Holder theorem. Thus the two most funda- mental problems of finite groups are (i) the study of the simple groups and (ii) the study of a group with a given set of composition fac- tors. The first is one of the leading problems of the theory, although it has been in a state of stagnation until rather recently ( Section J). As to the second problem, initial works by H. Wielandt and others are under way (partic- ularly in the direction of various generaliza- tions of Sylow's theorems). For the class of finite solvable groups, the first problem has a rather trivial solution; only the second prob- lem is important, and even in this case the theory seems to leave something to be desired. C. Finite Nilpotent Groups A finite group is nilpotent if and only if it is the tdirect product of its p-Sylow subgroups, where p ranges over all the prime divisors of the order. Any maximal subgroup of a nilpo- tent group is normal. The converse holds for finite groups; that is, a finite group is nilpotent if and only if all its maximal subgroups are normal. D. Finite Solvable Groups One of the most profound results on finite groups, an affirmative answer to the long- standing Burnside conjecture, is the Feit- Thompson theorem (Pacific J. Math., 13 (1963)): A finite group of odd order is solvable. The index of a maximal subgroup of a finite solvable group is a power of a prime number (E. Galois). But the converse is not true. The unique simple group of order 168 has the property that all maximal subgroups are of prime power index. A finite solvable group contains a self-normalizing nilpotent subgroup (i.e., a nilpotent subgroup H such that NdH) = H), and any two such subgroups are conju- gate (R. W. Carter, Math. Z., 75 (1960); cf. W. Gaschiitz, Math. Z., 80 (1963), for a generali- zation). Such a subgroup is caIled a Carter subgroup and is an analog of a Cart an subal- gebra of a Lie algebra. But unlike Cartan subalgebras, most simple groups do not con- tain any self-normalizing nilpotent subgroups. A finite solvable group of order mn (m, n are relatively prime) contains a subgroup of order m; two subgroups of order m are conjugate; if I 
151 E Finite Groups is a divisor of m, then any subgroup of order I is contained in a subgroup of order m (P. Hall), The converse of the first part of this theorem is also true: A finite group is solvable if it con- tains a subgroup of order m for any decompo- sition of the order in the form mn, (m, n) = 1 (P, Hall's solvability criterion), This generalizes the famous Burnside theorem asserting the solvability of a group of order paqb, where both p and q are prime numbers, If the se- quence of the quotient groups of a tprincipal series of a finite group G consists of cyclic groups, then the group G is called supersolv- able. A finite group is supersolvable if and only if the index of any maximal subgroup is a prime number (B, Huppert, Math. Z, 60 (1954)), If p is the largest prime divisor of the order of a finite supersolvable group G, then a p-Sylow subgroup of G is a normal subgroup, E. Hall Subgroups A subgroup is called a Hall subgroup if its order is relatively prime to its index (see the theorems of p, Hall on finite solvable groups), There is no general theorem known on the existence of a Hall subgroup. If a finite group G has a normal Hall subgroup N, then G con- tains a Hall subgroup H that is a complement of N in the sense that G = N Hand N n H = I; furthermore, any two complements are conju- gate (Schur-Zassenhaus theorem). The analog of Hall's theorem on finite solvable groups fails for nonsolvable groups, But if a finite group, solvable or not, contains a nilpotent Hall subgroup H of order n, then any sub- group of an order dividing n is conjugate to a subgroup of H (Wie1andt, Math. z., 60 (1954); cf. P. Hall, Proc, London Math, Soc" 4 (1954), for a generalization), There are some gener- alizations of these results for maximal n- subgroups which may not be Hall subgroups, F. n-Solvable Groups Let n be a set of prime numbers, Denote the set of prime numbers not in n by n'. A finite group is called a n-group if all the prime divi- sors of the order belong to n, A finite group is called n-soIvabIe if any composition factor is either a n'-group or a solvable n-group, If n= {p} consists of a single prime number p, we use terms such as p-group or p-solvab[e (in- stead of {p}-solvable), Let G be an-solvable group, A series of subgroups Po == 1 c No  PI ;';:N I  '"  cNl= G defined by the prop- erties that P//N i - I is the maximal normal n- subgroup of G/N i _ 1 and N;/P i is the maximal normal n'-subgroup of G/P i is called the n- series of G, and the integer [ is called the n- 590 length of G, A solvable group is n-solvable for any set n of prime numbers, An-solvable group contains a Hall subgroup which is a n- group and also a Hall subgroup which is a n'- group; an analog of Hall's theorem on finite solvable groups holds. Hall and Higman (Proc, London Math. Soc" 7 (1956» discovered deep relations between the p-length of a p-solvable group and invariants of its p-Sylow subgroup, For example, the p-length is 1 if a p-Sylow subgroup is Abelian. G. Permutation Groups The set of all permutations on a set Q of n elements forms a group of order n! whose structure depends only on n, This group is called the symmetric group of degree n, de- noted by Sn' Any subgroup of Sn is a permuta- tion group of degree n, When it is necessary to mention the set Q on which permutations operate, Sn may be denoted as S(Q), and a subgroup of S(Q) is called a permutation group on Q. The set of n elements on which Sn operates is usually assumed to be {I, 2, "" n}, and an element a of Sn is written as ( 1 2 a- I' 2' :) ( I 2 3 4 5 6 ) for example, 2 3 1 5 4 6' where j' is the image of i by a: i' = a(i), The element a may be written as (.., )(abc '" z)(",), which means that a cyclically maps a into b, b into c, and so on, and finally z back into a, In the example, a = (123)(45)(6). It is customary to omit the cycle with only one letter in it, such as (6) in the example, With this conven- tion, a = (1 23)(45) may be an element of Sn for any n;;:' 5, leaving all the letters i;;:, 6 invariant. A cycle of length I is an element a of Sn which moves [letters ai' ,." a 1 cyclicaIly and fixes all the rest; i,e" a = (ai' "', ad. Then an ex- pression such as a = (1 23)(45) is the same as the product of two cycles (1 23) and (4 5), In general, any permutation can be expressed as the product of mutually disjoint cycles (two cycles (a l ,..., al) and (b l ,..., b m ) are said to be disjoint if a i # b j for all i and j), Furthermore, the expression of the permutation as the prod- uct of mutually disjoint cycles is unique up to the order in which these cycles are written, A cycle of length 2 is called a transposition. Any permutation may be written as a product of transpositions. This expression is not unique, but the parity of the number of transpositions in the expression is determined by the permu- tation, A permutation is called even if it is the product of an even number of transpositions 
591 and odd otherwise, The symmetric group Sn contains the same number of even and odd permutations. The totality of even permutations forms a normal subgroup of order (n!)/2(n  2), calJed the alternating group of degree n and usually denoted by An. An even permutation is the product of cycles of length 3. The alternating group As of degree 5 is the nonsolvable group of minimal order. This fact was known to Galois. If n # 4, then the alternating group An is a simple group and a unique proper normal subgroup of Sn' If n=4, A4 contains a normal noncyclic subgroup V of order 4. In this case A4 and V are the only proper normal sub- groups of S4' A noncyclic group of order 4 is called a (Klein) four-group. If n  5, the sym- metric group Sn is not a solvable group. This is the group-theoretic ground for the famous theorem, proved by Ruffini, Abel, and Galois, which asserts the impossibility of an algebraic solution of a general algebraic equation of degree more than four. If n.s; 4, Sn is solvable. The group S4 has a composition series with composition factors of orders 2, 3, 2, 2, and S4 is realized as the group of motions in 3- dimensional space which preserve an octahe- dron. Hence S4 is calIed the octahedral group. Similarly, A4(As) is realized as the group of motions in space which preserve a tetrahedron (icosahedron); thus A4 is called the tetrahedral group and As the icosahedral group. These groups have been extensively studied in view of their geometric aspect. The group of motions of a plane which preserve a regular polygon is called a dihedral group. If a regular polygon has n sides, then the group has order 2n, Sometimes a Klein four-group is included in the class of dihedral groups (for n == 2). The dihedral groups, octahedral group, etc., are called regular polyhedral groups. A finite subgroup of the group of motions in 3- dimensional space is either cyclic or one of the regular polyhedral groups. A dihedral group is generated by two elements of order 2. Conversely, a finite group generated by two elements of order 2 is a dihedral group. This simple fact has surprisingly many conse- quences in the theory of finite groups of even order [15, ch. 9]. A dihedral group of order 2n contains a cyclic normal subgroup of order n, and hence is solvable, If n # 6, every automorphism of Sn is inner. The order of the group of automorphisms of S6 is twice the order of S6' The index (Sn: H) of a subgroup H of Sn is at least equal to n unless H = An' If (Sn: H) = n, then H is isomorphic to Sn-l' If n # 6, Sn contains a uni'}ue conjugate class of subgroups of index n. But S6 contains two such classes, which are exchanged by an automorphism of S6 ( Section I). 151 H Finite Groups. H. Transitive Permutation Groups A permutation group G on a set Q is called a transitive permutation group if for any pair (a, b) of elements of Q, there exists a permuta- tion of G which sends a into b. Otherwise G is said to be intransitive. Let G be a transitive permutation group on a set Q, and let a be an element of Q. The totality of elements of G which leave a invariant forms a subgroup of G calIed the stabilizer of a (in G). The index of the stabilizer is equal to the number of ele- ments of Q, the degree of G. Thus the degree of a transitive permutation group G divides the order of G (a fundamental theorem), The concept of orbits is important. Let G be a permutation group on a set Q. A subset r of Q is called an orbit of G if it is G-invariant and G acts transitively on r. In other words, a subset r of Q is an orbit of G if the folJowing two conditions are satisfied: (i) If a E rand 9 E G, the image g(a) also lies in r; and (ii) if a and b are two elements of r, there exists an element x of G such that b = x(a). Thus each element x of G induces a permutation <p,(x) on r. The set of all these permutations <p,(X)(XEG) forms a permutation group on r, which may be denoted by <p,(G). Then <p,(G) is transitive on r, and <p, is a homomorphism of G onto <p,(G). Thus the number of elements in an orbit r is a divisor of the order of G. It is clear that the set Q on which G acts is the union of mutually disjoint orbits r l , ..., rr of G, This implies that the degree of G is the sum of the numbers of elements in the orbits r i . The resulting equation often contains non- trivial relations. If <Pi denotes the homomor- phism <P'i defined before, then G is isomorphic to a subgroup of the direct product of the groups <Pi ( G), i = I, 2, ... , r. A transitive permutation group is called regular if the stabilizer of any letter is the identity subgroup {I}. A transitive permuta- tion group is regular if and only if its order equals its degree. Any group can be realized as a regular permutation group (Cayley's theorem). A transitive permutation group which is Abelian is always regular. Let G be a transitive permutation group on a set Q. If the stabilizer of an element a of Q is a maximal subgroup, G is called primitive, and otherwise imprimitive. A normal subgroup, which is #{ I}, of a primitive permutation group is transitive, An imprimitive permuta- tion group induces a decomposition of the set Q into the union of mutually disjoint subsets  I, ... , s (s> 1) such that each i contains at least two elements, and if x E G maps an ele- ment a of i onto an element b of j, then x maps every element of i into j:x(;)= j' The set {I' .., , s} is called a system of im- 
151 H Finite Groups primitivity. A subset!! of 0 is called a block if x() n  equals  or the empty set for all x in G. A block is called nontrivial if  #0 and  contains at least two elements. Each member of a system of imprimitivity is a nontrivial block. A transitive permutation group is primitive if and only if there is no nontrivial block. A permutation group G on a set 0 is called k-transitive (or k-ply transitive, where k is a natural number) if for two arbitrary k-tuples (ai, "', ad and (b l , "', bd of distinct elements of 0, there is an element of G which maps a i into b i for all i = 1, 2, " , , k. If k  2, G is called multi- ply transitive. A doubly transitive permutation group is always primitive. The symmetric group Sn of degree n is n-transitive, while the alternating group An is (n - 2)-transitive for n3. Conversely, an (n-2)-transitive permu- tation group on {l,2, ...,n} is either Sn or An' For multiply transitive permutation groups which are simple, see the list in Section I. If k  6, no k-transitive groups are known at present except Sn and An. If the Schreier con- jecture ( Section I) is true, then there are no 6-transitive permutation groups except Sn and An (Wielandt, Math. Z., 74 (1960); H. Nagao, Nagoya J. Math., 27 (1964); O'Nan, Amer. Math. Soc. Notices, 20 (1973)). Two 5-transitive permutation groups other than Sn and An are known: the groups M I2 and M 24 of degrees 12 and 24, respectively, dis- covered by E. L. Mathieu in 1864 and 1871. The stabilizer of a letter in M I2 (M 24 ) is a 4- transitive permutation group of degree 11 (23), denoted by M II (Md. No 4-transitive permu- tation groups other than Sn' An' M i (i = 11, 12, 23, and 24) are known. The groups M12' MIl> M 24 , M 23 and the stabilizer M n of a letter in M 23 are called Mathieu groups. They are simple groups which have quite exceptional properties. For Mathieu groups, see E. Witt, Abh. Math. Sem. Univ. Hamburg, 12 (1938). A k-transitive permutation group G on 0 of degree n and order n(n - 1)... (n - k + 1) has the property that no nonidentity element of G leaves k distinct letters of 0 invariant. If k  4, such a group is one of the following: Sk> Ak+2' M 12 , and Mil (c. Jordan). For k=2 and 3, see H. Zassenhaus, Abh. Math, Sem. Uriiv. Hamburg, 11 (1936). A multiply transitive permutation group G contains a normal subgroup S such that S is a non-Abelian simple group and G is isomorphic to a subgroup of the group AutS of the auto- morphisms of S, except when the degree n of G is a power of a prime number and G contains a regular normal subgroup of order n which is an telementary Abelian group (W. S. Burn- side). Furthermore, in these exceptional cases, 592 G is at most 2-transitive if n is odd, while G is at most 3-transitive if n is even but more than 4. The symmetric group S4 of degree 4 is the only 4-transitive group that contains a proper solvable normal subgroup. A transitive extension of a permutation group H on 0 is defined as follows, Let 00 be a new element not contained in O. A transitive exte!lsion G of H is a transitive permutation group on the set {O, oo} in which the stabilizer of 00 is the given permutation group H on O. Transitive extensions do not exist for some H. Suppose that a permutation group H admits a transitive extension G that is primitive. If H is simple, then G is also simple unless the degree of G is a power of a prime number. Construct- ing transitive extensions has been an effec- tive method for constructing sporadic simple groups. Permutation groups of prime degree have been studied extensively since the last century, partly because of their connection with al- gebraic equations of prime degree. Let p be a prime number. A transitive permutation group of degree p is either multiply transitive and nonsolvable or has a normal subgroup of order p with factor group isomorphic to a cyclic group of order dividing p - 1 (Burnside), Choose two cycles x and y of length p in Sp' If y is not a power of x, then the subgroup <x, y) generated by x and y is a multiply transitive permutation group which is simple. The struc- ture of <x, y) is not known despite its simple definition. More attention has been paid to groups of degree p, where p is a prime number such that (p-l)j2==q is another prime number. The problem is to decide if such nonsolvable groups contain the alternating group Ap. The Mathieu groups Mil and M 23 are the only known exceptions for p> 7. The search for additional exceptions has been aided by the development of high-speed computers. It is known that there is no exceptional group of degree p = 2q + 1 for 23 < p.s; 4079 (P. J. Nikolai and E. T. Parker, Math. Tables Aids Comput., 12 (1958); see N, Ito, Bull. Amer. Math, Soc" 69 (1963), for further results in this direction). A Frobenius group is a nonregular transitive permutation group in which the identity is the only element leaving more than one letter invariant. A Frobenius group of degree n contains exactly n - 1 elements which displace all the letters. These n - 1 elements together with the identity form a regular normal sub- group of order n. This is a theorem of Frobe- nius; all the existing proofs depend on the theory of characters. The regular normal subgroup of a Frobenius group is nilpotent (J. G. Thompson, Proc. Nat. Acad Sci. US, 45 
593 (1959); [15, ch. 10; 16, ch. 3J). A Zassenhaus group is a transitive extension of a Frobenius group. I. Finite Simple Groups All simple groups of finite order were com- pletely classified in February 1982 ( Section J; [23]). These are divided into the following four classes: (I) cyclic groups of prime order, (2) alternating groups of degree :;:'5, (3) sim- ple groups of Lie type, and (4) other simple groups. The subclass (1) consists of cyclic groups of prime order p for any prime number p. Abelian simple groups belong to this subclass. The sub- class (4) consists of twenty-six sporadic simple groups including five Mathieu groups. All sporadic groups, other than the five Mathieu groups, are of recent discovery. Simple groups of Lie type are analogs of simple Lie groups, and include the classical groups as well as the exceptional groups and the groups of twisted type. Classical groups are divided into four types: tlinear, tunitary, tsymplectic, and torthogonal ( 60 Classical Groups). Let q = pr be a power of a prime number p. Consider a vector space V of dimension n:;:' 2 over the field Fq of q elements, except in the unitary case where V is a vector space of dimension n:;:' 2 over the field Fq2 of q2 elements. Let f be a non degener- ate form on V which is tHermitian in the uni- tary case (with respect to the automorphism of order 2 of Fq2 over F q ), tskew symmetric bilinear in the symplectic case, and tquadratic in the orthogonal case. In the orthogonal case, the dimension of V is assumed :;:,3. Consider the group of all linear transformations of V (linear case) of determinant 1, or the group of all linear transformations of determinant 1 which leave the form f invariant (in other cases). In the orthogonal case, take the com- mutator subgroup. With each of these groups, the factor group of it by its center is a simple group with a few exceptions. There are several notations to denote these groups. E. Artin's notation for simple groups, which is reasonably descriptive and simple, follows the name of the simple group: nand q are as described in the preceding paragraph, y is the order of the simple group, and (a, b) denotes the greatest common divisor of two natural numbers a and b. Linear simple group, Ln(q): n g=qn(n-I)/2 fl (qi-l)jd, d=(n,q-l). ;=2 151 I Finite Groups Unitary group, Un(q): n g=qn(n-l)/2 fl (qi_( -INd, d=(n,q+ 1). ;=2 The structure of unitary groups does not de- pend on the form. Symplectic group, Sn(q), n=2m: m g=qm 2 fl (q2i_l)jd, d=(2,q-l). ;=1 In the symplectic case, the dimension n of the space V must be even, so n = 2m, and the struc- ture does not depend on the form. Orthogonal group in odd dimension n= 2m + 1, 02m+1 (q): m g=qm 2 fl (q2i_l)jd, d=(2,q-l). i=l The structure does not depend on the form in odd dimension. Orthogonal groups in even dimension n = 2m: There are two inequivalent forms, one with tindex m (which is maximal) and the other with index m-l. The two orthogonal groups are denoted by 02m(€, q), €=:I: 1, where €= 1 if the form is of maximal index and -1 other- wise. Then m-l g=qm(m-l)(qm_€) fl (q2i_l)jd, i=l d=(4,qm-€). The value of € is determined by the formf: €= I if f is equivalent to L7'=1 X 2 i-! X 2i ' and €= -1 iffxi+yxlX2+X+L7'=2X2i-IX2i' where the polynomial t 2 + yt + 1 is irreducible over Fq. There are other ways to denote these groups. Let X =X(*, *) be a group of nonsin- gular linear transformations of a vector space V. Two asterisks indicate two invariants, such as the dimension of V and the number of ele- ments in the ground field. The notation SX stands for the subgroup of X consisting of linear transformations with determinant 1, and the notation PX stands for the factor group of the linear group X by its center. Thus PX is a subgroup of the group of all projectivities of the projective space formed by the linear subspaces of V. The following list is self- explanatory, except the last term in each row, which is the notation ofL. E. Dickson [1]: Ln(q)= PSL(n, q)=LF(n, q) Un(q)= PSU(n, q)= HO(n,q2) S2n(q)= PSp(n, q)= A(2n,q). (LF: linear fractional group; HO: hyperortho- gonal group; A: Abelian linear group,) Iff is a non degenerate quadratic form, then the sub- 
1511 Finite Groups group of GL(n, q) consisting of a1\ the elements leaving the form f invariant may be denoted by O(n, qJ). Let Q(n, q,f) denote the commu- tator subgroup of O(n, q, f). Set 8 == 1 if f is of maximal index, and 8 == -1 otherwise. Then 0n(8, q)= PQ(n, qJ). Dickson's notation for orthogonal groups is complicated and seldom used. Finite simple groups corresponding to Lie groups of some exceptional type were studied by Dickson early in this century, but C. Chevalley (Tohoku Math. J., (2) 7 (1955)) proved the existence, simplicity, and other properties of groups of any (exceptional) type over any field by a unified method. Simple Lie algebras over the field C of complex numbers are completely classified, and according to the classification theory they are in one-to-one correspondence with the tDynkin diagrams. Let L be a simple Lie algebra (over C) corre- sponding to the tDynkin diagram of type X ( 248 Lie Algebras). Let L= Lo + L L, be a Cartan decomposition of L, where ex ranges over the troot system  of L. It is pos- sible to choose a basis B of L with the following properties (tChevalley's canonical basis): B consists of e,E L,(exE) and a basis of Lo; the structure constants of L with respect to B are all rational integers; the automorphism x,() in the tadjoint group defined by x,()==exp(ade,) «(EC) maps each element of B into a linear combi- nation of elements of B with coefficients which are polynomials in  with integer coefficients. Thus the matrix A,() representing the trans- formation x,() with respect to B has coeffi- cients which are polynomials in  with integer coeffi ci en ts. The elements of B span a Lie algebra Lz over the ring Z of integers. Let F be a field and form L F = F @ zLz. Then L F is a Lie algebra over F, and the set B may be identified with a basis of L F over F, Let t be any element of F, A,(t) be the matrix obtained from A,() by replacing the complex variable  by the ele- ment t, and finally x,(t) be the linear trans- formation of L F represented by the matrix A,(t) with respect to B. The group generated by the x,(t) for each root ex and each element t of F is called the Chevalley group of type X over F. The commutator subgroups of the Chevalley groups are simple, with a few excep- tions which will be stated after the complete list of simple groups of Lie type. Suppose that X = An' Dn, or E6 ( 248 Lie Algebras S). Then the Dynkin diagram of type X has a nontrivial symmetry; let it be ex-->p, Suppose that the field F has an automorphism 0" of the same order as the order of the symmetry of the 594 Dynkin diagram. Let e be the automorphism of the Cheva1\ey group which sends x,(t) to xpW). Let U (resp. V) be the subgroup of the ChevaIley group generated by x,(t) with ex> 0, tEF (xp(t),[3<O), and let U 1 (V 1 ) be the sub- group consisting of all the elements of U (V) which are left invariant bye. The group gen- erated by Uland V 1 is called the group of twisted type. If the order of 0" is i, this group is said to be of twisted type iX. In all but one case, the group of twisted type is simple (see R. Steinberg, Pacific J. Math., 9 (1959)), The value i is 2 except when X = D4' Since D 4 admits symmetries of orders 2 and 3, there are two twisted types. If X =B2' G 2 , or F4' then the diagram has a symmetry. If the character- istic p of the ground field F is 2, 3, or 2 accord- ing as X =B2' G 2 , or F4 and the field F has an automorphism 0" such that (ta)a = t P for any t E F, then a procedure similar to the one de- scribed before is applicable, and the group of twisted type X' is obtain'ed (R. H. Ree, Amer. J. Math., 83 (1961)). The group of twisted type is simple if the field F has more than three elements. The following list contains all the simple groups of Lie type. For each classical group, we list the type foIl owed by identification: An = Ln+l (q) (nl) 2 An= U n + 1 (q) (nl) Bn = 02n+1 (q) (n>l) C n =S2n(q) (n>2) Dn = 02n(1, q) (n>3) 2D n =02n( -1,q) (n>3) For other groups the type of the group is followed by the customary name or notation, if any, and the order g: B Suzuki group, Sz(q), q=2 2n +1 g=q2(q_1)(q2 + 1) 3D 4 g=q12(q8+ q 4 + l)(q6_1)(q2_1) G 2 g==q6(q6_1)(q2_1) G Ree group, Re(q), q=3 2n + 1 g==q3(q3+ l)(q-1) F4 g== q 2 4(q12 _1)(q8 _1)(q6 -1) x(q2-1) F q==2 2n + 1 g=q12(q6+ l)(q4-1) x (q3+ l)(q-l) E6 dg=q36(qI2_1)(q9_1)(q8_1) x (q6 _1)(q5 _1)(q2 -1) d=(3,q-1) 
595 2£6 dg=q36(qI2_I)(q9+1)(q8_1) X (q6 _ l)(q5 + 1 )(q2 -1) d=(3,q+ I) E7 dg=q63(qI8_I)(qI4_I)(qI2_1) x(qlO_1)(q8_1)(q6_1)(l_l) d=(2,q-l) E8 g=q120(q30_I)(q24_I)(q2°_I) X (qI8_I)(qI4_I)(q12_1) x (q8 _1)(q2 -1). B: M, Suzuki, Proc. Nat. Acad. Sci. US, 46 (1960); G and F: Ree, Amer. J. Math., 83 (1961): G 2 : Dickson, Trans, Amer, Math. Sac" 2 (1901), Math. Ann., 60 (1905); other Chevalley groups: Chevalley, T6hoku Math. J., (2) 7 (1955); twisted types: Steinberg, Pacific J. Math" 9 (1959), J, Tits, Seminaire Bourbaki (1958), Publ, Math. Inst. HES (1959), D. Hert- zig, Amer. J, Math., 83 (1961), Proc. Amer. Math. Soc., 12 (1961). Nonsimple cases: L 2 (2), L 2 (3), U 3 (2), and Sz(2) are solvable groups of orders 6, 12, 72, and 20, respectively. The groups 0 5 (2), G 2 (2), G(3), and F(2) contain normal subgroups of indices 2, 2, 3, and 2, respectively, These normal subgroups are simple and identified as L 2 (9), U 3 (3), L 2 (8) in the first three cases. The normal subgroup of F(2) is not in the list of simple groups of Lie type and is quite excep- tional (Tits's simple group, Ann. Math., (2) 80 (1964)). Isomorphisms between various simple groups: L 2 (q)= U 2 (q)=S2(q)=03(q); Os(q)= S4(q); 0 4 (1, q)= L 2 (q) x L 2 (q); 04( -1, q) = L 2 (q2); 0 6 (1, q) = L 4 (q); 06( -1, q) = U 4 (q); 02n+1 (q) = S2n(q) if q is a power of 2; L 2 (2) = S3; L 2 (3)=A 4 ; L2(4)=L2(5)=As; L 2 (7)=L 3 (2); L 2 (9)=A 6 ; L4(2)=A8: U 4 (2)=S4(3). If q is odd and 2n:;:'6, then S2n(q) and 02n+l(q) have the same order but are not isomorphic. L 3 (4) and L4(2) have the same order but are not isomor- phic. There is no other isomorphism or coin- cidence of orders among the known simple groups (Artin, Comm. Pure Appl. Math., 8 (1955», The groups of the automorphisms of simple groups belonging to subclasses (I), (2), and (3) are known. For the simple groups of Lie type, see Steinberg, Canad, J, Math., 10 (1960), The following list of twenty-six groups con- sists of all the simple groups that belong to class (4): Five Mathieu groups whose orders are M l1 :g=7,920=2 4 '3 2 '5'11 M 12 :g=95,040==2 6 . 3 3 . 5.11 M 22 : 9 =443,520=27. 3 2 . 5, 7, 11 1511 Finite Groups M 23: 9 == 10,200,960= 27.3 2 .5' 7.11.23 M 24 :g=244,823,040= 2 10 . 3 3 . 5, 7.11 .23. The other twenty-one groups have been discovered since 1964. Each group is identified by the symbol (x)j, indicating that it is the ith group discovered in the year 19x. The list continues with the name or names of dis- coverers, the order of the group, and a brief description. (64)1: Z. Janko, g= 175,560= 2 3 .3. 5, 7' 11 . 19. A subgroup of the Chevalley group G 2 (1 1). See J. Algebra, 3 (1966). (67h: M. Hall and Z. Janko, 9 == 604,800= 2 7 3 3 5 2 7 t . . . . . " a ransltlve extensIOn of U 3 (3) of degree 100. (67)2: D. G. Higman and Sims, g= 44,352,000 = 2 9 . 3 2 . 53 .7, II, a transitive extension of M 22 of degree 100, It is a normal subgroup of index 2 in the group of automor- phisms of a certain graph with 100 vertices. (67h: Suzuki, 9 =448,345,497,600= 2 13 , 3 7 . 52. 7.11. 13, a transitive extension of G 2 (4); defined from the automorphism group of a graph of 1782 vertices. (67k J. McLaughlin, g=898,128,000= 2 7 . 3 6 .5 3 . 7' 11, a transitive extension of U 4 (3); defined from a graph of 275 vertices. (68)1: G, Higman, Z. Janko, and 1. McKay, 9 = 50,232,960 = 27. 3 5 .5, 17' 19, a transitive extension of the group which is obtained from L 2 (16) by adjoining the field automorphism of order 2. The existence was verified by using a computer. (68b (68h, (68k], H. Conway, 9 == 2 21 .3 9 . 54. 7 2 . 11. 13.23 = 4, 157,776,806,543,360,000, !J = 2 18 .3 6 . 53. 7 . 11 . 23, 9 = 2 10 . 3 7 . 53, 7.11 . 23. The big group is obtained from the automor- phism group of a lattice in 24-dimensional space, and the two smaller ones are subgroups of it. The lattice was defined by 1. Leech in connection with a problem of close packing of spheres in 24 dimensions (Canad. J. Math., 19 (1967». (68)s: B, Fischer, ,q=2 17 . 39. 52. 7.11.13= 70,321,751,654,400, a transitive extension of U 6 (2) derived by means of a certain graph. (69)1: D. Held and others, g=2 10 . 3 3 . 52. 73. 17 = 4,030,387,200. (69}z: B. Fischer, 9 = 2 18 . 3 13 . 52. 7.11 . 13. 17. 23 = 4,089,470,473,293,004,800. (69h: B. Fischer, 9 = 2 21 . 3 16 , 5 2 .7 3 . 11 . 13. 17' 23.29 = 1,255,205,709,190,661,721,292,800. (71)1: R. N. Lyons and C. C. Sims, g= 2 8 . 3 7 . 56. 7.11. 31, 37.67. The existence of(69)1 and (71)1 was verified by 
151 J Finite Groups using computers; (69)2 and (69h were derived by means of certain graphs. For a more de- tailed account of these simple groups, see J. Tits, Semi/wire Bourbaki (1970), No, 375, and the references [18, 19, 20.J (72)1: A, Rudvalis, 9 = 2 14 . 3 3 . 5 3 .7' 13, 19, a transitive extention of Tits's simple group, i.e., a normal subgroup of F(2) of index 2. Concerning this group (72h, see the article of J. H. Conway and D. B. Wales, J. Algebra, 27 (1973). (73)1: M. O'Nan, g=2 9 . 34. 5.7 3 .11.19.31. This group (73)1 was discovered by O'Nan and the existence was verified by C. Sims, using a computer. (73)2: B. Fischer, 9 = 2 41 , 3 1 3. 56 . 7 2 . 11 . J3 . 19.23.31'47. (73h= B. Fischer and R. Griess, 9 = 2 46 . 3 20 . 5 9 '7 6 '11 2 . 13 3 '17' 19.23.29'31'41 ,47, 59 . 71. (74)1: J. G. Thompson, g=2 IS . 310. 53 ,7 2 . 13,19,31. (74)z: K. Harada, g=2 l4 . 3 6 . 56. 7' 11'19. The existence of (73)z was suggested by B. Fischer, and then that of (73h by B. Fischer and R. Griess. Shortly after this, the exis- tence of (74)1 and (74)z was suggested by J. G. Thompson, and then Thompson and Harada proved the existence of these groups with the aid of P. Smith and S. Norton, using com- puters, The existence of (73)z was established in 1976 by Leon and Sims, using a computer. Very recently (July 1980), R. Griess has an- nounced that the group (73h is realized as a group of automorphisms of a 196,883- dimensional commutative nonassociative algebra over the rational numbers, (75h: Z. Janko, g=2 21 . 3 3 . 5. 7'11 3 .23' 29. 31.37. 43. This group (75)1 was discovered by Z, Janko. The existence was verified by using a computer. For a more detailed account of the twenty- six sporadic simple groups, see [21]. Simple groups of order <1000 are As (g= 60), L 2 (7) (168), A 6 (360), L 2 (8) (504), and L 2 (11) (660). AII simple groups of order 20,000 are known. Among the known simple groups, the fol- lowing multiply transitive permutation repre- sentations are known: Alternating groups An (degree 11), As and A7 (degree 15), A6 (degree 10), As (degree 6), Mathieu groups M i (degree i), MIl (degree 12), Ln(q) (degree (qn -1)/ (q-I)), L 2 (p) (degree p for p= 5, 7,11), U 3 (q) (degree 1 +q3), Sz(q) (degree 1 +q2), Re (3 n ) (degree 1 + 3 3n ), S2n(2) (degrees 2 n - I (2 n :l: 1)), and the Higman-Sims group (67)z (degree 176) and, the Conway group (68)4 (degree 276). Among them, An (degree n, n;;:' 5), M i (degree i), Mil (degree 12), L 2 (2 m ) (degree 1 + 2 m ), and L 2 (5) (degree 5) are triply transitive. 596 There are several remarkable properties of known finite simple groups which have been conjectured to hold for arbitrary finite simple groups. One of the most famous is the Schreier conjecture, which asserts that the group of outer automorphisms of a simple group is solvable. This has been verified for all known cases. Another conjecture says that a finite simple group is generated by two elements. This has also been verified for almost all known groups. In many cases, there is a gen- erating set of two elements, one of which has order 2. There is no counterexample known to disprove the universal validity of this property. Except for Sz(q), the orders of known simple groups are divisible by 12. J. Classification of Finite Simple Groups The objective of classification theory is to find the complete list of finite simple groups; this was accomplished in February 1982, following the series of important works mentioned below. The order of a finite non-Abelian simple group is divisible by at least three distinct prime numbers (W. S, Burnside;  Section D). The order of a finite non-Abelian simple group is even (W. Feit and J. G. Thompson;  Sec- tion D). These theorems are special cases of the following theorem: If G is a finite non- Abelian simple group in which the normalizer of any solvable subgroup # {1} is solvable, then G=L 2 (q)(q>3), Sz(2 2n + l )(n;;:' I), A 7 , L3(3), U 3 (3), Mil, or Tits's simple group, In particular, a minimal simple group is isomor- phic to L 2 (p) (p=2 or 3 (mod 5), p>3), L 2 (2 P ), L 2 (3 P ), Sz(2 P ), or L3(3), where p is a prime number (a finite non-Abelian simple group is called a minimal simple group if alI proper subgroups are solvable). This theorem is proved in a series of papers by J. G. Thomp- son (Bull. Amer. Math. Soc., 74 (1968), Pacific J. Math., 33 (1970),39 (1971), 48 (1973), 50 (1974), and 51 (1974)). The method which Thompson used in these papers has since been generalized in various ways by many authors to establish a number of important theorems. There are also some interesting consequences of this theorem concerning solvable groups. For example, a finite group is solvable if and only if every pair of elements generates a sol- vable subgroup. Let G be a non-Abelian simple group of even order and S one of its 2-Sylow subgroups. Then S is neither cyclic nor a generalized quaternion group (W. S. Burnside, R. Brauer, and M. Suzuki). If S is a dihedral group, then G =A7 or L 2 (q) (q odd ;;:'5) (D. Gorenstein and J. H. Walter). These theorems deal with the 
597 cases where S is "small." The study in this direction has culminated in the classification of finite simple groups all of whose 2-subgroups are generated by at most four elements (D. Gorenstein and K. Harada, Mem. Amer. Math. Soc., 147 (1974)). If a 2-Sylow subgroup of a finite non- Abelian simple group G is an Abelian group, then G=L 2 (r) (r=O, 3 or 5 (mod 8), r> 3), or else G possesses an element of order 2 whose centralizer is isomorphic to Z2 x L 2 (q) (q = 3 or 5 (mod8), q> 3) (1. H. Walter). In the latter case, G is called a group of Janko-Ree type (1-R type for short), and if q # 5, it is called a group of Ree type. If q= 5, then G=(64h, the Janko's simple group of order 175,560. The Ree groups Re(q) are groups of Ree type. Since the discovery of Re(q), it has long been an open problem to show that there are no other groups of Ree type. Very recently, the com- bined work of E. Bombiere, J. G. Thompson and others settled the problem (Inventiones Math., 58 (1980)). A surprisingly short proof of the Walter's theorem above is given by H. Bender (Math. z., 117 (1970)). Bender's method applies to a much larger class of groups. A subgroup A of a finite group G is said to be strongly closed if AqnNG(A)cA for each gEG. D. M. Gold- schmidt proved (Ann. Math., 99 (1974)) that if A is a strongly closed Abelian 2-subgroup, then the subgroup Go generated by the con- jugates of A possesses a normal series Go =:> G 1 =:> G 2 with the properties: G 2 is of odd order, GdG 2 is a 2-group and is contained in the center of G O /G 2 , and either Go = G! or GO/G I is the direct product of simple groups on the following list: L 2 (q) (q=O, 3 or 5 (mod 8), q> 3), Sz(2 2n - I ), U 3 (2 n ) (n> 1), and the groups of J-R type. Furthermore, AG 2 /G 2 =:> GdG 2 and AGI/G I is the center of a 2-Sylow sub- group of Go/G!. This theorem generalizes an earlier result of G. Glauberman (the so- called Z*-theorem) which states that if A is a strongly closed subgroup of order 2, then the image of A in the quotient group G/K by the maximal normal subgroup of odd order is a normal subgroup and hence is contained in the center of G/K. These theorems of Glauber- man and Goldschmidt are of fundamental importance in the study of finite simple groups since they provide an effective tool for showing that a given group has a normal Abelian 2- subgroup. Glauberman obtained a criterion for the existence of a strongly closed Abelian p- subgroup # {1} for some prime p [15,22]. For any prime p, the quadratic group Qd(p) is defined to be the semidirect product of the 2- dimensional vector space V(2, p) over the field of p elements by the special linear group 151 J Finite Groups SL(2, p) where the action of SL(2, p) on V(2, p) is taken to be the natural one. A finite group G contains a strongly closed Abelian p-subgroup A # {I} if no section of G is isomorphic to Qd(p) (a section is a quotient group of a sub- group). Furthermore, if p is odd, then we can choose as A a characteristic subgroup of a p- Sylow subgroup S of G which is determined only by the structure of S. Therefore a finite non-Abelian simple group has a section iso- morphic to Qd(2)= S4 except when it is one of the simple groups mentioned in Goldschmidt's theorem. This theorem generalizes an un- published result of J. G. Thompson to the effect that 3 divides the order of finite non- Abelian simple groups except Sz(2 2n + I ). Let G be a 2-transitive permutation group on n + 1 letters, and assume that the stabilizer H of a letter contains a normal subgroup K which is regular on the remaining n letters. Then G contains a normal subgroup N such that G is isomorphic to a subgroup of the automorphism group of N and either N = L 2 (q), U 3 (q), Sz(q) or a group of Ree type, or else N is 2-transitive on the n + 1 letters and no nonidentity element of N leaves two dis- tinct letters invariant (the structure of N in the latter case is also known (H. Zassenhaus;  Section H). This theorem is proved by E. Shult for n even (Illinois J. Math., 16 (1972)) and by C. Hering, W. M. Kantor, and G. M. Seitz for n odd (J. Algebra, 20 (1972)). Its proof depends on the work of many authors who considered various special cases, especially the work of H. Zassenhaus, W. Feit, N. Ito, and M. Suzuki on the classification of Zassenhaus groups, and the work of M. Suzuki on the case where n is even and H/K is of odd order (Ann. Math., 79 (1964)). In this special case which Suzuki handles, the stabilizer H is of even order and H n Hg is of odd order for any 9 E G - H. If a proper subgroup H of an arbitrary finite group G has this property, then H is called a strongly embedded subgroup. Extend- ing the work of Suzuki, H. Bender proved the following theorem (J. Algebra, 17 (1971 )): If a finite group G possesses a strongly embedded subgroup, then either (i) a 2-Sylow subgroup of G is cyclic or a generalized quaternion group, or (ii) G possesses a normal series G = Go =:> G! =:> G 2 such that GO/G I and G 2 are of odd order and GdG 2 =L 2 (2 n ), U 3 (2 n ), or Sz(2 2n - l ) (n> 1). This theorem generalizes another theorem of Suzuki who reached the same conclusion under the assumption that two distinct 2-Sylow subgroups have only the identity element in common. Bender's theorem is of fundamental importance in the classifica- tion theory of finite simple groups, since a strongly embedded subgroup often appears as an obstacle to the proofs of classification 
151 Ref. Finite Groups theorems. For a generalization of Bender's theorem, see a paper by M. Aschbacher (Prac. Amer. Math. Sac" 38 (1973)), which also con- tains an alternative proof of Shult's theorem. The theorem of Shult, Hering, Kantor, and Seitz may be interpreted as a classification of finite groups having a split (B, N )-pair of rank I. Let G be a finite group and let Band N be subgroups of G such that (i) Band N generate G, (ii) T= B n N is a normal subgroup of N, and (iii) W=N/Tis generated by a set S ofele- ments of order 2 such that sBs # Band sBw c BwBU BswB for each SES and each WE W. The subgroups Band N are called a (B, N)- pair of G (the quadruplet (G, B, N, S) is called a Tits system), and the cardinality of the set S is called the rank of the (B, N)-pair. The (B, N)-pair is said to be split if B has a normal subgroup V such that B= TV and Tn V = {I}, and is said to be saturated if T= nnEN B n . If a finite group G has a split saturated (B,N)-pair of rank 1, and if Z = ngEG Bg, then G/Z is a 2- transitive permutation group satisfying the assumption of the theorem of Shult, Hering, Kantor, and Seitz, and information is obtained on the structure of G. In general, the simple groups of Lie type are characterized as simple groups with certain (B, N)-pairs. For (B, N)- pairs of rank 2, see papers by P. Fong and G. M. Seitz (Inventianes Math., 21 (1973),24 (1974)). J. Tits has developed a satisfactory theory on finite groups having a (B, N )-pair of rank at least 3 (Lecture notes in math. 386, Springer). Let G be a finite group generated by a conjugate class D of elements of order 2, and let n be the set of positive integers consisting of the orders of the products of two distinct elements of D. Furthermore, assume that G has no nontrivial solvable normal subgroup. B. Fischer proved (Inventianes Math., 13 (1971)) that if n = {2, 3}, then G contains a normal subgroup isomorphic to one of the following groups: An' S2n(2), 02n(:!: 1,2), O,.(:!: 1,3), V n (2), and the three Fischer's simple groups (68Js, (69}z, (69h. For a generalization of this theorem, see papers by M. Aschbacher (Math. Z., 127 (1972), J. Algebra, 26 (1973)). The most powerful result in this direction is given by F. Timmesfeld (J. Algebra, 33 (1975), 35 (1975)): Suppose n consists of 2,4, and odd positive integers. Furthermore, assume that if d and e are in D and de is of order 4, then (de)2 E D. Then G= A 6 , V 3 (3), the Hall-Janko group (67)1' Ln(q)(n3), 02n+1(q)(n3), 02n(:!: 1, q)(n  4), G 2 (q), 3D 4 (q), F 4 (q), 2E 6 (q), E 6 (q), E 7 (q), or E 8 (q), where q=2 m . Let G be a simple group of even order and let H be the centralizer of an element of order 2. Then the order of G is bounded by a func- tion f of the order h of H (for example, we can 598 choosej(h)={h(h+ I)}!; R. Brauer and K. A. Fowler, Ann. Math., 62 (1955)). In particular, there exist only finitely many isomorphism classes of finite simple groups which contain an element of order 2 with a given centralizer H. This fact is a ground for Brauer's program of studying simple groups of even order in terms of the structure of the centralizers of elements of order 2. There are a number of important results concerning Brauer's pro- gram [20]. For example, nine sporadic simple groups were discovered in related works. Since 1973, Brauer's program has been improved greatly by M. Aschbacher, D. Gorenstein, and others, and the classification of finite simple groups was finally completed in February 1982 [21,23]. References [IJ 1. E. Dickson, Linear groups with an exposition of the Galois field theory, Teubner, 1901 (Dover, 1958), [2J W. Burnside, Theory of groups of finite order, Cambridge Univ. Press, second edition, 1911. [3] H. Zassenhaus, Lehrbuch der Gruppen- theorie, Teubner, 1937; English translation, The theory of groups, Chelsea, 1958. [4J A. Speiser, Die Theorie der Gruppen von endlicher Ordnung, Springer, third edition, 1937. [5J W. Specht, Gruppentheorie, Springer, 1956. [6J M. Suzuki, Structure of a group and the structure of its lattice of subgroups, Erg. Math., Springer, 1956. [7J H. S. M. Coxeter and W. O. J. Moser, Generators and relations for discrete groups, Erg. Math., Springer, 1957. [8J M. Hall, The theory of groups, Macmillan, 1959. [9J C. W, Curtis and I. Reiner, Representation theory of finite groups and associative al- gebras, Interscience, 1962. [lOJ W. R, Scott, Group theory, Prentice-Hall, 1964. [IIJ M. Hall and J. K. Senior, The groups of order 2 n (n.s; 6), Macmillan, 1964. [12J H. Wielandt, Finite permutation groups, Academic Press, 1964. [13] J. Dieudonne, Sur les groupes c1assiques, Actualites Sci. Ind., Hermann, 1948. [14J 1. Dieudonne, La geometrie des groupes c1assiques, Erg. Math., Springer, 1955, [15J D. Gorenstein, Finite groups, Harper & Row, 1968. Second edition, Chelsea, 1980. [16J D. Passman, Permutation groups, Ben- jamin, 1968. 
599 [17] B. Huppert, Endliche Gruppen, Springer, 1967. [18] R. Brauer and C. H. Sah (eds.), Theory of finite groups, a symposium, Benjamin, 1969. [19] M. B. Powell and G. Higman, (eds.), Finite simple groups, Academic Press, 1971. [20] W. Feit, The current situation in the theory of finite simple groups, Actes Congr. Intern. Math., 1970, Nice, Gauthier-Villars, vol. I., p. 55-93. [21J D. Gorenstein, The classification of finite simple groups I, Bull. Amer. Math. Soc., 1 (1979),43-199. [22] G. Glaubermann, Factorization in local subgroups of finite groups, Regional confer- ence series in mathematics, no. 33 (1977). [23] D. Gorenstein, Finite simple groups, Plenum, 1982. 152 (VI1.18) Finsler Spaces A. Definitions Let T(M) be the ttangent vector bundle of an n-dimensional tdifferentiable manifold M. An element of T(M) is denoted by (x,y), where x is a point of M and y is a ttangent vector of M at x. Given a t10cal coordinate system (x 1 , '" ,xn) of M, we can obtain a local coordinate system of T(M) by regarding (Xl, "', x n , yl, ..., yn)= (xi,yJ) as coordinates of the pair (X,Y)E T(M), where (Xl, "', x n ) are coordinates of a point x of M and y=Ly J 8/8x J . A continuous real- valued function L(x,y) defined on T(M) is called a FinsIer metric if the following condi- tions are satisfied: (i) L(x, y) is differentiable at y #0; (ii) L(x, AY)= 1}.IL(x,y) for any element (x,y) of T(M) and any real number ..1.; and (iii) if we put gij(x, y) =(1/2)8 2 L(x, y)2/8yi8 y J, the symmetric matrix (gij(x, y)) is positive de- finite. A differentiable manifold with a Finsler metric is called a Finsler space. There exists a Finsler metric on a manifold M if and only if Mis tparacompact. We call F(x,y)= L(X,y)2 the fundamental form of the Finsler space. When F(x, y) is a quadratic form of (y', ..., yn), L(x,y) is a tRiemannian metric, and F(x,y)= Li,Jgij(x)yii. Therefore a Finsler space is a Riemannian space if and only if gij does not depend on y. The matrix gij is also called the fundamental tensor of the Finsler space (i,j = 1, ... , n). Thus the notion of a Finsler metric is an extension of that of Riemannian metric. The study of differentiable manifolds utilizing such generalized metrics was considered by B. Riemann, but he stated that a Riemannian 152 C Finsler Spaces metric is more convenient for the purpose since "only non geometrical results can be obtained" by using Finsler metrics [7]. P. Finsler initiated the systematic study of Finsler metrics and extended to a Finsler space many concepts and theorems valid in the classical theory of curves and surfaces [5]. B. The Finsler Metric In a Finsler space, the arc length of a curve x=x(t) (a.s;t.s;b) is given by SL(x,dx/dt)dt. Therefore a tgeodesic in a Finsler space is defined as a tstationary curve for the problem oftvariation b SL(x, dx/dt)dt = 0, and the differential equation of the geodesic is given by d 2 x i . ( dX ) dXJ dXk dt 2 + t Y1k x'dt dtdt=O, where yJdx, y) is the tChristoffel symbol of giJ, i.e., . 1" . ( 8 g ia 8gak 8 9 Jk ) YMx,y)=2L;:g'" 8x k + 8xJ - 8xa ' where (gij(x, y)) is the inverse matrix of (gij(x,y)). The distance between two points in a Fin- sler space is defined, as in a Riemannian space, as the infimum of the lengths of curves joining the two points. Many properties of Riemann- ian spaces as metric spaces can be extended to Finsler spaces. The topology defined by the Finsler metric coincides with the original topology of the manifold. A Finsler space M is said to be tcomplete if every Cauchy sequence of M as a metric space is convergent. The following three conditions are equivalent: (i) M is complete; (ii) each bounded closed subset of M is compact; (iii) each geodesic in M is infinitely extendable. In a complete Finsler space, any two points can be joined by the shortest geodesic. A diffeomorphism <p of a Finsler space M preserves the distance between an arbitrary pair of points if and only if the transforma- tion on T(M) induced by <p preserves the Finsler metric L(x, y). Such a transformation is called an tisometry of the Finsler space. In the tcompact-open topology the set of all isome- tries of a Finsler space is a tLie transformation group of dimension at most n(n + 1 )/2. If a Finsler space admits the isometry group of dimension greater than (n(n -1) + 2)/2, it is a Riemannian space of constant curvature [8]. C. The Theory of Connections An important difference between a Finsler space and a Riemannian space relates to their 
152 Ref. Fins\er Spaces properties with respect to the theory of tcon- nections. In the case of a Riemannian space, the Christoffel symbols constructed from the fundamental tensor are exactly the coefficients of a connection, whereas in the case of a Fin- sler space, the Christoffel symbols Y]k(X, y) do not define a connection, for the fundamental tensor gij depends not only on the points of the space but also on the directions of tangent vectors at these points, When we consider notions such as tensors, etc., in a Fins1er space M, it is generally more convenient to take the whole tangent vector bundle T(M) into consideration rather than restricting ourselves to the space M. For ex- ample, let P be the ttangent n-frame bundle over a Finsler space M and Q=p-l(P) be the tprincipa1 fiber bundle over T(M) induced from P by the projection p of T(M) onto M. We call the elements of fiber bundles asso- ciated with Q tensors. In this sense, the fun- damental tensor gij in a Fins1er space is the covariant tensor field of order 2. Therefore it is natural to consider a connection in a Fins1er space as a connection in the principal fiber bundle Q. The connection in a Fins1er space defined by E. Cartan is exactly of this type [3]. Namely, he showed that by assigning to a connection in Q certain conditions related to the Fins1er metric, we can determine uniquely a connection from the fundamental tensor so that the covariant differential of the funda- mental tensor vanishes. Cartan's introduction of the notion of con- nection produced a development in the theory of Fins1er spaces that parallels the develop- ment in the theory of Riemannian spaces, and many important results have been obtained. O. Varga (1941) succeeded in obtaining a Cartan connection in a simpler way by using the notion of osculating Riemannian space. S. S. Chern (1943) studied general Euclidean connections that contain Cart an connections as a special case. Noticing that the tangent space of a Fins1er space is a tnormed linear space, H. Rund (1950) obtained many notions different from those of Cartan. However, as far as the theory of connections is concerned, the two theories do not seem to be essentially different. The theory of curvature in a Fins1er space is more complicated than that in a Riemannian space because we have three curvature tensors in the Cartan connection. Using the fact that, in a local cross section of the tangent vector bundle of a Fins1er space, a Riemannian metric can be introduced by the Fins1er metric, L. Auslander (1955) [2J ex- tended to Fins1er spaces the results of J. L. Synge and S. B, Myers on the curvature and topology of Riemannian spaces. A. Lichnero- wicz extended the tGauss-Bonnet formula to 600 Fins1er spaces by considering an integral on the subbundle of the tangent vector bundle satisfying L(x,y)= 1 [6]. M, H. Akbar-Zadeh studied tho1onomy groups and transformation groups of Fins1er spaces by using the theory of fiber bundles. Connections of Finsler spaces have been investigated by many geometers, but most of them used methods considerably different from those of the modern theory of connections in principal fiber bundles. J. H. Taylor and Synge (1925) defined the covariant differential of a vector field along a curve. L. Berwald (1926) defined a connection from the point of view of the general geometry of paths. A curve on a manifold satisfying the differential equation d 2 x i . ( dX ) -+2G' x - =0 dt 2 ' dt is called a path. The theory was originated by O. Veb1en and T. Y. Thomas and generalized as above by J. Douglas. Characteristically, with respect to a Berwald connection, the covariant differential of the fundamental ten- sor does not vanish. A Fins1er space is a space endowed with a metric for line elements. As a dual concept, we have a Cartan space, which is endowed with a metric for areal elements [4]. A. Kawaguchi (1937) extended these notions further and studied a space of line elements of higher order (or Kawaguchi space). References [IJ M. H. Akbar-Zadeh, Les espaces de Fin- sler et certaines de leurs genera1isations, Ann. Sci, Ecole Norm. Sup., (3) 80 (1963), 1-79. [2J L. Auslander, On curvature in Finsler geometry, Trans. Amer. Math. Soc., 79 (1955), 378-388. [3J E. Cartan, Les espaces de Finsler, Ac- tualites Sci. Ind., Hermann, 1934, [4J E. Cartan, Les espaces metriques fondes sur 1a notion d'aire, Actualites Sci. Ind., Her- mann, 1933. [5J P. Fins1er, Uber Kurven und Fliichen in allgemeinen Riiumen, dissertation, Gottingen, 1918. [6J A. Lichnerowicz, Quelques theoremes de geometrie differentielle globa1e, Comment. Math. He1v., 22 (1949),271-301. [7J B. Riemann, Uber die Hypothesen, welche der Geometrie zu Grunde 1iegen, Habilita- tionsschrift, 1854. (Gesammelte mathematische Werke, Teubner, 1876,254-270; Dover, 1953.) [8J H. C. Wang, On Finsler spaces with com- pletely integrable equations of Killing, J. London Math. Soc., 22 (1947),5-9. 
601 [9J H. Rund, The differential geometry of Finsler spaces, Springer, 1959. [lOJ M. Matsumoto, The theory of Finsler connections, Pub!. Study Group Geom. 5., Okayama Univ., 1970. 153 (IX.7) Fixed-Point Theorems A. General Remarks Given a mapping f of a space X into itself, a point x of X is called a fixed point of f if f(x) =X. When X is a topological space and f is a continuous mapping, we have various theorems concerning the fixed points of f B. Fixed-Point Theorems for polyhedra (1) Brouwer Fixed-Point Theorem. Let X be a tsimplex and f: X --> X a continuous mapping. Then f has a fixed point in X (Math. Ann., 69 (1910), 71 (1912)). (2) Lefschetz Fixed-Point Theorem. Let Hp(X) be the p-dimensional thomology group of a tfinite polyhedron X (with integral coeffi- cients), Tp(X) the ttorsion subgroup of Hp(X), and Bp(X) = Hp(X)/Tp(X), The continuous mapping f: X --> X naturally induces a homo- morphism f* of the free Z-module Bp(X) into itself. Let ex p be the ttrace of f* and A f = L;o( -!)pex p (n=dimX). We call this integer Af the Lefschetz number f We have the Lefschetz fixed-point theorem: (i) Let f, 9 be continuous mapping sending X into itself. Iff, 9 are thomotopic (/""g), then A f = Ag. (ii) If A f # 0, then f has at least one fixed point in X (Trans. Amer. Math. Soc., 28 (J 926)). The condition Af#O is, however, not neces- sary for the existence of a fixed point of f The Brouwer fixed-point theorem is obtained im- mediately from (i) and (ii). in particular, if the mapping f is homotopic to the identity map- ping 1 x, then ex p is the pth tBetti number of X, and Af is equal to the tEuler characteristic X(X) of X. Hence, in this case, if X(X) # 0, then f has a fixed point. When X is a compact oriented manifold without boundary the Lefschetz number Af of f can be interpreted as the tintersection num- ber of the graph of f and the diagonal of X. More generally, let X and Y be compact oriented n-dimensional manifolds without boundary. If f and g: X --> Yare continuous mappings, a point x of X such that f(x) = 153 B Fixed-Point Theorems g(x) is called a coincidence point of f and g. The intersection number A f,g of the graph of.f and that of 9 is called the coincidence number of f and g. If Af,g#O, then f and 9 have at least one coincidence point. The coin- cidence number Af,g is also expressed as L;=o( -ljPtr(/* og<l HP(X)), where g,: HP(X)-->HP(Y) is the tGysin homomorphism of g. Suppose that a finite group G acts on the manifolds X and Y. If f: X --> Y is a mapping, a point x of X such that rf(x) = fr(x) for all rEG is called an equivariant point of f When G is a group of order 2 and acts on X nontrivially, the equivariant point index Af is employed. This index was introduced by Nakaoka (Japan. J. Math. 4 (1978)), using the tequivar- iant cohomology. It has the property that Af#O, implying that f admits an equivariant point. The prototype of this theorem is the Borsuk-Ulam theorem (Fund. Math. 20 (1933)), which states that a continuous mapping f: sn-->R n always admits a point XES n such that {(x) = {( - x). (3) Lefschetz Number and Fixed-Point Indices. Suppose that IKI is an n-dimensional homo- geneous polyhedron (i.e., any simplex of K that is not a face of another simplex of K is of dimension n), and f: I K 1--> I K I is a continuous mapping. Then there exists a continuous mapping g:IKI-->IKI homotopic to f and admitting only isolated fixed points {q I, "', qr}, each of which is an inner point of an n- dimensional simplex of K. The tlocal degree )"i of a mapping 9 at qi is called the fixed- point index of gat qi' Then J f = L;I Ai does not depend on the choice of 9 and is equal to ( -1)"A f . (4) Singularities of a Continuous Vector Field. Let X be an n-dimensional tdifferentiable manifold and F a tcontinuous vector field on X that assigns a tangent vector x p to each point p of X. A point p is called a singular point of F if x p is the zero vector. The vector field F induces in a natural manner a continu- ous mapping f: X -->X that is homotopic to the identity mapping Ix. Then a fixed point of f is a singular point of F, and vice versa. When such a singular point p is isolated, there exists a tcoordinate neighborhood N of p that is homeomorphic to an n-dimensional open ball such that x q is nonzero for every point q in N except for q=p. Let N' be the boundary of N. Then we may consider N'  sn-I and a map- ping FIN' from sn-l to Rn,,{o} ""sn-I. The tdegree of a mapping sn-I  Rn,,{o} sn-l is called the index of the singular point p. This index is equal to the fixed-point index Ap of f at p. Hence, when X is compact and has no 
153C Fixed-Point Theorems boundary, the sum of indices of (isolated) singular points of F is equal to (-I)"X(X). In particular, a compact manifold X without boundary admits a continuous vector field with no singular point if and only if X(X)=O (Hopf's theorem, Math. Ann., 96 (1927». (5) Poincare-Birkhoff Fixed-Point Theorem. In certain cases, a continuous mapping f: X --> X of a finite polyhedron X into itself has fixed points even if Af==O. For example, let X be the annular space {(r, e) I ex.s; r.s; [3} ((r, e) are the polar coordinates of points in a Euclidean plane) and let f:X -->X be a homeomorphism satisfying the following conditions: (i) there exist continuous functions g(e), h(O) such that g(e)< 0, h(e) > e,f(ex, O)=(ex, g(e», f([3, e)= ([3, h(e»; (ii) there exists a continuous positive function p(r, e) defined for ex < r < [3 such that f Lp(r,e)drdO= f Lp(/(r,e))drdO for all measurable sets D. Then f has at least two fixed points. This theorem was conjectured in 1912 by Poincare, who hoped to apply it to solve the trestricted three-body problem. The theorem was later proved by G. D. Birkhoff (Trans. Amer. Math. Soc., 14 (1913); see also M, Brown and W. D. Neumann, Michigan Math. J. 24 (1977») and is called the Poincare-Birkhoff fixed-point theorem or the last theorem of Poincare, C. Atiyah-Bott and Atiyah-Singer Fixed-Point Theorems There is a far-reaching generalization of the Lefschetz formula given by Atiyah and Bott (Ann. Math., (2) 86 (1967),88 (1968)). Let M be a compact differentiable manifold without boundary and r: M --> M a differentiable map- ping with only simple fixed points; that is, it is assumed that det(1 -dfp) #0 for each fixed point p E M of f, where dfp is the differential of f at the point p. The fixed points of f are finite in number. Suppose that an telliptic complex over M do d, d J 1 tS': 0--> r(Eo)--> r(E l )-->... ---> r(E 1 )--> 0 and a sequence of smooth vector bundle homomorphisms <Pi: f* E i --> E i (i = 0, .., , I) are given such that d i = +I d i for each i, where : r(E;)-->r(E;) is defined by s(x)= <PiS(/(X» for sEr(E;). The sequence T=() induces endomorphisms Hi T of the homology groups Hi(tS') of the elliptic complex tS', We define the Lefschetz number L(T) by L(T)= Lo( _l)i tr Hi T, On the other hand, for a fixed point p off, let <pi,p:Ei,p-->E i . p denote the restriction of <Pi on the fiber Ei,p of E i over 602 p. Under the circumstances mentioned above the Lefschetz number L(T) is given by the formula I I (-I)itr<pi,p L(T)= ilet(l_dfp)1 ' where the summation is over the fixed points off Here are some examples of the above for- mula. First, take as tS' the tde Rham complex and as <Pi the obvious one, i.e.. the ith exterior power of the transpose of df In this case, the formula reduces to the classical Lefschetz formula. As a second example consider the tDolbeault complex O-->AO,O(M)!.Ao,I(M)... !.Ao,n(M)-->O of a compact complex manifold M and a holomorphic mapping f: M -->M with only simple fixed points. In this case the formula above reduces to one giving the Lefschetz number of the induced endomorphisms f*: HO'*(M)-->Ho'*(M) of the tDolbeault cohomology: 1 L(/*)=  detdl-dfp) ' where dfp is regarded as a holomorphic differential. If the assumption that the mapping f has only simple fixed points is replaced by the one that f is a diffeomorphism of M contained in a compact ttransformation group G, then there is also a generalized Lefschetz formula, given by Atiyah and Singer (Ann. Math., (2) 87 (1968)). The fixed-point set of such a diffeo- morphism is a closed submanifold of M (con- sisting of several components) Suppose that we are given an elliptic complex tS' over M and a lift of the G-action on M to tS'. The latter implies that, if we define :r(Ei)-->r(E;) by s(x)= r l s(/(x) for sEr(E;), then d i = +l d i holds for each i. Under these circum- stances, the Lefschetz number L(T) can be expressed in the form L(T)= I v(F i ), F, where the summation is over the components {F;} of the fixed-point set Mf of f and where the number v(F;) is written explicitly in terms of the tsymbol of the elliptic complex tS' with G-action, the characteristic classes of the mani- fold F i , the characteristic classes of the normal bundle of F i in M, and the action of g= f- I on the normal vectors. The formula is essentially a reformulation of the tAtiyah-Singer index theorem, In fact, L(T) is the tanalytic index of tS' evaluated on 9 and the number v(FJ is de- duced from the ttopological index of tS' using 
603 the localization theorem, The most useful elIiptic complexes are de Rham complexes, Dolbeau\t complexes, signature operators and Dirac operators. In the case of Dolbeau\t complexes, f is assumed to be an analytic automorphism, and the number v(F;) takes the form f199 ff(F;) v(F;) = det(l_gIN*) [FJ. Here, the normal bundle N of F i has a decom- position N = L9 N(O) into the sum of complex vector bundles such that 9 acts on N(O) as e i9 , and 9 is the characteristic class defined by 1 -i9 -e 9 = f1 I -x -;9' j -e J where the tChern class of N (0) is written as c(N(O))= TI j (1 + x); moreover .;T(F i ) denotes the tT odd class of the complex manifold Fi' and N* denotes the dual bundle of N ( 237 K - Theory H). D. Fixed-Point Theorems for Infinite- Dimensional Spaces Birkhoff and O. D, KelIogg generalized Brouwer's fixed-point theorem to the case of function spaces (Trans. Amer. Math. Soc., 23 (1922)), Their result was utilized to show the existence of solutions of certain differential equations, and has led to a new method in the theory of functional equations. J. P. Schauder obtained the folIowing theo- rem: Let A be a closed convex subset of a Banach space, and assume that there exists a continuous mapping T sending A to a tcount- ably compact subset T(A) of A. Then T has fixed points (Studia Math., 2 (1930). This theorem is calIed the Schauder fixed-point theorem, A. Tikhonov generalized Brouwer's result and obtained the following Tikhonov fixed- point theorem (Math, Ann" 111 (1935»): Let R be a localIy convex ttopologicallinear space, A a compact convex subset of R, and T a con- tinuous mapping sending A into itself. Then T has fixed points. This theorem may be applied to the case where R is the space of continuous mappings sending an m-dimensional Euclidean space Em into a k-dimensional Euclidean space E k to show the existence of solutions of certain differential equations. For example, when m= k = 1, consider the differential equation dyjdx=f(x,y), y(xo)==Yo. We set T(y)=yo+ gof(t,y(t))dt to determine a continuous mapping T:R-->R. Then the fixed points of T are the solutions of the differential 154 A Foliations equation, Now we can apply the theorem of Tikhonov to show the existence of solutions, On the other hand, when we are given prob- lems offunctional analysis, Schauder's fixed- point theorem is usualIy more convenient to apply than Tikhonov's theorem. The following theorem, written in terms of functional analysis, is useful for applications: Let D be a subset of an n-dimensional Eucli- dean space, F the family of continuous func- tions defined on D, and T:F -->F a mapping. Suppose that the folIowing three conditions are satisfied: (i) For fl' f2 E F, 0 < ..1. < 1 implies Afl +(1 - A)f2 E F. (ii) If a series {fd offunc- tions in F converges uniformly in the wider sense to a function f, then f E F; and further- more, the series {Tfk} converges uniformly in the wider sense to Tf (iii) The family T(F) is a tnormal family of functions on D, Then there exists a function f E F such that Tf = f Let R be a topological linear space and T a mapping assigning a closed convex subset T(x) of R to each point x of R. A point x of R is calIed a fixed point of T if XE T(x). The map- ping Tis calIed semicontinuous if the condition xn-->a, Yn-->b (YnE T(x n )) implies that bE T(a). In particular, if K is a bounded closed convex subset of a finite-dimensional Euclidean space Rand T a semicontinuous mapping sending points of K into convex subsets of K, then T admits fixed points (Kakutani fixed-point theorem, Duke Math. J., 8 (1941)). This result was further generalized to the case of locaIly convex topological linear spaces by Ky Fan (Proc. Nat. Acad. Sci. US, 38 (1952)). References [1] B. F. Brown, The Lefschetz fixed point theorem, Scott, Foresman, 1971. [2J J. Cronin, Fixed points and topological degree in nonlinear Analysis, Amer. Math. Soc. Math. Surveys, no. 11, 1964. [3J S. Lefschetz, Topology, Amer. Math. Soc. ColIoq. Publ., vol. XII, 1930. [4J D. R. Smart, Fixed point theorems, Cam- bridge tracts in math. 66, Cambridge Univ. Press, 1974. 154 (lX.21) Foliations A. Introduction A foliation is a kind of geometric structure on manifolds, such as a differentiable or complex structure. The study of foliations evolved from 
154 B Foliations investigation of the behavior oftorbits of a vector field and also of the solutions of ttotal differential equations. Through the early works of C. Ehresmann, G. Reeb, and A, Hae- f1iger, together with the development of mani- fold theory in 1960s, it became an established field of mathematics. Since then, great progress has been made in this field, especially in its topological aspects. At the same time it turns out that many problems in foliation theory are deeply related not only to the geometry of manifolds but also to various other branches of mathematics, such as the theory of differen- tial equations, functional analysis, and group theory. B. Definitions and General Remarks A foliation on a manifold can be defined within various categories: topological, C'- differentiable (1  r  00), real analytic, and holomorphic, For definiteness, however, we restrict ourselves to the C'-differentiable cate- gory in what follows. Furthermore, all mani- folds are assumed to be paracompact. Let M be an n-dimensional Coo-manifold, possibly with boundary, A codimension q, C r _ foliation of M (0  q  n, 0  r  00) is a family ,,¥ = {L,I exEA} of arcwise connected subsets, called leaves, of M with the following prop- erties: (i) L,nL,.=0 if:x#:x'; (ii) U'EAL.= M; (iii) Every point in M has a local coordi- nate system (U, if;) of class C' such that for each leaf L. the arcwise connected components of un L, are described by x n - q + 1 =constant, .." x n =constant, where Xl, x 2 , ..., x n denote the local coordinates in the system (U, if;), In particular, every leaf of ,,¥ is an (n - q)- dimensional tsubmanifold of M, The totality of integral submanifolds of a tcompletely integrable nonsingular system oftPfaffian equations on Rn, wi=a il (x)dxI +ai2(x)dx2 + .., +ain(x)dxn=O(i= 1,2, ...,q) forms a co- dimension q foliation, and the totality of inte- gral curves of a nonsingular vector field of class C' on M (r? 1) constitutes a codimension n -1 C'-foliation. Let :F be a codimension q, C'-foliation of M (r? 1). Then M admits a C'-I tp-plane field consisting of all vectors tangent to the leaves of ,CY;, and, dually, a C'-I q-plane field (p + q = n). Denote the former by r(:F) and the latter by v(:F), and call them the tangent bundle and the normal bundle of :F, respectively. v(:F) is isomorphic to the quotient bundle T(M)/r(:F), ,,¥ is called transversely orientable if v(3') is an orientable vector bundle. A C'-mapping f: N --> M is said to be transverse to the foliation :F if the composite mapping T(N)T(M)--> T(M)/r(:F) is epimorphic at each point of N. 604 In this case,.f induces a codimension q, C'- foliation f- I (:F) of N whose leaves are the arcwise connected components of f-I(L) (LE,'¥). In particular, if Q is a q-dimensional C'- manifold and f: N -->Q is a C'-submersion, f induces a codimension q, C'-foliation of N whose leaves are the arcwise connected com- ponents of rl(x) (XEQ), A cr p-plane field X on M is called involu- tive if, for any C r vector fields X, Yon M such that Xx, YxEX x (x EM), the tLie bracket [X, Y] satisfies [X, YJxEXx' This condition is known as the Frobenius integrability condition for ::r, If X is defined locally by q Pfaffian equa- tions WI = ,., = w q = 0, the above condition is equivalent to the condition that there are C'-I I-forms (}ij (i,j = I, .., , q) such that dW i = LJI 0ij A w j . A C' p-plane field ;£ is said to be completely integrable if it is a tangent bundle of some foliation, When r? 2, ,:r is completely integrable if and only if it is involutive (Frobe- nius theorem) ( 428 Total Differential Equa- tions). There is a topological obstruction to the complete integrability of X ( Section F). A closed C""-manifold M admits a codi- mension I, C'-plane field if and only if the Euler number of M vanishes. In 1.944, Reeb constructed a codimension 1, CW-foliation of the 3-sphere S3 as follows [I]. Let f(x) be a C'lJ- even function defined on the open interval (-I, I), such that d(k) I lim--=O (k=0,1,2,..,). Ixl) dX(k) .f'(x) The graphs of the equations y = f(x) + c ( -I < x< l,cER) together with the lines x=:tl constitute a codimension I, Coo-foliation of [ -1, 1] x R. Then by rotating it around the y- axis in R 3 , we obtain a codimension 1, coo_ foliation of D 2 x R, where D 2 denotes the closed 2-disk. The foliation is invariant under vertical translations and therefore defines a codimension 1, Coo-foliation of D 2 x S). This foliation is called the Reeb component of D 2 x S). Since S3 is a union of two solid tori inter- secting in the common toroidal surface, the Reeb components of each solid torus con- structed above define the so-called Reeb foli- ation of S3 (Fig. 1). Fig. 1 
605 Generalizing the above construction into a differential topological method, one obtains the following results: Every closed 3-dimen- sional manifold admits a codimension 1, Coo_ foliation (S. P. Novikov [3J, W. Lickorish, J. Wood); every odd-dimensional sphere admits a codimension 1, Coo-foliation (I. Tamura, A. Durfee, B. Lawson). On the other hand, every open manifold has a codimension 1, Coo_ foliation induced by a submersion over R ( Section F). Let M be a total space of a C'-bundle over a Coo-manifold B with fiber F. If F is a Coo_ manifold and the tstructure group reduces to a totally disconnected subgroup of Diff' F, the group of all C'-diffeomorphisms of F, then the local sections, which are defined in an obvious manner using the local triviality of this bundle, fit together to give leaves of a codimension q, C'-foliation (q=dimF). In this case, M is caIled a foliated bundle or a flat F-bundle over B. Each leaf of this foliation is diffeomorphic to a covering manifold of B and transverse to the fibers of the bundle M --> B. Foliated bun- dles exhibit a class of foliations; this is espe- cially important in connection with the char- acteristic classes of foliations ( Section G). C. Holonomy The notion of the holonomy of a leaf, given by Ehresmann, is a generalization of the tPoin- care mapping in tdynamical systems. Let :F be a codimension q, C'-foliation of M and L be a leaf of :F. Let N(L) denote the total space of the normal disk bundle of L in M. Choose a C'-immersion i: N(L)--> M such that i restricted to the zero section of N(L) is the natural inclu- sion and i maps the fibers of N(L) transversely to the foliation :F. Then the induced foliation i-I (:F) of N(L) has the properties that the leaves are transverse to the fibers of N(L) and the zero section of N(L) is a leaf. If y is an oriented loop in L based at a point Xo E L, then there is a neighborhood U of 0 in the fiber over Xo satisfying the following: for each point XE U there is a curve Yx: [0, 1J --> N(L) having the properties: (i) Yx(O)=x, (ii) Im(yx) lies on a leaf of i-I (:F), and (iii) noYAt)=y(t) for any tE[O, 1J, where n:N(L)-->L is the bundle pro- jection. The family of curves {YxIXEU} gives a C'-diffeomorphism Hy from U to another open set ofn-1(x o ), which assigns yA1) to x. Let G; denote the group of tgerms at 0 of aU local C' -diffeomorphisms of Rq which fix O. The germ at 0 of the mapping Hy depends only on the thomotopy class of y, and, by identifying n- 1 (x o ) with Rq, we obtain a homomorphism hL:nl(L,xo)-->G;. h L is determined by L up to conjugacy and is called the holonomy homo- 154 D Foliations morphism, or simply the holonomy, of the leaf L. The image of h L is called the holonomy group of L. For r? 1, by differentiating each element of G;, one has a homomorphism dh L : n 1 (L, xo)-->GL(q; R), called the linear holonomy of L. The holonomy of a proper leaf ( Section D) completely characterizes the foliation of a neighborhood of it (Haefliger). D. Topology of Leaves Let :F be a codimension q foliation of M. The leaf topology is a topology of M defined by requiring each connected component of the set of the form un L to be open, where U is an open set in M and LE:F. Leaves are nothing but the connected components of M with respect to this topology. A leaf LE:F is called a compact leaf if L is compact in the leaf top- ology. In general, L is caIled proper if two topologies on L induced from the original and the leaf topologies on M coincide. Any com- pact leaf is proper. A leaf L is called locally dense if Int L# 0. If a leaf is neither proper nor locally dense, it is caUed exceptional. Since we are assuming that M is paracompact, a leaf that is a closed subset of M is proper (Haefliger). There exists a codimension 1, C 1 _ foliation of the 2-torus T 2 that contains exceptional leaves. But in the C' category (r? 2), such a foliation does not exist on T 2 (A, Denjoy, C. Siegel). In higher dimensions, there are examples of Coo-foliations with exceptional leaves (R. Sacksteder). The following result is called the Novikov closed leaf theorem [3]: Any codimension 1, C' -foliation (r? 2) of a closed 3-dimensional manifold M contains a Reeb component if either n 1 (M) is finite or n 2 (M)#0 (M #SI x S2, SI X RP2). In partic- ular, every C 2 -foliation of S3 contains a com- pact leaf homeomorphic to T 2 . The question of whether every codimension 2, C'-foliation of S3 has a compact leaf is known as the Seifert conjecture. There is a counterexample in the C 1 case (P. Schweitzer [7J), but it remains unsolved for r? 2 ( 126 Dynamical Systems N). A compact leaf L is said to be stable if it has an arbitrarily small open neighborhood that is a union of compact leaves. The foIlowing results are calIed the Reeb stability theorems: (1) Let L be a compact leaf of a C'-foliation (r? 0). If the holonomy group of L is finite, then it is stable (Reeb [IJ). (2) Let :F be a transversely orientable codimension 1, C'- foliation (r? 1) of a compact manifold M (tangent to the boundary). If there exists a compact leaf L with HI (L; R) = 0, then M is a fiber bundle over SI or [0, 1], and the leaves of :F are the fibers of this bundle. In particular, 
154 E Foliations every leaf is compact and stable (Reeb [1 J, Thurston [8J). The generalization of the stability theorem for proper leaves has been investigated by T. Inaba and P. Dippolito. E. Haefliger Structures Let ; be the tpseudogroup consisting of all C'-diffeomorphisms 9 of an open subset of Rq to another open subset of Rq, We write r for the set of all germs [gJx of 9 at x, xEdomain of g, 9 E 1;, The sheaf topology of r is the to- pology whose open base is the family of sub- sets of the form UXEdomainof g{[gJx}' With this topology and the multiplication induced from the composition of ;, r is a topological groupoid, i.e., a tgroupoid whose multiplication and inverse mappings are continuous. If one identifies a point x in Rq with [idR,Jx, r con- tains Rq as a subspace. A codimension q, C' - Haefliger structure or a r-structure £ on a topological space X is a maximal covering of X by open sets {U i liE J}, such that for each pair i,jEJ, there is a con- tinuous mapping Yij: uin lj-->r satisfying Yidx)=Yij(x)oYjdx) for XE uin ljn Uk' (*) Since y;;(x) is the germ of the identity mapping for XE Ui> the correspondence x-->y;;(x) defines a continuous mapping/;: Ui-->Rqcr, A co- dimension q, C'-foliation of a Coo-manifold M is the same as a r-structure on M such that each t; is a C'-submersion (Haefliger [5]). If f: Y --> X is continuous and :Yf is a r-structure on X, there is an induced r- structure I-I fe' on Y which is defined by {f-l(U;), Yijofli,jEJ}. Two r-structures  and :Yf 1 on X are said to be homotopic if there exists a r-structure :Yf on X x [0, 1] such that :Yfl x xl'} =Yf, (t=O, 1). Let r(X) be the set of homotopy classes of r-structures on X. There exists a space Br, called the classifying space for r-structures, such that there is a natural one-to-one corre- spondence between r(X) and [X, BrJ for any paracompact space X, where [A, BJ de- notes the set of homotopy classes of continu- ous mappings from A to B, By condition (*) above, ifr>O, the differentials {dYiAx)lxE U i n lj, i,j EJ} define a q-dimensional vector bundle v(:Yf) over X, which is called the nor- mal bundle of :Yf. The correspondence :Yf--> v(.f1') gives a continuous mapping v: Br--> BGL(q; R) among classifying spaces. If r=O, there is also a similar mapping v: Br--> BTopq. Let Bf be the homotopy fiber of the mapping v. Br is a classifying space for the r-structures with trivialized normal bundles. There is a tight connection between the 606 topology of Br and the group structure of Diffr(Rq), which is stated below. Let Diff(Rq) be the topological group of all C'- diffeomorphisms of Rq that are identities outside some compact sets with the C'to- pology. Let Diffk,o(Rq) denote Diff(Rq) with the discrete topology and B Diff (Rq) denote the homotopy fiber of the natural mapping BDiff.o(Rq)-->BDiff(Rq). Then there exists a continuous mapping B Diff (Rq)-->Qq Bf that induces an isomorphism in the ho- mology group with integer coefficient for Or 00 and q'? 1, where Qq denotes the qth tloop space functor (J. N, Mather [12], Thur- ston [10]). Further, it has been proved that Diffto(Rq) is a tsimple group (ifr#q+ 1) and that Br is t(q + 1 )-connected for r # q + 1 (Mather and Thurston, Haefliger [5]), The group HomeoK.oRq is tacyclic (Mather), and hence Br is tcontractible. F. Existence and Classification of Foliations Not every plane field on a manifold is isomor- phic to a completely integrable field (R. Bott). Thus, in general, the existence of a plane field does not guarantee the existence of a foliation of a manifold. Haefliger and Thurston solved the existence and classification problems in foliation in terms of Haefliger structures as follows. Two codimension q foliations ffo and ?I of a Coo-manifold M are said to be concor- dant if there is a codimension q foliation ff on M x [0, 1], that is transverse to M x {t} (t = 0, 1) and induces there the given foliation g;; (t=O, 1). They are said to be integrably homo- topic if one further requires that the foliation ff be transverse to M x { t} for all t E [0, 1] in the definition above. Similarly, two subbundles o, 1 of T(M) are said to be concordant if there is a subbundle  of T(M x [0, 1]) such that  I M x it} =, for t=O, 1, and they are said to be homotopic if one further assumes that  I M x l'] is a subbundle of T(M x {t} ) for all tE[O, 1]. The following theorem is offunda- mental importance. Theorem: Let M be an open (resp. closed) Cue -manifold. Then for each r == 0, 1,.... 00, the integrable homotopy classes (resp. concor- dance classes) of codimension q, C'-foliations of M are in a natural one-to-one correspon- dence with homotopy classes of r-structures .ff on M together with homotopy classes (resp, concordance classes) of subbundles of T(M) isomorphic to v(:Yf). (M. L. Gromov, A. Phillips, Haefliger [5], Thurston [9]). The following are consequences of the theorem: (i) A closed manifold M admits a codimension 1, Coo -foliation if and only if the Euler number of M vanishes. (ii) If a manifold 
607 admits a tq-frame field, then the associated q- plane field is homotopic to the normal bundle of a codimension q foliation of M. (iii) Every dimension q plane field on a Coo-manifold is homotopic to the normal bundle of a C o _ foliation with Coo leaves, G. Characteristic Classes of Foliations Let Br be the classifying space of r- structures. An element of the cohomology group H*(Br; R) is called a (real) character- istic class of codimension q, C'-foliations. If ff is a codimension q, Cr-foliation of M and f: M --> Br is the classifying mapping for ff, then an element ex(ff) = f*exEH*(M; R), exE H*(Br; R), is called the characteristic class of ff corresponding to ex. The first nontrivial characteristic classes of foliations are known as the Godbillon-Vey classes (c. Godbillon and J. Vey [13]) and can be defined as follows. Let ff be a transversely oriented, codimension q, Coo-foliation of M. Then there exists a tq-form Q on M such that on a neighborhood U of each point of M, Q is written as cof A... A co, where cof, ... , co are linearly independent 1- forms that vanish on leaves of ff. By the inte grability condition, there is a I-form IJ such that dQ=IJAQ. Then the Godbillon-Vey class r iF of ff is the de Rham cohomology class in H 2 q+l (M; R) represented by the closed 2q + I form IJA(dlJ)q, The following construction provides a wide class of characteristic classes of foliations (Bott and Haefliger [14], 1. Bernstein and B. Rosen- feld [15]). Let J k be the set oftk-jets at 0 of local Coo -diffeomorphisms of Rq keeping 0 fixed. The set {Jk}O forms an tinverse system of tLie groups with respect to the natural homomorphism Pk: Jk+l -->Jk> and each J k (k'? 1) contains O(q) as a tmaximal compact subgroup. Let P k be the differentiable fiber bundle of k-jets at 0 of local diffeomorphisms of Rq whose domains contain O. It is a tprin- cipal Jk-bundle over Rq. Denote by A(Poo) the tdirect limit of the de Rham complexes of {Pdo' and let A be the subcomplex of A (Pro) consisting of invariant forms with respect to the natural action of rgqoo. A is canonically isomorphic to the tcochain complex A(Aq) of continuous alternating forms on Aq, where Aq is the topological Lie algebra of tformal vector fields on Rq ( 105 Differentiable Manifolds AA). Now let ff be a codimension q, coo_ foliation of M. Let Pk(ff) denote the differ- entiable fiber bundle over M whose fiber over x EM is the space of k-jets at x of the c oo _ submersion f: U -->Rq from an open neighbor- hood U of x to Rq, satisfying (i) f(x)=O, (ii) 154 G Foliations Kerdf= r(ff)I u. This is a principal Jk-bundle over M, and its restriction to U is isomorphic to the pullback by f of the bundle P k . Hence there are homomorphisms from the set of in- variant forms on P k to A(Pk(ff)) that induce a homomorphism AA(Aq)-->IA(Pdff)). This homomorphism is compatible with the action of O(q) and hence induces a homomor- phism of O(q)-basic subcomplexes. Thus one obtains a homomorphism <p,.:H*(Aq;O(q))--> IH*(A(Pk(ff)); O(q))H*(M; R). In fact, <p,. depends only on the 2-jet of the foliation ,IF, and one can think of it as a homomor- phism <p:H*(A q ; O(q))-->H*(Br; R) (r'? 2). The elements in 1m <p are called the smooth characteristic classes of foliations. Let WO q be the differential graded algebra: WO q = E(u p u 3 , ..., u 2 [(q+O/2J-\) @ R[c l' ..., cqJ, where dUi=Ci, dci=O, deg(u;)=2i-l, deg(ci)= 2i, and E denotes the texterior algebra over R generated by the ui's, and R denotes the tpolynomial algebra of the c;'s truncated by the tideal generated by elements of degree > 2q, There exists a homomorphism of dif- ferential graded algebras WOq-->A(Aq; O(q)) which induces an isomorphism in cohomology (I. M. Gel'fand and D. B. Fuks). For a codi- mension q foliation ff of M, the cohomology class determined by C 2i corresponds to the ith Pontryagin class of the normal bundle v(ff) of ff, and the cohomology class U\ cj corre- sponds to the Godbillon-Vey class of ff. In particular, the subring of the cohomology ring H*(M; R) generated by the tPontryagin classes of v(ff) is trivial for degree> 2q (Bott's vanishing theorem). Let Nq+\ be a closed (q + I)-dimensional Riemannian manifold of tconstant negative curvature, The total space T\ N of the unit tangent sphere bundle of N admits a codi- mension q, Coo-foliation associated with the tgeodesic flow of T\ N (t Anosov foliation). It has been shown that the Godbillon-Vey class of this foliation is nontrivial (R. Roussarie, F. Kamber and P. Tondeur, K. Yamato). It is known that many of the smooth characteristic classes are also nontrivial (Bott and Haefliger, Thurston, J. Heitsch). A smooth characteristic class exE H*(Br; R) is called rigid if for any smooth one-parameter family {} of codimension q foliations on a Coo-manifold M, d(ex())/dt=O holds. The elements in the image of the natural homomorphism H*(WOq+d-->H*(WOq)-->H*(Br; R) are rigid (Heitsch). On the other hand, the Godbillon-Vey class is not rigid. In fact, Thur- ston constructed a one-parameter family of codimension q foliations of a certain 
154 H Foliations (2q + I)-dimensional manifold for which the GodbilIon-Vey class varies continuously. The characteristic classes of a simple foliation are often trivial. For example, the Godbillon-Vey class of a codimension 1 foliation of a closed manifold vanishes if it is almost without holo- nomy (i.e" no noncompact leaves have non- trivial holonomy) (M. Herman [16]; T. Mizu- tani, S. Morita, and T. Tsuboi [17]). H. Further Topics (1) Transverse structures. Let B be a q- dimensional manifold and :f' a geometric structure on it, and let 'fJ'l' denote the tpseudo- group generated by the local diffeomorphisms that preserve the structure if'. Replacing 'fJ; by 'fJ'l' in the definition of a C'-Haefliger struc- ture, one obtains definitions of a r'l,-structure and a rIP-foliation. A r 'Ffoliation is called a Riemannian foliation, a transversely real ana- lytic foliation, or a transversely holomorphic foliation if Y is a Riemannian, real analytic, or complex structure on Rq (Cq/2), respectively. The theoris for many such foliations are analogous to those for C'-foliations. For example, many results are known about the characteristic classes of Riemannian or holo- morphic foliations. Haefliger showed that there is no codimension I real analytic foliation on a simply connected closed manifold and that the classifying space Br for codimension 1 trans- versely oriented transversely real analytic foliations has the homotopy type of a tK (n, 1)- space for some uncountable tperfect group n [5J. (2) Foliated cobordism. Two closed oriented n-dimensional Coo-manifolds Mo and M 1 with codimension q, C'-foliations are said to be foliated cobordant if there exist a compact oriented (n + I)-dimensional C"; -manifold W with boundary aW=M l U( -Mo) and a codi- mension q, C'-foliation of W which is trans- verse to aw and induces the given foliations of Mo and MI' The resulting foliated cobordism classes {g;} form a group g;Q.q with respect to the disjoint union, It is known that g;Q2: 1 = {O} and that the Reeb foliation of S3 is cobordant to zero, The characteristic classes of foliations provide invariants of foliated cobordisms. In particular the Godbil1on- Vey number r F[MJ is an invariant of g;Q'2 q +l. q (r? 2, q? 1), and a result of Thurston men- tioned in Section G implies that the homo- morphism g;Q'2 q +1.q-->R defined by {g;}--> r F[MJ is surjective. (3) Growth of leaves, transverse invariant measure, Let ,'F be a Coo-foliation of a com- pact manifold, Fix a tRiemannian metric on M. Then each leaf L of g; has the in- duced metric, and one has a function f(r)= 608 vol(D(x, r)), where D(x, r) is the set of points Y E L whose distance along L from a fixed point x E L is not greater than r. The growth type of the function fL is determined only by L. Many papers have been published that deal with the relation between the behavior of leaves and their growth types in codimension 1 foliations (J. Cantwell and L. Conlon, G. Hec- tor, T. Nishimori, N. Tsuchiya). On the other hand, as a generalization of the notion of asymptotic cycles in dynamical systems, the notion of foliation cycles, or equivalently a transverse invariant measure, has been defined (1. Plante [18], D. Ruelle, D. Sullivan [19]). The existence of a transverse invariant mea- sure for a foliation is closely connected to the growth types of the leaves. The example of Denjoy in Section D leads to the study of tminimal sets of foliations and the structures of foliations (Hector, Cantwell and Conlon). The structure of a codimension 1 foliations which are almost without holonomy has been fairly well investigated (Sacksteder, Hector, H. Imanishi, R. Moussou, Roussaire). (4) Compact foliation. A foliation whose leaves are all compact is called a compact foliation. D. Epstein proved that if g; is a codimension 2, C 2 -compact foliation of a closed 3-manifold, then the leaves of g; are the fibers of a tSeifert fibration of M. In higher dimensions, the situation is more complicated (Sullivan, R. Edwards and K. Millett). (5) Foliated bundles. There are many results on foliated bundles. In particular J. W. Milnor [20] and J. Wood obtained a condition for a circle bundle  over a closed surface L: to have a foliation transverse to fibers. More precisely, if  and L: are orientable, then  admits such a foliation if and only if I X()I  -min{O, X(L:)}, where X denotes the Euler number and X the Euler-Poincare characteristic. Kamber and Tondeur made an extensive study of charac- teristic classes of foliated bundles [21]. (6) Transverse foliations. Two foliations g; and 'fJ of M are said to be transverse to each other if any two leaves K and L of g; and 'fJ are transverse to each other. A foliated bundle has such foliations. D. Hardorp proved that on every orientable closed 3-manifold, there exists a triple of codimension 1 foliations that are pairwise transverse. Tamura and A. Sato classified the codimension 1 foliations that are transverse to the Reeb component of D 2 x Sl. References [IJ G. Reeb, Sur certains proprietes topo- logiques de varietes feuilletees, Actualite Sci. Ind., 1183, Hermann, 1952. [2] C. Ehresmann, Sur la theorie des varietes 
609 feuilletees, Univ. Roma Rend. Mat. Appl., 10 (1951),64-82. [3J S. P. Novikov, Topology of foliations, Amer. Math. Soc. Transl., 1967,286-304. (Original in Russian, 1965.) [4J I. Tamura, Foliations and spinnable struc- tures on manifolds, Ann. Inst. Fourier, 23 (1973), 197-214, [5J A. Haefliger, Homotopy and integrability, Lecture notes in math. 197, Springer, 1971, 133-163. [6J A. Denjoy, Sur les courbes definies par les equations diffeentielles a la surface du tore, J. Math. Pure Appl., 11 (1932), 333-375, [7J P. Schweitzer, Counterexamples to Seifert conjecture and opening closed leaves offoli- ations, Ann, Math., (2) 100 (1974),386-400. [8J W. Thurston, A generalization of Reeb stability theorem, Topology, 13 (1974), 347- 352. [9J W. Thurston, Existence of codimension one foliations, Ann. Math., (2) 104 (1976), 249- 268. [1 OJ W. Thurston, Foliations and groups of diffeomorphisms, Bull. Amer. Math. Soc" 80 (1974),304-307. [11] W. Thurston, Noncobordant foliations of S3, Bull. Amer. Math. Soc., 78 (1972), 511- 514. [12J J. N. Mather, Integrability in codimen- sion 1, Comment. Math. Helv., 48 (1973), 195-233. [13J C. Godbillon and J. Vey, Un invariant des feuilletages de codimension un, C. R. Acad. Sci. Paris, 273 (1971),92-95. [14J R. Bott and A. Haefliger, On character- istic classes of r -foliations, Bull. Amer. Math. Soc., 78 (1972),1039-1044. [15J I. Bernstein and B. Rosenfeld, On charac- teristic classes offoliations, Functional Anal. Appl., 6 (1972),60-62. [16J M, Herman, The GodbiJIon-Vey invari- ant of foliations by planes of T 3 , Lecture notes in math. 597, Springer, 1977,294-307. [17J T. Mizutani, S. Morita, and T. Tsuboi, The GodbilIon- V ey classes of codimension one foliations which are almost without holonomy, Ann. Math., (2) 113 (1981). [18J J. Plante, Foliations with measure-pre- serving holonomy, Ann. Math., (2) 102 (1975), 327-362. [19] D. Sullivan, Cycles for the dynamical study of foliated manifolds and complex mani- folds, Inventiones Math., 36 (1976), 225-255. [20J J. W. MiInor, On the existence of a con- nection with curvature zerO, Comment. Math. Helv., 32 (1958), 215-223. [21] F. Kamber and P. Tondeur, Foliated bundles and characteristic classes, Lecture notes in math. 493, Springer, 1975. [22J B. Lawson, Foliations, Bull. Amer. Math. Soc., 80 (1974),369-418, 155 B Foundations of Geometry [23J B. Lawson, The quantitative theory of foliations, Regional conference series in math. 27, Conference Board of the Mathematical Sciences, 1977, 155 (VI.2) Foundations of Geometry A. Introduction Geometry deals with figures. It depends, there- fore, on Our spatial intuition, but our intuition lacks objectivity. The Greeks originated the idea of developing geometry logicalIy, based on explicitly formulated axioms, without re- sorting to intuition. From this intention re- sulted Euclid's Elements, which was long con- sidered the perfect model of a logical system. As time passed, however, mathematicians came to notice its imperfections. Since the 19th cen- tury especially, with the awakening of a mOre rigorous critical spirit in science and philoso- phy, more systematic criticism of the Elements began to appear. Non-Euclidean geometry was formulated after reexamination of Euclid's axiom of parallels; but it was also discovered that even as a foundation of Euclidean geom- etry, Euclid's system of axioms was far from perfect. Various systems of axioms for Eucli- dean geometry were proposed by mathemati- cians in the latter half of the 19th century, among them one by D. Hilbert [IJ, which became the basis of far-reaching studies. B. Hilbert's System of Axioms Hilbert took as undefined elements points (denoted by A, B, C,...), straight lines (or sim- ply lines, denoted by a, b, C, ...), and planes (denoted by ct., fJ, y,...). Between these ob- jects there exist incidence relations (expressed in phrases such as "A lies on a," "a passes through A," etc.); order relations ("B is be- tween A and C"); congruence relations; and parallel relations. The relations are subject to the following five groups of axioms: (I) Incidence axioms: (1) For two points A, B, there exists a line a through A and B. (2) If A # B, the line a through A, B is uniquely determined, We write a = AU B and call a the join of A, B. (3) Every line contains at least two different points. There exist at least three points that do not lie on a line. (4) If A, B, C are points not on a line, there exists a plane ex through A, B, C. (We also say that A, B, C lie on ct..) For every plane ct., there exists at least one point A on ex. (5) If A, B, C are points 
155 B Foundations of Geometry not on a line, the plane ex through A, B, C is uniquely determined. We write ex=A U BU C and call ex the join of A, B, C. (6) If A, Bare two different points on a line a and if A, B lie on a plane ex, then every point on a lies on ex. (We say that a lies on ex or ex passes through a.) (7) If a point A lies on two planes ex, fJ, there exists at least one other point B on ex and fJ. (8) There exist at least four points not lying on a plane. (II) Ordering axioms: (I) If B is between A and C, then A, B, C are three different points lying on a line; also, B is between C and A. (2) If A, C are two different points, then there exists a point B such that C is between A and B. (3) If B is between A and C, then A is not between Band C. We define a segment as a set of two different points A, B, denoted as AB or BA, and we caIl A and B ends of this segment. The set of points between A, B is called the interior of AB, and the set of points of A U B that are neither ends nor interior points of AB is called the exterior of AB. (4) Let A, B, C be three points not lying on a line. If a line a on the plane A U B U C does not pass through A, B, or C, but passes through a point of the interior of AB, then it also passes through a point of the interior of BC or CA (Pasch's axiom). The following propositions are proved from the above axioms. Given n points AI, A 2 , ..., An on a line (n > 2), we can rearrange them, if necessary, so that the point Aj is be- tween Ai and Ak whenever we have 1 .s; i <j < k  n. There are exactly two ways of arrang- ing the points in this manner (theorem of linear ordering). Let 0 be a point on a line a, and let A, B be two points on the line different from 0, Write A  B when A = B or 0 is not between A and B; write A"" B otherwise. Then  is an equivalence relation between points on the line different from 0; from A"" B, A"" C it follows that B  C. We say that A, B are on the same side or on different sides of 0 on a depending on whether A  B or A"" B. Two subsets a' and an of a defined by a' = {A' I A  A'}, an = {An I A "" An} are called half-lines or rays on a with 0 as the extremity (or starting from 0). Denoting by a, for simplicity, the set of points on a, we have a=a' U {O} U an (disjoint union). U sing axiom II.4, we can also prove the foIl owing: Let a be a line on ex, and let A, B be two points on ex not lying on a. If A = B or if the interior of the segment AB has no point in common with a, we say that A, B are on the same side of a on ex, and write A  B. Other- wise, we say that A, B are on different sides of a on rt. and write A"" B. Then", is an equiva- lence relation between points on ex not lying on mdmA",,A""CBC.T 610 subsets ex'={A'IAA'}, exn={AnIA""An} of ex are called half-planes on ex bounded by a. Again, denoting by ex and a the set of points on ex and the set on a, respectively, we obtain ex= ex' U a U ex n (disjoint union). (III) Congruence axioms: Two segments AB, A' B' can be in a relation of congruence, ex- pressed symbolically as AB=A'B'. (Segments AB and A' B' are then said to be congruent. Since the segment AB is defined as the set {A, B}, the four relations A B == A' B', BA = A' B', AB = B'A', BA = B'A' are equivalent.) This relation is subject to the foIl owing three axioms: (1) Let A, B be two different points on a line a, and Al a point on a line al (al mayor may not be equal to a). Let a'i be a rayon a l starting from AI' Then there exists a unique point BI on a'i such that AB = Al BI' (2) From AIBI =AB and A 2 B 2 =AB follows AIBI = A 2 B 2 . (Hence it follows that = is an equiv- alence relation between segments.) (3) Let A, B, C be three points such that B is between A and C, and let AI' BI' C I be three points such that BI is between Al and C I . Then from AB=AlBI' BC=B I C I follows AC=A I C I . Now let h, k be two different lines in a plane ex and through a point 0, and Jet h', k' be the rays on h, k starting from O. The set of two such rays h', k' is caIled an angle in ex, denoted by L(h', k') or L(k', h'). This angle is also de- noted by LAOB, where A, B are points of h', k', respectively, The rays h', k' are called the sides and the point 0 is called the vertex of this angle. Then h' is a subset of a half-plane on ex bounded by k, and k' is a subset of a palf- plane on ex bounded by h. The intersection of these two half-planes is called the interior of this angle, and the subset of ex-O consisting of points belonging to neither the inside nor the sides of the angle is called the exterior of the angle. Between two angles L(h', k'), L(h'I' k'l) there may exist the relation of congruence, again expressed by the symbol =, as in the case of segments, and subject to the following two axioms: (4) Let L(h', k') be an angle on a plane ex and hi be a line on ex l (ex I mayor may not be equal to ex). Let 0 1 be a point on hi' h'l a rayon ex l starting from 0 1 , and ex'i a half- plane on (XI bounded by hi' Then there exists a unique ray k'i starting from 0 1 and lying in ex'i such that L'(h', k')= L(h'l> k'd. Moreover, L(h',k')= L(h',k') always holds. (Hence it follows that = is an equivalence relation be- tween angles.) (5) Let both A, B, C and AI, Bl> C I be triples of points not lying on a line. Then from AB=AIBI' AC=A I C I , and LBAC= LBIAI C I , it follows that LABC= LAIBI C I . (IV) Axiom of parallels: Suppose that a, b are two different lines. Then it follows from axiom 1.2 that if a and b share a point P, such a point is the unique point lying on both a and 
611 b. In this case we say that a, b intersect at P and write an b = P. On the other hand, if a and b have no point in common and if a, b are on the same plane, we say that a, b are parallel and write a II b. If A, a are on a plane ex and A is not on a, we can prove (utilizing axioms I, II, and III) that there exists a line b passing through A in ex such that a II b. The axiom of parallels postulates the uniqueness of such a b. (V) Axioms of continuity: (1) Let AB, CD be two segments, Then there exist a finite number of points AI, A 2 ".., An on AU B such that CD=AA 1 =A1A2='" =An_IAn and B is be- tween A and An (Archimedes' axiom). (2) The set of points on a line a (again denoted for simplicity by a) is "maximal" in the following sense: It should satisfy axioms 11.1- 11.3, I1I.1, V.1, and the theorem oflinear ordering. If a is a set of points satisfying these axioms such that a=:> a, then a should be =a (axiom of linear completeness). Hence follows the theorem of completeness: the set of points, lines, and planes is maximal in the sense that it is not possible to add further points, lines, or planes to this set with the resulting set still satisfying axioms I-IV and V.l. C. Consistency In formulating the above axioms and proving their consistency, Hilbert assumed the consis- tency of the theory of real numbers ( 156 Foundations of Mathematics). To prove con- sistency, Hilbert constructed a model for the above axioms using the method of analytic geometry. He defined points as triples of real numbers (x I' x 2 , x 3 ), lines and planes as sets of points satisfying suitable systems of linear equations, and relations of ordering, con- gruence, and parallelism in the usual way. It is easy to verify that such a system satisfies all the axioms I-V. Thus the consistency of these axioms is reduced to the consistency of the theory of real numbers ( 35 Axiom Systems). A model for I-IV and V.1 can be obtained in the countable field Ro of all real talgebraic numbers instead ofR. Then Ro can be further restricted to its subfield Po defined as follows: Let F be an arbitrary field, An textension of F of the form F(Jl+}:Z) with AEF is called a Pythagorean extension of F, and F is said to be a Pythagorean field if any Pythagorean exten- sion of F coincides with F (e.g., Ro and Rare Pythagorean). It is easily verified that I-IV are satisfied in the "analytic geometry over any Pythagorean field," On the other hand, we can construct a minimal Pythagorean field con- taining a given field (the Pythagorean closure of the field) in the same way as we construct the talgebraic closure of a field. The field Po is 155 E Foundations of Geometry defined as the Pythagorean closure of the field Q of rational numbers. D. Independence of Axioms In Hilbert's system, the axioms I and II are used to formulate further axioms. On the other hand, it can be shown that each of the groups III, IV, and V is independent from other axioms, The independence of IV is shown by the consistency of non-Euclidean geometry ( 285 Non-Euclidean Geometry). The following model shows the independence of 111.5: In the analytic model for I-V, we replace the defi- nition of distance between two points (X 1 ,X 2 ,X 3 ), (YI,Y2'Y3) by «XI - YI +x 2 - Y2)2 +(x 2 - Y2)2 +(x 3 - Y3)2)1/2. Then I1I.5 does not hold, while all other axioms remain satisfied. The independence of V.2 is shown by the geometry over Ro or Po. The independence of V.1 follows from the existence of the non-Archimedean Pythag- orean field: the Pythagorean closure of any tnon-Archimedean field (e.g" the field of ra- tional functions of one variable over Q with a tnon-Archimedean valuation) is such a field, A geometry in which V.1 does not hold is called a non-Archimedean geometry. E. Completeness of the System of Axioms and Relations between Axioms The tcompleteness of the system of axioms I- V can be shown by introducing coordinates in the geometry with these axioms and represent- ing it as tEuclidean geometry of three dimen- sions. Axiom group V is essential for the intro- duction of coordinates over R. Moreover, we have the following results: (i) The geometry with the axioms I-IV can be represented as "Euclidean geometry" of three dimensions over a Pythagorean field, and the geometry with axioms 1.1- 1.3, II-IV can be represented as "Euclidean geometry" of two dimensions over a Pythagorean field. (ii) The geometry with II, and a stronger axiom of parallels IV* (given a line a and a point A outside a, there exists one and only one line a' passing through A that is parallel to a) can be represented as an taffine geometry over a field K that is not necessarily commutative. (iii) The field K is commutative if and only if the following holds: Suppose that in Fig. I A' U B II AU B', B' U C II B U C. Then it follows that A' U C II AU C (Pascal's theorem). 
155 F Foundations of Geometry .4' () Fig. (iv) The "two-dimensional geometry" with I.1-I.3, II, and IV'" can be embedded in the "three-dimensional geometry" with axioms I, II, and IV* if and only if the following holds: Suppose that in Fig. 2 we have AU B II A' U B', B U ell B' U C. Then it follows that C U A II C U A' (Des argues's theorem). c R o 4.4' Fig. 2 (v) From 1,1-1.3, II, IV*, and Pascal's theo- rem follows Desargues's theorem. (vi) Desargues's theorem is independent of I.1-I.3, II, III.1-1II.4, IV*, and V; that is, we can construct a non-Desarguesian geometry (a geometry in which Desargues's theorem does not hold) in which these axioms are satisfied. Axioms I, II, and IV*, as well as the theo- rems of Pascal and Desargues, are propositions in affine geometry. Each has a corresponding proposition in tprojective geometry, and the results concerning them can be transferred to t he case of projective geometry ( 343 Projec- tive Geometry). F. Polygons and Their Areas Suppose that we are given a finite number of points Ai (i =0,1, .." r) in the geometry with axioms I and II. Then the set of segments (or, more precisely, the union of segments together with their interiors) A i A i +1 (i =0,1, ..., r- 1) is called a broken line joining Ao with Ar. In particular, if Ao = A" then this set is called a polygon with vertices Ai and sides AiAi+l' A polygon with r vertices is called an r-gon. (For r = 3, 4,5,6, r-gons are called triangles, quad- rangles, pentagons, and hexagons, respectively.) A plane polygon is a polygon whose vertices all lie on a plane. A polygon is called simple if any three consecutive vertices do not lie on a line, and two sides A i A i +1 and AjAj+l (i#j) meet only whenj=i+ lor i=j+ I. In this article, 612 we consider only simple plane polygons, and refer to them simply as polygons. tJordan's theorem implies that a polygon in the sense just defined divides the plane into two parts, its interior and its exterior. This special case of Jordan's theorem can be proved by I.1-I.3 and II only. A polygon P is divided into two polygons PI' P 2 by a broken line joining two points on sides of P and lying in the interior of P (Fig. 3). In this case, we say that P is decomposed into PI' P 2 and write P =P I +P 2 , We may again decompose P" P 2 and thereby arrive at a decomposition of the form P = PI + '.. + P k . Axiom III is used to introduce the congruence relation = between polygons. Two polygons P, Q are called de- composition-equal if there exist decomposi- tions P=P l +." +Pk> Q=QI +... +Qk such that PI = Q I' "', P k = Qk' This is expressed by pzQ. We call P, Q supplementation-equal if there exist two polygons P', Q' such that (P+ P')z( Q + Q'), P'zQ'. This will be expressed by PeQ. If we assume IV, we can use result (i) of Section E. Let K be the ground field of the geometry (K is Pythagorean, hence tordered). The area of polygon P is defined as the posi- tive element m(P) of K assigned to P such that m(P+Q)=m(P)+m(Q), and m(P)=m(P') if P=P'. From pzQ or PeQ, it follows that m(P) = m(Q). Under these axioms, it is proved that m(P)=m(Q) implies PeQ. If we also assume V.1, then m(P)=m(Q) implies pzQ. Thus the theory of area of polygon can be con- structed without assuming axiom V.2, though this result cannot be generalized to higher- dimensional cases. For the case of three di- mensions, we can construct two solids of the same volume that are not supp1ementation- equal [2,7]. Fig. 3 G. Geometric Construction by Ruler and a Transferrer of Constant Lengths The geometry with I - IV can be represented as 3-dimensional Euclidean geometry over a Pythagorean field, Conversely, all these axioms are valid in 3-dimensional Euclidean geometry over any Pythagorean field. Thus the minimal system of "quantities" whose existence is assured in geometry with these axioms is the field Po, the Pythagorean closure of Q. Hilbert noticed that the existence of a 
613 geometric object under axioms I-IV can be expressed as its constructibility by ruler (i.e., an instrument to draw a straight line joining two points) and a transferrer of constant lengths. The latter, for a constant length x, is an instru- ment that permits finding the point X on the given ray AB such that AX = x. It is not pos- sible to construct by ruler and transferrer all the points that can be constructed by means of ruler and compass ( 179 Geometric Con- struction). However, it is possible to construct all the lengths h, where ..1. is any element of Po' Hilbert conjectured that an element of Po can be characterized as a ttotally positive algebraic number of degree 2', VE N. This conjecture was proved by Artin [3]. H. Related Topics While Hilbert's foundations are concerned with 3-dimensional Euclidean geometry, it is easy to generalize these results to the case of n-dimensional Euclidean geometry ( 139 Euclidean Geometry). Also, for affine and pro- jective geometries, there are well-organized systems of axioms (at least for the case of di- mensions ? 3). Hilbert [1, Appendix III] showed that plane thyperbolic geometry can be constructed on a modified system of axioms, but for other non-Euclidean geome- tries (in particular, telliptic geometries) there are no known systems of axioms as good as Hilbert's for the Euclidean case. On the other hand, Hilbert [1, Appendix IV] gives another method of constructing Euclidean geometry in characterizing the group of motions as the topological group with certain properties. G. Thomsen [4] rewrote Hilbert's system of axioms in group-theoretical language utiliz- ing the fact that the group of motions is gen- erated by symmetries with respect to points, lines, and planes. Finally, Hilbert's study of the foundations of geometry led him to re- search in the tfoundations of mathematics. References [IJ D. Hilbert, Grundlagen der Geometrie, Teubner, 1899, seventh edition, 1930. [2J M. Dehn, Dber den Rauminhalt, Math. Ann., 55 (1902), 465-478. [3J E. Artin, Dber die Zerlegung definiter Funktionen in Quadrate, Abh. Math. Sem. Univ. Hamburg, 5 (1926), 100-115 (Collected papers, Addison-Wesley, 1965,273-288). [4J G. Thomsen, Grundlagen der Elementar- geometrie, Teubner, 1933. [5J K. Reidemeister, Grundlagen der Geo- metrie, Springer, 1968. 156 B Foundations of Mathematics [6] G. Hessenberg and 1. Diller, Grundlagen der Geometrie, de Gruyter, 1967. [7] V. G. Boltyanskii, Hilbert's third problem, Winston, 1978. 156(1.1) Foundations of Mathematics A. General Remarks The notion of tset, introduced toward the end of the 19th century, has proved to be one of the most fundamental and useful ideas in mathematics. Nonetheless, it has given rise to well-known tpardoxes. Based on this notion, R. Dedekind developed the theories of natural numbers [2] and real numbers [3], defining the latter as "cuts" of the set of rational num- bers. Thus set theory served as a unifying principle of mathematics. It has been noted, however, that some of the most commonly utilized arguments in set theory, which are at the same time the most useful in mathematics and belong almost to the basic framework of formal logic itself, resemble very much those which give rise to paradoxes. This fact has caused many critical mathematicians to question the very nature of mathematical reasoning. Thus a new field, foundations of mathematics, came into being at the beginning of this century. This field was divided at its inception into different doctrines according to the views of its initiators: logicism by B. Russell, intuitionism by L. E. J. Brouwer, and formalism by D. Hilbert. In set theory, which was the origin of this controversy, it was pointed out that the "definition" of set as given by G, Cantor was too naive, and axiomatic treatments of this theory were proposed ( 33 Axiomatic Set Theory). B. Logicism Russell asserted that mathematics is a branch of logic and that paradoxes come from ne- glecting the "types" of concepts. According to his opinion, mathematics deals formally with structures independently of their concrete meanings. Science of this character has been called logic from antiquity. According to him, logic is the youth of mathematics, and math- ematics is the manhood of logic, To construct mathematics from this standpoint, asserted Russell, ordinary language is lengthy and inaccurate, and some proper system of sym- bols should be used instead. Thus he tried to reconstruct mathematics using tsymbolic logic. 
156 C Foundations of Mathematics Attempts to reorganize mathematics using logical symbols had formerly been made by G. Leibniz, who wrote Dissertatio de arte com- binatoria in 1666, as well as by A. de Morgan, G. Boole, C. S. Peirce, E. Schroder, G. Frege, G. Peano, and others. Symbols used by the last two authors resemble those of today. Russell studied these works and published his own theory in a monumental joint work with A. N. Whitehead: Principia mathematica (3 vols., 1st ed. 1910-1913, 2nd ed, 1925-1927), in which the theories of natural numbers and real numbers as well as analytic geometry are developed from the fundamental laws of logic. If this work had been completely successful, it could have eliminated any possibility of the intrusion of paradoxes into mathematics. However, the authors were forced to postulate an "unsatisfactory" axiom in order to con- struct mathematics. They introduced the notion of ttype as follows: An object M defined as the set of all objects of a certain type belongs to a higher type than the types of the elements of M. This serves to eliminate certain paradoxes but brings about inconveniences such as the following. Suppose that we are trying to con- struct the theory of real numbers from that of rational numbers. Each real number can then be considered a tpredicate about rational numbers. If this predicate contains only tquan- tifiers relating to variables running over all rational numbers, then the corresponding real number is said to be predicative, otherwise impredicative. According to Russell, the latter should have a higher type than the former, which makes the theory of real numbers exceedingly complicated. To avoid this dif- ficulty, Russell proposed the axiom of reduci- bility, which says that every predicate can be replaced by a predicative one. With this rather artificial axiom Russell himself expressed dissatisfaction. Russell also postulated the taxi om of infinity and the taxiom of choice, which are also problematic. After examining the philosophical background of the book, H. Weyl wrote about Principia mathematica, "Mathematics is no more based on logic than the utopia built by the logician." Nevertheless, logic as formulated in this book, as well as the theory of types as developed by F. P. Ramsay in the school of Russell and Whitehead, is still an important subject of mathematical logic. C. Intuitionism The intuitionist claims that mathematical objects or truths do not exist independently from mathematically thinking spirit or intui- tion, and that these objects or truths should be directly seized by mental or intuitional activ- 614 ity. The philosophical standpoints of mathe- maticians such as L. Kronecker and H. Poin- care in the 19th century or E. Borel, H. Lebesgue, and N. N. Luzin at the turn of this century can be assimilated to intuitionism, but those of the latter three are often said to be- long to semi-intuitionism or to French empiri- cism. Brouwer took a narrower standpoint, strongly antagonistic to Hilbert's formalism. Today the word "intuitionism" is generally interpreted in Brouwer's sense, Brouwer sharply criticized the usual way of reasoning in mathematics and claimed that indiscriminate use of the law of excluded mid- dle (or tertium non datur) Pv iP cannot be permitted. According to him, the proposi- tion "Either there exists a natural number with a given property P, or else no such num- ber exists" is to be regarded as proved only when an actual construction of a natural num- ber with the property P is given or when the absurdity of the existence of such a natural number can be constructively proved. When neither of these two results can be shown, then one can say nothing about the truth of the above proposition. Thus the usual method of proof, known as the method of reductio ad absurdum, i.e., of proving a proposition P by proving its double negation ii P, is not generally considered valid. It is a difficult but important problem of mathematical logic to determine which parts of usual mathematics can be reconstructed intuitionistically, though it does not seem easy to reconstruct any part of mathematics elegantly from this standpoint. D. Formalism To eliminate paradoxes, Hilbert tried to apply his axiomatic method. From Hilbert's stand- point, any part of mathematics is a deductive system based on its axioms. In the deductive development, however, "logic," including set theory and elementary number theory, is used. Paradoxes appear already in such logic. Hil- bert's idea was to axiomatize such logic and to prove its consistency. Thus one must first formalize the most elementary part of mathe- matics, including logic proper. Hilbert proved the consistency of Euclidean geometry by assuming the consistency of the theory of real numbers. This is an example of a relative consistency proof, which reduces the consistency proof of one system to that of another. Such a proof can be meaningful only when the latter system can somehow be re- garded as based on sounder ground than the former. To carry out the consistency proof of logic proper and set theory, one must reduce it to that of another system with sounder 
615 ground. For this purpose, Hilbert initiated metamathematics and the finitary standpoint. The finitary standpoint recognizes as its foundation only those facts that can be ex- pressed in a finite number of symbols and only those operations that can be actually executed in a finite number of steps. Essentially, it does not differ from the standpoint of intuitionism. The methods based on this standpoint are also called constructive methods. Metamathematics is also called proof theory. Its subject of research is mathematical proof itself. Hilbert was the first to insist on its im- portance. The theory is indispensable for con- sistency proofs of mathematical systems, but it may also be used for other purposes. In fact, the same idea can be seen in the tduality prin- ciple of projective geometry, which dates from long before Hilbert's proclamation of formal- ism. This is not a theorem of projective geome- try deduced from its axioms; rather, it is a proposition about the theorems in projective geometry, based on the type of axioms and proofs in this subject. According to Hilbert's method, one must develop proof theory from the finitary stand- point with the aim of proving the consistency of ax iomati zed mathematics. For this purpose, one must formalize the mathematical theory in question by means of symbolic logic. A theory thus formalized is called a formal system. E. Some Results of Formalist Theory One of the most remarkable results hitherto obtained with Hilbert's method is the con- sistency proof of pure number theory by G. Gentzen [7]. This consistency proof covers the largest domain for which an explicit consistency proof has so far been obtained. However, the methods of formalist proof theory have proved to be most effective in studying the logical structure of mathematical theories and have led to various results on the consistency of formalized mathematical sys- tems, on symbolic logic, and on axiomatic set theory. We give some examples. (1) GOdel's Incompleteness Theorem. K. Godel [6] showed that if a system obtained by for- malizing the theory of natural numbers is consistent, then this system contains a tclosed formula A such that neither A nor its negation I A can be proved within the system. He originally proved this under the assumption that the system is (V-consistent. This is a stron- ger condition for the system than simple con- sistency, but J. B. Rosser [13J succeeded in replacing this by the latter. This result shows the incompleteness not only of the usual 156 E Foundations of Mathematics theory of natural numbers but of any consist- ent theory (from the finitary standpoint) con- taining the theory of these n um bers. At the same time, Godel also obtained the following important result: Let S be any con- sistent formal system containing the theory of natural numbers. Then it is impossible to prove the consistency of S by utilizing only arguments that can be formalized in S, This means that a consistency proof from the fini- tary standpoint of a formal system S inevitably necessitates some argument that cannot be formalized in S. (2) Consistency Proofs for Pure Number Theory. Gentzen [7] called pure number theory the theory of natural numbers not depend- ing on the free use of set theory (differing con- sequently from the usual theory of natural numbers based on tPeano axioms;  294 Numbers) and proved its consistency. W. Ackermann [14] proved the consistency of a similar theory admitting the use of Hilbert's te-symbol. G. Takeuti [15J showed that Gen- tzen's result can be obtained as a corollary to his theorem extending tGentzen's funda- mental theorem on tpredicate logic of the first order to a ttheory of types of a certain kind. According to the result of Godel mentioned in (1) above, some reasoning outside pure num- ber theory must be used to prove its consis- tency. In all consistency proofs of pure number theory mentioned above, ttransfinite induction up to the first te-number eo is used, but all the other reasoning used in these proofs can be presented in pure number theory. This shows that the legitimacy of transfinite induction up to eo cannot be proved in this latter theory. A direct proof of this fact was given by Gentzen [13]. On the other hand, the legitimacy of transfinite induction up to an ordinal num- ber<e o can be proved within pure number theory. Again, transfinite induction is not the only method by which to prove the consistency of pure number theory. Actually, Godel [17] carried out the proof utilizing what he called computable functions of finite type on natural numbers and what we call primitive recursive functionals of finite type. By restricting pure number theory further, one obtains weaker theories of natural num- bers whose consistency can be proved with finitary methods without recourse to such methods as transfinite induction up to eo. M. Presburger [18J proved the consistency of a theory in which only the addition of numbers is considered an operation. Ackermann [19J, J. von Neumann [20], 1. Herbrand [2IJ, and K. Ono [22] proved the consistency of theo- ries in which some restrictions are placed 
156 Ref. Foundations of Mathematics on the use of the axiom of tmathematical induction. On the other hand, K. Schutte [23J gave a consistency prooffor number theory including what he called "infinite induction" from a stronger standpoint than Hilbert's finitary one; he attempted to find a basis that makes such a proof possible. (3) The Consistency of Analysis. No definitive result has yet been obtained from the stand- point of formalism, though many attempts are being made, among which a recent one by C. Spector [24J should be mentioned. (4) Axiomatic Set Theory. There are different kinds of axiom systems ( 33 Axiomatic Set Theory), To give a consistency proof for any of these systems is considered a very difficult problem today, but many interesting results are known concerning the relative consistency or independence of these axioms. (5) The Skolem-Lowenheim Theorem. The metamathematical Skolem - Lowenheim theorem states: Given a consistent system of axioms stated in the first-order predicate logic whose cardinality is at most countable, there always exists an tobject domain consisting of countable objects satisfying all these axioms. For example, axiomatic set theory is stated in predicate logic of the first order, and the cardinality of its axioms is countable. Thus there exists an object domain consisting of countable objects satisfying all these axioms, provided that they are consistent. Such a do- main is called a countable model of axiomatic set theory. On the other hand, from the axioms of this theory one can prove that there exists a family of sets that is more than countable. This should also hold in a model of the theory, in which each object represents a set. This situation is known as the Skolem paradox. This does not imply, however, the inconsis- tency of axiomatic set theory. In fact, the term "countable" is to be interpreted in its math- ematical sense when one says "there exists a family of sets that is more than countable," while it should be interpreted in its metamath- ematical sense when one speaks of a countable model of the theory. It is the confusion of these two different interpretations that leads to the "paradox." (6) Skolem's Theorem on the Impossibility of Characterizing the System of Natural Numbers by Axioms. T. Skolem [27] proved that it is not possible to characterize the system of natural numbers by a countable system of axioms stated in the predicate logic of the first order. More precisely, given any consistent 616 countable system of axioms satisfied by the system of natural numbers, there always exists another tjinearly ordered system satisfying all these axioms and yet not isomorphic to the system of natural numbers as an ordered system. Godel's incompleteness theorem and the Skolem paradox, as well as this result, seem to indicate that there is a certain limit to the effectiveness of the formalist method. On the other hand, nonstandard analysis has originated in this result. References [IJ S. C. Kleene, Introduction to metamath- ematics, Van Nostrand, 1952. [2J R. Dedekind, Was sind und was sollen die Zahlen? Vieweg, 1888. [3J R. Dedekind, Stetigkeit und irrationale Zahlen, Vieweg, 1872. [4J B. Russell, Introduction to mathematical philosophy, Allen & Unwin and Macmillan, 1919. [5J A. Heyting, Intuitionism, North-Holland, 1956. [6J K. Godel, Uber formal unentscheidbare Siitze der Principia Mathematica und ver- wandter Systeme I, Monatsh. Math, Phys., 38 (1931),173-198. [7J G. Gentzen, Die Widerspruchsfreiheit der reinen Zahlentheorie, Math. Ann., 112 (1936), 493-565. [8J D. Hilbert and P. Bernays, Grundlagen der Mathematik, Springer, I, second edition, 1968; II, 1970. [9J A. Tarski, Logic, semantics, metamath- ematics, Clarendon Press, 1956. [lOJ S. C. Kleene, Mathematical logic, Wiley, 1967. [IIJ J. R. Schoenfield, Mathematical logic, Addison-Wesley, 1967. [12J O. Becker, Grundlagen der Mathematik in geschichtlicher Entwicklung, Alber, 1954. [13J J. B. Rosser, Extensions of some theorems of Godel and Church, J. Symbolic Logic, 1 (1936),87-91. [14J W. Ackermann, Zur Widerspruchsfreiheit der Zahlentheorie, Math. Ann., 117 (1940), 162-194. [15J G. Takeuti, On the fundamental conjec- ture of GLC I, J. Math. Soc. Japan, 7 (1955), 249-275, [16J G. Gentzen, Beweisbarkeit und Un- beweisbarkeit von Anfangsfiillen der trans- finiten Induktion in der reinen Zahlentheorie, Math. Ann" 119 (1943),140-161. [17J K. Godel, Uber eine bisher noch nicht benutzte Erweiterung des finiten Standpunk- tes, Dialectica, 12 (1958),280-287. 
617 [18] M, Presburger, Uber die V ollstiindigkeit eines gewissen Systems der Arithmetik ganzer Zahlen, in welchem die Addition als einzige Operation hervortritt, C. R. du I Congres des Math. des Pays Slaves, Warsaw (1929), 92- 101,395. [19] W. Ackermann, Begrundung des "tertium non datur" mittels der Hilbertschen Theorie der Widerspruchsfreiheit, Math. Ann., 93 (1924),1-36. [20] J. von Neumann, Zur Hilbertschen Beweistheorie, Math. Z., 26 (1927),1-46. [21] J. Herbrand, Sur la non-contradiction de l'arithmetique, J. Reine Angew. Math., 166 (1931),1-8. [22] K. Ono, Logische Untersuchungen iiber die Grundlagen der Mathematik, J. Fac. Sci. Univ. Tokyo, 3 (1938),329-389. [23J K. Schutte, Beweistheoretische Erfassung der unendlichen Induktion in der Zahlen- theorie, Math. Ann., 122 (1951), 369-389. [24] C. Spector, provably recursive func- tionals of analysis: a consistency proof of analysis by an extension of principles for- mulated in current intuitionistic mathematics, Recursive function theory, Amer. Math. Soc. Proc. Symp. Pure Math., 5 (1962).1-27. [25J L. Lowenheim, Uber Moglichkeiten in RelativkalkUl, Math. Ann" 76 (1915), 447-470. [26] T. Skolem, Einige Bemerkungen zur axiomatischen Begrundung der Mengenlehre, 5 Congress der Skandinavischen Mathema- tiker, Helsingfors (1923), 217-232. [27] T. Skolem, Dber die Nichtcharakterisier- barkeit der Zahlenreihe mittels endlich oder abzahlbar unendlich vieler Aussagen mit aus- schliesslich Zahlenvariablen, Fund. Math" 23 (1934),150-161. 157 (VI.22) Four-Color Problem A. Brief History It is obvious that four colors are necessary to color some geographical maps on a sphere (or plane), but are four colors sufficient to color every map? This is the so-called four-color problem. The precise formulation of this prob- lem will be given in Section B. The conjecture was made by Francis Guthrie and communi- cated through his brother Frederick Guthrie to A. De Morgan in 1852 [4]. A. Cayley called attention to the problem in 1878. It became famous after J. P. Heawood pointed out a mistake in the proof by A. B. Kempe (1879). Heawood also studied the problem of coloring maps on arbitrary surfaces ( Section E). 157 B Four-Color Problem Though there have been various approaches to the four-color problem, the main stream of investigation has concentrated on obtaining an unavoidable set consisting of reducible ar- rangements ( Section D) in order to correct the mistake made by Kempe. G. D. Birkhoff [7J first discovered the simplest nontrivial reducible arrangement, nowadays called Birkhoff's diamond (Fig. 1). Using all reduc- ible arrangements known up to that time, P. Franklin showed that every map with up to 25 countries must contain a reducible arrange- ment, so that such a map is four-colorable [8]. This limit has been gradually increased; in 1975 a reducible arrangement for 25 countries was found. Fig. I BirkhofT's diamond. H. Heesch invented the method of discharg- ing ( Section D), found criteria for reduci- bility, and finally conjectured the existence of an unavoidable set of reducible arrangements with several thousand elements, but this was too large to construct by hand. W. Haken and K. Appel with J. Koch worked with high- speed computers for a total of 1,200 CPU hours over a period of four and a half years and finally succeeded in constructing and checking an avoidable set of reducible ar- rangements with 1,834 elements, which proved the four-color problem affirmatively (1976; [9, 10J). For early investigations of the four- color problem  [5]; for results up to the 1930s  [3]. B. Precise Formulation of the Problem To formulate the problem precisely, we must state the following two conditions: (i) Every country on a map is a tconnected domain; a connected part of the sea is considered to be a country. (ii) Two countries sharing boundary lines must be colored differently. On the other hand, if two countries share only a finite num- ber of points, then they may share the same color. Actually we can modify the map so that no more than three countries meet at the same point. A map with this condition is called a trivalent or cubic map. In the study of the four- 
157 C Four-Color Problem color problem, we can restrict ourselves to trivalent maps without two-sided countries. In the remainder of this article, we shall assume that the maps considered have these properties. In recent investigations, it has become customary to convert maps into their tdual graphs, where each country is replaced by its capital lying inside, and for each adjacent pair A, B (Fig. 2), the boundary is replaced by a line connecting the representative points of A and B in such a way that it meets the bound- ary only once. " Fig. 2 Dual graph. The formal extension of the four-color prob- lem to higher dimensions is trivial, since we can easily construct arrangements needing arbitrary many colors for coloring. By con- verting to a dual graph, one can see that this is not surprising, since the planarity of a graph is a strong restriction, while the condition of representability of a linear graph in 3- dimensional space imposes no restriction. C. Tait's Algorithm Suppose that a planar trivalent map M is four- colored. Denote the four colors by 0, 1,2, and 3. (In Fig. 3, we use A, B, C, and D instead of 0, 1,2, and 3, respectively). Here we take the operation a EB b -->c. Represent the integers a, b=O, 1,2, or 3 in the binary (2-adic) number system as 2-digit numbers a 1 a o and b 1 b o , where a i and b i are 0 or 1. For each digit we take the operation a i EB bi=c i to be binary Fig. 3 An example of Tait's algorithm, At the vertices, . stands for + and 0 for -. Within the domains, A, B, C, and D stand for four colors corresponding to 0, 1,2, and 3, respectively. 618 addition without carrying, i.e., the operation is that of the logical (exclusive) "or." The number c = c 1 Co in the binary system represents the result of the operation. To each boundary line, we give the num ber corresponding to a EB b, where a and b are the colors for the two coun- tries meeting at the boundary. Then we have the numbering 1,2, and 3 for each edge, where 1,2, and 3 are labeled once and only once for each triple of edges emanating from each vertex. Such numbering for edges is called Tait coloring for edges. Then we give the signature + or - to each vertex, according as the order of the edge numbering is counterclockwise or clockwise. Then the algebraic sum of the signa- tures along the boundary of each country in the map is always a multiple of 3. Conversely, if we give the signature + or - to each vertex in such a way that the algebraic sum of the signatures is always a multiple of 3 along the boundary of each country, then we get a four- coloring of the map by reversing the above procedure. This is called Tait's algorithm, which shows that the four-coloring of a planar map is an tNP problem. As for five-coloring, it is known that an algorithm of polynomial complexity exists. Tait's algorithm also shows that the four- color problem is equivalent to the following apparently elementary geometric proposition: "For any convex polyhedron, we can always cut near some of its vertices in such a way that the resulting polyhedron has only faces such that the number of sides is a multiple of 3." Many other equivalent statements of the four- color problem are known [1,2]. D. Solution of the Four-Color Problem For a given planar trivalent map, denote by v,. the number of the countries with n sides (n? 3). From Euler's theorem on polyhedra, we have immediately the relation L(6-n)V n =12. (1) 11;::3 We easily see from this that every planar map must contain 3-,4-, or 5-sided countries. A family F of the arrangements of countries with the property that every planar map must contain at least one arrangement belonging to F is called an unavoidable set. The family con- sisting of 3-, 4-, and 5-sided countries is the simplest example. In order to obtain an un- avoidable set, Heesch invented the method of discharging. Let us assume the existence of a map M that contains no arrangement of a family F. We give a signed charge of(6-n) to every n-gon in M. Next we divide and move the charges so that the pluses and the minuses cancel out. If all positive charges disappear, 
619 according to the assumption of the existence of the map M, then we have a contradiction of (1), so we can conclude that no such map M exists, A 3-sided country can be ignored in four- coloring. As for a 4-sided country A, Kempe proved (by means of a definite modification procedure) that after four-coloring outside A, we also have a total four-coloring including A. An arrangement of a country or countries A is called reducible if, as in the above case after four-coloring outside A, we get a total' four- coloring including A by suitable modifications. If we have an unavoidable set consisting only of reducible arrangements, then by definition, every planar map will be four-colorable. Kempe believed that he had proved the reducibility of a pentagon (5-sided country), but unfortunately, he missed a particular case. Even though the pentagon is not reducible, much effort has been given to finding other reducible arrangements. Many useful criteria for reducibility have also been studied. If we have enough reducible arrangements, then we can either eventually obtain an unavoid- able set, which proves the four-color problem affirmatively, or find a "minimal counter- example" without reducible arrangements, which disproves it. Haken obtained an unavoidable set consist- ing of arrangements without certain kinds of reduction obstructions. (He called these "geo- graphically good arrangements.") He firmly believed that, applying certain "probabilistic conjectures" to such arrangements, he would conquer the problem by considering arrange- ments of up to 14 countries. With repeated improvements on the discharging process and on the criteria for reducibility, he finally was able to conclude that his speculation was correct, as recounted in Section A. His inves- tigation is not only an example of the use of high-speed computation in pure mathematics, but also one inviting reassessment of the meaning of mathematical proof. E. Coloring Maps on Arbitrary Surfaces On a ttorus, seven colors are sufficient to color any map and that many colors are necessary to color some maps. Heawood (1890) inves- tigated the problem of coloring maps on closed surfaces with tEuler characteristic X < 2, orientable or not. The least number of colors sufficient to color any map on a surface is called the chromatic number of the surface. Heawood proved that the chromatic number is less than or equal to p = [(7 + J 49 - 24X)/2J (X < 2), 158 Fourier, Jean Baptiste Joseph where [ex J means the largest integer a.s; ex. After this was proved in various special cases, 1. W. T. Youngs and G. Ringel finally proved in 1968 that the number p equals the chromatic number except for a Klein bottle (a non- orientable surface with X = 0) [6]. Franklin proved in 1934 that the chromatic num ber for a Klein bottle is 6. References [IJ O. Ore, The four-color problem, Academic Press, 1967. [2J T. Saaty and P. Kainen, The four-color problem-assaults and conquest, McGraw- HilI, 1977. [3J N. L. Biggs (ed.), Graph theory 1736-1936, Oxford Univ. Press, 1976. [4] K. O. May, The origin of four-colour con- jecture, Isis, 56 (1965), 346- 348. [5J W. W. R. Ball, Mathematical recreations and essays, MacmiIlan, 1892; revised twelfth edition, H. S. M. Coxeter (ed.), Toronto Univ. Press, 1958, [6J G. Ringel, Map color theorem, Springer, 1974. [7J G. D. Birkhoff, The reducibility of maps, Amer. J. Math., 35 (1913),115-128. [8J P. Franklin, The four-color problem, Amer. 1. Math., 44 (1922), 225-236. [9] K. Appel and W, Haken, Every planar map is four-colorable I. Discharging, II1inois J. Math., 21 (1977),429-489. [lOJ K. Appel, W. Haken, and J. Koch, Every planar map is four-colorable II, Reducibility, Illinois 1. Math., 21 (1977),490-567. 158 (XXI.24) Fourier, Jean Baptiste Joseph (2) Jean Baptiste Joseph Fourier (March 21, 1768-May 16, 1830) was born in Auxerre France, the son of a tailor, and was orph;ned at the age of eight. In 1790, he was appointed professor at the Ecole Poly technique. In 1798, Napoleon took him on his Egyptian campaign together with G. Monge. On his return to France, he was made governor of the depart- ment ofIsere. With the downfaIl of Napoleon, he lost his position; however, he was later appointed to the French Academy of Science as a result of his research on the transmission of heat. In 1826 he was elected a member of the Academie Franc;aise. His research on heat transmission was begun in 1800. In 1811, he presented a prize- 
158 Ref. Fourier, Jean Baptiste Joseph winning solution to a problem put forth by the Academy of Science. He solved the equation for heat transmission under various tboundary conditions. Fourier also stated (without rigor- ous proof) that an arbitrary function could be represented by ttrigonometric series ( 159 Fourier Series), a statement that gave rise to subsequent developments in analysis. References [IJ J. B. J. Fourier, Oeuvres I, II, G. Darboux (ed.), Gauthier-Villars, 1888-1890. 159 (X.22) Fourier Series Throughout this article, we assume that f(x), g(x), .,. are real-valued functions and that integrals are always Lebesgue integrals. A. Introduction The set of functions I/jh, cosx/;;', sinx/;;', "', coskx/;;', sinkx/;;', ..., which is called the trigonometric system, is an torthonormal system in ( - n, n) ( 317 Orthogonal Functions). Let f(x) be an ele- ment of L 1 ( -n, n) (i.e., tLebesgue integrable in (-n, n)). We put I I n a k =; _/(t)Cosktdt, 1 I n b k =; _/(t)sinktdt, k=O, I,..., and call ak> b k the Fourier coefficients of f The formal series 1 00 -a o + L (akcoskx+bksinkx) 2 kl is called the Fourier series of f and is often denoted by 6(/). To indicate that a formal series 6(/), as above, is the Fourier series of a function f, we write 1 00 f(x)-ao+ L (akcoskx+bksinkx). 2 kl The sign  means that the numbers ak> b k are connected with f by the formula (1); it does not imply that the series is convergent, still less that it converges to f Generally, trigonometric series are those ofform (2), where the ak> b k are arbitrary real numbers. Since the trigono- 620 metric series have period 2n, we assume that the functions considered are extended for all real x by the condition of periodicity f(x + 2n) = f(x). The study of the properties of the series 6(/) and the representation of f by 6(/) are major objects of the theory of Fourier series. Since eix=cosx+isinx, if we set 2c k =a k -ib k , ck=Z\ (k=O, 1,2, ...), we have 1 I n . Ck =- f(t)e -,k' dt, 2n -n k=O, ::1::1,.... Then 6(/) is represented by the complex form L -00 cke ikx , and {e ikX } (k=O, ::1::1, ...) is an orthogonal system in (-n, n). In this complex form, we take symmetric partial sums such as Lk _nCkeikx (n= I, 2, ...). Consider the power series 1ao + Ll (ak- ibk)zk on the unit circle z = e ix in the complex plane. Its real part is the trigonometric series (2), and the imaginary part (with vanishing constant term) is 00 L (aksinkx-bkcoskx), k=l (3) which is called the conjugate series (or al- lied series) of r and is denoted by 6(/). In complex form, the conjugate series is - iL -oo(sgn k)cke ikx . If f and 9 belong to Ll (- n, n) and 00 f(x) L cke ikx , k= - 00 00 g(x)  L dke ikx , k - 00 then I f 2n 00 - f(x-t)g(t)dt L ckdke ikx . 2n 0 k-oo The function (I) 1 f 2n f * g(x)=- f(x-t)g(t)dt 2n 0 is called the convolution of f and g. If f is tabsolutely continuous, then the de- rivative f'(x) satisfies (2) X> f'(x)  i L kCkeikx k= -OJ 00 = L k(bkcoskx-aksinkx). kl If F(x) is an indefinite integral of f, then  C k . k F(x)-cox  C + L.: -;-e' x k= -oc, lk  aksinkx-bkcoskx =c+ L- kl k ' where C is a constant of integration and the sym bol ' indicates that the term k = 0 is omitted from the sum. If fE Ll (-n, n), then the Fourier coefficients 
621 C n converge to 0 as n--> 00 (Riemann-Lebesgue theorem). If f satisfies the tLipschitz condition of order ex (0 < ex.s; 1), then c n = O(n -a), and if f is oftbounded variation, then cn=O(n- 1 ). When fEL 2 ( -n, n), 1 I 2n 00 ; 0 If(xWdx=2 )oo 1ck1 2 1 2  2 2 ==-a o + L.. (a k + bd, 2 kl which is called the Parseval identity. If L 1ck1 2 < 00, then there exists a function f E L 2 ( - n, n) which has the C k as its Fourier coefficients. This converse is implied by the tRiesz-Fischer theorem ( Appendix A, Table 11.1). B. Convergence Tests The nth partial sums sn(x) = sn(x;f) of the Fourier series 6(/) can be written in the form 1 I n Sn(X)=; _/(x+t)Dn(t)dt, where Dn(t) = {sin(n+ 1/2)t}/2sin(t/2). The function Dn(t) is called the Dirichlet kernel. For a fixed point x we set <Px{t) = f(x+ t) + f(x-t)-2f(x); then 1 I n Sn(X) - f(x) =- <Px{t)Dn(t) dt, n 0 Hence if the integral on the right-hand side tends to zero as n......oo, limnoosn(x)==f(x). Iff vanishes in an interval I = (a, b), then 6(/) converges uniformly in any interval I' = (a + [;, b - 8) interior to I, and the sum of 6(/) is 0, This is caned the principle of localization. Here we give four convergence tests. (1) Iff is of bounded variation, 6(/) converges at every point x to the value {I(x+O)+ f(x- O)} /2. In addition, if f is continuous at every point of a closed interval I, 6(/) is uniformly convergent in I (Jordan's test). As a special case of this test, bounded functions having a finite number of maxima and minima and no more than a finite number of points of dis- continuity have convergent Fourier series (Dirichlet's test). (2) If the integral J I <px(t)l/t dt is finite, then 6(/) converges at x to f(x) (Dini's test). (3) If J: l<Px(t)ldt=o(h) and \im r n I<pAt)-<Px(t+ '1)1 dt=O, nO I" t 159 C Fourier Series then 6(/) converges at x to f(x) (Lebesgue's test). Jordan's and Dini's tests are mutually independent, and both are included in Le- besgue's test, which, although not as conve- nient in certain cases, is quite powerful. (4) If f(x) is continuous in (a, b) and its modulus of continuity satisfies the condition w(8) , Jog(1/8) -->0 as 8-->0 in this interval, then 6(/) con- verges uniformly in (a + [;, b - [;) (Dini-Lipschitz test). C. Summability Let sn(x) be the nth partial sum of the Fourier series 6(/), and an (x) = an(x;f) be the first arithmetic mean «C, 1 )-mean) of sn(x) (i.e., an(x) =(sO(X)+SI(X)+... +sn(x))/(n+ 1)). Then we have 1 I n an (x) - f(x) ==- <Px(t) Kn(t) dt, n 0 where K t = ( Sin(n+ l)t/2) ) 2. n() 2(n+1) sin (t/2) The expression Kn(t) is called the Fejer kernel, and the a.(x) are often called Fejer means. If the right and left limits f( x:t 0) exist, 6 (/) is t(C, 1 )-summable at the point x to the value (/(x + 0) + f(x - 0))/2. Iff is contin uous at every point of a closed interval I, 6(/) is uniformly (C, 1 )-summable in I (Fejer theorem, 1904). As we explain in Section H, there exist continuous functions whose Fourier series are divergent at some points. Thus the sum- mability of 6(/) is more important than its convergence. Fejer's theorem remains true if we replace (C, l)-summability by (C, ex)- summability (ex>O). More generally, if fE Ll (- n, n), then 6(/) is (C, ex)-summable for ex >0 to the value f(x) almost everywhere (H. Lebesgue). Since (C,ex)-summabiIity (ex >0) implies tsummability by Abel's method, the result of Fejer's theorem is valid for tA- summability. However, the direct study of A- summability is also important. Let f(r, x) be Abel's mean of 6(/); that is, 1 00 f(r,x)==-ao+ L (akcoskx+bksinkx)r k 2 k=l 1 I n =; _/(x +t)P(r, t) dt, where P(r,t)=(I-r 2 )/2(1-2rcost+r 2 ), O.s; r < 1. We call P(r, t) the Poisson kernel. The function f(r, x) is 1 harmonic inside the unit circle and tends to f(x) as r--> 1 almost every- where. Hence f(r, x) gives the solution of the tDirichlet problem for the case of the unit circle. 
159 D Fourier Series D. The Gibbs Phenomenon Let f(x) be of bounded variation and not continuous at 0 :j(0) = 0, f( +0) = 1 > 0, f( -0) = -I. Then the partial sum sn(x) converges to f(x) in the neighborhood of 0, but not uni- formly. Moreover, 1im Sn (  ) =IG, n-oo n 2 f "Sint G=- -dt = 1.1789.... not Hence as x tends to 0 from above and n tends to 00, the values y = sn(x) accumulate in the interval [0, IGJ, while sn( - x) == - sn(x) in the neighborhood of O. This phenomenon is called the Gibbs phenomenon. If f is of bounded variation, then 6(/) exhibits the Gibbs phe- nomenon at every point of simple discontinuity of f However, the (C, I)-means of 6(/) do not exhibit this phenomenon. E. Conjugate Functions For any integrable f E Lj ( - n, n), the integral J(x) = lIIJ- f ((/(x+t)- f(x- t)) /2 tan)dt exists almost everywhere, The function J(x) is called the conjugate function of f(x). The conjugate series (/) is (C, ex)-summable (ex > 0) to the value J(x) at almost every point, and a fortiori summable by Abel's method. Even if fELl ( - n, n), J does not always be- long to the class Lj ( - n, n). For example, L2 cosnx/logn is the Fourier series of a functionfELj(-n,n), but its conjugate series L2 sin nx/log n is not the Fourier series of a function in Lj (-n, n). However, if both f and J are integrable, 6(/) = 6(]). If f E Lp (p > 1), then JELp and IIJllp.s; Ap Ilfll p ; also, 6(/) = 6(J). If Ifllog+ If I is integrable (such a function is said to belong to the Zygmund class), then J is integrable and 6(/) = 6(J). Moreover, in this case there exist constants A and B such that r 2" e" Jo IJI dx.s;A Jo Ifllog+lfldx+B. If f is merely integrable, so is IJI P for any 0 < p< 1, and IIJllp Bpllfll (O<p < 1). If fELipex (0 < ex < 1), then J E Lip ex, but the theorem fails for ex == 0 and ex = 1. The conjugate function is important for convergence of partial sums of Fourier series, 622 F. Mean Convergence The theorems on conjugate functions enable us to obtain some results for the tmean con- vergence of the partial sums Sn of 6(/). If fE Lp (p> 1), then Ilf -snllp-->O; iff ELI' then Ilf -snllp-->O, IIJ - 5 n ll p -->0 for every 0 < p < 1. Also, if Ifllog+ If I E L j , then Ilf - snll-->O, IIJ - 5 n ll-->0, As a corollary of this result, we obtain the following theorem, which is a gen- eralization of the Parse val identity: If the Fourier coefficients of functions f E Lp and gE Lq (1/p+ l/q= 1) are an' b n and a, b, respec- tively, we have the Parse val formula 1 f 1. 1 00 - f9dx=- 2 aoa+ L (ana+bnb), n 0 n=1 where this series is convergent. G. Analytic Functions of the Class Hp Let p >0. A complex function <p(z) holomor- phic for Izi < 1 is said to belong to the class H p (Hardy class) if there exists a constant M such that I f 2" . 1im 1<p(re '8 )IPdO.s;M. r1 2n 0 When <p(z)EH p , the nontangentia11imit <p(e i8 ) = 1imzeiO <p(z) exists for almost all O. We write this as <p(e i8 ) = f(O) + If(O), where f(O), J(O) belong to the class Lp. Also, ](0) coincides with the conjugate function of f(O) for p  1. For 1 <p< 00, Hp is isomorphic to Lp, but for p= 1 and p= 00, Hp and Lp are different classes. Using the theory of functions of H p' we can discuss some properties of Fourier series. If <p(e i8 ) = f(O) + iJ(O) is of bounded variation, then <p(e i8 ) is absolutely continuous and its Fourier series converges absolutely. We set ( rl ) 1/2 g(O)= Jo (l-r)I<p'(re i8 Wdr , g*(O)= (t (1-r)dr I1. l<p'(rei(o-nw P(r, t)dt) 1/2, where P(r, t) is the Poisson kernel. Then g(O).s; 2g*(0), and there exist constants Ap, B, C, and A such that J:" Ig(O)IPdO.s;A p J:" I <p(e i8)IP dO, p>O, J:" Ig*(O)IP dO.s; Ap J:" I <p(e i8 )IP dO, p> 1, J:" Ig*(O)1 dO.s;B J:" 1<pllog+I<p1 dO+ C, 
623 (h ( en ) # Jo Ig*(O}l#dO.s;A# Jo 1<p(e i8 )ldO , 0<J1<1. Denote by sn(O) and CJn(O), respectively, the partial sums and arithmetic means of the Fourier series of <p(e i8 ), and set ( 00 Is (O)-CJ (0)1 2 ) 1/2 y(O)= L n n n=l n Then O#A I .s;g*(O)/y(O).s; A 2 # 00. From these relations, we can prove that if the indices nk satisfy the conditions fJ> n k +1 Ink > ex > 1,50,(0) converges almost everywhere to <p(e i8 ) for <p(e i8 )EH p (l.s;p). If we set dO)= L'"n'-l +1 c,e iv8 , 15(0) = (Lk'=o lk(OW)I/2, where <p(ei8)Locveiv8, then Ilb(O)llp.s;Apll<pllp (p> 1). If <p(z)EH p (O<p< 1), then Lc n e in8 is (C, p -I - 1 )-summable to <p(e i8 ) almost every- where. These functions and relations were introduced mainly by J. E. Littlewood and R. E. A. C. Paley and were later generalized by A. Zygmund. There are more precise results by E. Stein [7], G. Sunouchi [8], S. Yano [9], and others. H. Almost Everywhere Convergence and Divergence P. du Bois Reymond (1876) first showed that there exits a continuous function whose Fourier series diverges at a point, but the problem of whether Fourier series of continu- ous functions converge almost everywhere (the so-called du Bois Reymond problem) re- mained unsolved for many years. At last in 1966, L. CarIeson [10] proved that the Fourier series of a function belonging to L 2 converges almost everywhere; hence the du Bois Rey- mond problem was solved affirmatively. Using CarIeson's method, R. A. Hunt [12] proved that f 2n ( ) p f 2n o splsn(x)1 dx.s;A p 0 If(x)IPdx, l<p<oo, which implies that the Fourier series of fE Lp (1 < p < 00) converges almost everywhere. Hunt also proved that J:n (sPI(X)I)dX A fnlf(X)I(IOg+ If(x)I)2dx+A. Moreover, P. Sjolin proved that if f 2n o If\ 'log+ Ifl'log+ log+ Ifldx < 00, 159 J Fourier Series then the Fourier series of f(x) converges almost everywhere. On the other hand, A. N. Kolmogorov gave an integrable function with Fourier series diverging everywhere (more precisely, Jim SUPnoo IS2.(x)1 = 00 almost everywhere). Using this example, J. Marcinkiewicz showed that there is an fELl such that sn(x;f) oscil- lates boundedly almost everywhere. Moreover, there exists an integrable function with inte- grable conjugate and almost everywhere diver- ging Fourier series [3]. For any given tnull set E, we can construct a continuous function whose Fourier series diverges on every XE E. 1. Absolute Convergence The convergence of the series (1) L(lanl + IbnD implies the absolute convergence of the trig- onmometric series (2) a o /2+ L(ancosnx+ bnsinnx). Conversely, if the series (2) con- verges absolutely in a set of positive mea- sure, the series (1) converges (Denjoy-Luzin theorem). For the absolute convergence of Fourier series, we have the following tests: If fELipex (ex> 1/2), then 6(f) converges ab- solutely, but for ex = 1/2, this is no longer true. If f(x) is of bounded variation and belongs to Lip ex (ex> 0), 6(f) convcrges absolutely. Suppose that the Fourier series of a function f(x) is absolutely convergent and the value of f(x) belongs to an interval (a, b). If <p(z) is a function of a complex variable holomorphic at every point of the interval (a, b), the F ou- rier series of <p {I(x)} converges absolutely (Wiener-Levy theorem). As a corollary we obtain that if 6(f) converges absolutely and f(x) #0, then 6(1/f) converges absolutely. The converse of the Wiener-Levy theorem was proved by Y. Katznelson [6]. For a given <p(x) defined in [ -1, 1 J, if the Fourier series of <p{l(x)} converges absolutely for every f(x) with absolutely convergent Fourier series (If(x)l.s; 1), then <p(z) is holomorphic at every point of the interval [ -1,1]. Many problems concerning this topic still remain unsolved. In particular, the determi- nation of the structure of the functions with absolutely convergent Fourier series has not been completed. J. Sets of Uniqueness If an cos nx + b n sin nx converges to 0 on a set of positive measure, then an, bn-->O (Cantor- Lebesgue theorem). A point set E c (0, 2n) is called a set of uniqueness (or U-set) if every trigonometric series converging to 0 outside E vanishes identically. A set that is not a U-set is 
159 K Fourier Series called a set of multiplicity (or M-set). G. Can- tor showed that every finite set is a U-set, and W. H. Young showed that every denumerable set is a U-set. It is clear that any set E of posi- tive measure is an M-set, but D. E. Men'shov showed that there are tperfect M -sets of mea- sure O. Moreover, N. K. Bari showed that there exist perfect sets of type U. However, the structure problem of sets of uniqueness has not yet been solved completely. A set E is said to be of type H (or an H-set) if there exists a sequence of positive integers n l < n 2 <... and an interval I such that for each XE E, no point of {nkx}l is in I (mod 2n). H- sets are sets of uniqueness, a fact given by A. Rajchman. I. I. Pyatetskii-Shapiro generalized H-sets to Wm)-sets [3]. Lacunary trigonometric series are series in which very few terms differ from zero. Such series can be written in the form 00 00 L (akcosnkx+bksinnkx)= L An,(x). k=1 k1 S. Sidon established some of the characteristic properties of such series; he generalized them further and obtained the notion of Sidon sets ( 192 Harmonic Analysis). We often define a lacunary series more specifically as a series for which the n k satisfy Hadamard's gaps; that is, nk+1/nk>q> 1. Then ifLda+b) is finite, the series LI An,(x) converges almost every- where. Conversely, if LI An,(x) is convergent in a set of positive measure, then LI (a + b) converges. This theorem is related to the Rademacher series and random Fourier series [4]. K. Multiple Fourier Series Routine extensions to multiple Fourier series from the case of a single variable are easy, but significant results are difficult to obtain. Re- cently, however, there have been several im- portant contributions in this field. References [IJ A. Zygmund, Trigonometrical series, War- saw, 1935. [2] A. Zygmund, Trigonometric series I, II, Cambridge Univ. Press, second edition, 1959. [3] G. H. Hardy and W. W. Rogosinski, Fourier series, Cambridge Univ. Press, 1950. [4J N. K. Bari, A treatise on trigonometric series, Pergamon, 1964. (Original in Russian, 1958.) [5] J.-P. Kahane and R. Salem, Ensembles parfaits et series trigonom6triques, Actualites Sci. Ind., Hermann, 1963. [6] Y. Katznelson, Sur les fonctions operant 624 sur I'algebre des series de Fourier absolument convergentes, C. R. Acad. Sci. Paris, 247 (1958),404-406. [7] E. M. Stein, A maximal function with applications to Fourier series, Ann. Math., (2) 68 (1958),584-603. [8J G. Sunouchi, Theorems on power series of the class HP, T6hoku Math. J., (2) 8 (1956), 125-146. [9] S. Yano, On a lemma of Marcinkiewicz and its applications to Fourier series, T6hoku Math. J., (2) 11 (1959), 191-215. [10] L. Carleson, On convergence and growth of partial sums of Fourier series, Acta Math., 116 (1966),135-157. [11] J.-P. Kahane and Y. Katznelson, Sur les ensembles de divergence des series trigonome- triques, Studia Math., 26 (1966),305-306. [12] R. A. Hunt, On the convergence of Fourier series, Orthogonal expansions and their continuous analogues, Proc. Con£. South- ern Illinois Univ., 1967, Southern Illinois Univ. Press, 1968,235-255. [13] P. Sjolin, An inequality of Paley and convergence a.e. of Walsh-Fourier series, Ark. Mat., 7 (1968), 551-570. [14] R. Salem, Algebraic numbers and Fourier analysis, Heath, 1963. [15] J.-P. Kahane, Some random series of functions, Heath, 1968. 160 (X.23) Fourier Transform A. Fourier Integrals In this article we assume that f(x) is a complex-valued function defined on R = (-00,00) and t(Lebesgue) integrable on any finite interval. If the integral f oo f(x)e-ix'dx= lim f B f(x)e-ixtdx A- - 00 -00 A Boo exists, it is called the trigonometric integral or Fourier integral. We have a general result: Hf(x)ELI( -00,00), K(x) is bounded on (-00,00), and g K(x)dx=o(T) (T --> :too), then J':'oo f(x)K(xt) dx exists and Iim f ro f(x)K(xt)dx=O. t-+co - -00 In particular, it follows that if f(X)E Ld - 00, 00), then J':'oof(x)ei'Xdx exists and ,Yoo f oo f(x)e -itx dx = 0 (Riemann-Lebesgue theorem). 
625 B. Fourier's Integral Theorems Suppose that f(x) is of tbounded variation in an interval including x, or more generally satisfies the assumption for anyone of the convergence tests for Fourier series ( 159 Fourier Series B). Then Fourier's single inte- gral theorem 1 2(f(x+0)+ f(x-O)) 1 f ro sinA(t-x) = Iim - f(t) dt Aoon -00 t-x holds if one of the following three condi- tions is satisfied: (1) f(x)/(1 + Ixl) belongs to LI (-00,00); (2) f(x)/x tends to zero mono- tonically as x--> 1:00; (3) f(x)/x = g(x)sin(px + q), where g(x) tends to zero monotonically as x--> 1:00 (S. Izumi, 1934), The right-hand side of (1) is called Dirichlet's integral. Let f(x) be of bounded variation in an inter- val including x (or satisfy some other conver- gence test for Fourier series). Then Fourier's double integral theorem 1 2(f(x+0)+ f(x-O)) 1 f T f ro =-Iim dt f(u)cost(u-x)du 1! T-+oo 0 - 00 holds if one of the following three conditions is satisfied: (4) f(X)E LI (-00,00); (5) f(x)/(1 + Ixl)E LI (-00,00), and f(x) tends to zero monotonically as x--> 1:00; (6) f(x)/(1 + Ixl)E Ld -00,00) and f(x)=g(x)sin(px+ q), where g(x) tends to zero monotonically as X--> 1:00. If f(X)E Ld -00,00), the formula 1 f ro sin2 A(t-x) f(x) = Iim - f(t) 2 dt Aoo nA -00 A(t-x) holds almost everywhere, and in particular at any x where f(x) is continuous, More gener- ally, the formula f(x) = j A toooo f(t)K(A(t-x))dt = lim f ro f ( x- ) K(t)dt (3) A-a) -00 A holds at any point x where f(x + 0) and f(x- 0) exist, if K(t)E Ld -00,00), J<XJ oo K(t)dt= 1, IK(t)1 <M, K(t)=o(t- I ) as Itl-->oo, and f(X)ELd -00,00), or if K(t)EL 1 ( -00,00), J<XJ oo K(t)dt= 1, and f(x) is bounded. Similarly, the formula lim f ro f (  ) K(X)dX==\JJl{l} f ro K(x)dx AO 0 A 0 holds if \JJl{l} =lim,oo T -I SU(t)dt exists, K(x) is differentiable, Ix 2 K(x)I.s;C (l.s;x), and 160 C Fourier Transform X-I Jolf(t)ldt.s;D, where C, D are constants (Wiener's formula). C. Fourier Transforms (Appendix A, Table 11.11) (1) Letf(x)ELI(O, 00). Then F(t)=  too f(u)cosutdu is called the (Fourier) cosine transform of f(x). Under the same condition as for the validity of (2), the inversion formula of the cosine transform f(x)=Jo F(t)cosxtdt holds where we suppose thatf(x)=!(f(x+O)+ f(x-O)). If we define f( - x) = f(x), then this is equivalent to the formula (2). Analogously, G(t)= l too f(u)sinutdu is called the (Fourier) sine transform of f(x). Under the same condition as for the validity of (2), we get the inversion formula f(x) = Jo G(t)sinxtdt. More generally, for any f(X)ELI( -00,00), 1 f ro . F(t)= M::: f(x)e-'X'dx v 2n - 00 (2) is called the Fourier transform of f(x). Under the same condition as for the validity of (2), the Fourier inversion formula 1 f T . f(x) = M::: lim F (t)e'X' dt v 2n Too -T holds. The cosine transform and the sine transform coincide with the Fourier trans- form when f(x) = f( - x) and - f(x) = f( - x), respectively. If for any f(x), F(t)ELq( -00,00) (l.s;q< 00) exists, for which f ro I 1 f T. I q M::: f(x)e -IX( dt - F(t) dt-->O -00 v 2n -T (T-->oo) is valid (i.e., (l/)J1'd(t)e-iX'dt converges to F(x) as T --> 00 in the mean of order q), then we say that f(x) has the Fourier transform F(t) in Lq( -00,00). Iff(x)EL p ( -00,00) (1 <p.s;2), then f(x) has the Fourier transform F(t) in Lq (l/p+ l/q= 1), and F(t) has the Fourier transform f( - x) in Lp (E. C. Titchmarsh). Moreover, toooo IF(t)lqdx 1 (f rO ) I/(P-I) .s; (2n)(q/2)-1 _ 00 1.f(x)IP dx . 
160 D Fourier Transform Iff(x), G(x)EL p ( -00,00) (1 <p.s;2) and their Fourier transforms in Lq are F(t), g(t), respec- tively, then the Parseval identity foo F(t)G(t)dt= foo f(x)g( -x)dx holds. The Fourier inversion formula holds, in the sense that f x 1 f ro e-iX'_1 F(u)du = M:: f(t)------:-dt, o y' 2n - 00 - It f x 1 f ro eiXU_l f(t)dt= M:: F(u)-----;--du. o y'2n -00 rn The theory of Fourier transforms is valid for cosine and sine transforms. SpecificaIly for p = 2, if fE L 2 (0, 00), then J2F. ff;f(x)' cosxtdt tconverges in the mean in L 2 (0, 00) to the cosine transform F(t) as T --> 00, and con- versely, the cosine transform of F(t) in L 2 is f(x). The transforms f(x), F(x) are connected by the formulas f x Af rO sinxt F(u)du= - f(t)-dt, o not f x Af rO sin xu f(t)dt= - F(u)-du, o n 0 u too If(xW dx = too I F(tW dt. The theory of Fourier transforms was gen- eralized as follows by G. N. Watson (Proc. London Math. Soc., 35 (1933)). We suppose that X(x)/xEL 2 (0, 00) and f ro x(xu)x(yu) . 2 du=mm(x, y). o u If f(X)EL 2 (0, 00), then there exists an F(t)E L 2 (0,00) such that f x f ro x(xu)f(u) F(t)dt= du, o 0 u and the inversion formula f x f ro x(xt)F(t) f(u)du= dt o 0 t and the Parseval identity too If(xWdx= too IF(tWdt hold. F(t) is called the Watson transform of f(x). For any fEL 2 , equality (4) is necessary for the existence of the Watson transform F(t) for which the inversion formula holds. S. Bochner (1934) generalized this theory further to tunitary transformations in L 2 ( 192 Harmonic Analysis). 626 D. Conjugate Functions Corresponding to Fourier's double integral theorem (2), the integral 1 f A f ro Iim - dt sint(u-x)f(u)du l-too 1! 0 -00 is called the conjugate Fourier integral of the integral in the right-hand side of (2). Formally, this is written Iimxoo t- l f(7(I-cOSAt)t-1(f(x + t) - f(x - t))dt. If f(x) is a sufficiently regular function, the part involving COSAt tends to 0 as A--> 00. Now let I , f A f(x+t)- f(x-t) g(x)=- 11m dt. 11; : E t For any fELl (0, 00), the integral exists almost everywhere, and g(x) is called the conjugate function or Hilbert transform of f(x). If f E Lp (p> 1), then g(x)EL p also and we have f( ) - 1 I . f Ag(x+t)-g(X-t) d x - -- 1m t 11; : E t and S'XJ oo Ig(x)IPdx.s; Mpf",oo If(x)IPdx, where Mp is a constant depending only on p. In particular, foo If(xWdx= toooo Ig(xWdx for p=2. E. Boundary Functions of Analytic Functions (4) Suppose that a complex-valued function f(z) (z=x+ iy) is tholomorphic for y>O,f(x+ iy) converges as y-->O for almost all x to f(x) (which is called the boundary function), and f(X)EL p ( -00, 00) (p 1). Moreover, suppose that f(z) is represented by tCauchy's integral formula or tPoisson's integral of f(x) on the real line. If either f(x) E Lp (p  1) and has F(t) as its Fourier transform or f(x) is an Lp- Fourier transform of F(t)ELq (q 1), then a necessary and sufficient condition for the function f(x) to be the boundary function of an analytic function is that F(t) be 0 almost everywhere for t > 0 (N. Wiener, R. E. A. C. Paley, E. HiIle, J. D. Tamarkin). F. Generalized Fourier Integrals Let If(x)I/(l + Ixlk)E Ld -00,00) for a positive integer k and { H (-itx)' L-,lxl.s;l, Lk=Lk(t,x)= FO v! 0, Ixl> I, 1 f en e-itx-Lk Ek(t)=- {(x) . k dx. 2n -ex) (-IX) 
627 The function Ek(t) or one that differs from Ek(t) by a polynomial of degree at most k is called the kth transform of {(x). We write formally f(x) = J'X)if e ix ' d k Ek(t). ActualIy, if we give an appropriate meaning to the integral, this formula itself is valid (H. Hahn, Wiener Berichte, 134 (1925); S. Izumi, Tohoku Science Rep., 23 (1935)). (For the theory and applica- tions of kth transforms  [2, ch. 6].) G. Applications of Fourier Transforms Suppose that f(X)ELd -00,00), F(t) is its Fourier transform, and f(x) == o(e -O(X»), where 8(x) is positive and increasing. If J O(x)x- 2 dx = 00 and F(t) vanishes identically in an interval, then F(t) =0 over (-00,00). If J 8(x)x- 2 dx < 00, then there exists a function fIx) such that F(t) vanishes identicalIy in an interval, but F(t) does not vanish identicalIy in (-00,00) (Wiener, Paley, N. Levinson). These results are applicable to the theory of tquasi- analytic functions. Let f(X)E LI (-00,00). Then a necessary and sufficient condition for any function in Ld -00, ex;) to be approximated as closely as we wish by linear combinations of the trans- lations Lf1 ad(x + hd of f(x) with respect to the Lj-norm is that the Fourier transform of f(x) does not vanish at any real number. When {(X)E L 2 ( -00,00), a necessary and suffi- cient condition that an arbitrary function in L 2( -00,00) can be approximated as closely as we wish by Lfj ak{(x+ hd with respect to the L 2 -norm is that the zeros of the Fou- rier transform of f(x) have measure zero (Wiener). This result was used by Wiener to prove the generalized Tauberian theorem: Suppose that gl (X)E LI (-00,00) and its Fou- rier transform never vanishes. Moreover, let g2(x)ELd -00,00) and p(x) be bounded over (-00, OJ). Then limxoo J'XJex. gdx - t)p(t)dt = AJ'X)oogl(t)dt implies that limxifJ":'oog2(x- t)p(t)dt = A J'X)ex. g2(t)dt. Another type of Wiener theorem is concerned with tStieltjes integrals. Suppose that 00 L sup Igj(x)l<oo n= - cfJ nx<n+l (hence gl (X)E Ld and that the Fourier trans- form of gdx) never vanishes. Moreover, let ex. L sup Ig2(x)1 < 00 n=-oc nx<n+l and let J+jldex(t)1 be bounded, Then ! f:oo gl(x-t)dex(t)=A f:ex. gj(t)dt 160H Fourier Transform implies Iim f ro g2(X- t)dex(t)= A f ro g2(t)dt. xoo -00 - From these general theorems, we can prove various tTauberian theorems about the sum- mation of series. Also, these results were applied to the proof of the tprime number theorem by S, Ikehara and E. Landau (Wiener [IJ). In the general Tauberian theorem, the boundedness of p(t) can be replaced by one-sided bounded- ness (H. R. Pitt, 1938). In fact, the first form of the theorem stilI holds if we replace the con- dition by the folIowing one: gl and g2 are continuous, gl (x) O, the Fourier transform of 9 j (x) does not vanish, g2 (x) satisfies the con- ditions of the second theorem, and p(x)  C. (Concerning Fourier transforms on the topo- logical groups  192 Harmonic Analysis; and concerning Fourier transforms of the distributions  125 Distributions and Hyperfunctions). H. Fourier Transforms of Distributions The Fourier transform is defined in higher- dimension R n by ff(f)=(rn f e-ixY(x)dx, JELl' X( =x I (I +X 2 (2 +... +xn(n' One denotes it by /(). The inverse transform is defined by <( g)=(rn f eiXg(()d(. Differentiation under the integral sign gives D'/()=(rn f e-ix( - ix)"f(x)dx, D'=(% x d'I...(O/ox n )"", under the assumption (1 + Ixl)I'i{(x)ELI (Iexl = ex l + ... + exn)' Roughly speaking, the decreas- ing order of fIx) when Ixl--> 00 is reflected in the differentiability of j@. In the same way, (i)"/(()=(rn f e-iX{D'{(x)dx, which shows that the differentiability of f(x) is reflected in the decreasing order of /( O. The samc statements can evidently be made for the relation between g«() and its inverse Fourier transform. Let .cI' be the space of rapidly decreasing functions <p(x), i.e., such that for alI positive integers k and exO, (l +lxl)kD'<p(x) remains bounded ( 125 Distributions and Hyperfunc- tions). From the facts above, it folIows that the 
160 Ref. Fourier Transform linear mappingf(x)f->ff(f) is a topological isomorphism from  onto ,'/;;. Let Y' be the dual space of Y. Usual function spaces in classical analysis are contained in Y', for instance, the Lp-space and that of functions increasing at most of polynomial order at infinity. Let TEY;. Then the mapping <pE,,/;;f-> T(:? <p) is continuous, and ff TE Y is defined by ffT(<p)= T(ff<p). ff is also a topological isomorphism from Y; onto g. Take as an example the distribution p.v.l/x. This is no longer a function; however, its Fourier trans- form can be calculatt:d as follows: f . dx 2i lim (Jh)-l e-'x_=_ £-0 x Aoo p,,;lxl";A j2; . f A sin(x) { Ai, (>0, xhm -dx= A A-+oo 0 X 1! - 2i, <o, where the limit is taken with respect to the topology of Y'. Using the definition above, classical results can be extended to /1" almost automatically. For example, :#;(D'T)=(i()"ff(T), ff(( -ix)"T) =D':?(T). Moreover, if TErff', then ffT= Jh -n T(e-ix). In particular,, b=(2nrn/2. The Plancherel theorem says that if for f(X)E L 2 one defines its Fourier transform by j«()=I.i.m.j2;-n r e-iXf(x)dx, A--+ex:, JlxlA then the correspondence f(x)f-> j@ is a uni- tary mapping from L 2 onto L2' i.e., flf(XWdX= f,j(Wd. This result can be extended. For any non- negative integer m, the element of Hm ( 168 Function Spaces) is characterized by its Fourier transform: f(X)EH m if and only if (I +1ltj«()EL2' Furthermore, for arbitrary real S, the space H S can be defined as the set of alI elements of [I" whose Fourier transform j«() satisfies (1 +lw s j(()EL 2 . H' and H- S are dual to each other. For fELl' gEL 2 , or f, gEL 2 , the convo- lution makes sense as a function, and it holds that ff(f * g)=ff(f)ff(g). This relation can be extended to distributions, to state ff(S* T)= (ffS)(ff T). This holds for (S, T)E rff' x [I", £02 X £02 (£02 is the dual of £0 L ,j, etc. Fourier transforms are also often defined by j()= f e-iXf(x)dx or = f e- 2nix {f(x)dx. 628 In the former case, the inversion formula takes the form f(x) = (2nrn f eixq@d, and the Parse val identity becomes flf(XWdx=(2nrn flj(Wd(. References [IJ N. Wiener, The Fourier integral and cer- tain of its applications, Cambridge Univ. Press, 1933, [2] S. Bochner, Vorlesungen liber Fouriersche Integrale, Akademische- Verlag, 1932; English translation, Lectures on Fourier integrals, Ann. Math. Studies, Princeton Univ. Press, 1959. [3] E. C. Titchmarsh, Introduction to the theory of Fourier integrals, Clarendon Press, 1937. [4J R. E. A, C. Paley and N. Wiener, Fourier transforms in the complex domain, Amer. Math. Soc. ColIoq. Pub!., 1934. [5] S. Bocher and K. Chandrasekharan, Fourier transforms, Ann. Math. Studies, Princeton Univ. Press, 1949. [6J M. J. LighthilI, Introduction to Fourier analysis and generalized functions, Cambridge Univ. Press, 1958. [7J I. M. Gel'fand and G. E. Shilov (Silo v), Generalized functions I, Academic Press, 1964. (Original in Russian, 1958.) [8J S, Bochner, Harmonic analysis and the theory of probability, Univ. of California Press, 1955. [9J R. R. Goldberg, Fourier transforms, Cam- bridge Tracts, Cambridge Univ. Press, 1961. [lOJ T. Carleman, L'integrale de Fourier et questions qui s'y rattachent, Almqvist and Wiksell, 1944. [IIJ L. Schwartz, Theorie des distributions, Hermann, 1966. For formulas for Fourier transforms  refer- ences to 220 Integral Transforms. 161 (IV.3) Free Groups A. General Remarks A group F is called a free group if it is the tfree product ( 190 Groups M) oPinfinite cyclic groups G l , "', G n generated by ai, ..., an, re- spectively. Then n is called the rank of F. A 
629 free product of tsemigroups is defined similarly to that of groups, and the free product of infinite cyclic semigroups G i = {I, a" af, ... } (i = I, ..., n) is called a free semigroup gener- ated by n elements a i (i = 1, "', n). If a group G is generated by subgroups Hi (i = I, ... , n) isomorphic to G i , then G is a homomorphic image of the free product of the groups G i . A subgroup # {e} of the free product F of groups G i is itself the free prod- uct of a free group and several subgroups, each of which is conjugate in F to a subgroup of some G j (A. G. Kurosh, 1934). Notably, a subgroup # {e} of a free group is itself a free group (0. Schreier, Abh. Math. Sem. Univ. Hamburg, 5 (1927)). A subgroup of index) of a free group of rank n is a free group of rank 1 + j(n - I) (Schreier). Let F be the free group generated by n ele- ments ai, "', an, and let G be a group gen- erated by n elements bJ, "', b.. Then there is a homomorphism of F onto G. Let N be its kernel. If the class of a tword w(a t, ... ,an) belongs to N, then we have w(b l , ... , b n ) = 1. We caIl w(b l , ..., b n ) = 1 a relation among the generators b l , ..., b n . If N is the minimal normal subgroup of F containing the classes of words WI (at, ...,a n ), ..., wm(a l , ...,a n ), then the relations WI (b l , ..., h n )= I, ..., wm(bJ, ..., b n )= I are called defining relations (or funda- mental relations). If generators at, ..., an and words WI (at,..., an),"', wm(a l , ..., an) are given, then there is a group generated by ai' ..., an with defining relations WI (ai' ..., a n )= 1,..., wm(a l ,..., a n )= 1. In fact, let F be the free group generated by ai' ..., an and N the minimal normal subgroup containing the classes of words WI (a I , ..., an), ..., wm(aJ,"', an)' Then the factor group FjN is such a group. A free group is a group with an empty set of defining relations. In the preceding discussion, nand m are not necessarily finite. If both n and m are finite, then G is called finitely presented. B. The Word Problem If a finitely presented group G is given, then a general procedure has to be determined by which it can be decided, in a finite number of computational steps, whether a given word equals the identity element as an element of G. This is called the word problem ( 190 Groups M). A solution to the word problem does not always exist (P. S. Novikov [8], 1955); in fact, there is a group with two generators and 32 defining relations for which the word problem cannot be solved (W. Boone [7J). However, it was shown by V. A. Tartakovskii that the problem can be solved for a large class of 161 Ref. Free Groups groups. W. Magnus (1931) showed that it is solvable for any group with a single defining relation. The word problem is an example of decision problems ( 97 Decision Problem). The word problem for groups is closely related to that for semigroups (A. M. Turing, 1937; E. L. Post, 1947; A. A. Markov, 1947). Similar problems for other algebraic systems can also be considered. The problem of determining a general procedure by which it can be decided, in a finite number of steps, whether two given words interpreted as elements of G can be transformed into each other by an (inner) automorphism of G is called the transforma- tion problem. Let F be a free group of rank nand F = FI =:> ... =:> Fr =:> F r +1 =:> ... be the tlower central series of F. Then Frj Fr+1 is a tfree Abelian group of rank J1n(r)==(Ijr)LdlrJ1(rjd)nd, where J1 is the tM6bius function (E. Witt). The intersec- tion of all subgroups of F of finite index is the identity element. C. The Burnside Problem The original problem of Burnside is: If every element of a group G is of finite order (but not necessarily of bounded order) and G is finitely generated, is G a finite group? E. S. Golod [6] (1964) showed that this problem for p-groups has a negative solution. The following is the more usual form of the Burnside problem: If a group G is finitely generated and the orders of elements of G divide a given integer r, is G finite? Let F be a free group of rank n, N be the normal subgroup of F generated by all the rth powers x r of elements of F, and B(r, n)= FjN. Then the problem is the same as the question of whether B(r,n) is finite. For r= 2,3,4,6 the group is certainly finite (I. N. Sanov, M. Hall). The restricted Burnside prob- lem is the question whether the orders of finite factor groups of B(r, n) are bounded. It was solved affirmatively for r a prime (A. I. Kostrikin [5J, 1959), A group generated by two generators x, y and satisfying the relations x U = yV=(xy)w= 1 (where u, v, ware integers) is infinite if Iju+ 1jv + 1 jw - I :( 0, and is of order 9 if 0 < 1 ju + 1 jv + 1jw-l =2jg. There is also a finitely presented group which is isomorphic to its proper factor group (B. H. Neumann). References [1] A. G. Kurosh, The theory of groups I, II, Chelsea, 1960. (Original in Russian, 1953.) [2] M. Hall, The theory of groups, Macmillan, 1959. 
162 Functional Analysis [3] H. S. M. Coxeter and W. O. 1. Moser, Generators and relations for discrete groups, Erg. Math., Springer, 1957. [4] W. Magnus, A. Karrass, and D. Solitar, Combinatorial group theory, Interscience, 1966. [5J A. I. Kostrikin, The Burnside problem, Amer. Math. Soc. Trans!., (2) 36 (1964),63- 100. (Original in Russian, 1959.) [6J E. S. Golod, On nil-algebras and finitely approximable p-groups, Amer. Math. Soc. Trans!., (2) 48 (1965), 103-106. (Original in Russian, 1964.) [7] W. W. Boone, The word problem, Ann. of Math., (2) 70 (1959), 207-265. [8] P. S. Novikov, On the algorithmic un- solvability of the word problem in group theory, Amer. Math. Soc. Trans!., (2) 9 (1958), 1-122. (Original in Russian, 1955.) Also  references to 190 Groups. 162 (XII.1) Functional Analysis The origin of functional analysis can be traced to the 1887 work of V. Volterra. He stressed the important notion of operations or opera- tors, that is, generalized functions of which the domains as well as the ranges are sets of functions. A typical example is the operator assigning to a function f its derivative t. An operator is called a functional if its values are numbers, as in the case of the operator assign- ing to a function f the value f'(a) or the value If(t)dt. In 1896, Volterra considered the operator mapping a continuous function f to a continuous solution <p of the integral equa- tion f(x) = <p(x) - I: K(x, y)<p(y)dy, where K(x, y) is a continuous function. Defining I<p=<p and (K<p)(x) = I: K(x,y) <p(y)dy, he showed that <p is given by <p =(1 - K)-lf = f + Kf +K2f +'''' where Kn.f=K(Kn-lf). Following this lead, I. Fredholm studied in 1900 the integral equation f(x) = <p(x)- ), Ii K(x, y)<p(y)dy containing a parameter Ie. He proved the so-called talternative theorem: For a given Ao, the operator equation (I - }'oK)<p = f, (K<p)(x) = Ii K(x, y)<p(y)dy, either admits a uniquely determined continuous solution <p for every continuous function f or else (I - AoK)<po =0 admits a nontrivial con- tinuous solution <Po # O. D, Hilbert discussed (1904-1910) a tcontinuous linear operator K defined on the tHilbert space L 2 with values in L2' and he called a complex number ..1.0 a tspectrum of K if (I - }'o K) does not have a continuous linear inverse, He proved that if K is tHermitian, then K admits a tspectral resolution with real spectra only. One of his 630 outstanding contributions was the discovery of the tcontinuous spectrum. In 1918, F. Riesz proved that Fredholm's alternative theorem holds for a tcompletely continuous linear operator in the space of continuous functions, and later the result was extended to Banach spaces. In 1932, three important books by S. Banach [1],1. von Neumann [2J, and M. H. Stone [3J were published. These books treated tclosed linear operators that are not necessarily con- tinuous. The notion of tBanach space was introduced: a tnormed linear space complete with respect to the distance dis(x, y) = Ilx- y II. By making use of the tBaire- Hausdorff theorem and the tHahn-Banach theorem, Banach proved, for closed linear operators in Banach spaces, the fundamental principle consisting of the topen mapping theorem, the tclosed graph theorem, the tuniform bounded- ness theorem, the tresonance theorem, and the tclosed range theorem. These theorems were modified to be applicable in locally convex topological linear spaces by the Bourbaki group beginning in the late 1940s. In 1929, von Neumann proved, as a mathe- matical foundation of quantum mechanics, that a closed linear operator T in a Hilbert space admits spectral resolution with real spectra if and only if T is a tself-adjoint opera- tor ( also Stone [3]). The condition that a closed linear operator is a tfunction of a self- adjoint operator was given by von Neumann, F. Riesz, and Y. Mimura (1934-1936). K. Frie- drichs (1934) proved that a tsemibounded linear operator admits a self-adjoint extension, T. Kato [4] (1950) proved that a Schrodinger- type Hermitian operator is tessentially self- adjoint. Yon Neumann's tmean ergodic theorem in the Hilbert space (1932) was extended to Banach spaces by K. Y osida, S. Kakutani, and Riesz in 1936. G. D. Birkhoff's tpointwise ergodic theorem (1931) was extended by N. Wiener (1939), Y osida (I940), E. Hopf (1954), N. Dunford (1955), R. V. Chacon and D. S. Ornstein (1960), and others in various ways. The t Abelian ergodic theorems were discussed by E. Hille and R. S. Phillips [5J and Yosida [6]. The notion of tBanach algebra was intro- duced by M. Nagumo in 1936. I. M. Gel'fand proved that a commutative Banach algebra with multiplicative unit e satisfying Ilell = 1 (= the normed ring) admits a representation by an algebra of complex-valued continuous functions (1941). The treflexivity as well as the tduality of Banach spaces were studied by S, Kakutani (1939), V. L. Shmul'yan (1940), and W. T. Eberlein (1941). 
631 The notion of tvector lattice (= tRiesz space) was introduced in analysis by Riesz (1930). This was followed by the work of L. V. Kantrovitch (1935) and H. Freudenthal (1936). Kakutani gave two standard types oftBanach lattice: the tabstract (M) space (1940) and the tab- stract (L) space (1941). M. Krein and S. Krein (1940) and Yosida and M. Fukamiya (1940) discussed the (M)-type vector lattices, and Yosida (1941), the (L)-type vector lattices. H. Nakano (1940-1941) and T. Ogasawara and F. Maeda (1942) studied the spectral resolu- tion of the Banach lattice. The connection between tBrownian motion and tpotential theory was clarified by Kaku- tani (1942), and extended by J. L. Doob (1956) and G. A. Hunt (1957-1958). The tone-parameter semigroup of continu- ous linear operators in Banach spaces was studied by Hille and Yosida, and they gave in 1948 a characterization of the tinfinitesimal generator of such semigroups. Its dual was given by Phillips (1955). The one-parameter semigroup of nonlinear tcontractive operators in Hilbert spaces was studied by Y. K6mura (1967), who obtained a nonlinear version of the HilIe- Y osida theorem. This has been ex- tended considerably in Banach spaces by many scholars, e.g., Kato (1967), M. G. Cran- dall and A. Pazy (1969), Crandall and T. Lig- gett (1971), I. Miyadera and S. Oharu (1970), H. Brezis (1973), p, Benilan (1973), and others. In 1936, S. Sobolev gave a generalization of the notion of functions and their derivatives through integration by parts. This general- ization has been extended by L. Schwartz (1945-) [9J to the notion oftdistributions, which are continuous linear functionals defined on the function spaces (Rn) and 9'(R n ), and this extension gives, e.g., a reasonable inter- pretation of Dirac's b-function. Since 1959, Gel'fand [lOJ has been publishing, with his collaborators, books on the distribution the- ory pertaining to function spaces other than (Rn) and 9'(R n ). M. Sato introduced (1959- 1960) [11J the theory ofthyperfunctions, as a generalization of distribution theory. In the case of one independent variable, a hyper- function f may be defined as a generalized boundary value on the real axis R of a holo- morphic function F defined in CI-R I . Hyper- function theory has been refined to tmicro- local analysis and studied extensively by Sato, A. Martineau, H. Komatsu, P. Schapira, T. Kawai, M. Kashiwara, M. Morimoto, A. Ka- neko, and others ( [12J). References [1] S. Banach, Theorie des Operations Lin- eaires, Warsaw, 1932. 163 A Functional-Differential Equations [2J 1. von Neumann, Mathematische Grund- lagen der Quantenmechanik, Springer, 1932. [3J M. H. Stone, Linear transformations in Hilbert spaces and their applications to analy- sis, Amer. Math. Soc., 1932. [4J T. Kato, Perturbation theory for linear operators, Springer, second edition, 1976. [5J E. Hille and R. S. PhiIlips, Functional analysis and semigroups, Amer. Math. Soc., 1957. [6J K. Yosida, Functional analysis, Springer, sixth edition, 1980. [7J N. Dunford and 1. Schwartz, Linear opera- tors, Interscience, I, 1958; II, 1963; III, 1971. [8J H. Brezis, Op6rateurs Maximaux Mono- tones et Semigroupes de Contractions dans les Espaces de Hilbert, American Elsevier, 1973. [9J L. Schwartz, Theorie des Distributions, Hermann, 1966. [10] I. M. Gel'fand, Generalized functions 1- m (with G. E. Shilov), IV (with N. Va. Vilen- kin), V (with M. I. Graev and N. Va. Vilenkin), Academic Press, 1964-1966. [11J M, Sato, Theory of Hyperfunctions I, II, J. Fac. Sci. Univ. Tokyo, sec. I, 8 (1959), 139- 193; 8 (1960), 387-437. [12J M. Sato, T. Kawai, and M. Kashiwara, Microfunctions and pseudodifferential equa- tions, Lecture notes in math. 287, Springer, 1973. 163 (XIII.16) Functional-Differential Equations A. General Remarks In many models, it is assumed that the future behavior of a system under consideration is governed only by its present state and not by past states. However, for various systems arising in practical problems we cannot ignore the effect of the past on the future. Such a phenomenon is often observed in population problems, epidemiology, chemical reactions, system engineering, and so on. The description of such phenomena may involve difference-differential equations x(t)= f(t, x(t), x(t - hd, ..., x(t - h m )), (1) or integrodifferential equations x(t) =g(t, x(t)) + f>(t, s, x(t), x(s))ds. (2) An enormous variety of equations is discussed in the literature, but most of them can be 
163 B Functional-Differential Equations expressed in the form x(t) = f(t, x(s», which is usually called the functional differen- tial equation; here s varies in an interval I, and f depends on the function x of I,. If t is the right or left endpoint of the interval I" then (3) is said to be of retarded or advanced type, respectively. Most of the results obtained have been from work on equations of retarded type. In this case, the maximal length of the interval I, is called the retardation (delay, lag, deviation, etc.), and (3) is often called the retarded dif- ferential equation, delay differential equation, or differential equation with lag (retardation, deviating argument). B. Historical Remarks Functional differential equations of a sort were studied by Johann Bernoulli in the 18th cen- tury in connection with the string problem. Since then, a good deal of work has been done in this field by many people; some of this work was done before the beginning of this century. Among the investigators, Volterra [1, 2J is noteworthy for his systematic study on rather general equations related to problems of predator-prey populations and viscoelasticity, though his results were ignored by his con- temporaries. In the early 1940s, Minorsky [3], in his famous study of ship stabilization, pointed out the importance of delay effects in control theory, with many of the modern issues first appearing in his work [4]. Mishkis [5] studied linear systems extensively, and Driver [6] gave a unified representation for functional differential equations. Important achievements were given by Krasovskii [7J and Bellman and Cooke [8]. They laid the foundation for the qualitative theory of func- tional differential equations. Inspired by these works, many books (such as [9-14]) were published. Owing to these books together with many articles on this field, the theory of func- tional differential equations has become an important branch in the th.:ory of differential equations. Equations of advanced type are treated in, e.g., [5, 8J, but qualitative theory for them has hardly been established. The equation x(t)=ax(t)+bx(At) on O.s;t< 00 (4) is of advanced type if ..1. > 1, and its analytic solution has been studied in detail. Equation (3), whose right-hand side involves differential operators, e.g., x(t) = f(t, x(t), x(t - h), x(t - h)), is said to be of neutral type [5,8, 14J, and is generally considered to be of neither retarded nor advanced type because of its distinctive 632 (3) features. However, Hale [14] originated the study of a class of equations of neutral type for which the general qualitative theory can be developed in the same manner as for equations of retarded type. For the case of infinite retardation, proper choice of the space of functions x contained in the right-hand side of (3) is of great signifi- cance. A general treatment of the phase space for equations with infinite retardation in an axiomatic setting is given in [15]. C. Phase Space For simplicity, let I, always be the interval [t - h, tJ with a finite retardation h > O. A solution of (3) starting at t = r is a function defined on [r - h, a) for a> r such that the solution coincides with a preassigned function (the initial function) on [r - h, r J and is con- tinuously differentiable and satisfies (3) on [r,a). Since a solution starting at t=r must be continuous for tr, it is quite natural, as is done in much of the literature in order to develop a qualitative theory, to choose the space C([t - h, tJ, R n ) of continuous Rn-valued functions on [t - h, tJ as a space of functions x involved in the right-hand side of (3). Intro- ducing a symbol x, which represents an ele- ment ofC==C([ -h,OJ,Rn) defined by x,(s) = x(t + s) (s E [ - h, OJ), we can rewrite (3) in a more convenient form: x(t)=f(t,x,), (E) where the function f(t, <p) is defined on R x C (or on its subspace). Here C is considered to be a Banach space with the norm 11<p11 = maxSE[-h,o]I<p(s)l, 1'1 being a norm in Rn. The space C is said to be the phase space of (E), and the initial condition can be written as x,=(, (r,)ERxC. (5) D. Initial Value Problem Let f(t, <p) be a continuous functions defined on an open domain Dc R x C. The initial value problem is to find a solution of (E)-(5) for a given (r, () ED. Many fundamental results hold as for ordinary differential equations: (a) There always exists a solution of (E)-(5) under the continuity of f on D. Here, the solution means the one to the right. No general existence theorem is given for the solution to the left. (b) The solution of (E)-(5) is unique, if f satisfies the Lipschitz condition: If(t, <p)- f(t, if1)I.s; L 11<p -1/111 in a neighborhood of each point of D, where L is a constant depending on the neigh- borhood. (c) Iff varies in the space C(D, R n ) 
633 equipped with the compact open topology and if the solution of (E)-(5) is unique for every (r, () E D, then the solution is continuous as a function of (r, J). (d) Iff is completely con- tinuous on D, that is, f is a continuous map- ping of bounded subsets of D into bounded sets in R n , then every solution can be con- tinuab1e to the right as long as it remains bounded or stays away from the boundary of D. This assertion is no longer true in the absence of complete continuity. To prove the existence theorem, Schauder's tfixed point theorem is utilized; the Picard successive method is also effective under the Lipschitz condition. Consider the case when f(t, <p) can be writtn as f(t, <p)=g(t, <p(0), <p), where g(t, x, <p) is defined for (t, x, <p) E R x R n x C, and g(t,x,<p)=g(t,x,ijJ) if <p(s)=ijJ(s) on [-h, -15J, 15>0. Equation (1) is one such case, where h==maxh k , 15=minhx' Then, g(t,x,x,) is a function of (t, x) alone on [r, 't' + 15J under the given condition (5), that is, (E) is reduced to an ordinary differential equation. This makes it possible to find a sol ution of (E)-( 5) by matching successively the solutions of ordi- nary differential equations on [r, r + 15J, [r + 15, r + 215J, "" This is the step-by-step method, which is effective even for equations of neutral type with the same property. Under uniqueness, the solution x(t) of (E) induces a mapping T(t, r): G(t, r)-->C, t ?n, which maps X,EG(t, r) to X,EC, where G(t, r)c C is the set of  for which the solution of (E)- (5) is continuable up to t. Then T(t,s)T(s, r)= T(t, r) for tsr, (6) and T(t, 't') is strongly continuous in t. If (E) is autonomous, that is, f(t, <p) is independent of t, then there exists a mapping T(t), tO, satisfy- ing T(t-r)= T(t, r), and {T(t)},,,,o becomes a one-parameter semigroup. E. Linear System When f(t, <p) is continuous on I x C for an interval I and is linear in <p, equation (E) is said to be a linear system (denoted by (L)). In this case, f(t, <p) satisfies the Lipschitz con- dition with L= L(t) on I x C for a continuous function L(t) of I, which also implies that f is completely continuous. Thus the solution of (L) -(5) uniquely exists over [r, 00) n I for any ('t', () E I x C, and the mapping (the fundamental or solution operator) T(t, r) is a tbounded linear operator on C for any t, rEI, tr, and tcompact if t  r + h. T(t, r) corresponds to the fundamental matrix for ordinary linear dif- ferential equations, but it is not invertible in general. A continuous function f(t, <p) linear in <p 163 F Functional-Differential Equations may be expressed in the form f(t, <p) = fh [d s '1(t, s)J <p(s), (7) where '1(t, s) is an n x n matrix defined on I x [ - h, OJ, which is measurable in (t, s) and of tbounded variation in s with a ttotal variation less than L(t), and the integration is that of StieItjes with respect to s. From this fact it foIlows that the range of the initial functions can be extended to the space of tpiecewise- continuous functions, and the solution x(t) of the nonhomogeneous linear system x(t)= f(t, x,) + p(t) through (r, () E I x C can be rep- resented by the constant variational formula x(t) = [T(t, r)(J (0) + f [T(t, s)r] (O)p(s)ds, t  't', where r(s)= E (the unit matrix) for s =0, and r(s) = 0 (the zero matrix) for s < O. F. Autonomous Linear System Let (L) be an autonomous linear system x(t)= f(x,). (AL) In this case, (7) is not different from the Riesz representation theorem f(<p) = Jh[d'1(s)J<p(s). For (AL) the fundamental operator T(t) plays an extremely important role. As was seen, {T(t)},,,,o is a one-parameter semigroup of bounded linear operators T(t) which is strong- ly continuous in t O and compact for t h. Thus the asymptotic behavior of the solutions of (AL) are determined by the distribution of the spectra O'(A) of the infinitesimal generator A of T(t), which is given by A<p = cjJ with the domain D(A)= {<pECI cjJEC and cjJ(O)=f(<p)}, The properties of T(t) assert that: (a) O'(A) consists of point spectra alone. (b) The number of A,= {AEO'(A)IRdex} is at most finite for any exER. (c) For every AEO'(A) the dimension of the generalized eigenspace of ..1. is finite. (d) The spectra of A coincide with the roots (char- acteristic roots) of the characteristic equation of (AL), de{ AE - fh e As d'1(S)] ==0, together with their multiplicities. (e) e A ' E O'(T(t)), t O, if and only if AEO'(A). Let P., ex E R, be the linear space spanned by the generalized eigenfunctions corresponding to a AEA,. Then, (a) P, is invariant under T(t), that is, T(t)P,cP, for tO; (b) the restriction of T(t) to P, is invertible and hence extend- able over tER; (c) if (EP., then [T(t)(J(O)= LA€A,PA(t, )eAt, where PA(t,) are polynomials 
163G Functional-Differential Equations in t and linear in ; (d) there is a direct-sum decomposition C = P, + Q, such that Q, is in- variant under T(t), the projection along Q,n,: C-->P, is bounded, and there exist positive constants K, I: for which II T(tj( 11.s; Ke(,-tj' II (II, t  0 if (E Q,. Hence for any  E C the solution x(t) of(AL) through  at t=O satisfies I X(t)- L pAt, n,)eA' I .s; K(1 + Iln,ll)e(H)' IIII, lEArI; tO, (8) where Iln,11 denotes the operator norm of n,. The set Q = n'ER Q, may contain an element ( other than the zero element, but T(tj( must be identically zero for t  hn. For a linear system (L) with f(t, <p) w- periodic in t, similar conclusions result, with T(w,O) in place of A; and relation (8) holds, where A, is defined similarly by replacing O'(A) by W/w) logJ11 J1E O'(T(w, O))} and PA(t,) are polynomials with w-periodic coefficients. This corresponds to tFloquet's theorem for ordi- nary periodic linear systems. J1EO'(T(w, 0)) and (1/w)logJ1 are said to be the characteristic multiplier and the characteristic exponent, respecti vel y. G. Stability Problem The concept of stability can be defined and studied in the same spirit as for ordinary dif- ferential equations, and the Lyapunov second method also turns out to be effective. For instance, the zero solution of (E) with f defined on D H = [0,00) X {<pEC 111<p11 <H} for H >0 is said to be uniformly asymptotically stable if there are a constant ex >0 and positive func- tions 15(1:) and 0'(1:) of I: > 0 such that any solu- tion x(t) of (E) satisfies Ix(t)1 <I: as long as x(t) exists, whenever flx,11 <15(1:) and tr or Ilx,ll.s;ex and t  r + 0'(1:) for r  O. In the above definition, if f is completely continuous on D H and if 0<1:< H, then the phrase "as long as x(t) exists" is redundant, and (9) is equivalent to the claim that x(t) exists for all t  rand Ix(t)1 < c. Under the Lipschitz condition and the complete continuity on f in (E), the zero solution of (E) is uniformly asymptotically stable if and only if there exists a continuous R-valued function (Lyapunov function) V(t, <p) defined on D H for 0<H1.s;H such that 1 (i) a( I <p(O)l).s; V(t, <p).s; b( II <p II), (ii) D+ V(t,x,).s; -cV(t,x,) as long as (t, X,)E D H1 for a solution x(t) of (E), where a(r), b(r) are continuous functions with a(r) > 0 for r > 0, b(O) = 0, c > 0 is a constant, 634 and D + denotes the upper-right Dini deriva- tive. Furthermore, the Lyapunov function can be endowed with a Lipschitz condition I V(t, <p) - V(t, 1/1)1.s; L111<p- 1/111 for some constant Ll if f in (E) is uniformly Lipschitzian, that is, the coefficient L is constant over the domain. The Lipschitz condition together with (ii) makes it possible for a solution y(t) of the perturbed equation y(t)= f(t,y,) + P(t, y,) (10) to satisfy D+ V(t,y,).s; -cV(t, y,)+ L1IP(t, y,)I. By using this fact, the stability of (1 0) can be discussed. However, for functional differential equa- tions it is quite a difficult problem to con- struct a suitable Lyapunov function for a given (E). Many of the attempts to improve the suffi- ciency condition that have been made up to now are such that a stability property can be verified by means of a simple Lyapunov func- tion. The main effort has been devoted to replacing condition (ii) by another type of condition. One of them is (ii*) D+V(t,x,).s; -c(lx(t)l) under the uniform boundedness of f( t, <p), where c(r) is a continous function with c(r) > 0 for r > O. Another one is for the case when V(t, <p) = W(t, <p(0)), where W(t, x) is defined for (t, x) E R x R n , In this case, (ii) can be replaced by (9) (ii**) D+V(t,x,).s; -cV(t,x,) whenever V(t +S,X'H).s; F (V(t,x,)) for SE [ - h, OJ, where c> 0 is a constant and F(r) is a continuous function satisfying F(r) > r for r > O. The condition (ii**) was given by B. S. Razumi- khin and provides an easier way to construct a Lyapunov function. For a linear system, uniform asymptotic stability implies II T(t, r)ll.s; 1/15(1) for t r and II T(t, r) II .s; 1/2 for t  r + 0'(ex/2) + h. These facts together with (6) show that the zero solu- tion is exponentially stable, that is, Ix(t)l.s; Ke-V('')llx,11 for tr, where v = (log2)/(0'(ex/2) + h), K = 2/15(1). From (8) it follows that the zero solution of (AL) is uniformly asymptotically stable if and only if {).EO'(A) I RdO} = 0. (11 ) H. Equations of Neutral Type To deal with an equation of neutral type, such as x(t)=f(t, x" x,), (12) 
635 a phase space such as C I = C 1 ([ - h, OJ, R n ) of continuously differentiable functions should be chosen. The solution of (E) will become smooth as time elapses if f in (E) is sufficiently smooth. However, this property cannot be expected for (12), and the solution of(12)-(5) need not belong to C l even if ECI without a specific condition on ( such as (O) = f(r, (, ). Such a restriction on initial functions is an obstruction to the development of the general theory. Equations of the form d dt D(t, x,) = f(t, x,) cover a fairly general class of equations of neutral type, where D(t. <p) and fit. <p) are defined on an open set Dc R x C and a solu- tion of (N)-(5) means a continuous function x(t) defined on [r - h, a), a> r, which satisfies (5) and D(t,x,)=D(r,O+ ff(S,X.)dS on [r,a). (E) corresponds to the case where D(t, <p) = <p(0), while x(t)= G(t, x,) + g(t, x,) is reduced to the form (N) if G(t, <p) is linear in <p and continuously differentiable with respect to t by setting D(t, <p) = <p(0) - G(t, <p) and f(t, <p) = g(t, <p)-(ajat)G(t, <p). A continuous function D(t, <p) linear in <p is said to be atomic at 0 if in the representation D(t, <p) = fh [d s J1(t, s)J <p(s) (see eq uation (7)) P( t) = J1( t, 0) - J1( t, - 0) exists and is nonsingular, which is equivalent to D(t, <p)= P(t)<p(O) + fh [d s J1o(t, s)J <p(s) (13) with a nonsingular matrix P(t) and a matrix !lo(t, s) whose total variation with respect to s on [ -0', OJ tends to 0 as 0'-->0 locally uni- formly in t. A nonlinear function D(t, <p) is said to be atomic at 0 if D(t, <p) has a continuous tFrechet derivative D",(t, <p) with respect to <p and D",(t, <p) is atomic at O. In equation (N), D(t, <p) is always assumed to be atomic at 0, and many of the fundamental theorems for (E) are also valid for (N). Let T(t) be the operator solution of the autonomous linear equation d diD(x,) = f(x,). Then {T(t) },:;>o is a strongly continous semi- group of bounded linear operators, and the corresponding tinfinitesimal generator A has the domain D(A) = {<p E CI cP E C and D( cp) == f(<p)}. The properties of the spectra O'(A) are the same as for (AL), except that the character- 163 I Functional-Differential Equations istic equation is now given by de{),(p+ fh e As dJ1o(s))- fh e AS d 11 (S)]=O and the property (b) is true if ex > aD, where D(<p), f(<p) are assumed to be of the forms (13) and (7), respectively, without the argument t, and where aD is given by aD==max{Re).jde{p+ fh e AS d J1 o (.S)J=o}. (N) Similarly, the decomposition C= P,+ Q, is possible if ex > aD' Hence, to obtain the uniform asymptotic stability 0f the zero solution of (14), the condition aD<O should be assumed in addition to (11). The linear function D( <p) is said to be stable if aD<O. If D(<p) is stable, then the Lyapunov method is also applicable, as a sufficient condition, to the stability problem of (N) with linear D(<p) instead of general D(t, <p), but in this case condition (i) for V(t, <p) should be replaced by a( I D( <p)I).s; V(t, <p).s; b( II <p II). I. Infinite Retardation There are some basic differences between cases of finite retardation and of infinite retardation. For example, in the definition of stability, the inequality Ix(t)1 < c in (9) can be replaced by II x, II < c with no difference in the case of finite retardation, but this replacement yields a different concept deeply connected with the choice of the phase space in case of infinite retardation. There are several ways to define a phase space for a functional differential equation with infinite retardation. One of those phase spaces generally used is a linear space X of functions: (-00, OJ -->Rn with a semi norm II' Ilx such that if a function x: ( -00, a)-->R n satisfies x, E X and it is continuous on [r, a J, then (i) X,EX for all tE [r,a), (ii) x, is continuous as a function [r,a)-->X, (iii) mlx(t)l.s; Ilx,llx.s; K maxSE[,.t] Ix(s)1 +M(t-r)llx,llx, (14) where m and K are positive constants and M(t) is a continuous function. If X satisfies the foregoing conditions and if f(t, <p) is continuous on an open domain Dc R x X, then the local properties (a)-(c) in Sec- tion D hold, and so does (d) when D = R x X. On the other hand, if X has a fading memory, namely, (iv) M(l).s; Me-I" in (iii) for positive con- stants M, J1, 
163 Ref. Functional-Differential Equations then the procedure in Section F is applicable by restricting O'(A) to O'(A)== {),E O'(A) I ReA.> - J1} and, hence the zero solution of (AL) with infinite retardation is uniformly asymptotically stable under (11). ForO.s;h.s;oo andO.s;y<oo, both the space Ch,y of continuous functions <p:( - h, OJ-->R" with a finite limit lims -h e YS<p(s) and the space Mh, y of measurable functions <p: ( - h, OJ --> R n with SheYSI<p(s)lds< 00 satisfy the foregoing conditions, where the norms are given by II <pIlch Y = SUP(-h,OJ e YSI<p(s)1 and II <pIIM h " = I <p(0) I + f_heYSI<p(s)tds. These spaces have a fading memory if h < 00 or y>O. In much of the literature in which the equa- tion of infinite retardation has a form like (2) or (4), the space CB of bounded continu- ous functions or the space Co of continuous functions with compact supports are suffi- cient as a phase space under the norm II <P II = sup,,;ol<p(s)l. However, it is to be noted that Co is not complete when CB does not satisfy condition (ii). When CB is chosen as the phase space, in order to show the existence theorem, f(t, x,) should be continuous for any bounded continuous function x in addition to the cont- inuity in (t, <p), This condition is satisfied if I, == [g(t), tJ in (3) for a continuous function g(t).s; t. Equation (2) or (4) with O.s; ..1. < 1 give rise to such a case, but g(t) must be equal to 0 in (2) and (4) with ..1.=0. If g(t)--> 00 as t-->oo, the Razumikhin condition (ii**) D+ V(t,xl).s; -cV(t, x,) whenever V(s, xs).s; F (V(t, Xl)) (SE [g(t), tJ) for a Lyapunov function is effec- tive, but one can conclude only that x(t)-->O as t --> 00 without uniformity. References [1] V. Volterra, Sur la theorie mathematique des phenomenes hereditaires, J. Math. Pures Appl., 7 (1928), 249-298. [2J V. Volterra, Theorie mathematique de la Lutte pourla Vie, Gauthier- ViIlars, 1931. [3J N. Minorsky, Self-excited oscillations in dynamical systems possessing retarded actions, J. Appl. Mech., 9 (1942), 65-71. [4J A. M. Zverkin, G. A. Kamenskii, S. B. Norkin, and L. E. El'sgol'ts, Differential equa- tions with retarded arguments (in Russian), Uspekhi Mat. Nauk., 17 (1962), 77-164. [5J A. D. Mishkis, Lineare Differentialglei- chungen mit nacheilendem Argument, Deut- scher Verlag Wiss., 1955. (Original in Russian, 1957.) [6] R. D. Driver, Existence and stability of solution of a delay-differential system, Arch. Rational Mech. Anal., 10 (1962), 401-426. [7J N. N. Krasovskii, Stability of motion, Stanford Univ. Press, 1963. (Original in Rus- sian, 1959.) 636 [8] R. Bellman and K. L. Cooke, Differential- difference equations, Academic Press, 1963. [9] L. E. El'sgol'ts and S. B. Norkin, Introduc- tion to the theory and application of differen- tial equations with deviating argument, Aca- demic Press, 1971. (Original in Russian, 1963.) [lOJ A. Halanay, Differential equations: Sta- bility, oscillations, time lags, Academic Press, 1965. [11] M. N. Oguztoreli, Time-lag control sys- tems, Academic Press, 1966. [12J T. Yoshizawa, Stability theory by Lia- punov's second method, Math, Soc. Japan, 1966. [13J V. Lakshmikantham and S, Leela, Dif- ferential and integral inequalities II, Academic Press, 1969. [14] J. K. Hale, Theory offunctional differen- tial equations, Springer-Verlag, 1977. [15] J. K. Hale and J. Kato, Phase space for retarded equations with infinite delay, Funk- cial. Ekvac., 21 (1978), 11-41. 164 (XII.18) Function Algebras A. Definition [1-3J Let C(X) be the tBanach algebra of all con- tinuous complex-valued functions on a com- pact Hausdorff space X with pointwise oper- ations and tuniform norm. A function algebra (or uniform algebra) on X is a closed sub- algebra A of C(X) containing the constant functions and separating the points of X, i.e., for any x, YEX, x # Y, there exists fEA with f(x) # f(y). If <Px, XEX, denotes the evaluation mapping f --> f(x) of A, the correspondence X--> <Px is a homeomorphism of X into the tmaxi- mal ideal space \JJl(A) of A. Since J(<px) = f(x) for any x E X and f E A, the tGel'fand represen- tation of A is an isometric isomorphism. By identifying x with <Px, we regard X as a closed subset of \JJl(A) and the Gel'fand transform j, j E A, as a continuous extension of f to \JJl(A). B. Examples [1, 3,4] For a compact plane set K let P(K) (R(K)) be the subalgebra of all functions in C( K) that can be approximated uniformly on K by poly- nomials (rational functions with poles off K). A(K) denotes the subalgebra of all functions in C(K) that are analytic in the interior of K. These are function algebras on K and P(K)c;; R(K)c;;A(K). When K is the unit circle T== 
637 {I zl = I}, P(T) is called the disk algebra, the most typical concrete example. The theory of function algebras emerged from attempts to solve, by means of functional analysis, certain problems in complex analysis, especially problems of uniform approximation, e.g., when does A(K)==P(K) or A(K)=R(K) hold? Another important example is given by the algebra Hoo(U) of all bounded analytic func- tions on a bounded plane domain U with the supremum norm Ilflloo =sup{lf(z)11 ZE U} [5]. Since the Gel'fand representation is an iso- metric isomorphism of Hoo(U) into the algebra C(\JJl(Hoo(U))), Hoo(U) is viewed as a function algebra on the space \JJl(Hoo(U»). Further examples result if we take K or U in a Rie- mann surface or in the complex n-space, We list a few abstract function algebras that reflect certain relevant properties possessed by concrete examples. A is called a Dirichlet (resp. logmodular) algebra if the set Re A = {ReflfEA} (resp.logIA-11={loglfIIJJ-IE A}) is dense in CR(X), the space of all con- tinuous real-valued functions on X. It is called hypo-Dirichlet if the closure of Re A has finite codimension in CR(X), and the linear span of loglA -II is dense in CR(X). In the following, A denotes a function alge- bra on X unless otherwise specified. C. Boundary and Representing Measure [1-4] A subset E of X is a boundary for A if for any fEA there exists XEE such that If(x)l= Ilfll. A closed boundary is a boundary closed in X. G. E. Shilov proved that there is a smallest closed boundary, aA, which is called the Shilov boundary for A. A positive Borel measure J1 on X is a representing measure for <pE\JJl(A) if f(<p) = S f(x) dJ1(x) for all f EA. Each <p E 9Jl(A) has a representing measure supported by aA. The Choquet boundary, c(A), consists of all XE X such that the evaluation <Px at x has a unique representing measure. Then c(A) is a bound- ary, whose closure is aA. If X is metrizable, c(A) is a G set in X and supports a represent- ing measure for every member of \JJl(A). For <pE\JJl(A), M", denotes the set of repre- senting measures for <po It is a tweak* com- pact convex subset of the space of measures on X. M", is a singleton if A is Dirichlet or log- modular. It is finite-dimensional if A is hypo- Dirichlet. The case dimM", < +00 has been studied in detail. Extensive studies for the case dim M", = +00 have been done only for con- crete examples related to polydisks, infinitely connected domains, etc. The notion of boundary reflects the tmaxi- 164 F Function Algebras mum modulus principle for analytic func- tions, and representing measures come from tPoisson's integral formula. The most relevant maximum principle is H. Rossi's local maxi- mum modulus principle: For any closed set K in \JJl(A), we have IIJIIK= IIJllbdKu(Kni'A) for all f E A, where bd K is the topological boundary of K and IIJlls=sup{IJ(s)11 SES}. A corre- sponding result for function spaces was con- sidered by Y. Hirashita and 1. Wada. Closely related to representing measures are the or- thogonal measures for A, i.e., complex Borel measures J1 on X such that Sf dJ1 = 0 for all f E A; they are often useful in studying func- tion algebras by means of the duality tech- nique. The set of orthogonal measures is denoted by A 1.. D. Peak Sets [1-3] A subset K of X is a peak set for A if there exists f E A such that f(x) = 1 for x E K and If(y)1 < 1 for YE X - K. K is a generalized peak set if it is the intersection of peak sets. A point x E X is a (generalized) peak point if the set {x} is a (generalized) peak set. The set of generalized peak points equals the Choquet boundary for A. If X is metrizable, the peak sets and generalized peak sets coincide. There exist X and A such that X is metrizable: X == \JJl(A)=c(A), but A#C(X) (B. Cole). A sub- set E of X is interpolating for A if for any bounded continuous function u on E there exists f E A with fiE = u. Then a closed G set K in X is an interpolating peak set if and only if J1EA1. implies 1J1I(K)==O (E. Bishop). E. Antisymmetric Decomposition [1-3] A subset F of X is a set of antisymmetry for A if every function in A which is real-valued on F is constant on F. Bishop's anti symmetric de- composition then appears as an extension of the tWeierstrass-Stone theorem on uniform approximation. It is a refinement of Shilov's decomposition and reads as follows: If {E,} is the family of maximal sets of anti symmetry for A, it is a partition of X into generalized peak sets such that f E C(X) with fl E, E A I E, for all CJ. belongs to A. An interesting connection was found by 1. Tomiyama between the maximal antisymmetric decomposition of X relative to A and that of 9Jl(A) relative to A. F. Parts and Analytic Structure [1-4] A. M. Gleason defined an equivalence relation  in \JJl(A) by setting <Pif1 ifsup{lj(<p)- J(if1)llfEA, Ilfll < I} <2. Each equivalence class for this relation is a part (or a Gleason 
164G Function Algebras part) for A. Two points <p, i/J belong to the same part if and only if there exist mutually absolutely continuous representing measures Jl for <p and v for i/J such that c- I <dJ1/dv<c for some constant c>O (Bishop). An analytic structure in \JJl(A) is a pair (V, r), with an tana- lytic set V in some open subset of C n and a nonconstant continuous mapping r: V -->\JJl(A) such that J 0 r is analytic on V for all f EA. For such a structure, T(V) is always within a part. But there can be a nontrivial part with no analytic structure, as was shown by G. StolLen berg. A topo1ogi<.:a1 chara<.:teriLation of parts was obtained by J. Garnett. Sample results in the positive direction are: (1) If the ideal A", = {I E A IJ(<p) = O} is finitely generated, there is an analytic structure (V, r) such that r is a homeomorphism of V onto an open neigh- borhood of <p (Gleason); (2) if <p has a unique representing measure and if the part P of <p is not a singleton, there is a bijective continuous mapping r of the open unit disk onto P such that J 0 r is analytic for f E A (J. Wermer, K. Hoffman and G. Lumer); (3) if A is hypo- Dirichlet and if a part P is not a singleton, P can be made into a I-dimensional tanalytic space so that each fE A is analytic on P (J. Wermer and B. V. O'Neill). Analytic structures in tpolynomially convex hulls of curves in C n have also been studied [3,4]. G. Abstract Function Theory [1,6-9] Let <pE':UI(A), and choose mE M"" which is fixed. The generalized Hardy class Hp(m), 0 < p.s; 00, associated with A is the closure (weak* closure, if p= 00) of A in the tL p space Lp(m) on the measure space (X, m). Under suitable restrictions on A, <p, or m, we can recapture some of the important classical facts, most of which have their origins in the works of A. Beurling, R. Nevanlinna, F. and M. Riesz, and G. Szego. In this area, H. Helson and D. Lowdens1ager came up with a powerful method using orthogonal projections in Hil- bert space and gave together with S. Bochner's remark, a strong influence for subsequent development. The modification argument was then devised by Hoffman and Wermer, in- spired by F. Forelli. After Hoffman's detailed study of log modular algebras, Lumer observed that most results remain valid when <p has a unique representing measure. T. P. Srini- vasan and J.-K. Wang ( [8]) introduced the notion of weak * Dirichlet algebra and showed that some major theorems are mutually equiv- alent and are in fact measure-theoretic, With the Hoffman-Rossi complement, their result now states that for fixed mE M", the following are equivalent: (i) A + if is weak* dense in 638 Loo(m), i.e., A is a weak* Dirichlet algebra on (X, m); (ii) if J1 E M", is absolutely continuous with respect to m, then J1 == m; (iii) if a closed subspace M of L2(m) is simply invariant in the sense that A",M!:; M and A",M is not dense in M, then M ==qH 2 (m) with qELoo(m), Iq[ == 1 a.e.; (iv) the set logIHoo(m)-ll coincides with Loc (m; R), the set of real-valued elements in Loo(m); (v) for every WE Ldm), W ;;:,0, we have inf{J\l- fI2wdmlfEA",} ==exp(Slogwdm); (vi) the linear functional h-->S hdm on Hoc(m) has a unique positive extension to Loo(m). Further extensions were subsequently made by use of the conjugation operator ( Section K) by T. Gamelin, H. Konig, Lumer, K. Yabuta, and others. Some more properties of weak* Dirichlet algebras have been obtained by T. Nakazi. H. Generalized Analytic Functions [1,10] Let r be a dense subgroup of the additive group R of the rea1s with discrete topology, and let G be the tcharacter group of r. Each aE r, as a character of G, defines a continuous function Xa on G. Let A be the closed sub- algebra ofC(G) generated by {XalaEr,a;;:,O}. A is a Dirichlet algebra but is far more difficult to describe than the disk algebra. The study of this algebra, especially that of invariant sub- spaces, has evolved from papers by Helson and D. Lowdenslager. Let (J be the tHaar measure of G and H 2 ((J) the closure of A in L 2 «(J). A closed subspace M of L 2 «(J) is called invariant ifXaM!:;M for all aEr, a;;:'O. M is called doubly invariant if XaM!:;M for all a E r. Otherwise, it is called simply invariant. In fact, only the latter i interesting. Let e, E G with t E R be the character of r defined by e,(a) = ei'a. Then the mapping t -->e, is a faith- ful representation of R into G. A cocycle on G is defined to be a Borel function B on G x R such that(i) I B(x, t)1 = 1, (ii) B(x + e" t)= B(x,s) B(x,s+t) for XEG and s, tER. Two cocycles are identified if they differ only on a null set in G x R. For a cocycle B, let M B be the set of fEL 2 ((J) such that B(x, t) f(x+ e,)EH 2 (dt/(1 +t 2 )) for almost all XEG, where H 2 (dt/(1 + t 2 )) is the closure, in the space L2(dt/(1 + t 2 )) on R, of the set of boundary value functions on R of bounded analytic functions on the upper half-plane. Then the mapping B-->M B is a bijection from the set of cocycles onto the set of simply invariant sub- spaces M of L 2 «(J) such that M=n{XaMlaE r,a<O}. Moreover, M B =qH 2 «(J) for some qE Loo((J), Iql = I a.e. if and only if B(x, t) = q(x) q(x + e,), i.e., B is a coboundary. When r l' R, there is a co cycle that is not a coboundary. Further studies have been done by Helson, 
639 Gamelin, 1. Tanaka, and others. On the other hand, F. Forelli observed that tflows in com- pact spaces give rise to a kind of analyticity. In the above case, the algebra A consists of all fEC(G) that are analytic with respect to the flow T,(x)=x+e, for XEG and tER. General- ized analytic functions induced by flows in general have been studied by Forelli and P. S. Muhly. I. The Unit Disk [1,6,7,11,12] Let A be the disk algebra P(T), D the open unit disk {I zi < I}, and m the normalized Le- besgue measure on T. The algebra A has been the most important model in the theory of function algebras, and we find here the origin of many abstract results, Some typical results: (i) every orthogonal measure for A is abso- lutely continuous with respect to m (F. and M. Riesz); (ii) a closed set E in T is an interpolat- ing peak set for the tGel'fand transform A of A if and only if m(E)=O (W. Rudin and L. Carleson); (iii) A is maximal among the closed subalgebras of C(T) (Wermer); (iv) a function f E C(C1 D) belongs to A if f is analytic at every ZED with f(z)#O (T. Rad6). The generalized Hardy class Hp(m), O<p.s; 00, associated with A is viewed as the set of nontangential boundary value functions of elements in the classical tHardy class Hp(D). Here we find the origin of invariant subspace theorems: A closed subspace M (# {O}) of H 2 (m) is invariant, i.e., AM <;; M, if and only if M=qH 2 (m) with qEHoo(m), Iql= 1 a.e. (Beurling). The algebra Hoo = Hoo(m) is a weak* Diri- chlet algebra, whose Shilov boundary is identi- fied with the maximal ideal space X of Loo(m). We have Loo(m; R)==logl(H OO )-II, and afortiori Hoo is logmodular on X. The mapping Z--><Pz embeds the disk D in \JJl(H oo ) as an open set. The structure of \JJl(Hoo) was studied in detail by I. 1. Schark, Hoffman, and others. We finish with three remarkable results: (i) D is dense in \JJl(H oo ) (Carleson). This is the corona theorem and was proved in the following equivalent form: For any f"... ,fnE Hoo(D) with If I I + ... + If1 ;;:,£>0 on D, there exist gl,'" ,gnEHoo(D) with fig I +... + J"gn= 1. A simple proof was discovered by T. Wolff [12]. (ii) The convex combinations of tB1aschke products are uni- formly dense in the unit ball of Hoo(D) (D. Marshall) [12]. (iii) Every closed subalgebra B between Hoo and Loo(m) is a Douglas algebra, i.e., B is generated by Hoo and the complex conjugates of a family of inner functions (S.- Y. Chang and Marshall) [11]. The proof of (iii) is an interesting application of the theory of tbounded mean oscillation. 164 K Function Algebras J. Rational Approximation [1, 2J The problem of rational (polynomial) approxi- mation on a compact plane set K asks when A(K)=R(K) (A(K)= P(K)) holds. An impor- tant tool for such problems is Cauchy's trans- form of measure J1= fi.«() = S(z- () -I dJ1(z). Using this, one can show, for instance, the following: (1) A(K)= P(K) if and only if K has a con- nected complement (S. N. Mergelyan); (2) f E C(K) belongs to R(K) if each point ZEK has a closed neighborhood V with fl KnvER(K n V) (Bishop); (3) A(K)=R(K) if the diameters of the components of C - K are bounded away from zero (Mergelyan); (4) C(K)=R(K) if and only if almost all points of K are peak points for R(K) (Bishop). The last result cannot be extended much because there is a set K such that A(K)#R(K), while A(K) and R(K) have the same peak points (A. M. Davie). A complete characterization for A(K)= R(K) was obtained by A. G. Vitushkin: (5) The following are equivalent: (i) A(K)= R(K); (ii) for any bounded open set D in C, ex(D - K) = ex(D - K O ); (iii) for any Z E bd K, there exists r;;:, 1 such that Jim supoo ex((z; b) -Ko)/ex((z;rb)-K)< +00, where (z;b) is the disk {WEC Ilw-zl.s; b} and ex(E), for any bounded set E in C, is the continuous ana- lytic capacity of E, which is the supremum of 11'( 00)1 for all continuous functions f on the Riemann sphere C U { oo} such that Ifl.s; 1, f( 00) = 0, and f is analytic off a compact sub- set of E. As for uniform or asymptotic ap- proximation on noncom pact closed subsets in C, Carleman's classical study has recently been extended by N. U. Arake1yan, A. A. Nersesyan, A. Stray, and others in an interest- mg way. In connection with rational approximation, we should note detailed studies on pointwise bounded approximation in Hoo(U), U being a bounded open set in C, by O. J. Farrell, L. Rubel and A. Shields, Gamelin and 1. Garnett, and Davie. K. Further Topics (1) Conjugation operator [9, 13]. Take any mE M"" <p E \JJl(A), The conjugation operator then associates with each uEReA the unique element *u in CR(X) such that u+i*uEA and S *u dm = O. After the classical inequalities of Kolmogorov and M. Riesz, we consider the following conditions with constants c p and d p : (K) (SI*uIPdm)I/P.s; cpSlul dm for 0 <p < 1; (M) (SI*uIPdm)l/p.s;dp(Slulpdm)l/p for 1 <p< 00. Let m be arbitrary. Then the inequality (K) is valid for UE ReA, u >0. The inequality (M) is valid for all u E Re A if p is an even integer ;;:, 2; it is 
164 Ref. Function Algebras valid for all U ERe A, U > 0, if p is not an odd integer. All remaining cases have counter- examples. On the other hand, M", always con- tains an m such that log I f(<p)I.s;Jloglfl dm for fEA (Bishop). Such a representing measure is called a Jensen measure for <po If m is Jensen, (K) is valid for all uEReA and all O<p< 1, and (M) is valid for all UE Re A and all 1 < p<oo. (2) Riemann surfaces. For a compact bor- dered Riemann surface R, let A(R) be the algebra of functions, continuous on CI Rand analytic on R. Then A(R) is hypo-Dirichlet on the border bd R of R, and many results for the disk algebra are extended to A(R). Most of the basic results are described in [14]. The maxi- mality of A(R) in C(bdR) was obtained by H. Royden; extreme points of related Hardy classes were discussed by Gamelin and M. Voichick; and invariant subspaces were deter- mined by Forelli, M. Hasumi, D. Sarason, and Voichick. A further extension to infinitely connected surfaces has been obtained by C. W. Neville, Hasumi, and M. Hayashi in the case of open Riemann surfaces R of Parreau- Widom type, which is defined as follows: Let G(a, z) be tGreen's function for R, and let B(a,ex), ex>O, be the first tBetti number of the domain {zERIG(a,z»ex}; then R is of Parreau-Widom type if J B(a, ex)dex < +00. For such surfaces the situation looks favorable: For instance, the Cauchy-Read theorem is valid, and the Brelot-Choquet problem con- cerning Green's lines is solved affirmatively. As for the generalization of approxima- tion theorems of Mergelyan and Arakelyan, we refer to the work of Bishop and L. K. Kodama for compact sets and to that of S. Scheinberg for noncompact closed sets. (3) Higher-dimensional sets. Much attention has been paid to algebras of analytic functions on domains in cn, n;;:' 2, e.g., polydisks, unit balls, and general pseudoconvex domains. Po1ydisk algebras and ball algebras have been studied extensively by W. Rudin, P. Ahern, Forelli, and many others [15,16]. Approxi- mation theorems of Mergelyan type were ob- tained by G. M, Henkin, N, Kerzman, and I. Lieb for strictly pseudoconvex domains with smooth boundary and by L. Hormander and Wermer and L. Nirenberg and R. O. Wells, Jr., in the case of totally real manifolds. Further improvements have been obtained by R. M. Range, A. Sakai, and others. References [IJ T. W. Gamelin, Uniform algebras, Prentice-Hall, 1969. 640 [2] A. Browder, Introduction to function algebras, Benjamin, 1969. [3] E. L. Stout, The theory of uniform alge- bras, Bogden and Quigley, 1971. [4] 1. Wermer, Banach algebras and several complex variables, Markham, 1971. [5] T. W. Gamelin, Lectures on HOO(D), Uni- versidad Nacional de La Plata, 1972. [6] K. Hoffman, Banach spaces of analytic functions, Prentice-Hall, 1962. [7] H. Helson, Lectures on invariant sub- spaces, Academic Press, 1964. [8] F. Birtel (ed.), Function algebras (Proc. Int. Symposium on Function Algebras, Tulane Univ., 1965), Scott, Foresman, 1966. [9J K. Barbey and H. Konig, Abstract analytic function theory and Hardy algebras, Lecture notes in math. 593, Springer, 1977. [10] H. Helson, Analyticity on compact Abelian groups, Algebras in Analysis, J. H. Williamson (ed.), Academic Press, 1975, 1-62. [11] D. Sarason, Function theory on the unit circle, Virginia Poly tech. Inst. and State Univ., 1978. [12] P. Koosis, Introduction to Hp spaces, Cambridge Univ. Press, 1980. [13] T. W. Gamelin, Uniform algebras and Jensen measures, Cambridge Univ, Press, 1978. [14] M. Heins, Hardy classes on Riemann surfaces, Lecture notes in math. 98, Springer, 1969. [15] W. Rudin, Function theory in polydiscs, Benjamin, 1969. [16] W. Rudin, Function theory in the unit ball of en, Springer, 1980. 165 (11.4) Functions A. History Leibniz used the termfunction (Lat.functio) in the 1670s to refer to certain line segments whose lengths depend on lines related to curves. Soon the term was used to refer to dependent quantities or expressions. In 1718, Johann Bernoulli used the notation <pX, and by 1734 the modern functional notationf(x) had been used by Clairaut and by Euler, who defined functions as analytic formulas con- structed from variables and constants (1728) [1]. tCauchy stated (1821) [2]: "When there is a relation among many variables, which deter- mines along with values of one of them the values of the others, we usually consider the others as expressed by the one. We then call 
641 the one an 'independent variable,' and the others 'dependent variables.''' tDirichlet consi- dered a function of xE[a,bJ in his paper (1837) [3] concerning representations of "completely arbitrary functions" and stated that there was no need for the relation between y and x to be given by the same law throughout an interval, nor was it necessary that the relation be given by mathematical formulas. A function was simply a correspondence in which values of one variable determined values of another. B. Functions Today, the word "function" is used generally in mathematics in the same sense as a tmap- ping ( 381 Sets C) or, which is the same thing, a tunivalent correspondence ( 358 Relations B). But this word is sometimes used in a wider sense, to mean a general (not nec- essarily univalent) correspondence, called a many-valued (or multivalued) function; in that case a univalent correspondence is called a single-valued function. Specialists in each branch of mathematics have their respective ways of using the word, In analysis, values of a function are often considered real or complex numbers; such functions are called real-valued functions or complex-valued functions, respectively. Fur- thermore, if the domain of the function is also a set of real or complex numbers, then it is called a real function or a complex function, respectively ( 131 Elementary Functions; 84 Continuous Functions; 198 Holomorphic Functions). If the domain of a real- or complex-valued function is contained in a tfunction space, the function is often called a functional; the tdistribution is an example. In algebra we often fix a tfield, tring, etc., and consider functions whose domains and ranges are in such algebraic systems. Special names are given to functions having special prop- erties, which can be defined according to the structures of the domain and the range. For example, when both domain and range of a functionf are sets of real numbersJis called an even function if f(t) = f( - t), and an odd function if f(t) = - f( - t). A function f that pre- serves the order relation between real num- bers, i.e., such that t l <t 2 implies f(td.s;f(t2), is called a tmonotone increasing function. A mapping from a set I to a set F of func- tions, <p: I --> F, is called a family of functions indexed by I (or simple family offunctions), and is denoted, using the form fA instead of <p(A), by {fA}AEI or {lA}(AEl). In particular, if I is the set of natural numbers, the family is called a sequence of functions. 165 D Functions c. Variables A letter x, for which we can substitute a name of an element of a set X, is called a variable, and X is called the domain of the variable. An element of the domain of a variable x is called a value of x. In particular, if the domain is a set of real numbers or complex numbers, the variable is called a real variable or a complex variable, respectively. On the other hand, a letter that stands for a particular element is called a constant. When the domain and range of a functionf are X and Y, respectively, a variable x whose domain is X is called the independent variable, and a variable y whose domain is Yis called the dependent variable. Then we say y is a function of x, and write y = f(x). When a con- crete method is given by which we make a value of y correspond to each value of x, we say that y is an explicit function of x. When a function is determined only by a tbinary re- lation such as R(x,y)=O, we say that y is an implicit function of x ( 208 Implicit Functions). Given functions!, 9 with an independent variable t, suppose that y is regarded as a function of x defined by relations x= f(t), y = g(t). Then we say that y is a function of x with the variable t as a parameter. A function whose range is a given set C with variable t as its independent variable is often called a para- metric representation of C by t. If the domain of a functionfis contained in a Cartesian product set XI x X 2 X ... X X n , the independent variable is denoted by (x 1, x 2 , ... ,x n ), and f is often called a function of n variables or a function of many variables (when n;;:'2). D. Families and Sequences A function whose domain is a set I, <p: I -->X, is called a family indexed by I (or simply family), and I is called the index set. In the case <p(A)= X A ().E1), the family is denoted by {XA}AEI or {x A}(A E 1). If the range X of a function <p is a set of points, a set of functions, a set of map- pings, or a set of sets, then the family {x A LEI is called a family of points, a family of functions, a family of mappings, or a family of sets, res- pectively. If the set I is a tdirected set, the family is called a directed family. Generally, if J is a subset of I, the family {x A } Ad is called a subfamily of {x A } AEI' In particular, if I is a finite or infinite set of natural numbers, the family indexed by I is called a finite sequence or infinite sequence, respectively. Sequence is a generic name for both, but in many cases it means an infinite sequence, and usually we 
165 Ref. Functions have 1= N. Then the value corresponding to n E N is called the nth term or, generally, a term. For convenIence, the Oth term is often used as well. If each term of a sequence is a number, a point, a function, or a set, the se- quence is called a sequence of numbers, a se- quence of points, a sequence of functions, or a sequence of sets, respectively. A sequence is usually denoted by {an}' Ifit is necessary to show the domain of n explicitly, the sequence is denoted by {an}nEl' If J is a subset of I, a sequence {an} nEJ is called a subsequence of the sequence {an} nEl' And if I = N, the composite {a k .} of {an} and a sequence {k n } of natural numbers with k\ < k 2 < k3 ... is usually called a subsequence of {an}' References [lJ L. Euler, Opera omnia, ser. I, Opera math- ematica VIII: Introductio in analysin infini- to rum I, Teubner, 1922. [2J A. L. Cauchy, Cours d'analyse de l'Ecole Royale Poly technique pt. 1, 1821 (Oeuvres ser. 2, III, Gauthier-Villars, 1897). [3] G. P. L. Dirichlet, Dber die Darstellung ganz willkiirlicher Funktion durch Sinus- und Cosinusreihen, Repertorium der Physik, Bd. 1 (1837),152-174 (Werke, Bd. 1, G. Reimer, 1889, I, p. 133-160). 166 (X.5) Functions of Bounded Variation A. Monotone Functions A function (or mapping) f from an tordered set X to another ordered set Y is called a monotone increasing (monolOne decreasing) function if x\<x 2 implies f(xd.s;f(x 2 ) (Xl <x 2 implies f(xdf(X2))' (1) A monotone increasing (decreasing) function is also called a nondecreasing (nonincreasing) function. In either case, the function f is called simply a monotone function. If X and Yare ttotally ordered sets and the inequality < (» holds in (1) instead of .s; (), then f is called a strictly (monotone) increasing (strictly (mono- tone) decreasing) function. In either case, f is called simply a strictly monotone function. In particular, when X and Yare subsets of the real line R, a monotone function is contin- 642 uous except for at most a countable number of points. Hence it is tRiemann integrable in a finite interval provided that it is bounded. A continuous real function f(x) defined on an interval in R is tinjective if and only if it is strictly monotone. In such a case, the range of the function f(x) is also an interval, and the inverse function is also strictly monotone. Furthermore, a differentiable real function f defined on an interval is monotone if and only if its derivative f' is always O (monotone increasing) or always .s; 0 (monotone decreas- ing). If f' > 0 ( < 0), f is strictly monotone increasing (decreasing). B. Functions of Bounded Variation Let f(x) be a real function defined on a closed interval [a, b J in R. Given a subdivision of the interval a=xo<x\ <x 2 <... <xn=b, we de- note the sum of positive differences f(x i )- f(xi-d by P and the sum of negative differ- ences f(x;) - f(xi-d by - N. Then we easily obtain P-N=f(b)-f(a), P + N = L I f(x;) - f(xi-dI- The suprema of P, N, and P + N for all pos- sible subdivisions of [a, bJ are called the posi- tive variation, the negative variation, and the total variation of the function f(x) in the inter- val [a, b J, respectively. If any of these three val ues is fini te, then all three val ues are finite. In such a case, the function f(x) is called a function of bounded variation. Every function of bounded variation is bounded, but the con- verse is not true. The positive and negative variations n(t), v(t) of the function f(x) in the interval [a, tJ are monotone increasing func- tions with respect to t, and we have f(x)- f(a)=n(x)-v(x) (2) if f(x) is a function of bounded variation. Hence every function of bounded variation has both left and right limits at every point. A monotone function is a function of bounded variation, and the sum, the difference, or the product of two functions of bounded variation is also a function of bounded variation. Hence f(x) is a function of bounded variation if and only if it is the difference of two monotone functions. The representation (2) (representing a function of bounded variation as the differ- ence of two monotone increasing functions) is called the Jordan decomposition of the func- tion f(x). A function of bounded variation is Riemann integrable, continuous except for at 
643 most a countable number of points, and dif- ferentiable talmost everywhere. A continuous function defined on a closed interval is bounded but not necessarily of bounded variation (e.g.,j(x)=xsin(ljx) (XE (0, IJ), 0 (x = 0)). A discontinuous function may be a function of bounded variation on a closed interval (e.g., sgn(x». However, an tabsolutely continuous function, a differenti- able function with bounded derivative, or a function satisfying the tLipschitz condition is a function of bounded variation on a closed interval. The notion of functions of bounded vari- ation was introduced by C. Jordan in connec- tion with the notion of the length of curves (  246 Length and Area). C. Lebesgue-Stieltjes Integral Let f(x) be a right continuous function of bounded variation on a closed interval [a, b]. and f(x)=n(x)- v(x) the Jordan decomposi- tion of f(x). Then n(x) and v(x) are monotone increasing right continuous functions and hence define bounded measures dn(x) and dv(x) on [a, bJ, respectively ( 270 Measure Theory L (v)). The difference dn-dv of these two mea- sures is a tcompletely additive set function on [a, bJ which is often called the (signed) Lebesgue-Stieltjes measure induced by 1, written df For every function 9 integrable with respect to the measure dJ1=dn+dv, we define SEgdf to be equal to hgdn- SEgdv and call it the Lebesgue-Stieltjes integral. In this case, h(x) = S[a,bJg df is of bounded variation on [a, b J and the Lebesgue-Stieltjes measure dh(x) is denoted by g(x)df(x). If fl and f2 are of bounded variation on [a, bJ, then so is the product f = fd2' and we have df(x) = fl (x :t 0) df2 (x) + f2(X +O)dfl (x). (3) The Lebesgue-Stieltjes integral for a continu- ous integrand is often called the Riemann- Stieltjes integral, because it can be defined in an elementary way similar to the definition of the Riemann integral. The notion of bounded variation can also be defined for interval functions on Rn and set functions on an abstract space ( 380 Set Functions). References [IJ H. L. Royden, Real analysis, Macmillan, second edition, 1963. [2J W. Rudin, Real and complex analysis, McGraw-Hill, second edition, 1974. 167 B Functions of Confluent Type 167 (XIV.7) Functions of Confluent Type A. Confluent Hypergeometric Functions If some singularities of an ordinary differential equation of tFuchsian type are confluent to each other, we obtain a confluent differential equation whose solutions are called functions of confluent type. The equations that appear frequently in practical problems are the con- fluent hypergeometric differential equations d 2 w dw z dz 2 +(y-z) dz -exw=O (1 ) and related equations, Equation (1) corre- sponds to the thypergeometric differential equation for which a tregular singular point coincides with the point at infinity and is an tirregular singular point of class 1. For (1), z = 0 is a regular singular point, and a series solution (radius of convergence 00) is given by F(ex, y; z)= IFI (ex, y; z) _ _  ex(ex+l)...(ex+n-l) Z n L... (Izl < 00), (2) non!y(y+ 1)...(y+n-1) where y is not a nonpositive integer. The func- tion I FI in (2) is a tgeneralized hypergeometric function due to Barnes and is called a hyper- geometric function of confluent type or Kum- mer function. If y is not equal to a positive integer, the other solution of (1) independent of (2) is given by zl-y F(1 +ex-y, 2 -y; z) ( Ap- pendix A, Table 19.1), B. Whittaker Functions Equation (I) with w == e z / 2 z-y/2 W, y - 2ex = 2k, y2 _ 2y =4m 2 -1 reduces to Whittaker's dif- ferential equation d 2 W ( 1 k t -m2 ) -+ --+-+- W=O. dz 2 4 Z Z2 (3) If 2m is not equal to an integer, (3) has two series solutions for any finite z: Mk,m(Z)=Z(I/2)+mez/2 F(!+m- k, 1 +2m; z), M k ,_m(z)=z(I/2)-m e -z/2F(!-m-k,1-2m;z). If 2m is an integer, since the functions Mk,m and Mk,-m are linearly dependent, E. T. Whit- taker considered a solution of the form 1 ,m(z)= -- 2 . r(k+1- m )zk e -z/2 m f (o+) ( t ) k-(1/2)+m x 00 (- tr k -(1/2)+m 1 + e- t dt. 
167 C Functions of Confluent Type If k--m is equal to a negative integer, this integral does not exist. The function Zk e -z/2 f ro W; ( z ) = t-k-(1/2)+m k,m r(-k+m) 0 ( t ) k-(1/2)+m x 1 +- e-' dt z for Re(k--m)O is defined for any m, k, and for any z except when z is a negative real number. We call Mk,m and Yv",m the Whittaker functions. tBessel functions are particular cases of these functions, and the relation Z-I/2 1 n (z) = 22n+(1/2)in+(lf2)r(n + 1) M o ,n(2iz) is satisfied. In Whittaker's differential equation, since W-k,m( -z) is also a solution and Yv",m(z)/W-k,m( - z) is not equal to a constant, Yv",m(z) and W-k,m( - z) can be considered a pair of fundamental solutions ( Appendix A, Table 19.I1). C. Parabolic Cylinder Functions Putting x = (e - Yf2)/2 and y = Yf, the curves corresponding to  = constant and to Yf = constant, respectively, constitute families of orthogonal parabolas. The curvilinear coordi- nates (, Yf, z) in three dimensions are caned parabolic cylindrical coordinates. By using parabolic coordinates, separating variables in Laplace's equation into the form f()g(lJ)ei'z, and making a simple transformation, we find that f and 9 satisfy a differential equation of the form d 2 F -+(n+-iz2)F=0. dz By means of the Whittaker function Yv".m(z), a solution Dn(z) of (4) is represented by Dn(z) = 2 n /2+(1/4) z -1/2 Yv,,/2 +(1/4), -1/4( Z2). Equation (4) is caned Weber's differential equation or the Weber-Hermite differential equation, and Dn(z) the Weber function. An- other solution of (4) is D-n-I (iz) or D-n-I (- iz). The solutions of (4) are called parabolic cyl- inder functions. In particular, if n is equal to a nonnegative integer, then Hn(z)=rn/2exp(z2)Dn(J2 z) is the tHermite polynomial of degree n. Solu- tions of differential equations for harmonic oscinators in quantum mechanics are of this form. In general, suppose that three regular sin- gular points are confluent to the point at infin- ity, and that they are reduced to an irregular singular point of class 2. Suppose further that 644 there are no other singularities. Then differen- tial equations of order 2 with these conditions are transformed into the form (4), whose solu- tions are represented by parabolic cylinder functions. Differential equations of the form (4) are reduced to confluent hypergeometric differential equations if Z2 is chosen as an independent variable ( Appendix A, Table 20.III). D. Indefinite Integrals of Elementary Functions Since exponential and trigonometric functions can be represented by particular types of Kummer functions, their indefinite integrals that cannot be represented by elementary functions, e.g., tincomplete r-functions and the error function Erf z = S exp( - t 2 ) dt, can be represented by Kummer or Whittaker func- tions. They are included in a family of tspecial functions of confluent type, The functions defined by 1 f zcost C(z)= M:: ;: dt, V 2n 0 V t 1 f zsint S(z)=- -dt j2;. 0 jt are called Fresnel integrals, which are also represented in terms of the Whittaker function as C(z)-iS(z) (4) _ -I ( e 1/4 -z2/2 . ) _ 1 - ;8 2 e W- 1 / 4 ,1/4(12). Fresnel integrals first appeared in the theory of the diffraction of waves. More recently they have been applied to designing highways for high-speed automobiles. Furthermore, the functions C(u)= J: cosGS2 )dS, S(u)= J: sin(I s2 )dS (obtained by a change of variables 2 = nu 2 (2) are also called Fresnel integrals. Numerical tables are available for them. The curves x = C and y = S with a parameter 2 or u are called Cornu's spiral (Fig. 1). The functions f x dt Lix= - o logt' I x et Eix= -dt, - 00 t where a tprincipal value must be taken at t = 0 if x>O, I x sin t Six= -dt, o t f ro cost and Cix = - -dt x t 
645 Fig. 1 are called the logarithmic integral, exponential integral, sine integral, and cosine integral, or integral logarithm, integral exponent, integral sine, and integral cosine, respectively. They satisfy the relations Eix= Lie x , Eiix=Cix+iSix+(nI2)i. They have important applications: Eix in quantum mechanics, Six and Cix in electrical engineering, and Lix in estimating the number oftprimes less than x ( 123 Distribution of Prime Numbers). Lix is also denoted by lix ( Appendix A, Table 19.11). E. Stokes's Equation Consider a linear differential equation of the second order with five regular singular points including the point at infinity such that the difference of the characteristic indices at every singularity is equal to 1/2. Such equations are caIled generalized Lame's differential equa- tions. F. Klein and M. B6cher have shown that every linear differential equation that is commonly treated in mathematical physics is represented by a confluent type of generalized Lame's equation. Among these equations, if all five singularities are confluent to the point at infinity, the resulting equation is called Stokes's differential equation, which is applied to the investigation of diffraction. This is re- duced to tBessel's differential equation of order 1/3 by suitable transformations of the inde- pendent and dependent variables. References [IJ H. Buchholz, The confluent hypergeo- metric function, with special emphasis on its applications, translated by H, Lichtblau and K. Wentzel, Springer, 1969. (Original in Ger- man, 1953.) 1688 Function Spaces [2J L. J. Slater, Confluent hypergeometric functions, Cambridge Univ. Press, 1960. For the logarithmic integral, etc., [3] N. Nielsen, Theorie der IntegraIlogarith- mus und verwandter Transzendenten, Teub- ner, 1906. [4J Mathematical tables I, British Assoc. Adv. Sci., London, 1931. [5J Nat. Bur. Standards, Tables of sine, cosine and exponential integrals I, II, Washington, D. c., 1940. [6] L. Lewin, Polylogarithms and associated functions, North-Holland, 1958, revised edi- tion,1981. Also  references to 389 Special Functions. 168 (XII.6) Function Spaces A. General Remark., It is a general method in modern analysis to consider a set X of mappings of a space Q into another space A as a space ( 381 Sets) and its elements (namely, mappings of Q into A) as points of the space X, and to investigate them as geometric objects. In particular, it is impor- tant to consider the case where Q is a ttopo- logical space, a tmeasure space, or a tdiffer- entiable manifold and X is a set of real- or complex-valued functions defined on Q and satisfying certain conditions, such as continu- ity, measurability, and differentiability. Such spaces are generally called function spaces; they usually form ttopologicallinear spaces ( 37 Banach Spaces; 197 Hilbert Spaces; 424 Topological Linear Spaces). B. Examples of Function Spaces The following are important examples of func- tion spaces. Throughout this section, all func- tions are real- or complex-valued, and two functions on a measure space are identified whenever they are equal to each other talmost everywhere. (1) The Function Spaces C(Q), Coo(Q), and Co(Q). The totality of continuous functions f(x) defined on a compact tHausdorff space Q is denoted by C(Q). Let f +g and exf (ex a real or complex number) be the functions f(x) + g(x) and exf(x), respectively. Then C(Q) forms a tlinear space, Furthermore, define the norm of f by Ilfll = SUPXEf! If(x)l, Then C(Q) becomes a tBanach space since it is complete in (the 
1688 Function Spaces metric defined by) the norm. The norm is called the supremum norm or uniform norm because limnoo 11f,,- fll =0 means thatf,,(x) converges to f(x) uniformly on Q as n--> 00. Define .fg to be the function f(x)g(x). Then clearly Ilf'gll.s; Ilfllllgll. Hence C(Q) is also a tBanach algebra. Suppose that a subset R of C(Q) satisfies the following three conditions: (i) R is an talgebra over the complex number field with respect to the addition and multiplication defined above and contains the function identically equal to one. (ii) For any two distinct points x and y of Q, there exists a function fER satisfying f(x) # f(y). (iii) For any fER, there exists an f* E R such that f*(x) == f(x) on Q. Then R is dense in C(Q) with respect to the supremum norm (namely, in the sense of uniform conver- gence). This fact is known as the Weierstrass- Stone theorem (or Stone-Gel'fand theorem). A subset E of C(Q) is tprecompact (i.e., any sequence of functions in E contains a sub- sequence that converges uniformly on Q) if and only if E is tuniformly bounded and tequicon- tinuous (Ascoli-Arzela theorem). Since C(Q) is not a treflexive Banach space except for trivial cases ( Section C), precompact sets and relatively compact sets are different in the tweak topology. For important characteriza- tions of the latter sets  [2, 5]. When Q is a topological space that is not necessarily compact, the totality of bounded continuous functions on Q (denoted by BC(Q)) is also a Banach space with respect to the supremum norm Ilfll = sUPXEn If(x)l. Let Q be a loca\1y compact Hausdorff space, Then the space C(Q) of all continuous functions on Q is endowed with the topology of uniform conver- gence on the compact sets, i.e., the tlocally convex topology defined by the tseminorms SUpxEK If(x)1 as K ranges over the compact sets in Q. C(Q) is always a complete locally convex space. It is a tFrechet space ifQ is O"-compact (i.e., Q is the union of a countable family of compact sets). We denote by Coo(Q) the sub- space of all functions f(X)E C(Q) that converge to zero as x tends to infinity (i.e., given an G> 0, there is a compact set K such that If(x)1 < G for x if K). Coo (Q) is a Banach space with the norm sUPXEn If(x)l. It can be regarded as a closed linear subspace of BC(Q). The totality of continuous functions with compact support is denoted by Co(Q) or %(Q), where the support (or carrier) of a function f is the tclosure of the set {x l.fix) # O} in Q and is usually denoted by supp f If Q is not com- pact, Co(Q) is not complete with respect to the supremum norm, but when Q is O"-compact Co(Q) is complete with respect to the strongest locally convex topology with the property that for each compact set K in Q the embed- 646 ding in Co(Q) of the linear subspace %K of all functions with support in K equipped with the supremum norm is continuous. Usually %(Q) denotes the space Co(Q) equipped with this topology. (2) The Lebesgue Spaces Lp(Q) (0 < p < 00). Let (Q, J1) be a tmeasure space. We denote by Lp(Q) the totality of tmeasurable functions f(x) on Q such that If(x)IP is integrable. A function f de- fined on (Q, J1) is called square integrable if f E L 2 (Q, J1). IfQ is the interval (a, b) equipped with the Lebesgue measure, it is sometimes denoted by Lp(a,b). We define the norm Ilfll p by Ilfll = "f'lp=(L If(X)IPdJ1(X)YP. If l.s;p< 00, then Lp(Q) is a Banach space. The triangle inequality for this norm is precisely the tMinkowski inequality. IfO<p< 1, the norm no longer satisfies the triangle inequal- ity but does satisfy the quasinorm inequality lif + gllp.s; 2 1 / P - I (llfll p + Ilgll p ), and if L 11f,,11 < 00, then Lfn converges unconditionally in Lp(Q). Hence Lp(Q), O<p< 1, is a tquasi- Banach space. Iflimnoo Ilfn- fllp=O, we say that the sequence {f,,} converges to f in the mean of order p (or in the mean of power p), and write l.i.m'noof,,= f If {f,,} converges to f in the mean of order 2, we simply say that {f,,} converges to f in the mean. (The nota- tion 1.i.m. mcans the limit in the mean and is used mostly when p=2.) For any 1, gEL 2 (Q), (1, g) = Snf(x) g(x) dJ1(x) is we\1 defined, by the tSchwarz inequality, and has the properties of the tinner product. Hence, L 2 (Q) is a tHilbert space. If 1 < p < 00, then Lp(Q) is a tuniformly convex Banach space and is in particular treflexive. Deepest results on Lp(Rn), 1 < p < 00, are often derived from the Littlewood-Palay theory due to J. E. Littlewood and R. E. A. C. Paley, A. Zygmund [6], and E. M. Stein [7]. Its starting point is the inequality Apllfllp.s; Ilg(f)llp.s;Allfllp, where g(f) is the function (f rO ) 1/2 g(f)(x) = 0 I grad x u(x, tW t dt obtained from the Poisson integral u(x, t) = r(7nI!j2) L t(t 2 + Ix - yI2r(n+I)/2f(y)dy, where Ixl =(xi +... + X;)1/2. The Lp spaces, 1 <p < 00, are generalized in the following way, Let l1>(s) be a convex and nondecreasing function on [0, 00) satisfy- ing 11>(0) = 0 and l1>(s)/s--> 00 as s--> 00. Denote 
647 by Lq,(Q) (La;(Q)) the set of all functions f(x) such that 11>(lf(x)l) is integrable (l1>(klf(x) I) is integrable for some k > 0). Lq,(Q) == La;(Q) if 11>(2s).s; CI1>(s). La;(Q) is a Banach space, called the Orlicz space, under the norm Ilfll = inrfb 0 I f 11>(,{ -1 I f(x)l)dJi(x) .s; I}. Lp(Q), 1 < P < 00, is the Orlicz space for l1>(s) =sp. (3) The Function Space M(Q)=Loo(Q). Let Q and Ji be as in (2). A measurable function f(x) on Q is said to be essentially bounded if there exists a positive number ex such that If(x)l.s; ex almost everywhere on Q. The infimum of such ex is called the essential supremum of f, denoted by ess SUPXEn If(x) I. The totality of essentiaIly bounded measurable functions on Q (denoted by M(Q)) is a Banach space with respect to the norm Ilfll = Ilfll 00 = ess sUPxEnlf(x)[. If Ji(Q) < 00, then M(Q)cLp(Q) for any p>O, and Ilfll 00 =limpoo Ilfll p for any fEM(Q). From this point of view, M(Q) is also denoted by Loo(Q) even when Ji(Q) = 00. This is also the reason why the notation 11'1100 is used for the norm in M (Q). (4) The Lorentz Spaces L(p, q)(Q) (0 < p, q.s; 00). The Lebesgue spaces Lp(Q), 0 < p.s; 00, are rearrangement invariant. Namely, define for a measurable function f on a measure space (Q,Ji) the distribution function Jif(s)=Ji{XE Q Ilf(x)1 > s}, s> 0, and the rearrangement f*(t)=inf{s>OIJif(s).s;t}, t>O. Then fELp(Q) if and only if f* ELp(O, 00) and Ilfll p = Ilf*ll p . Another important class of rearrangement in- variant spaces are the Lorentz spaces L(p,q)(Q), O<p, q.s; 00 (G. G. Lorentz, 1950; R, A. Hunt [8]), which is defined to be the quasi-Banach space of all measurable functions f on Q such that Ilfll(p,q) = II tl/pf*(t)IIL: < 00, where L: is the Lq-space on (0, 00) relative to the measure dt/t. L(p, P)(Q) = Lp(Q) with equal norms. If 1 <p < 00 and l.s;q.s; 00, then L(p,q)(Q) is a Banach space under the equiva- lent norm Iit l / rl ff*(s)dsllL*' Except for these cases, L(p,q)(Q) is not eq'uivalent to a normed space in general [8]. If qO<ql, then L(p,qo)(Q) c L(p,q)Q) with continuous embed- ding. In case Ji(Q) < 00, L(Po,qo)(Q) C L(P1,q)Q) for Po> PI and any qo, q l' The Lorentz spaces play an important role in interpolation and approximation theory ( 224 Interpolation of Operators). (5) The Function Space S (Q). Let (Q, Ji) be a measure space with Ji(Q) < 00. Denote by S(Q) 168 B Function Spaces the totality of measurable functions on Q that take finite value almost everywhere. Then Ilfll = fn(lf(x) 1/(1 + If(x)l))dJi(x) for f ES(Q) has the properties of the tpseudonorm and S (Q) is a tFrechet space (in the sense of Banach) that is not locally convex in general. We have Iimnoo Ilfn - f II = 0 if and only if Iim Ji( {x Ilfn(x)- f(x)1 >B} )=0 noo for any positive number B. Convergence of this type is called convergence in measure (or as- ymptotic convergence) and is the same notion as tconvergence in probability of a sequence of trandom variables ( 342 Probability Theory). If {In} E Lp(Q) converges to f E Lp(Q) in the mean of order p, then {In} converges to f asymptoticaIly, but the converse is not true in general. If a sequence {In} ES(Q) converges to fES(Q) almost everywhere, then {In} con- verges to f asymptotically. Any sequence {In} that converges to f asymptotically contains a subsequence {In,} that converges to f almost everywhere. (6) The Sequence Spaces c, co, Ip (O<p< 00), m == 1 00 , and s. The totality c (resp. co) of se- quences x = {n} that converges (resp. con- verges to zero) as n--> 00 forms a Banach space with respect to the norm Ilxll =suplnl. Co (resp. c) is the space Coo(Q) (resp. C(Q) when Q is (resp. the one-point compactification of) the discrete locaIly compact space {1, 2, 3,...}. The sequence space Ip, O<p< 00 (resp. m=l oo ), is defined to be the spaces Lp(Q) (resp. M(Q)), where Q is the space {I, 2, 3,..., n,...}, of which each point has unit mass, while s de- notes the space S(Q), where Q = {1, 2, ..., n,... }, provided with the measure assigning mass 1/2 n to the point n. s is the set of all sequences equipped with the topology of pointwise con- vergence (s is also used to denote the space of rapidly decreasing sequences;  Section (16)). Assume that the space L 2 (Q) mentioned in (2) is tseparable and that {CPn} is a tcomplete orthonormal set in L 2 (Q). Then putting n= Lf(X) CPn(X) dJi(X) (tFourier coefficients) for any fELz(Q), we have {n} E 1 2 and LII n12 = II f11 2 . Conversely, for any {n}EI2 thereexistsanf=L:InCPnE L 2 (Q) whose Fourier coefficients are the given n (Riesz-Fischer theorem). By means of this correspondence, separable spaces L 2 (Q) and 1 2 are mutually isomorphic as Hilbert spaces. Sometimes we denote by Ip(Q) the function space Lp(Q), where Q is an arbitrary set en- dowed with the measure assigning mass 1 to each point. 
168 B Function Spaces (7) The John-Nirenberg Space BMO. A locally integrable function f(x) on Rn is said to be of bounded mean oscillation if IlfIIBMo=suPISI-Illf(X)-fsldx<oo, (1) where the supremum is taken over all (solid) spheres S in Rn, fs is the mean I SI-I Jsf(x)dx, and ISI denotes the measure of S (F. John and L. Nirenberg, 1961). The set BMO(R n ) of all functions on R n of bounded mean oscillation forms a Banach space under the norm men- tioned above if two functions f and 9 are iden- tified whenever f -g is equal to a constant almost everywhere. Condition (1) is equivalent to SUP(ISI-lllf(X)-fsIPdxYP <00 for any 1.s; p < 00. There are constants Band K > 0 such that l{xESllf(x)- fsl >A} I .s; BISlexp( - KA./llfIIBMO) for any sphere S and A. > O. BM 0 is a slightly larger space than Loo (e.g.loglxIEBMO) and has better properties. For example, the tCalder6n-Zygmund operators are bounded in BMO(Rn). We have n BMO(Rn)=Loo(R n )+ L RjLoo(Rn), j1 where Rj are the tRiesz transforms [9]. This is called the Fefferman-Stein decomposition. The Riesz transforms can be replaced by more general families of singular integral operators (A. Uchiyama, Acta Math. (1982)). (8) The Hardy Spaces Hp (0 < p < 00). The classical theory of Hardy classes ( 159 Fourier Series G) has been reconstructed by the real-analysis method and extended to higher-dimensional cases by E. M. Stein, G. Weiss, C. Fefferman, and others. According to their terminology the elements of the Hardy space Hp are (the complex linear combinations of) the real parts of the boundary values of holomorphic functions of the Hardy class. LetfEY"(R n ) be a ttempered distribution. For a cpEY'(R n ) define the radial maximal function M; f and the non tangential maximal function M;f relative to cp by M; f(x) = sup Icp, * f(x)l, '>0 M;f(x)= sup l<p,* f(y)l, !x-YI";' where CPt (x) = t -n cp(xjt) and * denotes convo- lution. Then the Hardy space Hp(Rn), O<p< 00, is defined to be the space of all tempered distributions f which satisfy the following 648 equivalent conditions: (i) M; fEL p for a cp with J cp(x)dx # 0; (ii) M;f E Lp for a cp with J cpdx #0; (iii) M; fEL p for any cp; (iv) M;fE Lp for any: cpo If a distribution f satisfies (one of) these conditions, then its Poisson integral u(x, t)= cp, * f(x) is a function, where cp(x) = const(1 + Ix1 2 ) -(n+I)/2, and its radial maximal function u+(x)= sUPt>o lu(x, t)1 and nontangen- tial maximal function u*(x)= suP!x-YI<t lu(y, t)1 both belong to Lp. Conversely if u(x, t) is a harmonic function on the upper half-space t > 0 and if ei ther u + or u* belongs to Lp, then its boundary value f = u( ',0) exists in the sense of tempered distribution and f satisfies the above conditions. Define the norm of an fEHp(Rn) by Ilu*ll p . Then Hp(Rn) becomes a q uasi- Banach space. If 1 < p < 00, then Hp(Rn) = Lp(Rn) with equivalent norms. HI(Rn) is a Banach space strictly smaller than L I (Rn), An fELl (R n ) belongs to HI (Rn) if and only if all the Riesz transforms RJ are in LdRn), and IIfllH 1 is equivalent to Ilflll +L IIRjfIII' Simi- lar characterizations of Hp(Rn) are also known for p>O (Fefferman and Stein [9J). Let Ij, j = 1, ... , m, be tproper convex open cones in Rn such that the tpolars IJo cover Rn. Then an f E Y"(Rn) belongs to Hp(Rn) if and only if there are holomorphic functions F;(x + iy) on Rn + iIJ such that sup{ II F;(' + iY)ll p I YEIJ} < 00 and f = L Fk +iIJO) (D. L. Burkholder, R. F. Gundy, and M. L. Silver- stein for n = 1 and L. Carleson for n> 1). Let 0 <p.s; 1. A measurable function a on Rn is said to be a p-atom if there is a sphere S such that suppa c Sand Iiall w.s; ISI I/p and if J a(x)x' dx =0 for all multi-indices ex with lexl"';n(p-I-I). Here a multi-index ex is an n-tuple (exl>"" exn) of nonnegative integers, lexl = ex l +... +ex n and x' = XI" ... x:". A distribution f belongs to Hp(R n ), 0 < p.s; 1, if and only if there are a sequence of p-atoms a j and a sequence of numbers A j ;' 0 in Ip such that f = L Aja j in the sense of distributions, and the norm IlfllH p is equivalent to the infimum of IIAjill (R. R. Coif- man for n = 1 and R, H. Latter fo: n > I), The theory of Hp and BMO has been generalized to more general situations ( Coifman and Weiss, Bull. Amer. Math. Soc. (1977)). From now on we assume that Q is a domain in the n-dimensional Euclidean space Rn (or more generally a differentiable manifold). D' stands for D I "... D:", where Dj=ojox j . (9) The Function Spaces C1(Q) and q(Q) (l = 0, 1,2, ... , 00). The totality of I-times continu- ously differentiable functions in Q (namely, differentiable functions of tclass C 1 in Q) is denoted by C1(Q). We say that a sequence {I,} of functions in C1(Q) converges to 0 in C1(Q) if ID'f,(x)1 converges to 0 uniformly on 
649 every compact subset of Q for every ex satisfy- ing O.s; lexl.s;1 (O.s; lexl < 00 ifl= 00), C/(Q) is a Frechet space. The totality of functions in C/(Q) whose supports are compact subsets of Q is denoted by q(Q). We say that a sequence {Iv} of functions in q(Q) converges to 0 in Cb(Q) if supp fv (v = 1,2,.,. ) is contained in a compact subset of Q independent of v and {fv} converges to 0 in C/(Q). Cb(Q) is an t(LF)- space, When Q is a locally closed set in Rn (or a differentiable manifold with boundary), we denote by C/(Q) the totality of functions f(x) on Q together with their continuous formal derivatives D'f(x), lexl.s;1 (Iexl < 00 if 1= 00) on Q such that for every ex and m with lexl.s;m.s; 1 (m<oo ifl=oo), D'f(x)- L D'+Pf(y)(x- y)P/[3! Ipl<;m-I'1 =o(lx- ylm-I'I) locally uniformly in Q as Ix - YI tends to 0, Convergence in C/(Q) is defined in the same way as above. If Q is a closed set in Rn with a finite number of connected components in each of which two points x and yare connected by an arc of length .s; Clx - yl with a constant C indepen- dent of x and y, then every f in C/(Q) can be extended to an 1 in C1(R n ) (Whitney's extension theorem). (10) The Lipschitz Spaces N. Let s>O, and let k be the least integer greater than s. The Lipschitz (or HOlder) space N(R n ) is the totality of functions f(x) on Rn which satisfy IlfllAS=sup l t (  ) ( -1)jf(X+jy) I/ IYls< 00. x,y JO } When s < 1, this is exactly the tLipschitz (or Holder) condition of order s. But when s = 1, it is strictly weaker than the tLipschitz condition. A function f E;\ I is said to be smooth in the sense of A. Zygmund [6]. Suppose O<h<s is an integer. Then a function f belongs to N if and only if it is h times continuously differenti- able and all the derivatives D'f of order hare in Nh. N(R n ) is a Banach space of functions modulo the polynomials of degree .s; k-1. Suppose that l.s; q < 00 and f is a measur- able function on R n such that SPir(ISI-l-(sq/n) is If(x)-P(xWdx YN < 00, where the supremum is taken over all spheres in R n and the infimum over all polynomials of degree .s;s. Then f(x) is equal to a function l(x) in N(Rn) almost everywhere and the su- premum is equivalent to the norm 11111 AS. Conversely every f E N(R n ) satisfies the above inequality (S, Campanato). 168 B Function Spaces (11) The Sobolev Spaces W(Q), H/(Q), and Hb(Q) (1=0, 1,2,..., l.s;p.s; 00 or -00 <I< 00, 1 < p < 00). Let I;); 0 be an integer and 1 .s; p.s; 00. The Sobolev space W(Q) is the totality of functions f(x) such that for all ex satisfying lexl.s; I, the derivatives D' f(x) in the sense of distribution ( 125 Distributions and Hyper- functions) belong to Lp(Q) with respect to Lebesgue measure in Q, W(Q) is a Banach space with the norm Ilfll = Ilfllw = ( L r I D'f(X)IPdx ) I/P. O<;I'I<;/Jn Clearly WpO(Q) = Lp(Q), Wi(Q) is a Hilbert space with respect to the inner product (I, g) = r L D'f(x)' D'g(x) dx, In 0<;1'1<;/ Sometimes Wi(Q) is denoted by H/(Q). Its closed linear subspace obtained as the comple- tion of CQ'(Q) is denoted by H6(Q). We have Hg(Q) = W 2 0 (Q) = L 2 (Q). However, ifl;); 1, we have Hb(Q) c Wi(Q), and identity does not hold unless Q = R n , The definition of Sobolev spaces has been extended to those with fractional and also negative order -00 < s < 00 in many different ways. When 1 < p < 00, it is natural to define W;(Rn) to be the space of all tempered distri- butions f on Rn such that (1- )'/2f =!F -1«1 + 112)'/2 !Ff)ELp(Rn). If s>O, then W;(Rn) thus defined coincides with the space of all f E Lp(Rn) whose Poisson integral u(x, t) satisfies II (LOO t 4k - 2d Wu(', tW dt Y lip < 00 for some (and any) integer k>s/2. The Poisson integral can be repla.«ed by other regularizations, and thus the definition of Sobolev spaces of fractional orders is ex- tended to arbitrary open set Q with the cone condition (T. Muramatu [13J), Here Q is said to satisfy the cone condition if there are a bounded and uniformly continuous mapping \{J:Rn-->Rn and an B>O such that for any xEQ the convex hull of the B-ball with center at x + \{J(x) and {x} is included in Q. (12) The Besov Spaces B,q (-00 <s < 00, l.s;p, q.s; 00). The effort to make the Sobolev em- bedding theorem [lOJ more precise led S. M. Nikol'skii and O. V. Besov [11] to the other classes of "Sobolev spaces" of fractional order. Let s > 0 and 1 .s; p, q.s; 00, The Besov space B,q(Rn) is the totality of functions f ELp(Rn) such that IflB,q =(fIIJo G)< -l)jf(' +jy)II: ly1:'+ n r q <00 
168 B Function Spaces for some (and any) integer k > s. In terms of the Poisson integral an f E Lp(Rn) belongs to B,q(Rn) if and only if (IOO t(2k-S)q111 kU(', t) II: dt YP < 00 for some (and any) integer k > s/2. The first definition is obviously extended to an arbi- trary domain Q in R n . If Q satisfies the cone condition, then the functions in B,q(Q) are characterized similarly as above by using a suit- able regularization u(x, t) of f(x) (Muramatu [13J). Let Q be a domain with the cone condi- tion. When 0 < h < s is an integer, an f belongs to B.q(Q) if and only if fE W;(Q) and all the derivatives D'f of order h are in B:(Q). B;.q(Q) is a Banach space with the norm Ilfll p + IfIB' . If P = 2, then B 2(Q) coincides with the Sob'olev space Wi'(Q): But in the other cases, B.q(Q) is different from W;(Q) for any q. However, BjQ) or B. oo(Q) is often called a Sobolev space of order s and denoted by W;(Q). Clearly we have B"oo, 00 = Loo n N. The Besov space B,q(Q) of order s.s; 0 is de- fined to be the totality of distributions f which can be represented as f = LI'I k D'J. with f, E B:(Q) for some (and any) integer k such that s + k > O. The norm is defined to be infL,IIJ.IIB'+k(Q)' In terms of the Poisson in- tegrals or oPiher regularizations the same char- acterization holds as above. If Q is a domain in Rn with the cone con- dition, then the restriction B.q(Rn)--> B,q(Q) (resp. W;(Rn)--> W;(Q) for 1 < p < 00) is a bounded linear surjection with a bounded right inverse [13]. From now on we denote by B,q, etc., B;,q(Q), etc. for a domain Qc: Rn with the cone condition. If q < r, then B,q c: B;,,. If 1 .s; p.s; 2, then B,pC: W; c: B.2' If 2.s; p";;;; 00, then B.2 c: W; c: B, p' If s > t, then B, 00 c: B, I' Sobolev-Besov embedding theorems: (i) Let p <p' and s-n/p=s'-n/p'. Then B,qc:B',q, B, p' c: W;:, W p ' c: B" p and, if p' < 00, then W; c: W;:. (ii) Let O<s<n/p and n/p'=n/p-s. Then B,q c: L(p',q) and W; c: L(p',p)( c: L p ')' (iii) Let s = nip. Then B, I c: BC for any p and B, 00 c:BMO for p< 00. (iv) Let O.s;n' <n and 0< s' = s - (n - n')/p. Then there is a bounded trace operator Tr: B,q(Rn)-->B',q(Rn) that extends the restriction mapping Y(Rn)-->Y(R n } Tr is surjective and has a bounded linear right inverse [11-13]. (13) The Function Spaces £ZI, 8, £ZI L , :18, and Y. The spaces of infinitely differentiable functions Co(Q) and COO(Q) are also denoted by £ZI(Q) and 8(Q), respectively. The totality of func- tions f(x) in COO(Q) such that, for all ex, D'f(x) belongs to Lp(Q) with respect to tLebesgue measure is denoted by £ZIL/Q). The neighbor- 650 hood 11,., of 0 in £ZILp(Q) is defined to be the totality of functions f(x) such that IID'fll p <£ for any ex satisfying lexl.s; l. In particular, £ZILjQ) is also denoted by :18(Q). (:18(Q) is also used to denote the space of hyperfunctions.) A function f(x) is called a rapidly decreas- ing Coo -function if it belongs to Coo (R n ) and satisfies lim IxlkID'f(x)I=O Ixloo for any ex and any integer k > O. The totality of rapidly decreasing Coo -functions is denoted by Y. The neighborhood v"k., of 0 in Y is defined to be the totality of functions f(x) such that (I + IxI 2 lID'f(x)1 < £ for any ex satisfying lexl";;;; l. The spaces in this section are tnuclear except for £ZI L and :18, and are employed in the theory of distributions [14J ( 125 Distributions and Hyperfunctions). When Q = R n , we usually omit (Rn); for example, £ZI(R n ) and £ZI L (R n ) are denoted by £ZI and £ZI L , respectively. p p (14) The Function Spaces £ZI{M p )' £ZI(M p )' 8{M p )' 8(M p )' and .91 = C"'. Let {Mp} be a sequence of positive numbers satisfying the logarithmic convexity M;.s; Mp-l Mp+!. 8{M p }(Q) (resp. 8(Mp)(Q)) denotes the totality of Coo-functions f on Q such that for any compact set K in Q there are constants k and C (resp. for any k > 0 there is a constant C) satisfying ID'f(x)I.s;Ckl"M1'1' xEK,lexl>O. 8{pl)(Q) is the totality oftreal analytic func- tions on Q and is denoted by d(Q) or C"'(Q). If {Mp} satisfies the Denjoy-Carleman condi- tion L Mp/Mp+1 < 00, then £ZI{Mp)(Q)= £ZI(Q) n 8{M p }(Q) and £ZI(Mp)(Q) = £ZI(Q) n8(M p )(Q) are dense in £ZI(Q). Conversely, if £ZI{Mp}(Q) is differ- ent from {O}, then {Mp} satisfies the Denjoy- Carleman condition. In this case an f E 8{M p }(Q) (resp. 8(M p )(Q) is sometimes called an ultradifferentiable function of class {Mp} (resp. (M p ))' The most important is the case where M p = p!S for an s> 1. Then an fE 8{M p }(Q) (resp. 8(M p )(Q)) is called a function of GeVfey class {s} (resp. (s)). The topological properties of d(Q) (resp. 8{M p )(Q), etc.) have been discussed by A. Martineau (resp. H. Komatsu). F or function spaces of S type  125 Distri- butions and Hyperfunctions. (15) The Function Spaces 0(Q), 0 p (Q) = Ap(Q), and A(Q). Let Q be an open set in en. The totality of tholomorphic functions on Q is denoted by 0(Q). 0(Q) is a tnuclear Frechet space with the topology of uniform conver- gence on compact sets. It is a closed linear subspace of C(Q) and also of COO(Q). For any p> 1, the totality of functions f 
651 holomorphic in Q and satisfying In If(zW dxdy < 00 (z=x + iy;dxdy is Lebesgue mea- sure), denoted by @p(Q) or Ap(Q), is a Ba- nach space with respect to the norm Ilfll p = (Inlf(z)IPdxdy)l/ p . In particular, it is a Hil- bert space when p = 2. The totality of functions bounded and con- tinuous on the closure of Q and holomorphic in Q (denoted by A(Q) is a Banach space with respect to the norm Ilfll = SUPZEn If(z)l. (16) The Kothe Spaces n A(ex(k) and L A x (ex(k)). Let {ex(k)} be an increasing sequence of se- quences ex(k)==(ex\k), ex),...) of positive numbers. The echelon space n A(OC(k) of G. Kothe [15] is the totality of sequences x = (l> 2'''') such that p(k)(X) = L exl k ) I d < 00 for any k. It is a Frechet space with the topology deter- mined by the seminorms p(k). The co-echelon space"LA x (ex(k J ) is the totality of sequences y = (tll, tl2"") such that Itlil.s; Cexlk) for some C and k. The space s of rapidly decreasing se- quences and the space s' of slowly increasing sequences are the echelon space and the co- echelon space for the sequence exl k ) = i k . If exl k ) = k i , then we obtain the space of power series with infinite radius of convergence and the space of convergent power series, More gener- ally, let ex = (ex i ) be a sequence of positive num- bers. The echelon spaces for exl k ) = exp(kex;) and exp( - k -I ex i ) are called the infinite type power series space and the finite type power series space and are denoted by Aoo(ex) and Al (ex), respectively. Echelon spaces and co-echelon spaces have been employed to construct exam- ples and counterexamples in the theory of tlocally convex spaces by Kothe [15J, Grothen- dieck, Y. and T. K6mura, E. Dubinsky, D. Vogt, and others. C. Dual Spaces When Q is a compact Hausdorff space, any bounded linear functional 11> on C(Q) is ex- pressed by the tStieltjes integral 11>(f) = L f(x)dcp(x),jEC(Q), with a tRadon measure cp, i.e., a (real- or complex-valued) tregular tcountably additive set function defined on the Borel sets in Q. Since cp is of bounded variation, the totality of such cp is denoted by BV(Q). Conversely, any cpEBV(Q) gives a bounded linear functional on C(Q) defined by (2), and 1111> II = the total varia- tion of cp over Q. Hence the dual space of C(Q) is isomorphic to the Banach space BV(Q) with the norm 1111>11. Let Q be a locally compact Hausdorff space. Then the dual space of Coo(Q) is again the 168 C Function Spaces Banach space BV(Q) of Radon measures of bounded variation. On the other hand, the dual space of Co(Q) is the space of all Radon measures on Q. N. Bourbaki takes this fact as the definition of measure. Any bounded linear functional 11> on LI (Q) is expressed as (2) 11> (f) = L f(x) cp (x) dJ1(x), fELI(Q), (3) with a suitable cpE M(Q); and 1111>11 = Ilcpll 00' Conversely, any cp E M (Q) defines a bounded linear functional on LI (Q) by means of (3). Accordingly, the dual space of LI (Q) is isomor- phic to M (Q). The dual space of Lp(Q) (1 <p< 00) is iso- morphic to Li Q ), where q is the real number defined by (lip) + (1Iq) = 1 (accordingly, 1 <q < 00) and is called the conjugate exponent of p. Any bounded linear functional on Lp(Q) is expressible by the formula in (3) (where f E Lp(Q)) with cpELq(Q), and 1111>11 = Ilcpllq' The dual space of M(Q) is isomorphic to the normed linear space of all (real- or complex- valued) finitely additive set functions cp defined on all measurable sets in Q, of bounded vari- ation over Q, and absolutely continuous with respect to the measure J1 given in Q (i.e., J1(N) = o implies cp(N)=O). If 1 <p< 00 and l.s;q< 00, then the dual space of L(p,q)(Q) is isomor- phic to L(p',q.)(Q), where p' and q' are conjugate exponents of p and q, respectively. If Q is nonatomic (i.e., Q has no set of posi- tive measure that cannot be decomposed into two subsets of positive measure), then no con- tinuous linear functionals exist other than zero on S(Q) and on Lp(Q) and L(p,q)(Q) for O<p< 1. The sequence spaces co, Ip (l.s;p<oo), m, and s (the space defined in Section B (6») are special cases of Coo (Q), Lp(Q), M(Q), and S(Q), respectively. Hence their dual spaces can be described explicitly. For example, the dual space of Co (resp. II) is II (resp. m), and if 1 < p < 00, the dual space of Ip is Iq (where (1/p)+ (1Iq)= 1). The coupling of x=(n) and y= (tin) is given by L ntln' The dual space of s is the totality of sequences (tin) such that tin = 0 except for a finite number of n. Let VM O(R n ) denote the closure of Co(Rn) in BMO(Rn). Then the dual space of VMO(R") is identified with HI (Rn). This can be considered to be a generalization of the tF. and M. Riesz theorem. On the other hand, the dual space of HI (Rn) is BM O(Rn) [9]. Hardy spaces Hp(Rn), 0< p < 1, are not locally convex but have suffi- ciently many bounded linear functionals, and their dual spaces are identified with Lipschitz spaces N(Rn), where s = n(p I - 1). Let 1 <p, q < 00. Then the dual space of W;(Rn) is isomorphic to Wp-;-S(R n ) and that of 
168 Ref. Function Spaces B.q(Rn) to B;?q.(Rn), where p' and q' are conju- gate exponents of p and q, respectively. The dual space £ZI' of £ZI is defined to be the space of tdistributions; the dual spaces of lff, £ZI L , and [I' are (algebraically) linear subspaces f £ZI'. Similarly, the dual spaces of £ZI(M p ) and £ZI(M p ) are called the spaces of tultradistributions of classes {Mp} and (M p ), respectively. The dual space of d is identified with the space of thyperfunctions with compact support ( 125 Distributions and Hyperfunctions). A continu- ous linear functional on (Q(Q) is called an analytic functional. For each analytic func- tional <I> there is a compact set L c Q such that 1<I>(f)I.s; C SUpzEL If(z)l. A compact set K is called a porter of <I> if every compact neighbor- hood L of K satisfies this condition. Porters are similar to supports of generalized func- tions, but an analytic functional does not necessarily have a smallest porter. The dual space of an echelon space is the corresponding co-echelon space. References [IJ S. Banach, Theorie des operations line- aires, Warsaw, 1932 (Chelsea, 1963). [2J N. Dunford and J. T. Schwartz, Line.ar operators, Interscience, I, 1958; II, 1963; III, 1971. [3J K. Y osida, Functional analysis, Springer, 1965. [4J J. Lindenstrauss and L. Tzafriri, Classical Banach spaces I, II, Erg. Math., 92 (1977), 97 ( 1979). [5J A. Grothendieck, Criteres de compacite dans les espaces fonctionne1es generaux, Amer. J. Math., 74 (1952), 168-186. [6J A, Zygmund, Trigonometric series, Cam- bridge Univ. Press, second edition, 1959. [7J E. M. Stein, Singular integrals and dif- ferentiability properties of functions, Princeton Univ. Press, 1971. [8J R. A. Hunt, On L(p, q) spaces, Enseigne- ment Math., (2) 12 (1966),249-276. [9J C. Fefferman and E. M. Stein, HP spaces of several variables, Acta Math., 129 (1972),137- 193. [IOJ S. L. Sobolev, Applications of functional analysis in mathematical physics, Amer. Math. Soc., 1963. (Original in Russian, 1950.) [11J O. V. Besov, Investigations of a family of function spaces in connection with theorems of imbedding and extension, Amer. Math. Soc. Trans!., Ser. 2,40 (1964), 85-126. (Original in Russian, 1961.) [12J H. Triebe1, Interpolation theory, function spaces, differential operators, North-Holland, 1978. 652 [13J T. Muramatu, On Besov spaces and Sobolev spaces of generalized functions de- fined on a general region, Pub!. Res. Inst. Math. Sci., 9 (1974), 325-396. [14J L. Schwartz, Theorie des distributions, Hermann, revised edition, 1966. [15J G. Kothe, Topological vector spaces I, Springer, 1969. (Original in German, 1966.) 169 (XI.19) Function-Theoretic Null Sets A. General Remarks By a function-theoretic null set we mean an exceptional set that appears in theorem as the one asserting that a certain property holds with a "small exception." We give below some of the more important examples of exceptional sets. For simplicity, we limit ourselves to the n- dimensional Euclidean space R n (n  2). B. Sets of Harmonic Measure Zero Denote by XE the characteristic function of a set E on the boundary aD of a bounded domain Din Rn. We call the thypofunction !f XE and thyperfunction N XE ( 120 Dirichlet Problem) the inner and outer harmonic mea- sures of E (with respect to D), respectively. When they coincide, the function is called the harmonic measure of E. A necessary and suffi- cient condition for E to be of inner harmonic measure zero is that u.s; 0 hold in D whenever a tsubharmonic function u bounded above in D satisfies lim supu(P).s; 0 as P tends to any point of aD - E. This theorem implies the following uniqueness theorem: If h is bounded and harmonic in D, if E is a set of inner har- / monic measure zero on aD, and if h(P)-->O as P tends to any point of aD - E, then h = O. A necessary and sufficient condition for E to be of outer harmonic measure zero is that there exist a positive tsuperharmonic function v in D such that v(P)--> 00 as P tends to any point of E. (Concerning the existence of a limit for a subharmonic or tharmonic function at every boundary point except those on a set of har- monic measure zero,  193 Harmonic Func- tions and Subharmonic Functions.) C. Sets of Capacity Zero Although there are many kinds of capac- ity ( 48 Capacity), here we consider only t10garithmic capacity and ex-capacity (ex> 0). Let K be a nonempty compact set in R n . Set 
653 W(K) = inf/lH PQ -'dJ1(P)dJ1(Q), where J1 runs through the class of nonnegative tRadon mea- sures of total mass 1 supported by K, and write C,(K)=(W(KW lj ,. Define C,(0)=0 for an empty set 0. For a general set E eRn, define the inner capacity by SUPKcE C,(K) and the outer capacity by the infimum of the inner capacity of an open set containing E. When the inner and outer capacities coincide, the common value is called the ex-capacity (or capacity of order ex) of E and is denoted by CafE). We denote the logarithmic capacity of E by CorE). In order that Co(K)==O (n == 2) or the tNewtonian capacity C n _ 2 (K)==0 (n;d) for a compact set K, it is necessary and sufficient that the harmonic measure of K with respect to G - K vanish for any bounded domain G containing K. Then K is removable for any harmonic function that is bounded or has a finite tDirichlet integral in G - K. In general, K is said to be removable for a family F of functions if for any domain G containing K and f E F defined in G - K, there exists 9 E F defined in G such that 9 = fin G - K. Let K be a compact set in R 2 with Co(K)==O and G be a domain containing K. Let f be a holomorphic function defined in G - K for which every point of K is an tessential singularity. Then the set of texceptional values for f at every point of K is of logarithmic capacity zero. If f is tmeromorphic in Izi < 1 and II 11'12 (1 + If12)2 (1-lzl)' dx dy < 00 (O.s;ex< 1), then f has a finite limit in any angular domain with a vertex at every point of Izi = 1 except for those belonging to a set of ex-capacity zero (logarithmic capacity zero if ex = 0). D. Hausdorff Measure Let ex>O and a set E be given in R n . Denote by 0" a covering of E by a countable number of balls with radii d l , d 2 , "', all of which are smaller than e (>0). As 10-->0, infaLkdf in- creases. The limit is called the Hausdorff mea- sure of E of dimension ex and is denoted by A,(E). In order for a compact set K to be removable for the family of harmonic func- tions defined in a bounded domain and satisfy- ing the tHolder condition of order ex, it is necessary and sufficient that A n -2+,(K)=O. Next, suppose that K is a compact set in a plane and the complement G of K with re- spect to the plane is connected. Set Ilfll q = (S SG Ifl q dx dy)ljq for f holomorphic in G and q, 1.s; q < 00, and Ilflloo = sUPGlfl. Denote by Hq the family off*,0 with Ilfllq<oo. If pis defined by l/p+ l/q= 1, then A 2 _ p (K)< 00 implies Hq= 0 for q, 2<q< 00, and A1(K)==0 169 F Function-Theoretic Null Sets implies H oo = 0. Moreover, H 2 = 0 if and only if Co(K)=O, and Hq= 0 implies C 2 _ p (K) =Oforq,2<q.s;00 [1]. E. Null Sets Defined with Respect to Families of Functions Conversely, L. V. Ahlfors and A. Beurling characterized the size of sets in a plane by means of families of functions [2]. Let D be a domain, and let f represent a holomorphic function in D. Fix a point Zo in D. Set !B=Ullfl.s; I}, :D={fl I Iv 11' 12 dxd y .s;n}, G:=Ulthe area of W;;;'n}, where W is the complement of the trange R of (f(z)- f(ZO»-I. Denote by 6!B, 6:D, 6G: the families consisting of constants and tuniva- lent functions in !B, :D, G:, respectively. Use the notation 3' to represent anyone of these six families, and define Mfj= Mfj(zo; D) by sup{lf'(zo)llfE3'}. Then M'.B==M,,;;;'M'tJ= M 6 ,,;;;'M 6 '.B=M 6 'tJ, and Mfj(zo;D)=O im- plies Mfj(z;D)==O for any ZED. Denote by Nfj the class of compact sets K such that the complement K C of K is con- nected and Mfj(z; KC)=O. We call K E Nfj a null set of class Nfj. In order for K to be re- movable for !B or :D, it is necessary and suffi- cient that K E N'.B or E N'tJ, respectively. We in general have {KI Co(K)==O} N'.BN'tJN6'.B' If Al (K) = 0, then K E N'.B' There exists a set KEN'.B with A1(K»0 (A. G. Vitushkin, Dokl. Akad, Nauk SSSR, 127 (1959); 1. Garnett, Proc. Amer. Math. Soc., 21 (1970». When K is a subset of an analytic arc A, K E N'.B implies A1(K)=0, N'tJ is equal to N 6 '.B' and K be- longs to N D if and only if Co(A)== CorA -K). If an tanalytic function has an essential sin- gularity at every point of K E N'.B' then any compact subset of the set of exceptional values at every point of K belongs to N'.B' A necessary and sufficient condition for K E N'tJ is either that the complement of any one-to-one tcon- formal image of K C be of plane measure zero or that any tunivalent analytic function in K C be reduced to a tlinear fractional function. If the union of at most a countable number of N'.B or N!) sets is compact, it belongs to the same class. But it not true for N 6 '.B sets [3]. F. Analytic Capacity For a compact set K, let D K be the unbounded connected component of K C . The quantity 
169 Ref. Function-Theoretic Null Sets M'B( 00, D K ) is called the analytic capacity of K and is denoted by ex(K). If ex(K) > 0, there exists an extremal function fo(z)=ex(K)z-1 +..., called the tAhlfors function, that maps D K onto a covering surface of the unit disk. In general, ex(K) is not greater than the logarith- mic capacity Co(K). If K is a continuum, then ex(K) = Co(K), and ex(K) is attained by and only by fo(z), which maps D K onto Iwl < I conform- ally and z= 00 to w==O. For a linear set K, ex(K) is equal to a quarter of its length (c. Pommerenke, Arch. Math., 11 (1960)). The concept of analytic capacity is basic for the theory of rational approximation on compact sets. References [IJ L. Carleson, Selected problems on excep- tional sets, Van Nostrand, 1967. [2J L. V. Ahlfors and A. Beurling, Conformal invariants and function-theoretic null-sets, Acta Math., 83 (1950), 101-129. [3J L. Sario and K. Oikawa, Capacity func- tions, Springer, 1969. [4J L. Sario and M. Nakai, Classification theory of Riemann surfaces, Springer, 1970. [5J 1. Garnette, Analytic capacity and mea- sure, Lecture notes in math. 297, Springer, 1970. [6J L. Zalcman, Analytic capacity and rational approximation, Lecture notes in math. 50, Springer, 1968. [7J T. W. Gamelin, Uniform algebras, Pren tice- Hall, 1969. 170 (IX.2) Fundamental Groups A tcontinuous mapping f from the interval 1= {t I O.s; t.s; I} into a topological space Y is called a path connecting the initial point f(O) and the terminal point f(I). In particular, a path satisfying f(O) = f(l) = Yo is called a loop (or closed path) with Yo as the base point. For a path f, the inverse path J of f is defined by J(t) = f(l- t). When the terminal point of f and the initial point of 9 coincide, the path F de- fined by F(t) = f(2t) for O.s; t.s; 1/2 and F(t) = g(2t -1) for 1/2.s; t.s; 1 is called the product (or concatenation) of f and g, and is denoted by f' g. With [f] standing for the equivalence class of a path f under the relation of thomo- topy relative to 0, I (E 1) (i.e., by homotopy with 0 and 1 being fixed), the inverse [fJ -I = [n and the product [f] . [g J = [f. 9 J are defined. In particular, in the set of homotopy 654 classes of loops with one common base point Yo, the product is always defined, and the set forms a group nl(Y,yo). This group is called the fundamental group (or Poincare group) (H. Poincare, 1985) of Y (with respect to Yo). If Y is tarcwise connected, then nl(Y,yo)nl(Y,yd for an arbitrary pair of points Yo, Y b and the structure of the group is independent of the choice of the base point. This group is de- noted simply by ndy). A continuous mapping <P:(Y,Yo)-->(Y',Yo) induces a homomorphism <p*:nl(Y,yo)-->nl(Y',Yo) by sending [f] to <p*[f] = [<pof], and (<p'o<p)*=<po<p* holds for the composite <p' 0 <p of mappings. Thus nl(Y) is a ttopological invariant of Y.lfnl(Y) consists of only one class (the class of the constant path), we say that Y is simply con- nected. For example, cells and spheres sn (n;::, 2) are simply connected. The famous tPoincare conjecture states that a simply con- nected 3-dimensional compact tmanifold is homeomorphic to the 3-dimensional sphere. We have van Kampen's theorem: Let P be a connected polyhedron, PI and P 2 be its con- nected subpolyhedra such that PI n P 2 is con- nected, and P=P I UP 2 . Then ndP) is isomor- phic to the group (tamalgamated product) obtained from the tfree product of n l (PI) and nl (P 2 ) by giving the relations that the images of each element ofnl(P I np 2 ) in ndPd and in n l (P 2 ) are equivalent. Also, the fundamental group of the tproduct spaces is the direct product of the fundamental groups of the spaces involved. Any group is the fundamental group of some tCW complex. The Abelization nl/[n l , nlJ of the fundamental group n l = nl(Y) (Y arcwise connected) is isomorphic to the I-dimensional integral thomology group H l (Y). For example, the fundamental groups of a circle SI and a ttorus Tn are an infinite cyclic group and a free Abelian group of rank n, respectively; the fundamental group of a 1- dimensional CW complex is a free group; and the fundamental group of an orientable 2- dimensional closed surface of tgenus p is a group having 2p generators {ai, ..., a p , bb ..., b p } and a relation a l b l a1 b1 ... apbpa;1 b p - I = 1. If Xo is a fixed point of the circle SI, then the fundamental group can be defined as the set of all homotopy classes of continuous map- pings f: (SI, xo)-->(Y, Yo). Extending the definition of the fundamental group by replacing I, SI with I", sn, we obtain the n-dimensional homotopy group ( 202 Homotopy Theory; 91 Covering Spaces). References [IJ H. Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934 (Chelsea, 1965). 
655 [2] S. Lefschetz, Introduction to topology, Princeton Univ. Press, 1949. [3] W. S. Massey, Algebraic topology, an introduction, Springer, 1977. [4] E. H. Spanier, Algebraic topology, McGraw-Hili, 1966. 170 Ref. Fundamental Groups 
171 Ref. Galois, Evariste 171 (XXI.25) Galois, Evariste Evariste Galois (October 25, 1811-May 31, 1832) was born in Bourg-la-Reine, a suburb of Paris. In 1828, while still in junior high school, he published a paper on periodic tcontinued fractions. Although he published four papers, his most important works were submitted to the French Academy of Science and either lost or rejected. He was unsuccessful in his attempt to enter the Ecole Poly technique and instead entered the Ecole Normale Superieure in 1829. Active in political affairs, he was expelled from school, imprisoned, and died in a duel soon after his release. The night before the duel, he left his re- search outline and manuscripts to his friend, A. Chevalier. These were published by J. Liouville in J. Math. Pures Appl., first series, 11 (1846). The contents include the concept of groups and what essentially became the tGalois theory of algebraic equations. The manuscript also contained expressions such as "theory of ambiguity," which seems to indicate that Galois intended to study the theory of algebraic functions along the same lines. References [IJ E. Galois, Oeuvres mathematiques, E. Picard (ed.), Gauthier-Villars, 1897. [2] F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19. Jahrhundert I, Springer, 1926 (Chelsea, 1956). [3] R. Bourgne and J.-P. Azra, Ecrits et memo ires mathematiques d'Evariste Galois, Gauthier-Villars, 1962. 172 (III. 7) Galois Theory A. History After the discovery of formulas giving the general solutions of algebraic equations of degrees 3 and 4 in the 16th century, efforts to solve equations of degree 5 remained unsuc- cessful. Early in the 19th century P. Ruffini and N. H. tAbel showed that a general alge- braic solution is impossible. Shortly after- ward, E. tGalois established a general prin- ciple concerning the construction of roots of algebraic equations by radicals. The principle was described in terms of the structure of a certain permutation group (the Galois group) 658 of the roots of the equation. Even in this original form of the theory (Galois theory), Galois not only completed the research started by 1. L. Lagrange, P. Ruffini, and Abel, but also made an epochal discovery that opened the way to modern algebra. J. W. R. Dedekind (Werke III, 1894) interpreted this result as a duality theorem concerning the automorphism groups of a field. It was shown later that Galois theory plays an important role in the general theory of commutative fields estab- lished by E. Steinitz. In the 1920s, W. Krull generalized the idea of Dedekind, using the concept of topological algebraic systems, and obtained the Galois theory of infinite alge- braic extensions (Math. Ann., 100 (1928)). The Galois theory gives a model for a successful theory summarizing the essentials of separable algebraic extensions and has led to analo- gous theories for other algebraic systems. For example, E. R. Kolchin constructed an analo- gous theory for differential fields where the Galois groups are algebraic groups ( 113 Differential Rings, Galois theory of differential fields). Another line of development of this theory, also originated by Dedekind (Werke III, 1876/77), led to the Galois theory of rings, an object of active research by N. Jacobson (Ann. Math., 41 (1940)), T. Nakayama, and others since the 1940s. Also, recently, the Galois theory for some general algebraic sys- tems containing inseparable fields has been constructed by M. E. Sweedler, U. S. Chase, and others in which Galois groups are re- placed by Hopf algebras or bialgebras ( 203 Hopf algebras). B. Definitions Given a group G of tautomorphisms of a given tfield L, the subfield F(G)={aELlaa=a,O"EG} is called the invariant field associated with G. For any extension fields L, L' of K, an isomor- phism of L into L' whose restriction to K is the identity is called a K-isomorphism. If L is a tnormal extension of K, any K-isomorphism of L into L' is a K -automorphism of L. If an algebraic extension L of K is normal, the group G( L/ K) of all K -automorphisms of L is called the Galois group of L/K. A tseparable normal algebraic extension of K is called a Galois extension of K. Let L/K be a finite normal extension; then there exist intermedi- ate fields M and N of L/K such that M/K is a Galois extension and N/K is a tpurely insep- arable extension and L = M 0 K N ( 277 Modules J). Further, G(L/K) = G(L/N)"" G(M/K) and the order of G(L/K) is equal to the tseparable degree of L/K, i.e. [M: K]. A necessary and sufficient condition for L/K to 
659 be a Galois extension is that the invariant field associated with G(LjK) be K. A Galois exten- sion LjK is called an Abelian extension or a cyclic extension when G(LjK) is tAbelian or tcyclic, respectively ( 149 Fields). C. Fundamental Theorem of Galois Theory Let LjK be a finite Galois extension and G its Galois group. Then there exists a tduallattice isomorphism between the set of intermediate fields of LjK and the set of subgroups of G, under which an intermediate field M of LjK corresponds to the subgroup H = G( Lj M); conversely, a subgroup H of G corresponds to M =F(H), The degree of extension [L: MJ is equal to the order of the corresponding sub- group H (in particular, [L:KJ is the order of G), and [M: KJ coincides with the index (G: H). If subfields M and M' are tconjugate over K, then the corresponding subgroups G(LjM) and G(LjM') are conjugate to each other in G, and vice versa. In particular, MjK is a Galois extension if and only if the sub- group H corresponding to M is a tnormal subgroup of G, and in this case, the Galois group G(MjK) is isomorphic to the factor group GjH. D. Extensions of a Ground Field Let LjK be a finite Galois extension, K'jK any extension, and L' the tcomposite field of Land K', Then L'jK' is also a Galois extension, and its Galois group is isomorphic to G(LjL n K') by the restriction map, E. Normal Basis Theorem Let LjK be a finite Galois extension with Galois group G. Then there exists an element u of L such that {u a I 0" E G} forms a basis for L over K called a normal hasis. If we denote by K[GJ the tgroup ring of Gover K, a tK[G]- module structure can be introduced in L by the operation L aaO"(x) = L aaxa; the existence of a normal basis implies that L is isomorphic to K [GJ itself as a K [GJ-module, or in other words, that the K-linear representation of G by means of L is equivalent to the tregular representation of G. F. Examples of Galois Extensions (1) tCyclotomic Fields. Let m be a positive integer not divisible by the tcharacteristic of K; ( a tprimitive mth root of unity, and L = K «(). Then LjK is an Abelian extension, and its Galois group is isomorphic to a subgroup of 172 F Galois Theory the treduced residue class group (ZjmZ)*; in particular, if K = Q, the subgroup coin- cides with (ZjmZ)*, by the irreducibility of tcyclotomic polynomials. Hence the degree [Q«():QJ is equal to <p(m), where <p is tEuler's function. (2) Finite Fields. A tfinite field K has nonzero characteristic p, and the number q of elements of K is a power of p. Also, K is uniquely deter- mined by q (up to isomorphism), hence it is denoted by GF(q) or Fq' Thus GF(qn) is the only extension of GF(q) of degree n; moreover, it is a cyclic extension. (3) Kummer Extensions. Assume that K con- tains a primitive mth root' of unity and the characteristic of K is 0 or is not a divisor of m. Denote by K* the multiplicative group of K. An extension L of K can be expressed in the form L=K(, ...,)(aiEK) if and only if LjK is an Abelian extension and all O"EG(LjK) satisfy O"m = 1; in this case LjK is called a Kum- mer extension of exponent m, There exists a one- to-one correspondence between K um- mer extension L of exponent mover K and finite subgroups Hj(K*)m of the factor group K*j(K*)m, given by the relations H = LmnK*, L= K(vlih Moreover, there exists a canon- ical isomorphism between Hj(K*r and the tcharacter group of G(LjK), so that Hj(K*r is isomorphic to G(LjK). Let L= K(O) be a cyclic Kummer extension of degree m of K, and let 0" be a generator of the Galois group G(LjK). Then the Lagrange resolvent (C a) == a +, oa +... + ,m-I oa m - 1 satisfies (C Or = C I (" 0), (', o)m E K, and a and its conjugates can be expressed in terms of «(, 0). In particular, Lis generated by (C 0) over K. (4) Artin-Schreier Extensions. Assume that K is of characteristic p # O. For any element a of an extension of K, we denote hy [ijJa the element a P -a and by (lj[ijJ)a a root of [ijJX -a=O. A finite extension L of K is of the form L = K «1 j[ijJ)a" ... , (lj[ijJ)a r ) (a i E K) if and only if LjK is a Galois extension whose Galois group is an Abelian group of texponent p; in this case, LjK is called an Artin-Schreier extension. There exists a one-to-one correspondence between Artin-Schreier extensions Lover K and finite subgroups H j&' K of the additive group Kj&'K, given by the relations H == .q7JLn K, L= K«lj[ijJ)H); moreover, Hj&'K is isomorphic to the character group of G(LjK) (therefore also to G(LjK) itself), More gen- erally, for Abelian extensions L of exponent pn (Le., Galois extensions whose Galois groups are Abelian groups of exponent pn), we obtain similar descriptions by using the additive 
172 G Galois Theory group of tWitt vectors of length n instead of K ( 449 Witt Vectors). G. Galois Group of an Equation L/ K is a finite Galois extension if and only if L is a tminimal splitting field of a tseparable polynomial f(X) in K [X]. In this case, we call G( L/ K) the Galois group of the polynomial f(X) or of the algebraic equation f(X) = O. The equation f(X) = 0 is called an Abelian equa- tion or a cyclic equation if its Galois group is Abelian or cyclic, respectively, while f(X) == 0 is called a Galois equation if L is generated by any root of f(X) over K. Generally, G(L/K) can be tfaithfully represented as a permutation group of roots of f(X) =0. If this group is tprimitive, then f(X) = 0 is called a primi- tive equation. The index of the group in the group of all permutations of roots is called the affect of the equation f(X) = 0; if the affect is 1, the equation f(X)=O is called affect- less. Let U 1 , "', Un be talgebraically indepen- dent elements over K. Then for the polyno- mial Fn(X) = X n - UI X n - t +... +( -1)"u n in K(u t , ..., un) [X], the equation Fn(X) = 0 is called a general equation of degree n. The Galois group of Fn(X)=O is isomorphic to the tsymmetric group 6n of degree n, and if K is not of characteristic 2, then the quadratic subfield corresponding to the talternating group 'lln is the field K(jD) obtained by adjoining the quadratic root of the tdiscrimi- nant D of Fn(X). H. Solvability of an Algebraic Equation Assume that K is of characteristic 0, j (X) E K[XJ, and L is the minimal splitting field of f(X). We say that the equation f(X)=O is solvable by radicals if there is a chain of sub- fields K=LocLt c...cLr=L such that L i = L i - t (;) with some a i E L i - \, and this is the case if and only if the Galois group of f(X) is tsolvable (Galois). In particular, Abelian equations are solvable by radicals. Cyclic equations are solved by using the Lagrange resolvent, and theoretically the general solv- able equation can be solved by repeating this procedure. Since 6n is solvable if and only if n.s; 4, it follows that a general equation of de- gree n is solvable only if n= 1,2,3,4 (Abel). (For a method of solving these equations  10 Algebraic Equations D.) Also, a poly- nomial is solvable by square roots if and only if the order of the Galois group is a power of 2. This fact enables us to answer some ques- tions concerning geometric construction prob- lems such as trisection of an angle or division 660 of a circumference in equal parts ( 179 Geometric Construction). I. Infinite Galois Extensions If a Galois extension L/K is infinite, then its Galois group is an infinite group. Let {Mv} be the family of intermediate fields of L/K that are finite and normal over K, and put Hv =G(L/Mv). Then by taking {Hv} as a tbase of a neighborhood system of the unity element, G becomes a ttopo1ogical group. This topology is called the Krull topology (Krull, Math. Ann., 100 (1928)). G is then isomorphic to the tpro- jective limit of the family of finite groups {G/H} and is ttotally disconnected and tcom- pact. There is a one-to-one correspondence between the set of intermediate fields of L/K and the set of closed subgroups of G given by the map (Galois group)+->(invariant field), and thus we have a generalization of Galois theory for finite extensions as described in Section C. Various theories, including the theory of Kum- mer extensions, can be generalized to the case of infinite extensions ( 423 Topological Groups), J. Galois Cohomology Let L/K be a finite Galois extension and G its Galois group. Then both the additive group L and the multiplicative group L * have G- module structures. The tcohomology groups of G with coefficient module L are 0 for all dimensions because of the existence of a normal basis ( 200 Homological Algebra). As for the multiplicative group L *, we have J{o(G, L*) K*/N(L*) (N is the tnorm N LIK ), Ht(G, L*)=O (Hilbert's theorem 90 or the Hilbert-Speiser theorem). In particular, if G is a cyclic group with generator 0", then every element a such that N(a) = 1 can be expressed in the form a=b t - a . H 2 (G, L*) is isomorphic to the tBrauer group of tcentra1 simple alge- bras over K which have L as a tsplitting field. In the case of number fields, a number of G-modules arise, such as tprincipal orders, tunit groups, tideal groups, tidele groups, and so on, whose co homological considerations are important ( 6 Adeles and Ideles; 59 Class Field Theory). Further, let A be a group (not necessarily commutative), and suppose that G acts on A, We denote by aa the action of O"EG on aEA. A is called a G-group if a(ab)=aaab for all a, bE A and O"EG. Then, in the same way as for G-modules, we can define the Oth coho- mology group HO(G, A) to be the subgroup A G of A consisting of all elements of A left fixed by G. The map a: O"f->aa from G into A such that aa, = aa a a, (IT, rEG) is called a 1-cocyc1e with 
661 values in A. Denote by ZI(G,A) the set of the l-cocycles of G with values in A. Two 1- cocycles a and a' in Zl (G, A) are cal1ed co- homologous if there exists an element bE A such that a = b -I aa ab for al1 O"E G. This is an equivalence relation in Zl (G, A) and the 1- cohomology set HI(G, A) of G with values in A is defined to be the set of cohomo1ogous classes of ZI(G, A). HI(G, A) does not have the structure of a group in general, but there does exist an element cal1ed an identity, namely, the cohomo1ogous class containing the trivial 1- cocycle. In general, a set X with an element P of X is cal1ed a pointed set; for any two pointed sets (X, p) and (Y, q), a map f: X --> Y is cal1ed a morphism of pointed sets if f(p) = q. For pointed sets (X, p), (Y, q), and (Z, r), a sequence of morphisms X!... Y!!" z of pointed sets is cal1ed exact if 1m f = Ker g. The I-cohomology set HI (G, A) with the identity can be regarded as a pointed set. Therefore, exact sequences of these sets make sense and possess some of the properties of cohomology groups in the com- mutative case. For example, let I-->A-->B--> C--> 1 be an exact sequence of G-groups (and A is central in B); then HO(G, A)-->Ho(G, B)-->Ho(G, C) -->H I (G, A)-->H I (G, B)-->H I (G, C) (-->H 2 (G, A)) is an exact sequence of pointed sets. For a Galois extension L/K with Galois group G, a linear algebraic L-group defined over K ( 13 Algebraic Groups) has natural1y the structure of a G-group, and we have HI(G, GLn(L))=O. Applying the above exact sequence to I-->SLn(L)-->GLn(L)L *--> I, we have HI(G,SLn(L))=O. Also, we have HI(G, SP2n(L))==O for any integer n  1. It is difficult to define higher cohomology sets natural1y, but various methods to define them have been obtained. In many cases, we are more concerned with the tcategory of Galois extensions of K with K-isomorphisms between them than with a single extension L/K. In other words, we con- sider a tfunctor L--> 3'(L) of the category of Galois extensions of K into the category of (Abelian) groups, and study the cohomology related to G(L/K)-module or G(L/K)-group structures derived from 3'(L). In the case of infinite algebraic extensions, we consider the inductive limit of cohomology of subfie1ds of finite degrees, making use of continuous cocycles of Galois groups relative to the Krul1 topology [8,9]. Let L/K be a Galois extension with Galois group G. Consider objects X, Y defined over K on which the extension of the ground field is defined (such as K -linear spaces with certain tensors on them, algebras over K (associative 172 K Galois Theory or not), K-varieties, or K-a1gebraic groups defined over K). When X and Yare isomor- phic over L, Yis called a L/K-form of X. Let E(L/K, X) be the set of al1 isomorphism classes of X over L. Let A be the group of all auto- morphisms over L of X. For a L/K-form Y of X and an isomorphism f: X --> Yover L, since G acts on X and Y, one can define the isomorphism af: X --> Yover L that satisfies a(f(x)) = af(a x ). In particular, A is a G-group by this action. Further, for (JEG, define aa = f- I . af E A, then a: (Jf-> aa is a l-cocycle of G with values in A that corresponds to a L/K- form Y of X. This induces a bijection from E(L/K,X) onto HI(G,A). Thus L/K-forms of X can be classified if one can determine HI(G,A). For example, H l (G,SP2n(L))=O is equivalent to saying that an L/K-form of a skew-symmetric bilinear form on a K-linear space of dimension 2n is unique up to K- isomorphisms. L/K-forms of a semisimple Lie algebra 9 over K can be classified by the I-cohomology set of the algebraic group Aut(g @ KL). (For the L/K-forms of algebraic groups  13 Algebraic Groups M.) This de- scent theory can also be discussed for more general categories. K. Galois Theory of Rings The theory of tcentralizers in simple algebras can be interpreted as the theory of a certain Galois correspondence with respect to inner automorphism groups. Also, by using tcrossed products, we can deduce from the theory of centralizers in simple algebras the Galois theory of commutative fields. On the other hand, Jacobson obtained a Galois theory of tdivision rings with respect to finite groups of outer automorphisms that is similar to the commutative case. Since then, many alge- braists have proceeded with investigations that aim either at unifying these two theories by admitting inner automorphisms in the group of automorphisms, at extending the theory from division rings to general rings such as simple rings, tprimitive rings, or tsemiprimary rings, or at weakening the finiteness con- ditions. One principal method in these theories lies in considering first the endomorphism ring HomR(S, S) for an extension SIR and then the roles of endomorphisms (or derivations) in it [6] ( 29 Associative Algebras). Let K be a field of characteristic P > 0 and L/K be a finite purely inseparable extension such that LV c K. The set of all tderivations of L/K forms a restricted Lie algebra D(L/K) over K. Then there exists a one-to-one and dua11attice isomorphic correspondence be- tween the set of intermediate fields M of L/ K 
172 Ref. Galois Theory and the set of restricted Lie subalgebras H of D(LjK), given by the relations H = D(LjM) and M == {aEL Id(a)=O, dEH} (Jacobson [7J). Namely, in the case of purely inseparable ex- tensions, derivations or higher derivations play the role of automorphisms, and the bialge- bras defined by such derivations correspond to Galois groups of Galois extension. Now, the group algebra KG of a group Gover K and the (restricted) universal enveloping alge- bra of a (restricted) Lie algebra over K both have the structure of a tHopf algebra. From this fact, unifying the Galois theory of Galois extensions and Jacobson's theory for purely inseparable extensions and using bialgebras or Hopf algebras, one can construct Galois theories of more general objects containing certain nonseparable field extensions (see M. E. Sweedler, Ann. Math., 87 and 88 (1968); S. U. Chase and Sweedler, Lecture notes in math. 97, Springer, 1969). References [IJ J. W. R. Dedekind, Dber die Perm uta- tionen des Korpers alIer algebraischen Zahlen, Gesammelte mathematische Werke, Viewig, 1930-1932, vol. 2, 272-291. [2J E. Artin, Galois theory, Univ. of Notre Dame Press, second edition, 1948. [3J N, Bourbaki, Elements de mathematique, Algebre, ch. 5, Actualites Sci. Ind. l102b, Hermann, second edition, 1959. [4J N. G. Chebotarev, Grundziige der Galois'schen Theorie, Noordhoff, 1950. [5J B. L. van der Waerden, Algebra I, Springer, seventh edition, 1966. [6J N. Jacobson, Structure of rings, Amer. Math. Soc. CoIloq, Publ., 1965. [7J N. Jacobson, Lectures in abstract algebra III, Van Nostrand, 1964; Springer, 1975, [8J J.-P. Serre, Corps locaux, Actualites Sci. Ind., Hermann, 1962; Local fields, Springer, 1979. [9J l-P. Serre, Cohomologie galoisienne, Lectures notes in math. 5, Springer, 1974. 173 (XIX.g) Game Theory A. Introduction and Historical Highlights Game theory consists of mathematical models used in the study of decision making in situ- ations involving conflict and cooperation. A conflict arises when each player in a game 662 selects from a list of alternatives one which, possibly together with chance and random events, leads to various outcomes over which the players have different preferences; thus the behavior of one player aiming at his own favorable outcome might induce unfavorable outcomes for others. Although the potential outcomes usually bring about conflicts among the players, there may be room for cooper- ation among some of them. Game theory attempts to extract that which is common and essential to such situations, to handle them by means of mathematical methods, and to pro- vide a normative guide to rational behavior for each of the players. Game theory thus goes beyond classical theories of probability and decision making that are sufficient to solve games involving just one player and chance. Modern game theory started in 1944 with the publication of the monumental book by von Neumann and Morgenstern [1]. These authors presented many logical classifications of games, including a distinction between two- person and n-person games, between constant- sum (zero-sum) and general-sum games (de- pending upon whether the sum of payoffs to the players is constant (zero) or not), and between noncooperative and cooperative games (depending upon whether any coIlaboration among the players is prohibited or allowed), The second edition included a remarkable expected utility theory, which has become a mainstay of game theory. Von Neumann and Morgernstern also gave three representations of games. The first representation is the so- called extensive form; this representation has been slightly modified by Kuhn [2]. The sec- ond representation is the normal-form game; this is the form which the minimax theorem for two-person zero-sum games was estab- lished [3]. This theorem was generalized to n- person general-sum noncooperative games by Nash [4]. FinaIly, the characteristic-function form was the one in which the authors devel- oped the theory of stable sets. Several other solution concepts, such as the core, the Shap- ley value, and the bargaining set, have since been defined for games in characteristic- function form. HistoricaIly, the development of game theory has been closely related to vari- ous areas of pure mathematics, such as analy- sis, topology, geometry, and the foundations of mathematics. A survey of game theory up to 1957 was presented by Luce and Raiffa [5]. In the 1950s and early 1960s, several major papers appeared in five issues of the Annals of Mathematics Studies [6]. Lucas [7J has pre- sented a good survey of developments to 1972, and Shubik [8J has surveyed the development of the field through 1981. Current articles 
663 appear mainly in the International Journal of Game Theory, and also in the journals in fields such as operations research, manage- ment science, economics, political science, and psychology. B. The Extensive Form An n-person game in extensive form is repre- sented by a game ttree (i,e" a connected graph with no cycles ( 186 Graph Theory)) having the following properties: There is one special vertex corresponding to the starting point of the game, Each non terminal vertex corre- sponds either to a move of one of the n players or to a chance move. The edges ascending from a vertex denote the alternatives of the player at this vertex, For each terminal vertex there is an n-dimensional vector whose components represent the payoffs to each player. The state of a player's information at any stage can be described by certain subsets of the set of all his vertices, called information sets, At each of his moves, player i knows which information set he is in, but not which vertex he occupies within this set. A local strategy for player i is a tprobability distribution over the set of all alternatives at each of his information sets, A behavior strategy for player i is a function which assigns a local strategy to all of his information sets, A pure strategy is a special behavior strategy that assigns a particular choice to each information set. Kuhn [2J showed the existence of pure optimal strategies for n-person general-sum noncooperative games with perfect information (i,e" games in which all information sets contain a single vertex) as a generalization of the result for two-person zero-sum games given in [1 J), Kuhn also proved the existence of equilibrium behavior strategies for games with perfect recall (i,e" games in which player i at any of his information sets remembers all his prior moves but is not aware of the prior choices of the other players), The definition of "equilibrium strategy" is given in the next section. Because of the complexity of n-person general-sum games, research on them has been minimal since the work of Kuhn, Recently a new attack on them has begun; for example, the Nash equilibrium point in extensive forms has been reexamined by Selten [9]. C. The Normal Form (or Strategic Form) An n-person game in normal form is specified by {N, {XtEN, WLEN}' where N=={l, .."n} is a set of players, Xi is the set of player i's strategies, and F i is a real-valued function on 173 C Game Theory the Cartesian product II?1 Xi, called player i's payoff function, Chance is incorporated by invoking an extra set X O of chance moves and a probability distribution on X O , A two-person zero-sum game or matrix game is the simplest case in which the existence of an equilibrium point has been established, Equilibrium fol- lows from von Neumann's minimax theorem: Let M I = {I, .., ,md, M l = {I,... ,m l } be the sets of pure strategies that may be chosen by two players. Let au be player 1 's payoff when strategies i andj are taken by players 1 and 2, A mixed strategy for player i is a probability distribution on M i , Sets of mixed strategies for players 1 and 2 are thus given by Xl = {XERm11 L7'J I X i = 1, XiO ViEM I } and X l = {xER m2 1 L7'='1 Xi = 1, XiO ViEM l }. If players 1 and 2 use the mixed strategies Xl and X2, respectively, the expected payoff for player 1 is FI(X I , X2)= Li Ljxlaijxf, which is a pay- off function for player 1. Von Neumann [3J proved max min FI(XI,Xl)= min max FI(XI,Xl), x 1 eX l x 2 eX 2 x2eX2 x 1 eX l We say that a pair (Xl, Xl) satisfying the foregoing minimax theorem is an equilibrium point. This pair (xl, Xl) turns out to be a saddle point ( 292 Nonlinear Programming A) of F l (Xl, Xl), The duality theorem ( 255 Linear Programming B) is mathematically equivalent to this minimax theorem. A generalization of this equilibrium to n-person general-sum non- cooperative games was presented by Nash [4J. An n-tuple (Xl, .." x n ) (XiE Xi) is a Nash equilib- rium if, for each i, F i (i1, ...,Xi-1,Xi,Xi+t, ...,in) Fi(XI, .."Xi-I,Xi,Xi+l, .."xn) for all XiEX i . That is, no player can improve his payoff by changing his strategy if all other players con- tinue to use the same strategies, Nash demon- strated the existence of this equilibrium for n- person general-sum noncooperative games with finite pure strategies, The existence of Nash equilibria for wider classes of nonco- operative games is proved by means of fixed- point theorems ( 153 Fixed-Point Theorems). Recently, multistage games, such as super- games and stochastic games in which games are played repeatedly, have become a major research topic of noncooperative game theory (with work being done on both extensive and normal forms), In two-person general-sum games or bimatrix games, there arise possibil- ities of cooperation (or bargaining) between two players. A solution concept for such situa- tions, the Nash bargaining solution, was pro- posed by Nash [1 OJ; this solution was sub- sequently discussed by Luce and Raiffa [5]. 
173D Game Theory D. The Characteristic-Function Form (or Coalitional Form) An n-person cooperative game in characteristic- function form is given by a pair (N, v), where N = {I, "', n} is a set of players and v is a real- valued function on a set of all subsets of N with v(0) = 0, called a characteristic function. Sometimes v is assumed to be superadditive, i.e., v(SU T)v(S)+v(T) S, TN with sn T# 0. v(S) represents the worth or the power achievable by the subset (coalition) S when its members cooperate regardless of the be- havior(s) of the players in the complement of S. An n-dimensional vector x = (x I, .., , x n ) is said to be an imputation if it satisfies (i) Xi  v( {i} ) i EN (individual rationality) and (ii) LiENXj = v(N) (group rationality). The set of all imputations, denoted by A, represents all reasonable or realizable ways of distributing the available gains among the n players. An imputation x is said to dominate another im- putation y if there is some nonempty coalition S such that (i) Xi> Yi iES and (ii) LiESXj.s; v(S) (the effectiveness of S with respect to x). The original solution concept for games in characteristic-function form given by von Neumann and Morgenstern [IJ is now called the stable-set or von Neumann-Morgenstern solution. A subset K of A is a stable set if (i) no dominance relation exists between any two elements of K (internal stability) and (ii) any imputation outside K is dominated by some imputation of K (external stability). The existence of stable sets was settled negatively by the ten-person example of Lucas [11]. This example, however, is rather specialized, and thus the existence of stable sets may yet be proved for a large class of games. Stable sets have been characterized for several classes of games, and these sets accurately reflect the coalition-forming processes among the players. Thus stable-set theory remains a ma- jor research topic in the theory of games in characteristic-function form. Many other solution concepts have been developed since that of the stable set. One of these is the core, defined by GiIlies in [6J (its naive idea had already appeared in [IJ). The core is the set C={xEAILiEsXiv(S) SN}. This says that no coalition can protest against or block an imputation in the core on grounds that the coalition can expect more. For superadditive games the core coincides with the set of un- dominated imputations, and thus the core is a subset of any stable set if both exist. The con- dition for nonemptiness of the core was de- rived by Shapley [12J using the duality theo- rem. Another solution concept, defined by Shapley in [6J, is known as the Shapley value. The Shapley value is a function <p(v)=(<pdv), 664 ... , <Pn(v)) sending an arbitrary characteristic function v on N into an n-dimensional Eucli- dean space satisfying the conditions (i) <Pni(nv) = <Pi(V), where n is any permutation of Nand nv(nS)= v(S) SN; (ii) LiES <Pi (v) = v(S) S N such that v(T) = v(S n T) T N; and (iii) <Pi(V+W)== <Pi (v) + <Pi(W) iEN and for any two games v and w. These three axioms uniquely determine the value (s-1)!(n-s)! <Pi(V)= L (v(S)-v(S-{i})) Sfi,N n! S3i for each i, where S is the number of players in S. The Shapley value is an imputation, and can be viewed as a fair-division solution since the three axioms are desirable properties for any equitable allocation scheme. From the above formula, the Shapley value can also be interpreted as the average of the marginal contributions of the players in a coalition. A bargaining-set concept was proposed by Aumann and Maschler in [6]. This set de- scribes what payoffs are stable once a partic- ular coalition structure (a tpartition of N) has formed. Briefly, a payoff associated with a coalition structure is stable or in a bargain- ing set if there is no objection to it from any player; or, even with an objection, if there exists a counterobjection to such an objection from other players. For details  [6]. Since there are many different ways to define objec- tion and counterobjection, there are various types of bargaining sets. Some of them are known to be nonempty. Two additional solu- tion concepts, the kernel and the nucleolus, derive from investigations into particular bargaining sets. The kernel, introduced by Davis and Maschler [13J, is always a non- empty subset of its bargaining set. More impor- tant is the nucleolus defined by Schmeidler [14], which is a unique imputation in the kernel and thus in the bargaining set. It is also in the core if the latter is nonempty. The "excess" of any non empty S  N for an imputa- tion x is defined by e(x, S) = v(S) - LiES Xi' The excess represents the dissatisfaction or the complaint of the coalition S with respect to x. The nucleolus is the imputation which mini- mizes the largest excess. If we have a tie, i.e., if the maximum excess attains a minimum at several imputations, then the next largest excess is to be compared, and so on. That is, the nucleolus is the tlexicographical mini- mum in the ordering of these arrangements. It can be computed by solving a series of linear programs. Several generalizations and variations of the classical von Neumann- Morgenstern for- mulation of games in characteristic-function form have also been investigated, such as the 
665 games without side payments studied by Aumann and others ( e.g., M. Shubik (ed.), Essays in Mathematical Economics: In honor of Oskar Morgenstern, Princeton Univ. Press, 1967); the games in partition-function form proposed by Thrall and Lucas (R. M. Thrall, and W. F. Lucas, Naval Res. Logistics Quart., 10 (1963), 281-298); and the games with in- finitely many players, in connection with which the Shapley value theory has become a major research topic [18]. E. Applications; Related Areas of Mathematics Game theory has been applied to many fields, such as economics, political science, manage- ment science, operations research, information theory, and control theory, as well as to pure mathematics. Games in extensive form are now important tools for analyzing the effects of information, and thus for solving many decision problems with uncertainty. Non- cooperative games in normal form and Nash equilibria have been used in the study of many phenomena, including oligopolistic markets (Friedman [15]), bidding processes, electoral competition, resource allocation, and arms control. Cooperative games have been success- fully applied to economics, and the relation between the core and competitive equilibria sheds further light on the theory of competi- tive economy. It is generally true that com- pctitive equilibria arc containcd in the corc. Debreu and Scarf [16] demonstrated that if the number of players approach infinity in a certain manner, the core shrinks to the set of competitive equilibria. By working in measure-theoretic terms, Aumann [17J was able to identify the core with the set of com- petitive equilibria. It was also demonstrated by Aumann and Shapley [18J that the set of competitive equilibria (and hence also the core) converges to the Shapley value under such formulations, provided certain conditions are satisfied. Another major application of cooperative game theory has arisen ir. pclitical science, wherein value-type soh.t;ons, such as the Shapley value, are widely used as indices of the power of each participant in various voting situations. Major applications are to problems of cost allocation for public goods such as water resources [19J, public transportation systems, and telephone systems; in such appli- cations the core, the Shapley value, and the nucleolus have all been employed. The books [8,20-22J are good references to the most recent applications of game theory. Game theory also has many close relations with various areas of pure mathematics. The following are typical examples. The study of certain types of games in extensive form has 173 Ref. Game Theory increased our understanding of the axiom of choice ( 34 Axiom of Choice and Equiva- lents A) and other foundational questions. Nash equilibrium is closely related to fixed- point theorems ( 153 Fixed-Point Theorems) and to separation theorems ( 89 Convex Sets A). Cooperative game theory also has many connections to functional analysis and to convex analysis. Finally, we mention that the study of dif- ferential games ( 108 Differential Games) is also highly developed and widely used in areas such as economics and control theory. References [1] J. von Neumann and O. Morgenstern, Theory of games and economic behavior, Princeton University Press, 1944; second edition, 1947; third edition, 1953. [2J H. W. Kuhn, Extensive games, Proc. Nat. Acad. Sci. US, 36 (1950), 570-576. [3] J. von Neumann, Zur Theorie der Gessell- schaftsspiele, Math. Ann. 100 (1928),295-320. [4J J. F. Nash, Noncooperative games, Ann. Math., 54 (1951), 286-295. [5] R. Luce and H. Raiffa, Games and deci- sions: Introduction and critical survey, Wiley, 1957. [6J A. W. Tucker et al. (eds.), Contributions to the theory of games, vols, I-IV, and Advanced studies in game theory: Ann. Math. Studies 24, 28, 39,40,52, Princeton University Press, 1950,1953,1957,1959,1964. [7J W. F, Lucas, An overview of the mathe- matical theory of games, Management Sci., 18, no. 5, pt. 2 (1972), P-3-P-19. [8] M. Shubik, Game theory in the social sciences: Concepts and solutions, MIT Press, 1982. [9J R. Selten, Reexamination of the perfectness concept for equilibrium points in extensive games, Int. J. Game Theory, 4 (1975),25-55. [lOJ J. Nash, The bargaining problem, Eco- nometrica, 18 (1950), 155-162. [IIJ W. F. Lucas, A game with no solution, Bull. Amer. Math. Soc., 74 (1968), 237-239. [12] L. S. Shapley, On balanced sets and cores, Naval Res. Logistics Quart., 14 (1967), 453-460. [13] M. D. Davis and M. Maschler, The ker- nel of a cooperative game, Naval Res. Logistics Quart., 12 (1965), 223-259. [14J D. Schmeidler, The nucleolus of a charac- teristic function game, SIAM J. Appl. Math., 17 (1969), 1163-1170 [15J J. W. Friedman, Oligopoly and the the- ory of games, North-Holland, 1977. [16J G. Debreu and H. Scarf, A limit theorem on the core of an economy, Int. Econ. Rev., 4 (1963),235-246. 
174 A Gamma Function [17J R. ], Aumann, Markets with a continuum of traders, Econometrica, 32 (1964), 39-50, [18J R, J. Aumann and L. S, Shapley, Values of non-atomic games, Princeton Univ, Press, 1974, [19J M, Suzuki and M. Nakayama, The cost allocation of the cooperative resource develop- ment: A game-theoretical approach, Manage- ment Sci" 22 (1976),1081-1086, [20J P. C. Ordeshook (ed.), Game theory and political science, New York Univ, Press, 1978, [21J S,], Brams et al. (eds,), Applied game theory, Physica-Verlag, 1979. [22J M, Shubik, A game-theoretic approach to political economy, MIT Press, 1984, 174 (XIV.4) Gamma Function A. The Gamma Function The function r(x) was defined by L. Euler (1729) as the infinite product 1 00 ( I ) X ( X ) -I - n 1+- 1+- x n=1 n n Legendre later called it the gamma function or Euler's integral of the second kind, The latter name is based on the fact that for positive real x, we have r(x)= Ioo e-'t x - l dt, This function satisfies the functional relation r(x+ 1)=xr(x), and hence for positive integral x, we have r(x+ 1)=x!. C. F. Gauss denoted the function r(x+ 1) by II(x) or x!, even when x is not a positive integer. The function x! is also called the factorial function, The gamma function can also be defmed as the solution of the func- tional equation r(x + 1) = xr(x) satisfying the conditions r(l) == 1, . r(x+n) hm-=1. noo r(n)n X Furthermore, we have 1 00 ( x ) _=xe cx n 1 +- e- xln , r(x) n=1 n This expression is known as the Weierstrass canonical form, Here C is Euler's constant C=!im ( 1 ++... +-IOgn ) roo 2 n =0.57721566490153286060651209... , 666 which is conjectured to be ttranscendental, but as yet even its irrationality remains un- proved, However, it is known, that if C were rational, the numerator and the denominator would be integers of more than 30,000 digits (R. p, Brent, 1980), The numerical value of C was calculated by Adams (1878) to 260 decimal places, and recently it has been calculated to more than 20,000 decimal places by means of an elec- tronic computer. Seven thousand digits have been computed by W. A. Beyer and M, S. Waterman (Math. Comp" 28 (1974)), and 20,000 digits have been computed by Brent et al. (1980), r(x) is tholomorphic on the complex x- plane except at the points x=O, -1, -2, ..', where it has simple tpoles, When Re x> 0, we have Hankel's integral representation r(x)= - f (-tf-Ie-'dt, 21smnx C x # integer, where the contour C lies in the complex plane cut along the positive real axis, starting at 00, going around the origin once counterclock- wise, and ending at 00 again, Among various properties of this function ( Appendix A, Table 17.1), the following two formulas are especially useful for numerical calculations: Binet's formula ( 1 ) log2n logr(x)= x- 2 logx-x+ f ro arc tan(t/x) + 2 2 dt, o e t - 1 Re x > 0, and the tasymptotic expansion formula that holds when largxl.s;(n/2)-b (15)0), ( 1 ) log2n logr(x) x- 2 logx-x+ 00 ( _1)n-1 B 2n + n1 2n(2n _1)x 2n - 1 (Stirling's formula), where the Bn are tBernoulli numbers, This last formula can be rewritten as r(x+ 1)=x! xxe-x, which is used for large positive integers x, The integrals f A -, x-I d e t t, o f '" -, x-I d e t t. A Rex>O, are known as the incomplete gamma functions and are used in statistics, the theory of molec- ular structure, and other fields, The texponen- tial integral and the terror function ( 167 Functions of Confluent Type D) are special cases of the incomplete gamma function, 
667 B. Polygamma Functions The derivatives of the logarithm of the gamma function are named the digamma function (or psi function) I/t(x)=dlogqx)/dx; the trigamma function I/t'(x); the tetragamma function I/t"(x); the pentagamma function I/t"'(x), etc. These functions are called polygamma functions. In particular, I/t(x) is the solution of the functional equation I/t(x+ I)-I/t(x)= l/x, 1/1(1)= -C, lim (l/t(x+n)-I/t(1 +n))=O. noo C. The Beta Function Euler's integral of the first kind B(x,y)= f t X - I (I-tr l dt, Rex>O, Rey>O, is called the beta function and is an analytic function of two variables x, y, This function is related to the gamma function as follows: qx)qy) B(x,y)- qx+y)' If the upper limit I in the integral is replaced by a, the result is called the incomplete beta function B,(x, y), References [IJ N. Nielsen, Die Gammafunktion I, II, Chelsea, 1964 (two volumes in one). [2J W. Shibagaki, Theory and applications of the gamma function, with tables of complex arguments (in Japanese), Iwanami, 1952. [3J E. T. Whittaker and G. N. Watson, A course of modern analysis, Cambridge Univ. Press, fourth edition, 1958. [4J K. Pearson, Tables of the incomplete r- function, Cambridge Univ. Press, second edi- tion, 1968. [5J E. Artin, The gamma function, Holt, Rinehart and Winston, 1964. (Original in German, 1931.) Also  references to 389 Special Functions. 175 (XXI.26) Gauss, Carl Friedrich Carl Friedrich Gauss (April 30, 1777- February 23, 1855) was born into a poor family in Braunschweig, Germany. From 176 A Gaussian Processes childhood, Gauss showed genius in mathe- matics. He gained the favor of Grand Duke Wilhelm Ferdinand and under his sponsorship attended the University of G6ttingen. In 1797, on proving the tfundamental theorem of alge- bra, he received his doctorate from the Univer- sity of Halle. From 1807 until his death, he was a professor and director of the Observa- tory at the University of G6ttingen. On March 30, 1796, he made the discovery that it is possible to draw a 17-sided tregular polygon with ruler and compass, which moti- vated his decision to devote himself to mathe- matics. The publication of his Disquisitiones arithmeticae in 1801 opened an entirely new era in number theory. In pure mathematics, he did excellent research on tnon-Euclidean geometry, thypergeometric series, the theory of functions of a complex variable, and the whole theory of telliptic functions. In the field of applied mathematics he made outstanding contributions to astronomy, ge- odesy, and electromagnetism; he also studied the tmethod of least squares, the theory of surfaces ( 111 Differential Geometry of Curves and Surfaces), and the theory of tpo- tential. He considered perfection in papers for publication to be of utmost importance; thus his published works are few relative to his amount of research, However, the scope of his work can be seen in his diary and letters, some of which are included in his complete works, comprising 12 volumes. He is generally consid- ered the greatest mathematician of the first half of the 19th century. References [IJ C. F. Gauss, Werke I-XII, K6nigliche Gesellschaft der Wissenschaften, G6ttingen, 1863-1933. [2J C. F. Gauss, Disquisitiones arithmeticae. German translation, Untersuchungen iiber h6here Arithmetik, Springer, 1889 (Chelsea, 1965); English translation, Yale Univ. Press, 1966. [3J F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19. Jahrhundert I, II, Springer, 1926-1927 (Chelsea, 1956). 176 (XVII.13) Gaussian Processes A. Gaussian Systems A system X={XA(w)IAEA} of real-valued random variables on a probability space 
176 B Gaussian Processes (0, B, P) is said to be Gaussian, or X is a Gauss- ian system, if any finite linear combination of elements X A of X is a tGaussian random variable, Any subsystem of a Gaussian system is again a Gaussian system, In particular, the joint distribution of (XI' X 2 , ..., X n ) for any finite subsystem {X j 11 .s;j.s; n} of a Gaussian system X is a multidimensional Gaussian distribution, This distribution is supported on the whole of R n or on a hyperplane (at most (n - 1 )-dimensional). Let m j be the ex pectation of X j , and let V = Uj,d be the (positive definite) tcovariance matrix of {X j } given by I-j,k = E{(Xj-m)(X k -m k )}, l.s;j, k .s;n, Suppose that the rank of the matrix Vis r, Then the distribution of (XI' X 2 ,..., X n ) is concentrated on an r-dimensional hyperplane ofRn: m+ VRn, m=(m l ,m 2 , ...,m n ), When V is nondegenerate, that is, when r = n, the distribution is supported by the whole of Rn and has a density function of the form (2n)-n/21 VI- I / 2 ex p [ -(x-m) V-I(x-m)} where x=(xl>x 2 , .."xn)ERn, I VI and V-I are the determinant and the inverse, respectively, of V, and where (x-m)' denotes the (column) vector transpose to the (row) vector (x-m), The above expression is the general form of the density of an n-dimensional Gaussian distribution, The characteristic function <p(z) of this distribution is given by <p(z) = eXP[i(m, z) -(Vz, Z)J Z==(Z1,2 2 , ...,zn)ER I1 , where ( , ) denotes the inner product on Rn. This distribution is denoted by N(m, V), If, in particular, m = 0 and if V is the identity matrix, then it is called the n-dimensional standard Gaussian distribution, For a general Gaussian system X = {XAI AEA} we are given the mean vector m A = E(X A ), AEA, and the covariance matrix VA'= E{ (X A -mA)(X -m)}, ..1., JiEA, which is posi- tive definite; for any n  1, complex numbers ex I> ex 2 "." exnEC, and }.I> }'2' "', AnEA, we have n L exjQ(kVA..AkO, j,k=l ) Conversely, given m A , AEA, and a positive definite V=(V;,IA.JiEA), there exists a Gaus- sian system X={XAIAEA}, the mean vector and the covariance matrix of which coin- cide with (m A ) and V=(VA')' respectively, If there exists another system X' with the same 668 property as X, then X and X' have the same distribution, A necessary and sufficient condition for the X A in a Gaussian system to be independent is that V;.. = 0 for every pair }. # Ji, For a Gauss- ian system X = {X n I n  1}, the tconvergence in probability of the sequence {X n } is equiv- alent to tconvergence in the mean square, The limit of the sequence in this case is again Gaussian in distribution, Also for this system, the talmost sure convergence of the sequence implies the convergence in mean square, There are many characterizations of Gauss- ian distributions and of Gaussian systems, (i) A necessary and sufficient condition for a distribution to be Gaussian is that cumulants (tsemi-invariants) Yk of an orders exist and satisfy Yk =0 for all k  3, (ii) Gaussian distri- butions have a self-reproducing property: If X and Yare independent Gaussian random variables, then their sum S = X + Y is also Gaussian, A converse to this property holds: If the sum is Gaussian, then, assuming that they are independent, X and Yare both Gaussian as well (P, Levy, H, Cramer), (iii) If XI and X 2 are independent and if Y l =aX I +bX I and Y 2 =cX I +dX 2 are also independent, then both XI and X 2 are Gaussian except for the trivial case b=c=O or a=d=O, (iv) If for X and Y there exist U independent of X and V independent of Y satisfying Y = aX + U, X = bY + V for some constants a, b, then there are only three possibilities: (1) (X, Y) is Gaussian, (2) X and Yare independent, (3) there is an affine relation between X and y, (v) Suppose that a distribution has a finite mean m and a density function of the form f(x -m), If the tmaximum likelihood estimate of the mean is always given by the arithmetic mean of the samples, then the distribution is Gaussian (c. F, Gauss), For any element X A of a Gaussian system X and for a subsystem X' of X, the conditional expectation E(XA/B') is the orthogonal projec- tion of X A onto X' , where B' is the smallest 0"- field with respect to which an the X in X' are measurable and where X' is the closed linear subspace of U(O,P) spanned by X', A system X == {XAIAEA}, X A =(Xl, "', Xi), of d-dimensional random variables is said to be Gaussian if the colIection {X {IA E A, 1 .s; j.s; d} is Gaussian, B. Complex Gaussian Systems Let Z be a complex-valued random variable with mean m, and denote it in the form Z = X +iY +m, i==J=1, X, Y real. If X and Y are independent and have the same Gaussian distribution with zero mean, then Z is called a 
669 complex Gaussian random variable. A system Z= {ZAIAEA} of complex-valued random variables is said to be complex Gaussian if any finite linear combination L CjZ Aj , CjE C, is complex Gaussian. A complex Gaussian sys- tem has properties similar to those of a Gaus- sian system discussed above. For instance, two complex Gaussian random variables are inde- pendent if and only if they are uncorrelated. Convergence properties are also similar. Fur- thermore, one can define complex Gaussian systems consisting of higher-dimensional ran- dom variables. C. Gaussian Processes A real-valued stochastic process {X,} is called a Gaussian process or a normal process if it forms a Gaussian system. If {X,} is a complex Gaussian system, it is called a complex Gauss- ian process. The most important example of a Gaussian process is tBrownian motion {B,I tO} with the properties E(B,)=O and E(B, - Bs)2 = It -sl. There are several gener- alizations of this process, among them: (i) tBrownian motion with a d-dimensional time parameter, (ii) a tWiener process with d- dimensional time parameter, which is a Gaus- sian system {Xala=(a l , ...,a d ), all ajO} with E(Xa)=O and E(X a , X a ,) = IIjmin{a] ,aJ}, ai=(a\, '" ,a), i= 1,2. Since a multidimensional Gaussian distri- bution is completely determined by its tmean vector and the tcovariance matrix, a Gaussian process is strongly stationary if it is weakly stationary ( 395 Stationary Processes); and such a process is called a stationary Gaussian process. The mean value m and the spectral measure F(dA) are associated with a weakly stationary process. The measure F(dA) is sym- metric with respect to the origin since each X, is real-valued. Conversely, given such F(dA) and m, we can construct a real weakly station- ary process {X,} with mean value m and spec- tral measure F(dA). GeneralIy, such a process {X,} is not determined uniquely; however, if {X,} is Gaussian, then there exists only one stationary Gaussian process with given m and F(dA). In view of this fact, stationary Gaussian processes can be regarded as being typical among weakly stationary processes. The discussions so far on stationary Gauss- ian processes are generalized to the case of stationary complex Gaussian processes, for which symmetry of F(dA) need not be assumed. The trandom measure {M(A)} in the tspectral decomposition of a complex Gaussian process {X,} again forms a complex Gaussian system, Weakly stationary Gaussian random distri- butions can also be introduced. 176 E Gaussian Processes D. White Noise and Gaussian Random Distributions A real-valued weakly stationary process with discrete parameter is called a white noise if the mean m=O and the tcovariance function p(t) = 1 (t =0); = 0 (t #0). Obviously the tspectral measure F(dA) is the tLebesgue measure dJ... In the continuous parameter case, a weakly stationary random distribution ( 407 Sto- chastic Processes C) is called a white noise if m =0 and p=b (15 is tDirac's delta function: b(<p) = <p(0)). The spectral measure is, therefore, the Lebesgue measure. In both cases Gauss- ian white noise {X,} or {X'I'} is determined uniquely. It has independent values at every point in the folIowing sense: In the discrete parameter case, X'I' X", ..., X," are mutualIy independent for any mutually distinct t l , t 2 , ..., t no For a continuous parameter Gauss- ian white noise, often calIed simply white noise if no confusion occurs, X'I'I ' X'I'" . .. , X'I'" are mutually independent if the supports of <PI' <P2, ..., <Pn are disjoint. In the latter case Gaussian white noise is realized by taking the derivative of a Brownian motion. The tcharac- teristic functional c( <p) is given by C(<p)=ex p [ -"<P1121 where II II is the L 2 (R I )-norm. Characteristic functionals of general Gauss- ian random distributions are expressed in the form c(<p) = exp [im(<p)- K(<p, <p)l where m is the mean functional and K is the covariance functional. E. Representations of Gaussian Processes The family of tstochastic integrals X, = JF(t,u)dBu, t>a (t>a if a== -00), based on Brownian motion, defines a Gaussian process {X,lta}. The converse problem is, however, not obvious. Given a Gaussian process {X,lt>a} with E(X,) =0, the problem is to find a Brownian motion {B,} and a kernel F that give a representation of the above form (P. Levy [10]). A more specific problem dis- cussed below is important. If, in addition, the representation is formed in such a way that B,(X) = B,(B) holds for every t, then the repre- sentation is called canonical, where B,(X) is the smalIest 0" -field with respect to which all the X" s.s; t, are measurable. The canonical repre- sentation, if it exists, is unique up to equiva- lence, Le., F(t, U)2 is unique. The existence of the canonical representation is, however, not 
176 F Gaussian Processes always guaranteed. Mean continuous, zero mean, purely nondeterministic stationary Gaussian processes have tbackward moving average representations, which are nothing but the canonical representations. Once the canon- ical representation of {X,} is obtained, the kernel, together with known properties of {B,}, tells us important properties of the given pro- cess, e.g., sample Function properties, Markov properties, and so forth. Let {X, It> a} be a general Gaussian process with E(X,) =0, and let M,(X) be the subspace of U(Q, P) spanned by the X" s.s; t. Assume that (i) {X,} is separable, i.e., Moo (X) is sepa- rable and (ii) {X,} is purely nondeterministic in the sense that nsM.(X)= {O}, Then X, is expressed as a sum of stochastic integrals: X'=L f ' Fj(t,u)dBL, J a where the {BL},j= 1,2, ''', N (N can be 00), are mutually independent tadditive Gaussian processes. In addition, the BL can be taken in such a way that the measures dvJ(u) = E(dBL)2, j = 1,2, ... , N is decreasing in j (i.e., dv 1 » dv 2 »...) and that B,(X) = VjB,(BJ) holds for every t. Such a representation is called a gen- eralized canonical representation of {Xi} [3]. It always exists under the above assumptions (i) and (ii), although it is not unique when N2. The number N, called the multiplicity of {X,}, is independent of the choice of a generalized canonical representation. It is a good question to ask when {X,} has a simple unit multiplicity or when {X,} has the canonical representation. No interesting answer to this question has been given so far except for stationary Gauss- isn processes. F. Gaussian Markov Properties A tsimple Markov Gaussian process {X,lta} has the canonical representation if it is sepa- rable and purely nondeterministic and if its covariance function never vanishes. It is of the form X,= f(t) f g(u)dB(u), f(t) #0, f g(u)2du#0 for t>a. A generalization of the simple Markov prop- erty of a Gaussian process is given in the following manner (suggested by P. Levy [IOJ). If the tconditional expectations E(X,.IB,(X)), for any choice of distinct t" t 2 , ... , t N+J > t (j0) span an N-dimensional subspace of U(Q, P), then the process {Xt} is called an N- pIe Markov Gaussian process. If the canonical representation of an N-ple Markov Gaussian 670 process exists, then it is expressed in the form N f ' X,= J/j(t) a gj(u)dBu, where det(Jj(t;) never vanishes for any choice of distinct t i (> a), 1 .s; i.s; N, and where the gj are linearly independent vectors in L 2([a, tJ) for any t> a. Conversely, a Gaussian process given by the above expression is N -pIe Mar- kov. A stationary N-ple Markov Gaussian process has a tspectral density function f(A) of a specific form, namely, it is expressed in the form [3J f().)=IQ(i).)/P(iAW; P, Q polynomials, degP=N and degQ<N (rational spectral density function). Y. Okabe [14J proved that the roots of P(x)=O are all real, and introduced a multiple Markov prop- erty of a stationary Gaussian process {X,} in am uch wider sense to prove that {X,} enjoys this property if and only if it has a rational spectral density. A somewhat restricted definition of multiple Markov properties for a Gaussian process uses differential operators. Assume that X, is (N -l)-times differentiable with respect to the L 2 (0., P)-norm. If there is an Nth order dif- ferential operator L = Lfo ak(t)D N - k with D=d/dt such that LX, = H" H, = dB,/dt white noise, then {X,} is caUed an N-ple Markov Gaussian process in the restricted sense. Such a process is naturally N -pIe Markov, and the canonical representation always exists. The kernel of the representation is the tRiemann function of the differential operator L that was used in the definition. The spectral density function corre- sponding to this process is of restricted form, namely, it is expressed in the form 1/IP(iAW, where P is a polynomial of degree N, and P(iA) = 0 has no root in the lower half-plane. Many attempts have been made to define a Markov property of a Gaussian random field, namely, a Gaussian system with a multi- dimensional time parameter; however, only two significant approaches are mentioned here, Let {Xa I aE R d } be a Gaussian random field with d-dimensional time parameter. H. P. McKean [12J gave a Markov property in the following manner. Let :Fu be the O'-field gen- erated by the XX, XE U, U( cR d ) open, For a closed set C, we set :Fc= n,:Fc" where C, is the e-neighbourhood of C. If, for any open set U, :Fu and :Fu' become independent under the assumption that :Fau is known, then {Xa} is called a Markov Gaussian random field in the McKean sense. It has been proved that a Brownian motion with d-dimensional time 
671 parameter is Markov in this sense if d is odd, while it is not Markov if d is even. Further detailed investigations of Markov properties for Gaussian random fields have been given by L. Pitt, S. Kotani, Y. Okabe [17], and others. Another definition of Markov properties of Gaussian trandom fields, in fact those of Gaussian random distributions, has been given by E. Nelson [13J in connection with Eucli- dean fields (invariant under Euclidean mo- tion) in quantum theory. Let X", be a Gauss- ian random field, For a smooth surface r in R d let ff r be the O"-fleld generated by X<I>, <I> = IjJ @ b(n), IjJ being a smooth function with compact support and n being normal to r, If ff D and ff D , become independent under the assumption that ff r , r = aD, is given, then {X",} is called a Markov Gaussian random field in the Nelson sense. Given a Gaussian Euclidean field which is Markov in the Nelson sense, then under some reasonable assump- tions, one can form a field (in the sense of tquantum field theory) satisfying the tWight- man axiom by changing imaginary time to real time. G. Sample Function Properties Some finer results about the smoothness of sample functions have been obtained for station- ary Gaussian processes, G. A, Hunt proved that if the spectral measure F(dA) of a stationary Gaussian process {X,} has finite moment of order a, then almost all its sample functions satisfy the following tH61der condition of order a [5]: For every finite interval I and every constant C, there exists a sufficiently small ho = ho(I, C) such that IX'+h - X,I < Clhl'(Iog 1/lhl)l;2 for any tEl and I h I < ho. The following result is due to Yu. K. Belyaev: Sample functions of a stationary Gaussian process are with proba- bility 1 either continuous or unbounded on every finite interval. Other conditions for continuity of sample functions are given in terms of the tcovariance function. Let v(s, t) be the covariance func- tion of a Gaussian process {X, I t E [0, 1] }. If v satisfies Iv(s"t)-v(s2,t)l.s;cl s l-s21' (O<a.s; 1, v(O,s)=O, sl's2E[0,IJ), then ( . IX'I- X '2 1 p 11m sup ,/2 1/2 010.I'I-'21,;;0It l -t 2 1 Iloglt l -t 2 11 <c I c l / 2 /a 2 )= 1, 176 H Gaussian Processes where C l is a constant. This result is due to Z. Ciesielski. Conditions for continuity of sample func- tions have also been given for Gaussian pro- cesses with multidimensional time parameters. For {X, I tE [0, IJn}, set d(s, t) = E[(Xs- X,)2J'/2. If there exists a function <P monotone increas- ing on some interval 0 < u < a such that d(s, t).s; <p(lls - t II), where II II is the metric in R n , and such that f +oo <p(e- X2 )dx < 00, then almost all sample functions are continu- ous (X. Fernique [2]), Conversely, if there exists a monotone increasing function <p satis- fying d(s, t)= <p(lls- tll) and if sample functions are continuous with positive probability, then the above integral should be fmite. Suppose that {X,} is a stationary Gaussian field. R. M, Dudley [11] proved that if there exists a number q> 1 and a neighborhood V = V(O) in Rn such that L q-k(log N(V, q-k))1/2 < 00, k where N(V,e) is the minimum number of e- balls (relative to the metric d) that cover V, then almost all sample fields of {X,} are con- tinuous, and Fernique [18] proved that the converse statement is also true. Thus the pro blem of continuity of the sample fields of stationary Gaussian fields has been completely solved. H. Gaussian Measures Let {X" t E T}, T an interval, be a Gaussian process. It gives a probability distribution on the measurable space (R r, .@), where .@ is the O"-field of Borel subsets of R r . Such a mea- sure P is called a Gaussian measure. Let pi, i= 1, 2, be Gaussian measures induced by Gaussian processes {X;, tE T}, i = 1, 2, respec- tively. Then for these measures concepts such as tabsolute continuity, tequivalence, and tsingularity are introduced in the usual man- ner. The most significant property for Gauss- ian measures is that two measures pi and p2 are either equivalent or singular. A powerful criterion for testing this dichotomy is the tent ropy of p2 with respect to pi, given by { P2(A;) } H(P2Ipl)=Sp P2(A;)log pl(A;) , where the sup is taken over all finite parti- tions a == {A" ..., An}, AiE.@, U Ai == R r (YU. A. Rozanov [15J). The measures pi and p2 are equivalent (resp. singular) if and only if 
176 I Gaussian Processes H(p21 pi) < 00 (resp. = 00). This statement can be paraphrased in terms of the Hilbert spaces H(X i ), i= 1, 2, spanned by {X;}, i= 1, 2, respec- tively. For simplicity, assume that E(X;) = O. Then, pi and p2 are equivalent if and only if the foIlowing two conditions hold: (i) A map- ping A determined by AX; = X,2 is extended to an invertible linear transformation from H(X I ) to H(X 2 ); (ii) There exists a symmetric operator B of tHilbert-Schmidt type such that (A, AI/h =((I - B), 1/)" where (, I/)i= E(I/) is the inner product in H(X i ). Next, suppose that {X;}, i= 1,2, are not centered. Set E(X;)=m;, i = 1, 2. Then P I and p2 are eq uivalent jf and only if {Xf} with X; = X; - m; satisfy the fol- lowing condition: (iii) in addition to conditions (i) and (ii) above, there exists an element (0 in H(X I ) such that m,I -m?==E[oX/J, tET. Given equivalent Gaussian measures pi and p2 on (R T, .@), T = [0, to], it is in general not easy to obtain the tRadon-Nikodym de- rivative; however, if one of them, say pI, is the tWiener measure, then one can obtain <P = dpl jdp2 explicitly (M. Hitsuda [4]). Assume that p2 is derived from {X,}, By assumption, X, has the tcanonical representation X,=B,- I fI l(s,U)dBu+a(S)}ds, tET, where {B,} is the tBrownian motion from which pI is assumed to be derived and where IEL1(T2), aEL 2 (T). With this notation. we can write <P= : =exp[IO fI I(S,U)dBu+a(S)}dB s -Io (I I(s, u)dB u +a(s)Y ds 1 This result yields a remark about general Gaussian measures: If pI and p2 are equiva- lent, {X/, tE T} has a canonical representation if and only if {X,2, tE T} does. I. Nonlinear Functionals Start with nonlinear functionals of a Brownian motion {B" tE R I}, and caIl them Brownian functionals. If {<Pn} is a complete orthonormal system in L 2 (R I), then the coIlection {X n = S <Pn(u)dBu} oftstochastic integrals forms a Gaussian system of independent standard Gaussian random variables. A Fourier- Hermite polynomial based on {<Pn} is a poly- nomial in X n expressible as a finite product in the form c n Hn (X j jj2), c a constant. . J ] The sum II = L IIj is the degree of this polyno- mial. If n > 0, its mean is zero and the variance 672 is Icl 2 II j (n) 2 nj ). Taking c to be II j (n! 2 n r l / 2 , the coIlection of all Fourier-Hermite polyno- mials based on {<Pn} forms a complete ortho- normal system in the Hilbert space (L 2 ) con- sisting of all Brownian functionals with finite variance. Denote by .Ye n the subspace of(L 2 ) spanned by all the Fourier-Hermite polyno- mials of degree n. Then the Hilbert space (L 2 ) admits a direct-sum decomposition: 00 (L 2 )= L EB£;., n=O which is caIled the Wiener-Ito decomposition [7J. The subspace £;. is eventuaIly indepen- dent of the choice of a complete orthonormal system {<Pn}' A member X of £;. is referred to as a multiple Wiener integral of degree n and is expressed in the form X= f  ff(SI,S2,00.,Sn)dB s ,dB s2 ...dB s " where f is a symmetric L 2(Rn)-function. In ad- dition, E(X 2 ) = n! Ilf11 2 , II II being the L 2(Rn)_ norm. Brownian functionals can be expressed as functionals of white noise, so that the Hilbert space L 2(Ji) is viewed as a realization of (L 2), where Ji is the probability distribution of the Gaussian white noise introduced in Section D. Nonlinear functionals of a general Gaussian process can be dealt with in a similar but somewhat more complicated manner. If the process has a canonical representation, then the functionals can easily be rewritten as Brownian functionals. J. Applications to Prediction Theory If a Gaussian process {X, I t  a} has the canon- ical representation X,= f F(t,u)dBu, then the best tpredictor E(X,jBs(X)), s < t, is given by r F(t, u)dBu. This is in agreement with the best tlinear predictor. No systematic approach for nonlinear pre- diction theory has been established so far. We iIlustrate this theory only in a typical case that arises from a stationary Gaussian process. Let {X,} be a real-valued stationary Gaussian process. Without loss of generality, we can as- sume that E(X,)=O. We consider the continu- ous parameter case, since the discrete param- etcr case is easier and can be inferred from it. Let M,(X) be the subspace of L 2 (Q,P) de- 
673 fined in the same way as in Section E, and let L, = L 2(Q, B,(X)). If the process is purely non- deterministic in the sense that n, M,(X)= {O}, then X, has the backward moving average representation X,= foof(t-S)dB" where {B,} is a Brownian motion. Further- more, this representation is canonical, i.e., B,(X)= B,(B). Since B is assumed to be V,B" every Y in (L 2) can be expressed in the form 00 f ro f S' f S2 Y=po -00'" -00 _oof P (S"S2' ... , S p)dB s1 dB" ... dBsp' in terms of multiple Wiener integrals. The predictor of Y based on the known values X s , s.s; 0, that is, the projection of Y on Lo is given by truncating the domain of the integral as 00 f o f S' f S2 po -00'" -00 _oof p (St,S2, ..., sp)dB s , dB s2 '" dBsp' K. Extrapolation and Interpolation Extrapolation of a stationary Gaussian pro- cess is used to fmd the best estimate of the unknown values of a given process when we have been given the only some of the past values of the process. M. G. Krein [8] ob- tained some results by using the theory of the inverse spectral problem discussed by I. M. Gel'fand and B, Levitan. In contrast to extrapolation is the interpola- tion of a stationary Gaussian process. There is an important contribution to this prob- lem by H. Dym and H. P. McKean [1] who developed the Krein theory by applying the de Branges method to the theory of entire functions. References [IJ H. Dym and H. P. McKean, Gaussian processes, function theory, and the inverse spectral problem, Academic Press, 1976. [2J X. Fernique, Continuite des processus gaussiens, C. R. Acad. Sci, Paris, 258 (1964), 6058-6060. [3] T. Hida, Canonical representations of Gaussian processes and their applications, Mem. ColI. Sci. Univ. of Kyoto, (A, Math,) 34 (1960),109-155. [4J M. Hitsuda, Representation of Gaussian processes equivalent to Wiener processes, Osaka J. Math., 5 (1968), 299-312. 177 A Generating Functions [5J G. A. Hunt, Random Fourier transforms, Trans. Amer. Math. Soc., 71 (1951),38-69. [6J I. A. Ibragimov and Yu. A. Rozanov, Gaussian random processes, Springer, 1978. (Original in Russian, 1970.) [7J K. Ita, Multiple Wiener integral, 1. Math. Soc. Japan, 3 (1951),157-169. [8J M. G. Krein, On the fundamental approx- imation problem in the theory of extrapola- tion and the filtration of stationary random processes, Selected Transl. in Math. Stat. Prob., 4 (1963), 127-131. (Original in Russian, 1954.) [9J P. Levy, Processus stochastiques et mouvement brownien, Gauthier-Villars, 1948; second edition, 1965. [10J P. Levy, A special problem of Brownian motion and a general theory of Gaussian random functions, Proc. 3rd Berkeley Symp. Math. Stat. Prob. II, Univ. of Calif. Press, 1956, 133-175. [11] R, M. Dudley, The size of compact subset of Hilbert space and continuity of Gaussian processes, 1. Functional Anal., 1 (1967),290- 330. [12J H. P. McKean, Jr., Brownian motion with a several-dimensional time, Teor. Vero- yatnost i Primenen., 8 (1963), 357-378. [13J E. Nelson, Construction of quantum fields from Markov fields, 1. Functional Anal., 12 (1973), 97 -112. [14J Y. Okabe, On the structure of splitting fields of stationary Gaussian processes with finite multiple Markovian property, Nagoya Math. 1., 54(1974),191-213. [15J Yu. A. Rozanov, Infinite-dimensional Gaussian distribution, Proc. Steklov Inst. Math., 108 (1971). (Original in Russian, 1968.) [16] M. Yor, Existence et unicite de diffusion a valeurs dans un espace Hilbert, Ann. Inst. H. Poincare, (B) 10 (1974),55-88. [17J Y. Okabe, Stationary Gaussian pro- cesses with Markovian property and M. Sato's hyperfunctions, Japan 1. Math., 41 (1973),69-122. [18J X. Fernique, Regularite des trajectories des fonctions aleatoires gaussienes, Lecture notes in math. 480, Springer, 1974, 1-96. 177 (XIV.2) Generating Functions A. General Remarks A power series g(t) = Lo ant n in t that con- verges in a certain neighborhood of t == 0 de- fines the sequence of numbers {an}' The func- tion g(t) is called the generating function of 
177 B Generating Functions the sequence. Similarly, the series K(x, t) = L:'=ofn(x)t n that is convergent for x and t in a certain domain in (x, t)-space is calIed the generating function of the sequence of func- tions {In(x)}. Sometimes the function g(t)= Lo(an/n!)tn is calIed the exponential generat- ing function of the sequence {an}' For exam- ple, the generating functions of the tbinomial coefficients and tLegendre polynomials are (1 + t}n and (1- 2tx + t 2 r l / 2 , respectively. When a generating function of {an} or {fn(x)} is given, we can obtain an or fn(x) by integral expressions; for example, in the latter case we have 1 r K(x,t) fn(X)= 2ni Jc dt, where the contour C is a sufficiently smalI circle going counterclockwise around the origin. A generating function can be continued beyond the domain of convergence of the power series. Simple generating functions are known for many important orthogonal sys- tems of functions ( 317 Orthogonal F unc- tions). Generating functions are widely used because they enable us to derive analyticalIy the properties of sequences of numbers or functions. For a system of numbers or func- tions depending on a continuous parameter instead of the integral parameter n, we define the generating function in the form of a tLa- place or tFourier transform. In particular, for a probability tdistribution function F(x), the exponential generating function f(t) = f"'ooe-'XdF(x) for the moments {an} of F(x) is calIed the moment-generating function of F(x), There is another kind of generating func- tion for the sequence an: L ann- s in the form of tDirichlet series; it is frequently used in number-theoretic problems. B. Bernoulli Polynomials A system of polynomials Bn(x) = kt O G) Bk(O)X n - k is defined by the generating function te X ' 00 tn ---,-= L Bn(x)-. e - 1 nO n! Bn(x) is called the Bernoulli polynomial of degree n. Since Bk(O) is the coefficient of t k /k! in t 00 t n -- " B ( 0 ) - '_ 1 -1... n " e nO n. we have Bo(O) = 1, BI (0) = -1/2, B 2 (0) == 1/6, ...; B2n+1 (0) = 0 for n  1; and ( _1)n-1 Bzn(O) > 0 for n  1. The nth Bernoulli number Bn ( Ap- 674 pendix B, Table 3) is usualIy defined by I Bn(O)I. Sometimes other definitions, such as Bn = Bn(O) or Bn = B 2n (0), are used. BernoulIi polynomials satisfy the relations Bn(x + 1) - Bn(x) = nx n - I , dBn(x)/dx = nBn-l (x), which are used in tinterpolation. For example, a polynomial solution of the tdifference equa- tion f(x + 1) - f(x) = Lo anx n is given by f(x) = LoanBn+1 (x)/(n + 1) + (arbitrary con- stant). In particular, we have 1 n + 2 n + ... + pn = (Bn+1 (p + I) - Bn+1 (1))/(n + I). C. Euler Polynomials A system of polynomials En(x)= f ( n ) akxn-k k=O k is defined by the generating function 2e x ' 00 t n ---,-= L En(x)-. e + 1 FO n! We calI En(x) the Euler polynomial of degree n. Here a k is defined by a k = EdO) and 2 00 t n ---,- 1 = Lan,' e + nO n. so that we have a o = 1, a l = -1/2, a 3 = 1/4, ".; a 2n =0 for n  1. The nth Euler number is sometimes taken as an' but more often it is defined by En=(-I)" f 2 ( n ) a, o J1 i.e., by 2 00 E -=sechx= L (_I)n--"x n eX+e- x nO n! ( Appendix B, Table 3). AII the En are inte- gers, E 2m +! =0 (m=O, 1, ...) and E 2m >0 (m= 0,1,...); in the decimal expressions for En the last digit is 5 for E 4m (m 1) and 1 for E4m+2 (mO). Sometimes E 2n is denoted by En. We have the relations 00 E x n secx= L' n=O n! En(x)+En(x+ 1)=2x n , En(J - x) = (- I)n En(x), dEn(x) =I1En_l(X), and in particular, _I n +2 n _3 n +4 n _... +( _ljPpn =(( -1)P E,,(p+ 1)- E,,(I))/2. 
675 D. Application to Combinatorics Let p(n) denote the number of ways of dividing n similar objects into nonempty class. This is called the number of partitions of n. Euler noticed that the following formula is valid for the generating function of p(n): 1 + n1 p(n)xn=LQ (l-xn)TI, whence we obtain p(n)= p(n-l)+ p(n- 2)- p(n-5)- p(n-7)+ p(n-12)+ p(n-15)-... = Ld -1)H[p(n-(3k 2 -k)/2)+p(n-(3k 2 + k)/2)J ( 328 Partitions of Numbers). Let B(n) denote the number of ways of dividing n completely dissimilar objects into nonempty classes. These are called Bell numbers. The first few are 1, 1,2,5, 15,52,203,877,4140, 21147,... [2]. For this sequence with B(O)= 1, the formal generating function Lo B(n)x n does not converge except at x=O. However, the exponential generating function g(x)= Lo B(n)(n!t l x n is convergent for all complex numbers x and is equal to eeLl. The differen- tial equation g' = e 9 gives rise to a recursive formula B(n+ 1)= t ( n ) B(k). kO k References [1 ] E. Netto, The theory of substitutions and its applications to algebra, revised and trans- lated by F. N. Cole, second edition, Chelsea, 1964. (Original in German, 1882.) [2] E, T. Bell, Algebraic arithmetic, Amer. Math. Soc., 1927. [3] E. B. McBride, Obtaining generating func- tions, Springer, 1971. Also  references to 389 Special Functions, 178 (VII.G) Geodesics A. General Remarks In the study of global differential geometry, geodesics play an essential role because the behavior of geodesics on a complete Riemann- ian m-manifold (m  2) M without boundary heavily influences its topological structure, A significant merit of using geodesics is that one Can make elementary and intuitive observa- tions on M (as in Euclidean geometry), which often yields fruitful results. Riemannian structure induces in a natural way a distance function d on M, and hence M is a metric space. A smooth curve y of constant 178 A Geodesics speed is by definition a geodesic if and only if every point on y is contained in a nontrivial subarc whose length realizes the distance be- tween its endpoints. M is equipped with the Levi-Civita connection V ( 80 Connections) by means of which a geodesic is characterized as an autoparallel curve. In local coordinates, y is obtained as a solution of an ordinary differential equation of order two ( 80 Con- nections L). It follows from the properties of solutions of the equation for geodesics that every point pEM has a neighborhood in which every two points can be joined by a unique shortest geodesic that depends smoothly on both of the endpoints. A classical result of 1. H. C. Whitehead (Quart. J. Math., 3 (1932)) states that every point pE M has a metric ball B centered at p such that every two points in B are joined by a unique shortest geodesic whose image is contained in B. Let Mp be the tangent space to M at p, and let MpcM p be the set of all vectors VE Mp such that the solution of geodesic y with initial condition y(O) = p, y(O) = v is well defined on [0, 1]. Mp contains an open neighborhood of the origin and is tstar- shaped with respect to the origin. The ex- ponential mapping exp p : M p...... M is defined to be exp p v=y(1). This mapping expp is smooth and has the maximal rank at the origin. Small balls around p are obtained as the image under the exponential mapping of the corre- sponding balls in Mp centered at the origin, and the restriction of expp to these balls are diffeomorphisms. Thus the topology of M as a metric space is equivalent to the original one of M. The fundamental Hopf-Rinow theorem (Comment. Math. He/v., 3 (1931)) states that (1) M is complete as a metric space if and only if Mp=Mp for some pEM; (2) M is complete if and only if every closed metric ball is compact; and (3) if M is complete, then every two points can be joined by a shortest geodesic, namely, a geodesic with length realizing the distance between the endpoints. In the following discussion M is assumed to be connected, complete, and without boundary, Let y:[O, IJ......M be a geodesic. A piecewise smooth I-parameter variation V along y is a continuous mapping V: [0, IJ x (-s, s)......M with a finite partition 0 == to < t 1 < ... < t k = 1 such that VI [t i , ti+lJ x (-s, s) is smooth and V(t,O)=y(t) for all tE[O, 1]. Then the first and second variation formulas for V are L'(O) = < Y, y) 16/L(y) and k [f " L"(O)=i 'i-! «Y;', Y;')-<R(Y;,y)y, Y;»dt + <Vo1osY;,Y)I:;-,JI L(y), 
178 B Geodesics respectively, where Y;(t) = 0 V(t, s)/os I FO for t E [t i - I , tJ is the variation vector field as- sociated with VI [t i - I , tJ x (-E., E.), L(s) is the length of the variation curve t --> V(t, s), and Y;' == V y Y; and R is the curvature tensor. A smooth vector field J along a geodesic 1': [0, 1] --> M is called a Jacobi field if and only if it satisfies J" + R(J;y)y = O. The set of all Jacobi fields along I' forms a vector space iso- morphic to My(O) x My(o) by the natural corre- spondence J-->(J(O),J'(O)), Every Jacobi field is associated with a I-parameter geodesic varia- tion V along y. Namely, every variation curve of V is a geodesic, V is smooth, and J(t) = o V(t, s)/os 1 FO for aII tE [0,1]. Conversely, the variation vector field of any I-parameter geodesic variation V along I' is a Jacobi field. EspecialIy, if A is a tangent vector to Mp at y(O), then d(expp)y(O) A = J(1), where J is the Jacobi field associated with the I-parameter geodesic variation V(t, s) = exppt(y(O) + sA), and A is identified under the canonical parallel translation in Mp. A point y(to) is said to be conjugate to P = 1'(0) along I' if and only if there exists a nontrivial Jacobi field along I' that vanishes at 0 and to. A theorem of Morse and Schoenberg states that if the sectional curva- ture K of M satisfies 0 < A.s; K.s; B < 00, then every unit speed geodesic 1': [0, lJ --> M has a point conjugate to 1'(0) if lnlR and has no point conjugate to 1'(0) if l < nlft. EspecialIy, every geodesic I' on M with K.s;O contains no conjugate point on it, and exp p is localIy regu- lar for any pEM. Thus a complete and simply connected M with K.s;O is diffeomorphic to Rm (Hadamard and Cartan). Let I' be a unit speed geodesic with 1'(0) = p. If t I> 0 satisfies d(p, y(t)) = t for all t E [0, t I) and d(p, y(t)) < t for aII t> t I, then the point y(t l ) is called a cut point to p along y. It ap- pears no later than the first conjugate point to p along y. The set C(p) of all cut points to p is called the cut locus of p. Let U be the set of vectors tvEM p , O.s;t< 1, where exppv is the cut point to p along y(t) = expptv. U is a nonempty open set and exppl U: U-->M -C(p) is a diffeo- morphism, where M - C(p) is diffeomorphic to an m-disk. The cut locus possesses essential information on the topology of M. The metric comparison theorems of H. Rauch (Ann. Math., (2) 54 (1951)) and M. Ber- ger (Illinois J. Math., 6 (1962)) are stated as follows: Let infMK =15> -00 and SUPMK =< 00, and let 1': [0, 1 J --> M be a nontrivial geo- desic. Let M(c) be a complete and simply connected space form of constant curvature c. Fix geodesics Yo, I'd: [0, 1J-->M(b), M(), re- spectively, with the same speed as y. If J, J o , and J d are Jacobi fields along 1', Yo, and y respectively such that J(O) = Jo(O) = J d(O) = 0 and IIJ'(O)II = IIJ';(O) II == IIJ(O)II, then IIJd(t)II.s; 676 IIJ(t)ll.s; IIJo(t)11 holds for aII tE [0, tdJ, where Y d(td) is the first conjugate point to Yd(O) along Y d (Rauch). If the Jacobi fields satisfy IIJ(O)II = IIJo(O)11 = IIJd(O)11 and J'(O)=J';(O)=J(O)=O, then iiJd(t)ll.s; IIJ(t)ll.s; 1IJ0(t) II holds for aII t E [0, tJ, where t is the first zero point of J d (Berger). They are often applied to compare curve lengths as follows. Let c: 1--> Rm be a piecewise smooth curve with Ilc(t)lI.s;t d . Fix points pEM, pEM(b), and PdEM() with fixed isometric identifications of the tan- gent spaces with Rm. Then L(expp,oc).s; L(exppoc).s; L(exp p , oc). If P, Po, and P d are unit parallel fields along corresponding geo- desics (J, 0"0' O"d: I -->M, M(b), M(), respec- tively, which have the same speed, and if <P, d") = <Po, do) = (P d , d'd)' then for every piecewise smooth function f: 1-->[0, tJ the curves u(s) = eXPa(s/(s)P(s) and u(s), ud(s) de- fined in the same way on M(b), M(), respec- tively, have lengths L(ud).s;L(u).s;L(u o )' They play important roles in the theory of geodesics. A geodesic triangle is a triple (1'0,1'" 1'2) of shortest geodesics (for convenience they are parametrized on [0, 1]) such that y;(I)=Yi+1(O) for aII i=O, 1,2 with mod3. The vertices Po, PI, P2 and the angles 0 0 , 0 1 , O 2 of a geodesic triangle (1'0,1'1' 12) are defined by Pi+l = Yi(O) and Oi+1 = cos -I {<y;(O), -Yi-d1) )/IIYiIIIIYi-III}, where angles are always taken in [0, n]. The triangle comparison theorem of Toponogov (Amer. Math. Soc. Transl., 37 (1964)) is stated as follows. Let K  15 > -00 be satisfied on M. For any geodesic triangle (Yo, Yl, 1'2) on M there is a geodesic triangle (Yo, YI' Y2) on M(b) such that L(y;) = L(yJ and the angles iI; of this triangle satisfy iI;.s; 0i for i = 0, I, 2. It turns out that if 15 > 0 then the circumference of any geodesic triangle on M does not exceed 2nl fl. F or details of the basic facts stated above we refer the reader to [1- 3]. B. Curvature and Fundamental Groups The fundamental group n I (M) of M ( 170 Fundamental Groups) is influenced by the curvature of M. A basic idea, going back to Hadamard (J, Math. Pures Appl., 4 (1898)) and Cartan, states that every nontrivial tfree homotopy class bEndM) of loops on a com- pact M contains a closed geodesic of minimal length whose preimage under the covering projection n: 1M --> M in the universal Riemann- ian covering 1M is either closed (when 15 is of finite order) or a straight line (when 15 is of infinite order) that is translated along itself by the tdeck transformation b. (I) By using the second variational formula, an even-dimensional compact M with K > 0 
677 either is simply connected or has n 1 (M) = Z2 (Synge, Quart. J. Math., 7 (1936)). A beautiful result of Myers (Duke Math. J., 8 (1941)) states that if the Ricci curvature Ric of M is bounded below by b > 0, then the diameter d(M) is not greater than n/ fl and hence M is compact and n l (M) is finite. It is proved in [3J that if K  0, then there exists a finite normal sub- group <I> of ndM) such that ndM)/<I> is a Bieberbach group ( 92 Crystallographic Groups), The splitting theorem ( Section F(2)) is used in the proof. (2) When K.s;O is satisfied on M, M is dif- feomorphic to Rm, and hence n,(M) =0 for all i  2. A basic tool used in the study of n dM) in this case is that the displacement function x--> d(x, b(x)), xEM, of every isometry b on M is convex [4J, where a function on M is said to be convex if and only if its restriction to every geodesic is convex. A classical result of Preissmann (Comment. Math. Helv., 15 (1943)) states that every nontrivial Abelian subgroup of ndM) of a compact M with K <0 is an infinite cyclic subgroup. It is proved in [4] that if K <0 holds on M and if every deck trans- formation on M translates some geodesic along itself, then nl(M) is a disjoint union of infinite cyclic subgroups, and any two com- muting elements belong to the same cyclic subgroup. Moreover, if M is compact and K.s;O and if ndM) is a direct product r l x r 2 such that nl(M) is tcenterless, then M is iso- metric to the Riemannian product M 1 x M2' and nd M ;) = r; holds for i = 1,2 [5,6]. It is shown in [7J that the fundamental group of M with K .s; 0 occurs as that of an (m + 1)- dimensional M' with K.s;c <0, which is diffeo- morphic to M x R. The warped product [4] is used to construct such a metric on M x R. c. Cut Locus The injectivity radius i(M) of M is defmed to be the infimum of the continuous function x-->d(x, C(x)), XE M. As is seen in Section D, the estimate of injectivity radii provides many fruitful results on the topology of Riemannian manifolds. It follows from Synge's result that an even dimensional compact and orientable M with 0 < K.s; 1 has its injectivity radius i(M)  n [3]. However, such an estimate can- not be obtained in odd dimensions. The ex- amples discussed in [8J show that there are infinitely many homotopically distinct homo- geneous Riemannian 7-manifolds ofposi- tive curvature. The injectivity radii of such examples are estimated precisely in [9J, ac- cording to which there is no positive lower bound for them. The sphere theorem of Kling- enberg (Comment. Math, Helv., 35 (1961)) 178 D Geodesics states that if M is compact and simply con- nected and if 1/4 < K.s; 1, then i(M)n, and M is a topological sphere. This extends the pioneering work by H. Rauch (Ann. Math., (2) 54 (1951 )). A rigidity theorem of M. Berger (Ann. Scuola Norm. Sup. Pisa, 14 (1962)) states that if dim M is even, M is simply connected, and 1/4.s; K .s; 1, then M is homeomorphic to sm (if d(M) > n) or isometric to one of the sym- metric spaces of compact type of rank 1 (if d(M)=n), A slight generalization of the trian- gle comparison theorem is used to prove that for given positive constants d, V, and S, there exists a constant Cm(d, V,S»O such that if K, d(M) and the volume v(M) satisfy IKI <S, d(M)<d, and v(M» V, then i(M)Cm(d, V,S) [10]. D. Finiteness Theorem Finiteness theorems are a natural extension of the sphere theorem. They provide a priori estimates for the number of various topolog- ical types of manifolds (for instance, homology, homotopy, homeomorphism, diffeomorphism types, etc,), which admit certain classes of Riemannian metrics characterized by geo- metric quantities, The basic idea of the esti- mates is to fmd a constant c > 0 that depends only on geometric information which char- acterizes the class of manifolds so that if M be- longs to the class, then i(M)c. Then a number N is found from the information given a priori such that every element M of the class has an open cover of at most N balls w hose radii are all less than c. It is proved in [11 J that for given bE(O, 1) and m, there exists a number N(b,m) such that there are at most N(b,m) homotopy types for the class of all simply connected 2m-dimensional manifolds with b .s; K .s; 1. Furthermore, for given positive numbers d, V, and S, there are at most finitely many homeomorphism (or diffeomorphism) types for the class of all m-manifolds M, each of which has the property that d(M)<d, v(M) > V, and IKI <S [10]. This result is applied to obtain a result in [12], which states that there exists for given m, V> 0 and an e > 0 such that if M is compact, -1-e.s;K.s; -1, and v(M» V, then M admits a metric of constant nega- tive curvature. For an e O, M is said to be e-flat if and only if SUPM K . d(M)2 .s; e is satisfied for M. Every compact flat manifold is e-flat for alleO. M is called a nilmanifold if and only if it admits a transitive action of a tnilpotent Lie group. It is shown in [13J that for a compact M with dim M = m, there exists a number e(m) > 0 such that if Mis e(m)-flat, then (1) there is a maxi- mal nilpotent normal divisor N cndM), (2) 
178 E Geodesics the order of n I (M)/ N is bounded by a constant which depends only on m, and (3) the finite cover of M which corresponds to N is diffeo- morphic to a nilmanifold. Moreover, if Mis F.(m)-flat then M is diffeomorphic to Rm. If M is F.(m)-flat and if n l (M) is commutative, then M is diffeomorphic to a Horus. E. Uniqueness Theorem Uniqueness of topological structures (as in the sphere theorem) of certain classes of compact Riemannian manifolds is discussed here. The discovery of texotic 7-spheres by J. Milnor (Ann. Math., (2) 64 (1956)) gave rise to the question of whether in the sphere theorem the conclusion (homeomorphism to the sphere) could be replaced by diffeomorphism. Since the number of differentiable structures on a topological sphere depends on its dimension ( 114 Differential Topology), it has been thought that in order to get the standard sphere in the sphere theorem the best possible restriction for the curvature might also depend upon dimension. The appropriate differenti- able pinching problem is to find a sequence {m} so that if M is compact and simply con- nected and if b.s; K .s; 1 for some b > m' then M is diffeomorphic to sm, and m is the least possible with this property. D. Gromoll and Y. Shikata proved independently [14, 15J that m < I holds for all m  2. Later it was proved that there exists a b o E(I/4, 1) such that m.s;bo holds for all m  2 [16]. The diffeotopy theo- rem, which plays an essential role in the proof of finding such a b o , provides a sufficient con- dition for a diffeomorphism on sm-I to be tisotopic to the identity mapping. A different idea is put forth in [17], which imitates the Gauss normal mapping of a closed convex hypersurface in Rm+l. It is proved that if the curvature operator is suffi- ciently closed to the identity, then M is diffeo- morphic to sm. This idea is used to obtain a diffeomorphism between M and a spherical space form. A generalization of the sphere theorem states that if M is compact and if (d(M)/n)2infM K > 1/4, then M is a topological sphere [18]. F. Noncompact Manifolds Let M be noncompact. It is due to the nature of noncompactness that through each point on M there passes a ray y: [0, 00)--> M, e.g., y is a unit speed geodesic any of whose subarc is minimizing. (1) Busemann Functions. A Busemann function Fy: M --> R with respect to a ray y is defined by 678 Fy(x) = lim hoo [t -d(x, y(t))]. The original defi- nition of it was used by H. Busemann to define a parallel axiom on straight G-spaces ( Sec- tion H). Fy has the property Fy-I « -00, tlJ) = {XE Fy-I« -00, t 2 J) I d(x, oFy-1 « -00, t 2 J))  t 2 - td for any t 2  t1> and hence Fy(x) = t 2 - d(x,oF y - I « -00,12]) for all t 2 and x with F y (x).s;t 2 . It has been proved, by using the second variation formula, that if Ric O, K  0[19] or if, in the case where M is Kahlerian, the holomorphic bisectional curvature [20J is nonnegative, then Fy is subharmonic (sh), convex or tplurisubharmonic (psh), respec- tively, where the holomorphic bisectional curvature is defined as R(X, JX, JY, Y) for the complex structure J and orthonormal vectors X and Y. If the holomorphic bisectional curva- ture [20] is positive, then Fy is strictly psh. If K  0, then the triangle comparison theorem implies that F=sup{FyIY(O)=p} is convex and is an exhaustion [21], where the sup is taken over all rays emanating from a point pEM, and a function f is said to be an exhaustion if and only if f-I« -oo,aJ) is compact for all aE R. A well-known theorem of H. Grauert (Math. Ann., 140 (1960) states that if M admits a strictly psh exhaustion function, then M is a tStein manifold ( 21 Analytic Functions of Several Complex Variables). In this context, various conditions for curvatures on Kahler manifolds under which they become Stein manifolds have been found [20]. Let H be complete and simply connected, and K .s;0. Busemann functions on Hare differentiable of class C 2 , and the thorosphere Fy-l ({ t}) is a C 2 -surface [22]. On a parabolic visibility manifold ( Section F(3)) the nega- tive of every Busemann function is CI-convex without minimum [7]. (2) Ends and Splitting Theorems. Ends of a noncom pact M are defined as folIows: If Al and A 2 are compact subsets of M with Al C A 2 , then any component of M - A 2 is con- tained in a unique component of M - AI' An end is the limit of an inverse system {compo- nents of M - A; A} directed by the inclusion relation as indicated above and indexed by {AlA compact}. It has been shown that ifRic>O [23] (or O [19J), then M has exactly one end (or at most two ends). A visibility manifold has at most two ends if it is not Fuchsian [7]. If M admits a locally nonconstant convex function, then M has at most two ends [24]. If M has more than one end, then there exists a straight line y:R-->M which is by definition a nontrivial geodesic any of whose subarcs is minimizing, A classical result of Cohn-Vossen (Mat. Sb., 1 (1936)) states that if dim M = 2, K  0 and if M contains a straight 
679 line, then the total curvature is 0 and hence M is isometric to either a plane or a cylinder SI x R. Generalizations of this result state that if M admits k independent straight lines and if K O (Toponogov, Amer. Math. Soc. Transl., 37 (1964)) (or ifRicO [19J), then M is iso- metric to the Riemannian product N x R k . (3) Structure Theorems. The structure theorem of 2-dimensional M with K > 0 was proved by Cohn-Vossen (Compositio Math., 2 (1935)). It has been shown that M is diffeomorphic to Rm if K >0 [23J and that if K O, then there exists a compact totaIly geodesic submanifold S without boundary such that M is homeo- morphic to the total space of the tnormal bundle over S [21]. Here, homeomorphism can be replaced by diffeomorphism, and if K > 0 outside a compact set of M, then M is diffeomorphic to Rm [26]. For a ray y and a point x on H, if (J is a ray with (J(O) = x and if d(y(t), (J(t)), t > 0, is bounded above, then (J is caIled an asymptotic ray (a ray asymptotic) to y. The asymptotic relation is an equivalence relation on the set of all geodesics on H, A point at infinity of H is an equivalence class on geodesics on H, and the set of all points at infinity of H is denoted by H(oo). With a suitable topology, the set of all points at infinity of H constitutes a bounding sphere such that H = H U H( 00) is a closed m-cell. For a properly discontin- uous group D of isometries acting on H, a closed D-invariant limit set L(D) is obtained in H(oo) as the set of all accumulation points of b(p), bED. M =H/D is a visibility manifold if and only if H satisfies the foIlowing axiom: any two distinct points on H( 00) can be joined by at least one geodesic, If K .s; c < 0 is satisfied on M, then M is a visibility manifold [7]. By investigating limit sets, visibility manifolds are classified into three types [7]. (1) M is parabolic; M is diffeomorphic to N x Rand is characterized by the fact that it has a convex function without minimum. (2) M is axial; M is a vector bundle over S I, and hence it is diffeomorphi to either SI X Rm-I or the prod- uct of a Mobius strip with Rm-2. (3) M is Fuchsian; M has more than two ends, and a strong algebraic restriction is imposed on the fundamental group of M. (4) Convex Functions. Some elementary prop- erties of Coo convex functions on M have been investigated and it was proved in [4J that if M admits a Coo convex function with- out minimum and if K .s;0, then M is diffeo- morphic to N x R, where N is a level hyper- surface, When K > 0, F can be replaced by a strongly convex exhaustion function. Namely, for every compact set A eM there is a b> 0 178 G Geodesics such that the second difference quotient along every geodesic at any point on A is bounded below by b [20]. A standard tconvolution smoothing procedure yields smooth convex approximations in neighborhoods of compact sets for every strongly convex function such that their second derivatives along every geo- desic is positive. Global approximations can be constructed [27]. Let <p: M --> R be a convex function which is not constant on any open set. It has recently been proved that (1) if b > infM<P -00, then there is a homeomorphism H: <p -I( {b}) x (infM <p, 00 )-->M - {minimum set of <p, if any} such that <p 0 H(y, ex) = ex for all yE<p -I ({b}) and for all exE(inf M <p, 00); (2) if <p takes a minimum, then the continuous exten- sion H: [inf M <p, 00)--> M of H is proper and surjective [24]. G. Manifolds AlI of Whose Geodesics Are Closed As is well known, every symmetric space of compact type of rank 1 (it is abbreviated CROSS as in [28J) has the property that all geodesics are simply closed and of the same length. Let M be a compact Riemannian mani- fold with the property that all geodesics on M are simply closed and that they have the same length (SC property). The problem discussed here is whether such an M is isometric (or at least the topology of such an M is equivalent) to some CROSS, or if it is possible to classify all such manifolds. An example of a nontrivial SC-manifold was first constructed by Zoll (Math. Ann., 57 (1903)) on S2 as a surface of revolution in R 3 , which is not isometric to a standard sphere. Blaschke conjectured that every SC-structure on PR 2 is the standard real projective space, This has been solved affirmatively by L. W. Green (Ann. Math., 78 (1963)). In higher di- mensions it has been proved that every in- finitesimal deformation of the standard SC- structure on PRm is trivial [29]. A general result for a SC-manifold M states that the volume of M (with dim M = m) with period 2n is the integral multiple of the volume of the standard unit m-sphere and this integer is a topological invariant [30]. For a point pE M, if every geodesic segment with length I emanating from p is a simple geodesic loop at p, then M is caIled a SCP-manifold, It has been proved that the tintegral cohomology ring of every SCP -manifold is isomorphic to one of the CROSS [31J, which is a generalization of Bott's theorem (Ann. Math., 60 (1954)). An essential difference between the metrics of a ZoIl surface and a standard sphere is seen 
178H Geodesics on the cut locus and the conjugate locus of a point. The (tangent) cut locus of any fixed point of a CROSS coincides with the (tangent) first conjugate locus; however, this does not hold on a Zoll surface. This observation gives rise to the definition of a Blaschke manifold: For distinct points P, qEM, let A(p,q)cM q be the set of all unit vectors tangent to the mini- mizing geodesics from q to P, M is said to be a Blaschke manifold at a point P if for every q E C(p), A(p, q) is a great sphere of the unit hypersphere in Mq centered at the origin. M is called a Blaschke manifold if it is so at every point of it. The following statements are equiv- alent [28]: (1) M is a Blaschke manifold at p, (2) the tangent cut locus C p at p is spher- ical in Mp, (3) along every geodesic starting from p, the first conjugate point p appears at a constant length and the multiplicity of it is independent of the initial direction. Then such an M can be exhibited as D U a E, where D is a closed ball with center p, E a Coo-closed k-disk bundle over an (m - k)-dimensional Coo-compact manifold with boundary oE dif- feomorphic to sm-l, and a:oD-->oE an tattach- ing diffeomorphism. Conversely, if M is ex- hibited as D U a E then there exists a metric 9 on M such that (M,g) is a pointed Blaschke manifold at p, where p is the center of D. A generalized Blaschke's conjecture states that every Blaschke manifold is isometric to a CROSS, A partial solution to this conjecture has been obtained by M. Berger and states that if (sm, g) is a Blaschke manifold, then it is isometric to a standard sphere [28]. H. G-Spaces G-spaces were created by Busemann to show that many global theorems of differential geometry are independent of the Riemannian character of the metric and also of smooth- ness. This approach naturally led to novel questions and results. (Many facts hold even when the distance is not symmetric;  [34].) The symbol G suggests that the principal property of G-spaces is the existence of geo- desics with the same properties as in complete Riemannian manifolds without boundary except for differentiability. A G-space X is defined as follows: (I) X is metric with (sym- metric) distance d; (2) every bounded infinite set has at least one point of accumulation; (3) given two distinct points p, rE X, there is a point qEX such that p #q #r and d(p,q) + d(q, r) = d(p, r); (4) to each point XE X a posi- tive number Px is assigned such that for any p, qEX with d(p,x)<px, d(q,x)<px there is a point r with d(p,q)+d(q,r)=d(p,r); (5) if 680 d(p,q)+d(q,r)=d(p,r) and if d(p,q)+d(q,r')= d(p,r'), then d(q,r)=d(q,r') implies r=r'. From (2) and (3) it follows that any two points on X can be joined by a curve called a segment whose length realizes the distance between them. A geodesic arc (or for simplicity geodesic) y: [a, bJ --> X is a curve such that any subarc contained in a Px-ball around some point XEX is a segment. From (4) and (5) it follows that every geodesic arc has a unique infinite extension to both sides, If a geodesic arc is extended infinitely to both sides, then it is called a geodesic line. The absence of smoothness causes an essen- tial difference in the fact that the distance function to a fixed point on X is not necessar- ily convex in a small ball around it, in contrast to the Whitehead theorem ( Section A) for Riemannian manifolds (or tFinsler spaces). Here a function on X is called convex if and only if its restriction to every arc length- parametrized geodesic line is convex. Every two points on X have neighborhoods which are homeomorphic to each other. Thus the topological dimension of X ( 117 Dimen- sion Theory) is well defined, A I-dimensional X is either a circle Sl or the whole real line, If dim X = 2 [33J or if dim X == 3 [36], then X is a topological manifold. It should be noted that G-space theory not only proves known Riemannian theorems under weaker conditions, but also leads to many facts which were either not considered previously or well understood in (or thought to be very different from) the Riemannian case. Among many fruitful topics discussed by Busemann [33], only two are included below. (1) G-Surfaces. For each point p on a 2-dimen- sional G-space if', which is called a G-surface, let Sp= {XE9' Id(x,p)= p}, where O<p <pp/4 is a fixed number. Sp is homeomorphic to a circle and bounds a 2-disk. An angle A at p is the set of all segments emanating from p and passing through the points on a connected subarc of Sp. An angular measure 1'1 for the angles at p is a nonnegative function whichatisfies: (1) I A 1= n if and only if p is the midpoint of the segment joining two points on Sp which bounds A; (2) if the intersection*of two angles Al and A 2 is a unique segment, then IAI UA 2 1 = I All + IA 2 1. If a triple of points (PO,P"P2) on 9' are contained in a sufficiently small ball, then the three segments joining them define a triangle, and each vertex Pi has the angle IPi-IPiPi+tl determined by the two segments. An excess F.(POPIP2) of a triangle (Po, P"P2) is defined to be F.(POPtP2)=IPoplp21+lptP2Pol + Ip2POPII- n, A degenerate triangle has excess O. If a triangle (a, b, c), which consists of geodesic arcs, is simplicially decomposed 
681 into a finite number of triangles (ai' b;, C;), i = 1,..., k, then £(abc) = Ll £(aibic;), and this is independent of the choice of finite decompo- sition. If :7 is compact and orientab1e, the total excess £(:7) of:7 is defined by £(:7) = L=I £(aibic;), where ::I' is simplicially decom- posed into nondegenerate finite triangles (ai, bi,C;), i= 1,..., k. Then £(:7)==2nX(:7), where X(::I') is the tEuler characteristic of ::1'. The results obtained by Cohn- Vossen [35J on the ttotal curvatUre of complete open surfaces have also been generalized to G-surfaces with angular measure uniform at n. Although no angular measure leading to a true analog of the tGauss-Bonnet theorem exists even in G- surfaces [34J, many of its applications, in particular the results by Cohn- Vossen, can be proved by using angular measures which are uniform at n [33]. If every two points on X can be joined by a unique geodesic line, then al1 geodesic lines on X are simultaneously either straight lines or circles of the same length l. If in the latter case dim X > 1, then X has a two-fold universal covering space X whose geodesic lines are al1 circles of the same length 21, and they have the property that al1 geodesics through a point on X meet at the same point at length l. For the Riemannian case it is a Blaschke manifold ( Section G). If every two points on:7 can be joined by a unique geodesic line, then :7 is either a plane or a projective plane. (2) G-Spaces with Nonpositive Curvature. Let (PO,P"P2) be a triple of points on X which are contained in a small ball. X, by definition, is of nonpositive curvature if and only if for any such triple of points d(PI,Pi+I)?2d(p;,p;+I) for i=O, 1,2, (*) where P; is the midpoint of the unique segment joining Pi-I to Pi+l' If the above inequality is strict for any nondegenerate small triangle, then X is said to be of negative curvature. X of curvature 0 is defined in the same way. It should be noted that a G-space of curvature 0 is a local1y Minkowskian space [33]. The sectional curvature of a Riemannian manifold M is nonpositive if and only if (*) holds for all smal1 triangles on it. Because of d(po, p).s; d(po, p'd +d(P'I, p).s; {d(po, PI) + d(po, P2) }/2, the distance function to a point x E X of non- positive curvature is convex on a smal1 bal1 around x. X is cal1ed straight if and only if all nontrivial geodesic lines on X are straight lines. If X has nonpositive curvature, then the universal covering space X is straight. Moreover, for any two geodesic lines y, 0": R--> X, the function t-->d(O"(t), y(t)) is convex. In particular, the distance function to every fixed point on X is convex, and (*) holds for any 178 Ref. Geodesics triangle on X. Thus X is contractible. Every nontrivial element of the homotopy class of loops on X with base point XEX contains a geodesic loop at x which has the minimum length. This fact and the strict convexity of t-->d(O"(t), y(t)) for any two arc length para- metrized geodesic lines y, 0" on the universal covering X of X with negative CUrvature imply that if X is compact, then every Abelian sub- group of the fundamental group n l (X) of X is infmite cyclic. This is a generalization of the Preissmann theorem ( Section B). References [IJ A. D. Alexandrov, Die innere Geometrie der konvexen Fliichen, Akademie- Verlag, 1955. [2J 1. Cheeger and D. Ebin, Comparison theorems in Riemannian geometry, North- Hol1and, 1975. [3] D. Gromoll, W. Klingenberg, and W. Meyer, Riemannsche Geometrie in Grossen, Lecture notes in math. 55, Springer, 1968. [4J R. Bishop and B. O'Neill, Manifolds of negative curvature, Trans. Amer. Math. Soc., 145 (1969),1-49. [5J D. Gromoll and J. Wolf, Some re1atiC'ns between the metric structure and the algebraic structure of the fundamental group in mani- folds of non positive curvature, Bull. Amer. Math, Soc., 77 (1971),545-552. [6J B. Lawson and S. T. Yau, On compact manifolds of nonnegative curvature, J. Differ- ential Geometry, 7 (1972), 211-228. [7] P. Eberlein and B. O'Nei1l, Visibility mani- folds, Pacific J. Math., 46 (1973), 45-109. [8J N. WalIach, Homogeneous Riemannian manifolds of strictly positive curvature, Ann. Math., (2) 75 (1972), 277-295. [9] T. Sakai, Cut loci of Berger's sphere, Hok- kaido Math, J., 10 (1981),143-155. [10] J. Cheeger, Finiteness theorems for Riemannian manifolds, Amer. J. Math., 92 (1970),61-74. [IIJ A. Weinstein, On the homotopy type of positively pinched manifolds, Arch. Math., 18 (1967),523-524. [12J M. Gromov, Manifolds of negative curva- ture, J. Differential Geometry, 13 (1978), 223- 230. [13J M. Gromov, Almost flat manifolds, J. Differential Geometry, 13 (1978), 231-241. [14] D. Gromoll, Differenzierbare Struktur und Metriken positiver Krummung auf Sphiiren, Math. Ann., 164 (1966),353-371. [15J Y. Shikata, On the differentiable pinching problem, Osaka J. Math., 4 (1967), 279-287. [16J M. Sugimoto and K. Shiohama, On the 
179 A Geometric Construction differentiable pinching problem, Math. Ann., 195 (1971),1-16. [17J E. Ruh, Curvature and differentiable structures on spheres, Comment Math. Helv., 46 (1971), 127136. [18J K. Grove and K. Shiohama, A gen- eralized sphere theorem, Ann. Math., (2) 106 (1977),201-211. [19J J. Cheeger and D. Gromoll, The splitting theorem for manifolds of nonnegative Ricci curvature, 1. Differential Geometry, 6 (1971), 119-129. [20J R. Greene and H. Wu, Function theory on manifolds which possess a pole, Lecture notes in math. 699, Springer, 1979. [21J 1. Cheeger and D. GromoU, On the struc- ture of complete manifolds of nonnegative curvature, Ann. Math., (2) 96 (1972), 413-443. [22J H. 1m Hof and E. Heitze, Geometry of horospheres,1. Differential Geometry, 12 (1977),481-492. [23J D. Gromoll and W. Meyer, On complete open manifolds of positive curvature, Ann. Math., (2) 90 (1969), 75-90. [24J R. Greene and K. Shiohama, Convex functions on complete noncompact manifolds; Topological structures, Inventiones Math., 63 (1981),129-157. [25J W. Poor, Some results on nonnegatively curved manifolds, J. Differential Geometry, 9 (I 974), 583-600. [26J H. Wu, An elementary proof in the study of nonnegative curvature, Acta Math., 142 (1979),57-78. [27J R. Greene and H. Wu, Coo convex func- tions and manifolds of positive curvature, Acta Math., 137 (1976), 209-245. [28J A. Besse, Manifolds all of whose geo- desics are closed, Springer, 1978. [29J R. Michel, Probleme d'analyse geome- trique lies a la conjecture de Blaschke, Bull. Soc. Math. France, 101 (1973), 17-69. [30J A. Weinstein, On the volume of manifolds whose geodesics are aU closed, 1. Differential Geometry, 9 (1974), 513-517. [31J H. Nakagawa, A note on theorems of Bott and Samelson, 1. Math. Kyoto Univ., 7 (1967),205-220. [32J C. T. Yang, Odd-dimensional Wieder- sehen manifolds are spheres, J. Differential Geometry, 15 (1980), 91-96. [33J H. Busemann, The geometry of geodesics, Academic Press, 1955. [34J H. Busemann, Recent synthetic differen- tial geometry, Springer, 1970. [35J S. Cohn-Vossen, Kilrzeste Wege und Totalkrilmmung auf Fliichen, Compositio Math., 2 (1935), 63-133. [36J B. Krakus, Any 3-dimensional G-space is a manifold, Bull. Acad. Pol. Sci., 16 (1968), 737-740. 682 179 (VI.5) Geometric Construction A. General Remarks A geometric construction problem is a problem of drawing a figure satisfying given conditions by using certain prescribed tools only a finite number of times. If the problem is solvable, then it is called a possible construction problem; if it is unsolvable, even though there exist figures satisfying the given conditions, then it is an impossible construction problem. If there does not exist a figure satisfying the given conditions, then we say that the problem is inconsistent. Among problems of geometric construction, the oldest and the best known are those of constructing plane figures by means of ruler and compass. In this article, we call these problems simply problems of elementary geo- metric construction. The following are some of the more famous problems of this kind; the first four are possible construction problems. (1) Suppose that we are given three straight lines I, m, n and three points P, Q, R in a plane. Draw a triangle ABC in such a way that ver- tices A, B, C lie on I, m, n and sides BC, CA, AB pass through P, Q, R (Steiner's problem). (2) Suppose that we are given a circle a and three points P, Q, R not lying on O. Draw a triangle ABC inscribed in a in such a way that the sides BC, CA, AB pass through P, Q, R (Cramer-Castillon problem). (3) Draw a circle tangent to all of three given circles (Apollonius' problem). (4) Suppose that we are given a triangle. Draw three circles inside this triangle. in such a way that each is tangent to two sides of the triangle and any two of the circles are tangent to each other (Malfatti's problem). (5) Let n be a natural number. For the divi- sion of the circumference of a circle into n equal parts (consequently, the construction of a tregular n-gon) to be a possible construction problem, it is necessary and sufficient that the representation of n as a product of prime numbers take the form n=2Apl'" Pk> where ).? 0, PI' ..., Pk are all different prime numbers of the form 2 h + 1 (tFermat number) (c. F. Gauss, 1801). (6) The foIl owing are three famous impos- sible construction problems of Greek origin: (i) divide a given angle into three equal parts (trisection of an angle); (ii) construct a cube whose volume is double that of a given cube (duplication of a cube or the Delos problem); and (iii) construct a square whose area is that of a given circle (quadrature of a circle). P. L. Wantzel (1837) proved that problems (i) and 
683 (ii) are impossible except for the special cases of n/2, n/4, etc, for (i); and C. L. F, Lindemann (1882) proved the impossibility of (iii) while proving that the number n is ttranscendental. B. Conditions for Constructibility A problem of elementary geometric construc- tion amounts to a problem of determining a certain number of points by drawing straight lines that pass through given pairs of points, and circles having given points as centers and passing through given points, Let (ai, bd, (a 2 , b 2 ), ''', (an' b n ) be rectangular coordinates of given points, and let K be the smallest tnumber field containing the numbers ai' ,.., b n , Straight lines that join given pairs of points and circles that have given points as centers and that pass through given points are represented by equations of the first or second degree with coefficients belonging to K. Con- sequently, the coordinates of points of intersec- tion of these straight lines and circles be- long to a quadratic extension K' = K(Jd) of K. Let A be the set of coordinates of the points that are to be determined, Then the problem is solvable if and only if any number ex in A is contained in a field L=K(Jd:,jd;, ... , Jd:), where di+l EK(jd;,..., jd;) (i=0, 1,..., r-l), Thus L is a tnormal extension field of K whose degree over K is a power of 2. Using this theorem we can prove the impossibility of trisection of an angle and duplication of a cube, Since the 18th century, besides the problem of construction by ruler and compass, prob- lems of construction by ruler alone or by compass alone have also been studied, We state here some of the more notable results: (1) If by drawing a straight line we mean the process of finding two different points on that line, then we can solve all the problems of elementary geometric construction by means of compass alone (G, Mohr, L, Mascheroni). (2) If by drawing a circle we mean the process of finding its center and a point on its circum- ference, and if a circle and its center are given, then we can solve any problem of elementary geometric construction by means of ruler alone, (3) It is not possible to find the center of a given circle by ruler alone (D. Hilbert). (4) It is impossible to bisect a given segment by ruler alone, (5) When two intersecting circles or concentric circles are given, We can find the centers of these circles by ruler alone, When non intersecting and nonconcentric circles are given, it is not possible to find their centers by ruler alone (D, Cauer). Cases have been considered in which the radius of a circle we can draw by compass or 180 A Geometric Optics the length of a segment we can draw by ruler is required to satisfy certain conditions, Also, various considerations have been made con- cerning cases in which we can use tools other than ruler and compass, For example, it is known that although not all possible elemen- tary geometric construction problems are solvable by ruler alone, all these problems are possible if we have either a pair of parallel rulers, a square, or a triangle having a fixed acute angle, If we use a square and a compass, then the trisection of an angle and the dupli- cation of a cube are possible (L. Bieberbach). Also, when a conic section other than a circle is given, the trisection of an angle and the duplication of a cube become possible by ruler and compass (H. 1. S. Smith and H. Kortum). By ruler and ttransferrer of constant lengths, We can solve Malfatti's problem but not Apol- lonius' problem (Feldblum), Even when a problem is possible, the method of construction may be rather com- plicated and impractical. In these cases, vari- ous methods of highly accurate approximate construction have been investigated, References [lJ H. L. Lebesgue, Lecons sur les construc- tions geometriques, Gauthier-Villars, 1950. [2J L. Bieberbach, Theorie der geometrischen Konstruktionen, Birkhiiuser, 1952. 180 (XX.15) Geometric Optics A. General Remarks Geometric optics is a mathematical theory of light rays. It is not concerned with the prop- erties of light rays as waves (e.g" their wave- length and frequency), but studies their prop- erties as pencils of rays that follow three laws: the law of rectilinear propagation, the law of reflection (i,e., angles of incidence and re- flection on a smooth plane are equal (Euclid», and the law of refraction (i,e., if () and ()' are angles of incidence and refraction of a light ray refracted from a uniform medium to a second uniform medium and if n, n' are the refractive indices of the first and the second medium, respectively, then n sin () = n' sin ()' (R. W. Snell, Descartes)), These three laws follow from Fermat's principle, which states that the path of a light ray traveling from a point A' to A in a medium with refractive index n(P) at P is such that the integral S, n(P)ds attains its 
180 B Geometric Optics extremal value, where ds is the line element along the path. This line integral is called the optimal distance from A' to A. Therefore Fermat's principle can be taken as a foun- dation of geometric optics and is, in a way, similar to the tvariationa1 principle in particle dynamics (Maupertuis's principle), b f J 2h-2U(P)ds=0, which is satisfied by the path of a particle of unit mass having constant energy h passing through a field oftpotential U(P). The quan- tity J 2h - 2 U corresponds to the refractive index n, In an optical system, express the position of a point on the path of a light ray by ortho- gonal coordinate s (x, y, z), a nd define the tLa- grangian L=n J l +X2 + y2 (x = dxldz, y= dyldz), optical direction cosines p = aLia x, q = aLlay, a nd the tHam iltonian H = xp + yq- L = - J n 2 - p2 - q2. Then the tcanonica1 equations of the path are obtained as in parti- cle dynamics; x, y and p, q are called tcanon- ica1 variables, Due to the variational prin- ciple, the integral of the linear differential form pdx+qdy-H dz=w d along a light path is a function S(A', A) of the endpoints A', A of the path, and the optical direction cosines and the Hamiltonians of the system at A and A' are given by as , ----:::-;=-p, ox as ay' = -q', as =H' az' ' as ax =p, as oy =q, as -=-H az . Hence we obtain tHamilton-Jacobi differential equations (  ) 2 ( as ) 2 ( as ) 2 =n,2 a ' +0' + a ' ' x oy z G Y + G Y + G Y =n 2 . As a corollary to these relations we obtain Malus's theorem, which states that a pencil of light rays perpendicular to a common surface (localJy) at a given moment is also perpendic- ular to a common surface (locally) after an arbitrary number of reflections and refractions. Suppose that light rays travel from an ob- ject space into an image space through an optical apparatus. If all the rays starting from anyone point of the object space converge to a point of the image space and if the mapping given by this correspondence is tbijective, then we say that this imaging is perfect. Examples of perfect imaging systems are realized by optical apparatus such as Maxwell's fisheye (having refractive index n(r) = al(b + r 2 ), where 684 r denotes the distance from the center of the system) and Luneburg's lens (n(r) = J a - r 2 ), Perfect imaging conserves optical distance, yields the relation n(A')ds' = n(A)ds, and gives a tconformal mapping, with the magnification inversely proportional to the refractive index. B. Gauss Mappings Consider an optimal system with a symmetri- _ cal axis of rotation, its optical axis, A ray of light that is near the optical axis and has a small inclination to the axis is called a paraxial ray. A mapping realizable by paraxial rays where the canonical variables x, y, p, q can be considered to be infinitesimal variables whose squares are negligible, is called a Gauss map- ping (Gauss map). When the positions of an object point and its image under a Gauss mapping are represented by homogeneous co- ordinates, the mapping is represented as a linear transformation, i.e" a tcollineation, which maps a point to a point and a line to a line. A point in one space corresponding to the point at infinity in the other space is called a focus. If we take a focus as the origin of a coordinate system in each space and use the thomogeneous coordinates Xi such that x = XdX4' y=x2/x4' z=x3/x4, then a Gauss mapping can be represented as x 1= X'I' X 2 =x, x 3 = fx, f'x 4 =x;. The ratio of x to x', i,e., the lateral magnification, is xix' =zlf = f'lz', where x' is the length of an object ortho- gonal to the axis and x is the length of its image. The distance f between a focus and a point where the lateral magnification is 1 (such a point is called a principal point) is calJed focal length in each space. The telescopic mapping, i.e., Xl = X'l, X2 = x, X 3 = ax;, X 4 = bx, is also a Gauss mapping, in which the lateral magnification is constant. C. Aberration When a mapping is realized not only by par- axial rays but also by rays having larger incli- nations, a departure from the Gauss mapping arises. This departure is generally called aber- ration. Suppose that a light ray that passes through the point (x', y', z') of a plane perpen- dicular to the optical axis at a fixed z' and has optical direction cosines p', q' is transformed by the optical apparatus into a light ray that passes through the point (x, y, z) of a plane perpendicular to the optical axis at a fixed z and has optical direction cosines p, q there. Then by the variational principle, pdx+qdy- p' dx' -q' dy' = dW (dW is an texact differen- tial). Therefore the transformation (x', y', p', q') -->(x, y, p, q) is a tcanonical transformation. The 
685 mapping can be described by aw P=a-;' aw q=ay' , aw p = - ax' ' aw q'-- - ay' in terms of Wand can also be represented in terms of V = W + p' x' + q' y' or U = W + p' x' + q' y' - px - q y, For a given optical system, one of these functions W, U, V (called a char- acteristic function or eikonal) can be used to estimate the aberration. By developing such a characteristic function in power series of canonical variables and observing its terms of less than the fifth power, we can single out five kinds of aberration: spherical aberration, curvature of image field, distortion, coma, and astigmatism in a rotationally symmetric opti- cal system, To eliminate these aberrations an optical system must satisfy Abbe's sine con- dition (the elimination of spherical aberration and coma), Petzval's condition (the elimination of astigmatism and curvature of image field), and the tangent condition (the elimination of distortion), The path of a charged particle III an electro- magnetic field can be treated in the same way as the path of a light ray. Let F. represent the specific charge of the particle, h the energy, Ao the electrostatic potential, and Ax, Ay, Az vector po tentials, T hen the index of refrac- tion is J 2(h-F.Ao) + F.(Axdx/ds+ Aydy/ds+ Azdz/ds). In this case, the index of refraction shows the anisotropy caused by the existence of the magnetic field. The paths of paraxial rays are determined by a set of linear differ- ential equations of the second order, and the Gauss mapping is realized as in geometric optics. References [IJ C. Carathcodory, Geometrische Optik, Springer, 1937. [2J M. Herzberger, Modern geometrical op- tics, Interscience, 1958. [3J R. K. Luneburg, Mathematical theory of optics, Univ. of California Press, 1964, [4J 0, N, Stavroudis, The optics of rays, wave- fronts, and caustics, Academic Press, 1972. 181 (VI.1) Geometry The Greek word for geometry, which means measurement of the earth, was used by the 181 Geometry historian Herodotus, who wrote that in an- cient Egypt people used geometry to restore their land after the inundation of the Nile. Thus the theoretical use of figures for practical purposes goes back to pre-Greek antiquity. Tradition holds that Thales of Miletus knew some properties of congruent triangles and used them for indirect measurement, and that the Pythagoreans had the idea of systematiz- ing this knowledge by means of proofs ( 24 Ancient Mathematics; 187 Greek Mathemat- ics). tEuclid's Elements is an outgrowth of this idea [1]. In this work, we can see the entire mathematical knowledge of the time presented as a logical system. It includes a chapter (Book V) on the theory of quantity (Le., the theory of positive real numbers in present-day termi- nology) and chapters on the theory of integers (Books VII-IX), but for the most part, it treats figures in a plane or in space and presents number-theoretic facts in geometric language, Geometry in today's usage means the branch of mathematics dealing with spatial figures. In ancient Greece, however, all of mathematics was regarded as geometry. In later times, the French word geomi!tre or the German word Geometer was sometimes used as a synonym for mathematician, In a fragment of his Pensees, B. Pascal speaks of the esprit de geometrie as opposed to the esprit de finesse, The former means simply the mathematical way of thinking, Algebra was introduced into Europe from the Middle East toward the end of the Middle Ages and was further developed during the Renaissance. In the 17th and the 18th cen- turies, with the development of analysis, geom- etry achieved parity with algebra and analysis. As R. Descartes pointed out, however, fig- ures and numbers are closely related [2]. Geometric figures can be treated algebraically or analytically by means of tcoordinates (the method of analytic geometry, so named by S, F. Lacroix [3J); conversely, algebraic or ana- lytic facts can be expressed geometrically. Ana- lytic geometry was developed in the 18th cen- tury, especially by L. Euler [4J, who for the first time established a complete algebraic theory of tcurves of the second order. Previ- ously, these curves had been studied by Apol- lonius (262-200? B,C.) as tconic sections. The idea of Descartes was fundamental to the development of analysis in the 18th century, Toward the end of that century, analysis was again applied to geometry, For example, G. Monge's contribution [5J can be regarded as a forerunner of tdifferential geometry, However, we cannot say that the analytic method is always the best manner of dealing with geometric prohlems. The method of treat- ing figures directly without using coordinates 
181 Ref. Geometry is called synthetic (or pure) geometry. In this vein, a new field ca\1ed tprojective geometry was created by G. Desargues and B. Pascal in the 17th century. It was further developed in the 19th century by ].- V. Ponce1et, L. N. Car- not, and others. In the same century, ]. Steiner insisted on the importance of this field ( 343 Projective Geometry). On the other hand, the taxiom of para\1e1s in Euclid's Elements has been an object of criticism since ancient times. In the 19th cen- tury, by denying the a priori validity of Eu- clidean geometry, J. Bo1yai and N. 1. Loba- chevskii formulated non-Euclidean geometry, whose logical consistency was shown by models constructed in both Euclidean and projective geometry ( 285 Non-Euclidean Geometry). In analytic geometry, physical spaces and planes, as we know them, are represented as 3-dimensiona1 or 2-dimensiona1 Euclidean spaces E 3 , E 2 . It is easy to generalize these spaces to n-dimensional Euclidean space En. A "point" of En is an n-tuple of real numbers (Xl' ..., x n ). and the distance between two points (xl> ... , x n ), (Y1' ... , Yn) is «y 1 - X 1)2 + ... + (Yn _xn)2)1/2. The geometries of E 2 , E 3 are ca\1ed plane geometry and space (or solid) geometry, respectively. The geometry of En is called n- dimensional Euclidean geometry. We obtain n-dimensional projective or non-Euclidean geometries similarly. F. Klein [7J proposed systematizing aU these geometries in group- theoretic terms. He caUed a "space" a set S on which a group G operates and a "geometry" the study of properties of S invariant under the operations of G ( 137 Erlangen Program). B. Riemann [6J initiated another direction of geometric research when he investigated n- dimensional tmanifo1ds and, in particular, tRiemannian manifolds and their geometries. Some aspects of Riemannian geometry faU outside of geometry in the sense of Klein. It was a starting point for the broad field of modern differential geometry, that is, the geom- etry of tdifferentiab1e manifolds of various types ( 109 Differential Geometry). The reexamination of the system of axioms of Euclid's Elements led to D. Hilbert's tfoun- dations of geometry and to the axiomatic ten- dency of present-day mathematics. The study of algebraic curves, which started with the study of conic sections, developed into the theory of algebraic manifolds, the algebraic geometry that is now developing so rapidly ( 12 Algebraic Geometry). Another branch of geometry is topology, which has developed since the end of the 19th century. Its influence on the whole of mathematics today is con- siderable ( 114 Differential Topology; 426 Topology). Geometry has now permeated aU 686 branches of mathematics, and it is sometimes difficult to distinguish it from algebra or anal- ysis. The importance of geometric intuition, however, has not diminished from antiquity until today. References [IJ T. L. Heath, The thirteen books of Euclid's elements, Cambridge, 1908. [2J R. Descartes, Geometrie, Paris, 1637 (Oeuvres, IV, 1901). [3J S. F. Lacroix, Traite e1ementaire de trigo- nometrie rectiligne et spherique et d'applica- tion de l'a1gebre a la geometrie, Bachelier, 1798-1799. [4JL. Euler, Introductio in ana1ysin infini- torum, Lausanne, 1748 (Opera omnia VIII, IX, 1922); French translation, Introduction a I'analyse infinitesima1e I, II, Paris, 1835; Ger- man translation, Ein1eitung in die Analysis des Unendlichen, Springer, 1885. [5J G. Monge, Application de l'ana1yse a la geometrie, fourth edition, Paris, 1809. [6J B. Riemann, Dber die Hypothese, we1che der Geometrie zu Grunde liegen, Habilita- tionsschrift, 1854 (Gesammelte mathematische Werke, Teubner, 1876,254-269; Dover, 1953). [7J F. Klein, Verg1eichende Betrachtungen liber neuere geometrische Forschungen, Das Erlanger Programm, 1872 (Gesamme1te math- ematicshe Abhandlungen I, Springer, 1921, 460-497). 182 (V.10) Geometry of Numbers A. History H. Minkowski introduced the notions of lat- tice and convex set in the ta1gebraic theory of numbers. He developed a simple yet powerful' method of arithmetic investigation using these geometric notions to simplify the analytic theory of tDiophantine approximation, which had been developed by P. G. L. Dirichlet and C. Hermite. His theory, the geometry of num- bers, has continued its development and con- tributed to various fields of mathematics ( 83 Continued Fractions). B. Lattices Let En be an n-dimensional Euclidean space identified with the linear space R n . For a point 
687 P in En, we denote the corresponding vector in Rn by v(P)='(x I , ...,x n ). A subset A of En is called an n-dimensional (homogeneous) lattice if there exists a basis {VI"'" v n } of R n such that A= {PEE I v(P)= L71 AiViJiEZ}. The set of points {X h ..., Xn} such that V(X;) = Vi (i = 1, "', n) is called a basis of the lattice A. A typical example of a lattice is the point set corresponding to zn in R n . The tfree module generated by Vi (i = 1, ..., n) is denoted by A * and is called the lattice group of A. We have A * ={vERnlv=v(P), PEA}. If {u l , .."un} is an- other basis of the free module A *, then there exists an element (ex i ) of GL(n, Z) (i.e., exijEZ and Idet(exi)1 = 1) such that u j = L71 exijv j , Hence the quantity Idet(v h ...,vn)1 is inde- pendent of the choice of the basis {v I, ... , v n }, We denote this quantity by d(A) and call it the determinant of the lattice. We denote the mini- mum distance between the points belonging to A by b(A). A subset L of the space En is called an in- homogeneous lattice if there exists a homo- geneous lattice A in En and a point Po in En such that L={PEEnlv(p)-v(Po)EA*}. Thus an inhomogeneous lattice is obtained from a homogeneous lattice by translation, In this article we restrict ourselves to the case of homogeneous lattices and henceforth omit the adjective "homogeneous." Suppose we are given a sequence of lattices AI' A 2 , "', in En with bases {XP)}, {X2)}, .... If the sequence of points X!V) converges to Xi (i= 1, ...,n) and the set {XI' ...,X n } forms a basis of a lattice A, we call A the limit of the sequence {Av}; we also say that the se- quence {Av} converges to the lattice A. In this case we have d(A,)-->d(A), b(Av)-->b(A). The notion of convergence of lattices gives rise to a topology of the space Mo of all the lattices in E. A sequence {A,} of lattices is said to be bounded if there exist positive numbers c and c' such that d(A,).s; c, b(A,)? c' for all v, A bounded sequence of lattices has a convergent subsequence. Let S be a subset of the space P. A lattice A is called S-admissible if we have Ansi = {a}, where Si is the interior of S and a is the origin, We denote the set of S-admissible lattices by A(S), Given a closed subset M of Mo, we put (S\M)=infAEA(s)nMd(A) if A(S)nM is non- empty, while if A (S) n M is empty, we put (S\M)= 00. When M =Mo, we write (S\M) = A(S) and call it the critical determinant of S. Generally, a lattice A in M is said to be critical in M with respect to S if AEA(S) and d(A)= (S\M). Suppose that we have O<(S\M)< 00. Then for a lattice critical in M with respect to S to exist it is necessary and sufficien t that there exist a bounded sequence {Av} such that AvEMnA(S) and d(Av)-->(S\M), 182D Geometry of Numbers C. Successive Minima and Minkowski's Theorem A subset S of the space En is called a bounded star body (symmetric with respect to the origin) if there exists a continuous function F defined on the space E satisfying the following four conditions: (i) F(O) ==0; (ii) if X #0, then F(X) > 0; (iii) for an arbitrary real number t and a point X, we have F(tX)=ltlF(X); (iv) S= {X I F(X).s; I}. A bounded closed tcon- vex body that is symmetric with respect to the origin is a bounded star body, If we are given a star body S, the associated function F, and a lattice A, there exist a set of points {PI' "" Pn} in A and a set of positive numbers {PI, "', Pn} satisfying the following four conditons: (1) v(P I ), "', v(Pn) are linearly independent; (2) F(P;)=pdi=I,...,n);(3)PI.s;... .s;Pn; (4)ifPis a point in A and v(P) is not contained in the subspace spanned by {v(Pd, "', v(Pm-d}, then F(P)? Pm' The set {Ph"" Pn} is uniquely determined by Sand A. The numbers Pi are called the successive minima of S in A; the points Pi are the successive minimum points of S in A. Minkowski's theorem: Let A be a lattice in a Euclidean space En and S a bounded subset of En. Then we have the foIlowing: (I) If the volume V(S) is larger than d(A), then there exist points XI and X 2 in S such that XI # X 2 and v(Xd - V(X 2 )E A *. Suppose, moreover, that S is convex and symmetric with respect to the origin. Then, if V(S) > 2 n d(A), there exists a point X in S n A different from the origin. Hence we have 2n(s)? V(S) (n= dim En), (II) Let S be a bounded closed convex body that is symmetric with respect to the origin, and let PI, .,. ,Pn be the successive minima of S in A. Then we have PI .,. Pn' V(S).s; 2 n d (A). D. Minkowski-Hlawka Theorem Suppose that we are given a subset S of the n- dimensional Euclidean space En such that the characteristic function x(X) of S is tintegrable in the sense of Riemann. Then we have the Minkowski-Hlawka theorem: (i) If n? 2 and S is open, then (S).s; V(S), and (ii) if, moreover, S is symmetric with respect to the origin, then 2(S).s; V(S); (iii) if S is a symmetric star body with respect to the origin, then 2((n)(S).s; V(S), where ((n) is the tRiemann zeta function, A prooffor the theorem was given by E. Hlawka (1944); (iii) and (iv) were conjectured by Minkowski. C. L. Siegel obtained another proof (1945), and C. A. Rogers simplified the original proof by Hlawka (1947). There are 
182 E Geometry of Numbers results concerning the estimation of ,1(S)/V(S) for various subsets S. E. Siegel's Mean Value Theorem In an attempt to obtain a proof for the latter half of the Minkowski-Hlawka theorem, Min- kowski observed the necessity of establishing the arithmetic theory of the linear transfor- mation groups. Siegel was inspired by this ob- servation and obtained the following theorem, Siegel's mean value theorem, which implies the Minkowski-H1awka theorem: Let F be a tfun- damental region of the group SL(n, R) with respect to a discrete subgroup SL(n, Z). Let w be the tin variant measure on SL(n, R) such that J pdw = 1 ( 225 Invariant Measures). Let f be a bounded Riemann integrable function with compact tsupport defined on the space R n . Note that the lattice zn is stabilized by the subgroup SL(n, Z). We have fpx/(9'X)dW(9)= I/(X)dX, ,,'0 where the right-hand side of the equation is the usual Riemann integral of the function f A. Wei1 considered this theorem in a more general setting (Summa Brasil. Math., 1 (1946)). F. Diophantine Approximation Minkowski initiated the notion of Diophan- tine approximation in reference to the prob- lem of estimating the absolute value If(x)1 of a given function f, where x varies in Z or in a given ring of ta1gebraic integers. (A tDiophan- tine equation is an equation f(x)=O, where x varies in Z.) Today Diophantine approxima- tion (in the wide sense) refers to the investiga- tion of the scheme of values f(x), where x varies in a suitable ring of algebraic integers. The geometry of lattices is a powerful tool in this investigation. A typical problem in this field of study is that of approximating irra- tional numbers by rational numbers; here tcontinued fractions play an important role ( Section G). For the problem of uniform distri- bution considered by H. Weyl, the analytic method, especially that of trigonometric series, is useful ( Section H). Dirichlet's drawer principle (to put n objects in m drawers with n > m, it is necessary to put more than one object in at least one drawer) is one of the basic principles used in the theory of Diophan- tine approximation. Recently, the theory has been applied to the theory of ttranscendental numbers and the theory of tDiophantine equations. 688 G. Approximation of Irrational Numbers by Rational Numbers Given an irrational number e, we have the problem of finding rational integers x (> 0) and y such that 10- y/xl <six, where s is a given positive number. Suppose that we are given a positive integer N. Using Dirichlet's drawer principle we can show the existence of x (.s; N) and y such that le- y/xl < l/xN. Let M (e) be the supremum of positive numbers M such that the inequality [e- y/x[ < I/Mx2 holds for infinitely many pairs of integers x, y. We have 1.s; M(e) (.s; 00). Two irrational num- bers e and e' are said to be equivalent if there exists an element (a;)E GL(2, Z) such that e' = (all A + a 12 )/(a 21 e + a 22 ). In this case we have M(O)=M'(e). If the irrational number e satisfies the qua- dratic equation ae 2 +bO+c=0 (a,b,c are rational integers), then we have M(e)= k- I Jb 2 -4ac , where k=min{ax2 +bxy+ cy 2 Ix,YEZ, x#O, y#O}.ln general, for an irrational number 0 of degree two, we have M(O)?fi. The equality M(O)=fi holds if 0 is equivalent to e l = (1 + fi)/2. If () is not equivalent to e l , then M(e);;.J8; the equality holds if e is equivalent to O 2 = 1 + fi. Simi- larly, we have e 3 , 0 4 , ...; and M(On)-->3 (n-->oo). If M (e) < 3, there exists a en such that e is equivalent to en. The set of irrational numbers e satisfying M(e)=3 is uncountable (A. A. Markov [13J). We have no information about M(e) for the general algebraic irrational num- ber e. Let J1(0) be the supremum of real num- bers J1 such that the inequality 1 e - y/ xl < l/x holds for infinitely many pairs of integers x, y. Given a number K> 2, it can be shown that the tLebesgue measure of the set of real numbers e such that J1(0)? K is zero. If e is a real alge- braic number of degree n, then J1(0).s; n (J. Liouville). Concerning J1(e), results have been obtained by A. Thue, Siegel, A. O. Gel'fond, and F. J. Dyson. K. F. Roth (1954) proved that J1(0)=2 (Roth's theorem [12J), which settled the problem of J1((}). Roth's theorem means that if K is larger than 2, then there exist only a finite number of pairs x, y satisfying 10- y/xl < l/x". This can be generalized to the case of the approximation of an element e that is algebraic over an A-field k by an element of the field k (S. Lang [9]). (An A-field is either an algebraic number field of finite degree or an algebraic function field in one variable over a finite constant field.) In 1970 W. M. Schmidt [20] obtained theorems on simultaneous approximation which generalize Roth's theorem. Thus, if ex l ,..., exn are real algebraic numbers such that 1, ex l , ..., exn are linearly independent over the 
689 field of rational numbers, then for every B> 0 there are only finitely many positive integers q with Ilq ex lll...ffqex n llq1+'<I, where II  II denotes the distance from a real number  to the nearest integer; in particular, we have lex; _ pdq I <q-(n+I)!n-" i = 1, .." n, for only finitely many n-tuples of rationals PIJq,... ,P.Jq. A dual to this result is as foIlows. Let lXI' ..., IXn, B be as before. Then there are only finitely many n-tuples of nonzero integers ql, ...,qn with IlqlC(1 +...+qnexnll'lql ...qnl1+'<1. This last theorem can be used to prove that if ex is an algebraic number, k a positive integer, and E > 0, then there are only finitely many algebraic numbers w of degree at most k such that lex- wi < H(W)-k-h, where H(w) denotes the height of w. See also [16,22]. The work of Thue, Siegel, and RotH had the basic limitation of noneffectiveness, A, Baker (1968) succeeded in proving that for any alge- braic number 0 of degree n? 3 and any K> /1, there exists an effectively computable number c=c(O, K»O such that 10- yJxl > cx-nexp(logx)1 r for all integers x, y (x> 0) [15]. This result is an immediate consequence of the foIlowing effective version of a classical theorem on binary Diophantine equations (Thue, 1909): Let f = f(x, y) be an irreducible binary form of degree n? 3 with integer coeffi- cients, and suppose that K> n, Then for any positive integer m, all integer solutions x, y of the equationf(x,y)=m satisfy max(lxl, Iy\)< cexp(logm)", where c>O is an effectively com- putable number depending on n, K, and the coefficients of f Baker obtained this result by making use of his theorems which give effec- tive estimates of moduli of linear forms in the logarithms of algebraic numbers with alge- braic coefficients. A typical theorem reads as fol\ows: Let ex l , .." exn be nonzero algebraic numbers with logex l , ..., logexn linearly inde- pendent over the rationals, and let fJo, .., , fJn be algebraic numbers, not all 0, with degrees and heights at most d and H, respectively. Then for any K>n+ 1, we have IfJo+ fJllogexl +... + fJn log exnl >cexp( -(logH)'), where c >0 is an effectively computable number depending only on /1, K, logexl, ..., logiX n , and d [14, 19]. Results of this kind have many important applications in number theory. For instance, we obtain a generalization of the Gel'fond- Schneider theorem on transcendental num- bers. Furthermore, the imaginary quadratic number fields of class number 1 can be com- 182H Geometry of Numbers pletely determined on the basis of Baker's result. This was actually done by Baker (1966) and independently by H. M. Stark (1966) ( 347 Quadratic Fields), Refinements and generalizations of Thue's theorem on the finiteness of solutions of bi- nary Diophantine equations have been ob- tained by Baker and his collaborators. ( [17J, and also [16J). Also, P- and p-adic ana- logs of Baker's results are known [18]. There are a number of unsettled problems on the irrationality of particular numbers, such as the tEuler constant C, n e , or '(2k + 1) with k a positive integer. R. Apery (1978) proved that there exist a positive number E and a sequence of positive integers {qn} such that 0 < II qn W)II < e -,n for all n? 1, so that W) is irrational. H. Uniform Distribution Let 0 be a real number, x a positive integer, and [OxJ the maximum integer not larger than Ox. We write (Ox)==Ox-[OxJ=:::Ox(mod 1). Jacobi showed that if 0 is irrational, then the set {(Ox)lxEN} is densely distributed in the interval (0, 1) (N is the set of positive in- tegers). In general, let f be a real-valued func- tion defined on N. We say that f(x) (mod 1) is uniformly distributed in the unit interval, or f(x) is uniformly distributed (mod 1), if the foIlowing condition is satisfied: Let ex, fJ be an arbitrary pair of real numbers such that O.s; ex < fJ.s; 1, and let N be a given positive integer. Let T(N) be the number of positive integers x such that x.s;N, ex.s;(j(x») < fJ, where (/(x)) = f(x)-[f(x)]. ThenIimNoo T(N)JN=fJ-ex. In order for f(x) (mod 1) to be uniformly dis- tributed, it is necessary and sufficient that IimNoo N- I LZI e2nihf(x) ==0 for any non- zero integer h (Weyl's criterion, 1914), Weyl proved that if 0 is an irrational number, then Ox (mod 1) is uniformly distributed, The following theorem, given by J. G, van der Corput, is often useful: Let f be a real- valued function defined on N. Consider the function fh(X) = f(x+h)- f(x) for an arbitrary positive integer h, If fh(X) (mod 1) is uniformly distributed (mod 1) for all such h, then f(x) (mod 1) is also uniformly distributed (mod 1). Utilizing this theorem, it can be shown that if f(x)== OrX r + Or-I x r - I +... + 0 1 x, where at least one of the coefficients Oi is irrational, then f(x) (mod 1) is uniformly distributed. The notion of uniform distribution of se- quences of real numbers has an analog in compact Hausdorff spaces and in various top- ological groups, A systemmatic treatment of such generalized notions of uniform distri- bution can be found in [21]. 
182 Ref. Geometry of Numbers References [IJ H. Minkowski, Geometrie der Zahlen, Teubner, 1910 (Chelsea, 1953). [2J H. Minkowski, Diophantische Approxi- mationen, Teubner, second edition, 1927 (Chelsea, 1957). [3J H. Minkowski, Gesammelte Abhandlun- gen I, II, Teubner, 1911 (Chelsea, 1967). [4J 1. F, Koksma, Diophantische Approxima- tionen, Erg. Math., Springer, 1936 (Chelsea, 1950). [5J H. Weyl, C. L. Siegel, and K. Mahler, Geometry of numbers, Mimeographed notes, Princeton, 1950. [6J 1. W. S. Cassels, An introduction to Dio- phantine approximation, Cambridge Univ. Press, 1957. [7J J. W. S. Cassels, An introduction to the geometry of numbers, Springer, 1959. [8J T. Schneider, Einfiihrung in die trans- zendentalen Zahlen, Springer, 1957. [9J S. Lang, Diophantine geometry, Inter- science, 1962. [lOJ C. L. Siegel, Dber einige Anwendungen diophantischer Approximationen, Abh. Preuss. Akad. Wiss., no. 1 (1929) (Gesammelte Ab- handlungen, Springer, 1966, vol. 1,209-266). [11J C. L. Siegel, A mean value theorem in geometry of numbers, Ann. Math., (2) 46 (1945),340-347 (Gesammelte Abhandlungen, Springer, 1966, vol. 3, 39-46). [12J K. F. Roth, Rational approximations to algebraic numbers, Mathematika, 2 (1955), 1- 20. [13J A. A. Markoff(A. A. Markov), Sur les formes quadratiques binaires ind6finies, Math. Ann., 15 (1879),381-407. [14J A. Baker, Linear forms in the logarithms of algebraic numbers I, II, III, IV, Mathe- matika, 13 (1966), 204-216; 14 (1967),102- 107; 14 (1967),220-228; 15 (1968), 204-216. [15J A. Baker, Contributions to the theory of Diophantine equations I, II, Philos. Trans. Roy. Soc. London, A 263 (1967-1968),173- 191,193-208. [16J A. Baker, Transcendental number theory, Cambridge Univ. Press, 1975. [17J A. Baker and J. Coates, Integer points on curves of genus 1, Proc. Cambridge Philos. Soc., 67 (1970), 595-602. [18J A. Brumer, On the units of algebraic number fields, Mathematika, 14 (1967), 121- 124. [19J N. I. Fel'dman, Linear form of logarith- mic algebraic numbers (in Russian), Dok\. Akad. Nauk SSSR, 182 (1968),1278-1279. [20J W. M. Schmidt, Simultaneous approxi- mation to algebraic numbers by rationals, Acta Math., 125 (1970),189-201. 690 [21J L. Kuipers and H. Niederreiter, Uniform distribution of sequences, Interscience, 1974. [22J W. M. Schmidt, Diophantine approxi- mation, Lecture notes in math. 785, Springer, 1980. [23J W. M. Schmidt, Approximation to alge- braic numbers, Enseignement Math., 17 (1971), 188-253. [24J C. F. Osgood (ed.), Diophantine approxi- mation and its applications, Academic Press, 1973. 183 (VII.23) Global Analysis Mathematics that treats, by using functional analytical techniques, various problems con- cerning the tcalculus of variations, tsingular- ities, infinite-dimensional Lie groups, or tnon- linear partial differential equations, such as equations of fluids or of gravitation in general relativity, may be called global analysis if it uses as a main tool an infinite-dimensional version of differential geometry and topology analogous to that for finite-dimensional mani- folds. The term global analysis therefore has no precise definition. However, it can be said that it is analysis on manifolds, and the concept of tinfinite-dimensional manifolds is the central abstract idea in it. Suppose one considers a nonlinear differen- tial operator on a finite-dimensional manifold. Then by using functional analytical techniques it often happens that its domain is neither a linear space nor its open subset but an infinite- dimensional manifold, and that such a non- linear operator can be regarded as a tdifferenti- able mapping between infinite-dimensional manifolds. The tdifferential at a point in that source manifold is called a linearized operator, to which one can apply various theories of linear functional analysis. "Global analysis" seems to have first ap- peared in the literature in the late 1960s [1,2]. However, the phrase "infinite-dimensional manifold" has been used widely since about 1960. By that early date, local theories of infinite-dimensional manifolds, sometimes called "general analysis," such as definitions of differentiability, the timplicit function theorem, tTaylor's theorem, the existence and unique- ness of tordinary differential equations, and the tFrobenius theorem, had already been established in tBanach spaces [3]. Therefore, with these regarded as a local theory, the con- cept of infinite-dimensional manifolds could be defined naturally, and the concept of infinite- 
691 dimensional Lie groups as well [4]. A few years later, R. Palais and S, Smale [5J and 1. Ee\1s and J. Sampson [6J showed that such concepts are useful in the calculus of varia- tions, and R. Abraham and 1. Robbin [7J re- marked that ttransversality theorems initiated by R. Thom can be easily proved by using an infinite-dimensional version of tSard's theorem [8]. The so-called tAtiyah-Singer index theorem [9], announced in 1963, gave impetus to the field as people sought the theorem's most natural expression; the work fina\1y resulted in the theorem classifying separable tHilbert manifolds by homotopy type ( 105 Differen- tiable Manifolds), After the appearance of these theorems an- nouncing the nonexistence of differential topol- ogy on separable infinite-dimensional Hilbert manifolds, global analysis moved toward more concrete problems and applications to various branches of mathematics. However, many of these applications are formulated not on tBanach or Hilbert manifolds, but on mani- folds modeled on Frechet spaces (tFrechet manifolds) or on tnuclear spaces. For such situations we have in general no local theories. Neither the implicit function theorem nor the Frobenius theorem holds on such manifolds. However, since these theorems are crucial for nonlinear problems, various kinds of sufficient conditions for the validity of these theorems are being studied by many people ( 286 Nonlinear Functional Analysis). As for the calculus of variations, Yamabe's problem is being studied extensively, for this seems to be a typical problem not satisfying the so-called tCondition C. The original paper of H. Yamabe [13J, insisting that every com- pact Riemannian manifold can be conformally deformed into a manifold of constant scalar curvature, contains a serious gap, and the problem is stilI open, though many cases are known where the statement holds ( 364 Riemannian Manifolds H). The harmonic mappings defined in [6J are also being studied extensively ( 195 Harmonic Mappings). In differential geometry or the tgeneral theory of relativity definitions of various cur- vatures, such as Riemannian, Ricci or scalar, or Gauss, can be sometimes regarded as non- linear differential equations on manifolds. Proof of the global existence of solutions to these equations has long been sought, and several theorems have recently appeared [14- 16]. Infmite-dimensional groups such as GL(E), or GLc(E) with uniform topology, are called tBanach-Lie groups, and they have been studied in the operator calculus. On the other 183 Ref. Global Analysis hand, infinite groups studied by S, Lie and E. Cartan were in fact infinite-dimensional germs of transformation groups defined on a neighborhood of a point in a manifold. These were not groups in the strict sense. Recently, H. Omori [17J has given a definition of ab- stract infinite-dimensional Lie groups that in- cludes Banach-Lie groups and many infinite- dimensional transformation groups studied by Cartan. An application of these groups to tfluid dynamics can be found in [18]. More- over, tunitary representation theories of these groups are now being constructed [19,20]. Though global analysis consists at pre- sent of a rather disorganized combination of many nonlinear problems in analysis on mani- folds and in mathematical physics, it is never- theless one of the most active branches of mathematics, References [IJ J. Eells, A setting for global analysis, Bu\1. Amer. Math, Soc., 72 (1966) 751-807, [2J R. Palais, Foundations of global nonlinear analysis, Benjamin, 1968. [3J 1. Dieudonne, Foundations of modern analysis, Academic Press, 1960. [4J G, Birkhoff, Analytic groups, Trans, Amer, Math. Soc" 43 (1938), 61-101. [5J R. Palais and S. Smale, A generalized Morse theory, Bull. Amer. Math, Soc" 70 (1964), 165-172. [6J J. Eells and 1. Sampson, Harmonic map- pings of Riemannian manifolds, Amer. 1. Math., 86 (1964),109-160. [7J R. Abraham and J. Robbin, Transversal mappings and flows, Benjamin, 1967. [8J S. Smale, An infinite-dimensional version of Sard's theorem, Amer. J. Math" 87 (J 965), 861-866. 9J M. Atiyah and I. Singer, The index of e\1iptic operators on compact manifolds, Bull. Amer. Math. Soc., 69 (1963), 422433. [IOJ N. H. Kuiper, The homotopy type of the unitary group of Hilbert space, Topology, 3 (1965),19-30. [l1J D. Burghelea and N. H. Kuiper, Hilbert manifolds, Ann, Math" (2) 90 (I 969), 335-352. [12J 1. Eells and D, Elworthy, Open embed- dings of certain Banach manifolds, Ann, Math" (2) 91 (1970),465-485. [13J H. Yamabe, On deformation of Rieman- nian structures on compact manifolds, Osaka Math. 1.,12 (1960), 21-37, [14J T. Aubin, Equations differentielles non lineaires et probleme de Yamabe concernant la courbure scalaire, 1. Math. Pures Appl., 55 (1976),269-296. 
184 Ref. Godel, Kurt [15J H. Gluck, The generalized Minkowski problem in differential geometry in the large, Ann. Math., 96 (1972), 245-276. [16J S. Yau, On the Ricci curvature of a compact Kahler manifold and the complex Monge-Ampere equation I, Comm. Pure Appl. Math., 31 (1978),339-411. [17] H. Omori, Infinite-dimensional Lie trans- formation groups, Lecture notes in math, 427, Springer, 1974. [18] D, Ebin and 1. Marsden, Groups of dif- feomorphisms and the motion of an incom- pressible fluid, Ann. Math., (2) 94 (1970), 101- 162. [19J A. M. Vershik, I. M. GeI'fand, and M, I. Graev, Representations of the group of diffeo- morphisms, Russian Math, Surveys, 30 (6) (1975),1-50. (Original in Russian, 1975.) [20J R. S. Ismagilov, Unitary representations of the group of diffeomorphisms of the space Rn, n? 2, Functional. Anal. Appl., 9 (1975), 144-155. (Original in Russian, 1975.) 184 (XXI.27) Godel, Kurt Kurt G6del (April 28, 1906-January 14, 1978) was born in Brno, Czechoslovakia (at that time Brunn, Austria-Hungary). He studied mathematics and physics at the University of Vienna, where he took the Ph.D. degree in 1930. After he had taught mathematics at the University of Vienna from 1933 to 1938, he was invited to the Institute for Advanced Study at Princeton, where he became professor in 1953, In 1976, he was named professor emeritus; he died in Princeton in 1978. G6del contributed important fundamental results covering all aspects of mathematical logic. Among his famous works are the proof of the tcompleteness of the first -order predi- cate calculus, the incompleteness of the con- sistent axiomatic system containing Peano's arithmetic (incompleteness theorem), and the tconsistency of the axiom of choice and the generalized continuum hypothesis. He also introduced the notion of trecursive functions and found the G6del solution of Einstein's equations of relativity. In addition to mathe- matical works, he left philosophical papers on set theory and the foundations of mathematics. Reference [IJ K, G6del, The consistency of the axiom of choice and of the generalized continuum hypothesis, Princeton Univ, Press, 1940, 692 185 (1.8) Godel Numbers A. General Remarks K. G6del [IJ devised the following method to prove his incompleteness theorems ( Section C). Let 6 be a tformal system, In this article, we call its basic symbols, tterms, tformulas, and formal proofs the "constituents" of 6, Let 9 be an tinjection from the constituents of 6 into the natural numbers satisfying the following two conditions: (1) Given a constituent C, we can compute the value g(C) in a finite number of steps, (2) Given a natural number n, there exists a finitary procedure to find out whether there exists a constituent C of 6 such that g(C) = n; furthermore, when such a C exists, it can actually be specified in a finite number of steps. If such a mapping 9 is given for the system 6, then the mapping 9 is calIed a Godel num- bering and the number g(C) is called the Godel number of the constituent C (with respect to g), B. An Example of GOdel Numbers (1) Let ex o , ex l , '" be the basic symbols of 6, With each ex i we associate a distinct odd num- ber qi: g(ex;)=qi (i=O, 1,...), (2) Let F be a con- stituent of 6, If F is constructed from any other constituents Fo, FI' '" , Fk of S by a rule peculiar to 6 (for convenience we write this F=(Fo,FI' .."Fd), and if, for each Fi' g(F,) is already defined, then we put g(F) = (g(Fo), g(Fd, ..., g(F k ), where (a o , ai' ..., ak) denotes the number Po pi 1 '" p%k (Pi is the (i+ l)st prime number). For example, suppose that 6 contains 0, =, v j (variables), and 1 (negation) among the basic symbols, and let their G6del numbers be 7, 9, 1I j +1, and 13, respectively. Since the formula 1 (0 = v) can be analyzed in the form (I, (0, =, v)), its G6del number is (13, (7, 9,1 WI». For details  [1,4]. C. GodeI's Incompleteness Theorems By means of a G6del numbering any meta- mathematical notion about the constituents of a formal system 6 can be interpreted as a number-theoretic notion. For example, the notion "formula" is interpreted as the number- theoretic predicate Form(x) which means that x is the G6del number of a formula. The provability of a formula is interpreted as the number-theoretic predicate Prov(x), which means that x is the G6del number of a prov- 
693 able formula; accordingly, for any formula A of 6, the proposition Prov(g(A)) means that A is provable. This interpretation is called the arithmetization of metamathematics. Let the formal system 6 include formal elementary number theory. Then many meta- mathematically useful number-theoretic predi- cates can be expressed by the respective for- mulas of 6; for example, there exist formulas Form(x) and Prov(x) expressing the predicates Form(x) and Prov(x), respectively. Further- more, Godel proved the existence of a closed formula U such that the formula IProv(g(U))<-> U is provable. This closed formula U is one of the so-called formally undecidable propositions, and in fact it is shown that neither U nor I U is provable in 6 if 6 is consistent in a strong sense, This result is called Godel's first incompleteness theorem. By use of the formulas Form(x) and Prov(x) the consistency of the formal system 6 is expressed by the formula Consis which is an abbreviation of3x(Form(x)/\ IProv(x)). Godel obtained the result that the formula Consis is not provable in 6 if 6 is consistent, on the basis of the following three facts: (1) U is not provable in 6 if 6 is consistent; (2) Consis--> IProv(g(U)) is provable in 6; (3) IProv(g(U))-->U is not provable in 6. This result is called Godel's second incompleteness theorem. The method of arithmetization is important and useful in the study of mathematical logic. The notion of Godel numbers of recursive functions is one of its applications ( 356 Recursive Functions). D. Tarski's Theorem Concerning Truth Definitions Let a consistent formal system 6 and a tmodel of 6 be given. By means of a Godel number- ing, the truth notion of closed formula is inter- preted as the number-theoretic predicate Tr(x) which means that x is the Godel number of a true closed formula; accordingly, for any closed formula A of 6, the proposition Tr(g(A)) means that A is true. In relation to the foregoing fact, if there exists a formula Tr(x) of a single variable and the formula Tr(g(A))<->A is provable for every closed formula A, then that formula Tr(x) is called a truth definition, A, Tarski proved the fact that there is no truth definition in 6 if the formal system 6 is consistent. References [IJ K. Godel, Uber formal unentscheidbare Siitze der Principia Mathematica und ver- 186 B Graph Theory wandter Systeme I, Monatsh. Math. Phys., 38 (1931), 173-198; English translation, On form- ally undecidable propositions of Principia mathematica and related systems, Oliver & Boyd, 1962. [2] A. Tarski, Der Wahrheitsbegriffin den formalisierten Sprachen, Studia Philosophica, 1 (1936),261-405, translated from the Polish original 1933; English translation, The concept of truth in formalized languages, logic, seman- tics, metamathematics, Oxford Univ. Press, 1956. [3J 1. B. Rosser, Extentions of some theorems of Godel and Church, J. Symbolic Logic, 1 (1936), 87-9\. [4J S. C. Kleene, Introduction to metamath- ematics, North-Holland and Noordhoff, 1952. [5J S. C. Kleene, Mathematical logic, Wiley, 1967. [6J 1. R. Shoenfield, Mathematical logic, Addison-Wesley, 1967. 186 (XVI.12) Graph Theory A. Overview of Graph Theory Two aspects of graphs are the object of graph theory, One is that a graph expresses a binary relation over a set V, and the other is the fact that a graph is a CW-complex of I dimen- sion, an object of study in algebraic topology. Because of its special natural as an object of I dimension, we can consider various concrete properties in detail. Hence graph theory has close connection with other areas, such as network theory, system theory, automata theory, and the theory of computational pro- cesses, and it has many useful applications. The notion of a "graph" as currently used in graph theory was first discussed by L. Euler [1]. It is said that J. 1. Sylvester coined the word "graph" as we presently understand it. However, up to now, there were few unifying principles, and graph theory seemed like a large collection of miscellaneous problems and ad hoc techniques. The terminology has not yet been standardized; different usages prevail in different schools. The resulting confusion can be seen in references books such as [2J- [5]. Recently, infinite graphs have also been studied. But here we restrict ourselves to finite graphs, since only they give a typical theory. B. Definition of Graph The notion of a graph G=(V, E, a+, (r) is a composite notion of two finite sets V and E 
186 C Graph Theory and two maps a+ : E --> V and a- : E --> V. An element of V is called a vertex, and an element of E is called an edge. The maps a I are called incidence relations. The terms point, or node instead of vertex, and arc, line, or branch in- stead of edge have also been used (sometimes with slightly different meanings). For an edge eEE, a+eE V is called the initial vertex and a- eE V is called the terminal vertex. Both are called the end vertices of e. The inverse map (jI of a I : V-->2 E is defined by (jIu={eEElaIe== v} and has the following properties: (i) If v#v', then (j+vn(j+v' =(j-vn(j-v' = 0. (ii) UVEV(j+V= UVEv(j-v=E. Conversely, if the maps (jI: V -->2 E have the properties (i) and (ii), then there exist corresponding maps ai, For a vertex VE V, l(j+vl is called the outdegree or positive degree, l(j- vi is called the indegree or negative degree, and the sum l(j+ul + l(j-vl is called the degree. We always have the relation LvEvl(j+ul = LVEvWvl = lEI, and LVEv(Wul + 1(j-vi)=2IEI. The number of vertices with odd degree is always even, Two end vertices of an edge are called mutually adjacent, and two edges with at least one common end vertex are also called mutually adjacent. An edge satisfy- ing a+ e = [r e is caned a self-loop, and a vertex satisfying (j+ v = (j- v = 0 is called an isolated vertex. When permutation groups Pvoperating over V and P E operating over E are given, we can naturally define the permutation (nv, n E ) over the graph G(nvEPv, nEE Pd. A graph is classified into equivalence classes by Pv, PE' When Pv, P E are all the permutations of V, E, respectively, each equivalence class is called an unlabeled graph, whereas when Pv, P E consist only of an identity transformation, G is called a labeled graph, For a given graph G=(V,E, (j+, (j-) and a subset E' c E, we can define the reoriented graph of edges in E' by G' =(V, E, a'+, a'-), where a'I e = aIe if ertE' and ade=a+e if eEE'. We have an equivalence relation by identifying the reoriented graphs. Each class by this equivalence relation is called an un- directed graph or an unoriented graph, In con- trast, the graph in the original sense is called a directed graph or an oriented graph. C. Examples of Special Graphs (1) A complete graph is a graph such that there exists one and only one edge with end vertices v and v' for every different pair of vertices v and v'. (2) A bipartite graph is a graph with a partition V+ U V- = V such that V+ n V- = 0 and for every eEE, we have a+eE v+ and (r eE V-. If there always exists an edge e with c+ e= v, a- e = v' for every pair VE V+ and 694 U' E V-, it is called a complete bipartite graph, (3) A regular graph is a graph whose degree at each vertex is the same. (4) A partial graph is defined as follows: Let E' be a subset of E in a graph G=(V,E,a+,a-). The graph G'= (V', E', a'+, D'-) is called a partial graph of G, where V' = a+ E' U a- E', and a'I are the restrictions of a I on E', respectively. When E' = E, the partial graph is the graph obtained by deleting all isolated vertices from G. Simi- larly, for a subset V' of V, the graph G" = (V", E", a"+, D"-), defined by E" = (j+ V" U(j- V" and with a"I being the restriction of a I on E", is called a section graph of G, D. Representation of a Graph When we want to process by computer a problem concerning a graph, we must repre- sent a graph in a suitable form ( 96 Data Processing). A commonly used method is the following list representation: Let V = {v l' v 2 , ...,UM} and E={e\>e 2 , ...,e n }, For each V,EV (ex = 1, ..., M), we arrange the edges in (j+ v, and in (j- v, in suitable orders. Then (i) for each V,E V, record 1(j+v,l, 1(j-v,l, B:, li:, B;;, and iJ;; , where B! and iJ;c are the first and the last index of the edges in (jI v ., respectively. We define the corresponding values to be 0 if the (jI v , are empty, (ii) For each eKEE, record a; , a; , E; , E; , E; , and E; , where a! are the number of vertices aIe., and E!, E are the number of edges immediately after or before e K among the edges with the same initial (for +) or terminal (for -) vertex as that of e K . If e K is the last or the first among such edges, we define the corresponding values to be 0, E. Deformation of a Graph Let G = (V, E, a+, a-) be a graph. A graph ob- tained by opening an edge e is the graph G' = (V, E - {e}, a'+, a'-), where iJ'I is the restric- tion of a I on E - {e}. The graph obtained by deleting all isolated vertices from G' is a partial graph of Gover E - {e}, A graph obtained by shortening an edge e is the graph G" = (V", E - {e},(j"+,a"-), where V"=(V-{a+e,a-e})U {v}, v being a new vertex not contained in V, and a" Ie = <p . act, <p: V --> V" being defined by <p(v)=v for v#aIe K and =v when v=a+e K or v = a- e K . A graph obtained by opening and shortening several different edges is simi- larly defined, and the result is independent of the order of opening or shortening process. A graph obtained by opening edge(s) or by shortening edge(s) or by both processes is caned a subgraph, a contraction, or a sub- contraction, respectively. 
695 F. Connectedness Let G=(V,E,a+,a-) be a graph, and let v, and vp be two vertices. A path from v, to Vii of length I is a sequence P=(v, = v'o' t.1 e K1 , V'I' t.2eK2' ...,t./eK"v,,=vp), where t.i are +1 or -1, and for every i = 1, ..., I, we have a+ e K , == v"_,, a- e K , = v" if t.i = + 1, and a+ e K , = v", a- e K , = v,,_, if t.i = -1. When v, = v p' it is called a closed path considering P as a cyclic sequence. If in the sequence P of a path no edge appears more than once, it is called a simple path. Similarly, if no vertex appears more than once, it is called elementary. If all t.i = + 1, it is called a direct path. Direct closed paths, etc., are similarly defined. If we define v  v' by the existence of a path from v to v', this  is an equivalence relation. Let us denote the equivalence classes by VI' ..., v.. The section graph G i = (Vi, E;, a7 ,ai) determined by Vi is called a connected com- ponent, or simply a component, of G. The sets E I , ... , Es are mutuaIly disjoint and the union is E. If we denote by v--> v' the existence of a direct path from v to v', the relation --> is a tpseudo-order. The relation v <--> v' defined by v-->v' and v' -->v is an equivalence relation in V, and the equivalence classes VI"'" V, or the section graphs G i = (V;, E" at, (5 i -) determined by V; are called strongly connected components of G, E l , ,., , E, are mutually disjoint, but their union is not always E. Among the sets VI' ..., v" we can define an order relation V;--> t;; by the existence of VE V;, v' E t;;, v-->v'. Classi- fication by <--> is a refinement of that by . When s = 1, the graph G is called connected, and when t = 1, G is caIled strongly connected. For a different pair v, v' E V, a set S c V- {v, v'} is called a separator of v and v' if every path from v to v' contains at least one vertex of S. When every separator S for any pair v, u' has at least k elements, the graph G is called k- connected. For k'?3, there are several vari- ations of the definition of k-connectedness. 1- connectedness is equivalent to connectedness in the sense defined above. A simple path containing all edges of a graph is caIled an Euler path. Euler direct paths, etc., are similarly defined. A graph with an Euler path is caIled an Euler graph. A graph is an Euler graph if and only if (i) it is con- nected, and (ii) the number of vertices with odd degrees is 0 or 2 (Euler's unicursa1 graph theorem [IJ). Similar results are known for Euler closed paths or Euler direct paths. An elementary path containing all vertices of a graph is called a Hamilton path. Hamil- ton closed paths. etc., are similarly defined. The criteria for the existence of a Hamilton path for any given graph is unknown. This is known to be an tNP-comp1ete problem ( 71 186G Graph Theory Complexity of Computations). Various suffi- cient conditions or necessary and sufficient conditions for special graphs have been given (e.g., [6J). G. Tieset, Cutset, Tree, and Cotree For a graph G=(V...c E, a+, a-), we define its incidence matrix [i5 (ex = 1,..., M (= I VI), K= 1, ..., n (= lEI))] by defining D to be 1 if v, = a+ e K # a- e" -1 if v, = a- e K # a+ e" and 0 if a+eK=a-e K or eA(j+v,U(j-v,. Similarly we define its adjacement matrix [q (ex, f3 = 1, "', M)J by defining q to be 0 if v, and vp are not mutually adjacent and 1 if U, and v p are mutu- ally adjacent. A set of edges forming a closed path is caIled a tieset. A set of edges of the form {e I a+ e E W, a-eE V- W} U{ela-eE W,a+eE V- W} for a partition (W, V - W) of V is caIled a cutset. A maximal subset of edges containing no tiesets is caIled a tree, and a maximal subset of edges containing no cutsets is called a cotree. A tree is sometimes called a spanning tree of G. Every tree is the complement (with respect to E) of a cotree, and vice versa. The number of elements of a tree is always the same and equal to the trank m of the in- cidence matrix. This value m is caIled the rank of the graph G. Similarly, the number k of elements of a co tree is always the same, which is called the nullity or the cyclomatic number of the graph G. Always, k = n - m. Let K be a field, K", K M be vector spaces over K of dimensions nand M, respectively, and K n *, KM* be the tdua1 spaces of K n , K M The incidence matrix defines two mutually contragradient linear mappings a: K n --> K M and (j:KM* -->K n * with respect to their canon- ical bases. A minimal set among the family of supports (in E) of nonzero vectors in the kernel Ker a is a tieset corresponding to an elemen- tary closed path. Similarly, a minimal set among the family of supports (in E) of nonzero vectors of the image 1m (j is a minimal cutset. Let T ( c E) be a tree and T = E - T be a cotree. Let us renumber the edges so that T == {e l , ...,e m }, T={e m +l> ...,e n }. For each em+pE f, there is a unique vector R;(p = 1, ..., k( = n- m») in Kera whose support is in {em+p}U T and whose e m + p - equals +1. Similarly, for each eaE T (a = 1,..., m), there is a unique vec- tor D: (a= 1, ...,m) in Ima whose support is in {ea} U f and whose e a - component equals +1. {R, ... , Rn is a basis of Ker a, and {D, ... , D:;'} is a basis oflma. Both matrices R;(k x n) and D:(m x n) are totally unimodular, i.e., every minor determinant is 0, + 1, or -1. Further- more, the foIlowing relations hold: R;+q = (j: (p, q = 1, ... ,k), Dg = (j: (a, b = 1, ... ,m), R + 
186H Graph Theory D+p=O (a= 1,..., m;p= 1,..., k). The matrices R; and D are called the fundamental tieset matrix and the fundamental cutset matrix, respectively, with respect to the tree-co tree pair (T, 1'). The minor of the matrix R; con- sisting of all rows and k columns K 1, , . , , Kk is + I or -1 if and only if {e K1 , ..., eKJ is a cotree, and 0 otherwise. Similarly, the minor of the matrix D consisting of all rows and m columns KI'"'' Km is +1 or -1 if and only if {e K " ...,eKJ is a tree, and 0 otherwise. H. Planarity of a Graph Let there be a natural one-to-one correspon- dence between the sets of edges of two graphs Gi=(Vi,Ei,at ,Oi-) (i= 1,2), and suppose that under the correspondence a tree T 1 in G 1 cor- responds to a tree T 2 in G 2 and that the fun- damental cutset matrices D 1 and D 2 with re- spect to (1;, E i - 1;) are mutually equal. Then G 1 and G 2 are said to be 2-isomorphic. The definition is equivalent to the one given by the equality of fundamental tieset matrices. 2-isomorphism is an equivalence relation. If G l and G 2 are 2-isomorphic, the families of trees, cotrees, tie sets and cutsets are mutu- ally corresponding. The coincidence of one of the families is a sufficient condition for the 2-isomorphism of G 1 and G z as undi- rected graphs. A 3-connected graph has no 2-isomorphic graph other than itself. Similarly, when a tree Tl of G 1 corresponds to a cotree 1'2 of G 2 and if the fundamental cutset matrix of G I with respect to (T 1 , E 1 - Td is equal to the fundamental tieset matrix of G 2 with respect to (E 2 - T2' T 2 ) as matrices, then we say that G 2 is dual to G 1 . In this case, G 1 is dual to G 2 also. If G 1 is dual to G 2 and G 2 is dual to G 3 , then G I and G 3 are 2-isomorphic. The duality is a relation among the equiva- lence classes of graphs by 2-isomorphisms. Any graph G = (V, E, a+ , a-) can be "drawn" in 3-dimensional Euclidean space in the fol- lowing sense: Each vertex is a (distinct) point, and an edge e is an arc connecting two points a+ e and a- e with a direction pointing from a+ e to a- e in such a way that no two arcs intersect. A graph representable on a plane or on a 2-dimensional sphere in the above sense is called a planar graph. A graph G is planar if and only if it has a dual graph (H. Whitney [7J). Another necessary and sufficient con- dition is that as an undirected graph, neither the complete five-point graph Ks nor the bi- partite complete graph of three-three points K3,3' appear in any subcontraction of the graph G. (This is a version of the criterion of C. Kuratowski [8].) 696 I. Coloring of Graphs A coloring of the vertices of a graph G = (V, E, a+, 0-) is a mapping if; from V to the set of integers N satisfying the condition if; (v) # if;(u') for all adjacent vertices v and v'. y(G)= min {Iif;( V)IIif; is a coloring of the vertices of G} is called the chromatic number of G. If the graph G is drawn over a 2-dimensional closed surface with a I-dimensional tBetti number b, we have y(G).s;lO + )1 + 24b) j2J, where L J denotes the integral part. Except for the tKlein bottle (with b = 2), where y(G).s; 6, this is the best possible, i.e., there exists a graph whose y(G) equals the upper bound on the right-hand side. As for b? 1, the inequality was first proved by P. 1. Heawood (1890), and final results were established by 1. W. T. Young and G. Ringel [10]. When b=O, y(G).s; 5 was shown by A. B. Kempe (1879), but the four color conjecture: "y(G) .s;4?" has remained unsolved for more than a hundred years. The conjecture is believed to have been solved affirmatively recently through checking a huge number of cases on a large computer [11,12]. A subset We V is called an independent set or an internally stable set if no two vertices in Ware mutually adjacent. ex(G)= max {I WI I W is an independent set of G} is called the num- ber of independence of G. A subset W of V is called a dominating set or externally stable set of G if every vertex VE V is either VE W or adjacent to a vertex of W. [3( G) = min {I WI I W being a dominating set of G} is called the num- ber of domination of G. For every graph G, we have ex(G)? [3(G) and y(G)' ex(G)? I VI. J. Decision Problems and Graphs There are many interesting topics in decision problems concerning graphs, especially from the standpoint of tcomplexity of computa- tions (e.g" [13J). The following are some typical problems: (i) Problems for which algo- rithms of polynomial order are known: Is the given graph k-connected, strongly con- nected, a Euler graph, or a planar graph? (ii) tNP-complete problems: Is the given graph a Hamilton graph? Do we have ex(G)=k, [3(G) = k, or y(G)=k? Let G I and G 2 be two graphs. The problem of whether they coincide as unlabeled graphs is called the isomorphism problem, and has been studied for many years in connection with problems concerning the structure of chemical compounds. Unfortunately, no al- gorithm of polynomial order is known; nor do we know whether this is an NP-complete problem. As for the isomorphism problem for 
697 planar graphs, algorithms of polynomial order are known. K. Perfectness Theorem Let the number of independence and the chro- matic number of a graph G = (V, E, a+ , a-) be ex(G) and y(G), respectively. Let VI,"" v,. be a disjoint decomposition of v. We denote by 8( G) the minimal number of r such that every section graph of G over V; contains a com- plete graph. Furthermore, we denote by w(G) the maximum value of I WI for a subset We V such that the section graph of Gover W contains a complete graph. We always have ex(G).s;8(G) and y(G).s;w(G), Gis caUed ex- perfect if every section graph H of G satisfies :x(H)= O(H). Similarly, G is called y-perfect if y(H) = w(H) for every section graph H of G. The conjecture that y-perfectness and ex- perfectness are equivalent has recently been solved affirmatively [14]. This is ca\1ed the perfectness theorem, References [IJ L. Euler, Solutio problema tis ad geo- metriam situs pertinentis, Commentarii Aca- demiae Petropolitanae, 8 (1736), 128-140. [2J F. Harary, Graph theory, Addison-Wesley, 1969, [3J C. Berge, Graphes et Hypergraphes, Dunod, second edition, 1974. [4J A. A. Zykov, Finite graph theory (in Rus- sian), Nauka, 1969. [5J M, Iri, Network flow, transportation and scheduling, Academic Press, 1969, [6J H. walther and H.-1. Voss, Dber Kreise in Graphen, VEB Deutscher Verlag der Wissen- schaften, 1974, [7J H. Whitney, Non-separable and planar graphs, Trans, Amer. Math, Soc., 34 (1932), 339-362. [8J C. Kuratowski, Sur Ie rrcLieme des courbes gauches en topologie, Fund. Math., 15 (1930), 271-283. [9J F. Harary and W. Tutte, A dual form of Kuratowski's theorem, Canad, Math. Bull., 8 (1965),17-20,373. [lOJ G. Ringel, Map color theorems, Springer, 1974. [lIJ K. Appel and W. Haken, Every planar map is four colourable, I. Discharging, Illinois 1. Math., 21 (1977),429-489; K, Appel, W, Haken and 1. Koch, II, Reducibility, Illinois J. Math., 21 (1977),490-567. [12J S, Hitotumatu, Four color problem (in Japanese), Kodansha, 1978. 187 Greek Mathematics [13J A. V, Aho, J. E. Hopcroft, and J. D. U\1- man, The design and analysis of computer algorithms, Addison- Wesley, 1974. [14J L. Lovasz, Normal hypergraphs and perfect graph conjecture, Discrete Math" 2 (1972),253-267. 187 (XXI.2) Greek Mathematics It is generally believed that theoretical mathe- matics originated with the Greeks, The Greeks learned the arts of land surveying and com- mercial arithmetic from earlier civilizations; but they developed theoretical mathematics themselves, toward the middle of the 4th cen- tury B,C. The creation of a mathematics that transcends practical purposes was one of the most remarkable events in the history of human culture, and one that had an immense impact on the development of all branches of science. We owe the reestablishment of impor- tant Greek mathematical texts and the re- construction of the development of Greek mathematics to the historians of the 19th century (the oldest extensive exposition of the history of mathematics before Euclid is due to Proklos (Proclus) (410-485)). The earliest known Greek mathematicians are Thales of Miletus (c, 639-546 B.C.) and Pythagoras of Samos (fl, 51O? B.C.). Both were Ionians, but the latter went to what is now southern Italy and founded a semireligious school whose members called themselves Pythagoreans, Their motto was "Everything is number"; their studies were ca\1ed mathema ("what is learned") and consisted of music, astronomy, geometry, and arithmetic (the subject group called the quadrivium, which formed the core of medieval and later higher education) "for the purification of soul." Their research delved into the theories of proportion (in relation to music) and tpolygonal numbers (triangular numbers, square numbers, etc.), and more generally into the theory of numbers and geometric algebra. It is said that they knew of the irrationality of )2, though no evidence of this has been found. Even after the demise of the Pythagorean school, its followers continued to promote mathematics in col- laboration with the Academy of Plato, Another significant school was the Eleatic. Among its members Zeno (c. 490-c, 430 B.C.) is especially important. tZeno's paradoxes are arguments leading to absurdity. Some see within them the origin of logical reasoning and, consequently, of theoretical mathematics [3]. It is chronologically difficult to attribute 
187 Greek Mathematics to Zeno the consideration of the continuum and irrational numbers, but we can find in him the impetus toward atomistic reasoning. The computation of the volume of pyramids (by dividing them into "atomistic" laminae) by Democritus (fl. 4307 B.C.) and the atomistic calculation of the area of circles by Antiphon (fl. 430 B.C.) came shortly after the time of Zeno. The middle decades of the 4th century B.C. are known as the Age of Pericles, the Golden Age of Athens. The ttrisection of an angle, the tduplication of a cube, and the tquadrature of a circle, known at that time as the "three big problems" ( 179 Geometric Construction), were studied by the Sophists. Hippias of Elis (fl. 420 B,C.), Hippocrates of Chios (fl. 430 B.C. in Athens), Archytas of Taras (c. 430- 365 B.C.), Menaechmus (fl. 350 B.c.), and his brother Dinostratus (fl. 350 B.C.) solved these problems using conic sections and the quadratrix (a transcendental curve whose equation is y= xcot(nx/2)). By 400 B.C. Athens had lost its political influence, but it remained the center of Greek culture. It was during this time that Plato's Academy flourished, and Plato (427-347 B.C.) and his followers laid particular importance on mathematics. Archytas, Menaechmus, and Dinostratus belonged to or were closely as- sociated with this school. During the first fifty years of the Academy, research in the follow- ing fields was pursued: methodology of mathe- matics or science in general (Le., dialectics, analysis, synthesis); geometric reconstruction of Mesopotamian algebra; the theory of irra- tionals in relation to the geometrization of algebra (Theodorus of Cyrene (5th century B,C.), who was Plato's teacher, as well as Theai- tetus of Athens (415?-369 B.C.) contributed to this study, and the general theory of propor- tion by Eudoxus of Cnidos (c. 408-c. 355 B.C.) also belongs to this field); the method of ex- haustion (by Eudoxus); and studies of the "three big problems" and conic sections, It was this school in which the term mathema came to be used in its present sense of "mathemat- ics" rather than in the sense of disciplines in general. The conquests of Alexander the Great ac- celerated the already considerable cultural influence of Athens. Later, during the Ptole- maic period, the center of culture moved to Alexandria. The Mouseion at Alexandra, the combined library and university, is said to have possessed hundreds of thouands of volumes. At Alexandria, Euclid (c. 300 B.C.) com- piled his Elements, which became a model for scientific works for centuries to come- Newton's Principia as well as Spinoza's Ethics 698 are modeled on it. Most historians say that Euclid's method derives from Aristotle (384- 322 B.C.), who, after studying at Plato's Acad- emy, founded a new school, the Peripatetics, whose doctrines are at many points opposed to those of the Academy. Others, however, see the origin of Euclid's axiomatic method in the Eleatics [5J; we can find prototypes of some parts of the Elements in both Oenopides, who lived during the time of Zeno, and in Hippocrates, The third century B.C. was the Golden Age of Greek mathematics. Archimedes of Syracuse (c. 282-212 B.C.) was the greatest mathema- tician, mechanic, and technician of antiquity. He did important work in mathematics, study- ing the exact quadrature of the parabola. According to his ephodos (method), he would obtain a result by mechanical experiments and then prove it by the method of "exhaustion." He also computed the value of n; studied spirals and other curves, spheres, and circular cylinders; contributed to the development of statics and optics and their application; and had a profound influence on later mathema- ticians. During the same period, Apollonius of Perga (fl. 210 B.C.) wrote Konikon biblia (Books on Conics) in eight books, of which the last has been lost. The geometric theory of tconic sections contained in this work is not much different from the one we know today; it had a great influence on 17th-century scientists es- pecially. Other mathematicians of this period worth noting are Eratosthenes (c. 275-195 B,C.), who conceived the tsieve method of find- ing prime numbers and who measured the earth, and Hipparchus (fl. 150 B.C.), called the father of astronomy, who made a table of smes. Hellenistic influence began to decline in the first century B.C., and the influence of Alex- andria decreased. The Mouseion burned in 48 B.C., but was rebuilt. Among the mathe- maticians of this time, we may count Heron (fl, 607 A,D,); Menelaus (fl. 100 A.D.), who wrote Sphaerica; Theon of Smyrna (fl. 125 A.D.); Ptolemy (fl. 150 A.D.), the author of Almagest; Nicomachus (50?-1507 A.D.), the author of Arithmetike eisagoge; Diophantus (fl. 250? A.D.), whose career is not fully known but who wrote Arithmetika, of which six of the original thirteen books remained to influence tFermat; and Pappus (fl. 300 A.D.), the last creative mathematician in Greece, who left eight books of the Synagoge, which influenced tDescartes and which stiIl exist today. The period following the fall of the Western Roman Empire was a difficult one for Greco- Egyptian science. The Mouseion was de- stroyed for the second time in 392 A.D. Theon of Alexandria (fl. 380) and his daughter, Hy- 
699 patia (c. 370-415), were at that time working on commentaries on the classics. Among the few remaining works of the period is Procl us' (410-485) commentaries on the first book of Euclid's Elements. The Athenian Academy was closed in 529 by order of the Emperor J usti- nian; the last director was Simplicius, who commented on Aristotle. Soon afterward, Alexandria fell into the hands of the Moors, and many scholars fled as refugees to Constan- tinople, the capital of the Eastern Empire, References [IJ T. L. Heath, A history of Greek mathe- matics I, II, Clarendon Press, 1921. [2J T. L. Heath, A manual of Greek mathe- matics, Oxford Univ. Press, 1931 (Dover, 1963). [3J M. Cantor, Vorlesungen iiber Geschichte der Mathematik I, Teubner, third edition, 1907. [4] B. L. van der Waerden, Zenon und die Grundlagenkrise der griechischen Mathema- tik, Math. Ann" 117 (1940), 141-161. [5] A. Szabo, Anfiinge des euklidischen Axiomensystems, Arch. History Exact Sci., 1 (1960), 37-106. [6J 1. L. Heiberg (ed.), Euc1idis opera omnia I-XIII and suppl., Leipzig, 1883-1916. [7J T. L. Heath, The thirteen books of Euclid's Elements I, II, III, Cambridge Univ. Press, 1908 (Dover, 1956). [8J p, Ver Eecke (trans.), Les oeuvres com- pletes d'Archimede, Desclee de Brouwer & Cie, 1921 (Blanchard, 1961). [9J P. Ver Eecke (trans.), Proc1us Diadochus, les commentaires sur Ie premier livre des Ele- ments d'Euclide, Desclee de Brouwer & Cie, 1948 (Blanchard, 1959). [I OJ P. Tannery, Le geometrie grecque, com- ment son histoire nous est parvenue et ce que nous en savons, Paris, 1887. [l1J B. L. van der Waerden, Science awaken- ing, Noordhoff, 1954, [12J M. Clagett, Greek science in antiquity, Abelard-Schuman, 1955 (Collier, 1963), [13] O. Becker (ed.), Zur Geschichte der grie- chischen Mathematik, Darmstadt, 1965, [14] A. Szabo, Anfiinge der griechischen Mathematik, Oldenbourg, 1969, F or the history of mathematics in general, [15J R. C. Archibald, Outline of the history of mathematics, Mathematical Association of America, sixth edition, 1949, [16] N. Bourbaki, Elements d'histoire des mathematiques, Hermann, second edition, 1969. [17J M. Cantor, Vorlesungen iiber Geschichte der Mathematik I-IV, Teubner. second edi- tion, 1894-1908, I88A Green's Functions [18J E. Montucla, Histoire des mathematiques IIV, Paris, new edition, 1799-1802, [19J D. E. Smith, A source book in mathe- matics, McGraw-Hili, 1929 (Dover, 1959), [20] D. J. Struik, A concise history of mathe- matics, Dover, third edition, 1967. 188 (XII1.28) Green's Functions A. General Remarks Green's functions are usually considered in connection with tboundary value problems for ordinary differential equations and also with telIiptic and tparabolic partial differen- tial equations. For example, consider bound- ary value problems for the tLaplacian in 3- dimensional space: L[uJ=(a2/axi+a2/ax+ a2/ax)u. Let D be a bounded domain with a smooth boundary S and the boundary con- dition B on S be either u(x) = 0 (x E S) (the first kind) or au/an + fJu= 0 (xES)(the third kind), where n is the outer normal of unit length, {J(x)?O, and (i(x);j=O. We say that the function g(x 1 , x 2 , x 3 ; h 2' 3) is the Green's function of L (or the partial differential equation L[uJ =0) relative to the boundary condition B, when (i) g(x,) satisfies Lx[g(x, m = 0 except for x =; (ii) g(x, ()= -1/4nr + w(x, (), where r =('L.dXi - (;)2)1/2 and w(x,) is a regular function, i.e., of class C' for a suitable value v; (iii) g(x,) satisfies the boundary condition B, i,e., g(x,) = 0, XES (the first kind), or (a/an + {J) g(x, 0 =0, XES (the third kind). Conditions (i) and (ii) mean that g(x,) is a Hundamental solution of L, i,e., Lx[g(x, m = b(X- ), where b(X -) is tDirac's measure at the point X =, Note that if g(x, ) is a fundamental solution, then by adding any solution u of the equation L[uJ = 0 to g, we obtain another fundamental solution 9 + u. Thus Green's function is the fundamental solution that satisfies the given boundary condition. To be more precise, in the boundary value problem, if the boundary condition is of the first kind, g(x, ) can be obtained by adding to a fundamental solution -1/4nr the solution w(x,) of the folIowing Dirichlet problem: x(j)(x, ) == 0, w(x, ) == 1/4nr (XES). We remark that there are slightly different definitions for Green's function. For example, there are cases where g(x, ) is de- fined by g(x, ) = 1 /4nr + w(x, ) or by g(x, ) = l/r + w(x, ). In the case in the previous paragraph, Green's functions satisfying the boundary conditions are uniquely determined. In gen- eral, if we are given a Green's function, then 
188 B Green's Functions for any regular function v(x) the function u(x)= Iv g(x, )v()d represents the solution of L[uJ = v with the boundary condition B. More precisely, if v(x) satisfies the tH61der condition I v(x) - v(x') I.s; Llx - x'I' (0 <ex.s; 1) (L, ex positive constants), then u(x) is of class C 2 . Conversely, if u(x) satisfies the equation L[uJ = v and the bound- ary condition B, it is represented by the for- mula for u(x). This means that if we denote the operator that associates u to v by G, then G is the inverse operator of the Laplacian L with the boundary condition B, and Green's function is the tintegral kernel of the operator G. Using this property, the boundary value problem relative to L can be reduced to a problem oftintegral equations. For example, the differential equation with the boundary condition B containing the complex parameter }" L[uJ +).u= J, is equivalent to the integral equation u +AG[UJ = G[fJ, which is obtained by letting G act from the left on the above dif- ferential equation. In this way, the problem can be simplified, In the case of general boundary value problems for higher-order elliptic operators, Green's functions are defined in the same way as before ( 189 Green's Operator). The im- portant case is when L and the boundary condition B define a tself-adjoint operator. In this case, Green's function is symmetric (g(x, )=g(, x)). To obtain Green's function is not easy in general. However, in some cases such functions can be obtained fairly easily ( Appendix A, Table 15.VI). B. Self-Adjoint Ordinary Differential Equations of the Second Order Consider the operator L[uJ = (p(x)u')' + q(x)u (p(x) > 0) defined in the interval a.s; x.s; b, with boundary conditions of the form exu' + flu = 0 at the two endpoints. Then Green's function g(x,) is defined in the folIowing way: (i) For x #, L[g(x, m = 0; (ii) [8g(x, )j8xJ=g = 1jp(); (iii) for  fixed, g(x,) satisfies the homo- geneous boundary conditions at x=a and x=b. Conditions (i) and (ii) mean that L[g(x, )J =(j(x-). We can construct g(x,) in the following way: Let U I (U 2 ) be the solution of L[uJ =0 satisfying the boundary condition at x = a (at x = b). If UI and U 2 are linearly inde- pendent, we can satisfy p( U'I U 2 - U I u) = 1 by choosing the constants suitably. Then Green's function g(x,) is defined by g(x, )= u\ (x)u2() for x.s;, and g(x, )= U I ()U2(X) for .s; x. If U I and U 2 are linearly dependent, there exists no 700 Green's function that satisfies conditions (i), (ii), and (iii), However, by modifying the defi- nition, we can get a generalized Green's func- tion playing a similar role [2]. This method can be applied to the case of ordinary dif- ferential equations of higher order. C. The Laplace Operator When the domain D is the n-dimensional sphere of radius a with center at the origin, Green's function of the Laplacian relative to the boundary condition u = 0 is obtained in the folIowing way. Let E(r) be the folIowing fundamental solution of the Laplacian: E(r)= (2n)-ll og rfor n=2 and E(r)= -«n-2)' w n r n - 2 )-1 for n?3, where w n =2n nf2 qnj2) is the (n - 1 )-dimensional surface area of the n- dimensional unit sphere. Then Green's func- tion g(x, ) is defined by g(x, )=E(r)- E(pr'ja), where p=(Li=1 a)If2, r' =(Lil (Xi- i)2)1/2, ;=(ajp)2i' D. Helmholtz's Differential Equation Let D be an exterior domain with a smooth boundary Sin R 3 . In mathematical physics, the boundary value problem of finding a solu- tion u(x) of Helmholtz's differential equation (+ F)u(x) = f(x) (k> 0) satisfying u(x) = 0 (x E S) is of particular interest. In this case, concerning the behavior of u(x) at infinity, we usually assume Sommerfeld's radiation condition: When Ixl--> +00, u(x)=O(lxl- I ), ( 8 8 u-iku )1 ==o(lxl-I), r Ixlr where 8j8r is the derivative along the radial direction. It is known that this condition en- sures the uniqueness of the solution (Rellich's uniqueness theorem). We can construct Green's function G(x, ) for any k( > 0) so that for smooth f(x) with bounded tsupport, u(x)= Iv G(x,)f()d represents the solution satisfying u(x)=O (XES) and Sommerfeld's radiation condition. Then, with eiklx-I G(x, ) = - + KAx, ), 4nlx--1 where ( 3 ) 1/2 Ix-I= i (xi-i , 
701 Kc!x, () can be obtained by solving an integral equation of tFredholm type [3,4]. In this case, there exists no Green's operator in L 2 space, and G(x, () can be considered to be a general- ized Green's function [4]. E. Stokes's Differential Equation Let D be a bounded domain in R 3 with smooth boundary S, and consider Stokes's differential equation in D ap Jiu.=--px. l 8Xi p 3 au. i=I,2,3, L=O' jl aX j where Ji, P are positive constants. In hydro- dynamics, we consider the boundary value problem of finding solutions (udx), u 2 (x), u 3 (x), p(x)) of Stokes's equation satisfying the bound- ary condition ui(x) = 0, XE S (i = 1, 2, 3). In this case, Green's tensors Gi)x, (), g;(x, () can be constructed, and for smooth functions Xi(x) (i = 1, 2, 3) the unique solution of this bound- ary value problem is represented by Ui(X)=P jtl Iv Gij(x,()X))d(, p(x) '7 P jt 1 Iv gj(x, )Xj(()d( [5]. F. Parabolic Equations Consider the boundary value problem (the initial boundary value problem): au 2 iJ2u L[uJ=--c - 2 =f(x,t), t>O, a<x<b, at ax u(x, 0) = <p(x), where at x = a and x = b, u(x, t) satisfies some homogeneous boundary conditions. In this case, we can construct the function g(x, t; (, r) (t? r) satisfying the fol1owing con- ditions: (i) L [gJ = 0 except for x =. t = r; (ii) exp( - (x- ()2/4c 2 (t -r)) g(x,t;(,r)-  2c y n(t - r) + (regular function) in a neighborhood of x = (, t = r; and (iii) g(x, t; (, r) satisfies the given homogeneous boundary conditions at x = a and x = b. Then u(x,t)= tr g(x,t;,r)f((,r)d(dr + r g(x, t; (, O)<p()d represents the solution of the problem stated 188 G Green's Functions in this section for regular functions f, <po The function g(x, t; (, r) is cal1ed Green's function relative to the boundary value problem. De- tailed consideration of such elementary cases is found in [6]. G. Kernel Functions The kernel function is closely related to Green's function of  (the Laplacian) rela- tive to the first boundary value problem in a domain in R 2 . First we explain the general definitions of the kernel function. Let E be a general set, and let 3' be a tHilbert space of complex-valued functions defined on E with a suitable inner product (f, g). Suppose that we are given a function K(x, y) defined on Ex E satisfying the fol1owing conditions: (i) For any fixed y, K (x, y) regarded as a function of x belongs to 3'; and (ii) for any f(X)E!5, (/(x), K(x, Y))x= f(y). Then K (x, y) is cal1ed a kernel function or reproducing kernel. The kernel function, if it exists, is unique and is tpositive definite Her- mitian; that is, n L K(Yj,Yk)(lk?O. j,kl (1) Conversely, any positive definite function is a reproducing kernel of some Hilbert space. A necessary and sufficient condition for the existence of the kernel function is that for any YE E, the linear functional f --> f(y) be bounded. In this case, the minimum of Ilfll under the condition f(y) = 1 (/ Em is attained by the element K(x, y)/K(y, y), and its value is K(y,y)-1/2. When 15 is a tseparab1e space, then by an torthonorma1 system {<Pv(x)}, we can represent K (x, y) as 00 K(x, y) = L <Pv(x) <p,(y) . v=l (2) As an example of kernel functions, the fol- lowing case is of particular importance. Let E be an n-dimensional tcomplex manifold, <p and i/J be ho10morphic tdifferentia1 forms of degree n on E, and let the fol1owing inner product be gIVen: (<p,i/J)= L <pI\If. Now 3' is given by 3'={<pI<p,<p)< +oo}, and the kernel function is cal1ed the kernel dif- ferential. When E is a domain in en, regarding the coefficients of the differential form as func- tions, we cal1 the kernel function Bergman's kernel function. Moreover, if E can be mapped onto a bounded domain by a one-to-one ho10morphic mapping, then ds 2 = LW10gK(z, z)/azjazddzjazk 
188H Green's Functions is positive definite and gives a Kahler metric which is called the Bergman metric. H. Kernel Functions for Domains in the Complex Plane Let E be a domain D in the complex plane (z = x + iy). Let K (z, 0 be Bergman's kernel function of D, and let G(z, 0 be Green's func- tion of  relative to the first boundary con- dition with a pole at (. Then we have K(z,()= -(2jnWG(z,O/rzor Next, let U (z, 0 be the kernel function of the Hilbert space consisting of the holomor- phic differential forms whose integrals are single-valued, and let N(z,O be Neumann's function of , Le., the function that is har- monic in D - {O, has the same singularity as Gat (, and whose derivative in the normal direction oN jon is constant along the bound- ary. Then we have U(z, ()=(2jnWN(z, Ojozor Now the kernel H(z,O=(N(z,O-G(z,mj2n is the kernel function relative to the Hilbert space consisting of all real tharmonic functions whose integral mean value along the boundary r is 0 and having the inner product (<p,IjJ)= ff ( o<PoljJ + o<paljJ ) dXdY. D ox ax oy ay The kernel H(z,O is called a harmonic kernel function. Suppose that the boundary r is tpiecewise smooth, and consider the space of all holo- morphic functions in D that are continuous on the boundary of D. The inner product of such functions <p and IjJ is given by (<p, 1jJ) == S I <plj/ ds (ds is the element of the arc length of r). Hence we have a Hilbert space. Then the kernel function relative to this Hilbert space is called Szego's kernel function, which has a close relation with tbounded functions. The kernel functions enable us to represent holomorphic mappings that map the domain D onto various canonical domains ( 77 Conformal Mappings). References [IJ R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, I, 1953; II, 1962. [2J K. Y osida, Lectures on differential and integral equations, Interscience, 1960. (Orig- inal in Japanese, 1950.) [3] V. D. Kupradze, Randwertaufgaben der Schwingungstheorie und Integralgleichungen, 702 (3) Deutscher Verlag der Wiss., 1956, (Original in Russian, 1950.) [4J S. Mizohata, Theory of partial differen- tial equations, Cambridge Univ. Press, 1973. (Original in Japanese, 1965.) [5] F. K. G. Odqvist, Ober die Randwertauf- gaben der Hydrodynamik zaher F1iissigkeiten, Math. Z., 32 (1930), 329-375. [6J E. E. Levi, Sull' equazione del calore, Ann. Mat. Pura Appl., (3) 14 (1908), 187-264. For kernel functions, [7J S. Bergman, The kernel function and con- formal mapping, Amer. Math. Soc. Math. Surveys, 1950. [8J S. Bergman and M. Schiffer, Kernel func- tions and elliptic differential equations in maihemalicai physics, Academic Press, 1953. [9J N, Aronszajn, Theory of reproducing kernels, Trans. Amer. Math. Soc., 68 (1950), 337-404. [1OJ H. Meschkowski, Hilbertsche Riiume mit Kernfunktion, Springer, 1962. (4) 189 (XIII.29) Green's Operator A. General Remarks Consider the first and the third boundary value problems for the elliptic equation n OU A[uJ = - u+ L ai(X)-+C(X)U= f(x) i1 OXi ( 323 Partial Differential Equations of Ellip- tic Type). Let D be a bounded domain ofR n whose boundary S consists of a finite number of smooth hypersurfaces. By the tmethod of orthogonal projection, we take the domain (A) as follows: (i) {u(x)IU(X)EHz(D) and u(x)=O for XES} or (ii) {u(x)lu(x)EHz(D) and oujDn+[3(x)u=O for XES} according as we are considering the first or third boundary value problem, where H 2 (D) is the tSobolev space ( 168 Function Spaces). If the operator A is a one-to-one mapping from (A) onto the function space L 2 (D), we call the inverse operator A -I Green's operator relative to the boundary condition, and we denote it by G. In general, the existence of A -I is not guaranteed. However, if we take real t large enough, G, = (A+tI)-1 exists. Consider the general case where ..1. is a com- plex parameter: (AI - A) [uJ = f(x), f(X)E Lz(D). Letting G, act from the left, we have (I -(..1. + t)G,)[uJ= -G,f Conversely, ifuEL 2 (D) is a solution, clearly u(x)E£&(A), and u(x) satis- fies the first partial differential equation and the boundary condition. Since G, is a tcom- 
703 pact operator in L 2 (D), the tRiesz-Schauder theorem can be applied ( 68 Compact and Nuclear Operators). In particular, if }, + t is not an teigenvalue of G" u(X) = (AI - A)-I r = -(1- (}.+t)G,)-lGJ represents a unique solution. In the equations in the first paragraph of this section, if ai(x) = 0 and c(x) and [3(x) are real, then G, is a tself-adjoint operator in L 2 (D), and therefore the tHilbert-Schmidt expansion theorem can be applied, Namely, let {Ai} be the eigenvalues of A such that Awi(x)= }.iWi(X), where {wi(x)} is an torthonormal system in L 2 (D). Then for any f(X)EL 2 (D), f(x) = LI (f, W;)Wi(X), where the right-hand side is taken in the sense of tmean convergence. Furthermore, for f(x)E(A), we have the expansion (Af)(x) = LI }'i(f, w;)w;(x) in the same sense. When G, is not self-adjoint, let G,* be the tadjoint operator of G, in L 2 (D). Then G,* represents Green's operator relative to the equation n a (A*+tI)[vJ= -v- L -( ai(x) v) i1 OX i +(c(X)+t)v =g(x), corresponding to the boundary conditions (i) v(x)=O, XES (first boundary condition) and (ii) (%n)v + [3'(x) v = 0, XES, where n [3'(x) = [3(x) + L a;(x) cos nx i , i=l with n the outer normal (third boundary con- dition) [2]. B. Elliptic Equations of Higher Order Green's operator can be defined for elliptic equations of higher order. Consider the equation A(x, %x)u(x) = f(x), xED; Bj(x,%x)u(x)=O, XES, j = 1,2, .. . , b( = m/2), (I) where A is an telliptic operator of order m and the boundary operators {B j } satisfy: (i) At every point X of S, the normal direction is not tcharacteristic for any B j ; and (ii) the order m j of Bj is less than m, and m j # m k (j # k). The domain (A) of A is defined by (A)={u(x)luEW(D) and B)x, %x)u(x)=O for XES, j=I,2"..,b}. When A is a one-tO-One mapping from £&(A) onto L 2 (D), the inverse G = A -I is called Green's operator, 189 C Green's Operator This general boundary value problem, es- pecially the existence theorem, was treated under some algebraic conditions on A and {Bj} by M. Schechter [5] who showed that G[uJEHm(D) ifu(x)EL 2 (D) and that G[uJ depends continuously on u. In particular, if m> n/2, then by tSobolev's theorem, G is a continuous mapping from L 2 (D) into c°(i)), and G is represented by an tintegral operator of Hilbert-Schmidt type (L. Garding [6]). Namely, for any f(x)EL 2 (D), (Gf)(x) = Iv G(x,)f@d, ff'G(x,WdXd< +00, In general, the function G(x, ), obtained by the kernel representation of Green's operator G, is called Green's function. On the other hand, consider, for exam- ple, the tDirichlet problem of  in R 3 Then Green's function is defined in the following way ( 188 Green's Functions): G(x,) = -(4nlx- W- I + u(x, ), where u(x,) satisfies (i) xu(x,)=O, and (ii) G(x,)lxEs=O. The function defined in this manner coincides with Green's function defined as a kernel represen- tation of Green's operator [1,2]. Suppose that in problem (1) the telliptic operator A(x, a/ax) is independent of x, and let E(x) be a tfundamental solution, i.e., E(x) is a distribution solution of A(%x)E(x)=£5(x) (£5(x) is tDirac's £5-function;  112 Differential Operators). Then E(x) is a COO-function away from the origin. Moreover, in a neighborhood of the origin the following estimates hold: For lexl<m, { CIXlm-n-I,I, m-n-Iexl <0, 1( :J 'E(X)I.s; cloglxl- 1 , m-n-lexl=O, c, m-n-lexl>O, where C is some positive constant. When prob- lem (1) has Green's operator G, we can say the following, using the fundamental solution: Green's function G(x, ) exists and can be written as G(x, )=E(x-U+u(x, ), where for any fixed ED, u(x,) satisfies (i) A(O/ox)u(x,) =0 and (ii) B)x, %x)G(x, )=O, xES,j= 1,2,..., b. C. Hypoelliptic Operators Let A(x,O/ox)= L a,(x) (  ) ', lal'i;m ax ( a ) " 01'1 ox = OXI...0X"' 
189 Ref. Green's Operator be a general partial differential operator with Coo -coefficients. If the kernel E(x, ) satisfies A(x, ajax)E(x,) = £5(x - ), that is, if we have the relation <E(x, ), 'A (x, ajax)<p(x)x = <p() for any <p(X)E!ZJ, then E(x,) is called a funda- mental solution of A, where tA is the transposed operator of A: ( (1 ) ' 'A (x, ajax)v(x) = L (-1)1" -;- (a,(x)v(x»). l'l<;m ox Now if there exists a fundamental solution E'(x,) of 'A (x, ajax) such that (i) E'(x,) de- fines a kernel that gives rise to two continuous mappings, one of which maps the space £& into ,ffx, and the other of which maps the space !ZJ x into ,ff ( 168 Function Spaces), and (ii) for x # (, E'(x, ) a CcN -function of (x, (), then any distribution u(x) satisfying A(x, ajax)u(x) = g(x) is a Coo -function, where g(x) is of class Coo. In general, an operator A with the prop- erty that any solution u(x) of A(x, ajax)u(x) = g(x) is of class Coo whenever g(x) is of class Coo, is called hypoelIiptic. Elliptic and para- bolic operators are both hypoelliptic. L. Hor- mander characterized the hypoelliptic dif- ferential operators with constant coefficients [8] ( 112 Differential Operators). A kernel E(x,) such that A(x, ajax)E(x, )=£5(x )+ w(x, ), with w(x, ) a Coo -function, is called a para- metrix of A. To prove the hypoellipticity of A, it suffices to show the existence of a parametrix E'(x,) of the operator tA having the prop- erties (i) and (ii) mentioned in the previous paragraph. To explain the notion of the fundamental solution for the tevolution equations, suppose that wc arc given an evolution equation am L[uJ =-u(x, t) atm ( 8 ) ' aj + L a,jx,t) _ a _ a ju(x,t)=O, j<m X t XE Rn, to.s; t.s; T. A kernel E(x, t;, to) (to.s; t.s; T) is called a fundamental solution to the evolution equation if Lx,,(E(x, t;, to)) == 0, t> to, and ,iY { O, 0.s;j.s;m-2, hm -----.., E(x, t; , to) == , ''o+oat' £5(x-(), j=m-1. References [IJ H. G. Garnir, Les problemes aux timites de la physique mathematique, Birkhiiuser, 1958. [2J S. Mizohata, Theory of partial differential equations, Cambridge Univ. Press, 1973. (Original in Japanese, 1965.) 704 [3] R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. [4] L. Garding, Dirichlet's problem for linear elliptic differential equations, Math. Scand., 1 (1953),55-72. [5] M. Schechter, General boundary value problems for elliptic partial differential equa- tions, Comm. Pure Appl. Math., 12 (1959), 457-486. [6J L. Garding, Applications of the theory of direct integrals of Hilbert spaces to some integral and differential operators, Univ. of Maryland lecture series no. 11, 1954. [7J L. Schwartz, Theorie des distributions, Hermann, second edition, 1966. [SJ L. Hormander, On the theory of general partial differential operators, Acta Math" 94 (1955),161-248. 190 (IV.1) Groups A. Definition Let G be a nonempty set. Suppose that for any elements a, b of G there exists a uniquely deter- mined element c of G, which is called the prod- uct of a and b, written c=ab. We call G a group or multiplicative group if (i) the associa- tive law a(bc)==(ab)c holds, and (ii) for any elements a, bEG there exist uniquely deter- mined elements x, YEG satisfying ax=b, ya= b. Then the mapping (a, b)-->ab is called multiplication in G, Condition (ii) is equivalent to the following two conditions: (iii) There exists an element e (called the identity element or unit element of G) such that ae = ea = a for any element a of G; and (iv) for any element a of G there exists an element x such that ax = xa=e. The element x in condition (iv) is called the inverse (or inverse element) of a, denoted by a -I. The uniqueness of the identity element e and the inverse a-I follows readily from the axioms. The identity element of a multiplica- tive group is sometimes denoted by 1. If ab = ba, then we say that a and b Commute. The commutative law, ab = ba for any elements a, bEG, is not assumed in generaL A group satis- fying the commutative law is called an Abelian group (or commutative group) in honor of N. H, Abel, who made use of commutative groups in his study of the theory of equations. The product in a commutative group is often writ- ten in the form a + b, and in this case the mapping (a, b)-->a + b is called addition. The element a + b is called the sum of a and b, and G is called an additive group. In an additive 
705 group the identity element is usua1ly denoted by 0 and ca1led the zero element, and the in- verse of a is denoted by -a ( 2 Abelian Groups; 277 Modules). To describe the tlaw of composition, we sometimes use notation differ- ent from multiplication or addition ( 409 Structures). B. Examples A tlinear space over a tfield K is an additive group with respect to the usual addition of vectors ( 256 Linear Spaces). A field is an additive group with respect to the addition, and the set of nonzero elements of a field forms a group with respect to the multiplica- tion, which is caUed the multiplicative group of the field ( 149 Fields). A1I tinvertible n x n matrices over a ring R form a group with respect to the usual multi- plication of matrices. This group is ca1led the tgenera1linear group of degree n over R ( 60 Classical Groups). AU one-to-one mappings from a set M onto itself (i.e., an permutations on M) form a group with respect to the composition defined by (j og) (x)= f(g(x)) (x EM). (Sometimes the product fog is denoted by gf and f(x) by xf Then x(gf) = (xg)f) The group ofa1l permuta- tions on M is ca1led the symmetric group on M. A group G is ca1led a permutation group (on M) if every element of G is a permutation on M. For instance, the general linear group of degree n over a field K may be regarded as a permutation group on the set of n-dimensional vectors, and it is also regarded as a permuta- tion group on a ttensor space. An tmotions in a Euclidean space form a group with respect to the usual composition of motions. A1I invertible n x n matrices over K leaving a given tquadratic form invariant form a group with respect to the usual multiplica- tion of matrices. This group is ca1led the torthogonal group belonging to the given quadratic form. If K is the tcomplex number field or the real number field, then these groups are Lie groups ( 13 Algebraic Groups; 151 Finite Groups; 161 Free Groups; 249 Lie Groups; 423 Topological Groups). C. Fundamental Concepts If a group G consists of a finite number of elements, then G is called a finite group; other- wise, G is ca1led an infinite group. The number of elements of G is caUed the order of G. A non empty subset H of G, is ca1led a subgroup of G if H is a group with respect to the multipli- cation of the group G. Hence a non empty sub- 190 C Groups set H is a subgroup of G if and only if a-I bEH for any a, bE H, For a family {H A } of sub- groups of G, the intersection n). H A is also a subgroup. The associative law of multiplication says that elements a" a z , a 3 of G determine the product a l a 2 a 3 , which is the common value of (a l a 2 )a 3 and ada 2 a 3 ). This law can be gen- eralized to say that any ordered set of n ele- ments a" a 2 , ... ,an (n> 2) of G determines their product a I a 2 ... an (general associative law). When a l =a 2 =... =an=a, we denote the product aa ... a by an. If we define a -n for n  0 by aO=e and a-n=(an)-I, we then have ana m = a n + m , (an)m = a nm for any n, mE Z. If there exists a positive integer n such that an = e, then the smaUest positive integer d with ad = e is ca1led the order of the element a. If there is no such n, then a is ca1led an element of infinite order. If a is of infinite order, then its powers a O (= e), a:!: \ a:!:2, ..., are all unequal. If a is of order d, then the different powers of a are a O (= e), a, a 2 , ..., ad-I. A1I the powers of a form a subgroup <a) of G, caned a cyclic subgroup. The order of an element a is the same as the order of the subgroup <a). The group <a) itself is ca1led a cyclic group and is an example of an Abelian group ( 2 Abelian Groups). Let S be a subset of a group G. Then the intersection of a1l subgroups of G containing S is ca1led the subgroup generated by S and is denoted by <S). It is the sma1lest subgroup containing S, and if S is nonempty, <S) con- sists of a1l the elements of the form a1 a'i 2 ... a;" (aiES, miE Z). If <S) = G, the elements of S are caUed generators of G. When G has a finite set of generators, G is said to be finitely generated. When S = {a}, then < S) coincides with <a), and the element a is the generator of the cyclic group <a). Suppose that elements a"..., an of G satisfy an equa- tion of the form ala'i2... a;:'" = 1. This equa- tion is then ca1led a relation among the ele- ments aI' ..., an. If we have a system of gen- erators and an relations among the generators, then they define a group ( 161 Free Groups). It is, however, still an open problem to find a general procedure to decide whether the group determined by a given system of generators and the relations among them contains ele- ments other than the identity ( 161 Free Groups B). For a subset S and an element x of a group G, the set of an elements x -I SX (SES) is de- noted by x -I Sx or SX, and Sand SX are caUed conjugate. We have (abY=axbX, (a-I)X =(aX)-I. If H is a subgroup, then HX is also a subgroup. For a subset S, the set of a1l ele- ments x satisfying SX = S forms a subgroup N (S), caned the normalizer of S. The set of all elements that commute with every element of 
190 D Groups S forms a subgroup Z(S), called the centralizer of S. The centralizer Z of G is called the center of G. The set of all elements conjugate to a given element a of G is called a conjugacy class. A group G is the disjoint union of its conju- gacy classes. Let H be a subgroup of a group G and x an element of G. The set of elements of the form hx (h E H) is denoted by H x and is called a right coset of H. A left coset xH is defined similarly. G is the disjoint union of left (right) co sets of H. The cardinality of the set of left cosets of H eq uals that of the set of right co sets of H; it is called the index of the subgroup H and is denoted by (G: H), Given two subgroups Hand K of G, the set HxK = {hxk I hE H, kEK} is called the double coset of Hand K, and Gis the disjoint union of different double co sets of Hand K. If the left cosets of a subgroup Hare also the right cosets, i.e., if Hx = xH for every XEG, then H is called a normal subgroup (or invariant subgroup) of G. An equivalent con- dition is that H = H x for all x E G. The center of G is always a normal subgroup of G. If H is a normal subgroup of G, the set of all products of an element of Ha and an element of Hb coincides with Hab. Thus if we define the product of two cosets Ha and Hb to be Hab, then the set of cosets of H forms a group. This group is denoted by G/H and is called the factor group (or quotient group) of G modulo H. (When G is an additive group, G/H is also denoted by G - H and is called the difference group.) The group G itself and {e} are normal subgroups of G. If G has no normal subgroup other than these two, then G is called a simple group. A subgroup of finite index contains a normal subgroup of finite index. If H is a subgroup of finite index, then we can find a common complete system of representatives of the left cosets and the right cosets of H. If G is finitely generated, then so is any subgroup of G of finite index. Let R be an tequivalence relation defined in a group G, If xRx' and yRy' always imply (xy)R (x'y'), then we say that R is compatible with the multiplication. The tquotient set G/R is a group with respect to the induced multipli- cation. This group is called the quotient group of G with respect to R. The equivalence class H containing e is a normal subgroup, and xRx' if and only if x -1 x' E H, i.e., x and x' are contained in the same coset of H. Thus G/R coincides with G/H. If G is a finite group of order n, then the order and the index of any subgroup of G, the order of any element of G, the cardinal number of any conjugacy class of G, and the number of different conjugate subgroups of any subgroup of G are all divisors of n. 706 D. Isomorphisms and Homomorphisms If there is a one-to-one mapping a <-+ a' of the elements of a group G onto those of a group G' and if a<-+a' and b<-+b' imply ab<-+ab', then we say that G and G' are isomorphic and write G G'. If we put a' = f(a), then f: G-->G' is a tbijection satisfying f(ab)=f(a)f(b) (a, bE G). More generally, if a mapping f: G--> G' satisfies f(ab)=f(a)f(b) for all a, bEG, then[is called a homomorphism of G to G'. An tinjective (tsur- jective) homomorphism is also called a tmono- morphism (tepimorphism). If there is a surjec- tive homomorphism G-->G', then we say that G' is homomorphic to G. The composite of two homomorphisms is also a homomorphism. If a homomorphism f: G-->G' is a bijection, then f is called an isomorphism. In this case f- I is also an isomorphism, and we have G  G'. For a subgroup H of a group G, the in- jective homomorphism f: H -->G defined by f(a)=a (aEH) is called the canonical injection (or natural injection). For a factor group G/R of G, the surjective homomorphism f: G-->G/R such that aE [(a) (aE G) is called the canonical surjection (or natural surjection). Let f: G--> G' be a homomorphism. Then the image f( G) of f is a subgroup of G, and the kernel H = {aE Glf(a)= e' (the identity of G')} of f is a normal subgroup of G. The equiva- lence classes of the equivalence relation given by f(x) = f(y) are just the co sets of H, and f induces an isomorphism 1: G/H --> f(G). The latter proposition is called the homomorphism theorem of groups. This theorem is extended in the following way: For simplicity let f: G--> G' be a surjective homomorphism. (i) If H' is a normal subgroup of G', then the inverse image H = f-l (H') is a normal subgroup of G, and f induces the isomorphism l: G/H --> G'/H'. (ii) if H is a subgroup and N is a normal subgroup of G, then HN ={hnlhEH,nEN} is a subgroup of G, and the canonical injection H --> H N induces an isomorphism H/H n N -->H N/N. (iii) If Hand N are two normal subgroups of G such that H =:> N, then the canonical surjection G-->G/N induces an isomorphism G/H--> (G/N)/(H/N). Propositions (i), (ii), and (iii) are called the isomorphism theorems of groups, A homomorphism of G to itself is called an endomorphism of G, and an isomorphism of G to itself is called an automorphism of G. The set of automorphisms of G forms a group with respect to the composition of mappings, called the group of automorphisms of G. Given an element a of G, the mapping x-->a- I xa (XE G) yields an automorphism of G which is called an inner automorphism of G. The set of inner automorphisms of G forms a normal subgroup of the group of automorphisms of G, called the 
707 group of inner automorphisms of G, which is isomorphic to the factor group of G modulo its center. The factor group of the group of auto- morphisms of G modulo the group of inner automorphisms of G is called the group of outer automorphisms of G. If a mapping f: G-->G' from a group G to another group G' satisfies f(ab) = f(b)f(a) (a, bE G), then f is called an antihomomor- phism. A bijective anti homomorphism is called an anti-isomorphism. When G = G', f is called an anti-endomorphism or anti-automorphism (e.g.,!:G-->G defined by f(a)=a- 1 is an anti- isomorphism). E. Groups with Operator Domain Let Q be a set and G a group. Suppose that for each eEQ and XEG, the product eXEG is defined and satisfies O(xy) = e(x)(ey). Then Q is called an operator domain of G, and G is called a group with operator domain Q, or simply an Q-group. (We sometimes write X8 instead of ex.) The mapping (e, x)-->Ox from Q x G to Gis called the toperation of Q on G. If G is an Q- group, then any element 0 of Q induces an endomorphism OG:x-->Ox ofG, Conversely, if we are given a mapping e-->e G of Q to the set of endomorphisms of G, then we may regard G as an Q-group. Any group may be regarded as an Q-group with Q equal to the empty set or to the set consisting of the identity automor- phism of G. Thus the general theory of groups can be extended to the theory of groups with operator domain, and in some cases effective use of suitable operator domains can be fruit- ful in the investigation of the properties of groups themselves. ( 2 Abelian Groups; 277 Modules). A subgroup H of an Q-group G is called an Q-subgroup (or admissible subgroup) if ex E H for any OEQ and xEH. In this case, H is also an Q-group. If an equivalence relation R de- fined in G is compatible with the multiplica- tion and also compatible with the operators, namely, if xRx' implies (ex)R(Ox') for any eEQ, then the quotient group G/R is also an Q-group. The equivalence class containing e is an admissible normal subgroup. Conversely, if H is an admissible normal subgroup, then the equivalence relation defined by H is compat- ible with the operators, and the factor group G/H is an Q-group. A homomorphism f: G -->G' of an Q-group G to an Q-group G' is called an Q-homomorphism (admissible homo- morphism or operator homomorphism) if f(Ox) = Df(x) for any OEQ and XEG. If f is an iso- morphism, then f is called an Q-isomorphism (admissible isomorphism or operator isomor- 190G Groups phism). We have the homomorphism theorem and the isomorphism theorems of Q-groups if we consider only admissible subgroups and admissible homomorphisms. F. Sequences of Subgroups Let HI' H 2 ... be an infinite sequence of (normal) subgroups of a group G. If H/i Hi+! (i = I, 2, .. .), then the seq uence is called an ascending chain of (normal) subgroups. If Hi  Hi+l (i = 1,2,...), then it is called a descend- ing chain of (normal) subgroups. If there is no ascending (or descending) chain of (normal) subgroups of G, we say that G satisfies the ascending (or descending) chain condition for (normal) subgroups. These conditions are the same as the ascending (or descending) chain condition in the ordered set of all (normal) subgroups of G ( 311 Ordering C). A group G satisfies the ascending chain condition for subgroups if and only if every subgroup of G is finitely generated. Also, for groups with operator domain we have similar results. The structure of Abelian groups satisfying the ascending (descending) chain condition is completely determined ( 2 Abelian Groups). It is not known whether there is an infinite group satisfying both the ascending and de- scending chain conditions for subgroups. A group satisfying the descending chain con- dition for subgroups has no element of infinite order, but the converse is not true. There is an infinite group which is finitely generated and has no element of infinite order ( 161 Free Groups C). G. Normal Chains A finite sequence G= Go =:> G 1 =:> G 2 =:> ... =:> G r = {e} of subgroups of a group G is called a normal chain if G i is a normal subgroup of G i - I for i = 1,2, " . , r. We call r the length of the chain. The sequence GO/G I , GdG2'"'' Gr-dG r is called the sequence of factor groups of the normal chain. A normal chain G=Ho=:>Hj =:> H 2 =:>... =:> H, = {e} is called a refinement of the chain G = Go =:> G 1 =:>... =:> G r = {e} if every G i appears in this chain. Two normal chains with the same length are called isomorphic if there is a one-to-one correspondence between their sequences of factor groups such that corre- sponding factor groups are isomorphic. Any two normal chains have refinements which are isomorphic to each other (Schreier's refine- ment theorem). A normal chain is called a composition series (or Jordan-Holder sequence) if it consists of different subgroups of G and in 
190H Groups any proper refinement there appear two suc- cessive subgroups which are the same. The sequence of factor groups of a composition series is called a composition factor series, and the factor groups appearing in this series are called composition factors. Any composition factor is a simple group. As a direct conse- quence of the refinement theorem we see that if a group G has a composition series, then the composition factor series is unique up to iso- morphism and the ordering of the factors. (This theorem is due to O. Holder. C. Jordan proved that if G is a finite group, then the set of orders of composition factors is inde- pendent of the choice of composition series. Hence we call the theorem the Jordan-HOlder theorem.) For an Q-group G, if we consider only Q- subgroups, we have definitions and theorems similar to those in this section. When we take the group of inner automorphisms of G as Q, then a composition series of the Q-group G is called a principal series, If we take the group of automorphisms of G as Q, then a composition series is called a characteristic series. An in- finite group G does not always have a compo- sition series, Even if G has a composition series, G may have an infinite normal chain G I c G 2 C ... c G such that each G i is a normal subgroup of G i +1 and U G i = G. In fact, there is a simple group which has such an infinite normal chain (P. Hall). Two groups which have isomorphic composition series are not necessarily isomorphic. A subgroup of a group G is called a subnormal subgroup of G if it may appear in some normal chain. The intersection of two subnormal subgroups is also sub- normal, but their join (i.e., the subgroup gen- erated by both of them) is not necessarily subnormal in an infinite group. The set of subgroups and the set of normal subgroups of a group form tlattices with respect to the inclu- sion relation (for the relationship between these lattices and the group structure  [8]). H. Commutator Subgroups Given two elements a and b of a group G, we call a -I b -I ab = [a, b J the commutator of a and b, The subgroup C generated by all commu- tators in G is called the commutator subgroup (or derived group) of G. The subgroup C is a normal subgroup of G, and the factor group GIC is Abelian. On the other hand, if B is a normal subgroup of G and GIB is Abelian, then B contains C. For two subsets A, B of G, the subgroup generated by the commutators [a, bJ (a E A, bE B) is called the commutator group of A and B and is denoted by [A, B]. If A and B are normal subgroups of G, then C' = [A, BJ is 708 also a normal subgroup of G, and AIC' com- mutes with BIC' elementwise in the factor group GIC'. Furthermore, [A, BJ is the mini- mal normal subgroup with the property. The subgroup [G, GJ is the commutator subgroup ofG. If the commutator subgroup of G is Abelian, then G is called a meta-Abelian group. If a group G has a normal chain G( = Go) =:> G I =:> G 2 ( = {e} ) of length 2 and the factor groups GIG!, G I /G2 are Abelian, then G is meta- Abelian. Meta-Abelian groups are special cases of solvable groups, discussed in Section 1. I. Solvable Groups Suppose that we are given a series of subgroups G i (i = 0, I, 2, .. . ) of G such that G = Go and [G"G;]=G i + l . Then we have a normal chain G=G O =:>G I =:>G 2 =:>.... IfGr={e} for some r, then G is called a solvable group. For the normal chain G (= Go)=:> G l =:> ...G r (= {e}) the factor groups GdG i +! (i=0, I, ...,r-I) are all Abelian. A finite group G is solvable if and only ifG has a composition series G=Ho=:> HI =:> H 2 =:>... =:> Hs= {e} such that the factor groups HdHi+l (i=0, I,... ,s-I) are all of prime order. An tirreducible algebraic equa- tion over a field of tcharacteristic 0 is solvable by radicals if and only if its Galois group is sol vable ( 172 Galois Theory). J. Nilpotent Groups The sequence of subgroups G = Go =:> G! =:> G 2 =:>... defined inductively by setting G r = [G, Gr-IJ (r= 1,2,.,.) is called the lower central series of G. If G n == {e} for some n, then G is called a nilpotent group, and the least number n with G n = {e} is called the class of the nilpotent group G. A nilpotent group is solv- able. Let ZI be the center of G, Z2/Z1 be the center of GIZ I , and so on. Then we have a sequence of subgroups Zo = {e} c ZI C Z2 C ..., called the upper central series of G. A group G is nilpotent if and only if Zm = G for some m, and the least number m with Zm = G is the class of G. For the subgroups G r and Zr (r = 1,2,...), we have [Gi-I,ZJ={e}. IfG is a tLie group, then G is nilpotent if and only if the corresponding tLie algebra 9 is nilpotent, i.e., gn=o. K. Infinite Solvable Groups The concepts of solvability and nil potency are generalized in several ways for infinite groups, For instance, a group G is called a generalized solvable group if any homomorphic image of G 
709 which is unequal to {e} contains an Abelian normal subgroup unequal to {e}, and G is called a generalized nilpotent group if any homomorphic image (#{e}) of G has center uneq ual to {e}. These definitions coincide with the previous ones for finite groups but not for infinite groups [7]. L. Direct Products Let G I , "', G n be a finite number of groups. The set G of all elements (x I, ..., x n ) with XiE G i (i = 1, ... , n) is a group if we define the prod- uct of two elements x=(x I ,..., x n ) and Y= (YI' ...,Yn) to be xY=(XIYI' ...,xnYn)' We call G the direct product of gro ups G I, .. . , G n and write G = G I X ... x G n . If ej is the identity element of G j , then e = (e I' ... ,en) is the identity element of G. The mapping (Xl>"" xn)-->x i from G to G i is a surjective homomorphism, called the canonical surjection. The subgroup H i = {(e l ,..., e i - I , Xi' e i + I ,..., en)1 xiEG i } is isomorphic to G i . The subgroups Hi (i = 1, ." , n) satisfy the following conditions: (i) Hi is a normal subgroup of G. (ii) Hi commutes with Hj elementwise if i # j. (iii) Any element of G can be written uniquely as the product of elements of HI,"" Hn. Conversely, if a group G has subgroups HI, ... , Hn satisfying these three conditions, then G is isomorphic to HI x ... x H.. In this case we also write G = HI X ... x Hn, and we call this a direct decompo- sition of G. Each Hi is called a direct factor of G. Conditions (i), (ii), and (iii) are equivalent to condition (i), (ii') G = HI H 2 ... Hn, and HI'" Hi-I n H i = {e} (i=2,..., n). A group G is called indecomposable if G cannot be decomposed into the direct product of two subgroups unequal to {e}, and com- pletely reducible if G is the direct product of simple groups. If G satisfies the ascending or descending chain condition for normal sub- groups, then G can be decomposed into the direct product of indecomposable groups. Such a decomposition is not unique in general, but if G has two direct product decompo- sitions G=G l x... x Gm=Hl x... x Hn, where G i and Hj are indecomposable and not equal to {e}, then m=n and the factors G i are iso- morphic to the factors Hj for some j; moreover, if G I corresponds, say, with HI' then we have G = HI X G 2 X ... x G m . This fact was first stated by 1. H. M. Wedderburn, and a com- plete proof of the theorem was given by R. Remak and O. Schmidt. Later W. Krull ex- tended it to more general groups (with opera- tor domain), and we call it the Krull-Remak- Schmidt theorem. O. Ore formulated it as a theorem on tmodular lattices. For an infinite number of groups G A (AE A) 190 M Groups we define the direct product II AEA G). of these groups similarly. The set of all elements (... ,x),...) (X).EG A ) such that almost all X A (i.e., all except a finite number of A) are identity elements is a subgroup of the direct product, called the direct sum (or restricted direct prod- uct) of {G A }. M. Free Products Given a family of groups {G)} AEA, we define the most general group G generated by these groups, called the free product of {G,1}, to- gether with canonical injections.fA: G A -->G. Let S be the disjoint union of the sets {G A } AEA, and regard G A as a subset of S. A word is either void or a finite seq uence a I, a 2 , "', an of elements of S, and we denote the set of all words by W. The product of two words wand w' is defined by connecting w with w' so that the tassociative law holds. We write w;>-w' when two words wand w' satisfy one of the following two conditions: (i) The word w has successive members a, b which belong to the same group G A , and the word w' is obtained from w by replacing a, b by the product ab. (ii) Some member of w is an iden- tity element, and w' is the word obtained from w by deleting this member. For two words w and w', we write w = w' if there is a finite se- quence of words w = W o , WI' ..., W n = w' such that for each i (l.s;i.s;n), either Wi-I;>-Wi or Wi;>-W i - I . This relation is an equivalence rela- tion and is compatible with the multiplica- tion. Thus we may define a multiplication for the quotient set G of W by this equivalence relation, and then G is a group whose identity element is the equivalence class containing the void word. Any xEG A is regarded as a word, and we have an injective homomorphism fA: G A --> G by assigning the corresponding class to each element of G)., The group G is called the free product of the system of groups {GALEA' and fA is called the canonical injection. The free product G of {G).} AEA is characterized by the following universal property: Given a group G' and homomorphisms fi: G,1-->G' (},EA), we can find a unique homomorphism g: G --> G' such that gaL = fi. The free product is the dual concept of direct product and is also called the tcoproduct ( 52 Categories and Functors). If each G). is an infinite cyclic group generated by a A , then the free product of the G A is the tfree group generated by {a,1} ( 161 Free Groups). The concept of free product is generalized in the following way, Let H be a fixed group. We consider the famil y of pairs (G ,j), where G is a group andj:H-->G is an injective homomor- phism. A homomorphism of pairs.f: (G,j)--> 
190N Groups (G',j') is defined to be a group homomorphism f: G-->G' such that Iaj=j'. For a given family of pairs {(GA,jA)}, we have the amalgamated product (G,j) of the family and the canonical homomorphism fA: (G A,j A) -->(G ,j), which is characterized by the following universal prop- erty: Given a pair (G'J) and homomorphisms Ii: (GA,jA)-->(G'J), we have a unique homo- morphism g:(G,j)-->(G',j') such that gafA = fi. If H = {e}, then the amalgamated prod- uct is the same as the free product. Now f), is an injection. If we regard G A as a subgroup of G, then G is generated by the subgroups G A and GAn G=h(H)=j(H) (A#J1). The notion of the amalgamated product is useful in constructing groups with interesting properties. For instance, we have a group whose non identity elements are all conjugate (B. H. Neumann and G. Higman), and a group generated by a finite number of elements such that its homomorphic image ( # {e} ) is always an infinite group (Higman) so that we have an infinite simple group generated by a finite number of elements. N. Extensions Let Nand F be groups. A group G is called an extension of F by N if G has a normal sub- group N isomorphic to Nand G j N  F, The problem of finding all extensions was solved by Schreier (Manatsh. Math. Phys., 34 (1926); Abh. Math. Sem. Univ. Hamburg, 4 (1928)). Suppose that (1) to each O"E F there corre- sponds an automorphism Sa of N; (2) there exist elements c a " (0", rEF) of N such that Sa(S,(a)) = ca,,(sa,(a))c;,; (aE N); and (3) ca.,ca"P =sa(c,.p)ca.'P' Then the set G of all symbols aSa (aE N, O"E F) is an extension of F by N if we define multiplication by aSa' bs,== (asa(b)ca,,)sa,' In fact, the set of all elements a=ac1,\sl (aEN) is a normal subgroup N of G such that GjN  F. Any extension can be obtained in this way. A system (sa, C a , ,) satisfying (l), (2), and (3) above is called a factor set belonging to F. Two factor sets (S.,c a ,,) and (ta,d a ,,) are said to be associated if there exist elements aa (O"E F) of N such that ta(a)=sa(aaaa;l) and d a .,= aa(sAa,))c a ,a;,l. In this case, two extensions determined by these factor sets are isomorphic. If (s", c a ,,) is associated with (t a , d a ,,) (d a " = I for any 0", rEF), then we say that the correspond- ing extension is a split extension. In this case, the extension G contains a subgroup F iso- morphic to F, and G=FN, FnN={e}. We call such an extension a semidirect product of Nand F. If N is Abelian, then condition (2) is simply sa(s,(a))=sa,(a), since the only inner automor- phism of N is the identity mapping. The con- 710 ditions of associated factor sets and split ex- tension are also simplified. If N is contained in the center of G, then G is called a central extension of N. O. Transfers Let H be a subgroup of finite index in G and gi (i = I, "', h) be representatives of the right co sets of H. For bEHg i we write gi=7]. Then for x EGan element x = H'II7=1 gi X(gixr l of HjH' is determined uniquely (independent of the choice of representatives), where H' is the commutator subgroup of H. The correspon- dence G' x-->x yields a homomorphism of GjG' to HjH', which is called the transfer from GjG' to HjH'. P. Generalizations The concept of group can be generalized in several ways. A set S in which a multiplication (a,b)-->ab satisfying (ab)c=a(bc) (the associa- tive law) is defined is called a semigroup. If S is a commutative semigroup in which ax = bx implies a = b (the cancellation law), then Scan be embedded in a group G so that the multi- plication in S is preserved in G and any x EGis the quotient of two elements of S: x = a-I b= ba- l (a, bES), Such a group G is determined uniquely by S. We call it the group of quotients of S. The notion of semigroup is obtained by taking only associativity from the group axioms. On the other hand, if Q is a set with a law of composition (a,b)-->ab which is not necessarily associative but satisfies the con- dition that any two among a, b, c in the equa- tion ab = c determine the third uniquely, then Q is called a quasigroup. A quasigroup with an identity element e such that ea = ae = a for every element a is called a loop. For loops, we have an analog of the structure theory of groups (R. H. Bruck, Trans, Amer. Math. Sac" 60 (1946)). If we give up the possibility of forming products for all pairs of elements or the uniqueness of the product in the axioms for groups, then we have the following generaliza- tions of groups. A set M with multiplication under which to any elements a, bEM there corresponds a nonempty subset ab of Mis called a hypergroupoid. Moreover, if the asso- ciative law (ab)c = a(bc) holds and for any elements a, bEM there exist x, YEM such that bExa, bEay, then M is called a hyper- group. A set M is called a mixed group if(l) Mean be partitioned into disjoint subsets Mo, M I' Mz,"'; (2) for aEM o , bEM i (i=0, 1,2,...), 
711 elements ab, a\b of M i are defined such that a(a\b)=b; (3) for b, cEM i , an element b/c of Mo is defined such that(b/c)' c= b; and (4) the associative law (ab)c=a(bc) (a,bEMo,cEM) holds (A. Loewy, 1927), A set M is called a groupoid if (1) M can be partitioned into disjoint subsets M ij (i, j = 1,2,...); (2) for aEMij and bEMjk> an ele- ment abEM ik is defined; (3) for aEMij and bEMik, an element a\bEM jk is defined such that a(a\b)= b; (4) for aE M ij and bE M kj, an element a/bE M ik is defined such that (a/b)b =a; and (5) for aEM ih bEMjk, and cEM k1 , the associative law a(bc)=(ab)c holds (H. Brandt, 1926), These generalized concepts also have some practical applications ( 2 Abelian Groups; 13 Algebraic Groups; 60 Classical Groups; 69 Compact Groups; 92 Crystallo- graphic Groups; 122 Discontinuous Groups; 151 Finite Groups; 161 Free Groups; 243 Lattices; 249 Lie Groups; 277 Modules; 362 Representations; 422 Topological Abelian Groups; 423 Topological Groups; 437 Unitary Representations). Q. History The concept of the group was first introduced in the early 19th century, but its rudiments can be found in antiquity; in fact, it was virtually contained in the concept of motion or trans- formation used in ancient geometry. From the time it took explicit form in the late 19th cen- tury, it has played a fundamental role in all fields of mathematics. In their study of algebraic equations in the late 18th century, 1. L. tLagrange, A. T. Van- dermonde, and P. Ruffini saw the importance of the group of perm utations of roots; using this idea N. H. Abel showed that a general equation of degree  5 cannot be solved alge- braicalIy, A. L. tCauchy studied the group of permutations of roots for its own interest, but a complete description of the relationship between groups and algebraic equations was first given by E. tGalois. C. Jordan developed a detailed exposition of the theory given by Abel and Galois in his TraiN? des substitutions (1870) [1]. Up to that time, a group meant a permutation group; the axiomatic definition of a group was given by A. Cayley (1854) and L. Kronecker (1870). F. tKlein emphasized the significance of group theory in geometry in his tErlangen program (1872), and M. S. tLie developed the theory oft Lie groups in the 1880s. In 1897, W. Burnside published his Theory of groups [3], whose second edition (1911) is one of the classics in group theory and is still valuable. Since 1896, G. Frobenius [2] and others have developed the theory of 191 G-Structures representation of groups by matrices ( 362 Representations). By that time, the theory of finite groups had acquired all its essential features. Among the branches of abstract algebra, the theory of groups was the first to develop; it led to the progress of abstract alge- bra in the 1930s. Since the latter haIr of that decade, the theory of finite groups has been developed further; there has been increased interest in the theory, and many significant results have been obtained, especially since 1955 ( 151 Finite Groups). References [IJ C. Jordan, Traite des substitutions et des equations algebriques, Gauthier-Villars, 1870 (Blanchard, 1957). [2] G. Frobenius, Gesammelte Abhandlungen I-III, Springer, 1968. [3] W. Burnside, Theory of groups of finite order, Cambridge Univ. Press, second edition, 1911. [4] H. Weyl, The classical groups, Princeton Univ. Press, revised edition, 1946. [5] B. L. van der Waerden, Algebra, Springer, I, seventh edition, 1966; II, fifth edition, 1967. [6] B. L. van der Waerden, Gruppen von linearen Transformationen, Erg. Math., Springer, 1935 (Chelsea, 1948). [7] A. G. Kurosh, The theory of groups I, II, Chelsea, 1960. (Original in Russian, 1953.) [8J M. Suzuki, Structure of a group and the structure of its lattice of subgroups, Erg. Math., Springer, 1956. [9J M. HaIl, The theory of groups, Macmillan, 1959. [10] H. Zassenhaus, Lehrbuch der Gruppen- theorie I, Teubner, 1937; English translation, The theory of groups, Chelsea, 1958, [11J A. Speiser, Die Theorie der Gruppen von endlicher Ordnung, Springer, third edition, 1937. [12] W. Specht, Gruppentheorie, Springer, 1956. [13J R. H. Bruck, A survey of binary systems, Erg. Math., Springer, 1958. [14] A. H. Clifford and G. B. Preston, The algebraic theory of semigroups, Amer. Math. Soc. Math. Surveys, I, 1961; II, 1967, [15] E. S. Liapin, Semigroups, Amer. Math. Soc. Trans!. of Math. Monographs, 1963. 191 (VII.8) G-Structures Differential geometry studies differentiable manifolds and geometric objects or structures 
191 A G-Structures on them. Among the geometric structures, the Riemannian and complex structures, with their contacts with other fields of mathematics and with their richness in results, occupy a central position in differential geometry. Perhaps it is impossible to say precisely what a differential geometric structure is or should be. However, the notion of G-structure allows a unified description of many of the interesting known geometric structures, such as Riemannian and complex structures. A. The Notion of G-Structures Let M be an m-dimensional Cae-manifold, and let r: T(M)-->M be its ttangent bundle. For xEM, TAM)=r-l(x) is the vector space of tangent vectors at x. Let n:F(M)-->M denote the tframe bundle of M. It is a GL(m; R)- principal bundle on M and Fx(M)=n-l(x) is the set of linear isomorphisms (called frames at x) ofR m onto Tx(M). Here GL(m;R) is the general linear group of R m , Hence Fx(M) can be naturally identified with the set of ordered bases of Tx(M). Write o=(e l ,..., em) for the canonical form of F(M), which is the R m _ valued I-form on F(M) defined by e(v)= p-l(n*(v)) for any VE Tp(F(M)). Given a dif- feomorphism f: M  N of M onto another C' -manifold N, we can naturally define the bundle isomorphism f(1):F(M)F(N) by f(1)(p)={*op. Let G be a tLie subgroup of GL(m; R). A principal G-subbundle np: P--> M of the frame bundle n:F(M)-->M is called a G-structure over M. Thus P is a regular submanifold of F(M) satisfying the following three conditions: (AI) n(P)= M. (A2) for pEP and O"EGL(m;R), pO"EP if and only if O"E G. (A3) for any XE M, there exist an open neighborhood U of x and a Coo-mapping s: U --> P such that n(s(y)) = y for any y E U, Conversely, if a regular submanifold P of F(M) satisfies the above three conditions (AI), (A2), and (A3), then n p = nip: P-->M is a G-structure over M. When G is closed, then condition (A3) is automatically satisfied, The restriction of e onto P is called the canonical form of P and is also denoted bye. For an open subset U of M, the restriction PI U = n;1 (U) is a G-structure over U. Let np: P-->M and n Q : Q-->N be G-structures over M and N, respectively. A diffeomorphism f: M  N is called a G-isomorphism of Ponto Q iff(1)(P)=Q. When such anfexists, we say that the G-structures P and Q are equivalent. A G-isomorphism of P Onto itself is called a G- automorphism of P. Write Aut(M, P) for the group of G-automorphisms of P. 712 Let 9 be the Lie algebra of G. Since G acts on P from the right, we have the natural Lie algebra homomorphism l:g-->r(P, T(P)). Here r(P, T(P)) is the Lie algebra of Coo-vector fields on P. Actually I is injective, and we write A* for I(A) (AEg). We call A* the fundamental vector field corresponding to A. For xER m , define po(x)EFx(Rm) by Po(x)«v j ))= Lj1 vj(ojoxj)x, where Xl, ..., x m are the natural coordinates of Rm. Any open subset D of Rm has the natural G-structure P(D, G) defined by P(D, G)= {Po (x). O"E Fx(R m ); XE D, O"E G}. We say that a G-structure np: P-->M over M is integrable if for each xEM, there exists a local coordinate system (U, <p, D) around x such that <p is a G-isomorphism of P I U onto P(D, G). Equivalently, if we set <p =(x l , ..., x m ), then the frame {(Djox\, ..., (ojDxm)y} belongs to P for each yE U. B. Examples The following examples (B 1 )-(B7) show that some classical geometric structures can be treated uniformly from the viewpoint of G- structures. (Bl) An absolute parallelism on M is, by definition, a system {XI'"'' Xm} of Coo-vector fields on M such that at each point XEM {XI (x), ..., Xm(x)} is a basis of T(MJx. Clearly this is equivalent to giving an {e}-structure np: P-->M, where e stands for the identity matrix of GL(m; R). Then P is integrable if and only if [Xi' Xa =0, 1.s; i,j .s;m. In this case, Aut(M, P) is a finite-dimensional tLie trans- formation group on M. (B2) Let E be a k-dimensional tdistribution, namely, E is a k-dimensional vector subbundle of T(M). Write R m = R k EB R m - k . Set GL(k, m; R) = {O"EGL(m;R);O"(Rk)cR k } and P={pEF(M); p(R k ) c E}. Then P is a GL(k, m; R)-structure over M. This gives a bijective correspondence between the k-dimensional distributions on M and the GL(k, m; R)-structures on M. More- over, P is integrable if and only if E is tin- volutive (Frobenius theorem). (B3) Let 9 be a Riemannian metric on M. Set O(m) = {O"EGL(m; R); '0"0" = e}. Define P(g) by P(g)= {(VI'"'' vm)EF(M); g(v i , V j ) = Cid. Then P(g) is an O(m)-structure on M. This gives a bijective correspondence between the Riemannian metrics on M and the O(m)- structures over M. Then Aut(M, P(g)) is the group of tisometries of g, and a finite- dimensional Lie transformation group on M. Moreover, P(g) is integrable if and only if 9 is flat in the sense that the Riemannian curvature tensor of 9 is zero. (B4) Two Riemannian metrics gl and g2 are 
713 called conformally equivalent if there exists a COO-function p>O with gl =pg2' This is an equivalence relation among the Riemannian metrics on M. An equivalence class {g} is called a conformal structure on M. Set CO(m) = {aA; AEO(m), aER -{O}}. Define P( {g}) by P( {g})= {(VI'"'' vm)EF(M); g(v l , vJ= a£5 ij , aER - {O} }. Then P( {g}) is a CO(m)- structure on M. This gives a bijective cor- respondence between the conformal struc- tures on M and the CO(m)-structures over M. Then Aut(M, P( {g})) is the group of tcon- formal transformations of {g} and is a finite- dimensional Lie transformation group. More- over P( {g}) is integrable if and only if the conformal structure {g} is conformally flat in the sense that at each point xEM, there exists a local coordinate system (Xl, ..., x m ) around x such that g( ojox\ 0 jox j ) = p£5 ij , where p is a positive function. (B5) Assume that m is even, say m= 21. By an almost symplectic structure on M, we mean a differential 2-form Q of maximal rank, i.e., Q /\ ... /\ Q (I times) never vanishes. Let A be the standard skew-symmetric bilinear form on R 21 defined by A«ut(vj))=(UIV2-U2VI)+... +(u 2 1-1 V21_U21V21-1). The symplectic group Sp(l; R) is defined by Sp(l; R) = {O"E GL(21; R); A (O"u, O"v)= A(u, v), U, vER 21 }. Set P(Q)= {pEF(M); p*A=Qx,x=np(p)}. Then P(Q) is an Sp(l; R)-structure over M. This gives a bijective correspondence between the almost symplectic structures on M and the Sp(l; R)- structures over M. Then Aut(M, P(Q)) is the group of symplectic transformations of Q, which is never a finite-dimensional Lie trans- formation group, Moreover P(Q) is integrable if and only if Q is a symplectic structure, i.e., dQ=O. (B6) Assume that m is even, say m= 21. We identify C 1 with R 21 as a real vector space. Then GL(m; C) is a Lie subgroup of GL(21; R). Let J be an almost complex structure on M. Set P(J)= {pE F(M);Jp(u) = p(iu), uER 21 =Cl}. Then P(J) is a GL(l; C)-structure on M. This gives a bijective correspondence between the almost complex structures on M and the GL(l; C)-structures over M. Then Aut(M, P(J)) is the group of J-analytic transformations on M. Furthermore, Aut(M, P(J)) is a finite- dimensional Lie transformation group when M is compact. Moreover P(J) is integrable if and only if J is a complex structure, (B7) It is easy to see that there is a bijec- tive correspondence between the SL(m; R)- structures over M and the set of volume ele- ments on M. Any SL(m; R)-structure over M is always integrable and Aut(M, P) is the group of diffeomorphisms preserving Q, which is never a finite-dimensional Lie transformation group. 191 D G-Structures C. Structure Functions Let G be a Lie subgroup of GL(m; R) and 9 the Lie algebra of G. For convenience, we write V for R m and V* for the dual space of V. Then V@ /\ 2 V* can be considered as the space of skew-symmetric bilinear mappings from V x V into V, and 9 @ V* can be identified with the space of linear mappings from V into g. Define a linear mapping 0: 9 @ V* --> V@ /\ 2 V* by (oT)(u, v)= - T(u)v+ T(v)u for TEg@ V*, U,VEV, Put H 2 . 1 (g)= V@ /\2 V*jo(g@ V*). Let np: P --> M be a G-structure over M and e the canonical form of P. Take any frame p in P. An m-dimensionallinear subspace H of Tp(P) is said to be a horizontal subspace of P at p if 8 p : H --> V( = R m ) is a linear isomor- phism. Then we write fH for 8 p - l : V-->H. Write tJ(J;,(P)) for the set of horizontal subspaces of Pat p. For HEtJ(Tp(P)), define c(p,H)EV@ /\ 2 V* by c(p, H)(u, V) = d8(fH(U)JH(V)) for u, VE V. Then we have c(p,H I )-c(p,H 2 )EO(g@ V*) for HI' H 2 E f>(Tp(P)). Therefore we have a well-defined element c(p)EH 2 , I(g) by c(p)= {c(p, H)}. We call the mapping c: P --> H 2 , I (g) the structure function of the G-structure np:P-->M. Let Q-->N be another G-structure over N andf:MN be a G-isomorphism of Ponto Q. Then we have c(f(1)(p))=c(p) for pEP. In particular, it is easily seen that if P is inte- grable, then c= O. However, the converse is not true in general. D. Prolongation of Linear Lie Algebras and Groups Let K denote either R or C. Let 9 be a Lie subalgebra of gl(m; K). For k= 0,1,2,..., let gk be the space of K-symmetric multilinear mappings t:Kmx...xKm-->Km '-----y----J (k+ 1) times such that for each fixed u I' .. . , Uk E K m , the linear transformation VE Kmf-> t(v, u l , ..., UdE K m belongs to g. In particular, go = g. We call gk 
191 E G-Structures the kth prolongation of g. If gk = 0, then gH I = gH 2 = ... = O. The first integer k such that gk = 0 is caIled the order of g. If gk # 0 for all k, then 9 is said to be of infinite type. When 9 contains a matrix of rank I as an element, then 9 is of infinite type. When K = C, the converse is also true [3]. However, when K=R, the coverse is not true in general. In fact, if we consider gl(l; q as a real Lie subalgebra of gl(21; R) (example (B6)), then gl(l; q is of in- finite type, having no matrix of rank I in it. In general, a Lie subalgebra 9 of gl(m; R) is said to be of elliptic type if 9 contains no matrix of rank I as an element. We consider gl as an Abelian Lie subalgebra of gl(K m EB g) by the correspondence tEg l f-> tE gl(K m EB g), where t is defined by t(u)=t(.,u) for uEK m , t(A)=O for A Eg. More generally, we consider gHI to be an Abelian Lie subalgebra of gl(K m EB go EB ". EB gk) by virtue of the correspondence tEg HI f-> t E gl(K m EB go EB, ,EB gd, where t is defined by t(U)=t(',...,',U)Eg k for uEK m , t(A)=O for A E 90 EB ... EB 9k' Identifying gl«K m EB go EB ... EB gk-d EB gd with gl(K m EB go EB .., EB gk), we have the important identity (DI) (gkh = gk+!, k = 0,1,2, .... Let G be a Lie subgroup of G L(m; R) and 9 the Lie subalgebra of gl(m; R) corresponding to G. For k= 1,2, "', let G k be the connected Abelian Lie subgroup of G L(R m EB go EB ... EB gk-d corresponding to the Abelian Lie sub- algebra gk; namely, G k consists of the linear mappings O"(t) (tEg k ) defined by O"(t)(u)=u+t(.,,,,,.,u) foruER m , O"(t)(A) = A for A Ego EB ... EB gH' We call G k the kth prolongation of G. We say that G is of finite (resp. infinite) type if 9 is of finite (resp. infinite) type. We say that G is of elliptic type if 9 is. From (DI), we have (D2) (Gk)1 =G HI . We know all irreducible Lie subalgebras 9 of gl(m; K) with order  2 [3,6]. (D3) An irreducible Lie subalgebra 9 of gl(m; q is of infinite type if and only if 9 is one of gl(m;C), sl(m;C), cSP(;C), sP(;C), where csp(m/2; q= C EB sp(m/2, C). 714 (D4) An irreducible Lie algebra 9 of gl(m; R) is of infinite type if and only if 9 is one of gl(;C), SI(;C), cSP(;C). sp (; C ), gl(m;R), sl(m;R), CSP(;R). SP(;R), where csp(m/2; R) = R EB sp(m/2; R), (D5) Let S be an m-dimensional irreducible Hermitian symmetric space of compact type, which is different from the complex projective space. Let L(S) be the identity connected com- ponent of the group of biholomorphic trans- formations of S. Take any point OE S and set Lo(S)= {O"E L(S); 0"(0)= o}. Let g(S) be the linear isotropy Lie subalgebra of gl(m; q cor- responding to L(S). Then g(S) is an irreduc- ible Lie subalgebra of gl(m; q of order 2. Con- versely, every irreducible Lie subalgebra 9 of gl(m; q of finite type with order  2 is equal to g(S) as given above. (D6) Let S be an m-dimensional irreducible symmetric space of compact type. Assume that S admits a finite-dimensional connected Lie transformation group L(S) which strictly con- tains the connected component of the group of isometries of S. Take any point OES, and set Lo(S)= {O"EL(S); O"(o)=o}. Let g(S) be the linear isotropy Lie subalgebra of gl(m; R) cor- responding to Lo(S). If (S, L(S)) #(sphere, pro- jective transformation group) or (S, L(S)) # (complex projective space, complex projective transformation group), then g(S) is an irre- ducible Lie subalgebra of gl(m; R) of order 2. Conversely, every irreducible Lie subalgebra 9 of gl(m; R) of finite type with order  2 is equal to g(s) as given above. For example, if we take an m-dimensional sphere as S and the group of conformal transformations of S as L(S), then g(s) = co(m). E. Prolongation of G-Structures Let G be a Lie subgroup of GL(m; R) and let 9 be the Lie subalgebra of gl(m; R) correspond- ing to G. We choose once and for all a linear subspace C of V@ /\ 2 V* such that V@ /\2 V*=o(g @ V*)EB C. In general there is no natural way of choosing such a C. Let np: P-->M be a G-structure on M. For each horizoantal subspace H of P at p (i.e., HE e(Tp(p)), we define the frame (p, H)E Fp(P) 
715 to be a linear isomorphism (p, H): R m EB g--> Tp(P) given by (p,H)(vEBA)=fH(v)EBA; for vERm,AEg, where A * is the fundamental vector field on P induced from A E g. Let PI = {(p, H)E F(P); H EYf(Tp(P)), c(p, H)E q. Then PI -->P is a GI-structure over P. We call PI-->P the first prolongation of P. The kth prolongation P k of P is defined inductively by P k = (Pk-dl = the first prolongation of P k - I . From (D2), P k is a Gk-structure over P k - I . Take any fEAut(M,P). It can be proved that f(1)E Aut(P, Pd. Since P k == (Pk-Ih, we can define j<k)E Aut(P k _ l , P k ) inductively by j<k) == (/(k-I»)(1). By the correspondence ff-> j<k), we can consider Aut(M, P) as a subgroup of Aut(P k _ l , P k ). If G is connected, then (E1) Aut(M, P)= Aut(Pk-l' P k ), k= 1,2, .... Let Q-->N be another G-structure. By a similar argument to that above, we see that P-->M and Q-->N are equivalent if and only if Pk--> P k - I and Qk-->Qk-I are equivalent. In [1], E. Cart an studied general equivalence problems of two G-structures. In that problem the above prolongation procedure plays an important role. F. Automorphisms of G-Structures of Finite Type S. Kobayashi proved the following: (Fl) Theorem [4]. Let P-->M be an {e}- structure over M. Then Aut(M, P) is a finite- dimensional Lie transformation group on M such that dim Aut(M, P).s;dimM. More precisely, for any point xEM, the mapping O"EAut(M, P)f->O"(x)EM is injective, and its image {O"(x); O"E Aut(M, P)} is a closed sub- manifold of M. The submanifold structure on this image makes Aut(M, P) into a finite- dimensional Lie transformation group on M. Now let G be a Lie subgroup of GL(m; R) of finite type, say Gk={e}. Then Pk-->P k - I is an {e}-structure, From theorem (FI) above, it follows that Aut(P k _ l , Pd is a finite- dimensional Lie transformation group. As explained in Section E, Aut(M, P) is a sub- group of Aut(P k _ l , P k ). Clearly Aut(M, P) is closed in Aut(P k _" P k ). Since every closed subgroup of a Lie group is again a Lie group, we have the following: (F2) Theorem. Let P --> M be a G-structure over M. If G is of finite type, then Aut(M, P) is a Lie transformation group of dimension .s;dim(R m EB 9 EB 91 EB ... EB gk-l)' 191 G G-Structures G. Automorphisms of G-Structures of Elliptic Type The following theorem of R. Palais allows us to prove that the automorphism groups of many general geometric structures are finite- dimensional Lie transformation groups. (G1) Theorem [5]. Let L be a group of Coo_ transformations of a Coo-manifold M. Let 1 be the set of all vector fields X on M which gener- ate global I-parameter groups tp,=exptX of transformations of M such that <p,E L. If the set 1 generates a finite-dimensional Lie alge- bra of vector fields on M, then L is a finite- dimensional Lie transformation group, and I is the Lie algebra of L. Let P-->M be a G-structure over M. For any representation p: G-->GL(V), we write E(p) for the associated vector bundle of P with respect to P, Let PI: G-->GL(gl(m; R)) be the represen- tation defined by PI(O")A=O"AO"-I for O"EG, AEgl(m;R). Let P2:G-->GL(g) be the represen- tation defined by P2(0")A = O"AO"-I for O"E G, A E g. Then P2 is a subrepresentation of PI' since pdO")A = P2(0")A for O"E G, A Eg. We re- mark that E(pd=Hom(T(M), T(M)). We write g(M) for E(P2)' Then g(M) is a vector su bbundle of Hom(T(M), T(M)). We write F for the quotient vector bundle Hom(T(M), T(M))jg(M). Let <1>: Hom(T(M), T(M))-->F be the natural projection. We fix an affine con- nection V on M which preserves P. We write T for the torsion tensor field of V, For each vector field X Er(M, T(M)), we define a Coo_ section VX + T(X,') in r(M,Hom(T(M), T(M)) by UE T(M)xf-> VuX + T(X(x), U)E TAM), Then define a first-order linear differential operator D:r(M, T(M))-->r(M,F) by D(X)=<1>(VX + T(X,')). It is easy to see that (G2) I(M, P) c ker D, where I(M, P) is the Lie algebra of all vector fields X Er(M, T(M)) which generate global I-parameter groups <p, = exp tX of transforma- tions of M such that <p,EAut(M,P). Then we can show: (G3) D is an elliptic operator if and only if G is of elliptic type. Now suppose M is compact. Then from the standard fact on linear elliptic operators on compact manifolds, we know the dimension of ker D is finite if D is elliptic. Thus from (G2), (G3), and theorem (Gl), we have the following: (G4) Theorem [7]. Let P-->M be a G- structure on an m-dimensional compact mani- 
191 H G-Structures fold M. If G is of elliptic type, then Aut(M, P) is a finite-dimensional Lie transformation group. H. Local Equivalence Problem Let np:P-->M and nQ:Q-->N be two G- structures. We say that P-->M and Q-->N are locally equivalent if the following holds: (HI) For arbitrarily given (p,q)EP x Q, there exists a local G-isomorphism f: U  V of P I U onto Q I V such that f(1)(p)= q, where U (resp. V) is an open neighborhood of np(p) (resp. nQ(q). From now on, we assume that the structure function C p (resp. c Q ) of P (resp. Q) is constant. If such an f as in (HI) exists, we must have c Q (f(1)(p'))=c p (p') for p'EPI U. Therefore C p and cQ must be the same constant. Cartan reduced the local equivalence problem be- tween P and Q to that of certain differential systems on P x Q. In fact, let 8 p = (8 1 , ... ,8 m ) and OQ=(tj;I, "', !jJm) be the canonical forms of P and Q respectively. Let ex: P x Q --> P, [3: P x Q-->Q be the canonical projections such that ex(p, q) = p, [3(p, q) = q. Let I: be the differen- tial system on P x Q generated by {ex * 8 1 - [3 * !jJ', ...,ex*om_[3*tj;m}. If X is the m- dimensional regular submanifold of P x Q given by the graph of JlI) in (HI), i.e., X = {(p',Jl1)(p')E P X Q; p' E P I U}, then X is an m-dimensional integral submanifold of I: satisfying the following conditions: (H2) (p,q)EX, (H3) ex * 8 1 , .., ,ex * 8 m are linearly indepen- dent on 7(p,q)(X). Conversely any m-dimensional integral sub- manifold of I: satisfying (H2) and (H3) is the graph of a local G-isomorphism f required in (HI). Therefore the local equivalence problem between P and Q is equivalent to the problem of finding an m-dimensional integral submani- fold X of I: satisfying (H2) and (H3) for any (p, q)EP X Q. We say that G is involutive if there exist linear subspaces 0 = V o C VI C '" C V m = R m such that (H4) dim=k (k=O,...,m), m (H5) dimg l = L dimg(), kO where 9 is the Lie algebra of G and g() = {AEg; A(u)=O for UE }. Now we have: (H6) Proposition. Let P-->M and Q-->N be two G-structures such that their structure functions are constant and equal. The dif- ferential system I: on P x Q defined as above is involutive at any m-dimensional integral 716 element E of I: on which ex * 8 1 ,..., ex * 8 m are linearly independent if and only if G is involutive. Combining proposition (H6) above with the classical tCartan-Kiihler theorem, we obtain the following theorems: (H7) Theorem ([1], also see [8]), Let P-->M and Q-->N be two real analytic G-structures over M and N, respectively, such that their structure functions are constant and equal. Suppose G is involutive. Then P-->M and Q-->N are locally equivalent. (H8) Theorem. Let P-->M be a real analytic G-structure over M. Suppose G is involutive. Then P is integrable if and only if the structure function of P is zero everywhere. Pinally, we remark that for a general Lie subgroup G c GL(m, R), G k is involutive for k?> ko [8]. I. Cartan-Kiihler Theorem on Differential Systems Let U be an open subset of R m , We write Ak( U) for the space of real analytic differential k-forms on U. Put A*(U)=Ao(U)EB... EB Am(U). Then A*(U) is a graded R-algebra with respect to the usual exterior product. By a differential system on U we mean an ideal I: of A*(U) such that (11) I:=Ao(U)nI:EBA1(U)nI:EB... EBI:nAm(U), (12) d(E)cI:, (13) I: is finitely generated as an ideal. For convenience we write Elk)=I:n Ak(U). When I:(O) = {O}, we call I: a restricted dif- ferential system. Let I: be a differential system on U c R m , By a k-dimensional integral manifold of I:, we mean a k-dimensional regular real analytic submanifold of U such that for any exEI:, we have ex I X = O. In the above definition, it is sufficient to know that ex I X = 0 for any exEEl k ), For XE U, we write GdTAU)) for the set of all k-dimensionallinear subspaces of TAU). Set Gk(T(U))= UXEU Gk(TAU)) (disjoint union); then Gk(T( U)) is naturally a real analytic manifold of dimension m+m(m-k), Let I: be a differential system on U. A k-plane EE Gk(TA U)) is called a k-dimensional integral ele- ment of I: at x if for any exEEl k ), we have exlE = O. Thus X is a k-dimensional integral sub- manifold if and only if Tx(X) is a k-dimensional integral element of I: for each XEX. We write Ik(I) for the set of k-dimensional integral ele- ments of I:. In general Ik(I:) is a real analytic subset of GdT( U)). For any k-dimensional 
717 integral element Eat XE U, we define the polar space H(E) by H(E)={UE TAU);ex(u, VI' ..., vd=O for exEE(k+l),v l , ...,VkEE}. Then EeH(E). Moreover FE Gk+1(TAU)) with F=:> E is in Ik+1 (Ej if and only if F e H(E). Let E be a k-dimensional integral element of E. We choose <pI, ..., <pk E A I (U) such that <pIlE, ..., <pkiE are linearly independent. Define 'W(<pl,...,<pk)by 'W(<pl, .", <pk)= {F E Gk(T(U)); <pll F, "" <pkl F linearly independent}. Then 'W(<pl, ..., <pk) is an open neighborhood of E in Gk(T(U». For any element FE'W(<pI, ..., <pk), define F I , "', Fk E F by <pi(F k ) = i5f. Thus {F"..., Fd is the dual basis of <pI IF, ..., <pkIF. Any element exEE(k) defines a real analytic function ex* on 'W(<pl, ..., <pk) by ex*(F) = ex(F I , '" , F k ). We calI E regular if there exist ex I , .. . , ex1EE(k) and an open neighborhood Y of E in 'W(<pl, ..., <pk) such that (14) Ik(E)n Y = {FE Y; ex(F)=... = ex(F) =O}, (15) dex, ... , dex are linearly independent onY, (16) dimH(F) is constant for any FEY. We say that EEldE) is an ordinary element if E contains a (k - 1 )-dimensional regular in- tegral element. The following is the well-known theorem due to Cartan and Kiihler [7]. (17) Theorem. Let E be a restricted dif- ferential system on U eR m . Let X be a k- dimensional integral submanifold of E. Sup- pose that Tx(X) is regular for a point x E X and that there exists a (k+ I)-dimensional integral element F at x with F=:> TAX), Then there exists a (k + 1 )-dimensional integral submani- fold Y of E such that x E Y, TA Y) = F, and X e Y in a neighborhood of x. We say that E is involutive at E E IdE) if there exist O=E o eEl e... eE k - 1 eE such that Ej is a j-dimensional regular integral element of E (j=0, 1, ...,k-l). Applying theorem (G7) inductively, we obtain: Corollary. Let E be a restricted differential system on U e Rm. If E is a k-dimensional involutive integral element of E at x, there exists a k-dimensional integral submanifold of E such that x E X and TAX) == E. In view of this corolIary, it is important to know when E is involutive at EElk(Ej. (18) Lemma [9]. Let E be a restricted dif- ferential system on U e R m , and E be a k- dimensional integral element of E. Then E is 191 Ref. G-Structures involutive at E if and only if there exist 0 = EO eEl e... e E k - I e E such that Ik(Ej is a sub- manifold of Gk(T( U)) near E with dimension m +k(m-k)-Ef-ol t j , where tj=m-dimH(Ej). References [1] E. Cartan, Les problemes d'equivalence, Oeuvres, pt. II, vol. 2, 1311-1334. [2] V. Guillemin and S. Sternberg, An alge- braic model of transitive differential geometry, Bull. Amer. Math. Soc., 70 (1964),16-47. [3J V. GuilIemin, D. Quillen, and S. Sternberg, The classification of the irreducible complex algebras of infinite type, J. Analyse Math" 18 (1967),107-112. [4J S. Kobayashi, Le groupe des transfor- mations qui laissent invariant Ie parallelisme, Colloque de Topologie de Strasbourg, 1954. [5] S. Kobayashi, Transformation groups in differential geometry, Erg. Math., 70, Springer. [6] S. Kobayashi and T. Nagano, On filtered Lie algebras and geometric structures, J. Math. Mech. I, 13 (1964), 875-908; II, 14 (1965),513- 522; III, 14 (1965),679-706; IV, 15 (1966), 163-175; V, 15 (1966), 315-328. [7J M. Spivak, A comprehensive introduction to differential geometry V, Publish or Perish Inc., 1975. [8J T. Ochiai, On the automorphism group of a G-structure, J. Math. Soc. Japan, 18 (1966), 189-193. [9] I. Singer and S, Sternberg, The infinite groups of Lie and Cartan: The transitive groups, J. Analyse Math., 15 (1965),1-114. [lOJ N. Tanaka, On the equivalence problems associated with a certain class of homogeneous spaces, J. Math. Soc. Japan, 17 (1965), 103- 139. 
192 A Harmonic Analysis 192 (X.24) Harmonic Analysis A. Fourier Transforms Let f(x) be an element of the tfunction space L 1 ( -00,00) and t a real number. Then the integral j(t)=(2nrl/2 Loooof(x)e-i'Xdx converges, and the function j(t) is continuous in (-00,00). We call] the tFourier transform of fIt f(X)EL2( -00,00), then f(x)ELd -a, a) for any finite interval ( - a, a), and if wc sct ](t) = (2n)-1/2 J:/(x)e-i'Xdx, then fa tconverges in the mean of order 2 as a --> 00 to a function .f in L 2 . In this case, we define.f to be the tFourier transform off (E L 2 ). Furthermore, in this case, if we set fa(x) = (2n)-1/2 fa j(t)ei'X dt, then l.i.m'aoofa(x)= f(x) (Plancherel theo- rem). Moreover, we have tParseval's identity: S",oo If(xW dx = S",oo 1](tWdt ( 160 Fourier Transform). Suppose that f(x) is periodic with period 2n andfEL 2 ( -n, n). We set a n =(2nr l f/(x)e-inXdx (tFourier coefficients). The nth partial sum of the tFourier series Sn(X)=L:_naveiVX con- verges in the mean of order 2 to f(x), and Parseval's identity I f n 00 2n -n If(xWdx= )oo la n l 2 holds. On the other hand, if {an} is a given sequence such that L -00 la n l 2 < 00, then L: -n ave ivX converges in the mean of order 2 to a function f(x), the Fourier coefficients of f(x) are {an}, and Parse val's identity holds ( 159 Fourier Series). B. Bochner's Theorem and Herglotz's Theorem A complex-valued function f(x) defined on ( - 00, 00) is said to be of positive type (or posi- tive definite) if 'L.j,kd(Xj- xk)lk;;:' 0 for any finite number of reals XI' x 2 , ..., X n and com- plex numbers  I' 2' ..., n' If f(x) is measur- able on (-00,00) and of positive type, then there exists a tmonotone increasing real- 720 valued bounded function a(t) such that f(x) = Loooo ei,xdex(t) (2) (1) for almost an x. If ex( -00) = 0 and ex(t) is right continuous, then ex(t) is unique (Bochner's theorem). Conversely, if ex(t) is nondecreasing and bounded and f(x) is defined by (2), then f(x) (caned the Fourier-Stieltjes transform of ex(t)) is continuous and of positive type, A sequence {an} (-00 < n < 00) is said to be positive definite (or of positive type) if Lj,k1 aj-kJk;;:' 0 for finitely many arbitrarily chosen complex numbers" 2' ...,n' If {an} is of positive type, then there exists a mono- tonc increasing bounded function ex(t) on [ -n, nJ such that a n = fn eintdex(t) (Herglotz's theorem). Conversely, if ex(t) is monotone increasing and bounded and an is defined by the above integral, then the se- quence {an} is of positive type, C. Poisson's Summation Formula If f(X)ELd -00,00) is oftbounded variation and continuous and if ](t) is its Fourier trans- form, then we have fi f f(ak)=Jb f j(bk), k= -00 k=-oo where ab=2n (a>O). This is caned Poisson's summation formula. D. Generalized Tauberian Theorems of Wiener Suppose that we are given a function f(X)E LI (-00,00) whose Fourier transform j(t) is never zero. Then the set of functions given by h(x)= foof(X- y)g(y)dy, where gELd -00,00), is dense in Ld -00,00). Hence we can deduce the tgenera1ized Tauber- ian theorem of Wiener: If the Fourier trans- form kl(t) of kl(X)EL I ( -00,00) does not van- ish for any real t and !L: kdx-y)f(y)dy=C f:oo kl(y)dy for a function f(x) that is bounded and measurable on (-00,00), then for any k 2 E LI (-00,00), ! f:oo k 2 (x- y)f(y)dy= C f:oo k2(y)dy 
721 ( t60 Fourier Transform G). Hence we can deduce tTauberian theorems of the Littlewood type [3]. E. Harmonic Analysis Let ex(A) be a complex-valued function on (-00,00) that is of bounded variation and right continuous, If we have the expression f(t) == too", e iA ' dex(},), then we say that the function f(t) is repre- sented by the superposition of harmonic oscil- lations e W . Conversely, when f(t) is given, we have the problem of finding a function ex(A) as above such that f(t) can be expressed in the form of (2'). When such a function ex(A) exists, it is also an important problem (in harmonic analysis) to find the amplitude ex(A)-ex(A-O) of the component of a proper oscillation. Con- cerning this problem, we have the following three theorems: (I) A necessary and sufficient condition for f(x) to be representable in the form (2') is f e<, If 00 ( sin(t/n) ) 2 . I sup - f(t)e-'A'dt dA< 00. 1 n -00 -00 tin (II) For a function f(t) expressed in the form (2'), we have: (i) for any ..1. 0 , ex(}'o)-ex()'o-O)= lim  f T f(t)e-iAo'dt, Too 2T -T and (ii) if ex(A) is continuous at ),=Ao-O" and ), = AO + 0" (0" > 0), then ex(),o + 0") - ex(},o - 0") Af T sinO"t . = lim - -f(t)e-'Ao'dt. Too n -T t (III) In (2'), suppose that the discontinuity points of ex(A) are ..1.1' }'2'"'' and set an = ex(An)- ex()'n - 0) (n = 1,2, ' . , ); then t I s - co lim - f(t+s)f(s)ds= L la n l 2 eiA". soo S 0 nl F. The Paley-Wiener Theorem In formula (1), if we change the variable from t to a complex variable (= t + iO", then we have F«() == ( 2n r l / 2 f:oo f(x)e -i(x dx, which is called the Fourier-Laplace transform of f(x). In particular, if f(x) has bounded tsup- port, then F«() is an tentire function. Concern- ing Fourier-Laplace transforms, Paley and Wiener proved the following theorems: 192 H Harmonic Analysis (I) A necessary and sufficient condition for an entire function F«() to be the Fourier- Laplace transform of a tfunction of class Coo having its support in a finite interval [ - B, BJ is that for any N, there exists a constant C N > 0 such that IFml.s; C N (1 + Im- N e Blal for all (= t +iO". (II) If g(t)EL 2 (0, 00), then its one-sided tLa- place transform (2') f(z)=(2n)-1/2 !ORO g(t)e-'Zdt satisfies: (i) f(z) is holomorphic in the right half-plane Re z > 0, and (ii)  f:oo If(x+iyWdy< 00. Conversely, if f(z) = f(x + iy) (x> 0) satisfies (i) and (ii), then the boundary function f(iy)E L 2 ( -00,00) exists and is such that lim f oo If(iy)- f(x+iyWdy=O, xjo -00 its Fourier transform in L 2 g(t)=1.i.m.(2n)-1/2 f N f(iy)ei'Ydy N-X1 -N vanishes at almost all negative t, and f(z) is the one-sided Laplace transform of g(t). G. Harmonic Analysis on Locally Compact Abelian Groups The general theory of harmonic analysis on the real line was extended to a theory on lo- cally compact Abelian groups by A. Weil, I. M. Gel'fand, D. A. Raikov, and others. The theory of normed rings was utilized for the development of the theory ( 36 Banach Algebras). This theory is called harmonic analysis on locally compact Abelian groups, and it has been developed as described in the following sections. H. Group Rings Let LI = LdG) be the set of all integrable functions with respect to a tHaar measure on a locally compact Abelian group G. If we define the norm and multiplication in LI (G) by (3) Ilfll = f G If(x)1 dx, f' g(x)= Lf(Xy-l)g(y)d Y , respectively, then LJ has the structure of a commutative tBanach algebra. (We call f' g the convolution (or composition product) of f 
192 I Harmonic Analysis and g.) If the topology of G is not tdiscrete, Ll (G) does not have a unity for multiplication. Hence, adjoining a formal unity 1 to LI (G), we set R = {ext + fl ex is a complex number, fELI(G)}, and Ilext + fll = lexl + Ilfll, (ext + /)+(fJt +g)=(ex+ fJ)t +(f +g), (ext + n-(fH + g) = (exfJ) t +(fJf +exg+ f'g). Then R is a commutative Banach algebra with unity. When G is discrete, R = Ll (G). The Banach algebra R is called the group algebra of G. The group algebra R of G is tsemisimple. R is algebraically isomorphic to a subalgebra of C(Wl), which is the associative algebra of all continuous functions on the compact Haus- dorff space WI consisting of all maximal ideals in R ( 36 Banach Algebras). By this corre- spondence, if <pER corresponds to <p(M), which is a function on \JJl, then SUPME9JlI <p(M)1 .s; 11<p11. LI(G) belongs to \JJl if and only if the group G is not discrete. I. Fourier Transforms According as the group G is discrete or not, we set 91 = \JJl or 9'1 = \JJl- {LI (G)}. Then there exists a one-to-one correspondence between the elements ME 91 and the elements X of the tcharacter group G of G such that the follow- ing formulas are valid: f(M) = L X(x)f(x)dx, fELl' X(y)= fy(M)jf(M), where fy(x) = f(xy-l) and f is a function such that f(M)#O. This correspondence M +-> X gives a homeomorphism between the locally compact space 91 and G. Hence if we identify M with X and set f(M) = !(x), then j is a con- tinuous function on G, called the Fourier trans- form of f(x). Since f --> f(M) is an algebraic isomorphism, (fg) '(X) = /(x)9(x). If ((X) == g(X), then j is equal to g in Ll (G). This is the uniqueness theorem of Fourier transforms. From it we can deduce the tmaximal almost periodicity of locally compact Abelian groups. If G is not discrete, then L 1 (G) E \JJl, and f'(L I (G)) = 0 for fELl (G). Hence {X II/(x)1 ;:, F.} is a compact subset of G. This means that! is a continuous function vanishing at infinity on G, (This is a generalization of the tRiemann- Lebesgue theorem concerning the cases G = R I or T l = RjZ.) Any continuous function u(X) on G vanishing at infinity is approximated uni- formly by !(X), which is the Fourier transform of fELdG). 722 J. Positive Definite Functions A function <p(x) defined on G is said to be positive definite (or of positive type) if the in- equality L'J,k1 <p(XjX k - 1 )ex/i'k;:' 0 holds for arbi- trary elements Xl, .., 'X n in G and arbitrary complex numbers ex l , ..., exn' We denote by P G the set of all positive definite functions on G. If <pE P G , then <p(e) > 0 (e is the identity of G), I <p(x)l.s; <p(e), and <p(x- 1 ) = <p(x) , If G is locally compact, we further assume that <p E P G is measurable with respect to the Haar measure ofG. Then for any fEL1(G), f f <p(xy-l)f(y) f(y) dxdy;:'O. Any <pEP G is equal almost everywhere to a continuous <PI E P G . (Concerning positive de- finite functions on locally compact groups and their relation with unitary representations  437 Unitary Representations B.) K. Harmonic Analysis and the Duality Theorem (4) If G is a locally compact Abelian group, then a function <p(x) on G belongs to P G if and only if there exists a nonnegative measure J1(G) < 00 on G such that <p(x) = IGx(x)dJ1(x). (When G = R I, this theorem is tBochner's theorem, whereas when G=Z, it is Herglotz's theo- rem.) Hence we can prove a spectral resolu- tion of unitary representations of G: V(x) == Ie x(x)dE(x) (a generalization oftStone's theorem).lffEL I (G)nP G , then!(x);:,O,jE Ll (G), and the inversion formula of Fourier transforms f(x) = IG x(x) !(x) dX holds, provided that the Haar measure on G is suitably chosen. If fE LI (G) n L 2 (G), then! E L 2 (G), and Par- seval's identity IG If(x)1 2 dx = IG 1!(x)1 2 dx holds. If we put Vf =! and vj = f, then V is extend- able uniquely to an isometry of L 2 (G) onto L 2 (G) and V is extendable uniquely to its inverse transformation, respectively (Plan- cherel's theorem on locally compact Abelian groups). By the inversion formula and Plan- cherel's theorem, we can prove that the char- acter group G of G is isomorphic to G as a topological group. This is called the Pon- tryagin duality theorem of locally compact Abelian groups ( 422 Topological Abelian Groups). In particular, if G is compact, G is a discrete Abelian group. Then we can normal- ize the Haar measure of G and G so that the measure of G and the measure of each element of G are I. Plancherel's theorem implies that the set of characters of G is a tcomplete tortho- normal set in L 2 (G). (5) 
723 L. poisson's Summation Formula Suppose that G is a locally compact Abelian group, H is its discrete subgroup, and G/H is compact. Then the tannihi1ator r of H is a discrete subgroup of G. For any continuous function f(x) on G, if LYEHf(xy) is convergent absolutely and uniformly (hence fELl (G)) and L{Er f<) is convergent absolutely, then LYEH f(y) = c L{Er j( ), where c is a constant depending on the Haar measures of G and G. This is called Poisson's summation formula on a locally compact Abelian group (a generali- zation oftPoisson's summation formula on G=R). M. Closed Ideals in LdG) In the following, the group operations in G and G are denoted by +, and the value of the character y(x) (x E G, y E G) is written as (x, y). For a function f defined on G and any YE G, the translation operator ry is defined by ryf(x) = f(x- y). A closed subspace of LdG) is an ideal in LdG) if and only if it is invariant under all translations (N, Wiener). A closed ideal I coincides with LI (G) if and only if the set of zeros of I, Le., Z(I)== nfEJ-I (0), is empty (Wiener's Tauberian theorem). A closed ideal I is maximal if and onl y if Z (I) consists of a single point. If the dual G of G is discrete, then the closed ideals in L I (G) are completely characterized by the zeros; that is, tspectral synthesis is possible, but this situation does not hold generally. P. Malliavin's theorem states that if G is not discrete, then there exists a set E in G and two different closed ideals I and J such that Z(I)=Z(J)=E. Such a set E is called a non-S-set. For example, if G = R 3 , the unit sphere is a non-S-set (L. Schwartz). N. Operating Functions Denote by A(G) the set of all Fourier trans- forms of the functions in LdG). Let fEA(G) and <I> an analytic function in a neighborhood of the range of! Furthermore, assume that <1>(0) = 0 if G is not discrete. Then there exists a gEA(G) such that g(y) = <I>(f<y)) for YE G (Wiener-Levy theorem). In general a function <I> defined on a set D in the complex plane is said to operate in a function algebra R or to be an operating function on R if <1>(/) E R for all fER whose range lies in D. The converse of the Wiener-Levy theorem holds in the follow- ing form. Let G be an infinite Abelian group and <I> a function on the interval [ -1, 1]. If <I> operates in A(G), then <I> is analytic in a neigh- borhood of the origin if G is compact, and 192 P Harmonic Analysis analytic in a neighborhood of [ -1, 1] if G is not compact [14,15]. Let E be a subset of G, and IE the set of all fEA(G) such thatf=O on E. Let A(E)=A(G)/IE be the quotient algebra. The set E is called a set of analyticity if every operating function on A(E) is analytic. For a characterization of such a set  [11]. O. Measure Algebras For a locally compact Abelian group G, let M(G) be the set of all tregular bounded com- plex measures. For.le and Ji in M(G), the convolution .Ie * Ji is defined by (..1. * Ji)(E) = SG A(E - y). dJi(Y), where E is a Borel set in G. Then M(G) is a semisimple commutative Banach algebra whose product is defined to be the convolution. The Fourier-Stieltjes trans- form of Ji EM (G) is defined by Jl(y) = L (x,y)dJi(x), YEG. A continuous function on G is positive definite if and only if it is the Fourier-Stieltjes trans- form of a positive measure in M(G) (Bochner's theorem). Assume that G is not discrete. A function on the interval [-1,1] that oper- ates in the Fourier-Stieltjes transforms of measures in M(G) can be extended to an entire function, and a function on the whole complex plane that operates in the tGel'fand repre- sentation of M(G) is an entire function [15]. From this fact, it follows that M(G) is asym- metric and nonregular. Furthermore, there exists a measure JiEM(G) such that Jl(y))o 1 but I/Jl is not a Fourier-Stieltjes transform of M(G). (See also Wiener and Pitt [16], Shreider [17], and Hewitt and Kakutani [18]; for the general description of measure algebra, see Rudin [llJ and Hewitt and Ross [12].) P. Idempotent Measures A measure JiE M (G) is called idempotent if Ji * Ji = Ji, that is, Jl(y) = 0 or 1 for all y E G. Then Jl is the characteristic function of the set {'I' E G I Jl(y) = I}. The smallest ring of subsets of G that contains all open co sets of subgroups of Gis called the coset ring of G. The characteristic function of a set E in G is the Fourier-Stieltjes transform of an idempotent measure in M(G) if and only if E belongs to the coset ring of G [19]. A simple proof of this theorem is given by T. Ito and I. Amemiya (Bull. Amer. Math. Sac" 70 (1964)). When G is the unit circle, the coset ring consists of sequences periodic except at a finite number of points, and for this case the theorem was obtained by H. Helson. Let 
192Q Harmonic Analysis n l , n z , ...,nk be distinct integers and dJi(x) = L=l e injx . dx. Then Ji is an idempotent mea- sure on the unit circle. J. E. Littlewood conjec- tured that the norm of Ji exceeds clog k, where c is a positive constant not depending on the choice of {nJ A partial answer was given by p, 1. Cohen [19J for compact connected Abelian groups and was improved by H. Davenport (Mathematika, 7 (1960)) and E. Hewitt and H, S, Zuckerman (Proc. Amer. Math. Soc., 14 (1963), Q. Mappings of Group Algebras Let G and H be two locaIly compact Abelian groups and <p a nontrivial homomorphism of Ll (G) into M (H). Associated with <p there is a mapping <p* of a subset Y of fJ into G such that <p(/)(y) = J(<p*(y)) for YE Yand =0 for yrje Y, or symbolically <p(/)=/(<p*). A con- tinuous mapping ex of Y into G is said to be piecewise affine if there exist a finite number of mutually disjoint sets Sj, j = 1, ..., n, in the coset ring of fJ and mappings ex j such that (i) Y = Ujl Sj; (ii) ex j is defined on an open coset Kj of fJ, where Kj=:>Sj; (iii) exj=ex on Sj; and (iv) ex)"I'+y' -y")= exiy) + Ct)y')-exj(Y") for all y, y', y" E Kj,j= 1, ..., n. P. J. Cohen's theorem is: If <p is a homomorphism of LdG) into M(H), then Y belongs to the coset ring of fJ and <p* is a piecewise affine mapping of Y into G. Con- versely, for any piecewise affine mapping ex, there is a homomorphism <p of LI(G) into M (H) such that <p* = ex. Related theorems have been studied by A. Beurling, H. Helson, 1.-P. Kahane, Z. L. Leibenson, and W. Rudin [11]. R. Exceptional Sets Let G be a locally compact Abelian group. A subset E is said to be independent if n I x I + '" + nkxk = 0, where the nj are integers and XjE E implies njxj=O,j= 1,..., k, A set E in Gis called a Kronecker set if for every continuous function <p on E of absolute value 1 and 8> 0 there exists a YEG such that I <p(x) -(x, y)1 < 8, x E E. Every Kronecker set is independent and of infinite order, but independent sets are not necessarily Kronecker sets. For a group G whose elements are of finite order p, a set E is called of type K p if for every continuous func- tion <p on E with values exp(2nik/p), k = 0, ..., p - 1, there is a YE G such that <p = yon E. If E is a compact Kronecker set in G and Ji is a mea- sure with support in E, i.e., JiEM(E), then IIJiII == II P II ct)' A compact set E is called a Helson set if there is a constant C such that II Jill .s; q P II ex- for JiEM(E). Every Kp set is also a Helson set. For a Helson set E, C(E) = A(E). A discrete 724 analog of a Helson set is a Sidon set. A subset F of a discrete group G is called a Sidon set if there is a constant C such that LYEF la).s; CSUPxILyEFay(x, Y)I for every polynomial L a)x, y). For example, a tlacunary sequence {nd, nk+l/nk>q> 1, of integers is a Sidon set. These sets are deeply connected with har- monic analysis on groups, and measures con- centrated on these sets have some unexpected pathological properties ( e.g., [11, 13J). S. Tensor Algebras and Group Algebras Let X and Y be compact Hausdorff spaces, and denote by V(X, Y) the projective tensor product C(X) @ C(Y) of continuous func- tion spaces C(X) and C(Y). The norm of <p(x,Y)=Llfi(x)gj(Y) is defined by 11<p11 = infLlllfill oc Ilgjll", where the infimum is taken for all expressions of <po If G is an infi- nite compact group, then there exist two sub- sets K 1 and Kz such that (i) Kl and Kz are homeomorphic to the tCantor ternary set; (ii) the expression Yl +Yz of an element of E = Kl +Kz is unique, where Yl EKI and yzEKz; (iii) Kt n Kz # 0; and (iv) Kl U Kz is a Kronecker set or a set of type K p for some p. Varopoulos' theorem states that the algebra V(K"Kz) is isomorphic to A(E) which denotes the algebra of restriction of functions in A(G) on the set E. By this theorem, the problems of spectral synthesis and operating functions of group algebras are transformed into problems of tensor algebras. For a more precise discussion  [20]. References [IJ E. C. Titchmarsh, Introduction to the theory of Fourier integrals, Clarendon Press, 1937. [2] A. Zygmund, Trigonometric series I, Cam- bridge Univ, Press, second edition, 1959, [3] N. Wiener, The Fourier integrals and certain of its applications, Cambridge Univ. Press, 1933. [4] R. E. A. C. Paley and N. Wiener, Fourier transforms in the complex domain, Amer. Math. Soc. Colloq. Pub!., 1934. [5] K. Y osida, Functional analysis, Springer, 1965, sixth edition, 1980. [6J M. A, Nalmark, Normed rings, Noordhoff, 1959. (Original in Russian, 1956.) [7] L. H. Loomis, An introduction to abstract harmonic analysis, Van Nostrand, 1953, [8] H. Cartan and R. Godement, Theorie de la dualite et analyse harmonique dans les groupes abeliens localement compacts, Ann. Sci. Ecole Norm. Sup., (3) 64 (1947),79-99. 
725 [9] R. Godement, Theon:mes tauberiens et theorie spectrale, Ann. Sci. Ecole Norm. Sup., (3) 64 (1947),118-138. [10] A. Weil, L'integration dans les groupes topologiques et ses applications, Actualites Sci. Ind., Hermann, 1940, second edition, 1951. [11J W. Rudin, Fourier analysis on groups, Interscience, 1962. [12J E. Hewitt and K. A. Ross, Abstract har- monic analysis, Springer, I, 1963; II, 1970. [13] 1.-P. Kahane and R, Salem, Ensembles parfaits et series trigonometriques, Actualites Sci. Ind., Hermann, 1963, [14J Y. Katznelson, Sur Ie calcul symbolique dans quelques algebres de Banach, Ann. Sci. Ecole Norm. Sup., (3) 76 (1959), 83 123. [15J H. Helson, 1.-P. Kahane, Y. Katznelson, and W. Rudin, The functions which operate on Fourier transforms, Acta Math., 102 (1959), 135157. [16J N. Wiener and H. R. Pitt, On absolutely convergent Fourier-Stieltjes transforms, Duke Math. 1., 4 (1938), 420-436. [17J Yu. A. Shrelder (Srelder), The structure of maximal ideals in rings of measures with convolution, Amer. Math. Soc. Transl., 81 (1953). (Original in Russian, 1950.) [18J E. Hewitt and S. Kakutani, Some multi- plicative linear functionals on M(G), Ann. Math., (2) 79 (1964), 489505. [19J P. 1. Cohen, On a conjecture of Little- wood and idempotent measures, Amer. J. Math., 82 (1960),191-212. [20] N. T. Varopoulos, Tensor algebras and harmonic analysis, Acta Math., 119 (1967), 51- 112. 193 (X.29) Harmonic Functions and Subharmonic Functions A. General Remarks A real-valued function u of tclass C 2 defined in a domain D in the n-dimensional Euclidean space R n is called harmonic if it satisfies the Laplace equation cJ 2 u a 2 u u(P)=-;;-z+... +-;;-z=0 (P=(x" ...,x n )) oX j oX n in D. A harmonic function is, by definition, twice continuously differentiable, but turns out to be real analytic. It is not true, however, that the solutions of the Laplace equation are real analytic. For example, for the function u(x, y) = Reexp( _Z-4) (z==x+ iy#O), u(O, 0)==0, U xx and U yy exist everywhere, and u satisfies the 193 C Harmonic Functions and Subharmonic Functions Laplace equation, but is not continuous at the origin. The fundamental properties of harmonic functions do not depend essentially on n. A real-valued function u of class C 2 satisfy- ing the ineq uality u;:' 0 is called subharmonic, For a more general definition of subharmonic functions and their properties  Sections P- U. B. Invariance of Harmonicity Harmonicity in R 2 is invariant under any tconformal transformation. Namely, when there exists a conformal bijection sending a domain D in the xy-plane onto a domain D' in the I]-plane, every harmonic function u(x, y) on D is transformed into a harmonic function of (, 1]) on D'. In Rn for n;:' 3, harmonicity is not generaIly preserved under conformal transformations. However, harmonicity is preserved in the following special case: Let D be a domain in Rn (n;:, 3), and consider the inversion f:D-->D' defined by f(x l , ..., x n ) = (x'l' ..., x) = (a 2 xdr2, ..., a 2 xn/r2), r= (xi + ... +X;)1/2. Let u(x l ,..., x n ) be a harmonic function on D, and let V(X'l , ... ,x) be the func- tion on D' obtained by applying the Kelvin transformation to u. Namely, v(x'" ..., x) = (a/r')"-2 u (a 2 x'dr'2, ..., a 2 x/r'2), where r'2 = x'/ + .., + X2. Then the function v is harmonic on D'. A function u that is harmonic outside a compact set is called regular at the point at infinity if any Kelvin transform of u is har- monic in a neighborhood of the origin, in which case u(P)-->O as OP -->oo. Now let T: X k = Xk(X'j, .., , x), 1 .s; k.s; n, be a one-to-one analytic transformation of a domain D' onto another domain D. If there exists a posi- tive function <P(X'l' .., ,x) in D' such that CP(X'b ..., X)U(XI (x'!, ..., x), ..., X n (X'l, ..., x)) is harmonic for any harmonic function u(Xj, ... , x n ) in D, then T is conformal. A conformal transformation as it is known in differential geometry is either (i) a tsimilarity transforma- tion, (ii) an inversion with respect to a sphere or a plane, or (iii) a finite combination of trans- formations of types (i) and (ii). C. Examples of Harmonic Functions (1) If u is a polynomial in XI, ..., X n and har- monic in Rn, then the terms of degree k in u form a harmonic function for each k;:,O. A harmonic homogeneous polynomial is said to be a spherical harmonic. (2) logr in R 2 and r 2 - n in Rn (n;:'3) are harmonic except at r=O. (3) Every tlogarithmic potential in R 2 and every tNewtonian potential in R n (n;:' 3) is harmonic outside the tsupport of the measure. Con- 
193 D Harmonic Functions and Subharmonic Functions versely, any harmonic function defined on a domain D is represented in an arbitrary rela- tively compact domain D' in D as the sum of a logarithmic (n = 2) or Newtonian (n)o 3) potential of a measure on aD' and the poten- tial of a tdouble layer. (4) Both the real part u and the imaginary part v of an analytic func- tion of a complex variable are harmonic. We caIl v a conjugate harmonic function of u. If u is harmonic on a tsimply connected domain D, then the conjugate v of u is given by f (X,y) ( ou ou ) v(x,y)== --dx+-dy + constant, (a,b) oy ax where (a, b) is a fixed point in D and the path of integration is contained in D. When D is a tmultiply connected domain, v may take many values in accordance with the thomology classes of the paths of integration. D. Green's Formulas In the foIlowing one should substitute "curve" for the term "surface" when n = 2. Let D be a bounded domain whose boundary S consists of a finite number of closed surfaces that are piecewise of class C 1. Let u and v be harmonic in D, and suppose that all the first-order par- tial derivatives of u and v have finite limits at every boundary point. We caIl D the inside of S. Let n be a normal on S toward the out- side of D. Then the relation f ( u v _ }u ) dU=O s on on foIlows immediately from tGauss's formula, where du is the tsurface element on S. In par- ticular, hen v is identicaIly equal to 1, for- mula (1) gives f au -;;-du = O. son Equations (1) and (2) are caIled Green's and Gauss's formulas, respectively. Conversely, u is harmonic in D if u is a function of class C 2 in D and at every point pED, there is a se- quence {rk} decreasing to zero and such that Is(p,r,)(oulon)du =0 (k = 1,2, ...), where S(P, rd is the spherical surface with center at P and radius r k . Another sufficient condition for u to be harmonic is that u is of class C 1 and, at every point PED, there be an rp>O such that IS(p,r)(oulon)du = 0 for every r, 0 < r < rp (Koebe, Bocher). E. Mean Value Theorems We assume that u is a harmonic function, D the domain of definition of u, and S the bound- 726 ary of D. The mean value of u on the surface or the interior of any baIl in D is equal to the value of u at the center of the ball. Namely, 1 f 1 f u(P) =--;; udr =-----;;=T udO", rnr B(P,r) unr S(P,r) where r n and Un are the volume and surface area of a unit baIl in Rn, respectively, B( P, r) is the open baIl with center at P and radius r, and dr is the volume element. These relations are called mean value theorems. Conversely, if v is continuous in D and at every point PED, there is a sequence {rk} decreasing to zero and such that the mean value of v over B(P, r k ) or S(P, rd is equal to v(P) for each k, then v is harmonic in D. This result is caIled Koebe's theorem. From the mean value theorems the maximum principle for harmonic functions foIlows: Any nonconstant u assumes neither maximum nor minimum in D. If both u and v are harmonic in D and have the same finite boundary value at every point on S, then u = v in D by the maximum principle. This is caIled the uniqueness theorem. F. Boundary Value Problems (1) The first boundary value problem (or Dirichlet problem) is the problem of finding a harmonic function defined on D that assumes boundary values prescribed on S ( 120 Dirichlet Prob- lem). The second boundary value problem (or Neumann problem) is the problem of finding a harmonic function u whose normal derivative aulon is equal to a function f prescribed on the piecewise smooth boundary S. The solution, if it exists, is uniquely determined up to an addi- tive constant. In order for the solution to exist, f should satisfy the condition Is! du = O. The third boundary value problem is the problem of finding a harmonic function u on D that satis- fies aul an = hu + f on S, where hand fare functions prescribed on S. AIl these problems can be reduced to certain Wredholm integral equations. There is also the boundary value problem of mixed type, in which the boundary values are prescribed in a part of S and the normal derivatives are prescribed on the rest. (2) G. The Poisson Integral Let D be a bounded domain with smooth boundary Sand u a function harmonic in D and continuous on D U S. Let G(P, Q) be Green's function in D. Then (1) yields I f aG(p,Q) u(P)= -- u(Q) du(Q). Un S on Q 
727 In particular, if D = B(O, r), then r 2 - Op 2 f u(Q) u(P)= - n dO"(Q). O"n r S(O,r) PQ Conversely, given an integrable function f on S(O, r), we set r 2 OP 2 f f(Q) u(P)= - n dO"(Q). O"n r S(O,r) PQ Then u(p) is harmonic in B(O, r) and converges to f(Q) as P tends to any point Q on S(O, r) where f is continuous. We call u a Poisson integral. Sometimes it is possible to represent a harmonic function u in B(O, r) in the following form, which is more general than the Poisson integral: r 2 _ 0p 2 f 1 u(P) = - ndex(Q), O"n r S(O,r) PQ where ex is a signed tRadon measure on S(O, r). In order for u to admit such a repre- sentation, it is necessary and sufficient that Js(O.r') I ul dO" be a bounded function of r' for 0 < r' < r, or equivalently, that the tsubharmonic function lul have a tharmonic majorant. Fur- thermore, if ex is absolutely continuous, then the Poisson integral representation of u is possible, and vice versa. A necessary and suffi- cient condition for the function u to admit the Poisson integral representation is that there exist a positive convex function <p(t) on t>O such that <p(t)/t--> 00 as t-->oo and <p(lul) has a harmonic majorant. When D is a general domain in which Green's function exists, every positive har- monic function u(P) is represented uniquely as the integral J K(P, Q)dJ1(Q), where K(P, Q) is a tMartin kernel and J1 is a Radon measure on the Martin boundary B whose support is contained in a certain essential part of B, each point of which is called a minimal point. A similar integral representation appears in the theory of Markov processes ( 260 Markov Chains I). The representation J K(P, Q)dJ1(Q) is a generalization of (3). In terms of a function similar to <p(t), we can give a necessary and sufficient condition for u(P) to be represented in the form J K(P, Q)f(Q)dv(Q), which corre- sponds to the Poisson integral representa- tion and in which v is determined by 1 = J K(P, Q)dv(Q). This condition is equivalent to the condition that u(P) be quasibounded, i.e., that there exist an increasing sequence of bounded harmonic functions that converges to u [7]. A positive harmonic function u is said to be singular if any nonnegative harmonic minorant of u vanishes identically. Every positive har- monic function can be expressed as the sum of a quasibounded harmonic function and a singular one. 193 J Harmonic Functions and Subharmonic Functions H. Expansion Let Po = (x7, ... , x) be a point in D, and de- note the distance from Po to S by r. Then a harmonic function u is expanded uniquely into a power series La k1 . ,dXt-x7)k1 ...(xn-x)k", k l ,..., knO, in B(P o , (j2 -l)r). Thus u is (real) analytic in D. If u vanishes on an open set in D, then u = 0 in D. If the power series is written as L.k h k with spherical harmonics h k of degree k = 1, 2, ." , then this series converges over all of B(Po, r). I. Sequences of Harmonic Functions (3) In this section, {u m } is a sequence of harmonic functions in a bounded domain D. First, if each U m is bounded and continuous on D U S and {u m } converges uniformly on S, then {u m } converges uniformly in D, and the limit- ing function u is harmonic in D. Moreover iJk1 + , +k"Um/iJx1 .. .iJx:" converges to iJk! + k"U/ iJx! ... iJx" uniformly on any compact subset of D (Harnack's first theorem). Second, if U t .s; u 2 .s; ... in D and there is a point of D at which {u m } is bounded, then {u m } converges uni- formly on any compact subset of D (Harnack's second theorem). The following Harnack's lemma is useful: If u is positive harmonic in D Po is a point of D, and K is a compact subset' of D, then there exist positive constants c and c', depending only on Po and K, such that cu(Po).s; u(P).s; c'u(Po) on K. Any family of (locally) uniformly bounded harmonic functions is tnormaL A family of positive harmonic functions that is bounded at a point is also normal by Harnack's lemma. If Jvluk-umIPdr-->O as k, m-->oo for p 1, then tHolder's inequality implies that {u m } con- verges uniformly on any compact subset of D. It follows that if Jvlgrad(uk-um)IPdr-->O as k, m--> 00 and {un} converges at a point in D, then there exists a harmonic function u in D such that Jv Igrad(u m - u)IP dr-->O as m--> 00 and u m converges to u uniformly on any compact subset of D, where Po is any point in D. Finally, if J v I Un IP dr (p  1) are bounded, then {un} forms a normal family. J. Level Surfaces and Orthogonal Trajectories The set {PI u(P) = constant} is calJed a level surface (niveau or equipotential surface). When a is given as the constant, the level surface is called the a-level surface. Assume that u is not a constant. A point where grad u vanishes is called critical. The set of critical points consists of at most countably many treal ana1ytic 
193 K Harmonic Functions and Subharmonic Functions manifolds of dimension .s; n - 2 (n = dim D, and a manifold of dimension 0 is understood to be a point). Any compact subset of D intersects only a finite number of such manifolds; we express this fact by saying that the manifolds do not cluster in D. Each of these manifolds is contained in a certain level surface. The complement of the critical points with respect to any level surface consists of real analytic manifolds of dimension n - 1 that do not cluster in D. For each noncritical point there exists an analytic curve passing through it such that gradu is parallel to the tangent to the curve at each point on the curve. A maximal curve with this property is called an orthogonal trajec- tory (or line offorce). Along every orthogonal trajectory, u increases strictly in one direction and hence decreases in the other, so that none of the orthogonal trajectories is a closed curve. There is exactly one orthogonal trajectory passing through any noncritical point. There- fore no two orthogonal trajectories intersect, and no orthogonal trajectory terminates at a noncritical point. Moreover, the set of limit points of any orthogonal trajectory in each direction does not contain any noncritical point. When u is a Green's function G(P, Q), every orthogonal trajectory is called a Green line, and a Green line that originates at the pole Q and along which u decreases to 0 is called regular. For any sufficiently large a, the a-level surface La is an analytic closed surface homeomorphic to a spherical surface. Let E be a family of orthogonal trajectories originating at the pole. If the intersection A of E and a closed level surface La is an (n - 1 )-dimensiona1 measurable set, then the tharmonic measure of A at Q with respect to the interior of La is called the Green measure of E. M. Bre10t and G. Choquet proved that all orthogonal trajec- tories originating at the pole except those belonging to a family of Green measure zero are regular. Consider a domain D bounded by two compact sets, and denote by u the har- monic measure of one compact set with re- spect to D. Assume that u is not a constant. Then u changes from 0 to 1 along all ortho- gonal trajectories except those belonging to a family that is small with respect to a measure similar to the Green measure (see "flux" de- fined in Section K). K. Harmonic Flows Denote by La the a-level surface for a harmonic function u, and by Lao the complement of the set of critical points with respect to La' Let 0" be an (n - 1 )-dimensiona1 domain in Lao such that the (n - 2)-dimensional boundary of 0" is piecewise of class C 2 . Suppose that u assumes 728 the value b (> a) on each orthogonal trajec- tory passing through 0". Consider the union of orthogonal trajectories that pass through 0". The subset of this union on which u assumes values between a and b forms a set called a regular tube. The parts of the boundary corre- sponding to a and b are called the lower and upper bases of the tube; accordingly, 0" is the lower base. The integral f(iJu/iJn)dO" on any section (i.e., the part of a level surface in the tube) is constant and is called the flux of the tube. The family of orthogonal trajectories passing through an (n -I)-dimensional domain (not necessarily bounded by a smooth bound- ary) in Lao is called a harmonic flow, and a subfamily is called a harmonic subflow if its intersection A with Lao is measurable (in the (n -I)-dimensional sense). The flux of a har- monic subflow is defined to be SA(iJu/iJn)dO". Then the Green measure of family E of Green lines originating at the pole is equal to the flux of E divided by O"n' We can compute the exact value of the textremallength of any harmonic subflow. L. Isolated Singularities Let u be harmonic in an open ball except at the center 0, It is expressed as the sum of a function h(P) harmonic in the entire ball and L:::=o Hm(P)/ O p 2m+l, where Hm is a tspherica1 harmonic of degree m. If OP 'u(P)-->O as P-->O for ex> 0, then u(P) is equal to h(P)+ c lOP +... +H m (p) /Op 2m + 1 with m<ex-1. In particular, if u is bounded in a neighborhood of 0, then 0 is a removable singularity for u. If u is bounded above (below), then u(P)=h(P)+c /Op n - 2 , where c.s; 0 (c  0), (When n = 2, we have u(p) =h(p)+c1og 1 /0P . For the removability of a set of capacity zero  169 Function-Theoretic Null Sets). If u is harmonic near the point at infinity, i.e., outside some closed ball, then 00 Hm(P) 00 u(P)= m0 0P 2m+1 + mo Um(P), where the first sum is regular at the point at infinity and U m is a spherical harmonic of degree m. If OP -'u(P)-->O as OP --> 00 with rJ,  0, then U m = 0 for all m> ex. If u is bounded above or below, then U m = 0 for all m  1. If u is harmonic in Rn and OP -'u(P)-->O as OP -->oo with ex  0, then u is a polynomial of degree m ( < ex). If u is harmonic and bounded above or below in R n , then u is constant. Brelot called a function u harmonic at the point at infinity if 00 Hm(P) u(P) = constant + L - Zm+1 mIOP (note that m  1) near the point at infinity [1]. 
729 M. Harmonic Continuation If U vanishes in a subdomain of D, then U = 0 in D, If Uk is harmonic in Dk (k = 1,2), Dl n Dz # 0, and U 1 =U 2 in Dl nD 2 , then U 1 and U 2 de- fine a harmonic function in Dl U D2' If the boundaries of mutually disjoint domains Dl and D 2 have a surface So of class Cl in com- mon, Uk is harmonic in Dk (k = 1,2), u 1 = U2 on So, and oudon and - ou 2 /on exist and coincide on So, then U 1 and U z define a har- monic function in the domain Dl U So U D 2 . We express this fact by saying that one of U 1 and U z is a harmonic continuation of the other. It follows that U = 0 in D if the boundary of D contains a surface So of class C 1 and U = oulon = 0 on So' Consider the case n = 2. If the boundary of a Jordan domain contains an analytic arc C and U (or oulon) vanishes on C, then a harmonic continuation of u into a cer- tain domain beyond C is possible. If n = 3, however, nothing is known except in the case where So is a part of a spherical surface or a plane and u=O (or oulon=O) on So. Boundary values of u do not always exist, but in some special cases, u has limits, For instance, a positive harmonic function in a ball has a finite limit at almost every boundary point Q if the variable is restricted to any angular domain with vertex at Q. N. Green Spaces As a generalization of tRiemann surfaces, Brelot and Choquet introduced 6"-spaces [3]. It is required that 6" be a separable connected topological space and satisfy the following two conditions: (i) At each point P there exists a neighborhood V p of P and a homeomorphism between V p and an open set V in the tAlexan- drov compactification Rn U {oo}; (ii) if A = V p , n V P2 # 0 and A is the part of V;, that corre- sponds to A (k = 1,2), then the correspondence between A; and A; via A is a conformal (pos- sibly with the sense of angles reversed) trans- formation when n = 2 and an tisometnc trans- formation (which keeps cr i!'! ,arlant) when n  3. If a tGreen's function exists on 6", then 6" is called a Green space. Harmonic functions and the tDirichlet problem on a Green space have been discussed from various points of view. O. Biharmonic Functions A function v is called polyharmonic if  k v = 0 (k2) and biharmonic if v==O; sometimes, polyharmonic functions are also called bi- harmonic. A biharmonic function in a plane domain D is written as Re(zf(z)+ g(z)), where f 193Q Harmonic Functions and Subharmonic Functions and g are complex analytic in D (Goursat's representation). Biharmonic functions are used in the theory of elasticity and hydrodynamics. P. Sub harmonic Functions Let D be a tdomain in the n-dimensional Eu- clidean space Rn (n2). A real-valued function u(P) in D is calIed subharmonic if (I) -oo.s; u < +00, u ¥= -00; (2) u is tupper semicon- tinuous; and (3) at every point Po of D, the mean value of u over the surface of any closed tball in D with center at Po is not smaller than u(P o ), i.e., U(Po).s; f UdO"=L(Po,r), (Jn rn where O"n is the area of the surface of a unit ball in Rn. Condition (3) can be replaced by: (3') The mean value A(Po, r) of u over the closed ball is  u(P o )' In order that an upper semi- continuous function u be subharmonic it is necessary and sufficient that, for any sub- domain D' of D and for any harmonic function h in D', the maximum principle hold for u-h. We call -u superharmonic when u is sub- harmonic, A harmonic function is subhar- monic and superharmonic. The converse is also true ( Section E). When u is of tc1ass C 2 , then u is subhar- monic if and only if 02U a 2 u u=----z+ ". +----z0 (P=(x l , ...,x n )). oX I oX n When u is an upper semicontinuous function that is not necessarily differentiable, u is sub- harmonic if and only if u interpreted as a tdistribution is a positive tmeaSUre. If u 1 , ..., Uk are subharmonic and aI, ..., a k are positive constants, then a l u 1 + ... + aku k and max(udP), u 2 (P), ..., uk(P)) are subhar- monic. If a subharmonic function u is replaced by the tPoisson integral for the boundary function u inside a closed ball in D, then the resulting function in D is subharmonic, If f(t) is a monotone increasing convex function, then f(u) is subharmonic. If vO and logv is subharmonic, then v is subharmonic. If f(z) is a holomorphic function of the complex vari- able z and ..1.>0, then Aloglf(z)1 and hence If(zW are subharmonic. If h is harmonic, then Ihl is subharmonic. Any tIogarithmic potential (n = 2) or tNewtonian potential (n  3) is super- harmonic in R n . Q. Properties of Mean Values Condition (3) (resp. (3')) can be replaced by the condition that there exists an r(Po) > 0 at any 
193 R Harmonic Functions and Subharmonic Functions Po such that u(Po).s; L(Po, r)(u(Po).s; A(Po, r)) for every r, O<r<r(P o )' The relation -00 < A(P o , r).s; L(Po, r) always holds, and both A(Po, r) and L(Po, r) decrease to u(Po) as r to. On any tcompact subset of D, u is tintegrable. Both A(Po, r) and L(P o , r) increase with rand are convex functions of -Iogr (n == 2) and r 2 - n (n)o 3); hence they are continuous functions of r. If D' is relatively compact in D and ro is the distance between iJD' and iJD, then A(P,r) is a continuous subharmonic function of P in D', where r is fixed in the interval (0, ro). By taking the average of A(P, r) k times, a subharmonic function of class C k is obtained that decreases to u as r to. If <Pr«xf +... + x;) '/2) is suitably chosen, then the +convolution u * <Pr is a sub- harmonic function of class Coo and decreases to u as rtO. R. Sequences of Subharmonic Functions The limit of a decreasing sequence or a downward-directed net of subharmonic func- tions is subharmonic or equal constantly to -00. The limit of a uniformly convergent sequence of subharmonic functions is sub- harmonic. If u l , u 2 , ,.. are subharmonic, then max(u" ..., Uk) is subharmonic for every k, but sup(u" u 2 , .,,) may not be subharmonic. Let U be a family of subharmonic functions in D that are uniformly bounded above on every com- pact subset of D. Then the upper envelope of U, i.e., the function defined by SUPUEU U in D, coincides with a subharmonic function except on a set of tcapacity zero. S. Harmonic Majorants and Riesz Decompositions Suppose that we are given a subharmonic function u in D. If there is a harmonic function h satisfying h)o u in D, then h is called a har- monic majorant of u. When there is a harmonic majorant of u, there exists a least one among them, denoted by hv. For any relatively com- pact subdomain D' of D, h v ' always exists and equals the tperron-Brelot solution in D' for the boundary function u. As D' increases to D, h v ' increases to a function h that is either har- monic or equal constantly to 00. If hv exists, it coincides with h, and hence h is harmonic. Conversely, if h is harmonic, then hv exists and equals h. Generally, there is a unique tRadon measure Ji in D with the following property: Let b be any subdomain of D such that ho and the tGreen's function Go exist in b (b may coincide with D). Then ho - u is equal to the potential So GodJi, and u = ho - So GodJi. In gen- eral, a representation of a superharmonic (subharmonic) function as the sum (difference) 730 of a harmonic function and a potential is called a Riesz decomposition. T. Boundary Values Let D be a domain in which a Green's function exists, and consider the tfine topology on the tMartin compactification of D. Any negative subharmonic function u in D has a finite limit with respect to the fine topology at every point of the tMartin boundary  except at the points of a subset of  of harmonic measure zero (1. L. Doob). When D is a ball, u has a limit in the ordinary sense along almost every radius. However, it may happen that even if u is bounded there exists no angular limit at any point of the boundary. If the mean value of I ul on every concentric smaller balI in D is bounded, then u can be decomposed into the sum of a nonnegative harmonic function and a negative subharmonic function by the Riesz decomposition, and hence u has a limit both radially and with respect to the fine topology. Let D be a domain and K be a compact subset of D of tcapacity zero. If a function u is subharmonic and bounded above in D - K, then u can be extended to be a subharmonic function in D. A function that is equal to a subharmonic function almost everywhere is called almost subharmonic, and an almost subharmonic function satisfying condition (3') is called submedian. Subharmonic functions can be discussed in a space more general than Rn, e,g" a tRiemann surface (n=2), or more generally, an tS-space of dimension n ()o 2) in the sense of Brelot and Choquet. U. The Axiomatic Treatment tNewtonian potentials were the main object of interest in the early stages of tpotential theory. A major part of potential theory can be dis- cussed on the basis of the theory of superhar- monic functions [8]. For example, a tpolar set is defined as a set on which some superhar- monic function assumes the value 00, and a set X is tthin at a point Pof/=X if and only if Po has a positive distance from X or there exists a superharmonic function v(P) in a neighbor- hood of Po such that Iim sup v(P) > v (Po) as PEX tends to Po. Moreover, we can discuss tbalayage, define potentials, and obtain Riesz decompositions. Generalizing results of Doob (1954) and starting from a family of harmonic functions defined axiomatically in a locally compact Hausdorff space, M. Brelot defined superharmonic functions and potentials and discussed balayage, Riesz decompositions, and 
731 the tDirichlet problem (1957). Further pro- gress in axiomatic potential theory has been made by Brelot, H. Bauer, C. Constantinescu, A. Cornea, and others [11,14]. References [1] M. Brelot, Sur Ie role du point a I'infini dans la theorie des fonctions harmoniques, Ann. Sci. Ecole Norm. Sup., (3) 61 (1944),301- 332. [2] M. Brelot, Elements de la theorie classique du potentiel, Centre de Documentation Uni- versitaire, Paris, third edition, 1965. [3] M. Brelot and G. Choquet, Espaces et Iignes de Green, Ann. Inst. Fourier, 3 (1951), 199-263. [4] O. D. Kellogg, Foundations of potential theory, Springer, 1929. [5] M. Nicolescu, Les fonctions polyharmoni- ques, Actualites Sci. Ind" Hermann, 1936. [6] M. Ohtsuka, Extremal length of level surfaces and orthogonal trajectories, 1. Sci. Hiroshima Univ., 28 (1964), 259-270. [7] M. Parreau, Sur les moyennes des fonc- tions harmoniques et analytiques et la classifi- cation des surfaces de Riemann, Ann. Inst. Fourier, 3 (1952),103-197. [8] M. Brelot, Sur la theorie autonome des fonctions sous-harmoniques, Bull. Sci. Math., (2) 65 (1941),72-98. [9] M. Brelot, Lectures on potential theory, Tata Inst., 1960. [10] T. Rad6, Subharmonic functions, Erg. Math., Springer, 1937 (Chelsea, 1970). [IIJ CoIloque International sur la Theorie du Potentiel, Paris-Orsay, 1964, C.N.R.S. (Ann. Inst. Fourier, 15 (1965), fasc. 1.) [12J L. Helms, Introduction to potential theory, Wiley Interscience, 1969. [13] W. K. Hayman and P. B. Kennedy, Sub- harmonic functions I, Academic Press, 1976. [14] C. Constantinescu and A. Cornea, Poten- tial theory on harmonic spaces, Springer, 1972. Also  references to 120 Dirichlet Problem and 338 Potential Theory. 194 (VII.11) Harmonic Integrals A. Introduction tDe Rham's theorem shows that the coho- mology group with real coefficients of a tdif- ferentiable manifold of class Coo is isomorphic to the cohomology group of the cochain com- 194 B Harmonic Integrals plex of tdifferential forms with respect to the exterior derivative d. Thus every element of the cohomology group can be represented by a class of tclosed differential forms. Harmonic forms enable us to choose one definite dif- ferential form in each cohomology class. The theory of harmonic forms, caIled the theory of harmonic integrals, is modeled after the theory of holomorphic differentials and their inte- grals (Abelian integrals) in function theory [2,4,5,8]. B. Definitions Let X be an oriented n-dimensional differ- entiable manifold of class Coo with a tRie- mannian metric ds 2 of class Coo ( 105 Differ- entiable Manifolds, 364 Riemannian Mani- folds A). For every (COO) p-form <p on X we define an (n - p)-form * <p on X as follows: First denote the tvolume element of X by dv. If we choose a basis {w I' .. . , w n } of the space of I-forms on an open set U of X such that ds 2 = Li wf and dv = WI A ... A W n , then <p can be expressed on U in the form <p = (I/p!)L<pi i Wi A...AW i . Ifwelet*<p= l' 'p 1 p (l/(n - P)!)L( * <p )j1' ,jn-p W h !\ ... A W jn _ p , where (*rn). . =(I/P')Lb 12 n. . rn. . then 't'11 In-p . 11 lph In-p't''1 1 .Ip' * <p is an (n - p)-form on U that does not depend on the choice of (w 1, .. . , w n ) and is determined only by <po Since X is covered by open sets as above, * defines a linear map- ping that transforms p-forms to (n - p)-forms. If we let ds 2 = L gjkdxj dXk in terms of the local coordinate system (x 1, ... ,xn) and <p = (l/p!)<Pi i dxi1 A... A dXip, then in the nota- l" , P , tion of tensor calculus, we have * <p = (1/(n - p)!)(* <P)j1 .jn-p dx h A",!\ dx jn - p , ( * rn ) . - r:. g . 15 12 n rnk1 .kp 't"'h jn-p-YY k 1 kpjt. jn-p't" (17 = det(gij)). For two p-forms <p and Iji, we define the inner product by (<p, Iji) = Ix <p A *Iji if the right- hand side converges. In order for the inner product (<p, Iji) to be defined, it suffices that either <p or Iji has compact tsupport. Then (<p, Iji) is a symmetric, positive definite bilinear form. If we let 15 = ( - 1 )"p+n+l * d * operate on p- forms, where d is the texterior derivative, then d and 15 are adjoint to each other with respect to the inner product. That is, if either <p or Iji has compact support, we have (d<p, Iji) = (<p,blji) (tStokes's theorem). We call =db+ bd the Laplace-BeItrami operator, which is a self-adjoint teIliptic differential operator. These operators satisfy relations such as * * =( _!)pln- PI , dd=O, M =0, *=*, 
194 C Harmonic Integrals *b=( -ljPd*, and b*=( _l)n-p+l *d (when they operate on p-forms). A differential form <p is said to be harmonic if d<p = 0 and b<p = O. Then <p = O. Since  is an elliptic operator, a tweak solution <p of the equation <p = J1 is an ordinary solution of class COO on the domain where J1 is of class Coo. Therefore, if <p is harmonic (as a weak solu- tion), <p is of class Coo. C. Harmonic Forms on Compact Manifolds On a compact manifold X, any <p with <p = 0 is harmonic, since (<p, <p) = (d<p, d<p) + (b<p, b<p). Let Lp(X) be the linear space of p-forms of class COO on X, and denote by i!p(X) the com- pletion of Lp(X) with respect to the inner product (<p, Iji), Then i!p(X) is the Hilbert space of square integrable measurable p-forms. Then f>p(X) = {<pEi!p(X)I<p=O (in the weak sense)} is a finite-dimensional subspace of i!p(X) and is contained in Lp(X), as we have seen before. Also, 5 p (X) is closed in i!p(X), and the tpro- jection operator H: i!p(X)-->f>p(X) is an tinte- gral operator with kernel of class Coo. The orthogonal complement of f>p(X) in i!p(X) is mapped onto itself by  and has the inverse operator G of , which is a continuous oper- ator of the Hilbert space. By letting G = 0 on f>p(X), we can extend G to an operator from i!p(X) to i!p(X) that is called Green's operator. It is also denoted by G, maps Lp into itself, commutes with d and b, and satisfies GH=HG=O, H+G=1 (=identitymap- ping). Therefore, for <pELp(X) we have <p=H<p + Gbd<p + dbG<p, which shows that H is thomo- topic to the identity mapping of the tcochain complex (L p Lp(X), d). From this we infer that every cohomology class of de Rham cohomol- ogy contains a unique harmonic form that represents the cohomology class. However, since products of harmonic forms are not always harmonic, it is not appropriate to use harmonic forms to study the ring structure of cohomology. G is also an integral operator with kernel of class CO: outside the diagonal subset in X xX. D. Harmonic Forms on Noncompact Manifolds If X is a noncom pact manifold, let Lp(X) be the space of p-forms of class COO with compact support, and let i!p(X) be its completion. Let !Bp(X) and !B;(X) be the respective closures of dL p _\ (X) and bL p +1 (X) in i!p(X), and let 3 p (X) and );(X) be the respective orthogonal complements ofp(X) and ;(X) in i!p(X), Then :)p(X)n3;(X)=5 p (X) is a subspace of the square integrable harmonic forms, and we 732 have the direct sum decomposition i!p(X) = !Bp(X)+!B;(X)+f>p(X). In this decomposition any component of a form of class Coo is also of class Coo. If X is an open submanifold of another manifold Y, X is compact, and ax = X - X is a closed submanifold of Y, then the theory in this section is just a generalized potential theory with boundary condition <p = 0 on ax. We sometimes treat decompositions of other Hilbert spaces that correspond to other bound- ary conditions. E. Generalization to Complex Manifolds If X is a complex manifold, we consider complex-valued differential forms ( 72 Com- plex Manifolds C). Then the space Lp(X) of p- forms is the direct sum of the spaces Lr.s(X) of forms of type (r, s), and the exterior derivative d has the expression d =d' +d", where d' is of type (1,0) (i.e., Lr,s(X)--> L r +1 ,s(X») and d" is of type (0, 1). If we are given a holomorphic vec- tor bundle E on X, we can define an operator d" on differential forms with values in E, and we have the generalized tDolbeault theorem. If X is compact and has a tHermitian metric, we can define a Hermitian inner product on E as follows: There is an open covering {L0} such that over each U j the vector bundle E is iso- morphic to L0 x cq, A point of E over U j is represented by (x,j)' where XEL0 and jECq. For XE L0n Uk we have (x, j) =(x, k) (the sides are the respective expressions over L0 and Ud if and only if j= gjk(X)k' where gjk(X) is a holomorphic mapping from L0n Uk to GL(q, C) satisfying gjkgkl = gjl on L0 n Uk nu" A differen- tial form <p with values in E is expressed as a family {<pj} of differential forms on L0 with values in cq such that <p)x) = gjd x ) <Pk (x) on L0 n Uk' If we take a positive definite Hermitian matrix h j whose components are Coo-functions on L0 such that 'gjkhihk = h k on ujn Uk> then {tjhlJ determines a Hermitian inner product on each fiber of E. We can also endow the space Lr.,(E, X) (of forms of type (r, s) of class ex with values in E) with a Hermitian inner product by setting (<p,Iji)= fxL,.phj,p<p} A *I/If for <p, IjiE Lr,s(E, X) (where the <pj (:x = 1, ..., q) are the components of <PJ If we denote by b the adjoint operator of d" with respect to this inner product and let A =d"b + bd", then A is a self-adjoint elliptic differential operator, and results similar to those for  mentioned above hold for A. For example, the space f>r,.(E, X) of harmonic forms of type (r, s) is of finite dimension, and there is a continuous linear operator G on i!r,.(E, X), the completion of Lr,,(E, X), that satisfies 1 = H + AG, HG = GH=O, d"G=Gd", and bG=Gb. Here H 
733 denotes the projection .2-->5, which is an integral operator with kernel of class Coo. Also, G maps Lr, ,(E, X) into itself. Therefore H is homotopic to the identity on the co- chain complex (L s Lr..(E, X), d"), and any ele- ment of tDolbeault's cohomology groups (d"- cohomology groups) is represented by a uni- que harmonic form ( 232 Kahler Manifolds B). F. Other Generalizations Even if a manifold X is not of class Coo, if X is a manifold of class C:, we can develop the theory of harmonic forms [6]. We say that X is of class c: if it is of class C 1 and has a set of local coordinate systems whose transition functions have derivatives satisfying the tLip- schitz condition. If X is a real analytic manifold with a real analytic Riemannian metric, then harmonic forms are also real analytic. Using this fact, we can embed real analytically a compact mani- fold with a real analytic Riemannian metric into a Euclidean space (P. Bidal and G. de Rham; this result is now included in the theorems of C. B. Morrey and H. Grauert). We can consider the theory of harmonic forms with singularities [4,5], a generalization of the theory of differential forms of the second and third kinds. Here the notion of tcurrent is very useful. G. Cohomology Vanishing Theorems Since the operator  is closely related to the Riemannian metric, some me tries may admit no harmonic forms of certain degrees except zero. This is important since it means that the corresponding cohomology group of the mani- fold vanishes. The condition for this phenom- enon to occur can be described in terms of the curvature of the metric. This study has its origin in S. Bochner's results [2]. Here is an example of a vanishing theorem: Let B be a holomorphic line bundle on a com- pact complex manifold X of dimension n. If the tChern class of B is expressed by a real closed differential form of type (1, 1) as (J)= J=1 La,pdz' /\az P , where the Hermitian matrix (a,p) is positive definite at every point of X, then Hq(X,QP(B))=O for p+q>n. In this case, ds 2 = 2 L a,pdz' /\ dz P is a tHodge metric on X ( 232 Kahler Manifolds D). References [1 J W. L. Baily, The decomposition theorem for V-manifolds, Amer. 1. Math., 78 (1956), 862-888. 195 B Harmonic Mappings [2J S. Bochner and K. Yano, Curvature and Betti numbers, Ann. Math. Studies 22, Prince- ton Univ. Press, 1953. [3J S. I. Goldberg, Curvature and homology, Academic Press, 1962. [4J W. V, D. Hodge, The theory and applica- tions of harmonic integrals, Cambridge Univ. Press, second edition, 1952. [5J K. Kodaira, Harmonic fields in Rieman- nian manifolds (generalized potential theory), Ann. Math., (2) 50 (1949),587-665. [6J C. B. Morrey and J. Eells, A variational method in the theory of harmonic integrals I, Ann. Math., (2) 63 (1956), 91- 128. [7J G. de Rham, Varietes differentiables, Ac- tualites Sci. Ind., Hermann, second edition, 1960. [8] G. de Rham and K. Kodaira, Harmonic integrals, Lecture notes, Inst. for Advanced Study, Princeton, 1950. 195 (VII.15) Harmonic Mappings A. General Remarks The theory of harmonic mappings between Riemannian manifolds has its origin in the study of tPlateau's problem. The basic prob- lem in the theory is to deform a given map- ping into a harmonic one, which is a problem of the tcalculus of variations and tglobal ana- lysis ( 46 Calculus of Variations, 183 Global Analysis). Recently, the theory of harmonic mappings has been applied to problems in various branches of geometry [7-9, 11]. B. Definitions and Examples Let (M,g) and (N,h) be tRiemannian mani- folds with metrics g = L gijdx i dx j and h = Lh,pdy'dyP, respectively. We define the energy of a Cl-mapping f: M -->N by E(/)=L Idj(xWdx, where Idf(x) I is the tHilbert-Schmidt norm of the differential dfx: Tx(M)--> Tf(x)(N) of fat x E M and dx is the canonical Lebesgue mea- sure defined by 9 on M (assumed compact). Thus E(/) can be considered to be a generali- zation of the classical tDirichlet integral for functions. The integrand e( f)(x) = Idf(xW is called the energy density of f; it measures the sum of the squares of elements of length stretched on a complete set of mutually per- pendicular directions. 
195 C Harmonic Mappings The tEuler-Lagrange differential equations of the energy functional E(/) yield a vector field r(/) along f, i,e., a section of the bundle f# T(N) induced from the tangent bundle T(N) of N by f In fact, given a family ,{, of mappings depending differentiably on t with fo = f, we have frE(J,)I,o= - L (r(f),  I,)dX, where < , > denotes the inner product of tangent vectors along f The vector field r(f) is called the tension field of the mapping f; it indicates the direction in which the energy of f decreases most rapidly. The Euler-Lagrange differential equations r(/) = 0 are a system of tq uasilinear elliptic partial differential equations of the second order. In local coordinates, these can be writ- ten in the form N'+ Lgijr" (f) ojP ojY =O . PY iJx' ox) , where  is the tLaplace-Beltrami operator on M and the r'py(/)(x) are the tChristoffel sym- bols on N at f(x). (The f" are local coordi- nates of the point f(x), and (giJ) is the inverse matrix of (gi)') A C 2 -mapping f: M -->N is said to be harmonic if its tension field r(/) vanishes. Thus, if M is compact, a C 2 -mapping f: M-->N is harmonic if and only if it is an extremal of the energy functional E(/). Examples of harmonic mappings appear in various contexts of differential geometry. For instance: (I) If N =R, then the harmonic mappings M --> R are the tharmonic functions on M. (2) If M is the circle SI, then a harmonic mapping SI -->N is a closed tgeodesic of N parametrized by arc length. (3) Let f: M --> N be an tisometric immersion of Minto N. Then f is harmonic if and only if it is a tminimal immersion. (4) If M and N are tKahler manifolds, then every tholomorphic or antiholomorphic map- ping M -->N is harmonic, where by an anti- holomorphic mapping is meant a mapping whose differential mapping carries a differen- tial form of type (1,0) into that of type (0, I). We note that each (anti-)holomorphic map- ping is an absolute minimum for the energy in its homotopy class. There are also examples of nonholomorphic (and nonantiholomorphic) harmonic mappings between Kahler manifolds ( Section D). C. Fundamental Properties (1) Regularity. Since a harmonic mapping is a solution of a second-order quasilinear elliptic 734 system of partial differential equations r(/)=O ( Section B), it is a smooth (i.e., of class COO) mapping. More generally, it is known that a continuous mapping that satisfies r(/) = 0 in a weak sense is smooth [4, 6]. (2) Unique continuation property. The follow- ing unique continuation theorem is valid for harmonic mappings: If two harmonic map- pings of Minto N agree up to infinitely high order at some point of M, then they are iden- tical (M being assumed connected). In partic- ular, a harmonic mapping that is constant on an open set is a constant mapping. The global natures of harmonic mappings are closely related to the curvatures of the manifolds under consideration. For instance, suppose that M and N are compact and that the sectional curvatures of N are nonpositive everywhere. Then we have: (3) Uniqueness, Let f: M -->N be a harmonic mapping, and assume that there is a point of f(M) where the sectional curvatures of N are negative. Then f is unique in its homotopy class unless f(M) is a closed geodesic y of N; and in this case we have uniqueness up to rotation of y, i.e., an isometry of y which moves each point of y a fixed oriented distance along y. (4) Degeneracy. Suppose further that the tRicci tensor (Rij) of M is positive semi definite everywhere. Then the energy density e(f) is a tsubharmonic function for every harmonic mapping. This implies that any harmonic mapping f: M -->N is ttotally geodesic. More- over, if N is of negative sectional curvature, then f is either constant or maps M onto a closed geodesic of N; if (Rij) is positive definite at some point, then f is constant. (5) Finiteness. Assume now that N is of negative sectional curvature. Then, for each K)o 1, there are only finitely many noncon- stant harmonic mappings f: M --> N of dilata- tion bounded by K. Here, we say that the dila- tation of f is bounded by K if and only if at each point of M we have df ==0 or (A 1 /A 2 )1/2.s; K, Al )0 ..1. 2 )0... > 0 being the positive eigen- values of the pullback quadratic form f* h(x) on TAM) induced from the metric h of N by f D. Harmonic Mappings of a Surface Let M be a compact surface. Then the energy of a mapping M -->N is the tDirichlet-Douglas functional, and harmonic mappings are closely connected with solutions of Plateau's problem ( 334 Plateau's Problem). In fact, if a tcon- formal mapping M -->N minimizes the tarea functional, then it also minimizes the energy functional. Now let M and N be compact orient able 
735 surfaces whose genera are denoted by p and q, respectively. Then the problem of existence (or nonexistence) of harmonic mappings is well understood. In fact: (1) When q # 0, for any metrics 9 and h on M and N, every homotopy class of mappings M --> N contains a harmonic mapping. (2) When q = 0 (i.e., N is the 2-sphere S2), every harmonic mapping whose tdegree d satisfies Idl)o pis holomorphic or antiholo- morphic with respect to the complex struc- tures associated with 9 and h. For example, consider the homotopy classes of mappings from the 2-torus T 2 to S2 with any metrics. Then all classes with degree I dl)o 2 have har- monic representatives, and any such is holo- morphic or antiholomorphic; and the classes with d = :t 1 have no harmonic representatives. (3) When q=O and Idl.s;p-1, we have, for every such p and d, a surface M of genus p and a metric h on S2 such that there exists a har- monic nonholomorphic (and nonantiholo- morphic) mapping of degree d from M to S2. E. Existence Theorems The basic problem in the study of harmonic mappings is to prove their existence in general geometric contexts. (I) In regard to this problem, translating the problem of the elliptic system r(f) == 0 into the tinitial value problem of the corresponding nonlinear tparabolic system of/at = r(/), 1. Eells and J. H. Sampson [1] proved that if M and N are compact and if N has nonposi- tive sectional curvature everywhere, then every homotopy class of mappings M --> N contains a harmonic mapping that minimizes the energy in that class. Subsequently, the uniqueness of these harmonic mappings was established by P. Hartman [2] in the form stated in Sec- tion C. (2) For harmonic mappings of surfaces, more general existence results have been known, First, by the tdirect method of the calculus of variations, L. Lemaire (1977) and others proved that if M and N are compact and if M is 2-dimensional, then every conjugacy class of homomorphisms n l (M)-->n l (N) of the funda- mental groups is induced by a minimizing harmonic mapping, It follows that if, in partic- ular, the second homotopy group n 2 (N) of N is zero, then every homotopy class of map- pings of a compact surface M to N contains a harmonic representative realizing the mini- mum of the energy in that class. On the other hand, by making use of the generalized tMorse theory for a perturbed energy functional, 1. Sacks and K Uhlen- 195 Ref. Harmonic Mappings beck [5] succeeded in giving a satisfactory answer to the structure of n 2 (N), which is a n 1 (N)-module, in terms of harmonic mappings. They proved that there exists a generating set for n 2 (N) consisting of harmonic mappings of spheres that minimize energy and area in their homotopy classes. We note that these har- monic mappings are minimal immersions with tbranch points. (3) Next, we mention the case of harmonic mappings of manifolds with boundary. In this case, we can naturally formulate the tDirichlet and the tNeumann boundary value problem for harmonic mappings. In his study of Plateau's problem on Rie- mannian manifolds, C. Morrey (1948) dis- cussed the Dirichlet problem for harmonic surfaces with boundary. The problem in arbitrary dimensions has been studied by R. S. Hamilton [3], who extended the result of Eells and Sampson mentioned above to the case where M and N have boundaries. In fact, let M and N be com- pact Riemannian manifolds with boundary, and assume that N has nonpositive sectional curvature and that the boundary oN of N is tconvex (or empty). Then there exists a unique minimizing harmonic mapping in each tre1a- tive homotopy class determined by the pre- scribed Dirichlet boundary value. We note that if oN is not convex, then it is easy to formulate Dirichlet problems with no solu- tions, Hamilton also treated the Neumann problem. Subsequently, S. Hildebrandt, H. Kaul, and K-O. Widman [4] gave another existence proof of solutions of the Dirichlet problem that covers the case where N admits positive sectional curvature. References [1] 1. Eells and J. H. Sampson, Harmonic mappings of Riemannian manifolds, Amer. J. Math., 86 (1964),109-160. [2] P. Hartman, On homotopic harmonic maps, Canad. 1. Math., 19 (1967),673-687. [3] R. S. Hamilton, Harmonic maps of mani- folds with boundary, Lecture notes in math. 471, Springer, 1975, [4] S. Hildebrandt, H. Kaul, and K-O. Wid- man, An existence theorem for harmonic mappings of Riemannian manifolds, Acta Math., 138 (1977),1-16. [5J 1. Sacks and K Uhlenbeck, The existence of minimal immersions of two-spheres, Ann. Math., (2) 113 (1981),1-24. [6J J. Eells and L. Lemaire, A report on har- monic maps, Bull. London Math. Soc., 10 (1978), 1-68. 
196 Hilbert, David [7] Y.-T. Siu and S.-T. Yau, Compact Kahler manifolds of positive bisectional curvature, Inventiones Math., 59 (1980), 189-204. [8] Y.- T. Siu, The complex-analyticity of har- monic maps and the strong ridigity of compact Kahler manifolds, Ann. Math., (2) 112 (1980), 73-111. [9] S. Nishikawa and K. Shiga, On the holo- morphic equivalence of bounded domains in complete Kahler manifolds of nonpositive curvature, 1. Math. Soc. Japan, 35 (1983) 273- 278. [10] T. Ishihara, The index of a holomorphic mapping and the index theorem, Proc. Amer. Math. Soc., 66 (1977),169-174. [11] T. Sunada, Rigidity of certain harmonic mappings, Inventiones Math., 51 (1979),297- 307. 196 (XXI.28) Hilbert, David David Hilbert (January 23, 1862-February 14, 1943) was born in Konigsberg, Germany. He attended the University of Konigsberg from 1882 to 1885, when he received his doctoral degree with a thesis on the theory of invar- iants. It was there that he established a life- long friendship with H, Minkowski. In 1892 he became a professor at the University, and in 1895 he was appointed to a professorship at the University of Gottingen, a position he held until his death, He obtained his basic theorem on invariants between 1890 and 1893, and next began research on the foundations of geometry ( 155 Foundations of Geometry) and the theory of talgebraic number fields. Concerning the former, he published Grundlagen der Geo- metrie (first edition, 1899), in which he gave the complete axioms of Euclidean geometry and a logical examination of them. Concerning the latter, he systematized all the important known results of algebraic number theory in his monumental Zahlbericht (1897). In number theory, he enunciated his significant conjecture on tclass field theory. At the international congress of mathematicians held in Paris in 1900, he put forth 23 problems as targets for mathematics of the 20th century (Table 1). Between 1904 and 1906 he conducted research on the tDirichlet principle of tpotential theory and on the direct method in the teal cuI us of variations. Around 1909 he established the foundations of the theory of tHilbert spaces. After 1910 he was chiefly involved in research on the tfoundations of mathematics, and he advocated the standpoint of tformalism. He is one of the greatest mathematicians of the first half of the 20th century. 736 Table 1. The 23 Problems of Hilbert (1) To prove the continuum hypothesis ( 33 Axiomatic Set Theory D). (2) To investigate the consistency of the axioms of arithmetic ( 156 Foundations of Mathematics E). (3) To show that it is impossible to prove the following fact utilizing only congruence axioms: Two tetrahedra having the same alti- tude and base area have the same volume. Solved by M. Dehn (1900). (4) To investigate geometries in which the line segment between any pair of points gives the shortest path between the pair ( 155 F oun- dations of Geometry). (5) To obtain the conditions under which a topological group has the structure of a Lie group ( 423 Topological Groups M). Solved by A. M. Gleason and D. Montgomery and L. Zippin (1952) and H. Yamabe (1953). (6) To axiomatize those physical sciences in which mathematics plays an important role, (7) To establish the transcendence of certain numbers ( 430 Transce_ndental Numbers B). The transcendence of 2 J2 , which was one of the numbers put forth by Hilbert, was shown by A. Gel'fond (1934) and T. Schneider (1935). (8) To investigate problems concerning the distribution of prime numbers; in particular, to show the correctness of the Riemann hypoth- esis ( 450 Zeta Functions). Unsolved. (9) To establish a general law of reciprocity ( 59 Class Field Theory A). Solved by T. Takagi (1921) and E. Artin (1927). (10) To establish effective methods to deter- mine the solvability of Diophantine equations ( 97 Decision Problem; 182 Geometry of Numbers). Solved affirmatively for equations of two unknowns by A. Baker, Phi/os. Trans. Roy. Soc. London, (A) 263 (1968); solved nega- tively for the general case by Yu, V. Matiyasevich (1970). (11) To investigate the theory of quadratic forms over an arbitrary algebraic number field of finite degree ( 348 Quadratic Forms). (12) To construct class fields of algebraic num- ber fields ( 73 Complex Multiplication). (13) To show the impossibility of the solu- tion of the general algebraic equation of the seventh degree by compositions of continu- ous functions of two variables. Solved nega- tively. In general, V. I. Arnold proved that every real, continuous function f(XI' X2, X3) on [0, IJ can be represented in the form Lil h,(g;(x" X2), x 3 ), where hi and gi are real, continuous functions, and A. N. Kolmogorov proved that f(xI' X2, X3) can be represented 
737 in the form L7=1 h;(gil (xd + gi2(XJ +gi3(X3)), where hi and gij are real, continuous func- tions and gij can be chosen once for all inde- pendently of f (Dok/. Akad. Nauk SSSR, 114 (1957), Amer. Math. Soc. Trans/., 28 (1963)). (14) Let k be a field, XI' 00', X n be variables, and {;(x I' ..., x n ) be given polynomials in k[x l , ..., xnJ (i = 1, "', m). Furthermore, let R be the ring formed by rational functions F(X l , ...,X m ) in k(X I , ,00,X m ) such that F(/I, 00. ,1m) E k [X I' ... ,X n ]. The problem is to deter- mine whether the ring R has a finite set of generators. Solved negatively by M. Nagata, Amer. J. Math., 81 (1959). (15) To establish the foundations of algebraic geometry ( 12 Algebraic Geometry). Solved by B. L. van der Waerden (1938-1940), A. Weil (1950), and others. (16) To conduct topological studies of alge- braic curves and surfaces. (17) Let {(XI' ..., x n ) be a rational function with real coefficients that takes a positive value for any real n-tuple (x I, ... , x n ). The problem is to determine whether the function f can be written as the sum of squares of rational func- tions ( 149 Fields 0). Solved in the affirma- tive by E. Artin (1927). (18) To express Euclidean n-space as a disjoint union UA p)" where each P A is congruent to one of a set of given polyhedra. (19) To determine whether the solutions of regular problems in the calculus of variations are necessarily analytic ( 323 Partial Differ- ential Equations of Elliptic Type). Solved by S. N. Bernshtein, I. G. Petrovskii, and others. (20) To investigate the general boundary value problem ( 120 Dirichlet Problem; 323 Par- tial Differential Equations of Elliptic Type). (21) To show that there always exists a linear differential equation of the Fuchsian class with given singular points and monodromic group ( 253 Linear Ordinary Differential Equa- tions (Global Theory)). Solved by H. RohrI and others (1957). (22) To uniformize complex analytic functions by means of automorphic functions ( 367 Riemann Surfaces). Solved for the case of one variable by P. Koebe (1907). (23) To develop the methodology of the calculus of variations ( 46 Calculus of Variations), References [1] D. Hilbert, Gesammelte Abhandlungen I-III, Springer, 1932-1935 (Chelsea, 1967). [2] D. Hilbert, Grundlagen der Geometrie, Teubner, seventh edition, 1930, 197 B Hilbert Spaces [3J D. Hilbert, Grundziige der allgemeinen Theorie der Iinearen Integralgleichungen, Teubner, second edition, 1924 (Chelsea, 1953). [4J D. Hilbert and W. Ackermann, Grundziige der theoretischen Logik, Springer, third edi- tion, 1949; English translation, Principles of mathematical logic, Chelsea, 1950. [5] D. Hilbert and P. Bernays, Grundlagen der Mathematik, Springer, second edition, I, 1968; II, 1970. [6J F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19. lahrhundert I, Springer, 1926 (Chelsea, 1956). [7] H. Weyl, David Hilbert and his mathemat- ical work, Bull. Amer. Math. Soc., 50 (1944), 612-654. [8] C. Reid, Hilbert. With an appreciation of Hilbert's mathematical work by H. Weyl, Springer, 1970. 197 (XI1.2) Hilbert Spaces A. General Remarks The theory of Hilbert spaces arose from prob- lems in the theory oftintegral equations. D. Hilbert noticed that a linear integral equation can be transformed into an infinite system of linear equations for the tFourier coefficients of the unknown function. He considered the linear space /2 consisting of all sequences of numbers {xn} for which LI IXnl2 is finite, and defined for each pair of elements X = {xn}, Y = {Yn} E /2 their inner product as (x, y) = L;I xnYn' The space /2 can be regarded as an infinite-dimensional extension of the notion of a Euclidean space. F. Riesz considered the space of functions now termed Lrspace and succeeded in giving a satisfactory answer to the Fourier expansion problem. In his book [3], 1. von Neumann established a rigorous foundation of quantum mechanics employing Hilbert spaces and the spectral expansion of self-adjoint operators. The following axiomatic definition ( Section B) of Hilbert spaces is due to von Neumann. H. Weyllater Justified the tDirichlet principle of Riemann by the method of orthogonal projection in a Hilbert space, and thus paved the way for the function- analytic study of differential equations. B. Definition Let K be the field of complex or real numbers, the elements of which we denote by Ct:, fJ, '" , 
197 C Hilbert Spaces Let H be a tlinear space over K, and to any pair of elements x, YE H let there correspond a number (x, y) E K satisfying the following five conditions: (i) (Xl +xz,y)=(x l ,Y)+(X2'Y); (ii) (exx, y) = ex(x, y); (iii) (x, y) = (y, x) ; (iv) (x, x) )00; and (v) (x,x)=O=x=O. Then we call H a pre- Hilbert space and (x, y) the inner product of x and y. With the norm Ilxll ==, H is a tnormed linear space. If H is tcomplete with respect to the distance Ilx - YII (i.e., Ilx n - xmll--> o (m, n--> 00) implies the existence of Jimx n =x), then we call H a Hilbert space. According as K is complex or real, we calI H a complex or real Hilbert space. A Hilbert space is a tBanach space. A normed linear space with norm Ilxll can be made a pre-Hilbert space, by defining an inner product (x,y) so that Ilxll = J(x, x) , if and only if the equality Ilx + yl12 + Ilx _ yliZ = 2(llxll z + II Yllz) holds for any x, y. C. Orthonormal Sets Two elements x, YE H are said to be mutu- alIy orthogonal if (x, y) = 0, A subset I of H is calIed an orthogonal set (or system) if 0  I and every distinct pair x, YE I is mutually ortho- gonal. If every element of an orthogonal set I is of norm I, then I is calIed an orthonormal set. Any orthogonal set I = {xJ can be normal- ized into an orthonormal set {xdllxill}. A maximal orthonormal set is calIed a complete orthonormal set or an orthonormal basis, AII the complete orthonormal sets of a given H have the same cardinal number, which we calI the dimension of H. Two Hilbert spaces are isomorphic if and only if they have the same dimension. Let I= {xJ be an orthonormal set. Then for every XE H, its Fourier coefficients (x, Xi) van- ish for all but a countable number of i, and the Bessel inequality Ilxll z )oLiI1x,x;)lz holds. The following three statements are equivalent in a Hilbert space: (i) I is complete; (ii) Parseval's equality II x IIZ = Li I (x, Xi) I Z holds for every x; (iii) every x can be expanded in a Fourier series x == L;(X, X;)Xi ( 317 Orthogonal Functions). D. Examples of Hilbert Spaces The space Iz ( Section A) is a Hilbert space of dimension o. The tfunction space Lz on a measure space (X, Ji) is a Hilbert space if the inner product of f, 9 E Lz is defined by (f, g) = Ix f9 dJi. In the case of the tLebesgue mea- sure in a Euclidean space, L 2 is of dimension o, so that it is a Hilbert space isomorphic to Iz. Further examples of Hilbert spaces are 738 A 2 (Q), Wi(Q) (=H'(Q)), and H(Q) ( 168 Function Spaces). E. Closed Linear Subspaces and Projections Let M bc a closed linear subspace of a Hilbcrt space H, i.e., a linear subspace that is closed in the norm topology of H. It is a Hilbert space with respect to the restriction of the inner product in H. For a given M the set of all xEH such that (x,y)=O for every YEM forms a closed linear subspace M.l called the orthog- onal complement of M. The orthogonal com- plement of M.l is M (i.e., M.l.l = M), and His the direct sum of M and M.l (i.e., every XEH can be uniquely represented as x= Y+z, YE M, zEM.l, and Ilxll z = IIYll z + IlzI1 2 ). Thus the quotient space HIM is isomorphic to M.l and is also a Hilbert space. The operator PM that maps x to y is called the projection (or ortho- gonal projection or projection operator) to M. A bounded linear operator P is a projection if and only if it is idempotent (P2 = P) and self- adjoint «Px,y)=(x,Py) for any x, YEH) ( 251 Linear Operators). F. Conjugate Spaces A linear operator from H to K is called a linear functional. The set H' of alI continuous linear functionals f on H forms a Hilbert space with norm Ilfll = sup{lf(x)llllxll = I}. For every fEH' there exists a unique YEH such that f(x)=(x,y) for all XEH (Riesz's theorem), and the correspondence f --> y gives an tantilinear isometric operator from H' onto H (for tlinear operators on Hilbert spaces  68 Compact and Nuclear Operators; 251 Linear Operators; 390 Spectral Analysis of Operators). References [IJ D. Hilbert, Grundzuge einer allgemeinen Theorie der linearen Integralgleichungen, Teubner, second edition, 1924 (Chelsea, 1953). [2] E. HelIinger and O. Toeplitz, Integral- gleichungen und Gleichungen mit unendlich- vielen Unbekannten, Enzykl. Math., Teubner, 1928 (Chelsea, 1953). [3J 1. von Neumann, Mathematische Grund- lagen der Quantenmechanik, Spnnger, 1932, [4J J. von Neumann, Collected works II, III, Pergamon, 1961. [5J M. H. Stone, Linear transformations in Hilbert space and their applications to analy- sis, Amer. Math. Soc. Colloq. Publ., 1932. 
739 [6J B. Sz.-Nagy, Spektraldarstellung linearer Transformationen des Hilbertschen Raumes, Erg, Math., Springer, 1942. [7] F. Riesz and B. Sz.-Nagy, Leyons d'analyse fonctionelle, Akademiai Kiad6, third edition, 1955; English translation, Functional analysis, Ungar, 1955. [8] N. I. Akhiezer and I. M. Glazman, Theory of linear operators in Hilbert space, Ungar, I, 1961; II, 1963. (Original in Russian, 1950.) [9J K. Y osida, Functional analysis, Springer, 1965. [10] R. Courant and D. Hilbert, Methods of mathematical physics I, Interscience, 1953. [l1J p, R. Halmos, Introduction to Hilbert space and the theory of spectral multiplicity, Chelsea, second edition, 1957. [12J P. R. Halmos, A Hilbert space problem book, Van Nostrand, 1967. 198 (XI.1) Holomorphic Functions A. Differentiation of Complex Functions Let f(z) be a tcomplex-valued function defined in an open set D in the tcomplex plane C. We say that f(z) is differentiable at z if the limit Iim(/(z+h)- f(z))/h= {'(z) hO exists and is finite as the complex number h tends to zero. We call /,(z) the derivative of f(z) at z. This definition is a formal extension of the definition of differentiability of a func- tion of a real variable to that of a complex variable ( 106 Differential Calculus), but it is a much stronger condition than the differentia- bility of a real function, since z + h in (1) may be an arbitrary point in a 2-dimensional neigh- borhood of z. Hence many results essentially different from those for functions of a real variable follow from it. If a function f(z) is differentiable at each point of an open set D, it is said to be holo- morphic (or regular) in D, or {(z) is a holo- morphic function on D. (For the definition of holomorphy of a complex-valued function of several complex variables  21 Analytic Functions of Several Complex Variables C.) Let E be an arbitrary nonem pty subset of C. We say that /(z) is holomorphic on E if it is defined in an open set D containing E and is ho10morphic on D. Some results valid for differentiable real functions also hold for holomorphic functions. For instance, the de- rivative of a sum, product, or quotient is given by the usual rules. The derivative of a com- posite function is determined by the chain rule. 198 A Holomorphic Functions The set of all functions holomorphic in a tdomain D forms a tring. Suppose [(z) is holomorphic on D and /'(zo) #0, zoED. Then two curves that form an angle at Zo are mapped by f to two curves forming the same angle at f(zo). Because of this property, the mapping I is said to be conformal at all points z with f'(z) #0. The following four conditions are equivalent for a function [= u + iv defined on an open set D. (1) f is holomorphic in D. (2) u= u(x,y) and v = v(x, y) are ttotally differentiable at each point z = x + iy and satisfy the Cauchy- Riemann differential equations au/ox = av/oy, ou/oy = - av/ox. (1) (3) f is represented by a tpower series Lo cn(z - a)" in a neighborhood of each point a of D; that is, f(z) is analytic in D. (4) f is continuous and Scf(z)dz=O for every rectifiable Jordan closed curve C whose interior is contained, together with C, in D. The proposition that (1) implies (4) is called Cauchy's integral theorem, and the proposition that (4) implies (1) is called Morera's theorem. The hypothesis of Morera's theorem can be weakened as follows: Let f(z) be continuous in a domain D. If Scf(z)dz=O for every rectangle C in D with sides parallel to the axes and whose interior consists of only points of D, then f(z) is holomorphic in D. In the statement of this theorem, if we let C be an arbitrary circle, we get the same conclusion. The following complex differential operators are often useful: :z = (:x -i : y } :z = (:x +i : y ). Generally, (i.J<p/oz) = o({J/i.Jz. The Cauchy- Riemann equations above can be expressed in a single equation: oflaz = O. If f is holomor- phic, ofloz = f'(z). In order to show that [= u+ iv is holomor- phic in D, assumption (2) can be weakened. Actually, we have the Looman-Men'shov theorem: Suppose that u and v are continuous in D, oulox, ou/oy, av/ox, and av/oy exist at every point of D except for at most a count- able number of points, and the Cauchy- Riemann equations hold in D except for a set of 2-dimensional tmeasure zero; then f = u + iv is holomorphic in D. D. E. Men'shov extended this theorem and obtained various conditions for holomorphy. For example, he proved the following theorem: Iff is a topological map- ping of D and f is conformal in D (i.e., limarg(/(z+h)- f(z»)/h hO exists) except for at most a countable number of points, then f is holomorphic in D. 
198 B Holomorphic Functions As another type of sufficient condition for holomorphy, we have the proposition: Iff is locally tLebesgue integrable in D and satis- fies the Cauchy-Riemann equation ,{z=O in the sense of tdistribution, then there exists a holomorphic function 9 in D such that g = f talmost everywhere. B. Cauchy's Integral Theorem Cauchy's integral theorem can be stated as follows: If f(z) is a holomorphic function in a tsimply connected domain D in the complex plane, the equality Jcj(z)dz =0 holds for every (rectifiable) closed curve C in D. In particular, the integral JU(z)dz (ex, fJ ED) is uniquely deter- mined by ex and fJ provided that its path of integration lies in D. The function F(z)= ]f(Od( (zED) is called the indefinite integral of f. F(z) is holomorphic in D and F'(z) = ['(z). In the proof of his integral theorem, Cauchy assumed the existence and continuity of the derivative /,(z) in D. Ho'Wever, E. Goursat proved the theorem utilizing only the existence of /,(z), Actually, by virtue of the integral formula (2) in this section, the existence of f'(z) in D implies the continuity of {'(z). This is sometimes called Goursat's theorem. Let C, C l , C 2 , ..., C n be rectifiable Jordan curves. Suppose that C 1 , C 2 , "', C n are in the interior of C and that each one lies in the ex- terior of the others. If f(Z) is holomorphic in the region D bounded by these n + I curves and continuous on D U CU C 1 U... U C n = D, then we have f/(Z)dZ= JI Lf(Z)dZ. Here the curvilinear integrals are taken in the positive direction (i.e., we take the direction such that JC<z-afl dz= ]c,(z-afl dz=2ni for a point a in the interior of Cor C k , respec- tively). Henceforth, an integral along a closed curve is taken in the positive direction unless otherwise noted. Cauchy's integral theorem under the assumption that f is holomorphic in D and is continuous on D is sometimes called the stronger form of Cauchy's integral theorem. Under the same assumptions as in the stronger form, we have Cauchy's integral for- mula for zED: f(Z)= f f(O d( - f f(O d(... 2m c ( - z 2m c 1 ( - z - f f(O d(. (2) 2ni c, ( - z This integral formula expresses the value of 740 f(z) at a point z in the domain D in terms of the values of f on the boundary of D. In par- ticular, when n = 0, the integral formula reads as f(Z)= f (0 d(. 2m c!"-z (3) Furthermore, if C is a circle Izi = R (i.e., D is the disk Izl <R), we obtain Poisson's integral formula: ['(z) == 1 f zn. R Z _r 2  f(Re"P) d<p 2n 0 R 2 +r 2 -2Rrcos(O-<p)' f(z) = 1 f 2n . Rei"'+z  Ref(Re'''')dq1+iImf(O), 2n 0 Re''''-z z=re ie , O.s;r<R. (3)' Formula (3)' is valid for a tharmonic function. Let C be a rectifiable Curve and f(O be a continuous function defined on C; then the integral of Cauchy type F(Z)= f (0 d( 2m c!"-Z is holomorphic outside C. The nth derivative F(n) of F is given by (n!/2ni) Jcf(OI«( _z)n+l d(; moreover, F can be expanded in a Taylor series about every point a outside C: en F(z)= L an(z-a)", n==O F(n)(a) an =---;;!, which converges in Iz-al<p, p being the distance from a to C. In particular, formula (3) implies that a holomorphic function f is in- finitely differentiable and is expanded in a Taylor series about every point of D as above. Let C be a closed curve not passing through a point a. Then the integral (1/2ni)Jcdz/(z-a) is an integer. It is calIed the winding number of C about a and is denoted by n(C;a). A cycle y (a finite sum of oriented closed curves) in an open set D is said to be homologous to zero in D if n(y; a) = 0 for all points a in the com- plement of D. The general form of Cauchy's integral theorem is stated as follows. Iff is holomorphic in D, then Lf(z)dz=O for every cycle y which is homologous to zero in D (E. Artin). From this we have the general form of Cauchy's integral formula: if f is holomorphic in a domain D, then , 1 f f(O n(y;z)'j(z)=-----;- -dz, 2m y ( - z zED-y, for every cycle y which is homologous to zero in D. 
741 C. Zero Points Let f(z) be a holomorphic function not identi- cally equal to zero. If f(a) = 0, we call a a zero point of f Every zero point of f is an isolated point, and there exists a unique positive inte- ger k and a function h holomorphic at a such that f(z)=(z-a)kg(z), g(a)#O. We call k the order of the zero point a and a a zero point of the kth order. The equality (4) implies that the tTaylor series of f(z) at a begins with the term ck(z - at Suppose that a is a zero point of ,[(z) - y of the kth order; then we call a a y-point of the kth order. For a function f(z) defined in a neighbor- hood of the tpoint at infinity, we set f(l/w) = g(w) (/(oo)=g(O)) and call f holomorphic at 00 if 9 is holomorphic at 0; f is said to have a zero of order k at 00 if g has a zero of order k at O. If two functions f and 9 are holomorphic in D and f(z)=g(z) on a subset E that has an taccumulation point in D, then f is identically equal to 9 in D (theorem of identity or unique- ness theorem) since the zeros of holomorphic functions must be isolated. D. Isolated Singularities Let f(z) be holomorphic in an annulus D ={zIR I <lz-al<R 2 ,0.s;R I <R 2 .s; +oo}. Then f(z) is expanded in the tLaurent series +00 f(z) = L cn(z - a)". n=" - 00 This is called the Laurent expansion of f about a. The coefficients C n are given by C n = (1/2ni) Scf«()d(/«( -- a)"+! with C = {z Ilz - al = r}, RI <r< R 2 . In particular, If f(z) is holo- morphic in D= {zIO< Iz-al < R} (or, if a = 00, in D= {zl R < Izi < +oo}) but not holomor- phic in D U {a}, we call a an isolated singular point (or isolated singularity) of f By utilizing the tlocal canonical parameter t = z - a (or t = I/z for a = 00), the Laurent expansion (5) of f is then written as f(z) = L;; -00 cnt n + Ln+ cnt n . The second sum is an ordinary power series, called the holomorphic part of f(z). The first sum is a power series of l/t with no constant term, called the singular part of f(z) at a or the principal part of the singularity (or of the Laurent expansion at a). If we have lim Ho (f(z) = 0, the Laurent expansion (5) of f(z) lacks its singular part, and the limit of f(z) exists as t-->O (z-->a) and is equal to Co. If we set f(a) = Co, then the func- tion f(z) is holomorphic in DU {a}. In this case, the point a is called a removable singular- 198 E Holomorphic Functions (4) ity. If f(z) is bounded in a neighborhood of a singularity a, then a is removable (Riemann's theorem). Usually, we assume that the remov- able singularities of a function have already been removed in this way. When the singular part of f(z) at a exists and consists of a finite number of terms, the point a is called a pole; when it consists of an infinite number of terms, the point is called. an essential singularity. If a is a pole, f(z) is represented by the Laurent series L;;: -k cnt n (ck#O) and/(z)-->oo as z-->a. In this case, the index k is called the order of the pole a. Then a relation such as (4) holds, where the index k is replaced by - k. Hence the point a is some- times called a zero point of the - kth order. If a IS an essential singularity, then for an arbitrary number C there exists a sequence Zn converging to a such that Iimnoo f(zn) = C (the Casorati- Weierstrass theorem or simply Weierstrass's theorem). Related to Weierstrass's theorem, we have tPicard's theorem, which gives a de- tailed description of the behavior of a function around its singularities. E. Residues Let a( # 00) be an isolated singularity of f(z). Then the coefficient C I of (z - ar l in the Laurent expansion (5) of f(z) is called the residue of f(z) at a and is denoted by Res[fJa, R( a; f), or R( a) if we need not indicate f We have (5) R(a)=cI = r f«()d(, 2m JI(-al=r where the integral is taken in the positive direction along a path for 0 < r < R. If f(z) is holomorphic at z=a, then R(a)=O, If f(z) has a po Ie of the first order at a, R(a) = lim (z - a)f(z). za The residue at the point at infinity is defined to be -a_I> where a_I is the coefficient of I/z of the Laurent expansion of f(z) at 00 :j(z) = L -00 anz n , and we have -I i -a_I =- 2 ' f«()d(, m 1(Ir R- I <r< +00. Thus the notion of the residue of f(z) is actu- ally related to the differential form f(z)dz and not to f(z) itself. From the first formula in this section and the formula for - a -I, the residue theorem follows (Cauchy, 1825): Let C be a rectifi- able Jordan curve in the complex plane. Let ai' "', am be a finite number of points inside C, and let D be a domain containing C and its 
198 F Holomorphic Functions interior. If f(z) is a function holomorphic in D -{ai' ...,a m }, we have 1 f m ---; f(z)dz= L R(a n ). 2m c n1 Furthermore, if f(z) is holomorphic in the extended complex plane (including the point at infinity) except for a finite number of poles, the sum of all residues is equal to zero. F. Calculus of Residues The calculus of residues is a field of calculus based on application of the notion of residues. For example, we have methods for the calcu- lation of definite integrals. ActuaIly, one of the reasons why Cauchy studied the theory of complex functions was that he believed that the theory would provide a unified method of computing definite integrals. For example, if <p(z) is a rational function without poles on the real axis and with a zero point at infinity whose order is at least 2, then we have f uc <p(x)dx=2ni L R(ex;<p(z)), (6) -CIJ Ima>O f oc eiX<p(x)dx=2ni L R(ex;eiz<p(z)), (7) - Ima>O Here the sums are taken over all the poles in the upper half-plane. Formula (7) is valid also for a rational function <p(z) with a simple zero at infinity. If <p(z) has simple poles at a k (k = 1, .. . , n) on the real axis, then we take the principal values of the integrals at those poles and add niR(a k ) (k = I, ... ,n) to the terms on the right-hand side of (6) and (7). Sometimes we use the residue theorem to obtain the value of the sum of a series (e.g., the tGaussian sum) by expressing it as an integral. Let {(z) be a single-valued function that is tmeromorphic and not identically equal to zero in a domain D, and let <p(z) be a function holomorphic in D. Draw a rcctifiablc Jordan curve C in D such that the interior of C is contained in D and f(z) has neither zeros nor poles on C. Let ex I' , exN and /31> . , /3p be the zeros and poles inside C, respectively (where each of them is repeated as often as its order). Then we have N p I f I'(z) L <p(ex n ) - L <p(fJp) =- 2 ' <p(z) f( ) dz. n1 p1 m c z If <p(z) = I, we get  f dargf(z)=N-P. 2n c This is called the argument principle. Next, let f(z) be a function meromorphic for Izl < 742 R.s; +00,/(0)#0, #00, and Set <p(z)=logz. Take C as a closed curve consisting of the boundary of an annulus 0 < p < Izl < r < R (where p is sufficiently small) and two sides of a suitable tcrosscut joining a point of Izl == p and Izl = r. Then we obtain Jensen's formula: log I "'1> ex 2 ,. , exN I =loglf(O)1 +(N - P)logr fJI,fJ2, ...,fJp - f 2" log If(rei'l') I dl/J, 2n 0 The argument principle can be utilized to prove Rouche's theorem: Let f(z) and g(z) be functions holomorphic in a domain D that contains a rectifiable Jordan curve C and its interior. Suppose that f(z) + Ag(z) never van- ishes on C for any ..1. with O.s;A.s; 1. Then the number of zeros of f(z) in the interior of Cis equal to that of f(z) + g(z). If If(z)1 > Ig(z)1 on C or argf(z)-argg(z)#(2n+ I)n (n is an integer), the hypothesis of Rouche's theorem is satisfied. This theorem is useful in proving the existence of a zero of a complex function (for example, a polynomial) and in finding its position. G. Analytic Continuations Let f(z) be a holomorphic function in a do- main D of the complex plane C and D* be a domain containing D as a proper subset. If there exists a function F(z) ho1omorphic in D* that coincides with f(z) in D, then F(z) is called an analytic continuation (or analytic prolonga- tion) of {(z) from D to D*. By the theorem of identity an analytic continuation of F(z) is uniquely determined if it exists. The function fdz) defined by the power series P(z; a) = Lo an(z - a)" with the radius of convergence r I> 0 is holomorphic in the domain DI :Iz-a!<r l , and at a point b of DI it can be expanded into a power series P(z; b) = Lo bn(z - b)" with the radius of conver- gence r2 (;;'r l -Ib-al). Ifr 2 >r l Ib-al, the domain D 2 :lz-bl<r2 is not entirely contained in Dl' Let 12(Z) be the function defined in D 2 by P(z; b). Then the function F(z) that is equal to 11 (z) in DI and to {2(Z) in D 2 is an analytic continuation of fl (z) from DI to DI U D 2 (a direct analytic continuation by power series). We have the following classical theorems about analytic continuations: Let Dl and D 2 be two disjoint domains, and suppose that their respective boundaries C I and C 2 are trectifiable simple closed curves and that the intersection of C I and C 2 contains an open arc r. If two holomorphic functions fl (z), f2(Z) defined in D j and D2' respectively, have finite common tboundary values at every 
743 point of r, then there exists an analytic con- tinuation F(z) of fl (z) and f2(Z) to DI U r U Dz (Painleve's theorem). We sometimes call fz(z) a continuation of fl (z) beyond r, If r is not rectifiable, the continuation beyond r does not exist, in general. Let f(z) be holomorphic in a tJordan do- main D lying in the half-plane on one side of the real axis and containing an open interval I of the real axis in its boundary. If [(z) has finite real boundary values at every point of I, then it can be continued analytically beyond I to the other side of the real axis; there the continued function is given by f(Z) (Schwarz's principle ofreflection). This theorem can be generalized to the case where the real interval is replaced by an tanalytic curve. A harmonic continuation of tharmonic func- tions is defined analogously to analytic con- tinuation. Let D be a Jordan domain lying in the half-plane on one side of the real axis and having an open interval I on the real axis as a part of its boundary. If u(z) is harmonic in D and has the boundary value 0 at every point of I, then u(z) has a harmonic continuation beyond I. F or other properties of holomorphic func- tions  43 Bounded Functions; 429 Tran- scendental Entire Functions. H. Analytic Functions A real-valued function f(t) of a real variable t is said to be analytic at t = to if it can be repre- sented by a tpower series in t - to in a neigh- borhood of to in R. If f(t) is defined on an open set of R at every point of which it is analytic, then f(t) is called an analytic function, or, more precisely, a real analytic function. Analogously, a complex-valued function f(z) of a complex variable z defined on a tdomain D of the complex plane C is said to be analytic at z = Zo (E D) if it can be represented by a power series in z - Zo in a neighborhood of Zo in C, and f(z) is an analytic function in D if it is analytic at every point of D. In the remainder of this article, we are concerned with analytic functions in this sense, To distinguish them from the real case, they are also called complex analytic functions. A complex analytic function f(z) defined on D is tdifferentiable in D; there- fore it is tholomorphic in D. The converse is also true, Thus the term "analytic function" is synonymous with "holomorphic function" insofar as it concerns a complex function (i.e" a complex-valued function of a complex vari- able) on a domain, but in the theory of func- tions it takes on an additional meaning that is explained in the following section 198 J Holomorphic Functions I. Analytic Functions in the Sense of Weierstrass Let a be a point of the tz-sphere and t the tlocal canonical parameter at a; i.e., t = z - a if a # 00 and t = Z-I if a = 00. If a power series P(z; a) = Lo cntn has a positive radius ra of convergence, we call P(z; a) a function element with center a on the z-sphere, after K. Weier- strass. P(z;a)=LOCn(z-a)" if a# 00, and P(z; a) = Lo cnz- n if a= 00. These represent a holomorphic function in Iz -al <ra or in r a - I < Izl.s; 00, respectively, If b is a point inside the circle of convergence of the function element P(z; a), by the tTaylor expansion of P(z; a) at z=b, we obtain the power series P(z;b) in z - b, which is a direct analytic continuation of P(z; a). Let a and b be two points on the z-sphere, and let c: z = z(s) (O.s; s.s; 1, z(O) = a, z(I)= b) be a curve joining a and b. We say that P(z; a) is analytically continuable along C and that we obtain P(z; b) at the end point b by the analytic continuation of P(z; a) along C if the following two conditions are satisfied: (i) To every SE [0,1] there corresponds a function element P(z; z(s)) with center Z(5); (ii) for every SoE[O, 1], we can take a suitable subarcz= z(s) (I S - So I.s; £, £ > 0) of C contained inside the circle of convergence of P(z; z(so)) such that every function element P(z; z(s)) with Is-sol.s;£ is a direct analytic continuation of P(z; z(so)). When P(z; a) and the curve Care given, the analytic continuation along C is uniquely determined (uniqueness theorem of the analytic continuation). Given a function element P(z; a) with center a, the set of all function elements obtained by every possible analytic continuation along every curve starting from a is called an ana- lytic function in the sense of Weierstrass deter- mined by P(z; a). In this definition, we can restrict the curves to polygonal lines. An ana- lytic function in this sense is completely deter- mined by a single arbitrary function element belonging to it, so two analytic functions are identically equal if they have a common func- tion element. A tgerm of a holomorphic function is iden- tical to a function element, and the set of all germs has the natural structure of a tsheaf (f). In the terminology of sheaves, an analytic function is a connected component of (I.!, and an analytic continuation along a curve C is a continuous curve r in (I.! whose projection is C. J. Values and Branches of Analytic Functions The value of an analytic function at a point b is, by definition, the value at b of its function elements with center b (whose existence is 
198 K HoIomorphic Functions assumed; there may be several such elements). An analytic function is, in general, a multiple- valued function because analytic continua- tions along different curves with the same end points may lead to different function elements. For a given analytic function f(z), if the max- imal number of its function elements with the same center is n, we say it is n-valued, and if n;" 2 we say it is multiple-valued (or many- valued). The number of function elements of f(z) with the same center is at most tcountably infinite, so the value of f(z) at a point is a countable set (Poincare-Volterra theorem). By introducing a tRiemann surface instead of the complex plane as the domain of definition of an analytic function, we can regard multiple- valued analytic functions as single-valued functions defined on a suitable Riemann sur- face ( 367 Riemann Surfaces). Let f(z) be an analytic function and P(z; a) be a function element belonging to [(z), where a is a point of a domain D. The set of all function elements obtained from P(z; a) by every pos- sible analytic continuation along all curves in D is caIled a branch of f(z) in D determined by P(z; a). When D coincides with the whole com- plex plane, the branch of f(z) in D is the func- tion f(z) itself. A function holomorphic in a domain D can be expanded in a power series with any point of D as its center, and the set of these power series (function elements) consti- tutes a branch of an analytic function. If analytic continuations of a function ele- ment are possible along all curves in D, then the analytic continuations along two tho- motopic curves in D lead to the same result (monodromy theorem). In particular, if D is tsimply connected and if analytic continua- tions of P(z; a) are possible along all curves in D starting from a, then the branch of f(z) in D determined by P(z; a) is single-valued. K. Invariance Theorem of Analytic Relations Suppose the following four conditions hold: (1) F(z, w) is a holomorphic function of two vari- ables for ZEl and WE2' where l> 2 are domains in the complex plane. (2) A curve C: z=z(s) (O.s;s.s; l,z(O)==a,z(I)=b) and two sets of function elements P(z;z(s)) and Q(z;z(s)) defined for every s (O.s; s.s; 1) are given. (3) P(z; a) and Q(z; a) can be continued analyti- cally along C using P(z; z(s) and Q(z; z(s)), respectively, (4) There exists a positive number R(s) for every s (O.s;s.s; 1) such that, iflz-z(s)1 < R(s), the values of P(z; z(s)) and Q(z; z(s)) belong to 1 and 2' respectively. Under these conditions, if F(P(z; a), Q(z; a)) == 0 holds for Iz -al < R(O), then F(P(z; b), Q(z; b))=O holds for Iz - bl < R(l). In other words, an analytic 744 relation between function elements belonging to two analytic functions that holds in a neigh- borhood of the starting point of a curve C is conserved for function elements with center at the terminal point b of C. This is caIled the invariance theorem of analytic relations. The same statement is valid for relations among more than two analytic functions and their derivatives (differential equations). L. Inverse Functions Suppose that P(z; a) (a # 00) belongs to an analytic function f(z) and P'(a; a) #0. We consider the inverse function of P(z; a) in a neighborhood of a and let !B(w; ex) (ex = P(a; a)) be its expansion as the power series in w - ex. We calI !B(w; ex) the inverse function element (or simply inverse element) of P(z; a) and the ana- lytic function determined by !B(w; ex) the inverse analytic function (or simply inverse function) of f(z). The inverse function is completely deter- mined by f(z) and is independent of the choice of P(z; a). For example, analytic functions rep- resented by j; or logw are defined as the inverse function of Z2 or e Z , respectively. M. Singularities of Analytic Functions Hereafter, when we speak of a curve C: z = z(s) (O.s; s.s; 1), it is always supposed that C is a curve in the complex plane starting at a and ending at w. Let Kr be the open disk Iz-wl < r; we denote by C r the connected component of C n Kr that contains w, If analytic continu- ations of P(z; a) are possible along any su barc of C with a terminal point arbitrarily near w but impossible along the whole C, we say that the analytic continuation of P(z; a) along C defines a singularity 0 of the coordinate w, and that 0 lies over w. For example, if P(z; a) has a finite radius of convergence, for a suitable point w on the circle of convergence the ana- lytic continuation of P(z; a) along the radius aw defines a singularity over w. Now take a point Zr on C r and denote by Fr(z) the branch of an analytic function determined by P(z; zr) in Kr. Let 0 be a singularity determined by C and P(z; a), and suppose that we are given another singularity 0* over w determined by C* and P*(z; a*). If they define the same branch Fr(z) for every K" by definition, we put 0=0*. Thus Fr(z) defines an tunramified covering surface J-v, of Kr - {w}, and it is single-valued on J-v,. Singularities are classified according to the geometric structure of J-v, and the value distri- bution of Fr(z) on it. First, if J-v, has no trela- tive boundary over 0 < Iz - wi < r for a suitable r, then 0 is called an isolated singularity of the 
745 analytic function. In this case, the number k of points of  lying over a point z in Kr - {OJ} is constant. If k = 00,  has a tlogarithmic branch point over OJ, and Q is called a loga- rithmic singularity. If k < 00, Fr(z) can be repre- sented as a single-valued holomorphic func- tion in 0 < I tl < r 1 / k by putting z = OJ + t k . In this case, if we introduce an additional point Po corresponding to z = OJ, then  U {Po} has only an talgebraic branch point over OJ. Now, taking into account the value of w = Fr(z), we call Q an algebraic singularity if lim w exists. In this case, we have Fr(Z)=L,cntn, and if we admit analytic continuations in the wider sense ( Section 0), P(z; a) is analytically con- tinuable along the whole C. N. The Natural Boundary Given a domain D and an analytic function f(z) holomorphic in D, if all boundary points of Dare singularities of f(z) and f(z) is not continuable to the exterior of D, the boundary of D is called the natural boundary of f(z). This phenomenon was first discovered for telliptic modular functions. Many results are known about power series for which the circle of convergence is the natural boundary ( 339 Power Series). For any given domain D in C, there exists an analytic function whose natural boundary is th boundary of D. The original proof of this fact, given by Weierstrass, con- tained a defect that was corrected by J. Besse. O. Analytic Continuation in the Wider Sense Let two tLaurent series (with parameter t) z = P(t)=Lkantn and w=Q(t)=L,bntn (k and I are integers, and a k b , # 0) converge in 0< I tl <r, and let (P(t d, Q(t d) # (P(t2), Q(t 2 )) if t l # t 2 ; then we say that the pair (P, Q) de- fines a function element in the wider sense. If a change of parameter r=r 1 t+r 2 t 2 +... (rl # o and the radius of convergence > 0) gives P(t) = IT(r), Q(t) = K(r), we say that (IT, K) and (P, Q) define the same function element. By a suitable choice of parameter, any function element can be given in the form z = t k + a (or a = t- k ), w = L,bntn, and the elimination of t gives the representation of w as a tPuiseux series of z. So if k = 1 and I;" 0, it reduces to a holomorphic function element. When k= 1, with 1<0 not excluded, the above element is called a rational element. If k > 1 it is called a ramified element, and if 1<0 it is called a polar element. If P', Q' are the direct analytic continuations of P and Q at to (0 < I to 1< r), i.e., their Taylor expansions at to, the function element (P', Q') is called a direct analytic continuation of (P, Q), 198 P Holomorphic Functions which is also considered its own direct analytic continuation. For a fixed r, the set of all direct continuations of (P, Q) thus obtained is called an analytic neighborhood of (P, Q), and these neighborhoods define a topology in the set of all function elements. A curve in this topolog- ical space is called an analytic continuation in the wider sense, and a tconnected component of this space is called an analytic function in the wider sense. An analytic function in the wider sense is a set of function elements in the wider sense, but it can also be regarded as a function w= [(z) (with an independent variable z and a dependent variable w) defined by each function element p(z, w):z= P(t), w=Q(t). An analytic continuation in the wider sense, p(s) = p(z, w, s); 00 z=z(s)+tk(S), w= L cn(s)t n , O.s;s.s;l, nl(,) is sometimes called an analytic continuation along the curve C:z=z(s) (O.s;s.s; 1) in the complex plane. If all p(s) are holomorphic function elements, this coincides with the analytic continuation along C in the original sense, but if this is not the case, p(O) and C do not necessarily determine p(l) uniquely. Actu- ally, an analytic function in the wider sense is just an analytic function in the original sense with at most a countable number of ramified or polar elements added. P. Singularities of Analytic Functions in the Wider Sense Suppose the following three conditions hold: (1) For every point on C except OJ, that is, for z(s) (O.s; s < 1), a function element in the wider sense p(z, w; s) is given. (2) For every ..1. ( < 1), p(z, w; s) (O.s; s.s; ..1.) constitutes an analytic con- tinuation in the wider sense. (3) It is impossible to find a function element p(z, w; 1) such that p(z, w; s) (O.s; s.s; 1) is an analytic continuation in the wider sense. When these three condi- tions are satisfied, we say that p(z, w; s) (O.s; s < 1) defines a transcendental singularity Q with OJ as its coordinate. The method of determin- ing a branch w = Fr( z) in an open disk with center OJ is completely parallel to the case of holomorphic analytic functions. Because of the appearance of function elements in the wider sense in Fr(z), the covering surface  of Kr defined by Fr(z) may have algebraic branch points, If  has a logarithmic branch point over OJ, Q is called a logarithmic singularity. If  has no point over OJ for suitable r, Q is called a direct transcendental singularity; other- wise, it IS called an indirect transcendental singularity. The logarithmic singularities are 
198Q Holomorphic Functions direct singularities. The inverse function of Z = w sin w has a direct singularity over Z = 00 that is not logarithmic, and the inverse func- tion of z = (sin w)/w has an indirect singularity over z = O. Taking into account the value of w = F,(z), if the tcluster set of F, at Q: SQ = nr>o {Fr(z)} consists of only one point, it is an ordinary singularity; if not, it is an essential singularity. Q. History A function of a complex variable is monogenic in the sense of A. L. Cauchy if it is differenti- able at every point of its domain of definition. It was B, Riemann who succeeded in develop- ing Cauchy's concept. Riemann considered an analytic function as a function defined on a t Riemann surface, that is, a I-dimensional complex analytic manifold. On the other hand, Weierstrass constructed the theory of analytic functions starting from power series, When we speak of single-valued functions defined in a domain of the complex plane, the monogenic functions of Cauchy and the analytic functions of Weierstrass are identical. Although the analytic functions are very special functions, the study of complex analytic functions is usually called the theory of functions of a complex variable, or simply the theory of functions. By considering the folIowing point set C, which is more general than a domain, E. Borel showed that a monogenic function on C is not necessarily holomorphic in the ordinary sense. Take a countable dense subset in a subdomain D' of a domain D and a double sequence of positive numbers {rh)}. Put Sh)= {zllz-znl < rh)} and C(h) = D - UI Sh). By a suitable choice of rh), we can suppose that the C(h) are connected and monotone increasing with respect to h. Put C= U;:;'l C(h). A function defined in C is by definition monogenic if it is differentiable in Oh) for every h. For such a monogenic function, Cauchy's tintegral for- mula in a generalized form holds, and the function is infinitely differentiable. If f(z) and g(z) are monogenic in C and coincide on a curve in C, then they are identical in C. Let D be the set {zIO<Rez< 1,0<Imz< I} and {zn} be all rational points in D (zn=(p+iq)/m). For a natural number h, we define C(h) to be the set D minus the union of open disks with radius exp( - em')/h and center (p + iq)/m. The function 00 m m exp( _ e m4 ) f(z) = L L L . ml po q=O Z - (p + Iq)/m is monogenic in C in the above-mentioned sense, but not holomorphic in C. The study of 746 these functions developed into the theory of tquasi-analytic functions. The concept of tanalytic functions of several complex variables can also be defined analo- gously to the case of one variable. Then non- uniformizable singularities appear that lead to a generalization of the concept of tmanifolds ( 23 Analytic Spaces). References [lJ E, Borel, Le"ons sur les fonctions mono- genes uniformes d'une variable complexe, Gauthier- Villars, 1917. [2J A, Hurwitz and R. Courant, Volesupgen iiber allgemeine Funktionentheorie und eIlip- tische Funktionen, Springer, fourth edition, 1964. [3] L. Bieberbach, Lehrbuch der Funktionen- theorie, Teubner, 1,1921; 11,1927 (Johnson Reprint Co., 1969). [4] E. C. Titchmarsh, The theory of functions, Oxford Univ. Press, second edition, 1939. [5] C. Caratheodory, Funktionentheorie I, II, Birkhiiuser, 1950; English translation, Theory offunctions, Chelsea, I, 1958; II, 1960. [6J S. Saks and A. Zygmund, Analytic func- tions, Warsaw, 1952. [7] L. V. Ahlfors, Complex analysis, McGraw- HiIl, 1953; third edition, 1979. [8J H. Behnke and F. Sommer, Theorie der analytischen Funktionen einer komplexen Veranderlichen, Springer, second edition, 1962. [9] H. Kneser, Funktionentheorie, Vanden- hoeck & Ruprecht, 1958. [10] E, HiIle, Analytic function theory, Ginn, I, 1959; II, 1962. [11] H. P. Cartan, Theorie e1ementaire des fonctions analytiques d'une ou p1usieurs vari- ables complexes, Hermann, 1961; English translation, Elementary theory of analytic functions of one or several complex variables, Addison-Wesley, 1963. [12] A. Dinghas, VorIesungen iiber Funk- tionentheorie, Springer, 1961. [13J R. Nevanlinna and V. Paatero, Einftih- rung in die Funktionentheorie, Birkhiiuser, 1964; English translation, Introduction to complex analysis, Addison-Wesley, 1968. [14] M. Heins, Complex function theory, Academic Press, 1968. [15J B, A, Fuks and V. I. Levin, Functions of a complex variable and some of their applica- tions I, II, Pergamon, 1961. (Original in Rus- sian, 1951.) [16] W, Rudin, Real and complex analysis, McGraw-Hill,1966, [17J S. Lang, Complex analysis, Addison- Wesley, 1977. As to the Looman-Men'shov theorem, 
747 [18] D, Men'shov (Menchoff), Les conditions de monogeneite, Actualites Sci. Ind., Hermann, 1936. [19] D. Men'shov (Menchoff), Sur la generali- sation des conditions de Cauchy-Riemann, Fund. Math., 25 (1935), 5997. [20J S. Saks, Theory of the integral, Warsaw, 1937, 188-201. For a new proof of Cauchy's integral theorem, [21] E. Artin, On the theory of complex func- tions, Notre Dame mathematical lectures, 1944, 57  70. Also  [7, 17]. For problems, [22] L. V olkovyskii, G. Lunts, and I. Aramo- vich, A collection of problems on complex analysis, Addison-Wesley, 1965. For analytic continuation, [23] H. Weyl, Die Idee der Riemannschen Fliiche, Teubner, 1913; third edition, 1955; English translation, The concept of a Riemann surface, Addison-Wesley, 1964. Also  [1- 17]. For the history and concepts of analytic functions, [24] G, Julia, Essai sur Ie developpement de la theorie des fonctions de variables complexes, Gauthier-Villars, 1933. 199 (IV.12) Homogeneous Spaces A. General Remarks Let M be a tdifferentiable manifold. If a tLie group G acts ttransitively on M as a tLie transformation group, the manifold M is said to be a homogeneous space having G as its transformation group ( 431 Transformation Groups). The tstabilizer (isotropy subgroup) Hx of G at a point x of M is a closed subgroup of G, and a one-to-one correspondence be- tween G/ Hx and M preserving the action of G is defined by associating the element sHx (SE G) of G/Hx with the point s(x) of M. This corre- spondence is a tdiffeomorphism between the manifold M and the quotient manifold G/Hx if the number of connected components of G is at most countable. Under this condition we may therefore identify a homogeneous space M with the quotient manifold G/ H of a Lie group G by a closed Lie subgroup H ( 249 Lie Groups). However, H is not uniquely determined by M, and it may be replaced by Hs(x) =sH x S- 1 (SE G), Each element h of the stabilizer Hx at a point x induces a linear transformation fi on the ttangent space V x of M at the point x. The set Fix of all fi is called the linear isotropy group at the point x. 199 B Homogeneous Spaces If we represent the homogeneous space M as G/H, we obtain the canonical map n:s-->sH of G onto M, which we call the projection of G onto M. Let 9 be the tLie algebra of G, and l) the Lie subalgebra corresponding to the closed subgroup H. When we identify 9 with the tangent space at the identity element e of G and E) with its subspace, the projection n in- duces a linear isomorphism of 9/E) with the tangent space V x of M at the point x=n(e). The tadjoint representation of G gives rise to a linear representation h-->Ad(h) modulo E) of the group H on the linear space 9/E). Through the linear isomorphism between 9/E) and the tan- gent space V x defined by the projection n, this representation of H is equivalent to the one which associates with h the linear transforma- tion fi defined by h on the tangent space V x . The homogeneous space G/H is said to be reductive ifthere exists a linear subspace m of 9 such that 9=E)+m (direct sum as linear spaces) and (AdH)mcm. H is said to be re- ductive in 9 if the representation h-->Ad(h) of H in 9 is tcompletely reducible. If a ttensor field P on the homogeneous space M = G/H is G-invariant (namely, invar- iant under the transformations defined by the elements of G), then the value of P at the point x = n(e) is a ttensor over the tangent space V x at x which is invariant under the linear iso- tropy group H. Conversely, such a tensor over V x is uniquely extended to a G-invariant tensor field on M, If G/H is reductive, then G- invariant tensor fields over M are in one-to- one correspondence with FI-invariant tensors over m, For instance, if H is compact, then H is reductive in 9 and an FI-invariant positive definite quadratic form on m defines a G- invariant tRiemannian metric on G/H, We say that the homogeneous space M = G/H is a Riemannian (linearly connected, complex Hermitian, Kahler) homogeneous space if there exists on MaG-invariant Rie- mannian metric (tlinear connection, tHer- mitian metric, tKiihler metric). Concerning such homogeneous spaces, there are various results on their structures and geometric prop- erties [1-5] (412 Symmetric Riemannian Spaces and Real Forms; 427 Topology of Lie Groups and Homogeneous Spaces). B. Examples Stiefel Manifold. A k-frame (1 .s; k.s; n) in a real n-dimensional Euclidean vector space Rn is an ordered system consisting of k linearly inde- pendent vectors. If we regard the real tgen- erallinear group of degree n, GL(n, R), as the regular linear transformation group of Rn, GL(n, R) acts transitively on the set V:,k(R) of 
199 Ref. Homogeneous Spaces all k-frames in Rn. Therefore, if H denotes the subgroup of GL(n, R) consisting of the ele- ments which leave fixed a given k-frame v, we may identify the set V,;,k and the quotient set GL(n,R)/H. Transferring the differenti- able manifold structure of GL(n, R)/H to V,:,k through this identification, we see that V;.k(R) = GL(n,R)/H becomes a homogeneous space (the differentiable manifold structure of V;,k is defined independently of the choice of v). The space V;,k(R) is called the (real) Stiefel manifold of k-frames in Rn. A k-frame is called an orthogonal k-frame if the vectors belonging to the frame are of length 1 and are orthogonal to each other. The set Vn,k(R) of all orthogonal k-frames is a sub- manifold of V;,k(R). The torthogonal group O(n) acts transitively on v,,',dR), which is a homogeneous space represented as Vn,k(R) = O(n)/I k x O(n - k). The manifold V n , dR) is actually the (n - 1 )-dimensional sphere. We call v",k(R) the (real) Stiefel manifold of orthogonal k-frames (or simply Stiefel manifold). The complex Stiefel manifold Vn,k(C) = U(n)/I k x U(n-k) is defined analogously. Grassmann Manifold. Let Mn,k(R) (l.s;k .s;n) be the set of all k-dimensionallinear sub- spaces of R n . The group O(n) acts transitively on Mn,dR), so that we may put Mn,k(R) = O(n)/O(k) x O(n-k). Here O(k) and O(n-k) are identified with the subgroups of O(n) con- sisting of all elements leaving fixed every point of a fixed (n - k)-dimensional subspace and of its orthogonal complement, respectively. In this way, Mn,k(R) is a homogeneous space, which we call the (real) Grassmann manifold. The tproper orthogonal group SO(n) acts transitively on Mn,k(R), and Mn,k(R) may be represented as a homogeneous space having SO(n) as its transformation group. It follows that M n k(R) is connected. The homogeneous space n,k(R) = SO(n)/SO(k) x SO(n - k) is called the Grassmann manifold formed by oriented subspaces. Mn,l (R) and Mn, I (R) may be identified with the (n - 1 )-dimensional real projective space and the (n -I)-dimensional sphere, respectively. Applying the above process for real Grass- mann manifolds to the complex Euclidean vector space C n instead of Rn, we see that the set Mn,k(C) of all k-dimensionallinear sub- spaces in C n is a homogeneous space with the tunitary group U (n) of degree n as its transformation group, and we represent it as U(n)/U(k) x U(n-k). This space is called the complex Grassmann manifold, The mani- fold Mn,k(C) is a simply connected complex manifold and has a cellular decomposition as a tCW complex whose cells are tSchubert varieties ( 56 Characteristic Classes E). On 748 the other hand, Mn,dc) may be regarded as the set of all (k -1)-dimensionallinear sub- spaces in the (n-l)-dimensional complex projective space. Then, by using the tPliicker coordinates of these subspaces, Mn,k(C) can be realized as an talgebraic variety without sin- gularity in the projective space of dimension (:) - 1 ( 90 Coordinates B). Sometimes Mn,dR) is denoted by Gn,dR) or G(n, k). In the same way, the homogeneous space represented as Sp(n)/Sp(k) x Sp(n - k) is called the tquater- nion Grassmann manifold and is denoted by Mn,k(H). Flag Manifold. Let k" ..., k r be a sequence of integers such that n> k 1 > .., > k r > 0, and let F(k" .., , k r ) be the set of all monotone se- quences VI =:>... =:> v" where v,(i == 1, ..., r) is a k,-dimensionallinear subspace in Rn. For the two sequences VI =:> ... =:> V, and Vi =:> .., =:> v,' belonging to F(k l , ..., k r ), there exists an ele- ment sEGL(n,R) such that S(I-';) = 1-';' (i= 1,. .,r). Therefore F(k l , ...,k r ) is a homogene- ous space with GL(n,R) as its transforma- tion group, and is called the proper flag mani- fold. Since the unitary group U(n) of degree n acts transitively on it, F(k"..., k r ) is also re- garded as a homogeneous space admitting U(n) as its transformation group. In this case, putting F(k l , ...,k r )= U(n)/H, H is isomorphic to the direct product U(k l - k 2 ) X U(k 2 - k 3 ) x ... x U(k r ). In particular, when r=n-l, k i = n - i, the homogeneous space is the quo- tient space of the compact Lie group U(n) by a maximal Horus T. In general, the quotient space G/T of a compact connected Lie group G by a maximal torus of G is called a flag manifold. If G acts effectively on G/T, G is a tsemisimple compact Lie group. The complex Lie group G C is a Lie transformation group of tbiregular transformations which acts transi- tively on the flag manifold G/T, a simply con- nected Kiihler homogeneous space. Here G C is a complex Lie group having G as a maximal compact subgroup. If B is a maximal tsolvable Lie subgroup (tBorel subgroup) of G C , G/T is represented as GC/B. References [1J A. Borel and R. Remmert, Dber kompakte homogene Kilhlersche Mannigfaltigkeiten, Math. Ann., 145 (1961-1962), 429-439. [2J Y. Matsushima, Espaces homogenes de Stein des groupes de Lie complexes, Nagoya Math. J., 16 (1960), 205218, [3J K. Nomizu, Invariant affine connections on homogeneous spaces, Amer. J. Math., 76 (1954),33-65. 
749 [4] H. C. Wang, Closed manifolds with homo- geneous complex structure, Amer. J. Math., 76 (1954), 1-32. [5J N. R. Wallach, Harmonic analysis on homogeneous spaces, Dekker, 1973. [6] S, Helgason, Differential geometry, Lie groups, and symmetric spaces, Academic Press, 1978. 200 (111.24) Homological Algebra A. General Remarks Homological algebra is a new branch of math- ematics that developed rapidly after World War II. The introduction of the theory was motivated by the observation that some alge- braic ideas and mechanisms that arose in the development of talgebraic topology, in particular, thomology theory, can provide powerful tools for treating from a unified viewpoint various problems in algebra that previously were treated differently. One of its characteristic features lies in emphasizing, from the standpoint of categories and functors ( 52 Categories and Functors), the functional structure of the objects to be investigated rather than their inner structure. Thus the theory of derived functors constitutes the main theme of homological algebra. This new theory turned out to have wide applications in other areas of mathematics, and the philos- ophy embodied in the theory has been influen- tial in the general progress of mathematics. For general references  [2,5, 6J; for tsheaf cohomology  [3,4,8]. B. Graded Modules and Graded Objects Let A be a tring with unity element and X be a tunitary A-module. Ifwe are given a sequence of A-submodules X n (nEZ) such that X = LneZ X n (tdirect sum), we call X a graded A-module and X n the component of degree n of X. Each element x of a graded A-module X has a unique representation x = LnEZ X n (xn E X n ); we call X n the component of degree n of x. An A-submodule Y of a graded A-module X is called homogeneous if XE Y implies XnE Y (n E Z). In this case, Y = Ln ¥" and the quotient module XIY = LnXnl¥" are graded A-modules, where ¥" = yn Xn. Let X = Ln X n and Y = Ln ¥" be graded A-modules and f:X --> Ybe an A- homomorphism. If there is a fixed integer p such that f(Xn) C ¥,,+p for any nEZ, f is called an A-homomorphism of degree p. In this case, 200C Homological Algebra Ker f = Ln Ker fn and Imf = Ln Imfn-p are homogeneous A-submodules of X and Y, respectively, where fn: X n --> ¥,,+ p is the restric- tion of f on Xn. Sometimes, by a graded module we mean only a sequence {X n } of A-modules X n , with- out considering the direct sum LnX.. Simi- larly, we have the notion of a graded object {en} in any category <:(j. C. Chain Complexes and Homology Modules By a chain complex (X, iJ) over A we mean a graded A-module X = LnXn together with an A-homomorphism a:X-->x of degree -1 such that a 0 0 = O. Hence a chain complex over A is completely determined by a sequence X (\+1 C n ...--> n+1---->X n -->X n - I -->... of A-modules and A-homomorphisms such that anoa n + l =0 for alI n, We call a the bound- ary operator, For a chain complex (X, a), we write Kera= Z(X), Kera n = Zn(X), Ima= B(X), Ima n + 1 = Bn(X). Then Z(X)=LnZn(X), B(X) = Ln Bn(X) are homogeneous submodules of X, calIed the module of cycles and the module of boundaries, respectively. B(X) is a homog- eneous submodule of Z(X), and the quotient modules Z(X)/B(X), Zn(X)/Bn(X) are denoted by H(X), Hn(X), respectively. We call H(X)= Ln Hn(X) the homology module of the chain complex (X, a). If (X, a), (Y, a') are chain complexes over A, an A-homomorphism f:X --> Y of degree o satisfying a' 0 f = f 0 a (i.e" afn = fn-1 an) is caIled a chain mapping of X to Y. For a chain mapping f, we have f(Zn(X))cZn(Y), f(Bn(X))c Bn(Y), and hence f induces an A- homomorphism f*: H(X)-->H(Y) of degree 0, which is caIled the homological mapping in- duced by f We have (1 x)* = 1 H(X), and (g 0 f)* = 9 * 0 f* for chain mappings f: X --> Yand g: Y -->Z. Let f, g: X --> Y be two chain mappings. If there is an A-homomorphism D:X --> Y of degree + 1 such that f -g =D oo+a' 0 D, we say that f is chain homotopic to 9 and write f  g; D is called a chain homotopy of f to g, If f is chain homotopic to g, we have f* = g*: H(X)-->H(Y). For chain complexes X and Y, if there are chain mappings f: X --> Y and g: Y -->X such that fog  If and 9 0 f lx, we say that X is chain equivalent to y, In this case f* :H(X)-->H(Y) is an isomorphism and g*:H(Y)-->H(X) is its inverse. Let (X, a) be a chain complex over A and Y=L n ¥" be a homogeneous A-submodule of X such that aYe Y. Then Y and XIY are chain complexes over A with the boundary operators induced by a. Y is called a chain 
200D Homological Algebra subcomplex of X, and X jY is called the quo- tient chain complex of X by Y or the relative chain complex of X mod Y. For a chain com- plex X and its subcomplex Y we have an texact sequence 0--> Y  X J. X jY -->0, where i is the tcanonical injection and j the tcanonical surjection. Let (W, a'), (X, a), (Y, aN) be chain com- plexes over A, and f: W-->X, g:X --> Ybe chain mappings such that 0--> W!.. X .!.. Y -->0 is exact. Then an A-homomorphism a*: H(Y)--> H( W) of degree -1, calIed the connecting homomorphism, is defined by a*(y+ B(Y) = r 1 oaog-1(y)+ B(W) (yEZ(Y)), and we ha ve the exact sequence of homology: 0* f. q* ,.. -->Hn(W)-->Hn(X)-->Hn(Y) c. f. q* (3. -->H n - 1 (W)--> Hn-l (X)-->Hn-l(Y)-->.... For a tcommutative diagram 0--> W -->X --> Y -->0 L'" L L'" 0--> W'-->X'--> Y'-->O consisting of chain complexes and chain mappings in which each row is exact, we have a* 00/* = <p* 0 a*: H(Y)-->H(W'). For the tinductive limit limX A of chain com- --> plexes X A over A, we have H(IXA)=IH(XA)' A chain complex X is said to be positive if X n = 0 for all n < O. If X is a positive chain com- plex over A and M is an A-module, then we mean by an augmentation of X over M an A- homomorphism £:Xo-->M such that the com- position XlXo-">M is trivial:wa 1 =0. A positive chain complex X together with an augmentation F. of X over M is calIed an aug- mented chain complex over M. It is said to be acyclic if the sequence 0 11 (\ s ,.. -->X n --> X n - 1 -->... --> X 1--> Xo-->M-->O is exact, namely, if Hn(X) = 0 (n #0) and F. induces an A-isomorphism Ho(X)M. In this case X is also called a left resolution of M. Moreover, if each X n is a tprojective A- module, X is caIled a left projective resolution. For any A-module M, there exists a left pro- jective resolution of M. Let ex:M -->M' be an A-homomorphism of A- modules, and X, X' be augmented chain com- plexes over M, M' having augmentations F., F.', respectively. Then a chain mapping f: X --> X' satisfying £' 0 fo = ex 0 £ is called a chain map- ping over ex. If X, X' are left projective reso- lutions of M, M', respectively, then there exist chain mappings of X to X' over IJ., and any two such mappings are chain homotopic. In particular, a left projective resolution of an A- 750 module M is uniquely determined up to chain homotopy. D. Tor Given a right A-module M and a left A- module N, Z-modules Tor;;t (M, N) (n = 0, 1, 2, . .. ), called the torsion products (or Tor groups), are defined as foIlows: Let Y: .. --> y,, y"-l -->...:!. Yo N-->O be a projective resolution of N, and consider the chain complex (M@AY,1 @a):... (1 ) t@o" --> M @ A Y" ---->M @ A y"-l -->... 1@(')1 ---.M @ AYO-->O obtained by forming the ttensor product of M and y, Then we see that the homology module Hn(M @ A Y) is uniquely determined for any choice ofleft projective resolution of N. We define Tor;;t(M, N) = Hn(M @ A Y). In partic- ular, we have Tor(M, N)  M @ AN. Properties of Tor. (1) If M is a tl1at A-module, we have Tor;;t(M, N)=O (n = 1,2,...). (2) An A-homomorphism f: M1-->M l in- duces a homomorphism f* : Tor;;t(M 1 , N)--> Tor;;t(M l , N). We have (I M )* = I, and (go/)* = g* 0 f* for f: M1-->M l , 9 :Ml-->M3' (3) For an exact sequence 0-->M 1 !..M l .!.. M 3 -->0, we have the following exact sequence of Tor: ,'. T A J. T A ...--> orn(M1,N)--> or n (M2' N) Y* A C* A --> Torn (M3' N)--> Torn_dM I' N)-->... A v. -->Tord M 3, N)-->M 1 @ AN-->M l @ AN -->M 3 @ AN -->0, where 0* are the connecting homomorphisms. (4) For a commutative diagram 0--> M 1 --> M l -->M 3 -->0 L'" L L", 0-->M'1-->M'z-->M 3 -->0 of A-modules and A-homomorphisms with exact rows, we have a* 00/* = <p* 0 0*. (5) Tor;;t(LAM A , N)LA Tor;;t(M A , N) (6) Tor;;t(limM A , N)lim Tor;;t(M A , N). --> --> On the other hand, take a left projective resolution X of M and consider the chain complex X @ AN. Then we have Hn(X @ AN) Tor;;t(M, N) for n =0,1,.... Therefore properties similar to (1)-(6) hold with respect to the second variable N of Tor:(M, N). (7) If A 0 is a ring tanti-isomorphic to A, then Tor;;t(M, N)Tor: (N, M). In particular, 
751 if A is commutative, then Tor(M, N) is an A- module and we have Tor(M,N)Tor(N,M). (8) Let A be a tprincipal ideal ring. Then Tor(M,N)=O (n=2,3, ...) and Tor1(M, N) is also denoted by M * AN. For an exact se- quence 0-->M I -->M 2 -->M J -->0, we have the exact sequence O-->M I * AN -->M 2 * AN--> MJ * AN -->M I @ AN -->M 2 @AN-->M J @ A N --> O. In particular, Z * zN = 0 and (ZjnZ) * zN  nN ( = { X E N I nx = O} ), Universal Coefficient Theorem for Homology. If (X, a) is a chain complex over A and N is a left A-module, then (X@ AN,a@ 1) is a chain complex. If A is a principal ideal ring and each X n is a ttorsion-free A-module, then we have a formula Hn(X @ AN)  Hn(X) @ AN + Hn-I (X) * AN, called the universal coefficient theorem. E. Double Chain Complexes By a double chain complex (Xp,q, a', a") over A we mean a family of left A-modules Xp,q (p,qEZ) together with A-homomorphisms a,q:xp,q-->Xp-1.q and O;,q:Xp,q-->Xp,q_1 such that C-I,qOa,q=a;,q-l Oa;,q=a,q_1 Oa;,q +O;_I,qOa,q=o. We define the associ- ated chain complex (X n , 0) by setting X n == Lp+qnXp,q, an==Lp+qna,q+o;,q' We call a the total boundary operator, and a', a" the partial boundary operators, Given a chain complex X consisting of right A-modules and a chain complex Y consisting of left A-modules, a double chain complex (Zp,q, a', a") is defined by setting Zp,q= X p @ A y", a,q= a p @ 1, a;,q =( -ljPl @ a q , where a p , Oq are the boundary operators of X. Y, respectively. It is called the product double chain complex of X and Y, and the homology module of its associated chain complex is denoted by H(X@ A Y). With respect to this homology module, the following facts hold. If X is a left projective resolution of a right A- module M and Y is that of a left A-module N, then Hn(X @ AY)=Tor(M,N). If A is a prin- cipal ideal ring and each X n is a torsion-free A- module, then we have the formula Hn(X@AY) L Hp(X)@AHq(Y) p+qn + L Hp(X) * AHq(y), p+q=n-l the K iinneth theorem F. Cochain Complexes By a cochain complex (X, d) over A we mean a graded A-module X together with an A- 200G Homological Algebra homomorphism d: X --> X of degree + 1 such that do d = 0; d is called the coboundary oper- ator or differential. For a cochain complex (X, d), we denote by xn the component of degree n of X, and by dn, x n -->xn+1 the restric- tion of don X n . Then a chain complex (Y,a) is defined by Y,,==x-n and an: Y,,--> y"-I is equal to d- n :x-n-->x-n+1. For a cochain complex (X,d), we write Kerdn=Zn(X), Kerd= Z(X) (Z(X)=L zn(x)), 1m dn-I = Bn(x), 1m d = B(X) (B(X) = L Bn(x), and zn(X)jBn(x) = H"(X)), Z(X)jB(X) = H(X) (H(X)=LH"(X)). These modules Z(X) (zn(x», B(X) (Bn(x», and H(X) (H"(X» are called the module of cocycles, the module of coboundaries, and the cohomology module of X, respectively. If we consider the associated chain complex (Y, a) of (X, d), then H -n(Y) corresponds to Hn(X), In this way, results on chain complexes give results on cochain com- plexes. Thus the concepts of cochain mapping, cochain homotopy, cochain equivalence, cochain subcomplex, and relative cochain complex can be defined as in the case of chain complexes in B, and we have corresponding results. In par- ticular, given an exact sequence 0--> W!.X  Y -->0 of cochain complexes and co chain map- pings, the connecting homomorphism d*: H n ( Y) -->H n +1 (W) is defined, and the exact sequence of cohomology ...H"(W)H"(X)f!.*.H"(Y) '!.*.H"+1(W):!iH"+t (X)H"+t (Y)'!.*.... exists. For a commutative diagram 0--> W --> X --> Y -->0 L'" L L'" 0--> W' --> X' --> Y'-->O of cochain complexes and cochain mappings with exact rows, we have d* 0 IjJ * = <p* 0 d*. A cochain complex X is said to be positive if Xn=O for n<O. If X is a positive co chain complex over A and M is an A-module, we mean by an augmentation of X over M an A- homomorphism £:M-->X o such that the com- position M ..':.XO:XI is trivial. If the sequence O-->M xo!.. .-->xnxn+t-->.., is exact, X is called a right resolution of M, Moreover, if each xn is an tinjective A- module, X is called a right injective resolution of M. For any A-module M, there exists a right injective resolution of M, and any two such resolutions are cochain homotopic, G. Ext Given left A-modules M and N, Z-modules Ext(M, N) (n =0,1,2,...), called the Ext 
200H Homological Algebra groups, are defined as follows: Let X: .. . --> X n --> X n - 1 -->.., --> X 0--> M -->0 be a projective reso- lution of M, and consider the cochain complex HomA(X, N): O-->HomA(X o , N)-->...-->HomA(X n _ l , N) -->HomA(X n , N)-->... obtained by forming the tmodule of A- homomorphisms, Then we can show that the cohomology module Hn(HomA(X,N» is uniquely determined for any choice of projec- tive resolution of M, We define Ext(M, N) = Hn(HomA(X, N)). This can also be defined as the cohomology module Hn(HomA(M, Y)) of the cochain complex HomA(M, Y):O--> HomA(M, yO)-->...-->HomA(M, yn-I)--> HomA(M, yn)-->..., where Y: O-->N --> yO-->...--> yn-I--> yn-->". is a right injective resolu- tion of N. Furthermore, for a left projectIVe resolution X of M and a right injective re- solution Y of N, we see that Ext(M, N) is isomorphic to the cohomology module Hn(HomA(X, Y)) of the associated cochain complex of the double cochain complex HomA(X, Y)=(HomA(X p , yq), d', dO), where d,q:HomA(Xp, yq)-->HomA(X p +!, yq) and d;,q:HomA(X p , yq)-->HomA(X p , yq+l) are given by d,q(u)=uoap+!, d;,q(u)= (-I)P+q+ld q o U (uEHomA(X p , yq)) by using the boundary operator a of X and the co- boundary operator d of Y. Properties of Ext. (1) We have Ext(M, N) HomA(M,N). (2) If M is a projective A-module or N is an injective A-module, then Ext(M, N) = 0 (n = 1, 2, .., ), (3) An A-homomorphism f:M 1 -->M 2 (resp. f:N I -->N 2 ) induces a homomor- phism f*: Ext(M2' N)-->Ext(M" N) (resp, f* :Ext(M, Nd-->Ext(M, N 2 )). We have 1tr= 1 and (g of)* = f* 0 g* for f: M 1 -->M 2 , g:M 2 -->M 3 (resp,I N .=1 and (gof)*=g*of* for f:N I -->N 2 , g:N 2 -->N 3 ). (4) For an exact sequence 0-->M I -->M 2 -->M 3 -->0 (resp. 0--> N 1--> N 2 -->N 3 -->0), we have the exact sequence of Ext: 0-->Hom A (M 3 , Nj-->Hom A (M 2 , N) -->HomA(M" N)-->Ext(M3' N) -->Ext(M2' N)-->... (resp. O-->HomA(M, Nd-->Hom(M, N 2) -->Hom(M, N3)-->Ext(M, Nd --> Ext(M, N 2)-->'" ) (5) Ext(M., TI N p ) = TI Ext(M"Np). p "p 752 (6) If A is a principal ideal ring, then Ext(M,N)=O (n=2,3, ...), and Ext(M,N) is also denoted by ExtA(M, N). In particu- lar, Extz(Z, N) = 0, Extz(ZlnZ, N)  N InN, Extz(M,Q/Z) =0, Extz(M, ZlnZ) = MlnM, where M = Homz(M, Q/Z). Universal Coefficient Theorem for Coho- mology.1f X is a chain complex over a prin- cipal ideal ring A such that each X n is a free A- module, then for any A-module N we have the formula W(HomA(X,N)) HomA(Hn(X), N)+ ExtA(H n _ 1 (X), N), the universal coefficient theorem. This is gen- eralized as folIows: Let X be a chain complex and Ya cochain complex, both over a prin- cipal ideal ring A. Assume that each X n is a free A-module or that each Y" is an injective A-module. Then we have the formula W(HomA(X, Y))  L HomA(Hp(X), Hq(y)) p+q=n + L ExtA(Hp(X), Hq(Y)) p+q=n-l ( 201 Homology Theory). H. Complexes in Abelian Categories We mainly consider general t Abelian cate- gories . Consideration may, however, be restricted to the tcategory (Ab) of Abelian groups (whose tobjects are Abelian groups and whose tmorphisms are homomorphisms) or the tcategory R.4t of R-modules. A (cochain) complex C in an Abelian cate- gory  is a graded object {C n } in  with differentials d n : cn--> cn+l subject to the con- dition that d n + 1 odn=O (nEZ). The nth coho- mology Hn(C) of C is defined by the texact sequence 0--> B n ( C)-->zn( C)-->H n ( C)-->O, where Bn(C) and zn(c) are objects representing Imd n - I and Kerd n , respectively. The complex C is calIed positive (negative) if C" = 0 for n < 0 (n>O). We sometimes interchange positive superscripts and negative subscripts and write C- n instead of cn. Then the differentials become d n : C n --> C n - I , and C is then calIed a chain complex. The quotient of Kerd n = Zn by Imd n +! =Bn is calIed the nth homology Hn(C). Negative complexes are usualIy described in this manner. When cn, zn, B n , and H n are sets, as in the category R.4t of R-modules, their elements are calIed cochains, cocycIes, coboundaries, and cohomology classes, respec- tively, Similarly, in the group C n of chains, residue classes of cycles (EZ n ) modulo bound- aries (EBn) are calIed homology classes (EH.). 
753 A morphism (or chain transformation) j : C--> C is a tnatural transformation of the com- plexes considered as tfunctors Z-->; i e., f is a family of morphisms f": C n -->c n (n E Z) satisfying .{"+1 0 d n = d n 0 fn. It induces a morphism of cohomology Hn(C)-->Hn(C). A subcomplex of C is an equivalence class oftmonomorphisms D-->C, usually denoted by any representative D of the class. A (chain) homotopy between two chain transforma- tions f, g: C -->C is a family of morphisms hn:cn-->c n - I (nEZ) satisfying {"_gn= h n +1 0 d n + d n - I 0 h n . If there exists a homo- topy between rand g, then f and 9 induce the same morphism of cohomology. A morphism f: C -->C is called a (chain) equivalence ifthere exists a morphism f': C --> C such that f' 0 r and r 0 f' are homotopic to the identities of C and C, respectively, In this case we have H n ( C)  H n ( C). An exact sequence of com- plexes 0--> C --> C --> cn -->0 gives rise to the connecting morphisms H n ( C) --> H n +1 (C) (n E Z), and the resulting sequence... --> H"-I (C) --> H"(C)-->H"( C) --> H"(C")--> Hn+I(C)-->... is exact (the exact cohomology sequence), and similarly for homology instead of cohomology. An object A E defines a complex (also denoted by A) such that AO=A, dO = O. A positive complex C together with a morphism e:A-->C is called a complex over A, and e is the augmentation. A complex C over A is acyclic if 0 --> A  CO --> C 1-->... is exact. An acyclic positive complex over A is called a right resolution of A. Let {C, e}, {C, e'} be complexes over A and A', respectively, and ex a morphism A --> A'. Then a morphism f: C --> C satisfying foe = e' 0 ex is called a morphism over ex. For a negative complex C, we define similarly augmentations e: C -->A, acyclicity, left resolutions, etc. A bicomplex (or double complex) C in  consists of objects cp.q (p, q E Z) and two dif- ferentials d(:Cp.q-->CP+l.q, dn:Cp,q-->Cp.q+1 subject to df = dA = 0 and d(d n == dnd( (some- times replaced by anticommutativity, dld n + dnd l = 0). Morphisms of bicomplexes are de- fined as for single complexes. A blcomplex C becomes a (single) complex if we put c n = Lp+qn cp,q (when the sum exists) and define the differential d to be d(+( -1jPd n on Cp,q. Then d is called the total differential and d(, d n the partial differentials. On the other hand, q = {CP,q(pE Z), d(} constitutes a complex for each q, whose cohomology HP(q) is denoted by H{(Cq). Then d n induces morphisms H{(Cq) -->H{(Cq+I), so that we obtain a complex H{(C). The cohomology of H{(C) is denoted by H(H{(C)). We define H{(Hfi(C)) similarly. The cohomology of C with respect to the total differential is denoted simply by Hn(C). Similar constructions are applied to double chain 200 I Homological Algebra complexes {Cp,q} and further to multiple complexes, as we shall show in the case of bicomplexes. Let T be a tbifunctor (6'1 x 2 -->' and C i be complexes in i (i= 1,2). Then T(C I , C 2 ) is a bicomplex in '. For instance, Hom(C, C) is a positive (bipositive) complex if C(C) is a posi- tive (negative) complex in . If C, C are com- plexes in j(R' R'.4t, respectively, the ttensor product C @ RC' is a complex in (Ab) (the product complex). There is a canonical mor- phism Hp(C) @ Hq(C)-->Hp+q(C @ C). If C n and Bn are tflat for all nEZ, we have the fol- lowing exact sequence (Kiinneth's formula): 0--> L Hp(C) @ Hq(C)-->Hn(C @ C) p+q=n --> L TordHp(C), Hq(C))-->O p+q=n-l (for the definition of Tor  Section D). For C = A E R'.4t, Kiinneth's formula reduces to the exact sequence O-->Hn(C) @ A-->Hn(C @ A)--> Tor dHn-dc), A)-->O (universal coefficient theorem). The corresponding exact sequence for cohomology is O-->Extl(Hn_1 (C), A)-->H"(C, A) --> Hom(Hn( C), A)-->O ( Section G; 201 Homology Theory). I. Satellites and Derived Functors Let  and ' be Abelian categories. All func- tors in this section are tadditive. A tcovariant functor T:  -->' is called exact if T maps every exact sequence in  to an exact sequence in '. T is called half-exact, left exact, or right exact if for every short exact sequence 0 --> A -->B-->C-->O, the sequence T(A)-->T(B)-->T(C), 0--> T(A)--> T(B)--> T(C), or T(A)--> T(B)--> T(C) -->0, respectively, is exact. Similar definitions apply for tcontravariant functors. The functor Hom: x  -->(Ab) (which defines the category ) is left exact in both factors. An object Pis projective if hp(') = Hom(P,') is exact, while Q is injective if hQ(.) = Hom(', Q) is exact. If every object A admits an tepimorphism from a projective object P-->A (resp. tmonomor- phism into an injective object A -->Q), (6' is said to have enough projectives (inJec- tives). An object G is called a generator (cogen- erator) if the natural mapping Hom(A, B)--> Hom(hG(A), hG(B») (Hom(hG(B),hG(A))) is one-to-one. An Abelian category  is called a Gro- thendieck category if (1)  has a generator, (2) tdirect sums always exist, and (3) the identity (U A;)n B = U(Ain B) holds for any object A, its tsubobJect B, and a ttotally ordered family {A,} of subobjects. A Grothendieck cate- 
200J Homological Algebra gory has enough injectives (R. Baer, 1940, for (Ab); A, Grothendieck, 1957, for general (1&'). A monomorphism into an injective ob- Ject f: A -->Q is called an injective envelope if ImfnImg #0 for any nonzero monomor- phism g:B-->Q. Every object A in a Grothen- dieck category admits an injective envelope, which is unique up to isomorphism (B. Eck- mann and A. Schopf, 1953, for R.4t; B. Mit- chell, 1960, for general r&). We say that a covariant a-functor r& -->r&' is given if we have a sequence of covariant func- tors T = {Ti: (I&' -->r&'} and the connecting mor- phisms a: Ti(A")--> Ti+I(A') for an arbitrary short exact sequence O-->A'-->A-->A"-->O satis- fying the following conditions: (i) a 0 Ti(f") = Ti+1 (f') 0 a for a morphism f of short exact sequences; and (ii) the sequence...--> Ti-l (A")!. Ti(A')--> Ti(A)--> Ti(A")!. Ti+1 (A')--> ...constitutes a complex. T={Ti} is called a covariant a*-functor if instead of a there are given a*: Ti(A")--> Ti-l (A') satisfying similar conditions (i*) and (ii*). By taking duals, we define the notion of contravariant a- and a*- functors. They are also called connected se- quences offunctors. A a-(a*-)functor defined for - 00 < i < + 00 is called a cohomological functor (homological functor) if the sequence in condition (ii) (resp. (ii*)) is always exact. A morphism of a-functors f: S --> T consists of natural transfonnations fi: Si --> T i that com- mute with the connecting morphisms. A a- functor S defined for a.s; i < b is called universal if for any a-functor T defined in the same in- terval and any natural transformation <p: sa--> T", there exists one and only one morphism f:S--> T such that f"==<p. Let F:r&-->r&' be a covariant functor and b any positive integer. A universal covariant a-functor S defined for O.s; i < b is caIled a right satellite of F if SO = F (S is then denoted by {Si F}). If such an S exists, then it is unique and satisfies Si+! (F) = Sl (Si F). If r& has enough injective objects, the right satellites always exist, and if F is left exact, then {Si F} is a cohomological functor. The universality of a*-functors is defined by reversing the arrows; the satellites {SiF} are then written as {S-i F} and called the left satellites. Let r& be an Abelian category with enough injectives. An injective resolution of an object A is a right resolution Q = {Qi} such that all Qi are injective. Every A admits an injective reso- lution, which is unique up to chain equivalence (H. Cartan, 1950), For a covariant functor F: r&-->r&', the functor A-->Hi(F(Q)), called the ith right derived functor R i F of F, is independent of Q. {R i F} is a cohomological functor. By the universality of satellites, there exists a mor- phism of a-functors {Si F} --> { R i F} which is an isomorphism if and only if F is left exact. The 754 left derived functors LiF of a contravariant functor F are defined similarly and are isomor- phic to the left satellites when F is right exact. If r& has enough projectives (instead of injec- tives), we define left (right) derived functors of covariant (contravariant) functors via projec- tive resolutions. For a multifunctor, we define partial derived functors as well as (total) de- rived functors of the functor viewed as a func- tor defined in the tproduct category. For instance, let T(A, B):r&l x (1&'2-->r&' be contra- variant in A and covariant in B. When r&2 has enough injectives, we obtain R T(A, B) = Hi(T(A, Q)) using an injective resolution Q of B. Suppose that T satisfies condition (i) A--> T(A, B) is exact for any injective B. Then for a fixed injective B, R T(A, B) is a cohomo- logical functor in A. When A has a projec- tive resolution P in r&" we obtain R\ T(A,B) = Hi(T(P, B)) as well as the equation for the total derived functor RiT(A, B)= Hi(T(P, Q)). We say that a functor T is right balanced if it satisfies (i) and also (ii) B--> T(A, B) is exact for any projective A. In this case, the three derived functors are isomorphic. The left balanced functors are defined similarly. When the right derived functors of the functor Hom (which defines the category) exist, they are denoted by Exti(A, B). J. Spectral Sequences In this section, we deal with cohomology in the category R.4t of R-modules. A similar theory for homology is obtained by modifying the theory in a natural way. Similar construc- tions are also possible for general Abelian categories [3, 8]. A filtration F of a module A is a family of submodules {P(A)lpEZ} such that P(A)=:> Fp+1 (A). We say that the filtration F is con- vergent from above (or exhaustive) if UpFP(A) = A, and F is bounded from below (or dis- crete) if P(A)=O for some p. The tgraded module G(A) = {GP(A) = P(A)/P+I(A) I pE Z} is said to be associated with A. A morphism of filtered modules f: A-->A' is a module homo- morphism such that f(P(A)) c P(A'). It in- duces a homomorphism of the graded modules G(A)-->G(A'). A filtration of a complex C= {C,d} consists of sub complexes P(C)= {P(C)} such that P(C)=:>P+!(C). We assume that the complex C satisfies UpFP(C) = C, and is bounded from below; i.e., for every n there exists some p such that FP(C)=O. In particular, if FO(C) = C, p+1 (CP) == 0, the com- plex C is called canonically bounded. Writing CM = GP(Cp+q), we obtain a tbigraded module {C M}, in which p, q, and p + q are called the 
755 filtration degree, the complementary degree, and the total degree, respectively. A spectral sequence {Er} with a graded module D = {D n } as its limit (denoted by E,q = pDn) consists of a family of doubly graded modules Er={E;,qlp,qEZ} (r:;:,2 or sometimes r:;:' 1) and differentiations d,:Ef'q--> Ef+r,q-r+1 (p, qE Z) of degree (r, 1 -r) satisfy- ing d; = 0 and satisfying the following two conditions: (i) H(E,) (with respect to d r ) is isomorphic to Er+1 (hence there exists a se- quence of graded submodules of E 2 :0= B 2 C B3 c... C Z3 cZ 2 = E 2 such that Zr/Br Er); and (ii) there are submodules Zoo and Boo such that UkBkC Bex, C Z", C nkZk> and Eoo = Zoo/Boo is isomorphic to the doubly graded module associated with a certain filtration F of D (i.e., E!;;q  GP(Dp+q)). We assume that Zoo = nk Zk and Boo = Uk Bk (weak convergence). Suppose that F is convergent from above and bounded from below and that ZdEg,q) is stationary for every p, q. Then {Er} is called regular. {Er} is bounded from below iffor every n there exists a Po such that E,n-p = 0 for P<Po. In particular, if E,q=O (p<O,q<O), then {Er} is called the first quadrant (or coho- mology spectral sequence). In the latter case, the edge homomorphisms E' 0 --> E  0, E?;:--> Eg,q are defined through base terms Ef'° and fiber terms EP'", respectively. A morphism of spectral sequences j : {E" D} --> {E, D'} con- sists of {,: Er--> E of degree (0,0) and f: D --> D' of degree 0 which preserve the mechanism of spectral sequences. When the spectral se- quences are regular, a morphism f'is an iso- morphism if one of the J.. is an isomorphism. Addition is naturally introduced in the set of morphisms so that spectral sequences form an additive category. An additive functor from an Abelian category «j to this category is called a spectral functor. A filtered complex {C, F} gives rise to a spectral sequence E,q = G(H(C)) if we put Zf= {aEP(C)1 daEp+r(C)}, Bf=dZf-r, Ef=Zf/(Z;l + Bf-I)' Er=LpEf. A double complex C == {Cp,q, db d n } admits two natural filtrations F I : F{(C) = Ls:"pLq c,q and F l1 :Fft(C)=L,:"qL p CP,'. By the pro- cedure above, these filtrations give rise to spectral sequences H{(HI1(C))= pHn(C) and H l1 (Hf(C))= qHn(C), respectively. Compari- son of these sequences yields many useful results. Let T be an additive covariant func- tOr from an Abelian category «j to R,41, C be a complex in «j, and Q = {Qp,q} be an injec- tive resolution of C. The double complex Q gives rise to spectral sequences HP(RqT(C)) = H(T(Q)) and RP T(W(C)) = H (T(Q)). The limit H(T(Q)) IS independent of Q and is called the hypercohomology of T with respect to C [2,8]. We can similarly define hyper- cohomology of multifunctors. The theory of 200 K Homological Algebra spectral sequences was initiated by J. Leray (1946), and suitable algebraic formulations were given by J. L. Koszul (1950). K. Categories of Modules The category R,41 (resp. .4t R ) ofleft (right) R- modules over a tunitary ring R is not only an Abelian category but also a Grothendieck category ( 277 Modules). The tfull embed- ding theorem permits us to deduce many pro- positions about general Abelian categories from the consideration of R.4t. An object P of R.4t is projective if and only if it is isomorphic to a direct summand of a tfree module. Any projective module is the direct sum of count- ably generated projective modules (I. Kap- lansky, 1958). It follows that any projective module over a tlocal ring is free. Finitely generated projective modules PI and P 2 are said to be equivalent if there exist finite- ly generated free modules Fl' F 2 such that PI EB FI  P 2 EB F 2 . The equivalence classes then form an Abelian group with respect to the direct sum construction called the projec- tive class group of a ring R. The category of complex tvector bundles over a compact space X is equivalent to the category of projective modules over C(X), the ring of complex- valued continuous functions on X, and simi- larly for other types of spaces and bundles. Many investigations have been made involv- ing the problem of whether every projective module over a polynomial ring is free (J.-P. Serre, 1955). This problem was settled affirma- tively by D. Quillen [13J and independently by A. Suslin. It has been observed that "big" projective modules are often free: for example, nonfinitely generated projective modules over an tindecomposable weakly Noetherian ring are free (Y. Hinohara, 1963). The nth right derived functor of HomR(A, B) is denoted by Ex(A, B) ( Section G). This is a bifunctor R.4t x w41-->(Ab), contravariant in A and covariant in B. Ext is isomorphic to and identified with HomR' An exact sequence O-->A'-->A-->A"-->O gives rise to the con- necting homomorphisms n: Ext(A', B)--> Ext+1(A", B), and the following sequence is exact: ... --> Ext-I (A', B)Ext(A", B)--> Ext(A, B)-->Ext(A', B) Ext+1 (A", B)-->. (the exact sequence of Ext). Similarly, an exact seq uence 0 --> B' --> B --> B" --> 0 gives rise to n: Ext(A, B")--> Ext+1 (A, B') and to an exact sequence of Ext. An extension of A by B (or of B by A) is an exact sequence (E): 0--> B-->X--> A -->0. The set of equivalence classes of exten- sions of A by B is in one-to-one correspon- dence with Ext1(A, B) by assigning to (E) its characteristic class XE =  o(I)E Ext1(A, B), 
200K Homological Algebra where 1 denotes the identity of HomR(B, B), In this correspondence, the sum of two extensions is obtained by a construction called Baer's sum of extensions. Similarly, Ext(A, B) is interpreted as the set of the equivalence c1asse of n-fold extensions 0--> B--> X n - I -->... -->X 0--> A -->0 (exact). This point of view permits us to establish a theory of Ext, etc., in more general (additive) categories lacking enough projec- tives or injectives (N. Y oneda, 1954, 1960). The tensor product A @ RB is a right exact covariant bifunctor .4t R @ R.4t -->(Ab). If the functor t p( . ) = . @ P is exact, P is called a tflat module. A projective module is flat. In general, a flat module is the tinductive limit of finitely generated free modules (M. Lazard, 1964). A flat module P is called tfaithfully flat if P #mP for every maximal ideal m of R. The functors @ and Hom are related by tadjointness ( 52 Categories and Functors). From this view- point @ can be introduced in more general categories. Left-derived functors of A @ RB are denoted by Tor:(A, B) and are called nth tor- sion products of A and B. Torf(A, B) is often denoted by A * RB. The functor @R is left balanced, hence Tor is calculated by using projective resolutions of A, B, or both A and B. We have Tor== @R' An exact sequence 0--> A'-->A-->A"-->O gives rise to n:Tor+1(A",B)--> Tor:(A', B) and the infinite exact sequence of Tor, and similarly for the second variables. From the various relations between Hom and @ follow the corresponding relations between their derived functors. When A and r are algebras over K and Q = A @ r, we can define the external product (T -product), which is a mapping T :Tor(A, B) @ Tor(A', B') --> TO+q(A @ A', B @ B'). In particular, if A and rare K-projective and To(A,A')=O (n>O), then we can define the wedge product (V-product) V:Ext(A,B)@ ExtHA', B')..... Exth+q(A @ A', B @ B'). The latter is described in terms of the composition of module extensions. When K = A = r = Q, the T -product reduces to the internal product, called the m-product. If A is a tHopf alge- bra over K, the tcomultiplication A --> A @ A induces Ext A0A --> Ext A . This, combined with the V-product, yields the cup product (,-,- product),-,: Ext(A, B)@ Ext;i.(A', B')--> Ext+q(A @ A', B @ B'), We define similarly .1 -product, i\ -product, UJ-product, and '-"- product (cap product) [2]. Let A, r, and E be algebras over K, with A K-projective; let AE.4t A0r , BE A .4t E , CE.4t r0E , and assume Tor(A, B)=O (n>O). The natural isomor- phism HomA0dA,HomE(B, C»)Homr0E (A @ AB, C) then yields a spectral sequence 756 Ext0dA, ExtHB, C»=pExt r0E (A @ A B , C) by the double complex argument in Section D. The homological dimension hdimRA, dh R A, or projective dimension proj dim R A of A E R.4t is the supremum ( .s; ex:,) of n such that Ext(A, B)#O for some B. The relation hdimRA.s;O means that A is projective. The injective dimension inj dim R B of BE R.4t is defined similarly by means of the functor Ext(., B), and the weak dimension wdim R C of C E R.4t by the functor Tor(', C). The common value sup {proj dim R A I A E R.4t} = sup{inj dim R B I BE R.4t} is called the left global dimension 1 gl dim R of R. It is identical with the supremum of homological dimensions oftcyclic modules (M. Auslander, 1955). The right global dimension r gl dim R is defined similarly. The common value sup{wdimRA I A E .4t R } = sup {w dim R q C E R.4t} is called the weak global dimension w gl dim R of R. We have w gl dim R.s; 1 gl dim R, r gl dim R. The equality may fail to hold (Kaplansky, 1958). If R is tNoetherian, the three global dimensions coincide (Auslander, 1955) and are called sim ply the global dimension of R: gl dim R. The condition 1 gl dim R = 0 (or r gl dim R = 0) holds if and only if R is an tArtinian semi- simple ring, while w gl dim R == 0 if and only if R is a tregular ring in the sense of J. von Neumann (M. Harada, 1956). A ring R is called left (right) hereditary if 1 gl dim R.s; 1 (r gl dim R .s; 1), and left (right) semihereditary if every finitely generated left (right) ideal is projective. A left and right (semi)hereditary ring is called a (semi)hereditary ring. Since projectivity and tinvertibility of an ideal of a (commutative) integral domain R are equiva- lent, R is hereditary if and only if R is a tDede- kind ring. In this case, the projective class group of R reduces to the tideal class group. An integral domain R is semihereditary if and only if w gl dim R.s; 1 (A. Hattori, 1957), and in that case R is called a Priifer ring. A tmaximal order over a Dedekind ring is hereditary. A commutative semihereditary ring R is charac- terized by the property that flatness of R- modules is equivalent to torsion-freeness (S. Endo, 1961). A Noetherian ring R is left self- injective if and only if R is tquasi-Frobenius (M. Ikeda, 1952), and the global dimension of a quasi-Frobenius ring is 0 or 00 (S. Eilenberg and T. Nakayama, 1955). A polynomial ring R =K[X I , ...,XnJ over a commutative ring K satisfies gl dim R = gl dim K + n. When K is a field, this is a reformulation of Hilbert's theory of syzygy sequences ( 369 Rings of Poly- nomials). In this sense, the study of the global dimension of rings and categories is sometimes called syzygy theory (Eilenberg, 1956). The homological algebra of commutative Noe- therian rings has been studied extensively and 
757 is useful in algebraic geometry. Since gl dim R =suPmgl dimRm (R'll is the tring of quotients relative to m), with m running over the max- imal ideals of R, the problem of determining gl dim R reduces to the case of t10cal rings. A finitely generated flat module over a local ring R is free. If K denotes the residue field Rim, where m is the maximal ideal of the local ring R, TorR(K, K) has the structure of a Hopf algebra (E. F. Assmus, Jr., 1959). De- tailed results concerning the Betti numbers dim Torf(K, K) of R have been obtained (J. Tate, 1957, and others). In particular, R is tregular if and only if gl dim R < 00 (Serre, 1955). A local ring R is called a Gorenstein ring if the injective dimension of R-module R is finite. This is a notion intermediate between regular rings and tMacaulay rings ( 284 Noetherian Rings). Consideration of a ring R in relation to a subring S leads to relative homological algebra, Foundations for this theory were established by Hochschild (1956). An exact sequence of R-modules that tsplits as a sequence of S- modules is caIled an (R, S)-exact sequence. An R-module Pis caIled an (R, S)-projective module if HomR(P") maps any (R, S)-exact sequence to an exact sequence. (R, S)-injective modules are defined similarly. Based on these notions, Ext(R,s) and Tor(R,S) are defined as the relative derived functors of HomR and @ R, respectively. We also have a relative theory from a different viewpoint (S. Takasu, 1957). Relative theory is extended to general cate- gories from various viewpoints [6, 14J ( Section Q). L. Cobomology Theory for Associative Algebras Let A be an talgebra over a commutative ring K and A a ttwo-sided A-module. Let c n be the module of all n-linear mappings of A into A called n-cochains (Co = A). De- fine the co boundary operator t5 n :c n -->c n + 1 by Wf)(A 1 , ..., An+tJ = },d().2, ..., }.n+d + 'L.71 (-I)'f(A1'"'' A;..1.i+1'"'' An+d+ (_I)"+lf().I' ...,An)An+I' We thus obtain a complex whose coho- mology is denoted by Hn(A, A) and is called the nth Hochschild's cohomology group of A relative to the coefficient module A (Hochs- child, 1945). A cochain f is caIled normalized if f(}.I' ..., An)=O whenever one of the Ai is 1. We obtain the same cohomology group Hn(A, A) from the subcomplex of normalized cochains. {Hn(A,')} is a cohomological functor from the category N A of two-sided A-modules to the category K' Using the enveloping algebra A' =A@ KAo, where 11. 0 is an anti-isomorphic 200M Homological Algebra copy of A, A'A can be identified with Ae. (and AfAe). If A is K-projective, {H"(A,')} is isomorphic to {Ext::"e(A,')}' (In [2J, H"(A, A) is defined as Ext::"e(A, A) in general.) We have HO(A,A)={aEAIAa=a)., VAEA}. We call 1- cocycles derivations (or crossed homomor- phisms) of A in A and I-co boundaries inner derivations. Thus HI (A, A) is the derivation class group and is related to the tramification. When K is a field, HI(A,')=O if and only if A is a tseparable algebra. In general, an alge- bra A over a commutative ring K is caIled a separable algebra if A is A'-projective, i.e., if Extle(A,') = 0 (Auslander and O. Goldman, 1960). This is a generalization of the notion of tmaximaIly central algebras (Nakayama and G. Azumaya, 1948). We have a one-to-one correspondence of H 2 (A, A) to the family of algebra extensions of A with kernel A (i.e., K- algebras r containing A as a two-sided ideal such that riA = A) satisfying A 2 = O. Any extension of an algebra A over a field K such that H2(A,) =0 splits over a nilpotent kernel (J. H. C. Whitehead and G. Hochschild). This holds in particular for a separable algebra, and we obtain the tWedderburn-Mal'tsev theorem. There are some interpretations of H 3 (A. A) in terms of extensions. The supremum ( .s; 00) of n such that Hn(A, A)#O for some A is called the cohomo- logical dimension of A and written dim A. For a finite-dimensional algebra A over a field K, dim A < 00 if and only if AI N is separable and gl dimA< 00, where N is the tradical of A (N. Ikeda, H. Nagao, and Nakayama, 1954). The homology groups Hn(A, A) of A relative to a coefficient module A are defined similarly. If A is K-proJective, {Hn(A,')} is isomorphic to {ToreLA)}. M. Cohomology of Groups The pair consisting of an algebra A over K and an algebra homomorphism [;: 11.--> K is called a supplemented algebra [2J (or aug- mented algebra [6J), of which [; is the augmen- tation. The tgroup algebra Z[GJ of a group G over the ring of rational integers is a supple- mented algebra, in which the augmentation is defined by £(x) = 1 (x E G). The category ofIeft G-modules is identified with the category of left Z[G]-modules. For a finite group G, a finitely generated projective G-module is not necessarily free (D. S. Rim, 1959) and is iso- morphic to the direct sum of a free module and a left ideal of Z[G]. It follows that the projective class group of Z[GJ is a finite group (R. G, Swan, 1960). The cohomology groups and homology groups of G relative to A E G' (Eilenberg and S. MacLane, 1943) are defined 
200M Homological Algebra by W(G, A) = Extz[c](Z, A) and Hn(G, A) = Tor[C](Z, A), respectively. Their concrete description is given usua1\y via the Z[GJ- standard resolution of Z. (1) Homogeneous formulation. The group of homogeneous n-chains is the free Abelian group with basis G x .., x G (n + 1 times), on which G operates by x(xo, ... , x n ) = (xxo, ,.. , xX n ), and the boundary operator is defined by d(x o , "', x n ) = L?o( _1)i (x o , ..., Xi, ..., x n ). (2) Nonhomogeneous formulation. The group of nonhomogeneous n-chains is the Z [GJ-free module with basis G x ..' x G (n times), and the boundary operator is defined by d(x I' "', xn)=x I (x 2 ,..., x n )+ L?;;;f( _1)i (XI,''', XiXi+l, ...,x n )+(-I)" (XI' ""xn-d. A non- homogeneous 2-cocycle is sometimes ca1\ed a factor set. HO(G, A) is the submodule A c of A consisting of the G-invariant elements, while Ho(G, A) is the largest residue class module Ac of A on which G acts trivially. Given two groups G and K, an exact sequence of group homomorphisms I-->K-->E-->G--> 1 is called a group extension of G over the kernel K. When K is Abelian, the extension canoni- cally induces a G-module structure on K, and the deviation of K from being a semidirect factor of E is measured by a factor set. The group H 2 (G, A) is thus in one-to-one corre- spondence with the set of equivalence classes of the tgroup extensions of G over A which induce the originally given G-module structure on A ( 190 Groups N). This point of view is essential in the proof of the tSchur-Zassenhaus theorem ( 151 Finite Groups). H 3 (G, A) is interpreted as the set of obstructions for exten- sions (Eilenberg and Mac Lane, 1947). For a tfree group F, Hn(F,A)=O (n> 1). Ifa group G is represented as a factor group FjR of a free group F, we have a group extension 1--> K-->E-->G--> 1, where K=Rj[R,RJ and E= Fj[R, R]. Let EH2(G, K) correspond to this extension. Then for any G-module A, the cup product X-->X followed by the pairing Hom(K, A) @ K-->A provides isomorphisms W(G,Hom(K, A))W+2(G, A) (n>O) (the cup product reduction theorem of Eilenberg and MacLane, 1947); similarly, we have the reduc- tion theorem for the homology. The Z-algebra H(G, Z) = Lo Hn(G, Z) under the multipli- cation defined by the cup product is finitely generated if G is a finite group (B. B. Venkov, 1959; L. Evens, 1961). The following are mappings relative to a subgroup H. (I) The inner automorphism by xEG in- duces an isomorphism of Hn(H, A) and Hn(xHx- l , A) which reduces to the identity of W(G, A) if H == G. Hence if H is a normal . subgroup, Hn(H, A) has the structure of a GjH- module, and similarly for Hn(H, A). 758 (2) When H is a normal subgroup of G, the mapping (x" ...,xn)-->(xIH, ...,xnH) of non- homogeneous chains induces the inflation (or lift) Inf: W(GjH, A H)-->W(G, A) and the deflation Def: Hn(G, A)--> Hn(GjH, A H ). (3) The embedding of nonhomogeneous chains induces the restriction Res: Hn(G, A) -->Hn(H, A) and the injection Inj (or corestric- tion Cor): Hn(H, A)--> Hn(G, A). The theory of tinduced representation gives another con- struction of these mappings; that is, if we put lC(A)=Homz[H](Z[G],A), Res is obtained by the isomorphism Hn(G, lC(A))  Hn(H, A) com- bined with the homomorphism induced by A -->lc(A); while if we put lc(A)=Z[GJ @ z[H]A, Inj is obtained by the isomorphism Hn(H, A)  Hn(G, lc(A)) followed by the homomorphism induced by lc(A)--> A. (4) If (G: H) < 00, we have lC(A) lc(A), The composition of Hn(H, A)--> Hn(G, lc(A)) -->Hn(G, A) defines on the cohomology groups the Inj: Hn(H, A)-->Hn(G, A), while the compo- sition Hn(G, A)-->Hn(G, lC(A))-->Hn(H, A) gives the Res: Hn(G, A)-->Hn(H, A). In particular, Res: HdG, Z)-->HdH, Z) coincides with the ttransfer Gj[G, GJ-->Hj[H, H]. (5) Let H be a normal subgroup of G. Consider the additive relation p (the corre- spondence) between hE zn(H, A)c and f E zn+I(GjH, AH) determined by p(hJ) if and only if there exists agE C(G, A) such that h = Res 9 and Inf r = £5g. If the relation induces a homomorphism Hn(H, A)c-->H n +1(GjH, AH), then it is ca1\ed the transgression. If Hi(H, A) = 0 (0 < i < n), the sequence 0--> Hn(GjH, A H) -->Hn(G, A)--> Hn(H, At --> Hn+1 (GjH, A H )--> Hn+l (G, A) composed of inflation, restric- tion, and transgression mappings is exact (Hochschild and Serre, 1953) and is called the fundamental exact sequence. This exact sequence can be derived from a certain spec- tral sequence Hn(GjH,Hq(H,A))=pHn(G,A) (R. C. Lyndon, 1948; Hochschild and Serre, 1953). The relative (co)homology theory relative to a subgroup (I. T. Adamson, 1954) can be dealt with in terms of the relative Ext and the rela- tive Tor (Hochschild, 1956). Many results in the absolute case are generalized to the rela- tive case: for example, the fundamental exact sequence (Nakayama and Hattori, 1958). The relative theory is further generalized to the cohomology theory of tpermutation represen- tations of G (E. Snapper, 1964). Non-Abelian Cohomology. For a non-Abelian G-group A, the cohomology "set" HI (G, A) (and HO(G, A)) is defined as in the Abelian case by means of the nonhomogeneous cochains ( e.g., [9J). Some efforts are being made toward the construction of a more general non- 
759 Abelian theory (1. Giraud, Cohomologie non abelienne, Springer, 1971). N. Finite Groups Let G be a finite group and A a G-module. Define the norm N:A-->A by N(a)=LxEGxa, and denote Ker N by NA. The kernel of the augmentation €:Z[GJ-->Z is denoted by I. Put jfn(G, A)=W(G, A)(n>O), jfo(G, A)=AGjN A, jf-l(G,A)=NAjIA, and jf-n(G,A)=Hn_dG,A) (n> I). Then {jfn(G,')} forms a cohomological functor, called the Tate cohomology (E. Artin and J. T. Tate), that can be described as the set of cohomology gloups concerning a certain complex caIled a complete free resolution of Z, (Similar arguments are valid more generally for quasi-Frobenius rings (Nakayama, 1957), and a theory of this kind is called complete cohomology theory.) We have jfn(G, A)=O (n E Z) if and only if h dimZ[G] A .s; 1 (Naka- yama, 1957). If A satisfies the conditions (i) jfl(G p , A)=O for any Sylow p-subgroup G p of G, and (ii) there exists a  E jf2(G, A) such that Res  E jf2(G p , A) has the same order as G p and generates all of jf2(G p , A), then the homo- morphisms jfn(H, B)-->jfn+2(H, A @ B) (nEZ) defined by the cup product with Res  are isomorphisms for every subgroup H and every G-module B such that Tor(A, B)=O (Naka- yama, 1957; for B = Z, Tate, 1952). If G is cyclic, the mappings jfn(A)-->jfn+2(A) (nEZ) defined by the cup product with a generator of jf2(Z) are isomorphisms. (The notation is abbreviated by omitting G) If the orders of jfo(A) and jfl(A) are finite, their ratio is called the Herbrand quotient h(A) of A. IfO-->A'-->A -->A"-->O is exact, then h(A)=h(A')h(A"). If A is finite, then h(A) = I. By combining these two facts we obtain Herbrand's lemma: If A' is a G- submodule of A of finite index and h(A') exists, then h(A) also exists and h(A) = h(A'). The periodicity jfn(A)=jfn+p(A) (nEZ, AE G ..4t) holds if and only if every Sylow subgroup is cyclic or a tgeneralized quaternion group (Artin and Tate; [2]). Let Lj K be a finite tGalois extension with the tGalois group G, The cohomology groups of various types of G-modules related to Lj K are caIled the Galois cohomology groups ( 172 Galois Theory), Using continuous cocycIes, a cohomology theory (Tate cohomology) is developed for infinite Galois extensions as well [9,10]. By means of Galois cohomology ( 59 Class Field Theory), the cohomology theory of finite groups and of ttotally disconnected compact groups (which are tprofinite groups) plays an important role in class field theory and its related areas. 200P Homological Algebra O. Cohomology Theory of Lie Algebras Let 9 be a tLie algebra over a commutative ring K, and assume that 9 is K-free. The ten- vel oping algebra V = V(g) is a tsupplemented algebra over K. For a g-module (= V-module) A, Ext(K, A) and Tor(K, A) are called the cohomology groups Hn(g, A) and homology groups Hn(g, A), respectively, of 9 relative to the coefficient module A. They are usually described by means of the V -free resolution V @ AK(g) of K (called the standard com- plex of g) constructed by C. ChevalIey and Eilenberg (1948), where AK(g) is the exterior algebra of the K-module 9 and (denoting 1 @ (Xl/\. ..Ax") by (Xl' ..., x n » the differenti- ation is given by n d(x l ,..., x n )= L (_I)i+1 xi(Xl>"', Xi>"" x n ) j;:l + L (-I)i+j([xi,xJ, XI' ..., 1 j<in Xi, .,. , X j , ..' , X n ). For n> [g: KJ, W(g, A)= Hn(g, A)=O. If 9 is a tsemisimple Lie algebra over a field K of char- acteristic 0, we have Hl(g, A)=O, H 2 (g, A)=O, while H 3 (g, K)#O. Hl(g, A)=O is equivalent to Weyl's theorem, which asserts the complete reducibility of finite-dimensional representa- tions ( 248 Lie Algebras E). H2(g, A) and H 3 (g, A) are interpreted by means of Lie algebra extensions as in the cohomology of groups. The theorem on tLevi decomposition is derived from H 2 (g,A)=0. ChevalIey and Eilenberg constructed this cohomology by algebraization of the cohomology of compact tLie groups. They also introduced the notion of relative cohomology groups Hn(g, E), A) relative to a Lie subalgebra E) of g, which cor- respond to the cohomology of homogeneous spaces. Hn(g, E), A) does not always coincide with Extu(q).u()(K, A) (Hochschild, 1956), but does so in an important case where K is a field of characteristic 0 and E) is treductive in 9 ( 248 Lie Algebras). For ttransformation spaces of tlinear alge- braic groups G over a field K, the rational cohomology groups are introduced using the notion of rational injectivity (Hochschild, 1961). In particular, if G is a tunipotent alge- braic group over a field K of characteristic 0, then H(G, A) is isomorphic to H(g, A), where 9 is the Lie algebra of G. There is also a relative theory, P. Amitsur Cohomology Let R be a commutative ring, and F a co- variant functor from the category '6'R of com- 
200Q Homological Algebra mutative R-algebras to the category of Abelian groups. For SE(t&'R and n=O, 1,2,..., we write sin) = S @ .,. @ S (n-fold tensor product over R). Let 8 i :s(n+1)-->s(n+2) (i=0, 1, ...,n+ 1) be R-morphisms defined by 8i(XO @ ... @ x n ) = xo@ ". @Xi-I @ 1 @Xi@ '" @x n . Defining d n : F(s(n+!»)-->F(s(n+2») by d n = L7;+-J( -1 )iF(8;), we obtain a cochain complex {F(s(n+!»), dn}. This complex and its cohomology groups are called the Amitsur complex and the Amitsur cohomology groups, and are usually denoted by C(SjR, F) and Hn(SjR, F), respectively. If S j R is a finite Galois extension with Galois group G, then the group Hn(SjR, U) of the unit group functor U is naturally iso- morphic to H"(G, U(S)). If SjR is a finite pure- ly inseparable extension, then Hn(SjR, U)=O for n;;:,3, The group H 2 (SjR, U) is related to the tBrauer group B(Sj R) ( 29 Associative Algebras K). Q. Relative Theory In the course of the development of homolog- ical algebra, it has been recognized that the notion of projective (resp. injective) resolutions should be generalized ([14,4]; Hochschild, 1956). In the meantime, a method has been introduced that utilizes simplicial objects in order to define the derived functor of an arbi- trary functor with Abelian category as its range ([15,17]; J. Beck, 1967). As a consequence of these developments, there emerged a view- point, which we describe below, making it possible to unify various known definitions of (co-)homology theories that has been designed for particular cases. Let d be a category and fY a class of ob- jects in d. In this section, we denote the set Hom",,(A, B) of morphisms by d(A, B). A morphism .r: A -> B in d is called a fY- epimorphism if the induced mapping d(PJ) (= Hom(PJ)): d(P, A)-->d(P, B) is surjective for any PEfY. The class .OJ' is called a projective class in d ifthere exist an object PEfY and a fY-epimorphismf:P-->A for each object AEd. To any category d, we associate a preaddi- tive category Z,,//, adding a zero object to d if necessary: Put ObZd =Obd, Zd(A, B) = free Abelian group generated by the set d(A, B). d is regarded as a subcategory of Zd by the natural inclusion J:d -->Zd. Any functor T from d into an Abelian category :!i has a unique additive extension T: Zd -->:!i such that T= TJ, If d is an additive category, there is a canonical projection 0: Zd -->d such that OJ = I d. If furthermore T is additive, then T= TO. Now suppose that a projective class .OJ' in d is given. For AEd, an augmented chain com- 760 plex X-->A in Z,,//, d n dl do -->Xn......Xn-I-->...-->Xo-->A, did i + l ==0, is called a fY-projective resolution of A if (i) XnEfY (n;;:, 0), and (ii) it is ,OJ'-acyclic (i.e., the sequence d n -->Zd(P, Xn)-->Z,'//(P, X n - I )-->... dl do -->Zd(P, Xo) -->Z,'//(P, A)-->O is exact for any PEfY [16]). Note that the fY-acyclicity in this case implies the existence of a contracting homotopy h n : Z,,//(P, Xn)-->Zd(P, X n + I ), n;;:'-I,X_ I =A, satisfying d n +! h n + h n - I d n = 1. Comparison theorem: Given two ,OJ'- projective resolutions X.-->A and Y.-->A, we have that the chain complexes X, and y, are chain equivalent in Z,'//. Let T:,'// -->.r.J8 be a (covariant) functor with f!.$ an Abelian category. The nth left derived func- tor Ln T: d -->:!i (n;;:' 0) of T, with respect to fY, is defined by Ln T(A) = Hn(TX) for a ,OJ'- projective resolution X-->A. The derived func- tors Ln T will remain unaffected if we replace the projective class fY by its enlargement ;ij == {objects in fY together with their retracts}. We can easily verify that (i) Lo T(P) = T(P) and Ln T(P)=O (n>O) for PE:Jl', and (ii) a short exact sequence 0--> T' --> T --> Tn -->0 of functors: ,,// -->f!.$ induces a long exact sequence of derived functors --> Ln T' --> Ln T -> Ln Tn!.. Ln-I T' -->. .. --> Lo Tn -->0. If d is preadditive and has a zero object and kernels, then it is routine to give a fY- projective resolution of any object. If d has finite tlimits (i.e., tfinite products and tequal- izers), it can be proved that there exists a fY-projective resolution for any object [17]. There is a standard functorial construction which provides canonically a projective class .OJ' III a category ,,// and a fY-projective reso- lution of any object in d. Let (G, 8, b) be a cotriple (or comonad [18] or functor coalge- bra) in d. Here G:,,//-->d is an endofunctor, 8: G--> I d and b: G--> G 2 = GG are natural trans- formations such that G80 b = 8G 0 b = 1 G and Gbob=bGob. A cotriple comes usually from a pair (F, U) oftadjoint functors U:d->, F:(t&' -->,,// with' natural bijection ,L'//(F(C), A) (C,U(A)). Putting G=FU:,'//-->d,8(A)= ;'-1 (1U(A»): FU(A)-->A, IJ(C) =A(1F(C»): C--> UF(C), we have a cotriple (G= FU, 8, b == FIJU) in cd, Conversely, it is known that any cotri- pIe in d is induced from a suitable pair of adjoint functors. 
761 Given a cotriple (G, t:, 15) in ,'/!, we define a projective class q}'G={G(A)IAEd} (or its enlargement 'G) in ,'/!, and an augmented simplicial object G *(A)--> A for any object A as follows. Put Gn(A) = G n +! (A) (n;;:, 0), a i = ai' = Git:Gn-i(A): Gn(A)--> G n - I (A) (face operator) and 15 i = 15i' = G i 15G n - i (A): Gn(A)-->Gn+!(A) (de- generacy operator) for O.s; i.s; n, G -I (A) = A. Then G*(A)-->A gives rise to a q}'G (or equiva- lently q;G)-projective resolution of A in Zd with differentials d n = L?=o( _1)i a i . This is the bar resolution (or standard resolution) in a generalized sense, and there are many (co-)homology theories defined by means of such constructions. Most of the above definitions and construc- tions can be dualized so as to give injective classes f, f-injective resolutions, and right derived functors with respect to f, triples (or manads), etc. (See also [19J for generalized (co-)homology). References [IJ Sem. H. Cartan, 19501951, Paris, 1951. [2J H. P. Cartan and S. Eilenberg, Homolog- ical algebra, Princeton Univ. Press, 1956. [3J A. Grothendieck, Sur quelques points d'algebre homologique, T6hoku Math. 1., (2) 9 (1957), 119-221. [4J R. Godement, Topologie algebrique et theorie des faisceaux, Actualites Sci. Ind., Hermann, 1958. [5J D. G. Northcott, An introduction to homo- logical algebra, Cambridge Univ. Press, 1960. [6J S. MacLane, Homology, Springer, 1963. [7J P. Freyd, Abelian categories, Harper, 1964. [8J A. Grothendieck (and J. Dieudonne), Ele- ments de geometrie algebrique I-III, Pub!. Math. Inst. HES, 1960-1963. [9] 1.-P. Serre, Cohomologie galoisienne, Lecture notes in math. 5, Springer, 1964. [lOJ S. Lang, Rapport sur la cohomologie des groupes, Benjamin, 1966, [IIJ E. Weiss, Cohomology of groUPb, Academic Press, 1969. [12J H. Bass, Algebraic K-theory, Benjamin, 1968. [13J D. Quillen, Projective modules over polynomial rings, Inventiones Math., 36 (1976),167-171. [14J S. Eilenberg and 1. C. Moore, Founda- tion of relative homological algebra, Mem. Amer. Math. Soc., 55 (1965). [15] A. Dold and D. Puppe, Homologie Nicht-Additiver Funktoren, Anwendungen, Ann. Inst. Fourier, 11 (1961),201-312. [16J M. Barr and 1. Beck, Homology and standard constructions, Lecture notes in math. RO, Springer, 1969. 201 A Homology Theory [17J M. Tierney and W. Vogel, Simplicial derived functors, Lecture notes in math. 86, Springer, 1969. [18J S. MacLane, Categories for the working mathematician, Springer, 1971. [19J 1. F. Adams, Stable homotopy and gen- eralised homology, Lecture notes, Univ. of Chicago, 1971. 201 (IX.6) Homology Theory A. History Homology theory is the oldest and most ex- tensively developed portion of talgebraic to- pology. Historically, it started with measuring the higher-dimensional connectivity of a space in the sense that the O-dimensional connectiv- ity is the number of tconnected components of the space. For example, take a t2-sphere S2 and a t2-torus T 2 . Then T 2 is distinguished from S2 by the fact that on T 2 a closed curve can be drawn without forming a boundary, while this is not true for S2. In fact, a curve (c l or C'l in Fig. 1) can be drawn on T 2 so that it does not form a boundary of an embedded 2- disk. On more complicated tsurfaces there are many kinds of such closed curves. The maxi- mum number of such closed curves is the 1- dimensional connectivity of the surface; this is a topological property of the surface. For example, the I-dimensional connectivity of 52 is 0, that of T 2 is 2, and that of the surface in Fig. 2 is 6. A more general consideration of the bounding properties of q-dimensional tclosed submanifolds of a manifold led E. Betti (Ann. Mat. Pure Appl., 4 (1871)) to introduce the notion of the q-dimensional connectivity of the manifold, which was a precursor of homology theory.  Fig. 1 3 Fig. 2 The foundation of homology theory was laid by tHo Poincare [1]. He started his study of homology with analytic treatment of mani- folds, which led to a series of complications. 
201 B Homology Theory Poincare then introduced a new method for the study, now called tcombinatorial topology: He decomposed the manifold into elementary pieces or tcells, which adjoin one another in a regular fashion; he then substituted the algebraic notions of tcycles and tboundary operators for the geometrical notions of closed submanifolds and boundaries. Thereby the notion of homology groups acquired an exact logical meaning, and the fundamental for- mulas, which are now called Poincare for- mulas, were proved. After Poincare, much of the development of homology theory centered around the ques- tion of the topological invariance of homology groups, that is, the independence of the ho- mology groups on the choice of tcellular de- composition. Through the development of tsimplicial complexes and their techniques, J. W, Alexander (Trans. Amer. Math. Soc., 28 (1926)) gave the first fully satisfactory prooffor topological invariance of the homology groups of tpolyhedra. In those days, the homology groups themselves were barely recognized; instead, one dealt with numerical invariants such as the tBetti numbers and the ttorsion coefficients [2,3]. During the period 1925-1935 there was a gradual shift of interest from the numerical invariants to the homology groups themselves, and homology theory developed intensively [4,5]. S. Lefschetz (Trans. Amer. Math. Soc., 28 (1926)) added the theory of the tintersection products to the homology of manifolds. He also invented trelative homology theory (Proc. Nat. Acad. Sci. US, 13 (1927)) and generalized the tduality theorems of Poincare and Alex- ander (ibid., 15 (1929)). G. de Rham (J. Math. Pures Appl., lO (1931)) obtained a duality theorem that relates the texterior differential forms in a manifold to the homology groups of the manifold. L. S. Pontryagin (Ann. Math., (2) 35 (1934)) proved the complete group- invariant form of the Alexander duality theo- rem. These duality theorems seemed to reflect the existence of a theory dual to homology theory, and the genesis of this dual theory, now called tcohomology, occurred in 1935, in the work of Alexander and A. N. Kolmo- gorov. It was discovered subsequently by Alexander (Ann. Math., (2) 37 (1936)), E. Cech (ibid.), and H. Whitney (ibid., 39 (1938)) that the cohomology of a polyhedron can be made into a ring. On the other hand, after L. Vietoris (Math. Ann., 97 (1927)) and P. S. Aleksandrov (Ann. Math., (2) 30 (1928)), many devices were inven- ted to extend the homology theory of poly- hedra to general ttopological spaces, and numerous variants of homology theory ap- peared at the hands ofCech (1932), Lefschetz 762 (1933), Alexander (1935), and Kolmogorov (1936). This development served to clarify the relations between combinatorial and set- theoretic methods in topology ( 426 Topol- ogy), whereas it produced complexity and confusion in homology theory [6]. S. Eilen- berg and N. E. Steenrod (Proc. Nat. Acad. Sci. US, 31 (1945) and [7J) cleared the air by treat- ing homology axiomatically. Roughly speaking, a homology theory as- signs tAbelian groups to ttopological spaces and thomomorphisms to tcontinuous map- pings of one space to another. In this way, a homology theory is an algebraic image of topology; it converts topological problems to algebraic problems. Starting from this view- point, Eilenberg and Steenrod selected some fundamental properties as axioms to charac- terize homology theory. This unified homol- ogy theories and allowed systematic treatment of homological problems which had previ- ously been done separately "by hand" in each case. Moreover, it motivated the birth of a new branch of mathematics, called thomo- logical algebra. B. Homology of Chain Complexes A chain complex C = {C q , Oq} is a collection of (additive) Abelian groups C q , one for each integer q, and of homomorphisms Oq: Cq-->C q _ 1 such that Oq 0 0 q+1 = 0 for each q. Elements of C q are called q-chains of C, and Oq is called the boundary operator. A subcomplex C = {C, o} of C is a chain complex such that C c C q and o = Oq I C for each q. The tkernel of 0 q is denoted by Zq( C), and its element is called a q-cycle of C. The timage of u q + 1 is denoted by Bq(C), and its element is called a q-boundary of C. The relation Oq 0 Oq+1 =0 implies Bq(C)cZq(C). The tquotient group Zq(C)/Bq(C) is denoted by Hq(C), the qth ho- mology group of C. Elements of Hq(C) are called q-dimensional homology classes of C. Two cycles representing the same homology class are said to be homologous. The tdirect sum LqHq(C) is denoted by H*(C) and is called the homology group of C. If C = {C q , Oq} and C = {C, o} are chain complexes, a chain mapping (chain map) <p: C --> C is a sequence of homomorphisms <Pq: C q --> C such that u 0 <Pq = <Pq_1 OOq for each q. If <p: C --> C is a chain mapping, then <Pq sends Zq(C) to Zq(C) and Bq(C) to Bq(C), and hence <P induces a homomorphism of Hq(C) to Hq(C). If Hq(C) is tfinitely generated, it can be de- composed into the direct SUm of a free Abelian group Bq(C) and a finite Abelian group Tq(C). Bq(C) and Tq(C) are called the qth Betti group 
763 of C and the qth torsion group of C, respec- tively. The trank Pq of Bq(C) is called the qth Betti numher of C. Tq( C) is isomorphic to the direct sum of r(q) finite cyclic groups of orders Of, OJ, ... , O,iq), where Of> 1 and O? divides Oil for i = 1, "', r(q) -1 ( 2 Abelian Groups). The numbers Of, OJ, ... , O,q) are called the qth tor- sion coefficients of C. If H*(C) is finitely generated, then the num- ber x( C) = L q ( -1)q Pq is called the Euler num- her, the Euler characteristic, or the Euler- Poincare characteristic of C. In this case a tpolynomial Lqpqt q with variable t is called the Poincare polynomial of C. Let C be a chain complex such that, for each q, C q is a tfree Abelian group of finite trank. Then the qth Betti number and the qth torsion coefficients are well defined for each q. Denote the ranks of C q and Bq(C) by ex q and [3q, respec- tively. Then it holds that Pq = ex q - [3q - [3q-l, and hence x( C) = L q ( -1)q ex q . (Euler-Poincare formula). Moreover, there exists a set of bases, one for each C q , with the following properties: For each q, the base for C q is composed of five types of elements, a't (1 .s; i.s; [3q - r q ), b i q (l.s;i.s;r q ), C,! (l.s;i.s;pq), d,! (l.s;i.s;r q _ I ), and e't (1.s;i.s;[3q-1 -r q - 1 ); c q satisfies ca't=O, cb?=O, cc't = 0, cd't == 0i q - I br l , and ce't = ar 1 . Such a set of bases is called the canonical hasis of C. Let C be a chain complex such that each C q is a free Abelian group with a given base {O"n Then the incidence numher [O"i q : 0"/-1] E Z is defined by Cq(O"iq)=Lj[0"?:O"r1JO"r 1 . This notion was commonly used in the early days of topology. C. Homology of Simplicial Complexes Let K be a tsimplicial complex. An oriented q- simplex 0" of K is a q-simplex S E K together with an equivalence class of ttotal orderings of the vertices of s, two orderings being equiva- lent if they differ by an even permutation of the vertices. If a o , ..., a q are the vertices of s, then [a o , a I> .. . ,aqJ denotes the oriented q- simplex of K consisting of the simplex s to- gether with the equivalence class of the order- ing a o < a I < ., , < a q of its vertices. For every vertex a of K there is a unique oriented 0- simplex [a], and to every q-simplex with q 1 there correspond exactly two oriented q- simplexes, which are said to be opposites of one another. Let Cq(K) denote the Abelian group generated by the oriented q-simp1exes of K with the relations 0" + 0"' = 0 if 0"' is the oppo- site of 0". If we choose an oriented q-simplex O"i q for each q-simplex Si q of K, then each element of Cq(K) is written uniquely as a finite sum Li giO"i q with integers gi # 0, and 201 D Homology Theory Cq(K) is a free Abelian group generated by the set {O"n We define a homomorphism c q : Cq(K)-->C q _ 1 (K) by cq[a o , a l ,..., aqJ = L'!o( -1 na o , ..., ai-I' ai+1> ..., a q ]. Then CqOC q + 1 =0 holds, and we have a chain com- plex C(K)= {Cq(K)A}(Cq(K) = {O} ifq<O), called the (oriented) simplicial chain complex. The homology group H*(C(K» is denoted by H*(K) and is called the (integral) homology group of the simplicial complex K. Let Kl and K 2 be simplicial complexes, and let f: K 1 --> K 2 be a tsimplicial mapping. Then for each q, a homomorphism f#q: Cq(Kd--> C q (K 2 ) can be defined by f#q([ao,al> ...,aqJ)= [f(ao)J(ad, ... J(a q )], where the right-hand side is understood to be 0 if f(u o ), f(a 1 ), '" , f(a q ) are not distinct. The sequence f# = {f#q} is a chain mapping of C(Kd to C(K 2 ), and it induces a homomorphism f*:H*(K 1 )-->H*(K 2 ). If K is a tfinite simplicial complex, then the Betti numbers, the torsion coefficients, and the Euler characteristic of K are defined to be those for the chain complex C(K). Let KI and K 2 be a subcomplexes of a sim- plicial complex K. Then we have the follow- ing texact sequence which relates the homol- ogy group of Kl U K 2 to the homology groups Co , of K l' K2' and Kl nK 2 : ...-->Hq(Kl nK 2 )--> p Co Hq(Kl)+Hq(K2)-->Hq(Kl UK 2 )-->H q - 1 (K 1 n K2)'" , where ex, [3, and C * are defined as follows. Let i,:Kl nK 2 -->K[ andj[:K[-->K 1 U K 2 (I = 1,2) be inclusion mappings; then ex(a) = (ilO(a), -i 2o (a)) and [3(a1,a2)=jlO(a1)+j2o(a2)' If Z =C 1 +C2 (C,E C(K[)) is a cycle of C(K 1 UK 2 ) then c(c l )= -c(c 2 ) is a cycle of C(K 1 nK 2 ); c* sends the homology class of z to the homol- ogy class of c(cd. The sequence is called the Mayer-Vietoris exact sequence of the couple {K I , K 2 }, and c* is referred to as the connect- ing homomorphism. The prototype of the Mayer- Vietoris exact sequence was obtained by W. Mayer (Monatsh. Math. Phys., 36 (1929)) and L. Vietoris (ibid., 37 (1930)). The present form is due to Eilenberg and Steenrod [7]. D. Homology of Polyhedra If K is a tsimplicial complex and K' is a tsub- division of K, then there exists a canonical isomorphism H*(K);:; H*(K'). This proves that if K j and K 2 are tsimplicial decompositions of a tpolyhedron then H*(K I ) and H*(K 2 ) are isomorphic, because there exists a common subdivision of KI and K 2 . Thus we may de- fine the (integral) homology groups H*(X) of a polyhedron X to be the homology group H*(K) of a simplicial decomposition K of X. Let X and Y be polyhedra, and let f: X --> Y be a continuous mapping. Take a tsimplicial approximation <p: K --> L of f Then a homo- 
201 E Homology Theory morphism of H*(X) to H*(y) given by the induced homomorphism <p*: H*(K)--> H*(L) is independent of the choice of <p, and is denoted by f*. The following properties hold: (i) 1 * = 1 :H*(X)-->H*(X), where 1 is the identity; (ii) If f: X --> Y and g: Y -->Z are continuous map- pings, then (go f)* =g* 0 f*: H*(X)-->H*(Z); (iii) If ,[, f' : X --> Yare homotopic, then f* = f:H*(X)-+H*(Y). These imply the homo- topy invariance of the homology group stated as follows: If X and Yare polyhedra which are thomotopy equivalent, then H*(X) and H*(Y) are isomorphic. Specifically, the homology group is a topological invariant. Thus if X is a ttriangulable space (for example, if X is a tdifferentiable manifold), then its homology group H*(X) can be defined to be the homol- ogy group H*(K), where (K, t) is a ttriangu- lation of X. This homology group is referred to as the simplicial homology group of X. Similarly, if X is a compact triangulable space, the Betti numbers Pq(X) of X, etc. can be de- fined to be those for the chain complex C(K). If we denote by pt a single point, then Ho(pt) = Z (the group of integers) and Hq(pt) = 0 if q#O. For an n-sphere sn, a 2-torus T l , and a treal projective plane p2, the homology groups can be computed as follows by means of their triangulations: (1) Ho(sn);=:; Hn(sn);=:; Z, and Hq(sn)=o if q #0, n; (2) Ho(T2);=:; H l (T 2 );=:;Z, HI (T 2 );=:;Z+Z, and Hq(T2)=0 if q#O, 1,2; (3) HO(p2);=:;Z, Hd P2 );=:;Z2' and Hq(P2)=0 if q #0, 1, where Z2 = Zj2Z. Two surfaces are homeomorphic if and only if their integral homology groups are isomor- phic (41O Surfaces). E. Singular Homology There are various devices for defining ho- mology groups of general topological spaces. A familiar one is the singular homology theory initiated by S. Lefschetz (Bull. Amer. Math. Soc., 39 (1933)) and improved by S. Eilenberg (Ann. Math., (2) 45 (1944)). The standard q-simplex is the convex set q c Rq+1 consisting of all (q + 1 )-tuples (to,t" ..., t q ) of real numbers with t i ? 0, to + t 1+ ... + t q = 1. Any continuous mapping of q to a topological space X is called a singular q- simplex in X. The ith face of a singular q- simplex O":q-+X is the singular (q-l)-simplex 0" 0 8 i : q-I -->X, where the linear embedding 8i:q-I-->q is defined by 8 i (to, ...,t i - l ,[i+1' ..., t q ) = (to, "', t i - l , 0, t i + l , ..., t q ). For each integer q, let Sq(X) denote the free Abelian group generated by the singular q- simplexes in X (Sq(X)=O if q <0), and define a homomorphism 8q: Sq(X)-->Sq_1 (X) by Dq(O") = 764 Lio( _I)i 0" 0 8 i . Then we have a chain com- plex S(X) = {Sq(X), 8q}, called the singular chain complex of X. The homology group H*(S(X)) is denoted by H*(X) and is called the integral singular homology group of the topological space X. Given a continuous mapping f: X --> Y, a chain mapping f#: S(X)-->S(Y) is defined by sending each singular simplex 0": q-->X to the singular simplex f 00": q--> Y, and it induces the homomorphism f*: H*(X)-->H*(y). The properties (i), (ii), (iii) of f* in Section D hold for continuous mappings of topological spaces, and hence the singular homology group is a homotopy invariant. The homology group H*(K) of a simplicial complex K is isomorphic to the singular ho- mology group H*(IKI) of the polyhedron IKI. Therefore the simplicial homology group of a triangulable space is isomorphic to the sin- gular homology group of the space. If {X;} is the set of tarcwise connected com- ponents of a topological space X, then H*(X) ;=:;LiH*(XJ If X is arcwise connected, then Ho(X);=:; Z. If {AJ is the collection of all the compact subsets of X directed by inclusion, then H*(X) is isomorphic to the tinductive limit IimH*(A,). It is not true that there is a Maye;Vietori sequence in singular homology for any couple {XI, Xl} of subsets of X. How- ever, for certain couples {Xl' X 2 }, there is a Mayer- Vietoris exact sequence of {Xl' X 2 }: * a P ...--:.Hq(XI n X 2 )--> Hq(Xd + H q (X 2 )--> Hq(XI U X2)Hq-l (XI n Xl)"" For example, this holds if X=IntX t UlntX 2 , where Int denotes the tinterior. Let c: X --> pt be the mapping of a topolog- ical space to a single point. Then the kernel of c*: H*(X)-->H*(pt) is denoted by HAX) and is called the reduced homology group of X. It holds that H*(X);=:;H*(X)+H*(pt). Regard the tsuspension SX as the union of two copies of the tcone over X. Then the connecting homomorphism in the Meyer- Vietoris sequence gives an isomorphism Hq(SX);=:; Hq-l (X) for any q. The inverse of this isomorphism is called the suspension iso- morphism for homology. Let M be a tC"'-manifold. A C'-singular q- simplex in M is a singular q-simplex O":q-->M such that 0" extends to a CCXJ-mapping from an open neighborhood of q in {(to, t I' "', tq)E Rq+1 I to + t I + ... + t q = I} to M. The totality of C' -singular simplexes in M generates a subcomplex of S(M), denoted by SOO(M). The inclusion S"'(M)c S(M) induces an isomor- phism H*(SOO(M));=:;H*(M). Let M be an n-dimensional ttopological manifold. Then Hq(M)=O unless O.s;q.s;n, and H*(M) is finitely generated if M is compact. 
765 F. Homology ofCW Complexes Homology theory is tractable in the category oftCW complexes by virtue of the facts stated below. Let X be a topological space and A its sub- set. Then we denote by X/A the quotient space obtained from X by shrinking A to a point, understanding X/0 to be the disjoint union XU pt, If Xl' X 2 are tsubcomplexes of a CW complex, then the Mayer- Vietoris exact se quence of {Xl> X 2 } and the excision isomor- phism i*:H* (Xd(X l nX2))H*«Xl UX 2 )/X 2 ) (i: inclusion) are valid. If A is a subcomplex of a CW complex X, then we have the following reduced homology exact sequence of (X, A):... 0* - 1* - i* _ 0* - i* ......Hq(A)-->Hq(X)-->Hq(X/ A)-->Hq_l (A)-->.." where i* andj* are induced by the inclusion i: A-->X and the collapsingj:X -->X/A, and (\ is given by a commutative diagram , Hq(X/A)Hq_1 (A) ih* ls _ h' _ Hq(X U CA)--*>Hq(SA), Here CA is the cone over A, S is the sus- pension isomorphism, and h: XU CA -->(X U CA)/CA =X/A and h':X UCA-->(XUCA)/X = SA are collapsings. More generally, if A, B are subcomplexes of a CW complex X and A=:> B, then we have the following reduced homology exact sequence of (X, A, B): ...  Hq(A/B)'.".Hq(X/B)Hq(X/ A) Hq-l (A/B) .... Furthermore, the homology group H*(X) of a CW complex X can be computed in the following manner. Let xq denote the q-skeleton of X, i.e., the union of all cells of dimensions .s; q, Put Cq(X)=Hq(Xq/xqI), and let aq:cq(X)--> C q - l (X) be the connecting homomorphism a*:Hq(xq/xq-I)-->Hq_dXq-l/xq-Z) in the re- duced homology exact sequence of (xq, Xq-l, xq-z). Then C(X)= {Cq(X), a q } is a chain com- plex such that Cq(X) is a free Abelian group with one generator for each q-cell of X. If X is a polyhedron IKI, then C(X) coincides with the simplicial chain complex C(K). The ho- mology group H*(C(K)) is called the cellular homology group of the CW complex X. This is isomorphic to the singular homology group H*(X). Since a CW decomposition frequently re- quires fewer cells than a simplicial decompo- sition, the cellular homology groups are use- ful in calculating the homology groups. For example, the tcomplex n-dimensional projec- tive space CPO has a CW decomposition with a single 2i-cell for each i = 0, 1, . .. , n, and hence we see immediately that Hq(cpn)z if q=2i (O.s;i.s;n) and =0 otherwise, If X is a finite CW complex and ex q denotes 201 G Homology Theory the number of q-cells of X, then we have the Euler-Poincare formula x(X) = L q ( -1)qex q , In particular, if X is homeomorphic to S2 then we have the Euler theorem on polyhedra: ex o - ex l + ex2 = 2. This was the first important result in topology (L. Euler, 1752). G. Homology with Coefficients in Abelian Groups Given a chain complex C and an Abelian group G, we have a new chain complex C @ G given by (C @ G)q== C q @ G and aq(c@g)== iJqc @ 9 (CE C q , gE G), where @ is the ttensor product of Abelian groups. For a topological space X, the homology group of the chain complex S(X) @ G is denoted by H*(X; G) and is called the singular homology group of X with coefficients in G. The homology group H*(K; G) of a simplicial complex K with coeffi- cients in G and the cellular homology group H*(C(X); G) of a CW complex X with coeffi- cients in G are similarly defined, The ho- mology group with coefficients in Z is the integral homology group. The previous results for the integral ho- mology groups generalize in a straightforward fashion to homology groups with coefficients in G. We have the homomorphism K:Hq(X)@ G-->Hq(X; G) sending a@gEHq(X)@ G to the homology class of z @ gE Zq(S(X)@ G), where z is a representative cycle of a. The follow- ing theorem is known as the universal coeffi- cient theorem for homology, since it expresses Hq(X; G) in terms of Hq(X), Hq-l (X), and G: There is an exact sequence 0--> Hq(X) @ G Hq(X; G)--> Tor(H q _ 1 (X), G)-->O, and this se- quence is split ( 200 Homological Algebra). Universal coefficient theorems of this type were first shown by S. Eilenberg and S. Mac- Lane (Ann. Math., (2) 43 (1942). Let A be a tring with a unit 1. Then a chain complex over A is a chain complex C such that each C q is a tA-module and each a q is a tA- homomorphism. The homology groups Hq(C) of a chain complex C over A are A-modules, If C is a chain complex, C @ A forms naturally a chain complex over A. In particular, if X is a topological space, then S(X) @ A is a chain complex over A, and Hq(X; A) are A-modules. In this case, the induced homomorphisms f*: Hq(X; A)-->Hq(Y; A) are A-homomorphisms. The homology groups with coefficients in a field k are tvector spaces over k and are useful in applications. If H*(X) is finitely generated, then x(X) == L q ( -l)q dimk Hq(X; k) holds for any field k. 
201 H Homology Theory H. Cohomology A cochain complex C = {cq, b q } is a collection of Abelian groups cq, one for each integer q, and of homomorphisms b q : C q --> cq+1 such that b q + 1 0 b q =0. Elements of C q are called q- cochains, and b q is called the coboundary operator. The notions of subcomplex of a co- chain complex, cocycIe, coboundary, coho- mology group, and cochain mapping are de- fined as in chain complex, Let Hom(A, B) denote the tgroup of homomorphisms from an Abelian group A to an Abelian group B. Given a chain complex C and an Abelian group G, a cochain complex C* = Hom( C, G) is defined by cq = Hom(C q , G) and (bqu)(c) = u(a q + 1 c) (UECq,CECq+d. For a topological space X, the cochain complex Hom(S(X), G) is called the singular cochain complex of X with coefficients in G, and elements of Hom(SiX), G) are called singular q-cochains of X. The cohomology group of Hom(S(X), G) is denoted by H*(X; G) and is called the singular cohomology group of X with coefficients in G. We write H*(X) for H*(X; Z); this is called the integral cohomol- ogy group of X. Similarly, the cohomology group H*(K; G) of a simplicial complex K with coefficients in G and the cellular coho- mology group H*(C(X); G) of a CW complex X with coefficients in G are defined. There are isomorphisms H*(K; G) H*(IKI; G) and H*(C(X); G) H*(X; G). If f: X --> Y is a continuous mapping, then a cochain map f#: Hom(S(Y), G)-->Hom(S(X), G) is defined by (/#u)(c)=uU#c) with UE Hom(Sq(Y), G) and CESq(X). Therefore f in- duces the homomorphism f*: H*(Y; G)--> H*(X; G). The following properties hold: (i) 1 * = 1; (ii) (g 0 fJ* = f* 0 g*; (iii) If r and I' are homotopic, then f* = 1'*. In particular, the singular cohomology groups are homotopy invariants. The tcokerne1 of c*: H*(pt; G)--> H*(X; G) induced by the mapping c: X -->pt is denoted by H*(X; G) and is called the re- duced cohomology group of X. For  E W(X; G) and a E HiX), the Kro- necker index <,a>EG is defined naturally in terms of representatives of  and a. We have the following universal coefficient theorem for cohomology: There is an exact sequence 0--> Ext(H q _ 1 (X), G)-->W(X; G)Hom(Hq(X), G)--> 0, and this sequence is split, where K is given by the Kronecker index ( 200 Homological Algebra). If A is a ring with 1, then a cochain complex over A is defined analogously to a chain com- plex over !\. The singular cochain complex Hom(S(X), A) forms naturally a cochain com- plex over A, and Hq(X; A) are A-modules. For  E Hq(X; A) and aE Hq(X; A), the Kronecker 766 index <,a>EA is defined naturally. Ifk is a field, then the vector spaces Hq(X; k) are identified with the dual space of Hq(X, k) by means of the Kronecker index. If M is a Coo-manifold, then there is a co- chain complex :D(M)= {:Dq(M),d} over the field R of real numbers, where :Dq(M) is the real vector space consisting of the tdifferen- tial forms of degree q on M, and d::Dq(M)--> :D q + 1 (M) is the texterior differentiation. The cochain complex :D(M) is called the de Rham complex of M, and its cohomology group H*(:D(M)) is called the de Rham cohomology group of M, A cochain mapping J::D(M)--> Hom(SOO(M), R) is defined by (J(w»)(O") = f O"*w, d q where WE Dq(M), 0": q-->M is a Coo singular q- simplex in M, and O"*W denotes the tpullback of w by 0". We have isomorphisms H*(:D(M)) H*(Hom(SOO(M),R)).f- H*(M; R), where i* is induced by the inclusion SOO(M)c S(M). This result is called the de Rham theo- rem on the cohomology of manifolds ( 105 Differentiable Manifolds). I. Cohomology Rings Given a topological space X and a ring !\, the cup product u,-,vEHom(Sp+q(X), A) of co- chains UE Hom(Sp(X), A) and VE Hom(Sq(X), A) is defined by (u,-,v)(O")=u(O"ot:)v(O"ot:'), where O":p+q-->X is a singular (p+q)-simp1ex, t::P--> p+q and t:': q -->p+q are given by t:(t o , t 1, ..., t p ) =(to, t 1, ..., t p , 0, ...,0), and t:'(t p , t p + h ...,tp+q)=(O, ...,O,tp,tP+h ...,t p + q )' The prod- uct operation is bilinear, and the formula b(u,-, v) = bu '-' v+ (-1 jPu '-' bv holds. There- fore it gives rise to the cup product  '-' I] E Hp+q(X; A) of cohomology classes  E HP(X; A) and I] E Hq(X; A). This cup product operation makes H*(X; A) into a ring, which is called the singular cohomology ring of X with coeffi- cients in A. If !\ has 1, the cohomology class represented by the O-cocycle taking the value 1 on each singular 0-simp1ex serves as 1 of H*(X; A). If A is commutative, then I] = (-I)pql] holds. The induced homomorphism f*: H*( Y; A)-->H*(X; A) preserves the prod- uct, and hence the cohomology ring is a ho- motopy invariant. If K is a simplicial complex, the cup prod- uct operation '-' : HP(K; A) @ Hq(K; A)--> Hp+q(K; A) is induced from the opera- tion,-,: Hom(Cp(K), A) @ Hom(Cq(K), A)--> Hom(Cp+q(K),!\) defined as follows. Adopt- ing a tlinear ordering of vertices of K, we write all oriented simplexes in this ordering. Then, 
767 for uEHom(Cp(K), A) and vEHom(Cq(K), A), we define U '-' VE Hom(Cp+q(K), A) by (u '-' v)( [a o , a" "', ap+qJ) = u( [a o , al> ... , apJ )v( [a p , a p +1, ..., ap+qJ). The canonical isomorphism from the cohomology of K to the singular cohomology of I KI preserves the cup product. On the de Rham complex :D(M) of a C oo _ manifold M, we have the texterior product W!\BE:DP+q(M) of wE:DP(M) and BE:Dq(M). This makes H*(:D(M)) into a ring, which is calIed the de Rham cohomology ring. The canonical isomorphism of H*(:D(M)) to H*(M; R) preserves the product ( 105 Dif- ferentiable Manifolds). Examples. (1) Let Tn=SI x... X SI denote the n-dimensional torus, and let n i : Tn......SI denote the projection to the ith factor (1.s; i.s; n). Take a generator ( of the A-module HI(SI;A), and put (i=nt«()EHI(Tn;A). Then H*(T n ; A) is the texterior algebra over A gen- erated by (" ..., (n. (2) If we denote by cpn the complex n-dimensional projective space, then Hq(cpn; A) is A if q = 2i (O.s; i.s; n) and 0 otherwise. If  is a generator of the A-module H 2 (cpn; A), then i generates the A-module H 2i (cpn; A) (O.s; i.s; n). Thus H*(cpn; A) is the quotient ring A[(J/(C+ I ) of the tpolynomial ring A[(J by the tideal (("+1). (3) If pn denote the real n-dimensional projective space, then H*(pn;Z2)Z2[(J/(("+1), where  is the gen- erator of HI (pn; Z2)' J. Homology of Product Spaces If C and D are chain complexes, their tensor product C @ D is a chain complex given by (C @ D)n=Lp+q=nCp@ Dq, and an(c @ d)= ap(c) @ d+( -1)Pc @ aq(d) (CE C p , dEDq). The following Eilenberg-Zilber theorem (Amer. J. Math., 75 (1953) is the link between the alge- bra of tensor products and the geometry of product spaces: For the product space X x Y of topological spaces X and Y, there is an isomorphism p*: H*(X x Y; G) H*(S(X)@ S(Y)@ G) induced from a chain mapping p: S(X x Y)-->S(X) @ S(Y) defined as follows: Given a singular n-simplex O":n-->x x Y, we define for each p (O.s; p.s; n) a singular p- simplex O" in X to be the composite Mn X x YX, where E(t o , t l ,..., tp)=(to,t"..., t p' 0, ... ,0) and n I is the projection to the first factor. Similarly, we define for each q (O.s; q.s; n) a singular q-simplex 0"; in Y to be the composite M"::'nx x Y Y, where /;'(tn-q, ... ,tn) = (0, ...,0, t n - q , ... ,tn) and n 2 is the pro- jection to the second factor. Then p is defined by p(O") = Lp+q=n O" @ 0"; and is called the Alexander-Whitney mapping (Alexander- Whitney map). Given a ring A, a chain mapping w(S(X) @ 201 J Homology Theory A) @ (S(Y) @ A)-->S(X) @ S(Y) @ A is de- fined by J1«c @ ic')@ (d @ icn)) = C @d@;:X"; it induces homomorphisms J1*: Hp(X; A)@ Hq(Y; A)-->Hp+q(S(X) @ S(Y) @ A). The cross product a x bE Hp+q(X x Y;A) of aEHp(X;A) and bE Hq(Y; A) is defined to be p;;1 (J1*(a x b)). If A is a commutative ring with 1, then the cross product defines a A-homomorphism x : Hp(X;A)@AHq(Y;A)......Hp+q(X x Y;A), and satisfies (a x b) x c = a x (b x c), T*(a x b) = (-1 )Pqb x a, U x g)*(a x b) = f*(a) x g*(b), where T: X x Y --> Y x X is the mapping inter- changing factors, and f: X -->X', g: Y --> y' are continuous mappings. If A is a tprincipal ideal domain, there is an exact sequence 0--> I Hp(X;A)@AHq(Y;A)Hn(Xx Y;A) p+q=n --> L TorA(Hp(X; A), Hq(Y; A))-->O, p+q=n-l and this sequence is split ( 200 Homolog- ical Algebra). In particular, ifk is a field we have the folIowing isomorphism of vector spaces: x: I Hp(X;k)Q9Hq(Y;k)Hn(Xx Y;k). p+q=n k This is called the Kiinneth theorem, since the prototype was proved by D. Kiinneth (Math. Ann., 90 (1923); ibid., 91 (1924)). The present form was given by H. Cartan and S. Eilenberg [8]. The tensor product C @ D of cochain com- plexes C and D is defined analogously to that of chain complexes. Given topological spaces X, Y and a ring A, a cochain mapping J1: Hom(S(X), A)@ Hom(S(Y), A)-->Hom(S(X) @ S( Y), A) is defined by (J1(u @ v»)(c@ d) =u(c)v(d), where uEHom(Sp(X), A), VE Hom(Sq( Y), A), c E Ss(X), dE S,( Y), and u(c)v(d) is understood to be 0 if(p,q)#(s,t). We then have the composite HP(X; A)@ Hq(y; A) W+q(Hom(S(X)@S(Y),A))W+q(X x Y;A), where p is the Alexander- Whitney mapping. For (E HP(X; A) and 11 E Hq(y; A), the cross product  x 11 E Hp+q(X x Y; A) is defined to be p* J1*«( @ 11). The cohomology cross product satisfies the properties analogous to the homol- ogy cross product. The cup product and the cohomology cross product are given in terms of each other: ('-' 11 =d*(( x 11), ( x 11 = ni(() '-' ni(I1), where d: X --> X x X is given by d(x) = (x, x), and n l : X x Y--> X, n 2 : X x Y --> Yare projections. We have the following Kiinneth theorem for cohomology: If A is a principal ideal domain and each Hq(X; A) is finitely generated over A, then there is an exact sequence 0--> L W(X;A)@AHq(Y;A)H"(Xx Y;A) p+q=n --> I TorA(W(X; A), Hq(y; A)-->O, p+q=n+l 
201 K Homology Theory and this sequence is split ( 200 Homological Algebra), For EW(X;1\), I1EHq(Y;1\), 'E H'(X; 1\),11' E H'(Y; 1\), the formula «( x 11) '-' «(' x 11') = ( -1 )q,( ('-' (') x (11 '-' 11') holds. There- fore, if k is a field and dimk Hq(X; k) < 00 for each q, then the cohomology ring H*(X x Y; k) is determined by the cohomology rings H*(X;k) and H*(Y;k). Wiber bundles can be considered as gen- eralized product spaces. Let E be the total space of a fiber bundle with base B and fiber F. The following Leray-Hirsch theorem (J. Math. Pures Appl., 29 (1950)) asserts that, under certain conditions, the cohomology of E is additively isomorphic to that of B x F: Let 1\ be a principal ideal domain, and assume that H*(F;1\) is free and finitely generated over 1\. Furthermore, assume that there is a homo- morphism O:H*(F;1\)-->H*(E; 1\) such that the composite H*(F; 1\)!..H*(E; 1\)i H*(p -I (b); 1\) is an isomorphism for each bEB, where p:E--> B is the projection and ib:p-l(b)cE. Then an isomorphism <D: H*(B; 1\) @A H*(F; 1\) H*( E; 1\) is given by <D( ( @ 11) = p* ( '-' 0(11), where (EH*(B;1\), I1EH*(F;1\). A general connection between (co)homol- ogy of E and B x F is given by means of spec- tral sequences ( 148 Fiber Spaces). K. Cap and Slant Products There are other products closely related to the cup product or the cross product that involve cohomology and homology together. Given a topological space X and a ring 1\, the cap product v.-- C E Sp(X) @ 1\ of a co- chain vEHom(Sq(X), i\) and a chain c= LiO"i @ AiESp+q(X)@ 1\ is defined by V,--C=LiO"iO E @ v( O"i 0 EV'i' where O"i are singular (p + q)- simplexes in X, A i E1\, and E:P-->P+q, E':q--> p+q are the mappings used in the definition of cup product. For any u E Hom(Sp(X), 1\), theformula <u,-,v,c)=<u,v.--c) holds. The cap product satisfies a(v.--c)=( -1)p6v.--c+ v.-- ac, and hence it induces the cap product (.-- aE Hp(X; 1\) of a cohomology class E Hq(X; 1\) and a homology class aEHp+q(X; 1\). If 1\ is a commutative ring with 1, then the cap product operation is bilinear and satisfies the following properties: «( '-' (').-- a = ( .-- «('.-- a), f*U*I1.--a) = 11'-- f*(a), 1.-- a = a, «( x 11)'-- (a x b)= (-1 jP(t-q)«(.-- a) x (11'-- b), where EW(X; 1\), ('EW'(X; 1\), I1EHq(y; 1\), aEHs(X; 1\), bE H,(Y; 1\), and f:X --> Y is a continuous mapping. Given topological spaces X, Y and a ring 1\, the slant product wjd E Hom(Sp(X), 1\) of a cochain wEHom(S(X) Q9 S(Y)p+q, 1\) and a chain d= Liri @AiESq(Y) @ 1\ is defined 768 by (wjd)(O") = Li(W(O" @ r;))Ai' where 0" is a singular p-simplex in X, r i are singular q- simplexes in Y, and )'i E 1\. The slant operation satisfies 6(wjd)=(6w)jd-( -1)Pwjad. There- fore, under the identification H*(X x Y; 1\) = H*(Hom(S(X) @ S(Y), 1\)), it induces the slant product (jbE HP(X; 1\) of a cohomology class (E Hp+q(X x Y; i\) and a homology class bE Hq(Y; 1\). For any aEHp(X; 1\), it holds that <(jb,a) = «,a x b). Let G, G' and G" be Abelian groups. Given a homomorphism G' @ G" -->G, we write g'g" for the image of g' @ g" E G' @ G" in G. Then the cap product.-- : Hq(X; G') @ Hp+q(X; G")--> Hp(X; G) can be defined in the same way as before. Similar definitions are valid for the cup product, the cross products for homology and cohomology, and the slant product. L. Relative Homology If C' is a subcomplex of a chain complex C, then we have a chain complex CjC' = {CqjC, 8q}, where CqjC denotes the quotient group and 8q is induced from a q by passing to the quotient. CjC' is called the quotient complex of C by C. A topological pair (X, A) is composed of a topological space X and its subset A. Given a topological pair (X, A) and an Abelian group G, we have the chain complex (S(X)jS(A)) @ G and the cochain complex Hom(S(X)jS(A), G). The homology group of (S(X)jS(A)) @ Gis denoted by H*(X, A; G) and is called the rela- tive singular homology group of X modulo A with coefficients in G or the singular homology group of (X, A) with coefficients in G. The homology group H*(X;G)=H*(X,0;G) is sometimes called the absolute homology group. Similar definitions are made for the coho- mology group H* (X, A; G) of the cochain complex Hom(S(X)jS(A), G). A simplicial pair (K, L) is composed of a simplicial complex K and its subcomplex L, and a CW pair (X, A) is composed of a CW complex X and its subcomplex A. The relative homology group H*(K, L; G) = H*«C(K)jC(L))@ G) of a simplicial pair (K,L) is isomorphic to H*(IKI,ILI;G). For a CW pair (X, A), there is an isomorphism H*(X, A; G) H*(XjA; G). Similar statements hold for the cohomology groups. A continuous mapping f:(X, A)-->(Y, B) of topological pairs is a continuous mapping f: X --> Y such that f(A)c B. If f:(X, A)-->(Y, B) is a continuous mapping, then f#:S(X)-->S(Y) sends S(A) to S(B), and hence f induces homo- morphisms f*: H*(X, A; G)-->H*(Y, B; G) and 
769 f* :H*(Y, B; G)-->H*(X, A; G). Since the prop- erties are analogous, we state them below only for the case of relative homology. The following six properties are fundamen- tal. (i) 1 * = 1. (ii) (g 0f)* = g* 0 f*. (iii) Homo- topy property: Iff, f':(X, A)-->(Y, B) are tho- motopic, then f* = f. (iv) Exactness property: There exists a homology exact sequence of 0* i* )* a* (X, A):. ..-->Hq(A; G)-->Hq(X; G)-->HiX, A; G)--> Hq_dA;G)..., where i:AcX,j:(X,0)c (X, A), and 8* sends the homology class of a cycle of(S(X)jS(A))@ G represented by a chain c E S(X) @ G to the homology class of 8c which is a cycle of S(A) @ G. 8* is called the boundary homomorphism or the connecting homomorphism. (v) Naturality of 8*: For any continuous mapping f:(X, A)-->(Y,B), it holds that 8* 0 f* =UI A)* 08*. (vi) Excision pro- perty: If U is a subset of X such that the clo- sure [j is in Int A, then the excision isomor- phism H*(X - U,A- U;G);:;;;H*(X,A; G) is induced by inclusion. The exactness property extends to the hmology exact sequence of a triple (X, A, B): .  A G  G h G  ...-->Hq( ,B;. )-->HiX,B; )-->Hq(X,A; )--> Mq-dA, B; G).... A couple {X" X 2 } of sub- sets of X is said to be excisive if H*(X l , Xl n X2)H*(XI U X 2 , X 2 ) is induced by inclusion. If {XI, X 2 } is excisive, so is {X 2 , Xd. For ex- ample, if X=IntX I UIntX 2 or if Xl and X 2 are subcomplexes of a CW complex, then {XI' X 2 } is excisive. If {XI' X 2 } and {AI' A 2 } are excisive couples such that Al C Xl and A 2 c X 2 , then we have the relative Mayer- Vietoris exact sequence: ... -->Hq(XI n X 2 , Al n A 2 ; G)-->HiXhAI; G) + H q (X 2 , A 2 ; G)-->Hq(XJ U X 2 , Al U A 2 ; G)-->Hq_l (Xl n X 2 , Al n A 2 ; G)-->.... For the case of relative cohomology, we use terms such as cohomology exact sequence and coboundary homomorphism, correspondingly. The universal coefficient theorems are valid for the relative (co)homology groups. Given a homomorphism G' @ G" -->G, if {A, B} is excisive in X, then the cup product '-': HP(X, A; G')@ Hq(X, B; G")-->Hp+q(X, A U B; G) and the cap products,--, : Hq(X, B; G') @ Hp+q(X, A U B; G")-->Hp(X, A; G) can be de- fined. The product (X, A) x (Y, B) is defined to be the pair (X x Y, A x YU X x B). Given a homomorphism G' @ G" --> G, the homology cross products x : Hp(X, A; G') @ HiY, B; G") -->Hp+q«X, A) x (Y, B); G) and the slant prod- ucts j:W+q«X, A) x (Y, B); G')@ HiY,B; G") -->HP(X, A; G) can be defined. If {A x Y, X x B} is excisive, then the cohomology cross products x :HP(X,A;G')@Hq(Y,B;G")--> Hp+q«X, A) x (Y, B); G) can also be defined. If {A x Y, X x B} is excisive, the Kiinneth theorems are valid for the relative (co)homology groups [9, 10, 11]. 201 M Homology Theory M. Cech Homology Theory Another homology theory is commonly used ".long with the singular theory. The theory was originated by E. tech (Fund. Math., 19 (1932)) and was modified by C. H. Dowker (Ann. Math., (2) 51 (1950)). Given a topological space X and an Abelian group G, the Cech homology group H*(X; G) and the Cech coliomology group H*(X; G) are defined as follows. We take the family of a\1 topen coverings of X directed by trefinement, and we consider the tnerve K (U) of each open covering U, that is, the simplicial complex whose simplexes are finite nonempty subsets of U with nonempty intersection. If U' is a refmement of U, then a simplicial mapping n(U, U'): K(U')-->K(U) is obtained by assign- ing to each U' E U' an element U E U such that U' cU. The induced homomorphisms n(U, U')*: H*(K(U'); G)-->H*(K(U); G) and n(U, U')*: H*(K(U); G)-->H*(K(U'); G) are inde- pendent of the choice of n(U, U'), and we have the tin verse system {H*(K(U); G), n(U, U')*} and the tdirect system {H*(K(U); G), n(U, U')*}. We now define H*(X; G) = lim H*(K(U); G) - <- and H*(X; G) = I!ErH*(K(U); G). A continuous mapping f: X --> Y induces homomorphisms f*: H*(X; G)-->H*(Y; G) and f*: H*(Y; G)-->H*(X; G) as fo\1ows. If [! is an open covering of Y then a simplicial mapping f'!l:K(Fl([!»-->K([!) is defined by f'!J(FI(V» = V (V E [!). The ind uced homomorphisms f'!l': H*(K(F I ([!)); G)-->H*(K([!); G) for a\1 open coverings [! of Y gives rise to f* : H*(X; G)-->H*(Y; G). Similarly f induces f*, Another approach to Cech cohomology theory is ca\1ed the Alexander-Kolmogorov construction (Proc. Nat. Acad. Sci., 21 (1935) and C. R. Acad. Sci. Paris, 202 (1936). The approach was improved by E. H. Spanier (Ann. Math., (2) 49 (1948), and the theory is now called the Alexander (or Alexander- Kolmogorov-Spanier) cohomology theory, The Alexander cohomology group H*(X; G) is defmed as follows. Let <Dq(X; G) be the Abelian group of all functions from the (q + 1)- fold product space xq+1 to G with addition defined pointwise. An element <p E <Dq(X; G) is said to be locally zero if there is an open covering U of X such that <p(xo, ..., x q ) van- ishes if Xo, ..., x q are contained simultaneously in some U E U. The subgroup of <Dq(X; G) con- sisting of locally zero functions is denoted by <Do(X; G). We define a homomorphism 6 q : <Dq(X; G)--> <D q +1 (X; G) by (6 q <p)(xo, XI' ..., xq+d == Li6( -1)i<p(xo, ..., Xi-I' Xi+1,"" Xq+d. Then <D(X; G) = {<Dq(X; G), 6 q } is a cochain complex, and <Do(X; G) == {<Do (X; G), 6 q } is its subcomplex. We now define H*(X; G) to be 
201 N Homology Theory the cohomology group of the quotient com- plex <I> (X ; G) = <D(X; G)/<Do(X; G). If f: X --> Y is a continuous mapping, then a cochain map f# :<D(Y; G)--><D(X; G) is defined by (f# <p)(x o , Xl> ..., x q ) = <p(f(xo),J(xd, "" f(x q )), and it induces the homomorphism f*: H*(Y; G)-->H*(X; G). There is a natural iso- morphism H*(X; G)  ii*(X; G). The (co)homology group of a simplicial complex K is isomorphic to the tech (co)- homology group of IKI. If X is a manifold or a CW complex, then its singular (co)homology group and its tech (co)homology group are isomorphic. However, even for compact met- ric spaces X, the singular (co)homology group of X is not necessarily isomorphic with the tech (co)homology group of X. If {X A} is an tin verse system of compact Hausdorff spaces and X=limX)., then there - <- - are isomorphisms limH*(X A ; G)H*(X; G) - <-- and IH*(XA; G) H*(X; G). This is called the continuity property for Cech theory. If A is any closed subset of a manifold M, then there is an isomorphism lim H*(W; G)  H*(A; G), --> where W varies over neighborhoods of A in M directed downward by inclusion. If the tcover- ing dimension of X is n, Hq(X;G)=O for q>n. The cup product in the tech cohomology is introduced simply by passing to the limit with cup products in simplicial complexes, and the cup product in the Alexander cohomology is induced from the operation '-' : <DP(X; G') @ <Dq(X; G")--><Dp+q(X; G) defined by (<p '-' t/J)(x o , Xl' ..., X p + q ) = <p(X o , Xl, ..., Xp)t/J(x p , X p + l , '"  x p + q ). The relative Cech homology group H*(X, A; G) and the relative Cech cohomology group H*(X, A; G) are defined as follows: An open covering of (X, A) is a pair (U, 91) of an open covering U of X and an open covering 91 of A such that me u. To such a pair (U, 91) we assign a simplicial pair (K(U), K'(91)), where K'(91) is the nerve of m n A = {N n A I N Em}. Considering the family of 311 open cover- ings of (X, A), we define now H*(X, A; G) = IimH*(K(U), K'(91); G) and H*(X, A; G)= <- IH*(K(U), K'(91); G), The relative Alexander cohomology group H*(X, A; G) is defined to be the cohomology group of the cochain complex which is the kernel of the cochain mapping i#: <1>(X; G)--> <I>(A; G) induced by inclusion. There is a nat- ural isomorphism H*(X, A; G)ii*(X, A; G). The relative tech (co)homology groups satisfy the properties analogous to the relative singular (co)homology groups except the exactness property for homology ( Section Q), In certain cases, the excision property is strengthened for tech (co)homology. For example, we have the following theorem: As- sume that X and Yare compact Hausdorff 770 spaces, A and B are closed subsets of X and Y, respectively, and f: (X, A)-->(Y, B) is a continu- ous mapping which maps X - A onto Y - B homeomorphically. Then f*: H*(X, A; G)  H*(Y, B; G) and f* :H*(Y,B; G)H*(X,A; G) hold. N. Fundamental Classes of Manifolds For a topological space X and a point X of X, the local homology groups H*(X, X -x) repre- sent a topological property of X around x. The notion of torientation for differentiable manifolds and triangulable manifolds general- izes to ttopological manifolds by using local homology groups as follows. Let M be an n-dimensional (topological) manifold with tboundary aM. If X is a point of the interior Mo = M - aM, then Hq(M, M - x)  Hq(Rn, R n -0) is Z for q=n and is 0 for q#n. We define a local orientation Ox for M at xEMo to be a choice of one of the two possible generators for Hn(M, M -x), and we then define an orien- tation for M to be a function which assigns to each x E Mo a local orientation Ox which varies continuously with X in the following sense: For each X there should exist a compact neighbor- hood NcM o and an element ONEHn(M,M- N) such that iy.(ON)=Oy for each YEN, where iy:(M, M -N)c(M, M - y). If there is an ori- entation for M, then M is said to be orient- ahle, and the pair of M and an orientation is called an oriented manifold. If M is a nonorient- able manifold without boundary, the set of local orientations for M forms an orientable manifold doubly covering M, called the orien- tation manifold of M. If M is an oriented n-dimensional manifold, then for any compact subset K of M there is a unique element OKEHn(M,(M -K)U aM) such that ix*(OK) = Ox for each XE K n Mo, where ix'(M,(M -K)UaM)c(M, M -x). The ele- ment OK is called the fundamental homol- ogy class around K. In particular, if Mis it- self compact, 0MEHn(M,aM) is usually de- noted by [MJ and is called the fundamental homology class of M. A connected compact n-dimensional manifold M is orientable if and only if Hn(M, aM)#O, and in this case Hn(M, aM) is a free cyclic group generated by a fundamental class [M]. If M is an orientable compact n-dimensional manifold, then aM is an orientable compact (n-l)-dimensional manifold without boundary, and the boundary homomorphism a*: Hn(M, aM)-->Hn-1 (aM) sends a fundamental class [MJ to a funda- mental class [a MJ. An n-dimensional manifold M is orientable if and only if there exists an element U E Hn(M x M, M x M -dM) such that, for each XE Mo, 
771 g(U) is a generator of H"(M, M -x), where dM is the diagonal in M x M, andjx:(M, M -x)-->(M x M, M x M -dM) is given by JAY) =(x,Y) (YEM). In fact, U corresponds to an orientation which assigns to each x E Mo a local orientation Ox such that G:(U), ox) = 1. The element U is called the orientation coho- mology class of M. If M is a compact manifold without boundary, it holds that <d*(U), [M]) = X(M), where d*: H"(M x M, M x M -dM)--> H"(M) is induced by the diagonal mapping. The element d*(U)EHn(M) is called the Euler class of M. If we work with the (co)homology groups with coefficients in Z2, the fundamental classes are defined for an arbitrary manifold without making any assumption of orientability. If M is connected and compact, then Hn(M, aM;Z2)Z2 is generated by [M]. O. Duality in Manifolds Let M be a compact n-dimensional manifold, and let M I , M 2 be compact (n-l)-dimensional manifolds such that M I U M 2 = aM and M l n M 2 =aM I ==aM 2 . Assume either that M is oriented or that G = Z2' Then for each q, an isomorphism D:Hq(M,MI;G)Hn_q(M, M 2 ; G) is defined by D() = ( ,.-{M]. In par- ticular, there are the isomorphisms D: Hq(M, aM;G)Hnq(M; G) and Hq(M; G)Hn-iM, aM; G), where the cap product is taken with respect to the homomorphism G @ Z-->G defined by multiplication. This theorem is called the Poincare-Lefschetz duality theorem, and the special case for aM = 0 is often referred to as Poincare duality. Poincare duality implies the following con- sequences for a compact n-dimensional mani- fold M without boundary. If Mis orientable, then the qth Betti number is equal to the (n- q)th Betti number, and the qth torsion coef- ficients are equal to the (n-q-l)th torsion coefficients. Ifn is odd, then X(M)=O, and if M is orientable and n=2 mod4, then X(M) is even. Poincare duality generalizes to the following duality theorem. Let M be an n-dimensional manifold without boundary, and let K be a compact subset of M. Assume either that M is oriented or that G = Z2' Then there is an iso- morphism D:fiq(K; G)Hn-q(M, M -K; G) for any q, which is given as follows: For each open neighborhood W of K, define Dw: Hq(W; G)--> Hn_q(M, M - K; G) by Dw()=k*(,.-.., k;l (OK))' where k* is the excision isomorphism induced by k:(W, W-K)c(M,M -K). Now D is de- fined to be the limit of Dw, where W varies over open neighborhoods of K. The inverse of D up to sign is given in terms of the slant 201 P Homology Theory product with the fundamental cohomology class U of M as follows: For each open neigh- borhood W of K, we define a homomorphism Yw: Hn-q(M, M - W; G)--> Hq(W; G) by yw(a) = j*(U)ja, wherej*: H"(M x M, M x M -dM)--> Hn(W x (M, M - W») is induced by inclusion. Then, passing to the limit, these Yw define the desired one. If we take an n-sphere sn as M in the above duality theorem and use the homology exact sequence of (sn, sn - K), then we have the fol- lowing Alexander duality theorem (Trans. Amer. Math. Soc., 23 (1922)): If K is a closed subset of sn, then the qth reduced tech coho- mology group of K is isomorphic to the (n- q -1)th reduced singular homology group of sn - K for any coefficient group G and any q. In particular, if K is a tneighborhood retract, fiq(K; G)  fi n - q - I (sn - K; G) holds. This shows that H*(sn-K) depends only on K and not on the way K is embedded in sn. The Alexander duality theorem for n = 2 and K = Sl gives the classical t10rdan curve theorem. In view of the duality theorems, certain classical definitions in the homology of mani- folds can be given in terms of cohomology. For example, iff:M-->M' is a continuous mapping of oriented closed manifolds, then the Umkehr homomorphism or Gysin homomor- phism/,: Hq(M'; G)-->Hq+d(M; G) (d=dimM- dimM') (W. Gysin, Comment. Math. He/v., 14 (1941)) can be defined by Do f' = f* 0 D. In co- homology we have ,fr: Hq(M; G)-->Hq-d(M'; G). Similarly, if M is an oriented n-dimensional closed manifold and a E H p( M), bE H q( M), then the intersection product a' bE Hp+q-n(M) of Lefschetz can be defined by a' b = D -I a ,.-.., b = D(D-Ia '-..'D-Ib), If p+q=n, the number a' bEIIo(M)Z is called the intersection number of a and b. The classical definitions are still meaningful today, since they are closer to geometric intuition and therefore possess con- siderable heuristic value. For example, the following fact serves to compute cup products in manifolds. If M is an oriented closed differ- entiable manifold and a, bE H*(M) are repre- sented by closed submanifolds N" N 2 which intersect ttransversally, then i:.a' b is repre- sented by N l n N 2 [11]. See [12J for a rigor- ous discussion of classical intersection theory. P. Cohomology with Compact Supports Let X be a topological space. A subset V of X is said to be cobounded if X - V is compact. A singular q-cochain uEHom(Sq(X), G) is said to have compact support if there exists a co- bounded set V such that u(a)=O for every singular q-simplex a in V. The singular co- chains with compact support form a subcom- 
201 Q Homology Theory plex of the cochain complex Hom(S(X), G). The cohomology group of this subcomplex is denoted by Hi(X; G) and is called the singu- lar cohomology group of X with compact sup- ports. There is an isomorphism Hc*(X; G)::;:;; IH*(X, V; G), where V varies over co- bounded subsets of X. Let K be a simplicial complex. A q-cochain uEHom(Cq(K), G) is called a finite cochain of K if u(O") = 0 except for a finite number of oriented q-simplexes 0" of K. If K is a tlocally finite simplicial complex, then finite cochains of K form a subcomplex of the cochain com- plex Hom(C(K), G) whose cohomology group is isomorphic to Hi(IKI; G). Let X be a tlocally compact Hausdorff space, and let XU { 00 } denote the tone-point compactification of X. Then the Cech coho- mology group of X with compact supports, denoted by it*(X; G) is defined to be the re- duced tech cohomology group of X U {oo} with coefficients in G. There is an isomorphism Hc*(X; G)::;:;;limH*(X, V; G), where V varies over cobounded subsets of X. If X is a mani- fold or a CW complex, then Hc*(X; G)::;:;; Hi(X; G). If X is a compact Hausdorff space and A is closed in X, then Hi(X - A; G)::;:;; H*(X, A; G). The Alexander-Kolmogorov con- struction gives a direct approach to Hc*(X; G) [10,13]. A tproper continuous mapping f: X ...... Y of locally compact Hausdorff spaces induces homomorphisms f*: Hc*(Y; G)--> Hc* (X; G) and f*: Hi(Y; G)-->Hc*(X; G), and if f, f':X --> Yare properly homotopic, then they induce the same homomorphisms. The cohomology with compact supports is useful in order to extend results in the coho- mology of compact spaces to noncompact spaces. For example, the conclusion of the duality theorem on a compact set K c M in Section 0 generalizes to the case of a closed set K c M as follows: There is an isomorphism H;(K;G)::;:;;Hn_q(M, M -K;G) for any q. This implies the following generalization of Poin- care duality: H;(M; G)::;:;; Hn-q(M; G) holds for an orientable n-dimensional manifold M with- out boundary. There are homology theories associated with the cohomology theories with compact supports [13]. Q. Eilenberg-Steenrod Axioms Let H* be a collection of the following three functions: (1) A function assigning to each topological pair (X, A) and each integer q an Abelian group Hq(X, A). (2) A function assign- ing to each continuous mapping f:(X, A)--> (Y, B) and each integer q a homomorphism 772 (*: Hq(y, B)-->Hq(X, A). (3) A function as- signing to each topological pair (X, A) and each integer q a homomorphism 6*:Hq(A)--> Hq+1 (X, A). Then H* is called a cohomology theory on the category of topological pairs if the following seven axioms are satisfied [7]. (i) 1*= 1, where 1 is identity. (ii) (g 0 f)* = j * 0 g*: Hq(Z, ej-->Hq(X, A) for continuous mappings f:(X, A)......(Y, B) and g:(Y, B)--> (Z, C). (iii) Homotopy axiom: If f, f': (X, A)--> (Y, B) are homotopic, then f* == j : Hq( Y, B)...... Hq(X, A). (iv) Exactness axiom: The sequence <5* "* "'" 0* "* ... ......Hq(X, A)!.....Hq(X)!.....Hq(A)-->Hq+1 (X, A)!........ is exact, where i: AcX andj:(X, 0)c(X, A). (v) f* 06* = 6* oUI A)* : Hq(B)-->Hq+I(X, A) for a continuous mapping f: (X, A)-->(Y, B). (vi) Excision axiom: If U is an open set of X such that (J c Int A, then i*: Hq(X, A)::;:;; Hq(X - U, A - U), where i is the inclusion. (vii) Dimension axiom: W(pt)=O if q#O. Axioms (i)-(vii) are called the Eilenberg-Steenrod axioms, and the group HO(pt) is called the coefficient group of the cohomology theory H*. A cohomology theory on the category of pairs of compact Hausdorff spaces is defined similarly. A cohomology theory on the cate- gory of CW pairs (or finite CW pairs) is de- fined similarly except that axiom (vi) is re- placed by the following excision axiom: If {Xl, Xl} is a couple of subcomplexes of a CW complex, then i*:Hq(X l UXl,Xl)::;:;;Hq(X I , Xl n X 2 ), where i is the inclusion, Two coho- mology theories H* and H'* on the same category are isomorphic if there is an isomor- phism hq:Hq(X,A)::;:;;H,q(X, A) for each (X, A) and each q, and they commute with j * and 6*. A homology theory on various categories is defined similarly by dualization. A singular (co)homology theory with coeffi- cients in G is an example of a (co)homology theory on the category of topological pairs. The tech cohomology groups with coefficients in G can be made into a cohomology theory on the category of topological pairs. However, the tech homology groups do not constitute a homology theory on the category of topolog- ical pairs; the homology sequence of any pair (X, A) is defined, but it can be proved only that the composite of any two successive homomorphisms is zero. The tech homology groups with coefficients in a field constitute a homology theory on the category of compact Hausdorff pairs. The Alexander cohomology groups constitute a cohomology theory on the same category of topological pairs, and it is isomorphic to the tech cohomology theory if their coefficient groups are isomorphic [10]. The tech (co)homology constitutes a (co)- homology theory on the category of CW pairs, and it is isomorphic to the singular (co)homology theory on the same category if 
773 the coefficient groups are isomorphic. (Co)- homology theories on the category of finite CW pairs are determined, up to isomorphisms, by their coefficient groups. This fact is called the uniqueness theorem of homology theory on the category of finite CW pairs. Coho- mology theories on the category of pairs of compact Hausdorff spaces which satisfy the following continuity axiom are determined, up to isomorphisms, by their coefficient groups: If {(X A' Ai,)} is an inverse system of pairs of com- pact Hausdorff spaces, then Hq(limX A ,limA A ) 1Hq(XA' Ai,). The tech coho'mologythe- ory satisfies this axiom. During recent years, many (co)homology theories have been developed which satisfy the first six Eilenberg-Steenrod axioms but fail to satisfy the dimension axiom. These are called generalized (co)homology theories, and include various tK-theories, tbordism theories, and tstable homotopy theories ( 202 Homotopy Theory). R. Homology with Coefficients in Local Systems N. E. Steenrod (Ann. Math., (2) 44 (1943») in- troduced the (co)homology group with coef- ficients in a local system of Abelian groups, which is useful in tobstruction theory and in the homology theory of tfiber spaces, A local system ffi of Abelian groups on a topological space X is a set of Abelian groups G x , one for each x E X, together with an iso- morphism 1*: G[(O)-->G[(l) for each tpath I: [0, IJ -->X subject to the foIlowing conditions: (1) If two paths 1 and I' are homotopic with endpoints fixed, then 1* = 1'*. (2) If 1 and m are paths such that /(1) = m(O), then (I' m)* = m* 0 1*, where I. m denotes the tproduct of 1 and m. An example is provided by the thomo- topy groups nn(X,x) for n?2. Let M be an n-dimensional topological manifold. Then x...... Hn(M, M -x) is a local system of infinite cyclic groups. It is called the orientation sheaf of M.'A local system ffi is said to be trivial if 1* == 1'* for any paths I, I' with the same initial and final points. Given a local system (f) of Abelian groups on a topological space X, a chain complex S(X; ffi) = {Sq(X; ffi), (\} is defined as follows: If q <0, then SiX; ffi)= 0, and if q?O, then Sq(X; ffi) is the Abelian group of formal finite sums LgaO", where O":q......X are singular q- simplexes in X and gaEGa(1,O, ,0); the bound- ary operator Oq: Sq(X; ffi)-->Sq-l (X; (5) is given by (\(gaO")=I:(ga)0"080+LYI( -1)igaO"oF. i , where (J 0 Ej is the ith face of (J, and fa: [0, I J ......X is given by la(t) = 0"( 1 - t, t, 0, ... , 0). The homology group of the chain complex S(X; (5) 201 Ref. Homology Theory is denoted by H*(X; (5) and is called the sin- gular homology group of X with coefficients in ffi. Similarly, a cochain complex S*(X; ffi) = {sq(X; (5), b q } is defined as follows: If q < 0, then sq(X; ffi)= 0, and if q? 0, then sq(X; ffi) is the Abelian group of functions u assigning to every singular q-simplex 0" in X an element U(O")EG a (1,O, ,0); the coboundary operator bq:Sq(X; ffi)-->Sq+I(X; ffi) for q?O is given by WU)(O") = 1:- 1 u(O" 0 8 0 ) + LY':+? ( -liu(O"o 8;), where 0" is a singular (q+ I)-simplex in X. The cohomology group of the cochain complex S*(X; ffi) is denoted by H*(X; ffi) and is called the singular cohomology group of X with coef- ficients in ffi. If ffi is trivial, then the (co)homology group with coefficients in ffi coincides with the (co)- homology group with coefficients in G  G x ' The various notions and theorems on the ordinary (co)homology can be extended to (co)homology with coefficients in ffi. The tech (co)homology group with coefficients in ffi is also defined [10]. The cohomology groups with coefficients in ffi are generalized to the cohomology groups with coefficients in a sheaf [10, 14J ( 383 Sheaves). References [IJ H. Poincare, Analysis situs, J. Ecole Poly tech., 1 (1985), 1-121; Rend. Circ. Mat. Palermo, 13 (1899), 285-343. [2J O. Veblen, Analysis situs, Amer. Math. Soc. Colloq. Pub\., 1922. [3J S. Lefschetz, Topology, Amer. Math. Soc. Colloq. Pub\., 1930. [4J H, Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934 (Academic Press, 1980). [5J p, S. Aleksandrov and H. Hopf, Topologie I, Springer, 1935 (Chelsea, 1965). [6J S. Lefschetz, Algebraic topology, Amer. Math. Soc. Colloq. Pub\., 1942. [7J S. Eilenberg and N. E. Steenrod: Founda- tions of algebraic topology, Princeton Univ. Press, 1952. [8J H. Cartan and S. Eilenberg, Homological algebra, Princeton Univ. Press, 1956. [9J P. J. Hilton and S. Wylie, Homology theory, Cambridge Univ. Press, 1960. [10J E. H. Spanier, Algebraic topology, McGraw-HilI, 1966. [IIJ A. Dold, Lectures on algebraic topology, Springer, 1972. [12J M. Glezerman and L. S. Pontryagin, Intersections in manifolds, Amer. Math. Soc. Trans\., 50 (1951), (Original in Russian, 1947.) [13J W. S. Massey, Homology and coho- mology theory, Dekker, 1978. 
202 A Homotopy Theory [14J R. Godement, Topologie algebrique et theorie des faisceaux, Hermann, 1958. [15] S. Lefschetz, Introduction to topology, Princeton Univ. Press, 1949. [16] 1. G. Hocking and G. Young, Topology, Addison Wesley, 1961. [17] D. G. Bourgin, Modern algebraic topol- ogy, Macmillan, 1963. [18] H, Schubert, Topologie, Teubner, 1964 (Allyn and Bacon, 1968). [19J S. T. Hu, Homology theory, Holden-Day, 1966, [20] M. J. Greenberg, Lectures on algebraic topology, Benjamin, 1967. [21] G, E. Cooke and R. L. Finney, Homology of cell complexes, Princeton Univ. Press, 1967. [22] C. R. F. Maunder, Algebraic topology, Van Nostrand, 1970. [23] A. H. Wallace, Algebraic topology, Ben- jamin, 1970. [24] C. Godbillon, Elements de topo1ogie algebrique, Hermann, 1971. [25] 1. Vick, Homology theory, Academic Press, 1973. [26] J. W. Milnor and 1. D. Stasheff, Charac- teristic classes, Princeton Univ. Press, 1974. [27] W. S. Massey, Singular homology theory, Springer, 1980. 202 (IX.g) Homotopy Theory A. General Remarks Given a topological space X, we utilize the concept of homotopy to define the tfunda- mental group, homotopy groups, and co- homotopy groups of X. These groups, to- gether with (co)homology groups, are useful tools in topology. Since the research of H. Hopf, W. Hurewicz, and H. Freudenthal in the 1930s, homotopy theory has made rapid progress and now plays an important role in topology. B. Homotopy If a family j;: X --> Y (tE I = {t I O.s; t.s; I}) of tcontinuous mappings from a ttopological space X into a topological space Y is also continuous with respect to t, that is, if the mapping F from the product space X x I into Y defined by F(x, t) = j,(x) (x EX, tEl) is con- tinuous, then {j,} or F is called a homotopy. In this case, fo and f1 are said to be homotopic. This relation between fo and f1 is indicated by fo  [I: X --> Y, Or simply fo  fl, and is called 774 the relation of homotopy. Denote by yX the set of all continuous mappings from X into Y. The homotopy relation is an tequivalence relation on yX, and the equivalence class [fJ of a mapping f: X --> Y is called the homotopy class (or mapping class) of f The set of all homotopy classes of mappings of X into Y is called the homotopy set and is denoted by n(X; Y) or [X, Y]. A function y of continuous mappings fE Yx is called a homotopy invariant if fg implies y(/) ==y(g). When X consists of a point * we write n( *; Y) = no( Y). If all con- tinuous mappings in Yx are homotopic to each other, we write n(X; Y) = 0; no(Y) = 0 means that Y is tarcwise connected, A mapping f from a compact space into an n-dimensional sphere sn is called essential if any mapping 9 homotopic to f satisfies g(X) = sn. A mapping is inessential if and only if it is homotopic to the constant mapping. These concepts are generalized as follows: Let Ai and B i (i = 1,2,...) be subspaces of X and Y, respectively, and denote by Yx (AI' A 2 , ...; 8 1 ,8 2 ,,,,) the set of continuous mappings fE Yx satisfying f(A;) cB i . If a homotopy {j,} is such that j,E YX(A i ; B i ), then {,fr} is called a restricted homotopy with respect to Ai, B i or a homotopy from a system of spaces (X, AI, Az, ...) into a system of spaces (Y,B"B 2 , ...). The notation fo fl :(X, AI, A2' ... )-->(Y, B I , B z ,,,,) and the homotopy set n(X, A" A2' .,,; Y, BI' Bz,...) are defined accordingly. For the composite gofEZx(A i ; C i ) of fE YX(A i ; B i ) and gEZY(B i ; Ci),ff' and gg' imply gof  g' of'. Thus the composite fJoex=[gofJEn(X,Ai;Z, C;) of [fJ =exE n(X,A i ; Y,B i ) and [gJ=fJEn(Y,Bi;Z,C i ) is defined. By putting g*[fJ = [go fJ = f*[gJ we induce two mappings, g*:n(X,A i ; Y,B;)......n(X, Ai;Z, C i ), f*: n(Y, B i ; Z, Ci)-->n(X, Ai; Z, q. Then ff' implies f* = f'* and gg' implies g* ==g. Also (go f)* =g* 0 f*, (gof)* == f* 0 g*, and h* 0 g* = g* 0 h*, where hEXw(D i ; A;). The category of pointed topological spaces is defined to be the tcategory in which each object X, which is a topological space, has a point fixed as a base point and each mapping X --> Y carries the base point of X to the base point of Y. In this category, we define a homo- topy set, denoted by n(X; Y)o or [X; YJo, as follows: Denoting the base points by *, we have n(X,Ai'*; Y,B;,*)=n(X,A i ; Y,Bi)o. A continuous mapping f homotopic to the constant mapping X --> * E Y is said to be homotopic to zero (or null-homotopic). This is indicated by f  0, and n(X; Y)o == 0 means that all continuous mappings are homotopic to zero. Let SO be a set of two points; then 
775 n(X;So)o=O means that X is tconnected. In contrast to these specific homo to pies, the usual homotopy is sometimes called a free homotopy. Suppose that a homotopy {fr} (fr: X --> Y) is such that the restriction of fr to a subspace A of X is stationary, that is, fr(a) = fo(a) (aE A, tEl). Then fo, f1 are said to be homotopic relative to A, indicated by fo -:::0:. f1 (reI. A). If a homotopy {fr} (fr: X --> Y) is such that each fr is a homeomorphism into Y, then {fr} is called an isotopy and fo is called isotopic to f1 ( 235 Knot Theory). Research done by L. E. Brouwer, H, Hopf, W, Hurewicz, K. Borsuk, L. S. pontryagin, and S. Eilenberg has contributed to the theory of homotopy, an important field of topology still in the process of development. C. Mapping Spaces We endow the set Yx of all continuous map- pings f: X --> Y, with the tcompact-open topol- ogy, The topological space yX is called a mapping space. In particular we denote y J (0, 1; * , * ) ( * E Y) by Q( Y) = Q( Y, * ) and call it the space of closed paths (or loop space) of Y. Two points 1, 9 of Yx are connected by a tpath in Yx if and only if f -:::o:.g:X --> Y. Thus no(Yx) =n(X; Y) and no(yX(A i ; B;))=n(X, Ai; Y,B;). D. Retracts Let A be a subspace of a topological space X. If there exists an f E AX such that the restric- tion f I A is the identity mapping of A, then A is called a retract of X, and f a retraction. If A is a retract of X, any continuous mapping of A into any topological space can be extended to a continuous mapping of X. If A is a retract of some neighborhood U(A), A is called a neighborhood retract or NR of X. If for any thomeomorphism of a metric space A onto a closed subspace Ao of any metric space X, Ao is a retract (neighborhood retract) of X, then A is called an absolute retract or AR (absolute neighborhood retract or ANR). For example, an n-dimensional simplex or an n-dimensional Euclidean space is an AR. If a retraction f is homotopic to the identity mapping of X (resp. U(A»), we call A a deformation retract (neigh- borhood deformation retract) of X. Moreover, if f -:::0:.1 x (reI. A), then A is called a strong de- formation retract. In particular, if a point Xo is a (strong) deformation retract of X, we say that X is contractible to the point Xo. For example, any tpolyhedron P and any com- pact n-dimensional ttopological manifold are ANRs; any polyhedron Po contained in P in a strong deformation retract of some neighbor- 202 F Homotopy Theory hood in P. X is called locally contractible if each point x of X has a contractible neighbor- hood U of x. E. The Extension Property Let X, Y be topological spaces, A c X,fo, f1 E y X , and {g,: A--> Y} a homotopy such that gi = /; I A (i =0,1). We can extend {g,} to a homotopy {fr} of X if and only if the mapping F:(X x O)U(A x I)U(X x 1)--> Y defined by F(x, i) = fi(X), F(a, t) = g,(a) can be extended to a continuous mapping sending X x I into Y. Therefore the problem of whether f o -:::o:.f1 can be reduced to the problem of whether a con- tinuous mapping defined on a subspace can be extended to the whole space. If for any homo- topy {g,:A--> Y} and any continuous mapping fo: X --> Y into any topological space Y satisfy- ing fo I A = go there exists a homotopy {fr: X --> Y} satisfying fr I A = g" then we say that (X, A) has the homotopy extension property. This occurs if and only if (X x 0) U (A x I) is a re- tract of X x I. A pair (X, A) of ANRs, where A is closed in X, and a pair (P, Po) with P a tCW complex and Po a subcomplex of P have this property. Given a continuous mapping h: B-->A of a subspace B of a topological space Y into a topological space A, we identify bE B with h(b)EA in the tdirect sum AU Yand obtain the tidentification space denoted by A U h Y, which is called an attaching space under h. If (Y, B) has the homotopy extension property, then (Yx X,B x X) and (AU h Y,A) also have the same property. When A consists of a point *, we write * U h Y = Y/B and call the space Y/B a space smashing (shrinking or pinching) B to a point. If Y = B x I, B= B x 0, then we call A Uh(B x I) a mapping cylinder of h, (A Uh(B x I))/(B x I) a mapping cone of h, and the mapping cylinder and mapping cone of h: B--> * the cone over B and suspension of B, respecti vel y. F. Homotopy Type For systems (X, AJ, (Y, B;) of topological spaces, ifthere exist fE YX(A i ; B;), gEXY(B i ; Ai) such that go f and fog are homotopic to the identity mappings of (X, AJ and (Y, B;), respec- tively, then we say that (X, Ai) and (Y, B i ) have the same homotopy type or are homotopy equivalent. Such mappings f and 9 are cal\ed homotopy equivalences. For a homotopy equivalence f, the induced mappings f* and f* are bijective. Therefore, in homotopy theory, systems of spaces having the same homotopy type are considered equivalent. If A is a deformation retract of X, then A and X have the same homotopy type, and the 
202 G Homotopy Theory injection of A into X and the retraction of X onto A are homotopy equivalences. A con- tractible space has the same homotopy type as a point. Spaces having the same homo- topy type have isomorphic homotopy groups and t(co)homology groups. Since the mapping cylinder Zf = YU f(X x I) of f E Yx contains Y a its deformation retract, it has the same homotopy type as Y. By this homotopy equiva- lence, f can be replaced by the injection of X x 1 into Zf' If to each topological space there corresponds a value (which may be some ele- ment of R or some algebraic structure) and the values are the same for homotopy equiva- lent spaces, then the value is called a homotopy type invariant. A homotopy type invariant is a ttopological invariant; for example, n(X; Y) is a homotopy type invariant of X. If a continu- ous mapping f: X --> Y induces isomorphisms of the homotopy groups of each tarcwise Con- nected component, then f is called a weak homotopy equivalence. Conversely, if X and Y are CW complexes, then a weak homotopy equivalence is a homotopy equivalence (J. H. C. Whitehead). Now we consider the category of pointed topological spaces. Let A and B be pointed topological spaces. Then the tdirect sum in this category is the one-point union (or bouquet) A v B obtained from the disjoint union AU B by identifying two base points * A and *B' A v B is identified with the subspace (A x *B)U(* A x B) in A x B. The reduced join (or smash product) of A, B is the space obtained from A x B by smashing its subspace A vB to a point and is denoted by A /\ B. We call A /\ SI the (reduced) suspension of A and de- note it by SA. Repeating the suspension n times, we have the n-fold reduced suspension of A. We call CA=A/\I (1=[0, 1]) the rednced cone of A (l has the base point 1). For a Con- tinuous mapping f: X --> Y, the space obtained by identifying each point (x, 0) of the base of CX with f(X)E Y is called the reduced mapping cone and is denoted by C f = Y U f Cx. The reduced join (or smash product) of mappings f: Y --> X and f': Y' --> X' is the mapping f /\ f': Y /\ Y'-->X /\X' induced from the product mapping f x f': Y x Y' --> X x X'. The reduced join of f: Y -->X and 1: SI-->SI (identity map- ping) is written as Sf = f /\ 1 and is called the suspension of f G. Puppe Exact Sequences For f:X --> Y and g: Y -->Z, we have 9 of O if and only if 9 can be extended to a continuous mapping from C f into Z. In other words, the sequence i* f* n(C f ; Z)o-->n(Y; Z)o......n(X; Z)o 776 is exact (i.e., Imi* = Ker f* = f*-I(O), where i: Y -->C f is the canonical inclusion and 0 is the class of the constant mapping). The inclusion i: Y -->C f gives rise to the reduced mapping cone C i . We also have the canonical inclusion i': Cf-->C i . Adding the term n(C i : Z Yo!:: to the left-hand side of the sequence above, we have a new exact sequence. Continuing this process, we obtain an exact sequence of infinite length. If X, Y satisfy a suitable condition (e.g., X, Y are CW complexes), then C i has the same homotopy type as the reduced suspension SX of X; i'* is equivalent to p*:n(SX;Z)o...... n(C f ; Z)o induced by a mapping p: Cf-->SX smashing Y to a point; and furthermore C p has the same homotopy type as SY, and the inclu- sion io: SX -->C p is equivalent to the suspension Sf: sx -->S Y of f Thus the following Puppe exact sequence is obtained: Sp* 5i* Sf* .. ,---->n(SC f ; Z)o---->n(SY; Z)o-+n(SX; Z)o p* i* f* -->n(C f ; Z)o-->n(Y; Z)o-->n(X; Z)o. In this exact sequence, if Y is a CW complex, X is a subcomplex of Y, and f is the inclusion i: X --> Y, then C i = C f = YU C x is homotopy equivalent to the space CjCX = YjX obtained by smashing CX to a point, and an exact sequence of the following type is obtained: Si* b* p* ... -->n(SX; Z)o-->n(YjX; Z)o-->n(Y; Z)o .!:'.n(X;Z)o. A sequence equivalent to the sequence X!.. Y.!...C f is called a cofibering, for which a similar exact sequence is obtained. For a con- tinuous mapping f: X --> Y, consider the sub- space Ef = {(x, <p) I f(x) = <p(0)} of the product space X x yI, By identifying X with {(x, <Px)1 <Px(l) = f(x)}, we can regard X as a deforma- tion retract of Ef' By putting PI (x, <p) = <p(l), we obtain a tfiber space (Ef,PI, Y). The fiber 7f = p1 (*) is called a mapping track of f Using the tcovering homotopy property, we see that the sequence n(W; Tf)On(W; X)on(W; Y)o is exact, where p(x, <p)=x. This sequence is also extended infinitely to the left as A x (Of). A Y i. P. ...-->n(W;., )o----->n(W;., )o-->(W; 7f)0--> , where i is the inclusion of the loop space 0 Y into Tf and Of: ox -->OY is the correspon- dence of the loops induced from f H. Homotopy Sets that Form Groups If X = SX' or Y = 0 Y' (or, generally, if Y is a thomotopy associative tH-space having a thomotopy inverse), then n(X; Y)o forms a group. In the general case the product of the 
777 loops induces the product of n(X; QY')o' We represent a point of SX by (x, t) (x E X, tEl) and define the mapping Qog: X --+Q Y for each g:SX --> Y by Qog(x)(t)=g(x,t). Hence an isomorphism Qo: n(SX; Y)o  n(X; Q Y)o is obtained. Each of the following pairs of homo- morphisms is equivalent: f* : n( SX; Y)o--> n(SX; Y')o and Qf*: n(X; QY)o-->n(X; QY')o; and h*:n(X';QY)o-->n(X;QY)o and Sh*: n(SX'; Y)o-->n(SX; no. If sn is an n-dimensional sphere, then nn(X) = n(sn; X)o is the n-dimensional homotopy group ( Section J). Let nn(x)= n(X; sn)o. If X is a CW complex of dimension less than 2n -1, nn(x) is the cohomotopy group isomor- phic to n(X;Qsn+1)o ( Section I). Let Kn be an tEilenberg-MacLane space of type (II, n). Then we have Kn = QK n + l , and if (X, A) is a pair of CW complexes, then n(X / A; Kn)o coin- cides with the cohomology group Hn(x, A; II). For the tclassifying space Bo(Bu) of the infinite orthogonal group 0 (infinite unitary group U) ( Section V), n( X/A; Bo) (n( X/A; Bu)) may be considered the KO-group KO(X, A) (K-group K(X,A)) (237 K-Theory). I. Cohomotopy Groups K. Borsuk defined a sum of mapping classes of X into sn (1936), which was named Borsuk's cohomotopy group by E. Spanier. Spanier also studied the duality of the cohomotopy group with the homotopy group and its relations to the usual cohomology groups. A cohomotopy group of (X, A) is defined to be nn(x, A) = n(X,A;sn, *), which forms a group if dim X/A <2n-1. A mapping F:X/A-->sn x sn given by F(x)=(/(x),g(x)) with f, g:X/A-->sn is homotopic to a mapping into sn v sn. If we compose F with a folding mapping of sn v sn onto sn, we obtain a mapping that repre- sents the sum [n + [g]. With each homo- topy class of a continuous mapping f of an n-dimensional tpolyhedron K n into an n- dimensional sphere sn, we associate the image f*(u) of the fundamental class u EHn(sn; Z) under the induced homomorphism f*: Hn(sn; Z)-->H"(K n ; Z). We then obtain a bijec- tive relation nn(Kn)...... Hn(K n ; Z), called Hopf's classification theorem. J. Homotopy Groups Let X be a topological space with a base point * , [" = {t = (t I' t 2' ... , tn) I O.s; t I' t 2' ... , tn.s; 1} be the unit n-cube, and jn its boundary. Write Qn(x, *) =xl"(in, *) (in particular, Ql(X, *) is the loop space), and denote by nn(X, *) or simply nn(X) the set of arcwise connected components of Qn(x, *), i.e., the homotopy 202K Homotopy Theory classes [f]. Using the notation of homotopy sets, we have nn(X, *) = n(1", i"; X, *), If we choose the constant mapping as the base point * ofQn(x,*), then Qm(Qn(x,*),*)= Qm+n(X, *). Thus nm(Qn(x, *), *) = nm+n(X, *). Since n l is the tfundamental group, nn(X, *) = nl(Qn-l(x), *) is also a group, called the n- dimensional homotopy group of X with base point *. "Multiplication" in homotopy groups is defined as follows: Given f" f2 EQn(x, *) we define fl + f2 EQn(x, *) by { fl(2t l , t 2 ,... ,tn), O.s; tl.s;t, Ul + f2)(t) = f 2 1 l,,::: ,,::: 1 2 ( t 1 - , t 2, . .. , t n ), 2 -....::::: t 1 -....::::: (Fig. 1). Then the product or sum of [flJ and [f2J is given by [fl + f2]. The identity is the class of the constant mapping (denoted by 0), and the inverse of [n is [fJ, represented by .1(t) = f(l- t l , t 2 , ..., tn)' The space Qn(x, *) is an I H-space, where multiplication is given by the correspondence (/1,12)--> fl + f2' Since the fundamental group of an H-space is commuta- tive, nn(X, *) is an Abelian group for n? 2, I, I, II I, I,, I II o 1/2 Fig. 1 Fig. 2 Let sn={t=(tj>...,tn+dILt?= I} be the n- sphere, and take * = (1, 0, ...,0) as its base point. Suppose that we are given a continuous mapping o/n:(1", jn)--> (sn, *) such that o/n: ["- jn --> sn - * is homeomorphic. Then the corre- spondence 0/:: n(sn; X)o-->nn(X, *) determined by 0/: [gJ = [g 0 o/nJ is bijective. Thus we can identify the homotopy group nn(X, *) with n(sn; X)o. K. Relative Homotopy Groups Suppose that we are given a topological space X and a subspace A of X sharing the same base point * . Identify ["-I with the face t n = o of 1", and let I n - l be the closure of jn_ 1"-1 (Fig. 2). Denote by nn(X, A, *) the set of homotopy classes of continuous mappings f: (1", jn, In-l)-->(X, A, *). Let Qn(x, A, *) be the mapping space consisting of such map- pings!, and Jet nn(X, A, *)= no(Qn(X,A, *)). Since Qm(Qn(x, A, *), *) is homeomorphic to Qm+n(x, A, *), we have nm(Qn(x, A, *), *)  nm+n(X, A, *). Thus nn(X, A, *) is a group for n? 2 and an Abelian group for n ? 3. This group is called the n-dimensional relative 
202 L Homotopy Theory homotopy group of(X,A) with respect to the base point *, or simply the n-dimensional homotopy group of (X, A). In the same man- ner as in Section J multiplication in this group can be defined using fl + f2' Since Qn(x, *, *) and Qn(x, *) are identical, we have nn(X, *, *) = nn(X, *). Hence homotopy groups are spe- cial cases of relative homotopy groups. Let g: (X, A, * )-->( Y, B, * ) be a continu- ous mapping. Then a correspondence g*: nn(X, A, * )-->nn(Y' B, *) is obtained by g*[f] = [gof], with g* a homomorphism of homo- topy groups for n? 2 and for n= 1, A = *. We call g* the homomorphism induced by g. Let En={t=(t l , ...,tn)ILt;= I} be the unit n-cell with boundary sn-I, Utilizing a suitable rela- tive homeomorphism t/J :(1", j"-l)-->(E n , *), t/J(jn)=sn-I, we obtain a one-to-one corre- spondence t/J*: n(E n , sn-I; X, A)o-->nn(X, A, *), and Qn(x, A, *) is homeomorphic (via t/J) to the mapping space XE"(sn-l, *; A, *). L. Homotopy Exact Sequences Given an element ex = [f] E nn(X, A, *), and letting aex = [[I ["-IJ Enn-l (A, *), we obtain a homomorphism (n? 2) a:nn(X, A, *)--> nn-I (A, *), which is calIed the boundary homomorphism. Furthermore, we have the following exact sequence involving homomor- phisms i*,j* induced by two inclusions i:(A, *) -->(X, * ),j:(X, *, * )-->(X, A, *): iJ ) i. X j, X A ... -->nn(A, * -->nn( ,* )-->nn( , ,*) (; j* (} ( i. ) -->... -->ndX, * )......ndX, A, * )-->no A)-->no(X . This sequence is called the homotopy exact sequence of the pair (X, A). A system of topo- logical spaces X ::::> A ::::> B'3 * is calIed a triple. In this homotopy exact sequence, if we replace (A, *), (X, *) by (A, B, *), (X, B, *), respectively, we obtain an exact sequence, calIed the homo- topy exact sequence of the triple (X, A, B). The homotopy group nn(A x B) of the prod- uct space is isomorphic to the direct sum nn(A)+nn(B), and the projections p(p'): A x B -->A(B) of the product space induce the pro- jections from nn(A x B) onto the direct sum- mands nn(A), nn(B). This is a special case of the tHurewicz-Steenrod isomorphism theorem in fiber spaces ( 148 Fiber Spaces). Setting A v B=(A x * )U( * x B), we obtain a direct sum decomposition nn(A v B)nn(A)+nn(B)+ nn+1 (A x B, A v B). Next we consider a fixed pair (X, A) and move the base point * to inves- tigate its effect on the elements of the homo- topy group. Suppose that we are given a path h: I -->A with terminal point * = h(l) and an element exEnn(X, A, *) (ex = [J]J:(1",in, jn-I) -->(X, A, *)). By the homotopy extension prop- erty, we can construct a homotopy fo: ([",in) 778 -->(X, A) satisfying fo(1"-I) = h(B) and fl = f Then the homotopy class [foJ of j with re- spect to the base point *' = h(O) is determined only by ex and the homotopy class w of the path h. We denote the homotopy class [foJ by ex'" Enn(X, A, * '). The correspondence ex-->ex'" is a group isomorphism, and (ex"')"" = ex"'''''. Thus if A is arcwise connected, nn(X, A, *) is isomor- phic to nn(X, A, * '), Hence, in this case, we may simply write nn(X, A) instead of nn(X, A, *). When * == *', the correspondence ex-->ex'" deter- mines the action of the group n l (A, *) on nn(X,A, *). Given an element exEnn(X, *) and a class w of paths in X, we define ex"'Enn(X, * ') as for relative homotopy, Specifically, if WE ndX, *), then ex'" -ex coincides with the White- head product [w,exJ (when n= 1, we have ex"'. ex-I = [w, exJ = wexw -I ex-I) ( Section P). A pair (X, A) consisting of a topological space X and an arcwise connected subspace A of X is said to be n-simple if the operation of ndA) on nn(X, A) is trivial. Similarly, an arc- wise connected space X is called n-simple if the operation of n dX) on nn(X) is trivial. For example, a pair (X, A) consisting of an H-space X and an H-subspace A is simple, i.e., n-simple for each n. If a topological space X satisfies n;( X) == 0 (O.s; i.s; n), then X is said to be n- connected. O-connectedness coincides with arcwise connectedness and I-connectedness means tsimple connectedness. sn is (n-l)- connected, A pair (X, A) is said to be n- connected if no(A)= no(X)= ni(X, A)=O (1.s; i.s; n), and (En, sn-I) is (n -I)-connected. M. Homotopy Groups of Triads Let (X; A, B, *) be a system, caned a triad, of a topological space X and its subspaces A, B satisfying An B'3 * (base point). Let nn(X; A, B, *) = nn-I (Ql (X, B), QI (A, A n B), *) (n? 2); nn(X; A, B, *) is a group for n? 3 and an Abelian group for n ?4. We calI nn(X; A, B, *) the homotopy group of the triad. From the homotopy exact sequence of the pair, we ob- tain the following homotopy exact sequence of the triad: c i* ... -->nj(A, A n B, * )......niX, B, *) )* D i* -->nj(X; A, B, * )-->n j _ 1 (A, A n B, *)-->,... Assume for simplicity that An B is simply connected, X = Int A U Int B (lnt A is the tin- terior of A), (A, A n B) is m-connected, and (B, A n B) is n-connected. Then (X; A, B) is (m + n)-connected, i.e., nj(X; A, B, *) = 0 (2.s; j.s; m + n) (BIakers- Massey theorem). Furthermore, in this case we have a replica of the texcision isomorphism in homology theory for j < m + n; that is, we have the iso- morphism i*:niA, A nB, * )n)X,B, *) in- 
779 duced by the inclusion i:(A, A n B)-->(X, B). On the other hand, nm+n+1 (X; A, B, *) is isomor- phic to nm+1 (A, A n B, *) @ nn+1(B, A n B, *). This shows that the excision isomorphism does not always hold for homotopy groups, an important difference from homology theory. However, if we replace the excision axiom by the H urewicz-Steenrod isomorphism theorem, which is valid for fiber spaces ( 148 Fiber Spaces), then we can construct homotopy theory axiomaticalIy in the same manner as homology theory ( 201 Homology Theory). N. The Hurewicz Isomorphism Theorem The Hurewicz homomorphism r of nn(X, A) into the n-dimensional integral homology group Hn(X,A) is defined by r([fJ)= f*(E n ) (where En is a generator of Hn(In, jn), Then we have the Hurewicz isomorphism theorem: Sup- pose that the pair (X, A) is n-simp1e (e.g., A = *) and (n-l)-connected. Then we have H,(X,A)=O U<n) and the isomorphism r: nn(X,A)Hn(X,A) (for n= I  170 Funda- mental Groups), Let X, Y be simply connected topological spaces, and let f: X --> Y be a con- tinuous mapping. Then the folIowing two con- ditions are equivalent: (1) f*: ni(X)-->n;( Y) is injective for i < n and surjective for i.s; n. (2) f*: Hi(X)-->H;(Y) is injective for i < n and sur- jective for i.s; n (J. H. C. Whitehead's theorem). I.-P. Serre generalized these theorems as folIows: A family  of Abelian groups satisfy- ing condition (i) is calIed a class of Ahelian groups: (i) If a sequence F -->G-->H of Abelian groups is exact and F, H E, then GE. Fur- thermore, we consider the folIowing condi- tions: (ii) The tensor product G @ F of an arbi- trary Abelian group F with an element GE also belongs to. (ii') If both F, GE, then F@ G, Tor(F, G)E(6', (iii) If GE(6', then its thomo1ogy group Hi(G)E (i>O). Condition (ii') is implied by (ii). A homomorphism f: F -->G is calIed -injective if Ker fE,- surjective if Coker f= G/ImfE, and a- isomorphism if f is -injective and - surjective. Two Abelian groups G and G' are called -isomorphic if there exist ((1- isomorphisms f: F -->G and f': F -->G'. In particular, if the class (6'0 consists of only the trivial group 0, then concepts such as (6'0- isomorphism coincide with the usual concepts of isomorphism, and so on. Let p be the class of finite Abelian groups whose orders are relatively prime to a fixed prime number p. Here, instead of the terms p-isomorphism and so on, we use the terms modp isomorphism and so on. Let 9 be a class of finitely gen- erated Abelian groups. Then p satisfies con- ditions (ii) and (iii), and q; satisfies (ii') and (iii). 2020 Homotopy Theory We have the folIowing generalized Hurewicz theorem: (A) Suppose that a class  satisfies (ii) and (iii) and we are given a 2-connected pair (X, A) of simply connected spaces X, A. If ni(X, A)E (i< n), then Hi(X, A)E, and r:nn(X,A)-->Hn(X,A) is a -isomorphism. (B) Suppose that A = *,  satisfies conditions (ii') and (iii), and X is simply connected. Then an assertion similar to (A) holds. In particular, a simply connected space X having finitely generated homology groups (e.g., a simply connected finite polyhedron) has finitely gen- erated homotopy groups. As a corolIary to theorem (A), we obtain a generalized White- head theorem. In particular, applying the theorem to the class q; n p, we obtain the foIlowing frequently used theorem: Suppose that we are given simply connected spaces X, Y whose homology groups are finitely gen- erated and f: X --> Y satisfies f*n2(X) = n 2 (y). Then the folIo wing two conditions are equiva- lent: (1) f*: ni(X) -->n i ( Y) is a mod p isomor- phism for i < n and a mod p surjection for i = n. (2) f* : Hi(X, Zp)-->H,(Y, Zp) is an isomorphism for i <n and a surjection for i=n (where Zp= Z/pZ). The theory above, which makes use of the notion of class , is an example of Serre's -theory. Concepts such as tspectra1 sequences for fiber spaces and tn-connective fiber spaces are important tools in Serre's- theory ( 148 Fiber Spaces). To calculate homotopy groups, we use notions such as exact sequences, fiber spaces, (co)homology groups of n-connective fiber spaces, and tPostnikov systems. Given an arbitrary group (more generaIly, a Postnikov system), there exists a CW complex having the given group (system) as its homotopy group (Postnikov system) (realization theorem of homotopy groups). For an arbitrary arcwise connected topological space X there exist topological spaces (X, n) and continuous map- pings Pn :(X, n+ 1)-->(X, n) (n = 1,2,...) satisfy- ing the foIl owing two conditions: (i) «X, n + 1), Pn' (X, n)) is a fiber space whose fiber is an tEi1enberg-MacLane space. (ii) (X, 1)= X, and «X, n+ 1), PI 0...0 Pn' X) is an n-connective fiber space. The method of obtaining the homotopy group nn(X)Hn«(X,n» by com- puting (co)homo1ogy groups of (X, n) is caIled a killing method. O. Homotopy Operations Let X, Y, X', Y' be topological spaces. If to each continuous mapping fE yx there corre- sponds a homotopy class <I>(f)En(X'; Y') that is a homotopy invariant of f (satisfying a certain naturality condition), then <I> is caIled a homotopy operation. More generaIly, we 
202 P Homotopy Theory may consider the case where <1> is a mapping from n(X I ; Yd x ... x n(X r ; Y,.) into n(X'; Y'). The naturality of <1> is defined as follows: Con- sider the tcategory  of topological spaces (or its subcategory). Let Y = Y' be an arbitrary tobject of, and fix X and X'. In this case, the naturality of<1>y:n(X; Y)-->n(X'; Y) is de- fined to be the commutativity of the diagram: <l>y n(X; Y)->n(X'; Y) 19. 1 q. <l>z n(X; Z)->n(X'; Z) i.e" 9 * 0 <1>y = <1>z 0 9 * for an arbitrary tmor- phism (i.e" continuous mapping) g: Y --> Z of the category . Similarly, when objects Y, Y' of the category  are fixed and X = X' is an arbitrary object of ((i', to say that a homotopy operation <1>x: n(X; Y)-->n(X; Y') is natural means that h* 0 <1>x= <1>w 0 h* for an arbitrary morphism h: W-->X. We have the folIo wing theorem: In the category of topological spaces and continu- ous mappings, the homotopy operations <1>y: n(X; Y)-->n(X'; Y) and the elements of n(X'; X) are in one-to-one correspondence. The corre- spondence is obtained by associating a homo- topy operation <1>(fJ) = fJ 0 ex (fJE n(X; Y)) with each exEn(X'; X). Similarly, the homotopy operations <1>x: n(X; Y)-->n(X; Y') and the ele- ments of n( Y; Y') are in one-to-one corre- spondence, This theorem holds also for the case involving several variables if we consider n(X'; XI v X 2 V ...) or n( Y I x Y 2 X ...; Y') in- stead of n(X'; Y) or n(X; Y'). The theorem remains valid if we replace the spaces X, Y by systems of spaces. P. Homotopy Operations in Homotopy Groups (1) If X, X' are spheres sn, SP with base points and Y, Y' are topological spaces with base points, a homotopy operation <1>y: nn( Y)--> n p ( Y) is said to be of type (n, p). By the theo- rem in Section 0, the homotopy operations of type (n, p) are in one-to-one correspondence with the elements of the homotopy group of the sphere np(sn), (2) As an example of the 2-variable homo- topy operations <1>y: nm(Y) x nn( Y)-->np(Y) of type (m, n; p) we have the Whitehead product defined as folIows: Suppose that exEnm(Y), fJEnn(Y) are elements represented by f:(Im,im) -->(Y, *) and g:(I",in)......(Y, *), respectively, Define a continuous mapping F from the boundary i m + n = (1 m X in) u (i m X I") of I m + n = 1 m X In into Y by F(x, y) = f(x) for (x, Y)Er x in and F(x, y) = g(y) for (x, y) E i m x I". Since jm+n is homeomorphic to Sm+n-I, we can iden- tify them. The homotopy class represented by F is an element of nm+n-I (Y) determined by ex 780 and fJ, denoted by [ex, fJJ and called the White- head product of ex and fJ (J. H. C. Whitehead, Ann. Math" (2) 42 (1941)). The Whitehead product is a homotopy operation of type (m, n; m + n - 1). Let t/lm: (1m, i m )--> (sm, * ) be a mapping that smashes i m to a point. The pro- duct oft/lm and t/ln defines a mapping t/lm,n: sm+n-I-->sm v sn =(sm X * )U( * X sn). Let IE n m ( sm v sn), I' E nn( sm v sn) be the homotopy classes of the natural inclusions of sm, sn into sm v sn; then the homotopy class of t/lm,n is [I, I']. G. W. Whitehead showed that a direct sum decomposition np(sm v sn) = I*np(sm) + Inp(sn)+ [I, I'J*np(sm+n-l) (1*, I, [I, I'J* are injective) holds for 1 <p<m+n+min(m,n)- 3. Furthermore, P. J. Hilton showed that for general p> I, n p(sm V sn) is the direct sum of the images of injections 1*, I, [I,I'J*, [[I, I'J, I J*, [[I, I'J, I'J *' etc. The homotopy operations of type (m, n; p) are in one-to-one correspondence with the elements of np(sm v sn); hence such operations can be constructed by means of composition and the Whitehead product. The last proposition is also valid for homotopy operations of type (m l , ..., m r ; p). The Whitehead product [a, fJJ (exEnm(X), fJE nn(X)) is distributive with respect to ex (resp. fJ) for m> 1 (n> 1), and we have [fJ,exJ=( -lt n [ex, fJJ and f* [ex, fJJ = [f*ex,!*fJJ for f: X --> Y. Moreover, for y E nr(X) the Jacobi identity holds: (-I tr[[ex, fJJ, yJ + (-1 tn[[fJ, y J, exJ + (-I)'"[[y,exJ,fJJ==O (M. Nakaoka and H. Toda; H. Uehara and W. S. Massey; Hilton). Q. Suspensions and Generalized Hopf Invariants We denote by ex/\fJEn(X /\X', Y/\ Y')o the class of the reduced join of f, g, where f represents exEn(X; Y)o and 9 represents {JEn(X'; Y')o. We call ex /\ fJ the reduced join of ex and fJ. In particular, if Y = Y' = Sl, fJ is the identity mapping of Sl, and ex is represented by f, then ex /\ fJ is calIed the suspension of ex and is de- noted by Sex. Sex is the class of the suspension Sf of f and belongs to n(SX; SY)o, where SX indicates the reduced suspension of X. The suspension Sex is often denoted by Eex in ref- erence to the German term Einhiingung. The identity mapping 1 of SY gives rise to an in- jection i = Qo I sending Y into the loop space Q(SY) determined by the formula i(y)(t) = (y, t). Then we have i* = Qo 0 S: n(X; Y)o-->n(SX; SY)o n(X;QSY)o, and Sand i* are equivalent. Let 1k be the identifying space ykj , where y k is the prod- uct space Y x ... x Y of k copies of Y and  
781 is the equivalence relation determined by (*,Yl,Y2, ""Yk-d(Yl'*'Y2' ""Yk-I)'" (Y"""Yk-I'*)' Denote by Y oo = Uk 1k the limit space with respect to the injection 1k-1 --> 1k given by (y Y ) --> (Yl '" Yk - l *) and call it the 1, ..., k-l " , reduced product space of Y. Let Y be a CW complex of 0- tsection *. The mapIJing i: Y = Y l -->OS Y can then be extended to i: Y 00--> OSY where tis a weak homotopy equiva- , -I lence. If X is also a CW complex, then 0 0 0 t*:n(X; Yoo)o-->n(SX;SY)o is bijective. By smashing the subset Y of Y 2 , we have YA Y= Y 2 /Y. This smashing mapping can be extended to h: Y oo -->( Y A Y)oo (I. M. James). Utilizing h*: n(X; Yoo)o-->n(X; (Y A Y)oo)o and the bijec- tion O(jl 01*, we obtain a correspondence H:n(SX;SY)o-->n(SX;S(YA Y))o. We call H(ex) the generalized Hopf invariant of ex. When X = S2n-2, y=sn-l, H is equivalent to the Hopf invariant y: n 2n - 1 (sn)-->z ( Section U). In general, we have H 0 S = 0, and the exactness of  !!. holds under various conditions. Denote also by 0 the composition of homo- topy classes; then we have S(exofJ)= Sex 0 SfJ and H(exoSfJ)= Hex 0 SfJ, Also, H(SexofJ)= S(exAex)oHfJ. Under the condition i<3n-3, we have (ex I +ex 2 )0 fJ= ex l 0 fJ +ex 2 0 fJ + [ex I' ex 2 J 0 H(fJ) for ex l , ex 2 E nn(X) and fJ Eni(sn) (G. W. Whitehead). Thus the composition ex 0 fJ is not always left distributive but is always right distributive, and ex 0 fJ is left distributive if fJ = S(r. The composition is defined over the stable homotopy groups G r of spheres ( Section U): exofJEG p + q (exEGp,fJEGq).1t is distributive and satisfies fJ 0 ex = ( -ljPq ex 0 fJ. When Yand Y' are Ei1enberg-MacLane spaces, Bo, and Bu, we have tcohomo1ogy operations on cohomology groups H n ( ; II), KO groups, and K groups, respectively. As typical examples there are tSteenrod square operations Sqi:Hn(X; Z2)-->H n + i (X; Z2), tSteen- rod pth power operations f!lJi: Hn(x; Zp)--> H n + 2i (p-I)(X; Zp), tChern characters ch n : K(X) -->H 2n (X; Q) (Q: rational field), tAdams opera- tions <1>i: KO(X)-->KO(X) (K(X)-->K(X)). They are all homomorphisms ( 64 Cohomology Operations; 237 K-Theory). R. Secondary Compositions Suppose that exofJ=O, fJoy=O for YEn(W; X)o, fJEn(X; Y)o, exEn(Y;Z)o. In the com- mutative diagram of Puppe exact sequences ,* p* i* 1. n(SW;Y)o-->n(Cy; Y)o-->n(X; Y)o--> !::x* * to:.". !a. .,_ S"< p I Z I n(SW;Z)o-->n(Cy;Z)o-->n(X; )0--> 202 T Homotopy Theory the set of elements TJ in n(SW; Z) such that p*(TJ)Eex*i*-I(fJ) is denoted by {ex, fJ, 'I} and is called a secondary composition or Toda bracket. If (), I] are elements of n(SW; Y)o, n(SX;Z)o, respectively, then we have {ex,fJ, 'I} +ex*()={ex,fJ,y}, {ex, fJ, 'I} +SY*I] = {ex,fJ,y}. Hence we may consider the set {ex, fJ, 'I} to be a residue class modulo a submodule generated by ex*n(SW; Y)o and Sy*n(SX;Z)o. The secondary composition {ex, fJ, 'I} has the following properties: (i) {ex, fJ, 'I} is linear with respect to ex, fJ, 'I (if the sum is defined); (ii) ex 0 {fJ, 'I, b} = {ex, fJ, 'I} 0 ( - Sb); (iii) S {ex, fJ, 'I} = - {Sex, SfJ, Sy}; (iv) exo {fJ, 'I, b} = {exo fJ, 'I, b}, {exo fJ, 'I, b} = {ex,fJoy, b},...; (v) {{ ex,fJ, 'I}, Sb, S8} + {ex, {fJ, 'I, b}, S8} + {ex, fJ, {'I, 15, 8} } = O. Suppose that the spaces X, Y, Z, Ware spheres. Then by (iii) the secondary composi- tion {ex, fJ, 'I} E Gp+q+r+d(ex 0 Gq+r+l + 'I 0 Gp+q+d is defined in the stable homotopy groups G r = 11 ' m n ( sn ) of S p heres. From this we obtain 71-+00 n+r (vi) {y,fJ,ex} =( _l)Pq+qr+rp+l {ex,fJ, 'I} and (vii) (_l)Pr{ex,fJ, 'I} +( -1)qP{fJ, 'I, ex} +( _l)rq{y,ex,fJ} =0. S. Functional Operations Let <1> be an operation corresponding to ex and y be the class of f We put <1>f(fJ) = {ex,fJ,y} and call <1>f a functional <1>-operation. When <1> is a cohomology operation, <1>f is called a func- tional cohomology operation. Then <1>f(fJ) is defined for fJ satisfying f*(fJ) = <1>(fJ) = 0, and <1>f(fJ) is determined modulo 1m Sf* + 1m <1>. For f:sn+k-->sn, k=2i(p-l)-I, we denote by H (f) ' E Hn+k+l ( sn+k+!.z ) the image p t n +k+l , p of a generator 8n of Hn(sn; Zp) under the functional f!lJj operation. Then the Hopf in- variant modulo p (or modp Hopf invariant) H . n ( sn ) -->z is obtained (we use Sq2i for p P' n+k p . = 2). The following statements are eqUIvalent: (i) The mod2 Hopfinvariant is not trivial (H2 # 0); (ii) there exists a mapping: S2k+l --> Sk+1 of Hopf invariant 1; (iii) Sk is an H-space; (iv) the Whitehead product [I, I] of a genera- tor 1 of ndSk) vanishes. Also, H 2 # 0 if and only if k = 2,4,8 (J. Adams), and for an odd prime p, Hp#O if and only if k = 2p - 3 (A. L. Liulevicius; N. Shimada and T. Yamanoshita). T. Stable Homotopy Groups and Spectra The homotopy set n(snx, snY)o for n-fo1d iterated suspensions snx = X A sn = ssn-l X and sny, forms a group (an Abelian group) if n:;' I (n:;'2). The limit nS(X; Y)=limn(SnX; sn Y)o with respect to the suspension homo- morphisms S: n(snx; snY)o-->n(sn+l X; sn+! Y)o is called a stable homotopy group of X and 
202 U Homotopy Theory Y. For an r-connected space Y and a CW- complex X, S:n(X; Yjo-->n(SX;SY)o is bijec- tive if dim X .s; 2r and surjective if dim X .s; 2r+ 1 (generalized suspension theorem). Thus, if X is a finite-dimensional Cw -complex, nS(X, Y) is isomorphic to n(S"X,snY)o for suf- ficiently large n. To discuss stable homotopy groups more generally, the following concept of spectra is used. A system E = {E k , sd which consists ofCW-complexes Ek and continuous mappings Sk:SEk-->Ek+1 is called a spectrum. When Ek=Sk and Sk= lk+1 :SSk-->Sk+l, S= {Sk, lk+l} is called a sphere spectrum. When Ek = K(G, k) (tEilenberg-MacLane complex) and Sk induces a homotopy equivalence K(G, k)""QK(G, k+ 1), HG = {K(G, k), sd is called an Eilenberg-MacLane spectrum. As in the latter, a spectrum E in which Sk induces a homotopy equivalence Ek "" QEk+1 is called an Q-spectrum. By Bott's periodicity, Q-spectra KU = {Z x Bu, U,Z x Bu, U,...} and KO= {Z x Bo, U/O,Sp/U,Sp,Z x B sp ' U/Sp, SO/U, 0, Z x Bo,... } are obtained ( Section V). Also, using tThom complexes, Thom spectra MU, MO, etc. are obtained. Given a spectrum E, by putting E n (X,A)=limn(Sk(X/A);Ek+do for each pair (X, A) of CW-complexes, we obtain a generalized cohomology theory with E-coefficient; and by putting En(X, A) = limndEk_nA(X/A)), we obtain a generalized homology theory with E-coefficient. tGener- alized (co)homo10gy theory on (finite) CW- complexes can be represented by a suitable spectrum (G. W. Whitehead, E. H. Brown, Adams), Corresponding to E = S, HG, KU, MU, etc., we have stable (co)homotopy groups, G-coefficient (co)homology groups, K-groups, t(co)bordism groups, etc., respec- tively ( 201 Homology Theory). U. Homotopy Groups of Spheres The spheres sn and their homotopy groups are basic objects in homotopy theory. Although much research has been done concerning these objects, there are still open problems. sn is (n-1)-connected: n;(sn) = 0 U<n). The fact that nn(sn)  Z (infinite cyclic group) was obtained from the tBrouwer mapping theorem. Also, n;(SI)=O (i> 1) follows from the fact that the tuniversa1 covering space of SI is contrac- tible. Suppose that we are given a continuous mapping f:S 2n - l -->sn. We approximate it by a tsimplicia1 mapping q>. Then the inverse image <p -I ( * ) of a point * in the interior of an n- simplex of sn is an (n -I)-dimensional tpseudo- manifold which is orientable by means of a suitable generator SE Hn-dq> -I( *)). The boundary isomorphism 8: H n (s2n-l, q> -I (*)) Hn-l (rp-I( *)) and the homomorphism q>*: 782 H n (s2n-1 ,q> -I (* ))-->Hn(S", *) give rise to an in- teger y( q» determined by the relation q>*8 -I (s) = y(q»sn (sn is an orientation of sn). This inte- ger is independent of the choice of <p, so we can set y(/)=y(q». Then f""g implies that y(/) = y(g). We call y(/) the Hopf invariant of f H. Hopfdefined y and showed y:n3(S2)Z (1931); y(n 2n _dS n )) = 0 for odd n; y(n 2 n-1(sn))=:> 2Z for even n; and y(n 2n _dS n )) = Z for n = 4, 8 (1935). H. Freudenthal defined a homomor- phism E: ni(sn)-->n i + 1 (sn+I), E[fJ = [Sf], and proved the Freudenthal theorem: (1) E is an isomorphism for i< 2n - 1; (2) E is a surjection for i = 2n -1; and (3) the image of E coincides with the kernel of y for i = 2n. Furthermore he obtaincd nn+1(sn)Z2 (n p 3) (1937). For n= 2,4, 8, a mapping f: S2n-l-->sn (tHopf map- ping) such that y(/) = 1 (given by Hop£) is the projection of a tfiber bundle s2n-1 over the base space sn, and the correspondence (ex, [3)--> Eex+ f*[3 gives an isomorphism ni_dsn-I)+ n;(s2n-l) (direct sum)n;(Sn). Hence we ob- tain n 4 (S2)=Z2' It was shown by G. W. Whitehead and L. S, Pontryagin that nn+2(sn) (n p 3) is isomorphic to Z2 (1949). Whitehead also defined a generalized Hopf homomorphism H: n;(sn)-->n i (S2n-l) for a range of i < 3n - 3, and this restriction on the dimension was removed by P. J. Hilton and I. M. James. Using H, many nontrivial results concerning ni(sn) have been obtained. Serre obtained the following (1951-1953): n;(sn) is finite except when i = n or i = 4m - 1 and n = 2m. Further- more, n 4m - 1 (S2m) is the direct sum of Z and a finite group. Let p be an odd prime and n be even. Then ni(sn) is Ybp-isomorphic to ni-I (sn-I) + n;(S2n-I). Let n be odd. Then nn+k(sn)EYb p (k<2p-3), and nn+2P_3(sn) is Yb p - isomorphic to Zp. Serre and H. Toda deter- mined nn+k(sn) for k = 3,4,5, and Serre further determined it for k = 6, 7, 8. Utilizing the re- duced product space of sn, James gave the sequence ." -->ni(S")ni+1 (sn+1 )ni+1 (S2n+l) -->ni-I (sn)-->... and showed that it is an exact sequence if n is odd and an exact sequence mod 2 if n is even (1953). Using this exact sequence and the secondary composition, Toda determined nn+k(sn) for k.s; 19 ( Appendix A, Table 6.VI). By the Freudenthal theorem (1), the nn+k(sn) (n > k + 1) for a fixed k are isomorphic to each other. We call nn+k(sn) (n > k + 1) the stable homotopy group of the k-stem of the sphere and denote it by G k . For k=O, 1,2, ..., 15, ''', G k  Z, Z2, Z2' Z24' 0, 0, Z2, Z240, Z2 + Z2, Z2 + Z2 +Z2, Z6, ZS04' 0, Z3, Z2 +Z2' Z480 + Z2' .... For the computation of G k , the notion of n-connective fiber spaces is important. By 
783 utilizing the Adams spectral sequence, We can show that G k is closely related to the cohomol- ogy of the tSteenrod algebra. Let p be an odd prime. There exist the folIowing sequences of elements of order p: {exiE G k (k = 2i(p- 1)- 1 j}, {PiE G k (k = 2(ip+ i - 1 )(p-1)- 2)} and for p> 3{YiEGdk= 2(i p 2 +(i-l)p+ i- 2)(p- 1)- 3)}. The p-component of G k is determined for k <2 p 2(p-I)- 3 by using Steenrod algebra. To compute G k for higher k, relations such as ex l Pf = 0, Pi#[ = 0 (i> 1) are necessary. In gen- eral, each element of G k (k # 0) is nilpotent (G. Nishida). Let n;(sn: p ) be the p-component of ni(sn). To survey this group for the nonstable case (i?o 2n - 1), we utilize Serre's mod p direct sum decomposition (for n even), and we have the following two exact sequences for the case of odd n: '" -->n;(sn)ni+2(sn+2)-->ni(Q2(sn+2), sn)"., ... -->n i + 3 (spn+ p +l : p)ni+l (spn+p-l : p) -->n i (Q2(sn+2), sn: p)-->ni+2(spn+p+l : p)..., where E 2 =EoE and E2(ex)=pex (Ap- pendix A, Table 6.VI). V. Homotopy Groups of Classical Groups Consider the classical group U(n, A), which is either the orthogonal group O(n) (A = R); the unitary group U(n) (A=C); or the symplectic group Spin) (A = H). The infinite classical group U(oo,A) is defined to be the inductive limit group of {Urn, A) I n = 1,2,... } with re- spect to the natural injection U (/I, A) c U (n + I,A). We caIl U(oo,A) the infinite orthogonal group, infinite unitary group, and infinite sym- plectic group for A = R, C, and H, respectively The dimensions of the cells of U( 00, A)- U(n, A) are ?oA(n+ 1)-1, where A=dimRA (== 1 (A = R), =2 (A=C), =4 (A=H)). It foJlows that ndU(n, A)) is isomorphic to ndU(oo,A)) for k<),(n+ 1)-2, which is called the kth stable homotopy group of the classical group. Let 0 = U( 00, R), U = U( 00, C), Sp= U( 00, H). The homotopy groups of the clas- sical groups are periodic (k?oO): nk(U)nk+2(U)Z, k odd,  0, keven, nk(O) nk+4(Sp) nk+S(O), Z, k=3,7(mod8), Z2' k=O,l (mod 8), O, k#O, 1,3,7 (mod 8). This is caIled the Bott periodicity theorem. The relations are deduced from weak homotopy equivalences U -->Q(B u ), Bu x Z-->Q(U), Bo x Z -->Q(U/O), U/O-->Q(Sp/U), Sp/U -->Q(Sp), Sp 203 A Hopf Algebras -->Q(B sp ), Bsp x Z-->Q(U/Sp), U/Sp-->Q(O/U), O/U -->Q(O). This result is applied to non- stable cases; for example, n 2n (U(n)) is a cyclic group of order n! ( Appendix A, Table 6.VI). The 2-dimensional homotopy group n2(G) of any Lie group is trivial. Let exEnk(O(n)), where ex=[f],f:Sk-->O(n). We define 1: Sk X sn-l-->sn-l by l(x, y)=/(x)' y and identify Sk+n with the boundary (Ek+l X sn-l)u (Sk X En) of Ek+l X En. We extend 1 to f:sk+n-->sn so that it maps Ek+l X sn-l, Sk X En into the upper and lower hemisphere of sn(sn I = the equator), respectively. Let J(ex)E nn+k(sn) be the class of the mapping thus ob- tained. This homomorphism J : nd O( n))--> nn+k(sn) is calJed a J-homomorphism of Hopf and Whitehead. For the stable case, J: nk(O)--> G k is injective for k = 0, 1 (mod 8), and the order of the image of J is the denominator of B 2t /4t (B 2t is a tBernoulli number) or its double for k=4t-1 (Adams). References [IJ H. Hopf, Uber die Abbildungen von Spharen auf Spharen niedriger Dimension, Fund. Math., 25 (1935),427-440. [2J W. Hurewicz, Beitrage zur Topologie der Deformationen I-IV, Proc. Acad, Amsterdam, 38 (1935), 112-229, 521528; 39 (1936),117- 125,215-224. [3] H. Freudenthal, Uber die Klassen der Spharen-abbildungen, Compositio Math., 5 (1937), 299314. [4J J.-P, Serre, Groupes d'homotopie et classes des groups abeliens, Ann. Math., (2) 58 (1953), 258-294. [5] R. Bot!, The stable homotopy of the class- ical groups, Ann. Math., (2) 70 (1959),313- 337. [6J H. Toda, Composition methods in homo- topy groups of spheres, Princeton Univ. Press, 1962. [7J E. H. Spanier, Algebrai<.: topology, McGraw-HilI, 1966. [8J R. M. Switzer, Algebraic topology, Springer, 1975. [9J G. W. Whitehead, Elements of homotopy theory, Springer, 1978. 203 (111.25) Hopf Algebras A. General Remarks The concept of Hopf algebras arose from two directions. First in the field of algehraic topol- 
203B Hopf Algebras ogy the notion of Hopf algebras arose from the study of homology and cohomology of Lie groups or, more general1y, H-spaces. It was introduced by H. Hopf [IJ, whose basic struc- ture theorem was generalized and applied to several problems by A, Borel [2]. These Hopf algebras are graded as algebras and coalge- bras, and they are now used as standard tools in algebraic topology. On the other hand, Hopf algebras without grading were studied in connection with affine algebraic groups and formal groups. The study of nongraded Hopf algebras as an algebraic system was initiated by M, E. Sweedler [3J, and many results on Hopf algebras of this type have been applied, not only to the theory of algebraic groups but also to the Galois theory of field extension and to combinatorial theory. Although these two types of Hopf algebras have similar structures, and the same termi- nology is used to describe their properties, they are somewhat different from each other. So to avoid confusion in this article, we dis- tinguish between graded Hopf algebras and Hopf algebras. B. Graded Algebras A tgraded module A = Ln;;<oAn over a field k is said to be of finite type when each An is finite- dimensional, A is connected when an isomor- phism IJ: k  Aa is given. The ttensor product of two graded modules A and B is a graded module with A @ B=Ln(A @ B)n' (A @ B)n= LpA p @ Bn-p' We cal1 A* = L A: (where A: is the tdua1 module of An) the dual graded module of A. When A and B are of finite type, A @ Band A* are also of finite type, and we have(A@B)*=A*@B* and A**=A, When A and B are connected, A * and A @ B are also connected. Let A be a graded module. If there exists a degree-preserving linear mapping q>: A @ A--> A, we cal1 (A, q» a graded algebra, whereas if there exists a degree-preserving linear mapping 1jJ: A --> A @ A, we cal1 (A, 1jJ) a graded coalgebra. We call q> a multiplication, and IjJ a comultipli- cation (or diagonal mapping). Usual1y we write q>(a @ b) = ab (the product of a, bE A), and call ljJ(a) the coproduct of a. Multiplication and comultiplication are dual operations. If A is of finite type and (A,<p) is a graded algebra, then (A*,q>*) (where q>* is the dual mapping of q» is a graded coa1gebra, and vice versa. A multi- plication q> is called associative (commuta- tive) if q>(1 @ <p)==<p(q>@ 1) (q>o T=q», where T: A @ A-->A @ A is the mapping defined by T(a @ b)=( -l)Pqb @ a for aE Ap and bE Aq. Associativity and commutativity of a comulti- plication are defined dually. Let (A, 1jJ) be a 784 graded connected coalgebra of finite type, and identify k with Ao via IJ. Then the graded algebra (A*, 1jJ*) has the unity of k as unity if and only if IjJ satisfies 1jJ(J) = 1 @ 1 and ljJ(x) = 1 @x+x@ 1 + LiX;@ x;' (O<degx;<degx) for degx > 0, In this case we say that IjJ has the unity of k as counity. For graded algebras (A, q» and (B, q>'), if q>" =(q> @ q>') 0 (1 @ T@ 1): A @ B@ A @ B-->A @ B, then (A @ B,q>") is also a graded algebra, which we denote by (A @ B, <p")= (A, <p) @(B,q>'). The tensor prod- uct of graded coalgebras is defined as the dual notion of (A, q» @ (B, <p'). C. Graded Hopf Algebras For simplicity we assume that graded mod- ules are defined over a field, connected, and of finite type. Let a graded module A be equipped with a multiplication <p and a co- . multiplication 1jJ. If q> and IjJ have the unity of k as unity and 1jJ:(A,q»-->(A,q»@(A,q» is an algebra homomorphism, then we call (A, q>, 1jJ) a graded Hopf algebra, The last condition for a graded Hopf algebra is satisfied if and only if <p :(A, 1jJ) @ (A, 1jJ)-->(A, 1jJ) is a homomorphism of graded coa1gebras. The dual (A*, 1jJ*, q>*) is also a graded Hopf algebra, cal1ed the dual Hopf algebra of (A, (P, 1jJ). D. H-Spaces Let X be a topological space, The tcoho- mology group H*(X) (thomology group H*(X)) considered over a field k has a multi- plication d* (comultiplication d*), which is induced by the diagonal mapping d:X -->X X X and hecomes a commutative and associative graded algebra (coalgebra). The groups H*(X) and H*(X) are dual to each other ( 201 Homology Theory I, J). When X is equip- ped with a base point Xo and a base point- preserving continuous mapping h: X x X --> X such that h 0 /i  1 x (thomotopic) for i = 1 and 2 (where /1 (x) =(x, xo) and /2(X) =(x o , x)), we call (X, h) an H-space, h a multiplication, and Xo a homotopy identity of X. Then h induces, through a tKiinneth isomorphism, a co- multiplication h*: H*(X)-->H*(X) @ H*(X) (Hopf comultiplication) and a multiplication h* :H*(X) @ H* (X)--> H*(X) (Pontryagin multiplication). Then h*(ex) (exEH*(X)) is called the Hopf coproduct of ex, and h*([3 @ y) ([3, YEH*(X)) is called the Pontryagin product of fJ and y. When X is tarcwise connected and H*(X) is of finite type, (H*(X),d*,h*) and (H*(X), h*, d*) are graded Hopf algebras dual to each other. In particular, when h is homot- opy associative, i.e., h 0 (h x 1 x)  h 0 (l x x h) (homotopy commutative, i.e., h  h 0 r where 
785 T(X t ,X 2 )=(X 2 ,X t ) for XiEX), then h* and h* are associative (commutative). tTopological groups and tloop spaces are homotopy as- sociative H-spaces. If a continuous mapping g:X -->X satisfies ho(\x xg)ho(g x Ix)c (constant mapping X --> {Xo} ), then 9 is called a homotopy inverse for X, h. Suppose that a graded Hopf algebra A is defined over a field k of characteristic p and equipped with associative and commutative multiplication, and A is generated by a single element aEAn' Then A is a tpolynomial ring k[aJ (n is even when p#2) or a tquotient ring k[aJ/(a 2 ) (n is odd when p # 2) or k[aJ/(a pf ) (only when p # 0; n is even when p # 2). These are called elementary Hopf algebras. Every graded Hopf algebra over a tperfect field k with associative and commutative multiplica- tion is isomorphic (as a graded algebra) to a tensor product of elementary Hopf algebras (Borel's theorem) [2]. In particular, the coho- mology algebra over a field of characteristic 0 of a tcompact connected Lie group is isomor- phic to a tGrassmann algebra generated by elements of odd degrees [I]. E. Steenrod Algebras The tSteenrod algebra ,91 p over Zp is generated by tSteenrod operations Sqi (p = 2), f!lJi (p > 2), and the tBockstein operation /).p (p > 2), with composition of operations defined as multi- plication. Then ,91 p is a connected associative graded algebra of finite type (not commuta- tive). Defining a comultiplication IjJ of d p by IjJ(Sqn) = L Sqi @ Sqn-i, 1jJ(f!lJ n ) = L.Oj'i @ .OJ'n-i, and 1jJ(p)= 1 @ /).p+ p @ I, ,91 p becomes a graded Hopf algebra with an associative and commutative comultiplication. Thus its dual d is a graded Hopf algebra with an associa- tive and commutative multiplication, and we can apply Borel's theorem to d in order to investigate the structure of d p [4]. Let (A, <p, 1jJ) be a graded Hopf algebra with associative multiplication and comulti- plication. Putting c(1)= I and c(a)= -a- La;' c(ai') for dega >0 (where ljJ(a) = 1 @ a + a @ I + L ai @ ai'), We obtain a linear map- ping c:A-->A satisfying c<p=<p(c@c)T. We call c the conjugation mapping of A. When the multiplication or comultiplication is commuta- tive, we obtain the relation c 2 = I, and c is a bijection, The conjugation mapping is utilized in studying Steenrod algebras [4,5]. F. Coalgebras Now we turn to nongraded cases. Let A be a vector space Over a field k, and let Ji: A @ k A --> A and I}:k-->A be linear mappings. Then the 203F Hopf Algebras triple (A, Ji, I}) is said to be an algebra over k if Jio(Ji @ 1A)=Jio(IA @ Ji) and JiO(1A @ I})= Ji 0 (I} @ 1 A) = I A, where 1 A is the identity mapping of A, and A @ kk and k @ kA are identified naturally with A. We call Ji the multi- plication and I} the unit mapping of the alge- bra. DuaIly a triple (C, /)., B) with C a vector space over k, linear mappings /).: C-->C @ kC, and B: C -->k is said to be a coalgebra over k if (\c@ /).)o/).=(/).@ Ido/). and (1c@B)O/).= (B @ Ido/).= Ic. We call /). the comultiplica- tion or the diagonal mapping and B the aug- mentation or the counit of the coalgebra. An algebra (A, Ji, I}) is called commutative if Ji 0 T= Ji, where T is the twist mapping a @ b f-> b @ a. A cocommutative coalgebra is defined dually. Definitions of the tensor prod- uct of two algebras or coalgebras are similar to the definitions in the graded case. If D is a subspace of a coalgebra (C, /)., B) over k satisfy- ing /)'(D)cD @ kD, then (D, /).ID, BID) is a co- algebra and is said to be a snbcoalgebra of C. A subspace I of a coalgebra (C, /)., B) over k is called a coideal of C if /).(I)c C @ J + I @ kC and B(I) = O. Then the quotient space Cj I has a coalgebra structure induced naturally from (C, /)., B) and is said to be a quotient coalgebra of C. The intersection and the sum of sub- co algebras of C are again subcoalgebras of C. If S is a subset of C, then the intersection of all subcoalgebras containing S is said to be the subcoalgebra generated by S. The subcoalgebra generated by any finite set or finite-dimensional subspace of C is finite- dimensional. If (C, /)., B) is a coalgebra over k, then the dual space C* of C has an algebra structure over k with multiplication Ji and unit mapping I} defined naturally from the tdual mappings of  and B respectively. (C*,Ji,I}) is called the dual algebra of (C,/).,B). Suppose that (A, Ji, I}) is an algebra over k, and let A ° be the subset of the dual space A* of A consisting of elements f whose kernel contains an ideal I such that A/I is finite-dimensional. Then (A 0, /)., B) is a co- algebra over k, where /). and B are the linear mappings induced from the dual ones of Ji and I}, respectively. (AO, /)., B) is called the dual co- algebra of (A, Ji, I}). The tfunctors ( )* and ( )0 are adjoint to one another in the sense that there is a natural bijective correspondence between the set of algebra homomorphisms of A to C* and that of coalgebra homomor- phisms of C to AO for any coalgebra C and algebra A, where coalgebra homomorphisms are defined as the dual notion of talgebra homomorphisms. A nonzero subcoalgebra D of a coalgebra C is called simple if D has no nonzero proper subcoalgebras, and the sum of all simple sub- coalgebras of C is called the coradical of C. If 
203G Hopf Algebras C coincides with its coradical, then C is said to be cosemisimple. If C has only one simple subcoalgebra, then C is called irreducible. C is called pointed if all simple subcoalgebras of C are I-dimensional. An element 9 of a coalgebra (C,, E) is called grouplike if (g) = 9 @ 9 and E(g)= 1. The set G(C) of grouplike elements in C is linearly independent over k. G. Bialgebras A system (H, Ji, I},, E) with an algebra structure (H, Ji, I}) and a coalgebra structure (H,, E) is said to be a bialgebra over k if  and E are algebra homomorphisms. This last condition is equivalent to saying that Ji and I} are coalge- bra homomorphisms. If K is a subspace of a bialgebra (H, Ji, I},, E) which is simultaneously a subalgebra and a subcoalgebra of H, then we call K a subbialgebra of H. An ideal I of (H, Ji, I}) which is also a coideal of(H,, E) is called a biideal of H and the quotient space H/I has a bialgebra structure which is said to be a quotient bialgebra of H. A linear mapping between bialgebras is a bialgebra homomor- phism if it is simultaneously an algebra homo- morphism and a coalgebra homomorphism. A bialgebra (H, Ji, I},, E) is called commutative (cocommutative) if(H,Ji,I}) «H,,E)) is com- mutative (co commutative). Examples. Let kG be the vector space with a set G as free basis over a field k. If we define linear mappings : kG-->G @ kkG by (x)= x @ x and E: kG-->k by E(X) = 1 for x in G, then (kG, , B) is a cocommutative coalge- bra over k such that the set G(kG) of group- like elements is equal to G. Moreover if G is a tsemigroup with unit element, then (kG, Ji, I},, E) is a cocommutative bialgebra over k, where Ji(I}) is the multiplication (unit mapping) of the tsemigroup algebra kG. This bialgebra is called a semigroup bialgebra over k, If Lis a tLie algebra over k and U(L) the tuniversal enveloping algebra of L with multi- plication Ji and unit mapping I}, then Lie alge- bra homomorphisms x-->x EB x (L-->LEB L) and x-->O (L-->{O}) induce algebra homomor- phisms: U(L)-->U(LEB L) U(L)@ U(L) and B: U(L)-->k, respectively. Then (U(L),Ji,I},,E) is a cocommutative bialgebra over k, called the universal enveloping bialgebra of L. H. Hopf Algebras Let (A, Ji, I}) be an algebra over k and (C,, E) a coalgebra over k, If f and 9 are in R = Homk(C, A), thenf*g=Ji0(/@g)0 is called the convolution of f and g. Defining Ji': R @ kR-->R and I}': k-->R by Ji'(/@g)=f*g 786 and I}'(ex) = exl} 0 E, then (R, Ji', I}') is an algebra over k. If H is a bialgebra over k with underly- ing coalgebra H C and algebra HA, then R H = Homk(H c , H A) is an algebra over k in the same manner as above. If the identity mapping 1 H of H has the inverse S in R H under the multiplication Ji', then H is said to be a Hopf algebra over k with antipode S. Then S satisfies the following: S(gh) = S(h)S(g) for g, h in H, SOI}=I}, WS=E and To(S@S)o=oS, where T is the twist mapping a@ b-->b @ a. If H is commutative or cocommutative, then SoS= I H . If H' is another Hopf algebra over k with antipode S', then a bialgebra homomorphism f of H to H' such that S'f = fS is called a Hopf algebra homomorphism. If Hand H' are both commutative (cocommutative), then any bi- algebra homomorphism of H to H' is a Hopf algebra homomorphism. The tcategory whose objects are commutative and cocommutative Hopf algebras over a field k and whose mor- phisms are Hopf algebra homomorphisms is an tAbelian category (A. Grothendieck). If His a commutative Hopf algebra over a field of characteristic zero, then the underlying algebra H A has no tnilpotent elements (P. Cartier). If G is a tgroup, then the group bialgebra kG given above has an antipode S defined by S(x)=x- 1 for x in G, and hence kG is a cocommutative Hopf algebra over k. Another example of Hopf algebras is the tcoordinate ring of an talgebraic group defined over k. More generally, let X = Spec(A) be an taffine group scheme over k. Then algebra homomor- phisms : A -->A @ kA, E: A -->k, and S: A-->A are naturally induced from the group structure of X, and (A, Ji, I},, E, S) is a commutative Hopf algebra over k, where Ji and I} are the multiplication and unit mapping of A, respec- tively. Conversely, if (A, Ji, I},, B, S) is a com- mutative Hopf algebra over k, then X = Spec(A) is an affine group scheme over k with the group structure induced from , E, and S. Hence a commutative Hopf algebra over k is nothing but a cogroup object of the category of commutative algebras over k (i.e., a tgroup object of the tdual category). Dually a cocommutative Hopf algebra over k is nothing but a group object of the category of cocommutative coalgebras over k. I. Hyperalgebras If (C,, B) is a pointed irreducible coalgebra over k, then C contains a unique grouplike element g, and kg is the unique simple sub- coalgebra of C. An element a of C satisfying (a) = a @ 9 + 9 @ a is called primitive. The set P(C) of primitive elements in C is a vector 
787 subspace of C. A cocommutative coalgebra C is called colocal if C is irreducible, i.e., if the dual algebra C* of C is a tquasilocal ring. The tdimension of P(C) of a colocal pointed coalge- bra C is finite if and only if the dual algebra C* of C is a tNoetherian complete local ring. A bialgebra (H, Ji, IJ, f., F.) over k is said to be a hyperalgebra over k if the underlying coalgebra is colocal. Then the unique simple subcoalge- bra (grouplike element) of His lJ(k) (1J(1)), and P(H) has a Lie algebra structure defined by [x, yJ =xy- yx for x,y in P(H). The universal enveloping bialgebra U(L) of a Lie algebra Lover k is a hyperalgebra over k such that the set P(U(L)) of primitive elements in U(L) is equal to L. Conversely, if the char- acteristic of k is zero, any hyperalgebra Hover k is isomorphic to the universal enveloping algebra U(P(H)) of the Lie algebra P(H). But in positive-characteristic cases U(P(H)) is generally a proper subbialgebra of H. Another important example of hyperalgebras is the dual coalgebra hy(X)=Ao of the tstalk A at the neutral point of the tstructure sheaf of an talgebraic group scheme X over k. In addition, hy(X) has an algebra structure defined from the group structure of X and is a hyperalgebra over k such that P(hy(X)) is equal to the Lie algebra L(X) of X. Although L(X) plays an important role in the infinitesimal theory of algebraic groups over a field of characteristic zero, it does not give any information on in- finitesimals of orders higher than p in the case of positive characteristic p. In the case of char- acteristic zero we see hy(X)= U(L(X)) and L(X) = P(hy(X)), and so hy(X) is a natural substitute for L(X) in positive-characteristic cases. From this viewpoint many interesting results on hy(X) of an algebraic group scheme X which are paraIlel to those on L(X) in the case of characteristic zero have been obtained [6-8]. References [1] H. Hopf, Uber die Topologie der Gruppen-Mannigfaltigkeiten und ihre VeraIl- gemeinerungen, Ann. Math., (2) 42 (1941), 22-52. [2J A. Borel, Sur la cohomologie des espaces fibres principaux et des espaces homogenes des groupes de Lie compacts, Ann. Math., (2) 57 (1953),115-207. [3] M. E. Sweedler, Hopf algebras, Benjamin, 1969. [4] J. W. MiInor, The Steenrod algebra and its dual, Ann, Math., (2) 67 (1958),150-171. [5] J. W. Milnor and J. C. Moore, On the structure of Hopf algebras, Ann. Math., (2) 81 (1965),211-264. 204B Hydrodynamical Equations [6J J. Dieudonne, Introduction to the theory of formal groups, Dekker, 1973. [7] M. Takeuchi, Tangent coalgebras and hyperalgebras I, Japan. J. Math., 42 (1974),1- 143. [8J H. Yanagihara, Theory of Hopf algebras attached to group schemes, Lecture notes in math. 614, Springer, 1977. [9J E. Abe, Hopf algebras, Cambridge Univ. Press, 1980. (Original in Japanese, 1977.) [lOJ G.-c. Rota, Coalgebras and bialgebras in combinatorics, Lecture notes at the umbral calculus conference, University of Oklahoma, 1978. 204 (XX.1 0) Hydrodynamical Equations A. General Remarks Mathematical analysis of the motion of fluids ( 205 Hydrodynamics) gives rise to various kinds of mathematical problems; the equations that govern flows are amongst the most im- portant and most extensively studied of the nonlinear partial differential equations. Here we review only basic and noteworthy results. Sections B-E are concerned with incompress- ible fluids, while Sections F and G deal with compressible fluids. B. Nonstationary Solutions of the Navier- Stokes Equation Let 0 be a bounded domain in R m (m = 2 or 3) occupied by a fluid, with smooth boundary 00. If the fluid is viscous and incompressible, its motion can be described by means of the velocity u = u(t, x) and the pressure p = p(t, x) with tO and XEO the time variable and the space variable, respectively. For simplicity, we assume that external forces are absent. Then u and p satisfy the Navier-Stokes equation ou at + (uV)u = v,,1u - Vp, (1) and the equation of continuity divu=O, (2) where the positive constant v stands for the (kinetic) viscosity. On 00, u is subject to the boundary condition u I ,'n = [3(t, x). (3) The initial value of u is also prescribed: u I ,o = uo(x). (4) 
204C Hydrodynamical Equations If Q is un bounded, e.g., if Q is an exterior domain outside compact surfaces, then u(t,x)-->Uo(t) (Ixl-->oo) is imposed in addition as the boundary con- dition at infinity. The problem of finding u and p that satisfy (1 )-(4) for given fJ and Uo is called the Navier- Stokes initial value problem (abbreviation, NS initial value problem). Pioneering mathemat- ical studies of this problem were initiated by 1. Leray, and since the 1950s various contri- butions have been made by many authors, including E. Hopf, O. A. Ladyzhenskaya, H. Fujita, T. Kato, and S. Ito ( [13,17, 21J). We state here the result in terms of regular (class- ical) solutions under the simplifying assump- tion that Q is bounded, # = 0, and U o is sole- noidal (divuo=O) and smooth. The situation depends non trivially upon the dimension m. Namely. if m = 2, then a regular solution of the NS initial value problem exists uniquely and globally, i.e., for all time. If m = 3, we can prove only a local existence theorem (i.e., one hold- ing in a finite interval) of a regular solution. This solution is unique in the internal of its existence; however, it can be extended over the whole interval when the Reynolds number is sufficiently small. In other words, the ques- tion of well-posedness of the 3-dimensional NS initial value problem is open at present. C. Weak and Strong Solutions of the Navier- Stokes Equation In 1951 Hopfintroduced the notion of weak solutions of the NS initial value problem and succeeded in proving their global existence (without uniqueness). We here give the defini- tion of Hopf's weak solution: Let Cg: a be the set of vector functions UE C;{'(Q) with div u = O. By H we denote the closure of Ca under the L 2- norm , and by V the closure of Ca under the Wi (Q)-norm (or equivalently, the Dirichlet norm for bounded Q), Then an H-valued func- tion u = u(t) is a weak solution of the NS initial value problem with {1=0 in [0, T) (0< T.s; +CXJ) if (i) UEU'(O, T;H)nL 2 (0, 00; V), (ii) u is weakly continuous from [0, T) to H, and (iii) u satisfies the weak equation IT {(u, <P')L'(n) - v(Vu, V<p)P(!1) +(u'V)<P,U)p(!1)}dt= -(u o ,<p(O))P(!1) (5) for all <P E C;{, ([0, T) x Q) with div (p = O. Note that the pressure p has been eliminated 788 (3') in the weak equation. In general, the unique- ness of Hopf's weak solution is not known except for m = 2. However, when a regular solution does exist, then the weak solution coincides with it (a. e.), and is unique. Solutions of the NS initial value problem which are stronger than the weak solution to the extent that the uniqueness can be proved have been introduced, for instance, by A. A. Kiselev and Ladyzhenskaya [12] and Fujita and Kato [5]. Actually the existence and uniqueness theorems of the aforementioned regular solu- tions are proved using existence theorems of such strong solutions. Recently, Y. Giga and T. Miyakawa succeeded in generalizing the Fujita-Kato theory from U to LV, and thus they have shown that ifuoELm(Q) and is sole- noidal, a unique strong solution exists at least locally for any m. Those strong solutions that satisfy unique- ness, and hence are regular solutions of the NS initial value problem, turn out to be smooth for t>O; they are analytic in t>O and XEQ. D. Stationary Solutions of the Navier-Stokes Equation If the flow is steady, u and p are solutions of the boundary value problem consisting of (1) with au/at omitted, (2) and (3) with # indepen- dent of t (and, in addition, (3') with a constant U o if Q is an exterior domain). The existence of solutions of this boundary value problem for the case of bounded Q was established by Leray as one of the earliest applications of his fixed-point theorem. FOr the case of un- bounded Q, Leray's study was completed and extended by R. Finn, H. Fujita, and others to yield theorems on existence, regularity, and asymptotic behavior in the wakes of solutions ( [4,13,21]). These stationary solutions are unique if the Reynolds number is sufficiently small. On the other hand, under certain cir- cumstances that involve Quette flow, non- uniqueness or bifurcation of stationary solu- tions for large Reynolds numbers has been positively proved, E. Euler's Equation If the fluid is inviscid and incompressible, the Navier-Stokes equation is reduced to Euler's equation au T+(u'V)u= -Vp. ct (6) Then the boundary condition is replaced by the frictionless boundary condition, which, in 
789 the homogeneous case, takes the form unlan=O, where un is the normal component ofu. Regu- lar solutions of the initial value problem con- sisting of (6) with (2), (4), and (7) have been proved to exist for all t if m = 2 and in a finite time if m= 3 [1,11]. F. The General Navier-Stokes Equations If the fluid is compressible, viscous, and heat- conductive, its motion is described in terms of the density p, the velocity u, and some thermo- dynamic quantity, say the absolute tempera- ture e, and is governed by the following system of equations, sometimes called the general Navier-Stokes equations: ap -+div(pu)=O, ot au +(u. V )u=t: ( +VdiV ) U-VP' at p 3 p 00 K O.  -;;-+(u'V)O=-M+-(dIVU)Pe+-. ot cvP cvP cvP Here the pressure P is regarded as a function of p and 0 through the equation of state. The viscosity coefficient Ji, the coefficient of heat conduction K, and c v , the specific heat at constant volume, are positive constants. We have for simplicity assumed that the external force is absent, and that the Stokes condition for viscosity is satisfied. Finally,  is the dis- sipation function: = -Ji(divu)2+t: f ( Uj+U, ) 2 3 2 i,j1 ax, aX j If Q is the whole space, then the initial values Po, u o , eo of p, u, e are given at t = 0, and we obtain the Cauchy problem for the general Navier-Stokes equation. Mathematical study of this Cauchy problem has become active since J. Nash [19J and N. Itaya [8J proved the existence of unique regular solutions local in time. Following Itaya's argument, A, Tani constructed a unique regular solution local in time for the initial boundary value problem consisting of (8)-(10) and boundary conditions imposed on u and O. When it comes to the global existence of solutions, only restricted results are known. The global existence of a regular solution in the I-dimensional version of the Cauchy problem has been established by Ya. KaneI' [lOJ and Itaya under certain simplifying assumptions, such that the fluid is a barotropic gas, i.e., one obeying P = cpy, where C and y:;:' 1 are positive constants. The 204G Hydrodynamical Eqnations (7) I-dimensional initial boundary value problem can also be solved globally if the gas is ideal and polytropic, i.e., one for which p = Rpe, and with the internal energy proportional to e and [9]. For the 3-dimensional case, we can only refer to [18J, where existence of global solu- tions of the Cauchy problem has been proved for initial data close to constants under the as- sumption that the gas is ideal and polytropic, G. Equations for Inviscid Ideal Gases When the gas under consideration is inviscid, ideal, and barotropic, we put Ji = 0 and p = C p y in (9). The equation thus obtained is combined with (8) to yield the following quasilinear hyperbolic system, which admits conservation laws: (8) ap . a;-+dIV(pU)=O, (9) ou C -+(u'V)u+-vPy=O. at p (11) (10) It the initial data Po and U o are smooth to some extent, then the Cauchy problem for (11) has a regular solution local in time. Generally, we cannot expect existence of regular solu- tions global in time, namely, discontinuity is likely to take place in a finite time, which cor- responds to the occurrence of shock waves. Therefore we have to introduce weak solutions that admit discontinuity. By definition a piece- wise continuous function {p, u} is a weak solution of (11) if it satisfies (11) in the distri- bution sense and if its discontinuity is sub- ject to a certain jump condition, called the Rankine-Hugoniot relation as wcll as another condition, called the entropy condition, which two conditions allow us to distinguish a physi- cally realizable solution among many possible discontinuous solutions [3,14]. Global exis- tence of the weak solution has been proved so far only for the I-dimensional problem with initial data close to constants in the sense that their oscillations and total variations are suffi- ciently small. Actually in this case we can apply J. Glimm's method [6J to construct weak solutions by means of a difference ap- proximation which involves random numbers. Little is known regarding the uniqueness of weak solutions. Finally, if we are concerned with steady-state solutions of an inviscid com- pressible fluid and assume that the flow is irrotational, then we are led to a quasilinear partial differential equation of mixed type for the velocity potential <1>, which is elliptic in the subsonic region and hyperbolic in the super- sonic region. Classical results concerning these equations may be found in [2,3]. 
204 Ref. Hydrodynamical Equations References [1] C. Bardos, Euler equation and Berger equation-Relation to turbulence, Lecture notes in math. 648, Springer, 1978, 1-46. [2J L. Bers, Mathematical aspects of subsonic and transonic gas dynamics, Wiley, 1958. [3J R. Courant and K. O. Friedrichs, Super- sonic flow and shock waves, Interscience, 1948. [4] R. Finn, Stationary solutions of the Navier-Stokes equation, Amer. Math. Soc. Proc. Symp. Appl. Math., 17 (1965),121-153. [5J H. Fujita and T. Kato, On the Navier- Stokes initial value problem I, Arch, Rational Mech. Anal., 16 (1964), 269-315. [6J J. Glimm, Solutions in the large for non- linear hyperbolic systems of equations, Comm. Pure Appl. Math., 18 (1965), 697-715. [7] E. Hopf, Dber die Anfangswertaufgabe fur die hydrodynamischen Grundgleichungen, Math. Nachr., 4 (1950-1951), 213-231. [8J N. Itaya, On the Cauchy problem for the system of fundamental equations describing the movement of compressible viscous fluid, Kodai Math. Sem. Rep., 23 (1971), 60-120; corrigenda and addenda, J. Math. Kyoto Univ., 16 (1976), 239-240. [9] A. V. Kazhikhov and V. V. Shelukhin, Overall solvability of single-valued functions for the initial boundary problem for one- dimensional equations of viscous gases (in Russian), Prikl. Mat. Mekh., 41 (1977),282- 291. [10J Va. I. KaneI', A model system ofequa- tions for the movement of gases in one di- mension (in Russian), Differentsial'nye Urav- neniya,4 (1968), 721-734. [IIJ T. Kato, On classical solutions of two- dimensional non stationary Euler equation, Arch. Rational Mech. Anal., 25 (1967), 188- 200. [12J A. A. Kiselev and O. A. Ladyzhenskaya, On the existence and uniqueness of solutions of nonstationary problems for viscous, in- compressible fluids (in Russian), Izv. Akad. Nauk SSSR, 21 (J 957), 655-680. [13] O. A. Ladyzhenskaya, The mathematical theory of viscous incompressible flows, Gor- don & Breach, 1963. (Original in Russian, 1961.) [14] P. D. Lax, Hyperbolic systems of con- servation laws and the mathematical theory of shock waves, SIAM Regional Con£. Ser. Appl. Math., 11 (1973). [15J J. Leray, Etude de diverses equations integrales non lineaires et de quelques prob- Iemes que pose I'hydrodynamique, J. Math. Pures Appl., (9) 12 (1933), 1-82. [16J 1. Leray, Sur Ie mouvement d'un liquid visqueux emplissant I'espace, Acta Math., 63 (1934), 193-248. 790 [17] J. L. Lions, Quelques methodes de reso- lution des problemes aux limites non lineaires, Dunod, 1969. [18J A. Matsumura and T. Nishida, The initial value problem for the equations of motion of viscous and heat-conductive gases, J. Math. Kyoto Univ., 20 (1980), 67-104. [19J J, Nash, Le probleme de Cauchy pour les equations differentielles d'un fluide general, Bull. Soc. Math. France, 90 (1962), 487-497. [20] B. L. Rozhdestvenskii and N. N. Ya- nenko, Systems of quasilinear equations and their application to gas dynamics (in Russian), Nauka, 1978. [21] R. Temam, Navier-Stokes equations, North-Holland, 1977. 205 (xx. g) Hydrodynamics A. General Remarks Gases and liquids are easily deformed, and they share many kinetic properties. They are examples of fluids. By definition, a fluid is a continuous substance having the property that when it is not moving, any part of the sub- stance separated from the rest by a surface exerts an outward force that is perpendicular to the given surface. Hydrodynamics (or fluid dynamics) is con- cerned with the equilibrium and the motion of gases and liquids without considering their molecular structure. In particular, the branch of the theory concerning fluids in equilibrium is called hydrostatics, and hydrodynamics sometimes refers to the branch concerning fluids in motion. There are two methods of describing the motion of a fluid. One regards a fluid as a sys- tem consisting of an infinite number of par- ticles and discusses the motion of each parti- cle as a function of time. This is Lagrange's method. For example, suppose that a fluid particle with the coordinates (x, y, z) = (a, b, c) at the moment t = 0 has coordmates x = ft (a, b, c, t), y= f2(a, b, c, t), z = I(a, b, c, t) at an arbitrary time t. Then the motion of the fluid is perfectly determined by the functions II, f2' and f3' The other is Euler's method, which discusses the values of the velocity v(u, v, w), the density p, the pressure p, etc., of the fluid at arbitrary times and positions. From this standpoint each quantity of the fluid is regarded as a function of a space-time point (x, y, z, t). The rate at which any physical quantity F varies while moving with the fluid particle is 
791 the Lagrangian derivative DF/Dt, which is related to the ordinary partial derivatives by DF cF of of cF -=-+u-+v-+w- Dt Dt ox oy Dz' The three components (u, v, w) and the two state quantities (p, p) (in general, other state quantities, for example, the temperature T and the tentropy S, are assumed to be determined by equations of state such as T= T(p, p), S = S(p, p) are determined by five (= I + 3 + 1) relations derived from the conservation laws of mass, momentum, and energy, namely, the equation of continuity, which corresponds to the conservation of mass, up/Dt + div(pv) = 0; the equation of motion, which corresponds to the conservation of momentum, c(pv)/Dt +div(pv @ vp)= pK, where K is the external force per unit mass, p is the stress tensor, and @ denotes the ttensor product, while tdivergence is applied to each row vector, and by virtue of (1), the equation (2) can be expressed component-wise as Du Dpxx Dpxy cpxz P- D =- c +--;;-+-;-+pKx, t x oy cz Dv CPYX OPyy CPYZ P- D =;;-+--;;-+-;-+pKy, t uX oy uZ Dw oPzx Cpzy (Jpzz P D =;;-+--;;-+-;-+pKz; t eX ey uZ and the energy equation, which corresponds to the conservation of energy, c(pv 2 /2 + pE)/Dt +div(pv(v 2 /2+ E)- v. p +h)= pv' K, (3) or the equation of entropy production, which is another expression of (3), pTDS/Dt= -divh+Q, where E is the internal energy per unit mass, Q the heat generated per unit time and volume, and h the heat flux. Here K, Pib h, and Q or their relations with other quantities (e.g., h==  K grad T, where K is the thermal conductiv- ity) are assumed to be known. B. Perfect Fluids When there is a velocity gradient in the flow, a tangential stress appears which tends to make the velocity uniform, so that p is not a diagonal tensor (- P(5ib i.e., pressure). This property is called fluid viscosity. Generally, Q and h do not vanish in this case. However, in order to simplify the problem we consider a 205B Hydrodynamics non viscous (sometimes also adiabatic) fluid, which is called a perfect fluid and is a good approximation in the large of actual fluids. The motion of a perfect fluid is determined by Euler's equation of motion pDv/Dt= -gradp+pK, (4) (1) which is obtained from (1) and (2) by replacing Pik by the pressure only, and also by the ther- modynamic relation DS/Dt=O obtained from (3') by putting Q = 0 and h == 0 or its integral S = constant in homentropic flow, which is governed by the adiabatic law prxpY, where y denotes the ratio of specific heat at constant pressure to that at constant volume. In partic- ular, for a liquid, the density variation can be neglected. Putting p = constan t in (1). we have divv=O, (5) (2) which, in conjunction with (4), determines four unknowns (u, v, w, p) as functions of (x, y, z, t). A fluid of constant density is called an in- compressible fluid, and one of variable density a compressible fluid. Even though it might seem natural to consider gases as examples of compressible fluids, they can be treated as incompressible fluids if the speed of the flow of the gas q = Ivl is small compared \'lith the velocity c = J dp/dp of sound propagating in the gas. We cal1 q/c= M the Mach number, The vector m«(, I}, 0, which is derived from the velocity vector v as m = rot v, is called the vorticity. A small part of the fluid rotates with angular velocity m/2. If m = 0, the flow is cal1ed irrotational, otherwise rotational. The curves dx:dy:dz=u:v:w and dx:dy:dz=(:I]:( are cal1ed, respectively, stream lines and vortex lines. The line integral fcv,ds along a closed circuit C is cal1ed the circulation around C. In irrotational flow, the velocity is expressed as v = grad <1>, where <1> is cal1ed a velocity poten- tial. When the external force K has a potential Q (K = - gradQ) and p is a definite function of p, we have the pressure equation 0<1> 1 f d P +- 2 q2+ -+Q=constant, ct p which is valid everywhere in the flow. In a steady flow, I f d P _q2 + -+Q==constant 2 p (2') (3') (6) is valid along each stream line or each vortex line; this is called the Bernoulli theorem. These two equations correspond to tenergy integrals of the equation of motion. Furthermore, cor- responding to the conservation of tangular momentum, we have Helmholtz's vorticity theorem: When K == - gradQ and p== f(p), vorticity is neither created nor annihilated in the fl uid. 
205 C Hydrodynamics For the irrotationa1 motion of an incom- pressible fluid, tLap1ace's equation <1> == 0 is derived from (5). Hence the problem reduces to the determination of a tharmonic function <1> under appropriate boundary conditions (e.g., for a fixed wall, normal velocity V n = a<1>jan = 0). For the 2-dimensional problem a stream function \fI is introduced to satisfy (5) by the relation u = a\fl jay, v == - a\fl jax, Since the tCauchy-Riemann equations a<1>jax= rJ\fI jay, a<1>jiJy = - iJ\fI jax are valid in this case, f = <1> + i\fl is an tana1ytic function of z = x + iy. Therefore the theory of 2-dimensiona1 irro- tationa1 motion is essentially equivalent to the theory of complex t a na1ytic functions, and consequently the theory of tcomformal map- ping is a powerful method in the theory of such fluid motion, For irrotationa1 steady flow of a compress- ible fluid in which Q=O, c is determined from (6) as a function of q. Then (1) and (4) yield a tnon1inear partial differential equation for <1>: ( 1_ u2 ) a2<1> + ( I_ v2 ) a2<1> + ( 1- w2 ) a2<1> c 2 {)x 2 c 2 ay2 c 2 az 2 vw {)2<1> wu {)2<1> uv a 2 <1> -2---2---2-=0. (7) c 2 ayaz c 2 azax c 2 axay This equation is telliptic or thyperbo1ic ( 326 Partial Differential Equations of Mixed Type) according as M is less than I (subsonic) or greater than I (supersonic). For 2-dimensional flow, we can introduce a stream function \fI from (I) by u=a<1>jax= (Jjp)(a'Pjay), v=rJ<1>jay= -(ljp)(a\fljax). By utilizing the idea of t Legendre transformation, this system of nonlinear equations for <1> and \fI can be reduced to a system of linear eq uations in the hodograph plane (q, e): a<1> d ( 1 ) a\fl 1- M 2 a\fl aq=q dq pq 8ii= -r;q-8ii' a<1> q a\fl - -- ae p aq (d(pq)jdq = p (I - M 2 )), where the independent variables q and 0 are the magnitude and the inclination of the velocity, respectively. The treatment of 2-dimensional compressible flow on the basis of this system is called the hodo- graph method. For a flow of small M, there is a method of successive approximation (M 2 _ expansion method) which starts from Laplace's equation, neglecting the terms of O(M 2 ) in (7). For uniform flow (velocity U in the x- direction) past a thin wing or slender body where v and ware small, we have thin wing theory or slender body theory, whose first approximation is a 2 <1> a 2 <1> a 2 <1> (I-M2)+--;;-2+- a 2 =0. ox uy Z 792 If M < 1 or M > 1 (although not too large) a linearization (Prandtl-Glauert approximation) is possible by replacing M by the Mach num- ber at infinity Moo = U jc oo . For M> 1, (8) has a tcharacteristic surface, which is the Mach cone whose central axis makes an angle arcsincjq= arcsin IjM with the flow. This can be inter- preted also as an envelope produced by spher- ical sound waves with velocity c from a source drifting with velocity q. For M  1, we put <1>= c*x+<p (c* is the fluid velocity when q=c). Then for an adiabatic gas, (8) is approximated by a partial differential equation of tmixed type: a 2 <p a 2 <P y + 1 a<p a 2 <p -+-=---. ay2 {)Z2 c* ax ax 2 (9) Such a flow in which both domains M Z I co- exist is called the transonic flow, and exact solutions by the hodograph method are known. However, continuous deceleration from M> 1 to M < I generally tends to be unstable or impossible, and the appearance of a shock wave, i.e., a discontinuous surface of state quantities, is not unusual. This can be considered as the tweak solution of (I), (2) (3) for a perfect fluid. In particular, in the coordi- nate system fixed to the surface, its integrated form can be obtained as follows: [PVnJ = 0, [pbin+pvivnJ=O, [q2j2+E+pjpJ=0([ ] is the jump of the quantity at the surface, and n is the normal component). Supplemented by the entropy increase, these formulas give rela- tions between the fluid velocity and the state variables at the front and back of the shock. In an ideal gas they are called the Rankine- Hugoniot relation. Entropy is not uniform behind a curved shock, and the flow is not irrotational. For a weak shock starting from the tip of a pointed slender body, however, the discontinuity is small and approaches the tcharacteristic surface of (8), i.e., the Mach wave (compressive wave, in this case). Rare- factive Mach waves are found in the super- sonic flow of acceleration around a convex surface. Such waves contribute to the drag on an obstacle placed in supersonic flow. C. Viscous Fluids (8) A body moving uniformly in a fluid at rest (with velocity less than that of sound) suffers no drag as long as the viscosity of the fluid is negligible and the flow is con tin uous (d' Alem- bert's paradox). Hence we must take the vis- cosity into account in order to discuss the creation and annihilation of vortices, the gen- eration and structure of shock waves, and the drag acting On obstacles, For this purpose, we extend Newton's law stating that frictional stress is proportional to the velocity gradient 
793 and assume that the stress tensor p is a linear function of the rate-of-strain tensOr e: 2 Pxx= -p+Ji'divv+Jiexx-3Jidivv au ( 2 ) . = - p-f 2Ji-+ Ji' --Ji dIVV,..., ax 3 P = Ji e = Ji( aw + av ) ,.... yz yz ay az The proportionality constants Ji and Ji' are called, respectively, the coefficients of (shear) viscosity and bulk viscosity, The bulk viscosity is sometimes neglected (Stokes's assumption) in the usual hydrodynamics, and then the mean value of the normal stress components equals the pressure. When a fluid satisfies this linear relation between p and e, it is called a New- tonian fluid. Otherwise, it is called a non- Newtonian fluid. Except for a few cases, such as colloid solutions, fluids can be regarded as Newtonian. If we take the viscosity into account, the equation of motion of an incompressible fluid becomes pDvjDt = pK - gradp + JiV, This is called the Navier-Stokes equation. A nondimensional quantity R = pU Lj Ji formed by representative length L, velocity U, density p, and viscosity Ji of a flow is called the Rey- nolds number, In order for two flows with geometrically similar boundaries to share similar kinetic properties, their Reynolds num- bers must be equal. This is called the Reynolds law of similarity, For small R, we can approximate the equa- tion of motion (10) by replacing the accelera- tion DvjDt by avjat (Stokes approximation) or by uvjat + Uavjax (Oseen approximation) for a body placed in the uniform flow of velocity U in the x-direction. For large R, the flow can be regarded as that of a perfect fluid, since we can neglect JiV as long as the velocity gradient is not too large. In the vicinity of a fixed wall, however, the velocity gradient becomes large, because in a very thin layer the velocity decreases rapidly from the value U of a perfect fluid to zero at the waIl, This layer is called the boundary layer. For the boundary layer, Prandtl's bound- ary layer equation au au au au au Ji a 2 u -+u-+v-=-+ U -+--;;-z, at ax ay at ax p oy au av -+-=0 ax ay is valid, where x and yare the coordinates paraIlel and perpendicular to the wall, respec- 205 E Hydrodynamics (10) tively, and U is the velocity outside the bound- ary layer. If aUjax<o, it sometimes happens that the boundary layer separates from the surface of the body, In this case a vortex is generated in the flow, as large vorticities in the boundary layer are carried into the flow, For a body without separation of the boundary layer, the d'Alembert paradox holds and is no longer a "paradox," and the drag is smalL Such bodies are called streamlined. In compressible flow the new problem arises of the interaction of shock waves with the boundary layer. A rapid increase in pressure due to the shock wave formed on the surface of a body invalidates the assumption of a boundary layer and causes its separation. If the Mach number becomes sufficiently large (M;:;: 5, hypersonic flow), the bow shock ap- proaches the body and interferes with the boundary layer. The generation of heat at the boundary layer (e.g., viscous dissipation in Q) requires the consideration of heat transfcr as weIl as viscosity. In this manner, it becomes necessary to treat a complete system of equa- tions which take into account the energy equation (3) as well as the temperature de- pendence of K, Ji, and Ji'. D. Laws of Similarity For such complicated systems, tdimensional analysis is often useful ( 116 Dimensional Analysis). As laws of similarity, we can con- sider not only those like the Reynolds law but also others for bodies which transform similar- ly by taffine transformations, Corresponding to equation (8), the Prandtl-Glauert law of similarity for subsonic flow is famous: The pressure coefficient (nondimensional pres- sure change) for a thin wing of chord (i.e., the length in the direction of flow) 1, span L, and thickness r is Cp(L,r)=ACpo( Jl-M L,rj JI-M ..1.), when ..1. is an arbitrary constant and Cpo is C p for a body of scaled length and thickness placed in an incompressible flow. Correspond- ing to (9), an extension of the famous von Karman transonic similarity is possible: Cp(L,r)=r2/3(y + 1)-1 / 3 xf( J II-M1 L,(y + l)rjll-MI3j2). E. Turbulence (11 ) For low Reynolds numbers, the flow generally has smooth streamlines, For high Reynolds numbers, however, extremely irregular motion in space and time appears, The former is called 
205 F Hydrodynamics laminar flow, and the latter turbulent flow ( 433 Turbulence and Chaos). The transition from laminar to turbulent flow is considered to be due to the instability of the laminar flow, and this transition has been studied by the method of small oscillations. Recently, non- linear effects have also been examined. Regard- ing the internal structure of turbulence, statis- tical theories originated by T. von Karman and G. I. Taylor (Proc. Roy. Soc. London, 151 (1935)) and A. N. Kolmogorov (Dokl. Akad. Nauk. SSSR, 30 (1941)) are of central importance. F. Water Waves tSurface waves that occur on the free surface of water (or other liquids) are ca\1ed water waves; their restoring forces are gravity and surface tension. If we consider waves generated on still water (assumed to be inviscid and incompress- ible) in equilibrium, we can regard the flow field associated with the wave motion as tirro- tational by virtue of tHelmholtz's vorticity theorem. Hence the flow velocity can be de- rived from the tvelocity potential <1> which satisfies tLaplace's equation <1> = 0, together with the boundary conditions D(H+z) Dt a<1> 0 H a<1> a H a<1> +--+-==0 ax ox ay ay oz at z= -H(x,y), (12) D(h - z) ah a<1> ah a<1> oh a<1> -=-+--+----==0 Dt at ax ox ay oy az at z = z(x, y, t), (13) a<1> 1 2 Po - P 0" -+-(grad<1» +gz=-=- at 2 P pRm at z=h(x,y, t), (14) where the Cartesian coordinates x and yare taken in the undisturbed horizontal free sur- face, while the positive z-axis is verticaIly upward. The equations z= -H(x,y) and z= h(x, y, t) denote, respectively, the bottom sur- face (assumed to be known) and the elevation of the free surface measured from the un- disturbed level z = 0, while 9 stands for the gravitational acceleration, 0" the surface ten- sion, Po the atmospheric pressure, and IjR m the tmean curvature of the disturbed free surface expressed as = l{ 1 + ( oh ) 2 } a2h Rm ax iJ y 2 + { 1 + ( ah ) 2 } a2h _2 0h ah a 2 h ] ay ax 2 ax ayoxay 1{I+ G) 2 + G Yr2 794 The conditions (12) and (13) imply, respec- tively, that the fluid does not cross the bottom and the free surface, while (14) expresses the fact that the difference between atmospheric and fluid pressures at the free surface is equal to the normal force (per unit area) due to the surface tension. Thus the problem is formu- lated as a nonlinear boundary value problem for Laplace's equation including the unknown boundary z = h(x, y, t). Let us first consider linear waves for which the wave amplitude of the surface elevation is much smaller than any other characteristic linear dimension such as the wavelength or the water depth H (for simplicity, we assume hereafter H = constant, i.e., a flat horizontal bottom). Linearizing the boundary conditions (12)-(14) with respect to h and grad <1>, and assuming a sinusoidal wave proportional to exp[i(k' r-wt)J, k(k x , ky) and w being respec- tively the (2-dimensional) twave number vector and the tan gular frequency and r = (x, y), we obtain the dispersion relation ( O"k2 ) w 2 = 1 +r;g gktanh(kH). (15) In a layer of stiIl water the waves are isotropic in the horizontal plane and the dispersion relation involves only the magnitude k of the wave number vector. It is readily seen from (15) that the water waves are typical tdispersive waves in which the tphase velocity c p ( = wjk) depends on the wave number k or the wavelength ..1.( = 2njk). It is also evident from (15) that the quantity O"k 2 j(pg) measures the relative importance of surface tension and gravity. Hence for waves with wavelengths much larger than Am = 2n J O"j(pg) (1.7 em for water), the effect of surface tension is negli- gible, and we have gravity waves. Conversely, when A« }.m, the effect of surface tension becomes dominant, and we have capillary waves or ripples. When the water depth H is much larger than the wavelength ..1., we can approximate (15) by w 2 = gk + O"P j p, since tanh(kH) 1. We calI such waves deep water waves. On the other hand, if H is much sma\1er than the wavelength (kH «2n), we have shal- low (or long) water waves for which (15) can be approximated by w 2 =gHk 2 [1 + {O"j(pgH 2 )_ Ij3} (kH)2+...J if O"j(pgH 2 ) = 0(1). In partic- ular, if we neglect 0(kH)2, we recover the well-known dispersionless long gravity waves whose phase velocity is simply J9ii. In all the cases mentioned above, the amplitude function of the velocity potential <1> is proportional to -iwcosh{k(z+H)}j{ksinh(kH)}, so that the flow velocity due to deep water waves de- creases exponentiaIly as one proceeds verti- cally downward from the free surface. In the 
795 limiting case of long gravity waves, however, the fluid motion is nearly horizontal through- out the fluid layer. When the wave amplitude becomes larger, nonlinear effects are no longer negligible. For such waves, various tsingular perturbation methods provide powerful tools, For example, the basic system of equations for weakly non- linear (I-dimensional) shaIlow water waves can be reduced to a simple solvable nonlinear equation caIled the tKorteweg-de Vries equa- tion, whose solitary wave solution is known as a prototype of solitons ( 387 Solitons). Another classical example is a (I-dimensional) deep gravity wave called the Stokes wave, which can be obtained as a power series in the wave steepness (amplitude x wave number), The first term of the series is of the form of a linear sinusoidal wave, and the higher-order terms correspond to the higher harmonics, while the angular frequency is shifted from the linear case and depends not only on the wave number but also on the amplitude, Similar singular perturbation methods have also been applied to various kinds of resonant interac- tions such as nonlinear self-modulation, higher harmonic resonances, and multiwave interac- tions. Finally it should be mentioned that an exact solution representing a (I-dimensional) deep capillary wave was obtained by G. D. Crapper (j, Fluid Mech., 2 (1957) 532-540; extended later to the case of finite depth by W, Kinnersley, j, Fluid Mech., 77 (1976), 229- 241), This is one of the few realistic exact solu- tions obtained so far; a famous exact solution of Gerstner's trochoidal wave does not satisfy the irrotationality condition. References [IJ H. Lamb, Hydrodynamics, Cambridge Univ. Press, sixth edition, 1932. [2] S. Goldstein, Modern developments in fluid dynamics, Clarendon Press, 1938. [3J R. Courant and K. O. Friedrichs, Super- sonic flow and shock waves, Interscience, 1948. [4J L. Howarth, Modern developments in fluid dynamics, High speed flow I, II, Oxford Univ. Press, 1953, [5J K. Oswatitsch, Gas dynamics, Academic Press, 1956. [6J H. W. Liepmann and A, Roshko, Elements of gasdynamics, Wiley, 1956, [7J J. J. Stoker, Water waves, Interscience, 1957. [8J L. D, Landau and E. M. Lifshitz, Fluid mechanics. Pergamon, 1959. (Original in Rus- sian, 1954.) [9J K. G, Guderley, The theory of transonic flow, Pergamon, 1962, 206 A Hypergeometric Functions [IOJ L. Rosenhead, Laminar boundary layers, Clarendon Press, 1963. [l1J G. K. Batchelor, An introduction to fluid dynamics, Cambridge Univ. Press, 1967. [12J G. B. Whitham, Linear and nonlinear waves, Wiley, 1974. [13] M. J. Lighthill, Waves in fluids, Cam- bridge Univ. Press, 1978. [14] H. Schlichting, Boundary-layer theory, McGraw-Hill, 1979. 206 (XIV.5) Hypergeometric Functions A. Hypergeometric Functions The tpower series qy)  qex+n)qfJ+ n ) F(ex fJ Y' z) L. zn , " qex)qfJ).1 n!qy+n) in the complex variable z is called the hyper- geometric series or Gauss's series. It is convergent for any ex, fJ, and y if Izi < 1, and is convergent for Re(ex+ fJ - 1') < 0 if Izi = 1. If z= I, its sum is equal to qy)qy-ex- fJ)/qy - ex)qy - fJ) (except when I' is a nonpositive integer). The hypergeometric functions are obtained as analytic continuations of the functions determined by hypergeometric series that are single-valued analytic functions defined on the domain obtained from the complex plane by deleting a line connecting branch poin ts z = I and z = 00 ( Appendix A, Table 18.1). A hypergeometric function is a solution of the differential equation d 2 w dw z(l-z) dz 2 +(y-(ex+ fJ +l)z)d;-ex fJ w==O, (I) which is ca11ed the hypergeometric differential equation or Gaussian differential equation, This equation is a differential equation of tFuchsian type with tregular singular points at 0, 1, and 00, whose solutions are expressed, in terms of the t P-function of Riemann, by w==p {  I-I' 1 } o z. yex-fJ 00 ex fJ If anyone of the values of y, I' - ex- fJ, or ex- fJ is integral, there exists a series containing logz, representing a solution of the differential equation (1) in a neighborhood of the corre- sponding singular points. When none of the 1', y  ex - fJ, or ex - fJ values is integral, since the linear transformations z' = z, z' = l/z, z' = I -- z, 
206B Hypergeometric Functions z'=z/(z-I), z'=(z-I)/z, z'= 1/(I-z) permute singular points, there exist 24 particular solu- tions around the singular points. The latter fact was first proved by E. E. Kummer (1836), There exist various curves C for which the integral w= Ie u,-I(I-u)Y-,-I(I-zu)-pdu is a solution of (1). Among them we can take the segment [0,1] when Reex>O, Re(y-ex»O. Then the corresponding solution is holomor- phic in the interior of the unit circle, and r(y) F(ex,fJ,y;z) r(ex)r(y-ex) x t u,-I(I-u)Y-'-I(I-zu)-Pdu. Since the integrand has branch points at 0, 1, and l/z, we have the following expression when I' is not an integer: F(ex, fJ, 1'; z) 1 r(y) (l_e 2m (Y-'»(1_e 2m ,) r(ex)r(y-ex) l(1+,O+,I-,O-) x'f u'(I-u)Y-,-I(I-zu)-pdu, where Reex>O, Re(y-ex»O; whereas if I' is an integer, then 1 r(y) F(ex,fJ, y;z)= (l_e2.i') r(ex)r(y-ex) l(I+,o+) x'f u'(I-u)Y-'-I(I-zu)-pdu, where the contour in the first expression en- circles successively each of 1, 0,1, and 0 once with indicated directions. These expressions can be adopted as a definition of the hypergeo- metric functions for the general value of z, Other integral expressions are also known ( Appendix A, Table 18.1). B. The Ladder Method A linear ordinary differential equation of the second order having three regular singular points on the complex sphere is easily trans- formed into an equation of the form (1). To solve such an equation with a parameter, it is often useful to decompose, in two different ways, the main part of the equation into two factors of the first order, and find a recurrence formula involving the parameter, as we shall see in the folIowing example. This method is called the ladder method or factorization method. For example, tLegendre's differential 796 equation Ln[w J =(1- z2)((1 - Z2)W')' + n(n + l)w) = 0 is decomposed as folIows: Ln = Sn' T" + n 2 == T,,+I . Sn+! +(n + 1)2, d T,,=(I-z 2 )-+nz, dz 2 d Sn=(I-z )--nz. dz If W n is a solution of Ln[w J = 0, then multiply- ing both sides of Sn . T" [wnJ + n 2 W n = 0 by T", we find that Tn' Sn(T,,[WnJ) + n 2 (T,,[w n J)= 0, that is, T" [wnJ is a solution of Ln-I [w J = O. Similarly, we see that Sn+![wnJ is a solution of Ln+! [w J = O. In this sense, Sn and T" are caIled, respectively, the step-up operator, or up-ladder, and the step-down operator, or down-ladder, with respect to the parameter n, The above relation constitutes a recurrence formula for Legendre functions ( Appendix A, Table 18.11). C. Extensions of Hypergeometric Functions J. Thomas (1870) proposed the series  ( exdn(ex 2 )n'" (exh)n 1 + L.. zn n1 (Pdn(fJ2)n ..,(fJh)n ' (A)n=A(A+ 1)... (..1.+11-1) as an extension of the hypergeometric series. The sum of this series satisfies the hth-order differential equation dhw d h - I W (l-z)dfh+(A I -BIz) dth-l d h - 2 W +(A 2 -B 2 z) dt h - 2 +... +(Ah-Bhz)w=O, l=logz. When h=2 and fJI = 1, it reduces to the ordi- nary hypergeometric series. The notation pFiexl, ex2, ..., ex p ; PI' P2, ..., fJq; z) = f (exdn(ex 2 )n'" (lX p )n zn, n=O 11! (fJdn( fJ2)n .. . (fJq)n (2) which is due to L. Pochhammer and modi- fied by E. W. Barnes, is used to denote the ex- tended hypergeometric series, and the function defined by (2) is often called Barnes's extended hypergeometric function. For example, Gauss's series in this notation is 2FI (ex, fJ, 1'; z), Corresponding to Barnes's integral expres- sion for hypergeometric functions, it is known that the integral 1 f C+iOO W(z)=---: K()H(Oz-'d, 2m c-;oo where K«()=K( + 1) 
797 and r«( + exl)r«( + ex 2 )... q( + ex h ) Hm= , r( ( + 1 + P d r( ( + 1 + P2) ... r( ( + 1 + fJh) is a solution of the hth-order differential equa- tion at the beginning of this section, The hyper geometric function expressed by the definite integral f('«(-l t «(-Z)'d( has an obvious formal extension f«( -ad b1 «( -a 2 )b,... «( -am)b m «( -z)'d(. On the other hand, the equation m dVw vo <Pv(z) dzv = 0, where ( _l)m-I-v <Pv(z) - (h+m-2)... (h+ l)h (( h+m-V-2 ) x Plml-v\z) m-v-l ( h+m-v-2 ) ) + pbm-,)(z) , m-v PO(z) =(z-ad(z-a 2 )... (z-a m ), ( fJI P2 Pm ) P1(z)=PO(z) -+-+...+- , z-a l z-a 2 z-a m called the Tissot-Pochhammer differential equation, has a solution w(z)= L «(-ad P1 - 1 «(-a 2 )p,-I... x «( -am)Pm- I «( - z)h+m-2 d(, After Pochhammer (1870), this is sometimes called Pochhammer's generalized hypergeo- metric function. As another extension of Gauss's series, H. E. Heine (1846) introduced Heine's series: (l_ q a)(l_ q b) rn ( a b C' . Z ) - 1 + q z 'f' " ,q, - (l-q)(l- q C) (l-qa)(l-qa+I)(I-l)(l-qb+1) 2z + (l_q)(l_ q 2)(I_qC)(I_ q c+1) q +.... Setting q= 1 +£, z=(lj£)logx, and letting <.-->0, we obtain Gauss's series as the limit of Heine's senes. D. Hypergeometric Functions of Several Variables P. Appell (1880) formally extended Gauss's series to the case of two variables and defined 206D Hypergeometric Functions four kinds of functions [3]: FI (ex; p, p'; y;x,y) =   (ex)m+n(P)m{P')n xm n L. L. " ( ) y , m=On=O m.n. Y m+n F 2 (ex; Ii, fJ'; y, y'; x, y) _   (ex)m+n(P)m{P')n m n - L. L. " ( ) ( ' ) x y, m=On=O m.n. Y m Y n F 3 (ex, ex'; P, p'; y; x, y) =   (ex)m(ex')n(P)m{P')n xm n L. L. " ( ) y , m=On=O m.11. y m+n F 4 (ex;P;y, y';x,y) = f f (ex)m+n(P)m+n x m n. m=onom!n!(Y)m(y')n y They are called Appell's hypergeometric func- tions of two variables. Each satisfies a cor- responding system of partial differential equations: x(l-x)r+ y(l-x)s+(y-cx)p-Pyq - expz=O, FI y(l-y)t+x(l- y)s+(y-c'x)q- P'xp -exp'z=O, (x-y)s-p'p+pq=O, { X(I-x)r-x y s+(y-cx)P- pyq-expz=O, F 2 y(l- y)t- xys+(y' -c'y)q - p'xp -exp'z=O, { X(I-X)r+ ys+(y-cx)p-expz=O, F3 y(l- y)t+xs+(y-c"y)q-ex'P'z=O, { X(I_x)r- y2t-2xys+(y-cx)p-cyq -expz=O, F4 y(l- y)t-x 2 r-2xys+(y' -cy)q-cxp -expz=O, where c=ex+P+l, c'=ex+P'+I, c"=ex'+P'+I, p = azj8x, q = azjay, r = a 2 zjax 2 , s=a 2 zjaxay, t=a 2 zja y 2. Appell's hypergeometric functions can also be represented by integrals: for example, F - r(y) ff u P - I VP'-I I r(P)r(P')r(y-p-P') x (1- u - vf-P-P'-l (l-ux- vy) -, dudv, F - r(y)r(y') I I I I UP-1VP'-1 2 - r(P) r(P') r(y - P) r(y' - P') 0 0 x (l-uf- P - I (I-v)Y'-P'-I(I-ux-vy)-'dudv, 
206E Hypergeometric Functions F - qy) ff u p - t P'-l 3 - r(fJ) r(fi') qy - fJ _ fJ') v x (1 - u - vr- P - P '- t (1 - ux) -'(1- vy)-" dudv, where, for Ft and F 3 , the domain of integra- tion is u?O, v?O, l-u-v?O. E. Picard (1881) showed that F 1 can also be expressed by a single integral: Ft = qy) f t u.- t (1- UY-'-1 qex)qex-y) 0 x(l-uxr fl (1-uy)-P'du. G. Lauricella (1893) extended the foregoing functions to the case of more than two vari- ables. More general hypergeometric series of several variables were defined by R. Mellin, J. Horn, and J. Kampe de Feriet [3,4]. Every algebraic equation can be solved analytically in terms of the foregoing functions (Mellin, R. Birkeland) [4]. Also there are studies con- cerning tRiemann's problem and tautomor- phic functions derived from Ft (Picard, T. Terada [5J). E. Hypergeometric Functions with Matrix Argument For symmetric matrices Z of degree m. C. S. Herz defined hypergeometric functions with matrix argument as follows [5]: Denoting by etr Z the exponential exp(tr Z) of the ttrace of Z, let oFo (Z) = etr Z, p+tFq(exl> ...,exP;Pl' ...,fJq;y;Z) = r 1 ( , f etr(-A)pFq(exl,...,exp;fJt,...,(Jq; m Y) A>O AZ)(det AjY-P dAl1 d)'22 ... d}'mm, (3) pF q + 1 ( ex l> ...,ex P ;fJl' ...,fJq;y;;\) rm(y) i = (2 . m(m+O/2 etrZpFq(ex t , ...,ex p ; nl) ReZ=Xo>O fJl, ...,fJ q ;;\Z-t)(detZ)-Ydz t1 dZ 22 ",dz mm , (4) where A=(Ai)i,jt, ,m, Z = «1 + bij)zij/2)i,j1. ,m, rm(y) = n m (m-t)/4qy)qy -1/2)... x r(y-(m-1)/2), and ;\ > 0 means that A is tpositive definitc. The integral (3) converges for - Z > 0 if Rey >(m - 1 )/2, If Rey is sufficiently large, then for suitably chosen Xo, (4) converges in a domain of the space of;\ and represents an analytic function of its argument. In particular, we 798 have tFo(ex;Z) =(det(E - Z))-'. Based on this definition, many special func- tions and formulas are extended to the case of a matrix argument. For example, Ao(Z)= OFt(c5+(m+ 1)/2; -Z)/r m (c5+(m+ 1)/2) (5) is an extension of the tBessel function, and this reduces to (t/2) -0 Jo(t) = Ao«t/2)2) when m== 1. Formula (5) is applied to the tnoncentral Wishart distribution in mathe- matical statistics. References [1] F. Klein, Vorlesungen iiber die hypergeo- metrische Funktionen, Springer, 1933. Also  references to 389 Special Functions. For applications of the ladder method, [2] L. Infeld and T. E. Hull, The factorizatIOn method, Rev. Mod, Phys., 23 (1951), 21-68. For hypergeometric functions of several variables, [3] P. Appell, Sur les fonctions hypergeo- metriques de plusieurs variables, Memor. Sci. Math., Gauthier-Villars, 1925. [4J G. BelardinelIi, Fonctions hypergeome- triques de plusieurs variables et resolution analytique des equations algebriques generales, Memor, Sci. Math., Gauthier-Villars, 1960. [5] T. Terada, Probleme de Riemann et fonc- tions automorphes proven ant des fonctions hypergeometriques de plusieurs variables, J. Math, Kyoto Univ. 13 (1973), 557-578. For the case of a matrix variable, [6J C. S. Herz, Bessel functions of matrix argu- ment, Ann. Math" (2) 61 (1955),474-523. [7J L. J. Slater, Generalized hypergeometric functions, Cambridge Univ. Press, 1966. 
207 A Ideal Boundaries 207 (XI.13) Ideal Boundaries A. Ideal Boundaries For a given Hausdorff space R, a tcompact Hausdorff space R* that contains R as its dense subspace is called a compactification of R, and =R*-R is called an ideal boundary of R. In the present article, we deal mainly with properties (in particular, function- theoretic properties) of ideal boundaries of tRiemann surfaces R. B. Harmonic Boundaries By R we mean an open Riemann surface. The set r of points p* in  such that IiminfR3pp'P(p)=0 for every tpotential P (i.e., a positive tsuperharmonic function P for which the class of nonnegative tharmonic functions smaller than P consists only of the constant function 0) is a compact subset of R*, The set r is called the harmonic boundary of R with respect to R*. For an arbitrary compact subset K in  - r, there exists a finite-valued potential P K with IimR3pp' PK(p) = 00 (p* E K). From this, various kinds of tmaximum prin- ciples are derived. For instance, if u is a har- monic function bounded above for which Iimsuppru(p).s;M holds, then u.s;M on R. There are infinitely many compactifications of R. For two compactifications Rt (i = 1,2) of R, we say that Rj is greater than R! or, equivalently, lies over R!, if the identity map- ping of R can be extended to a continuous mapping of Rj onto R!. In order that deep function-theoretic studies of R* can be carried out, various conditions must be imposed On R*, A compactification R* is said to be of Stoilow type (or of type S) if for every tcon- nected open subset G* in R* whose boundary in R* is contained in R, G* -  is also con- nected. Next suppose that RtOG ( 367 Rie- mann Surfaces E). For a given real-valued function f on , let Ur R ' mr R ') be the class oftsuperharmonic functions s bounded from below (tsubharmonic functions s bounded from above) such that Iim infR3pp' s(p)  f(p*) (Jim sUPR3rp' s(p) .s;f(p*») for every p* E, If these classes are nonempty, then Hf,R'(p)= inf{s(p)lsEU,R'} and Hf'R'(p)=sup{s(p)lsE ll.R'} are harmonic on R, and Hf'R' .s;Hf,R', In particular, if Hf'R' = Hf'R', then the com- mon function is denoted by Hf'R', and the function f is said to be resolutive with respect to R*. A compactification such that every bounded continuous function on  is resolu- tive is called a resolutive compactification. In 800 such a case, a point p* in  is said to be regu- lar with respect to the tDirichlet problem if IimR3Pp' Hf,R'(p) = f(p*) for every bounded continuous function f on  ( 120 Dirichlet Problem). The set r of regular points in  is contained in r. If R* is a resolutive com- pactification, then there exists a unique posi- tive tBorel measure Ji p such that Hf,R'(p)= J,J(p*)dJip(p*) for every bounded continu- ous function f on . This measure is called the harmonic measure with respect to pER. There exists a function P(p, p*) on R x  with dJip(p*) = P(p, p*)dJio(p*) for an arbitrary fixed point 0 in R satisfying the following three conditions: (i) P(p, p*) is harmonic on R as a function of p; (ii) P(p, p*) is Borel measurable as a function of p*; (iii) k( 0, pr l .s; P(p, p*).s; k(o, p), with the Harnack constant k(o, p) of {o, p} relative to R [9]. C. Compactifications Determined by Function Families A family F of real-valued continuous functions on R admitting infinite values is called a sep- arating family on R if there exists an f in F such that f(p) # f(q) for any pair of given distinct points p and q in R. A compactifica- tion R* is called an F-compactification, de- noted by R, if every function in F can be continuously extended to R* and the family of extended functions again constitutes a sep- arating family on R*. The correspondence <p: F -->R defines a single-valued mapping of all separating families F on R onto all F- compactifications of R. If Fl ::::J F 2 , then <p(Fd lies over (fJ(F2)' For any R*, (fJ-I(R*) contains infinitely many separating families, among which the separating families constituting tassociative algebras are important. The fol- lowing are typical examples of compactifica- tions determined by function families: (1) The Aleksandrov compactification is the U-compactification R with the family U of bounded continuous functions on R with compact support. It is the smallest compacti- fication of R and is often used in function theory in discussing Dirichlet problems for relatively noncompact subregions in reference to relative boundaries. (2) The Stone-Cech compactification is the <£:-compactification R with the family <£: of bounded continuous functions on R, It is the largest compactification of R. It is rarely used in function theory, but an application is found in the work of M. Nakai [8]. (3) The Kerekjarto-Stoilow compactification is the 2>-compactification R with the family 2> of bounded continuous functions.r on R 
801 such that there exist compact sets Kf with the property that the f are constants on each connected component of R - K f' This is the smallest compactification of Stoi1ow type. Many applications of this compactification can be found in function theory, among which the investigation done by M. Ohtsuka on the Dirichlet problem and the theory of conformal mappings is typical. (4) The Royden compactification is the 1Ji- compactification R!IJ with the family IJi of bounded COO functions f on R with finite Dirichlet integrals HRdf A *df It was intro- duced by H. L. Royden and developed further by S. Mori, M. Ota, Y. Kusunoki, Nakai, and others. This compactification has been used effectively in the study of HD-functions and the classification problem of Riemann surfaces ( 367 Riemann Surfaces). (5) The Wiener compactification is the Iffi- compactification Ril'D with the family Iffi of bounded continuous functions f on R such that {HJ"} converges to a unique harmonic function independent of the choice of exhaus- tions {G n } of an arbitrary fixed subregion G rt 0G, where the G n are relatively compact sub- regions of G. It is the largest resolu,tive com- pactification, and compactifications smaller than Rifu are always resolutive. This compac- tification was introduced independently by Mori, K. Hayashi, Kusunoki, and C. Constan- tinescu and A. Cornea and is useful for the study of HB-functions and the classification of Riemann surfaces. (6) The Martin compactification is the \JJl- compactification R with the family \JJl of bounded continuous functions f on R such that there exist relatively compact regions Rf with the property that f = H!.-Rf,R.,-Rfj Hf.-Rf,R:;,-R f on R-R f . Here f* coincides with f on Rf and eq uals 0 on R - R, and 1 * is similarly defined. The set R - R is called the Martin boundary of R. If tGreen's func- tion 9 exists on R, then the function m(p, q) = g(p, q)j g(o, q) for an arbitrary fixed 0 E R can be extended continuously to R x R, which is called the Martin kernel. By the metric d 9J1 (q, r) = SUPpERolm(p, q)j(1 + m(p, q)) -m(p, r)j (1 + m(p, r))J with a parametric disk Ro in R, R:JJI is tmetrizable. This compactification was introduced by R. S. Martin, and many appli- cations of it to the study of H P-functions, potential theory, Markov chains, and cluster sets were obtained by M, H. Heins, Z. Kura- mochi, J. L. Doob, Constantinescu and Cor- nea, and others. (7) For a function f on R, (R)3fj3n=0 means that there exists a relatively compact subregion Rf such that f is of class Coo on R outside Rf and the Dirichlet integral of f over R -  is not greater than those of functions 207D Ideal Boundaries on R - Rf that coincide with f on the bound- ary of Rf' The Kuramochi compactification is the R-compactification R with the family R of bounded continuous functions f on R satisfy- ing (R)3fj3n =0. The continuous function k(p, q) on R such that (R)3kj3n = 0 vanishes in a fixed parametric disk Ro in R and is har- monic in R - Ro except for a positive tlogarith- mic singularity at a point q can be extended continuously to R x R, which is called the Kuramochi kernel By the use of this kernel, R is metrizable, as in the case of Martin compactification. This compactification was introduced by Kuramochi, and its important applications to the study of HD-functions, potential theory, and cluster sets were made by Kuramochi, Constantinescu and Cornea, and others. Among compactifications (1)-(7), no bound- ary point in (2), (4), or (5) satisfies the tfirst countability axiom (hence they are not metriz- able), while the others are all metrizable, In (4) and (5), r = r. Fig. 1 shows the relationship among the seven examples. Here A --> B means that A lies over B, and A # B means that in general neither A --> B nor B --> A. R' . .:'-............ . Ra-R", II' "'" Re--R'II '-..R;-RV" Fig. 1 D. Remarks In contrast to topological compactifications (1)-(3), (4)-(7) can be regarded as potential- theoretic and have enough ideal boundary points so that one can solve the Dirichlet problem and introduce vario:.Is measures there. Utilizing Green's functions, R. S. Martin [6] deduced the first important compactifi- cation and gave the integral representation of positive harmonic functions (the extension of the tPoisson integral). Z. Kuramochi [3J ob- tained his compactification similarly by using N-Green functions introduced by himself instead of the usual Green's functions. In this compactification, the ideal boundary points can be considered to be interior points of the surfaces in a potential-theoretic sense. In case of finitely connected domains with smooth boundaries, both the Martin and Kura- mochi boundaries coincide with the usual boundaries. The Royden and Wiener compactifications were introduced as the maximal ideal spaces of respective function algebras. Their ideal boundaries contain extremely many points. However, these compactifications have elegant 
207 Ref. Ideal Boundaries properties and many applications. An analytic mapping <p of a Riemann surface R into an- other R' is called a Dirichlet (Fatou) mapping if <p can be extended to a continuous map- ping of R(Ri1D) to R::(R). For example, a Linde10fian mapping (AD-function) is a Fatou (Dirichlet) mapping. The Dirichlet and Fatou mappings were investigated by Constanti- nescu, Cornea, and others. By using the Martin boundary, Z. Kuramo- chi and M. Nakai proved the extension of the Evans-Selberg theorem to parabolic Riemann surfaces [3,8]. The normal derivatives of H D- functions on the ideal boundaries and their applications were studied by Constantinescu, Cornea, F. Maeda, Y. Kusunoki, and others. The theory of compactification can be gen- eralized to domains in R n , tGreen spaces, and harmonic spaces. References [IJ C. Constantinescu and A. Cornea, Idea1e Riinder Riemannscher F1iichen, Springer, 1963. [2J K. Hayashi, Sur une frontiere des surfaces de Riemann, Proc. Japan Acad., 37 (1961), 469-472. [3J Z. Kuramochi, Mass distributions on the ideal boundaries of abstract Riemann surfaces I, II, Osaka Math, J" 8 (J 956), 119 137, 145- 186. [4J Y. Kusunoki, On a compactification of Green spaces, Dirichlet problem and theorems of Riesz type, J. Math. Kyoto Univ., I (1962), 385402. [5J Y. Kusunoki and S. Mori, On the har- monic boundary of an open Riemann surface I, Japan. J. Math., 29 (1959), 52-56. [6J R. S. Martin, Minimal positive harmonic functions, Trans. Amer. Math. Soc., 49 (1941), 137172. [7J S. Mori, On a compactification of an open Riemann surface and its application, J. Math. Kyoto Univ., 1 (1961), 2142. [8J M. Nakai, On Evans potential, Proc. Japan Acad., 38 (1962),624-629. [9J M, Nakai, Radon-Nikodym densities between harmonic measures on the ideal boundary of an open Riemann surface, Nagoya Math. J., 27 (1966), 71-76. [lOJ H. L. Royden, On the ideal boundary of a Riemann surface, Contributions to the Theory of Riemann Surfaces, Princeton Univ. Press, 1953, 1071O9. [11 J F. Maeda, M. Ohtsuka, et aI., Kuramochi boundaries of Riemann surfaces, Lecture notes in math. 58, Springer, 1968. [12J L. Sario and M. Nakai, Classification theory of Riemann surfaces, Springer, 1970. 802 208 (X.7) Implicit Functions A. General Remarks Historically, a function y of x was called an implicit function of x if there was given a functional relation f(x, y) = 0 between x and y, but no explicit representation of y in terms of x ( 165 Functions). Nowadays, however, the notion of implicit function is rigorously defined as follows: Suppose that a function f(x l' ..., x n , y) is oftclass C 1 in a domain G in the real (n + I)-dimensional Euclidean space Rn+1 and that f(x?,..., x ,yO)=O, fy(x?, ..., x ,yO) # 0 at a point (x?,..., x ,yo) in G. Then there is a unique function g(x l' .., ,x n ) of class C 1 in a neighborhood of the point (X?, '" , x) that satisfies f(XI, ..., x n , g(x 1 , ..., x n )) = 0, yO = g(x?, '" , x) (implicit function theorem). The function 9 is called the implicit function determined by f = O. The partial derivatives of 9 are given by the relation Dgjoxj= -WjDx)/WjDy), where y = g(x t , ,.. , x n ). If the function f is of class cr (1 .s; r.s; 00 or r = w), then the function 9 is also of class cr. In particular, when n = 1, letting Xl be x, we have dgjdx= -fxjfy. B. Jacobian Matrices and Jacobian Determinants A mapping U from a domain Gin Rn into Rm u(x) =(udx 1 ,..., x n ),..., um(x l ,..., x n )), X=(Xl'"'' x n ), is called a mapping of class C' if each compo- nent u h .. . , U m is of class C r (O.s; r.s; 00 or r = w) in G. Given a mapping u of class C 1 from G into Rm, we consider the following matrix, which gives rise to the differential dux of the mapping u ( 105 Differentiable Manifolds I): D(u)jo(x) = (Du)ox k ) I oC;joC;m. I oC;koC;n' (1) This matrix is called the Jacobian matrix of the mapping u at x. If there is another map- ping v of class C 1 from a domain containing the trange U of u into R 1 , then we have the law of composition: (D(v)jD(u))(D(u)jD(x)) = o(v)jD(x). When n = m, the tdeterminant of the matrix (I) is called the Jacobian determinant (or simply Jacobian), and is denoted by D(u)jD(x), D(u t ,..., un)jD(x t ,..., x n ) or D(u l , ...,u n ) D(x l , ...,x n ) 
803 Sometimes the notation a is used instead of D; but in the present article we distinguish the matrix from the determinant, using a for the matrix and D for the determinant. If m = nand D(u)jD(x) never vanishes at any point of the domain G, then U is called a regular (or nonsingular) mapping of class C I. If the Jacobian D(u)jD(x) is 0 at x, we say that u is singular at x. A mapping that is singular at every point in a set S eGis said to be degen- erate on S. For a regular mapping u, the sign of the Jacobian is constant in a connected domain G. If it is positive, the mapping u preserves the orientation of the coordinate system at each point in G, while if it is nega- tive, the mapping changes the orientation. A point where u is degenerate is called a critical point of the mapping u, and its image under u is called a critical value. In general, the image of the mapping is "folded" along the set of critical points. The set of critical values of a mapping u of class C I (sending a domain in R n into Rn) is of Lebesgue measure 0 in Rn (Sard's theorem). If u is a regular mapping, then each point in the domain G of u has a neighbor- hood V such that the restriction of u on V is a ttopological mapping. Its inverse mapping x(u) is also a regular mapping of class C I and satisfies the relation D(u)D(x) --=1 D(x)D(u) (inverse mapping theorem). If u is of class cr (1 .s; r.s; oc or r = w), then so is its inverse mapplllg. C. Functional Relations A function F(u l , ..., un) defined on a domain B in R n is called a function with scattered zeros if F has a zero point (i.e., there exists a point u for which F(u) = 0) and if every open subset of B contains a point u such that F(u) # 0, Every tanalytic function of- 0 has scattered zeros. Let u(x) be a mapping from a domain G in Rn into BeRn. Suppose that there exists a function F(u) defined in B, of class C', with scattered zeros, If F(u(x)) = 0 for every x in G, then we say that the components u 1 , ... , Un of the mapping u have a functional relation of class C r or are functionally dependent of class cr. In such a case, we sometimes say simply that u I' .,. , Un are functionally dependent or that they have a functional relation. If the compo- nents UJ, ... ,Un of a mapping u of class C I are functionally dependent of class Co, then the Jacobian D(u)jD(x) of u must vanish. Con- versely, if the Jacobian D(u)jD(x) of a mapping u of class C I is identically 0 in the domain G, 208D Implicit Functions the components u l , ..., Un are functionally de- pendent of class COO on every compact set in G (Knopp-Schmidt theorem) [1]. D. Implicit Functions Determined by Systems of Functions Suppose that the trank of the Jacobian matrix of (1) is r < m everywhere in G. Suppose that u(x) is a mapping of class C I from a domain Gin Rn into R m , the Jacobian determinant D(u I' .., ,ur)j D(x l' ,.. , x r ) never vanishes in G, and further D(uJ, ..., U" up)jD(x l , ..., x" xa) is identically 0 in G for each p, 0" with r < p.s; m, r < O".s; n. Then the values u, II (x), . - -, u",(x) are determined by the values U I (x), ..., ur(x), and each up is represented as a function of class C I of u 1 , ''', u r . Let u(x) be a mapping of class Cl from a domain G in R n into R m and V the tinverse image of a point u O . To study the properties of the set V, we assume, for simplicity, that U O is the origin. Suppose that the rank of the matrix a(u)jci(x) is r for every point x in G and each of Ur+ I, '" , U m is functionally dependent on UJ, ...,U r . Then each up (r<p.s;m) is a func- tion up(u I , ...,u r ) ofu j , ....u r . The set Vis empty if there is a p such that up(O, ... , 0) # O. On the other hand, if up(O, ...,0) = 0 for all p (r < p.s; m), then V is the set of common zero points of the functions U l (x), ..., ur(x). There- fore, to study the set V, we can assume that r = m.s; n. If m = n, V consists of isolated points only. If m < n, then, changing the order of the variables x I' .. . ,X n if necessary, we can ass ume that D(u j ,..., um)jD(x l ,..., x m ) #0 at a point (x o ) in V. In this case, there is a unique func- tion p(Xm+l' ...,x n ) of class C I (l.s;J1.s;m) in a neighborhood of (x?) satisfying the following two conditions: (i) x = p(X+I' ..., x); (ii) if the point (Xm+I' ..., x n ) is in a neighborhood of (X+I' ..., x), then the point (j(Xm+l' ...,x n ), ""m(Xm+I' ...,x n ), x m + 1 , ...,Xn)E V. Each function p is called an implicit function of x m +1, '" , X n determined by the relations u I = . .. = U m = O. The ttotal derivatives of the " are determined from the system of linear equations m (1u. n cu. L dk+ 2: dx/=O, j= 1, ...,m. k=l UX k /=m+IOX/ The foregoing implicit function theorem is a local one. Among the global implicit function theorems, the following one, due to Hadamard, is useful: Let Xf-> y(x) be a mapping of class C l from Rn into R n such that the inverse <1>-1 (x) 
208E Implicit Functions of its Jacobian matrix $(x) is bounded in R n ; then the mapping is a tdiffeomorphism of class C I from Rn onto Rn. The implicit function theorem holds also in complex spaces. Let h(x l , ... ,X n , Y I' ... , Y p), 1 .s; i.s; p, be a system of tholomorphic functions in a domain in the complex (n + p)-dimensional space C+ p . If (i) /;(x7, ...,x,y7, ...,y)= 0, 1 .s; i.s; p, and (ii) D(/I' ... Jp)/D(y" ..., yp)(X,Y)(XO,yO) # 0, then there exists a unique holomorphic solution Yi = Yi(X I' ..., x n ) (1 .s; i.s; p) in a neighborhood of the point X O = (x, ...,x) that satisfiesh(x l , ...,xn,ydx l , "" x n ), "', Yp(x l ,.., ,xn)=O (l.s; i .s;p), E. Linear Relations Suppose that u(x) is a mapping of class cm-l from R I into R m . Its components (u" ..., u m ) are functions of class C m - l . Then the determinant U l U2 U m U'l U2 u u(m-l) um-l) u(m-l) I m is called the Wronskian determinant (or simply the Wronskian) of the functions U I, .. . , U m and is denoted by W(U l ,U 2 , ...,u m ). If the func- tions u l , ,." U m are tlinearly dependent, i.e" if there exist constants c j not all zero satisfying L]"cl CjU)X) =0 identically, then the Wronskian vanishes identically. Therefore, if W(u 1 ,..., u m ) # 0, then the functions U I' ... 'U m are linearly independent. Conversely, if W(Ul"'" um)=O identically, and further if there is at least one nonvanishing Wronskian for U"..., ui-j, U i +" ..., U m (1 .s; i.s; m), then the functions U 1 , ..., U m are linearly dependent. The necessity of the additional condition is shown by the follow- ing example: U I = X3 and U 2 = I X 1 3 are of class C l in the interval [ -1, IJ and linearly inde- pendent, but they satisfy W(x 3 , Ix1 3 ) = 0 iden- tically. However, the additional condition is unnecessary if the functions u l , ..., U m are analytic. Similar theorems are valid in a do- main in a complex plane. Furthermore, if u(x) is a continuous map- ping from an interval [a, b J in R I into Rm, the determinant (1, 1) ) _ (2,1) G(u",,,,u m - , . , (m,l) (I,m) (2,m) , (m,m) (j,k)= r uix)udx)dx is called the Gramian determinant (or simply Gramian). The Gramian is the tdiscriminant of 804 the quadratic form f b ( t jU)X) ) 2 dx a )-1 of " ... , m and is equal to 1 f b f b -. ... (det(uAx k )))2dx 1 ...dx m . m. a a We always have G(u l ,..., U m );) 0, and G(u" ..., u m ) = 0 if and only if u" ..., U m are linearly dependent. The Gramian is defined if the functions U l , ..., U m are tsquare integrable in the sense of Lebesgue. In that case, the condition G(u" ...,um)=O holds if and only if U l ,..., U m are linearly dependent tal most every- where, i.e., there are constants C I, ." , C m not all zero such that the relation C I U I (x) + ' .. + cmum(x)=O holds except on a set of Lebesgue measure O. References [IJ K. Knopp and R. Schmidt, Funktional- determinanten und Abhangigkeit von Funk- tionen, Math. Z., 25 (1926), 373-381. [2J G. Doetsch, Die Funktionaldeterminante als Deformationsmass einer Abbildung und als Kriterium der Abhangigkeit von Funktionen, Math. Ann., 99 (1928),590-601. [3J W. Rudin, Principles of mathematical analysis, McGraw-Hill, second edition, 1964. [4J H. Cart an, Calcul differentiel, Hermann, 1967. [5J J. T. Schwartz, Nonlinear functional anal- ysis, Gordon & Breach, 1969. 209 (XXI.5) Indian Mathematics India was one of the earliest civilizations, but because it has no precise chronological record of ancient times, it is said to possess no his- tory. Indian mathematics seems to have de- veloped under the influence of the cult of Brahma, as did the calendar. It may also have some relation to the mathematics of the Near East and China, but this is difficult to trace. The word ganita (computation) appears in early religious writings; after the beginning of the Christian Era, it was classified into piitf- galJita (arithmetic), bija-galJita (algebra), and ketra-galJita (geometry), thus showing some systematization. The Buddhists (notably Na- garjuna) had a kind of logic, but it had no re- 
805 1ation to mathematics. Unlike the Greeks, the Indians had no demonstrational geometry, but they had symbolic algebra and a position system of numeration. Indian geometry was computational; Arya- bhata (c. 476-c. 550) computed the value of n as 3.1416; Brahmagupta (598-c. 660) had a formula to compute the area of quadrangles inscribed in a circle; and Bhiiskara (1114- 1185) gave a proof of the Pythagorean theo- rem. In trigonometry, Aryabhata made a table of sines of angles between 0° and 90° for every 3.75° interval. The name "sine" is related to the Sanskritjya, which referred to half of the chord of the double arc. The Indians had a remarkable system of algebra. At the beginning they had no oper- ational symbols and described in words the rules for solving equations. Brahmagupta worked on the tPell equation ax2 + 1 = y2. Bhaskara knew that a quadratic equation can have two roots that can be positive and nega- tive, but did not assign any meaning to the negative root in such cases. Bhiiskara also introduced algebraic symbols. The symbol 0 was used in India from about 200 B.C. to denote the void place in the posi- tion system of numeration; 0 as a number is found in a book by Bakhshiili published in the 3rd century A.D. The number 0 is defined as a - a = 0 in our notation, and the rules a::!:: 0 = a, 0 x a=O, jO=O, O-'-a=O are mentioned. Brahmagupta prohibited division by 0 in arithmetic, but in algebra he called the "quan- tity" a - 0 taccheda. Bhiiskara called it khahara and made it playa role similar to that of our infinity. Some historians assert that the Indians had the ideas of infinity and infinitesimal. Some hold that the Indian position system of numer- ation arose from the circumstance that the names of numbers differed according to their positions. The Indian numeration system was exported to Europe through Arabia, and had great influence on the development of mathematics. References [1] M. Cantor, V orlesungen liber Geschichte der Mathematik I, Teubner, third edition, 1907. [2J G. Sarton, Introduction to the history of science I, From Homer to Omar Khayyam Carnegie Institute of Washington, 1927. [3J B. Datta and A. N. Singh, History of Hindu mathematics I, II, Lahore, 1935-1938. (Asia Publishing House, 1962). 210 B Inductive Limits and Projective Limits 210 (11.25) Inductive Limits and Projective Limits A. General Remarks Inductive and projective limits can be de- fined over any tpreordered set I and in any tcategory. We first explain the definition of these limits in the special case where I is a tdirected set and the category is that of sets, of groups, or of topological spaces. The simplest case is when I is the ordered set N of the natural numbers. B. The Limit of Sets Let I be a directed set. Suppose that we are given a set Xi for each i E I and a mapping <Pji: Xi-->X j for each pair (i,j) of elements of I with i .s;j, such that <pij = 1 Xi (the identity map- ping on X;) and <Pki = <Pkj 0 <Pji (i.s;j.s; k). Then we denote the system by (Xi, <Pji) and call it an inductive system (or direct system) of sets over I. Let S be the tdirect sum U iXi of the sets Xi (iEI), and define an equivalence re- lation in S as follows: XEX i and YEX j are equivalent if and only if there exists a kE I such that i.s;k,j.s;k, <Pki(X) = <Pkj(Y)' Let D be the tquotient set of S by this equivalence rela- tion, and let h: Xi --> D (i E I) be the canonical mappings. Then we have I (I) fj 0 <Pji = h (i .s;j); 1(2) for any set X, and for any system of map- pings gi:Xi-->X (iEI) satisfying gj0<Pji=gi (i .s;j), there exists a unique mapping f: D-->X such that f 0 h = g;(iE I). We call (D,h) the inductive limit (or direct limit) of the inductive system (Xi> <Pj;) over I, and denote it by 1 Xi or ind1imX i (more precisely, by limiElX i or --> ind IimiEl XJ Suppose, dually, that we are given a set Xi for each iEI and a mapping l/Jij:Xj-->X i for each i .s;j, such that I/Jii = 1 Xi and I/Jik = I/Jij 0 I/Jjk (i.s;j.s; k). Then we denote the system by (Xi' I/Jij) and call it a projective system (or inverse system) of sets over I. Let P be the subset of the Cartesian product IIX i defined by P = {(x;)ll/Jij(x)= Xi (i .s;j)}, and let Pi: P-->X i be the canonical mappings. Then we have P(1) I/Jij 0 Pj = Pi (i .s;j); P(2) for any set X, and for any system of mappings qi: X --> Xi satisfying I/Jij 0 qjqi (i .s;j), there exists a unique mapping p: X -->P such that PiO P=qi (iEI). We call (P, p;) the projective limit (or inverse limit) of the projective system (Xi' I/Jij) over I and de- note it by 1 Xi or proj lim Xi' 
210C Inductive Limits and Projective Limits Note that we may replace I by any cofinal subset of I without changing the limits. C. The Limit of Groups and of Topological Spaces If, in the notation of Section B, Xi is a group and <Pji (iJ;i) is a homomorphism, then we say that (Xi, <Pji) «Xi, iJ;i;)) is an inductive (projective) system of groups. The inductive limit (as a set) D = l Xi has the structure of a group for which the canonical mappings.li are homo- morphisms. With this group structure, D is called the inductive limit (group) of the induc- tive system of groups. It satisfies properties I(J) and 1(2) with group X and homomor- phisms !Ii and { Similarly, the projective limit (as a set) P=limX i has a unique group struc- ture such that';;"ach Pi:P-->Xi is a homomor- phism, namely, that of a subgroup of the direct product group IIX i . The group P is called the projective limit (group) of the projective system of groups. When each Xi is a module over a fixed ring A, we get entirely similar results by considering A-homomorphisms instead of group homomorphisms. Next, let Xi be a topological space and (Pji and iJ;ij be continuous mappings. Then (Xi, (Pj;) «Xi> iJ;i;)) is called an inductive (projecthe) system of topological spaces. If we introduce in D = l Xi the topology of a quotient space of the ttopological direct sum of the spaces Xi (iEI), then the t; are continuous, and I(J) and 1(2) hold with sets and mappings replaced by topological spaces and continuous mappings. Similarly, if we view P = l Xi as a subs pace of the tproduct space OX i , then the Pi are continuous and P(I) and P(2) hold with the same modification as before. The spaces D and P are called the inductive limit (space) and the projective limit (space) of the system of topo- logical spaces, respectively. The projective limit of Hausdorff (compact) spaces is also Hausdorff (compact), Furthermore, if the Xi (i E I) form a topolog- ical group and <Pji, iJ;ij are continuous hom- omorphisms, then lim Xi and lim Xi are topo- logical groups, and properties 1(1), 1(2), P( 1), and P(2) are satisfied for topological groups and continuous homomorphisms ( 423 Topological Groups). In particular, projec- tive limits of finite groups are ttotally discon- nected compact groups and are called profinite groups; they occur, e.g., as the ring oftp-adic integers and as the tGalois group of an infinite tGa10is extension. Conversely, the tgerms of continuous functions at a point x in a topolog- ical space X, and other kinds of germs ( 383 Sheaves), are important examples of inductive limits of groups. 806 D. Limits in a Category Let I be a preordered set and «(if a category. If we are given an object Xi of a category C(j for each i E I and a tmorphism <Pji: X i --> X j of C(j for each pair (i,j) of elements of I with i .s;j, and if the conditions <Pii = 1 Xi' <Pki == <Pkj 0 <Pji (i.s;j.s; k) are satisfied, then we call the system (Xi, <Pj;) an inductive system over I in the cate- gory (e. A projective system in «(; is defined dually: It is an inductive system in the tdual category C(j0. Ifwe view I as a category ( 52 Categories and Functors B), then an inductive (projective) system in C(j over the index set I is a tcovariant (tcontravariant) functor from I to ({,. Now if an object DE C{? and morphisms .f.:Xi-->D (iEI) satisfy conditions I(J) and 1(2) with the modification that X is an object and {Ji' fare morphisms in (C, then the system (DJ) is called the inductive limit of (Xi, <Pi;) and is denoted by Iim Xi' Similarly, if an object --> . PEC{? and morphisms Pi:P-->Xi (iEI) satisfy P(J) and P(2) with a similar modification, then (P, p,) is called the projective limit of (Xi' iJ;ii) and is written lim Xi' By 1(2) and P(2), these limits are uniqJ;; if they exist. In the categories of sets, of groups, of modules, and of topological spaces, inductive and projective limits always exist. Note that if the ordering of I is such that i.s; j implies i = j, i,e" if there is no ordering between two distinct elements of I, then the inductive (projective) limit is the tdirect sum (tdircet product) ( 52 Categories and Functors E). Let (Xi, <Pji), (X;, <PJi) be two inductive sys- tems over the same index set I, and let <Pi: Xi-->X; (iEI) be morphisms satisfying <P}i 0 <Pi = <Pi 0 <Pii (i .s;j). Then the system (<Pi) is called a morphism between the inductive systems. Such a morphism is a tnatura1 transformation between the inductive systems viewed as func- tors I -->(t. If li!r Xi and 1iJ;1 Xi exist, then (<p;) induces a morphism lim<pi:limXi-->limX; in --> --> --> a natural way, and similarly for projective limits. For the more abstract notion of limit of a functor  [5]. For the theory of procategories  l4]. References [1] S, Lefschetz, Algebraic topology, Amer. Math. Soc. Colloq. Pub\., 1942. [2J S. Eilenberg and N. Steenrod, Foundations of algebraic topology, Princeton Univ. Press, 1952. [3J H. Cartan and S, Eilenberg, Homological algebra, Princeton Univ. Press, 1956. [4] A. Grothendieck, Technique de descente et theoremes d'existence en geometrie algebrique II. Le theoreme d'existence en theorie formelle 
807 des modules, Sem. Bourbaki, Expose 195, 1959/1960 (Benjamin, 1966). [5] M. Artin, Grothendieck topology, Lecture notes, Harvard Univ. 1962. 211 (X.3) Inequalities A. General Remarks In this article we consider various properties of inequalities between real numbers. An in- equality that holds for every real number (e.g., Xl;" 0) is called an absolute inequality; other- wise it is called a conditional inequality. When we are given a conditional inequality (e.g., x(x - 1) <0), the set of all real numbers that satisfy it is called the solution of the inequality. The process of obtaining a solution is called solving the conditional inequality. B. Solution of a Conditional Inequality Suppose that a conditional inequality is given by f(x) >0 (or f(x);"O), where f is a continu- ous function defined for every real number. If the equation f(x) = 0 has no solution, then we have either f(x»O or f(x)<O for alI x. On the other hand, if ex and fJ (ex < fJ) are adja- cent roots of the equation f(x) = 0, the sign of f(x) is unchanged in the open interval (ex, fi). Therefore the solution of the given inequality depends essentialIy on the solution of the equation f(x) = O. If inequalities involve two variables x, y and are given by {(x, y) > 0, g(x, y) > 0 for continuous functions f and g, the solution is, in general, a domain in the xy- plane bounded by the curves {(x,y)=O and g(x, y) = O. Similar results hold for the case of inequalities involving more than two variables. C. Famous Absolute Inequalities (1) Inequalities concerning means (or averages): Suppose that we are given an n-tuple a = (aI' ...,a n ), ar;"O. We set ( 1 n ) I/r Mr=Mr(a)= - I a . nV=l If at least one a r is 0 and r <0, we put Mr=O. In particular, we put A=M I , ( n ) I/n G=lim M r = IT a v , rCL v""t H=M_ 1 ; 211 C Inequalities these are called the arithmetic mean, geometric mean, and harmonic mean of a v (v = 1, ., . , n), respectively. Except when either all a v are identical or some a, is 0 and r.s; 0, the func- tion M r increases tstrictly monotonically as r increases, and Mr-->mina v (r--> -00), M r --> maxa, (r--> +00). Therefore we always have minav.s; Mr.s; maxa v . In particular, we have H < G < A if the a v are all positive and not all eq ual. Let p(x) (> 0), f(x) (;" 0) be tintegrable func- tions on a tmeasurable set E. We put MrU)=(LpJ'dX! LpdXrr, r#O. Furthermore, if M r (/) is strictly positive for some r> 0, we put Mo(/)= lim Mr(f) r--++O =exp(LplOgf dX ! LPdX). We call M r (/) the mean of degree r of the function f(x) with respect to the weight func- tion p(x). It has properties similar to those of Mr(a). In particular, when the weight function P= 1, the means M 1 (/), Mo(/), M_ 1 (/) are called the arithmetic mean, geometric mean, and harmonic mean of f, respectively. (2) The Holder inequality: Suppose that p #0, 1 and (p-1)(q - 1)= 1; that is, l/p+ l/q = 1, and a v >0, b v >0. Then, in general, we have the Holder inequality: ( ) I/P ( ) l/q avbvz a b , pl, where the inequality signs in the first inequal- ity are taken in accordance with p < 1 or p> 1. The summation may be infinite if the sums are convergent. The inequality sign is replaced by the equality sign if and only if there exist constant factors ..1. and Ji such that Aa = Jib; for all v. The Holder inequality for p = q = 2 is called the Cauchy inequality (or Cauchy- Schwarz inequality). For two measurable positive functions f(x), g(x), we have the Holder integral inequality: LfqdXZ(LJPdX rp(L gqdx r q , p 1, except when there exist two constant factors ..1. and Ji ((..1., Ji) #(0, 0)) such that ijP= Jigq holds talmost everywhere. The above inequality is replaced by eq uality if and only if we are in this exceptional case. The case where p = q = 2 is called the Schwarz inequality (or Bunyakov- skii inequality). (3) The Minkowski inequality: Suppose that p # 0, 1 and a, > 0, b v > O. Then we have the 
211 D Inequalities Minkowski inequality: ( ) I/P ( ) I/P ( ) I/P (av+bv)P Z a + b!' , pl, except when {a v } and {b v } are proportional. The inequality is replaced by equality if and only if we are in the exceptional case. The corresponding integral inequality for positive functions f(x), g(x) is (L (/ +gjPdx r p z(LJPdX r p + (L9PdXrp, pl, except when f(x)/g(x) is constant almost every- where. The inequality is replaced by equality if and only if we are in the exceptional case. D. Related Topics Absolute inequalities are important in anal- ysis, especially in connection with techniques to prove convergence or for error estimates. However, there seldom are general principles for deriving such inequalities, except for a few elementary theorems. For other famous inequalities  Appendix A, Table 8. For related topics  88 Convex Analysis; for convex functions and their ap- plications  255 Linear Programming; for linear inequalities  212 Inequalities in Physics. References [IJ G. H. Hardy, 1. E. Littlewood, and G. P6lya, Inequalities, Cambridge Univ. Press, first edition, 1934; revised edition, 1952. [2J V. I. Levin and S. B. Stech kin (Steckin), Inequalities, Amer. Math, Soc, TransL, (2) 14 (1960), 1-29 (English translation of the appendix to the Russian translation of [IJ). [3J E. F. Beckenbach and R. Bellman, In- equalities, Erg. Math., Springer, second re- vised printing, 1965. [4J E. F. Beckenbach and R. Bellman, An introduction to inequalities, Random House, 1961. [5J N. D. Kazarinoff, Geometric inequalities, Random House, 1961. [6] N. D, Kazarinoff, Analytic inequalities, Holt, Rinehart and Winston, 1961. [7J W. Walter, Differential- und Integral- Ungleichungen, Springer, 1964. [8J O. Shisha (ed.), Inequalities, Academic Press, I, 1967; II, 1970; III, 1972. 808 212 (XX.36) Inequalities in Physics A. Correlation Inequalities Let J1 be a probability measure on a space X (with 0"-fie1d .%') and fj be measurable functions on X, and write <fl...fn)== Lrl(X)...fn(X)dJ1(X), A number of inequalities among such expec- tations under a variety of conditions on the measure J1 and functions fj are known as cor- relation inequalities, after their original occur- rence for correlation functions in the statistical mechanics of lattice gases. The earliest results were obtained by Grif- fiths (J. Math. Phys., 8 (1967)), for the case where X is the product IIf=1 1; of two point sets Ii = {I, -I}, the fare kth coordinate func- tions O"k==O"dx) (== ::1::1) OfxEX, dJ1(x) is the counting measure multiplied by a Gibbs factor Z -I e - PH(x) with a ferromagnetic Hamiltonian N H(x)== - I JijO"i(X)O"j(X), Jij=JjiO, i<j [3  0, and the normalization factor Z = Lxe-PH(X). His conclusions are <O"kO"/) O (Griffiths's first inequality), <O"kO"/O"mO"n)  <O"k(j/) <O"mO"n) (Griffiths's second inequality). These were extended by Kelly and Sherman (J. Math. Phys., 9 (1968) as <O"A) O (GKS first inequality), <(jAO"B)  <O"A) <O"B) (GKS second inequality), where A and B are subsets of {I,..., N}, (jA = II/ EA O"i, and H = - LAcNJAO"A with J A  O. A further generalization (for example, Ii == R) can be found in [1,2]. Under the same situation with N N H= -" J..O".O".-" h.O". J..>-O h.>-O ,  IJ 1 j L.J l " lj....., f..... i<j i=l the following GHS inequality by Griffiths, Hurst, and Sherman (J. Math. Phys., 11 (1970)) holds: <O"iO"jO"k) - <O"i) <O"jO"k) - <O"j) <O"iO"k) - <O"k) <O"iO"j) + 2 <0",) <(jj) < (jk) .s; o. If hi = 0 for all i, then <O"iO"PkO"/) - <0"/0"/) <O"jO"k) - <O"jO"/) <O"iO"k) - < O"k(j/) < O"iO"j) + 2 <O"iO"/) <O"jO"/) < (jkO"/) .s; O. 
809 Further generalizations can be found in [2,3J, and the references quoted therein. Let X be a finite distributive lattice, J1 be a positive measure satisfying the condition J1(x/\Y)J1(xvY)J1(x)J1(Y), and rand 9 be both increasing or both decreasing functions on X. Then the following FKG inequality by For- tuin, Kasteleyn, and Ginibre (Comm. Math. Phys., 22 (1971)) holds: <lg)  (I> (g). B. Inequalities Involving Traces of Matrices Let tr A denote the trace of a matrix A. Some of the earlier results applied to statistical mechanics are as fol1ows, where p  0, 0-  0, A*=A, B*=B: tr(plogp- p logo-- p+ 0-) O (Klein inequality), tr(eA+B)/tre A exp(treA B/tre A ) (Peierls- Bogo lyu bov ineq uaIi ty), tr(eAe B )  tr(e A + B ) (Golden- Thompson inequality). The following inequality is related to the last inequality and was proved for a general m  0 by Lieb and Thirring (Studies in Mathematical Physics, Lieb, Simon, and Wightman (eds.), Princeton Univ. Press, 1976): tr(po-)m.s;tr(pmo-m), pO, o-O. For the Hilbert-Schmidt norm IIAIIH s = (tr A * A)I/2 and the trace-class norm II A lit, = triAl, where IAI =(A* A)I/2, the Powers-Stormer inequality (Comm. Math. Phys., 16 (1970)) holds: Ilp2-0"211,,(llp-o-IIHs)2, pO, o-O. Araki and Yamagami (Comm. Math. Phys., 81 (1981) give IIICI-IDIIiH s. .s;J2IIC-DIIH,s. If C* = C and D* = D, then J2 can be removed. The entropy S(p)= -tr plogp is a concave function of p  0 satisfying the triangular inequality (Araki and Lieb, Comm. Math, Phys., 18 (1970)): S(pd+S(P2) S(PI2) IS(pd-S(p2)1 and strong subadditivity (Lieb and Ruskai, J. Math. Phys., 14 (1973)): S(P123) - S(pd - S(PI3)+ S(Pd.s; 0, where Pl23 is a matrix on the tensor prod- uct space HI @Hz @H 3 , Pjk =tr i p123' Pi= tr j tr k PI 2 3 ( { i,j, k} = {1, 2, 3}) and tr j is a (partial) trace taken only on the space R.i. The 212 C Inequalities in Physics last inequality is based on the concavity of the function f(p)=trexp(A + logp) in p  0 (A* =A) proved by Lieb [4J (also see Epstein, Comm. Math. Phys., 31 (1973», who also proved the joint concavity of tr( C* p 'cO"r) (rO, sO, r+s.s; 1) in p  0 and o-O. This concavity for the case r = s = 1/2 was previ- ously proved by Wigner and Yanase (Proc. Nat. Acad. Sci. US, 49 (1963)), and the general case has been conjectured by Wigner, Yanase, and Dyson. It leads to the joint concavity of the relative entropy S(p, cr) = tr p(1og p -log 0-) (defined to be + 00 if o-Iji = 0 and plji # 0 for some vector Iji) in p  0 and o- 0 as welI as its monotonicity S(ex(p), ex(o-).s; S(p, 0") for any trace-preserving expectation mapping ex. (For entropy, see also [5].) The above results have generalizations in the context of von Neumann algebras (Araki, Publ. Res. Inst. Math, Sci., 11 (1975); 13 (1977); Uhlmann, Comm. Math. Phys., 54 (1977); [6J). C. Operator Monotone and Operator Convex Functions A real-valued function f(x) defined on an interval I (finite or infinite; open, half-open, or closed) is called matrix monotone increasing (decreasing) of order m if f(A) f(B) whenever m x m Hermitian matrices A and B with their eigenvalues contained in I satisfy A  B (A.s; B) and is calIed operator monotone if it is matrix monotone of order n for all positive integers n. f(x) is caIled matrix convex of order m if f[(l - t)A HBJ .s;(1 -t)f(A) + if(B) for O.s; t.s; 1 and alI m x m Hermitian matrices A and B with their eigenvalues in I and is caIled opera- tor convex if it is matrix convex of order n for any positive integer n, The functions x' (O.s;ex.s; 1), log x, and -(x+a)-l (aO) are alI operator monotone increasing in the half- line interval x> O. The functions (x + a)-I and x log x are operator convex in the same interval. A function f(x) is operator monotone in- creasing in an open interval (a, b) if and only if it is analytic in (a, b) and has an analytic con- tinuation to the whole upper half-plane where the function has a nonnegative imaginary part [7]. Another necessary and sufficient condition IS N L f(i+k+1)(X)(i(d(i+k+ I)! O i,k=O for all XE (a, b), for all real (, and for all posi- tive integers N. An operator monotone func- tion f(x) on an open interval ( - R, R) has the integral representation f(x) = f(O) + 1'(0) f x/(I-tx)dJ1(t), 
212 Ref. Inequalities in Physics where J1 is a probability measure with its sup- port contained in [ -R -1, R -IJ and, if f is not a constant, is uniquely determined by f The set of all extremal points of the set of all operator monotone increasing functions on (-1,1) satisfying f(O) = 0 and 1'(0)= 1 is exact- ly the set offunctions x/(1-tx), Itl.s; 1, which appear as integrands in the above formula. An operator monotone function on R must be linear. An operator convex function in an open in- terval (- R, R) has the integral representation f(x) = f(O) + j'(O)x+ (/"(0)/2) f x 2 /(1- tx)dJ1(t), where J1 is a probability measure with its sup- port contained in [ -R -I, R -IJ and, if f is not linear, is uniquely determined by f An operator convex function in R must be at most a quadratic polynomial satisfying 1"(0);;" O. For a continuous real function on an inter- val [0, ex) (O.s; ex.s; 00), the following four condi- tions are mutualIy equivalent: (i) f is operator convex and f(O).s;O, (ii) f(a* Aa).s; a*f(A)a for any self-adjoint operator A with its spec- trum in [0, ex) and any operator a with its norm not exceeding 1, (iii) the preceding condi- tion with a limited to projections, (iv) x -I f(x) is operator monotone increasing in an open interval (0, ex). (See [8-11J and Hansen and Pedersen, Jensen's inequality for operators and Lowner's theorem, Math. Ann., 258 (1982), 229-241. References [IJ 1. Ginibre, General formulation of Grif- fiths' inequality, Comm. Math. Phys., 16 (1970), 310-328. [2J B. Simon, Functional integration and quantum physics, Academic Press, 1979, ch. 4. [3J C. Newman, Gaussian correlation inequal- ities, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 33 (1975), 75-93. [4J E. H. Lieb, Convex trace functions and the Wigner-Yanase-Dyson conjecture, Advances in Math" 11 (1973),267-288, [5J A. Wehr!, General properties of entropy, Rev. Mod. Phys., 50 (1978),221-260. [6J H. Araki, Inequalities in von Neumann algebras, CNRS Publication R.C.P., 25 vols., 22 (1975),1-25. [7J K. Lowner, Uber monotone Matrixfunk- tionen, Math. Z., 38 (1934),177-216. [8J 1. Bendat and S, Sherman, Monotone and convex operator functions, Trans. Amer. Math. Soc., 79 (1955),58-71. [9J W. Donoghue, Monotone matrix functions and analytic continuation, Springer, 1974. 810 [1OJ C. Davis, Notions generalizing convexity for functions defined on spaces of matrices, Amer. Math. Soc. Proc. Symposia in Pure Math., 7(1963),187-201. [11J F. Kraus, Uber konvexe Matrixfunk- tionen, Math. Z., 41 (1936), 18-42. 213 (XIX.12) Information Theory A. General Remarks The mathematical theory of information trans- mission in communication systems, first devel- oped by C. E, Shannon [IJ and now called information theory, is one of the most impor- tant fields of mathematical science. It consists of two main themes, channel coding theory and source coding theory. The purpose of channel coding theory is to ascertain the existence of schemes for transmitting information or data reliably over a noisy channel at a fixed trans- mission rate. The purpose of source coding theory is to show the existence of schemes for compressing data emitted from an information source and reproducing them within tolerable limits of distortion. Both theories are based profoundly on the theory of probability, statis- tics, and the theory of stochastic processes. B. Entropy In information theory it is customary to call a finite set A = {ex l , ..., exn} an alphabet and to call its elements the letters of the alphabet. The simplest model of information sources consist- ing of an alphabet A and a tprobability dis- tribution p over A is denoted by X = [A, p], which can be regarded as a tfinite probability space, where the probability of outcome of a letter exiEA from the source is denoted by Pi= p(ex;). A real-valued function defined by I(ex j )= -log p( ex i ) is calIed the self-information of the event ex i E A. The tmean value of the self- information, i,e" n H(X)= I -p(exi)logp(ex i ), i=l is called the entropy of the source. This can be interpreted as a measure of the average a priori uncertainty as to which letter will ema- nate from the source, or a measure of the average amount of information one obtains upon receiving a single letter from the source. The unit of information or entropy for base e logarithms is called a nat, while that for base 2 logarithms is called a bit. 
811 Let A = {exl, ..., exn} and B= {[il,"" fJm} be two alphabets. Let r(ex i , fJ) be a tjoint proba- bility distribution defined on the product AB, and denote the tprobability space by X Y = [AB, r]. Then the joint distribution gives rise to tmarginal distributions p(ex;) = Lj'=1 r(ex i , fJ) and q(fJ) = L?=l r(ex i , fJ) and to tconditional distributions p(exd [ij) = r(ex i , fJ)/q(fJ) and q(fJj I ex;) = r(ex i , fJ)/p(exJ Probability spaces X = [A, P J and Y = [B, qJ are subspaces of the probability space XY== [AB, r]. The entropy of XY= [AB, rJ is defined by n m H(XY)= I I -r(exi,fJj)logr(exi,/J) i::::l j::::l as well. A real-valued function I(exdfJ)= -log p( ex i I fJ) is calIed the conditional self- information. The average of I (ex i I fJ) over ex i defined by n H(XlfJ)= I p(ex;)I(exdfJ j ) i=l is calIed the conditional entropy of X for given fJjE B. The average of H(X I fJ) over fJ j defined by m H(XI Y)= I q(fJ)H(XlfJj) j=1 is calIed the conditional entropy of X for given Y. The conditional entropy of Y for given X, denoted by H(YI X), is defined similarly. Then it can be shown that H(XY)=H(Y)+H(XI Y)=H(X)+H(YIX), H(XY).s;H(X)+H(Y), H(XI Y).s;H(X), H(YIX).s;H(Y), where equalities in the last three expressions hold if and only if r(ex i , fJ)= P(exi)q(fJj) for aII exiEA and fJjEB. C. Information Sources Given a finite alphabet A, we consider the finite product set AN = A x ... x A and the doubly infinite product A Z = II _ 00 A k , whe:'e Ak=A, k=O, i: 1, i:2,.... Let. be the 0"- algebra generated by all tcylinder sets in A Z . Given a tprobability measure P over, an information source is defined as a probability space [A Z , PJ or as a trandom process X = ... X-I XOXI X 2 .... When P is invariant under the tshift transformation Ton A Z , i.e" P(E) = P(TE) for any EE, then [A Z , PJ is said to be a stationary source. In particular, if P(E) = o or 1 whenever TE = E E , then the infor- mation source is said to be ergodic. If X is an tindependently and identically distributed 213D Information Theory random process, the source X = [A Z , PJ is said to be memoryless. For a given X, denote the subsequence XI ... X N of X by X N . Then a probability mea- sure P on AN and a finite probability space X N = [AN, PNJ are naturalIy induced, The en- tropy of the stationary information source X= [k, PJ is defined as H(X)= limNoo H(X N )/ N or as H(X)= limNoo H(XNI X N - I ), because both limits exist and are identical. If X is memoryless, the entropy of X is equivalent to that of XI =[Al,pIJ, i.e., H(X)=L?I - Pi log Pi' where Pi=pl(X I = ex;) and H(X N )= NH(Xl)=NH(X). D. Source Coding Theorem Let x N = XI ... XN be a sequence of N consecu- tive letters from the source [A z , P]. Suppose that we wish to encode such sequences into fixed-length code words u L = U I ... U L consist- ing of L letters from a code alphabet U of size v. The number R=(logvL)!N is calIed the coding rate per source letter. A mapping <p: AN --> U L is calIed an encoder and <p: U L -->A N a decoder. The set U L with specified encoder- decorder pair [<p, I/J J is called a code with rate R=(Llogv)!N. The error probability of the code is defined by Pe[<p, I/J J = P[I/J {<p(X N ) # X N }]. The fixed-length source coding theorem [1,2J states: Let X = [A z , PJ be a stationary ergodic source. Then for any 6> 0, if R ;;" H (X) + 6, there exists a code [<p, I/J J with rate R such that the error probability Pee <p, I/J J can be made arbitrarily smaII by making N suffi- ciently large. Conversely, if R.s;H(X)-6 then for any code with rate R, Pe [<p, I/J J must be- come arbitrarily close to 1 as N --> 00, This theorem is trivial if R;;" log n. For mem- oryless sources, the theorem folIows immedi- ately from the tweak law of large numbers, which implies that for arbitrary s > 0 and 6> 0 there exists an integer No(s, 6) such that for aII N>N o (S,6) p[I _IOgN(XN) H(X)I >6 J<s. The validity of this property for stationary ergodic sources was proved by B. McMillan [2]. In case of memoryless sources, the exact asymptotic form of the error probability for an optimal code with rate R was given by F. Jelinek [3J, I. Csiszar and G. Longo [4J, Blahut [5J, and Longo and A, Sgarro [6J as .1. 11m -logPe[<p,I/JJ= mill D(qllp), Nx. N q H(q)",R where p is the source distribution, q denotes a 
213 E Information Theory probability distribution on A, and n q. D(qllp)= L qjlog-', i1 Pi which is called Kullback's discrimination in- formation [7] or the divergence. Source sequences x N can be encoded into variable-length code words consisting of letters from alphabet U of size v. A set of n N code words is called a prefix condition code if there is no code word which is equivalent to the prefix of any other code word. Denote the length of the code word corresponding to a source sequence x N by L(x N ) and the average length of the code words per source letter by 1 _ 1 -L= L _PN(xN)L(x N ). N xNN The variable length source coding theorem states: Given a memoryless source X with entropy H(X), there is a prefix condition code such that the average length of the code words per source letter satisfies - 1 H(X).s; L log v.s; H(X)+-Iogv. N This theorem is valid for stationary ergodic sources if (logv)jN in the last term is replaced by F.(N), where F.(N)-->O as N --> 00. These two theorems are referred to as noise- less source coding theorems. E. Source Coding Theorem with Fidelity Criterion The noiseless source coding theorem implies that the average number of code letters per source letter can be reduced to the source entropy H(X) under the requirement that the source sequence be exactly reproduced from the encoded sequence. If an approximate re- production of the source sequence to within a given fidelity criterion is required, the coding rate per source letter must be reduced further to a certain value below the source entropy. Suppose that a distortion measure d(exj, ex) is defined for exi' ex j E A, where it is assumed that d(ex i , ex)O and d(ex i , ex;) =0. For blocks x N = XI ... X N and yN =1'1 ... YN, define N N 1  dN(x ,y )=- L.. d(x k , yd. N k1 Any set = {yf,..., yz,}, y};; EA N , ofreproduc- ing words is calIed a source code of block length N. Each source sequence x N of the source X=[AZ,PJ is mapped into whichever code word y;;; EC(6 minimizes dN(X N , yN), i.e., dN(X N , y;;;) = min dN(X N , yN), yNe'C 812 and the suffix m is transmitted. Hence the coding rate per source letter is R = (log M)j N, and the average distortion is dN()= L pN(xN)mindN(xN,yN). yN e {6 xNeAN The problem is how far we can reduce the rate under the condition that the average distortion keeps satisfying a given fidelity criterion, which is specified as a maximum tolerable value d for the average distortion. The source coding theorem with a fidelity criterion states: Let X = [A z, PJ be a memory- less source. For any specified dO, any F.>O, and b > 0, there exists a source code  with rate RR(d)+b and with sufficiently large block length N for which the average distor- tion satisfies dN().s; d H, where R(d) is the rate distortion function de- fined by R(d)= min I(p; W), WE1f(d) n n W( 'Ii) I(p;W)=L LPiWUli)log n } i1 j1 L Pi WU I i) i=t 1Y(d) = { W:i j PiWUli)d(exj,ex).s;d}, where W(j I i) denotes a conditional proba- bility distribution referred to as the test chan- nel, and where I(p; W) is called the mutual in- formation. It should be noted that R(O)= H(X). The rate distortion function R(d), which was first defined by Shannon [I, 8J, is closely re- lated to the F.-entropy introduced by A. N. Kolmogorov [9]. R(d) is a monotonicalIy de- creasing and convex function. The theorem was first proved by Shannon [8J, and was extended by R. G. Gallager [IOJ to the case of stationary ergodic sources with discrete alphabets and by T. Berger [IIJ to stationary ergodic sources with abstract al- phabets. More recently, R. G. Gray and L. D. Davisson [12J have proved source coding theorems without the ergodic assumption for stationary sources su bject to a fidelity criterion. The proof was based on Rokhlin's ergodic decomposition theorem [13]. Other important source coding theorems have been obtained by F. Jelinek [14J for tree codes, A. J. Viterbi and J. K. Omura [15J for trellis codes, and Gray, D. L. Neuhoff, and D. S. Ornstein [16J for sliding block codes. The rate distortion function for memory less Gaussian sources subject to squared error distortion was given by Shannon [8J, and that for autoregressive Gaussian sources was deter- mined by Kolmogorov [9J, M. S. Pinsker 
813 [17J, Berger [11J, Gray [18J, and T. Hashi- moto and S. Arimoto [19]. F. Channel Coding Theory A mathematical model for a channel over which information is transmitted is specified in terms of the set of possible inputs, the set of outputs, and a probability measure on the output events conditional on each input. The simplest channels are the noiseless ones, for which there is a one-to-one correspondence between input and output and no loss of in- formation in transmission through the chan- nel. The second simplest channels are discrete memoryless channels (DMCs) which are de- fined as follows: The input and the output are sequences of letters from finite alphabets (say, exjEA and fJjEB), and the output letter at a given time depends statistically only on the corresponding input letter. That is, a DMC is characterized by a fixed conditional proba- bility distribution W(j I i) = W(fJj I ex;) because the probability measure on the input and output sequences satisfies N WN(yNlx N )= f1 W(Yklx k ), XkEA, Yk EB . kl By U = {I, ... ,M} we denote the set of integers each of which is assigned to each correspond- ing possible message from the source. A map- ping <p: U --> AN induces a collection C(j = {xf, ..., x}, called the block code with rate R = (log M)/ N; each element is called a code word. Only code words are transmitted over the channel. A mapping Iji : B N --> U is called the decoding. Thus, given an encoding and decod- ing pair [<p, Iji J, t he error probability is defined by 1 M Pe[<p, IjiJ= N mY::t L* WN(yNI<p(m)), where the summation L* is taken with respect to all yN such that lji(yN) # m. The capacity for a DMC is defined by C=maxI(p; W) p W(li) =maxLLP j WUli)log n } .. ' P j1 jt LjtPj W(j II) The fundamental theorem of channel coding theory states: Given a DMC with capacity C> 0, there exist a block code with rate R below capacity C, a pair [<p,IjiJ, and a function E(R) > 0 for O.s; R < C such that the error proba- bility satisfies Pe[<p,IjiJ<exp{ -NE(R)}. This was first discovered by Shannon [IJ, and its precise proof was first given by A. 213 F Information Theory Feinstein [20]. The precise expression of E(R) was subsequently given by P. Elias [21J, R. M. Fano [22J, and Gallager [23]. The best ex- pression of E(R), due to Gallager [23J, is E(R)==max{ Er(R), Eex(R +O(N- l ))}, where Er(R)= max max [ -PR Opl p -Iog {Pi WU1i)I/(1+P>}'+P} Eex(R)=supmax [ -PR P;' I p - 10 g LPiPk { L J WUI i) WUI k) } l/P ] r,k ) The converse of the fundamental theorem states: Given a DMC with capacity C, for any block code with rate R above C and any pair [<p, Iji J, the error pro bability satisfies Pe[<p,IjiJ> l-exp{ -NE(R)}, where E(R) is a function positive for R > C. This was first proved by J. Wolfowitz [24J, and the precise expression, found by S. Ari- moto [25J, is described by E(R)= max min [ -PR -lpO p -10g {Pi W(jl i)lj(l+PJ} 1+] The fundamental theorem of coding theory and its converse imply that the capacity is a critical rate; above the capacity, information cannot be transmitted reliably through the channel. Unfortunately, there is, in general, no direct method for computing the capacity, and therefore an iterative method was proposed by Arimoto [26]. Another iterative method for computing the rate distortion function was given by R. E. Blahut [27]. A discrete channel with memory is, in gen- eral, defined by a list of probability measure {Wx,xEA Z } on a tmeasurable space {BZ,ff B } such that for each FE'B' Wx(F) is a tmeasur- able function of x. A channel W is called sta- tionary if WTx(TF) = Wx(F) for all xEA z and all FEff B . Given an input source [A Z , PJ and a channel W, connecting the input to the chan- nel induces a joint process of input and out- put, denoted by [k x B Z , PWJ where PW is a measure on {A Z x B Z , ff AX B}' If a joint process [A Z x B Z , PWJ is stationary the mutual in- formation between input and output is defined by I (X; Y) = H (X) + H (Y) - H (XY). A channel W is called ergodic if for every 
213 Ref. Information Theory ergodic input [A Z , PJ the induced joint process [A Z x B Z , PWJ is ergodic. The channel capac- ity for a discrete stationary channel is defined in various ways. The important ones are the ergodic capacity defined by C c = sup. of I(X; Y) with respect to all stationary ergodic sources X = [A Z , PJ, and the stationary capacity de- fined by C s = sup. of I(X; Y) W.r.t. all station- ary sources. K. R. Parthasarathy [28J proved C e = C, for all discrete stationary channels. In another way 1. Nedoma [29J introduced the operational source/channel block coding capacity C scb , which is defined as the supre- mum of the entropies of all admissible sta- tionary ergodic sources in the sense that there exist source/channel block codes such that the error probabi1ity Pe-->O as block length N --> 00. Nedoma [29J also pointed out an example of a stationary channel where C s > C scb ' Hence block coding theorems have been proved for various channels: for finite memory channels by A. I. Khinchin [30J, K. Takano [31 J, Nedoma [29J, and Feinstein [32J, for d-continuous channels by Gray and Ornstein [33J, and for almost finite memory channels by D. L. Neuhoff and P. C. Shields [34]. References [IJ C. E. Shannon, A mathematical theory of communication, Bell System Tech. J., 27 (1948), 379-423, 623-656. [2J B. McMillan, The basic theorems of in- formation theory, Ann. Math. Statist., 24 (1953),196-219. [3J F. Jelinek, Probabilistic information theory, McGraw-Hili, 1968. [4J I. Csiszar and G. Longo, On the error exponent for source coding, Studia Math. Acad. Hung., 6 (1971), 181-19\. [5J R. E. Blahut, Hypothesis testing and in- formation theory, IEEE Trans. Information Theory, 20(1974), 405-417. [6J G. Longo and A. Sgarro, The source cod- ing theorem revisited: A combinatorial ap- proach, IEEE Trans. Information Theory, 25 (1979), 544- 548. [7J S. Kullback, Information theory and sta- tistics, Wiley, 1957. [8J C. E. Shannon, Coding theorems for a discrete source with a fidelity criterion, IRE National Convention Record, pt. 4 (1959), 142-163. [9J A. N. Kolmogorov, On the Shannon theory of information transmission in the case of continuous signals, IRE Trans. Information Theory, 2 (1956),102-108. [IOJ R. G. Gallager, Information theory and reliable communication theory, Wiley, 1968. [11 J T. Berger, Rate distortion theory for 814 sources with abstract alphabets and memory, Information and Control, 13 (1968),254-273. [12J R. G. Gray and L. D. Davisson, Source coding theorems without the ergodic assump- tion, IEEE Trans. Information Theory, 20 (1974), 502-516. [13J V. A. Rokhlin, On the fundamental ideas of measure theory, Amer. Math, Soc. Trans\., 71 (1952). [14J F. Jelinek, Tree-encoding of memory less time-discrete sources with a fidelity criterion, IEEE Trans. Information Theory, 15 (1969), 584-590. [15J A. 1. Viterbi and J. K. Omura, Trellis encoding of memoryless discrete-time sources with a fidelity criterion, IEEE Trans. Infor- mation Theory, 20 (1974),325-332. [16J R. M. Gray, D. L. Neuhoff, and D. S. Ornstein, Nonblock source coding with a fidelity criterion, Ann. Probability, 3 (1975), 478-49\. [17J M. S. Pinsker, Information and infor- mation stability of random variables and pro- cesses, Holden-Day, 1965. [18J R. M. Gray, Information rates of auto- regressive processes, IEEE Trans. Information Theory, 16 (1970), 412-42\. [19J T. Hashimoto and S. Arimoto, On the rate-distortion function for the nonstationary Gaussian AR process, IEEE Trans. Informa- tion Theory, 26 (1980), 478-480. [20J A. Feinstein, A new basic theorem of information theory, IRE Trans. Information Theory, 4 (1954), 2-22. [21J P. Elias, Coding for noisy channels, IRE National Convention Record, pt. 4 (1955),37- 46. [22J R. M. Fano, Transmission of information, MIT Press and Wiley, 196\. [23J R. G. Gallager, A simple derivation of the coding theorem and some applications, IEEE Trans. Information Theory, 11 (1965), 3-18. [24J J. Wolfowitz, The coding of messages subject to chance errors, Illinois J. Math., I (1957), 591-606. [25J S. Arimoto, On the converse to the cod- ing theorem for discrete memoryless channels, IEEE Trans. Information Theory, 19 (1973), 357-359. [26J S. Arimoto, An algorithm for computing the capacity of arbitrary discrete memory less channels, IEEE Trans. Information Theory, 18 (1972), 14-20. [27J R. E. Blahut, Computation of channel capacity and rate-distortion functions, IEEE Trans. Information Theory, 18 (1972), 460- 473. [28J K. R. Parthasarathy, On the integral representation of the rate of transmission of a stationary channel, Illinois J. Math., 2 (1961), 299-305. 
815 [29J J. Nedoma, The capacity of a discrete channel, Trans. 1st Prague Conf. Information Theory (1957),143-181. [30J A. Ya. Khinchin, Mathematical founda- tions of information theory, Dover, 1957. [31J K. Takano, On the basic theorems of information theory, Ann. Inst. Statist. Math., 9 (1957),53-77. [32J A. Feinstein, On the coding theorem and its converse for finite-memory channels, In- formation and Control, 2 (1959),25-44. [33J R. M. Gray and D. S. Ornstein, Block coding for discrete stationary d-continuous noisy channels, IEEE Trans. Information Theory, 25 (1979),292-306. [34J D. L. Neuhoff and P. C. Shields, Chan- nels with almost finite memory, IEEE Trans, Information Theory, 25 (1979), 440-447. 214 (XVIII.18) Insurance Mathematics A. General Remarks Insurance is a system in which a large number of people contribute a smaIl precalculated amount of money (called a premium) to fill the economic need that arises when a person meets adversity. The amount of economic need fiIled by this system is caIled the amount of insurance (or amount insured). The insurer is the one who implements the system. Actuarial mathematics is the branch of applied mathe- matics that studies the mathematical basis of insurance, one of the first cases in which math- ematics was successfully applied to a social question. Actuarial mathematics can be di- vided into two branches according to its ap- plication. The first includes the calculation of various values of each individual policy, such as premiums or reserves. The second is mainly connected with management of an insurance business and includes the study of reinsurance systems, of the maximum amount of insurance, of the contingency fund, or the analysis of profits. There is only one basic principle in actuarial mathematics, caIled the principle of equivalence. It determines the premium and reserve in each year so that the present value of future premium income of the insurer is equal to the present value of future benefits for each policy. The basic factors of actuarial calculations are (1) probabilities of contingncies, (2) an expected rate of interest in the future (often referred to as the assumed rate of interest), and (3) cost of administration of the system. 214 A Insurance Mathematics Premiums are calculated using these factors and the principle of equivalence. The foIlow- ing is an example of the classical method of calculation for a life insurance policy with the use of "commutation symbols," which is an old device for the convenience of calculations. We write P for the net premium (in which the cost of administration is disregarded), P' for the gross premium, T, for the amount of death benefits payable in the tth year after the policy is issued, E, for the amount of survival benefits payable at the beginning of the tth year, n for the period for which the insurance is effective, and m for the period for which pre- miums are to be paid. Let ex, fJ, and y stand for three positive constants determining the initial expenses=ex(T I or Ed, the premium coIlection expenses = fJP', and the general expenses for maintenance = y( T, or E,). The factor that comes into consideration next is a model of human death and survival (measurement). Assume that Ix is the number of lives attaUlIllg age x, and write qx for the probability that a life of x years wiIl end within one year. Then d x , the number of lives ending within one year out of lx, is Ixqx, and I x + l , the number of lives remaining after one year at age x + 1, is I x - d x =Ix<1 -qx)' The commutation symbols commonly employed are defined as follows: Write v= 1/(1 +i), where i is the assumed rate of interest; then Dx=lxvX, Cx=dxv X + l , n N x = L Dx+" r=O n M x = L CX+<. 1=0 For a policy issued at an insured person's age x, the present value of the insurer's future income can be expressed as P'(N x - N x + m )/ Dx, and the present value of his future payments can be expressed as 1 ( n n+1 - L T,Cx+'-1 + L E,Dx+<_1 Dx ,l ,I n +ex(T I or EdDx+Y L (T, or E,)Dx+'_1 '=0 + fJP'(N x - N x + m )} By assuming that the present value of the future income is equal to the present value of the future payments, the value of the gross premium P' is obtained. (The P' obtained from the assumption ex = fJ = Y = 0 is denoted by P and is caIled the net premium. The difference P' -P is called the loading.) For a policy in which benefits are payable on disability or contingencies other than death, we have only to obtain a model of contingencies and apply a similar calculation. 
214 B Insurance Mathematics B. Liability Reserve During the term of an insurance contract, it often happens that the present value of the future income is less than the present value of the future payments. If this is the case, the difference is to be held by the insurer as the liability reserve. The source of this fund is the past premium income plus interest. The net premium reserve, which disregards expenses, is calculated as 1 ( n n+1 v,==- L T"C xH - I + L ErDxH-I D x + t r=t+ 1 r=t+ 1 -P(N x + 1 -N x + m ). Between the net premium P and the net premium reserve V, we have the relation P=(vV,- V,-d+(1;- V,)vqx+,+E,. The first term of the right-hand side of this formula is called the savings premium, since it is the amount left out of the premium income of the tth year and added to the reserve. The second term is called the cost of insurance or risk premium and is applied to cover the dif- ference between the amount of insurance and that of the existing reserve in case the contin- gency of death arises. The third term is applied to the payment of the survival benefits (or annuities in case of an annuity contract). If 1;- v" the amount of risk insured by the insurer, is positive for all values of t during the period of insurance, the policy is called death insur- ance. On the other hand, if the value of 1; - V, is negative for all values of t, the policy is called survival insurance. If the value of 1; - V, varies between positive and negative according to the different values of t, the policy is called mixed insurance. If 1; is always equal to v" the policy constitutes mere savings. Most of the insurance policies issued today are one or another type of death insurance, while life annuity policies are a type of survival insur- ance, For a long time, studies have been made of the effect on premiums and reserves of changes in the three basic factors (1), (2), and (3) in Section A. C. Risk Theory Risk theory occupies a special position in the field of actuarial mathematics. Actuarial math- ematics was first born from the theory of prob- ability. Since the modern theory of proba- bility based on measure theory was developed by A. N. Kolmogorov and other mathema- ticians ( 342 Probability Theory), new ap- 816 proaches have inevitably been made to ac- tuarial mathematics. An outstanding example is risk theory. Risk theory can be divided into two branches. One is called classical risk theory (or individual risk theory), in which the profit or loss that may result during a certain term of an insurance contract is regarded as a tran- dom variable. Since the insurer's profit equals the sum of these random variables over all the individual contracts, various probability func- tions can be obtained by applying the theory of probability. The second, called collective risk theory, pays no attention to each indiv- idual contract but studies changes in the insurer's balance as a whole with the lapse of time. The basis of collective risk theory was given by F. Lundberg, H. Cramer, and other mathematicians. We explain the collective risk theory follow- ing Cramer [5]. For simplicity we consider an insurer who issues no policies other than death insurance and makes no expenditures except the policy claims. Suppose that during the time interval (t, t + M) contingency occurs with probability AM + o( t) independently of the past. We denote by F(x) the tdistribution function of the amount to be paid by the in- surer when a contingency occurs. Then the number of contingencies in (0, t), N(t), is a tPoisson process with parameter A. and the total expenditure of the insurer during the time interval (0, t), X(t), is a tcompound Pois- son process ( 5 Additive Processes) such that E(eiZX<'))=ex p ( At L'" (e iZX -l)dF(X»). If p is the premium income per unit time, then we have P=AS:f xdF(x), because E(X(t))=pt holds by the principle of equivalence. If u denotes the initial fund of the insurer, then the fund reserved at time t, Y(t), equals u + pt- X(t). Y(t) is a tLevy process ( 5 Additive Processes) such that E(eizY('» =exP(iZu +At {OO (eiZX_1 _ iZX)dF(X). The probability that Y(t) < ° for some time t< 00 is called the ruin probability, which de- pends on the initial fund u, denoted by ljJ(u). This satisfies the following tintegral equation of Volterra type: EIjJ(u)= f ro Q(x)dx+ r u ljJ(x)Q(u-x)dx ..1. U Jo (Q(x) = I-F(x)), from which one can derive an asymptotic rela- tion ljJ(u)Ce-Ru as u-->oo, where Rand C 
817 are positive constants depending only on ..1. and F, See [6J for recent developments in risk theory. References [IJ C. W. Jordan, Society of Actuaries' text- book on life contingencies, Society of Actu- aries, 1952. [2J W. Saxer, Versicherungsmathematik, Springer, I, 1955; II, 1958. [3] E. Zwinggi, Versicherungsmathematik, Birkhauser, 1945. [4J P. F. Hooker and L. H. Longley-Cook, Life and other contingencies I, Cambridge Univ. Press, 1953. [5J H. Cramer, Collective risk theory, Nor- diska Bokhandeln, 1955, [6J H. Blihlmann, Mathematical methods in risk theory, Springer, 1970. 215 (XIX.6) Integer Programming A. General Remarks Integer programming, in its broadest sense, addresses itself to either minimization or maxi- mization of a functional f over some discrete set S in Rn, but it is usually understood as dealing with questions related to linear pro- gramming problems ( 255 Linear Program- ming) with additional integrality conditions on the variables, namely, the problem Po: Minimize {c'x I Ax =b, x:;) 0, x j an integer,j= 1, ...,nl(.s;n)}, where AERmxn, bERm, cERn are given data and x =(x I ,." ,xn)'ERn is a vector. Po is called a pure integer programming problem or an all-integer programming prob- lem ifn=nl, and a mixed integer program- ming problem if n l < n. In particular, Po is called a 0-1 integer programming problem if all the integer variables are restricted to be equal to either ° or 1. We write X O = {xERn I Ax = b, xj:;)O,j = 1,..., n}, Xl =X O n {x ERn I X j is an integer,j= 1,..., n l }, and assume for simplicity that (i) Xl # 0, (ii) X O is bounded, and (iii) all the components of A and b are integers. Po arises not only as a mathematical model for an optimization prob- lem where some of the decision variables have indivisible minimum units but also as one for many optimization problems with some logical and/or combinatorial constraints [1]. 215 C Integer Programming Algorithms for solving Po can be classified into algebraic methods and enumerative methods. Both are based upon linear programming and relaxation techniques [2J in which some of the constraints of Po are temporarily relaxed ( 264 Mathematical Programming). B. Cutting Plane Methods The origin of this class of algorithms is the fractional cutting plane algorithm proposed in 1958 by R. E. Gomory [3], the outline of which is described as: (1) Let X =X o . (2) Solve a linear programming problem: Mini- mize {c' x I x E X}, and let x be its t optimal solution. If the x j are integer for allj.s;n" then stop (x is optimal to Po). Otherwise, go to (3). (3) Generate a half-space H = {x E R n 11t' x:;) no} (1tERn, noE R I ), where H (i) contains Xl and (ii) does not contain X. Return to (2) by replacing X by XnH. Here, a linear programming problem ob- tained by relaxing integrality conditions is solved, and as long as xftX I , an inequality satisfying the two conditions (i), (ii) of (3) is introduced. Such an inequality 1t' x:;) no or equality 1t'X = no is called a cut or a cutting plane. Gomory devised a Gomory cut using (relaxed) integrality conditions on the vari- ables, and showed that the algorithm above produces a point of Xl in finitely many steps. Some of the other algorithms using cutting planes are the all-integer algorithm (Gomory, 1963) and the primal all-integer algorithm (R. D. Young, Operations Res., 16 (1968)). These algorithms, however, are generally slow and behave erratically, so that it is believed that they cannot in practice serve as general- purpose algorithms. C. Other Algebraic Methods Gomory [4], again in 1965, proposed a group- theoretic approach to Po. This method is based upon the following observation: Let x B be the vector of tbasic variables ( 255 Linear Pro- gramming) associated with a tdual feasible basis of a linear programming problem Po: Minimize {C'XIXEX O }, and let Xl be the set generated from Xl by relaxing the nonnegativ- ity constraints on x B . Then Xl can be shown to have a tcyclic group structure, so that a group minimization problem P G : minimize {c'x I XE Xl} can be solved as a tshortest path problem on a directed graph with a special structure ( 186 Graph Theory). If the opti- mal solution x of P G satisfies xB:;)O, then it is optimal for Po. If, on the other hand, xBjO, 
215D Integer Programming then a branch and bound algorithm described later can be applied for a systematic search of integer points near X. This algorithm is reported to produce good results when the size of the associated graph is not excessively large (G. A. Gorry et aI., Management Sci., 17 (1971)). Some of the other important results in this area are: (i) the theory of subadditive cuts (R. Gomoryet aI., Math. Prog., 3 (1972)) and disjunctive cuts (E. Balas, in Nonlinear Programming 2, O. Mangasarian et al. (eds.), Academic Press, 1975), (ii) research on facial structures of the integer polyhedron coX I (convex hull of Xl) for some of the more im- portant integer programming problems, such as that of the corner polyhedron COXI [5J, knapsack polytopes (E. Balas, Math. Prog., 8 (1975)), and traveling-salesman polytopes (M. Grotschel et aI., Math. Prog., 16 (1979)). It should be pointed out, however, that more difficulties of Po have been revealed rather than resolved through the intensive research in this area. Incidentally, the 0-1 integer programming problem is known to be t NP- complete ( 71 Complexity of Computations). D. Enumerative Methods Another large class of algorithms for solving Po consists of the branch and bound methods, first proposed by Land and Doig [6] in 1960. An outline of the improved version (R. J. Dakin, Computer J., 8 (1965)) is as follows. (1) Let ,9> = {Po}, z* = 00, x* = undefined. (2) If ,9> = 0, then stop (x* is optimal to Po). Otherwise, choose from P the problem PI: Minimize {C'XIXEXf}. (3) Solve PI: Minimize {C'XIXEX I }, in which the integrality condition is relaxed from the constraints of PI' If  has an optimal solution X, then go to (4). Otherwise, return to (2). (4) IfXEX{ and c'x<z*, then let x-->x*, c'x--> z*, and return to (2). If XE X{ and c'x  z*, then return to (2). Otherwise, go to (5). (5) Choose j.s; n l . for which x j is not integral and generate the two subproblems +: mini- mize {C'XIXEX{,Xj[xJ+ I}, and -: mini- mize {C'XIXEX{,Xj.s;[xJ}, in both of which [xjJ represents the largest integer not exceeding x j . Let g> U {- ,pn -->,9> and return to (2). The best point of Xl identified during the preceding steps is denoted by x* and called an incumbent. In summary, the branch and bound method chooses one subproblem PI from-the problem list g> and estimates the lower bound of its optimal objective func- tional value. If the lower bound is worse than the current incumbent, then PI is discarded, whereas Pr is separated into two subproblems if no conclusion can be reached. This process 818 is continued until the problem list P becomes empty, thereby implicitly checking all points of Xl. The above method is called an LP-based branch and bound method because linear programming techniques are employed to obtain a lower bound. The branch and bound method tends to require a large amount of storage, but many engineering improvements on the method of choosing (i) a subproblem PI and (ii) a branching variable x j , and several improvements of the bounding techniques, in addition to the substantial progress made in linear programming codes, enable us to solve a problem of size n::::: 100. In particular, an improved version of the implicit enumera- tion method, proposed by E. Balas [7J for 0-1 integer programming problems which uses logical conditions for obtaining lower bounds, is known to be able to solve rather large 0-1 integer programming problems (A. Geoffrion, Operations Res., 17 (1969)). E. Other Topics The partitioning algorithm [8J, in which in- teger variables are varied parametrically, is reported to work well for 0-1 mixed integer problems with relatively few integer variables. As people begin to realize the intrinsic dif- ficulty of Po, they pay more attention to heu- ristic algorithms or approximate algorithms to obtain a good but not necessarily optimal solution. Among heuristic methods, the inte- rior path method [9], which elaborates simple ideas such as rounding of the optimal solution of Po, has been reported to work well for prob- lems in which X O has a nonempty interior. Also, more emphasis is being placed on special purpose algorithms for solving practical prob- lems, such as tset partitioning and the ttravel- ing salesman problem, etc. [1]. Po is a typical nonconvex programming problem, and no practically useful tduality theorem is avail- able. Hence it is difficult to perform sensitivity and/or post-optimality analysis. Some research in this area has emerged recently (e.g., C. J. Piper et aI., Management Sci., 22 (1976)), but it looks as if it will be several years before a reasonably good procedure becomes available. References [I] R. S. Garfinkel and G. L. Nemhauser, Integer programming, Wiley, 1972. [2J A. M. Geoffrion and R. E. Marten, Integer programming: A framework and state-of-the- art survey, Management Sci., 18 (1972), 465- 491. 
819 [3J R. E. Gomory, Outline of an algorithm for integer solutions to linear programs, Bull. Amer. Math. Soc., 64 (1958), 275-278. [4J R. E. Gomory, On the relation between integer and non-integer solutions to linear programs, Proc. Nat. Acad. Sci. US, 53 (1965), 260-265. [5J R. E. Gomory, Some polyhedra related to combinatorial problems, Linear Algebra and Appl., 2 (1969), 451-558. [6J A. H. Land and A. G. Doig, An automatic method for solving discrete programming problems, Econometrica, 28 (1960), 497-520. [7J E. Balas, An additive algorithm for solving linear programs with zero-one variables, Oper- ations Res., 13 (1965),517-546. [8] J. F. Benders, Partitioning procedures for solving mixed-variables programming prob- lems, Numer. Math., 4 (1962), 238-252. [9J F. S. Hillier, Efficient heuristic procedures for integer linear programming with an inte- rior, Operations Res., 17 (1969), 600-637. 216 (X.10) Integral Calculus A. The Riemann Integral Let f(x) be a bounded real-valued function defined on an interval [a, b]. We shall divide this interval I = [a, b J into subintervals Ii = [Xi-I, xJ (i = 1, "', n) by a finite number of points Xi (a=xo<xl <... <xn=b). This divi- sion into subintervals is uniquely determined by the set D = {x;}, called the partition of I. We set Mi=suPxEl,!(X), mi=infxE1,!(x), and put i:T(D)=L?t Mi(xi-Xi-l), Q(D)=L71 mJXi- Xi-I)' Consiering all possible partitions D of I, we set Jf(x)dx=infDi:T(D), Jj(x)dx= sUPDQ(D), which are called the RIemann upper integral and Riemann lower integral of J, re- spectively. If they coincide, then the common value is called the Riemann integral of { on [a, bJ and is denoted by Jf(x)dx. In this case, we say that {is Riemann integrable (or simply integrable) on [a, b J and call f the integrand; a and b are called the lower limit and the upper limit, respectively. In this case, by integrating.f from a to b we mean the process of obtaining the value J%f(x)dx. Darboux's theorem: For each £ > 0 there exists a positive b such that the inequalities 1i:T(D)- r {(X)dxl<c;, \Q(D)- f !(X)dX! <£ 216 B Integral Calculus hold for any partition D with b(D) = maxj(x i -xj_t)<b. In other words, we have lim i:T(D) = f b f(x)dx, o(D)---joO a lim Q(D)= f b f(x)dx. o(D)O a From Darboux's theorem it follows that a necessary and sufficient condition for f(x) to be integrable on [a, b J is that for each positive £ there exist a positive b such that b(D) < b im- plies i:T(D)-Q(D) = L?=t (Mi-m;)(xj-x i - t )<£. We call Mj-m i the oscillation of f on Ii and i:T(D) and Q(D) the Darboux sums. Obviously, if {is integrable on [a, b J, then for each positive s there exists a positive b such that the follow- ing inequality holds for every partition D = {xJ with b(D) < b and for every set of points jElj (j= 1,... ,n): Ijf()(Xj-Xj-tJ- r {(X)dxl<£. The sum Lj=1.{(O(x j -x j - t ) is often called a Riemann sum (or sum of products). A function that is continuous on [a, b J, or bounded and continuous except for a finite number of points in the interval, is integrable. Furthermore, a bounded function that is continuous on [a, b J except for an infinite number of points x}. is integrable if for an arbitrary positive number £ there exist a finite number of intervals Ii of which the total length is less than £ and if the set {x A} of exceptional points is contained in U Ii' Generally, a necessary and sufficient condition for a bounded function defined on [a, b J to be integrable is that the set of points where the function is not continuous be of tmeasure 0 (in the sense of Lebesgue). A func- tion that is either tmonotonic on [a, b J (and consequently bounded) or of tbounded vari- ation is integrable. A function that is inte- grable on [a, b J is integrable on any sub- interval of [a, b J, the integrand being the restriction of the given function to this subinterval. B. Basic Properties of Integrals Let I be the set of all functions integrable on [a, b]. If f, gEl, then for any numbers ex, fJ, we have exf + fJgE I, fgE I, min {J, g} E I, max {f, g} E I, and fig E I provided that there exists a positive constant A such that the inequality Igl>A holds. Furthermore, if{EI, then If I EI; and if fnE I (n= 1,2,...) and fn converges tuniformly to f, then f E I. Cor- responding to these properties, the following formulas hold. 
216 C Integral Calculus (1) Linearity: r (exf(x) + fJg(x))dx =ex r f(x)dx+fJ r f(x)dx, where ex, fJ are constants. (2) Monotonicity: If f(x):;;, 0, then r f(x)dx:;;,O. If, further, f is continuous at a point Xo E [a, b J and f(xo»O, then Sf(x)dx> O. (3) Additivity with respect to intervals: If a, b, and c are points belonging to an interval on which f is integrable and a < c < b, then rf(X)dX= ff(X)dX+ ff(X)dX. Adopting the conventions that S: f(x)dx = 0 and SU(x)dx= - Sf(x)dx, the additivity formula holds independently of the order of a, b, and c. It follows from (2) that ISf(x)dxl.s; Slf(x)1 dx if a < b. Further, if fn(x) converges to f(x) uniformly on [a, b J, then lim f b fn(X)dX= f b f(x)dx. n--+w a a Replacing fn(x) by partial sums of a series, we obtain the following theorem: Let L an (x) be a series in which each term an(x) is inte- grable on an interval [a, b]. If the series con- verges uniformly on [a, b J, then the sum s(x) is integrable on [a, b J, and the series is term wise integrable, that is, f b 00 f b a s(x)dx= nl a an(x)dx. Also, the series LI S: an(t)dt converges uni- formly on [a, b J to the integral S: s(t)dt. As- sume that L an(x) is convergent but not uni- formly convergent. If all an (x), together with s(x) = L an (x), are integrable and there is a constant M independent ofn such that ISn(x)1 .s; M (XE [a, b J) for all n, where sn(x) are par- tial sums, then the series is termwise integrable (c. Arzela). The first mean value theorem: If f(x) is con- tinuous on [a, b J and <p(x) is integrable and of constant sign on [a, b J, then there exists (J (0 < 0< I) such that r f(x)<p(x)dx= f(a+O(b-a)) r <p(x)dx. When <p(x) = 1, we have r f(x)dx=f(a+O(b-a»)(b-a), 820 The second mean value theorem: If J (x) is a positive, monotone decreasing function de- fined on [a, b J and <p(x) is an integrable func- tion, then there exists I] (a < I].s; b) such that r f(x)<p(x)dx=f(a+O) f <p(x)dx. In the hypothesis of the second mean value theorem, if f(x) is assumed to be monotonic but not necessarily positive, then there exists I] (a.s;l].s;b) such that r f(x)<p(x)dx = f(a+O) f <p(x)dx+ f(b-O) r <p(x)dx. H. Okamura (1947) proved that the condition a.s;l].s; b can be replaced by a <I] < b. In the case f(x):;;, 0 on [a, b J, we consider the figure F bounded by the graph of f(x), the x- axis, and the lines x = a and x = b. Then iJ( D) and Q(D) are areas of polygons of which one encloses F and the other is enclosed by F, as shown in Fig. 1. Hence it can be shown that the integrability of f(x) in the sense of Rie- mann is equivalent to the measurability of F in the sense of Jordan. The Riemann integral Sf(x)dx is the area of F with respect to its tJordan measure. / // // / ,/ / a Xj X2 Xn-2 x,,] b Fig. I C. Relation between Differentiation and Integration Suppose that f(x) is integrable on an interval 1. We fix a point a of I and consider the in- tegration F(x)== S:f(t)dt, where x varies in I. The function F(x) is called the indefinite in- tegral of f(x). In contrast with this, the integral on a fixed interval, as considered in the previ- ous sections, is often called the definite in- tegral. The indefinite integral F(x) is continu- ous on the interval I and of bounded varia- tion. If f(x) is continuous at a point Xo in I, then F(x) is differentiable at Xo and F'(x o ) = f(x o ). In general, if a function G(x) satisfies G'(x)= /'(x) everywhere in I, then G(x) is called a primitive function of f(x). If f(x) is continu- 
821 ous, the indefinite integral of f(x) is one of the primitive functions off(x). Furthermore, if a function G(x) is a primitive function of f(x), then any other primitive function can be writ- ten in the form G(x) + C, where C is a con- stant, called an integral constant. For a con- tinuous function f(x) on [a, bJ and anyone of its primitive functions G(x), we have r f(x)dx=G(b)-G(a)=[G(x)J (fundamental theorem of calculus) ( Ap- pendix A, Table 9). From the differentiation formulas we obtain the following integration formulas: Integration by parts: If f(x) and g(x) have continuous derivatives on [a, bJ, then r f(x)g'(x)dx=[f(x)g(x)J- r f'(x)g(x)dx, More generally, if f(x) and g(x) are integrable on [a,bJ, then r g(X)(f f(t)dt)dX = r f(x)dx r g(x)dx r f(X)(f 9(t)dt)dX. Change of variables: If f(x) is integrable on [a, bJ and x = <p(t) and <p'(t) are continuous on [ex,fiJ, where a = <p(ex), b=<p(fi) (a.s;<p(t).s;b), then r f(x)dx= r f(<p(t))<p'(t)dt. D. Improper Integrals The concept of the integral can be generalized to the case where the integrand or the interval on which integration is accomplished is not bounded. Assume that f(x) is not bounded on [a, b) but is bounded and integrable on any interval [a,b-F.J (c [a, b)). If Jg-'f(x)dx has a finite limit for F.-->O, the limit is denoted by Jg.f(x)dx and is called the improper Riemann integral (or simply improper integral) of f(x) on [a, b). For example, if f(x) is continuous on [a,b) and f(x)=O«(b-x)") for some ex (O>ex> -I), where 0 is the tLandau symbol, then the improper integral Jg f'(x)dx exists. On the other hand, if f is integrable on [a + F., bJ for each F. > 0 but not bounded in any neigh bor- hood of a, we can define the integral on (a, bJ in the same way. If f is not bounded in any neighborhood of a or b and if there exists a point C (a < C < b) for which the improper in- tegrals Jf(x)dx and J;f(x)dx exist, then we define J f(x) dx == Jf(x) dx + J; f(x) dx, which is independent of the choice of the point c. Fur- 216 E Integral Calculus thermore, assume that f(x) is not bounded in any neighborhood of each point c j (j = 1, ..., n) (a <C I <... <Cn < b). Then we define f b f C' f C2 a f(x)dx= a f(x)dx+ C 1 f(x)dx+... + f"1 f(x)dx+ ff(X)dX, provided that all improper integrals 1'1 f(x)dx, ..., ff(X)dX exist. Suppose that f(x) is defined on [a, bJ and bounded outside any neighborhood of cE(a,b) but not bounded in either [c-F.,cJ or [c, C + F.] for any F. > O. It may well happen that, although neither lim,oJ-' f(x)dx nor lim,'oJ;H' f(x) dx exists (accordingly, the improper integral Jg f(x)dx does not exist), if we put F. = F.', the limit (1'-' f(x)dx+ J:/(X)dX) does exist. This limit is called Cauchy's prin- cipal value and is denoted by p. v. Jg f(x) dx (v.p. in French), For example, p.v. JI(dx/x) = limHo(J=W/x)dx+ Jl(1/x)dx)=O. E. Integrals on Infinite Intervals Suppose that we are given a function f(x) defined on an infinite interval [a, 00) and integrable on any finite interval [a, b]. If limh'XJ Jg.f(x) dx exists and is finite, then this limit is called the improper integral of f on [a, 00) and is denoted by J: f(x)dx. We define similarly JCNf(x)dx=lima_ooJg f(x)dx, where f is defined on ( -00, bJ and integrable on any interval [a, b]. Furthermore, J':'""f(x)dx is, by definition, J'- oof(x) dx + J.:" f(x) dx, which is independent of the choice of c. Suppose that f(x) is integrable on [a, bJ for a fixed a and an arbitrary b larger than a. If f(x) = O(x') for some ex < -1, then J: f(x) dx exists. Generally, for ex, fi such that -oo.s; ex < 00 and -00 < fi.s; 00, if the improper integral J f(x)dx exists, we say that the integral is convergent; other- wise, it is divergent. Improper integrals also satisfy the three basic properties of integrals (I), (2), and (3) ( Section B). However, the existence of an improper integral of a func- tion f on an interval I does not imply the existence of the improper integral of If I on the same interval I. For example, let f be a function determined by f(O) = 0, f(x) = (l/x)sin(l/x) for 0 < x.s; n. Then JOf(x) dx exists, but J If(x)1 dx does not. On the other hand, if the improper integral of If(x)1 exists, 
216 F Integral Calculus then the improper integral of f(x) exists, and we have If f(X)dxl.s;flf(X)ldX, where -oo.s; ex < fJ.s; +00. In this case, we say that f is absolutely integrable on the interval [ex, fJJ. Assume now that f(x) is defined on ( -00,00) and integrable on any finite interval. If lim,oo SJ(x)dx exists, then it is called Cauchy's principal value of the integral of fin (-00,00). If f(x) is a monotone decreasing, positive, and continuous function defined on [k, 00) (where k is an integer), then according as Ld(v) converges or diverges, so does Sk' f(x)dx. Suppose that a series L;'=l fn(x), where all the functions fn(x) (n = 1,2,...) are defined and nonnegative on an infinite interval [a, 00), satisfies S(L;'dn(x))dx = L;' I S fn(x) dx for arbitrary b> a. Then according as L;'=l S: fn(x)dx converges or diverges, so does S:(Ldn(x))dx. When they converge, the following equality holds: S: L;'dn(x)dx =L;'=l S: fn(x)dx. In this theorem, if S: L;'llfn(x)1 dx or L;'l S;' Ifn(x)1 dx con- verges, then the same conclusion as above will follow even when the fn(x) are not necessarily positive ( Appendix A, Table 9). F. Multiple Integrals Suppose that f(x, y) is a function defined and bounded on an interval 1= {(x, y) I a.s; x.s; b, c.s;y.s;d} in the xy-plane. Partitions {xJ and {Yk} of [a,bJ and [c,dJ with a=xo<xl <... <x m =b and c = Yo <Yl <... <Yn=d determine a "partition," denoted by D, of I into subinter- vals of the form Ijk = {(x, y) I x j _ 1 .s; x.s; x j , Yk-l .s; y.s;yd (j = 1,..., m; k = 1,..., n), Writing M jk = sup f(x, y), (x,y)eljk m jk = inf f(x, y), (x,y)eljk we set m n O'(D) = I I Mjk(xj-xj-d(Yk-Yk-d, jl kl Q(D) = I I mjk(x j - Xj-d(Yk - Yk-\)' jlk=1 Then we obtain infDO'(D)suPDQ(D). If infDO'(D) = sUPDQ(D), then f(x, y) is called inte- grable on I, and the common value is called the double integral off on I and is denoted by SSd(x,y)dxdy. Analogously, we can define n- tuple integrals (or multiple integrals) and the integrability offunctions of n variables. Let K be a bounded set in the xy-plane and 822 I an interval containing K. Let <p(x, y) be the characteristic function of K defined on I, that is, <p is determined by <p(x,y)= 1 for (x,Y)EK, <p(x,y)=O for (x,Y)EI-K. Replacing f(x, y) by this <p(x, y), we consider infDO'(D) (suPDQ(D)). These values can be shown to be independent of the choice of such an interval I and are called the outer area (inner area) of K, respectively. When these two values coincide, K is said to be of definite area, and the common value is called the area of K, A necessary and sufficient condition for K to be of definite area is that the outer area of the tboundary of K be zero. Now consider a bounded function defined on a set K of de- finite area. Then, taking an interval I contain- ing K, define an extension <p(x, y) of f(x, y) as follows: <p(x,y)= f(x,y) for (x,y)EK, <p(x,y)=O for (x,Y)EI-K. If <p(x, y) is integrable on I, then f(x, y) is called integrable on K, and the integral of f on K is defined by SSd(x,y)dxdy=SS/<p(x,y)dxdy, which is independent of the special choice of I. The set K is called the domain of integration. Since K is of definite area, the set of boundary points of K at which <p(x, y) is not continuous can be contained in a union of intervals whose total area can be made smaller than any preas- signed positive number. Consequently, a func- tion bounded on K and continuous at each tinterior point of K is integrable on K. Like integrals of functions of a single variable, multiple integrals satisfy the three basic prop- erties of integrals ( Section B). G. Multiple Integrals and Iterated Integrals Suppose that we are given a function f(x, y) that is continuous on an interval 1= {(x, y) I a.s;x.s;b,c.s;y.s;d}. Then, for a fixed y in [c, d], the function f(x, y), regarded as a func- tion of x, can be integrated with respect to x on the interval [a,bJ, and the integral thus obtained is a continuous function of y. The integral of the function defined on [c, dJ, namely, S(gf(x,y)dx)dy, is called the iterated integral (or repeated integral) of f(x, y) and is often written as S dy Sf(x, y)dx. The following formula gives a representation of a double integral by iterated ones: f Jf(x,y)dXd Y = r dy r f(x,y)dx = r dx r f(x,y)dy. 
823 More generally, <PI (x) and <P2(X) being con- tinuous on [a,bJ and <PI (x).s; <P2(X), consider the following subset K = {(x, y) I a.s; x.s; b, <PI (x).s; y.s; <P2(X)} of the xy-plane. Suppose further that f(x, y) is continuous on K. Then the following equality holds: If f h f "'2(X) f(x, y)dxdy = dx f(x, y)dy. K a 1) In the case of unbounded integrands or unbounded domains of integration, we can still define integrals under suitable restrictions. For instance, assume the following two prop- erties: (1) There exists a sequence {Kn} of sets, each of which is of definite area, satisfying Kl cK 2 c.,. and K = Ul Kn. (2) f(x,y) is bounded and integrable on each Kn (n= 1,2,...). If a finite Jimit limn'" I kf(x, y)dxdy exists and is independent of the choice of {Kn}, then f(x, y) is called integrable on K. This limit is called the integral of f(x, y) on K and is denoted by I Id(x, y)dxdy: Jim If f(X,y)dXdy== I r f(x,y)dxdy. n->C1J Kn JK When the integral thus defined exists, we say that the integral is convergent. If a finite limit Jimnoo I k If(x, y)1 dxdy exists for some se- quence {Kn} with property (1), then f is inte- grable on K. Let {(x,y) be continuous and nonnegative on K = {(x,y)lex < x< fJ, y <y< b}, where -oo.s;ex<fJ.s;oo, -oo.s;y<b.s;oo. Fur- thermore, let f(x, y) be integrable on K, and assume that the improper integral F(x) = S$f(x, y)dy = Jimcjy,dJO S: f(x, y)dy exists and converges uniformly with respect to x as cty, dib. Then It F(x)dx is well defined, and we have I Lf(X,y)dXd Y = r dx r f(x,y)dy. In particular, if ex=a, y=b, fJ=8= 00, then we have 1 00 1 00 f(x,y)dxdy= 1 00 dx 1 00 f(x,y)dy. H. Interchanging the Order of Differentiation and Integration Ifboth {(x,y) and 3f(x,y)j3y are continuous on an interval {(x,y)la.s;x.s;b,Yo-1].s;y.s; Yo+1]}, then we can interchange the order of differentiation and integration as follows: d J h J h 3f(x, y) - f(x,y)dx= -dx for y=Yo. dy a a 3y Assume further that this equality holds for 216 Ref. Integral Calculus every b( > a), the improper integral f ro f(x,y)dx= Jim f h f(x,y)dx hoo a a converges, and the improper integral f "'3 f (X,y) d _ I ' f h3f(X,y) d -x-1m -x a 3y h-oo a 3y converges as b-->oo uniformly for y with Iy- Yol<1]. Then d f oo f 003f(XY) - f(x,y)dx= ------'--dx for y=Yo. dy a a 3y Several other similar theorems are known. Though the previous theorems are written in terms of two variables, analogous theorems hold for n variables. I. Change of Variables in Multiple Integrals Let G be a bounded domain of definite area in an n-dimensional Euclidean space Rn(x). As- sume that a mapping x--> y(x) =(Yl (Xl, ..., X n ), ... , Yn(X 1 , ... , x n )) is of class C 1 from an open set containing the closure G of G into an n- dimensional Euclidean space Rn(y). We de- note the image of G under this mapping by B. If f(y 1> .. . , y n) is continuous on B, then the following formula on change of variables holds: f... If(Yl'''',Yn)dyl...d Y n = I f ! D(Yl, .."Yn) I ... g(x".."x n ) dx1...dx n G D(x l' ...,x n ) where g(x 1 , ...,x n )= f(Yl(X I , ...,x n ), ...,Yn(x 1 , ...,x n )) and D(Yl, ...,Yn)jD(x l , ...,x n ) is the tJacobian determinant of the mapping y(x). This formula is usually utilized in the case where YI' ..., Yn are tfunctionally independent, though otherwise both sides vanish and the formula still holds. For improper integrals, a similar formula will hold under suitable restrictions, for example, if the integrals con- verge absolutely. For related topics  94 CurviJinear In- tegrals and Surface Integrals, 221 Integration Theory, and 270 Measure Theory. References [1J T. M. Apostol, Mathematical analysis, Addison-Wesley, 1957. [2] N. Bourbaki, Elements de mathematique, Fonctions d'une variable reeIle, Actualites Sci. Ind., 1074b, 1132a, Hermann, second edition, 1958, 1961. [3J R. C. Buck, Advanced calculus, McGraw- Hill, second edition, 1965. 
217 A Integral Equations [4] R. Courant, Differential and integral cal- culus I, II, Nordemann, 1938. [5] G. H. Hardy, A course of pure mathemat- ics, Cambridge Univ. Press, seventh edition, 1938. [6] E. Hille, Analysis I, II, Blaisdell, 1964, 1966. [7] W. Kaplan, Advanced calculus, Addison- Wesley, 1952. [8] E. Landau, Einfiihrung in die Differentia1- rechnung und Integralrechnung, Noordhoff, 1934; English translation, Differential and integral calculus, Chelsea, 1965, [9] 1. M. H. Olmsted, Advanced calculus, App1eton-Century-Crofts, 1961. [IOJ A. Ostrowski, Vorlesungen iiber Differentia1- und Integralrechnung I-III, Birkhiiuser, second edition, 1960-1961. [11] M. H. Protter and C. B. Morrey, Modern mathematical analysis, Addison-Wesley, 1964. [12] W. Rudin, Principles of mathematical analysis, McGraw-Hili, second edition, 1964. [13] V. I. Smirnov, A course of higher math- ematics. I, Elementary calculus; II, Advanced calculus, Addison-Wesley, 1964. [14] A. E. Taylor, Advanced calculus, Ginn, 1955. 217 (XIII.30) Integral Equations A. General Remarks Equations including the integrals of unknown functions are called integral equations. The most studied ones are the linear integral equa- tions, i.e., linear in unknown functions. Let D be a domain of n-dimensional Eu- clidean space and {(x) and K (x, y) be func- tions defined for x=(X I ,X 2 , ...,xn)ED, y= (YI, Y2, ..., Yn)E D. Integral equations of Fred- holm type (or Fredholm integral equations) [1] are those of the forms Iv K(x,y)<p(y)dy=f(x), (1) <p(x)- Iv K(x,y)<p(y)dy= f(x), (2) A(x)<p(x)- Iv K(x,y)<p(y)dy=f(x), (3) where <p(x) is an unknown function and JDdy means the n-fold integral J... JDdYI'" dy.. Equations of the forms (I), (2), and (3) are called equations of the first, second, and third kind, respectively. Equations of the second kind have been investigated in great detail. 824 Equations of the third kind, in many cases, can be reduced formally to those of the second kind. The function K (x, y) is called a kernel (or integral kernel) of the integral equation. Integral equations of Volterra type (or Vol- terra integral equations) are those of the forms f K(x,y)<p(y)dy= f(x), (1') <p(x)- r K(x,y)<p(y)dy=f(x), (2') A(x)<p(x) - f K(x, y)<p(y)dy = f(x), (3') where <p(x) is an unknown function. Equations of the forms (1 '), (2'), and (3') are also called equations of the first, second, and third kind, respectively. Integral equations of Volterra type can be regarded as integral equations of Fredholm type having kernels equal to 0 for x < y, but these two types of equations are usually treated separately, since they have considerably different characters. The kernels in equations (1)-(3) and (1')- (3') are frequently written in the form AK(x,y) with a parameter }" in particular when the equations are related to eigenvalue problems, which is explained in Section F, The theory of integral equations was orig- inated in 1823 by N. H. Abel, who investigated the relationship between time and the path of a falling body in the field of gravitation. Let <p(t) be a quantity varying with time, which is connected by some law with its values in some time interval of the past or the future. Then the law of variation of <p(t) can be described mathematically by an integral equation. The situation is the same even when the variable t is not time but a coordinate of the space. In this way, various problems in physics can be reduced to solutions of integral equations. B. Relation to Differential Equations Many problems in differential equations can be reduced to problems related to integral equations, Such reduction often makes the problems easier to handle and clarifies the nature of the solutions. For example, consider the problem of finding a solution of the ordi- nary second-order linear differential equation d 2 yjdx 2 +Ay=Owith the boundary condition y(O)=y(1)=O [4]. Let d 2 yjdx 2 =u(x). If we integrate the equation twice, change the order of integration, and make use of the boundary condition, then we have y= J: (x-)u()d- I x(I-)u@d, 
825 from which we see that the given differential equation can be written in the form U=A L x(l-)u()d-A f(X-)U()d. Decomposing the first integral on the right- hand side into the sum of an integral over (0, x) and one over (x, 1), and combining the integral over (0, x) with the second integral on the right-hand side, we obtain a Fredholm integral equation of the first kind as follows: U=A L G(x, )u()d, { w -x) (O.s;.s;x), G(x,)= x(1-.;) (x.s;.;.s;I). Clearly, the solution of this integral equation is equivalent to that of the original differential equation. The function G is called tGreen's function for the boundary value condition y(O)= y(1)=O in the theory of boundary value problems. Differential equations of higher orders can be treated analogously ( 315 Ordinary Differential Equations (Boundary Value Problems)). tInitial value problems of linear ordinary differential equations can be reduced to the solution of V olterra integral equations in a similar way, As another example [5,6], consider the tDirichlet problem on a plane, i.e., the problem of finding a function U satisfying the conditions (i) U is tharmonic in the interior of the region D bounded by a closed curve C(  == <p(s), 1] = o/(s), O.s;s.s;l); (ii) u(x,y)-->F(s) uniformly with re- spect to (xo, Yo) as (x, y) approaches (xo, Yo) from the inside of D, where (xo, Yo) is an arbi- trary point on C, F(s) is a continuous func- tion given on C, and s is the arc length along C. Put f(s) = F(s)jn and (0/ (s) - 0/ (t»<p'( t) - (<p(s) - <p( t)) 0/'( t) K(s; t) = n«<p(s) _ <p(t))2 + (o/(s) _ 0/(t»)2) Then it is known that a solution u of the prob- lem can be given in the form i l a 1 u(x,y)= J1(s)-log-ds, o an r where r 2 = (<p(s) - X)2 + (o/(s) - yf, n is the inner normal of C, and J1(s) is a continuous solution of the following Fredholm integral equation of the second kind: J1(s) = f(s) - I K(s; t)J1(t)dt. We can treat the tNeumann problem similarly, i.e" the problem in which condition (ii) is re- placed by (ii') (aujan)(x, y)--> F(s) uniformly with respect to (x o , Yo) as (x, y) approaches (x o , Yo) from the inside of D. In the Neumann 217D Integral Eqnations problem, put f(s) = F(s)jn and (o/(s) - o/(t) )<p'(s) - (<p(s) - <p(t)o/'(s) L(s;t)= n«<p(s)_<p(t»2+(0/(s)_0/(t))2) . Then we have a solution u in the form i l 1 u(x,y)= - J1(s)log-ds+c, o r where J1(s) is a solution of the following Fred- holm integral equation of the second kind: . J1(s) = f(s) - I L(s; t)J1(t)dt. A solution of this integral equation, however, exists when and only when JF(s)ds=O. In the expression of a solution u(x, y) of the Neu- mann problem, c is an arbitrary additive con- stant, up to which a solution of the problem is determined uniquely. We can also treat par- tial differential equations oftelliptic type in an analogous way. C. Integral Eqnations with Continnous Kernel We describe some results for integral equa- tions with m-dimensional independent vari- ables, i.e., equations in which D is an m- dimensional closed domain. We assume that K (x, y) and f(x) are continuous in Sections D-H. D. The Method of Snccessive Iteration Among methods of solving Fredholm integral equations of the second kind, the simplest is the method of successive iteration, sometimes ca1\ed the method of snccessive approximation [7]. In the method of successive iteration, we rewrite (2) in the form <p(x) = f(x) + L K(x,y)<p(y)dy and replace the function <p(y) on the right- hand side by the function f(y) + LK(y,Z)<P(Z)dZ. If we repeat the process successively, then we have <p(x) = f(x) + it L Ki(x,y)f(y)dy + L K n +1 (x, y)<p(y)dy, where KI(x,y)=K(x,y), Ki(x,y)= L Ki_dx,s)K(s,y)ds. 
217 E Integral Equations The functions K ;(x, y) are caned the iterated kernels. Assume that LI Kn(x,y) converges uniformly. Then, putting 00 R(x,y)= L Kn(x,y), (4) n=1 we obtain a solution of (2) in the form <p(x) = f(x) + Iv R(x, y)f(y) dy. The series (4) is called a Neumann series. For a given kernel K(x, y), a function R(x, y) satisfying K(x,y)-R(x,y)+ Iv K(x,s)R(s,y)ds==O and K(x,y)-R(x,y)+ Iv R(x,s)K(s,y)ds=O is called a resolvent of K(x, y) (in some cases - R(x, y) is taken as the resolvent). If a re- solvent of K (x, y) exists, the solution of (2) can be given uniquely by (5). If a Neumann series converges uniformly, (4) gives a resolv- ent of K(x,y). If we apply a similar process to Volterra integral equations of the second kind, then we have the iterated kernels defined by Ki+l(x,y)= f K;(x,s)K(s,y)ds (i= 1,2,...). For these iterated kernels, a Neumann series defined by (4) always converges uniformly. E. Fredholm's Method Let D be a bounded closed domain and K(x,y) a continuous kernel. A Neumann series (4) converges uniformly and gives a resolvent if IK(x,y)1 or the region D is sufficiently small, but otherwise it does not necessarily converge. E. I. Fredholm [1, 7J gave a method of con- structing a resolvent for the more general case. Write a kernel in the form },K(x,y), and put ( ) K(Xl,Yd Xl' ...,X" K YI' ...,Yn = K ( ) Xn'Yl K(x" Yn) K(xn,Yn) Define D(A) and D(x, y; ..1.) by D(A)=  (-..1.)" 1 f ( SI,...,Sn ) 1 + L...  .., K dS I ... dS n n=1 n. D D 8 1 ,...,8 n and D(x,y;A)=K(x,y) 00 (-A)" f 1 ( X,SI,...,Sn ) + L  ... K dsl...ds n . n=1 n. D D Y,Sl,""S" 826 The series in these two equations both con- verge uniformly and hence define tentire func- tions of X The functions D(A) and D(x, y; ..1.) are caned Fredholm's determinant and Fredholm's first minor of the kernel K(x,y), respectively. For small 1..1.1, we have (5) D'(A) 00 f -= LAn-I Kn(s,s)ds, D(}.) n1 D where the Kn(x, y) are iterated kernels corre- sponding to K(x,y). Now if D(A)#O, a resolv- ent ).R(x, y; ..1.) of the kernel AK(x, y) can be given by D(x,y;).) - R(x, y; ..1.). D(A) If D(A) = 0, some extension of the method in this section is needed. Fredholm introduced for this purpose Fredholm's rth minor D ( X",..'Xr;A ) YI,'" ,Yr defined by ( x"...,x r ) D '..1. YI,"', y,' ( Xl,...,X r )  (-..1.)" =K +L...- YI, ...,Yr n=1 n! f f ( x" ""X"S" ...,Sn ) X ... K dsl..,ds n . D D Yi,...,Y"si"",sn F. Eigenvalue Problems and Fredholm's Alternative Theorem Consider a homogeneous integral equation of the second kind <p(x)-A Iv K(x,y)<p(y)dy=O, (6) where D is a bounded closed domain and K(x,y) is continuous in D x D, When (6) has a nontrivial solution <p(x) for some ..1., then ..1. is called an eigenvalue corresponding to the kernel K(x, y), and the corresponding non- trivial solution <p(x) is caned an eigenfunction corresponding to the kernel K(x,y). If D(A) #0, then (6) has no nontrivial solution, from which it follows that eigenvalues must be zero points of the entire function D(A). For an arbitrary eigenvalue ..1., there is a set of linearly inde- pendent eigenfunctions corresponding to ..1. such that any eigenfunction corresponding to ..1. can be written as a linear combination of the eigenfunctions belonging to the set under consideration. Such a set of linearly indepen- dent eigenfunctions corresponding to an eigen- value ..1. is caned a fundamental system corre- sponding to the eigenvalue ..1.. The number of elements of the fundamental system is caned 
827 the index of the eigenvalue A. The index of an eigenvalue is always finite. The homogeneous integral equation of the form ljJ(y)-A Iv K(x,y)ljJ(x)dx=O is called an associated (or transposed) integral equation of (6). The associated equation has the same eigenvalues as the original equation. moreover, the index of a common eigenvalue is the same for both equations. For any eigen- value ..1., the order of the zero point ..1. of the entire function D(A) is called the multiplicity of the eigenvalue A. If an eigenvalue ..1. is a pole of R(x,y;A), then we have p+ 1 ?r+q, where r is the order of the pole, p is the multiplicity of ..1., and q is the index of J.. In particular, if ..1. is a simple pole of R(x,y, ..1.), we have p=q, namely, the multiplicity is equal to the index, An exam- ple with this particular property is the integral equation with a symmetric kernel to be dis- cussed in Section G. For the set of eigenvalues there is no finite accumulation point even if thcre are infinitely many eigenvalues. If ..1. is not an eigenvalue, the inhomogeneous equation <p(x)-}, Iv K(x,y)<p(y)dy= f(x) can be solved uniquely for any continuous function f(x), In this case we have D(A) # 0, and the resolvent R(x, y; ..1.) of the kernel },K(x, y) exists. If ..1. is an eigenvalue, we have D(A) = 0, and equation (7) has a solution if and only if Iv ljJ(x)f(x)dx=O for all solutions ljJ(y) of (6') (Iincarly indcpcn- dent solutions ljJ(y) are finite in number). The last statement is called Fredholm's alterna- tive theorem ( 68 Compact and Nuclear Operators). A kernel of the type n K(x, y) = L Xj(x) }j(y) jl is called a separated kernel, degenerate kernel, or Pincherle-Goursat kernel. For such a kernel, we have D(A) =det(<5 jk -..1. JXj(t) (t)dt), and hence we can easily obtain eigenvalues and eigenfunctions. A nondegenerate kernel can be studied using the results obtained for sepa- rated kernels, since we can regard a kernel of the general form as the limit of a sequence of separated kernels. G. Symmetric Kernels A kernel K(x,y) is called a symmetric kernel if it is rcal and K(x,y)=K(y,x). Lct D be a 217 G Integral Equations (6') bounded closed domain and K (x, y) be a con- tinuous symmetric kernel. In this case the as- sociated equation (6') clearly coincides with the original equation (6) [5,6,8]. Corresponding to any nontrivial symmetric kernel K (x, y), there exist at least one eigen- value and one eigenfunction. The eigenvalues are all real, and the eigenfunctions correspond- ing to distinct eigenvalues are mutually ortho- gonal. If we orthonormalize the eigenfunctions belonging to all fundamental systems and number them according to the order of in- creasing absolute values of the corresponding eigenvalues, then we have an orthonormal system {<Pi(X)}, called a complete orthonormal system of fundamental functions or simply a complete orthogonal (or orthonormal) system. If we number the eigenvalues taking their multiplicities into account and according to the order of increasing absolute values, then we have the equality i ) = Iv Iv K 2 (x,y)dxdy. (7) Corresponding to an iterated kernel Km(x,y), we have the eigenvalues {A;"}, and we can choose the corresponding orthonormal system so that it coincides with the one corre- sponding to K(x, y). Eigenvalues and eigen- functions corresponding to an iterated kernel can be obtained in the following way: Put J D Kn(s, s) ds = Un; then the following limit exists: lim U2n/U2n+2 = A 2 < + 00, nOO which gives an eigenvalue of the iterated ker- nel K 2 (x, y). A function <p(x, y) defined by lim },2n K 2n (x, y) = <p(x, y) noo (uniformly convergent) gives the correspond- ing eigenfunction <p(x, c) for any constant c satisfying <p(x, c) =1= O. Let An be an eigenvalue corresponding to an iterated kernel Kn(x, y) and <p(x) be a corre- sponding eigenfunction. Consider the func- tions IjJj(x) (j = 0, 1, ... , n - 1) dcfined by IjJj(x) = <p(x) + :t: £kjA k Iv Kk(x,y)<p(y)dy (j = 0,1,2, .., ,n - 1), where £ is one of the nth primitive roots of 1. The.l for at least one value ofj, £j), is an eigen- value corresponding to the kerne1 K(x,y), and IjJj is a corresponding eigenfunction. This rela- tionship between eigenvalues and eigenfunc- tions corresponding to an iterated kernel and those corresponding to the original kernel is valid even for kernels that are not necessarily symmetric [2]. 
217 H Integral Equations Let a kernel K1n)(x, y) be defined by Kln) ( ) =K ( ) _  <Pi(X)<Pi(Y) x,y x,y L.... ' , i=l Ai where the }'i (i = 1,2, ..., n) are the eigenvalues corresponding to the kernel K(x, y) and the <Pi(X) (i = I, ... , n) are the corresponding ortho- normalized eigenfunctions. Then eigenvalues and eigenfunctions corresponding to K(")(x, y) are those corresponding to K(x, y), with the exception of ..1.1, ..., An and <PI (x), "', <Pn(x). Let <p(x) be any function that satisfies JD(<P(X))2 dx= I. Then the integral J= LL K 2 (x,y)<p(x)<p(y)dxdy assumes the maximum value when <p(x) is an eigenfunction corresponding to K 2 (x, y) with the smallest eigenvalue Ai. Let the eigenvalues }'n of K(x, y) be numbered in order of increas- ing absolute values, so that IAnl.s; IAn+ll. Let <p(x) be any function satisfying Iv o/;(x)<p(x)dx =0 (i = 1,2,..., n), Iv (<p(x))2dx= I for given functions o/i(X) (i = 1, ..., n). Then the maximum value of the integral J above is the least when the set of aU linear combinations of {o/ I (x), ... , o/n(x)} coincides with the set of all linear combinations of {<PI (x), ..., <Pn(x)}, and in this case the maximum of J is attained by some eigenfunction <p(x) corresponding to K 2 (x, y) with the eigenvalue ..1.;+1' The results in this paragraph show that we can obtain eigenvalues by solving a variational problem concerning the integral J. H. Expansion Theorems Let K(x,y) be a continuous symmetric kernel and h(x) be a function square integrable on a bounded closed domain D. Then a function f(x) such that ((x) = Iv K(x,y)h(y)dy can be expanded in the form 00 f(x) = I Cn<Pn(X), n=1 where {<Pi(X)} is a complete orthonormal sys- tem offundamental functions corresponding to K(x,y) and C n == Lf(X)<Pn(X)dX (n= 1,2, ...). 828 The series in the expansion of f(x) converges uniformly. These facts are the content of the Hilbert-Schmidt expansion theorem [5,6,8]. By using this theorem, for a ..1. that is not an eigen- value, we can obtain a solution <p(x) of the Fredholm integral equation (7) with a sym- metric kernel in the form ,  <p;(x) i f( ) ( <p(x) = f(x) + AL., ----=- x <Pi x) dx. II }'i }, D For m  2, iterated kernels can be expanded in the form Km(x,y)= f: <Pi(X)i(Y) i=1 )'i (uniformly convergent). If AR(x, y; ..1.) is a re- solvent of a symmetric kernel AK (x, y), then R(x, y; },) can be expanded as R ( 'A ) =K ( ) + '  <Pi(X)<Pi(y) x,y, x,y ".L.... ' ( ' _ ' ) . 1=1 Ai Ai A If a symmetric kernel K(x, y) satisfies the ineq uality Iv Iv K(x,y)<p(x)<p(y)dxdyO for all <p(x), it is called a positive (semidefinite) kernel. If in this inequality the equality holds only for <p(x) =0, K(x,y) is called a positive definite kernel. For a positive definite kernel, eigenvalues are all positive, and the kernel can be expanded in the form K(x, y) = f: <Pi(X)<Pi(y) i1 Ai (uniformly convergent). This result is called Mercer's theorem. When a real continuous kernel K(x, y) is not symmetric, we consider two positive kernels K'(x, y) and K"(x, y) defined by Iv K(x,s)K(y,s)ds=K'(x,y) and Iv K(s,x)K(s,y)ds=K"(x,y). The eigenvalues corresponding to these ker- nels are the same, and they are all positive. Let ),? (i = 1,2, ...) be these eigenvalues and {<Pi(X)} and {o/i(X)} be the corresponding complete orthonormal systems corresponding to K' and K", respectively. Then we have Ai Iv K(y,X)<Pi(y)dY=o/i(X), Ai Iv K(x, y)o/i(y)dy= <Pi (x). 
829 Let f(x) be an arbitrary function such that f(x) = Iv K(x,y)h(y)dy, where h(x) is a function square integrable on D. The function f(x) can then be expanded in the form 00 f(x) = I Ci<Pi(X), i=l where C i == Iv f(x) <Pi (x) dx (i = 1,2,...). The series in the expansion (8) converges uniformly. The Fredholm integral equation (1) of the first kind with a general kernel (i.e., a kernel that is not necessarily symmetric) has a square integrable solution <p(x) if and only if f(x) has a uniformly convergent expansion (8) and L;;! (C);)2 < +00. When this condition is satisfied, equation (1) has a solution given by L! cnAn<Pn(x) that converges in the sense of tmean convergence. It should be noted that the theory concern- ing symmetric kernels can be extended to complex-valued Hermitian kernels, Le., kernels such that K(x,y)= K(y,x) . Also, we can obtain the theory in Section G and this section, con- cerning Fredholm integral equations with continuous kernels, by using the methods of functional analysis that treat SDK(X,y)<p(y)dy as a tcompact operator in the space of con- tinuous functions ( 68 Compact and Nuclear Operators). I. Kernels of Hilbert-Schmidt Type Kernels of Hilbert-Schmidt type are kernels which are square integrable in the sense of Lebesgue over D x D, where D is an arbitrary domain. Most of the results mentioned in the previous section concerning integral equations with kernels continuous on bounded domains are valid also for equations with kernels of Hilbert-Schmidt type, because every operator mentioned in the previous section is also a compact operator in the space concerned, i.e., the space L2(D) [6] ( 68 Compact and Nuclear Operators). J. Singnlar Kernels For general kernels that are not necessarily continuous, the theory described in the previ- ous sections does not apply properly, but when an iterated kernel Km(x, y) has a resolv- 217 J Integral Eqnations ent Rm(x, y), we can find a resolvent R(x, y) of K(x,y) in the form (8) R(x, y) = Rm(x, y) + Hm(x, y) + Iv Rm(x, s)Hm(s, y)ds, where Hm(X,Y)=L1 Ki(x,y). When a kernel is of the form AK (x, y), the relationship be- tween eigenvalues and eigenfunctions corre- sponding to an iterated kernel and those cor- responding to the original kernel, which was stated in Sections G and H, is still valid for general kernels. Ifa kernel K(x,y) is continu- ous for x # y and has a singularity of the form Ix - yl-' (0 <:x < I) on x = y, the iterated ker- nels Km(x, y) are continuous provided that (1-ex)m::;:, 1. tGreen's functions of partial dif- ferential equations of elliptic type have this property. A kernel that is not square integrable is called a singnlar kernel. An integral equation whose domain of definition is unbounded or whose kernel is singular is called a singnlar integral eqnation [9]. Singular integral equa- tions have some particular properties that are not seen in ordinary integral equations, i.e., integral equations with kernels continuous in a bounded closed domain. For example [10], consider the identity g I rO S inXy ( fi e-'Y+--I--z ) d Y --.); 0 --.}2 ex +y o x = --.} 2 e -'x + ex2 + x 2 ' where ex is an arbitrary real number. This equality shows that for the continuous kernel sinxy, 1 is an eigenvalue and Ffi e-'x + x/(ex 2 + x 2 ) is a corresponding eigenfunction. Since ex is arbitrary, the index of the eigenvalue 1 is evidently infinity. As another example, observe the equality f w 2 e -lx-Yle-i'Y dy =------ze -i'X, -00 l+ex where ex is an arbitrary real number. From this equality, we see that for the continuous kernel e- 1x - Y1 defined on (-00,00), and number ..1.= (1 + ex 2 )/2 greater than or equal to 1/2 is an eigenvalue. In this example, the spectrnm, i.e., the set of eigenvalues, is a continuum, Such a spectrum is called a continnons spectrnm. In applications, an important role is played by integral equations with kernels of Carleman type: K(x, y) = G(x, y)/(y  x), where G(x, y) is a bounded function, In inte- gral equations with such kernels, the integral 
217 K Integral Equations is taken in the sense of the tCauchy principal value [5,9,11,12]. For example, the Riemann- Hilbert problem is the following: We are given a simple closed and smooth curve L in the complex plane and real-valued smooth func- tions a, b, and c defined on L, a + ib never vanishing. The problem is to find a function <p(z) which is holomorphic in the exterior of L, at most of polynomial growth at infinity, and continuous up to L with boundary value <p + such that Re[(a+ ib)<p+J =c on L. This prob- lem is reduced to that of finding a function J1 defined on L satisfying an integral equation J1(Z)- f G(z,() J1(Od(=f(z) L z-( where G is a smooth kernel determined by a + ib, r is a known function depending on c, and the integral is taken in the sense of the Cauchy principal value. The integer K defined by K=(1/n)hd(arg(a-ib) is called the in- dex of this problem. The fulI solution of the Riemann-Hilbert problem was given by I. N. Vekua [11]. A multidimensional analog of the Cauchy principal value is the singular integral of A. P. Calderon and A. Zygmund ( 251 Linear Operators). A smooth function k(x) defined in R n except at x == 0 is called a kernel of Calderon-Zygmund type if k(x) is positively homogeneous of order - n and if its integral mean on the unit sphere is zero. Then the operator K defined by (zEL), Kf(x)==lim r k(y)f(x- y)dy e-tO J Iyl>e is called a Calder6n-Zygmund singular integral operator [13]. K is a bounded linear operator in U(R n ) if 1 <p< +00. If n= 1 and k(x)= 1/(nix), K is nothing but the tHilbert trans- formation. The pseudodifferential operator ( 345 Pseudodifferential Operators) is an exten- sion in some sense of the singular integral operator. K. Systems of Integral Equations A system of Fredholm integral equations of the second kind can always be reduced to a single equation. In fact, as is seen easily, a system of integral equations <pj(x) - Ie  I I Kij(x, y)<pj(y)dy = h(X) J 0 (i = 1,2,..., n) can be reduced to a single equation <l>(x)-A J: K(x,y)<l>(y)dy = F(x) (O.s;x.s;n), 830 where <l>(x) = <p;(x - i + 1), F(x) = h(x - i + 1), K(x,y)=Kij(x-i+ l,y-j+ 1) (i-l.s;x,j-l.s;y.s;j;i,j= 1,2, ...,n). A system of Volterra integral equations of the second kind can be reduced to a single equa- tion by eliminating the unknown functions successively. L. Integral Equations of Volterra Type Consider a Volterra integral equation of the first kind r K(x,y)<p(y)dy= f(x) such that K(x, x)#O and KAx, y) and f'(x) are continuous, If we differentiate both sides of the equation, then we have a Volterra integral eq uation of the second kind: f x KAx, y) f'(x) <p(x)+ a K(x,x) <p(y)dy= K(x,x) ' Abel's integral equation of general form is I x G(x,y) - ( ) ' <p(y)dy= f(x) o x-y (O<ex<1). (9) If G, G x , and f' are -continuous and G(x, x) # 0, equation (9) can be reduced to the equation J: H(u,y)<p(y)dy= J: f(x)(u-xr' dx, where H u = 1 u G(x,y)dx ( ,y) ( ) 1-' ( ) " y u-x x-y . Since H(u, u) =(n/sinexn)G(u, u)#O, it follows that I u Hu(u, y) <p(u) +  ( ) <p(y)dy=g(u), o u, u (9a) where d f a g(u)=H(u,u)-'- f(x)(u-xr'dx du 0 =H(U,Uf'(udf(O) + J: (u - xr 1f'(X) dX). Clearly (9a) is a Volterra integral equation of the second kind. Abel's problem ( Section A) was to find the path of a falling body for a given time of descent. The problem then can be reduced to the solution of equation (9) with G(x, y) = 1 and ex = 1/2. When G(x, y) = 1, we 
831 can solve equation (9) explicitly to get <p(x) = sinnexn (xHf(O) + J: (X-t)'-lf'(t)dt) [5]. M. Nonlinear Integral Equations When a nonlinear integral equation includes a parameter A, it may happen that the parameter has a bifurcation point, i.e., a value Ao such that the number of real solutions is changed when }, varies through Ao taking real values. For example, consider the equation <p(x)-A I <p2(y)dy= 1. This equation has real solutions <p(x)=(I:t )1-4..1. )/2..1. for A.s; 1/4 but no real solution for A> 1/4. Hence ..1.0 = 1/4 is a bifurcation point ( 286 Nonlinear Functional Analysis). Among nonlinear integral equations, Ham- merstein's integral equation has been studied in detail [5,14]. It is an equation of the form <p(x) + Iv K(x,y)f(y,<p(y»dy=O. If K(x, y) and f(y, 0) are square integrable and f(y, u) satisfies a tLipschitz condition in u with a sufficiently small coefficient, then the inte- gral equation (10) can be solved by successive approximations. If K(x, y) is a square inte- grable positive kernel and JDIK(x,yWdy is bounded, then we can prove the existence and uniqueness of a solution of (10) under a con- dition on f(y, u) weaker than a Lipschitz con- dition. We can prove similar results for equa- tion (10) with a non symmetric kernel when K(x,y) is continuous in the mean, that is, !!Tx Iv IK(x',y)-K(x,yWdy=O, Iim r IK(x,y')-K(x,yWdx=O. y'y JD A nonlinear Volterra integral equation of the form <p(x) = f(x) + f F(x, y; <p(y))dy can be solved by successive approximations if F(x, y; u) and f(x) are square integrable, F(x, y; u) satisfies a Lipschitz condition IF(x, y; u') - F(x, y; u")I.s;k(x, Y)lu' -u"l with some square integrable function k(x, y), and J: F(x, y;f(y))dy is majorized by some square integrable function of x. When F(x, y; u) and f(x) are continuous, we can obtain theorems on the existence and uniqueness of continuous solutions and tcomparison theorems similar to 217 N Integral Equations those for initial value problems in ordinary differential equations [16]. N. Numerical Solution For the numerical solution of integral equa- tions, we assume throughout that the func- tions appearing are all continuous and the solution of every equation is unique. Methods of numerical solution can be divided roughly into two classes. Methods of one class try to evaluate numerically the analytical solu- tion described in the preceding sections, and methods of the other try to obtain a solution by transforming the problem to one that is numerically solvable. (1) A Method Based on Numerical Quadra- ture. Consider the integral equation r F(x, y, <p(x), <p(y»dy == O. (10) Let a=x I <x 2 <... <x n =b be points on the interval [a, b J and <PI' <P2, ..., <Pn be the values of <p(x) at Xl> X2, .", x n ' By the use of numer- ical quadrature, we then have the following system of equations in <Pi: n L ajF(xi,x j , <Pi' <p)=0 (i=I,2,...,n). jl The method corresponds to that of solving ordinary differential equations by their dif- ference equation analogs. Hence the errors involved in the solutions obtained by this method can be analyzed similarly to those in the case of numerical solution of ordinary differential equations ( 303 Numerical Solu- tion of Ordinary Differential Equations). If the given integral equation is a Fredholm equation of the second kind, then we have a system of linear equations in <Pi' If we apply quadrature formulas to the integral appearing in the integral equation by using the values of <Pi obtained, then we have a formula by which the solution can be evaluated directly, that is, without using an interpolation formula. (2) A Method Utilizing Recurrence Formulas. Let d n and dn(x, y) be the respective coefficients of An in the expansions of Fredholm's determi- nant D(A) and Fredholm's first minor D(x,yJ). They satisfy the recurrence formulas dn(x,y)=dnK(x,y)+ r K(x,s)dn_1(s,y)ds, 1 f b d n + 1 = - dn(s,s)ds, n+ 1 a d o = I, do(x,y)=K(x,y). By the use of these formulas, we can compute 
217 Ref. Integral Equations d n and dn(x, y) successively and hence evalu- ate D(A) and D(x,y;A) approximately, and by means of these recurrence formulas we can readily obtain a solution of a Fredholm equa- tion of the second kind. (3) A Method Utilizing Approximate Kernels. If we replace a kernel by an approximate one in a Fredholm integral equation of the second kind, then we have an integral equation that has a solution approximately equal to the solution of the original equation. Hence if we can find an approximate kernel for which an integral equation can be solved numeri- cally or ana1ytically, then we can find an ap- proximation to the desired solution by solving the modified equation. For such solutions, a method of error estimation was given by F. G, Tricomi [17]. (4) An Iterative Method. Consider the integral equation <p(x) = r F(x,y, <p(x), <p(y))dy. Let <Po(x) be an adequate function, and define <Pn(x) successively by <Pn+d x ) = r F(x, y, <Pn(x), <Pn(y))dy. If the sequence {<Pn(x)} converges, then the limit 1imn <Pn(x) = <p(x) is a solution of the given equation, and hence we can obtain an approximation to a solution by calculating <Pn(x) for some finite n. This method can be used effectively for Fredholm integral equa- tions of the second kind with a parameter ..1., provided that the absolute value of ..1. is smalJer than the least absolute value of the eigenvalues. (5) Variational Method. If some conditions are fulfilJed, an integral equation of the form G(x, <p(x)) + r F(x, y, <p(x), <p(y))dy=O can be regarded as an tEu1er-Lagrange equa- tion for a variational problem l[uJ = r r E(x,y, u(x), u(y))dxdy + r H(x,u(x))dx=extremaI. (11) In this case, we can find a solution of the given integral equation numericalJy by solving the variational problem (11) numericalJy [18]. (6) Enskog's Method. Suppose that {<Pn(x)} is a complete orthonormal system for the Fred- 832 holm integral equation (7). If we put o/n(x) = <Pn(x)-AJK(y,x)<Pn(y)dy, then from (7) we have r <p(x)o/n(x)dx = r f(x) <Pn(x) dx, (12) and furthermore we see that {o/n(x)} can be orthonormalized to yield a complete ortho- normal system {Xn(x)}, The equality (12) then shows that the Fourier coefficients of a solu- tion <p(x) with respect to the system {Xn(x)} can be obtained readily from the Fourier coefficients of f(x) with respect to the system {<Pn(x)}. This method of obtaining a solution is calJed Enskog's method. For Volterra integral equations, methods (I) and (4) can be used effectively. We usualJy transform equations of the first kind into equations of the second kind by differentiation and then apply the above numerical methods, This is done for the sake of securing stability of the numerical methods. References [IJ I. Fredholm, Sur une c1asse d'equations fonctionnelJes, Acta Math., 27 (1903), 365-390. [2J G. Vivanti and F. Schwank, E1emente der Theorie der Lineare Integralgleichungen, Helwingsche Verlagsbuchhandlung, 1929. [3J E. Hellinger and O. Toeplitz, Integral- gleichungen und G1eichungen mit unendlich- vie1en Unbekannten, Teubner, 1928. [4J R. Courant and D. Hilbert, Methods of mathematical physics I, II, Interscience, 1953, 1962. [5] F. G. Tricomi, Integral equations, Inter- science, 1957. [6J K. Y osida, Lectures on differential and integral equations, Interscience, 1960. (Orig- inal in Japanese, 1950,) [7J E. Goursat, Cours d'analyse mathematique III, Gauthier-Villars, 1928. [8J F. Smithies, Integral equations, Cam- bridge, 1958. [9J T. Carleman, Sur 1es equations inte- gra1es singu1ieres a noyau reel et symetrique, Uppsa1a, 1923. [lOJ T. Lalesco, Introduction a 1a theorie des equations integra1es, Gauthier-VilJars, 1912. [IIJ N. I. Muskhelishvili, Singular integral equations, NoordhotT, 1953. (Original in Rus- sian, 1946.) [12J S. G. Mikh1in, Multidimensional singular integrals and integral equations, Pergamon, 1965. (Original in Russian, 1962.) [I3J A. p, Calderon and A. Zygmund, Sin- gular integral operators and differential equa- tions, Amer. J. Math., 79 (1957), 901-921. 
833 [14] H. Schaefer, Neue Existenzsatze in der Theorie nichtlinearer Integralgleichungen, Akademie-Verlag, 1955. [15J L. Lichtenstein, V orlesungen uber einige Klassen nichtlinearer Integra1gleichungen und Integro-differentia1gleichungen, Springer, 1931. [16J T. Sato, Sur I'equation integra1e non lineaire de Volterra, Compositio Math., 11 (1953),271-290. [17] H. Buchner, Die praktische Behandlung von Integralgleichungen, Springer, 1952. [18] L. Collatz, The numerical treatment of differential equations, Springer, 1960. [19] L. V. Kantorovich and V. I. Krylov, Approximate methods of higher analysis, Interscience, 1958. (Original in Russian, 1949.) 218 (VI1.19) Integral Geometry A. General Remarks Integral geometry, in the broad sense, is the branch of geometry concerned with integrals on manifolds, but the problems considered in integral geometry are usually of a more limited nature. If a tLie group G acts on a tdifferenti- able manifold M as a tLie transformation group, G also acts on various figures on M, by which we mean geometric objects such as tsubmanifolds of M, ttangent r-frame bundles on M, etc. Let :F be a set of such figures on M invariant under G (i.e., gFE:F for gEG, FE:F). Consider the following problems: (i) to know whether any G-invariant tmeasure Ji on :F exists, and how to determine Ji if it exists; (ii) to find the integral J <p(F)dJi(F) of functions <p on :F with respect to the measure Ji. The term integral geometry was introduced by W. Blaschke, who considered the special case of problem (ii) in which <p(F) is a function representing geometric properties of F and the integral is to be evaluated by means of the geometric invariants concerning :F[l]. Prob- lems of so-called geometric probability (such as the pro blem of Buffon's needle) belong to this category. The measure Ji is called the kinetic measure (or kinetic density), and dJi(F) is also denoted by dF. If:F has the structure of an n-dimensional differentiable manifold and the measure Ji is given by a tvolume element w (i.e., a positive tdifferential form of degree n), we denote w also by dF. Problem (i) is simple: If G acts ttransitively on :F, then :F = G/H, where H is the tisotropy subgroup of G. In this case :F has the structure of a differentiable manifold, and if a G-invariant t(Radon) mea- sure exists, it is unique up to a multiplicative 218 C Integral Geometry constant. A condition for the existence of a G- invariant measure Ji can be given by means of tHaar measures of G and H ( 225 Invariant Measures). We now consider some examples. B. Crofton's Formula Let G(p, 1.1) be a straight line defined by the equation Xl cosl.l+x 2 sinO=p with respect to orthogonal coordinates in a Euclidean plane. Let n(p, 1.1) be the number of intersections of G(p, 1.1) with a curve C of length L. Then we have Crofton's formula, f n(p,O)dpdl.l=2L, (1) where dpdO is the texterior product of the differential forms dp, dO of degree 1, and the integral is extended over pE( -00,00) and OE [0, 2n]. C. Poincare's Formula and the Principal Formula of Integral Geometry The kinetic density dF of a figure F congruent (with the same orientation) to a fixed figure in a Euclidean plane is defined as follows: Let R be an orthogonal frame attached to F, (x I' X2) be the coordinates of the origin of R with respect to a fixed orthogonal frame Ro, and 1.1 be the angle between the first axis of Rand the first axis of Ro. If we put dF=dx 1 dx 2 dl.l (exterior product), dF has the following in- variance properties: (i) dF is not changed by displacements of F; (ii) dF is not changed if instead of R we take another orthogonal frame R' attached to F. Let two plane curves C" C 2 of length Ll' L2' respectively, be given, and suppose that C 1 is fixed and C 2 is mobile. If the number of intersections of C 2 in an arbitrary position with C 1 is finite and equal to n, then the in- tegral of n extended over all possible positions of C 2 is given by f ndC 2 =4L 1 L 2 (2) (Poincare's formula). L. A. Santal6 applied this result to give a solution of the tisoperimetric problem (1936). Let C 1, C 2 be two plane tJ ordan curves of length L 1 , L 2 , respectively, and let SI> S2 be the areas of the domains bounded by C l , C 2 , respectively. Suppose that C I is fixed and C 2 mobile, and let X be the number of connected domains common to the domains bounded by C I' C 2 for C 2 in an arbitrary position. Then the integral of X extended over all possible 
218 D Integral Geometry positions of C 2 intersecting C I is given by f XdC 2 =LlL2 +2n(SI +S2) (Blaschke [IJ). This is the principal formnla of integral geometry. Many formulas can be derived from it as special cases or limiting cases. D. Generalization to Dimension n The kinetic density of subspaces of dimension k in a Euclidean space and in a spherical space of dimension n was given by Blaschke, while the generalization of the principal formula (3) to a Euclidean space of dimension n was given by S. S. Chern, applying the methods of E. Cartan. Let (e I, ... , en) be a positively oriented ortho- normal frame with vertex A. The infinitesimal relative displacements are then given by n dA = '\' w.e L,; l t, i=1 n de.= '\' w..e. I  J J' j1 where wi=(dA,eJ, wij=(dei,e)= -Wji are differential forms of degree 1 in the orthogonal coordinates of A and the n( n - 1 )/2 variables that determine e l ,..., en' For various posi- tions of a figure, we take an orthogonal frame (A, e l , ..., en) fixed to this figure and form the texterior product dK = /\ w, /\ w" , 'i i i<j of aU the Wi and (J)ij (i <j). This has the invar- iance properties (i) and (ii) of Section C and is, by definition, the kinetic density of the fig- ure in an n-dimensional Euclidean space. Moreover, the kinetic density of dE of k- dimensional subspaces E can be obtained by considering the orthogonal frames such that the vertex A and e 1, ... , ek lie on E and by forming the exterior product of the corre- sponding w., W'A (ex = 1, ...,k;},=k+ 1, ...,n), dE = !\W,!\W'A' Let L be a compact orientab1e hypersurface of class C 2 in an n-dimensional Euclidean space, and let k, (ex = 1, ... ,n -1) be the tprin- cipa1 curvatures at a point on L. Denote by Si the te1ementary symmetric form of degree i in k, (i = 1, ... , n -1), and put So = 1. Then con- sider the in tegrals over L: M i = J..Sids/(nl} i=0,1,...,n-1, (4) where dS denotes the surface element of L. Let DJ, Dz be the domains bounded by two com- pact orientab1e hypersurfaces L I , L2 of class C 2 with volume VI, V 2 , respectively, and let 834 (3) MP), Mf2) be the integrals M i defined by (4) for LI> L 2 . If LI is fixed, L 2 is mobile, and the tEuler- Poincare characteristic X of DI n D 2 is finite, then the generalization of (3) has the form f X dL 2 = II ... In-I ( M1)1 V 2 + M2)1 VI 1 n- 2 ) + - '\' M (1)M(2)  k n-2-k nk=O (5) (Chern's formnla), where dL 2 is the kinetic density of L2 and Ik (k = I, ..., n -1) is the area of the unit sphere in a Euclidean space of dimension k + 1, with the integral extended over aU positions of L 2 intersecting LI . Let dE be the kinetic density of the sub- spaces E of dimension k intersecting a com- pact orientab1e hypersurface L of class C 2 , and let X be the Euler characteristic of the intersec- tion of E with the domain bounded by L. The integral J X dE extended over all hyperplanes of dimension k intersecting L is proportional to M k relative to the hypersurface L defined by (4). This fact generalizes (1). Further generali- zations were obtained by Chern (1966). E. Other Generalizations For 2-dimensional spaces of constant curva- ture, Santal6 derived formulas analogous to those in a Euclidean plane (1942-1943) and thus solved the isoperimetric problem in these spaces. In 1952, he derived a formula corre- sponding to (5) in n-dimensional spaces of . constant curvature, foUowing Chern's method. He investigated further integral geometry in affine, projective, and Hermitian spaces. Chern and others obtained the results of the previous sections by Cartan's method of gen- eral moving frames and studied integral geom- etry in the setting of the geometry of Lie transformation groups in the sense of F. Klein ( 137 Erlangen Program). Chern, P. Griffiths, and others studied the value distribution theory of ho1omorphic mappings in several complex variables from the point of view of integral geometry (1961) ( 21 Analytic Functions of Several Complex Variables, 124 Distribution of Values of Func- tions of a Complex Variable). F. Radon Transforms Another important topic of integral geome- try is the theory of Radon transforms. Let g; be the set of hyperplanes (w,p)={xERnl(x,w) = p} in the Euclidean space R", where W = (AI, ..., An) is a unit vector, (x, w) = L X)i' and p is real. For a function f defined in R n , define 
835 j()=j(w,p) on.? by .i()= If(X)dX' where dx is the tvolume element on the hyperplane  such that dx /\ L Ai dXi = dx, with dx the volume element of Rn. Then i is called the Radon transform of[ For example, if f is the tcharacteristic function of a bounded domain V, the value i() of the Radon trans- form .i of f at  E.? is the volume of the sec- tion of V by. Now the group G oftmotions of Rn (the tconnected component of the iden- tity of the group of isometries) acts on .9' tran- sitively. For every xERn, the tisotropy sub- group G x of G with respect to x acts transi- tively on the set x= g E.? I XE }. Since G x is compact, there exists a unique normalized G x - invariant measure Ji on x such that Ji(x) = 1. For a function 9 on .?, the conjugate Radon transform g can now be defined by g(x)= r g()dJi() JXES as a function on Rn. The determination of g belongs to problem (ii) of integral geometry mentioned in Section A. In particular, it is important to determine g =cir for 9 = i and to find the relation be- tween (ir and f These problems were solved by J. Radon for n=2, 3 and by F, John in the general case, The results can be formulated as follows: In the case of odd n, Jet Y be the space of trapidly decreasing COO-functions ( 168 Function Spaces). Let .51'*(.?) be the set of g(w,p) EY(.?) such that JCXJoog(w,p)pkdp=O for every natural number k and every w. For every fEY(Rn) and every gEY*(.?), we then have f=c!:J.(n-I)/2«in g=cVn-l)/2((iln, where!:J. is the tLaplacian in Rn and Lg(w,p) = d 2 g(w, p)ld p 2, c= r(n/2) -I (2ni)l-n n n/2. In the case of even n, for every f E Y'(R n), gE9'*(.?), f =C 1 J l «in g=C2 J 2«(iJ)'), where J 1 (/)(x) = LJ(Y)IX-YII-2ndY, J 2 (g)(w,p)= L g(w,p)lp-ql-ndq. These integrals are in general divergent, and they must be interpreted as regularizations defined by analytic continuation with respect to the powers of Ix - yl or Ip - q I [9]. John applied the Radon transform on Rn to the study of partial differential equations in Rn with constant coefficients [8]. 218 G Integral Geometry (6) A formula corresponding to tPlancherel's theorem and an analog of the tpaley- Wiener theorem for the Fourier transform are valid for the Radon transform (I. M. Gel'fand et al. [6]). G. Horospheres (7) The theory of the Radon transform is also important in noncompact tsymmetric Rie- mannian spaces M. The connected component G of the identity in the group of isometries of M is isomorphic to the tadjoint group Ad G and can be considered as a linear group. Maxi- mal tunipotent subgroups of G are conjugate to each other. If N is such a subgroup, we call the torbits on M of gNg- 1 horospheres on M for 9 E G. These correspond to the hyperplanes in Rn, If M is the complex upper half-plane with the thyperbolic non-Euclidean metric, the horospheres are precisely the circles tan- gent to the real axis and the straight lines parallel to the real axis. The group G acts transitively on the set .9' of horospheres on M, and we have.? = GIMoN. Here G=KAN is an tIwasawa de- composition of G, and Mo is the tcentralizer of A in K. For a horosphere E.?, let dx be the volume element on  with respect to the tRiemannian metric on  induced by the Rie- mannian metric on M, and define the Radon transform i of a function f on M by (6) as a function on.? For every xEM, there exists a uniq ue normalized meas ure on x = g E.9' I XE} invariant under the (compact) isotropy subgroup G x of G at x (p(x) = 1). The conju- gate Radon transform g of a function 9 on .? is defined by formula (7) by means of this measure Ji. Then there exists an integrodif- ferential operator /\ such that if /\ * is the adjoint operator, we have the inversion for- mula f = (/\/\ * i)' and Plancherel's theorem: L If(xWdx= L l/\lrWd(, where dx, d are G-invariant measures on M, .?, respectively, and r is an arbitrary Coo_ function with compact support. If the tCartan subgroups of G <ire conjugate to each other, /\ is a differential operator; the inversion formula can then be written in the form f = L«in with some differential operator L on M [9]. S. Helgason applied the Radon transform on noncompact symmetric Riemannian spaces to solve differential equations on these spaces (1973). Horospheres and Radon transforms can be defined not only for symmetric Riemannian spaces GIK, but also for various thomoge- neous spaces GIH of noncom pact semisimple 
218H Integral Geometry Lie groups G. The Radon transform r --> i maps a function f on G/H into a function on the space of horospheres on G/H. If a tunitary representation U of G is realized in a function space over G / H, the Radon transform r -->.1 helps to clarify the properties of U by going over to the function space on ,?iF. In several examples, Gel'fand, He1gason, and others have by this method decomposed U explicitly into direct integrals of irreducible representations [6,7,9,10]. Further work on the Radon transform on tsymmetric Riemannian spaces (e.g., compact symmetric Riemannian spaces of rank one and tGrassmann manifolds) has been done by He1gason and others (1965). A generalization of Radon transforms to differential forms has also been given by Gel'fand and others (1969). H. Another Generalization Integral geometry can also be investigated in spaces admitting no displacement. Let F(X I ,X2'YI'Y2) be positive and homoge- neous of degree 1 with respect to Yl, Y2, and consider a tstationary curve of the integral J F(x 1, X 2 , dx l/dt, dX2/dt)dt. Let Pi = of/oYi (Yi = dxddt) along this curve. Poincare found that for a 2-parameter set of stationary curves, dX I dpl +dx 2 dp2 is not changed by any dis- placement of the line element (Xi' Pi) along a stationary curve, Blaschke took this as the kinetic density of the stationary curve and proved a formula containing formula (1) as a special case. On the other hand, Santa16 intro- duced kinetic density for sets of geodesics on 2-dimensiona1 surfaces and proved a generali- zation of formula (2). The study has been fur- ther extended to tfoliated manifolds. Some uniqueness theorems for various integral geo- metric problems with applications to the study of the earth's internal structure from seismo- logical data have been given by V. G. Roma- nov [11]. References [IJ W. Blaschke, Vorlesungen iiber Integra1- geometrie, Teubner, 1,1936; 11,1937. [2J S. S. Chern, On integral geometry in Klein spaces, Ann. Math., (2) 43 (1942),178-189. [3J S. S. Chern, On the kinematic formula in the Euclidean space of n dimensions, Amer. J. Math., 74 (1952),227-236. [4] L. A. Santalo, Introduction to integral geometry, Hermann, 1953. [5] L. A. Santalo, Integral geometry and geo- metric probability, Addison-Wesley, 1976. [6J 1. M. Gel'fand, M. 1. Graev, and N. Va. Vi1enkin, Generalized functions V, Integral 836 geometry and representation theory, Academic Press, 1966. (Original in Russian, 1962.) [7J I. M. Gel'fand and M. 1. Graev, Geometry of homogeneous spaces, representations of groups in homogeneous spaces and related questions of integral geometry, Amer. Math. Soc. Transl., (2) 37 (1964), 351-429. (Original in Russian, 1959.) [8J F. John, Plane waves and spherical means applied to partial differential equations, Inter- science, 1955. [9J S. He1gason, A duality in integral geome- try; some generalizations of the Radon trans- form, Bull. Amer. Math. Soc., 70 (1964), 435- 446. [10] S. Helgason, A duality for symmetric spaces with applications to group representa- tions, Advances in Math., 5 (1970), 1-154. [I IJ V. G. Romanov, Integral geometry and inverse problems for hyperbolic equations, Springer, 1974. (Original in Russian, 1969.) [12J R. Deltheil, Probabilites geometriques, Gauthier-Villars, 1926. [13J D. G. Kendall and P. A. Moran, Geo- metrical probability, Griffin, 1963. [14J M. I. Stoka, Geometrie integra1e, Gauthier- Villars, 1968. 219 (XIII.14) Integral Invariants A. Poincare's Integral Invariants Let us view a system of differential equations dxddt =X i (X 1 , .., ,X n , t) (i= 1,2,..., n) as defin- ing the motion of a point whose coordinates are (XI, ..., x n ) in the n-dimensional space Rn at time t. Let K be a p-dimensiona1 manifold (1.s; p.s; n) in R n , and let K, be the set occupied at the instant t by the points which occupy K for t = O. If the integral f F(xI,,,,,x"'t)dw, K, (1 ) where dw is the t vo 1ume element of K" does not depend on t for any p-dimensiona1 surface K, then J F(x 1 , ...,xn,t)dw is called an integral invariant of degree P of the original system of differential equations. (The tdifTerential form F dw is also called an integral invariant.) In particular, a necessary and sufficient condi- tion for an integral J M (x, t) dx I ... dXn to be an integral invariant of degree n is oM /ot + L7 I o( M X;j/OXi = O. Furthermore, a neces- sary and sufficient condition for an integral JL71 Mi(x, t)dx i to be an integral invariant of degree 1 is oMdot+L;l«oMdox)Xj+ (DXdox)Mj) =0. If the integral (1) does not 
837 depend on t for any closed p-dimensional surface K, then f F(x"..., x n , t)dw is called a relative integral invariant of degree P (l.s;p.s; n-I). Corresponding to this terminology, an integral invariant is sometimes called an ab- solnte integral invariant. If a differential form 0 is a relative integral invariant of degree P, then its exterior differ- ential dO is an absolute integral invariant of degree P + 1. For a Hamiltonian system dpi/dt= -UH/Jqi, dqi/dt = UH/UPi (2) (i= 1, ..., n,H =H(p,q, t),P=(PI'''' ,Pm),q= (ql,...,qm)), the I-form m w= L Pidqi i=l is a relative integral invariant. The integral of w on a closed curve  L I Pi dq i plays a role in classical quantum theory. The absolute integral invariant Q=dw= LI dpi/\dqi has, as a differential2-form on R 2m, the properties: 0. is a closed form, that is, dQ=O. (4) Let ( be a ttangent vector (at any point), 1£0.«(,1])=0 for any tangent vector I] (at the same point), then (=0. (5) Also, for a Hamiltonian system (2), f . , . fdPI...dPmdql...dqm is an integral invariant. In other words, a 2m-dimensional figure in (PI' ..., Pm' q I, ..., qm)-space may change its form according to the motion of points, but its volume remains unaltered (Lionville's theo- rem), This fact is of importance in applica- tions to tstatistical mechanics. Poincare developed the theory of integral invariants and applied the theory to the tthree- body problem and the problem of tstability [I]. B. Cart an's Extension Poincare treated the position (x I, ." , x n ) and the time t separately. E. Cart an extended Poincare's theory by unifying the treatment of position and time. We may consider the solution curves of a Hamiltonian system (2) through all points of a closed curve C in (2m + I )-dimensional space (p, q, t) together with the tube consisting of these solution curves. For any closed curve C 1 lying on and enclosing the tube, we have fcL=1 Pidqi- H dt= k LI Pidqi-H dt, and L I Pi dqi - H dt is called Cartan's relative integral invariant. If the curve C lies on t = constant, then f w is a relative integral invar- iant, in Poincare's terminology. 219 Ref. Integral Invariants The exterior differential of this form d(w- H dt) = 0. - dH /\ dt has the property: Let (p(t), q(t), t) be a solution curve for (2). For the velocity vector (I =(p(t), q(t), 1) and any vector (2 in (p, q, t) space, we have (Q-dH /\dt)((I, (2) =0. (*) Therefore the integral of the 2-form over a tube consisting of solution curves equals O. Applying this to the tube enclosed by C and C I , we have the former equality, (*) also implies that a solution of (2) is de- rived from the tvariational problem of finding the extrcmal of the functional (3) f 'l [ t Pj(t)qj(t)-H(P(t),q(t),t) ] dt to )-1 for the curves (p(t), q(t), t) (to.s; t.s; t I) satisfying q(ti)=qi (i=0, I) for given to, t l , qO and ql (without any condition for p=p(t)). If H is given by the tLegendre transforma- tion H=LIPAi-(T-U), where T=T(q,q) is a kinetic energy, U = U(q) is a potential, and Pi = OT/Oqi, this variational problem considers a wider class of curves than tHamilton's prin- ciple <5 f(T- U)dt=O (because there are no restrictions on p(t)). However both variational problems give the same extremal [3]. The vector =(p(t),q(t)) is reconstructed by means of 0. and dH as follows. Put (I =«(, 1) and (2 = (I], 0) (I] is a vector in (p, q) space) in (*), then 0.«(,1])= -dH'1] for any 1]. (6) Such a ( is uniquely determined by virtue of (5). C. Symplectic Strnctnre Let M be a 2m-dimensional differentiable manifold. A differential form 0. of degree 2 on M is called a symplectic strncture if it satisfies (4) and (5). And then (M, 0.) is called a symplec- tic manifold. For H: M --> R, we define the vector field ( over M by (6) (at each point XEM and I]E TxM). H is called a Hamiltonian fnDCtion (independent of t) and ( is called a Hamil- tonian vector field. In this case, we also have an argument similar to that for the case of the usual Hamil- tonian systems in Euclidian space. For exam- ple, H is invariant along the flow generated by the vector field (, and 0. is an integral invariant. References [IJ H. Poincare, Les methodes nouvelles de la mecanique celeste III, Gauthier-Villars, 1899. 
220 A Integral Transforms [2J E. Cartan, Lecons sur les invariants in- H:graux, Hermann, 1922. [3J V. 1. Arnold, Mathematical methods of classical mechanics, Springer, 1978, 220 (X.27) Integral Transforms A. General Remarks Given (real- or complex-valued) functions f(y) and K(x, y) such that their product is tinte- grable as a function of y in the interval [a, b], we set g(x)= r K(x,y)f(y)dy. This transformation of f to 9 is called the inte- gral transform with the kernel K(x,y). Now fix the kernel K(x,y). When the correspondence f -->g given by (1) from a set of functions f to a set of functions 9 is bijective, we can con- sider the inverse transform g--> f The for- mula that describes the inverse transform g--> f in terms of an integral transform is called the inversion formula. The kernel K (x, y) can often be written as k(x - y), k(xy), etc., where k(t) is an integrable function. Table 1 contains integral transforms that are important in applications. Table t Kernel Interval Name e ixy (-00,00) Fourier transform cos xy (0,00) Fourier cosine transform smxy (0,00) Fourier sine transform e -xy (0,00) Laplace transform vV JAxy) (0, 00 ) Hankel transform Ij(x-y) (-00,00) Hilbert transform x y - I (0,00) Mellm transform (x+y)-P (0,00) StieltJes transform e-(X-V)' (-00,00) Gauss transform In the Hankel transform, J v is the tBessei function. In the Hilbert transform, the tprin- cipal value is to be taken in the integral. In the Stieltjes transform, p is assumed positive. Since the explanations for the Fourier trans- form and Laplace transform are given in the corresponding articles, we deal here only with generalized Fourier, Hilbert, Mellin, and Stieltjes transforms. 838 B. Generalized Fourier Transform Suppose that the kernels of the transform (1) and its inverse transform are both of the form k(xy). Hence f(x) = r k(xy)g(y)dy. (2) (1 ) In this case, we call the integral transform (1) the generalized Fourier transform of symmetric type or the Watson transform, and k(t) the Fourier kernel of (2). The functions ..j2h cos t, J2Fc sint, and jtJv(t) defined in the interval (0, 00) are examples of such Fourier kernels ( 160 Fourier Transform). The last kernel gives rise to the Hanke) transform. Let k(x) and l(x) be Fourier kernels. Then the integral transform m(y) of l(x) with respect to the kernel k(xy) is called the resultant of k and l. The resultant of two Fourier kernels is also a Fourier kernel. Suppose that a function K(Ij2+it) satisfies K(I/2 + it)K(I/2 - it) = 1, IK(I/2 + it)1 = 1. Then the function k(t)=K(I/2+it)/(1/2-it) belongs to L 2 ( -00,00). We set X f T kl(x)=-l.i.m. k(t)X- 1 / 2 - U dt 2n Too T (the limit is taken in the tmean convergence of order 2), Then for a function f(x) E L 2 (0, 00), d f ro dt g(x) =- k l (xt)f(t)- dx 0 t exists almost everywhere, and g(x)EL 2 (0, 00). In this case, we have the inversion formula d f ro dx f(t) =- d kl (xt)g(x)- t 0 x and the Parseval identity LX) (/(t))2 dt = LX) (g(X))2 dx, If a function f(x) is invariant under a gen- eralized Fourier transform, then f(x) is called a self-reciprocal function. Such a function f(x) is a solution of the homogeneous integral equation f(y) = r k(xy)f(x)dx. The function x -1/2 is an example of a func- tion that is self-reciprocal with respect to the Fourier cosine transform. Using a function that is self-reciprocal with respect to the Han- kel transform, we can derive the lattice-point formula of number theory: Let r(n) be the number of possible ways in which a nonnega- tive integer n can be represented as the sum of 
839 two square numbers. Set P(x)= L' r(n)-nx, O:;;;n:;;;x where L' means that if x is an integer, we take r(n)/2 instead of r(n). Then f(x) =x -3/2. (P(x 2 /2n) -1) is self-reciprocal with respect to the Hankel transform with v = 2. Utilizing this, G. H. Hardy proved (1925) - r:: 00 r(n) c: P(x)=y X n1 J;/ 1(2n y nx). A. Z. Walfisz (1926) and A. Oppenheim (1927) generalized this formula and obtained a for- mula concerning the number of ways in which n can be represented as the sum of p square numbers ( 242 Lattice-Point Problems). C. Mellin Transform If f(X)X k - 1 EL 1 (0, 00), then F(s) = too f(x)x s - I dx, s=k+it, is called the Mellin transform of f If f(x) is of tbounded variation in a neighborhood of x, then the inversion formula f(x+O)+ f(x-O) 1 . i k+iT _ 2 -2. hm F(s)x sds 1[11---+00 k-iT holds. If {(X)X k - I / 2 E L 2 (0, 00), the integral S1/Af(x)x S - 1 dx (s= k+ it) tconverges in the mean of order 2 to a function F(s) for a fixed k and - 00 < t < 00, and the Parseval identity f ro 1 f ro If(xWx 2H dx=- IF(k+it)1 2 dt o 2n - 00 holds. F(s) is also called the Mellin transform of f(x). If f(X)X k - I ;2, g(X)X I / 2 - k E L 2 (0, 00), and F(s), G(s) are the Mellin transforms of f(x), g(x), respectively, then f ro 1 i k+iOO f(x)g(x)dx=- F(s)G(I-s)ds. o 2n k-ioo The theory of the Mellin transform in the function space Lp is analogous to the theory of the Fourier transform [1, ch. 4]. D. Stieltjes Transform For a function ex(t) of bounded variation, f ro dex(t) f(s)= 0 (s+tjP ' is called the Stieltjes transform of ex(t). Usually we assume that p == 1. If the Laplace transform is applied twice in succession to ex(t), we obtain formally the Stieltjes transform of ex(t). The Stieltjes transform has been studied systemati- p>O, 220 Ref. Integral Transforms cally in connection with the theory of the Laplace transform by D. V. Widder, R. P. Boas, and others. Assume that p = 1. Let D be the domain obtained from the complex plane by removing the negative part of the real axis. If the Stieltjes transform converges at a point s = So E D, then it converges uniformly on any compact set in D. The inversion formula is 1 f ' lim -----; (/( -O"-il'J)- f( -0"+ il'J)) dO" n+02nl 0 =(ex(t+O)+ex(t-O)-(ex( +O)+ex( -0)))/2, t>O. If ex(t) = J <p(u) du and <p(t::J: 0) exist, then 1 lim -----;(/( - t - il'J) - f( - t + il'J)) n+02nl =(<p(t+0)+<p(t-0))/2, t>O. E. Hilbert Transform Let <p(z) = U(x,y)+iV(x,y) (z=x+iy) be holomorphic in the upper half-plane and f(x) =U(x,O), g(x)== - V(x,O) be the respective boundary values on the real axis. Then if f, gELI (-00,00), 1 f ro f(x+t) g(xJ=-p.v. -dt, n -00 t 1 f ro g(x+t) {(x) = --p.v. -dt. n -00 t (3) (4) Here p.v. means Cauchy's tprincipal value, that is, p.v. I-oooo F(t)dt = lim (f -t F(t)dt+ f A F(t)dt ) . A-HX),€---+O _ A [; We call 9 the Hilbert transform of f If f E L 2 ( - 00, 00), the inversion formula and the Parseval identity hold. More precisely, for any fEL 2 ( -00,00), relations (3) and (4) above hold almost everywhere, 9 E L 2 ( -00,00), and the Lrnorms of f and 9 are identical ( 160 Fourier Transform). The importance of Hilbert transforms lies in the fact that they establish relations between the real and imaginary parts of an analytic function ( 132 Elementary Particles C). References [1 J E, C. Titchmarsh, Introduction to the theory of Fourier integrals, Oxford Univ. Press, 1937. 
221 A Integration Theory [2] I. I. Hirschman and D. V. Widder, The convolution transform, Princeton Univ. Press, 1955. For formulas of integral transforms, [3] A. Erdelyi (ed.), Tables of integral trans- formations I, II, McGraw-Hill, 1954. [4] G. A. Campbell and R. M. Foster, Fourier integrals for practical applications, Bell Tele- phone Lab., 1931, revised edition, Van Nos- trand, 1948. [5J F, Oberhettinger, Tabellen zur Fourier Transformation, Springer, 1957. 221 (X.13) Integration Theory A. General Remarks The class of Riemann integrable functions is not closed under limits and infinite sums. For example, the function equal to 1 for rationals and to 0 for irrationals, the so-called Dirichlet function, is not Riemann integrable, though it is expressed by Iimmw Iimnoo(cos(m! nx))2n. Hence the concept of Riemann integral is too narrow to be used effectively in modern anal- ysis. To cope with this difficulty, H. Lebesgue [IJ introduced a general integral which is now called the Lebesgue integral. It is not only defined for all useful functions appearing in analysis but also has nice properties, such as exchangeability with limits and infinite sums under some simple conditions which can be checked easily. The definition of the Lebesgue integral is based on the concept of Lebesgue measure ( 270 Measure Theory F), which is a generalization of length, area, or volume. In modern analysis one discusses integrals in an abstract space endowed with a measure and defines the Lebesgue integral as a special case. Integration theory plays a basic role in mod- ern mathematics, in particular in analysis, functional analysis, and probability theory. Let f(x) be a bounded function defined on a bounded interval [a, b]. f(x) is tRiemann inte- grable if and only if it is continuous talmost everywhere with respect to the Lebesgue mea- sure. If ((x) is Riemann integrable on [a,bJ, then f(x) is Lebesgue integrable on [a, bJ and the integrals coincide with each other. How- ever, the improper Riemann integral is not included in the Lebesgue integral; for example, (sin x)jx is not Lebesgue integrable on (0, 00), but limawS(sinx)jxdx==nj2. The theory of Denjoy integral gives insight into this situation ( 100 Denjoy Integrals). 840 B. Definition of Integrals Let X be an arbitrary set. If a tcompletely additive class !B of subsets of X and a tmea- sure Ji defined on !B are given, then we say that a tmeasure space (X,!B, Ji) is defined. For example, the Euclidean space R n , the class \fin of all tLebesgue measurable sets in R n , and the tLebesgue measure m n on Win form the measure space (Rn, \JJln, m n ) ( 270 Measure Theory). We consider only !B-measurable sets and !B-measurable functions in this article, so a !B-measurable set will be called simply a set and a !B-measurable function simply a function. We admit :I:oo for the values of a function. The integral SEf(x)dJi(x) on a set EcX ofa real-valued function f(x) (we write simply SE (dJi or SE/) can be defined in steps as fol- lows. (I) Let f(x)?O be a simple function, that is, a function whose range is a finite set {ad (i== 1,2, ...,n). If {(x)=a i for xEE i , where E= U;I Ej, EjnE k == 0 U#k), then we put SEi = Laji'O ajJi(Ej)' (The value of the integral is a real number or +00. For operations concern- ing :I:oo,  270 Measure Theory D.) (2) For an arbitrary f(x)?O, we define SEf as the tsupremum of SEg, where the supremum is taken for all simple functions 9 such that O.s;g.s;f (3) In general, lettingf(x)=j+(x)- F(x), where {+(x)=max{l(x}-O}'F(x)= max{ - f(x),O}, we define SEi = Sd+ - SEi- if at least one of SEf+ and SEi - is finite and say that f has an integral (or a -integral) on E. In particular, if SEf is finite, then we say f is integrable (or ji-integrable) on E. If the given measure space is (Rn, Win, m n ), the integral defined in this section is called the Lebesgue integral (or simply L-integral), and the function that is integrable in this case is said to be Lebesgue integrable. The integral is often written as SEf(x)dx, and if E is the inter- val [a,bJ, as Sf(x)dx. C. Properties of Integrals (1) The set of all functions integrable on E forms a ttinear space over R, that is, if { and 9 are integrable on E, then for any real ex, {1, exf + fJg is also, and I (exi +fJg)dJi=ex IfdJi+fJ I gdJi. (2) The integrability of 1, of both {+ and F, and of If I are mutually equivalent. (3) If g.s; ( on E, then SEg.s;SEf In particular, ifm.s; f(x).s;/ on E, then mJi(E).s;,fEfdJi.s;/Ji(E). (4) If{(x) is integrable on E (has an integral on E), then it is integrable (has an integral) on 
841 every subset of E. (5) If E = Ul En' Ej n Ek = o (j#k), and hfdJi exists, then the series L;,"=I kfdJi converges and is equal to IddJi (complete additivity of the integral). (6) If f(x)? o and IEfdJi=O, then f(x)=O talmost every- where on E. (7) Modification of the values of f(x) on a tnull set influences neither the integrability nor the value of the integral. Consequently, if the function f is not defined on a null set, we can define the value of f on this set arbitrarily so that the integral has meaning. (8) If f(x) is integrable on E, E=:> En' and Ji(En)-->O, then IimnookfdJi=O. (9) If f,,(x) .s;fn+1 (x) on E, and there exists a <p(x) such that fn(x)? <p(x) and IE <p > -00 (for example, iffn(x)?O), then lim r fn= r ( Iim f n ) . n-!'CD J E J E n--+ CX) (10) Iflimnoofn(x) exists almost everywhere on E, and there exists a <p(x) such that Ifn(x)l.s; <p(x) and IE<P < +00 (for example, if Ji(E) < 00 and Ifn(x)1 < M), then Iim r fn = r ( lim fn ) n-+cOJE JE n-+ctJ (Lebesgue's convergence theorem). (11) If there exists a <p(x) such that f,,(x)? <p(x) on E and IE<P> -00, then liminf r fn? l ( liminff,, ) n-+CX) J E E n-+ctJ (Fatou's theorem). (12) If fn(x)?O on E, then IE Ldn = L;,"=dEf". If fn(x) is real on E, then this equality holds for If" I in place of fn. If the common value is finite, then the equality holds also for the original f". (13) Let <p be a mapping from (X,!B, Ji) to (X', !B', Ji') such that B' E!B' implies <p -I (B')E!B and Ji'(B') = Ji(<p -I (B')). If{(x') is !B'-measurable on E' E !B', then IE' f(x')dJi'(x') = L,-I(E)f(<p(x))dJi(x), where the equality means that if one of these inte- grals exists, then so does the other, and they have the same value, D. Indefinite Integrals If we put F(e) = Ie! for each measurable subset e of E, with f(x) integrable on E, F(e) is a t j1- absolutely continuous completely additive tset function (properties of integrals (4), (5), (8)). We call F(e) the indefinite integral of f(x). In the special case where X is the set R of real num- bers and f(x) is integrable on the interval [a,bJ, the function F(x)= I:f(t)dt defined for x E [a, bJ is also called the indefinite integral of f(x). The F(x) so defined is an tabsolutely continuous function. Conversely, if a function F(x) is absolutely continuous on [a, b J, then it 221 Ref. Integration Theory is differentiable almost everywhere on [a, b J, and we have F(x)- F(a) = I: F'(t)dt. (For the relationship between differentiation and in- tegration in the case of R n , or more generally in the case of an arbitrary measure space,  380 Set Functions.) E. Fubini's Theorem Let (X,!B"Jid and (Y,!B2,Ji2) be two O"-finite measure spaces and (X x Y,!B, Ji) be their tdirect product measure space. Assume that f(x, y) is !B-measurable and integrable (has an integral) on X x Y. Then for almost all fixed YE Y, f(x, y) considered as a function of x is !B I-measurable and integrable (has an inte- gral), and Ix f(x, y)dJiJ!x) is a !B 2 -measurable function of y. Moreover, in this case we have f f(x, y)dJi(x, y) XxY = I(tf(x,y)dJidX))dJi2(y) (Fubini's theorem). This fact also holds with x and y exchanged. The integral on the left-hand side of this eq uation is called a multiple inte- gral, while that on the right-hand side is called an iterated (or repeated) integral. Even if an iterated integral exists, the corre- sponding multiple integral need not always exist. For example, let f(x, y) be defined as (x 2 - y2)j(X2 + y2)2 on (0, 1) x (0, 1), and other- wise O. Then IR2f+ = IR,f- = 00, so that IR,f does not exist, but f ro (f rO f(x,y)dX ) dY= -, -00 -00 4 foo (f:oo {(x,y)dy )dX=. By Fubini's theorem, if f is a nonnegative function defined on a !B-measurable subset of a O"-finite measure space (X,!B, Ji), the !B- measurability of f is eq uivalent to the measur- ability of the ordinate set Ef={(x,y)IO.s;y.s; f(x), x E E} considered as a subset of the mea- sure space (X x R, !B', Ji'), which is the direct product of (X,!B, Ji) and (R, \JJl I , md. In this case we have IEfdJi = Ji'(Ef)' which can serve as a definition of the Lebesgue integral. When (X,!B, Ji) coincides with (R, \JJl I , md, the tJor- dan measure (area) of the ordinate set coin- cides with the Riemann integral. References [IJ H. Lebesgue, Integral, longueur, aire, Ann. Mat. Pura Appl., (3) 7 (1902), 231-359. 
222 A Integrodifferential Equations [2J S. Saks, Theory of the integral, Stechert, second edition, 1937. (Original in French, 1933, 1937.) [3J N. Bourbaki, Elements de mathematique, Integration, chs. 1 9, Actualites Sci. Ind., Hermann, 1965-1969. [4J H. L. Royden, Real analysis, Macmillan, second edition, 1963. [5J W. Rudin, Real and complex analysis, McGraw-Hill, second edition, 1974. 222 (XIII.31) Integrodifferential Equations A. General Remarks A tfunctional equation involving an operator T of the form f(t,x'(t),x(t),(Tx)(t))=O, to.s;t.s;t" (1) x(to) = Xo, (2) reduces to a tdifferential equation, tdifferential- difference equation, tintegral equation, inte- grodifferential equation, or other functional equation when the operator T is given a spe- cial form [1,6]. In particular, by letting f(t, x,y, z)=x -g(t,y)-z, f 'l'(') (Tx)(t) = K(t,s,x(s))ds, '0 we obtain an integrodifferential equation f '1'(') x'(t)= g(t, x(t») + K(t, s, x(s))ds. '0 If <p(t) = constant or <p(t) = t, (5) is said to be an integrodifferential equation of Fredholm type or of V olterra type, respectively. B. The Initial Value Problem Let I be an interval to.s; t.s; t 1,10 the interval to < t.s; t I' C(I) a set of continuous functions on I, T an operator such that Tx E C(Io) if x E C(I), 3' a family of all such T, 3' + a subset of 3' consisting of all. T in 3' for which Tx.s; Ty holds at t = s whenever x, yare functions in C(I) satisfying x(t) <y(t) for to.s; t <s for some s E 1 0 , and Z a set of continuous functions which are differentiable on 10' Let M be the ?Iass of all functions f(t, x, y, z) defined for t E . 1 0 , lxi, Iyl, Izl < ex; and satisfying f(t, x I, y, z d;, f(t,X 2 ,y,Z2) (XI ;'X 2 ,ZI .s;Z2)' Suppose that in (1) f(t, x, y, z) is defined for tE 1o, lxi, Iyl, Izl < 00, and TE 3', Suppose fur- ther that for some y > 0 and two solutions x I 842 and X 2 of (1) with (2) belonging to Z, there always exist a function WE M and an operator QE3'+ such that the inequality w(T, x -X'I, X 2 -XI, Q(x 2 -xl)).s;O holds for every tEl o such that x 2 (t)-X l (t)=y and x 2 (t)-X I (t)<y for to<t<t. Furthermore, suppose that there exists a function p EZ satis- fying the following inequalities: O.s;p.s;y, tEl o ; w(t,p',p,Qp»O, tEl o ; p(to + 0) > x 2 (to + 0)- x I (to + 0). Then equation (1) with (2) has at most one solution XEZ. This result can be established by obtaining an estimate for the difference Ix l (t)-x 2 (t)I, where xi(t) is a solution of (1) with the condition x(to) = i (i = 1,2). For the particular case of integrodifferential equations of Fredholm type, suppose that the following conditions are satisfied: (3) g(t'YI)-g(t'Y2).s;L(t)(YI-Y2)' tEl o ; f ' (K(t,s, wl(s))-K(t,s, w 2 (s)))ds '0 .s; N(t) f 'M(S)(W d S)-W 2 (S))dS, tEl; '0 f 'SM(S)dS<oo, tEl o ; '0 (4) N(t)+tL(t).s; 1 +t 2 M(t), tEl o ; where YI;' Y2, LEC(Io), WI' W 2 , M, N EC(I), WI ;,w 2 , M;'O, and N;'O. Then it is possible to obtain more practical expressions for w, Q, and p: w(t,x,y,z)=x-L(t)y-N(t)z, tEl o , QW= f 'M(S)W(S)dS, '0 p(t)=fJt(1 +t)exp f ' s(1 +s)M(s)ds, '0 (5) where # > 0 is sufficiently small. C. Another Problem A problem analogous,to the tboundary value or teigenvalue problems of linear ordinary differential equations is to find a solution of the linear integrodifferential equation (pU')' -qU+A(PU+ L k(x,y)u(y)dy )=0 with the boundary value U ==0, This equation can be derived from the problem of minimiz- ing the functional D[<pJ under the condition 
843 that H[<pJ is constant, where D[<pJ = L p<p'2dx+ L q<p2dx, H[<pJ= Lp<p2dX + LL k(x,y)<p(x)<p(y)dxdy. The orthogonality condition for this boundary value problem is given by L pui(x)ui x ) dx + f f k(x, y)ui(x)uiy)dxdy= { I, i=j G G 0, i # j [2]. Integrodifferential equations are closely related to problems of mathematical physics and engineering, and there are many investi- gations of such equations in the study of the equilibrium of rotating fluids [3J, Prandtl's integrodifferential equation for aircraft wings in 3-dimensional space [4J, the dynamics of reactors, and so on. In the second example, the circulation of the airflow rry) with constant velocity V around the profile is determined by the equation rry) 1 f b/2 drry') cx(y) = +-p.v.-, nk(y)t(y) V n V -biZ Y - y' called Prandtl's integrodifferential equation, where b is the wingspan, y and y' are variables whose range is [ - b/2, b/2J (y is assumed fixed), t the length of the chord, cx an angle of incidence from the point with buoyancy 0, 2nk the slope of the curve defined from buoyancy by the angle of incidence, and p.v. tCauchy's principal value. As a problem having applications in the theory of tstochastic processes, the existence of solutions that have finite limits as t--> 00 has been investigated for the Wiener-Hopf integro- differential equation kD (D + Adf(x) =)'1 ... An {OO f(x+t)dH(t), x;:, 0, D=d/dx. (6) F or this problem, by means of the method of tsemigroups of operators, equation (6) can be extended to Jl (A+,{k)f(X)=}.I...An{OO (T,f)(x)dH(t), xEI, where A is the tinfinitesimal generator of a semigroup of operators {T,}, and it has been 223 A Interpolation shown that analogous results to those for (6) can be obtained for this more genend equation [5J. References [IJ K. Nickel, Fehlerabschatzungs- und Ein- deutigkeitssiitze fUr Integro-Differentialglei- chungen, Arch. Rational Mech. Anal., 8 (1961), 159 180. [2J R. Courant and D. Hilbert, Methods of mathematical physics I, Interscience, 1953. [3J L. Lichtenstein, Vorlesungen iiber einige Klassen nichtlinearer Integralgleichungen und Integro- Differentialgleich ungen, Springer, 1931. [4J W. F. Durand, Aerodynamic theory II, Springer, 1935. [5J S. Karlin and G. Szego, On certain differential-integral equations, Math, Z., 72 (1959-1960), 205228. [6J M. N. Oguztoreli, Time-lag control sys- tems, Academic Press, 1966. [7J N. I. Muskhelishvili, Singular integral equations, Noordhoff, 1953. (Original in Rus- sian, 1946.) [8J T. L. Saaty, Modern nonlinear equations, McGraw-Hill,1967. 223 (XV.2) Interpolation A. Lagrange Interpolation Assume that the values of a function f(x) with some regularity property (e.g., differentiability up to a certain order) are given at each of n + 1 distinct values Xo, XI' "', x n . The method of finding the values f(x) at X #x i by using the values fo= f(x o ), fl = f(x1),...,in = f(x n ) is called interpolation. A function L(x) that ap- proximates f(x) and coincides with f(x) at X = Xo, X I, ... ,X n is called an interpolation function or interpolation formula. We usuaIJy use a polynomial of degree n as L(x). Such a polynomial is caIled a Lagrange interpolation polynomial, and the method of using such a polynomial is called Lagrange interpolation. If we let Il(x) Ii(x) = ( )Il'() , X-Xi Xi n Il(x) = f1 (X-Xi)' i=o the Lagrange interpolation polynomial can then be expressed in the form n L(x) = L Ii(x)/;. i=O 
223 B Interpolation Ii(x) is a polynomial of degree n and is caned the Lagrange interpolation coefficient. Some- times, by interpolation we mean the method of finding f(x) for x lying between the maximum and the minimum of Xi' When x is located elsewhere, such a method is caned extrapola- tion. The Lagrange interpolation polynomial is uniquely determined, and if f(x) is (n + 1)- times differentiable, the deviation of L(x) from f(x) is given by pn+1)()Il(x)/(n + 1 )!, where  lies between the maximum and the minimum of x, x o , x I, .,. , x n ' Conversely, the method of finding an approximate value of x satisfying f(x) = f for a given value of fusing L(x) is caned inverse interpolation. Suppose that we restrict the interval of interpolation to [-1, 1J and require that the points of interpolation Xi be spaced equally, Xi = 2i/n -1, i = 0,1, ..., n. Then, if the number of points n + 1 is increased, the Lagrange inter- polation polynomial L(x) may not converge to f(x) even if f(x) is an analytic function of x on [ -1,1]. For example, this nonconvergence phenomenon is observed when f(x) = 1/(25 + x 2 ) is interpolated; this is called the Runge phenomenon. On the other hand, if we choose the zeros of the tChebyshev polynomial of degree n+ 1 as Xi' i.e., if we take x;=cos{(2i+ l)n/2(n + I)}, i =0,1, ..., n, which is called Chebyshev interpolation, then L(x) converges uniformly to f(x) as n + 1 tends to infinity, provided that f(x) has a bounded derivative on [-1,1]. B. Iterated Interpolation We can compute the value of the interpolation polynomial for given x by generating a se- quence of interpolants each of which involves one more point than the previous one. The following method is caned the Aitken interpo- lation scheme. First we compute 1 fo xo-x IOi== xi-X O h xi-x i = 1,2, ..., n, Table 1 844 and then successively 1 01 k Xk- X 1 01 .ki= Xi-X k . 1 01 k-li xi-x i=k+ 1,...,n. If we continue successive evaluation of 1 0 " 1 012 ,,,, until the successive values coincide within the desired accuracy, then we can ac- cept the converged value 1 01 . n as an approxI- mate value of the Lagrange interpolation polynomial L(x) of degree n at x. In this case the Xi are not necessarily assumed to be ar- ranged in monotonic order. It is better to ar- range them in order of their distance from X rather than in ascending or descending order of magnitude. C. Interpolation by Finite Differences When the interpolation points are equalIy spaced, the values of the interpolation poly- nomial can be expressed in terms of finite differences. Suppose that for Xi = Xo + ih, where h is the grid size, the values fo'/" ... , in of the function f(x) are known. The differences h = I'.  I'. are then caned ( finite) differences of Ji+l Ji first order. Furthermore, we define the dif- ferences of order k + 1 inductively by k+1 f. = kh+1-kh. If we use the shift operator E defined by Eh = h+l, the difference operator is represented as  == E - 1. Sometimes the back- ward difference V = 1 - E- I is used. In contrast to V, the operator  is called the forward dif- ference. The central difference 15 = E 1/2 - E  1/2 which is defined by bh-I/2 = h- f.-I is also used. Table 1 shows the relations between the finite differences and the differentiation operator D defined by Df(x)=df(x)/dx. Table 2, in which each entry after the second column is the difference of the two entries lying immediately to its left is calIed the dif- ference table. From the relation  = V E or  = bE I / 2 we can express each entry of table 2 in terms of V or b. For example, in the second E  8 V hD I + 8 2 2 + 8p. 1 e hD E E I+ I-V 8 2 V e hD -l  E-l  8p. + - I-V 2  V 2sinh(hD/2) EI/2_E-I/2 8 8 (I+)1/2 (I - V)I/2  8 2 1 V I-- V I-E- I 8p.- - e hD I+ 2 log£ 10g(1 +) 2 arc sinh( 8 /2) -log(1 - V) hD hD £1/2+ E -1/2 1+/2 (1+8 2 /4)1/2 I-V /2 cosh(hD /2) p. 2 (I +)1/2 (1- V)I/2 
845 Table 2. Difference Table f-2 N-2 f-I 2f_2 f-I 3f_2 fo 2f_1 4f_2 No 3f_1 fl 2fo NI f2 column, fk is equal to Vfk+1 and i5fk+I/2' If f(x) is a polynomial of degree k, then f(x) is a polynomial of degree k-l, kf(x) is a con- stant, and k+lf(x) is zero. Therefore, looking at the difference table, we can find the degree of an interpolation polynomial that can satis- factorily approximate f(x). It should also be noted that, if the computation of each entry of the table is carried out with a finite number of significant figures, the error in the val ues foJI, ... ,In is multiplied by binomial coeffi- cients corresponding to the location in the difference table. The following are interpolation formulas for which the difference table is used. Suppose that we want to interpolate the value fp at x=xo+ph. Then fromf p =£Pf o =(1 + MPfo, we have Newton's forward interpolation formula: p(p-l) 2 f p = ro+pNo+ fo + p(p-l)(p-2) 3fo+"" 3! Similarly, from f p = £Pfo =(1 - V)-Pfo' we have Newton's backward interpolation formula: p(p+ 1) 2 fp=fo+pVfo+V fo + p(p+ 1)(p+2) V 3 fo+.... 3! We get these formulas by starting at fo and proceeding downward to the right or upward to the right. On the other hand, by proceeding in a zigzag manner, downward to the right, then upward to the right, then again down- ward to the right, etc" we get another formula, called Gauss's forward interpolation formula: p(p-l) 2 f, p =fo+pNo+- f-I 2! + (p+ l)p(p-l) 3f_1 +.... 3! Similarly, we have Gauss's backward interpoIa- 223 E Interpolation tion formula: (p+l)p 2 fp=fo+pN-I+- f-I 2! + (p+ l)p,(p-l) 3f_2 +.... 3. In addition to these formulas, there are several by Everett, Bessel, Stirling, and others, which are essentially equivalent to the Lagrange interpolation polynomial that uses the same tabular points, although the representations are different. D. Interpolation by Divided Differences For points located at unequal intervals, the values of the interpolation polynomials can also be expressed in terms of divided differ- ences, defined successively as I'.= /;-!j Jij , xi-x j /;j - !jk /;jk==-'.... Xi-Xk The divided difference of order k defined as above can be expressed as k /; fOI k=.L JI ' ( ) ' 1""0 Xi k JI(x) = IT (x-x;). j=O The Lagrange interpolation polynomial of degree n is L(x)= fo+(x -xo)fol +(x-xo)(x -x l )foI2 +... +(x-xo)(x-xd... (x-xn-I)fol with the error given by f(x)- L(x)=(x -xo)(x -xd... (x-X n )fOI2 where f012 nx means that the divided dif- ference of first order is calculated from x and f(x) instead of from x j and !j. E. Hermite Interpolation The polynomial H(x) of degree 2n + 1 that satisfies not only H(x)= f(x) but also H'(x) = f'(x) at x = Xo, X I' ... , X n is called the Hermite interpolation polynomial. The Hermite inter- polation polynomial is n H(x)= L {1-2I;(x i )(x-x i )}I;2(x)f(x i ) i=O n + L (x -x;)I?(x)f'(x i ), i=O where Ii(x) is the Lagrange interpolation coef- ficient. The deviation of H(x) from f(x) is given by f(x)- H(x) == f(2n\)JI2(x)/(2n)!, where  lies between the maximum and the minimum of X,XO,X 1 , ".'Xn° 
223 F Interpolation F. Spline Interpolation Let the interval (-00,00) be divided into n + 1 subintervals not necessarily of equal length by n knots Xo= -00 <XI <x 2 <... <xn<x n + 1 = 00. A function S(x) which coincides with a polynomial of degree at most m on each sub- interval (x h X i +1), i = 1,2, "" n, and has con- tinuous derivatives up to order m - 1 at each knot is called a spline, i.e., a spline S(x) is a C m - I function. If S(x i ) = f(x i ), i = 0,1,2, "', n, then S(x) is an interpolation function. Inter- polation by means of a spline is called spline interpolation. The term "spline" derives from the name of an instrument with which drafts- men fair a curve through points. If a spline coincides with a polynomial of degree 2k - 1 on each subinterval and with a polynomial of degree k -Ion (-00, XIJ and [x n , 00), it is called a natural spline, If the data Yi is given at each knot Xi> i = 1,2, '" ,n, and also an integer k not larger than n is given, then the spline S(x) of degree 2k - 1 that satis- fies S(x;) = Yi, i = 1,2,..., n, is uniquely deter- mined. The spline that is most frequently used in practical problems is the natural cubic spline (k = 2). Let f(x) be an arbitrary function of C k (k.s; n) satisfying f(x i ) = Yi at each knot Xi> i = 1,2, ..., n. Then in any interval [a, b J which in- cludes X"X2, ...,X n , f [S(k)(x)] 2 dx.s; f [f(k)(X)] 2 dx holds, where S(x) is the natural spline of degree 2k -1 that satisfies S(x i )= Yi, i = 1,2, ..., n. This inequality is called the minimum norm prop- erty, and, in particular, the minimum curva- ture property when k = 2, of the natural spline. G. Polynomial Interpolation on Triangles Polynomial interpolation on a triangular region is used in the finite element method. The complete polynomial of degree m, m Pm(x,y)== L avxy" {l+V=O can be used as an interpolation polynomial on a triangular region. The number of the coef- ficients av is (m + l)(m + 2)/2. On the other hand, if we divide each side of the triangle into m equal parts and join the points of sub- division by lines parallel to the sides of the triangle, then we have m 2 congruent small triangles the number of whose vertices is (m + l)(m + 2)/2. Therefore if we choose these ver- tices as the points of interpolation, we can determine the coefficients a, uniquely from the data given at these points. This is the La- 846 grange interpolation polynomial of degree m on a triangle. The interpolation polynomial Pdx,y) of degree m = 1 that is uniquely determined from the data u" U 2 , u 3 given at the vertices (x I' Yd, (x 2 , Y2), (x 3 , YJ), respectively, can be expressed as PI (x, y)= U I  I (x, y)+ U 2 2(X, y)+ U33(X, y), where 1 1 i(X,y)= 2S x x j x k , Y Yj Yk 1 1 s=- XI 2 YI X 2 X3 Y2 Y3 (i,J, k) is any cyclic permutation of (1, 2, 3), and the absolute value of S is equal to the area of the triangle. i(X, y) is a polynomial of degree 1 satisfying { I, j=i, i(Xj'Y)= 0 '--J-' , Jrl, and is called the shape function of degree 1 on the triangle. If we choose the vertices together with the midpoints of the sides as the points of inter- polation, we can determine the six coefficients av of the interpolation polynomial P 2 (x,y) of degree 2 uniquely, and the polynomial can be written as 6 P 2 (x,y)= L uiF)(x,y), i=l where 2), i = 1,2, ." ,6, are the shape func- tions of degree 2 on the triangle; these can be expressed in terms of the shape functions of degree 1 as follows: 2><X,Y)=J2i-l), i=1,2,3, !2)(x,y)=412' 2)=423' 2)=431' The Lagrange interpolation polynomial Pm(x, y) given above is defined locally on each triangle in the finite element method, and the function over the whole triangular network is constructed by connecting these Lagrange interpolation polynomials. This piecewise polynomial function of degree m over the whole network is evidently of class Co, and has no continuity of derivatives of higher order in general. A Cl-function, for example, can be obtained from the complete polynomial Ps(x, y) by determining the 21 coefficients from the values and the derivatives up to order 2 at the vertices and the normal deriva- tives at the midpoints of the sides. Alterna- tively, we can impose three conditions so that the normal derivative reduces to a cubic along each side instead of imposing that it coincide with specified data at the midpoint; then we obtain another Cl-function. A variety of interpolation functions are known for 
847 other elementary regions, such as rectangles or tetrahedra. References [1 J P. 1. Davis, Interpolation and approxima- tion, Blaisdell, 1963. [2J A. Ralston and P. Rabinowitz, A first course in numerical analysis, McGraw-HilI, second edition, 1978. [3J 1. F. Steffensen, Interpolation, Chelsea, 1950. [4J P. J. Davis and I. Polonsky, Numerical interpolation, differentiation and integration, Handbook of Mathematical Functions, M. Abramowitz and I. A, Stegun (eds.), Nat. Bur. Stand. Appl. Math. Ser., 55 (1964), 875-924. [5] R. W. Hamming, Numerical methods for scientists and engineers, McGraw-Hili, second edition, 1973. [6] J. H. Ahlberg, E. N. Nilson, and J. L. Walsh, The theory of splines and their appli- cations, Academic Press, 1967. [7J A. R, Mitchell and R. Wait, The finite element method in partial differential equa- tions, Wiley, 1977. 224 (XII.11) Interpolation of Operators A. General Remarks In 1926 M. Riesz proved that if T is a bounded linear operator Lpo(Q)--> Lqo(Q') with norm No and L p1 (Q)-->L q1 (Q') with norm N h l.s;po, Ph qo, ql « 00, simultaneously, then it is a bounded linear operator Lp(Q)--> Lq(Q') with norm « N -0 Nf whenever l/p=(1-8)/Po+0/pl' l/q=(1-8)/qo+0/ql' for 0 < 0 < 1, provided that q;:' p. In 1939 G. O. Thorin removed the restriction q;:' p by devis- ing a proof based on function theory (Riesz- Thorin theorem). Meanwhile, J. Marcinkie- wicz (1939) announced that the boundedness of T: Lp(Q)--> Lq(Q') holds for a quasilinear operator T under weak type assumptions ( Section E (2). From 1959 to 1964, J.-L. Lions, A. P. Cal- deron, J. Peetre, and others extended these results to linear operators from a couple of Banach spaces to another couple, The inter- polation methods provide a powerful and often essential tool in various fields of mathe- matical analysis where estimates of operators 224C Interpolation of Operators playa central role, such as harmonic analysis [4,8,9J, numerical analysis, approximation theory [7], and the theory of partial differen- tial equations [2]. A compatihle couple is a pair {X o , Xl} of Banach spaces (or more general topological linear spaces) that are continuously embedded in a Hausdorff topological linear space, Let {yo, Y l } be another compatible couple. A linear operator T: Xo + Xl --> Yo + Y l is said to be continuous {XO,XI}-->{Y O ' Y I } with norm {No, N l } iffor each 1=0, 1, T: Xl...... 11 is con- tinuous linear with norm N l . An interpolation method is a tfunctor which assigns to each compatible couple {X o , Xl} of Banach spaces a Banach space X with Xo n Xl eX c Xo + Xl such that every continuous linear operator T: {X o , Xd -->{ Yo, Yd induces a continuous linear operator T: X --> y, X is called an interpolation space of {X o , Xd. There are two important types of interpolation methods, the complex method (due to Cal- deron [5J, S, G. Krein, Lions) and the real method (due to E. Gagliardo, Lions, Lions and peetre [6J, Peetre). These methods generalize the classical results of Riesz and Thorin and of Marcinkiewicz, respectively. B. The Complex Interpolation Method (1 ) (2) In this section {Xo,Xd is assumed to be a compatible couple of complex Banach spaces. Let F(Xo, Xd be the Banach space of all bounded continuous functions fm, , =  + it), on the strip O.s; .s; I with values in Xo + Xl' holomorphic in 0 <  < I, and such that for each 1=0, l,f(l+it)) is a continuous and bounded Xl-valued function. IlfIIF(Xo,Xd = max{sup Ilf(it))IIx o ' sup Ilf(1 + it)llxJ is the norm. The complex interpolation space [XO,XIJo, 0<0< 1, is defined to be the Banach space of values f(8) of fEF(Xo, Xd with the norm Ilxllrxo,Xd, =inf{ IlfIIF(Xo,X t ) Ix = f(8)}, Xo n Xl is dense in [X o , XIJo. Let {XO,X I } and {Yo, Y l } be two compatible couples, and let T:{Xo,Xd-->{Yo,} be a bounded linear operator with norm {No, N l }, Then T: [X o , XIJo--> [Yo, YIJo is bounded linear with norm .s; N -0 Nf (interpolation theorem), If a holomorphic family  1(}, 0 < Re' < 1, of linear operators, acting in Xo + Xl into Yo + Y l , fulfills a certain boundedness requirement, then 19 induces a bounded linear operator [X o , XIJo--> [Yo, Jo (E. Stein [9J). C. The Real Interpolation Method The method of this section applies also to compatible couples of tquasinormed spaces (P. 
224D Interpolation of Operators Kree (1967»). If {X o , Xl} is a compatible couple of Banach spaces (resp. quasinormed spaces), then Xo n Xl and Xo + XI are Banach spaces (resp. quasinormed spaces) under the norms J(t,x)=max{llxllxo' tllxllxj and K(t,x)= inf{ Ilxollxo + tllxlllx,l x = Xo +x d, respectively, for any t >0. Denote by L;, 0 <p.s; 00, the Lp- space on (0, 00) relative to the measure dt/t. Then the real interpolation space (X o , Xdo,p' 0<0< I, l.s;p.s; 00 (resp. O<p.s; 00), is defined to be the Banach space (resp. quasinormed space) of all XE Xo + XI such that t-OK (t, X)E L; with the norm Ilxll(x o 'x1)o.P == Ilt-OK(t,X)IIL (Peetre's K -method). Xo n XI is dense in (X o , Xdo,p if P < 00. The continuous embed- ding (X o , Xdo,p e(X o , Xdo,q holds for p.s; q.s; 00. If T: {Xo, Xl} -->{ Yo, Yd is a bounded linear operator with norm {No, N l }, then T:(Xo,Xdo,p-->(Y o , Ydo,p, 0<0< 1, p.s; 00, is bounded linear with norm .s; NJ -0 NT (inter- polation theorem). The linearity or boundedness requirements of T can be weakened considerably (Kree, T. Holmstedt, H. Komatsu (1981). There are several equivalent definitions of the real interpolation spaces for compatible couples {Xo, Xl} of Banach spaces, The J- method gives (X o , XI),p, 0 < 0 < 1, 1 .s; p.s; 00, which is the space of means x == S u(t)dt/t with the norm Ilxll(xo,xtli.p =infu Ilt-oJ(t, u(t))IIL' where u(t) are strongly measurable functions on (0, 00) with values in Xo n XI such that t- O J(t, U(t))E L;. (X o , Xd,p =(X o , XI )o.P holds with equivalent norms. A discrete version of the J-method applies also to couples {Xo, Xd oftquasl-Banach spaces. D. The Reiteration Theorem Let {X o , Xd be a compatible couple of (quasi-) Banach spaces. A (quasi-) Banach space X is said to be of class Ko(Xo, Xd, O.s; O.s; 1, if xonxi eX eXO+X l with continuous embeddings and if there are constants C K and C J such that K(t,x).s;CKtOllxll x , XEX, and tOllxllx.s; CJJ(t, x), XEX o nx l . The class Ko(Xo, Xl) contains (X o , Xl )O,p, 0 < 0 < 1, O<p.s; 00, and, for Banach spaces X o , XI' [X o , XIJo' If 1: is of class K,(X o , Xd and Y p of class Kp(Xo,Xd and ifO.s; ex <fJ.s; 1, then (1:, Yp)A,p=(Xo,Xdo(A),P' 0<)< I, O<p.s; 00, O().)=(1-A)ex+XfJ, with equivalent norms (reiteration theorem). Set l:=[X o ,X t J, and Yp=[XO,XIJ p for Banach spaces Xo and XI' If Xo n XI is dense in Xo, XI and Yon Y, then [Y" YJA = [X 0' X IJO(A), O.s;)..s; 1, with equal norms (re- iteration theorem). 848 E. Examples and Applications (1) Let Lp = Lp(O, Ji) be the Lp-space on a 0"- finite measure space (0, Ji). Then [LpG' L p1 Jo = Lpo holds with equal norms for l/po =(1- OJ/po + O/PI, 0< 0 < 1, 1 .s; Po, PI .s; 00. Hence follows the Riesz- Thorin theorem. In particular, the Fourier transform maps Lp(R n ), I .s; p.s; 2, continuously into Lp.(Rn), p' = p/(p-l) (the Hausdorff-Young inequality). The convo- lution operator K *, K E Lr(Rn), I .s; r < 00, is bounded from Lp(Rn) into Lq(Rn) with norm .s; II K IlL, if P;:' 1, l/q = l/p+ l/r-l ;:, 0 (Young's inequality). For the tHardy space HI (Rn), [Lp(Rn), HI (Rn)Jo = Lq(Rn), l/q=(1-0)/p+0, l<p.s; 00, 0<0 < 1 (Stein and C. F efferman). (2) Recall that the tLorentz space L(p,q)= L(p,q)(O, Ji), 0 <p, q.s; 00, is the tquasi-Banach space of all measurable functions f with Ilfll(p,q)= Iltl/Pf*(t)IIL:< 00, where f*(t) is the trearrangement of f(w), WEO. L(p,p)= Lp and L(p,q) e L(p,r) for q .s;r. If I <p < 00, 1 .s;q.s; 00, then L(p,q) coincides with the Banach space (Loo, Ll)l/p.q under the equivalent norm Iit l / P - I Sf*(s)dsIIL:- More generaIly, ifO<po, PI < 00, Po#p" 0<0< 1, I/Po=(1-0)jpo+ O/p" O<qo, ql, q.s; 00, then (L(PO,qo)' L(P1.q,»0,q = L(PO,q) with equivalent norms. Let T be an operator which maps a space of measurable functions on (0, Ji) into another on (0', Ji'). The inequality II Tfll (q, 00).s; Nil fll Lp holds if and only if Ji'{ w' EO' II Tf(w') I > s}.s; (NllfIIL.ls)q, s>O. Then Tis said to be of weak type (p, q). Suppose that Tis quasi linear, i.e., that T(/ +g) is uniquely determined whenever Tf and Tg are defined and that I T(/ +g)(w')I.s; K {I Tf(w') I + I Tg(w')I} a.e. holds with a con- stant K independent of f and g. The interpola- tion theorem then holds. Therefore, if a quasi- linear operator T is of weak type (Po, qo) and (PI' qd, qo #q" then Tis of type (p, q), i.e., T: Lp(O)--> Lp(O') is bounded for p, q satisfying (1), (2), and q;:' p (Marcinkiewicz's theorem). When Po #p" the same conclusion is obtained if II Tfll(q" 00).s; Ntllfll(p"r,), 1==0, 1, for some rt (R. A. Hunt). The tHilbert transform and the tCalderon- Zygmund singular integral operators are of weak type (1, 1) and of type (2,2). Hence it follows from these facts together with dual- ity that they are of type (p, p) for 1 < p < 00 [4,8,9]. The convolution operator K *, K E L(r, oo)(R n ), 1 <r< 00, is of type (p,q) if p> 1, I/q= l/p+ l/r - 1 > 0 (R. O'Neil). In particular, the convolu- tion with Ixl,-n, O<ex<n, on R n is of type (p,q) for 1 <p<n/ex and l/q= l/p-rx/n (the Hardy- Littlewood-Sobolev ineq uality). For the tHardy spaces Hp=Hp(Rn) and the 
849 tJohn-Nirenberg space BMO=BMO(Rn), (L",,, Hp)o,q=(BMO, Hp)o.q=Hq and (H po ' Hp)o,po =Hpo if 0<0 < 1,0 <Po, PI < 00, l/q=O/PI and I/Po=(1-0)/Po+0/pl (S. Igari, N. Riviere, Y. Sagher, Fefferman, R, Hanks). A tBesov space B,q(Q) is understood most naturally as the real interpolation space (Lp(Q), Wpm(Q))'/m.q of the Lebesgue space Lp(Q) and the tSobolev space Wpm(Q) [2,3]. (3) Let A be a closed linear operator in a Banach space X such that the resolvent (t + A) -I exists for t > 0 and satisfies II t(t + A) -III .s; M. If D(A m) denotes the domain of the inte- gral mth power Am of A equipped with the graph norm, then D;(A)=(X,D(Am))a/m,p' 0<0" < m, 1 .s; p.s; 00, coincides with the space of XEX such that W(A(t+A)-lrxllxEL; and is independent of m. If - A generates an tequicontinuous semigroup T, of class (CO) (resp. a tho10morphic semigroup T, bounded on a sector), then D;(A) consists also of all XEX such that Ilt-a(I - T,rxllxEL; (resp. Ilt-a+mAmT,xllxEL;). There are similar charac- terizations of elements in (X, nD(A%,p for a commutative family {AI, ... , Ad of such operators (Lions and Peetre, P. Grisvard, P. L. Butzer and H. Berens, Komatsu, T. Muramatu), When A = - or Ai = O/OX i in Rn or in a suitable domain Q in Rn, these results give various equivalent characterizations of the functions in Besov spaces B,q(Q). The Sobo1ev embedding theorem for Besov spaces can also be proved from this point of view (Grisvard, Peetre, A. Yoshikawa, Komatsu). The space D;(A) is closely connected with the domains D(A') oftfractiona1 powers A' of A. Df(A)cD(A')cD(A), O"=Reex>O. If O<RefJ<iJ, then D;(A)={xED(AP)IAP xE D;-Rep(A)}. If the pure imaginary powers Ai are 10cally uniformly bounded, then D(A')= [X,D(Am)]Re,/m,O<Reex<m. F. Duality Suppose that {Xo, Xd is a compatible couple of Banach spaces such that Xo n X I is dense both in Xo and in XI' Denote the dual of X by X'. If one of Xo and XI is reflexive, then so are [XO,XIJo and (XO,XI)o,p, 0<0< 1, l.s;p.s; 00. Furthermore, [Xo,XIJ=[Xb,X;Jo and (X o , Xd,p =(Xb, X; )o.P" p' = p/(p - 1), with equivalent norms (Calderon, Lions and Peetre, H. Morimoto). References [1] 1. Bergh and 1. Lofstrom. Interpolation spaces, Springer, 1976. 225 A Invariant Measures [2] H. Triebe1, Interpolation theory, func- tion spaces, differential operators, North- Holland, 1978. [3] J. Peetre, New thoughts on Besov spaces, Math. Dept., Duke Univ., 1976. [4] A. Zygmund, Trigonometric series, Cam- bridge Univ. Press, second edition, 1959. [5] A. P. Calderon, Intermediate spaces and interpolation: The complex method, Studia Math., 24 (1964) 113-190. [6J 1.-L. Lions and J. Peetre, Sur une classe d'espaces d'interpo1ation, Pub!. Math. Inst. HES, 19 (1964), 5-68. [7] P. Butzer and H, Berens, Semigroups of operators and approximation, Springer, 1967. [8] E. Stein, Singular integrals and differentia- bility properties of functions, Princeton U niv. Press, 1970. [9] E. Stein and G. Weiss, Introduction to Fourier analysis on Euclidean spaces, Prince- ton U niv. Press, 1971. 225 (X.14) Invariant Measures A. Introduction The tLebesgue measure in the Euclidean space R n is invariant under Euclidean motion (translation and rotation), and the measure dJ1(),)=d},/A on the half-line of positive num- bers is invariant under multiplication. If we define usual tspherical coordinates (8, <p) on the unit sphere S2 in R 3 , the measure dJ1(w) =sin 8dOd<p (w=(O, <p)) on S2 is invariant under rotation. More generally, if we define the spherical coordinates (01) ... ,On-I' <p) on the unit sphere sn in Rn+l, the measure dJ1(w) = sin n - l 0 1 sin n - 2 8 2 ", sin 8n-1 d8 1 d0 2 ... d8 n - 1 d<p on sn is invariant under rotation, where the spherical coordinates are related to the rectan- gular coordinates (x h X 2 , ..., X n , xn+d in Rn+1 as follows: X I = sin 0 1 sin O 2 ,,, sin On-I CoS <p, X 2 = sin 8 1 sin 8 2 ", sin 8n-1 sin <p, X 3 = sin 0 1 ... sin 0n-2 cos On-I, X n - I =sin8 1 sin0 2 cos0 3 , X n == sin 8 1 cos 8 2 , X n + 1 = COS 0 1 (O.s; O,.s; n, v = 1,.,., n -1; O.s; <p.s; 2n). 
225 B Invariant Measures The notion of measures invariant under cer- tain transformations is generalized to that of invariant measures, defined in the following section. B. Definitions Let Ji be a tmeasure defined on a to"-algebra !B of subsets of a space X, and G be a transfor- mation group acting on X in such a way that sA E!B for any A E!B and SE G. For every SE G, define a measure y(s)Ji on!B by (y(s)Ji)(sA) = Ji(A). Ify(s)Ji=Ji for all sEG, then Ji is called an invariant measure with respect to G (or G- invariant measure). We consider the case where X is a tlocaUy compact Hausdorff space and G is a ttopo- logical transformation group of X. We further suppose that !B is the smallest 0"-a1gebra con- taining the family <£: of aU compact subsets of X and that Ji(K) < 00 for aU K E<£: ( 270 Measure Theory). Let Co(X) be the space of all real-valued continuous functions with compact tsupport defined on X. For example, if X is an toriented tRiemannian manifold and w is the tvo1ume element associated with the Riemannian metric on X, there exists a unique measure Ji on X such that Lf(X)dJi(X) = Lf W for every f(X)E Co(X). This measure Ji is invar- iant under the group G oftisometries of X. In the case of a nonorientable X, a G-invariant measure can also be defined from the tRie- mannian metric. In the following, we consider G-invariant measures on thomogeneous spaces X of a locally compact Hausdorff topological group (abbreviated to locally compact group) G. C. Haar Measures Most fundamental is the case in which G is 10caUy compact and X =G, with sx (resp.xs- 1 ) defined by the group multiplication law. In this case, a nonzero G-invariant measure on Gis caIled a left- (resp. right-) invariant Haar measure on G. On every locally compact group, there exists a left- (right-) invariant Haar measure, which is unique up to a posi- tive multiplicative constant (Haar's theorem). For example, Haar measures on the additive group R of real numbers and the additive group R n are the usual tLebesgue measures. A Haar measure Ji on the multiplicative group R of positive real numbers is given by r: ((x)dJi(x)== too f(x)dx/x. 850 For an n-dimensional tLie group G, a left- invariant Haar measure p. is defined by for- mula (I) with a left-invariant tdifferential form W of degree n. A Haar measure Ji on a locally compact group G is tregular in the following sense. If D is the set of all open subsets of G, then for every AE!B, we have Ji(A)=sup{}l(K)\ KE<£:, K c A} =inf{Ji(U) I U E!B nD, A c U}, For UE!BnD (U # 0), we have Ji(U) > 0, and Ji(A) < 00 for compact A. The measure p.(s) of one point S is >0 if and only if Gis tdiscrete. The total measure Ji(G) of G is finite if and only if G is compact. D. Modular Functions (1 ) Let Ji be a left-invariant Haar measure on a locally compact group G, and define the mea- sure b(s)Ji by (b (s) Ji)(A) = Ji(As) for every SE G. Since i5(s)Ji is also a left-invariant Haar measure, there exists a positive real number (s) such that i5(s)Ji=(s)Ji, by virtue of the uniqueness of the left-invariant Haar measure. The function  = G on G is called the modular function of G. For an tintegrable function on G with respect to Ji, we have Lf(XS)d}l(X)=(srl Lf(X)dJi(X), Lf(X-I)(xrl d}l(x) = Lf(X)dJi(X). If v is a right-invariant Haar measure on G, we have the formulas Lf(SX)dV(X)=(S) Lf(X)dV(X), Lf(X-l)(X)dV(X)= Lf(X)dV(X). Moreover,  -I Ji is a right-invariant Haar measure, while V is a left-invariant Haar measure. The modular function  of G is a continu- ous homomorphism of G into the multipli- cative group R of positive real numbers. If the modular function  of G is equal to the constant 1, i.e., if a left-invariant Haar measure is also right-invariant, G is said to be uni- modular. G is unimodular if G is compact, commutative, or discrete. If G is a tLie group, we have (s)=ldetAd(s)-II, where s-->Ad(s) is the tadjoint representation. In particular, G is unimodular if G is a tsemisimple Lie group, a connected tnilpotent Lie group, or a Lie group for which Ad G is compact. However, 
851 the group T(n; R) of right triangular matrices (n> 1) is not unimodular. E. Product Measures Let {G,} 'EA be a family of locally compact groups, and let Ji, be a left-invariant Haar measure on G, for every ex E A. Suppose that there exists a finite subset B of A such that G, is compact and Ji,(G,) = I for exEA - B. The product measure Ji = II'EA Ji, is then a left- invariant Haar measure on the tCartesian product G = II'EA G., which is also a locally compact group. Moreover, if , is the modular function of G" then G(x) = I1'EA ,(x,) for X = (X')'EA' F. Product Formula Let H, L be two closed subgroups of a locally compact group G, and suppose that Q == H L contains a neighborhood V of e in G. This means that Q is an open subset of G. If we put D={(s,s)lsEHnL}, then the mapping (s,t)--> st- 1 of H x L into Q induces a one-to-one continuous mapping <p of the quotient space H x L/D onto Q. Suppose that <p is a homeo- morphism. This is the case, for example, if G is tparacompact. Furthermore, if H n L is compact, we have the product formula Lf(W)dJi(W) =a f r f(hl)G(l)dn-ldJiH(h)dJidl), JHXL where Ji, JiH' JiL denote left-invariant Haar measures on G, H, L, respectively, and a> 0 is a constant independent of f G. Weil Measures If A is a measurable subset with respect to a left-invariant Haar measure Ji and Ji(A) > 0, then A I A = {s I tis, tEA} is a neighborhood of the identity element of G, and such subsets form a base for the neighborhood system of the identity. This shows that the topology of a locally compact group is determined by its Haar measure. Conversely, we shall consider the definition of a topology in an abstract group G with a measure Ji. Let Ji be a to"-finite measure defined on a to"-additive family !B in G, such that sA E!B for A E!B and s E G. Ji is called a Weil measure if it satisfies the following two conditions: (WI) Ji(sA) = Ji(A); (W2) if f(x) is !B-measurable, then f(y-l x) is !B x !B-measurable. 225H Invariant Measures If a Weil measure Ji #0 exists in a group G, then {A -1 A I Ji(A) > O} forms a base for the neighborhood system of the identity ele- ment of a topology, which makes G a locally ttotally bounded topological group. If for every SE G there exists an A E!B such that Ji(A nsA) < Ji(A) < 00, Ji(A) > 0, then G is a Hausdorff space. In this case the tcompletion G of G is a locally compact group, and for a suitable left-invariant Haar measure ji on G, we have A = An GE!B and Ji(A) = ji(A) for every A E  (the smallest O"-additive family con- taining the family of all compact subsets of G). H. Relatively Invariant Measures Let G be a transformation group acting on a set X. A measure Ji on X is said to be a rela- tively invariant measure with respect to G if for every SE G, y(s)Ji is proportional to Ji, i.e., y(s)Ji== X(Sfl 'Ji (X(S)E R). If Ji #0, X(s) is uniquely determined by s, and S--> X(s) is a continuous homomorphism from G into the multiplicative group R of positive real numbers. We call X the multi plica tor of the relative invariant measure. We now consider relatively invariant mea- sures with respect to a locally compact group G on the tquotient space G/H of G by a closed subgroup H, Let Ji, fJ be left-invariant Haar measures on G, H, respectively, and let x-->x* be the canonical mapping of G onto G/H. For any measure A on G/H, there exists a unique measure A # on G satisfying the condition LH (L f(Xh)dfJ(h)) d).(x*) = L f(x)dA # (x) for every continuous function f with com- pact support on G. For every hEH, we have i5(h)A # = H(h)A #. Conversely, for a measure v on G such that i5(h)V=H(h)v for every hEH, there exists a unique measure ..1. on G/H such that ..1. # = v. This measure ..1. is called the quo- tient measure of v by fJ and is denoted by ..1. = v/fJ. For a continuous homomorphism X of G into the multiplicative group R of positive real numbers, a necessary and sufficient condi- tion for the existence of a not identically zero, relatively invariant measure on G/H with the multiplicator X is that X(h)= H(h)/dh) for every hE H. If this condition is satisfied, the relatively invariant measure on G/H with multiplicator X is unique up to a multiplicative constant and is given by the quotient measure v'=(XJi)/fJ of XJi by fJ. In particular, for the exis- tence of a G-invariant measure on G/H, it is necessary and sufficient that the modular func- tions G and H coincide on H. Hence, if H is compact or if G and H are unimodular, there exists a G-invariant measure on G/H. 
225 I Invariant Measures I. Weyl's Integral Formula Let G be a compact connected tsemisimple Lie group and H a tCartan subgroup (maxi- mal torus) of G. Then a Haar measure Ji on G can be expressed by means of a Haar measure fJ on H and a G-invariant measure ..1. on G/H. If Ji, fJ, ..1. are all normalized to be of total mea- sure 1, we have the following formula for every continuous function f on G: L ((g)dJi(g) = f f f(ghg-l)J(h)d),(g*)dfJ(h) W H G/H (Weyl's integral formula), where w is the order of the tWeyl group of G and J(h) is given by J (exp X) = I n (e'(X)f2 - e -,(X)/2) 1 2, 'EP with P the set of all tpositive roots ex of G with respect to H and X an arbitrary element of the tLie algebra of H. For an element h of H, the element X with h=expX is not unique, but the function J is a single-valued function. A similar formula is valid on a tsymmetric Rie- mannian manifold. Weyl's integral formula can also be generalized to the case of noncom- pact semisimple Lie groups. However, it is then necessary to replace the right-hand side by a sum extended over a system of repre- sentatives of mutually nonconjugate Cartan subgroups. J. Quasi-Invariant Measure Suppose that a group G acts on a space X in such a way that sAE!B for any AE!B and sEG, where !B is a O"-additive family of subsets of X. A measure Ji defined on !B is called a quasi- invariant measure with respect to G if the mea- sures Ji and y(s)Ji (Section A) are equivalent for every SE G. Here, two measures A and Ji defined on !B are equivalent if ..1. = <PJi (this formula means },(E) = SE <p(x) dJi(x) for any E E!B) for some measurable function <p(x) which is >0 almost everywhere with respect to Ji and Ji-integrable on every A E!B such that Ji(A) < 00. We now consider quasi-invariant measures with respect to a locally compact group G on a quotient space G/ H of G by a closed sub- group H. There are always quasi-invariant measures on G/H with respect to G, and they are all mutually equivalent. They can be con- structed as follows. There exists a positive continuous function p on G such that p(gh) = L'1 H (h)L'1 G (hr l p(g) for gEG, hEH. Then the 852 quotient measure )'=(PJi)/fJ is a nonzero quasi- invariant measure on G/H with respect to G if Ji, fJ are Haar measures on G, H, respectively, and it holds that d},( sx) = p(sg)p(gr 1 dA(x) for xEgHEG/H and sEG. If G is a Lie group, we can take a function p oftclass COO. If X is an infinite-dimensional tlocally convex topo- logical vector space over R, there exists no to"-finite Borel measure on X that is quasi- invariant with respect to translations by the elements of X [6]. References [1] N. Bourbaki, Elements de math6matique, Integration, ch. 7, 8, Actualites Sci. Ind., 1306, Hermann, 1963. [2] A, Weil, L'integration dans les groupes topologiques et ses applications, Actualites Sci, Ind., Hermann, 1940. [3J P. R. Halmos, Measure theory, Van Nos- trand, 1950. [4] S. Helgason, Differential geometry and symmetric spaces, Academic Press, 1962. [5] L. H. Loomis, An introduction to abstract harmonic analysis, Van Nostrand, 1953. [6] Y. Umemura, Measures on infinite dimen- sional vector spaces, Pub!. Res. Inst. Math. Sci., (A) 1 (1965), 147. [7] A, Haar, Der Massbegriff in der Theorie der kontinuierlichen Gruppen, Ann. Math., (2) 34 (1933), 147169. [8] 1. von Neumann, The uniqueness of Haar's measure, Mat. Sb" 1 (1936), 721  734. [9] H. Cartan, Sur la mesure de Haar, C. R. Acad. Sci. Paris, 211 (1940), 759762. 226 (IV.18) Invariants and Covariants A. The General Case Let R be a tcommutative ring. We say that a group G acts on R if (i) each element 0" of G defines an tautomorphism I -->0" r of R, and (ii) O"(rf) = (O"r) f for any 0", rE G,j E R. In this case, an element r of R is said to be G-invariant (or simply invariant) if O"f = f for any 0" E G, An element f is called (G-)semi-invariant if for each 0" in G there is an invariant a(O") such that O"I =a(a)f, that is, if f is invariant up to an invariant multiplier depending on a. A semi- invariant may also be called a relative invar- iant, and an invariant may be called an ab- solute invariant. The correspondence O"-->a(O") mod(O: f) «(O:j)= {XE R I xf ==O}) is a trepre- 
853 sentation of degree 1 of G, and a(O") is called the multiplier of the semi-invariant f, or the character defined by f. B. Invariants of Matrix Groups Let K be a commutative ring with a unity element. Then we can consider a matrix group (or matric group) Gover K, i.e., a subgroup of the group of n x n invertible matrices over K; this latter group is called the general linear group of degree n over K and is denoted by GL(n, K) ( 60 Classical Groups). Assume that R is a commutative ring generated by x I' , " , X n over K and an action of the group G on R is defined such that t( O"X I' ... , O"x n ) == o"'(XI' ..., x n ) (' means the ttranspose of a matrix). In this case, we say that G acts on R as a matrix group. If K is a field, then the smallest talgebraic group G (in GL(n, K)) containing G acts on R as a matrix group, and an element [of R is G-invariant if and only if [is G-invariant, and similarly for semi-in- variants. These results can be generalized to the case where K is not a field. A thomomorphism P of a matrix group G (c: GL(n, K)) into GL(m, K) is called a rational representation of G if there exist rational func- tions <Pkl (1 .s; k, l.s; m) in n 2 variables xij (1 .s; i, j.s; n) with coefficients in K such that P«O"ij)) = (<Pkl(aij)) for all (O"ij)E G. Assume that P is a rational representation of a matrix group G and p(G) acts on a ring R as a matrix group. Then we have an action of G on R defined by af =(pa)[ (O"E G,j E R), called the rational action defined by p. If the following condi- tion is satisfied, then the action is called semi- reductive (or geometrically reductive): If N = fl K + ... + f,.K is a G- tadmissible module (fI,. ., [,.ER) and if[o modN (foER) is G- invariant, then there is a thomogeneous form h in /o,...,f,. of positive degree with coefficients in K such that h is tmonic in fo and is G- invariant. This action is called reductive (or linearly reductive) if h can always be chosen to be a linear form. (1) Rational actions of a matrix group G in each of the following three cases are all reduc- tive. (i) K is either the real number field or the complex number field, and G is a dense subset of a tLie group (c: GL(n, K)) which is either tsemisimple or tcompact. (ii) K is a field of tcharacteristic 0, and letting G be the smallest algebraic group containing G, the tradical of G is a ttorus group ( 13 Algebraic Groups). (iii) K is a field of characteristic p #0, and G (as in (ii)) contains a torus group T of finite tindex that is relatively prime to p. (2) Any action of a finite group is a semire- 226C Invariants and Covariants ductive action. If we omit the condition that the characteristic of K is 0 in (ii), then rational actions of G are semi reductive. This is known as Mumford's conjecture and was proved by W. J. Haboush. (3) Assume that <P is a G-tadmissible homo- morphism of the ring R onto another ring R'. We denote the sets of G-invariants in Rand R' by IG(R) and IG(R'), respectively. If the actions of G are reductive, then (i) <p(IG(R))=IG(R'); (ii) hiEIG(R) implies (LihiR)nldR)=LihJG(R); and (iii) if K is tNoetherian, then IG(R) is finitely generated over K. If rational actions of G are semireductive, then (i) for each element a of IG(R'), there is a natural number t such that a'E<p(IG(R)), and hence Ic;(R') is tintegral over <p(Ic;(R)); (ii) if hiEIG(R) and j E(LihiR)n Ic;(R), then a suit- able power f' of f is in LihJdR); and (iii) if K is a tpseudogeometric ring (in particular, if K is a field), then IG(R) is fillltely generated over K [3]. When IG(R) is generated by f" ... ,is over K, then [I, ..., is are called basic invariants. C. Polynomial Rings Let PI' '" ,Pu be matrix representations of a group G over a commutative ring K of re- spective degrees n l , ..., nu' Let X}i) (l.s; i.s; u, 1 .s;j.s; n;) be L n i talgebraically independent elements over K. Then we define an action of G on the tpolynomial ring K[X\I), ..., xJ by '( ax), .. . , O"xl) = pM) '(x), ... , xl). In this case, a (relative) invariant is the sum of (relative) invariants that are homogeneous in each (x), ..., xm. (Because of this fact, in some of the literature, a (relative) invariant means a (rela- tive) invariant that is homogeneous in each of (x), ..., xl).) On the existence of basic invar- iants, the following theorem is known (be- sides the one on (semi)reductive actions): As- sume that K is a field of characteristic 0, G is dense under the tZariski topology in an algebraic linear group G such that the tuni- potent part (G)u of the radical of G is at most I-dimensional (these conditions hold if G is a I-dimensional Lie group), and that all the Pi are rational representations; then basic invariants exist (R. Weitzenb6ck). Furthermore, if G is a matrix group and each Pi is either the tidentity map or the con- tragredient map A -->'A -I, then the invariants are called vector invariants. If K is a field of characteristic zero, the basic invariants and a basis for the ideal of algebraic relations of the basic invariants are explicitly given in several cases [1,2]. 
226D Invariants and Covariants D. Classical Terminology The classical theory of invariants considers the following objects. Let K be a field of char- acteristic zero (e.g., the real number field or the complex number field), and let G be G L(n, K). Consider a homogeneous form F of degree d in n variables I' ..., n with coeffi- cients in K:F=LCi1 inmi1. in (Li,=d, mil in =(d!/II(i,!))\1 ... n). For each O"EG, we define O"i by (O"I' ""O"n)=(I' ""n)O"-1 and then (O"C)i 1 . in by F=L(O"C)i 1 . i n (O"m i1 i) Then the transformation [O"Jd:(CdO 0' ...,C i1 in' ...,C O Od) -->«O"C)dO 0,...,(aC)i 1 in'''''(O"C)O Od) is a tlinear transformation of a d+n-l C n - I - dimensional taffine space. Let us denote the matrix of the linear transformation by the same symbol [0" Jd: '( .. . , ( O"C)i 1 . in' .. . ) = [0"] d '( .. . , C i, . in' .. . ). Then <Pd: 0"......[0" Jd is a rational representation ofG. Now fix an F such that the coefficients C i1 in are independent variables, and consider the action of G on the pol ynomial ring R = K[ ""C i , in' ...J defined by the rational representation <Pd' If 9 is a relative invariant, then O"Y = a(O")g with a rational representation a-->a(O"). Hence there is an integer w such that a(a) =(det O")w. Then 9 is called an invariant of weight w (note that 9 is an SL(n, K )-invariant); 9 is an absolute invariant if and only if w = O. The group G acts naturally on the ring R[l' ..., nJ also. Then relative invariants in this case are called covariants. The weight of a covariant and the absolute covariant are de- fined as in the case of invariants. When we want to refer to nand d, we call the invariants (covariants) invariants (covariants) of n-ary forms of degree d: binary for n = 2; ternary for n = 3; linear forms for d = 1; quadratic forms for d = 2, etc. For example, (1) if n = 2, d = 2, then the tdis- criminant D=C02C20-cfl is an invariant of weight 2. (2) Assume that d=2 (n arbitrary), and let uij be the coefficient of ij in F, or, more pre- cisely, uij = u ji = C'1 an' where (i) if i = j, then ex i = 2 and the other ex k are zero and (ii) if i # j, then ex i = ex j = 1 and the other ex k are zero. In this case, D = det(uij) is an invariant of weight 2. (3) If n = 2 and d = 4, then g2 = C 4 0C0 4 - 4C 13 C 31 +3d2, g3==C 04 C22 C 40- C 04c51- c40d3 + 2c 13C22C31 -d2 are invariants of respective weights 4, 6. The discriminant is expressed as 23(g - 27g) and is an invariant of weigh t 12. 854 (4) For d = 1, if we denote the coefficient of i in F by C i , then LiCii is an absolute covariant and is also a vector invariant. (5) For arbitrary nand d, det(a2F/aiaj) is a convariant of weight 2 and is termed the Hessian. Instead of one form F, we may take a finite number of homogeneous forms FI' ... , Fr of degree d I, '., , d r such that the coefficients c:. 'n are algebraically independent. Then we consider an action of G L(n, K) on the poly- nomial ring K[...,c) , , ...J [I' ..., nJ given by <P'i:O"-->[O"Jdi on the ;oefficients of F i and by O"-->ta-I on j' as in the case of one form. Invariants in this case are called covariants, and covariants containing no i are called invariants. Weights, absolute covariants, and absolute invariants are defined similarly. The forms F I , ., , , Fr are called ground forms, and the covariant is called a covariant with ground forms FI' .. . , Fr' (6) If r= n, then the tJacobian det(aF;/a) is a covariant of weight 1. E. Multiple Covariants Consider the situation described in Section C for the case of polynomials, and assume that K is a field of characteristic zero, G = G L(n, K), and each Pi is either a <Pd (d arbitrary) or the contragredient map K. Then these invariants are called multiple covariants, and weights and absolute multiple covariants are defined as before. Let s be the number of Pi equal to K. Then the invariants and covariants of the preceding section correspond to the cases s=O and s= 1, respectively. Now assume that Pi == K if and only iU = 1, ... , s. Gram's theorem states: for each ex = 1, '" , s, let H, be a poly- nomial in xY) (i>s) homogeneous in xV), ..., x.? for each i, and assume that the set V of com- mon zeros of HI, ... , H, (in the affine space of dimension Li >, n,) is G-stable . Then there exists a finite number of absolute multiple co- variants C I' ... , c, such that V is the set of (.. ,a)'),...) (i> s) satisfying the condition that (... ,a), ''', a)'), ... ,) is a zero point of C I' ... ,C, for any a) with ex.s; s. Since every rational action of G L(n, K) is reductive, the set I of absolute multiple covar- iants (in a fixed polynomial ring over K) is a finitely generated ring over K. Furthermore, the set of multiple co variants of a given weight is a finitely generated I-module. If we omit the assumption that K is of char- acteristic zero, it is difficult to define <Pd; this difficulty can be avoided by considering transformations of coefficients a'l in of F = Lai1 in\'''' n. Although we can give similar definitions in that case, the theory does not 
855 proceed similarly because, for example, ra- tional actions of GL(n, K) are not reductive. F. Invariants of Lie Groups Let G be a Lie group (hence K must be either the real number field or the complex number field) and p be a tdifferentiab1e representation ofG such that p(G)cGL(n,K), and assume that an action of G on K [x 1, ... , xnJ is defined by p. Then, by means of an infinitesimal trans- formation Xa corresponding to G, an invariant (or semi-invariant) is characterized as an ele- ment r satisfying the condition Xa! =0 (VX a ) (or Xa/=exaf, exaEK (VXa)). For instance, ifG =GL(2,K), then fis an invariant if Xaf =0 for only two infinitesimal transformations that correspond to C ) and G ). respec- tively. Similarly, for each Lie group G, there exists a finite number of Xa such that Xaf =0 for these Xa characterizes! as an invariant G. History In connection with geometry, the theory of invariants, especialIy that of binary forms, was first studied by A. Cayley (J. Reine Angew. Math., 30 (1846)). The theory was further developed by J. J. Sylvester, R. F. A. Clebsch, P. Gordan, and others. Since the theory was originated with applications to projective spaces in mind, homogeneous semi-invariants were important; this is why semi-invariants were called invariants in the classical theory. On the other hand, in the theory of binary quadratic forms, invariants of discontinuous groups were studied from the viewpoint of the theory of numbers. It was D. Hilbert who introduced clearly the notion of invariants for general groups. He proved the existence of basic invariants in the classical case, making use of the tHilbert basis theorem. Hilbert's 14th problem ( 196 Hilbert) is related to this result, but its answer is negative; that is, even in the case of a polynomial ring over the real number field or the complex number field, there are groups acting on the ring without basic invariants (M. Nagata). Though the theory of invariants has not been studied effectively for a long time, it is again under active study because of its importance in alge- braic geometry. References [IJ R, Weitzenbock, Invariantentheorie, Noordhoff, 1923. 227 Inventory Control [2J I. Schur and H. Grunsky, V orlesungen iiber Invariantentheorie, Springer, 1968. [3J H. Weyl, Classical groups, Princeton Univ. Press, revised edition, 1946. [4J M. Nagata and T. Miyata, Remarks on matric groups, 1. Math. Kyoto Univ., 4 (1965), 381-384. [5J D. Mumford, Geometric invariant theory, Springer, 1965. [6J 1. Fogarty, Invariant theory, Benjamin, 1969. [7J J. A. Dieudonne and J. B. Carrell, Invar- iant theory, old and new, Advances in Math., 4 (1970), 1 -80. [8J D. Hilbert, Gesamme1te Abhand1ungen, Band II, Springer, 1933, 1970. 227 (XIX.14) Inventory Control Although inventory control is discussed main- ly in connection with production control and operations research, inventory models have applications in many other fields. For exam- ple, an inventory (or queuing) model can be used to plan the optimal operation of a dam, and may thereby influence the engineering design of the dam. A mathematical theory of inventory control must be based upon an inventory structure in which the following items are considered: (i) costs and revenue, (ii) demand, and (iii) deliv- eries. Item (i) involves (1) order and produc- tion costs, (2) storage costs, (3) discount rates, (4) penalties, (5) revenues (under the assump- tion that price and demand are not controlled by the firm), (6) costs of changing the produc- tion rate, and so on. For item (ii), there are several different situations due to the com- bination of predictability and stability condi- tions of demand for the commodity. Current theories are concerned with two particular situations. In the first, the tprobability distri- bution of demand is known to the firm. In the second, the date of occurrence of orders is a trandom variable with a known probability distribution, while the amount of demand is a known constant. A tminimax principle is applied when some of these probability distri- bution functions are unknown to the firm. Problems in (iii) are due to delays (lead time periods) after an order for inventory is placed or a decision is made to produce for an un- certain demand. The folIowing inventory system has been studied mathematically in much of the litera- ture. Let X n be the initial storage level for the nth period, and n and Zn be the demand and 
227 Ref. Inventory Control the amount of order, respectively, for the nth period. Then Yn = X n + Zn is the storage level after the arrival of ordered items, and Xn+l = max(O, Yn - n) is the initial storage at the (n + 1 )th period. Furthermore, in this period, the stock shortage '1n=n- Yn is incurred if Yn<n' When the demand n is known, the inventory model is a simple deterministic one. When n is a random variable, let its density function be <p(). For constructing the model, the following three cost functions are introduced: The order- ing cost c(z) of order z, the holding cost h(y) of storage Y for one period, and the penalty cost p('1) for a shortage '1. Then the total cost for the period of storage level Y after reorder is I(y)= J: h(y-)<p()d+ fA p(-y)<p()d. Let x be the initial storage for the first period, and let fn(x) represent the minimal total cost of an n-period inventory model; then fn(x) satisfies the following fundamental func- tional equation with a fixed discount rate ex (0<ex<1): fn(x) = min [ min { C(y-X) +I(y) y>x +ex too f,,-dY-)<P()d}, I(x)+ex too f,,-dX-)<P()dJ In this equation, the first term of the right- hand side gives the amount of order at the first period as Yn(x)-x, where Yn(x) is a minimizing value of y, and the second term gives the cost if no order is put in. Policies of the (s, S) type are defined as follows: Order S - x if x < s, and do not order if x;:, s. For the special case when the ordering function is given by c(z) = C + cz (z>O); =0 (z=O), some sufficient conditions for the optimal policy to be of (s, S) type have been studied. For various inventory models where the demand and lead time are random variables, the optimal policy can be derived by using queuing theory. We consider an inventory model with maximum inventory M. The queu- ing situation corresponding to this model is an M-channel queuing system. "Empty shelf space due to demand" corresponds to the arrival of a customer, the ordering instance corresponds to the commencement of service, and the arrival of an ordered item corresponds to the completion of service. Thus the station- ary probability of storage level n, Pn' can be obtained by solving the equilibrium equation of the queuing system; then mean inventory, mean sales, and mean ordering number can be derived. This above ordering system is called 856 "reorder for each item sold"; more general systems have been treated in a similar manner. Another example of the queuing system approach may be illustrated by way of the planning problem for dams mentioned at the beginning of this article: The probability distri- bution of the amount of water stored in the dam is derived by analyzing the functional equation of a queuing system. Let the capacity of the dam be M, the storage of the dam at the nth period be x., and the outflow after inflow Zn be n+l; then the storage at each period is given by Xl =x, O<x<M, x n +1 =[M -n+1 +[xn+zn-MrJ +, where a+ =max(O,a) and a- =min(O, a). The equations for queuing models will be the same as those given above for the inven- tory model if we let X n be the waiting time of the nth customer, Zn the service time, n+l the interarrival time of customers n and the n + 1, and M the service time. References [IJ K. 1. Arrow, S. Karlin, and H. E. Scarf, Studies in the mathematical theory of inven- tory and production, Stanford Univ. Press, 1958. [2] H, E. Scarf, D. M. Gilford. and M. W. Shelly (eds.), Multistage inventory models and techniques, Stanford Univ. Press, 1963. [3J T. M. Whitin, The theory of inventory management, Princeton Univ. Press, 1953. [4J G. Hadley and T. M. Whitin, Analysis of inventory systems, Prentice-Hall, 1963. [5J P. M. Morse, Queues, inventories and maintenance, Wiley, 1958. [6J 1. W. Cohen, Single server queue with uniformly bounded virtual waiting time, 1. Appl. Probability, 5 (1968), 93-122. 228 (X.34) Isoperimetric Problems A. The Classical Isoperimetric Problem Two curves are called isoperimetric if their perimeters are equal. The term curve is used here to mean a tJordan curve. The classical isoperimetric problem is to find, among all curves J with a given perimeter L, the curve enclosing the maximum area. This problem is also called the special isoperimetric problem or Dido's problem. Its solution is a circle. The analogous problem in 3-dimensional space has 
857 a sphere as its solution; that is, among all closed surfaces with a given surface area, the sphere encloses the maximum volume. The following variational problem can be regarded as a generalization of the classical isoperi- metric problem: To find the curve C:y=.f(x) that gives the maximum value of the func- tional Ic F(x, y, y')dx under the subsidiary condition Ic G(x, y, y')dx = constant. This is sometimes called the generalized isoperimetric problem. The classical isoperimetric problem can be solved by variational methods. It can also be solved by using inequalities involving quantities associated with the figure in ques- tion. For example, the inequality U-4nF0 between the area F and the perimeter L of a Jordan curve J solves the problem, since the equality holds only for a circle. For refinements of (1) there are further inequalities due to T. Bonnesen (1921): U -4nF (L - 2nr)2, L 2 -4nF(2nR- L)2, L 2 -4nFn2(R-r)2, where r is the radius of the largest circle in- scribed in the curve J and R the radius of the smallest circumscribed circle. These in- equalities can also be used to solve the iso- perimetric problem. Moreover, we have the following inequality for curves on the sphere of radius a: R-r L 2 -4nF + F 2 a- 2  8na 2 sin 4a(l + 2n) (F. Bernstein, 1905). For curves on the surface of negative constant curvature -1/a 2 , we have L 2 -4nF - F 2 a- 2 a2(4n + Fa- 2 ) ( tanh _ tanh  ) 2 4 2a 2a (L. A. Santal6 [2J), From these inequalities, we see that the circle remains the solution to the isoperimetric problem in each of the non- Euclidean planes. The corresponding problem in the 3-dimen- sional case is more difficult. Without going into detail, we cite the following example: For an tovaloid with surface area S and volume V, S3-36nV20, where the equality holds only for the sphere. B. Isoperimetric Inequalities on Eigenvalues In recent years the concept of isoperimetric inequality has been extended to include all 228 B Isoperimetric Problems inequalities connecting two or more geometric or physical quantities depending on the shape and size of a figure. For example, it includes inequalities on teigenvalues of partial dif- ferential equations under given boundary conditions. Lord Rayleigh conjectured in 1877 that, for the equation L1u+Au =0 for a vibrating mem- brane on a region D of fixed area F (with U = 0 on the boundary), the first eigenvalue ..1.1 is least when D is a circle. This conjecture is true. In fact, in 1923 G. Faber and E. Krahn proved independently that ..1.1 (n/F)i (2) (1 ) and that the equality in (2) holds if and only if the domain D is a circle, where j = 2.4048 ... is the first positive zero of the tBessel function Jo(x). For the second eigenvalue ).2 of the same problem, the circle does not give the minimum value. I. Hong (1954) gave the inequality A2(2n/F)l and showed that ..1. 2 approaches its greatest lower bound, (2n/F)l, as the shape of the domain approaches a figure consisting of two equal mutually tangential circles, each having area F /2 [4]. Many other results were found with regard to isoperimetric inequalities on eigenvalues of partial differential equations, for example, results related to eigenvalues for a membrane under other boundary conditions, such as au/an =0, and for other types of equations, such as L1L1u - AU = O. A method devised by 1. Steiner and called symmetrization is a powerful means of dis- covering isoperimetric inequalities. Steiner's symmetrization with respect to the line l changes the plane domain P into another plane domain Q characterized as follows: Q is symmetric with respect to l, any straight line g perpendicular to l that intersects one of the domains P or Q also intersects the other, both intersections have the same length, and the intersection with Q is a segment bisected by l. This operation can be extended to a space of higher dimension by replacing l with a hyperplane. Steiner's symmetrization pre- serves area (or volume) and diminishes perim- eter (or surface area). Steiner first used these properties to solve the classical isoperimetric problem in 1838. In 1945, G. P61ya and G. Szeg6 found that the electrostatic capacity of a solid is diminished by Steiner's sym- metrization. The concepts developed in their papers made possible a systematic treat- ment of many isoperimetric inequalities and estimations of mathematical and physical quantities. 
228 Ref. Isoperimetric Problems 858 References For the classical isoperimetric problem, [IJ W. Blaschke, Kreis und Kugel, Veit, 1916 (Chelsea, 1949). [2J L. A. Santal6, La desigualdad isoperi- metrica sobre superificies de curvatura con- stante negative, Rev. Univ. Tucuman, (A) 3 (1942),243-259. For isoperimetric inequalities on eigenvalues, [3] G. P61ya and G. Szego, Isoperimetric inequalities in mathematical physics, Prince- ton Univ. Press, 1951. [4J I. Hong, On an inequality concerning the eigenvalue problem of a membrane, Kodai Math. Sem. Rep. (1954),113-114. 
229 Ref. Jacobi, Carl Gustav Jacob 229 (XXI.29) Jacobi, Carl Gustav Jacob Carl Gustav Jacob Jacobi (December 10, 1804-February 18, 1851) was born into a wealthy banking family in Potsdam, Germany. He was well educated at home and was highly cultured in many areas. He entered the Uni- versity of Berlin, studied mathematics largely on his own through Euler's texts, and obtained the doctorate in 1825. The following year he became a private lecturer at the University of Konigsberg, and in 1831 a professor. For the next seventeen years he worked vigorously in Konigsberg, where his influence was consider- able. Toward the end of his life, his health failed; moreover, he lost his property and met with general misfortune because of the polit- ical situation of the time. He made no further contributions after 1843; he died of smallpox at 47. Because he had an intense personality, there were times when he invited the animosity of people; however, he did have early contact with tAbel, and in his later years he enjoyed the friendship oftDirich1et. Jacobi's mathe- matical works lacked formal completeness but were very original and contributed to many fields. The tHamilton-Jacobi equation in dynamics and the t1acobian determinant of a differentiable mapping are well-known prod- ucts of his ideas, but even more noteworthy are his contributions to the theory of elliptic functions and algebraic functions, particularly the introduction of theta functions and the treatment of the inverse problem of hyper- elliptic integrals [2]. References [IJ C. W. Borchardt and K. Weierstrass (eds.), K. G. 1. Jacobi's gesammelte Werke I-VII, Reimer, 1881-1891 (Chelsea, second edition, 1969). [2J G. J. Jacobi, Fundamenta nova theoriae functionum ellipticarum, Borntrager, 1829. [3] F. Klein, V orlesungen iiber die Entwick- lung der Mathematik im 19. Jahrhundert I, II, Springer, 1926, 1927 (Chelsea, 1956). 230 (XXI. 7) Japanese Mathematics (Wasan) Before the introduction of Western mathe- matics, indigenous Japanese mathematics de- 860 veloped along its own characteristic lines. In Japanese, this form of mathematics is called wasan. The first development took place in the 8th century A.D. during the Nara era under the influence of the Tang dynasty in China. After a period of decline came another period of de- velopment from the 13th to the 17th century. During this second wave, Chinese mathemat- ical books such as Suanhsueh chimeng and Suanfa tangtsung were imported, along with the abacus, or soroban as it is called in Japa- nese, and calculating rods, sangi in Japanese, with which Chinese mathematicians performed algebraic operations. Japanese mathematicians absorbed these methods and invented and developed their own written algebra, called endall-jutsu or tenzan. The fundamental concepts of wasan are attributed to Seki Takakazu, or Seki Kowa (Seki is the surname; likewise for other Japa- nese names in this article), Takebe Katahiro, and Kurushima Yoshihiro. Its main develop- ments stem from Ajima Naonobu and Wada Yasushi (or Wad a Nei), among others. Wasan scholars obtained interesting and significant results, but they pursued mathematics as an art in the Japanese manner rather than as a science in the Western sense, Wasan had no philosophical background as did the Greek tradition, nor did it have an intimate relation with the natural sciences. Thus it lacked the character of systematized science and dissolved after the introduction of Western mathematics into the school system by the Meiji govern- ment (1867-1912). Among wasan works of the earlier period, J inkoki by Yoshida Mitsuyoshi (1598-1672), the first edition of which appeared in 1627, contributed much to popularize the soroban and to arouse general interest in mathematics. Seki Takakazu (1642?-1708) was born in Fujioka in the Gunma prefecture, Some his- torians say he learned mathematics from Takahara Y oshitane, while others say he was completely self-taught. His achievements in- clude the following: (1) the invention of endan- jutsu, or written algebra; (2) the discovery of determinants; (3) the solution of numerical equations by a method similar to tHorner's; (4) the invention of an iteration method to solve equations, similar to Newton's; (5) the intro- duction of derivatives and of discriminants of polynomials; (6) the discovery of conditions for the existence of positive and of negative roots of polynomials; (7) a method of finding max- ima and minima; (8) the transformation theory of algebraic equations; (9) continued fractions; (10) the solution of some Diophantine equa- tions; (11) the introduction of tBernoulli num- bers; (12) the study of regular polygons; (13) the calculation of n and the volume of the 
861 sphere; (14) tNewton's interpolation formula; (15) some properties of ellipses; (16) the study of the tspirals of Archimedes; (17) the discov- ery of the Pappus-Guldin theorem; (18) the study of magic squares; and (19) the theoretical study of some questions arising out of math- ematical recreations (called mamako-date, metsuke-ji, etc.). Takebe Takahiro (1664-1739) was a disciple of Seki. He is the author of the book Enri tetsujutsu. (Enri, or circle theory, was one of the favorite subjects of wasan scholars.) It contains the formula ( ! . ) 2 _ x 2 2 2 . x4 2 2 .4 2 . x6 arcsmx - + + +.... 2 2 4 6 He also obtained other formulas of trigono- metry and some approximation formulas, by means of which he compiled trigonometric tables to 11 decimal places. In collaboration with Seki he wrote the 20 volumes of Taisei sankei and FukyCt tetsujutsu, the latter elucidat- ing the methodology of the Seki school. It contains a value of n to 42 decimal places. Kurushima Yoshihiro (?-1757) was an original scholar influenced by Nakane Genkei (1662-1733), a disciple of Takebe. He gen- eralized an approximation formula for sines obtained by Takebe, treated problems of max- ima and minima involving trigonometric func- tions, improved the theory of determinants and the theory of equations, obtained a for- mula for Sp = JP +... + n P without using Ber- noulli numbers, and found a relation among a, b, c, n by eliminating x from x+(x+c)+... +(x+(n-1)c)=a, xk+(x+ct+... +(x +(n -1)c)k =b. Furthermore, he studied tEuler's function <p(n) before Euler and obtained the tLaplace expan- sion theorem for determinants before Laplace. He is said to have contributed to Hoen san- kei, an important work on enri written by Matsunaga Yoshisuke (c. 1694-1744) in which a value of n is given up to the 50th decimal place. The Seki school, continued under Takebe, Nakane, Kurushima, and Matsunaga, became a center of wasan. Scholars of this school lived mainly in Edo (the ancient name for Tokyo), Yamaji Nushizumi (or Shuju) (1704-1772) studied wasan with Nakane, Kurushima, and Matsunaga. Arima Y oriyuki (1712-83), one of his disciples, for the first time made public the teachings of this school in the book ShCtki sampo. The practice of dedi- cating to Shinto shrines or Buddhist temples tablets engraved with solved mathematical problems became popular during this period. Ajima Naonobu (1739-98) was a disciple of Yamaji. He improved enri, simplified its 230 Japanese Mathematics (Wasan) theory, and amplified its applications. He treated problems of finding volumes involving double integrals, discovered the binomial theorem for exponent l/n, compiled a table of logarithms to 14 decimal places, and treated Diophantine problems. No trace of demon- strative geometry is found in wasan, but Ajima and his school did treat geometric problems, such as tMalfatti's problem, dealing with several circles tangent to each other. Wada Yasushi (or Wada Nei, 1787-1840) studied wasan with Kusaka Makoto (1764- 1839), a disciple of Ajima. He made tables containing more than 100 definite integrals, including, for example, II x P (I-x)Qdx, II x P (I-x 2 )Qdx. However, there is no evidence that wasan scholars, even in this period, knew of the fun- damental theorem of infinitesimal calculus, Apart from the Seki school, there were Tanaka Y oshizane (1651-1719), a contempor- ary of Seki, and his disciple Iseki Tomotoki (c. 1690). Tanaka is said to have ranked with Seki in his work on determinants and magic squares, but most of his writings have been lost. Iseki wrote a text on determinants called Sanpo hakki (1690), the first of its kind in the history of mathematics. A little later, Takuma- ryu (the Takuma school) was formed in Osaka. Ino Tadataka (1745-1821), famous for making the first precise map of Japan, had studied wasan with Takahashi Yoshitoki (1764-1804), who belonged to this school. Aida Yasuaki (1747-1817), a contemporary of Ajima, founded Saijo-ryCt, or the "superlative school," and rivaled Fujita Sadasuke (1734-1807) of the Seki school. Toward the end of the Tokugawa era, the study of geometric problems became popular among wasan scholars, such as Hasegawa Kan (1782-1383), Uchida Gokan (1805-82), and his disciple Hodoji Zen (1820-68), who used the method of tin version. Hasegawa wrote Sanpo sinsho, a popular work containing an explanation of the methods of enri, The influence of Western mathematics is hardly recognizable outside astronomy, cal- endar making, and the compilation of loga- rithmic tables. However, some wasan scholars took a more positive attitude and began study- ing Western mathematics toward the close of the Tokugawa period, thus helping to lay the foundations for the development of mathe- matics in Japan in the new era. Following the Meiji restoration, Kikuchi Dairoku (1855-1917), Hayashi Tsuruichi (1873-1935), Fujiwara Matsusaburo (1861- 1933), and recent scholars contributed much to preserve wasan literature and to clarify its 
230 Ref. Japanese Mathematics (Wasan) content, but their undertaking has not yet been completed. References [IJ M. Fujiwara, History of mathematics in Japan before the Meiji era (in Japanese), I-V, under the auspices of the Japan Academy, Iwanami,1954-1960. [2J Y. Mikami, The development of mathe- matics in China and Japan, Teubner, 1913 (Chelsea, 1961). [3] A. Hirayama, K. Shimodaira, and H. Hirose (eds.), Takakazu Seki's collected works, English translation by J. Sudo, Osaka Kyoiku Tosho, 1974. 231 (111.22) Jordan Algebras A. Definitions Let A be a tlinear space over a field K. If there is given a tbilinear mapping (multiplication) A x A --> A, the space A is called a distributive algebra (or nonassociative algebra) over K. In particular, if this multiplication satisfies the associative law, A is called an associative algebra over K ( 29 Associative Algebras). We assume that A is a distributive algebra over K of finite tdimension over K. Denote by E(A) the associative algebra of all K- tendomorphisms of the K-linear space A. With every aEA we associate RaEE(A), LaEE(A) by Ra(x)=xa, La(x)=ax, where x belongs to A. The tsubalgebra of E(A) generated by the Ra, La (aEA) and the tidentity mapping of A is called the enveloping algebra of A. The left, right, and two-sided tideals of A are defined as in the case of associative algebras. An element c of A is said to be in the center of A if (i) ac = ca and (ii) a(bc)=(ab)c, a(cb)=(ac)b, and c(ab)=(ca)b for every a, bin A. We denote the product of two elements a, b in A by a' b in order to distinguish it from multiplication in the case of associative algebras. Denote a' a by a 2 and put A 2 = {a l . a 2 la l , a z EA}. Define A(n) successively by A(O)=A, A(I)=A 2, ...,A(k+I)= (A(k»)'2. Then A is called a solvable algebra if A (n) = 0 for some n and a nilalgebra if every element of A is tnilpotent. A distributive alge- bra A is called a Jordan algebra if the follow- ing two conditions are satisfied for every a, u in A: (i) a' u = u. a and (ii) a 2. (u' a) = (a'Z' u). a, We omit discussion of noncom- mutative Jordan algebras in this article. A distributive algebra A is called an alternative 862 algebra if the following two conditions are satisfied for every a, u in A: (i) a' u 2 = (a. u). u and (ii) u 2. a = u . (u . a). An alternative algebra A is called an alternative field if La(A)= A = Ra(A) for every a in A such that a#O. Gen- eralizing these algebras we obtain the notion of a power associative algebra. A distributive algebra A is a power associative algebra if every element of A generates an associative subalgebra. Jordan algebras and alternative algebras are power associative. We now consider an algebra A over a field K. We assume that the tcharacteristic of K is not 2 and that A is of finite dimension over K. If A and B are associative algebras, a linear mapping 0": A --> B is called a Jordan homomor- phism if(i) (a 2 )a = (aa)2 for every a in A and (ii) (aba)a = aa b a aa for every a, b in A (where we denote by aa the image of aEA by the map- ping 0"), If B does not contain tzero divisors, then (ab)a = aa b a or (abr = b a aa. Let A be an associative algebra. Define a new multiplication' in A by a' b = (ab + ba)/2. We then have a Jordan algebra A +. A sub- algebra of the Jordan algebra A + is called a special Jordan algebra. Let K [x I' ... ,xnJ be the noncommutative free ring in the indetermi- nates Xl' ..., X n (that is, K [Xl"'" xnJ is the associative algebra over K that has as its K- bases the free semigroup with identity element lover the free generators Xl' ..., x n ). The sub- algebra K [x I' . , , , xnJ + generated by 1 and the Xi is called the free special Jordan algebra of n generators and is denoted by J). A Jor- dan algebra A is special if and only if there is an isomorphism of A into B+, where B is some associative algebra. A Jordan algebra that is not special is called exceptional. All homomorphic images of Jb 2 ) are special. De- note by [j£ the ideal of Jb 3 ) generated by X 2_ y'2 (note that Jb 3 )  K[x, y, zJ). Then Jb 3 ) /!Jll is exceptional. A Jordan algebra is special if it contains the unity element and is generated by two elements. If A is an alternative algebra, the associated Jordan algebra A + is special. Condition (ii) for a Jordan algebra A is equivalent to [Ra, Ra 2] =0. In A, we have [Ra, Rb.J + [Rb' Re.aJ + [Re> Ra bJ = 0; and [[Re> RaJ, RbJ =R[a,b,C]' Here we put [S, TJ = ST-TS, [a,b,cJ=(a'b)'c-a-(b'c). Such an equation in A is called an identity in a Jordan algebra. B. Structure of Jordan Algebras A Jordan algebra A has a unique largest solv- able ideal N, which contains all nilpotent ideals of A and is called the radical of A. If N =0, A is called semisimple. The quotient A/N is always semisimple. A semisimple Jordan 
863 algebra A contains the unity element and can be decomposed into a direct sum A = Al EB ... EB Ar of minimal ideals Ai' Each Ai is a tsimple algebra. In particular, if K is of characteristic 0, there is a semisimp1e subalge- bra S of A such that A =S EB N. Let e be an idempotent element of A, and let ).EK. Put Ae(A)={xlxEA,e'x==Ax}. Then we have A = Ae(1) EB Ae( 1/2) EB Ae(O). This decomposition of A is caIled the Peirce decomposition of A relative to e. Suppose that the unity element 1 is expressed as a sum of the mutua1\y orthogonal idempotents e i . Then, putting Ai i = A e (I), Ai }= Ae.(1/2) n Ae 0/2), we have A'= 'L.i} EB Ai.'}' The/Ai,} are c1\ed Peirce spaces. Furthermore, suppose that for every i, Ai, i is of the form Ai, i = K ' e i + N i , with N i a nilpotent ideal of Au, In this case, A is ca1\ed a reduced algebra and the number of the e i is ca1\ed the degree of A. Let D be an alternative algebra with the unity element 1 and with an tinvo1ution -. Furthermore, suppose that there is a tquadra- tic form Q(X) such that x. x = x' x = Q(x)' 1 (xED) and thatf(X, y)=(Q(X + y)-Q(X)- Q(Y))/2 is a tnondegenerate bilinear form. Then D is ca1\ed a composition algebra. Reduced simple Jordan algebras A are classified into three types: (I)A=K'1. (2) There exist idempotents e" e 1 such that A = K. 1 EB K' (e 1 -e 1 ) EB Al,l' where Au # 0; and furthermore, the multiplication of A is determined as fo1\ows: Let x = ex( e I - e 2 )+a 1 ,2' y={3(el-ez)+b1.2 be in A with ex, {3EK, a"l, b u EAt,2 Then x. y=(ex{3+ f(al,l' b u ))' 1, where f is a nondegenerate symmetric bilinear form. (3) Let D be a composition algebra with the involution -, and let Dn be the ttota1 matrix algebra over D of degree n. Let r = diag{r" "', r n } (r i #0) be given in Dn. Define J:Dn-->D n by a J = r' if' r- l . Then A is of the form A={xEDnlx=x J }. Let A be a special, reduced simple Jordan algebra such that Au = Ke i . If A is of degree 2, then A is a tClifford algebra. If A is of de- gree ? 3, then A is classified into five types. Let A be a simple Jordan algebra over a field K. Then there is an extension field P of K of finite degree such that A p ( = A @ KP) is isomorphic to one of the fo1\owing five types: (i) P;;, where Pn is the total matrix algebra of degree n over P; (ii) the suba1gebra of P;; consisting of a1\ symmetric matrices in P;;; (iii) {xEPimlxJ =x}, where x J ==q-l 'xq, q==( m m). with 'x the transpose of x and 1m the unit ma- 231 Ref. Jordan Algebras trix of degree m; (iv) an algebra generated by the generators so, Sj,..., Sn together with the defining relations so' Si = Si, s/ = So, si s}= 0 (i # j); (v) £':, where ff3 is the algebra of a1\ 3 x 3 Hermitian matrices over a tCay1ey algebra. C. Representations of Jordan Algebras A representation S of a Jordan algebra A on a K-linear space M is a K-linear mapping a-->Sa from A into the associative algebra E(M) of a1\ K -endomorphisms of M such that (i) [Sa, Sb oJ + [Sb, Sc aJ + [So Sa bJ = 0 and (ii) SaSb S , + Sc Sb Sa + S(a c) b = SaSb c + SbSa c + ScSa b (for all a, b, c in A). A K-linear space M is called a Jordan module of A if there are given bilinear mappings M x A-->M (denoted by (x,a)-->x' a), A x M -->M (denoted by (a, x)-->a' x) such that for every x E M and every a, b, c E A, (i) x. a=a' x; (ii) (x' a). (b' c)+(x' b)'(a.c)+ (x' c), (a' b)=(x' (b' c)), a+(x' (a' c)), b + (x . (a. b))' c; and (iii) x . a . b' C + x . C' b . a + a' c. b' x =(x' c). (a' b)+(x' a)' (b' c)+(x' b)' (a' c). As usual, there is a natural bijection between the representations of A and the Jordan modules of A. A special representation of a Jordan algebra A is a homomorphism A-->E+, where E is an associative algebra. Among the special representations of A, there exists a unique universal one in the fo1\owing sense. There exists a special representation S: A --> U + with the fo1\owing property: For every special representation 0": A --> E + there exists a unique homomorphism 11: U --> E such that 0" = I1S. The pair (U, S) is uniquely deter- mined. Furthermore, if A is n-dimensional over K, U is of dimension .s; 2 n - lover K. The pair (U, S) is ca1\ed the special universal enveloping algebra of A. A is special if and only if S: A--> U + is injective. A Jordan algebra A has only a finite number of inequivalent tirreducib1e Jordan modules. Suppose that the base field K is of character- istic O. Let S be a representation of A. If N is the radical of A, S(N) is contained in the rad- ical of the associative algebra S(A)* generated by S(A). If A is semisimple, so is S(A)*. In general, the radical R of S(A)* is an ideal of S(A)* generated by S(N), Furthermore, for every semisimp1e subalgebra T of A such that A = T EB N, we have S(A)* = S(T)* EB R. References [1] A. A. Albert, Non-associative algebras I, II, Ann. Math., (2) 43 (1942), 685-707. [2JA. A. Albert, A structure theory for Jordan algebras, Ann. Math., (2) 48 (1947),546-567. 
231 Ref. Jordan Algebras 864 [3J N. Jacobson, General representation theory of Jordan algebras, Trans. Amer. Math. Soc., 70 (1951),509-530. [4J F. D. Jacobson and N, Jacobson, Classifi- cation and representation of semi-simple Jor- dan algebras, Trans. Amer. Math. Soc., 65 (1949),141-169. [5J N. Jacobson and C. E. Rickart, Jordan homomorphisms of rings, Trans. Amer. Math. Soc., 69 (1950), 479-502. [6J N. Jacobson, Structure and representa- tions of Jordan algebras, Amer. Math. Soc. Colloq. Pub I., 1968. [7J H. Braun and M. Koecher, Jordan- Algebren, Springer, 1966. 
232 A Kahler Manifolds 232 (VII.1 0) Kahler Manifolds A. Definitions Let X be a tcomplex manifold of complex dimension n. Let J denote the ttensor field of type (1,1) oftalmost complex structure in- duced by the complex structure of X ( 72 Complex Manifolds A). Considering J as a linear transformation of vector fields on X, we have J2 = -1. A tRiemannian metric 9 of class COO on X is called a Hermitian metric on X if g(x, y)== g(Jx, Jy) for any two vector fields x, y on X. On a tparacompact complex manifold X there always exists a Hermitian metric, If we put Q(x, y) = g(J x, y) for a Hermitian metric g, then Q is an talternating covariant tensor field of order 2, and hence a tdifferential form of degree 2. We call Q the fundamental form associated with the Hermitian metric g. If the texterior derivative of Q vanishes, i.e., d Q = 0, the given Hermitian metric 9 on X is called a Kahler metric on X, and X is called a Kahler manifold. With a tholomorphic local coordinate system (z', '" , zn) on a coordi- nate neighborhood U in X, we can write 9 = L:,p, g,P dz'az P in U, where the matrix (g,p) is an n x n tpositive definite Hermitian matrix. Then the fundamental form can be expressed as Q = (j=!/2)Lg,pdz' 1\ dz p . If a complex manifold X is a Kahler mani- fold with a Kahler metric g, then X has the following properties: (I) For every point p of X there exists a real-valued function Ij; of class COC on a suitable coordinate neighborhood of p such that g,P = iP lj;joz'ozP. (2) For every point p there exists a holomorphic local co- ordinate system at p whose real and imaginary parts form a tgeodesic coordinate system at p in the weak sense. (We say a coordinate system (x" ... , x n ) is a geodesic coordinate system at p in the weak sense if [V'<,!oxi (i1/ox j )]p =0, i,j = 1, ...,n (417 Tensor Calculus).) Each of these properties characterizes a Kahler metric. The Kahler metric was introduced by E. Kah- ler with property (1) as the definition. W. V. D. Hodge applied it to the theory of harmonic integrals [7]. B. Harmonic Forms on Compact Kahler Manifolds (194 Harmonic Integrals) We now consider complex-valued differential forms on a compact complex manifold X of complex dimension n endowed with a Her- mitian metric, The operators d and * on real- valued differential forms can be uniquely 866 extended to complex linear operators, and the inner product of real differential forms can be extended to a Hermitian inner product of the form (<p,Ij;)= Sx<P 1\ *Ij; . Then the tadjoint operator,) of d is also complex linear, and the decomposition d = d' + d" gives rise to ,) = ,)' + ,)", where ,)' and ,)" are the adjoint opera- tors of d' and d", respectively. We also have ,)'=(-I)*d"* and ,)"=(-I)*d'*. Define the operator L by L<p = Q 1\ <p, and denote by A the adjoint operator of L. Then for p-forms, we have A = (-Jy * L *. We say that <p is primitive when A<p=O. Some important properties of L and A follow: (3) For a p-form <p, A<pP=O if and only if Lq<p==O(q==max(n-p+ 1,0)). (4) If A<p == 0 and <p is of type (r, s), then for all O.s;q,,:;;n- p, * Lq<p = {( _Jy(P+l)/2 x (j=!Y-Sq!/(n - p-q)!} U-p-q<p (where p = r + s). (5) A p-form <p can be uniquely decomposed in the form <p = <Po + L<Pl + '.. + L' <Pr with r == [p/2J and <Pi primi- tive (Hodge [7], Weil [13]). Properties (3)-(5) are shared by all Hermitian metrics. When the metric is Kahlerian, we further have Ld-dL=O Ad'-d'A= Cl,)" , V -1 , Ad"-d"A= -j=! ,)', From these we also obtain L=L, A=A, d',)" +,)"d'=O, d",)' +,)'d" =0,  =2(d',)' +,)' d')=2(d",)" +,)" d"), where  is the tLaplace-Beltrami operator  = d,) + ,)d. tGreen's operator G commutes with d', d", 15', and ,)". Thus when the metric is Kah- lerian, we have: (6) Let Lp(X)=Lr+FpLr,,(X) be the decomposition of the space of p-forms into the spaces Lr,s(X) of forms of type (r, s), Then, correspondingly, the space Hp(X) of harmonic p-forms is the direct sum Hp(X) = Lr+FP Hr,,(X) of the spaces Hr,.(X) of har- monic forms of type (r, s). (7) If we let A = d",)" +,)" d", then <p =0 is equivalent to A<p = O. Denote the projection to the space of har- monic forms by H. Then using the formula 1 == H + G ( 194 Harmonic Integrals A), we can infer that not only in the tcochain com- plex (L p Lp(X), d) but also in the cochain com- plexes (L,Lr,,(X), d") and (LrLr,.{X), d'), H is homotopic to the identity mapping and the respective cohomology groups of degree sand degree r of the last two complexes are both isomorphic to Hr,s(X). (8) For a harmonic p- form <p, the forms <Pi in the decomposition (5) are harmonic. (9) The extenor powers Or (r = 0, 1, .., , n) of the fundamental form Q are non- 
867 zero and harmonic. (10) Holomorphic dif- ferential forms are harmonic. C. Further Properties of Compact Kiihler Manifolds The p-dimensional tde Rham cohomology group with complex coefficients of a compact Kahler manifold X is canonically isomorphic to the direct sum of the tDolbeault coho- mology groups of type (r, s), where r + S = p ( 72 Complex Manifolds D). Let b p and h r , " respectively, denote the pth Betti number of X and dime H'(X, or). Then it follows that b p = Lr+,phr,'. If <p is harmonic, then ip is also harmonic, so we have hr,s=h"r. Therefore, if p is odd, b p is even. This is a generalization of the fact that the first Betti number of a closed Riemann surface is even. From (9) we have b p ?:-1 if p is even. Furthermore, a linear com- bination of irreducible analytic subsets of X of (complex) dimension r with positive real coeffi- cients is a cycle of degree 2r that is never hom- ologous to zero on X, since the integral of or on the cycle is never zero. For a compact Kahler manifold X, we can define a tcomplex torUs 1 and a holomorphic mapping },: X -->  such that (i)  is gener- ated by A(X) as a group; (ii) any holomorphic mapping J1: X --> T can be decomposed as J1 = ex 0 A + c, where T is another complex torus, ex is a complex analytic homomorphism from  to T, and c is a point of T.  is called the Albanese variety of X. The set  of complex analytic isomorphism classes of tcomplex line bundles that are trivial as topological bundles has a natural complex structure with a canoni- cally associated structure of a family of vector bundles ( 72 Complex Manifolds G). With this structure  is, in fact, a complex torus and is called the Picard variety of X. Then  and 1 are constructed using H'(X,R) and H 2n -'(X,R), respectively, and are dual com- plex tori. If X is a Hodge manifold ( Sec- tion D),  and  are Abelian varieties ( 3 Abelian Varieties). A small deformation of a compact Kahler manifold is also Kahlerian (a Kahler metric can be taken to be of class C'" with respect to parameters) [9]. But a limit (in the sense of deformation) of a Kahler manifold is not always Kahlerian, An example was given by Hironaka [6]. Generalizing the notion of compact Kahler manifold, A. Fujiki [2,3J introduced the cate- gory C(j by X E «(j if and only if (i) X is a com- pact complex reduced space and (ii) there exist a compact Kahler manifold Y and a surjec- tive holomorphic mapping: Y -->X. Fujiki 232 D Kiihler Manifolds proves among other things that when X EC(} is a manifold, (1) every holomorphic p-form is closed, (2) Hn(x, C) EBpHP(X, O-P) for any n?:-O, and (3) HP(X,01)H"(X,O) as a vector space, where the O denotes the sheaves of holomorphic p-forms on X. Let X be a compact Kahler manifold with Kahler metric g = L gap dz' dz p . Then R(g)= L Rap(g)dz'dz p (where Rap(g) = - 8 2 /oz"ozP(log(det(g,p)))) is the associ- ated Ricci tensor. According to Chern, (J=!/2n)L R,p(g)dz' /\ dZ P is a closed (1,1) form representing the first Chern class of X, Conversely, for a compact Kahler manifold X, E. Calabi conjectured that: (a) Given a real closed (1, 1) form (j'=1/2n)L Rapdz' /\ dz P which represents the first Chern class of X, there exists a unique Kahler metric g on X such that its corre- sponding Ricci tensor R(g) coincides with L Rap dz' dz P and that g determines the same cohomology class as the original Kahler metric. (b) If X has either negative or zero first Chern class, then X admits an Einstein Kahler metric 9 (where "Einstein" means R(g)= (const). g). As a partial answer to (b), T. Aubin proved the existence of an Einstein Kiihler metric on compact complex manifolds with negative first Chern class. Later, S. T. Yau gave a complete affirmative answer for both (a) and (b), devel- oping the method (solving complex Monge- Ampere equations) of Calabi and Aubin. Among several consequences of Yau's work, the following by A, Todorov is one of the most interesting: Every point of SO(3, 19)/SO(2) x SO(l, 19) corresponds to at least one marked Kahler surface via period mapping. Yau also showed that Aubin's previous work solved affirmatively the following con- jecture of F. Severi: Every compact com- plex surface that is homotopic to P2(C) is biholomorphic to P2(C), Yau's proof used a delicate differential geometric analysis of the inequality c, (X)2[XJ.s; 3c 2 (X)[XJ, proved by Y. Miyaoka for compact complex surfaces X of general type. The key observation is that if X is a compac.t complex surface with negative first Chern class and CdX)2[XJ = 3C2(X) [XJ, then the universal covering space of X is an open ball {(Z"Z2)EC 2 ; Iz,12+lz212< I}. For recent developments concerning Calabi's con- jecture  [1, 15]. D. Hodge Manifolds One of the most important examples of Kah- ler manifolds is a projective nonsingular vari- ety over the field of complex numbers. If we 
232 E Kiihler Manifolds let «(0, ..., (N) be a homogeneous coordinate system in the N-dimensional projective space pN (C), then the subset (k # 0 has its holomor- phic coordinate system (Zl, ..., ZN), where Zl = (o/(k> "', Zk = (k-J!(b ZHI = (Hl/(k> ..., ZN = (N/(k. If we let i/tk=(I/n)log(1 + Ljlzjl2) and g,p=a 2 i/t/az'az p , then ds 2 =Lg,pdz'dz P is independent of k and determines a standard Kiihler metric of pN (C) that coincides with the so-called Fubini-Study metric on PN(C) up to a constant multiplier. In this case, we have the following results: (11) Let f> be a hyper- plane of PN. Then the integral Sz Q of the fun- damental form Q on a 2-cycle Z is equal to the tKronecker index KI(Z, f» of the tintersec- tion of Z and f>. Therefore (12) an integral (a period) of Q on a 2-cycle with integral coeffi- cients is an integer. In other words, Q corre- sponds to the cohomology class in H 2 (X, R) of a cocycle with integral coefficients. Property (12) holds for the induced Kahler metric on an analytic submanifold X of pN (C) (which is algebraic by tChow's theorem). Property (11) also holds if we replace f> by the intersection Y of X and f>. Property (8) ( Section B) can be thought of as an expression (in terms of har- monic forms) of Lefchetz's theorem on the topology of projective algebraic varieties. Generally, if a compact complex manifold X admits a Kahler metric with property (12), we say that the metric is a Hodge metric and X is a Hodge manifold. Kodaira's theorem asserts that a Hodge manifold has a biholomorphic embedding into a projective space [8]. Coho- mology vanishing theorems ( Section D) and the properties of tmonoidal transformations of complex manifolds are used to prove this theorem. On a closed Riemann surface 9\, i,e., a 1- dimensional compact complex manifold, any Hermitian metric is Kahlerian, Moreover, a metric with total volume 1 is a Hodge metric. This proves that ffi is isomorphic to an alge- braic curve in a projective space. This is a proof (using Kodaira's theorem) of the exis- tence of noncOnstant meromorphic functions on 9\. The condition on the tRiemann matrix for a complex torus T= C"/D (D is a discrete subgroup of rank 2n) to be a projective alge- braic variety is that T admit a Hodge metric. Let L be the sheaf associated with a positive line bundle (i.e., its first Chern class is repre- sented by a Hodge metric) On a compact com- plex manifold X. Then the Kodaira vanishing theorem states that Hi(X, Kx @ L)=O for any i> 0, where K x is the canonical sheaf, i.e., the sheaf of holomorphic n-forms, n = dim X. Several generalizations are known, For in- stance, let X be a compact complex variety with a(X) = dim X and F be a sheaf associated with a positive vector bundle on X. Take a 868 nonsingular projective variety Y and a bira- tional holomorphic mapping n: Y --> X and define Kx to be the direct image n*(Ky). Then Hi(X, K x @ F)=O for any i>O(S. Nakano [lOJ, H. Grauert and O. Riemenschneider [5J), Ramanujam's vanishing theorem is con- cerned with an effective divisor D on a nOn- singular projective surface S, D is said to be numerically connected if the intersection num- ber Dl . D 2 > 0 for every decomposition D = Dl + D 2 with D i > O. If D 2 > 0 and D is numer- ically connected, then HI(S,fD)=O, where f D is the sheaf of ideals defining D [12]. How- ever, Kodaira's vanishing theorem fails for algebraic manifolds over fields of positive characteristic. E. Examples and Other Properties Concerning differential geometry on compact KaWer manifolds, the analytic transformation group and the tisometric transformation group are also studied. For example, let X be a com- pact Kahler manifold of complex dimension n. Then the Lie group of isometric transforma- tions on X is of dimension .s; n 2 + 2n, and the equality holds if and only if X = pn(C). A nonalgebraic complex torus is the most important example of a compact Kahler mani- fold that is not algebraic. A complex torus is not an algebraic variety if it is not a Hodge manifold. An example of a noncompact Kahler mani- fold is a bounded domain in C" with the Kah- ler metric ds 2 = L (a 2 logK(z, 'Zvaz'azp)dz' dz P , ,.p where K (z, rJ is tBergmim's kernel function. This metric is significant because of its invar- iance under the analytic automorphisms of the domain. More generally, a tStein manifold admits a complete Kahler metric [4]. References [IJ J. P. Bourguignon et aI., Premiere c1asse de Chern et courbure de Ricci: preuve de la con- jecture de Calabi (Seminaire Palaiseau, 1978), Asterisque 58, Societe Mathematique de France. [2J A. Fujiki, Closedness of the Douady space of compact Kahler spaces, Publ. Res. Inst. Math. Sci., 14 (1978),1-52. [3J A. Fujiki, On automorphism groups of compact Kahler manifolds, Inventiones Math., 44 (1978),225-258. [4J H. Grauert, Charakterisierung der Holo- morphiegebiete durch die vollstandige Kah- lersche Metrik, Math. Ann., 131 (1956), 38- 75. 
869 [5J H. Grauert and O. Riemenschneider, Ver- schwindungssatze fUr analytische Kohomo- logiegruppen auf komplexen Raumen, Inven- tiones Math., 11 (1970), 263- 292. [6] H. Hironaka, An example of a non- Kahlerian complex-analytic deformation of Kahlerian complex structures, Ann. Math., (2) 75 (1962), 190-208, [7] W. V, D. Hodge, The theory and applica- tions of harmonic integrals, Cambridge Univ. Press, second edition, 1952. [8J K. Kodaira, On Kahler varieties of re- stricted type (An intrinsic characterization of algebraic varieties), Ann. Math., (2) 60 (1954), 28-48. [9J K. Kodaira and D, C. Spencer, On de- formations of complex analytic structures III, Ann. Math" (2) 71 (1960),43-76. [!OJ S. Nakano, On complex analytic vector bundles, J. Math. Soc. Japan, 7 (1955),1-12. [IIJ c. P. Ramanujam, Remarks on the Kodaira vanishing theorem, J. Indian Math. Soc., 36 (1972), 41-51. [12J C. P. Ramanujam, Supplement to the article "Remarks on the Kodaira vanishing theorem," J. Indian Math. Soc., 38 (1974), 121-124. [13J A Weil, Introduction a l'etude des varietes kahleriennes, Actualites Sci. Ind., Hermann, 1958. [14J R. O. WelIs, Jr., Differential analysis on complex manifolds, Prentice-HalI, 1973. [15] S. T. Yau, Calabi's conjecture and some new results in algebraic geometry, Proc. Nat. Acad. Sci. US, 74 (1977), 1798-1799. [16J S. T. Yau, On the Ricci curvature of a compact Kahler manifold and the complex Monge-Ampere equation I, Comm. Pure Appl. Math., 31 (1978),339-411. 233 (XXI.30) Klein, Felix Felix Klein (May 25, 1849-June 22, 1925) was one of the leading mathematicians in Germany in the latter half of the 19th century. Born in Diisseldorf and graduated from the University of Bonn, Klein went to study in Paris. In 1872 he became a professor at the University of Erlangen, and in 1886 attained a chair at the University of Gottingen, where he was em- ployed until his death. His accomplishments cover alI aspects of mathematics, but his main field was geometry. In his inaugural lecture at the University of Erlangen, he presented a bird's-eye view of all the then known fields of geometry from the standpoint of group theory, which is referred to as the tErlangen program 234 A Kleinian Groups ( 137 Erlangen Program). In it he stated that both Euclidean and non-Euclidean geometry are included in tprojective geometry, Klein said in his last lectures [4J that he spent the greatest part of his energies in the field of tautomorphic functions. These last lectures are important as historical material on the mathe- matics of the 19th century. Klein was a leader of reforms of mathematical education in Ger- many [3]. References [IJ F. Klein, Gesammelte mathematische Abhandlungen I -III, Springer, 1921-1923. [2] R. Fricke and F. Klein, Vorlesungen iiber die Theorie der automorphen Funktionen, Teubner, I, 1897; II, 1901. [3J F. Klein, Elementarmathematik von ho- heren Standpunkte aus, Springer, 1,1924; II, 1925; III, 1928. [4J F. Klein, Vorlesungen iiber die Entwick- lung der Mathematik im 19. Jahrhundert I, II, Springer, 1926-1927 (Chelsea, 1956). [5] F. Klein, Vorlesungen iiber nicht- Euklidische Geometrie, Springer, 1928 (Chel- sea, 1960). [6J F. Klein, Vorlesungen iiber hohere Geo- metrie, Springer, third edition, 1926 (Chelsea, 1957). 234 (XI.17) Kleinian Groups A. Definitions Let H 3 be the upper half-space of the 3- dimensional Euclidean space. The boundary plane of H 3 is regarded as the complex plane C. The one-point compactification H 3 U C of H 3 is denoted by fP, where C is the ex- tended complex plane. Let Mob(H 3 ) be the group of alI the Mobius transformations in fP which are represented by an even number of compositions of reflections with respect to hemispheres in fP orthogonal to C. Here, the plane in H 3 orthogonal to C is regarded as a hemisphere orthogonal to C. The hyperbolic structure can be defined naturalIy in H 3 , and this structure is preserved by any element in Mob(H 3 ). A hemisphere in fP orthogonal to C is a (hyperbolic) plane in this structure. The restriction of any element in Mob(H 3 ) to C is a linear fractional transformation on C; con- versely, for any given linear fractional trans- formation y on C, there is a unique Mobius 
234B Kleinian Groups transformation in Mob(H 3 ) whose restriction to C coincides with y. A Kleinian group r is defined as a discrete subgroup of Mob(H 3 ) and acts discontinu- ously on H 3 . If there exist a point pEH 3 and a sequence {Yn} of distinct elements of r such that the sequence {Yn(P)} converges to a point 'E C in the usual topology of fj3, then the point' is called a limit point of r, and the set A(rj of all the limit points of r is called the limit set of r, which is a closed subset of C. When A(rj = C, the Kleinian group r is of the first kind. Otherwise, r is of the second kind, and the restriction of r to C acts discontinu- ously on the region of discontinuity O(rj = C - A(rj for r. Usually, "a Kleinian group" means one of the second kind. Thus the Klein- ian group r acts discontinuously on H 3 U O(r), the quotient space H 3 /r is a 3-manifold with a hyperbolic structure induced by that of H 3 , and the union of some Riemann surfaces O(rj/r is the boundary of this 3-manifold. A study of Kleinian groups in connection with the theory of 3-manifolds has been done re- cently by W. P. Thurston. If A(r) consists of at most two points, then the Kleinian group r is called elementary; if otherwise, r is nonelementary and A(rj is perfect and nowhere dense in C, and O(rj allows the Poincare metric and has a hyper- bolic structure invariant under r. A Kleinian group whose region of discontinuity has a nonempty connected component invariant under r is often called a function group. B. Examples A Fuchsian group ( 122 Discontinuous Groups) is a Kleinian group which leaves a circular disk in C invariant. Another important Kleinian group is the so-called Schottky group. Let {C j , C;}';= I be /I (  2) pairs of Jordan curves on C, everyone of which lies in the exterior of each of the other 2n - 1 curves, Assume that there are n loxodromic (or hyper- bolic) linear fractional transformations Yj, where Yj maps the interior of C j onto the ex- terior of C; conform ally. The group generated by these {Yj} is a Schottky group whose limit set is totally disconnected and has its nonem- pty subset in the interior of every C j and C;. A Kleinian group is a Schottky group if and only if it consists of only loxodromic or hyperbolic elements, is finitely generated, and is free. A Schottky group is not always Fuchsian, and there are many kinds of Kleinian groups which are not Fuchsian. Quasiconformal de- formation of a Fuchsian group G of the first kind yields the tTeichmtiller space T(G) with 870 the center G, and every point of T(G) is a quasi-Fuchsian group whose limit set is a quasicircle, an image of a circle on C under a quasiconformal mapping of C onto itself. To every point of the boundary of T( G), there corresponds a Kleinian group called a bound- ary group for G. Among them, there is a Klein- ian group whose region of discontinuity is connected and simply connected. Such a Kleinian group is called totally degenerate. A web group is a Kleinian group whose region of discontinuity consists only of Jordan domains. C. Fundamental Sets Let r be a Kleinian group acting on H 3 . There is a tfundamental set for r in H 3 which is bounded by a countable number of hyperbolic planes in H 3 and is convex in the sense of the hyperbolic structure. The closure of this funda- mental set in H 3 is called a convex funda- mental polyhedron for r. For a point PoEH3 not fixed by elements of r except for the identity, the set {pEH3Id(p,Po).s;d(y(p),po) for each YEr}, where d denotes the hyperbolic distance, yields a convex fundamental poly- hedron for r and is called the Dirichlet region with the center Po for r. If r has a convex fundamental polyhedron with a finite number of sides or faces, then r is called geometri- cally finite. Similarly, a Dirichlet region for a Fuchsian group in the upper half complex plane H can be defined, and it is often called a normal polygon for the group, For a Kleinian group r, instead of fundamental sets for r in O(rj, the concept of fundamental domains is sometimes useful. A fundamental domain for a Kleinian group r in O(r) (#0) is an open subset D of O(rj such that r-images of any point in D, except the point itself, are not in D and such that some of the r -images of any point in O(rj lie on the closure of D in O(rj. The Ford fundamental region is also useful. The action of Y E r on C is given by a linear fractional transformation z -->( az + b )/( cz + d), ad-bc=1. The circle {zECllcz+dl=l} is called the isometric circle of Y with c #0, If 00 E O(rj, then the set, every point of which is in the exterior of isometric circles of all y E r with c #0, is a Ford fundamental region for r [7]. D. Finitely Generated Kleinian Groups Much work has been done on finitely gen- erated Kleinian groups [2,5,9,10,12, 13J, while very little has been done on nonfmitely generated Kleinian groups, If a Kleinian group r (of the second kind) is finitely generated, 
871 then the quotient space O(r)jr is a disjoint union of a finite number of compact Riemann surfaces with at most a finite number of punc- tures (Ahlfors's finiteness theorem). Further- more, if the above r is nonelementary and is generated by N generators, then the (hyper- bolic) area of o (r)jr is not greater than 4n(N - 1) (Bers's area theorem). These two theo- rems played important roles in the study of Kleinian groups after 1960. If a Fuchsian group is finitely generated, then every normal polygon for the group has a finite number of sides. In contrast with this fact, the following is suggestive: If a finitely generated Kleinian group r is totally degenerate, then r is not geometrically finite (L. Greenberg). There is a method to obtain a Kleinian group from two simple groups. If r j (j = 1,2) is a finitely generated Kleinian group and if the interior of a connected fundamental domain Dj for r j (j = 1,2) contains the boundary and the exterior of D j (i # j), then the group generated by r 1 and r 2 is again a Kleinian group with a fundamental domain D, n D 2 (Klein's combina- tion theorem). Generalizations of this theorem are discussed by Maskit [14J. Let r be a nonelementary Kleinian group, and let  (# 0) be an invariant union of com- ponents of O(r) under r. Denote by Aq(, r) the Banach space of holomorphic auto- morphic forms <p(z)dz q in  for r satisfying JJ."/r p(zf-ql <p(z)1 dx dy < 00, where Mr is a measurable fundamental set for r in , z = x +iy, and p(z)ldzl is the Poincare metric on, The Banach space Bq(, r) is the totality of holomorphic automorphic forms tjJ(z) dz q for r in  with SUPZE.''.p(zPltjJ(z)1 < 00. If r is finitely generated, then Aq(, r) = Bq(, r), and it is the space of all the cusp forms for r in . If r is not finitely generated, then even the inclusion Aq(O(r), r) c: Bq(O(r), r) does not hold in general. E. Limit Sets of Kleinian Groups Let r be a Kleinian group, and let {OJ be the collection of all the components of O(r). Although it was stated in [8] that A(r) con- sists of boundaries aO j of OJ, it was proved by Abikoff in [3] that the statement is incorrect; that is, the residual limit set Ao(r) = A(r)- Uj aO j of r is not always empty. In fact, there is a web group whose region of discontinuity consists of an infinite number of components, and such a group has a nonempty residual limit set [I]. It was conjectured by Ahlfors that the 2- dimensional Lebesgue measure of A(r) equals zero for every finitely generated Kleinian group 234 Ref. Kleinian Groups r of the second kind. This conjecture is one of the sources of recent developments of the theory of Kleinian groups, and it is still open. Concerning this conjecture, the following fact was proved by Sullivan in [13] from the stand- poin t of the ergodic theory: The Bel trami coefficient for r with support on A(r) is equal to zero. It is also known that, if r is geometri- cally finite, then the conjecture is affirmative. The study of Kleinian groups from the view- point of ergodic theory has also been devel- oped [11,15]. Let M,(r) be the tt-dimensional Hausdorff measure of A(r). The Hausdorff dimension d(r) of A(r) is defined by d(r)=inf{t>OI M,(r)=O}. There are some studies on the Hausdorff dimension of A(r) for several kinds of Kleinian groups r [3,4,6]. References [IJ W. Abikoff, The residual limit sets of Kleinian groups, Acta Math., 130 (1973), 127- 144. [2J L. Ahlfors et aI., Advances in the theory of Riemann surfaces, Ann. math. studies 66, Princeton Univ. Press, 1971. [3] T. Akaza, Local property of the singular sets of some Kleinian groups, Tohoku Math. 1., (2) 25 (1973),1-22. [4J A. F, Beardon, Hausdorff dimension of singular set of properly discontinuous groups, Amer. 1. Math" 88 (1966), 722-736, [5J L. Bers, 1. Kra, et aI., A crash course on Kleinian groups, Lecture notes in math. 400, Springer, 1974, [6J R. Bowen, Hausdorff dimension of quasi- circles, Publ. Math. Inst. HES, 50 (1979), 1-25. [7] L. R. Ford, Automorphic functions, Chel- sea, second edition, 1951. [8J R. Fricke and F. Klein, V orlesungen iiber die Theorie der Automorphen Funktionen I, II, Teubner, 1897. [9J L. Greenberg et aI., Discontinuous groups and Riemann surfaces, Ann. math. studies 79, Princeton Univ. Press, 1974. [!OJ W. J. Harvey, et aI., Discrete groups and automorphic functions, Academic Press, 1977. [IIJ E. Hopf, Ergodentheorie, Springer, 1937. [12J I. Kra, Automorphic forms and Kleinian groups, Benjamin, 1972. [13] I. Kra, B. Maskit, et aI., Riemann surfaces and related topics, Ann. math. studies 97, Princeton Univ. Press, 1980, [14J B. Maskit, On Klein's combination theorem, Trans. Amer. Math. Soc., 120 (1965), 499-509. [15J M. Tsuji, Potential theory in modern function theory, Maruzen, 1959. 
235 A Knot Theory 235 (IX.16) Knot Theory A. General Remarks The knot problem is a special case of the placement problem, stated as follows: Given two homeomorphic topological spaces XI and X 2 and their respective homeomorphic subsets Al and A 2 , is there a homeomor- phism f:X I -->X 2 such that f(A I )== A 2 ? A simple closed curve in a Euclidean 3-space R 3 (or in a 3-sphere S3 obtained from R 3 by one-point compactification, sometimes used in preference to R 3) is called a knot. If two knots KI and K 2 can be mapped from one to the other by a homeomorphism of R 3 (or of S3), they are called equivalent. All knots are classified into knot types by this eq ui valence. Let (x, y, z) be Cartesian coordinates in R 3. Knots eq uivalent to x 2 + l = 1, z == 0 and their knot type are called trivial or unknotted; knots of other types are called knotted. Knots equiv- alent to those given by polygonal curves in R 3 and their knot types are called tame; other knots are called wild. We discuss only tame knots in this article except in Section H. Given a knot K in R 3 , there exists a plane such that the orthogonal projection n on it has the fol- lowing two properties: (1) The image n(K) has no multiple points other than a finite number of double points. (2) The projections of the vertices of K are not double points of n(K). Then n(K) is called a regular knot projection of K. Let z=O be the plane in question. Of the two points of K corresponding to a dou- ble point of n(K), the one with the greater z- coordinate is called the overcrossing point; the one with the smaller z-coordinate is calIed the undercrossing point. Suppose that overcrossing points and undercrossing points appear alternately when we move along K in a fixed direction; then K is caned alternating. Knots KI and K 2 in R 3 are said to be of the same isotopy type if there is an tisotopy {h,} (0.s;t.s;l)ofR 3 such that h,:R 3 -->R 3 are homeomorphisms with the identity ho and such that h l (K I )=K 2 . Knots Kj and K 2 ofR 3 are of the same isotopy type if and only if KI is mapped onto K 2 by an orientation-preserving homeomorphism of R 3. Thus knots of the same isotopy type are equivalent. The con- verse, however, is false. A knot that can be mapped onto itself by an orientation-reversing homeomorphism of R 3 is called amphicheiral. A knot K is called invertible if K is map- ped onto itself by an orientation-preserving 872 homeomorphism h such that hi K reverses the orientation of K. Knots 41,63,83,89,812,817' 8 18 in Appendix A, Table 7 are amphicheiral, and 8 17 is noninvertible. Given a knot K, there is an orientable sur- face F of tgenus h, say, having K as its bound- ary, called a Seifert surface of K [1]. The minimum of the genera of Seifert surfaces of K is called the genus of K, and there is an algorithm with which one can determine the genus of K (W. Haken, Acta Math., 105 (1961)), Using tDehn's lemma and the tsphere theorem, C. D. Papakyriakopoulos showed that for a tame knot, n i (S3 - K) = 0 (i;;;' 2), and further, K is un knotted if and only if n I (S3 - K)Z[2]. The first attempt to list all knots systemati- cally was made by P. G. Tait and C. N. Little in the 19th century; an attempt was also made by J. H. Conway in 1970. At present, the classi- fication has been completed for all knot types with at most ten double points by using the invariants of knots introduced in Sections B-E [3,4]. Given two knots Kj and K 2 separated by a 2-sphere S2, we can tie them together with a narrow strip to obtain a new knot K (Fig. 1), caned the product or composition of K I and K 2 . If the strip is chosen so that the orienta- tions of K I , K2' and K are mutually compat- ible, then the isotopy type of K is uniquely determined by those of KI and K 2 . The set of Isotopy types forms a commutative tsemigroup with respect to this product, in which the trivial knot type is an identity. Any knot is decomposed uniquely (up to isotopy) into finite products of the prime knots, which can- not be decomposed into products of knotted knots (H. Schubert and S.-B. Heidelberger, Akad. Wiss., 3 (1949». m ....::=.... Fig. I Until about 1930 the theory of knots had been studied chiefly by 1. W. Alexander in the United States and by K. Reidemeister, H. Seifert, and others in Germany. Little progress was made, however, until a new development of the theory was introduced by R. H. Fox [5J and his school in the United States. In Japan, significant contributions have been made by T. Homma, S. Kinoshita, K. Murasugi, and others [6]. 
873 B. Knot Groups If K is a knot, the tfundamental group G(K) = nl (R 3 - K) of R 3 - K is called the knot group of K. Any group G is written as a tquotient group F / N with a tfree group F and its tnor- mal subgroup N. By specifying the generating set {XI' X2,'" } of F and a set of elements {rl> r 2 ,... } in F such that N is the minimum normal subgroup of F containing rl> r 2 , ..., we obtain a system (XI' X 2 ,...: r l , r 2 ,...), called a presentation ofG, written G=(XI> X2, ...: r l , r2'''')' For the knot group G(K), there is a standard presentation caIled a Wir- tinger presentation, which is obtained as fol- lows. First, for the sake of convenience, we give an orientation to K. (The orientation is irrelevant, however, since the reverse orienta- tion gives an isomorphic group.) Consider a regular knot projection n(K). Ifn(K) has n double points n(d,), then K is divided into n arcs Zi by n undercrossing points. We con- sider a free group F generated by n letters x" ... ,X n . To each double point n(d,) we asso- ciate a word r i in the foIlowing manner. The two cases near n(d,) are illustrated in Fig. 2. For the first case, we define ri=x;;IXXAX;I; and for the second, r i == X;;I x;l xAX w Then the presentation (x I, X2, ..., X n : r l , r2' ..., r n ) is called a Wirtinger presentation of G(K). Since anyone of these ri is a consequence of the rest, G(K) may be written as (XI' X2, ..., X n : r l ,r2, ... ,rn-I)' "'x., / 'x r. HI =XilXIlX;.X;:1 "".' / 'x rHI =xiJ1x;1 X;"X/l Fig. 2 If G' is the tcommutator subgroup of G(K), then G(K)/G' is an infinite cyclic group Z. A knot group is an invariant of a knot type, but two knots with isomorphic knot groups are not necessarily equivalent. However, it is not known whether or not two prime knots with isomorphic knot groups are equivalent. The affirmative statement is called the general knot conjecture and is one of the fundamental prob- lems in knot theory. The knot complement conjecture is another important conjecture and states that two prime knots with isomorphic knot groups have homeomorphic comple- 235C Knot Theory ments [11]. Nevertheless, the knot group plays a prime role in knot theory. To be more pre- cise, given a knot Kin S3, consider a ttubu- lar neighborhood V of K. V is a solid torus. Choose two simple closed curves m and Ion o V in such a way that (1) m and I intersect at the base point; (2) m bounds a disk in V, but m is not homologous to 0 on oV; (3) I bounds an orientable surface in S3 - V , but I is not homo- logous to 0 on 0 V. A pair {m, I} is called a peripheral system for K, and m is a meridian and I a longitude of K. The meridian m and the longitude I, as elements of G(K), generate a free Abelian subgroup p(K) of G(K), called the peripheral subgroup of G(K). Two knots KI and K 2 are equivalent if and only if there is an isomorphism <p from G(Kd to G(K 2 ) which sends {m l , ld onto a conjugate of {ml, 11} in G(K 2 ) (F. Waldhausen, Ann. Math., (2) 87 (1968». Therefore the group system {G(K), p(K)} is an important invariant of the knot type of K. G(K) has been studied quite extensively since 1960. For example, the commutator subgroup G' of G(K) is tfinitely generated if and only if G' is free, and then the rank of G' is twice the genus of K [7]. S3 - V is a fiber bundle over SI with the fiber an orientable surface if and only if G' is finitely generated (J. Stallings, 1962). Such a knot is called a fibered knot. G(K) is residually finite for any knot K (the result requires Thurston's hyperbolization theorem), and thus the tword problem for G(K) is solv- able. G(K) has no element of finite order [2]. The knot type whose group has a nontrivial tcenter is completely characterized. Also, many problems on 3-manifolds can be formulated in terms of knot groups. For example, the foIlow- ing conjecture, related to tPoincari:'s conjec- ture, still remains unsolved. Property P conjec- ture: For a knotted knot K in S3, let N q be the smaIlest normal subgroup of G(K) containing ml q . If G(K)/N q is a trivial group, then q =0. C. Alexander Invariants and the Seifert Matrix Let (XI> ...,xn;r l , ...,r n - I ) be a presentation of the knot group G of a knot K, F be the free group generated by X I' X 2 , .,., X n , and <p: F--> G, tjJ: G-->H = G/G' be canonical homomor- phisms. Then <p and tjJ can be extended to homomorphisms between tgroup algebras with integral coefficients <p:Z[FJ-->Z[GJ and tjJ:Z[GJ-->Z[H]. For any word r in F, we have the free derivative or/ox i for i = 1, ..., n satisfy- ing the following: OX';OX j = (jij, OXi-I/OXi= -Xi-I, o (rs)/ox i = or/ox i + r' (OS/OX,) 
235D Knot Theory (Fox [6J). Now we have the Alexander matrix A = (a i ) (i= I,... ,n-1;j= I,... ,n) of the knot K defined by aij=tjJo<p(8rd8x). The ideal Ek of Z[HJ generated by all (n-k) x (n-k) minors of A is called the kth elementary ideal of A. OsE 1 sE 2 s... s En=Z[HJ, and in particular, EI is a principal ideal, called the Alexander ideal of K. Since H;;;, Z, any element of Z[HJ is a finite sum of elements of the form mt n (m, nEZ). The generator (t) of the Alexander (principal) ideal is called the Alexander poly- nomial of the knot K. The elementary ideals of A, including (t), are invariants of the knot type of K. (t) is uniquely determined by K up to the factor :t t A and has the following prop- erties: (i) (1)= :tl; (ii) (llt)=tA(t). Con- versely, any polynomial with integral coeffi- cients satisfying (i) and (ii) is the Alexander polynomial of some knot. For example, the knot group G of the trefoil knot (or clover- leaf knot) K (Fig. 3) is presented by G =(a, b; aba(bab)-I). The Alexander matrix of K is (1 - t + t 2 , -1 + t - t Z ), and the Alexander polynomial of K is (t)= I-t+t z . '"Q) B a Fig. 3 For a trivial knot, the Alexander polynomial is 1, but the converse is false. The first example of a nontrivial knot with (t) = 1 was given by Seifert (Fig. 4) [I]. The Kinoshita-Terasaka knot (Fig. 5) is another knot with this property (Osaka J. Math. 9 (1957». Kl K 2  ;[\ 'g  Fig. 4 Fig. 5 The degree of the Alexander polynomial never exceeds twice the genus of the knot, and the equality holds for any alternating knots and for fibered knots. For fibered knots, (O) = :tl and this is also sufficient for alternating knots to be fibered knots. The Seifert surface F of K is used to define an invariant of K as follows, Since HI (F; Z) is a free Abelian group of rank 2h, we can choose 2h oriented simple closed curves ai, a 2 , "', a Zh on F, whose homology classes form a free 874 basis for HI(F;Z). Consider F x [-I, I] in S3, and let v ij denote the tlinking number D(a; x { -I}, aj x {I}). Then the 2h x 2h integral matrix V=(viJ is called the Seifert matrix of K. It has been shown that the determinant of the matrix 'V -tV is the Alexander polynomial (t) of K, and the tsignature of the symmetric matrix 'V + V is a useful knot invariant O"(K), called the signature of K (H. F. Trotter, 1962; [8J). For a trefoil knot K (Fig. 3), O"(K) = -2. Since O"(K) = 0 for an amphicheiral knot, this shows that a trefoil knot is not amphicheiral. D. Link Theory A link L in R 3 is a disjoint union of knots K i in R 3 and is denoted by L = K I U K 2 U " . U Kr. Two links L=KI U K 2 U ... U Kr and L'= K; UK; U '" UK; are said to be equivalent if r = s and each K i is mapped onto K;, i = 1,2, .. ., r, by a homeomorphism of R 3. Further, we can define the isotopy type of a link, as we did in Section A. In link theory, however, we mostly consider oriented links, i.e., each com- ponent is given an orientation. Therefore, in the following, we consider only oriented links, and the link type means an isotopy type which preserves the orientation of each component. Let L = KI U K 2 U ... U Kr be a link in R 3 c: S3. The link group G(L) of L is defined as n l (S3 - L). Using a regular projection, a Wir- tinger presentation of G(L)=(x l , x 2 , "', x n : r 1 , r2,"" rn-d can be obtained in exactly the same manner as was described in Section B. The Alexander matrix A of L is also defined by means of the free derivative from a presen- tation of G(L). Since GIG' is a free Abelian group of rank r, the Alexander matrix A is a matrix over an integral Laurent polynomial ring with r variables t 1 ,t2,"', t" and hence, the first elementary ideal EdA) may not be principal. A generator of the smallest prin- cipal ideal containing the generators of EI (A) is called the Alexander polynomial or the link polynomial of L. It is an integral polynomial (t I, t z , ..., t r ) which is uniquely determined up to the factor :t t;1 t2 ." t;'. The link poly- nomial is an invariant of the link type but is not an invariant of the link group. There are some necessary conditions for a polynomial to be a link polynomial, but they are not suffi- cient to characterize a link polynomial. By putting t 1= t 2 =... = t" we obtain a poly- nomial of one variable .1(t). (1 - t).1(t) is called the reduced link polynomial. .1(t) is divisible by (l-ty-z and V'(t)=.1(t)/(I- tj'-2 is called a Hosokawa polynomial of L. V'(t) is symmetric. n 2 (S3 - L) #0 if and only if L splits, i.e., there is a 2-sphere S2 in S3 which is disjoint from L, but each component of S3 - S2 con- 
875 tains at least one component of L. Given two links, we can define (not uniquely) a product link in the same manner as was described in Section A, and any !ink type is decomposed uniquely into a finite number of prime links. The complement of a prime link cannot be determined from its group. Given a link L of r components, we can find an orientable surface F of genus h, say, whose boundary is L, and thus we can define a Seifert matrix V for L. ( Section C. The only dif- ference is that HI (F; Z) is free Abelian of rank 2h+r-1.) The determinant of'V - tV is, then, the reduced link polynomial. The signature of 'V + V is called the signature of L [8]. For links in S3, we can define a weaker equivalence, called the homotopy type. Two links are said to be homotopic (or of the same homotopy type) if one link is deformed con- tinuously onto another under the condition that during the deformation each component of the link is allowed to cross itself, but no two components are allowed to intersect. A typical homotopy invariant of a link L is the linking number Aij= Lk(Ki' Kj)' The absolute value of Aij is determined by GIG 2 , where G;, i = 1,2,..., denotes the t\ower central series of G(L). Using the group GIGk> k 1, J. Milnor defines a numerical link invariant JiUd2 ...jd, l.s;jl,h, ... ,jk.s; r, called Milnor's invariant (Ann. Math., (2) 59 (1954)). In particular, Ji(ij) = Aij, and Ji completely determines the homotopy type of a link with at most three components. E. Representations and Covering Spaces Let K be a knot (or a link) in S3. A ttransitivc homomorphism <P from the group G(K) into Sn' the tsymmetric group of degree n, is called a representation of G( K) of degree n. Two rep- resentations <PI and <P2 are said to be equiva- lent if there is an tinner automorphism p of Sn such that <P2 = P<PI . Since GIG'  Z, <p(G)I<p(G)' must be cyclic, Conversely, any finite group r with riP cyclic is a homomorphic image of some knot in S3 (F, Gonzalez-Acuna, 1975), If r = Sn or D 2n +1 (the tdihedral group of order 2(2n + 1 )), r Ir' is finite cyclic and some conditions are known for the knot group to be mapped onto r [5]. Given a representation <P of G(K) of degree n, we obtain the n-fold tcovering space C'P of S3 - K. Equivalent representations correspond to homeomorphic covering spaces. C'P can be completed to a covering space L'P of S3 with branch points on K. L'P is an orientable closed 3-manifold and is called the n-fold branched covering space of S3 along K. Let A'P be L'P- C'P' Then A'P is a link in L'P' Any topological 235 E Knot Theory invariant of (L'P' A'P) is an invariant of K. For example, the homology groups of L'P are im- portant invariants of K, and by using these invariants we can show that the two knots KI (Fig. 4) and K 2 (Fig. 5) are distinct (R. Riley, 1971). If Hl(L'P;Z) is a finite group, we can compute the linking number between two components of A'P (R. Hartley and Murasugi, Canad. J. Math., 29 (1977». The set of these rational numbers is called the covering link- age invariants and is used to classify all knot types with ten double points (K. Perko, Jr. (1974)). Since GIG'  Z, every knot group has a unique representation on the cyclic group of order g. Let Eg - Ag be the corresponding g- fold covering space of S3 - K, called the cyclic covering space. Then the I-dimensional ho- mology group Hl(Lg-Ag) is isomorphic to the direct sum of HI (Eg) and Z, and the order e of HI (Lg) is given by 0 = IIJJ A(w\ where w denotes the primitive gth root of unity. If HI (Lg) is infinite, then the tBetti number of Lg is the number of roots of (t)=O, properly counted, that are gth roots of unity. Since every orientable closed 3-manifold is a branched covering space of S3 along a knot or link (Alexander (1928), some attempt has been made to identify tsimply connected 3- manifolds as branched covering spaces of S3. However, all simply connected 3-manifolds obtained as coverings thus far are known to be S3. In 1979 W. Thurston proved that any g-fold (g  2) branched cyclic covering space of S3 (or of a homotopy 3-sphere) along a nontrivial knot is never simply connected, This result set- tles affirmatively the Smith conjecture: The fixed point set of any periodic self-diffeomorphism of S3 is unknotted if it is a circle. The proof of the conjecture is based on the study of hyperbolic structures on 3-manifolds initiated by W. Thurston (about 1977) and on the equivariant loop theorem of Meeks and Yau. A hyperbolic manifold is a Riemannian manifold of constant negative curvature which is complete and of finite volume, A knot (or a link) K whose complement S3 - K admits a hyperbolic-manifold structure is called a hyperbolic knot (or link). Two hyperbolic knots with isomorphic knot groups have homeo- morphic complementary domains. A faithful representation i/J of a knot group G(K) into tPSL(2, C) is called an excellent parabolic rep- resentation if(!) trt/t(m)=:t2 for a meridian m and (2) t/t(G(K)) is a non-Abelian discrete subgroup of PSL(2, C). According to Thur- ston, a knot K is hyperbolic if and only if G(K) has an excellent parabolic representation. The theory of hyperbolic knots has had a most profound influence on knot theory, 
235 F Knot Theory F. Braids The theory of braids has been used as a tool to investigate the theory ofknots. An illustration of a braid of fifth order is given in Fig. 6. In general, consider a cube D 3 written as D 2 x [0, IJ, where D 2 is a disk {(x, y) 10.s;x,y.s; I} . 3 ( i 1 ) , 2 In R . Let P i = -,- be pomts on D and n+ 1 2 define 2n points Ai=P i x {I} and Bi=P i x {O}, i= 1,2, ...,n. Then join Ai with B k " i= 1,2, ..., n, where (k 1 , k 2 , ..., k n ) is a permutation of (1, 2, .." n), by means of mutually disjoint polygonal curves 'i in D 3 in such a way that (I) 'inaD3 = AiU Bk and (2) for each t, O.s; t.s; 1, a disk D,=D2 x (t} intersects 'I at exactly one point. Such a configuration is called a braid of nth order. Two braids 31 and 32 are said to be isotopic, 31 32, if there is a homeomorphism of D 3 onto itself mapping 31 onto 32 such that it is an identity on aD 3 . The isotopy relation is an equivalence relation among braids. Suppose that we are given a braid such as Fig. 6. If we connect Ai and B i by polygonal curves I; on the boundary of D 3 (Fig. 7), we obtain a closed braid. In general, a closed braid is a link. Conversely, any link is equivalent to a closed braid. '.4. 1 -A z -A J -A, -As IT T T T T il J I A-kH / B 1..JhJl. 3.Jl']1.S Fig. 6 Fig. 7 The product 3132 of two braids 31 and 32 is defined as the braid obtained by connecting 31 and 32 as shown in Fig. 8. If 31  3'1 and 32  32, then 31 32  3'1 32' Hence we can define the product [31J [51J = [3132J of equivalence classes of braids of nth order. The totality of [3J forms a group 3n called the braid group of order n. 3n is generated by the equivalence classes O"i (i = 1,2, '" , n - 1) of braids as shown in Fig. 9(a); note O"i- 1 shown in Fig. 9(b). The tfundamental relations between {O"i} are Sji = 1, where Sji=O"j- 1 0"i- 1 0"jO"i or O"j- 1 0"i- 1 0"j- 1 0"iO"jO"i according as Ij-il 2 or li- jl = 1. Suppose that we are given braids 31,32 represented as products of O"i' Then 31 and 32 are equivalent if and only if the two products represent the same element in 3n. The problem of deciding whether 31 and 32 are equivalent reduces to the tword problem in 3n. On the other hand, the problem of deciding whether or not two closed braids are equivalent reduces 876 , ,+1 rnJJJIX[ , 0 0 : : : (a) , 0 , , 0 , rrmm (b 1 Fig. 8 Fig. 9 to the ttransformation problem in n ( 161 Free Groups), The theory of braids was initiated and de- veloped by E. Artin, who also gave a solution to the word problem in 3n (Abh. Math, Sem. Univ. Hamburg (1926)). The transformation problem was solved by F. A. Garside (Quart. J. Math., 78 (1969)). By taking an arbitrary surface F instead of a disk D 2 , we have a braid group over F, de- noted by Bn(F). The structure of the group Bn( F) has been studied since 1962. A presenta- tion of Bn(F) for any 2-manifold is known. In particular, Bn( F) has no elements of finite order except when F is a 2-sphere or a projec- tive plane. Following the discovery of a deep connection between Bn(F) and the mapping class group, the study of Bn(F) has become quite important [9]. G. Higher-Dimensional Knots The problem of knots, that is, the problem of placement of simple closed curves in R 3 , is extended to the problem of placement of q- dimensional spheres in p-space RP or in a p- dimensional sphere SP. In this section, the explanation is restricted to the case of tcombinatorial manifolds and tPL homeomorphisms between them. A simi- lar theory can be developed for other cate- gories ( 114 Differential Topology). Let D n be the n-dimensional disk {(XI' X 2' ..., xn)ERn I Ix;! .s; 1, i= 1,2, "', n}. We identify D n with the subspace D n x {O} of D n + 1 . If sq is a subcomplex of SP, then (SP, sq) (p > q) is called a (p, q) sphere pair or (p, q)-knot. If a q- dimensional cell Bq is a subcomplex of BP and if Bq n BP = Bq, the boundary of Bq, then (BP, Bq) is called a (p, q) ball pair or (p, q)-ball knot. Two pairs X =(XP,X q ) and Y=(YP, yq) are said to be homeomorphic if there is a homeo- morphism h:XP--> YP such that h(X q )= yq. We classify (p, q) ball pairs and (p, q) sphere pairs into equivalence classes via homeo- 
877 morphisms. A (p, q) ball pair (BP, Bq) is said to be unknoued (or flat) if it is homeomorphic to the standard pair P,q = (DP, Dq). A (p, q) sphere pair (SP, sq) is said to be unknoUed if it is homeomorphic to (iJDq+1, iJDP+1). E. C Zeeman showed that if p - q;" 3, then the (p, q) ball pair and the (p, q) sphere pair are both unknotted (Ann. Math., (2) 78 (1963)). Similar results have been obtained by Stallings (Ann. Math., (2) 77 (1963)). Because of these results, the only interesting case is p=q+2. We as- sume further that sq is locally flat in Sq+2. Let Kq be a q-knot, i.e., a q-dimensional sphere, in Sq+2, and let G(Kq)=n 1 (Sq+2_Kq), the group of Kq. Since G(Kq)/G'  Z, we can define the Alexander matrix, the Alexander ideal, and the Alexander polynomial (t) of Kq, as we did in Section C. An alternative descrip- tion of these invariants expressed as invar- iants of homology groups of the infinite cyclic covering space of sq+ 2 - Kq can be found in 1. Levine, Ann. Math., (2) 84 (1966), There are some discrepancies between I-knot theory and q-knot theory, q;" 2: (1) It is not known whether ndS 4 - K2)  Z implies that (S4, K2) is unknotted (unknotting conjecture). (2) ndS q + 2 - Kq) can have an element of finite order. (3) (I)==:t 1, but (t) may not be symmetric. (4) n2(S4-K 2 ) may not be trivial. Although the characterization problem of the I-knot group n 1 (S3 - K) has not yet been solved, the same problem for the q-knot group has been completely settled by M. A. Kervaire (1963) as follows. Let G be a finitely presented group. Then G is the group of some knot Kq in Sq+2, q;,,3, if and only if (i) G/G'Z, (ii) H 2 (G; Z)=O, and (iii) there is an element x in G whose set of conjugates generates G. These conditions are satisfied by any q-knot group, q ;"1, but they are not sufficient for I-knot groups. For q = 2, conditions (i)-(iii) are suffi- cient for G to be a 2-knot group in a thomo- topy 4-sphere. A q-knot Kq in Sq+2 (= iJDq+3) is called null- cobordant if Kq is the boundary of a locally flat embedded disk in Dq+3. The concept of knot cobordism was introduced by Fox and Milnor in 1957 (Bull. Amer. Math. Soc., 63 (1957)) for I-knots and was readily extended to q-knots in Sq+2. Knot cobordism is a weaker equiv- alence than isotopy, but the set of cobordism classes forms an Abelian group C q under the connected sum (joining the knotted spheres by a tube) or the product for I-knots. Any I-knot K that represents zero element in C 1 is called a slice knot. If K is a slice knot, then (t) must be of the form f(t)f(t -1) for some integer polynomial (Fox and Milnor (1957)) and (J(K)=O [8]. For all q ;,,1, C 2q - 1 is not finitely generated, while C 2q = 0 (Kervaire (1964)). 236 Kronecker, Leopold H. Miscellaneous Results Results on wild knots can be found scattered throughout the mathematical literature since 1945 Many strange things can happen with wild knots or wild arcs. For example, the simple closed curve K (Fig, 10) bounds a disk, but n 1 (S3 - K) is not Abelian. The placement problem of graphs in R 3 has also been treated, for instance by Kinoshita (Pacific J. Math., 47 (1975)).  ';JjjJJ::::. Fig. 10 References [1] H. Seifert, Ober das Geschlecht von Knoten, Math. Ann., 110 (1934),571-592, [2] C. D. Papakyriakopoulos, On Dehn's lemma and the asphericity of knots, Ann. Math., 66 (1957), 1-26. [3] K. Reidemeister, Knotentheorie, Erg. Math., Springer, 1932 (Chelsea, 1948). [4] D, Rolfsen, Knots and links, Math. lecture note 7, Publish or Perish, 1976. [5] R. H. Fox, A quick trip through knot theory, Topology of 3-Manifo1ds and Related Topics, M, K. Fort, Jr. (ed.), Prentice Hall, 1962,120-167. [6J R. H. Crowell and R. H. Fox, Introduction to knot theory, Ginn, 1963 (Graduate Texts in Math., Springer, 1977). [7] L. Neuwirth, Knot groups, Ann. math. studies 56, Princeton Univ. Press, 1965. [8J K, Murasugi, On a certain numerical invariant of link types, Trans. Amer. Math. Soc., 117 (1965),387-422. [9] 1. Birman, Braids, links, and mapping class groups, Ann. math, studies 82, Princeton Univ. Press, 1974, [!OJ H. Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934 (Chelsea, 1965); English translation, A textbook of topology, Academic Press, 1980. [11] K. Murasugi, Knot theory-old and new, preprint, Univ, of Toronto, 1980. 236 (XXI.31) Kronecker, Leopold Leopold Kronecker (December 7, 1823- December 29, 1891) was born in Liegnitz near 
236 Ref. Kronecker, Leopold Breslau in Germany (now Legnica in Poland). He entered the University of Berlin in 1841 but studied at various universities throughout the country, finally studying under E. E. Kum- mer at Breslau. In 1845, he received his doctor- ate with a thesis on units of talgebraic num- ber fields. Then he succeeded to his uncle's business in the management of banks and farms, which kept him away from publishing mathematical papers for eight years. In 1853, he published a paper on algebraic solution of equations, containing the assertions that Abelian extensions of the rational number field are contained in cyclotomic fields and that Abelian extensions of imaginary quadratic fields can be obtained using tcomplex multi- plication. The latter is called "Kronecker's dream in his youth." It remained a conjecture until it was solved by means of tclass field theory. He gave lectures at the University of Berlin, first in his capacity of academician, then as a professor, succeeding his teacher Kummer in 1883. His statement that mathe- matics as a whole should be based solely on the intuition of natural numbers ( 156 Foundations of Mathematics) often brought on disputes with his colleague K. tWeierstrass. His rejection of the bold reasoning of tset theory produced anxieties for G. tCantor. His famous statement, "Natural numbers were made by God; the rest is the work of man," can be put in contrast with the liberal state- ments of Cantor and R. Dedekind. Kronec- ker's works and lectures ranged widely over the theory of numbers, algebra, and analysis. His contribution to the theory of elliptic func- tions and his tlimit formula for zeta functions are well known. He also did pioneering work in topology, References [IJ K. Hensel (ed.), Leopold Kronecker's Werke I-V, Teubner, 1895-1931 (Chelsea, 1968). [2J H. Weber, Leopold Kronecker, Jber. D. M. V., 2 (1891-1892), 5-31. 237 (IX.15) K -Theory A. General Remarks K-theory was introduced by M, F. Atiyah and F. Hirzebruch after the original idea was sug- gested by A. Grothendieck. The tBott period- icity theorem is essential for the development of the theory. There are important applica- 878 tions of K-theory to tdifferential and alge- braic topology, such as the Riemann-Roch theorems for differentiable manifolds (Atiyah and Hirzebruch [6, 7J), the solution of the vector field problem on spheres (J. F. Adams [IJ), applications to immersion and embed- ding problems ( 114 Differential Topology), a simple proof of the solution of the Hopf in- variant problem, and the determination of J- images in stable homotopy groups of spheres ( 202 Homotopy Theory). A review of alge- braic K-theory is also included in this article. B. Construction of KA(X) Let the basic field A be the real number field R, the complex number field C, or the quater- nion field H. Let A' be the tcenter of A, and X be a compact Hausdorff space. Then G'A(X) denotes the set of all isomorphism classes of A- tvector bundles over X and is a commutative semigroup under the tWhitney sum  EB I}. Let FA(X} be the free Abelian group generated by the set G'A(X), and let QA(X) be the subgroup of FA(X) generated by the elements of the form  EB I} -  -I}, Then the Abelian group KA(X) is defined as the quotient group K,,(X)= F,,(X)/QA(X), KA(X) is called the Grothen- dieck group or K-group, and such a construc- tion is called the Grothendieck construction. From this we obtain the canonical mapping O:G'A(X)C FA(X)-->KA(X), which is a homo- morphism of the semigroups. Moreover, the pair (KA(X), 0) is universal in the following sense: Given an Abelian group G and a semi- group homomorphism g: G'A(X)-->G, there exists a unique group homomorphism h: KA(X)-->G such that g=hoO. We call h the extension of g. Let f: X --> Y be a continuous mapping from X into another compact Hausdorff space Y. Then for I}EG'A(Y), the tinduced bundle f*(I})E G'A(Y) is defined. Since the mapping f* :G'A(Y) -->6'A(X) is a semigroup homomorphism, it induces a homomorphism KA(f):KA(Y)--> KA(X), which is the extension of Oof*, so that KA(/)oO=Oof*. Usually, KA(f) is also denoted by f*. Thus K A is a tcontravariant functor. According to whether A = R, C, or H, the notations KG, K, or KSP are often used for K A . If X = {x o }, the semigroup homomorphism 6'A(xo)3(-->dimAEZ induces an isomorphism KA(XO)Z, If X is a finite CW complex with base point X o , then the reduced group KA(X) is defined to be the kernel of i*, where i:xo-->X is the inclusion. Then we have the canonical splitting KA(X)ZEB KA(X), and K A is a functor defined on the category of pointed finite CW complexes. 
879 Two isomorphism classes ( and IJ of 6"A(X) are said to be stably equivalent if there exist trivial bundles 01 and O 2 such that ( EB 01 = IJ EB O 2 , An equivalence class with respect to this relation is called a stable vector bundle. If X is connected, then the set of all stable A- vector bundles can be identified with KA(X). When A = R or C, the ttensor product of vector bundles induces a ring structure on KA(X), andf* becomes a ring homomorphism. The complexification of real vector bundles --> EB R C = i() is a semigroup homomor- phism 6"R(X)-->6"dX) and induces a ring homomorphism i: KO(X)-->K(X) such that ioO=Ooi. If E6"H(X), then  can be viewed as a complex vector bundle under the scalar restriction of basic field, which we shall denote by p()E6"dX). The mapping P:6"H(X)--> 6"dX) induces a homomorphism p: KSP(X)--> K(X). Similarly, the scalar restriction from C to R induces a homomorphism r: K (X)--> KO(X). All these are tnatural transformations, Let  be a complex vector bundle. We can formally write the tChern class ( 56 Charac- teristic Classes) c() of  as c()= TI (1 + x;). Then the Chern character Ch()EH*(X; Q) is defined by ch@=Lexpx i , where Q is the field of rational numbers. The mapping ch: 6"dX)-->H*(X; Q) is extended to a ring homomorphism ch: K (X)--> H*(X; Q). We denote by the same notation ch the ring homo- morphisms choi: KO(X)-->H*(X; Q) and cho p: KSP(X)-->H*(X; Q). These are natural transformations from the functor K A to the functor H*( ; Q). C. Cohomology Theory 0A(n) denotes O(n), U(n), or Sp(n) according as the basic field A is R, C, or H. Let OA be the tinductive limit group with respect to the usual inclusion 0A(n)c:OA(n+ 1). Provided with the weak topology, the group OA becomes a tCW complex, The set of all equivalence classes of stable A-vector bundles corresponds bijec- tively to the set of tprincipal 0 A -bundles. Let B A be the tclassifying space for the group 0A, X, Y be finitc CW complexes with base points, [X, y] be the set of all thomotopy classes of mappings from X to Y, and [X, YJo be the set of all thomotopy classes in the tcategory of pointed topological spaces. Then by the classi- fication theorem of fiber bundles ( 147 Fiber Bundles), we have KA(X)=[X, Z x BAJ and KA(X)=[X,Z x BAJo' The space B A has the structure of a weak tH-space, so that the 237 D K -Theory induced group structure of the homotopy set [X, Z x BAJ coincides with that of KA(X). Iff is a continuous mapping, then the induced homomorphism f* of the homotopy set [X, Z x BAJ coincides with that of KA(X). For a finite CW pair (X, A), we put Kn(x, A) = KA(sn(X/A)), n=O, 1,2,..., where X/A is the space obtained from X by collaps- ing A to a point that becomes the base point of X/A, and sn denotes the tn-fold reduced suspension. This gives rise to a cohomology theory (indexed by nonpositive integers) ( 201 Homology Theory, 202 Homotopy Theory), The tensor product of vector bundles in- duces the following pairing, called the cross product: Km(x, A) @ Kn(y, B) -->K,(m+n)(x x Y,X x BUA x Y). When A = R or C, the cup product Km(x) @ Kn(X)-->K(m+n)(x), K,\m(x)@ K,\n(x, A)-->K(m+n)(x, A) is defined as the composite of the cross prod- uct and the induced homomorphism *, where : X --> X x X is the diagonal map- ping. The comp1exification i: KO -n(X, A)--> K -n(x, A) preserves the cup product. The composite of ch: Kn(X, A)-->H*(sn(x/ A); Q) and the tsuspension isomorphism H*(sn(x / A); Q)-->H*(X, A; Q) is denoted by ch: Kn(X, A)-->H*(X, A; Q). The homomorphism ch preserves the cup product when A=R or C. D. Bott Periodicity Let A be the tcanonical A-Ene bundle over the A-projective line, The elements gc = O(d -1 EK(S2)= K- 2 (x), r-.,., gH=O(H)-1 EKSP(S4)= KSP- 4 (x), and gR = gH x gH(cross product) E KO(S8)= KO- 8 (x) are called Bott generators. The Bott periodicity theorem [1 OJ is as follows: (1) K(S2), KSP(S4), and KO(S8) are infinite cyclic groups generated by gc, gH, and gR, respectively. Moreover, ch(gc) = 0"2, ch(gH) =0"4, and ch(gR) = 0"8, where O"nEHn(sn;z) is a 
237 E K -Theory generator. (2) The cross products K-n(X, A) @ K -2(x)-->K -(n+2)(x, A), Ksp-n(x, A) @ KSP-4(X)-->KO -(n+4)(X, A), KO -n(x, A) @ KO -s(x)-->KO -(n+S)(x, A) are isomorphisms. The isomorphisms K-n(X, A)  K -(n+2)(x, A), Ksp-n(x, A)  KO -(n+4)(X, A), and Ko-n(X,A) Ko-(n+S)(X,A), defined by a-->a x gA, are called Bot! isomor- phisms. They commute with f*, the tcobound- ary operator b, and ch. Identifying K -n with K-(n+2), Ksp-n with Ko-(n+4), and Ko-n with Ko-(n+S), we get periodic cohomology theories K* = LnEz, K n and KO* = LneZ g KO n , which are multiplicative cohomology theories, and ch is a multiplication-preserving homomorphism into H*( ; Q). The cohomology of a point K,\n(x) = KA(sn) =nn(B A ) = nn-dOA) is given by Bott [11] ( Appendix A, Table 6.IV). E. Operations We assume that A is R or C. The exterior powers A q are basic operations in KA(X), For E6"A(X), the pth +exterior power of, AP()E 6"A(X), has the following properties: AO()= 1, A I()= (and AP( EB 1]) = Lq+rp).q() @ Ar(). Let G be the multiplicative group consisting of the formal power series EKA(X){t} whose constant term is 1. The assignment -->A,()= L Aq()tq gives rise to the homomorphism 6"A(X)-->G. Let ..1.,: KA(X)-->G be its extension. The operation Aq:KA(X)-->KA(X) is defined bY A,(x) = Lo<;qAq(xW. An important series of operations Ij;k, called the Adams operations, is derived from the exterior powers. Put dA,(x) I Ij;_,(x)= -t--EKA(X){t}, dt },,(x) and define Ij;k: KA(X)-->KA(X) by Ij;,(x) = LO<;k Ij;k(X)t k . When A = C, define Ij; -I as the extension of -->, where  is the tconjugate complex vector bundle of. The operation Ij;k is a ring homomorphism preserving 1, and the relation Ij;kolj;l=lj;kl holds. If E6"A(X) is a line bundle, then Ij;k(O()) =(e()t If xE KA(X) and ch(x) = Lnchn(X), where Ch n (X)EH 2n (X; Q), then Ch(lj;k(X)) = L knchn(x). n The operations A q and Ij;k commute with the complexification i:KO(X)-->K(X). If fJc:K(X) 880 -->K(S2 X) and fJR: KO(X)--> KO(Ss X) are Bott isomorphisms, then Ij;k 0 fJc = kfJc 0 Ij;k and Ij;k 0 fJR = k 4 fJR 0 Ij;k [1]. F. Thom-Gysin Isomorphisms Let  be a real oriented vector bundle of di- mension n over a finite CW complex X. A reduction of the structure group SO(n) of the bundle  to its universal covering group Spin(n) is called a spin structure of. The bun- dle  admits a spin structure if and only if W2()=0, where W2() denotes the second tStiefel- Whitney class of . The bundle  equipped with a spin structure is called a spin bundle. The bundle  is called a c I -bundle or Spinc bundle if there is given a cohomol- ogy class CI()EH2(X;Z) such that W2()= C, () mod 2. Let X{ be the tThom complex of the vector bundle . The group K(X{) has the structure of a K(X)-module. If we assume that  is a c, -bundle when A = C and a spin bundle when A = R, then there exists a canonical K(X)-module isomorphism <p: K(X)-->K+dim{(x{) such that ch<p(I) =<p'«,"()exp(cI()/2))-I) when A=C and ch<p(I)=<p'(d()-I) when A=R [6]. Here, <p': H*(X; Q)-->H*(X'; Q) is the usual tThom- Gysin isomorphism, and si() is the d- characteristic class of the bundle  defined as follows: Write the tPontryagin class p@ of  formally as p() = II (1 + xl); then the class si() is given by ,ci() = fl (x i /2)/(sinh(x;/2)). If  is a complex vector bundle, then its first tChern class CI() gives a cl-bundle structure to r()E 6"R(X), In this case, the class ,'1@= ,J41'( )exp(c 1/2) is the Todd class of the com- plex vector bundle . G. Riemann-Roch Theorems for Differentiable Manifolds Let M and N be connected closed differenti- able manifolds. A continuous mapping f: M -->N is called a spin mapping (spin map) if wl(M)=f*wl(N) and w 2 (M)= f*w 2 (N). If wdM)= f*wdN) and there is given a class c, EH 2 (M; Z) such that w 2 (M)- f*w 2 (N)= c 1 (mod 2), f is called a c I-mapping. If we as- sume that f is a c I-mapping when A = C and a spin mapping when A = R, then there is a canonical homomorphism f,: K(M)-->K+dimN-dimM(N) such that f,( {*(x). y) = x' f,(y) 
881 and ch(f,(y))d(N) = f,(ch(y)d(M)exp(c l /2)) for YEKh,(M) and xEK(N). This is the Riemann-Roch theorem for differentiable mani- folds [6] ( 366 Riemann-Roch Theorems). In the second formula, the homomorphism j,:H*(M;Q)-->H*(N;Q) on the right-hand side is the usual tGysin homomorphism, and if A=R we set C l =0, The homomorphism f, de- pends only on the homotopy class of f and has the usual functorial properties 1, = 1 and (/og)= f,og,. H. The Atiyah-Singer Index Theorem Let X be an n-dimensional compact differenti- able manifold of class Coo without boundary. As we shall see later, any elliptic differential operator (or, more generally, any elliptic com- plex) d on X has analytic index inda(d) and topological index ind,(d), the latter of which is deeply related to K-theory. The Atiyah-Singer index theorem asserts that inda(d) = ind,(d) [8]. We shall describe the details of the definitions and the theorem. Let E and F be complex vector bundles of class Coo over X with dim E = s and dim F = t ( 147 Fiber Bundles). Let r(E) and r(F) be the linear spaces over C consisting of Coo-cross sections of E and F, respectively. A linear mapping d from r(E) to r(F) is called a dif- ferential operator of the kth order if d is locally expressed by some differential operator of the kth order. This means that if we choose a local coordinate neighborhood U of X and triviali- zations of E and F on U such that E I U  U x C' and FlU  U X C', then d is a differential operator of the kth order from COO( U, CS) to C OO ( U, C'). Thus d is locally expressed by the matrix form (LI'I';;kai,j)(x)D') (i= 1, ..., t;j= 1, ... , s), each component of which is a differen- tial operator ( 112 Differential Operators). Using this expression we define the symbol O"(d) of d as follows: Let T*(X) be the cotan- gent bundle of X. Given any 'lxE Tx*(X), put O"(d)('lx) = (LI'Ik a,j)(x)'l), where 'l stands for 'l1 .. .'l" for any multi-index ex = (ex l , ... , exn) and for a local coordinate expression 'lx = ('l, ..., 'l). We call O"(d) the principal symbol of d. Now a differential operator d is called ellip- tic if for each 'lx#O the linear mapping O"(d)('lx) gives an isomorphism from Ex onto Fx. For the elliptic differential operator d, we have dimKerd< 00 and dimCokerd< 00 [5]. The analytic index inda(d) is defined to be the inte- ger dim Ker d - dim Coker d, and it has the characteristic property that inda(d) is invariant under deformation of d, More generally, an elliptic complex ,g on X 237 H K -Theory and the analytic index inda(,g) can be defined as follows: Given a finite number of smooth complex vector bundles {EJil, ,/ on X and differential operators d i : r(E;)--> r(Ei+i)' we call ,g = {Ei' di}i I, h ,/ an elliptic complex on X if the following two conditions are satisfied: (i) di+i odi=O; (ii) for any 'lxE Tx*(X), 'lx #0, h .. b I E a(d,Jlnx) t e (pnnclpal) sym 0 sequence --> i,x---> Ei+l,x--> is exact. For an elliptic complex ,g, we have dimHi(,g)=dim(KerdJImd i _ I )< 00 [5]. The integer L( -1)idimHi(,g) is called the analytic index of ,g. An elliptic complex with the form 0--> r(E) r(F)-->O is an elliptic operator. An important example of an ellip- tic complex arises from de Rham theory ( 105 Differentiable Manifolds Q): Take E i = NT*(X) and the exterior differentiations as differential operators. The elliptic complex thus obtained is the de Rham complex. The topological index ind,(,g) of the elliptic complex,g is introduced in the following way: By virtue of the exactness for 'lx # 0, the sym- b I E a(d,Jlnx) E d . o sequence --> i,x-----> i+l,x--> etermmes a definite element [O"(d)J of K(X'), where X' is the tThom complex associated with the cotangent bundle of X. Embed X in some Euclidean space R N . Then the mappingj: T*(X)--> T*(R N )  R 2N canonically induces the homomorphism j,: K (X')--> K«RNj')  K(S2N)  Z, andj, is obtained as in Section G fromj by using a canonical complex vector bundle struc- ture of T*(X) in T*(R N ). We set ind,(,g)= j,[O"(,g)J and call this the topological index of ,g. We have ind,(,g) = ch([ O"(d)] ),r(X) [X'J, where ch([O"(d)J) (E H*(X'; Q)) is the Chern character of O"(d), .9"(X) (E H*(X; Q)) is the Todd class of T(X)@C, and [X'] is the funda- mental cycle of X' [8]. The Atiyah-Singer index theorem, in general form, asserts that inda(,g) = ind,( ,g). For the de Rham complex E, it follows from the definition that ind,(E) is equal to the Euler characteristic of X. Let X be a compact complex manifold and W be a complex analytic vector bundle on X. Applying the theorem to the Dolbeault complex with value in W...-->Ao.i(W)i. AO.i+i(W)-->... ( 72 Complex Manifolds), we can conclude that Hirzebruch's formulation of Riemann-Roch theorem ( 366 Riemann- Roch Theorems B) holds not only for projec- tive algebraic manifolds but also for compact complex manifolds. Moreover, from the index theorem we can deduce the Hirzebruch index theorem ( 56 Characteristic Classes G) and various integrability theorems [8]. Any char- acteristic number which takes integral values on all oriented manifolds or weakly almost complex manifolds can be derived from the 
237 I K -Theory Atiyah-Singer index theorem (R. Stong and A. Hattori). The Hirzebruch index theorem and the Atiyah-Singer index theorem can be extended to compact manifolds with boundary in the framework of Riemannian geometry, giving a formula analogous to the Gauss-Bonnet theorem for manifolds with boundary [9]. Another generalization of the theorem is its equivariant version. Let X be a compact Hausdorff space on which a compact Lie group G acts, i.e., a tG-space. A real or com- plex vector bundle n: E--> X is called a G-vector bundle if E is a G-space, n is an tequivariant mapping and G-actions are fiberwise linear. The set of isomorphism classes of G-vector bundles over X is an additive semigroup with respect to Whitney sums, and the Grothen- dieck construction on this semigroup gives an Abelian group, which is denoted by KG(X) or K 0dX), depending on the scalar field of bun- dles, called equivariant K-group of X. KG(X) and KOG(X) have commutative ring structures given by the tensor products, Now consider the case where X is a one-point space pt. Then a G-vector bundle over pt is a finite- dimensional complex or real linear represent- ation of G. Therefore KG(pt) or KOG(pt) is the Grothendieck group of isomorphism classes of linear representations. This group also has a ring structure. It is called the representation ring ofG and is denoted by R(G) or RO(G), respectively. When X = G/H, a homogeneous space of G by a closed subgroup H, the iso- morphism KG(G/H)R(H) (or KOG(G/H) RO(H) holds true. Let X be a compact smooth G-manifold and E and F smooth complex G-vector bundles over X, i.e., G acts smoothly on X, E, and F. A differential operator d: r(E)--> r(F) is called equivariant when d commutes with induced G- actions on r(E) and r(F), respectively. Sup- pose that d: r(E)--> r(F) is an equivariant elliptic differential operator. As r(E) and r(F) are G-modules, Kerd and Cokerd are finite- dimensional G-modules, and the analytic index of d is defined by ind.(d)= Kerd -CokerdE R(G). An argument parallel to that for the inequivar- iant case defines the topological index of d as ind,(d)E KG(V) KG(pt)  R(G), where V is a suitable finite-dimensional com- plex G-module. Both analytic and topological indices are also generalized for equivariant elliptic G-complexes, and the equivariant Atiyah-Singer index theorem asserts that ind a (8') == ind,(8')E R(G) for any elliptic G-complex g [8]. 882 The equivariant Atiyah-Singer index theo- rem has a close relation with the Lefschetz fixed-point theorem (of Atiyah-Bott [5]) and generalizes the Hirzebruch index theorem (also called the Hirzebruch signature theorem) to its equivariant form, the so-called G-signature theorem ( 153 Fixed-Point Theorems). I. J-Groups and the Adams Conjecture Let ( and IJ be real vector bundles over a finite CW complex X.  and IJ are called fiber homo- topy equivalent if there exist fiber-preserving mappings f: S()-->S(IJ), g: S(IJ)-->S() between associated sphere bundles, and fiber-preserving homotopies go f""- 1, fog ""- 1.  and IJ are called stably fiber homotopy equivalent if there exist trivial bundles nand m such that ( EB n and IJ EB m are fiber homotopy equivalent [24]. Stable fiber homotopy equivalence is an equivalence relation, and the set of all stable fiber homotopy equivalence classes of real vector bundles over X is denoted by J(X) and is called the J-group of X. J(X) is an Abelian group with addition induced by Whitney sums of vector bundles, and we can express J(X)= KO(X)/T(X) as a quotient group. The natural projection J: KO(X)-->J(X) = KO(X)/T(X) is called the J -homomorphism. When X = S\ the k-sphere, the J-homomorphism can be identified with the classical stable J- homomorphism J: nk-dO)-->ne-l of Hopf and Whitehead ( 202 Homotopy Theory). Adams [2] proposed to compute J(X) by introducing two factor groups J'(X) and J"(X) of KO(X) in such a way that these are comput- able and that the epimorphisms J"(X)-->J(X)--> J'(X) hold whenever the following conjecture is true: Adams conjecture. Let k be any integer and YEKO(X). There exists a nonnegative integer e=e(k,y) such that J(k e (ljJk-l)y)=O. Adams [2] proved this conjecture for line and plane bundles. In 1970, D. G. Quillen [25J proved this conjecture in its full generality. By intensive use of the Brauer induction theo- rem, Quillen reduced the problem to the case of bundles with finite structure groups and then to the case of line or plane bundles where Adams' proof [2] applies. Since then, many different proofs of the conjecture have appeared. Adams' theory on J(X) and Quillen's theo- rem (Adams conjecture) are utilized in deter- mining completely the J-images in stable homotopy groups of spheres ( 202 Homo- topy Theory). 
883 J. Algebraic K-Theory Algebraic K-theory is a branch of algebra concerned mainly with a series of Abelian group valued functors Kn of rings (and, more generaIly, of certain categories), which have certain features of generalized homology theory. It originated in the K-group construc- tion used in Grothendieck's work on the Riemann-Roch theorem. The theory was ini- tiated in the early sixties by H. Bass, who in- troduced Kl and extensively studied Ko and Kl in coIlaboration with other researchers [15 18]. Then K 2 was introduced by J. Milnor [26J, and higher K-theories were constructed by QuiIlen and others from various view- points [22, I]. There is also a K-theory with respect to Hermitian structure [22, III]. Algebraic K-theory is intimately related to various other branches of mathematics, such as topology, algebraic geometry, and number theory. The Grothendieck group Ko(A) of a ring A is the Abelian group generated by the set of isomorphism classes [PJ of finitely generated tprojective A-modules subject to the relation [P EB p'J = [PJ + [P'J for every pair of projec- tive modules P, P'. If A is finitely generated as a Z-algebra, then Ko(A) is a finitely generated group. The assignment n-->n[AJ defines a homomorphism Z-->Ko(A) whose cokernel is the tprojective class group of A. If A is commutative, a similar construction for the category of rank 1 projective A-modules with res pect to the tensor prod uct (8) leads to the Picard group Pic(A). We then have an epimor- phism Ko(A)-->Pic(A) defined by P-->A'P, where P is of rank r [16]. If X is a compact Hausdorff space, the topological K(X) is iso- morphic to the algebraic Ko(A), where A is the ring of complex-valued continuous func- tions on X. The Whitehead group Kl (A) is defined as follows_ Let GL(A) be the direct limit of the sequence ...-->GLn(A)GLn+l(A)-->..., where in(X)=( } Let En(A) be the subgroup of GLn(A) generated by all elementary matrices 1 +ae ij (i#j,aEA), where the eij are matrix units. The limit E(A) of En(A) coincides with the commutator subgroup of GL(A). Now define K1(A)=GL(A)jE(A) [15,18]. For A= Zn, the integral group algebra of a group n, the cokernel of the natural homomorphism :t n--> KI (A) is denoted by Wh(n). The tor- sion invariant of 1. H. C. Whitehead is de- fined in Wh(n) [20]. If A is commutative, we put SKI (A)=SL(A)jE(A), where SL(A)= limSLn(A), By the determinant homomor- --> phism, we have K1(A)=,SK1(A)EB U(A), where U(A) is the group of units of A. SKI (A)=O 237 J K -Theory when A is a field or a local ring A. A deeper result states that SK 1 (A) = 0 for the ring of integers of an algebraic number field (Bass, Milnor, and Serre [17J). This and related re- sults have been applied to investigate SK, (Zn) for a finite group n. We define K 2 (A)==H 2 (E(A);Z) (the Schur multiplier of E(A)) [21]. This yields a universal central extension of E(A), defined by 0-->K 2 (A) -->St(A)-->E(A)-->O, where St(A)=IStn(A) and Stn(A) (n?o 3) is the Steinberg group gen- erated by xij(a) (EO A; i,j = 1, ..., n, i # j) subject to the relations (i) xij(a)xij(b) =xij(a + b), and (ii) the commutator (xij(a), xk/(h)) equals xi/(ab) for j=k, i#l; and equals I for j#k, i#l. Let F be a field. A bimultiplicative mapping s: F* x F*-->C (C an Abelian group) satisfying s(x, I - x) = 1 (x #0, I) is called a (Steinberg) symbol on F. There exists a universal symbol F* x F*-->K 2 (F) which, followed by homomor- phisms K 2 (F)-->C, yields all C-valued symbols on F. Since certain Steinberg symbols, such as the tHilbert symbol, are important in number theory, the group K 2 (F) of a global or local field F is intimately related to the arithmetic of F. If F is an algebraic number field and R its ring of integers, we have an exact sequence 0 -->K 2 (R)->K 2 (F)-dL p (Rjp)*->0 (the next-to- last term being the direct sum over all prime ideals p of R), and K 2 (R) is a finite group. Quillen [26,27J defined higher algebraic K- theory based on the following topological construction. Let X be a CW complex given with a perfect normal subgroup N of n l (X). Attaching 2- and 3-cells suitably to X, he con- structed a complex X+ and a mapping f:X--> X+ so that ndf) is epimorphic, Kerndf)= N, and f*: H*(XJ*L) ,;>:;H*(X+, L) for any local coefficients L over X +. (X + J) is univer- sal for pairs (Y, g), g: X --> Y, satisfying ndg)(N) = 0, i.e., there exists a mapping 9 + : X + --> Y unique up to homotopy such that g+ of g. Quillen applied the above construction to X=BGL(A) and N =E(A), and defined K,,(A) =nn(BGL(A)+) for n?o 1 (the first definition). The universality implies that the inclusion E(A)cGL(A) induces a mapping BE(At--> BGL(A)+ which is the same as the universal covering mapping up to homotopy, which implies that K 2 (A) ';>:; H 2 (E(A); Z), i.e., Quillen's K 2 coincides with Milnor's. It is also known that K 3 (A),;>:;H 3 (St(A);Z). QuilIen [22, I] defined higher algebraic K- theory for certain additive categories endowed with a class of short exact sequences, called exact categories. For an exact category M, he defined another category QM having the same objects as M but with morphisms changed. Making use of Segal's classifying spaces B of categories, he defincd Ki(M)=n i + 1 (BQM, 0), where 0 is a zero object of M (the second de- 
237 Ref. K -Theory finition). When M = PA, the exact category of finitely generated projective A-modules, Ki(A) Ki(P A ) [23], whereby Quillen intro- duced the notions of symmetric monoidal categories S and their localizations S-I Sand showed the homotopy equivalences BS-1S A  Ko(A)xBGL(A)+ andQBQPABS-ISA' where SA = Iso P A is the subcategory of P A whose morphisms are all isomorphisms of PA- The second definition is used to generalize many classical results in algebraic K-theory to higher K-theory [22, I]. References [1] J. F, Adams, Vector fields on spheres, Ann. Math., (2) 75 (1962), 603-632. [2] 1. F. Adams, On the groups J(X) I-IV, Topology, 2 (1963),181-195; 3 (1965),137- 171,193-222; 5 (1966), 21-71. [3J M. F. Atiyah, K-theory, Benjamin, 1967. [4] M. F. Atiyah and R. Bott, The index prob- lem for manifolds with boundary, Bombay Colloquium on Differential Analysis, Oxford Univ. Press (1964),175-186. [5] M. F. Atiyah and R. Bott, A Lefschetz fixed point formula for elliptic complexes I, Ann. Math., (2) 86 (1967), 374-407. [6] M. F. Atiyah and F. Hirzebruch, Riemann- Roch theorems for differentiable manifolds, Bull. Amer. Math. Soc., 65 (1959), 276-281. [7] M. F. Atiyah and F. Hirzebruch, Vector bundles and homogeneous spaces, Amer. Math. Soc., Proc. Symposia in Pure Math. (1961),7-38. [8] M. F. Atiyah and I. M. Singer, The index of elliptic operators I-III, Ann. Math., (2) 87 (1968), 484-604. [9] M. F. Atiyah, V. K. Patodi, and I. M. Singer, Spectral asymmetry and Riemannian geometry I, Math. Proc. Cambridge Philos. Soc., 77 (1975), 43-69. [10] R. Bott, Quelques remarques sur les theoremes de periodicite, Bull. Soc. Math. France, 87 (1959), 293-310. [11 J R. Bott, The stable homotopy of the classical groups, Ann. Math., (2) 70 (1959), 313-337. [12] R. Bott, Lectures on K(X), Benjamin, 1969. [13] P. E. Conner and E. E. Floyd, The re- lation of cobordism to K-theories, Lecture notes in math. 28, Springer, 1966. [14] E. Dyer, Cohomology theories, Benjamin, 1969. [15] H. Bass, K-theory and stable algebra, Publ. Math. Inst. HES, 22 (1964),5-60. [16] H. Bass and M. P. Murthy, Grothendieck groups and Picard groups of Abelian group rings, Ann. Math., (2) 86 (1967), 16-73. 884 [17] H. Bass, J. Milnor, and 1.-P. Serre, Solu- tion of the congruence subgroup problem for SLn(n;d) and SP2n(n2), Publ. Math. Inst. HES, 33 (1968), 59-13'i. [18] H. Bass, Algebraic K-theory, Benjamin, 1968. [19] R. Swan, K -theory of finite groups and orders, Lecture notes in math. 149, Springer, 1970. [20] 1. Milnor, Whitehead torsion, Bull. Amer. Math. Soc., 72 (1966), 358-426. [21] J. Milnor, Introduction to algebraic K- theory, Ann. Math. Studies, Princeton Univ. Press, 1971. [22] Algebraic K-theory, Battelle Inst. Con£. 1972, I-III, Lecture notes in math. 341, 342, 343, Springer, 1973. [23] Algebraic K-theory, Evanston 1976, Lecture notes in math. 551, Springer, 1976. [24J M. F. Atiyah, Thom complexes, Proc. London Math. Soc., 11 (1961),291-310. [25J D. Quillen, The Adams conjecture, Top- ology, 10 (1971),67-80. [26] D. Quillen, Cohomology of groups, Actes Congres Intern. Math., 1970, t. 2,47-51. [27] 1.-L. Loday, K -theorie algebrique et representations de groupes, Ann. Sci. Ecole Norm. Sup., 9 (1976), 309-377. 
238 Ref. Lagrange, Joseph Louis 238 (XXI.32) Lagrange, Joseph Louis Joseph Louis Lagrange (January 25, 1736- April 10, 1813) was born in Turin, Italy, and became an instructor at the military school there in 1753. In 1766, he was invited to Prus- sia by King Frederick the Great (1712-1786) and moved to Berlin, where he filled the post (formerly occupied by L. Euler) of chairman of the mathematics department in the graduate division of the University of Berlin. In 1787, he moved to Paris, where he became a pro- fessor at the recently founded Ecole Normale Superieure; he remained in France for the rest of his life. Lagrange was chiefly responsible for the establishment of the metric system. In 1795, he became the first president of the newly established Ecole Poly technique. Dur- ing the later stages of the Napoleonic Era, he was made a count. Mathematically, his position is between Euler and Laplace, and he is considered one of the major mathematicians of the late 18th and early 19th centuries. His notable achievements in analysis-the initiation of the tcalculus of variations resulting from his research in the tisoperimetric problem, the founding oftana- Iytical dynamics with the introduction of gen- eralized coordinates, and the solving of the equations now known as tLagrange's equa- tions of motion-all have a strong algebraic flavor. Lagrange attempted to base calculus on tformal power series. He also conducted re- search on the solution of algebraic equations, and his work on the tpermutation group of the roots of algebraic equations can be regarded as a forerunner of the achievements of Abel and Galois. References [IJ J. L. Lagrange, Oeuvres I-XIV, M. 1. A. Serret (ed.), Gauthier-Villars, 1867-1892. [2J J. L. Lagrange, Theorie des fonctions analytiques, contenant les principes du calcul differentiel, Paris, 1797. 239 (XXI.33) Laplace, Pierre Simon Pierre Simon Laplace (March 23, 1749- March 5, 1827) was born into a farming family in Beaumont en Auge in Normandy, France, His genius was recognized early, and in 1767 he moved to Paris, where he enjoyed the favor of 886 J. d'Alembert. He became a professor at the Ecole Normale and the Ecole Poly technique. During the Napoleonic Era, he was nominated for the post of Minister of the Interior; he later became a count. Following the fall of Napoleon, he became a marquis under Louis XVIII. He is often said to have lacked pro- fessional integrity, particularly in the matter of claiming priority, but on the other hand he sometimes showed independence of character and was generous to his pupils toward the close of his life. His achievements in mathe- matics, physics, and astronomy were so well recognized that he reached the highest social position. Laplace's achievements reached a peak in the field of analysis, which had been initiated in the 17th century and developed in the 18th century by Euler and the mathematicians of the Bernoulli family. He applied the methods of analysis to tcelestial mechanics, tpotential theory, and tprobability theory, obtaining remarkable results. Without the use of formulas and in a flow- ing and elegant literary style [3,5], he eluci- dated his various results. Concerning the origin of the solar system, in 1796 he published the nebular hypothesis-the so-called Kant- Laplace nebular hypothesis-which is famous as the predecessor of the theory of the evo- I ution of the universe. References [1] P. S. Laplace, Oeuvres completes I-XIV, Gauthier-Villars, 1878-1912. [2] P. S. Laplace, Mecanique celeste, Paris, 1798-1825; English translation, Celestial mechanics I-IV, Chelsea, 1966; V (in French), Chelsea, 1969. [3J P. S. Laplace, Exposition du systeme du monde, Paris, 1796. [4J P. S. Laplace, Theorie analytique des probabilites, Courcier, 1812. [5J P. S. Laplace, Essai philosophique sur les probabilites, Courcier, 1814. 240 (X.26) Laplace Transform A. General Remarks The notion of the Laplace transform can be regarded as a generalization of the notion of Dirichlet series. L. Euler applied the Laplace transform to solve certain differential equa- tions (1737); later, independently, P. S. Laplace 
887 applied it to solve differential and difference equations in his famous book Theorie analy- tique des probabilitl!s, vol. 1 (1812). In this cen- tury, the Laplace transform has been used to justify Heaviside's toperational calculus, and the notion has become an important tool in applied mathematics. Let ex(t) be a function of tbounded variation in the interval O.s; t.s; R for every positive R. If L(s) = 2(ex(t)) (s) = Ioo e -s' dex(t) = Jim f R e-stdex(t) R-oo 0 converges for some complex number So' then it converges for all s satisfying Res> Res o . We calI L(s) the Laplace-Stieltjes transform of ex(t). If ex(t) = I <p(u)du (where <p(u) is tLebesgue integrable in the interval O.s; t.s; R for every R), then we call L(s) = 2(<p)(s) = Ioo e-s'<p(t)dt the Laplace transform of <p(t) ( Appendix A, Table 12.1). B. Regions of Convergence Given a Laplace transform L(s) of ex(t), there exists a real number (or :t 00) O"c such that the maximal region of convergence of L(s) is the set of all s such that Res>O"c' In extreme cases, when the integral never converges, we write 0", == + 00, and when it converges every- where, we write 0",= -00. The number O"c is called the abscissa of convergence of L(s), and the Jine Re s = 0", the axis of convergence of L(s). A formula to determine the abscissa of convergence in terms of ex(t) is known. If k = limsuPhoo(log!ex(t)I)/t#O, then O"c=k; if k=O and ex(t) does not converge as t tends to infin- ity, then O"c = O. If L(s) has a nonnegative ab- scissa 0"" then O"c = Jim sup,oo (log I ex(t)l)/t, and if 0",<0, then O"c=limsuPHoo(loglex(oo)-ex(t)l)/t (E. Landau, S. Pincherle). GeneralIy, . 10glex(t)-ex([tJ)1 hmsup - O"n t-x t where [ J is the tGauss symbol (K. Kurosu, T. Kojima, T. Ugaeri, K. Knopp). If I;;" e-S'ldex(t)1 < 00, then the Laplace- Stieltjes integral L(s) = I;;" e -s' dex(t) is said to be absolutely convergent. There exists a real num- ber O"a such that L(s) converges absolutely for Re s > O"a and does not converge absolutely for Res < O"a' We call O"a the abscissa of absolute convergence of L(s), There exists a real number O"u such that L(s) converges uniformly for Re s)o O"u + c (for every c > 0) and fails to do so 240D Laplace Transform for Res)oO"u-s. We call O"u the abscissa of uniform convergence of L(s). It is clear that O",.s; O"u.s; O"a' Formulas determining O"u and O"a are analogous to the Dirichlet series formulas given by Kojima and M. Kunieda ( 121 Dirichlet Series B) [1]. C. Regularity In the region of convergence Re s > 0"" L(s) = I;;" e -s' dex(t) is tholomorphic, and we have L(k)(S) = I;;" e- S '( - t)kdex(t) in Res> 0",. If ex(t) is monotonic, then the real point s = O"c on the axis of convergence is a singular point of L(s). However, there may not be any singular point on the axis of convergence in general. The abscissa of regularity is the infimum of all 0" such that L(s) is holomorphic in Re s > 0". Also, L(O"+ir)=O(lrl) uniformly in O"c+b.s;O"< 00 for every positive b as Irl-->oo. Any analytic function that is holomorphic at 00 can be rep- resented by a Laplace transform. To be pre- cise, let f(s) = f( 00) + Lo(ann!)/sn+l (isl >c). Then the function <p(t) = Lo ant n is entire, f(s) = f( 00) + I e -" <p(t) dt, and O"c> C. D. Inversion Formulas We say that a function ex(t) (t)o 0) that is of tbounded variation on any interval is normal- ized if ex(O) = ex( + 0)=0 and ex(t)==(ex(t+ 0)+ ex(t - 0))/2. A normalized function ex(t) (t> 0) is uniquely determined by its Laplace transform. Moreover, the inversion formula determining ex(t) in terms of L(s) is known. That is, if L(s) = It e- S ' dex(t), then for c > max(o-n 0), 1 f '+i1 L(s) { ex(t), t>O, lim -----: - est ds = ex( + 0)/2, t = 0, 1''" 2m c-iT S 0, t<O. The integral on the left-hand side is often called the Bromwich integral. Suppose that ex(t) = I <p(u)du, L(s) converges absolutely on Res =c, and <p(u) is of bounded variation in a neighborhood of u = t (t)o 0). Then we have 1 f C+iT lim -----: L(s)e st ds Too 2m c-iT { (<P(t+0)+<p(t-0))/2, t>O, = <p( + 0)/2, t=O, 0, t<O. There is another form of the inversion for- mula by E. L. Post and D. V. Widder. Namely, set Lk,t[f(x)] =( -IN(k)(k/t)(k/t)k+l /k! for a COO-function f(x) (t>O, k is a positive 
240E Laplace Transform integer). Then lim f ' Lk.u[L(x)Jdu=cx(t)-cx( +0). k-oo 0 If cx(t) = S<p(u)du, then for almost alI t (>0), lim Lk,,[L(x)J = <p(t) koo ( Appendix A, Table 12.1). E. Representation Theorem If f(x) is of class Coo on (a, b) and (-In(k)(x))o o for k = I, 2, ... , then f(x) is called completely monotonic in (a, b). Moreover, if f(x) is con- tinuous on [a, h], then f(x) is caIled cOT.1pletely monotonic on [a, b]. A necessary and suffi- cient condition for a function f(x) to be com- pletely monotonic in O.s; x < 00 is that f(x) = Se-X'dcx(t), where cx(t) is bounded and in- creasing and the integral converges for O.s; x < 00 (Bernshtein's theorem). A necessary and sufficient condition for f(x) to be represent- able in the form So' e- xI <p(t)dt, where <p(t)E Lp(O, 00) (p> I), is that (i) f(x) have derivatives of all orders in 0 < x < 00, (ii) f(x) vanish at infinity, and (iii) there exist a constant M such that So' ILk,,[f(x)JlPdt<M for k= 1,2,3,.... In the representation f(x) = S e -x, <p(t) dt, a necessary and sufficient condition for <p(t) to be bounded in 0 < t < 00 is that f(x) is of class Coo in 0 < x < 00 and there exists a constant M such that ILk.,[f(x)JI<M and I xf(x) I <M for O<x < 00. In order that <p(t)E L I (O.s; t < 00), it is necessary and sufficient that (i) f(x) be of class Coo in 0 < x < 00, (ii) f(x) vanish at infinity, and (iii) So' ILk.,[f(x)Jldt< 00 and lim f ex: ILk.,[f(x)J-Lj.,[f(x)Jldt=O k: 0 (Widder). F. Operations on Laplace Transforms Let the Laplace transform of f(x) be 2(/(t))(s) = So' e-S'f(t)dt. It is important in operational calculus to know the formula for the Laplace transform of <pI, where <p is an operation. We mention here some important formulas: 2(/(at - b))(s) =exp ( -s) 2(f(t))(sja) (/(at-b)=O if at<b, a>O, b)oO); 2(t f(r)dr }S) =2(f(t»)(S) (Res >max(O, O"cJ); 888 and 2(/'(t))(s) = s2(/)(s) - f(O), provided that 2(/'(t))(s) converges at s (> 0) and f(t)--> f(O) as t--> + O. Generally speaking, if f( + 0), ... ,j(k-l)( + 0) exist and the Laplace transforms 2(/(k)(t))(S) converge at s (>0), then 2(/(k)(t))(S) =Sk 2(/(t))(s) - f( + O)sk-l -f'( + 0)Sk-2 -... - r(k-l)( + 0). Furthermore, given functions fl and f2' if 2(/d(s) and 2(/2)(S) are both convergent and if one of them converges absolutely or !i'(/1 * f2)(S) converges, then 2(/1 * f2) = 2(/1) 2(/2)' G. Asymptotic Properties of the Laplace Transform If L(s) = S e- s ' dcx(t) (s > 0), then for any c)o 0 and any constant A, we have limsup Isc L(s)-AI s-+o .s;limsuplcx(t)t-'qc+ I)-AI, 1-00 limsupls'L(s)-AI SCfj .s; lim sup Icx(t)t-cqc+ 1)- AI. t-+O I n particular, if we set c = 0 and assume that cx(t)-->A as t-->oo, then f(s)-->A as s--> +0; and if we assume cx(t)AtCjqc+ 1) as t-->oo (or t--> + 0), then f(s)  As- c as s--> +0 (or s--> 00). These results are called Abelian theorems, because if we choose cx(t) appropriately and change variables, we get tAbel's continuity theorem on power series: If L an converges to s, then L anx n tends to s as x tends to 1 - O. More generally, we have the folIowing theo- rem: If r e- st dcx(t) = L(s)-->A as s--> + 0, then lim1-oo cx(t)=A if and only if P(t)= Sudcx(u)= o(t) (t-->oo). H. Bilateral Laplace Transform If cx(t) is of bounded variation in every finite interval and if for some s Urn f R e -sl dcx(t), R-cc 0 Jim f o e-"dcx(t) R'-cc -R' exist, we set L(s)= S'ooe-S'dcx(t). If L(s) converges at SI =0"1 + ir l , S2 = 0"2 + ir 2 , then L(s) converges in the vertical strip 0"1 < Res<o-2' If L(s) converges in the strip 0-:< Res<O"; and diverges for Res>O"; and Res< 0":, then each of the numbers 0": and 0":' is called 
889 an abscissa of convergence of L(s). If . 10glcx(t)1 hmsup - k#O, t-oo t lim inf logl cx(t)1 = I # 0 t- - 00 t with k < I, then k and I are the abscissas of convergence. If cx(t) is a normalized function of bounded variation in every finite interval and L(s) converges in the strip k < Re s < I, then for alI t 1 f C+iT L(s) lim - -es'ds Too 2ni c-iT S = { cx(t)-CX( -00), cx(t)-cx(oo), c>O, c<O, k<c<l, k<c<l. Suppose that cx(t) == J <p(u)du, L(s) converges absolutely on the line Re s = c, and <p(t) is of bounded variation in some neighborhood of t=t o . Then 1 f C+iT lim ---; L(s)estuds T-oo 2nl c-iT =(<p(t o + 0) + <p(to -0))/2. There are other formulas analogous to those for the ordinary Laplace transform. T. Application to the Theory of Semigroups of Operators Applying a Laplace transform to a one- parameter semigroup of bounded operators {U(t)},;;,o on a Banach space yields a natural correspondence between the infinitesimal gen- erator A of {U(t)} and its resolvent (s-Ar l . Namely, given a continuous one-parameter semigroup {U(t)},;;,o on a Banach space X ( 378 Semigroups of Operators and Evolu- tion Equations), the infinitesimal generator A is defined by Ax==limh+oh-l(U(h)-I)x, and A is a densely defined closed operator on X for which there exist some positive numbers M and {J such that lI(s - A)-nll.s; M(Re s - {Jr n (n = 1,2, ...), is valid, provided that Res> {J. The Laplace transform of {U(t)},;;,o is defined by 2'(U(t))(s)x= tev e-S'U(t)xdt (x EX, Res>{J), and the foIlowing identity is valid: 2(U)(s)==(s-Ar l . Furthermore, the inversion formula 1 f '+iT U(t)x=lim---; e"(s-Aflxds Toc 2m c-iT (c>{J, t>O) 240J Laplace Transform holds for every x in the domain of A, and the convergence is compact-uniform in t>O. If A is bounded, then U(t)= Lk=O t k Ak/k! (norm convergent), and we can write U(t) = etA. If A is not bounded, U(t) is still regarded as an ex- ponential function of tA, because we have U(t) =limnoo(1 +tA/nfn or U(t)=limnooe-tA", An = A(1 + tA/n)-l being bounded (both strongly convergent). Hence the correspondence be- tween {U(t)} and (S-A)-l is considered a generalization of the formula 2(e at )(s) =(s - cxr l (Res>{J). In order for us to get the inversion formula for every XEX, {U(t)} must satisfy some ad- ditional conditions. Namely, if {U(t)} is a holomorphic semigroup, the inversion formula is valid for every x E X, t?o O. J. Laplace Transform of Distributions The Laplace transform in R is defined by 2(/)(+ilJ)= r e-x(Uin)[(x)dx=Ji'(e-xY)(IJ) JR when e-x{f(x)ELl (160 Fourier Transform). For n == 1, this definition is equivalent to the bilateral Laplace transform. For a given func- tion f(x), the set 6 of ( for which e -x{[ be- longs to Ll is convex in R. g(') == g(( + ilJ) = 2(/)(  + ilJ) is holomorphic in X + iR pro- vided that X is not empty. Differentiation under the integral sign gives D(g(O= r e- X ,( -x)"f(x)dx, JR and the integral converges uniformly on K + iR, K being a compact subset of X. Especi- ally, if e-x{[(x) belongs to Y'(Rn) for some , then 2(/)(+ilJ) is defined on 6f+iR, where 6f= {I e-X{[(x)EY'(R)}, which is also con- vex, 2(/)(  + ilJ) belongs to .'f'(R n ) for every fixed  E f' Hence the definition of Laplace transform can be extended to some class of distributions. Namely, for every distribution TEfW'(R), the set T= {E Rn I e-x TEY"(R)} is convex. Thus the Laplace transform of T for which 61' is not empty is defined by 2(T)( + ilJ) = Ji'(e- X { T)(IJ). This definition can be rewritten by the use of test functions as folIows. For any <pEY'(R), <L(T), <p >n = ((e-x{ T), <p >n = <e-xT, Ji'(<p) >x =(T,JR,e-X({+in)<p(IJ)dlJ)x. Moreover, if 61' has an interior point, 2(T) has an explicit expression: take a  in Xl, and let 0 < B.s; dist(,a6l); then e-,<x>EY'(R), e,<x>TE Y"(R), and 2(T)( + ilJ) = (T, e-x({+in»x= 
240K Laplace Transform <e'<x> T, e-,<x>-x({+in»x, where <x) = . In the following, assume that TEE!i'(R) and Q=1 is not empty. Then h(O=2'(T)((+il/) is (i) holomorphic in Q + iR and (ii) slowly increasing in 1/. More precisely, for any com- pact subset K ofQ, there exist a positive num- ber C K and an integer m K such that for every ( + iI/ E K + iR, the following inequality holds: Ih(+il/)I.s;CK(1 +ll/l)m K . Conversely, for a nonempty open convex set Q and a function h«( + il/) defined on Q + iR satisfying conditions (i) and (ii) above, the cor- responding inverse Laplace transform of h is 2'-1 (h)(x) = e X {g;;,-1 (h)(x) (E Q), the right-hand side being independent of  in Q. T=2'-I(h) is a distribution and T=:>Q. Moreover, 2'(T) = h in Q+ iR. The following identities hold as in the case of functions: 2'( (a J T }O=('2'(T)((), 2'(X'T)(()=( - :,J 2'(T). Roughly speaking, differentiability of T is reflected in the decreasing order of 2'(T) as 11/1-->00. Let r be a cone in R and Br be the ball with center 0 and radius r in R. The dual cone of r is denoted by P. Assume that the support of T is contained in Br+ r', and Q =6 1 is not empty. Then (i) Q + r c Q, i.e., Q extends in the direction of r, and (ii) for any C E Q and any [; > 0, there exist some C and m such that for any  + iI/ E o + r + iR, we have 12'(T)( + il/)I.s; Ce(r+')II(l + II/I)m. Conversely, if there exist a cone r, a domain Q, and a holomorphic function h«() on Q+ iR satisfying (i) and (ii) with h(() replacing 2'(T), then the support of 2'-1 (h) is contained in Br+r'. The convolution.r * T of a function fin .'I'(R) and a distribution T in Y"(R) belong o 0  0 to 9'''(R), and if fnT'" 0, then f*1 =:>fn T and 2'(/* T)=2'(f)2'(T) in 6fn6r+iR. More generally, convolution of two distri- butions T and S for which 6 T n 6s =Q '" 0 holds is defined by T * S = 2'-1 (2'(T)2'(S)). The formula 2'(T* S)= 2'(T)2'(S) is valid by virtue of the definition. If Q + r c Q for some cone r, we have supp(T* S)csuppT +suppS + P. 890 K. Moment Problem Given a real sequence {Jin}g', the problem of finding a function cx(t) of bounded variation normalized on the interval [0,1] satisfying Jin = Sb tndcx(t) (n =0,1, ...), is called Haus- dorff's moment problem. Such a function, if it exists, is unique. This problem is a discrete analog of the Laplace inversion formula, for if we put t=e- U and I1=S, we have Jis= ]g' e-'Ud[ -cx(e- u )]. Let Am,n=()( -J)m- n 6 m - n Jin (m,I1=O, 1, ...), where NJin=L}o( -li(})Jin+H (k=O, 1, ...). Then the problem has a solution cx(t) if and only if there exists a positive number L such that L;;'ol).m,nl.s;L (m=O, 1,...). Moreover, cx(t) is nondecreasing if and only if Am,n)o O. In this case {Jin} is called completely monotonic. When dcx(t) = <p(t)dt, <p(t) belongs to U(O, 1) (p> 1) if and only if(m+ ljP-l L;;'=o 1)'m,nIP.s;L (m = 0, 1, ... ), and <p(t) is bounded if and only if (m+ 1)1).m,nl.s;L (m, 11 =0,1,...). Given a real sequence {Jin}g'. the problem of finding a nondecreasing function cx(t) on R satisfying Jin=]'=mtndcx(t) (11=0, 1,...) is called Hamburger's moment problem, and the simi- lar problem obtained by replacing the condi- tion by Jin = fg' t n dcx(t) is called Stieltjes's mo- ment problem. In these problems uniqueness is not valid. The following is a counterexam- ple for the Stieltjes problem given by Stieltjes himself: cxl(t)= ]exp( -U 1 / 4 )du and cx 2 (t) = ] exp( - U 1 / 4 ) [1 -sin(u 1 / 4 )J du (t> 0) are non- decreasing in (0, 00), and both correspond to the sequence Jin =4(4n + 3)! in Stieltjes's problem. Carleman showed that the condition L Jiz;/2n = 00 is sufficient for uniqueness in Hamburger's problem, Hamburger's problem has a solution if and only if the quadratic forms LLoJii+j(ij (11=0, 1, ...) are all non- negative definite. Stieltjes's problem has a solution if and only if the quadratic forms L;,j=o Jii+ji j and L;,j=o Jii+j+l ij (11 = 0, 1, ... ) are non-negative definite. If, in Hamburger's or Stieltjes's problem, we require only that the solution is a function of bounded variation instead of a nondecreasing one, then every sequence {Jin}g' has a solution (R. P. Boas, Jr.). References [IJ D. V. Widder, Laplace transform, Prince- ton U niv. Press, 1941. [2] G. Doetsch, Theorie und Anwendung der Laplace- Transformation, Springer, 1937. [3J B. Van der Pol and H. Bremmer, Opera- tional calculus based on the two-sided La- place integral, Cambridge Univ. Press, econd edition, 1964. 
891 [4J N. Dunford and 1. T. Schwartz, Linear operators, pt. 1, General theory, Interscience, 1958. [5J K. Y osida, Functional analysis, Springer, sixth edition, 1980. [6J J. Leray, Hyperbolic differential equations, Princeton Lecture notes, Inst. for Adv. Study, 1952. [7J L. Schwartz, Theorie des distributions, Hermann, 1966. [8J 1. A. Shohat and 1. D. Tamarkin, The problem of moments, Amer. Math. Soc. Math. Surveys, no. 1, 1943. 241 (XVI.11) Latin Squares A. Definition and Classification A latin square over a set A = {at, a 2 , ..., an}, or of order n, is an arrangement of elements of A in a square of side n such that each symbol in A occurs exactly once in each row and in each column. It is in standard form, or reduced, if the first row and the first col umn consist of the natural permutation. There are three different interpretations oflatin squares. (1) If we iden- tify the set of indices with A and write z = x 0 Y to indicate that the symbol at the row x and the column y is z, then we have a quasigroup, denoted by (A,o) ( Section C). (2) The set of the n 2 triplets xyz in the above relation is an terror-detecting code over the alphabet A, of word length 3 with the minimum Hamming distance 2. (3) The set of the n permutations Pi moving the natural permutation to the ith row satisfies the condition that Pi-I  is a discord- ant permutation if i # j. Two latin squares Land L' are isotopic if under convention (2) there are three permu- tations p, q, r of A such that L = {xyz}, L' = {x'y'z'}, x' = p(x), y' = q(y), z' =r(z). An iso- topy class can contain more than one reduced latin square. The number of isotopy classes is denoted by L:. If we admit another equiva- lence due to rearrangement of the three com- ponents of the words in the code (2), then we have main classes; the number of main classes is denoted by L:*. The total number of latin squares of order n is given by n!(n-l)!Ln for the number Ln ofreduced latin squares. It is known that LI =L 2 = L3 = 1, L 4 =4, Ls = 56, L6 = 9408, L7 = 16,942,080, L8 = 535,281,401,856, L9=377, 597,570,964,258,816; Lj=L==q=l, L!==q==2, L==22, q= 564, Q=I,676,257; Lf*=Q*=Q*=I, L!* =Q*=2, L*=12, q*= 147. 241 D Latin Squares B. Orthogonality Suppose that we superimpose two latin squares L=(A, 0) and L' =(A, e), and ifthere is no coincidence for the pairs of the symbols, i.e., if xo y=x' 0 y' and xe y=x' e y' implies x = x', y = y', then Land L' are mutually ortho- gonal, and the resulting square is an Euler square. Actually there exists a pair of mutually orthogonallatin squares of order n, provided that n # 2 or 6; this was established by Bose, Shrikhande, and Parker only in 1959, contrary to a long-standing conjecture of Euler himself that such a set does not exist if n = 2 (mod 4). A set of mutualIy orthogonallatin squares of order n cannot contain more than n-1 squares, and a set of n - 1 squares is a com- plete set of mutually orthogonallatin squares. This is equivalent to an error-correcting code of n 2 words of word length n + 1 with mini- mum distance n. Again this is equivalent to a finite projective plane of order n, i.e., each line containing exactly n + 1 points. If n is a power of a prime, then there exists a projective plane of order n. In general, if pr is the minimum of the prime-power components of n, then there is a set of pr - 1 mutually orthogonallatin squares of order n. C. Quasigroups A quasigroup (A, 0) is a set A bestowed with a binary operation 0, satisfying both cancella- tion laws; xoy=xoz implies y=z; and yox= zox implies y==z. A finite quasigroup is a group if and only if it satisfies the associative law (xoy)oz=xo(yoz). A quasigroup is isotopic to a group if and only if it satisfies Brandt's law: xoy=zow, xoy'=zow', x'oy = z' 0 w implies x' 0 y' = z' 0 w'. Two groups are isotopic if and only if they are is-om orphic. A biunique mapping f of a finite group G is caned a complete mapping if x-->x-If(x) is also a biunique mapping. In this case if we denote by G' the quasigroup defined by x 0 y = xf(y), then the two latin squares corresponding to G and G' are mutualIy orthogonal. A group of odd order has a complete mapping, while a group of even order with a cyclic 2-Sylow subgroup does not have a complete mapping. A solvable group of even order with a non- cyclic 2-Sylow subgroup also has a complete mapping, and it is conjectured that the solv- ability condition here is redundant. D. Room Squares A quasigroup (A, 0) is idempotent if x ox==x for any x and is commutative if xo y= yox for any x and y. Now a Room square of order 2n 
241 E Latin Squares is a distribution of the n(2n - 1) unordered pairs of elements from A = {a o , ai' "', a2n-l} among the (2n - 1)2 cells of a square of side length 2n - 1, such that each element of A appears exactly once in each row and in each column. The remaining (2n-l)(n-l) cells should be empty. According to Bruck this is equivalent to a pair of two idempotent com- mutative quasigroups (A; 0) and (A;.) defined over A' = {a I' ... , a2n- d, satisfying an ortho- gonality condition similar to that of latin squares, namely, x 0 y=x' 0 y', x. y=x'. y' implies x =x', y= y' or x= y', y=x'. A Room square of order 2n exists if n)o 4. E. Number of Latin Squares Little is known about the total number of latin squares in general. The first k rows of a latin square of order n is a k x nlatin rect- angle. The number of k x n 1atin rectangles is asymptotic to (n!texp( -k(k-l)j2) for k< n l / 3 (Erdos, Kaplansky, and Yamamoto) and to (n!)k exp ( -() -()j(n- 1) _()j(n21)) for k < n 5 / 12 . They suggest an analogous asymp- totic relation for the number of latin squares. The following congruences are known for the number Ln of reduced latin squares; Lp= 1 (modp), Ln=O (modp) for n)o2p ifp is a prime. References [1] J. Dimes and A. D. Keedwell, Latin squares and their applications, Akademiai Kiad6, Budapest, and Academic Press, 1974. [2] W. D. Wallis, A. F. Street, and J. S. Wallis, Combinatorics: Room squares, sum-free sets, Hadamard matrices, Lecture notes in math. 292, Springer, 1972. 242 (V.8) Lattice-Point Problems A. General Remarks Suppose that we are given in a Euclidean plane a closed tJordan curve C of length L that bounds a tregion of area F. We denote by A the number of tlattice points on the curve C or in the region bounded by C. In many cases it can be verified that A = F + O(L) (0 is the tLandau symbol). Specifically, if we take a circle whose center is the origin and whose radius is fi, then A(x) == nx + O(fi). Next, consider the closed region defined by uv.s; x, u)o 1, and v)o I on the uv-plane. Let D(x) be 892 the number of lattice points lying in this closed region. Then D(x)=xlogx +(2C -1)x+ O(fi), where C is the tEuler constant. To observe another aspect of the problem of estimating the number of lattice points in a given region, consider the series +w I (m 2 + n 2 ) -, = f(s), ('::'n):O) (Jln-J =g(s). Then we have f(s) = LI r(n)n -s, g(s)= LI d(n)n- S , where r(n) is the number of integral solutions (u, v) of the equation u 2 + v 2 = nand d(n) is the number of positive integral solutions (u, v) of the equation uv == n. Thus the problem of estimating A(x) is identical to that of estimating H(x) = LnJ(n); this is called Gauss's circle problem. We also have D(x)= Ln,;;xd(n), and the problem of estimating the latter is called Dirichlet's divisor problem. We set P(x)=A(x)-nx and 6(x)=D(x) -(xlogx+(2C-l)x). W. Sierpiflski (1906) showed that P(x)=0(X I / 3 ), and G. Voronoi (1903) showed that 6(x)=0(x l / 3 10gx). There are further investigations concerning the esti- mations of P(x) and 6(x). J. G. van der Cor- put and E. C. Titchmarsh devised methods to estimate more general ttrigonometric sums. For instance, let f(x) be a real-valued func- tion oftclass C k (k)o3). If O<)..s;f(k)(X).s; h). or O<A.s; - f(k)(x).s;hA in the interval a.s;x.s;h (with b-a)o 1), then I eXp(2nif(x)) = 0(h 2d (b _a)A(2 k -2)-1 an,,;;b + (b _ a)I-22-k). -(2 L 2)-I). As of 1968, we have an estimate slightly better than 0(X I3 / 40 +<) for P(x) and 6(x) obtained by L. K. Hua (1942), C. J. Cheng (1963), and W. L. Yin (1959). It is conjectured that P(x) and (x) are 0(X I / 4H ), where £ is an arbitrary positive number. On the other hand, M. Tsuji (1953) proved that HP(y)/ydy=O(I). G. H. Hardy (1916) and A. E. Ingham (1941) showed that . P(x) hmsup=oo, x-+oo x .. P(x) hm mf 1/4 1/4 < 0, xoo x log x 6(x) limsup 1/4 1/4 >0, x log x . , 6(x) hmmf= -00. x-+ ro X H. Cramer (1926) showed that 1 I x - IP(y)ldy=0(x l / 4 ), X I 1 I x - [6(y)ldy=0(X I / 4 ). X I G. V oronoi (1904) proved that if x is positive, 
893 then c.  r(n) r:.::: L' r(n)=nx+..; x L --;=J I (2n..; nx), n:e::;x n::::::l y'n I' d(n)=x10gx+(2C -1)x+ 1/4 nx CN d(n) +fi n1 ;;; F(2n), where L' means that when x is an integer m, the mth term d(m) is replaced by (1/2)d(m). Here J I (x) is the tBessel function of the first kind, and F(x) = (2/n)f;' cos(xu)sin(x/u) duo There are many proofs for these expansion formulas. E. Landau's proof (1920) of the estimation of Lnxr(n) is interesting from the geometric point of view, and W. Rogosinski's proof (1922) of the estimation of Lxd(n) uses real analytic methods in an ingenious man- ner. A. Oppenheim (1926) generalized these problems. B. Other Extensions Let av be rational, av= av, and Q(u l , ..., un) = L,v1 avuuv be a tpositive definite qua- dratic form with tdiscriminant D. As an ex- tension of Gauss's circle problem, it is nat- ural to consider the number of lattice points (m 1 , ..., m n ) satisfying Q(m l , ..., mn).s; x. In connection with the tEpstein zeta function, this problem was extended to that of estimat- ing the sum F(x)= I exp(2ni(1X 1 m l +...+lX n m n )). Q(mj, ,mn):,,;x Namely, the weight exp(2ni(IX I m l +... + IXnmn)) is placed at each lattice point. We now define 15 such that 15 = I if lXI, ..., IXn are all integers and 15=0 otherwise. Landau (1915) obtained three exquisite proofs of nn/2 F(x)=b x"/2+0(x n (n-I I /2(n+I»). fi l(n/2+ I) I. M. Vinogradov (1960) obtained a deeper result for the special case n=3: Lu2+v2+w2zxl = (4/3)nx 3 / 2 + O(X 19 / 28 +'). Four times the tDedekind zeta function of the Gaussian field Q(J=1) is equal to LI r(n)n-'. Hence Gauss's circle problem can be extended to that of estimating H(x) for the Dedekind zeta function. The generalized divi- sor problem, including Gauss's circle problem and Dirichlet's divisor problem, was the prin- cipal theme of Landau's research after 1912. We now consider the case where the Dirichlet series L::"l F(n)n- S is a finite product of Dede- kind zeta functions. With a slight modification the following result is valid for the product of tHecke L-functions: Let k j (I .s;j.s; r) be an 242 Ref. Lattice-Point Problems algebraic number field of degree n j , (j(s) be the tDedekind zeta function of kj> and Pi be the residue at the pole s = 1 of (j(s), Further, we assume that n 1 +n z + ...+n,=N, 00 (I (S)(2(S)... (,(s) = L F(n)n -S, 11=1 H(x)= I F(n). n>;x Then H(x)=x(allog,-I X+... +a'_llogx+a,) + O(X(N-I)/(N+l) 10g'-1 x), a 1 =P1P2 ...p,/(r-I)!, and the remainder O-term of the right-hand side cannot be replaced by O(xo) for 0< 1/2 -1/2N. There are some algebraic results (Z. Suetuna (1929), H. Hasse and Suetuna (1931)) concerning the estimation of H(x). In particu- lar, if in the definition of F(n), the (j(s) are aJl equal to the Riemann zeta function, then we obtain (S)k = LI dk(n)n- S , where dk(n) is the number of ways of expressing n as a product of k factors. In this special case the remainder term can be replaced by O(x c +'), where c= max(I/2, (k-I)/(k+2)) (k;d) (G. H. Hardy and J. E. Littlewood, 1922). An appropriate application of the t Artin L-function was ob- tained by Suetuna (J. Fac. Sci. Univ. Tokyo, 1925), who extended to algebraic number fields of finite degree the result obtained by Landau (I912)-which states that the number ofposi- tive integers not larger than x that can be expressed as the sum of two squares is ap- proximately equal to ax/ Jlogx , where a is a positive constant. References [I] G. H. Hardy and J. E. Littlewood, The approximate functional equation in the theory of the zeta-function, with application to the divisor-problems of Dirichlet and Piltz, Proc. London Math. Soc., (2) 21 (1923),39-74. [2] L. K. Hua, Die Abschiitzung von Ex- ponentialsummen und ihre Anwendung in der Zahlentheorie, Enzykl. Math" Bd. I, 2, Heft 13, Teil I, 1959. [3] E. G. H. Landau, Zur analytischen Zahlen- theorie der definiten quadratischen Formen, S.-B. Preuss. Akad. Wiss. (1915),458-476. [4] E. G. H. Landau, Einfiihrung in die ele- mentare und analytische Theorie der alge- braischen Zahlen und der Ideale, Teubner, 1918, second edition, 1927. [5] E. G. H. Landau, Vorlesungen iiber Zah- lentheorie II, Hirzel, 1927 (Chelsea, 1969). 
243 A Lattices [6J E. C. Titchmarsh, The theory of the Rie- mann zeta-function, Clarendon Press, 1951. [7J E. G. H. Landau, Ausgewiihlte Abhand- 1ungen zur Gitterpunktlehre, A. Walfisz (ed,), Deutscher Verlag der Wiss., 1962. [8J A. Wa1fisz, Gitterpunkte in mehrdimen- siona1en Kuge1n, Warsaw, 1957. 243 (11.14) Lattices A. Definitions When x and yare elements of an tordered set L, the tsupremum and tinfimum of {x,y}, whenever they exist, are called the join and meet of x, y and denoted by xU y and x n y, respectively, Lis cal1ed a lattice (or lattice- ordered set) when every pair of its elements has a join and a meet. The following three laws hold in any lattice L: (i) xU y= yUx, x ny= ynx (commutative law); (ii) xU(yUz)=(xU y)Uz, x n(ynz)=(x ny)nz (associative law); and (iii) xU(ynx)=(xUy)nx=x (absorption law). Conversely, if in a set L two operations U, n are given that satisfy (i)-(iii), then the conditions xU y == y and x n y = x are equiv- alent and define an tordering x.s; y in L with respect to which L becomes a lattice. The supremum and infimum of {x,y} in this lattice coincide with the elements xU y and x ny, res- pectively. Accordingly, a lattice Can also be de- fined as an talgebraic system with operations U, n satisfying laws (i)-(iii). The idempotent law xux=xnx=x holds in any lattice. An ordered set L is called an upper semilat- tice if each pair of elements x, y always has a join (supremum) xU y, and a lower semilattice if each pair of elements x, y always has a meet (infimum) x n y. B. Examples The set (S) of all subsets of a given set S is a complete and distributive lattice with respect to the inclusion relation ( Sections D, E). The set of al1 tnorma1 subgroups of a given tgroup is a complete and modular lattice ( Section F) with respect to the inclusion re- lation. This remains true if the normal sub- groups are replaced by the tadmissib1e sub- groups with respect to a given toperator domain. This applies in particular to the case of the set of all tidea1s in a given com- mutative tring. The set of all subspaces of a given tprojective space is a modular lattice. 894 C. Further Definitions A mapping f of a lattice L into a lattice L' that satisfies the conditions f(x U y)= f(x)U f(y) and f(xny)= f(x)nf(y) is called a lattice homomorphism (or simply homomorphism). A tbijective lattice homomorphism f is called a lattice isomorphism (or simply isomorphism); its inverse mapping is also a lattice isomor- phism. When such an f exists, the lattices L and L' are said to be isomorphic. More gener- ally, a mapping f between ordered sets is said to be order-preserving when it satisfies the condition: x.s; y implies f(x) .s;f(y). Any lat- tice homomorphism is order-preserving, but the converse is not always true; however, an order-preserving bijection is an isomorphism. If the ordering in a lattice L is replaced by the tdual ordering, then the join and the meet are interchanged, and a new lattice L' is ob- tained. This new lattice is called the dual lattice for L. A mapping f of a lattice L into a lattice L' satisfying the conditions f(x n y) = f(x) U f(y) andf(xUy)= f(x)nf(y) is cal1ed a dual homo- morphism (or antihomomorphism), Moreover, when f is a bijection, f is cal1ed a dual iso- morphism (or anti-isomorphism), and we say that Land L' are dually isomorphic (or anti- isomorphic) to each other. When a lattice L' is a subset of a lattice L and the canonical injection L' --> L is a lattice homomorphism, L' is cal1ed a sublattice of L. If a subset L' of a lattice L satisfies the con- dition that x, YEL' implies xUy, xnYEL', then two operations U, n can be induced in L' so that L' becomes a sublattice. For example, when a, b are given elements of a lattice L, the set of elements x satisfying a.s;x.s;b is a sub- lattice, denoted by [a, bJ and called an interval of L. When the quotient set L/R of a lattice L by an equivalence relation R in L is also a lattice and the canonical surjection L-->L/R is a homomorphism, then L/ R is cal1ed a quo- tient lattice of L. If an equivalence relation R in a lattice L satisfies the condition that x = x', y=y' (modR) implies xUy=x'U y', xny= x' n y' (mod R), then two operations U, n can be induced in L/R so that L/R becomes a quotient lattice. The Cartesian product L = I1 iE1 L i of a family {L i } iEl of lattices becomes a lattice if the operations U, n are defined by (x;) U (Yi) = (Xi U Yi), (x;) n (y;) = (Xi n y;). This lattice is caIled the direct product of lattices {LJiEl' D. Complete Lattices An ordered set L is cal1ed a complete lattice if every nonempty subset of L has a supremum 
895 and an infimum in L, and a O"-complete lattice if every nonempty countable subset has a supremum and an infimum. Naturally, such ordered sets are lattices. And a sort of con- verse holds: Any lattice is a sublattice of some complete lattice. An ordered set L is said to be conditionally complete when every subset tbounded from above (below) has a supremum (infimum) in L, and conditionally O"-complete when every countable subset bounded from above (below) has a supremum (infimum). For any ordered set L there exist a complete lattice L and an order-preserving injection f:L-->L satisfying the condition that each element (E L is the supremum and infimum of the images f(X) andf(y), respectively, for some sets X, Y c L. This condition is equivalent to the con- dition that for any complete lattice L' and order-preserving injection 1': L --> L', there exists an order-preserving injection <p: L--> I' for which <pof= 1'. Hence (LJ) is unique up to lattice isomorphisms. L is called the com- pletion of the ordered set L. For example, the set of real numbers supplemented by +00 and -00 is the completion of the set of rational numbers. E. Distributive Lattices A lattice L is said to be distributive when it satisfies the following equivalent conditions (distributive laws) for x, y, ZEL: (i) xU(ynz) =(xU y)n(xU z); (ii) xn(yUz)=(x ny)U (x n z); and (iii) (x U y)n(yU z)n(zU x)= (xny)U(ynz)U(znx). The dual lattices, sublattices, quotient lattices, and direct prod- ucts of distributive lattices are distributive. The set (S) of subsets of a given set S is a distributive lattice, and each of its sublattices is called a lattice of sets in S. A distributive lat- tice is isomorphic to a certain lattice of sets. A homomorphism from a distributive lattice L into (S) is called a representation of L in S. A lattice L is said to be complemented when a greatest element I and a least element 0 exist in L and for every element x, there exists an element x' satisfying x U x' = I, x n x' = O. Such an x' is called a complement of x. A lattice that is distributive and complemented is called a Boolean lattice (or tBoolean algebra). In a Boolean lattice, every element has a unique complement. The lattice (S) of all subsets of a given set S is a Boolean lattice, in which S is the greatest element and 0 the least element. A sublattice of (S) that con- tains the complement of each of its elements is also a Boolean lattice and is called a Boolean lattice of sets, Any Boolean lattice can be rep- resented isomorphically by some Boolean lattice of sets ( 42 Boolean Algebra). 243 F Lattices An element a of a lattice is called a neutral element if for any pair of elements x and y, the sublattice generated by a, x, y is distributive. When a is neutral and has a complement, a is called a central element. The set of all central elements of a lattice L is called the center of L. F. Modular Lattices A lattice L is said to be modular if the follow- ing condition (modular law) is satisfied: x.s;z implies xU(ynz)=(xUy)nz. A distributive lattice is always modular. The dual lattices, sublattices, quotient lattices, and direct prod- uct of modular lattices are also modular. The tJordan-Holder theorem and the refine- ment theorem of O. Schreier on normal sub- groups ( 190 Groups) are generalized as follows to the case of any modular lattice: A pair of elements x, y in a lattice L satisfying x)o y is called a quotient and is denoted by x/Yo In particular, when x> y and there exists no element z such that x> z > y, then x/y is called a prime quotient, x is said to be prime over y, and y is said to be prime under X. A sequence C: X o , Xl' ..., Xk of elements of L satisfying the conditions X i - l )0 Xi (I.s; i.s; k) is called a descending chain, and k is called its length. Each xi-dxi is called a quotient deter- mined by C. When each of these quotients is prime, C is called a composition series. A de- scending chain D: Yo, Y I, ... , Yl is called a re- finement of a descending chain C: X o , x I' ... , X k when Xo = Yo, X k = Yl and each Xi is equal to some yj' We now define an equivalence relation between descending chains. First, a relation x/yx'jy' between quotients x/y and x'/y' is defined to mean that either the con- dition x = x' U y, y' = x' n y or the condition x' = x U y', y = x n y' holds, Then x/yand x'/y' are called equivalent if there exists a finite number of quotients qo, ql>"', q, satisfying the conditions x/y=qo, x'/y' = qr and qi-l qi (l.s; i.s; r). Descending chains C and C' are said to be equivalent if they have the same length and the set of quotients determined by C is mapped bijectively to the set of quotients determined by C' so that the quotient and its image are equivalent. Now let L be a modular lattice. If two quotients x/y and x' /y' in L are equivalent, the intervals [y, x] and [y', x'], considered as lattices, are isomorphic (Dede- kind's principle). If two descending chains C:x o , XI' ..., X k and C':Yo, YI' ..., Yl have the same ends Xo = Yo and Xk = y" then there exist a refinement of C and a refinement of C' which are equivalent. In particular, any two compo- sition series connecting the same elements are equivalent ( 85 Continuous Geometry). In a modular lattice L with a least element 
243G Lattices 0, if there exists a composition series connect- ing 0 and a given element a, then all such composition series have a common length k, denoted by d(a) and called the height of the element a. If no such composition series exists, the height is defined to be d(a) = 00. If two elements a, h of L satisfy d(a U b) < 00, then d(aUb) + d(anh)=d(a)+d(h), This fact is called the dimension theorem of modular lat- tices, If L has a greatest element I, d(I) is called the height of the lattice L. In a complemented modular lattice L, an element which is prime over the least element o is called an atomic element. A complemented modular lattice L is said to be irreducible if any two atomic elements have a common complement. G. Lattice-Ordered Groups An ordered set G in which a group operation is defined is called an ordered group when x.s; y implies xz.s; yz and zx.s; zy for all x, y, z in G. Moreover, if it is a ttotally ordered set, the ordered group G is called a totally ordered group. If G is a lattice, the condition for an ordered group is equivalent to the condition that (x U y)z= xz U yz, (x n y)z =xz n yz, z(x U y) = zx U zy, and z(x n y) = zx n zy. In this case, G is called a lattice-ordered group. A lattice-ordered group is a distributive lattice and has neither a grcatest nor a least elemcnt. If {Xi] has a supremum in a lattice-ordered group, then we have (SUPiX;)nY=SUpJXiny) and its dual (complete distributive law). The lattice-theoretic structure of a lattice- ordered group was clarified by P. Lorentzen, A. H. Clifford, and T. Nakayama. In particular, a lattice-ordered commutative group is isomor- phic (as a lattice-ordered group) to some sub- group of a direct product of totally ordered groups. A lattice-ordered group has no ele- ment of finite order other than the identity element. Conversely, a commutative group which has no element of finite order other than the identity element can be made a lattice- ordered group with respect to some total ordering, Any free group can also be made a lattice-ordered group with respect to some total ordering. K. Iwasawa (1948) and others have done further research on totally ordered groups. An element x (#e) of a lattice-ordered group G is called positive (negative) when x e (x.s;e), where e is the identity. G is called an Archimedean lattice-ordered group when the following condition is satisfied: if, for some y, xn.s; V for all natural numbers n, then x.s; e. An Archimedean lattice-ordered group is 896 isomorphic to some subgroup of a complete lattice-ordered group. Conversely, a complete lattice-ordered group is Archimedean; more- over, it is commutative and isomorphic to the direct product of some tlattice-ordered linear spaces and copies of the lattice-ordered group of rational integers (Iwasawa, 1943). In particular, any totally ordered Archimedean lattice-ordered group is isomorphic to a sub- group of the lattice-ordered group of all real numbers. If the tminimal condition holds for the set of all positive elements in a lattice-ordered group, then the group is commutative, and each of its elements can be decomposed uniquely into a product of powers of elements that are prime over the identity element. The set of all tfrac- tional ideals of an algebraic number field is a typical example of such a lattice-ordered group. For further reference  310 Ordered Linear Spaces, 85 Continuous Geometry. References [IJ G. Birkhoff, Lattice theory, Amer. Math. Soc. Colloq. Publ., 1940, revised edition, 1948. [2J P. Dubreil and M. L. Dubreil-Jacotin, Lei;ons d'algebre moderne, Dunod, second edition, 1961; English translation. Lectures on modern algebra, Oliver & Boyd, 1967. 244 (XXI.34) Lebesgue, Henri Leon Henri Lebesgue (June 28, 1875-July 26,1941) was born in Beauvais (Oise), studied at the Ecole Normale Superieure, and received in 1902 the doctoral degree for his thesis con- cerning integration [1]. After teaching in Rennes and Poitiers, he came to Paris where he was nominated for a professorship at the Faculty of Science in 1920 and then at the College de France in 1921. He was one of the most influential French analysts of this cen- tury and is known as the inventor of the tLe- besgue integral. With deep insight based on intuitive geometric conccptions, he was able to initiate a new era in analysis by creating the theory of this integral. Not only was this theory the start of modern integration, it was also a turning point in the theory oftFourier series and tpotential theory. The notions of tLebesgue dimension of topological spaces and of tLebesgue number of compact sets are due to him. Lebesgue also made significant contri- butions to the tDirichlet problem. 
897 References [1] H. Lebesgue, Integrale, longueur, aire, Ann. Mat. Pura Appl., (3) 7 (1920), 231-359. [2] H. Lebesgue, Sue les fonctions represent- abIes analytiquement, J. Math. Pures Appl., (6) I (1905), 139-216. [3] H. Lebesgue, Lei;ons sur I'integration et la recherche des fonctions primitives, Gauthier- Villars, 1904, second edition, 1928. 245 (XXI.35) Leibniz, Gottfried Wilhelm Baron Gottfried Wilhelm von Leibniz (July I, 1646-November 4, 1716) was born the son of a professor and grew up to be a genius with encyclopedic knowledge. He took part in politics and touched all scholarly fields, con- tributing creatively to the development of technology as well. His posthumously pub- lished works cover theology, philosophy, mathematics, the natural sciences, history, and technology, and are classified into 41 fields. A complete edition of his works has yet to be published. Ars combinatoria, written upon his graduation from Altdorf in 1666, was a scheme to systematize the various fields using mathe- matics as a model. During his stay in Paris (1672-1676), when not involved in politics, he studied the works of tDescartes and tPascal, as suggested by C. Huygens. He discovered the tfundamental theorem of differential and in- tegral calculus and set up a basis for calculus with the introduction of an ingenious system of notation. After 1676, he worked on histor- ical compilations under the Duke of Hanover. Leibniz worked not only on the synthesis of mechanistic philosophy and medieval theolog- ical philosophy but also on the reconciliation of Protestantism and Catholicism. With his monadism he attempted to unify the old and new philosophies. In addition he worked on plans for a world academy for the develop- ment of learning and on the unification of all knowledge. This was to be accomplished using, for example, universal symbolism and universal linguistics. Under his influence, the Berlin Academy was established in 1700. After his death, his conceptions of tsymbolic logic and tcomputers were realized, References [IJ C. I. Gerhardt (ed.), Leibnizens mathema- tische Schriften I-VII, Schmidt, 1849-1861 [2] C. I. Gerhardt (ed.), G. W. Leibniz, Philo- sophische Schriften I-VII, Berlin, 1875-1890. 246 A Length and Area 246 (X.16) Length and Area A. Length of a Curve A continuous mapping C sending each point U of an interval I={ula.s;u.s;b} to a point p= p(u) =(x i (u), "', xk(u)) of the k-dimensional Euclidean space R k (k2) is called a tcontin- uous arc or continuous curve, denoted some- times by C:p=p(u). The supremum of the length of a polygonal curve inscribed in C is called the length of C (in the sense of Jordan) and is denoted by 1(C). Namely, it is equal to sUPoL7'llp(u;)-p(ui-dl, where b is a par- tition of I:a=uo<u l ... <um=b. Let C:p= p(u), and Cn:P=Pn(u), n= 1,2,... (uEI) be continuous arcs. If Pn(u)-->p(u) on I, then 1(C).s;liminfnl(C n ). This property is called the lower semicontinuity of length. Given two continuous arcs C:p=p(u) (uEI) and CI:p =q(v) (vEIl)' if for any £>0, there exists a homeomorphism u=h,(v) of II onto I such that Ip(h,(v))-q(v)1 <£ on II, then C and C I are called equivalent in the sense of F rechet. Equivalent continuous arcs have the same length. A continuous image of an open interval is ca1\ed an topen arc. The length of an open arc is defined to be the supremum ofthe length of a continuous arc contained in the open arc, and the notion of the equivalence of open arcs is defined in the same way as for continuous arcs. An equivalence class of continuous ares (or open arcs) is ca1\ed a Frechet curve, and its length is defined uniquely. Suppose that a continuous arc C is ex- pressed by (xdu), , ... xd u )), uEI. Then the length 1(C) is finite if and only if every x;(u) is of tbounded variation, When I(C) is finite, C is called rectifiable. In this case each axdau exists talmost everywhere on I, and the inequality f b ( k ( ax ) 2 ) 1/2 I(C) .  du a ,-I aU (1 ) holds. The equality holds if and only if each x;(u) is absolutely continuous. Among the continuous arcs equivalent to C, there exists a unique continuous arc C I such that C I : q = q(s) (O.s;s.s;I(C)) and the length of every subarcq= q(s) (O.s;s.s;s'(.s;I(C))) is equal to s'. C I is called the representation in terms of arc length of C. For C I , the equality holds in (1). A similar argument is valid for any open arc. When every su barc of C is rectifiable, C is called locally rectifiable. If Al is the I-dimensional tHausdorff measure in R k and n(p) is the number of points on I corresponding to pERk, then I(C)= S n(p)dAI (p) (M. Ohtsuka, 1951). 
246B Length and Area B. Surface Area In this section, we deal with area of tsurfaces in R 3 , using [1] as the main reference. In con- trast to the situation for curves, the area of a polyhedral surface P inscribed in a given sur- face does not necessarily tend to a fixed value as P approximates the surface. In a letter of 1880, H. A. Schwarz gave the folIowing example: Approximate a circular cylinder with height h and radius r by a sequence {Pn} of inscribed polyhedral surfaces, each Pn consist- ing of similar triangles of height b n and base length an' If bnla; is suitably chosen, then the surface area of Pn tends to an arbitrary value not smaller than 2nrh (Fig. 1), Fig. 1 C. Lebesgue Area Suppose that we are given a plane domain A and a continuous mapping T of A into R 3 . The pair (T, A) is called a surface. Let d(T, T, B)=SUPWEBI T(w)- T(w)1 for surfaces (T, A), (T', A') and a set Be A n A', Let (T, A), (T 1 , Ad, (T2' A 2 ), ... be given. If Ani A and d(T, 1;"An)-->O, then (1;" An) (or simply 1;,) is said to converge to (T, A) (or T), and the con- vergence is expressed by Tn --> T. In particular, if A consists of a finite number of triangles and T is linear on each triangle, i.e., the image of A under T consists of triangles, then the nota- tion (P,F) is used for (T, A), and the area of T(A) is denoted by a(P, F). Given a surface (T, A), denote the totality of sequences {(Pn,Fn)} converging to (T,A) by <1>, and calI inf",liminfna(P n , Fn) the Lebesgue area of (T, A). This area is denoted by L(T, A). By virtue of the definition there exists a sequence {(Pn,Fn)} con- verging to (T, A) such that a(Pn,Fn)-->L(T, A). Like length, Lebesgue area has the lower semi- continuity property. Namely, 1;,--> T implies L(T, A).s;liminfnL(T n , An). When A is a Jordan domain, the same value L(T, A) is obtained if <1> is replaced by the set <1>* of all seq uences {(Pn,Fn)} such that Fn i A and Pn(w)--> T(w). Let f(x, y) be a continuous function defined on O.s; x.s; 1, O.s;y.s; 1. Regard f(x, y) as a func- tion of y (resp. x) for a fixed x (y), and denote 898 it by fAY)(f,,(x)) and its ttotal variation by V(X)(VdY)). WhenS6 V(x)dx+S6 V1(y)dy<00, f(x, y) is said to be of bounded variation in the sense of Tonelli. Furthermore, if fAy) and fy(x) are tabsolutely continuous for almost every x and y, respectively, then f(x, y) is said to be absolutely continuous in the sense of Tonelli. Similar definitions are also given when f is defined in a general domain. Suppose that a surface (T, A) is expressed by a set of three functions x==x(u, v), y== y(u, v), z=z(u, v), alI of which are absolutely continuous in the sense of TonelIi, and that the partial derivatives xu, xv, ..., Zv are square integrable. Then L(T, A) = HA J dudv < 00 (as was shown by C. B. Mor- rey), where J = (Jt + Ji + J)1/2 with tfunctional determinants J 1 , J 2 , J 3 of the transformations (u, v)-->(y, z), (z, x), (x, y). D. The Geocze Problem The Geocze problem is the problem of deter- mining whether L(T, A) coincides with the area obtained by using (instead of <1» the set of all sequences {(Pn, Fn)} such that (Pn, Fn) converges to (T, A) and each (Pn,Fn) is inscribed in (T, A). The answer is affirmative when A is a Jordan domain with L(T, A) < 00. Let T be a surface expressed by a function z=F(x,y) (O.s;x.s; 1, O.s; y.s; 1) that is absolutely continuous in the sense of TonelIi, and let {(Pn,Fn)} be as before. If the ratio of the length of the largest side and the smalIest height of each triangle in (Pn, Fn) is uniformly bounded, then a(P n , Fn) tends to L(T, A) [1, p. 74]. E. Geocze Area Consider a surface (T, A). Let EI' E2' E3 be coordinate planes in R 3 , and denote by T; (i = 1, 2, 3) the composition of the mapping T and the projection of R 3 onto E i . Let on be the positively oriented boundary of a polygonal domain n in A and C i be the oriented image of on by T;. Then the torder O(z; C;) of z with respect to C i is a measurable function of z, Set vi(T; n)=v j = HE,IO(z; C;)I dxdy (z=x+ iy) and v(T;n)=v=(v+v+vW/2. The quantity V(T;A)=sup L v(T;n) S 1tES (2) is called the Geocze area of (T, A), where S is a finite collection of polygonal domains in A such that no two of them overlap. If V(T, A) < 00, u i is defined by J JE,O(Z; C;)dxdy, and V (T, A) is defined as in (2) by means of u i , then U(T,A)== V(T, A). The inequalities V(T;,A).s; V(T, A).s; V(T 1 , A) + V(T 2 , A)+ V(T3, A) hold trivially. 
899 F. Peano Area Consider (T, A) and n as in the previous sec- tion. Let r be the projection of R 3 onto a plane E, and denote by C' the image of the bound- ary of n under the composite mapping roT. Set v(T, n, E)= J h 10(z; C')I dO", where dO" is the surface element on E, and set i/J(T, n)= suPEv(T, n,E). Define P(T, A)=sup L i/J(T, n) S nES as in (2). This is called the Peano area of (T, A). H. Okamura defined area by integrating the tmapping degree instead of 10(z; C')I [5]. L, V, P all coincide, and hence L and V are invariant under any orthogonal transforma- tion ofR 3 . G. Other Definitions of Area As in the definition of Peano area, consider n, r, E, and denote the Lebesgue measure of r 0 T(n) by m(n, E). Ifwe set J1(n)=suP E m(n, E), then we can define the area of (T, A) by SUPE LnES J1(n). If v(n) ==(m 2 (n, Ed + m2(n, E 2 ) + m 2 (n, E3))I;2 is used instead of J1(n), then the Banach area of (T, A) is obtained, and if J J I O(z; C;)I dx dy is used instead of m(n, E;), then the Geocze area is obtained. Let us define various kinds of area for an arbitary tBorel set X in R 3 . Divide R 3 into meshes Mt, M 2 ,... which are half-open cubes with diameter of equal length d, and denote by mF) (i = 1,2,3) the tLebesgue measure of the projection of MjnX onto the ith coordinate plane. The limit of Li(m'l»)2 + (m»2 + (m»)2)1/2 as d-->O is called the Janzen area of X. Denote by m j the supremum with respect to the set of planes E in R 3 of the Lebesgue measure of the projec- tion of MjnX to E. Then limLjm j as d-->O is caIled the Gross area of X. C. Caratheodory covered X by a countable number of convex sets KI' K 2 ,... each of whose diameters is less than 15 > 0, denoted by mj the supremum of the Lebesgue area of the projection on Kj into E, and adopted IimL j mj as 15-->0 as his defini- tion of area of X. If the Kj are restricted to be spheres, then Caratheodory's area divided by n/4 is identical with the tHausdorff measure A 2 (X). (For other definitions and mutual relations  [4].) We give a measure-theoretic definition of an area for a surface (T, A) as follows. Denote by n(p) the number of points in A corresponding to a point pin R 3 , and call n(p) the multiplic- ity function of the mapping T. The integral A(T, A)=(n/4)J n(p)dA 2 (p) can be taken as a definition of the area. However, a different definition of the multiplicity function is needed for the integral to be equal to qT, A) [3,6]. 246H Length and Area In fact, if x = <p(u), y = i/J(u) (O.s; u.s; 1) represent a tpeano curve filling the square O.s; x.s; 1, O.s; y.s; 1, then the Lebesgue area of the sur- face (T, A) defined by A = {0<J1 < 1,0 < v < I} and T:x=<p(u), y=i/J(u), z=O is zero, but A(T,A);d. H. Mappings of Bounded Variation Let T be a mapping of a domain A in the w-plane into the z-plane, and define n, 0 = O(z; C), and S as in Section E. Set O+(z; C)=(IOI+ 0)/2, O-(z; C)=(101-0)/2, v(T, n)= f fIO(Z; c)1 dxdy, vI(T, n)== f f OI(Z; C)dxdy (the same signs correspond to each other), V(T, A)=sups LnESV(T, n), VI(T, A)=suPsLnEsvI(T, n), N(z; T, A)=sups LnEsIO(z; C) I, and NI(Z; T, A)=sups LnESOI(Z; C). Then N, NI are lower semicontinuous in the z-plane. The integrals W(T, A)= Sf N dxdy, W+(T, A)= JJ N+ dxdy, and W-(T,A)= J J N- dxdy are called the total variation, positive variation, and negative variation of T, respectively, and the equalities W = W+ + W-, V=W,and VI=W:I: hold. When W(T,A)< 00, T is said to be of bounded variation. A related notion is defined as follows: T is ab- solutely continuous if the following two con- ditions hold. (1) For any given E> 0 there exists a 15 > 0 such that LnES v(T, n).s; E when- ever the sum of the areas of n E S is .s; b. (2) For any polygonal domain no such that no U an o c A and any polygonal subdivision S of no, V(T, no) = LnES V(T, n). If the area of A is finite and T is absolutely continuous, then T is of bounded variation. Let T be a continuous mapping of bounded variation of a domain A in the w-plane into the z-plane. The derivatives V'(w), V (w), v' (w) of the set functions V(T, A), V+(T,A), V-(T,A) exist talmost everywhere (a.e.) in A and are finite, The difference J(w)= V(w)- V.'(w) is called the generalized Jacobian, and the rela- tion J(w) = V'(w) holds a.e. If x(u, v), y(u, v) are differentiable a.e., then J(w) coincides with the ordinary tfunctional determinant a.e. Next, let T be a continuous mapping of A into R 3 with V(T, A)< 00, and denote by Ji(w) the gen- eralized Jacobian of (1;, A). Then J(w) =(Jf(w) 
246 I Length and Area + Jl(w) + J? (W)) 1/2 is called the generalized Jacobian of(T,A). The relation J(w)=::!: V'(w) holds a.e, in A. Therefore V(T,A) f L J(w)dudv is valid. The equality holds if and only if each (1;, A) is absolutely continuous. I. Frechet Distance Let (Tj, Ad and (Tz, A 2 ) be surfaces, and as- sume that the set H of homeomorphisms be- tween Al and A 2 is nonempty. Define the Frechet distance between two surfaces by IIT I , T 2 11 = inf sup ITdw)-T2(h(w))I. hER wEA 1 It satisfies the three axioms of distance ( 273 Metric Spaces). When II T I , T 2 11 = 0, TI and T 2 are called equivalent (in the sense of Frechet). A set of all equivalent surfaces is called a Frechet surface. Equivalent surfaces have equal Lebesgue areas; hence Lebesgue area is weIl defined for any Frechet surface. Given a surface (T, A) with L(T, A) < 00, there exists a pair (TI' Ad, equivalent to (T, A) such that the functional determinants Ji(w) exist for (T I , AI) and V(T"Ad= f L1 J(w)dudv, where j2(w) = Lf1 Jf(w), as before. The prob- lem of finding such a pair (TI' Ad is called the representation problem. Moreover, we can choose (T!, Ad to be a generalized conformal mapping in the following sense: Express (T" AI) by x =x(u, v), y= y(u, v), z=z(u, v). Then xu, xv, ..., Zv exist a.e. in Al and are square integrable, x + y + z = x; + y; + z;, and XuXv + YuYv+ zuzv=O a.e. in AI' J. Higher-Dimensional Case In higher-dimensional spaces, many results are also known, such as area and coarea formulas [11] and a generalization of Morrey's result on Lebesgue area (C. Goffman and W. P. Ziemer, 1970). K. Hausdorff Dimension A. S. Besikovich [9J has shown that for every set S in a Euclidean space, there exists a real value D such that the td-dimensional Haus- dorff measure is infinite for d < D and van- ishes for d> D. This D is called the Hausdorff dimension, F. Hausdorff had already considered this measure for tCantor sets and Koch curves. 900 (3) A fractal [IOJ is defined as a set for which the Hausdorff dimension strictly exceeds the tdimension of the set, which is topologically defined in a Euclidean space. We show only one example, the coastline of a triadic Koch island constructed in the fol- lowing way. Consider an equilateral triangle with sides of unit length. Remove the middle third of each side, and attach in its place a V- shaped peninsula bounded by two sides of an equilateral triangle with side length 1/3. We thus get a Star of David. Repeat the same process of formation of peninsulas for each segment of the star's sides, If we continue this process indefinitely, then we get a complicated coastline whose Hausdorff dimension is 1.2618. Of course the tlength of this coastline IS 00. B. Mandelbrot mentions many interesting examples of fractals having fractional dimen- sions in his book [10]. References [I] L. Cesari, Surface area, Ann. Math. Studies, Princeton Univ. Press, 1956. [2J L. Cesari, Recent results in surface area theory, Amer. Math. Monthly, 66 (1959), 173- 192. [3J H. Federer, Measure and area, Bull. Amer. Math. Soc., 58 (1952), 306-378. [4J G. N6beling, Dber die Flachenmasse im Euklidischen Raum, Math. Ann., 118 (1943), 687 -701. [5J H. Okamura, On the surface integral and Gauss-Green's theorem, Mem. ColI. Sci. Univ. Kyoto, (A., Math.) 26 (1950), 5-14. [6J T. Rad6, Length and area, Amer. Math. Soc. Colloq. Pub\., 1948. [7] T. Rad6, Lebesgue area and Hausdorff measure, Fund, Math., 44 (1957),198-237. [8J S. Saks, Theory of the integral, Warsaw, 1937. [9J A. S. Besikovich (Besicovitch) and H. D. Ursell, Sets of fractional dimensions V, On dimensional numbers of some continuous curves, J. London Math. Soc., 12 (1937),18- 25, [1 OJ B. B. Mandelbrot, Fractals: Form, chance, and dimension, Freeman, 1977. [I I] H. Federer, Geometric measure theory, Springer, 1969. 247 (XXI.36) Lie, Marius Sophus Marius Sophus Lie (December 17, 1842- February 18, 1899), a Norwegian mathema- 
901 tician, is famous as the founder of the theory oft Lie groups From 1869 to 1870 he colla- borated with F. Klein on tsphere geometry, which led Lie to develop the concept of con- tinuous groups. This discovery was the step- ping stone that allowed Klein to complete his ideas for the tErlangen program. In 1872, Lie became a professor at the University of Chris- tiania (now Oslo) In 1886, Lie succeeded Klein in the chair of mathematics at Leipzig, where he remained until 1898, Then he returned to Christiania, where a post was created for him, but he died one year later. The continuous groups that Lie dealt with are today called the tLie transformation group germs. With the free use of geometric concepts and analytic methods (especially the theory of tditTerential equations) he was able to develop his theory and apply it to the theory of dif- ferential equations The significance of his work was not recognized until after his death. Early in the 20th century, E. tCartan and H. tWeyl were able to complete the theory of Lie groups, and by the middle of the century, the characteristics of the Lie group as a ttopolog- ical group were clarified. References [IJ F. Engel and P. Heegaard (eds.), Sophus Lie, Gesammelte Abhandlungen I-VI, Teub- ner,1922-1937. [2J S, Lie and F. Engel, Theorie der Transfor- mationsgruppen I-III, Teubner, 1888-1893 (Chelsea, second edition, 1970). [3J S. Lie and G. Scheffers, V orlesungen tiber continuierlicher Gruppen, Teubner, 1893 (Chelsea, second edition, 1971). 248 (IV.10) Lie Algebras A. Basic Concepts Let K be a tcommutative ring with unity. A set 9 is called a Lie algebra over K if the following four condi tlOns are satisfied: (i) g is a tleft K- module, where we assume that the unity of K acts on g as the identity operator. (ii) There is given a K-bilinear mapping (called the bracket product) (X, Y)-->[ X, YJ from 9 x 9 into 9: [IIXiXi,I#j YjJ= IIXi[3JXi' YjJ for all lXi, [3j in K and Xi, Yj in 9. (iii) [X, XJ =0 for every X in 9, (Hence [X, Y] = - [Y, XJ for every X, Y in g (alternating law)) (iv) [X, [Y,ZJJ + [Y, [Z, XJJ + [Z, [X, YJJ =0 for 248 B Lie Algebras every X, Y, Z in 9 (Jacobi identity). In partic- ular, if K =C (the complex number field) or K = R (the real number field), 9 is called a complex Lie algebra or a real Lie algebra, respectively. For example, let  be an tassociative alge- bra over K. Putting [X, YJ=XY- YX, we can supply 1 with the structure of a Lie alge- bra over K, which is called the Lie algebra associated with 1. In particular, if 1 is the ttotal matrix algebra Kn of degree n over K, then the Lie algebra associated with Kn is called the general linear Lie algebra of degree n over K and is denoted by 91(n, K). Let 9 be a Lie algebra over K and a, b be tK-submodules of g. The subset of 9 consisting of elements of the form L [A, BJ (finite sum) with AEa, BEb is denoted by [a,b], which is a K-submodule of 9 A K-submodule a of 9 is called a Lie subalgebra of 9 if [a, aJ ca. A sub- algebra a of 9 is called an ideal of 9 if [a, gJ c a (this condition is equivalent to [9, aJ c a). If a is a subalgebra of 9, the restriction of the bracket product of 9 on a makes a a Lie alge- bra over K. If a is an ideal of 9, the tquotient K-module g/a is a Lie algebra over K relative to the bracket product [X + a, Y + aJ = [X, YJ + a. This Lie algebra g/a is called the quotient Lie algebra of 9 modulo a. Let 91> 92 be Lie algebras over K. A map- pingf:91-->92 is called a homomorphism of 91 into 92 if f is K -linear and [([X, YJ) = [f(X),j(Y)J for every X, Yin 91' A bijective homomorphism is called an isomorphism. Then 91 is said to be isomorphic to 92 if there exists an isomorphism from 91 onto 92, and we write 91 92' If f:91-->92 is a homomor- phism, then {(9d is a subalgebra of 92, and the kernel a = j -I (0) of {is an ideal of 91' Fur- thermore, the homomorphism {induces an isomorphism 1: 9 d a --> f(9 d (homomorphism theorem ). The direct sum 9 I + 92 of two Lie algebras 91> 92 over K is defined as in the case of as- sociative algebras. Then 91> 92 are ideals of 91 +92' The set A (9) of all automorphisms of a Lie algebra 9 is a subgroup of the general linear group GL(9). A(9) is called the (full) automor- phism group of 9. B. Representations Let g be a Lie algebra over K, and let V be a K-module. Denote by i!(V) the associative algebra consisting of all K -linear mappings from V into V. Denote by 91(V) the Lie algebra associated with E(V). (Note that if V has a basis consisting of m elements over K, then 91(V)91(m, K).) A homomorphism P:9-->91(V) 
248C Lie Algebras is called a representation (more precisely, a linear representation) of 9 over V, and V is called the representation space of p. If V is a tfree K -module of rank m, then m is called the degree of the representation p. We also use (p, V) instead of p to mention the representa- tion space explicitly. The concepts concerning the representations such as tequivalence, tirre- ducibility, or complete reducibility are similar to the corresponding concepts found in the representation theory of associative algebras ( 362 Representations). In particular, by taking V = 9 and putting p(X) Y = [X, YJ (X, YEg), we obtain a representation of g, called the adjoint representation of g, and p(X) is denoted by ad(X). Then ad(g) = {ad(X) I X Eg} is a subalgebra of gl(g) and is called the adjoint Lie algebra of g. Let (p, V) be a representation of g. Then there is associated with this representation a tsymmetric bilinear form B p: 9 x 9 --> K given by Bp(X, Y)=trp(X)p(Y), where Bp satisfies the invariance property Bp([X,ZJ, Y)= Bp(X, [Z, YJ). In particular, if p = ad, then we write B instead of Bp, and B is called the Killing form of g. Let 9 be a Lie algebra over R of dimension n, and let ad' g-->gl(g) be the adjoint represen- tation of g. Put det(tI - ad(X)) = L'lo t j (X) for every element X E!1. Then the (X) are polynomial functions on g, and Pn= 1. Let I be the least integer such that P, # O. Then 1= trank !1. An element X of!1 is called regular (singular) if P,(X) # 0 (P, (X) = 0). The subset g' of 9 con- sisting of all regular elements of 9 is open and dense in g. The subset 9 - g' of 9 consisting of all singular elements of 9 is of measure zero with respect to the tLebesgue measure of g, which is obtained uniquely up to positive scalar multiples by means of a linear isomor- phism of 9 onto R n using any basis of 9 over R. Now suppose that !1 is reductive ( Section G). Then an element X E!1 is regular if and only if the centralizer 3x = {Y Eg I ad(X) Y =O} of X is a Cartan subalgebra ( Section I) of g. Furthermore, if X E!1 is regular, then ad(X) is a tsemisimple linear endomorphism of g. C. Structure of Lie Algebras Suppose that a, b are ideals of a Lie algebra g. Then [a, bJ is also an ideal of g. In particular, !1 has the following ideals: g' = [g, !1J, g" = [g', !1'J,..., g(i+I) = [g(i), g(i)J,..., Furthermore, we have 9 =:>!1' =:>!1" =:> .... This series is called the derived series of g, and g' is called the derived algebra of!1. The Lie algebra !1 is said to be Abelian if !1' = 0 and solvable if !1(k) = 0 for some k. Now put gl = g, g2 = [g, glJ, g3 = [g, g2J,..., !1 i + 1 = [!1, !1 i J,.... Then !1 1 , !1 2 ,... 902 are all ideals of g, and we have!1 1 =:>!1 2 =:> g3 =:> .... This series is called the descending cen- tral series of g, and 9 is said to be nilpotent if gk = 0 for some k. An ideal a of 9 is called Abelian (solvable, nilpotent) if the subalgebra a is Abelian (solvable, nilpotent). Put 3= {AEg I [X, AJ =0 for every X in g}. Then 3 is an Abelian ideal of g, called the center of g, and is the kernel of the adjoint representation of g. Define the ideals 31,32,... of 9 as follows: 31 is the center of g, 32/31 is the center of !1/31' ..., 3i+I/3i is the center of g/3i,"" Then we have 0 c 31 C 32 c.... This series is called the ascending central series of g, and 9 is nilpotent if and only if 3k = 9 for some k. We assume that K is a field of characteristic o and Lie algebras over K are of finite dimen- sion, Let XI"'" X n be a basis of 9 over K. Then the n 3 elements cf j in K defined by [Xi' XjJ = L cfjXk are called the structural constants of 9 relative to the basis (X;). D. Radicals and Largest Nilpotent Ideals The union r of all solvable ideals of 9 is also a solvable ideal of g, called the radical of g. The union n of all nilpotent ideals of 9 is also a nilpotent ideal of g, called the largest nilpotent ideal of g. The ideal s = [r, gJ is called the nilpotent radical of g. We have !1 =:> r =:> n =:> s. E. Semisimplicity A Lie algebra is called semisimple if its radical is O. A semisimple Lie algebra 9 over K is called simple if 9 has no ideals other than 9 and O. If r is the radical of a Lie algebra g, then g/r is semisimple. Every semisimple Lie algebra is a direct sum of simple Lie algebras. For example, put t(n,K)= {A =(aij)E!1I(n,K)1 a ij = 0 for every i <)} and n(n, K) = {A = (ai)E t(n, K) I all = a 2 2 = ... = ann =O}. Note that t(n, K) is the set of all lower triangular ma- trices and n(n, K) is the set of nilpotent lower triangular matrices. Then t(n, K) is a solvable subalgebra of gl(n, K), and n(n, K) is a nilpo- tent subalgebra of gl(n, K), Put sl(n, K) = {A Egl(n, K)I tr A =O}. Then sl(n, K) is an ideal of gl(n, K). For n;:: 2, sl(n, K) is a simple Lie algebra F. Theorems The following theorems are fundamental in the theory of Lie algebras: (I) Engel's theorem (valid even if K is of positive characteristic): Let V be a finite- dimensional vector space over a field K such that V # {O}. Let !1 be a subalgebra of !11(V) 
903 consisting of nilpotent elements. Then there is a nonzero element v in V such that Xv=O for every X in g. (Thus by choosing a suitable basis of V and identifying gl(V) with gl(n, K), we have 9 c n(n, K), where n = dim v.) (2) Lie's theorem: Let (p, V) be an irreduc- ible representation of a solvable Lie algebra g. Then p(g) is Abelian. In particular, if K is algebraically closed, then dim V == 1. (Thus for every representation (p, V) of a solvable Lie algebra 9 over an talgebraically closed field K, we have p(g) c t(n, K) by choosing a suitable basis of V.) (3) Cartan's criterion of solvability: Let 9 be a subalgebra of gl(n, K). Then 9 is solvable if and only iftr XY=O for every XEg and YE[g,g]. (4) Cartan's criterion of semisimplicity: A Lie algebra 9 is semisimple if and only if the Killing form B of 9 is tnondegenerate (i.e., B(X, g)=O, X Eg implics X =0). (5) Weyl's theorem: Every representation (of finite degree) of a semisimple Lie algebra is completely reducible (6) Levi decomposition: Let r be the radical of a Lie algebra g, Then there is a semisimple subalgebra s of 9 such that 9 = r + s, r n s = O. Furthermore, such a subalgebra s is unique up to automorphisms of 9 (A. I. Mal'tsev), (7) Ado's theorem (orginally proved only for the case of characteristic 0 for K; the case of positive characteristic was proved by K. Iwa- sawa): Let 9 be a finite-dimensional Lie alge- bra over a field K. Then there exists a repre- sentation (p, V) of 9 of finite degree such that gp(g). G. Reductive Lie Algebras A Lie algebra 9 is called reductive if the radical r of 9 coincides with the center 3 of g. The following four conditions for a Lie algebra 9 are mutually equivalent: (i) 9 is reductive; (ii) the nilpotent radical s of 9 is 0; (iji) the adjoint representation of 9 is complete1y reducible; and (iv) the derived algebra [g, gJ of 9 is seml- simple and 9 = 3 + [g, gJ (direct sum), where 3 is the center of g. A representation (p, V) of a reductive Lie algebra 9 is completely reducible if and only if p(X) is diagonalizable for every X in 3. For example, the Lie algebra \'J1(n, K) is reductive H. Derivations A linear mapping t5 : 9 --> 9 is called a derivation of the Lie algebra 9 if t5([X, YJ)= [t5(X), YJ + [X,o(Y)] for every X, Y in g. The set '1)(g) of all derivations of 9 is a subalgebra of gl(g), and :D(g) is called the Lie algebra of deriva- 248 J Lie Algebras tions of g. The adjoint Lie algebra ad(g) is an ideal of :D(g), and elements of ad(g) are called inner derivations of g. If 9 is semisimple, then :D(g)=ad(g)g. Now suppose that K == R (K = C). Then the group A(g) of automorphisms of 9 is a Lie group (complex Lie group), and the Lie alge- bra of A(g) is given by :D(g). Lie t5E:D(g) (cgl(g». Then expt5 (EGL(g)) is in A(g). The connected subgroup I(g) of A(g) generated by {expt5It5Ead(g)} is a tLie subgroup of A(g). Furthermore, I(g) is a normal subgroup of A(g), called the group of inner automorphisms of 9 or the adjoint group of g. Thus ad(g) is the Lie algebra associated with I(g). The quotient group A (g)/ I (g) is called the group of outer automorphisms of g. If 9 is semisimple, then I(g) coincides with the identity component of A(g). I. Cartan Subalgebras A subalgebra  of a Lie algebra 9 over K is called a Cartan subalgebra of 9 if (i)  is nilpo- tent and (ii) the normalizer n of  in 9 (i.e., n = {X Eg I [X, J c}) coincides with . If K is algebraically closed, then for every two Cartan subalgebras " 2 of 9 there exists an auto- morphism 0" of 9 such that O"(I)= 2' Fur- thermore, for such a 0" we can take an auto- morphism of the form 0" = exp(ad(Ad)... exp(ad(A,) (AI' ''', ArE g), where all the ad(A,) are nilpotent. J. Universal Enveloping Algebras Let 9 be a Lie algebra over a field K. Regard- ing 9 as a vector space over K, let T(\'J) be the ttensor algebra over g. Let J be the ttwo-sided ideal of T(g) generated by all elements of the form X @ Y - Y@X -[X, Y] (X, YEg). The quotient associative algebra U(g)== T(g)jJ is called the universal enveloping algebra of g. The composite of the natural mappings g--> T(g)--> U(g) is an injection g--> U(g), and we identify 9 with a linear subspace of U(g) by this mapping. Then we have [X, Y] =XY- YX (X, Yeg) in U(g). The algebra U(g) has no tzero divisors. In particular, if 9 is the Lie alge- bra of a connected tLie group G, then U(g) is isomorphic to the associative algebra of all tleft-invariant differential operators on G. For every subalgebra  of g, the universal envelop- ing algebra U() of  is isomorphic to the sub- algebra of the associative algebra U (g) gener- ated by 1 and . If 9 is the direct sum of two Lie algebras gl, g2, then U(g) is isomorphic to the ttensor product U(gl) @K U(g2)' Let a be an ideal of g, and let { be the two-sided ideal of U(g) generated by a. Then we have U(g)/{ 
248K Lie Algebras V(g/a). Now put V o = K 1. Define a linear sub- space Vi of V (g) by Vi = K . 1 + 9 + g' 9 + .., + U' i Then we have Voc VI c..., ViVjc V i + j , Ui Vi = V(g). Thus {VJ defines a tfiltration of V(g). Denote by G=Go+G t +G 2 +... (G o = V o , G i == VJV i - l ) the tgraded ring associated with this filtration, Then we have g=G t cG. Let X I' .,. ,X n be a basis of g, and let S == K [Y I , .." y"J be a t p olynomia1 ring on K in n indeterminates Y t ,..., Y". Then there exists a unique algebra homomorphism w:S-->G such that w(I)= 1, w(Y;)= Xi (i = 1, ..., n). Further- more, w is bijective, and the ith homogeneous component Si is mapped by w onto G i . Thus the set of monomials {Xl X2... X"} (it O, ... , in  0) forms a basis of V (g) over K (the Poincare-Birkhoff-Witt theorem). Every representation (p, V) of 9 over K can be extended to a ulllque representation (p', V) of V(g). Furthermore, P is irreducible (com- pletely reducible) if and only if p' is irreducible (completely reducible). Given two representa- tions PI' P2 of g, PI is equivalent to P2 if and only if P'I is equivalent to p. Now suppose that g is semisimple, and let X t, ..., X n be a basis of g. Using the Killing form B of g, put gij= B(X i , X). Denote the inverse matrix of (gij) by (gij). Define CE V(g) by C=L gijXiX j . The element c, called the Casimir element of the Lie algebra, is inde- pendent of the choice of the basis (Xi), is a well-defined element of V(g), and belongs to the center of V(g) For every absolutely irredu- cible representation P of V(g), p(c) is a scalar operator, and tr p(c) is a positive rational number. K. Complex Semisimple Lie Algebras We assume that K =C, although there is no essential change if we assume that K is an algebraically closed field of characteristic 0, A subalgebra I) of a complex semisimple Lie algebra g is a Cart an subalgebra of 9 if and only if I) is a maximal Abelian suba1gebra of 9 such that ad(H) is diagonalizable for every H in I). We fix a Cart an subalgebra I); dim I) is called the rank of g, and we denote the linear space consisting of all C-valued forms on I) by 1)*. For every cx in 1)*, let g,= {X Eg I ad(H)X =cx(H)X for all H in I)}. Then g, is a linear subs pace of g, and go = I). Define a subset  of 1)* by  = {CXEI)* I cx #0, (J, # {O}} Then  is a finite set. Elements of  are called 904 roots of 9 relative to I), and  is called the root system of 9 relative to I). For every root cx, g, is of dimension one, and 9 is decomposed into a direct sum of linear subspaces: g=l)+ L g,. 'EA For each root cx, g, is called the root subspace corresponding to cx. The restriction BI) of the Killing form B of 9 on I) is nondegenerate. Hence for every}, in 1)* there exists a unique element H A in I) such that ),(H)=B(HA, H) for all H in I). Thus we get a linear bijection 1)*-->1) defined by j,-->H;,. Via this bijection, BI) gives rise to a symmetric bIlinear form ()" Ji) = (H A , H) ()" JiEI)*) on 1)*. Denote by I) the real linear subspace of 1)* spanned by . Then the inner product (..1., Ji) defined on 1)* is positive definite on I). Hence with respect to this inner product, I) is an [- dimensional Euclidean space, where I = dim I). The root system  is a finite subset of the Euclidean space I). L. Properties of Root Systems (i) CXE implies -CXE Furthermore, among the scalar multiples of cx, only ::!::cx belong to . (ii) Let cx, fJE. Then 2(cx, fJ)/(cx, cx) is a rational integer. (iii) Let cx, fJ E  and fJ # ::!::cx. Then there exist unique nonnegative integers j, i such that {fJ + val VEZ} n == {fj -jcx, fJ -(j -l)cx, ..., fJ- cx, fJ, fJ + cx, ..., fJ + icx}. Furthermore,j - i = 2(cx, fJ)/(cx, cx), i + j.s; 3. The set {fJ + Zcx} n  is called the cx-string of fJ. Now let CXE. Denote by w, the treflection mapping of I) with respect to the hyperplane P,={xEI)I(cx,x)=O}, which is orthogonal to cx. Then we have w,(fJ) = fJ -(cx*, fJ)CXE for every fJ in , where cx* = 2cx/( cx, cx) Thus we have w.()= for every cx in. (iv) Let cx, fJE and fJ # ::!:: cx. Then the angle 0 between cx and fJ is one of the following: 30°,45°,60°,90', 120°, 135°,150°. Suppose, moreover, that 00.s;90° and (cx, cx) .s;(fJ, fJ). Then we have the following criteria: 0=30 0 =3(cx,cx)=(fJ,fJ); 0=45°= 2(cx,cx)=(fJ,fJ); 0=60° =(cx,cx)=(fJ,fJ). (v) Let cx, fJ, CX+fJE. Then [g.,gpJ =g,+p' Conversely, suppose that a finite subset  of a finite-dimensional Euclidean space E satisfies conditions (i) and (ii) together with a part of (iii): w.() =  for every cx in . Then  is a root system of some complex semisimple Lie algebra. M. Lexicographic Linear Ordering in I) Let A I' . .. ,AI be a basis of I) over R. Define a linear ordering}. > Ji on I) as follows: If j, = L().i' Ji=LI))i (i,l)i all in R), then A>Ji if 
905 and only if there exists an index s (1 .s; s.s; I) such that (i= /]i for i = 1,..., s-1 and (,> /]S' This linear ordering is called the lexicographic linear ordering of  associated with the basis (X;). Relative to this linear ordering, a root cx is called a positive (negative) root if cx > 0 (cx < 0). We denote the subset of  consisting of all positive (negative) roots by  + (-). A subset S of  coincides with  + for some lexico- graphic linear ordering of  if and only if the following conditions are satisfied:  = S U (-S); Sn( -S)= 0; cx, fJES, cx+ fJE imply CX+ fJES A positive root CXE + is called a simple root if cx cannot be expressed as the sum of two posi- tive roots. N. Fundamental Root Systems Let II be a subset of  consisting of 1 roots cx I' .., cx,. Then II is called a fundamental root system of  if (i) every element cx of  is ex- pressed uniquely as an integral linear combi- nation of the CXi (cx = L micx i ), and (ii) in this ex- pression, m l , ..., m, are either all ;:: 0 or all .s; O. For any lexicographic linear ordering of , the set of all simple roots forms a fundamental root system of  Moreover, every funda- mental root system of  is obtained in this manner. Let II = { cx I' . ., , CXI} be a fundamental root system of . Then L g" + L 9 _', generates g. The Lie algebra A is not simple if and only if II admits an orthogonal partition, i.e., II = III UII 2 , III # 0, II 2 # 0, III nII 2 = 0, and (cx,#)=O for every cxEII 1 and fJEII 2 . The F integers au = - 2( CX i , cx)/(CXj' CXj)( 1 .s; i, J .s; I) are called the Cartan integers of g relative to the fundamental root system II. Then we have au = -2, aij;::O for i #j. O. Borel Subalgebras and Parabolic Subalgebras Let b =  + La>o g,. Then b is a maximal solv- able subalgebra of g. The group I(g) acts transitively on the set of all maximal solvable subalgebras of 9, A maximal solvable subalge- bra of 9 is called a Borel subalgebra of g. A subalgebra of 9 is called a parabolic subalgebra if it contains a Borel subalgebra of g, Now let cl! be any subset of a given fundamental root system II == {CXl, ..., cxd. Denote by  -(cl!) the set of all negative roots cx = L nicx i such that n j = 0 for all cx j in <1>. Then P<I> = b + L'EA-(<I» g, is a parabolic subalgebra. Thus we get 2' para- bolic subalgebras {P<I> I <1> c II}. Every parabolic subalgebra is conjugate under I(g) to one and only one of the parabolic subalgebras {P<I> I <1> c II}. 248 Q Lie Algebras P. Weyl's Canonical Basis Let HI' '" ,H, be a basis of , and let E. be a basis of g, for each root cx. Then we have a basis {Hi, E,} of g. Such a basis is called Weyl's canonical basis if the following three conditions are satisfied: (i) cx(Hj)E R (j = 1, ..., I) for every CXE, (ii) the Killing form B of 9 satisfies B(E., E _a) = -1 for every cx E; and (iii) if cx, fJ, cx + fJE and [E" EpJ = N".pE,+p (N"pEC), then N"p is in Rand N"p=N_,,_p' The Lie algebra 9 always has Weyl's canonical basis. For such a basis {Hi' E,}, the linear space gu==LRJ=l Hj+L R(E,+E,) + LR(J=l (E,-L,)) is a semisimple Lie algebra over R. The Killing form of gu is negative definite. Every connected Lie group whose Lie algebra is gu is always compact. Furthermore, 9 = gu + J=l gu, gunJ=l 9u=0. Thus 9 is isomorphic to the Lie algebra g; = C @Rgu over C obtained from gu by extending the basic field R to C, and gu is called the unitary restriction of 9 relative to WeyJ's canonical basis {H;, E,}. A Lie algebra a over R is called a real form of 9 if a C = C @R a is isomorphic to g. When this is the case, A is called a complex form (or the complexification) of a. Note that a real form a of 9 can be regarded as a real subalge- bra of g such that 9 = a + J=l a, an J=1 a = O. A real Lie algebra a is called a compact real Lie algebra if its Killing form is negative definite. A real Lie algebra is compact if and only if it is semisimple and is the Lie algebra of some compact Lie group. The Lie algebra a of a compact Lie group A is the direct sum of its center 3 and some compact Lie algebra; hence a is reductive. A compact real form gu of a complex semi- simple Lie algebra g is called a compact form of g. The group I(g) of all inner automor- phisms of 9 acts transitively on the set of all compact real forms of 9 (regarding the real forms of g as real subalgebras of g). Q. Chevalley's Canonical Basis A complex semisimple Lie algebra 9 always has a basis {Hi, E,} (consisting of a basis HI, '" , HI of  together with a basis E, of g, for each root cx) such that (i) cx(H;)EZ for every CXE and i = 1,..., I; (ii) B(E., E_,)= 2/(cx, cx) for every CXE; and (iii) if cx, fJ, rx+ fJE and [E., EpJ =N"pE,+p (N"pEC), then N,.pEZ and N"p = - N_" -/1' Such a basis {Hi, E,} is called Chevalley's canonical basis. When we take this basis, the structural constants of 9 relative to {Hi, E,} are all integers. Thus gz = L ZH i + 
248 R Lie Algebras L ZE, is a Lie algebra over Z. Furthermore, gR = L RH i + L RE, is a real form of 9 with the property that there exists a Cartan subalgebra R of gR such that for each element H in R' all the eigenvalues of ad(H) on gR are contained in R. (In fact, we may take LRH i as R') Such a real form of 9 is called a normal real form. The group I(g) acts transitively on the set of all normal real forms of g. R. Weyl Groups The reflections w, (cx E) of the Euclidean space  generate a subgroup W of the group of all tcongruent transformations of. W is called the Weyl group of g relative to  and is repre- sented faithfully as a tpermutation group over the finite set . Hence W is a finite group, Let II = {CXl>"" cx t } be a fundamental root system of . Then W is generated by w, ,..., w" . The root system  coincides with th set {w(cx) I WE W, CXE II}. Let 3' be the set of all fundamen- tal root systems of . Then W acts on 3' and is tsimply transitive on 3'. If 9 is simple, two roots cx, fJ are conjugate under W if and only if (cx, cx)=(fJ, fJ). Now let r be the complement in  of the union of all the hyperplanes P, (CXE) orthogonal to cx. Then r is a W-stable open subset of . A connected component of r is called a Weyl chamber. W acts on the set 3'0 of all Weyl chambers and is simply transitive on 3'0' Let II = {CXI' ..., cx t } be a fundamental root system. Then the set {x E  I (x, cx;) > 0 for i == 1, ..., I} is a Weyl chamber, called the posi. tive Weyl chamber associated with II. Now fix any lexicographic linear ordering of  that has II as the set of simple roots. For WE W, put  = {CXE + I W(CX)E -}. Denote the car- dinality of  by n(w). Then n(w)=O=w= 1. Furthermore, w can be expressed as a product of n(w) factors w = w" ... W'j' where each factor is taken from {W'I' ... , w,) admitting repeti- tions, In fact, n(w) is the minimum length of the expression of was a product w = w,... w, (i,j = 1, .., , I). With respect to the genertors J W'I' ,.. , waf' W has the following system of tdefining relations: w;,=I, l.s;i.s;l, (w"w,)mij= 1, l.s;i<j.s;l, where m ij is the order of w" w'r Thus if Oij is the angle between CX i and cx j , we have mij = nj(n - OiJ Denote by T the set of all linear transfor- mations 0" of  such that O"() =. Then Tis also the subgroup of all congruent transfor- mations of t furthermore, T is a finite group, and W is a normal subgroup of T, Let II be a fundamental root system of  and put P = {CXE TIO"(Il)=II}. Then P is a subgroup of T, 906 and we have a semidirect product T= p. W. Elements of P are called particular transfor- mations relative to II. The group P TjW is isomorphic to the group A(g)jI(g) of outer automorphisms of g. S. Classification of Complex Simple Lie Algebras Let  be a Cartan subalgebra of a complex semisimple Lie algebra g, and let II = {cx l , ". , CXt} be a fundamental root system relative to . We associate with II the diagram (tl- dimensional complex) indicated in Fig. 1. This diagram is called the Dynkin diagram of 9 (also called the Schliitli diagram or Coxeter diagram). It is constructed as follows: with each CX i there is associated a vertex (denoted by a small open circle). These I vertices are con- nected by several segments as foIlows. Let Oij be the angle between CX i and cx j . (i) If Oij= 150", CX i and CXj are connected by three oriented seg- ments as in (1) of Fig. 1, where the orienta- tion means (cx i , cx;) > (CXj, cx). (ii) If Oij= 135°, CXi and CXj are connected by two oriented segments as in (2) of Fig. 1. (iii) If Oij= 120°, CX i and cx j are connected by a nonoriented single segment as in (3) of Fig. 1. (iv) If Oij = 90°, CXi and cx j are not connected. Ql Q) (1) ();;5$() Qj Q) at Q; (2) 0-=0 (3) 0---0 Fig. 1 A Dynkin diagram. The Dynkin diagram of g is independent of the choice of, II. Furthermore, two complex semisimple Lie algebras are isomorphic if and only if they have the same Dynkin diagram. A complex semisimple Lie algebra is simple if and only if its Dynkin diagram is connected. Fig. 2 gives all possible Dynkin diagrams A I U=l,2, Q) Q.! QJ Qf-l QI ------o--o BI (1=2,3,' QJ Q.: Q) Q, j Q, ------o=-o C I (1=3,4, ..) Q) a.! a \ at I a, -----(X=O    __n  1 1. QI I DI (1=4,5, )  a l Eb' b a 1 E; ba J EK baJ Qj Q.! Q{ Q-l FJ 0) Q, G, Fig. 2 Dynkin diagrams of simple Lie algebras. Note that dimrJ is A,' J2 + 21; B , :2J2 +1; C , 2J2+ 1, D, 2J2 -1; E 6 :78, E7: 133, E 8 .248, F 4 .S2; G 2 '14. 
907 associated with complex semisimp1e simple Lie algebras. There are seven categories, (The index 1 in Al means the rank of g.) Among these Al (l 1), BI (l 2), C I (l 3), Dl (l4) are called classical complex simple Lie algebras. EI (l = 6,7,8), F4' and G 2 are cal1ed exceptional complex simple algebras Note that Al  BI  C I , B 2 C2' A3D3' D 2 =AI +AI' Al (resp. BI' C I , D I ) is the Lie algebra of the complex Lie group SL(l + 1, C) (SO(21 + 1, C), Sp(l, C), SO(21, C). T. Classification of Real Simple Lie Algebras We refer the reader to [3] and the references at the end of [3] for the classification of simple Lie algebras over a general field k; in partic- ular, for k=R (Appendix A, Tables 5.1, 5.11), the algebras are closely related to the classification of irreducible symmetric Rieman- nian manifolds ( 412 Symmetric Riemannian Spaces and Real Forms), In particular, since compact semisimp1e real Lie algebras 9 are in one-to-one correspon- dence (up to isomorphism) with complex semi- simple Lie algebras gC obtained as the com- plexification of g, the classification of compact real simple Lie algebras reduces to the classifi- cation of complex simple Lie algebras. Hence they are also represented by the same Dynkin diagrams. Compact real simple Lie algebras of the types AI, B I , C I , and DI are called classical compact real simple Lie algebras. They are the Lie algebras of the compact Lie groups SU(l + 1), SO(21 + 1), Sp(l), and SO(21), respectively. Compact real simple Lie algebras of the type EI (I = 6,7,8), F4' and G 2 are called exceptional compact real simple Lie algebras. U. Satake Diagrams of Real Semisimple Lie Algebras Let 9 be a real semisimple Lie algebra, f be the suba1gebra associated with a tmaxima1 com- pact subgroup of the tadjoint group of g, and p be the orthogonal complement of f in 9 relative to the Killing form of g. Let a be a maximal Abelian suba1gebra contained in p, and let I) be a Cartan suba1gebra of 9 contain- ing a. Denote by gC, I)c the comp1exifications of g, I), respectively. Let (J be the tsemi1inear tautomorphism of gC defined by (J(X + P Y)=X -p Y(X, YEg). Then we have (J(l)c) = I)c. Thus (J acts on the root system d of gC relative to I)c as follows: «(Jcx)(H) = cx«(JH) (cx Ed), Thus (J acts on (I)c). There is a lexico- graphic linear ordering of (I) c) such that CXE + and (JCX # -IX imply (JcxEd +. Fix such an ordering, and let II be the set of simple roots 248W Lie Algebras relative to the ordering. Put (J = pw, where p is a particular transformation relative to II and W is an element in the Wey1 group W. Then p induces a permutation of order 2 on the set {cxEIII (JIX# -IX}. Suppose that a vertex be- longing to the Dynkin diagram of II corre- sponds to a simple root cx such that (JCX = -cx. Then replace the vertex by a small filled circle. Also, if two vertices are mapped to each other by p, then connect the two vertices by an arc with two arrows on the end. The diagram thus obtained is called the Satake diagram of g. The Satake diagram of 9 is independent of the choice off, a, I) and of the ordering of(l)c). Two real semisimple Lie algebras are isomor- phic if and only if they have the same Satake diagram, and 9 is simple if and only if its Sa- take diagram is connected. Thus real simple Lie algebras are classified by their Satake diagrams [16]. V. Iwasawa Decomposition of Real Semisimple Lie Algebras Let 9 be a real semisimp1e Lie algebra. Take f, a, I) and the ordering of (I)c) as in the con- struction of Satake diagrams ( Section U). Let n be the intersection of 9 with the sub- space L(gc)., where the sum is taken over CXE d + such that (JCX # -cx. Then n is a nilpotent suba1gebra of g, and we have a decomposition of 9 into the direct sum of linear spaces: 9 = f + a + n, This decomposition is called an Iwa- sawa decomposition of g. Iwasawa decom- positions are unique in the following sense: Let 9 = f' + a' + n' be another Iwasawa decompo- sition; then there exists an inner automor- phism A ofg such that Af=f', Aa=a', An=n'. For the cohomology theory of Lie algebras and Lie algebras over fields of characteristic p > 0, in particular the theory of restricted Lie algebras  [3]. For the relationship between Lie algebras and the theory of finite groups (e.g., Chevalley's simple groups, tBurnside problems)  [7] and the references therein. W. Representations Let 9 be a complex semisimp1e Lie algebra and I) be a Cartan suba1gebra of g. We fix I) and a lexicographic linear ordering on I). Let II = {cx I' ... ,IX I } be the set of simple roots. Since every representation of 9 is completely reduc- ible, we restrict ourselves to the explanation of irreducible representations. Let (p, V) be a representation of g, For each AEI)*, put VA = {VE Vlp(H)v=A(H)v for all H EI)}. Then VA is a linear subspace of V, and ..1. is called a weight of the representation p (relative to I)) if 
248 X Lie Algebras VA # {O}; then dim VA is called the multiplicity of the weight X The set of all weights of P is a finite W-stable subset of . Denote this set by {AI, ..., Ar}' Then V is decomposed into a direct sum: V =  +... + VA . The maximum element among p\, ... Jr} ith respect to the given ordering of is called the highest weight of p. The following two theorems are basic in determining irreducible representations. (1) Cartan's theorem: Let A I' Az be the highest weights of irreducible representations PI' pz of g, respectively. Then PI is equivalent to pz if and only if Al =A 2 . (2) Cartan-Weyl theorem: Let AE, Then there exists an irreducible representation P of 9 which has ..1. as its highest weight if and only if (i) 2(A,cx)/(cx,cx)EZ for every root CXE (such an element AE is called an integral form on ); and (ii) w(A).s; A for every WE W (such an ele- ment AE is called dominant). These theorems lead to the concept of the system of fundamental representations. Put cxt = 2cx;/(cx i , cx i ), and let AI, ..., Al be the basis of  dual to cx!, ...,cxt «(Ai, cxj) =biJ Then the tfree Abelian group L ZA i coincides with the module P of all integral forms. An element L miA i (m i E Z) is dominant if and only if m I ;:: 0, .., ,ml;:: O. Denote by P + the semigroup in P consisting of all dominant elements in P. For j = 1, .,. , I, let (Pj, Vi) be the irreducible representation of 9 which has Aj as its highest weight. The system {PI' ..., pJ is called the fundamental system of irreducible representa- tions associated with II. The irreducible rep- resentation that has A = L miAiE p+ as its highest weight is constructed as follows: Put Vim = Vi @ '" @ Vi (mth tensor power of Vi) and V = V;" 1 @ ... @ Vt'. Then V can be regarded as a representation space of 9 in a natural manner. Let V be the smallest g-stable sub- space of V containing VA = (vd: @ ... @ (V;):. Then V gives an irreducible representation of 9 with highest weight A Thus by decomposing VII @... @ vt', we get all irreducible rep- resentations of g. This is why {PI, ..., PI} is called the fundamental system of irreducible representations. X. Relation with Representations of Compact Lie Groups Let G be a compact, connected, semisimple Lie group. Then every Cartan subalgebra  of the Lie algebra g of G is Abelian. We call dim  the rank of g or of G. Let H be the connected Lie subgroup of G associated with . Then H is a maximal torus (toroidal subgroup) of G. Fur- thermore, every maximal torus of G is conju- gate to H in G. Also, every element of G is 908 conjugate to an element of H in G, Let N = N(H) be the tnormalizer of H in G. Then every O"E N induces an automorphism Ad(O") of g, which induces an automorphism (denoted by the same symbol Ad(O")) of the complexifi- cation gC of g. We have Ad(O")(c) = c, and Ad(O") preserves the root system of c. Fur- thermore, the restriction of Ad(O") on (c) is an element W a of the Weyl group W of gC relative to c. Via this mapping O"-->W a , we have N / H  W. Thus we identify N / H with W. Since (c = P , we can define a lexico- graphic linear ordering using a basis of. Fix such an ordering. Now every representation P of G over a complex vector space V induces a representation dp of 9 over V, where dp is the differential of p ( 249 Lie Groups). Then we have a representation dp of gC over V. We also call a weight ..1. relative to c of the representa- tion dp of gC a weight of the representation P of G relative to H. Representations PI' pz of G are equivalent if and only if representations dPI, dpz of gC are equivalent. Denote by AI' A 2 the highest weights of dPI, dpz, respective- ly. Then PI is equivalent to pz if and only if Al =Az' The module P of all integral forms in (c) coincides with the set of all elements in (c) that are weights (relative to c) of some repre- sentation of gC. Denote by P G the subset of P consisting of all elements in P that are weights (relative to H) of some representation of G, Then P G is a submodule of P such that [P: PGJ < 00. Put Pl; =PGnp+, Then the mapping (representation p) --> (the highest weight A of p) induces a bijective mapping from the set of all classes of irreducible representations of G onto Pl; . The exponential mapping exp: --> H is a surjective homomorphism from  to H. The kernel r G of this homomorphism is a tlattice group in  of rank 1 ( = dim ); a basis HI>"" HI of r G over Z is also a basis of  over R. Now define linear forms AI, ... ,AI on  by ().i' H) = bij' Then we have P G =2nPLZA j . In other words, an element AE(c) is in P G if and only if A(H)E2np Z for every element H in r G . This characterization of P G also char- acterizes pl; = P G n P+. In particular, Gis simply connected =P= P G =r G =2np. LZcxt, where CX I , .,., cx I are simple roots and cxt = 2cx;/(cx i ,cx;) for i= 1, ...,1. We also have G = I (g) = the group of inner automorphisms of g=P G =LZCX i =r G =2np LZGi, where £1' ..., G{ are elements of (gc) defined by (cx;, G) = bij' In general, we have P =:> P G =:> L ZCXi, 2n.J=!LZCX;Cr G c2nj=1 Z[;i' Fur- thermore, the tfundamental group n l (G) of G is given by P/PGrG/(2npLZCXn Also, 
909 the kernel of the adjoint representation G-->I(G)=I(g) is given by PGiLZCXi(2nj=1. LZF.;)/r G . Y. Invariant Measure on G Keeping the definitions in this section the same as in the previous section, we see that every root cx defines a representation h--> X,(h) of the group H of dimension lover (gc), as follows: Let E, be a basis of (gc),. Then Ad(h)E, = X,(h)E,. We have X,(h) = e'(X) if h =expX (XEI)), i.e., X,oexp=e' We let e' stand for X,: e'(h) = X,(h), hE H. Now let dg, dh, dm be the tin variant measures on G, H, M = G/ H, respectively, normalized by L dg= 1, L dh= 1, L dm== 1. Then for every continuous function f on G, we have Lf(9)d9= L(Lf(m,h)dm)Q(h)dh, (1) where w is the order of the Weyl group Wand f(m, h) is a function on M x H defined by f(m, h)= f(ghg- I ), m =gH. Note that f(m, h) is well defined. Finally, Q(h) is a function on H defined by Q(h) = IT (e,(X)/2 _e-'(X)/2) 'EA for h=expX (X EI)), and Q(h) is called the density on H. Denote by D(h) the same prod- uct as Q(h), letting cx range over  +. Then we have Q(h)=D(h) D(h) = ID(hW;::O. In particular, if r is a tclass function on G (i.e., f(xyx-I)=f(y) for every x, YE G), f(m, h)= f(h). Hence (1) is simplified into the following integral formula for class functions: f f(9)d9= f f(h)Q(h)dh. G w H Z. Weyl's Character Formula Keeping the definitions in this section the same as those in the previous section, we let (p, V) be an irreducible representation of G and X p be the character of p(Xp(g) = tr p(g)). Let A be the highest weight of P, Then since A determines P up to equivalence, X p must be determined by A In fact, X p is given via A by Weyl's character formula (3). (Note that G = U gHg- 1 . Hence X p is determined by its restriction on H.) Now for hE H, we have ( h ) = (AH(h) X p (o(h) , 248 Ref. Lie Algebras where (; VEP) is the alternating sum (A(h)== L det(w)e(W(A»)(X), h=expX. WEW (4) Finally, 15 in (3) is given by 1 15=- L 0(, 2 :IEd+ that is, 15 is the half-sum of the positive roots. In particular, we have (o(h) = D(h), Denote by mAV) the multiplicity dim VA of a weight A of p. Then we have Xp(h)=LAmA(},)e;(h), Further- more, we have mA(A)= 1, mA(A)=mA(w(l)) (WE IV), and mA(A) is given by Kostant's formula: mA(.')= L det(w)P(w(A+15)-(A+15)), (5) WEW where for each JiE P, P(Ji) is the number of ways Ji can be expressed as a sum of positive roots. Thus P(Ji) is the number of nonnegative integral solutions {k,} of Ji = L'EA+ k 7 cx. Now suppose that (PI' Vd, (P2, V 2 ) are irreducible representations of G. Their tensor product (PI @ P2, VI @ V 2 ) is decomposed into a direct sum of irreducible constituents: PI @ P2 = L m(Ji)pp, where P p is the irreducible rep- resentation of G which has Ji as its highest weight and m(Ji) is the multiplicity of P P in PI @ P2 Then m(Ji) is given by Steinberg's formula: m(Ji) = L L det(ww')P (w(A I +(5) weW w'eW +W'(i\2 + (5)-(Ji + 2(5)), (6) where PI = PAl' P2 = PA 2 The partition function P(Ji) that appears in (5) and (6) satisfies the following recursive formula (B. Kostant): P(Ji) = - L det(w)P(Ji- (15 - w(15))). (7) WEvv,v#-l (2) References (3) [1] Sem. "Sophus Lie," Ecole Norm. Sup, 1954-1955. [2] N. Bourbaki, Elements de mathematique, Groupe et a1gebres de Lie., Actualites Sci. Ind., Hermann, ch, 1-3, 1960; ch. 4-6, 1968. [3] N. Jacobson, Lie algebras, Interscience, 1962. [4] S. He1gason, Differential geometry, Lie groups, and symmetric spaces, Academic Press, 1978. [5] E. Cartan, Oeuvres completes pt. I, Gauthier-Villars, 1952. [6] C. Chevalley, Theory of Lie groups I, Princeton Univ. Press, 1946; Theorie des groupes de Lie, Actualites Sci. Ind., Hermann, 11,1951; III, 1954. 
249 A Lie Groups [7] I. Kaplansky, Lie algebras, Lectures on modern mathematics 1, T. L. Saaty (ed.), Wiley, 1963, 115-132. [8] H. Wey1, Theorie der Darstellung kon- tinuierIicher halb-einfacher Gruppen durch linearen Transformationen I-III, Math. Z., 23 (1925),271-309; 24 (1926), 328376, 377-395 (Gesammelte Abhand1ungen II, Springer, 1968, 543-647). [9J H. Weyl, The classical groups, Princeton Univ. Press, revised edition, 1946. [10] I. D. Ado, Dber die Darstellung von Lieschen Gruppen durch Jinearen Substitu- tionen, Bull. Soc. Physico-Math. Kazan, (3) 7 (1934-1935), 343. [11] F. R. Gantmacher, On the classification of real simple Lie groups, Mat. Sb., 5 (47) (1939),217-250. [12] E. B. Dynkin, The structure of semis imp Ie Lie algebras, Amer. Math. Soc. Transl., 17 (1950). (Original in Russian, 1947.) [13] R. Bott, Homogeneous vector bundles, Ann. Math., (2) 66 (1957), 203-248. [14] I. Satake, On representations and com- pactifications of symmetric Riemannian spaces, Ann. Math., (2) 71 (1960),77-110. [15] H. Freudenthal, Oktaven, Ausnahme- gruppen und Oktavengeometrie, Mathemat- isch Instituut der RiJksuniversiteitte, Utrecht, mimeographed note, 1951. [16] S. Araki, On root systems and an in- finitesimal classification of irreducible sym- metric spaces, 1. Math. Osaka City Univ., 13 (1962), 1-34. [17J l-P. Serre, Lie algebras and Lie groups, Benjamin, 1965. [18] l-P. Serre, A1gebres de Lie semi-simples complexes, Benjamin, 1966. [19] 1. E. Humphreys, Introduction to Lie algebras and representation theory, Springer, 1972. Also  references in [3,4]. 249 (IV.9) Lie Groups A. Definitions A set G is called a Lie group if there is given on G a structure satisfying the following three axioms: (i) G is a group; (ii) G is a tparacom- pact, treal analytic manifold (G need not be connected); and (iii) the mapping G x G-->G defined by (x,y)-->xy-l is trea1 analytic ( 105 Differentiable Manifolds). For simplicity, real analyticity is denoted by C"': for example, C"'-functions, C"'-mappings. If we replace real analyticity by tcomp1ex 910 analyticity in axioms (ii) and (iii), then we have the axioms (i), (ii'), and (iii') of a complex Lie group. We consider the real analytic case, since the complex analytic case can be dealt with similarly. Every element (J of G defines a mapping G -->G given by X-->(JX (x-->x(J), denoted by La (Ra) and called the left (right) translation of G by (J. La, Ra are automorphisms of G as a C"'- manifold. Therefore, given a tvector field X on G, a tdifferentia1 form w on G, or, in general, a ttensor field Ton G, we can apply La and Ra in a natural manner to the given tensor field and obtain a tensor field La T, Ra Ton G. A tensor field Ton G is called left (right) invar- iant if La T= T (Ra T= T) for every (JE G. A tensor field on G is of class C'" if it is left or right invariant. B. Lie Algebras of Lie Groups Let G be a Lie group. Then the set .I( G) of all C"'-vector fields on G has the structure of a vector space over the real number field R. Furthermore, with respect to the bracket oper- ation [X, Y]=XY- YX (X, YE.I(G)), .I(G) forms a tLie algebra over R. Denote by 9 the subset of .I(G) consisting of all left invariant vector fields on G. Then 9 is a tsuba1gebra of the Lie algebra .I(G). Thus 9 is also a Lie algebra over R. This Lie algebra 9 is called the Lie algebra of the Lie group G. The linear mapping from 9 into the ttangent space I;,(G) of G at the identity element e given by X --> Xe is bijective. Hence dim 9 = dim G. The Lie algebra 9 is often identified with I;,(G) via this bijection. C. Simply Connected Covering Lie Groups For any finite-dimensional Lie algebra 9 over R, there exists a tconnected Lie group G that has g as its Lie algebra. Such Lie groups are all tlocally Isomorphic. Among these groups, there exists a tsimply connected one that is unique up to isomorphism. This group is called the simply connected covering Lie group of the Lie algebra g. D. Lie Subgroups A subgroup H of a Lie group G is called a Lie subgroup if (i) H has the structure of a Lie group and (ii) the inclusion mapping <p: H-->G is a C"'-mapping, and the tdifferentia1 d<p is injective at every point of H (i.e., as a manifold, H is a tsubmanifold of G). Moreover, if H is a connected manifold, it is ca1\ed a connected Lie subgroup. For a subgroup H of a Lie group 
911 G, there exists at most one structure of a Lie group on H that makes H a connected Lie subgroup. (If H is not assumed to be con- nected, then this uniqueness statement is not generally true.) A subgroup H of a Lie group G is a connected Lie subgroup if and only if H is arc-wise connected 1M. Kuranishi and H. Y amabe). Let H be a Lie subgroup of a Lie group G. Then the tangent space T,,(H) is a subspace of T,,(G). Under the isomorphism Te(G)g men- tioned in Section B, the subspace T,,(H) corre- sponds to , which is a tsuba1gebra of the Lie algebra g. We callI) the subalgebra of 9 asso- ciated with the Lie subgroup H, and  can be identified with the Lie algebra of H in a natural manner. By this mapping H -->, we have a bijection from the set of all connected Lie subgroups of G onto the set of all suba1ge- bras of g. For example, the connected Lie subgroup G' of G corresponding to the tde- rived algebra g' of 9 is the tcommutator sub- group of G. The Lie algebra of a normal sub- group of G is an tidea1 of g. Conversely, if the Lie algebra I) of a connected Lie subgroup H of a connected Lie group G is an ideal of g, then H is a normal subgroup of G. A connected Lie group G is called semi- simple, simple, solvable, nilpotent, or Abelian (or commutative), respectively, whenever the Lie algebra 9 is ( 248 Lie Algebras). This definition of solvability, nilpotency, and com- mutativity agrees with the corresponding definition in which G is regarded as an ab- stract group. E. Closed Subgroups The topology of a Lie subgroup H (which we call the inner topology of H) as a submanifold of a Lie group G need not coincide with the relative topology of H regarded as a subspace of a topological space G. The inner topology of H coincides with the relative topology of H if and only if H is closed in G. Conversely, for every closed subgroup H of G, H has the struc- ture of a Lie subgroup such that the inner topology and relative topology coincide (E. Cartan). Moreover, such a structure of a Lie subgroup on H is unique. We always regard closed subgroups of a Lie group as Lie sub- groups in this sense. Also, we denote the Lie algebras of Lie groups G, H,... by the corre- sponding lower-case German letters g, , ... . F. Homogeneous Spaces Let G be a Lie group, and let H be a closed subgroup of G. Then the tquotient topological space (coset space) M =G/H of the topological 249J Lie Groups group G modulo H has the structure of a C"'- manifold such that the canonical mapping n: G--> M and the action of G on M: G x M--> M (defined by (g, xH)-->gxH) are both C"'- mappings. Furthermore, such a C"'-manifo1d structure on M is unique, The C"'-manifold M thus obtained is called the homogeneous space of Gover H ( 199 Homogeneous Spaces). Put n( e) = p. Then I) is the kernel of the dif- ferential mapping T,,(G) = g--> Tp(M). Thus we can identify the tangent space Tp(M) of M = G/H at p with the quotient linear space g/ via this differential mapping. G. Quotient Lie Groups Suppose that H is a closed normal subgroup of a Lie group G. Then G/H has the structure of a quotient group together with the structure of a manifold as a homogeneous space. Now G/H is a Lie group with respect to these two structures. The Lie group G/H is called the quotient Lie group of Gover H. In this case,  is an tidea1 of g, and the quotient Lie algebra g/ is isomorphic to the Lie algebra of the Lie group G/H. H. Direct Product of Lie Groups Let G I , G 2 be Lie groups. Then the direct product G 1 x G 2 satisfies axioms (i), (ii), and (iii) of a Lie group ( Section A) in a natural manner. The Lie group G I x G 2 thus obtained is called the direct product of the Lie groups G I , G 2 . The Lie algebra of G I x G 2 can be identified with the direct sum of gl, g2' I. Cartan Subgroups A subgroup H of a group G is called a Cartan subgroup of G if H is a maximal nilpotent subgroup of G and moreover, for every sub- group HI of H of finite index in H, [N(HI): HIJ < 00, where N(Hd={gEGlgHIg- I =H I } is the tnormalizer of HI in G. A connected Lie group G always has a Cartan subgroup. Fur- thermore, every Cartan subgroup H of G is closed; hence H is a Lie subgroup. The Lie algebra I) of H is a tCartan subalgebra of the Lie algebra g. This mapping H --> is a bijection from the set of all Cartan subgroups of G onto the set of all Cartan suba1gebras of 9 [7]. J. Borel Subgroups Let G be a complex connected semisimple Lie group. A maximal connected solvable Lie subgroup of G is called a Borel subgroup of G. 
249K Lie Groups Any two Borel subgroups are conjugate in G. A subgroup P of G is called a parabolic sub- group if P contains a Borel subgroup of G. Every parabolic subgroup is a connected closed Lie subgroup of G. K. Simple Examples Let V be a finite-dimensional vector space over R. Let .\!(V) be the tassociative algebra of linear endomorphisms of V. The tgeneral linear group GL(V)={xE.\!(V)ldetx#O} over V is a Lie group. The Lie algebra of GL(V) is denoted by gl(V). We can identify gl(V) with E(V), and we have [X, Y]=XY- YX for all X, YEgl(V). If dim V=n, GL(V) may be identi- fied with the group GL(n, R) of all real nonsin- gular n x n matrices. Then gl(V) may be identi- fied with the Lie algebra gl(n, R) of all n x n real matrices. The following are examples of (closed) Lie subgroups of the general linear group: the tspeciallinear group SL(n, R), whose Lie alge- bra is {XEgl(n,R)ltr X =O}; the tunitary group U(n), whose Lie algebra is {X Egl(n, c)1 'X +X=O}; the torthogonal group O(n), whose Lie algebra is {XEgl(n, R)I 'X +X =O}; and the tsymplectic group Sp(n), whose Lie algebra is {X Egl(2n,c)IJX +'XJ =0, 'X +X =O}, where J=(_ ) and I is the n x n unit matrix. Examples of complex Lie groups are: the complex tgenerallinear group GL(n, C), whose Lie algebra is gl(n, C); the complex tortho- gonal group O(n,C), whose Lie algebra is {X E91(n, CJI'X +X =O}; and the complex tsympletic group Sp(n, C), whose Lie algebra is {X Egl(2n, C)IJX + 'XJ =O}, where J is the matrix given above. L. Compact Simple Lie Groups If a connected Lie group G has a tcompact real Lie algebra, then G is compact Thus for each compact real simple Lie algebra 9, the simply connected covering Lie group G is compact, and the center of G is a finite group. The connected compact Lie groups SU(l+ 1), SO(21 + I), Sp(l), SO(21) have, respectively, the compact real simple Lie algebras Al (I:;:; 1), BI (l:;:;2), C I (1:;:;3), DI (1:;:;4) as their Lie algebras; these groups are called classical compact simple Lie groups. Sp(n) (n:;:;2) and SU(n) (n:;:;2) are simply connected, but SO(n) (n=3 or n:;:; 5) is not, and has the fundamental group of order 2. The universal covering group of SO(n) (i e., the simply connected, connected 912 Lie group that has the same Lie algebra as SO(n)) is Spin(n). The connected compact Lie groups that have EI (l = 6,7,8), F 4 , or G 2 as their Lie algebras are called exceptional com- pact simple Lie groups. The simply connected covering Lie groups of E 8 , F 4 , G 2 have the identity group {e} as their center. Hence they coincide with their tadjomt groups. The simply connected covering Lie groups E 6 , E7 have as their centers the groups of orders 3 and 2, respectively. Hence their adjoint groups are not simply connected. M. Complex Simple Lie Groups For a complex Lie group G, the definition of the Lie algebra 9 is similar to that for real Lie groups. Then 9 is a Lie algebra over the com- plex number field C and is calIed the complex Lie algebra of the complex Lie group G. The complex connected Lie grou ps SL(l + 1, C), SO(21 + 1, C), Sp(l, C), SO(21, C), which have the tclassical complex simple Lie algebras AI> BI> C I , DI as their Lie algebras, respectively, are called classical complex simple Lie groups. Complex connected Lie groups that have texceptional complex simple Lie algebras EI (l = 6, 7, 8), F4' G 2 as their Lie algebras are called exceptional complex simple Lie groups. N. Homomorphisms A mapping <P:G 1 -->G 2 from a Lie group G 1 into a Lie group G 2 is called an analytic homo- morphism (C"'-homomorphism) if (i) <p is a homomorphism between groups and (ii) <p is a C"'-mapping between manifolds. Moreover, if <p is bijective and <p -I is also a C"'-mapping, then <p is called an analytic isomorphism (C"'- isomorphism), Two Lie groups G j , G 2 are said to be isomorphic as Lie groups if there exists a C"'-isomorphism from G l onto G 2 ; we denote this situation by G 1 G2' Now let <P:G 1 -->G 2 be a C"'-homomorphism. Then the tdifferential (d<P)e 1 : 7;,1 (Gd--> T.,(G 2 ) induces a Lie algebra homomorphism d<p: 91 -->g2' The kernel of d<p is the Lie algebra of the kernel of <po Suppose that G 1 is connected and simply connected, Then for every Lie group G 2 and every Lie algebra homomorphism t/1: 91 -->g2, there exists a unique C"'-homomorphism <p: G 1 -->G 2 such that t/1 = d<p. We can replace condition (Ii) in the definition ofC"'-homomorphism by the following weaker one: (ii') <p is a continuous mapping. In particular, for a given topological group G, there exists at most one structure of a Lie group on G preservmg the group structure and the topology. A topological group G admits a structure of a Lie group (preserving the group structure and the topology) if and 
913 only if G is a tlocally Euclidean topological group ( 423 Topological Groups N). O. Representations Let G be a Lie group, and let V be a finite- dimensional vector space over C(R). Then a continuous (hence C W _) homomorphism p: G -->GL( V) is called a complex (real) represen- tation of G. The linear space V is called the representation space of P, and dim V is called the degree of p. To be more precise, a represen- tation is denoted by (p, V) instead of p. The function X p : G-->C defined by Xp(g) = tr p(g) (g E G) is called the character of p ( 362 Rep- resentations). The representation (p, V) of G gives rise to a representation (dp, V) of g. Sup- pose G is connected Then two representations (Pi, 1-';) (i = 1,2) of G are tequivalent if and only if the representations dpi and dp2 of A are equivalent. For the tdirect sum of representa- tions and the contragredient representation, we have d(PI +p2)=dPI +dP2' d('p- I )= -'(dp). For the ttensor product of representations, we have (d(PI (8) P2))(X)=(dpd(X) (8) I V2 + I V1 (8) (dp2)(X), (For representations of compact Lie groups  248 Lie Algebras W.) P. Adjoint Representations An element (J of a Lie group G defines an analytic automorphism <Pa: X-->(JX(J-I of G The tdifferential Ad«(J) of <Pa is an automor- phism of the Lie algebra A. Since Ad«(J)EGL(g), we have a representation (J-->Ad«(J) of G whose representation space is g. This is called the adjoint representation of G, and the image Ad(G) of G is called the adjoint group of G. Ad(G) is a Lie subgroup of GL(g). The tcenter Z of G is a closed subgroup of G, and GjZ  Ad(G). The analytic homomorphism (J--> Ad«(J) from G into GL(A) gives rise to a Lie algebra homomorphism g-->gl(g) by taking the differential of (J-->Ad«(J). Denote this Lie alge- bra homomorphism by X -->ad(X) (X Eg). Then ad(X) Y= [X, YJ for every X, Y in g, Thus ad: g-->gl(g) coincides with the tadjoint representation of the Lie algebra g. If G is connected, then the Lie subalgebra ad(g) (the tadjoint Lie algebra of g) of gl(A) is associated with the connected Lie subgroup Ad(G) of GL(g). Thus the adjoint group Ad(G) coincides with the group I (g) of tinner automorphisms of 9 (the tadjoint group of g). If G is connected and semisimple, 9 is also semisimple. Furthermore, we have :D(g) = ad(g). Hence the adjoint group Ad(G) = I(g) of G coincides with the connected component of the tautomorphism group A (g) of 9 contain- ing the identity element. If G is compact or 249Q Lie Groups semisimple, then every representation of G is completely reducible. Let G be a connected Lie group of dimen- sion n, 9 be the tLie algebra of G, and Ad: G -->GL(A) be the adjoint representation of G. Put det«t+ 1)I-Ad(x))=LjotjDj(x) for every x E G. Then each Dj is an analytic func- tion on G, and Dn = 1. Let I be the least integer such that Dl # 0 Then I = rank G = rank g. An element x of G is called regular (singular) if D1(x) #0 (D1(x) = 0). The subset G* of G con- sisting of all regular elements of G is open and dense in G, The subset G - G* of G consisting of all singular elements of G is of measure zero with respect to the left-invariant tHaar mea- sure of G. Now suppose that 9 is treductive. Then an element XE G is regular if and only if the cen- tralizer 3x = {X Eg I Ad(x)X = O} is a tCartan subalgebra of g. Furthermore, if XE G is regu- lar, then Ad(x) is a semisimple linear endomor- phism of g. Q. Canonical Coordinates With each element X of the Lie algebra A of the Lie group G there is associated a one- parameter subgroup of G, i.e., a continuous homomorphism t--><p(t) from the additive group R of real numbers into G, such that d<p(Xo)=X, where Xo=djdt is the basis of the Lie algebra of R. Furthermore, the con- tinuous homomorphism <p is unique. Putting <p(I) = exp X, we define the exponential map- ping exp:g-->G. Then we have <p(t)=exptX for every t E R. In particular, for the case G = GL(n,C), g=gl(n,C), we have expX =LXmjm! for every X Eg. Thus expX coincides with the usual exponential mapping of matrices ( 269 Matrices). The mapping exp:g-->G is a C'"- mapping whose differential at X =0 is a bijec- tion from To(g) = 9 onto 7;,( G). Thus for each basis Xl'''' X n of A, there exists a positive real number £ with the following property: {exp(LXiX;) I Ix;! <£ (i = 1,..., n)} is an open neighborhood of the identity element e in G on which (J=exp(LxiXi)-->(Xl,' ., x n ) (Ixil <£, i = 1,. ., n) is a tIccal coordinate system, These local coordinates are called the canonical coordinates of the first kind associated with the basis (X;) of g. Similarly, we have a local coordinate system r = (exp Xl X d... (expxnX n ) -->(x I, .., , x n ) in a neighborhood of e; these x I' . , X n are called the canonical coordinates of the second kind associated with the basis (X;)ofg. Let <p: G I -->G 2 be a continuous homomor- phism from a Lie group G l into a Lie group G 2 . Then <p(exp X) = exp(d<p(X)) for every X Eg l . The Lie subalgebra I) associated with a 
249R Lie Groups connected Lie subgroup H of G is character- ized by using the exponential mapping as fo1\ows:  = {X E!11 exptX EH for all tER}. R. Multiplication Functions Fix a basis Xl' ..., X n of the Lie algebra !1 of a Lie group G. Let (u 1 , ..., un) be the canonical coordinates of the first kind associated with the basis (XJ With respect to this coordinate system, let (x;), (y;), (z;) be the coordinates of the elements 0", T, O"T, respectively. Then each Zi is a real analytic function in (x I> ... , X n ; Yl, ,.., Yn)' The tTaylor expansion of Zi at XI = X 2 ==... = Yl = Y2 =... =0 is given as follows: Put S== LXiXi' T= LYiXi' Then  I k I Zi= L.. ,,(S T u;)o k,lok,1. =x i + Yi + (STu;)o + (terms of degree :;:'3). Furthermore, let (Wi) be the coordinates of T-lO"-ITO'. Then we have wi=([T,SJuJo+(terms of degree :;:,3). Let g* be the set of all R-va1ued linear forms on !1, that is, g* is the dual vector space of !1. We can identify the elements of!1* with the left-invariant tdifferential forms of degree 1 on G. These forms are called Maurer-Cartan differential forms. Let OJ 1, " . , OJ n be a basis of A* which is dual to the basis Xl"'" X n of A: Wi(X j ) = b ij (1.s; i,j.s; n). The texterior derivative dOJ k of OJk is a left-invariant differential form of degree 2 on G and is expressed as a linear combination of the texterior products OJ i /\ w j as follows: dOJ k = -" ck.w /\ OJ. 2 l) 1 J' I.J where (ct) are the tstructural constants of 9 with respect to the basis (Xi), i.e" [Xi' XD =LkCtXk' Using the canonical coordinates of the first kind (u i ) associated with the basis (XJ, put OJ k == OJk(u, du) = LJ Ak/U)duj. Then the matrix  = (Ak/ u )) is given by .I .I 2 111=1 +-+-+ ..., 2! 31 where .I = (Ci/ U ), Ci/U) = LkCjkUk> i,e" -.I is the matrix of ad(L UkX k ) relative to the basis (Xi)' Thus, in particular, if G is a nilpotent group, .I is a tnilpotent matrix. Once the functions Ak/u) are known, then the functions Zi= h(xl> ...,xn;y" "',Yn) de- scribing the multiplication in G (note that 0" +-> (x;), r+->(y;), ar+->(zi)) are obtained as folIows. By the left-invariance of the OJ i , we have the following Maurer-Cartan system 914 of differential equations: OJi(Z, dz) = OJ,(Y, dy), 1 .s; i.s; n. (3) Regarding Xl' ... , X n as parameters, put Zi = <Pi(Yl' ...,Yn)' Then (3) is equivalent to o<Pj LAi/z)-:;-==Aik(y), l.s;i, k.s;n. j °Yk (3') Now (3') is tcompletely integrable, by (1). By solving the system (3') of differential equations under the initial conditions <p/O, ... ,0) = Xj' 1 .s;j.s; n, (4) we obtain the multiplication functions Zi =h(x 1 , ""Xn;YI> ...,Yn) [1]. By the exponential mapping exp:g-->G from the Lie algebra 9 of a Lie group G, a suffi- ciently small neighborhood U of the zero element of!1 is mapped bijectively and bi- real-anal ytically onto a neigh borhood V of the identity element of G ( Section Q above). Now by taking U sufficiently smalI, the prod- uct (exp X) (exp Y) for X, Y in U can be explicitly given in the form expZ by the Campbell-Hausdorffformula. Namely, Z Eg is given by Z=cl(X, Y)+c 2 (X, Y)+..., where cdX, Y)==X + Y and C2, c 3 ,... are defined recursively by 1 (n+ l)c n +1(X, Y)=-[X - Y,cnJ 2 + L KzpL'[Ck1,[...[Ck2P'X+YJ...]' p:;'1.2p<;n (1) where the second summation is taken over all 2p-tuples (k 1 , ..., k 2p ) of positive integers kl> ..., k 2P satisfying k 1 +... + k 2P = n, Here the K 2 , K4' K6'''' are rational numbers defined by the Taylor expansion Z 1 w ---z=l+ L K Z2p. 1-e- z 2 pl 2p For example, (2) 1 c 2 =2"[X, YJ, 1 C3=-[[X, Y], YJ 12 1 - 12 [[X, YJ, XJ, 1 C 4 = - 48 [Y, [X, [X, Y JJJ 1 --[X, [Y, [X, YJJJ, 48 etc. [23]. 
915 S. Maximal Compact Subgroups Any compact subgroup of a connected Lie group G is contained in a maximal compact subgroup of G. Any two maximal compact subgroups of G are conjugate in G. Let K be a maximal compact subgroup of G. Then K is connected and G is homeomorphic to the direct product of K with a Euclidean space R m (Cartan-Mal'tsev-Iwasawa theorem). T. Iwasawa Decomposition Let G be a connected semisimple Lie group, Suppose that the center of G is a finite group. Let 9 = f + a + n be an tIwasawa decompo- sition of the Lie algebra 9 of G ( 248 Lie Algebras). Then the connected Lie subgroups K, A, N associated with f, a, n, respectively, are aJl closed subgroups of G. Furthermore, K is a maximal compact subgroup of G, A is isomor- phic to the additive group RS for a suitable s, and N is a simply connected nilpotent Lie group. The mapping K x A x N -->G given by (k, a, n)-->kan is bijective and is an isomorphism of analytic manifolds. The decomposition G == KAN is caJled an Iwasawa decomposition of G. An Iwasawa decomposition is unique in the following sense: Let G = K' A' N' be another Iwasawa decomposition. Then there exists an element 9 in G such that gKg- I =K', gAg-I =A', gNg- 1 =N' [9,3]. U. Complexification of Compact Lie Groups Let G be a compact Lie group. Denote by C(G) the commutative associative algebra of aJl C-valued contmuous functions defined on G relative to the usual multiplication in C(G). Then for each O"EG, La(Ra) acts on C(G) by Laf= foLa (RJ = foRa). Now put o(G)= {lEC(G)1 dim LaEGCLJ< oo}. Then o(G) is a subalgebra of C(G), called the representative ring of G. Elements of o(G) are called represen- tative functions of G because for an element f E C( G), f E o( G) if and only if there exists a continuous matrix representation O"-->(dij(O")) of G such that f is a C-linear combination of the dij. For each O"EG, La and Ra preserve o(G). With respect to the uniform norm Ilflloo = maxaEGlf(O")1 of C(G), o(G) is everywhere dense in C(G) (Peter-Weyl theory) ( 69 Com- pact Groups B). This implies the existence of a faithful ( = injective) representation p : G--> GL(n, C) of G. Furthermore, o(G) is a finitely generated algebra over C. Thus there exists a representation O"-->(dij(O")) such that o(G) is generated over C by the dij' Such a representa- tion is caJled a generating representation. 249U Lie Groups Denote the group of alJ automorphisms of the algebra o(G) by A. Let G be the centralizer in A of the subgroup {LaIO"EG} of A. Then we have a bijective mapping cx-->cx' defined by cx'(/) = (cx(/))(e) (/EO(G)) from the group G onto the set Homdo(G), C), which is an affine variety. Thus every generating representation p: O"-->(di)O"))t i,jn defines a faithful matrix representation p of G by p(cx) = (cx'(di))t i,}n by means of the bijection cx -->cx' from G onto Homdo(G), C). Furthermore, the image p(G) is an talgebraic subgroup of GL(n, C) given by {[3(diJti,jnl[3EHomdo(G),C)}. Thus G has the structure of a tlinear algebraic group, which is actually independent of the choice of a generating representation. (Hence G also has the structure of a complex Lie group,) Now 0" -->Ra defines an injective continuous homo- morphism from G into G, and we can regard G as a subgroup of G. Then for CXE G, cx is in G if and only if cx(J) = cx(/) for every f in o(G) (Tannaka duality theorem) ( 69 Compact Groups D). G is a maximal compact subgroup of G, and every maximal compact subgroup of G is conjugate to G in G. Then complex Lie algebra 9 of G is isomorphic to the complexifi- cation gC = C @Rg of the Lie algebra 9 of G. G is homeomorphic to the direct product of G with a Euclidean space R N , where N =dim G. For a complex analytic function <p defined on G, <p = 0 -= <p IG = O. In particular, G is the clo- sure of G relative to the tZariski topology of G. The group G is called the Chevalley com- plexification of G, which we denote by G C . Let G 1 , G 2 be compact Lie groups, and let <p:G 1 -->G 2 be a continuous homomorphism. Then <p can be extended uniquely to a rational homo- morphism <p c: G f --> G f In particular, every complex representation (p, V) can be extended uniquely to a trational representation (pc, V) of G C . Every complex analytic representation p of G C is a rational representation, and p = p c, where p == piG' Thus P is completely reducible. Also, we have a bijection from the classes of irreducible continuous representa- tions of G onto the classes of irreducible complex analytic representations of G C . For a closed subgroup H of G, the complexification H C of H coincides with the closure of H in G C relative to the Zariski topology of G C . If (p, V) is a generating representation of G, then GCpc(Gc)cGL(V). Furthermore, pC(G c ) is an algebraic subgroup of GL(V) that is completely reducible on V. Conversely, let F be an algebraic subgroup of GL(V) that is completely reducible on V. Then there exists a compact Lie group G such that GC F (as algebraic groups). If an algebraic subgroup F of GL(n, C) satisfies 'F= F, then F Gc, where G=Fn U(n). Now let F be a connected semi- simple complex Lie group. Then FGc for 
249 V Lie Groups every maximal compact subgroup G of F. In particular, F has a faithful representation. For example, let G U(n), O(n), SO(n), Sp(n), respectively. Then G C  GL(n, C), O(n, C), SO(n,C), Sp(n,C), respectively [1]. V. History S. Lie was the first to consider Lie groups, in the late 19th century; at that time they were called continuous groups. His motivation was to treat the various geometries from a group- theoretic point of view ( 137 Erlangen Pro- gram) and to investigate the relationship be- tween differential equations and the group of transformations preserving their solutions. Lie groups were studied locally, and the notion of Lie algebras was introduced. It was observed that the properties of Lie groups reflect the properties of Lie algebras to a large degree. Even in this early stage, the notions of solvable and semisimple Lie algebras were introduced together with proofs of basic properties. The Galois theory of linear differential equations (by E. Vessiot and others) is contained in Lie's theory. Early in the 20th century, the theory of infinite-dimensional Lie groups was studied by E. Cartan. After him, however, there was a long lull until it was taken up again by M. Kuranishi and others in the 1950s. We re- strict ourselves to the consideration of finite- dimensional Lie groups. From 1900 to 1930, E. Cartan and H. Weyl obtained a complete classification of the semi- simple Lie algebras and determined their representations and characters. They also devised useful methods for investigating the structure of these algebras, and did pIOneering work on global structures of the underlying manifolds of Lie groups. After them (1930- 1950), these resu1ts were systematized and refined by C. Chevalley, Harish-Chandra, and others. In the same period, K. Iwasawa [9J clarified Cart an's idea, showing that the only Lie groups that are topologically important are compact Lie groups. He also obtained the Iwasawa decomposition, which has become a basic tool in the study of semisimple Lie groups At the same time, Iwasawa contri- buted to Hilbert's fifth problem, which seeks to characterize Lie groups among topological groups. This problem was solved by A. M. Gleason, D. Montgomery, L. Zippin, and H. Yamabe in 1952. Since 1950, the topological properties of Lie groups have attracted con- siderable attention. The methods of Cartan, who postulated de Rham's theory, and H. Hopf, who used the properties of groups ex- tensively, were succeeded by systematic appli- 916 catIOns of the general theory of algebraic top- ology. In particular, the topological theory of tfiber bundles was applied to homogeneous spaces G/H by A. Borel and others. Thus homology groups of Lie groups were com- pletely determined, and homotopy groups of Lie groups were determined to a considerable extent. At the present time, the following fields, rather than Lie groups themselves, are objects of extensive study: structure and analysis of homogeneous spaces, algebraic groups, infinite-dimensional unitary representations, and finite or discrete subgroups ( 13 Alge- braic Groups, 109 Differential Geometry, 427 Topology of Lie Groups and Homogen- eous Spaces, 437 Unitary Representations). References [IJ C. Chevalley, Theory of Lie groups I, Princeton Univ. Press, 1946. [2J L. S. pontryagin, Topological groups, first edition, Princeton Univ. Press, 1939, second edition, Gordon & Breach, 1966. (Original in Russian, 1954.) [3J S. Helgason, Differential geometry and symmetric spaces, Academic Press, 1962. [4] E. Cartan, Sur la structure des groupes de transformations finis et continus, Thesis, Paris, Nony, 1894. (Oeuvres completes, Gauthier- Villars, 1952, pt I, vol 1, p. 137-287.) [5J H. Weyl, The structure and representation of continuous groups, Lecture notes, Institute for Advanced Study, Princeton, 1935. [6J H. Weyl, The classical groups, Princeton Univ. Press, revised edition, 1946 [7J S. Lie, Theorie der Transformationsgrup- pen, Teubner, 1,1888; 11,1890; III, 1893 (Chel- sea, 1970). [8J F Peter and H. Weyl, Die Vollstiindigkeit der primitiven Darstellungen einer geschlos- senen kontinuierlichen Gruppe, Math. Ann., 97 (1927), 737-755, (WeyL Gesammelte Ab- handlungen, Springer, 1968, voL 3, p. 58-75.) [9] K. Iwasawa, On some types of topological groups, Ann. Math., (2) 50 (1949),507-558. [1 OJ G. D. Mostow, A new proof of E. Cartan's theorem on the topology of semi- simple groups, BulL Amer. Math. Soc., 55 (1949). 969-980. [11J C. Chevalley, On the topological struc- ture of solvable groups, Ann. Math., (2) 42 (1941),668-675. [12] D. Montgomery and L. Zippin, Topolog- ical transformation groups, Interscience, 1955. [13J A, Borel, Topology of Lie groups and characteristic classes, BulL Amer. Math. Soc., 61 (1955),397-432. 
917 [14J Y. Matsushima, Differentiable manifolds, Dekker, 1972. (Original in Japanese, 1965.) [15J G. P. Hochschild, The structure of Lie groups, Holden-Day, 1965. [16J P. M. Cohn, Lie groups, Cambridge Univ. Press, 1957. [17J 1.-P Serre, Lie algebras and Lie groups, Benjamin, 1965. [18J H. Freudenthal, Linear Lie groups, Academic Press, 1969. [19J N, Bourbaki, Elements de mathematique, groupes et algebras de Lie, Actualites Sci. Ind., Hermann, 1960-1975, chs. 1-8. [20] 1. F. Adams, Lectures on Lie groups, Benjamin, 1969. [21J G. Warner, Harmonic analysis on semi- simple Lie Groups I, II, Springer, 1972. [22J N. R. Wallach, Harmonic analysis on homogeneous spaces, Dekker, 1973. [23J V. S. VaradaraJan, Lie groups, Lie alge- bras, and their representations, Prentice-Hall, 1974. [24J S. Helgason, Differential geometry, Lie groups, and symmetric spaces, Academic Press,. 1978. 250 (XVII.3) Limit Theorems in Probability Theory A. General Remarks Let Sn be the number of successes in n tBer- noulli trials with probability p for success. Then the classical de Moivre-Laplace limit theorem implies that ( Sn-np ) I f x 2 / 2 P <x -->- e- U du J np(1-p) J2; -00 (n--> 00), for all x. It is also known that for any I: > 0 P(ln-ISn-PI>f.)-->O (n-->oo), which is a special case of the law oflarge num- bers. General forms of such convergences of sequences of trandom variables .are explained in this article. In tprobability theory, theorems concerning tconvergence in distribution, tconvergence in probability, and talmost sure convergence of sequences of trandom variables are sub- sumed under the generic term of limit theo- rems. When the tprobability distributions of a sequence of random variables converge to the distribution F ( 341 Probability Measures F), then F is called the limit distribution. 250 B Limit Theorems in Probability Theory B. Convergence in Distribution of Sums of Independent Random Variables A sequence of random variables {X nk} (1 .s; k.s; kn,n?; 1) (k n --> 00) is said to be infinitesimal if maxI ';;k<;k. P(IX nkl ?; £) -->0 (n --> 00) for every 1:>0. When X nl ,..., X nk . are tmdependent for every nand {Xnd (1.s; k.s; k n , n?; 1) is infi- nitesimal, the set of all limit distributions for the sums Sn=X n1 +... +X nk . -An (the An are suitable constants) coincides with the class of infinitely divisible distributions ( 341 Probability Measures G). Suppose that the tcharacteristic function f of an infinitely divis- ible distribution F is given by tLevy's canon- ical form (12 t 2 logf(t)=iyt-- 2 f O- ( itx ) + e itX -l--------z dM(x) -00 l+x f ro ( itx ) + e itx -1 - -------z dN(x), 0+ I +x and the distribution function of X nk is Fndx). Then a necessary and sufficient condition for the distribution of Sn -Yn to converge to F for some sequence {Yn} is that (i) for every tcon- tinuity point x of M(x) and N(x), k. L Fndx)-->M(x) (x<O), k1 k. L (Fnk(x)-l)-->N(x) (x>O), k=l limlimsup I ( r x 2 dF nk (x) t:-O n-oo k:::::l JIXI<t: _ ( r XdF nk (X» ) 2 ) JIXI<' =Iimliminf I ( r x 2 dF nk (x) t:-O n_oo k=l JIX!<t: _ ( r XdF nk (X) ) 2 ) J1X!q =(12. Applying these results to special distribution functions, we obtain various kinds of limit distributions. (1) The Central Limit Theorem. In the cen- trallimit theorem, the limit distribution is a tnormal distribution. A necessary and suffi- cient condition for the distributions of the sums Sn= Bn- l (Xl +... +X n )- An (Bn--> 00) of a sequence of independent random variables {X n } to converge to the tstandard normal distribution N(O, 1) and for {B;I X k } (l.s; k.s; n, 
250B Limit Theorems in Probability Theory n:;:' 1) to become infinitesimal is that lim t r dFn(x)=O and n-+a:J k=l JIXI>CB n lim Bn- 2 f ( r X2 dFk(x) n--+oc k=l J 1 x l<FB I1 - ( r XdF k (X» ) 2 ) = 1, J1XI<fB n where Fk(X) is the distribution function of X k . When the tvariance of each X k is finite, we put n n B; = L V(X k ), An = B;;1 L E(Xd, kl kl where V(X), E(X) stand for the variance and mean of X, respectively. Then the necessary and sufficient condition is replaced by the Lindeberg condition: lim Bn- 2 f r x 2 dF k (x+E(X k ))=0 n-+C1J k=l Jlxl>£B n for every c> O. In particular, if the X n have the same distri- bution with finite variance, the corresponding Lindeberg condition is satisfied, and the cen- trallimit theorem holds. When the variables give outcomes of independent tBernoulli trials, the proposition is reduced to the classical theorem of de Moivre and Laplace. Moreover, if X n has the finite tabsolute moment mno == E(IX n l2+0) of order 2+6 and E(X n ) = 0, then the Lindeberg condition is implied by the Lyapunov condition n B;;(2+0) L mo-->O (n-->oo). k1 (2) The Law of Small Numbers. In order that the distributions of the sums Sn = X n1 + .., + X nk , of infinitesimal independent random variables converge to the tPoisson distribution P(}.) with mean A, it is necessary and sufficient that for every CE(O, 1), k, f ! k1 R, dFnk(x)=O, R,={xllx-11:;:'c, Ixl:;:,c}, k, 1 lim L dFnk(x) = A; n-+::xJk=l Ix-ll<f k, f lim L xdFnk(x)=O; "-+'n k=l Ixl<£ lim I ( r X2 dFnk(x) n-+c£ k=l JIXI<C and - ( r x dF nk (X» ) 2 ) == O. JIXI<' From this proposition follows the classical law of small numbers, If the probability of success 918 Pk for the kth outcome of independent trials satisfies kpk = A with constant}. independent of k, then the total number Sn of success up to the nth trial converges in distribution to the Pois- son distribution with mean }., (3) The Law of Large Numbers. In the law of large numbers the limit distribution is the tunit distribution. Given a sequence of independent random variables X n with distribution func- tion Fn(x) with mean an, a necessary and suffi- cient condition for n- 1 LZ1 (X k -a k ) to con- verge to 0 in probability is that lim t r dFk(x+a k ) ==0, n-+x k=l J Ixl>n 1 n f lim - I x dFk(x + a k ) = 0, and n-'<I::nk=l Ix!<n lim  f ( r x 2 dF k (x+ad n-+ex n k=l J Ixl<n - ( r x dFk(x + a k » ) 2 ) = O. Jlxl<n In particular, this is the case when (i) X n has the finite variance V(X n ) and n-2LZ1 V(X k ) -->0; or (ii) the X n , n:;:, 1, obey the same distri- butIOn with finite mean. (4) Convergence to Quasistable Distributions. The set of limit distributions for the sums Sn = Bn- 1 (Xl +.. + Xn)- An of identically distri- buted independent random variables {X n } (with suitable constants An' Bn) coincides with the set of quasistable distributions ( 341 Probability Measures G), If G(x) is the distri- bution function of the Xi, in order for the limit distribution to be normal it is necessary and sufficient that lim K 2 r dG(X) ! r x 2 dG(x) = O. K«. J1xl>K Jlxl<K In order for the limit distribution to be quasi- stable with index cx (0 < cx < 2) it IS necessary and sufficient that . G(-x) C 1 hm-=- C l +c 2 >0; and xx 1 - G(x) c 2 ' . I-G(x)+G(-x) hm =a' for every a >0 HX 1-G(ax)+G(-ax) In this case the characteristic function of the limit distributIOn IS given by Levy's canonical form with M(x)=c 1 Ixl-X, N(x)= -c 2 Ixl-', 0"=0 (5) Refinement of Central Limit Theorem. Let {X n } be a sequence of Identically distributed independent random variables with E(X;) = 0, 0"2 = V(X i ), E(IXd3)< 00. Let <l>n(x) be the distribution function of Sn=(X 1 +... +X n )/ 
919 O"j;; and <I>(x) the distribution function of the normal distribution N(O, 1). Then we have <I>n(x) - <I>(x) exp( -x 2 /2) ( 1 ) =  (Q(x)+R(x))+o j;; uniformly in x, where Q(x)=(1-x 2 )E(Xf)/60"3; R(x) is identically zero when Xi does not have a tIattice distribution; but R(x) = dO"-1 RI ((x + an)O"j;; d- I ), RI(x)=[xJ-x+ 1/2, and a n = - j;; aO"-1 when Xi has the lattice distribu- tion with maximal span d: P(XiE {a + kdl k= 0, ::!:: 1, ." } ) = 1. More accurate asymptotic expansions for <I>n(x) are derived when the Xi have absolute moments of higher orders [1]. Asymptotic expressions are also obtained when the Xi are not identicaIly distributed [2J or the limit distribution is tstable [3]. (6) The Local Limit Theorem. The local limit theorem is concerned with the density function of the limit distribution. Let {X n } be a se- quence of identicaIly distributed lattice vari- ables with finite variances as in (5). Let Sn=(O"j;;)-1 f (Xk-E(X k )), k1 Sn,j=(O"j;;)-1 d(j -nE(Xd). Then uniformly in}, O"j;; P(Sn = Snj) (2n)-1/2 exp( - S;j/2)--> 0 (n --> 00). The classical de Moine-Laplace theorem for Bernoulli trials is a special case of this theo- rem. When Xi has a density function, a cor- responding limit theorem holds. Local limit theorems are derived also when (i) higher moments exist, (ii) the limit distribution is stable, or (iii) the component variables are not identicaIly distributed [2,3]. (7) Large Deviations. Let {X n } be a sequence of identically distributed independent random variables with E(X;) = 0,0 < 0"2 = V(X;) < 00. Let <I>n(x) be the distribution function of Sn = (Xl +... + Xn)/O"j;; and <I>(x) the distribu- tion function of the standard normal distribu- tion N(O, 1). In many problems in the field of applications of probability theory, the asymp- totic behavior of 1 - <I>n(x n ) as X n --> 00 with n is of interest. This is ca\1ed the problem of large deviation. Assume that E(exp(aIX,j))< 00 for some a> O. Then for x n ?: 1, X n = o( j;;) and n--> 00 the following assertions hold: l-<I>n(x n ) - ex p ( x }' (  ))( I +O (  )) , 1-tlJ(x n ) j;; jn j;; :((::; - ex p ( )1 A( j? ) )(1 +O ( )). 250D Limit Theorems in Probability Theory where A(Z) is a power series involving the tsemi-invariants of Xi and is convergent for sufficiently smalllzi [4,5]. This result has been generalized for the case of random variables not identically distnbuted [5,6J Bounds for 1 -<I>n(x n ) have also been derived by several authors ( [7J). C. The Strong Law of Large Numbers and Its Refinements A sequence of random variables {X n } is said to obey the strong law of large numbers if n -I LZI (X k - a k ) tends to zero with proba- bility I as n-->oo, when constants a k are pro- perly chosen. When the component variables of the sequence are independent, a useful suffi- cient condition for validity of the strong law of large numbers is ex) 1 I zE(X n -E(X n ))2<00. n=l n According to Birkhoff's individual ergodic theorem ( 136 Ergodic Theory B), it is neces- sary and sufficient that the mean of a compo- nent variable exist for a sequence of identically distributed independent variables in order for it to obey the strong law of large numbers. Suppose that {X n } is a sequence of inde- pendent random variables with E(X n ) = 0, 0"; = E(X;) < 00, h; = O" + ... + 0"; --> 00, and P(I Xnl < lnbn, n = 1,2,...) = 1 for a decreasing sequence }.n tending to 0 as n-->oo. If An= 0(1/<p3(b;)) for a certain increasing continu- ous function <p, the probability that XI + ... + X n > bn<p(b;) for infinitely many n equals 0 or f ro 1 I according as the integral _<p(t)e-<p 2 (t)/2 dt I t converges or diverges [8]. In particular, when <p(t) =(210g(2) t + 310g(3) t + 210g(4)t +... + 210g(k-l) t + (2 + F.)log(k)t)I/2, where log(2) t = log logt, etc., the relevant prob- ability is equal to 0 or 1 according as F. is positive or negative, If we take <p(t)=2log(2)t, we are led to Khinchin's law of the iterated logarithm: If IXnl = o(b n / Jlog(2)b; ), then ( Xl + ... + X n ) P lim sup - 1 = 1. noo J 2b; 10g(2)b; D. Functionals of Sums of Independent Variables (1) Consider a recurrent event which occurs at successive time periods "C I, "C I + '2, ... , where 
250E Limit Theorems in Probability Theory {'n} is a sequence of nonnegative independent random variables with the same distribution. Let F be the distribution function of TI' m = E(,d, (J2 = V(,d < 00, and N n be the number of occurrences to time n. Then I . ( n (Jfi ) 1 f x -u2 / 2 1m P Nn,?;---x =- e duo noo m m 3 / 2 -oo When (J2 = 00 and P('I < 00)= 1, a necessary and sufficient condition for suitably norma- lized N n to converge in distribution is that for every a>O the relation (I-F(x))/(I-F(ax)) -->a'(x--> 00),0.s; cx < 2, hold. When O.s; cx < 1, P(N n (1-F(n))<x)-->'P,(x). When 1 <cx<2, P«N n -nm- l )m1+I/'b n - 1 <x)-->'P,(x) for b n such that 1-F(bn)n-\ where 'Po (x) = 1- e-X(x>O), 'P,(x)= l-<I>,(x- I /') (x>O) for 0< cx< 1, 'P,(x) = 1 -<1>,( -x) for 1 < cx<2, and <1>, is the quasistable distribution function whose characteristic function is exp( -Itl'(cos 2 -1 ncx - i sgn tsin2 -I ncx)r(1- cx)). In particular, we have 'PI/2( x)=J2F. J: exp( -u 2 /2)du. Moreover, the strong law of large numbers and the law of the iterated logarithm con- nected with N n have been obtained [9]. (2) Let {X n } be a sequence of independent identically distributed random variables taking integer values with the same distribution, and put Sn=X I +... + X.. If the maximum span of the Xi is d and O<E(X;).s; +00, then ECI X{nd}(Sd)-->d/E(X;) (n-->oo), -->0 (n--> -00); t-lE(JI X(O,,)(Sk)).....I/E(XJ (t --> 00), where X}, is the tindicator function of the set E, and I/E(X;) is defined as zero when E(X;) = 00. If the limit distribution of the normalized sums Sn of {X n } is stable with index fJ (l.s;fJ.s;2), Xi is symmetric for (3= 1, and E(Xi)=O for fJ> I, then the event that Sn returns to 0 is a recur- rent event. For these cases, setting cx = 1- fJ-l, the conditions in (1) imposed on the distri- bution of the recurrence time 'I are satisfied, and the limit distribution for N n = x{o}(Sd+ ... + X{O}(Sn) is determined. In this connection, we can show that a similar result is valid when X{O} is replaced by a member of a considerably wider class of functions, and that the limit distribution is determined as well for M n which is the number of occurrences of the event (Sk,?;O,Sk+I.s;O), I.s;k.s;n, Similar theorems are established for the case when Xi has nonlattice distribution [10-12]. 920 (3) Let {X n } be a sequence of independent random variables with the same distribution, and put Sn =X l +... +X n , Ln= X(O, oo)(Sd+... + X(O, oo)(Sn)' If n -I E(Ln)-->cx (O.s; cx.s; 1), then P(Ln.s; nx)-->G,(x), where G, is a distribution function on the unit interval [0,1] such that Go(O+ )-Go(O-)= I, G,(x)=n- I sinncx LX u'-I(I-u)-'du for O<cx< 1, and Gdl +0)-G I (1-0)= 1. When cx = 1/2, the corresponding limit distri- bution is given by G l / 2 (x)=2n- 1 arcsinp (the arcsine law), These Ln and N n , M n in (2) are considered as tadditive functionals of the tMarkov process S.. When E(Xi)=O and E(xl) = 1, we have the limiting relation lim P ( max Sk<X fi ) n-oo 1 kn  fA f "p( < )d" (x.s;O), (x>O), lim P ( max ISkl <x fi ) n-toc 1 kn 4 00 (_I)m ( 2m + 1)2 n 2 ) =- L -exp - n m02m+ 1 8x 2 (x'?; 0). Furthermore, the limit distributions of n -I (S +,.. + S;) and n-3/2(ISII +... + ISnl) are also determined explicitly [14]. When XI is non- negative, let V x be the number of Sn (n = 1,2,...) not exceeding x, and write Yx=Svx+1 -x, Y;= x-S vx . Then a fundamental theorem oftre- newal theory guarantees the existence of the limit distributions of Y x , Y; as x--> 00 [15]. Extensive studies have also been devoted to limit theorems for sums of dependent ran- dom variables, especially for tMarkov chains [16-20]. E. Convergence in Distribution for Stochastic Processes (1) General Theory. Let T be a finite or infi- nite time interval, and let (Q, !B, P) and S be a probability space and topological state space, respectively. Given an S-valued stochastic process X =(X(t,w), tE T,WEQ), we denote by !B(ST) the (J-algebra generated by the tcylinder sets and denote by Px and p1 '", respectively, the probability measures over !B(ST) induced by the processes X and (X(tl,w),... ,X(tbW)), Suppose that there is a sequel1ce of stochastic processes X n (n= 1,2,...) whose tsample paths are contained in a subset E of ST. We can introduce a topology, on E so that if!B n (n = 
921 1,2,...) is the Px,-completion of !B(ST), the topological O'-algebra !Bk on E becomes a subfamily of!B n n E (n = 1,2,...). If there exists a probability measure Po over (E, !Bk) such that lim f f(y)dPx, = f f(y)dPo n---+iX) E E for every bounded ,-continuous real function f on E, then the probability distributions of {X n } are said to converge to Po. Let (E, p) be a tcomplete metric separable space and P' the Lebesgue measure on Q' = [0,1]. If a sequence of E-valued random vari- ables {Xn(w), 1 .s; n < oo} has the probability distribution Po over the O'-algebra !B of Borel sets of (E, p) as its limit distribution, then it can be shown that there is a sequence of E- valued random variables gn(w')} (W'EQ';O.s; n < 00) such that the probability distribution of n(w') coincides with that of Xn(w) (l.s; n < 00), o(w') has the probability distribution Po, and P'(n(w')-->o(w'))= 1 [21]. Consider a sequence of real-valued sto- chastic processes {Xn(t, w)}. If (i) there is a probability measure Po on !B(R T ) such that P" 'k converges to Pb l .', for every choice of (t 1 , ...,t k ) and (ii) limlimsup sup P(IXn(s)-Xn(t)I>f.)=O hjO noo I'-'Ih for every f. > 0, then there exists a sequence of real-valued processes {n(t, w')} (w' EQ'; O.s; n < 00) such that for every t E T, n(t) converges in probability to o(t), (n and X n (l.s; n < 00) induce the same probability measure over !B(R T ), and (0 has the probability distribution Po and is continuous in probability. (2) Convergence in Distribution of Stochastic Processes Whose Sample Functions Have tDis- continuities of Only the First Kind. Yu. V. Pro- khorov and A. V. Skorokhod carried out a sys- tematic study of convergence in distribution for those processes whose sample functions are (S, p)-valued and have discontinuities nf 0t1ly the first kind, where (S, p) is a complete metric separable space. Let D be the set of functions x(t) from T= [0,1] to S having discontinuities of only the first kind and right continuous for O.s; t < 1 with x(l- 0) = x(l), and introduce a metric PD by PD(X, y) = inf) {SUP'ET I t - }-(t) I + sup'ETP(x(t),y(A(t)))}, where the infimum is taken over all homeomorphisms ..1. on T. Then (i) in the notation of (1), !B};D=!B(ST)n D, (ii) (D,PD) is tseparable but not neces- sarily tcomplete, and (iii) there is a complete separable metric P in D which is equiva- lent to PD (i.e., PD and P induce the same topology in D). When almost all sample func- tions of X are elements of D, we write X E 250 E Limit Theorems in Probability Theory D. Given XnED (n:;:'O), a necessary and suf- ficient condition for P x , to converge to PX o over !B};D is that (i) pI,""k converge to pIO 'k for any (t 1, ..., tn) eN, where N is a dense subset of [0,1] containing 0, 1; and that (ii) limhjo lim SUPnoo PxJD(h, X n ) > b) = 0 for every 15>0, where D(h,x)==sup{min(p(x(td, x(t)), p(x(t 2 ), x(t))) It - h < t 1 < t < t 2 <t + h}. If C is a subspace of D consisting of an con- tinuous functions, PD is equivalent to pc(x, y) = max{p(x(t), y(t))1 O.s; t.s; I}, and c(h,x)= max {p(x(s), x(t)) lis - tl.s; h} corresponds to D' For a sequence of processes XnEC, a necessary and sufficient condition for Px to converge to Px over !Bf' is that (i) P" .'k 'con- verge to pio' 'k for any (t 1 , ...,tk)cN (N is a dense set as in the case of D); and that (ii) lim hjo limsuPnoo PxJc(h, X n » b)=O for every b> O. If P x , converges to PX o over !B'ff (over !BfC), then there exists a sequence n(t,w')ED (EC) (tE T, w' EQ') for O.s; n < 00 such that p{, = P x , for O.s; n < 00 and P'(PD(n' o)-->O) = 1 (P'(Pc«(n'O)-->O)= 1). The following theorem, due to Prokhorov, is useful for practical applications. Let {X,(t) I O.s; t < 1, cx E A} be a family of real-valued pro- cesses that satisfy E(IX,(t) -X,(s)I') .s;clt- sl1+b, cxEA, where a, b, c are positive constants inde- pendent of cx, If the family of distributions of {X,(O) I cx E A} is tight ( 341 Probability Measures F), then X.(t)EC (cxEA), and {PxJ is a tight family of probability distributions over C. (3) Convergence in Distribution of Markov Processes. Consider a Markov process {X (t), O.s; t.s; I} with complete metric separable space (S, p) as its state space and with topo- logical O'-algebra !B p . Suppose that its tran- sition probabilities P(S, x; t, A), O.s;s<t.s; I, A E!B p are measurable in XES. Define V'(x) = {Ylp(x,y):;:,e}, and introduce two kinds of conditions: (D) for every f. > 0 and O.s; t.s; 1, limhjOsuPxEs"I"zP(t 1 ,x;t 2 , V'(x)) =0, where t l , t 2 are subject to the conditions t.s;t l <t 2 .s;t+ h, t - h.s; t 1 < t 2 < t, 1 - h.s; t 1 < t 2 .s; 1; (C) for every e> 0, supxES,t-sh P(S, x; t, V'(x)).s; h'fl,(h), where 'fl,(h) is such that 'fl,(h)lO as hlO for every e> O. According as {X (t), O.s; t.s; I} satis- fies conditions (D) or (C), there is a process X'(t) which is equivalent ( 407 Stochastic Processes) to X (t) belonging respectively to (D) or (C). Convergence in distribution of Markov processes is based on this fact. Suppose that a sequence of Markov processes {X n(t) I O.s; t.s; I}, O.s;n< 00, satisfies (i) condition (D) uni- formly in n (condition (C) with 'fl, indepen- dent of n), (ii) with N as in (2), pI, "'k con- 
250 F Limit Theorems in Probability Theory verges to p10 "for every (t I' ..., td c N. Then P x , converges to PX o over !B'fl (over !BC). Con- vergence in distribution of Markov processes is also implied by the convergence of the tgen- erators of the Markov processes [24,25]. (4) Donsker's Invariance Principle. Donsker's invariance principle is a kind of central limit theorem for stochastic processes Let {n} be a sequence of identically distributed independent (real-valued) random variables with E(;)=O and 0<0"2= V(;)< 00, and let Sn=1 +... +"' n I, So=O. Define random elements XnEC= ceO, 1], n  I, by Xn(t,w) I I = r.:: S[ntj(w)+(nt-[ntJ) r.:: [nt]+1(w) O"yn (Jyn (o.s;t.s; 1), when [ntJ denotes the integer part of the real number nt. Thus Xn(i/n,w)=SjO"-fi, and X n is linear on the interval [(i -l)/n, i/nJ for i = 1,2, ..., n. The C-valued random variables X n induce their probability distributions P x , over !Bc, n 1. Let X(t), O.s;t.s; 1, be a (real- valued) tWiener process ( 45 Brownian Motion A) with X(O) =0 and Px its probability distribution over !B(. Px is called a Wiener measure. Then Donsker's invariance principle [26] asserts that P x , converges to Px. Ifwe define XED, n 1, by I X(t,w)= r:.. S[ntj(w) O"y n (O.s; t.s; 1), and if we consider Px as a probability measure on the space (D, 'E'fl) which is concentrated on C (i.e., Px( C) = 1), then the above convergence theorem holds also for the distributions Px; of X, n  1, over !Bt D [20]. The arcsine law ( Section D (3)) is a nice example of an application of the invariance principle. Define a function f on C by f(x) = J6X(O,cr)(x(t))dt, xEC. Then it is shown that f is measurable and continuous except on a set of Wiener measure O. Hence the invariance principle implies that the probability distri- butions of U(X n )} converge to the distribution of f(X). It is known that f(x) obeys the arcsine law P(/(X)<a)=2n- 1 arcsin, O.s;a.s; I ( 45 Brownian Motion E). In the random walk case, where P(i= 1)=P((i= -1)= 1/2, nf(Xn) is the number of i, 1.s; i.s; n, for which Si-I and S; are both nonnegative. (5) Strassen's Invariance Principle. An in- variance principle for the law of the iterated logarithm is explained here. Let {n}' {Sn}, and {Xn(t)} be the same as in subsection (4) above. 922 Define T,,(t, w) = (2loglog n) -1/2 Xn(t, w) (O.s;t.s;l, n3). Strassen's invariance principle [28J asserts that with probability 1 the set {T",n 3} is trela- tively compact in C = C[O, 1], and the set of its limit points coincides with the set K of ab- solutely continuous functions x on the inter- val [0, lJ such that x(O)=O and Jbx'(t)2dt.s; 1. This principle is based on the following fact. Let X (t), O.s; t < 00, be a tWiener process with X(O)=O, and Zn(t) = (2n log log n) -1/2 X (nt), (o.s;t.s;l, n3). Then with probability 1 the set {Zn,n3} is relatively compact in C with its limit set equal to K. As applications of the invariance prin- ciple we obtain, e.g., (i) the ordinary law of the iterated logarithm P(liSPSn/(2nlOgIOgn)I/2=0")= 1, and (ii) if 0"= 1, for any a 1, P(lip(2nlOglOgn)-a/2n-1 it Is;!a =2(a+2)(a/2)-l a -a/2 (f l  ) -a ) = 1, o in particular, P(lip(2nl0gI0gn)-1/2n-l;t IS;I = r l / 2 )= 1. F. Convergence of Empirical Distributions Let {Xk(w)II.s;k.s;n}, {}j(w)ll.s;j.s;m} be independent random variables with the same distribution function F. Let Nn(x, w) be the number of k (k= 1, ...,n) such that Xk.s;x. Then Fn(x,w)=n-INn(x,w) is a distribution function in x called an empirical distribution function. According to the Glivenko-Cantelli theorem, Hn+ =suPxlFn(x,w)-F(x)I-->O (n-->oo) with probability 1. Let Gm(x, w) be the empir- ical distribution function constructed from the }j, and put Hn = sup (Fn(x, w) - F(x)), H+(n, m) =sup I Fn(x, w)- Gm(x, w)l, H(n,m)=sup(Fn(x, w)- Gm(x, w)). If F is continuous, then the following results 
923 hold. According to Kolmogorov's theorem, as n--> 00 we have P(jn Hn+ < x)-->K(x), where 00 K(x)= L (-llexp( _2k 2 x 2 ) (x>O); k= - 00 =0 (x.s;O). On the other hand, Smirnov's theorem asserts that (i) P( Hn<x)=O (x.s;O); = 1-(I-n-I/2x)"-x nf nn ( n ) k=r+1 k x (k - x)k(n - k + xr-k-1 (O<x.s;, r=[xJ); =1 (x>), and, as n--> 00, P( Hn<x)-->L(x), L(x)= l-exp( _2x 2 ) (x > 0), =0 (x.s;O); (ii) when n-->oo, mn-I-->r, r a constant, P«(nm) 1;2(n + m) -1;2 H + (n, m) < x)--> K (x), P«nm)I/2(n + m) -1/2 H (n, m) < x)--> L(x). In terms of convergence in distribution for Markov processes, these results are interpreted as follows: F(X,) is uniformly distributed over [0,1]. Let vn(t) be the number of k (k= 1,2, ..., n) such that F(Xk).s; t, and put Xn(t) = (n-IVn(t)-t) (O<t.s; 1), Xn(O)=O. Then E(Xn(t)) = 0, E(Xn(t)Xm(t))==min(s, t)-st, Hn+ =sup,IXn(t)l, and Hn(t)=sup,Xn(t). Let {X(t)IO.s;t.s;1} be a tGaussian process with E(X(t»)=O, E(X(s)X(t)) = min(s, t)- st, and put H+ = sup, IX(t)l, H=sup,X(t). Then p x . con- verges to Px over !BI'F. Therefore P( Hn+ <x)-->P(H+ <x) (n-->oo), P( Hn<x)-->P(H <x) (n-->oo). Similarly, if we denote the number ofj such that F(}j).s;t by Jim(t), then H+(n,m)= sup, I n -I vn(t) - m -I Jim(t) I, and therefore P«nm)I/2(n + m) -1;2 H+(n, m) < x) -->P((l +r(I/2 sup ,IX(t)-';;: Y(t)\<x) when n-->oo, nm-I-->r, where {Y(t)IO.s;t.s; 1} is independent of {X (t) I O.s; t.s; I} and distributed according to the same law as for the latter. The exact as well as asymptotic expressions for the distributions Hn+, H+(n,m) are also ob- tained [33,34]. Thus the limit distributions can be calculated by using the distributions of 250 Ref. Limit Theorems in Probability Theory the processes {X(t)} and {Y(t)}. The results above are obtained by applying Donsker's invariance principle. The Gaussian process {X(t)1 O.s; t.s; 1} introduced above is called a Brownian bridge because it is obtained from a Brownian motion B(t) by conditioning B(O)=B(I)=O. References [IJ B. V. Gnedenko and A. N. Kolmogorov, Limit distributions for Sums of independent random variables, Addison-Wesley, 1954. (Original in Russian, 1949.) [2J V. V. Petrov, Asymptotic expansions for distributions of sums of independent random variables, Theory of Prob. App\., 4 (1959), 208-211. (Original in Russian, 1959.) [3] H. Cramer, On asymptotic expansions for sums of independent random variables with a limiting stable distribution, Sankhya, (A) 25 (1963), 13-24. [4J H. Cramer, Sur un nouveau theoreme- limite de la theorie des probabilites, Actualites Sci. Ind., 736, Hermann, 1938, 5-23. [5J V. V. Petrov, A generalization of Cramer's limit theorem, Selected Trans\. Math. Stat. and Prob., Amer. Math, Soc., 6 (1966), 1-8. (Original in Russian, 1954.) [6J Yu. V. Linnik, On the probability oflarge deviations for the sums of independent vari- ables, Proc. 4th Berkeley Symp. on Math. Stat. and Prob., II, Univ. of California Press, 1961, 289-306. [7J S. V. Nagaev, Large deviations of sums of independent random variables, Ann. Proba- bility, 7 (1979), 745-789. [8J W. Feller, The general form of the so- <.:alled law of the iterated logarithm, Trans. Amer. Math. Soc., 54 (1943), 373402. [9J W, Feller, Fluctuation theory of recurrent events, Trans. Amer. Math. Soc., 67 (1949), 98- 119. [10J G. Maruyama, Fourier analytic treatment of some problems on the sums of random variables, Natural Sci. Report, Ochanomizu Univ., 6 (1955), 7-24. [11J N. Ikeda, Fluctuation of sums of in de- pendent random variables, Mem. Fac. Sci. Kyushu Univ., (A) 10 (1956),15-28. [12J G. Kallianpur and H. Robbins, The se- quence of sums of independent random vari- ables, Duke Math. J., 21 (1954),285-307. [13] F. Spitzer, A combinatorial lemma and its application to probability theory, Trans. Amer. Math. Soc., 82 (1956), 323 339. [14J p, Erdos and M, Kac, On certain limit theorems of the theory of probability, Bull. Amer. Math. Soc., 52 (1946), 292302. [15J E. B. Dynkin, Some limit theorems for 
251 A Linear Operators sums of independent random variables with infinite mathematical expectations, Selected Trans!. Math. Stat. and Prob., Amer. Math. Soc., 1 (1961), 171-189. (Original in Russian, 1955.) [16] R. L. Dobrushin, Central limit theorem for nonstationary Markov chains I, II, Theory of Prob. Appl., 1 (1956), 65-80, 329-383. (Original in Russian, 1956.) [17] S. V. Nagaev, Some limit theorems for stationary Markov chains, Theory of Prob. Appl., 2 (1957),378-406. (Original in Russian, 1957.) [18] R. L. Dobrushin, Limit theorems for a Markov chain of two states, Selected Trans!. Math. Stat. and Prob., Amer. Math. Soc., 1 (1961),97-134. (Original in Russian, 1953.) [19] I. A. Ibragimov and Yu. V. Linnik, Inde- pendent and stationary sequences of random variables, Wolters-Noordhoff, 1971. (Original in Russian, 1965.) [20J P. Billingsley, Convergence of probability measures, Wiley, 1968. [21] A. V. Skorokhod, Limit theorems for stochastic processes, Theory of Prob. App!., 1 (1956),261-290. (Original in Russian, 1956.) [22] Yu. V. Prokhorov, Convergence of ran- dom processes and limit theorems in prob- ability theory, Theory of Prob. App!., I (1956), 155-214. (Original in Russian, 1956.) [23] A. V. Skorokhod, Limit theorems for Markov processes, Theory of Prob. Appl., 3 (1958),202-246. (Original in Russian, 1958.) [24J A. V. Skorokhod, Studies in the theory of random processes, Addison-Wesley, 1965. (Original in Russian, 196\.) [25J D. W. Stroock and S. R. S. Varadhan, Multidimensional diffusion processes, Springer, 1979. [26] M. D. Donsker, An invariance principle for certain probability limit theorems, Mem. Amer. Math. Soc., 6 (1951),1-12. [27] D. Freedman, Brownian motion and diffusion, Holden-Day, 1971. [28] V. Strassen, An invariance principle for the law of the interated logarithm, Z. Wahr- scheinlichkeitstheorie und Verw. Gebiete, 3 (1964),211-226. [29J A. N. Kolmogorov, Sulla determinazione empirica di una legge di distribuzione, Giorn. 1st. Ital. Attuari, 4 (1933),83-91. [30J N, V. Smirnov, Approximate laws of random variables from empirical data (in Russian), Uspekhi Mat. Nauk, (old ser.) 10 (1944),179-206. [31] W. Feller, On the Kolmogorov-Smirnov limit theorems for empirical distributions, Ann. Math. Statist., 19 (1948), 177-189. [32J 1. L. Doob, Heuristic approach to the Kolmogorov-Smirnov theorems, Ann. Math. Statist., 20 (1949),393-403. 924 [33] V. S. Korolyuk, On the discrepancy of empiric distributions for the case of two inde- pendent samples, Selected Transl. Math. Stat. and Prob., Amer. Math. Soc., 4 (1963), 105- 12\. (Original in Russian, 1955.) [34] V. S. Korolyuk, Asymptotic expansions for the criteria of fit of A. N. Kolmogorov and N. V. Smirnov (in Russian), Izv. Akad. Nauk SSSR, Ser. Mat., 19 (1955),103-124. 251 (XII.g) Linear Operators A. General Remarks In tfunctional analysis, when we talk about an operator or a mapping T from one space to another, it is important to specify not only the domain D(T) on which T is defined and the range R(T) onto which T maps D(T) but also the spaces X and Yof which D(T) and R(T), respectively, are subsets. Thus a mapping T from a subset D(T) of a (real or complex) linear space X to a linear space Y is called an operator from X to Y. The image of x E X under T is customarily denoted by Tx. An operator T from X to Y is said to be a linear operator (linear transformation or additive operator) if (i) D(T) is a tlinear subspace of X and (ii) T(cxlx l +CX2X2)=CX\ TX I +cx 2 TX2 for all XI' X2ED(T) and all scalars CXl> CX 2 . The set of all continuous linear operators T from X to Y with D(T)== X is denoted by L(X, Y). We denote L(X, X) by L(X). For simplicity we suppose throughout this article that X and Yare tBanach spaces. In this case L(X, Y) consists of all tbounded linear operators from X to Y. Hence L(X, Y) and L(X) are denoted by B(X, Y) and B(X), respectively Some of the statements given remain true in more general situations ( 424 Topological Linear Spaces). More information about operators in B(X, Y) can be found else- where ( 37 Banach Spaces; 68 Compact and Nuclear Operators). Examples are grouped together in Section O. B. Operations on Operators When T\ and T 2 are operators from X to Y, the sum TI + T 2 is the operator defined by D(T\ + T 2 ) = D(TtJ n D(T2) and (T\ + T 2 )x = T\ x + T 2 x, x E D( T\ + T 2 ). When T\ is from X to Y and T 2 is from Y to Z, the product T 2 TI is the operator defined by D(T 2 T I ) == {xED(T\)1 T\XED(T 2 )} and T 2 T\x= T 2 (T\x), x E D( T 2 T\). The product cx T of a scalar cx and 
925 an operator T is defined in a similar and obvi- ous way. These operators are linear whenever TI and T 2 are linear. For an operator T from X to Ythe subset r(T)={(x, Tx)lxED(T)} of the product space X x Y is said to be the graph of T. If r(T 1 ) C r(T 2 ), the operator Tz is said to be an extension of TI (we write TI c Tz). If a linear operator T from X to Y satisfies the condition that x#O implies Tx #0, then T has the inverse operator T- 1 , which is a linear operator satisfying D(r-1)=R(T) and R(r- 1 ) =D(T). C. Convergence of Operators The following three topologies are most fre- quently used in the linear space B(X, Y), which becomes a tlocally convex topological linear space under each of them: (1) the uniform operator topology, (2) the strong operator topol- ogy, and (3) the weak operator topology. These topologies are determined by the respective fundamental systems of neighborhoods of 0 consisting of all sets of type (1) {T III T II < f.}, (2) {TIIITxll < f., XEX}, and (3) {Tllf(Tx)I<f., XEXJEY'}, where f. varies over all positive numbers, X over all finite subsets of X, and Y' over all finite subsets of Y', the tdual space of Y. The uniform operator topology is thus the metric topology determined by the tnorm in B(X, Y) ( 37 Banach Spaces C). Convergence of 7;, to T with respect to one of these topol- ogies is referred to as (1) uniform convergence, (2) strong convergence, or (3) weak convergence. We have such convergence if and only if(l) 117;, - T 11-->0, (2) II (7;, - T)x 11-->0 for every x E X, or (3) If(7;,x- Tx)I-->O for every XEX and fEY'. D. Closed Operators Closed operators play an important role when we deal, as is frequent in applications, with operators that are not necessarily continuous. An operator T from X to Y is said to be a closed operator if the graph of T is closed in X x Y, or equivalently, if XnE D(T), Xn-->X, and Txn-->y imply xED(T) and Tx= y. If T is con- tinuous and D(T) is closed, then T is closed. Conversely, if a linear operator T is closed and D(T) is closed, then T is necessarily continu- ous (the closed graph theorem). An operator T from X to Y is said to be a closable operator if T has a closed extension. A linear operator is closable if and only if XnE D(T), xn-->O, and Tx n --> Y imply y =0. The closure of the graph of a closable operator T is the graph of the smallest closed extension T of T. Thus f is also called the closure of T. When T is a linear 251 E Linear Operators operator from X to Y with D(T) dense in X, the dual operator (or conjugate operator or adjoint operator) T' of T is defined to be the operator from Y' to X' determined by the relations D( T') = {f E Y' I there exists 9 EX' such that g(x) = f(Tx) for every xED(T)} and T'f = g, f E D(T') ( 37 Banach Spaces). T'is always closed. If T is a closable linear operator with dense domain, then its smallest closed extension is obtained as the bidual T" re- stricted to D(T)= {x EX n D(T") I T"XE Y}. For a closed linear operator T we define its nullity nul T to be equal to dimN(T), where N(T)= {xED(T)1 Tx =O} is the null space (i.e., the tkernel) of T, and its deficiency defT to be equal to dim YjR(T). When at least one of nul T and defT is finite, we define the index of T by ind T=nul T-defT. A closed linear operator T is said to be a Fred- holm operator if R(T) is closed, nu1(T) < 00, and def(T) < 00 ( 68 Compact and Nuclear Operators). The domain D(T) of a closed linear operator T from a Banach space X to another Y turns out to be a Banach space under the graph norm I!xllx + II Txll y , and T is looked upon as a bounded linear operator from this Banach space D(T) to Y. E. Operators between Hilbert Spaces Throughout this section we suppose that X and Yare complex tHilbert spaces. Let T be a densely defined linear operator from X to Y. Instead of the dual T', it is sometimes conve- nient to use the operator T* from Y to X determined by the relation (x, T*y) =(Tx, y), xED(T). The operator T* is called the adjoint operator (or Hilbert space adjoint) of T. By means of the an tilinear isomorphism n x from X onto X' given by (nxx)(u)=(u, x) (tRiesz's theorem), T* is related to T' by T*=nxl T'n y . The correspondence T --> T' is linear, while the correspondence T --> T* is antilinear. D(T*) is dense in Y if and only if T is closable, and in this case the smallest closed extension f of T coincides with T** =(T*)*. If a densely defined linear operator T in X (i.e., T is from X to X) satisfies Tc T*, then T is said to be a symmetric operator (or Her- mitian operator). If T= T*, then T is said to be a self-adjoint operator. A symmetric operator is always closable. A symmetric operator Tis said to be essentially self-adjoint if the closure of T is self-adjoint. A self-adjoint operator T is said to be nonnegative or positive or positive semidefinite if (Tx, x);;, 0 for every x E D(T). An operator TE B(X, Y) is said to be partially isometric if there exists a closed linear sub- space M of X such that T is isometric on M 
251 F Linear Operators (i.e., II Txll = Ilxll, xEM) and zero on the tortho- gonal complement of M. The closed linear subspace M (TM) is called the initial (final) set of T. An operator T E B(X, Y) is partially isometric if and only if T*T and TT* are (orthogonal) tproJections. The ranges of these projections are the initial and final sets of T, respectively. In particular, if M = X, then T is said to be an isometric operator or an iso- metry. If X == Y = M == TM or, more generally, if X = M and Y = TM, then T is said to be a unitary operator. 'fEB(X) is unitary if and only if T* = T- 1 . A closed linear operator T with dense domain is said to be normal if T* T = TT* The set of self-adjoint (or unitary) operators in X forms a subclass of the class of all normal operators. The structure of nor- mal operators, especially that of self-adjoint operators, has been studied in detail by means of spectral analysis ( 390 Spectral Analysis of Operators). Let T be a densely defined closed linear operator from X to Y. Then there exist a nonnegative self-adjoint operator P in X and a partially isometric operator W with initial set R(P) = R(T*) such that T= W P. The operators P and Ware determined uniquely by these requirements. This is called the canonical de- composition or the polar decomposition of T. For linear operators T in a complex Hilbert space X, the notion of numerical range W( T) = {(Tx,x)lxED(T), Ilxll = 1} is also impor- tant. W(T) is a convex set such that II TII/2.s; sup I W(T)I = sup{ I},II },E W(T)}.s; II TII. If Tis normal, then the closure of W(T) coincides with the tclosed convex hull of the tspectrum O"(T), and we have sup I W(T)I = II n. A nor- mal operator T is a nonnegative self-adjoint operator if and only if its numerical range is in the positive ray ..1.;;'0. F. Resolvents and Spectra Let T be a closed linear operator in a complex Banach space X, and I be the identity in X. The set p(T) of all complex numbers A such that AI - T has an inverse in B(X) is called the resolvent set of T, and the complement O"(T) of p(T) is called the spectrum of T. For AE p(T) the operator R(A; T)=(U - T)-1 EB(X) (or sometimes -R(A; T)) is called the resolvent of T. IPoE p(T), then V IIX-Xol < IIR(Ao; T)II- l } cp(T). Hence p(T) is an open, and O"(T) a closed, set. If T is bounded, then O"(T) is a nonempty compact set. However, p(T) may be empty (example (2) in Section 0) or the whole plane C in general. The operator R(},; T) is an tanalytic operator function of ), in p(T) and satisfies the (first) resolvent equation R(}'l; T)- R(},2; T) =(A 2 - AdR(A1; T)R(X 2 ; T) 926 for every )'1' i. 2 Ep(T). For every TEB(X) the limit r(T) = Jim II T" 11 1 !n.s; II n, n--> 00, exists and coincides with the tspectral radius supIO"(T)1 of T. An operator TEB(X) with r(T)=O is called quasinilpotent or generalized nilpotent. Con- cerning the dual operator, we have p(T) = p(T) and R(A; T')=R(A; T)'. If X is a Hil- bert space, then p(T*)= p(T) and R(i.; T*)= R(X; T)*, where - stands for the complex conjugate. G. Operational Calculus In operator theory, the term operational cal- culus generally indicates a way of defining "functions" f(T) of an operator T so that a kind of algebraic homomorphism is estab- lished between a set of complex-valued func- tions f and the corresponding set of operators f(T). The functions and operators that must be taken into consideration depend on the nature of the pro blems to be solved, and ac- cordingly there are several versions of oper- ational calculus. We describe two typical ones. (J) Let T be a self-adjoint operator in a com- plex Hilbert space X with the tspectral resolu- tion T = Ice en i.dE(X), and let f be a complex- valued Borel measurable function on R. Then the operator f(T) is uniquely determined by the following relations ( 390 Spectral Analy- sis of Operators): D(f(T))={xlf:oc 1 f(),Wd(E(},)x,x)< oo}, (j(T)x,y)= L'J", f(A)d(E(}.)x,y), xED(/(T)), YEX. Then f(T) is normal and the correspondence ff-> f(T) satisfies the following relations: (i) (Clf + f3g)(T)=:> exf(T) + [3g(T); (ii) (fq)(T) => f(T)g(T); and (iii) f(T)* = J(T). If 9 is a bounded func- tion, the extensions in (i) and (ii) can be re- placed by equalities. (2) Let X be a complex Banach space, TE B(X), and ff(T) the set of all functions holo- morphic in a neighborhood of O"(T). We define an operator f(T)EB(X),fEiF(T), by f(T)= f f(t)R(t; T)dt, (*) 2m c where C is a closed curve consisting of a finite number of rectifiable Jordan arcs encircling a domain that contains O"(T) in its interior and lies with its boundary completely in the domain in which f is holomorphic. The inte- gral does not depend on the curve. In this case relations (i) and (ii) hold with equality in place of extension. Instead of (iii) we have (iv) Y(T) = ,'F(T) and f(T') = f(T)'. The integral 
927 appearing in ( * ) is sometimes called the Dun- ford integral. In both situations described in (I) and (2) the spectral mapping theorem O"(f(T)) = j(O"(T)) holds. When these two ways of defin- ing f(T) are possible, the resulting operators coincide. Another kind of operational calculus can be constructed, for example, when T is the gen- erator of a certain semigroup of operators [1,2,4]. H. Isolated Singularities of the Resolvent Let T be a densely defined closed linear oper- ator in a complex Banach space X and )'0 an isolated point of O"(TI. Take a sufficiently small circle C around Ao and put E= f R(}'; T)di., 2m c which is a projection in X. Then the Laurent expansion around Ao of the resolvent is given by u:, R(A; T)= I An(}.-Ao)n, with An = (-I )"+1 ().O f  Tj -(n+1) E for n <0. When the dimension v of the range of E is finite, Ao is a tpole of R().; T) with order not exceeding v, and ).0 is an teigenvalue of T with tmultiplicity not exceeding v. Furthermore, E is then a projection onto the troot subspace belonging to the eigenvalue ).0' I. Extension of Symmetric Operators In applications we frequently encounter the problem of finding self-adjoint extensions of a given symmetric operator. Let T be a closed symmetric operator in a complex Hilbert space X. Then T::!:: if is one-to-one, and its range R c!c is a closed linear subspace of X. The operator V 7 =(T-iI)(T+iI)-1 fromR+ ontoR_ is isometric, and (I - Vr)R+ is dense in X. We calI V T the Cayley transform of T. Conversely, let V be an isometric operator from a closed linear subs pace M of X onto another one N such that (I - V)M is dense in X. Then the operator T= i(I + V)(I - V)-l is a closed sym- metric operator satisfying V 7 = V. Thus the cor- respondence T --> V T is one-to-one onto; TcS if and only if V T c V s , and T is self-adjoint if and only if V 7 is unitary The dimension n r of the subspaces X eN c!c ={xl T*x= ::!::ix} are called the deficiency indices of T. Denoting the tresidual spectrum of T by O"r(T) and putting II c!c ={Allm).O}, we have the following propositions: (i) According as n+ >0 or n+ =0, II+ co-,(T) or II+ c p(T) (similarly for n_ in 251 J Linear Operators place of n+); (ii) T has a self-adjoint extension if and only if n+ = n_; and (iii) T is self-adjoint if and only if n+ = n_ =0, or, equivalently, if and only if II c!c cp(T). When T is symmetric but not closed, the previous arguments can be applied to the closure f of T. The deficiency indices of t are also called the deficiency indices of T. These arguments can be per- formed similarly with A and I (ImA#O) in place of i and -i. We now describe two more concrete criteria for T to have self-adjoint extensions. (1) Semi- bounded operators: If there exists a real num- ber y such that (Tx,x);;'yllxf for every XE D(T), then T has a self-adjoint extension satisfying a similar inequality with the same constant i'. The structure of all self-adjoint extensions of T with the same lower bound y was studied in detail by M, G, Krein [8]. Among such extensions the Friedrichs exten- sion is distinguished as the one having the smallest form domain. (2) Real operators: If T commutes with a conjugation in X, namely, if there exists an antilinear mapping J from X onto X such that (Jx,Jy)=(y,x), J2 =1, and JT= T J, then T has a self-adjoint extension ( Section 0 (2)). J. Dissipative Operators A linear operator T in a Hilbert space X is said to be dissipative (resp accretive) if Re(Tx,x).s;O (resp. ;;'0) for every xED(T). To extend the definition to operators in a Banach space X, we define, for each XEX, Fx to be the set of all x' in the dual space X' such that <x, x') = Ilx11 2 = Ilx'11 2 . Fx is not empty by virtue of the Hahn-Banach extension theorem ( 37 Banach Spaces) The multivalued map- ping F: x --> x' is called the duality mapping. A linear operator T in X is called dissipative (resp. accretive) if for each XE D(T) there exists an x'EFx such that Re<Tx,x').s;O(resp, ;;'0), or equivalently if Ilx-ATxll;;' Ilxll for every XE D(T) and i.>O (resp. A<O). A dissipative operator T is called m-dissipative if R (I -). T) = X for a (and all) i. >0. Then the half-plane Re}. > 0 is in the resol ven t set p( T), and we have II(I.I - Tj-111.s; l/Rd, ReA >0. In particu- lar, T is closed, and if X is treflexive, then the domain D(T) is dense A dissipative operator is called a maximal dissipative operator if it has no strict extension that is dissipative. An m- dissipative operator is a maximal dissipative operator. If X is a Hilbert space, then con- versely a maximal dissipative operator with dense domain is m-dissipative. Every dissipa- tive operator with dense domain in a Hilbert space can be extended to an m-dissipative operator. A linear operator T in a Banach 
251 K Linear Operators space is the tinfinitesimal generator of a con- traction semigroup of class (Co), Le., a tsemi- group {T, It;;' O} of class (Co) such that II T,11.s; 1 for all t;;' 0, if and only if T is an m-dissipative operator with dense domain. The notion of dissipative and accretive operators is also defined for nonlinear opera- tors ( 286 Nonlinear Functional Analysis). K. Subnormal Operators and Hyponormal Operators A bounded linear operator T in a complex Hilbert space X is said to be subnormal if there exists a bounded normal operator N in a complex Hilbert space Y, containing X as a closed linear subspace such that Tx = N x for all x E X. The operator N is called a normal extension of T. T is subnormal if and only if Lm,n(T*m Tn x m , x.);;' 0 for every finite sequence {xn} of X. Then its normal extension N is determined uniquely up to tunitary equiva- lence under the minima1ity requirement that there is no treducing subspace ( Section L) for N between X and Y. Every normal or isometric operator is subnormal. The spectrum of a subnormal operator T is obtained from the spectrum of its minimal normal extension N by adding some bounded components of its complement. An operator TEB(X) is said to be hyponormal if the self-commutator [T*, TJ = T* T - TT* is nonnegative. Every sub- normal operator is hyponormal. If a hypo- normal operator T has a cyclic element x, i.e., the tlinear span of {Tn X I n = 0, I, ... } is dense in X, then the self-commutator is a ttrace class operator (c. Berger and B. Shaw). The planar Lebesgue measure of the spectrum of any hyponorma1 operator T is not less than nll[T*, TJII (Putnam's theorem) [10]. L. Invariant Subspaces A closed linear subs pace M of a Banach space X is said to be invariant under an operator TE B(X) if T maps Minto M. If M is invariant under alI operators in B(X) that commute with T, then M is said to be hyperinvariant under T. When X is a Hilbert space and M is invari- ant under both T and T*, then M is said to reduce T. That M reduces T is characterized by the commutativity TP = PT, where P is the orthogonal projection to M. The question of whether an arbitrary nonzero bounded linear operator in a separable infinite-dimensional Hilbert space has a nontrivial invariant sub- space still remains open but work has pro- gressed significantly in recent years. (1) Every nonzero tcompact operator has a nontrivial 928 hyperinvariant subspace (V. I. Lomonosov). (2) An operator T has a nontrivial invariant sub- space if it is subnormal or if II TII.s; 1 and the spectrum O"(T) covers the whole unit disk of the complex plane (S. Brown, 1979). (3) If T is hyponorma1, a certain power Tn has a non- trivial invariant subspace (Berger, 1979). The complete description of all invariant subspaces is usually difficult even for an oper- ator of quite simple type. For the tshift oper- ator ( Section 0 (6)) on the tHardy space Hz on the unit circle this was done by A. Beurl- ing [11,12]. For the tintegra1 operator on the space Lz(O, 1) with tkerne1 k(t, s) = max(t -s, 0) the set of alI invariant subspaces is ttotally ordered with respect to inclusion [13]. M. Dilations Let X be a Hilbert space and TEB(X). A bounded linear operator U in a Hilbert space Y, containing X as a closed linear subspace, is said to be a dilation (also strong or power dilation) of T if Tnx=Punx for all XEX, n= 1, 2, ..., where P is the orthogonal projection from Y to X. If U is unitary, it is called a uni- tary dilation of T. Every contraction TEB(X), i.e., an operator T with II TII.s; 1, has a unitary dilation U in a suitably constructed Hilbert space Y (the dilation theorem). Such a unitary dilation U is uniquely determined up to tuni- tary equivalence under the minima1ity require- ment that there is no reducing subspace for U between X and Y. A corollary is the von Neumann inequality: For any contraction T and any complex polynomial p(A), the in- equality II p(T)II.s; sUPIAI<llp(l,)1 holds. Further- more, if an operator S E B(X) commutes with a contraction T, then there exists a dilation V of Sin B(Y) such that II VII = 11511 and V com- mutes with the unitary dilation U of T (the lifting theorem). If a contraction T is com- pletely nonunitary in the sense that the restric- tion of T to any nontrivial reducing subspace is not unitary, then the minimal unitary dila- tion has tabsolute1y continuous spectrum, and the operational calculus pf-> p(T) is ex- tended to all functions in the tHardy space Hex, [15]. N. Functional Models for Contractions A canonical model for an operator is a "natu- ral" representation of the operator in terms of simpler operators and in a context in which more structure is present. There are several canonical models for bounded linear operators [13-16]. Here, we follow the approach of B. Sz.-Nagy and C. Foia that was developed 
929 in connection with unitary dilations of con- tractions [15, 16]. An tanalytic operator function 0(},) defined on the open unit disk with values in B(X, Y), where X and Yare separable complex Hilbert spaces, is denoted by {X, Y,0}. An analytic operator function {X, Y, 0} is said to be con- tractive if II o (}.) II .s; I for all Ie. If, in addition, 110(0)xll < !lxll for all nonzero XEX, then it is said to be purely contractive. Each contractive {X, Y,0} is uniquely decomposed into the direct sum 0(..1.) = 0 0 (..1.) EB V, where {X o , Yo, 00} is a purely contractive analytic operator func- tion with closed linear subspaces Xo S; X and Yo S Y, and V is an isometric operator from X 8 Xo into Y 8 Yo. The operator function 0 0 is called the purely contractive part of 0. Let {X, Y, 0} be purely contractive. Then the boundary value 0«()=s-lim H I 0(rO exists for almost all ( with respect to the Lebesgue measure on the unit circle. Let A(O = (I- 0*(00(m l / 2 for 1(1 = I. Then the operator functions 0(0 and A(O defined on the unit circle induce, respectively, the operators 0 E B(Lf, L) and EB(LD defined by (0f)(0= 0(Of(0 and (f)(O=A(Of(O for 1(1 = I and fELf, where Lf is the tHilbert space of X- valued square integrable functions on the unit circle, Consider the Hilbert space K = L EB R() and the unitary operator U de- fined by U({EB g)(O = (f(0 EB (g(O. Con- sider the linear subspace H = [Hi EB R() J 8 {0fEBflfEHn ofK, where Hi is the X- valued tHardy space that is a subspace of Lf, Then the operator T = PH U restricted to H, where PH is the orthogonal projection from K to H, is a completely non unitary contraction with U as its minimal unitary dilation. T is called the contraction generated by {X, Y,0}. Two purely contractive analytic operator functions {XI' YJ,0d and {X 2 , Y 2 ,0 2 } can generate contractions which are unitarily equivalent if and only if there are isometries VI from Y I onto Y 2 and V 2 from XI onto X 2 such that VI 0 1 (..1.)==0 2 (..1.) V 2 for all A in the unit disk. In this case, 0 1 and O 2 are said to coincide. Now let a completely nonunitary contrac- tion T be given, Consider the operator D 1' = (I - T*T)I/2 and the closed linear subspace :D 1' = R(D 1' ) , and define similarly D 1' * and :D1'* by using T* instead of T. Then the function 0 1' (A) = - T + ADT*(l- ). T*) -I Dr restricted to :D 1' becomes a purely contractive analytic opera- tor function with values in B(:D 1' , :D1'*) and is called the characteristic operator function of T. The contraction generated by the character- istic operator function is unitarily equivalent to T and is called the functional model (or the Sz.-Nagy-Foia model) for T. The spectrum of T coincides with the set consisting of the Ao 2510 Linear Operators in the open unit disk at which 0 1' (A o ) fails to have bounded inverse, together with the (0 on the unit circle at which 0 1 (.) fails to have an analytic extension to a neighborhood Q of (0 which is unitary on the intersection of Q and the unit circle. There is a one-to-one corre- spondence between the invariant subspaces M for T and the regular factorizations of 01' as a product 0 1' (A)==0 2 (A)'0 1 (}.) of two contractive analytic operator functions {Y, :D1'*' O 2 } and {:D 1' , Y, 0d with a suitable Hilbert space Y. Here the regularity of the factorization means that for almost all ( in the unit circle the range of (I -0H00 2 (m l / 2 and the range of (I - 0 1 (0 0t(()) 1/2 are linearly independent. More- over, the characteristic operator function of the restriction of T to M coincides with the purely con tractive part of 0 I' while the characteristic operator function of PM T restricted to M  = X 8 M coincides with the purely contractive part of O 2 , where PM is orthogonal projection to M. O. Examples of Linear Operators (1) Integral Operators. Let Ej,)= I, 2, be linear spaces consisting of measurable functions defined on measure spaces Qj with measures Jij' Let k(t, s) be a measurable function on Q2 x QI, and define D(R) to be the set of all XE E 1 such that (Kx)(t) = Jill k(t, s)x(s)dJids) belongs to E 2 , where the integral is assumed to be absolutely convergent almost everywhere. The mapping that assigns Kx to each xED(R) determines a linear operator K from EI to E 2 with domain D(K). K is called an integral operator, and k(t, s) the kernel (or integral kernel) (of K). As an example, let Ej = Lp(QJ. 1 .s; p.s; 00, and suppose there exists an M > 0 such that r Ik(t,s)ldJiI(s).s;M, Jill r Ik(t,s)ldJi2(t).s;M. Jill Then KEB(EI,E2) with IIKII.s;M (68 Com- pact and Nuclear Operators C; [I 7J), An integral operator is said to be Hermitian if the kernel satisfies k(t, s) = k(s,t) . A bounded Hermitian integral operator is self-adjoint in L 2 (Q). (2) Differential Operators. For X = L 2(0, I), let :Do be the set of all XE C 2 (0, I) with com- pact support in (0, I) and :D I the set of all XE CI(O, t) such that x'(t) is absolutely continu- ous in (0, 1) with x" E X. Then the operators 7j,j=O, 1, determined by (7jX)(t) = -x"(t), x E :Dj, are linear in X. To* = TI' so that To is a 
251 Ref. Linear Operators symmetric operator. Furthermore, To is a real operator with respect to the conjugation x-->x. Since two linearly independent solutions of (T 1 -U)x=O both belong to X, the deficiency indices n+ of To are 2. (Note that p(Td = 0.) Self-adjoint extensions of To are obtained by restricting the domain of T 1 by boundary conditions ( 112 Differential Operators; [1,4]). (3) Fourier Transforms. For every x E Lp(R n ), l.s;p.s;2, (U x)(t) = lim (2n)-/1/2 r exp( - its)x(s)ds m---+OC' J1tl:(m converges in the norm of Lq(Rn), p-I +q-l = 1. The operator U thus defined belongs to B(Lp, Lq). When p == q = 2, U is a unitary oper- ator in L 2 ( 160 Fourier Transform). (4) Singular Integral Operators. Let Q(t) be a bounded measurable function on Rn homoge- neous of degree O. Suppose that the integral of Q over the unit sphere vanishes and that Q satisfies the condition e 15- 1 ( SUp I Q(t)-Q(S)I ) d15 < 00. Jo l'Ilsl=1 I'-sl<o Then for every x (t) E Lp(Rn), 1 < p < 00, I Q(t-s) (Tx)(t) = lim -----:-nx(s)ds do 1'-'I>,lt-,\1 converges in the norm of Lp(Rn), and the oper- ator T so defined is a bounded linear opera- tor in Lp(Rn). If n = 1 and Q(t) = n- 1 t/ltl, then T is the tHilbert transform. If n > 1 and Q(t) = r(n + 1 )/2)n-(n+1);2 t)1 tl,j = 1, ..., n, then Tis called the Riesz transform. In general, integral operators on Lp(Rn) defined by kernels k(t, s) satisfying the esti- mate Ik(t, s)l.s; Clt - sl-n are called Calderon- Zygmund singular integral operators. These have been widely investigated [18, 19]. (5) Pseudodifferential Operators. Let a(t, ,) be a function defined on Rn x Rn. Then the oper- ator T defined by (Tx)(t)=(2n)-n/2 J eitra(t, ,)x(,)d, is called a pseudodifferential operator, and a(t, ,) is called the symbol of T, where x de- notes the Fourier transform of x. This is a generalization of the Calder6n-Zygmund singular integral operator. The following is a sufficient condition for T to be a bounded linear operator in Lp(Rn) for every I <p < 000: There exists a constant C such that the inequality la;a(t, ,)I.s; C(l + 1,1)-1'1 930 holds for every tmulti-index C>: with lexl.s; n + 2, and there exists a function CV(15) such that JJCV(15)2 15 - 1 d15<00 and that sup la;a(t, ,)-a;a(s, ,)I.s;cv(15)(1 + 1,1)-1'1 I'-slo for every multi-index C>: with lexl.s; n + 2 (T. Muramatu and M. Nagase, Proc. Japan Acad. (1979); [19]). Pseudodifferential operators play a decisive role in the modern theory of partial differential equations ( 345 Pseudodifferen- tialOperators). (6) Toeplitz Operators. Let L 2 be the L 2 -space on the unit circle in the complex plane with respect to the linear Lebesgue measure, and let H 2 be the tHardy space. Each bounded measurable function <p on the unit circle gives rise to the Toeplitz operator T", in Hz defined by T",f = P(<pf) for fE H 2 , where P is the ortho- gonal projection from L 2 to Hz. The matrix (ex nm ) of the Toeplitz operator T", with respect to the complete orthonormal basis {Xn I O.s; n < oo}, where Xn(()==(n for 1(1 == 1, is given by C>:nm = 1>n-m where 1>k is the kth tFourier coeffi- cient of <po If <p is in the tHardy space H L , then the Toeplitz operator T", is subnormal. When <p(O=(, the Toeplitz operator T", is called the shift operator (or shift). A Toeplitz operator T", cannot be compact except when <p = O. When <p is a continuous function, the Toeplitz operator T becomes a Fredholm operator if and only '" d . if <p does not vanish on the unit circle, an III this case the index of T", is equal to minus the winding number of the curve traced out by <p with respect to the origin. Toeplitz opera- tors play an important role in approximation theory and prediction theory [20]. References [I] N. Dunford and 1. T. Schwartz, Linear operators, Interscience, I, 1958; II, 1963; III, 1971. [2] E. Hille and R. S. Phillips, Functional analysis and semigroups, Amer. Math. Soc. Colloq. Publ., second edition, 1957. [3J F. Riesz and B. Sz.-Nagy, Functional analysis, Ungar, 1955. (Original in French, 1952.) [4] K. Y osida, Functional analysis, Springer, sixth edition, 1978. [5J M. H. Stone, Linear transformations in Hilbert space and their applications, Amer, Math. Soc. Colloq. Publ., 1932. [6] N. I. Akhiezer and U. M. Glazman, The- ory of linear operators in Hilbert space, Pit- man, 1980. (Original in Russian, 1966.) [7J I. C. Gokhberg (Gohberg) and M. G Krein, Introduction to the theory of linear non-self-adjoint operators in Hilbert space, 
931 Amer. Math. Soc. Trans!. of Math. Mono- graphs, 1969. (Original in Russian, 1965.) [8] M. G. Krein, Theory of self-adjoint ex- tensions of semibounded Hermitian opera- tors and its application I, II (in Russian), Math. Sb., 20 (62) (1947),431-495; 21 (63) (1947),365-404. [9J P. R. Halmos, A Hilbert space problem book, Springer, second edition, 1982. [10] K. Clancey, Seminormal operators, Springer, 1979. [11] H. Helson, Lectures on invariant sub- spaces, Academic Press, 1964. [12] H Radjavi and P. Rosenthal, Invariant subspaces, Springer, 1973. [13J M. S. Brodskii, Triangular and Jordan representations of linear operators, Amer. Math. Soc. Trans!. of Math Monographs, 1971. (Original in Russian, 1969.) [14J C Pearcy (ed.), Topics in operator theory, Amer. Math. Soc. Surveys, 1974. [15] B. Sz.-Nagy and C. Foia, Harmonic analysis of operators on Hilbert space, North- Holland and Akademia Kiado, 1970. [16J N. K. Nikolskii, Lectures on shift opera- tors (in Russian), Nauka, 1980. [17J M. A. Krasnoselskii, p, P. Zabreiko, E 1. Pustylnik, and P. E. Sobolevskii, Integral operators in spaces of summable functions, Noordhoff, 1976. (Original in Russian, 1966.) [18J E. M. Stein, Singular integrals and differ- entiability properties offunctions, Princeton Univ. Press, 1970, [19J R. R. Coifman and Y. Meyer, Au del:'! des operateurs pseudodifferetiels, Asterisque, 57 (1982), [20J R. G. Douglas, Banach algebra tech- niques in operator theory, Academic Press, 1972. 252 (XIII.6) Linear Ordinary Differential Equations A. General Remarks Let PI (x), ..., Pn(X)' q(x) be known functions of a real (or complex) variable x, An ordinary differential equation i n )+Pl(X)i n - 1 )+. .+Pn(x)y=q(x) (1) containing an unknown function Y and its derivatives y', y", ..., y(n) of order up to n is called a linear ordinary differential equation of the nth order. In particular, a linear differential equation y(n)+Pl(x)i n - 1 )+ ..+Pn(x)y=O 252 B Linear Ordinary Differential Equations with q(x) = 0 is said to be homogeneous. If q(x)#O, (I) is said to be inhomogeneous. The existence and uniqueness theorems for solu- tions to the initial value problem are valid for (1) ( 316 Ordinary Differential Equations (Initial Value Problems)). Moreover, the fol- lowing theorem holds: Let D denote an inter- val in the real line or a domain in the complex plane. If the coefficients Pk(X), q(x) are continu- ous in an interval D, then every solution of (I) has D as its interval of definition. If Pk(X), q(x) are holomorphic in a domain D, then every solution of (1) is continued analytically along any path in D Combining this theorem and the existence and uniqueness theorems we have the following theorem: Let Pk(X), q(x) be continuous in an interval D. Then for every point Xo in D and every n-tuple of num- bers 11, 11',..., 11(n-l), there exists one and only one solution y(x) of (1) satisfying the initial conditions y(x o ) = 11, y'(xo) = 11', ..., y(n-l)(x o ) = 11(n-l) (2) and such that y(x), y'(x), ..., y(n)(x) are all con- tinuous in D. If pdx), q(x) are holomorphic in D, then there exists one and only one solution y(x) of (1) satisfying (2) and such that y(x) is tcomplex analytic (not necessarily single- valued) in D It follows that every point of discontinuity (or singular point) of a solution of (1) is a point of discontinuity (or singular point) for at least one of the functions pdx), q(x) ( 254 Linear Ordinary Differential Equations (Local Theory)). B. Fundamental Systems of Solutions The totality of solutions of a homogeneous linear ordinary differential equation forms a linear space (over the real or complex field). That is, any linear combination y(x) = L;"1 CiYi(X) of the solutions Yh yz,..., Ym of (I'), where the C i are arbitrary constants, is also a solution of(I'). This is called the prin- ciple of superposition More than n + 1 solu- tions of (1') are always linearly dependent, that is, if m;;' n + 1, we can find m constants C l' C 2 , ..., Cm' not all equal to zero, such that L;"1 CiYi(X)=O, Equation (1') has n linearly independent solutions For instance, the n solutions y" Y2,"" Yn defined by the initial conditions Ydxo)= I, y'dxo)=O, ...,y\n-l)(x o )=O, Y2(X O )=0, Yz(x o )= 1,..., y-I)(XO)=O, Yn(XO)=O, y(xo)=O,.. ,yn-l)(xo)= 1 (3) (1') are linearly independent. Such a system of n 
252C Linear Ordinary Differential Equations linearly independent solutions Yl, "', Yn of (1 ') is called a fundamental system of solutions of (1 '). In terms of a fundamental system of solutions Yl, ...,Yn, each solution Y of (1') is represented uniquely in the form y(x) = L?=l CiYi(X). C. Liouville's Formula In order for n solutions Yl, Y2, ...,Yn of(1') to be linearly independent, it is necessary and sufficient that the tWronskian determinant W(y" Y2, ..., Yn) #0 in D, Furthermore, the coefficients pdx) can be represented in terms of an arbitrary fundamental system of solutions YI' Yz, ...,Yn since the coefficient of y(n-k) in the expansion of (-I)"W(Y,Ydx),Yz(x),.", Yn(x)) W(YI (X)'Y2(X),.,. ,Yn(x)) is identically equal to pdx) in D. Using this equality for PI (x), we obtain Liouville's formula: W(YI (x), ..., Yn(x)) = W(YI (x o ),..., Yn(xo))exp ( - I>l (t)dt). D. Lagrange's Method of Variation of Constants The difference of two solutions of the inhomo- geneous equation (I) is a solution of the homo- geneous equation (1 '). Consequently, the tgen- eral solution of (1) can be represented as the sum of a tparticular solution of (I) and the tgeneral solution of (1 '). Since a particular solution of (1) can be obtained from an arbi- trary fundamental system of solutions Yl, Y2, ..., Yn of (1 '), (I) can be solved if a funda- mental system of solutions for (1 ') is known, In fact, if we consider C l , C z , ... , C n in the repre- sentation Y= L?l Ciy;(x), not as constants, but as functions of x, and determine them by the conditions Yl(X)C;(X)+ Yz(x)C;(x) + ... + Yn(x)C(x)=O, y'l (x)C; (x) + y'z(x)C;(x) +.., + y(x)Qx)=O, yln-l)(x)C; (x) + y-1)(x)C;(x) +... + yn-l )(x)Qx) = q(x), (6) then y(x)=L7=1 C,(x)y;(x) is a solution of (1). This is always possible because from (6) we obtain dC i q(x) W;(x) dx W(Ydx),...,Yn(x))' 932 where W;(x) is the tcofactor of yn-l)(x) in the determinant W(ydx), ..., Yn(x). This method is called Lagrange's method of variation of con- stants (or variation of parameters). E. Linear Ordinary Differential Equations with Constant Coefficients (4) A linear ordinary differential equation y(n)+aly<n-l)+... +qn-Iy' +any=O (8) with constant coefficients a i has Y = exp rx as a solution if r is a root of the algebraic equation f(r)=rn+alr n - I +... +an-lr+an=O, (9) called the characteristic equation of (8), Let r I, r z , .,. , r m be the distinct roots of (9), and sup- pose that the root r i has multiplicity Jii (i = 1,2, ." , m). Then the set of functions er1X, xer1x, ...,x/11-1ertx; ...; errnx, xerm\ ..., Xflm-1 e rmx (10) is a fundamental system of solutions of (8). (5) F. D' Alembert's Method of Reduction of Order (7) Let Yl (x) be a solution, not identically equal to zero, of the homogeneous equation (1'). By substituting Y = YI z into (1'), we see that z' satisfies a linear differential equation of order n - \. This method is called d' Alembert's method of reduction of order. Since linear ordinary differential equations of the first order can be integrated by quadrature ( Appendix A, Table 14.1), a homogeneous linear ordinary differential equation of the second order can be integrated completely if one solution of the equation that does not identically vanish is known. So far we have outlined a general theory of solutions of (1) in the domain where solutions are continuous or complex analytic, but in order to have thorough knowledge of all the solutions, we have to examine their behavior also in the neighborhood of a singular point (which is a discontinuity point or a singular point for at least one of the coefficients) ( 253 Linear Ordinary Differential Equations (Global Theory), 254 Linear Ordinary Dif- ferential Equations (Local Theory)). Also, boundary value problems are important as well as initial value problems described before, especially for second-order equations in con- nection with mathematical physics [7]. For these  315 Ordinary Differential Equations (Boundary Value Problems), 390 Spectral Analysis of Operators. 
933 G. Systems of Linear Ordinary Differential Equations of the First Order Let the hix) be known functions. A system of linear differential equations of the first order dYl/dx= ft dX)Yt +... + ftn(x)Yn+gt(x), dYz/dx= f2dx)Yt +... + f2n(X)Yn+g2(X), dYn/ dx = fndx)Yt +... +fnn(x)Yn +gn(x) with n unknowns Yt, Yz, ..., Yn contains (1) as a special case, since (1) is transformed into (11) by setting Y = Y l' y' = Yz, ..., yrn-t) = Yn' A sys- tem (11) with gi(X)=O, Le., a system dYt/ dx = ftt (x)Yt +... + ftn(x)Yn, dYn/ dx = fnt (x)Yt +... + fnn(x)Yn (11 ') is said to be homogeneous, while (11) is said to be inhomogeneous. Suppose that the h;(X), gi(X) are continuous in an interval D, Then the following theorem holds: To every point Xo in D and every initial condition Yl (xo) = b l , Y2(XO) = b 2 ,..., Yn(XO) = b n , there corresponds one and only one solution that is continuous in D, If the hj(x), gi(X) are holomorphic in a domain D, there corresponds one and only one solution that is complex analytic (not necessarily single-valued) in D. It follows that every point of discontinuity (singular point) of a solution of (11) is a dis- continuity (singular) point for at least one of the coefficients hix), gi(X), H. Fundamental Systems of Solutions If m;;' n + 1, m solutions (Yti, Yzi, ..., Yn;) (i = 1,2, ." , m) of (11 ') are linearly dependent, Le., we can find constants C t , C 2 , .., , Cm, not all equal to zero, such that L7'=t CiYki(X)=O (k= 1,2, ...,n). System (11') has n linearly inde- pendent solutions. To see this, we have only to choose the initial conditions so that Yik(X O ) = I, i=k, =0, i#k. Such a system of n linearly independent solu- tions (Yli,Y2i, ...,Yn;) (i= 1,2, ...,n) is called a fundamental system ofsolutions of (11'). In terms of this fundamental system, any solution (y t, .., , Yn) of (11 ') is represented uniquely in the form Yk(X) = L.71 GjYkix) (k= 1,2, ..., n). The linear independence of n solutions (YlI, ..., Ytn), .", (Ynl, ..., Ynn) is equivalent to 252 J Linear Ordinary Differential Equations the condition that the determinant Yll(X) Yt2(X) A(x)= Yzd x ) yu(x) Ynd x ) Yn2(X) Ytn(X) Y2n(X) Ynn(X) does not vanish in D. Corresponding to Liouville's formula (5), we have (11) A(X)=A(Xo)eXpC [hi(t)d} I. Method of Variation of Constants The general solution (Y I , Y 2 , ..., ¥,,) of the inhomogeneous equation (11) is given as the sum of the general solution (Yt, Y2, ..., Yn) of (11 ') and one particular solution (Y t 0, Y 20 , . .., ¥"o) of (11), i.e., in the form (Yl + Y lO , Y2 + Y 20 , "', Yn + ¥"o). To obtain the particular solution () 0' Y 20 , ..., ¥"o), we take any fundamental system of solutions Yt = <Plk(x), Y2 = <P2k(X), ...,Yn=<Pnk(X), k = 1,2, ..., n, of (11') and consider the constants Uk in the linear combination n Yi= L <Pik(X)Uk> i= 1,2, ...,n, (12) kt as functions of x. Substituting (12) into (11), we obtain a system of differential equations n L <Pik(X)U(X)=gi(X), i= I, 2, ...,n, (13) kt with unknowns Uk' Since the Yi form a funda- mental system, the determinant of the matrix with elements <Pik(X) does not vanish. Hence (13) can be solved in the form u(x)=Gdx), k=I,2,...,n, and the uk(x) can be obtained by quadrature, Consequently, it follows that one particular solution can be given, in terms of the uk(x), in the form (12). This method is also called the method of variation of constants. J. Systems of Linear Ordinary Differential Equations with Constant Coefficients Suppose that the coefficients k in (11') are all constants. Then n columns of the matrix exp(Fx) form a system of fundamental solu- tions of (11'), where F denotes the matrix [hJ. Therefore the general solution has the form m Yj=L k(x)eA'X, j=I,2,...,n. k=t 
252 K Linear Ordinary Differential Equations Here AI, A2' ..., )'m are the distinct roots of the characteristic equation f11- A f12 fin f21 f22 - A f2n =0, fnl .f2 fnn - ;, and if Ak has multiplicity e k (Lj1 e j = n), k(X) is a polynomial of degree at most e k - 1 which contains e k arbitrary constants. Suppose that the coefficients fjdx) in (11') are al1 periodic functions having the same period OJ. Then there exists a linear trans- formation yj = L qjk(X)Zk in which the qjk are periodic with period OJ, such that the original equation is reduced to dz)dx = LCjkZk> where the Cjk are constants (Floquet's theorem). Hence if we can find such a linear transfor- mation, we can integrate the original equation. K. Adjoint Differential Equations Consider a linear homogeneous ordinary differential equation F(y)= Po in) + Ply(n-I) +." + PnY=O, Integration by parts of JzF(y)dx gives zF(y)- yG(z)=d[R(y, z)]jdx, where G(z) = (-I)"«p o z)ln) -(p I z)(n-I) + ". +(-I)"Pn z ), n k-l R(y, z) = I I (-1 )hy<k-h-I)(Pn_kZ)(h), klhO The equation G(z) = 0 is cal1ed the adjoint differential equation of F(y)=O, and the iden- tity (14) is cal1ed the Lagrange identity, By integrating (14), we have Green's formula f X 1 (zF(y)- y G(z)) dx= R [y,zJ (xd x" -R[y,zJ(xo)' The adjoint differential equation of the adjoint equation of F(y) =0 coincides with F(y)=O. If y is a solution of F(y) = 0, then the solution z of the adjoint differential equation satis- fies the (n - 1 )st-order differential equation R(y,z)=constant When G(y)=F(y), F(y)=O is cal1ed a self-adjoint differential equation, In the case of second-order equations with real coefficients, its general form is F(y) = d(pdyjdx)jdx + qy =0. For systems of differential equations, the adjoint system of(II') is defined by dz)dx= -luzl- 'h,jZZ- ...- fnjzn, j = 1, 2, ... , n. (16) 934 Conversely, (11') is the adjoint system to (16). If(YI,Y2, ...,Yn)' (ZI,Z2, ...,zn) are solutions of (11 '), (16), respectively, then we have L}=I yjZj = constant. The system (11') is cal1ed a self- adjoint system of differential equations if (11') coincides with (16), i,e., if fjdx) = - hj(x), L. Laplace and Euler Transforms When the coefficients Pi(X) (i= 1,2, ""n) in (I') are rational functions, it often happens that we can find a solution in the form y(x)= r v(t)eX1dt. (17) Namely, we can often find a suitable function v(t) so that the tLap1ace transform (J 7) of v(t) is a solution of (1 '). Similarly, it is often possible to find a solution of (1') as the Euler transform y(x)= r v(t)(I-xjP-ldt (18) of some suitable v(t), These transforms are used for the integral representation of special functions. (14) M. Linear Ordinary Differential Equations and Special Functions A number of transcendental functions, such as thypergeometric functions, tBesse1 functions, tLegendre functions, etc., and Hermite poly- nomials, Laguerre polynomials, Jacobi poly- nomials, etc" are defined by linear ordinary differential equations of the second order ( 389 Special Functions). References (15) [1] E, A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw- Hil1, 1955, [2J K. O. Friedrichs, Lectures on advanced ordinary differential equations, Gordon & Breach, 1965 [3] E, L. Ince, Ordinary differential equations, Dover, 1956, [4] P. Hartman, Ordinary differential equa- tions, Wiley, 1964. [5J E. Hil1e, Lectures on ordinary differential equations, Addison-Wesley, 1968, [6] M. A. Naimark (Neumark), Linear dif- ferential operators, I, II, Ungar, 1967, 1968. (Original in Russian, 1954.) [7] K. Y osida, Lectures on differential and integral equations, Wiley, 1960. (Original in Japanese, 1950.) 
935 253 (XIII.8) Linear Ordinary Differential Equations (Global Theory) A. General Remarks Let there be given a linear differential equation of the nth order y ln ) + a l (x)y'n-l) +... + an(x)y =0, or a system of linear differential equations y' = A(x)y, which is the vector-matrix expression of n yj=Lajk(x)Yk> j=1.".,n, k1 where the coefficients ak(x), ajk(x) are complex analytic functions of x in a certain complex domain D, The solutions of (1) or (2) are known to be holomorphic when the coeffi- cients are all holomorphic. However, at a singular point of at least one of the coefficients, a tbranch point of the solution usually ap- pears, Thus a solution of (I) or (2) is, in gen- eral, a tmultiple-val ued analytic function of x. The object of global theory is the function- theoretic study of this function-that is, deter- mination of its tRiemann surface and investi- gation of its behavior on the Riemann surface. At a tregular singular point of (1) or (2), any solution can be expressed explicitly by the combination of elementary functions and power series convergent within a circle around the singular point. In the presence of an tirre- gular singular point, instead of a convergent expression, we can construct an asymptotic expansion valid within a certain sector whose vertex is situated at the singular point ( 254 Linear Ordinary Differential Equations (Local Theory)). Once such expressions have been obtained, the remaining task is to find the relations connecting those locally valid ex- pressions, Therein lies the main and most dif- ficult part of global theory. The problem of determining the relations, called the connection formulas, is the connection problem. Equation (1) or the system (2) is said to be of Fuchsian type if all singular points of (1) or (2) are regular singular points. If (1) is an equation of Fuchsian type defined on the Riemann sphere and having singularities at a j , a 2 , "', am, a m + l = 00, then we have the Fuchs relation m+l n L L Pjk=(m-l)n(n-I)/2, j1 k=l where Pjl, .., , Pjn are the texponents at aj of (l). 253 B Linear ODEs (Global Theory) B. Monodromy Groups (1) Suppose that the coefficients of equation (2) are defined on a certain Riemann surface i)' with singular points ai' a 2 , '''' By deleting ai' a z ,... from i)', another Riemann surface 3" is obtained. Choose a point x on i)", and let Y(x) be any fixed branch of a tfundamental system of solutions of (2). Also, let r be a circuit on 3" starting from x, By an analytic continuation along r, another branch of Y(x) is obtained, which we denote by Y(xr). It is known that in this case these two branches are connected by the relation Y(xq= Y(x)C r , where C r is an n x n constant matrix, and also that if r I and r z are thomotopic circuits, C r1 = C r2 , So the branch Y(xn and the matrix C r are deter- mined by the thomotopy class Y to which r belongs Thus we can write Y(xy) or C y instead of Y(xq or Cr. Now let G be the tfundamental group of 3". Since Y(xY2Yd= Y(XY2)C Y1 = Y(x)C y2 C Y1 for any y" Y2EG, the correspondence y-->C y defines a trepresen tation of G. The gro up 9 = { C y lYE G}, which is naturally homomorphic to G, is called the monodromy group of equation (2). For equation (1), we can also define the monodromy group of the equation by 9 = {CyIYEG}, where C y is a matrix such that (ydxy),. " Yn(xy)) = (YI (x), "., Yn(x))C y , where YI (x), ..., Yn(x) are linearly independent solu- tions of (1). If the equation is of Fuchsian type, the global problem can be regarded as solved when the monodromy group of the equation has been completely determined, If 3' is a complex sphere and the equation is of Fuchsian type, the number of singular points of the coefficients is of course finite. Let a j,.. , am be those singular points, and h be the homotopy class of 3" determined by a closed curve r k that encloses only one sin- gular point a k . Then the monodromy group g is generated by the matrices C y1 , .,., C ym ' Obviously C y1 , ..., C Ym are not necessarily in- dependent. At least one relation C y1 ' ..., C Ym = I (a unit matrix) always holds, In this case, tJordan canonical forms of C y , are all de- termined from the convergent expression for the fundamental system of solutions valid around a k constructed by the famous tFro- benius method. However, the calculation of C, itself is generally impossible. If n = 2, m = 3, and the coefficients are all rational functions of x, equation (1) or (2) is completely determined if we fix the roots of tindicial equations at every a k , as long as the equation is of Fuchsian type. Therefore the monodromy group is determined by the values of the roots of indicial equations. Since these roots are calculated purely algebraically, the (2) (2') 
253C Linear ODEs (Global Theory) monodromy group is determined by algebraic procedure in this case [5]. Let n = 2, m = 3 in equation (1). Denote by a, b, c the three singular points and by ..1., ic'; Ji, Ji'; v, v' the roots of indicial equations at a, b, c, respectively. As was mentioned in the pre- vious paragraph, the equation is determined uniquely by these nine quantities, Hence they also determine a family of functions consisting of all the solutions of the equation. This family is usually written as p{, b Ji Ji' v' c v x} and is called the P-function of Riemann [2]. A simple transformation of variables reduces it to the totality of solutions of Gauss's thyper- geometric differential equation x(1-x)y" +(y-(ex+ fJ + l)x)y' -exfJy=O. (3) Some solutions of(3) are expressed by a hyper- geometric integral L t'-Y(t-1y-P-l(t-xr'dt, where C is a suitably chosen path of integra- tion [2J ( 206 Hypergeometric Functions). Calculation of the monodromy group of the equation (I) or (2) is still an unsolved problem except for the case n = 2, m = 3 and a few other particular cases, C. Equations with an Irregular Singular Point In the presence of an irregular singular point, complete knowledge of the monodromy group is still insufficient for the solution of the global problem, It is only the structure of the Rie- mann surface that is known from the mono- dromy group, and the behavior of the solution on the Riemann surface still remains to be studied. At an irregular singular point, the solution can be expressed only by an tasymp- totic series valid within a certain sector, and the same solution possesses completely differ- ent expressions in different sectors. This is called tStokes's phenomenon ( 254 Linear Ordinary Differential Equations (Local The- ory)). Thus, to complete the global theory, connection formulas between different asymp- totic expressions must be established. For a second-order linear equation with two singular points, one of which is regular and the other irregular of the first rank, the problem is completely solved. In this case, the equation can be reduced to a tconfluent hypergeometric differential equation [2J ( 167 Functions of Confluent Type). The problem is also partly solved for a linear equation of higher order 936 with two singular points, one of which is regu- lar and the other irregular (K. Okubo, J, Math. Soc. Japan, 1963). If two singular points are both irregular, even the monodromy group cannot be cal- culated in general. For such a case, G, D. Birkhoff proposed a method of reducing one of the singular points to a regular one. He showed that this procedure is possible under certain assumptions on the monodromy ma- trix (Math. Ann., 1913). D. Riemann's Problem As a noteworthy result for equation (1) of Fuchsian type with algebraic coefficients, Poincare's theory deserves special mention. According to his theory, a solution of (1) can be uniformized in the form y = f(z), x = g(z), where f and 9 are single-valued analytic func- tions of z. Although it is known generally that any analytic function admits such uniformiza- tion ( 367 Riemann Surfaces), Poincare's theory gives a more explicit and efficient uni- formizing construction. As uniformizing para- meter z, we may take a ratio of two indepen- dent solutions of a certain linear differential equation of the second order that is deter- mined from (1), and f and 9 are, in general, tFuchsian functions, i.e., tautomorphic func- tions for a certain tFuchsian group, save for a few exceptional cases in which they are ra- tional or telliptic functions. Brief mention should be made of Riemann's problem as a problem closely related to the global theory of linear differential equations. This problem was taken up by tHilbert in his famous Paris lecture as the 21st problem, and hence is often called the Riemann-Hilbert problem. The problem can be stated as fol- lows: Suppose that we are given a Riemann surface 3', points ai' a2' .., on 3', and a group 9 of n x n matrices homomorphic to the funda- mental group of 3' - {ai, az, ...}. Then find an equation of the form (2) such that (i) the coeffi- cient A(x) is single-valued and meromorphic on 3'; (ii) the singular points are all regular and situated at ai' a2, ...; and (iii) the monodromy group of the equation coincides with 9 if a fundamental system of solutions is suitably chosen. Extensive research was done by many mathematicians, and finally H. Rohrl suc- ceeded in solving the problem (Math. Ann., 1957). E. Isomonodromic Deformations M. E. R. Fuchs considered the equation d 2 yldx 2 =p(x)y, (4) 
937 where p(x) is given by a b c d p(x)=-++-+- x 2 (x-l)2 (X-t)2 x(x-l) 3 IX + 4(X-le)2 + x(x-l)(x-t) lJ + x(x-l)(x-A) and he proposed the following problem: Ob- tain conditions among the parameters a, b, c, d, t, }" cx, fJ such that the monodromy group of (4) is kept invariant when these parameters vary, under the hypothesis that a fundamental system of solutions around x = ..1. of (4) does not contain logarithmic terms. It is clear that a, b, c, d remain constant. Fuchs obtained a necessary and sufficient condition which is stated as follows: ..1., cx, f3 are considered as functions of t and ..1. satisfies the tPainleve equation (VI) and cx, f3 are rational functions of Ie and dA/dt (Math. Ann., 63 (1907)). The result of Fuchs was extended by R. Garnier into two directions: Considering equa- tions of the form (4) with irregular singular points, Garnier derived the Painleve equations (I)-(V). Then from equation (4) with a b N C. d p(x)=-++ I +- x 2 (x_I)2 j=dx-t/ x(x-l)  3  cx j +L., 2 +L., j14(X-A) jl x(x-1)(x-t) + f f3j j=l x(x-1)(x-Al he obtained a system of partial differential equations called the Garnier system (Ann. Sci, Ecole Norm. Sup., 1912), For equations with irregular singular points, the problem must be modified as follows: Deform equations so that not only the mono- dromy group but also the system of tStokes multipliers are kept invariant. This modi- fied problem is called the isomonodromic deformation. L. Schlesinger studied the isomonodromic deformation of the system dy = ( f ) y, dx j=l x - t j and he derived a tPfaffian system, called the Schlesinger equations (Crel/es J., 1912), The research of the theory of isomono- dromic deformation has recently become active. K. Okamoto has investigated the iso- monodromic deformation of Painleve equa- tions, Garnier systems, and linear ordinary differential equations in detail, and T. Miwa and M. limbo have extended the Schlesinger 254 A Linear ODEs (Local Theory) equations. Also  288 Nonlinear Ordinary Differential Equations (Global Theory), References [IJ E. A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hili, 1955. [2J E. Hille, Ordinary differential equations in the complex domain, Wiley, 1976. [3J L. Schlesinger, Handbuch der Theorie der linearen Differentialgleichungen I, III, 11 2 , Teubner, 1895-1898. [4J E. Hilb, Lineare Differentialgleichungen in Komplexen Gebiet, EnzykI. Math., Bd. II, 2, Heft 4 (1913), 471-562. [5J C. E. Picard, Traite d'analyse III, Gauthier-Villars, 1896. [6J A. Lappo-Danilevskii, Memoires SUr la theorie des systemes des equations differen- tieJles Iineaires, Chelsea, 1953. [7J N. I. MusheIishvili, Singular integral equa- tions, Noordhoff, 1953. (Original in Russian, 1946.) [8J l. Plemelj, Problems in the sense of Riemann and Klein, Wiley, 1964. 254 (XIII.7) Linear Ordinary Differential Equations (Local Theory) A. Singular Points Consider a system of n linear ordinary dif- ferential equations dy dx = A(x)y, (1 ) where the independent variable x belongs to a domain in the tRiemann sphere, y is a complex n-dimensional column vector (YdX),Y2(X), "', Yn(x)), and the n x n matrix A(x) has tcomplex analytic functions as elements. A singular point x = a (#00) of A(x) is called a singular point of the system (I), Let dy/dt=B(t)y (2) be the system transformed from (1) by the change of variable t = l/x. The point x = 00 is a singular point of the system (1) if t = 0 is a singular point of (2). By definition a point x = a or x = 00 is a singular point of (l) if t = 0 is a singular point of the transformed equation (2) by the change of variable t==x-a or t= l/x. It follows that by the use of tlocal coordinates the notion of singular points of the system can 
254 B Linear ODEs (Local Theory) be extended to the case when A(x) is complex analytic on a tRiemann surface Consider a single nth-order differential equation defined on a Riemann surface y<n) + a l (x)y(n-l) +". + an(x)y = 0, The above definition of singular points may easily be adapted for use in equation (3), because (3) can be converted into a system (I) by a simple transformation. B. Classification of Singular Points Suppose that A(x) is holomorphic in a deleted neighborhood O<lxl < R of x=o. Then any solution of (!) can be continued analytically in 0< Ixl < R and is not necessarily single-valued ( 252 Linear Ordinary Differential Equa- tions). For simplicity, we denote by xe 2 "i the terminal point of a closed path starting and ending at x and surrounding x = 0 once in the positive sense. Let y dx), ..., Yn(x) be a funda- mental system of solutions of (1). Then the fundamental matrix solutIOn Y(x) = (y d-x), ..., Yn(x)) undergoes a linear transformation Y(xe 2 "i) = Y(x)M when x is transformed to xe 2 "i, where M is a nonsingular constant ma- trix. The matrix M is the monodromy matrix (or circuit matrix) of Y(x) at x =0 If we take a matrix S such that M = e hiS , then there exists a matrix P(x) whose elements are holomor- phic functions in 0 < Ixl < R and such that Y(x) has the form Y(x) = P(x)x s , where x S is defined by X S =exp(Slogx). If for a solution y(x) of (I) and an arbitrary sector 1: there is a positive number r such that limxolxlry(x)=O, x=O is a regular singular point of the solution y(x); and if there is no such number, x =0 is an irregular singular point of y(x), If x = 0 is a regular singular point of all the solutions of (1), it is a regular singular point of the system (1). If some of the solutions have x = 0 as an irregular smgular point, then it is an irregular singular point of the system (1). A necessary and sufficient condition for x = 0 to be a regular singular point of the sys- tem is that an arbitrary fundamental matrix solution Y(x) of(!) in the form Y(x)=P(x)X' as described above have x =0 as a pole of P(x). In the same way, we can give the definition of regular singular points and irregular singular points for equation (3), C. Regular Singular Points Consider equation (3). A necessary and suffi- cient condition for x == 0 to be a regular sin- 938 (3) gular point of (3) is that every adx) have a pole of order at most k at x =0, Consequently, we can write (3) in the form xny(n) + A 1 (x)xn-J y(n-l) +." + An(x)y=O, (4) where each Ak(x) is holomorphic at x=o, Equation (4) has a fundamental system of solutions of the form y = xl', Pdx, log x), where fJk are n roots of the indicial equation at x = 0 of (4): p(p-l)...(p-n+ !)+AdO)p(p-1)... (p-n + 2)+... + An-l (O)p+ An(O)=O, and Pk(x, L) is a polynomial in L of degree at most equal to the number of roots of the indicial equation that are congruent to Pk (modulo integers) with coefficients holomor- phic functions of x. The quantities fJk are called the exponents at x = 0 of (3). If the real part of Pk is largest among the real parts of the roots Pi that are congruent to fJk modulo in- tegers, there exists a solution with the expo- nent fJk and not containing the logarithmic term. (A number a is congruent to b modulo integers when a - b is an integer.) In particular, no solution contains the logarithmic terms when no pair of roots of the indicial equatIOn is congruent modulo integers, Frobenius's method is convenient for finding these solu- tions ( Appendix A, Table 14). We return to system (1). The following theorem gives a simple sufficient condition for x = 0 to be a regular singular point, but there is no simple necessary condition: x = 0 is a regular singular point of (1) if x = 0 is a simple pole of A (x), Then (I) is written as x(dyjdx) = C(x)y, (5) where C(x) is holomorphic at x=O. Equation (5) has a fundamental system of solutions of the form Yk = XP'Pk(X, log x), k = 1, "" n, where the exponents Pk are n roots of the indicial equation at x=o of (5): det(C(O)- pI)=O, and the Pk(X, L) are vector functions whose compo- nents are polynomials in L with coefficients holomorphic at x = O. If none of the exponent- differences Pi - Pk (.i # k) is equal to an integer, then there is a fundamental system of solutions of the form Yk=XP'Pk(X) The above result is derived from the following theorem: There exists a transformation y = P(x)z that takes (5) into a system x(dzjdx)=Dz, where P(x) is a matrix holomorphic at x=o and D is a constant matrix. If all solutions of (1) or (3) are meromorphic at x = 0, then x = 0 is called an apparent sin- gular point. 
939 D. Irregular Singular Points If at least one of the coefficients a)x) of equa- tion (3) has a pole of order more than j, then x =0 is an irregular singular point of (3), Let m j be the order of the pole of a)x) at x = 0, with the convention that mj= 00 when aj(x)=O; and assume that mo = 0 for simplicity. Let Av (v = 0, 1, '" , n) be the points with coordinates (v, m,) in the Euclidean plane and II be the upper half of the boundary polygon of the tconvex hull of the set {Ao,A", "An}. This polygon is calIed the Newton diagram of equa- tion (3). Let (v,rv) be the intersection of the straight line x = v with IT, and consider the nonincreasing sequence of numbers {O"j}, 0",= rv - r v - I (ro =0). We assume Ji to be an integer such that 0"1  0" Z  '"  O" > I  0"+1  ...  0".. To the first Ji sections of the diagram IT with slopes O"j> 1 there correspond Ji formal solu- tions of the folIowing form that are formally linearly independent: y=exp(Xk(x»)xPkPdx,logx), where the Xk(x) are polynomials in fractional powers of X-I and the Pk(x,L) are polynomials in L with coefficients given by formal series in fractional powers of x, Associated with the sections of IT corresponding to those n - Ji numbers O"j.s; I, there are n-Jilinearly inde- pendent formal solutions of similar form but without the exponential term, Consider a system x r + 1 (dyjdx)= A(x)y, where r is a positive integer and A(x) is holo- morphic at x = O. The system (6) has n formalIy linearly independent solutions: Yk = exp(}'k(X))XPkPk(X, log x), where the }.k(X) are polynomials in fractional powers of X-I and the Pk(x,L) are n-vectors whose components are polynomials in L with coefficients given by formal power series in fractional powers of x, This result is obtained from the following theoreln: There exists a transformation y = P(x)z that changes (6) formally into the system xr+l (dzjdx)=(A(x) +Jxr)z, where P(x) is a matrix with elements given by formal power series in fractional powers of x, A(x) is a diagonal matrix with diagonal ele- ments xr+l A(X), and J is a constant matrix in Jordan canonical form. A necessary and suffi- cient condition for x = 0 to be a regular singu- lar point of (6) is that A(x) vanish identically. Unfortunately, formal power senes appear- ing in the formal solutions of (3) and (6) are in general divergent series. H. Poincare, by intro- 254 E Linear ODEs (Local Theory) ducing the notion of tasymptotic expansions, first proved under a very restrictive hypothesis that these formal solutions represent asymp- totically actual solutions in any small sector. Contributions in this direction had been made, notably by W, 1. Trjitzinski and 1. Malm- quist, but a decisive result was obtained by M, Hukuhara, whose method is also applicable to the study of regular singular points, Let yJ(x), ..., y (x) (<1>1 (X)) be a fundamental system of solutions of (3) (fundamental matrix solution of(6)) that are expressed asymptoti- cally by n formal solutions in a sector Dl and yf(x), ..., y(x) (<1>2(X)) be another fundamental system of solutions of (3) (fundamental ma- trix solution of (6)) of the same nature ex- pressed by the same formal solutions but in a different sector D 2 . Then the two systems (two matrix solutions) are, in general, not the same system (matrix solution), but they are connected by a constant matrix C: (yl(x),..., y(x))=(yf(x), "', y(x))C (<1> 1 (x) = <1>2(X)C). (6) This is the so-called Stokes phenomenon, and the elements of the matrix Care calIed Stokes multipliers. The problem of determining these multipliers is a kind of tconnection problem, and the above equality is a tconnection for- mula ( 253 Linear Ordinary Differential Equations (Global Theory)). For an equation of the form (3) (system of the form (6)) with an irregular singular point at x =0, the polynomials Xdx), ... ';'n(x), the quantities PI, "', Pn' and a set of Stokes multi- pliers form a complete system of invariants under linear transformations of the form 2 =qdx)y+ qz(x)y' +... + qn(x)y<n-l) (2 = Q(x)y), where the qlx) (the elements of Q(x)) are mero- morphic at x=O, Even if equation (3) has x = 0 as an irregular singular point, it may happen that (3) admits solutions holomorphic at x=O, This was first studied by 0, Perron and then by F. Letten- meyer, M, Hukuhara, and Iwano, and by H. Komatsu. G, D. Birkhoffproved that when the monodromy matrix at zero of the system (6) can be diagonalized, there is a nonsingular matrix P(x) (det P(O) #0) such that the linear transformation y = P(x)z transforms (6) into d ( r-l ) x r = I Bk Xk Z, dx k=O E. Singularities with Respect to a Parameter An analogous theory has been obtained for a system of first-order linear differential equa- 
254 F Linear ODEs (Local Theory) tions with a small complex parameter s: hdy s -=A(x,s)y, dx 00 A(x,s) L Adx)sk, k=O where h is a positive integer, the n x n matrices Adx) (k=O, 1,. ,.) are single-valued holomor- phic functions of x in a neighborhood D of the origin, and the tasymptotic expansion is valid when s-->O in a sector E. When all eigen- values of the matrix Ao(O) are distinct, there is a matrix of formal solutions of the form P(x, s)eQ(x.,), where P(x, s) is a formal series of the form A(x, s), Q(x, s) is a diagonal matrix with polynomials in lis of degree h as diagonal elements, and all the coefficients are single- valued holomorphic functions of x on D* (c D). In particular, the coefficient of s -h in the jth diagonal element of Q(x, s) is g Jij(t)dt, where Jij(x) is an eigenvalue of the matrix Ao(x). If we take a certain subsector E* of E, the matrix P(x, s)eQ(x.,) represents an actual matrix solution in E* When there is no tturn- ing point ( Section F), it is always possible, even if there are multiple eigenvalues in Ao(O), to construct formal solutions that are asymp- totic representations of some solutions in some sector. Similar theories were developed for cases where more than two parameters appear. F. Turning Points Consider the point x=O in the system (7), and set n=2, h= 1, and Ao(X)=C ). Then Ao(x) has a multiple root when x=O and dis- tinct roots when x #0. As in this example, when the Jordan canonical form of the leading matrix Ao(x) has different structure for x=O and for x # 0, the coefficients of the formal power series in £ are not single-valued holo- morphic functions of x, and have worse sin- gularities as the order becomes higher. Con- sequently, in the neighborhood of x = 0, we cannot construct actual solutions that can be represented asymptotically by these formal solutions, Such a point is called a turning point (or transition point) of the system (7). If there is a nonsingular formal transforma- tion y= T(x,s)z; T(X,S)=L 7i(X)Sk (det To (0) # 0) having similar analytic properties to those of A(x, s), and if the transformed system has a well-known form, it is possible to give ana- lytic meaning to the formal transformation T(x, G), Then the transformed system is called a related differential equation of (7); in the above example, s(dzldx) = Ao(x)z is a related 940 (7) differential equation that has well-known solu- tions expressed by tBessel functions, What is meant by well-known here is that the behavior of all solutions is known in the entire complex plane for a fixed £. However, it is not easy to find a suitable related differential equation for an arbitrarily given system. References [1] E. A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hill, 1955. [2] E. L. Ince, Ordinary differential equations, Dover, 1956, [3] E. Hille, Ordinary differential equations in the complex domain, Wiley, 1976. [4] L. Bieberbach, Theorie der gewohnlichen Differentialgleichungen auf funktionentheore- tischen Grundlage dargestellt, Springer, 1953. [5J W. Wasow, Asymptotic expansions for ordinary differential equations, Wiley, 1965. [6J A. Erdelyi, Asymptotic expansions, Dover, 1956. [7] P. Hartman, Ordinary differential equa- tions, Wiley, 1964. 255 (XIX.2) Linear Programming A. Problems A linear programming problem is a special type of mathematical programming ( 264 Mathe- matical Programming) in which all the func- tions involved, i.e., the objective function and the constraints, are linear in the variables. The simplest typical case is formulated as follows, Problem I: Maximize z = c'x, under the condition (C) Ax=b and x;;'O, where x ERn is the vector to be determined, c is a constant n-real vector, b is a constant m- vector, and A is an m x n real matrix. Without any loss of generality we can as- sume that the rank of A is equal to m, because otherwise the condition is either redundant or inconsistent. This also implies that m.s; n. Any vector which satisfies condition (C) is called a feasible solution, and that which maxi- mizes z among all feasible solutions is called an optimal solution. Let Al be an m x m non- singular submatrix of A, i,e., a matrix consist- ing of m linearly independent columns of the matrix A. Denote by A 2 the m x (nm) matrix 
941 formed by the remaining columns of A. Let Xl be the m-vector consisting of the components of X corresponding to Al and Xz be the vector of the remaining components. Then the equa- tion Ax = b can be written as Al Xl + A 2 X 2 = b, and one of its solutions is given by Xl = A I - 1 band X 2 = 0, which is called a basic solu- tion. Furthermore, if Xl ;;,0, the basic solution is called a basic feasible solution, and if it is also optimal, it is a basic optimal solution. Then the following theorem can be proved. Theorem: If there exists a feasible solution, there is also a basic feasible solution, and if there exists an optimal solution, there is also a basic optimal solution. Corresponding to a basic solution, we can derive the expression Xl + A I - l A 2 X 2 = All b; z-(c -C'l All A 2 )x 2 =0, Such an expression is called a basic form of the problem, and the components of Xl are called the basic vari- ables. The basic solution Xl = Al-lb, x 2 =0, is an optimal solution if and only if (Q) Al-lb;;,O and q=cz-AAI-l<cl .s;0, which is called the optimality criterion, Since there are at most nCm basic solutions, we can always find an optimal solution, if there is one, in a finite number of steps. Another form of the linear programming problem is as follows. Problem II: Maximize z = c'x, under the condition (C') Ax.s;b and x;;,O. The two formulations are equivalent be- cause Problem II can be transformed into Problem I by introducing a new nonnegative m-vector s and writing the equation as Ax + s = b. Such a vector s is called the vector of slack variables. Conversely, Problem I can be formulated as Problem II by imposing the conditions Ax.s; b and Ax;;' b instead of the equality Ax = b. Sometimes it happens that for some of the variables nonnegativity conditions are not assumed. Then for the vector X O of such vari- ables we can define X + - X - == x O , X + ;;, 0, and X - ;;, 0, and we can assume that all the vari- ables are nonnegative. B. Duality Problem II has the following dual problem. Problem III: Minimize w = b'y, under the condition (C") A'y;;,c and y;;,O. F or Problem I, if we restate the equality condition as Ax.s; band - Ax.s; - b, then the dual problem can be written as: Minimize 255B Linear Programming w = b'y + - b'y - under the condition A'y + - A'y- ;;'c; y+ ;;,0, y- ;;,0. Then by putting y= y+ -y-, we can formulate the dual problem as follows. Problem IV: Minimize w=b'y, under the condition (C"') A'y;;' c, with no restriction on the sign ofy. In contrast to the dual problem, the original problem is called the primary problem. The following theorem is basic to the theory of linear programming. Duality theorem: If either one of the primary or dual problems has an optimal solution, then the other also has an optimal solution, and it holds that max z = min w, Moreover, if x* and y* are optimal solutions of the two problems, we have c'x*=b'y*=y*'Ax*. Conversely, if x* and y* are feasible solu- tions of the primary and dual problems and if we have c'x*=b'y*=y*'Ax*, then x* and y* are optimal solutions of the respective problems. Let x! = All b, x! = 0 be a basic optimal solution of Problem I. Then the condition in its dual problem is written as A; y ;;, c I and Ay;;' c 2 . Put y* = A;I,C I ; then from the opt i- mality condition it is shown that A;y*;;' c 2 , and also A; y* == c l ; hence y = y* is a feasible solution of the dual problem. It is obvious that c'x* =C'l x! =c l A;l b= b'y* = y*' Al x! = y*'Ax*, and for any feasible solution y of the dual problem, we have b'y = x*' A; y;;' x*'c = b'y*, which establishes that y* is an optimal solution. The duality theorem can be formulated in a more general way: Let Vand W be closed convex cones in Rm and Rn, respectively. Then we state the following problems, Primary problem: Maximize z=c'x, xERn, under the condition b - Ax E V and x E W. Dual problem: Minimize w=b'y, YERm, under the condition A'y-cE W* and YE V*, where W* and V* are the dual cones of Wand V, respectively. We can also write max z = 00 if the value of z is not bounded, and max z = -00 if there is no x which satisfies the condition; similarly, min w = -00 if w is not bounded from below, and min w = 00 if there is no y satisfying the condition, Theorem: If either max z # -00 or min w # 00, we have max z = min w. Moreover, if -00 < max z = min w = c'x* = b'y* < 00, we have y*' Ax* = c'x* = b'y*. Define the expression <p(x, y) = c'x + b'y- y' Ax=c'x + y'(b- Ax)= b'y-x'(A'y-c) for XE Wand yE V*. qJ can be regarded as a Lagrangian form for both the primary and dual problems. And if (x*, y*) is a pair of op- 
255C Linear Programming timal solutions, we have <p(x*,y)<p(x*,y*) <p(x, y*) for all XE Wand YE V*, which means that (x*, y*) is a saddle point of the function <po Now suppose that in the primary problem the constraint vector b can change, and consider max z to be a function of b, denoted by z(b). Then under a small change of b at least some of the optimal solutions of the dual problem will remain optimal; hence for any vector a we have 1 inf a'y*.s;lim-(y(b+ta)-z(b)).s; sup a'y*, y*eY* t Y*EY. where y* is the set of the optimal solutions of the dual problem. Because of this property the solution of the dual problem is called the vector of shadow prices or of the imputed costs of the constraints, SymmetricaIly, the solution of the primary problem describes the rate of change in the value of the objective function under a small change in the constraint vector in the dual problem. The duality theorem is closely related to some theorems on systems of linear inequal- ities and convex cones in a finite-dimensional Euclidean space; especiaIly, the foIl owing are equivalent to 01 easily derivable from the duality theorem. Minkowski-Farkas theo- rem: Given an equation Ax = b, where b is an element of R n , a necessary and sufficient condi- tion for a solution x  0 to exist is that u'b  o hold for any vector u such that u' A  O. Stiemke theorem: For a matrix A one of the following two alternatives holds: (i) Ax == 0, x> o have a solution; (ii) u' A  0 has a solution, Tucker's theorem on complementary slackness: For any matrix A, the two systems of linear inequalities (i) Ax = 0, x  0, and (ii) u' A  0 have solutions x, u satisfying A'u + x > 0, The minimax theorem for tzero-sum two-person games ( 173 Games Theory C) with finite num ber of strategies for both players is also shown to be equivalent to the duality theorem. It is formulated as min max y' Mx = max min y'Mx, yeS2 xeSt xeS t Y ES 2 where M is the payoff matrix and SI and S2 are tsimplexes of mixed strategies. C. Algorithms The most commonly used method for solving linear programming problems numericaIly is the simplex method introduced by van Dantzig [IJ and its variations. It gives a procedure starting from one basic feasible solution to reach an optimal basic solution in a finite number of steps by improving the value of the 942 objective function at each step. Let X.+" d..x.= g . i EI , I L.. lJ J I' jeJ z+ L !jXj=u jeJ be a basic form for Problem I, where I denotes the set of the basic variables and J the set of non basic variables. Then the feasibility implies that d i  0 for all i E I. Furthermore, if Jj  0 for alljEJ, the basic solution Xi=gi for i in I and x j = 0 for j in J is optimal. If Jj* < 0 for some j* in J, define r i = gi/dij* for i in I and for which dij> O. Let r i * = min r i ; then we can delete i* from the set I and add j* to it and get a new basis, and by simple algebraic calculation we get a new basic form corresponding to the new basic feasible solution, for which the value of z is increased by -.!j*ri*' If r i * is always positive, we can get an optimal basic solution, since the above procedure cannot continue in- definitely; and even when r i becomes zero (the degenerate case), we can avoid infinite circular repetition by using a method proposed by R. G, Bland [16]. And !j*<0 and dij.s;O for all i implies that the value of z is unbounded. The array of the coefficients of the basic form is called the simplex tableau, and the method is called the simplex method. In order to obtain a basic feasible solution, the two-phase simplex method is used. Assume that in Problem I the vector b is nonnegative. Then we introduce a new m-vector u caIled the vector of artificial variables, and formulate an additional prob- lem as follows, Pro blem Ia: Maximize t = -I'u under the condition Ax+u=b and xO, uO. This problem can be solved by the simplex method starting from the basic solution u == b, x = 0, and if we get to an optimal solution with t = 0, we have a feasible solution for the original problem, from whence we can proceed with the original problem. If there some inequalities in the condition we transform them to equalities by introduc- ing slack variables, and then apply the simplex procedure. Once an optimal basic solution is obtained, a solution of the dual problem is easily obtained from the relation y* = At-t b. The dual simplex method utilizes the primary- dual relationship. Recently, L. G. Khachiyan [7J proposed a new linear-programming method that gives an optimal solution within predetermined ac- curacy of approximation in anum ber of steps bounded by a polynomial in the numbers of the variables and constraints, a property the simplex algorithm fails to have. His method is basicaIly an iterative procedure to find a solu- tion x satisfying a system of strict inequalities 
943 aix < hi, i= 1,..., m, where the b i and the com- ponents of the a i are all integers. Let x(O) be any vector and B(O) = KI, where K is a posi- tive constant defined in terms of the a i and the b i , and the sequence of vectors X(k) and matrices B(k) are defined by 1 B(k) X(k+l)=X(k)_ a i n + 1 J aB(k)a. ' I I n 2 ( 2 B(k)a.aB(k» ) B(k+l)__ B(k)_ I I - n 2 -1 n+ 1 a;B(k)a i ' when the vector X(k) violates the inequality aix < hi' and continue until a solution is obtained or the number of steps reaches some constant. In the latter case it can be proved that the system of inequalities does not have any feasi- ble solution. This algorithm can be applied to solve Problem II in the following way: Due to the duality theorem, the optimal solution of Problem II can be characterized as vectors satisfying b'y.s;c'x, Ax.s;b, -A'y.s; -c, xO, Y  O. Then the system is approximated by another system with integer coefficients, and then by a system of strict inequalities, and the above algorithm can be applied to this ap- proximated system. This method is based on a principle entirely different from the simplex method of computing successively the centers of ellipsoids of indefinitely decreasing size and containing a subset of feasible solutions, Although Khachiyan's method has a mathe- matically appealing property, in practical applications the simplex method and its vari- ations still seem to be the most efficient gen- eral method for the numerical solution of linear programming problems. Some special types of linear programming problems allow specific algorithm for solution, e.g., the transportation problem has the struc- ture: Minimize LiLjCijXij under the condition Ljxija;, Lixij.s;bj, xijO. This can be solved by anyone of several simple intuitive methods. Various types of linear programming problems are discussed as problems of maximizing flows on networks ( 281 Network Flow Problems). Also, problems with the further condition that some or all of the variables are integers can profitably be discussed separately ( 215 Integer Programming). D. Generalizations and Applications Linear programming in the sequence space (1)= {X = {xJ I L]:;" 1 Ix) < oo} was treated by P. C. Rosenbloom [12]. In this case, the require- ments for variables X are given in terms of tlinear functionals AiE(I)* = (m) (i = 1,2, ..., k), for example, as (i) equalities ).i(X) = c i or (i') 255 D Linear Programming inequalities Ai(X).s; c i , and (ii) x j  0 (\ij). The space (I) is supplied with the tweak topol- ogy as the conjugate space of (co) = {X = {xJ llimj", x j == O}. Let 3' be the set of all X satisfying (i) (or (i')) and (ii). Let }.(X) be a tweakly upper semicontinuous functional, and suppose that the AJX) are tweakly lower semicontinuous, 3' # 0, and },(X) is upper bounded. Then the maximal value of A(X) is attained at an extreme point of 3', and further- more, if the space is completely regular, then the solution is unique. If 3' is bounded, it is the convex hull of its extreme points, i.e" the smallest closed convex set containing its ex- treme points. By applying the theory in this section to the family of functions that can be expressed as f(x) = Ljl aj<pj(x) in terms of a given system of functions <Pj(x) (j = 1,2, ... ) on R, S, N, Bernshtein's approximation theory of function systems, the theory of absolutely monotonic functions, and several inequalities in the theory of functions of a complex vari- able can be treated in a unified fashion. If the theory is further extended to the case of tfinitely additive measures defined by means of linear functionals on the Banach space of bounded functions, we may treat the extremal problems of linear functionals on the func- tion space of all f (x) that can be expressed as ((X) = SsK(x,s)dJ1(s), and apply it to obtain the interpolation formula of nonnegative thar- monic functions, results of Caratheodory and Fejer on its Fourier coefficients, an analog of Harnack's theorem for the theat conduction equation, and so on. The extension of linear programming theory to linear topological spaces is due to L. Hur- wicz [5]. Let Y be a linear space, qlj, ::z be linear topological spaces, Py, Pz the nonnega- tivity cones of qy, :?Z, respectively, which are closed convex cones containing inner points, and D a convex set in Y. We assume that F and A are linear mappings from D into '!If and :2:, qlj = R, :?Z is tlocally convex, and Px, Pz are closed convex cones in !'[ and :?Z, respectively, and we consider the following. Problem L: Maximize F(X)=X*(X) under the condition G(X)= A(X)-BO, XO, BE:?Z. Put <I>(X,Z*)=X*(X)+Z*(A(X)-B), Ifwe define a linear mapping T from 11' =:f£' x R into Y=!L x 11' by T«X,p))=(A(X)-pB, (X,p)), where pER, then under the condition that the image under T* of the nonnegativity cone of "//'* is a tregularly convex set in 11'*, Xo is a solution of problem L if and only if there exists a Z6 E:?Z* such that (X o , Z) is a nonnegative saddle point of <1>. Under conditions expressed in simpler terms, Isii [6] proved the coincidence of the 
255 E Linear Programming supremum of the objective function and infz* SUPXED of the tLagrangian form, gave conditions for the supremum and the infimum to be attained, and developed a theory that generalizes the tChebyshev inequality and the tNeyman-Pearson fundamental lemma for testing statistical hypotheses. E. History The theory of linear programming is closely related to the theory of convex cones, convex polyhedra, and systems of linear inequalities. Concerning tpolyhedral convex cones, tconvex polyhedra in R n , and the algebraic theory of systems of linear inequalities, we have classical results due to P. Gordon (1873), J. Farkas [2J, E, Stiemke [13J, H. Weyl [15J, etc., and a later refinement due to A. W. Tucker [10, paper I]. The application of linear systems to eco- nomics was made possible through the works of J. von Neumann, especially his tgame the- ory and balanced linear growth model [14]. These and the interindustrial input-output analysis of W. Leontief [IIJ led to the works assembled in 1951 by T. C. Koopmans and others [9]. Concerning practical computa- tion and applications to industry, there are isolated and long-neglected works by L. V. Kantorovich [8]; however, the main part of the method was developed in the United States, especially after the discovery of the simplex method by G. B. Dantzig [I] and his followers. Linear programming has become one of the most important techniques in operations re- search, and, following upon the development of computers, has found wide application to practical problems. Most contemporary large- scale computers are equipped with programs to solve linear systems problems. References [IJ G. B. Dantzig, Linear programming and extension, Princeton Univ. Press, 1963. [2J J. Farkas, Ober die Theorie der einfachen Ungleichungen,1. Reine Angew. Math., 124 (1902),1-27. [3J S. I. Gass, Linear programming: Methods and applications, McGraw-Hill, second edi- tion, 1964. [4J A. Haar, Dber linear Ungleichungen, Acta Sci. Math. Szeged., 2 (1924),1-14. [5J L. Hurwicz, Programming in linear spaces, Studies in Linear and Non-Linear Program- ming, K. J. Arrow, L. Hurwicz, and H. Uzawa (eds.), Stanford Univ. Press, 1958,38-102. [6J K. Isii, Inequalities of the types of Cheby- 944 shev and Cramer-Rao and mathematical pro- gramming, Ann. Inst. Stat. Math., 16 (1964), 277-293. [7J L. G. Khachiyan, A polynomial algorithm in linear programming, Soviet Math. DokI., 20 (1979),191-194. (Original in Russian, 1979.) [8J L. V. Kantorovich, Mathematical methods of organization and planning production, Management Sci., 6 (1960), 366-422. (Original in Russian, 1904.) [9J T. C. Koopmans (ed.), Activity analysis of production and allocation, Wiley, 1951. [IOJ H. W. Kuhn and A. W. Tucker (eds.), Linear inequalities and related systems, Ann. Math. Studies, Princeton Univ. Press, 1956. [IIJ W. Leontief, The structure of American economy 1919-1939, Oxford Univ, Press, second edition, 1951. [12] P. C. Rosenbloom, Quelques classes de problemes extremaux, Bull. Soc. Math. France, 79 (1951), 1-58; 80 (1952), 183-215. [13J E. Stiemke, Ober positive Losungen homogener linearer Gleichungen, Math. Ann., 76 (1914-1915), 340-342. [14J 1. von Neumann, Ober ein okonomisches Gleichungssystem und eine Verallgemeinerung des Brouwerschen Fixpunktsatzes, Ergebnisse eines Math. Kolloquiums, 8 (1937), 73-83; English translation, A model of general eco- nomic equilibrium, Rev. Economic Studies, 13 (1945-1946), 1-9 (Collected works, Pergamon, 1963, vol. 6,29-37). [15J H. Weyl, Elementare Theorie der kon- vexen Polyeder, Comment. Math. Helv., 7 (1934-1935),290-306. [16] R. G. Bland, New finite pivoting rules for the simplex method, Core discussion papers 7612, Center for Operations Research and Economics, Univ. Catholique de Louvain, 1976. 256 (111.8) Linear Spaces A. Definition Suppose that we are given a set L and a tfield K satisfying the following two requirements: (i) Given an arbitrary pair (a, b) of elements in L, there exists a unique element a + b (called the sum of a, b) in L; (ii) given an arbitrary element ex in K and an arbitrary element a in L, there exists a unique element exa (called the scalar multiple of a by ex) in L. The set L is called a linear space over K (or vector space over K) if the following eight conditions are satisfied: (i) (a + b) + c = a + (b + c); (ii) there exists an ele- 
945 ment OE L, called the zero element of L, such that a + 0 =0 + a = a for all aE L; (iii) For any aE L, there exists an element x = -aE L satisfy- ing a+x=x+a=O; (iv) a+b=b+a; (v) a(a+ b)= aa+ ab; (vi)(afija = rx(fia); (vii) (a + fija= aa + fia; (viii) I a = a (where 1 is the tunity element of K). An element of K is called a scalar, and an element of L is called a vector. K is called the field of scalars (basic field or ground field) of the linear space L. In the definition of linear spaces, K can be noncommutative. L is also called a left linear space over K since the scalars act on L from the left (a-->aa, aE L, aE K). A right linear space is similarly defined. Actually, a left (right) linear space is a unitary left (right) K-module. If K is commutative, it is not necessary to specify left or right, since we can identify aa and aa. In this article we consider only linear spaces over commutative fields. A similar theory can be established for linear spaces over noncommutative fields ( 277 Modules). If K is the field of real numbers R or the field of complex numbers C, a linear space over K is called a real linear space or complex linear space, respectively. In the following discussion, we fix a field K, and by a linear space we mean a linear space over K. Examples. (1) Geometric vectors: In a tEu- clidean space or, more generally, an taffine --> space, the set of vectors PQ associated with points P, Q in the space forms a linear space. (2) n-tuples in K: K n denotes the set of all sequences (a I' ... , an) of n elements in a field K. Defining two operations by (a I, ... , a,,) + (fil, . .. , fin) = (a I + fib .. . , an + fin), A (a I , . .. , an) = (Aa I' '" , Aa n ) (), E K), the set K n forms a linear space over K. An element of K n is called an n- tuple in K, and a i is called the ith component of (rx l ,..., an)' In general, the inner product of a= (ai, ..., an), b = (#b ..., fin) is defined by (a, b) = L7=1 aifii' However, when K = C (complex number field), we usually define (a, b) to be L7=1 a}i' (3) Sequences in K: All infinite sequences in a field K form a linear space over K under the operations defined in example (2). (4) K-valued functions: Given a nonempty set I and a field K, let K I be the set of all K- valued functions defined on I. Defining two operations by (/ + g)(x) = f(x) + g(x), (}J)(x) = Af(x) (XEl,AEK), the set K 1 forms a linear space over K. In particular, if we put 1== {I, "', n}, then the space K 1 can be identified with the space of n-tuples given in example (2), and if we put 1= N (natural numbers), then we obtain the space given in example (3). Let K be the field R of real numbers and I an interval in R. The set C(I) of all continuous functions on 256C Linear Spaces I, the set D(I) of all differentiable functions on I, and the set A(I) of all treal analytic func- tions on I are all linear spaces contained in the space R I . (5) Polynomials in K: K [Xl' .,. ,XnJ denotes the set of all polynomials of n variables with coefficients in a field K. This forms a linear space under the usual operations. B. Linear Mappings Let L, M be linear spaces over a field K. A mapping <p from L to M is called a linear map- ping or linear operator if <p satisfies the follow- ing two conditions; (i) <pia + b) = <p(a) + <p(b); and (ii) <p(Aa)=A<p(a) (a, bEL, }.EK), Namely, a linear mapping is a K-homomorphism be- tween K-modules (277 Modules), Regard- ing K as a linear space, a linear mapping L --> K is called a linear form. A linear mapping from L to L is called a linear transformation of L. The identity mapping of L is a linear transformation. Given linear spaces L, M, N, and linear mappings <p:L-->M and if;:M-->N, the composite if; 0 <p: L --> N is also a linear mapping. If a linear mapping <p: L --> M is tbijective, then the inverse mapping <p-I: M--> L is also a linear mapping. Such a mapping <p is called an isomorphism, and we write L  M if there exists an isomorphism L --> M. A linear transformation L -->L is called regular (or nonsingular) if it is an isomorphism. Examples. (1) Let L be the linear space formed by all geometric vectors in a Euclidean space (affine space) E. Then a tmotion (taffine trans- formation) of E induces a linear transforma- tion L-->L. (2) Let (aij) be an m x n tmatrix in K. As- signing ('11' "','1m) to (I' ""n)' where '1i== Lj1 aijj (1 .s; i.s; m), we have a linear map- ping Kn-->K m . (3) Assigning the tderivative f' to a real- valued differentiable function f on an interval I, we have a linear mapping D(l)-->R 1 . C. Linear Combinations Let L be a linear space over a field K. An element of L of the form a l a l + ... + an an (rxiE K, aiE L) is called a linear combination of ai, "', an' A sequence ai' ..., an of elements in L is called linearly dependent if there exists a sequence ai, ..., an of elements in K such that not all the a i are equal to 0 and a l a l + ... + ana n = O. A sequence of elements in L is called linearly independent if it is not linearly depen- dent. Suppose that there exists a linearly in- dependent sequence of n elements in a linear 
256 D Linear Spaces space L, and no sequence of n + 1 elements in L is linearly independent. Then n is called the dimension of L and is denoted by dim L. If there exists such a number n, L is said to be finite-dimensional. Otherwise, L is said to be infinite-dimensional. In an infinite-dimensional linear space, there exist linearly independent sequences of elements having arbitrary length. The linear space K n of n-tuples in K is of di- mensIOn n. A sequence (ai' .." an) of elements in a linear space L is called a basis if every element a of L is uniquely written in the form a=cx l a l +... + cxna n (CXiE K, i = 1, ..., n). This means that the linear mapping K n --> L assigning cx I a I + ., . + cxnanELto (cx" ...,cxn)EKn is bijective and hence an isomorphism. The condition that (a I ,.,., an) is a basis of L is equivalent to any two of the following three conditions: (i) (a l ,.,., an) is linearly independent; (ii) every ele- ment of L is a linear com bination of a I, ... , an; (iii) L is of dimension n. It follows that the length n of a basis (a I, .,. , an) is equal to the dimension and hence is independent of the choice of basis. In the expression a = L cxiai, CX i is called the ith component (or ith coordinate) of the element a relative to the basis (a" ..., an)' D. Spaces of Linear Mappings (Finite- Dimensional Case) Let L, M, and N be finite-dimensional linear spaces over a field K. The set HomK(L, M) of all linear mappings L-->M is a linear space under the operations defined by (<p + <p')(a) = <p(a) + <p'(a), (ic<p)(a)=},<p(a) (aEL, AEK). Let (a I ,..., a/), (b"..., b m ) be bases of L, M, respectively. Then any linear mapping <p: L --> M can be represented by an m x I matrix (cxij) determined by <p(a) = Lil bicxij (1.s;j.s; I). This assignment <p-->(cxij) gives an isomorphism from the linear space HomK(L, M) to the linear space of all m x 1 matrices ( 269 Ma- trices). In addition, let (e I, ... , en) be a basis of N, and let the n x m matrix ([3k;) represent a linear mapping if;: M --> N. Then the composite mapping if; 0 <p: L--> N is represented by the product ([3ki)(CX i J of the matrices ([3k;) and (cxiJ. The set of all linear transformations of a linear space N of dimension n forms an tassociative algebra over K which is isomorphic to the ttotal matrix algebra Mn(K) of degree n under the correspondence <p-->(cxiJ Its tinvertible elements are regular linear transformations, and they form a group which is denoted by GL(N) and called the tgeneral linear group on N. This corresponds to the group GL(n, K) formed by all n x n tinvertible matrices under the isomorphism <p-->(cxij)' 946 E. Infinite-Dimensional Linear Spaces In this section, we consider only the algebraic aspects of infinite-dimensional linear spaces (for the topological aspects  422 Topological Abelian Groups L; 424 Topological Linear Spaces), Let {a} LEA be a family of elements in a linear space L. A linear combination of the family is an element of L in the form LAEA cxAa}, (CXAEK, where CX A =0 except for a finite number of ic), The family {a). LEA is called linearly in- dependent if no linear combination L),EA CXAaA is equal to 0 unless all the coefficients CX), are equal to O. The family {a) LEA is called a basis of L if every element of L is uniquely written in the form LAEA cxAa), These notions are gen- eralizations of those defined for finite A. In general, A is an infinite set. Any linear space L has a basis ( 34 Axiom of Choice and Equivalents C). The cardinality of a basis is determined by L. Two linear spaces are iso- morphic if and only if their bases have the same cardinality. F. Subspaces and Quotient Spaces Let L be a linear space over a field K. A non- empty subset N of L forms a linear space over K under the induced operations if the follow- ing two conditions hold: (i) a, bE N implies a+bEN; and (ii) },EK, aEN imply icaEN. In this case, the subset N is called a linear sub- space of L (or simply subspace of L). The canonical mapping <p: N --> L defined by <p(a) =a (aEN) is an injective linear mapping. Let S be a nonempty subset of L. The set of all linear combinations of elements in S forms the smallest subspace of L containing S; this space is called the subspace generated (or spanned) by S. For subspaces N, N' of L, the intersection N n N' and the sum N + N' = { a + a' I a E N, a' EN'} are both subspaces. Similar propositions hold for an arbitrary number of subspaces. If N, N' are of finite dimension, the equality dimN +dimN' =dim(N nN')+ dim(N + N') holds. We say that L is decom- posed into the direct sum of N, N' if every element of L can be uniquely written in the form a + a' (a EN, a' EN'). This is the case if and only if L is generated by Nand N' and N n N' = {O}, In this case, N' is called a com- plementary subspace of N. Any subspace has a complementary subspace. For direct products and sums of linear spaces  277 Modules F. An equivalence relation R in a linear space L is said to be compatible with the operations in L if the following two conditions hold: (i) R(a,a') and R(b, b') imply R(a+b,a' + b'); (ii) R(a, a') implies R(),a, ica') (},E K). Then the tquotient set L/R, namely, the set of all 
947 equivalence classes, forms a linear space over K under the induced operations; this is called the quotient linear space (or simply quotient space) of L with respect to R. The canonical mapping <p: L--> L/R, given by aE <p(a) (aE L), is a surjective linear mapping. The equivalence class N containing 0 forms a subspace of L, and the equivalence class containing a E L is the coset a + N = {a +b IbEN}. We have R(a, a') if and only if a-a' EN. Conversely, for any subspace N, there exists an equivalence relation R compatible with the operations determined by R(a, a') if and only if a - a' EN. The quotient linear space L/ R thus obtained is denoted by L/N and called the quotient (linear) space of L by N. If L/N is finite-dimensional, its dimension is called the codimension of N relative to L and is denoted by codim N. Let <p: L-->M be a linear mapping of linear spaces. Its image <p(L) is a subspace of M, and the kernel N = {aE L I <p(a) = O} is a subspace of L. The mapping <p induces an isomorphism <p: L/ N --><p(L) ( 277 Modules E). If Lis finite- dimensional, dim L-dimN =dim<p(L). The dimension of the image of <p is called the rank of <p, and the dimension of the kernel of <p is called the nullity of <po The rank and nullity of an m x n matrix (ex ij ) ( 269 Matrices) are the respective rank and nullity of the linear map- ping Kn-->Km represented by the matrix (ex u ) ( Section B, example (2)). G. Dual Spaces Let L be a linear space over a field K. The set HomK(L, K) of all linear forms on L is a linear space, denoted by L * and called the dual (linear) space of L. The space L* is the tdual module of L as a K-module ( 277 Modules). For elements a of Land a* of L*, we denote the element a*(a) by (a,a*) and call it the inner product of a and a* For a linear map- ping <p: L--> M, we define a linear mapping t<p: M*-->L* by ('<p)(b*)=b*o<p (b*EM*). The mapping '<p is called the dual mapping (trans- posed mapping or transpose) of <p, and is deter- mined by the relation (a, '<p(b*) = (<p(a), b*) (aEL, b*EM*). We havet(<pI +<P2)='<PJ +'<Pz, t(if; 0 <p) = '<p o 'if;, tl L = 1L" If <p is tsurjective, then '<p is tinjective, and if <p is injective, then t<p is surjective. If <p is bijective, then '<p is also bijective. The rank of t<p coincides with the rank of <p if the rank of rp is finite. For an isomorphism <p:L-->M, the inverse mapping '(p-l =rfJ:L*-->M* oft<p is called the contragre- dient of <po We have (if; 0 <pr= J; 0 rfJ, Given a subspace N of a linear space L, the subspace {a*cL*I(a,a*)==O(acN)} of L* is denoted by N'c and is called the subspace (of L*) orthogonal to N. Then we have the canon- 256 H Linear Spaces ical isomorphisms (L/N)*N'c, N* L*/N'c. Similarly, given a subspace N' of L*, we obtain the subspace N''c of L orthogonal to N':N''c = {aE LI (a, a*) =0 (a*E N')}. Thus we have a one-to-one correspondence between the finite-codimensional subspaces of L and the finite-dimensional subspaces of L* by assign- ing N'c to Nand N''c to N'. The codimension of N is equal to the dimension of N'c. If Lis finite-dimensional, we have a canonical iso- morphism L(L*)* and a one-to-one corre- spondence N-->N'=N'c between the set {N} of all subspaces of L and the set {N'} of all sub- spaces of L*. These properties of correspon- dence between the su bspaces of Land L * are called duality properties of the linear space. Let (e J, .., , en) be a basis of a linear space L. Then the system of elements (ej, ..., e:) in L * defined by the relation (e j , en = 0 (i # j), (e;, en = 1 forms a basis of L*, called the dual basis of (e l , ..., en)' Thus a finite-dimensional linear space L can be identified with its dual space utilizing the isomorphism given by as- signing each element of the dual basis to the corresponding element of the basis in a natural manner. H. Multilinear Mappings Let L, M, N be linear spaces over a field K and f be a mapping from the Cartesian prod- uct M x N to L. Suppose that for any fixed bE N, the mapping M -->L assigning j(x, b)E L to x EM is linear, and for any fixed aE M, the mapping N -->L assigning {(a,Y)E L to YEN is also linear. Then f is called a bilinear mapping from M x N to L. The set of all bilinear map- pings from M x N to L forms a linear space under the operations (I + g) (x, y) = f(x, y) + g(x,y). (Af)(X, y) = Af(x, y) (AEK); this space is denoted by 2'(M, N; L). In general, for linear spaces MJ, ..., M n , a mapping f:M l x ... x M n --> L is called a multilinear mapping if it is linear in each variable. The set of all multilinear mappings from M l x... x M n to Lforms a linear space, denoted by 2'(Ml' ...,Mn;L).1f L = K, a bilinear mapping and a multilinear mapping are called a bilinear form and multi- linear form, respectively. Suppose, in particular, that M l = ,., = M n = M. A multilinear mapping f: MJ x ... x M n --> L is called symmetric if f(x U (1)' ..., xu(n») = f(x J , ... , x n ) (Xi E M) for any permutation 0" of {I, ,.., n}. Also, f is called alternating if f(x I' ... , Xi' ... , Xj, ... , X n ) = 0 for Xi = Xj' i # j. In this case, j (X u (ll' .,., xu(n») = sgn 0"' f(x I, ..., x n ) for any permutation 0" (sgn (J is + I if (J is teven and -1 if 0" is todd). On the other hand, r is called skew-symmetric (or anti symmetric) if it satisfies this equality. Therefore, if the charac- 
256 I Linear Spaces teristic of the field K is different from 2, a skew-symmetric mapping is alternating. Let M, N be linear spaces over a field K and <I> be a bilinear form on M x N. The mappings dq,: N -->M*, Sq,: M --> N* defined by <I>(x, y) = (dq,(y))(x) = (sq,(x)(y) (XE M, YE N) are linear mappings. If M, N are finite-dimensional, then dq, and Sq, have the same rank, called the rank of <1>, Let (x I' ..., x m ), (YI, ..., Yn) be bases of M, Nand (xf,..., x), (Yf,... ,Y:) be their dual bases, Then we have dq,(Y) = Ll xt<l>(x j , Yj), sq,(x,) = L'i=l <I>(x j , Y)Yt The matrix (<I>(x i , Y) is called the matrix of a bilinear form <I> relative to the given bases, and its rank coincides with the rank of <1>. If dq" Sq, are both injective, they are also bijec- tive, and in this case <I> is said to be nondegen- erate. Then each of dq, and Sq, can be regarded as the transpose of the other, and M, N can be identified with N*, M*, respectively. In particular, if <I> is a nondegenerate bilinear form on M x M, we have an isomorphism from M to its dual space M*; identifying M with M* by this isomorphism, M is said to be self-dual. Let M be a linear space over a field K. A mapping Q: M --> K is called a quadratic form on M if the following two conditions hold: (i) Q(exx) =ex 2 Q(x) (exEK, XE M); and (ii) the map- ping <1>: M x M --> K defined by <I>(x, y) = Q(x + y) - Q(x) - Q(y) (x, Y E M) is a bilinear form on M x M. In this case, <I> is called the bilinear form associated witb tbe quadratic form Q, and it can be shown to be symmetric. We have <I>(x,x)==2Q(x) (X EM) and Q(x)=(lj2)<I>(x,x) if the characteristic of K # 2. In general, for any bilinear form f: M x M -->K, the mapping Q: M --> K defined by Q(x) = f(x, x) is a quadratic form. If (x I' ..., x n ) is a basis of M, a quadratic form Q is expressed as follows: Q(L?1 (iX;) = L{i,j} exijij (the sum over all unordered pairs {i,j}), where ex;;=Q(x,), exij=<I>(xj,x j ) (i # j) ( 348 Quadratic Forms), A metric vector space is a linear space M supplied with a nondegenerate quadratic form Q on M, and is denoted by (M, Q). The bilinear form <I> associated with Q gives an inner product <I>(x, y) (x,YEM). I. Tensor Products Let M, N be linear spaces over a field K. The tensor product M @ N of M, N is defined as follows and can be used to "linearize" bilinear mappings from M x N to any linear space. Let F be the linear space generated by M x Nand R be the subspace of F generated by all ele- ments of the forms (x+x',y)(x,y)-(x',y), (x, Y + y') - (x, y) - (x,y'), (ex x, y) - ex(x, y), (x, exy) -ex(x,y) (x,x'EM, y,y'EN, exEK). Then the 948 quotient space F j R is denoted by M @ N, and the canonical projection F -->M @ N is de- noted by Ij;. Given an element (X,Y)E M x N, we denote the image Ij;( (x, y)) by x @ y. The bilinear mapping M x N --> M @ N assigning x @ y to (x, y) is called the canonical bilinear mapping. We have, by definition, (x +x') @ y =x@y+x'@ y, x@(y+ y')=x @ y+x @ y', (exx)@ y=ex(x@ y)=x@(exy) (exEK), To em- phasize the basic field K, we sometimes write M @KN instead of M @ N. The tensor product can be characterized by the property that for any linear space Land any bilinear mapping f: M x N --> L, there exists a unique linear mapping <p: M @ N --> L satisfying f(x, y) = <p(x @ y). Thus assigning the bilinear mapping f: M x N --> L defined by f(x, y) = <p(x @ y) to a linear mapping <p: M @ N --> L, we obtain an isomorphism Hom(M @ N, L);;:: 2'(M, N; L). Every element of M @ N can be expressed as a finite sum of elements of the form x @ y (XE M, YE N). If {XdiEl' {yjLEJ are bases of M, N, respectively, then the family {Xi @ yj LE/,jEJ forms a basis of M @ N. Hence if M and N are of finite dimen- sion, dim(M @ N)=dimMdimN, Let M l , M2' ... be linear spaces over a field K. We have a unique isomorphism M l @ M 2 -->M 2 @ M l that assigns X2 @ Xl to Xl @ x 2 (XiEMJ We also have a unique isomorphism (M I @ M2)@ M3-->MI @(M 2 @ M 3 ) that as- signs Xl @(x 2 @ x 3 ) to (Xl @x 2 )@ x 3 (XiEM;); hence we can identify (M l @ M 2 ) @ M3 and M j @ (M 2 @ M 3 ), and we denote them sim- ply by M l @ M 2 @ M3' In general, assigning Xl @... @x n to (X" ...,x n ), we obtain the canonical multilinear mapping M l x... x M n -->M l @... @ M n . As before, given any linear space L, we have the natural isomorphism Hom(M I @... @ Mn,L)2'(Ml'"'' Mn;L). Conversely, given linear spaces M j ,..., Mn' the space M l @ ... @ M n cn be characterized as a linear space N with a given multilinear map- ping Ij;:M l x... x Mn-->N such that (i) N is generated by the image Ij;(M I x ... x M n ); and (ii) for any multilinear mapping f: M l x '" x M n --> L, there exists a unique linear mapping j': N --> L satisfying j = j' 0 Ij;. The tensor prod- uct M l @ .., @ M n is sometimes written as @? I M i , and an element X I @ ... @ x n is writ- ten as @?l Xi' Given linear mappings /;: M i --> M (1 .s; i.s; n), thcre exists a unique linear mapping f: M l @... @ Mn-->M'l @... @ M satisfying f(x l @ '" @ x n )= fl(XtJ@ ... @ fn(x n ); we denote the mapping f by fl @ ... @ j or @71.{; and call it the tensor product of the j; (1 .s; i.s; n). The assignment (/1 , .. . , in) --> fl @ ... @ fn gives an isomorphism (8)1=1 Hom(M;, M;)--> Hom(@?=l M i , @Il M;) if the M i are finite- dimensional. If in particular M'l = ... = M = 
949 K, we have an isomorphism @7=1 Mt--> (@71 M;)* under the identification @7=1 M; = K given by the assignment X'I @ '" @ x--> X'I .., x. Explicitly, the isomorphism f: @71 Mt-->(@71 M;)* is determined by f( @71 xtH @71 Xi) = I171 <Xi, xt) (xt E Mt, XiEMJ J. Tensors Let E(A) (I .s; A.s; k) be linear spaces over a field K. If E(l)=,.. = E(k) = E, then @11 E(A) is written @kE and called the tensor space of degree k of E (@oE denotes K). Also, (@PE) @ (@qE*) is written T:(E), where E* is the dual space of E. We have T6'(E) = @PE, o(E)= @ qE*, and Too (E) = K. T:(E) is called the tensor space of type (p, q) of E, and each of its elements is called a tensor of type (p, q). In particular, a tensor of type (p, 0) is called a contravariant tensor of degree p, and a tensor of type (0, q) is called a covariant tensor of degree q, A tensor of type (0,0) is a scalar. An element of TOI (E) = E is called a contravariant vector, and an element of T?( E) = E* is called a covariant vector. If p # 0, q #0, a tensor of type (p, q) is called a mixed tensor. Let (eJ,"', en) be a basis of E and (/1, ... ,In) be the basis of E* dual to (e l' ... ,en)' Then the tensors e i1 @ ." @ ei p @fh @ ... @Fq (i A' j = 1, ... ,n; ..1. = 1, ... , p; J1 = 1, ... , q) form a basis of P(E). Therefore any tensor of type (p, q) can be written uniquely in the form t=LJ:' /.ei1@...@eip@fj1@...@fjq. Also, ;: .: is called the component of t relative to the basis (e I' . .. , en), the index i A is called a contravariant index, and the indexj is called a covariant index. Let (e I' ..., en) be another basis of E and (JI, ... In) be its dual basis. Suppose that we have n - L j e.= a.e.  t J' j1 n Ji= L [3JF j1 Then we have n " i k <i L. [3k a j=Uj k1 and the transformation formula ;;i 1 .i p =" "[3i1...[3ipal.'...alqk1' kp h .Jq L.. L.. k 1 kp h J q It...lq , kt. ,kp ll' .lq where the .'.:iP are the components of t rela- tive to (e l , ..., en) and the i.'. ip are the com p o- h ..Jq nents of t relative to (e l , "', en)' In the tensor calculus, an index appearing after the symbol L is called a dummy index if it appears in both the upper and the lower positions. For example, in the expression 256 K Linear Spaces L71 iyi, the index i is a dummy. As a conven- tion, we sometimes omit the symbol L7=1 for a dummy index i; for example, by iyi we mean the sum L7=1 iyi. This convention is called Einstein's convention. Using it, we write the previous transformation formula as ;;1' ip={3il...{3ip(Xl...(Xqkl' k p . it lq k 1 kp it Jq It...lq We have a nondegenerate bilinear form <1> on P(E) x Tpq(E) determined by ( p q q P ) <1> Xi@YJ'Yj@xt P q = IT <Xi,Xt) IT <Yj,yn. i=1 )=1 Thus the space P(E) can be identified with the dual space of Tpq( E), and vice versa ( Section H), In this identification, the basis (e i1 @... @e ip @f j1 @... @fjq) ofP(E) and the basis (e h @... @e jq @fi 1 @... @fi p ) of Tpq(E) are dual to each other. In addition, combining the natural isomorphism Tpq(E)*;::; 2'(I1qE, I1PE*; K) with the duality Tpq(E)* = P(E), we have a natural isomorphism T:(E) -->2'(I1qE,I1PE*; K), Explicitly, identifying an element tE P(E) with the multilinear form I1qE x I1PE*-->K corresponding to it under the natural isomorphism, we have t ( x X Y * Y * ) - ri1- i p ;tj1 ;tjqn I p h .." q' l' ..., P -Sjl...jql ...Sq 'lit ...17i p ' where gn is the component of XAEE= TOI(E), {I'/r} is the component of y*EE*= T?(E), and . .  {(;:::j:} is the component of tE T:(E). By the natural isomorphisms T6'(E);::; 2'(I1PE*, K), TpO(E) = 2'(I1PE, K), a contravariant tensor of degree p and a covariant tensor of degree p can be identified with a multilinear p-form on E* and on E, respectively. K. Tensor Algebras There exists a unique bilinear mapping P(E) x 7;'(E)--> :r(E) which assigns the element XI @... @x p @ Yl @... @y,@xf@... @ x: @ yf @... @ y: to the pair (XI @ ,,, @ x p @ xj @ ... @x:, Yl @ ." @ y,@ Yt @ .., @ yn we denote the element assigned to the latter by t@u, where t=x l @... @ xp@xt @... @x:' u=YI @... @ y,@ yt @... @ y:" and call it the product of t and u. If the components of t, u, t @ u are { i[ ip } { nk1 k, } 11 Jq' 'Il t , l ' {,:. :;::}, then we have (it ipkt k r _ il" ip k 1 k r jl" jqlt . is - J 1 j/ 7l t is' Let T(E) be the direct sum of P(E) (p,q == 0, 1,2, ...). Then T( E) is an associative algebra over K whose product is a natural 
256L Linear Spaces extension of the product @. We call T(E) the tensor algebra on E. The direct sum of T6'(E) (p=O, 1,2, ...) forms a subalgebra of T(E), also called the (contravariant) tensor algebra. L. Contractions The contraction of T:(E) relative to the kth contravariant index and the lth covariant index is by definition the linear mapping ct: T:(E)--> T:_I (E) determined by assigning <Xk>xi)x I @... @X k - I @Xk+1 @... @ xp@xf @... @Xt-I @Xt+I'" @x; to XI @... @x p @ xf @ ... @ x;, where <Xk, xi) is the inner prod uct of Xk> xt. For a tensor t of type (p, q), the tensor ct(t) of type (p-1, q-I) is called the contracted tensor of t, If the components of t are  } i1 ip, the components of ct(t) are 1 }q given by n 171 ip-l = '" 1 ..i. k - 1 si. k .. p- J. h }q-l L.. J1 )1-l S jl Jq-l .\::::1 M. Tensor Representations For a linear mapping f: E -->F, the tensor productf@... @f:T6'(E)= @PE-->@PF = T6'(F) is denoted by fP. The fP (p= 0, 1,2, ...) give an algebra homomorphism L:o T6'(E)--> L:o T6'(F). Next, let f be an isomorphism and j = 'f-I be its contragre- dient. Then fq denotes the tensor product j@... @]:TqO(E)= @qE*-->@qF*=TqO(F), and f/ the tensor product f P @ fq: T,t( E) --> T:(F). The mapping J: is an isomorphism, and the system {f/} (p, q = 0, 1,2, ... ) gives rise to an algebra isomorphism T(E)--> T(F). In particular, if.r is a nonsingular linear trans- formation of E, then f: is a nonsingular linear transformation of the linear space T:(E), and the assignment f --> f: gives a group homo- morphism GL(E)-->GL(T:(E)); this homomor- phism is called a tensor representation of the group GL(E). N. Symmetric and Alternating Tensors A contravariant tensor of degree p is called symmetric (alternating) if the corresponding multilinear p-form, under the natural isomor- phism T6'(E);::; 2'(I1PE*, K), is symmetric (alternating), A covariant tensor is also called symmetric (alternating) if the corresponding multilinear form is symmetric (alternating). A skew-symmetric (or antisymmetric) tensor is defined similarly. We reformulate these defi- nitions under the assumption that the field K is not of characteristic 2, Let p be the group of all permutations of 1, .,., p (the tsymmetric group of degree p), For any O"E 6 p , we have 950 a unique linear transformation T6'(E)-->T6'(E) assigning xu-1(1) @ ... @ XU-1(P) to x I @ ... @ xp' This transformation is nonsingular and is also denoted by 0". Similarly, we have a unique nonsingular linear transformation of Tpo(E), also denoted by 0". An element tE T6'(E) (or E Tpo(E)) is symmetric if and only if O"t = t for all 0" E 6 p , while t is alternating if and only if O"t = (sgn O")t for all O"E 6 p' Let the i1 ip (or the i1 . i) be the components of t. Then t is symmetric (alternating) if and only if the components are symmetric (alternating) relative to permutations of the indices ii' ." , ip. The linear transformation Sp = L UE 6 p 0" of T6'(E) or Tpo(E) is called the symmetrizer, and A p = L UE 6 p (sgn 0")0" is called the alternizer. For any t, Spt is a symmetric tensor, and Apt is an alternating tensor. The subspace of T6'(E) consisting of all symmetric (or alternating) tensors is invar- iant under the transformation R: G L(E)--> GL(T6'(E)), the tensor representation where <p(t) = R(<p)(t) for <pE GL(E) and tE Td'(E). O. Exterior Product For simplicity, we assume that the basic field K is of characteristic O. We denote by N p the kernel of the alternizer Ap: Td'(E)--> T6'(E), namely, the subspace consisting of all t satisfy- ing Apt=O, and by I\PE the quotient space T6'(E)/N p ' The image of XI @... @ x p (XiEE) under the natural mapping T6'(E)-->I\PE is denoted by XI/\'" /\ x p and is called the ex- terior product of X I, . , , , x p' The linear space I\PE is called the p-fold exterior power of E. We have X I /\ .../\(xi+x;)/\ .../\x p = X I /\ ... /\ Xi /\ '" /\ X p + x I /\ ... /\ xi /\ ... /\ X p' X l /\ .../\(exx i )/\ .../\x p =ex(X I /\ .../\X i /\ .../\x p ), exEK, and for every O"E 6 p , X U (1) /\ ... /\ XU(P) = (sgn 0") X I /\... /\ xp' Ap induces a natural isomorphism I\PE;::; P, where p consists of all contravariant alternating tensors of degree p. Thus an ele- ment of I\PE can be identified with a con- travariant alternating tensor of degree p. Then we have Ap(x I @... @xp)=x I /\ ",/\x p ' Similarly, I\P E* is identified with the linear space consisting of all covariant alternating tensors of degree p. An element of I\PE is sometimes called a p-vector, and an element of I\P E* is called a p-covector ( 90 Coordi- nates B). If (el' "', en) is a basis of E, then the ei 1 /\.. ./\ei p (il < i 2 <... < ip) form a basis of I\PE, and any element of I\PE is written in 
951 theformt=Li 1 <. < ip ex i1 ipei1/\.../\eiport =(1/P!)Li 1 , ,ipexi1 i pei1 /\ .../\e ip ' In the latter form, ail' ip is alternating relative to permuta- tions of ii' ..., ip, and it is the component of t. The dimension of I\PE is equal to G} and I\P E = 0 if P > n. If (/1, ... ,In) is the dual basis of (e 1, ..., en), the inner product of an element t =Li 1 <...<ipexi1' i pei1 /\." /\e i EI\PE and an e1e- . p . ment S = Li 1 <... < ip fJi 1 " i p f'l/\ ... /\ f'p E I\P E* is defined by <s,t)= L ex i1 iPfJi1 ,ip' i 1 < <i p Then we have <x 1 /\ ,,,/\X p 'Yl/\ .../\ Y p ) =det«xi,y)), where <xi,Y) is the inner product of XiEE and YjEE*. By this inner product, we can identify I\PE* with the dual space of I\PE. The tensor product @PE x @qE-->@p+qE induces naturally a mapping <1>: @PE/N p x @qE/Nq-->@p+qE/N p + q . Using this bilinear mapping <1>: I\P Ex 1\ q E --> I\p+q E, we define the exterior product t /\ s of an element t E I\P E and an element s E 1\ q E by t /\ S = <I>(t, s). Then t /\ s is an element of I\p+q E, and we have t /\ S = (-I) pq s/\t, (x i /\ .../\xp)/\(x p + l /\ .../\x p + q )= x 1 /\", /\x p + q ' We denote by I\E the direct sum of I\PE (p = 0,1,2, ''', n), and define the product of two elements x== L;=o xP, Y = L;o yp (x p , yPE I\PE) by X/\ Y=L;+FOXP/\ y q . Then the product /\ satisfies the associative law. We call1\E the exterior algebra (or Grassmann algebra) of the linear space E. If E is of dimension n, I\E is of dimension 2 n . If (e 1 , ..., en) is a basis of E relative to K, I\E is sometimes written as I\K(e 1 , "', en)' The exterior algebra I\E* of the dual space E* is similarly defined and can be considered as the dual space of I\E. P. Semilinear Mappings Let L be a linear space over a field K and L' be a linear space over a field K'. A pair (<p, p) consisting of a mapping <p: L--> L' and a map- ping p: K --> K' is called a semiIine-dr mapping if the following four conditions hold (for conve- nience here we write the scalars to the right of the vectors): (i) <p(a + b) = <p(a) + <p(b); (ii) <p(aA) = <p(a)p(A); (iii) p(ex + {3) = p(ex) + p(fJ); and (iv) p(rx{3) = p(rx)p(fl) (a,bEL; A,rx,fJEK). In this case, <p is sometimes said to be semilinear relative to p ( 277 Modules L). Conditions (iii) and (iv) mean that p is a field homomor- phism. If K = K' and p is the identity, then the semilinear mapping <p: L--> L' is a linear map- ping. If L = L', K == K' (and p is an automor- phism), <p is called a semilinear transformation relative to p. 256Q Linear Spaces For semilinear mappings (<p, p): L--> L', K--> K' and (<p', p'): L' -->L", K' -->K", where L" is a linear space over K", the composite (<p' 0 <p, p' 0 p) is also a semilinear mapping. If a basis (e l' .., , en) for Lover K and a basis (e'l, ..., e,) for L' over K' are given, a semi- linear mapping (<p, p): L-->L', K -->K' deter- mines a matrix (ex i ) by the relation <p(e) = L?1 e;exij (1 .s;j.s; n). Conversely, a homomor- phism p and an n' x n matrix A = (exij) deter- mine a semilinear mapping <p by this relation. Hence for fixed bases a semilinear mapping is represented by a pair (A, p), where A is an n' x n matrix. If a semilinear mapping <p' relative to p' is represented by (A', p'), the composite (<p' 0 <p, p' 0 p) is represented by (A'AP', p' 0 p), where AP' is the matrix (p'(exij»' Let <p: L--> L be a semilinear transformation relative to an automorphism p: K --> K. Sup- pose that <p is represented by (A, p) relative to a basis (e I' ..., en) for L and by (B, p) relative to another basis (/1' .. . , fn)' If we define a matrix P=(Pij) by the relation jj=L?1 eiPij (1.s;j.s;n), we have B= p- l ApP. Two pairs (A,p), (B, p) having the relation B = p-l App are said to be similar. Q. Sesquilinear Forms Let K be a field (not necessarily commutative) and J be its tantiautomorphism. For left linear spaces M, N over K, a mapping <1>: M x N --> K is called a (right) sesquilinear form relative to J if the following four conditions are satisfied: (i) <I>(x + x', y) = <I>(x, y) + <I>(x', y); (ii) <I>(x, y + y') == <I>(x, y) + <I>(x, y'); (iii) <I>(exx, y) == ex<l>(x, y); and (iv) <I>(x, exy) = <I>(x, y)ex J (x, x' EM; y, y' E N; ex E K). If J is the identity automorphism, then K is necessarily commutative and <I> is a bilinear form ( Section H). As an example of K and J, we may take K as the field of com- plex numbers and J as complex conjugation. In general, for a left linear space E over K, we denote by E J the right linear space with the scalar multiplication xA = A r 1 x (x E E, A E K). Then condition (iv) becomes (iv') <I>(x, ya) = <I>(x, y)ex; and if K is commutative, <I> is a bilinear form on M x N J . For a sesquilinear form <I> on M x N, we have the linear map- pings dq,:NJ-->M*, sq,:M r' -->N* defined by the relation <I>(x, y) = <x, dq,(y) = <y, sq,(x)J (xEM,YEN), If M, N are finite-dimensional, dq" Sq, have the same rank, which is called the rank of <1>. We assume that all linear spaces are finite-dimensional. Let (Xl' ...,X m ), (YI' ...,Yn) be bases of M, N and (xL. .. , x), (yf, ... , y:) be their dual bases. Then we have dq,(yJ= L1 xt<l>(Xj, y), sq,(x j ) = LJ=1 yj<l>(x j ,y)J-l. The matrix (<I>(xj,y)) is called the matrix of the sesquilinear form 
256 Ref. Linear Spaces <I> relative to the given bases; its rank is equal to the rank of <1>. If d<f>' S<f> are both injective (therefore bijective), <I> is said to be nondegen- erate. Let <1>': M' x N' --> K be another sesqui- linear form relative to J. Then for any linear mapping u: M --> M', there exists a unique linear mapping u*: N' -->N such that <I>'(u(x), y') = <I>(x, u*(y')) (x E M, y' EN'); this is called the left-adjoint linear mapping of u. Similarly, for any linear mapping v: N --> N', there exists a unique linear mapping v*: M' --> M such that <I>'(x', v( y)) = <I>(v*(x'), y) (x' EM', YEN); this is called the right-adjoint linear mapping of v. We have u* = d;1 0 'u 0 d<f>" v* = s; I 0 'v 0 S<f>" In particular, let u, v be isomor- phisms. Then we have <I>(x, y) == <I>'(u(x), v(y)) (x EM, YEN) if and only ifu- I =v*, V-I ==u*. A sesquilinear form on M x M is called simply a sesquilinear form on M. Let J be an tinvolution (namely J =J- I ), and write A J = A (.icE K), If condition (v) <I>(x, y) = <I>(y, x) (x,YEM) holds, <I> is called a Hermitian form on M. On the other hand, if the condition (v') <I>(x, y) = - <I>(y, x) holds, <I> is called an anti- Hermitian form (or skew-Hermitian form) on M. In particular, if J = l K , then a Hermitian form (anti-Hermitian form) is a symmetric bilinear form (antisymmetric bilinear form). A linear space M supplied with a nondegenerate Hermitian form <I> is called a Hermitian linear space, and <I>(x, y) is called the Hermitian inner product (or simply inner product) of x, yEM. References [1] O. Schreier and E. Sperner, Einfiihrung in die analytische Geometrie und Algebra I, II, Vandenhoeck & Ruprecht, fourth edition, 1959; English translation, Introduction to modern algebra and matrix theory, Chelsea, 1959. [2] N. Bourbaki, Elements de mathematique, Algebre, ch. 2, 3, 7, 8, 9, Actualites Sci. Ind. 1236b, 1044, 1179a, 1261a, 1272a, Hermann, 1958-1964. [3] 1. M. Gel'fand, Lectures on linear algebra, Interscience, 1961. (Original in Russian, 1951.) [4] N. Jacobson, Lectures in abstract algebra II, Van Nostrand, 1953. [5] C. Chevalley, Fundamental concepts of algebra, Academic Press, 1956. [6] W, H. Graeub, Lineare Algebra, Springer, 1958. [7] P. R. Halmos, Finite-dimensional vector spaces, Van Nostrand, second edition, 1958. [8] R. Godement, Cours d'algebre, Hermann, 1963. [9J D. A. RaIkov, Vector spaces, Noordhoff, 1965. (Original in Russian, 1962.) [10] S. Lang, Algebra, Addison-Wesley, 1965. 952 257 (V.17) Local Fields A. General Remarks A field k that is tcomplete with respect to a tdiscrete valuation is called a local field if its field of residue classes is finite. (Real and com- plex number fields are sometimes also called local fields; these, however, are not considered in this article.) A local field k is isomorphic either to the tcompletion with respect to a tp_ adic valuation determined by a prime ideal p of a number field of finite degree or to the field of tformal power series of one variable over a finite field. In the former case, k is caIled a p- adic number field. We let 0 stand for the tvalu- ation ring of k, p stand for the tvaluation ideal of k, p stand for the tcharacteristic of the field o/p ofresidue classes, and N(p) stand for the number of the elements of o/p. An tadditive valuation of k whose set of values coincides with the set of all rational integers, is denoted by ordex (ex Ek); here we understand ordO= 00, The tnormal (multiplicative) valuation of k is defined by lexl = (N(pJr°rd' ( 439 Valuations). B. Construction of Local Fields A p-adic number field k is an extension of finite degree of the p-adic field Qp. If n = [k:QpJ =ef, where e is the tramification in- dex of k/Qp and f is the trelative degree of k/Qp ( Section D), then there exists one and only one field F such that k =:> F=:> Qp, [k: FJ == e, [F:QpJ = f, and F/Qp is tunramified. The field K of residue classes of F is isomorphic to GF(p/), and F is uniquely determined by K by means of Witt vectors ( 449 Witt Vectors). Every residue class (#0) of 0F/PF  K contains one and only one mth root of unity (m is a divisor of pi - 1), and F is obtained by adjom- ing to Qp a primitive (pi -1)th root. Then k is a totally ramified extension of F and is ob- tained by adjoining to F a root of an tEisen- stein polynomial ( 337 Polynomials F). C. The Topology of k Taking pm (m=O, 1,2,...) as a tbase for a neighborhood system of 0, k becomes a tlo- cally compact ttotally disconnected ttopolog- ical field, and pm (m = 0,1,2,...) are compact subgroups of the additive group k. The multi- plicative group k x of nonzero elements of k is a locally compact tAbelian group, and the u(m) ={exEolex= 1 (mod pm)} (m=O, 1,2'00') form a base for the neighborhood system of 1. The 
953 tcharacter group in the sense of Pontryagin of the additive group k is isomorphic to k. This isomorphism is obtained by the following natural correspondence: For a p-adic field k, denote by <p the composition of the natural mapping ofQp onto Qp/Zp (Z[I/pJ/Z c Q/Z) and the ttrace Tr from k to Qp, and put Xx(Y) =exp(2nJ=l <p(xy)) (YEk); for the field k of the formal power series over a finite field K, put XAy) = i/;(xy) (YE k) with i/;(ex) = exp(2nJ=l. Tr(Resex)/p) (exEk), where Res ex is the residue of ex E k and Tr is the trace from K to Z/pZ. Then in both cases Xx is a character of k, and X-->Xx gives an isomorphism between k and the character group. D. Ramification Theory A valuation v of k has a unique tprolongation to an extension K of finite degree over k (we denote the prolongation also by v). K is com- plete under the valuation and is therefore a local field. Denoting by f and K the field of residue classes of K and k, respectively, we call [f: KJ = f the relative degree of K/k, and e= [v(K X): v(k')J the ramification index of K/k. Then we have the equality [K:kJ=ef If e= 1, we call K/k an unramified extension. An unramified extension K/k is tnormal, and its tGalois group is a cyclic group gen- erated by the Frobenius automorphism, i.e., the element 0" of the Galois group of K/k such that exu=exN(P) (modp) for any element ex in the valuation ring of K. For a given natural num- ber f, there exists one and only one unramified extension of degree f over k in an algebraic closure of k. Let K/k be a normal extension of finite degree with Galois group G, let II be a tprime element of K, that is, a generator of the tvalu- ation ideall.p of K, and put V(i)={O"EGIII u = II (mod I.pi+l)}. Then V(i) is independent of the choice of II. We call V(O) the inertia group and V(i) the ith ramification group. Then V(i) is normal in G, [G: V(O)J =,f, [V(O): 1] =e, and V(1) is the p-Sylow subgroup of V(O). Further- more, G/V(O) and V(O)/V(l) are cyclic, and V(i)/V(i+l) (i= 1,2,...) is an Abelian group of type (p, p, ..., p). Ramification theory for Abel- ian extensions is described in Section F. For xEk, the series exp(x)=Loxn/n! (resp. 10g(1 + x)= Ld _1)"-1 xn/n) converges for ordx>e/(p-l), where e is the ramification index of k/Qp (resp. ordx>O). The additive group pm and the multiplicative group u(m) (m> e/(p - 1)) are isomorphic as topological groups under the mappings X--> y = exp(x), y -->x=log(y). If we fix an element nEk with ord n = 1, then an arbitrary element x E k with ordx=r is uniquely expressed in the form x 257E Local Fields =n'(ex, (pf-l = 1, exEU(1). The group u(1) is a multiplicative group on which Zp operates, and the structure of u (1) as a Zp-group can be determined explicitly ( [2, ch. IIJ). E. Cohomology For a normal extension K/k with Galois group G, we may consider the tcohomology groups W(G, K X) (r = 1,2,...) of G operating on the multiplicative group K x. In particular, the 2- cohomology group H2(G, K X) is important in local class field theory and theory of algebras over k. If e is the separable algebraic closure of k, i.e., the tmaximal separable field over k in the talgebraic closure of k, and r is the Galois group of Cjk, we can consider the 2-cohom- ology group H 2 (r, eX). Here we take as cocycles only those mappings f(O", ,) of r x r into e X that are continuous with respect to the tKrull topology of r and the discrete top- ology of ex. A fundamental theorem about the structure of H 2 (r, eX) states that the cocycles of H2(r, eX) that split in an extension K/k of degree n are exactly those cocycles that split in the unramified extension of degree n over k. Here a cocycle is said to split in K if it belongs to the tkernel of the homomorphism H 2 (r, eX)H2(H, eX), where H is the sub- group of r corresponding to K, and res is the mapping obtained by restricting 0", , of f(O", ,) to the elements of H. Let K/k be a normal extension of degree n with the Galois group G, and let H be the subgroup of r corresponding to K. Then the fundamental theorem combined with the exact sequence 0-->H 2 (G, K X)-->H 2 (r, CX)-->H 2 (H, CX) (1) known in the theory of Galois cohomology ( 59 Class Field Theory H, 200 Homological Algebra I), yields H2(G, K X)Z/nZ (cyclic group of order n), and H 2 (r, eX)  Q/Z, where Q is the additive group of rational numbers and Z is the additive group of rational in- tegers. These isomorphisms can be given in canonical form as follows: Denote the unrami- fied extension of degree n by Kn/k, and the Frobenius automorphism of Kn/k by 0", Then an element c of H2(G n , Kn X ) (where G n is the Galois group of Kn/k) is represented by the cocycle f(O"i, O"j) = a[(i+j)/nJ-[i/n]-u/nJ, i,j E Z, with aEk x , and conversely, every aEk x deter- mines an element c of H2(G n , K:) in this manner. Under these conditions, the corre- spondence between c and a gives rise to an isomorphism H2(G n , KnX)k' /NK./dKnX). 
257F Local Fields Next define an element inv C of Q/Z by inv c = (ord a)/n (modZ). Then since cEH 2 (r, C X) splits in an unramified extension of degree n by the fundamental theorem, the exact sequence (1) determines in a natural way an element c' of H2(G n , Kn X ) corresponding to c. Putting inv c =invc', we can show that H 2 (r, C X )03c-->invc gives an isomorphism H 2 (r, CX)'Q/Z. We call in v c the invariant of c E H 2 (r, eX). The invariant of an element of the cohomology group H2(G, K X) is defined to be the invariant of the corresponding element of H 2 (r, eX), which is determined by the exact sequence (1). Mapping an element of H2(G, K X) to its invar- iant, we obtain an isomorphism H2(G, K X ):;;; Z/nZ. F. Local Class Field Theory Let K/k be a normal extension of degree n with the Galois group G, let f(O", ,) be a cocycle representing the element of H 2 (G,K X ) with the invariant l/n, and put ( K/k ) - = IT {("0")-1. (J rEG Then 0"-->( K;k ) gives an isomorphism be- tween G/G' (where G' is the commutator sub- group of G) and k x /NK/k(K X), It follows from this that [e :NE;dP)J.s;[E:kJ for any exten- sion Elk of finite degree, and the equality holds if and only if E/k is Abelian. The inverse map- ( K/k ) ping of G 3 0"--> ---;;- E e / N K / k ( K X) for an Abelian extension K/k is written as k X O3a--> (a, K/k)E G, and (a, K/k) is called the norm- residue symbol. If L/k is Abelian and K/k is a subfield of L, then the restriction of (a, L/k) to K coincides with (a, K/k). Let ka be the max- imal Abelian extension of k, i.e" the union of all Abelian extensions of finite degree over k. Then for any aE k x, an element (a, k) of the Galois group G(ka/k) of ka/k is uniquely deter- mined by (a,k)(y)=(a,k(y)/k)(y), YEka. The mapping e O3a-->(a,k)EG(k a /k) is a one-to-one continuous homomorphism, and the image is tdense in G(ka/k). It has also been proved that there exists one and only one Abelian exten- sion K/k with NK/k(KX)=A for any given closed subgroup A of finite index of k x. There- fore closed subgroups A of finite index of e are in one-to-one correspondence with finite Abelian extensions K of k through the relation A = N K / k ( K X), and in this case A is called the subgroup of k X corresponding to K/k. Let K/k be an Abelian extension of finite degree, A be its corresponding subgroup of k x ,and V(i) (i = 0, 1,2, .. . ) be ramification groups of K/k. Furthermore, define constants 954 VI, V2'''', V r by V(O)= V(t)=... = V(vtl V(V 1 +t) = ... = V(V2) '"  V(V, 1+1)= '" = V(V,)  V(V,+t) = (1), denote the order of V(Vi+l) by n i (i = 1,2, ..., r), and put up = V o + (no/no)(v i - Vo) + ... + (np_I/no)(v p -Vp-l), P = 1,2, ..., r (here we understand V o = -1, no = [V(O): IJ). Then Ul' ... , u, are rational integers, and we have Au(O)= ... =Au(U1)Au(U1+1)= .,. =Au(U2)... Au(U,+I)=A (H, Hasse). If m is the smallest integer with A ;; u(m), then pm is called the conductor of K/k. The above results of Hasse show that m = U r + 1. On the other hand, it is known that the correspondence between Au(U p ) and V(V p ) (p = 1,2, ... ,r) is given by the norm-residue symbol (a, K/k) (--" 59 Class Field Theory), G. Theory of Algebras By the general theory oftcrossed products of algebras, the structure of the tBrauer group formed by the classes of tnormal simple alge- bras over a local field k is obtained directly from results concerning cohomology ( Sec- tion F). Namely, a tnormal simple algebra over k splits over a separable extension of degree n if and only if it splits over the unrami- fied extension of degree n, and the Brauer group of k is isomorphic to Q/Z. Furthermore, the texponent (the order as an element of the Brauer group) of a normal simple algebra  over k coincides with the tSchur index, and if [Ill: kJ = n 2 , then  is expressed as a crossed product with respect to any normal extension of degree n over k. The invariant of the factor set (2-cocycle) that appears in this crossed product expression is called the invariant of  ( 29 Associative Algebras; for the properties of  as a topological ring and as a topological group of the group of invertible elements of   6 Adeles and Ideles). H. Explicit Formulas Let K/k be an Abelian extension of finite de- gree. When we have an explicit formula for the norm-residue symbol (a, K/k), we say that we have an explicit reciprocity law. Let k be a p-adic number field containing a primitive mth root m of unity, and let fJ be an element in k X . Let p be a prime number con- tained in p, Since the Kummer extension K = k(::j7i) over k is Abelian, the Hilbert norm- residue symbol (a, fJ)m is defined by 
955 (ex,K/k)(::fii)=(ex,fJ)m::fii, where (ex,fJ)m is an mth root of unity. In this case, the pro blem of obtaining an explicit reciprocity law is solved if we can express the symbol (ex, fJ)m in terms of ex, fJ and suitable parameters depending on the ground field k. In particular, if p=2, m=2, we have the simple formula given by Hasse, (ex, fJh = (_lf r «'-I)(P-l)/4), where ex, fJ are two units in k satisfying ex = fJ = 1 (mod 2) and Tr is the trace from k to Qp. Similar formulas for the comple- mentary laws are also known [10]. On the other hand, if m = p is an odd prime and k = Qp«(p), we have the following formulas for a prime element Ap = 1- (p and two units ex, fJ satisfying ex= 1 (mod p 2), fJ= 1 (modp): (ex,fJ)p=(V/p)Tr«(plOg, DlogP\ «(p, fJ)p = (I/p)Tr(IOgp), (},p, fJ)p = (;(I/p)Tr«(p/Ap)logp), where DlogfJ=(1/fJ)L1 ihi},-l, while the b i are determined by the Ap-expansion fJ= Lob), biEZ p [5]. Furthermore, we have an explicit Kummer-Hilbert formula deduced from (2) in terms of Kummer's logarithmic differential quotients [8, 10]. Concerning the complementary laws (3), (4), the following Artin-Hasse formulas are known for k = Qp«(p') and fJ= 1 (modp): «(p" fJ)p' = (/p')Tr(logp), (A p " fJ)p' = (;p/p')Tr«(p'/Ap,)logp), where Apn = 1 - (pn [9]. Utilizing these for- mulas, K. Iwasawa obtained a formula for (rJ., fJ)p' that is a natural generalization of (2) [13]. A. Wiles has shown that a generalization of the Artin-Hasse-Iwasawa formulas can be obtained in terms of the Lubin-Tate formal groups (Ann. Math" (2) 107 (1978)). Concerning (ex, fJ)p" we have Shafarevich's reciprocity law [11]. To explain it, let ko be the inertia field in k, i.e., the maximal subfield in k that is unramified over Qp. For an arbitrary integer a in ko, we consider the Artin-Hasse function E(a,x)=exp( -L(a, x)), where L(a,x) = Lo(aai/pi)xP' with the tFrobenius auto- morphism 0" of ko/Qp. We choose a prime element if in ko such that (p' = E(I, if) and an integer a in a suitable unramified extension field of ko such that aa - a = a for a given integer a in ko. Given any pn-primary element x in k (i.e., k(x l / p ') is unramified over k), there exists an integer a in ko such that x  E(a) =E(pna, ii), where x y (x, YEP) means x = Y' u for an element u in k x p'. Furthermore, if b is an element of k x, we have the canonical decomposition formula bnd*E(d) n E(dj,n j ), I j<eop (j.P) I 257 Ref. Local Fields where n is a prime element in k, d*EZ, d, d i are integers in k o , and eo = e/(p - 1) with the ramifi- cation index e of k. The explicit formula due to Shafarevich and Hasse is expressed as (a, [3)pn = (;ro(a*h-ah*+c), where Tro is the trace from ko to Qp and a, a* (resp. b, b*) are determined by the canonical decompositions of ex (resp. fJ) as stated above. The element c in ko is determined by n 1 i,j<eop (i,P)=(j.P) I E(iaib j , ni+j)=yE(c) n E(c j , n j ) lj<eop (j,P) I (2) (3) (4) for odd p, and by ( -1 (b* n E(iaibj> ni+j) 1i,j<2eo (i,2)(J,2) I X fI E«i2-I+j2V-I)arbr,ni2P+j2') 11. v=l == y  E(c) n E(cj, n j ) 1j<2eo (j,2)= 1 (3') (4') for p==2. Several formulas for the case where k is a function field are also known [1,5]. Let k be a general local field. When K is an extension field of k obtained by adjoining nn_ division points {A} of the Lubin-Tate formal group F defined over the integer ring of k, the extension K/k is totally ramified and Abelian, and we have an explicit formula: (u- 1 , K/k)A = [uJF(A), where u is a unit in k and [UJF is an endomorphism of F corresponding to the unit u. In particular, this formula implies the cyclo- tomic reciprocity law [12]. The problem of obtaining an explicit re- ciprocity law arises in the problem of obtain- ing the reciprocity law for power residues from the law of reciprocity in global class field theory, The problem is closely connected with T. Kubota's recent results (e.g., [14]), which clarify the analytic meaning of the reciprocity law in algebraic num ber fields. References [IJ E. Artin, Algebraic numbers and algebraic functions, Lecture notes, Princeton Univ., 1950-1951 (Gordon & Breach, 1967). [2J H. Hasse, Zahlentheorie, Akademie- Verlag, third edition, 1969; English translation, N um ber theory, Springer, 1980. [3] H. Hasse, Normenresttheorie galoisscher Zahlkorper mit Anwendungen auf Fiihrer und Diskriminante abelscher Zahlkorper, J. Fac. Sci. Univ. Tokyo, 2 (1934), 477-498. [4J M. Deuring, Algebren, Erg. Math., Springer, second edition, 1968. [5J E. Artin and 1. Tate, Class field theory, Lecture notes, Princeton Univ., 1951 (Ben- jamin, 1967), 
258 A Lorentz Group [6J J.-P, Serre, Corps locaux, Actualites Sci. Ind., Hermann, 1962; English translation, Local fields, Springer, 1979. [7J O. F. G. Schilling, The theory of valu- ations, Amer. Math. Soc. Math. Surveys, 1950_ [8J T. Takagi, On the law of reciprocity in the cyclotomic corpus, Proc. Phys.-Math. Soc. Japan, 4 (1922),173-182. [9J E. Artin and H. Hasse, Die beiden Ergan- zungssatze zum Reziprozitatsgesetz der In-ten Potenzreste im Korper der In-ten Einheitswur- zeln, Abh. Math. Sem. Univ. Hamburg, 6 (1928), 146-162. [10J H. Hasse, Bericht iiber neuere Untersu- chungen und Probleme aus der Teorie der algebraischen Zahlkorper II, Jber. Deutsch. Math. Verein. (1930) (Physica Verlag, 1965). [11J I. R. Shafarevich, A general reciprocity law, Amer. Math. Soc. Trans\., 4 (1950), 73- 106. (Original in Russian, 1950.) [12J J. Lubin and J. Tate, Formal complex multiplication in local fields, Ann. Math., (2) 81 (1965), 380-387. [13J K. Iwasawa, On explicit formulas for the norm residue symbol, J. Math. Soc. Japan, 20 (1968),151-165. [14J T. Kubota, Ein arithmetischer Satz iiber eine Matrizengruppe, J. Reine Angew. Math., 222 (1966), 55-57. [15J J. W. S. Cassels and A. Frolich (eds.), Algebraic number theory, Academic Press, 1967, 258 (XX.24) Lorentz Group A. Lorentz Group (1) Minkowski space. A 4-dimensional real vector space M with an indefinite inner prod- uct between two vectors x and y defined by x' y=xG'y= X O yO _Xl y' _x 2 y2 _x3 y3, G (1 o o -1 o  ) = (gJ, -1 o -1 o o is called a Minkowski space. A vector x E M is classified according to the signs of x. x and X O as follows. X.X>O, xO>O: future timelike } timelike, past timelike future lightlike } origin IightIike, past lightlike X.X>O, xO<O: x'x=O, xO>O: x'x=O,XO=O: x'x=O, xo<O: x . x < 0: spacelike. 956 The set of all future (or past) timelike vectors is called the future (or past) cone and is denoted by V+ (or V_). The set of light like vectors is called the light cone. The set of spacelike vec- tors is called the side cone. Minkowski space is used in the tspecial theory of relativity (with units such that the velocity oflight is 1), where {(x - y)' (x _ y)}'/2 is called the proper time between two mutually timelike events x and y. (2) Inhomogeneous Lorentz group. The group of all one-to-one mappings of the Min- kowski space onto itself preserving the proper time between any timelike pair of points is called the full inhomogeneous Lorentz group or Poincare group and is denoted by .OJ'. An ele- ment 9 of & is a linear transformation 3 (gx)= L Ax'+a (J1=0,1,2,3), v=o where aEM and A=(A) is any real matrix satisfying AG('A)=G. We write g=(a,A) and gx=Ax+a. Then (a l ,Ad(a 2 ,A 2 )=(a l +A l a 2 ,A I A 2 ). The normal subgroup of & consisting of (a, I), aEM, is called the translation group on M. The subgroup of & consisting of (0, A) is called the full homogeneous Lorentz group and is denoted by 2 or @(l,3). It consists of the following four connected components: 21 = {AE2" I detA= 1, AV+ = V+}, 2) == { A E 2" I det A == 1 A V = V } + , + -, 2 = {AE2" I detA== -1, A V+ = V+}, 2 ={AE2"ldetA= -1,AV+ = V_}. The identity mapping, the space-time inversion x--> - x, the space inversion X O -->X O , x i --> - Xi (i = 1,2,3), and the time reversal X O --> - x O , Xi-->X i are their typical elements, respectively. If detA = 1, AE2 is called proper. If A V+ = V+, AEL is called orthochronous. The identity component 21 is called the restricted homoge- neous Lorentz group. (This usage in mathe- matical physics may differ from the general terminology for @(m, n), in which the identity component is called proper.) The same terminology is used for &, which also consists of four components gp1, &, gp, and gp . (3) Universal covering group. For XE M, let 3 x= " xO" L.. , o ( 1 0 ) ( 0 1 ) ( 0 -i ) 0"0 = 0 1 ' 0" I = 1 0 ' 0" 2 = i 0' 0"3=G _). 
957 where (j = (0" 1,0"2,0"3) is called the Pauli spin matrix. For any 2 x 2 matrix A with determi- nant I (i.e., A ESL(2, C)), AxA*=x A , xA=A(A)x, defines J\(A)E2t, and A-->J\(A) is a two-to- one mapping from SL(2, C) onto 2. By this covering mapping, SL(2, C) becomes the uni- versal covering group for £'!. If A is unitary (i.e., AESU(2)), J\(A) leaves X O invariant, and SU(2) is the universal covering group of @(3)+ (the 3-dimensional proper orthogonal group or proper rotation group). For a complex vector ZE M + iM, Az+ B=ZA,B, zA,B=J\(A, B)z defines a complex Lorentz transformation A(A, B), and (A, B)-->J\(A, B) gives a covering mapping from SL(2, C) x SL(2, C) onto the proper complex Lorentz group 2(C) +, consist- ing of complex 4 x 4 matrices A satisfying J\G('J\) = G and detJ\ = 1. B. Finite-Dimensional Representations (1) 2. Any continuous representation of SL(2, C) on a finite-dimensional complex vector space is a direct sum of irreducible representations. Let E = C 2 be the 2-dimensional complex vector space on which SL(2, C) is acting. Let a representation nk,n of SL(2, C) on the (k + n)- fold tensor product E0(k+n) be nk,n(A) = A Ok @(A)0 n , where A is the complex conjugate of A. A vector  in this representation space is called a mixed spinor of rank (k, n) and its components are written as e 1 AkP,1 .p,', with undotted indices ..1.1' ..., Ak and dotted indices fll ... fln all taking values 1 or 2. If n = 0 (or k = 0), it is called an undotted (or dotted) spinor of rank k (or n). A spinor with upper indices, called a contra- variant spinor, can be converted to a spin or with lower indices (partially or totally), called a covariant spinor, by f.=(f.A)=( _ } For example, i'Al P,1. =" i'A 1 KP, IV f. f... S 2 2 J.1.2  S 'KA2 'VJ.1.2" K,V The restriction of nk,n to the subspace of spinors which are invariant under permuta- tions of undotted indices and under those of dotted indices is an irreducible representation of dimension (k + l)(n+ 1), which is denoted by [k, n]. The set of [k, n]. k==O, I,..., n ==0,1,..., exhausts all finite-dimensional irreducible representations of SL(2, C). 258C Lorentz Group The tensor product of two irreducible repre- sentations is decomposed as follows: [k l , nlJ @ [k 2 , n2J = L 8J [k, nJ, where [k, nJ appears with multiplicity 1 for k, n satisfying k l +k2klkl-k21, n l +n2n In I - n 2 1. The complex conjugate representa- tion of [k, nJ is en, k]. For xEM, x is a mixed spinor of rank (1, 1). (2) SU(2). The restriction of [k, OJ (k = 0,1,...,) to the subgroup SU(2) is an irreduc- ible representation (called the spinor represen- tation of rank k) and exhausts all irreducible representations of SU(2). [0, nJ is equivalent to [n,OJ for SU(2) due to A =f.Af.- l for AESU(2). [k, nJ for SU(2) is equivalent to k+n [k,OJ@[n,OJ= L 8J [r,O]. rlk-nl An explicit coefficient in this decomposition is called a tClebsch-Gordan coefficient. Any representation of @(3)+ is a representa- tion of SU(2). The irreducible representation of SU(2) with rank r is an irreducible repre- sentation of @(3)+ if r is even. It is an irreduc- ible double-valued representation of @(3)+ if r is odd. Likewise, the irreducible representa- tion of SL(2, C) with rank (k, n) is an irreduc- ible representation of 2 if k + n is even and double-valued if k + n is odd. (3) 2 1 = 2 U 21. The space inversion P induces an automorphism of £' by J\......PJ\P, that of SL(2, C) by A -->f.Af. -I and the asso- ciated mapping U(A)--> V(A)= U(f.Af.- 1 ) of irreducible representations [k, nJ -->[n, k]. Thus an irreducible representation of 2 1 is obtained on [k,nJ EB [n,kJ ifn#k, and two inequivalent irreducible representations on en, nJ corre- sponding to two choices (:t 1) for the operator representing P. A vector in [k, nJ EB en, kJ is called a bispinor of rank (k, n). The wave func- tion of the Dirac equation is a bispinor of rank (1,0). C. Unitary Representations (1) Invariance in quantum mechanics. In the special theory ofrelativity, each gEf?l'J repre- sents a symmetry of the system, namely, a transformation of states if; of the system to states gif; such that "physical relations" be- tween two states if;l and if;2 should be the same as those between gif;l and gif;2' In quantum mechanics, a (pure) state is represented by a unit ray (i.e., a set of vectors e iB ,¥, O.s; 0 < 2n for a unit vector '1') in a Hilbert space, and 1('1'" '¥2W is supposed to be an obser- vable quantity, called the transition prob- ability. Thus the special theory of relativity in quantum-mechanical situation implies a con- tinuous representation of.9'1 by bijective map- 
258C Lorentz Group pings of unit rays in a Hilbert space preserv- ing the transition probability. By the Wigner theorem, such a mapping is implemented by either a unitary or anti unitary operator U(g) as a mapping {eie\ll}-->{eieU(g)\II} ( e.g., [12]). In order for this to be a group of transformations of unit rays, V should be a projective representation: U(gl)U(g2)= U(gl,g2)U(glg2)' where U(gl,g2) is a 2- cocycle with a complex value of modulus 1. In the case of.qp1, each U(g) has to be uni- tary (true for any connected Lie group), and there is a choice of U(a, A) from the unitary ray {e i6 U(a,J\(A))10.s;0<2n} so as to make U(a, A) (aE M, AESL(2, C)) a continuous uni- tary representation of the universal covering group &1 = {(a, A)laEM, AESL(2, C)} of &J [2]. (2) &J. Any continuous unitary represen- tation of & on a separable Hilbert space is a direct integral of continuous irreducible uni- tary representations of &1 (called irreducible representations in the following). Any con- tinuous unitary representation of the trans- lation group {(a, 1)laEM}, the group being commutative, is of the form (U(a, 1)\II)(p)=e ia P\II(p), \II E f H(p)dJi(p). The parameter pE M is called the energy- momentum 4-vector or 4-momentum. In an irreducible representation of .9>J, the measure dJi is equivalent to an invariant measure on an orbit of the conjugacy action g( a, l)g -I = (J\(A)a, I), 9 = (h, A)E&J, i.e., on one of the following orbits. mot: p2==p' p=m 2 (m>O), :tpo>O, Or: p2=0, :tpo>O, O:p=O, im:p2=_m 2 (m>O). The parametcr (p2)1/2 is called the mass. F or each orbit the subgroup of all A E SL(2, C) that do not move a predetermined point q) on the orbit A, is called the little group (of A Or of qJ, Up to isomorphism, the little group contains the foIlowing. m-t: SU(2), consisting of all unitary A E SL(2, C), which leaves qm + = (:t m, 0, 0, 0) invariant. - O:!:: The 2-fold covering group of the 2- dimensional Euclidean group, consisting of all A(O,z)=(iO :;:z), ZEC, OER (q0ot =(:t 1,0,0, :tl), with the product A(0"zdA(02,z2)=A(01 +02,ZI +e2iOlz2)' 958 0: SL(2, C). im: SL(2, C) consisting of all real A E SL(2, C), which leaves qim=(O, 0, m, 0) invariant. Any irreducible representation of P?l is uniquely determined (up to unitary equiva- lence) by an orbit ..1. and an irreducible repre- sentation D of the little group G A C SL(2, C) on a Hilbert space K as follows (as an tinduced represen ta tion): \II = f \II(p)dJi(p)E H = f H(p)dJi(p), [U(a, A)\IIJ(p)=e ia PD(R(A,p))\II(;\(A)-1 p), R(A, p) = L(p) -I AL(J\(A -I )p), where H(p)= K, dJi(p) is a measure (unique up to a multiplicative constant) on A, invariant under the action of £'1 on }, (p--> J\p,;\ E £'1), L(p) is a fixed element of SL(2, C) for each pEA such that J\(L(P))qA = p, and R(A, p) is called the Wigner rotation. (3) The orbit m:!:. The irreducible represen- tation with the orbit mjc and the irreducible representation of S V (2) with rank 2j (j = 0,1/2,1,...) is denoted by [mot,j], wherej is called the spin. It is given by [U(a, A)\IIJ (p) = e ip a A 02j\ll(J\(A) -I p), (\II, <1» = f (\II (p), (m/:t ji) 02j <l>(p)) X (::I:: 2 p O) -I dpl dp2 d p 3, where \II(p)EC 2 j for each p and p is restricted to the orbit m x . (4) The orbit Ox' Irreducible representations of the little group can be classified by the orbit in the spectrum of the normal Abelian sub- group consisting of A(O,z), zEC. Since the spectrum is a plane and A(O, z) acts as a rota- tion by an angle 20, an orbit is a circle of radius p. The case p = 0 is further classified completely by the spin j = 0, :t 1/2, ::I:: 1, ... (used for the description of a massless particle of spin UI and helicity 0" = signj). It is denoted by [0 x' j] and is given by [U(a, A) \II] (p) = e ip a(Aa) 02U1 \II(;\(A)-1 p), (\II, <1» = f (\II (p), (:t jia)0 2 ljl <I>(p) x (:t 2 p o) -I dpl dp2 d p 3, where Aa=A or A and ji"=I;'ji6- 1 or ji de- pending on 0" = 1 or  1, and \II(p) belongs to the quotient of cn and the subspace consisting of all X with (X, (jia)0 2 UI X) = 0 (the quotient is of one dimension). The case p#-O (caIled continuous spin) is completely classified by p and U(O, -1) = :t 1. 
959 The representation D of the little group for p #0 is given by [D(A(z, O))\PJ (<p) =exp i{p Re e-i<l'z + kG} \P(<p - 20), where k=O or 1 (accordingly U(O, -1)= lor -1), <pER mod2n and \PEL 2 ([0,2n],d<p). (5) The orbit O. Irreducible representations of the little group SL(2, C) are as foIlows: The principal series with two parameters -00 < p< 00 and m=O, :t 1,... are given by [D(A)fJ (z) = (bz + d)mlbz + dI2ip-m-Z f('Az), A=(: } tAz =(az+ c)/(bz+ d), where f is an L 2 -function of z = x + iy with respect to dxdy. The supplementary series with a parameter 0 < p < 1 is given by [D(A)fJ (z)= Ibz+ dI 2 p- z fCAz), (/1'/2)= f fd z d fz(Z2) x IZI -z21- z - 2P dx I dXzdYI dyz. Together with the identity representation D(A) = I, they exhaust all possibilities up to unitary equivalence. (6) The orbit im. Irreducible representations of the little group SL(2, R) are as foIlows: The principal continuous series with parameters 0"=0, l,pOforO"=Oandp>OforO"=1 are given by [D(A)fJ (x) = fCAx)lbx + dliP-1 (sign(bx + dW, where fELz« -00, oo),dx), The supplementary series with a parameter O.s; P < 1 are given by [D(A)fJ(x)= Ibx + dl-1-Pf('Ax), (/1'/2)== f f f(xd f(x2)lxl-xzl-I-PdXI dx z . The principal discrete series with a parameter n = 1,2,3, ... are given by [D(A)fJ (z) = (bz + d)-nf('Az), (/1>1z) = r<. dx f yn- 2 d yfd z ) fz(z), where f is holomorphic in either the upper or lower half-plane, (so that the y-integration is over (0, 00) or (-00,0) accordingly), z=x+ iy, and the two choices give rise to two in- equivalent representations for each n. Together with the identity representation D(A)= I, they exhaust all possibilities up to unitary equivalence. 258 Ref. Lorentz Group D. Infinitesimal Generators In any continuous unitary representation V of g;1 +, i-llim t-I(U(y(t))-I) Hoo (defined on the dense set of all those vectors on which the limit exist) for anyone-parameter subgroup y(t) of g;! is a self-adjoint operator called the infinitesimal generator for y(t). If y(t) = (ta, 1) (aE M), the generator is p. a, where the components pO, pi, p2, p3 of P mutually commute and are caIled energy-momentum operators (or 4-momentum operators). If y(t) = (O,exp(it O"k/2)), k= 1,2,3 (the rotation by an angle - t around the kth axis), the generator is written as Jk = Mjl == - M1j «jlk) is a cyclic permutation of (123)) and is called the angular momentum operator. If y(t) = (0, exp(tO"k/2)), k = 1, 2, 3 (the pure Lorentz transformation along the kth axis with a velocity tanh t), it is written as N k = M Ok = - M kO , and MPV (Ji, v = 0, 1,2,3) is called the angular momentum tensor. The value of the scalar p. P is the square of the mass and, when p. P.s; 0, the alternatives of pO being positive or negative definite or zero give the invariants of irreduc- ible representations, distinguishing different orbits, The angular momentum in the center of mass coordinates, w K =2- 1 L[';K;PVPAMpv> gives another invariant W' w, called the PauIi- Lubanski vector, where the quantity with lower indices are obtained from those with upper indices by contraction with the Minkowski metric g, e.g., Pv = Lp pp gpv; ['; is totaIly anti- symmetric in its indices and [';0123 = 1. If p. P>O, then W' w= - j(j + I)P' P defines the spin j. In the representation with orbit 0:t and p = 0, w P = O"j pp defines the spin j  0 and the helicity 0" = :t 1. References [IJ V. Bargmann, Irreducible unitary repre- sentations of the Lorentz group, Ann. Math., 48 (1947), 568-640. [2J V. Bargmann, On unitary ray represen- tations of continuous groups, Ann, Math., 59 (1954),1-46. [3J C. ChevaIley, The algebraic theory of spinors, Columbia Univ. Press, 1954. [4J I. M. Gel'fand, R. A. Minlos, and Z. Ya. Shapiro, Representations of the rotation and Lorentz groups and their applications, Per- gamon Press, 1963. (Original in Russian, 1958.) [5J I. M. Gel'fand and M. A. Naimark, Uni- 
258 Ref. Lorentz Group 960 tary representations of the Lorentz group, 1. Phys., 10 (1946), 93-94; Izv. Akad. Nauk SSSR, 11 (1947),411-504. [6] M. A. Naimark, Linear representations of the Lorentz group, Pergamon Press, 1964. (Original in Russian, 1958.) [7J M. Schaaf, The reduction of the product of two irreducible unitary representations of the proper orthochronous quantum-mechanical Poincare group, Lecture notes in physics 5, Springer, 1970. [8J A. S. Wightman, L'invariance dans la mecanique quantique relativiste, Dispersion Relations and Elementary Particles, C. De Witt and R. Omnes (eds.), Wiley, 1960, 161- 226. [9J E. P. Wigner, Unitary representations of the inhomogeneous Lorentz group, Ann. Math., 40 (1939), 149-204. [lOJ E. P. Wigner, Group theory and its ap- plications to quantum mechanics of atomic spectra, Academic Press, 1959. (Original in German, 1931.) [IIJ M. Sugiura, Unitary representations and harmonic analysis, Kodansha, Halsted Press, 1975. [12J U. Uhlhorn, Representation of symmetry transformations in quantum mechanics, Arkiv for Physik, 23 (1963), 307-340. 
259 Ref. Magnetohydrodynamics 259 (XX.11) Magnetohydrodynamics Magnetohydrodynamics (also called hydromag- netics or magnetofluid dynamics) is concerned with the motion of an electrically conductive fluid in the presence of a magnetic field. An electromotive force (e.m.f.), induced by the motion of the fluid in the magnetic field, in turn induces an electric current that perturbs the original magnetic field. On the other hand, an electromagnetic force due to the magnetic field deforms the original motion, Many im- portant and interesting phenomena result from such interactions of the magnetic field and the motion of the fluid. In ordinary magnetohydrodynamics we assume that (i) the fluid is continuous, (ii) electric conductivity (J is not negligible, and (iii) fluid velocity is small compared with the velocity of light, i.e., max(Uj(c 2 T 2 ), U 2 jc 2 )« min(l, Rm), where L is a representative length, T a representative time, U a representative velocity, and Rm = O"flU L (11 is the magnetic permeability) is a nondimensional number called the magnetic Reynolds number. In this case we can neglect in the tMaxwell equations the displacement and convective currents in comparison with the conductive current J, and write divB=O, rotH=J, rotE= -aBjat, (I) Pe=divsE, (2) where 8 is the dielectric constant, and B= flH, J =(J(E+v x B), Here the latter equation, Ohm's law for a moving medium, is valid when the effects of temperature gradient, the Hall effect, etc., are small. The motion of fluids ( 205 Hydro- dynamics) is governed by the tequation of continuity apjat+div pv=O, the tequation of motion a(pv)jat = - div(pv @ v - P - T) (P is the mechanical stress tensor, T is the Maxwell stress tensor Tij= fl(H;Hj - H2Dij)) or pDvjDt=divP+K, K=J x B, the tequation of state, and the tenergy equa- tion. (From assumption (iii), the force PeE on the electric charge Pe can be neglected com- pared with the force J x B on the electric cur- rent, and (2) can be separated from the other equations in order to determine Pe') When fl and 0" are uniform, we can eliminate E and J from (1) and Ohm's law to obtain the 962 induction equation aBjat = rot(v x B) + ),B, (6) where =graddiv-rotrot, -1.= Ij(flO"). This is of the same form as the equation arojat = rot(v x ro) + v ro (v is the kinematic viscosity) for the tvorticity ro of an ordinary tincom- pressible viscous fluid. We call ..1. = 1 j(flO") the magnetic viscosity, and the ratio of the first term (the convection term) to the second one (the diffusion term) in the right-hand side of (6) is the magnetic Reynolds number Rm = U Lj}, = flO" U L. Rm = 00 corresponds to the tperfect- fluid case as 0"-->00 or L-->oo. In this case, the magnetic flux moves with the fluid as if both were frozen together, as in the tHelmholtz theorem about vorticity. The existence of a transverse wave of velocity cx = J flH 2 j P along magnetic lines of force in the fluid, owing to the tension flH 2 (Tij except for the magnetic pressure - t flH 2 D i ) of the magnetic flux frozen to the fluid, was noted for the first time by H. Alfven (1943), and this wave is called the Alfven wave. In a compressible perfect fluid (Rm == 00, P ij = - pD ij , where p is the pressure), (1)-(5) reduce to a system of thyperbolic partial differential equations, yielding as tcharacteristic surfaces in addition to the pure Alfven wave two magnetosonic waves of phase velocities (3) aT = C ) x (a 2 + cx 2 :t J (a 2 + cx2)_4a2cx2 cos 2 () ) 1/2 (0 is the angle between the magnetic field and the wave normal, a is the velocity of sound interfering with the Alfven wave), We call a+ and a_ the fast wave and the slow wave, re- spectively. Hydromagnetic dynamo theory explains the generation and maintenance of the magnetic field inside the earth on the basis of (6). Applications are also made to cosmic problems and MHD generation of electricity. Mercury, liquid sodium, etc., can be used to verify the theoretical results. Extra- polation may be made to the plasma used in thermonuclear fusion to the extent that the hydrodynamic treatment is valid as a first approximation. (4) (5) (5') References [IJ T. G. Cowling, Magnetohydrodynamics, Interscience, 1957. [2J H. Alfven and C. G. Falthammar, Cosmi- cal electrodynamics, Clarendon Press, second edition, 1963, [3J L. D, Landau and E, M. Lifshitz, Electro- 
963 dynamics of continuous media, Course of theoretical physics 8, Pergamon Press, 1960. [4] H. Cabannes, Theoretical magnetolluid- dynamics, Academic Press, 1970. [5] Progress of theoretical physics, suppl. no. 24, 1962. 260 (XVII.6) Markov Chains A. General Remarks We consider a random process {X,} (t  0 or t = 0, 1,2, ...) that is governed by some prob- ability law. One of the most important is the process whose probability law of X, under the condition X S1 =a l , ""Xs,=a n (Sl <S2<'" < Sn < t) coincides with that under the condition X s , = an' This is called the Markov property, and a process with this property is called a Markov process ( 261 Markov Processes). If the process takes place in a finite or countable set S, it is called a Markov chain. A Markov process is specified by its ttran- sition probability (i.e., the system of the prob- ability laws of X, given Xs = x for S < t) and initial distribution ( 261 Markov Processes). The transition probability of a Markov chain is denoted by Ps)x, y), while that of a general Markov process is denoted by P(s, x, t, A). Before proceeding to a sophisticated definition of Markov chains, we give some examples. (a) Suppose that l> 2, ... are mutually tindepen- dent real trandom variables on a probability space (Q,!B, P) and fl, f2, '" are Borel measur- able functions from R 2 to R. The process {X n } which is defined by the recurrence relation Xo = o' X n = fn(n' Xn-d is a Markov process with its transition probability given by P(n- l,x,n,A)=PUn(n,x)EA}. In particular, if the possible values of each n are at most countable, {X n } becomes a Markov chain. It is ttemporally homogeneous, i.e., Pn,n+l (x, y) is independent of n, if the gn} are tidentically distributed and fl = f2 = ... . Moreover, if each n is integer-valued and fn(x, y) = x+ y, {X n } is a I-dimensional random walk. (b) Ehrenfest model of diffusion. Suppose that N molecules are distributed in two containers A and B. At each trial a molecule is chosen at random and moved from its container to the other. Let X n be the number of molecules in A after the nth trial. Then {X n } is a temporally homo- geneous Markov chain with Pn,n+! (i, i-I) = i/N, Pn,n+! (i, i+ 1) =(N - i)/N for O.s; i.s; N (Pn,n+! (i,11= 0 otherwise). (c) Population model. Consider a population in which there is no interaction among individuals. Suppose that during a time interval (t, t + tJ each individ- 260 A Markov Chains ual gives birth to a new one with probability ;,t+o(t) and dies with probability Ji!1t+ o(M). Let X, be the population size at time t. Then the process {X,} becomes a birth and death process, the most typical of Markov chains with continuous time parameter ( Section G). Numerous Markov chains with special structure are extensively studied in various fields of applications of probability theory such as queuing theory ( Section H), the theory of tbranching processes, the sto- chastic theory of tpopulation genetics, and others [1,3]. In this article, however, we are mainly concerned with the theoretical aspects of Markov chains. Hereafter, we consider only ttemporally homogeneous Markov chains, which are re- formulated as a system of tstochastic pro- cesses in the following way. Let S be a finite or countable set (called the tstate space of the motion) and T be {O, 1,2,... } or [0, 00) (called the time parameter space). A family (X" PxJ'ET,XES is called a Markov chain if Px, the tprobability law of the process X, starting at x, is subjected to the condition PAX S1 +',= Yl, ... ,X SmH , = Yml X'I =X I , ...,X"""x n ) = P X ,(X'1 = YI,"" X Sm = Ym), (1) for every t j , Sk E T (j = 1, ... , n; k = 1, .. " m) such that t l <t 2 <... <tn' O<SI <S2'" <Sm' Then the function defined by p,(x,y)=PAX,= y), tET, x, YES satisfies O.s;p,(X,y).s; 1, (2) (3) L p,(X, y) = 1, YES p,+.(X,y)= L p,(x,z)p.(z,y) ZES (4) because of (1) and the general properties of probability laws. We call p,(x, y) (tE T,X,YES) the transition probability (or transition func- tion) of the Markov chain, and relation (4) is called the Chapman-Kolmogorov equation. Furthermore, the matrix p, = (p,(x, y)) is called the transition matrix. Conversely, for a given p,(x, y) with prop- erties (2), (3), and (4), we can construct a Mar- kov chain that satisfies PAX" =X 1 , ...,X,,=x n ) = P'I (x, XdP'Z-'1 (Xl' X 2 )... P"-"-1 (Xn-l> X n ) by using Kolmogorov's extension theorem, Such a Markov chain is essentially unique. If we are given a stochastic matrix P=(PX,y), i.e., a matrix satisfying O.s;PX,y.s; I and LyESPx,y= 1, the components Pn(x, y) of the iterated matrix pn satisfy (2)-(4), and hence there exists a 
260B Markov Chains Markov chain with discrete parameter having Pn(x, y) as its transition probability. Let flx = P(X o = x) be the initial distribution over S at time O. Then the distribution fl) = P(X, = y) at time t is obtained from fl) = LXES flxP,(x, y). In particular, if fl) is indepen- dent of t, i.e., fly = L flxP,(x, y), XES then fl is called the invariant distribution of the Markov chain. An arbitrary real nonnegative solution of (5) is called an invariant measure. When S is the set of all d-dimensionallattice points and Px,y =n x - y , the associated Markov chain is called a (general) random walk. In particular, if n x = 2 -n for every neighboring point x of the origin and = 0 otherwise, it is called a standard random walk (or simply random walk). Now, when the equal sign in (3) is replaced by <, we consider the space S* obtained by adjoining 8 (tdeath point) to S, and define p;"(x,y)=p,(x,y), X,YES; p;"(x, 8)= 1- L p,(x, y), XES; YES p;"(8,8)=I; p;"(8,x) =0. Then we can associate a Markov chain (X" P/) on S* with {pi}. If X, equals 8, the particle of the process is regarded as extinct at the random time' = inf{ t I X, = 8}, called the life- time (or killing time). In this case, the process X, restricted to t smaller than' is also called a Markov chain on S with the transition probabilities {p,(x, y)}, Then the conditions LyEsP,(x,y)= 1 and PA' = 00)= 1 are equiva- lent, and the chain is called conservative if PA' = 00) = 1 for every x. In this section we have restricted ourselves to the temporally homogeneous case. In the temporally inhomogeneous case, we have to consider the probability laws P x " of the path starting from XES at time t, instead of Px. Equation (1) becomes P X ,,(X S1 = YI, '.. ,X Sm = Yml X'I =x 1 ,... ,X," =x n ) =PX".dXSl = YI,'" ,X Sm = Ym), t 1 <... <tn<SI <... <sn' For the rest of this article, we consider only the homogeneous case. B. Markov Chains with Discrete Parameter Hereafter, the one-step and the n-step tran- sition probability of a Markov chain with discrete parameter will be denoted by P(x, y) 964 (5) and Pn(x, y) instead of PI (x, y) and Pn(x, y), respectively. Let .r = (X n , PxJ be a conservative Markov chain. For a subset A of the state space S, O"A = min{n 11 XnEA} (min 0 = +00) is called the hitting time for the set A. If Px!O"y < 00 »0 (= 0), we write x--> y (X-f> y). Then x-->y is equivalent to the existence of n  1 with Pn(x, y) >0. When x-->y and y-->x, we write x.....y. The set of all x for which there exists y # x with X--> Y and y-f>X is denoted by F and is called the dissipative part of S. For elements of S - F, the relation..... is an equivalence relation. Each equivalence class E, is called an ergodic class. When F = 0 and S consists of a single class, the chain .r is called irreducible (or ergodic). For an ergodic class E, the greatest common divisor d of {n> IIPn(x,x»O} for XEE does not depend on the choice of XE E and is called the period of the class E. A class with period 1 is called aperiodic. Set Gn={YEEIPkd+n(x,y» o for some kO} (n= 1,2, ...,d) for a fixed XEE. Then we have a decomposition of E: E = U G n , G n n G m = 0 (n # m), and LzEG"+ 1 P(y, z) = 1 (yE G n ). Each G n is called a cyclic part. The decomposition may depend on the choice of x but is unique up to ordering. The point x is called recurrent or nonrecur- rent (transient) according as Px!O"x< 00)= lor <1. A necessary and sufficient condition for x to be recurrent is Lo Pn(x, x) = 00. The prob- ability Px (X n == X occurs infinitely often) is 1 or o according as x is recurrent or nonrecurrent. A chain is called recurrent or nonrecurrent ac- cording as every point in S is recurrent or nonrecurrent. A recurrent point x is called positive recurrent or null recurrent according as m x == Ex!O"x) is finite or infinite. Let E be an ergodic class. If there exists a positive recur- rent element in E, then all elements in E are also positive recurrent, and the class E is called positive recurrent. We can define null recur- rence or non recurrence of an ergodic class similarly. For a finite state space, every Mar- kov chain has at least one ergodic class, and all ergodic classes are positive recurrent. C. Limit Theorems for Markov Chains, and Recurrent Events The properties of a recurrent chain are re- duced to those of an irreducible recurrent chain, since a recurrent chain is decomposed into ergodic classes. We assume that .r = (X n , Px) is an irreducible recurrent chain. Concerning the transition function we have the following limit theorems: Let d be the period of.r and S = UI G r be the decom- position into cyclic parts, If x E G d , then 
965 limnoo Pnd+r(x,y) = dm}-I (YEG r ), =0 (yrtGr)' These results are sometimes referred to as the basic limit theorem of Pn(x, y). For every x, YES, Iimnoo N- I LI Pn(x, y)=m;l. Further- more, rx,y = limNoo(L:=1 Pn(y, Y)/L1 Pn(x, x)) exists and is finite, Then rx,y is an invariant measure as a function of y, and every invariant measure is a constant multiple of it. The chain is positive recurrent if and only if Lyrx,y < 00, and then we have rx,y = mxm;l. Consequently there exists a real invariant measure Jix (real solution of (5) such that 0 < LXES Iflxl < 00 if and only if the chain is positive recurrent, and in this case flx is a constant multiple of m;1 . Let fl be an invariant measure, and set I(f) = LxESflJ(X), For I, 9 such that I(lfl)< 00, O<I(lgl)< 00, and I(g)#O, we have Px(l (JI f(Xn>/.tl g(X n ») = I(f)/I(9))= 1. The tlaw of large numbers, the tcentrallimit theorem, and the tlaw of the iterated logarithm hold for the asymptotic behavior of the sum L=J(Xn) as N-->oo. Let Iff = {EI' E 2 , ... } be a sequence of events on a probability space (Q, @, P), and let Un = P( En). The time intervals {'k} between the successive occurrences of t&' are called the recurrence time of Iff. More precisely, the {Ok} are successively defined by 'I =inf{n liEn occurs}, 'k=inf{n'l + "'+'k-l + liEn occurS}-('1 +"'+'n-d(k2). To avoid complications we assume here that all the 'k are finite talmost surely. The sequence Iff is called a (persistent) recurrent event if the, k (k  1) are mutually tindependent trandom variables with a common tdistribution Un}' More generally, if the distribution of, I is allowed to be different from Un}, Iff is called a delayed recurrent event. The basic relation on a recurrent event is the renewal equation Un = b n + ulfn-I + u 2 fn-z +... + un-If" where {b n } is the distribution of, I' Suppose that the greatest common divisor of {n  11 fn >O} is d. Then we obtain lim U nd + r = dfJr/ fl, r = 1, 2, ... d, noo where fl = L nfn is the mean recurrence time and fJr=Lnobnd+r' This fact is known as the renewal theorem. Let .r = (X n , Px) be an irreduc- ible recurrent Markov chain. For fixed x and y, consider the events En = {X n = y} under the measure Px. Then, Iff == {E l , E2' ... } is a delayed recurrent event with U n = Pn(x, y), fn = PiO"y= n), and b n = PAO"y= n). Applying (6), we obtain the basic limit theorem of Pn(x, y) stated at the beginning of this section. We next consider the number N n of occurrences of a recurrent event 260 D Markov Chains Iff up to time n, The limiting behavior of N n has been extensively studied ( 250 Limit Theorems in Probability Theory D). D. Potential Theory for Markov Chains For a given Markov chain, 00 G(x,y)= L Pn(x,y) n=O (6) is well defined, admitting possibly the value 00. If G(x, y) is not identically 00, we can define a (generalized) tpotential with kernel G(x, y) ( 45 Brownian Motion; 261 Markov Processes; 338 Potential Theory). For a real function <p over S, the function G<p(x) = LyES G(x, y) <p(y) is called the potential with charge <p if the sum exists. Even if G<p does not exist, the infinite sum L;:;"=o Pn<P may exist. In this case this sum is also called the potential with charge <po A real function f (-00 <f.s; +00) over S is defined to be superharmonic (or super- regular) if Pf.s;f Here P is the operator asso- ciated with the kernel P(x, y), that is, Pf(x) = LyESP(X, y)f(y). Furthermore, if fO and f Pj; f is called excessive, and if -00 <f < +00 and f = PI, f is called harmonic. If Pf.s;f at a point x, f is called superharmonic at x, etc. A potential with charge <p  0 is always excessive. For an irreducible recurrent chain, every nonnegative superharmonic function is constant. (1) For a nonrecurrent chain we can consider the potential f = G<p for every function <p with finite support. G satisfies (P-I)G= -I and limnoo PnG = 0, where I is the unit matrix. Consequently, the operator P - I corresponds to the tLaplacian  of tNewtonian potential theory, and the equation (P-I)f =0 corre- sponds to the Laplace equation f = O. If the limit w = limnoo Pnf exists for a function f on S, f can be expressed uniquely as the sum of the potential G<p (<p = f - PI) and a harmonic function W. This decomposition is called the Riesz decomposition, following the terminology used in Newtonian potential theory ( 338 Potential Theory). Let E be a finite subset of S and 0"1 =min{n 01 XnEE}. Then f(x) = Px!O"I < 00) is a (unique) potential which is harmonic at XEE' and takes the value 1 on E. This is called the equilibrium potential of the set E, and its total charge C(E) = LXES(f(X)-Pf(x)) is called the capacity of E. The tmaximum principle and the tbalayage principle ( 338 Potential Theory) are valid in this potential theory. (2) For a recurrent chain, we cannot define the potential kernel as in (1), since G(x, x) == 00. However, we can define a kernel analogous 
260E Markov Chains to the ease of the t10garithmic potential. We assume that the chain is irreducible. When the limit A(x, y)= lim (Gn(x, X)flyfl;1 - Gn(x, y)), noo n Gn(x,y)= L Pk(x,y), k=O exists for an invariant measure fl, the chain is called (right) normal. A chain is normal if and only if icd.)=lim L Pn(x,Z)Pz(O"EE') n........oo ZES exists. If }'E(') exists, it is independent of x. Every positive recurrent chain is normal. For a nor- mal chain we obtain the folIo wing results. Let <P be a function with finite support. The poten- tial f of <P exists if and only if Lxesflx<P(X) = 0, and f = - A <P in this case. A <P satisfies the conditions (P-l)A<p=<p and IimnooPnA<p = O. A function .f is a bo unded potential of a function with finite support E if and only if f is bounded and harmonic in E' and satisfies LxAE(x)f(x) =0. An irreducible recurrent chain is not necessarily normal. For every such chain, however, there exists a kernel W(x, y) such that (P -I) W<p = - <p for any function <p with finite support and nulI charge (i.e., LXEsflx<P(X)=O). Such a kernel Wis called a weak potential kernel. E. Random Walks Consider a random walk defined on the set S of alIlattice points in a d-dimensional Eucli- dean space. Let S + = { x 10 --> x }, S == { z I z = x - y, x, yE S+}. F. Spitzer obtained the folio\\!- ing results for the random walk with S=S. The random walk is recurrent if the following conditions are satisfied: (i) d= 1, LlxIP(O,x)< 00 (Ixl is the distance between 0 and x), and m== LXP(O, x) == 0; (ii) d =2, m=O, and 0"2 == LlXl 2 P(O,x)< 00. When d:;;,3, the random walk is always nonrecurrent. The measure flx == 1 is invariant whether the random walk is recurrent or not. Every recurrent random walk is right normal, and the potential kernel A satisfies (P -I)A = I. Several interesting results are known on the uniqueness of the kernel A satisfying (P - I)A = I [7]. For the case m=O, there are a number of results similar to those for tBrownian motion, including: (i) When d= 1 and 0<0"< 00, Iim Po(maxOk"'n IXkl.s; O" x)= 1- F(x- 2 ), noo x>O, 966 where 4 00 (-ll ( n2 2 ) F(x)= 1-- L -exp --(2k+ 1) x ; n k=02k+ 1 8 (ii) The arc sine law: Let Tn be the number of k.s;n for which X k becomes >0. When d= 1 and 0< 0" < 00, 2 . r:: Iim P o (7;,.s;nx)=-arcslll y x, n-ex; n O.s;x.s;l; (iii) The Wiener test: The set E is called re- current if P'(O"E< 00)= 1 holds for every x. When d = 3 and 0" < 00, a set E is recurrent if and only if LI C(En)2 -n = 00, where En = En {xl2 n .s;lxl < 2n+I}. F. Markov Chains with Continuous Time Parameter Suppose that the transition probability p,(x,y) is measurable in t, Then p,(x,y) is uni- formly continuous in a complement of any neighborhood of t = 0, and there exists mx,y = limtJoo p,(x, y) for which m x y = Lzesmx,zp,(z, y) = Lzesp,(x,z)m z . y ' If p,(x,y) =0 for all XES and t > 0, y is called a fictitious state. Let F be the set of all fictitious states of S. Then the restric- tion of p,(x, y) to S - F gives a transition prob- ability on S - F. If F = 0 and the family p = {p,(-,y)lt>O,YES} separates points of S, the transition probability p,(x,y) is calIed stan- dard. Then p,(x, y) is standard if and only if Jim P ( X ,, ) = b . When F = 0 and p does qo t,..r X,Y not separate, points of S can be reduced to the standard case by a suitable identification of states. We assume that p,(x, y) is standard. Then lim'Jot-l(p,(x,y)-bx,y)=qx,y exists and satisfies O.s; qx.y < 00 if x # y. The matrix Q = (qx,y) over S is called the Q-matrix of the tran- sition matrix p,=(p,(x,y)), and we write Q= p. Set qx = -qx,x (:;;,0). We call x a stable state if q T < 00 and an instantaneous state if qx = 00. If qx < 00, p;(x, y) (the derivative with respect to t) exists and is continuous in t > O. When every point of S is stable, n(x, y) de- fined by ( ) { qx,yq;1 n x, y = 0 if x#y if x=y satisfies O.s;n(x,y).s; 1, n(x, x)=O, Ln.s;I. m YES From the tKolmogorov-Chapman equation we can formalIy derive Kolmogorov's backward equation =-+n  ZES 
967 and its dual, Kolomogorov's forward equation p;(X,y)= -qyP(X,y)+ L p,(x,z)qzn(z,y). (9) ZES Strictly speaking, these equations hold only under suitable conditions: For in- stance, a conservative transition probability p,(x, y) satisfies (8) if and only if LYESn(X, y) == 1. If there exists x >0 (XE S) such that LXESXP,(x,y).s;y (\It >0), then P,(X,y) satisfies (9) if and only ifLYEsyqyn(y,x)=x' Con- versely, given O.s; qx < 00 and n satisfying (7), there exist in general many solutions of (8) and (9) with the initial condition lim'jo p,(x, y) == (\,y' Among them, the minimal solution p?(x, y) exists. The chain with p?(x, y) as its transition probability is called the minimal chain associated with {q, n}. For the path of a Markov chain with stan- dard transition probability, there exists a separable measurable tmodification X" but in general there exists neither a right continuous modification nor a modification having the tstrong Markov property for all tstopping times. (For detailed properties of paths  [8,9].) Set '0 =0, 'n =inf{ 1 >nn-II X, #X,"_J (n? 1), and '00 = limnoo Tn' If each point x of S is stable, T I is subject to the exponential distri- bution PA'I >t)=e- qx ', so that Ex('d=q;" PAX'1 = y) = n(x, y), and 'I and X, are indepen- dent with respect to the Px-measre. Further- more p?(x, y) = PAX, = y, '00 > t) is the minimal solution. For the minimal chain there exists a right continuous modification with left-hand limits, which has the strong Markov property. For a finite Markov chain with a standard transition probability, all states are stable and both (8) and (9) are fulfilled. Furthermore, the transition probability satisying (8) and (9) is the unique minimal solution. D. Williams [10] obtained the following remarkable result concerning Markov chains having only instantaneous states. Let Q be a matrix on S with - q = 00 ( \Ix ) O'< q < 00 x,x , -.....::::: x,y (\Ix, y; x # y). Then Q is the Q-matrix of a tran- sition matrix if and only if the following two conditions are satisfied: ( i ) L min (q q ) Z¥=-X,y X,Z, y,z < 00 (\lx,y; x # y), (ii) for some infinite subset A of S, LYEA" {x} qx,y < 00 (\Ix). Williams [II] also studied a counterpart of Kolmogorov's back- ward equation for the instantaneous case. G. Birth and Death Processes A Markov chain .r with state space S = {O, 1, 2, '" } is called a birth and death process if its Q-matrix Q = (qn,m) satisfies the following conditions: O.s; qo = qO,1 < 00, 0 < qn = qn,n-I + qn,n+l < 00 (n? 1) and qn,m =0 (m #n+ 1, n- 260H Markov Chains 1), where qn= -qn,n as before. Usually we write An for qn,n+1 and fln for qn,n-I' The para- meters An and fln are called the infinitesimal birth and death rates, respectively. In partic- ular, the chain .r is called a birth process if fln == o and a death process if An = 0, The birth and death process satisfying An> 0 and fln > 0 for n? 1 has a number of properties similar to those of a I-dimensional t diffusion process. For the rest of this section we assume that },o =0 (i.e., 0 is a ttrap) and An >0, fln >0 (n? 1). The sequence x n , called the natural scale is defined as follows: XI = fl1 1 , Xl = fll 1 + ..1.'1 1 , X n = fll l + All +... +(fl2'" fln-d(A I ... An-d- I (n? 3), Xoo = limnoo xn' The measure m with the mass mn=AI'" A n -dfl2'" flnf l (n? 2) and = 1 (n = 1) at the point x n is called the canon- ical measure. Then P,(Xi, xd = p,(i, k)m k - l is a transition probability on E = {XI' X 2 , ...} and f(x i , t) = p,(x;, x k ) satisfies a differential- difference equation af/at=Dmr, (10) which is equivalent to (8). Here,f+(x n )= (f(Xn+d- f(xn))(Xn+I-Xnfl and Dmg(x n ) = (g(xn)-g(xn-d)m;l. The operator f -->Dm' f+ is similar to FelIer's expression for the infini- tesimal generators of I-dimensional diffusion processes. Every birth and death process is obtained from tBrownian motion by ttime change. Furthermore, Xoo is regarded as a boundary point and is classified as a natural, exit, en- trance, or regular boundary point. Every birth and death process is determined by (10) and the boundary condition at Xoo ( 115 Diffu- sion Processes). H. Markov Chains in Queuing Theory A queue is formed when customers arrive at random times at some facility and request service of some kind. Such phenomenon often arises in service systems. The queuing model generally embodies the following mathemat- ical structure. Suppose that the nth customer arrives at time Tn' waits for a time interval W n until the beginning of his service, and departs after a service time Vn- Usually the models we consider are specified by the following assump- tions. The interarrival times Un = 'n+1 - Tn (n? 1) are mutually tindependent trandom variables distributed according to a common tdistribution function F(x) with tmean ..1. -I. Similarly, {v n } is a sequence of independent random variables with a common distribution function G(x) with mean fl-I. The sequences {un} and {v n } are mutually independent. The queuing discipline is "first come, first served." 
260H Markov Chains In case of many servers, this means that the first customer waiting in the queue is served as soon as anyone of the servers becomes free. Besides this standard queuing model, various other queuing models have been studied in different fields of application. There is a huge literature on queuing theory, Le., the mathe- matical analysis of queuing models. A com- pendium of results obtained by around the 1960s, together with an extensive bibliography on this theory, is contained in [15]. Some recent aspects of queuing theory are developed in [17]. Here, we are concerned only with some simple queuing models and their asso- ciated Markov chains. There is a standardized notation, introduced by D. G. Kendall [14J, for identifying stan- dard queuing models. In Kendall's notation A/ B/s, s represents the number of servers, while A and B indicate the types of distri- butions F(x) and G(x), respectively. Distri- butions frequently used in the first two places of Kendall's notation are the following: M = an texponential distribution, D == a tunit distri- bution, Ek = a tgamma distribution of order k (called a k-Erlang distribution in queuing theory), G(or GI) = a general distribution, and so on. The model M/I is often said to have Poisson input, for the number A, of arrivals during a time interval (0, tJ forms a tPoisson process. One of the important problems in queuing theory is to analyze the number Q, of cus- tomers waiting or being served at time t. The process Q, is called the queue length. If s = 00, then Q, indicates the number of busy servers at time t. The simplest and most extensively studied queuing model is M/M/s. In this case, the process Q, becomes a birth and death process with Aj=A, Jij==jfl (1.s;j.s;s), and =Sfl (j > s). Kolmogorov's forward equation is given by p;(i, 0) = - Ap,(i, 0) + flP,(i, 1), p;(i,j) = Ap,(i,j - 1) - (..1. + jfl)P,(i,j) +(j+ l)flP,(i,j+ 1), l.s;j.s;s-l, p;(i,j) = Ap,(i,j -1) -(..1. + Sfl)P,(i,j) + SflP,(i,j + 1), jS. This equation can be solved explicitly by the method of tgenerating functions. The limit distribution lim Hoo p,(i,j) = Pj, independent of i, exists if and only if Sfl > A. In particular, { (1-P)pj if s= I, Pj= e-Ppj/j! if s=oo, where p = A/fl. (In the single-server case, p is called the traffic intensity.) Unless the queuing model is M/M/s, Q, is no longer a Markov chain. In some special cases, such as Ek/ M /1, 968 the properties of Q, can be reduced to those of some birth and death process. The analysis of Q, itself is difficult in general. However, if either F(x) or G(x) is of exponen- tial type, the method of embedded ( = imbedded) Markov chains is useful. For example, in the system M/G/I we examine the queue length only at times to = 0, t l , t 2 , ..., where t n is the departure time of the nth customer. This em- bedded process X n = Q" becomes a Markov chain on S = {O, 1,2,...}. For practical pur- poses we could content ourselves with results concerning {X n} instead of the original process Q,. The transition probability P(i,j) of {X n } is given by P(O,j)=k j , P(i,j)=kj_i+I' i 1, (11) where _ {f rO e-Ax (A)j dG(x) k j - 0 J. kj=O if jO, if j<O. This chain is irreducible and aperiodic. More- over, it is transient, null recurrent, or positive recurrent according as the traffic intensity p = ..1./ Ji > 1, = 1, or < 1. In the last case, the generating function P(s) of the limit distri- bution {P;} is given by P ( s ) _ (1- p)(J -s)K(s) - K(s)-s ' where K(s) is the generating function of {k j }. Similarly, if the system is GI/M/I, the embed- ded process X n = Q,,_ is the Markov chain with transition probability P(i, 0) = ex i , P(i,j) = li- j+1, j  1, where {f rO e-x (fl)j dF(x) lj = 0 j! lj=O if jO, if j<O, and ex i = Lj>i lj' The limiting behavior of this chain is analogous to that of the preceed- ing chain (11). In the positive recurrent case (p< 1), the limit distribution is given by Pj= (1 - ()(j, where (is the solution of L(s) = sin (0,1) for the generating function L(s) of {IJ These results are extended to the cases with many servers. For the general single-server queuing model GI/G/l, when we consider the waiting times {w n } instead of Q" we obtain the recurrence relation W n +1 = max {O, W n + V n - un}' This im- plies that {W n } is a Markov process with dis- crete parameter, over S = [0, 00) an uncount- able state space, having the structure of a Markov chain with transition probability (11), Several methods are exploited for the analysis of {w n } [3, 16, 17]. 
969 I. Boundary Value Problems for Markov Chains To discuss the behavior of the path of a Mar- kov chain beyond the time roo, we have to introduce a suitable boundary of the state space S. Among several conceivable bound- aries, the Martin boundary, which is most frequently utilized, is explained later in this section. The name comes from its similarity to the tMartin boundary in the theory of har- monic functions. Most results stated in this section are also valid for the discrete time parameter case, Let ;(=(X"P x ) be the minimal, nonrecur- rent chain associated with {q, n} for which roo equals the lifetime (. Harmonic functions, etc., are defined as in the discrete time parameter case (with p replaced by n). Let y == y( x) be a measure such that 0 < yG(y) == Lx y(x)G(x, y) < 00, where G(x,y)= S;,;, p,(x,y)dt. Let PI be the metric in the one-point compactification of the state space S equipped with the discrete topol- ogy, Set K (x, y) = G(x, y)jyG(y), and define , _  1 IK(xn,y)-K(xn,y')1 P2(y,y)-L. 2 n 1 I K K( ' )1 ' n=1 + (xn,y)- xn,y {xn}=S. The set as of all points adjoined to S by the completion of S relative to P = PI + P2 is called the Martin boundary of S, M = S U as is a compact separable space. By the definition of P, K(x, )= Iimy{K(x, y) exists, is continuous in , and is superharmonic in x. A nonnegative superharmonic function u is called minimal if every superharmonic function v such that O.s;v.s;u is a constant multiple of u. The set aS I = {  I K (. , ) is minimal harmonic}, caIled the essential part of as, is an tFa-set. Then every y- integrable nonnegative superharmonic func- tion u is represented by u(x)= S K(x, )f1(d) by means of a unique measure f1 on M I =SU as l , caned the canonical measure of u. In particular, if u is harmonic, f1 is concentrated on as. A number of representation problems in analysis, such as the Hausdorff tmoment problem, can be considered to be representation prob- lems of suitable Markov chains. Let u be a y- integrable nonnegative superharmonic func- tion and (X" P;) the Markov chain (called the u-chain) having p(x, y) = u(x) -I u(y)p,(x, y) (OjO=O) as its transition probability, Then X_ = Iim,)(X, exists and P;(X",_EB)=u(X)-1 I K(x,)f1(d), where f1 is the canonical measure of u. A meaSure v on S is called a super harmonic measure (harmonic measure) if vq (n - I) .s; 0 (= 0), that is, LyES vyqy(nyx - Dyx).s;O ( = 0). Fix a 260 J Markov Chains function g  0 such that 0 < Gg < 00 ( Section D), and set K*(x, y) = G(x, y)Gg(xr l . Define the metric pi similar to P2, using the function family {K*(' ,y)} (YES). The set adjoined to S by the completion relative to p* = PI + pi is called the dual Martin boundary. Extend K* to S U as* and de no te by aSt the set of aliI] E as* such that K*(I],') is a minimal superharmonic meaSUre. Then every superharmonic measure v with S v(dx)g(x) < 00 is represented uniquely as v = S f1(dl])K*(I],') in terms of a measure f1 on suaSr Let  E M I , and denote the K (. , )-chain by (X" P:'), Then P;«( < 00)=0 or =1. We call  a passive boundary point in the first case and an exit boundary point in the second. On the other hand, denote by (X" p:,n) the chain having p*,n(x, y) = K*(I], xr l K*(I], y)p,(y, x) as its transition probability. Then p:,n«( < 00) =0 or =1. We callI] a dual passive boundary point in the first case and an entrance boundary point in the second. Fix {q, n}, and let ;(=(X" Px) be the minimal chain associated with {q, n}. Let (8S)ex and (as*)en be the sets of exit and entrance bound- ary points, respectively, If Px(X{- E( as)ex) == 0, (8) has no solution other than the minimal solution. But if Px(X,- E(aS)ex» 0 for some XES, (8) has infinitely many solutions. Further- more, if (as*)en # 0, we can obtain infini:e1y many solutions satisfying (8) and (9). There are many open problems in this connection. J. Concluding Remarks (1) Literature and some historical remarks. Besides the monographs on countable Markov chains, such as [6, 8J, many of the standard textbooks on stochastic processes contain chapters on Markov chains (e.g., [1-4J). Among others, W. FeIler [IJ gave an elemen- tary and elegant treatment of the basic theory of Markov chains with discrete parameter, which the bulk of the recent textbooks foIlow. The terminology for the classification of states we use here fol1ows the first edition of [1]. Since the second edition, Feller has used the terms "persistent" and "ergodic" instead of "recurrent" and "positive recurrent," respec- tively. S. Karlin's book [3J contains some excellent chapters on applications of Markov chains. The potential theory of nonrecurrent Mar- kov chains is essential1y contained in that of general nonrecurrent Markov processes due to G. A. Hunt [5]. The potential theory of recurrent chains was developed by J. G. Kemeny and J. L. Snell (J. Math. Anal. Appl., 3 (1961)). A greater part of Kemeny, Snell, and 
260 Ref. Markov Chains A. W. Knapp [6] is devoted to the potential- theoretic aspects of Markov chains, including boundary theory. F. Spitzer's book [7] is a definitive work on the theory and application of random walks. The proof of normality of recurrent random walk is the highlight of the book. P. Levy's article [9] is a pioneering work on Markov chains with continuous parameter involving instantaneous states. Feller and H. p, McKean first gave an exam- ple of Markov chains having only instantane- ous states (Proc. Nat. Acad. Sci. US, 42 (1956»). K. L. Chung's monograph [8] is the only book devoted to the foundation of the theory of Markov chains with continuous parameter. The exposition of birth and death processes in this article follows Feller [12]. (For details of birth and death processes  [3].) The first systematic treatment of queuing theory from the point of view of stochastic processes is due to D. G. Kendall [13, 14], and this has greatly influenced subsequent work in this field. As the standard textbooks of queuing theory we mention N, U. Prabhu [16] and ch. 14 of Karlin [3]. The Martin boundary theory of nonrecurrent Markov chains was developed by J. L. Doob [18], Hunt [19], and T. Watanabe [20]. There are many papers on the extension of the boundary theory to more general Markov processes (e.g., H. Kunita and T. Watanabe, Illinois J. Math., 9 (1965)). The exposition in this article was taken from Kunita and Watanabe (Sugaku, 13 (1961); 14 (1962)). Feller (Trans. Amer. Math. Soc., 83 (1956)) was the first to introduce the notion of an tideal boundary of Markov chains. He also studied the tboundary conditions for Kolmogorov's equations (Ann. Math., (2) 65 (1957). (2) General Markov chains. A Markov pro- cess with discrete parameter is often called a Markov chain whether the state space is count- able or not. We here call such a process a general Markov chain. A considerable amount of the results on Markov chains-on potential theory, limit theorems, random walks, and so on-are extended to general Markov chains. Classical results are found in Doob [2]. Later, Sparre E. Andersen (Math. Scand" 1 (1953); 2 (1954)) initiated the fluctuation theory of 1- dimensional non lattice random walks. T. E. Harris carried out an important study on the existence of invariant measures of recurrent general Markov chains (Proc. 3rd Berkeley Symp. on Math. Stat. Prob., vol. 2, 1956). S. Orey (Pacific J. Math., 9 (1959)) extended the limit theorems of Markov chains to a class of recurrent general Markov chains (called Harris chains). A systematic treatment of gen- eral Markov chains is given in D. Revuz [21]. 970 References [1] W. Feller, An introduction to probability theory and its applications, I, Wiley, third edition, 1968. [2] J. L. Doob, Stochastic processes, Wiley, 1953. [3] S. Karlin, A first course in stochastic pro- cesses, Academic Press, 1966; second edition, with H. M. Taylor, Academic Press, 1975; Karlin and Taylor, A second course in sto- chastic processes, Academic Press, 1981. [4] L. Breiman, Probability, Addison-Wesley, 1968. [5] G, A. Hunt, Markov processes and poten- tials, Illinois J. Math., 1 (1957),44-93, 316- 369; 2 (1958),151-213. [6] J. G. Kemeny, J. L. Snell, and A. W. Knapp, Denumerable Markov chains, Van Nostrand, 1966. [7] F. Spitzer, Principles of random walk, Springer, second edition, 1976. [8] K. L. Chung, Markov chains with station- ary transition probabilities, Springer, 1960. [9] P. Levy, Systemes markoviens et station- aires, Cas denombrable, Ann. Sci. Ecole Norm, Sup., (3) 68 (1951), 327-381. [10] D. Williams, The Q-matrix problem for Markov chains, Bull. Amer. Math. Soc., 81 (1975),1115-1118. [11] D. Williams, The Q-matrix problem 2: Kolmogorov backward equations, Lecture notes in math. 511, Springer, 1976,505-519. [12] W. Feller, The birth and death processes as diffusion processes, J. Math. Pures Appl., (9) 38 (1959), 301-345. [13] D. G. Kendall, Some problems in the theory of queues, J. Roy. Stat. Soc., (B) 13 (1951),151-185. [14] D. G. Kendall, Stochastic processes occurring in the theory of queues and their analysis by the method of the imbedded Mar- kov chain, Ann. Math. Statist, 24 (1954), 338- 354. [15] T. L. Saaty, Elements of queuing theory with applications, McGraw-Hill, 1961. [16] N. U. Prabhu, Queues and inventories, Wiley, 1965. [17] A. A. Borovkov, Stochastic processes in queuing theory, Springer, 1976. (Original in Russian, 1972.) [18] 1. L. Doob, Discrete potential theory and boundaries, J. Math. Mech., 8 (1959). [19J G. A, Hunt, Markov chains and Martin boundaries, Illinois J. Math., 4 (1960),313- 340. [20] T. Watanabe, On the theory of Martin boundaries induced by countable Markov processes, Mem. ColI. Sci, Univ, Kyoto, 33 (1960-1961),39-108. 
971 [21] D. Revuz, Markov chains, North- Holland and Elsevier, 1975. 261 (XVII.5) Markov Processes A. General Remarks Let {X'}'ET be a tstochastic process defined on the tprobability space (Q, !B, P). The tstate space S of X, is the set of real numbers R or N- dimensional Euclidian space R N . In general, S may be a tlocally compact Hausdorff space satisfying the second countability axiom. Tis an interval [D, 00) or a set {O, 1,2,...}. (T may also be any interval in the real line or a set { ..., - 2,1,0, 1,2, ... }.) We call this {X'}'ET a Markov process if, for any choice of points s I < S2 < ." < Sn < t in T, the tconditional prob- ability distribution of X, relative to X s1 , X S2 ' ''', X" is equal to the conditional probability distribution of X, relative to X s ,' Namely, for AE!B(S) and XI' ...,XnES, (1) P(X,EA IX s, =X" ""Xs,=x n ) =P(X,EAIXs,=x n ), where !B(S) is the least to"-algebra that contains all open sets of S. For a Markov process {X'}'ET, the distri- bution of Xo is called the initial distribution of {X,},ET' The conditional probability distri- bution PiX, E A I X, = x) is denoted by pis, x, t, A) and is called the transition probability of {X'}'ET' This is a function of s, tE T (s.s; t), XES, and AE!B(S) that has the following properties: (2) For fixed sand t, Pis, x, t, A) is a tprobabil- ity measure in A and is !B(S)-measurable III x. (3) P(s,x,s,A)= 1 if xEA, =0 if x1A. (4) The Chapman-Kolmogorov equality, P(s,x,u,A)= is P(s,X, t,dy)P(t,y,u, A) (s<t<u). In view of (1), the tfinite-dimensional distri- bution of a Markov process is completely determined by its initial distribution and its transition probability. Moreover, for a given function P(s, x, t, A) satisfying (2), (3), and (4), and for a given prob- ability distribution f1 on !B(S), there exists a 261 A Markov Processes Markov process {X,},eT with transition proba- bility P(s, x, t, A) and initial distribution Ji. In this sense, the transition probability is a char- acteristic quantity of the Markov process. If the transition probability depends only on the difference between sand t, that is, if there exists a function P(t, x, A) of t, x, and A such that P(s,x,t,A)=P(t-s,x,A), then the Mar- kov process is called temporally homogeneous. If S is an additive group and there exists a function P(s, t, A) of s, t, and A such that P(s,x,t,A)=P(s,t,A-x), where A-x={y- xIYEA}, then the Markov process is called spatially homogeneous. When S = R N , a spa- tially homogeneous Markov process is an tad- ditive process. A Markov process whose state space S is countable is called a tMarkov chain ( 260 Markov Chains). A Markov process whose sample path ( Section B) is continu- ous with probability 1 ( 407 Stochastic Pro- cesses) is called a tdiffusion process ( 115 Diffusion Processes). Consider the case T= [0, 00) and S = R N , and assume that the transi- tion probability has a tdensity pis, x, t, y) with respect to Lebesgue measure and satisfies certain analytic conditions that ensure the continuity of the sample path, etc. Then A. N. Kolmogorov proved that pis, x, t, y) satisfies the tFokker-Planck partial differential equa- tion, Conversely, he raised the problem of finding conditions for the existence and uniqueness of the transition probability satis- fying the given Fokker-Planck equation [1] ( 115 Diffusion Processes). W. Feller ex- tended this equation to an tintegrodifferen- tial equation of a certain type and solved the problem partially by classical analytic methods [2]. On the other hand, S. N. Bern- stein [3] and P. Levy [4] made probabilistic approaches to the problem, and K. It6 con- structed Markov processes directly by solv- ing the corresponding stochastic differential equations [5] ( 406 Stochastic Differential Equations). A profound study of the structure of Brown- ian motion and the additive process by P. Levy [4, 6J, the theory of tmartingales initiated by J. L. Doob [7], and the tstochastic calculus due to K. 1t6 [5J have served as useful appara- tus in analyzing the structure of Markov pro- cesses probabilistically. The theory of Markov processes has also retained a close relationship to various aspects of mathematical analysis- especially to functional analysis and tpotential theory. The link lies in an obvious fact that, by virtue of the formula (5) T,f(x) = is pit, x, dy)f(y), the transition probability pit, X, A) of a (tem- 
261 B Markov Processes porarily homogeneous) Markov process in- duces a semi group {1;, t  O} of linear positive operators on a function space, e.g., on the space B(S) of all bounded measurable func- tions on S. The study of diffusion processes has played important roles in the theory of Markov pro- cesses. When S = R I, the structure of diffu- sion processes was completely clarified by W. Feller [8J, E. B. Dynkin [9J, K. !to and H. P. McKean [IOJ, and others ( 115 Diffusion Processes). In succession, G. A. Hunt [IlJ and E. B. Dynkin [9J isolated fairly general and practically useful concepts in Markov pro- cesses, including the right continuity of sample paths and the strong Markov property. Based on these properties, probabilistic potential theory and the theory of additive functionals were developed. We now present a formulation for the tem- porally homogeneous Markov process with continuous time parameter [9,11-14]. B. Fundamental Notions Adjoin a point 0 to S as a point at infinity when S is noncom pact and as an isolated point when S is compact, set S =su {a}, and let !B(S) be the O"-algebra that consists of all the Borel sets in S. Let W be the set of all right continuous functions w(t) whose tdiscontinu- ities are at most of the first kind and such that w(t) = 0 for t  s if w(s) = a. By convention, we set w(oo)=o. Let «w) be the minimum of the t-values such that w(t)=a. For WEW, w,- and w,+ in Ware defined as w,- (s) = w(min(t, s)) and w,+ (s) = w(s + t), respectively. Let W be a subset of W that is closed under the operation w--> w,+ and w--> w,- , and let !B =!B( W) be the 0"- algebra generated by the sets {wEWlw(t)EA} (AE!B(S)). We often write X,(w) for w(t). The subclass of!B that consists of sets represented by {wEWlw,- EB} (BE !B) is denoted by !B,. Suppose that the family of probability mea- sures {Px} (x E S) on (W, B) satisfies the follow- ing conditions: (6) For a fixed B in !B, Px(B) is !B(S)- measurable in x. (7) PAXo(w)=x)= 1 for XES. (8) The Markov property PAw,+ E B I !B,) = px,(w)(B) holds with Px-measure 1 for BE!B. Then the triple \JJl=(X" W, Px I XES) is called a Markov process. This is a mathematical model for the random motion of a particle moving in S whose tprobability law is inde- pendent of its past history once the present position of the particle is known ( 260 Mar- 972 kov Chains). We call S the state space of \JJl, W the path space, and an element w in W a path, respectively. Px represents the probability law of a particle that starts from x. In view of the interpretation that the particle vanishes if it reaches a, we call (w) the lifetime (or terminal time) and a the terminal point. Now, for any Sl <S2 <... <sn and t, PAXs,+,EAIX s " ...,XsJ == PAXs,+,EA I XsJ== Px,,(X,EA) holds with Px-measure 1 for any x. This equal- ity means that {X,},;,o is a Markov process on S in the sense mentioned before with initial distribution (\ ( a unit measure at x) and the transition probability given by P(t, x, A) = PA X, E A). The restriction of this transition probability to (t, X, A)E [0,00) x S x !B(S) satis- fies the following properties: (2') For a fixed tO, P(t, x, A) is !B(S)- measurable in x and is a measure in A E !B(S) with P(t, x, S).s; 1. (3') P(O,x,A)=bAA), XES, AE!B(S). (4') P(Ht,x,A)= 1 P(s, x, dy)P(t, y, A). (9) lim'jo 1;f(x) = f(x), XES, holds for any bounded continuous function f on S, where 1;f is defined by formula (5). {1;} is called the semigroup of the Markov process \JJl because it enjoys the property 7;+, = I; 1; on B(S). By convention, any function f E B(S) is extended to a function on S by setting f( a) == o. (9) then follows from the expression 1;f(x) = EAf(X,)), where EA ) represents the integral with respect to Px- A Markov process \JJl is called conservative if P(t, x, S) = 1 holds for every X in S and every t. A point X in S is called a recurrent point if for every neighborhood U of x and every t > 0, the path starting from x returns to U after time t with probability 1. Under some conditions, x is recurrent if and only if S;7 p(t, x, U)dt = 00 for any neighborhood U of x. \JJl is called recur- rent if all points in S are recurrent. Otherwise, it is called transient. \JJl is of course conserva- tive if it is recurrent. There are other instances in which {X,} is called recurrent, if a particle starting from any point in S reaches any neigh- borhood of any other point in S in finite time with probability 1. In this case, under suitable conditions for regularity, the tmixing property holds, whence follows the tergodic property ( 136 Ergodic Theory). If the transition probability of {X,} has an tinvariant mea- sure with total mass 1, tBirkhoff's individual ergodic theorem holds [7]. Let {3',} be an increasing family of 0"- subalgebras of!B( W) such that X, is 3',- 
973 measurable for each t  O. A [0, 00 )-valued function 0" on W is said to be an ({ts',}-)Markov time (or stopping time) if {wi O"(w).s;t} E3'" tO. The O"-algebra 3'a is then defined by 3'a={BE!BIBn {O".s;t}E3', for any tO}. If there exists a family {3',} as above and if for every {3',}-Markov time 0" and for any XES, s>O, and AE!B(S), (10) PAXa+sEA l3'a)=PdXsEA) holds with Px-measure 1, then it is said that \JJl has the strong Markov property with respect to {ts',}, and such an \JJl is called a strong Markov process. Since a constant time is also a Mar- kov time, the strong Markov property imposes stronger conditions than the Markov prop- erty. A sufficient condition for a Markov process \JJl to have the strong Markov prop- erty is that the space C(S) of all bounded continuous functions on S. be invariant under the semigroup of\JJl. In this case, we can take 6, = np, !B s . It is convenient to enlarge the algebra !B, as follows. Let fl be a measure on (5, !B(5)), and write !B(S)# for the completion of !B(S) by fl. Write !B(S) for the intersection of all (!B(S))#, where fl runs over all probability measures on !B(5). Let P# be the measure on (W,!B) defined by P#(i\) = Is fl(dx)PAi\); and let ,= n#(!B,)# and  = n"(!B)#, where (!B)" is the completion of!B by p" and (!B,)" is the O"-algebra obtained by adjoining to !B, all P"-null sets in (!B)w A Markov process 911 is called a Hunt process if 1!3, = ns>' I!3 s for any t  0, \JJl has the strong Markov property with respect to {1!3,}, and \JJl has left quasicontinuity in the following sense: If {O"n} is any increasing sequence of {,}- Markov times and 0" = lim O"n, then for 0" < 00, Iim Xa, = Xa holds except for a set of P x - measure 0 for any XES. For a set A of S, let O"A = inf{t I t >0, X,E A} if such a t exists, = 00 if x,f/:A for all t >0. Then O"A is called the hitting time for A. (Some- times the condition t > 0 in the definition of O"A is replaced by tO.) If\JJl is a Hunt process, then O"A is a {,}-Markov time for any Borel set A [14,15]. For a subset A, r A =O"A' is called an exit time from A. If \JJl is a strong Markov process, the exit time ra from a point a is sub- ject to the exponential distribution Pa(ra>t)=e-A(a)" O.s;A(a).s; 00. In particular, a is called an instantaneous state if ..1. (a) = 00 and a trap if .ic(a)=O. For a Hunt process \JJl, PAB)=O or 1 if B is in !B o . This is called Blumenthal's zero-one law. A function P(t, X, A) of t  0, XE S, and A E B(S) is said to be a transition function on S 261 C Markov Processes if it satisfies (2'), (3'), and (4'). Denote by Coo (S) the set of those functions in C(S) vanishing at infinity. A transition function defines by (5) a linear operator on B(S) which satisfies the semigroup property T. T, = 7;.+,. If T, leaves the space Coo(S) invariant for every tO and if(9) holds for any f E Coo (S), then the transition function (resp. semigroup T,) is called a Feller transition function (resp. Feller semigroup). Any Feller transition function on S admits a Hunt process; namely, for a given Feller tran- sition function P(t, X, A), there exists a Hunt process jJR (on 5) whose transition probability coincides with P(t, X, A) for tO, XES, and A E !B(S). Such a process \JJl is often called a Feller process. For instance, any spatially homogeneous Markov process on R N is a Feller process. If a Feller transition function satisfies an additional condition that, for any compact set K and neighborhood G of K, t -I P(t, X, S - G)-->O, t-->O, uniformly on K, then the associated Hunt process \JJl is a diffusion, namely, Px (X, is continuous in t«)= 1, XES [2,7]. C. Generators of Markov Processes For a given Markov process \JJl, the generator of \JJl is defined by (11) <fJf(x)=limt-I(T,f(x)- f(x)), XES, tjO where {T,} is the semi group of \JJl. The choice of the domain :D(<fJ) of <fJ depends on the situation. When {T,} is a Feller semigroup, it is a strongly continuous semigroup on C",,(S). Let <fJ be its infinitesimal generator in the sense of Hille and Y osida ( 378 Semigroups of Operators and Evolution Equations). Then :D(<fJ) consists of those functions f E C",,(S) for which the convergence in the right-hand side of(ll) takes place uniformly on S. A necessary and sufficient condition for a linear operator on Coo (S) to be the infinitesimal generator of a Feller semigroup is known [8]. For a general strong Markov process 911, one way of defining :D(<fJ) is as follows [16, 17]. Denote by C(S) the set of all functions f E B(S) which are finely continuous in the sense that (12) PAf(X,) is right continuous in tO)== 1, XES. :D(<fJ) is then defined as the set of those func- tions f E C(S) for which the convergence in the right-hand side of (11) takes place boundedly in X with limit functions belonging to C(S). (Ij is called the generator of the strong Markov process \JJl. The following three conditions are 
261 D Markov Processes equivalent: (13) f E !:i(<fJ), <fJf = g. (14) f, gEC(S) and T,f(x)- f(x) = I T,g(x)ds, tO. (15) j',gEC(S)andf(X,)-f(X o )- I9(Xs)dS is a tmartingale on (W, !B" Px) for each XES. Furthermore, let 0" be a Markov time such that EAO") < 00. Then we have Dynkin's formula: for fE !:i(<fJ), f(x) = -Ex(f: (Df(X,)dt) + EAf(X a )). If <fJf is continuous at x, then <fJf(x)= lim EAf(X,u))- f(x) if X is not a trap, ui{x} EAru) =0 if x is a trap, where U is an open neighborhood of x. This representation of <fJ suggests that, when S is a smooth manifold, <fJ f for smooth func- tions f often becomes an elliptic partial dif- ferential operator or a mixture of it and an integral operator satisfying a certain maximum principle. In many practical cases, the data given to us are the coefficients of such a linear operator L carrying a certain linear space !:i(L) c B(S) into B(S). Thus a problem arises as to the existence and uniqueness of a strong Markov process \JJl whose generator <fJ is an extension of L. In connection with (15), Stroock and Varadhan [18] formulated this problem as the martingale problem concerning the existence and uniqueness of a probability measure Px on W such that PAX o = x) = 1 and f(X,)- f(Xo)- SLf(Xs)ds is a martingale on (W, !B" PxJ for every fE !:i(L). Since this formu- lation refers directly to probability measures on W, it is useful in the study of stochastic dif- ferential equations and of the convergence of Markov processes as well. Let m be a positive Radon measure on S which is strictly positive on each nonempty open set. A Markov process \JJl is said to be m- symmetric if 1 T,f(x)g(x)m(dx) = I f (X)T,g(x)m(dx) (.s; +00) holds for any t >0 and nonnegative mea- surable functions 1, g. Then {1;} is realized uniquely as a strongly continuous semigroup of symmetric operators on the real L 2 -space 974 U(S; m). Let <fJ be its infinitesimal generator. <fJ is then a nonpositive definite self-adjoint operator on L 2(S; m), and the folJowing defi- nition of the symmetric form tff' on L 2(S; m) makes sense: !:i[tff'] = !:i(), tff'(f,g)= (f, J -ti) g),f, gE!:i[tff'], where ( , ) denotes the inner product in L z(S; m). tff' is called the Dirichlet form of an m-symmetric Markov process \JJl. A Dirichlet form (on L 2(S; m)) is by definition a closed symmetric form tff' on L 2 (S;m) such that uE!:i[tff'J implies v= min(max(u, 0), I)E!:i[tff'] and tff'(v,v).s;tff'(u,u). A Dirichlet form is called regular if the space !:i[tff'] n Co(S) is dense in !:i[tff'] and in Co(S), where Co(S) is the space of continuous func- tions with compact support. For a given regu- lar Dirichlet form tff', there exists uniquely in a certain sense an m-symmetric Hunt process whose Dirichlet form is the given tff' [19]. In many cases of symmetric Markov processes, the form tff'(j',f) rather than the generator <fJf admits an explicit expression for smooth func- tions f When \JJl is the N-dimensional Brownian motion, the generator of its semigroup on L 2 (R N ) is given by <fJu=iu, !:i(<fJ) = {UE L2(RN)IuEL2(RN)}, and its Dirichlet form is given by tff'(u, v) = j grad u. grad vdx, 2JR N !:i[tff']= { UEL2(RN) 1 au EL2(RN), l .s;i.s;N } . aX i D. Markov Processes and Potential Theory Analytic notions and relations in classical potential theory can be interpreted in terms of Brownian motion ( 45 Brownian Motion). The relevant probabilistic notions and re- lations have been formulated not only for Brownian motion but also for a general Hunt process under the name of probabilistic potential theory [11,14,20]. Let \JJl be a Hunt process. A set A cS is called nearly Borel measurable if for each probability measure J1 on S there exist AI, Az E!B(S) such that Al cAcA z and Pp(X,E Az -AI for some tO)=O. !Bn(S) denotes the family of all nearly Borel subsets of S. Then !B(S) c !Bn(S) c !B(S) . The hitting time O"A for a nearly Borel set A is still a {!B,}-Markov time, and the probability PX(O"A>O) is either zero or one according to Blumenthal's zero-one law. x is said to be a regular point of A in the former case and an irregular point of A in the latter case. Let A r denote the totality of regular points of A. A set A c S is called finely open if for any x E A there exists a set C = CtX)E!B n containing 
975 S - A such that x rt C', Any open set is finely open by virtue of the right continuity of the paths. The fine topology defined by the fine open subsets of S is therefore finer than the original topology, A nearly Borel measurable function r on S is finely continuous if and only if it satisfies (12). A nonnegative - measurable function f is called :x-excessive «(XO) if e-"T,j.s;f for any t and e-atT,f(x)--> f(x) (t -->0). A O-excessive function is called simply excessive. Any (X-excessive function is nearly Borel measurable and finely continuous. The function f(x) = G,g(x) = Sgo e-atT,g(x)dt (G, is called the resolvent of \m) is (X-excessive for any nonnegative !B(S) -measurable func- tion g. If f is (X-excessive and A c S is nearly Borel measurable, then the function pf(x) = EsJe-,aAf(XaJ; O"A < 00) is (X-excessive again. When WI is the Brownian motion on R N (N  3), the class of excessive functions coincides with the class of nonnegative tsuperharmonic functions. Also, the fine topology is identical with the tCartan fine topology. A set A c S is called polar if A is contained in a set DE!B n such that PsJO"D< 00)=0 for any XES. A is called thin if A is contained in a set DE!B n such that Dr is empty. A set contained in a countable union of thin sets is called semipolar. If A is semi polar, then X,EA occurs for at most countably many values of t with Px-measure 1, XES. The set A - A r is semipolar for any AE!B n . Any polar set is semipolar. The converse is also true when \JJl is Brownian motion but not true when WI is the uniform motion to the right on R I. Under certain conditions of duality or sym- metry for the Hunt process, the notion of capacity can be introduced by generalizing the classical tNewtonian capacity. Then a set of zero capacity is identified with a polar set or its weaker version [11,19]. The stochastic solution of the classical tDirichlet problem can also be formulated for a wide class of diffu- sions. Thus, for a domain DeS, the solution of the equation <fJu(x)=O, xED, with boundary function f is expressed as u(x) = p?Df(x), xED, and the boundary behavior of u is studied probabilistically [9]. We have so far discussed mainly potential theory for transient Markov processes. How- ever, we can also establish potential theory for recurrent Markov processes following the model of the tlogarithmic potential in the case of 2-dimensional Brownian motion. E. Additive Functionals The notion of additive functionals was first introduced in relation to the study of time change of Markov processes, and in particular, 261 E Markov Processes to the study of the tlocal time of Brownian motion, Later it was studied in relation to potential theory and tmartingale theory, Addi- tive functionals play an important role in the study of Markov processes. For a Markov process \JJl, a function <p = <p,(w) of t and w is called a (right continuous and homogeneous) additive functional if the following conditions are satisfied: (1) -00 < <p,(w).s; 00; (2) <p,(w) is right continuous in t (t < (), <P,_ exists, and <p, = <p,_ for t  (; (3) <p,(w) is !B,-measurable in w for fixed t; and (4) for any t and s, (16) <Ps+,(w) = <Ps(w) + <p,(w s +) holds with P x - measure 1 for any XES. We call <p a perfect additive functional, if it satisfies (1), (2), (3) and satisfies (16) for aU w. Two additive functionals are equivalent if they are equal except on a set of Px-measure 0 for any t and x, and <p is called nonnegative if there exists a nonnegative additive functional which is equivalent to <po The concept of continuous additive functionals can be defined similarly. The function (X = (X,(w) of t and w is called a (right continuous and homogeneous) multi- plicative functional if it can be expressed by an additive functional <p as (17) (X,(w)=exp( -<p,(w)). An additive functional <p is said to be natural if <p, and the path X, of M have no jumps in common with Px-measure 1 for any XES. For a nonnegative additive functional <p, (18) U,(X)=E x (l OO e-atd<p,) is (X-excessive. Let WI be a Hunt process. A finite (X-excessive function u, can be expressed in the form (18) by a unique nonnegative natural additive functional <p if for every increasing sequence {O"n} of Markov times such that O";i(, EAe-M,u,(Xa,))-->O holds. It is possible to choose a continuous <p cor- responding to u, if and only if for every in- creasing sequence {O"n} of Markov times, EAe'a'u,(Xa.J)-->EAe-'au,(Xa)) holds, where 0" = lim 0".. In particular, u, may correspond to a continuous <p if the following two con- ditions are satisfied: (1) u, is bounded and e -at T,u,(x)-->O (t --> 00), (2) e -at T,u,(x)-->u,(x) (t -->0) uniformly in x [4]. In the case of Brownian motion, any nonnegative continu- ous additive functional <p can be characterized by a certain O"-finite measure f1 on S charging no polar set, which is called a smooth measure [9,21]. Such a characterization can be ex- tended to more general Hunt processes satis- fying certain conditions of duality or sym- metry with respect to a basic measure m, and the correspondence between <p and f1 is speci- fied by the following formula of Revuz [22]: 
261 F Markov Processes For any function fO, (19) l imEm (f 'f(Xs)d<Ps ) = r f(x)J1(dx). '10 t 0 Js An additive functional <p of a Hunt process Wl is called a martingale additive functional if EA<p(2) < 00 and EA<p,)=O for any t>O and XES. In fact, <p, is then a square integrable tmartingale on (W, {!B,},o' Px) for each XES. Let ./1{ be the set of all martingale additive functiona1s. The study of the class .41 consti- tutes a special aspect in the general theory of tsemimartingales [15, 16,23-26]. In particular, the quadratic variation (<p) of <pEA can be realized as a nonnegative continuous additive functional that satisfies EA(<p),) = EA<p;) for all XES and t>O. The additive functional (<p,if;) is similarly defined for <p, if; EA. Con- sider for <pEA a measurable function f on S such that E x (jhf(X s )2d(<p)s) is finite. Then the stochastic integral f' <p can be defined as an element of A satisfying (20) EA(f' <p),if;,) = Ex(f>(Xs)d(<P, if;>,), if;E 41. (f. <p), is often denoted by Sf(X,)d(ps' The space of all continuous functiona1s in A is denoted by Ae' Its orthogonal complement in the sense of ( , ) is denoted by Ad' The struc- ture of Ad is known, and each functional in Ad can be represented by means of the so- called Levy system, which is a pair consisting of a certain kernel on S and a certain nonnega- tive continuous additive functional [24,25]. If .4t is an N-dimensional Brownian motion, then At = A" and any <p E A takes the form (21) <p,= i L bi(Xs)dX:, where the integral appearing On the right-hand side is defined by (20) for the ith coordinate process X: - Xb E Jf{ and for a measurable function b i On R N with EAShb;(X s )2ds)< 00, xER N [23,28]. Replacing (16) by the "associative law" <p:(w) + <p(w) = <p(w), we can also define temporally inhomogeneous additive functiona1s. F. Transformation of Markov Processes There are several methods by which a given Markov process can be transformed to a new One. Here we mention some important transformations. Transformation by a Multiplicative Functional. For a Markov process \JJl, let a be a multiplica- 976 tive functional such that EAa,).s; 1 and PAa o = 1) = 1. Set P"(t, X, r) = EAa,Xr(X,)) (r E !B(S)), P"(t, X, {8}) = 1 - P"(t, X, S). Then P"(t, X, E) is a transition probability on S and corresponds to a Markov process \JJl" = (S, W", P x , XE S). We caIl \JJl" a transfor- mation of \JJl by a multiplicative functional :x. It is possible to choose W" = W. If \JJl is a strong Markov process, sO is Wl". Conversely, let \JJl and \JJl' be two Markov processes with the same path space W On the same state space S, If the probability law P of \JJl' is absolutely continuous relative to Px of \JJl, then \JJl' is a transformation of \JJl by a certain multiplica- tive functional of \JJl. This transformation includes killing, transformation by drift, and superharmonic transformation as special cases. (1) Ki1Iing. Transformation by a multiplica- tive functional a is called killing if O.s; a,.s; 1 holds. In fact, \JJl" can be constructed as fol- lows: A particle going along a path w of\JJl is "killed" (jumped to 8) in such a way that its surviving probability up to time t is a,(w). For a nonnegative bounded continuous function c(x) On S, let a,(w)=ex p ( - L C(Xs(W))dS). Then a satisfies the condition given previously. If the semigroup {T,} of \JJl is a Feller semi- group and has the generator <fJ, then the scmi- group {T,} of \JJl" is also a Feller semigroup, its generator <fJ" has the same domain as <fJ, and <fJ" = <fJ - C [29]. (2) Transformation by drift. For <pEA, let a, =exp{<p,-(<p),j2}. Then a, is a multiplicative functional. The transformation determined by a is called a transformation by drift. Let \JJl be the N-dimensional Brownian motion and <pEA be the functional expressed as in (21) with bounded b l , ..., b N ; then (<p),= Lit bl(Xs)ds, and the above formula for a gives a transfor- mation by drift. Moreover, if b l , ... , b N are in Coo (S), then the semigrou p of \JJl" is a Feller semigroup, and for a bounded function f with bounded continuous derivatives up to thc second order, f is in the domain of <fJ" and <fJ"f =(lj2)N + Lf=1 b i (8fj8x i ) [9,16]. (3) Superharmonic transformation. Let u be an excessive function of \JJl and A = {x I 0 < u(x) < oo}. Set a,(w) = u(X,(w»ju(Xo(w») if Xo(w)EA =0 if Xo(w)$A. Then a, is a multiplicative functional. The 
977 transformation defined by cx, first introduced by Doob, is caned a superharmonic transfor- mation. The transition probability P(t, x, r) of \JJl' is equal to u(xr l Sr P(t, x, dy)u(y) if XE A, 0 if xA and t>O, and bAr) if xA and t=O, for r in !B(S). In particular, if \JJl is a Feller process and u is a continuous function such that 0< c.s; u.s; k < 00, then \JJl' is also a Feller process and <fJ'f = u- I <fJ(uf), where the do- main of <fJ' is the set of f for which u{ is in the domain of <fJ. Time Change. We understand the term time change in a broad sense, including the follow- ing two important special cases. (4) Time change by an additive functional. Let \JJl be a Hunt process and <p be a nonnega- tive continuous additive functional such that PA<po=O)= 1. Set S*=={xIPx(<p,(w»O for every t > 0) = 1 }, and assume that S* is locally compact. Let W* be the set of all right con- tinuous functions on S*. They have discon- tinuities of at most the first kind, and !B* is the O"-algebra on W* generated by all tBorel cylinder sets. Set P;(B)=PAX",,-I(w)(W)EB) (BE!B*), where <p,-I is a right continuous in- verse function of <p,. Then \JJl* = {S*, W*, P;, XES*} is a Hunt process on S*, and we say that \JJl* is obtained by time change from \JJl by <po Roughly speaking, \JJl* can be con- sidered to be a Markov process with paths X,*(W) = X",,-'(W)(w). The resolvent of\JJl* is given by EASe-A""f(X,)d<p,). Suppose that a(x) is a continuous function on S such that O<c.s;a(x).s;k< 00, and set <p,(w) = Sa(Xs(w))ds, Then S*=S, W* = W, and !B* =!B, and \JJl* has the same fine topology as \JJl. The domain of the generator <fJ* of \JJl* coincides with that of <fJ of \JJl and <fJ*f = a-I <fJf Let \JJl and \JJl' be Markov processes with the same state space and the same path space W. If they have the same hitting proba- bilities {HK(x,') = PAX.KE')}, then each one can be obtained from the other by time change by a strictly increasing additive functional. The converse is also true [14]. (5) Subordination. This concept, introduced by Bochner, was then extended as follows: Let e -'P(A) be the tLaplace transform of an tinfinitely divisible distribution with support [0, 00), and let F,('), be the distribution with Laplace transform e -''P(A). Let {T,} be a Hille- y osida semigroup on a certain tBanach space, and set T,'P= S 7;F,(ds) (tBochner integral). Then {T,'P} is also a semigroup and is called the subordination of {T,} by if;. If <fJ is a gen- erator of {T,}, then - if; ( - <fJ) is a generator of { T,'P}. In particular, we can assume {1;} to be a nonnegative semigroup on C(S)(Coo(S)) such that 1; I = 1, and {X,} to be a Markov process 261 Ref. Markov Processes corresponding to the semigroup T,. Let {'P(t)} be an additive process that is independent of {X,} and satisfies E(e-A'I'(t)) = e-''I'(A). Set Y,(w) = X'I'(',W)(w); then {Y,} is a Markov process corresponding to the semigroup {T,'P}. The operation by which we obtain {Y,} from {X,} by using {'P(t)} is also called subordination. In particular, if {'P(t)} is a one-sided stable pro- cess of the cxth order ( 5 Additive Processes), this operation is called the subordination of the cxth order. If {X,} is an additive process, then the process obtained from it by subordination is also an additive process. The subordination of the cxth order of Brownian motion gives a tsymmetric stable process of the 2cxth order. Let {if;l(t)} and {if;2(t)} be independent of {X,}. Then the superposition of two subordinations of {X,} by if;1(t) and if;2(t) coincides with the subordination by {'Pd'P 2 (t,w),w)} [30]. (6) Reversed processes. Let {X'}'ET be a Markov process on (Q, !B, P) and X,* = X _, for tE T* = {tl-tE T}. Then {X'*}'ET* is a Markov process and is called a reversed process of {X,}. If the state space of S is countable, then the transition probability P(s, x, t, y) of {X,} and P*(s, x, t, y) of {X,*} satisfy the following condition: P*(s, x, t, y) = Q( - t, y)P( - t, y, -s,x)Q( -s,xr l , where Q(t,x)=P(X,(w)=x) and we assume Q(t,x)#O. For a given temporally homogeneous Mar- kov process {X,} with state space S, let {T,} be the semigroup corresponding to {X,}. Then a to"-finite measure m on (S, !B(S)) is called a subinvariant measure (or excessive measure) if the inequality Ss T,f(x)m(dx).s;Ssf(x)m(dx) holds for every nonnegative function f The measure m is called an invariant measure if the equality holds instead. Let {X,*} be an- other Markov process whose state space is S and whose semigroup is {T,*}. If for some O"-finite measure m and for every nonnega- tive f and 9 the equality Ss T,f(x)g(x)m(dx) = Ssf(x) T,* g(x)m(dx) holds, then {X,*} is called the dual process of {X,}. m is then a subinvariant measure of {X,} and {X,*}. The concept of reversed process is related to that of the dual process. For example, if m is an invariant probability measure of {X,} and if the distribution of Xo is m, then the distri- bution of X, is also m for every t, and the reversed process of {X,} coincides with the dual process [23]. References [1] A. N. Kolmogorov, Ober die analytischen Methoden in der Wahrscheinlichkeitsrech- nung, Math. Ann., 104 (1931), 415-458. [2] W. FeHer, 2m Theorie der stochastichen Prozesse, Math. Ann., 113 (1936),113-160. 
262 A Martingales [3] S. Bernstein, Equations differentielles stochastiques, Actualites Sci. Ind., Hermann, 1938, 531. [4] P. Levy, Processus stochastiques et mouvement brownien, Gauthier-Villars, 1948. [5] K. Ito, On stochastic differential equations, Mem. Amer. Math. Soc., 1951. [6] P. Levy, Theorie de l'addition des vari- ables aleatoires, Gauthier-Villars, 1937. [7] L. Doob, Stochastic processes, Wiley, 1953. [8] W. Feller, On second order differential operators, Ann. Math., (2) 61 (1955),90-105. [9] E. B. Dynkin, Markov Processes I, II, Springer, 1965. (Original in Russian, 1963.) [10] K. Ito and H. P. McKean, Jr., Diffusion processes and their sample paths, Springer, 1965. [11] G. A. Hunt, Markov processes and po- tentials, Illinois 1. Math., 1 (1957),44-93,316- 369; 2 (1958),151-213. [12] K. Ito, Lectures on stochastic processes, Tata Inst. of Fundamental Research, Bombay, 1960. [13] P. A. Meyer, Processus de Markov, Lec- ture notes in math. 26, Springer, 1967. [14] R. M. Blumenthal and R. K. Getoor, Markov processes and potential theory, Academic Press, 1968. [15J C. Dellacherie and P. A. Meyer, Prob- abilites et potentiel, two volumes, Hermann, 1975, 1980. [16] N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, Kodansha and North-Holland, 1981. [17] K. Ito, Stochastic processes, Lecture Notes Series, no. 16 Aarhus Univ., Matematisk Institute, 1969. [18] D. W. Stroock and S. R. S. Varadhan, Multidimensional diffusion processes, Springer, 1979. [19] M. Fukushima, Dirichlet forms and Mar- kov processes, Kodansha and North-Holland, 1980. [20] 1. L. Doob, Semimartingales and sub- harmonic functions, Trans. Amer. Math. Soc" 77 (1954), 86-121. [21J H. P. McKean and H. Tanaka, Additive functionals of the Brownian path, Mem, Coil. Sci. Univ. Kyoto, (A. Math.) 33 (1961),479- 506. [22] D. Revuz, Mesures associees aux fonc- tionnelles additives de Markov I, Trans. Amer. Math. Soc., 148 (1970),501-531. [23] M. Motoo and S. Watanabe, On a class of additive functionals of Markov processes, J. Math. Kyoto Univ., 4 (1965), 429-469. [24] H. Kunita and S. Watanabe, On square integrable martingales, Nagoya Math. 1., 30 (1967),209-245. [25] S. Watanabe, On discontinuous additive 978 functionals and Levy measures of a Markov process, Japan, 1. Math" 34 (1964), 53-70. [26] P. A. Meyer, Integral stochastiques I-III, Lecture notes in math. 39, Springer, 1967. [27] P. A. Meyer, Martingales local fonction- neles additives I, Lecture notes in math. 649, Springer, 1978. [28] A. D. Wenzell, Additive functionals of the multidimensional Wiener process, DoH Akad. Nauk SSSR, 139 (1961),13-16; 142 (1962), 1223-1226. [29] M. Kac, On some connections between probability theory and differential and integral equations, Proc. 2nd Berkeley Symp. Math. Stat. Prob., Univ. of California Press (1951), 189-215. [30] S. Bochner, Harmonic analysis and the theory of probability, Univ. of Califomia Press, 1955. 262 (XVI1.11) Martingales A. General Remarks Let (Q,!B, P) be a tprobability space and T a time parameter set. For each t E T, let 3't be a to"-algebra such that 3's c 3', c!B (s < t). With- out loss of generality we can assume that the probability space (Q,!B, P) is tcomplete and each 3', contains all measurable subsets of Q with P-measure zero. A real-valued tstochastic process {X,}'ET on (Q,!B, P) (which is also denoted by (X" tE T)) is called a martingale with respect to 3't provided that (i) X, is 3',- measurable and E(lX,I) < 00; and (ii) if s < t, then E(X,I3's)=X, (a.s.), (I) where (a,s.) means talmost surely, which will be omitted when there is no room for confusion. In this case, we also say that (X" 3'" t E T) is a martingale. If the eq uality in (I) is replaced by the inequality .s; (), {X'}'ET is called a super- martingale (submartingale). For the case of martingales the values of X, may be complex numbers. We write martingale, submartingale, and supermartingale as (M), (SbM), and (SpM), respectively, for short. If (X" 3'" tE T) is an (M) then (X" !B" t E T) is also an (M), where !B, = !B(X u , U E T, u.s; t). When the family of 0"- algebras involved in the definition of an (SbM) {X'}'ET is not explicitly mentioned, we under- stand that {X'}'ET is an (M) with respect to !B t . This convention is used for (SbM) and (SpM) also. The term martingale is due to 1. Ville. P. Levy had already made use of the concept in his work, but it was J. L. Doob who estab- 
979 Ii shed a systematic theory of martingales [2]. These concepts now provide us with not only basic tools but also fundamental principles in the theory of stochastic processes. The terminology "super-" and "submartin- gales" comes from the analogy to super- and subharmonic functions. As a mathematical model of games, however, they correspond on the contrary to subfair and superfair games, respectively. One of the fundamental mathe- matical principles in games was formulated by Doob as the following optional sampling theorem for martingales. To state it, we sup- pose for the moment that T= Z + = {O, 1,2,...}; a similar result also holds if T= R + and X, is right-continuous. Let 0" and r be bounded (3'n)- tstopping times (0" is bounded if mE Z + exists such that O".s;m a.s.). If(Xn,nEZ+) is an (M) «SbM), (SpM)) with respect to (3'n), then XaM=E(Xam',) (.s;, ). In particular, E(X aM )== E(X a ) (.s;, ). Con- versely, these properties characterize martin- gales: If X n is 3'n-measurable, E(IXnl)< 00 and E(Xa",)=E(Xa) (.s;,) for any bounded stop- ping times 0" and r, then (X n , 3'n,nEZ+) is an (M) «(SbM), (SpM)). The following are some basic properties of martingales: (1) For any given (SbM) X" m(t) = E(X,) is an increasing function of t, and X, is an (M) if and only if m(t) == constant. (2) Let X, and Y, be (SbM). Then aX, + bY, (a,bO) and sup(X" Y,) are (SbM). (3) Let X, be (SbM) and f(x) an increasing tconvex function defined in (-00,00). For any given toET, if E(lf(X'o)l) < 00, then (f(X,), tE( -00, to] n T) is an (SbM) and furthermore, when X, is an (M), (f(X,), tE( -00, to] n T) is an (SbM) even if f(x) is not increasing. In particular, X,+ = sup(X" 0) is an (SbM) if X, is an (SbM), and IX,I is an (SbM) if Xl is an (M). (4) Let X, be an (SbM). If a, b, tE T and a< t<b, then E(IX,I).s; 2E(IX b l)-E(X a ). (5) If X, is an (SbM) and X,  ° (t E T) and tIE T, then the family of trandom variables {X, I tE( -00, t IJ n T} is uniformly integrable (see (6)). (6) If X, is an (SbM), tnET and tnL then {X,j is uniformly integrable if and only if\imnoo E(X,) > -00. Here a family of random variables {X'}'EJ is said to be uniformly integrable if we have ! sp t"" IX,(w)ldP(w)=O, (2) where A n ,,= {w IIX,(w)1 > n}. Example 1. For any sequence ofrandom variables Y 1 , Yz, ..., if the relations E(Y,,+1IYI'''''Y,,)O, n=I,2,..., (3) hold, then n X n = L 1: v=l 262B Martingales is an (SbM). If the inequality sign in (3) is replaced by the equality sign, then X n is an (M). In particular, if {Y;,} is a sequence of independent random variables such that E( Yn) = 0, then n X="'Y n L, v v=l is an (M). B. Martingale Inequalities and Convergence Theorems The following inequalities, which are conse- quences of the above optional sampling theo- rem, are due to Doob. Let (X j , 1 .s;j.s; n) be a nonnegative (SbM) and X n * =max 1 j9Xj' Then AP(X:>A).s;E(Xn;X:>A).s;E(X n ) (..1.>0). From this we have that if (X j , I .s;}.s; n) is an (M) such that E(IXnIP)< 00, then P(IXI:>A).s;A-PE(IXnIP) (p I) and E(IXI:P).s; (  ) P E(IXnIP) (p> 1), p-l where IXI:=max 1 jnIXJ If U(I) is the up- crossing number of an interval I = [a, b] by a sample sequence of an (SbM) (X j , I .s;j.s; n) (i.e., the number of pairs (i,j), 1.s; i.s;j.s; n, such that Xi.s;u, Xjb and u<Xk<b for kk<j), then I E(U(I)).s; b_a (E[(Xn-a)+] -E[(X 1 -a)+]). Using these inequalities, we have the fol- lowing convergence theorems: (i) Let (X n , I.s; n < 00) be an (SbM). (a) If sUPn E(X n +) < 00, then limnooXn=Xoo exists with probability I and E (I X 00 I) < 00. In particular, every non- positive (SbM) and nonnegative (SpM) con- verge to integrable random variables a.s. (b) Furthermore, if {X n 11.s; n< oo} is uni- formly integrable, then limnoo X n = Xoo exists with probability I by (a), and (X n II .s; n.s; 00) is also an (SbM). (c) If X n is an (SbM) such that {E(IXnl)} is bounded, then limnoo X n exists, and if (X n , 1.s;n.s; 00) is an (SbM), then limnoo E(Xn).s; E(X oo )' where the equality holds if and only If {X n II .s; n < oo} is uniformly integrable. (ii) If(Xn, - 00 < n.s; -1) is an (SbM), then limn_ooXn=X-oo exists and -00 .s;X- oo < 00. Furthermore, if E(X_oo) > -00, then -oo<X_oo<oo and (X n , -oo.s;n.s; -I) is a uniformly integrable (SbM). (iii) Let (Xl> Xz, ..., Z) be an (SbM). (a) limnoo X n = Xoo exists and IimnooE(Xn).s;E(Xoo).s;E(Z). (b) 1imn00 E(Xn) = E(Xoo) if and only if {Xnl 1 .s; n < 00 } is uniformly integrable, and in 
262 C Martingales this case (X I ,X 2 , ...,Xoo,Z) is an (SbM). (iv) Let {ts'n} (-00 < n < 00 and 3'n c 3'n+1 c !B) be a sequence of O"-algebras on (Q,!B, P). Put 3'- 00 = nn 3'n and 3'+ 00 = Vn 3'n (the smallest O"-algebra containing all 3'n)' If Z is a random variable with E(IZI) < 00, then limn:too E(ZI 3'n)=E(ZI3':too) (a.s.). We mention some applications of these convergence theorems. Example 2. Let (Q, !B, P) be a probability space and {nn} (n = 1,2,...) a sequence of par- titions of Q into !B-measurable sets with posi- tive P-measure such that for each n, nn+1 is finer than nn' Let nn be {Mi n ), M1 n ), ... }, denote the smallest O"-algebra containing {M}")L 1,2, by 3'n for each n, and set 3'00 = Vnn' For a given tcompletely additive set function <p on (Q, 3'00)' if we define Xn(w) by Xn(W) = <p(M}"»)/P(M}"») for wEM}"),j = 1,2,..., then (X n ,3'n, 1 .s; n < 00) is an (M), and limnoo X n = X 00 exists. If P is the restriction of P to 3'00' then <p is tabsolutely continuous with respect to P if and only if {X n 11 .s; n < 00 } is uniformly integrable, and in this case, Xoo =d<p/dP with P-measure 1. If <p is tsingular with respect to P, then Xoo =0 with P-measure 1. Example 3. Let XI' X 2 , ... be any sequence of random variables and Z an integrable ran- dom variable that is measurable with respect to !B(X I , X 2 ,...). Then limnoo £(ZI X" X 2 , "', X n ) = Z (a.s,). In particular, if XI' X 2 ,... are independent and Z is !B(Xn, X n +1'''')- measurable for every n, then Z is equal to a constant a,s. This is the so-called tKolmo- gorov zero-one law. Recently, much work has been done to develop an analog of the classical theory of HP-spaces of harmonic functions in the frame- work of martingale theory. Let (Q, 3', P) and {3'n}nEZ+ be fixed, and let X = (X n ) be a uni- formly integrable (M) with respect to {3'n}' Then Xn==E(Xooln)' where Xoo=limnooXn, and we can identify X = (X n ) with Xoo. Set X* =suPnlXnl and [X,XJ =L0(Xn-Xn_d2 (X-I =0). By Doob's inequality above, ( P-l ) P IIX*llp.s; IIXoollp.s; IIX*ll p , 1 <p< 00, where II lip is the usual U(Q, P)-norm, 1.s; p.s; 00. Burkholder and Gundy, and Davis obtained the following inequality: There exist positive constants c p and C p depending only on p such that cpll[X, XJ I / 2 1I p .s; IIX*llp.s; Cpll[X, XJ I / 2 1I p , l.s;p<oo. If we set HP= {X IliXllw= II[X,XJ I / 2 1I p < oo}, p;;:' 1, then HP is a Banach space which can be identi- fied (by the identification of X and Xoo) with 980 U for p> 1. HI '* L I , however, and if we set BMO= {X IIIXIIMO=SUP IIE(IX oo n -X n - I 121Fn)11 00 < oo}, then BMO is a Banach space which can be identified with HI', the dual space of HI. In particular, Fefferman's inequality holds: IE(X oo Y oo )I.s;)2II X IIHIII YIIBMO, XEH I , YEBMO. This is an analog of the classical Fefferman's theorem, and X E BMO is an analog of a func- tion of bounded mean oscillation, the notion of which is due to John and Nirenberg. For details  [1, 3J; for an approach using con- formal martingales in continuous time  [4]. C. Sample Functions Let (X" 3'" tE T) be an (SbM). Here the para- meter set T may be an arbitrary subset of (-00,00). However, we can always find an interval I=:> T and for each tEl a O"-algebra rs:, and a random variable X, such that (X" rs:" tEI) is an (SbM) and P(X,=X,)= 1, rs:,= 3', for every t E T. Therefore we Can assume with- out loss of generality that the parameter set T is an interval. Furthermore, we assume that the stochastic process {X'}'ET is tsepa- rable. Using inequalities and convergence theorems for the sample sequences of (SbM)'s with discrete parameters, we obtain the follow- ing properties of the tsample functions of (SbM)'s with continuous parameters: (i) The sample function of an (SbM) {X,} is bounded on every finite interval [a, b JeT with proba- bility 1. (ii) Let To be the interior of T. Then P(X,+o and X,-o exist for all tET o )= I, and for each tE To, lim s l,E(X,).s;E(X,-o).s;E(X,).s; E(X,+o).s;limsj,E(X s )' (iii) Let D be the set of fixed discontinuity points of {X,} (t is called a tfixed discontinuity point of {X,} if P(X,-o = Xt=X,+o)# 1). Then D is an at most count- able set. We assume for simplicity that the parameter set is R + = [0, 00) and the sample functions of {X,} are right continuous with probability \. In this case, if(X" 3'" tER +) is an (SbM), then (X" 3',+, tE R +) is an (SbM), where 3',+ = ns>, 3's' Therefore we can assume that 3', = 3',+ for all tE R +. Let A be an interval and {r.} 'EA a family of stopping times such that r..s; rp < 00 whenever cx < p. Put X: = X" and 3': = 3''0' Then (X:, 3':, CXE A) is called the stochastic process obtained by an optional sampling (or a time change) from (X" 3'" tE T). By the optional sampling theorem of Section A, we can conclude that (X:, 3':, cxEA) is also an (SbM) if at least one of the following con- 
981 ditions is satisfied: (1) X,.s;O (a.s.) for all tER +; (2) {X" tE R +} is uniformly integrable; (3) for each exEA, r. is bounded with probability 1. D. Decompositions of Submartingales [1,6J If an (SbM) (X" 3'" tE R +) is uniformly inte- grable, lim,oo Xt = Xoo exists and (X" 3'" t E [0, ooJ) is an (SbM) for which 3'00 = V,3',. In addition, when the family {X,lrE:!} is uni- formly integrable, it is said to belong to class (D). Here:! denotes the collection of all stop- ping times with respect to (3',). If for each a (O<a< 00) the family {X,lrE:! and r:(a} is uniformly integrable, the family is said to belong locally to class (D). If an (SbM) X, is uniformly integrable, X, is decomposed as X,= E(X oo 13',)- (E(Xoo 13',)- X,), and if we take appropriate tversions of con- ditional expectations, E(X oo 13',) becomes a right continuous (M). In the decomposition (4), M, = E(X oo 13',) is an (M) and Z, = E(X oo 13',)- X, is a potential, i.e., a nonnegative right continuous (SpM) with Iim Hoo Z, =0 (a.s.), The decomposition (4) is called the Riesz decompo- sition of the (SbM) X,; the names "potential" and "Riesz decomposition" come from tpoten- tial theory in view of the obvious similarity. A stochastic process (A" tER +) on (Q,!B, P) is called a (right continuous) increasing process provided that (i) A, is 3',-measurable, E(A,) < 00 for each t, and (ii) with probability 1, the sample function is a right continuous and increasing function with Ao = O. If E(Aoo) < 00, where Aoo = lim,oo A" the stochastic process is said to be integrable. We have the following Doob-Meyer decomposition theorem: (i) A potential X, is decomposed as X, = E(A oo 13',) - A, by a suitably chosen integrable increasing process if and only if Xt belongs to class (D). (ii) An (SbM) X, is decomposed into X,=X; + A" where X; is an (M) and At is an increas- ing process if and only if X, belongs locally to class (D). (iii) In (i) and (ii), A, can be chosen to be tpredictable; under this condition, these decompositions are unique. Furthermore, At can be chosen to be continuous if and only if X, is regular in the sense that for any sequence rnE:!suchthatrnir limn ooE(X )=E(X ) for every a>O. '  ',AU 'AU E. Semi martingales [1,6, 7J tU:vy processes and tIto processes are natu- rally generalized to a class of stochastic pro- cesses called semimartingales. This class of 262 Ref. Martingales processes is important since it appears to be the most general class for which a systematic theory of stochastic calculus can be developed ( 406 Stochastic Differential Equations). Furthermore, we can define some quantities called local characteristics of semimartingales which are, in a sense, a generalization of Levy- Khinchin characteristics for Levy processes. In many interesting cases, the law of a semimar- tingale can be determined if we specify its local characteristics to be given functionals of the sample paths. This way of characterizing sto- chastic processes is known as a martingale problem, a concept introduced by Stroock and Varadhan. A stochastic process X" tER +, on (Q,!B,P) and (3',) is called a semimartingale if it Can be represented as (4) X,=Xo+M,+ v" where Xo is 3'o-measurable, M, (Mo =0) is a right continuous local martingale with respect to (3',), i.e., there exists O"n E:t such that O"n i 00 and M,u'=M'M, is a uniformly integrable (M) with respect to (3',) for each n, and V, (VO =0) is a right continuous t(3',)-adapted process such that t E [0, TJ f-> V, is of bounded vari- ation a.s. for every T> O. The class of semimar- tingales is known to be invariant under C 2 _ transformations (Ito's formula), time changes, and absolutely continuous changes of the basic probability P (Girsanov's theorem). Example 4. Let X, = (X,' , X/, ... , X;) be a system of semimartingales such that all xi, XiX/-JUt, i,j = 1,2, ..., n, are continuous (local) martingales with respect to (3',). Then X, is an n-dimensional Wiener process such that !B(X u - Xv, u, v  t) and the 3't are inde- pendent for every t. For this reason such an X, is often called a Wiener martingale. Example 5. Let X, be a semimartingale whose sample paths are increasing step func- tions with jumps of size 1 a.s. If X, -ct (c >0) is an (M) with respect to (3',) then X, is a tPois- son process with the same independence prop- erty as Example 4. For a similar characteri- zation of Levy processes in the martingale framework  [5]. The notion of semimartingales can also be defined for processes taking values in a dif- ferentiable manifold [8]. (5) References [IJ C. Dellacherie and P. A. Meyer, Theorie des martingales, Hermann, 1980, chs. 5, 8. [2J J. L. Doob, Stochastic processes, Wiley, 1953. [3J A. M. Garsia, Martingale inequalities, Benjamin, 1973. 
263 A Mathematical Models in Biology [4J R. K. Getoor and M. J. Sharpe, Conformal martingales, Inventiones Math., 16 (1972), 271-308. [5J B. Grigelionis, On the martingale charac- terization of stochastic processes with inde- pendent increments, Lit. Math. J., 17 (1977), 75-86. [6J N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, Kodansha and North-Holland, 1980, [7J 1. Jacod, Ca\cul stochastique et problemes de martingales, Lecture notes in math. 714, Springer, 1979, [8J L. Schwartz, Semi-martingales sur varietes, et martingales conformes sur des varietes analytiques complexes, Lecture notes in math, 780, Springer, 1980. 263 (XVI.B) Mathematical Models in Biology A. History In certain quantitative studies in biology, particularly in epidemiology, population ecol- ogy, and developmental biology, there arise mathematically describable models. Such mathematical descriptions of biological phe- nomena have sometimes been proposed by mathematicians and sometimes by biologists. We call these descriptions mathematical mod- els in biology. The first mathematical model was proposed by R. Ross [IJ in 1911. He undertook a theo- retical investigation of the propagation of malaria, His object was to give an analysis of the propagation of malaria in a certain locality under somewhat simplified conditions. His assumption was that both emigration and immigration were negligible and that there was no increase of population. In such a local- ity, the propagation of malaria is considered to be determined in general by two factors which evolve continuously and simultane- ously. On the one hand, the number of new in- fections depends upon the number and infec- tivity of the mosquitoes; on the other hand, at the same time the infectivity of mosquitoes is determined by the number of people in the given locality and the frequency of infection among them. Ross expressed the uninterrupted and simultaneous dependence of the first com- ponent on the second and that of the second on the first by means of a system of first-order differential equations. As a much simpler case, following P. F. Verhulst [2J, R. Pearl and L. 1. Reed [3J pro- 982 posed in 1920 a population model describing how a population of animals living in a fixed region initially increases, but after some period approaches a saturation point. Their model is the simple differential equation du -=(A-ku)u dt ' (1 ) where u is the population at time t and A and k are positive constants. The solution of this equation with positive initial data U o is ex- pressed by a monotone-increasing curve that tends to the saturation value A/k as t tends to infinity, We call this equation a logistic equa- tion. The solution fits the growth pattern of some real populations of insects; but we also have another model, which was exploited by experimental ecologists who were engaged in the study of a kind of bean weevil. That model is expressed by the difference equation [4J u n + 1 =f(u n ), (2) where Un is a population of the insects in the nth generation and f(u) is called the reproduc- tion function; the latter is often a simple func- tion, but is not necessarily monotonic, B. Population Model with Two Species V. Volterra [5J proposed the following model for the prey-predator relation between two species. We denote the prey population by u, and the predator population by v. These populations satisfy the system of equations du -= ( A-kv ) u dt ' dv dt = -(B-hu)v, (3) where A, B, k, and h are positive constants. The orbit of the solution of this system passing through (u o , v o ) in the first quadrant is closed, enclosing the point (B/h, A/ K) and staying in the first quadrant. An integral for this system is given by u-Bv-Aehu+kv = C, (4) where C is a constant of integration. Conse- quently, the solution starting at a point in the first quadrant is periodic, with the period depending on the initial data. The average populations over one period, 1 f '+T u=- u()d, T, 1 f '+T v=- v@d, T, (5) do not depend on the initial data. Many mathematical models in biology are expressed in terms of ordinary differential equations [6,7]. 
983 C. Fundamental Equations Some models are used to describe the spatial distributions, that is, the spatial patterns of populations. For example, we can consider equations for the pattern of a population with migration. We denote by p(t, x) the population den- sity at time t and position x in some region V in R 2 , v(t, x) is the velocity of the migra- tion.f(t,x) is a source (or supply) term for the population, Then we get - d d r p dx = - f pvn ds + r f dx, d v DV J v where n is the outer unit normal to the bound- ary ov. Consequently, we get a partial dif- ferential equation op &+V'(pv)=f Here we assume, for example, that v is a func- tion of x, p; then we get a thyperbolic equation op &+ V' (v(x,p)p)= f Also, if we assume v = -d(x, p)Vpjp and f = f(p), then we get op &=V(d(x,p)Vp)+f(p) on V, which is an equation of tparabolic type. We impose some boundary condition at the boundary OV; for example, the homogeneous tNeumann condition op -=0 on ov. on In this case, if V is a convex region and d(x, p) = constant. then H. Matano [8J showed that only a constant solution of the stationary problem dp+ f(p)=O and (8) can be stable as a limit of the solution of the nonstationary problem (7) and (8) as t tends to 00, (He assumed that f(p) is a smooth function of p.) D. A Diffusive Prey and Predator Population Model R. May [4J and M. Mimura [IOJ indepen- dently proposed mathematical models to ex- plain some aspects of the population pat- terns of plankton in water. Their work can be considered to be the completion of earlier attempts by the ecologist J. H. Steele [9]; Mimura and Nishida proved that Steele's 263 D Mathematical Models in Biology original model does not yield a solution de- scribing the observed population patterns, and Mimura and May introduced corrections arising from their ecological viewpoint. The model is the system op 02p 2 apq -=d -+kp-Jip - ot IOX 2 I+bp' 8q_d 02q 2 apq &- 2 0x 2 - YQ + I+bp ' on [0,1] x [0, 00), (10) (6) where p(t, x) and q(t, x) are the prey and pre- dator populations, respectively, and d j and d 2 are diffusion coefficients. k, Ji, y, a, and bare positive constants. If b, d j , d 2 , and Ji are zero, we get Volterra's prey and predator equation. Mimura considered the Cauchy problem with boundary conditions op op -(t,O)=-(t, 1)=0, ox at oq aq ox (t,O)=&(t, 1)=0. (II) (7) In this case, under some assumptions on d l , d 2 and on k, Ji, y, a, Mimura [11] succeeded in proving some stationary patterns of p and q which are stable but not constant for the vari- able x. He used bifurcation theory to prove existence of a stationary solution near the equilibrium point. For the case in which d l is sufficiently small, he used the singular pertur- bation theory introduced by Fife [12]. The same procedure can be applied to the following system of partial differential equa- tions, which was proposed by Gierer and Meinhardt in developmental biology [14]: oa 02a cpa 2 &=Da ox 2 +PoP+T- Jia, (8) Dh 0 2 h ,,2 -=D -+c p a -vh Dt h ox 2 ' (12) (9) where a(t, x) is the concentration of a short- range activator and h(t, x) is that of a long- range inhibitor. Da, Dh' Po, P, c, c', Ji, and v are all positive constants. Gierer and Meinhardt showed numerically that a system such as (12) exhibits some interesting spatial patterns, Under the boundary conditions (II), Mimura obtained a rigorous proof of the numerical results [13]. These models may not be realistic from the biological viewpoint, but they do afford some insight, so that biologists may now begin to devise more realistic ones. For example, the model given by (10) and (11) is one by means of which we can explain pattern formation in spatially homogenous environments. (We call the existence of such 
263 E Mathematical Models in Biology patterns "patchiness.") The model shows that without any local environmental inhomogene- ity, we can still find some patchiness in the population pattern of the plankton, There are many models that explain bio- logical phenomena only metaphorically. An example is the use of catastrophe theory by R. Thom [15] to model morphogenesis. E. Population Genetics Consider the development of a genetic struc- ture in a population consisting of N individ- uals. We assume two kinds, a and A, of genes; and an individual has one of the genotypes aa, AA, aA. We assume that every generation is of a constant size N, and we consider the number of genes, not genotypes. The num- ber X n of a genes in the nth generation is the result of a stochastic process. In the simpest model it is a tMarkov chain with the ttran- sition probability Pij from Xn=i to X n + 1 =j being expressed by a tbinomial distribution: _ ( 2N ) . 2N-j Pij- j p{(I- p;) , i Pi= 2N ' Models that take mutation, natural selection, and migration into account can be given by appropriate changes of Pi- Since the works of R, A. Fisher and S. Wright, natural selection and migration models have been the object of much research. A powerful method in the analysis of these Markov chains is the diffu- sion approximation [16-18]. Consider the time t and the ratio x = i/(2N) of a genes as continuous variables. Then the partial differen- tial equation op 1 0 2 0 ai= 4N oy2 {y(l- y)p} - oy {a(y)p} (13) is obtained for the transition probability den- sity p(t, x, y) of the ratio of a genes. Here, a(y) is a polynomial with degree .s;2. M. Kimura found a solution of (13) in terms of special functions and solved many problems in ab- sorption probability, limit distribution, speed of convergence, etc. Equation (13) is tKolmo- gorov's forward equation in the theory of tdiffusion processes, but the coefficients are degenerate at the boundaries (0 and 1). S. Karlin and J. McGregor [19J found that the foregoing discrete models can be taken as tbranching processes under the condition that the number of individuals in each generation is 2N. The relation between discrete and con- tinuous models has been studied in general dimensions as a problem in the convergence of stochastic processes [20,21]. For related topics  40 Biometrics. 984 References [I] R. Ross, The prevention of malaria, Lon- don, 1911. [2] P. F. Verhulst, Notice sur la loi que la population suit dans son accroisement, Corr. Math. Phys, Pub!. A, 10 (1838), 113, [3] R. Pearl and L. J. Reed, On the rate of growth of the population of the United States since 1790 and its mathematical representa- tion, Proc. Nat. Acad. Sci. US, 6 (1920), 275. [4J R. M, May, Theoretical ecology: Principles and applications, Blackwell, 1976. [5] V. Volterra, Le<;on sur la theorie mathe- matique de la lutte pour la vie, Gauthier- Villars, 1931. [6] E, C. Pielou, An introduction to mathe- matical ecology, Wiley, 1969. [7] J. M. Smith, Models in ecology, Cam- bridge Univ. Press, 1974. [8] H. Matano, Asymptotic behavior and stability of solutions of semi-linear diffusion equations, Publ. Res. Inst. Math. Sci., 15 (2) (1979), 401-454. [9] 1. H, Steele, Spatial heterogeneity and population stability, Nature, 248 (1974), 83. [10] M. Mimura, Y. Nishiura, and M. Yama- guti, Some diffusive prey and predator sys- tems and their bifurcation problems, Bifur- cation Theory and Applications in Scienti- fic Disciplines, O. Gurel and O. E. Rossler (eds.), Ann. New York Acad. Sci., 316 (1979), 490-510. [11] M. Mimura, Asymptotic behaviors of a parabolic system related to a planktonic prey and predator model, SIAM J. Appl. Math., 37, no. 3 (1979), 499-512. [12] P. C. Fife, Boundary and interior tran- sition layer phenomena for pairs of second- order differential equations, J. Math. Anal. Appl., 54 (1976), 497-521. [13] M. Mimura and M. Yamaguti, Pattern formation in interacting and diffusing system in population biology, Advances in Biophysics, M. Kotani (ed.), vol. 15, Tokyo Univ. Press, 1982. [14J G. Meinhardt, A theory of biological pattern formation, Kybernetik, 12 (1972), 30- 39, [15] R. Thom, La stabilite structurelle et la morphogenese, Benjamim, 1972. [16J 1. F, Crow and M, Kimura, An introduc- tion to population genetics theory, Harper & Row, 1970. [17] W. Feller, Diffusion processes in genet- ics, Proc. 2nd Berkeley Symp. Math. Statist. Probab., Univ. of California Press, 1951,227- 246. [18J W, 1. Ewens, Mathematical population genetics, Springer, 1979. 
985 [19J S. Karlin and J. McGregor, Direct prod- uct branching processes and related induced Markov chains I, Springer, 1965, 111-145. [20J K. Sato, Diffusion processes and a class of Markov chains related to population genet- ics, Osaka 1. Math., 13 (1976), 631-659. [21J S, N. Ethier, A class of degenerate diffu- sion processes occurring in population genet- ics, Comm. Pure Appl. Math., 29 (1976), 483- 493. 264 (XIX.1 ) Mathematical Programming A. Introduction Mathematical programming is a method useful in operations research, industrial en- gineering, systems engineering, etc., and it is concerned with optimization in general. If we want to minimize cost or loss or to maximize some effect or profit under various circum- stances, the corresponding mathematical prob- lem is usually formulated in the form of a mathematical-programming problem. Com- putational methods in mathematical program- ming have seen great development hand in hand with remarkable progress in computer technology, and we are now able to deal with large-scale problems practically. B. General Definitions A mathematical-programming problem in the broadest sense of the word is one of finding a maximum or minimum of a given function f: X --> R (where X is a set and R is an ordered set). However, in a narrower sense, it usually means a problem where X is a closed subset of the n-demensional Euclidean space R n (or, more generally, a Banach space) and f is a real-valued continuous function. Often, X is defined as the set of points in Rn that satisfy a system of equalities and inequalities of the form gi(X).s;O, ?:O, or =0 for some given real- valued functions gi (i = 1, ... ,m) defined on Rn. In mathematical programming, special termi- nology is sometimes used; e.g., X is conven- tionally called the feasible region, a point of X a feasible solution, the solution of the problem itself the optimal solution, f the objective func- tion, and the equalities and inequalities defined in tcrms of the gi are the constraints. 264 D Mathematical Programming C. Types of Mathematical-Programming Problems Mathematical-programming problems can be classified from various viewpoints, from which the problems get their various names. (i) Linear programming ( 255 Linear Pro- gramming): The objective function f and all the functions gi of constraints are linear. Non- linear programming: Some of f and the gi are nonlinear. Convex programming: f and all the gi are convex with inequalities of the form gi(X).s;O (and, consequently, X is a convex set); the problem is to minimize f ( 88 Convex Analysis, 255 Linear Programming, 292 Non- linear Programming). (ii) Disjunctive pro- gramming: X is not connected. Integer pro- gramming: X is a subset of the lattice points of integer coordinates in R n ( 215 Integer Pro- gramming). (iii) Parametric programming: f and/or the gi contain parameters, and the problem is to analyze the behavior of the optimal solution and/or the feasible region when the parameters vary. (iv) Stochastic programming: The p,,:-ameters in (iii) are ran- dom variables. (v) Multiobjective programming: A vector-valued function f: Rn--> R k (k?: 2) is taken as the objective function, where a certain order relation is defined in R k (such as the Cartesian product of the order in R or the lexicographic order based on the order in R). (vi) when the constraints and/or the objective function are endowed with special mathemat- ical structures, special names are accordingly given. For example, tnetwork flow problems, whose f and gi are defined with reference to a graph ( 186 Graph Theory), are called net- work programming; if f and the gi have an iterative or repetitive structure, the name multistage programming is used; dynamic programming ( 127 Dynamic Program- ming) can be regarded as a kind of multistage programmlllg. D. Mathematical-Programming Problems of Special Type In this encyclopedia, independent articles appear for those types of mathematical- programming problems that have been math- ematically well investigated and are most frequently used in practice ( 127 Dynamic Programming, 215 Integer Programming, 255 Linear Programming, 292 Nonlinear Pro- gramming, 349 Quadratic Programming). The following are some other typical problems that have been systematically studied. (i) Fractional programming: The objective function to bc minimizcd within the set X 
264 Ref. Mathematical Programming takes the form f(x) = C(x)jD(x) (C, D: Rn-->R), where D(x) is assumed to be positive in X. If X is convex, C is convex and of positive value, and D is concave, the function of a real para- meter A defined as h(A) == minxEx[C(x) - AD(x)J is monotone decreasing and convex in A. The problem of determining h(A) for each value of ..1. is a convex-programming problem, and the solution of the convex-programming problem forA = ..1.0 such that h(Ao) = 0 is the optimal solution of the original problem, In particular, if C and D are linear, i.e., if C(x) = c' . x + Co and D(x) = d' . x + do, and, moreover, if X = {xERnIAx.s;b,xO}, where AERmxn, bERm and the inequalities are to be read component- wise (linear fractional programming), then by introducing new variables y =xjD(x) and Yo = IjD(x), we can reduce the original problem to the linear-programming problem min(c'y + CoYo I Ay - byo .s;O,d" y+doyo = I, Y O,Yo O}. (ii) Geometric programming: The objective function f(x) = go(x) and the feasible region X={xERnlgi(x).s;O,i= I, .",m;xO} are defined in terms of functions gJx) of the so- called polynomial type: g;(x) = Lk CikjX;'k j (i = 0, I,..., m). The dual ( 292 Nonlinear Pro- gramming) of a geometric-programming prob- lem is easier to treat than the original, because it is a problem of finding the minimum of a convex function under constraints of linear equalities and inequalities. (iii) Nonconvex quadratic programming and bilinear programming: The former is the prob- lem of minimizing an objective function of the form f(x)=c"x +!x'Qx (where Q is a symmetric matrix not necessarily nonnegative definite) under linear constraints (equalities and inequalities), and the latter is to minimize f(x 1 , x 2 )=c', . x, +c. X 2 +X'l QX 2 over the region X = {(XI' X 2 )E R n l x R n2 1 Al XI .s; bj, A 2 x 2 .s;b 2 , XI O, x 2 O}, where QE Rn 1 x n " AiERmix"', and biERm i (i= 1,2). These two types of problems are related to each other, and a number of computational techniques, such as specially elaborated versions of the cutting-plane method ( 215 Integer Pro- gramming B), have been developed for them. References [IJ G. B. Dantzing, Linear programming and extensions, Princeton Univ. Press, 1963. [2] D. G. Luenberger, Optimization by vector space methods, Wiley, 1969. [3] J. F. Shapiro, Mathematical programming -Structures and algorithms, Wiley, 1979. [4J W. Dinkelbach, On nonlinear fractional programming, Management Sci., 13 (1967), 492-498, 986 [5J S. Schaible, Duality in fractional program- ming-A unified approach, Operations Res., 24 (1976), 452-46 \. [6J S. Schaible, Fractional programming 1- Duality, Management Sci., 22 (1976),858-867. [7] R. J. Duffin, E. L. Peterson, and C. Zener, Geometric programming-Theory and appli- cations, Wiley, 1967. [8J H. Konno, A cutting plane algorithm for solving bilinear programs, Math. Program., 11 (1976), 14-27. [9J H. Konno, Maximization of a convex quadratic function under linear constraints, Math. Program., 11 (1976), 117-128. 265 (XXl.g) Mathematics in the 17th Century The 17th century abounds in remarkable events in the history of science: the work on mechanics by Galileo (1564-1642); the discov- ery of analytic geometry by R. tDescartes (1596-1650); the early research in the theory of probability by P. de tFermat (1601-1665) and B. tPascal (1623-1662); the discovery of tmathematical induction by Pascal; and the discovery of the infinitesimal calculus (i.e., tdifferentlal calculus and tintegral calculus) by I. tNewton (1642-1727) and G. W. tLeib- niz (1646-1716). Compared with these events, the results of mathematical research from the Middle Ages to the 16th century seem minute. These results nonetheless exist, and historians of mathematics are now reevaluat- ing them, particularly those of the 15th and 16th centuries. Before Galileo, Tycho Brahe (1546-1601) kept precise records of astronomical observa- tions. J. Kepler (1571-1630), motivated by a mystic faith in the "harmony of the universe," studied Brahe's records and discovered three laws on the motion of planets. He also treated a question of cubature in his paper on the form and volume of the wine barrel (1615). His contemporaries J. Napier (1550-1617) and J. Burgi (1552-1632) discovered logarithms, which helped astronomers tremendously in their calculations. Napier used the concept of velocity in his introduction of logarithms; thus analysis began to germinate. Galileo founded the modern approach to the concepts of veloc- . ity and acceleration in his Dialogue on two new sciences (1638). Using a telescope he built, he discovered four of Jupiter's moons and ob- served sunspots. His espousal of the heliocen- tric theory of Copernicus (1473-1543) led to his denunciation before the Inquisition, which 
987 ordered him to refrain from holding or de- fending the theory. This is the most famous episode of his life, but his most significant contribution to science lies in his foundations of theoretical mechanics, which he freed from the Aristotelian tradition, thereby opening the way for Newton. F. Cavalieri (1598-1647), a disciple of Galileo, applied the notion of indivi- sibilis (originating in scholastic philosophy) to questions of quadrature in his Geometry of the indivisible (1635). This idea influenced Pascal and J. Wallis (1616-1703). Descartes established the method of analytic geometry in his Geometry (1637), published as an appendix to his Discourse on method. The use of tcoordinates can be traced back to Apollonius of Perga; Fermat used them occa- sionally, but Descartes made the first clear formulation of the method of representing general figures by means of equations, an essential step beyond Greek geometry. He also surpassed F. tViete (1540-1603) by abolishing the restriction that quantities represented by letters should be of one dimension. Contemporary to Descartes, Fermat made remarkable contributions to tnumber the- ory and Pascal to tprojective geometry, and through their correspondence there began the theory of probability; both also made precur- sory contributions to analysis. Fermat treated questions on maxima and minima of functions and tangents of curves; Pascal answered some questions on tangents, centers of gravity, quadrature, and cubature concerning tcy- cloids. Pascal also contributed to hydrostatics, made positive use of the idea of the point of infinity in projective geometry, and clearly formulated the principle of mathematical induction in his theory of arithmetic triangles, the so-called tpascal's triangles. (Freudenthal [4J established that the first discovery of the principle of mathematical induction is due to Pascal; the exact date of the discovery was studied by Hara [5].) In England, Wallis and 1. Barrow (1630- 1677) preceded Newton. Wallis solved ques- tions concerning quadrature and cubature (by bold use of the methods established by Cavalieri), infinite series, and interpolation, Barrow was Newton's teacher. He came close to the fundamental theorem of calculus, and Newton certainly owed some ideas which led to his discovery to Barrow's suggestions. New- ton completed his method of fluxions, corre- sponding to our differential calculus, toward 1669-1671, but his paper on this method was published only after his death (1736). In his main work, Principia mathematica philosophiae naturalis (1687), he used this method and its converse, without naming them, to solve the +two-body problem. The work begins with 265 Ref. Mathematics in the 17th Century three laws of mechanics and covers the motion of the moon and hydromechanics. Leibniz discovered infinitesimal calculus slightly later than Newton but independently. He invented convenient new symbols that gave great im- petus to the development of calculus: the sym- bols dx and J are due to him. Leibniz was in Paris in 1672-1677, where he made the ac- quaintance of Father Arnauld (1612-1694) of Port Royal (the monastery to which Pascal belonged) and the Dutch physicist C. Huygens (1629-1695). Through their suggestions, he studied the work of Descartes and Pascal. Leibniz's first papers on calculus were pub- lished in 1684 in the scientific journal Acta Eruditorum, which he also edited. The methods of calculus he initiated were transmitted to mathematicians of the tBernoulli family and then to L. tEuler, who developed them into the wide field of analysis. Thus the mathematics of the 17th century went clearly beyond Greek mathematics, The importance of numbers over diagrams was recognized, and mathematicians were no longer hesitant to use infinity. Moreover, people became aware of the importance of experimental methods in science. The position of mathematics as an important method of natural science was established; mathematics became a rational basis of scientific research. It was also in this century that a peculiar kind of mathematics was developed in Japan by T. Seki (1642?-1708). However, it lacked the Greektradition of viewing logical founda- tions as being important, and furthermore had no connection with natural sciences; con- sequently, it did not see subsequent develop- ment comparable to that of Western mathe- matics ( 230 Japanese Mathematics (Wasan)). References [IJ M. Cantor, Vorlesungen liber Geschichte der Mathematik, Teubner, II, 1892; III, 1898. [2J P. L. Boutroux, L'ideal scientifique des mathematiciens dans l'antiquite et les temps moderns, Presses Universitaires de France, new edition, 1955. [3J D. T. Whiteside, Patterns of mathematical thought in the later seventeenth century, Arch. History Exact Sci., 1 (1961), 179-388. [4J H. Freudenthal, Zur Geschichte der voll- stiindigen Induktion, Arch. Internat. Histoire Sci., 6, no. 22 (1953),17-37. [5J K. Hara, Pascal et l'induction mathema- tique, Rev. Histoire Sci. Appl., 15 (1962), 287- 302. [6J N, Bourbaki, Elements d'histoire des mathematiques, Hermann, 1960. 
266 Ref. Mathematics in the 18th Century 266 (XXI. 1 0) Mathematics in the 18th Century During the Age of Enlightenment, mathe- matical analysis developed steadily after its initiation in the preceding century. It found numerous applications in theoretical physics and contributed to the growth of rationalistic thought. The central figures in mathematics during the late 17th and early 18th centuries were I. tNewton (1642-1727) and G. W. tLeibniz (1646-1716). C. Maclaurin (1698-1746) of Scotland followed Newton, but no mathema- tician of Newton's stature appeared in Great Britain. An unfortunate dispute over the prior- ity of discovery of infinitesimal calculus arose between Newton and Leibniz, after which their followers came into conflict. This prevented the members of the English school from giving up their inconvenient notation system, which hindered their progress in calculus. On the Continent, Leibniz was succeeded by the mathematicians of the tBernoulli family and by L. tEuler (1707-1783), who brought about brilliant developments in calculus and its applications. They solved various kinds of tdifferential equations and invented the tcal- culus of variations. F. tViete used the term analysis in the sense of algebra as a heuristic method; the same term meant algebraic treat- ment of infinite series in Newton's usage. It was during this century that analysis secured a position as a branch of mathematics indepen- dent of algebra and geometry. t Analytical dynamics, initiated by Euler, was further de- veloped by J. L. tLagrange (1736-1813) and P. S. de tLaplace (1749-1827). Laplace, in systematizing tcelestial mechanics and the ttheory of probability, showed what a power- ful instrument analysis was. A. M. Legendre (1752-1833) investigated telliptic integrals and opened the way for C. F. tGauss and other mathematicians of the next century. The growth of the Ecole Poly technique, established during the time of the French Revolution, contributed to the brilliant pro- gress of French mathematics. Lagrange, La- place, and Legendre were all active in Paris during this period, as were S. D. Poisson (1781-1840) and 1. B. J. tFourier (1768-1830), both of whom made major contributions to analysis, and G. Monge (1746-1818), L. Car- not (1753-1823), and 1. V. Poncelet (1788- 1867). A problem proposed by Fourier in his theory of hea t propagation gave rise to an important question of analysis, One that later formed the basis of tharmonic analysis. Fourier and Poisson aimed at clarifying the 988 laws of nature, while Monge, Carnot, and Poncelet developed tprojective geometry and tdescriptive geometry for their purely geo- metric interest. Monge also did precursory work on tdifferential geometry. The mathematics of this century left many remarkable results in geometry and analysis and their applications; however, it inherited its methods from the preceding century and lacked critical spirit. Mathematicians were more interested in obtaining new results than in reflecting upon the rigor of their methods. Reexamination and reestablishment of the foundations of mathematics were left to the next century. References [IJ M. Cantor, Vorlesungen iiber Geschichte der Mathematik III, Teubner, 1898. [2J D. J. Struik, A concise history of mathe- matics, Dover, 1948, third edition, 1967. [3J N. Bourbaki, Elements d'histoire des mathematiques, Hermann, 1960. 267 (XXI.11) Mathematics in the 19th Century The 19th century was a critical period in the history of mathematics. When the century began, the memory of the French Revolution was still fresh, and World War I followed closely upon the turn of the 20th century. During this time mathematics made enormous progress and left a tremendous legacy to the present century. Increased personal liberty released people from traditions and allowed culture to spread to wider classes of society, producing a greater reservoir of talent. Re- search activities were intensified in the univer- sities, and many specialists collaborated or competed with each other. The century can be divided into three periods: the first 20 years, during which many new fields of mathematics arose; the next 30 years, a period of further development; and the latter half of the century, when these fields attained maturity. In 1801, Disquisitiones arithmeticae by the young C. F. tGauss (1777-1855) appeared. It contained a systematized theory of num- bers, ushering in a new era of mathematics. In France, many mathematicians studied at the Ecole Poly technique, established during the French Revolution. Among them, A. L. tCauchy (1789-1857) was one of the most prominent. He gave the exact definitions of 
989 limit and convergence, thus giving solid foun- dations to calculus some 150 years after its discovery. N, H. tAbel (1802-1829) and C. G. J. tJacobi (1804-1851) studied telliptic func- tions during the same time, producing results sensational to their contemporaries. Gauss gave a rigorous proof of the existence of roots of algebraic equations in the field of complex numbers; Abel proved the algebraic nonsolva- bility of algebraic equations of degree;;" 5; and E. tGalois (1811-1832) created his theory of algebraic equations, which began a new phase in algebra, J, V, Poncelet (1788-1867), another graduate of the Ecole Poly technique, devel- oped tprojective geometry along the lines pursued by G. Monge (1746-1818). His re- search was continued in Germany by A. F. Mobius (1790-1868), J. Steiner (1796-1863), and J. Plucker (1801-1868), Steiner investi- gated, in particular, talgebraic curves and surfaces by synthetic methods; Plucker intro- duced projective coordinates, thus enlarging the usage of analytic methods in geometry. These brilliant results in the first period of the 19th century were all achieved by young mathematicians, most of them still in their twenties. The new geometry was developed in the period 1830-1840 by K. G. C. von Staudt (1798-1867) in Germany and M. Chasles (1793-1880) in France. In the 1840s, the the- ory of tinvariants was taken up in connection with geometry; outstanding in this domain were the English mathematicians A. Cayley (1821-1895) and J. J. Sylvester (1814-1897). P. G. L. tDirichlet (1805-1859) endeavored to simplify Gauss's number theory and intro- duced the tDirichlet series in his computation of tclass numbers of binary tquadratic forms. He also initiated the theory of ttrigonometric series by giving a rigorous proof of a theorem on expansion in tFourier series, introduced by J. B. J. tFourier (1768-1830) in his theory of heat propagation. Another notable event was the independent and almost simultaneous discovery oftnon-Euclidean geometry by J. Bolyai (1802-1860) and N. I. Lobachevskii (1793-1856), which aroused philosophical interest since it changed the character of axioms. The invention of tquaternions by W. R. Hamilton (1805-1865), publication of Ausdehungslehre (theory of extensions) by H. G. Grassmann (1809-1877), and development of the algebra ofIogic by G. Boole (1815- 1864) also occurred during this period, but these notions received neither full comprehen- sion nor sympathy until later. During the latter half of the century, G. F. B. tRiemann (1826-1866) and K. T. W. tWeierstrass (1815-1897) were prominent. Both have had great influence on the mathe- 267 Mathematics in the 19th Century matics of the 20th century, the former by his brilliant and abundant production, the latter by his mature and critical spirit. Riemann lived only 40 years, and published, in rapid suc- cession, his epoch-making ideas on the theory of functions of a complex variable, t Abelian functions, trigonometric series, tfoundations of geometry, tdistribution of primes, and tzeta functions. Weierstrass was already 49 when, after teaching in a country gymnasium (second- ary or college preparatory school), he became a professor at the University of Berlin. The theory of the functions of a complex variable, initiated by Cauchy in the I 820s, had to wait for the contribution of these men to attain completion in the form of the theory oftellip- tic functions. Riemann's influence is also con- siderable in algebraic geometry and the theory of tdifferential equations. Weierstrass reformed the tcalculus of variations. His critical ap- proach uncovered pathological functions, such as continuous nowhere differentiable functions ( 106 Differential Calculus) and Peano space- filling curves ( 93 Curves J), and to real analysis, based largely on the set theory of G. tCantor (1848-1918). Concerning the foundations of mathemat- ics, Cantor, M. C. Meray, and 1. W. R. Dede- kind (1831-1916) established the theory of irrational numbers. Dedekind and G. Peano (1858-1932) developed the theory of natural numbers; their results brought about the "arithmetization" of mathematics and led to the research in the foundations of mathematics of the present century. Cayley and F. tKlein (1849-1925) inter- preted non-Euclidean geometry by means of metrics introduced in projective geometry. Toward the cnd of the ccntury, D. tHilbert (1862-1943) examined the roles of axioms of congruence, continuity, and parallelism in Euclidean geometry, thus initiating the study of taxiom systems in general. The theory of tgroups, in particular finite groups, was developed around 1870 by C. Jordan (1838-1922), G. Frobenius (1849- 1917), and W. S. Burnside (1852-1927). M. S. tLie (1842-1899) applied infinitesimal trans- formations to differential equations, and Klein applied the groups of linear transformations to geometry, The discovery of tautomorphic functions by Klein and H, tPoincare (1854- 1912) was another brilliant application of the theory of groups. In the algebraic theory of numbers, originated by Gauss, E. E, tKummer (1810-1893) developed the idea of "ideal num- bers" ( 14 Algebraic Number Fields); Dede- kind then established the theory of tideals. L. tKronecker (1823-1891), an admirer of Abel's work, studied algebraic equations and dis- covered that every t Abelian extension of the 
267 Ref. Mathematics in the 19th Century rational number field is contained in a tcy- clotomic field; he believed that relations of a similar kind would hold between tmodular equations of elliptic functions with tcomplex multiplications and Abelian extensions of timaginary quadratic fields, and enunciated a famous conjecture known as his "dream in his youth." Finally, we note the appearance of another important mathematician, S, V. Kovalevskaya (1850-1891). After studying with Weierstrass, in 1884 she was invited by G. M. Mittag- Leffier (1846-1927) to teach at the University of Stockholm, where she remained until her death. Toward the end of the 19th century, the subjects of mathematical research became highly diversified. Branches were further rami- fied into more specialized branches, while unexpected relations were found between previously unconncctcd fields. Thc situation became so complicated that it was difficult to view mathematics as a whole. It was under these circumstances that in 1898, at the sugges- tion of F. Meyer and under the sponsorship of the Academies of Gottingen, Berlin, and Vienna, a project was initiated to compile an encyclopedia of the mathematical sciences. The Enzyklopiidie der mathematischen Wissen- schaften was completed in 20 years; it provides a useful overview of the mathematics of the 19th century. Toward the end of the century, the Interna- tional Congress of Mathematicians (ICM) was established to foster communication among mathematicians from all parts of the world, Before World War I broke out, the ICM met in Zurich (1896), Paris (1900), Heidelberg (1904), Rome (1908), and Cam- bridge, Mass. (1912). During this period, mathe- matical societies were formed in many coun- tries, e.g., the London Mathematical Society (1865), the Societe Mathematique de France (1872), the American Mathematical Society (1888), the Deutsche Mathematiker Vereini- gung (1907), and the Mathematical Society of Tokyo (1877), which later became the Physico- Mathematical Society of Japan and then was split in two in 1946, The present Mathematical Society of Japan evolved from this division. Five years after the 1872 reform of the Japa- nese educational system, the University of Tokyo was established, and D. Kikuchi (1855- 1917) and R. Fujisawa (1861-1933) taught at the Department of Mathematics during its early ears. Under their influence, Japanese research in European-style mathematics (based on Greek traditions) began. The Universities of Kyoto and T6hoku were established in 1897 and 1911, respectively. From the beginning of the 20th century, original results were ob- 990 tained and published in the Proceedings of the Physico-Mathematical Society of Japan and in the journals of the faculties of science of these universities, In 1911, the T6hoku Mathematical Journal was founded by T. Hayashi (1873- 1935). In 1920, a paper on tclass field theory by T. tTakagi (1875-1960) was published in the Journal of the College of Science Qfthe Univer- sity of Tokyo. Thus the position of Japanese mathematics gradually came to be established. References [IJ F. Klein, Vorlesungen uber die Entwick- lung der Mathematik im 19. Jahrhundert, Springer, I, 1926; II, 1927 (Chelsea, 1956). [2J N. Bourbaki, Les elements d'histoire des mathematiques, Hermann, second edition, 1969. [3] Enzyklopadie der mathematischen Wissen- schaften mit Einschluss ihrer Anwendungen, Teubner, 1898-1934. 268 (XIV.10) Mathieu Functions A. Mathieu's Differential Equation The 2-dimensional tHe1mholtz equation (+ k 2 )'l' =0 (= fP jox 2 + fP joy2), sepa- rated in telliptic coordinates , 11 given by x = c cosh  cos 11, Y == c sinh  sin 11, has a solution of the form 'l' =X() Y(IJ) whose factors X@, Y(I1) satisfy d 2 ujdz 2 +(a- 2qcos2z)u=0, d 2 ujdz 2 -(a - 2qcosh2z)u=0, (1 ) (2) respectively, where a is an arbitrary constant and q=k 2 c 2 j4. By the substitution z--> :!:iz, (1) becomes (2), (1) and its solutions are known as Mathieu's differential equation and the Mathieu functions, and (2) and its solutions are calIed the modified Mathieu differcntial equa- tion and the modified Mathieu functions. B. Hill's Differential Equation Hill's differential equation is a linear ordinary differential equation of the second order: d 2 ujdx2 + F(x)u = 0, (3) with F(x+2n)=F(x). It is named after G. W. Hill, who investigated it in his study of lunar motion. This equation includes Mathieu's differential equation and tLame's differential equation as particular cases, and by suitable 
991 transformations, tLegendre's differential equa- tion and the tcontluent hypergeometric dif- ferential equations as well. While F(x) is periodic, solutions of (3) are not necessarily so. There always exists, how- ever, a particular solution that is quasiperiodic in the sense that u(x+2n)=(Ju(x), (J = constant (Floquet's theorem). That is, the differential equation (3) has a solution of the form u(x)=ex<p(x), where <p(x+2n)= <p(x) and Ji (defined by (J= e2n) is called the characteristic exponent. Being a particular case of Hill's equation, (1) has a general solution of the form U(z) = AeZ <p(z) + Be-z<p( - z), where <p(z + n) = <p(z). For those values of a (called eigenvalues) that make the character- istic exponent Ji equal to 0 or i, u(z) has period n or 2n and is called a Mathieu function of the first kind, also called an elliptic cylinder func- tion when employed in problems of diffraction by an elliptic cylinder. Sometimes it is called simply the Mathieu function, and other solu- tions of (1) are referred to as general Mathieu functions. The formula en F(x) = I ame imx m=-oo suggests, in conjunction with Floquet's theo- rem, a solution of (3) in the form 00 U = eX L bne inx . (8) n= - 00 Substituting (7) and (8) into (3) and comparing coefficients of e(+in)x, we have infinitely many linear equations 00 (Ji + in)2 b n + L amb n - m = 0, m=-oo n=...,-2,-1,0,1,2,.... (9) By eliminating the b n in (9), we also obtain an infinite determinantal equation called Hill's determinantal equation, (Ji)= IB,sl =0, where the elements B,s of Hill's determinant (Ji) are such that { I if r=s, Brs= (Ji+;)s+ao if r#s. Here an infinite determinant D = I Bmn I(m, n = - 00, ... , 00) is defined as the limit, if it exists, of Dm = det(B i ) (i,j = - m, ..., m) as m--> 00, The formula (10) can be reduced to a simpler form sin2niJi=(0)sin2n. (11) 268C Mathieu Functions This determines a characteristic exponent Ji, which in turn determines the b n in (9) and a solution (8). This procedure is called Hill's method of solution. Applying Hill's method of solution to (1), we have an equation for the characteristic expo- nent Ji: (4) sin 2 (n/2) iJi = MO)sin2(n/2), (12) where the infinite determinant (O)=IBmnl has elements such that (5) 1 1 -q B == a-4m 2 mn o if m=n if n=m:f:: 1 otherwise (m,n=..., -1,0, 1,...). (6) When q-->O, we have n n (O)= 1 +2q2  cot-+O(q4). 8 a(l-a) 2 (13) Thus if q=O, we have (O)= 1, and a=4n 2 , (2n + 1)2 correspond to Ji = 0, i in (12). C. Mathieu Functions of the First Kind (7) Mathieu functions of the first kind are further classified into the following four types: ce 2n (z,q)= IA;2n)cos2rz, (a 2n ), se 2n + 1 (z, q) = L Bin+-+;1)sin(2r + l)z, (14.1 ) (b 2n +d, ce 2n + 1 (z,q)= LAW)cos(2r+ l)z, (14.2) (a 2n +d, se 2n + 2 (z, q) = L Bi 2 :',+l)sin(2r+ 2)z, (b 2n + 2 ) (14.3) (14.4) (10) (n, r = 0,1,2,...), where the appended terms in parentheses are eigenvalues, ordered by a2n < b 2n + 1 <a 2n + 1 <b 2n + 2 for a given q, and increas- ing with n. Each of these series converges ab- solutely and uniformly for all finite z and has n zeros in 0 <z < n/2. In addition, orthonormal- ity relations f 2n o cem(x)sen(x)dx =0, f 2n f 2n o cem(x)cen(x)dx= 0 sem(x)sen(x)dx =n()mn hold. When q-->O, ceo (z)--> 1/.j2, cem(z)--> cosmz, and sem(z)-->sinmz, For small q we assume that the quantities involved have power series expansions in q, 
268 D Mathieu Functions e.g., a=m 2 +rxq+ fIq2 +..., cem(z)=cosmz+qFdz) +q2 F 2 (z) + ..., and we substitute them in (I) to determine successively rx, [3, .." FI (z), F 2 (z),... (Mathieu's method). For larger q, (14) is substituted in (1) to give recurrence formulas for the coefficients, e.g., for ce 2n (z), -aA\fn)+qAb 2n )=O, 2qAb 2n ) +(4 - a) Ai 2n ) +qA!fn) =0, qAfrn)2 +(4r 2 -a)Af:)+qAin2 =0, r> 1. After we eliminate the Af..n), the formulas lead to an equation for the eigenvalues a 2n : a -q - 2q a - 4 o -q o -q a-16 o 0 o 0 -q 0 =0, or, equivalently, to a tcontinued fraction 2 q 2 q2 q2 a- a-4 - a-16 - a-36 -.... Given q, we can find the a 2n from (17) and determine the A in) for each a 2n from (15) (Ince-Goldstein method). D. Mathieu Functions of the Second Kind and Modified Mathieu Functions There exists only one (half-)periodic solution of (1) corresponding to each (half-)periodic eigen- value ( Section E). Therefore other solu- tions corresponding to the same am or h m and independent of cem(z, q) or sem(z, q) are non- periodic. They are called the Mathieu functions of the second kind and are denoted by fem(z, q) or gem(z, q). By the substitution z-->iz in (14) we obtain formulas for the modified Mathieu functions of the first kind, Cem(z, q)= cem(iz, q), Sem(z, q) = - isem(iz, q). When q-->O, then Ceo(z)--> 1/)2, Cem(z)--> coshmz, Sem(z)-->sinhmz, Similarly, by the substitution z -->iz we obtain modified Mathieu functions of the second kind from Mathieu functions of the second kind. In addition, we introduce modified Mathieu functions of the third kind as those linear combinations of modified Mathieu functions of the first and second kinds that have the asymptotic form e- Y y-l/2 (y==ql/2 e z) as z-->oo.ln addition to the Fourier expansion (7), expansion of the Mathieu functions in terms of tBessel functions 992 is possible, e.g., after taking q=h 2 , Ce 2n (z, q)= 'L A 2 ,cosh2rz =(AO)-l ce 2n (n/2, q) x I (-I)' A 2 J2r(2hcosh z) =(Ao) -I ce 2n (0, q) X IA 2 ,12r(2hsinhz) =(Ao) -2 ce2n (0, q)ce 2n (n/2, q) x D -1)' A 2 ,1,(he-V,(he Z ). (19.1) (19.2) (19.3) (19.4) (15) These series converge absolutely and uni- formly for all finite z. Replacing the J on the right-hand sides of (19) by N 2 r(2hcoshz), N 2r (2h sinh z), J,(he-Z)N,(he Z ), respectively, in an obvious way we obtain infinite series for a function that again satisfies (2) and is denoted by FeY2n(z,q). In a similar manner other modi- fied Mathieu functions of the second kind GeY2n+dz, q), FeY2n+1 (z, q), GeY2n+2(Z, q) can be obtained. These are more convenient for prac- tical applications than fem(iz, q) and gem(iz, q) since they converge more rapidly. The equations (16) (17) d 2 u/dz 2 +(a +2qcos2z)u == 0, d 2 u/dz 2 -(a+2qcosh2z)u=0 (20) (21) obtained from (1) and (2) by the substitution q--> -q are the results of separating (1'1- k 2 )<p =0. In general, iff(z,q) is a solution of (1), then f(n/2 - z, q) is a solution of (20). Thus the formulas ce2n(z, -q)=( -1)"ce2n(n/2-z,q), (22.1) ce 2n + l (z, -q)=( -1)"se 2n + 1 (n/2-z,q), (22.2) se 2n + 1 (z, - q) =( -I )"ce 2n + 1 (n/2 - z, q), (22.3) se 2n + 2 (z, -q)=( _1)n se 2n +2(n/2 - z, q) (22.4) (18) can be adopted as definitions of cem(z, q), sem(z, q) for q < 0 (Ince's definition). Accord- ingly, an expansion of Ce holds in terms of modified Bessel functions 1m in place of the J m in (19), which in turn becomes a solution of (21) if we replace 1m by (-1)mKm/n: Fek 2n (z, - q) =( -1)"(nAo) -I ce 2n (n/2, q) x I A 2r K 2r (2hsinh z). (23) In a similar manner, Fek 2n + 1 (z, -q), Gek2n+dz, -q), Gek2n+2(z, -q) can be defined. They decrease exponentially as z --> 00 and hence are precisely the modified Mathieu functions of the third kind. E. Stability Let UI(X), u 2 (x) be a fundamental system of solutions of (3) such that udO) = 1, U'I (0) =0; 
993 u 2 (0)=0, u(O)= 1; then (1, Ji in (4), (5) are given by (1=e2n, 2nJi=arccoshA, 2A =ud2n)+u(2n), There are two values of Ji satisfying (24); let them be Jil and Ji2 (Ji2 = - Jid. If F(x) is a real function, then U I (x), U2(X) are also real func- tions and A is a real number. It can be seen from (24) that A < -1,1 <A <0, O<A < 1, 1 < A correspond to :t2nJi = arc cosh IA I + ni, i(arccosIAI+n), i arccosA, arccoshA, respec- tively. Consequently, when IA 1< 1, the general solution of (3) neither diverges nor vanishes as x tends to infinity. Such solutions are called stable solutions of Hill's equation, or some- times Hill's functions. When IA I > 1, either eIX or e2X tends to infinity with x. Such solutions are called unstable solutions. If A = I ( -1), we have Ji = 0 (ij2) and a solution of the form of u = <p(x) (e ix / 2 <p(x» with the period 2n (4n), called a periodic (half-periodic) solution. When we apply the Mathieu functions to physical and engineering sciences, such as the theory of oscillation and quantum mechanics, it is convenient to modify (3) in the form d 2 ujdx 2 +(..1. +y<l>(x»u =0 involving parameters ..1., y, Then with y fixed, there exist countably many values of ..1. (called eigenvalues) corresponding to periodic or half- periodic solutions of (25). Let them be Ao.s; }'I .s; ... or Al .s; A 2 .s; ... , respectively; then we have }'O<Al .s;A 2 < i".s; A 2 <... <I 2k -1 .s;I 2k < A2k-1 .s; A 2k < ... . The values of A in the open intervals (A2k-2' A2k-d and (Izk> }'zk-d correspond to stable solutions, while the }:s in other intervals correspond to unstable solutions (Haupt's theorems). When both ). and y vary, the Ay-plane is divided into regions corresponding to stable or unstable solutions according as the character- istic exponent Ji is purely imaginary or not. For example, when <I>(x)=2cosx in equation (25), we obtain the Mathieu equation. In this case the Ay-plane is divided as shown in Fig. 1, where shaded (unshaded) regions correspond to stable (unstable) solutions and boundary curves give eigenvalues corresponding to peri- odic or half-periodic solutions. References [1] E. T. Whittaker and G. N. Watson, A course of modern analysis, Cambridge Univ. Press, fourth edition, 1958, 269 A Matrices (24) Fig. 1 (25) [2J M. J. O. Strutt, Lamesche-, Mathieusche-, und verwandte Funktionen in Physik und Technik, Erg. Math., Springer, 1932 (Chelsea, 1967). [3J N. W. McLachlan, Theory and application of Mathieu functions, Clarendon Press, 1947. [4J J. Meixner and F. W. Schiifke, Mathieu- sche Funktionen und Sphiiroidfunktionen, Springer, 1954. Also  references to 133 Ellipsoidal Harmonics. 269 (111.2) Matrices A. General Remarks Let K be a tring or a tfield. As examples of such K, we may take the real number field R and the complex number field C. By a matrix with elements in K, we mean an array of mn elements a ik (i== 1,... ,m; k= 1,..., n) of K ar- ranged in a rectangular form: ( al! a 21 aml a ln ) a 2n a mn a 12 22 a m2 The element a ik is called the (i, k)-element (entry or component). (Sometimes, instead of using parentheses, we use II II or [ ].) More pre- cisely, this matrix is said to be an m by n ma- trix (m x n matrix or matrix of (m, n)-type). In particular, an n x n matrix is called a square matrix of degree (or order) n, while a matrix in general is sometimes called a rectangular ma- trix. Each horizontal n-tuple in an m x n ma- trix is called a row of the matrix, and each vertical m-tuple is called a column of the matrix. We often abbreviate the notation for the matrix given previously by writing (aid or 
269B Matrices simply A. A square matrix is called a diagonal matrix if all its components are zero except possibly for diagonal components a;;, that is, if a ik =0 for i # k. If the components a ii of a diagonal matrix are all equal, it is called a scalar matrix. An n x n matrix whose (i, k)- element is equal to (jik is called the unit matrix (or identity matrix) of degree n, where (jik is the Kronecker delta, which is defined by (jii = 1 and (jij=O for i #j. We denote this matrix by In, or simply by I if there is no need to specify n. A I x n matrix (ai' a2,"', an) is called a row vector of dimension n, and an m x I matrix (JJ is called a column vector of dimension m, The m rows and n columns of an m x n matrix A are called row vectors and column vectors of A. B. Operations on Matrices Two matrices A = (a ik ) and B = (b ik ) are said to be equal if and only if they are of the same type and a ik = b ik (i = I, ." , m; k = 1, ... , n), If both A and Bare m x n matrices, we define the sum of A and B by A + B = (a ik + b ik ). The prod- uct of two matrices A and B is defined by AB = (Cik), Cik = Ljaijb jk , provided that the number of columns of A is equal to the number of rOws of B. We further define the (left and right) multiplication of a matrix A by an element a of K by aA =(aaik) and Aa =(aika). The set of all matrices over K of the same type forms a t K- module. The multiplication of matrices satis- fies the associative law and the left and right distributive laws with respect to addition. Thus the set of all n x n matrices over K forms a ring, which is called the total matrix algebra (or full matrix algebra) of degree n over K; it is usually denoted by Mn(K) or Kn. If K has the tunity element 1, then In is the unity element of Mn(K). The matrix whose components are all 0 is called the zero matrix and is denoted by the same symbol O. Suppose that K has the unity element 1. Let Eik be the matrix whose (i, k)- element is I and whose other elements are all O. Then every matrix A =(a ik ) in Mn(K) can be expressed uniquely as A = L aikEik> a linear combination of E;k' The matrix Eik is called a matrix unit. We have EijEkl=O ifj#k, EikEkl =Eil' and aEik=Eika for all aEK. Let A be a sq uare matrix in K. If there exists a matrix A -I such that AA -I =A -I A =1, then A -I is caIled the inverse matrix of A, and A is called a regular matrix (nonsingular matrix or invertible matrix). The inverse A -I is unique if it exists. When K is commutative, A is regular 994 if and only if its tdeterminant IA I is a tregular element of K; in particular, when K is a field, A is regular if and only if IAI #0. A square matrix that is not regular is called a singular matrix. Let A = (a ik ) be an m x n matrix. Then the n x m matrix (b ik ) such that b ik = a ki for all i and k is called the transposed matrix of A, and is usually denoted by 'A. Transposing a matrix amounts to changing rows into columns and vice versa. When K is commutative, AB = e implies 'e = 'B'A. A square matrix such that 'A = A is called a symmetric matrix, and one such that 'A = - A is called an alternating (skew-symmetric or antisymmetric) matrix, A square matrix A = (a ik ) is called an upper (lower) triangular matrix if a ik = 0 for i> k (kk). C. The Kronecker Product of Matrices We assume that the ring K is commutative. Let A be an m x n matrix (a ik ), let B be an r x s matrix (b jl ) in K, and write cA,Il=aikbjl by means of indexes .Ie = (i, j) and Ji = (k, 1). The Kronecker product of A and B, usually denoted by A @ B, is defined as the mr x ns matrix e = (c A ,Il)' By an appropriate choice of ..1. and Ji, it can be expressed as (""B a l2 B '''B) a 21 B a 22 B a2n B amlB a m2 B amnB Or C'A b l2 A h"A ) b 21 A b 22 A b 2s A br1A b r2 A brsA We have the formulas A @(B I +B 2 )=A @B I +A @B 2 , (AI +A2)@B=Al @B+A2 @B, c(A @B)=(cA)@B=A @(cB), (AI @Bd(A2@B2)=(AIA2)@(BIB2), provided that the sums and products can be defined. Matrices correspond to tJinear mappings of free K-modules ( Section L). The Kronecker product corresponds to the ttensor product of the corresponding linear mappings. D. The Rank of a Matrix Let A be an m x n matrix (a ik ) in a field K. If there exists a nonzero tminor of A of degree r, and if all minors of degree;;, r + 1 are equal to 
995 0, then the number r is calIed the rank of A. Denoting the rank of a matrix A by p(A), we have p(PAQ).s;p(A) for any matrices P, Q for which the product PAQ exists; the equality holds if P and Q are regular (square) matrices. The rank r of A is equal to the maximum number oftlinearly independent row vectors of A, and it is also equal to the maximum num- ber of linearly independent column vectors of A. The number n - p(A) is calIed the (column) nullity of the matrix A, It is equa1 to the di- mension of the linear space consisting of solu- tions of homogeneous linear equations Ax = 0, that is, the number offundamental solu- tions of these equations. (The row nullity m- p(A) of A is equal to the dimension of the linear space consisting of solutions of xA =0,) Even if K is not commutative, we can define the rank of a matrix A as the maximum num- ber of left (right) linearly independent row (column) vectors of A ( 256 Linear Spaces F). E. Elementary Divisors Let 0 be a tprincipa1 ideal domain (e.g., the ring Z of rational integers, or the tpolynomial ring K[xJ over a field K), and let A be a ma- trix with elements in o. Then, multiplying ap- propriate invertible matrices from the left and right, we can transform A into a diagonal matrix of the form e 1 o e 2 e, o o o where r denotes the rank of A, each ei#O, and each e i + 1 is divisible by e i , These el, e 2 , ..., e, are ca11ed the elementary divisors of A and are uniquely determined up to unit factors. This fact can be generalized to some extent for the case where 0 is noncommutative ( 256 Linear Spaces P). Using the concept of deter- minant, the elementary divisors can also be defined as fo11ows. Let d k denote the greatest common divisor of alI minors of degree k of A. The elements ei=ddd i - I (i = 1, ..., r; do = 1) of 0 are the elementary divisors of A. The numbers d i (1 .s; i.s; r) are ca11ed the determinant factors of A. F. Characteristic Polynomials and Eigenvalues Let A =(aid be a square matrix of degree n in a field K. The determinant F(x) = IxI - AI is a polynomial in x with coefficients in K, 269G Matrices called the characteristic polynomial of A. The algebraic equation F(x) = 0 is calIed the characteristic equation of A, and its roots ..1.1 , ).2, "', ).n are calIed the eigenvalues (proper valnes or characteristic roots) of A. The tdeter- minant of A, det A, is equal to I17=1 Ai' the product of the eigenvalues. The sum of eigen- values, L71 Ai = L7=1 a i ;, is calIed the trace (Ger., Spur) or diagonal sum of A, and is de- noted by tr A or Sp(A). In case K is talge- braicalIy closed, there exists a nonzero vector x that satisfies the equations Ax = AX if and only if A is an eigenvalue of A. This solution x is calIed an eigenvector (proper vector or charac- teristic vector) belonging to the eigenvalue ..1.. In particular, if all the a ik are real and A is symmetric, then alI eigenvalues of A are real, and the characteristic equation F(x) = 0 of A is called a secular equation. Every square matrix A satisfies its characteristic equation, i.e., F(A)=O. This is called the Hamilton-Cayley theorem, and it is useful in numerical calcula- tion of inverse matrices. Let A be a square matrix. As is clear from above, there exist monic polynomia1s f(x) ( #0) in K[xJ such thatf(A)=O. Let <p(x) be such a polynomial of the least degree, Then every f(x) is divisible by <p(x), This <p(x) is called a mini- mal polynomial of A. Let e 1 (x), ..., en(x) be the elementary divisors of the matrix xl - A. Then en(x) = <p(x). Now if F(x)=x n , A is said to be a nilpotent matrix, and if F(x)=(x-l)", A is said to be a unipotent matrix. G. Jordan Normal Form Two square matrices A and B are said to be similar to each other if B = P -I AP with a regular matrix P. The matrices A and Bare similar if and only if xl - A and xl - B have the same elementary divisors. Now suppose that A has elements in a field K and that every eigenvalue of A is in K. Then there is a regular matrix P with elements in K such that p-l AP is of the form ( AI P-1AP= 0 A 2 .) where ( A, 1" 0 ) A A, ,= .'1 ' o At (Ai is a square matrix of a certain degree, say m i ; ifmi= 1 then Ai=(XJ) This matrix P-IAP is called the Jordan normal form of A. It is a diagonal matrix if and only if the minimal 
269H Matrices polynomial of A has no multiple roots. If this is the case, A is said to be semisimple. H. The Exponential Function of a Matrix Let Av=(a;») (v =0,1,2, ...) be square matrices with elements in the complex number field C. The series Ao + Al +... + A, +... is said to be convergent if the series of components aJ) + aU) + .., + a!;) + .., is convergent for each (i,j). For every square matrix A, the following series is convergent: 1 2 I, I+A+A +...+-A +.... 2! v! We denote this series by expA. Then exp('A) ='(expA); exp(-A)=(expA)-I; and if AB =BA, exp(A+B)=expAexpB. Moreover, det(expA) = exp(tr A). Furthermore, if we set F(t) =exp(tA), then dF(t)/dt = AF(t) ( 431 Transformation Groups). I. Normal Matrices, Unitary Matrices, and Hermitian Matrices Let A = (a ik ) be a square matrix with elements in the complex number field C. Then the ad- joint matrix A * of A is the transposed conju- gate 'X = (a k ;), where Ii is the complex conju- gate of a. If AA * = A * A, then A is said to be a normal matrix. A matrix V such that V*V = I, that is, V-I = V*, is normal. Following G. Frobenius, we call it a unitary matrix (in the following discussion, we shall call it simply a u-matrix). A matrix H such that H* = H is also normal. It is called a Hermitian matrix (which we shall refer to as an h-matrix). An h- matrix P such that p2 = P is called a projec- tion matrix. The set of all u-matrices forms a tgroup with respect to matrix multiplication. If A is a normal matrix and V is au-matrix, V* AV is a normal matrix. If H is an h-matrix, Q*HQ is also an h-matrix for any matrix Q. Every normal matrix A can be transformed into a diagonal matrix by a suitable u-matrix (i.e., there exists a u-matrix V such that V* A V = V-I AV is a diagonal matrix), and con- versely, every matrix with this property is a normal matrix. Thus, in particular, any h- matrix or u-matrix can be transformed into a diagonal matrix by a suitable u-matrix. Moreover, every real symmetric matrix, which is naturally an h-matrix, can be transformed into a diagonal matrix by an orthogonal ma- trix; this will be defined later. If A I' .., , Am are mutually commutative normal matrices, we can transform them into diagonal matrices by the same u-matrix; that is, there exists a u- 996 matrix V such that V-IAjV is of diagonal form for all i. All eigenvalues of an h-matrix are real, and all eigenvalues of a u-matrix have absolute value 1. If all eigenvalues of an h-matrix H are posi- tive (positive or zero), then H is said to be positive definite (positive semidefinite). For an h-matrix H, expH is a positive definite h- matrix, and conversely, every positive definite h-matrix can be expressed as expH by a unique h-matrix H. Furthermore, every regular matrix A can be uniquely expressed as A = V H (or A==H'V') by a u-matrix V (or V') and a positive definite h-matrix H (or H'), and A is normal if and only if V H = H V. A square matrix A such that A * = - A is called a skew-Hermitian matrix (simply skew h-matrix or anti-Hermitian matrix). All eigen- values of a skew h-matrix are purely imaginary numbers. If A is a skew h-matrix, expA is a u- matrix. Conversely, if a u-matrix V lies in a sufficiently small neighborhood of the identity matrix I (i.e., if all elements of V - I have sufficiently small absolute values), V can be uniquely expressed in the form expA with a skew h-matrix A. J. Orthogonal Matrices An h-matrix whose components are all real is necessarily symmetric. A u-matrix whose com- ponents are all real, that is, a real matrix R such that R -I == t R, is called an orthogonal matrix. The totality of orthogonal matrices forms a group with respect to matrix multi- plication, If S is a real symmetric matrix, there exists an orthogonal matrix T for which T -1 ST is a diagonal matrix; that is, S can be transformed into a diagonal form by T. Every orthogonal matrix R can be transformed by an orthogonal matrix T into the diagonal form T-IRT -1 -1 PI P, _ ( cosOj Sin(}j ) p- J -sinO j cos(}j The determinant I R I of an orthogonal ma- trix R is either 1 or -1. If IRI = 1, then R is called a proper orthogonal matrix. If A is a real alternating matrix, then exp A is a pro- per orthogonal matrix; conversely, any pro- for j = 1, ... , t. 
997 per orthogonal matrix in a sufficiently small neighborhood of the identity matrix I can be expressed uniquely in this form. Moreover, there exists a one-to-one correspondence be- tween real alternating matrices A and proper orthogonal matrices R without eigenvalues equal to -1, given by R=(I -A)(I +A)-I and A =(1 -R)(I + R)-I. This is called a Cayley transformation. A (complex) square matrix Twith the property 'T= T -I is called a complex ortho- gonal matrix. This matrix can be uniquely expressed as T= R exp(iA), where R is an orthogonal matrix, A is a real alternating matrix, and i 2 = -1. K. Infinite Matrices By an infinite matrix with elements in a ring K, we mean an array of elements of K with infinite numbers of rows and columns as follows: c: aa, "' ) , where 0", r are indices denoting the row and column for the element aa, and each index ranges over an infinite set r. Equality, ad- dition, and multiplication by an element of K of infinite matrices are defined in the same manner as for ordinary matrices. Generally, however, multiplication of infinite matrices cannot be defined. If the elements of each row (column) of an infinite matrix are zero except for a finite number of them, then it is called a row (column) finite matrix. For row (column) finite matrices A = (aa,) and B=(b a ,), the prod- uct AB=(c a ,) is defined by ca,='L.,aa,b n for all 0", rE r. By this definition the totality of such infinite matrices forms a ring. Now let K be the complex number field and r the set of natural numbers, and consider an infinite matrix (a ik ). This is called a bounded matrix if the following inequality holds for arbitrary Xi and Yk: I m,n I i'1 aikxiYk ( m ) 1/2 ( n ) 1/2 .s;M iIIXiI2 kIIYkI2 , where M is a constant. The set of all bounded matrices forms a ring. If <p" <P2' ... is a com- plete orthonormal system of a tHilbert space f), then for any continuous linear operator A we have A<pk = LI <Piaik' and A corresponds to a bounded matrix (aid. By this correspon- dence we have a ring isomorphism between 269L Matrices the ring of continuous linear operators of f) and the ring of bounded matrices ( 197 Hilbert Spaces). L. Linear Mappings Let L, L' be free right K-modules with bases ai, ..., am and b l , "', b n , respectively. Then, to a tlinear mapping f of L into L', there corre- sponds an n x m matrix A such that f:'LaiCi=(al,...,am) ( I ) f->(bl,...,bn)A ( I C m C m Conversely, for any n x m matrix A with ele- ments in K, there is a unique linear mapping f as above (with respect to these bases). In this sense, if a linear mapping 9 of L' (into some free K-module) corresponds to a matrix B, then the product go f corresponds to the prod- uct BA. (For left modules, we can observe transpositions.) If a'1, ... , a and b'I' "', b are another pair of bases, then there are regu- lar matrices P, Q such that (a'" ... , a) = (a" ..., am)P and (b'" ..., b)= (b l ,..., bn)Q. With respect to these new bases, the corre- sponding matrix to f is Q -I AP. In the partic- ular case where L == L', a i == b i (for all i), we see that (1) under a fixed basis of L, a linear trans- formation of L is nothing but a matrix of degree m and (2) a change of basis corresponds to the transformation A f-> p- I AP by the base- change matrix P. Thus, notions applicable to matrices can be applied to linear mappings. For instance, an eigenvector (characteristic vector or proper vector) of a linear transformation <P of L is an element a( #0) of L such that <p(a) =a(X «(X E K). If K is a field, then the characteristic polynomial X(X), and therefore also the eigen- values (proper values or characteristic roots), are invariant under base change. We add a little more on the case where K is a field. For an eigenvalue (X of <p, the subspace N. = {aELI<p(a)=(Xa} of L is called the eigenspace belonging to (x. Furthermore, the space N = {aELI(<p-(Xl (a)=O for some k>O} is a subspace of L containing N. and is sometimes called an eigenspace in a weaker sense. If all roots of X(X)=O are in K, then L is decom- posed into the direct sum of N , ... ,N , where (XI, '" , (xs are the distinct oots of'the equation X(X)=O. The dimension of N. is equal to the multiplicity of the root (Xi i the equation X(X)=O. This fact is the basis of the Jordan normal form. Minimal polynomials of linear mappings are well defined. A linear transformation <P of L is called semisimple if L has the structure of a tsemi- 
269M Matrices simple K[X]-module determined by Xa= <p(a). Hence <p is semisimple if and only if the minimal polynomial J1(X) of <p has no square factor different from constants in K[X]. In particular, the condition that all roots of J1(X) =0 are in K and simple is sufficient for semi- simplicity of <po This condition is equivalent to the condition that <p is represented by a tdiagonal matrix relative to some basis of L. Then L is decomposed into the direct sum of the eigenspaces N., and <p is said to be diagonalizable. If K is a tperfect field, any linear transfor- mation <p of L is represented as the sum of a semisimple linear transformation <Ps and a nilpotent linear transformation <Pn: <p = <Ps + <Pn (Jordan decomposition). Also, <Ps and <Pn com- mute with each other, and they are uniquely determined by <po We call <Ps and <Pn the semi- simple and nilpotent component of <P, respec- tively. Furthermore, <Ps and <Pn can be repre- sented as polynomials of <P without a constant term. The transformation <P is nonsingular if and only if <Ps is nonsingular. A nonsingular linear transformation <P is called unipotent if <Ps is equal to the identity transformation l L . Any nonsingular linear transformation <P is uniquely represented as a product of a semi- simple linear transformation and a unipotent linear transformation, which are commutative: <P = <Ps<Pu (multiplicative Jordan decomposition). Here <Ps is the semisimple component and <pu = 1 L + <Ps-l <Pn is unipotent (<Pu is called the unipotent component of <P;  13 Algebraic Groups). M. Linear Equations Let K be a tfield, h= ailX I +... + ainX n (i= 1, ..., m) be m tlinear forms: Kn......K, and b l , ... , b m be m given elements of K. Then a set of n elements x I' ... ,X n of K, or an n-tuple x =(x" ..., Xn)E K n , satisfying the system of m linear equations ail XI +... + ainX n =b n , i = 1,..., m, (1) is called a solution of equations (1). In partic- ular, a system with b l = ... = b m = 0, i.e., ail Xl +... +ainxn=O, i = 1,..., m, (2) is called a system of linear homogeneous equa- tions. In the theory of linear equations, the field K need not be commutative. In those cases where K is noncommutative, we have to distinguish between right multiplication and left multiplication. We make this distinction by adding the word "right" or "left" in paren- theses whenever it is necessary to do so. If XI, ..., x, are solutions of the system (2), 998 then any of their (right) linear combinations LixiCi (CiEK) is also a solution of (2), so that the solutions of (2) form a t(right) linear space Lover K. L is the kernel of the (left) linear mapping Kn-->K m given by x-->(fdx), ..., fm(x)), Let s=dimL. If s=O, the system (2) has only X = 0 as a solution, called the trivial solution. If s > 0, L has a basis {Xl' ..., xs}. Then x" ... , Xs are (right) linearly independent solutions of (2), and every solution of (2) is a (right) linear combination of them. We say that Xl' ... 'X s form a system of fundamental solutions of (2). Let r denote the maximum number of (left) linearly independent forms in the set UI '/2' ... ,1m}. Then we have s + r = n, so that the system (2) has a nontrivial solu- tion if and only ifr<n, and the number of linearly independent fundamental solutions is nr. Accordingly, if the number of equations is less than the number of unknown quantities, there always exists a nontrivial solution. Suppose now that the system (1) has a solu- tion Xo. Then every solution X of (1) can be written Xo + y for a solution y of (2); and con- versely, for any solution y of (2), Xo + y is a solution of (1). Furthermore, in order for (1) to have a solution, it is necessary and sufficient that Licibi=O whenever LiCih=O (CiEK), i.e., that the following two matrices have the same trank: (al! "'l A= ami a mn ( a" a ln h, ) A= ami a mn b m In particular, equation (1) has a unique solu- tion if and only if A and A have the same rank n. If m = n (i.e., A is a square matrix), (1) has a unique solution if and only if A has the inverse A -I, and then the solution is given by 'x=A -I 'b, where b=(b l ,b 2 , ...,b n ). We have also the following result. Let K' be an textension field of K. If a system of linear equations in K has a solution in K', it has a solution contained in K. In particular, if (2) in K has a nontrivial solution in K', it has a nontrivial solution already in K, and any system of fundamental solutions in K is itself a system of fundamental solutions in K'. Suppose now that K is commutative. Then we have an explicit formula for solving equa- tion (1) by means of tdeterminants. Consider first the case In = n. Let  = I A I, the determinant of the matrix A. If  # 0, then (1) has a unique solution, given by xk=d (k == 1, ..., n), where k is the determinant ob- tained from  by replacing its kth column, 
999 a 1k , a 2b "', a nk by b l , b 2 , ..., b n . This is called Cramer's rule. To consider the general case, let r be the common rank of A and A. We may assume that I aikl # 0 (i, k = 1, ... , r), appropri- ately changing the order of the equations and unknowns. Then by Cramer's rule we can solve the first r equations for x I' '" ,x" assign- ing arbitrary values to x r +!, ..., xn' In order for equation (2) to have a nontrivial solution, it is necessary and sufficient that the rank of A be less than n, and hence that I A I = 0 if A is a square matrix. For geometric applications of the theory of linear equations  7 Affine Geometry, and for numerical solution  302 Numerical Solution of Linear Equations. N. Positive Matrices A positive (or nonnegative) matrix is a matrix whose elements are all positive (or nonnega- tive) real numbers. A nonnegative square ma- trix A is said to be reducible (or, more pre- cisely, reducible by permutations) if by certain permutations of rows and columns A is re- duced to a form ( AI 0 ) o A 2 with square matrices AI, A 2 ; otherwise A is called irreducible. Theorem (Perron). For each positive matrix A, there is a positive real number r such that (1) r is a simple root of the characteristic equation for A and (2) every other eigenvalue of A has absolute value less than r. Furthermore, (3) if v is an eigenvector corresponding to r, then the components of v are all positive or all negative. Theorem (Frobenius). If A is an irreducible nonnegative matrix, then there is a positive number r satisfying (1) above and such that every eigenvalue of A has absolute value at most r. Furthermore, (3) above also holds in this case. Theorem. If A is a nonnegative matrix, then there is a nonnegative real number r that is an eigenvalue and such that absolute values of other eigenvalues are at most r. If v is an eigenvector corresponding to this r, then the components of v are all nonnegative or all nonpositive. References See references to 103 Determinants. 270C Measure Theory 270 (X.12) Measure Theory A. General Remarks Roughly speaking, a measure is a nonnegative function of subsets of a space completely addi- tive in the sense that the measure of the union of a sequence of mutually disjoint sets is the sum of the measures of the sets. This concept was introduced as a mathematical abstraction of length, area, volume, mass distribution, probability distribution, etc. Measure theory is the basis of integration theory, and these two theories playa fundamental role in modern mathematics, particularly in analysis, func- tional analysis, and probability theory. B. Important Classes of Sets Let X be an abstract space. The power set of X, the class of all subsets of X, is denoted by 2 x . Let  be a nonempty subclass of 2 x . If  is closed under intersections, i.e., if A, B E  implies An B E, then  is caIled a multipli- cative class on X. If  is closed under finite unions and complements, then  is called an algebra (or finitely additive class or field) on X. If  is closed under countable unions and complements,  is called a O"-algebra (count- ably additive class, completely additive class, or Borel field). If  is closed under montone limits, i.e., if for every sequence AnE, n= 1, 2, ... , monotone increasing or monotone de- creasing, the union or intersection of the se- quence belongs to , then  is caIled a mono- tone class on X. If X E  and if  is closed under countable disjoint unions and proper differences (A........B for A =:> B), then  is called the Dynkin class on X. For any <£:c2 x the least O"-algebra that includes <£: is called the 0"- algebra generated by <£:; this is written as 0" [<£:]. Similarly, we denote by \JJl[<£:J (D[<£:J) the monotone class (Dynkin class) generated by <£:. The foIl owing theorems are often useful. The monotone class theorem: If <£: is an algebra, then \JJl[<£:J =O"[(£:]. The Dynkin class theorem: If <£: is a multiplica- tive class, then 1) [<£:] = 0" [<£:]. C. Measurable Spaces (Borel Spaces) A space X endowed with a O"-algebra !B on X is called a measurable space (or Borel space) and is denoted by (X, !B). A set belonging to !B is calIed a !B-measurable set (or Borel set) in (X, !B). It is obvious that the empty set 0 and 
270 D Measure Theory the whole space X are !B-measurable. !B- measurability is preserved by complements, namely, if A is !B-measurable, then so is A C . Similarly !B-measurability is preserved by countable set operations such as countable unions, countable intersections, differences, etc., but not always by uncountable unions. For a sequence {An} (nEN) of subsets of X, the superior limit (or limit superior) and the inferior limit (or limit inferior) are de- fined by limsuPnoo An= n:,(UmAn) and liminfnoo An = Ud nmAn)' respectively. The symbols lim and lim may be used in place of Iim sup and liminf, respectively. The inferior limit is always contained in the superior limit, and when they coincide, they define the limit of {An}, which is denoted by limnoo An" If {An} is monotone increasing (or decreasing), limnoo An is U, An (or n, An) ( 87 Con- vergence L). !B-measurability is preserved by these limits. Let Y be a subset of a measurable space (X, !B). Then the c1ass!Bn Y={Bn YIBE!B} is a O"-algebra on Y. Y is regarded as a measur- able space endowed with !B n Y unless stated otherwise. Suppose that we are given a family of measurable spaces (X,t.!B.J, ..1.EA, where A is an arbitrary index set. Let X be the tCartesian product of X A , },EA, and n A the projection from X to X A for each ..1.EA. Then X is re- garded as a measurable space endowed with the O"-algebra generated by the class {n;'(B A )I..1.EA, B;,E!BJ, denoted by I1 AEA !B A , unless stated otherwise. Let T be a topological space. The O"-algebra on T generated by the open subsets of Tis called the topological O"-algebra (Borel field) on T, denoted by !B(T). Some authors define !B(T) to be the O"-algebra generated by compact subsets of T, which is equivalent to the defi- nition mentioned above if T is locally compact and O"-compact. T is regarded as a measurable space endowed with !B(T) unless stated other- wise. Hence R n is a measurable space (R n , !B n ), where !B n = !B(R n ), A !B(T)-measurable subset of T is called a Borel subset of T. If the tchar- acteristic function of a set is a tBaire function, then the set is a Borel set. Such a set is called a Borel set in the strict sense (or a Baire set). The union (intersection) of at most a countable number of closed (open) sets is called an Fa set (Go set). Both Fa sets and Go sets are Borel sets. Denote by 3'0 the class of all closed subsets of X and by ffio the class of all open subsets, and define classes 3'{ (ffi{) for an ordinal number  to be the sets expressible as a countable union (intersection) of the sets belonging to Un<{3'n(Un<{ffi n ). Then by the construction, every set belonging to either 3'  or ffi { is a Borel set. Ifw, «, then 3'{=3'w" ffi{=ffiw,. (For the definition of w,  312 Ordinal Num- 1000 bers.) In particular, in a tcompletely normal topological space (for example, a metric space), 3'. c ffi p , ffi. c 3'p if ex < [3, and both U{<w, 3'{ and U{<w, ffi{ coincide with the Borel field. In this case, an arbitrary Borel set B is a Baire set, and thus can be classified according to the tBaire class of the characteristic function of B. It can also be classified according to the smallest ordinal number  with respect to which BE3'{ (or BEffi{). If S is a subset of a topological space T, then S is regarded as a topological space with the relative topology and so is regarded as a measurable space (S, !B(S)). Hence every subset of R n is regarded as a measurable space. S is also a measurable space (S, !B(T) n S), as men- tioned above. Since !B(S)=!B(T)nS, no con- tradiction arises. Let T A , ..1.E A be a family of topological spaces. Then the topological prod- uct T of this family is regarded as a measur- able space (T, !B(T)) or (T, I1 AEA !B(T A )). It should be noted that these two O"-algebras may be different. They are the same if T has a countable open base. Let (Xi' !B;), i = 1, 2, be measurable spaces. A bijective mapping f from Xl to X 2 is called a Borel isomorphic mapping if f(!Bd= {f(B,) I B, EO!B,} =!B2' If there exists such an f, then (X 2 , !B2) is said to be Borel isomorphic to (Xl' !B d. Borel isomorphism is an equivalence relation. A measurable space is called a stan- dard measurable space (or a standard Borel space) if it is Borel isomorphic to a Borel sub- set of R' (viewed as a measurable space as explained above). A striking fact is that a standard measurable space is Borel isomor- phic to one of the following spaces: {I, 2, .., , n} (n= 1,2,...), N, [0, 1]. A measurable space which is Borel isomorphic to an tanalytic sub- set of R' is called an analytic measurable space. Every complete separable metric space, viewed as a measurable space, is standard. More generally, every tLuzin space is a stan- dard measurable space. Hence the spaces of tdistributions, 9' and [I", are standard mea- surable spaces, being Luzin spaces. Every tSuslin space, viewed as a measurable space, is analytic. D. Measure A real-valued set function m defined on a finitely additive class \JJl on a space X is called a finitely additive measure (or Jordan measure) if it satisfies the following two conditions: (I) O.s;m(E).s; 00, m(0)=0; (11 0 ) A, BE\JJl, An B=0 imply m(AUB)=m(A)+m(B). A set function 11 defined on a completely additive class !B in a space X is called a mea- sure (completely additive measure or O"-additive 
1001 measure) on 1.8 (or on X) if it satisfies the fol- lowing two conditions: (I) OJl(E) 00, Jl(0) =0; (II) E.E!B (n= 1,2,...), Ejn Ek = 0 (j #-k) imply Jl(U.%l E.) = 1:.%1 Jl(EJ (complete addi- tivity or u-additivity or countable additivity). The triple (X, 1.8, Jl) or the pair (X, Jl), where X is a space, 1.8 a completely additive class of subsets of X, and Jl a measure, is ca1led a measure space. We ca1l Jl or (X, 1.8, Jl) finite or bounded if Jl(X) < 00 and u-finite if there exists a sequence {X.} satisfying Jl(X.) < 00 and U.%lX.=X. A set AE!B is ca1ledatomic if (i) Jl(A»O and (ii) BE!B and BcA imply that Jl(B)=O or Jl(B) = Jl(A). A measure space (X, 1.8, Jl) is ca1led nonatomic if no element of 1.8 is atomic. The simplest bounded measure is given by defining m(A)= 1 if aEA and m(A)=O if aA, where a is a given point in X. Such a measure is ca1led the Dirac (j-measure. A mea- sure m with m(X) = 1 is ca1led a tprobability measure ( 342 Probability Theory). Equalities appearing in (11 0 ) and (II) include the possibility that both sides equal 00. In measure theory and integration theory, ad- dition and multiplication involving :too are carried out as fo1lows (a denotes a real num- ber): (:too)+a=a+(:too)= :too; (:too)-a= :too, a-(:too)= =too; (:too)+(:too)= :too; (:too)-(=too)= :too; a.(:too)=(:too).a= :too if a>O; a.(:too)=(:too)'a= =too if a< 0; ( + 00 ) . ( + 00 ) = + 00. (Sometimes it is further agreed to put O.(:too)=(:too)'O=O.) A !B-measurable set in a measure space (X, 1.8, Jl) is also ca1led Jl-measurable (or sim- ply measurable). For a sequence {A.} of Jl- measurable sets the fo1lowing conditions hold: (i) Jl(lim inf.oo A.)  lim inf.oo Jl(A.); (ii) if Jl(U.%'oA.) < +00 for some no, then Jl(1im sup.oo A.)  lim sup.oo Jl(A.); and (iii) if lim.oo A. exists and the hypothesis of (ii) holds, then Jl(lim.oo A.) = lim.oo Jl(A.). For a finitely additive measure Jl on a completely additive class 1.8 to be a completely additive measure, it is necessary and sufficient that Jl(lim.oo A.) = lim.oo Jl(A.) hold for an arbi- trary monotone increasing sequence {A.} of sets in !B. A set satisfying Jl(E) = 0 is ca1led a Jl-null set or a null set. The u-algebra generated by 1.8 and the Jl-nu1l sets is ca1led the Jl-completion of 1.8, denoted by !t. The measure obtained by extending Jl onto !t is ca1led the completion of Jl, denoted by ji, and the measure space (X, !t,ji) is ca1led the completion (or Lebesgue extension) of (X, 1.8, Jl). In particular, if!t =1.8 (and so ji = Jl), i.e., if a subset of a Jl-nu1l set always belongs to 1.8, then Jl is ca1led a com- plete measure and (X, 1.8, Jl) is ca1led a complete measure space. If (X, 1.8, Jl) is complete, then 1.8 is closed under tanalytic operations [2]. Lei P be a property concerning a point of a measure 270G Measure Theory space (X, 1.8, Jl). If the set of a1l points for which a property P fails to hold is a nu1l set, then we say that P holds almost everywhere (a.e. or at almost aU points). Sometimes we use the expression "almost P" to describe the same situation. E. Construction of Measures A set function Jl*(A) defined for every subset of X is ca1led a Caratheodory outer measure (or outer measure) on X if it has the fo1lowing three properties: (i) OJl*(A) 00, Jl*(0) =0; (ii) A c B => Jl*(A)  Jl*(B); (iii) Jl*(U.%1 A.)  1:.%1 Jl*(A.). Because of (iii), the inequality Jl*(B)  Jl*(B n A) + Jl*(B n A C ) always holds; if the equality holds for every B, then A is ca1led measurable with respect to Jl*. It fo1lows from this definition that a set A satisfying Jl*(A) = 0 is always measurable with respect to Jl*. The class 1.8 of an sets measurable with respect to Jl* is a completely additive class, and if we define Jl(A) = Jl*(A) for A belonging to 1.8, then Jl(A) gives a complete measure on !B. This measure is ca1led the Caratheodory measure indced by Jl* or the generalized Lebesgue measure. A finitely additive measure m defined on a finitely additive class 9Jl is ca1led completely additive when A.E9Jl, AjnA k = 0 (j#-k), U.%l A.E9Jl imply m( U:'=1 A.) = 1:.%1 m(A.). If by means of such an m we define Jl*(A) for an arbitrary A c X to be the infimum of a1l pos- sible values 1:.%1 m(A.), where A c U.%l A. (A.E9Jl), then Jl* gives a Caratheodory outer measure. From this Jl* a measure Jl on a com- pletely additive class 1.8 is induced as described above, and we have 1.8:::> 9Jl and Jl(A) = m(A) for AE9Jl. Therefore m can be extended to a mea- sure Jl on u[9Jl] (E. Hopf's extension theorem). In particular, if X is a countable union of sets of finite measure, then this extension is unique. F. Regularity of Measures Let (X, !B) be a measurable space. The comple- tion of a measure on 1.8 is ca1led a !B-regular (or regular) measure on X. Let T be a Haus- dorff topological space and Jl be a !B(T)- regular measure. If for every AE!B(T) with Jl(A) < 00 and every 8>0 there exists a com- pact set K c A such that Jl(A - K) < 8, then Jl is ca1led a K-regular measure. If T is a Sus- lin space, every !B(T)-regular measure is K- regular [4]. G. Lebesgue Measure In the n-dimensional Euclidean space R', the volume of a left open interval 1= {x I a k < 
270H Measure Theory X k :(;b k , k= 1,2,..., n} (-00 :(;a k <b k :(; 00) is defined by m(I) = I1kdbk - a k ). Let 9Jl o be the collection of all sets that can be represented as the finite union of disjoint left-open intervals, and for such expression A = Ujl Ij in 9Jl o define m(A) = Lj=1 m(Ij)' Let 9Jl = 9Jl o U {0} and m(0)=0. Then m gives a finitely additive measure on 9Jl that is completely additive. Therefore m determines an outer measure J1*, which in turn determines a measure J1. This J1* is called the Lebesgue outer measure (or simply outer measure). Sets that are measurable with respect to J1* are said to be Lebesgue measur- able (or simply measurable), and the measure J1 is called the (n-dimensional) Lebesgue mea- sure (or simply measure). Every interval is measurable, and its measure coincides with its volume. Open sets, closed sets, and Borel sets are all measurable. More generally, suppose that an outer measure J1* defined on a metric space X with a metric d satisfies the condi- tion: If d(A, B) > 0, then J1*(A U B) = J1*(A) + J1*(B). Then every closed subset of X is J1*- measurable, and therefore so is every Borel subset. Here d(A, B)= inf{d(a, b) I aEA, bEB} denotes the distance between the two sets A and B. A measure J1 defined on the class of all Borel subsets of a topological space X is called a Borel measure. The cardinality of the set of all Lebesgue measurable subsets of Rn is 2', while the cardinality of the class of Borel sets is c (here c is the cardinal number of the tcontinuum, i.e., the cardinal number of R). Therefore there exists a Lebesgue measur- able set that is not a Borel set. It follows immediately from the definition that the Lebesgue measure is K-regular, so for every Lebesgue measurable set A we can find an tF a - set B and a tG.-set C such that Be A e C and J1(C-B)=O. Historically, for a bounded subset A of R n , C. Jordan defined m(A) to be the infimum of all possible values m(B), where BE9Jl and B=> A, and m(A) to be supremum of all possible values m(B), where BE9Jl and BeA. He called m(A) the outer volume of A and m(A) the inner volume of A (in the case of R 2 , the outer area and inner area, respectively). When m(A) = m(A), A is called Jordan measurable, and this common value is defined to be the Jordan mea- sure (Jordan content) of A. Jordan measure is only finitely additive, and was found to be un- satisfactory in many respects. It was Lebesgue who modified this notion and introduced completely additive measures. Jordan measur- able sets are always Lebesgue measurable. Using the taxiom of choice, a set that is not Lebesgue measurable can be constructed (G. Vitali). For example, a set obtained by choos- ing exactly one element from each coset of the additive group of all rationals in the addi- 1002 tive group of the reals is not measurable [3, pp.67-70]. H. Product Measure When two a-finite complete measures J1x and J1y are defined on completely additive classes x and y on X and Y, respectively, an ele- ment C belonging to the smallest finitely addi- tive class R in the Cartesian product that contains {A x BIAEX,BEy} can be repre- sented as a finite disjoint union C = Uj=1 (Aj x B j ) (AjEX,BjEy). If we define v(C)= Lj=1 J1x(A)J1y(B) (here we agree to put O' 00 = 0), then this value is independent of the way the set C is represented, and v defines a com- pletely additive measure on R By extending v by means of Hopf's extension theorem, we obtain a (complete) measure space, called the (complete) product measure space obtained from the measure spaces (X,x,J1x) and (Y, y, J1y). The measure obtained in this way on the space X x Y is called the product mea- Sure of J1x and J1y and is denoted by J1x x J1y. If we denote by 9Jl p the class of all Lebesgue measurable subsets of the p-dimensional Eu- clidean space RP and by mp the p-dimensional Lebesgue measure, then the (complete) product measure space of (RP, 9Jl p , m p ) and (Rq, 9Jl q , m q ) is (R(p+q), 9Jl p + q , m p + q )' The product measure space of any finite number of measure spaces (Xi'i,J1i) (i= 1, ...,n) is defined similarly. Let X), (AEA) be spaces with an index set of arbitrary cardinality. For the product space X = I1),EA X)" an n-cylinder set, or simply a cylinder set, is a set of the form A x I1),fP,....,),.) X), (A e X),j X ... x X),J If a finitely additive class 'lI) is given for each X)" the class of all subsets that can be represented as the union of a finite number of cylinder sets of the form AI x A 2 X ... X An X I1),fPj.....),n)X), (AjE'lI),j'j= 1,2, ...,n) is a finitely additive class in the space X. When each 'lI), is completely additive, the completely additive class 'lI generated by this finitely additive class is called the product of the completely additive classes 'lI), and is de- noted by I1),EA 'lI),. When a measure space (X)" )"J1),) with J1AX;.) = 1 is given for AEA, a mea- sure J1 can be defined in the following way on the completely additive class  = I1),EA ), in the product space X: To begin with, for a cyl- inder set of the form AI x .., x An X X' (here AjE),j and X'=I1 MP1 .....),.}X),), we define J1(A1 x... x An X X')=J1)j(A 1 ) ...J1),,(An). If we extend J1 to the finitely additive class (t con- sisting of all sets that can be represented as the finite union of such cylinder sets, then this extension gives a completely additive measure on (t, and therefore, by Hopf's extension theo- rem, there exists a unique extension to a mea- 
1003 sure 11 on , and 11 satisfies I1(X) = 1. We denote this 11 by 11 = I1 AEA I1A' I. Radon Measure Let X be a tlocally compact Hausdorff space,  the topological a-algebra on X, and Co(X) the real linear space of all real-valued con- tinuous functions f on X having tcompact support (i.e., the closure of the set {x I {(x) l' O} is compact). A (real) tlinear functional ({J de- fined on Co(X) is called a positive Radon mea- sure if ({J(f)0 whenever fO. For such a functional ({J there corresponds a Borel mea- sure 11 on X for which ((J(f) = Ix f dl1 holds for every f E Co(X). Lebesgue measure m n is re- garded as a positive Radon measure on Rn. If X is a-compact (i.e., X can be represented as the countable union of compact sets), then the property ((J(f) = S x f dl1 for all f E C o ( X) defines the measure 11 uniquely on the class of Borel sets of X. A linear functional on Co(X) that can be written as the difference of two positive Radon measures is called a Radon measure. For a linear functional ({J defined on Co(X) to be a Radon measure, it is necessary and sufficient that for an arbitrary f E Co (X), the set {((J(g) Ilgl  Ifl,gE Co(X)} be bounded. Equiva- lently, it is necessary and sufficient that the restriction of ({J to the subspace of all functions in Co(X) having their support in a fixed com- pact subset of X must be continuous with respect to the ttopology of uniform conver- gence. Therefore, if X is compact, an arbitrary continuous linear functional on C(X) is a Radon measure. L. Schwartz investigated Radon measures on spaces that are not locally compact [6]. J. Measurable Functions When a completely additive class  on a space X is given, a function f defined on a - measurable set E and taking real (and possibly :too) values is called a -rneasurable function on E if for an arbitrary real number IX, the set {xlf(x»IX} is -measurable. The condtion f> IX may be replaced by f IX,! < IX,!  IX. A function f can also be defined to be - measurable if the tin verse image under f of any Borel set is -measurable. When f and g are -measurable, so are af +bg (a,b con- stants),!' g, fig, max(,f, g), min(f, g), Ifl P (p a constant), whenever they are well defined. The superior and inferior limits of a sequence of- measurable functions are also -measurable. In a complete product measure space of two a- finite measure spaces, a function f(x, y) may fail to be measurable as a function of two variables even if it is measurable with respect 270 J Measure Theory to each of the variables x and y separately. For example, let d be the class of all closed subsets F of R 2 satisfying m 2 (F» 0, Fe [0,1] x [0, 1]. Using the fact that the cardinality of IY does not exceed the cardinal number of the con- tinuum, we can prove by ttransfinite induction hat the elements of !1 can be indexed as F,  < c,  being the cardinality of , and that for every F E '6 we can pick two points z = (x, y ), z = (x, y) in such a way that if F l' F" then x, x, x" x are all distinct and y, y, y" y are all distinct. Furthermore, we can prove that the set E consisting of all such z  is not measurable. Therefore, denoting the charac- teristic function of the set E by f(x, y), f(x, y) is not measurable; but if we fix x (resp. y), then as a function of y (resp. x), f(x, y) is measurable, since f(x;) (resp. (C, y)) is always 0 except possibly at one point. If  is the class of all Lebesgue measurable sets or the class of all Borel sets, then a -measurable function is called a Lebesgue measurable function (or simply measurable function) or a Borel mea- surable function, respectively. In Euclidean space, the class of all Borel measurable functions coincides with the class of all Baire functions, and an arbitrary Le- besgue measurable function is equal almost everywhere to a Baire function of at most the second class. For a function f that is finite almost everywhere on a Lebesgue measur- able set E to be Lebesgue measurable, it is necessary and sufficient that for an arbitrary I: > 0 we can find a closed subset F such that m(E - F) < f. and f is continuous on F (Luzin's theorem). If a sequence {fn} of -measurable func- tions on a measure space (X,, 11) converges almost everywhere to f on a set E with I1(E) < 00, then for an arbitrary f. >0 we can find a set F (F c E,F E) such that I1(E - F) < f. and fn converges uniformly on F If X = Rn and  is either the class of Borel sets or the class of Lebesgue measurable sets, then the set F can be chosen to be a closed set (Egorov's theorem). For a finite measurable function f(x) defined on the real line, there exists a sequence {h n } such that limnoof(x + h n ) = f(x) almost every- where (H. Auerbach). The functional equation f(x + y) = f(x) + f(y) has infinitely many nonmeasurable solu- tions (G. Hamel;  388 Special Functional Equations). Let \!Ii, i = 1, 2, be a-algebras on X;, i = 1, 2, respectively. A mappingf:X 1 ->X 2 is said to be measurable \!It!\!I 2 if f-1(A 2 )E\!I 1 for every A 2 E \!I2' When \!Iz generates \!I2,!: XI ->X 2 is measurable \!It!\!I 2 if r l (A;)E\!I 1 for every A; E\!I;. For example, a mapping f:R1->R I is Borel (Lebesgue) measurable if and only if 
270K Measure Theory fis measurable 1/I(9Jlr!I), where 1 and 9Jl 1 denote the Borel subsets and the Le- besgue measurable subsets of R 1. Measurabil- ity of mappings is preserved by compositions, namely, if f:X 1 ->X 2 and g:X 2 ->X 3 are mea- surable \!I 1 /\!I 2 and \!I 2 /\!I 3 , respectively, then gof:X,->X 3 is measurable \!I 1 /\!I 3 . If the pro- jection na: I1;EI X;->Xa is measurable, then I1; E1 \!Ij\!I a . Since n;-I(A,) (iEI, AiE\!I,) generate I1;EI \!I;, a mapping f: X -> I1 iE1 X; is measur- able \!I/I1 iE1 \!Ii if ni 0 f: X -> Xi is measurable \!I;/\!I for every i E I, where \!I is a a-algebra onX. K. Image Measures Let 11 be a regular measure on a measurable space (X, xJ, i.e., the completion of a measure on x, Then every mapping f: X -> Y measur- able x/y induces a a-algebra on Y:  = {Be YIF 1 (B) is l1-measurable}, and a measure on : v(B) = I1U- 1 (B)). The measure v is called the image measure of 11 under the mapping f; this is denoted by I1F' or f'l1. Obviously, v(Y)=I1(X), and v is complete. Although  includes y by virtue of the measurability of 1, v is not regular in general. However, v is regular if 11 is a bounded measure and if both (X, x) and (Y, y) are analytic measurable spaces ( 22 Analytic Sets I). Therefore the image measure of a regular probability measure under a measur- able mapping is also a regular probability measure in practically all useful cases. L. Related Topics (i) Integration. For a nonnegative measurable function f(x) on (X,, m), we can define the integral of f(x) on a set E E, denoted by hf(x)l1(dx), hf(x)dl1(x), hf dl1, or JEi For a real measurable function f(x), the integral is defined to be eq ual to J d + dl1- J E r dl1, if at least one of these integrals is finite, where f+ and f- are the positive and negative parts of f ( 221 Integration Theory B). (ii) Fubini's theorem for measures. Let (X, , 11) be the complete product measure space of (Xi' ;, 11,) (i = 1,2), where both 111 and 112 are a-finite and complete. For EE we define the sections E(xrJ and E(X2) by E(xrJ= {x21 (XI' X 2 )E E}, E(x 2 ) = {xII (x" X 2 )E E}. Then we have E(XrJE2 for almost all (111)X1 E 1004 XI' E(X2)E1 for almost all (112)x 2 EX 2 , and I1(E) = f 112(E(xrJ)111 (dx 1 ) X j = f ill (E(x 2 ))112(dx 2 ) X 2 ( 221 Integration Theory E). (iii) The Radon-Nikodym theorem and the Lebesgue decomposition theorem for measures. Let 11 and v be a-finite measures on (X, ). If I1(E)=O implies v(E)=O, then v is said to be absolutely continuous with respect to 11, denoted by v-<I1' v-<11 if and oniy if v is ex- pressible in the form v(E) = hf dl1, where f is a nonnegative -measurable function (the Radon-Nikodym theorem). Such an [is uniquely determined up to l1-measure 0 and is called the Radon-Nikodym derivative of v with respect to 11, denoted by dv/dl1. The concept opposite to absolute continuity is singularity. v is said to be singular with respect to 11 if there exists an EE such that V(£<)=I1(E)=O. If 11 and v are two arbitrary a-finite measures, then v is expressible uniq uely as the sum of an absolutely continuous measure VI and a sin- gular measure V 2 with respect to 11 (Lebesgue decomposition theorem) ( 221 Integration Theory D, 380 Set Functions C). (iv) Invariant Measures. Let (X,) be a measurable space and G a group of Borel isomorphic mappings from (X,) to itself. A measure 11 on (X,) is said to be invariant under G if I1(E) = l1(g -1(E)) for every E E  and every gEG. For example, the Lebesgue mea- sure m on (R", n) is invariant under the group of congruent transformations ( 225 Invariant Measures). (v) The Lebesgue-StieItjes measure. Let f be a right continuous monotone increasing func- tion on R. Then there exists a unique measure 11 on (R, I) such that 11«a, b]) = f(b) - [(a) for a < b. This measure is called the Lebesgue- Stieltjes measure ind uced by f ( 166 Func- tions of Bounded Variation B). (vi) Baire measurable functions and univer- sally measurable functions. Let f(x) be a real- valued function defined on a topological space X. If for every open set 0 the inverse image f- I (0) has the tBaire property (resp. is mea- surable with respect to the completion of any a-infinite tBorel measure on X), then f is said to be Baire measurable (resp. universally mea- surable or absolutely measurable). Universally measurable functions can be defined similarly on a measurable space. (vii) Disintegration. Let (S, 6) and (T,::!) be standard measurable spaces, v a a-finite mea- sure On (S, 6), and p: S -> T a v-measurable mapping. Ifthe image measure 11= Vf-1 is a- finite, there exists a unique family of measures 
1005 {V"tET} on (S, 6) such that (1) tf->v,(E) is universally measurable for every EE 6, (2) v, is concentrated on p-l(t) for almost every t with respect to J1, and (3) the following equality holds: v(E)= t J1 (dt)V,(E), EE6. This expression is called the disintegration of v with respect to J1. Every v-measurable function j on S is v,-measurable for almost every t with respect to J1, and the integral of j is expressible in the form of iterated integrals as follows: is j(s)v(ds) = t J1(dt) is j(s)v,(ds). This corresponds to Fubini's theorem on product measures. When v is a probability measure on (S, 6), p(s) is regarded as a (T, ::!)- valued random variable on (S, 6, v), and v, is the conditional probability measure under the condition p(s) = t. ( 342 Probability Theory E. For disintegration of measures on a topo- logical space  [3, ch. 9].) References [IJ H. Lebesgue, Integral, longueur, aire, Ann. Mat. Pura Appl., (3) 7 (1902), 231-359. [2J S. Saks, Theory of the integral, Warsaw, 1937. [3J P. R. Halmos, Measure theory, Van Nos- trand, 1950. [4J N. Bourbaki, Elements de mathematique, Integration Chap. 1-9, Actualites Sci. Ind., Hermann, 1965-1969. [5J H. L. Royden, Real analysis, Macmillan, second edition, 1963. [6J W. Rudin, Real and complex analysis, McGraw-Hili, second edition, 1974. [7J L. Schwartz, Probabilites cylindriques et applications radonifiantes, J. Fac. Sci. Univ. Tokyo, lA, 18 (1971),139-286. 271 (XX.4) Mechanics A. Newton's Three Laws of Motion The study of laws governing the motion of bodies began with the laws of falling bodies, of which the first exact formulation was made by Galileo. But the general relationship between force and acceleration was first described by tNewton, who established Newton's three laws of motion; the mechanics based on them is Newtonian mechanics. Newton expounded 271 B Mechanics these laws in his famous book Principia mathe- matica philosophiae naturalis (1686-1687), where the law of gravitation and its applica- tion, problems of fluid motion, motions of the planets in the solar system, etc., were systemat- ically treated. Newton's first law, A body continues its state of rest or uniform motion in a straight line unless it is compelled to change that state by external action (i.e., force). This is also called the law of inertia. Newton's second law. The rate of change of momentum is proportional to force and is in the direction in which the force acts. Here, momentum is defined as the product mv of the mass m and the velocity v. This law can be expressed as d(mv)/dt = F, where F is the force expressed in an appropriately chosen system of units. Since dv/dt is the acceleration a, the law takes the form ma = F, when the mass is con- stant. These equations are called equations of motion. The second law is often simply called the law of motion. Newton's third law. When two bodies 1 and 2 in the same system interact, the force exerted on body I by body 2 is equal and opposite in direction to that exerted on body 2 by body I. This law is called the law of action and reac- tion, or simply the law of reaction. Various attempts at a rigorous axiomati- zation of Newtonian mechanics have been made, beginning with E. Mach's work. At the beginning of the 20th century, it was found that Newtonian mechanics requires modifica- tion when bodies travel at speeds approach- ing the speed of light or when it is applied to physical systems of molecular size or smaller. These modifications led to the establishment of the theory of trelativity and tquantum mechan- ics. In contrast to these later theories, New- tonian mechanics is called classical mechanics. B. Newton's Law of Gravitation Kepler discovered the following three laws for the motion of planets around the sun (valid within the accuracy in observation available at the time): Kepler's first law. The orbit of a planet is an ellipse with the sun at one of its foci. Kepler's second law. The area swept per unit time by the straight line segment joining the planet and the sun is independent of the posi- tion of the planet in its orbit. Kepler's third law. The square of the period (the time needed for the planet to go around the orbit once) is proportional to the cube of the major axis of the orbit. From these empirical laws, Newton deduced his law of universal gravitation: Between any 
271 C Mechanics pair of point particles, with masses m 1 and m 2 and at a distance r, there arises an attrac- tive force along the line joining the two points; the magnitude of this force is given by F = Gm 1 m 2 r- 2 , where G is a universal constant (approximately 6.670 x 10- 8 dyne cm 2 g-2) called the gravitational constant. C. Kinetic and Potential Energies If the force F on a particle is a function of the position x of the particle and is the gradient - VU of a time-independent (scalar) function U(x), called the potential, then 1 (v _ _ d d X t ) E =-mv 2 + U(x) 2 is a constant of the motion of the particle. E is called the total energy, while the first and second terms on the right-hand side are called the kinetic energy and potential energy, respectively. For a system of n particles at points x(1), ..., x(') with masses m 1 , "', m n , suppose that the force acting on the particle at xU) is - VU)U (the gradient of U relative to xu») for a com- mon potential function U (x(1), "', x(n»). Then n 1 E= I -m.v. 2 +U(x(1) x('») j= 1 2 J J , ... , ( _ dX U » ) Vj - dt is the constant total energy of the motion. For example, Newton's gravitational force acting among a number of particles can be described by the Newtonian potential u= I -Gmjmjlx(i)-xU)I- 1 . i<j A potential U which is a sum of functions depending only on a pair of coordinates as in the above example is called a two-body interaction. D. Apparent Force The coordinate system in which Newton's three laws of motion hold is called an iner- tial system. In some cases (e.g., on a rotating sphere such as the earth) it is more convenient to use a moving coordinate system, in which the equation of motion derived from Newton's second law by a coordinate transformation (from an inertial system to the moving coordi- nate system) takes a form similar to the second law except for an additional apparent force to be added to the force in the original equation of motion. For a coordinate system rotating at a constant angular velocity w (relative to an inertial system), the apparent force consists of the centrifugal force, of magnitude mw 2 p (p being the distance to the axis of rotation), 1006 which pushes the particle away from the axis of rotation (along the perpendicular), and the Coriolis force 2mv x w (x denotes the tvector product), which bends the motion of the par- ticle in a direction perpendicular to both the axis of rotation and the velocity of the particle. E. Dynamics of Rigid Bodies A rigid body is defined as a system of particles whose mutual distances are permanently fixed. Like a point particle, it is an ideal concept introduced into mechanics to simplify the theoretical treatment. Actual solid bodies can in most cases be regarded as rigid under the action of forces of ordinary magnitudes. Since a rigid body can be imagined to be made up of an infinite number of particles, the equations of motion for a system of particles can also be applied to it. Thus the motion of a rigid body can be completely determined by the theorems of momentum and angular momentum. The momentum of a rigid body is defined by f dr Q = K dt dm, where dm is the mass of the volume element at a point r of the rigid body K and dr/dt is its velocity. If the external forces acting on K are denoted by F j (i = 1,2,...), we have dQ/dt=IF i , which expresses the theorem of momentum. If the velocity and acceleration of the cen- ter of gravity (center of mass or barycenter) J r dm/J dm of the rigid body are denoted by V G and AG' respectively, and the mass J dm by m, we have Q=mV G , and the theorem of mom en- tum becomes mdVG/dt=mAG='LF j . The angular momentum of a rigid body K about an arbitrary point ro is defined by f dr H= (r-ro)x-dm. K dt If Pj is the vector from ro to the point at which F j acts, we have dH/dt= I (Pi x F;)=G. This is called the theorem of angular momentum. For the case of a rigid body with one point ro fixed, the angular momentum H and the angular velocity ware related by Hx = Aw x - Fw y - Ew z , Hy= -Fwx+Bwy-Dwz. Hz= -Ewx-Dwy+Cw" where Hx, Hy, Hz; w x , Wy, W z are the compo- nents of Hand w in the xyz-coordinate system 
1007 fixed in space with its origin at the fixed point ro, and A= f(yZ+zZ)dm, c= f(xZ+yZ)dm, E= fzxdm, B= f(zZ+xZ)dm, D= f yzdm, F= f xydm, with the integrals taken over the whole rigid body. We ca1! A, B, and C the moments of inertia about the X-, Yo, and z-axes, respec- tively, and D, E, F the corresponding products of inertia. The rotational motion of a rigid body with one axis fixed is completely deter- mined by the theorem of angular momentum. However, for rotation about a fixed point, the theorem is not very convenient to use, because A, B, C, D, E, and Fare genera1!y unknown functions of time. Thetquadric Axz + B y 2 + Cz z + 2Dyz + 2Ezx + 2Fxy = 1 represents an te1!ipsoid with its center at the origin, ca11ed the ellipsoid of inertia. If the principal axes , 1], and ( are taken as coordi- nate axes, the equation of the e1!ipsoid of inertia becomes Ae + B1]z + r(Z = 1, where A, B, r are the moments of inertia about the -, 1]-, (-axes and are ca1!ed the principal moments of inertia, while the  -, 1]-, (-axes themselves are ca1!ed the principal axes of inertia. If the components of the angular momentum Hand angular velocity in the direction of the prin- cipal axes of inertia are denoted by (HI' Hz, H 3 ) and (w j ,W Z ,w 3 ), respectively, then HI =Aw 1 , Hz=Bw z , H 3 =fw 3 . Furthermore, if the -, 1]-, (-components of the resultant moment of the external forces G = L (Pi X FJ are denoted by (G p G z , G 3 ), then dH/dt= G becomes Adwr/dt=G j +(B-f)w Z w 3 , Bdwz/dt=Gz+(f -A)w 3 wj, r dw 3 /dt = G 3 + (A - B)w 1 W z . These are ca1!ed Euler's differential equations. The study of the motion of a rigid body can be reduced mostly to the study of the motion of its e11i psoid of inertia, since the latter is attached to the rigid body. The method of de- scribing the motion of a rigid body by means of its e1!ipsoid of inertia is known as Poinsot's representation. The motions of two bodies having equal e1!ipsoids of inertia are the same if the external forces acting on them have equal resultant moments, even if their geo- metric forms are different. When a rigid body moves under no con- straint, the motion of its center of gravity G 271 F Mechanics can be determined by the use of the theorem of momentum. Also, the rotational motion about the center of gravity can be found from dH'/dt = G', which is a modification of the theorem of angular momentum. Here H' and G' are re- spectively the angular momentum and mo- ment of external forces about the center of gravity. In this case also, simplification can be achieved by considering the equation in the reference system that coincides with the prin- cipal axes of the e1!ipsoid of inertia with its center at the center of gravity. F. Analytical Dynamics Mechanics as origina1!y formulated by New- ton was geometric in nature, but later L. Euler, J. L. Lagrange, and others developed the ana- lytical method of treating mechanics that is now ca1!ed analytical dynamics. Lagrange introduced generalized coordinates qj (j = 1, 2, ... ,j, where f is the number of degrees of freedom of the system considered), which uniquely represent the configuration of the dynamical system, and derived Lagrange's equations of motion: d ( a52 ) a52 di aqj - aqj =0, where qj = dq)dt, and 52 = T - U (T= kinetic energy, U = potential energy) is a function of qj and qj ca1!ed the Lagrangian function. Later, W. R. Hamilton introduced j = 1,2, .., ,j, Pj=aT/aqj, H = LPAj-52=H(P1"" ,Pf;qj, ...,qf) and transformed the equations to Hamilton's canonical equations: dqj aH - - dt apj' dpj dt aH aq/ j= 1,2,... J Here Pj is the generalized momentum conjugate to qj' and qj, Pj are ca1!ed canonical variables. If the functions representing the configuration of the dynamical system in terms of qj do not explicitly contain the time t, the Hamiltonian function (or Hamiltonian) H coincides with the total energy of the system T + U. The transformation (p, q)->(P, Q) under which canonical equations preserve their form is ca1!ed a canonical transformation. It is given by aw aw aw Pj= aq/ lj= - aQ/ K =H +ar' where W= W(qj, ...,qf;Q1, ...,Qf) and K js the Hamiltonian of the transformed system. The set of canonical transformations forms a group, ca1!ed a group of canonical transform a- 
271 G Mechanics tions. An infinitesimal transformation is given by as dpj= - f.- a ' qj as dq.=e- J apj' where f. is an infinitesimal constant. Here S is an arbitrary function and is said to be the generating function of the infinitesimal trans- formation. Canonical equations can be inter- preted to mean that the variations of P and q during the time interval f.=dt are the infini- tesimal canonical transformations whose gen- erating function is H(p, q, t). The variation of an arbitrary function F(p, q) under an infinitesimal transformation is given by dF=f.(F,S), where (U'V)=L (  riv _ oU  ) =L ri(u,v) j aqjoPj ripjoqj j o(qj'p) is Poisson's bracket. Therefore the time rate of change of a dynamical quantity F(p, q) can be written as dFjdt=(F,H). Thus the function F(p,q) that satisfies (F,H)=O is an tintegral of the canonical equations. If a canonical transformation (p, q)->(P, Q) such that  = rx j , Qj = (Jj are constant is found, the motion of the system can be determined by riW Pj=----;;;-' uqj oW (Jj=a' rx j where W is the tcomplete solution of the Hamilton-Jacobi differential equation: °oW +H ( W ,..., oW ; qr,...,qf,t ) =O ut uq1 oqf ( 82 Contact Transformations). G. Theory of Elasticity (1) General remarks. Suppose that a solid body is deformed elastically by the action of external forces. We may inquire about the magnitudes of deformation, stress, and strain caused by the external forces at each point of the body. The theory of elasticity studies this problem, assuming the body to be a con- tinuum and utilizing classical mechanics as a basis, and "endeavours to obtain results which shall be practically important in applications to architecture, engineering and aII other use- ful arts in which the material of construction is solid" [ll Cartesian coordinates (x, y, z) are employed for defining the 3-dimensional space contain- 1008 ing the body. We confine ourselves to the small-displacement theory of elasticity, in which the components of the displacement u = (u, v, w) of an arbitrary point P(x, y, z) of the body are assumed to be small enough to jus- tify the linearization of the governing differen- tial equations and boundary conditions. (2) Stress. Consider an infinitesimal rectan- gular parallelepiped enclosed by the following six surfaces: x = const, y = const, z = const, x + dx = const, y + d y = const, z + dz = const, in the body. The stress at the point P(x,y,z) is defined as those internal forces per unit area acting on the six surfaces of the parallelepiped. It has nine components, which are usually represented by ( ax Tyx Tzx ) Txy (Jy Tzy , '[xx Lyz (Iz where the a and T are called normal and shear- ing (or tangential) stresses, respectively. These nine quantities form a tensor called the stress tensor. Since it can be shown that TXY = Tyx, Tyz = Tzy, Tzx = TxZ' the stress tensor is a symmetric tensor. By considering equilibrium conditions of the parallelepiped, the equations of equilib- rium are found to be ria x OT yx OT zx - + oy +&+X=O,...,..., where X,... are body forces per unit volume. (3) Strain. The infinitesimal rectangular parallelepiped fixed to the body at the point P before deformation is transformed, after de- formation, into an infinitesimal parallelepiped which is no longer rectangular. The strain at the point P is defined as the changes caused by the deformation of the parallelepiped: exten- sions of the three sides and changes from right angles of the three angles formed by the three sides of the parallelepiped. Thus the strain has six components, usually represented by (f. x ' f.Y' F. z , Yyz, Yzx, YXy), where f. and yare called elongation and shear- ing strains, respectively. These six quantities, with slight modification, form a symmetric tensor called the strain tensor. (4) Strain-displacement relations. In small- displacement theory, the strain-displacement relations are given in linear form by ou OV t x =-, f.y=-, ox riy ou av ..., yxy=-+-;;-. oy ox (5) Stress-strain relations. In the theory of elasticity, stress and strain are assumed to 
1009 obey a linear relationship called Hooke's law: {a}=[AJ{r.}, where {(J} 1 = [ax, (Jy, (Iz, Tyz, '[zx, Txy], {f.} T = [8x, C: y , 8z, Yyz, Yzx, YXy], and [AJ is a symmetric positive definite ma- trix. For an isotropic body, the relations are Es x = axv(ay+ a z ), GYxy=Txy, where E, v, and G = E/2(1 + v) are elastic con- stants called the modulus of elasticity in tension or Young's modulus, Poisson's ratio, and the modulus of elasticity in shear or the modulus of rigidity, respectively. (6) Boundary conditions. The surface of the body can be divided into two parts with re- gard to boundary conditions: the part 51 over which the boundary conditions are prescribed in terms of external forces and the part 52 over which the boundary conditions are prescribed in terms of displacements. Obviously av = 51 + 52' where a v is the whole surface of the body. (7) Small-displacement theory of elastic- ity. We have seen that the equations which govern the problem are 3 equations of equilib- rium, 6 strain-displacement relations, and 6 stress-strain relations in terms of 15 unknowns, namely, 6 stress components, 6 strain compo- nents, and 3 displacement components. Thus our problem is reduced to a boundary value problem in which these 15 field equations are to be solved under the specified boundary conditions. Since all the field equations and boundary conditions are linear with respect to the unknowns under the assumption that the displacements are small, we obtain linear relationships between the external load and resulting deformation of the body; this is the small-displacement theory of elasticity. It should be remembered, however, that the assumption of small displacement sets limits to the application of the theory to practical problems. (8) Variational principles. Several variational principles have been formulated in the small- displacement theory of elasticity. These include the principle of minimum potential energy [uJ, the generalized principle [a, 8, U], the Hellinger- Reissner principle [a, u], the prin- ciple of minimum complementary energy [a J, and so forth, where the symbols in the brackets represent independent functions subject to variation. In connection with the aforementioned variational principles, vari- ational principles with relaxed continuity re- quirements have also been formulated by re- 271 Ref. Mechanics laxing the continuity requirements imposed on admissible functions. One of the practical advantages of these variational principles is that they often provide the problem with approximate formulations and approximate methods of solution, among which the Rayleigh-Ritz method is well known. Theories of beams, plates, shells, and multi- component structures are typical examples of such approximate formulations. Recently, these variational principles have been found to provide effective bases for the formulations of the tfinite element method. (9) Notation. Various symbols are used for stress and strain. For example, a, T and 8, yare commonly used in the engineering literature. However, in Love's treatise [6J, XX, Y n Zx and e xx , e xy , e xz are used in place of ax, T xy ' T xz and 8" Yxy' Yxz. respectively. Also, various nota- tions are used for elastic constants. E is widely used, while G and v are less common. Love uses J1 and a for G and v, respectively. In the engineering literature the reciprocal of Pois- son's ratio m (= l/v) is used and is called the Poisson number. (10) Finite-displacement theory of elastic- ity. When the displacement of the body is no longer small (infinitesimal) but is finite, we should abandon small-displacement the- ory and instead employ finite-displacement theory, in which stress and strain are defined in a manner different from that in small- displacement theory, keeping in mind the difference between spatial and material vari- ables. Thus equations of equilibrium, strain- displacement relations, and the boundary conditions on 51 become nonlinear equations with stress and displacement components as unknowns, although the stress-strain relations remain linear. Thus the problem is reduced to solving a nonlinear boundary value problem, sometimes called a nonlinear elasticity prob- lem. Variational principles have been formu- lated for finite-displacement theory and are frequently used in the formulation of approxi- mate methods of solution. When the stress becomes large enough to exceed the so-called elastic limit, where the linear stress-strain relationships cease to hold, the theory of elasticity is no longer valid and should be replaced by the theory of plasticity [9, 10]. References [1J E. T. Whittaker, A treatise on the analyt- ical dynamics of particles and rigid bodies, Cambridge Univ. Press, fourth edition, 1927. 
272 A Meromorphic Functions [2] A. Sommerfeld, Mechanics, translated by M. O. Stern, Academic Press, 1952. (Original in German, 1943.) [3] J. B. Marion, Classical dynamics of par- ticles and systems, Academic Press, 1965. [4] V. I. Arnold, Mathematical methods of classical mechanics, translated by K. V ogt- mann and A. Weinstein, Springer, 1978. (Orig- inal in Russian, 1974.) [5] R. Abraham and J. E. Marsden, Founda- tions of mechanics, Benjamin/Cummings, second edition, 1978. [6] A. E. H. Love, A treatise on the mathemat- ical theory of elasticity, Cambridge Univ. Press, fourth edition, 1934. [7] S. Timoshenko and J. N. Goodier, Theory of elasticity, McGraw-Hill, 1951. [8] A. E. Green and W. Zerna, Theoretical elasticity, Oxford Univ. Press, 1954. [9] R. Hil1, Mathematical theory of plasticity, Oxford Univ. Press, 1950. [10] K. Washizu, Variational methods in elasticity and plasticity, Pergamon, second edition, 1975. 272 (XI. 7) Meromorphic Functions A. General Remarks A single-valued tanalytic function in a domain D in the complex plane C is cal1ed meromor- phic in D if it has no singularities other than tpoles. A function that is meromorphic in the whole complex plane including the point at infinity is a rational function (Liouville's theorem). Specifical1y, if a function is mero- morphic in the domain C, then the function is cal1ed simply a meromorphic function, and a meromorphic function that is not a rational function is cal1ed a transcendental meromor- phic function. A meromorphic function j(z) can be represented as a quotient of two tentire functions. Let {Zk} (k = 1,2, '" ) be poles of j(z), and let J;.(z) = a:) /(z - Zk)"k + ... + a\k) /(z - zd denote the tsingular parts of j(z) at Zk (k = 1,2, ...). Then j(z) can also be written in the form OJ j(z)=g(z)+ L (J;.(z)-pd z )), k' where g(z) is an entire function and the pdz) (k = 1,2, ...) are rational entire functions (Weierstrass's theorem). Assume that a se- quence {Zk} (k = 1,2, ...) converges only to the point at infinity and that jk(1/(Z - zk)) (k = 1,2, ...) are rational entire functions of 1/(z- zd which have no constant terms. Then 1010 there exists a meromorphic function of z with J;.(1/(z - Zk)) as its tsingular part at Zk (Mittag- Leffler's theorem). B. Nevanlinna Theory The theory of meromorphic functions can be considered an extension of the theory of entire functions. In particular, value distribution theory, originating in Picard's theorem, was studied by many people, and in 1925 R. Nevan- linna published a systematic theory unifying the results obtained until then. This is cal1ed Nevanlinna theory. We let j(z) denote a meromorphic function in Izl < R  +00, and when we say that j(z) takes on a value, the value may be 00. For a value a, n(r, a) denotes the number of a-points of j(z), i.e., points z with j(z) = a, in 1 zl  r < R, where each a-point is counted with its multi- plicity. We set f r n(t, a) - n(O, a) N(r,a)= dt+n(O,a)logr, o t m(r,a)=- 2 1 f 2nlOg+ I j i I d() n 0 (re )-a if a i= 00, and f ' n(t, 00) - n(O, 00) N(r, 00)= dt+n(O,oo)logr, o t m(r, 00 )= f 2n log+ Ij(reiO)ld() 2n 0 if a= 00, where log+ a=max(loga,O) for a>O. The functions Nand mare cal1ed the counting function and proximity function of j(z), respec- tively, and T(r)= T(r,j)=m(r, oo)+N(r, 00) is the order function (or characteristic function) of j(z). T(r) is an increasing function of r and a tconvex function oflogr and is useful for ex- pressing f(z) as an infinite product, etc. The fol1owing relation holds among T(r), m(r, a), and N (r, a) for any a: T(r)=m(r, a)+ N(r, a)+ 0(1), (1 ) where 0(1) is tLandau's symbol (Nevanlinna's first fundamental theorem). By this theorem, if a bounded remainder is disregarded, then m(r, a) + N (r, a) is equal to T(r) for all a. This equality thus demonstrates a beautiful1y bal- anced distribution of a-points. We see that N (r, a) is in a sense the mean value of the number of a-points in Izl  r, and m(r, a) is the mean proximity to a of the val ue j(z) on Izl = r. If the term 10g+ in the definition of the proximity function is replaced by the logarithm of the reciprocal of the chordal distance between j(re iO ) and a on the com- plex sphere, then the remainder term in (1) is 
1011 eliminated. Hence the definition of the proxim- ity function is sometimes given in this form. C. The Order of Meromorphic Functions For an entire function j(z), the equality . log T(r). loglogM(r) hmsup-=hmsup roo logr ,oo logr holds, where M(r)=M(r,f)=maxlzlrlj(z)l. Since the right-hand side is the order of j(z) ( 429 Transcendental Entire Functions), we define the order (lower order) p of a mero- morphic function j(z) by . logT(r) (11 ' ml ' nf lOgT(r) ). p=hmsup- roo logr ,oo logr The order of a meromorphic function in Izi < R is also defined by . logT(r) p=hmsup ,R log(1/(R-r)) D. Meromorphic Functions on a Disk The order function T(r) is bounded if and only if j(z) can be represented as the quo- tient of two bounded holomorphic functions hdz), h 2 (z) in Izl < R (Nevanlinna). If T(r) is bounded, Iim'Rj(rei9) exists and is finite for every e, 0  0 < 2n, except possibly for a set with tlinear measure zero (P. Fatou and Nevanlinna). Among functions j(z) such that limrR T(r) = 00, those satisfying . T(r) hmsup = 00 rR log(I/(R-r)) have properties similar to those of transcen- dental meromorphic functions. E. Meromorphic Functions in the Whole Finite Plane Any meromorphic function such that T(r) limsup-< K HR logr is a rational function. Ifj(z) is a meromorphic function of order p and {r j ( IX) }, r j ( IX)  r j + 1 (IX) (j = 1,2, ... ) is the set of absolute values of IX- points, then L1 (1h(IX))P+' converges for any IX. Furthermore, j(z) = ZkeP(z) ( l- ) exp ( +... + zPp ) x lim TI ( ". ) t \ r--+oo la1 r z Z z Ib.l<;r I- bv exp b v +...+ pb: 272 F Meromorphic Functions where p is the smallest integer satisfying p + 1 > p, the a p and b v are the zeros and poles of j(z), respectively, k is an integer, and P(z) is a polynomial of degree at most p (Hadamard's theorem). Let lXI' ..., IXq (q  3) be distinct values. Then for any meromorphic function in Izl < R  00, q (q-2)T(r)< I N(r, IXj)-Ndr)+ D(r), j1 Or<R. Here f rndt)-ndO) N[(r)= dt+n 1 (O)logr, o t n[ (r) is the number oftmultiple points in Izi  r (a multiple point of order k is counted k-l times), and D(r) is the remainder such that if R = 00, then D(r) < K (log T(r) + log r) for some K except possibly for values of r belonging to the union of a countable number of intervals with finite total length, and if R < 00, then D(r) < K(log T(r) + log(I/(R -r))) except possibly for the union of a countable number of intervals {IJ with LJljd(l/(R-r))< 00 (Nevanlinna's second fundamental theorem). Several theorems on value distribution of meromorphic functions can be obtained di- rectly from this theorem. For instance, if j(z) is a transcendental meromorphic function, the equation j(z) = IX has an infinite number of roots for every value IX except for at most two values called Picard's exceptional values (Picard's theorem). For a meromorphic func- tion of order p, limroo Lrj<;,(rj(IXJr A (), <p) diverges for every value IX except for at most two values (Borel's theorem). A value IX for which the series converges is called a Borel exceptional value. We call b(IX)=b(IX,f) = 1 -lim sUProo N(r, IX)/T(r) the defect of f It always satisfies Ob(IX) 1, and the values with b(IX»O are called Nevanlinna's excep- tional values. The number of values IX (may be 00) with b( IX) > 0 is at most countable for any meromorphic function j(z), and l:r;1 b(IX;)  2. There are many studies concerning the values IX with b(IX) = 0 F. Julia Directions Among functions that have an essential sin- gularity at the point at infinity and are mero- morphic in the whole plane, there are some that possess no tJulia directions. These func- tions, called Julia exceptional functions, are of order O. A necessary and sufficient condi- tion for j(z) to be a Julia exceptional func- tion is that j(z) can be written in the form zmI1 p (l-z/a p )/I1v(1-z/b v ) (A. Ostrowski, 
272 G Meromorphic Functions 1925). Concerning zeros a# and poles b v of ((z), the following three properties are obtained using the theory of tnormal families due to P. Montel: (1) There are constants K[. K2' and K3 independent of r such that I n(r, 00) - n(r, 0) I <KI' n(2r, oo)-n(r, 00)<K 2 , and n(2r,0)- n(r, 0) < K3' (2) There are constants K4 and Ks such that for any p and q, l a I I l a I la 1 m Il -.!' Il -.!' <K p la"I<lapl a# Ib,I<lapl b v 4' I b I I I b I Ibql- m Il --'!.. Il ...'l.. <Ks. Ib,I<lbql b v la"I<lbql a# (3) There exists an £ > 0 satisfying I ap/b q - 11  £>0 for any p and q. G. Valiron gave a precise form of Julia directions that corresponds to Borel's theorem (Acta Math., 52 (1928), [3]). Namely, if the order p of a meromorphic function j(z) in Izi < 00 is positive and finite, then there exists a direction J defined by argz=IX such that the zeros zv(a,) of j(z) - a in any angular domain : largz- IXI <15 containing J have the prop- erty Lvlzv(a,)I-(P-')= 00 for any £>0 except for at most two values of a. The direction J is called a Borel direction. G. Relations between Two or More Meromorphic Functions Borel's unicity theorem can be stated as fol- lows: Let jj (j = 1, "', n) be non vanishing entire functions satisfying L'Jljj= 1; then for some (c 1 , ..., cn) #(0, ..., 0), L'J1 cjjj=O. This is contained in the following theorem: Let jj (j = 1, ..., n) be transcendental entire functions such that L'J=ljj= 1; then L'J1 b(O,jj)n-1. If two meromorphic functions j1 (z), j2(Z) have the same IXj-points for five distinct values IXj (j = 1,...,5) (where multiplicity is not taken into account), then they coincide everywhere. If the tRiemann surface of the talgebraic function w(z) defined by a polynomial P(z, w) = ° of z, w is oftgenus > 1, it is impossible to find mero- morphic functions z= j(O, w=g(O that satisfy P(f(O,g(m=O (tuniformization by meromor- phic functions). H. Asymptotic Values If a meromorphic function f(z)-> IX as Z-> 00 along a curve C, the value IX and the curve C are called an asymptotic value and asymp- totic path, respectively. Each (Picard's) excep- tional value of j(z) is an asymptotic value. For meromorphic functions, no simple relation is known between their order and the number of their asymptotic values. There exists a merom- orphic function of order ° with an infinite 1012 number of asymptotic finite values. Some results analogous to those for entire functions are obtained for meromorphic functions by applying the theory of normal families. F. Marty established a systematic theory of normal families of meromorphic functions by using spherical distance. I. Inverse Functions Generally, the inverse function of a meromor- phic function w = j(z) is infinitely multiple- valued. Let P(w, wo) be a function element of the inverse function with center at w o , and let C be an arbitrary curve starting at W o and with OJ its terminal point. For any domain S con- taining C, P(w, w o ) can be continued analyti- cally in S up to a point arbitrarily near OJ (Iversen's theorem). Continue the function element analytically along each half-line start- ing at its center. Then the set of arguments of half-lines along which the analytic continu- ation meets a singularity at a finite point is of zero linear measure (Gross's theorem). By considering the inverse image of the suitably cut Riemann surface of the inverse function, the z-plane can be divided into fun- damental domains such that each domain is the inverse image of the whole w-plane (with suitable slits removed) and has a boundary each point of which is taccessibIe from the inside of the domain, and the boundary curves of fundamental domains cluster nowhere in the plane. For a meromorphic function j(z), the set of functions z' = qJ(z) defined by j(z') = j(z) (i.e., transformations between points that give j(z) the same value) has the property of a thypergroup. If qJ(z) is single-valued, then it is a linear entire function, and if it is finitely multiple-valued, then it is an algebraic func- tion. The tcluster set of the inverse function at a transcendental singularity consists of only one point, 00, that is, it is an tordinary sin- gularity. To an analytic continuation along a curve that determines a transcendental sin- gularity of the inverse function there corre- sponds a curve in the z-plane terminating at 00. This curve is an asymptotic path off(z). Namely, the value j(z) tends to the coordinate of the transcendental singularity as z-> 00 along this path. Each asymptotic value of a transcendental meromorphic function w = j(z) corresponds to a transcendental singularity of its inverse function z = qJ(w), and if we consider two asymptotic paths to be the same if they correspond to the same singularity, then there exists a one-to-one correspondence between the set of asymptotic paths and the set of transcendental singularities of the inverse 
1013 function. The inverse function of any mero- morphic function of order p has at most 2p tdirect transcendental singularities if p  1/2 and at most 1 such singularity if p < 1/2 (L. V. Ahlfors). J. Theory of Covering Surfaces Ahlfors established the theory of covering surfaces by a metricotopological method and in applying it, obtained Nevanlinna theory and many other results on meromorphic functions. Let F, denote the covering surface of the Riemann sphere Fa with radius 1/2; F, is the image of Izl:s; r under a meromorphic function w = j(z). The area of F, divided by n, where n is the area of Fa, is given by A ( r ) - J ' r If'(zW d dO - n JIZI<,(I +lj(zW)2 p p , z=pe ie , and is called the mean number of sheets of F,. The length of the boundary of F, is given by L(r) = r 11'(z) I 2 ldzl . Jlzl' 1 + I [(z) 1 The relation f A(r) T(r)= ---;:-dr+O(I) holds (T. Shimizu, Ahlfors). Consider the Riemann surface of the inverse function of a meromorphic function w = j(z) in Izl < R:S; +00. It has a countable number of components Qv over a domain on the w-plane. Let v denote the inverse image of Qv on the z- plane. If v together with its boundary is con- tained in Izl <R, the component Qv is called an island, and otherwise, a peninsula. Let D be a simply connected domain of the w-plane, n(r, D) be the sum of the sheet num- bers of the islands of F, over D, and m(r, D) be the sum of the areas of the peninsulas of F, over D divided by the area of D. Then m(r, D)+n(r, D)= A(r)+ O(L(r)). Let Dj (j = 1, ..., q) (q  3) be disjoint simply connected domains on the w-plane. Then q q I n(r,D)- I n 1 (r,D»(q-2)A(r)-O(L(r)), j1 j1 where n dD) is the sum of the orders of branch points in all islands of F, over D. For a meromorphic function j(z), L(r) < A(r)I/2+€, where r satisfies O:S;r< 00 except for rE UJj for some intervals Ij' Hence in the case where Dj is a point rx j , this inequality 272 K Meromorphic Functions yields q q I n(r,rx)- I n 1 (r,CI:) j1 j1 >(q- 2)A(r) -O(A(r)'/2+€) with some exceptional intervals of values of r. This latter important inequality corresponds to Nevanlinna's second fundamental theorem. Let Dj (j = 1, ... ,q) (q  3) be disjoint simply connected domains on the w-plane. If every simply connected island over Dj has at least J1j sheets, then L1dl-(1/J1j)):S;2 (disk theorem). It follows from this theorem that given three disjoint disks Dj on the Riemann sphere, there is at least one Dj that has an infinite number of islands over it, and also that given five Dj, there exists at least one Dj that has a I-sheeted island over it (Ahlfors's tive-disk theorem). This theorem corresponds to tBloch's theorem for entire functions. These theorems can also be obtained for meromorphic functions on a disk. Ahlfors established a more important theory by introducing a differential metric. K. Recent Development The Nevanlinna brothers raised several im- portant problems, which gave strong motiva- tion for later investigations. The first major breakthrough after World War II was given by A. Gol'dberg in 1956. He gave an example which has infinitely many deficient values (Nevanlinna's exceptional values). W. Hayman proved that L b(a,j)" converges for rx> 1/3, and there are examples of meromorphic func- tions for which the series diverges for rx < 1/3. Finally, A. Weitsman showed that the series converges for rx = 1/3. The second major break- through was given by A. Edrei and W. Fuchs in 1959, whose works concern the following Nevanlinna theorem: Let K(f) be . N(r,O)+N(r,oo) hm sup ,w T(r,j) and K(p) = inf K(f), where inf is taken over all meromorphic function j of order p. Then K(p) > 0 if p is neither a positive integer nor 00. Furthermore, Nevanlinna gave a conjecture for an exact value of K(p). This conjecture is still open, although several estimates have appeared. In the case of entire functions hav- ing only negative zeros the conjecture was positively solved by S. Hellerstein and J. Wil- liamson. Edrei and Fuchs proved that K(p) = 1 for O:S; p:S; 1/2 and K(p) = sin np for 1/2 < p:S; 1. They also proved the ellipse theorem: Let j(z) be a transcendental meromorphic function of order p (O:s;p:S; 1). Put u = I-b(a,j), v= 1- b(b,j). Then u, UE[O, IJ and u 2 -2uucosnp+ u 2  sin 2 np. If further u < cos np, then U = 1. 
272 L Meromorphic Functions For the sum of deficiencies Edrei proved that 2:b(a,J) { l-csnp (0p1/2) 2 - sm np (1/2 <p  1) unless the number of deficient values is one. It was conjectured that L b(a,J) = 2 implies that the order p of f is a half integer, the number v of deficient values is at most 2p, and the value of b(aJ) is a multiple of lip. For the case of entire functions this conjecture is true (A. Pfluger). Weitsman proved that v  2p. Other cases remain open. All the above results still hold even if the order is replaced by the lower order. The inverse problem was completely solved by D. Drasin. Many of the above re- sults depend on the concept of P6lya peaks. N. V. Govorov and V. Petrenko proved independently that . . 10gM(r,J) hmmf np for p> 1/2 '-00 T(r,J) for every entire function of order p. This was a conjecture made by R. Paley. For p  1/2, an exact upper bound nplsin np was given by Valiron. Furthermore, the following result was proved by Edrei and Fuchs: . N(r,O) sinnp hmsup ;;;.- (Op< 1). '-00 10gM(r, f) np L. History The value distribution theory of meromorphic functions had its inception with the classical Picard theorem. It first appeared as the value distribution theory of entire functions and was developed into a well-organized field by way of the Nevanlinna theory and the AhIfors theory of covering surfaces. In recent years, emphasis has also been placed on the study of meromorphic functions on open Riemann surfaces ( 367 Riemann Surfaces). The value distribution of a set of several meromorphic functions was studied first by A. Bloch and developed into the study of mer om orphic curves by Ahlfors, and H. and J. Weyl [2]. The behavior of meromorphic functions in neigh- borhoods of general singularities has also been studied. An example of results in that field is the theory of tcluster sets. References [lJ R. H. Nevanlinna, Eindeutige analytische Funktionen, Springer, 1935, revised edition, 1953; English translation, Analytic functions, Springer, 1970. 1014 [2J H. Weyl and J. Weyl, Meromorphic func- tions and analytic curves, Princeton Univ. Press, 1943. [3J G. Valiron, Directions de Borel des fonc- tions meromorphes, Memor. Sci. Math., Gauthier-Villars, 1938. [4J W. K. Hayman, Meromorphic functions, Clarendon Press, 1964. [5J W. Fuchs, Developments in the classical Nevanlinna theory of meromorphic functions, Bull. Amer. Math. Soc., 73 (1967), 275-291. [6J A. Edrei and W. Fuchs, The deficiencies of meromorphic functions of order less than one, Duke Math. 1., 27 (1960), 233-249. [7J S. Hellerstein and J. Williamson, Entire functions with negative zeroS and a problem of R. Nevanlinna, J. Anal. Math., 22 (1969), 233- 267. [8J A. Edrei, Solution of the deficiency prob- lem for functions of small order, Proc. Lon- don Math. Soc., 26 (1973), 435-445. [9J A. Weitsman, Meromorphic functions with maximal deficiency sum and a conjecture of F. Nevanlinna, Acta Math., 123 (1969), 115-139. [10J A. Weitsman, A theorem on Nevanlinna deficiencies, Acta Math., 128 (1972), 41-52. [l1J D. Drasin, The inverse problem ofNe- vanlinna theory, Acta Math., 138 (1977), 83- 151. [12J A. Gol'dberg, Meromorphic functions, J. Soviet Math., 4 (1975),157-216. [13J V. Petrenko, Growth of meromorphic functions of finite lower order (in Russian), Izv. Akad. Nauk SSSR, 33 (1969), 414-454. Also  references to 124 Distribution of Values of Functions of a Complex Variable. 273 (11.17) Metric Spaces A. General Remarks The distance between two points x = (x 1, ... , x n ) and y = (y l' .. . , y n) in the n-dimensional tE uclidean space Rn is defin ed by p(x, y) = j (Y1 -xd 2 +... +(Yn- x n)2. The function p(x, y) is nonnegative for every pair (x, y) and has the following properties: (i) p(x, y) = 0 if and only if x = y; (ii) p(x, y) = p(y, x); and (iii) p(x,z)p(x,y)+p(y,z) for any three points x, y, z. Property (iii) is called the triangle inequality. B. Definition of Metric Spaces Abstracting the notion of distance from Eu- clidean spaces, M. Frechet defined metric 
1015 spaces [1] (1906). A metric on a set X is a nonnegative function P on X x X that satisfies (i), (ii), and (iii) of Section A, and a metric space (X,p), or simply X, is a set X provided with a metric p. The members of X are called points, P is called the distance function, and p(x, y) is called the distance from x to y. The distance function is sometimes denoted by d(x, y) or dis(x, y). If (i) is replaced by its weaker form (i') p(x,x)=O, the function p is called a pseudo- metric (or pseudodistance function), and X is called a pseudometric space. Examples of metric spaces: (1) The n-dimensional Euclidean space Rn, in particular the real number system R with Po(x, y)= Ix - yi. (2) The tfunction space Lp(O). (3) The tfunction space C(O). (4) The tsequence space s, i.e., the space R N of all se- quences of real numbers with metric p(x, y) = L1 2- n l xn - Ynl/O + IX n - Ynl), where x = (X 1 ,X 2 , ...) and Y=(Y1,J2, ...). (5) The tse- quence space m, i.e., the space of all bounded sequences of real numbers with metric p(x,y)=suPnlxn-Ynl for x=(xr.xz, ...) and Y=(Yl' Yz,...). (6) A Haire zero-dimensional space (ON, p), where 0 is a set and the dis- tance p(x,y) between X=(X1'X Z ' ...) and Y= (Y1'Y2' ...) is equal to the reciprocal of the minimum n such that X n i= Yn' When the car- dinal number T of Q is specified, the Baire space is denoted by B(T). (7) For any set X, define P by setting p(x,x)=O and p(x,y)= 1 when x i= y. Then (X, p) is a metric space, called a discrete metric space. (8) For any set X, define p by setting p(x,y)=O for any mem- bers x and y. Then p is a pseudometric, and the resulting space X is called an indiscrete pseudometric space. F or a subset M of a metric space X, sup{p(x,y)lx,YEM} is called the diameter of M (denoted by d(M)), and M is said to be bounded if its diameter is finite (includ- ing M = 0). For two subsets A, B of X, inf{p(x,y)lxEA,yEB} is called the distance between A and B, denoted by p(A, B). We have p(A, B) = p(B, A). When a family \In = {MA I AE A} of subsets of X is a covering of X, i.e., X = UAMA' the supremum of the dia- meters d(M A ) of M A in \In, sup{d(MA)I).EA}, called the mesh of the covering \In. For a posi- tive number 8, a covering whose mesh is less than f. is called an f.-covering. A metric space X is calIed totally bounded (or precompact) (F. Hausdorff, 1927) if for each positive num- ber 8 there exists a finite 8-covering of X. A subset X 1 of a metric space X becomes a metric space if we define its metric PI by set- ting PI (x, y) = p(x, y) for x, Y E X r. where p is the metric of X. The space (X l' pd is called a metric subspace of (X, p). A subset of X is called totally bounded (or precompact) if it is 273C Metric Spaces totally bounded as a metric subspace. Any totally bounded subset is bounded. Converse- ly, in the Euclidean space R n any bounded subset is totalIy bounded. A bijection f from a metric space (XI' PI) onto a metric space (Xz,pz) is called an iso- metric mapping if f preserves the metric, i.e., P2(f(X),f(y)) = PI (x, y) for any points x, YE XI; and X, and X 2 are called isometric if there is an isometric mapping from XI onto Xz' Let X be a metric space with metric p, and left f be an injection from a set Y into X. Then the function p'(Yr. Yz) = p(f(YrJJ(Y2)) (Y1' Yz E Y) is a distance function on Y, and with this metric the set Y becomes a metric space called the metric space induced by f; f is an isometric mapping from (Y, p') onto (f(Y),p). For a finite number of metric spaces (X l' PI), ..., (X n , Pn), we can define a metric p on their Cartesian product X = XI X ... X X n by setting p(x,y)= J P1(X 1 'Y1)2 +... + Pn(x n , Yn)z , for two points X=(X1, ...,x n ), Y=(Y1' ...,Yn) of X. Thus we obtain a metric space (X, p), called the product metric space of (X l' PrJ, "', (X n , Pn). The n-dimensional Euclidean space Rn is the product metric space of n copies of the real line (R, Po). C. Topology for Metric Spaces For a point x of a metric space (X, p) and any positive number 8, the set Ue(x) of all points Y such that p(x, y) < 8 is called the 8- neighborhood (or 8-sphere) of x. We can intro- duce a topology for X by taking the family of all 8-neighborhoods as a tbase for the neigh- borhood system ( 425 Topological Spaces). Then the following five propositions hold, any one of which can be used to define the same topology: (i) A subset 0 is topen if and only if for any point x in 0 there is a positive number 8 such that the 8-neighborhood of x is con- tained in O. (ii) A subset F is tclosed if and only if any point whose every 8-neighborhood contains at least one point of F is contained in F. (iii) A subset U is a neighborhood of a point x if and only if U contains some 8- neighborhood of x. (iv) A point x is an tinte- rior point of a subset A if and only if some f.- neighborhood of x is contained in A; the inte- rior A i of A is the set of all such points. (v) A point x is adherent to a subset A if every f.- neighborhood of x contains at least one point of A; the closure A of A is the set of all such points, and XE A if and only if p(x, A)=O. Every metric space X satisfies the tfirst countability axiom. A metric space X is a 
273 D Metric Spaces tHausdorff space and, more specifically, a tper- fectly normal space; it is also tparacompact. In the same way, we can define a topology for each pseudometric space that satisfies the first countability axiom, but a pseudometric space is not necessarily Hausdorff. D. Convergence of Sequences A sequence {xn} of points in a metric space is said to converge to a point x (written limn00 X n =x) if p(xn,x) tends to zero as n-too. The point x is called the limit of {x n }. This conver- gence is equivalent to convergence with re- spect to the topology defined in Section C ( 87 Convergence). As the first countability axiom is satisfied, we may define the topology by means of convergent sequences of points: the closure A of a subset A is the set of all limits of sequences of points in A. E. Separable Metric Spaces For a metric space X the following three con- ditions are equivalent: (i) There exists a coun- table family 1'10 of open sets of X such that each open set of X is the union of members of 1'10 (tsecond countability axiom). (ii) X is tsep- arable, that is, X has a countable subset that is tdense in X. (iii) Every open covering of X has a countable subcovering (tLindelof space). A metric space with any of these properties is called a separable metric space. The sequence space s is separable. Any separable metric space can be isometrically embedded in the sequence space m, i.e., is isometric to a sub- space of m ( 168 Function Spaces B). F. Compact Metric Spaces For a metric space X, the following five con- ditions are equivalent: (i) X is tcompact, that is, every open covering of X has a finite sub- covering. (ii) X is tcountably compact, that is, every countable open covering of X has a finite subcovering. (iii) X is tsequentially com- pact, that is, any sequence of points in X has a convergent subsequence. (iv) Every nested family FI :::J F 2 :::J '" of nonempty closed sets of X has a nonempty intersection. (v) Every infinite subset M of X has an accumulation point x, i.e., x E Air =- {x} . A metric space satis- fying any of these conditions is called a com- pact metric space (M. Frechet [IJ). Every real- valued continuous function defined on a com- pact metric space has a maximum and a min- imum. A metric space is compact if and only if it is totally bounded and complete ( Sec- tion J). Every totaIly bounded metric space is 1016 separable. In particular, every compact metric space is separable. Let U = {U",} be an open covering of a com- pact metric space X. There exists a positive number b such that every set with d(A) < b is contained in some U",. The number (j is called the Lebesgue number of the open covering U. A subset A of a metric space is said to be compact if it is compact as a metric subs pace, and A is said to be relatively compact if its closure is compact. Bounded closed sets in R n , in particular closed intervals of real numbers, are compact. For these sets, conditions (i), (iv), and (v) are called the Heine-Borel theorem (or Borel-Lebesgue theorem), Cantor's intersection theorem, and the Bolzano-Weierstrass theorem, respectively. G. Product Spaces of Metric Spaces Let (XI, PI)' ..., (X n , Pn) be metric spaces. Then the Cartesian product X =X I x... X X n has distance functions pp(X, y) = {PI (XI' YI)P +... + Pn(xn, Yn)P} liP, p;;:,1 and p",,(X, y)= max {PI (XI' YI), ..., Pn(x n , Yn)}, where x =(x I ,..., x n ) and Y=(YI'''' ,Yn)' The topology of X induced by each one of these metrics coincides with the product topology. In particular, for the n-dimensional Euclidean space Rn, the metrics Pp (p;;:' 1) and P00 define the same topology. Let (X b ptJ, ..., (X n , Pn),'" be a countable number of metric spaces. If we define a metric P on the Cartesian product X = IT:  I X n by p(X, y) = I  Pn(Xn, Yn) , n1 2 1 + Pn(x n , Yn) where x=(X 1 ,X2'''') and Y=(YI,Y2'''')' then the topology defined by P is identical with the product topology. For the Cartesian product of an uncountable number of metric spaces, we cannot construct a metric P such that the topology induced by P agrees with the product topology in general. H. Uniformity of Metric Spaces Every metric space X is a tuniform space, for which we may take a countable number of subsets {(x,Y)1 p(x,y)< r n }, n= 1, 2,..., of X x X as a base of tuniformity ( 436 U ni- form Spaces). 
1017 I. Uniform Continuity A mapping.ffrom a metric space (X,p) into a metric space (Y,a) is tcontinuous iffor any point x in X and any positive number f. there is a positive number b such that !(UJ(x)) C v,,(J(x)), where UJ(x) is a b-neighborhood for p and v,,(y) is an f.-neighborhood for a; that is, p(x, x') < <5 implies a(J(x),.f(x')) < f.. In this case, we must generany choose b depending on x and f.. In the special case where we can choose b depending only on f., independently of x, we can! uniformly continuous in X. (The notion of uniform continuity may be general- ized to uniform spaces.) Not every continuous mapping is necessarily uniformly continuous, but every continuous mapping from a com- pact metric space into a metric space is uni- formly continuous. J. Complete Metric Spaces A sequence {x n } of points in a metric space (X, p) is caned a fundamental sequence (or Cauchy sequence) if p(xn,xm)->O as n, m->oc. Every convergent sequence is a fundamental sequence, but the converse is not always true. A metric space is caned complete if every fun- damental sequence in the space converges to some point of the space (M. Frechet [1]). A topological space that is homeomorphic with a complete separable metric space is sometimes caned a Polish space ( 22 Analytic Sets I). The metric spaces introduced in examples (1) through (5) of Section B are complete. (In example (3) we must assume that the space Q is a compact Hausdorff space.) A metric space is compact if and only if it is complete and totally bounded. A tIocany compact metric space is homeomorphic to a complete metric space. For a metric space X, we can construct a complete metric space Y such that there is an isometric mapping ({J from X onto a dense subspace XI of Y (F. Hausdorff, 191 L1 ). Slch a pair (Y, ({J) is caned a completion of X. If X has two completions (Y" ({JrJ and (Y 2 , ({J2), then there is an isometric mapping r from Y I onto Y 2 with ({J2 = ! 0 ({Jt . In this sense the comple- tion of X is unique. By identifying X with ({J(X) when (Y, ({J) is the completion of X, any metric space can be regarded as a dense sub- space of a complete metric space. For example, the completion of the rational number system Q is the real number system R. A metric space is tot any bounded (= precompact) if and only if its completion is compact. Haire-Hausdorff theorem: In a complete metric space every set of the Hirst category is a tboundary set. That is, every set that can be 273 K Metric Spaces expressed as the union of a countable number of sets whose closures have no interior point has no interior point. In other words, if the union U:, Fn of closed sets F" F2"" of X has an interior point, then at least one of the Fn must have an interior point. K. The Metrization Problem A topological space X is caned metrizable if we can introduce a suitable metric for X which induces a topology identical to the original one. A tT 1 -space satisfying the second counta- bility axiom is metrizable if and only if it is tregular (Uryson-Tikhonov theorem; P. S. Uryson, Math. Ann., 94 (1925), A. Tikhonov, Math. Ann., 95 (1925)). However, a metric space does not necessarily satisfy the second countability axiom. Therefore, the Uryson- Tikhonov theorem does not provide a neces- sary and sufficient condition for metrizability. The fonowing are some necessary and sufficient conditions for a topological space X to be metrizable: (1) There exists a nonnegative real-valued function d on X x X satisfying the first two axioms given in Section A and the fonowing condition: There exists a real-valued function ((J(w) that converges to zero as w->O such that, for any three points x, y, z and any positive number f., d(x, y) < ((J(f.) and d(y, z) < ((J(f.) imply d(x, z) < f. (E. W. Chittenden, Trans. Amer. Math. Soc., 18 (1917)). (2) X is a T 1 -space that has a countable number of open coverings Wit, \In 2 , ... satisfy- ing the fonowing two conditions: (i) If U t , U 2 E\In n + 1 have a common point, there is a set U Effil n with U:::J U 1 U U 2 ; (ii) for any point X in X, if Un is any member of \Inn containing x, the family {U n }F1.2. is a base for the neigh- borhood system of x (P. S. Aleksandrov and Uryson, C. R. Acad. Sci., Paris, 177 (1923), and N. Aronszajn). When X is a uniform space, this amounts to saying that X has a metric compatible with the uniform structure if and only if X is a T 1 -space and has a countable base of uniformity. (3) X is a T 1 -space that admits a countable number of open coverings WI" \In2'''' such that {S(S(x, \In;), \Inj)1 i,j = 1,2, ... } is a base for the neighborhood system of x at each point of X, where S(A, \In) is the tstar of A relative to WI (R. L. Moore, Fund. Math., 25 (1935); K. Morita, Proc. Japan Acad., 27 (1951); A. H. Stone, Pacific J. Math., 10 (1960); A. V. Arkh- angel'skii, Dok/. Akad. Nauk SSSR, 2 (1961)). (4) X is regular and has a ta-locany finite open base (1. Nagata, J. lnst. Poly tech. Osaka City Uniu., 1 (1950); Yu. M. Smirnov, Uspekhi Mat. Nauk, 6 (1951)). 
273 Ref. Metric Spaces (5) X is regular and has a ta-discrete open base (R. H. Bing, Can ad. J. Math., 3 (1951)). (6) X is a tcollectionwise normal Moore space ( below; Bing, ibid.). (7) X is a tperfect image of a subspace of a Baire's zero-dimensional space (Morita, Sci. Rep. Tokyo Kyoiku Daigaku, sec. A, 5 (1955)). (8) X is a Hausdorfft M -space such that the diagonal is a tGJ-set in the direct product X xX (A. Okuyama, Proc. Japan Acad. 40 (1964); C. J. R. Borges, Pacific J. Math., 17 (1966); 1. Chaber, Fund. Math., 94 (1977)). A regular space is said to be a Moore space if it has a countable number of open coverings \In, such that {S(x, \Inj)} is a base for the neigh- borhood system of x for any point x. A Moore space is not necessarily metrizable. F. B. Jones (Bull. Amer. Math. Soc., 43 (1937)) proved under the assumption 2"0 < 2"1 that every separable normal Moore space is metrizable and asked whether or not every normal Moore space is metrizable. (3) and (6) are partial answers to the question. A normal Moore space is metrizable if it is locally compact and tlocally connected (G. Reed and P. Zenor). The existence of a nonmetrizable separable normal Moore space is consistent with and indepen- dent of the axioms of the usual ZFC set theory, the tZermelo-Fraenke1 set theory with the taxiom of choice (F. D. Tall). W. G. Fleissner (Trans. Amer. Math. Soc., 273 (1982)) con- structed a normal nonmetrizable Moore space assuming an axiom weaker than the contin- uum hypothesis, while P. J. Nyikos (1980) has proved that every normal space with the first countability axiom is collectionwise normal from the strong axiom of set theory. In connection with (7) the following result is known. Let f be a tclosed continuous mapping from a metric space X onto a topological space Y. Then the following conditions are equivalent: (I) Y is metrizable; (2) For each y E Y the tboundary af-l (y) of the inverse image is compact; (3) Y satisfies the first coun- tability axiom (Morita and S. Hanai, Proc. Japan Acad., 32 (1956); Stone, Proc. Amer. Math. Soc., 7 (1956); I. A. Vainstein, Dok/. Akad. Nauk SSSR, 57 (1947)). In particular, perfect images of metric spaces are metrizable. For quotient topological spaces of metric spaces  425 Topological Spaces Cc. References [IJ M. Frechet, Sur quelques points du calcul fonctionnel, Rend. Circ. Mat. Palermo, 22 (1906), 1-74. Also  references to 425 Topological Spaces. For metrization problem see in particular 1018 [2J 1. Nagata, Modern general topology, North-Holland, 1968. [3J R. Engelking, General topology, Polish Scientific Publishers, 1977. [4J Y. Kodama and K. Nagami, Theory of topological spaces (in Japanese), Iwanami, 1974. [5J A. V. Arkhangel'skii, Mappings and spaces, Russian Math. Surveys, 21 (1966), 115-162. 274 (XII.16) Microlocal Analysis A. General Remarks Let X be an open set ofRn. Then X x(Rn"o) can be identified with T* X ,,0, the tcotangent bundle of X minus the zero-section. To every locally integrable function (or distribution or hyperfunction ( 125 Distributions and Hyperfunctions)) f(x) defined on X, one can assign closed subsets of X x (Rn"o) called the wave front set of f and the singularity spec- trum (or analytic wave front set or essential support) of f The wave front set (resp. the singularity spectrum) of f describes in detail the singularity of f modulo the infinitely dif- ferentiable functions (resp. the real analytic functions). One can further associate with f more refined objects defined on X x (Rn"O), such as microfunctions. In many cases one can recover knowledge of the structure of f by analyzing these objects defined on X x (Rn"o). Such an analysis on the cotangent bundle of X is called microlocal analysis. Microlocal analy- sis is particularly successful if f is a solution of a system of linear (pseudo- )differential equa- tions, because in that case one can use vari- ous linear transformations, such as differential operators, pseudodifferential operators ( 345 Pseudodifferential Operators), or microdiffer- ential operators and Fourier integral opera- tors, or quantized contact transformations. B. Microlocal Analysis for Distributions Let u be a distribution defined in an open subset X of Rn. The ave front set WF(u) of u is defined as the complement in X x (Rn"O) of the collection of all (x o , o) in X x (R n ,,0) such that for some neighborhood U of x o , V of o we have for each qJEC(U) and each N>O, (u, qJexp( -iu' » = O(T- N ) as T-> 00, uniformly in  E V (L. Hormander; [IJ). WF(u) is considered to be a subset of 
1019 T*X,,{O}. IfWF(u)=0, then u is a Coo_ function. Let n: X x (R n "O)->X be the natural projection. If n(WF(u)) contains Xo, then u is not a COO-function in any small neighborhood of Xo. Thus WF(u) is the obstruction for u to be infinitely differentiable. Let p(x, D) be a tlinear partial differential operator of order m. Assume that p(x, D)u = f Then WF(f)c WF(u)c WF(f)U p/;/(O), where Pm :(x, ()->Pm(x, () is the tprincipal sym bol of p(x, D). Such a technique of localizing the problem on the cotangent bundle has been used in the form of the estimation of the Fourier transform of f since the advent of the tsingular integral operators of A. P. Calderon and A. Zygmund [3,4J (see also S. Mizohata [5, 6J). The formula above contains as a special case the classical result that u is a COO-function if p(x, D) is telliptic and if f is a COO-function. In obtaining useful results of microlocal analysis for distributions one often uses Fou- rier integral operators and pseudodifferential operators (or singular integral operators) ( 345 Pseudodifferential Operators). C. Fourier Integral Operators [1,2,7-9J A Fourier integral operator B: Co(Rn)->'(Rn) is a locally finite sum of linear operators of the type Af(x) = (2nf(n+N)/2 r a(x, Ii, y) JRII+N x exp(iqJ(x, 0, y))f(y)dy dO. Here a(x, 0, y) is a COO-function satisfying the inequality IDDC DJa(x, Ii, y)1 :;:;; C(1 + Ilil)m-pIPI+(I-p)(I.I+IYI) for some fixed m and p, 1  P > 1 j2, and any triple of tmulti-indices IX, {3, y, and qJ(x, 0, y) is a real-valued COO-function which is homoge- neous of degree 1 in Ii for I Ii I > 1. The function qJ is called the phase function and a the ampli- tude function. Let C", = {(x, 0, y)1 doqJ(x, Ii, y)= 0, Ii ""O} and W = {(x, Y)ER n X Rn 131i",,0 such that (x, Ii, y)E C",}. If dx.o.yqJ(x, Ii, y) "" ° for Ii "" 0, then the kernel distribution k(x, y) of A is of class Coo outside W. A phase function qJ is called non- degenerate if the dx,o,y(aqJ(x, 0, y)jali),j = 1,2, ... , N, are linearly independent at every point of C",. In this case, C", is a smooth manifold in Rn+N+n and the mapping <1>: C", 3(X, Ii, y)-> (x, y, (, 11), (= dxqJ(x, Ii, y), 11 = dyqJ(x, Ii, y), is an immersion of C", to T*(Rn x Rn)"o, the cotangent bundle of R n x R n minus its zero- section. The image <l>C", = A", is a conic La- grangian manifold, i.e., the canonical 2-form a 274C Microlocal Analysis = Ljd(j/\ dX j - Ljd11j/\ dYj vanishes on A", and the multiplicative group of positive numbers acts on A",. Let AI, ,1,2' ..., A 2n be a system of local coordinates in A",. These, together with oqJjali 1 , aqJjali 2 , ..., aqJjaO N , constitute a sys- tem of local coordinate functions of Rn+N+n in a neighborhood of C",. Let J denote the Jacobian determinant ( , oqJ aqJ ) D AhA2'''' A2n,-, "',-;;j1 aO I G U N D(x,O,y) The function at.. = jJ ale. <1>-1 exp(nMij4) is called the symbol of A. Here ale is the restric- tion of a to C", and M is an integer called the Keller-Maslov index [10,11]. The conic La- grangian manifold A",=A",(A) and the symbol a/\ =0/\ (A) essentially determine the singular- ity. of th tkernel distribution k(x, y) of the Fourier integral operator A. Conversely, given a conic Lagrangian manifold A in T*(Rn x Rn)"o and a function a/\ on it, one can con- struct locally a Fourier integral operator A such that A",(A)=A and a/\.(A)=a/\. For global construction of such a Fourier integral operator one requires detailed consideration of the Keller-Maslov index. A globally defined Fourier integral operator A with A",(A)=A and a/\. (A) = a/\ exists if and only if a/\ is not a function on A but a section of the complex line bundle 01/2 @ L, where 0 1 ;2 is the bundle of square roots of the volume elements of A and L is a Z4 bundle over A called the Maslov bundle. The factor J1 exp(nMij4) in the de- finition of a/\. above appears as the effect of trivialization of the bundle ° 1;2 @ L. Those Fourier integral operators whose associated conic Lagrangian manifolds are the graphs of thomogeneous canonical transformations of T*(Rn) are most frequently used in the theory of linear partial differential equations. Let A be a Fourier integral operator such that A",(A) is the graph of a homogeneous canon- ical transformation X. Then the adjoint of A is a Fourier integral operator such that the associated conic Lagrangian manifold is the graph of the inverse transformation X-I. Let AI be another such operator; if A",(Ar) is the graph of XI, then the composed operator AI A is also a Fourier integral operator and A",(A 1 A) is the graph of the composed homo- geneous canonical transformation XIX. Consider the kernel distribution k(x, y) of A. If the phase function qJ of A is nondegenerate, then WF(k) is contained in A",(A). Moreover, if the symbol a/\ (A) does not vanish, then WF(k)= A",(A). Let u be a distribution and A be a Fourier integral operator such that Aq>(A) is the graph of a homogeneous canonical transformation X. Then WF(Au) c X(WF(u)). 
274 D Microlocal Analysis A pseudodifferential operator of class S; l_p(Rn) is a particular type of Fourier inte- gral operator ( 345 Pseudodifferential Oper- ators). In fact, a Fourier integral operator A is a pseudodifferential operator of class S; l_p(Rn) if and only if A<p(A) is the graph of the identity mapping of T*(R n ). Hence for any Fourier integral operator A, A*A and AA* are pseudodifferential operators. The following theorem is due to Yu. V. Egorov [12]: Let P(x, D) be a pseudodifferen- tial operator of class S; l_p(Rn) with the sym- bol p(x, ), and let A be a Fourier integral oper- ator such that the associated conic Lagrangian manifold A<p(A) is the graph of a homogeneous canonical transformation X of T*(Rn). Then there exists a pseudodifferential operator Q(x, D) with the symbol q(x, )E S;, l_p(R n ) such that P(x, D)A = AQ(x, D) and q(x,)- p(X(x, mES;+l(Rn). Note that m-2p+ l<m. Assume that m= I, p= I, and that Pl(X,) is a real-valued C"-function, homogeneous of degree I in  for I  I> I, such that p(x, ) - PI (x, )ES?o(Rn) and dPl (XO, o)#O at (xo. o), where pd x o, o) =0. Then one can find a Fourier integral operator A such that the function q(x, ) of Egorov's theorem satisfies the relation q(x,) - I E S?o(R n ). The boundedness of Fourier integral oper- ators in the space Lz(Rn) (or the spaces H'(Rn)) has also been studied in several cases. Some sufficient conditions for boundedness can be found in [7,8, 14-16]. The theory of Fourier integral operators has its origin in the asymptotic representation of solutions of the wave equation, (see, e.g., [17,18]). Fourier integral operators were first used by G. I. Eskin [7]. D. Essential Support or Analytic Wave Front Set of a Distribution Inspired by the physical idea introduced by C. Chandler and H. P. Stapp, 1. Bros and D. Iagolnitzer introduced the notion of the essen- tial support of a distribution, which is a closed subset of X x (Rn"O) [19]. Let U be a distri- bution defined on an open set X of Rn and X be a C"-function with compact support around Xo E X which is locally analytic and different from ° at Xo. Let Lx(U) be the subset ofRn"o of which the complement is defined as follows. A point I] is in the complement of Lx(U) if there exist a conic neighborhood U of 1], constants Cl., Yo> 0, and C" such that I <u,Xexp{ -ix'  - yl llx - XOI2} >1 CN(1 +IW-Ne-'YII 1020 for all  E U, 0< Y < Yo, and all positive integers N. The essential support Lxo(U) of U at xo, is the limit of LAu) when the width Isupp xl of the support of X around Xo tends to O. The es- sential support L(U) of U is the closed subset UXEX {x} x L.{U) of X x Rn"O. .E(u) is the ob- struction for U to be real analytic. L. Horman- del' [20] also defined the analytic wave front set of a distribution, which is also the obstruc- tion for a distribution to be real analytic. The definition of an analytic wave front set is quite different from that of essential support. How- ever they coincide with each other [21]. More- over, both of them coincide with the singular- ity spectrum of U if the distribution U is re- garded as a hyperfunction. E. Microlocal Analysis for Hyperfunctions [22] (1) Microfunctions. Let N be a real analytic manifold of dimension n + d and M its sub- manifold of codimension d. In what follows, TMN and T,JN denote the normal bundle of M and the conormal bundle supported by M, respectively. Here the normal bundle TMN is defined to be the quotient bundle TN I M/TM of the tangent bundle and the conormal bundle T,J N to be the subbundle of the cotangent bundle T * N I M that annihilates TM. Identify- ing N with {(x, V)E TN I v=O} or {(x, )E T* N I  = O}, we define the tangential sphere bundle SN and cotangential sphere bundle S*N by (TN"N)/R (= UXEN(TxN,,{O} )/R) and (T*N "N)/R( = UXEN(TX*N,,{O} )/R), re- spectively. The normal sphere bundle SMN = (TMN "M)/R and conormal sphere bundle SZr N = (T,J N "M)/ R  are defined in the same manner. In parallel with the algebraic geom- etry ( 16 Algebraic Varieties) we define the real monoidal transform of N with center M to be the manifold (N"M)USMN with boundary, in which the center M is blown up to SMN by the polar coordinates. We denote it by MN. We mainly use this notion when N is a tcom- plexification X of M, regarding X as a 2n- dimensional real manifold. In this case we can canonically identify TMX with J=1 TM, and hence SMX with j=t SM. We denote by x+ j=t vO the point in S,wX that corresponds to (x, j=t v) in j=t SM by this identification. An open subset W of X" M is called a conoidal neighborhood of a subset U of j=t SM if WUj=t SM is a neighborhood of U in MX. Let E, if, and i denote respectively the canon- ical embedding mappings from X "M to X, from MX "j=t SM to MX and from j=t SM to 'WK. We then define the tsheaves tr: and ,;} 
1021 by 8*8- 1 (9x and 0IJ_ISM(=r 1 0 x ). Here 0x denotes the sheaf of germs of holomorphic functions on X. We also define the sheaf!!2 on p SM by £';_iSM(,-I(9X)' where, is the canonical projection from MX to X and £ denotes the pth tderived functor of the functor rs of taking the sections with support in S. Then!!2 is isomorphic to d /,-ld for the sheaf d of real analytic functions on M. The sheaf d is, so to speak, the sheaf of "boundary values" of holomorphic functions. Actually there exists a canonical mapping b from d to ,-I :JiJ M , where :JiJ M denotes the sheaf of thyper- functions on M ( 125 Distributions and Hyperfunctions). Thus we see that !!2 describes the singularities of the boundary value of a holomorphic function. These sheaves d and !!2 are easy to understand intuitively. However, they are defined on p SM, while p. S* M is more important in analysis. Our final goal, namely, the sheaf of microfunctions, is constructed on p S* M through cohomo- logical machinery starting from !!2' In order to do this, we introduce the disk bundle DM by {(x+P vO, (x,p oo)) EP SM xMP S*MI(v,OO}. Here P SM xMP S*M denotes the tfiber product of P SM and P S* M over M and the symbol oo is used to em- phasize that  designates the codirection, which is dual to the infinitesimally small quantity vO. The canonical projections from DM to P S*M and from P SM to M are both denoted by,. Similarly, the pro- jections from DM to p SM and from P S*M to M are denoted by n. We denote by a the antipodal mapping on P S* M, namely, a(x,p oo)o=(x, -P oo). For a sheaf g; on p S*M, we also denote a*,( =a- I g;) by g;a. In the following, Rj,*, etc., denotes the jth tright-derived functor of the functor '* of taking the direct image of sheaves, etc. ( 383 Sheaves). Now the sheaf 'Il M of micro functions is defined on p S*M by (R n - I ,*n- 1 !!2)" @ n- 1 w M , where W M de- notes the torientation sheaf of M. Note that Rj,*n- I !!2=0 holds for j#n-l. Remark: Here we have defined the sheaf 'f,M of micro functions on p S*M. However, it is sometimes more convenient to define the sheaf on p T*M (= T,::X) by the follow- ing convention: (6'M.(x.J-=i) = 'IlM.(X.J1W) if  #0, and :JiJ M . x , if  = O. Several authors (e.g., [23J) call this sheaf c(jM the sheaf of micro- functions and denote it by ({5M' (2) Basic Properties of Microfunctions. The sheaf ({5M defined above has the following properties: (i) The sheaf ({5 M is a tflabby sheaf 274E Microlocal Analysis on p S*M. (ii) For each (x,p oo) in p S* M, there exists a surjective mapping from :JiJ M x to ({5M.(x.,/-=-i w), This mapping is denoted by sp. The mapping from :JiJ M to n* 'Il M is also denoted by sp. (iii) We have the following exact sequence: O-->dM-->:JiJ M '.!; n/{5M-->O. (iv) Rkn* 'Il M = 0 holds for k #0. The exact sequence (iii) shows that the singu- larities of hyperfunctions are dispersed over p S* M and that the dispersed object is described by the sheaf 'Il M of microfunc- tions. For a hyperfunction f E:JiJM(M), we call sp(f) (E'f,(P S*M)) the spectrum of f We denote suppsp(f) by S.S.f and call it the singularity spectrum of f or the singular spec- trum. It is known [21 J that this coincides with the analytic wave front set of f and with the essential support of f if f is a distribution. (v) The following sequence is an exact se- quence on p SM: O-->',{M-"> ,-I J8 M -->n*,-1 ({5M -->0. In the following, a subset A of the (n-1)- dimensional sphere sn-1 is said to be convex ifw- 1 (A)U{0} 'is convex, where w is the ca- nonical projection from Rn,--{o} to sn-r, and ifw- 1 (A)U{0} is COnvex and includes no straight line, A is said to be properly convex. A subset Z of P SM (resp., p S*M) is also said to be (properly) convex if, -1(X) n Z (resp., n- 1 (x)nZ) is so for each x in M. For a subset Z of A SM its polar set ZC is, by definition, {(x,p xoo)EJ=I S*M I (v x , x> > 0 holds for each x + p V x 0 in Z}. The polar set zo of a subset Z of p. S* M is defined in the same way. (vi) Let V be an open subset of P SM such that ,-I(x)n V is a nonvoid connected set for each x in M. Let V denote V ao . Then we have (a) The restriction mapping p: r( V; ,,{ )--> r(V; ,,{) is a bijection. Here r(V; ,,{), etc. de- notes the space of global sections of ,,{ over V, etc. (b) O-->,'{(V)-->:JiJ(M)'Il(P S* M - VO) is an exact sequence. (vii) Let f(x) be a hyper- function on M. Then the following two state- ments are equivalent: (a) sp(f)(xo.I-locx,)=O. (b) There exist a finite family of open subsets V j of p SM whose polar set V/ does not contain (xo,p ooo) and qJj in r(Vj,d) such that f = L.jb(qJJ. Then we say that f is micro-analytic at (xo, P o 00). (3) Operations on Microfunctions. Let M and N be real analytic manifolds, and let f be a real analytic mapping from N to M. We de- note by T; M the kernel of the natural map- ping from N x M T*M to T*N. It is also called a conormal bundle supported by N. The as- sociated sphere bundle is denoted by SM. Denote by p and w the natural mappings from N xMP S* M'--P SM to P S* N and from N xMP S*M,--P SM to P S* M, respectively. (i) Let ,'!4 denote the 
274F Microlocal Analysis sheaf {UEflgMI S.S. unp SM = 0}. Then we have the fo11owing two canonical homo- morphisms: f*: flg->flgN and f*: p,ro- 1 'I1 M -> 'I1 N . Here and in what fo11ows, for a continuous mapping cp from N to M and a sheaf g; on N, CPIg; denotes the sheaf on M defined by assign- ing {SE r(cp -I (U); g;) I cplsupps: supps->U is tproper} to each open subset U of M. These two homomorphisms are consistent. We ca11 each of them a substitution (homomorphism) and denote it by (f*u)(y) = u(f(y)). (ii) Let V M denote the sheaf of tdensities. Then there exist the fo11owing two canonical homo- morphisms: f* :f,(flgN @"'NVN)->flgM 0"'M V M and f* :ro,p -I ('I1 N @"'NVN)->'I1M 0"'M v M . These two homomorphisms are consistent. We ca11 each an integration along a fiber and denote it by (f*u)(x) = Sr'(X) u. (iii) Using the result in (i), we can define the product u 1 U z of two hyper- functions U 1 and u z , if S.S. U 1 n (S.S. u z )" = 0. Furthermore, the singularity spectrum ofu1u z is contained in {(X,p(el +(1- O)z)oo)l(x,P I oo)ES.S.U I , (x,p zoo) ES.S.U z , OO I}US.S.u 1 US.S.u z ' F. Microdifferential Operators [22J (1) Microlocal Operators. Let M be a real analytic manifold, and define the sheaf 2 M on p S:f(M x M)=P S*M by £9-::jSM(MXM)('I1 MxM @V M ). A section K(x,y)dy ot2 M natura11y determines an integral oper- ator  : u(y)-> S K (x, y)u(y) dy. An operator thus obtained is called a microlocal operator, because it acts on the sheaf '11 M of microfunc- tions as a sheaf homomorphism. Usua11y we identify an operator .% and a kernel function K(x,y)dy. (i) 2 M is a sheaf ofrings by the natural composition. The unit element of 2 M is b(x - y) dy. It acts on '11 M as an identity operator. (ii) Let K(x,y)dy be the kernel func- tion of a microlocal operator . defined near (x o , p o 00). Then its adjoint operator .%* is, by definition, the microlocal operator de- fined near (x o , - p o 00) with the kernel function K(y, x)dy. The operation * is a sheaf isomorphism between 2 M and 2, where a denotes the antipodal mapping ( Section E). (2) Microdifferential Operators. A micro- differential operator is an analog of a micro- local operator in the complex domain. By a procedure similar to that used to defl,!1e the sheaf of microfunctions we first define the sheaf'l1 y1x of holomorphic microfunctions for a submanifold Y of X ( [22, definition 1.1.7 on p. 319], where it is denoted by (Ylx)' It follows from the definition that 'I1;fx is sup- 1022 ported by Pj!'(Y x X), which is identified with Y x x P* X. Here and in what fo11ows P* X, etc., denotes the cotangential projective bundle of X, etc. Then the sheaf,fff of microdifferen- tial operators (of infinite order) is, by defini- tion, 'IlXj'x@x @(1)x Qx , where Qx denotes the sheaf of holomorphic dim X -forms. Remark. Several notations are used to de- note ,fff in the literature. For example, [22J uses the symbol g>x and ca11s it the sheaf of pseudodifferential operators. As in the case of microfunctions, SOme authors use the symbol ,fff to denote a sheaf on T * X. In this case ,fff I x is, by definition, ::Zf, the sheaf of linear differential operators (of infinite order). One should be careful in these notational confu- sions in referring to papers using microdif- ferential operators. We note also that the sym- bols ::z and ,ff have nothing to do with the symbols in distribution theory. We now list the basic properties of micro- differential operators. (i) When X is a complexification of a real analytic manifold M,,fff I J-=!S*M is a subring of 2 M , (ii) Let Q be an open subset of p* X. Using a local coordinate system (x) on X, we define Q by {(x, )EC" X (C n _ {O}) I(x, oo)EQ}. Let {pix, ) }jeZ be a seq uence of holomorphic functions on Q satisfying the fo11owing condi- tions: (1) Pj(x,) is homogeneous of degree j in . (2) For each 8> 0 and each compact sub- set K of Q, there exists a constant C,.K such that SUPK Ipj(x, )I  c'.Kdj! (j;::'0) holds. (3) For each compact subset K of Q, there exists a constant R K such that SUPK Ipj(x, )I  Ri(j( -j)!(j < 0) holds. Then there is a one-to- one correspondence between the space of such sequences and the space of sections of ,fff over Q. (iii) A sequence satisfying the conditions in (ii) is ca11ed a symbol sequence, and the corresponding section of ,fff is denoted by L.jeZPiX, Dx). If we define a subsheaf ,ffx(m) of ,fff by {P=L.jPj(x,Dx)E,ffflpix,)=O(j;::' m + I)}, it is independent of the choice of the local coordinate systems. A microdifferential operator belonging to ,ffx(m) is said to be of order (at most) m. We denote Um,ffx(m) by ,ffx and can a section of ,ffx a microdifferential operator of finite order. (iv) Let <1> ,,(z) denote r(A)/( - z)A, where its branch is chosen so that <1>A(-l)=r(A). When A = 0, -1, - 2, " . , we consider its tfinite part. Let Q be a complex neighborhood of (xo,j=1 ooo)EP S*RnRn x p sn-I. Using a symbol sequence {Pj(z,)} on Q, we define a multivalued holomorphic function K(z, w,O by Ljp)Z, O<Dn+j(z-w,O) and consider its boundary value from the domain 
1023 Re<z -w, 0 <0. We denote the resulting micro function by I pix, J=1 )<1>n+j(J=1«x - y, 0 j + PO)). Then K(x,y)=(2n)-n f {Pix,J=1 )<1>n+j«x-y, J=1 O-O)}W() is a well-defined microfunction in a neighbor- hood of(xo,xo;J=1(o, -o)oo) whose sup- port is contained in the antidiagonal set a = {(x,y;J=1(,I])OO)EP 5*(M x M)lx= y, +I]=O}. Here w() is the volume element of the (n - 1 )-dimensional sphere 5 n - l . Hence K(x,y)dy defines a microlocal operator. The mapping which associates K(x,y)dy with {Pj(x,)} is compatible with the inclusion mapping stated in (i). (v) By using the tplane wave decomposition of the 8-function due to F. John ( 125 Distri- butions and Hyperfunctions CC), we find that microdifferential operators are a natural gen- eralization of tlinear differential operators: A linear differential operator corresponds to a symbol sequence {Pj(x, )L;,o, where Pj is a polynomial with respect to . (vi) (a) Let P=jpix,Dx) and Q= k qk(X, Dx) be microdifferential operators. Then their com position R = Po Q is a micro- differential operator with the symbol sequence {r[} [EZ given by 1 r[(x,) = I I" Dpix, )Dqdx, ). [= j+k-Iol a. Here D = alol/arl ... a' and a! = (Xl! ... an' for the tmulti-index a =(a l , ..., (Xn). (b) Let P = jPj(x, Dx) be a microdifferential operator. Let 8(x - y) denote the residue class [lJ of the left ,sx x x-Module ,sx x X/(l ,sx x x(x k - Yd + =I ,sxx X(a/aXk+ a/ayd). ("Module" means sheaf of modules.) Then there exists a unique microdifferential operator R = [r[(y, Dy) such that P(x, Dx)8(x - y) = R(y, Dy)8(x - y). Fur- thermore, r[(x, ) is given by (-l)j 0 0 I DDxpix, ). [j-Iol (X! R is called the adjoint operator of P and is denoted by P*. When X is a complexification of the real manifold M, it coincides with the adjoint operator P* E!l'. (vii) For a microdifferential operator P in ,sx(m), we define its principal symbol am(P) by Pm(x, ). The principal symbol am(P) is in- 274 F Microlocal Analysis dependent of the choice of local coordinate system. It gives an isomorphism between ,sx(m)/,sx(m - 1) and {(;1 * x(m), the sheaf of holomorphic functions on T* X which are homogeneous of degree m with respect to . (viii) (a) Let P be in ,sx(m)(Xo'o)' Assume that am(p)(x o , 0)#0. Then its inverse p- I (i.e., pp-I = p- 1 P= 1) exists in ,sx( - m)(Xo'o)' (b) Let P and G be in ,sx(m)(Xo.o" ,sx(l)(xo'o)' re- spectively. Suppose that H,(G)(am(P))(xO' o)= o (j = 0, ... ,p - 1) and that H:,(G)(am(p)) (x o , o) #0. (Here Hf(g) is, by definition, the tPoisson bracket {f, g} of rand g ( 82 Contact Trans- formations)). Then for each 5 in ,sx.(Xo.o) we can find Q and R in ,sx.(xo'o) so that 5 = QP + p R with (adGYR, [G,R], ...J =0 holds. This result is usually referred to as the Spath-type division theorem (for microdifferen- tial operators). In particular, when G = X n and (x o , o) = (0; 1,0, ... ,0), R has the form f;;;b R(k)(X, D')D. Here R(k)(X, D') = R(k)(X, DI' ..., Dn-I)' As a corollary to this expression we find the following (Weierstrass-type) pre- paration theorem (for microdifferential oper- ators): Let P be as above, and let G = X n . Then we can find Q and Win ,sX.(O.l.O. .0) such that P=QW with invertible Q and W=D:+ f;;;b W(k)(X, D')D, where W(k) belongs to ,sJ(p - k) and ap_k(W(k») (0; 1,0, .., ,0) = O. (ix) Quantized contact transformation. (a) Let X be an n-dimensional complex manifold and o an open subset of p* X. Let Ij (j = 1,2, ..., n) be in ,sx(1) (0) and Qj (j = 1, ..., n) in ,sx(O)(O). Assume that [Ij, PkJ = [Qj, QkJ = 0 and [Ij, QkJ = 8jk hold (1 j, k  n). Let cp be the contact transformation from 0 to p*cn defined by pr->(ao(Q 1 )(p), ..., aO(Qn)(P), adPd(p), "" a 1 (Pn)(P)). Then there exists a unique C-algebra homomorphism <1>: cp -I ,sc,->,sxln such that <1>(x) = Qj and <1>(D j ) = Ij (j = 1, ..., n). Further- more, <1> is an isomorphism, <1>,sc,(m) = ,sx(m) holds, and a m (<1>(R)) = am(R) 0 cp holds for R in ,se(m). We call the pair (cp, <1» a quantized con- tact transformation. In the above situation, the ,sxxC'-Module utt=,sxxc, !( i ,sxxe(xj-Q) }=I + j ,s x x e( - Dj - Ij)) is a simple holonomic system ( Section H) whose support is the graph of cpa. Let u be the canonical generator of jt, i.e., the residue class of 1 in..4l. Then R*u=<1>(R)u holds for RE,sC'. (b) Conversely, let cp be a contact transfor- mation from a neighborhood of P in P* X to p*cn, and let u be a generator of a simple holonomic system whose support is the graph 
274G Microlocal Analysis of (pa. Then a C-algebra isomorphism CI>: Ip -I rffc-4tffx is defined in a neighborhood of P through R*u=<1>(R)u (RE8d, and (1p,<1» becomes a quantized contact transformation. (c) In particular, let P be a point in J=1 S*M and Q its complex neighborhood. Let (Ip, CI» be a real quantized contact transformation de- fined on Q (i.e., Ip maps J=1 S* M to J=1. S*Rn). Then, in a neighborhood of (p, lp(p)a)E J-=1 S*(M x R n ), there exists a microfunction solution K(y,x) #0 of the equations xjK(y,x)= QjK(y,x), -(%x)K(y,x)=K(y,x) (j= 1,..., n) (XE Rn, YE M). Such a microfunction is unique up to constant multiple. The integral operator .ff: v(x)f-> S K(y, x)v(x)dx gives rise to a sheaf isomorphism between Ip -I <gR' and <gM in a neighborhood of p. Furthermore, we have %(Rv) = <1> (R)(%v) for vE'6'Rn and REtffen. This .ff is the counterpart of the Fourier inte- gral operator ( Section C). (x) Algebraic properties of tff;' and 8x. (a) tff x is tcoherent as a left tffx-Module, and its stalk is a t1eft Noetherian ring. (b) tff; is tfaithfully flat over gx. (c) Ifx is tflat over tffx(O). (d) tff x is flat over n- t :!tJ x . G. Microdifferential Equations [22,24J (1) Background. A system . of microdifferen- tial equations (of finite order) is by definition a tcoherent left (or right) 8x-Module, i.e., there exists locally an exact sequence of the form gk-4tff'J:-4 -40. For a coherent 8-Module , the support Supp of. is an impor- tant geometric object associated with .41. It is called the characteristic variety of .41. For a coherent -Module , its characteristic variety is by definition Supp(tff @'J< ). It is often denoted by S.S. .. Since a microdifferen- tial operator is a microlocal operator, the result in (viii) (a) of Section F (3) asserts that &d ),(, '6'M) is supported by the characteristic variety of M intersected with J=! S*M. Now, it is known that V = Supp  is tinvolu- tory (= involutive, in involution) in p* X, namely, II v=gl v=O entails {j;g} I v=O [22, theorem 5.3.2 on p. 453]. One of the most important problems in microlocal analysis is to study how much information V can give concerning the structure of  itself, and hence that of tffx/(, '6'M)' The epoch-making dis- covery of [22J is that V determines the struc- ture of tffw @g. at generic points of V as follows. (2) Structure Theorems. The fundamental result of [22, theorem 5.3.7 on p. 455J is the following theorem for coherent tff-Modules: Structure theorem 1. Let jl be a coherent IfrModule satisfying the following conditions: 1024 . dcf (1) IfJ"X<jt,tffx)=O for J#d. (2) V=Supp is regular at pE V in the sense that V is non- singular near p and that wi v(p) # ° for the tcanonical I-form w. Then, through a quan- tized contact transformation (Ip, <1», gw @  is isomorphic to a direct summand of a direct sum of finite copies of partial de Rham sys- tem .;Va = 8e,/(LJ=t tffc\%z) with Ip(p) = (0; 0,... ,0, 1)EP*e. By studying the canonical form of V under real contact transformations, [22J further gives the foIlowing structure theorem in a real domain, i.e., in J=l S* M for a real analytic manifold M. Structure theorem 2. Let X be a complexifi- cation of a real analytic manifold M, and let  be as in structure theorem 1. Let p be a point in vn J=! S* M. Suppose that the following three conditions are satisfied: (3) vn V is regu- lar at p. (4) (V)n (V)= (Vn V) holds for each q in vn V. Here V denotes the complex conjugate of V (with respect to J=1 S* M) and (V), etc., denotes the tangent space of V, etc., at q. (5) The generalized Levi form of V is of constant tsignature (a, b) near p, where the generalized Levi form of V is, by definition, the Hermitian form L(v)= L {pj,pd (x, J=1 )V})k j,k for the Pj such that V= njpj-t(O). Then IfW @ jl is isomorphic to a direct summand of a direct sum of finite copies of the system g L @ou1/ considered in a neighborhood of (x,J=! )=(0,J=l(0, ...,0, 1))Ej=! S*Rn, where ./1/ is given by a - f=O (j= 1, ...,r), iJx j ( 0 /1 0 ) +y-l- 1=0 OX'+2k-t OX,+2k (k= 1,... ,s), ( iJ 0 ) +J=1 X,+2,+I- 1=0 OX,.+2s+1 GX n (/= 1,... ,a), ( a 11 0 ) -y -1 X d 2,+I-::;- 1=0 CXr+2+1 OX n (/ = a + 1, .. . , a + b). Here, r=2codim V-codim(Vn V) and s= codim(Vn V)-codim V-(a+b). The first (resp. second) type of equation in the above are called (partial) de Rham equa- tions (resp. (partial) Cauchy-Riemann equa- tions). The third and the fourth are called Lewy-Mizohata equations (of type (a, b)) after these authors' pioneering works [25,26]. Thus any system is seen to be microlocally isomor- phic to a mixture of these three types of equa- tions, generically speaking. As a corollary to 
1025 this, structure theorem 2 clarifies the structure of microfunction solutions of ./If as follows: Structure theorem 3. Let M, X, ., V, and P be as in structure theorem 2. Then 87ih(8; @ ., 'iM)=O (j#a) holds, and the remaining cohomology group .'F = 87;{ (8; @ ,4/, (G'M) has the following structure in a neigh- borhood U of p: There exists an s-dimen- sional complex manifold Y, a real analytic manifold N, and a tsmooth mapping cp from vn U n J'=1 S* M to Y x J=1 S* N such that :IF = cp-1'fj for a sheaf 'fj on Yx J=l. S*N that is a direct summand of ,yeN, with.ff being the solution sheaf of the partial Cauchy- Riemann equations associated with Y. H. Holonomic Systems A coherent (left) 8-Module, is called holo- nomic if Supp. is tLagrangian. A coherent (left) !:0-Module ,Ii is called holonomic if 8' @ './ j{ is so. Even though the term "holo- nomic" is currently used, another term, "maxi- mally overdetermined," is used to describe the same object in some of the literature, including [22]. The importance of such a system lies in the fact that the space of its microfunction solutions is finite-dimensional [27,29]. I n this sense it resembles an ordinary differential equation. A holonomic system, however, does not satisfy condition (2) of structure theorem I of Section G, and its structure is rather com- plicated. A result which corresponds to struc- ture theorem I in Section G is the following: Let V be an involutory submanifold of T* X, and let 8v be the subring of 8 generated by {PE G'x(l)la l (P)I v = O}. Then a coherent 8x- Module Ai defined on an open subset Q of T* X is said to have regular singularities along V if for any point P of Q, there exist a neigh- borhood U of p and an 8v-sub-Module ,o of ./t{ defined on U which is coherent over 8(0), and which generates 11 as an 0'x-Module. A holonomic system is said to have R.S., which is an abbreviation for regular singular- ities, if it has R.S. along its support. Then for an arbitrary holonomic 8-Module Ai we can find a holonomic 0'-Module 'reg with R.S. such that 8 @J ./I{reg  8'" @J Ai holds. See [28] for the proof of this striking result and related topics on holonomic systems with reg- ular singularities. An elementary class of holonomic systems is that of simple holonomic systems. A holonomic 8-Module ./t{ is called simple if there exists a left Ideal .fi such that Ai = 81J and that the symbol Ideal {a(P) I PE,-'} coincides with the defining Ideal of Supp.. Let u denote the generator I mod.-' of a simple holonomic 274 H Microlocal Analysis system (/;//1. Suppose that A=SUPPdi! is nonsingular. Then the principal symbol aA(u) is defined as follows: For P=L.jp)X,Dx)E8(m), let Lp denote the first-order linear differential operator ( 1 (")2 Pm(X, ) ) Lp=Hprn(x.)+ Pm-1(.X')-- 2 I _ _ . i OXiC i Here H denotes the Hamiltonian vector field , Pm defined by {Pm,' }. Let QA and Qx denote the sheaf of n-forms of A and X, respectively, and let Q)!/2 (resp. QI/2 @ Q -1/ 2 ) denote a line bundle L such that L0 2 is isomorphic to QA (resp. QA @ Q-I). Since the line bundles QI/2 and Q112 @ Q-1/2 do not exist globally in general, aII the equations among their sections should be understood up to a constant multi- plicative factor. When A is a purely imaginary Lagrangian submanifold of J=l S* M for a real analytic manifold M, these line bun- dles can be constructed globally by using the Keller-Maslov index ( Section C). Let v and t/J denote respectively the first term and the second term in the right-hand side of the above definition of L p . Then Lp:Q1/2...... QI/2 is given by L p (s)=(1/2s)LJs 2 )+t/Js for SE QI/2, where LJS2) denotes the tLie deriva- tive of 52 along v. One can then prove that the system of equations L p 5 = 0 (PE,-') admits locally one and exactly one nonzero solution s in QI/2 up to a constant factor. Then the principal symbol a1\(u) of u equals s @  E QI/2 @ Q-1/2 by definition. The principal symbol a,.du) is homogeneous with respect to , and its homogeneous degree is called the order of u and is denoted by ordA(u). The microlocal structure of a simple holonomic 8 -Module with a nonsingular characteristic variety is determined by the order of its gen- erator as follows: (a) Let Ii'u and 8v denote simple holonomic 8-Modules with the same characteristic variety A. Then 8u is isomorphic to 8v if and only if ordA(u) - ordA(v) is an integer. (b) Let 6'u be a simple holonomic system, and let ac denote the order of u. Then through a suitable quantized contact trans- formation, 8u is isomorphic to 8C"w, where w satisfies (ZI a1 +( ac+D )w=o, cw -=0 (j=2,..., n). aZ j Thus the microlocal structure of a simple holonomic system Ai = 8u is fairly simple at nonsingular points of its characteristic variety. Moreover a Hartogs-type theorem for micro- differential equations [28, ch. I, 92J entails that if Supp .,1{ has the form AI U A 2 with Lagrangian manifolds Al and A 2 such that 
274 I Microlocal Analysis codimA,(A t nA2)2; then At has the form '1 EB At 2 with supp.-4ij=A j (j= 1,2). Hence the case where codim A , (A, n A 2 ) = 1 is impor- tant. Suppose Al and A 2 are nonsingular in this case and TxAl =1= TxA2 at a point xEA 1 nA 2 . Then, through a quantized contact transfor- mation (Ij;, cD), the system . is isomorphic to 6"cnw, with w satisfying the following equations defined near (0; dz 1 (0): (ZI a1 +(a 1 +D )w=O, (z2+(at -a 2 + ) Iw=o, \ aZ 2 \. 1. ) aw -=0 (j=3,...,n), aZ j where aj = ordA/u) (j = 1,2) with Ij;(A 1 ) = {ZI =0, 2 =... =n=O} and Ij;(A 2 ) = {ZI = Z2 = 0, 3 = ... = n = O}. For more general cases and an application  [30]. The theory of holonomic systems and its applications are being studied most inten- sively, raising the hope of establishing a uni- fied theory of special functions of several variables;  [23J and references cited therein. I. History Microlocal analysis means local analysis on the cotangent bundle. It emphasizes the im- portance of localization in cotangent bun- dles in analysis, which was pointed out by S. Mizohata [5, 6J immediately after the advent of singular integral operators in the works of A. P. Calderon and A. Zygmund [3,4]. Since then, localization in the cotangent bundle has been used frequently in the theory of linear partial differential equations. R. T. Seeley [31J proved that the symbol of a singular integral operator is well defined on the cotangent bundle. Works by J. 1. Kohn and L. Nirenberg [32J and L. Hormander (Comm. Pure Appl. Math., 18 (1965» strengthened the trend of localizing the problem on the cotangent bun- dle. Although it seems that the term "micro- local analysis" first appeared in the literature in 1973 (T. Kawai, Asterisque, 2 and 3 (1973», the basic part of the theory had been con- structed during the period from 1969 to 1972 by M. Sato (Proc. Intern. Con! Functional Anal. and Related Topics, 1969), Yu. V. Egorov [12], Hormander [8, 20J, J. 1. Duistermaat and Hormander (Acta Math., 128 (1972», M. Kashiwara and Kawai (Proc. Japan Acad., 46 (1970», and Sato, Kawai, and Kashiwara [22]. Apparently the work of V. P. Maslov [IIJ had an important influence on the work of Egorov. The most important part of Sato's 1026 contribution was the fact that, through the construction of the sheaf of microfunctions he found that the singularities of hyperfunctions can be canonically dispersed over the cotan- gent bundle ( Section E) and that a hyper- function solution u of a linear differential equation pu=o is concentrated on the charac- teristic variety when its singularities are thus dispersed ( Section G). The last-stated fact was also formulated by Hormander [8,20J in the framework of distribution theory. The most important part of the contribution of Egorov [12J was the discovery that one can 11C'P 'JIn ;nt,c.lTr'Jol tr<:1ncofArn"r:ltIA...... l......tr........1"".""'.rl h" u...... u.u: J.11""""5J.UJ. l-J.UJ.J0.3JVIU.J.ULJVIIIIILIVUU,"",""U uy V. I. Eskin [7J to find a transformation of pseudo differential operators compatible with a homogeneous canonical transformation, i.e., a contact transformation, so that the commut- ation relations and the orders of the operators can be preserved. Hormander (Acta Math., 121 (1969» independently introduced integral transformations of the same type, calling them Fourier integral operators. Egorov [13J and L. Nirenberg and F. Treves [33J successfully used the transformation of operators to study the regularity and existence of solutions. Sub- sequently, Hormander [8J elaborated the theory of Fourier integral operators. Kashi- wara and Kawai (Proc. Japan Acad., 46 (1970» observed that a pseudodifferential operator in their sense (now called a microdifferential operator;  Section F) gives rise to a sheaf homomorphism on the sheaf of microfunctions and that the structure of the microfunction solutions of pseudodifferential equations is determined by the principal symbol of the operator in question if it has simple character- istics. Then Sato, Kawai, and Kashiwara [22J succeeded in amalgamating these two theories, namely, the theory of microfunctions and the theory of the transformation of operators. (They called the transformation a quantized contact transformation in [22J). Such an amal- gamation was also done independently by Hormander [8, 20J, who introduced the notion of the wave front set for distributions as a substitute for the support of microfunctions. Incidentally, it is noteworthy that C. Chandler and H. P. Stapp (J. Math. Phys., 10 (1969» and D. Iagolnitzer and Stapp (Comm. Math. Phys., 14 (1969» obtained a notion similar to the singularity spectrum in a physical context. Their results were later elaborated (around 1971-1973) by J. Bros and Iagolnitzer [19]. With the aid of the above-mentioned amal- gamation of the theories, the works of Sato, Kawai, and Kashiwara [22], Hormander [20J, Duistermaat and Hormander (Acta Math., 121 (1969)), Kawai (Publ. Res. Inst. Math. Sci., 7 (1971-1972» and K. G. Andersson (Trans. Amer. Math. Soc., 177 (1973» have clarified the 
1027 importance of the bicharacteristic strip, which is a submanifold of a cotangent bundle, not a base manifold, as a carrier of singularities of solutions of pseudodifferential equations ( Section G); and the works of Sato, Kawai, and Kashiwara [22], Sato (Actes Congr. Internat. Math. Nice, 1971) and Kawai (Publ. Res. Inst. Math. Sci., 7 (1971-1972)) revealed the hidden mechanism of the celebrated counterexample of H. Lewy [25]. Among these, the contri- bution of Sato, Kawai, and Kashiwara [22] was most decisive and fundamental in that it first clarified the structure of a general system of pseudodifferential equations at geneiic points of the characteristic variety and then derived from it the above-quoted results on the structure of the solutions ( Section G). Microlocal analysis has now become one of the most important and basic concepts in the theory of linear partial differential equations and theoretical physics. For recent develop- ments  Hormander [34] and V. Guillemin, Kashiwara, and Kawai [23] and references cited therein. References [1] J. J. Duistermaat, Fourier integral opera- tors, Lecture notes, Courant Inst. Math. Sci., 1973. [2] V. GuilIemin and S. Sternberg, Geometric asymptotics, Amer. Math. Soc. Math. Surveys 14 (1977). [3] A. P. Calderon and A. Zygmund, Singular integral operators and differential equations, Amer. J. Math., 79 (1957), 901-921. [4] A. P. Calderon, Uniqueness in the Cauchy problem for partial differential equations, Amer. J. Math., 80 (1958), 16-36. [5] S. Mizohata, Systemes hyperboliques, J. Math. Soc. Japan, 11 (1959),205-233. [6] S. Mizohata, Note sur Ie traitement par les operateurs d'integrale singuliere de Cauchy, J. Math. Soc. Japan, 11 (1959), 234-240. [7] G. 1. Eskin, The Cauchy problem for hyperbolic systems in convolutions, Math. USSR-Sb., 3 (1967), 243-277. [8] L. Hormander, Fourier integral operators I, Acta Math., 127 (1971), 79-183. [9] 1. Chazarain (ed.), Fourier integral opera- tors and partial differential equations, Lecture notes in math. 459, Springer, 1975. [10] 1. B. KelIer, Corrected Bohr Sommerfeld conditions for nonseparable systems, Ann. Phys., 4 (1958), 180-188. [11] V. P. Maslov, Theorie des perturbations et methodes asymptotiques, Gauthier-Villars, 1972. (Original in Russian, 1965.) [12] Yu. V. Egorov, On canonical transfor- mation of pseudodifferential operators (in 274 Ref. Microlocal Analysis Russian), Uspekhi Mat. Nauk, 25 (1969), 235- 236. [13J Yu. V. Egorov, Conditions for solvability of pseudodifferential equations, Soviet Math. Dok!., 10 (1969), 1020-1022. [14] D. Fujiwara, On the boundedness of integral operators with highly oscillatory kernels, Proc. Japan Acad., 51 (1975), 96-99. [15] H. Kumano-go, A calculus of Fourier integral operators on R" and the fundamental solution for an operator of hyperbolic type, Comm. Partial Diff. Eq., 1 (1976), 1-44. [16] K. Asada and D. Fujiwara, On some oscillatory integral transformations in L 2 (R"), Japan. J. Math., 4 (1978), 299-361. [17] A. Sommerfeld, Optics, Lectures on Theoretical Physics, vo!. 4, Academic Press, 1964. [18] P. D. Lax, Asymptotic solution of oscil- latory initial value problems, Duke Math. J., 24 (1957),627-646. [19] D. Iagolnitzer, Analytic structure of distributions and essential support theory, Structural Analysis and Col1ision Amplitude, North-Hol1and, 1976,295-358. [20] L. Hormander, Uniqueness theorem and wave front sets for solution of linear differen- tial equation with analytic coefficients, Comm. Pure App!. Math., 24 (1971),671-704. [21] 1. M. Bony, Equivalence des diverses notions de spectre singu1ier analytique, Semi- naire Goulaouic-Schwartz 1976- 77, Ecole Poly technique. [22] M. Sato, T. Kawai, and M. Kashiwara, Microfunctions and pseudodifferential equa- tions, Lecture notes in math. 287, Springer, 1973,265-529. [23] V. Guillemin, M. Kashiwara, and T. Kawai, Seminar on microlocal analysis, Ann. math. studies 93, Princeton Univ. Press, 1979. [24] J. E. Bjork, Rings of differential opera- tors, North-Hol1and, 1979. [25] H. Lewy, An example of a smooth linear partial differential equation without solution, Ann. Math., 66 (1957), 155-158. [26] S. Mizohata, Solutions nul1es et solutions nonanalytiques, J. Math. Kyoto Univ., 1 (1962), 271- 302. [27] M. Kashiwara, On the maximalIy over- determined system of1inear differential equa- tions I, Pub!. Res. Inst. Math. Sci., 10 (1974- 1975), 563-579. [28] M. Kashiwara and T. Kawai, On holo- nomic systems of microdifferential equations III, Systems with regular singularities, Pub!. Res. Inst. Math. Sci., 17(1981),813-979. [29] M. Kashiwara and T. Kawai, Finiteness theorem for holonomic systems of micro- differential equations, Proc. Japan Acad., 52 (1976),341-343. [30] M. Sato, M. Kashiwara, T. Kimura, and 
275 A Minimal Submanifolds T. Oshima, Microlocal analysis of prehomo- geneous vector spaces, Inventiones Math., 62 (1980),117-179. [31J R. T. Seeley, Singular integrals on com- pact manifolds, Amer. J. Math., 81 (1959),658- 690. [32J J. J. Kohn and L. Nirenberg, An algebra of pseudodifferential operators, Comm. Pure AppJ. Math., 18 (1965), 269-305. [33J L. Nirenberg and F. Treves, On local solvability of linear partial differential eq ua- tions II, Comm. Pure AppI. Math., 23 (1970), 459-510. [34J L. Hormander, Seminar on singularities of solutions of linear partial differential equa- tions, Ann. math. studies 91, Princeton Univ. Press, 1979. 275 (V11.14) Minimal Submanifolds A. General Remarks An immersion f of an m-dimensional manifold M with boundary aM (possibly empty) into a Riemannian manifold N is caIled minimal if the tmean curvature vector field H of M with respect to the induced Riemannian metric vanishes identically. Then Mis caIled a mini- mal sub manifold of N. This definition comes from the foIlowing variational problem: By a smooth tvariation of I is meant a smooth map- ping F:I x M ->N, where I =( -1,1), such that each;; = F( t, . ): M -> N is an immersion, fo = f, and f, I aM = II aM for all tEl. Let dV, be the volume element of the metric induced by;;, and set V(t)= SM dV" the volume of M at time t. Then the first variation of the volume is expressed as dV ! f ( a I ) - = -m H,F*- dVo, dt 1=0 M at 1=0 where a/at denotes the canonical vector field along the I factor in I x M. Thus the mean curvature vector field H of f vanishes identi- caIly if and only if d V/dt I t=O = 0 for all vari- ations of f Therefore a minimal submanifold gives an extremal of the volume integral, though neither necessarily minimal nor of the least vol ume. In the case N = R", an immersion x: M -> Rn is viewed as a vector-valued function, and the mean curvature vector field H is expressed as H = t1x/m, where t1 denotes the tLaplace- Beltrami operator -gijvv. Thus x is minimal if and only if each compent function of x is harmonic. In particular, there is no compact minimal submanifold without boundary in R n . 1028 The history of the theory of minimal sub- manifolds goes back to J. L. Lagrange, who studied minimal surfaces in the 3-dimensional Euclidean space R 3 . In 1762 he developed his algorithm for the tcalculus of variations, which can be applied to higher-dimensional problems and is now known as the tEuler- Lagrange equation. For instance, let D be a domain in the plane R 2 and Z = f(x, y), (x, y) E D, the equation of a surface in R 3 . As a neces- sary condition for the surface to have the least area among the surfaces with fixed boundary, Lagrange obtained a tquasilinear eIliptic par- tial differential equation of the second order, called the minimal surface equation: (1 +z;)zxx-2zxzyzxy+(1 + z;)Zyy =0. Before this, in 1744, L. Euler had found that a tcatenoid is a minimal surface. In 1766, J. B. M. C. Meusnier showed that a right thelicoid is a minimal surface. Besides catenoids and helicoids, in 1834, H. F. Scherk found that the surface defined by z=log(cosy)-Iog(cosx) is a minimal surface, which is called Scherk's surface. In the latter half of the 19th century, tPla- teau's problem ( Section C; 334 Plateau's Problem) was studied extensively by O. Bon- net, B. Riemann, K. Weierstrass, A. Enneper, G. Darboux, and others. The problem is stated as foIlows: Given a tJordan curve f in R 3 (or in R n ), find a surface of least area having f as its boundary. On the other hand, in 1866, Weierstrass gave a general formula, caIled the Weierstrass-Enneper formula ( Section B (5)), to express a simply connected minimal surface in terms of a complex analytic function and a meromorphic function with certain properties. The formula aIlows one to con- struct a great variety of minimal surfaces by choosing those functions. The existence of a minimal surface of disk type having a prescribed boundary curve was first obtained in 1930 by 1. Douglas and T. Rad6 independently as a solution to Plateau's problem, admitting singularities. The result was improved by R. Courant for the case of finitely many boundary curves by the method of tDirichlet integrals. The method was carried out further by C. B. Morrey for the gener- alized Plateau's problem in a Riemannian manifold ( Section C (5)). Indeed, the Euclid- ean space was replaced by any complete Rie- mannian manifold which is metricaIly weIl behaved at infinity. For example, any compact or any thomogeneous Riemannian manifold is in this class. The existence proof of minimal surfaces cannot in general be applied directly to the case of higher-dimensional minimal submani- folds. The notion of varifolds ( Section G (2)) 
1029 and the corresponding generalization of a minimal submanifold, called a minimal variety ( Section G (2)), were then introduced by F. Almgren, who proved the existence of a mini- mal variety in a compact Riemannian mani- fold. It was also proved that a minimal variety is approximated by regular minimal submani- folds. This work has been developed as geo- metric measure theory by E. R. Reifenberg, W. H. Fleming, H. Federer, F. Almgren, and others ( Section G). The study of minimal surfaces given as the graph of a real-valued function of two vari- ables leads to that of solutions of the minimal surface equation. In 1915, S. Bernshtein proved that a minimal surface z=.f(x,y) defined on the entire plane R 2 must be a plane. Sub- sequently, this was generalized to the following problem: Is a minimal hyper surface xn+l = f(xj,... ,x n ) in Rn+l defined on the entire space R n a hyperplane? The answer turns out to be affirmative for n  7 and negative for n  8 ( Section F (1)). In general, when a bounded domain D in Rn and a continuous function cp on its boundary aD are given, the problem of finding a minimal hypersurface M defined by the graph of a real- valued function f on 15 with fl aD = cp gives rise to a typical tDirichlet problem. The basic questions are those of existence, uniqueness, and regularity of solutions. These were first studied by Rad6 for n = 2 and later by L. Bers, R. Finn, H. Jenkins, J. Serrin, R. Osserman, and others ( Section D (1)). Minimal submanifolds of a sphere have interesting properties; some are analogous to those in Euclidean space but some are not. Among them, 1. Simons, in 1968, gave a dif- ferential equation that the norm of the second fundamental form of a minimal submanifold in a sphere should satisfy. He then showed that a totally geodesic submanifold is isolated among the compact minimal submanifolds in the sphere. S. S. Chern, M. P. do Carmo, S. Kobayashi, N. Ejiri, T. Ito, T. Otsuki, N. Wal- lach, and others have made further contri- butions to this subject ( Section F (2)). B. Minimal Surfaces (1) Branched minimal surfaces. A minimal surface M in Rn is an immersed surface with vanishing mean curvature vector field. M equipped with an atlas of tisothermal coor- dinates is viewed as a tRiemann surface. A branch point of a tharmonic mapping f: M --> R n is a point at which the differential dfp is zero. A harmonic mapping}: M __R n of a Riemann surface M is called a branched (or 275 B Minimal Submanifolds generalized) minimal immersion if it is con- formal except at the branch points, and the image f(M) is then called a branched minimal surface. The solution to Plateau's problem given by Douglas and Rad6 is a branched minimal surface. (2) Maximum principle. When n = 3, the following maximum principle for minimal surfaces holds: If M 1 and M 2 are two con- nected branched minimal surfaces in R 3 such that for a point pEM l n M 2 , the surface M 1 locally lies on one side of M 2 near p, then M 1 and M 2 coincide near p. (3) Convex hull property. In general, every branched minimal surface with boundary in R n lies in the convex hull of its boundary curve, the smallest closed convex set containing the boundary. (4) Reflection principle. If the boundary curve of a branched minimal surface contains a straight line y, then the surface can be ana- lytically continued as a branched minimal surface by reflection across y. Based on this principle, the following holds: Let r be an analytic Jordan curve in Rn and f: M __Rn a branched minimal immersion with boundary f. Then f is analytic up to the boundary, i.e., f(M) is contained in the interior of a larger branched minimal surface (H. Lewy). The smooth version of this theorem was given by S. Hildebrandt: If f: M --> Rn is a branched minimal immersion with smooth boundary curve, then f is smooth up to the boundary. (5) Weierstrass-Enneper formula. Every simply connected minimal surface in R 3 is represented in the form Xk(O=Re{f k(Z)dZ}+Cb where 1 = f(1 - g2)/2, 2 = J=1 f(1 + g2)/2, 3 = ft.], and C k is a constant. Here, g(O is a meromorphic function on a domain D in the complex (-plane, and frO is an analytic func- tion on D with the property that at each point (, where g«() has a pole of order m, frO has a zero of order 2m, D being either the unit disk or the entire plane. This formula is quite use- ful for constructing various minimal surfaces. For instance, on setting f == 1 and g(O =" Enneper's surface is obtained: k=I,2,3, U 3 2 v 3 2 2 2 (X 1 ,X 2 ,X 3 )=(U- 3 +uv ,v-"3+VU ,U -v), (u,V)ER 2 . Another feature of the formula is that general theorems about minimal surfaces can be ob- tained by translating statements about analy- tic functions into the corresponding minimal- surface ones. For example, a surface f: M --> R 3 is minimal if and only if the tGauss mapping 
275C Minimal Submanifolds G: M ->5 2 is antiholomorphic, i.e., the complex conjugate of the mapping is holomorphic, when 52 is viewed as the Riemann sphere with complex structure by the stereographic projec- tion (from the south pole) onto the complex plane. As a consequence, a complete minimal surface in R 3 is a plane or else the normals to the surface are everywhere dense in 52 (R. Osserman). (6) Stability. A minimal submanifold Mis called stable iffor every compact region on M all the second variations of the volume are positive. For instance, if M -> Rn is a minimal hypersurface and fv is a variation vector field, v being the unit normal vector field of M and f a smooth function with compact support on M, then the second variation formula for the volume Vet) is given by d 2  I = r (lV'fI2-IAI2f2)dv o , dt tO JM where I A I denotes the norm of the second fundamental form of M. In particular, when n=3, then -IAI2/2=K, the Gauss curvature of M. Therefore a minimal surface M in R 3 is stable if and only if L (1V'f1 2 +2KP)dV>0 for any smooth function f with compact sup- port on M. It follows that the stability of mini- mal surface M in R 3 is related to the bound- ary value problem of a linear elliptic operator L =  - 2K,  being the Laplacian on M. Namely, a minimal surface M in R 3 is stable if and only if the first eigenvalue A. 1 (D) of L on any bounded domain D in M is nonnegative. In connection with the tGauss mapping, a sufficient condition for a domain D in a mini- mal surface to be stable was obtained by H. Schwarz in 1885: If a minimal surface M in R 3 has one-to-one Gauss mapping G: M ->5 2 , then a domain Dc M such that G(D) is con- tained in a hemisphere of 52 is stable. This was generalized by J. L. Barbosa and M. do Carmo as follows: Ifthe area A(G(D)) of the spherical image G(D) is less than 2n, then D is stable. This result is sharp in the sense that for any F. > 0 there exists an unstable domain D with A(G(D))=2n+F.. As an analog to Bernshtein's theorem, the following holds: A complete and stable minimal surface in R 3 is a plane. H. Mori has also obtained results in this regard. As for the existence of unstable minimal surfaces, it is known that if there are two dis- tinct stable minimal surfaces with the same smooth boundary curve fin R 3 , then there exists an unstable branched minimal surface with f as its boundary (M. Morse and C. Tompkins, M. Schiffman). 1030 C. Plateau's Problem ( 334 Plateau's Problem) Let f be a trectifiable tJordan curve in Rn and D = {(x, Y)E R 2 1 x 2 + y2 < 1}. Then there exists a continuous mappingf:15->R n such that (a) fl aD maps homeomorphically onto f; (b) f I D is harmonic and almost conformal, i.e., <fx'!y) =0 and Ifxl=lfyl in D with 1df1>0 except at isolated branch points; (c) the in- duced area of f is the least among the class of piecewise smooth surfaces bounding f with (a). This mapping f is called the classical solu- tion or the Douglas-Rado solution to Plateau's problem for f, which may have singularities called tbranch points. The resulting surface 5 is a branched minimal disk. This theorem establishes the existence of a surface of least area among all surfaces homeomorphic to a disk. It has been generalized by R. Courant for f consisting of finitely many rectifiable Jordan curves. For a branched minimal disk 5 bounded by smooth fin R 3 , there is a relation, called the Gauss-Bonnet-Sasaki-Nitsche formula, among the ttotal curvature K(f) of f, the total curva- ture of 5, and the orders of branch points: 1 + I(m x -l)+ IMp+ r IKldAK(f), 2n J s 2n where K denotes the Gauss curvature of 5, m x - 1 the orders of the interior branch points, and 2M p the orders of the boundary branch points, which must be even. (1) Regularity. A minimal disk ofleast area in R 3 has no boundary branch points when f is real analytic (R. Gulliver and F. Leslie) or when f is smooth and the total curvature K(f) is less than 4n (1. C. C. Nitsche). In general, a classical solution for smooth fin R 3 cannot have infinitely many branch points. A re- markable fact is that every classical solution to Plateau's problem in R 3 is free of branch points in its interior, i.e., is a regular immer- sion (R. Osserman). (2) Embeddedness. A classical solution is not necessarily an embedding; it may have self-intersections. For instance, if f is knot- ted in R3, then every solution must have self- intersections. It is known, however, that im- mersed minimal disks of least area in R 3 which can self-intersect only in their interiors are embeddings. In particular, if f is an extremal Jordan curve, i.e., if f lies on the boundary of its tconvex hull, then the classical solution for f is an embedding (F. Tomi and A. 1. Tromba; W. H. Meeks and S. T. Yau). Ifthe topological type is not specified, then there always exists an embedded minimal surface. That is, if f is the union of any finite collection of disjoint 
1031 smooth Jordan curves in R 3 , then there exists a compact embedded minimal surface with boundary r which is smooth up to the bound- ary (R. Hardt and L. Simon). However, there exists an unknotted Jordan curve that never bounds an embedded minimal disk. (3) Uniqueness. In general, the classical solutions to Plateau's problem do not have the uniqueness property. Rad6 was the first who gave a condition on a boundary curve to guarantee the uniqueness of the minimal disk. Namely, if a Jordan curve f in R n admits a one-to-one orthogonal projection onto a con- vex curve in a plane R 2 in R n , then the classical solution to Plateau's problem for f is free of branch points and can be expressed as the graph over this plane. When n = 3, the solution is unique. Another geometric condition on r has been given by Nitsche: An analytic Jordan curve f in R 3 with total curvature K(r)  4n (or a smooth f with K(r)<4n) bounds a unique immersed minimal disk. Moreover, the generic uniqueness holds: In the space .sf of al1 smooth Jordan curves in R n with suitable topology, there exists an open and dense sub- set ;;g such that for any f in ;;g there exists a unique area-minimizing minimal disk (F. Morgan and A. 1. Tromba). (4) Finiteness. As for the finiteness of the classical solutions to Plateau's problem, several conditions on boundary curves are known. An analytic Jordan curve in R 3 bounds only finitely many minimal disks of least area (F. Tomi). An analytic textremal Jordan curve in R 3 bounds only finitely many minimal disks with relative minima of area (H. Beeson). A smooth Jordan curve r with total curvature K(q  6n bounds only finitely many minimal disks (Nitsche). Moreover, generical1y, i.e., for an open and dense subset of boundary curves, there are at most finitely many minimal surfaces with given boundary, relative minima or not (R. Bohme and Tromba). (5) Generalized Plateau's problem. C. B. Morrey's setting of the generalized Plateau's problem is as fol1ows: A homotopical1y trivial rectifiable Jordan curve f is given in an n- dimensional Riemannian manifold N. Let D denote a disk in R 2 . Find a mappingf:D->N such that (a) fi aD maps homeomorphical1y onto r, (b) the induced area of f is the least among the class of piecewise smooth surfaces in N bounded by r with (a). Obviously when N = R", this is the classical Plateau's problem. A solution was given by Morrey under the assumption that N is homogeneously regular; i.e., that there exist 0 < k < K such that for any point YE N there is a local coordinate system (V, <1» around y for which <1>(V) = {xER"lllxll < 1} and the Riemannian metric 275D Minimal Submanifolds L g,P dx' dx P satisfies k I (V')2 Ig,pv'vP KI (V')2 for any x and (vr, ..., V")E R". Any compact or any homogeneous Riemannian manifold is homogeneously regular. Morrey's solution is as fol1ows: If N is a homogeneously regular Riemannian manifold and if r is a homotopi- cal1y trivial rectifiable Jordan curve in N, then there exists a branched minimal immersion f: D->N with least area bounded by r with (a). The regularity of Morrey's solution is simi- lar to the classical one. If r is smooth, then so is the solution f up to the boundary. If, furthermore, dim N = 3, then the solution f is an immersion in its interior. If Nand rare real analytic and dim N = 3, then the solution f is an immersion up to the boundary. D. Existence Problems of Minimal Submanifolds (I) The Dirichlet problem for the minimal surface equation. When the graph of a (vector- valued) function is a minimal submanifold in some Euclidean space, the function must satisfy the so-cal1ed tminimal surface equation. To be precise, let D be a (bounded) domain in R", f: D-> R k a (vector-valued) function, and put F:D->Rn+k:F(x)=(x,j(x)) for XED. Let M be the graph off:M =F(D). Then M is mini- mal if and only if f (or F) satisfies one of the fol1owing equations: if n = 2 and k = 1, (1 + f/)fxx- 2 fxf y fxy+(1 + f/)fyy=O; (1) if n is arbitrary and k = 1, " a ( 1 a f ) I ----; -------z = 0 where i=laX wax W= ) l +IVfI 2 ; (2) if n = 2 and k is arbitrary, (1 + If y l2)fxx - 2(fx' fy)fxy+ (1 + IfxI 2 ).fyy= 0; (3) and if nand k are arbitrary, n " a 2 f aF aF I g')=O where gij=---O-----', (4) i.j=1 ax'ax) ax' ax J and (gi is the inverse matrix of (g;)- The basic problem for this class of equations is the tDirichlet problem. For n=2 and k= 1, the fol1owing theorem of Rad6 and Finn is fundamental: There exists a solution f of the Dirichlet problem corresponding to an arbi- trary continuous function ({J On the boundary aD of D if and only if D is convex. Since the difference of any two solutions of eq. (1) satis- fies the maximum principle, there can be at most one solution f of the Dirichlet problem 
275 E Minimal Submanifolds corresponding to a given continuous function ({J on aD. As for the removability of isolated singularities, the following is known: Every solution of eq. (1) in 0< x 2 + y2 < 1 extends continuously to the origin. The extended func- tion is smooth at the origin and satisfies eq. (1) in the full disk x 2 + y2 < 1 (Finn). For arbitrary nand k = 1, namely, for the case of minimal hypersurfaces, the following is known: Let D be a bounded domain in Rn with smooth boundary. The Dirichlet problem for eq. (2) has a solution for any continuous func- tion ({J on aD if and only if the mean curvature l""\f;Jf) Ul'ith rp.cnp.r>t trio +1,,,,, Inn""'r n.......-r...............1 " ................. >JJ. l/.L/ n 110.11 1.....p..........L LV Lll..... lUll...... 11Vl1ual 1.::') lIVU- negative at every point. As for the uniqueness and regularity, the same results as above hold (Jenkins and Serrin). For n = 2 and arbitrary k, namely, for the case of minimal surfaces in R 2+\ exactly the same result of existence as in the classical case k = 1 holds. A solution exists for an arbitrary continuous vector-valued function on aD if and only if D is convex (Osserman). Though the removability of singularities holds under suitable restrictions, the uniqueness fails in this case. F or the last case, n> 2 and k> 1, there are essentially no results on either existence or uniqueness for the Dirichlet problem. (2) Existence of minimal surfaces and top- ology. For a compact Riemannian manifold N, there are results on the existence of minimal surfaces related to the homotopy groups of N. Let f be a continuous mapping from a tRiemann surface Eg of tgenus g into N. If the tinduced mappingf# :ndEq)-->n,(N) of f on the tfundamental groups is injective, then there exists a branched minimal immersion h : Eg-->N such that h# = f# on n,(E.) and the induced area of h is the least among all mappings with the same action on n dE.). If furthermore n 2 (N)=0, then h can be deformed fromj continuously (R. Schoen and S. T. Yau). If n 2 (N)#0, then there exists a generating set for n 2 (N) consisting of branched minimal immer- sions of spheres that minimize energy and area in their homotopy classes (1. Sacks and K. Uhlenbeck). If, furthermore, dimN = 3, then the above minimal immersion h in the homo- topy class is an embedding or a two-to-one covering mapping. In the second case, its image is an embedded real projective plane. If h' is another such least-area mapping, then either h' is equal to h up to a conformal re- parametrization or the images of hand h' are disjoint (Meeks and Yau). E. Gauss Mapping of Minimal Surfaces On a connected, orient able surface M there exist local isothermal coordinates (x,y) at each 1032 point. On putting z = x + J=l y, the metric has the form ds 2 =2FldzI 2 , and M is viewed as a Riemann surface. A conformal immersion f: M -->Rn is minimal if and only if t1f =0, or equivalently a z 8zf = 0. In the case n = 3, the tGauss mapping G: M -->S2 of a surface f: M --> R 3 is defined by assigning to each poin t pE M the unit normal vector translated paral- lel to the origin of R 3 . A surface j: M -->R 3 is minimal if and only if the Gauss mapping is antiholomorphic. In the general case, the Gauss mapping G is defined to be a mapping assigning to each point p E M the oriented tangent space f* J;,Ol1) eRn. Thus G is a map- ping from M into the tGrassmann manifold M n . 2 (R) = SO(n)jSO(2) x SO(n - 2) of oriented planes in R n , which is naturally diffeomorphic to the tcomplex quadric Qn-2 in the complex projective space cpn-1. An immersion f: M -->R n is minimal if and only if its Gauss mapping is antiholomorphic. Let f: M -->R" be a complete orientable minimal surface. Let X be the Euler character- istic and - nC = J M KdA, the total curvature of j(M). Assume that the total curvature is finite. Then the following results of Chern and Osser- man are fundamental: (a) M is conformally a compact Riemann surface M with finite num- ber, say r, of points deleted; (b) C is an even in teger and satisfies C;::, 2(r - X) = 4g + 4r - 4, where g is the genus of M (= genus of M) ; (c) if f(M) does not lie in any proper affine subspace ofR", then C;::'4g+r+n-3;::'4g+n-2;::,n-2 (F. Gackstauer); (d) if f(M) is simply connected and nondegenerate, i.e., its image under the Gauss mapping does not lie in a hyperplane of cpn-l, then C;::, 2n - 2 and this inequality is sharp; (e) when n = 3, C is a multiple of 4, with the minimum value 4 attained only for Enneper's surface and the catenoid; (f) the Gauss mapping G of M extends to a mapping of M whose image G(M) is an algebraic curve in Cp"-1 lying in Qn-2; the total curvature of f(M) is equal in absolute value to the area of G( M) , counting multiplicity; (g) G(M) inter- sects a fixed number of times, say m (counting multiplicity), every hyperplane in CP"-1 except for those hyperplanes containing any of the finite number of points of G( M - M); the total curvature of f(M) equals -2nm. In particular, if a complete minimal surface in R" has k tends, then the total curvature never exceeds 2n(x - k). Enneper's surface and the catenoid are the only two complete mini- mal surfaces in R 3 whose Gauss mapping is one-to-one. If the Gauss mapping of a complete minimal surface of finite total cUrvature in R 3 omits more than 3 points of S2, then it is a plane (Osserman). F. Xavier proved that the Gauss mapping of any complete nonl1at minimal 
1033 surface in R 3 can omit at most 6 points of S2. It is an open question whether this can be improved to 4 points. The Gauss mapping of Scherk's surface omits exactly 4 points of S2. F. Minimal Submanifolds (1) Bernshtein's problem. As was mentioned in Section A, a minimal surface in R 3 represented as the graph of a function on the entire plane R 2 is a plane. This is known as Bernshtein's theorem (1915). This is, however, not true in R 4 , namely, there is a minimal surface defined as the graph of a vector-valued function on the entire plane, which is not a plane. The gen- eralized Bernshtein problem is stated as fo\!ows: Letf:Rn--R be a function satisfying the mini- mal hypersurface equation n 8 ( 1 8 f ) i 8xi W8xi =0, where W = J 1 + I Vf1 2 . Is the graph of fan affine hyperplane? The answer is affirma- tive for n  7 (E. De Giorgi, F. J. Almgren, 1. Simons). For n;::' 8, it is negative (E. Bombieri, De Giorgi, E. Giusti). Though the problem turns out to be negative for n;::' 8, no concrete counterexample is known, even for n = 8. (2) Minimal submanifolds in the sphere. Minimal submanifolds in the unit sphere sn have some special properties. For example, there is no compact minimal submanifold of sn contained in an open hemisphere. On the other hand, there is no stable minimal closed submanifold in sn (B. Lawson, Simons). More generally, any closed minimal hypersur- face in a Riemannian manifold with positive Ricci curvature is unstable (Simons). As for the existence of minimal surfaces in a sphere, the following theorem of Lawson is general: Any closed surface of any genus, except the real projective plane, can be minima\!y immersed into the unit sphere S3. As an analog to Bern- shtein's theorem, a sphere S2, which is mini- mally immersed in S3, is necessarily tota\!y geodesic. On the other hand, by making use of higher-order second fundamental forms and the relations among them, a complete descrip- tion of the minimal immersions of R 2 into sn has been given by K. Kenmotsu. As a general theorem which gives a neces- sary and sufficient condition for a Riemann- ian manifold to be minimally immersed in the unit sphere sn, the following is fundamental: An isometric immersion x of an m-dimensional Riemannian manifold Minto sn, viewed as a vector-valued function into Rn+l, is minimal if and only if tlx = mx (T. Takahashi). As its immediate application, any m-dimensional compact homogeneous space whose linear 275F Minimal Submanifolds isotropy group is irreducible can be minimally immersed into the n-sphere of curvature A/m, corresponding to any eigenvalue ( t= 0) of the Laplacian, where n + 1 is the dimension of the eigenspace corresponding to A (W. Y. Hsiang, Takahashi). Since for the tsymmetric spaces of trank 1 the eigenvalues and the corresponding eigen- spaces of the Laplacian are known, the mini- mal immersions of such spaces into the unit sphere have been determined as we\! as the trigidity of such minimal immersions (N. Wal- lach). For example, if the m-sphere of constant curvature c is minima\!y immersed into the unit n-sphere, but not contained in any great hypersphere, then for each nonnegative integer k, c=m/k(k+m-1) and n+ 1 (2k+m-l)' [(k+m-2)!/k!(m-l)!]. The immersion is rigid if and only if m = 2 or k  3. Similar results have been known for the projective spaces over real numbers, complexes, quaternions, or Cayley numbers. Simons showed that the scalar curvature p of a compact minimal submanifold M of dimension m in the unit (m + p)-sphere is not greater than m(m - 1). Furthermore, if p;::' m(m-l)-mp/(2p-l), then either p=m(m-1) or p=m(m-l)-mp/(2p-l). In the former case, M is tota\!y geodesic and therefore is the unit sphere sm. In the latter case, M is either a hypersurface of the unit (m + 1 )-sphere which i s isometri c to the product Sk(jk7;) X sm-k( J(m - k)/m ) of spheres of radius jk7; and J (m - k)/m, respectively, called the gen- eralized Clifford torus or the Veronese surface in S4. Here the Clifford torus is the torus Sl(J1;2) x Sl(J1;2)cS 3 cR 4 , and the Ver- onese surface is defined as follows: Let (x, y, z) be the natural coordinate system in R 3 and (U1,U2,U3,U4,U5) that in R 5 . The mapping defined by u 1 =yz/j3, u 2 =zx/j3, u 3 =xy/j3, U 4 =(X 2 _ y2)/2j3, u 5 =(x 2 + y2 _ 2Z2)/6 gives an isometric immersion of S2(j3) into S4. Two points (x, y, z) and ( - x, - y, - z) of S2(j3) are mapped into the same point of S4, and thus the mapping defines an embedding of the real projective plane Rp2 into S4. This embedded real projective plane in S4 is called the Veronese surface. T. Otsuki proved the following: Let M be a complete minimal hypersurface immersed in the unit (n + 1 )-sphere with two principal cur- vatures. If their multiplicities are m and n- m;::' 2, then M is congruent to the generaliz ed Clifford torus sm() x sn-m( J (n - m)/n) c sn+l C R n + 2 . If one of their multiplicities is 1, then M is a hypersurface of sn+1 in Rn+2 = Rn x R 2 whose orthogonal projection into R 2 
275 G Minimal Submanifolds is a curve of which the tsupport function x(t) is a solution of the following nonlinear differ- ential equation of the second order: nx(1 -x 2 )W x/dt 2 ) + (dx/dt)2 +(I-x 2 )(nx 2 -I) =0. Furthermore, there are countably many com- pact minimal hypersurfaces immersed but not embedded in S"+1. Only S"-I( J(n-I)/n ) x SI() is minimally embedded in S"+1 This just corresponds to the trivial solution x(t)= 1/ of the above equation. (3) Minimal submanifolds in Riemannian manifolds. In general, it is difficult to deter- mine all the complete stable minimal sub- manifolds in a given Riemannian manifold. If, however, some curvature conditions are given to the Riemannian manifold, then a classifica- tion has been given: Let N be a complete 3- dimensional Riemannian manifold with non- negative scalar curvature p, and let M be a complete, stable, and orientable minimal sur- face in N. (a) If M is compact, then either M is confor- mally equivalent to the tRiemann sphere S2 or else it is a totally geodesic flat torus. Fur- thermore, if p > 0, then the latter case never occurs. (b) If M is not compact, then M is confor- mally equivalent to the complex plane C or a cylinder (D. Fischer Colbrie; R. Schoen). As a generalization of the procedure for generating periodic minimal surfaces in R 3 with octahedral or tetrahedral symmetry (Schwarz, 1867), T. Nagano and B. Smyth gave a construction procedure to generate periodic minimal surfaces in R" or n-tori with sym- metry corresponding to any tWeyl group of rank n. G. Minimal Varieties Recent progress in the study of Plateau's prob- lem in higher dimensions is closely related to the point of view of geometric measure theory and tthe calculus of variations. (1) Integral currents. Let T be a tcurrent of degree k, or simply a k-current, defined on an open set U of R n , and let aT be the boundary of T, the exterior derivative of T. For simplic- ity, in the following only currents with com- pact support are considered. The mass M(T) of a k-current T is the dual norm M(T)= sup{ T(q» I M(q» I} of the comass M(q» on k-forms, which is defined by M(q»=sup{IIq>(x)lllxEU} with 11q>(x)11 = sup{ (q>(x), VI 1\ ... 1\ v k ) I VI, ..., V k are ortho- normal vectors at XE U}. It follows that M(T) coincides with k-dimensional volume when T 1034 is a k-current defined by integration on a k- dimensional simplex. We say that T is normal if M(T) and M(aT) are finite. Normal cur- rents form a tBanach space with norm N(T)= M(T)+M(aT). A current T is called rectifiable if it can be approximated in the mass nOrm by currents of type f*y, where y is a finite polyhedral chain with integer coefficients, f:suppy->U is a tLipshitz mapping of the support of y into U, and f*y is the current defined by means of f*y(q» = y(f*q». If both T and aT are recti- fiable, T is called an integral current. Inte- gral currents give the appropriate notion of generalized manifolds with boundary in the study of higher-dimensional Plateau problems. One of the fundamental properties of in- tegral currents is the compactness, stated as follows: Given a compact subset K c U and a number c > 0, the set of integral currents T such that suppTcK and N(T)c is sequen- tially compact in the tweak topology. Since the mass M(T) is tlower semicontinuous in the weak topology, it follows that Plateau's problem can always be solved in the space of integral currents ( Section C; 334 Plateau's Problem). The question arises: To what extent is this a satisfactory solution to the problem? For codimension 1, it is known from the work of Federer and others that an integral (n - 1)- current of least mass in R" is nonsingular in codimension < 7. In particular, in R", n  7, every integral (n - 1 )-current ofleast mass is an analytic manifold. In general codimensions, it is known that the set of regular points is dense (Reifenberg, Almgren. and others). The definition of rectifiable and integral currents carries over to those on a Riemann- ian manifold M in a natural way. Then the space of integral currents I*(M) on M with the boundary operator a, for which a 2 = 0, form a chain complex. It is then a fundamental theorem, due to Federer and Fleming, in homological integration theory that there is a natural isomorphism H*(I*(M))-::;;H*(M;Z) of the homology of the complex of integral cur- rents with the integral singular homology groups of M. From this it follows that if M is a compact Riemannian manifold M, then each class ClEHp(I*(M))-::;;Hp(M;Z) contains an integral current ofleast mass among all in- tegral curren ts in Cl. For a homology class of codimension 1, it has also been proved by Almgren that if M is a compact Riemannian manifold of dimension  7, then for each ClEH n _ 1 (M; Z) there exists a finite collection of mutually disjoint, compact oriented minimal hypersurfaces Sj, ..., SI em- bedded in M and integers n 1, ... ,n l such that the integral current L;1 njs j represents Cl. On the other hand, it has been recently proved 
1035 that every compact Riemannian manifold of dimension";; 6 contains a nonempty closed embedded minimal hypersurface (1. T. Pitts). (2) Varifolds. The theory of integral currents developed by Federer, Fleming, and others yields reasonable spaces for purposes of the calculus of variations. However, it is not en- tirely feasible to use these in the study of the actual soap films that occur in physical ex- periments. It turns out that we should work in a more set-theoretic fashion and give up the notion of orientability and the boundary operator. A convenient theory for describing these physical experiments is the theory of varifolds developed by Almgren. A k-dimensional varifold On a Riemann- ian manifold M is a tRadon measure on the bundle Gk(M) of unoriented tangent k-planes of M. For simplicity, only varifolds with com- pact support are considered. These are re- garded as continuous linear functionals on the space C(GdM)) of continuous functions on Gk(M). Thus a k-dimensional submanifold S of finite volume embedded in M determines a varifold V s by integration: Vs(f) = Lf(TxS)d£,k(X), fEC(GdM)), where £'k denotes the k-dimensional tHaus- dorffmeasure on M. More genera11y, to any rectifiable current there corresponds an underlying varifold obtained by neglecting orientations. Let Vk(M) denote the space of k-dimensional varifolds on M. Given a varifold VE Vk(M), the mass M(V) is defined to be the V-measure of the total space, i.e., M (V) = V( 1). Let f: M -> M be a C 1 -mapping. Then f induces natura11y a mapping f* of Vi(M) into itself. In particular, if X is a smooth vector field on M with asso- ciated flow 1" then there is a smooth function M(t)=M(1,. V). A varifold VE J;k(M) is then called a k-dimensional minimal variety in M if the first variation (djdt)M(t) I '=0 = 0 for a11 smooth vector fields on M. For example, minimal submanifolds, complex analytic sub- varieties of Kahler manifolds and integral currents of least mass are minimal varieties. An appropriate notion of rectifiable and integral varifolds is defined, and an analog of the compactness theorem can be obtained. As a consequence, it was proved by Almgren using tMorse theory methods that if M is an n- dimensional compact Riemannian manifold, then for each p, 1 ,,;;p";;n-l, there exists at least one minimal variety of dimension p. As for the regularity of minimal varieties, it is known that if V is a k-dimensional variety, then in the support of V there is a relatively open dense subset that is a regular minimal submanifold of dimension k (W. K. A11ard). 275 Ref. Minimal Submanifolds H. Maximal Hypersurfaces in Minkowski Space Let Ln+l be a Minkowski space, i.e., Ln+1 = {(xj, ...,x",t)l(xj, ...,xn)ER",tER} with the tLorentzian metric L7=1 (dX;)2 _(dt)2. Let M be a tspacelike hypersurface of L"+1 so that the induced metric on M is Riemannian. If the mean curvature vector field H of M, defined in the same way as in the Riemannian case, vanishes identica11y, then M is ca11ed maximal. In contrast to the Riemannian case, by the variation in the normal direction of H, the volume increases, provided H #0. However, the equation describing a maximal hyper- surface is similar to that for the Riemannian case ( Sections D (1), F (1)). Indeed, let D be a domain in R n andf:D->R a smooth func- tion. Then the graph of fin Ln+1 is a maximal hypersurface if and only if f satisfies the fol- lowing quasilinear partial differential equation of the second order: n 02 f "o if O ij 02 f (l-Wfl l ) I ----,:+ L ---=0. i1 OX i i.j=1 OX i OX j OxJ}Xj This is te11iptic for IVfl < 1, since M is space- like. E. Calabi proposed a problem similar to Bernshtein's in 1968. Contrary to Bernshtein's case, the answer is affirmative for any n. Namely, a maximal hypersurface in L"+1 de- fined as the graph of a function on the entire space R" is a hyperplane in Rn+1. More gener- a11y, a maximal hypersurface, which is a closed set in L"+1, is a hyperplane (S. Y. Cheng and S. T. Yau). It is also known that any maximal hypersurface in L"+l is stable. References [1] F. J. Almgren, Plateau's problem, An invitation to varifold geometry, Benjamin, 1966. [2] R. Courant, Dirichlet's principle, COn- formal mapping, and minimal surfaces, Inter- science, 1950. [3] H. Federer, Geometric measure theory, Springer, 1969. [4] H. Federer, Colloquium lectures on geo- metric measure theory, Bu11. Amer. Math. Soc., 84 (1978) 291-338. [5] D. Hoffman and R. Osserman, The geome- try of the generalized Gauss map, Mem. Amer. Math. Soc., 236 (1980). [6] H. B. Lawson, Minimal varieties, Proc. Symp. Pure Math., 27 (1975), 143 175. [7] H. B. Lawson, Lectures on minimal sub- manifolds I, Publish or Perish, 1980. [8] C. B. Morrey, Multiple integrals in the calculus of variations, Springer, 1966. 
276 A Model Theory [9J J. c. C. Nitsche, On new results in the theory of minimal surfaces, Bull. Amer. Math. Soc., 71 (1965), 195270. [IOJ 1. C. C. Nitsche, Vor1esungen i.iber Mini- mall1achen, Springer, 1975. [l1J R. Osserman, Minimal varieties, Bull. Amer. Math. Soc., 75 (1969), 1092 1120. [12J R. Osserman, A survey of minimal sur- faces, Van Nostrand, 1969. [13J T. Rad6, On the problem of Plateau, Springer, 1933. [14J A. 1. Tromba, On the number of simply connected minimal surfaces spanning a given curve, Mem. Amer. Math. Soc., 194 (1977). [15J Minimal submanifolds and geodesics, Proc. Japan-US Seminar, Tokyo, 1977, North- Holland, 1979. [16J S. Nishikawa, On maximal spacelike hypersurfaces in a Lorentzian manifold, Nagoya Math. J., 95 (1984), 117-124. [17J O. Kobayashi, Maximal surfaces with conelike singularities, 1. Math. Soc. Japan, 36 (1984),609-617. 276 (1.6) Model Theory A. Language Every mathematical theory has an appro- priate language. To determine a language for a theory means to determine a language for the related mathematical system. Such a language consists of the fo11owing symbols (the symbols given here are examples of only one notational system). (1) Symbols that express logical concepts (t1ogical symbols): V, 3, I, 1\, v, -->; (2) tfree variables: ao, aI, a2, ...; (3) tbound variables: x o , XI' X 2 ,...; (4) symbols that denote individual objects (individual constants): Co, C I. C 2 , ..., C, ...; (5) tfunction symbols: fO,[1 ,[2, ... ,fa, ...; (6) tpredicate symbols: Po, P J , P 2 , ... , Po. .... The tcardinalities of the sets of symbols in (4), (5), and (6) are arbitrary, except that there must be at least one predicate symbol. It is assumed that each set of symbols is twe11 ordered. Also, it is understood that to each fj in (5) there corresponds a positive integer i j , while to each  there corresponds a nonnega- tive integer (these integers are called the num- ber of arguments of fj and , respectively). In practice, other kinds of languages are also dealt with. One example is a system with infinitely long expressions, which permits 1036 ttransfinite ordinals for the numbers of argu- ments of fj and , and which has the extended concepts !\a<P' Va<p, 3x 1 ... 3xa ... for ex < {J, and VXJ VX 2 ... VX a ... for ex < {J, where ex and {J are transfinite ordinals. Another language includes variables of higher ttype as well as V and 3 over those variables. Free and bound variables may not be distinguished typograph- ical1y. In that case a variable not bound by V or 3 in a tformula is ca11ed free. (The notion of a formula is defined later.) To simplify this discussion, however, we restrict ourselves to the tfirst-order predicate language with a typographic distinction between free and bound variables. We also assume that there are only a countable number of variables, and hence we use natural numbers as subscripts. Set LJ = {logical symbols}, L 2 = {aO,a I. a 2 ,...}, L3={XO,XPX2''''}' L 4 ={co,c l , c 2 ,...}, L s ={fo,ft,f2,"'}, L 6 ={P O 'P I. P 2 ,...}, L=(LJ,L2,L3,L4,Ls,L6)' To determine a language L is to specify such a list L. Since V, 3, I, 1\, v, -->, are normalIy used for L1' ao, at. a 2 ,... for L 2 , and X o , XI. X 2 ,,,. for L3' we may assume that these are fixed, and hence to determine a language is to determine (L 4 , Ls, L 6 ). We take (L I. L2' L 3 ) Just described and assume an arbitrary but fixed L= (L 4 , Ls, L 6 ). First we define the notions term of L (or L-term) and formula of L (or L-formula). Definition of the terms of L (L-terms): (1) Each free variable a j is an L-term. (2) Each individual constant c j of L is an L-term. (3) If jj is a function symbol of L, ij is the number of arguments of fj, and each of t 1, '" ,t i is an L- term, then fj(t I. ... , t i ) is also an L- trm. (4) The L-terms are only those constructed by (1) (3). A term that does not contain a free variable is cal1ed a closed term. Definition of the formulas of L (or L- formulas): (1) Let  be a predicate symbol of L and ij be the corresponding natural number. If each of t I. ... , t i is an L-term, then (t I' "', t i ) is an L-formula This type of formula is called a prime formula (or atomic formula). (2) If A and Bare L-formulas, then each of I(A), (A) 1\ (B), (A)v(B), and (A)-->(B) is an L- formula. (3) Let F be an L-formula and Xi be a bound variable that does not occur in F. Then an expression obtained by putting ( ) around F, replacing some occurrences in F of a free variable, say a j , by Xi> and prefixing VX j or 3x i is an L-formula. (4) The L-formulas are only those constructed by (1)(3). A formula that has no occurrence of a free variable is cal1ed a closed formula. The paren- theses used in the formation of a formula may be omitted if no ambiguity arises thereby. 
1037 B. Structures Let L be a specific language as described in the previous section. Then Wl = [M : p; a; T] defined by (1)-(4) below is called a structure for L (or L-structure). (1) M is a nonempty set. (M is called the universe of Wl.) (2) p is a mapping from L4 into M. (3) Let L = Ui I the number of arguments offjis i}. Then Ls=LUqU...ULU... provides a partition of Ls. Let i be the set of all mappings from M i = M x... x M (i times) into M and a i be a mapping from L into 6i' We define a for an arbitrary f of Ls by a(f) =ai(f), where i is the number of arguments of .f Then a is obviously a mapping from Ls into Ur;16i' (4) Decompose L6 into L U L...U L U... as in (3). Let Pi be the set of all subsets of M i and 'i be a mapping from L into Pi' where Po is the set {M, 0} (0 is the empty set). Then T is defined for every i and for an arbitrary P of L by T(P)=Ti(P), Ifwe denote p(c) by c, aU) by 1, and T(P) by P, then we may understand that p is repre- sented by Co, c 1 , ..., a is represented by 10' 11' "', and T is represented by Po, fir, .., . Therefore Wl is normally expressed as Wl= [M: Co, c 1 , ... ;JoJr....; Po, fir,...]. C. Satisfiability We fix not only a language L but also a struc- ture Wl for L. Then the property that an L- formula is satisfiable is defined by the follow- ing procedure: Let m, n, '.. stand for tsequences of the elements of M, say (mO,m1' ...), (no,nJ, ...), ..., called WI-sequences. We write m='=n to indi- cate that each entry of m except the ith one is equal to the corresponding entry of n. Using these concepts, the value of an L-term at an WI-sequence m, denoted by t[m], is defined as follows: (1) If t is a free variable a j , then t[m] = m j . (2) If t is an individual constant c j , then t[m]=c j . (3) If t is of the form fj(tr."', t,), then t[m] = (t 1 em], ..., t,em]). If t is an L-term, then evidently t[m] is an element of M. Based on this definition of t[m], the relation A is satisfiable by m in Wl, denoted by Wl, m 1= A, is defined for an arbitrary L-formula A and an arbitrary Wl-sequence m as follows: it) Wl, m 1= 1J(t 1, ..., t,)-= (ti[m], ..., ti[m] > EPj' (2) Wl, m 1= I B -= WI, m 1= B is false. 276D Model Theory (3) Wl, m 1= B 1\ C -= WI, m 1= B and WI, ml=C. (4) WI, m 1= Bv C -=Wl, m 1= B or WI, 1111= C. (5) Wl, m 1= B->C-=\JJl, m 1= B implies Wl, 1111= C. (6) Wl, m I=VxjF(xj)-=Wl,nI=F(a,) for an arbitrary n that satisfies mn, where a i has the least index among the free variables that do not occur in F(xJ (7) Wl, m 1= 3x j F(x j )-=there exists an n such that n  m and Wl, 111= F(a,), where a i satisfies the same condition as in (6). Following are some consequences of this definition. (1) For an arbitrary L-formula A and an arbitrary Wl-sequence m, exactly one ofWl, m 1= A and Wl, m 1= I A holds. (2) Let a j " ... ,aj; include all free variables that occur in A, and let m and n be Wl- seq uences for which m h = n j " "', mj; = nji' Then Wl, m 1= A and Wl, n 1= A are equivalent. So we may write Wll= A [ aj" ..., a ji ] instead m j1 ,...,mh ofWl, ml=A, for any formula A whose free variables are among a h , .,. ,a ji , and any Wl- sequence m. (3) If A is a closed formula, then for an arbitrary pair of sequences m and 11, Wl, m 1= A and Wl, n 1= A are equivalent. Therefore, for a closed formula A, we may express the state- ment "for some (or, equivalently, for all) Wl- sequence m, Wl, m 1= A holds" by Wll= A. (4) Let a i be an arbitrary variable that does not occur in VxF(x) or 3xF(x). Then Wl, 1111= VxF(x) is equivalent to Wl, n 1= F(a,) for an arbitrary n such that n ='= m. Likewise, Wl, m 1= 3xF(x) is equivalent to the statement that there is an n such that n ='= m and Wl, nI=F(a;). D. Models Here again we fix a language L. Let A be a closed L-formula and Wl an L-structure. If Wll= A, then Wl is called a model of A. Furthermore, if r = {Ao, AI, ....} is an arbi- trary set of closed formulas and Wll= Ai for all Ai in r, then the structure Wl is called a model of f. (1) Consistency. Consider a logical system whose language is L. If there is a model of the set of all provable closed formulas of the sys- tem, then the system is tconsistent. In partic- ular, the tfirst-ordered predicate calculus is consisten t. (2) Completeness. A logical system is said to be complete if every closed formula that is satisfied in every structure is provable in the 
276 E Model Theory system. In particular, the first-order predicate calculus is complete. K. Godel proved (2). Later L. Henkin gave an alternative proof whose essential idea con- tributed to proving the following proposi- tion: If a set r of closed L-formulas is con- sistent, then there is a model of r. Henkin also introduced a (nonstandard) second-order semantics, relative to which the tsecond- order predicate calculus is complete. This can be shown by extending Henkin's tech- nique (for the first order) to the second-order language. (3) Here we extend the language slightly by adding the second-order free predicate vari- ables a., a. 2 , ...,a.,... (n= 1,2, ...) and the second-order bound predicate variables cp, CP2' ...,cp,... (n= 1,2, ...), where n indicates the number of arguments' of a variable. Other- wise the definition of the language is the same as for the case of the first-order predicate calculus. For simplicity, however, we assume that there are no individual constants, function symbols, or predicate symbols. The structure is defined as follows: Put Wl= [M: 5 1 ,5 2 , ..., 5n, ...], where M is a nonempty set and 5n is a set of subsets of M x ... x M (n times). An Wl-sequence m is defined as before, and Sn denotes (s, S2' "', s7, ...), where each s7 is a member of 5n. The concept of satisfiability is defined as follows: Wl, (m, SI,"', sn,"') pa.)(x i "... ,X i )= (mil"'" mi)ES j ". illl, (m,sl' ...,S., ...)pVcprA(cpr)=for an arbitrary s for which s b Sn' Wl, (m, SI' "', s,...) p A (a.J), where a.) has the smallest index among the free predicate variables that do not occur in A(cpr). illl, (m,sl' "',Sn, ...) p3cprA(cpr)=there exists an s such that s b Sn and Wl, (m, sj, ''', S, , ...) p A (a.j), where a.j satisfies the same condition as in the previous clause. Satisfiability for other cases is defined as for first-order predicate language. A structure Wl is called normal if all axioms of the second- order predicate calculus are true in Wl. Com pleteness of the second-order predicate calculus: Every closed formula that is satisfi- able in all normal structures is provable in the second-order predicate calculus. (4) Let the cardinality of L4 be T, and r be an arbitrary set of closed L-formulas. If r has a model, then r has a model of cardinal- ity max (T, X o ). This follows from Henkin's method. Historically, however, it was first proved by Th. Skolem and L. Lowenheim for a special case, and was later generalized by A. I. Mal'tsev and A. Robinson. (5) The following results are all due to A. Tarski and R. L. Vaught. 1038 Definition 1. Two L-structures Wl and 91 are said to be elementarily (arithmetically) equiva- lent if for an arbitrary closed L-formula A, WlpA=91pA. Definition 2. Let Wl = [M: qo, q1, ...; go, gj, ...; Qo, Q1' ... ] and 91 = [N: ro, r 1 , .., ; h o , hI, ... ; Ro, R1' ... ] be two structures. WI is an elementary exten- sion of 91 if the following two conditions are satisfied: (i) M::::> N; qj= r j (j = 0,1, ...); the restriction of gj to N is identical to h j (j = 0,1, ...); the restriction of Qj to N is identi- cal to Rj (j = 0,1, ...). (If this condition holds, then Wl is said to be an extension of 91.) (ii) For an arbitrary L-formula A and an arbitrary 91- sequence n, if 91, n p A then Wl, n p A. Theorem 1. Let Wl be an extension of 91. A necessary and sufficient condition for Wl to be an elementary extension of 91 is that for an arbitrary L-formula of the form 3xF(x) and an arbitrary 91-sequence n, if Wl, n p 3xF(x), then there is some element n of N such that for the Wl-sequence m for which m'; nand m i = n, Wl, m p F(aJ, where ai is an arbitrary free variable that does not occur in F(x). Theorem 2. Here we place a condition on L that each set of symbols be at most countable and arranged in the (I)-type ( 312 Ordinal Numbers). Let the cardinality of the universe M of Wl be an infinite cardinal a, M' be a subset of M of cardinality c, and b be an in- finite cardinal that satisfies c  b  a. Then there exists an L-structure 91 whose universe N has cardinality b and such that M' c Nand Wl is an elementary extension of 91. Theorem 3. Suppose that L satisfies the same condition as in Theorem 2. Let the cardinality of the universe M of Wl be a (a is an infinite cardinal) and b be a cardinal for which a  b. Then there exists an L-structure 91 that is a proper elementary extension of Wl and whose universe has cardinality b. E. Ultraproducts Assume that for a set of L-structures I and a set of indices I, there is a mapping 8 from I onto I. If a. is a member of I, Wl is a member of I, and 8(a.) = Wl, then Wl may be denoted by Wl". It should be noted that there may be more than one a. corresponding to the same structure. If D is a tmaximal filter of I and Wl" 
1039 is expressed as 9Jl"=[M":c",...:]",...: P",...], then 11.EI M" is defined by Il M" = {I.{J II.{J is a mapping from "E1 [into U M", where I.{J(O:)EM}. "El For any two elements I.{J and if; of1LE/M", I.{J g if; is defined by D I.{J= if;=-{o: I I.{J(O:) = if;(o:)} ED. Then I.{J g if; is an equivalence relation between the elements of IT"EI M". Furthermore, the set IT"EI M partitioned by g is expressed by IT"EIM"ID, and each element m ofIT"ElM"ID is expressed by m= [I.{J], where I.{J is a repre- senting element of m. Next we define an operator that produces a new structure from I. Put M = IT"EI M"ID. For an individual constant c of L, let c = [I.{J], where I.{J(O:) = c" for every 0:. For an n-ary function f of L and arbitrary elements m l = [I.{JI], ..., m n = [l.{Jn] of M, define [(ml' ..., mn)=[i/;], where if; (0:) = !" (I.{J 1 (0:), ... ,l.{Jn(O:)) for every 0:. For an n-ary predicate P of L, define (m1, ...,mn)EP by (m 1 ,...,m n )EP =- {o: I (I.{JI (0:), ..., I.{Jn(O:) E PO} E D. According to these definitions, put 9Jl=[M:c, ...,], ...,P, ...], and denote it by IT"E19Jl"ID, called the ultra- product of {WI"} "EI (with respect to D). 9Jl is an L-structure. Fundamental Theorem of Ultra products. Let 9Jl = IT"EI 9Jl" I D be the ultraproduct of {9Jl"}"E/, m=(mj,m2' ...) be an 9Jl-sequence, l.{Ji be a representing element of mi, and A be an arbitrary formula. Then 9Jl, m f= A=- {o: 19J1", (I.{JI (0:), 1.{J2(0:), ...) f= A} ED. By using this fundamental theorem we have the following result: Suppose that r is a set of closed formulas in L such that every finite subset of f has a model. Let [ be the set of all the finite subsets of r. For each 0: EI and each A E r, let 9Jl" be a model of 0: and A the set of all the finite sets in I which contain A as a member. Let F= {AlAE r}. Since F has the finite intersection property, there is a maximal filter D such that D ;1 F. Let 9Jl be the ultra- product IT"E/9Jl"ID. Since A is a subset of the set {o: 19J1" f= A} and A belongs to D, the set {o: 19J1" f= A} belongs to D, for each A E r. Hence we have that M is a model of r, by the foregoing fundamental theorem of ultra- 276 E Model Theory products. Therefore, we have the following theorem. Compactness Theorem. A set r of closed for- mulas has a model if and only if every finite subset of it has a model. In the case where all the structures 9Jl" coincide with the single structure 91, the ultra- product of {9Jl"}"d (with respect to D) may be written 91 1 1 D and called the ultrapower of 91 (with respect to D). Let 9Jl=[M; qo, qj, ...; 90' 91, ...; Qo, QI' ...] and 9"1=[N; f o , fj, ...; ho, hI' ...; R o , R I , ...] be two structures. 9Jl and 91 are said to be isomorphic if there is a bijection f from M to N such that the following three conditions hold: (i) f(qO)=f o ,f(ql)=f 1 , .... (ii) The se- quences 90, 91'''' and h o , hI"" are of the same type andf(9j(a l , ..., an)) = hj(f(ad, ..., f(a n )) holds for every n-tuple aI' ..., an in M. (iii) The sequences Qo, Q I' ... and R o , R j, .. . are of the same type and Rj = {(f(a 1 ), ..., f(a n ) I (a l ,... ,an> EQj}' Letj be the function from N to NIID de- fined by j(a) = [l.{Ja] for each aE N, where l.{Ja is the constant function from I to N such that l.{Ja( 0:) = a for each 0: E I. Let 9Jl be the substruc- ture of 91 liD whose universe is the range ofj. Thenj is an isomorphism of 91 to 9Jl. In the foI1owing we identify a and j(a) for each a E N. Then 91 is an elementary substructure of 91 l iD by the fundamental theorem of ultraproducts. If 9Jl and 91 are isomorphic, then 9Jl and 91 are elementarily equivalent. By using this fact and the fundamental theorem of ultraproducts, we have the following result. Let WI and 91 be two structures. If there is a nonempty set I and a maximal filter Don [ such that 9JlIID and 91 I I D are isomorphic, then 9Jl and 91 are elementarily equivalent. H. J. Keisler proved the converse of this proposition by using the G.C.H. (generalized continuum hypothesis), and later S. Shelah proved it without the G.C.H. Keisler-Shelah isomorphism theorem; Let 9Jl and 91 be two structures. Then 9Jl and 91 are elementarily equivalent if and only if there is a nonempty set I and a maximal filter D on I such that I)JlIID and 91 I ID are isomorphic. The ultraproduct operation has various applications in number theory, algebraic geo- metry, and analysis. Here we give an example due to J. Ax and S. Kochen. Let P be the set of prime numbers. Let Qp and Zp((t)) be the field of p-adic numbers and the field of formal 
276F Model Theory power series over Zp = {O, 1, ..., p - I} for each pin P, respectively. Ax-Kochen isomorphism theorem: Suppose that D is a nonprincipal maximal filter on P. Then I1 PEp Qp/D and I1 pEp Z p ((t))/D are isomorphic. As an immediate consequence of this theo- rem, we have the following partial solution of Artin's conjecture on Diophantine equations. Theorem: For each positive integer d, there exists a finite set Y of primes such that every homogeneous polynomial f(t 1, ... , tn) of degree d over Qp, with n> d 2 , has a nontrivial zero in Qp for every p rt Y ( 118 Diophantine Equations). We give another example in nonstandard analysis. A. Robinson developed the general theory of nonstandard analysis in [10]. Here we explain a theorem due to A. R. Bernstein. Let X be a nonempty set and U(X) be the smallest transitive set (i.e., aE b and bE U(X) imply aE U(X)) which has X as a member and is closed under the following operations: pair- ing, union, power set, and subset operation (i.e., aEU(X) and be a imply bEU(X)). Let L be the first-order predicate logic with equality whose set of nonlogical constants consists of a binary predicate symbol E and individual constant symbols C a for aE U(X) ( 411 Sym- bolic Logic F). Then the first-order structure \H = [U (X); a(a E U (X)); EJ is an L-structure, where E is the E relation on the set U(X). Let Wl= [U(X)I/D:a(aE U(X)); e/DJ be the ultra power 'RI/D of'll with respect to a non- principal ultrafilter D on a set I. For each aE U(X), let a* be the set of all elements [cpJ in U(X)I/D such that {iE1Icp(i)Ea}ED. Then a is a proper subset of a* if a is infinite. Since'll and Wl are elementarily equivalent, these two sets a and a* have common first-order prop- erties in the following sense: for each formula <1>( a o ) in L, (Vb Ea)('R F <1>[goJ)= (Vb Ea*)(Wl F <1>[goJ). From this it follows that if r is a relation on a set a in'll, then r* is a relation on the set a*; and if f is a mapping from a to bin'll, then f* is a mapping from a* to b*. Hence a* is a mathematical object which greatly resembles a. By using this type of resemblance between a and a* we have the following result. Let H be a Hilbert space over the complex number field C such that dim(H)=w and let T be a bounded linear operator on H. Let X = H U C and consider the first -order structure Wl as above. Since R (the set of all real num- bers) and N (the set of all natural numbers) are infinite sets which belong to U(X), R* and N* have elements which do not belong to Rand N, respectively. Such elements are called non- standard real numbers and nonstandard natural numbers, respectively. By the fundamental 1040 theorem of ultraproducts we can conclude that there are many nonstandard real numbers rx such that O<*rx<*a in R* for any aER. Such a nonstandard real number rx is called an infinitesimal real number. Since the norm operator II II is a mapping from H to R, II II * is a mapping from H* to R*. If II xii * (x E H*) is infinitesimal, then x is said to be infinitesimal in H*. Let S be the set of all linear subspaces of H. For a linear subspace K of H* that is contained in S* let KO be the set of elements xEH such that x-x o is infinitesimal for some Xo in K. Then KO is a closed linear subspace of H. Let e = {e;} iEN be an ortho- normal basis of the Hilbert space H; e can be considered as a mapping from N to H, and hence e* = {eJjEN' is a mapping from N* to H*. For eachjEN*, let Hj be the linear sub- space of H* spanned by {e k I k j}. For a given bounded linear operator T on H, T* is a linear operator on H*. We define 7]=  T*, where  is the projection from H* to Hj' Since dim(HJ=j is a (nonstandard) natural number, there exists a tower J OJ e J Ij e ... e J jj = Hj of closed, 7]-invariant linear subspaces of Hj such that dim (J kj ) = k(k j). Then J kj ° is a closed, T-invariant linear subs pace of H. If there is a polynomial p(x) such that p(T) is a compact operator, then we get a nonstandard natural number j such that J kj ° is a proper subspace of H for some k j. This gives the follow- ing result, which is an affirmative solution of a problem of K. Smith and P. R. Halmos. Theorem (Bernstein [4J): Let T be a bounded linear operator on an infinite-dimensional Hilbert space H over the complex numbers and let p(x) t= 0 be a polynomial with complex coefficients such that p(T) is compact. Then T leaves invariant at least one closed linear subspace of H other than H or {O}. F. Categoricity in Powers Let f be a set of closed formulas in a first- order language L which has a designated binary predicate symbol Po' In the following, we assume that the interpretation Po of Po by Wl is the equality relation on the universe of Wl for every L-structure Wl. f is said to be categorical if all the models of f are iso- morphic. By Theorem 3 in Section D, any f having a model of infinite cardinality is not categorical. Hence, there exists no interesting f which is categorical. Therefore, we consider the weaker notion of categoricity in powers. Let K be an infinite cardinal and n(f, K) be the number of nonisomorphic models of f of cardinality K. Then f is said to be categorical in Kif n(f, K) = 1, i.e., if all the models of f of cardinality K are isomorphic. There exist many 
1041 interesting r's which are categorical in K for some K. For exam pIe, the set of axioms of algebraically closed fields of characteristic 0 is categorical in  1 , and the set of axioms of dense linear orderings without endpoints is categorical in o' With respect to this notion, J. Los conjectllred that if r is categorical in K for some K> I (the cardinalit y of L), then r is categorical in K for all K> L. This conjecture was slved affirmatively by M. Morley in the case L = o' and later by S. Shelah in the general case Theorem (1): Let r be a set of closed formulas_in L. Then r is categorical in K for some K > L if and only if r is categorical in K for all K > L. We also have the following interesting theorem, due to J. T. Baldwin and A. H. Lachlan. Theorem (2): Let r be a set of closed formulas in L such that L = o' If r is categorical in 1' then n(r, o) = 1 or o' As for n(r, K) there are two famous conjec- tures due to R. Vaught and others. A set r of closed formulas in L is called complete in L if it has a model and, for any closed formula A in L, either AEr or IAEr. Conjecture 1. There is no complete set r of closed formulas in L such that o < n(r, o) < 2"°. Conjecture 2. There is no finite set r of closed formulas in L such that n(r, o) = o and n(r, 1)= 1. G. Omitting Type Theorem By ao-formulas (or a-formulas), we mean for- mulas which have no free variables except aD. Suppose that 91/ is an L-structure and I is a set of a-formulas in L. Then, 91/ realizes I if there is an element m of the universe of 9Jl such that 9Jl F A [::'J for all A in I, and 9Jl omits I if 9Jl does not realize £. For example, if L is a first-order language such that L4 = {co, C I}, Ls={/o, I'}, L 6 ={P O ,P 1 }, where/o,ft. Po, PI are all binary, and 91 is the L-structure [9/;0, I; +,x; =, <J, where 91 is the set of natural numbers, and I= {PI (n,a) I n=O, 1, 2,...}, whereO=co,T=/o(O,cd,..., n+ I = lo(n, c 1)' .,. , then clearly 91 omits £. Also, if WI is an L-structure, m is an element of the uni- verse of WI, and £ m = {A I A is an a-formula such that 91/ F A [::'J}, then clearly 9Jl realizes £m' where Im is called the type ofm in WI. Suppose that r is a set of closed formulas in L. Then, by the completeness theorem of L, we can easily see that r has a model realizing £ if and only if r U I is consistent. On the other hand, it is rather difficult to obtain a necessary and sufficient condition for r to have a model omitting £. The following is a sufficient con- 277 A Modules dition: r is said to locally omit £ if there is no a-formula A such that P{A} is consistent and, for any formula B in £, r u {A, I B} is not consistent. Theorem. Suppose that r is a consistent set of closed formulas in a countable language L, and £ is a set of ao-formulas in L. If r locally omits I, then r has a countable model which omits I. Also, if r has a model of power greater than ::1" omitting £ for each  < (J) 1 ' then r has a model omitting £ in each infinite power, where ::1" is defined by ::10 = o, ::1,,+1 = 2', ::1a = sUP"<a::1,, if (J is a limit ordinal. References [IJ J. Ax and S. Kochen, Diophantine prob- lems over local fields I, Amer. J. Math., 87 (1965),605-630. [2J J. Ax and S. Kochen, Diophantine prob- lems over local fields II, Amer. 1. Math., 87 (1965), 630-648. [3J 1. Ax and S. Kochen, Diophantine prob- lems over local fields III, Ann. Math., (2) 83 (1966),437-456. [4J A. R. Bernstein, Non-standard analysis, Studies in Model Theory 8, M. D. Morley (ed.), Mathematical Association of America, 1973,35-58. [5J C. C. Chang and H. J. Keisler, Model theory, North-Holland, 1973. [6J S. C. Kleene, Introduction to metamath- ematics, Van Nostrand, 1952. [7J M. Machover and J. Hirschfelt, Lectures on non-standard analysis, Lecture notes in math. 94, Springer, 1969. [8J M. Morley, Categoricity in powers, Trans. Am. Math. Soc., 114 (1965), 514- 538. [9J M. Morley, Omitting types of elements, The Theory of Models, J. W. Addison et al. (eds.), North-Holland, 1965,265-273. [lOJ A. Robinson, Non-standard analysis, North-Holland, 1966. [IIJ G. E. Sacks, Saturated model theory, Benjamin, 1972. [12J S. Shelah, Classification theory, North- Holland, 1978. 277 (111.23) Modules A. General Remarks In this article, we consider mainly modules with operator domain (Section C), in partic- ular modules over a tring. Modules over a 
277 B Modules field are linear spaces ( 256 Linear Spaces). Modules over a commutative ring are impor- tant in algebraic geometry ( 16 Algebraic Varieties, 67 Commutative Rings, 284 Noe- therian Rings). The theory of modules over a tgroup ring can be identified with the theory of linear representations of a group ( 362 Representations). Modules without operator domain may be regarded as modules over the ring Z of rational integers, and the theory of finitely generated Abelian groups can be gen- eralized to the theory of modules over a tprin- cipal ideal domain ( 52 Categories and Functors, 200 Homological Algebra). B. Modules A module (without operator domain) is a tcom- mutative group M whose law of composition is written additively: a + b = b + a (a, bE M); the tidentity element is denoted by 0, and the inverse element of a by -a. Every subgroup H of M is a normal subgroup. For any aE M, the left and right cosets of H containing the ele- ment a are identical: H +a=a+ H ( 190 Groups A). In the set N M of all mappings of a set M to a module N, we define an addition by the sums of values: (f + g)(x) = f(x) + g(x). Then N M forms a module. The set Hom(M, N) of all homomorphisms of a module M to a module N forms a subgroup of the module N M , called the module of homomorphisms of M to N. The composite of homomorphisms is a homomor- phism. Hence the set Hom(M, M) = t&'(M) of all endomorphisms of M forms a tring with re- spect to the addition and the multiplication defined by composition; this is called the endo- morphism ring of M. The tunity element of t&'(M) is the identity mapping of M, and the tinvertible elements of t&'(M) are the automor- phisms of M. Let {x A} AeA be a family of elements in a module M. The sum L).eAXA is well defined if X A =0 ().EA) except for a finite number of A. F or any family {NALeA of subsets of M, LAeA N), denotes the set of all elements of the form L),eAX). (x),EN)), where x A =0 except for a finite number of A. If all the N A are subgroups of M, then N = LAeA N A is also a subgroup, called the sum of {N),LeA' If every element of N can be written uniquely in the form LAEAX A (x), E N A ), N is called the direct sum of {N A } AeA' When the N), are subgroups, this is equivalent to the condition that N A n LA#/iEA N/i = {O} (AEA). C. Modules with Operator Domain Suppose that we are given a set A and a module M. If with each pair of elements a E A 1042 and XE M there is associated a unique element aXEM satisfying the condition (1) a(x+ y)= ax+ay (aEA;x,YEM), we say that A is an operator domain of M and M is a module with operator domain A (module over A or A- module) ( 190 Groups E). The mapping A x M -+ M given by (a, x)-+ax is called the toperation of A on M. Any a E A induces an endomorphism aM:x-+ax of M as a module (not as an A-module). To give the structure of an A-module to a module M amounts to giving a mapping A-+t&'(M) (a-+a M ). If N is a subgroup of an A-module M such that ax EN for any aEA and xEN, then N forms an A-module, called an A-submodule (or allowed submodule) of M. If {N A } AeA is a family of A-submodules of an A-module M, then the intersection n)EA N A and the sum LAEA N A are both A-submodules of M. Let R be an tequivalence relation in an A- module M such that if aEA and xRy, then axRay. Then R is said to be compatible with the operation of A. In this case, an operation of A is induced on the quotient set MIR. Moreover, if R is compatible with the addition, namely, xRx' and yRy' imply (x+ y)R(x' + y'), then MIR forms an A-module, called a factor A-module of M. The equivalence class N con- taining 0 is an A-submodule of M, and MIR coincides with MIN. D. Modules over a Group or a Ring If a (multiplicative) group structure is given to the operator domain A of a module M, we always assume (in addition to condition (1) III Section C) that the following two condi- tions are satisfied: (2) (ab)x=a(bx); (3) 1x=x (a,bEA,xEM). If a ring structure is given to A, we always assume (besides conditions (1) and (2)) that the following condition holds: (4) (a+b)x=ax+bx (a,bEA,xEM). This means that the mapping A-+t&'(M) (a-+a M ) is a tring homomorphism. If the ring A has unity element 1, and I M =iden- tity mapping (namely, condition (3) holds), then the A-module M is called unitary. We consider only unitary A-modules. Any module M can be regarded as a Z-module or as an t&'(M)-module. When M is a module over a ring A, an element of A is called a scalar, A itself is called the ring of scalars (basic ring or ground ring), and the operation A x M -+ M is called the scalar multiplication. The elements ax (aEA) are called scalar multiples of x, and the totality of these elements is denoted by Ax. Let {x ALEA be a family of elements in M. An element of the form L)eA a), x;, where the a). are elements of A and equal to 0 except for a finite number 
1043 of }., is called a linear com bination of {x.} .EA' The set N of linear combinations of {X.}.EA is the smallest A-submodule of M containing all the x. (}.EA) and is equal to the sum L.EAAx.. The A-module N is said to be generated by {x.} J.EA' and {XJ.};.E!. is called a system of gen- erators of N. A module having a finite number of generators is said to be finitely generated (of finite type or simply finite). The module Ax generated by a single element x is called mono- mial. If A is a tfield (which may be noncom- mutative), an A-module is a linear space over A ( 256 Linear Spaces). Let a be an element of an A-module M. If there exists a nonzero divisor A of A such that J.a = 0, then a is called a torsion element of M. We say M is a torsion A-module if every ele- ment of M is a torsion element, and M is torsion free if M has no torsion element other than O. An element a of M is called divisible if for any nonzero divisor A E A there exists an element bEM such that a=Ab. M is called a divisible A-module if every element of M is divisible. Strictly speaking, the A-modules we have considered so far are called left A-modules. If we define the operation of A on M by xa (a E A, x EM) instead of ax and modify con- ditions (1 )-(4) appropriately (in particular, condition (2) becomes x(ab) = (xa)b), then M is called a right A-module. If AQ is a group or ring anti-isomorphic to A, then a left A- module can be naturally identified with a right AO-module. If A is a commutative group or ring, we can disregard the distinction between left and right A-modules. Let A and B be groups or rings. Sometimes we consider an A-module structure and a B- module structure simultaneously on the same module M. If the operations of A and B com- mute with each other, namely, a(bx)=b(ax) (a E A, bE B, x EM), it is convenient to put one of the operations to the right. If M has a left A-module structure and a right B-module structure, satisfying condition (5) (ax)b=a(xb), then M is called an A-B-bimodule. If G is a group and K is a commutative ring, the G-K- bimodule structure is equivalent to the left K[G]-module structure, where K[G] is the tgroup ring. E. Operator Homomorphisms A homomorphism f:M->N of A-modules M and N such that f(ax)=af(x) (aE A, XE M) is called an A-homomorphism (operator homo- morphism or allowed homomorphism). If A is a ring, f is also called an A-linear mapping. Regarding A as an A-module, an A-linear mapping M ->A is called a linear form on M. 277 F Modules The composite of A-homomorphisms is an A- homomorphism. Letf:M->L be an A-homomorphism of A- modules. The A-submodule Imf = f(M) of Lis called the image of f, and the A-submodule Kerf={xlxEM,f(x)=O} of M is called the kernel of f Coim f = M /Ker f is called the co- image of f, and Coker f = L/lmf the cokernel of f The binary relation xRy on M defined by f(x) = fry) (x,YEM) coincides with the equiva- lence relation defined by x - YEN = Ker f (x,YEM), and the mapping f induces an A- isomorphism J: M/N -> f(M). A sequence of A-homomorphisms of A- modules M n (nEZ) M f,-, M f, M ... nl----+ ,,---;, n+l ... is called an exact sequence if 1m f" - 1 = Ker fn for all n. The A-module {O} is denoted by O. Exactness of O->N!..M or M .'!.L->O means that the mapping f:N ->M is injective or the mapping g: M ->L is surjective, respectively. In an tinductive (tpro]ective) system {M.,f.} of A-modules, where every f.:M.->M A<J1 (A> J1) is an A -homomorphism, the limit M = l M J (l M.) is also an A-module. If 0-> L. -> M J -> N. ->0 is exact for every A and LA ->MJ ->N A 1 1 1 L"""M......N is a tcommutative diagram, then O->IL.-> l M. ->I N. ->0 is also exact. For the projective limit, however, we can only state the exactness ofO->IL).->IM.......lN.. The set of all A-homomorphisms of an A- module M to an A-module N, denoted by HomA(M, N), is a subgroup of the module Hom(M, N), and is called the module of A- homomorphisms. The set HomA(M,M)=t&'A(M) of all A-endomorphisms of an A-module M forms a tsubring of the ring t&'(M) and coin- cides with the set of all elements commuting with any aM (aEA). We denote by GL(M) the group of all tinvertible elements in t&'A (M). If A is a commutative ring, HomA(M,N) can be regarded as an A-module by defining (af)(x) = af(x), namely, af=aNof In particular, t&'A(M) is an tassociative algebra over A. If M is an A-B-bimodule, HomA(M,N) forms a left B- module by (bf)(x) = f(xb). If N is an A-B- bimodule, HomA(M,N) forms a right B- module by (fb)(x) = f(x)b. F. Direct Products and Direct Sums In the Cartesian product P = IT. E !. M. of a family {M.LE!. of A-modules, we define ad- dition and an A-operation as follows: {x.}+ {y.} = {x). + Y.}, a {x;.} = {ax.}. Then P forms 
277 G Modules an A-module. We call IT)EA M), the direct product of modules {M),}),EA' The canonical projection assigning x), to {x;,} is denoted by p),:PM),- Suppose that an A-module M and A-homomorphisms f),:MM),(AEA) are given. Then there exists a unique A-homo- morphism f:MP such that p), of = f), (AEA); f is given by f(x) = {J),(x)}. In the direct product IT),EA M)" the set S of all elements whose components x), are equal to o except for a finite number of A is called the direct sum of modules {M),} ),EA and is denoted by L),EA M), (or U ),EA M), or EB ),EA M),). The canonical injection assigning { ...,0, X;., 0, ...} ES to x), EM), is denoted by j),:M),S. If an A-module M and A-homomorphismsf).:M), M are given, then there exists a unique A- homomorphism f: S  M such that f oj), = f), (AEA), defined by f( {x),})= L),EAf),(x),). When M is an A-module and {N),} ),EA is a family of A-submodules of M, the A-homomorphism f:L).EAN),M defined by j({x),})=L),EAX), is an A-isomorphism if and only if M is the direct sum of {N).}. If M). =M for all AEA, IT),EAM). and L)EAM), are denoted by M A and M(A), respec- tively. M A can be regarded as the set of all mappings of A to M. The direct product M 1 x ... x M n and direct sum M 1 EB ... EB M n of a finite number of A-modules Ml' ...,Mn, can be Identified with each other and, if M i = M (1 in), we simply denote it by M n . G. Free Modules Let A be a ring. A family {x;,} ),EA of elements in an A-module M is called linearly indepen- dent if L),EAa),X), = 0 (a), EA) implies a), =0 for all AEA. This is equivalent to saying that the mapping A(A)M that assigns L),EAa),x),EM to {a),} is injective. A linearly independent family {x),} ).EA generating M is called a basis of M. A family {X),}),EA is a basis if and only if every element of M can be written uniquely in the form L),EAa),X). (a),EA). An A-module that has a basis is called a free module over A. If A is a field (which may be noncommutative), every A-module is a free module ( 256 Linear Spaces). The tcardi- nality of a basis of a free module M over A depends only on M if A is a field (which may be noncommutative) or a commutative ring; this number is called the rank (or dimension) of M. Any submodule of a free module over a tprincipal ideal domain is a free module. H. Simple Modules and Semisimple Modules An A-module M is called simple if M #0 and M has no A-submodules except M and O. 1044 If M and N are simple A-modules, any A- homomorphism of M to N is an isomor- phism or the zero homomorphism (i.e., one which sends every element of M to 0) (Schur's lemma). If an A-module M is the sum of a family {M),} )EA of simple submodules, M is the direct sum of a suitable subfamily {M).} )'EA' (A' c A). In this case, M is called semisimple (or completely reducible). If an A-module M can be decomposed into the direct sum of A-submodules Nand N', then N' is called a complementary submodule of N. An A-module Mis semisimple if and only jf every A-submodule of M has a comple- mentary submodule. Let A be a ring. Then the A-module A is semisimple if and only if every A-module is semisimple. In this case A is called a tsemisimple ring ( 368 Rings G). Every simple module over a semisimple ring A is A-isomorphic to a tminimalleft ideal of A. I. Chain Conditions The set of all A-submodules of an A-module M forms an tordered set under the inclusion relation. An A-module is called a Noetherian module if the ordered set satisfies the tmaximal condition and an Artinian module if it satisfies the tminimal condition ( 311 Ordering C). Let N be an A-submodule of an A-module M. Then M is Noetherian (Artinian) if and only if Nand M/N are both Noetherian (Ar- tinian). A ring A is called a tleft Noetherian ring (tleft Artinian ring) if A is Noetherian (Artinian) as a left A-module, and similarly for right Noetherian and Artinian rings. Every finitely generated module over a Noetherian (Artinian) ring is Noetherian (Artinian). Over an arbitrary ring A, a module M is Noetherian if and only if every A-submodule of M is fi- nitely generated. A finite sequence {MJO<;i<;' of A-submodules of an A-module M is called a tJordan-Holder sequence if M=Mo, Mj=>M i + I , Mr={O}, and the MJ M j + 1 (0  i< r) are simple. If such a sequence exists, M is said to be of finite length. The number r, called the length of M, depends only on M. The quotient modules MJM i + 1 (0  i < r) are uniquely determined by M up to A-isomorphism and permutation of the indices (c. Jordan and O. Holder). An A- module M is of finite length if and only if M is Noetherian and Artinian. A semisimple A- module is of finite length if and only if it is finitely generated. An A-module M is called indecomposable if M cannot be decomposed into the direct sum of two A-submodules different from M and {O}. Any A-module of finite length can be decomposed into the direct sum of a finite 
1045 sequence N 1 , "', N n of indecomposable A- submodules different from {O}. The direct summands N i (1 in) are unique up to A- isomorphism and permutation of the indices (W. KruIl, R. Remak, and O. Schmidt). J. Tensor Products Let A be a ring. Given a right A-module M and a left A-module N, we construct a module M @A N (caIled the tensor product of M and N) and a canonical mapping cp: M x N --> M @AN as foIlows. Let F be a free Z-module (free Abelian additive group) generated by M x N, and R be the subgroup generated by the elements of the forms (x+x',y)-(x,y)- (x',y), (x,y+ y')-(x,y)-(x,y'), (xa,y)-(x, ay) (x,x'EM, y,y'EN, aEA). We define M @AN = F / R, and calI the natural projection cp. If we denote cp(x,y) by x @ y, then we have (XI + x 2 )@ y=x 1 @y+x 2 @ y,X@(Yl +Y2)= X@Yl +X@Y2, and (xa)@y=x@(ay). Any element of M @ A N is written in the form LXi@ Yi (xjEM,Yi EN ). The tensor product M @AN of M and N and the canonical mapping cp: M x N--> M @ A N can be characterized as foIlows: For a module L, a mapping f: M x N -->L is caIled biadditive if the conditions f(x + x', y) = f(x,y)+ f(x',y),j(x,y+ y')= f(x,y) + f(x,y') hold. A biadditive mapping f satisfying the condition f(xa,y) = f(x,ay) is caIled an A- balanced mapping. Then we have (i) the canon- ical mapping cp:M x N -->M @AN is A- balanced; and (ii) for any module L and any A- balanced mapping f: M x N --> L, there exists a unique homomorphism f*: M @ AN --> L such that f(x,y)= f*(x @ y) (xEM,YEN). A right (left) A-module can be regarded as a left (right) A a-module, where A ° is the ring anti-isomorphic to A. In this sense, we have M@ANN@AoM. Let A be a commutative ring. For A- modules M, N, and L, a mappingf:M x N--> L is caIled a bilinear mapping if f is biaddi- tive and satisfies f(ax, y) = f(x, ay) = aJ(x, y) (aEA,xEM,YEN). The set S!(M,N; L) of all bilinear mappings M x N --> L forms A- submodule of the A-module L M x N. A bilinear mapping M x N --> A is caIled a bilinear form on M x N. The tensor product M @ A N be- comes an A -module if we define a(x @ y) = (ax)@ y (= x@ (ay)), and the canonical map- ping M x N -->M @AN is bilinear. For any A- module L and bilinear mapping f: M x N--> L, there exists a unique A-linear mapping f*:M @AN -->L satisfyingf(x,y) = f*(x @ y). By this correspondence f...... f*, we get an A- isomorphism S!(M,N;L)HomA(M@AN,L). If A is a field, M @AN coincides with the ten- 277K Modules sor product M @ N as a linear space ( 256 Linear Spaces H, I). In general, let M be a B-A-bimodule and N be a left A-module. Then M@AN becomes a left B-module if we define b(x @ y) = (bx) @ y. Let N be an A-B-bimodule and M be a right A-module. Then M @AN becomes a right B- module if we define (x @ y)b = x @ (yb). In particular, we have A @A N  N, M @A A  M. Let M, M' be right A-modules and N, N' be left A-modules. For A-homomorphisms f: M --> M' and g: N --> N', there exists a unique homomorphism h: M @AN --> M' @AN' satisfy- ing h(x @ y) = f(x)@ g(y); h is caIled the tensor product of f, 9 and is denoted by f@ g. We give here some simple examples (also  Sec- tion L). Examples. (1) Let M, N be free modules (linear spaces, for example) over a commuta- tive ring A. If {X;}iEI and {yJjEJ are bases of M and N, respectively, M@AN is also a free module with a basis {Xi @ Yj}iI.jEJ' If the dimensions dimM, dimN are finite, dimM@AN=dimMdimN. (2) For an tideal a of a commutative ring A, the tfactor ring M = A/a can be regarded as an A-module, and we have M@ANN/aN. For instance, (Z/mZ) @ z(Z/nZ)  Z/(m, n)Z, where (m, n) denotes the greatest common divisor of m and n. K. Hom and @ We continue to consider modules over a ring A. Concerning the direct sum and product, we have H omA ( IMA,flN )  flHomA(MA,N) A  A. and ( IM A ) @A ( IN )  I (MA @AN). )  A. Concerning projective and inductive limits we have HomA(IMA,1N)  IHomA(MA,N) and (IMA)@A(IN) =1(MA@AN). An A-homomorphism f:M -->M' induces a homomorphism HomA(M', N)-->HomA(M, N) by the assignment g-->goj An exact sequence M' --> M --> M" -->0 gives rise to the exact sequence O-->HomA(M", N)-->HomA(M, N) -->HomA(M',N). (I) 
277 K Modules An A-homomorphism f: N --> N' induces a homomorphism HomA(M, N)--> HomA(M, N') by the assignments g......f 0 g, and an exact sequence O-->N'-->N-->N" gives rise to the exact sequence O-->HomA(M, N')--> HomA(M, N) --> Hom A (M, N"). Let M be a right A-module and N', N, N" be left A-modules. An A-homomorphismf: N --> N' induces the homomorphism 1 M @f: M @AN -->M @AN', and an exact sequence N' --> N --> N" -->0 gives rise to the exact sequence M @AN'-->M @AN-->M @AN"-->O. (3) Exchanging left and right, we obtain similar results ( 52 Categories and Functors B; 200 Homological Algebra). Let Q be an A-module. Iffor any exact sequence of A-modules 0--> M' --> M --> M" -->0, the induced sequence 0--> HomA(M", Q)-->HomA(M, Q) -->HomA(M', Q)-->O is exact, then Q is called an injective A-module. This is equivalent to the condition that if M' is an A-submodule of an A-module M, then any A-homomorphism M' -->Q can be extended to an A-homomorphism M-->Q. Any A-module is an A-submodule of some injective A-module, and any injective A-module is a divisible A- module. If A is a tOedekind domain, any divisible A-module is an injective A-module. Let P be an A-module If for any exact sequence (4), the induced sequence O-->HomA(P, M')-->HomA(P. M) -->HomA(P, M")-->O is exact, then P is called a projective A-module. This is equivalent to the condition that for any surjective A -homomorphism g: M --> M" and any A-homomorphism [: P-->M", there exists an A-homomorphism h: P-->M satisfying goh = f. Any A-module is a factor A-module of some projective A-module. A projective A- module has no torsion element. A free A- module is a prOjective A-module. In general, an A-module is a projective A-module if and only if it is a direct summand of a free A- module. Let R be a right A-module, Iffor any exact sequence (4), the induced sequence O-->R @A M' -->R @AM-->R @A M " -->0 (7) is exact, then R is called a flat A-module. Any projective A-module is a flat A-module. A flat A-module R is called faithfully flat if R @AM 1046 (2) = {O} implies M = {O}. A flat right A-module R is faithfully flat if and only if R i= Rill for any left ideal III (i= A) of A. Let A be a tprincipal ideal domain. Then an A-module R is flat if and only if R has no torsion element, and R is faithfully flat if and only if R has no torsion element and R i= Rp for any tprime element p of A. We have the following important examples: (1) For a commutative ringA and its multi- plicatively closed subset S, the tring of quo- tients As is flat as an A-module. However, As is not faithfully flat. For instance, the field of rational numbers Q is not faithfully flat as a Z-module. (2) Let A be a tsemilocal ring and Abe its completion. Then A is faithfully flat as an A- module ( 284 Noetherian Rings; also [1, 7J). In the exact sequence (4), ifIm cp = Ker Ijt is a direct summand of the A-module M, we say that (4) splits. Then (5), (6). and (7) are exact for any A-modules Q, p. R. The exact sequence (4) splits if M' is injective or M" is projective. By AM, MA' and AMB' we mean that M is a left A-module, a right A-module, and an A-B- bimodule, respectively As already stated, AMB and AN imply B(HomA(M, N)), and AM and AN B imply (HomA(M, N))B' Similarly BMA and N A imply (HomA(M, N))B' and M A and BNA imply B(HomA(M, N)). Furthermore, BMA and AN imply B(M@AN), and M A and ANB imply (M @AN)B' With BLA' AM, and BN, we have (4) (5) HomA(M, HomB(L, N))  HomB(L @AM, N). (8) Similarly, for AMB' LA, and N B , we have Hom A ( L, Hom B ( M, N))  HomB(L @AM, N). (8') (6) If B is a commutative ring, (8) and (8') are B- isomorphisms. Furthermore, with LA, AMB' and BN, we have (L@AM)@BN L@A(M @BN). (9) We denote by M* the set HomA(M,A) of all linear forms on an A-module M. Then AM implies Mt and M A implies AM*; the A- module M* is called the dual module of M. A as a left A-module is dual to A as a fight A- module, and vice versa. For a family of A- modules {M)LEA. we have a canonical corre- spondence (LAEA M A )*  fLEA M!. From this, we have a canonical isomorphism (M*)*  M for any finitely generated projective A-module M Many facts concerning this tduality are similar to those valid for linear spaces ( 256 Linear Spaces G). Let A be a commutative ring. Letting A = B = N in (8) and (8'), we have the canonical A- 
1047 isomorphisms HomA(M, L*)HomA(L, M*) (L Q9AM)*= i!(L, M;A), namely, any bilinear form on Lx M is repre- sented by a linear mapping M->L* or L->M*. L. Extension and Restriction of a Basic Ring Fix a ring homomorphism p:A->B. We regard B as a B-A-bimodule by defining a right oper- ation of A on B by b'a=bp(a) (aEA, bE B). This bimodule is denoted by Bp. For every left A-module M, we construct the left B-module p*(M)=BpQ9AM, which is called the scalar extension of M by p. Every A-h:-Jmomorphism of A-modules f: M-> M'induces the B-homomorphism p*(f)= I B Q9 f:p*(M)->p*(M'). For every left B-module N, we construct the left A-module p*(N)=HomB(Bp,N), which is called the scalar restriction (or scalar change) of N by p. By the assignment h->h(1), we have a module isomorphism HomB(Bp,N)=N We identify p*(N) with N under this isomorphism. The operation of A on N is then given as a . y =p(a)y (aEA,YEN). If A is a subring of Band p is the canonical injection, then the opera- tion of A on p*(N) is the restriction of the operation of Bon N. Every B-homomor- phism of B-modules f: N -> N' induces the A- homomorphism p*(J):p*(N)->p*(N'). For any left A-module M and left B-module N, an A-linear mapping f: M ->p*(N)= N is called a semilinear mapping with respect to p. This means that f is an additive homomorphism satisfying f(ax)= p(a)f(x) (aEA,xEM). The extension and the restriction of a basic ring are related by the canonical isomorphism HomA(M,p*(N))HomB(p*(M), N) for an A- module M and a B-module N ( equation (8')). An element rx of the left-hand side and an element fJ of the right-hand side are associated by the relation rx(x) = {3(1 Q9 x) (x E M) ( 52 Categories and Functors). Let A and B be commutative rings. Then for A-modules M and M', we have the canonical B-linear mapping p*: B Q9A HomA(M, M') ->HomB(BQ9AM,BQ9AM'), which is a B- isomorphism if M is a finitely generated free (or more generally, projective) A- module. Using the notation p*, we have p*(HomA(M, M'))HomB(p*(M), p*(M')). We now give some examples where the basic rings are noncommutative. Let G be a group and H its subgroup Let p denote the homo- morphism of group rings K [HJ ->K [GJ in- duced by the canonical injection H ->G, where K is a commutative ring. For any K[H]- module M, p*(M) is called the induced module 278 A Monge-Ampere Equations of M. The representation of G associated with p*(M) is the tinduced representation of the representation of H associated with M. Next, we fix a group G and consider a homomor- phism p:K[GJ->K[GJ induced by a homo- morphism of commutative rings (J:K->K. If K = K and (J is an automorphism, then the representation associated with the "scalar ex- tension" p*(M) of a K [G]-module M is the tconjugate representation to the representa- tion associated with M. If K = KjlJ1 (1J1 is an ideal of K) and (J is the canonical projection, then the representation over K associated with the scalar extension p*(M) of a K [G]- module M is the reduction modulo IJ1 of the representation over K associated with M, and p*(M) is canonically isomorphic to MjIJ1M. Furthermore, the tlocalization and the tcom- pletion can also be treated under the formula- tion of scalar extension ( 67 Commutative Rings G, 284 Noetherian Rings B). References [IJ M. Nagata, Local rings, Interscience, 1962. [2J H. P. Cart an and S. Eilenberg, Homolog- ical algebra, Princeton Univ Press, 1956. [3J D. G. Northcott, An introduction to homo- logical algebra, Cambridge Univ. Press, 1960. [4J S. MacLane, Homology, Springer, 1963. [5J C. Chevalley, Fundamental concepts of algebra, Academic Press, 1956. [6J N. Bourbaki, Elements de mathematique, Algebre, ch. 2, Actualites Sci. Ind., 1236b, Her- mann, third edition, 1962. [7J N. Bourbaki, Elements de math6matique, Algebre commutative, ch 1,2, Actualites Sci. Ind., 1290a, Hermann, 1961. [8J C. W. Curtis and 1. Reiner, Representation theory of finite groups and associative alge- bras, Interscience, 1962. [9J R. Godement, Cours d'algebre, Hermann, 1963. [lOJ S. Lang, Algebra, Addison-Wesley, 1965. [IIJ S. T. Hu, Elements of modern algebra, Holden-Day, 1965. 278 (XIII.23) Monge-Ampere Equations A. Monge-Ampere Equations A Monge-Ampere differential equation is a second-order partial differential equation of the form Hr+2Ks+Lt+M +N(rt-s 2 )=0, (I) 
278 B Monge-Ampere Equations where H, K, L, M, and N are functions of x, y, z, p, and q, and r, S, t, p, and q represent the partial derivatives 02Z r = ox 2 ' oz p= ox ' 02Z s=, oxoy oz q=- oy' 02Z t=- oi' The characteristic manifolds are integrals of a system of differential equations defined as follows: Case (i) N i= 0. Ndp+Ldx+A 1 dy=0, N dq+A 2 dx+H dy=O, dz-pdx-qdy=O, Ndp+Ldx+A 2 dy=0, Ndq+A 1 dx+Hdy=0, dz- pdx-qdy=O, where AI and A 2 are the two roots of the equation A2+2KA+H L-MN =0. Case (ii) N = 0, H i= 0. dY=Aj dx, H dp+HA2dq+M dx=O, dz- pdx-qdy=O, dY=A2dx, H dp+HA j dq+M dx=O, dz-pdx-qdy=O, where Al and ;(2 are the two roots of the equation H)" 2 - 2KA + L = 0. Case (iii) N =0, H =0, Li=O. dx=O, Mdy+2Kdp+Ldq=0, dz-pdx-qdy=O, 2Kdy-Ldx=0, M dy+Ldq=O, dz- pdx-qdy=O. Case (iv) N =0, H =0, L=O. dx=O, 2Kdp+Mdy=0, dz-pdx-qdy=O, dy=O, 2Kdq+Mdx=0, dz- pdx-qdy=O. A manifold X(A), y(A), Z(A), p(A), q(A) that satisfies the system (2), (3) of differential equa- tions for case (i), (4), (5) for case (ii), (6), (7) for case (iii), or (8), (9) for case (iv) is a character- istic manifold of eq uation (1). The following result is known concerning Monge-Ampere equations: The union of sur- face elements of an integral surface of (1) is gencrated in two ways by characteristic mani- folds depending on one parameter. Conversely, 1048 if a manifold, each element of which is com- posed of a point of a surface S and the tangent plane at that point, is generated by a family of characteristic manifolds depending on one parameter, then the surface S is an integral surface of (1). B. Intermediate Integrals If a relation dV(x, y, Z,p, q) = ° is a consequence of a system of differential equations of charac- teristic manifolds, for example, in case (i) when V satisfies (2) N(oVjox + poVjoz) - LoVjop- A 2 0VjOq=0, N(oVjoy + qoVjoz) - AI oVjop - HoVjoq = 0, (3) V(x, y, z, p, q) = c (c an arbitrary constant) is called an integral of the system of differential equations. (i) If V = c is an integral of a system of differential equations of characteristic mani- folds, the solution z(x, y) of V = c considered anew as a partial differential equation of the first order is a solution of (1). Conversely, if every solution of V = c (excepting tsingular ones) satisfies equation (1), V=c is an integral of a system of differential equations of char- acteristic manifolds. (ii) If u(x, y, z, p, q) and v(x, y, z, p, q) are two integrals of a system of differential equations of characteristic mani- folds, then for any arbitrary function cp, cp(u, v) is also an integral. Thus, as a consequence of (i), every solution z(x,y) of cp(u, v)=O is also a solution of (1). The converse is also true, namely, for every solution z of (1) we can find a function cp such that z(x, y) is also a solution of cp(u, v) = 0. The relation cp(u, v) = ° is called an intermediate integral of (1). Sometimes an integral of a system of differential equations of characteristic manifolds is also called an inter- mediate integral. If each of the two systems of differential equations defining the character- istic manifolds has an tntermediate integral, then the two intermediate integrals cp(u, v) = ° and tjJ(u, v)=O form a tcomplete system of partial differential equations of the first order. Integrating this complete system, the tgeneral solution of equation (1) is obtained. (4) (5) (6) (7) (8) (9) References [IJ E. Goursat, Cours d'analyse mathematique III, Gauthier-Villars, fourth edition, 1927. [2J E. Goursat, Le90ns sur I'integration des equations aux derivees partielles du second ordre a deux variables independantes I, Her- mann, 1896. 
1049 279 (VII.16) Morse Theory A. General Remarks We are interested in smooth functions on a smooth manifold of dimension m that have only the simplest critical points. Here, "smooth" means differentiable of class ceo. Such a function f: M ->R enables us to investi- gate the topology of M. The decomposition of M into level sets of f contains a lot of topo- logical information on M. For instance, it is shown that M is thomotopy equivalent to a tCw complex that is determined by critical points of f; furthermore the tEuler character- istic of M can be computed by means of j, because the critical points are related to the thomology groups of M. These types of inves- tigations, called Morse theory, were originated by H. Poincare [1] and G. D. Birkhoff [2] and then developed into the form we see today by M. Morse [3]. An excellent exposition of this theory has been given by 1. Milnor [4]. R. S. Palais [5] has extended the theory to Hilbert manifolds. Morse theory has been fruitfully applied to differential topology and differential geometry. B. Critical Points of Functions on Manifolds For a point pEM, Mp is by definition the tangent space to Mat p. Let f be a smooth real-valued function on M. A point pEM is called a critical point of f if the induced map- ping f* : Mp->Rf(p) is zero at p. For every tlocal chart (x 1, "', x m ) around a critical point p of f with p=(O,... ,0), 8f 8f -;;-(0)=... =-(0)=0. OX 1 8X m The value f(p) is then called a critical value of f If the matrix (8 2 fj8x i 8x/O)) is invertible, then the point is ca1led a nondegenerate critical point, and if the matrix is not invertible, then the point is said to be degent:fate. These no- tions are independent of the choice of local charts around p. The above matrix is called the Hessian of fat p. The nullity and index of the Hessian of f at p are called the nullity and index of fat p, respectively. The function f has a local minimum at a nondegenerate critical point of index 0, and a local maximum at a nondegenerate critical point of index n. A smooth function f on M is caned a Morse function if it satisfies: (A 1) Every critical point of f is nondegenerate. The Morse lemma states that if p is a nondegenerate critical point of f 279C Morse Theory and if the index of fat p is A, then there exists a local chart (y 1, ... ,Ym) around p with p = (0, '" ,0) such that f is expressed as {(y)= f(p)-(Yd 2 -... -(YiJ 2 +(YHlf +... +(Ym)2. C. The Existence of Morse Functions Let M be a compact smooth m-dimensional manifold without boundary. The existence of Morse functions on M is shown by tembed- ding Minto R n for a sufficiently large n. For convenience, M will be here identified with its embedded image. At each point pEM the set v p of all unit normals forms an (n - m - 1)- dimensional unit sphere, and v(M)= UPEMV p is a smooth (n-1)-dimensional compact manifold. Let S be the unit hypersphere in R" around the origin. The Gauss normal mapping q>:v(M)->S sends each point (p, v(p))EV(M) to VE S via a canonical parallel translation of R n . Since q> is smooth, the tSard theorem implies that the set E of all critical values of q> has measure zero on S. Thus q>* at every (p, V(p))E q>-I(V) has maximal rank if v is taken in S-E. For every fixed VES - E let hv: M ->R be de- fined by hv(x)=<V,X),XEM, where < , ) de- notes the canonical inner product of Rn. Then hv is a Morse function. To see this let dE, dv, and dM be volume elements of S, v, and M, re- spectively. Then dv /\ dM is a volume element of v(M), and the function G:v(M)->R is obtained by the relation q>*dE= G(x, v(x))dv /\dM. G(x, v(x)) is called the Lipschitz-Killing curva- ture at V(X)EV x ' If A(v(x)) denotes the tsecond fundamental form of M with respect to v(x), then G(x, v(x)) =( -1 )mdet(A(v(x))) (Chern and Lashof [6J). Every critical point pE M of hv for VE S - E has a normal v(p) to M at p, and the Hessian of hv is given by A(v(p)). Thus G(p, v(p)) # ° ensures the nondegeneracy at p [6]. Another approach to the construction of Morse functions is to find for a given embed- ding of Minto R" an open dense set U c R" such that the Euclidean distance d x from any fixed point x on U to points on M has no degenerate critical points on M. In this case d;1 (( -00, aJ) is compact for al1 aE R. It is also possible to construct Morse functions on compact manifolds with boundary as wel1 as on noncompact manifolds. But a striking result of A. Phillips states that there exists a Morse function on every noncompact mani- fold which has no critical points [7]. For a pair of smooth manifolds M, N let Ceo(M,N) be the set of all smooth mappings from M to N. The weak (or compact-open) Ceo 
279 D Morse Theory topology on CD(M, N) is generated by the sets defined as follows. Let f E COO(M, N), and let (U, cp) and (V, t/J) be charts of M and N, respec- tively. Let K cUbe a compact set such that f(K) c V. For BE(O, 00), a weak tsubbasic neighborhood N°O(f;(U,cp),(V, t/J), K,B) of f is defined to be the set of all smooth map- pings gECOO(M,N) such that (1) g(K)c V, (2) IIDk(cpO forI) (x)-D k (cpogor 1 ) (x)11 <I:, xEcp(K), k = 1,2, ..., where D k denotes the kth derivative with respect to the coordinates. The weak cx; topology is thus defined on CX (M, N), and the space with this topology is denoted by C'(M, N). It follows from the construction of the em bedding in the Whitney embedding theorem that the set of all smooth embeddings of compact Minto R" is dense in C(M, R") if n> 2dim M. Moreover, the set of all Morse functions on a compact M forms an open dense set in C::;(M,R) (Nagano [8], Hirsh [9], Auslander and MacKenzie [10]) D. Decomposition of M by a Morse Function To decompose M into levels of a Morse func- tion we now consider the process of tattaching a handle. Let M be a compact manifold with tboundary aM. Let D S be the s-disk, and let g:(aD S ) x Dm-s-+aM be an tembedding. Then a tmanifold X(M; g; s) with a handle attached by 9 is defined as the quotient set obtained from the disjoint union M U (D S x D m - s ) by identify- ing points in aDs x D m - s and their images under 9 and equipped with a natural differenti- able structure. Similarly, if gi:(aDi') x Di n ,-+ aM (i = [, .. . , k) are em beddings with disjoint images, we can define the thandlebody X(M; 9 I , '" , gk; s I, .. . , ski [11]. For an fECX(M, R) and for a, bEf(M) with a <b let M"= {xEM'f(x)a} and M;= {XE M la f(x) b}. M; is called a level set of f For a Morse function f on a compact manifold M without boundary, the following fundamental facts are known [4]: (1) The first fundamental theorem. If MZ contains no crit- ical points, then MZ is diffeomorphic to M x [a,b]; (2) The second fundamental theorem Let CEra, b) be a unique critical value in [a, b], and let PI" ., Pk E M be critical points of f with Pi having index Ai' Then M b is diffeo- morphic to X(M a ; fr,... ,J,., Ar,..., Ak) for suitable embeddings 11, "', k It follows from the existence of Morse functions and from (1) and (2) that every compact manifold can be obtained by successively attaching handles to a disk ( 114 Differential Topology). Fur- thermore, if M admits a Morse function with only two critical points, then M is homeo- morphic to a sphere (Reeb [12]). Concerning the homotopy types of M and 1050 M a we have the following: (3) If cEf(M) is a critical value off with points PI' "', Pk with Pi having index Ai and if M: is compact for E> ° and contains no critical points other than PI, .. . , Pb then Me+< has the same homotopy type as MC-CUeA,U... Ue A " where eAj(J= 1, "', k) are Arcells. (4) Assume that a Morse function f on a (not necessarily compact) M satisfies: (A2) M" is compact for all a E R. Then by choosing a seq uence a 1 < a 2 < ... of regular values of f and applying (3) to each M:+1, we see that M has the same homotopy type as that of a Cw complex" where the number of A-cells belonging to , is the same as that of critical points of index A of f Moreover, we have the following Morse inequalities: (5) Let f be a Morse function on a compact M. If Ml. is the number of critical points of f on M of index ;., and if Rl. is a A- dimensional tBetti number of M, then Mo  Ro, MI-MoR1-Ro, Ml.-Ml.- 1 +...+(-I) AM o RA-RA_I+...+(-I)ARo, I<A<n-l, M"-M n - 1 +... +( -I)"Mo =Rn- R "-1 +... +( -1)"Ro. In particular, we have M k  Rk for all k. By using these facts S. Smale obtained an affirma- tive solution of the tPoincare conjecture in high dimensions [11,13]. The concept of critical manifolds in the sense of Bott [14] is stated as follows: Let M be a compact manifold embedded into an open set U c: R k . Let f: U -+R be a smooth function. M is called a nondegenerate critical manifold of f on U if (1) all points of Mare critical points of f and (2) the nullity of all XE M is equal to dim M. When M is such a nondegenerate critical manifold of f on U, fis constant on M, and the index A of f at x is well defined and is the same at all points on M. The tPoincare polynomial is expressed as P(M; t)= LtkdimHk(M). The Morse poly- nomial is defined by 1JJl(J; t) = L,v tl. N PIN; t), where N runs over all critical manifolds of f and )'N is the index of N. Under certain con- ditions for orientability along each nondegen- erate critical manifold of 1; we have again the Morse inequality 1JJl(J; t) P(M; t). E. Morse Theory on Hilbert Manifolds Let M be a tHilbert manifold, and let f: M -+ R be a smooth function. The Hessian a 2 fp of f at a critical point of f is a symmetric bilinear 
1051 form on M p given by a 2 fp(u, v) = d 2 (f 0 cp -1 )(dcpp(u), dcpp(v)), u,vEM p , where cp is a coordinate mapping of a local chart around p. The index (coindex) of f at a critical point pE M of f is defined to be the supremum of dimensions of subspaces of Mp on which a 2 f p is negative (positive) defi- nite. The self-adjoint bounded operator A which represents a 2 fp is given by a 2 fp(u, v) = (u, A(v)p' If A is invertible, then f is said to be nondegenerate at p. These notions do not depend on the choice of local charts around p. The Morse lemma has been generalized to a Riemannian Hilbert manifold M by R. Palais and S. Smale [15J as follows. Let f be a smooth function defined on M. If XE M is a nondegenerate critical point of f then there exist a chart cp around x and a projection P such that for y near x f(y)= f(x) + IIPcp(y)11 2 -11(1 - P)cp(y) II 2. The above result has been extended to critical manifolds [16]. A connected submanifold N of a Hilbert manifold M is called a nondegenerate critical manifold of f: M --+ R if (1) every point p E M is a critical point off and (2) for each pE M there exists a closed subspace Ep of Mp such that Mp = N p EB Ep and the Hessian form restricted to Ep is nondegenerate. Let v = (n, E, N) be a smooth Hilbert-space bundle over a compact connected manifold N, and let ( , ) be a Riemannian structure for v. Assume that a smooth function f: E --+ R has the zero section of vas a non degenerate critical mani- fold. Then there exist a tubular neighborhood v, = {VE E III vii < d of the zero section in vand a fiber-preserving diffeomorphism 1/1: v,--+I/1(v,) and an orthogonal bundle projection P such that for VE V p f 0 l/1(v) = f(N)+ IIPvl1 2 -11(1 - P)vI1 2 . Let us assume that (B) M is a complete Riemannian manifold, and (C) (Palais-Smale condition) iff is bounded on a set S c M and if the norm II Vf II of the tgradient vector of f has infimum ° on S, then there is a critical point of f on the closure S of S. Since M is not locally compact, condition (C) is required. In order to prove the first and second funda- mental theorems of Morse theory it is neces- sary that the integra I curves of'Vf exist, and (C) ensures their existence. Namely, under conditions (A 1), (B), and (C), one of the follow- ing two facts follows for any bEf(M) and for any regular point pEM b : (1) The integral curve e: [0, r J --+ M b of'Vf with e(O) = p exists, and f 0 c(r) = b holds for some r E [0, w), (2) the integral curve c: [0, w)--+M h of 'Vf exists and 279 F Morse Theory lim,_ooc(t) exists such that the limit point c(w) = limb 00 c(t)E M b is a critical point of f These facts imply that the first fundamental theorem of Morse theory holds if M does not contain any critical point of f, and also the second fundamental theorem holds if M has only nondegenerate critical points on M for some cE(a, b). Furthermore, assume thatf:M--+R and M satisfies (A 1), (B), and (C), and let a, bEf(M) be regular values with a<b. If for each nonnegative integer Ie, R, is the Ath Betti number of M, and if M, is the number of critical points of index }. of f in M, then the Morse inequality holds: MoRo, M I -MoR1-Ro, , , L (-l)HMk L (-l)HR b kO kO . 00 00 L (-l)kM k = L (-l)kR k . kO kO In particular, M,R; holds for all ),. If f is bounded below, then the }.th Betti number R! of M h and the number Mj of all critical points of f with index Ie in M b satisfy Mj  R! for all }.. F. Morse Theory of Path Spaces Morse theory on Hilbert manifolds applies to the energy functions on Riemannian Hilbert manifolds which consist of all HI-curves on a compact Riemannian manifold M, and the theory is useful for proving the existence of closed geodesics on M. Let M be a smooth manifold, and let 1= [0,1]. Let Hd/, M) be the set of all continuous curves a: 1 --+ M such that for each local chart (V, cp) of M, cp 0 a is absolutely continuous and II(cpoa)'ll is locally square integrable. In par- ticular, if M = R m , then HI (I, R m ) is a Hilbert space with the inner product (a,p)I:=(a(O),p(O))+ f (a(t),p(t))dt, a,pEH)(1,R m ). For each aEHI(/, M), set HdI, M)a= {X E H) (1, TM)I X(t)EMa(ll' tEI}, where TM is by definition the ttangent bundle over M. For any fixed pair of points p, qEM, let Q(M;p,q)= {aE Hd/, M)I a(O)= p, a(l)= q}, and for each aEQ(M;p,q), let Q(M;p,q)a= {X EH 1 (1, M)al X(O)=OEM p ,X(1)=OEM q }. Then H 1 (1,M)a forms a vector space, and 0.(M; p, q)a is a sub- space of HI (1, M)a' M can be embedded into R n for sufficiently large n by the Whitney 
279G Morse Theory embedding theorem [17]. Then we have the following facts [17]: (1) HdI, M) = {aE HI (1, R") I a(1) eM}, (2) HI (1, M) is a closed submanifold of HI (1, R"), (3) Q(M;p,q) is a closed submanifold of HI (I, M), (4) the tangent space to HI (1, M) at each point a E HI (1, M) is HI (1, M)a, (5) the tangent space to Q(M; p, q) at each point aEQ(M;p,q) is Q(M;p,q)a' Here M is identified with the embedded image in R n . Thus HI (1, M) and Q(M; p, q) carry the structure of a Hilbert manifold, and they are independent of the choice of embeddings [17]. The energy functions on HI (I, M) and Q(lvl; p, q) play important roles in the develop- ment of Morse theory. M is now assumed to be a complete Riemannian manifold. It follows from the Nash isometric embedding theorem ( 365 Riemannian Submanifolds B) that M is isometrically embedded into R n ' for a sufficiently large n'. HI (1, R"') carries a com- plete Riemannian metric in a natural way, and HI (1, M) becomes a complete Riemann- ian manifold with the metric induced from HdI,RN} Similarly Q(M;p,q) admits a com- plete Riemannian metric. Then the energy function E: HdI, M)->R is defined to be E(a):=  r (a', a') dt, 2J[ where ( , ) is the inner product induced from the Riemannian structure of M. Then the Palais-Smale condition (C) is satisfied for E on HI (1, M). That is, if a sequence {ad on HdI,M) satisfies: (1) {E(ad} is bounded above; and (2) V'E(ad->O as k-> 00, then there is a subsequence {ad of {ad such that {ad con- verges to a critical point aE HI (I, M) of E. Also, condition (C) is fulfilled for the energy function on Q(M;p,q). For a fixed pair of points p, qE M, let Q = Q(M,p,q). Let aEQ be a critical point of E. A tangent vector W to Q at a: 1-> M with a(O) =p, a(1)=q is a tpiecewise differentiable vec- tor field along a such that W(O) = OE Mp and W(1)=OEM q . A proper variation a:( - 8, 8) X I ->M along a which is associated with W satis- fies the following: (1) a(O, t) = a(t), tEl; (2) there exists a finite partition 0= to < t 1 <... < t k = 1 of I such that a I ( - 8, 8) X [t;-I' tJ is differenti- able for all i = 1, ..., K; (3) a(u, 0) = p and a(u, 1) =q hold for all UE( -8,8); and (4) oa(O,t)jou = W(t), tEl. It follows from the first variation formula ( 178 Geodesics A) that aE Q is a critical point of E if and only if a is a geodesic on M. Let WI' W2EQa, and let U be an open set in R 2 around the origin. Then a proper variation a: U x I ->M along a that is asso- ciated with WI and W 2 satisfies the follow- ing: (1) IX(O,O,t)=(J"(t), tEl; (2) there exists a finite partition 0= to < t 1 < ... < t k = 1 of I such 1052 that a I U x [t;-I' t i ] is differentiable for all i = 1, ...,k; (3) a(u 1 ,u2'0)=P, a(uj,u2' 1)=q for all (u 1 , U2)E U; and (4) Ga(O, 0, t)jGU1 = WI (t), oa(O, 0, t)jOU2 = W 2 (t), tE I. Then the Hessian E** of E at a is given by _ o2EW(U1' U 2 )) 1 E**(W 1 ,W 2 )- 2 0 3 ' U 1 (U 2 (0.0) where (u I' U 2 )E Q is by definition the curve a(u 1 ,u 2 )(t)=a(u1,u2,t). The second variation formula ( 178 Geodesics A) then gives E**(W 1 , W 2 )= - L (W2(t),, W{(t) f' -1 (W 2 , WI' +R(W 1 ,a')a')dt, where, W{(t) = W{(t+O)- W{(t-O). We have the Morse index theorem, which states: (1) The tnull space of E at a critical point a is the linear space spanned by all Jacobi fields ( 178 Geodesics A) along a that vanish at ° and 1; (2) if {a(sd,a(s2)' ...,a(sl)} (O<SI <S2 < ... < SI < 1) is the set of all points conjugate to a(O) along a and if Ai is the multiplicity of the conjugate point a(si)' then the index of E at a is equal to AI +... + .!cl' It follows from the Sard theorem together with the differentia- bility of the exponential mapping on M that except for a set of measure zero in M x M, p, q can be chosen so that p, q is not a conjugate pair along any geodesic in Q. Then all critical points of E are nondegenerate, and for any c>O, QC= {wEQIE(w)=c} contains at most fimtely many nondegenerate critical points with finite indices. G. Existence of Closed Geodesics Let M be a compact Riemannian manifold. By replacing I with the circle S 1, we consIder a Hilbert manifold A(M)=H 1 (SI, M). A(M) carries the structure of a complete Riemannian manifold. Every point pE M is naturally em- bedded in A(M) as a point curve, and M is a totally geodesic submanifold of A(M). A point aEA(M) is a critIcal point of the energy func- tion E: A(M)->R if and only if a is a closed geodesic of M. It is known that E satisfies the Palais-Smale condition (C). The index and n:.J1lity of E at a critical point a is finite. Since M is compact, the fundamental length fi of M is positive and N = {aE A(M) I E(a)  c} is a tdeformation retract of Me A(M) by means of the deformation along the integral curves of -V'E. If M is not simply connected, then each nontrivial element of the tfundamental group of M represents a class of homotopic curves in 
1053 which there is a closed geodesic whose length realizes the infim um of all these curves. If M is simply connected, then there is a mini- mum integer k for which ndM)#O. Clearly, nk_I(A(M))ndM)#O. Suppose that E has no positive critical value. Then A(M) is a defor- mation retract of M, a contradiction. Therefore there exists at least one closed geodesic on every compact Riemannian manifold (Lyuster- nik and Fet [18J). Proof of the existence of many closed geo- desics on M is difficult due to the follow- ing: (1) There is a continuous 0(2) action on A(M) that assigns aEA(M) and IXEO(2) to the curve t->a(t+ IX), tES I ; (2) for each integer k and for each critical point aEA(M), the curve t->a(kt) is a critical point with energy k 2 E(a). A remarkable result has been obtained by Lyusternik and Shnirel'man [19J, which states that there are at least three closed geo- desics on every simply connected compact manifold of dimension 2. Fet then proved that there exist at least two closed geodesics on a compact manifold if all critical points of E on A(M) are nondegenerate [20]. By developing a precise argument concerning the Morse lemma around an isolated degenerate critical point aEA(M) of E, Gromoll and Meyer have proved that there exist infinitely many closed geodesics if the sequence of Betti numbers {bi(A( M))} with respect to any field is un- bounded [21]. If A: M -> M is a certain isome- try, then there are also infinitely many A- invariant closed geodesics if the Betti numbers of the space of A-invariant HI-curves are not bounded [22,23]. By investigating the Z2- cohomology of A(M) of compact symmetric spaces, Ziller has proved that if M has the same homotopy type as that of a symmetric space of rank  2, then M has infinitely many closed geodesics [24]. Many attempts have been made by W. Klingenberg and others to prove the existence of infinitely many geometrically distinct closed geodesics on every compact Riemannian mani- fold [25]. References [IJ H. Poincare, Sur les lignes geodesiques des surfaces convexes, Trans. Amer. Math. Soc., 6 (1905),237-274. (Oeuvres, Gauthier-Villars, 1953, vo!. 6, 38-85.) [2J G. D Birkhoff, Dynamical systems, Amer. Math. Soc. Colloq. Pub!., 1927. [3) M. Morse, Calculus of variations in the large, Amer. Math. Soc. Colloq. Pub!., 1934. 279 Ref. Morse Theory [4J J. Milnor, Morse theory, Ann. math. studies 51, Princeton Univ. Press, 1963. [5J R. S. Palais, Morse theory on Hilbert manifolds, Topology, 2 (1963),299-340. [6J S. S. Chern and R. Lashof, On the total curvature of immersed manifolds, Amer. J. Math., 79 (1959),306-318. [7J A. Phillips, Submersions of open mani- folds, Topology, 6 (1967),171-206 [8J T. Nagano, Variational theory in the large (in Japanese), Kyoritu, 197\. [9J M. W. Hirsch, Immersions of manifolds, Trans. Amer. Math. Soc., 93 (1959), 242-276. [IOJ L. Auslander and R. E. MacKenzie, In- troduction to differentiable manifolds, McGraw-Hill,1963. [IIJ S. Smale, Generalized Poincare's conjec- ture in dimensions greater than four, Ann. Math., (2) 74 (1961),391-406. [12J G. Reeb, Sur certains proprietes topo- logiques des vari6tes feuilletees, Actualites Sci. Ind., II (1952),91-154. [13J J. Milnor, Lectures on the h-cobordism theorem, Pnnceton Univ. Press, 1965. [14J R. Bott, Nondegenerate critical mani- folds, Ann. Math., (2) 60 (1954), 248-26\. [15J R. Palais and S. Smale, A generalized Morse theory, Bul!. Amer. Math. Soc., 70 (1964),165-172. [16J W. Meyer, Kritische Mannigfaltigkeiten in Hilbertmannigfaltigkeiten, Math. Ann., 170 (1967),45-66. [17J J. T. Schwarz, Nonlinear functional ana- lysis, Gordon & Breach, 1969. [18J L. Lyusternik and A. I. Fet, Variational problems on closed manifolds (in Russian), Dokl. Akad. Nauk SSSR, 81 (1951), 17-18. [19J L. Lyusternik and L. Shnirel'man, Methodes topologiques dans les problems variations, Actualites Sci. Ind., Hermann, 1934. [20J A. I. Fet, Variation problems on closed manifolds, Amer. Math. Soc. Trans!., ser. 1,6 (1962),147-206. (Original in Russian, 1952.) [21J D. Gromoll and W. Meyer, Periodic geodesics on compact Riemannian manifolds, J. Differential Geom., 3 (1969),493-510. [22J K. Grove, Condition (C) for the energy integral on certain path spaces and applica- tions to the theory of geodesics, J. Differential Geom., 8 (1973), 207-223. [23J M. Tanaka, On the existence of infinitely many isometry-invariant geodesics, J. Differen- tial Geometry, 17 (1982) 171-184. [24J W. Ziller, Geschlossene Geodatische auf global symmetrischen und homogen Raume, Bonner Math. Schr., 85, 1976. [25J W. Klingenberg, Lectures on closed geo- desics, Grundlagen der math. Wiss. 230, Springer, 1978. 
280 A Multivariate Analysis 280 (XVlIl.g) Multivariate Analysis A. General Remarks M ultivaria te analysis consists of methods of statistical analysis of multivariate observations represented by a collection of points in a finite- dimensional Euclidean space RP. With the development of powerful computers, multi- variate techniques are beginning to be utilized in many fields of science and technology. B. The Multivariate Linear Model The multivariate linear model is an immediate extension of the univariate linear model. Sup- pose that X =(X(1). . x<n») denotes the p x n matrix of n observations of p-dimensional data. Suppose that it can be expressed as X=BZ+U, where B is a p x m matrix of unknown para- meters, Z is a known m x n matrix of inde- pendent variables, and U is a p x n matrix of errors. We assume that (i) the texpectations of the elements of U are zero, that is, the p x n matrix E(U)=O, and call the relation (1) a multivariate linear model. We usualIy assume further that (ii) the column vectors U(i), i = I,. ., n, of U are independent and identically distributed, and that (iii) U(i) is distributed according to a multivariate tnormal distri- bution with tcovariance matrix };. Analogous to the univariate case, the tleast squares es- timator B of B is defined to be the p x m ma- trix that minimizes tr(X - BZ) (X - BZ)' and is given explicitly by B=XZ'(ZZ,)-1 when IZZ'I i'0. Here the symbol' means the transpose of a matrix. Also, an unbiased estimator of}; is given by 1: = Qe/(n - m), Qe = XX' - BZX'. B is an unbiased estimator of B under the assumption (i) above and is the tbest linear unbiased estimator under (i) and (ii), while 1: is unbiased when (i) and (ii) are assumed. Under the assumptions (i)-(iii), Band 1: form a set of complete tsufficient statistics; hence they are tuniformly minimum variance unbiased es- timators. Also under (i)-(iii), elements of Bare normally distributed, and their covariance can be expressed by }; <8> M (<8> denotes the tKronecker product). 1054 (1) where M =(ZZ')-1 Applying tCochran's theorem for the multivariate case, Qe is shown to be distributed according to a tWishart distribution with n - m degrees of freedom. To test the hypothesis B = Bo under (i)-(iii), we put QB=(B-Bo)ZZ'(B-BoY and have (X -BoZ)(X-BoZ)' =QB+Q.. where QB and Qe are independently distributed The distribution of QB is a Wishart distribution with m degrees of freedom when the hypothesis is true, and a tnoncentral Wishart distribution when Bi'Bo. Based on this fact, several procedures have been proposed. If we require the invariance of procedures with respect to linear transforma- tions of the coordinates of p-dimensional vectors, the roots XI' ..., X p of the tcharacter- istlc equation IQB - ),Qel = 0 form a tmaximal invariant statistic; hence the testing proce- dures should be defined in terms of these roots ( 396 Statistic I). Also, the consideration of tpower leads to procedures that reject the hypothesis when these roots are large. Com- monly used test statistics are (1) the t1ikelihood ratio test W=IQel/IQB+Qel=fL(1 +A;)-t (S. S. Wilks); (2) trQ;tQB=LAi (D. N. Lawley and H. Hotelling); (3) maxA i (S. N. Roy); (4) trQB(QB+Qe)-1 =L)'i(1 +.!ci)-t (K. C. S. Pillai). Pill ai's trace test (4) is locally the most powerful invariant; Wilks's likelihood ratio test (1) has the maximum Bahadur efficiency. The power functions of the tests (1 )-(3) have the monotonicity property, namely, they are monotonically nondecreasing with respect to each eigenvalue OfO=};-I(B-B o )ZZ'(B- Bo)', the matrix of noncentrality parameters for QB' The monotonicity for Pill ai's test (4) is known to hold only for restricted cases where the critica1 value c for the acceptance region tr QB(BB + Qe) -1 <:;: C should satisfy 0 <:;: c <:;: 1 All the tests (1 )-(4) are unbiased. With respect to 0-1 loss, the tests (1) and (4) are tadmissible Bayes and the tests (2) and (3) are tadmissible. Tests based on min Ai are inadmissible. SmalI-sample distributions of these statis- tics are complicated but when n--+CXJ, nlog W, ntrQBQ;t, and ntrQB(QB+Qe)-1 are as- ymptoticalIy distributed according to a chi- square distribution with pm degrees of free- dom under the null hypothesis. Even under the alternative hypothesis that the matrix of noncentrality parameters Q = 0(1) for large n, the asymptotic distributions remain the same. When Q = O( I), they are noncentral chi- square distributions of pm degrees of freedom and noncentrality parameter b = tr Q. If Q = O(n), they are normal distributions, namely, -fi(log W+logl1 +BI), fi(trQBQ;I-trB) and fi(trQB(QB+Qe)-t -trB(I +O)-t) for f)=limQ/n have asymptotically normal distributions of zero means and variances given by 2 tr(I - (I + B) - 2), 2 tr (20 + ( 2 ), and 
1055 2 tr«(I + (1) -2 -(I +0)-4-), respectively. As a special case, if m = 1 there exists only one non- zero A, and the procedures in this paragraph all coincide and are equivalent to one based on T 2 = M -1 (H - Bo)' .t- 1 (H - Bo). It is known that under the hypothesis, (n - p) T 2 /(n - l)p is distributed according to an t F -distribution with (p, n - p) degrees of freedom. When p = 2 (resp. m=2), (n-m-1)(I-jW)/(mjW) (resp. (n - p-1)(I-jW)/(PjW») is distributed according to an F -distribution with degrees of freedom (2m,2(n-m-l)) (resp. (2p,2(n-p -1))). Simultaneous tconfidence regions of B can be derived from the testing procedures in this paragraph, that is, trQ;I(B-H)22'(B-H)' <c. Moreover, when the matrix B is decom- posed as B = (B) : 8 2 ), where B 1 and B 2 are a p x q matrix and a p x (m - q) matrix, respec- tively, and the hypothesis to be tested is of the form B 1 = 0, the test procedures can be ob- tained as follows: Decompose 2 as 2= (J, where 2 1 is a q x n matrix and 2 2 is an (m - q) x n matrix, and put Hi = X2; (2 2 2;) -I. Q*=XX'-Hi2 2 X', QB,=Q*-Qe. Then QB 1 and Qe are independent, QB, is distributed according to a Wishart distribution with q degrees of freedom when the hypothesis is true, and we can apply the procedures in the previous paragraph, simply replacing QB byQB,. Such a procedure is called multivariate analysis of variance (or MANGV A, for short). Various standard situations can be treated in this way (after some linear transformation of variables, if necessary). Some examples are (1) X = (X(1) ... x(n»), where the X(i) (i= 1,..., n) are distributed independently according to a p-dimensional normal distribution N (p, 1:). We can express X as X =pI' + U, and the estimators are given by p=X=Xl/n,.t= (X-Xl')(X-XI')'j(n-1). In this case, we obtain a test for the hypothesis P = Po based on HoteIling's T 2 statistic, i.e., the test with a tcritical region of the fonn T 2 =n(X- Po)'.t- 1 (X- Po»c. (2) Suppose that p x n i matrices Xi, i = 1, "', k, are sam pIes of size n i from p-dimensional normal distributions N(Pi' L:) with common covariance matrix L:. The tests for the hypoth- esis PI =... = Pk are obtained from the follow- ing observation: Let Qe = Li(X i - Xi 1')(X i - XiI')', where X i =X i ljn i , Q"=Lni(Xi-X) 280C Multivariate Analysis (Xi-X)', X=LniX;/Ln i . Then L(Xi-XI') (Xi-XI,),=Q"+Qe' We call Qe the matrix of the sum of squares within classes, and Q" the matrix of the sum of squares between classes. The latter is distributed according to a Wish- art distribution when the hypothesis is true. (3) Suppose that Xij are p-dimensional vectors, and that Xij=P+'Xi+P j + U ij , i= 1,... ,m, j= 1, ...,n, where p, 'Xi, Pj are p-dimensional constant vectors such that L'Xi=O and Lpj=O, and the Uij are independently distributed according to a p-dimensional normal distribution N(O,1:). We set Q,=nI(X i -X)(X i -X)', Qp=mI(X j-X)(X j-X)', Qe=II(Xij-X i -Xj+X) x (Xij-X i -X j+X)', where X. = 'X / n 1  I] , j x = ' X / m J  IJ ' i X= IIXjmn Then we have LL(Xij-X)(Xij-X)' =Q,+ Qp + Qe, and Q" Qp, Qe are distributed inde- pendently according to (noncentral) Wishart distributions with degrees of freedom m - 1, n-l, and (n-1)(m-l), respectively. The tests for the hypothesis 'Xi=O (i= 1, ...,m) or Pj=O (j = 1, "', n) are obtained from these matrices. C. Tests for Covariance Matrices Let Xij(p x 1),]= 1, ...,N i , be a random sample from p-variate normal distribution N (Pi' L:J for i = 1, "', k. For testing the hypothesis Ho: L:, =... = l:k with unknown mean vectors Pi against all alternatives, the likelihood ratio statistic is given by . flISiIN;/2 flN i N ;/2 II S iI Nj2 ' N=N 1 +...+N k , (2) where Si=LfdXij-Xi)(Xij-X;)' for X i = Lf1 Xij/N i . If we replace the sample size N i by the degrees of freedom n i = N i - 1 and N by n = N - k in (2), the modified likelihood ratio test is unbiased for general p and k [16]. For k = 1, the hypothesis specifies Ho: L: 1 = L:o (a given positive definite matrix) and the likelihood ratio statistic is given by ISI NI2 etr( - Lo! S/2) (etr(lX) = exp(trlX)). Again replacing N by n = N - 1 yields an unbiased test. Moreover the power function of this modified likelihood ratio test depends only on the eigenvalues of L: 1 L:o 1 and is increasing with respect to the absolute deviation of each 
2800 Multivariate Analysis eigenvalue from 1, that is, Ich)",'l1.'ol-11. For k = 2, it is conjectured that the modified likeli- hood ratio test has a power function monoton- ically increasing with respect to Ichi1.'j 1.'2- 1 -11. This conjecture has not yet been proved; we know only that the power is increasing if the maximum eigenvalue chI 1.'1 1.'2- 1 increases from 1 or the minimum eigenvalue ch p 1.' I 1.';1 decreases from 1. D. Estimation of Mean Vector and Covariance Matrix Let X(p x 1) have p-variate normal distri- bution N{jl,I). For estimating the unknown mean, take the sum of squared errors S as a loss L(p, d) = lip - d11 2 . Since each component Xi of X is distributed independently according to N (Pi, 1) for p = (l1b ... ,11.)', it is natural to suppose that X is a good estimator. In fact X is a minimax estimator for p. However, Stein showed that X is admissible for p= 1 and 2 but inadmissible for p 3. James and Stein showed that the estimator ( P-2 ) l-1iXf X dominates X when p  3. This estimator is further dominated by Stein's positive part estimator (1-(p- 2)/IIXf)+X, where (a)+ means a if a is nonnegative and zero if a is negative. The class of estimators X + V 10gf(X), where V=(%x 1 , ...,%x p )' for X'=(x 1 , ...,x p ) are all minimax for p dominating X, if f(X) O and J f(X) is superharmonic (V 2 J f(X) <:;:0), satisfying E[IIVlogf(X)f] < 00 and E[Llo 2 f(X)/oxfl/f(X)] < 00. Putting f(X) = IIXII-(P-2) yields the James-Stein estimator (3). For this problem a class of monotone estimators is essentially complete, where an estimator d(X) is called monotone if d(X) <:;: d(Y) whenever X <:;:Y (defined componentwise). Stein's positive-part estimator is not mono- tone and is still inadmissible [8]. Let X;(p xl), i = 1, ... , n, be a random sam- ple taken from the p-variate normal distri- bution N(O,L'). Then the maximum likeli- hood estimator for 1.' is given by Sin, where S= L7=1 XiX;. S has Wishart distribution J.v,,(n, 1.'). To study the estimation problem of 1.'. take two loss functions Ld1.', d) = trd1.'-l -logld1.'- I I- p and L2(1.', d) = (1/2)tr(d1.'-1 _1)2. Under L 1 -loss, the best scalar multiple ofS is given by Sin, and under L 2 -loss it is given by S/(n+ p+ 1). The James-Stein mini- max estimator for L 1 -loss is given by d 1 (S) = K/',.K', where K is the lower triangular ma- trix with positive diagonal element for which S=KK' and /',.=diag(/',.I' ...,/',.p) with /',.i= 1/(n + p + 1 - 2i). A minimax estimator for 1056 L 2 -loss is given by d 2 (S)=K/',.K' with the same lower triangular matrix K, but /',. is the solu- tion of the linear equation A/',. = b, where A is given by l (n+ p -1)(n+ p +l) n+p-3 A= n+p+3 (n+p-3)(n+p-l) . . . . . . n-p+1 n-p+1 (3) =: ] (n-p+ l)(n- p+3) and b'=(n+p-l, n+p-3, ...,n-p+ 1). The minimax estimators dr(S) and d 2 (S) dominate the best scalar multiples of S. However, they are inadmissible. Let P be a permutation ma- trix, then the estimator 1.'p P'di(PSP')P/p! dom- inates d;(S) because of the convexity of the loss function L j for i= 1 and 2. The estimator hdS) =(S+ub(u)C)/n, where u= 1/trCS- 1 and Cis a fixed positive-definite matrix, dominates Sin under L 1 -loss if 0<:;: b(u) <:;: 2(p -1 )/n and b(') is nonincreasing. Under L 2 -loss (S+bCjtrCS-1)/ (n+p+ 1) dominates S/(n+p+ 1), ifO<:;:b<:;: 2(p - 1 )/( n - p + 3) [7]. The Haff estimator hI (S) is dominated by the James-Stein esti- mator d 1 (S) if p6. The estimators (S+ b(trCS- 1 /trCS- 2 )C)/n for 0<:;:b<:;:2(p-l)/n under L I-loss also dominate Sin and are not dominated by d 1 (S). Similar results hold under L 2 -loss. E. Correlation among Variables In order to represent the interrelationships among the p variates, the population corre- lation matrix P = {Pij} is used. Also, we can calculate the population tmultiple correlation coefficient of the ith component Xi and all the rest of the var iables by Pi 1 .(i) p= J 1- P/P ii , and the tpartial correlation coefficient of Xi and X j , given an others, by P .. 1 ( ' ) U) = - P.. / IP.1>. lJ L P IJ V rUrjj, where P ij are the cofactors of P ( 397 Statis- tical Data Analysis). The sample correlation matrix is calculated from the data, and the sample multiple correla- tion and the sample partial correlation coeffi- cients are calculated from the sample correla- tion matrix in exactly the same way as the population coefficients are calculated from the population correlation matrix. When X = (X(I) '" x(n») is a sample of size n from a multivariate normal population, the sampling distributions of R i 1 Ii) P and R ij I (i) U) pare known ( 374 Sampling Distributions). 
1057 The determinant of the covariance matrix 11:'1 (or ISI), called the (sample) generalized var- iance, is a measure of the dispersion of a p- dimensional distribution. The distance of two distributions with mean vectors 1'-1 and 1'-2, respectively, and with common variance 1:' is often expressed by b =(1'-1 -1'-2)'1:'-1(#1 - #2), which is called the Mahalanobis generalized distance. When the data consists of (p + q)- dimensional vectors (J with q < p, the inter- \1'/ relation of X and Y as a whole can be ex- pressed in the following way: Let the covar- iance matrix be partitioned as 1:' = ( LXX LYX ) LXY Lyy and the nonzero roots of the equation IpLyy- LYXLX Lxy I = 0 be PI' ..., Pq. Then pfl2, ..., pi l2 , called the canonical correlation coeffi- cients, are the maximal invariant parameters with respect to linear transformation of X and Y. Also, if we denote the eigenvector cor- responding to a root Pi by 'Ii, i.e., (PiLYY)'1i = (Lyx1:'x LXY)"i' the linear function "i Y and "i Lyx 1:'x X are called the canonical variates. F. Principal Components An important problem in multivariate analysis is to express the variations of many variables by a small number of indices. Principal compo- nent analysis is a technique of dealing with this problem. Let X be a p x n matrix with n column vec- tors of p-dimensional data. A linear transform of X, T=AX (A an r x p matrix, r<p, T an r x n matrix) is called the principal component if A is chosen so as to maximize the sum of the squares of the sample multiple correlation coefficients of each of the row vectors of X to those of T, namely, if A is an r x p matrix formed by the r eigenvectors of the sample correlation matrix of X corresponding to the r largest eigenvalues or any nonsingular linear transformation of them. This is a characteriza- tion of principal components in terms of cor- relation optimality. The principal component T=AX is also characterized by the information-loss op- timality in that all eigenvalues of (X - CY - bI)(X - CYbl)' are simultaneously minimized subject to the condition: C is a p x r matrix, b is a p x 1 vector and Y is an r x n matrix. The solution is given by CY + bl = A'T + xl, 280G Multivariate Analysis where x = X1nln. The variation optimality of the principal component is given by maximiz- mg simultaneously all the eIgenvalues of the matrix C'(X - Xl)' (X - Xl)' C, subject to the condition that C is a p x r matrix such that C'C = Ir. The solution is given by C = A', namely, C'X=T [15]. If all the correlations between the components of X are positive, the largest eigenvalue of the covariance matrix of X is simple and positive. All the coefficients of the first principal component (components of the eigenvector) can be taken to be positive (Perron-Frobenius theorem). \Vhen we assume normality, the eigenvalues of the sample correlation matrix R are the tmaximum likelihood estimators of the eigen- values of the population correlation matrix, and their sampling distributions can be ob- tained. A hypothesis relevant to principal component analysis is, for example, that the smallest p - r eigenvalues of the correlation matrix are equal, which can be tested by the statistic Rp-r =IRI!()-I'" Ar«p-A I -... -Ar)/(p-rj)P-r), where AI, '" , Ar are the r largest eigenvalues of R. Under the hypothesis, - c logRp-, (c a con- stant) is asymptotically distributed according to a chi-square distribution when n--+ 00. Variations of principal component analysis can be obtained by taking the eigenvectors of the covariance matrix of the raw data or of a multiple of it by some weight matrix. G. Factor Analysis Factor analysis is closely related to principal component analysis. We assume a model X=BF+U, where Band F are unknown p x rand r x n matrices of constants (p> r) and U is a p x n matrix of independent errors. F is called the matrix of factor scores and B the matrix of factor loadings. We assume F F' = nI. If E(UU') =n<1> is known, then by applying the least squares principle, we can determine Band F so as to minimize the trace of (X - BF)'<1> -I (X - BF). Then B is obtained by taking the r eigenvectors of <1> -I XX' corresponding to the r largest eigenvalues. When <1> is diagonal but unknown, we can solve the simultaneous equa- tion for Band <1>, whose solutions are the matrix B with columns equal to eigenvectors of &-1 XX' and the diagonal matrix $ with elements equal to the diagonal elements of XX'ln-BB'. If we assume that U is normal, the proce- dure in the preceding paragraph for the case 
280H Multivariate Analysis when <1> is known is equivalent to the maxi- mum likelihood method. When <1> is unknown, we can assume further that the columns m F are also multivariate normal vectors distri- buted independently of U, which implie that the columns of X are also normal vectors with the covariance matrix}; = BB' + <1>. Band <1> are estimated from the sample covariance matrix, and the solutions of the simultaneous equation for Band <1> gives the maximum likelihood estimators. However, there is the so-called identification problem of determining whether for given}; and r the decomposition };=BB' + <1> is unique. This problem has not yet been completely settled. If the solution $ is posi- tive definite and HH' is positive semi definite, it is called proper. When <1> is not positive defi- nite, it is called a Haywood case, and when HH' is not positive semidefinite it is called a complex case. Sometimes iterative procedures lead to improper solutions. H. Canonical Correlation Analysis Canonical correlation analysis can also be used for descriptive purposes. Sample canon- ical variates have various descriptive implica- tions. Suppose that,,; Y and ; X are the first canonical variates corresponding to the largest canonical correlation pt/ 2 . Then pt/ 2 is the largest possible correlation between a linear function of X and a linear function of Y and is actually equal to the correlation of,,; Y and ; X. Similarly, the second canonical correla- tion is equal to the largest possible correla- tion between linear functions in X and in Y which are orthogonal to ; X and,,; Y, respec- tively, and so forth. As a second interpretation of the canonical variates, we consider the linear regression model Y=BX+U, where Y, B, U are q x n, q x p, q x n matrices, respectively, and rank B = r < q. Then there exist an r x p matrix A and a q x r matrix C satisfying C'};-lC = I such that B = CA, where E(UU')=n};. Putting T=AX, we get a re- gression model Y = CT + U, regarding T as a matrix of regressor variables. If }; is known, least squares considerations lead to minimiz- ing tr(Y - BX)'};-l (Y - BX) with the condition rank B = r. The resulting row vectors of A consist of the r eigenvectors corresponding to the r largest eigenvalues of the matrix S;) Sxy};-l SyX and column vectors of C consist of the r eigenvectors corresponding to the r largest eigenvalues of the matrix SyxS;; SXy};-I. If }; is replaced by Syy, T = AX and Z = C'S;yl Y are equal to the matrices of canonical variates. 1058 If we assume that U is normal, this proce- dure is equivalent to the maximum likelihood method It should be remarked that although the model here is not symmetric in X and Y, the results are symmetric in X and Y, and therefore they will be the same if the roles of X and Yare interchanged in this model. I. Linear Discriminants and Problems of Classification Let p x n i matrices Xi (i = 1, ... , k) be the set of observations for k distinct populations with a common covariance matrix. We determine a vector a such that T i = a'X i reveals the dif- ferences of the k populations as much as pos- sible, or, more precisely, so that the ratio of the sum of squares between classes of T to the sum of squares within classes is maximized. If the matrices of the sums of squares between and within classes are Qb and Qw, respectively, the ratio is equal to 1= a'Qbaja'Qwa, which is maximized when a is equal to the eigenvector of Q;;;l Qb corresponding to the largest eigenvalue. The linear function t = a'X is called the linear discriminant function. When k=2, a is given by a=Q;;;I(X 1 -X 2 ), where Xl and X 2 are sample mean vectors. When k > 2, we let A be the matrix formed by the r eigenvectors corresponding to the first r largest eigenvalues of Q;;;l Qb' and set Ti=AX i . From this we can construct the r- dimensional discriminant function. These functions can be used to locate the k popu- lations in r-dimensional space, and also to decide to which population a new observa- tion belongs. For the latter problem we can also construct k quadratic functions 5 i = (X-Xi),Q;;;I(X-X,), where Xi is the sample mean vector of the ith population, and X a new observation. Then we can decide whether X belongs to the population correspondmg to the minimum 5i' Such a method is called a classification procedure. J. Discrete Multivariate Analysis Let X ijk be an observed frequency on three characteristics, each belonging to l)kth class for 1 <:;: i <:;: I, 1 <:;:} <:;: J and 1 <:;: k <:;: K. Assume that Xijk is a sample from multinomial distri- bution having Pijk as a probability of occur- ence for an (i,j, k) cell, where P = L Pijk = I. The multinomial observation X ijk with proba- bility Pijk is called an I x J x K contingency table. If we further assume that log Pijk = f1 + lJ. i + f3 j + Yk + lJ. ij + f3 j k + hi + b ijk 
1059 with the restrictions on parameters Li lJ(i = Li 'l-ij = Lj 'l-ij = 0, similarly on {j's and y's and LiOijk = LjOijk = Lk Oijk = 0, we say that a saturated log-linear model is given. Here the number of observations and the number of parameters are equal, and no errors can be estImated The parameter Oijk is called the three-factor effect, or equivalently, the second- order interaction. Similarly, lJ(ij' #jk, hi are called the two-factor effects or the first-order interactions. Finally 'l-i' fi j , "h are called the main effects. A simple model is obtained when all the first- and second-order interactions vanish This is equivalent to the independence model: Pijk = Pi P j P k and the maximum like- lihood estimators for 11, IJ(;, fJ j , {k are obtained from Pijk = Xi X j'X kln3, where n = X .. A nontrivial model is obtained by putting a1l the second-order interactions equal to zero. The likelihood equations for this model are given by npij=X ij , npik=X ik , npjk=X jk , (4) Bartlett first described a solution of (4) when I = J = K = 2. For a 2 x 2 x 2 table, the solution can be expressed by Pijk = (Xijk I (})/n because of the constraints for Xij , Xi k, and X jk' Putting ,\ II = () for Pijk yields a cubic equation for 0: (XIII + O)(X l22 + 0)(X 221 + 0)(X 212 +0) =(X 121 -0)(X 1l2 - O)(X 21 I - O)(X 222 - ()). For a general I x J x K table, the equations (4) have a unique solution within the no-three- factor effect model if there exists qijk > 0 satisfy- ing (4). The unique solution maximizes the likelihood. To solve the likelihood equa- tions (4), standard iterative procedures, such as the Newton-Raphson method, can be ap- plied. However, the following iterative scaling method of Deming and Stephan is more useful: (3m+')_ (3m) X ij npijk - Pijk (3m) ' Pij (3m+2)_ (3m+I) npijk - Pijk (3m+!) , Pi k (3m+3)_ (3m+2) npijk - Pijk (3m+2)' P jk The first iteration adjusts p\t m ) by fitting pro- portionally with respect to k so that np\]m+l) is equal to Xij , and similarly for the second and third iterations. Starting with any initial values satisfying the first-order interaction model, the iterative scaling method (5) converges to the unique solution of (4) as m-4 'XJ K. Other Problems The sampling distributions associated with the procedures discussed here are usually very 280 Ref. Multivariate Analysis complicated, and often only asymptotic prop- erties are known ( 374 Sampling Distri- butIOns) Nonparametric rank analogs of many multivariate techniques for normal distribution are found in [18]. Robustness of the distributions of test statistics or of latent roots is investigated in [3,10,14]. Multivariate data analysis is found in [5]. References (5) [IJ T. W. Anderson, An introduction to multi- variate statistical analysis, Wiley, second edi- tion, 1984. [2J Y. M. M. Bishop, S. E. Flenberg, and P W. Holland, Discrete multivariate analysis, MIT Press, 1976. [3J Y. FUJikoshi, Asymptotic expansions for the distributions of the sample roots under nonnorma1ity, Biometrika, 67 (1980), 4551. [4J N. C. Giri, Multivariate statistical in- ference, Academic Press, 1977. [5J R. Gnanadesikan, Methods for statistical data analysis of multivariate observations, Wiley, 1977 [6J S. J. Haberman, The analysis of frequency data, Univ. of Chicago Press, 1974. [7J L. R. Haff, Empirical Bayes estimation of the multivariate normal covariance matrix, Ann. Statist., 8 (1980), 586597. [8J I. Hashimoto, Estimation for distribu- tions with monotone likelihood ratio: Case of vector-valued parameter, Commun. Statist., AIO(1981),1901-1913. [9J W. James and C. Stein, Estimation with quadratic loss, Proc. Fourth Berkeley Symp. Math. Statist. and Prob., 1 (1961), 361 379 [IOJ T. Kariya, Robustness of multivariate tests, Ann. Statist., 9 (1981), 1267-1275. [11J M. G. Kendall, Multivariate analysis, Griffin, 1975. [12J A. M. Kshirsagar, MuJtivariate analysis, Dekker, 1972 [13J D N Lawley and A. E. Maxwell, Factor analysis as a statistical method, Butterworth, 1963. [14J R. J. Muirhead and C. M. Waternaux, Asymptotic distributions in canonical correl- ation analysis and other multivariate proce- dures for nonnormal populations, Biometrika, 67 (1980), 31 43. [15J M Okamoto, Optimality of principal components, Multivariate Analysis II, P. R. Krishnaiah (ed.), Academic Press, 1968, 673 685. [16J M D Perlman, Unbiasedness of the likelihood ratio tests for equality of several covariance matrices and equality of several multivariate normal populations, Ann. Statist., 8 (1980), 24 7 263. 
280 Ref. Multivariate Analysis 1060 [17J M. D. Perlman and I. Olkin, Unbiased- ness of invariant tests for MANOV A and other multivariate problems, Ann. Statist., 8 (1980),1326-1341. [18J M. L. Puri and P. K. Sen, Nonparametric methods in multivariate analysis, Wiley, 1971. [19J C. R. Rao, Linear statistical inference and its applications, Wiley, 1973. [20J R. Schwartz, Admissible tests in multi- variate analysis of variance, Ann. Math. Sta- tist., 38 (1967), 698-710. [21J M. S. Srivastava and C. G. Khatri, An introduction to multivariate statistics, North- Holland,1979. [22J C. Stein, Estimation of the mean of a multivariate normal distribution, Ann. Statist., 9 (1981),1135-1151. 
281 A Network Flow Problems 281 (XIX.5) Network Flow Problems A. Introduction The network flow problem is a special kind of mathematical-programming problem ( 255 Linear Programming, 264 Mathematical Pro- gramming, 292 Nonlinear Programming), where the variables of the objective function and the constraints are all defined in terms of a graph ( 186 Graph Theory). Owing to its special structure, the mathematical properties of the network flow problem as well as the solution algorithms have been investigated in detail. Network flow problems have a variety of useful applications in fields such as trans- portation, scheduling, and resource alloca- tion, and in operations research in general, so that they now constitute an important class of mathematical programming. By the term "network" (Netzwerk in German, reseau in French, set' in Russian) we usually mean a graph with some physical attributes attached to its edges and vertices. B. Basic Form ofthe Problem Let G=(V,E,a+,a-) be a graph with vertex set V, edge set E, and incidence relations a +, C - : E -. V, and let R be the field of real num- bers. (Most of the statements in the following are valid if we take for R an ordered field or ordered additive group more general than the field of real numbers.) Furthermore, we regard the collection 2 of all functions : E -. R as a vector space of dimension lEI, denoting (eK) by K if E = {ej, ..., en}' Similarly, the collec- tion Q of all functions w: V -. R is a vector space of dimension I VI. If we define the map- ping bet: V-.2 E by b:tv={ela:te=v} ( 186 Graph Theory), a linear mapping a: 2-.Q is naturally introduced through the relation (G)(V)=LeEJ+V(e)-LeEo-V(e). A vector E2 is called a flow on G if a =0. The dual space 2* of 2 and the dual Q* of Q are identified with the collection of functions 1/: E-.R and that of (: V -.R, respectively, under the obvi- ous correspondence. Then the linear mapping b:Q*-.2*, which is contragredient to a, is defined by means of the relation (bO(e) = ((c+ e)-((i;- e). A vector I/E2*, which is the image of ( under 15, i.e., which satisfies 1/ = bC is called a tension on G, and ( is called the potential on G corresponding to the tension 1/. (Sometimes, (e) is called a flow in edge e, I/(e) a tension across e, and ((v) a potential at vertex v.) A continuous curve C ( c R x R) on the 1062 Euclidean plane R x R is said to be mono- tone if(X 1 -X 2 )(Y1 - Y2)?0 for any (x 1 ,Yd, (X 2 'Y2)EC. A monotone curve C is called a characteristic curve if its projection to each coordinate axis, i.e., Cx=={xl(x,Y)EC} and C y == {y I (x, Y)E C}, is a closed interval. For a characteristic curve C, two convex functions, ip(x) = S:o(Y- Yo)dx and ljJ(y) = S:o(x-xo)dy, are defined, where (xo, Yo) is a point fixed on C and the integrals are taken along (x, y) E C. (It is understood that ip(x) = 00 if xrjC x and ljJ(y) = 00 if yrjC y .) These two functions defined for a fixed C are conjugate to each other in FencheJ's sense, and they satisfy ')(x) + ;jJ(y}  (x - xo)(y - Yo) for any (x, y) E R x R, where the inequality reduced to an equality if and only if (X,y)EC ( 88 Convex Analysis, 292 Non- linear Programming). A network N in network flow theory is a graph G to each edge e K whose edge set E = {e l' ... ,en} is given a characteristic curve C K . On a network N, the following three problems, PI, PII, and PIlI, are defined. PI: Find a flow  that minimizes ct>()==Ll ipK(K) under the constraints K == (eK)E C; (K = 1, ..., n). PII: Find a tension 1/ which minimizes \}I(I/)== L=1 ipK(I/K) under the constraints I/K ==1/(eK)E C; (K= 1,..., n). PIlI: Find a pair (, 1/) of a flow  and a tension 1/ such that (K, I/K)E C' for all K = 1, .., ,n. (With respect to PI or PlI, a flow or a tension satisfying the "constraints" is ordinarily called a feasible flow or tension, respectively.) Then, as a special case of the (Karush-)Kuhn-Tucker theorem and the dual- ity theorem in nonlinear programming, we have the following theorems (A) and (B). (A) For a given network N, one and only one of the following four alternatives is the case: (i) There is a feasible flow in PI, and at the same time, there is a feasible tension in PlI. In this case, both PI and PlI have a solution, and, for any solution  of PI and any solution  of PlI, the pair (,) is a solution of PIlI; and, conversely, for any solution (,) of PIlI, the flow  is a solution of PI and the tension  is a solution of PlI. (ii) There is a feasible tension in PlI, whereas there is no feasible flow in PI. In this case, ct>() of PH is not bounded below for feasible tensions . (iii) There is a feasible tension in PH, whereas there is no feasible flow in PI. I n this case, \}I(I/) of PII is not bounded below for feasible tensions 1/. (iv) There is neither a feasible flow in PI nor a feasible tension in PII. (B) Let C; = [b" cKJ (b K  c K ; b K can be -00, and c" (0) and C;=[dKJKJ (d K can be -00, and i" (0). Then a necessary and sufficient condition for PI to have a feasible flow is that 
1063 LI e' - L2 b'?O for every cutset (cocycle or cocircuit) w of G, where the summation L I (resp. L2) is taken over all the edges e, (E E) that lie in w in the positive (resp. negative) direction. Similarly, a necessary and sufficient condition for PH to have a feasible tension is that Ll f,- L2d,?0 for every tieset (cycle Or circuit) 0 of G, where L I (resp. L2) is taken over all the edges e, that lie in (] in the posi- tive (resp. negative) direction. C. Shortest Paths, Maximum Flows, and Minimum-Cost Flows We choose one of the edges of G, say e I' as the reference edge and assign it the parametric characteristic curve C 1 (IX) = {(x, y) I y= x + IX}. Then if there is a feasible flow on the network obtained from G by contracting (i.e., short- circuiting) e I, and if at the same time there is a feasible tension on the network obtained from G by deleting (i.e., open-circuiting) el' then problem PIlI has a solution ((IX),(IX») for every real IX, and I(IX) and 1(1X) are uniquely determined for each IX. The problem of deter- mining these parametric solutions is the two- terminal problem for the two-terminal network N I (which is obtained from G by deleting e t ) with the vertex a- e 1 as the entrance (or source) and the vertex a+ el as the exit (or sink). The curve C = {(l (IX), I (IX») IIXE R} enjoys the properties of a characteristic curve, and is called the two-terminal characteristic of N I with respect to the entrance a - e I and exit a + e l' A two-terminal network for which only the projections to the x-axis C; = [b', e'] of the edge characteristics are specified (K = 2, ..., n) is called a capacitated network, and the maximum-flow problem for a capacitated network N 1 can be mathematica1\y formulated as the problem of determining the projection to the x-axis Cx=[b,e] of the two-terminal characteristics of N I . For the maximum-flow problem, the relation e = min {L'1 e' - L b'} holds, where the minimum is taken over an the cutsets that contain e I in the negative di- rection and where L'l (resp. L) denotes the summation over all the edges, except e{. lying in a cutset in the positive (resp. negative) direc- tion. This relation is caned the maximum-flow minimum-cut theorem. (A similar relation holds also for b.) Similarly, or dua1\y, the problem of deter- mining the y-projection C y = [d.JJ of the two- terminal characteristic of a two-terminal net- work N I for which the y-projections C; = [d,.J'] of the characteristics are specified to the nonreference edges e, (K = 2, ... ,n) is a network flow formulation of the shortest-path problem. For the shortest-path problem, the 281 D Network Flow Problems relation f =min{L'1 h- L2 d K }, called the maximum-separation minimum-distance theo- rem, holds, where the minimum is taken over all the tiesets that contain e I in the positive direction and where L'I (resp. L2) denotes the summation over all the edges, except e I' lying in a tie set in the positive (resp. nega- tive) direction. The minimum-cost flow problem is to deter- mine the two-terminal characteristic of N I w hen all the C (K of- 1) are of staircase form. A number of algorithms exist for time com- plexity ( 71 Complexity of Computations) 0(1 V13) for the shortest-path problem [4]; the algorithm proposed by E. W. Dijkstra [7J for the case d,  0 fK for all K( of- 1) is of complex- ity 0(1 VI 2 ). The algorithm of A. V. Karzanov [8,9] for the maximum-flow problem is of complexity 0(1 vj3). The minimum-cost flow problem can be solved by alternately solving the subproblems of the shortest-path type and those of the maximum-flow type, but no al- gorithm of time complexity polynomial in lEI and I VI has been found. D. Transportation and Scheduling Let us make the edges of a graph G = (V, E, a + ,(1-) correspond to the transportation routes and the flow to the stream of a com- modity; impose a capacity constraint of the form 0 '  e K on the flow K in each edge e" and assume a cost function of the form ip,(C) =f,' for each edge. (e' is a constant called the capacity of edge e" and f, is a constant caned the unit cost of edge e,.) Furthermore, let us specify a subset VI (c V) of vertices as the set of entrances and another subset V 2 ( c V) as the set of exits, where VI n V 2 = 0, and prescribe the amount of inflow q(v) to each entrance VE VI and the amount of outflow q(v) from each exit vE V 2 , where we must have LVEv, q(v)= L VE V 2 q(v). Planning a trans- portation plan that satisfies all the above- prescribed conditions and that minimizes the total cost L'EE ipK(') can be reduced to finding a minimum-cost flow on the extended network G = (v, E, a +,15 -), defined as follows, such that the flow in the reference edge is to be maxi- mum: 17= VU{s}U{t} (s,tV), E=EUE 1 U E 2 U {eo} (E 1 nE= E 2 n E= 0, eoEU EI U E 2 ; eo is the reference edge, 15 + eo = t, a - eo = s; E 1 = {el15+e=s, jj-e=v, VE Vr}, C(e)={(O,y)\ yO} U {(x, 0) I O x  q(15- e)} U {(q(v), y)1 Oy}, where 15-e=vE VI; E 2 ={el15+e=v, 15-e=t,vE V 2 }, C(e)={(O,Y)lyO}U {(x,O)IO xq(v)} U{(q(v),y)IOy}, where a+e=vE V 2 ), 3 ic I E= a", C(e,) = {(O, y) IYf,} U {(x,.f,) I Oxe'}U{(c"'y)ly?f,} for e,EE. For the project-scheduling problem with an 
281 E Network Flow Problems acyclic graph G = (V, E, a +, a -) as the arrow diagram for which the start node SE V and the completion node are specified, we make an edge erE E) correspond to a job (or activity), and a vertex correspond to the event that all the jobs corresponding to b - v have been finished, whereas those corresponding to b + v are ready to commence. Then we interpret the negative of the tension of an edge as the time (i.e., duration) spent on the correspond- ing job and the potential at a vertex as the time (i.e., instant) of the corresponding event taking place. Furthermore, we assume that to each edge e K (or the corresponding job) are given the normal duration - d K ( > 0), the crash duration - fK(>O, < -d K ), and the unit cost cK(>O) we have to pay in order to decrease the job time by one unit, where the job time necessarily lies between the normal and the crash duration. Under the above-listed speci- fications, we consider the relation between the total project time (i.e., the duration from the event corresponding to the start node to that corresponding to the completion node) and the extra cost to be paid for decreasing the project time below the normal one. This prob- lem can be reduced to the two-terminal char- acteristic problem for the following network G: G is obtained from G by adding the refer- ence edge eo (a+ eo = t, a- eo=s); C = {(O,Y)I ydK} u {(x, dK)IO x c K } U {(c", y) I d K y fK} U {(X.JK) I c K  x}. E. Applications to Combinatorial Optimization Many problems in combinatorial optimization can be reduced to network flow problems. The problem of finding a maximum matching on a bipartite graph G ( 186 Graph Theory) is reduced to the maximum-flow problem for the graph representing the transportation prob- lem on G with one of the two vertex sets as the entrance set and the other as the exit set, where all the edges have a unit capacity and the amount of inflow/outflow to/from each entrance/exit vertex is equal to unity (the cost being irrelevant). (The existence of an integer solution to this kind of maximum-flow prob- lem is proved constructively on the basis of the solution algorithm.) The maximum-flow minimum-cut theorem in this particular case can be stated as follows: The maximum car- dinality of matchings on a bipartite graph is equal to the minimum cardinality of ver- tex subsets which cover all the edges (this is known as the Konig-Egervary theorem). The Dilworth theorem for a partially ordered set, the criterion for the existence of a graph with 1064 prescribed vertex degrees, etc., are known to be reducible to network flow problems [2]. F. Generalizations The network flow problem may be general- ized in various directions. Replacing a graph by a matroid ( 66 Combinatorics) or con- sidering stronger conditions on the feasibility of flows and tensions are natural extensions [4,6,10,11]. Another extension is to con- sider several kinds of flow, instead of a single kind, that simultaneously affect the capacities of edges. This latter problem is called the multicommodity flow problem in contrast to the single-commodity flow problem that was treated above [5]. It can be said that any ex- tension of the network flow problem aims at a mathematical model that has wider appli- cation without losing the advantage of having simple effective solution algorithms. References [1] G. 1. Minty, Monotone networks, Proc. Roy. Soc. London, A257 (1960), 194-212. [2] L. R. Ford, Jr., and D. R. Fulkerson, Flows in networks, Princeton Univ. Press, 1962. [3] C. Berge and A. Ghouila-Houri, Pro- grammes, jeux et reseaux de transport, Dunod, 1962. [4] M. Iri, Network flow, transportation and scheduling- Theory and algorithms, Aca- demic Press, 1969. [5] T. C. Hu, Integer programming and net- work flows, Addison-Wesley, 1969. [6] E. L. Lawler, Combinatorial optimization -Networks and matroids, Holt, Rinehart and Winston, 1976. [7] E. W. Dijkstra, A note on two problems in connexion with graphs, Numerische Math., 1 (1959),269-271. [8] E. A. Dinits, Algorithm solutions to maximum network flow problems by power analysis (in Russian), Dokl. Akad. Nauk SSSR, 194 (1970), 754-757. [9] A. V. Karzanov, Calculation of maximum network flow by power methods (in Russian), Dokl. Akad. Nauk SSSR, 215 (1974), 49-52. [10] M. Iri and S. Fujishige, Use of matroids in operations research, circuits and systems theory, Intern. 1. Syst. Sci., 12 (1981),27-54. [11] C. P. Bruter, Elements de la theorie des matroi'des, Lecture notes in math. 387, Springer, 1974. 
1065 282 (XX.17) Networks A. Linear Graphs (186 Graph Theory) A linear graph (or simply graph) is an object composed of (i) a finite set {B K } (K= 1, ...,n) of elements called branches (or edges), (ii) a finite set {N a } (a = 1, ..., m) of elements called nodes (or vertices), and (iii) an incidence rela- tion between branches and nodes represented by a function [B K : NaJ from {B K } x {N a } to {O, 1, -I} such that for every K, there exist exactly one N a with [B K : NaJ = 1 and exactly one N a with [B K : NaJ = -1, with all the other [B K : NaJ equal to 0. (Intuitively, a branch B K starts from the node N a with [B K : NaJ = 1 and ends at the node N b with [B K : NbJ = -1.) In terms of topology, a (linear) graph is a 1- dimensional finite tsimplicial complex ( 70 Complexes, 186 Graph Theory). A network in the wide sense is a linear graph whose branches and nodes are endowed with some physical properties. B. Networks A contact network, one of the simplest kinds of networks, is an abstraction of a circuit whose branches correspond to contact points of relays and switches that are allowed to take a finite number of physical states, e.g., the two states "on" and "off." The theory of contact networks is developed by means of tBoolean algebra and is applied to switching networks, such as telephone exchange networks and the logical networks of digital computers. In most cases, to the branches B K of a net- work two kinds of real quantities i K and E K are assigned (which may be variables or functions of time) satisfying the conditions: n (i) I [BK:NaJiK=O (a= 1, ...,m); K=l (ii) there exist Ea such that m I [BK:NaJEa=E K (K=I,...,n). a=l In the case of electric networks, where i., E.. and Ea are the current in branch B., the vol- tage across branch B., and the potential at node N a , respectively, these conditions are known as Kirchhoff's laws. The network flow problem, which has a number of practically important applications in toperations research (e.g., transportation problems, project-scheduling problems) and is a special case of tmathematical programming, 282 C Networks can be formulated as the problem of minimiz- ing L:;'=dK(iK) under condition (i) (or minimiz- ing LdK(EK) under condition (ii)), where for each branch B., fK is a given tconvex function defined on a given interval [a., bKJ. C. Electric Networks Since there has been a great deal of research on electric networks, "network" often means "electric network." We call a branch in which the current is a given function of time a cur- rent source, and one across which the voltage is a given function of time a voltage source. A branch that is either a current source or a voltage source is called a source branch. A network with M source branches is called an M-port network. If the currents i K in and the voltages E K across the non-source branches (n' in number) are related by n' EK=IzK;)A (K=I,...,n') A1 or n' i K = I YKAEA (K=l,...,n'), A1 where the Zd or Yd are linear tintegrodifferen- tial operators, the network is said to be linear. If ZAK =Zd or YKA = YAK' it is said to be recipro- calor bilateral; if the Zd or Yd are invariant under the change of the origin of time, it is said to be timeinvariant; and if a linear time- invariant network satisfies the condition r ro s iJr)EJr) d,  0 for every t and for every choice of functions of time for i K or E K associated with the source branches B K in S, provided that the current- voltage relations for non-source branches are satisfied, then the network is said to be passive. Under certain nonsingularity conditions, for the currents in and the voltages across the source branches (denoted by I K and e K ) of a linear M-port network, we have the relations e K = I ZdIA' B;..ES I K = I Yde A (BKES), B;..ES where the matrices Zd and Y d of linear inte- grodifferential operators are called the port- impedance matrix and the port-admittance matrix of the network, respectively. Analysis determines ZKA' Y KA from a given linear graph and given Zd' Yd, while synthesis finds a net- work (i.e., Zd or YKA' as wen as a linear graph) when part of Zd' Y d , or some relations to be satisfied by them are given. (In synthesis, the Zd or YK; are usually confined to some special class.) In analysis as well as synthesis we usu- 
282 Ref. Networks ally deal with the tLaplace transforms Zd(S), Yd(S), ZK'(S), 1:,(s) instead of Zd' Yd, Zd' Y K ) themselves, where the characteristics Zd(S), YK,(s) of a network are determined by the topological properties of its linear graph and the properties of Zd(S), YK).(S) as functions of the complex variable s. (If w is the angular frequency, S = iw.) The following fundamental facts are known in regard to analysis and synthesis: (1) If ZK'(S), 1:,(s) are rational functions of S holomorphic on the open right half-plane, a necessary and sufficient condition' for a net- work to be passive is that for arbitrary real numbers " L K,Zd(S) or BI(.B).ES L K' YK)(s) BK.B).ES is a positive real function of s, i.e., a function whose value is real when S is real and lies on the right half-plane when S lies on the right half-plane. (2) Every passive one-port network can be synthesized by using a finite number of three kinds of branches, i.e., positive resistors (E K = R K i" R K > 0), positive capacitors (iK = CKdEK/dt, C K > 0), and positive inductors (EK=LKdiK/dt, LK>O). (3) Every passive M-port network (M?2) can be synthesized by using, in addition to the three kinds of branches mentioned in (2), ideal transformers (an ideal transformer is a pair of branches (B" B,) such that i K = ni" E, = nE" and n=real number) and ideal gyrators (an ideal gyrator is a pair of branches (B" B ,) such that i K = E)., i, = - E K ); ideal gyrators are not needed, however, to synthesize a reciprocal network. However, very little is known about the synthesis of passive M -port networks without using ideal transformers and gyrators. Topo- logical methods are expected to be powerful for such synthesis problems. The linear graph structure of a network loses its significance if ideal transformers are admitted. References [IJ K. Kondo (ed.), RAAG memoirs of the unifying study of basic problems in engineer- ing and the physical sciences by means of geometry, Association for Science Documents Information, Tokyo, 1,1955; 11,1958; III, 1962. [2J W. Cauer, Synthesis of linear communica- tion networks, McGraw-Hill, second edition, 1958. (Original in German, 1941.) [3J 1. E. Storyr, Passive network synthesis, McGraw-Hill, 1957. [4J L. Weinberg, Network analysis and syn- thesis, McGra w- Hill, 1962. 1066 [5J O. Brune, Synthesis of a finite two- terminal network whose driving-point im- pedance is a prescribed function of frequency, 1. Math. Phys., 10 (1931),191-236. [6J Y. Oono and K. Yasuura, Synthesis of finite passive 2n-terminal networks with pre- scribed scattering matrices, Mem. Fac. Eng. Kyushu Univ., 14 (1954), 125-177. [7J S. Seshu and M. B. Reed, Linear graphs and electrical networks, Addison-Wesley, 1961. [8J W. H. Kim and R. T. Chien, Topological analysis and synthesis of communication net- works, Columbia Univ. Press, 1962. [9J C. Berge and A. Ghouila-Houri, Program- ming, games and transportation networks, Methuen, 1965. (Original in French, 1962.) [tOJ L. R. Ford, Jr., and D. R. Fulkerson, Flows in networks, Princeton Univ. Press, 1962. [11J R. W. Newcomb, Linear multi port syn- thesis, McGraw-Hill, 1966. 283 (XXI.37) Newton, Isaac Sir Isaac Newton (December 25, 1642-March 20, 1727), the English mathematician and physicist, was born into a family of farmers in W oolsthorpe, Lincolnshire. In 1661, he entered Cambridge University, where he was greatly influenced by the professor who was teaching geometry, I. Barrow, and where he began research in Kepler's optics and Descartes's geometry. In 1665, he discovered the tbinomial theo- rem, and in the same year, during a stay at his birthplace to escape the plague, he began work on his three great discoveries-the spec- tral decomposition of light, the universal law of gravity, and differential and integral cal- culus. He returned to Cambridge University in 1667, and in the following year invented the reflecting telescope and proposed his theory of light particles. During this period, he suc- ceeded Barrow as professor and lectured on optics. At the same time, he probed deeper into the calculus. Guided by Barrow's insight that differentiation and integration were in- verse operations and also by his own research on infinite series, Newton obtained the tfunda- mental theorem of calculus. Leibniz obtained the same theorem a little later, and a struggle resulted between the two over priority. The two discoveries were independent, but because Leibniz's notation was superior, the later development of calculus owes more to him. The dynamic elucidation of the heliocentric theory was accomplished in Newton's main 
1067 work, Principia mathematica philosophiae natura lis (1686-1687), in which Kepler's law on the movement of the planets, Galileo's theory of movement, and Huygens's theory of oscillation were unified into the three laws of Newtonian dynamics. These natural laws, which deal with all dynamic phenomena in the universe. are the most superlative realization of Descartes's concept of exploring the mathe- matical structure of nature; they had an es- sential influence on the later development of the natural sciences. The style of writing is similar to that in Euclid's Stoicheia. In the Principia, Newton also sets forth his philo- sophical position. In 1695, Newton moved to London and became engrossed in theology. He was ap- pointed Master of the Mint and was presi- dent of the Royal Society from 1703 until his death. While he is sometimes said to have divorced himself from science, many of his notes on geometry date from this time. References [lJ D. T. Whiteside (ed.), Sir Isaac Newton, Mathematical works, Johnson Reprint, I, 1964; II, 1969. [2J D. T. Whiteside (ed.), Sir Isaac Newton, Mathematical papers, Cambridge Univ. Press, 1,1967; 11,1968; III, 1969; IV, 1971; V, 1972: VI, 1974; VII, 1976; VIII, 1981. [3J I. Newton, Philosophiae naturalis prin- cipia mathematica, London, 1687; English translation, Mathematical principles of natural philosophy, trans. by A. Motte in 1729, Univ. of California Press, 1934. [4J D. Brewster, Memoirs of the life, writings, and discoveries of Sir Isaac Newton, Con- stable, 1855. 284 (II1.12) Noetherian Rings A. General Remarks In this article, we mean by ring a commutative ring with unity element. Thus a Noetherian ring is a commutative ring with unity element that satisfies the tmaximum condition for its ideals; if it is also an tintegral domain, then it is called a Noetherian integral domain or Noetherian domain (for right and left Noe- therian rings  368 Rings F). A ring is Noetherian if and only if every prime ideal of the ring has a tfinite basis (Cohen's theorem). A ring is Noetherian if it is 284 B Noetherian Rings generated by a finite number of elements over a Noetherian ring (Hilbert's basis theorem). The following three conditions for a ring R are equivalent: (i) R is an Artinian ring, that is, it satisfies the tminimum condition for its ideals (for right and left Artinian rings  368 Rings F). (ii) R is a Noetherian ring and every prime ideal of R is a maximal ideal. (iii) There exist a finite number of Noetherian rings Rj(i = 1, .., ,n) whose tmaximal ideals are tnilpotent such that R is the direct sum of the Rj(i = 1, ..., n). We say that the restricted minimum condition holds in a ring R if R/a is an Artinian ring for every nonzero ideal a of R; the latter condition is satisfied if and only if R is either an Artinian ring or a Noetherian domain of tKrull dimension 1. Every ideal of a Noetherian ring R can be expressed as the intersection of a finite number of tprimary ideals. Given a ring R and an R-module M, a submodule P of M is said to be a primary submodule of M if every element a of R that is a zero divisor with respect to M/P (i.e., there exists an mEM/P such that m#O and am=O) is nilpotent with respect to M/P (i.e., there exists a natural number n such that an(M/P) =0). The property of ideals in Noetherian rings stated above can be generalized to the case of Noetherian modules: If an R-module M is a tNoetherian module, then every submodule of M can be expressed as the intersection of a finite number of primary submodules. Let R be a Noetherian ring, a an ideal of R, M a finite R-module, and Nand N' submod- ules of M. Then we have (i) the Artin-Rees lemma: There exists a natural number l' such that for all n>r, anN n N' =a n -,. (arN n N'). (ii) Krull's intersection theorem: n;;<',1 an M = {mE M 13aEa such that (l-a)m=O} (hence, in particular, if m is the tJ aco bson radical or R, then n:=1 mnM={O}. (iii) Krull's altitude theorem: If a is generated by s elements and p is a tminimal prime divisor of a, then the height of p  s. B. Topology Defined by an Ideal Let R be a ring, a an ideal of R, and M an R- module. Then the a-adic topology of Mis defined to be the topology on M such that {an M I n = 1,2, ... } is a tbase for the neighbor- hood system of zero. In particular, let R be Noetherian, M a finite R-module, and N a submodule of M. Then by the Artin-Rees lemma, the a-adic topology of N coincides with the topology on N as a subspace of M with the a-adic topology. Returning to the general case, M is a tTo-space (under the a- adic topology) if and only if n:1 anM = {O}, 
284C Noetherian Rings or, in other words, if and only if M is a tmetric space, where the tdistance d(a, b) between points a, b in M is defined to be inf {2 -n I a- bEanM}. Moreover, if N is a submodule of M, then MIN is a To-space if and only if N is a closed subset of M, that is, if and only if ndN +anM)=N. A sequence {a n }= (aI' a z , ..., an' ...) is called a Cauchy sequence (under the a-adic topology) ifVn3NVrVs (with each of them a natural number) a N + r -aN+sEanM; for this it is sufficient that Vn3NVr a N + r +1 -aN+rEanM. If this sequence converges to zero (i.e., Vn3NVr aN+rEanM), it is called a null sequence. The set \Ill of an Cauchy sequences in M becomes an R-module if we define their sum and multiplication by an element of R by {an} + {b n } = {an + b n }, c{ an} = {can}' Then the set \J! of all null sequences is a submodule ofID/. An element m of Mis identified with the sequence (m, m, "', m, ...), and we regard M as a submodule of \Ill. Then the R-module M = \IIlI\J! is the tcompletion of MI(nnanM) (as a metric space under the a- adic topology). M is called the a-adic com- pletion of M. If a has a finite basis, then the topology of M (as the completion) coincides with its a-adic topology. If M = R, we define a multiplication in \Ill by {an} {b n } = {anb n }. In this case, \Ill is a ring in which \J! is an ideal, and hence the completion R = M is a ring. If a = 1:1 aiR, then considering the tring of formal power series R = R [[XI' "', x,]] and its ideal n= nd1:dai- x;)R + anR), we have RRln. C. Zariski Rings For a Noetherian ring R and an ideal a of R, every element b of R such that I-bEa has an inverse in R if and only if every ideal of R is a closed subset of R under the a-adic topology. When this condition is satisfied, the ring R with the a-adic topology is called a Zariski ring; we often express this by saying that (R, a) is a Zariski ring. A Zariski ring is called com- plete if it is a complete topological space. The completion R of the Zariski ring (R, a) has the aR-adic topology, and (R, aR) is a Zariski ring. Furthermore, (i) R is a tfaithfully flat R- module; and (ii) when N is a submodule of a finite R-module M, then N is a closed sub- space of M (under their a-adic topologies), and their completions are identified with N @ RR, M @ RR. D. Local Rings Suppose that R is a Noetherian ring having only a finite number of maximal ideals and J is 1068 the Jacobson radical of R. Then the Zariski ring (R, J) is caned a semilocal ring. Further- more, if R has only one maximal ideal, then (R,J) is caned a local ring. A ring that has only a finite number of maximal ideals is caned a quasisemilocal ring; if it has only one maximal ideal, it is caned a quasilocal ring. (In some literature, the terms local ring and semi/ocal ring are used under weaker conditions; in the weakest case, quasisemilocal rings and quasi- local rings are simply called semilocal rings and local rings, respectively, and local rings and semilocal rings in our sense are caned Noetherian local rings and Noetherian semi- local rings, respectively.) Assume that R is a semilocal ring with max- imal ideals m 1 , "', m n and the Jacobson rad- ical J =m 1 n... n mn' For every finite R- module M, we introduce the J-adic topology as its natural topology. The completion R of R is a semilocal ring with maximal ideals m 1 R, "', mnR and is naturally isomorphic to the direct sum of the completions of the local rings Rm (i = 1, ..., n). Since R is a Zariski ring, (1) R is fithfuny flat; (2) submodules of Mare closed subsets of M; and (3) the completion of M is identified with M @ RR. If (R, m) is a complete local ring (i.e., a local ring and a complete Zariski ring at the same time), then R contains a subring I with the fonowing properties: (i) I is a complete local ring, and II(m n I) = Rim; and (ii) for the tcharacteristic p of Rim (p is either zero or a prime number), m n I = pl. Therefore, if m is generated by n elements, then R is a homomorphic image of the ring of formal power series in n variables over I. This theorem is caned the structure theorem of complete local rings, and I is caned a coefficient ring of R. If R contains a field, then I is a field, called a coefficient field of R. A complete local ring is a tHensel ring. When (R, m) is a local ring and 1:i1 xiR is m-primary, then we have an inequality r?(tKrun dimension of R); if the equality holds, then we say that XI' ..., x, form a sys- tem of parameters of R. Furthermore, if m = 1:1 xiR, then we say that XI, "', X r form a regular system of parameters of R. A local ring that has a regular system of parameters is called a regular local ring (cf. tJacobian crite- rion). A regular local ring is a tunique factori- zation domain. Let d be the Krull dimension of a local ring (R, m). Then R is a regular local ring if and only if one of the following holds: (1) every R-module has finite thomological dimension, (2) every R-module has homolog- ical dimension of at most d, or (3) the homo- logical dimension of Rim (as an R-module) is finite (and actually coincides with d). A Noe- therian ring R' is called a regular ring if R.is 
1069 a regular local ring for every prime ideal p'. A regular local ring is a regular ring. Consider a local ring (R, m) and an m- primary ideal q. Then the length l(n) of Rlq" (as an R-module) is a function of n. For a sufficiently large n, the length l(n) can be ex- pressed as a polynomial f(n) in n with rational coefficients. The degree of f(n) coincides with the Krull dimension d of R. The multiplicity of q is (coefficient ofn d inf(n)) x (d!).1f Xj, ...,Xd form a system of parameters, then (multi- plicity of L11 XiR) (length of RIL XiR); if the eq uality holds, then we call XI, . .. , X d a distinct system of parameters. A local ring that has a distinct system of parameters is called a Macaulay local ring. A local ring is a Macaulay local ring if and only if one of the following holds: (1) every system of parameters is a distinct system of parameters, or (2) if an ideal a of height s is generated by s elements, then every tprime divisor of a is of height s. A regular local ring is a Macaulay local ring. The notion of multiplicity can be also de- fined in general Noetherian rings [4]. Let R be a Noetherian ring. If Rm is a Macaulay local ring for every maximal ideal m, then R is called a locally Macaulay ring. Furthermore, if height m = Krull dim R for every maximal ideal m, then R is called a Macaulay ring. If R is a locally Macaulay ring, then the poly- nomial ring in a finite number of variables over R is also a locally Macaulay ring. In general, an ideal a is called an unmixed ideal (or pure ideal) if the height of every prime divisor of a coincides with height a; otherwise, a is called a mixed ideal. Thus if R is a locally Macaulay ring, an ideal a of the polynomial ring R [x" .., ,xnJ over R is generated by r elements, and height a = r, then a is unmixed (unmixedness theorem). If the completion of a local ring R is a tnor- mal ring, then we say that R is analytically normal. If the completion of a semilocal ring R has no nilpotent element except zero, then we say that R is analytically unramified. A semi- local integral domain R which is a ring of quotients of a finitely generated ring over a field is analytically unramified. If R is a normal local ring, then R is analytically normal (0. Zariski). Let (R, m) be a local ring, and let q be an m- primary ideal. Set F i = qilqi+' (i = 0,1,2, ..., qO=R). Let a=a' (modqi+l)EFi and b=b' (modqj+1)EFj. We put ab=a'b' (mod q i+ j +1)E Fi+j. Then the direct sum of modules F= Lo F i becomes a graded ring generated by F, over Fo, in which F i is the module of homo- geneous elements of degree i. F, called the form ring (or associated graded ring) of R with respect to q, plays an important role in the 284G Noetherian Rings theory of local rings, particularly in the theory of multiplicity. E. Chains of Prime Ideals Let R be a Noetherian ring with prime ideals p, q such that pc q. Consider the length n of a chain of prime ideals p = Po  PI  ...  Pn= q which cannot be refined any more. It is not true in general that n is uniquely deter- mined by p and q (M. Nagata). However, n is uniquely determined for a rather large class of Noetherian rings, for instance the rings that are homomorphic images of locally Macaulay rings and, in particular, finitely generated rings over a tDedekind domain. F. Integral Closures Let R be a Noetherian integral domain with the field of quotients k, let K be a finite al- gebraic extension of k, and let R be the tin- tegral closure of R in K. Then (i) If R is of Krull dimension 1, then for an arbitrary ring R' such that R c R' c K and for every nonzero ideal a' of R', the quotient R'/a' is a finite R/( a' n R)-module. In particular, R' is a Noe- therian domain satisfying the restricted mini- mum condition. (ii) If R is of Krull dimension 2, then R is Noetherian. (iii) In general, R is a tKrull ring, and for an arbitrary prime ideal p of R there are only a finite number of prime ideals 13 of R such that p = 13 n R. For any of such 13, the field of quotients of Rip is a finite algebraic extension of that of Rip. Result (i) is called the Krull-Akizuki theorem. We say that R satisfies the finiteness condition for integral extensions if R is a finite R-module for any choice of K. A Noetherian ring R is called a pseudogeometric ring, or a universally Japa- nese ring, if Rip satisfies the finiteness con- dition for integral extensions for every prime ideal p. A ring is pseudogeometric if it is gen- erated by a finite number of elements over a pseudogeometric ring. G. History J. W. R. Dedekind first introduced the concept of ideals in the theory of integers. The main objects studied in ring theory were subrings of number fields or function fields until M. Sono (Mem. Call. Sri. Univ. Kyoto, 2 (1917),3 (1918- 1919)) originated an abstract study of Dede- kind domains, which was followed by E. Noether (Math. Ann., 83 (1921),96 (1926)), who originated the theory of Noetherian rings. 
284 Ref. Noetherian Rings W. Krull made further important contri- butions to the development of the theory of Noetherian rings and general commutative rings [1]. Many other authors, including E. Artin, Y. Akizuki, and S. Mori, also contri- buted to the theory. The theory of local rings was originated by Krull (J. Reine Angew. Math., 179 (1938)) and developed by C. Che- valley (Ann. Math., 44 (1943)), I. S. Cohen (Trans. Amer. Math. Soc., 59 (1946)), and Zari- ski (Ann. Inst. Fourier, 2 (1950)), and later by many authors, including P. Samuel, Nagata, M. Auslander, D. A. Buchsbaum, and l.-P. Serre [4]. The theory of Noetherian rings is applied to algebraic geometry. References [1 J W. Krull, Idealtheorie, Erg. Math., Springer, second edition, 1968. [2J B. L. van der Waerden, Algebra I, II, Springer, 1966-1967. [3J O. Zariski and P. Samuel, Commutative algebra I, II, Van Nostrand, 1958-1960. [4J M. Nagata, Local rings, Wiley, 1962 (Krieger, 1975). [5J N. Bourbaki, Elements de mathematique, Algebre, ch. 1,8, Actualites Sci. Ind., 1144b, 1261a, Hermann, 1964, 1958. [6J N. Bourbaki, Elements de mathematique, Algebre commutative, ch. 1-7, Actualites Sci. Ind., Hermann; ch. 1,2, 1290a, 1961; ch. 3,4, 1293a, 1967; ch. 5,6,1308,1964; ch. 7,1314, 1965. [7J J.-P. Serre, Algebre locale, multiplicites, Lecture notes in math. 11, Springer, 1965. [8J D. G. Northcott, Lessons on rings, mod- ules and multiplicities, Cambridge Univ. Press, 1968. [9J H. Matsumura, Commutative algebra, Benjamin, 1970; second edition, 1980. 285 (VI.16) Non-Euclidean Geometry A. History The validity of the fifth postulate of Euclid's Elements, the taxiom of parallels, has been a subject of argument ever since it was for- mulated ( 139 Euclidean Geometry). At the beginning of the 18th century, G. Saccheri tried to prove the postulate by assuming the validity of other axioms. Under the hypothesis that the axiom does not hold, he deduced various extraordinary results. Although he \070 was mistaken in thinking that he had ob- tained a contradiction, his work is regarded as a forerunner to the study of non-Euclidean geometry. At the beginning of the 19th century, N. 1. Lobachevskii and J. Bolyai opened up the impasse by establishing a geometry based on postulates that contradict the fifth postulate. This geometry is called hyperbolic geometry or Lobachevskii's non-Euclidean geometry. Actu- ally, a similar idea had been conceived by C. F. Gauss, but he refrained from publishing it because of likely misunderstanding by a public still strongly influenced by I. Kant's philoso- phy. On the other hand, B. Riemann con- structed so-called elliptic geometry (or Rie- mann's non-Euclidean geometry), which is different from both Euclidean and hyperbolic geometry. Euclidean geometry (including the theory of similarity) is sometimes called para- bolic geometry. In general, a space satisfying axioms that contradict Euclid's postulates is called a non-Euclidean space. Around the turn of the 20th century, A. Cayley, F. Klein, and H. Poincare constructed models of non-Euclidean spaces that are sub- sets of Euclidean spaces, and E. Beltrami con- structed a differential geometric model. By means of these models, it was established that non-Euclidean geometries are consistent as long as there is no inconsistency in the under- lying Euclidean geometries. On the other hand, D. Hilbert established a complete system of axioms for Euclidean geometry and showed, by constructing non-Euclidean models, that the axiom of parallels is independent of the other axioms ( 155 Foundations of Geom- etry). The logical foundation of non-Euclidean geometries was thus clarified. Moreover, A Einstein showed in his ttheory of relativity that actual space-time does not satisfy Eu- clidean axioms. Together with Euclidean spaces, non-Euclidean spaces are often used as fundamental models both in the problem of tspace forms and in the theory of tsymmetric spaces. B. Axiomatic Considerations Hilbert's system of axioms of plane Euclidean geometry consists of axioms of incidence, order, congruence, parallels, and continuity ( 155 Foundations of Geometry). Specifically, the axiom of parallels is stated as follows: Suppose we are given a straight line and a point in a plane. If the straight line does not contain the point, then there exists only one line through the point that does not intersect the line. 
1071 The system of axioms of hyperbolic geome- try is obtained by replacing the axiom of par- allels by the following: Suppose we are given a straight line and a point in a plane. If the line does not contain the point, then there exist at least two lines that pass through the point without intersecting the line. (The other four groups of axioms are unaltered.) In this case, if [ is a given line that does not contain a given point C, then there exist exactly two lines parallel to [ that pass through C. We denote them by XYand X'Y', and they are character- ized as follows (Fig. 1): Any line that passes through C and lies in LX'CYnecessari1y intersects [; by contrast, neither the two lines XY, X'Y' nor any line in LXCX' intersects [. Euclidean geometry can be considered as a "limit" of this geometry where the lines X Y and X' Y' coincide. . X' Y Fig. 1 In el1iptic geometry, the axiom of parallels is replaced by the following: Suppose we are given a straight line and a point in a plane. If the line does not contain the point, then any line passing through the point intersects the line. In this case, the lines are closed curves, and the axioms of order must be modified. Specifically, in Euclidean geometry, the axioms of order are based on the notion of a point A lying between points Band C, where A, B, C are distinct points on a line. In elliptic geome- try, however, to define order we utilize the notion of a pair A, C of points separating another pair B, D (and vice versa), where A, B, C, D are distinct points on a line. The axioms of order are modified accordingly. The sum of inner angles of a triangle is smaller or greater than two right angles ac- cording as we use the axioms of hyperbolic or elliptic geometry. C. The Projective-Geometric Point of View We take tprojective coordinates in an n- dimensional, real tprojective space pn and consider a quadric hypersurface defined by Q:ax6 +xi + '" +x; =0, a =10 ( 90 Coordi- nates; 343 Projective Geometry). We denote by G the group consisting of the totality of pro- jective transformations of pn that leave Q 285C Non-Euclidean Geometry invariant. We call Q the absolute and call G the group of congruent transformations. When a < 0, then this Q is a real quadric hypersur- face. In this case, there exists a domain H n (the totality of points inside Q) whose boundary coincides with Q, and the group G acts ttransi- tively on H". The pair {G, H"} provides a model of hyperbolic geometry, and the n- dimensional hyperbolic space H n is homeo- morphic to an n-dimensional open cel1. Points of H n , points on Q, and points outside Q are called ordinary points, points at infinity, and ultrainfinite points (or ideal points), respec- tively. Two lines on H n are said to be parallel if they intersect on the absolute Q. Next, when a>O, the absolute Q is an imaginary quadric hypersurface, and the group G acts transitively on pn. The pair {G, pn} provides a model of elliptic geometry. The n-dimensional elliptic space pn is homeomorphic to n-dimensional real projective space; hence it is compact. In elliptic geometry, any two distinct lines in a plane necessarily intersect at a point. The above models {G, pn} of non-Euclidean geo- metries are called Klein's models. Let {G, H"} be a Klein's model, A, B distinct points in pn, [ the line containing A, B, and I, J be two points where the line [ meets Q (Fig. 2). If we denote by (A, B,I,J) the tanhar- monic ratio of these four points, then the non- Euclidean distance p between the points A and B is given by p=alog(A,B,I,J), a constant. Next, let [, 9 be lines in H n intersecting at a point D. In the plane determined by [and g, we draw two imaginary tangents u, v to the absolute through D. Denoting by (I, g, U, v) the anharmonic ratio of these four lines, the non- Euclidean angle (} between the lines [ and 9 is given by (}=(1/2i)log(l,q,u, v), i=P. Gen- erally, let 0 be a point in the projective space pn, and denote by p the tpolar of 0 with re- spect to the absolute. By counting the polar p doubly, it can be regarded as a quadric hypersurface, which wi11 be denoted by So. A quadric hypersurface S of H n is called a non- Euclidean hypersphere if it belongs to the tpen- cH of quadric hypersurfaces determined by Q and So. S is called a proper hypersphere, a limiting hypersphere, or an equidistant hyper- surface according as the center 0 is an ordi- q X Y' X' Y Fig. 2 
285D Non-Euclidean Geometry nary point, a point at infinity, or an ultrain- finite point, respectively. In the case of e11ip- tic geometry {G, PO}, the distance p between two points A and B is given by p = (aji) . log (A, B,I, J) as I, J are imaginary points. Moreover, we may get parabolic geometry as the "limit" (a--> CX)) of the geometry given by Klein's models. D. The Conformal-Geometric Point of View Let sn be an n-dimensional tconformal space. If we take suitable (n + 2)-hyperspherical co- ordinates in So, the space sn can be realized as a quadric hypersurface xf + x +... + x- 2xoxoo =0 in (n + I)-dimensional real projec- tive space p"+I. A point in P"+! represents a hypersphere in S" ( 76 Conformal Geometry; 90 Coordinates). We denote by G the group consisting of the totality oftMobius trans- formations leaving invariant a (real, point, or imaginary) hypersphere Q. When Q is a real hypersphere, the space sn is divided into Q and the two open cells H", H;. If we denote by G the totality of transformations of the group G that do not interchange H" and H;, then G is a subgroup of index 2 of G. In this case, each of the pairs {G, H"} and {G, H;} provides a model of hyperbolic geometry. When Q is a point hypersphere, the space E" obtained from S" by omitting the point Q is homeomorphic to an open ce11, and the pair {G, En} provides a model of parabolic geometry. On the other hand, when Q is an imaginary hypersphere, G is isomorphic to the torthogonal group O(n + 1). We ca11 the pair {G, SO} a spherical geom- etry and S" an n-dimensional spherical space. In this case two points x and x' are ca11ed equivalent if x' is the image of x by symmetry with respect to Q. The space P" obtained from S" by identifying equivalent points is homeomorphic to the n-dimensional real projective space. If we denote by G the group obtained from G by making its actions effec- tive on po, then G is the factor group of G by a cyclic group Z2 = Z/2Z. The pair {G, PO} pro- vides a model of elIiptic geometry. These models are ca11ed Poincare's models. They were introduced as a result of research on tautomorphic functions in the case n = 2. In Poincare's model, every straight line is represented either by a circle orthogonal to Q or by a circle passing through Q according as Q is a (real or imaginary) hypersphere or a point hypersphere. In spherical geometry, however, straight lines are usua11y calIed great circles, and two distinct great circles lying on a 2-dimensional sphere necessarily intersect at two points that are symmetric with respect to Q. Also, in Poincare's model the distance 1072 between two points A and B is defined as before, by making use of the anharmonic ratio of four points A, B, I, J on a circle (Fig. 3) ( 74 Complex Numbers G). Fig. 3 E. The Differential-Geometric Point of View An n-dimensional space M of tconstant curva- ture is by definition a tRiemannian manifold whose line element ds is given by d 2 dxf + dx + ... + dx s -(1+  (xi+x+...+x;)) with respect to appropriate local coordinates, where K is a constant caIled the tsectional curvature ( 364 Riemannian Manifolds). According as K is positive, zero, or negative, M can be considered locally as an elliptic space, Euclidean space, or hyperbolic space, respectively. In this case, lines are tgeodesics of M, and the non- Euclidean distance and non-Euclidean angle are those defined in the Riemannian manifold. When n = 2, a tsim- ply connected and tcomplete space of posi- tive constant curvature is tern bedded in 3- dimensional Euclidean space as a sphere, and a space of negative constant curvature is tlocalIy isometric to a pseudosphere (Fig. 4), which is a surface of revolution obtained by rotating a ttractrix around its asymptote. tComplete n- dimensional spaces of constant curvature (n? 2) are ca11ed space forms. A simply con- nected space form is necessarily one of spher- ical space, Euclidean space, or hyperbolic space. Each of these is a tuniversal covering manifold of a general connected space form with a curvature of the same sign, and the group of tcovering transformations is isomor- Fig. 4 
1073 phic to a tdiscontinuous subgroup of the group of congruent transformations. Each of the spaces, Euclidean, non- Euclidean, and spherical, is a thomogeneous space on which the corresponding group of congruent transformations acts transitively. Actually, each of these spaces has the structure of a tsymmetric Riemannian homogeneous space of rank 1. References [IJ D. M. Y. Sommerville, The elements of non-Euclidean geometry, Bell, 1914. [2J F. Klein, Vorlesungen iiber nicht- Euklidische Geometrie, Springer, 1928 (Chel- sea, 1960). [3J E. Cart an, Leons sur la geometrie des espaces de Riemann, Gauthier-Villars, second edition, 1963. [4J H. S. M. Coxeter, Non-Euclidean geome- try, Univ. of Toronto Press, fifth edition, 1965. [5J F. Engel and P. Stiickel, Urkunden zur Geschichte der nicht-Euklidischen Geometrie I, II, Teubner, 1898, 1913. 286 (XII.21) Nonlinear Functional Analysis A. General Remarks At present, the theory of nonlinear problems is still not unified, and many individual results obtained for specific classes of problems are stated in the languages of the corresponding fields of study. However, there are some funda- mental facts and methods of a general nature concerning nonlinear problems, which may be referred to as the subject matter of nonlinear functional analysis. B. Iterative Methods Let X be a tBanach space. Consider a non- linear mapping G:X ->X and the equation GX=O (XEX). Set F = I-G. Then (1) can be written as x=Fx. F satisfies the Lipschitz condition if there exists a constant rx such that IIFx-Fyll ::;;rxllx-yll 286D Nonlinear Functional Analysis for all x, y in X. In particular, the mapping F is said to be nonexpansive if 0 < rx::;; 1. If 0 < rx < 1, then F is called a contraction. (Sometimes a nonexpansive mapping is also called a con- traction.) A contraction F satisfies the contrac- tion principle: F has a unique fixed point Xo, and the iteration xn+l=Fxn (n=I,2,...) with an arbitrary initial element XI always converges to Xo [1-3]. Similar results hold for a contraction that is defined only on a tball and leaves the ball invariant. This leads to tNewton's iterative process and the timplicit function theorem. C. Methods of Monotonicity By definition, a nonlinear mapping G from a tHilbert space H to H is a monotone or accre- tive operator if Re(Gx-Gy,x- y)O (x,YEH). G. Minty [4J proved that ifG:H-->H is mono- tone and continuous, then AI + G is a mapping onto H for any Ie> 0, and its inverse (AI + G)-1 is nonexpansive. He has also shown that in the hypothesis of the theorem, we can replace the continuity requirement for G by maximality of G within the class of accretive operators that are possibly multivalued. Various develop- ments of Minty's ideas, including generaliza- tion of his results to Banach spaces and appli- cations to partial differential equations, have been obtained by F. Browder (A mer. Math. Soc. Proc. Symposia in Appl. Math., 17 (1965)), 1. Leray and J. L. Lions (Bull. Soc. Math. France, 93 (1965)), J. L. Lions [5J, W. Strauss, H. Brezis (A mer. Math. Soc. Proc. Symposia in Pure Math., 18 (1970)), and others. A mapping A is said to be dissipative if - A is accretive. Dissipative mappings playa cen- tral role in the theory of nonexpansive semi- groups ( Section X). D. Topological Methods (1 ) In the geometric study of ordinary differen- tial equations [6J some familiar theorems of topology and tdifferential topology have been strong tools, e.g., tBrouwer's fixed-point theo- rem is utilized to establish the existence of periodic solutions. However, in order to deal with nonlinear partial differential equations we have to generalize these theorems to infinite- dimensional cases. For example, a fixed-point theorem in an infinite-dimensional space was first obtained by G. D. Birkhoff and O. D. (2) (3) 
286 E Nonlinear Functional Analysis Kellogg (Trans. Amer. Math. Soc., 23 (1922), 95-115). The theory of the degree of mappings was generalized to the case of Banach spaces by J. Leray and J. Schauder [33J for the class of mappings of the form 1- F, where F is a compact continuous mapping, i.e., the image by F of any bounded set is relatively compact. Let D be an open bounded set in a Banach space X and aD be its boundary. Let F:D->X be a continuous compact mapping and cJ> denote I-F. If a point p in X does not be- long to cJ>(aD), then we can define the Leray- Schauder degree d(cJ>, p, D) of cJ> relative to p [1-3]. The Leray-Schauder degree d(<I>, p, D) is an integer with the following properties: (i) d(l,p,D)=1 ifpED. IfpcJ>(D), then d(cJ>,p, D) = 0. (ii) (Homotopy invariance) d(cJ>, p, D) de- pends only on the compact homotopy class of cJ>:cD->X"-{p}. More precisely, let K:[O, IJ x aD->X be a continuous compact mapping such that x+ K(t, x) # p for any tE [0, IJ and any x E aD. Let cJ>o(x) = x + K (0, x) and cJ>1 (x) = x+K(I,x). Then d(cJ>o,p,D)=d(cJ>hP,D). (iii) If p and p' are in the same component of X,,-cJ>(aD), then d(cJ>,p, D) = d(cJ>,p', D). (iv) (Con- tinuity) d(cJ>, p, D) is a continuous 10cal1y con- stant function of cJ> (with respect to uniform convergence) and of pE X "-cJ>(aD). (V) (Domain decomposition) If D is the union of finite num- ber of open disjoint sets Dj (j = 1, 2, ..., N) with aDF aD and cJ>(x) # p on Uf1 aD j , then d(cJ>, p, D) = Lf1 d(cJ>, p, DJ (vi) (Excision) If  is a closed subset of D on which cJ>(x) # p, then d(cJ>,p,D)=d(cJ>,p,D,,-). (vii) (Cartesian prod- uct formula) If X = XI EB Xz with D i C Xi' cJ> = (cJ>I' cJ>z) with cJ>i: Di->X i (i = 1,2), D = DI X Dz, and p =(Ph pz), then d(cJ>, p, D) = d(cJ>, , PI' D,) X d(cJ>z, Pz, Dz), provided that the right-hand side is well defined. The degree of mapping of cJ> is also defined for some proper Fredholm mapping cJ> ( Section E; K. D. Elworthy and A. J. Tromba [38J; [32J). Brouwer's fixed-point theorem ( 153 Fixed-Point Theorems) is generalized to Schauder's fixed-point theorem: A compact mapping F of a closed bounded convex set K in a Banach space X into itself has a fixed point. Using the Leray-Schauder degree theory, one has the Leray-Schauder fixed-point theorem: Let D be a bounded open set of a Banach space X containing the origin O. Let F(x, t): 15 x [0, 1 J -> X be a compact mapping such that F(x,O)=O. Suppose that F(x,t)#x for any xEaD and tE[O, 1]. Then the compact mapping F(x, 1) has a fixed point x in D [1-3]. Other homotopy invariants, such as thomo- topy groups and tcohomotopy groups, are also used in nonlinear functional analysis [1,3]. For instance we have the following 1074 theorem of Shvarts (1964) [3]: Let X and Y be two Banach spaces. Let D= {XEX I II xii < 1} be the unit ball and oD={xEXlllxll = I} be the unit sphere of X. Suppose L is a fixed continu- ous linear tFredholm operator from X to Y of index p? 0. Let P L be the set of compact perturbation of L mapping aD into Y,,-O, i.e., P L = {cJ> = L + K I K is a continuous compact mapping of aD to Y such that cJ>(x)=Lx+ K(x)#O for xEDD}. Two mappings cJ>o and cJ>1 in PL are said to belong to the same com- pact homotopy class on aD if there exists a continuous compact mapping h: [0,1 J x aD-> Y such that Lx + h(t, x) # 0 for x in (D, cDo(x) = Lx+ h(O, x), and cJ» (x)= Lx+ h(l, x). Shvarts's theorem: Let L be a fixed continu- ous linear Fredholm mapping from X to Y of index p? 0. Then the compact homotopy classes on aD of P L are in one-to-one corre- spondence with the elements of the pth stable homotopy group nn+p(sn) (n? p + 1) ( 202 Homotopy Theory H). Warning: The topological structure of an infinite-dimensional Hilbert or Banach space is quite different from that of a finite-dimensional Euclidean space. For instance, let X be a Hil- bert space of infinite dimension, and let D = {XEX I II xii < I} be its unit ball and iJD= {XE X Illxll = 1} be the unit sphere of X. S. Kakutani (Proc. Imp. Acad. Tokyo, 19 (1943)) gave a fixed-point free continuous mapping of D into itself if X is separable [1-3]. Thus naive generalization of the Brouwer fixed- point theorem is no longer true in infinite- dimensional spaces. V. Klee and C. Bessaga [17J proved that the unit sphere aD is e x _ diffeomorphic to X for an arbitrary Hilbert space X of infinite dimension. N. H. Kuiper proved that the group of invertible continuous linear operators on X is contractible if X is separable [3]. All this is in striking contrast to the well-known facts for finite-dimensional spaces ( 202 Homotopy Theory, 427 Topol- ogy of Lie Groups and Homogeneous Spaces). This is the reason why compactness assump- tions are made in the theorems mentioned above. E. Calculus in Banach (or Locally Convex) Spaces When one considers a nonlinear operator, it often happens that the domain and the range are neither linear spaces nor their open subsets. The domain might be a space of all smooth mappings of a compact manifold into another, and so might be the range. Such spaces have no linear structure, and hence linearity or semilinearity do not make sense in 
1075 general. The concept of infinite-dimensional manifolds is therefore introduced of necessity in nonlinear functional analysis. Definition of differentiable mappings. Let E and F be real Banach spaces and let L(E, F) (= L(E) if E = F) be the Banach space of all bounded linear operators with uniform oper- ator norm. Let U be an open subset of E and x a point of V. A mapping (= nonlinear oper- ator) f of V into F is called Gateaux differ- entiable at x if Iim ho t- I (/'(x + ty)-f(x))= df(x, y) exists for any yE E. df(x, y) is called the Gateaux derivative off at x. f is called Frechet differentiable at x if there exists a linear oper- ator AEL(E, F) such that IimyO Ilf(x+ y)- {(x)-Ayll/llyll =0. A is called the Frechet derivative of f at x and is denoted by df(x) or .f'(x). f is Frechet differentiable in V if and only if it is Gateaux differentiable, df(x, y) is linear in y, and SUPy#O Ildf(x,y)II/llyll is bounded [10]. Let U be an open subset of E. A mapping f of U into F is said to be of class CO if it is continuous and to be of class C I if it is Frechet-differentiable at each point XE U and the differential df(x)EL(E,F) is continuous as a mapping of V into L(E, F). The differential df(x) is also called the linearized operator. If the mapping df: U ->L(E, F) is of class C'-I, thenf is said to be of class cr. d(d'-If)(x) is written as d'f(x), and called the rth differential at x. d'f(x) is an r-linear, bounded, symmetric operator of Ex... x E (r times) into F. f is said to be of class C' if j is of class C' for every r. For an open subset V of F, f: V -> V is called a C' diffeomorphism if f is a bijection and both f and r 1 are of class C'. A C' mapping f of V into F is called a Fred- holm mapping (Fredholm map) if df(x)E L(E, F) is a linear tFredholm operator for every XE V. Since Ind df(x) is constant if V is connected, that integer Ind df(x) is called the index of f F. Taylor's Theorem and Its Converse Let f: V -> F be of class C' (r;;" I). A general- ized Taylor theorem claims that f can be ap- proximated by a polynomial mapping: Let XE V and YE E be sufficiently close to 0 so that x+tyEVforOt1. Then _'Ik f 1 (l-t)'-1 f(x+y)-  _ k id f(x)(y,...,y)+ (-I)' k-O . 0 r . x {d'f(x + ty) - d'.f(x)} (y, ..., y)dt. Let Lym(E, F) be the Banach space of all k- linear, bounded, symmetric operators of E x '" x E into F with the uniform topology. If for every k, 0  k  r, there exists a continuous 286H Nonlinear Functional Analysis mapping IfJk: V -> Lym(E, F) such that , 1 f(x+ y)= kO kTlfJk(x)(y,... ,y)+o(llyll') at every XE V and y sufficiently close to 0, then f: V->F is of class C', and dkf(x) = IfJdx). G. The Implicit Function Theorem Using the notation above, let f: V -> V be of class C', r;;" I, and assume that OE V, OE V, and ((O) = 0, where 0 is the origin of E or F. Sup- pose there is an A E L(F, E), called the right inverse of 4f(0), such that df(O)A = 1 F (the iden- tity). Then the following assertions hold: (i) The image of F under A, AF, is a closed sub- space of E, and E = Ker 4{(0) EB AF. (ii) There are neighborhoods VI' U z , V' of the zeros of Kerdf(O), AF, F, respectively, such that VI EB U 2 c V and such that the mapping g: VI EB U 2 -> VI EB V' defined by g(u, u) = (u,f(u, u)) is a C'-diffeomorphism. Therefore, denoting the inverse of 9 by h=(hl,h z ), we have hdu, w)=u and f(u, hz(u, w))=w. The latter means that the nonlinear equation f(u, v)= w can be solved with respect to u. H. Existence and Uniqueness of Integral Curves Using the notation above, let f be a C' map- ping (r;;" I) of V into E. Since V x E is the ttan- gent bundle of V, (x, f(x)) can be regarded as a C' tangent vector field on V. The equation of tintegral curves is (d/dt)x(t) = f(x(t)). A local existence and uniqueness of solutions is stated as follows: For an arbitrarily fixed x E V, there are c > 0 and an open neighborhood W of x such that there exists uniquely a C' mapping h of Wx (-c,c) into V satisfying (d/dt) h(w,t)= f(h(w,t)) and h(w, 0)= w. Using this fact, one can prove the Frobenius theorem: Let E' be a closed linear subspace of E with a direct summand En, and let f: V--> L(E', E") be a C' mapping (r;;" I) such that f(O) =0. To each XE V one associates a closed linear subspace Dx= {(v,f(x)v) 1 VE E'}. The disjoint union D= UXEUDX can be regarded as a subbundle of the tangent bundle U x E. A mapping U of V into E is called a cross sec- tion of D if U(X)E Dx for every XE V. D is caIled involutive if for any two C I cross sections ii. u of D, the Lie bracket product [ii, u] defined by [ii, u] (x) = dii(x)(v(x))- dv(x)(ii(x)) is again a cross section of D. Now suppose D is an in- volutive subbundle of U x E. Then for an arbi- trarily fixed XE U, there are a neighborhood W 
286 I Nonlinear Functional Analysis of x and a C' diffeomorphism h of W onto a neighborhood of ° of E such that dh(x)(Dx! == E'. This fact shows that an involutive sub- bundle can be trivialized by a suitable change of local coordinate systems. I. Local Theories on Locally Convex Spaces All local theories mentioned in Sections E-H are constructed on Banach spaces. However, it is important in concrete applications to con- struct these theories on a wider class of locally convex topological linear spaces. Let E, F be tIocally convex topological linear spaces, and let U be an open subset of E. A mapping f: U -> F is said to be of class CO if it is continuous. f is said to be of class C' if f is of class C'-1 and the following is fulfilled: For every XE U, there is an r-Iinear continuous symmetric mapping d r f(x) of Ex... x E into F such that drf: U xE x... x E->F is continuous. If we put F(y)=f(x+y)- f(x)-df(x)(y)-... 1 --drf(x)(y,... ,y) r! for every y sufficiently close to OE E, then the mapping C defined by Crt, y) = { F(ty)/t', t o, (tE R) 0, t-O, is continuous at (O,O)ER x E. The definitions ofC'" mappings and C' diffeomorphisms are given as in Section E. J. Implicit Function Theorems in Locally Convex Spaces The implicit function theorem does not hold in general locally convex spaces. However, since it is useful for nonlinear problems, sev- eral sufficient conditions are presently being studied. The following are some of them. We assume that f: U -> F is a C' mapping (r;;:' 1) such that frO) = ° and that df(O): E -> F has a continuous right inverse. (i) If dim F < 00, the implicit function theo- rem holds just as in Section G. (ii) An implicit function theorem that can be applied to nonlinear elliptic differential operators can be restated as follows. Suppose E, Fare tprojective limit of families of Banach spaces {E\k;;:'d}, {F\k;;:'d} and U =En U d , where U d is an open subset of Ed. Iff: U->F can be extended to a C' mapping (r» 2) of E k n U d into F k for every k;;:, d, if f satisfies the following inequalities, called a linear estimate 1076 with respect to Ilk: Idf(x + Y)(V)lk:::;; C(lylkl vl d + Iyldlvlk) +PdIYlk-l)lvl k - 1 , k>d, Wf(x + y)(u, V)lk:::;; C(lylkluldl vl d + Iyldlulklvld + Iyldluldlvlk) + Pk(lylk-dlulk-ll vlk-l, k> d, (4) where I Ik is the norm in E k or F\ C is a posi- tive constant independent of k, and P k is a polynomial with positive coefficients, and if a right inverse A of df(O) satisfies the inequality of Girding type, IAulk<C'lulk+Dklulk-J' k>d, (5) where C' > ° independent of k and Dk > 0, then the implicit function theorem holds just as in Section G, and the obtained mapping h satis- fies the same inequalities as (4) [11]. (iii) Nash-Moser implicit function theorem. Though linear estimates such as (4) hold for many differential operators, the second in- equality (5) is sometimes out of order, espe- cially if f is a nonlinear hyperbolic operator. However, one can often obtain instead of (5) a weaker inequality: IAulkC'lulk+s+Dklulk+s-j, s>O. J. Nash [37J and 1. Moser [12J approximated such an operator A by some smoothing oper- ators and proved an implicit function theo- rem under a certain additional condition. The Nash-Moser implicit function theorem was successfully applied to many difficult problems, e.g., the embedding problem of Riemannian manifolds [37], the small divi- sor problem of celestial mechanics [36J, free boundary problems (e.g., L. Hormander Arch. Rational Mech. Anal., 62 (1976),1-52), and other problems (e.g., S. Klainerman, Comm. Pure Appl. Math., 33 (1980), 43-101; M. Kura- nishi, Amer. Math. Soc. Proc. Symposia in Pure Math., 30 (1977), 97-105). (iv) Analytic implicit function theorem. In cases (ii) and (iii), the spaces E, F were given as projective limits of Banach spaces. On the contrary, H. Jacobowitz considered the case where E, Fare tinductive limits of Banach spaces. For instance, the space E of the smooth functions can be approximated by a family of Banach spaces {E. I I: > O} of all real analytic functions with I: as the radius of con- vergence. Under this circumstance, certain conditions for f and the right inverse of df(O) yield an implicit function theorem [13]. (v) Mather's implicit function theorem [14]. The difficulty of implicit function theorems in Frechet spaces is concentrated in the following fact: Even if df(O) has a right inverse and even if x is sufficiently close to 0, df(x) may not have 
1077 a right inverse. If the implicit function theorem holds, such a phenomenon should not happen. In cases (ii)-(iv), several functional-analytic conditions exclude this pathological phenom- enon. In some special cases, these conditions can be replaced by algebraic ones. J. Mather, using his division theorem, proved an im- plicit function theorem that was applied to thf' theory of singularities. K. Infinite-Dimensional Manifolds A Hausdorff space M is called a C' Banach manifold modeled on a Banach space E if the following conditions are satisfied: (a) Mis covered by a family of open subsets {U a } aEA' For each U a there are an open subset II;. of E and a homeomorphism ljJa of U a onto Va. Such a pair is called a local coordinate system or a local chart of M. (b) If U a n Up =I 0, ljJaljJp1 is a C' -diffeomorphism of IjJp(U a n Up) onto ljJa(Un Up). (c) The index set A is maximal among those that satisfy (a) and (b). If M (resp. M') is a C' Banach manifold modeled on E (resp. E'), then so is M x M' modeled on E EB E'. A mapping f:M ->M' is said to be of class C' if f expressed through local coordinate systems is of class cr. Suppose M is covered by a family of local charts {( Ua, ljJa)} aEA' On the disjoint union UaEA U x E, define an equivalence rela- tion  as follows: For (x, U)E U a x E, (y, V)E Up x E, (x, u)  (y, v) if and only if x = y and d(ljJaljJpl)(x)v=u. The set of equivalence classes is called the tangent bundle of M. which will be denoted by T M . There is another definition of the tangent bundle which uses the ring of C' functions on M. However, since E is not reflexive in general, the latter gives us a differ- ent vector bundle. TM is a C'-1 Banach mani- fold modeled on E EB E. The correspondence which sends (x, u) E UaEA U a x E to x induces a C'1 mapping of TM onto M, called the pro- jection of TM' denoted by n. A topological group G is called a Banach- Lie group if G is a COO Banach manifold and the group operation (g,h)->g-lh is a COO mapping of G x G into G. If E is a Banach space, then GL(E), the group of all invertible bounded linear operators, is a Banach-Lie group under the uniform topology. If E is a Hilbert space, then the group of all unitary operators is also a Banach-Lie group under the same topology. The concepts of manifolds and Lie groups are similarly defined when the model space is a Hilbert space or a Frechet space. These are called a Hilbert manifold and a Frechet manifold, respectively. For some differential 286N Nonlinear Functional Analysis topologies on separable Hilbert manifolds  279 Morse Theory E. L. Structures on Infinite-Dimensional Manifolds Suppose M is a C'+1 Hilbert manifold modeled on E. At each XE M, the tangent space TxM = n -1 (x) is a Hilbert space, linear- homeomorphic to E. M is called a C' Riemann- ian manifold if there is defined an inner prod- uct (u, v>x on each TxM such that (', . >x is of class C' with respect to x. Existence of such a structure is ensured by using a partition of unity if M is paracompact. Let M be a C' Banach manifold (r 1) modeled on a Banach space E. Each tangent space M is linear homeomorphic to E. M is called a C' Finsler manifold if there is a norm Iulx defined on each TxM such that I Ix is continuous with respect to x. A paracompact C 1 Banach manifold can have a C 1 Finsler structure. Let M be a C'+2 Frechet manifold (rO) and C'+l(M), P+1(TM) the spaces of all C'+1 functions and of all C'+1 vector fields on M, respectively. A bilinear mapping V of r,+1 (T M ) x p+ 1 (TM) into r'(T M ) is called an affine connection on M if V satisfies VI. V = fV.v, Vufv=(iif)v+ fVuv for every ii, vEf"OHl(T M ), fEC'+1(M). For an affine connection V, T(ii,v) = Vuv - Vvii - [ii, u] is called the torsion tensor and R(ii, v) = V.Vv-VvV.-V[u.V] is called the' curvature tensor of V. If M is a Riemannian manifold, then there exists a unique affine connection without torsion which leaves the Riemannian inner product parallel. M. Local Linearization Theorems Let M be a C'+1 Banach manifold (r 1) modeled on E. Let ii be a C' vector field on M such that ii(x) =I 0 at XEM. Then there are a neighborhood U x of x and a C' diffeomor- phism IjJ of U x onto an open subset V of E such that dljJii(IjJ-1(y))=(y, v), vEE, for every yEV, where v does not depend on y. N. Morse Lemma Let M be a C,+2 Hilbert manifold and f be an R-valued C'+2 function. Suppose x is a critical point of f, i.e., df(x) = O. x is called a non degen- erate critical point if d 2 f(x) is a nondegenerate bilinear form. For such x there are a neighbor- hood U x of x and a C' diffeomorphism IjJ of U x onto an open neighborhood of 0 of the model space E such that ljJ(x)=O, and f(IjJ-1(y)) = IPyI2_1(1-P)yI2, where P is an orthogonal 
2860 Nonlinear Functional Analysis projection in E. i x =dim(l- P)E (Oix::fj) is called the index of the critical point x of f o. Submanifolds A subset N of a C' Banach manifold M mod- eled on E is called a C' submanifold if at each point XE N there are a neighborhood U x of x and a C'-diffeomorphism i/; of U x onto an open neighborhood V of 0 of E such that i/;(x) =0 and i/;(UxnN)= VnF, where F is a closed linear subspace of E. There are some other definitions of submanifolds. One of them re- quires in addition that F be a direct summand of E, and another uses instead of U x n Nits connected component containing x. In the latter definition, a submanifold is not neces- sarily locally closed. P. Sard-Smale Theorem Although it is not easy to define nontrivial measures on infinite-dimensional manifolds, the concept "almost everywhere" can some- times be replaced by that of residual sets. A subset of a topological space is called a tresidual set if it contains an intersection of countably many open dense subsets. A re- sidual set in a complete metric space or in a tBaire space is dense. S. Smale [15J ex- tended tSard's theorem to infinite-dimensional manifolds as follows: Let M, N be C' Ba- nach manifolds and f: M ->N be a C' Fred- holm mapping. If M is separable and r> max{O, Ind(df(x))} for each XE M, then Rf= N - f(C) is a residual subset of N, where C is the set of critical points off, i.e., a point x where df(x) is not surjective. Q. Calculus of Variations and Infinite- Dimensional Manifolds Many problems in the calculus of variations can be understood as problems seeking crit- ical points of functions defined on infinite- dimensional manifolds. R. Palais and S. Smale set up the following Condition C and fixed a category of functions where the critical points can be chased through gradient-like vector fields [6]. Palais-Smale Condition C. Let f be a C 1 function on a C 1 Finsler manifold M. If S is any subset of M on which f is bounded but on which I df(x) I is not bounded away from 0, then there is a critical point of f adherent to S. In general, it is not easy to examine Con- dition C for a concrete f However, many concrete problems, where the Euler equations are nonlinear elliptic, satisfy Condition C. 1078 Morse theory. Let M be a Cx" complete Riemannian manifold and f a C'"' function bounded below satisfying Condition C and having only nondegenerate critical points. Then, using the Morse lemma, one can make a thandlebody decomposition of M by the same method as in the case of finite-dimensional manifolds ( 279 Morse Theory). Lyusternik-Shnirel'man theory. This theory, constructed on finite-dimensional manifolds, can be extended naturally to Finsler mani- folds. Let M be a complete C 2 Finsler mani- fold and f a C 2 function satisfying Condition C and bounded below. Then f has at least cat(M) critical points, where cat(M) = m means that M can be covered by m closed contrac- tible subsets of M but not by m - 1 ones. If there is no such integer, then we set cat( M) = 00. Both Morse theory and Lyusternik- Shnirel'man theory have been successfully employed in the global theory of the calculus of variations. R. Bifurcation Theory Bifurcation theory concerns itself with the structure of the zeros of the functional equa- tion of w with a parameter k G(A.,w)=O. (6) In general, the state w satisfying (6) represents the equilibrium (time-independent or station- ary) solution of the tevolution equation w,=G()., w) and w(O)=w o ' (7) Here the evolution equation itself stems from a mathematical model describing natural phe- nomena, w = w(t) stands for the state at time t, and ), is the set of parameters representing the physical environment. For example, in the tNavier-Stokes equation appearing in fluid dynamics, w(t) represents the unknown veloc- ity field at time t and ). is the tReynolds num- ber. It is important to study bifurcation phe- nomena because they typically accompany the transition to instability of the state when some characteristic parameter passes through a certain value, called a critical value. Let X, Y, and A be real Banach spaces, and let G(A., w) be a mapping from A x X to Y. Suppose that there exists a mapping W(A): A-> X satisfying G(A, W(A.)) =0. One calls ()., w(A.)) a trivial solution of (6). (),o, w(A.o)) is called a bifurcation point of G(A, w) (with respect to the trivial solution) if in any neighborhood of (A o , W(Ao)) there exists a nontrivial solution of (6). (In general, there may appear another type of solution that is not connected with the trivial solution [20J). 
1079 Assume that G(.A., w) is of class C 1 in some neighborhood of (.A. o , W(Ao)) in A x X. Then it follows from the timplicit function theorem that (Ao, W(A o )) is not a bifurcation point if Gw(.A.o, W(A o )), the tFrechet derivative of G with respect to w, is nonsingular. S. The Principle of Linearized Stability Closely tied to the phenomenon of bifurcation is the property of stability. Suppose that the dynamics of a physical system are governed by (7). Let w(t; w o ) denote the solution of (7). An equilibrium solution W = w().) is called stable if for any f. > ° there exists a 6> Osuch that II w(t; wo) - W II < /; for all t > ° whenever Ilwo-wll <6. Furthermore, W is said to be asymptotically stable if, in addition, w(t; w o )-> was t-> XJ. By the principle of linearized stability we mean that the stability of an equilibrium solu- tion W is determined formally by the tspec- trum of the linearized operator Gw(.A., w(.A.)). As- sume that Gw(A, w(.A.)) has only a tpoint spec- trum. Then (i) if w(l) is stable, the spectrum of G w ()', w(m is contained in {ZEC I Rez O}, and (ii) if the spectrum of GJ;L, W(A)) is contained in {zECI Rez<O}, W is asymptotically stable. Suppose that as l crosses a certain value A o , one or more eigenvalues of Gw(A, W(A)) cross the imaginary axis from the left to the right half-plane, where W(A) is the known equilib- rium solution. This is precisely the situation when w().) becomes unstable. For notational simplicity, we put U=W-W(A) and F(A,U)= G(A,U+W(A)). T. Bifurcation from Simple Eigenvalues Take A = R. Let FE C 2 (R x X, Y) be such that F(A, 0) =0 for any real A. Set Lo = Fu(/co, 0), LI = FA. u(.A.o , 0), and suppose that (i) Ker(L o ) is spanned by uo#O; (ii) codimR(L o )= 1; and (iii) LI uariR(Lo), where R(L o ) denotes the trange of L o ' Then there exists a C'-curve ()., 1/1): ( - 6,6) -> R x X defined on some inter- val ( - 6, 6) such that ).(0) = A o , 1/1(0) = 0, and F(A(S), s(u o + l/1(s)))=O for any SE( -6,6). Moreover, in a neighborhood of (.A. o , 0) any zero of F either lies on this curve or is a trivial solution (M. G. Crandall and P. H. Rabino- witz, J. Functional Anal., 8 (1971)). In the above case, there exist only three possible situations of the curves (A, 1fJ) and (A,O), called subcritical, supercritical, and transcritical bifurcations. In the third case, there occurs the so-called exchange of stability [19-21]. 286 V Nonlinear Functional Analysis U. Bifurcation of Periodic Solutions (Hopf Bifurcation Theorem) If U(A) loses stability by virtue of a pair of complex conjugate eigenvalues crossing the imaginary axis, then under suitable conditions one can prove the existence of bifurcating time- periodic solutions of (7). Rewrite equation (7) as U,+LOU+g(A, u)=O. (8) Suppose (i) Lo:D(Lo)cX ->X is a densely de- fined linear operator on X such that - Lo generates a strongly continuous semigroup on X, which is holomorphic on XC ( = the com- plexification of X). Lo has compact resolvent; i is a simple eigenvalue of La, and ni riu(L o ), the spectrum of Lo, for n = 0. 2, 3, ... . As a con- sequence of (i), ifr> -Rd for any AEU(L o ), then the fractional power (rI + L o !" for IX;;:'O is well defined. Because their domains are in- dependent of r, one can set X. = D( (rI + L o )"), which are Banach spaces under the norm Iluli. = II(rI + Lo).ull. Suppose (ii) there exist an IX E [0, 1) and a neighborhood (I) of (0, 0) in R x X such that 9EC2({I), X.), where Ck({I), X.) denotes the space of all X-valued C k functions defined on (!). Moreover g(A,O)=O if(A,O)E(!), and 9u(0,0)=0. (iii) Let (3={3(),) be a continu- ously differentiable function defined in a neigh- borhood ofO such that (3(A)EU(L o +g u (A,O)) and (3(0) = i. Suppose that Re (3'(0) # 0. If as- sumptions (i), (ii) and (iii) are satisfied, then there exist a positive 6 and continuously dif- ferentiable functions (p, A, u):( -6, 6)-> R 2 x CO(R, X.) such that (a) for 0< Is I < 6, u(s) is a 2np(s)-periodic solution of period 2np(s) of (8) corresponding to A=).(S); (b) p(O)= 1, A(O)=O, u(O)=O, and u«)#O if s#O; and (c) any 2np- periodic solution 01 (8) in qp(R, X.) (= the space of 2np-periodic continuous functions with value in X.) with Ip-11, IAI and Ilull suffi- ciently small is of the above form for some I sl < 6 up to a translation of the real line. Moreover, if 9ECk+1({I),X.), then the functions p, A, u are of class C k . V. The Lyapunov-Schmidt Procedure Suppose that L(A) = Fu(A, 0) has at A = ° an n- fold eigenvalue at the origin, i.e., dim Ker L(O) = n, and assume that Xc Y. Let Ker L(O) be spanned by IfJI' ..., 1fJ, and let P be the projec- tion onto this linear subs pace that commutes with L(O). Then P must take the form Pu = LJ=1 (u, 1fJ/>lfJj' where the IfJJ* E Y* c X* are null functions of the adjoint operator L(O)* and < IfJj' 1fJt> = i5 jk . P is a linear operator from Y to Y; hence it can be regarded as a mapping from X to X as well, and Q = I - P is a projec- 
286W Nonlinear Functional Analysis tion onto the range of L(O) in Y. Using P and Q, the equation F(A,U)=O can be decomposed into the system of equations QF(A,v+!/t)=O and PF().,v+!/t)=O, (9) where v = Pu and !/t = Qu. One solves the first equation for !/t = !/t(A, v) by the implicit function theorem, and then, substituting this into the second, one has the bifurcation equation F(Je, v) = PF(A, v +!/t(A, V))=O. Solutions of the bifurcation equation are in one-to-one correspondence with solutions of the original system sufficiently dose to the bifurcation point. There exists another method of reducing an infinite-dimensional problem to a finite- dimensional one, called the center manifold theorem [21, 39]. W. A Global Result The following global result is due to P. H. Rabinowitz (J. Functional Anal., 7 (1971)). Assume that F(A,U)=U-ALu+H(A,U), where L is a compact linear operator and H: R x X -> X is a compact mapping with H(A,U)=o(llull) at 0 uniformly on bounded A-intervals. Then, if /1-1 is an eigenvalue of L of odd multiplic- ity, (/1,0) is a bifurcation point for F with respect to the trivial solution. Moreover, the closure of the set of nontrivial zeros of F con- tains a component that meets (/1,0) and either is unbounded in R x X or meets ({i., 0), where W;<o {i. and {i. -1 is an eigenvalue of L. The beginning of bifurcation theory seems to be in the celebrated work of H. Poincare (Acta Math., 7 (1885)). X. Abstract Cauchy Problems Suppose that we are given an abstract Cauchy problem du -=Au (t>O), dt u( +O)=a in a Banach space X, where aEX and the non- linear operator A is assumed, for simplicity, to be independent of t. If the domain D(A) of A coincides with X and A is tLipschitz continu- ous, we can reduce the abstract Cauchy prob- lem to the tintegral equation of Volterra type u(t)=a+ t Au(s)ds, and by applying the iteration procedure we can easily show that the abstract Cauchy problem has a uniquely determined solution. 1080 (10) A similar treatment works for a singular but mildly nonlinear A of the form A = L + N if the linear operator L is the generator of a tstrong- ly continuous semigroup e'L in the sense of Hille- Y osida theory ( 378 Semigroups of Operators and Evolution Equations) and the nonlinear mapping N is Lipschitz continuous. In this case, we reduce the problem to the integral equation u(t)=e'La+ t e(h)LNu(s)ds. Here, if we merely have to assure ourselves of the iocal existence of the solution, then N can merely be locally Lipschitz continuous; and in this case, N can even be singular to some extent if elL is tholomorphic in t [22,23]. A typical application of this procedure was made by P. Sobolevskii, T. Kato, and H. Fujita to the Navier-Stokes equation ( 204 Hydro- dynamical Equations; 205 Hydrodynamics) to construct regular solutions [22]. tGalerkin's method ( 304 Numerical Solu- tion of Partial Differential Equations) is some- times quite convenient for obtaining (tweak) solutions of (11) and (12) [5]. Again, typical applications were made to the Navier-Stokes equations by E. Hopf and others [5,24,25]. To prove the convergence of approximate solu- tions constructed by Galerkin's method and subject to so-called energy estimates, we often make use of Aubin's compactness theorem concerning vector-valued functions [5]. Remarkable developments have taken place since 1964 for the case where A is dissipative, i.e., - A is accretive. F. Browder [26J proved that if X is a Hilbert space and A is dissipative, then the mere continuity of A is sufficient for the twell-posedness of(11) and (12). Then Y. K6mura [27J brought about a crucial advance by showing that if A is a (possibly multi valued and) maximal dissipative operator with D(A) dense in the Hilbert space X, then (11) and (12) are uniquely solvable for any aED(A). Furthermore, he founded the theory of non- linear semigroups of operators by establishing a tno n linear version of the Hille- Y osida the- ory for semigroups of tnonexpansive opera- tors in Hilbert spaces [28]. Subsequent de- velopments and applications were made in various directions by T. Kato (J. Math. Soc. Japan, 19 (1967)), M. Crandall and A. Pazy (J. Functional Anal., 3 (1969)), H. Brezis and A. Pazy (J. Functional Anal., 6 (1970)), S. Oharu (J. Math. Soc. Japan, 22 (1970)), M. Crandall and T. Liggett [29J, y. Konishi (Proc. Japan Acad., 47 (1971)), and others. While K6mura's original proof was based on the Yosida approximation A(I -F.A)-I(F.->O) for A, it is proved in [29J that in any Banach space a dissipative operator A that is maxi- (11 ) (12) 
1081 mal in a certain sense generates a nonlinear semigroup T, by the exponential formula T,x = lim ( I - A ) -n x. n--+oo n The scope of applications of the generating theorem (13) can be seen, e.g., in B. K. Quinn (Comm. Pure Appl. Math., 24 (1971)), M. G. Crandal1 (Israel J. Math., 12 (1972)), Y. Koni- shi (Proc. Japan Acad., 48 (1972) and J. Math. Soc. Japan, 25 (1973)), and S. Aizawa (Hiro- shima Math. J., 6 (1976)). Y. Nonlinear Semigroups in Banach Lattices Let X be a Banach lattice ( 310 Ordered Linear Spaces). An operator A:X::>D(A)-->X is said to be dispersive iffor al1 x, YED(A), II(x- y)+ II  II(x- y-A(Ax-Ay))+ II (;.>O). When a dispersive operator A satisfies the range condition R(I -AA)::> D(A) for any X> 0, it generates an order-preserving semi group T, = etA on D(A) : 11(T,x - T,y) + II  II(x - y) + II for t> O. We have therefore the preservation of order: x  y implies T,x  T,y. We can prove in par- ticular that the order of initial data is inherited by the solutions of a nonlinear heat equation (Y. Konishi). Remark. Various pathological phenomena arise when we do not restrict the form of A in the abstract Cauchy problem (11), (12). We cite merely the "blowing up" of solutions of Cauchy problems for tnonlinear heat equa- tions ( 291 Nonlinear Problems) and the nonlinear wave propagations described by nonlinear tSchrodinger equations (1. B. Bail- lon, T. Cazenave, and M. Figueira). Z. Abstract Cauchy-Kovalevskaya Theorem in a Scale of Banach Spaces T. Yamanaka (Comment. Math. Univ. St. Paul, 9 (1960)), L. V. Obsyannikov (Soviet Math. Doklady, 6 (1965) and 12 (1971)), F. Treves (Trans. Amer. Math. Soc., 150 (1970)), L. Niren- berg (J. Differential Geometry, 6 (1972)), and T. Nishida [30J discussed abstract treatments of classical Cauchy-Kovalevskaya theorem for partial differential equations: Let S = {Bp} p>O be a col1ection of Banach spaces depending on the real parameter p > O. Let Ilulip denote the norm of an element UE Bp. The col1ection S is cal1ed a scale of Banach spaces if, for any p and p'<p, BpcB p ' and Ilullp' Ilulip for any u in Bp' Consider in S the initial value problem of the form du dt = F(u(t),t), Itl<15, and u(O)=O. (14) 286 Ref. Nonlinear Functional Analysis (13) Assume the fol1owing conditions on F: (i) For some numbers R > 0, 1] > 0, Po> 0 and every pair of numbers p, p' such that 0 < p' < p < Po, (u, t)--> F(u, t) is a continuous mapping of {uEBplllullp<R} x {tlltl <1]} into Bp" (ii) For any p' <P<Po and al1 u, vEBp with Ilullp< R, II vll p < R, and for any t, It I < 1], F satisfies IIF(u,t)-F(v,t)llp.<Cllu-vllp/(p-p'), where C is a constant independent of t, u, v, p, or p'. (iii) F(O, t) is a continuous function of t, It 1 <1], with val ues in B p for every p < Po and satisfies, with a fixed constant K, IIF(O,t)llpK/(po-p), O P<Po' Abstract Cauchy-Kovalevskaya theorem. Under the preceding hypotheses there is a positive constant M such that there exists a unique function u(t) which for every positive P < Po and It I < M(po - p) is a continuously differentiable function of t with values in Bp, Ilu(t)llp<R, and satisfies (14) (also [31J). These results cover a theorem of M. Na- gumo (Japan. J. Math., 18 (1948)), which gen- eralizes the classical Cauchy-Kovalevskaya theorem. References [IJ L. Nirenberg, Topics in nonlinear func- tional analysis, Lecture notes, New York Univ., 1973-1974. [2J 1. T. Schwartz, Nonlinear functional analy- sis, Gordon & Breach, 1969. [3J M. Berger, Nonlinearity and functional analysis, Academic Press, 1977. [4J G. J. Minty, Monotone (nonlinear) oper- ators in Hilbert space, Duke Math. 1., 29 (1962),341-346. [5J 1. L. Lions, Quelques methodes de resolu- tion des problemes aux limites nonlineaires, Dunod,1969. [6J 1. K. Hale, Ordinary differential equations, Interscience, 1969. [7J M. A. Krasnosel'skii, Topological methods in the theory of nonlinear integral equations, Pergamon, 1964. (Original in Russian, 1956.) [8J A. Friedman, Partial differential equations of parabolic type, Prentice-Hall, 1964. [9J O. A. Ladyzhenskaya and N. N. Ural'- tseva, Linear and quasilinear el1iptic equations, Academic Press, 1968. (Original in Russian, 1956.) [IOJ E. Hille and R. S. Phil1ips, Functional analysis and semigroups, Amer. Math. Soc. Coil. Pub!., 31 (1957). [IIJ H. Omori, Infinite-dimensional Lie trans- formation groups, Lecture notes in math. 427, Springer, 1974. [12J 1. Moser, A new technique for construc- tion of solutions of nonlinear differential equa- 
287 A Nonlinear Lattice Dynamics tions, Proc. Nat. Acad. Sci. US, 47 (1961), 1824-1831. [13J H. Jacobowitz, Implicit function theo- rems and isometric embeddings, Ann. Math., 95 (1972),191-225. [14J 1. Mather, Stability ofC'" mappings I, Ann. Math., 87 (1968), 89-104. [15J S. Smale, An infinite-dimensional version of Sard's theorem, Amer. 1. Math., 87 (1965), 861-866. [16J R. Palais and S. Smale, A generalized Morse theory, Bull. Amer. Math. Soc., 70 (1964),165-172. [17J C. Bessaga, Every infinite-dimensional Hilbert space is diffeomorphic with its unit sphere, Bull. Acad. Sci. Polon., 14 (1966),27- 31. [18J R. Palais, Lusternik-Schnirelman theory on Banach manifolds, Topology, 5 (1966), 115-132. [19J M. G. Crandall and P. H. Rabinowitz, Mathematical theory of bifurcation, Bifurca- tion Phenomena in Mathematical Physics and Related Topics, C. Bardos and D. Bessis (eds.), Reidel, 1980. [20J 1. E. Marsden, Qualitative methods in bifurcation theory, Bull. Amer. Math. Soc., 84 (1978),1125-1148. [21J D. H. Sattinger, Bifurcation and sym- metry breaking in applied mathematics, Bull. Amer. Math. Soc., new ser., 3 (1980), 779- 819. [22J T. Kato, Nonlinear evolution equations in Banach spaces, Amer. Math. Soc. Proc. Symposia in Appl. Math., 17 (1965),50-67. [23J P. E. Sobolevskii, Equations of parabolic type in a Banach space, Amer. Math. Soc. Transl., (2) 49 (1966),1-62. (Original in Rus- sian, 1961.) [24J E. Hopf, Uber die Anfangswertaufgabe fUr die hydrodynamischen Grundgleichungen, Math. Nachr., 4 (1951),213-231. [25J O. A. Ladyzhenskaya, The mathemat- ical theory of viscous incompressible flow, Gordon & Breach, 1963. (Original in Russian, 1961). [26J F. Browder, Nonlinear equations of evo- lution, Ann. Math., 80 (1964), 485-523. [27J Y. Komura, Nonlinear semigroups in Hilbert space, J. Math. Soc. Japan, 19 (1967), 493-507. [28J Y. Komura, Differentiability of nonlinear semigroups, J. Math. Soc. Japan, 21 (1969), 375-402. [29J M. G. Crandall and T. M. Liggett, Gener- ation of semigroups of nonlinear transfor- mations on general Banach spaces, Amer. 1. Math., 93 (1971), 265-298. [30] T. Nishida, A note on a theorem of Niren- berg, J. Differential Geometry, 12 (1977), 629- 633. 1082 [31 J T. Kano and T. Nishida, Sur les ondes de surface de l'eau avec une justification math€:- matique des equations des ondes en eau peu profonde,1. Math. Kyoto Univ., 19 (1979), 335-370. [32J Yu. G. Borisovich, V. G. Zvyagin, and Yu. I. Sapronov, Nonlinear Fredholm maps and the Leray-Schauder theory, Russian Math. Surveys, 32 (1977),1-54. (Original in Russian, 1977.) [33J 1. Leray and J. Schauder, Topologie et equation fonctionnelles, Ann. Sci. Ecole Norm. Sup., 51 (1934),45-78. [34J L. Lyusternik and L. Shnirel'man, Methode topologique dans les problemes variationnels, Herman, 1934. (Original in Russian, 1930.) [35J M. Morse, Calculus of variations in the large, Amer. Math. Soc. Colloq. Pub\., 18 (1934). [36] 1. Moser, A rapidly convergent iteration method and nonlinear partial differential equations I, II, Ann. Scuola Norm. Pisa, 20 (1966),265-315,499-535. [37J 1. Nash, The embedding problem for Riemannian manifolds, Ann. Math., 63 (1965), 20-63. [38J K. D. Elworthy and A. J. Tromba, Dif- ferential structures and Fredholm maps, Amer. Math. Soc. Proc. Symposia in Pure Math., 15 (1970),45-94. [39J 1. Carr, Applications of Centre manifold theory, Springer, 1981. 287 (XX.33) Nonlinear Lattice Dynamics A. Lattice Dynamics In order to elucidate certain characteristic features of nonlinear waves, one-dimensional lattice models have been studied. Around 1953, E. Fermi et al. performed computer experiments on nonlinear lattices to verify a generally accepted belief that nonlinear cou- pling between the tnormal modes of harmonic oscillators would lead to complete energy sharing between these modes. To their sur- prise, their nonlinear lattices yielded very little energy sharing at all; on the contrary, the interactions resulted in the recurrence of the initial state. These results were later inter- preted in terms of solitons ( 387 Solitons), i.e., nonlinear waves that preserve identity despite mutual interaction. The equations of motion for a uniform 1- dimensional chain ofpartic1es of mass m with 
1083 nearest-neighbor interaction can be written as m d;tn = -ep'(Qn-Qn-d+ep'(Qn+I-Qn) (n=...,1,2,...). It was shown that a lattice with a nonlinear interaction of the form ep(r) = e -r + r + const. admits solutions in closed form. Later, it was shown that this lattice (the exponential lattice or the Toda lattice) is a completely integrable system. It is convenient to introduce Sn, which is the generalized momentum canonically conjugate to the mutual displacement rn=Qn+1 -Qn- For a lattice with exponential interaction, e -r. -1 = dsn/dt, and if we introduce Sn= f sn dt , then the equations of motion can be written as 10g(1 +d 2 S n /dt 2 )=Sn+1 +Sn-I -2S n , and the displacements are given by Qn =Sn-Sn+I' We have a solution Sn=log{1 +e 2 (2n+ P tH)}, where rx and 15 are arbitrary constants and fJ = ::!:: sinh rx. The associated wave e-'.-1 =f32 sec h 2 (rxn+f3t+15) represents a solitary wave or soliton. The multi soliton (N-soliton) solution is given by Sn=logdet\Iln, where \Iln is an N x N matrix whose elements are ( \I' ) _ s: + (ZjZdn+1 W+.)t Tn jk - Ujk CjC k 1 _ Z;Zk e ' with Zj= ::!::e- 2j , f3 j = +sinhrx j . Asymptotically the wave reduces as t---> +00 to an assembly of solitons N e-'. - I = I f3} sech 2 (rx j n + f3 j t +15}-). j=1 B. Conserved Quantities The equations of motion for a periodic ex- ponentiallattice of N particles can be written 287 C Nonlinear Lattice Dynamics in a Lax representation dL/dt = BL - LB, where Land Bare N x N matrices with the elements L1Jn=b n , Ln,n+l =L n + 1 ,n=a n , L1.N=LN.I =a N , 8 n ,n+l = -8n+1.n= -an, B1.N=-B N . I =a N (the other elements of Land B are all zero), with an =e-(Q.+I -Q.)/2, h n = !Pn (Pn=dQn/dt). The eigenvalues X of L can be shown to be independent of time, and so the motion of the lattice is a spectrum-preserving deformation. Now, if we define {Id by det(A1- L)= )"N + },N-1 II +... +AI N _ I +IN' the n{II} are polynomials of an and h n . These are constants of motion that were discovered independently by M. Henon and H. Flaschka. Thus the lattice has N conserved quantities; /1 is related to the total momentum and 1 2 to the total energy, but higher-index conserved quantities have no physical interpretation. C. Method of Integration Let Land B be the infinite matrices obtained from the foregoing ones in the limit N ---> 00. The eigenvalues ). of the equation Lep = )dp are independent of time, and the time evolu- tion of ep is given by the equation dep/dt = Bep. If the motion in the lattice is restricted to a finite region, we can clearly speak of the scat- tering of the wave ep due to the deformation in the lattice. For a given initial motion Qn(O) and Pn(O), or L(O), we calculate the initial scattering data of asymptotic form ep  zn (n---> 00). The scattering data consist of the reflection coeffi- cient R(z), the bound state eigenvalue Aj = -(Zj+zj-I)/2 (Aj< -1 or Aj> I), and the co- efficient c j of the normalized bound state eigen- function of asymptotic form CjZ j n for n---> 00. From the initial data and the equations of motion for n---> +00, we get the scattering data at a later time t. In effect, we construct the kernel I 1, - F(m)= 2ni f R(z,O)e-(Z-Z I)tzm-t dz + Ic](O)e-(Z,-Zj-l lt zr j 
287 Ref. Nonlinear Lattice Dynamics of the discrete integral equation (Gel'£and- Levitan-Morchenko equation) 00 K(n,m)+F(n+m)+ L K(n,n')F(n'+m)=O, n'=n+l mn+ 1. After solving this equation for K(n, m), we calculate K(n, n), given by 1 00 2 -1 + F(2n) + L K(n, n')F(n' + n). [K(n, n)J n'n+l Then the initial value problem is solved in the fonn e- 1Q .- Q . d= [ K(n,n) ] 2 K(n-l,n-l) The solution can be given as dQn/dt=snSn+l' with Sn = K(n-l, n). The simplest case R(z) = 0 yields the multi- soliton solution. For the periodic case also, eigenvalues of the equations LIfJ = J11fJ and dlfJ/dt = BIfJ, under suitable boundary conditions and for certain initial data, give sufficient information to construct a solution to the initial value prob- lem. Such a method of obtaining a general solution for the periodic lattice was developed by E. Date and S. Tanaka, and independently by M. Kac and P. Van Moerbeke. Following Date and Tanaka, the solution can be written in terms of the multi variable theta function, or the Riemann theta function 9, as d .9(exn + (3t + (5) Pn=-log +const., dt .9(ex(n+ 1)+{3t+i5) where ex, {3, and 15 are certain vector constants. There has been much activity recently to- ward interpreting the integrability of the Toda lattice in terms of Lie algebras (B. Kostant [9J). References [IJ E. Fermi, J. Pasta, and S. Ulam, Studies of non-linear problems, Los Alamos Report LA- 1940 (1955); Collected papers of Enrico Fermi, vol. 11, Univ. of Chicago Press, 1965,978-988. [2J M. Toda, Vibration of a chain with non- linear interaction, 1. Phys. Soc. Japan, 22 (1967),431-436. [3J M. Toda, Studies of a nonlinear lattice, Phys. Rep., 18C (1975), 1-123. [4J M. Henon, Integrals of the Toda lattice, Phys. Rev., B9 (1974),1921-1923. [5J H. Flaschka, On the Toda lattice. I, Exis- tence of integrals, Phys. Rev., B9 (1974), 1924- 1925; II, Inverse-scattering solution, Prog. Theor. Phys., 51 (1974),703-716. 1084 [6J E. Date and S. Tanaka, Analogue of in- verse scattering theory for the discrete Hill's equation and exact solutions for the periodic Toda lattice, Prog. Theor. Phys., 55 (1976), 457-465. [7J M. Kac and P. Van Moerbeke, A complete solution of the periodic Toda problem, Proc. Nat. Acad. Sci. US, 72 (1975), 2879-2880. [8J M. Toda, Theory of nonlinear lattices, Springer, 1981. (Original in Japanese, 1978.) [9J B. Kostant, The solution to a generalized Toda lattice and representation theory, Ad- vances in Math., 34 (1979), 195-338. 288 (X1I1.1 0) Nonlinear Ordinary Differential Equations (Global Theory) A. General Remarks Many well-known functions (with the notable exception of the tr -function), such as the ex- ponential, trigonometric, telIiptic, and tau to- morphic functions, satisfy ordinary differential equations of simple forms. For the purpose of finding new transcendental functions, P. Painleve initiated the systematic study of the equation F(x, y, y', .., .in)) = 0 (1) in the complex domain. To investigate the solution in its whole domain of definition, he assumed that F is a polynomial in y, y', ..., in) whose coefficients are analytic in x. Such an equation is called an algebraic differential equation. If F is linear in in), then (1) is written as (n) _ P(x, y, y', ... ,i n - 1 )) y Q( , (n-1» ) ' x,y,y, ...,y (2) where P and Q are polynomials in y, y', ..., i n -') with coefficients that are analytic func- tions of x. Equation (2) is called a rational differential equation. If F is linear in y, y', "', in), i.e., (1) is a linear differential equation, and if the coefficient of in) is 1, then singular points of solutions are situated at the singular points of the coeffi- cients ( 254 Linear Ordinary Differential Equations (Local Theory)). If F is not linear, then singular points of solutions of (1) are divided into two categories, one consisting of those points whose positions are determined by the equation itself and are independent of individual solutions, and the other consisting of those points whose positions depend on the 
1085 choice of particular solutions. In other words, the singularities of the first category appear independently of the choice of arbitrary con- stants involved in the general solution, while those of the second category depend on the choice of the arbitrary constants. The former are called fixed singularities and the latter movable singularities. The linear differential equation (1) has fixed singularities only, which are situated at the singularities of the coeffi- cients. In the same way, tbranch points of solutions can be classified into two kinds, fixed branch points and movable branch points. B. Algebraic Differential Equations of the First Order Consider the equation P(x,y) y'=- Q(x,y)' where P and Q are relatively prime polyno- mials in x, y. The fixed singular points of (3) are defined to be points , ' with the following properties: (i) Q(,y)=O. (ii) Q(',y)¥O, P(',y) =Q(Cy)=O have a root y=r( (iii) If, in (ii), we substitute llz for y and if the same relation (ii) holds for x= and z=O, we count such a value 1 as a fixed singular point. (iv) We transform equation (3) by setting x = lit. If the value t = 0 satisfies (i) or (ii) for this trans- formed equation, we count 00 as a singular point  or ' accordingly. The points , tare, in general, ttranscendental singularities of solu- tions, and the points ' cannot be tessential singularities of solutions but may be tordinary transcendental singularities. A singular point of a solution different from  and ' is an talge- braic singular point, and for any point dis- tinct from  and C equation (3) admits a solution with an algebraic singularity at this point. A necessary and sufficient condition that (3) has no movable branch point is that (3) be a tRiccati equation. Consider the algebraic differential equation of the first order F(x, y, y') = o. After defining the fixed singular points  and ' of (4), where the algebraic function of x, y defined by (4) has bad singularities, Painleve proved that movable singularities of solutions are algebraic. Before Painlev6's work, L. Fuchs gave a necessary and sufficient condition that (4) has no movable branch points, and then H. Poincare showed that if this condition is satis- fied, then (4) is either reducible to the Riccati equation if 9 = 0, integrable by the use of ellip- tic functions if 9 = 1, or algebraically integrable if g> 1, where, with x the independent vari- 288B Nonlinear ODEs (Global Theory) (3) able, 9 denotes the tgenus of the algebraic curve defined by (4). Painleve found that there were gaps in the proofs of Fuchs and Poincare and completed these by proving his theorem and the following one: Let <p(x, Yo, xo) be the solution of (4) satisfying the initial condition y(x o ) = Yo. Let X, Xo be points different from  and ', and let L be a curve connecting Xo to x and not passing through any  or t. If we denote by <PL(X, Yo, xo) the value at x = x of the branch obtained by continuing <p(x, Yo, xo) analytically in a neighborhood of L and re- gard <PL(X, Yo, xo) as a function of Yo, then <PL(X, Yo, xo) coincides, in a neighborhood of every point Yo = b, with several branches of an talgebroid function of Yo. Painleve studied the case when the general solution is finitely many- valued and gave a condition for <p(x, Yo, xo) to be an algebraic function of Yo. When equation (4) does not contain x ex- plicitly, no movable branch points appear if and only if all solutions are single-valued, and then the solutions are expressible in terms of rational, exponential, and elliptic functions. Such an equation is called a Briot-Bouquet differential equation. J. Malmquist proved, by using P. Bou- troux's method of studying the behavior of solutions in the neighborhood of a fixed sin- gularity, that if equation (4) admits at least one solution that has an essential singularity and is finitely many-valued and free from movable branch points around this singularity, then (4) is an equation without movable branch points. If (4) admits a solution that is a finitely many- valued transcendental function, then an alge- braic transformation may be applied to (4) so that it will become an equation without movable branch points. It is an immediate consequence of the first assertion that if (3) admits a solution that has an essential sin- gularity and is finitely many-valued and free from movable branch points around the sin- gularity, then (3) is a Riccati equation. Later, equations (3) and (4) were studied by M. Hukuhara, K. Y osida, T. Sat5, T. Kimura, and T. Matuda. The following results are due to Kimura. If a solution <p(x) of (3) has an essential singularity at x = (, then, in an arbi- trary neighborhood of (, <p(x) assumes every value with the exception of the roots of P(, y) = O. If (3) is not a Riccati equation, it is deter- mined by a finite number of algebraic pro- cesses whether or not (3) admits a solution that has an essential singularity at x = ( and has no movable branch point around (. If (3) admits such a solution, then the singularity is a tlogarithmic branch point. For an essential singularity of a solution there exists, in gen- eral, a direction similar to a tJulia's direction, which was investigated by Hukuhara and (4) 
288C Nonlinear ODEs (Global Theory) Kimura. Matuda studied in detail the behavior of solutions as x tends to  along a half-line and concluded that, except for some special cases, any solution tends to a certain value as x tends to  along a half-line. To obtain algebraic solutions C. Briot and 1. C. Bou- quet devised a method similar to tPuiseux expansion in the theory of algebraic functions. Hukuhara improved their method and suc- ceeded in reducing (3) to several differential equations of standard forms in a neighbor- hood of x =. This enables us to apply the local theory to the global study. H ukuhara's method was used by Kimura and Matuda to obtain the results discussed in this paragraph. C. Algebraic Differential Equations of the Second Order For second-order algebraic differential equa- tions we can pose the same problems as for first-order equations: When do these equations have single-valued or finitely many-valued general solutions? What new transcendental functions are needed to integrate such equa- tions? These problems, studied by E. Picard and Painleve, are difficult because of the exis- tence of movable transcendental singularities. However, painleve succeeded in determining rational differential equations of the second order without movable branch points. Such equations, with the exception of those that are integrated by the use of solutions of the first-order and linear differential equations, can be transformed by rational transforma- tions into one of the following six differential equations: (1) y"=6 y 2+ X , (II) y"=2l +xy+ IX, y'2 y' lXy2+/3 15 y" =---+-+yl +-, y x x y (III) (IV) y'2 3 y 3 f3 y" =-+_+4 xy 2 +2(x 2 -IX)Y+-, 2y 2 y ( 1 I ) Y' y"=y'2 2y + y-l - (Y_I)2 ( f3 ) yy oy(y+l) +- IXY+- +-+ , x 2 Y x y-l y'2 ( 1 1 I ) Y"=T y+ y-l + y-x ( 1 I 1 ) y(y-I)(y-x) -y' -+-+- + x x-I y-x X 2 (X-1)2 x { 1X+ f3 x + y(x -:] + 15X(X-l) } i (y_I)2 (Y_X)2 ' (V) (VI) where IX, Ii, y, and 0 are constants. These equa- 1086 tions and their solutions are called Painleve equations and Painleve transcendental func- tions, respectively. Equation (VI) was dis- covered by B. Gambier, who found an omis- sion in Painlev6's calculations. All solutions of (I) are single-valued, and their properties were investigated by Boutroux. The solutions of (VI) have, in general, logarithmic branch points at x = 0, 1, 00 and were studied by R. Garnier. The case when the equation is of degree 2 with respect to y" was studied by Malmquist and F. Tricomi. The following facts are known concerning movable transcendental singularities of the rational equation y" = P(x, y, y')J(Q(x, y, y'), where P and Q are relatively prime poly- nomials (Kimura). Let p, q be the degrees of P and Q with respect to y'. If p > q + 2, then any solution rp(x) admits no movable essential singularities, but its derivative rp'(x) may admit such singularities. If p > q + 2 and Q is not decomposable as Q1(X,y)Q2(X,y,y'), then neither rp nor rp' admits movable essential singularities. If p  q + 2, then both rp and rp' may have movable essential singularities. If pq+2 and Q is not decomposable as above, then rp' has no movable essential singularities. If, for a solution rp(x), x = a is a movable essen- tial singularity of rp(x) or rp'(x), the rp(x) or rp'(x) assumes all values other than a finite number of exceptional values in an arbitrary neighborhood of x = a. If p > q + 2, then every solution possesses tIversen's property, and hence the set of movable singularities is not a continuum. D. Higher-Order Equations and Other Equations Painlev6's method of obtaining the second- order equations without movable branch points is applicable to higher-order equations. The determination of third-order equations without movable branch points was attempted by Painleve, J. Chazy, and Garnier by the use of this method, but is not yet complete. Chazy studied in detail an equation of the form (1 IJn) ,,2 y'" -, y + b(y)y'y" + C(y)y'3 y and showed that when n = - 2 and b(y) = 0, tFuchsian and tKleinian functions are obtained as solutions ( 32 Automorphic Functions). R. Fuchs, a son of L. Fuchs, derived equa- tion (VI), at almost the saine time as Gam- bier, from the study of tmonodromy groups. He showed that the monodromy group of the 
1087 equation 1 d 2 Z ( rx f3 ')' ;d(2= [2+ (t-1)2 + (t-x)2 b 3 a +-++ t(t-1) 4(t-y)2 t(t-1)(t-x) + t(t-1(t-yJ remains invariant as the singularity x varies if and only if rx, {J, ')', and b remain constant; the singularity y, considered as a function of x, satisfies equation (VI); and a and b are rational functions of x and Y and y'. Investigating a second-order linear differential equation of Fuchsian type with tregular singularities 0, I, 00, xl' ...,X n 'Y1' ...,Yn where Yl' ...,Yn are ap- parent fixed singularities, Garnier was led, under the hypothesis that the monodromy group of this equation remains invariant as Xl' ... ,X n vary, to a tcompletely integrable system of partial differential equations and showed that a symmetric function of Yl, ..., Yn' con- sidered as a function of anyone of XI, .., ,x n , satisfies an equation without movable branch points ( 253 Linear Ordinary Differential Equations (Global Theory)). For nonalgebraic equations, movable branch points appear even in the first-order case. Kimura obtained a sufficient condition that for an equation F(x,y,y')=O, where F is a polynomial of y' with meromorphic coeffi- cients in x, y, every solution has Iversen's property in a domain of the complex plane. It was shown by O. Holder that the ["- function satisfies no algebraic differential equation. Also, if a function meromorphic in the unit circle is of torder 00, then it satisfies no algebraic differential equation. References [1] P. Painleve, Oeuvres de Paul Painleve, 1, 2,3, Centre Nat. Recherche Sci. France,. 1972, 1974,1975. [2] P. Boutroux, Leyons sur les fonctions definies par les equations differentielles du premier ordre, Gauthier-Villars, 1908. [3] E. L. Ince, Ordinary differential equations, Longmans-Green, 1927. [4] L. Bieberbach, Theorie der gewohnlichen Differentialgleichungen, Springer, second edition, 1965. [5] M. Hukuhara, T. Kimura, and T. Matuda, Equations differentielles ordinaires du premier ordre dans Ie champ complexe, Pub!. Math. Soc. Japan, 1961. [6] J. Malmquist, Sur les fonctions a un nombre fini de branches definies par les equa- 289 B Nonlinear ODEs (Local Theory) tions differentielles du premier ordre, Acta Math., 36 (1913), 297-343; 74 (1941),175-196. [7] E. Hille, Ordinary differential equations in the complex domain, Wiley, 1976. 289 (XIII.9) Nonlinear Ordinary Differential Equations (Local Theory) A. General Remarks Consider a system of n differential equations dYj/dx= Jj(X'Yl' ...,Yn), j= 1, ...,n, (I) where the Jj are analytic functions of x, Y 1, ..., Yn' To simplify the notation, we use vector notation y instead of (y l' ... ,Yn)' If all the Jj are holomorphic at a point (x, y) = (a, b), there exists one and only one solution y(x) for (I) such that y->b as x->a and y(x) is holo- morphic at x = a. We say that a point (a, b) is a singular point of the system (1) if it is a sin- gular point of Jj for some j. The well-known tCauchy's existence theorem can no longer be applied to the case when (a, b) is a singular point of system (1). I n this case, the following three problems arise naturally: (i) to determine whether solutions y(x) such that y(x)->b as x ->a exist, and if they exist, to determine the number of independent solutions; (ii) to con- struct analytic expressions for solutions y(x) such that y(x)->b as x->a or, in a slightly more general way, analytic expressions for bounded solutions y(x) such that the values of (x, y(x)) stay in a neighborhood of the singular point (a, b); (iii) to investigate the properties of these solutions. These three problems are called local problems, since only those solutions in a neighborhood of the singular point (a, b) are considered. However, even when n = 1, the study of local problems is very difficult except for the case of singular points of par- ticular types at which the functions Jj are meromorphic. When n> 1, the problem becomes even harder; research on this case lags that for n = 1. In the subsequent discussion we assume with- out loss of generality that a = 0, b = O. B. The Case of a Single Equation Consider the equation dy/dx= Y(x,y)/X(x,y), (2) where X and Yare holomorphic functions of (x,y) at (0,0) x, YEC. When X(O,O)=O and 
289 C Nonlinear ODEs (Local Theory) Y(O,O) #0, we can rewrite equation (2) in the form dx/dy=X(x,y)/Y(x,y), and we see that (i) if X(O,y)¥, 0, equation (2) has one and only one solution y(x) which is algebraic at x=O and tends to ° as x -->0; (ii) if X (0, y) == 0, there is no solution of equation (2) such that y-->O as x -->0. The case where X(O,O)=O and Y(O,O)=O was first studied by C. A. A. Briot and 1. C. Bouquet. In order to obtain algebraic solu- tions they introduced a method similar to tPuiseux expansion in the theory of algebraic functions. A. R. Forsyth and J. Malmquist studied a problem of reduction for equation (2) by using the Briot-Bouquet method. The theory of reduction was completed by M. Hukuhara, who divided a neighborhood of (0,0) into a finite number of subdomains in such a way that (i) the union of these sub- domains covers the given neighborhood of (0,0) completely; (ii) in each of these sub- domains equation (2) takes one of eight canon- ical forms. Hukuhara investigated the prop- erties of the solutions for these canonical forms. Among them, the following two are well known as the Briot-Bouquet differential equations: xdy/dx= f(x,y), f(O,O) =0, x a +1 dy/dx = f(x, y), f(O, 0) = 0, (J  1 is an integer. C. Systems of Differential Equations When y is a vector with components (y), equa- tions (3) and (4) can be written as xdy)dx=f)x,Yt....,Yn), j=l,...,n, (5) x a +1dYj/dx=!i(X'Yl,...,Yn), j=l,...,n. (6) After Briot and Bouquet, the singular points of these types were studied by many authors, including Poincare, E. Picard, H. Dulac, Malmquist, W. J. Trjitzinski, and Hukuhara. A method of constructing solutions of these equations consists of two parts: (i) formally transforming the given equations into simpli- fied or reduced equations of the simplest pos- sible form by applying a formal transforma- tion of the type _"\' k I y- L.,PklX Z or Y . - "\' Pk k, Xko Zk l l ... Z" , J-L., ok,. n j=1,2,...,n; (8) (ii) verifying convergence or the validity of tasymptotic expansions for the formal solu- tions of the given equations, which are ob- tained by substituting bounded solutions of the equations satisfied by z or Zj into (7) or (8). 1088 By studying these analytic expressions, the properties of the sol utions can be clarified. D. Properties of Solutions of Briot-Bouquet Differential Equations For equation (3), the character of the simpli- fied equation depends on the value of A = fy(O, 0). We have the following four cases: (i) }, is neither ° nor negative. A suitable formal transformation (7) changes (3) to xdz/dx=),z+bx A . In particular, if A is not equal to a positive integer, then b=O. The double power series (7) is uniformly convergent. There exists a func- tion rp(x, z) of (x, z) holomorphic at (0,0) such that y= rp(x, xA(b logx + C» is a general solu- tion of (3) with an integration constant C. (ii) ),=0. The equation satisfied by z takes the form xdz/dx =zm+'(b +b'zm), m 1. If b = 0, b' necessarily vanishes and x = ° is a holomorphic point of (3). A general solution is given by (3) Z(x)=(C-mblogx)-l l m, b#O, b'=O, and (4) ( b' ( mb2 1 )) -l l m Z(x)= bO Tlog+C , bb'#O. Here, (= o(t) is the branch (of the inverse function of ( -log ( = t) such that o(t) - tlog t -t-->O as t-->oo. Then there exists a holo- morphic function rp(x, z) of (x, z) for Ixl < 15, Imargz+ argbl <3n/2 -I;, Izi < such that rp(x, Z(x» is a general solution of (3). (iii) A is a negative rational number - j1/v. The equation satisfied by z is written as vx dz/dx = z( - j1 + b(x zv)m + b'(xzv)2m). A general solution has the form Z(x)=x A (C-mblogx)-I/mv, b'=O, and ( b' ( vmb2 1 ) ) -l/mV Z(x)=x A b O Tlog+C , bb'#O. (7) In this case, there exists a holomorphic func- tion rp(x,z) of(x,z) for Imj1argx+mvargz+ argb=tn/21 <n-/;, Ixl <15, Izi < such that rp(x, Z(x» is a general solution of (3). M. Iwano expressed this solution in the form Ij;«x Z(x)T, x,Z(x)), where Ij;(w,x,z) is holomorphic for I arg w =+= I < IT - f., ° < I wi < b, Ixl <, Izi <. In particular, if b =b' =0, rp(x, z) is holomorphic at (0,0). (iv) I. is a negative 
1089 irrational number. The equation in z has the form xdzldx=h The formal transformation (7) may either diverge or converge. Dulac proved that if (7) diverges and if there exists a solution y(x) such that y(x)-->O as x-->O along a suitable path L, then Ix'y(x)p argxl-->00 and Ix'y(x)Pargy(x)1 -->00 as x-->O, XEL for any r:J. and p. However, the existence of such a solution is not yet verified. C. L. Siegel proved that if A satisfies certain inequalities, the formal transformation (7) is divergent. An example such that (7) is divergent was first given by Dulac. A very simple example for such a case was given by Y. Sibuya. In the case f(0, 0) = 0 and ), = 1;,(0,0) =I 0 for (4), the most complete result was obtained by Hukuhara, namely: (i) A suitable formal trans- formation (7) reduces equation (4) to X,,"+1 dzldx = z(r:J. o + Cl: 1 x + ... + r:J. o X ""), r:J. o = ).. A general solution is given by Z(x) = C. x""'eA(x), where A(x) is a polynomial in 1/x of degree (J. (ii) A general solution of (4) is ex- pressed by a uniformly convergent power series of the form L rpk(X)Z(X)k, where the rpk(X) are holomorphic functions of x for a certain sectorial neighborhood of x = 0 and have tasymptotic expansions L PjkXk as x -->0. Assume jj(O, 0, ...,0) = 0 for equations (5). Put Ajk = OjjIOYk(O, 0, ...,0). Denote by AI, ..., )'n the eigenvalues of an n x n matrix with elements {).jk}' Then (i) if an angle w can be chosen so that for some m  n, all of I wi, larg)'I-wl, ...,Iarg)'m-wl are less than n12, equations (5) possess solutions that are expressed as uniformly convergent (m + 1)- tuple power series of x, ZI (x), ..., Zm(x): "\' P X ko Z k, Z km l..J jkokl...km 1 ... m' Here the Zk(X) are general solutions of the simplified equations and have the expression Zk(X)=X I ' k (Ck+a polynomial of Cj,"', Ck_"logx), k= 1,2, ...,m. (ii) Moreover, Iwano extended the result of (i) as follows: When there exists one and only one zero among the other n - m eigenvalues, equa- tions (5) have solutions, depending on m+ 1 arbitrary constants, that are expressed as (m+ 1 )-tuple uniformly convergent power series LPj k ok,...k)Zm+1 (x))x ko ZI (X)k, ... Zm(x)k m . Here the coefficients Pjkok,...kJZm+d are holo- morphic functions of Zm+1 in a sectorial neigh- borhood of Zm+1 = 0 and admit asymptotic expansions in powers of zm+l as zm+l -->0. In this case, the functions Z 1 (x), .., ,Zm+ I (x) can- 289 E Nonlinear ODEs (Local Theory) not generally be integrated by quadratures, but as Zm+1 (x)-->O, the power series expansions coincide with the expressions obtained in (i). (iii) In the case when the Jacobian matrix (Ajk) is the zero matrix, Iwano constructed, under additional assumptions, a convergent analytic expression of a general solution for equations (5). Let the right-hand side of (6) be holomor- phic at (0,0, ...,0). In 1939 Trjitzinski proved the existence of solutions that admit asymp- totic expansions in powers of n arbitrary con- stants. In 1940 and 1941, Malmquist proved, under strong conditions, the existence of solutions that are expressed as uniformly convergent power series of Z.(x), ..., Zp(x): L Pjk, kp(X) Z.(xt'... Zp(X)k P . Here the Zk(X) are polynomials of x and log x of the form Zdx)=eAk(x)x},x (Ck+a polynomial of Ck-j, ..., CO' log x), k = r:J., ... ,f, where the coefficients admit asymptotic expan- sions in powers of x. Trjitzinski's result is contained in Malmquist's result as a special case. On the other hand, under much weaker assumptions than Malmquist's, Hukuhara solved a problem on the formal simplification of equations (6) and formal solutions. Iwano improved Hukuhara's result on formal solu- tions and discussed the convergence of formal solutions under weaker conditions than Malm- quist's (Ann. Mat. Pura Appl., 1957, 1959). The Pjk x kp(X) are holomorphic functions of x in a sectorial neighborhood of x = 0 and admit asymptotic expansions in powers of x as x -->0. The angle of the sector in which the asymptotic expansions are valid is largest for Iwano's method. If L (Jj < n, eq uations of the form x,,"jdYjldx = jj(x, y" ..., Yn), j = 1, "', n, possess at least n - L (Jj solutions that are holomorphic for Ixl <15 (R. W. Bass, Amer. J. Math., 77 (1955)). This result is analogous to those obtained by O. Perron, F. Lettenmyer, and Hukuhara and Iwano in the linear case ( 254 Linear Ordinary Differential Equa- tions (Local Theory)). E. Singular Perturbations The terms on the right-hand side of the non- linear differential equations <.""JdYjldx= jj(X,YI, ...,Yn,<'), j= 1,2, ...,n, (9) are holomorphic functions of(x,y,<.) for Ixl <a, Ilyll <b, 0<1<.1 <C, larg<.1 <d, and admit uni- formly convergent expansions in powers of Y with coefficients asymptotically developable in 
289 Ref. Nonlinear ODEs (Local Theory) powers of E. The (Jj are nonnegative integers. W. R. Wasow, W. A. Harris, Sibuya, and Iwano and T. Saito discussed problems on constructing asymptotic or convergent expan- sions for bounded solutions that are depen- dent on several arbitrary constants. In equations (9), the hand 0h/OYk are con- tinuous functions of (x, y, E) for -00 < x < +00, Ilyll < b, lEI < c, and periodic functions of period T with respect to x. Moreover, assume that a system of degenerate algebraic equations 0= h(x,yr. ...,Yn,O), j= 1, ...,n, (10) has a periodic solution Yj= p)x) of period T for -00 <x < +00. Then if equations (9) have periodic solutions Yj = p)x, E) of period T such that Yr->p)x) as E-->O, the Pj(x, E) are called singular perturbations of p)x) for equations (9). Concerning this problem, see I. M. Volk (Prikl. Mat. Mekh. SSSR, 10 (1946)). The work of Wasow (1950) on a single equation E a in) = f(x, y, y', "', i"'), E), (J > 0, n > m  0, (11 ) is remarkable. References [1] H. Dulac, Points singuliers des equa- tions differentielles, Memor. Sci. Math., 61, Gauthier- Villars, 1934. [2] E. Picard, Traite d'analyse III, Gauthier- Villars, 1896. [3] E. L. Ince, Ordinary differential equations, Dover reprint, 1956. [4] M. Hukuhara, T. Kimura, and T. Matuda, Equations differentielles ordinaires du premier ordre dans Ie champ complex, Math. Soc. Japan, 1961. [5] E. Hille, Ordinary differential equations in the complex domain, Wiley, 1976. 290 (XIII.11) Nonlinear Oscillation A. General Remarks By nonlinear oscillation we usually mean oscil- lation described by periodic or talmost peri- odic solutions of nonlinear ordinary differential equations. The theory of nonlinear oscillation is sometimes called nonlinear mechanics. In connection with oscillations in tdynamical systems and electrical circuits, the theory of nonlinear oscillation has been studied inten- sively in the Soviet Union under the direction 1090 ofN. M. Krylov and N. N. Bogolyubov since aLound 1930, and after World War II research in this field became active in Western countries also. Written as first-order systems, the differen- tial equations in this theory take one of the forms dx/dt = X(x) (1) or dx(dt=X(x,t), (2) where x is a vector and t is a scalar. A dif- feren tial eq uation of the form (1) is said to be autonomous. In a differential equation of the form (2), X(x, t) is usually assumed to be periodic or almost periodic in 1. In the former case, the differential equation (2) is said to be periodic, and in the latter case, almost periodic. Oscillations in physical systems are described mostly by periodic or almost periodic dif- ferential equations; therefore it is a principal problem in the theory of nonlinear oscillation to find a periodic or almost periodic solution of these differential equations. However, an oscillation described by a solution of a dif- ferential equation can be actually realized only when the solution is tstable ( 394 Stability) under a small variation of the initial value. Therefore it is important to investigate the stability of periodic or almost periodic solu- tions. In view of the fact that an actual phe- nomenon may be only approximately de- scribed by mathematical equations, sometimes it is necessary to require a certain stability of the system so that solutions of a peri- odic or almost periodic equation stay stable under a small variation of the equation itself. Such stability is called structural stability, the investigation of which is also important. It may happen that a differential equation possesses neither a periodic solution nor an almost periodic solution, but that it has an almost periodic tintegral manifold (i.e., a mani- fold x=f(t,O) in tx-space, where 0 is a para- meter, such that f(t, 0) is periodic or almost periodic in t and periodic in 0) containing the ttrajectory of the differential equation passing through an arbitrary point of the manifold. In this case, we can consider that a solution corresponding to a trajectory lying on the manifold describes an oscillation. Therefore it is important to find a periodic or almost periodic integral manifold of a differential equation and to investigate the stability of such a manifold. The methods used most frequently in re- search on nonlinear oscillation are: (i) geo- metric methods, (ii) analytic methods, and (iii) numerical methods. 
1091 B. Linear Oscillations Let SI and Si be the spaces of all w-periodic solutions of the w-periodic linear system dx/dt=A(t)x, A(t+w)=A(t), and its adjoint system, respectively. Then the space Pro of continuous w-periodic functions R  Rn has direct sum decompositions Pro = SI +S2 and Pro =Si + Sf so that (x, y)=O for every XESj, YES 2 or XESi, YESf, where (x,y) =(I/w)Sg't x (s)y(s)ds. Let gr, ...,rn} and {1]1, ...,1]rn} (m may be 0) be bases of SI and Si orthonormal with respect to (', '). Then for pE Pro there is a unique w-periodic solution x of dx/dt = A(t)x + p(t) belonging to S2 under the condition (1]\ p) = 0 (k = 1, ..., m), which is represented in the form x= G[pJ by a bounded linear operator G:Pro Pro. Also, if (3) and hence its adjoint system have no nontrivial w-periodic solution, then (4) always has a unique w-periodic solution gi ven by x = G [p], and this is true even if the w-periodicity is replaced by almost periodicity. The almost periodic system (3) is said to be regular if (4) has almost periodic solution for an arbitrary almost periodic function p(t). A necessary and sufficient condition for (3) to be regular is that (3) induce an exponential di- chotomy, that is, the solution space S of (3) have a direct sum decomposition S=S_ +S+ such that I x(t)1  Me -rlt-s1Ix(s)1 holds for -00 <st<oo ifxES+ and for -00 <ts< 00 if XES_, where M and yare positive constants. An autonomous (resp. periodic) system (3) is regular if and only if no tcharacteristic roots (resp. characteristic exponents) have zero real parts. C. Geometric Methods Geometric methods are used frequently for finding a periodic solution in an autonomous or periodic case. In the autonomous case (1), a periodic solution describes a closed orbit (t is a parameter of a curve) in x-space, which is usually called a phase space. The geometric method is used to show the existence and the stability of a closed orbit by investigating geometrically the behavior of orbits in the phase space. In such an approach, the prop- erties oPcritical points and tlimit sets ( 126 Dynamical Systems) are utilized frequently. This method is effective especially for 2- dimensional cases on the basis of the t Poincare- Bendixson theorem, and various results are given for a generalized Li{mard's (or Duffing's) 290D Nonlinear Oscillation (3) differential equation x + j(x)x + g(x)=O, which includes the van der Pol differential equation x-A(I-x 2 )x+x=0 [1,2]. In the periodic case (2), let w( > 0) be a period of X(x, t) with respect to t and x = rp(t, IX) be a solution of (2) such that rp(O, IX) = IX. Then a periodic solution of (2) is given by x = rp(t, IX o ), where IXo satisfies rp(w, IXo) = IXo' Thus the existence of a periodic solution can be shown by investigating geometrically the existence of a tfixed point of the mapping x x' = rp(w, x) in the phase space. In such an approach, tBrouwer's fixed-point theorem is utilized frequently. In the mapping xx', it may happen that there is no fixed point but that there exists an tin variant manifold. In this case, we get a periodic integral manifold. Geometric methods give information on qualitative properties, but usually not on quantitative properties like the shape of an oscillation. Thus these methods are in general not sufficient for the analysis of the phenom- ena met in practice. (4) D. Analytic Methods At present analytic methods are used most frequently in the study of nonlinear oscilla- tions because, in comparison with geometric methods, they enable us to get many quantita- tive results in addition to the qualitative ones. However, these methods are usually efficient only for weakly nonlinear differential equations, that is, differential equations differing only slightly from linear differential equations (gen- eral nonlinear differential equations are called sometimes strongly nonlinear differential equa- tions). In this sense, analytic methods are all tperturbation methods in the wider sense [3J, and the variety of the methods lies in the form of perturbation and the method of calculation. To make use of analytic methods, we always reduce the given differential equation to a differential equation of the form x = Ax + £X(x, t, G). Here £ is a parameter with small absolute value, A is a matrix of the form A = diag(Op, B), where Op is a p x p zero matrix and B is a matrix whose eigenvalues have all nonzero real parts, and X(x, t, £) is periodic or almost periodic in t. (i) When X(x, t, £) is periodic in t, Poincare's perturbation method is used frequently. (ii) In practical problems, we frequently meet the case A =0, that is, the case where (5) is of the form (5) x = £X(x, t, G). (6) 
290E Nonlinear Oscillation In this case, the average Xo(x, £) = limTo(I/T)SlX(x, t, £)dt exists. If dx/dt = Xo(x,O) has a periodic solution (t) and the related variational linear system is regular, then (6) has an (almost) periodic integral mani- fold x = f,(t, 0) such that f,(t, O)->(e) uniformly as £->0. The method of averaging based on this fact was devised by Bogolyubov and Mitropol'skii [4] and is used frequently. The equation X + OJ2 x =£f(x, X, t,£) is one of the examples that can be reduced to an equation of the form (6). Equation (7) ap- pears frequently in practical problems, and hence various convenient techniques are de- vised, such as the method of linearization, the asymptotic method, the method of harmonic balance [4], etc., through which we can apply the method of averaging directly to the given equation (7). (iii) Consider an OJ-periodic system dx/dt = A(t)x + f(x, t). For any X(t)E P w ' dx/dt = A(t)x + f(x(t), t) is of the form (4) and it has an OJ-periodic solution Lkl ake + G[Nx], or Tx = kl ake + G l Nx - kl (1]\ NX)1]k J (9) under the condition (1]\ Nx)=O (k = 1,..., m) as in Section B, where N: Pw->Pw is defined by Nx = f(x('), .), and the OJ-periodic solutions of (8) correspond to the fixed points of the trans- formation T:Pw->Pw induced by (9), where the a k in (9) are given by a k = (\ x), as is required when x is a fixed point. This is the alternative or bifurcation method [6-9], which is extend- able to the case of almost periodic systems under the regularity of (3) [10]. In order to obtain a fixed point of T, various kinds of fixed point theorems can be utilized. (iv) In Section C and (iii) above, the search for a fixed point of an appropriate mapping is a principal device. However, the choice of a suitable domain for the mapping is a crucial problem, and usually an a priori bound for solutions is looked for. The concept of stability and hence Lyapunov's second method are effective in such situations [11]. E. Numerical Methods Numerical methods are used for obtaining explicit forms of the oscillations. They are convenient in practical applications, since they can be used efficiently whether or not the nonlinearity of the system is weak. For an autonomous or periodic system, one can uti- lize the following methods as efficient means 1092 (7) of computing periodic solutions: Newton's iterative method, the finite-element method, the Lindstedt-Poincare method, the method of multiple scales, the method of harmonic bal- ance, the Galerkin method, etc. [12-15]. For weakly nonlinear cases, analytic meth- ods (that is, perturbation methods) are very efficient. However, when a parameter is fixed beforehand, it is not easy to know whether the conclusion obtained by the perturba- tion method is valid for the given value. For strongly nonlinear cases, it is very difficult to analyze the problems by analytic methods, and at present hardly any efficient 111ethods exist. Numerical methods win therefore be- come exceedingly important in research on nonlinear oscillations. These are now of toler- able efficiency and reliability, due to progress in high-speed machine computation. F. Nonstationary Oscillations (8) Physically speaking, an oscillation is a station- ary state. However, when a system contains a parameter varying slowly with time, the oscil- lation also varies slowly with time (for exam- ple, when the length of a pendulum varies slowly, the amplitude of the pendulum also varies slowly). Such variation of oscilIations in the course of time is represented by dx/dt = £f(x, t, a, £), where f(x, t, s, £) is (almost) peri- odic in t, s. Such cases provide important problems in the theory of nonlinear oscilla- tions; one such problem has been posed by Mitropol'skii [16] under the name nonstation- ary oscillations and has been investigated by means of the method of multiple scales [15]. References [1] G. Sansone and R. Conti, Nonlinear dif- ferential equations, Pergamon, 1964. (Original in Italian, 1956.) [2] S. Lefschetz, Differential equations: Geo- metric theory, Interscience, 1959. [3] G. E. O. Giacaglia, Perturbation methods in non-linear systems, Springer, 1972. [4] N. N. Bogolyubov and Yu. A. Mitro- pol'skii, Asymptotic methods in the theory of nonlinear oscillations, Gordon & Breach, 1961. (Original in Russian, 1958.) [5] N. Minorsky, Nonlinear oscillations, Van Nostrand, 1962. [6] 1. K. Hale, Ordinary differential equations, Wiley, 1969. [7] M. A. Krasnosel'skii, Translation along trajectories of differential equations, Amer. Math. Soc., 1968. (Original in Russian, 1966.) 
1093 [8J L. Cesari, R. Kannan, and J. D. Schuur, Nonlinear functional analysis and differential equations, Dekker, 1976. [9J N. Rouche and J. Mawhin, Equations differentielles ordinaires II, Masson, 1973. [IOJ M. A. Krasnosel'skii, V. Sh. Burd, and Yu. S. Kolesov, Nonlinear almost periodic oscillations, Wiley, 1973. (Original in Russian, 1970.) [11J T. Yoshizawa, Stability theory and the existence of periodic solutions and almost periodic solutions, Springer, 1975. [12J C. Hayashi, Nonlinear oscillations in physical systems, McGraw-Hill, 1964. [13J A. Andronov, A. Vitt, and S. Khaikin, Theory of oscillations, Addison-Wesley, 1966. (Original in Russian, 1959.) [14J M. Urabe, Nonlinear autonomous oscil- lations, Academic Press, 1967. [15J A. H. Nayfeh and D. T. Mook, Nonlinear oscillations, Wiley, 1979. [16J Yu. A. Mitropol'skii, Problems of the asymptotic theory of non stationary vibrations, Daniel, 1965. (Original in Russian, 1964.) 291 (XIII.12) Nonlinear Problems A. General Remarks Nonlinear problems deal with nonlinear map- pings or operators and the related equations. Until recently it was customary to consider nonlinear problems as belonging to applied mathematics and the physical sciences. How- ever, nonlinear problems now belong to mod- ern mathematics. Many phenomena in math- ematical physics are essentially described by nonlinear equations, e.g., the motions of several particles or of viscous or compressible fluids ( 420 Three-Body Problem, 204 Hy- drodynamical Equations). Some of these equa- tions are approximated by linear equations only when the variables appearing in the equa- tions are restricted to very small domains; they are treated by perturbation methods when the variables stay in comparatively small domains. If these requirements cannot be met and we have to deal with equations in which the variation of the variables are not negli- gible, nonlinear problems certainly arise. The methods of solution of nonlinear prob- lems are not as powerful or general as those for linear differential equations. For instance, the tprinciple of superposition of solutions does not hold for nonlinear problems, and therefore Fourier methods are no longer ap- plicable. Indeed, some nonlinear problems can 291 C Nonlinear Problems be dealt with only by means of very particular or ad hoc devices. B. Methods Used in Nonlinear Problems Consider a nonlinear equation G(x)=O, (1 ) where G is a nonlinear mapping or operator of a subset S of a linear space X into itself. If we put G=I -F (I is the identity), equation (1) becomes x=F(x). (2) Then a solution of (2) is a fixed point of F. Therefore fixed-point theorems of various kinds are useful for solving (2) ( 286 Nonlinear Functional Analysis). Let x(O) E S, and suppose that we can define X(k), k= 1,2, ..., by x(k)=F(x(k-1»), k= 1,2,.... If F is continuous and the sequence X(k) con- verges to a point XES, then x is a fixed point of (2). Such a method of constructing an ap- proximate sequence by iteration is called the iterative method. tNewton's iterative process, given by X(k) = X(k-1) _ [G'(X(k-1»)] -1 G(X(k-l»), is one such method, where G' denotes the tFrechet derivative of G. If X is an infinite-dimensional space, many concepts and methods of the theory of func- tional analysis can be used for a number of nonlinear problems ( 286 Nonlinear Func- tional Analysis). We note that there exist nonlinear trans- formations that change nonlinear equations into linear ones. For example, the thodograph method, which is often applied in hydrody- namics, consists of reducing a system of tquasi- linear partial differential equations of the form Ai(u, v)u x + Bi(u, v)u y + Ci(u, v)v x + Di(u, v)v y = 0 (i= 1,2) to linear differential equations Aiy,,-BiX,,-CiYu+DiXu=O (i=1,2) by means of the thodograph transformation, which changes the independent variables from x, y to u, v ( 205 Hydrodynamics). C. Nonlinear Algebraic and Transcendental Equations Consider a system of equations [,(x 1 , ...,xn)=O (i= 1, ...,n). (3) 
291 D Nonlinear Problems Newton's iterative process can be applied as follows. Starting with a point x(O) = (x\O), ... , xO)) lying near the desired solution, we define X(k)= (X\k), ..., Xk)), k = 1,2, ..., by solving the system of equations nor. I 2L(x(k-l))(XY)_xjk-1))+ Jj(x(k-J)) =0 iJ aXi (j= 1, ...,n). Under some conditions the iteration {X(k)} converges to the solution ( 301 Numerical Solution of Algebraic Equations). If the system (3) is a real one, (3) is equiva- lent to l:J? = o. It is clear that a method of obtaining a minimum of a function f(x J , "', x n ) is applicable to solving l:J?=0. Taking a point x(O), we define Xk=X(k-l)-Ak_JVf(x(k-l)), k= 1,2, ..., where Vf = (of/ax!, "', of/ax"), and let Ak-l satisfy f(x(k-)) -Ak-l Vf(X(k-l))) <,J(X(H) - AVf(x(k-)))) (AO). Under suitable assumptions, a subsequence X(k j ) converges to x such that Vf(x) = 0 and f(X(kj)) decreases monotonically to f(x) [1]. Let f be a continuous mapping from a domain D of R n or C" into itself. Then for any x(O)ED, the iteration X(k) can always be defined by X(k) = f(X(k-J)). The problem of the behavior of X(k) is an interesting and important one in pure mathematics; recent work in the physical sciences is yielding many new concepts related to this problem ( 126 Dynamical Systems, 433 Turbulence and Chaos). D. Nonlinear Differential Equations Consider a nonlinear system of ordinary dif- ferential equations dx/dt=f(t,x) (XERn). The initial value problem with initial condi- tion x("t) =  is equivalent to the problem of solving the nonlinear integral equation x(t)=+ ff(S,X(S))dS. The solution of (4) is a fixed point of the operator T: rp(t) f-->  + S;f(s, rp(s)) ds defined for a suitable function space. Therefore fixed- point theorems are applicable to T, and the iterative method for T is called the tmethod of successive approximation. Also the method of the tCauchy polygon is useful ( 316 Ordi- nary Differential Equations (Initial Value Problems)). These ideas are extended to an 1094 abstract Cauchy problem, du/dt=A(t)u (t>O), u( +O)=a, in a Banach space X, where aEX and A(t) is a nonlinear operator ( 286 Nonlinear Func- tional Analysis). For extensive studies of nonlinear ordinary and partial differential equations  314 Ordi- nary Differential Equations (Asymptotic Behavior of Solutions), 290 Nonlinear Oscil- lation, 394 Stability, 321 Partial Differential Equations (Initial Value Problems), 323 Par- tial Differential Equations of Elliptic Type, 325 Partial Differential Equations of Hyper- bolic Type. Nonlinear differential equations of special types appear in many fields of pure and ap- plied mathematics, e.g., the Monge-Ampere equation and the equation for minimal sur- faces in differential geometry ( 183 Global Analysis, 275 Minimal Submanifolds), and the Toda lattice equation and the Korteweg- de Vries equation in mathematical physics ( 287 Nonlinear Lattice Dynamics, 387 Solitons). E. Nonlinear Problems of Control Systems (4) The basic equation for a tcontrol system in which the state of the controlled object can be represented by an n-vector x is given by the following system of differential equations [3]: x=Ax-rp(O")b, =rp(O"), O"=c'x-y( (5) where x and  stand for dx/dt and d/dt, re- spectively,  is a scalar function representing the control, b, c are constant n-vectors, y is a constant number, c' is the transpose of c, and A is a constant n x n matrix whose character- istic roots have negative real parts. Further- more, we assume that when the control has no effect upon the system, x is determined by x= Ax. The quantities under consideration are all real. Finally, the function rp = rp(O") is a scalar function characteristic of the control mechanism. Generally, rp is nonlinear in 0", and hence equation (5) is nonlinear. Normally we assume that rp has the following properties: (i) rp(O") is a real-valued continuous function on (-00,00) with rp(O)=O and O"rp(O"»O for 0"=1- 0; (ii) Sl'oo rp(O") dO" = +00. We say that (5) is absolutely stable if x (t)...... 0, W)......O as t......+oo for any choice of rp subject to (i) and (ii) and for every solution of (5). In the study of control systems an important problem is to obtain a necessary and sufficient condition for the system to be absolutely stable. In this connec- tion, we have the following result, due to M. v. 
1095 Popov: (5) is absolutely stable if there exists a nonnegative q such that Re{(1 + iwq)(c'(iwl-A)-I b)} + qy;;'O (6) for any real w, where I is the n x n identity matrix. Con versely, if the absolute stability of (5) is given by means of the tLyapunov func- tion ( 394 Stability) V(x,(J)=x'BX+Ct.(J2+ rx f'X+fJ f: 1fi«(J)d(J, where B is a constant matrix and f is a con- stant vector, then there exists a nonnegative q for which (6) is satisfied for all real w. F. Nonlinear Equations in Applied Mathematics Some examples of nonlinear problems are given here ( also 205 Hydrodynamics; 318 Oscillations). (1) The nonlinear tdifferential-difference equation du(t)ldt=(a-u(t-l))u(t) is called the Cherwell-Wright differential equa- tion [4]. Given the initial condition u(t) = g(t) (Ot 1, g(t) is a given continuous function), its solution is uniquely determined for O t < 00. If aO and g(I»O, then u(t).....O as t..... 00; if a =0 and g(l) < 0, then u(t)..... -00 as t..... 00. For a> 0, u(t)..... -00 as t..... 00 if g(1) < 0, while u(t) either approaches a monotonically (0< a  lie) or oscillates (boundedly) around a (a> lie) if g(I» O. In particular, we have damping oscillations for a  3/2, while oscilla- tory solutions without damping appear for a> n12. (2) If we regard a star as a gas sphere and assume the polytropic relation p = Kp Y (K and yare constants) between the pressure p and the density p at each point inside the star, then we have a differential equation of the second order that determines the density distribu- tion. This equation constitutes the basis of the classical theory of the internal structure of the stars and is called Emden's differential equa- tion or the polytropic differential equation. It reads: (1/e)d(edOld)ld= _on, where y= 1 + 11n, p=J..on, r=rx, rx=«n+ 1)KJ..Y- 2 /(4nG))1/2, with r the distance from the center, G the universal gravitational con- stant, and A an arbitrary constant. The solu- tion that satisfies the conditions 0 = 1 and dOld= 1 at =O is called the Lane-Emden function of index n. Emden's equation is in- variant under the transformation .....A, 0..... A -2/('-')0 (with A an arbitrary constant). 291 Ref. Nonlinear Problems Although Emden's equation can be reduced to an equation of the first order, it has not been possible to solve it analytically except for the cases n=O, 1, and 5. For certain values ofn between 0.5 and 6, Emden gave numerical solutions, which were refined later by Green, D. H. Sadler, and D. C. Miller [5]. (3) CaianielIo's differential equations, which describe the state of a network of neurons [6J, are xi(t+r)= Y[ a\.?xj(t-rr)-oiJ where the function xi(t) represents the state of the ith neuron at the time t and takes only the values 0 and 1. The state of the system is to be considered at the discrete times t = 0, r, 2r, .., . The (real) coefficient a\.? represents the weight of the effect of hysteresis in the relay process from the ceIlj to the cell i. The nonnegative integer Oi is the threshold value of the cell i. Y[xJ is the unit step function that is equal to 1 for x;;, 0 and vanishes for x < O. (4) The following Hodgkin-Huxley differen- tial equation arises in the study of conduction and excitation in nerve systems [7]: ij2 V 2r ( 0 V ox2 = R: C oar +g l m 3 h(V- VI) +g2 n4 (V- V 2 )+g3(V- V 3 )} om ar= -(rxdV)+fJdV))m+rxdV), iJh -:;-= -(rx 2 ( V) + fJ2(V))h + rx 2 (V), ot on -= -(rx 3 (V)+fJ3(V))n+rx 3 (V), ot where rx i , fJi (1  i  3) are given functions of V, and ro, R o , Co, gj, Vi (1  i  3) are constants. The unknown function V is sought in the domain 0 < x < 00, 0 < t < 00, while the initial values of V, m, h, n, and the boundary value of V are given at t = 0 and x = 0, respectively. References [IJ T. L. Saaty and J. Bram, Nonlinear mathe- matics, McGraw-Hill, 1964 (Dover, 1981). [2J T. L. Saaty, Modern nonlinear equations, McGraw-Hili, 1967 (Dover, 1981). [3J S. Lefschetz, Stability of nonlinear control systems, Academic Press, 1965. [4J S. Kakutani and L. Markus, On the non- linear difference-differential equation y'(t) = (A - By(t - r))y(t), Contributions to the theory of nonlinear oscillations, Ann. Math. Studies, Princeton Univ. Press, 1958, vol. 4,1-18. 
292 A Nonlinear Programming [5] R. Bellman, Stability theory of differential equations, McGraw-Hill, 1953. [6] E. R. Caianiello, Outline of a theory of thought-process and thinking machines, J. Theoret. BioI., I (1961),204-235. [7] A. L. Hodgkin and A. F. Huxley, A quan- titative description of membrane current and its application to conduction and excitation in nerve, 1. Physiol. (London), 117 (1952), 500- 544. 292 (XIX.4) Nonlinear Programming A. Problems A nonlinear programming problem is a type of mathematical programming problem where it is required to minimize or maximize a non- linear function O(x) of n-vector variable x defined in a closed connected set XO with a Set of linear or nonlinear constraints. Minimiza- tion or maximization of a continuously dif- ferentiable function O(x) under the equality condition expressed by a set of continuously differentiable functions has been traditionally dealt with by the method of Lagrange multi- pliers. Hence a typical nonlinear programming problem is usually formulated as follows. (NLP) Minimize O(x) under the condition XEXo e R n and g;(x) O for i= 1,2,... ,m. Or, equivalently, determine the set of all x such that O(x)=minxEcO(x), where C= {XIXEXO and g(x)O}. Here we need only consider minimization, since maximization problems can be converted to minimization problems by virtue of the obvious relation maxO= -min( -0). The set C is known as the feasible region or the constraint set, and x is called an optimal solution or simply a solution. In many non- linear programming problems XO is R n . If XO = Rn and 0 and 9 are linear functions on Rn, then the problem becomes a linear program- ming problem ( 255 Linear Programming). The problem of minimizing a quadratic func- tion subject to linear constraints is called the quadratic programming problem ( 349 Qua- dratic Programming). If XO is a convex set, 0 is convex (or concave), and the g; are convex on XO, then the minimization (or maximiza- tion) problem is known as a convex (or con- cave) programming problem. Convex and con- cave functions are important in nonlinear programming, because they admit reason- ably straightforward sufficient conditions for optimality and also because they constitute the only important class of functions for which 1096 necessary optimality conditions can be given without the differentiability condition. Subject to suitable modifications, the method of La- grange multipliers can also be applied to the solution of nonlinear programming problems. The Lagrangian function Ij; associated with the minimization problem (NLP) is defined by Ij;(x, u)=O(x)+u'g(x), where u = (u l' ... , Urn) and u' denotes the vector of Lagrange multipliers. A pair (x, u) is called a saddle point of Ij;(x, u), provided XEXO, ueRm, u;;,O, and Ij;(x, u) Ij;(x, U)Ij;(x, u) for all XEXo and all uER m such that u;;' O. It follows easily that: (1) (H. Uzawa [18]) If(x,u) is a saddle point of Ij;(x, u), then x is an optimal solution. (2) (Kuhn and Tucker [11]) Assume that XO is open and convex, and that 0 and 9 are dif- ferentiable and convex. If there exists a pair (x, u) such that VO(x)+ u'Vg(x) = 0, HC, u'g(x)=O, u;;'O (where V8(x) and Vg(x) denote, respectively, the gradient vector of 0 at x and the Jacobian matrix of 9 at x), then x is an optimal solution. B. Necessary Conditions for Optimality In Section A sufficient conditions for opti- mality were given. Some necessary conditions are also known. (3) When O(x) is continuously differentiable and all the gi are linear, that is, when the con- dition is expressed as Ax  b and x;;, 0, where A is an m x n matrix and b an m-vector, x is an optimal solution only if x is an optimal solution of the following linear programming problem. (Lpo) Minimize v'x under the condition that Axb, x;;' 0, where v=VO(x). By applying the duality theorem of linear programming, it can be proved that there exists a vector u;;,O such that v rp(x, u) = VO(x) + u' A = O. (4) (F. John [9]) Assume that XO is open, and that 0 and 9 are differentiable. Then, if x is a solution of the problem (NLP), there exist u o ;;' 0 and UE R m such that UoVO(x) +u'Vg(x)=O, g(x)O, u'g(x) = 0, u;;,O. Now we consider the following: The vector- valued function 9 is said to satisfy Guignard's constraint qualification at an inner point x of XO if any vector y satisfying the linear inequal- ities Vgi(x)' y O for iE I = {ilg;(x) = O} is in the convex hull spanned by the vectors tangent to the set C at x. 
1097 (5) (Guignard [7J) Let XO be an open set, let x be an optimal solution of (NLP), let 0 and 9 be differentiable at X, and assume that 9 satis- fies the Guignard constraint qualification. Then there exists u E R m such that V8(x) + u'Vg(x) = 0, g(x)O, and u'g(x)=O, u?:O. Guignard's constraint qualification is satis- fied if neither of the following conditions hold: (L) Vectors gi(X) for iEI are linearly independent. (S) (Slater's constraint qualification) The gi are convex and XO is a convex set; and there exists a vector x such that gi(X) < ° for all i. With convexity of 8 and gi, differentiability is not required: (6) (Kuhn and Tucker [IIJ) Let XO be a convex set, let 0 and 9 be convex on Xu, and assume that 9 satisfies Slater's constraint qualification. If x is an optimal solution, then there exists uER m , u?:O, such that u'g(x) = ° and (x, u) is a saddle point of Ij; (x, v) = 8(x) + u' g(x). C. Sensitivity Analysis Now consider the following class of problems. (Pc) Minimize the function 8(x) under the condition that g(x)  c, where c is a real m- vector. We denote by Xc the set of the solutions of the problem (Pc) and denote 0: = 8(x c ), xcE XC' Suppose that Guignard's condition is satisfied for each c and that the set of La- grange multipliers Ac is nonempty. Then for any real vector a, we have 1 infa'uclim-(8c*+I -8c*)supa'uc. UcEAc. 1-0 t " Therefore, if the Lagrange multiplier is unique- ly determined for some c, then U c represents the vector of the rates of increments of the objective function to the small increments of the components of the constraints vector. Hence the components of the Lagrange- multiplier vector are called the imputed prices or shadow costs of the constramts; these have important economic ;mpiications, espe- cially when the objective function is expressed in terms of money or profits. More generally, we can consider the follow- ing class of problems. (GPc): Minimize the function O(x, c) under the condition that g(x, c)  0, where c is a real parameter. Denote by x(c) and u(c) the solution and the Lagrange multiplier of (G Pc) corresponding to the parameter c, respectively, and let O*(c) = 8(x(c)). Then under a set of regularity condi- tions we have V8*(c)= VcO(x(c), c) + u(c)'Vcg(x(c), c), 292 D Nonlinear Programming where V c denotes the gradient of 8 or 9 with respect to the parameter. D. Duality A duality theorem in mathematical program- ming is the statement of a certain relationship between two problems. This relationship has the following two aspects: (i) one problem is a constrained minimization problem and the other is a constrained maximization problem; (ii) the existence of a solution to one of these problems ensures the existence of a solution to the other, and in this case their respective values are equal. Let Ij;(x, u) be the Lagrangian form of (NLP), and define w(u)=infxExolj;(x,u). Then we can state the following two problems. (P) (primary problem) Miminize 8(x) under the condition that x E XU and g(x)  ° (D) (dual problem) Maximize w(u) under the condition u?: 0. If (x, u) gives a saddle point of Ij;(x, u), then infO(x)= B(x)= supw(u) = w(u) = Ij;(x, u). The dual problem can be formulated alterna- tively as follows. (5) Maximize Ij;(x, u)= 8(x) + u' g(x) sub- ject to (x, U)E Y = {(x, u) I XE Xu, UE Rm, u?: 0, Vxlj;(x, u)=O} (where Vxlj;(x, u) denotes the vector whose components are the partial de- rivatives olj;(x,u)jox i for i= 1, ...,n). There are a number of duality theorems related to problems (P) and (5); two such theorems are as follows: (1) (P. Wolfe [20J) Suppose that XU is open and convex, that 0 and 9 are differentiable and convex, and that 9 satisfies the Kuhn-Tucker constraint qualification. Then, if x is a solution of (P), there exists a UE Rm such that (x, u) is a solution of(5) and 8(x) = Ij; (x, U). (2) (0. L. Mangasarian and 1. Ponstein [13J) Suppose that Xu is open and convex, that 8 and 9 are differentiable and convex, and that Cx, 12) is a solution of (5). If Ij;(x, 12) is strictly convex in some neighborhood of X, then x is a solution of (P) and 8(x)= Ij;(x, 12). The above two problems (P) and (D) are not symmetric. The notion of symmetric duality was introduced by G. B. Dantzig, E. Eisen- berg, and R. W. Cottle: Primary: Minimize F(x, u)=K(x, u)-u'VuK(x, u), subject to the constraints VuK(x,u)O, x?:O, and u?:O; Dual: Maximize G(x, u)= K(x, u)- x'VxK(x, u), subject to the constraints VxK(x, u)?: 0, x ?:O, and u ?:O, 
292 E Nonlinear Programming where K is continuously differentiable in (x,u)ER" x Rm. (3) Dantzig et al. proved [6] the existence of a common optimal solution (x, u) to both the primary and dual problems, provided (i) an optimal solution (x, u) to the primary problem exists, (ii) K is con vex in x for each U and concave in U for each x, and (iii) K is twice differentiable and the matrix of second partials (a 2 Kjau i au j ) is negative definite at (x,u). Rockafellar [15] gave another expression of the duality relation: Define F(x, y)= O(x) if g(x) <:;; y and = 00 otherwise, and denote tp(y) = infxEXo F(x,y). Then O(x)=tp(O). For any nonlinear function tp(y), the conjugate tp*(ry) is defined by tp*(ry) = sup (ry'y - tp(y)). y (4) (Rockafellar [15]) supw(u)= tp**(O) = clco tp(O), where clco tp(y) denotes the closed convex hull or maximum convex minorant of tp(y) defined by clcotp(y)=suP/I'y{I'z<:;; tp(z) for all z}. It follows from the above that inW(x)= supw(u) if and only if tp(O) = clco tp(O), which holds true if tp(y) is convex. Further forms of the duality theorem hold for linear or quadratic programming problems ( 255 Linear Programming, 349 Quadratic Programming). E. Algorithms In a limited class of problems, i.e., when the objective function is quadratic and the con- straints are linear ( 349 Quadratic Program- ming) the optimal solution can be obtained by solving a system of linear equations by the simplex method or other algorithms; but in most nonlinear programming the solution is calculated by some kind of iterative procedure. Note that even when the constraints are given as equalities and all the functions involved are continuously differentiable, so that the optimal solution is explicitly given as a solution of a set of simultaneous equations, we usually require some iteration procedure, such as the Newton- Raphson algorithm, to obtain numerically the solution with preassigned accuracy; and the iteration is not always easy if the functions are sufficiently complex. Several iterative procedures for solving non- linear programming problems have been pro- posed. Since the simplex method is a powerful tool in linear programming, one type of ap- proach is to obtain an approximately optimal solution by approximating the objective and the constraint functions by piecewise linear functions and then applying linear program- ming techniques to get the approximately optimal solution within each region. Another 1098 is to change the constrained problem to a nonconstrained one by introducing a suffi- ciently large number M, called the penalty, and then maximizing OM (x) = O(x)+ MI max(gi(x), 0) j without the constraint. This is called the pen- alty method. The third and most generally applicable technique is the gradient method, of which several variations are known: (i) Arrow-Hurwicz-Uzawa gradient method [4]. Concave or convex programming prob- lems can be solved by finding a saddle point of the Lagrangian function Ij;(x, u). Let tp(x, u) be strictly concave and of class C 2 in n-vector x;;, 0 and con vex and of class C 2 in m- vector u;;, 0 and possess a saddle point (x, u). To approach a saddle point of tp(x, u) it is natural to devise a gradient process of the form dX i atp dU j dt ax; , dt atp aU j (1 ) To keep the variables in the positive orthant, we need to modify (1), and we consider the following system of differential equations: { o dx -i= atp aX i if xi=O and atp <0 ax; otherwise { 0 duo d/ = _ atp aU j (i = 1, ... , n), . otp Ifuj=O and ->0 aU j otherwise (j= 1,... ,m). Under certain regularity hypotheses, there exists a unique solution (x(t), u(t)) of the sys- tem with any initial point (xo, uo), and the x component x(t) of the solution converges to x as t-> 00. Applying the above results to the Lagran- gian function Ij;(x,u) we can solve the concave or convex programming problem. (ii) Rosen's gradient projection method [16]. If a point Xo of the feasible region does not give a solution for the minimization problem (P), then we look for a feasible point with a lower function value by proceeding from Xo in the direction of the gradient of the function - O(x). The method fails if Xo is a boundary point and if the gradient vector points toward the exterior of the feasible region. Rosen's method [16] is to project the gradient onto the boundary of the feasible region and then pro- ceed in the direction of this projection. In this manner, we remain on the boundary of the feasible region. 
1099 (iii) Methods of feasible directions. These were first described by G. Zoutendijk [21]. Consider the problem of minimizing O(x) subject to the constraint XES eRn, where S is a closed, connected set satisfying certain regu- larity conditions and 0 is a continuously dif- ferentiable function of the n-vector x, such that, for some rx, the set {xESIO(x)rx} is bounded and nonempty. A method of feasible directions is any recipe for solving this problem by proceeding along the following lines: (1) Start with some xo E S such that O(Xo) < sup rx. (2) Pass from the kth iteration point Xk to Xk+1 by first determining a direction Sk in Xk such that the ray Xk + ASk lies in S for all sufficiently small A> O. (3) Then determine the step length Ak' thus obtaining the (k+ l)st iterate xk+! =Xk+AkSk. (4) Repeat this procedure until some prescribed stopping condition is satisfied. There are many methods of determining the s\ and in most cases the Ak are then deter- mined by solving a one-dimensional mini- mum problem along the direction so obtained. Zoutendijk has unified the various possible methods offeasible directions and the relevant normalization rules that yield the optimal directions s\ and has investigated this subject in detail from the viewpoint of computational technique [21,22]. F. Generalizations The extension of the Kuhn-Tucker theory to linear topological spaces is due to L. Hurwicz [8]. Let :J: be a linear space, OY, :!Z be linear topological spaces, Py, Pz the non negativity cones of OY, :!Z, respectively, which are closed convex cones containing inner points, D a convex set in:J:, and F, G concave mappings ( 88 Convex Analysis A) from D into OY,:!Z, respectively, such that G(D) contains an inner point of Pz. If F(X) attains its maximal point when X =X o and X o satisfies G(Xo);'O, XoED, then there exist Y o *;' 0, Z6 ;, 0 such that <IJ(X,Z*)= Yo*(F(X)) + Z6(G(X)) has a saddle point at (X o , Z6)' The condition that Pyand Pz have inner points can be weakened to cover the cases of (lp), (L p ), (s), (S) (H urwicz and Uzawa). P. P. Varaiya [19J considered the following nonlinear programming problem in Banach space. (B) Maximize f(x) subject to x E A, g(X)E Ay, where X, Yare real Banach spaces, x EX, g: X --> Y is a Frechet differentiable mapping, f is a real-valued differentiable function, A is a subset of X, and A y is a convex set in Y. The main results are similar to the Kuhn- Tucker necessary conditions. Varaiya also exhibited a saddle value problem related to 292 Ref. Nonlinear Programming (B), when Ay is a closed convex cone. L. W. Neustadt [14J investigated nonlinear pro- gramming problems in linear vector spaces and gave an application to the theory of optimal control. The main results are: (i) Kuhn-Tucker type conditions which are both necessary and sufficient for optimality, (ii) a duality theory for obtaining multipliers in the generalized Kuhn-Tucker conditions, and (iii) an application to optimal control theory. References [IJ 1. Abadie (ed.), Nonlinear programming, North-Holland, 1967. [2J J. Abadie (ed.), Integer and nonlinear programming, North-Holland, 1970. [3J K. J. Arrow, L. Hurwicz, and H. Uzawa (eds.), Studies in linear and nonlinear pro- gramming, Stanford Univ. Press, 1958. [4J K. J. Arrow and L. Hurwicz, Gradient method for concave programming I, in [3, pp. 117-126J; H. Uzawa, Gradient method for concave programming II, in [3, pp. 127-132]. [5J E. M. Beale, On minimizing a convex function subject to linear inequalities, J. Roy. Statist. Soc., (B) 17 (1955),173-184. [6J G. B. Dantzig, E. Eisenberg, and R. W. Cottle, Symmetric dual nonlinear programs, Pacific J. Math., 15 (1965), 809-812. [7J M. Guignard, Generalized Kuhn-Tucker conditions for mathematical programming problems in a Banach space, SIAM 1. Control, 7 (1969),232-241. [8J L. Hurwicz, Programming in linear spaces, in [3, pp. 38-102]. [9J F. John, Extremum problems with inequal- ities with subsidiary conditions, Studies and Essays: Courant Anniversay Volume, Inter- science, 1948, 187-204. [IOJ H. P. Kiinzi and W. KreIle, Nichtlineare Programmierung, Springer, 1962. [IIJ H. W. Kuhn and A. W. Tucker, Non- linear programming, Proc. 2nd Berkeley Symp. Math. Statist. Prob., 1951,481-492. [12J O. L. Mangasarian, Nonlinear program- ming, McGraw-Hili, 1969. [13J O. L. Mangasarian and J. Ponstein, Mini- max and duality in nonlinear programming, J. Math. Anal. Appl., 11 (1965), 504-518. [14J L. W. Neustadt, Sufficiency conditions and a duality theory for mathematical pro- gramming problems in arbitrary linear spaces, in [17, pp. 323-348]. [15J R. T. RockafelIar, Augmented Lagrange multiplier functions and duality in nonconvex programming, SIAM J. Control, 12 (1974), 293-322. [16J J. B. Rosen, The gradient projection method for nonlinear programming, 1. Soc. 
293 A Nonstandard Analysis Indust. Appl. Math., 8 (1960),181-217; 9 (1961),514-553. [17J J. B. Rosen, O. L. Mangasarian, and K. Ritter (eds.), Nonlinear programming, Aca- demic Press, 1970. [18J H. Uzawa, The Kuhn-Tucker theorem in concave programming, in [3, pp. 32-37]. [19J P. P. Varaiya, Nonlinear programming in Banach space, SIAM J. Appl. Math., 15 (1967), 284-293. [20J P. Wolfe, A duality theorem lor nonlinear programming, Quart. Appl. Math., 19 (1961), 239-244. [21 J G. Zoutendijk, Methods of feasible direc- tions, Elsevier, 1960. [22J G. Zoutendijk, Nonlinear programming, computational methods, in [2, pp. 37-86]. [23J G. Zoutendijk, Some algorithms based on the principle offeasible directions, in [17, pp.93-121]. 293 (1.7) Nonstandard Analysis A. General Remarks Nonstandard analysis is a new field of research that has branched off from model theory and that provides a powerful method applicable in almost all fields of mathematical science. About 1960, A. Robinson successfully used a nonstandard model of the real number field R to justify Leibniz-type infinitesimal calculus. After this, using higher-order logic, he devel- oped a stronger theory of nonstandard analy- sis, and applied it to other mathematical fields [1]. In this article, we adopt a first-order logic over a universe. In the final section we present a theory called nonstandard set theory; this is a conservative extension of Zermelo-Fraenkel set theory with the axiom of choice. B. Axioms for Nonstandard Analysis A nonempty set U is called a universe if the following four conditions are satisfied: (a) XEU, YEX YE U; (b) X, YE U  {X,Y}E U; (C)XEUUXEU; (d) XEU  P(X)EU. In what follows, U wi\] be a universe contain- ing the field R of real numbers. We construct a language 2 describing mathematics in U. The alphabet consists of the 1100 following symbols: (1) countably many vari- ables. (2) constants corresponding to all ele- ments in U. (3) predicate symbols = and E. (4) logical symbols I, /\, v, ->, 3, V. (5) auxiliary symbols [ ,]. The last two symbols will be omitted where there is no danger of confusion. Definition of formulas. (1) If t and s are terms (variables or constants), then t = sand tES are formulas (atomic formulas). (2) If I/J and I/J are formulas, then II/J, I/JI\I/J, I/Jvl/J, I/J->I/J are formulas. (3) If I/J is a formula and x is a variable, then 3x[I/JJ and Vx[I/JJ are for- mulas. (4) The formulas are those that can be constructed by the above procedure. A formulas I/J containing at most n free variables is called an n-ary formula. In this case, we often write I/J(x 1, '" , x n ) for I/J. A sen- tence is a O-ary formula. We write p I/J if a sentence I/J is true under the usual inter- pretation. Consider a quadruple (U, *U, *E, *) where *U is a set, *E is a binary relation on *U, and * is a mapping: a *a from U to *U. An element CI: of * U is called standard if CI: = * a for some aE U, and nonstandard if not. Adjoin to !f' constants representing all nonstandard elements of * U. Then we have a language * !f' for * U. A sentence in 2 is a sentence in * 2. Taking *U as the scope of quantifiers, we can interpret every * 2-sentence in *U. We write *p I/J if an * 2-sentence I/J is true under this interpretation. Axiom 1 (transfer): For every sentence I/J in !f', we have pl/J=*pl/J. Definition: Let I/J(x, y) be a binary formula in 2 (resp. in * 2). We say that I/J(x, y) is concur- rent in U (resp. in *U), if, for every finite num- ber of elements ai' . .. , an in U (resp. in * U), there exists an element b in U (resp. in *U) such that pl/J(a;, b) (resp. *pl/J(a;.b)) for l<i<n. Axiom 2 (enlargement): If a binary formula I/J(x, y) in 2 is concurrent in U, there exists an element fJ in *U such that *pl/J(a,fJ) for all a in U. A quadruple (U, *U, *E, *) or simply *U is called an enlargement of U if it satisfies Axioms 1 and 2. These axioms are strong enough to develop the basic theory of nonstandard analysis, but sometimes we require a stronger axiom. Let K be an infinite cardinal. Axiom 3 (K-saturation): Let I/J(x, y) be a binary * ff:'-formula concurrent in *U. Then, for every subset A of * U with cardinality at most K, there exists an element fJ of *U such that *1= (CI:, /3) for all Cl:E A. * U is called a K-saturated model if it satisfies Axioms 1 and 3, and a K-saturated enlarge- 
1101 ment if it satisfies Axioms 1, 2, and 3. If K is not less than the cardinality of V, then a K- saturated model is necessarily an enlargement. In the remainder of this article, * V will be an enlargement of V, if the contrary is not explicitly mentioned. For rx in *V, &. is the set of E*V such that *Erx:&.=gE*VI*Erx}. For simplicity, we write E for *E and identify &. with rx. Under this identification, a subset A of *V is called inter- nal if it belongs to * V and external if not. Axiom 1 and 2 imply the following results. (1) There exist infinite hypernatural num- bers, namely, elements of *N bigger than ail *n, nEN. (2) There are positive infinitesimal hyperreal numbers. (3) For every set A in V, there exists a hyperfinite internal subset of * A containing all *a, aEA. Here, r is called hyperfinite if we have *F ift(r), where ift(x) is a formula that says "x is a finite set." (4) Let A be a set in V. Then * A has non- standard elements if and only if A is an infinite set. (5) If a family :F of sets in V has the finite intersection property, then the family {* A I A E :F} in *V has a non empty intersection. (6) Let rx: *N-->*R be an internal hyper- sequence. If rx(*n) is infinitesimal for every nE N, then there exists an infinite hypernatural number A such that rx(v) is infinitesimal for every hypernatural number v less than A. If we moreover assume Axiom 3, we have the following results. (7) The cardinality of an internal set is either finite or more than K. (8) If a family :F of internal sets in * V has the finite intersection property and cardinality K at most, :F has a nonempty intersection. (9) Introduce the order topology in *R. Then every subset of *R with cardinality at most K is bounded and discrete. (10) Let rx, {3 be internal sets and C a subset of rx with cardinality at most K. Then every mapping from C to {3 can be prolonged to an internal mapping from rx to {3. (10') In particular, every external sequence N ->{3 can be prolonged to an internal se- quence *N --> {3. This foIlows from the assump- tion of countable saturation. This fact plays an essential role in nonstandard probability theory, which is now in the process of rapid development ( Section D (3)). C. Construction of Ultrapower Models Let I be an infinite set. A mapping rx from I to V is a family of elements in V with indices in I. So we write <rx(i)iEI or simply <rx(i) for rx. 293D Nonstandard Analysis Let .'F be a nonprincipal ultrafilter on I and define an equivalence relation - Y' on the set V I of all mappings from I to V: <rx(i) -" <{3(i) <0> {iE I I rx(i) = {3(i)} E:F. Denote by *V the quotient set of V under the relation -Y" The class of <rx(i));EI will be denoted by [rx(i)JiEI or simply [rx(i)]. Define a binary relation *E on * V: [rx(i)J *E [{3(i)J <0> {i E I I rx(i) E {3(i)} E:F. Let * be the diagonal mapping from V to * V and consider the quadruple (V, *V, *E, *). Theorem (Los): The foregoing quadruple satisfies Axiom 1. Moreover, * V is a countably saturated model. In particular, let I be the set of all finite subsets of V. For iEI, put j1(i)={jEIli c j}. Then the family of sets  = {j1(i) liE I} has the finite intersection property, and therefore there exists an ultrafilter ,'F on I including  (use tZorn's lemma). If we construct * V from the ultrafilter ,'F, then * V is a countably saturated enlargement of V [1]. I t is not easy to construct a K-saturated enlargement for an arbitrary cardinal K. We have two ways: to use a K-good ultrafilter (the proof of its existence is difficult) [2, 3J or to use an ultralimit (iteration of ultra powers) [4, 16]. D. Applications (1) Infinitesimal Calculus. A hyperreal number (X (element of *R) is called infinitesimal if Irxl is smaller than every positive real number. If (X - {3 is infinitesimal, rx and {3 are said to be infinitely close to each other; this is written rx;::,; {3. If l/rx is not infinitesimal, rx is called finite. Every finite hyperreal number rx is in- finitely close to a real number a (completeness of R). We call a the standard part of rx and write st(rx). Every finite hyperinteger is an integer. Let f be a real-valued function on a real interval I. Then *{ is a hyperreal-valued func- tion on the hyperreal interval *1. The function f is continuous if and only if fix);::,; *f(1]) for every XE I, 1]E * I with x;::,; 1]. The function f is uniformly continuous if *f();::,; {(1]) for every ,1]E*I with ;::';1]. Let bx be a variable ranging over nonzero infinitesimals. The function bf = *f(a + bX)- {(a) of bx will be called the infinitesimal increment of fat a. f is differentiable at a if and only if the quotient b{/bX is of infinitesimal variation. The common standard part of bjjbx is the derivative j'(a). The higher-order differential b} is also justified as a higher-order infinitesimal dif- ference, with the standard part of b"f/bX n being 
293 E Nonstandard Analysis f(n)(a). We can define the Riemann integral as the standard part of a Riemann hypersum with respect to a hyperfinite partition of infini- tesimal width. This type of reformulation permits us to rewrite the whole of calculus, and many teachers are trying to adapt this theory to elementary calculus [5,6]. (2) Topological Spaces. Let X be a topological space in U and let a be a point of X. The intersection of * A, A varying over the neigh- borhoods of a is called the monad of a and is denoted by Mon(a). There exist hyperneigh- borhoods of a contained in Mon(a) (infini- tesimal neighborhoods). The topology is deter- mined by the system of monads <Mon(a)aEx' We can thus rewrite the theory of topological spaces. For example, X is Hausdorff if and only if Mon(a) n Mon(b)= 0 for every a#b in X. X is compact if and only if every point of * X belongs to the monad of some element in X. This characterization is very useful; using it, Robinson and Bernstein were able to solve the mvariant subspace problem for a special class of operators on a Hilbert space [7]. We can also construct a Haar measure very naturally and simply [2]. (3) Measures and Probability Theory. Let (X, B, m) be a measure space. Then there exist a hyperfinite subset r of * X and a positive hyperreal-valued internal function qJ such that we have f {dm= I *f()qJ() x Er for every integrable function f [8]. The right- hand side is a hyperfinite sum. It foHows that the measure m can be extended to a finitely additive measure defined over all subsets of X. If in particular every measurable finite set is of measure 0, then there exist a hyperfinite subset r of * X including X and a hyper- natural number p such that we have J fdm I *f() x P Er for every integrable function f [8]. If m(X) = 1, then p can be taken as the hypercardinality of r. Here, the right-hand side is nothing but the mean value of *f on a hyperfinite set. This idea leads us to a simple description of probability theory. The above method is rather formal; but P. Loeb [9J has pioneered a new approach in probability theory by constructing an external measure space from a hyperfinitely additive internal measure space (X,d, v) in *U. In the following, *U is supposed to be countably saturated. 1102 Let IJ(,w) be the external countably additive algebra of subsets of X generated by d. Then the mapping d f-> st(v(d)) from d to R can be extended to a measure on IJ(d). The com- pletion of this measure space is denoted (X, L(d), L(v)). In cases where X is hyper- finite, d is the totality of internal subsets of X, and v(X)= 1, then the space (X, L(,w), L(v)) is called a Loeb space and L(v) a Loeb measure. Every Radon probability space can be re- presented as the image of a Loeb space by a measure-preserving mapping. Therefore the probability theory on a Radon probability space can be reduced to that on a Loeb space. For example, for tLebesgue measure on the interval [O,lJ, take an infinite hypernatural number A. and put X={,u/AI,uE*N,O,u A-I}. If we assign 1jA. to every point of X, we have an internal probability measure v on X. Then the mapping st: Xf->st(x) from X to [O,IJ serves as a measure-preserving mapping from the Loeb space (X, L(.w), L(v)) onto the Lebes- gue measure space on [0, I]. The notion of lifting plays a key role in probability theory on Loeb spaces. Let f be a real-valued function on X, and F an internal hyperreal-valued function on X. F is called a lifting of f if we have f(x)=st(F(x)) almost everywhere with respect to L(v). Hence f is measurable if and only if it has a lifting. Shuttling between an internal probability space and its Loeb space by lifting and standard-part mapping, we can develop, sim- ply and partially hyperfinitely, probability theory on the Loeb space. Among others, Anderson [10, 11J and Keisler [12J applied this method quite successfully to Brownian motions, Ita integrals, stochastic differential eq uations, etc. [17]. E. Nonstandard Set Theory In our formulation in B, we must construct *U from a fixed universe U. Nelson [13J, Hrbacek [14J and Cuda [15J independently invented theories that nonstandardize the whole of set theory. In these theories, there is only one real number field R, and R already contains in- finitesimal numbers. Compare this to Robinson-type infinitesimal analysis, where infinitesimal numbers are introduced as an ad hoc tool. The new theories may demand a reflection on mathematical description in the natural sciences. In the following, we outline (intuitively) Hrbacek's theory, as strengthened and im- proved by Kawai [16]. We start from ZFC, the Zermelo-FraenkeJ set theory plus the axiom of choice. The lan- guage of Kawai's theory NST is that of ZFC 
1103 plus two constants S and I. We understand S as the totality of standard sets and I as the totality of internal sets. Let i/J be a formula of ZFC, that is, a for- mula of NST without S and I. We write si/J (resp. I i/J), the formula of NST obtained by restricting the scope of variables in i/J to S (resp. I). The mathematical axioms and axiom schemes of NST are substantially as follows. (1) If i/J is an axiom of ZFC, then si/J is an axiom of NST. In other words, all the axioms of ZFC are valid in the universe of standard sets. (2) In the universe of all sets, all the axioms of ZFC are valid except the regularity axiom. (3) Every standard set is internal. Every element of an internal set is internal. (4) (transfer) For every n-ary formula i/J(x 1 , "', x n ) of ZFC, we have 'Ix, ES... VXnES[si/J(Xj, ...,x n ) <-> Ii/J(X1' ..., X n )]. (5) (saturation) For every set of size at most that of S, the scheme corresponding to Axiom 3 (K-saturation) in Section B holds. Therefore the scheme corresponding to Axiom 2 (enlarge- ment) holds also. (6) (standardization) For every set a in- cluded in a standard set, there exists a stan- dard set b such that we have VXES[xEa<-> xEb]. This completes the description of NST. Theorem (Nelson-Hrbacek-Kawai): NST is a conservative extension of ZFC That is, let i/J be a sentence of ZFC If si/J can be proved in NST, then i/J can be proved in ZFC Corollary: If ZFC is consistent, so is NST. References [IJ A. Robinson, Non-standard analysis, North-Holland, 1966; second edition, 1974. [2J M. Saito, Ultraproducts and non-standard analysis (in Japanese), Tokyo Tosho, 1976. [3J C C Chang and H. J. Keisler, Model theory, North-Holland, 1973. [4J K. D. Stroyan and W. A. 1. Luxemburg, Introduction to the theory of infinitesimals, Academic Press, 1976. [5J H. J. Keisler, Elementary calculus, Prindle, Weber & Schmidt, 1976. [6J H. J. Keisler, Foundations of infinitesimal calculus, Prindle, Weber & Schmidt, 1977. [7J M. Davis, Applied nonstandard analysis, Wiley, 1976. [8J M. Saito, On the non-standard represen- tation of linear mappings from a function space, Comm. Math. Univ. Sancti Pauli, 26 (1977),165-185. 294 A Numbers [9J P. A. Loeb, Conversion from nonstandard to standard measure spaces and applications in probability theory, Trans. Amer. Math. Soc., 211 (1975), 113122. [IOJ R. M. Anderson, A nonstandard repre- sentation for Brownian motion and Ita in- tegration, Israel J. Math., 25 (1976), 15-46. [IIJ R. M. Anderson, Star-finite probability theory, Dissertation, Yale Univ., 1977. [12J H. J. Keisler, An infinitesimal approach to stochastic analysis, Mem. Amer. Math. Soc. 297 (1984). [13J E. Nelson, Internal set theory, Bull. Amer. Math. Soc., 83 (1977), 1165 1198. [14J K. Hrbacek, Axiomatic foundations for nonstandard analysis, Fund. Math., 98 (1978), 119. [15J K. Cuda, A nonstandard set theory, Comm. Math. Univ. Carolinae, 17 (1976), 647663. [16J T. Kawai, Nonstandard analysis by axiomatic method, Southeast Asian Confer- ence on Logic, C-T. Chong and M. J. Wicks (eds.), Elsevier, 1983. [17J N. J. Cutland, Nonstandard measure theory and its applications, Bull. London Math. Soc., 15 (1983), 529589. 294 (11.9) Numbers A. General Remarks From counting, a primitive mental activity, came the natural numbers (N) ( Section B), which serve to denote the number of items or the order in which these items are arranged. We can extend this concept to define, step by step, the tintegers (Z), trational numbers (Q), treal numbers (R), and tcomplex numbers (C) ( 355 Real Numbers; 74 Complex Numbers). The extensions up to the rationals are carried out to attain a domain within which the operations of addition, subtraction, multi- plication, and division, namely, the four arith- metic operations (or rational operations) can be performed indefinitely, with division by o the only exception. To develop the theory of natural numbers there are two well-known methods, one being G. Peano's system of axioms [4], which will be stated below, and the other being R. Dedekind's set-theoretic treatment [5]. The domain of rational num- bers can be extended, taking continuity into consideration, to that of real numbers by several methods of which the best known are those of the Dedekind cut (1872) [8J, which will be described below, and of G. Cantor's 
294B Numbers fundamental sequences (1872) [9]. There is also a way based on infinite series, given by K. Weierstrass in his lectures (1859-1860). Though in the domain of real numbers (1) the four arithmetic operations can be per- formed indefinitely and (2) an order relation for magnitude is defined, the eq uation x 2 + 1 = 0 has no root. Introducing numbers expressible as a + ib (i = j=!), it is possible to solve every equation of the second order. Such numbers, once called imaginary, have been used since the days of G. Cardano [6J in the 16th century. L. Euler also made good use of complex numbers as convenient tools in many calculations and obtained among other things his formula expiO=cos8+isinO. Indeed, the notation i = P was used for the first time by Euler (1777). Furthermore, C. F. Gauss, giving imaginary numbers the name complex numbers, showed that any algebraic equa- tion with numerical coefficients always has roots in the domain of complex num bers. The discovery of the geometric representation of complex numbers by several mathematicians in the late 18th and early 19th centuries and their use in many applications have made complex numbers indispensable in mathematics. Though there are extensions of complex numbers, such as tHamilton's quaternions or tCayley numbers, it is generally accepted that when we speak of a number we usually mean a complex number. B. Natural Numbers Peano, basing his system on a specific natural number, 1, and a function such that to each natural number x corresponds a natural num- ber x+ 1 (hereafter denoted by x' and called the successor of x), formulated the funda- mental properties of the set N of natural num- bers in the following five axioms, called the Peano postulates: (J) I EN; (2) if xEN, then x' E N; (3) if XE N, then x' #- 1; (4) if x' = y' (x, YEN), then x = Y; and (5) if a set M satisfies the two conditions: 1 EM, and XEM implies x' E M, then N c M. Since these postulates determine N uniquely up to isomorphism, they can be regarded as a definition of N. Elements of N are ca1led natural numbers. Owing to Peano's fifth postulate, regarding a certain property P(n) for natural numbers n we can deduce that P(n) is true for every n if we prove both of the fo1lowing conditions: (i) P(I) is true; and (ii) for any natural number k, if P(k) is true, then P(k + J) is true. Such rea- soning is called mathematical induction (or complete induction). Accordingly, Peano's fifth postulate is called the axiom of mathematical induction. Double mathematical induction, a 1104 generalization of mathematical induction, is as follows: to show that the property P(m, n) is true for every pair of natural numbers m and n, we have only to show that (iii) P(m, 1) and P( I, n) are true for every m and every n; and (iv) for every pair of natural numbers k and I, if P(k + I, I) and P(k, 1 + I) are true, then P(k + 1, 1 + I) is true. This axiom can be form ulated in several other ways and can be generalized further to n-tuple mathematical inductions (n = 2,3,4,...), generically called multiple mathematical inductions. Assume for a set M that a mapping f from the tCartesian product N x Minto M is given. Then a mapping cp from N into M such that (v) cp(l) = a; and (vi) cp(x') = f(x, cp(x)) (XE N) exists and is unique. Defining cp by (v) and (vi) is called the definition of cp by mathematical induction. In particular, given a natural number a, the mapping cp:N-->N defined by (vii) cp(I)=a'; and (viii) cp(X') = cp(x)' is ca1led addition by a. We shall write cp(b)=a+b, whence x'=x+ 1. Addition thus defined obeys the following laws: a+b=b+a (commutative law); (a+b) +c=a+(b+c) (associative law). Peano's postulates are thus equivalent to the following: (J') lEN; (2') for each pair a, bEN, a+bEN is defined so that addition obeys the commuta- tive and associative laws; (3') for each pair of natural numbers a and b, one and only one of the following three relations holds: a = b + C (cEN); a=b; a+c=b (cEN); (4') the same as (5) (mathematical induction). From (1')-(4') follows the cancellation law: a+c=b+c= a=b. Define a b if and only if a=b or a= b+c (a,b,cEN). Then, from (3'), N becomes a ttotally ordered set and ab=a+cb+c. For each aE N, the mapping cp: N-->N defined by cp(I)=a and cp(x') = cp(x) +a is called multiplication by a, and we write cp(b) = ab (or a' b). Multiplication obeys the fo1low- ing laws: ab= ba (commutative law); (ab)c = a(bc) (associative law); a(b+c)=ab+ac, (a+b)c=ac+bc (distributive laws); and ac=bc=a=b (cancellation law). The state- ment a'l = l'a=a also holds. Natural numbers, which have been intro- duced thus far as tordinal numbers, also have the properties of tcardinal nu m ber s. Denoting {1, 2, '.:..:2n} =M .. we ha Mn=Mm =m=n, Mm + Mn = Mm +n' M m x Mn = M m n (49 Cardinal Numbers). C. Integers Introducing new numbers which are not in N = {l, 2,... }, represented by the notations 0, - I, - 2, "', - n, '" , we write Z = { '" , 
1105 -11, ..., -2, -1,0,1,2, ...,I1,...}. An element of Z is called an integer (or rational integer). Algebraically we can construct Z from N as follows: Let the set of all tordered pairs (k, I) of natural numbers k, I be M = N x N, and define in M an tequivalence relation (k, I)  (m,l1) by k+l1=m+1. Let the equivalence class of(k,l) be K(k,I), and construct the tquotient space, M /  = M*. Then the mapping cp: Z-->M* defined by cp(l1) = K(k+l1, k), cp(O) = K(k, k), cp( -11)= K(k, k+ 11) is bijective. Setting K(k, 1)+ K(m, 11)= K(k+m, 1+11), addition can be defined in M* (and accordingly in Z) which is an extension of that in N. Since K(k, 1)- K(m, 11)= K(k+l1, I+m), subtrac- tion can be defined in Z. This makes Z an tAbelian group with respect to addition. An order relation in Z is defined by K (k,l)  K(m,I1)<o>k+l1m+l, which makes Z a totally ordered set. This order relation is an extension of that in N. In particular, N = {aEZI a>O}. Furthermore, settmg K(k, I) x K(m,I1)= K(km+ 111, kl1+ 1m), multiplication in Z can be defined. It is an extension of that in N and obeys commutative, associative, and distributive laws. Also, for each a, bE Z, we have ab=O<o>(a=O or b=O). Thus, Z becomes an tintegral domain. D. Rational Numbers Let P be the set of all ordered pairs (a, b) of integers a, b with b # 0, and define in P an equivalence relation (a,b)(c,d) by ad=bc. Each equivalence class determined by this relation is called a rational number. Denoting by L(a, b) the equivalence class to which (a,b) belongs, we can define the sum x+ y, the difference x- y, the product xy, and the quo- tient x/y of rational numbers x = L(a, b), y = L(e, d), as in the cases of addition and multi- plication of integers, in the following way: x+ y=L(ad+bc, bd), x- y= L(ad-bc,bd), xy=L(ae, bd), x/y= L(ad, be), where the quo- tient is defined only when c #0. Thus the set Q of all rational numbers becomes a tfield. For the same reason that we have identified integers of a special type with natural num- bers, we now identify a rational number ex- pressible as L(a, 1) with an integer a. Hence- forth, any rational number L(a, b)(b #0) can be expressed in the form of a quotient a/b (b #0) of integers a and b. From L(a, b)= L(ca, cb) (e #0), it is always possible to assume b > 0 in the representation L(a, b) of a rational number x. For any two rational numbers x=L(a,b), y=L(e,d) with b > 0, d > 0, define an ordering in Q by x  Y <0> ad  be, which is an extension of the order- ing of integers. Thus (i) Q becomes totally 294 E Numbers ordered, and we have (ii) x  y =>x + z y +z and (iii) x  y and z  0 = xz  yz. The ration- al x is called positive if x> 0 and negative if x<O. E. Real Numbers Two typical methods of constructing real numbers from rational numbers are those of Dedekind and of Cantor. Dedekind's Theory of Real Numbers. We call a pair (A[, A 2 ) of subsets A [, A 2 of the set Q of all rational numbers a cut of Q if they satisfy the following conditions: (i) Al # 0, A 2 # 0; (ii) Q=A[ U A 2 ; (iii) a[ EA" a 2 EA 2 => a, < a 2 . Then the following three cases are distin- guished: there is (i) a maximum in A, with no minimum in A 2 ; (ii) no maximum in A[ with a minimum in A 2 ; or (iii) no maximum in A[ with no minimum in A 2 . A cut with either condition (i) or (iii) is called a real number (in the sense of Dedekind); condition (ii) can be converted to (i). The set of all real numbers is denoted hereafter by R and each real number by rx or (J or .... A real number with property (i) is called a rational real number, and a real number with property (iii) an irrational real number. Any rational real number is uniquely determined by the maximum a* of A[, and the mapping: a-->a*=(A I ,A 2 ) from the set Q of rational numbers onto the set Q* of rational real numbers is bijective. I. For real numbers rx=(A[.A 2 ) and {J= (B I , B 2 ) we define rx  (J if and only if A[ c B[. By this ordering , R becomes a totally ordered set. II. For real numbers rx=(A[. A 2 ) and {J= (B[.B 2 ), put C 2 ={a+blaEA 2 ,bEB 2 } and C I =R-C 2 ; then (C" C 2 )=Y is a real num- ber. Define the sum rx + {J by setting rx + (J =:J. Addition thus defined obeys commutative and associative laws, and R becomes an t Abelian group with 0* as its zero element. Furthermore, for real numbers rx=(A[.A 2 ) and (J=(B[. B 2 ) with 0*  rx, 0*  {J, put D 2 = {ablaEA 2 ,bEB 2 }, D, =R-D 2 ; then (D[.D 2 )= c5 becomes a real number. Define the product rx{J by setting rx{J = c5. According as 0* > rx, 0*  {J; 0*  rx, 0* > {J; or 0* > rx, 0* > {J, define rx{J = - (( - rx){J); rx{J= - (rx( - (J)); and rx{J=( -rx)( - (J), respectively. Multiplication thus defined obeys the commutative, associative, and distributive laws, and R becomes a field with 1 * as its unity element. III. For ordering and arithmetic operations, we have (1) rx{J=>rx+y{J+y; and (2) rx {J, r O* => rxy  {Jy. By letting each rational number a corre- 
294 F Numbers spond to a rationally real number a*, we can set up a bijection between the set Q of all rational numbers and Q*. Furthermore, in this correspondence, the sum, product, 0, and 1 of Q are mapped to the sum, product, zero ele- ment, and unity element of Q*, respectively; in addition, the ordering is preserved. Thus Q and Q* are isomorphic with respect to both arithmetic operations and ordering. We shall hereafter identify the element a* with a. Ac- cordingly we call rational real numbers simply rational numbers and similarly irrational real numbers simply irrational numbers. IV. Similarly as for Q, we can define a cut (A!, A 2 ) of the set of all real numbers R (pre- cisely, by the conditions AI #- 0, A 2 #- 0; R=A1 UA 2 ; d 1 EA1' d 2 EA 2 =d! <d 2 ). For each cut (AI' A 2 ) of R, either there is a max- imum in AI and no minimum in A 2 or no maximum in AI and a minimum in A 2 (Dede- kind). This property is called the connectedness (or continuity) of real numbers. By using definitions (1)-(IV), any real num- ber rx can be represented (i) as the tleast upper bound of some set A of rational numbers: rx = sup A; and also (ii) as the limit of a sequence {an} of rational numbers: rx=liman- Cantor's Theory of Real Numbers. A sequence {an} of rational numbers is called a funda- mental sequence (or Cauchy sequence) if it satisfies the following condition: for each posi- tive rational number E, there exists a natural number no such that - E < an - am < E for every am and an with m>n o and n>no. For two Cauchy sequences {an} and {b n }, we can write {an}{bn} if {aJ,b!,a 2 ,b 2 , ...,amb n ,...} is again a Cauchy sequence. The relation  is an equivalence relation. Let R' be the set of all equivalence classes obtained by classifying, with respect to , the set of all Cauchy se- quences, and call an element of R' (an equiva- lence class) a real number (in the sense of Can- tor). We shall write [{ an}] hereafter for the equivalence class of {an}' In particular, {an} with an =a (n= 1,2,...) being a Cauchy se- quence, we denote a real number [{ an}] by a**. An element of R' of this type is called a rational real number (in the sense of Cantor), while one not of this type is called an irra- tional real number (in the sense of Cantor). For each pair of real numbers rx = [{an} J and fJ = [{ b n } J, their sum and product are uniquely defined by rx + fJ = [{ an + b n } J and rxfJ = [{ an b n }], where both {an + b n } and {an b n } are shown to be Cauchy sequences. Further, for rx and {J, define rx < fJ if an < b n hold for all n larger than some number no. As regards arithmetic operations and ordering, R' has the properties (I)-(IV) of Dedekind's theory. Fur- ther, any Cauchy sequence of which the terms 1106 all are real numbers has a limit in R' (com- pleteness of real numbers). The sets of real numbers obtained by the above two methods are isomorphic with re- spect to arithmetic operations and ordering ( 355 Real Numbers). F. Complex Numbers To construct complex numbers from real ones, several methods have been devised, among which the one stated below is due to W. R. Hamilton. An ordered pair (a,b) of real numbers a and b will be called a complex number. Arithmetic operations among complex numbers are de- fined as follows: (a,b)+(e,d)=(a+e,b+d), (a,b)-(e,d)=(a-c, b-d), (a,b)'(e,d)=(ae- bd, he + ad), and (a, b) _ ( ae+bd be-ad ) 2 2' 2 2' (e,d) #-(0, 0). (c,d)- e +d e +d According to these definitions, the set C of all complex numbers becomes a tfield with (0,0) and (1,0) as its tzero element and tunity ele- ment, respectively. R*={(a,O)laER} is a t su bfield of C, and the mapping cp : R -> R * defined by cp(a)=(a,O) proves to be a field tisomorphism. Thus, identifying the element (a, 0) of R* with the element a of R, C can be regarded as an toverfield of R (R c C). Hence, the zero element of C is the real number ° and the unity element is the real number 1. The complex number (0, I) is called the imaginary unit and denoted by i. Thus we can write a+bi as usual for a complex number (a,b) ( 74 Complex Numbers). References [IJ O. Perron, Irrationalzahlen, de Gruyter, second edition, 1929 (Chelsea, 1948). [2J E. G. H. Landau, Grundlagen der Analy- sis, Akademische Verlag., 1934; English trans- lation, Foundations of analysis, Chelsea, 1960. [3J N. Bourbaki, Elements de mathematique, I. Theorie des ensembles, ch. 3, Actualites Sci. Ind., 1243b, second edition, 1967; III. Topo- logie genera Ie, ch. 4, 1143c, third edition, 1960; ch. 8, 1235b, third edition, 1963; English trans- lation, Theory of sets, Addison- Wesley, 1968; General topology, pt. I, Addison-Wesley, 1966. [4J G. Peano, Arithmetices principia nova methodo exposit a, Bocca, 1889 (Opere sceIte II, Cremona, 1958). [5J R. Dedekind, Was sind und was sollen die Zahlen? Vieweg, 1887 (Gesammelte mathe- matische Werke 3, Vieweg, 1932); English translation, Essays on the theory of numbers, Open Court, 1901. 
1107 [6J G. Cardano, Artis magnae sive de regulis algebraicis, N uremberg, 1545. [7J W. R. Hamilton, Lectures on quaternions, Hodges and Smith, 1853. [8J R. Dedekind, Stetigkeit und irrationale Zahlen, Vieweg, 1872 (Gesammelte math- ematische Werke 3, Vieweg, 1932); English translation, Essays on the theory of numbers, Open Court, 1901. [9J G. Cantor, Dber die Ausdehnung eines Satzes aus der Theorie der trigonometrischen Reihen, Math. Ann., 5 (1872),123-132 (Gesam- melte Abhandlungen, Springer, 1932). 295 (V.4) Number-Theoretic Functions A. Recurrent Sequences A (complex-valued) function that has the set of nonnegative integers (or the set N of natural numbers) as its tdomain is called a number-theoretic (or arithmetic) function. Thus it can be regarded as a tsequence of numbers. We first consider recurrent se- quences. Let f(xo, XI, ..., xr-rJ be a complex- valued function of r variables. Put U o = a o , Uj = a 1 ,..., U,_j = a,-I, and successively de- fine U i +,= f(u i , U i + 1 , ..., Ui+'-I) (i = 0,1,2, ...). The sequence {un} thus defined is called a re- current sequence of order r determined by the initial values a o , aj, ..., a,-l and the function f In particular, when the defining function f is given by "L,:6 biXi, {un} is called a linear re- current sequence. The Fibonacci sequence is a special linear recurrent sequence with initial values a o , a 1 and the defining function Xo + XI' Let 0:=(1 +J5)/2, /3=(1-J5)/2 be two roots of 1 + x= xZ, and let C 1 and C z be deter- mined by c 1 +cz=a o and c 1 0:+c z /3=a 1 . Then the Fibonacci sequence with the initial values a o , a 1 is given by putting Un = C 1 o:n + c z /3 n (n= 0,1,2, ...). If we put a o = 1, a j = 1, then we obtain Binet's formula: U n =  { C +2 J5 )"+1 - C -2 J5 )"+1 B. Multiplicative Functions A number-theoretic function f with domain N is said to be multiplicative if f(1)= 1,f(mn) = f(m)f(n) for (m, n)= 1, and to be completely multiplicative if f(1)= 1, f(mn)=f(m)f(n) for any m and n (EN). Similarly, f is said to be additive if f(l)= O,f(mn) = f(m) + f(n) for (m, n) = 1, and to be completely additive if f( 1) = 0 295 C Number-Theoretic Functions and f(mn) = f(m) + f(n) for any m and n (EN). If f is multiplicative and "L,rf(n) is absolutely convergent, then "L,:;of(pV) is absolutely con- vergent for any prime p. Moreover, we have J1 f(n) = IJ (to f(p V)), where the infinite product on the right-hand side is also absolutely convergent. Further- more, if f(n) is completely multiplicative, then we have 00 I f(n) = f1 (1- f(pW j . n=l p In particular, for f(n)=n- s , which is com- pletely multiplicative (with s = u + it a complex variable), we get the tEuler product formula for the tRiemann zeta function: 00 ((s)= I n- S = f1(1_p-S)-1 n=l p for u> 1. C. Convolutions If f and 9 are number-theoretic functions, the convolution f * 9 is defined by (f * g)(n) = I f(d)g(n/d), din with the summation carried over all divisors d of n. For any number-theoretic functions f, g, h, we have f *g=g* f and (f *g)*h= f *(g*h). If f and 9 are multiplicative number-theoretic functions, f * 9 is again a multiplicative number-theoretic function. The Mobius function p(n) is defined as fol- lows: p(1) = 1, p(n) = 0 if n is divisible by the sq uare of a prime, and p( n) = ( - 1)' if n is the product of r distinct primes. It can easily be proved that p is multiplicative and that " { I (n=I), L, p(d) = din 0 (n>I). Let e and p be functions defined by e(l)= 1, e(n) = 0 (n> 1), and p(n) = 1 for every n. Then e is the identity element for the convolution *, and p * p = p * p = e. It follows that f * p = 9 is equivalent to f = 9 * p, that is, g(n)= If(d) din implies that f(n)= I p(d)g(n/d), din and vice versa. We call the latter the Mobius inversion formula. Similarly, for complex- valued functions F, G defined on [1, +00), G(X)= I F(x/n) nx 
295D Number-Theoretic Functions is equivalent to F(x)= I j1(n)G(xln). nx Let (p(n) be the number of integers m not greater than n and such that (m, n) = 1. The function cp(n), called the Euler function, is multiplicative, and we have ( 1 ) j1(d) cp(n)=nTI 1-- =nI-, pin P din d and cp(p") = p" - p"-' for every prime p. Let v be the function defined by v(n)=n for every n. Then \ve have !-1 * v = cp and v == P * Q, and hence L.dl" cp(d) = n. The generalized divisor function ddn) is the number of ways of expressing n as a product of k factors. Thus we have dk(n) = I I=Id H (m), "1"2 ..nkn mln and d k = {J* ... * p (k factors). Therefore, d k is multiplicative. For simplicity, we write d(n) instead of d 2 (n). Then d(n) is the number of divisors of n, and we call it the divisor function. For example, d(!2)=6. We denote by IJ,(n) the sum of ()(th powers of divisors of n. If n= IIplnpl, then we have p'(l+I)-1 IJ,(n)=Id'=TI , ' din pin p-l and IJ, is also multiplicative. We write IJ I (n) = IJ(n). A number n satisfying IJ(n) = 2n is said to be a tperfect number. Let n = IIp1nplp. The functions win) = L.plnl (the number of distinct prime factors of n) and Q(n) = L.PI" lp (the num- ber of prime factors of n) are often used, the former being additive and the latter com- pletely additive. D. Residue Characters and Gaussian Sums Let k be a positive integer and X be a com- pletely multiplicative function such that x(n) = 0 for (n, k) > 1 and x(n,) = x(n2) for nl = n2 (mod k). The function X is called a Dirichlet character (or residue character) with modulus k (or modulo k). There exist cp(k) distinct charac- ters modulo k for a given k. The character satisfying x(n) = 1 for every n coprime to k is called the principal character modulo k, and we denote it by Xo. It is easily proved that k-' { CP(k) I x(n)= "o 0 { CP(k) Ix(n)= x 0 (X=Xo), (x i= Xo), (n= l(modk)), (n", l(modk)). Suppose that k= k l k 2 ... k, ((k j , k)= 1. i i=j). Then there exist r characters Xi modulo k i permitting the unique decomposition x= X, X2'" X,. 1108 Let X be a character modulo k and p = exp(2nilk). The Gaussian sum modulo k is defined by G(a,x)= I x(n)pan, n(modJ..) where n runs over a complete system modulo m ( 297 Number Theory, Elementary, G). Hence if a = b (mod k), then G(a, X) = G(h, X). Suppose that k=k l k 2 ...k, ((kj,k)= 1, ii=j) and aj=a (modk;). Then we have G(a, X) = t:G(a l , x,)G(a2' X2)'" G(a" X,), f. = X, (kl k,) X2 (klk 2 )... x,(klk,), where X = X I X2 ..' X, is the decomposition of X mentioned above. Let ko be a divisor of k. If x(n)= 1 whenever (n,k)= 1 and n= 1 (modk o ), then we say that X is defined mod ko. The least positive integer r = fix) modulo to which X is defined is called the conductor of X. If the conductor of X is k itself, then X is said to be a primitive character modulo k. When X has the decomposition x= x, '" X" the conductor of X is also decom- posed as ((X) = f(x,) f (X2) .. ..f(X,). If D is a square-free integer, then the tdiscriminant d of the quadratic number field Q(JD) (where Q is the rational number field) is either D (D= 1 (mod 4)) or 4D (D =2, 3 (mod4)). The integers d so represented are caned the fundamental discriminants. The tKronecker symbol (din) ( 347 Quadratic Fields) is defined only for such fundamental discriminants. Z. Suetuna and A. Z. Walfisz (1936) proved that if x(n) is a real primitive character modulo k, then we necessarily have one of the fonowing cases: (i) k = PI P2'" p,; (ii) k=4PIP2'" p,; or (iii) k= 8p, P2 ... p, (with the pj distinct odd primes). In case (i), { (kin) x(n)= (-kin) (k= 1 (mod4)), (k= -1 (mod4)); in case (ii), { ( -kin) (kI4= 1 (mod4 ) ), x(n)= (kin) (kI4= -1 (mod4)); and in case (iii), x(n) is either (kin) or (-kin). If X is a primitive character modulo k, then G(a,x)=x(a)G(I,x), G(1,X)G(1,X)=k. In par- ticular, if X is a real primitive character, then G(1,X)= { Jk (X(-I)=I), iJk (X(-I)=-I). Sometimes S(a, k) = L. n (mod k)pan 2 is caned the Gaussian sum, where p=exp(2nilk). If(a,k j ) = 1 (i = 1,2) and (k l , k 2 ) = 1, then S(a, k l k 2 ) =S(ak 2 ,k,)S(ak"k 2 ). If(a,2)= 1, then { o (r=I), S(a, 2') = (1 + iO)2'/2 (r even), 2"ja/4.2(,+I)/2 (r odd> 1). 
1109 Let p be an odd prime and (a, p) = 1. Then we have { (a IP )S(1,P) S(a,p')= p,j2 p(r-I)/2S(a,p) (r=I), (r even), (r odd> 1), where (alp) is the tLegendre symbol ( 297 Number Theory, Elementary, H). The well- known Gauss formula is stated as follows: { jP (p= 1 (mod4)), S(I,p)= ijP (p=3 (mod 4)). V arious proofs of this formula have been given by many authors [5]. Let G(a, Xo) = I' exp(2niahlk), h(mod k) called the Ramanujan sum, be denoted by ck(a). The sum ;'(modk) means that h runs through a reduced residue system modulo m ( 297 Number Theory, Elementary, G). It follows that ck(l) = J1(k) and ck(a)= I J1 ( - d k ) d. dll'.u) E. Analytic Methods Let f be a number-theoretic function. One of the problems in analytic number theory is to estimate n<;.J(n) or to expand it into series. We assume that f(x) and g(x) are real-valued functions defined for x  1 and g(x) is of class C 1 . Then we have n/(n)g(n)=F(x)g(X)- f F(t)g'(t)dt, where F(x) = n<;. J(n). If we take f(x) = 1, g(x) = logx or llx in this formula, then we obtain Ilogn=xlogx-x+O(logx) n::'i;x or I n- 1 =logx+C+O(x- 1 ) n'( (where C is the tEuler constant), respectively. We now construct from the function (the Dirichlet series C£ F(8)= I f(n)n-' "=1 or the power series  P(x)= I {(n)x". n=1 There are called the generating functions of f The consideration of these functions makes possible the application offunction-theoretic 295 E N umber-Theoretic Functions methods to the problem in this section. Some- times more complicated functions are used. We mainly consider the generating function represented by the Dirichlet series. The gen- erating functions of p, J1, d, X ( Sections C, D) are ((8) = 1 n -s, I J1(n)n -', 1 d(n)n- S , L(s, X)= l x(n)n -s, respectively. The tab- scissa of absolute convergence of each of these Dirichlet series is 1. The function L(s, X) is called the tDirichlet L-function ( 123 Distri- bution of Prime Numbers; 450 Zeta Func- tions). When k = 1 and X = Xo, L(s, X) is pre- cisely (s). Let F(s), G(s), and H(s) be the generating functions of f, g, and h = f * g. Let f and 9 be multiplicative, so that h is also multiplica- tive. Moreover, if F(s) and G(s) are absolutely convergent for IJ  IJo, then H(s) is also ab- solutely convergent for IJIJo, and F(s)G(s)= H (8) for IJ  IJ o . It follows from this that :=1 J1(n)n -s = C 1 (s), l J1(n)x(n)n- S =L(s,X)-\ I dk(n)n-s=((s)\ and :=1 d 2 (n)n -s = (4(8)/((28). The last result was given by S. Ramanujan. More generally, :=1 d'(n)n -s = (2 r (8)qJ(S), where qJ(s) is an analytic function for IJ> 1;2. By utilizing ana- lytic methods, we can deduce from this that n<;.xdr(n)x(cdlogx)2r-l +... +c2r) (8. M. Wilson, 1923) ( 123 Distribution of Prime Numbers; 242 Lattice-Point Problems). There are many formulas known as the Euler summation formula. Among them the following one is convenient to use (E. Landau, L. 1. Mordell, H. Davenport): Let fo(x) = x - [x] -1/2 (when x is not an integer), =0 (when x is an integer). We successively con- struct continuous functions fl (x), f2(X), .., of period 1 such that j;(x) =.f..-l (x) (r 1) if x is not an integer, and H.f..(x)dx=O. If F(x) is of class C h on [a, b], then using these auxiliary functions, we have aJ;h F(m)= f F(x)dx h-l + I ( -1 ),(.f..(a)F(')(a) r=O - .f..(b)pr)(b)) +( _1)h-! f h-l (X)F(h)(x)dx, where ' means that the term corresponding to m=a or m=b in the sum is to be replaced by F(a)/2 or F(b)/2 whenever a or h is an in- teger. Since the tFourier series of fo(x) is _ f sin(2nnx) n=1 nn (which is convergent), .f..(x) can be expressed by _ I' exp(2ninx) n=-lf (2nin)'+1 ' 
295 Ref. Number-Theoretic Functions where L' means that the term with n = 0 is omitted and this sum actually means  ( eX P (2ninX) + exp( - 2ninX) ) . /-;;'1 (2nin)'+1 (-2nin)'+1 Hence if h= 1, we have ab F(n)= f F(x)dx 1 00 1 f b -- I - F'(x)sin(2nnx)dx. n n=1 n a For instance, if we put h= 1, a= 1, b=N, F(x)=x-' and let N tend to 00, then we have the formula 00 (s)= I n-' n=1 =+-s f oo fo(X) dx for u> 1. s - 1 2 1 xs+l Utilizing the following integration by parts, the integral on the right-hand side can be extended so as to become holomorphic in s in the whole complex plane: f '" fo,(:: dx= - f1(1)+(S+ l) f oo f1,; dx 1 X 1 X Probabi1istic considerations are also used for the study of various number-theoretic functions. If f(n) is w(n) or Q(n), then we have LnJ(n)= xloglogx +cx+o(x). Therefore the average order of w(n) or Q(n) is estimated as 10glogn. Let A(x; ex, {3) be the n umber of n satisfying nx and 10glogn +ex J loglogn < f(n)<loglogn+{3 J loglogn. Then I . A(x; ex, {3) 1 f p -u2/2 d 1m - - e u. xoo x J2.; . For f(n)=w(n), the result was proved by P. Erdos and M. Kac (1940) by using the tcentral limit theorem and V. Brun's tsieve method. Further general formulas were obtained by M. Tanaka (1955). Further development and the present state of probabilistic number theory can be seen in [7,8]. Finally, we mention the well-known estima- tion formulas. Let 8 be an arbitrary positive number and n be a positive integer. Then d(n) = O(n'), where 0 (tLandau's symbol) depends on 8. We know that lim sUPnoo(Iogd(n))' (loglogn)jlogn=log2 (S. Wigert, 1907), and liminfnoo qJ(n)' (Ioglogn)jn = e- c . The result w(n)=O(lognjloglogn) is often used. Let X be a primitive character modulo k, and let Sm = L;:'l x(n). We can prove that ISml <Jk logk for aIJ m (I. Schur, 1918) and Sm=Q(jk log log k) (R. Paley, 1932). This formula, with the symbol 1110 Q, means that there exist infinitely many k satisfying ISml >cJk loglogk by taking m and X suitably, with c any positive constant. Let X be a character modulo k, and Xo be a primitive character associated with X. Then we have L(s, X) = L(s, XO)I1 P1k (l- XO(p)p -'). It follows that n I x(m)= I J1(d)x°(d) I xO(m) m=1 d.dlk,dKj mn!d if the conductor of XO is fA. G. Postnikov investigated (1956) the sum of characters. The least integer that is not a quadratic residue modulo p does not exceed pl/2 J e (Iogp)2, where p is a sufficiently large prime (I. M. Vinogra- dov, 1926). The least integer that is a tprimi- tive root modulo p does not exceed 2 m + 1 jP, where m is the number of distinct prime divi- sors of p-l, with p a prime (L. K. Hua, 1942). D. A. Burgess (1962) deals with the latter re- sults. We conclude with Artin's conjecture: If w is a given square-free integer, then there exist infinitely many primes p such that w is a primi- tive root modulo p. In 1967, C. Hooley proved this conjecture subject to the assumption that the general tRiemann hypothesis holds for tDedekind zeta functions over certain tKum- mer extensions of Q. He also obtained the asymptotic formula for the number of such primes not exceeding x. Recently the tsieve method has been widely applied to the various investigations in the theory of number theore- tic functions ( 123 Distribution of Prime Numbers E). References [1] R. G. Ayoub, An introduction to the ana- lytic theory of numbers, Amer. Math. Soc. Math. Surveys, 1963. [2] A. O. Gel'fond and Yu. V. Linnik, Elemen- tary methods in analytic number theory, Rand McNally, 1965. (Original in Russian, 1962.) [3] G. H. Hardy and E. M. Wright, An intro- duction to the theory of numbers, Clarendon Press, fourth edition, 1965. [4] H. Hasse, V orlesungen tiber Zahlentheorie, Springer, 1950. [5] E. G. H. Landau, Vorlesungen tiber Zahlentheorie I, Hirzel, 1927 (Chelsea, 1946). [6] W. J. Leveque, Topics in number theory, Addison-Wesley, 1956. [7] J. Kubilius, Probabilistic methods in the theory of numbers, Amer. Math. Soc. Trans!. Math. Monographs, 1964. (Original in Russian, 1962.) [8] P. D. T. A. Elliott, Probabilistic number theory I, II, Springer, 1979. 
1111 296 (V.1) Number Theory A. History Simple and curious relations among integers were discovered and admired from antiquity. For example, we have the relation 3 2 +4 2 =5 2 , which has geometric meaning concerning right triangles. The Pythagoreans ( 187 Greek Mathematics) sought similar relations. They were also interested in tperfect numbers (num- bers eq ual to the sum of their divisors, such as 28 = 1 + 2 + 4 + 7 + 14). Modern arithmetic inherits from the Greeks the proof of the existence of an infinite number of primes, the tEuclidean algorithm to obtain the greatest common divisor of two integers (both given in Euclid's Elements), and Eratosthenes' tsieve for finding primes. In the 3rd century A.D., Dio- phantus of Alexandria discovered a method of solving indeterminate eq uations of degrees 1 and 2; this marked the origin of Diophantine analysis. The ancient Chinese also knew how to solve equations of the first degree in some special cases. Arithmetic also developed in India from an early period; its relation to Greek mathematics is not yet entirely clear. In the 12th century, the Indian mathematician Bhaskara knew how to solve tPell equations using a method much like Lagrange's. Interest in integers was reborn in Europe in the 17th century. During this period, Bachet de Meziriac (1581-1638) rediscovered the solution of the tDiophantine equation of the first degree and published it in his famous book on mathematical recreations [1]. Primes of the form 2 P - 1, closely related to perfect numbers and called tMersenne numbers, at- tracted considerable interest. tFermat, some- times called the father of number theory, announced numerous results without giving proofs; the most famous among them is the so- called Fermat's last theorem ( 145 Fermat's Problem). Another famous conjecture of his is that every integer is expressible as the sum of at most n n-gonal numbers, i.e., numbers of the form k(k-l)n/2-k(k-2), kEN. This was proved by Gauss (for the case n= 3), Jacobi (n=4), and Cauchy (for the general case). In the 18th century, remarkable progress was made by Euler and Lagrange. The second part of Euler's Algebra [2J contains rich re- sults of miscellaneous sorts in the field. La- grange developed the theory of tcontinued fractions and applied it to arithmetic. Toward the end of the 18th century, Legendre com- piled his comprehensive book [3J, from whose title originates the term number theory. 296 B Number Theory Gauss's Disquisitiones [4J appeared at about the same time as Legendre's book. The theo- retical arithmetic of today originates from this work of Gauss. The book includes the theories oftquadratic residues, tquadratic forms, and cyclotomy (i.e., arithmetic theory of the roots of unity in the field of complex numbers), all of which appeared as well-developed theories of a remarkably high standard. The work was received with more respect than comprehen- sion. Dirichlet made a lifelong effort to pop- ularize the Disquisitiones; he also applied analytical methods to compute the tclass number of quadratic forms, thus giving num- ber theory a new direction, the analytic theory of numbers. Gauss treated only binary qua- dratic forms; Eisenstein, Minkowski, and Siegel generalized the theory to the case of n variables. The algebraic theory of numbers has its origin in Gauss's paper on biquadratic residues. ( 14 Algebraic Number Fields; 59 Class Field Theory; 118 Diophantine Equa- tions; 182 Geometry of Numbers; 297 Num- ber Theory, Elementary; 347 Quadratic Fields; 430 Transcendental Numbers.) B. Analytic Methods Analytic methods are sometimes used to solve arithmetic problems. For example, Legendre conjectured that any arithmetic progression of integers a, a + d, a + 2d, ." contains an infinite number of primes if a and d are relatively prime. The conjecture was first proved by Dirichlet in 1837; in the proof he used an t L- function. Recently, proofs that do not use L- functions have been obtained, although these proofs are still not purely arithmetical. Analy- sis is indispensable in the formulation of some arithmetic problems. For example, let XER and n(x) be the number of primes not exceed- ing x. Euclid's Elements already give the re- sult n(x)->CXJ as X->CXJ, but to describe the behavior of n(x) as x->CXJ, notions of analysis are needed. Gauss conjectured that Iim=1. xoo x/logx This tprime number theorem was first proved in 1896, using the results of the theory of func- tions of a complex variable; more elementary proofs have been obtained in recent years. Analysis is sometimes needed to solve or simplify certain problems in number theory. The branch of mathematics treating such problems is calIed analytic number theory. The question of the extent to which analysis is really needed in dealing with such problems is itself an interesting one. 
296 Ref. Number Theory In the 20th century, analytic number theory has made rapid progress. The problem of distribution of primes has been generalized to the case of algebraic number fields. t Additive number theory dealing with tWaring's prob. lem, tGoldbach's problem, and other prob- lems has developed and formed a new field. We also have geometric number theory, which deals with tlattice-point problems. ( 4 Addi- tive Number Theory; 123 Distribution of Prime Numbers; 242 Lattice-Point Problems; 295 Number-Theoretic Functions; 328 Par- titions of Numbers; 450 Zeta Functions.) References [IJ C. G. Bachet de Meziriac, Problemes plaisants et delectables qui se font par les numbres, Lyon, 1612. [2J L. Euler, Vollstandige Anleitung zur Alge- bra, Petersburg, 1770. [3J A. M. Legendre, Essai sur la theorie des nombres, Paris, 1798. [4J K. F. Gauss, Disquisitiones arithmeticae, Leipzig, 1801; English translation, Yale Univ. Press, 1966. [5J M. B. Cantor, Vorlesungen liber Geschich- te der Mathematik, Teubner, 1894-1908. [6J L. E. Dickson, History of the theory of numbers I, II, II1, Carnegie Institution of Washington, 1919-1923 (Chelsea, 1952). [7J P. G. L. Dirichlet, Vorlesungen liber Zah- lentheorie, herausgegeben und mit Zusatzen versehen von R. Dedekind, Braunschweig, fourth edition, 1894. [8J J. LeVeque, Reviews of papers in number theory, vols. 1-6, Amer. Math. Soc., 1974. 297 (V.2) Number Theory, Elementary A. The Euclidean Algorithm We denote the set of natural n um bers 1,2, 3, '" by N and the set of rational integers 0, ::f:: L ::f::2, ... by Z. Evidently Z is an ordered tcommutative ring and also an tintegral do- main with respect to ordinary addition and multiplication. For any aEZ and bEN, there exists a unique pair of integers q and r such that a=qb+r (Or<b) (division algorithm); q is called the quotient and r the remainder of the division of a by b. When the remainder r is zero, we say that a is a multiple of b, b is a divisor of a, and a is divisible by b. We denote this relation by b I a. After a finite number of divisions a=q1b+r l , b=q2rj +r2' rl =q3r2+ r3' ... (b> r l > rz ... > 0), we reach an equa- 1112 tion of the form r k = qk+Zrk+l' The remainder r = rk+1, uniquely determined in this manner by a and b, is the greatest common divisor (G. C. D.) of a and b. It can be expressed as r=ax+ by with integers x and y. This method of obtaining the G. C. D. is called the Euclidean algorithm. The greatest common divisor of a and b is denoted by (a, b). If an integer c divides both a and b, then c I (a, b). When (a,b)= 1, we say that a and b are relatively prime, and there are integral solutions x, y of the equation ax + by = 1. Given a pair of posi- tive integers a and b, there exists a unique positive integer c that divides any common multiple of a and b, called the least common multiple (L. C. M.) of a and b. If d and I denote the G. C. D. and L. C. M. of a and b, respec- tively, we have ab = dl. B. Prime l\"umbers An integer p is called a prime number if p is larger than 1 and has no positive divisors other than 1 and itself. A positive integer is called a composite number if it has positive divisors other than 1 and itself. The follow- ing method of selecting the prime numbers from the sequence 2,3,4,5, ..., known to the Greeks, is called Eratosthenes' sieve: First we discard multiples of two, thus reaching the second prime number, 3. Then we discard multiples of three and reach the next prime, 5. Continuing this process, we find the sequence of primes 2.3,5,7,11,... by sifting out all multiples. c. Decomposition into Primes Factors Every integer n> I can be uniquely expressed as the product of primes; the resulting decom- position can be written as n=p"qPrY... with distinct prime factors p, q, r, .., and corre- sponding exponents ct., fJ, y, ... uniquely de- termined by n. This theorem is called the fundamental theorem of elementary number theory. D. Perfect Numbers We denote by O"(n) the sum of positive divisors of n (including 1 and n itself). According as O"(n) is greater than, equal to, or less than 2n, we call n an abundant number, perfect number, or deficient number. An even number is perfect if and only if it can be represented as n = 2"-1 . (2" - 1) with prime 2" - 1 (L. Euler). The exis- tence of an odd perfect number still constitutes an open question. It is well known that, if n is odd and perfect, then n must be of the 
1113 form n = pop' ...pt, where Po ==G o == 1 (mod4), and a( is even for i>O. Recently it has been proved that t must be  7 (P. H. Hagis, Jr., Math. Comp., 35 (1980)). Many necessary conditions for an odd number to be perfect have been given and repeatedly improved. It has been recently proved that there exists no odd perfect number less than 10 50 (Hagis, Math. Comp., 27 (1973)). Two numbers m and n are said to form a pair of amicable numbers if O"(n) - n = m and O"(m)-m=n (e.g., m=220 and n=284). Euler listed 61 such pairs. The following two num- bers m, n, both having 152 digits, constitute the largest amicable pair currently known: m=3 4 .5 .11. 528U 9 . 29, 89(2,1291, 528U9 -1), n = 3 4 . 5, 11 . 5281 19 (2 3 . 3 3 . 52 . 1291 .5281 19 (H. J. J. te Riele, Math. Comp., 28 (1974)). E. Lionnet considered the numbers n such that the product I1dlnd is equal to n 2 and called such numbers perfect numbers of the second kind, e.g., n=p3, n=pp', where p and p' are unequal prime numbers. It is also known that there are numbers n such that O"(n)/n is an integer. For example, 2 5 .3. 7,2 15 . 3 5 . 52. 7 2 . 11 . 13 . 17 . 19. 31 . 43 . 257 have this property. E. Mersenne Numbers A number of the form 2 e - 1, where e is a prime, is called a Mersenne number. For the number 2 e - 1 to be prime, it is necessary but not sufficient that e be prime. If a Mersenne number is a prime, it is called a Mersenne prime. It is not known whether there are in- finitely many Mersenne primes; it has been verified, however, that for e < 44500, there are 27 cases of e such that 2 e - 1 is a Mersenne prime; e=2, 3, 5, 7,13,17,19,31,61,89,107, 127,521,607,1279,2203,2281,3217,4253, 4423,9689,9941,11213,21701,23209,and 44497. Until the 18th century, the verifications for e  31 were done by direct calculation. For 61  e  127, the "Lucas test" was utilized to execute the computation. The remaining cases were calculated by means of electronic com- puters. The number 244497 - 1, which has 13395 digits, is at present the largest known prime. F. Fermat Numbers Numbers of the form 2 2 ' + 1 are called Fer- mat numbers. For a number p=2 e + 1 to be a prime, it is necessary that e be a power of 2. 297 G Number Theory, Elementary Fermat conjectured that numbers of the form 2 2 ' + 1 are all primes. In fact, for v=O, 1,2,3, 4, the corresponding p = 3,5, 17,257,65537 are primes; however, 2 25 + 1 is divisible by 641. It is not yet known whether there exist Fermat primes other than these five primes. For fur- ther details  [10]. Fermat numbers are closely connected with the problem of tgeo- metric construction of regular polygons. G. Congruence -1) Let m be a positive integer. Two integers a and b are said to be congruent modulo m if their difference is divisible by m; we denote this relation by the congruence a==b (modm) (or simply by a == b (m)) and caIl m the modulus of this congruence. Congruence modulo m is an tequivalence relation, which is compatible with the ring operations and classifies Z into m classes. We thus obtain the tresidue class ring Z/mZ with m elements. If p is a prime, then Z/pZ is a field which is isomorphic to the tprime field with tcharacteristic p. A complete system of representatives of the quotient set Z/mZ is called a complete residue system modulo m. On the other hand, a set of cp(m) elements n, (cp is tEuler's function) such that nd=nj (m) (i#j) and (n"m)= 1 is called a re- duced residue system modulo m. The set of all residue classes represented by a reduced re- sidue system modulo m forms a multiplicative Abelian group of order cp(m); we denote this group by (Z/mZ)*. If(a,m)= 1, then a",(m)==l (modm). If p is a prime and (a,p)= 1, then aP-1 == 1 (modp) (Fermat's theorem), since cp(p)=p-l. When m is 2,4, pk or 2 p k (p #2; k = 1, 2, ...), (Z/mZ)* is a tcyclic group, whose tgenerator 9 is called a primitive root modulo m ( Appendix B, Table 1). For any a prime to m and generator 9 of (Z/mZ)*, there exists a unique number J1 (1 jJ  cp(m)) such that a==g (modm). We call jJ the index of a with respect to the basis g, and write jJ = Ind g G ( Appendix B, Table 2). The group (Z/2kZ)* (k  3) is Abelian of type (2, 2k-2), a basis of which is formed by the residue classes repre- sented by -1 and 5. From this follows (p -1)! == -1 (mod p) (Wilson's theorem). Generally, if m = I1i=l m(, (m(, m) = 1 for i # j, then we have Z/mZZ/m1Z+ .., +Z/m,Z (direct sum) and (Z/mZ)* (Z/m1 Z)* x '" x (Z/m,Z)* (di- rect product). The congruence Gx==b (modm) with (a,m) = d is solvable if and only if d I b, and when it is solvable, the solution is unique modulo mid. The simultaneous congruences X==G( (modmJ (i = 1,2, ..., k) are solvable if and only if a( == a j (mod(m(, m j )) (i,j = 1,..., k), and when they are solvable, the solution is unique modulo the 
297 H Number Theory, Elementary greatest common multiple of m[,..., m k . In particular, when (mi' m) = 1 (i # j), the solution is unique modulo m 1 m 2 ... m k (Chinese remain- der theorem). If m = pl '" pi? is the factoriza- tion of m, solving the congruence f(x) == 0 (modm), where f(x) is a polynomial with in- tegral coefficients, can be reduced to solving f(x)==O (modpfi) (i= 1, ...,k). Also, solving a quadratic congruence can be reduced to solv- ing a linear congruence and a congruence of the form x 2 == a (mod m). H. Quadratic Residues When the congruence x 2 == a (mod m), where (a, m) = 1, is solvable, a is said to be a quadratic residue modulo m; otherwise, a is said to be a quadratic nonresidue modulo m. The following two conditions are necessary and sufficient for a to be a quadratic residue modulo m: (i) a is a quadratic residue with respect to each of the prime factors p (#2); and (ii) a== 1 (mod4) or a == 1 (mod 8) according as 41 m or 81 m. Given a prime number p and integer a prime to p, the Legendre symbol (a/p) is by definition 1 or -1 according as a is a qua- dratic residue modulo p or not. The value of this symbol is determined by a (mod p), and we have (ab/p) = (a/p)(b/p). Hence the symbol determines a tcharacter of (Z/pZ)* of order 2. Furthermore, the congruence (a/p)==d P - 1 )/2 (modp) holds (Euler's criterion). I. Reciprocity Law For odd primes p and q (p#q), the formulas (p/q)(q/p) = (-1 )((P-l)/2)((Q-1)/2), (-I/p) =( -1 )(P-I)/2, (2/p) = (-1 )(p2-1)/8 are called the law of quadratic reciprocity of the Legendre symbol, the first complementary law, and the second complementary law, re- spectively. These laws were conjectured by Euler and first proved by Gauss, who gave seven different proofs. We now have more than fifty different proofs of these laws. P. Bachmann (Niedere Zahlentheorie I (1902)) lists the 47 different proofs of the laws known at the time. Gauss's first proof was elementary; his second proof used quadratic forms. The latter has been reformulated utilizing the theory of quadratic fields [4]. The fourth proof used tGaussian sums, and the sixth proof used algebraic congruences with integral coeffi- cients. His seventh proof, contained in his posthumous works, used congruences of higher degree [4]. His fourth, sixth, and seventh proofs are related to the arithmetic of tcyc1o- 1114 tomic fields [4]. The third and fifth proofs are the most elementary and simple. They are based on Gauss's lemma: Let r l ,r 2 , ...,r(p-1)/2 be the residues of divisions of la, 2a, ..., (p- l)a/2 by an odd prime p, and let n be the number of these residues that are greater than p/2. Then we have (a/p)=( -1)". T. Takagi gave a simplified exposition of the third proof using geometric figures (1904). The same method was rediscovered by G. Frobenius (1914). When m is an odd integer such that m = :tI1;p1 i , (m,n)= 1, we call (n/m) = IT, (n/p,jei Jacobi's symbol. If m has no sq uare factor, it is a character of (Z/mZ)*. If we put sgnm= + 1 or -1 according as m>O or m<O, we have the following law of quadratic reciprocity of Jacobi's symbol and its complementary laws: If m and n are relatively prime odd numbers, then (m/n)(n/m) = ( -1 )((m-1 )/2)((n-1 )/2)+((,gnm-1 )/2)((sgnn-1 )/2), (-I/n) =( _1)(n-I)/2+(sgnn-l)/2, (2/n) = ( -1 )(n'-I )/4, where n* = ( _1)(n-I)/2 n. Furthermore, tKronecker's symbol, another generalization of the Legendre symbol, is used in the theory of quadratic number fields ( 347 Quadratic Fields). References [1J C. F. Gauss, Werke 2, Konigliche Gesell- schaft der Wissenschaften, Gottingen, 1863. [2J C. F. Gauss, Sechs Beweise des Funda- mentaltheorems liber quadratische Reste, Ostwald's Klassiker Nr. 122, Wilhelm Engel- mann, 1901. [3J L. E. Dickson, History of the theory of numbers I, Carnegie Institution of Washing- ton, 1919 (Chelsea, 1952). [4J P. G. L. Dirichlet, Vorlesungen liber Zah- lentheorie, Herausgegeben und mit Zusatzen verse hen von R. Dedekind, Vieweg, fourth edition, 1894 (Chelsea, 1969). [5J H. Hasse, Vorlesungen liber Zahlentheorie, Springer, 1950. [6J E. G. H. Landau, Vorlesungen liber Zahlen- theorie I, Hirzel, 1927 (Chelsea, 1946). [7J W. J. LeVeque, Topics in number theory, Addison-Wesley, 1956. [8J H. Rademacher, Lectures on elementary number theory, Blaisdell, 1964. [9J Z. I. Borevich and I. R. Shafarevich, Num- ber theory, Academic Press, 1966. (Original in Russian, 1964.) 
1115 [10J W. Sierpiilski, Elementary theory of numbers, Hafner, 1964. 298 (XV.6) Numerical Computation of Eigenvalues A. General Remarks Numerical computation of t eigenvalues and teigenvectors of a matrix provides a basic technique for the numerical solution of various eigenvalue problems. Roughly speaking, there are two kinds of method. In methods of the first kind, one determines the tcharacteristic polynomial p(A)= det(AI -A) (where I is the unit matrix) of A (or to give an algorithm to calculate the value of p(A) for an arbitrary A), then solves the algebraic equation p(A)= ° numerically to obtain the eigenvalues All (J.l= 1,2, ...) ( 301 Numerical Solution of Alge- braic Equations), and finally to one deter- mines the eigenvectors XII by means of the equations (AJ - A)x ll = 0. In methods of the second kind, one obtains eigenvalues and eigenvectors directly without resorting to the solution of an algebraic equation, relying in- stead upon repeated application of similarity transformations. (The power method does not fit into this classification, being a different approach altogether;  Section C.) In partic- ular, a real symmetric or complex Hermitian matrix is reduced approximately to a diagonal matrix. In this article we denote a given square matrix of order n by A = (a;) (i,j = 1, ..., n). B. The Jacobi Method The Jacobi method is an iterative technique for determining all the eigenvalues and eigen- vectors of a real symmetric matrix [7]. Before the advent of high-speed computers it was not considered practical, but at present it is one of the most compact and elegant methods. The following algorithm can be extended to Hermitian matrices by replacing tortho- gonal transformations by suitable tunitary transformations. Roughly, the method transforms a given matrix A = (a;) (a j ;= a;j; i,j = 1, "', n) into a diagonal one by repeated application of 2- dimensional rotations of the reference axes. We first put A(O)=A, U(O)=I, and compute A (I), A (2), ... , U(I), U(2), .., successively as follows: (1) Select an off-diagonal element of A (I) = 298B Numerical Computation of Eigenvalues (a\j!) with the maximum absolute value and denote it by a. (2) Compute 2d') sgn(d') -d ' )) tan (j - pq pp qq I d') -d') 1 + J ( d') _a(l» ) 2 +4 ( a(I» ) 2 H  H  " (where sgn X = 1, 0, or -1 according as x> 0, =0, or <0), cos (j =(1 + tan 2 (j) 1/2, and sin (j = tan 0 . cos (j; and form T(I) = (tj!), where t = t = cos 0, tW = 1 for i¥'p, q, -t=t'=sin(j, and tW=O for aU other (i,j). (3) Determine A(I+1) and U(1+1) by A(I+1) = T(I)' A(I)T(') (' denotes the transpose) and U(I+1) = U(I)T(I). In this process, T(I) represents an orthogonal transformation (rotation) in the plane spanned by the pth and qth coordinate axes such that a:I) = a;1) is nullified. If we put N(B)=L;.jb;] and M(B) = Lii'A] , then N(B) is invariant under an orthogonal trans- formation, so that N(A(I))=N(A). Further- more, since a:+I)=aW (i ¥'p, q) and (a;I»)2 + (aq+1))2 = (a)2 + (am 2 + 2(a)2, we have M(A(I+1))=M(A(I))-2(af Since a has the maximum absolute value among all the off- diagonal elements, we have (a)2;;" M(A(I))! (n 2 - n). Therefore M(A('+1))::::;(1-2j(n 2 -n))M(A(I)) ::::;(l-2j(n 2 _n))'+1 M(A) <M(A)exp( -2(/+I)j(n 2 -n)). It has been proved a fortiori [13J that, after M(A(I)) comes down to below a certain thresh- old value, the convergence of the iteration process becomes quadratic, i.e., there is a num- ber c determined by the order n of A and the arrangement of the eigenvalues of A such that M(A(I+n(n-1)/2)) <c(M(A(I»))2. Since the set of eigenvalues of A(I) coincides with that of A and the eigenvalues of an arbitrary symmetric matrix B can be made to correspond in a one- to-one way to its diagonal elements in such a way that the difference between an eigenvalue and the corresponding diagonal element is not greater than M(B)I/2, aW tends to ).; (i= 1, ..., n) as / tends to infinity. Moreover, as / tends to infinity, each column vector of U(I) = (um tends to the corresponding eigenvector, in the sense that LZ1 a;kUr) - AjuY)-,O. The number of arithmetic computations required to obtain A(I+1) and U(l+1) from A(I) and U(I) is at most proportional to n, so that for a given" > 0, the arithmetic required to reduce maxilaW - Ail below c;(M(A))I/2 is at most proportional to n 3 (because / is at most 
298C Numerical Computation of Eigenvalues proportional to n 2 ). On the other hand, the search for an off-diagonal element of A(I) with the maximum absolute value, if it is done by simply comparing all the elements, will require effort proportional to n 2 , so that the amount of work required by the searching process is proportional to n 4 . To bypass this searching process, the cyclic Jacobi method and the threshold Jacobi method are often used. The former method adopts as a the off-diagonal element for which q> p and 1= (p - 1) (n - p/2) +(q - p), that is, aW, a\2, "', a\nn- 1 ), at a:I), ... are adopted in this sequence. The latter method adopts as a off-diagonal elements in a sequence similar to the one above as long as they exceed a given threshold value; but if an element is less than that threshold value, then the next element in the sequence is a can- didate for adoption as a, where the thresh- old value is made to decrease gradually as the iteration process proceeds. However, the search for an element with the maximum absolute value can be done more effectively by taking account of the fact that only the matrix elements lying in rows p and q and in columns p and q change their values when we transform A(I) into A(I+1). In fact, we can record for each row the value as well as the position of the (off-diagonal) element with the maximum absolute value in that row. By so doing, the effort of searching for an off-diagonal element with the maximum absolute value can be reduced to something proportional to n on the average. C. The Power Method The power method is suitable for obtaining only the eigenvalue of maximum absolute value [6]. Let us assume that the eigenvalues Ie j, .., ,An of A are arranged so that I A 11 > IA 2 1  IA 3 1 ...  IAnl, with AI real, and de- note by Y1 the left eigenvector corresponding to AI (which means Y1 (Ar! - A) =0). Starting from an arbitrary (real) vector x(O) such that (Y1,X(O))#0 and x!)= 1 for a prescribed i o , we compute 8(0), 8(1),... and x(l), X(2),... by Ax(l)=8(1)x(I+1) (1=0, 1,2, ...;x!+1)= 1). Then we have lim Hoo O(l)=A 1 and lim Hoo x(l)=x 1 (the eigenvector corresponding to the eigen- value ArJ. The rate of convergence depends on IAI/A21 if the telementary divisor of A cor- responding to AI is linear, but in the case of a nonlinear elementary divisor, the convergence is too slow for practical purposes. If AI =1= A2 and IA11=IA21>IA31...IAnl, then the se- quences of 8(1) and x(l) computed by the for- mulas above do not converge but in general oscillate. However, from 8(1), 8(1+1), x(l), x(l+1), X(l+2) for a sufficiently large I, we can obtain 1116 approximate eigenvalues AI and A 2 as the two roots of the quadratic equation in 2: X!I) xl'+1) X!'+2) xY) 8(1)8(1+1) xy+ 1) 8(1) A xY+ 2 ) A 2 =0 (i and j arbitrary, i #j). The corresponding eigenvectors are given by XI =A 2 X(l+I)-X(l+2) and X 2 =A 1 X(H1)-X(l+2). Complex conjugate pairs of eigenvalues can be dealt with in this manner. This is useful also when IA 1 1 '=; IA 2 1. The extension to the case of more than two eigenvalues with the same maximum absolute value is obvious. In order to determine the remaining eigen- values by the power method we have to com- bine it with the deflation or transformation of matrices, as discussed later in this section and in the following section. The amount of com- putation depends on the arrangement of the eigenvalues of A and the required accuracy. We note that the multiplication of a matrix by a vector requires an amount of computation proportional to n 2 . Improvement in the approximations can be incorporated into the power method. If v = x p + 0(£) is an approximation to the eigenvector x p corresponding to an eigenvalue Ap of a real symmetric matrix A, then the Rayleigh quo- tient AR = (v, Av)/(v, v) affords a good approxi- mation to Ap- In fact, we have lAp - ARI = 0(£2). If Ai (i = 1, '" , n) is an eigenvalue of A and Xi the corresponding eigenvector, then P(A i ) (i = 1, "', n) is an eigenvalue of P(A) and Xi the corresponding eigenvector, where P() is a polynomial in  and  -1. This fact can be utilized to transform the magnitudes of eigen- values to accelerate the convergence of the power method, to separate eigenvalues with the same absolute value, or to obtain inter- mediate eigenvalues. The choice P(A)=A-c or (A - c) -1 is particularly useful, where C is an appropriate constant. In fact, an efficient algorithm known as inverse iteration exists for computing an approximate eigenvector when a good approximation to an eigenvalue is known. Given a trial eigenvector X cor- responding to a computed eigenvalue c, one computes an improved approximate eigen- vector Y by solving (A-cI)y=x or y=(A- cI)-1 x. Aitken's ,F-method is also efficient in accel- erating the convergence of the power method. When an eigenvalue-eigenvector pair is known, another eigenvalue-eigenvector pair can be computed using the process known as deflation. If an eigenvalue Ap and the corre- sponding eigenvector x p (and also the corre- sponding left eigenvector 'yp if necessary) of A are known, it is possible to "subtract" them 
1117 from A to get a problem containing only the remaining eigenvalues. Such deflations are often used in combination with the power method. The following are two examples of deflation methods. (1) Assuming that x and y are normalized in such a way that (y, x) = 1, form B = A - AXy. Then B has the same set of eigen- values and eigenvectors as A except for Aw The eigenvalue and the eigenvector of B corre- sponding to 2 of A are 0 and x, respectively. This kind of deflation process can be gen- eralized to the case of nonlinear elementary divisors, but that becomes somewhat more complicated. (2) After normalizing x so that its nth com- ponent xn is equal to 1, form B=(bi):bij=aij - xianj (i,j = 1, ..., n -1). Then B has the same set of eigenvalues as A except for AW If W k is the eigenvector corresponding to the eigen- value }'k of B, then the corresponding eigen- vector Xk of A is given by X ki = W ki + dkXi (i = 1, ..., n - 1), Xkn = db where d k is determined from L?::f niwki=(Ak-A)dk if Ak7'd w If 2k=2 and r=L?=1 aniW ki = 0, then we can put dk=O. If Ak = 2 and r # 0, A has a nonlinear elemen- tary divisor for Ak = )., and the Xk defined by Xki = wkjr and Xkn = 0 is a generalized eigen- vector of A in the sense that AX k = AXk + x. D. Transformation of Matrices There are a number of methods of transform- ing a given matrix A by means of a suitable similarity transformation A-.B=S-IAS into another matrix B for which it is easier to solve the eigenvalue problem. The Givens method [9J transforms a symmetric matrix A into a tridiagonal matrix B (i.e., a matrix such that bij = 0 for Ii - jl  2) by means of a matrix S which is the product of 2-dimensional rotation ma- trices. The Householder method [1OJ also trans- forms a symmetric A into a tridiagonal B by means of an orthogonal matrix S of special type, i.e., a reflection matrix I. 2uu* (u*u = I, u* = conjugate transpose of u), and the Lanc- zos method [8J transforms a general A into a tridiagonal B. To general matrices the follow- ing methods are also applicable: (1) The Dani- levskii method [IJ transforms A into its com- panion matrix B by repeated application of elimination operations. Here, row interchanges can be combined to increase numerical sta- bility [14]. (2) The Hessenberg method [3J transforms A into a B such that bij = 0 for i - j  2 with a triangular S. (3) The Givens method [9J transforms A into a B of the same form as in (2) by repeated application of 2- dimensional rotations. This method now tends to give way to the Householder method or to 298E Numerical Computation of Eigenvalues the elimination method with row interchange. All these methods require an amount of com- putation proportional to n 3 . In general, special treatment is necessary for the case of multiple eigenvalues. As an example, we explain the Givens method for general matrices. Let N = (n-l)(n-2)/2. For 1=0, 1, ...,N -1, choose (p, q) = (3, 2), (4,2), "', (n, 2); (4,3), (5,3), "', (n, 3); .. . ; (n - 1, n - 2), (n, n - 2); (n, n - 1) in this order. Using T(I) of the same form as in the Jacobi method, and setting tan e = aq-r!a!q-1 in this case, calculate A(O)=A, U(O)=I, A(l+!) = T(I)'A(I)T(I), U(l+I)= U(I)T(I) (1=0, 1, ...,N -1), and put B= A(N). Then we have bij=O for i- j 2. We can solve the eigenvalue problem for this simplified B and then retransform the eigenvectors of B thus obtained into those of A by means of U(N). It should be noted that the method of bisection based on tSturm's theorem is effectively used to solve the char- acteristic equation of a real tridiagonal ma- trix [14]. This method is remarkably stable numerically. It is used when the number of eigenvalues to be computed is small relative to the order of the given matrix. If all eigenvalues are desired, an alternative method such as the QR method ( Section F) is recommended. E. The Lanczos Method A more detailed exposition of the Lanczos method is now given. Let A be a given real matrix of order n. Pick two vectors C 1 and C 1 . Determine recursively the vectors c i +! and C i + h i = 1, "', n, that satisfy the following con- ditions: (i) Yi+1 C i +! = Ai -Cl.iC i =- Pi C i-1 == bi+l' Yi+l C i + 1 = A'Ci-a.iC i - f3J!i-l =b i + 1 , i = 2, ..., n; (ii) C i + 1 is orthogonal to C i - 1 and c i ; (iii) c i +! is orthogonal to C i - 1 and c" where CI." Pi' ii" and 11, are scalars, and where Yi and Yi are nor- malizing scalars. Actually, Cl. i = c;Ac,jc;c i = iii, Pi = Yi C ;C,jC;-1 Ci-j, Pi = Yi c icjci-1 C i - 1 , where c;c i , i = 1, ..., n, are assumed nonzero. It can be shown that C i + 1 is orthogonal to every c h I j  i, and that C i + 1 is orthogonal to every c j , 1 j  i (c n +! = C,,+! = 0). The conclusion is that the given matrix A is similar to the tridiagonal matrix H = (hi)' where hii=Cl.i (i= 1,..., n), h i . i +! = Pi+!, hi+1.i = Yi+1 (i = 2, ''', n). In fact, if C denotes the matrix whose jth column eq uals c j (j = 1, ''', n), one obtains C- 1 AC = H. Thus the eigenvalues of A are identical to those of H. In principle, the Lanczos method transforms a given matrix to a tridiagonal matrix similar to it if every C;Ci is nonzero (i = 1,..., n). If cic i does vanish for a certain i, one selects C 1 and c 1 again and restarts. On the other hand, if hi+1 = 0 one chooses an arbitra!y vector C i + 1 orthogonal to c 1 , ..., c i and if b i + 1 = 0 one selects an arbitrary vector C i + 1 orthogonal to 
298F Numerical Computation of Eigenvalues C 1 , ...,c j . In the actual numerical computation, the distinction between zero vectors and non- zero vectors is usually blurred by rounding errors. In the application of the LanclOs method one often observes the loss of orthogonality c;c j = ° (i # j). This usually results from cancel- lation errors in the computation of b j + 1 and bj+!' If the orthogonality is lost, C- 1 AC may significantly deviate from a tridiagonal matrix. As a practical means for preserving the in- dicated orthogonality one can reorthogonalize the vectors c j and c j . Indeed, one can add to the computed c j + 1 a linear combination of C 1 , .,., C j so that the sum is orthogonal to C 1, ..., c j , then take the sum as c j +! after pro- perly normalizing. A similar process applies to c j + 1 ' The Lanczos method is often applied in double precision. It has been suggested that one first reduces the given matrix to an upper Hessenberg form by using the Hessen- berg method with row interchange before applying the LanclOs method [15]. A further remark on the Lanczos method is in order. Recall that Yi is determined from Ac i , Cil' C'li-l, and {3il; Pi from C i , c i , C i - 1 , Cil' and )lj; and (Xi from Ac;, C j , and c j . This shows that the LanclOs method applied to a sparse matrix (a matrix whose elements are almost all zero) requires only a memory proportional to n. The Householder method, on the other hand, requires memory proportional to n 2 even for a sparse matrix. For a real symmetric matrix one can modify the LanclOs method for a general matrix so that C is orthogonal and H is real, symmetric, and tridiagonal (the details are omitted). F. The QR Method The QR method was discovered independently by 1. G. F. Francis and by V. N. Kublanov- skaya in 1961 [15]. The method has been improved and extended considerably since then. In the usual application to matrix eigen- value problems, the QR method provides the most efficient iterative process for finding all eigenvalues of a given matrix. The matrix is reduced by means of a similarity transforma- tion to Hessenberg form or to tridiagonal form before application of the QR method; the re- duction process may be effected by the House- holder method or by the elimination method with row interchange. (If the given matrix is a complex matrix, the latter is preferable.) The reason is that one step of the QR process applied to a full matrix is prohibitively expen- sive, requiring a number of operations propor- 1118 tional to n 3 , while one step of the QR process applied to a Hessenberg matrix requires a number of operations proportional to n 2 . We now describe one of the most useful versions of the QR method. Let A = Ao be a given matrix, and define a seq uence of matrices AI, A2' ... obtained from Ao as follows. At the ith (i = 0, 1, ...) step, choose an appropriate constant Sj, called an origin shift, according to the process described below, and decompose A j - sJ into the product of a unitary matrix Qj and an upper Hessenberg matrix Rj:A j - sJ = QjR j . The matrix A j +! is then defined by A j +! =RjQi+SJ =Qr AjQj. Hence A j +! is simi- lar to A j . The QR method is closely related to the power method and to the inverse power method. We describe this relationship in order to obtain an insight into the nature of the QR method. To this end, we first state a well- known theorem: Let A be diagonalizable, and let its eigenvalues Aj (i = 1, ..., n) have distinct moduli, say IA 1 1> IA 2 1 > .., > IAnl. Let X-lAX = diag{21' A2' ..., An}, and let X have an LU- decomposition X =LU, where Land U are, respectively, lower triangular matrix and an upper triangular matrix. Then the QR al- gorithm without an origin shift (Sj = 0, i = 0, 1,2, ... ) behaves as follows: a -->0 (p > q); ab-->Ap; a oscillates (p<q), p, q= 1,2, ...,n, i --> 00, where a denotes the (p, q)th element of Aj. In other words, A j approaches an upper triangular matrix as i --> 00 so that the diagonal elements of A j converge to the eigenvalues of A. The relationship between the QR algorithm and the power method is given by (QOQI'" Qj) (R j R j _ 1 ... Ro)=(A -soI)(A -SIl)... (A -sJ). Ifs o =sl =",=Sj=O, then the right-hand side reduces to A j + 1 , and since the product R i ... Ro is upper triangular, the first column of (L == QOQ1 ... Qj equals a scalar multiple of A j + 1 e 1 , where e 1 = (1,0,0, ... , 0)'. Therefore, by the power method, the first column of Qj converges to an eigenvector corresponding to the eigen- value AI of A that has the largest modulus, under a certain fairly mild condition. Since A j + 1 =Qr AQj, A j + 1 e 1 =Qr AQ j e 1 A1e1 for large i. In other words, the first column of Ai converges to the vector (Aj,O, 0, ...,0)' as i-->oo. The relationship of the QR method to the inverse power method and to the Rayleigh quotient is now explained. From the definition of the QR algorithm with an origin shift, we have Qj=[(A-SJ)-IJ*Rt. Since Rr is a lower triangular matrix, the nth column of Qj equals Q j e n =[(Aj-SJ)-IJ* Rten=(A -SJ)-1 '(a scalar multiple of en), where en = (0, 0, .., ,0, 1)'. This last process of obtaining the last column of Qj represents a process known as the inverse power method. If Sj is close to an eigenvalue of 
1119 Ai (and hence to an eigenvalue of A), the last column of Ai gives a good approximate eigen- vector of A corresponding to the eigenvalue under consideration. Now, if x is a given col- umn vector such that x*x = 1, the value of 2 which minimizes (Ax - h)*(Ax - h) is given by the Rayleigh quotient x* Ax. If x(x*x = 1) happens to be an eigenvector of A, the Ray- leigh quotient equals an eigenvalue of A. If we take x = en, the corresponding Rayleigh quotient equals ann' Therefore if we take the (n, n)th element of Ai as the origin shift Si' then Si can be regarded as the best approximation to an eigenvalue of Ai (hence of A) when en is taken as an approximate eigenvector of Ai' in the sense that A = Si minimizes the functional (Aie n - Aen)*(Aie n - Ae n ). Under the same condition as in the preceed- ing theorem, the rate of convergence of the QR method with an origin shift is given as fol- lows: a (n  p  q  1) is asymptotically pro- portional to (Ap/Aq)' for Si=O (i=O, 1,...) (no origin shift); and with the origin shift Si' the behavior of a at the ith step is determined by (A p - SJ/(A q - sJ If Si =: An ( = the eigenvalue of A with the least modulus), each element in the nth row of Ai+! except ain+1) exhibits rapid decrease in modulus. A natural and practical choice of the origin shift Si is given as follows: (i) for a real tridiagonal matrix, Si is taken to be that eigenvalue of the 2 x 2 matrix situated at the lower right corner of Ai that is closer to a [14]; (ii) for a real upper Hessenberg matrix, the two eigenvalues of the 2 x 2 matrix situated at the lower right corner of Ai as Si and Si+! [14]; (iii) for a complex upper Hessenberg matrix, Si is chosen as in (i) [17, COMQR]. The QR method with origin shift would eventualIy make each element in the nth row, except aiJ, smaller in modulus than a pre- scribed positive number. At this stage of itera- tion, aJ is taken as an approximate eigenvalue of A. The QR method is then applied anew to the (n-l) x (n-l) matrix obtained from Ai by deleting the nth row and the nth column of Ai, and another approximate eigenvalue of A is obtained. The method proceeds similarly until all the eigenvalues of A are computed. For maximum accuracy the eigenvalues of the given matrix should be computed in the order of increasing modulus. A word of caution is in order. When the given matrix A has elements of greatly varing modulus, rearrangement of elements of A may be necessary before apply- ing the QR method with an explicit origin shift, where Ai - sJ (i = 0, 1, .., ) is explicitly computed. The reader is referred to [14-16] for details of the QR method. 298 G Numerical Computation of Eigenvalues G. Generalized Eigenvalue Problem Ax = 2Bx An eigenvalue problem of the type Ax = 2Bx is called a generalized eigenvalue problem and is often encountered in applied mathematics. A necessary and sufficient condition for A to be an eigenvalue is det(A-AB)=O. If B- 1 exists, then AX=ABx is equivalent to B- 1 Ax=h and hence has n eigenvalues, where n is the order of A. If B- 1 does not exist, the eigenvalue prob- lem AX=ABx has at most n-l eigenvalues. We restrict ourselves to one of the most im- portant cases: that where A is real and sym- metric and B is real, symmetric, and positive definite. In this case one could solve the prob- lem by reformulating it as B- 1 AX=AX, where B- 1 A is explicitly computed. However, B- 1 A is not in general symmetric. Moreover, when B has eigenvalues of widely different moduli, the elements of B- 1 may have widely different moduli as well, which would in turn make the computation of those eigenvalues of B- 1 A of smaller moduli difficult. An efficient and nu- merically stable method is known which obvi- ates the aforementioned difficulty by exploit- ing the symmetry of A and B. An outline of this procedure is now given. Since B is positive definite, a lower triangular matrix L exists such that LL' = B. This is called the Cholesky decomposition of B. The elements of L can be computed by equating the correspon.ding ele- ments in LL' = B. The eigenvalue problem Ax =ABx is now equivalent to L -1 A(L')-1 Y= AY with y=L'x, where the matrix L -IA(L')-1 = P is computed in two stages by solving LX =A and PL'=X. In this last equation it is enough to compute the upper right half of P, because P is symmetric while X is not gener- ally symmetric. The upper right half of P can be computed by equating the corresponding elements in PL'=X. The eigenvalues A of AX=ABx are given by the eigenvalues of the matrix P. Other types of eigenvalue problems, such as y'AB=Ay', BAY=AY, and x'BA=h', often appear in practice, where A is real and symmetric and B is real, symmetric, and posi- tive definite. By using the Cholesky decomposi- tion of B, one can reduce anyone of these eigenvalue problems to the ordinary eigen- value problem for a real symmetric matrix [4]. If A and B are general matrices in AX=ABx, the following method is known to be effective [18]. First, reduce B to an upper triangular matrix by applying n -1 Householder trans- formations from the left. This reduces the eigenvalue problem AX=ABx to the case where B is upper triangular. Next, apply to A a sequence of plane rotations of Givens type from the left, thereby reducing the eigenvalue problem AX=ABx to the case where A is an 
298 Ref. Numerical Computation of Eigenvalues upper Hessenberg matrix and B is an upper triangular matrix. Then apply the QR method to B- 1 Ax = AX without explicitly computing B- 1 A to reduce the eigenvalue problem Ax = i.Bx to the case where A and B are both approximately upper triangular. The eigen- values of Ax = 2Bx are then easily computed as ratios of corresponding diagonal elements. This method is called the QZ method [18]. A collection of about 50 excellent FOR- TRAN subroutines for various types of ma- trix eigenvalue problems is contained in [18]. These subroutines are in most part trans- lations from ALGOL procedures given in [14]. References [IJ H. Wayland, Expansion of determinantal equations into polynomial form, Quart. Appl. Math., 2 (1945), 277-306. [2J P. S. Dwyer, Linear computation, Wiley, 1951. [3] R. Zurmiihl, Matrizen, Springer, 1950. [4J L. Collatz, Eigenwertaufgaben mit techni- schen Anwendungen, Akademische Verlag., 1949. [5] R. V. Mises and H. Geiringer, Praktische Verfahren der Gleichungsauflosung, Z. Angew. Math. Mech., 9 (1929), 58-77,152-164. [6J E. Bodewig, Matrix calculus, Interscience, 1956. [7] C. G. J. Jacobi, aber ein leichtes Verfah- ren, die in der Theorie der Si.ikularstorungen vorkommenden Gleichungen numerisch auf- zulosen,1. Reine Angew. Math., 30 (1846), 51- 95. [8] C. Lanczos, An iteration method for the solution of the eigenvalue problem of linear differential and integral operators, 1. Res. Nat. Bur. Standards, 45 (1950), 255-282. [9J W. Givens, Computation of plane unitary rotations transforming a general matrix to triangular form, SIAM 1., 6 (1958), 26-50. [10] J. H. Wilkinson, Householder's method for symmetric matrices, Numer. Math., 4 (1963),354-361. [1\] P. A. White, The computation of eigen- values and eigenvectors of a matrix, SIAM 1., 6 (1958),393-437. [12] S. H. Crandall, Engineering analysis, McGraw- Hill, 1956. [13] A. Schonhage, Zur quadratischen Kon- vergenz des Jacobi-Verfahrens, Numer. Math., 6 (1964), 410-412. [14J 1. H. Wilkinson and C. Reinsch, Hand- book for automatic computation II, Linear algebra, Springer, 1971. [15] 1. H. Wilkinson, The algebraic eigenvalue problem, Clarendon Press, 1965. 1120 [16J G. W. Stewart, Introduction to matrix computations, Academic Press, 1973. [17J B. T. Smith, 1. M. Boyle, 1. 1. Dongarra, B. S. Garbow, Y. Ikebe, V. C. Klema, and C. B. Moler, Matrix eigensystem routines- EISPACK guide, Springer, second edition, 1976. [18J C. B. Moler and G. W. Stewart, An al- gorithm for generalized matrix eigenvalue problems, SIAM J. Numer. Anal., 10 (1973), 241-256. 299 (XV. 7) Numerical Integration A. Interpolatory Integration Formulas Numerical integration is a method of finding an approximate numerical value of a definite integral of a given function f(x). Usually the integral S%f(x)dx or S%f(x)w(x)dx (w(x) is the weight function) is approximated by a linear combination L?=1 f(x') of the values of the integrand at the points X 1 'X 2 ' ...,x n . Integra- tion formulas are divided into two groups, the interpolatory formulas and the formulas based on variable transformation. In order to obtain an interpolatory formula, we interpolate over the integrand f(x) at n poin ts XI, X 2' ... ,X n by means of the t Lagrange interpolation polynomial of degree n -1, and then integrate the polynomial over [a, b]. Depending on the selection of the points Xi and the weights , we have several kinds of formulas. (1) Newton-Cotes Formulas. We assume w(x) =constant and Xi = Xo + ih (i= 0,1, "', n). The weights  are so determined that the value of the integral can be calculated accurately if the integrand f(x) is a polynomial whose degree does not exceed n. There formulas are called the Newton-Cotes formulas: Df(x)dx=(fo + frJh/2 for n= 1 (trapezoidal rule), Df(x)dx= (fo +4f1 + f2)h/3 for n= 2 (Simpson's 1/3 rule), and S;gf(x)dx =(fo +3f1 +3f2 + f3)3h/8 for n = 3 (Simpson's 3/8 rule). The truncation errors of these three formulas are given by h 3 f(2)()/12, h 5 J<4)()/90, 3h 5 f(4)()/80, respec- tively, where  is a number in the interval of integration and f(i) denotes the ith derivative of f (differentiability is assumed). For an even n, the polynomial of degree n + 1 can also be integrated accurately by these formulas. In Table 1 the coefficients A and B i of the Newton-Cotes formulas hA i=o BJ(xJ, h = (x n - xo)/n, and the coefficien ts C of the error 
1121 299 A Numerical Integration Table 1 n A Bo B 1 B 2 B3 B4 B5 B6 B7 B8 C 1 1/2 I 1 -1/12 2 1/3 1 4 1 - 1/90 3 3/8 1 3 3 1 -3/80 4 '1/45 7 32 12 32 7 - 8/945 5 5/288 19 75 50 50 75 19 -275/12096 6 1/140 41 216 27 272 27 216 41 -9/1400 7 7/17280 751 3577 1323 2989 2989 1323 3577 751 -8183/518400 8 4/14175 989 5888 -928 10496 -4540 10496 -928 5888 989 -2368/467775 term ChP+1pP)(), where p=n+ 2 if n is even and p = n + 1 if n is odd, are given. When the interval [a, b] of integration is large, we usually divide it into small subinter- vals and apply formulas for small n for each part rather than formulas for large n for the whole interval. For example, if the interval is divided into m equal subintervals, we get the following formula by applying the trapezoidal rule for each subinterval: f f(x)dx=h(Uo+ fm)/2+U1 + f2 +... + fm-rJ), where xo=a, xm=b, h=(b-a)/m, with trun- cation error (b-a)3p2)/(12m 2 ). (Here, as in the rest of the article, Pi) stands for f(i)(), with differentiability assumed as before.) By apply- ing Simpson's 1/3 rule we obtain the formula f f(x)dx 2 =3 h (Uo + f2m)/2 + 2U1 + f3 +... + f2m-rJ +( f2 + f4 +... + f2m-2))' where xo=a, x 2m =b, h=(b-a)/2m, with truncation error (b _a)5 f(4)/180(2m)4. For the integral of a periodic analytic func- tion over a single period the trapezoidal rule with equally spaced points gives the best result asymptotically, as the number of points tends to infinity. Newton-Cotes formulas can also be ob- tained by integrating over the interval [a, b] the interpolation polynomial for equally spaced points. The formulas mentioned so far are called closed formulas since they use values at the two endpoints of the interval. We can also use open formulas, which do not use values at the ends. For example, we have gf(x)dx = 4h(2f l - f2 + 2f3)/3 with trun- cation error 14h 5 f(4)/45. Open formulas are useful for numerical solution of differential equations. There are also formulas that in- clude values outside the interval, for example, (f-1 + 13fo+ 13fl - fz)h/24 for J:f(x)dx. Applying these formulas to the m subintervals of [a, b], we obtain a trapezoidal formula with the correction term hU1 - f-rJ/24 + hUm- fm+rJ/24=h(No+N-rJ/24-h(Nm-1 + Nm)/24, where l1fi means fi+1 - fi. (2) Chebyshev Formulas. The Chebyshev for- mulas are a family of integration formulas in which all the weights W; of L?, W;f(xJ are equal, while the abscissas Xi are chosen so that the integral can be evaluated exactly when f(x) is an arbitrary polynomial whose degree does not exceed n. When Jd(x)dx  WLf=d(x;), it is easy to see that W = 2/n since the right-hand side must be equal to the left-hand side when f(x) == 1. It is known that the abscissas Xi for n";; 7 and n = 9 are real, while for n = 8 and n  10 at least one of the abscissas becomes complex. It is easy to see that Chebyshev for- mulas are interpolatory. (3) Gauss Formulas. In the Gauss formulas both the weights H-; and the abscissas Xi are chosen so that we obtain the accurate value of the integral when the integrand is any poly- nomial whose degree does not exceed 2n - I. If we put Il(x) = n?1 (x - x;), an arbitrary poly- nomial of degree 2n - 1 can be expressed in the form n Il(x) n-' f(x) = k'I:. 1 (x _ Xi) Il'(xd fk + Il(x) Jo akx\ where f(xd= fk and the first term is the tLa- grange interpolation polynomial. By the as- sumption that the integral of f(x) with weight w(x) equals Lk1 h, we obtain  = f(w(x)Il(x)/(x - xdIl'(xd)dx and the relations f w(x)Il(x)xkdx=O, k=O, 1, ...,n-1. Accordingly, the abscissas Xi are determined as the roots of the polynomial Il(x) of degree n that is orthogonal to Xk (k = 0, 1, '" ,n - 1) with respect to the weight function w(x). The following are typical examples of inte- gration formulas of Gaussian type. (i) Gauss integration formulas (in the narrow sense). For w(x)= I and the interval [ -1, 1], Il(x) is the tLegendre polynomial Pn(x)=(1/2 n n!)d n (x 2 - 1 J"/dx n . The error is (n!)42 2n +1 f(2n)/(2n + 1)«(2n)!)3. 
299 B Numerical Integration (ii) Gauss-Laguerre formulas. For the weight w(x)=exp( -x) and the interval [0,00), Jl(x) is the tLaguerre polynomial Ln(x)= (expx)dn(xnexp( - x))/dx n . (iii) Gauss-Hermite formulas. For the weight function w(x)=exp( _x 2 ) and the interval (-00,00), Jl(x) is the tHermite polynomial Hn(x)=( -1)n expx 2 'dnexp( -x 2 )/dx n . (iv) Gauss-Chebyshev formulas. For the weight function w(x)=(1-X 2 )-1/2 and the interval [-1,1], we use the Chebyshev poly- nomial T,,(x)=r(n-I)cos(narccosx). In this case, the H-; are al1 eq ual to n/n. From the definition of  we see that the Gauss formulas are interpolatory. (4) Clenshaw-Curtis Formulas. Although the Gauss formula is in general more accurate than the Newton-Cotes formula with the same number of points of interpolation, the points of interpolation for a Gauss formula of any order are distinct from those of any other order except the point zero, which appears in all formulas of odd order. The Clenshaw-Curtis formulas are interpolatory, with the points chosen so that the distribution of the points is similar to that of the Gauss formula and such that, in proceeding from a computation of order n to that of order 2n, al1 the function values evaluated in the former computation be used in the latter. For w(x)= 1, the inter- val [ -1,1], and even n, the points X k and the weights  are given by kn xk=cos-, k=O, 1,... ,n; n 1 W o = H-;. = ----z-; n -1 4 n/, 1 2njs = W n - s =- I ----;zcos, njO 1-4} n n s=I,2, ...,-. 2 L" meas that the first and the last terms in the sum are to be halved. There are some different types of Clenshaw-Curtis formulas depending on the selection of the points X k [1]. B. Integration Formulas Based on Variable Transformation If the integrand has some singularity at the endpoint, any interpolatory formula based on interpolation with a polynomial does not give a good result. In such cases integration for- mulas based on variable transformation are quite effective. 1122 (1) IMT Formula. The tEuler-Maclaurin formula is given by f b { I n-I I } f(x)dx=h - f(a) + I f(a+kh)+- f(b) a 2 k1 2 m h 2r B - I {f(2,-j)(b) F1 (2r)! - P2'-I)(a)} + Rm' b-a h=- n ' h2m+l f 1 { n-I } Rm= 2 ' B 2m (t) I p2m)(a+kh+ht) dt, (m). 0 k-O where the Bn(t) are tBernoulli polynomials of degree n and the Bn are tBernoulli num- bers (Bo = 1, B[ = -1/2, B 2 = 1/6, B3 =0, B4 = -1/30, ...). This formula suggests that if the higher derivatives of the integrand vanish at the both endpoints, the error of the trape- zoidal rule with equally spaced points be- comes very smal1. The IMT formula is based on the idea of transforming the variable x of Sb f(x)dx in such a way that al1 the deriva- tives of the new integrand vanish at both end- points by taking x=cp(t), cp(t)=K- 1 Shl/J("r)d7:, K = SbI/J(7:)d7:, 1/J(7:)=exp( -7: -I -(1-7:)-1). Then the trapezoidal rule with h = l/n is applied to the transformed integral to obtain (1/Kn)L;11I/JUln)f(cpUln)) [1]. The asymptotic expression of the error for the IMT formula is proportional to exp( - C) with a positive constant C. (2) Double Exponential Formula. The trape- zoidal rule with equal1y spaced points ap- plied to the integral of an analytic function over ( -00,00) gives in general a result of high accuracy. The double exponential for- mula is based on the idea of transforming SI f(x)dx to So:.",f(cp(t»)cp'(t)dt with x = cp(t) = tanh(tnsinh t) and applying the trape- zoidal rule with a mesh size h, which results in hL_",j(cp(nh))cp'(nh) [4]. The name of the double exponential formula is attributed to the decay of cp'(t) at t---> ::1::00, which is approxi- mately proportional to exp( -Cexpltl) with a positive constant C. The transformations x = exp(nsinh t) and x = sinh(tnsinh t) give the double exponential formulas for the infinite integrals S;;' f(x)dx and So:.",f(x)dx, respec- tively. In the actual computation the infinite summation is truncated at appropriate upper and lower bounds. The asymptotic expres- sion of the error for the double exponential formula in terms of the number N of the sam- pling points actual1y used is proportional to exp( - CN /log N) with a positive constant C. The IMT formula and the double exponential 
1123 formula are robust against the singularities at the endpoints. C. Automatic Integration By an automatic integration scheme we mean a computer program for numerical integration of S:f(x)dx in which the user gives the limits of integration a and b, a subroutine for com- puting f(x), and an error tolerance 8. Then the program gives a value of the integral which is expected to be correct within the tolerance 8. Usually in an automatic integration scheme the mesh size is halved until the desired ac- curacy is attained. Automatic integration schemes are classified into two groups, non- adaptive schemes and adaptive schemes. In a nonadaptive scheme, the sequence of integra- tion points is chosen according to some fixed rule independent of the shape of the inte- grand. Newton-Cotes formulas, Clenshaw- Curtis formulas, IMT formulas, and double exponential formulas can be used as base formulas for nonadaptive schemes. From the historical point of view, Romberg integration should be mentioned; it is a kind of nonadaptive automatic integrator. Consider an integral 1= S:f(x)dx. Divide the interval [a, b J of integration into 2 k subintervals and apply the trapezoidal rule with the mesh size h=(b-a)/2 k , which we denote by Td k ). Then, starting from the values obtained for Td k ), k = 0,1, ... , we compute the sequence 4 m T(k+1) _ T.(k) T.(k) m-' m-l m 4 m -1 m=I,2,.... From the tEuler-Maclaurin formula the as- ymptotic error for the trapezoidal rule TJk) can be expressed as 1- Tdk)=C 1 h 2 + c 2 h 4 + ... + c m h 2m + "', where C m = const x (f(2m-1)(b)_ f(2m-l)(a)) does not depend on h. If we com- pute Tt) =(4Tdk+1) - Td k »)/3 using the values Tdk+1) and Td k ), then we see that the asymp- totic error expression of Tl k ) becomes 1- Tt) = Ch4 + c 3 h 6 + .., + c;"h 2m +.... Romberg inte- gration is based on the idea of eliminating the term with h 2m in the expression of the error by successive application of Tk). This is an application of Richardson's extrapolation procedure. In the adaptive scheme, the points are chosen in a way that depends on the shape of the integrand. The Newton-Cotes formula of order 8, for example, is used as the base for- mula for the adaptive scheme. D. Approximate Multiple Integration If a region is a product region, such as a rect- angular parallelepiped, a product rule ob- 299 Ref. Numerical Integration tained by forming the product of I-dimen- sional formulas is useful. For integrals over a cube or a phere there are monomial rules which are exact for a certain family of mono- mials. These rules can be used for integrals of dimension lower than 5 or 6. For integrals of higher dimension only methods based on sampling make sense. E. Numerical Differentiation In order to find the numerical value of the derivative f = f'(x p ) at a point x p from the tabulated values fk = f(x k ), we usually use the derivative of the tLagrange interpolation formula. This gives JI' ( ) 1 f= I x P , f p + I -fb k""p(xp-XdJl(xk) k""pXp-Xk where Jl(x) = nl (x - xd. When we compute the derivative of a func- tion which can be evaluated at any point in a given interval, the approximation f(x+h)- f(x-h) f'(x) 2h is useful; similarly, we can use f (x + h) - 2f(x) + f(x - h) f"(x) 2 . h It must be noted that, as h tends to zero, the difference f(x + h) - f(x - h) comes to contain fewer significant digits, so that it is meaning- less to carry out {J(x + h) - f(x - h)} /2h be- yond a reasonable value of h. References [1J P. J. Davis and P. Rabinowitz, Methods of numerical integration, Academic Press, 1975. [2J P. J. Davis and I. Polonsky, Numerical interpolation, differentiation and integration, Handbook of Mathematical Functions, M. Abramowitz and I. A. Stegun (eds.), Nat. Bur. Stand. Appl. Math. Ser., 55 (1964),875-924. [3J V. I. Krylov, Approximate calculation of integrals, translated by A. H. Stroud, Mac- millan, 1962. [4J H. Takahasi and M. Mori, Double ex- ponential formulas for numerical integration, Publ. Res. Inst. Math. Sci., 9 (1974), 721-741. [5J J. R. Rice, A meta-algorithm for adap- tive quadrature, J. ACM, 22 (1975),61-82. [6J A. H. Stroud and D. H. Secrest, Gaussian quadrature formulas, Prentice-Hall, 1966. [7J A. H. Stroud, Approximate calculation of multiple integrals, Prentice-Hall, 1971. 
300 Ref. Numerical Methods 300 (XV.1) Numerical Methods In the earlier history of mathematics, the de- velopment of methods of numerical calcula- tion was one of the main purposes of research. Until the beginning of this century, logarith- mic calculation played a central role in numer- ical calculation, and the main topic of this field was to make tables of values of functions. The digital electronic computer ( 75 Computers), which made its first appearance in the 1940s and has been developing at an exponential rate, has caused drastic changes in numerical technique. Problem solving by numerical methods has now become one of the fundamen- tal means of research in the physical sciences and engineering, and also in the social sciences and humanities. In this article we give exam- ples of the changes in numerical methods brought about by the availability of digital computers and portable calculators. Computers may be effectively utilized for calculating individual values offunctions. This has led to the reexamination and, in some cases, modification of approximate formulas for evaluating functions ( 142 Evaluation of Functions). For familiar functions, such as tlogarithmic, texponential, and ttrigonometric functions, tables have been almost completely replaced by function keys on electronic cal- culators. Microprogramming algorithms for obtaining values of these functions have also been devised [1]. A complex function-theoretic error-estimation method for use with numer- ical integration formulas is given in [2]. In this method, graphical outputs of the com- puter are utilized. For problems in which the existence and uniqueness of solutions have been established, as for algebraic equations and ordinary differential equations, fairly good numerical calculation methods and their error estimates have been established ( 301 Nu- merical Solution of Algebraic Equations; 303 Numerical Solution of Ordinary Differen- tial Equations; [3,4J). A method for estimating arithmetic taccumulation errors for operations involving finite numbers of digits has been systematized ( 138 Error Analysis; [5J). It is not unusual nowadays for linear equa- tions with 10,000 or more unknowns to be solved. As new computing systems, such as the virtual memory system and the vector oper- ation system, come into existence, new al- gorithms are examined; a numerical method that is optimal for today's technology may wel1 be suboptimal for the next generation of computers. General-purpose program packages of linear problems, including eigen- value problems, have been developed ( 298 1124 Numerical Computation of Eigenvalues; [6J). Partial differential equations seem to have become more familiar because of the visualiza- tion of their numerical solutions in graphical computer outputs. In [7J, which appeared much earlier than computers, partial difference equations for the fundamental linear prob- lems in mathematical physics were discussed with a suggested variational treatment ( 304 Numerical Solution of Partial Differential Equations; [8J). The tfinite element method, which started as a calculation technique in structural mechanics and is based on the tcal- culus of variations, is widely accepted as an efficient approximation method for partial differential equations [9,10]. The term tsimu- lation is used often to describe procedures in which partial differential equations describing time-dependent phenomena are discretized; the resulting difference system can be solved for long periods of time [11]. Numerical analysis has heretofore long been considered to be the only numerical method (for error analysis in particular), and has been carried out mainly by means of the discretiza- tion of equations. Nowadays, however, mathe- matical modeling, taking into account both the phenomena to be described and the capabi- lities of the computers to be used, has become an important numerical method. References [IJ J. S. Walther, A unified algorithm for elementary functions, Conference Proceedings, Spring Joint Computer Conference, AFIPS 38 (1971), AFIPS Press, 1971, 379-385. [2J H. Takahashi and M. Mori, Error estima- tion in the numerical integration of analytic functions, Rep. Computing Centre, Univ. of Tokyo, 3 (1970),41-108. [3J P. Henrici, Elements of numerical analysis, Wiley, 1974. [4J P. Henrici, Discrete variable methods in ordinary differential equations, Wiley, second edition, 1968. [5J J. H. Wilkinson, Rounding errors in alge- braic processes, HMSO, 1963. [6J J. H. Wilkinson and C. Reinsch (eds.), Handbook for automatic computation 2, pt. II, Linear algebra, Springer, 1971. [7J R. Courant, K. Frienrichs, and H. Lewy, On the partial difference equations of math- ematical physics, IBM 1. Res. Develop., 11 (1967),215-234. (Original in German, 1928.) [8J F. John, Lectures on advanced numerical analysis, Gordon & Breach, 1967. [9J G. Strang and G. J. Fix, An analysis of the finite element method, Prentice-Hal1, 1973. 
1125 [IOJ P. G. Ciar1et, The finite element method for elliptic problems, North-Holland, 1978. [11J R. J. Roache, Computational fluid dy- namics, Hermosa, 1972. 301 (XV.S) Numerical Solution of Algebraic Equations A. General Remarks Methods for finding roots of an equation f(X) =0 (where f(x) is not necessarily a poly- nomial, but is assumed to be a function with some regularity) by numerical calculation can in general be divided into the following two types: (i) The first has as its goal the finding of good approximate values of the roots; examples are the BernoulIi method (Section J) and the Graeffe method (Section N). (ii) The second improves the accuracy of estimates of the roots; an example is the Newton-Raphson method (Section D). The methods belonging to (i) and (ii) can be used separately. However, in (i) convergence of the approximations may be excessively slow when a pair of nearly equal roots is present, and the size of numbers in- volved in the calculation may grow exponen- tially as we proceed through the iterations. In (ii) convergence of the approximate values is not assured unless the initial approximate value is suitably chosen. Accordingly, when an electronic computer is utilized, it is advisable to combine both types of method. Because of the development of computers, solutions with global convergence have become important [1,2,15,17,21]. B. Successive Substitutions When an equation f(x) = ° is transformed into x = F(x) and the roots are obtained by iterative calculation of Xi+1 =F(x;) (i=0, 1,2,...), suffi- cient conditions for its convergence are as follows: Let one of the roots of the equation be rx. Then Xi+! -rx=F(x;)-rx, and therefore (X i + 1 -rx)/(xi-rx)=F'() (Xi<<rx or Xi> >rx). Accordingly, rx converges monotonically ifO< F'()< 1, while it converges with oscillation ifO>F'(» -1. IfIF'()I> 1 and F- 1 (x) de- notes the inverse function of F(x), the iteration X i + 1 =F-1(x;) converges. To define the speed of convergence by iterative processes the follow- ing notion of order is utilized: When lim i _ oo Xi = rx, the speed of convergence of the sequence {x.} is said to be ofthe kth order if lim i _ oo (Xi+! - rx)/(x i - rx)k = c # 0. Necessary and sufficient 301 D Numerical Solution of Algebraic Equations conditions for the speed of convergence of {xJ to be of the kth order are rx = F(rx), F'(rx) = F"(rx) = ... =F(k-I)(rx)=O, F(k)(rx)#O. The main iterative processes used in numer- ical calculation are described in the following sections. C. Regula Falsi Regula falsi (or the method offalse position) is a process of obtaining the real root rx of an equation f(x) =0 by approaching the root from both sides. The calculation procedure is as follows: Assume that f(x p ) > ° and f(x q ) < 0, where x p and x q are approximate values of rx such that rx lies between them. A new approxi- mate value x is then obtained from x=(xqf(x p ) -xpf(xq))/(J(x p ) - f(x q )). If f(x) > 0, then x--+ x p ' and if f(x)<O, then x--+x q , and the pro- cedure is repeated (the symbol --+ means re- placement). The conditions and speed of con- vergence of regula falsi are as follows: Let F(x) = (xqf(x) - xf(xq))/(J(x) - f(x q )); then F'(rx) = (J(x q ) + (rx - xq)f'(rx))/f(x q ). If f' and f" exist and are continuous near rx, then f(x q ) = f(rx) + (Xq-rx)f'(rx)+(1/2)(xq-rx)2f"(), where rx> >Xq or rx<<Xq' Accordingly, assume that F'(rx)=(1/2)(xq-rx)2f"()/f(xq)#0. Then IF'(x)1 < 1 can be satisfied if X and x q are near rx. Therefore, if the initial value is appropriate, the convergence is of the first order. In regula falsi, only calculation of f(x) is necessary, while that of f'(x) is unnecessary. Furthermore, if f(x) =0 has nearby real roots, i.e., rx, {3,... close to each other and f'(x) small near the roots, there can be no mistake such as the neighbor- ing root {3 being obtained in the process of finding the root rx by this method. This method belongs to the inverse linear interpolation method. This kind of inverse interpolation includes MulIer's method [3], which uses tLagrange's interpolation formula, the Torii- Miyakoda method [4J, which uses tHer- mite's interpolation formula, and Whittaker's method [5J, which uses tStir1ing's interpola- tion formula. These methods converge more rapidly but have more complicated formulas than inverse linear interpolation. The Sturm method (in which the interval where the roots exist is narrowed by tSturm's theorem) and the Horner method (which obtains the decimal part digit by digit), both used to obtain the real roots of a high-order algebraic equation f(x) = 0, are also of this type. D. The Newton-Raphson Method In obtaining the real roots of an equation f(x) =0, the Newton-Raphson method (or the New- 
301 E Numerical Solution of Algebraic Equations ton iterative process), which converges rapidly, is used when f'(x) =1= ° is computable. Let Xi be a sufficiently close ith approximation of the root IJ(; then x i +1 = Xi - f(x;)/f'(x;) is closer than Xi to the true solution. The process is repeated untillx i + 1 -xii is smaIl enough. Conditions and the speed of convergence are as foIlows [6,7]: Let F(x)=x- f(x)/f'(x); then F'(x) = f(x)f"(x)/(f'(x)f and F'(IJ() =0. Accordingly, F"(IJ() =1=0 if f'(IJ() =1=0 and f"(IJ() =1=0, and the convergence is of the second order if it is pos- sible to determine an appropriate approxi- mate value Xi that is close to IJ( and satisfies I F'(x;) I < 1. In particular, if f(xo)f'(x o ) =1=0, ho = - f(xo)/f'(x o ), I f"(x) I  M, and If'(x o )I ? 2lholM, then every Newton approximation Xi starting with Xo is contained in the interval I = [XI -Ihol, XI + Iholl the equation has only one root IJ( in I, and xi-+IJ(. Besides, IIJ( -x'+ll  Mlxi-xi-rl2/21f'(x,)l, i = 1,2,.... With regard to convergence and evaluation of the error, including roundoff error in practical computations, M. Urabe's studies should be consulted [8,9]. In general, the convergence is of the third order in the Newton-Raphson method, which uses not only f'(x) but also f"(x) [10]. When f'(x) =0, we must assume f"(x) =1=0 and use the value f"(x). With regard to this case, the study by S. Hitotumatu should be consulted [11]. In addition, W. Kizner (SI AM J. Appl. Math., 12 (1964)) reported an itera- tive process in which the convergence is of the fifth order without using any derived function higher in order than f"(x). The essential part of his method lies in a numerical integration of the integral part by the tRunge-Kutta formula, based on the fact that if X 1 is the first approxi- mate value of a root x of f(x) = 0, then x= f O dx df +x 1 . !(x\) df There may be several iterative processes for solving f(x) = 0, even if the order k of the con- vergence is fixed. For example, in obtaining the positive root of f(x)=x 2 -a =0 (a> 0), i.e., the square root IJ( = a 1 / 2 , both iterative pro- cesses X i + 1 = (Xi + a/x;)/2 and X i +1 = 2xt /(3 x? - a) give convergence of the second order. In obtaining the real root of {(x)=x 3 -a=0, i.e., the cube root IJ(=a 1 / 3 , Xi+1 =xj+(a/x;-x,)/3 converges of the second order, while x i +1 = x;/2 +(a + a/2)j(2x; + a/x,) converges of the third order. The Newton-Raphson method is applicable also to holomorphic functions of complex variables. Take simultaneous equations of two vari- ables,j(x,y) =0 and g(x,y)=O. If it is possible to transform the equations into x=ep(x,y) and 1126 y = ljI(x, y), and if loep/iJxI + loljl /oxl < 1, loep/oyl + I oljl/oy I < 1, then the iterative processes X i + 1 =ep(x"y;) and Yi+1 = ljI(x" y;) are applicable. In the Newton-Raphson method, let the correc- tions to the ith approximate values Xi and Yi be Xi and Yi' and solve {AXi'Yi)Xi+ fy(X"Yi)Yi= - f(xi,y;), gAXi,Y;)Xi+gy(Xi'Y;)Yi= -g(x"y;). Then we can take Xi+1 =Xi+Xi and Yi+1 = Yi +Yi as the next approximate values. This process is repeated until both Ix;\ and IYil are sufficiently small. More generally, the Newton-Raphson method for solving a system of n nonlinear equations /;(x 1 , ..., x n ) = 0, i = 1,2, ... , n, is defined by x(k+1) =X(k) _J(X(k))-l F(X(k)), k = 0,1,2, ..., where X(k)= (x\k), ..., Xk))" F(x) = (fdxj, "', x n ), ..' In(x 1 ,..., x n ))" J(x)=(o/;(x 1 ,..., xn)/ox j ) (the tJacobian matrix of F), and where x(o) is chosen appropriately [23]. E. The Bairstow Method Corresponding to a pair of complex roots IJ(::!:: ifJ of an algebraic equation f(x) = aoxn + a1 x n - 1 + ... + an = ° with real coefficients, there is a real quadratic factor x 2 + p*x + q*. The Bair- stow method (or Hitchcock method) obtains the coefficients p* and q* of this quadratic factor. To do this, first choose an appropriate qua- dratic factor x 2 + px + q as a candidate. Then through synthetic division by the quadratic factor, b j and c j are computed by means of the formulas bj=aj-pb j _ 1 -qb j - 2 , j=O, 1, ...,n, and Cj=bj-PCj-l-qCj_2, j=O,I,...,n-l, where b_ 1 =b_ 2 =c- 1 =c- 2 =0. By solving the simultaneous equations Cn-2P + cn-3q = b n - 1 , C n - 1 P+Cn-2q = b n , where C n - 1 =c n - 1 -b n - 1 , the quantities p and q are obtained. Then taking jj=p+p, q=q+q, we have x 2 + jjx + q as a new candidate. This operation is repeated until p and q are sufficiently small. Since this method corresponds to the Newton- Raphson method in the case of two variables, and, accordingly, the convergence is of the 
1127 second order, a choice of suitable initial values of p and q leads to rapid convergence. The key to this method lies in choosing p and q so that RI(p,q)=O and R 2 (p,q)=0, where RI and R 2 are such that R 1 X + R 2 is the remainder of f(x) divided by the trial quadratic factor x 2 + px+ q. This method was generalized by A. A. Grau (SIAM J. Appl. Math., 11 (1963)). Namely, when f(x) = (x 2 + px + q)g(x) + r(x) with r(x) = r)xk+1 +r2x\ the functions r 1 and r 2 can be used instead of R) and R 2 . The process cor- responds to the Bairstow method when k = 0, and to the McAuley method (SIAM J. Appl. Math., 10 (1962)) when k=n-2. F. The Durand-Kerner Method The Durand-Kerner (DK) method [12J for solving an algebraic equation f(z) =zn + a) zn- I + '" + an = ° (an =1= 0) with complex coefficients is an iterative method defined by (k+I) _ (k) f(Z\k)) z. -Z.- , 'njI.N1 (z!k)-zn' i=I,2,...,n, k=0,1,2,.... (1) A feature of this method is that it can deter- mine simultaneously aU the roots of f(z) =0. Let qJm(zj, ...,zn), m= 1,2, ...,n, be the mth elementary symmetric functions with respect to 2 1 , ...,2n: CfJm(Zl"",Zn) L ZilZi2...Zimo it <i 2 <. <im Set fm(z)=( -ltqJm(z),..., zn)-a m . Then cx= (cx 1 , ..., cx n )" a set of the roots of f(z) =0, is a solution of a system of the n equations fm(z) = 0, m = 1,2, .., ,n. I. O. Kerner [12J showed that the DK method is the Newton-Raphson method applied to the system of nonlinear equations fm(z) =0, m= 1,2,..., n. Therefore the speed of convergence of (1) is of the second order, provided that it converges. The initial values z\O), ... ,zO) for the DK method are usuaUy chosen as foUows: Let g(z) = f(z - adn) =zn+ C2Z n-2+... +c n and h(z)=zn-lc 2 Iz n - 2 -... -Icnl. If (C2'''' ,C n ) =1=(0, ...,0) (n;:' 2), then it can be shown that h(z)=O has exactly one positive root r, and all the roots of f(z)=O lie in the disk Iz+adnlr. Now, with a positive ro;:' rand {) = n/(2n), put a l [( 2(i-l)n ) 11 ] z!O)= --;+roex p n +0 v' -1 , i = 1, 2, ... , n. (2) Such a choice was proposed by O. Aberth [13]. Hence the process (1) together with (2) can be caUed the Durand-Kerner-Aberth (DKA) method. It is shown for the DKA 301 G Numerical Solution of Algebraic Equations method that, if r o is large enough, then Z\k) + a: = ( 1 _  y (z!O) + a: ). i = 1,2, ... , n, hold nearly for the first several steps. Thus the DKA method has a certain kind of global convergence property that renders it one of the most powerful methods for solving algebraic equations. A variant of the Durand-Kerner method is the Ehrlich-Aberth method [13, 14J; it is de- fined for i= 1,2, "', n, k=O, 1,2, ... by Z(k+1) , f(Zk)) f'(Z!k)) - f(Z\k)) Lj1.Ni 1/(z\k) - zt))' The speed of convergence of this method is of the third order. Further variants of the Durand-Kerner and the Ehrlich-Aberth methods have been proposed by M. Iri et al. [15J and A. W. M. Nourein [16]. =Z\k) G. The Dejon-Nickel Method Let f(z) be a polynomial of degree n with complex coefficients, and take Zo such that j(zo) =1=0. Then we can write f(zo+h)= f(zo){1 +b i h i +b i +)h i +1 +... +bnh n } (bi=l=O) = f(zo){1 +b i h i (1 +O)}, {)=o(h). Now, choose h sufficiently smaU so that Ibihil < 1,101 < 1, and arg(bihi)=n. Then 11 + bihil = l-Ib i h i l and IbihiOI<lbihil so that If(zo+h)1  1 f(zo)I{I1 +bihil + Ibihi{)l} < If(zo)l. Hence, if 20 is chosen so that min If(z)1 = If(zo)l, then we must have f(zo) =0. This is the out- line of Cauchy's existence proof for the roots of algebraic equations. The Dejon-Nickel method [17J is a method in which h is chosen as foUows: h =( -1/b k )l/\ where min {IIM 1Ij l b j =1= 0, ij n} = 11/bd 11k (=r,say). If several such k exist, take the smallest one. The branch of the multivalued function (-I/b k )llk is chosen such that zllk is positive for z positive. If If(zo + h)1  (1-1;)lf(zo)1 with a preassigned constant I; such that 0< I; < I, then put z) = Zo + h. If the inequality does not hold, then for each integer m;:' 1 choose the small- est integer 1= l(m) such that max {I bjl(r/2m)j I ij n} =lb/l(r/2 m )/, and put 2m =zo + (r/2m) (-lbd/b/)II/. Find the smallest integer m;:' 1 such that If(zo)1  (1 - 2 -1 (r/2 m il bd)lf(zo)l, and put 
301 H Numerical Solution of Algebraic Equations ZI =zm' By continuing this process, a sequence {Zj} is constructed such that If(zo)1 > If(zl)1 > If(z2)1> .... It converges toward some root of the equation f(z) =0. S. Hirano has pro- posed a similar method. H. Methods for Finding Good First Estimates of Roots Some of the principal methods for obtaining good first approximate values of the roots are given in the following sections I-N. I. Matrix Methods The problem of solving an algebraic equation f(z) =zn + a 1 zn-1 +... + an=O with complex coefficients is equivalent to that of finding the eigenvalues of the companion matrix r a . -a2 -a 3 -Qn-1 al 0 0 0 :" A= 0 1 0 0 0 0 0 Therefore numerical methods for solving the eigenvalue problems for non symmetric ma- trices are applicable ( 298 Numerical Com- putation of Eigenvalues). However, it should be remarked that this might be inefficient because most of the elements of the matrix A are zero [18]. J. The Bernoulli Method In the Bernoulli method the iterative formulas SI = -aI, Sk= -(al Sk-1 +a 2 S k - 2 +... + a k - 1 S 1 +kad (k=2,3, ...,n), Sk= -a 1 S k - 1 -a 2 S k - 2 - ... - anSk-n (k = n + I, n + 2,...) are repeatedly applied to an algebraic equation f(x) = x n + a 1 x n - 1 + a 2 x n - 2 +... + an =0. When the roots of f(x)=O are 0(j,0(2,'" ,O(n (10(11  10(21 ...  100nl), Sk/Sk-1 --+0(1 if 0(1 is a real and simple root and 10(11) 10(21. When O(r. 0(2 are complex roots, we put 0(1 = Re ie , 0(2 = Re- ie . If 10(31 < R, then we have S-SH1Sk-1 2 2 --+R, Sk-1 - SkSk- 2 Sk S k-1 -Sk+1 S k-2 0 2 --+ 2R cos , Sk-l - SkSk- 2 and hence the two roots of x 2 - (2R cos O)x + R 2 =0 are 0(1 and 0(2' Sk is the sum of the kth powers of the n roots of the equation, 0(1,0(2, "',O(n' C. Lanczos used xj'(x)/f(x)=n+S 1 /x+ S2/X2 + ... + Sk/Xk + .., to compute Sk [10, pp. 26-30]. Let the eigenvalues of the com- panion matrix A be AI, A2' "', An (IAr!  IA21  1128 ...  I An I). Then the Bernoulli method is the same as the tpower method for obtaining the maximum characteristic value AI ( 298 Nu- merical Computation of Eigenvalues C). K. The Lehmer Method The Lehmer method [9] is a general method for finding the n roots of an algebraic equation with complex coefficients f(z) = a o + a 1 z+ a2z2 +... + anz n =0 in the complex plane by repeating the follow- ing procedure: First, draw a circle whose cen- ter is at the origin and whose radius is R = r2- e , where r is an arbitrary given number and 0 an arbitrary given integer. Then, utiliz- ing the Lehmer theorem, observe whether a root 0( of f(z) = 0 lies inside the circle. If there is no such root, replace R with 2R, whereas if such a root exists, replace R with R/2. Con- tinue this process until an R for which a root 0( exists within the annulus R < Iz 1< 2R is obtained. Second, draw circles with the centers /3k = (5R/3)exp(i2nk/8) (k =0,1, ...,7) and com- mon radius J1=(5R/3)/2 and cover the annulus obtained by the first step. Then find out which of the circles has the root rJ. in its interior. If 0( is in the interior of the circle for k = j, the origin is shifted to /3j and the operation is started again from the first step. If R 1 satisfy- ing R 1 < I z - /3) < 2R 1 is obtained by the first step, we have Rl «5/12)R. Therefore, when the first step is repeated N times, the root 0( is confined in a small circle whose radius is smaller than 2(5/12tR. Then the center /3 of this small circle gives a good approximate value of 0(. L. The Downhill Method The downhill method is a method for obtain- ing the extreme values of a function of many variables. It is also applicable in obtaining approximate solutions of a system of equa- tions. Let us consider the downhill method in the case of two variables. The problem of obtaining the real roots of simultaneous equa- tions f(x,y) =0 and g(x,y)=O can be reduced to the problem of obtaining the coordinates (0(, /3) which give the extreme value 0 of <I>(x, y), where <I>(x, y) = r 2 + g2. The values of <I>(x, y) are calculated at 3 2 points obtained by the combination of x = XI, XI i: h; y = Yl' Y1 i: h, where (x l' Y d are arbitrary approximate values of (ex,) constituting the centers of those sets of points, and h is the given step size. Utilizing the values of the function at the 3 2 points, 
1129 <I>(x, y) is approximated by a quadratic surface b o + b l x+ b 2 y+ b l d3x 2 - 2) + b 22 (3 y 2 - 2) + b I2 XY. The values of b o , bi' b 2 , b 1 j, b 22 , b 12 are cal- culated by the tmethod of least squares, and the center (xi, yj) of the approximate quadra- tic surface is obtained. Then the first approxi- mations XI and YI are replaced by the sec- ond approximations x, +xj and y, + yj. This process is repeated while the step size h is made appropriately smaller. This method is an improvement on the method of successive experimental planning used by G. E. P. Box and K. B. Wilson (1954) to obtain optimum conditions in the exploration of response curved surfaces. By using another minimization technique, 1. A. Grant and G. D. Hitchins [21J gave an ever-convergent algorithm for determining initial approximations to the roots of algebraic equations with real coefficients. The prac- tical implementation of this method is given in J. A. Grant and G. D. Hitchins (Comput. J., 18 (1975)). M. Continuation Methods Let F(x)=(f, (x),... In(x))'=O (x=(Xi''''' xn))t be a system of n equations. Suppose that no reasonable approximation for a solution exists. Then, take arbitrary x(O) and define a one- parameter family of equations H(x, t)F(x)-(I- t)F(x(O») =0, O";;t";; 1. Suppose that for each t the equation has a solution x(t) which depends continuously on t. Observe then that x(O) = x(O) and x(1) is a solution for the equation F(x)=O. Partition the interval [0, 1 J by the points 0 = to < t 1 < ... < t N = 1. First, solve H(x, t I) =0 by some itera- tive method using x(O) as a first approximation. Let x(J) be a solution thus obtained. Next, solve H (x, t 2) = 0 by some iterative m<:,;wu using x(1) as a first approxim3!:;:',i, and so on. Finally, solve H(x, t N ) =0 by some iterative method which uses X(N-l) as a first approxima- tion. Then X(N), a solution thus obtained, can be used as a first approximation in an itera- tive method applied to the equation F(x)=O. This method is called a continuation method [22-24]. As another approach, differentiate (3) with respect to t. Then J(x(t))x'(t) + F(x(O») =0, where J(x(t)) is the Jacobian matrix of F, evaluated at x=x(t). Hence x(t) is the solution of the system of ordinary differential equations x'(t) = -J(X(t))-1 F(x(O»), 0,,;; t";; 1, 301 Ref. Numerical Solution of Algebraic Equations subject to the initial condition x(O) = x(O), pro- vided that J(x(t)) is nonsingular. Therefore numerical solution at t = 1 gives a good ap- proximation for a solution of F(x)=O. This is called Davidenko's method of differentiation with respect to a parameter. These methods can be used to find initial approximations of the Durand-Kerner and the Ehrlich-Aberth methods. They are also applicable to a single algebraic equation J(x) =0, which is the special case n = 1. N. Other Methods To obtain the first approximate values of the roots of an algebraic equation, the Graeffe method has been used, in which the roots of the equation are separated by successively forming an equation whose roots are the squares of the roots of the preceding equation. In computer calculations, however, the other methods described in previous sections are more convenient than the Graeffe method. The Lanczos method [1 OJ is a method in which g(x), an approximate function of J(x), is obtained by the process of approximating functions, and the roots of g(x) = 0 are taken as approximate values of the roots of J(x) = O. The Garside-Jarratt-Mack method [25J is a modification of the LanclOs method and ap- proximates J(x)/f'(x) by a rational function g(x)=(x-a)/(b+c x ). O. Error Bounds for Computed Solutions (3) Let Z I' ..., Zn be computed solutions of an algebraic equation J(z)=O which were ob- tained by some method. If z I, ... ,Zn are dis- tinct, then the following result due to B. T. Smith [26J is quite useful for estimating the errors of Zi: Let r ' l _ I ' = nIJ(zi)1 j" Z Zj -....;:: fi n rrjI.Hi IZi- zjl Then the union of the disks f i contains all the roots ()(" ..., ()(n of J(z) =0. Any connected component of U7'1 f i , which consists of just m disks f i , contains exactly m roots of J(z) = O. Hence if r i n Ij = 0 for every j =I i, then I ()(i- zil,,;; Ii' Smith obtained a more general result for the case where z" ..., Zn are not necessarily distinct. References [IJ J. F. Traub, A class of global convergent iteration functions for solution of polynomial equations, Math. Comp., 20 (1966), 113-138. [2J B. Dejon and P. Henrici (eds.), Construc- 
302 A Numerical Solution of Linear Equations tive aspects of the fundamental theorem of algebra, Wiley- Interscience, 1969. [3J D. E. Muller, A method for solving alge- braic equations using an automatic computer, Math. Tables Aids Comput., 10 (1956),208- 215. [4J T. Torii and T. Miyakoda, A practical root-finding algorithm based on the cubic Her- mite interpolation, Information Processing in Japan, J3 (1973),143-148. [5J E. T. Whittaker and G. Robinson, Cal- culus of observations, Van Nostrand, 1924. [6J L. B. Rail, Computational solution of nonlinear operator equations, Krieger, 1979. [7J A. M. Ostrowski, Solution of equations in Euclidean and Banach spaces, Academic Press, 1973. [8J M. Urabe, Error estimation in numerical solution of equations by iteration process, J. Sci. Hiroshima Univ., ser. A., 26 (1962), 77-91. [9J M. Urabe, Component-wise error analysis of iterative methods practiced on a floating- point system, Mem. Fac. Sci. Kyushu Univ., 27 (1973), 23-64. [IOJ C. Lanczos, Applied analysis, Prentice- Hall, 1956. [11 J S. Hitotumatu, A method of successive approximation based on the expansion of second order, Math. Japonicae, 7 (1962), 31- 50. [12J I. O. Kerner, Ein Gesamtschrittverfahren zur Berechnung der Nullstellen von Polyno- men, Numer. Math., 8 (1966),290-294. [13J O. Aberth, Iteration methods for finding all zeros of a polynomial simultaneously, Math. Comp., 27 (1973), 339-344. [14J L. W. Ehrlich, A modified Newton method for polynomials, Comm. ACM, 10 (1967),107-108. [15J M. Iri, H. Yamashita, T. Terano, and H. Ono, An algebraic-equation solver with global convergence property, Pub!. Res. Inst. Math. Sci., 339 (1978), 43-69. [16J A. W. M. Nourein, An improvement on two iteration methods for simultaneous deter- mination of the zeros of a polynomial, Int. J. Computer Math., 6 (1977), 241-252. [17J B. Dejon and K. Nickel, A never failing, fast convergent root-finding algorithm, Con- structive Aspects of the Fundamental Theo- rem of Algebra, B. Dejon and P. Henrici (eds.), Wiley-Interscience, 1969, 1-35. [18J D. M. Young and R. T. Gregory, A sur- vey of numerical mathematics I, Addison- Wesley, 1972. [19J D. H. Lehmer, A machine method for solving polynomial equations, J. ACM, 8 (1961), 151-162. [20J O. L. Rasmussen, Solution of polynomial equations by the method of D. H. Lehmer, BIT, 4 (1964),250-260. 1130 [21J J. A. Grant and G. D. Hitchins, An always convergent minimization technique for the solution of polynomial equations, J. Inst. Math. App!., 8 (1971),122-129. [22J J. M. Ortega, Numerical analysis, a sec- ond course, Academic Press, 1972. [23J J. M. Ortega and W. C. Rheinboldt, Iterative solution of nonlinear equations in several variables, Academic Press, 1970. [24J R. Wait, The numerical solution of alge- braic equations, Wiley, 1979. [25J G. R. Garside, P. Jarratt, and C. Mack, A new method for solving polynomial equations, Comput. 1.,11 (1968),87-89. [26J B. T. Smith, Error bounds for zeros of a polynomial based upon Gerschgorin's theo- rems, J. ACM, 17 (1970), 661-674. [27J A. S. Householder, The numerical treat- ment of a single nonlinear equation, McGraw- Hill, 1970. [28J J. H. Wilkinson, Rounding errors in alge- braic processes, HMSO, 1963. [29J J. H. Wilkinson, Algebraic eigenvalue problem, Oxford Univ. Press, 1965. [30J P. Rabinowitz (ed.), Numerical methods for nonlinear algebraic equations, Gordon & Breach, 1970. 302 (XV.4) Numerical Solution of Linear Equations A. Condition of Linear Systems The solution of the system of linear algebraic equations n I aijx j = bi; i = 1,..., n, aij, b,real, (1) j1 which may be written in the matrix form Ax=b; A = (aij), X = (Xi), b=(b i ), (1') is expressed as quotients of determinants by tCramer's rule. In practice, however, this form of the solution is of little value for numerical computation, because the direct evaluation of the determinants involves (n + 1 )!(n - 1) multi- plications which, even for a moderate-sized system, amount to a prohibitive number of arithmetic operations to be executed, even with modern high-speed computers; besides, it requires high-precision calculation. There are a variety of practical methods of solving efficiently the system (1) with finite-precision calculation. These numerical methods are generally divided into two classes: direct meth- ods and iterative methods. 
1131 Whatever method is used, inherent diffi- culties are encountered when the solution of the system is unstable. The instability is usu- ally measured by the condition number of the coefficient matrix, which is defined by condA=al/a n (= 1lAIIIIA -III  I), (2) where the a i (a 1 a2 ... an O) are the non- negative square roots of the teigenvalues of A t A (called the singular values of A), II A II = max{IIAxll/llxll :x#O}, and Ilxll =. This definition is valid also for an m x n matrix A (mn). The condition number condA satisfies IIt5xll  condA ( + IIt5bll ) Ilxll '" IIt5AII IIAII Ilbll' 1 --cond A IIAII where Ax=b, (A +M)(x+t5x)=b+t5b, and II<5AIIIIA -111 < 1. As the condition number increases, solution processes become more susceptible to errors. If cond A is large, the system is called ill-conditioned. 8. Direct Methods A direct method is one that yields an exact solution in a finite number of arithmetic oper- ations if they are performed without roundoff error. Among the existing direct methods, the one known as Gaussian elimination with pivot- ing, which is based on systematic elimination of unknowns of the equation (1), is found to be the best with respect to time or accuracy. Its dependability was re-established by means of backward error analysis ( 138 Error Analy- sis; [1-3J). The method generates successive vectors b(k) = (b i k ) and matrices A(k)=(at), typi- cally of the form * * * * * * * A(2)= 0 * * * 0 * * * 0 * * * when n = 5 and k = 2. At the kth stage, a pivo- tal element at- 1 #0 (i,jk) is chosen in one way or another. Then the ith and kth rows and thejth and kth columns of A(k-I) are interchanged so that at-I becomes atk' The ith and kth elements of b(k-I) are also inter- changed. The pivot atk is then used to elimi- nate all the nonzero entries in its column below the diagonal as follows: at=at- I , i=1,...,k, j=1,...,n; bik=bik-1, i= 1, ...,k; at=ar 1 -(afk- 1 /atk)at j , i = k + 1, .. . ,n, j = k, . .. , n; b i k = b i k - I - (afk- 1 latdbk\ i = k + 1, ... n. 3028 Numerical Solution of Linear Equations Starting with A (0) = A and b(O) = b, a system of linear equations with an upper-triangular coefficient matrix, A(n-I) x* = b(n-I), is produced at the stage k = n - 1, where x* is a permutation of x, caused by interchanging columns. This part of the process is called forward elimination. If at- 1 =0 for all i,jk at some stage, then A is a singular matrix of trank k - 1 and the system (1) admits infinitely many solutions if b i k - I =0 for i=k, ...,n, and no solution otherwise. If this is not the case, A is a nonsingular matrix with det A = a1 a:;:l .. a;', and the solution of (1) is given by Xt= ( b;-I- I ak-1xt )l a?-1 k=i+l for i = n, n - 1, ... , 1, which is called back sub- stitution. Taking the pivot to be an element of the largest absolute value among column elements afk- I (ik) at each stage is called the partial pivoting strategy. In complete pivoting, the pivot is taken to be an element of the largest absolute value among at- 1 (i,j  k). These pivotings are introduced to prevent loss of accuracy due to rapid growth of elements of successive A(k). Although a smaller bound for the growth factor is obtained for complete pivoting, in practice partial pivoting appears to be entirely adequate. If rows or columns are not interchanged in the process, forward elimination effectively produces a factorization of A into the product of a lower triangular matrix L and an upper triangular matrix U, i.e., A=LU, (3) where L has unit diagonal elements and U = A(n-I). This factorization is computed directly by the Doolittle method without calculating the intermediate A(k). The Crout method also produces a similar factorization (3) in which U has unit diagonal elements. The Cholesky method determines a similar factorization (3) of a tpositive definite matrix A, in which U = L'. Once the triangular factorizations (3) are formed, the sol ution of the system (1) is deter- mined by solving the two triangular systems Ly=h and Ux= y successively. The number of multiplicative operations required for these factorizations are about n 3 /3 for forward elimination, Dootittle's method, and Crout's method, and about n 3 /6 for Cho- lesky's method. The solution of each triangu- lar system requires about n 2 /2 multiplicative operations. Special properties of A, such as a banded structure, can be exploited to reduce the number of operations and memory re- quirements considerably [4]. Gauss-Jordan elimination is similar to Gaussian elimina- 
302C Numerical Solution of Linear Equations tion, but the elements above the diagonal are also eliminated to dispense with back sub- stitution. However, it requires about n 3 /2 multiplications. Modern computers can easily handle problems of size n = 100 by these direct methods. C. Iterative Methods An iterative method is a dynamical process that generates a sequence of approximations {Xk} converging to the exact solution. At each step of the iteration, an improved approxima- tion is obtained from the previous ones. The accuracy of the solution depends on the num- ber of iterations performed. Most iterative methods retain the coefficient matrix in its original form throughout the process and hence have the advantage of requiring minima! memory. They are suitable for solving large sparse systems arising in finite-dimensional approximations to tpartial differential equa- tions, where A is sparse if most of its elements are zero. Linear stationary iterative processes are the most frequently used iterative methods. A method in this class is written in the form X k =X k - 1 +R(b-Ax k - 1 ), k= 1, 2, .", (4) where R is chosen to approximate the inverse of A. Usually, Rand b-Ax k -' are not ex- pressed explicitly in the actual algorithms. If the tspectral radius of the iteration matrix I- RA is less than one, the method converges to the solution for an arbitrary initial ap- proximation XO [5]. The key matrix R can be chosen quite freely as long as this condition is satisfied. In the Richardson method, R = ()(A" 0<()«2/IIAI1 2 R=(A +E)-1 is the usual choice, in which E is a perturbing matrix that makes A + E easily invertible. In the Gauss- Seidel method and the Jacobi method, R is chosen to be the inverse of the lower triangular and the diagonal sub matrices of A, respec- tively, i.e., R=(L+D)-1 and R=D- 1 , where L = (aij) (i> j) and D =(a;J Direct methods can be combined with the iterative method (4) to obtain an approxi- mate inverse R. In the course of factorization by a direct method, an artificial perturbation E is introduced to produce an incomplete factorization, [a=A+E, so that [ and a are of low tcomplexity of computation. A combined method is then constructed by putting R= a-I [-I. Even if E is not added intentionally, each of the di- rect methods actually produces an incomplete factorization (5), in which E is a matrix of 1132 small elements accounting for the effect of roundoff errors [1-4]. In this case, the com- bined method is called the iterative improve- ment of the direct method. If applied to a first solution XO = a-I L -1 b, it produces more accu- rate solution in a few steps with only a mod- est increase in computation time, when the system is not too ill-conditioned [4]. It is essential, however, that the residual b - Axo be computed with higher precision. Various convergence criteria have been established for a variety of methods [5,6]. For example, the Gauss-Seidel method is conver- gent for a symmetric positive definite matrix A. The smaller the spectral radius, the faster the convergence of the method. In general, a larger amount of computation is required in each iteration to get faster convergence. If the spec- tral radius p(l-RA) is close to one, the con- vergence is slow, and an acceleration of the process is needed. SOR (successive over- relaxation) is an accelerated version of the Gauss-Seidel method, in which Rw = (L + w -1 D) -1 and the optimum acceleration parameter w (0 < w < 2) is chosen to minimize p(l- Rw A ) ( 304 Numerical Solution of Partial Differential Equations; [6]). There is an adaptive acceleration method [7], which, if applied to a scalar sequence, reduces to Aitken's t5 2 -method. D. The Conjugate Gradient Method The conjugate gradient (CG) method is a non- linear stationary iterative method for solving a system with a symmetric positive definite coefficient matrix. The method generates {Xk}, {r k }, and {pk} by means of the formulas ()(k = (r k , rk)/(p\ Apk), Xk+1 =Xk+()(kpk, rk+1 =b-Axk+1 =rk-()(kApk, (6) (5) fJk =(rk+r, rk+I)/(r\ r k ), pk+1 =rk+1 +fJkpk, where XO is arbitrary and pO = rO = h - Axo, (r,s)=s'r. The CG method shares a feature with the direct method. In theory, {Xk} converges to the solution in less than n steps, and the pi are mutually conjugate, i.e., (pj)'Api = 0 (i #j). When Hestenes and Stiefel [9] proposed the method in 1952, they created a great sensation because of the method's theoretical elegance. The CG method turned out, however, to be highly sensitive to roundoff errors. In practice, nice theoretical properties, such as finite ter- mination, do not hold in the presence of error. Recently, the CG method has regained its 
1133 popularity as an iterative method for solving large sparse systems. The iteration (6) is usu- ally restarted periodically to accelerate the convergence. It may converge in a small num- ber of steps, as it takes advantage of the distri- bution of eigenvalues of A adaptively. The CG method is most effective when it is used to accelerate linear stationary iterative methods or when it is applied to the preconditioned system [,-IA([,-I)'y=L -Ib, x=(L -l)t y , (7) where [, is computed by the incomplete Cho- lesky factorization ['f' = A + E. The matrix [,-' A([,-')t is not to be formed explicitly in the CG method. E. Linear Least Squares Problem Let Ax = b be an overdetermined system of linear equations, where A is an m x n matrix (m?n). The linear least squares problem is to find the x that minimizes the Euclidean norm lib-Axil. We assume here that A is of rank n. The most straightforward method for solving the problem is to apply the Cholesky method to the normal equation (A'A)x=A'b. This may result in ill-conditioning of the sys- tem, since condAtA =(condA)2. III-conditioning can sometimes be avoided by forming the factorization A = LV by Gaussian elimination, where L is an m x n lower trapezoidal matrix and V is an upper triangular matrix. The least squares solution is then obtained by solving successively the systems VLy=Vb and V x = y. Another approach to avoiding ill- conditioning is based on orthogonalization. The modified Gram-Schmidt orthogonaliza- tion method produces in an effective manner the factorization A=QV, where Q is an m x n matrix whose columns are a set of torthonormal vectors and V is an upper triangular matrix [10,11]. The least squares solution is obtained by solving Vx= QI b. A seq uence of Householder transforma- tions, P k =1- 2Wk wUllwkl1 2 , can produce the factorization PnP n -,... P 2 P I A =(). where V is an n x n upper triangular matrix [12]. The least squares solution is obtained by solving V x = b, where n is formed by the first n elements of the vector PnPn-, ... P 2 PI b. A 302 Ref. Numerical Solution of Linear Equations similar factorization can also be produced by using a sequence of Givens transformations, which have the advantage of exploiting the sparseness of A. The matrices V in (9) and (10) are unique and coincide with the transpose V of the lower triangular matrix L of the Cho- lesky factorization, AlA = LV, of AlA. The multiplicative operations required for the factorizations (9) and (10) are about mn 2 and mn 2 - n 3 /3, respectively. When the rank of the matrix is unknown, we can determine an "effective rank" p of A, based on the singular value decomposition (SYD): A = V D V" (8) where Vand V are torthogonal matrices of dimensions m and n, respectively, and D is an m x n diagonal matrix whose diagonal ele- ments d ii are the singular values (Ji of A [13]. If singular values smaller than (Jp can be ignored, the approximate least squares solution is given by x P = V f5; VI b, where the ith element dE of the diagonal matrix f5; is (Ji- I if ip and 0 otherwise. These methods certainly solve the system (1) but generally require more computational work than those based on the triangular fac- torization. The least squares solution is com- puted also by applying the CG method (6) to the normal equation (8), in which AlA is not to be formed explicitly [9]. Iterative methods for solving linear systems with singular and rect- angular coefficient matrices are characterized in terms of the trange of RA and the tnull space of AR [15]. Besides the methods discussed here, numer- ous other methods have been proposed for solving the system (1) [16-18]. Recently, vari- ous sparse techniques have been developed for direct methods to control the growth of non- zero entries (fiU-in) in the process of matrix factorization [19]. In general, direct methods have an advantage over iterative methods; however, their relative computational effi- ciency varies according to the scale, sparsity, and type of the coefficient matrix and to the available computational devices. (9) References (10) [1] J. H. Wilkinson, Error analysis of direct methods of matrix inversion, J. ACM, 8 (1961), 281-330. [2] 1. H. Wilkinson, Rounding errors in alge- braic processes, Prentice-Hall, 1963. [3] 1. H. Wilkinson, The algebraic eigenvalue problem, Oxford Univ. Press, 1965. [4] G. E. Forsythe and C. B. Moler, Computer solution of linear algebraic systems, Prentice- Hall,1967. 
303 A Numerical Solution of ODEs [5J R. S. Varga, Matrix iterative analysis, Prentice-Hall, 1962. [6J D. M. Young, Iterative solution of large linear systems, Academic Press, 1971. [7] K. Tanabe, An adaptive acceleration of general linear iterative process for solving a system of linear equations, Proc. Fifth Hawaii Int. Conf. System Sci., (1972),116-118. [8J C. LanclOs, An iteration method for the solution of the eigenvalue problem of linear differential and integral operators, J. Res. Nat. Bur. Standards, sec. B, 45 (1950), 255-282. [9J M. R. Hestenes and E. Stiefel, Methods of conjugate gradients for solving linear systems, J. Res. Nat. Bur. Standards, sec. B, 49 (1952), 409-436. [IOJ F. L. Bauer, Elimination with weighted row combinations for solving linear equations and least squares problems, Numer. Math., 7 (1965),338- 352. [11J A. Bjorck, Solving linear least squares problems by Gram-Schmidt orthogonaliza- tion, BIT, 7 (1967),1-21. [12J G. H. Golub, Numerical methods for solving linear least squares problems, Numer. Math., 7 (1965),206-216. [13J G. Golub and W. Kahan, Calculating the singular values and pseudoinverse of a matrix, SIAM 1. Numer. Anal., 2 (1965), 205-224. [14J C. L. Lawson and R. J. Hanson, Solving least squares problems, Prentice-Hall, 1974. [15J K. Tanabe, Characterization oflinear stationary iterative processes for solving a singular system of linear equations, Numer. Math., 22 (1974), 349- 359. [16J D. K. Faddeev and V. N. Faddeeva, Computational methods of linear algebra, Freeman, 1963. (Original in Russian, 1963.) [17J G. E. Forsythe, Solving linear algebraic equations can be interesting, Bull. Amer. Math. Soc., 59 (1953), 299-329. [18J J. R. Westlake, A handbook of numerical matrix inversion and solution of linear equa- tions, Wiley, 1968. [19J J. K. Reid (ed.), Large sparse sets of linear equations, Academic Press, 1971. 303 (XV.8) Numerical Solution of Ordinary Differential Equations A. General Remarks Applications of the numerical solution of tordinary differential equations include tinitial value problems, tboundary value problems, 1134 and teigenvalue problems. Although solution by an tanalog computer is among the numer- ical methods in the wide sense, we usually mean by "numerical solution" a solution ob- tained by approximating a problem of infinite degrees of freedom and a continuous variable by one of finite degrees of freedom. The ap- proximation of an arbitrary function by a linear combination of a given finite system of functions is an example ( Section I). Succes- sive substitution and power series expansion are also used but have no particular advan- tages over other methods. Among various methods of approximation, the so-called dif- ference methods (or discrete variable methods) are the most flexible and have the largest field of application. A difference method reduces a given problem to an approximate problem in which we deal only with a systematically chosen set of discrete values of the variable and with values of the unknown functions at the chosen points. A difference method is usually carried out by a simple iterative com- putation, so that it is suitable for tdigital com- puters. We confine ourselves almost exclu- sively to difference methods [1-7]. B. Initial Value Problems To solve numerically general initial value problems, it suffices to consider the problem of determining numerically the values on an interval [a, b J of m functions yi(x) (i = 1, ... ,m) that satisfy a system of ordinary differential equations of the form yi'(x) = Pix, i (x), ..., ym(x)) and the initial conditions yi(a) = I/i (i = 1, '" ,m), where the t are given functions with appropriate smoothness, and a, b, and I/i are given constants. We define the mesh points x n = a + nh (n = 0, 1, ... ), call h the step size, and determine numerically the values y approxi- mating i(x n ). Assuming that the y were computed with infinite accuracy, i.e., without troundoff error, we call e==y- yi(x n ) the global truncation error (or global discretization error), while we call r == y - y the glo bal roundoff error, where the y are the values we actually have by means of finite-accuracy computation. On the other hand, if the assumption holds locally that the solutions at the previous steps are exact, we call the e and r the local truncation error (or local discretization error) and the local roundoff error, respectively. To avoid confusion, we denote the local truncation error by t. The numerical solution must have the property that, for every XE [a, b J, limh_o.x":xe = 0, which is called the condition of conver- gence. In particular, ift=O(hP+I) (xn=x,h--+ 0, where O(h P + 1 ) is Landau's symbol), then p 
1135 is ca1led the order of the solution process. We sha1l show the typical methods of numer- ical solution, using the abbreviation in i for f'(x., y). C. Overall Approximation If we rewrite the differential equations on the interval [x., x n + p ] as y'(x n + q ) - yi(X n ) = J;;+q f'(x, yj(x)) dx, q = 1, ... , P, and substitute suitable Inumerical integration formulas for the integrals on the right-hand side, then we have the overall approximation formulas: P y+q-y=h I Cq'/nih' q=I,...,P, r=O from which we can obtain y+" ... ,y+P when the y are given. Since ji+, contains y+" these equations are nonlinear in y+q, so that we have to use, for example, the iterative proce- dure of starting from suitable Oth approxima- tions and adopting as the lth approximations of the y+q values computed from the overall approximation formulas by substituting the (1- l)th approximations for the y+, in the f:h on the right-hand sides. These formulas are often used to determine a set of starting values for a multistep method. The truncation error of the formulas depends on that of the numer- ical integration formula substituted for the integral. A few examples: P= 1, (CIO,C I d= (1/2,1/2) (error h 3 yi(3)(x n )/12 + O(h 4 )); P = 2, (c,o,cll,cnJ=(5/12,8/12, -1/12) (error _h 4 y i(4)(X n ) + O(h 5 )), (c 20 , c 2l , c 22 ) =(1/3, 4/3, 1/3) (error h 5 y'(5)(X n ) + O(h 6 )); P = 3, (c I 0, c I " c 12 ,c 13 )=(9/24, 19/24, -5/24,1/24), (C 20 'C 2 ', czz,c 23 )=(1/3,4/3, 1/3,0), (C30,C31,C3Z,C33)= (3/8,9/8,9/8,3/8). D. Runge-Kutta Methods By a general Runge-Kutta method we mean a method of determining yb = lJi, y, y,... suc- cessively by means of a formula y+ I - y = h<I>i(X., y; h), where the functions <I>i are de- fined in terms of parameters cx, and #" as <I>i (x, yj; h) = L;o cx,k:, kb = f'(x, yj), k: = f'(x + #,ohj + h(#,o - L;=I #,s)k6 + hL;=1 #"kl) (r = 1, "', P). A Runge-Kutta method is called ex- plicit if #" = ° for r";; s, and implicit otherwise. In the latter case, if {J" = ° for r < s, the method is ca1led semi-explicit (or semi-implicit). Unless otherwise stated, the Runge-Kutta methods considered below are assumed to be explicit. In order to proceed one step from y to y+1 we have to compute each function f' (P+ 1) times. Hence this is ca1led the (P + 1 )-stage method. If we denote by Zi(t) the solutions 303D Numerical Solution of ODEs of Zi'(t) =fi(t, zj(t)) with the initial condi- tions Zi(X) = y' and put ht-..i(x, yj; h) = Zi(X + h)- Zi(X), then we have <I>i(x,yj;h)-t-..i(x,yj;h)= hpq;i(X,yj)+O(h P + I ), where the q;i(X,yj) are expressible in terms of the ['(x, yj) and their (partial) derivatives. Various Runge-Kutta methods have been devised by searching for values of the cx, and #" that make p as large as possible for a given p (p is called the order of the method). When searching for these values of cx, and {Jm we usua1ly impose at least one of the fo1lowing conditions: (1) cx, and #" are simple; (2) the truncation error is small; (3) the region of absolute stability ( Section G) is large; (4) cx, and #,s give sma1ler roundoff errors; (5) only a small computer memory is required. The ordered pairs (P, p), where p is the highest order that can be attained by the (P + I)-stage method, are (P";; 3, P + 1), (4,4), (5,5), (6,6), (7,6), (8,7), (9-10,8), (p+ 2";;(P+ 1)";;(1/2) (pz-2p+4), p9). The following formulas are frequently used since they are accurate and have simple parameters: for P= 1 and p = 2, the formulas with CX o = 0, cx I = 1, #10= 1/2 (modified Euler method), with CX o = cx I = 1/2, #'0 = 1 (improved Euler method) and with CX o = 1/4, cx I = 3/4, #10 = 2/3 (Ralston's second-order method); for P=2 and p=3, the formulas with CX o = 2/9, cx I = 1/3, CX z = 4/9, #10= 1/2, #20=3/4, #21 =3/4 (Ralston's third- order method), and with CX o = 1/4, cx I = 0, CX 2 = 3/4, #t 0= 1/3, {Jzo = 2/3, #ZI = 2/3 (He un's third-order method). For P=3, p=4, the for- mula with CX o = CX 3 = 1/6, cx I =CX 2 = 1/3, #10 = #20 = #21 = 1/2, #30 = #32 = I, #31 = ° is we1l known and is frequently referred to as "the fourth- order Runge-Kutta method" or "the Runge- Kutta method." It has various desirable fea- tures [8]. Gill's modification of the classical Runge-Kutta method (sometimes ca1led the Runge-Kutta-Gill method) has some advantage in regard to roundoff errors and to the neces- sary computer memory size, while Ralston's secondo, third-, and fourth-order methods have minimum error bounds in Lotkin's sense for the local truncation error [7]. There exist "substantia1ly fifth-order" methods for P = 4 [9]. Today we frequently prefer to use higher- order methods, especia1ly fifth-order methods, instead of lower-order methods, because the former require less computation time for solu- tions of given accuracy [1OJ. For the estimation of the local truncation error, the fo1lowing two practical techniques are well known: (i) one-step-two-half-steps error estimate. If we let y, and y+1 denote the approximations that are computed with an mth-order method by taking two half-steps and one full step (= h n ), respectively, then an estimate of the local truncation error per half-step associated with y:'1 is given by the 
303E Numerical Solution of ODEs formula t+l y(hn/2)m+l (Yl - Y+I1/2(1- 2 m ), where we assume that y does not vary much over the interval [xn, xn+lJ. (ii) The formula of embedding form. The following formula of embedding form is more effective than the traditional method above. By use of the pth- order method Y+l = Y + Lo cx,k; and the (p+ 1)th-order method Yl = Y + L=o cx: k;, we obtain an estimate of the local truncation error in Y+l as t+l  Y+l - Yl [I 1-13]. At present, formulas of this type are known for p = 2-8, \vhere the method for p == 5 is said to be the most efficient. Recently, a similar method for the estimation of the global truncation error has been investigated [14]. In addition, multistep generalizations of explicit Runge- Kutta methods, which are known as pseudo- Runge-Kutta methods, have also been inves- tiga ted [15]. E. Multistep Methods A linear multistep method approximates given differential equations by difference equations p(E)y=ha(E)f:, where E is the operator of increasing n by 1, p(() = CXk(k + CXk-1 (k-l + ... +CX 1 ( +cx o , a(()=fJk(k+PH (H +... + PI (+ Po, CXk #0, and Icxol + IPol #0. (If p and a are of degree k in (, we speak of a linear k- step method.) By means of these difference equations we can determine y+k from y+k-l' y+k-2' ..., y. Since the Y+k are determined explicitly if Pk=O and implicitly if Pk#O, the difference equations are called explicit and implicit, respectively. In the implicit case, if Ihl < I(CXk/PdLI, where L is a tLipschitz constant for P(x, i), then the Y+k can be calculated by successive substitutions. In order to obtain the Y successively by means of a k-step method with k;:, 0, it is necessary, to begin with, to give the k -I sets of m values y;, ..., Y-l (called the starting values) in addition to the initial values Yb = I{ To determine the starting val ues, Runge-Kutta methods, overall approximation formulas, etc., are ordinarily used. The number of times we have to compute the P to pro- ceed one step from Y to Y+l is only I for an explicit case, while for an implicit case it is equal to the number of iterations required to secure the convergence of the successive substi- tutions (ordinarily, the step size h as well as the Oth approximation for Y+k are chosen so that the convergence is attained after a few itera- tions). We have lim h _ o x xe=O (xE[a,bJ) for any P and IJi and n"y starting values such that limh_oY=lJi (j1=0, I, ...,k-l), if and only if the polynomials p and (J satisfy the following two conditions: (i) consistency: p(I) = 0, p'(I)=a(1); (ii) zero stability: Every root 1136 of p(()=O lies on or inside the unit circle, and every root lying on the unit circle is simple. We call t =(p(E)i(xn) - ha(E)i'(x n )/a(l) the local truncation error. For a consistent and stable method determined by (p, a) there exist a real constant C p + l (# 0) and an integer p (;:' 1) such that t=hP+1 Cp+1i(p+l)(xn)/a(I)+ O(h p + 2 ); p is called the order of the method and C == Cp+da(I) the error constant. For a given p satisfying (ii), a is chosen to make the order as small as possible and then to make the error constant as small as possible. How- ever, p cannot exceed k + I if k is odd or k + 2 if k is even; moreover, p can be eq ual to k + 2 only if all the roots of p(()=O lie on the unit circle. The following are examples of multistep methods. (1) Explicit Methods. (a) Adams-Bashforth methods. k=l, p(()=(-I, a(()=I, p=l, C = 1/2 (Euler method); k = 2, p(() = ((( - I), a(() =(3( -1 )/2, p=2, C=5/12; k=3, p(()=e(( -1), a(()=(23e -16( + 5)/12, p= 3, C=3/8; etc. (b) Midpoint rule. k=2, p(()=(2_1, a(()=2(, p=2, C = 1/6. (c) Milne's predictor. k=4, p(()=(4_1, a(() =(8(3 _4(2 + 8013, p=4, C=7/90. (2) Implicit Methods. (a) Adams-Moulton methods. k=l, p(()=(-I, a(()=((+I)/2, p=2, C= -1/12 (trapezoidal rule); k=2, p(()=(((-I), am =(5(2 + 8( -1 )/12, p=3, C= -1/24; etc. (b) Milne's corrector (or the Milne-Simpson formula). k=2, p(()=(2_1, a(()=((2+4( + 1)/3, p=4, C= -1/90. 
1137 When an implicit formula (p, a) is used to obtain Y+k' the Oth approximation Y!k for Y+k is usually determined by an explicit for- mula (p*, a*) of the same order as (p, a) (where (p*, a*) itself may not necessarily be stable). This kind of combination of implicit and ex- plicit methods is called a predictor-corrector method (or PC method), where the formula (p*, a*) is called the predictor and (p, a) the corrector. Typical combinations are the mid- point rule and the trapezoidal rule, Milne's predictor and Milne's corrector (this combina- tion is called Milne's method), and an Adams- Bashforth method and an Adams-Moulton method of the same order. A predictor- corrector method has the advantage that the local truncation error (of the corrector) can be estimated in the course of calculation without extra computation. In fact, if the orders of the predictor (p*, a*) and the corrector (p, a) are equal to p and their error constants to C* and C, respectively, then the local truncation error t can be estimated by t = KD + O(h P + 2 ) (n = 0,1,...), where D == Y!k - Y+k is the differ- ence between the value Y!k at xn+k obtained by the predictor and the Y+k obtained by the corrector, and K = ()(:C/[(C - C*)a*(I)] [16]. The term multistep methods originates from the fact that these use the values of the depen- dent variables at more than two different mesh points in order to proceed one step. These are also called multivalue methods since they use more than one value of the dependent vari- able. The multi value method is, however, a more general concept than the multistep method. Linear multistep methods are not only ex- amples of the PC method; they are also exam- ples of the variable-step variable-order algo- rithms (VSVO algorithms), where the order of the formula as well as the step size are auto- matically chosen according to the behavior of the solution. In practical VSVO algorithms, the Adams-Bashforth-Moulton family of PC pairs of order 1 to 13 are usually used, and for solving stiff systems ( Section G) those correctors with large regions of absolute sta- bility are used. In these algorithms we use the multivalue method, which saves the informa- tion at different steps in a form convenient for the change of order and of step size. The multivalue method also facilitates error esti- mation and stability, and results in an efficient use of memory and reduction of the computa- tional cost. For example, in Gear's algorithm the information required for computing Yn+J is saved in the following form: Yn =(Yn, hy,..., (hk/k!)yk)), where y'(x) is the polynomial Pk.n(X) interpolating fn' fn-1' ... In-k+J, where Y) = di-l P k . n (x)/dx i - 1 1 FX" hold. All the local trun- cation error estimators are based essentially 303 G Numerical Solution of ODEs on Milne's device. In the VSVO algorithms heuristics play an important role [2,17,18]. F. Extrapolation Methods Let us assume that the numerical solution with the step size h at some fixed point x has the form m y(x, h) = y(x) + I Lihd + O(hr(m+J)), i=l where r is a positive integer. Then we can approximate y(x, h) by a function Rm(x, h) with (m+ 1) unknowns determined by the require- ment that Rm(x, h j ) = y(x, h),j = 0,1,2, ..., m, hi> h j (i <j) in order to approximate y(x) by Rm(x, 0). If Rm(x, h) is a polynomial of degree mr in h, we call the foregoing method a poly- nomial extrapolation method, where Rm(x, h) = y(x) + o(hr(m+J)) as h tends to O. Defining Rb = y(x, h j ) and Rm(x, O)=R for j= i, i + 1, ..., i + m, we can calculate the R by the re- cursion relation R=R;:!1 + (R;:!1 -R-d/ «(hJhi+mr -1), m 1. It is well known that, for the following method (Gragg's method). there exists an asymptotic expansion with r=2 [19]: xn=nh, rJ(O,h)= y(O), rJ(xJ,h)= yo+f(yo,O), rJ(xn+J, h)=rJ(Xn-1' h) +2hf(rJ(xn, h),x n ), n = 1,2, ..., N -1, where X N =x, y(x, h) = 1/2[I](x N - 1 , h) +I](XN, h) + hf(I](XN,h),XN)]. If Rm(x,h) is a rational function Rm(x, hj)=(po + PI h+ p2h2+... + pph P ) /(QO+q1h+q2h2+... +q,h'), where J1= [m/2], v=m- J1= [(m+ 1)/2],j= i, i + 1, ..., i + m, we call this method a rational extrapolation method. For r = 2, the following formulas were derived by Bulirsch and Stoer [20]: R=R;:!1 + (R;:!1 -R_d/((h;/hi+m)2 X [1-(R;:!1 -R-d/(R;:!l -R;:!2)] -1), mI,Ri_1=0, R=y(t,hJ G. Stability Consider the application of the general k-step method at the nth point (which is consistent and O-stable) k k .I ()(jYn+j=h I l3;fn+j FO j=O to y' = f(x, y), y(Xo) = Yo. Let Yn be the numer- ical solution at x = Xn' Let en = y(x n ) - Yn be the global error, and let qJn be the total error at the nth application. Then we find k _ k ar _ I ()(je n + j = h I  iJ (Xn+j' n+)f3jen+j + qJn+ko FO jO Y 
303 H Numerical Solution of ODEs where n+j lies in the open interval whose endpoints are Yn+j and y(x n +). If we make two assumptions, aj/aY=A (const), rpn=rp (const), the above equation reduces to L7o(()(j- h),fJ)e n + j = rp, whose general solution is given by k k en = L dsr; - rp/hA L fJj' 5=0 )=0 where the d s are arbitrary constants and the r, are the roots, assumed distinct, of the poly- nomial equation k L (()(j - hAfJj)r j = 0. jO We call the linear k-step method absolutely stable for a given hA iflr,l< 1, s= 1,2, ...,k. On the other hand, we call the linear k-step method relatively stable for a given hA if Ir,1 < 1r11 (or Ir,l<ei.h), s=2,3, ...,k, where r 1 is the root corresponding to the theoretical solu- tion. We call the region Sa = {hA II rsl < 1, s = 1,2,... ,k} (or S,= {h), Ilrsl <ei.h (or Irr I), s= 2,3, ... ,k} ) in the complex plane the region of absolute stability (or the region of relative stability) of the linear k-step method. Also we call Sa n R (or S, n R) the interval of absolute stability (or the interval of relative stability) of the linear multistep method, where R is the real line. The explicit methods and the PC methods have a finite interval of absolute stability. The implicit methods usually have larger intervals of absolute stability than the corresponding explicit methods. The higher- order PC methods have smaller intervals, while Runge-Kutta methods do not. The PECE methods usually have larger intervals of stability than the corresponding PEC or P(EC)2 methods, where P indicates an appli- cation of predictor, C an application of cor- rector, and E an evaluation of 1[21,22]. Let us consider the linear system y' = Ay + tp(x), where A is an m x m constant matrix and y(x), y'(x), tp(X)E Rm. If A possesses m dis- tinct eigenvalues A, = J1, + iv" t = 1,2, ... ,m, the theoretical solution of this system is given by m y(x)= L K,exp((J1, + iv,)x)C,+ !fI(x), t=1 where K, and C" t = 1, 2, ..., m, are, respec- tively, arbitrary constants and the eigenvectors corresponding to A, and !fI(x) is a particu- lar solution. We call the linear system stiff if (i) J1, < 0, t = 1,2, ..., m, (ii) s» 1, where s = maxI <;'<;m 1J1,I/min 1 <;,<;m 1J1,1. We call the non- linear system y' = I(x, y), y(x), y"(x), {(x, y)E Rm stiff in an interval I of x if, for every x E I, the eigenvalues ),,(x) of the tJacobian matrix of I satisfy (i) and (ii). The ratio s is called the stiff- ness ratio. To solve stiff systems effectively a variety of methods with infinite stability 1138 regions have been proposed. Some of them are: (1) A-stability (Dahlquist): Sa;2 {hA I Re(hA) <a} (2) Stiff-stability (Gear): Sa;2 SI U S2' SI = {h), I Re(hA) < -a < a}, S2 = {hA I -a";; Re(hA)";; b, -c,,;;Im(hA)";;c,b;:,O,c>O} (3) A(()()-stability (Widlund): Sa;2 {hA I-()« n- arg(hA) < er:, IXE(O, n12)} (4) Ao-stability (Cryer): Sa;2 {h}, I Re(hA) < O,Im(hA)=O}. The order p and the step number k of linear multistep methods are restricted by the follow- ing stability requirements: (i) A-stability: implicit, p";; 2; trapezoidal rule is the most accurate method. (ii) Stiff-stability: implicit, p";; k; backward differentiation formulas L7o()(jYn+j=hln+k are stiffly stable for p= k = 1, 2, ...,6, O-unstable for k=6. (iii) A(()()-stability: implicit; there exist high- order A(O)-stable linear multistep methods. (A method is said to be A(O)-stable if it is A(()()- stable for some sufficiently small ()(E(O, n/2).) For the Runge-Kutta (P, p) methods, we can write e n + 1 ={1 +hA+(hA)2/2!+... + (hAjP/p! P+1 } + L yq(hA)q e n +rpn+1' q=p+l where al/aY=A (const) and Y q are functions of the coefficients of the method in use. We call a region R = {hA 111 + hA + (h)i/2! +... + (hAjP/p! + L:+1 yq(hA)ql < I} a region of absolute stability of the Runge-Kutta (P, p) method. The implicit Runge-Kutta methods have a larger region of absolute stability than the cor- responding explicit Runge-Kutta methods. Therefore the implicit Runge-Kutta methods are suitable for stiff equations, and the explicit Runge-Kutta methods are suitable for nonstiff or mildly stiff equations. Recently, Yamaguti and others have pointed out that the instabilities occurring in the nu- merical solution of ordinary differential equa- tions are closely connected to the phenomena of chaos studied by Li, Yorke, and others [23,24]. H. Boundary Value Problems A boundary value problem is generally for- mulated as the problem of obtaining functions y'(x) that satisfy the differential eq uations y"(x) = f'(x,yl(x),... ,ym(x)) and boundary conditions Bk(i (x n ), ... ,ym(x n ); y1 (x n ), '" , ym(x n2 ); ...;l(x n ), ...,ym(xn))=O (i,k== 1, "', m), where the f' and Bk are given functions. If the f' and Bk are linear in the y', we can 
1139 first calculate the solutions yh(x) of the differ- ential equations under an appropriate initial condition, as well as a set of m independent solutions yf(x) (l = I, .., ,m) of the homogene- ous differential equations (say, under the ini- tial conditions y;(a)=<5f at some point x=a), and then substitute the expression for the desired solution yi(x) = yb(x) + L7'J ellY!(x) in the boundary conditions to obtain simulta- neous linear equations for the unknowns ell ( 302 Numerical Solution of Linear Equa- tions). No universally powerful method is known for the case where the t or the Bk or both are nonlinear. Ordinarily, we resort to the trial-and-error method of solving the differential equations iteratively under differ- ent initial conditions until the solution fully satisfies the boundary conditions, or to ap- proximating the differential operators by suitable difference operators to obtain a set of (generally nonlinear) simultaneous equations approximating at the same time the differential equations and the boundary conditions. For related topics  298 Numerical Computa- tion of Eigenvalues; 315 Ordinary Differen- tial Equations (Boundary Value Problems): [25-27]. I. Methods Other than Difference Methods Besides difference methods there are frequently used methods for finding in a given finite- dimensional function space a function that best satisfies the differential equations as well as the initial or boundary conditions. Denote the equations by L[y(x)] =0 and assume the conditions to be linear. We choose a function Yo(x) that satisfies the conditions and that is considered to approximate the exact solution, and also functions YI(X) (l = 1, "', q) that are considered to represent the typical deviations of Yo(x) from the exact solution and each of which satisfies the homogeneous conditions. We then determine the ell in such a way that y(x) = Yo(x) + Lf=l elIYI(X) best satisfies the differential equations. Corresponding to differ- ent interpretations of the words "best satisfy" there are different methods. The collocation method determines the ell so as to nullify the values of L[y(x)] at some prescribed points Xi; the method of least squares minimizes the integral of I L[y(x)]!l over the considered region; the Galerkin method makes L[y(x)] orthogonal to every YI(X) (I = 1, ..., q); and the Ritz method considers the variational problem <5J[y(x)J =0 whose tEuler equation is L[y(x)] =0 ( 46 Calculus of Variations F), if such a problem exists, and determines the ell so as to have J[y(x)] take an extremum value with y(x)= Yo(x) + Lf1 ellYI(X). 303 Ref. Numerical Solution of ODEs References [1] J. D. Lambert, Computational methods in ordinary differential equations, Wiley, 1973. [2] C. W. Gear, Numerical initial value prob- lems in ordinary differential equations, Prentice-Hall, 1971. [3] P. Henrici, Discrete variable methods in ordinary differential equations, Wiley, 1962. [4] G. Hall and J. M. Watt (eds.), Modern numerical methods for ordinary differential equations, Clarendon Press, 1976. [5] H. J. Stetter, Analysis of discretization methods for ordinary differential equations, Springer, 1970. [6] L. Lapidus and J. H. Seinfeld, Numerical solution of ordinary differential equations, Academic Press, 1971. [7] A. Ralston and P. Rabinowitz, A first course in numerical analysis, McGraw-Hill, second edition, 1978. [8] M. Tanaka, T. Arakawa, and S. Yama- shita, On the characteristics of Runge-Kutta methods, Information Processing in Japan, 17 (1977), 175181. [9] E. B. Shanks, Solutions of differential equations by evaluations of functions, Math. Comp., 20 (1966),21-38. [10] K. R. Jackson, W. H. Enright, and T. E. Hull, A theoretical criterion for comparing Runge-Kutta formulas, SIAM J. Numer. Anal., 15 (1978), 618-641. [11] M. Tanaka, On Kutta-Merson process and its allied processes, Information Process- ing in Japan, 8 (1968), 44-52. [12] 1. H. Verner, Explicit Runge-Kutta meth- ods with estimates of the local truncation error, SIAM J. Numer. Anal., 15 (1978), 772- 790. [13] H. Shintani, Two step processes by one step methods of order 3 and order 4, 1. Sci. Hiroshima Univ., ser. A-I, 30 (1966),183-195. [14] P. Merluzzi and C. Brosilow, Runge- Kutta integration algorithms with built-in estimates of the accumulated truncation error, Computing, 20 (1978), 1-16. [15] G. D. Byrne and R. J. Lambert, Pseudo- Runge-Kutta methods involving two points, J. Assoc. Comput. Math. 13 (1966), 114-123. [16] H. J. Stetter, Global error estimation in Adams pC codes, ACM Trans. Math. Soft- ware, 5 (1979), 415-430. [17J L. F. Shampine and M. K. Gordon, Com- puter solution of ordinary differential equa- tions, the initial value problem, Freeman, 1975. [18] F. T. Krogh, A variable-step variable- order multistep method for the numerical solution of ordinary differential equations, Proc. Information Processing 68, North- Holland, 1969, vol. 1. 
304 A Numerical Solution of PDEs [19J W. B. Gragg, On extrapolation algo- rithms for ordinary initial value problems, SIAM 1. Numer. Ana1., 2 (1965), 384-403. [20J R. Bulirsch and J. Stoer, Numerical treat- ment of ordinary differential equations by extrapolation methods, Numer. Math., 8 (1966). [21J G. Dahlquist, Stability questions for some numerical methods for ordinary dif- ferential equations, Amer. Math. Soc. Proc. Symp. in Applied Math. 15, 1963. [22J M. Iri, A stabilizing device for unstable numerical solutions of ordinary differential equationsDesign principle and applications of a "filter," Information Processing in Japan, 4 (1964), 65- 73. [23J T. Y. Li and J. A. Yorke, Period three implies chaos, Amer. Math. Monthly, 82 (1975),985-992. [24J M. Yamaguti and H. Matano, Euler's finite difference scheme and chaos, Proc. Japan Acad., ser. A, 55 (1979), 78-80. [25J A. K. Aziz (ed.), Numerical solutions of boundary-value problem, Academic Press, 1975. [26J H. B. KeI1er, Numerical methods for two- point boundary-value problems, B1aisdeI1, 1968. [27J H. B. KeI1er, Numerical solutions of two- point boundary-value problems, SIAM Re- gional Conf. Ser. in Appl. Math., 1976. 304 (XV.g) Numerical Solution of Partial Differential Equations A. General Remarks The numerical solution of partial differen- tial equations became practical in the 1950s with the advent of automatic digital compu- ters. Nowadays, by means of modern high- performance computers, the numerical solu- tion of partial differential equations is carried out extensively and often on a very large scale for problems in physics, engineering, and other fields of applied analysis, in order to obtain approximate solutions of rigorous equations or to simulate real phenomena by means of numerical experiments. Various numerical-solution schemes have been pro- posed, applied, and studied, and programming techniques required to implement numerical solutions are being invented regularly; some are of a general nature, and others are de- vised for particular problems. The variational method, which includes the Ritz-Galerkin method and the finite element method, and 1140 the difference method deserve special mention here because of their generality and scope of application. B. The Ritz-Galerkin Method Having been applied mainly to boundary value problems for eI1iptic partial differen- tial equations, the Ritz method is a classical method, which has been used since the old era of mechanical calculators, and is mathemati- caI1y nothing other than the tdirect method in the calculus of variations ( 46 Calculus of Variations F) applied to tvariational problems that are equivalent to the original boundary value problems. In order to exemplify the method, let Q be a bounded domain in R N with piecewise smooth boundary S, and con- sider the Dirichlet boundary value problem comprised of the Poisson equation U = - f (1) and the boundary condition uls={3. (2) Here rand {3 are given functions on Q and S, respectively. (In the fonowing, given functions are assumed to be sufficiently smooth.) The boundary value problem (I), (2) is equivalent to the variational problem of minimizing the tfunctional 1 J[uJ=2:a(u,u)-(u,J) (3) within the set D(J) of admissible functions u subject to (2). In (3) the following notation is used: a(u, v)=(Vu, VVh"2(rJ)= In Vu' Vvdx and (U,V)=(U,V)L 2 (rJ)= In uvdx, while D(J)= {UE Lz(Q)1 VUELz(Q), uls= {3}. This variational problem is further reduced to the condition: UED(J), a(u, rp)=(j, rp) (VrpE V), (4) where V stands for the tSobolev space HJ(Q) ( 168 Function Spaces B), that is, V= HJ (Q) = {UE Lz(Q)1 VUELz(Q), uls=O}. Note that V is equal to D(J) with {3=0. The condition (4) is often called a weak form of the boundary value problem. In applying the Ritz 
1141 method to this problem, we introduce un- known parameters lXI' IX 2 , ..., IXn and set the approximate solution Un in the form Un = F(x; lXI, IX 2 , ..., IXn)E D(l) and minimize l[u n ] as a function of the n vari- ables lXI, IX 2 , ..., IXn' A standard way to form Un is to choose first a bE D(l) subject to the inhomogeneous boundary condition and <PI' <P2,.", <PnE V subject to homogeneous bound- ary condition, and then to set U n =b+IX 1 <P1 + IX 2 <P2 +... +IXn<Pn' The functions <Pj U = 1,2, . .. , n) are called basis functions or coordinate functions in the Ritz method. Denoting by Dn the set of all possible functions appearing on the right-hand side of (5) and by V n the linear space generated by <PI' <P2' ..., <Pn' we can state the condition to determine the approximate solution Un as: UnEDn a(u n , <p)=([, <p) (\f<pE V n ). Thus the Ritz method is a kind of projective approximation method which approximates the weak equation (4) by its projection on a finite- dimensional space v". The condition (6) is equivalent to the equations a(U n , <p)=(J, <p) U= 1,2, ...,n). Particularly when fJ == 0 in (2), Dn coincides with v", and the equations (7) that deter- mine the coefficients IX j are reduced to a linear equation KIX=Y for IX = t(IX 1 , IX 2 , ..., IXn), where the matrix K = (K ij) and the vector Y = t(YI, Y2"'" Yn) are given respectively by Kij=a(<pi,<pj) and Yj=(f,<PJ Solving (8) numerically by the tGauss elimina- tion method or by titeration, one eventually obtains the approximate solution un by the Ritz method. The Ritz method is also appli- cable, for instance, to the eigenvalue problem u = AU, U I, = 0, since this eigenvalue problem is equivalent to the variational problem of finding a stationary Rayleigh's quotient R[u] = a(u, u)/(u, u) with Vas the set of admissible functions. In the approximation to restrict the admissible functions to v", the approximate eigenvalue J.(n) is determined through the ma- trix eigenvalue problem K IX = A (n) MIX, where the matrix K is as above and the matrix M = (M i ) is given by Mij=(<Pi,<PJ Sometimes, optimization of l[u n ] or R[u n ] is carried out more directly, e.g., by the tgradient method, particularly when Un contains free parameters IX j in some nonlinear way. 304 B Numerical Solution of PDEs (5) Many studies have been made of conver- gence and error estimation for the Ritz method in general (e.g., [1-4]). For instance, it is known that, if # = 0, f E L 2(0.), and {<Ph <P2' ... ,<Pn' ... } spans a linear subspace dense in V=HJ(0.), then the approximate solution Un obtained through (6) (or through (12) below) converges to the rigorous solution in the H1_ topology (and hence in the L 2-topology). Nevertheless, success in using the method in practical applications can be gained only by a clever choice of trial functions. A typical but systematic example of such a choice is made by the finite element method (---'" Sec- tion C). We proceed to the Galerkin method [I]. Suppose that we are to solve the equation L[u] =u +(b. V)u= - f, (10) (6) which is the equation in (1) perturbed by a lower-order term (the convection term). Here b =(bl(x), ...,bn(x)) and (b. V)U = LA(x) (Ju/JxJ For simplicity, the boundary condition (2) is assumed to be homogeneous (i.e., fJ == 0). Although the boundary value problem (10), (2) cannot be reduced to a variational problem since (10) is not symmetric, it is equivalent to its weaker form a(u,<p)-((b'V)u,<p)=(f,<p) (\f<pE V), (11) (7) provided that UE V. Note that (11) is ob- tained from (L [u] + f, <P )L 2 = 0 by transforming (L [u], <p) through integration by parts. Ac- tually, this boundary value problem has a unique solution u for any f E L 2 (0.) if Idiv bl is sufficiently small. According to the Galerkin method as applied to the present problem, one replaces V by V n in (11) and determines the approximate solution Un = IX 1 <PI + '" + IXn<Pn E v" by the condition (8) a(u n , <p)-((b' V)u n , <p)=(1,<p) (\f<pE v,,), (12) which is equivalent to the equations a(u n , <Pj) -((b' V)u n , <Pj)= (f, <p) U = 1,2, ..., n). (13) (9) Sometimes it is convenient to project (L[u] + f, <p) = 0 onto the fini te-dimensional space and to deal directly with (L [un] + 1, <Pj) = 0 U = 1,2,..., n). When the coefficients of the equa- tion and the given function f are periodic in space variables and when sine or cosine func- tions are chosen as basis functions, the Galer- kin method turns out to be the same as the so-called Fourier approximation method and can be efficiently implemented with the aid of a tfast Fourier transform (FFT) to yield the required Fourier coefficients numerically [5]. The Galerkin method can be applied to the numerical solution of evolution equa- tions, namely, in order to approximate time- 
304C Numerical Solution of PDEs dependent solutions. This is exemplified through the following initial-boundary value problem for the diffusion equation (heat equation): au -=u (t;;:,O, xEQ) at with the homogeneous boundary condition uls=O and the initial condition ult=o=a(x) (XEQ). We first set the required approximate solution Un = un(t, x) in terms of the basis functions rpl, rp2, ..., rpn of v" as U n =!X1(t)rpl +!X 2 (t)rp2 +... +!Xn(t)rpn, (17) and then determine the coefficients !XI (t),!X 2 (t), ... , iXn(t) from the conditions d di(un,rp)= -a(un,rp) (t;;:'O, VrpE v,,) (18) and (un(O), rp)=(a, rp) (VrpE v,,). Since (18) is equivalent to the equations d di(un,rpj)= -a(un,rp) (j=1,2,...,n), the vector function !X(t) = !(!X 1 (t), !X 2 (t), .", !Xn(t)) is obtained by solving the ordinary differential equation d M-(X= -K!X dt ' where the matrices M and K are those in (8) and (9). The initial value !X(O) of !X should be given in accordance with (19). Sometimes the procedure above is caIled a semidiscrete ap- proximation based on the Galerkin method. Obviously, this method can be applied to more general evolution equations, for instance, to the diffusion-convection equation au -=L[u] =u+(b' V)u, at which governs time-dependent states corre- sponding to stationary states subject to (10). As a concrete example, we can refer to the finite element approximation for evolution equations described in Section D. If the geom- etry of Q is simple enough, e.g., for I-dimen- sional Q, one can adopt eigenfunctions of L [ ] as the basis functions; the resulting version of the semidiscrete Galerkin method is called a spectral method. In order to obtain un(t) one has to carry out numerical integration for (20), discretizing the time variable. The Galerkin approxima- 1142 (14) tion with discretization of the space and time variables is caIled a full discrete approximation. Finally, an advantage of the Ritz-Galerkin method is that if the boundary condition imposed on rigorous solutions is a natural boundary condition, then the admissible func- tions and particularly the basis functions need not satisfy it [4]. (15) (16) C. Finite Element Methods for Boundary Value Problems (19) In recent years the method of numerical solu- tion of partial differential equations most extensively employed in structural mechanics and many other fields of engineering has been the finite element method. In its standard form, the finite element method can be regarded, at least mathematically, as a type of Ritz- Galerkin method that adopts as its basis func- tions piecewise polynomials of low degree and with narrow supports. Although the idea leading to the finite element method can be traced back to a paper by R. Courant in 1943 (Bull. Amer. Math. Soc.), the method acquired its popularity in the late 1950s when it was rediscovered by engineers on the basis of me- chanical considerations [6]. Here we apply the method to the boundary value problem (l), (2), assuming that p = 0 and Q a polygonal domain in R 2 [7-9]. First, n=QU S is divided into small triangles of which the length of any side does not exceed h > O. Each triangle T appearing in this decomposition of n is called a triangular element, or simply an element (following the terminology in structural me- chanics), and we denote the set of all triangular elements by :Yh' h being equal to max TEy , {the longest side of T}. The set of all nodal points, i.e., the vertices of the triangular elements, is denoted by N, and we put (20) N;={PENIPEQ}={P 1 ,P 2 ,...,P n }. (22) (21) Then a standard choice of admissible functions is to adopt as v" in (6) the foIlowing v" c V = HJ(Q): v" = {VhE C(n) I V h is linear on each element and vhls=O}, (23) namely, v" is the set of all elementwise linear continuous functions satisfying the boundary condition. Now the approximate solution UhE v" is determined by the condition a(uh' rph)=(f, rph) (Vrph E v,,). (24) A basis of v" is formed by pyramidal functions rpj (j = 1, 2, "', n)E v" such that rpi)= 1 and rp}Q)=O (QEN,,{}). (25) 
1143 In order to obtain U h , one has to solve (8) numerica1!y for a. Since the support of <Pj is confined to elements adjacent to , the matrix K turns out to be a sparse matrix of multi- diagonal type, and hence particular elimination procedures can be applied with high efficiency. In the finite element method, the matrix K is often called the stiffness matrix. The conver- gence of Uh to U as h->O is guaranteed if :T h (0 < h =( ho) satisfies a certain regularity con- dition. For instance, if any angle of a triangle T is not smaller than eo> 0, then Uh converges in the HI (a)-topology. Moreover, when a is convex, it holds that Ilu-UhlIHI =( ChllfllL2, Ilu- uhllL 2 =( Ch211fllL2 [7,9]. Uniform convergence of Uh to U and Len- estimation of the error have also been estab- lished (e.g., [7, chap. 3J). Uh is subject to the maximum principle, provided that the trian- gulation :T h is of acute type. Here, :T h is said to be of acute type (or of strongly acute type) if any angle 0 of TE:T h is in 0<0 =( (nI2) (0 < IJ=(IJ 1 < (nI2). To acquire higher accuracy, more sophisticated admissible functions that are piecewise polynomials of higher degree are used. To take account of curved boundaries, several devices, including the isoparametric method, have been proposed. In dealing with biharmonic equations or other equations of higher order, one can con- veniently apply a modified version of the finite element method of mixed type or of noncon- forming type; in the latter the approximate solution is sought within a class of functions which are of less regularity at the interface of elements and hence do not belong to the domain of the original variational problem. i.e., which are not admissible in the classical sense [10,11]. D. Finite Element Methods for Initial Value Problems The most basic type of finite element method applicable to evolution problems is described here, and concerns the initial boundary value problem (14)-(16), assuming that a is the same as in Section B [8,9]. In the semidiscrete finite element approximation one uses the functions <Pj of the preceding section as the basis for the approximate solution, now denoted by Uh' and then determines U h through (17)-(19) with v" replaced by V h . Then Uh converges to U as h->O, provided that the triangulation :T h is regular. Furthermore, under the same condition that makes (26) hold, the error is given by the fol- 304 D Numerical Solution of PDEs lowing estimate [12]: Ilu-UhIIL2=(Ch21Iallult (t>O), (27) which reflects the smoothing property of the diffusion equation. In the terminology of the finite element method, the matrix M in (20) is called the mass matrix. In order to gain a fully discrete finite element approximation U'.h for u, one discretizes the time variable with the tmesh length, and solves the difference analog of (20), i.e., either M(a(t+T)-a(t))/T= -Ka(t) (28) or M(a(t+T)-a(t))/T= -Ka(t+T), (29) (26) where t is restricted to t= kr (k = 0,1,2,...). The difference schemes (28) and (29) are of forward type and of backward type, respec- tively. From (29) follows Ila(kr)11 =( Ila(O)II, and consequently Ilu,.h(t)III' 2 =( CIIak. In the gener- ally accepted terminology, the approximation is stable if for each T> ° there exists a positive constant C T depending on T such that Ilu,.h(t)11 =( CTllal1 (O=( t = kr =( T). (30) Thus the approximation through (29) is un- conditionally stable, while the one through (28) is stable only under certain conditions on the triangulation :T h and on the time mesh T. For instance, the forward scheme (28) is stable if :T h is regular and satisfies the inverse assumption h sup max < +00, O<h>;;h o ToY, (longest side of T) and if the ratio Tlh 2 is sufficiently small [13,14]. Sometimes, the mass matrix M in (28) and (29) is modified by the following procedure, ca!1ed mass lumping: For each , one joins alternately the center of gravity of the ele- ments with  as one of their vertices and the middle point of the sides with  as one of their endpoints, thus forming a closed broken line Ij surrounding . We denote by qJj the char- acteristic function of the polygonal domain Bj bounded by Ij. When mass lumping is ap- plied, the matrix M is replaced by the matrix M =((qJi' qJj))' Generally, mass lumping re- laxes the conditions for the stability and the maximum principle to hold, although it may lower the accuracy to some extent (H. Fujii, Proc. US-Japan Seminar, Univ. Tokyo Press, 1973). Usually, schemes without mass lumping are ca1!ed consistent mass schemes. In principle, there is no difficulty in applying the finite element method to the diffusion- convection equation (21). However, when lib II is large, the stability condition becomes strin- 
304 E Numerical Solution of PDEs gent. To meet these difficulties, M. Tabata, T. Ikeda [15J, and others have devised some schemes that enjoy better stability, the maxi- mum principle, and even the law of conserva- tion of mass by improving mass lumping and introducing the idea of an upstream approxi- mation or an artificial viscosity for discretizing the convection term. Moreover, finite element methods are currently being applied to wave equations, the Navier-Stokes equations, and various other evolution equations. E. Difference Methods for Boundary Value Problems In difference methods for solving partial dif- ferential equations we reduce the original equation to its difference analog, replacing tdifferential quotients of the unknown function by the corresponding difference quotients. The difference analog, usually a system of algebraic equations, is then solved numerically, yielding the desired approximate solution. Since the last part of the method, i.e., the numerical solution of the difference analog, requires a large amount of computation, difference methods were not feasible until the develop- ment of automatic computers. In terms of the variable x, difference ap- proximations for the derivative df/dx are the forward difference (D;f)(x)=(J(x+h)- f(x))/h, the backward difference (DJ)(x)=(J(x)- j(x-h))/h, and the central difference (symmetric difference) (c5;f)(x)=(f(x+h)- f(x-h)/2h. In order to exemplify the difference method applied to boundary value problems, we re- turn to equations (1) and (2), supposing that Q is a 2-dimensional square Q= {(x,y)1 0 <x < 1, O<y < 1} [1,16-18]. Then we cover Q U S by a square net (lattice, grid) with mesh length h = 1/ N, for N a large integer. The set of net points Pm.n=(mh,nh) lying in Q is denoted by Qh' and Sh is the set of net points on S. We want to have a net function Uh on Rh=QhUSh that approximates the solution U of (1) and (2), where Uh is determined by the difference equation LihUh(X,y)= - f(x,y) «X,y)EQh)' (31) and the boundary condition Uh(x,y)=P(x,y) ((X,Y)ES h ). (32) Here Li h is a difference operator, the 5-point 1144 difference analog of Li, which is defined by Lih<P=DD;<p + DD;<p =++<p-+<py+ + <p(x, y- h)-4<p(x, y))/h 2 . In view of the maximum principle for net functions on Rh' it is seen that (31) and (32) admit only Uh = 0 as a solution if f = {3 = 0, which implies that the solution Uh of(31) and (32) exists uniquely for all f and {3. When we are concerned with the convergence of U h to U as h-->O, we have to consider (31) and (32) for all small h> O. Such a family of difference equations is called a difference scheme. Actu- ally, Uh obtained through (31) and (32) con- verges to U as h-->O, and we can say that the difference scheme (31) and (32) is convergent. In general, the order of the error U - Uh de- pends on the smoothness of u. For instance, if UE C 4 (Q), then Iu - uhl:( Ch 2 holds uniformly on Rh' When the asymptotic behavior of the error is known to satisfy U - Uh = wh 2 + o(h 2 ) for some w=w(x,y), it is possible to attain a higher accuracy by eliminating the h 2 -order term if we compute U h for two different values of h and form an appropriate linear combi- nation of the Uh thus obtained (Richardson's extrapolation;  e.g., [17J). The convergence of difference schemes applied to boundary value problems is not affected if rectangular nets are used instead of square nets. Many techniques have been devised to treat curved boundaries. There have been attempts recently to map Q into a domain Q of simpler geom- etry, say, a rectangular one, and then discretize the arising partial differential equations with variable coefficients in Q (e.g., J. F. Thompson et aI., J. Comput. Phys., 1982.) Moreover, when information regarding the location and nature of singularities of solutions is available, one can refine the net near these points. The difference analog of elliptic equations described above is a system of linear equa- tions with the values of the U h as its unknowns. Since the coefficient matrix of this system is of multidiagonal type, direct elimination methods can be efficiently applied to solve it. Partic- ularly, some elimination algorithms employ- ing vector computations have been invented [19]. Iteration methods of the Gauss-Seidel type can also be applied, where acceleration of convergence by SOR is effective ( 302 Numerical Solution of Linear Equations C). F. Difference Methods for Initial Value Problems To demonstrate characteristics of the dif- ference method applied to evolution equa- 
1145 tions [17,20,21J, we first consider the initial boundary value problem (14)-(16) for the dif- fusion equation supposing that Q=(O, I), a I-dimensional interval. Then we cover Q = [0, ex)) x Q by a rectangular net t = m (n = 0,1,2,...) and x=jh (j=0, 1,...) with mesh lengths M = T and x = h. The value of the approximate function U'.h at the net point (m,jh) is denoted by Up. Then the simplest difference scheme offorward type for (14) is written as "+1_ Up UP+I + "-l -2UP T h 2 Namely, we adopt the difference operator L"h[ J =D,- DD; as the difference analog of (a/at)-(a 2 /ax 2 ). Equation (33) is consistent in the sense that L,.h[UJ = "the residual of u"->O (T,h->O) for any smooth solution u of (14). This is the case also with the difference scheme of back ward type n+I_" Up+1 + "+:/ -2Up+1 T h 2 To compute the approximate solution U"= {n I Oj N}, (33) or (34) must be combined with the approximate initial condition cor- responding to (16), say o=a(jh), and with the boundary condition U& = U = 0. Then one can proceed from UO to U 1 , from U I to U 2 , and so on. Actually, (33) is rewritten in the form n+1 =icUP+I + J,UI'-1 +(1-2J.)UP, (35) with J. = T/h2, which gives U n +1 explicitly in terms of un. Hence (33) is an explicit scheme. On the other hand, in order to acquire U n + 1 from un according to (34), it is necessary to solve a system of linear equation with un+1 as its unknown. Hence (34) is an implicit scheme. Since the coefficient matrix of the system of linear equations to determine U n + 1 according to (34) is tridiagonal, one can employ a partic- ularly efficient elimination algorithm. We introduce the maximum norm II<phllh= maxRhl<p(jh)1 for net functions <Ph on Rh=QhU Sh={x=jhIOjN}. Then, as is obvious from (35), the approximate solution U" = u,.h(m) obtained through (33) satisfies II Unll h  llUo Il h (n=O, 1,...), provided that O<;.=-;. h 2 Generally, a difference scheme approximating an initial value problem is said to be stable if the approximate solution U'.h for the initial value a h = U"h(O) satisfies, for small T and h, Ilu,.h(t)llhMTllahllh (Ot=mT) (37) for any T> 0, M T being a constant depending 304 F Numerical Solution of PDEs (33) on T. Lax's equivalence theorem asserts that if the original initial value problem is twell- posed and if the approximating difference scheme is consistent, then the stability of the difference scheme is necessary and sufficient for the convergence of the approximate solu- tion U'.h to the rigorous solution U as T, h->O. For instance the explicit difference scheme (33) for the I-dimensional diffusion equation is convergent under the mesh condition (36). On the other hand, the implicit scheme (34) is unconditionally stable and hence is conver- gent. Many difference schemes more sophisti- cated than (33) and (34), of higher accuracy or with other favorable properties, have been proposed and studied. All these difference schemes can be generalized to the case of many-dimensional Q. For instance, ifQcR 2 , then by means of the difference operator h in (31), the forward scheme and the backward scheme are given by D,U"=hU" and D,U"= h un+1, respectively. The former is stable if ° <;. = T/h2  1/4, while the latter is uncon- ditionally stable. Moreover, in a method called the ADI method (alternating direction implicit method) one introduces U n +1/2 on fractional steps t=m+T/2 and discretizes (14) as (34) un+I/2 un - D'!.D h un + D'!.Dh Un+I/2 '[/2 X x y y , Un+1 _ U"+1/2 -D'!.D h U"+1 + D'!.Dh un +1/2 ( 38 ) T/2 x x y y . Then, starting from Uo, the computation goes as U°->U1/2->UI->...->un->U"+1/2... by solving at each step alternatively a difference equation implicit with respect to x and y, for which a particular elimination method can be used with good efficiency. Furthermore, the ADI method is unconditionally stable. Various types of fractional-step method that generalize the ADI method have been proposed [12, 17]. We now proceed to hyperbolic equations. First consider the Cauchy problem where the spatial domain is the whole R I. Noting that the wave equation v" = V xx , for instance, can be rewritten as au = ( 0 l ) au at 1 ° ax (36) by putting U = '(v" vxJ, we here deal with a hyperbolic system of the form au au -=A- at ax' (39) where the unknown u is an m-vector u = '(uj,u 2 ,... ,urn), and A is a constant m xm matrix that is supposed to be real and sym- metric. For hyperbolic equations, explicit schemes are generally preferred, and as a typ- 
304 Ref. Numerical Solution of PDEs ical one we mention the Friedrichs scheme. In the Friedrichs scheme applied to (39), the approximate solution U = t(U 1 , U 2 ,..., Urn) is determined by U(t+T,X)--!-(U(t, x+ h)+ U(t, x-h)) T =Ab;U(t,x). Here t is restricted to t = m (n = 0,1,...), while x is regarded as a variable ranging over R 1. Introducing translation operators Th and T-h by I;.: rp(X)-Hp(X + h) and Lh:rp(x)-->rp(x-h), we obtain from (40) U(t+T, ')=ShU(t, '), where, with the mesh ratio A=T/h, Sh is given by Sh=G+A )I;.+G-A )Lh' (42) Generally, if a difference scheme is reduced to the form of (41), then the operator Sh' which yields evolution of the solution of the dif- ference equation for one time step, is called the amplification operator of the scheme. The stability of the difference scheme is implied by the uniform boundedness of the operator norm IISII of sn acting in the Hilbert space (L 2 (R l)r for 0:( n:( T/T. Consequently, the scheme is stable if IIShll:( 1 + CT, C being a constant. The tsymbol S"hm of Sh with respect to the Fourier transform is called the amplifi- cation matrix of the scheme. The L 2 -stability of the scheme mentioned above is equivalent to the uniform boundedness of the matrix norm IIS()II for O:(m:( T and ER1. There- fore, denoting by rh() the spectral radius of Sh()' one can give a necessary condition for the stability by Irh()I:( 1 + CT, which is called the von Neumann condition. Concerning gen- eral critera for the uniform boundedness of powers of Sh() when Sh@ is not necessarily normal, a fundamental theorem was given by H. O. Kreiss in 1962 [21]. If the largest-modulus eigenvalue of A is denoted by 110, then the Friedrichs scheme is stable and hence is convergent under the con- dition on the mesh ratio A = T/h given by A:( 1/11101. In particular, (43) implies that for the stable Friedrichs scheme propagation speed of the difference scheme  propagation speed of the original equation. (44) Condition (44) is a necessary condition for general difference schemes approximating 1146 hyperbolic equations to be stable, and is known as the CFL condition (Courant- Friedrichs-Lewy condition) [22]. For a scheme more accurate than the Friedrichs scheme we refer to the Lax- Wendroff scheme, which pro- ceeds by way of the following amplification operator Sh: (40) T 2 S - I+ TAbh + -A2DII.Dh h- x 2 x x A ).2 =1 +-A(T h + L h )+-(T h -21 + L h ). 2 2 (41 ) The Lax-Wendroff scheme is stabie under (43). There are many other difference schemes to approximate (39), most of which are appli- cable to higher-dimensional spaces. Some of these schemes can be conveniently employed to solve nonlinear hyperbolic equations, for instance, those arising in the gas dynamics of compressible fluids, for which the Friedrichs scheme and the Lax- Wendroff scheme were originally intended [23]. Concerning difference schemes with variable coefficients which ap- proximate a tregularly hyperbolic system with an x-dependent principal part, criteria for the stability of the scheme in terms of the symbol Sh(X,) of Sh have been obtained, making use of the theory of tpseudodifferential operators, by P. D. Lax and L. Nirenberg (Comm. Pure Appl. Math., 1966), M. Yamaguti and 1. Nogi (Publ. Res. Inst. Math. Sci., 1967), R. Vaillancourt (Math. Comput., 1970), Z. Koshiba (J. Math. Anal. Appl., 1981), and others. References (43) [1] L. V. Kantorovich and V. 1. Krylov, Ap- proximate methods of higher analysis, Inter- science and Noordhoff, 1958. (Original in Rus- sian, 1950.) [2] J. Necas, Les methodes directes en theorie des equations elliptiques, Masson, 1967. [3] R. Temam, Numerical analysis, Reidel, 1973. [4] R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, I, 1953; II, 1962. [5] J. W. Cooley and J. W. Tukey, An algo- rithm for the machine calculation of complex Fourier Series, Math. Comp., 19 (1964), 297- 301. [6] M. J. Turner, R. W. Clough, H. C. Martin, and L. J. Topp, Stiffness and deflection anal- ysis of complex structures, J. Aero. Sci., 23 (1956),805-823. [7] P. G. Ciarlet, The finite element method for elliptic problems, North-Holland, 1978. [8] A. R. Mitchell and R. Wait, The finite element method in partial differential equa- tions, Wiley, 1977. 
1147 [9J G. Strang and G. Fix, An analysis of the finite element method, Prentice-Hall, 1973. [IOJ F. Kikuchi and Y. Ando, On the conver- gence of a mixed finite scheme for plate bend- ing, Nuclear Eng. Design, 24 (1973),357-373. [IIJ T. Miyoshi, A mixed finite element method for the solution of the von Karman equations, Numer. Math., 26 (1976), 255-269. [12J H. Fujita and A. Mizutani, On the finite element method for parabolic equations I, Approximation of holomorphic semi-groups, J. Math. Soc. Japan, 28 (1976), 749-771. [13J T. Kato, Perturbation theory for linear operators, Springer, second edition, 1976. [14J T. Ushijima, Approximation theory for semi-groups of linear operators and its appli- cation to approximation of the wave equa- tion, Japan. J. Math., 1 (1975), 185-224. [15J T. Ikeda, Maximum principle in finite element models for convection-diffusion phe- nomena, Kinokuniya and North-Holland, 1983. [16J G. E. Forsythe and W. R. Wasow, Finite- difference methods for partial differential equa- tions, Wiley, 1960. [17J G. I. Marchuk, Methods of numerical mathematics, Springer, second edition, 1982. [18J G. D. Smith, Numerical solution of par- tial differential equations: Finite difference methods, Clarendon Press, second edition, 1978. [19J D. L. Book (ed.), Finite-difference tech- niques for vectorized fluid dynamics calcula- tions, Springer, 1981. [20J F. John, Lectures on advanced numerical analysis, Gordon & Breach, 1967. [21J R. D. Richtmyer and K. W. Morton, Difference method for initial value problems, Interscience, 1957. [22J R. Courant, K. Friedrichs, and H. Lewy, Dber die partiellen Differenzengleichungen der mathematischen Physik, Math. Ann., 100 (1928),32-74. [23J R. Peyret and T. D. Taylor, Computa- tional methods for fluid flow, Springer, 1983. [24J B. Carnahan, H. A. Luther, and J. O. Wilkes, Applied numerical methods, Wiley, 1969. 304 Ref. Numerical Solution of PDEs 
305 A Obstructions 305 (lX.11) Obstructions A. History The theory of obstructions aims at measuring the extensibility of mappings by means of algebraic tools. Such classical results as the tBrouwer mapping theorem and Hopf's exten- sion and tclassification theorems in homotopy theory might be regarded as the origins of this theory. A systematic study of the theory was initiated by S. Eilenberg [lJ in connection with the notions of thomotopy and tcoho- mology groups, which were introduced at the same time. A. Komatu and P. alum [2J ex- tended the theory to mappings into spaces not necessarily tn-simple. For mappings of poly- hedra into certain special spaces, the thomo- topy classification problem, closely related to the theory of obstructions, was solved in the following cases (Km denotes an m-dimensional polyhedron): Kn+l-->sn (N. Steenrod [5J), Kn+2 -->sn (.I. Adem), Kn+k--> Y, where 7rj(Y)=O for i < nand n < i < n + k (M. Nakaoka). There are similar results by L. S. Pontryagin, M. Postnikov, and S. Eilenberg and S. Mac Lane. Except for the special cases already noted, it is extremely difficult to discuss higher obstruc- tions in general since they involve many com- plexities. Nevertheless, it is significant that the idea of obstructions has given rise to various important notions in modern algebraic topol- ogy, including cohomology operations ( 64 Cohomology Operations) and characteristic classes ( 56 Characteristic Classes). The notion of obstruction is also very useful in the treatment of cross sections of fiber bun- dles ( 147 Fiber Bundles), tdiffeomorphisms of differentiable manifolds, etc. B. General Theory for an n-Simple Space Y The question of whether two (continuous) mappings of a topological space X into an- other space Yare thomotopic to each other can be reduced to the extensibility of the given mapping: (X x {O} )U(X x {1})-> Y to a map- ping of the product X x I of X and the unit interval 1=[0, 1 J into Y. Therefore the pro b- lem of classifying mappings can be treated in the same way as that of the extension of mappmgs. Let K be a tpolyhedron, L a subpolyhedron of K, and ](n = L U K n the union of L and the tn-skeleton K n of K. Let Y be an tarcwise connected n-simple space, and J' be a mapping of L into Y. Denote by <l>n(f') the set of map- pings of [(n into Y that are extensions of J', 1150 and by <l>n(f') the set of thomotopy classes of mappings in <f>n(f') relative to L The set <f>0(f') consists of a single element because of the arcwise connectedness of Y, <f>1 (f') is non- empty, and <f>n(!') (n:;. 2) may be empty. Let.f" be an element of <l>n(f'). If we consider the restriction of fn to the boundary d"n+! of an oriented (n + 1 )-cell (jn+1 of K, then fn: d"n+! --> Y determines an element c(fn, (jn+ I) of the thomotopy group 7r n (Y) ( 202 Homotopy Theory). This element gives a measure of ob- struction for extending fn to the interior of (jn+1. We obtain an (n+ l)-tcocycle cn+l(fn) of the tsimplicial pair (K, L) with coefficients in 7r n ( Y), called the obstruction cocycle of fn, by assigning c(fn, (jn+1) to each (n+ I)-cell (jn+1. This obstruction cocycle c n +! (fn) is the mea- sure of obstruction for extending f" to [(n+1. A necessary and sufficient condition for the extensibility is given by cn+l(fn)==o. Clearly, c n + 1 un) is uniquely determined for each ele- ment f of <f>n(f'). The set of all c n + 1 (fn) with fn E<I>n(f') forms a subset on+1 (f') of the group of cocycles zn+! (K, L; 7r n (Y)). <l>n+ l(f') is non- empty if and only if on+1 (f') contains the zero element O. Let KD = K x I, LD =(K x O)L (L x I)U (K x 1). Given two mappings fo, f1: K --> Y satisfying fo I L = f11 L, we can define a natural mapping F':L0--> Y such that an element F n of <l>n(F') corresponds to a thomotopy h n - 1 relative to L connecting fo I K n - 1 with f11 Kn-1. Given an element F n E([ln(F'), we have the element c n + 1 (F") of zn+ 1 (KO, LO; 7r n (Y)), which we identify with zn(K, L; 7r n (Y)) through the natural isomorphis m of chain groups of the pair (KD, LD) to those of the pair (K, L). Thus we can regard cn+!(F") as an element of zn(K, L; 7r n (Y)), which is denoted by dn(fo, h n - 1 J1), and call it the separation (or difference) cocycIe. IfIo I ](n-1 = III [(n-I, we have the canonical mapping Fn:LoU(KD)" --> Y, and the separation co cycle is denoted simply by d n (fOJ1)' The set of;eparation cocycles corresponding to elem ents of <l>n(F') is considered to be a subset of Zn(K, L; 7r n (Y)) and is denoted by on( foJd. A l1ecessaryand sufficient condition for h n - 1 to be extensible to a homotopy on](n is d n (fo,h n - 1 J1)=0. Therefore a necessary and sufficient condition for fol[(nfll](n (reIL) (i.e., relative to L) is OEon(foJd. Given fJ:Jt: [(n_. Y with IJ:I L = m L, then dn(fo", hn-l Jr"J (EO' (fJ:Jt")) is an element of zn(K n , L; 7r n ( Y)). which is also considered to be a cochain of the pair (K, L). In this sense, we call dn(fJ:, h n -1 Jr"J the sepa- ration (or deformation) cochain over (K, L). The co boundary of the separation cochain dn(fJ:, h n - 1 , fn coincides (excf:pt possibly for sign) with c n + 1 (fJ:)-c n +! (fn. For a fixed If: E<I>n(f'), any n-cochain d n 
1151 of the pair (K, L) with coefficients in 7t n (Y) is expressible as a separation cochain d n = dn(fon,fr") where ftE<bn(f') is a suitable map- ping such that 10"1 /(n-1 = IIKn-1 (existence theorem). Therefore if we take an element In-1 of iii n - 1 (f') whose obstruction cocycle c n (fn-1) is zero, the set of an obstruction cocycles c n + 1 (fn) of an such InE<bn(f') that are exten- sions of f"-1 forms a subset of on +1 (f') and coincides with a coset of zn+1 (K,L; 1t n (Y)) factored by Bn+1(K, L; 1t n (Y)). Thus a coho- mology class c n + 1 (fn-1)E Hn+1(K, L; 7t n (Y)) corresponds to an f"-1 E <b n - 1 (f') such that c n (fn-1)=0, and c n +1(fn-1)=0 is a necessary and sufficient condition for In-1 to be exten- sible to K n + 1 (first extension theorem). For the separation cocycle, ([n(fo, h n - 2 ,f1)E H"(K, L; 1t"(Y)) corresponds to each homotopy h n - 2 on K"-2 such that d n - 1 (fo,h n - 2 ,f1)=0, and ([n(fo,h n - 2 ,f1)=0 is a necessary and sufficient condition for h"-2 to be extensible to a homotopy on /(n (first homotopy theorem). The subset of H"+l(K, L; 1t"(Y)) correspond- ing to 0"+1(f') is denoted by 0"+1 (f') and is caned the obstruction to an (n + 1 )-dimensional extension of f'. Similarly, the subset on(fo,f1) of Hn(K, L; 7t"(Y)) corresponding to on(fo,f1) is caned the obstruction to an n-dimensional homotopy connecting 10 with II' Clearly, a condition for f' to be extensible to Kn+1 is given by OEO n +1(f'), and a necessary and sufficient condition for 10 I K" I11 K" (rel L) is given by OEO n (fO,f1)' A continuous mapping q>:(K',L')-->(K,L) induces homomorphisms of cohomology groups q>*: H n +1(K, L; 1t n (Y))-->H n + 1 (K',L'; 1t"(Y)), H"(K, L; 1t"(Y))-->H"(K',L'; 1t n (Y)). Then for f':L+- Y, on+1(f'oq»=>q>*on+1(f'), and for 10' II : K --> Y such that 10 I L = III L, O"(fo 0 q>,f1 0 q» => q>* on(fo,f1)' Therefore we also find that the obstruction to an extension and the obstruction to a homotopy are independent of the choice of subdivisions of K, L, and con- sequently are topological invariants. Let 10, II, and 12 be mappings K --> Y such that 10 I L = III L = 121 L. Given homotopies h o j 1 :Io /K n -1 I1IKn-1 (relL), h;1 :I1/Kn-1  121/("-1 (reI L), then for the composite h21 = h1;1 ohoj1, we have d"(fo' h11 ,II) + d n (f1' h21 ,f2) =dn(fo, h 02 1 ,f2)' and for the inverse homotopy h(/ :III /(n-1  10 1/("-1 of hoj1 , clearly d"(fr, h;1 ,fo)= -d"(fo, h o1 1 ,II)' Therefore on(fo,fo) forms a subgroup of Hn(K, L; 1t n (Y)) that is determined by the 30SC Obstructions general, if on(fo,f1) is nonempty, it is a coset of H"(K, L; 1t n ( Y)) factored by the subgroup on(fo,fo). Combined with the existence theo- rem on separation cochains, this can be uti- lized to show the fonowing theorem. Assume that <b"(f') is nonempty. The set of an elements iiin(f') that are extensions of an element of iii n -1 (f') is put in one-to-one correspondence with the quotient group of H"(K n , L; 1t"( Y)) modulo on(fo" ,frf) by pairing the obstruction on(f/; ,I") with each f" for a fixed 10". Among such elements of iiin(f'), the set of f" that are extensible to Kn+1 is in one- to-one correspondence with the quotient group of H"(K n +1, L; 1t n (Y)) = H"(K, L; 1t n (Y)) modulo the subgroup on (frf +1 ,fon+1), assuming that 10" is extended to 10"+1 (first classification theorem). C. Primary Obstructions Assume that Hi+1(K,L; 1t i (Y)) = Hi(K,L; 7ti(Y)) = 0, where 0 < i< p (e.g., 7ti( Y) = 0,0 < i< p). In this case, by consecutive use of the first exten. sion theorem and the first homotopy theorem, we can show that each iiii(f') (i < p) consists of a single element and Op+1 (f') also consists of a single element C P +1(f')E HP+1(K, L; 1t p (Y)). The element c p + 1 (f'), caned the primary obstruc- tion of f', vanishes if and only if f' can be extended to /(P+1 (second extension theorem). When Hi+1(K, L; 1t i (Y)) =0 for i> p (for exam- ple, when 1ti(Y)=O for p<i<dim(K -L)), f' is extendable to K if and only if the first ob- struction of f' vanishes (third extension theorem). Correspondingly, if Hi(K, L; 1t i ( Y)) = H i - 1 (K, L; 1ti(Y)) = 0 (0< i<p), then for any two mappings 10, II : K --> Y, 10 I L = III L, OP(fOJ1) consists of a single element ([P(Io,f1) E HP(K, L; 1t p ( Y)), which we can the primary difference of 10 and II' This element vanishes if and only if 10 IKPI11 KP (relL) (second homotopy theorem). Moreover, when Hi(K,L; 7t i (Y))=O (i > p), the primary difference is zero if and only if 10  II (reI L) (third homotopy theorem). Assume that the hypotheses of the second extension theorem and second homotopy theorem are satisfied. If we assign to each element I P of iiip(f') the primary difference of I P and the fixed element It, then iiip(f') is in one-to-one correspondence with HP(KP, L; 1t p (Y)) by the first classification theorem (sec- ond classification theorem). Similarly, assume that the hypotheses of the third extension theorem and third homotopy theorem are satisfied. If Jo: K --> Y, J' = Jo I L, then homotopy classes relative to L of extensions I of f' are 
305 D Obstructions elements of HP(K, L; 1t p (Y)) by pairing dP(.f,fo) with f (third classification theorem). D. Secondary Obstructions For simplicity, assume that 1ti(Y)=O (i<p and p < i < q). If the primary obstruction C P +1(f')E HP+J(K, L; 1t p (Y)) of f': L-. Y vanishes, we can define oq+J(.f')c Hq+'(K,L; 1t q (Y)), which we call the secondary obstruction of f'. When Y = SP, q = p + 1, p > 2, the secondary obstruction OP+2(f') coincides with a coset of HP+2(K, L; Z2) modulo the subgroup Sq2(HP(K, L; Z)), where Sq2 denotes the tSteenrod square operation [5]. In this case, if L = KP, then OP+2( f') reduces to a cohomology class, Sq2(i*flf'*(rJ) with i:L->K, where rJ is a gen- erator of HP(SP, Z) (in this case (i*)-If'*(rJ) # o is equivalent to c p + 1 (f')=0) [5]. Moreover, if Sq2f'*(rJ) =0, then there exists a suitable extension fP+2: KP+2-. Y = SP off'. The set of obstruction cocycles of all such fP+2 defines the tertiary obstruction OP+3(f'), which coin- cides with a coset of HP+3(K,L;Z2) modulo the subgroup Sq2(Hp+J (K, L; Z2))' By using the tsecondary cohomology operation <I> of J. Adem, it can be expressed as <I>((i*r 1 f'*(rJ)) ( 64 Cohomology Operations). All the propositions in this article remain true if we take tCw complexes instead of polyhedra K. References [IJ S. Eilenberg, Cohomology and continuous mappings, Ann. Math.. (2) 41 (1940),231-251. [2J P. alum, Obstructions to extensions and homotopies, Ann. Math., (2) 52 (1950), 1-50. [3J P. alum, On mappings into spaces in which certain homotopy groups vanish, Ann. Math., (2) 57 (1953), 561-574. [4J N. E. Steenrod, The topology of fibre bundles, Princeton Univ. Press, 1951. [5J N. E. Steenrod, Products of cocycles and extensions of mappings, Ann. Math., (2) 48 (1947),290-320. [6J E. H. Spanier, Algebraic topology, McGraw-Hill,1966. 306 (X11.20) Operational Calculus A. General Remarks The term "operational calculus" in the usual sense means a method for solving tlinear dif- 1152 ferential equations by reducing the operations of differentiation and integration into alge- braic ones in a symbolic manner. The idea was initiated by P. S. Laplace in his Theorie analytique des probabilites (1812), but the method has acquired popularity sil1ce O. Heaviside used it systematically in the late 19th century to solve electric-circuit problems. The method is therefore also callec Heaviside calculus, but Heaviside gave only a formal method of calculus without bothering with rigorous arguments. The mathematical foun- dations were given in later years, first in terms of tLaplace transforms, then by ap::>lying the theory of tdistributions. One of the motiva- tions behind L. Schwartz's creation of this latter theory in the 1940s was to gi ve a sound foundation for the formal method, but the theory obtained has had a much larger range of applications. Schwartz's theory was based on the newly developed theory oft topological linear spaces. On the other hand, 1. Mikusin- ski gave another foundation, based only on elementary algebraic notions and on Titch- marsh's theorem, whose proof has recently been much simplified. In this article, we first explain the simple theory established by Mikusinski [2J and later discuss its relation to the classical Laplace transform method. B. The Operational Calculus of Mikusinski The set '{; of all continuous complex-valued functions a = {a(t)} defined on t  0 is a tlinear space with the usual addition and scalar multi- plication. '{; is a tcommutative algebra with multiplication a' h defined by the tconvolution {Ja(t-s)b(s)ds}. The ring ((j has no t zero divisors (Titchmarsh's theorem). (There have been several interesting proofs of Titchmarsh's theorem since the first demonstration given by Titchmarsh himself [3]. For example, a simple proof has been published by C. Ryll- Nardzewski (1952).) Hence we can construct the tquotient field :l2 of the ring '{;'. An element of:l2 is called a Mikusinski operator, or simply an operator. If we define a(t)=O for t<O for the elements {a(t)} in ((j, then '{; is a subalgebra of °11, which is the set of all locally integrable (locally L 1 ) functions in (-00,00) ....hose tsup- port is bounded below. Here we ide'ntify two functions that coincide almost everywhere. The algebra J1I has no zero divisor, and its quotient field is also fl. The unity element for multiplication in 12, denoted by (5 = bib (b # 0), plays the role of the tDirac (i-function. It is sometimes called the impulse function. The opera tor 1 = { I} E ((j is the 
1153 function that takes the values 0 and 1 accord- ing as t < 0 or t > O. This operator is Heavi- side's function and is sometimes called the unit function. Usually it is denoted by I(t) or simply I. The value 1(0) may be arbitrary, but usually it is defined as 1/2, the mean of the limit values from both sides. The operator 1 is an integral operator, because, as an operator carrying a into I. a, it yields I. a = {I a(S)dS} = the integral of a over [0, t]. More generally, the operator {tA-1/r(A)} (ReA. > 0) gives the Ath-order integral. The operator S = b/I, which is the inverse operator of I, is a differential operator. If aEC(? is of tclass C 1 , then we have s. a =a' +a( + O)b =a' + {a( + O)}/{ I}. (1) Similarly, if aEC(? is of class cn, we have sn. a=a(n) + d n -l)(O)b + d n - 2 )(0)s + ... + a(O)sn-I. The operator a->s' a can be applied to func- tions a that are not differentiable in the ordi- nary sense, and considering the application of s to be the operation of differentiation, we can treat the differential operator algebraically in the field fl.. In particular, we have s . 1 = b, and this relation is frequently represented by the formula dl(t)/dt = b(t). A rational function of s whose numerator is of lower degree than its denominator is an telementary function of t. For example, we have the relations 1/(s - IX)n = {tn-I e"'/(n -I)!}, 1/(s2 + {32) = {{3-1 sin {3t}, S/(S2 + {32) = {cos {3t}. The solution of an ordinary linear differ- ential equation with constant coefficients =oIXrcp(r)(t)= f(t) under the tinitial condition cp(i)(O)=Yi (0::::; i::::; n-l) is thus reduced to an equation in s by using formulas (1) and (2), and is computed by decomposing the following operator into partial fractions: { } {3n_1sn-1 +... +{30+ f cp(t) = n ' IXnS +... +IX 1 S+IX o where {3r= IXr+1 Yo + IX r +2YI +... + IXnYn-r-j, O::::;r::::;n-l. The general solution is repre- sented by (5) if we consider the constants Yo, '" , Yn-1 or {30, ... ,{3n-1 as arbitrary parameters. If the rational function in the right-hand side of (5) is M (s)/D(s) and the degree of the numer- ator is less than that of the denominator, then the right-hand side of (5) is explicitly repre- 306C Operational Calculus sented by 1-1 t l - r - 1 [ dr ( M(A) )] L - e Aot r=0(I-r-1)!r! dA r D(A)/(A-A o )' A=AO  M(AJ At + L.,-e', i=1 D(AJ where we assume that .1. 0 , .1.10"" Am exhaust the roots of the equation D(A) = 0, .1.0 is a multiple root of degree I, and all other roots are simple (m=n-I). The above formula is called the expansion theorem. C. Limits of Operators (2) A sequence an of operators is said to converge to the limit a = b/q if there exists an operator q( # 0) such that q' an EC(? and the sequence offunctions q' an converges tuniformly to b on compact sets. The limit a is determined uniquely without depending on the operator q. Based on this notion of limits of operators, we can construct the theory of series of operators and differential and integral calculus of func- tions of an independent variable A whose values are operators. They are completely parallel to the usual theories of elementary calculus ( 106 Differential Calculus; 216 Integral Calculus; 379 Series). A linear partial differential equation in the function cp(x, t) of two variables, in particular its initial value problem, reduces to a linear ordinary differen- tial equation of an operational-valued func- tion of an independent variable x. For a given operator w, the solution (if it exists) of the differential equation cp'(A) = W. cp(A) with the initial condition cp(O) = b is unique, is called the exponential function of an operator w, and is denoted by cp(A) = e AW . If the power series (3) (4) 00 I Anwn/n! n=O (6) converges, the limit is identical to the exponen- tial function e AW . However, there are several cases in which e AW exists even when the series (6) of operators does not converge. For example, for w= -j;, we have e-Aj; = {(A./2j;rt3)exp( _.1. 2 /4t)}, (7) (5) and for w= -s, we have e-AS=hA=s' HA(t), (8) where the function HA(t) takes the values 0 and 1 according as t<A or t>A- HA(t) belongs to the ring o/i and is called the jump function at A- For f(t)Eo/i, we have h 1 . {f(t)} = {f(t - A)}, and hence we call (8) the translation operator 
306D Operational Calculus (or shift operator). For w = -s, the series (6) does not converge, but if we apply the formal relation e-AS=,';"=o( -).s)"/n! to f(t), we have a formal Taylor expansion 00 f(t-A)= L pn)(t)(-).)"/n!. n==O The solution of linear tdifference equations are represented by rational functions of h A . The power series  (Xnhn of operators always converges. This fact gives an explicit example of a representation by formal power series. Note that the operators e- AS and e- AJS play an essential role in the solution of the twa ve equation ;]2 cp (X2;p cp - ox 2 ot 2 and the theat equation 02cp (X20cp - -- ox 2 ot The operator (7) converges to c5 for A->O, and this gives a tregularization of the Dirac c5- function. D. Laplace Transform For every function {f(t)} E<{?, the limit lim f p e-ASf(A)dA= f oo e-ASf(A)dA (9) {J+oo 0 0 always exists (in the sense of the limit of oper- ators), and as an operator coincides with the original function {f(t)}. Therefore, if the usual Laplace transform ( 240 Laplace Trans- form) of the function f().) exists and (9) is a function g(s), then as a function of the differen- tial operator s, g(s) is the operator that is given by the inverse Laplace transform f(t) of g(s). Formulas (4) and (7) are indeed typical examples of this relation, where the left-hand side is the usual Laplace transform of the right-hand side. In the practical computation of (5), we can compute the Laplace inverse transform of the right-hand side. However, if we took the Laplace transform as the founda- tion of the theory, it would not only be com- plicated but also be subject to the artificial restriction caused by the convergence condi- tion on Laplace transforms. In the theory of operational calculus, the transform g(p)=p Iro e-P'f(t)dt is sometimes used instead of the Laplace trans- form itself. But the difference is not essen- tial; we obtain the latter transform merely by 1154 changing the variable from s to p and multi- plying the former transform by p. E. Relation to Distributions For f E<{?, an operator of the form h A . Sk. f is identified with a distribution of L. Schwartz with support bounded from below We can identify with a Schwartz distribution the limit of a sequence fn (or a suitable equivalence class of sequences) of operators of the form h A . Sk. f such that fn' fn+!, .., are identical in the inter- val (-n, n). The notions of Schwar1z distri- butions and of Mikusinski operators do not include each other, but both are generaliza- tions of the notions of functions an d their derivatives. For formulas and examples  Appendix A, Table 12. References [1J O. Heaviside, On operators in physical mathematics I, II, Proc. Roy. Soc. London, ser. A, 52 (1893),504-529; 54 (1893),105-442. [2J J. Mikusinski, Operational ca1culus, Per- gamon, 1959. (Original in Polish, 1953.) [3J E. C. Titchmarsh, The zeros of certain integral functions, Proc. London Math. Soc., 25 (1926), 283-302. [4J K. Y os ida, Operational calculus: A theory of hyperfunctions, Springer, 1984. 307 (XIX.13) Operations Research A. General Remarks (10) Operations research in the most general sense can be characterized as the application of scientific methods, techniques, and tools to the operations of systems so as to pro\ide those in control with optimum solutions to problems. This definition is due to Churchman, Ackoff, and Arnoff [1]. Operational research began in a military context in the United Kingdom during World War II, and it was quickly taken up under the name operations research (OR) in the United States. After the war it evolved in connection with industrial organization, and its many techniques have found expand- ing areas of application in the Uni1ed States, the United Kingdom, and in other industrial countries. Nowadays OR is used ""idely in industry for solving practical problems, such as planning, scheduling, inventory, transpor- tation, and marketing. It also has various im- portant applications in the fields 01' agricul- 
1155 ture, commerce, economics, education, public service, etc., and some techniques developed in OR have influenced other fields of science and technology. B. Applications Applications of OR to practical problems are often carried out by project teams because knowledge of disparate aspects of the prob- lems are required, and interdisciplinary co- operation is indispensable. The following are the major phases of an OR project: (i) formu- lating the problem, (ii) constructing a mathe- matical model to represent the system under study, (iii) deriving a solution from the model, (iv) testing the model and the solution derived from it, (v) establishing controls over the solu- tion and putting it to work (implementation). When the mathematical model that has been constructed in phase (ii) is complicated and/or the amount of data to be handled is large, a digital tcomputer is often utilized in phases (Hi) and (iv). C. Mathematical Models [2] Typical mathematical models and tools that appear frequently in OR are: (1) Optimization model ( 264 Mathematical Programming). This model is characterized by one or more real-valued functions, which are called objective functions, to be minimized (or maximized) under some constraints. According to the number of objective functions, the types of objective functions, and the types of COn- straints, this model is classified roughly as follows: (i) Single-objective model, which in- cludes linear, quadratic, convex, nonlinear and integer programming models ( 215 Integer Programming, 255 Linear Programming, 292 Nonlinear Programming, 349 Quadratic Pro- gramming); (ii) multi-objective model; (iii) sto- chastic programming model ( 408 Stochastic Programming); (iv) dynamic programming model ( 127 Dynamic Programming); (v) network flow model ( 281 Network Flow Problems). (2) Game-theoretic model ( 173 Game Theory). Game theory is a powerful tool for deriving a solution to practical problems in which more than one person is involved, with each player having different objectives. (3) Inventory model ( 227 Inventory Con- trol). It is necessary for most firms to control stocks of resources, products, etc., in order to carry out their activities smoothly; various inventory models have been developed for such problems. Mathematically, optimization techniques ( 264 Mathematical Program- 307C Operations Research ming), Markovian decision processes ( 127 Dynamic Programming, 261 Markov Pro- cesses), and basic tprobability theory are used to construct models for these problems. (4) Queuing model ( 260 Markov Chains H). In a telephone system, calls made when all the lines of the system are busy are lost. The problem of computing the probability of loss involved was first solved in the pioneering article on tqueuing theory by A. K. Erlang in 1917. For systems in which calls can be put on hold when all lines are busy, one deals with the twaiting time distribution instead of the proba- bility of loss. In the 1930s, F. Pollaczeck and A. Ya. Khinchin gave explicit formulas for the characteristic function of the waiting time distribution. In many situations, such as in telephone systems, air and surface traffic, production lines, and computer systems, vari- ous congestion phenomena are often observed, and many kinds of queuing models are utilized to analyze the congestion. Mathematically, almost all such models are formulated by using Markov processes. For practical uses, approximation and computational methods are important as well as theoretical results. (5) Scheduling model ( 376 Scheduling and Production Planning). Network scheduling is used to schedule complicated projects (for example, construction of buildings) that consist of a large number of jobs related to each other in some natural order. PERT (pro- gram evaluation and review technique) and CPM (critical path method) are popular com- putational methods for this model ( 281 Network Flow Problems). Job shop scheduling is used when we have m jobs and n machines and each job requires some of the machines in a given order. The model allows us to find an optimal order (in some certain sense) of jobs to be implemented On each machine. (6) Replacement model. There are two typical cases. One is the preventive maintenance model, which is suitable when replacements are done under a routine policy because a replacement or a repair before a failure is more effective than after a failure. Probabilistic treatments are mainly used, and this model resembles those for queues and tMarkov processes. The other is a model for deciding whether to re- place a piece of equipment in use. In this case, we need to compare costs of both used and new equipment, and evaluations of various types of present cost are important. (7) tSimulation. This is a numerical experi- ment in a simulated model of a phenomenon which we want to analyze. Simulation is one of the most popular techniques in OR. (8) Other models. Besides the models listed above, many problems are formulated by way of various other models in OR. In modeling, 
307 Ref. Operations Research tprobability theory, and mathematical tstatis- tics, especially, tMarkov chain, tmultivariate analysis, tdesign of experiments, tregression analysis, ttime series analysis, etc. often play important roles. References [IJ C. W. Churchman, R. L. Ackoff, and E. L. Arnoff, Introduction to operations research, Wiley, 1957. [2J H. M. Wagner, Principles of operations research, Prentice-Hall, 1975. 308 (XII.19) Operator Algebras A. Preliminaries Let  be a tHilbert space. The set of tbounded linear operators on  is denoted by J(J() = .. It contains the identity operator 1. The notions of operator sum A + B, operator product AB, and tadJoint A* are defined on it. A subalge- bra of () is called an operator algebra. In this article we consider mainly von Neumann algebras. For C*-algebras  36 Banach Alge- bras G- K. Any tHermitian operator A (i.e., an operator such that A = A*) has the property that (Ax, x) is always real for any XE. If(Ax,x)O for any x, A is called positive, and we write A  O. When Hermitian operators A and B satisfy A -BO, we write A B. Thus we introduce an ordering A B between Hermitian operators. A set {A,} of positive Hermitian operators is said to be an increasing directed set if any two of them A" Ap always have a common major- ant Ay, that is, A,';:; Ay and A p ';:;Ay- If a Her- mitian operator A satisfies (Ax, x) = sup(A,x, x) for such a set, it is called the supremum and is denoted by sup A,. The supremum supA, exists if and only if the sup(A,x, x) is finite for any x, and then A, converges to A with respect to the weak and strong operator topologies. B. Topologies in ;% Various topologies are introduced in 2fJ = .cf6'(): the tuniform operator topology, the tstrong operator topology, and the tweak operator topology ( 251 Linear Operators). These topologies are listed above in order of decreasing fineness. The operation in 28 of taking the adjoint, A ->A *, is continuous with respect to the uniform operator topology and 1156 weak operator topology, but not with respect to the strong operator topology. The opera- tion from 3D x 2fJ to &8 of taking the product, (A, B)-> AB, is continuous with respect to the uniform operator topology, is continuous with respect to the strong operator topology when the first factor is restricted to a set bounded in the operator norm, but is not contmuous on .dd x &8. It is continuous with respect to the weak operator topology when one of the fac- tors is fixed (i.e., it is separately continuous). The set !JJ is a tBanach space with respect to the operator norm, or, more precisely, a tC*- algebra. It is a tlocall y convex topological linear space with respect to the strong or weak operator topology. The Banach space &8 is the tdual of the Banach space!YJ* of all tnuclear operators in  ( 68 Compact and Nuclear Operators I). The weak* topology in :Yi as the dual of ,OJJ* is called the IT-weak topology. C. Yon Neumann Algebras A subset .4£ of!lJ is called a *-subalgebra if it is a subalgebra (i.e., A, B EJfl implis )A + fl,B, AB E .41) and contains the adjoint A * of any AE.4Y. The com mutant .#' of a subset.# of:Yi is the set of operators that commure with both A and A* for AEd. The commutant is a *- subalgebra, and .#'=d"'. A von Neumann algebra .lit is a ",-subalgebra of:JB that is defined by one of the follow- ing four equivalent conditions: (i) .J1 is a *- subalgebra of:JB containing 1, closed under the weak operator topology; (ii) .lit is a *- sub algebra of;Jj) containing 1, c1os(.d under the strong operator topology; (iii) . U is the commutant of a subset of:JB (or, equivalently, .4{ = .t"); (iv) . is a *-subalgebra of ,:J} con- taining 1, closed under the uniform operator topology, and, as a Banach space, coinciding with the conjugate space of some Banach space. Note that a *-subalgebra of!JlJ, closed under the uniform operator topology, is a C*-algebra. Von Neumann algebms are also called rings of operators or W*-algebras. The latter term is usually used for a C* .algebra *-isomorphic to a von Neumann algebra in contrast to a concrete von Neumann algebra. The study of these algebras was "tarted by 1. von Neumann in 1929. He showed the equiva- lence of conditions (i)-(iii) (von Neumann's density theorem), and established a foundation for the theory named after him [IJ The notion of von Neumann algebras can be rlgarded as a natural extension of the notion of matrix algebras in a finite-dimensional space, and therein lies the importance of the theory. The 
1157 fourth condition of the definition was given by S. Sakai (Pacific J. Math., 1956). The fonowing theorem is of use in the the- ory of von Neumann algebras: Given a *- subalgebrad of fJI containing I, its closure .A with respect to the weak or strong operator topology is von Neumann algebra; and when we denote the set of elements of operator norm ::::; 1 in .'71 (resp. .A) by .'71 1 (resp. .AI), .AI is likewise the closure of .'71 1 with respect to the weak or strong operator topology (Kaplan- sky's density theorem (Ann. Math., 1951). If E is a projection operator in a von Neu- mann algebra.A, then E.AE={EAEIAE.A} is a *-subalgebra of fJI closed with respect to the weak operator topology. It is not a von Neumann algebra because it does not contain I, but since its elements operate exclusively in the closed subspace E55, we can regard it as an algebra of operators on E$). In this sense, E.AE can be regarded as a von Neumann algebra on E55, which we can the reduced von Neumann algebra of.A on E$) and write .A E . If E is a projection operator in .A', E.AE= E.A restricted to the subspace E55 is caned the induced yon Neumann algebra of.A on E55 and is denoted also by .A E . In the latter case, the mapping AE.A--+EAE.A E is a *- homomorphism and is caned the induction of .A onto .A E . The tensor product 551 @ 552 of two Hilbert spaces 55i (i = 1, 2) is the tcompletion of their ttensor product as a complex linear space equipped with the unique tinner product satis- fying (JI @/2,gl @ g2)=(JI,gIM/2,g2h for an J;, giE55i' For any A i EfJI(55i)' there exists a unique operator in fJI(551 @ 552) denoted by Al @ A 2 satisfying (AI @ A 2 )(f1 @ 12)= Adl @ A 2 /2 for an J;E55 i . For von Neumann algebras .Ai C fJI(55i)' the von Neumann alge- bra generated by Al @ A 2 with AiE.A i is de- noted by .AI @ .A 2 and is caned the tensor product of .AI and .A 2 . The *-isomorphism AE .A --+ A @ I E.A @ I, where I is the trivial von Neumann algebra consisting solely of complex multiples of the identity operator, is caned an amplification. For two von Neumann algebras .A i cfJI(55J (i = 1, 2), a *-isomorphism n from .AI into .A 2 is caned spatial if there exists a unitary (i.e., a bijective isometric linear) mapping U from 55i to 552 such that UAU*=nA for an AE.A[, and a *-homomorphism n is caned normal if sup.nA. = n(sup.A.) whenever A. is a bounded increasing net in .A. Any normal *-homomor- phism is continuous in the strong and weak operator topologies and is a product of an amplification, a spatial *-isomorphism, and an induction. Its kernel is of the form E.A, where E is a projection operator belonging to the center :!.t =.A n.A' of .A. This gives a com- 308D Operator Algebras plete description of an possible normal repre- sentations (i.e., a normal *-homomorphism into some fJI) of a von Neumann algebra. D. States, Weights, and Traces A state cp of a C*-algebra .'71 is a complex- valued function on .'71 that is (1) complex linear: cp(A + B) = cp(A) + cp(B), cp(cA) = ccp(A) for A, BEd, c E C, (2) positive: cp(A * A)  0 for AEd, and (3) normalized: Ilcpll = 1 (equiva- lent to cp(l)= 1 if I Ed). For any positive linear functional cp on .'71, there exists a triplet (55q>' nq>' q» (unique up to the unitary equiva- lence) of a Hilbert space 55q>' a representa- tion nq> (i.e., a *-homomorphism into fJI(55q») of .'71, and a vector q> in 55q> such that cp(A)= (nq>(A)q>, q» and $)q> is the closure of nq>(d)q>. The space 55q> is constructed by defining the inner product (I] (A), I](B)) = cp(B* A) in the quotient ofd by its left ideal {A I cp(A * A) = O}, where I] is the quotient mapping, and by completion. Then nq> is defined by nq>(A)I](B) = I](AB). This is caned the GNS construction after its originators I. M. Gel'fand, M. A. Naimark, and I. E. Siegel. A weight cp on a von Neumann algebra .A is a function defined on the positive elements of .A, with positive real or infinite values, which is additive and homogeneous (cp(A + B) = cp(A) + cp(B) and cp(AA) = Acp(A) for an A, BE .A and AO with the convention O' 00 =0 and 00 + a = 00). It is said to be faithful if it does not vanish except for cp(O) =0, normal if cp(A) = sup cp(A.) whenever A. is an increasing net of positive elements of.A and A = sup A., and semifinite if the left ideal 91q>, consisting of an elements A E.A for which cp( A * A) is finite, has the property that the linear span \JJlq> of 91:91q> is dense in .A. The restriction of cp to positive elements of \JJlq> has a unique extension to a linear functional on \JJlq>' which we denote by the same letter cpo Canonicany associated with a normal semifinite weight cp, there exists a Hilbert space 55q>, a normal *-representation nq> of .A, and a complex linear mapping I] from 91.. into a dense subset of 55.. such that (I](B'), I](B))=cp(B* B), and nq>(A)I](B)=I](AB), where B, B' E 91q> and A E.A. If cp is finite (i.e., 91q> = .A), then its extension to .A is a positive linear functional for which the triplet (55q>' nq>, 1](1)) is given by the GNS construction. The linear span of an normal states of a von Neumann algebra .A is a norm-closed sub- space of its dual .A*, caned its predual and denoted by .A*, because .A turns out to be the dual of .A*. A trace t on a von Neumann algebra.A is a weight satisfying t(UAU*)= t(A) for U unitary 
308E Operator Algebras in oil and for all positive A in ..$I (equivalently, t(A*A)=t(AA*) for all AE..$I). E. The von Neumann Classification A van Neumann algebra for which a semi- finite normal trace does not exist is called a purely infinite von Neumann algebra or von Neumann algebra of type III. In contrast to this, a von Neumann algebra ..$I is called semifinite (resp. finite) if for each positive Her- mitian operator A (#0) in ./It there is a semi- finite (resp. finite) normal trace t such that t(A)#O. Every Abelian von Neumann algebra is finite. If there are no central projection oper- ators E # 0 such that ..$IE is finite, ..$I is called properly infinite. Purely infinite ..$I and ga(f) for infinite f), for example, are properly in- finite. A nonzero projection operator E in ..$I is called Abelian when ..$IE is Abelian. We call ..$I a von Neumann algebra of type I (or discrete) when it contains an Abelian projection E for which I is the only central projection P cover- ing E (i.e., E  P). A von Neumann algebra is of type II if it is semifinite and contains no Abelian projection. A von Neumann algebra of type J[ is called of type II 1 if it is finite and of type 1100 if it is properly infinite. A finite von Neumann algebra is also characterized as a von Neumann algebra in which the operation of taking the adjoint is continuous with re- spect to the strong operator topology on bounded spheres (Sakai, Proc. Japan Acad., 1957). A properly infinite von Neumann alge- bra ..$I is characterized by the property that ,/It and oil @ ga(f) for any separable f) is *- isomorphic. Given a von Neumann algebra ..$I, there exist mutually orthogonal projections E I , En, EIII in the center :!Z of ./It such that EI + En + EIII = I, and ..$IE" ..$lEn' ..$IE", are von Neumann algebras of type I, type II, type III, respec- tively. This decomposition is unique. There also exist unique central projections EJ and E. such that ./lt E is fmite, ..$IE is properly in- I fl. . finite, and EJ+Ei= 1. The two decompositIOns can be combined. (If some of the projections E are 0, the condition on the corresponding..$lE is to be waived.) F. Factors A von Neumann algebra whose center consists exclusively of scalar multiples of the identity operator is called a factor. Von Neumann's reduction theory ( Section G) reduces the study of arbitrary von Neumann algebras on a separable Hilbert space more or less to the study of factors. Factors are classified into types In (n = 00, 1,2, ... ), II 1 (i.e., of type II and 1158 finite), 1100 (i.e., of type II and not finite), and III. A factor of type In is isomorphic to the algebra ga(f) of an bounded operators on an n-dimensional Hilbert space f). Since the dis- covery of two nonisomorphic examples of factors of type II 1 by F. J. M Ufray and von Neumann (1943), classification offactors has been a central problem in the theory of von Neumann algebras. After 1967, great progress was made in the investigation of isomorphism classes offactors, and we have uncountably many nonisomorphic examples of factors of types IIr, 11 00 , and III. After the discovery of the third to ninth nonisomorphic examples of factors of type II 1 by J. Schwartz (1963) (the third example, 1963), W. Ching (the fourth), Sakai (the fifth), 1. Dixmier and E. C. Lance (the sixth and seventh), and G. Zelb-Meier (the eighth and ninth), D. McDuff showed that there exist countably many nonisornorphic examples of factors of type II l' and finally McDuff(Ann. Math., 1969) and Sakai (J. Func- tional Anal., 1970) showed the existence of uncountably many nonisomorphic examples of factors of type II l' For type III factors  Section I. G. The Integral Direct Sum and Decomposition into Factors The Hilbert spaces considered in this section are all tseparable. Let (9Jl, C, fl) be a tmeasure space; with each (E9Jl we associate a Hilbert space NO. We consider functions x(O on 9Jl whose values are in f)(O for each (. Let K be a set of these functions having the following properties: (i) Ilx(OII is measurable for x(OEK; (ii) if for a function y(O, the numerical func- tion (x((),y(m is measurable any x(OEK, then Y(()EK; (iii) there is a countable family {X1(O,X Z (O,...} of functions in K such that for each fixed (E9Jl, the set {xd(),xz(O,...} is dense in f)(O. Then K is a linear spa.ce. We call each function in K a measurable vector func- tion. We introduce in the set of measurable vector functions x(() with f IIx(OIIZ dfl(O< 00 an equivalence relation by defining x(O and y(O as equivalent when f Ilx(O- y(OIIZdfl(O=O. Thus we obtain a space of equivalence classes which we denote by f). f) is a Hilbert space with the inner product (x,y)= f(x(O,y(mdfl(O, 
1159 which is called the integral direct sum (or direct integral) of i)(O. An operator function A(O that associates with each (EIDl a bounded linear operator A(O on i)«() is called measur- able if for any measurable x(O, A(Ox(O is also measurable. If, moreover, IIA(m is bounded, A(() transforms a function in i) to a function in i) and thus defines a bounded linear oper- ator on i). This operator is called the integral direct sum (or direct integral) of A(O, and an operator on i) that can be reduced to this form is called decomposable. Generally, let i) be a Hilbert space, and consider an Abelian von Neumann alge- bra d on i). Then we construct a measure space (IDl, Iff, /1) and represent d as the set of bounded measurable functions on IDl. (This is possible in different ways. The Gel'fand repre- sentation is an example.) Then a Hilbert space i)(() can be constructed so that i) is repre- sented as the integral direct sum of i)((). Oper- ators in d are all decomposable and are called diagonalizable. A von Neumann algebra .A on i) whose elements are all decomposable is characterized by .A cd'. The A(O yielded by the decomposition of operators A in .A generate a von Neumann algebra .A(O on i)((). If we take as d the center :?E of .A, then almost all the .A(O are factors (von Neumann's reduction theory), and if we take as d a maxi- mal Abelian von Neumann algebra contained in .A', then almost all the.A(O are type I factors (F. I. Mautner, Ann. Math., 1950). H. Tomita-Takesaki Theory Motivated by the problem of proving the commutant theorem for tensor products (i.e., (.AI @ .A 2 )' =.A; @ .A), which remained unsolved for algebras of type III up until that time, Tomita succeeded in 1967, after years of effort, in generalizing the theory of Hilbert algebras, previously developed'only for semi- finite von Neumann algebras. The most im- portant ingredient of this theory lies in certain one-parameter groups of *-automorphisms of a von Neumann algebra, called modular auto- morphisms (see below), each one-parameter group of modular automorphisms being in- trinsically associated with a faithful semifinite normal weight of the algebra. Tomita's theory was perfected by Takesaki [13], who also showed that modular automorphisms satisfy (and are characterized by) a condition origi- nally introduced in statistical mechanics by the physicists R. Kubo, P. C. Martin, and J. Schwinger and accordingly known as the KMS condition. In the mathematical founda- tions of statistical mechanics, this condition characterizes equilibrium states of a physical 308 I Operator Algebras system for a given one-parameter group of automorphisms (of a C*-algebra) describing the time-development of the system. It was a fortunate coincidence that this condition was formulated in a so-called C*-algebra approach to statistical mechanics by R. Haag, N. M. Hugenholtz, and M. Winnink [14] at about the same time that Tomita's work appeared in 1967. The original proofs of the Tomita- Takesaki theory have been simplified con- siderably by the work of M. Rieffel [16] and A. Van Daele [15]. Deeper insight into the significance of modular automorphisms is also provided by the work of A. Connes [19], showing that the group of modular automor- phisms (up to inner automorphisms) is intrin- sic to the von Neumann algebra (i.e., inde- pendent of the weight) and belongs to the center of Out .A (the group Aut.A of all *- automorphisms of the von Neumann algebra .A modulo the subgroup Int.A of all inner *-automorphisms ). Some of the basic definitions and results of the Tomita-Takesaki theory are as follows. If cp is a normal semifinite faithful weight on .A, the antilinear operator S"" defined on a dense subset '1(91", n 91;) of i)", ( Section D) by the relation S",'1(A) = '1(A *), is tclosable and the polar decomposition S", = J ",A.Y2 of its closure defines a positive self-adjoint operator A."" called a modular operator, and an anti unitary involution J",. The principal results of the Tomita-Takesaki theory are (1) if xE.A, then ol(A) == A.AA.;it E.A for all real t, and this defines a continuous one-parameter group of *-automorphisms (It'" of .A, called modular automorphisms, and (2) if A E.A, thenj",(A)== J",AJ",E.A', andj", is a conjugate-linear iso- morphism of.A onto .A'. A weight cp on .A is said to satisfy the KMS condition at {3 (a real number) relative to a one-parameter group of *-automorphisms (It of.A if, for every pair A, BE91",n91;, there exists a bounded con- tinuous function F(z) (depending on A, B), on 0::::; 1m z::::; {3 holomorphic in 0 < 1m z < {3 and such that F(t) = cp(Aut(B)) and F(t + i{3) = cp(ut(B)A). A given one-parameter group (It coincides with a group of modular automor- phisms u t '" if and only if cp satisfies the KMS condition at {3 = -1 relative to Ut. (In statistical mechanics, {3 = (kT) -1, where k is the Boltz- mann constant and T the absolute tempera- ture of the system.) I. Structure and Classification of Factors of Type III At the Baton Rouge Conference in 1967, R. T. Powers reported his results [17] on non- isomorphism of the one-parameter family 
308J Operator Algebras of factors of type III (now called Powers's factors), which had been constructed by von Neumann in 1938 in terms of an infinite ten- sor product of factors of type I (abbreviated as ITPF]). Prior to Powers's work only three different kinds of factors of type III, along with the same number of factors of type II" had been distinguished. A systematic classifica- tion of ITPFIs was subsequently given by H. Araki and E. J. Woods [18J in terms of two invariants, i.e., the asymptotic ratio set roo (0/11) and the p-set p(Jlf) of the von Neumann algebra jt. Using the Tomita-Takesaki the- ory, Connes [19J introduced two invariants, namely the S-set S(-",I) (the intersection of the spectra of all modular operators) and the T-set T(o/H) (the set of all real t for which the modu- lar automorphism CJ,'" is inner), and, when ,//1 is an ITPFI, proved the equality S(,A"I)=r'X)(.) and the relation T(A'I)=2nllogp(A't)I-I. The S-set S(,41) of a factor of type III on a separa- ble Hilbert space is either the set of all non- negative reals (type 111 1 ), the set of all integral powers of}, (where 0<),< 1) and 0 (type III}), or the set {O, I} (type II10). The work of Araki and Woods shows that there exists only one ITPFI of type III}. for each A.E(O, 1) (the exam- ples of Powers) as well as for A. = 1, while there exist continuously many ITPFIs of type IIIo. Woods [20J has shown that the classification ofITPFIs of type IIIo is not smooth. A structural analysis of factors of type III, given independently by Connes [19], Takesaki (Acta Math., 1973), and Araki (Publ. Res. Inst. Math. Sci., 1973) expressed independently a certain class of factors of type III as a kind of crossed product of semifinite von Neumann algebras with their injective endomorphisms (automorphism in the case of Connes). These analyses led Takesaki [21 J to the discovery of a duality theorem for crossed products of von Neumann algcbras with locally compact groups of theit *-automorphisms ( Section J) and its application to the following structure theorem for von Neumann algebras of type III. The crossed product of a von Neumann alge- bra. with the group of modular automor- phisms CJt is a von Neumann algebra ./V of type II"" with a canonical action 0, of the dual group as a one-parameter group of *- automorphisms which is trace-scaling, i.e., ToO,=e-'T for some faithful normal trace T. If j{ is properly infinite, the crossed product of Ai with 0, is isomorphic to the original von Neumann algebra .41. In particular, any von Neumann algebra j{ of type III can be writ- ten as the crossed product of a von Neumann algebra 0/0/ of type II" with a one-parameter group of trace-scaling *-automorphisms 0,. The isomorphism class of j{ is determined by the isomorphism class of 0/0/ together with the 1160 conjugacy class of 0, modulo inner automor- phisms. The restriction at of 0, to the center ::E of A is of special importance. j{ is a factor if and only if a, is ergodic. In that case, j{ is of type III I if JV is a factor, j{ is of type III), O<}, < 1, if 0, is periodic with period -log A., and j{ is of type Illo if a, is aperiodic and not isomorphic to the one-parameter group of *-automorphisms of Loo(R) induced by the translations of the real line R. (The ,xcluded case does not occur for j{ of type ]] I.) A von Neumann algebra on a separable Hilbert space is called approximately finite- dimensional (or approximately finite or hyper- finite) if it is generated by an increa,ing se- quence of finite-dimensional *-subalgebras. This class of von Neumann algebras includes many important examples, such as ITPFI and the von Neumann algebra generated by any representation of canonical commu1ation (or anticommutation) relations on a separable Hilbert space. The classification of a pproxi- mately finite-dimensional factors is almost complete. In fact the uniqueness of an approxi- mately tinite-dimensional factor of type II I has been known since the work of von Neumann. It is called the hyperfinite factor. Th;: unique- ness of approximately finite-dimensional fac- tors of type IIoo (which is then the tensor prod- uct of the hyperfinite factor with .'?a(:i))) and of type II]A, 0 <}, < 1, (which are then Powers's factors) has been demonstrated by Connes [22]. Approximately finite-dimensional fac- tors of type IIIo are classified exactly by the isomorphism classes of the ergodic groups a, of *-automorphisms of commutative von Neu- mann algebras ::E. Any such factor i:; a Krie- ger's factor, i.e., a crossed product of a com- mutative von Neumann algebra with a single *-automorphism. Examples of such factors have been extensively studied by Krieger, who has also shown [23J that isomorphi:im of a Krieger's factor is equivalent to weak equiva- lence of the associated nonsingular transfor- mation of the standard measure spa,:e. A von Neumann algebra on a separa- ble Hilbert space is approximately finite- dimensional if and only if it is injective ( 36 Banach Algebras H). J. Crossed Products The crossed product j{ @.G of a vo t1 Neu- mann algebra j{ (acting on a Hilbert space ) and a locally compact Abelian group G relative to a continuous action IX of G on .41 (by *-automorphisms IX g , gE G) is the von Neumann algebra J1I' generated by 1he opera- tors n(A), AEvt1 and ).(h), hEG, defined on the Hilbert space L2(G,) of all -valm'd Lr 
1161 functions on G (relative to the Haar measure) by [1t(A)] (g) = Cf. g - 1 (A)(g), [A(h)] (g) = (h -1 g), where ELz(G,S5). The canonical action IX of the dual eon JV is defined by IXp(B) = f1(p)Bf1(p)* for BE JV and pEe, w here f1(p) is defined by [f1(p)](g) = <g, p) (g). The duality theorem of Takesaki [21] asserts that [vU @. G] @"e is isomorphic to vU @ 8l(Lz(G)), where the second factor 8l(Lz(G)) is the algebra of a11 bounded linear operators on Lz(G). K. Natural Positive Cone The closure V' of the set of vectors Li'1(A) for all positive A in 91cp n 91: reflects certain properties of the von Neumann algebra vU for 0::::; Cf.::::; 1/2 [24,25]. In particular, V 1 / 4 is ca11ed the natural positive cone. It is a self-dual closed convex cone, and is intrinsic to the von Neumann algebra vU (i.e., independent of the weight cp). Every normal positive linear functional t/J on vU has a unique representa- tive (t/J) in this cone (i.e., t/J(A) = (ncp(A)(t/J), (t/J))), and the mapping  is a concave, mono- tone, bijective homeomorphism, homo- geneous of degree 1/2. The group of a11 *- automorphisms of vU has a natural unitary representation U(g), gEAutvU, satisfying the relations U(g)AU(g)*=g(A), U(g)(cp)= (cpog-1). L. C*-Algebras and von Neumann Algebras Let a C*-algebra .'71 be given. A *-represen- tation x-+ Tx gives rise to a von Neumann algebra vU, generated by Tx, xEd. The type of this representation is defined according to the type of vU. A C*-algebra is called a C*-algebra of type I if its *-representations are always of type I. It is known that this class is exactly the class of GCR algebras ( 36 Banach Algebras H). It is also known that a separable non-type I C*-algebra has a representation of type II and a general non-type I C*-algebra has a representation of type III (J. Glimm, Ann. Math., 1961; Sakai [8]). For a C*-algebra .'71, a11 its representations generate tinjective von Neumann algebras if and only ifd is tnuclear [26,27]. M. Topological Groups and von Neumann Algebras Consider a unitary representation g-+U g of a 10ca11y compact Hausdorff group G ( 423 308 Ref. Operator Algebras Topological Groups). If this representation is a tfactor representation, the type of this repre- sentation is defined according to the type of the von Neumann algebra vU generated by U g , 9 E G. A tgroup of type I is a group whose factor- representations are a11 of type I. For example, connected semisimple Lie groups and connected nilpotent Lie groups are of type I (Harish-Chandra, Trans. Amer. Math. Soc., 1953; J. M. G. Fe11, Proc. Amer. Math. Soc., 1962). Examples of groups that are not of type I are known ( 437 Unitary Representation E). References [1] J. von Neumann, Co11ected works II, III, Pergamon, 1961. [2] W. Arveson, An invitation to C*-algebras, Springer, 1976. [3] O. Bratte1i and D. W. Robinson, Operator algebras and quantum statistical mechanics, Springer, 1979. [4] J. Dixmier, Von Neumann algebras, North-Ho11and, 1981. [5] J. Dixmier, Les C*-algebres et leurs repre- sentations, Gauthier- Vi11ars, 1964. [6] I. Kaplansky, Rings of operators, Ben- jamin, 1968. [7] G. K. Pedersen, C*-algebras and their automorphism groups, Academic Press, 1979. [8] S. Sakai, C*-algebras and W*-algebras, Springer, 1971. [9] J. T. Schwartz, W*-algebras, Gordon & Breach, 1967. [10] S. Stratila and L. Zsido, Lectures on von Neumann algebras, Editura Academiei/Abacus Press, 1975. [11] M. Takesaki, Theory of operator algebras I, Springer, 1979. [12] Y. Nakagami and M. Takesaki, Duality for crossed products of von Neumann alge- bras, Lecture notes in math. 731, Springer, 1979. [13] M. Takesaki, Tomita's theory of modular Hilbert algebras and its applications, Lecture notes in math. 128, Springer, 1970. [14] R. Haag, N. M. Hugenholtz, and M. Winnink, On the equilibrium states in quan- tum statistical mechanics, Comm. Math. Phys., 5 (1967), 215-236. [15] A. Van Dae1e, A new approach to the Tomita- Takesaki theory of generalized Hilbert algebras, J. Functional Anal., 15 (1974), 378- 393. [16] M. Rieffel and A. van Dae1e, A bounded approach to Tomita-Takesaki theory, Pacific J. Math., 69 (1977),187-221. [17] R. T. Powers, Representations of uni- formly hyperfinite algebras and their asso- 
309 A Orbit Determination ciated von Neumann rings, Ann. Math., (2) 86 (1967), 138-171. [18J H. Araki and E. 1. Woods, A classifica- tion of factors, Pub!. Res. Inst. Math. Sci., (A) 4 (1968). 51-130. [19J A. Connes, Une classification des facteurs de type III, Ann. Sci. Ecole Norm. Sup., (4) 6 (1973), 133-252. [20J E. J. Woods, The classification of factors is not smooth, Canad. J. Math., 25 (1973), 96- 102. [21J M. Takesaki, Duality for crossed prod- ucts and the structure of von Neumann alge- bras of type III, Acta Math., 131 (1973), 249-310. [22J A. Connes, Classification of injective factors, Ann. Math., 104 (1976),73-115. [23J W. Krieger, On ergodic flows and the isomorphism of factors, Math. Ann., 223 (1976), 19-70. [24J H. Araki, Some properties of modular conjugation operators of von Neumann alge- bras and a non-commutative Radon-Nikodym theorem with a chain rule. Pacific J. Math., 50 (1974),309-354. [25J A. Connes, Caracterisation des espaces vectoriels ordonnes sous-jacents aux algebres de von Neumann, Ann. Inst. Fourier, 24, Fasc. 4 (1974). 127-156. [26J M. Choi and E. Effros, Separable nuclear C*-algebras and injectivity, Duke Math. J., 43 (1976),309-322. [27J M. Choi and E. Effros, Nuclear C*- algebras and injectivity: The general case, Indiana Univ. Math. J., 26 (1977), 443-446. [28J R. V. Kadison and J. R. Ringrose, Funda- mentals of the theory of operators I, Academic Press, 1983. [29J S. Stratila, Modular theory in operator algebras, Editura Academieil Abacus Press, 1981. 309 (XX.7) Orbit Determination A. General Remarks The purposes of the theory of orbit determi- nation are (1) to study properties of orbits of celestial bodies, (2) to determine orbital ele- ments from observed positions of the celestial bodies, and (3) to compute their predicted positions utilizing the orbital elements. Celes- tial bodies to which the theory is applied are mainly planets, asteroids, comets, satellites, and artificial satellites in the solar system, although orbits of meteors and visual, photo- 1162 metric, and eclipsing binaries can be deter- mined by similar methods. B. Kepler's Orbital Elements Consider, for example, an asteroid moving on an ellipse with one focus at the SUITI. The ellip- tic orbit is fixed by the initial conditions of the motion or the integration constant, of the tHamilton-Jacobi equation ( 55 Celestial Mechanics) and is described by Kepler's orbital elements a, e, (J), i, Q, and to (Fig. 1). north Fig. 1 Orbital elements The size and shape of the ellipse are deter- mined by the semimajor axis (half rhe tmajor axis) a and the teccentricity e, while the argu- ment (J) of perihelion, measured from the as- cending node to the perihelion, shows the direction of the major axis. (Sometimes, we adopt as one of the main parameters the peri- helion distance q = a(l-e) instead of the semi- major axis a.) The position of the orbital plane is determined by the inclination angle i to the ecliptic and the longitude Q of the ascending node, and then the position of the asteroid on the orbit is determined by the time to of the perihelion passage. The period T of one revolution, or mean motion n = 2n/T, which is the mean angular velocity, is computed by Kepler's third law n 2 a 3 = fl., with fI. a constant depending on the mass of the asteroid. The mean motion is a fundamental frequency in the solution of the Hamilton-Jacobi equation and is obtained by differentiating the energy constant - fI./2a with respect to an action variable j;;.. To express the position of the asteroid on the ellipse as a function of time, we use the true anomaly v, which is the angular distance of the asteroid from the perihelion. the eccen- tric anomaly E, and the mean anomaly M = n(t-t o ). Of these three anomalies the mean anomaly can be derived directly from Kepler's elements, although it must be tran;formed to the true anomaly or to the eccentric anomaly when we compute the coordinates of the aster- 
1163 oid. Kepler's equation E-esinE=M hOlds between E and M. Solving this equation, we obtain an expression for E as a function of M: '" 2 E=M + L -In(ne)sinnM, n=1 n where I n is the tBessel function of order n. However, in practical computations, we often solve equation (1) directly by numerical methods or by using tables. C. Orbit Determination An astrometric observation of a celestial body usua11y consists of measurements of two co- ordinates (right ascension and declination) on the celestial sphere. Therefore, to derive six orbital elements, three sets of observations should be made at three moments separated by appropriate time intervals. If the topocen- tric distance of the celestial body is known, the orbital elements can be computed directly from observations. HOwever, since the distance is not usua11y known, special methods have had to be developed. A method for orbit deter- mination was worked out by C. F. tGauss at the beginning ofthe 19th century to find the orbit of Ceres, the first asteroid to be dis- covered. Although the topocentric distances are not known, we know that orbits of aster- oids are planar, and Kepler's second law, the law of conservation of areal velocity, holds approximately. Therefore we can assume that the area of the triangle made by the sun and the two positions of the asteroid observed at different moments is proportional to the cor- responding time interval. Using this property of the orbit we can derive the topocentric distance and then the orbital elements. This method is ca11ed the indirect method, and similar methods can be developed for para- bolic and hyperbolic orbits. D. Osculating Elements and Orbit Improvement For the ttwo-body problem the orbit is a fixed and invariable e11ipse, and therefore Kepler's orbital elements are constants. On the other hand, when gravitational interactions from other bodies cannot be disregarded, the orbital elements are found to be variable by comput- ing the tperturbations by the tmethod of vari- ation of constants. The perturbations are expressed as sums of periodic, tsecular, and long-periodic terms. 309E Orbit Determination (1 ) Because of the perturbations, the orbit deviates from the fixed el1ipse, although at every moment the instantaneous velocity and position of the asteroid determines an el1ipse. The orbital elements of the e11ipse thus defined at each moment, ca11ed osculating elements, are variable with time. To compute perturbations that cause this change of osculating elements, it is necessary to observe the initial conditions of motion, i.e., the osculating elements at the initial moment. During a time interval shorter than the period of one revolution, the vari- ations of the osculating elements are usual1y very small. Therefore, by three sets of observa- tions made at three moments at short inter- vals, it is possible to determine the orbital elements that can .be identified with the oscu- lating elements observed at the mean moment. However, if the intervals are very short, errors in the determined values often are very large, and it becomes necessary to carry out obser- vations at distant moments also. When such additional observations are made, those data are compared with the respective values that fo11ow from the initial observations, and the perturbations computed from them; then the method of least squares is used to improve the estimation of the orbital elements. (2) E. Artificial Satellites Since the periods of revolution of asteroids are of the order of a few years, the osculating elements change very little in a few weeks. On the other hand, for artificial sate11ites moving around the earth, the periodic as we11 as secu- lar perturbations become very large after a few hours because the period of revolution may be as short as two hours. Therefore, to determine orbital elements for artificial satel- lites, observed positions should be corrected by subtracting the effects of periodic pertur- bations computed from approximate orbital elements already known. By using the obser- vations thus corrected, mean orbital elements are derived by the method of orbit improve- ment. The approximate orbital elements can be computed if the launching conditions of the sate11ites are known. In this manner, mean orbital elements can be derived every day, and variations of the mean orbital elements, or amounts of secular perturbations, for a certain period (say, for 100 days) are found. From them, information on atmospheric density and the gravitational potential of the earth are derived. It should be remarked that for arti- ficial satel1ites distance measurements have been made by radar, and velocity determi- nations have been made by measuring the Doppler effect. 
309F Orbit Determination For satellites of other planets, measurements of two coordinates with respect to the centers of the planets are made. Masses of planets can be computed by Kepler's third law when the orbital elements of satellites are known, and gravitational potentials of the planets can be determined from their secular perturbations. F. Binaries In the study of visual binaries, methods similar to those for satellites can be applied, although the exact estimation of the distances to bina- ries is often impossible. For photometric binaries radial components of velocities are derived by measuring the Doppler effect; and for eclipsing binaries important information, such as their masses, densities, and sizes, as well as data regarding their internal consti- tutions, can be derived from the observed orbital elements. References [IJ G. Stracke, Bahnbestimmung der Planeten und Kometen, Springer, 1929. [2J A. D. Dubyago, ";'he determination of orbits, Macmillan, 1961. [3J C. F. Gauss, Theoria motus, Dover, 1963. [4J P. R. Escobal, Methods of orbit determi- nation, Wiley, 1965. 310 (XII.4) Ordered Linear Spaces A. History Many spaces used in functional analysis, such as tHilbert spaces, tBanach spaces, and ttopo- logical linear spaces, are generalizations of Euclidean spaces, where the leading idea has been to generalize the distance in Euclidean spaces in various ways. On the other hand, generalizing the order concept for real num- bers has led to spaces of another kind: ordered linear spaces and vector lattices. The theory of vector lattices was presented in a lecture by F. Riesz at the International Congress of Mathe- maticians in 1928 [IJ and has been developed by many authors. Among them we cite H. Freudenthal, L. V. Kantorovitch (Mat. Sb., 2 (44) (1937)), Riesz (Ann. Math., (2) 41 (1940)), S. Kakutani, F. Bohnenblust, G. Birkhoff [2J, H. Nakano [3], B. C. Vulikh [5], and H. H. Schaefer [8]. Vector lattices have been used in lattice-theoretic treatments of integration ( 1164 Section I), tspectral resolution, and tergodic theory. The central notion is BanaGh lattices ( Section F), but the theory has been ex- tended to the case where E is a tlocall y convex topological linear space with the sl ructure of a vector lattice [6-8J. B. Definitions A real tlinear space E is said to be an ordered linear space if E is supplied with an torder relation? with the following two properties: (i) x? y=>x+z? y+z; (ii) x? y and Jc?O (A is a real number)=>Ax?}.y. If, in addition, E forms a tlaMice under this order ?, we call E a vector lattice (Riesz space or lattice ordered linear space). For Sections B through E, we a,.sume E to be a vector lattice. For any x, YE E, the tjoin and tmeet of x, yare denoted by x v y and x/\y respectively. The following relations are obvious: (x+z)v(y+z)=(xv y)+z, (x +z) /\(y +z) = (x /\y) +z, hVAY=)"(xvy), ).X/\AY=)"(X/\Y) (A?O), hVAY=).(X/\Y), AX/\AY=Jc(XVY) (AO) and (x v y) /\z =(x /\z) v(y /\z), (x /\y) V Z =(x v z) /\(y V z). The last relation means that E is a tdistribu- tive lattice. For xEE, the elements xvO, (-x)vO, and xv( -x) are called respectively the positive part, negative part, and absolute value of the element x, and are denoted respectively by x +, x-, and Ixi. The following identities hold: x= x+ X- (Jordan decomposition), Ixi =x+ + x-, x+ /\x- =0, xvy+x/\y=x+y, Ihl= IAllxl, and Ix- yl =xv y-x/\y. For a, bEE with ab, the set {xlaxh} is called an interval and is denoted by [a, b]. A subset of E is called (order) bounded if it is included in an interval. An element e of E is said to be a unit or an Archimedean unit if for any XEE there exists a natural number n such that x  ne. A linear subspace I of E is called an ideal (or order ideal) of E if XEI and Iyl Ixi imply YEI. C. Order Limits Given a subset {x.} of E, if an element x of E is an upper bound of {x.} and any upper bound y of {x.} satisfies the relation y?x, then it is called the least upper bound (or su- 
1165 premum) of {x.} and is denoted by sup.x. or V.x.. The greatest lower bound (or infimum) of {x.}, denoted by info x. or /\. x., is defined dually. A sequence {Xn} (xnE E) is said to be order convergent to x if there exists a nonincreasing sequence {un} (unEE) such that /\nun=O and IXn - xl  Un' In this case x is called the order limit of {xn} and is denoted by x=o-limx n . For order convergent sequences {xn} and {Yn}, we can show the following relations: o-lim(Ax n + llYn) =A(o-limx n ) + ll(o-limYn) and o-lim(xnXYn) = (o-lim xn)x(o-limYn). We say that E is Archi- medean if the relations O nxy (n = 1,2,3,...) imply x = O. If E is Archimedean, then the relations x= o-limx n and A=limAn imply Ax= o-limAnxn. We say that E is complete (CJ- complete) if any (countable) subset of E that is bounded above has a least upper bound. A CJ-complete vectOr lattice is always Archime- dean. If E is CJ-complete, for any sequence {xn} bounded above, o-lim sup X n is defined to be /\n Vmo;,nxm; we define o-liminfx n similarly. With these definitions, x=o-limx n is equiva- lent to x=o-limsupx n =o-liminfx n . Any Archimedean vector lattice can be extended to a complete vector lattice in the same way as the real numbers are constructed from the rational numbers by Dedekind cuts ( 294 Numbers). D. Examples of Vector Lattices tSequence spaces, such as c, m, and Ip, and tfunction spaces, such as C, M, and Lp, form vector lattices under pointwise ordering ( 168 Function Spaces). Among these spaces c and C are not u-complete, but the others are complete. We give two examples. First, let I be a CJ-algebra of subsets of a space Q, and let A(Q, I) be the set of all finite CJ-additive tset functions defined on L. Then A(Q, I) is a complete vector lattice if we define III  112 to mean 1l1(S)1l2(S) for any SEI. The second example is an ordered space consisting of all tbounded symmetric operators T on a Hil- bert space H, where we define Tl  T 2 to mean (T1X,X)(T2X,X) for any XEH. In general, this space is not a vector lattice. However, if A is a commutative tW*-algebra of operators on Hand S is the set of tsymmetric operators be- longing to A, then S is a complete vector lattice under the ordering just defined. We can re- place the conditions of finiteness in A(Q, I) and boundedness in S with weaker ones and sti11 obtain the same situation. The tRadon- Nikodym theorem in A(Q,I) and the tspectral resolution theorem of symmetric operators in S can be extended to theorems of tspectral representations in general vector lattices. 310F Ordered Linear Spaces Let El be a linear space of functions defined on a set Q ordered pointwise. If there exists a bijective mapping defined on a vector lattice E onto El that is linear and order isomorphic, we cal1 El a representation of E. If E has an Archimedean unit and is simple (which means that E and {O} are the only ideals of E), then E can be represented as the set of real numbers such that the Archimedean unit of E is repre- sented by the number 1 (H. Freudenthal, Proc. Akad. Amsterdam, 39 (1936)). E. Dual Spaces Let £(E, F) be the set of order bounded linear mappings of a vector lattice E into a vector lattice F, where order boundedness means that any bounded (in the sense of the order) subset of E is mapped into a bounded set of F. For any CfJ1, CfJ2 E £(E, F), define CfJl  CfJ2 to mean CfJl(X)CfJ2(X)(X0,XEE). IfF is complete, then £(E, F) is a complete vector lattice. An element CfJ E £(E, F) is called a positive operator if CfJ  O. If F is the set of real numbers R, then £(E, F) is the set of al1 (order) bounded tlinear func- tionals on E. This space, cal1ed the dual lattice of the vector lattice E and denoted by E b , is a complete vector lattice. For fEE b and XO, xEE, we have j+(X)= sup f(Y), r(x)= - inf f(y), Oyx Oyx Ifl(x) = sup f(Y). lylO;;x F. Banach Lattices A linear space E is cal1ed a normed vector lattice if E is a vector lattice having the struc- ture of a tnormed space satisfying Ixi  I yl= Ilxll  Ilyll. Furthermore if a normed vector lattice E is complete relative to the norm, we call E a Banach lattice. The examples in Sec- tion D are Banach lattices (for IlEA(Q,I) we define 111111 = 11l1(Q)). In Banach lattices, Ilxn-xll-+O and II Yn- yll -+0 imply IlxnXYn-xxyll->O. Among relations between order convergence and nOrm conver- gence in Banach lattices, the following is one of the most fundamental: In a Banach lattice E, norm convergence of a sequence {xn} to x is equivalent to relative uniform star convergence of {xnrto x, i.e., for any subsequence {xn(m)} of {xn}, there exists a subsequence {Xn(m(l))} of {xn(m)} and an element Y of E satisfying the relations IXn((l)) - xl y/I (I = 1, 2,...). Any set bounded relative to the order is bounded relative to the norm, but the con- verse does not hold in genera1. For a linear functional, however, these two concepts of boundedness coincide, and the order dual of 
3IOG Ordered Linear Spaces E is the same as the norm dual of E. More- over, the dual (in any sense) of a Banach lattice is also a Banach lattice. G. Abstract M Spaces and Abstract L Spaces For a Banach lattice E, we consider the follow- ing three conditions: (M) x, y?O=>llxv YII = max(llxll, Ilyll). (L) x, y?O=>llx+ yll = Ilxll + II yll. (Lpl x /\y=O=>llx+ ylIP= Ilxil P + II yilP (1 < p < iXJ). If E satisfies one of the conditions (M), (L), or (L p ), we say that E is an abstract M space, and abstract L space, or an abstract Lp space, written AM, AL, and ALp, respectively. If the unit ball of an AM space has a tgreatest element, it is called the Kakutani unit of E. The duals of AM spaces, AI. spaces, and ALp spaces are AI. spaces, AM spaces with the Kakutani units, and ALq spaces (l/p+ 1/q = 1), respectively. An AM space with a Kakutani unit is represented by C(Q), i.e., the set of all real-valued continuous functions defined on a compact Hausdorff space Q. The AI. spaces and ALp spaces are represented by Ll and Lp, respectively, on a tmeasure space. Here the representation of a Banach lattice means a representation of a vector lattice preserving the norm (Kakutani, Ann. Math., (2) 42 (1941); Bohnenblust, Duke Mt.lth. J., 6 (1940)). H. Spectral Properties of Positive Operators The n-dimensional real vector space En is a vector lattice under pointwise order ( Sec- tion D). An element x in En is called strictly positive if Xi > 0 for all i. A square matrix A = (aiJ of order n is called positive if ai.j?O for all i and). It corresponds to a positive opera- tor in En ( Section E). A is called irreducible if there exists no permutation matrix P such ( A, A2 ) that p- I AP = 0 A 3 ' where AI and A3 are square matrices of order n i (1  n i < n). We denote by CJ(A) the tspectrum of A and by r(A) the tspectral radius of A, i.e., sup{ IIXIIIXECJ(A)}. The spectral circle of A is the circle of radius r(A) having the origin as its center. O. Perron (Math. Ann., 64 (1907)) and G. Frobenius (S. B. Preuss. Akad. Wiss., 1908 and 1912) established the following remarkable result on the spec- tral properties of positive matrices. Theorem (Perron-Frobenius). Let A be a positive square matrix. Then its spectral radius r(A) belongs to CJ(A), and for this spectrum r(A) there exists an eigenvector x? O. Assume further that A is irreducible and the order of A is greater than 1. Then r( A) > 0 and the eigen- space of A for r(A) is a I-dimentional subs pace 1166 spanned by a strictly positive element. In this case the eigenvalues of A on the spectral circle are the kth roots of unity for some k multiplied by r(A), each of which is a simple root of the eigenequation of A. Since a positive matrix of order n corre- sponds to a positive operator in En' exten- sions of this theorem to positive operators in ordered linear spaces have been studied by many mathematicians. For these extensions, see the following articles: M. G. Krein and M. A. Rutman (Amer. Math. Soc. Transl., 26 (1950); original in Russian, 1948), F. F. Bonsall (J. London Math. Soc., 30 (1955)), S. Karlin (J. Math. Mech., 8 (1959)), T. Ando (J. Fac. Sci. Hokkaido Univ., ser. 1, 13 (1957»), H. H. Schae- fer [8], H. P. Lotz (Math. Z., 108 (1968)), F. Niiro and I. Sawashima (Sci. Pap. ColI. Gen. Educ., Univ. Tokyo, 1966), I. Sawashima and F. Niiro (Nat. Rep. Ochanomizu Univ., 30 (1979)), and S. Miyajima (J. Fac. Sci. Univ. Tokyo, 27 (1980) ). I. Integrals Based on Ordering As applications of ordered linear space theory, we state the integrals of Daniel-Stone and of Banach. Let us begin with a set if of real- valued functions defined on an abstract space S and assume that G: is a vector lattice with respect to the usual order relation, addition, and scalar multiplication. Assume further that a functional E(f) defined on G: satisfies the following conditions: (i) additivity, i.e., E(f +g) =E(f)+E(g); (ii) positivity, i.e.,.f;O implies E(f)?O; (iii) f, fnEG: (n= 1,2,...) and If I  ,Ifnl imply E(lfl);;", E(lfnl), where If I means r v ( - f). A functional on G: satisfying both conditions (i) and (ii) is called a positive linear functional. A positive linear functional on G: satisfies M. H. Stone's condition (iii) if and only if it satisfies P. J. Daniell', condition (iii)': f1 ?f2? .., and limn_wfn =0 imply limn_wE(fn)=O. Next, we define, for every function q> on S admitting :!:iXJ as values, a functional N(q» as follows: N(q» = inf{J1 E(lfnl)jInEG:, 1q>1 ;nllfnl}. Here we put N(q» = +iXJ when for a function q> there are no functions Un} such that 1q>1  ;;",Ifnl. A function q> is, by defini:ion, a null function if N(q»=O holds, and a set A is a null set if its tcharacteristic function is a null function. Since each function of 3'(1 = { q> I N (q» < +iXJ} takes finite values except on a null set, we can define addition and the scalar multipli- cation for such functions except on a null set. Let !Y be the set of equivalence cla,ses of !Yo 
1167 with respect to the relation cp - t/J defined as N(cp-t/J) =0. Then  is a Banach lattice with the norm N, and (f is included in  (by iden- tifying f and 9 of (f when E( I f - 9 I) = 0). Let us denote now by f! the closure of (f in . Then any function cp belonging to f! is said to be Daniell-Stone integrable, and L(cp)=N(cp+)- N(cp-)(cp+ =t(lcpl+cp), cp- =t(lcpl-cp)) is ca11ed the Daniell-Stone integral of cpo The integral L thus defined is, as a functional on f!, an extension of the functional E on (f. For this integral, Lebesgue's convergence theorem is easily proved, and a result corresponding to Fubini's theorem has been obtained [9]. Fur- thermore, the concepts of measurable func- tions, measurable sets, and measure can be defined by using Land f!. Also, the relation between L and the Lebesgue integral with respect to this measure is known [9]. Since f! is an tabstract L-space, the Danie11-Stone integral L(cp) is represented by the Lebesgue integral of cp on a certain measure space. The Danie11-Stone integral introduced above is due to M. H. Stone [9J. Danie11 (Ann. Math., (2) 19 (1917-1918)) origina11y defined the upper integral /(cp) by using E(f) on (f satisfying conditions (i), (ii), and (iii'). Also, he defined the set f! of Danie11-Stone integrable func- tions by L={cpl/(cp)= -/(-cp)}. S. Banach defined an integral by using methods similar to Danie11's, replacing condition (iii') for a posi- tive linear functional E(f) on (f by condi- tion (iii"): lim n _o:Jn=O, Ifni ::::;g, andf", gE(f imply limn_oo E(fn) = 0 [10]. Furthermore, N. Bourbaki [l1J and E. J. McShane (Proc. Nat. Acad. Sci. US, 1946) have defined a more gen- eral integral than the Daniell-Stone with a condition analogous to (iii'), replacing the se- quence in it by a tdirected family offunctions {f.}. Specifica11y if, in the Daniel1-Stone integral, S is a loca11y compact Hausdorff space and (f is the set of continuous functions with compact supports, then a functional E(f) on (f satisfy- ing conditions (i) and (ii) is proved to satisfy the condition (iii'), and the Danie11-Stone integral L(cp) can be constructed from E(f) [11]. Banach also defined another integral for a11 real-valued bounded functions on [0,1) by using the tHahn-Banach extension theorem [12]. His definition is as fo11ows: Let  be the set of a11 real-valued bounded functions on [0, 1) and I!I be the family of a11 finite sets of real numbers IX = (IX 1, 1X 2 , ... , IXn). Furthermore, we define, for X(S)E and IXEI!I, 1 n M(x,IX)= sup - I X(S+lX k ), -oo<s<+oo nk=l where x(s) is considered as the periodic exten- sion to (-00, +(0) of the function defined 310 Ref. Ordered Linear Spaces origina11y on [0,1) and p(x)= inf M(x, IX). .E Then, by the Hahn-Banach extension theorem, there exists a linear functional F on  satisfy- ing F(x)::::; p(x). If we write S x(s)ds for F(x), then we can prove immediately that S x(s)ds has the fo11owing properties: (1) S {ax (s) + by(x)} ds = as x(s)ds +b S y(s)ds, where a and b are real constants. (2) x(s)O implies S x(s)dsO. (3) S x(s+ so)ds = S x(s)ds, where So is an arbitrary real number. (4) J1 ds = I. If necessary, we can add the property (5) S x(l- s)ds= S x(s)ds by defining f X(S)dS={F(X(S))+F(X(I-S))}. Then F(x)= f x(s)ds or {F(X(S))+ F(x(l-s))} = f x(s)ds is ca11ed the Banach integral of x(s). The construction of the Danie11-Stone inte- gral and the Banach integral opened avenues to several other abstract integrals based on the order relation, such as an integral for more general functions with values in a vector lat- tice, or an integral considered as a mapping from a vector lattice into another vector lattice (or from an ordered set into another ordered set). Indeed, if the function takes values in a complete vector lattice, then almost all results in this section (e.g., the Hahn-Banach exten- sion theorem) hold trivially. For discussions of these and other abstract integrals  [13-15]. References [IJ F. Riesz, Sur la decomposition des oper- ations fonctionne11es lineaires, Atti del Con- gresso Internazionale dei Matematici, 1928, Bologna, 3, 143-148. [2J G. Birkhoff, Lattice theory, Amer. Math. Soc. Co11oq. Pub!., revised edition, 1948. [3J H. Nakano, Modulared semi-ordered linear spaces, Maruzen, 1950. [4J M. M. Day, Normed linear spaces, Erg. Math., Springer, 1958. [5J B. C. Vu1ikh, Introduction to the theory of partia11y ordered vector spaces, Gronigen, 1967. [6J I. Namioka, Partia11y ordered linear top- ological spaces, Mem. Amer. Math. Soc., 1957. [7J A. L. Peressini, Ordered topological vector spaces, Harper & Row, 1967. 
311 A Ordering [8J H. H. Schaefer, Banach lattices and posi- tive operators, Springer, 1974. [9J M. H. Stone, Notes on integration, Proc. Nat. Acad. Sci. US I-III, 34 (1948),336-342, 447-455,483-490; IV, 35 (1949), 50-58. [IOJ S. Banach, The Lebesgue integral in abstract spaces, Theory of the Integral, S. Saks (ed.), Stechert, 1937,320-330 (Dover, 1964). [11J N. Bourbaki, Elements de mathematique, Integration, ch. 1-9, Actualites Sci. Ind., Her- mann: ch. 1-4, 117a, second edition, 1965; ch. 5, 1244b, second edition, 1967; ch. 6, 1281a, 1959; ch. 7, 8, 1306, 1963; ch. 9, 1343, 1969. [12J S. Banach, Theorie des operations lineaires, Warsaw, 1932, 30-32. (Chelsea, 1963). [13J S. Izumi, N. Matuyama, M. Nakamura, M. Orihara, and G. Sunouchi, An abstract integral, Proc. Imp. Acad. Tokyo, I-III, 16 (1940),21-25,87-89,518-523; IV, 17 (1941), 1-4; V-X, 18 (1942), 45-49, 50-52, 53-56, 535-538,539-542,543-547. [14J E. J. McShane, Order-preserving maps and integration processes, Ann. Math. Studies, Princeton Univ. Press, 1953. [15J T. H. Hildebrandt, Integration in abstract spaces, Bull. Amer. Math. Soc., 59 (1953), 111- 139. [16J N. Dunford and J. T. Schwartz, Linear operators, Wiley, I (1958), III (1971). [17J K. Y osida, Functional analysis, Springer, 1965. 311 (11.12) Ordering A. Ordering The concept of ordering is abstracted from various relations, such as the inequality rela- tion between real numbers and the inclusion relation between sets. Suppose that we are given a set X = {x, y, z, ... }; the relation be- tween the elements of X, denoted by  or other symbols, is called an ordering (partial ordering. semiordering, order relation, or sim- ply order) if the following three laws hold: (i) the reflexive law, x  x; (ii) the antisymmetric law, x  V and y  x imply x = y; and (iii) the transitive law, x  y and y  z imply x  z. A set X with an ordering between its ele- ments is called an ordered set (partially ordered set or semiordered set). A subset of an ordered set X is also an ordered set with respect to the same ordering as in X. Iffor an arbitrary pair of elements x, y of an ordered set A either xy or yx must hold, then the ordering  1168 is called a total (or linear) ordering, and A is called a totally ordered (or linearly ordered) set. We sometimes write xy as yx. The binary relation  is called the dual ordering of ; it is also an ordering. More generally, the duals of concepts, conditions, and propositions concerning an ordering are defined by replac- ing the ordering with its dual. For example, x < y means that x  y and x =f. y, while x> y means that xy and x=f.y; and> is the dual of <. If a universal proposition concerning the ordering is true, then its dual is also true; this principle is called the duality princil)le for ordering. IncidentalIy, x  y is equivalent to the statement x <y or x = y, according to the definition of <. B. Definitions A subset of an ordered set X of the form {xla<x<h} is denoted by (a,h), and a set of the form (a,h), {xlx<a}, or {xlx>a} is called an interval. In particular, S(c) = {x I x <c} is calIed the segment of X determined by c. A pair of elements a, b satisfying a  b is calIed a quotient of X and is denoted by bla. When a<c<b or b<c<a, c is said to lie between a and b. A totalIy ordered ;et A is said to be dense if for any pair of distinct elements a and b in A there exists a third element c lying between a and b. When a < h and there is no element lying between a and b, then a is calIed a predecessor of b, and b a successor of a. In this manner, most of the terminology as- sociated with the inequality of numbers is carried over to general ordering. In an ordered set A, an element a is calIed an upper bound of a subset X if x  a for every element x of X. When an upper bound exists, X is said to be bounded from above (or bounded above). The dual concept of an upper bound is a lower bound of the subset; and if the subset has a lower bound, it is said to be bounded from below (or bounded below). A set bounded both from above and from below is simply said to be bounded. When a is an upper bound of X and aEX, then a is calIed the greatest element (or maximum element) of X. Such an element a (if it exists) is unique and is denoted by max X; its dual is the least element (or minimum element) and is denoted by min X. If there is a least element in the set of upper bounds of X, it is calIed the least upper bound (or supremum) of X and is denoted by I.u.b. X or sup X. Its dual is the greatest lower bound (or infimum) and is denoted by g.l.b.X orinfX. If the ordered set X is the image rp{A) of a set I\. under a tmapping qJ, where I\. is of the form {Ie I C(},)}, then sup X is also written 
1169 SUPC(A) cp(A) and is called the supremum of cp(A) for all A that satisfy C(A). When there is no danger of misunderstanding, it may be written as SUPA cp(A) or sup cp(A) and called simply the supremum of cp(A); similar conventions hold for inf, max, min, etc. An element a of a set X is called a maximal element if a<x never holds for any element x of X; its dual is a minimal element. If the greatest neast) element exists, it is the only maximal (minimal) element. But in general, a maximal (minimal) element is not necessarily umque. C. Chain Conditions An ordered set X is said to satisfy the minimal condition if every nonempty subset of X has a minimal element. The dual condition is called the maximal condition. An infinite sequence {aI' a 2 , ..., an, ... } of elements of an ordered set X such that a( < a2 < ... < an < ... is called an ascending chain, and the condition that X has no ascending chain is called the ascending chain condition. The notions dual to those of ascending chain and ascending chain condition are descending chain and descending chain condition, respectively. By the chain condition, we mean either the ascending or the descending chain condition. Under the taxiom of choice, the maximal condition is equivalent to the ascending chain condition, and the minimal condition to the descending chain condition. If a totally ordered set X satisfies the mini- mal condition or, equivalently, if every non- empty subset of X has a least element, then the set X is called a well-ordered set, and its order- ing is called a well-ordering. The following theorem is called the principle of transfinite induction: Let P(x) be a propo- sition concerning an element x of a well- ordered set X such that (i) P(x o ) is true for the least element Xo of X, and (ii) P(x) is true if P(y) is true for all y satisfying y <x. Then P(x) is true for all x in X. tMathematical induction is a special case of this principle, where X is the set of all natural numbers. To define a mapping F from a well-ordered set X into a set Y, we may use the following principle: Suppose that F(x o ) is defined for the least element Xo of X, and for each element x of X there is given a method to associate an ele- ment G(f) of Y uniquely with each mapping f: S(x)-+ Y with domain S(x), where S(x) is the segment of X determined by x. Then there exists a unique mapping F: X -+ Y satisfying F(x) = G(F I S(x)) for all x. The definition of the mapping F by this principle is called a definition by transfinite induction. The prin- 311 G Ordering ciples of induction are often used for proving propositions or giving definitions concerning ordinal numbers ( 312 Ordinal Numbers). D. Directed Sets An ordered set (or in general a preordered set ( Section H)) in which every finite subset is bounded from above is called a directed set. Let B be a subset of a directed set A. If {b I b  a} n B #- 0 for every element a of A, then B is said to be cofinal in A; such a subset B is itself a directed set. If {b I b  a} c B for some element a of A, then B is said to be re- sidual in A; such a subset B is also cofinal in A. The condition that B is co final in A is equiva- lent to the condition that A - B is not residual in A. E. Order-Preserving Mappings A mapping cp:A-+A' of an ordered set A into an ordered set A' is called an order-preserving mapping (monotone mapping or order homo- morphism) if a b always implies cp(a) cp(b). Moreover, if cp is bijective and cp -( is also an order-preserving mapping from A' onto A, then cp is called an order isomorphism. A' is said to be order homomorphic (order isomor- phic) to A when there exists an order homo- morphism (order isomorphism) cp such that A' = cp(A). If a mapping cp :A-+A' gives an order isomorphism of A to the dual of A', cp is called a dual isomorphism (or anti-isomorphism). F. Direct Sum and Direct Product Let S be a set that is the tdisjoint union of a family {AA} AEA of its subsets, and suppose that each A A is an ordered set. F or a, bE S, define ab to mean that a, bEA;. for some AEA and ab with respect to the ordering in AA' The ordered set S obtained in this way is called the direct sum (or cardinal sum) of the family {AA} AEA of ordered sets. When (aAhEA and (b AhEA are elements of the tCartesian product P = ILEA AA of a family {AA} AEA of ordered sets, we define (aA)AEA  (b AhEA to mean that a A  b A holds for all AEA. The ordered set P obtained in this way is called the direct product (or cardinal product) of the family {AA} AEA of ordered sets. G. Ordinal Sum and Ordinal Product Suppose that I!I = {A, B, ...} is a family of mutually disjoint ordered sets and is itself an 
311 H Ordering ordered set. Then an ordering  can be de- fined in the disjoint union S = U X (X E I![) as follows: x  Y in S means that either (i) there exists an A satisfying x, YEA E I![ and x  Y holds with respect to the ordering in A; or (ii) for A and B satisfying xEAEI![, YEBEI![, we have A < B. The ordered set S obtained in this way is called the ordinal sum obtained from I![ and is denoted by '!I X. In particular, if I![ = {A, B} and A < B, the ordinal sum is denoted by A +B. Suppose that X is a subset of the Cartesian product II. X). of a family of ordered sets indexed by an ordered set A, and the subset {A I x). ,,",yJ of A has a least element whenever x=(XJ).EA and Y=(Y).)).EA are two distinct elements of X. The ordering in X defined by setting x < Y when x < Y  for the least ele- ment fl of {A I x). ""' Y J is called the lexico- graphic ordering in X. It can be applied to X = IL.X). if A is well-ordered; X is then called the ordinal product. When A, B, '" are ordered sets, AB . .. is often used to denote the ordinal product obtained from Xl = A, X 2 = B, . ., with the ordering 1 < 2 < '" of indices; the ordering in this ordinal product is called the lexicographic ordering in the Cartesian product AxBx .... H. Preordering A relation R between elements of a set X is called a preordering (or pseudoordering) if it satisfies the reflexive law and the transitive law, but not necessarily the antisymmetric law. By defining (X 1 'Y1)R (xz,Yz)-=x 1 xz, for example, a preordering of pairs (x, y) of real numbers is obtained. From a preordering R an equivalence relation  can be defined in X by xy-=(xRyandyRx). Let [XJ=X/ be the tquotient set of set X by this equivalence relation, and let [x], [yJ be the equivalence classes determined by x, YEX; then an order- ing  can be defined in [XJ by [xJ  [yJ-= xRy. (For further topics  52 Categories and Functors; 409 Structures.) References [lJ N. Bourbaki, Elements de mathematique, I. Theorie des ensembles, ch. 3, Actualites Sci. Ind., 1243b, Hermann, second edition, 1967; English translation, Theory of sets, Addison- Wesley, 1968. [2J G. Birkhoff, Lattice theory, Amer. Math. Soc. Colloq. Publ., 1940, revised edition, 1948. Also  references to 381 Sets. 1170 312 (11.13) Ordinal Numbers A. General Remarks Let A  B mean that two tordered 8ets A, B are torder isomorphic; then the relation  is an tequivalence relation. An equivalence class under this relation is called an ordlr type, and the class to which an ordered set A belongs is called the order type of A. Historically, an ordinal number was first defined as the order type of a twell-ordered set (Cantor [2J). How- ever, it was found that a contradic1ion occurs if order type defined in this way arl considered to form a set. Hence, another definition was given by J. von Neumann [3J, which is stated in Section B. A similar situation was found concerning the definition of tcardinal numbers, which led to a new definition of cardinal num- bers using ordinal numbers, which is given in Section D. B. Definitions A set IX is called an ordinal number if it satisfies the following two conditions: (i) IX]S a well- ordered set with the tbinary relation E as its ordering; and (ii) {3 E IX implies {3 co:. Accord- ing to this definition, the empty se1 is an ordinal number, which is denoted by O. Also, 1 ={O}, 2={0, 1}, 3={0, 1,2},... are ordinal numbers. These ordinal numbers, which are finite sets, are called finite ordinalllumbers. The finite ordinal numbers are identified with the natural numbers (including 0). The set w = {O, 1,2, ... } of all natural numbers is also an ordinal number. An ordinal numbr that is an infinite set, like w, is called a traostinite ordinal number. For every well-ordered set A, there exists one and only one ordinal number order iso- morphic to A. This ordinal number is called the ordinal number of A. (Throughout this article, lower-case Greek letters denote ordinal numbers.) We also write IXE {3 as IX < {3, which defines an tordering of the ordinal numbers. The least ordinal number is 0, and the order- ing of the finite ordinal numbers coincides with the usual ordering of the natural num- bers. The least transfinite ordinal number is w. The ordering ,introduced by defining IX  {3 to mean either IX < {3 or IX = {3, is a tlinear order- ing and, in fact, a twell-ordering of the ordinal numbers. Therefore ttransfinite induction can be applied to ordinal numbers. For any ordinal number IX, the set IX' = g I   IX} is also an ordinal numb(:r, and is the tsuccessor of IX. There exists at most one 
1171 ordinal number that is the tpredecessor of IX. A transfinite ordinal number without a prede- cessor is called a limit ordinal number, and all the other ordinal numbers are called isolated ordinal numbers. The first limit ordinal num- ber is w, For any set A of ordinal numbers, m 3f1 (<flEA)} is an ordinal number and is sup A, the tsupremum of A. C. Sum, Product, and Power The sum IX + 13, the product IX' 13 (or IXI3), and the power IX P of ordinal numbers IX, 13 are de- fined by transfinite induction on 13 and have the following properties: IX+O=IX, IX +I3'=(1X +13)', IX+Y = sup {IX + I  <'I}; IX'O=O, IX' 13' =IX' 13+ IX, IX'Y=SUp{IX' I  <'I}; lX o =l, IXP'=IXP'IX, IXY=sup{IXI<Y}, Here 'I is a limit ordinal number, and for the power we assume that IX> O. The sum and product thus defined satisfy the associative laws (IX + 13)+'1 = IX +(13 +y), (IX' f3)'y = IX' (13' 'I) and the left distributive law IX' (13 + 'I) = IX' 13 + IX' 'I; the power satisfies the laws IX P + Y = IX P 'IX Y , IXPY = (IXP)Y. If IX and 13 are the ordinal numbers of the well-ordered sets A and B, respectively, then IX + 13 is the ordinal number of the tordinal sum A + B, and IX' 13 is the ordinal number of the tordinal product BA. When n> 1, any ordinal number IX can be written uniquely in the form IX = n P " 'II + n P2 . '12 + ... + n Pn . 'In; f31>f32>...f3n>O, O<Yi<n, lin, which is called the n-adic normal form for IX; when n = w, it is called Cantor's normal form. Let f be an ordinal number-valued function of ordinal numbers. We say that f is strictly monotone when IX < 13 implies f(lX) <f(f3). Iff is strictly monotone, then IX f(IX), We say that f is continuous when f(y)=sup{f()1  <'I} for each limit ordinal number y. A strictly mono- tone continuous function is called a normal function. If f is a normal function, then for any IX there exists a 13 that satisfies f( 13) = 13 > IX. In fact, it suffices to define f3n(n < w) by 130 = f(1X + 1), f3n+! = f(f3n) and put 13= sup{f3n I n <w}. Since f(IX)=w. is a normal function, there exists an B that satisfies w t = B. Such an ordinal number B is called an B-number. We say that 13 is cofinal to IX when there exists a mOnotone function f that satisfies IX = sup{fW' I  <f3}. The first ordinal number that is cofinal to IX is 312 E Ordinal Numbers called the cofinality of IX and is denoted by Cf(IX), D. Cardinal Numbers Let M  N mean that a one-to-one corre- spondence exists between the two sets M and N. An ordinal number IX with the property that IX   implies IX   is called an initial number or a cardinal number. With the taxiom of choice, it can be shown that for each set M there exists one and only one cardinal number IX satisfying M  IX. This unique IX is called the cardinality (or car din al number) of the set M and is denoted by M. All finite ordinal numbers are cardinal num- bers, and w is the least transfinite cardinal number. There exists one and only one mono- tone function that maps the class of ordinal numbers onto the class of transfinite cardinal numbers, and it is a normal function. The value of this function corresponding to IX is denoted by . (aleph alpha) or w.. In partic- ular, o = w, and  1 is both the smallest un- countable cardinal number and the smallest uncountable ordinal number. A finite ordinal number is called an ordinal number of the first number class, and an ordinal number IX satisfy- ing o  IX <  1 is called an ordinal number of the second number class. The concept of ordinal number ofthe third (or higher) number class is defined similarly, E. Inaccessible Ordinal Numbers The cofinality cf(lX) of IX always satisfies cf(lX)  IX. An ordinal number is said to be regular when cf(lX) = IX and singular when cf(lX) < IX. For any ordinal number IX, cf(lX) is a regular car- dinal number; therefore any regular ordinal number is a cardinal number. When IX=W p is regular and 13 is a limit ordinal number, IX is said to be weakly inaccessible. Let R be the set-valued function of ordinal numbers, de- fined by R(O) = 0 and R(IX) = U{I.13(R())1  <IX} (by ttransfinite induction), where I.13(M) de- notes the tpower set of M. A regular ordinal number IX is said to be strongly inaccessible when IX> wand the following condition is satisfied: If x, yare a pair of sets such that XE R(IX), yc R(IX), and there exists a mapping of x onto y, then yE R(IX). If a regular ordinal number IX is strongly inaccessible, it is weakly inaccessible. A strongly inaccessible ordinal number is usually defined as a reg ular number IX> w such that 13 < IX implies 1.13(13) < IX. Under the axiom of choice, this definition is equi- valent to the one given here. Moreover, under the tgeneralized continuum hypothesis, strong 
312 Ref. Ordinal Numbers inaccessibility and weak inaccessibility are equivalent. References [IJ N. Bourbaki, Elements de mathematique, I. Theorie des ensembles, ch. 3, Actualites Sci. Ind., 1243b, second edition, Hermann, 1967; English translation, Theory of sets, Addison- Wesley, 1968. [2J G. Cantor, Beitrage zur Begrundung der transfiniten Mengenlehre II, Math. Ann., 49 (1897),207-246. (Gesammelte Abhandlungen, Springer, 1932; English translation, Contri- butions to the founding of the theory of trans- finite numbers, Open Court, 1915.) [3J J. von Neumann, Zur EinfUhrung der transfiniten Zahlen, Acts Sci. Math. Szeged., 1 (1923), 199-208. (Collected works I, Per- gamon, 1961.) Also  references to 381 Sets. 313 (XII1.2) Ordinary Differential Equations A. General Remarks Let x be a real (complex) variable and y a real (complex) function of x. Assume that y = F(x) is a differentiable function of class C" if x, yare real, and a holomorphic function if x, yare complex. We write y', y", . .. ,y(") for the first n derivatives of y. A relation among x, y, y', ... ,y(n), f(x,y,y', ...,y!n»)=o (which holds identically with respect to x), is called an ordinary differential equation for the function y = F(x). Here we assume that the function f in the left-hand side of (I) is a real (complex) function of the n + 2 variables x, y, y', ..., y!n) and is defined in a given domain of R"+2 (C"+2). Usually we assume further that f has a certain regularity, such as being of class C' (r = 0, I, ... , (0), treal analytic, or tcomplex analytic, A function y = F(x) that satisfies (1) is called a solution of (I). To find a solution of (1) is to solve or integrate it. Ordinary differential equations may be contrasted to partial dif- ferential equations, which are equations similar to (I) but in which y is a function of two or more variables XI' x 2 ,... and which contain the partial derivatives ayjaxr, ayjax 2 , ... (- 320 Partial Differential Equations). Ordinarily, the term differential equation refers to an ordinary or partial differential equation. In this article, 1172 since we are concerned only with ordinary differential equations, we omit the word "ordi- nary." If the left-hand side f of (1) contains y(n) explicitly or afjay(") =1=0, then we say that the order of (1) is n, and if further f is a poly- nomial in y, y', '" ,y(") that is of degree m with respect to y(n), we say that the degre of (1) is m. In particular, if f is a linear form in y, y', ..., y(n), then (I) is said to be linear. i\ differ- ential equation that is not linear is said to be nonlinear ( 252 Linear Ordinary Differ- ential Equations; 291 Nonlinear Problems). Let cp(x, y, C l' '" ,c n ) be a function of the n + 2 variables x, y, C 1, "', C" of class C' in a domain D, and let (xo,Yo,c?, ...,()ED, cp(x o , Yo, c?, ..., c) = 0, and Cpy(x o , YJ, c?, ..., c) =1= O. Then the equation cp(x, y, c?, ... ,c) = 0 defines an timplicit function y(x) of class C' satisfying the condition y(x o ) = Yo. Consider c 1 , ""C n to be constants in cp(x,y,cr. ""c,,)=O and differentiate cp n times with respect to x. Then we obtain a system of n equa1ions in the variables x, y, y', ...,y("),c 1 , ...,c", If we can eliminate c l' ... ,C" from these n equ ations and cp = 0, then we obtain an nth-order differential equation of the form (1). Conversel)', a solu- tion of an nth-order differential eqlLation can usually be written in the form cp(x, y, C l' ... , c,,) = 0, (2) (I) which contains n arbitrary constants c l' ... ,c" (sometimes called integration constants). A solution containing n arbitrary constants of the form (2) of an nth-order differential equa- tion is called a general solution, and a solu- tion cp(x, y, c?, '" ,c) = 0 obtained from a general solution cp = 0 by giving particular values c?, ... , c to the arbitrary co 1stants is called a particular solution. Some equations admit solutions that are not particular solu- tions. They are called singular solutions (for example, tClairaut differential equations;  Appendix A, Table 14.1). B. Systems of Differential Equations A set of n differential equations containing n unknown functions y l' ... ,y" of a variable x is called a system of ordinary differential equa- tions. Here each equation of the system has a form similar to (I), but each left-hand side contains Yr. "', y" and their derivarives. A set of n functions y 1, . .. , Y n of x is called a solution if the functions satisfy the given sy5,tem of dif- ferential equations. The highest order of de- rivatives in the left-hand sides is called the order of the system of differential equations. We consider most frequently a first-order system of the form y;=,t;(x, Y1 ,..., Yn), i= 1,2,..., n. (3) 
1173 If we put Y= Yl, y'= Yz, ...,yln-l)= Yn and solve (1) with respect to yln) to get yln) = In(x, Y1, "', Yn), then (1) is equivalent to a system of equations of the form (3), where 11 = Yz,fz = Y3' ... In = f In an analogous way, a general system of equations can be transformed to a system of the form (3). Therefore (3) is caned the normal form of differential equations. C. The Geometric Interpretation When x, Y1' ...,Yn are real, (3) can be inter- preted as fonows: Let I = (a, b) be an open interval and D a domain of R n . Let Yi=ipi(X,Cb ...,c n ), i= 1,2, ...,n, (4) be functions of class Cl defined for (x, C l' ..., Cn)E I x D, and let !:i(xo) be the image in the Y1, ..., Yn-space of D under the mapping Yi=ipi(XO,C b ...,c n ) (i= 1,2, ...,n) for a fixed xoEI. We assume that for each fixed xoEI we have 8(ip1' ...,ipn)/8(c 1 , ...,cn)#O in D. Then for every x = Xo E I, C l' ... , C n are considered to be functions of (Y1' ..., Yn) defined in a neighborhood of every point (y, ... ,Y) of !:i(x o ), and we have Y; = ip;(x, C 1 ,..., cn) = !;(X'Y1' ...,Yn) (i= 1,2, ...,n), i.e., Yb ...,Yn satisfy a system of differential equations of the form (3). On the other hand, (4) represents a family of curves of class C 1 in the x, Y1' ..., Yn- space Rn+1 containing n parameters C 1 ,..., Cn' for which (y'I' ..., Y) is the tangent vector (in the terminology of physics, (y;, ... ,Y) gives the speed and the direction of a stationary flow in Rn+1 at each point). By solving (3) we find the family of curves of class C' in Rn+1 (in the terminology of physics, we find a stationary flow of which the speed and the direction are given at each point). A solution containing n parameters analogous to (4) is caned a general solution of (3), and a solution obtained from a general solution by giving particular values to the n parameters is caned a particular solution. As may be imagined by the interpretation in this section, there exists in general one and only one particular solution passing through the point (x o ' Y, ..., Y) for XoE I, M,..., Y)E !:i(x o )' The problem of finding this solution, i.e., the solution of (3) for which Yi(XO) = Y? for X=Xo' is caned the initial value problem ( 316 Ordinary Differential Equations (Initial Value Problems)). D. Methods of Integration We have different methods of solving differen- tial equations. To solve differential equations by a finite number of integrations is caned the method of quadrature. This method is useful 314A ODEs (Asymptotic Behavior of Solutions) for some special types of differential equations ( Appendix A, Table 14.1). S. Lie gave theo- retical foundations for this method by using Lie transformation groups ( 431 Transfor- mation Groups; Appendix A, Table 14.111). There are many other methods, for exam- ple, power series methods (assuming that the solution can be expanded in a power series l:av(x-a», substituting the series for Y in (1), and finding its coefficients); methods of suc- cessive approximation; methods using tLa- place transforms or tFourier transforms; tper- turbation methods; numerical methods; etc. ( 303 Numerical Solution of Ordinary Dif- ferential Equations). Historicany, finding explicit solutions of various kinds of differential equations has been the main object of the theory. Recently, however, the importance of qualitative studies, in particular theorems on the existence and uniqueness of solutions, has been recognized. For example, if a solution with a property A is given, and if the uniqueness of the solution having the property A and the existence of solutions having the properties A and B can be shown, then the given solution necessarily has the property B. In this way, topological and analytic studies of differential equations are applied to find their solutions ( 314 Or- dinary Differential Equations (Asymptotic Behavior of Solutions); 315 (Boundary Value Problems); 316 (Initial Value Problems); 126 Dynamical Systems). References [IJ W. Kaplan, Ordinary differential equa- tions, Addison-Wesley, 1958. [2J P. Hartman, Ordinary differential equa- tions, Wiley, 1964. [3J E. Hine, Lectures on ordinary differential equations, Addition-Wesley, 1968. Also  references to 316 Ordinary Differential Equations (Initial Value Problems). 314 (XIII.5) Ordinary Differential Equations (Asymptotic Behavior of Solutions) A. Linear Differential Equations A system of1inear ordinary differential equa- tions can be written as x' = A(t)x, (1 ) 
314 B ODEs (Asymptotic Behavior of Solutions) where t is a real independent variable, x = (x 1, ... ,x n ) is an n-dimensional complex vec- tor function of t, and A(t) is an n x n matrix whose elements are complex-valued functions of t. If A(t) is a continuous function of t de- fined on an open interval I, any solution of (1) is continuously differentiable for tE I. The question naturally arises as to how the solu- tions behave as t approaches either one of the endpoints of I; that is, the question of the asymptotic properties of the solutions. The interval I can always be taken to be O,s; t < 00, by applying a suitable transformation of the independent variable if necessary. The study of the tasymptotic expansions of solutions when the coefficient A(t) is an ana- lytic function of t was initiated by H. Poincare in 1880. This work was continued by 1. Horn, J. C. C. A. Kneser, and others in the direction of removing assumptions on the structure of A(t) and extending the domain where the ex- pansions are valid. The theory has been almost completed by W. 1. Trjitzinsky, J. Malmquist, and M. Hukuhara ( 254 Linear Ordinary Differential Equations (Local Theory)). On the other hand, O. Perron initiated a new direc- tion of research by weakening the regularity conditions on the coefficients. His work was continued by F. Lettenmeyer, R. A. Spath, Hukuhara, and others. The methods used in these two lines of investigation were originally distinct, but Hukuhara established a unified method of treating the problems arising in these two different types of investigations. Furthermore, he succeeded in sharpening those results previously obtained. Here we assume that A(t) need not be ana- lytic. The foIlowing asymptotic properties of a solution x(t) as t-> 00 are considered: (i) boundedness of Iim sup t- 1 10glx(t)l; (ii) boundedness of solution: Iim suplx(t)1 < 00; (iii) convergence of solution: limx(t); (iv) inte- grability: S"'lx(s)IPds< 00, etc. We caIl x(x)= limsupt- 1 10glx(t)1 the type number (or Lya- punov characteristic number) of the solu- tion x(t). The fact that all solutions of (1) are bounded is equivalent to the tstability of the solution x = 0, and the fact that all solutions of (1) tend to zero as t-> 00 is equivalent to the tasymptotic stability of the solution x = o. B. Constant Coefficients and Periodic Coefficients We begin with the particular case of (1), where A(t) is a constant matrix: x' = Ax. (2) To study the asymptotic properties of the 1174 solutions of (2), it suffices to transform the matrix A into a tJordan canonical form, since the structure of the solution space of (2) is completely determined by the Jordan canon- ical form of A. Thus all solutions of (2) are bounded if and only if every eigenvalue of A has a real part not greater than zero, and those with zero real parts are of simple type, that is, the corresponding blocks in the Jordan canonical form are all 1 x 1 matrices; all solu- tions of (2) tend to zero as t -> 00 if and only if every eigenvalue of A has negative real part. Consider the linear system x' = Ao(t) x, (3) where Ao(t) is a periodic matrix function of period w. According to tFloquet's theorem, (3) is transformed into a system with constant coefficients by means of a suitable transfor- mation x = P(t)y, where P(t) is a nonsingular periodic matrix of period w. Thus, at least theoreticaIly, the information on the asymp- totic behavior of the solutions of the periodic system (3) can be derived from the correspond- ing theory for the system with comtant coeffi- cients (2). C. Asymptotic Integration Suppose that A(t) is bounded. Then the type number x(x) is finite for any nontrivial solu- tion x(t) of (1), and the number of distinct type numbers does not exceed n. Consider the linear system x'=[A+B(t)Jx, (4) where A is a constant matrix and B(t) is a matrix function such that S:+1IIB(s)11 ds->O as t-> 00. For any nontrivial solution x(t) of (4), the limit 11 = lim t- 1 10glx(t)1 exists and is equal to the real part of one of the eigenvalues of A. Conversely, if at least one eigenvalue of A has real part 11, then there exists a nontrivial solu- tion x(t) of (4) satisfying limt- 1 10g Ix(t)1 = 11. Suppose in addition that B(t)->O as t-> 00. Let 111 ,s; 112 ,s; .,. ,s; I1n be the real parts of the eigen- values of A. Then there exists a tfundamental system of solutions of(4), {x 1 (t),.. ,xn(t)}, such that for any C j , C k =1= 0, loglc 1 XI (t) +... + CkXk(t)1 = I1kt + o(t). A sharp estimate of the term o(t) was given by Hukuhara. Next consider the linear system x' = [A(t) +B(t)Jx, (5) where the matrices A(t) and B(t) satisfy S"'IIA'(s)11 ds< 00 and S"' IIB(s)11 ds< 00. Let A1(t), ...,An(t) and Ab ...,An' ;'k=limAk(t), 
1175 be the eigenvalues of A(t) and A=limA(t), respectively. N. Levinson proved the fo11ow- ing theorem: Assume that AI' ..., An are mutu- ally distinct and Mjk(t)=ReI[Ais)-Ak(S)]ds satisfy either Mjk(t)--> 00 as t--> 00 and for each pair (j, k), Mjk(t2)-Mjk(t1);i - K for t 1 < t 2 , or Mjk(t)--> -00 as t-->oo and Mjk(t2)-Mjk(t1),,;;,K for t 1 <t 2 , or IM jk (t 2 ) - M jk (t 1 )I,,;;,K for a11 tj, t 2 , where K is a positive constant. Then (5) has a fundamental system of solutions {x 1 (t), ... , xn(t)} such that xj(t)=exp(I AiS)dS) [j+o(1)], j= 1, ...,n, where j is an eigenvector of A corresponding to Aj' D. Roundedness and Convergence of Solutions Consider again the linear system (5) satisfying S'XJ II A'(s) II ds < 00 and Ioo IIB(s)11 ds < 00. Suppose that a11 eigenvalues of A(t) have non positive real parts and that the eigenvalues of A = limA(t) whose real parts vanish are simple. Then a11 solutions of (5) are bounded. This result is a generalization, due to L. Cesari, of the so-ca11ed Dini-Hukuhara theorem. In the case of general A(t), it is known that not all solutions of (5) are bounded even if a11 solutions of (1) tend to zero as t --> 00 and if the matrix B(t) is such that Ioo IIB(s)11 ds < 00 and B(t)-->O as t-->oo. However, if A(t) is periodic or satisfies liminfRe I'tr A(s)ds> -00, then under the assumption that Ioo II B(s) II ds < 00, the boundedness of a11 solutions of (1) implies the boundedness of a11 solutions of (5). The fo11owing inequalities often provide useful information about the asymptotic behavior of solutions of (1): Ix(O)1 exp ( - I J1[ - A(s)] dS)";;' Ix(t)1 ";;'lx(O)lexp(I J1[A(S)]dS} tO, where J1[A(t)] = limh+o [III +hA(t)ll-l]/h. (J1[A(t)] was introduced by Lozinskii.) If limsup f J1 [A (s)] ds < 00, then aU solutions of (1) are bounded; if lim II J1[A(s)] ds exists, then for every sOlution x(t) of (1), Ix(t)1 tends to a finite limit as t--> 00; and iflim II J1[A(s)] ds = -00, then a11 solutions of (1) tend to zero as t--> 00. It can be shown that every solution of (1) tends to a finite limit as t-->oo, provided that IooIIA(s)1I ds<oo. If a11 solutions of (1) are bounded, then limsup Re Iltr A(s)ds < 00. IfIiminfRe jtr A(s) ds> -00, then (1) has a solution x(t) with the property that limsuplx(t)1 >0. When limlx(t)1 314 E ODEs (Asymptotic Behavior of Solutions) exists for every solution x(t) of (1), if there exists a nontrivial solution x(t) of (1) such that lim x(t) = 0, then lim Re I' tr A(s) ds = -00, but if there is no such solution, then Re f tr A(s) ds is bounded. E. Nonlinear Differential Equations Consider a system of nonlinear differential equations of the form x'=Ax+f(t,x), (6) where A is an n x n constant matrix and f(t, x) is an n-vector function that is continuous for t  0, I x I < L\, and that satisfies f( t, 0) = O. Sup- pose that f(t,x)/Ixl-->O as Ixl-->O and t-->oo. Then for every eventua11y nontrivial solution x(t) of (6) that tends to zero as t--> 00, J1 = lim t- 1 1oglx(t)1 exists and equals the real part of one of the eigenvalues of A. Conversely, if at least one eigenvalue of A has real part J1 < 0, then there exists a sOlution x(t) of (6) such that limt- 1 1oglx(t)1 = J1. Suppose that f(t, x)/Ixl-->O as Ixl-->O uniformly with respect to t. Then if all eigenvalues of A have negative real parts, the zero solution x(t) = 0 of (6) is asymptotica11y stable, and if A has an eigen- value whose real part is positive, then the zero sOlution of (6) is tunstable. Suppose that fx(t, x) = (8Jj(t, X)/8Xk)--> 0 as Ixl-->O uniformly with respect to t. In this case, if A is a matrix such that its k eigenvalues have negative real parts and the other n-k eigenvalues have positive real parts, then there exists a k-dimensional manifold S containing the origin with the fo11owing property: For to sufficiently large, any solution x(t) of (6) tends to zero as t--> 00, provided that x(to)eS, and ifx(to)S, x(t) cannot remain in the vicinity of the origin no matter how close x(t o ) is to the origin. In the nonlinear system x' = F(t, x), (7) suppose that F(t, x) is of period w with respect to t and has continuous partial derivatives with respect to x. Suppose, moreover, that (7) has a solution p(t) of period w. If an the tchar- acteristic exponents of the tvariational system of (7) with respect to p(t), y' = Fx(t, p(t))y with F x (t,x)=(8Fj(t,x)/8x k ), have negative real parts, then the periodic solution p(t) is asymptoti- ca11y stable. If an autonomous system x' = F(x) has a periodic sOlution p(t) and the corre- sponding variational system y' = Fx(p(t))y has n-l characteristic exponents with negative eigenvalues, then there exists an 1»0 such that for any solution x(t) satisfying Ix(t 1)- p(to)1 <I> for some to and tj, we have Ix(t)-p(t+c)I--+O as t--> 00 for a suitable choice of c (asymptotic phase). 
314 F ODEs (Asymptotic Behavior of Solutions) F. Scalar Differential Equations The aforementioned results can be specialized to the case of higher-order scalar (or single) ordinary differential equations. Much sharper results can often be derived through direct analysis of scalar equations themselves. In particular, detailed and deep results have been obtained for second-order linear differential equations of the form x" +q(t)x=O, e.g., tMathieu's equation. If ]X 5Iq(s)1 ds < 00, then (8) has a funda- mental system of solutions {XI (t), xz(t)} satisfy- ing, for t-> 00, x 1 (t)= 1 +0(1), x z (t)=t[1 +o(1)J, x'I(t)=l-lO(I), x(t)=I+o(I); if ]'" I q(8) + 11 ds < 00, then (8) has a funda- mental system of solutions satisfying x 1 (t)=e'[1 +o(l)J, x z (t)=e-'[1 +o(1)J, x'I(t)=e'[1 +o(1)J, x(t)= -1'-'[1 +o(1)J; and if J" Iq(s)-llds< 00, then (8) has a funda- mental system of solutions xl(t)=e it [1 +o(l)J, x z (t)=e- it [1 +o(I)J, X'1 (t)= ie it [1 + 0(1 )J, x(t) = - ie- it [1 + 0(1)]. Suppose that q(t)->c>O as t->oo and ]Xlq'(s)1 ds < 00. Then x(t) and x'(t) are bounded for every solution x(t) of (8). The same is true if q(t) is a positive periodic function of period w such that W]q(s)ds';;;4. Ifq(t) is nega- tive, then (8) always has both bounded and unbounded monotone solutions. The number of linearly independent solu- tions xlt) 01'(8) satisfying ]"'lx(sWds< 00 plays an important role in teigenvalue prob- lems. It is known that the ordinary differen- tial operator l[xJ = x" + q(t)x is of tlimit point type at infinity if there exist a positive func- tion M(t) and positive constants k p k z such that q(1),;;;k 1 M(t), IM'(t)M- 3 / z (t)1 ';;;k z , and f" M- 1 / z (s)ds= 00, and that l[xJ is oftlimit circle type at infinity if q(t»O,]OO q-l/z(s)ds= 00, and ]XI[q-3/z(s)q'(s)]' +(1/4)q-5/z(s)q,z(s)1 ds < 00. Finally consider the nonlinear equation x" + q(t)lxIYsgnx =0, where )' is a positive constant and q(t) is a positive function. If q'(t);:' 0, then all solutions of (9) are bounded; if either q'(1);:' 0 and lim q(t) < 00 or q'(t)';;;O and lim q(t) > 0, then all solu- tions x(t) of (9) are bounded together with their derivatives x'(t); and if q'(t);:, 0, Jim q(t) = 00 and either q"(1);:'O or q"(t)';;;O, then all solutions 01'(9) converge to zero as t->oo. 1176 Equation (9) is said to be oscillatory if every solution of (9) that is continuable to t = 00 has arbitrarily large zeros. If (9) is oscillatory and if q1 (t) ;:'q(t), then the equation x" + {j 1 (t)lxIYsgnx =0 is also oscillatory. When y= 1, (9) is oscil- latory if q(t);:'(1 +£)/4t Z for some 1:>0, and is not oscillatory if q(t)';;; 1/4t z . A necessary and sufficient condition for equation (9) with y =1= 1 to be oscillatory is as follows: ]00 sq(s) ds = 00 ify>I;]X s Y q (s)ds=oo ifO<y<1. (8) References [IJ M. Hukuhara, Sur les points singuliers des equations differentielles lineaires; domaine reel, 1. Fac. Sci. Hokkaido Univ., (I) 2 (1934), 13- 88. [2J R. Bellman, Stability theory of differential equations, McGraw-Hili, 1953. [3J L. Cesari, Asymptotic behavior and sta- bility problems in ordinary differential equa- tions, Springer, third edition, 1970. [4J W. A. Coppel, Stability and asymptotic behavior of differential equations, Heath, 1965. [5J P. Hartman, Ordinary differential equa- tions, Wiley, 1964. [6J E. A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hill, 1955. [7J 1. S. W. Wong, On the generalized Emden- Fowler equation, SIAM Rev., 17 (1975),339- 360. 315 (XIII.4) Ordinary Differential Equations (Boundary Value Problems) A. General Remarks Consider the differential equation in the real variable x f(x, y, y', ... ,y(n)) = O. (1) (9) Let aI' . .. ,a k be points in an inter val Ie Rand consider several relations between nk values y(a,), y'(a,), ..., y(n-1)(a i ), i = 1, ..., k. The prob- lem of finding solutions of (1) satisfying these relations is called a boundary value problem of (I), and the relations considered are called boundary conditions. When k=2 and aI' a z are the endpoints of I, the problem, called a two- point boundary value problem, has been a main subject of study. We can consider boundary value problems in the same way for systems of differential equations. 
1177 B. Linear Differential Equations Consider a linear ordinary differential opera- tor L defined by L[y] = po(x)y(n) + Pl (x)y<n-1) +... + Pn(x)y, where Pk(X) is a complex-valued function of class en- k defined on a compact interval axb and Po(x)#O for any xE[a,b]. We define a system of linear boundary operators U b ... , Urn by n n U;[y] = L Mijy(j-1)(a) + L Nijy(j-1)(b). j=1 j1 Given a function f(x) and complex constants Y1, '" , Yrn' the linear boundary value problem defined by L[y]=f(x), Uj[Y]=Yj, i=I,...,n, (2) is a two-point boundary value problem. When f(x)=O, Yj=O, i = 1, ..., n, the problem is caned homogeneous; otherwise it is caned inhomoge- neous. Let L * [y] be a formany tadjoint dif- ferential operator of L [y]. A set of m* linear boundary conditions U/ [y] = 0, i = 1, '" ,m*, is said to be an adjoint boundary condition of U;[y] = 0, i = 1, ..., m, iffor any function y of class en satisfying U j [y] = 0, i = 1, ... , m, and any function y* of class en satisfying U j * [y*] =0, i= 1,..., m*, we have S:L[y] y* dx= S yL*[y*] dx. The boundary value problem L*[y] =0, Uny] =0, i= 1, ...,m*, (3) is said to be an adjoint bonndary value problem of L[y]=O, U;[y] =0, i=I,...,m. (4) We say that the problem (4) is self-adjoint if L [y] = L * [y] and the conditions U j [y] = 0, i = 1, ..., m, are equivalent to the conditions Uj*[y] =0, i= 1, ...,m*. The boundary value problem containing a parameter A L[Y]=AY, U;[y] =0, i=I,...,n, (5) admits nontrivial sOlutions only for special values of A.. Such values of A are caned the eigenvalues (or proper values) of (5), and the corresponding solutions *0 are caned the eigenfunctions (or proper fnnctions) of (5). For any value of A that is not an eigenvalue, there exists a unique function G(x, , A) such that the conditions L[y]= AY + 1, U;[y] = 0 are equiva- lent to y = S: G(x, ,A)f() d. The function G(x, , A) is caned the Green's function of (5). If A=O is not an eigenvalue, then (5) is equivalent to Y(X)=A r G(x,)y()d, 315 C ODEs (Boundary Value Problems) where G(x, )= G(x, ,O). For the Green's function G(x, , A) of (5) and the Green's func- tion G*(x, , A) of L * [y] = AY, Uny] = 0, we have the relation G(x,, A)= G*(, x, X). Under the assumption that (5) is self-adjoint, we have the fonowing four propositions: (i) Problem (5) has only real eigenvalues which form a finite or countably infinite discrete set; (ii) two eigen- functions corresponding to two different eigen- values are orthogonal to each other; (iii) if {<"Pn} is an torthonormal set of eigenfunctions such that no eigenfunction is linearly independent of {<"Pn}, then the system {<"Pn} is a complete orthonormal set in the Hilbert space L 2 (a, b) consisting of functions that are square inte- grable on (a, b), and hence for the tFourier expansion f =a 1 <"Pl +a 2 <"P2 +... of any fE L 2 (a, b) the tParseval equality hOlds; and (iv) if f is a function of class en satisfying Uj[f] = 0, then the Fourier expansion of f converges uniformly to f on [a,b]. The boundary value problem for a second- order equation (p(x)y')' + (q(x) + Ar(x))y=O, lXy(a) + py'(a) =0, yy(b) + <>y'(b) =0 is caned a Sturm-Liouville problem, where p, q, r are real-valued functions defined on [a, b] and IX, 13, y, are real constants. Suppose that P, q, r are continuous and p(x»O, r(x»O on [a, b]. Then (i) the eigenvalues form an increas- ing sequence tending to +00; (ii) the eigenfunc- tion <"Pn(x) associated with An has precisely n zeros in a < x < b, and there exists between two adjacent zeros of <"Pn(x) a zero of <"Pn-1 (x); and (iii) the set of eigenfunctions is an orthogonal set on [a,b] with weight function r(x): r r(x)<"Prn(x)<"Pn(x)dx=O, m#n. When the coefficients Po, "', Pn of L are defined in an open interval - 00  a < x < b  00 and Pk is of class e n -\ L defines in a natural way operators in the Hilbert space consisting of functions that are square inte- grable in a < x < b, and the general theory is based on operator theory in Hilbert space ( 390 Spectral Analysis of Operators). C. Nonlinear Differential Equations Boundary value problems for nonlinear dif- ferential equations are very difficult, and re- sults are obtained only for equations of special form. Consider, for example, the second-order equation y" = f(x, y, y') (6) 
315 Ref. ODEs (Boundary Value Problems) and boundary conditions y(a) = A, y(b) = B. The following theorem has been proved: Suppose that f(x, y, y') is continuous for a x b, (()(x)yw(x), -00 <y' < +00, and If(x,y,y')IM(l + y'2); (()"(x) > f(x,(()(x), (()'(x)) and w" <f(x,w(x), w'(x)) for axb; and (()(a)Aw(a) and (()(b)Bw(b). Then (6) admits a solution y(x) such that y(a) = A, y(b) = B, and (()(x)  y(x)  w(x) for a  x  b. If in addition f is an increasing function with respect to y, the solution is unique. More- over, under suitable conditions, the solution is obtainable by the method of successive approximations. The boundary value problem y'" + 2yy" +2.1e(k 2 - y'2)=0, y(O) = y'(O) =0, y'(x)-->k (x-->oo), where .Ie and k are constants, appears in the theory of fluid dynamics. It is known that if .Ie;:, 0, the problem has a solution, and that if o .Ie  1, the solution is unique. Consider the system of differential equations yj = jj( x, Y 10 ... , y n), j = 1, ... , n. The problem of finding a solution such that Yiaj)=bj,j= 1,..., n, called Hukuhara's prob- lem, reduces to the initial value problem when the a j coincide. The problem of solving yen) = f(x, y, y', ..., y(nI)), y(a)=b j , j= 1,..., n, is reduced to Hukuhara's problem by a suit- able change of variables. The following result is a generalization of tPerron's theorem: Let (()j(x), wix),j= 1,... ,n, be continuous and right and left differentiable functions and CtJj(x)  wix) for ax b. Suppose that the .Ij(x, YI' ..., Yn) are continuous for rx  x  f3 and %(x)  Yk wix), k = 1, "', n; satisfy (x - aj)(D Iwix)- jj(X,YI,,,, ,Yn));:'O for yj=Wj(x) and %(x) YkWk(X), ki=j; satisfy (x-a)(DICtJj(x)- .Ij(x, YI' ...,Yn))O for Yj=CtJj(x) and (()dX)Yk wk(x), k i= j; and satisfy CtJj(a j )  b j  wj(a j ). Then there exists a solution y(x) such that Yj(a)=b j and (()ix)y(x)Wj(x). This theory was ap- plied by M. Hukuhara to the study of singular points of ordinary differential equations. References [lJ E. A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hill, 1955. [2J P. Hartman, Ordinary differential equa- tions, Wiley, 1964. [3J E. Hille, Lectures on ordinary differential equations, Addison-Wesley, 1969. 1178 [4J M. A. Naimark (Neumark), Linear dif- ferential operators I, II, Ungar, 1967, 1968. (Original in Russian, 1954.) [5J N. Dunford and 1. T. Schwartz, Linear operators II, Interscience, 1963. [6J B. M. Levitan and I. S. Sargsyan (Sargsjan), Introduction to spectral theory: Self-adjoint ordinary differential operators, Amr. Math. Soc. TransI. of Math. Monographs, 1975. (Original in Russian, 1970.) 316 (XIII.3) Ordinary Differential Equations (Initial Value Problems) A. General Remarks Consider a system of ordinary differential equations dyddx=/;(x,yj, ...,Yn), i= 1, ...,n. (1) A. L. Cauchy first gave a rigorous proof for the existence and uniqueness of solutions: If /;, i = 1,..., n, and their derivativ(:s a/;jaYk are continuous in a neighborhood of a point (a,b l , ...,b n ), then there exists a unique solu- tion of (1) satisfying the conditions Yi(a) = b;, i = 1, .., ,n. These conditions are called initial conditions, and the values a, bj, "', b n initial values. The problem of finding solutions that satisfy initial conditions is caned an initial value problem (or Cauchy problem) If we con- sider (x, Yj, "', Yn) as the coordinates of a point in the (n + I)-dimensional space Rn H, then a solution of (1) represents a curve in this space caned a solution curve (or integral ('urve). The statement that a solution satisfies initial con- ditions Yi(a) = bj, i = 1, ..., n, means that the integral curve represented by it passes through the point (a, b l , "', b n ). Since, in general, we can transform a dif- ferential equation of higher order into a sys- tem of differential equations of the form (1) by introducing new dependent variables, all defi- nitions and theorems concerning the system (1) can be interpreted as applying to a higher- order equation. For example, for the equation yen) = f(x, y, y', '.. ,y'n-I)) the conditions y(a) = b, y'(a) = b', "', y(n-I)(a)= b(n-I) constitute initial conditions, and the values a, b, b', ..., b(n-I) are initial values. Iff and its derivatives at/ay(j) are continuous, then there exists a unique solution satisfying given initial conditions. Suppose that the /; are continuous. Then a system offunctions (YI(X), ...,Yn(XI) is a solu- 
1179 tion of (1) if and only if y,(x)=b i + r j;(X'Y1(X), ...,Yn(x))dx, j = 1,..., n. When the J; are not continuous, we define (Yl (x), ..., Yn(x)) to be a s01ution of (1) for the initial value problem YI(a) = b l if (Ydx), ..., Yn(x)) satisfied the integral equation just given. We use the vectorial notation: Y=(Yl'''' Yn), f =(fl, .., ,In) together with IIYII 2 = YI + ... + y;. The equations (1) are then written as the single equation y' = f(x,y). B. Equations in the Real and Complex Domains We state main theorems for differential equa- tions in the real domain in Sections C- F and in the complex domain in Section G. C. Existence Theorems Suppose that f(x, y) is continuous for Ix -al r and Ily-bil p, and that IIf(x,Y)11 M there. Then equation (1) admits a solution satisfying y(a)=b and defined in an intervallx-al min(r, p/M) (existence theorem). There are two methods of proving this theorem, one using Cauchy polygons and one using tfixed- point theorems for function spaces. From this theorem we deduce that iff(x, y) is continuous in a domain D of R n +1, then there exists a solution curve passing through any point of D. Let y=1/11(X) and y=1/12(X) be solutions of (1) defined in the intervals II and 1 2 , respectively. If II eI 2 and 1/11 (x) = 1/12 (x) for xEl 1 , we say that 1/12 is a prolongation or extension of 1/11' Given a s01ution of (1), there exists a nonex- tendable solution that is an extension of the solution. The solution curve of a nonexten- dable solution tends to the boundary ofD as x tends to anyone of the ends of its interval of definition. D, Uniqueness Theorems Continuity does not imply uniqueness of the solution. If (1) admits at most one solution satisfying a given condition, we call this con- dition a uniqueness condition. Various kinds of uniqueness theorems, which state uniqueness conditions, are known. The Lipscbitz condition: Ilf(x, y)-f(x, z)11 L Ily-zll, L>O a constant, 316E ODEs (Initial Value Problems) is one of the simplest. When f is continu- ous and satisfies the Lipschitz condition, the method of successive approximation, initiated by C. E. Picard, is often used to prove the existence of s01utions. This method is as f01- lows: We choose a suitable function, for ex- ample yo(x)=b, and then define Yk(X), k= 1, 2,. ", recursively by Yix)=b + J:f(X'Yk-l(X»)dx. Then {Yk(X)} is uniformly convergent, and its limit is a solution of (1) satisfying y(a)=b. Assuming the continuity of f, H. Okamura gave a necessary and sufficient condition for uniqueness: Suppose that f is continuous in D. Then a necessary and sufficient condition for there to exist a unique solution curve of (1) going from any point of D to the right is that there exist a C 1 -function cp(x, y, z) defined for (x, y, z) such that (x, y) and (x, Z)E D and satisfying the conditions cp(x, y, z) = 0 for y = z, cp(x, y, z) > 0 for y #- z, and acp acp acp -+ L - j;(x,y)+ L -J;(x,z)O. ax ,aYi i aZ i E. Perron's Theorem Consider the scalar equation y' = f(x, y). We have Perron's theorem: Let (Q(x) and w(x) be continuous functions that are right differenti- able in IXX <P and satisfY(Q(x)w(x), and let f be a continuous function defined on D: IX  x<p, (Q(x)yw(x). Suppose that D+(Q(x) f(x,(Q(x)) and D+w(x)f(x,w(x)). (D+w de- notes the tright derivative of w.) Then for any (a, b) E D there exists a solution defined on a  x < {3 and satisfying y(a) = b. The fact that the interval of definition is a  x < p can be expressed by saying that if we denote the set 00 <x < p, lyl < 00 by 0, then D is closed in 0 and there exists, among solution curves going from a point in D to the right, a curve that reaches the boundary of O. Perron's theorem was generalized by M. Hukuhara and M. Nagumo. Let 0 be an open set in R n +1, D a closed set in n, and f a con- tinuous function in D. A necessary and suffi- cient condition for (1) to admit a solution curve going from any point (a, b) in D to the right is that there exist a sequence of points in D, {(ak,b k )}, such that akta and (bk-b)/(ak-a) -->f(a, b). Moreover, every solution curve is prolonged to the right to the boundary of O. Let SlY) be a continuous tsubadditive and positively homogeneous function and w(x) a function continuous and right differentiable on IX x < p. A sufficient condition for D: IX  X < {3, Sly) w(x) to possess the property in the statement of Perron's theorem is given by D+ w(x)  S(f(x, y)) for II yll = w(x). A continuous function w(x) is said to be a 
316 F ODEs (Initial Value Problems) right majorizing function of (1) with respect to S(y) if for any solution qJ(x), S(qJ(a)):s; w(a) implies S(qJ(x)):s; w(x) for x a if both S(qJ(x)) and w(x) are defined. In order for w(x) to be a right majorizing function, it suffices that D+ w(x) > S(f(x, y)) for Ilyll =w(x). A function satisfying this inequality is called a right su- perior function of (1) with respect to S(y). If F(x, S(y)) > S(f(x, y)), then any solution of y' = F(x, y) is a right superior function of (1). Theorems stating such facts are called com- parison theorems. If (1) has a unique solution, the condition D+w(x):;;'S(f(x,y)) for S(y)=w(x) implies that w(x) is a right majorizing function of (1). Con- versely, we can derive from comparison theo- rems general uniqueness theorems, one of which we state. Suppose that G(x, y) is continu- ous for rJ.<x<{3 and O:s;y<r(x); G(x,O)=O; a solution of y' = G(x, y) such that y = o(r(x)) as x->rJ.+O vanishes identically; and finally that S(f(x'Y1)-f(x'Y2)):S;G(x,S(Y1 -Y2))' Then for two solutions qJ1, qJ2 of(1) such that S(qJ1-qJ2) =o(r(x)) as x->rJ.+O, we have qJ1 =qJ2' Assum- ing that f is continuous at (a, b) and taking yj(x-a) as G, we obtain Nagumo's condition (x-a)S(f(x, Y1)-f(x, Y2)) :S;S(Y1 -Y2)' G. Peano proved the following theorem: With the same notation and assumption as in Perron's theorem, there exist a maximum solution ip and a minimum solution p of y' = f(x, y) such that y(rJ.)=b for (()(rJ.):S; b:s; w(rJ.), and such that there exists a solution curve passing through any point in rJ.:S; X < {3, p(x):S; y:S;ip(x). This theorem was extended by Huku- hara as follows. Suppose that f(x, y) is con- tinuous and bounded in D:rJ.:S;x:S;{3, IIYII < 00. Let C be a tcontinuum in D, and let (C) de- note the set of solutions intersecting C. Then (C) is a continuum of the tfunction space C([rJ., {3J). From this theorem we can deduce the Kneser-Nagumo theorem, which says that the intersection of the set of points belonging to the mem bel's of (C) and a hyperplane x =  is a continuum. It was proved by Hukuhara that if C is in the hyperplane x = rJ., then (1) admits a solution connecting the two hyper- planes x = rJ. and x = {3 and passing through the boundary of the set of points belonging to the members of (C). F. Equations Containing Parameters We assume uniqueness of the solution of y'=f(x,y,A), A=(Aj,...,Am), where f is a continuous function of (x, y, A). Let q>(x, a, b, 1) denote the solution of (2) satisfying y(a) = b. Then qJ(x, a, b, A) is continuous with respect to (x, a, b, A) in its region of definition. 1180 If the derivatives afjoYk are also continuous, then qJ(x, a, b, A) is a continuously differentiable function of (x, b); Zjk = o({!)abbJ = 1, ..., n, satisfy the system of linear ordinary differential equations and the initial condition d _ n ( afj ) - d Zjk- L _ a Zib x 11 YI zjd a ) = D jk ; and Zj = a({!)aa, J = 1, ... , n, satisfy the same system with the initial condition z)a) = - fj(a, b, A), where (afjjaYI) means (aJjjaYI) (x, qJ(x, a, b, A), A). If f further admits continu- ous derivatives afjaie l , then qJ(x, a, b, A) is con- tinuously differentiable with respect to AI, and moreover, w jl = a({!jjaAI,J = 1, ..., n, :;atisfy the system Wjl= f ( afj ) Wkl+ ( afj ) , dx k1 aYk a)'1 These differential systems are called the varia- tional equations of (1). C. Caratheodory proved the existence of solutions of (1) under the less restrictive as- sumption that f is continuous with respect to y for any fixed x and measurable with respect to x for any fixed y. Suppose that f is continuous and satisfies a Lipschitz condition. Let z(x) be a function such that z(a)=b and Ilz;(x)- j;(x,z(x))11 :S;£(x), and let y(x) be a solution of (1) such that y(a) = b. Then we obtain wjlla) = 0. Ilzi(X)-Yi(X)11 :s;eL,x-a'lf £(x)e-LIX-aldxl, which gives approximate solutions of (1). G. Equations in the Complex Domain (2) We assume that the variables x, Y l' ... , Yn all have complex values. We have the following theorem: If f is holomorphic at (a, IJ.), then (1) has a unique solution that is holomorphic at x = a and takes the value b at x =, a. This theorem can be proved by utilizing the method of successive approximations and fixed-point theorems. Cauchy proved the theorem by using majorant series. This method., called the method of majorants, proceeds for the scalar equations as follows: Let f(x, y) =: ajdx- a)j(y-W and y=1:c n (x-a)". Substituting the latter series into both members. we can successively determine the coefficients C n by the method of undetermined coefficien1s. Assum- ing that If I < M for Ix -al < rand Iy-bl <p, consider the solution Y = 1: C,,(x - a)" of dY M dx (1-(x-a)jr)(1-(Y -b)jp) satisfying Y(a)=b. We have Cnlc,,1 for any n, 
1181 which shows that :l:C.(x-a)" is a majorant series of:l:c.(x-a)". We have the fo11owing uniqueness theorem: Suppose that f(x, y) is holomorphic at (a, b). Let C be a curve having the point a as one of its ends and tp(x) be a solution with the fo11ow- ing properties: tp is holomorphic on C except possibly at x = a, and there exists a sequence of points on C, {a k }, such that ak-->a and tp(a k )--> b. Then tp(x) is holomorphic at a. By a ttheo- rem of identity, the analytic continuation of a solution continues to be a solution if it does not encounter any singularity of f. If a solution tp(x) is holomorphic on a smooth curve x = X(t), with 0  t  to and X(O) = a, and tp(a)=b, then y=tp(X(t)) is a solution for O t  to of y' = X'(x)f(X(t), y) satisfying y(O) = b. Conversely, if (3) has a solu- tion y = 'fI(t) defined in 0  t  to and satisfy- ing y(O) = b, and if f(x, y) is holomorphic at (X(t), 'fI(t)) for 0  t  to, then (1) has a solution tp(x) holomorphic on C and 'fI(t) = tp(X(t)) for Otto' Suppose that f= frlf2' where f1 and f2 are holomorphic at (a, b). If f1(a, b) # 0,f2(a, y) ¥=O, and f2(a, b)=O, then the equation y' = f(x,y) admits a unique solution such that y-->b as x-->a, and this solution can be expanded into a tPuiseux series: 00 y = L: c.(x - a)"/'. ,,=0 Iff is holomorphic at (a, b), then the solu- tion y=tp(x, x o , Yo) of (1) satisfying y(x o ) = Yo is holomorphic with respect to (x,x o , Yo) at (a, a, b). If f(x, y, 1) is holomorphic at (a, b, 1 0 ), then the solution of (2), y=tp(x,x o , Yo, 1), satis- fying y(x o ) = Yo is holomorphic at (a, a, b, 1 0 ), Suppose that x is a real variable and y is a complex vector. Iff is continuous with respect to (x, y) and holomorphic with respect to y, then the solution tp(x, x o , Yo) is holomorphic with respect to Yo. If f(x, y, 1) is continuous with respect to (x, y, 1) and holomorphic with respect to (y,l), then tp(x, x o , Yo, 1) is holomor- phic with respect to (Yo, 1). References [1] E. Kamke, Differentialgleichungen ree1ler Funktionen, Akademische Verlag, 1930. [2] E. A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hi11, 1955. [3] P. Hartman, Ordinary differential equa- tions, Wiley, 1964. [4] H. Okamura, Condition necessaire et suffisante remplie par les equations differen- 317 A Orthogonal Functions tie11es ordinaires sans points de Peano, Mem. Col1. Sci. Univ'. Kyoto, (A) 24 (1941). [5] O. Perron, Ein neuer Existenz Beweis fUr die Integrale der Differentialgleichung y' = f(x,y), Math. Ann., 15 (1945). [6] M. Hukuhara, Sur la theorie des equations differentie11es ordinaires, J. Fac. Sci. Univ. Tokyo, sec. I, vo1. VII, pt. 5, 1958. [7] M. Nagumo, Uber die Lage der Integral- kurven gewohnlicher Differentialgleichungen, Proc. Phys.-Math. Soc. Japan, 24 (1943). [8] E. Hi11e, Ordinary differential equations in the complex domain, Wiley, 1976. [9] R. Abraham and J. E. Marsden, Founda- tions of mechanics, Benjamin-Cummings, second edition, 1978. (3) 317 (X.21) Orthogonal Functions A. Orthogonal Systems Let (X,)1) be a tmeasure space. For complex- valued functions J, 9 on X belonging to the tfunction space L 2 (X), we define the inner product (J,g)= fxf(x) g(x) d)1(x) and the norm Ilfll = (J,1)1/2. If(J,g)=O, then we say thatf and 9 are orthogonal on X with respect to the measure )1. If X is a subset of a Euclidean space and )1 is the tLebesgue measure m, then we simply say that they are orthogona1. If the measure has a tdensity function cp(x) with respect to the Lebesgue measure and (J, g) = fxf(x) g(x) <p(x) dm(x) =0, we say that they are orthogonal with respect to the weight function cp(x). If IIfl1 2 = 1, then I is said to be normalized. A set of functions {I. (x) } (n = 1, 2, . . . ) is said to be an orthogonal system (or orthogonal set), and we write {In} EO(X), if any pair off unctions in the set are orthogona1. The orthogonal set {f,,(x)} is said to be ortho- normal, and we write {In} EON(X) if each In is normalized. Let {In} be a set of linearly independent functions in L 2 (X), and let R be a subset of L 2 (X). If we can approximate any function fER arbitrarily closely by a finite linear com- bination of the fn(x) with respect to the norm in L 2 (X), we say that {f,,} is total in R. Let {In} EO(X). If (cp,f,,)=O for a11 n implies cp(x) =0 almost everywhere (a.e.), then {In} is said to be complete in L 2 (X). An orthogonal system {In} is complete in L 2 (X) if and only if the system is total in L 2 (X). If {In} E O(X), then the series :l:'%1 cnf,,(x) is ca11ed an orthogonal series. If the series con- verges to cp(x) tin the mean of order 2, then C n = (cp, f,,)/llfnI1 2 . We ca11 the c. (n = 1, 2, ...) the 
317 B Orthogonal Functions expansion coefficients or Fourier coefficients of cp(x) with respect to {In}. If {gn} C L 2 (X) are linearly independent, we can construct an orthonormal system {fn} by forming suitable linear combinations of the gn; U;'} spans the same subspace as {gn}' For this purpose we set fdx)=gdx)/llg111, f,,(x) = CPn(x)/11 CPn II, where CPn(x) = gn(x)- I:::::[(gnJv)fv(x), n  2. This procedure is caned Schmidt orthogonalization or Gram- Schmidt orthogonalization. If the C are Fourier coefficients of cp(X)E L 2 (X) with respect to {In(X)} EON(X), then we have the tBessel inequality I::=llc n I 2  11<1111 2 . Equality in the Bessel inequality for an cpEL 2 (X) (the tParseval identity) is equivalent to completeness of {In} in L 2 (X). In this case I::=1 cnf,,(x) is caned the orthogonal expansion of cP with respect to {In}, and conversely, for any sequence {cn} such that I:lc n I 2 < 00, there is a function cP E L 2 (X) that has the C n as its Fourier coefficients, and f Ic n I 2 =llcpI12, n=l <11 (x) = 1}... Ct1 cmfm(x) ). This is caned the tRiesz-Fischer theorem. B. Orthogonal Systems on the Real Line We assume that X is a finite interval (a, b) and that functions on X are real-valued. We write O(a, b) or ON(a, b) instead of O(X), ON (X). (1) If {In} EON(a, b), Ifn(x)1 M (const.), and I:cnf,,(x) converges ta.e., then cn-+O as n-+oo. (2) We can construct a complete orthonormal system {In(x)} and a function cp(x)ELda, b) such that its orthogonal expansion I:cnfn(x) diverges everywhere. (3) If {f,,(x)} EON(a, b) and I:c;log 2 n< 00, then I:cnfn(x) converges a.e. The factor log2 n cannot be replaced by any other monotone increasing factor w(n) satisfying 0  w(n) = o(log 2 n) (Rademacher- Men'shov theorem). K. Tandori proved that if cnto and I:cnCPn converges a.e. for any ortho- normal system {CPn}, then I: Ic n l 2 10g2 n < 00. (4) If the orthogonal expansion of a function cP E L 2 (a, b) is tsummable by Abel's method on a set E, then it is t(C, 1)-summable a.e. on E. (C, 1 )-summability a.e. of the orthogonal ex- pansion of a function of cP E L 2 (a, b) is equiva- lent to convergence a.e. of the partial sums S2.(X) (n = 1, 2, ... ) of its expansion. (5) Sup- pose that {In(X)} EON (a, b), Ifn(x)IM. Then: (i) If the an are the Fourier coefficients of cp(x) with respect to {In(x)}, then ( ) IIP' (f b ) IIP nllanIP' M(2-p)IP II Icp(xjlPdx : 1182 where 1 <p2, l/p+ l/p'= 1. Conversely, if (I: la n l P )llp < 00 (1 <p 2), there exists a func- tion cp(x) which has the an as its Fourier coeffi- cients and such that (f b ) IIP' ( 00 ) I IP II Icp(xjIP' dx M(2-p)/p nllanlP (F. Riesz's theorem). When the orthonormal system is the trigonometric system, this is caned the Hausdorff-Young theorem. (ii) Let {an be the decreasing rearrangem(:nt of {Ianl}; then J1 a: P n P - 2 "'-;;A p f'CP(X)IPdX (1 <p2). If q  2 and I: a: q n q- 2 < 00, then thtre exists a function cp(x) which has the an as its Fourier coefficients and such that f b 00 II Icp(xWdxAqn1 a: q n q - 2 (Paley's theorem). When the system is trigono- metric, this is caned the Hardy-Littlewood theorem. (6) If for some positive E we have I:lc n I 2 -'< 00, then I:cnfn(x) converges a.e. (7) If we set s* (x) = sUPn I I:=1 cJv(x)l, then Ils*llqAq(I:cqvq-2)llq (q>2), wh(:re {c:} is the decreasing rearrangement of {Icnl}. C. Examples of Orthogonal Systems (1) {cosnx} EO(O,n), {sinnx} EO(O, /L). (2) {1,cosnx,sinnx} EO(O, 2n) ( 159 Fourier Series). (3) Suppose that A(x) is positive and continuous, and let Yn(x) be solutions of y"(x) + AnA(x)y(x) =0 satisfying the condition Yn(a) = Yn(b)=O, where An is any teigenvalue. Then { J A(x) Yn(x)} EO(a, b) (for orthogonality of eigenfunctions  315 Ordinary Differential Equations (Boundary Value Problems) B). (4) Set rn(x) = -lor 1 according as th(: nth digit of the binary expansion of x (0 < x",-;; 1) is 1 or 0, and rn(x) =0 if x is expandable in two ways. Then {rn(x)} E 0(0, 1). This is caned Rade- macher's system of orthogonal functions. The system is not complete, but it is interpreted as a tsample space of coin tossing. Rade- macher's system is useful for constructing various counter-examples. (5) Let the binary expansion of n be n = 2 V, + 2 V ' + .., + 2 V p (VI < V 2 <... < v p ), and set wn(x) =r V, +1 (x)r V2 +1 (x) ... Tv +1 (x). Then {wn(x)} is a complete or- tho;ormal system caned Walsh's system of orthogonal functions. This system is interpreted as a system of characters of the group of binary numbers, and there are many theorems for this system analogous to those for the 1 rigono- 
1183 metric system. (6) In the interval [0,1], set x(x) = j?, =-j?, =0, xE((k-l)j2 m , (k-lj2)j2 m ) xE((k-lj2)j2 m , kj2 m ) xE((I-l)j2 m , Ij2 m ), l"fk, 1 12m. The orthonormal system X(x) (1  k  2 m , 1  m) is called Haar's system of orthogonal functions. The Haar expansion of the continu- ous function f(x) converges to f(x) uniformly. D. Orthogonal Polynomials (Appendix A, Table 20) Suppose that we are given a weight function CP(x);;;oO (cp(x) >0 a.e.) defined On (a, b) and that the inner product offunctions f, 9 on (a, b) is defined by (f, g) = Jf(x)g(x)cp(x) dx. We ortho- gonalize {x'} by Schmidt orthogonalization and obtain polynomials Pn(x} of degree n. Here the sign of P.(x) can be determined so that the sign of the coefficient of the highest power of x is positive. We call {P.(x)} the system of orthogonal polynomials belonging to the weight function cp(x). This system is complete in L<{')(a, b), which is defined to be the space offunctions f such that Jlf(xWcp(x)dx< 00. In other words, the system { J cp(x) P.(x)} is a complete orthonormal system in the ordinary L 2 (a, b) space. Concerning the convergence problem of the orthogonal expansion by {Pn(x)}, the Christoffel-Darboux formula f (t) ( ) _ an . Pn(X)Pn+1 (t) - P.(t)Pn+1 (x) L..., Pk Pk X -- k=O a.+! t-x plays an important role. Several important special functions in clas- sical mathematical physics are given by ortho- gonal polynomials: (1) Setting cp(x)=(I-x)"(1 +4 (a> -1,[3> -1) in [ -1, 1], we get the Jacobi polynomials, although they are sometimes defined in [0,1] with respect to cp(x) = x"(1 - x)P ( Appendix A, Table 20.V). If we set a = f3 in the Jacobi polynomials, we get the ultraspherical poly- nomials (or Gegenbauer polynomials) ( Appendix A, Table 20.1). Furthermore, if a = [3 =0, then we get the tLegendre polynomials, and if a = [3 = -lj2, we get the Chebyshev polynomials T,.(x) = cos(n arc cos x). The T,.(x) also appear in the best approximation prob- lem ( Appendix A, Table 20.1I; 336 Poly- nomial Approximation). (2) If we set cp(x)=x"e- X in (0, (0), we get the Sonine polynomials (or associated Laguerre polynomials) with appropriate constant factors. 317 Ref. Orthogonal Functions If a is a positive integer m, we get sm)(x) =x-me x ( :" (xn+me- X »). The particular case m = 0 gives the Laguerre polynomials. In this case, however, it is cus- tomary to normalize them as L.(x) = (eXjn!)(d'jdxn)(x'e- X ) ( Appendix A, Table 20.VI). Laguerre polynomials are used in tnumerical integrations of a Gaussian type in (0, (0). Furthermore, associated Laguerre polynomials appear in the solutions of the Schrodinger equation for the behavior of hydrogen atoms. This system of orthogonal polynomials is useful in the expansions of approximate eigenfunctions of atoms analo- gous to hydrogen, velocity distribution func- tions of molecules in gas theory, and so on. (3) Setting cp(x)=e- x2 (or e- X2(2 ) in (-00, (0), we get Hermite polynomials H.(x) = (_I)'e X2 (d'e- X2 jdx'), modulo constant fac- tors ( Appendix A, Table 20.VI). Hermite polynomials are special cases of parabolic cylinder functions ( 167 Functions of Con- fluent Type). These polynomials appear as eigenfunctions of the Schrodinger equation for harmonic oscillators. They are also connected with probability integrals and are used in mathematical statistics. (4) Replacing the integral by a finite sum I:=of(m) g(m) in the definition of inner pro- duct, we get so-called orthogonality for a finite sum. (Regarding orthogonal polynomials with respect to a finite sum ( Appendix A, Table 20.VII) and their application to the mean square approximation  19 Analog Compu- tation F.) Since orthogonal polynomials with respect to a finite sum are often called simply orthogonal polynomials by engineers, one must be careful not to confuse these with the ordinary ones. References [1] S. Kaczmarz and H. Steinhaus, Theorie der Orthogonalreihen, Warsaw, 1935 (Chelsea, 1951). [2] G. Szego, Orthogonal polynomials, Amer. Math. Soc. Colloq. Pub!., 1939. [3] R. Courant and D. Hilbert, Methods of mathematical physics I, Interscience, 1953. [4] F. G. Tricom Vorlesungen tiber Ortho- gonalreihen, Springer, 1955. [5] G. Alexits, Convergence problems of or- thogonal series, Pergamon, 1961, revised trans- lation of the German edition of 1960. [6] G. Sansone, Orthogonal functions, Inter- science, revised English edition, 1959. 
318 A Oscillations 318 (XX.14) Oscillations A. General Remarks A vibration or oscillation is a phenomenon that repeats periodically, either exactly or ap- proximately. Exactly periodic oscillations are studied in the theory of tperiodic solutions of differential equations. The tperiod of a solu- tion f(t) is called the period of the oscillation and its reciprocal the frequency. The differene between the greatest and least values of f(t) (globally or in an interval) is the amplitude. The theory of vibrations has its origin in the study of mechanical vibrations, but its nomen- clature has been used also for electric circuits. As examples of practical applications of the theory of oscillations, we mention, in engineer- ing, the prevention of vibrations and the gen- eration of stable sustained oscillations, and in geophysics, investigations concerning the free oscillation of the earth, the existence of which has recently been confirmed. B. Linear Oscillation Periodic solutions of tlinear differential equa- tions have been studied in detail for a long time. Perhaps the simplest case of such an oscillation is represented by the differential equation d 2 x/dt 2 + n 2 x =0, where the restitutive force is proportional to the displacement from the equilibrium posi- tion. Typical examples are the free vibration of a simple pendulum with small amplitude and an electric circuit composed of a self- inductance and capacity (without resistance). The solution is given by x = A cos(nt + IX). This is called harmonic oscillation or simple har- monic motion. Here the amplitude is A, the period is 2n/n, n is the circular frequency, and IX is the initial phase. A system of m tdegrees of freedom (x 1, '" , x m ) is said to be in free harmonic oscillation if the coordinates can be expressed as m Xi= L Aikcos(nkt+lXk), i=I,2,...,m. k1 Each of these simple harmonic oscillations is called a normal vibration. As a limiting case, where the number of degrees of freedom is infinite, we have the vibration of a string: 8 2 u(x, t) 2 a 2 u(x, t) -=n- 8t 2 Dx2 ' u(O, t) = u(l, t) = 0. 1184 The solution is given by a series L Ak sin(knx/l) cos(knnt/l), k which is just the superposition of tt,e funda- mental vibration (corresponding to k= 1) and simply harmonic motions of frequencies equal to multiples of the fundamental frequency. If a resisting force proportional to the veloc- ity is acting, the equation becomes d 2 x/dt 2 + 2edx/dt + n 2 x =0, n > E, (2) whose solution x = Ae-" cos(O"t+ IX), 0"= Jn 2 -e 2 (3) (1) is not periodic. However, x becomes zero at a fixed interval n/O", and the extrem.II values in the intervals decrease to zero in a geo- metrical progression with the common ratio v = exp( - ne/O"). This phenomenon is called damped oscillation with damping ratio v and logarithmic decrement logv= -ne/lT. In this case, too, 2n/0" is called the period. When a driving force term cp(t) is present in the right-hand side of (2), the solution takes on the additional term e :" (sin O"t f cp(t)e" cos O"t dt -cosO"t f cp(t)e"sinO"tdt). which represents the forced oscillation due to cp(t). If e <0 in (2) (negative resistance). the solu- tion (3) increases in amplitude, so that a small disturbance is amplified, resulting i-1 an auto- matic generation of osci\lation. This phenom- enon is called self-excited vibration. Besides being caused by some special kinds of circuit elements (e.g., tunnel diodes), such a situation often occurs when the vibrating system has time delay characteristics ( 163 Functional- Differential Equations). Among sustained vibrations, other than forced oscillations and self-excited vibrations are the parametrically sustained vibrations ' caused by periodic variation of a parameter of the vibrating system. Electric wires and panto- graphs for use in high-speed electric railways must be designed to prevent unwanted para- metrically sustained vibrations. On the other hand, a parametron is an electric element utilizing parametrically sustained vibration. C. Nonlinear Oscillation Actual vibrating systems contain more or less nonlinear elements, which give rise to various kinds of oscillations different from those de- scribed by the linear theory ( 290 Non- 
1185 linear Oscillation). For example, d 2 x/dt 2 - E(I-x 2 )dx/dt+x=0 (E>O) represents a stable sustained oscillation such that for large values of E, two nearly stationary states occur alter- nately, the transition from one to the other taking place abruptly. This is called relaxation oscillation. References [1J Lord Rayleigh, The theory of sound, Mac- millan, second revised edition, I, 1894; II, 1896 (Dover, 1945). [2J A. A. Andronov and C. E. Chaikin, Theory of oscillations, Princeton Univ. Press, 1949. (Original in Russian, 1937.) [3J L. S. Pontryagin, Ordinary differential equations, Addison-Wesley, 1962. (Original in Russian, 1961.) Also  references to 287 Nonlinear Lattice Dynamics, 290 Nonlinear Oscillation, 291 Nonlinear Problems. 318 Ref. Oscillations 
319 A Paradoxes 319 (1.3) Paradoxes A. General Remarks A statement that is apparently absurd but not easily disproved is called a paradox. A con- tradiction between a proposition and its neg- ation is called an antinomy if both statements can be supported by logically equivalent rea- soning. In practical use, however, "paradox" and "antinomy" often mean the same thing. B. Paradoxes in Set Theory (1) The Russell Paradox (1903). We classify sets into two kinds as follows: Any set that does not contain itself as an element is called a set of the first kind, and any set that contains itself as an element is called a set of the second kind. Every set is either a set of the first kind or of the second kind. Denote the set of all sets of the first kind by M. If M is a set of the first kind, M cannot be an element of M. But if M is of the first kind, then M must be an element of M, by definition. This is contradictory. On the other hand, if M is a set of the second kind, M must be an element of M; but since M is an element of M, M is a set of the first kind, so M cannot be an element of M, by definition. This is contradictory. Since the kind of reasoning employed in this paradox is very simple and is often utilized in mathematics, it became popular in set theory. To remove this paradox from set theory, Rus- sell suggested tramified type theory. If we adopt this theory, however, it becomes very hard to develop even an ordinary theory of real numbers ( 156 Foundations of Math- ematics). On the other hand, this paradox, together with the Burali-Forti paradox, indi- cates that the definition of a set should be restrictive. This realization led to the develop- ment of taxiomatic set theory. (2) The Burali-Forti Paradox (1897). Let W = {O, 1,2,..., (1), .., } be the twell-ordered set ( 312 Ordinal Numbers A) of all tordinal numbers. Let Q be the ordinal number of W Then every ordinal number, being an element of W, is less than Q. But Q is an ordinal num- ber. Hence, Q<Q. This is contradictory. (3) The Richard Paradox (1905). The expres- sions in the English language can be enum- erated by the device that is applied to the usual enumeration of the algebraic equations 1188 with integral coefficients. From the specified enumeration of all expressions in the English language, by striking out those which do not define a real number in the interval (0, 1], we obtain an enumeration of those which do. Consider the following expressiDn: "The greatest real number represented by a proper nonterminating decimal fraction whose nth digit, for any natural number n, is not equal to the nth digit of the nonterminating decimal fraction representing the real number defined by the nth expression in the last-described enumeration." Then we have befon: us a defi- nition of a real number in the interval (0, 1] by means of an expression in the English lan- guage. This real number, by its definition, must differ from every real number definable by an expression in the English language. This is contradictory. The fol1owing paradox was given by Berry (1906): "The least natural number not name- able in fewer than twenty-two syllables" is actually named by this expression, which has twenty-one syllables. The Epimenides paradox is a traditional ancient Greek paradox of this kind: Epimenides (a Cretan) said, "Cretans are always liars. ..." C. Paradoxes of the Continuum The problem of the continuum is important in both mathematics and philosophy. There are several paradoxes of Zeno concerning the continuum, among which the following two are best known: (l) Assume that Achilles and a tortoise start simultaneously from the points A and B, re- spectively, Achil1es running after the tortoise. When Achi11es reaches the point B, the tortoise advances to a point B l' When Achil1es reaches the point B 1, the tortoise advances further to a point Bz. Thus Achilles can never overtake the tortoise. (2) A flying arrow occupies a certain point at each moment. In other words, at each moment the arrow stands still. Therefore the arrow can never move. References [1] A. N. Whitehead and B. Russell, Principia mathematica, second edition, Cambridge Univ. Press, 1925. [2] S. C. Kleene, Introduction to metamath- ematics, Van Nostrand, 1952. [3] E. Mendelson, Introduction to mathemat- icallogic, Van Nostrand, 1966. 
1189 320 (XII1.19) Partial Differential Equations A. General Remarks A partial differential equation is a functional equation ( az a z z a z z ) F XI,...,X.,z' ax 1 ''''' axfax 1 ax Z '''' =0 that involves a function z of independent vari- ables XI' Xz, "', X n , its tpartial derivatives, and the independent variables Xj,..., Xn' The definition of a system of partial differential equations is similar to that of a system of tordinary differential equations. (The partial differential equation becomes an ordinary differential equation if the number of inde- pendent variables is one.) The torder of the highest derivative appear- ing in a partial differential equation is caned the order of the partial differential equation. Usuany we write Pi for azjaxj, X for XI' and y for Xz when n = 2, and P = azjax, q = azjay, r=aZzjax Z , s=aZzjaxay, t=aZzjayz when z is a function of X and y. A partial differential equation is caned linear if it is a linear relation with respect to z and its partial derivatives. For example, the equation A(x, y)r + B(x, y)s + C(x, y)t + D(x, y)p + E(x,y)q +F(x,y)z= G(x, y) is a linear partial differential equation. A par- tial differential equation is quasilinear if it is a linear relation with respect to the highest- order partial derivatives. A partial differential equation is caned nonlinear if it is not linear ( 291 Nonlinear Problems). A function z= cp(x j, Xz, ..., x n ) that satisfies the given partial differential equation is caned a solution of the partial differential equation. Obtaining such a solution for a given partial differen- tial equation is called solving this equation, and by analogy to the case n = 2, the integral hypersurface of the equation is z- CP(X1' X z ,"', xn)=O. For a system of partial differential equations, we define solutions in the same manner. Example 1. Xj, X z ,"', X n are functions of n independent variables XI' xz, "', X n . Then solving the partial differential equation az az az X 1 - +Xz-+... +Xn-=O aXI axz aX n is equivalent to sOlving the system of ordinary differential equations dX 1 dx z - - XI Xz dX n X n 320B Partial Differential Equations In other words, iff1Jz, ...,f.-1 are n-l in- dependent tintegrals of (2), then for an arbi- trary function <1>, z = <I>(f1' '" , f.-I) is a general solution of (1) ( Section C). Example 2. If Pj, P z , ..., P n , R are functions of independent variables X j, ..., X n and the dependent variable z, and if the quasilinear partial differential equation (Lagrange's dif- ferential equation) az az az P 1 - a +Pz- a +",+Pn-=R XI Xz aX n (3) has an integral hypersurface V(z, XI' .., ,x n ) = 0, then we have av av av av P l - a +Pz- a +...+Pn- a +R- a =0, (4) Xl Xz X n Z which is an equation of type (1). From this we can obtain a general solution by the method of example 1. The same procedure is applicable to solving other systems of partial differential equations. B. Characteristic Manifolds We consider a partial differential equation of the nth order of two independent variables x,y: F(X,y,Z,PIO,POj, ...,PnO' "',POn)=O, aj+k z wherepjk= axjayk ' (5) With this equation, we associate a manifold defined by a real parameter 2, x=x(2), y=y(2), Pjk=Pjd2), j, k=O, 1, ...,n-l; j+kn-l, (6) and consider the fonowing problem: Find a solution cp(x, y) of (5) that satisfies aj+k cp(x, y) ax j ayk = Pjk(2), j, k=O, 1,2,..., n-l; j + kn-l (I) on the curve x=x(2), y= y(2). We can this problem the Cauchy problem for equation (5). If F vanishes for a system of values x o , yO, P (j, k =0,1,2,..., n; p80 =ZO;j + k  n) and is tholomorphic in a neighborhood of this sys- tem of values, if x, y, Pjk are holomorphic functions of 2 in a neighborhood of 2 = 0 and take the respective values xo, yO, P at 2=0, and if pnodyn- P n - 1 . 1 dxd y n-1 +... (2) +(-I)nPondxn#O (7) 
320C Partial Differential Equations for (XO, yO, pJ,J, then we have a unique holo- morphic solution of this Cauchy problem in a neighborhood of (xo, yO, pJ,J (Here we use the notation P.ik = aF japjk') This is Cauchy's existence theorem. We cannot apply this theo- rem when the left-hand side of (7) vanishes at (XO, yO, pJ,,). At such a point, uniqueness of the solution fails, and there may be several solutions through the point (XO, yO, pJk)' There are n curves on which the left-hand side of (7) vanishes on the integral surface Z = cp(x, y). These curves are called characteristic curves of (5). We associate the values , aj+k cp(x, y) I Pjd A )=- j k ' ax oy FX(A),YY(A) j, k=O, 1, ...,n; j+k:'(n, with each point (x, y) on these curves. The manifold {x(l), y(l), p(l)} (p(l) = {Pjk(l)}) of the parameter l is called a characteristic mani- fold of equation (5). Cauchy's existence theo- rem cannot be applied on the characteristic manifold. The foregoing considerations can be ex- tended, to some extent, to the space of higher dimensions Rn or e. Let P be a linear partial differential operator of order m: P(x, D) = L a"(x)D", 1"Im " = ( _ ) "l ( !...- ) ". D ... , aX j aX n IIXI =1X 1 +... +lXn' The coefficients are assumed smooth and real in Rn or holomorphic in cn. Its homogeneous part of order m, denoted by Pm(x, D), is called the principal part of P. Let S be a hypersurface defined by cp(x) = 0 with grad cp(x) # O. S is called a characteristic surface if Pm (x, grad cp(x)) =0 holds for x in a neighborhood of S or merely for XES, and cp(x) is called a phase function. A real vector o ( # 0), or a complex vector o # 0 is called a characteristic direction at the point X o if P m(X O ' o) = O. The zeros (x, ) (#O) of Pm(x,) are called the characteristic set. Furthermore (x o , O) is called simple if gradPm(xo' o) #0. Suppose that (x o , O) is simple. The integral curve (x(t), W)) that satisfies x = grad Pm(x, ),  = - grad x Pm (x, ) is called the bicharacteristic strip of Pm. Evi- dently Pm(x, ) is constant along the bicharac- teristic strip. In particular, if this constant is zero, it is said to be null-bicharacteristic. The phase function cp(x) can be obtained at least locally by using the bicharacteristic strip. 1190 C. Classification of Solutions First, we consider a partial differential equa- tion of the first order of two independent variables: F(x, y, Z, p, q) =0. (8) A solution of (8) that contains two arbitrary constants is called a complete solution. If we get one complete solution of (8), then we can obtain all the sOlutions by differentiations and eliminations. Let (9) be a complete solution: V(x,y,z,a,b)=O. (9) Differentiating this, we get av av ax +Pa;-=O, av av ay +qa;-=O. (10) Eliminating a and b from (9) and (10), we get the original equation (8). Furthermore, solving equation (8) is equivalent to getting three func- tions a, b, Z of x, y from (8), (9), and (10). If we regard a, b as functions of x, y, in (9), we get avaa aVab --+--=0 aa ax ab ax ' a v aa a v ab --+--=0. aa ay ab ay (11 ) Therefore we can replace (9) and (10) by (9) and (11). Hence we have the following three cases: (i) When a, b are constants, we get a complete solution. (ii) When V = 0, a Vjaa = 0, and a Vjab = 0, we get a solution that does not contain arbitrary constants, because z, a, bare all functions of x and y. We call this solution a singular solution of (8). (iii) When a Vj aa, a Vjab do not vanish simultaneously, the Jacobian D(a, b)jD(x, y) vanishes because of (ll). This means that there exists a functional relation between a and b. If there are two such rela- tions, a and b are constants and the solution z becomes a complete solution. Therefore we assume that there is only one such relation between a and b, whose form is assumed to be b = cp(a). Then we get V(x, y, z, a, cp(a)) =0, av av aa +8"bcp'(a) =0. (12) If we solve (12) for the unknowns a and z, we get a solution Z of (8) that contains an arbi- trary function cp instead of arbitrary constants. Such a solution is called a general solution of the partial differential equation (8) of the first order. By specializing this function cp, we ob- tain a particular solution of (8). Thus (i)-(iii) exhaust all the cases, and by obtain.ing a com- plete solution of (8) we can get all the solu- tions of (8). The number of complete solutions may be more than 1, or it may be infinite. These complete solutions Can be transformed into each other by tcontact transformations. 
1191 Moreover, they are contained in the general solutions. Now we consider the case where the number of independent variables is n. Take an equa- tion that contains n - r + 1 arbitrary constants a 1 , a 2 ,..., a n - r + 1 : V(x 1 , x 2 ,..., x n :a 1 , a 2 ,..., an-r+!)=O. (13) Differentiate this equation assuming that aI' ..., a n - r +! take fixed values. Then we get oV oV ;:;- + Pr" = 0, UXi uZ oz Pi= OX" , i= 1,2, ...,n. If we eliminate aI' a 2 , ...,a n - r +! from (13) and (14), we obtain r partial differential equations of the first order: fj(xj, X2'"'' Xn,Z, PI ,P2"'" Pn)= 0, j= 1,2,... ,r. (15) We can (13) a complete solution of(15). In this case, as in the case when n = 2, we get an the solutions of (15) from a complete solution (13) of(15). We have the same classification as in the case n = 2: (i) When we regard aj, ..., a n - r +! as constants, then we have a complete solution of (15). (ii) When we can eliminate the con- stants a 1 ,..., a n - r +! from equations V=o, oV oV ,,--= 0, "', -;---= 0, ua 1 ua n - r + 1 we get a solution that does not contain an arbitrary constant. Such a solution is caned a singular solution of (15). (iii) If not an of the oV/oa i vanish, there exists at least one func- tional relation among aI' a 2 , ...,a n - r +!. We assume that there exist exactly k (  n - r) relations among aI' a2' ... ,a n - r +!: Jj(a 1 ,a 2 , ...,a n - r + 1 )=0, j=l, ...,k. (16) Then there exist numbers AI' A2'"'' Ak such that av 011 oik ,,=A1 ,,+... +A k -, ual ual oal 1= 1, ...,n-r+ 1. (17) Hence, by eliminating aI' a 2 ,..., a n - r + 1 , Aj, A2' ...,Ak from (13), (16), and (17), we generaIly obtain exactly one relation between XI"'" X n and z. This is a solution of (15) that contains exactly k arbitrary functions II .J2' ...,k Such a solution is caned a general solution of (15). In particular, if k = n-r+ 1, then it is a complete solution. We might think that there are n-r general solutions corresponding to k = 1, ... , n -r. But these general solutions are not essen- tially different. For the partial equation of the second order F(x, y, z, p, q, r, S, t) =0, this defi- 320D Partial Differential Equations (14) nition of a general solution is not applicable since we cannot successfuIly define a general solution by using the number of arbitrary functions contained in a solution. Instead, we now use the fonowing definition, due to J. G. Darboux: A solution of a general partial differential equation is caned a general solution if by specializing its arbitrary func- tions and constants appropriately we obtain a solution whose existence is established by Cauchy's existence theorem. A solution z = ep(x, y) of a general partial differential equa- tion is caned a singular solution if Cauchy's existence theorem cannot be applied on any curves on the manifold formed by z = ep(x, y), p=oep/ox, q=oep/oy. D. Cauchy's Method We can regard equation (8) as a relation be- tween the point (x, y, z) on the integral surface S and the direction cosines of a tangent plane at that point. Therefore the tangent planes at an points of the surface form a one-parameter family. They envelop a cone (T) whose vertex is (x, y, z) on S. The tangent plane at a point M on the integral surface S is tangent to this cone (T) along one generating line G of (T). A curve on S whose tangents are an generat- ing lines of(T) is a characteristic curve. If we write of of of of of X = ox ' Y = oy ' Z = iJz ' P = op ' Q = oq , then the characteristic curve is given by the system of ordinary differential equations: dx = dy === -dp P Q Pp+Qq X +pZ Y+qZ (18) We can this system the characteristic differen- tial equation or Charpit subsidiary (auxiliary) equation of the partial differential equation (8) of the first order. System (18) determines not only x, y, z but also p and q. The set of these tsurface elements (x, y, z, p, q) is the character- istic manifold. This characteristic manifold is considered as a part of the integral surface with infinitesimal width, and in this case we can it the characteristic strip. The character- istic strip is represented by the equations x= X(A), y = y(A), z = Z(A), P = p(A), q = q(A) con- taining a parameter. On the integral surface z=z(x,y), we have dz oz dx ozdy dA = oxdA + OydA and dz=pdx+qdy. (19) 
320E Partial Differential Equations Equation (19) is called the strip condition. The equations (18) evidently satisfy this condition. The solution u(t, x) = u(t, x[, ..., x n ) of U t + f{t,x 1 ,..., X n , u x "..., ux)=O, u(O,X)=qJ(X) is obtained (at least locally) as follows. Let the solution of the differential equations dx ---.: = ff (t,x 1 , ""Xn'I' ''''n)' dt " di ' -= - fx(t,x 1 , ""xn'[' ""n)' 1 in, dt ' issuing from (x o , o) at t = 0 be (x(t; X o , o), (t; X O , o)). Then specializing jO = qJx/x o ) (1 jn), the solution u(t, x) is obtained by quadrature along these curves (characteristic strips) from the relation du n - d = - f(t, x, ) + ?: JP, x, ). t J1 In particular, when f(t, x,) is homogeneous of degree 1 in , by Euler's identity the right- hand side is identically zero. This means that u is constant along the characteristic strip. E. Homogeneous Partial Differential Equations Assume that f( 1, z, "', m) is a thorn ogene- ous polynomial of m independent variables  l'  z, ... , m' We denote the differential oper- ator a/ax, by D,. Then consider a homogene- ous partial differential equation f(D[,Dz, ...,Dm)u=O. We can obtain a homogeneous equation from an inhomogeneous partial differential equation by transformation of the dependent variable. For example, Dw = Dz w becomes the homoge- neous partial differential equation (D - DzD3)U = 0 by the transformation of the dependent variable u = eX'w. The equation (D - DzD3)U = o corresponds to the homogeneous polynomial r(" z, 3) = i - Z3' Generally, for equa- tion (20), we consider the solution u=F(O[,Oz, ...,0,), where ° 1 , 0z, ..., Os are functions of XI' ..., X m and F is an arbitrary function of 0;. Such a solution is caned a primary solution of (20). For example, for the equation (D - DDu = 0, there are two primary solutions, u=F(x[ +X2) and u= f(x[ -xz). For tLaplace's equation azu azu azu ,i u=- a i++- a z =0, x oy z we have a primary solution u= F(z+ ixcoslX+ iysinlX), where IX is a parameter. 1192 Second, for the twave equation azu azu azu 1 azu -+-+----=0 ax z ayZ az z OZ at Z ' (22) we have a solution u = yf( IX, fJ), 1 y=-, r z-r fJ = -----:- , x+zy (23) r lX=t-- 0' where r Z = XZ + yZ + zZ, f is an arbitrary func- tion, and y is a particular solution of (22). Furthermore, v = f(lX, fJ) satisfies the equation of a characteristic curve of (22): ( av ) Z + ( av ) Z + ( av ) Z = ( av ) Z ax ay az OZ at Such a solution, which is the product of par- ticular solutions and some function that con- tains an arbitrary function, is called a primitive solution of the original equation. Equation (22) has another primitive solution of the type u=g ( t+, z- ) , r 0 x + zy where 9 is an arbitrary function. Laplace's equation has a primitive solution 1 ( z-r ) u=-f - . r x+iy (20) A basic equation is an equation, such as La- place's equation, that has a primary solution and a primitive solution. A solution of an equation that has the same characteristic curves as a basic equation can be obtained from a particular solution of the basic equa- tion by integrations and additions. For exam- ple, if we choose a particular solution u = r-[ of Laplace's equation, which is a specialization of the primitive solution u=r- 1 f«z--r)/(x+iy)), then u= f f f [(x-)z+(Y-'1f+(z-,n-1/Z x F(,'1,()dd'1d' is a solution of ,iu + 4nF(x, y, z) = 0 (21) in the interior of the domain of integration. F. Determined Systems The general form of a system of partial dif- ferential equations in two independent vari- ables is Fi(x, y, u(1), u(Z),..., ut m ), u[), u1),..., um), um), uV,...)=O, i=1,2,...,h, (24) i.e., a system of h equations for m functions 
1193 u!1), U(2), ...,U(m) ofthe independent variables x and y. The system is caned a determined sys- tem if h = m, an overdetermined system if h > m, and an underdetermined system if h < m. An example of a determined system is the tCauchy-Riemann equation UX-vy=O, uy+vx=O, for u(x, y), v(x, y), which can be further reduced to two determined equations u=O and v= O. A simple example of an overdetermined system is U x = f(x, y), u y = f(x, y). A necessary and sufficient condition for the existence of a sOlution of this system is h = fx' The Cauchy-Riemann differential equations for a holomorphic function f(z l' Z2)=U + iv of two complex variables ZI =X 1 +iYb Z2=X2 +iY2 are U X1 =V y1 , U X2 =V y2 , u y1 =-V X1 ' U Y2 = - V X2 ' which can be reduced to U XjXj +uy,y, =0, U X ,X 2 + U YjY2 = 0, U X2X2 + U Y2Y2 = 0, U -u =0 XIY2 X2Yl ' which is also an overdetermined system. An example of an underdetermined system is J(U, v) J(x,y) =UXvy-UyVx=O. If this equation holds, there exists a functional relation w(u, v)=O that can be regarded as a solution of this underdetermined system. G. General Theory of Differential Operators In recent developments of the theory of par- tial differential equations, there is a trend to construct a general theory for tdifferential operators regardless of the classical types of differential equations ( 112 Differential Operators). For example, we take a property that is satisfied by some equation of classical type (e.g., an teniptic differential equation) and proceed to characterize an equations that have this property. (For example, thypoenipticity is a property of classical tparabolic and eniptic equations.) There are several basic problems in this general theory: the existence of a funda- mental solution, the existence of a local solu- tion, unique continuation of solutions, the differentiability and analyticity of solutions, and the propagation of smoothness. We ex- plain here only two of them: the fundamental solution and the local existence of solutions. 320H Partial Differential Equations H. Fundamental Solutions L is assumed to be a linear partial differential operator with constant coefficients. If a tdistri- bution E satisfies the equation LE(x) = <'5 (x), where <'5(x) is the tDirac <'5-function, then we can E(x) a fundamental solution (or elementary solution) of L. Also, if L is a linear differential operator and E satisfies the equation LxE(x, y) = <'5 (x - y), then we can this distribution E(x, y) a funda- mental kernel (or elementary kernel) of L. Let L be a differential operator with con- stant coefficients and E(x) be a fundamental solution of L. Then E(x, y) is a fundamental kernel of L. Sometimes E(x, y) itself is caned a fundamental solution. L. Ehrenpreis and B. Malgrange proved that any linear differential operator with constant coefficients has a fun- damental solution [4]. If we take a fundamental solution (funda- mental kernel) E and add to it an arbitrary solution of the equation Lu=O, then we get another fundamental solution (fundamental kernel) of L. This freedom of the fundamental solution (fundamental kernel) can be used to construct tGreen's functions of the boundary value problem of eniptic equations and of the mixed initial-boundary value problem for parabolic equations. A Green's function is a fundamental solution (fundamental kernel) that satisfies given boundary conditions ( 188 Green's Functions; 189 Green's Operator). The fundamental solutions (fundamental ker- nels) relative to the Cauchy problem are also defined as in this section. For example, con- sider a fundamental solution of the Cauchy problem concerning the future behavior of a differential operator L=JjJt-P(JjJx), namely, a distribution E(t, x) that satisfies LE = o (t>O) and E(t,x)lt=o=<'5(x). If we put E(t,x)= E(t, x)(t:;" 0) and E(t, x)=O (t <0), then E(t, x) is a fundamental solution (or kernel) of the differential operator L, that is, LE=<'5(t,x). Sometimes a fundamental solution of the Cauchy problem for a parabolic equation is caned a Green's function. On the other hand, a fundamental solution (or kernel) of the Cauchy problem for a hyperbolic equation is caned a tRiemann function. A Riemann function actu- any is not always a function; in general it is a distribution. Example 1. A fundamental solution of the 3- dimensional Laplacian J2 J2 J2 =-+-+- Jxi Jx Jx is E(x) = -lj4nr, where r= J xi +x +x. 
320 I Partial Differential Equations Example 2. A fundamental solution of the Cauchy problem for the future of the 3- dimensional wave operator a 2 3 a 2 L=z-- I, at i1 ax; i.e., a distribution E(t,x) (t;::'O) that satisfies LE=O (t>O), E(O,x)=O, and (a/at)E(O, x) = t5(x), is given by E (t, x) = (1/4nt)t5(r- t) (t> 0), r =  + x + x. A fundamental solution for L is given by E(t, x) = E(t, x) (t;::' 0); = o (t<O) ( Appendix A, Table 15.V). I. Existence of Local Solutions Given a linear differential operator L and the equation Lu = 1, we have the problem of deter- mining whether this equation always has a solution in some neighborhood of a given point. If the coefficients of Land fare holo- morphic in a neighborhood of this point and if the homogeneous part of highest order does not vanish, then there exists a solution that is holomorphic in a neighborhood of the given point (tCauchy-Kovalevskaya existence theorem). If L is a linear differential operator with constant coefficients, E is a fundamental solu- tion of L, and r is a function (or distribution) that is zero outside of a compact set, then we have a solution u that is the tconvolution of E and f: u = E * f On the other hand, H. Lewy proposed the following example [3]: au au au Lu=--+i-+2i(X1 +iX2)-= f(X3), aXl aX2 aX3 where f is a real function of X3' He showed that if this equation has a solution that is of class C 1 , then r must be real analytic. There- fore, if f is of class Coo but not real analytic, then this equation has no C 1 -solution. Actu- ally, no solution exists even in the distribution sense. (Note that, since the coefficients of L are now complex-valued, the results mentioned at the beginning of this section are no longer applicable.) For linear differential operators L, a study by L. Hormander gives some necessary con- ditions and also some sufficient conditions for the local existence of a solution of the equa- tion Lu = f for sufficiently general f [4]. This result has been developed and completed by L. Nirenberg and F. Treves [18] and by R. Beals and C. Fefferman [19]. The operator considered by S. Mizohata (J. Math. Kyoto Univ.,l (1962)), a . k a L=-+ IX 1-, ax; aX 2 serves as a standard model in this problem. In 1194 the neighborhood of the origin, if k is even, L is locally solvable, and if k is odd, it is not locally solvable. However, for linear partial differential operators with multiple character- istics, the problem of local solvability becomes extremely difficult. References [1] E. Goursat, Cours d'analyse mathema- tique, Gauthier-Villars, II, second Idition, 1911; III, fourth edition, 1927. [2] R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. [3] H. Lewy, An example of a smooth linear partial differential equation without solution, Ann. Math., (2) 66 (1957), 155158. [4] L. Hormander, Linear partial differential operators, Springer, 1963. [5] S. Mizohata, Theory of partial differen- tial equations, Cambridge Univ. Press, 1973. (Original in Japanese, 1965.) [6] C. Caratheodory, Calculus ofyariations and partial differential equations of the first order, Holden-Day, 1967. [7] I. G. Petrovskii, Ober das CauGhysche Problem fUr ein System linearer pa rtieller Differentialgleichungen im Gebiett: der nicht- analytischen Funktionen, Bull. Univ. Moscou, ser. internat., sec. A, vol. 1, fasc. 7 (1938),1-74. [8] J. Hadamard, Le probleme de Cauchy et les eq uations aux derivees partielles lineaires hyperboliques, Hermann, 1932. [9] I. G. Petrovskii, Lectures on partial dif- ferential equations, Interscience, 1954. (Orig- inal in Russian, 1953.) [10] S. L. Sobolev, Partial differential equa- tions of mathematical physics, Pergamon, 1964. (Original in Russian, 1954.) [11] 1. L. Lions, Equations differelltielles operationelles et problemes aux limites, Springer, 1961. [12] J. L. Lions and E. Magenes, Problemes aux limites non homo genes et applications, Dunod, I, 1968; II, 1968; III, 1969. [13] A. Friedman, Partial differen1ial equa- tions, Holt, Rinehart and Winston, 1969. [14] F. Treves, Locally convex spa.ces and linear partial differential equatiom., Springer, 1967. [15] F. Treves, Linear partial differential equa- tions with constant coefficients, Gordon & Breach, 1966. [16] F. John, Partial differential equations, Springer, 1971. [17] L. Bers, F. John, and M. Schechter, Par- tial differential equations, Interscimce, 1964. [18] L. Nirenberg and F. Treves, On local solvability oflinear partial differential equa- tions I, II, Comm. Pure Appl. Math., 23 (1970),1-38,459-510. 
1195 [19] R. Beals and C. Fefferman, On local solvability of linear partial differential equa- tions, Ann. Math., 97 (1973), 482-498. [20] F. Cardoso and F. Treves, A necessary condition of local solvability for pseudodiffer- ential equations with double characteristics, Ann. Inst. Fourier, 24 (1974), 225-292. 321 (XIII.21) Partial Differential Equations (Initial Value Problems) A. General Remarks First, we give two examples of initial value problems for tpartial differential equations. (I) Consider the partial differential equation U x - u y = 0 of two independent variables x and y. If the function q>(y) is of class C 1 , then u = q>(x + y) is a solution of this equation that satisfies u(O, y) = q>(y). (II) We denote a point ofR.+! (or en+!) by (t,x), X=(Xl, ...,x.). Let L be a linear partial differential operator of order m: am alvl L=-+ I a (t x) atm Ivl<;;m VoV,'''V n ' atVOaxi'... ax;n' Ivl=vo+vl +...+v., vo<m, where the coefficients avov,...vJt, x) are func- tions of class C'" (i.e., treal analytic functions or holomorphic functions) in a neighborhood of(t, x) = (0, 0). If the functions v(t, x) and wk(x) (0  k  m - 1) are of class C'" in a neighbor- hood of (t, x) =(0, 0), then there exists a unique solution u(t, x) of class C'" in a neighborhood of (t, x) = (0, 0) that satisfies L[u] =v(t, x), aku ""8tk(0, x)= wk(x), Okm-1. This is caned the Cauchy-Kovalevskaya exis- tence theorem (for linear partial differential equations). As in (II) we choose one of the independent variables as the principal variable and regard the others as parameters. When we assign a value a to the principal variable t, the values of the dependent variables (unknown functions) and their derivatives are caned initial values, initial data, or Cauchy data. Conditions to determine initial values are caned initial con- ditions. The problem of finding a solution of (1) under given initial conditions is caned an initial value problem or Cauchy problem. We may consider initial value problems not only for initial conditions on a hyperplane t = a, but 321 B PDEs (Initial Value Problems) also for initial conditions on a hypersurface, caned an initial surface ( Section C). Let a(x, D) be a linear partial differential operator of order m: a(x, D) = I a,(x)D', I,j<;;m al'l D'= ax' ... axn' liXl =iX l +... +iX., where the coefficients a,(x) are of class C'" in a neighborhood of x=O. Its characteristic poly- nomial is h(x,)= I a,(x)', '=f'...n. 1,1=m Let S:s(x)=O be a regular surface (i.e., Sx= (as/ax[. ..., as/ax.) #(0)) of codimension 1. We suppose that S is a tnoncharacteristic surface, that is, h(x, sx) # 0 on S. Let v(x) and wk(x) (Okm-l) be the functions of class C'" in a neighborhood of x = 0 and on S, respec- tively. We consider the Cauchy problem aku a(x, D)u(x) = v(x), _ a k ) = wk(x) on S, n (x Okm-l, (1) (1) where n is the outward normal direction of S. S is thus the initial surface. Then there exists a unique solution u(x) of class C'" in a neighbor- hood of x=O. In fact, by the change ofvari- abIes Xl =s(x), Xj=Xj (2in) if as/ax 1 # o on S, this problem can be reduced to the Cauchy problem (1) by taking account of the fact that h(x,sx)#O on S. The Cauchy-Kovalevskaya theorem asserts the local existence of solution when the initial values are of class C"'. Indeed, J. Hadamard noted that if the initial values are not of class C"', the initial value problem does not always have a solution. For example, consider the initial value problem a 2 u a 2 u a 2 u -+-+-=0 ax 2 ay2 az 2 with the initial values au _ 0. u(O, y, z) = w(y, z), ax(O, y, z) If the function w(y, z) is not of class C'" in any neighborhood of y=z=O, the sOlution of this problem can never exist in (or even on one side x  0 of) any neighborhood of x = y = z = O. B. The Cauchy-Kovalevskaya Existence Theorem for a System of Partial Differential Equations in the Normal Form The Cauchy-Kovalevskaya existence theorem (1) is extended for more general systems of 
321 C PDEs (Initial Value Problems) partial differential equations in the normal form studied by Kovalevskaya. Consider rP,u. ( =Fi t,x,Ut,...,u m ,..., atP, a1vlU j ) V o V 1 i v n '''' , ot ax] ... eX n where 1 i,jm, Ivi = V o + VI +... +VnPj, V o < Pj, and x =(x 1 , ... ,x n ). We assume that Fj (1  i  m) are functions of class cm with respect to arguments t, x, u l ,..., urn"'" OlvIUj/ctVoiJxl ... h", . .. , in a neighborhood of (0, 0, . . . ,0). If thefunctions wjk(x) (1 im,Okpj-l) are of c1ass C m in a neighborhood of x = 0, then there exists a unique solution u l (t, x), ..., urn(t,x) of class c m in a neighborhood of(t,x)= (0,0) that satisfies the equations and the initial values (kU. .(O,x)=wjdx), 1 im, Okpj-l ct [2, 4]. C. Single Equations of the First Order F or a single partial differential equation given in the normal form ru ( ' au au ) --:-=F X'Yl,""Yk'U,--,..., ' ex CY1 0Yk we assume that F(x,y], ""Yk,u,q" ...,qd is a real-valued function of class C 2 in a neighbor- hood of x = a, Yj = bi, U = C, qi = d j , and that cp(y], .., , Yk) is also a function of class C 2 such that cp==c, r](p/DYj=d j at Yi=b j . Then there is a solution u of (2) in a neighborhood of x= a and Yj = b i that satisfies u(a, y" "', Yk) = cp(}'". " Yk) and is of class C 2 . For the uniqueness of solutions of this Cauchy problem, A. Haar (Atti del Congresso lnternazionale dei Matematici, 1928, Bologna, vol. 3) showed that if F satisfies the tLipschitz condItIOn I F(x, Y, u', q') - F(x, y, u, q)1 k A I Iq;-q;i+Blu'-ul, i=l then the solution of the initial value prob- lem for (2) is unique. To obtain this result he studied the partial differential inequality I i'u I k I au I -:;- A I -;;-- +Blul+C. ox j1 uY; Next, we consider more general equations of the first order ( i':u ru ) F Xl,...,Xk'U,,..., =0. eX I cX k 1196 Suppose that F(x 1 , ..., X b U, p" .... Pk) is a real- valued function of class C 2 in a neighborhood of Xi =aj, u=b, pj=c;; qJ(x"... ,x k ), S(x 1 , ...,x k ) are functions of class C 2 in a neighborhood of Xi = a j that satisfy b = qJ(a 1 , ''', a k ), C j = (aqJ/ax;)x=a, S(a 1 , .,., ad = 0; and Lt] :; ::J H'Ub.P=C #0. (5) Then there exists a solution u of (4) of class C 2 in a neighborhood of x=a that satisfies u= qJ(x) on the hypersurface S(x) = O. Furthermore, if F, qJ, S are of cl2.sS C] and satisfy (5), then there is at most one solution u of (4) of class C 1 in a neighborhood of a that satisfies u = qJ(x) on the hypersurfae S(x) = O. These facts can be proved in the following way. By choosing SI (x), ... ,Sk-1 (x) and then S(x) so that the tJacobian a(S, S], ..., Sk-rJ/ c(x I' ..., xd does not vanish and by changing variables from x to S, SI' .., ,Sk-1' we obtain a normal form solved for au/as by condition (5) (this condition means that the hypersurface S(x) = 0 is not tangent to the tcharacteristic curves). D. Quasilinear Equations of the Second Order (2) Consider the equation k a 2 u .I a;/x,u,p)- a . a +b(x,u,p)=O, I.Jl x, xJ where x=(x 1 , ...,xd, p=(p" ...,pd, and pj= au/ax j . We assume that the initial conditions are u=qJ(x) on S(x)=O and k as au I ,,-=I/1(x) j=1 (;X j ax; on the same hypersurface. Taking the other functions Sdx), "', Sk-1 (x) and then S(x) so that the Jacobian a(S, SI' ..., Sk-rJ/a(X 1 , "', xd does not vanish, we change the variables x to SO,S" ...,Sk-1 (so=S,Sj=S;). Then we get k-] a 2 u I Q(S,Sv)-;;-- a +b*=O, Il.V=O oSf.1 Sv (6) where  as as v Q(S, Sv) = ..;.... ajj- a . _ a .' l,J=t Xl XJ (3) k-1 as au p;=I---.Il.-, o aX j as u k-' k a 2 s au b*=b+ I I a.-. .=0 ;.j=' IJ aXjaX j as. (4) When So = 0, the initial conditions are trans- 
1197 formed to u= qJ*(Sj,..., sk-d= qJ(x(O, sl>"', Sk-1))' au _,1,* _ a -'I' (SI' ""sk-d So [ k-1 ( as ) 2 J -l [ = .I _ a t/J(X(O,SI' ...,Sk-l)) =1 Xi k-1 at/J* k as). as ] -I-I-- . ).=1 as). i=1 ax; ax; Suppose that Q(S, S) # 0, and set au/as i = qi' Then equation (6) added to the initial con- ditions u= qJ*(SI' ..., sk-d, qo = t/J*(SI , ..., sk-d at So = ° is equivalent to the system in the normal form of partial differential equations of the first order au _ a =qo, So aql' aqo. as o asl" J1=I,...,k-l, aqo = _ [ kII ( kf Q(S S) aql' as o Q(S, S) 1'=1 v1 1" v aS v - 2Q(SI" S) :) - b* ] with initial conditions u = qJ*(SI' "', Sk-d, qo = t/J*(SI"'" Sk-1)' qu=aqJ*/aSI' when So = 0. Thus, if the coefficients are of class C"', the preceding theory applies to this equation. E. Continuous Dependence of Solutions on the Initial Values First, we consider the fo11owing simple exam- ple ofa linear equation. The wave equation a 2 v _ 2 a 2 v ail- C ax 2 for v(x, t) is the simplest thyperbolic equation. Its solution satisfying the initial conditions v(x, 0) = f(x), (av/at)(x, 0) = g(x) is given by 1 1 f x+ct v(x,t)=2"(f(x+ct)+ f(x-ct))+- g(A)dA. 2 x-c' It is obvious from this expression that if we regard f(x) and g(x) as elements of the tfunc- tion space C(R) of continuous functions of xER with the topology oftuniform conver- gence on compact sets, then v(x, t) is deter- mined as the value of a linear operator from C(R 2 ) to C(R2) on xt-space. If such continuous dependence on the initial values is established, or more precisely, if there is a unique solution for sufficiently smooth initial values that de- pends on the initial values continuously in a 321 E PDEs (Initial Value Problems) suitable sense, we say that the initial value problem is well posed (properly posed or cor- rectly posed). Systematic research on the we11-posedness of Cauchy problems was initiated by I. G. Petrovskii, who considered the fo11owing sys- tem of partial differential equations, which is more general than differential eq uations of the normal form. (The coefficients are a11 assumed to be functions of t only.) a'ju. N aVo+Vj+,,+v'u ( xt ) --2 -" " a ( t ) k , a t'j -   jkvOVI' v. at VO a v, a v. k-1Ivl';m Xl ... X, + BlXj, ...,X"t), j= 1, ...,N, Ivl=v o +v 1 +...+V,' vO<nk' (7) This has a normal form if n k = m (k = 1, "', N). Now, taking the derivatives ( a ) 'r l at ulx, t), (:t ) ulx, t) ( a ) 'r 2 at uj(x, t), ... , as new unknowns, we get another system: aU j N' aVI+' +V n -=" "a' k ( t ) u ( X t ) a t   J vI,,,vn a v a v k , k=llvl';m XII... X,n +Cj(x,t), j= 1, ...,N'. (8) We take as the space of initial values the ttopologicallinear space composed of a11 func- tions whose derivatives up to a sufficiently large order are bounded on the whole space R' and equipped with the topology determined by the tseminorms that are the maximums of derivatives up to a given order on the whole space, and we take as the range space a simi- lar space on the xt-space, where xER' and °  t  T. Then we can formulate a necessary and sufficient condition for the we11-posedness of the initial value problem for the future (the problem is regarded as specifying a mapping that assigns to the initial values on t = ° the values u(x, t) for t > 0). To give such a con- dition we consider the fo11owing system of ordinary differential equations, which are given by a tFourier transformation on the x-space of system (8): dU j N' - dt = I I a jkv ,.. vJt) kllvl';m x (2nidV, '" (2ni,rnUk(' t) + Cj(' t), j=I,...,N'. (9) If the tfundamental system of solutions of (9) is VP1(, t) (i= 1,..., N',j = 1, ..., N'), then the condition is that these functions satisfy the ineq ualities Idj)(,t)IC(I+II)L, OtT, (10) 
321 F PDEs (Initial Value Problems) where C and L are constants independent of . This is the condition obtained by PetrovskiI. If system (7) is of normal form and is well posed for the future, it is also well posed for the past. In this case, equation (7) is said to be of hyperbolic type ( 325 Partial Differential Equations of Hyperbolic Type). An example that is not of a normal form and is well posed is a parabolic equation ( 327 Partial Dif- ferential Equations of Parabolic Type). In Petrovskifs theory it is sufficient to assume that the coefficients in (7) are continuous and bounded, and we can take T arbitrarily large provided that (10) is satisfied. Hence Petrov- skiI's theory guarantees global existence of solutions. F. Uniqueness of Solutions If a linear partial differential equation of the first order of normal form has coefficients of class Co, then the solution of class C 1 satisfy- ing the prescribed initial conditions is unique (Holmgren's uniqueness theorem, 1901). Every system of partial differential equations of higher order of normal form can be reduced to a system of the first order of normal form. Therefore, if the coefficients are of class CW, there is only one solution for the original initial value problem with continuous partial derivatives up to the order of the equation. Moreover, if an analytic manifold S of dimen- sion n - 1 (n is the number of independent variables) is not tcharacteristic for the given equation of order m, there is at most one solu- tion whose derivatives of order up to m - 1 coincide with given functions on the manifold S. The proof of this fact relies on the Cauchy- Kovalevskaya existence theorem. The uniqueness problem for the initial value problem is in general very difficult even for linear equations when the coefficients are not of class Cwo In particular, if the number of independent variables is 2 and the coefficients are all real, then we have a result of T. Carleman (1939) about the system: au m au m a: = V1 apv(x,y) a; + V1 bpv(x,y)u" (11) where the a pv are of class C 2 and the b pv are continuous. He proved that if the eigenvalues of the matrix (a pv ) are all distinct, then even when some of the eigenvalues are complex, there is at most one solution of class C1 for the initial value problem. If we omit the assump- tion about the eigenvalues, however, the theo- rem does not hold in general, because we have a counterexample due to A. Plis (1954) where all coefficients are of class Coo, m = 2, b pv == O. 1198 A. P. Calderon showed that Caileman's result can be extended to the case n > 2 (A mer. J. Math., 80 (1958)). Consider the following linear partial differential equation of the kth order: :: + jt 1j(x,y, :J[::j- J+B[li]=O, (12) where 1j(x, y,) is a homogeneous polynomial of degree j of  = (1' ..., n) with real coeffi- cients and B is a differential operation in (x, y) of order at most k-l. We assume that the coefficients of 1j(x, y, ) are functions of (x, y) of class C 1 , their derivatives are Holder continu- ous, and the coefficients of B are bounded and continuous. If the characteristic equation of (12), k Jek+ 2: 1j(x,y,)Jek-j=O, j1 has only distinct roots for  #0, then the C k _ solution of the Cauchy problem is unique in a neighborhood of x=a. Calderon proved this except for the cases k;:;, 4, n = 2, where a cer- tain topological difficulty arises. S. Mizohata (J. Math. Soc. Japan, 11 (1959)) succeeded in obtaining the proof for the excepti onal cases. This result can be extended to systems of equations under similar assumptions. See L. Hormander [4] for an extension to the com- plex coefficient case. S. Mizohata, T. Shirota, and H. Kumanogo discuss the uniqueness theorem for equations of double character- istics or of parabolic type. For nonlinear equations there are, in gen- eral, very few results about the global existence of solutions. For example, if the function Fin equation (2) in the normal form satisfies the Lipschitz condition with constants A and B that are independent of x, then Wf: get a global existence theorem. The method of proof of this theorem is as follows: First, we prove the existence for Ixl  /;1 and sufficiently small /;1 by Picard's tsuccessive iteration method. Then we regard x = /;1 as the hyperplane on which the initial values are assigned and proved the existence of a solution on /;1  Ix 1 I  /;2' and so on. The same method can be applied to non- linear systems of the first order if :hey are special types of quasilinear systems. G. Construction of Solutions by Asymptotic Expansion Let a(x, D) = LI.I,;;ma.(x)D. be a linear par- tial differential operator of order In, with coefficients of class Coo. We write a(x, ) = LI.I';;ma.(xW=h(x,)+h'(x,)+... with h and h' homogeneous in  of degree m and m- I, respectively. Let K: qJ(x) = 0 be a regular 
1199 surface of co dimension 1. We assume that K is a simple characteristic, i.e., hex, q>x) = 0 and (8h/8Ux, q>x)) #(0) on K. Let f.i(t) (j = 0,1, ...) be a sequence offunctions satisfying df.i/dt(t) = f.i-dt),j= 1,2, .... Then the equation a(x, D)u(x) = 0 has a formal solution in the form 00 u(x)= L f.i(q>(x))ui x ). j=O The coefficients uj(x) are obtained by solving successively the equations m L 1 [ujJ = L L k [u j -k+1J, k=2 where n 8h 1 n 8 2 h L 1 =. ;y-(x, q>x)Di +2:.  8 J:. 8 J:. (x, q>x)q>x,Xj -1 t l,j-l l I + h'(x, q>x) and Lk (2  k  m) are differential operators of order k depending only on a and q>. By this method of asymptotic expansion of solution, fundamental solutions and singularities of solutions for hyperbolic equations have been studied (Hadamard [5,6J; R. Courant and P. Lax, Proc. Nat. Acad. Sci. US, 42 (1956); Lax, Duke Math. J., 24 (1957); D. Ludwig, Comm. Pure Appl. Math., 22 (1960); S. Mizohata, J. Math. Kyoto Univ., 1 (1962);  325 Partial Differential Equation of Hyperbolic Type). If a(x, D) is an operator with analytic coeffi- cients, this formal solution is convergent. By using this fact, Mizohata constructed t n u11 solutions. In fact, put f.i(t) = tP+m+j/r(p+ m+ j) (pO),j =0,1, .... Define u(x) by u(x) = 0 for q>(x) O and u(x) = I:jof.i(q>(x))uJ(x) for q>(x)O obtained by the preceding process. Then u(x) is a nu11 solution of a(x, D)u(x) = 0 (Mizohata, J. Math. Kyoto Univ., 1 (1962)). The Cauchy problem in the case when the initial surface has characteristic points had been studied by 1. Leray and by L. Giirding, T. Kotake, and Leray. Let a(x, D) be a differential operator with holomorphic coefficients in a complex domain. Let S:s(x)=O be a regular surface and T be a subvariety of codimension 1 of S. Suppose that S is noncharacteristic on S - T, but characteristic on T. Consider the Cauchy problem (1) of Section A. Then the solution u(x) is ramified around a char- acteristic surface K that is tangent to S on T, and it can be uniformized (Leray, Bull. Soc. Math. France, 85 (1957); Giirding, Kotake, and Leray, Bull. Soc. Math. France, 92 (1964)). Next we consider the Cauchy problem (1) when the initial surface S (XI =0) is nonchar- acteristic, but the initial values wk(x) (0  k  m-l) have singularities on a regular sub- variety T (XI =X 2 =0) of S. We assume that 321 G PDEs (Initial Value Problems) h(O'I' 1,0, ...,0)=0 has m distinct roots. Then there exist m characteristic surfaces K j :q>i(X) = o (1  i  m) originating from T. q>j(x) is ob- tained by solving the tHamilton-Jacobi equa- tion hex, q>x)=O, q>(0, X2'"'' X n )=X2' Now, if wk(x) (0km-1) has a pole along T, the Cauchy problem (1) has a unique solution in the form (13) u(x)= i { [jPi + G i (X)10 g q>i(X)} +H(x), where Fj(x), G,(x) (I im) and H(x) are holo- morphic functions in a neighborhood of X = o and Pi (1  i  m) are integers> O. In order to obtain this solution, we set u(x) in the form u(x) = I::"=1 I:jof.i(q>j(X))UijX), where fo(t) is a function with a pole or a logarithmic sin- gularityat t=O chosen so that u(x) satisfies the initial conditions. Thus we can determine the coefficients u,jx) by solving successively the equations (13) on each K j (1  i  m) (Y. Hamada, Pub/. Res. Inst. Math. Sci., 5 (1969); C. Wagschal, J. Math. Pures Appl., 51 (1972)). When the multiplicity of characteristic roots is more than 1, the situation is not the same. For example, consider the Cauchy problem Dfu-D 2 u=0, 1 U(0,X2)=2' X D, u(O, X2) = o. The solution is 00 u(x)= L (-I)nn!xfn/(2n)!x+' k=O with essential singularities along X 2 = O. This situation occurs quite genera11y. We factor h(x,) = hI (x, )', ... hs(x, )\ where hi (1  i  s) are irreducible polynomials of degree m i in  with holomorphic coefficients. We assume that the equation nr=1 hj(O, j, 1,0, ... ,0) = 0 has p distinct roots (p=m 1 +... +ms). Then there exist p characteristic surfaces K i (1  i p) originating from T. We suppose more gener- a11y that the initial values wdx) are multi- valued functions ramified around T. Then the Cauchy problem (1) has a unique holomorphic solution on the universal covering space of V - Uf=1 K i , where V is a neighborhood of x = O. In fact, this is solved by transforming this problem into tintegrodifferential equations. Such a method of solution is closely related to the method discussed in this section. See Hamada, Leray, and Wagschal (J. Math. Pures Appl., 55 (1976)). In this case, even if the initial values have only poles, the solution in general may have essential singularities along Uf=1 Kj, but if a(x, D) satisfies Levi's condition (a(x, D) is well decomposable), the solution does not yield essential singularities along Uf=1 K j . See J. De Paris (J. Math. Pures Appl., 51 (1972)). 
321 Ref. PDEs (Initial Value Problems) References [lJ R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. [2J I. G. Petrovskii, Lectures on partial dif- ferential equations, Interscience, 1954. (Orig- inal in Russian, second edition, 1953.) [3J L. Hormander, Linear partial differential operators, Springer, 1963. [4J E. Goursat, Cours d'analyse mathema- tique, Gauthier-Villars, II, second edition, 1911; III, fourth edition, 1927. [5J J. Hadamard, Le90ns sur la propagation des ones et les equations de l'hydrodynamique, Hermann, 1903 (Chelsea, 1949). [6J J. Hadamard, Le probleme de Cauchy et les equations aux derivees partielles lineaires hyperboliques, Hermann, 1932. [7J C. Caratheodory, Calculus of variations and partial differential equations of first order, Holden-Day, 1965. (Original in German, 1935.) [8J S. Mizohata, Theory of partial differential equations, Cambridge University Press, 1973. (Original in Japanese, 1965.) [9J F. Treves, Linear partial differential equa- tions with constant coefficients, Gordon & Breach, 1966. [10J F. John, Partial differential equations, Springer, 1971. [llJ J. Dieudonne, Elements d'analyse, 4, 7, 8, Gauthier-Villars, 1971-1978. 322 (XIII.20) Partial Differential Equations (Methods of Integration) A. General Remarks The methods of integrating partial differential equations are not as simple as those for tordi- nary differential equations. For ordinary dif- ferential equations, we often obtain the desired solution by first finding the general solution containing several arbitrary constants and then specializing those constants to satisfy prescribed additional conditions. The situa- tion is more complicated, however, for partial differential equations. In the formal general solution of a partial differential equation we have arbitrary functions instead of arbitrary constants, and there are cases where it is im- possible, or very difficult, to find a suitable spe- cialization of these functions so that the given additional conditions are fulfilled. For this reason, methods of solution are rather specific and are classified according to the types of additional conditions. In many cases, we as- 1200 sume that a problem is mathematically well posed, guess the possible solutions, and verify it directly. A problem is said to be mathematically well posed (properly posed or correctly posed) if, under assigned additional conditions, the solution (i) exists, (ii) is uniquely determined, and (iii) depends continuously on the assigned data. By carefully examining problems in physics and engineering, we usually obtain many well-posed and important problems. In these problems, usually the data are suffi- ciently smooth functions. In these cases, part (iii) of the above definition (the continuous dependence of the solutions on th, data) fol- lows often from assumptions (i) and (ii). For example, telliptic equations like tLaplace's equation uxx+uyy=O for u(x,y) dt:scribe laws of static or stationary phenomena such as the field of universal gravitation, the dectrostatic field, the magneto static field, the steady flow of incompressible fluids without vortices, and the steady flow of electricity or heat. For this equation, tboundary value problems are well posed, but tinitial value problems are not ( 323 Partial Differential Equations of Elliptic Type). By contrast, for thyperbolic equations like U tt - U xx = 0 for u(x, t) and tpar.lbolic equa- tions like u, - U xx = 0 for u(x, t) which control the change (in reference to time) of the various stationary phenomena, initial value problems or mixed problems with both boundary con- ditions and initial conditions are well posed ( 325 Partial Differential Equations of Para- bolic Type). For the rest of this article we explain funda- mental and typical methods of integration ( Appendix A, Table 15). B. The Lagrange-Charpit Method For the partial differential equation of the first order F(x,y, u,p,q)=O, au p= ax ' (1 ) au q=' ay ' we consider a system of ordinary differential equations called tcharacteristic differential equations: dx dy du Fp Fq pFp+qF q -dp -dq -- Fx+pFu Fy+qFu (2) If we obtain at least one tintegral of this sys- tem containing p, q, and an<arbitrary constant a in the form G(x, y, u, p, q) = a, (3) and if we find p and q from (1) and (3), then du =pdx+qdy is an texact differential form, and by integrating it we get a solution <I>(x, y, u, a, b) 
1201 =0 of (1). A solution of (1) containing two arbitrary constants is called a tcomplete solu- tion. Here, setting b=g(a) and eliminating a from the two equations l1>(x, y, u, a, g(a») =0 and <l>a+<I>bg'(a)=O, we obtain a solution involving an arbitrary function. Such a solu- tion is caned a tgeneral solution of (1). For example, consider pq= 1. Since the character- istic differential equations are dx = dy  -dp = -dq q p 2pq 0 0' we have p=a (constant), and hence from the original equation we get q=a- 1 . Then du= adx+a- 1 dy is an exact differential form, and by integrating it a complete solution u = ax+a- 1 y+ b is obtained. A general solution is found by eliminating a from u=ax+a- 1 y+ g(a) and 0=x-a- 2 y+g'(a). Likewise, when we have n independent vari- ables XI' ..., X n in the equation F(x 1 , ..., X n ' U,Pt, .'. ,Pn)=O, au Pi=- a ' Xi we can use the tcharacteristic differential equa- tions to find a complete solution and a gen- eral sOlution (the Lagrange-Charpit method;  82 Contact Transformations). C. Separation of Variables and the Principle of Superposition The simplest and most useful method is the separation of variables. Concerning the equa- tion u; + u; = 1 in u(x, y), for example, by set- ting u= q>(X) + t/J(y) we obtain q>'(X)2 + t/J'(y)2 = lor q>'(X)2 = I-t/J'(y)2. Since the right-hand side is independent of X and the left-hand side is independent of y, both sides must be equal to the same constant 1X2. From this we get a (complete) solution involving two arbitrary constants IX, fJ: U=IXX+ y+fJ. For tlinear equations, it is often effective to write the solution as a product u = q>(x)t/J(y). From the theat equation u y = U xx ' we obtain by this process a relation t/J'(y)jt/J(y) = q>"(x)jq>(x), from which we get a particular solution u= ae-. 2y sin v(x -IX) containing a parameter v. Next, when the equation is linear and homogeneous, by forming a linear combina- tion of particular solutions that correspond to various values of a parameter v, we obtain a new solution (the principle of superposi- tion). For example, by integrating the solution e _.2y cos vx with respect to the parameter v between the limits -00 and 00 (namely, by a superposition consisting of a linear combina- tion and a limiting process), we obtain a new 322D PDEs (Methods of Integration) solution u= f oo e-.2Ycosvxdv= Fi exp ( _ x 2 ) -00 y 4y (y>O), (4) which is the tfundamental solution of the heat equation. This name refers to the fact that the function (4) can be used to obtain solutions of the heat equation under some initial condi- tions. More exactly, a solution ofuy-uxx=O that is a function of class C 2 in y > 0, is con- tinuous in yO, and coincides with a bounded continuous function q>(x) on y = 0 can be ob- tained by a superposition of the solution (4) such as (1') 1 f oo ( (X_)2 ) u(x,y)= C q>()exp -- 4 d. 2y11:y -00 Y The method of separation of variables ap- plied after a suitable transformation of vari- ables is often successful. In particular, by using orthogonal coordinates, tpolar coordinates, or tcylindrical coordinates according to the form of boundary, we often obtain satisfactory results. For example, concerning the boundary value problem for Laplace's equation u=uxx + U yy = 0, which is smooth in the circle r 2 = x 2 + y2 < 1 and takes the value of a given con- tinuous function g(O) on the circumference r = 1, we can use polar coordinates to rewrite the equation in the form u, I u=urr+-+2U66=0 r r and apply the method of separation of vari- ables to obtain particular solutions rncosnO, r n sin nO. Hence it is reasonable to suspect that by a superposition of these particular solutions we can obtain the desired solution: a 00 u(x,y)=+ I (ancosnO+bnsinnO)r n . 2 nl In fact, this series is a desired solution if the coefficients an' b n can be chosen so that the series converges uniformly for 0:::; r:::; I and can be differentiated twice term by term for 0:::; r < 1, and if we have a 00 g(O)=+ I (ancosnO + bnsin nO). 2 n1 By virtue of the uniqueness of the solution of an eniptic equation, this is the unique desired solution. The boundary value problem in the preceeding paragraph is wen posed. D. Mixed Problems For linear homogeneous equations of hyper- bolic or parabolic type, mixed problems fre- 
322D PDEs (Methods of Integration) quently appear, i.e., problems in which both initial and boundary conditions are assigned. These problems are, furthermore, classified into two types. For the first type, homogeneous boundary conditions are assigned. For example, in vibra- tion problems of a nonhomogeneous string between O::o;x::o;l, we must find the solution of the equation (T(x)ux)x = pUtt satisfying an initial condition u = f(x), u, = g(x) for t = 0, under homogeneous boundary con- ditions: (i) u = 0 for x = 0 and x = 1 in the case of two fixed ends; (ii) U x = 0 for x = 0 and x = 1 in the case of two free ends; (iii) - U x + <10 U = o for x=O, and u x +<11 u=O for x=1 (<1 0 >0, <11 >0), where the two ends are tied elastically to the fixed points; (iv) T(O)u(O) = T(l)u(l), T(O)u'(O) = T(l)u'(l) (periodicity condition); (v) u, u' are finite at x = 0 and x = 1 (regularity condition). The method of separation of variables is applicable to problems of this type also. For example, suppose that we have two fixed ends. Disregarding the initial condition for a while, we find a particular solution fulfilling only the boundary condition by setting u(x, t) = y(x)exp ivt. Here, y(x) must satisfy (T(x)y')'+v 2 p(x)y=0, y(O)=y(l)=O. (5) Except when the solution is trivial (i.e., u(x, t) =0), y(x)¥,O must hold. But in the special case T(x)== 1, p(x) == 1,1 = 1[, the values of v for which functions of this kind exist are only 1,2,3,... (the corresponding y(x) is sin vx). Also. in more general cases, nontrivial solu- tions y(x) exist only for some discrete values of v. These v are called teigenvalues of (5), and the corresponding solutions y(x) are called teigenfunctions for the eigenvalues v. That is, the desired particular solution can be obtained by solving the teigenvalue problem (5). Again, when T(x)==p(x)== 1,1=1[, particular solutions are sin nxexpint, sinnxexp( - int) (n = 1, 2,...). We consider 00 u(x,t)= L (ancosnt+bnsinnt)sinnx, (6) "=1 which is obtained by a superposition of these particular solutions. If we can determine the coefficients an, b n so that this series converges uniformly and is twice differentiable term by term, and 00 00 f(x) = L ansinnx, g(x)= L nb n sin nx, "=1 "=1 then the series (6) is a solution of the mixed problem in question. If the uniqueness of the solution of the mixed problem is proved, it is not necessary to look for any solution other 1202 than the one obtained by combining the method of separation of variables and the principle of superposition. Furthermore, the method described in this section is applicable to solving the following nonhomogeneous equation, which charac- terizes the motion of a string under the in- fluence of an external force f(x, t): U"-U xx = f(x, t) (7) under the boundary condition for the first type of mixed problem and the initial condition u= u t = 0 for t = O. In this case, we expand the unknown u and the function f(x, t) in terms of the system of eigenfunctions {sin nx}, and by substituting 00 u = L an(t)sin nx, n=l 00 f(x, t) = L An(t)sin nx n=l into (7), we reduce the problem to determining an(t) (n= 1,2, ...). When the external force varies with a harmonic oscillation over time as in f(x, t) = - q>(x)exp( - iwt), a sim ilar method can be applied by setting u=v(x)exp( -iwt). For mixed problems of the second type, the homogeneous initial condition u =0 0, u, = 0 for t = 0 is assigned, but the boundary condition is nonhomogeneous. For example, when an oscillating string is at rest until t =0 0, and for t > 0 its right end is fixed and its left end moves subject to an assigned rule, the be havior of the string is described by the solution of U" - U xx = o under the boundary condition u(O, t) = f(t), u(l, t) = 0 (t> 0). This is called a transient prob- lem. If we now choose an arbitrar y function B(x, t) that fulfills all the boundary and initial conditions, and if we set u - B(x, t} = v(x, t) and Bxx - B" = f(x, t), then v satisfies v,,-vxx=f(x,t) for t>O with V=V t =0 for t=O; v =0 for xo=O and x =1. Then v(x, t) describes the oscillation of a string that is at rest until t = 0 and moves under the effect of an external force represented by f(x, t) for t>O. Problems of this kind often appear in electrical engineering. Such problems can be reduced w problems of the first type in the manner described in the previous paragraph, but there are some direct methods that are more effective, the first being Duhamel's method. Consider the case where f(t) is the unit impulse function: { I, t>O, f(t)= 0, t<O, and let U(x, t) be a solution corre:;ponding to this case and vanishing for t::O; 0, -' > O. Then a solution corresponding to the general case is given by i t a u= -U(x,t-"[)f("[)d"[. oat 
1203 The second method is based on application of the tLaplace transformation. Denoting the Laplace transform of a solution u(x, t) by 1 00 - p t d V(x,p) ue t=-, o p we multiply both sides of u,,-uxx=O by e- pt and integrate the result with respect to t from o to 00. Then, taking account of the initial condition, we have v xx - p 2 V =0 with the boundary condition v(O,p)=p Loo f(t)e-Ptdt, v(l,p) =0. If for p=rx+ifJ (rx > rxo) we can find a solution v(x, p) of this boundary value problem for the ordinary differential equation, then the desired solution is given by 1 f v(x,p) u(x, t)=----; -ept dp, 2m L p where L is a path in rx > rxo para1le1 to the imaginary axis. This is ca1led the Bromwich integral. E. Green's Formula and Application of Fundamental Solutions Given a linear partial differential operator L[u]==Iap(x)DP u , P=(P1' ...,P.), P DP=a pt +... +Pnjaxf' ... dx%n, we ca1l the operator L*[v] == I( -1)P' +...+ PnDP(ap(x)v) P the adjoint operator of L, and the operator tL[v] == I( -1)Pt +... + PnDP(ap(x)v) P the transposed operator of L. Sometimes 'L is also ca1led the adjoint operator of L. In the complex Hilbert space, the adjoint operators are more appropriate than the transposed operators. In this case, the operator L * defined above is usua1ly ca1led the formal adjoint oper- ator to distinguish it from the one defined by (L[u],v)=(u,L*[v]) for a1l uED(L), D(L) being the domain of the definition of L (251 Linear Operators). These trans- posed or adjoint operators are often used to represent (at least loca1ly) the solutions u of L[u] = f We explain in more detail the specific case where the number of independent variables is 322E PDEs (Methods of Integration) 2. For the linear partial differential operator a 2 u a 2 u a 2 u L(u) = A(x, y) ax 2 + 2B(x, y) ax ay + C(x, y) ay2 au au + D(x, y) ax + E(x, y) ay + F(x, y)u and its adjoint partial differential operator M(v)= a 2 (Av) +2 a2 (BV) + a 2 (Cv) ax 2 axay ay2 a(Dv) a(Ev) ----+Fv ax ay , we have tGreen's formula: f L (vL(u)-uM(v))dxdy f ( ax a y ) =- P-+Q- ds, aD an an (8) where P=v{Au x + Buy} -u{(Av)x+(Bv)y} + Duv, Q=v{Bu x + CUy} -u{(Bv)x+(Cv)y} + Euv, (9) and aD denotes the boundary curve of the domain D, n the internal normal of aD at a point of aD, and s the arc length. We can apply this formula for solving a nonhomogeneous equation L(u) = f as fo1lows. Assume that there exists a solution of L(u)= f satisfying the assigned additional condition, and choose a tfundamental solution of M(v)= o having an adequate singularity at a point (xo, Yo) of D and fulfi1ling a suitable boundary condition. Then, if we substitute these solu- tions u and v into (8), we obtain an explicit representation of u(x o , Yo). If we can verify that the function u(x,y) thus obtained is a solution of L(u) = f fulfi1ling the assigned additional conditions, then we see, under the assumption of uniqueness of solutions, that this and only this function u(x,y) is the desired solution. For example, consider a boundary problem for a 2 u a 2 u a 2 v a 2 v L(u)=- a 2+- a 2,forwhichM(V)=+- a 2. x y ax y The problem is, for a circle of radius r with center at the origin, to find a function u(x, y) that is continuous in the interior and on the circumference C r of the circle, and that satis- fies L(u) = f in the interior of the circle (f is bounded and continuous in the interior of the circle) and is equal to a given continuous function 9 on the circumference Cr. In this case, we consider the circumference K. of sufficiently sma1l radius e with center (xo,Yo) contained in the interior of the first circle, and set v(x, y) =(lj2n)log Ijp + h(x,y), p =((x -xo)2 +(y- YO)2)112. (10) 
322 Ref. PDEs (Methods of Integration) We apply formula (8) to the domain D, en- closed by the circumferences K, and Cr. If, in (10), h satisfies M(v)=O in the interior of the circle enclosed by C r and vanishes on C" then we get ff vfdxdy= - f (VUn-UVn)dS+ f gvds. DE Kr C r By letting [; tend to zero, the first term on the right-hand side yields f 2" 1 alogp -u(xo,Yo) --pdO= -u(xo,Yo), o 2n ap by the logarithmic singularity of v at the point (x o , Yo). Therefore we have an explicit repre- sentation of u, U(XO,yo)= f gVndS- ff vfdsdy, (11) C r x2 + y2 r2 and it is easily verified that this is the desired solution. As stated in the previous paragraph, to apply Green's formula it is necessary to find a solution v of M (v) = 0 possessing a fundamen- tal singularity as the logarithmic singularity, i.e., a so-called fundamental solution. As fUn- damental singularities for av a 2 v a 2 v a 2 v M(v)=-- and M(v)=- By ax 2 ay2 ax 2 of parabolic type and of hyperbolic type, we must take those given respectively by v(x, y) { I ( exp = 2.. ./ n(y - Yo) 0, yYo, (X_XO)2 ) 4(y-yo) , y>Yo, and { 1/2, Ix-xol<y-yo, v(x, y) = 0, Ix-xol;;:'y-yo ( 323 Partial Differential Equations of Ellip- tic Type; 325 Partial Differential Equations of Hyperbolic Type; 327 Partial Differential Equations of Parabolic Type). References [IJ R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, I, 1953; II, 1962. [2J E. Goursat, Cours d'analyse mathema- tique, Gauthier-Villars, III, 1927. [3J A. G. Webster and G. Szego, Partielle Differentialgleichungen der mathematischen Physik, Teubner, 1930. [4J L. Schwartz, Mathematics for the physical sciences, Hermann, revised edition, 1966. 1204 [5J J. L. Lions, Equations differentielles opera- tionelles et problemes aux limites, Springer, 1961. [6J A. Friedman, Partial differential equations, Holt, Rinehart and Winston, 1969. 323 (XIII.24) Partial Differential Equations of Elliptic Type A. General Remarks Suppose that we are given a tlinear second- order partial differential equation n a 2 U n ilu L[uJ == i.1 aij(x) aXiaXj + i b,(x)a x i +c(x)u = f(x), (1) where x = (x 1, ... ,X n ), a ji = aij' If the quadratic form :E aijij in  is tpositive definite at every point x of a domain G, this equation (or the operator L) is said to be elliptic (or of elliptic type) in G. For n = 2, all (x)a 22 (x).-(au(x))2 > o is the condition of ellipticity. In this case, by a change of independent variables, the equation is transformed locally into the canon- ical form a 2 u a 2 u au au ax 2 + a y 2 + b d x ,y) ax +b 2 (x,y) ay +c(x,y)u = f(x, y). The operator :E?=1 a 2 laxl, denoted by ,i, is called Laplace's operator (or the Laplacian). The simplest examples of elliptic equations are ,iu = 0 (Laplace's differential equation) and ,iu = f(x) (Poisson's differential equation) ( Appendix A, Table 15). B. Fundamental Solutions Let K be the n-dimensional ball with radius R, center Xo, boundary Q (an (n-l)-dimensional sphere), and area Sn' and let r be the distance from Xo to x. Then for any function u(x) of class C 2 we have 1 i Rn-1 u(xo)=- udS Sn n (n-2)Sn x t (r 2 - n - R2-n),iudx for n>2, I i I f R u(xo)=- udS -- log-,iudx 2n n 2n K r for n=2. 
1205 Thus, if u is a solution of Poisson's equation u = f(x), we have a representation of u(xo) by replacing u by f in the integrals just given. Next, concerning the solutions of the equation u + cu =0 (c> 0, constant), the folIowing relation holds:  r udS= r(n/2)Jn,(R) u(x o ), Sn Jo (R/2)"' where r is the tgamma function, n' = (n - 2)/2, and J. is the tBessel function of order v. Now, if we put V(x,)= I f ((xi-i y-n/2 =r 2 - n , n;d, ( 2 ) -1/2 1 log .L (xi-i =log-, n=2, r=l r then the function u(x) defined by u(x)= - f V(x, )f()d, W n G n-2 W =2nn/2_ n r(n/2) represents a tparticular solution of u = f(x), where V(x,) as a function of the variables x satisfies  V(x, ) = 0 except at x =. Consider now the more general case (1). A function E(x,) is calIed a fundamental solution (or elementary solution) of (1) or of L if u(x)= L E(x, )fWd provides a solution of (1) for any f(x)eC OO with compact support. A fundamental solution E(x,) is a solution of the equation L[E] =0 having a singularity at x =  of the form _W;I r 2 - n (n3) or w21logr (n = 2), where a()=det(aij(m and r= (I:ail()(xi-i)(Xj-j»)1/2 and (ail) is the inverse matrix of (aij)' Roughly, there are three different methods of constructing fundamental solutions. The first and most general is to use pseudodif- ferential operators ( 345 Pseudodifferential Operators). The second is that of J. Hadamard [1], which uses the geodesic distance between two points x and  with respect to the Rie- mannian metric I: aiJ(x)dxidXj. This is impor- tant in applications of elIiptic equations to geometry. The third method is due to E. E. Levi [2] and is as folIows: Let { r, n3, V(x, )= 2- l n - ogr, n=2. To obtain a solution of L[u] = f(x) we set u= L V(x,)q>()d. 323C PDEs of Elliptic Type Writing L[V(x, m = X(x, ), we obtain the folIowing tintegral equation of Fredholm type in q>(x): 1 f 1 q>(x)-- X(x,)q>()d=-f(x). W I1 G W n If we denote the tresolvent of this equation by X(x, ) and put y(x, )= V(x, )-wn L V(x, OX(', ) J a(') d', then E(x,)= -y(x,)/Wn is seen to be a fundamental solution of L. Thus Levi's method enables us to construct a fundamental solution 10calIy by successive approximation, because the integral equation in q> as above has a unique solution expressible by Neumann series if the domain G is smaIl enough ( 189 Green's Operator). C. The Dirichlet Problem Let G be a bounded domain with boundary r. We calI the problem of finding a solution u of the given eIliptic equation in G that is con- tinuous on G U r and takes the assigned con- tinuous boundary values on T the first bound- ary value problem (or Dirichlet problem). In particular, the Dirichlet problem for u=O has been studied in detail ( 120 Dirichlet Problem). If c(x) < 0 and f(x):::; 0 or if c(x):::; 0 and f(x) < 0 in (1), a solution u does not attain its local negative minimum in G. This is calIed the strong maximum principle ( [3,4] for Hopf's maximum principle and Giraud's theo- rem). The maximum principle is one of the most powerful tools available for the treat- ment of elIiptic equations of the second order with real coefficients. From this it folIows that the solution of the Dirichlet problem for (1) is unique if c(x) < O. Furthermore, concerning the uniqueness of the solution, we have the folIow- ing criterion: If there exists a function w(x) > o of class C 2 in G and continuous on G U r such that L[w(x)] <0, then the solution of the Dirichlet problem is unique. Let G be a tregular domain in the plane. If we denote tGreen's function of  relative to the Dirichlet problem in G by K(x,y;,IJ), then the solution u of u = f(x, y) vanishing on r is given by u(x,y)= - 2 1 n f L K(x,y;, lJ)f(,IJ)ddlJ, where f(x, y) satisfies a tHOlder condition. The Dirichlet problem for au au L[u] ==u+a(x,y)-+b(x, y)-+c(x,y)u ax ay = f(x,y) 
323D PDEs of Elliptic Type with prescribed boundary values reduces to the problem in the previous paragraph. In fact, let h(x, y) be a function that coincides with the prescribed boundary value on f. Then if we put U = h(x, y)+ v, the problem is reduced to finding a solution v satisfying an equation similar to the one for L[u] (replacing f by f - L [h]) and vanishing on r. Now suppose that f consists of a finite number of tJor- dan curves whose tcurvatures vary continu- ously, a and b are continuously differentiable, and c and f satisfy Holder conditions. Let K(x, y;, 1]) be Green's function for the Diri- chlet problem relative to. To find a solution of L[u] =.f vanishing on r we set u= -2kIt K(x,y;,I])p(,I])ddl]. Then p is a solution of the integral equation p(x,y)+ It H(x,y,,I])p(,I])ddl]=f(x,y), where H(x,y,, 1])=( -lj2n){a(x,y)KAx,y; ,I])+b(x, y)Ky(x,y;, 1]) +c(x,y)K(x,y;, I])}. Therefore we have the following alternatives: Either L[u] = f has a unique solution for any f and any given boundary values on f, or L[u] = 0 has nontrivial solutions vanishing on r (in this case the number of linearly indepen- dent solutions is finite). More general equations of type (1) have been studied by 1. Schauder and others. Schaucler proved, first, that when the bj(x) and c(x) are zero, aj)x) and f(x) satisfy Holder conditions, and the boundary r of G is of class C 2 , then there exists a unique solution u(x) vanishing on r. Next, when the bJx) and c(x) are continuous and aij and f satisfy Holder conditions, he showed the following alterna- tives: Either L[u] =.f admits a unique solution vanishing on r for every!, or L[u] =0 has nontrivial solutions vanishing on r (in this case, the number of linearly independent solu- tions is finite). In (1), suppose that a jj , bi, and c are Holder continuous of exponent (X (0 < (X < 1) uniformly on G and that f is of class C2+.. Then we have the following inequality (Schauder's estimate) for any UEC2+'(G): IluI12+a.G K 1 (1IL[u] II..G + Il u ll2+..rJ +K21Iullo.G' (2) where 11'11k,S stands for the norm in the func- tion space C\S) ( 168 Function Spaces). Kl and K. 1 are positive constants depending on L, G, and (X but independent of u. More precisely, K 1 depends only on the ellipticity constant A of L defined as the smallest number;:' 1 such that , -1 1 1' 1 2 " 2 Ie " L.,aj)xgjjAI1 1206 for any (x, )E G x R n . The inequality (2) is one of the most important a priori estimates in the theory of elliptic equations [5,6] ( inequality (9) in Section H). D. Quasilinear Partial Differential Equations Consider the second-order partial differential equation in U(Xb ..., x n ): F(x 1 , ""Xn,U,Pl' "',Pn'Pl1' ""Plj' ...,Pnn)=O, where Pj = oujox j , Pij= 0 2 ujOXjOX j . If, for a solution u(x), L7.j1 (oFjopjjKj is a positive definite form, we say that the equation is ellip- tic at u(x). Moreover, if for any values of u P Pij this quadratic form is positive definite, e" say simply that the equation is of elliptic type. The equation is called quasilinear if F is linear in Pij' For example, the equation of tminimal surfaces ( 334 Plateau's Problem) (1 + pDp11 - 2P1P2P12 +(1 + Pi)P22 =0 is a quasi linear elliptic equation. Furthermore, u = f(x, y, u, u x , u y ) is also elliptic. E. Picard solved this equation by the method of successive appro!(imation [7]. Specifically, let h(x, y) be the thannonic func- tion taking the assigned boundary values on r. Starting from uo(x, y) = h(x, y), we define the functions un(x, y) successively as the solutions of un = f(x, y, Un-I' oun_rlo x , oun-rlay), coinciding with h on r. Let K(x, y;, 1]) be Green's function in G and in the region de- fined by lu-h(x,y)IA, Ip-hxlB, Iq- hyl B, (X,y)EG, and let the supremum of If(x,y,u,p,q)1 be C. Assume now that 2 It Kddl]A,  ItIKxlddl]B, 2 I t IKylddl]B, and that f satisfies a Holder condition in (x, y). Moreover, let If(x,y, u',p',q')- f(x, y, u, P, q)1 Llu-u'l + L'(lp' -pi +Iq' -ql). Assume finally that It (LK +L'(IKxl +IKyl))ddl]Y< 1. Under these assumptions, {un(x, y)} is uni- formly convergent, and the limit IJ(x,y) coin- ciding with h(x, y) on r satisfies (3) u = f(x, y, u, oujox, oujoy). 
1207 Furthermore, it is known that the solution is unique within the region mentioned above. This method can be applied when Gis small and the values of hx and hy are limited. When f does not contain P and q, the fol- lowing method is known. Let (g(x, y) and w(x, y) both be continuous on G U r and of class C 2 in G. Suppose that f(x, y, u) satisfies a Holder condition in (g(x,y):::;u:::;w(x,y), and that (g f(x,y,(g(x, y)), w:::;f(x,y,w(x,y)). Then; given a continuous function qJ on r such that (g:::;qJ:::;w, there exists a unique solution u of u= f(x,y, u) such that (g(x,y):::;u(x,y):::;w(x,y) in G, u(x,y)=qJ on r. Finally, consider the equation 02U 02U 02U A-+2B-+C-=0 OX2 oxoy oyl , where A, B, and C are functions of x, y, u, P, q and AC - B 2 >0. Under the following condi- tions, there exists a solution of the Dirichlet problem: A, B, C are of class C 2 , and their derivatives of order 2 always satisfy Holder conditions; G is tconvex; and the boundary value qJ along r considered as a curve in xyu- space represented by the parameter of arc length is of class C3+a (IX> 0). Moreover, any plane having 3 common points with this curve has slope less than a fixed number A. The proof of this theorem is carried out in the following way: For any function u satisfying lu(x, y)l:::; maxlqJl, IUxl :::;A, IUyl :::;A, replacing u, p, q by u(x, y), ux(x, y), uy(x, y), respectively, in A, B, and C, we have the linear equation in v: 02V 02V 02V A[uJ OX2 + 2B[uJ oxoy + C[uJ oy2 =0. We can obtain the solution v taking the boundary value qJ on r. Thus we have a mapping u-+v. Applying the tfixed-point theorem in function space to this mapping, we have the desired solution v(x, y) = u(x, y). Concerning the Dirichlet problem for the second-order semilinear elliptic partial dif- ferential equation n 02U ( ou ou ) . aix)=f x,u,-;-, """ ' ',j=1 uXfux j UXI uX n a work by M. Nagumo (Osaka Math. J., 6 (1954)) establishes a general existence theorem. For the general nonlinear equation F(x 1 , ...,X n ,U,P1' ...,Pn,PI!> ,,,,Pij' ...,Pnn)=O, ifL.(oF/op;j)ij>O, F.:::; 0, the solution of the 323E PDEs of Elliptic Type Dirichlet problem for this equation is unique. Even when F.:::;O does not hold, the conclu- sion remains the same if we can reduce the equation to this case by a suitable change of variables. Quasilinear equations in divergence form n 0 ( ou ou ) .L "af x, u,-;-,... ,-;- .=1 uX; uX 1 uX n = f ( x,u,, ..., u ) , OX 1 uX n (4) or more general1y, any quasilinear el1iptic equation f a;j ( x,u,,..., )  f.j=1 oX 1 oX n oxfox j =f ( X,U,,..., u ) , (5) OX1 UX n and even quasilinear el1iptic systems have been treated in detail in several recent works [8,9J. J. Serrin [IOJ treated the Dirichlet problem and established the existence and the unique- ness of solutions for some classes of equations of type (5) containing the minimal surface equation. His method is to estimate the maxi- mum norms of u and of its first derivatives, to apply a result of O. A. Ladyzhenskaya and N. N. Ural'tseva [9J and the Schauder estimate (2), and final1y to use the Leray-Schauder fixed-point theorem [11]. E. Relation to the Calculus of Variations Consider the bilinear form f ( ou ou J = La;j(x)"" G UX i uX j ou ) + 2 L b;(x) ox; u + c(x)u 2 + 2f(x)u dx, where we assume L.aolx);j > O. Under the boundary conditions imposed on u, if there exists a function u that makes J minimum, then assuming some differentiability condi- tion on afj(x), bf(x), c(x), we have the tEuler- Lagrange equation o ( ou ) " afj(x)" J UX) l uX i - ( C(X)- L  b;(X) ) U- f(x)=O, ; ox; which is a linear second-order self-adjoint elliptic equation. B. Riemann treated the simplest case, where aix)=b!, b;(x)=c(x)=O, i.e., the case u=O. He proved, assuming the existence of the mini- mum of J, the existence of the solution of u= o with assigned boundary values. This result, 
323 F PDEs of Elliptic Type called Dirichlet's principle, was used by D. Hilbert, R Courant, H. Weyl, O. Nikodym, and others to show the existence of solutions for linear self-adjoint elliptic equations. If F(x" ...,Xn,U,Pl, ...,Pn) satisfies L.(i!2 F/i!Piap)ij> ° (and F has some regu- larity), then the function that minimizes the integral f ( au au ) J= F X"...,Xn,U,-, ..,- dXt....,dxn G aXl aX n with the given boundary condition satisfies the Euler-Lagrange equation (of type (4» n ;Pu n au n I FpiPJ+ I Fup'- a + I Fx,p,-Fu=O i,j=l OXiOXj i=l Xi i='l (Pi = au/ax;) and the boundary condition as well. This is also an ellIptIc partial differential equation in u. The case where F is a function of P alone (in particular, the case of tmini- mal surfaces) has been studied by A. Haar, T. Rad6, J. Serrin, and others, particularly for F = (1 + pi +... + p;;)1/2 in the case of the mini- mal surface equation. F. The Second and Third Boundary Value Problems Let G be a domain in R n with a smooth boundary consisting of a finite number of hypersurfaces. Also, let B[u] be the boundary operator defined by au B[u] = a-;;-+ pu, ov where a = (L.71 (L.;l aijcos(vox)2)1/2, Vo is the outer normal of unit length at the point XE S, and v is the conormal defined by n cos(vx;)= I aijcos(vox)/a, i= 1, ...,n. j=' The problem of finding the solution u(x) of the equation L[u] = f continuous on the closed domain G and satIsfying B[u] = cp on the boundary S of G is called the second boundary value problem (or Neumann problem) when {3 == 0, and the third boundary value problem (or Robin problem) when fJ ¥= 0. In general, in boundary value problems, the condition that the solutions must satisfy at the boundary is called the boundary condition. We assume that the boundary S of G is expressed locally by a function with tHolder continuous first deriva- tives (G is then called a domain of class ct.}'). Assume that G is such a domain, C  0, p  0, and at least one of c and fJ is not identically 0. Then the second and third boundary value problems admit one and only One solution. When (' == ° and fJ == 0, the solutions of the 1208 second boundary value problem are deter- mined uniquely up to additive constants. Fur- thermore, let M be the tadjoint operator of L, and let av B'[v] =a-+(fJ -b)v, av where b= i COS(VOXi{b i - j J . Then if the boundary S of G is of class C 1 and .r, cp are continuous, in order that there exist at least one solution u of the second or third boundary value problem relative to L[u] = f, it is necessary and sufficient that f fVdx- f cpvdS=O, G S where v is any solution of M[v] =0 with the boundary condition B' [v] = 0. Here the neces- sity is easily derived from Green's formula. G. Giraud used the notion offundamental solu- tion to reduce the second and third boundary value problems relative to L[u] = f to a prob- lem of integral equations, under the assump- tions that G is a domain of class C 1, the co- efficients of Land f satisfy Holder conditions, and cp and fJ are continuous [3; also 12]. G. Method of Orthogonal Projection (6) The theory of tHilbert spaces is applicable to the boundary value problems in Section F. In general, let Hm(G) be the space of functions in L 2 (G) whose partial derivatives in the sense of tdistributions up to order m belong to L2(G). For elements f and g in Hm(G), w define the following inner product: (f, g)m = I f Daf(x) Dag(x) dx, lal';;m G where alai D a = ,IIJ(I =1J(1 +... +lJ(n aX1 .., ax: n ( 168 Function Spaces). With respect to this inner product, Hm(G) is a Hilbert space. When u satisfies L[u] = f (fEL 2 (G» and cp(x) is an arbitrary element of HI (G), Green's formula yields n ( au acp ) n ( , au ) -. I aij- a .' _ a . +. bi(x)- a " cp 1,)=1 X, XJ 1-1 XI f au +(c(x)u,cp)+ s a av ' ipdS=(f,cp), where b;(x)=bj(x)= - 'L j (8auf8x j ). Taking account of the boundary condition on u: 
1209 a8uj8v+ {Ju=O, we get ?: ( ajj 8 8U. , 8 8(P ) -  ( bi 8 8U , cp ) -(CU, cp) l,J X, xJ I x, + !s{JUq>dS= -(f,cp). Thus the problem is reduced to finding U(X)E H1(G) satisfying this equation for all cpEH 1 (G). This equation can be regarded as an equation in H 1 (G). If necessary, by replacing c(x) by c(x) - t for a large t, we can show that for any f(X)EL2(G), there exists a unique solution u(x)EH 1 (G) [12, 13]. Now, for a solution of this functional equation, if we take cp(X)E .@(G), we have (u,L*[cpJ)=(f,cp), where L* is the tadjoint operator of L. This means that u(x) is a solution of L[u] = f in the sense of distributions, and we call such a u(x) a weak solution. Such a treatment may be called the method of orthogonal projection, following Weyl. In this case, it can be shown that if we assume smoothness of the coefficients, the boundary S, and {J, then the solution U(X)E H1(G) belongs to Hs+2(G) when f(X)E HS(G) (s = 0, 1, ...). Thus, if we apply Green's for- mula, we can see that u satisfies the boundary condition a 8uj8v + {Ju = O. In particular, when s> nj2, we see that U(X)E C 2 (G) by tSobolev's theorem [12]. In other words, u(x) is a gen- uine solution. Next, we introduce the complex parameter A and consider the boundary value problem (L + AI) [u] = f, fE L 2 (G), a8uj8v + {Ju=O. If t is large, (L-tl) is a one-to-one mapping from the domain 2t1(L) = {uEH 2 (G)la8uj8v+ {Ju=O} onto L 2 (G). Thus, denoting its inverse, which acts on the equation from the left, by G" we have (I +(A+ t)G,)[u] = G.f Conversely, since the solution u(x) contained in L 2 (G) (hence also contained in 2t1(L)) satis- fies the equation and the boundary condition, the problem is reduced to the displayed equa- tion in L 2 (G) considered above. Now, since G is bounded and G, is a continuous mapping from L 2 (G) into H 2 (G), Rellich's theorem yields that G, is a tcompact operator when it is re- garded as an operator in L 2 (G). So we can apply the tRiesz-Schauder theorem ( 189 Green's Operator). H. Elliptic Equations of Higher Order The differential operator of order m: 8 1 . 1 L = L a.(x) D., D "- , 1.I<;m 8x'...8x:n loci =OC 1 + '" +oc., 323H PDEs of Elliptic Type is called an elliptic operator if :E1.I=m aa(x)a # 0 ( # 0). In particular, if Re L aa(xWcllm, c>O, lal=m L is called a strongly elliptic operator. In this case, m is even. L. Garding studied the Diri- chlet problem for strongly elliptic operators [14]. If we put m=2b, the boundary value condition is stated as 8 j uj8v j = Jj(x) (j=0, 1, .., , b - 1), where v is the normal of unit length at the boundary. Using the notion offunction space, this boundary condition means that the solutions belong to the dosure Hb(G) of 2t1(G) in Hb(G) ( 168 Function Spaces). In this treatment, Girding's inequaiity (-l)bRe(L[u], u)bllull -clluI1 2 , uEHb(G), (7) where band c are positive constants, plays an important role. In general, for an elliptic operator L defined in an open set G, if u(x) satisfies L[u] = f(x) and f(x) belongs to H S on any compact set in G, then u(x) belongs to Hm+s on every compact set in G (Friedrich's theorem [15]). General boundary value problems for elliptic equations of higher order have been consid- ered by S. Agmon, A. Douglis, and L. Niren- berg [16], M. Schechter [17], and others. These problems are formulated as follows: L[u]=f(x), Bix,D)u(x)=cpj(X)' XES, j=I,2,...,b(=mj2), (8) where the Bix, D) are differential operators at the boundary and f and {cpj} are given functions. Under certain algebraic conditions (Shapiro-Lopatinskii conditions) on (L, {B j }), the problems are treated also in Hm(G); hence the L 2 a priori estimates playa fundamental role: IfUEHm(G), b lIullm:::;K(jILull + L IIBjulim-mj-(1/2).S+ lIull), j=1 (9) where K is a constant determined by (L, Bj, G), 11'11k,S is the norm in Hk(S), and m j are the orders of Bj (compare (9) with (2)). Under these estimates the boundary value problem is said to be coercive. In applications, the theory of interpolation of function spaces are also used [18] ( 168 Function Spaces). Variational general boundary value problems have been treated by D. Fujiwara and N. Shimakura (J. Math. Pures Appl., 49 (1970)) and others. For systems of such equations, there are works by F. E. Browder (Ann. math. studies 33, Prince- ton Univ. Press, 1954, 15-51) and others. 
323 I PDEs of Elliptic Type I. Analyticity of Solutions In a linear elliptic equation Lu = j; suppose that all the...coefficients and f are of class CX! (resp. real analytic) in an open set G and that u is a distribution solution in G. Then u is also of class Coo (resp. real analytic) in G [19]. Hence the linear elliptic operators are hypoelliptic (resp. analytically hypoelliptic) ( 112 Dif- ferential Operators). In particular, tharmonic functions (i.e., solutions of u = 0) are (real) analytic in the domain of existence, whatever the boundary values may be. Hilbert conjectured that when F(x, y, u, p, q, r, s, t) (P=Pr. q= P2, r= PI r. s= P12' t= P22) is analytic in the arguments, then any solu- tion u of the elliptic equation F = 0 is analytic on the domain of existence (1900, Hilbert's 19th problem;  196 Hilbert). This conjec- ture was proved by S. Bernshtein, Rado, and others, and then H. Lewy proposed a method of extending this equation to a complex do- main so that it can be regarded as a thyper- bolic equation (Math. Ann., 101 (1929)). This result was further extended by I. G. Petrovskii to a general system of nonlinear differential equations of elliptic type (Mat. Sb., 5 (47), (1939)). J. The Unique Continuation Theorem Since all the solutions of Laplace's equa- tion u = 0 are analytic, it follows that if u(x) vanishes on an open set in a domain, then u(x) vanishes identically in this domain. This unique continuation theorem can be extended to linear elliptic partial differential equations with analytic coefficients in view of the analyticity of solutions. This fact is also proved by apply- ing tHolmgren's uniqueness theorem ( 321 Partial Differential Equations (Initial Value Problems)). The unique continuation theorem, first established by T. Carleman for second- order elliptic partial differential equations L[u] =0 with C 1 -coefficients in the case of two independent variables, was extended to second-order linear elliptic equations with C 2 _ coefficients in the case of any number of inde- pendent variables by C. Muller, E. Heinz, and finally by N. Aronszajn [20]. This re- search was extended by A. P. Calderon [21] and others in the direction of establishing the uniqueness of the Cauchy problem. However, it is to be remarked that even if we assume that the coefficients are of class Coo, we cannot affirm the unique continuation property for general elliptic equations. A counterexample was given by A. Plis (Comm. Pure Appl. Math., 14 (1961)). See also the work of K. Watanabe (Tohoku Math J., 23 (1971)). 1210 K. Elliptic Pseudodifferential Operators and the Index A pseudodifferential operator P(x, D) with symbol p(x, c;)ES;;'.1 ( 345 Pseudodifferential Operators) is said to be elliptic, provided there exists a positive constant c such that Ip(x, )I;o c(1 + I W m for all XE R" and I I;oc -1. The no- tion of ellipticity can be extended to oper- ators on a manifold. The theory of elliptic pseudodifferential operators has been widely applied to the study of elliptic differential equations, and is particularly useful in the calculation of the tindex of elliptic operators. B. R. Vainberg and V. V. Grushin [22] cal- culated the index i of the tcoercive boundary value problem for an elliptic opera10r by showing that i is equal to the index of some elliptic pseudodifferential operator on the boundary. Example [23]: Given a real vector field (VI' V2) on the unit circle xi + x = 1, suppose the vector (vr(x), V2(X) rotates I times around the origin as the point x = (x l' x 2 ) moves once around the unit circle in the positi"e direc- tion. Then the index of the boundary value problem ( ::i + aa: )U(X)= f(x), au au v 1 (x)- a +V2(X)- a =g(x), XI X 2 xi + x <1, xi + x = 1, is equal to 2 - 2/. M. F. Atiyah and I. M. Singer determined the index of a general elliptic operator on a manifold in terms of certain topological invar- iants of the manifold ( 237 K -Theory H). The index of noncoercive boundary value problems has also been studied by Vain berg and Grushin, R. Seeley (Topics in pseudo- differential operators, C.I.M.E. 1968,335- 375), and others. L. The Giorgi-Nash-Moser Result Let us state the following result (1. Moser [24]): Let L be of the form " a { Du } Lu=I _ a aij(x)- a ' 1.;=1 X j Xi where aij=a j ; are real-valued, of class Loo(G), and such that the ellipticity condition (3) holds at almost everywhere in G with some A.;o 1. Also, let G' be any subdomain of G whose distance from aG is not smaller tha n b > O. Then, for any weak solution uEH 1 (G) of the equation Lu=O and for any two points x and y in G', we have the inequality lu(x)-u(y)1 A Ix- YI"lluIIL2(G)' 
1211 where A and IX (A> 0, 0 < IX  1) depend only on (n, A, 15) and are independent of the particular choice of (L, G, G', u). Moser proved that the above inequality is a corollary to a Harnack- type inequality ( 327 Partial Differential Equations of Parabolic Type G). M. Asymptotic Distribution of Eigenvalues Let L be an e11iptic operator on G of order m with smooth coefficients realized as a self- adjoint operator in L 2(G) under a nice bound- ary condition, where G is a bounded domain in R" with smooth boundary. Let N(T)(T>O) be the number of eigenvalues of L smal1er than T. Then, it hOlds that N(T)=CT"/m+ an error term, as T --> +00, c=(2n)-" f dx f a(x,)-"/mdS, (10) G 8 n - 1 where a(x,) is the tprincipal symbol of Land S"-1 is the unit sphere on R". C is independent of the boundary condition and, if L is of con- stant coefficients, of the shape of G. Formula (10) was at first established by H. Weyl [25] for the case of L = Laplacian, and hence it is often called Weyl's formula. Weyl's method is based on the minimax prin- ciple [26]. T. Carleman (Ber. Math.-Phys. Klasse der Sikhs. Akad. Wiss. Leipzig, 88 (1936)) studied the behavior of the trace of the Green's function of zI-L as Izl--> 00 in the complex plane ( [27]). S. Minakshisun- daram (Canad. J. Math., 1 (1947)) discussed this fonnula in connection with the heat equa- tion; see also S. Mizohata and R. Arima (J. Math. Kyoto Univ., 4 (1964)). L. Honnander (Acta Math., 121 (1968)) treated the case of compact manifolds without boundary and obtained the best possible error estimate. H. P. McKean and I. M. Singer (J. Differential Geometry, 1 (1967)) treated the case of mani- folds and discussed the geometric meaning of this formula. In general, N(T) is no more than O(T"/m) if L is of degenerate el1iptic type (c. Nordin, Ark. Mat., 10 (1972)). N. Equations of Degenerate Elliptic Type An operator L of the fonn (1) is said to be degenerate at Xo E G in the direction  E R" if  is a null vector of the matrix (aiixO)). L is said to be of degenerate elliptic type if (a;ix)) is nonnegative definite at any xEG and if L is degenerate at some point of G in some direction. Suppose that the coefficients of L and the boundary r of G are smooth enough. At x E r, denote by v(x) =(V1 (x),..., v"(x)) the unit outer 323 Ref. PDEs of Elliptic Type normal vector to G. Let L3 be the set of x E r at which L is not degenerate in the normal direction. Also, let L1' L2' and LO be the sets of XEr\L3 at which b(x»O, <0, and =0, respectively, where b(x) is defined by " { " oa. (X) } b(x)= -; vi(x) bi(X)-) ;j . Then the Dirichlet problem for equation (1) is to find a function u(x) defined on G U L2 U L3 satisfying (11) L[u]=f in G, (1 ) (12) u=g on L 2 UL 3 , where f and g are given functions. Let l<p<oo. We set q=p/(p-1). We also put * ( ) ( )  ob;(x)  02aiix) c x=cx-L..,-+ L..,-. ;=1 ox; ;.)=1 oxiox) (13) We have the following existence theorem [28]: If (i) either c <0 on G or c* <0 on G and if (ii) pc + qc* < 0 on G, the Dirichlet problem (1) and (12) (with g=O) has a weak solution u E U( G) for any f E U( G). The regularity of solutions is also discussed in [28]. The value of b(x) is closely related to the regularity near the point x E r if L is degenerate at x in the nor- mal direction ( also M. S. Baouendi and C. Goulaouic, Arch. Rational Mech. Anal., 34 (1969)). Degenerate el1iptic equations of type (1) have also been investigated from the proba- bilistic viewpoint ( 115 Diffusion Processes). The general boundary value problems for degenerate elliptic equations of higher order have been treated by M. I. Vishik and V. V. Grushin [29], N. Shimakura (J. Math. Kyoto Univ., 9 (1969)), P. Bolley and 1. Camus (Ann. Scuola Norm. Sup. Pisa, IV-1 (1974)), and others. References [1] 1. Hadamard, Lectures on Cauchy's prob- lem in linear partial differential equations, Yale Univ. Press and Oxford Univ. Press, 1923. [2] E. E. Levi, Sul1e equazioni lineari total- mente el1iptiche aile derivate parziali, Rend. Circ. Mat. Palermo, 24 (1907), 275-317. . [3] C. Miranda, Partial differential equations of elliptic type, Springer, 1970. [4] M. H. Protter and H. F. Weinberger, Maximum principles in differential equations, Prentice-Hall, 1967. [5] J. Schauder, (Jber lineare el1iptische Dif- ferentialgleichungen zweiter Ordnung, Math. Z., 38 (1934), 257-282. [6] 1. Schauder, Numerische Abschiitzungen in 
324 A PDEs of First Order elliptischen linearen Differentialgleichungen, Studia Math., 5 (1934),34-42. [7] E. Picard, Leon sur quelques problemes aux limites de la theorie des equations differen- tielles, Gauthier-Villars, 1930. [8] C. B. Morrey, Multiple integrals in the calculus of variations, Springer, 1966. [9] O. A. Ladyzhenskaya and N. N. Ural'ts- eva, Linear and quasilinear elliptic equations, Academic Press, 1968. (Original in Russian, 1964.) [10] J. Serrin, The problem of Dirichlet for quasilinear elliptic equations, Philos. Trans. Roy. Soc. London, (A) 264 (1969), 413-493. [11] 1. Leray and 1. Schauder, Topologie et equations fonctionnelles, Ann. Ecole Norm. Sup., 51 (1934),45-78. [12] S. Mizohata, The theory of partial dif- ferential equations, Cambridge Univ. Press, 1973. (Original in Japanese, 1965.) [13] L. Nirenberg, Remarks on strongly el1ip- tic partial differential equations, Comm. Pure Appl. Math., 8 (1955), 649-675. [14] L. Girding, Dirichlet's problem for linear elliptic partial differential equations, Math. Scand., 1 (1953), 55-72. [15] K. O. Friedrichs, On the differentiability of solutions of linear el1iptic equations, Comm. Pure Appl. Math., 6 (1953), 299-325. [16] S. Agmon, A. Doug!is, and L. Nirenberg, Estimates near the boundary for sOlutions of elliptic partial differential equations satisfying general boundary conditions I, II, Comm. Pure App1. Math., 12 (1959), 623-727; 17 (1964),35-92. [17] M. Schechter, General boundary value problems for el1iptic equations, Comm. Pure App1. Math., 12 (1959), 457-486. [18] 1. L. Lions and E. Magenes, Problemes aux !imites non homogenes et applications I-III, Dunod, 1968. [19] F. John, Plane waves and spherical means applied to partial differential equations, Interscience, 1955. [20] N. Aronszajn, A unique continuation theorem for solutions of elliptic equations or inequalities of second order, J. Math. Pure App1., 36 (1957),235-249. [21] A. P. Calderon, Uniqueness in the Cauchy problem for partial differential equa- tions, Amer. 1. Math., 80 (1958),16-36. [22] B. R. Vain berg and V. V. Grushin, Uni- formly nonelliptic problems I, II, Math. USSR-Sb., 1 (1967),543-568; 2 (1967), 111- 133. (Original in Russian, 1967.) [23] I. N. Vekua, System von Differential- gleichungen erster Ordnung vom el1iptischen Typus und Randwertaufgaben, Berlin, 1956. [24] J. Moser, On Harnack's theorem for elliptic differential equations, Comm. Pure App1. Math., 14 (1961),577-591. 1212 [25] H. Weyl, Das asymptotische Verteilungs- gesetze der Eigenverte linearer partieller Dif- ferentialgleichungen, Math. Ann., 71 (1912), 441-479. [26] R. Courant and D. Hilbert, Methods of mathematical physics I, II, Interscience, 1962. [27] S. Agmon, Lectures on el1iptic boundary value problems, Van Nostrand, 1965. [28] O. A. Oleinik and E. V. Radkevich, Second-order equations with non-negative characteristic form, Plenum, 1973. I Original in Russian, 1971.) [29] M. I. Vishik and V. V. Grushin, Bound- ary value problems for el1iptic equations de- generate on the boundary of a domain, Math. USSR-Sb., 9 (1969), 423-454. (Original in Russian, 1969.) 324 (XIII.22) Partial Differential Equations of First Order A. Quasilinear Partial Differential Eqnations and Their Characteristic Cnrves Suppose that we are given a tquasilinear partial differential equation n OU ; P;(x,u) OX; =Q(x,u), x=(xj,. .,x n ). (1) A curve defined by a solution xi=x;(t), u=u(t) of the system of ordinary differential equations dx. df= P;(x, u), i=l,...,n; du dt =Q(x,u) is cal1ed a characteristic curve of (1) ( 320 Partial Differential Equations; 322 Partial Differential Equations (Methods or Integra- tion)). A necessary and sufficient condition for u=u(x) to be a solution of (1) is that the char- acteristic curve passing through any point on the hypersurface u = u(x) (in the (n + 1)- dimensional xu-space) always be contained in this hypersurface. For example, the' character- istic curve of I:71 xiou/ox; = ku is .x; =x?e', u = UO e k ' (a solution of xi = Xi' U' =" u). There- fore the solution u = u(x) is a function such that u(h 1 ,..., h n )=J. k U(X 1 , ..., x n ) (J. = e' >0), i.e., a homogeneous function of degree k. B. Nonlinear Partial Differential E.qnations and Their Characteristic Strips We denote the value of ou/ox; by Pi and define the snrface element (or hypersnrfac,e element) 
1213 by the (2n + I)-dimensional vector (x, u,p)= (x I' ... , x n , u, PI' '" , Pn)' Consider the partial differential equation F(x b .., ,Xn,U,PI' .., ,Pn)=O, Pi = OU/ox i . A set (x (t), u(t), p(t)) of surface elements de- pending on a parameter t and satisfying the system of ordinary differential equations dx. df=F pi , du n dpi - d = L PiFpi' - d = -(Fx,+Pi F .) t j=1 t is called a characteristic strip of equation (2), and the curve x = x(t), u = u(t) is called a char- acteristic curve of (2). For quasilinear equa- tions, this definition of characteristic curve coincides with the one mentioned in Section A. Furthermore, an r-dimensional tdifferentiable manifold consisting of surface elements satisfy- ing the relation n du- L pjdXi=O i=l is called an r-dimensional union of surface elements. A solution of the partial differential equation (2) is, in general, formed by the set of all characteristic strips possessing, as ini- tial values, surface elements belonging to an (n - I)-dimensional union of surface elements satisfying F(x, u,p)=O. An example of a nonlinear partial differen- tial equation of first order is pq - z = 0 (where XI =x, X2 = y, U=Z,PI = P,P2 =q). The equa- tions of the characteristic strip are x' = q, y' = P, z' = 2pq, p' = P, q' = q, and therefore the char- acteristic strip is given by y = Yo + (Po/qo)x, z=zo+2Pox+(p/qo)X2, P= Po +(Po/qo)x, q = qo + x (if we take x as an independent vari- able and impose an initial condition y= Yo' Z=Zo, P= Po, q=qo at x=O). Putting Zo = W(Yo)(an arbitrary function) for x=O, we have, furthermore, y = Yo,+ W(yo)/(W '(YO»2, z = W(Yo)+ 2W(yo)/W'(yo)+ W(Yo)x 2 /(W'(Yo»2. The elimination of Yo from these expressions yields a general solution z = z(x, y). In this case, a tcomplete solution is 4az=(x+ay+b)2 (where a, b are constants), and a tsingular solution is z=O. C. Complete Systems of Linear Partial Differential Equations Forfunctions Pi(X) (i= 1, ...,X=(XI, ...,x n » of class Coo, define a differential operator X by n a X = L p.(x)-. .=1 ax. We call k differential operators X i = I:=I P:(x)%x. (i = 1, ..., k) mutually inde- 324D PDEs of First Order pendent when the rank of the matrix (P:) is equal to k. If a system of k independent linear partial differential equations involving one unknown function f(x), (2) Xd=O,...,XJ=O, (3) has the maximum number n - k of independent tintegrals, then the system (3) is called a com- plete system. A necessary and sufficient condi- tion for the system (3) to be a complete system is that there exist k 3 functions A.ij(x) of class Coo such that k [Xj,XJ=XjXi-XjXj= L A.iix)X., \1=1 that is, n ( OPi oPj ) k L P-P = L A.ijP;. h=1 oX h oX h .=1 Here, [X j , XJ is a differential operator of first order, called the commutator of the differential operators Xi' X j or the Poisson bracket. D. Involutory Systems For two functions F(x,u,p), G(x,u,p) of x, u, P of class Coo, we define the Lagrange bracket [F, G] by n ( oF ( OG OG ) [F,G] = L - -+P.- .=1 OP. ax. au oG ( oF OF )) -- -+P- ap. ax. · au . If F, G do not contain u and are homogeneous linear forms with respect to p, then F and G are differential operators F = XI u and G = X 2 u with respect to u (for P. = au/ax.), and we see that [F, G] = [XI , X 2 ].. This bracket has the following properties: [F,G] = -[G,F], k oqJ [F, qJ(G 1 ,..., G k )] = i aG i [F, GJ, [[F, G],H] +[[G,H],F]+ [[H,F], G] aF aG aH = au [G,H] + au [H,F]+a;[F,G]. When F, G are functions of x and P only, we usually use the notation (F, G) and call it also the Poisson bracket. In this case, the right- hand side of the third relation vanishes. Consider k partial differential equations involving one unknown function u(x 1, .,. ,X n ), Fi(x,u,p)=O, i=I,...,k; au P.=. uX. (4) If a common solution u(x) of these equations exists, it is also a solution of [Fi' Fj] =0 (i,j = 
324 E PDEs of First Order 1,..., k). Therefore, from the equations thus obtained, we take independent equations and add them to the original system. If we then have more than n + 1 equations, the original system has no solution. Otherwise, we obtain a system Fj=O (j = 1, ..., r) for which the Fj are independent (i.e., the rank of (aFi/apv) is equal to r), and all [Fi' Fj] =0 can be derived from Fj=O. A system (4) such that [Fi' Fj] =0 for all i, j is called an involntive (or involutory) system. We always treat a system by extending it to an involutory system. When k = 1, we regard the equation itself as an involutory system. When the equations (4) are mutually inde- pendent, a necessary and sufficient condition for them to have in common a solution with n - k degrees offreedom (a solution that coin- cides with an arbitrary function on an ade- quate manifold of dimension n - k) is that the system (4) be a system of equations involving unknowns P and equivalent to an involutory system. An involutory system (4) can be extended to an involutory system consisting of n inde- pendent equations by adding n - k suitable equations Fk+ 1 (x, U, p) =ak+1' ..., Fn(x, U, p) = an' (4') That is, we can find successively f = FI (l = k + 1, '" , n) such that FI satisfies the system of equations [Fj,f] =0, i=I,...,I-I, which is a complete system of linear partial differential equations for f, and the F i (i = 1, ..., n) are mutually independent. Then, if we find that Pv as functions of(x, u, a) (a = (ak+h ..., an)) from (4) and (4'), the system of ttotal differential equations au -=Pv(x,u,a), v= 1, ...,n, ax v is tcompletely integrable, and we can find U as a solution containing essentially n - k + 1 parameters c, ak+!' ... ,an' that is, a complete solution of (4). Moreover, if we have an in- volutory system of n + 1 independent equa- tions F 1 =0,..., Fk =0, Fk+1 =ak+!' ..., Fn+! = a n +!, then we find a complete solution by eliminating PI' ..., Pn between the equations. This method of integrating an involutory system is called Jacobi's second method of integration. E. Relation to the Calcnlns of Variations Consider a partial differential equation of first order F(Xl' ...,Xn,U,Pb ...,Pn)=O.!fa solution u(x) of this equation is given as an implicit 1214 function by cp(x 1 ,... ,X n , u)=O, we have F ( X1' ..., X n , U, _ acp/ aX 1 , ..., _ aq)/aX n ) = 0. acp/au aw/au This gives formally a partial differential equa- tion with independent variables u, Xh ..., X n and a dependent variable cpo It does not con- tain cp explicitly. That is, this equation has the form F(x 1 , ..., Xn+1' ocp/OX l' ..., 0cp/OX nt rJ= 0. Then, by finding a partial derivati ve, say acp/ox n + h as a function of the remaining ones from the displayed equation, we get a partial differential equation of the form acp/at + H(t, x, acp/ax) =0, which is called the normal form of the partial differential equation of first order. Setting Pi = acp/axj, the equations of the characteristic curve of this equation are dx. -i=Hp.(t,x,p), dpi - d = -Hx (t,x. p), t ' i= 1, ...,n, which are called Hamilton's differential equations. Now, consider the tEuler-Lagrange dif- ferential equations dFx;!dt -Fx,=O, i= 1,..., n, for the integral i t, J= F(t,x,x')dt, to dx X'=7it. Under the assumption that det(Fx'x);fO, we put F x ,= Pi' and solve these relation ith respect to x; in the form, say, X;='Pi(t,X,P). Furthermore, if we put ( f XFx;-F ) =H(t,x,p), v=l X'=cp(t,x,p) then the Euler-Lagrange equations are equiva- lent to Hamilton's differential equations dxi/dt = Hp" dpi/dt= -H x " i= 1, ...,n, since F(t, x, x') = I:i1 PiHpi - H. A curve represented by a solution of the Euler-Lagrange equations is called a station- ary curve. Now consider a family of stationary curves in a domain G of the (n+ I)-dimensional tx-space such that passing through every point of G there is one and only one curve in this family, and suppose that the family is ttrans- versal to an r-dimensional manifold 'l£ (r  n) (that is, F t5t - I: F x ,t5x i = ° for the differentials t5t, t5x i along 'l£; in particular, if'l£ consists of only one point (r = 0), a stationary curve pass- ing through this point is transversal to 'l£). In 
1215 this case, if we denote by V(t, x) the value of the integral J along the stationary curve from 'l£ to any point (t, x) of G, then the equation oVjot+ H(t, x, oVjox)=O holds. This equation is caned the Hamilton- Jacobi differential equation or the canonical or eikonal equation. Conversely, a solution of this equation is equal to the value of the integral J for a family of stationary curves transversal to an adequate 'l£. F. The Monge Differential Equation Consider the partial differential equation (2). By eliminating p and t between dx. =F dt Pi' du n -= L p;Fp, and F(x,u,p)=O dt i=1 . (for example, by eliminating p between dx;/dx I = Fp,lFp, and F=O when Fp, #0), we obtain M(x l , ... ,x n , U, OX2jOX 1 , ..., oxnjoxl)=O. This equation is caned the Monge differential equation, and the curve represented by its solu- tion is caned an integral curve of the equation. In the (n+ I)-dimensional tx-space, a curve that is an envelope of a I-parameter family of characteristic curves of the partial differential equation (2) is a solution of the Monge equa- tion. A characteristic curve is also an integral curve. When n=2, an integral curve that is not a characteristic curve is a Hine of regression of the surface generated by the family of charac- teristic curves tangent to the integral curve under consideration (which is an integral sur- face of F(x, u, p) = 0). If F is linear in Pi' i.e., quasilinear, an integral curves coincide with characteristic curves. References [1] R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. [2] I. G. Petrovskii, Lectures on partial dif- ferential equations, Interscience, 1954. (Orig- inal in Russian, 1953.) [3] E. Goursat, Cours d'analyse mathema- tique, Gauthier-Vi11ars, II, second edition, 1911; III, fourth edition, 1927. [4] F. John, Partial differential equations, Springer, 1971. [5J C. Caratheodory, Variationsrechnung und Partiene Differentialgleichungen Erste Ordnung, Teubner, 1935; English translation, Calculus of variations and partial differential equations of the first order, Holden-Day, 1967. 325 A PDEs of Hyperbolic Type 325 (XIII.25) Partial Differential Equations of Hyperbolic Type A. Second-Order Linear Hyperbolic Equations A t1inear partial differential equation in n+ 1 variables t, x =(x l , ..., x n ) of the second order, 02U n 02U n 02U L[u] =---z- L a Oi -- L aij- ot i1 otox; i.j=1 oxiox j OU n OU -a o -- L ai--au=O, ot ;=1 OXi (1) with coefficients a Oi , .'., a that are functions in (t, x) is said to be hyperbolic (or of hyperbolic type) (with respect to the t-direction) in tx- space if the characteristic equation of equation (1) considered at each point of tx-space, n n H(t,X;A,)=A2- L aOi)- L aijij=O, i=l i,j=l (2) has two distinct real roots A = Al (t, x, ), A2(t, x,) for any n-tuple of real numbers  = (b ... , n) # (0, ... ,0). In particular, (1) is caned regularly hyperbolic if these two roots are separated uniformly, that is, lim IA,(t,X,)-A2(t,X,)I=c>0. ('.x).II=1 A typical example of hyperbolic equations is the wave equation 02U 02U 02U DU=af- oXI - ... - ox; =0. Equation (3) is also caned the equation of a vibrating string, the equation of a vibrating membrane, or the equation of sound propaga- tion according as n= 1, 2, or 3. Another exam- ple is (3) 02U 02U OU --c 2 --2-=0, ot 2 ox 2 ot which describes the propagation of electric current in a conducting wire with leakage and is caned the telegraph equation ( Appendix A, Table 15). A hyperplane A(t - t°)+ ... + n(xn - x) = 0 passing through a point pO=(tO,XO) in tx-space and having normal direction (A,) is caned a characteristic hyperplane of (1) at pO if the direction (A,) satisfies the characteristic equa- tion at pO: H(tO, xO; A,) =0. A hypersurface S: s(t, x) =0 in tx-space is a characteristic hyper- surface of (1) if at each point of S the tangent hyperplane of S is a characteristic hyperplane of (1), that is, H(t,x;s"sx)=O everywhere on S. According to the theory of first-order par- 
325B PDEs of Hyperbolic Type tial differential equations, a characteristic hypersurface of S is generated by so-called bicharacteristic curves, i.e., solution curves t = t(-r), x(-r) of a system of ordinary differential equations dt dT = H;., ..., dXn --H dT - "' dA -=-H dT " dn =-H dT Xn' ..., H(t,X;A,)=O. Now if (1) is hyperbolic, the set of all charac- teristic hyperplanes at pO: {A(t - to) + ... + n(xn - x) =0 I H(tO, x o ; A, )=O} has as its envelope a cone C(pO) with the vertex pO. Moreover, since the intersection of any hyperplane t = constant and the cone C(po) is an (n-l)- dimensional ellipsoid or two points for n= 1, a conical body D+(po) (D_(pO)) is determined, whose boundary consists of the part of C (po) with t  to (t  to) and the interior of the ellip- soid on the hyperplane t = constant. A tsmooth curve y in tx-space is called timelike if the tangent vector of y at each point p on y be- longs to D+(p) or D_(p). Consider the set of points that can be connected with the point pO by a time like curve. We call its closure an emission, and a subset 2t1+(po) (2t1_(po)) of the closure for which t  to (t  to) a forward (back- ward) emission. An emission is a conical body surrounded by characteristic hyperplanes in some neighborhood of the vertex pO. If the co- efficients of (1) are bounded functions, the emissions 2t1 :!:(po) are contained in a conical body K = {(t,X)1 Aax(t-tO)2  i (X i -X?)2} independently of the situation of pO, where A. max = max(,.x).IIr(IA1 (t, x, m IA2(t, x, )I). B. The Canchy Problem Important for the hyperbolic equation (1) is the tCauchy problem, i.e., the problem of finding a function u=u(t,x) that satisfies (1) in t > 0 and the initial conditions u(O, X) = uo(x), 8u/8t(0,x)=u 1 (x), (4) where the functions uo(x) and ur(x) are given on the initial hyperplane t = O. Suppose that (1) is regularly hyperbolic and the coefficients are bounded and sufficiently smooth (i.e., of class CV with v sufficiently large). Then for the Cauchy problem the fol- lowing theorem holds. Theorem (C): There exists a positive integer 1(= [n/2] + 3), depend- ing on the dimension n + 1 of the tx-space, such that if the functions uo(x) and ur(x) in (4) 1216 are of class C 1 , then there exists a unique solu- tion u = u(t, x) of class C 2 in the domain 0  t < 00, -00 <x i < 00 (1  in). Moreover, this correspondence {uo(x),uj(X)}--+U(I,X) is con- tinuous in the following sense: If a sequence of initial functions {UOk(x), U[k(x)} (k = 1,2,...) and their derivatives up to the lth order tend to 0 uniformly on every compact set in the hyperplane t = 0, then the sequence of corre- sponding solutions uk(t, x) also tends to 0 uniformly on every compact set in each hyper- plane t = constant. In other words the Cauchy problem for regularly hyperbolic equations is twell posed in the sense of Hadamard [2]. F or dependence of the solution on initial data, the following proposition is valid: The values of the solution u at a point pO = (to, xo) depend only on the initial data on a domain Go (domain of dependence) of the initial hyper- plane, which is determined as the intersection of the backward emission 2t1_(po) and the initial hyperplane. We have the following dual proposition: A change in the initial conditi'ons in a neighborhood of a point Qo of the initial hyperplane induces a change of values of the solution only in some neighborhood of the forward emission 2t1 +(Qo) (domain of inflnence). If the coefficients of the equation a.re bounded, the intersection of emissions 2t1 ct(po) and the hyperplane t = constant is always compact. It follows that the domain of dependence and the domain of influence are bounded. In some special cases, there exists a proper subdomain of Go such that the solution depends only on the initial data on the subdomain. For exam- ple, for the wave equation (3) with n = 3, the solution for the Cauchy problem a.t a point pO = (to, x o ) ( Section D) is determined, as can be seen from the solution forITIula (12), by the initial data in a neighborhood of the cone with vertex pO: (t - t o )2 = I:71 (Xi - X?)2, namely, in a neighborhood of the] ntersection of bicharacteristic curves (lines, in this case) passing through pO and the initial hyperplane. If the solution of the Cauchy problem has such a property, it is said that Hnygens's prin- ciple is valid, or that diffusion of waves does not occur. For the wave equation (3), Huy- gens's principle is valid only for odd n> 1. C. The Energy Ineqnality The energy conservation law for the wave equation (3), E(t) = troro ...f:ro ( G Y +i (::J )dxl...dx n = constant, 
1217 is generalized to the so-called energy inequal- ity for hyperbolic equations, which plays an essential role in deducing the well-posedness of the Cauchy problem and the properties of the domain of dependence of the solution. Let the coefficients of a hyperbolic equation (1) be bounded functions, and let G(.) be the inter- section of the conical body K = {(t, x)l A;ax(t- t 1 )2  I:7=dxt-xl)2} with the hyperplane t= . (. < t 1 ). The k( l)th-order energy integral of the solution u(t, x) of (1) on G(.) is defined by Ek)(U, G(.)) f I ol"lu 1 2 = L (.,x) dx. G«) "0+'" +"n",k ot"o... ox: n Then the following inequality holds: Ek)(U,G(.)):::;CE:)(u,G(tO)), to:::;.<t1, (5) where the constant C is independent of u. We call (5) the energy inequality (1. Schauder [5]). For the wave equation (3), the hypothesis 1= [n/2] + 3 in theorem (C) can be replaced by a weaker condition 1= [n/2] + 2, but if we take 1= [n/2] + 1, there is an example for which no global solution of class Cl exists. In general, even though the initial functions are of class C I , the solution in the Cauchy problem for hyperbolic equations may not be of class C1, while if the energy of the initial functions is bounded, the energy of the solution is also bounded. D. Representation Formulas for Solutions of the Cauchy Problem We consider solution formulas that represent solutions of the Cauchy problem explicitly as functionals of the initial functions. The prob- lem of solving (1) under the condition (4), or more generally, the problem of solving an equation L[u] = f(t, x) under (4), can be re- duced, by transforming the unknown function u and applying tDuhamel's method, to the Cauchy problem with initial conditions on the hyperplane t = .; u(.,x)=O, ou/ot(., x) = q>(x) for arbitrary.. We define a function Xq(s) of a real variable S by Xq(S) = Isl q /4(2ni)"-1 q! (n odd), = -sqloglsl/(2ni)"q! (n even), (6) where n i, the dimension of the x-space and q is a positive integer such that q + n is even. Then for a function q>(x) of class CV (with v sufficiently large) with compact support, the 325D PD Es of Hyperbolic Type following equality holds [7]: q>(X)= f oo L\n+q)f2q>(y)dy r Xi(x-y)w)dw, -00 Jlwl=l (7) where L\y is the tLaplacian with respect to the variables Y=(Yl' ...,Yn) and dw is the surface element of the unit sphere Iwl = 1 in x-space. Now, since the tprinciple of superposition is valid because (1) is linear, we can infer from formula (7) that the Cauchy problem (1) with initial condition (4') can be reduced to the one for initial conditions with parameters y,w: u(.,x)=O, ou/ot(.,x)=xi(x-y)w). (4") In fact, since Xq(s) is (q -I)-times differentiable by definition, the Cauchy problem (1) with initial condition (4") has a unique solution Rq(t, x; ., Y; w) for q chosen large enough so that theorem (C) can be applied to (1) with initial condition (4"). Moreover, Rq(t, x;" y;w) is a function of (t, x,., Y, w) of class C', and v increases with q. Now, let q>(x) be a function of class CV with sufficiently large v and with compact support. Then, by (7) and the defini- tion of Rq, the integral f oo L\n+q)f2q>(y)dy r Rq(t,x;.,y;w)dw (8) -00 JIW!=1 is a solution of the Cauchy problem (1) with initial condition (4'). Therefore, when Rq is found explicitly, (8) yields a solution formula of the Cauchy problem (1) with initial condi- tion (4') as a functional of the initial func- tions. Since in (8), the integral JI W I=1 Rqdw is not necessarily of class cn+ q as a function of ) A (n+q)f2 J R d . . I t (t, x;" Y , '-'y Iwl=1 q W IS III genera no a function in the ordinary sense. But we denote it by R(t, x;" y) formally, and understand that a linear operator u(t,x)= f R(t,x;.,y)q>(y)dy (9) (4') is defined by (8). The kernel R(t, x; ., y) in this sense is called a fundamental solution or Rie- mann function of the Cauchy problem. If we extend the function Jlwl=1 Rq(t, x;" y; w)dw de- fined for t . to t <" assigning it the value 0 there, then R(t, x;" y)=L\n+q)f2 JIWIl Rq(t, x; ., y; w)dw can be considered a tdistribution on (t, x;" y)-space, and the equality L(t, x, %t, %x)R(t, x;,, y) =L*(., y, 0/0., %y)R(t, x; ., y)=c5(t -.)c5(x - y) is valid, where L* is the tadjoint operator of Land 15 is tDirac's 15- function. In other words, R(t, x;" y) is a fun- damental solution of L in the sense of distri- bution theory. The fundamental solution R(t, x;,, y) can be analyzed using the asymptotic expansion with 
325E PDEs of Hyperbolic Type respect to the sequence offunctions {Xq(s)}, and we have the following important result: If the coefficients of (1) are of class Coo (resp. real analytic), then the fundamental solu- tion R(t, x; 1", y) is of class Coo (real analytic) in (t, x) except for points that are on bichar- acteristic curves of (1) passing through the point (1", y). In the language of the Cauchy problem, the smoothness of the solution u at a point p=(t,x) depends only on the smooth- ness of the initial conditions on a neighbor- hood of the intersection of the initial hyper- plane and all bicharacteristic curves pass- ing through p. This fact is called Huygens's principle in the wider sense. Behavior of the fundamental solution R(t, x; 1", y) near dis- continuous points has also been investi- gated [2]. For the wave equation (3), the fundamental solution can be constructed, and therefore we can write the solution formula explicitly. The solution formula for (3) with initial condition (4') for n)o 3 is u(t,x) 1 on-2 I ' =--- ( t 2 _1"2 ) (n-3)/21" Q( x 1" ) d1" (n-2)! ot n - 2 0 ' , Q(x,1")= r cp(x + 1"W)dw, W n J1W!=1 where W n = 2# jf'( nj2) is the surface area of the unit sphere of n-dimensional space. Solutions of the Cauchy problem (3) with initial condition (4) for n = 1, 2, and 3 are, respectively, ( ) uo(x+t)+uo(x-t) 1 f x+t U t,x = +- ud)d 2 2 x-' (10) (d' Alembert's solntion), u(t, XI, x 2 ) = I UO(I' 2)d1 d2 2not C, J t2_(X1-rJ2_(X2-2)2 + 1 I udJ, 2)d1 d2 (11) 2n c, J t2_(X1-rJ2_(X2-2)2 (Poisson's solntion), and U ( t x x x ) = f UO(I'2'3) d , l' 2, 3 4 O W, n t D, t +r UdI'2'3) dw (12) 4n In, t ' (Kirchhoff's solution), where C, is a disk in the l 2-plane with center (xJ, x 2 ) and radius t, D, is a sphere in the  1 2 3-space with center (X 1 ,X2,X 3 ) and radius t, and dw, is the surface element of Dr. 1218 E. Second-Order Nonlinear Hyperbolic Eqnations A second-order nonlinear differential equation 02 =A ( t,x,U' u ,, 02 U , 02 U _ ) , ot ot oX i otox i oxiox j 1,,;; i, j";; n, (13) is called hyperbolic in a neighborhood of a function U(t, x) if the linear equation of the form (1) obtained from (13) is hyperbolic, where aOi(t, x) and aiit, x) are determined by substituting u by U(t, x) in the partial de- rivatives of A with respect to 02uji'tox i and 02 u jox i ox j , respectively. If the functions A in (13) and U =xo(x)+tudx) determined by (4) are sufficiently smooth with respect to t, x, u, '" , 0 2 ujoxiox j , the Cauchy problem for (13) with initial condition (4) has a unique solution in some neighborhood of the initial hyperplane under the condition that equation (13) is hyperbolic in a neighborhood of U. In general, initial value problems for nonlinear equations have only local solutions. F. Higher-Order Hyperbolic Eqnations An Nth-order linear differential equation in n + 1 variables t, x =(x 1 , "', x n ) with constant coefficients L (  ' 0 0 ) u= I a. . (  ) 'o (  ) ' u ot x 'o+I.I';;N 0 ot ox =0, (14) where a=(a 1 ,..., an), lal =a 1 +... +a n , and ( 0 ) . 0 1 . 1 ax - aXl ...ox:'" is hyperbolic in the sense of Girding if the following two conditions are satisfied: (i) the partial derivative oNjot N appears in L; (ii) the real parts of the roots A =), 1 (), .., , AN() of the characteristic eqnation L()" i) = 0 3 re bounded functions of real variables  = (1" .,  n)' When L is a homogeneous equation of the Nth order, condition (ii) is equivalent to the following condition: (ii') AI (), ..., AN() are purel y imaginary for all real  = (1' ... , n) # (0, ... ,0). The principal part (consisting of the highest-order terms) of a hyperbolic equation is also hyperbolic. If (14) is hyperbolic, a theorem analogous to theorem (C) holds for the Cauchy problem for (14) with iaitial conditions okujotk(O, x) = Uk (x), O";;k";;n-l; (15) that is, the Cauchy problem for (14) with initial condition (15) is well posed in the sense of Hadamard. Conversely, if the Cauchy problem 
1219 for (14) with initial condition (15) is wen posed, then (14) must satisfy the hyperbolicity con- ditions (i) and (ii) in the sense of Girding. In other words, wen-posedness of the Cauchy problem is equivalent to hyperbolicity in the sense of L. Girding [14]. Girding's conditions for hyperbolicity cannot be generalized to the case of variable coefficients, since the influence of the lower-order terms in the equation is taken into account in the definition of hyper- bolicity. However, in the case of constant coefficients, an Nth-order homogeneous equa- tion remains hyperbolic for any addition of lower-order terms if and only if the character- istic equation has N distinct purely imaginary roots for any real  =(1'"'' n)#(O,..., 0). In this case the equation is caned hyperbolic in the strict sense. Thus a linear equation with variable coefficients L[u] = ° O N  + L a"o"(t, X) ( O O ) "0 ( 0 0 ) " u t "o+I"I';N t x iXo<N =0 (16) is caned hyperbolic in the sense of Petrovskii if the characteristic equation A N + L a"O"(t,x)A"O(i)"=O "o+I"I=N has N distinct purely imaginary roots (caned characteristic roots) AI (t, x, ), ... , A N( t, x, ) for each point p = (t, x) and each  #0. Moreover, if the characteristic roots AI' ... , AN are sepa- rated uniformly, i.e., the inequality li. I Aj(t, x,) - Ak(t, x, )I =c > ° (t.x).II-l.dk hOlds, (16) is said to be regularly hyperbolic. In the second-order case, this definition is equiva- lent to the previous one. Theorem (C) hOlds for the Cauchy problem for a regularly hyper- bOlic equation (16) with initial conditions (15). For the domain of dependence of the solution, a result analogous to the case of the second-order equation can be obtained using an energy inequality [9, 10]. If the coefficients are of class Coo, Huygens's principle in the wider sense is valid, that is, discontinuity of the solution is carried over only along bicharacter- istic curves. G. Systems of Hyperbolic Equations For systems of equations I L LiJUJ=O, lil, j=1 where the Lij are higher-order linear differen- tial operators of the form (16), several types of hyperbolicity are formulated in connection with the wen-posedness of the Cauchy prob- 325H PD Es of Hyperbolic Type lem. We take up two important types, hyper- bolicity in the sense of Petrovskii and sym- metric hyperbolicity due to Friedrichs. We can a system of linear differential equations I .. ( 0 ) "0 ( 0 ) " .L L ao"(t,x) ot _ 0 Uj=O, }=1 a:o+lalnj x 1 il, (17) a system of hyperbolic differential equations (in the sense of Petrovskii) if the determinant ( .. ( 0 ) "0 ( 0 ) " ) det L ao"(t,x) _ 0 _ 0 ' "o+I"I';nj t x (18) calculated formany using the matrix of dif- ferential operators in the system, is hyperbolic in the sense of Petrovskii as a single equation of N (= I:J1 nj)th order. Petrovskii showed that the Cauchy problem for a system that is hyperbolic in this sense is wen posed [1OJ. There were some imperfections in his argu- ment, which have been corrected by others ( s. Mizohata [12]). In the case of constant coefficients, the Cauchy problem for (17) is wen posed if and only if (18) is hyperbolic in the sense of Girding. K. O. Friedrichs, observing that the energy inequality played an essential role in Petrov- skifs research, studied symmetric hyper- bolic systems of equations, since for them the energy inequalities are valid most natu- rany. A system of first-order linear differential equations OU n OU Ao(t,x)-= L A;(t,x)-+Bu ot ;1 ox; (19) is caned symmetric hyperbolic (in the sense of Friedrichs) if the matrices A;(t, x) (O i  n) are symmetric and Ao(t, x) is positive definite. A typical example is provided by Maxwen's equations. For this system it has been shown that the Cauchy problem is wen posed and the domain of dependence of the solution is bounded [13]. H. Weakly Hyperbolic Operators We adopt the fonowing definition of hyper- bolicity: a linear differential operator of Nth order ON ( 0 ) "0 ( 0 ) " L=- o N + L a"o"(t,x) _ 0 _ 0 t "o+I"I.;N t x a:oN-l is caned hyperbolic if the Cauchy problem for L[u] =0 with initial condition (15) is wen posed in Hadamard's sense. A hyperbolic operator L is caned strongly hyperbolic if L remains hyperbolic for any addition of 10wer- order terms, and weakly hyperbolic otherwise. 
325 I PD Es of Hyperbolic Type A necessary condition for hyperbolicity is that all characteristic roots of LN(t, x, A, ) = 0 be real for any (t, x,) (P. D. Lax [15J, Mizo- hata [16J). In the case of constant coefficients, strongly hyperbolic operators have been char- acterized by K. Kasahara and M. Yamaguti (Mem. Coll. Sci. Kyoto Univ. (1960)). As for operators with variable coefficients, it is known that not only regularly hyperbolic operators but also some special classes of not regularly hyperbolic operators are strongly hyperbolic (V. Ya. Ivrii, Moscow Math. Soc., 1976). Let us consider the hyperbolicity of an operator L which is not regularly hyperbolic under the assumption that the multiplicities of the characteristic roots are constant and at most 2; namely, the characteristic polynomial LN(t,X).,)=AN+ I aaoa(t,X)AaOa ao+lal=N Cl:oN-l is decomposed in the following way: LN(t, x).,) s N-s =n().-.1.p,x,))2 n (.1.-A)t,x,m, j=l j=s+l lim I)'j(t, x,) - .1. k (t, x, )I = c >0. (t.x) 11I.ji"k Assume that the ))t, x, ), j = 1,2, ... ,N - s, are real. Then L is hyperbolic if and only if it satisfies E. E. Levi's condition, i.e., [ 1 ( a 2 L N aA j lit,x,)= L N - 1 +"2 aA 2 at + I a 2 L N aA j )] =0 a=la.1.aaaxa ,= forall (t,x,), j=I,2,...,s, where L N - 1 denotes the homogeneous part of (N -1)th order among lower-order terms of L (Mizohata and Y. Ohya [17J). Thus, for a weakly hyperbolic operator with vari- able coefficients, even the principal part is not necessarily hyperbolic. J. Chazarain [18] has studied weakly hyperbolic operators with characteristic roots of arbitrary constant multiplicity. O. A. Oleinik [19J studied the Cauchy problem with nonconstant multiplic- ities for second-order equations. For higher- order equations, if the multiplicity of charac- teristic roots at (t, x) is at most p, or more precisely, if there exist positive rational num- bers q and r (q? r) such that (:.1. r L,v(t, x, 0, ) # 0 and ( a ) a o ( . a ) a ( a ) PO ( a ) p - - - - LN(t,x,O,)=O aA /j at ax (v E R n \ 0) 1220 for a o + lal + qpo +rlPI <p, then it is necessary for the well posedness of the Cauchy problem that ( a ) a o ( a ) a ( a ) PO ( a ) p A A a). a at ax LN-s(t,x,O,)=O (v E R n \0) for a o +lal+qpo+rIPI+s(1 +q)<p are satis- fied, where L N - S (1 sN) are the homoge- neous lower-order terms of order N - s of L (Ivrii and V. M. Petko v [20J). On the other hand, the sufficien1 condition in the case of multiplicity 2 is given by some conditions related to the subprincipal symbol 1 ( a 2 LN n a 2 LN ) L N - 1 -- -+I-, 2 aAat a=laaaXa which corresponds to Levi's condition in the case of constant multiplicity, and to the tPoisson brackets (A. Menikoff, Amer. J. Math., 1976; Ohya, Ann. Scuola Norm. Sup. Pisa, 1977; L. Hormander, J. Anal. Math., 1977). The Cauchy problem for a weakly hyper- bolic system of equations is more complicated, because of the essential difficulty that the matrix structure ( I aYoa(t, X)Aaoa ) Clo+lal=nj associated with (17) is not clear in general ( references in [20]). I. Gevrey Classes Classically, the functions of tc1ass s (s? 1) of Gevrey ( 58 COO-Functions and Quasi- Analytic Functions G; 168 Function Spaces B (14) were introduced into the studies of the fundamental solution for the heat equation: Y\c(Rn)= {cp(x)ECOO(R n ) I for any compact subset K of R n and any multi-indices a,there exist constants C K and A such that SUPK l(a/ax)"cplx)1  CKAlallaj!'}. This class of functions was used efficiently in the studies of the Cauchy problem for tweakly hyperbolic partial differential equations: L ( t,x,, ) U(t,X)= j(t,x) at ax in [0, TJ x R n =Q, GJ U(O,X)=uj(X), j=O,l,...,]I,'-1. We assume that the multiplicities of the char- acteristic roots are constant, i.e., k LN(t, x, A,) = fl (). - Ai(t, x, m\ i=l 
1221 where Vi is constant for any (t, x, )EQ x R n , Ai(t, x,) is real and distinct, and I:=I Vi = N. Let max 1 ';i';k Vi = p. If we suppose that L(t, x, a/at, %x)- L(t, x, a/at, a/ox) is a (tpseudo) differential operator of order at most q, where _ p ( a a ) L= D a i t,x,-;;-,,, ,I ut uX is a (pseudo)differential operator with ai(t, x, a/at, a/ax) being strictly hyperbolic (pseudo) differential operators associated with L N , then for any s such that 1 s<p/q, the Cauchy problem is weU posed in l'!c(Q), provided that aU a.o.(t, x) of Land f(t, x) belong to l'\c(Q), and that {Uj(x) }O';j';N-I are given in 1'1c(Rn) (Ohya [21], J. Leray and Ohya [22]). This result was proved even for the case of arbitrary nonconstant multiplicity of characteristic roots by M. D. Bronshtein [23]. J. Lacunas for Hyperbolic Operators The theory of lacunas of fundamental solutions of hyperbolic operators, initiated by Petrovskii [24], has been developed further in a paper [25] by M. F. Atiyah, R. Bott, and L. Garding. Let L()=LN()+ M() be a hyperbolic polynomial with respect to the vector 8ER n -O, where LN() is the principal part of L; this means that LN(8)#0 and L( +(8)#0 when 11m t I is sufficiently large. Then L has a funda- mental solution E =E(L, 8, x) in the form E(L,8,x)=(2n)-n f L(_ic8)-leiX(-ic9)d, where c is sufficiently large and the integral is taken in the sense of tdistribution. The con- vex huU of the support of E, denoted by K = K(L,8)=K(A,8), is a cone depending only on the real part ReA of the complex hyper- surface A: L() = 0, and contained in the union of the origin and the half-space x. 8>0. Let A be the Hangent conoid of A at , transported to the origin, and define the wavefront surface W(A, 8) by the union of aU K(A, 8) for  #0. Then it can be shown that the singular sup- ports of E(L, 8) and all the E(Lt, 8) are con- tained in W(A, 8) and, moreover, that they are 10caUy holomorphic outside W. In [25], the Herglotz-Petrovskii-Leray formulas are gen- eralized to any nonstrict L(,). Thus we have VII E(L,lI, x) =constf..(x.)qIIL()-kW(), q>O, (20) Dfl E(Lt, 8, x) =const f (x.)qeL()-kw(), qO tx.oa* 325K PDEs of Hyperbolic Type when xEK(A,8)\ W(A, 8). Here, q=mk-IJ3I-n is the degree of homogeneity of the left-hand side, and w= I:j=l( -1}i-1jd1 A... A;R;A ... A dn' The integrands are closed hational (n-l)-forms on (n-l)-dimensional complex tprojective space and are integrated over cer- tain thomology classes oc* = oc(A, 8, x)* and t x ' aoc*. These formulas provide means of obtain- ing topological criteria for lacunas. Let fi> c It be a maximal connected open set, where E(L, 8, x) is holomorphic. fi> is said to be a weak (strong) lacuna of L if E(L, 8, x) is the restriction of an entire function to fi>(E(L, 8, x) = ° in fi». In [25] it is shown that x belongs to a weak lacuna for aU E(Lt, 8,') if and only if aoc* = 0. The sufficiency directly foUows from (20') and the necessity foUows from a theo- rem of A. Grothendieck (Publ. Math. Inst. HES, 1966) which implies that the rational forms which appear in (20') span aU the tco- homology classes in question. K. Mixed Initial-Boundary Valoe Problems Let Q be a domain in R n with a sufficiently smooth boundary r, let L(t, x, ajat, a/ax) be a linear hyperbolic operator of Nth order de- fined in [0, 00) x 0= {(t, x) I tE [0,00), XEO}, and let B)t, x, a/at, a/ax),j= 1,2,..., b, be linear differential operators of Njth order defined in a neighborhood of [0, 00) x r. The problem of finding a function u(t, x) satisfying the conditions (20') L[u]=O in (O,oo)xQ, BJu]=O on (0,00) x r, j= 1,2, ...,b, aku/atk(O,x)=uk(x), OkN-l, (21) is caUed a mixed initial-boundary value prob- lem. A typical example of such a mixed prob- lem is provided by the case L = 0 (n = 2) and B[u] = u(t, x), which describes the vibration of membranes with a fixed boundary. The mixed problem (21) is said to be well posed iffor any initial data Uk(X)E C""(O), OkN -1, which are compatible with the boundary conditions, there exists a unique solution U(t,X)EC""([O, 00) x 0). Mixed problems for second-order hyper- bolic equations are considered in [6]. In regard to mixed problems for hyperbolic equations of higher order, we make the foUowing four assumptions: (i) Q=R",. = {(x',xn)1 x' ERn-I, xn>O}; (ii) r={xlxn=O} is not characteristic for Lor B j ; (iii) L is regularly hyperbolic; (iv) NjN -1 and Nj#N k ifj#k. We denote the tprincipal parts of Land Bj by LN(t, x', x n , a/at, a/ax', a/ax n ) and Bjo(t, x', ajat, a/ax', a/ax n ), respectively. By the hyper- 
325 L PDEs of Hyperbolic Type bolicity of L, an equation in K LN(t,X',O).,',K)=O for Im.1.<O, 'ERn-l has J1 roots Kt with 1m Kt > 0, and N - J1 roots Kj- with 1m Kj- < 0, and the number J1 is inde- pendent of (t, x') and (.1., '). A necessary con- dition for the wen-posedness of the mixed problem (21) is that the number of boundary conditions coincide with this integer J1. The function R defined by R(t, x', A, 0 d [ Bjo(t, x',.1., ', K)K 1 - 1 J = et + dK c L (t,x',.1.,', K) j.l=1.2.. . where L + = rrf1 (K - Kt (t, x', A, 0) and C is a contour enclosing an Kt, is caned a Lopatinski determinant. We say that Land Bj satisfy the uniform Lopatinski condition if inf IR(t,x',A,OI=c>O. (t,x'),lml<O 1'I+IAI=l When the uniform Lopatinski condition is satisfied, the mixed problem (21) is wen posed, and (21) represents a phenomenon with a finite propagation speed, which is the same as that of the Cauchy problem for L[uJ =0 (T. Bala- ban [26J, H. O. Kreiss [27], and R. Sakamoto [28J). An analogous result holds in the case of a domain Q with a compact boundary r, provided that Land Bj satisfy the uniform Lopatinski condition at every point of r. In the treatment of mixed problems for L and Bj not satisfying the uniform Lopatin- ski condition, the wen-posed problems have been characterized for operators with con- stant coefficients when Q = R"+- (Sakamoto [29J). For general domains, however, the well- posedness of mixed problems depends not only on the properties of the Lopatinski deter- minant but also on the shape of the domain (M. Ikawa [30J). L. Asymptotic Solntions In order to explain some properties of phe- nomenon governed by hyperbolic equations, asymptotic solutions play an important role. Consider, for example, the aconstic problem Du=O in (O,oo)xQ, u=O on (0, (0) x r. Let w(t, x; k) be a function defined in (0, (0) x Q with parameter k?: 1 of the form N w(t, x; k) = eik(cp(x)-t) L vj(t, x)k- j , j=O 1222 where cP is a smooth function satisf)ing the eikonal eqnation IV cpl2 = 1. If vj,j = 0,1, ..., N, satisfy the transport eqnations aV. 2+2Vcp' Vvj+L\cpv j = -iDv j - 1 , V-I =0, at we have Dw=O(k- N ). Then w of (22) is an approximate solution of Du = ° for large k, and it represents a wave propagating in the direction Vcp. When supp w n (0, (0) x r =I cp, if w hits the boundary r transversany, we can construct N w+(t,x;k)=eik(CP+(X)-t) L vt(t,x)k- j jO such that IVcp+1 2 = 1 in Q, cp+ =cp, and aq. -->0 on r, a and vt satisfy the transport equations and vt = - v j on (0,00) x r, where v is the unit inner normal of r. Then w + w + is an approximate solution, and w+ represents a reflected wave propagating in the direction V cp + = Vcp - 2(Vcp' vivo These asymptotic solutions show that the high-frequency waves propagate approxi- mately according to the laws of tgeometric optics. If asymptotic solution (22) has a ('anstic, i.e., {{x+IVcp(x)IIER}lxEsuppw} has an envel- ope, w of the form (22) cannot be an asympto- tic solution near the caustic. The asymptotic behavior of high-frequency solutions near the caustic was first considered by G. B Airy (Trans. Cambridge Phi/os. Soc., 1838). Under the condition that the principal curvatures of the caustic are positive, w in the form (22) can be prolonged to a domain containing the caustic satisfying the asymptotic solution w(t, x; k) = eik(O(X)-t) { Ai( - k 2 / 3 p(x))go(t, x; k) + ik- 1 / 3 Ai'( - k 2 / 3 p(x))gdt, x; k)}, (23) where Ai is the Airy function A i(z)= f  ei(zt+t3/3)dt 2n -00 and gl(t, x; k) = L9lj(t, x)k 1/6-j j (22) (D. Ludwig [31J). Concerning the reflection of grazing rays by 
1223 strictly convex obstacles, the reflected wave can be constructed by the superposition of asymptotic solutions of the type (23). The methods of construction of asymptotic solutions of the forms (22) and (23) are also applicable to Maxwell equations or more general hyperbolic systems (R. K. Luneberg [32]; Ludwig and Granoff, J. Math. Anal. Appl., 1968; Guillemin and Sternberg, Amer. Math. Soc. Math. Surveys, 14 (1977»). M. Propagation of Singularities Let L be a hyperbolic operator with Coo coeffi- cients and consider the Cauchy problem L[u] =0 with initial condition (15). When the initial data have singularities, the solution also has singularities for t>O, which is a property of hyperbolic equations quite different from the properties of parabolic ones. It should be noted that the tpropagation of singularities cannot be derived from the Huygens prin- ciple in the wider sense, i.e., even for regu- larly hyperbolic operators of second order we cannot determine the location and the type of singularities of the solutions for ini- tial data with singularities directly from the singularities of the fundamental solution R(t, x; 7:, y). Suppose that the multiplicities of character- istic roots of L are constant. Assume that Uk' k = 0, 1, ... ,N -1, have, on either side of a suffi- ciently smooth (n-l)-dimensional manifold r, continuous derivatives of sufficiently high order to suffer jump discontinuities across r. Then the solution u has continuous partial derivatives of sufficiently high order every- where except on the characteristic surfaces of L issuing from r, and across these the partial derivatives of u have jump discontinuities (Courant and Hilbert [1]). For more general singularities of initial data, it is known that the twavefront propa- gates along the tbicharacteristic strips satisfy- ing <Pr - AAt,x, V<p) =0,1= 1,2, ..., N -s, that is, WFu(', t) is contained in {(x(t), (t))E T*(R n ) I (dxj/dt)(s) =(aAda)(s, x, ), (dj/dt)(s)= -(oAdoxj)(s, x, ), (x(OH(O))E Uk WF(u k )} (J. Chazarain [18]). The propagation of singularities is more complicated in mixed problems because of the reflections of singularities at the boundary. For the tacoustic problem, R. B. Melrose [33] showed the following: Suppose (!J = CO c {x Ilxl <R} for some R>O, and all the broken rays according to the geometric optics starting from oR=on{lxl<R} go out of OR in a fixed time. Then for initial data with singularities in 325 Ref. PDEs of Hyperbolic Type OR' the solution becomes smooth in OR for sufficiently large t. References [1] R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. [2] 1. Hadamard, Le probleme de Cauchy et les equations aux derivees partielles lineaires hyperboliques, Hermann, 1932. [3] J. Hadamard, Lecons sur la propaga- tion des ondes et les equations de l'hydro- dynamique, Hermann, 1903. [4] R. Courant and K. O. Friedrichs, Super- sonic flow and shock waves, Interscience, 1948. [5] J. Schauder, Das Anfangswertproblem einer quasilinearen hyperbolischen Differential- gleichung zweiter Ordnung, Fund. Math., 24 (1935),213-246. [6] M. Krzyzanski and J. Schauder, Quasi- lineare Differentialgleichungen zweiter Ord- nung vom hyperbolischen Typus, gemischte Randwertaufgaben, Studia Math., 6 (1936), 162-189. [7] F. John, Plane waves and spherical means applied to partial differential equations, Inter- science, 1955. [8] S. L. Sobolev, Applications of functional analysis in mathematical physics, Amer. Math. Soc., 1963. (Original in Russian, 1950.) [9] J. Leray, Hyperbolic differential equations, Lecture notes, Institute for Advanced Study, Princeton, 1952. [10] I. G. Petrovskii, Dber das Cauchysche Problem fur Systeme von partiellen Differen- tialgleichungen, Mat. Sb. (N.S.), 2 (44) (1937), 815-870. [11] I. G. Petrovskii, Lectures on partial dif- ferential equations, Interscience, 1954. (Orig- inal in Russian, 1953.) [12] S. Mizohata, The theory of partial dif- ferential equations, Cambridge, 1973. (Original in Japanese, 1965.) [13] K. O. Friedrichs, Symmetric hyperbolic linear differential equations, Comm. Pure App\. Math., 7 (1954), 345-392. [14] L. G:hding, Linear hyperbolic equations with constant coefficients, Acta Math., 85 (1951),1-62. [15] P. D. Lax, Asymptotic solutions ofoscil- latory initial value problems, Duke Math. J., 24 (1957), 627-646. [16] S. Mizohata, Some remarks on the Cauchy problem, 1. Math. Kyoto Univ., 1 (1961), 109-127. [17] S. Mizohata and Y. Ohya, Sur la con- dition de E. E. Levi concernant des equations hyperboliques, Pub\. Res. Inst. Math. Sci., 4 (1968), 511-526; Sur la condition d'hyper- 
326 A PDEs of Mixed Type bolicite pour les equations a caracteristiques multiples II, Japan. J. Math., 40 (1971),63- 104. [18J 1. Chazarain, Operateurs hyperboliques a caracteristiques de multiplicite constante, Ann. Inst. Fourier, 24 (1974),173-202; Propagation des singularites pour une classe d'operateurs a caracteristiques multiples et resolubilite locale, Ann. Inst. Fourier, 24 (1974), 203-223. [19J O. A. Oleinik, On the Cauchy problem for weakly hyperbolic equations, Comm. Pure Appl. Math., 23 (1970), 569-586. [20J V. Ya. Ivrii and V. M. Petkov, Necessary conditions for the correctness of the Cauchy problem for nonstrictly hyperbolic equations, Russian Math. Surveys, 29 (1974),1-70. (Orig- inal in Russian, 1974.) [21J Y. Ohya, Le probleme de Cauchy pour les equations hyperboliques a caracteristique multiple, 1. Math. Soc. Japan, 16 (1964), 268- 286. [22J 1. Leray and Y. Ohya, Systemes lineaires, hyperboliques non stricts, 2 ieme Co11oque sur l'Analyse Fonctionne11e, CBRM (1964), 105- 144. [23J M. D. Bronshtein, The parametrix of the Cauchy problem for hyperbolic operators with characteristics of variable multiplicity, Functional Anal. Appl., 10 (1976) 4, 83-84. (Original in Russian, 1976.) [24J I. G. Petrovskii, On the diffusion of waves and the lacunas for hyperbolic equations, Mat. Sb. (N.S.), 17 (59) (1945), 289-370. [25J M. F. Atiyah, R. Bott, and L. G:hding, Lacunas for hyperbolic differential operators with constant coefficients I, II, Acta Math., 124(1970),109-189; 134(1973), 145-206. [26J T. Balaban, On the mixed problem for a hyperbolic equation, Mem. Amer. Math. Soc., 112 (1971). [27J H. O. Kreiss, Initial boundary value problems for hyperbolic systems, Comm. Pure Appl. Math., 23 (1970), 277-298. [28J R. Sakamoto, Mixed problems for hyper- bolic equations I, II, 1. Math. Kyoto Univ., 10 (1970),349-373,403-417. [29J R. Sakamoto, 6"-we11 posedness for hyper- bolic mixed problems with constant coeffi- cients,1. Math. Kyoto Univ., 14 (1974), 93- 118. [30J M. Ikawa, On the mixed problems for the wave equation in an interior domain II, Osaka J. Math., 17 (1980),253-279. [31J D. Ludwig, Uniform asymptotic expan- sions at a caustic, Comm. Pure Appl. Math., 19 (1966), 215-250. [32J R. K. Luneberg, Mathematical theory of optics, Brown Univ. Press, 1944. [33] R. B. Melrose, Singularities and energy decay in acoustical scattering, Duke Math. 1., 45 (1979), 43-59. 1224 326 (XIII.27) Partial Differential Equations of Mixed Type A. General Remarks Let A [u(x)J =0 be a tquasi1inear second-order partial differential equation. The type (te11iptic, thyperbolic, or tparabolic) of the equation depends on the location of the poin t x. If the type varies as the point x moves, thl: equation is said to be of mixed type. An example is the equation ( 1- u2 ) a2cp _ 2uv a 2 cp + ( 1- V2 ) 82cp =0 c 2 ox 2 c 2 oxoy c 2 oy2 (1) of 2-dimensional stationary flow without rotation of a compressible fluid without vis- cosity, where cp is the velocity potential, u = ocpjox and v = ocpjoy are the velocity compo- nents, and c is the local speed of sound, which is a known function of the speed q =, (u 2 + V 2 )1/2 of the flow. Equation (1) is of e11iptic type if q < c, i.e., the flow is subsonic, and of hyperbolic type if q > c, i.e., the flow is supersonic. If there exist points where the flow is subsonic as we11 as points where it is supersonic, (1) IS of mixed type. The study of equations of mixed type has become important with the development of high-speed jet planes. B. Chaplygin's Differential Equation It is difficult to solve equation (1) directly since it is nonlinear. However, we can linearize it by taking q and e = arc tan (vju) as independent variables (the hodograph transformation). The linearized equation takes the form 02Z 02Z OX2 -K(X) oy2 =0, xK(x)O, (2) which is ca11ed Chaplygin's differential equa- tion. Equation (2) is hyperbolic for .( > 0 and e11iptic for x <0. The study of general equa- tions of mixed type, even when the} are linear, is much more difficult and less developed than the study of equations of nonmixed type. Almost a11 research so far has been on equa- tion (2) or slight modifications of it. C. Tricomi's Differential Equation The simplest equation of the form (2) is 02Z 02z --x-=O ox 2 oy2 ' (3) 
1225 which is called Tricomi's differential equation. F. G. Tricomi considered the following bound- ary value problem for (3): In Fig. 1, AC and BC are two tcharacteristic curves of (3) and (J is a Jordan curve connecting A and B. We seek a solution of(3) in the domain D bounded by AC, BC, and (J that takes given values on (J and on one of the two characteristic curves, say on AC. This boundary value problem is called the Tricomi problem. Tricomi proved the existence and uniqueness of the solution of his problem under some conditions on the shape of (J and the smoothness of the boundary values. After Tricomi, much research has been done on his and similar problems for equations ofform (2) [2]. We can also consider problems such as finding a solution of(3) (or of(2)) satisfying the initial conditions Z(O, Y)=Zl (y), (oz/ox)(O, Y)=Z2(Y) on the common boundary x = 0 of the e11iptic domain and the hyperbolic domain of the equation. This is ca11ed the singular initial value problem. S. Bergman [3] obtained an integral formula for the solution under the condition that Zl(Y) and Z2(Y) are real analytic. y A c x Fig. 1 D. Friedrichs's Theory For the study of equations of mixed type it would of course be most convenient if there existed a general theory of boundary value problems independent of the type of the equa- tion. However, constructing such a general theory is considered very difficult, because the twe11-posedness of boundary conditions as we11 as the analytic properties of solutions are quite different according to the type. The first contributor to the solution of this difficult problem was K. O. Friedrichs [4], who noticed that although the methods of solving the tCauchy problem and the tDirichlet problem are quite different, both methods utilize energy integrals in the proof of the uniqueness of solutions. Using this observation, he succeeded in constructing a unified theory that enables us to treat various types (including the mixed type) of linear equations in a single scheme- an admissible boundary value problem for a symmetric positive system of first-order linear 326 Ref. PD Es of Mixed Type partial differential equations. There is, how- ever, a difficulty in Friedrichs's theory since it does not give a unified procedure for reducing a given boundary value problem for a given equation to an admissible boundary value problem for a symmetric positive system of partial differential equations. The study of equations of mixed type that are of more general form than (2) by means of Friedrichs's theory is an open problem. E. Further Studies Work on equations of more general type than (2) or (3) has appeared (not all depending on Friedrichs's theory). For example, the fol- lowing equations are treated in [5,6,7], respectively: n 0 2 Z t OZ L -+--=0 ;=1 OX? It I ot ' ( 02 Y 02 ) n OX 2 +fYI Oy2 Z=O, 02Z 02Z G(y) OX 2 + Oy2 - K(y)z = h, where Z=(Zl' ""zn) and G(y) and K(y) are symmetric matrices. References [1] F. Tricomi, Su11e equazioni Iineari a11e derivate parZiali di 2° ordine di typo misto, Atti Acad. Naz. Lincei, (5) 14 (1923), 133-247. [2] L. Bers, Mathematical aspects of subsonic and transonic gas dynamics, Wiley, 1958. [3] S. Bergman, An initial value problem for a class of equations of mixed type, Bul!. Amer. Math. Soc., 55 (1949), 165-174. [4] K. O. Friedrichs, Symmetric positive linear differential equations, Comm. Pure App!. Math., 11 (1958), 333-418. [5] A. V. Bitsadze, On the problem ofEqua- tions of mixed type in multi-dimensional domains (in Russian), Dok\. Akad. Nauk SSSR, 110 (1956), 901-902. [6] V. I. Zhegalov, Boundary value problem for higher-order equations of mixed type (in Russian), Dok\. Akad. Nauk SSSR, 136 (1961), 274-276. [7] V. P. Didenko, Some systems of differen- tial equations of mixed type (in Russian), Dok\. Akad. Nauk SSSR, 144 (1962), 709-712. [8] A. V. Bitsadze, Equations of the mixed type, Pergamon, 1964. (Original in Russian, 1959.) [9] M. M. Smirnov, Equations of mixed type, Amer. Math. Soc. Trans!., 1978. (Original in Russian, 1970.) 
327 A PDEs of Parabolic Type 327 (XII1.26) Partial Differential Equations of Parabolic Type A. General Remarks Consider a second-order linear partial dif- ferential equation n l iJ2u n ou J i. I a,i x , t) OXjOX j + j bj(x, t) oX j OU +c(x,t)u--=f (I) ot for an unknown function u of (n + I) indepen- dent variables (x,t)=(X1' ..., xn,t), where aij= a jj . This equation is said to be parabolic (or of parabolic type) if and only if the quadratic form I: aijjj in  is positive definite at each point (x, t) of the region under consideration. x and t are sometimes called the variables of space and of time, respectively. The most widely studied of the parabolic equations is the eqnation of heat conduction (or the heat equation): OU L[u]=u--=O ot ' where I = I:i=, (J2 lax; is the Laplacian taken over the space variables. B. The Eqnation of Heat Conduction The I-dimensional case of the heat equation is 02U OU L[u]= (Jx 2 - ot =0 for the temperature u(x, t) in a rod, considered as a function of the distance x measured along the rod and the time t. Equation (3) was one of the first treated in the theory of partial differ- ential equations. Consider a finite rod with constant temperature 0 at its ends x = a and x = h. Thermodynamics suggests that the initial temperature cp(x) (cp(a) = cp(b) = 0) prescribed at t = 0 is sufficient to determine the distribution of heat u(x, t) in the rod at all later times t > O. On such physical grounds, we can expect that a solution to the following problem exists: Find a continuous function u(x, t) that satisfies equation (3) for a  x  b, t > 0, and the bound- ary conditions u(a, t) = u(b,t) = 0, !imu(x,t)=cp(x), axb. lja According to J. Fourier the answer to this problem is expressed in a series LI cnu n con- structed by superposition of the particular 1226 solutions un=sinjT. (x-a)exp( --Ant). Here the An are the roots ofsinjT. (b-a)=O, and the C n are chosen so that L1 cnun(' 0) = cp(x). In fact, if cp(x) is continuously differentiable, then L1 cnun(x, t) is the required solution. Thus we are led to the problem of expanding a given function cp(x' in a tFouriel series. The temperature distribution in an infinite rod is given by a continuous function u(x, t) that satisfies equation (3) for t > 0 and that, for t = 0, takes values given by cp(x), where !im u(x, t) = cp(x). lja (5) If cp(x) is bounded, then it can also be repre- sented by superposition of particular solutions e-(a-x)2/ 41 of (3) as !  f OO cp(ex)e-(a-x)'/4Idex, 2j;i -00 u(x,t)= t>O, cp(x), t = o. (6) (2) Partial differential equations of parabolic type are important because of thei r connec- tion with various phenomena in the physical world; they include not only equations that govern the flow of heat but also those that describe diffusion processes ( 115 Diffusion Processes). C. Partial Differential Equations of Parabolic Type in Two Variables (3) We are concerned mainly with the partial differential equation of parabolic ':ype in two variables: 02U 02U 02U a(x, t)+ 2b(x, t)-+c(x, t)-z uX oxot ot OU OU +d(x, t);-+e(x, t)-+ f(x, t)u=g, (7) uX ot with ac=b 2 . In the region where lal + Icl >0, equation (7) can be reduced to the: form a 2 w ow ow (Je +d'(,T)az+e'(,'r)a;+f'(,T)W=9' (7') by an appropriate change of varia bles  = U(x, t), T= V(x, t). If e' <0 in this region, we can assume without loss of generRlity that our equation takes the canonical form 02U OU OU a(x, t) ox 2 +b(x, t) ox +c(x,t)u-a,=g, (8) (4) with a> 0, from the outset. It has the single family of tcharacteristics t = constan t. (9) There are four typical problems to be posed with regard to equation (8). 
1227 The first consists of determining, in some neighborhood of a given curve C nowhere tangent to a characteristic, a solution u that possesses prescribed values of u and ou/on, or of a linear combination of u and ou/on, along C. For instance, the problem of finding a solution u(x, t) such that u(x o , t) = g(t) and ux(x o , t) = h(t) for given functions g(t) and h(t) is a problem of this type. Consider the equation oZu ou ---=0 ox 2 ot in the region a<t<b, xo"-;;x. According to E. Holmgren, a solution u(x, t) satisfying the conditions lim u(x, t) = g(t), lim ux(x, t) = h(t), X!Xo X!Xo where g'(t) is bounded and continuous, exists if and only if h(t)+  f '  d,=k(t) y1r .yt-, is of class Coo and satisfies I k(n)(t)I"-;; M (n!)z /rn for positive constants M and r. In the second type of problem we are re- quired to find in a region of the form CP1(t)"-;;X"-;;CPZ(t), t 1 <t<t z , a solution of (8) that takes prescribed values on part of the boundary of that region. Here we impose the hypothesis on the curves x = CP1 (t) and x= cpz(t) that they are nowhere tan- gent to a characteristic (9). M. Gevrey [3J showed that if such a solution does exist, the functions CP1 (t) and cpz(t) must satisfy the tH61- der condition with exponent r:t. > 1/2: ICPi(t+h)-CPi(t)I"-;;clhl", c=constant, (15) for sufficiently sma1l h. The problem of heat conduction mentioned in Section B corre- sponds to the particular case CP1 (t) = constant, CP2(t) = constant, for which condition (15) is automatica1ly fulfi1led. The third type of problem is to find in a region of the form a"-;;x,,-;;b, t>O, a sOlution of(8) that satisfies the conditions lim u(x, t) = cp(x); ,jO ou --hu=O for x=a, h=constant >0; ox ou ox +Hu=O for x=b, H=constant >0. (16) 327D PDEs of Parabolic Type This problem, posed for equation (10), is also a mathematical formulation of the problem of heat conduction in a rod [4]. If cp(x) is of class Cr, then the solution to this problem can be expanded as 00 u(x,t)= I cne-An'CPn(X), n=l (10) c n = r cp(x)CPn(x)dx, where CPn(x) is a normalized function (J cp;(x)dx = 1) that satisfies the boundary conditions (17) cp(a)-hcpn(a)=O, cp(b)+ HCPn(b)=O (18) (11) and the equation d 2 CPn(x) ---z-= -AnCPn(X). dx The fourth type of problem is to find a function u(x, t) that satisfies (8) for t> 0 and the initial condition lim'jo u(x, t) = cp(x). It corresponds to the problem of heat conduc- tion in an infinite rod. (19) (12) (13) D. Green's Formula (14) The tadjoint of the differential operator L[uJ in (3) is given by ozv ov M[v J = ox z +8(' (20) Integration by parts yields the identity It (ljJL[cpJ-cpM[ljJJ)dxdt = L cpljJdx+ L(IjJ : -cp  )dt, (21) (14') where G is the region bounded by the closed curve C, and the line integral on the right is evaluated in the counterclockwise direction over C. We ca1l (21) Green's formula for the partial differential equation of parabolic type (3). As in the case of partial differential equa- tions of elliptic type ( 323 Partial Differen- tial Equations of E1liptic Type), this formula is used to establish the uniqueness of the solu- tion of (3) and to derive an integral repre- sentation for it. For example, the uniqueness of the solution is established in the fo1lowing way: Let the curve AD and fiE in Fig. 1 be such that no characteristic meets either of them in more than one point. If u(x, t) is continuous in the closed region (ABED), vanishes on AD, fiE, and the segment AB , and satisfies equation (3) in (ABED) except on AB , then it vanishes identica1ly. Green's formula (21), applied to the region G=(ABQP) and the functions IjJ == 1, 
327 E PDEs of Parabolic Type rp == u 2 , yields 2 It G: r dxdt+ tQ u 2 dx=0. In a similar way, by extending Green's for- mula suitably, we are able to prove unique- ness theorems for the four problems stated in Section C for more general linear parabolic equations. ____________E A B Fig. 1 To obtain a representation for solutions we proceed as follows: Let u(x, t) be a solution of (3) and Let 1 U(fJ., fJ, X, t) = e-(.-X)2/4(P-t) (22) J 4n(fJ-t) be a particular solution. Applying Green's for- mula to the region (PABQMP) and the func- tions rp = u(x, t), IjJ = U (x o , to + h, x, t), where h is a positive number and M is a point with coordinates (x o ' to), we obtain f 2 h dx u(x,to)e-(Xo-X) /4 _ PQ j4;;h = 1 -- u(x,t)U(xo,to+h,x,t)dx PABQ + ( u U -u aU ) dt. ox ax Since the integral in the left-hand side of this equality approaches u(x o ' to) as htO, we can establish the basic representation formula u(x o , to) = 1 - u(x, t) U(Xo, to, x, t)dx PABQ ( au au ) + U--u- dt ax ax for solutions u of (3). Formula (23) shows that u(x o , to) is determined in terms of the partic- ular solution (22) if we know the values of u and au/ax on the part PABQ of the boundary of the region (ABED). The function (22) is called the fnndamental solntion of (3) because it plays the same role as the fundamental solution log r (r = ((IX - X)2 +(fJ - y)2)1/2) of Laplace's equation a 2 u/ax 2 + a 2 u/a y 2 = 0. Similarly, the following function E (called the Ganss kernel) is a fundamental solution of equation (2): E(IX, Ii; x, t) =(4n(fJ _ t)) -n/2 exp { _ IIX - X I2 } , 4(fJ - t) (24) 1228 where (x, IX)E R n x R n and t < fJ. For equation (3), the following maximum princilile holds: In the region (ABED) of Fig. 1, suppose that L[u] O and that u takes its maximum value K at an interior point M. Then u is identically equal to K on the segment QP and in the region (ABQP). More generally, various ver- sions of the maximum principle are known for equation (1) [5]. E. The Laplace Transform Method Let u(x, t) be a solution of (3) for t >0 and v(x,).)= IX) e-Atu(x,t)dt, ,1.>0, (25) be its tLaplace transform with respect to t. Utilizing integration by parts, we have f -At ( )d [ -At ( )] t=oo 000 e u, x, t t = e u x, t ,o +.1Ioo e-A'u(x,t)dt = - rp(x) + .1v(x, A), (26) provided that lim'_ooe-Atu(x,t)=O and lim,t o u(x, t)= rp(x). We find in view of(3): i} 2 v ax 2 = .1v(x,.1)-rp(x). (26') Once the solution of (26') has been found, the desired solution of (3) can be derived by invert- ing the Laplace transform (25). This idea can also be applied to the solution of parabolic equations with constant coefficients in (n + 1) variables, such as (1). F. General Second-Order Eqnations of Parabolic Type (23) Consider the equation (1) with f =,0, which can be written as au -=A(t)u, at (27) where A(t) is a second-order telliptic operator with parameter t. Let D be a regim (bounded or unbounded) of points x whose boundary is a smooth hypersurface S. We pose the fol- lowing initial boundary value problem for (27): Find a function u(x, t) that satisfies in D x (0, (0) equation (27) together with the conditions limu(x,t)=rp(x), xED, ,to au(x,t)/an+h(x,t)u(x,t)= f(X,t), XES, (16') where a/an is the directional derivative in the 
1229 outward tconormal direction at (x, t) (XES), and h(x,t)O. The Laplace transform is not suitable for solving problems (27) and (16'). Instead, the theory of I-parameter semigroups of linear operators ( 378 Semigroups of Operators and Evolution Equations) can be applied to establish similar fundamental results. Let m be a large positive integer. For t > 0, put t k = kb for k=O, 1,..., m-l with b=t/m By the La- place transform method as described above, . we can associate with I/J a unique solution v of A(tk)v=AV-I/J with A= l/b. We put RkI/J=AV. By iterating this procedure m times, we have a function um(x, t) = Rm-t Rm-2 ... Ro<p start- ing from the initial value <p at t = O. Then we obtain a solution u(x, t) of (27) and (16') as the limit of um(x, t) as m-+ 00. The following results are known [6,7]: (i) There exists a u= U(,"C,x,t) (x,ED,t>"CO) that, as afunc- tion of X and t, satisfies equation (27) and the homogeneous boundary conditions (16') with <p = 0, f = o. (ii) The function u(x, t) defined by u(x,t)= Iv <p()U(,O,x,t)d + I d"C Lf(,"C)( U(,"C,x,t) au(,"C,x't) ) dS (23') an  is a solution of(27) and satisfies (16'). Thus U(, "C, X, t) is a generalization of the function (24), called the fundamental solution of the linear parabolic equation (27) with boundary conditions (16'). Besides the properties (i) and (ii), the fundamental solution satisfies U(, "C, X, t)O, JD U(, "C,z, w)U(z, w, x, t)dz= U(,"C,x,t) ("C<w<t), and further JDU(,"C, x, t) dx = 1 under some additional assump- tions. Therefore this theory is of consider- able significance from the point of view of the theory of probability ( 115 Diffusion Processes). It can be shown that a weak solution of the parabolic equation (27) is a genuine solution. That is, if u(x, t) is locally summable and LX> Iv U(X,t) (a<p:,t) + A(t)*<p(x, t))dxdt=O for any function <p(x, t) of class C 2 in D x (0,00) and vanishing outside a compact subset of D (A(t)* is the adjoint of the partial differential operator A(t) and dx=dx1...dx"_1)' then u(x, t) satisfies (27) in D x (0, 00) in the usual sense. In particular, when the coefficients of A(t) are infinitely differentiable, any solu- tion u( x, t) of (27) in the distribution sense is a genuine solution ( 125 Distributions and H yperfunctions). 327G PD Es of Parabolic Type If the function h in the boundary conditions (16') and the coefficients of A are independent of t, then the fundamental solution U(, "C, X, t) depends only on, x, and t-"C and is written as U(, x; t-"C) (t>"C). Furthermore, if A is tself-adjoint, then there exist a sequence of teigenfunctions {l/Jp(x; A) Ip = 1,2,...} (AI/Jp+ AI/Jp=O) and a sequence {pp(A)} of measures on the real line for which the following hold: (1) The fundamental solution U(, x; t) is ex- pressed in the form U(,x;t)= lfoo e-J.'I/Jp(;A)I/Jp(X;A)dpp(A). (2) The solution u(x, t) of (27) satisfying (16') with f(x, t) == 0 is expanded as 00 f oo u(x,t)= PJ;l -00 e-J.'I/Jp(X;A)<Pp(A)dpP), where <pP)= Iv I/Jp(x;A)<p(x)dx (<p(x) is the function given in (16'). G. Nash's Results Let us consider a parabolic equation au " a { au } " t =.I " aij(x,t) ' u .,}=l uX j uX, (28) where aij= a ji are real-valued functions of class Coo and equal to constllIts outside a fixed compact set of R" for all t  to (this regularity assumption can be relaxed). Suppose that there exists a constant A 1 such that A -1112 ,,-;;I:ailx,t)ij"-;;AI12 for all (t,X,)E(to, 00) x R" x R". Then, for any bounded solution u(x, t) of (28) in (to, 00) x R" and for any (x, y, t, s) such that xER", YER", and to < t"-;;s, the inequality lu(x, t)- u(y, s)1 {(  ) . (  ) /i } ,,-;;AB  + , yt-t o t-t o (29) hOlds, where B=sup{lu(x, t)11 tto, xER"} and p = ex/(2ex + 2). In this inequality, the constants ex and A are positive, depending on (n, A) but independent of the particular choice of(a ij ), to and of u [9]. As a corollary to this theorem, J. Nash proved that, if the aij do not contain t and if v(x) is a bounded solution in R" of the elliptic equation obtained by replacing au/at by 0 in (28), the inequality IV(x)-v(y)1 ,,-;;A'B'lx- yl" (30) holds for any (x,Y)ER" x R", where a= ex/(ex + 1) with ex in (29) and B' = sup Iv(x)l. The constant 
327 H PDEs of Parabolic Type A' depends only on (n, A) ( 323 Partial Dif- ferential Equations of El!iptic Type L). H. Partial Differential Equations of p- Parabolic Type Let p and m be given positive integers. Let us consider an equation for an unknown function u of (n + 1) independent variables (x, t) of the type amu aa aju -+" a . ( x t ) --= j, atm L.. a.J ' a a a t j , a,J X where a=(aj, ..., a.) and aajax a =(ajax 1 )"' ... (ajax n )"". We write also lal = a 1 +... + a.. In (31), I:.. i is the summation taken over the (a,j) such that pj+lalpm and Oj<m. Let us denote by {Adx, t,) }=1 the roots A of the equation ),m + I' aajx, t)(i)" A j = 0, a,j where L.j is the summation over the (a,j)'s such that pj+ lal =pm and that Oj<m. We say that the equation (31) is p-parabolic (or of p-parabolic type) in the sense of I. G. Petrov- skii if and only if there exists a positive num- ber 8 such that ReAk(x, t, ) -bIW, I km, (33) for any (x, t) in the region under consideration and for any ER'. The integer p is then seen to be even. Equation (27) is p-parabolic if - A (t) is strongly el!iptic of order p. The heat equa- tion (2) is 2-parabolic in this sense. Similarly, we can define the p-parabolic systems of equa- tions [10]. p-parabolic equations are known to be thypoe11iptic if the coefficients are of class Coo [11]. S. D. Eidel'man obtained precise estimates of the fundamental solutions and of their derivatives for p-parabolic equa- tions [10]. The mixed initial boundary value problems are investigated in detail also by Eidel'man [10] and by R. Arima (J. Math. Kyoto Univ., 4 (1964)). References [1] L. Bers, S. Bochner, and F. John, Contri- butions to the theory of partial differential equations, Ann. Math. Studies, Princeton Univ. Press, 1954. [2] A. Friedman, Partial differential equations of parabolic type, Prentice-Hal!, 1964. [3] M. Gevrey, Les equations aux derivees partiel!es du type paraboliques, J. Math. Pures Appl., 9 (1913),305-471; 10 (1914),105-147. 1230 (31 ) [4] R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, I, 1953, II, 1962. [5] M. H. Protter and H. F. Weinberger, Max- imum principles in differential equations, Prentice-Hal!, 1967. [6] S. Ito, Fundamental solutions of parabolic differential equations and boundary value problems, Japan, 1. Math., 27 (1957),55-102. [7] A. M. I1'in, A. S. Kalashnikov, and O. A. Oleinik, Linear equations of the second order of parabolic type, Russian Math. Surveys, 17-3 (1962),1-143. (Original in Russian, 1962.) [8] 1. L. Lions, Equations differentiel!es operationnel!es et problemes aux limites, Springer, 1961. [9] 1. Nash, Continuity of solutions of para- bolic and e11iptic equations, Amer. J. Math., 80 (1958),931-954. [10] S. D. Eidel'man, Parabolic systems, North-Hol!and, 1969. (Original in Russian, 1963.) [11] S. Mizohata, Hypoel!ipticite des equa- tions paraboliques, Bull. Soc. Math. France, 85 (1957),15-50. (32) 328 (V.6) Partitions of Numbers A partition of a positive integer n i, an ex- pression of n as the sum of positive integers. The number of partitions of n, where the order of the summands is ignored and repetition is permitted, is denoted by p(n) and is cal!ed the number of partitions of n. For example p(5) = 7 n5=4+1=3+2=3+1+1=2+2+1= 2 + 1 + 1 + 1 = 1 + 1 + 1 + 1 + 1. Therefore, p(n) equals the number of tconjugate classes of the tsymmetric group of order n and is closely related to the trepresentation theory of this group. The tgenerating function of p(n) is f(x) = 1 + n1 p(n)x'= (l (1_X')f1 The unit circle Ixl = 1 is the tnatural boundary of f(x), which is holomorphic in Ixl < 1. The Dedekind eta function, which is closely related to f(x), is defined by the fol!owing formula for the complex variable, taking values in the upper half-plane: 00 1](') = exp(niTj12) II (1-exp(2nim)). n=l Hence 1](, + 1) = exp(nijI2)1](,). L. Euler (1748) obtained the fol!owing formula (cal!ed the pentagonal number theorem because n(3n - 1 )j2 
1231 is a tpentagonal number): 00 IT (I-x n ) n=1 00 = 1 + L (_l)n(x n (3n-1)/2 + x n (3n+I)/2). n=1 This fo11ows easily from the Jacobi-Biehler equality 00 IT {(1-q2n)(1 +q2n-1 z)(l +q2n-1 z-l)} n=1 00 =1+ L qn 2 (zn+ z -n) (iql<l, z#O). n=1 By using the ttransformation formula for 9- functions, we can infer from the pentagonal number theorem that 1'/( -1/1:)= 1'/(1:). Hence, if a, b, c, d are integers satisfying ad- bc= 1 and c>O, then ( a1:+b )  1:+d 1'/ - =e -----:-1'/(1:), c1:+d I where e is a 24th root of unity. It is known that 1'/(1:) is a tcusp form of weight -1/2 with respect to the fu11 tmodular group r(1) [9, 11]. C. L. Siegel (1955) gave a simple proof of the formula 1'/( -1/1:)= 1/(1:). Later S. Iseki (1957) gave another proof by using a new method, known as the !1.- fJ formula [12]. The size of pen) increases rapidly with n; for instance p(1 0) = 42 and p( 1 (0) = 190,569,292. By making use of a remarkable identity, G. H. Hardy and S. Ramanujan (1918) proved the following inequalities, where A and Bare suitable constants: (Aln)e 2Jn < pen) < (Bln)e 2j2 J"n. Subsequently they obtained pen) - (1/4)3 n)exp(n J2nI3) . After, P. Erdos (1942) and D. J. Newman (1951), A. G. Postnikov [8J succeeded in proving _ exp(nJ2;;3) ( ( 10gn )) pen) ;:; 1 +0  4y 3 n n by means of an elementary function-theoretic method. Multiplying both sides of Euler's formula by the generating function of pen) and comparing the coefficients, we obtain L (-l)H p (n-w k )=O, Orok:::;n where w k = k(3k-1)/2 (k =0, :t 1, :t2, ...) is a pentagonal number. From this formula we can calculate pen) successively; in fact P. A. MacMahon obtained in this way the values of pen) for n up to 200. Hardy and Ramanujan proved the following transformation formula for the generating 328 Partitions of Numbers function f(x) of pen): f( exp C:ih _ 2Z) ) =  k-fiexp ( - ) . 12kz 12k ( ( 2nih' 2n )) xf exp ---y:-- kz ' (h,k)= 1, hh'= -l(modk), where ,k is defined by .k =exp(nis(h, k)) and the value of s(h, k) was given by Radema- cher in the form k-j m (( hm )) s(h,k)= L - - . m=1 k k Here the symbol «t)) in the sum denotes the function that is 0 for integral t and t - [tJ - 1/2 otherwise ([ J is the tGauss symbol). With regard to the Dedekind sum s(h,k), we have the reciprocity law for Dedekind sums: 1 1 ( h k 1 ) S ( h k ) +s ( k h ) = --+- -+-+- . , , 4 12 k h hk If we make substitutions a=h', b=(hh' + l)(k, c=k, and d= -h in the Hardy-Ramanujan transformation formula, then the e appearing in the transformation formula of 1'/(1:) is seen to be equal to exp( -nis(c, d) + ni(a + d)/12c). A direct proof of the transformation formula and the reciprocity law was given by K. Iseki (1952). According to Cauchy's integral formula, pen) can be represented as an integral: _ 2 1 . r f dx, mJr x where the contour r is taken inside the unit circle around the origin. The generating func- tion f(x) varies greatly: namely, letting r-+ 1 - 0 in x=rexp(2niplq), where p and q are fixed integers, it fo11ows from the transformation formula that f(x) - exp(n 2 /6q2(1 - r)). Never- theless, we can deal with the integral by the tcircle method, introduced by Hardy and Ramanujan, which threw light on recent addi- tive number theory. Hardy and Ramanujan thus obtained 1 d ( eXP(CA n ) ) r.: p(n)= 2nj2dn An +O(exp(Dyn)), where An= Jn-1/24 , C=nJ2j3, D>C/2. The theory was improved by Rademacher (1937, 1943), who expanded pen) into the 
328 Ref. Partitions of Numbers series _ 1  1/2 d ( sinh(CAn/k) ) p(n)- - h L.., Ak(n)k - d 1 ' ny 2 kl n An where Ak(n) = L .k exp( - 2nihn/k). h(modk).(h,k)=1 Rademacher (1954) proved further that ( n ) 3/2 p(n)=2n 12 x f Adn)k-Sf2L3/2 (( - l n ) 2(24n_1) ) , k=1 2k where (1) Zn Lv(z)= L , . n'=on.r(v+n+ 1) Rademacher (1943) had developed an inge- nious proof by taking "Ford's circle" as the contour r. Ramanujan observed that p(5m+4)=0 (mod 5), p(7m+5)=0 (mod 7), and p(l1m+ 6)=0 (mod 11). Rademacher (1942) and New- man studied these cases by using I1(T). More generally, A. O. L. Atkin proved that if 24n = 1 (mod5 a 7 b l F) then p(n)=O (mod5"71+[ b/2 1I Ie) (Glasgow Math. J., 8 (1967)). At present, this is the best result. Let n=n 1 +n 2 +... +ns be a partition of n. Many problems arise when we put additional conditions on the n j . For instance, we may require that the nj satisfy certain congruence relations (L. K. Hua (1942), S. Iseki (1959)) or are powers of integers (E. M. 1. Wright (1934), L. Schoenfeld (1944), S. Iseki (1959)) or are powers of primes (T. Mitsui (1957)). The par- tition problem can also be extended to the case of an algebraic number field of finite degree (Rademacher, G. Meinardus (1953), Mitsui (1978)). References [IJ R. G. Ayoub, An introduction to the ana- lytic theory of numbers, Amer. Math. Soc. Math. Surveys, 1963. [2J G. H. Hardy and S. Ramanujan, Asymp- totic formulae in combinatory analysis, Proc. London Math. Soc., (2) 17 (1918), 75-115. [3J G. H. Hardy and E. M. Wright, An intro- duction to the theory of numbers, Clarendon Press, fourth edition, 1965. [4J K. Iseki, A proof of a transformation for- mula in the theory of partitions, 1. Math. Soc. Japan, 4 (1952),14-26. [5] H. A. Rademacher, Lectures on analytic number theory, Tata Inst. Fund. Res., 1954- 1955. 1232 [6J S. Ramanujan, Collected papers of Srini- vasa Ramanujan, Cambridge Univ. Press, 1927 (Chelsea, 1962). [7J H. Peters son, Ober Modulfunktionen und Partitionenprobleme, Abh. Deutsch. Akad. Wiss., Berlin K1. Math. Allg. Nat., ] 954, no. 2. [8J A. G. Postnikov, Introduction 10 the ana- lytical theory of numbers (in Russian), Mos- cow, 1971. [9J M. I. Knopp, Modular functions in analy- tic number theory, Markham, 1970. [IOJ T. Mitsui, On the partition pwblem in an algebraic number field, Tokyo 1. Math., 1 ( 1978). [11J H. A. Rademacher, Topics in ;malytic number theory, Springer, 1973. [12J S. Iseki, The transformation formula for the Dedekind modular function and related functional equations, Duke Math. .r., 24 (1957), 653-662. 329 (XXI.38) Pascal, Blaise Blaise Pascal (June 19, 1623-August 19, 1662) was born in Clermont-Ferrand in southern France. He lost his mother when still an infant and was brought up by his father, Etienne Pascal (discoverer of the curve caned Pascal's t1ima90n). As a youth, he demonstrated a remarkable ability for mathematic,. In 1640, under the influence of Desargues, he dis- covered tPascal's theorem on conic sections, and in 1642 invented an adding machine. After hearing of Toricelli's experiments in 1646, he became interested in the theory ofl1uids and on his own began to conduct experiments; this research put to rest the prevailing opinion that nature abhors a vacuum and that, therefore, a vacuum cannot exist. Pascal formulated the principle stating that pressure, when applied at any point within a contained liquid, is trans- mitted throughout the fluid. By means of this principle, he explained various phenomena concerning fluids such as the atmosphere and laid the foundations for hydrostatics. Between 1652 and 1654, Pascal was pre- occupied with social affairs, but subsequently he began to devote himself to religion. He entered the Abbey Port-Royal of the Jansenist sect, where he remained until his death. Im- mediately before his entry, however, he and Fermat exchanged correspondence about games of chance, and these letters proved to be the beginning of the theory of tprobabi1ity. Concerning games of chance, Pascal had con- ducted research on tPascal's triangle, and in this study he formulated and used tmathemat- 
1233 ical induction. He also indicated a way to obtain the sum of the mth powers of the con- secutive terms of an arithmetic progression, and with an intuitive idea of limits obtained the formula Jg x m dx = am+! /(m + 1). While in Port-Royal, he published Lettres provinciales (1657), in which he carried on a dispute with the Jesuits. His book Pensees shows his deep involvement with religion; however, he did not abandon mathematics. In 1658, he determined the area enclosed by a tcycloid and its base, the barycenter and area of the figure enclosed by a cycloid and straight lines parallel to its base, and the volume of the figure obtained by rotating it around these lines. The study of the methods used by Pascal to obtain these re- sults, which were forerunners of differential and integral calculus, led tLeibniz to discover the fundamental theorem of calculus. Pascal also formulated clear ideas about axioms. References [1] L. Brunschvicg (ed.), B. Pascal, Oeuvres I-XIV, Hachette, 1904-1914. [2] J. Chevalier (ed.), B. Pascal, Oeuvres com- pletes, Bibliotheque de Pleiades, Gallimard, 1954. [3] Kokiti Hara, L'Oeuvre mathematique de Pascal, Mem. de la Faculte des Lettres de l'Univ. d'Osaka, no. 21, 1981. 330 (11.8) Permutations and Combinations Let there be given a set 0 of n elements. If we choose k distinct elements of 0 and ar- range them in a row, we have a k-array or k- permutation of elements of O. The number of such arrays is (n)k= n(n-l)... (n - k+ 1). The polynomial (X)k= x(x -1)... (x -k + 1) in x of degree k is called the Jordan factorial of degree k. In particular, (n)o=n!, n factorial, is the number of permutations of O. A subset of o is called a k-subset if it contains exactly k elements. The number of k-subsets (or k- combinations) of 0 is G) = (n)k/k!. The binomial coefficients (:) are defined by the generating function (1 + z)X = Lo (:) zO. For any complex number x, the series is conver- 330 Ref. Permutations and Combinations gent for Izl < 1, and it is verified that (:)= (x)o/n! in terms of the Jordan factorial (x)o' The same results hold in a tcomplete field with tvaluation, in particular in a tp-adic num- ber field. In any case, we have the recursive relation (:)=(x:l)+(:=D (nI), G)=I, and in general f ( x ) ( y ) _ ( x+ y ) k=O k n - k - n ' which leads to many identities involving binomial coefficients. The recursive relation allows us to compute the values of (:) easily for small integers n, k, as was noticed by Pascal. The arrangement of these values in a triangular form: 1 1 1 1 2 1 1 3 3 1 14641 is ca1!ed Pascal's triangle. For integral values x, (1 + zt are polynomials, and we have (a + b)" = LZ=o G)aO-kb k (binomial theorem). As a generalization, we have n! (a1+...+am)o=I , af'...a!:.m PI! '''Pm' (multinomial theorem), where the sum is ex- tended over all nonnegative Pi with LP/=n. The number of ways of choosing k elements, allowing repetition, from a set of n elements is ( -n ) ( n+k-l ) ( - l)k k = k . This is also the number of nonnegative integral solutions of Lf=1 XI = k. As an example of binomial coeffi- cients with noninteger arguments, we have (- /2)=( -1)02-20Cnn). References [1] E. Netto, Lehrbuch der Combinatorik, Teubner, second edition, 1927 (Chelsea, 1958). [2] P. A. MacMahon, Combinatory analysis I, II, Cambridge Univ. Press, 1915-1916 (Chel- sea, 1960). [3] J. Riordan, Combinatorial identities, Wiley, 1968. 
331 A Perturbation of Linear Operators 331 (XII.13) Perturbation of Linear Operators A. General Remarks Historically, the perturbation method was developed as an approximation device in classical and quantum mechanics. In the per- turbation theory of eigenvalues and eigenfunc- tions, created by L. Rayleigh and E. Schrodin- ger, the main concern was to find solutions as power series in a parameter K that could be regarded as small. In the perturbation theory for linear operators, however, we are con- cerned more genera11y with the behavior of spectral properties of linear operators when the operators undergo small change. The foundation of the mathematical theory, includ- ing a complete convergence proof of perturba- tion series, was laid down by F. Re11ich [IJ and T. Kato [2,3]. Another major topic in the perturbation theory for linear operators is the perturbation of continuous spectra, which 'Was initiated by K. O. Friedrichs (Math. Ann., 115 (1938); [4J). It is closely related to scattering theory and is discussed more fully in 375 Scattering Theory. A standard reference in this field is [5J (also  [6, 7J). Most of the material presented in this article is taken from [5]. For problems in Hilbert spaces there are two general frameworks in which to formulate perturbation situations: the operator formula- tion and the form formulation. In the former we deal with a family of operators T(K) direct- ly, while in the latter, we deal with associated semi bounded Hermitian (or, more generally, sectorial) forms t(K). The latter is applicable only when there is semiboundedness (or a sectorial property) inherent in the problem, but is usually more general than the former in such problems, since the latter (resp. the former) requires roughly the constancy of the domain of the "square root" of T(K) (resp. the domain of T(K)). In this article we discuss problems in the operator formulation. For the form formulation  [5J and [7]. In this article X, Y,..., are complex Banach spaces and 1', A, ... are linear operators unless other specifications are made. The fo11owing notations defined in 251 Linear Operators are used without further explanation: D(T), R(T), B(X, Y), B(X), qT) (the graph of 1'), (J(T) (the spectrum of 1'), p(T) (the resolvent set of 1'), and R((; 1') (the resolvent of 1'). We also use C(X, Y) (resp. A(X, Y)) to denote the set of all tclosed linear operators (resp. all tlinear operators) from X to Y and C(X)=C(X,X). 1234 B. Stability of Basic Properties (1) Let TEC(X, Y). Important notions for characterizing the sma11ness of A E 4(X, Z) relative to I' are the fo11owing. (i) A is said to be relatively bounded with respect to I' (or simply T-bounded) if D(A) => D(T) and there exist a, b;;;: 0 such that (*) IIAullzallullx + bll Tully for a11 uED(T). The infimum, denoted by II A II T, of b for which (*) holds with some a is caIled the T-bound of A. (ii) A is said to be relatively compact with respect to l' (or T-compact) if D(A)=>D(T) and A is compact from D(T) with the graph norm of I' to Z ( 68 Compact and Nuclear operators F). T-compactness of A implies 1'- boundedness (and in Hilbert space;; IIAIIT=O). (2) Let TEC(X, Y), and let AEA'X, Y) be 1'- bounded. (i) If IIAII T< 1 (or if A is T-compact), then 1'+ A E C(X, Y). (ii) If, in addi1 ion, X = Y is a Hilbert space, Tis tself-adjoint, and A is tsymmetric, then I' + A is self-adjoint (Rellich- Kato theorem) [1,8]. (iii) Suppose that Tis a tFredholm operator. If either A is T-compact or the inequality (*) holds with constants a, b satisfying bp + a < p for a certain positive number p determined by 1', then T+A is a Fredholm operator and ind(T+A)=ind I' (for ind 1', nul 1', and defT  251 Linear Operators). In the latter case when bp+a<p, we also have nul(T+A)nul I' and def(T+ A)defT. C. Continuity and Analyticity of Families of Closed Operators In order to handle unbounded operators, which are important in application:" it is neces- sary to introduce generalized notions of con- vergence and analyticity of families of closed operators (1) C(X, Y) becomes a tmetric space by a distance function d(S, 1') having the property that b(qS), qT))d(S, T)2b(qS),qT)), where for closed subspaces M and N we put b(M, N) =max[b(M, N), b(N, M)J, b(M,N)= sup dist(u,N);b(O,N)=O. uEM.llull =1 b(M, N) is called the gap between M and N [5]. When Tn --> I' in this metric, 1'" is sa id to con- verge to I' in the generalized sense. This gen- eralized convergence coincides with the norm convergence if Tn' TEB(X, Y). If X = Yand p(T) # 0, then 1',,--> Tin the generalized sense if and only if for some (or eq Ulvalently all) (Ep(T) we have (Ep(T,.) for sufficintly large nand IIR((; Tn) - R((; 1')11-->0, n--> Gel. This is called norm resolvent convergence. 
1235 (2) When X = Y, there is also the notion of strong convergence in the generalized sense [5], which is roughly the strong convergence of resolvents. In particular, when T" and Tare self-adjoint operators in a Hilbert space, T" -+ T strongly in the generalized sense if R(,; T,,)-+ R(,; T) strongly for some (or equivalently an) , with 1m' =1= O. This is caned strong resolvent convergence. (3) Let DeC be a domain. The notion of analyticity (holomorphic property) of a family T(K)EB(X, Y), KED, of bounded operators is wen known ( 37 Banach Spaces K). This notion is generalized to a family T(K)E C(X, Y), KED, of closed operators [1,5]. Namely, T(K) is said to be holomorphic in D if at each Ko E D there exist a Banach space Z and U(K)EB(Z,X), V(K)EB(Z, Y), defined near Ko, such that (i) U(K) and V(K) are holomor- phic at Ko as families of bounded operators; (ii) U(K) is one to one and onto D(T(K)); (iii) T(K)U(K)= V(K). Let us mention several spe- cial cases. (I) if X = Yand if' Ep(T(K)) for an KED, then T(K) is holomorphic in D if and only if R(,; T(K)) is holomorphic in D. (II) If D(T(K)) is independent of K and if T(K)U is holomorphic in D for every uED(T(K)) then T(K) is holomorphic in D (holomorphic family of type (A) [5]). (III) Let TE C(X, Y), and let T(n)EA(X, Y) such that D(T(n»)=:ID(T) and IIT(n)ull cn-1(allull +bIITull), uED(T), where a, b, cO. Then T(K)U= Tu+ KT(l)U+ ... + KnT(n) + .." uED(T) defines a holomorphic family of type (A) inD={KIIKI«b+c)-I}. (IV) If X= Y is a Hilbert space and if T(K) is self-adjoint for real K, T(K) is said to be a self-adjoint family. In particular, the family discussed in (III) is a self-adjoint holomorphic family if T is self- adjoint and T(n) is symmetric. D. Perturbation of Isolated Eigenvalues (1) Separation of the spectrum. Let TEC(X). Suppose that a bounded subset  of a(T) is separated from the rest of a(T) by a simple closed contour r (i.e., r ep(T) and (a(T)\) lies inside (outside) of r). Then the operator P=- 2 1 . r R(,; T)d', mJr which is independent of r, is a projection (i.e., P E B(X) and p2 = P). The closed subspaces XI =PX and X 2 =(I-P)X treduce T and give rise to the decomposition T= Tlx, EB Tlx 2 = T 1 EB T 2 . In particular, a(TrJ = a(T) n {inside of r} and a(T 2 )=a(T)n {outside of q. (2) Let T(K) be holomorphic in D. We as- sume that OED and regard T(O)= T(O) as the unperturbed operator. Suppose that  and r are as in (1) with T replaced by T(O). Then 331 D Perturbation of Linear Operators there exists <5>0 such that IKI<<5 implies rep (T(K)). This fonows from the upper continu- ity of compact components of the spectrum with respect to the metric J. of C(X) [5]. Thus the separation of the spectrum discussed in (1) is applicable to T(K). In particular, corre- sponding to the projection P(K)=- 2 1 . r R(CT(K))d', IKI<<5, mJr T(K) is decomposed as T(K) = T 1 (K) EB T 2 (K); and the problem of determining the spectrum of T(K) inside r is reduced to the problem of determining the spectrum of T 1 (K) (IKI <<5). Suppose now that  = {Ao} is an isolated eigenvalue of T(O) and that m=dimP(O)X < 00. Then dim P(K)X = m, I KI < <5. Moreover, a base {41dK), ..., 41m(K)} of P(K)X can be constructed in such a way that the 41iK) are holomorphic in {I KI < <5'  <5} [3,5]. Thus the problem for T 1 (K) in this case is just the finite-dimensional eigenvalue problem det{A<5 jk -(T(K)41j(K), 41k(K))} =0. The totality PiK)} of solutions of this equation, i.e., the totality of eigen- values of T(K) near Ao, is expressed by one or several power series of K 11p with a suitable integer p> O. If T(K) is a self-adjoint family, we can take p= 1 so that the eigenvalues are holomorphic near Ao. If H(K) =H(O) + KH(I) + ... is a self-adjoint holomorphic family described in example (IV) in (3) of Section C and if m = 1, the power series A( K) = LAj K j can be explicitly computed as AI =(H(l)u o , u o ), A 2 = (H(2)u o , uoJ-t (SH(I)U O ' H(I)U O ), ..., where H(O)Uo=AoU o with !luoll = 1 and where S = lim'Ao R(,; H(O))(I - P(O)) is the reduced resolvent. This series is known as the Rayleigh- Schrodinger series. The power series for the associated eigenvectors U(K) = L Kju j can also be computed. For details, including the case of a degenerate Ao (m> 1), in which the situation becomes more complicated due to the splitting of eigenvalues,  [5]. The perturbation theory discussed in this subsection is caned analytic (or regular) perturbation theory. (3) Even when a problem cannot be handled by means of analytic perturbation, it may happen that the coefficients Aj and u j of formal power series can be computed up to a cer- tainj. In many such cases it can be shown under general assumptions that an asymptotic expansion such as A(K) =Ao + AI K + O(K) is valid as long as the coefficients involved can be computed legitimately [2,5]. Estimates for O(K) can also be given. This provides a rigor- ous foundation for the perturbation method in many important practical problems. The case of degenerate Ao can be treated similarly. The strong convergence in the generalized Sense mentioned in (2) of Section C is used here. This theory is caned asymptotic perturbation theory. 
331 E Perturbation of Linear Operators E. Perturbation of Continuous Spectra For continuous spectra, studying the mode of change under perturbation is not usually a tractable problem. Rather, certain parts of continuous spectra tend to be stable under perturbation; and the study of this stability has been a major topic in perturbation theory (also  375 Scattering Theory). In this section we discuss only self-adjoint operators and let H = f A dE(A), Ho, ..., be self-adjoint opera- tors in a Hilbert space X. For Bp and II lip to be used below  68 Compact and Nuclear Operators. (1) The essential spectrum ( 390 Spectral Analysis of Operators E) is stable under com- pact perturbation. Namely, if H = Ho + K with compact K, then (J'e(H)=a'e(Ho) (H. Weyl, Rend. Circ. Mat. Palermo, 27 (1909)). More generally, it suffices to assume that R(,; H) - R(,; Ho) is compact for some (or equivalently all) 'Ep(H)np(H o )' Conversely, if X is sepa- rable and if (J'e(H) = (J'e(Ho), then there exist a unitary operator U and a compact operator K such that H = U Ho U -1 + K (J. von Neumann, Actualites Sci. Ind., 229 (1935)). Moreover, any self-adjoint operator H in a separable Hilbert space can be changed into H + K with a pure point spectrum by adding a KEBp with IIKllp <I> for any p> 1 and 1»0 (S. T. Kuroda, Proc. Japan Acad., 34 (1958)). I. D. Berg (1971), W. Sikonia (1971), J. Voigt (1977), and D. Voicu- lescu (1979) have extended these results to normal operators and m-tuples of commuta- tive self-adjoint operators. Also  390 Spec- tral Analysis of Operators I, 1. (2) The absolutely continuous spectrum ( 390 Spectral Analysis of Operators E) is stable under perturbation by the ttrace class. Name- ly, if H=Ho+K, with KEBh then the abso- lutely continuous parts of Ho and Hare tuni- tarily equivalent, and in particular (J'ac(H) = (J'ac(Ho) (M. Rosenblum, Pacific J. Math., 7 (1957); T. Kato, Proc. Japan Acad., 33 (1957)). Among generalizations we mention the follow- ing two. (i) If R(';H)-R(';Ho)EB1 for some 'E p(H) n p(Ho), then the absolutely continuous parts of qJ(Ho) and qJ(H) are unitarily equiva- lent for any smooth strictly increasing real function qJ (M. Sh. Birman, Izv. Akad. Nauk SSSR, ser. mat., 27, (1963); T. Kato, Pacific J. Math., 15 (1965)). (ii) If Ho and H act in different Hilbert spaces Xo and X, respectively, and if there exists JEB(Xo,X) such that JD(Ho)c D(H) and such that the closure of HJ -JH o belongs to B 1 (X o , X), then the same conclusion as in (i) holds (D. Pearson, J. Functional Anal., 28 (1978)). (i) can be derived from (ii). Perturba- tion theory for absolutely continuous spectra is closely related to the study of generalized wave operators in scattering theory. In fact, 1236 the existence and the completeness of the latter implies the unitary equivalence of absolutely continuous parts. All the results me'ntioned above are proved by scattering-theoretic methods, either by the wave operator ap- proach or by the abstract stationary approach ( 375 Scattering Theory, esp. B, C). F. Some Other Topics (1) For the perturbation theory for semigroups of operators and evolution equations, not discussed in this article,  [5,7,9]. (2) The detailed structure of continuous spectra is hard to analyze. An eigenvalue Ao of Ho which is embedded in the continuous spec- trum may diffuse into the continuous spectrum in the presence of a perturbation. In such a case, H(K), K #0, has no eigenvalue:; near Ao but may have a continuous spectrum highly concentrated around AD. This phenomenon of spectral concentration is studied, especially for some concrete problems, in relation to resonance poles (or poles of the holomorphic continuation of the resolvent or the scattering matrix). In some problems, it is proved that the first few terms of the perturbation series for A(K) that are still computable are wlated to the real part of the resonance. Some problems of resonance can be treated by the technique of dilation analyticity, a technique which is also effective in other problems of spectral analysis (1. Aguilar and J. M. Combes, Comm. Math. Phys., 22 (1971)). (3) A vast quantity of results in ttle spectral theory of the Schrodinger operators appear- ing in the tSchrodinger equation in quantum mechanics can be obtained by perturbation methods. For the topics mentioned in (2) and (3)  [7]. References [1] F. Rellich, Storungstheorie del Spektral- zerlegung I-V, Math. Ann., 113 (1937), 600- 619; 113 (1937), 677-685; 116 (1939), 555-570; 117 (1940),356-382; 118 (1942),462-484. [2] T. Kato, On the convergence of the per- turbation method, J. Fac. Sci. Univ. Tokyo, sec. I, 6 (1951), 145-226. [3] T. Kato, On the perturbation 1heory of closed linear operators, 1. Math. Soc. Japan, 4 (1952),323-337. [4] K. O. Friedrichs, On the perturbation of continuous spectra, Comm. Pure AppI. Math., 1 (1948), 361-406. [5] T. Kato, Perturbation theory for linear operators, Springer, 1966. 
1237 [6] N. Dunford and J. T. Schwartz, Linear operators I-III, Wiley-Interscience, 1958, 1963, 1971. [7] M. Reed and B. Simon, Methods of mod- ern mathematical physics I-IV, Academic Press, 1972, 1975, 1979, 1978. [8] T. Kato, Fundamental properties of Hamil- tonian operators of Schrodinger type, Trans. Amer. Math. Soc., 70 (1951),195-211. [9] H. Tanabe, Equation of evolution Pitman 1979. (Original in Japanese, 1975.)' , 332 (VI. 7) Pi (n) The ratio of the circumference of a circle to its diameter in a Euclidean plane is denoted by n, the initialletter of neplJ1.e1:pOr; (perimeter). Thus n can be defined as 2 J: dx/ J1-x 2 . The symbol n has been used since W. Jones (1675-1749) and L. Euler. The fact that this ratio is a constant is stated in Euclid's Ele- ments; however, Euclid gave no statement about the numerical value of 11:. As an approxi- mate value of n, 3 has been used from antiq- uity. According to the Rhind Papyrus, (4/3)4 was used in ancient Egypt. Let L.(I.) be the perimeter of a regular n-gon circumscribed about (inscribed in) a circle of radius 1. Then the relations L.>n>I., 2 1 1 -=-+- L 2 . L. I; 1 2 .= JI.L 2 . hold. Archimedes obtained 3 ? < n < 3 by calculating L96 and 196' In 3rd-century China Liu Hui used n,=:3.14. In 5th-century China, Tsu Chung-Chih mentioned 22/7 as an inaccu- rate approximate value and 355/113 as an accurate approximate value of n. These values were obtained by methods similar to those of Archimedes. In 5th-century India, Aryabhatta obtained n '=: 3.1416, and in 16th-century Europe, Adriaen van Roomen obtained n,=:355/113. F. Viete represented 2/n in the fo11owing infinite product: fI cos= fi J+ fi .=2 2. -{i 2 2V2 x + J+A .... 332 Pi (n) Using this formula, L. van Ceulen (1540-1610) calculated n to 35 decimals. In the 17th and 18th centuries, the Japanese mathematicians T. Seki, K. Takebe, and Y. Matunaga computed n to 50 decimals. Since the 17th century, many formulas that represent n as a sum of infinite series or as a limit have been used to obtain more accurate approximate values. The fol- lowing are representations of n known in those days: n 2.2'4'4.6.6... 2 1.3'3.5.5.7... (1. Wa11is) n 1 F 3 2 52 --- -- 41+2+2+2+... the notation  83 Continued Fractions) (W. Brouncker; for =1-1/3+ 1/5-1/7+ ... (J. Gregory, G. W. F. Leibniz) =4 Arctan 1/5-Arctan 1/239 (J. Machin). A formula combining Machin's representation of n and the power series Arc tan x = x- (1/3)x 3 + (1/5)x 5 - ... is ca11ed Machin's for- mula and was often used for calculating an approximate value of n. By utilizing this for- mula, in 1873 W. Shanks obtained an approxi- mate value of n up to 707 decimals. No im- provement of his approximation was obtained unti11946 when D. F. Ferguson calculated 710 digits of n and found that Shanks's value was correct only up to the 527th digit. The com- putation of an accurate approximate value of n has been made easier by the recent develop- ment of computing machines, and an approxi- mate value up to 1,000,000 decimals has been obtained. P. Beckmann [2] gives a detailed and humorous historical account of the calculation of n from ancient times up to the present com- puter age. Various numerical results obtained by electronic computers are not forma11y pub- lished, some being deposited in the UMT repository of the editorial office of the journal Mathematics of computation. Choong et a1. [3], using information in [1], obtained the first 21,230 partial denominators of the regular continued fraction representation of nand described how their numerical evidence ta11ies with theoretical results, obtained by the met- rical theory of continued fractions, which is valid for almost a11 irrational numbers (e.g.  [4]). In 1761, J. H. Lambert used Brouncker's expression of n in a continued fraction to prove that n is irrational. In 1882, C. L. F. Lindemann proved that n is a ttranscendental number using Euler's formula en! = -1. The approximate value of n up to 50 decimals is 3.141592653589793238462643383279502884197 16939937510 ... ( Appendix B, Table 6). 
332 Ref. Pi(n) References [1] D. Shanks and 1. W. Wrench, Calculation of n to 100,000 decimals, Math. Comp., 16 (1962),76-99. [2] P. Beckmann, A history of pi, second edition, Golem Press, 1971. [3] K. Y. Choong, D. E. Daykin, and C. R. Rathbone, Rational approximations to n, Math. Comp., 25 (1971), 387-392. [4] A. Khinchin, Continued fractions, Noord- hoff, 1963. 333 (11.18) Plane Domains A. Domains in the Complex Plane A tdomain (i.e., a tconnected open set) in the tcomplex plane or on the tcomplex sphere is called a plane domain. The tclosure of such a domain is called a closed plane domain. In this article, we consider only subsets of the com- plex plane (or sphere), and a plain domain is called simply a domain. The tinterior of a tJordan curve J in the complex plane is a domain called a Jordan domain. In a domain D, a Jordan arc whose two endpoints lie on the boundary of D is called a cross cut of D. For a domain D, each of the following three conditions is equivalent to the condition that D is tsimply connected: (1) For every cross cut Q of D, D - Q has exactly two tconnected components. (2) Every Jordan curve in D is thomotopic to one point, that is, it can always be continuously deformed to a point. (3) The tmonodromy theorem holds in D. If D is a domain on a complex sphere, each of the following three conditions is equivalent to the condition that D is simply connected: (4) The boundary of D consists of a single tcon- tinuum or a single point. (5) For every Jordan curve C in D, either the interior or the exterior of C is contained in D. (6) The complement of D with respect to the complex sphere is a connected (not necessarily arcwise con- nected) closed set. Jordan domains are simply connected. Let n;;:' 2 be an integer and D a plane domain. The thomology group Hr(D, Z) is identical to zn-I if and only if the complement of D in the complex sphere has n connected components. Then D is said to be n-ply con- nected or multiply connected without speci- fying n. If D is an n-ply connected domain, there exist n-l suitable m:Jtually disjoint cross cuts Q 1, ... , Qn-1 such that D- (Q 1 U ... U Qn-1) is simply connected. Some typical examples of domains are as 1238 follows: (1) Circular domain: I z - c 1< r. (2) Half-plane: Rez>O, or Imz>O. (3) Angu- lar domain: rx < arg(z -c) < p. (4) Annular domain: r< Iz-cl <R. (5) Slit domain: a domain obtained by excluding a Jordan arc [" from a domain D, where all points on [" (except an endpoint lying on the houndary of D) are contained in D. In this case, the Jordan arc [" is called the slit of the domain. B. Boundary Elements A boundary point P of a domain D is called accessible if there exists a sequence of points Pv tending to P such that the line segments PI P 2 , ... lie completely in D. For t:xample, for the domain obtained by removin, x= 1/(n+ 1), O<y 1/2 (n= 1, 2,...) from th, square O<x< 1, O<y< 1, the boundary points with x=O, Oy< 1/2 are all inaccessible (Fig.!). P, Fig. 1 Let the domain D be bounded by a smooth Jordan curve, and let P be a boundary point of D. Take an angular domain D' with vertex at the point P and the initial parts of the two sides of D' lying in D. A curve in D converging to the point P from the interior or the angular domain D' is called a Stolz's path or a nontan- gential path ending at the point P. Let D be a simply connected domain. A sequence {qJ of cross cuts mutually disjoint except for their endpoints is called a funda- mental sequence of cross cuts if it satisfies the following two conditions (Fig. 2): (1) Every qv separates qV-1 and qv+! on D. (2) For v--+ 00, the sequence qv tends to a point on the bound- ary. Let {qv} be a fundamental sequence of cross cuts, and denote by Dv the subdomain of D separated by qv that contains qv+1' The inter- section n Dv consists only of the boundary points of D. Two fundamental sequences {qv}, {q} of cross cuts are equivalent ir every Dn contains all q except for a finite number of v, and every D contains all qv except for a finite number of v. Here Dn' D are the mbdomains constructed from qn and q as above. This condition determines an equivalence relation, under which the equivalence class of funda- mental sequences of cross cuts is called a boundary element. This notion is due to C. Caratheodory [2]. The boundary element of a 
1239 multiply connected domain is defined similarly for each isolated component of the boundary. For example, each point of a slit domain, except for the endpoint of r lying on the boundary of the domain, determines two distinct boundary elements on each side. A closed domain is usua11y considered to be the union of a domain and the set of a11 its bound- ary elements. Various notions of tideal bound- ary COme from considering suitable boundary elements for various purposes ( 207 Ideal Boundaries). Fig. 2 C. Domain Kernels Let {G v } be a sequence of domains containing the origin O. If a suitable neighborhood of the origin is contained in G v for a11 v, there exists a domain G such that every closed domain containing the origin and contained in Gis contained in G v except for a finite number of v. The union K of such domains G is ca11ed the domain kernel of the sequence {G v } (Caratheo- dory). If there is no neighborhood of the origin contained in G v for a11 v, we put K = {O}. If every infinite subsequence of {G v } has the same domain kernel K, then we say that the sequence {G v } converges to K. The notion of domain kernel is important in considering the limits of a sequence of conformal mappings ( 77 Conformal Mappings). References [1] M. H. A. Newman, Elements of the to- pology of plane sets of points, Cambridge Univ. Press, second edition, 1951. [2] C. Caratheodory, Uber die Begrenzung einfach zusammenhangender Gebiete, Math. Ann., 73 (1913), 323370. 334 (X.33) Plateau's Problem A. Origin Because of surface tension, a soap membrane bounded by a given closed space curve takes 334B Plateau's Problem the shape of a minimal surface, i.e., a surface of the least area. This experiment was performed by the Belgian scientist J. A. Plateau (1873) to realize minimal surfaces; hence Plateau's problem is that of determining the minimal surfaces bounded by given closed space curves. It is a problem of the tca1culus of variations. B. Formulation Let r be a tsimple closed curve in xyz-space such that its projection C on the xy-plane is also a simple closed curve. Let D be the finite domain bounded by C. We consider surfaces z = z(x, y) having common boundary r. Then under suitable assumptions on the smooth- ness of z(x, y), the problem is to minimize the tfunctional J[z]= It J l+p2+q 2 dxdy; oz P= ox ' oz q= oy ' with the condition that z=z(x,y) has r as its boundary. The tEuler-Lagrange differential equation for the functional J[z] is  p + q - 0 oX J l+ p2 +q2 oy J l+ p2 +q2 ' or (1 +q2)r-2pqs+(1 + p2)t=0, r=o2 z / ox 2, S=02Z/oxoy, t=02 Z / oy 2, which is a second- order tquasi1inear partial differential equation of e1!iptic type and whose geometric interpre- tation had already been given by M. C. Meus- nier (1776). To formulate the problem more genera11y, let a surface be expressed in vector form :r = :r(u, v) by means of parameters u, v. Let its tfirst fundamental form be d:r 2 = E du 2 + 2F dudv+ Gdv 2 and its tsecond fundamental form be -d:rdn=Ldu 2 +2Mdudv+Ndv 2 , with n = n(u, v) the unit normal vector. By equating to zero the tfirst variation of the areal functional f f J EG-F 2 dudv based on infinitesimal displacement in the normal direction, we obtain the Euler- Lagrange equation in the form 2H=(NE-2MF +LG)/(EG-F 2 )=0, where H=(R 1 1 +R;I)/2 is the tmean curva- ture of the surface and R 1 and R 2 are the tradii of principal curvature. Since tBeltrami's second differential form satisfies 8. 2 :r=Hn, the condition for a minimal surface becomes Ii:r = o (with 8. the tLaplace operator) provided that isothermal parameters u, v satisfying E = 
334C Plateau's Problem G, F =0 are chosen, i.e., the vector :r(u, v) rep- resenting a minimal surface is harmonic (the components of this vector are tharmonic func- tions of u, v). Let t)(u, v) be a harmonic vector conjugate to :r(u, v). Then isothermality is ex- pressed by the condition that the analytic vector ty(w) = :r(u, v) + it)(u, v) (w = u + iv, i = ) satisfies ty'(W)2 =0 (Weierstrass). In general, a minimal surface is defined as a sur- face with everywhere vanishing mean curva- ture, and Plateau's problem is to determine the minimal surface with a preassigned boundary. In this formulation, the problem can be easily generalized to an n-dimensional Euclidean space R" ( 275 Minimal Submanifolds). C. Existence of a Solution The existence of a solution of Plateau's prob- lem was discussed by S. N. Bernshtein (1910) from the viewpoint of a tboundary value prob- lem of the first kind for the e11iptic partial differential equation in the previous section. A. Haar (1927) dealt with the minimal problem for the functional J[z] by a tdirect method in the calculus of variations. Previously, Riemann, Weierstrass, Schwarz, and others dis- cussed the case where the given space curve r is a polygon, in connection with the tmono- dromy group concerning a second-order linear ordinary differential equation. Subsequently, R. Garnier (1928) investigated the existence of a solution by the limit process when r is a simple closed curve with bounded curvature. However, when r is assumed merely to be trectifiable, the existence of a solution was first shown by the limit process by T. Rad6 (1930). He further discussed the general case where r can bound a surface with finite area. On the other hand, by introducing a new functional depending on boundary values instead of the areal functional, 1. Douglas (1931) succeeded in giving a satisfactory result for the existence problem. R. Courant (1937) gave another existence proof by reducing Plateau's problem to the tDirichlet principle [3]. At present, the methods of discussing the existence of solutions of Plateau's problem can be classified into the following three sorts (represented, respectively, by Rad6, Courant, and Douglas): (1) The first method is to minim ize directly the areal functional Sf J EG-F 2 dudv. The variational equation of the areal functional becomes H = o. (2) Dirichlet's functional for a scalar function f(u, v) is defined by D[n = f fUu 2 + f})dudv, and for an n-dimensional vector function f(u, v) with components.h(u,v) (j= 1, ...,n) by D[f] = j1 D Lfj]. The existence of a solution of Plateau's problem can be discussed by starting 1240 from the variational problem of minimizing D[f]. The variational equation of D[fJ is f= O. (3) An analytic vector (3'(w) is representable in terms of the boundary values of its real part. For instance, if the domain of w is the unit disk Iwl < 1, then Poisson's integral formula 1 f 2" eiO+w ty(w)= (e)de+ilmty(w) 2n 0 e' -w with the boundary function (e) =, Re (3'(e iO ) can be used. On the other hand, the vector function that minimizes the Dirichlet integral among functions with fixed boundary values is harmonic. Based on these facts, Douglas trans- formed Dirichlet's functional with harmonic argument functions into a functional whose arguments are boundary functions. Specifi- cally, by starting from the problem of minimiz- ing Douglas's functional 1 f 2" f 2n (9(e)-(qJ))2 A [ h ] = dedm ') 4 4 . 2 (e )/2 'Y, n 0 0 sm - qJ we can prove the existence of solution of Plateau's problem satisfactorily. D. The Generalized Case Up to now we have been concerned with Plateau's problem in the case of a single simple closed curve. Douglas, Courant, and others treated the generalized case of a fmite number of boundary curves, where tgenus and orienta- bility are assigned as the topological structure of the surface to be found. The existence of a solution has been shown in this case also. The problem is further generalized from the case of fixed boundary to the case where the bound- ary is merely restricted to lie on a given mani- fold [3]. On the other hand, C. B Morrey (1948) generalized the problem by replacing the ambient space R" by an n-dimensional tRiemannian manifold and gave the existence proof in considerable generality [6]. E. Relation to Conformal Mappings There is a notable relation betweln Plateau's problem and conformal mapping when the dimension of the space is 2. Namely, the exis- tence proof of the solution of the former for a Jordan domain implies tRiemann's mapping theorem together with W. F. Osgood and C. Caratheodory's result on boundary corre- spondence ( 77 Conformal Mappings). F. New Developments Among recent contributions to the study of Plateau's problem, the following remarkable 
1241 results have emerged. One of them is con- nected with the final result of Douglas (1939) on the existence of solution surfaces. The mapping of a 2-dimensional manifold with boundary into R" defining Douglas's solution of the Plateau problem for a finite number of simple closed curves is a tminimal immersion with the possible exception of isolated points where it fails to be an immersion. These points are called branch points. It was then proved by R. Osserman (1970) and R. D. Gulliver (1973) that for n = 3 the mapping of Douglas's theorem, which is a surface of least area, is free of branch points, i.e., is an immersion. Osserman also gave examples of generalized minimal surfaces in R" (n > 3) with true branch points. In this connection, Gulliver also dealt with an analogous problem for surfaces of prescribed mean curvature. Next, we mention the question of boundary regularity. H. Lewy (1951) proved that if the boundary of a minimal surface is analytic, then the surface is analytic up to the bound- ary. Subsequently, S. Hildebrandt (1969) and others proved that if the boundary is of class cm.., m;:' 1, the surface is also of class cm.. up to the boundary. There are also some recent results on the number of solutions of Plateau's problem. For instance, J. C. C. Nitsche (1973) proved the uniqueness of solutions for analytic boundaries of ttotal curvature at most 4n. Further developments in connection with Plateau's problem have emerged in the work of E. R. Reifenberg (1960) and others, who sought to minimize the tHausdorIf measure among general classes of geometric objects, not as parametrized manifolds, but as subsets of R". The existence and regularity of solutions of Plateau's problem from this point of view have been discussed by H. Federer (1969), W. H. Fleming, F. J. Almgren, and others [10]. References [1] T. Rad6, On the problem of Plateau, Erg. Math., Springer, 1933. [2] 1. Douglas, Solution of the problem of Plateau, Trans. Amer. Math. Soc., 33 (1931), 263-321. [3] R. Courant, Dirichlet's principle, con- formal mapping and minimal surfaces, Inter- science, 1950. [4] 1. C. C. Nitsche, V orlesungen iiber Mini- ma1f1iichen, Springer, 1975. [5] R. Osserman, A survey of minimal sur- faces, Van Nostrand, 1969. [6] C. B. Morrey, Multiple integrals in the calculus of variations, Springer, 1966. [7] R. Osserman, A proof of the regularity everywhere of the classical solution to Pla- 335 Ref. Poincare, Henri teau's problem, Ann. Math., (2) 91 (1970), 550- 569. [8] S. Hildebrandt, Boundary behavior of minimal surfaces, Arch. Rational Mech. Anal., 35 (1969), 47-82. [9] J. C. C. Nitsche, A new uniqueness theo- rem for minimal surfaces, Arch. Rational Mech. Anal., 52 (1973),319-329. [10] H. Federer, Geometric measure theory, Springer, 1969. [11] F. J. Almgren, Plateau's problem. An invitation to varifold geometry, Benjamin, 1966. 335 (XXI.39) Poincare, Henri Henri Poincare (April 29, 1854-July 17, 1912) was born in Nancy, France. After graduating from the Ecole Poly technique, he taught at the University of Caen in 1879, then at the Uni- versity of Paris in 1881. He was made a mem- ber of the Academie des Sciences in 1887 and of the Academie Fran<;aise in 1908. He died in Paris. His achievements center on analysis and applications to theoretical physics and astron- omy. However, his work covered many fields of mathematics, including arithmetic, algebraic geometry, spectral theory, and topology. His tuniformization of analytic functions by means of the theory of tautomorphic functions in 1880 is especially notable. His paper on the tthree-body problem won the prize offered by the king of Sweden in 1889. The methods he developed in his three- volume Mecanique celeste (1892-1899) began a new epoch in celestial mechanics. In addi- tion, Poincare opened the road to talgebraic topology and made suggestive contributions to the ttheory of relativity and tquantum theory. He asserted that science is for science's own sake [4], and his popular philosophical works concerning the foundations of natural science and mathematics exhibit a lucid style. References [1] H. Poincare, Oeuvres I-XI, Gauthier- Villars, 1916-1956. [2] H. Poincare, Les methodes nouvelles de la mecanique celeste 1- III, Gauthier-Villars, 1892-1899. [3] H. Poincare, Science et hypothese, Flam- marion, 1903. [4] H. Poincare, La valeur de la science, Flam- marion, 1914. 
336 A Polynomial Approximation [5] H. Poincare, Science et methode, Flam- marion, 1908. 336 (X.20) Polynomial Approximation A. General Remarks On the existence of polynomial approxima- tions, we have Weierstrass's approximation theorem, which is formulated in the following two forms: (i) If f(x) is a function that is con- tinuous in the finite interval [a, b], then for every t; > 0 there exists a polynomial Pn(x) of degree n = n(t;) such that the inequality If(x) - Pn(x)1  c; holds throughout the interval [a, b]. (ii) If f(O) is a continuous function of period 2n, then corresponding to every posi- tive number F. there exists a trigonometric polynomial of degree n = n(t;), n Pn(O)=a o + L (akcoskO+bksinkO), (1) k1 such that the inequality If(O)- Pn(O)1 F. holds for all values of O. The second form of Weier- strass's theorem follows from the first, and conversely. M. H. Stone obtained a theorem that generalizes Weierstrass's theorem to the case of functions of several variables. Of the many direct proofs now available for Weier- strass's theorem, we mention two simple ones. To prove version (i) of the theorem, we can assume that the given function f(x) is defined in the segment [0,1]. Consider the Bernshtein polynomial Bn(x) = f f (  ) ( n ) Xk( 1 _ x)n-k. kO n k Then Bn(x) converges to f(x) tuniformly. To prove (Ii), we can apply tFejer's theorem on tFourier series. We have the following gen- eralization of (i): Let PI, P2, ... be a sequence of positive numbers such that lim Pn = 00. Then linear combinations of xO = 1, x p1 , 'x P1 , ... can uniformly approximate each continuous func- tion on [0,1] with arbitrary precision if and only if  p;;1 = -00 (Muntz's theorem). B. Best Approximations Let !Polx), !PI (x), ... be a sequence of linearly independent continuous functions on a bounded closed domain A III Rn. For any given continuous function f(x), a function Pn(x) = Lo Ck!Pk(X) attains inf maxl[(x)-Pn(x)1 co. 'Cn JCEA 1242 is called the best approximation of ((x) by a linear combination of {!Pk(X)}. For any given n there is a best approximation of I(x) by a linear combination of !Po (x), ..., !Pn1x), but such an approximation is not always unique. For such an approximation to be unique it is neces- sary and sufficient that the determinant of the matrix (!Pk(X i )) (k, i = 0, 1,2, ... ,n) is not zero, where x o , XI' ..., X n are n + 1 arbitrary distinct points of A (Haar's condition) (Math. Ann., 78 (1918)). If {!Pk(X)} satisfies this condition, the system of functions {!Pkho. .n is called a Chebyshev system (or unisolvent system). The sets {1,x,x 2 , ...,xn} on [a,b], {1,cosx, ..., cosnx} on [O,n] and {sinx,...,sinnx} on [O,n] are Chebyshev systems. For a Chebyshev sys- tem {!pdx)} on [a, b], let Pn(x) be a linear com- bination of !Po (x), "', !Pn(x) that is not identical to the functionfEC[a,b]. Then Pn(x) is the best approximation for f(x) if and only if there are at least n + 2 distinct points XI <... < Xn+2 of [a, b], where If(x) - Pn(x)l attains its maxi- mum (these points are called deviation points) and (f(x;)- Pn(x i )) (f(X i + 1 ) - Pn(x i + )) <0 (i= 1, ..., n + 1) (Chebyshev's theorem). For example, consider the polynomial Pn(x) =an_IXn-1 +... +alx+a o with real coeffi- cients such that max Ixn-an_Ixn-1_... -aol -lxl takes its smallest value. Then x n - Pn(x) = r<n-l)T,,(x), where T,,(x) = cos(n arc cosx) is the Chebyshev polynomial of degree n. Since the best approximation is desired for numerical computation, several mthods have been developed to find it ( 300 Numerical Methods). However, when the set .4 c Rn (n ;;" 2) contains three nonintersecting arcs emanating from a common point, A admits no Chebyshev system. Thus we do not always have a unique best-approximation polynomial [ 16]. C. Degrees of Approximation and Moduli of Continuity For a continuous function [(x) defined on [a, b], the modulus of continuity of kth order is defined by wk(f;t)= J. It (-I)k- v C)f(x+Vh)! a!Sxb ax+khb for t  (b - a)/k. In particular, WI is the ordi- nary modulus of continuity. Put E:(f) = infao. .a,.b , . .b, maxa';Y;;blf(x) - Pn(x)l, where f is a continuous periodic function of period 2" and Pn(x) is a trigonometric polynomial of the form (1). Then E:U) ckwd (; 1/(n + I)) (Jackson's theorem [1]), where Ck is indepen- 
1243 dent of f The best possible coefficient C k has been determined by J. Favard [2]. Further investigations on the relation between E:U) and wkU; t) have been carried out by S. N. Bernshtein [3J and A. Zygmund [4]. S. B. Stechkin obtained the following results: ( 1 ) C " w k f;- -f L: (v+1)k-1E:U) n n v=O [5,6J. For the approximation of fEC'([ -1, IJ) by polynomials, there exists a polynomial p"(x) of degree at most n such that for any xE[-l,lJ, If(x)- p"(x) I  Mr(t(x))' WI u(r); t(x)), where M r is a constant not depending on f, x, and n, f(r)(x) is the rth derivative of f(x), and t(x)=(I/n)( + (Ixl/n»). We also have theorems evaluating wku(r); t) in terms of If(x) - p"(x)l. For the proof of these theorems, estimation of the magnitude of the derivative of the polynomial of degree n plays an essen- tial role. For example, we have the Beroshtein inequality max" I T:(x) I  nmax,,1 T,,(x) I for any trigonometric polynomial T,,(x) of degree n and the Markov inequality IP(x)1 nmin[I/, nJ max 1P"(x) I -l::s;,;x::s;,;l for XE[ -1, 1J and any polynomial p"(x) of degree n. D. Approximation by Fourier Expansions If {4J"(x)} is an torthonormal system of func- tions in L 2 (a, b) and f is any function in L 2 (a, b), then among all linear combinations of 4Jo(x),..., 4J"(x) the one that gives the best mean square approximation to f (i.e., the one for which the integral r (f(X)- kt O C k 4Jk(X)Y dx attains its minimum) is the Fourier polynomial LZ=o ak4Jk(x), where a k = ff(x)4Jk(X)dx. Con- sequently, the least square approximation (or best approximation with respect to the Lr norm) by trigonometric polynomials is given by the partial sum s"(x) of the Fourier series of f(x). For Lp (1 <p< (0), s"(x) also gives the best approximation up to a constant factor, but in the case of uniform approximation we have If(x)-s"(x)1 A(logn)wkU; n- 1 ), and this result cannot be improved in general. There is no linear operation that gives the best trig- onometric approximation. In approximation with a linear combination of 4Jo(x),..., 4J"(x), the saturation phenomenon of approximation often appears. For example, observe the arith- metic means of s"(x) (i.e., tFejer means u"(x)). If 336F Polynomial Approximation fELiplX (O<IX< 1) ( 84 Continuous Func- tions A), then If(x)- u"(x)1 = O(n-O). However, If(x) - u"(x)1 = O(n -1) if and only if the tconju- gate functionf(x)ELip 1; If(x)-u"(x)l=o(n- 1 ) if and only if f(x) is constant (M. Zamansky [7J, G. Sunouchi and C. Watari [8J). E. Trigonometric Interpolation Polynomials Since the trigonometric system is a Chebyshev system, given 2n + 1 distinct points Xo, Xj, ..., X2" and arbitrary numbers Co, C 1 ,..., C 2 ", there is always a unique trigonometric poly- nomial of degree n with prescribed values C k at the points Xk' Given any continuous function f(x) with period 2n, the trigonometric poly- nomial that coincides with f(x) at the points Xk is called the trigonometric interpolation polynomial with nodes at Xk. If Xk = 2nk/(2n + 1) (k=O, 1, ...,2n), then the interpolating trigo- nometric polynomial is given by (I. 1  I: sin( (n + 1/2)(x - Xj)) U x)=- L.. J(x,) ", 2n+ 1 j =0 J sin((x-x j )/2) 1 en sin((n+ 1/2)(x-t)) = 2n Jo f(t) sin((x-t)/2) d4J"(t), where 4J"(t) is a step function that has the value 2njj(2n + 1) in [2njj(2n + 1), 2n(j + 1)/ (2n + 1)]. U"(f, x) resembles the partial sum s"(x) of the Fourier series of f(x). If f(x) is con- tinuous and of tbounded variation, then U"(f, x) converges uniformly to f(x) (D. Jack- son [lJ). Although the partial sum s"(x) of the Fourier series of a continuous function f(x) converges almost everywhere to f(x), there is a continuous function for which U"(f. x) diverges everywhere (J. Marcinkiewicz [9J). Moreover, there exists a continuous function for which (l/n)(LZ=1 Uk(f, x)) diverges everywhere (P. Erdos [IOJ, G. Grunwald [IIJ). Restating these facts for the algebraic polynomial case, we can conclude that there is a continuous function defined in [ -1, 1 J for which the tLa- grange interpolation polynomial and its arith- metic mean are both divergent everywhere if we take as nodes the roots of the Chebyshev polynomial of degree n. F. The Case ofa Complex Domain If a given function f(z) is holomorphic in a bounded tsimply connected domain E in the complex plane and continuous in if, then f(z) is approximated uniformly by polynomials on any compact set in E (Runge's theorem). This theorem was first studied by C. Runge, and his results were developed by J. L. Walsh and M. V. Keldysh (e.g., [12J). When E contains no interior point, the polynomial approxima- 
336G polynomial Approximation tion of a continuous function defined in E was given by M. A. Lavrent'ev. Unifying these two results, S. N. Merge1yan obtained the following theorem [13]: A necessary and suffi- cient condition for an arbitrary function con- tinuous on a compact set E and holomorphic inside E to be approximated on E uniformly by polynomials is that the set E does not divide the complex plane. On the degree of approximation of poly- nomials to f(z) on a simply connected domain, there are the following results: Let D be a closed bounded set whose complement K is connected and regular in the sense that K possesses a tGreen's function G(x, y) with a pole at infinity. Let DR be the locus G(x, y) = 10gR >0. When }(z) is holomorphic on D, there exists the largest number p with the following property: {(z) is single-valued and holomorphic at every interior point of Dp. If R <p, there exist polynomials P,,(z) of degree n (n= 1,2,...) such that I}(z)- P,,(z)l.:(:MIR" for zED, where M is a constant independent of n and z. On the other hand, there exist no such polynomials Pn(z) on D for R > P (Bernshtein and Walsh [12J). G. Lagrange's Interpolation Formula F or each n (n = 0, 1, .., ), let z\"!, z), ... ,zn2, be a given set of real or complex numbers, and let f(z) be an arbitrary function. Then there is a unique polynomial of degree n that coincides with f(z) at each point zL") (k = 1, ... ,n + 1). This is called Lagrange's interpolation poly- nomial and is given by ,,+1 w(z) p (z ) - " f( z(") ) " - 1.... k ( (n) ) ' ( In) ) ' k=1 Z-Zk W Zk W(z) = (z - zit))... (z - Z"21)' The sequence Pn(z) does not always converge to {(z) For example, if we take f(z) = I/z and the (n + I )st roots of I as zL"), then P,,(z) = z" and Pn(z) converges to f(z) only at the point 1. For real variables also, there are examples of divergent Pn(z). However, if {(z) is holo- morphic in Izl <p (p > 1), then Pn(z) with the (n+ I)st roots of 1 as nodes converges to I(z) uniformly in Izl.:(: 1. When 4 n ) is independent of the choice of n, Pn(z) coincides with the sum of the first n terms of tNewton's interpolation formula. In this case, P,,(z) is called Newton's interpolation polynomial and is given by Pn(z) = aD + a l (z - z, )+a 2 (z-z,)(z-z2)+'" + a,,(z - z rJ... (z - z,,), where aD = f(zJJ; a 1 = (I'(Z2)- f(zrJ)/(Z2 -Zl) (Z2 #Zl)' a 1 = f'(ZI) (Z2 = Zl); and so on. Suc- 1244 cessive coefficients of the polynomial P,,(z) can be calculated by tfinite differences. Conver- gence of Newton's interpolation polynomial is closely connected to convergence of tDirichlet series. H. Chebyshev Approximation Let D be a bounded closed subset of the com- plex plane, and f(z) a continuous function on D. Then there exists a polynomial n,,(z) of de- gree n such that max zED If(z) - n,,(z)1 attains the infimum EnU). The polynomial n,,(z) is unique and is called the best approximation polynomial (in the sense of Chebyshev). If D is simply connected and f(z) is single-valued and holomorphic on D, then n,,(z) conyerges to f(z) uniformly on D. Moreover, in this case there exist a number M that does not depend on n and a number R> 1 such that If(zl- nn(z)l.:(: M I R". Assuming that f(z) satisfies certain ad- ditional conditions, W. E. Sewell [14J proved the existence of a constant r such that 1 {(z)- n,,(z)l.:(: Mln' R". Furthermore, by approximat- ingf(z)=z" by polynomials of de,ree n  1, we can show that there exists a polynomial 7;,(z) of degree n such that min {EX Iz" +a l Z,,-I +... + a"I}='1 T,,(z)l. 7;,(z) is called a Chebyshev polynomial of de- gree n with respect to the domain D. Similar statements are valid for functions of a real variable. In particular, when D = [ -I, I J, we have 7;, (x) = cos(narc cosx)/2 n - l , which is the ordinary (real) Chebyshev poly- nomial. Generally, the limit ( ) 1/" lim max 17;,(z)1 = p(D) n---+OC; D exists, and p(D) coincides with the tcapacity and ttransfinite diameter of D [15]. For new results and applications of Chebyshev poly- nomials  [17]. If the method of evaluating the degree of approxImation using the absolute value I{(z)- n,,(z)1 is replaced by methods using a tcurvi- linear integral or tsurface integral, as explained below, we still obtain similar results. Let D be a closed domain in the complex plane with a boundary C that is a rectifiable Jordan curve. If {(z) is single-valued and holomorphic on D, then there exists a polynomial n,,(z) of degree n that minimizes the integral Scu(z)ll(z)- nn(z)IPldzl (p > 0), where u(z) is a given posi- tive continuous function On C. Moreover, II(z)-nn(z)I.:(:MIR" for some R> I (actually {n,,(z)} is toverconvergent). If D is a closed 
1245 Jordan domain and if J(z) is single-valued and holomorphic in D, then there exists a poly- nomial nn(z) of degree n that minimizes the integral f f D u(z)IJ(z) - nn(z)IP dS, where u(z) is a given positive continuous function on D. Moreover, IJ(z)-nn(z)1 MjRn for some R> 1 on D. I. Approximation by Orthogonal Polynomials on a Curve Let C be a rectifiable Jordan curve in the complex plane, and let Pk(Z) EtL 2 (C). If fCPk(Z )Pn(z) ldzl = 8 kn , then {Pk(Z)} is called an orthonormal system on C. Given a holo- morphic function J(z) on D, we set a k = fCJ(Z)Pk(Z)ldzl and consider the formal series r.ro akPk(z). If we denote the nth partial sum of this series by sn(z), then sn(z) is the least square approximation by a linear combination of Po(z), "', Pn(z). This and other results, such as tBessel's inequality, the tRiesz-Fischer theorem, etc., are all valid here as in the theory of general torthogonal systems. In particular, if we take Izl=R as C, then {1,z,z2,...} is an orthogonal system. Since in this case 1 f -k 1 f J(z) a k - 2k+1 J(z)z Idzl =------; k+T1dzl 2nR c 2m C Z and s,,(z) = a o + alz +... + anz n , the tTaylor expansion of J(z) coincides with the ortho- gonal expansion of J(z). Given a compact domain D and a holomor- phic function on D, if there exist orthogonal polynomials Pn(z) such that the orthogonal expansion of J(Z} with respect to Pn(z) con- verges to J(z) uniformly on D, we say that {Pn(z)} belongs to the domain D. The problem of existence and determination of such poly- nomials for any given domain was proposed and first solved by G. Faber. Generalizations were given by G. Szego, T. Carleman, and Walsh. Roughly speaking, Pn(z) is given by the orthogonalization of the system {t, z, Z2, ... } with respect to the curvilinear integral on C = aD or the surface integral on D. J. Numerical Approximation of Functions The accuracy of the approximation of a given function J(z) by the partial sums of its tTaylor expansion r. an(x - xo)" decreases rapidly as the distance I x - Xo I increases. The accuracy of the approximation of J(x) defined on a com- pact interval [A, B] by a (polynomial) function qJ(x) can be evaluated by means of the least square approximation, the best approximation with respect to the uniform norm, and so on. The second method is best suited to numer- 336 Ref. Polynomial Approximation ical calculation of functions. To get the best approximating polynomial qJ(x) = Pn(x) = r.Z=o CkqJk(X) of J(x) ( Section B), we must determine coefficients C k that satisfy the con- ditions of Chebyshev's theorem. The first step in this process is the orthogonal development of J(x) by Chebyshev polynomials {7;.(x)}: qJn(x) = r.Z=oa k (u), u=(x-(A + B)j2)j((B- A)j2). The error IJ(x) - qJn(x) I is estimated by a constant multiple of 7;.+1 (u): IJ(x) - qJn(x) I  KI 7;.+du)l. This Chebyshev interpolation is actually given by a o = N -I r.J(xJ, a k = 2N- I r.J(Xi)(UJ (k= 1,... ,n), where N = n+ 1 and the Ui=(xi-(A +B)j2)j((B-A)j2) (i = 1, "', N) are the roots of TN(U). Let M be the extremum of the error IJ(x) - qJn(x)1 of such an approximation, and set J(xJ- qJn(Xi) = ::!:: M i (i = 1, 2,..., N). Consider a func- tion ipn(x) = r. ak (u) satisfying J(xJ - ip(x i ) = ::!:: M. Then solve the linear equation ipn(xJ- qJn(xJ = ::!:: (M - MJ with respect to ak = a k - a k and M. Repeat this process until ak becomes sufficiently small. A computer can perform the division very quickly, and the rational approximation of a function, for example by its tcontinued frac- tion expansion, is often useful. References [1] D. Jackson, The theory of approximation, Amer. Math. Soc. Colloq. Publ., 1930. [2] 1. Favard, Sur les meilleurs procedes d'approximation des certains classes des fonc- tions par des polynomes trigonometrique, Bull. Sci. Math., 61 (1937),209-224,243-265. [3] S. N. Bernshtein (Bernstein), Sur l'ordre de meilleure approximation des fonctions con- tinues par des polynomes de degre donne, Mem. Acad. Roy. Belgique, ser. 2,4 (1912), 1-104. [4] A. Zygmund, Smooth functions, Duke Math. J., 12 (1945), 47-75. [5] N. I. Achieser, Theory of approximation, Unger, 1956. (Original in Russian, 1947.) [6] A. Zygmund, Trigonometric series, Cam- bridge, 1959. [7] M. Zamansky, Class de saturation de certains procedes d'approximation des series de Fourier des fonctions continues et applica- tions a quelques problemes d'approximation, Ann. Sci. Ecole Norm. Sup., ser. 3, 66 (1949), 19-93. [8] G. Sunouchi and C. Watari, On the deter- mination of the class of saturation in the the- ory of approximation of functions II, Tahoku Math. J., 11 (1959),480-488. [9] J. Marcinkiewicz, Sur la divergence des polynomes d'interpolation, Acta Sci. Math. Szeged., 8 (1937),131-135. 
337 A Polynomials [IOJ P. Erdos, Some theorems and remarks on interpolations, Acta Sci. Math. Szeged., 12 (1950), 11-17. [IIJ G. Grunwald, Uber Divergenzerschei- nungen der Lagrangeschen Interpolations- polynome der stetigen Functionen, Ann. Math., (2) 37 (1936), 908-918. [12J J. 1.. Walsh, Interpolation and approxi- mation by rational functions in the complex domain, Amer. Math. Soc. Colloq. Publ. 1935; fifth edition, 1981. [I3J S. N. Mergelyan (Mergeljan), On the representation of functions by series of poly- nomials on closed sets, Amer. Math. Soc. Transl., ser. I, 3 (1962), 287-293. (Original in Russian, 1951.) [14J W. E. Sewell, Degree of approximation by polynomials in the complex domain, Ann. math. studies 9, Princeton Univ. Press, 1942. [15J E. Hille, Analytic function theory II, Ginn, 1962. [16J J. R. Rice, The approximation of func- tions, Addison-Wesley, I, 1964; 11,1969. [17J S. J. Karlin and W. J. Sneddon, Tcheby- sheff systems with applications in analysis and statistics, Interscience, 1966. 337 (111.4) Polynomials A. Polynomials in One Variable Let R be a commutative tring and ao, aI, "', an elements of R. An expression f(X) of the form f(X)=a O +a 1 X +... +anX n is called a polynomial in a'variable X over R; if an to 0, the number n is called the degree of the polynomial f(X) and is denoted by degf If an = I, the polynomial (1) is called a monic poly- nomial. The totality of polynomials in X over R forms a commutative ring with respect to ordinary addition and multiplication (whose definition will be given later). It is called the ring of polynomials (or the polynomial ring) of X over R and is denoted by R[X]. We say that we adjoin X to R to obtain R[X]. B. Polynomials in Several Variables Let R[X, Y] denote the ring R[XJ[Y]' name- ly, the ring obtained by adjoining Y to R [X]. An element of R[X, Y] can then be expressed as a", x" Y'. This expression is called a polynomial in X and Y over R. Generally, R[X"... ,XmJ = R[X 1 ,... ,X m - 1 J[X m J is called the polynomial ring in m variables (on m inde- 1246 terminates) XI'''' ,X m over R, and its element F(X 1 ,X 2 ,...,X m )='a" ,XlX2...Xm L.. 12m (2) ( denotes a finite sum for nonnegative in- tegral Vi beginning with VI = V 2 = .., = V m = 0) is called a polynomial in m variables XI' .,. , X m over R. We call each summand a tfrm of the polynomial, a vlv ,. V m the coefficient and VI + V 2 + .., + V m the degree of this term. The greatest degree of terms is called the degree of the polynomial F. The term a o 0 of degree 0 is called the constant term of F. If a polyno- mial F in XI' ..., X m is composed of terms of the same degree n, then F is called a homoge- neous polynomial (or form) of degree n; a polynomial consisting of a single term, such as aXIX'... Xm, is called a monomial. Now let ()(" ()(2, ." , ()(m be elements of R (or a commutative ring S containing R), and let F(()(" ()(2, ..., ()(m) denote the element of R (or S) obtained by substitu tion of ()(, , ()(2, ... , O(m for X"X 2 , ""X m in F(X 1 ,X 2 , ...,X m ). It is also called a polynomial in 0(1,0(2, ..., ()(m. If F(()(1,()(2, "',()(m)=O, then (()(1,()(2, "',()(m) is called a zero point (in S) of the polynomial F(X"X 2 , ...,X m ) (or a solution of the alge- braic equation F(X" ...,Xm)=O). 111 the case of one variable, it is called a root ot F(Xd (or of F(Xd=O). C. polynomial Rings (1) Addition and multiplication in R[XJ are defined by (aiXi)+(bjXj)= (ai +bi)X i , (aiXi)( bjXj) = di+Naibj)Xk. A polynomial f(X)ER[XJ can be regarded as a function of a commutative ring R' con- taining R into itself such that Cf-4 f(c). In this sense,j(X)+g(X) andf(X)g(X) are the func- tions such that Cf-4 f(c) + g(c) and (f-4 f(c)g(c), respecti vel y. It holds that deg(J(X)+g(X)) max{degf(X),degg(X)}, degf(X)g(X) degf(X)+degg(X). If R is an tintegral do- main, then the latter inequality is an equality, and therefore R[XJ is an integral domain. For these inequalities and for convenience elsewhere, we define the degree of 0 to be indefinite. Assume that R is a field. For given j,gER[XJ (degg;;;: I), we can find unique q,rER[XJ such that f = gq +r and degr<degg or 1'=0 (divi- sion algorithm). This q is called the integral quotient of f by g, and r is called the remainder of f divided by g. The same fact remains true in the general R[XJ if g(X) is monic. ( 369 Rings of Polynomials). 
1247 D. Factorization into Primes Let k be an integral domain. Since k[X] and hence k[Xr,..., Xm] are integral domains, we can define the concepts concerning divisibility (such as a divisor, a multiple, etc.) (67 Com- mutative Rings). If k is a tunique factorization domain, then so are k[X] and k[Xr,..., Xm]. A polynomial over k is said to be primitive if the greatest common divisor of an the coeffi- cients is equal to 1. Every polynomial Over k can be uniquely expressed as a product of some primitive polynomials and an element of k; a product of primitive polynomials is primi- tive (Gauss's theorem). If k is a field, then k[X] and k[Xr, ... ,X m ] are unique factorization domains. Further- more, to find the greatest common divisor (J, g) of J, 9 E k [X], we can USe the Euclidean algorithm, that is, apply the division algorithm repeatedly to obtain f=gq1 +rr, g=r1q2+ r 2, r1 =r2q3+r3, ..., degg>degr1 >deg r 2 >..., so that after a finite number of steps we attain r V -1 =r v qV+1(r v +1 =0). Then rv =(J, g). Accord- ingly, k[X] is a tprincipal ideal domain. This algorithm is applied to Euclid rings ( 67 Commutative Rings L). E. The Remainder Theorem Let k be an integral domain, f(X)Ek[X], and let g(X)=X -1X(IXEk). Then using the division algorithm, we get f(X)=(X -1X)q(X)+r, q(X)Ek[X], rEk. Therefore, f(lX) = r; that is, the remainder of f(X) divided by X -IX is equal to f(IX). This is called the remainder theorem. If f(lX) =0, then f(X) is divisible by X -IX in k[X]. F. Irreducible Polynomials Let k be a field. A polynomial f(X)Ek[X] of degree n is said to be reducible over k if f is divisible by a polynomial of degree v < n in k[X] (v#O); otherwise, it is said to be irreduc- ible over k. Any polynomial of degree 1 is irreducible. A polynomial f is a tprime element of k[X] if and only if f is irreducible over k. Let I be a unique factorization domain. If f(X) is a polynomial (1) in I[X] such that for a prime element p in I, a. '/= 0 (mod p), a.- 1 =a.-2 =... =ao =0 (modp) but a o ,/=0 (mod p 2), then f(X) is irreducible over the field of quotients of I (Eisenstein's theorem). If a polynomial (2) in m variables over an 337 I Polynomials talgebraic number field k is irreducible, we can obtain an irreducible polynomial in Xr,..., X p (O<Jj<m) from the polynomial F(Xr, ... ,X m ) by assigning appropriate values in k to Xp+r,..., X m (Hilbert's irreducibility theorem, J. Reine Angew. Math., 110 (1892». These two theorems have been generalized in many ways and given precise formulations. In Hilbert's irreducibility theorem, the algebraic number field may be replaced, for example, by any infinite field that is tfinitely generated over its tprime field (K. Dorge, W. Franz, E. Inaba). G. Derivatives Given a polynomial f(X 1 , ... ,X m )= Ia V ,v 2 vmXr1X;2... x::,m over a field k, we define the (formal) derivative of f with respect to Xi as h(Xr, ... ,X m) = L v i a v ,v 2 Vm XIi ...X7 i - 1 ...X;;'m and denote it by af/ax i . The map fi--> af/ax i is caned the (formal) derivative with respect to Xi' In particular, if m= 1, then af/ax is denoted by df/dX. The usual rules oftderivatives also hold for the formal derivative. If df/dX =0 for an irreducible polynomial f(X) in k[X], then f(X) is said to be inseparable; otherwise, f(X) is separable. If the tcharacteristic of the field k is 0, then every irreducible polynomial f(X) ( # 0) is separable. When k is of characteristic p#O, an irreducible polynomial f(X) is insep- arable if and only if we can write f(X)=g(XP). H. Rational Expressions  The tfield of quotients of the polynomial ring k[X 1 , ...,X.] over a field k is denoted by k(X 1, ... ,X.) and is called the field of ra- tional expressions (or field of rational func- tions) in variables Xr,..., X. over k. Its element is called a rational expression in Xr,..., X.. It can be written as a quotient of one poly- nomial f(X 1 , ... ,X.) by another polyno- mial g(X 1 , ...,X.)#O. Also, an expression f(1X1' ..., 1X.)/g(IXr,..., IX.) obtained by replac- ing XI'''.' X. with elements 1Xr, ..., IX. of k in the above expression is called a rational expression in IX l' .,. , IX. (provided that g(IXr,..., IXn)#O). I. Symmetric polynomials and Alternating Polynomials Let f(X 1 ,... ,X.) be a polynomial in vari- ables Xr, ..., X. over an integral domain I. If 
337 J Polynomials {(XI' ..., X n ) is invariant under every per- mutation of XI, ..., X n , it is called a sym- metric polynomial (or symmetric function) of X" "', Xn. If f(X 1 , ..., X n ) is transformed into -{(XI'"'' X n ) by every todd permuta- tion of Xr, ..., X n , it is called an alternating polynomial (or alternating function). Also, an expression {(rY.r,..., rY. n ) obtained from f(X 1 , ..., X n ) by replacing X" ..., X n with elements 'rY. l' .. . , rY. n of 1 is called a sym- metric (alternating) function of rY. 1 , ..., rY. n if f(X"..., X n ) is symmetric (alternating). Let the coefficient of xn-k in the expansion of (X - X,).. .(X - X n ) be denoted by (- I )kITk' Then we have IT 1 =LX i =X 1 + .. +X n ,a 2 = LX i X j =X,X 2 +X,X 3 +. . +X n - 1 X n ' ...,ITn= X 1 X 2 ' ...,X", Obviously, these are sym- metric polynomials of XI' ..., Xn. Moreover, for every element rp of the polynomial ring I[Y" Y 2 , ..., Y n ], rp(IT 1 ,a2, ...,ITn) is a sym- metric polynomial of XI' ..., X", Conversely, every symmetric polynomial of XI' " ,X n can be uniquely expressed as a polynomial rp(IT!, IT 2 , ..., an)' Thus the totality of symmetric polynomials of XI, ..., X n is identical with the ring 1[17,,172' ..,ITn]. This is called the funda- mental theorem on symmetric polynomials, and IT l' IT 2, . .. , ITn are called elementary symmetric polynomials (or elementary symmetric func- tions). For example, for s, = LX;" (v= 1,2,. .), we have s, =aps2 =a -2IT2,S3 =a - 317 1 17 2 + 317 3 , S4 = ai-4aa2 + 2a +417, 173 -417 4 , Concerning the elementary symmetric poly- nomials and the SV, we have the relations Sv-a1sv-1 +a2sv-2 -... +( -1)'-l av _1 s 1 + (-I)'vav=O (v= 1,2,.. .), and sn - a, sn- 1 +... +(--l)"a n 'n-n=O (Jl=n+ 1,n+2, ...) (Newton's formulas). Letp(X1'" ,X n )=(X 2 -X,)(X 3 -X 1 )... (X n - X,)(X 3 - X 2 )... (X n - X n - 1 ) be the product of n (n - I )/2 differences between X r, ... , X n' Then the polynomial p is invariant under even permutations of Xr,..., X n , and p becomes - p under odd permutations. Hence p is an alter- nating polynomial of Xr, ..., Xn. It is called the simplest alternating polynomial of these vari- ables. Because of its particular expression, p is also called the difference product of X" ..., Xn. If the characteristic of 1 is different from 2, an alternating polynomial f is divisible by the simplest alternating polynomial p; it can be written as f = ps, where s stands for a sym- metric polynomial. J. Discriminants The square D(X 1 , ..., X n ) = p2(Xr, "', X n ) of the simplest alternating polynomial p is a symmetric polynomial, and it is therefore a polynomial in a" ...,a n . D(rY."rY.2' ...,rY.n)=O 1248 gives a criterion for the condition tr at some of rY." ..., rY. n are equal. If rY. 1 , ..., rY. n are the roots of an talgebraic equation aoxn +a, xn-1 +... + an = 0 of degree n, then D( , , . .. , rY. n ) is called the discriminant of the equation. It can be ex pressed in terms of coefficients a o , a, , ... , an of the equation. For instance, if n = 2, we have a5D=a-4aoa2; ifn=3, we have aD= aa + 18a o a, a2a3 - 4aoa -4a a 3 -- 27a5a. References [lJ B. L. van der Waerden, Algebra I, Springer, seventh edition, 1966. [2J N. Bourbaki, Elements de mathematique, Algebre, ch. 4, Actualites Sci. Ind., 1102b, Hermann, second edition, 1959. [3J A. G. Kurosh, Lectures on general algebra, Chelsea, 1970. (Original in Russian, 1955.) [4J R. Godement, Cours d'algebre, Hermann, 1963. For Hilbert's irreducibility theorem, [5J S. Lang, Diophantine geometry, Inter- science, 1962, ch. 8. 338 (X.28) Potential Theory A. Newtonian Potential In dynamics, a potential means a fUllction u of n variables XI' ..., X n such that - grad u = -(Du/Dxr,..., ou/ox n ) gives a field of force in the n-dimensional (n;;;: 2) Euclidean space Rn Given a point P in R" and a measure Jl, the functions u(P) given by the integral:; u(P)= - fl0g PQ dJl(Q), n=2, u(P)= f PQ 2- n dJl(Q), n;;;:3, are typical examples of potential fu Ktions. They are called the logarithmic pott:ntial and Newtonian potential, respectively. However, some authors mean by Newtonian potential the function u(P) = S PQ -1 dJl(Q) in R 3 . Usu- ally, the measure Jl is taken to be a nonnega- tive tRadon measure with compact tsupport. These potentials are tsuperharmonic in R" and harmonic outside the support of Jl. Conversely, any harmonic function defined on a domain in R" can be expressed as the sum of a potential of a single layer and a potential of a double layer (defined in the next paragraph). Because of this close relation between potentials and harmonic functions, sometimes pot,ntial 
1249 theory means the study of harmonic functions ( 193 Harmonic Functions and Subhar- monic Functions). (For the representation by potentials of superharmonic functions  193 Harmonic Functions and Sub harmonic Func- tions S.) Suppose that the measure /1 of R 3 satisfies d/1=pd, with sufficiently smooth density p and volume element dr. Then the Newtonian potential u of /1 satisfies Poisson's equation u = -4np. If the support of /1 is contained in a surface Sand d/1 = P d(J with density p and surface element d(J, then the potential u of /1 is caned the potential of a single layer (or simple distribution). If p is continuous on S, then u is continuous in the whole space, and the tdirec- tional derivative of u at a point P in the direc- tion of the normal line to S at Po tends to -2n p (p o )+ f P- a a 1 I d(J(Q) S np PQ PPo as P approaches Po along the normal line. Therefore, as P moves on the line, the direc- tional derivative jumps by -4np(P o ) at Po. If p satisfies the tH6lder condition at Po E S, then the derivative at P in the direction of any fixed tangent line at Po has a finite limit as P tends to Po along the normal line. The integral U(p)= f P- a a 1 d(J(Q) S nQ PQ is caned the potential of a double layer (or double distribution). If p is continuous on S, then the limits at Po of u from the two direc- tions along the normal line at Po exist and are 2np(P o )+ u(Po) and - 2np(P o )+ u(Po). If, fur- ther, p is of class C 2 on S, then each partial derivative of u has a finite limit as P tends to a point of S. B. Generalized Potential The classical notion of potentials is generalized as fonows: Let a be a space supplied with a measure /1 (;::' 0) and <I>(P, Q) a real-valued function on the product space a x a. When the integral f<l>(P, Q)d/1(Q) is wen defined at each point PEa, it is caned the potential of /1 with kernel <I> and is denoted by <I>(P, /1) or <I>/1(P). The function <I>(P, Q)=<I>(Q, P) is caned the adjoint kernel of <1>. When (/1, v)= f f <I>(P,Q)d/1(Q)dv(P) = f <I>(P, /1)dv(P) exists for measures /1, v;::, 0, the value is caned the mutual energy of /1 and v. In particular, (/1, /1) is caned the energy of /1. The definition of 338C Potential Theory potential given above may be too general, and some restrictions are caned for. We assume that a is a Hocany compact Hausdorff space; <I> is a Hower semicontinuous function on a x a satisfying -00 < <I>  00; and /1, v, and A are nonnegative Radon measures with compact support in a. In particular, when a = Rn, the potential with the kernel <I>(P, Q) = PQ -' (0 < IX < n) is caned a potential of order IX (some- times of order n -IX) or a Riesz potential. C. The Maximum Principle and the Continuity Principle Let a = Rn, and let <I>(P, /1) be the kernel of the Newtonian potential. Then <I>(P, /1) satis- fies the fonowing principles: (1) Frostman's maximum principle (first maximum principle); sUPPen <I>(P, /1)  sUPPeS" <I>(P, /1) for any /1, where SI' is the support of /1. (2) Ugaheri's maximum principle (dilated maximum prin- ciple): There is a constant c;::' 0 such that sUPpen<l>(P,/1)csuPPeS" <I>(P,/1) for any /1. (3) A variation of U gaheri's maxim urn principle: Given any compact set K c a, there exists a constant c which may depend on K such that sUPPeK<I>(P,/1)csuPPeS" <I>(P,/1) for any /1 with SI' c K. (4) Upper boundedness principle: If <I>(P, /1) is bounded from above on SI" then it is bounded on a also. (5) For any compact set K c a and any /1 with SI' c K, if <I>(P, /1) is bounded above on SI" then it is bounded on K also. (6) Continuity principle: If <I>(P, /1) is con- tinuous as a function on SI" then it is also con- tinuous in a. Generany, the relations shown in Fig. 1 hold among principles (1)-(6), where (a)->(b) means that (a) implies (b), and (c)-#(d) means the negation of (c)-> (d). 11' (3)  (1)  (2) (5)  (6)  (4) 1? Fig. 1 If the continuity principle holds for a gen- eral kernel <I>(p, /1) of a potential and if for any /1 there is a sequence {P k } of points in a - SI' that has an accumulation point on SI' and along which <1>( P k , /1)->suPn-s" <I>(P, /1), then (1) holds also. The second condition is valid, for instance, when a=R n , <I>(P,/1) is tsubharmonic in Rn-sl" and limsup<l>(P,/1)<suPQeRn<l>(Q,/1) as P tends to the point at infinity. T. Ugaheri, G. Choquet, and N. Ninomiya studied (2) and (6). Ugaheri showed that for any nonnegative decreasing function cp(r) defined in [0, (0) and satisfying cp(O) = 00, the kernel <1>( P, Q) = cp( PQ ) satisfies (2) in R n . M. Ohtsuka established 
338D Potential Theory that (6)-.(5) and (5)-+>(6) in general and proved (5)->(6) in the special case where <I> is contin- uous on Q x Q in the wider sense (i.e., <I> may have 00 as its value) and <I> is finite outside the diagonal set of Q x Q. The examples in Sec- tions F, I, and J show that the potentials with kernels satisfying a weak condition such as (6) possess a number of important properties. We note that (6) does not necessarily hold in gen- eral. (For literature on (1 )-(6) and other re- lated principles  [18].) D. The Energy Principle Denote by E the class of all measures of finite energy. A symmetric kernel is called positive definite (or of positive type) if(fl- v, fl- v)= (fl,fl)+(V, v)-2(fl, V);;:' 0 for any fl, VEE. If the equality (fl-v,fl-v)=O always implies t1= v, then the kernel is said to satisfy the energy principle. Some characterizations for a kernel to be of positive type or to satisfy the energy principle were given by Ninomiya. Using them, he showed that a symmetric kernel that satisfies Frostman's maximum principle or the domination principle ( Section L) and a certain additional condition is of positive type. Choquet and Ohtsuka generalized these defi- nitions and results [18]. E. Topologies on Classes of Measures Let Co be the space of continuous functions with compact support in Q and Mo+ be the class of measures in Q. The tseminorms fl- v -> I Sf d/l- Sf dv I (f E Co) define the vague topology on Mo+ . The class of unit distribu- tions can be topologized by the vague topol- ogy, which induces a topology on Q itself. This topology coincides with the original topology in Q. A subclass M of Mo+ is relatively com- pact with respect to the vague topology if on every compact set in Q, the values of the mea- sures of M are bounded. Denote by L the class of measures ). such that ()., fl) is finite for all flE Mo+, and define the fine topology on Mo+ by the seminorms fl-v->I(A,fl)-(A, v)1 (AEL). This topology was introduced by H. Cartan [4J for the Newtonian kernel. It is the weakest topol- ogy that makes each <l>(PJ), AEL, continu- ous. Further, when the kernel is of positive type, the weak topology is defined on E by the seminorms fl-v->I(A,fl)-(A, v)1 (AEE). The strong t opology is de fined on E by the semi- norm J(fl-V,fl-V). For the Newtonian kernel, Cartan [5J showed that vague-«fine-«weak-«strong on E, where vague-<fine, for instance, means that the fine topology is stronger than the vague 1250 topology. He proved that the fine, weak, and strong convergences are equivalent for any sequence {fln} of measures with bounded energies. B. Fuglede [16J called a kernel of positive type consistent when any tCauchy net with respect to the strong topology that converges vaguely to a measure converges strongly to the same measure. This notion is used to give conditions for E to be tcomplete with respect to the strong topology [11,18]. Moreover, Fuglede called a consistnt kernel satisfying the energy principle perfct, and studied the cases where tconvolution kernels on a locally compact topological group are consistent or perfect [11]. For instance, PQ -a (0 < IX < n) in R n and the Bessel kernel, which was studied by N. Aronszajn and K. T. Smith, a re perfect. F. Convergence of Sequences of Potentials We are concerned with determining when a family of potentials corresponding to a class of measures {flm} with indices OJ in a directed set converges. If all Sw are contained in a fixed compact set and flm converge5. vaguely to flo, then liminf<l>(P,flm);;:'<I>(P,flo) in Q. If, moreover, <I> is continuous in the wider sense and both <I> and $ satisfy the continuity prin- ciple, then equality holds quasi-everywhere (q.e.) in Q in this inequality [18J; w, now de- fine the notion q.e. First, for a nonempty com- pact set K in Q, define W(K) to be nf(fl, fl), where S cK and fl(K)= 1, and W(0) to be 00 for the empty set 0. Next, set 11(X) = inf Kcx W(K) for an arbitrary set X in Q and J.v,,(X)=suPXcG Tt;(G), where G is an open set in Q. When a property holds except on a set X such that J.v,,(X) = 00 (resp. Tt;(X) = 00), we say that the property holds quasi-everywhere (q.e.) (resp. nearly everywhere (n.e.)). The terms q.e. and n.e. are also used in the theory oftcapac- ity, although their meaning is not t:le same as here. (For results on the convergence of se- quences of potentials  [18].) G. Thin Sets A set Xc Q is called thin at Po when either Po is an isolated point of the set XU {Po} with respect to the original topology of Q or there exists a measure fl such that lim inf<l>(P, fl) > <I>(Po,fl) as PEX -{Po} tends to Po. If Po is an isolated point of X U {Po} with respect to the topology weakest among those stronger than both the original topology and the fine topology, then X is thin with respect to the adjoint kernel $. The converse is true in a special case [5]. The notion of thinness was 
1251 introduced in 1940 by M. Brelot and inves- tigated in detail in [3]. Let <p(r) be a positive decreasing function. Suppose there exist posi- tive numbers ro, <5, a such that <p(r)  a<p((1 + <5)r) in 0 < r < roo Assume that 0 is a metric space with distance p, and take <p(p(P, Q)) as the kernel <I>(P, Q). Then a necessary and sufficient condition for X to be thin at Po is ;;1 sjjf-Y.(X) < 00 (s> 1) [19], where Xj= {PE XI sj  <p(p(P, Po)) sj+1}. This criterion was obtained by N. Wiener in 1924 and uti- lized to give a condition for a boundary point Po of a domain D in R3 to be tregular with respect to the tDirichlet problem for D. In this situation, Po is regular if and only if the com- plement of D is thin at Po. When every com- pact subset of X is thin at Po, X is caned inter- nally thin at Po. A necessary and sufficient condition for X to be intern any thin at Po is sjjW;(X)< 00. H. Polar Sets Brelot (1941) caned a set A polar when there is a measure fl for which <I>(P, fl)= 00 on A. Consider U(O, K) (= V(K, 0)) as defined in 48 Capacity C, and define Ue(O, X) by sUPGooXinfKC:G U(O, K) for an arbitrary set X, where G is an open set. Then for any fl, X = {P I <I>(P, fl) = 00 } is a G set for which the value of Ue(O, X) is infinite. Conversely, given a G. set A of tNewtonian outer capacity zero in Rn (n;::' 3), there is a measure fl such that the set of points where the Newtonian potential of fl is equal to 00 coincides with A and fl(Rn - A) = 0 (Choquet [7]). This result is caned Evans's theorem (or the Evans-Selberg theorem) ( 48 Capacity) in the special case where A is compact. I. Quasicontinuity A function fin 0 is caned quasicontinuous if there is an open subset G in 0 of arbitrarily sman capacity such that the restriction of f to 0- G is continuous. Natural1y, quasicontinu- ity depends on the definition of capacity. Sup- pose that whenever the potential of a measure fl with kernel <I> is continuous as a function on 51' it is quasicontinuous in 0; <I> is then said to satisfy the quasicontinuity principle. Assuming in addition that <I> is positive symmetric and taking IjU(O, G) as the capacity of G, we find that every potential is quasicontinuous in 0 (M. Kishi). A similar result is valid for a non- symmetric kernel if the continuity principle is assumed (Choquet [6]). 338L Potential Theory J. The Gauss Variational Problem Given a compact set K and a function f on K, the problem of minimizing the Gauss integral (fl, fl) - 2 J f dfl for a measure fl such that 51' c K is caned the Gauss variational problem. When an additional condition is imposed on fl, the problem is caned conditional. Among many results obtained for this problem [18], the fonowing is typical: If <I> is symmetric, K sup- ports a nonzero measure of finite energy, and f is finite upper semicontinuous on K, then there exists a fl such that f(P)  <I>(P, fl) n.e. on K and f(P);::' <I>(P, fl) on 51" When <I> is not sym- metric, the same relations hold for some fl if <I> is positive and ell satisfies the continuity principle (Kishi [15]), although the method using Gauss variation is not applicable. When <I> is symmetric and of positive type, the Gauss integral with f(P) = <I>(P, v) (vEE) is equal to Ilfl-vI1 2 -llvI1 2 , and the minimizing problem is equivalent to finding the projection of v to {flE EI 51' c K}. In some cases, this projection is equal to the measure obtained by the balayage of v to K ( Section L). K. Equilibrium Mass Distributions A unit measure fl supported by a compact set K is caned an equilibrium mass distribution on K if <I>(P, fl) is equal to a constant a n.e. on K and <I>(P, fl)  a in O. The kernel is said to satisfy the equilibrium principle if there exists an equilibrium mass distribution on every compact set. If a > 0, Ija can be regarded as a kind of capacity, and flja is caned a capacitary mass distribution. Corresponding to tinner and outer capacities, inner and outer capaci- tary mass distributions and their coincidence can be discussed [11]. When <I> is symmetric, Frostman's maximum principle is equivalent to the equilibrium principle. L. The Sweeping-Out Principle A kernel is said to satisfy the sweeping-out principle (or balayage principle) iffor any com- pact set K and measure fl there exists a mea- sure v supported by K such that <I>(P, v)= <I>(P, fl) n.e. on K and <I>(P, v)  <I>(P, fl) in O. When we find such a v, we say that we sweep out fl to K, and finding v is caned a sweeping- out process (or balayage). This terminology originated in the classical process for the New- tonian potential in which the exterior of K is covered by a countable number of bans and the masses inside the bans are repeatedly swept out onto the spherical surfaces. For any general kernel, the balayage principle implies 
338M Potential Theory the domination principle (also called Cartan's maximum principle), which asserts that if the inequality <1>(P,):( <1>(P, v) is valid on S for EE and any 11, then the same inequality holds in Q. The converse is true if <1> is positive, sym- metric, continuous in the wider sense, and finite outside the diagonal set. In contrast to the domination principle, <1> is said to satisfy the inverse domination principle if the inequal- ity <1>(P,II):(<1>(P, v) on Sv for EE and any v implies the same inequality in Q. In a special case, the domination principle implies Frost- man's maximum principle [17]. For the equi- librium and domination principles for non- symmetric kernels  [14]. Corresponding to inner and outer capaci- tary mass distributions, we can examine inner and outer balayage mass distributions and inner and outer Gauss variational problems and their coincidences [5, 18]. With respect to the Newtonian potential, a point P is called an internally (externally) irregular point of X if the inner (outer) balayage mass distribution to X of the unit measure f.p at P is different from Ep. X is thin (internally thin) at P if and only if P is an externally (internally) irregular point of X (Cartan [5J). M. Other Principles A kernel <1> is said to satisfy the uniqueness principle if  == v follows from the equality <1>(P,)=<1>(P, v), which is valid n.e in Q. Nino- miya and Kishi studied this principle. A ker- nel <1> is said to satisfy the lower envelope prin- ciple if given  and v, there is a ic such that <1>(P,;,)=min(<1>(P,), <1>(P, v)). If <1> satisfies the domination principle and <i> satisfies the con- tinuity principle, then <1> satisfies the lower envelope principle on every compact set con- sidered as a space. Conversely, if <1> satisfies the lower envelope principle on every compact set considered as a space, then <1> satisfies the domination principle or the inverse domi- nation principle under some additional con- ditions (Kishi). A kernel is said to satisfy the complete maximum principle if the inequality <1>(P,):(<1>(P, v)+a on S implies the same inequality in Q for  E E, any v, and a  0 (Car- tan and J. Deny, 1950). This principle implies both Frostman's maximum principle and the domination principle. Potentials of order li (n-2:(li<n) in R" and the Yukawa potential with kernel a PQ -l exp( -;, PQ ) in R 3 satisfy the complete maximum principle. Relations between this principle and some other prin- ciples were studied by Kishi [14]. While an principles discussed so far are global, Choquet and Ohtsuka made a local stud y. 1252 N. Diffusion Kernels By means of the bilinear form J f d (f E C,  E Mo), we now introduce weak topologies in the space C of continuous functions in Q and in the class Mo of Radon measures of general sign with compact support. Similarly, we in- troduce weak topologies in Co and in the class M of measures of general sign with not neces- sarily compact support. A positive linear map- ping G of Mo into M that is continuous with respect to these two weak topologies is called a diffusion kernel. The linear mappi I1g G* of Co into C that is determined by Sf dG = J G*f d is called a transposed mapping. We can define the balayage principle for G and the domination principle for G* as in the case where kernels are functions. Then G satisfies the balayage principle if and only if G* satis- fies the domination principle (Choquet and Deny). The complete maximum principle has been defined and studied for G* (Deny). G. A. Hunt obtained a relation between this prin- ciple and the representation of G*[ in the form J P'f dt with a tsemigroup . His result is important in the theory of tstochastic pro- cesses ( 261 Markov Processes). O. Convolution Kernels A diffusion kernel G on a locally cO'11pact Abelian group induces a convolution kernel K if G is translation-invariant. It is called a Hunt kernel when there exists a vaguely continuous semigroup {lit},o such that K= Jgo c,dt and lio = Eo, the Dirac measure at the origin. A Hunt kernel satisfies the domination principle and the balayage principle to all open sets, and it satisfies the complete maximum principle if and only if {lit},o is sub-Markovian, Jdli,:( 1. The Fourier transform of such a sernigroup has a closed connection with a negative- definite function [1]. For a convolution kernel K satisfying the domination principle, or, equivalen1ly, the balayage principle, the inequality K(Wr)K(W z ):(K(W 1 -Wr)K(W 1 +w z ) is valid for all relatively compact open sets WI and Wz [8]. It has a unique decomposition K= <P' (Ko + K 1 ), where <p is a continuous ex- ponential function, Ko is equal to 0 or a Hunt kernel satisfying the complete maxi 11um prin- ciple, and K 1 is a singular kernel satisfying the domination principle such that K 1 Hx=K1 for every XES ko [12]. P. Potentials with Distribution Kernels A function f in R" is called slowly increasing in the sense of Deny if there exists a positive 
1253 integer q such thatJf(P)(1 + OP 2 pd't"(P) < 00. A tdistribution K is ca11ed a distribution kernel if the tFourier transform F K of K is a function kO and both k and Ijk are slowly increasing in the sense of Deny. Given such a distribution K, the family W of distributions T for which FT are functions and II TI12 = J k(FT)2 d't" < 00 (this is ca11ed the energy of T) is a tHilbert space with inner product (T 1 , T 2 ) = J kFT 1 FT 2 d't". However, the family of Newtonian potentials of measures with finite energy is not a Hilbert space (Cartan). The family of functions of class Coo with compact support is tdense in W. For every TE W the function FK x FT is slowly increasing in the sense of Deny. There exists a distribution U = U T that satisfies FU = FK x FT, ca11ed the K -potential of T. Since W is complete, the method of projection is applicable, and prob- lems of equilibrium, balayage, and capacity can be examined. For instance, if tDirac's distribution (j is taken as a distribution kernel, then the corresponding capacity is the Le- besgue measure. In the case of the Newtonian kernel, 8U T j8x j =h is defined a.e. in Rn for any TE W. These h are square integrable, T= -cnj1 8h18xj' and II TI12 =Cnj=1 JW 2 d't", where Ijc n = 2(n - 2)nn/2 jr(nj2). Furthermore, U T =(n-2) t (xj-Yj)h(Q) PQ -nd't"(Q), where Xj' Yj are components of P, Q, respec- tively. Every ordinary potential of a double layer is a special case of U T . Conversely, let f be a function in Rn that is absolutely continu- ous along almost every line para11el to each coordinate axis and whose partial derivatives are square integrable. Then f is equal to the potential of some TE W with Newtonian ker- nel up to an additive constant. These results are due to Deny [9]. Q. Dirichlet Spaces In this section, functions are assumed to be complex-valued. Let a be a 10ca11y compact Hausdorff space,  O a Radon measure in a, and Co the space of continuous functions with compact support. A Hilbert space D consisting of1oca11y -integrable functions is ca11ed a Dirichlet space if Co n D is dense in both Co and D, the relations Iv(P)-v(Q)1  lu(P)- u(Q)1 and Iv(P)1  lu(P)1 for uED and a function v always imply vED and IIvll  lIull, and for any compact set K c a, there exists a constant A(K) such that JK lul d  A(K) Ilull for every UE D. The notion of Dirichlet space was in- troduced by A. Beuding. A function uED is ca11ed a potential if there exists a Radon mea- sure /1 such that (u, <p)= J ipd/1 holds for every 338 Ref. Potential Theory <pEC O nD. If in addition /10, then u is ca11ed a pure potential. For any pure potential, the lower envelope principle, the equilibrium principle, the balayage principle, and the com- plete maximum principle hold [2]. Suppose that a is a 10ca11y compact Abelian group and D is a Dirichlet space such that Uyu(x)=u(x- Y)ED and II Uyull = Ilull for every uED and YEa. Then we ca11 D special and characterize it in terms of a real-valued continuous func- tion on a [2]. (For axiomatic potential theory  193 Harmonic Functions and Subharmonic Functions.) References [1] C. Berg and G. Forst, Potential theory on 10ca11y compact Abelian groups, Springer, 1975. [2] A. Beuding and J. Deny, Dirichlet spaces, Proc. Nat. Acad. Sci. US, 45 (1959), 208-215. [3] M. Brelot, Sur les ensembles effiles, Bull. Sci. Math., (2) 68 (1944), 12-36. [4] M. Brelot, Elements de la theorie classique du potentiel, Centre de Documentation Uni- versitaire, third edition, 1965. [5] H. Cartan, Theorie geneale du balayage en potentiel newtonien, Ann. Univ. Grenoble (N.S.), 22 (1946), 221-280. [6] G. Choquet, Sur les fondements de la theorie fine du potentiel, C. R. Acad. Sci. Paris, 244 (1957), 1606-1609. [7] G. Choquet, Potentiels sur un ensemble de capacite nu11e, Suites de potentiels, C. R. Acad. Sci. Paris, 244 (1957),1707-1710. [8] G. Choquet and J. Deny, Noyaux de convolution et balayage sur tout ouvert, Lec- ture notes in math. 404, Springer, 1974, 60- 112. [9] J. Deny, Les potentiels d'energie finie, Acta Math., 82 (1950),107-183. [10] O. Frostman, Potentiel d'equilibre et capacite des ensembles avec quelques applica- tions :i la theorie des fonctions, Medd. Lunds Univ. Mat. Sem., 3 (1935). [11] B. Fuglede, On the theory of potentials in 10ca11y compact spaces, Acta Math., 103 (1960), 139-215. [12] M. 1t6, Caracterisation du principe de domination pour les noyaux de convolution non-bornes, Nagoya Math. J., 57 (1975), 167- 197. [13] O. D. Ke11ogg, Foundations of potential theory, Springer, 1929. [14] M. Kishi, Maximum principles in the potential theory, Nagoya Math. J., 23 (1963), 165-187. [15] M. Kishi, An existence theorem in poten- tial theory, Nagoya Math. J., 27 (1966), 133- 137. 
339 A Power Series [16J N. S. Landkof, Foundations of modern potential theory, Springer, 1972. [17J N. Ninomiya, Etude sur la theorie du potentiel pris par rapport au noyau syme- trique, J. Inst. Poly tech. Osaka City Univ., (A) 8 (1957),147-179. [18J M. Ohtsuka, On potentials in locally compact spaces, J. Sci. Hiroshima Univ., 25 (1961),135-352. [19J M. Ohtsuka, On thin sets in potential theory, Sem. Anal. Funtions, Inst. for Adv. Study, Princeton, 1957, vol. 1, p. 302-313. 339 (XI.2) Power Series A. General Remarks Let a and Co, Cj, c 2 , ... be elements of a tfield K and z be a variable. A series of the form P == Lo cn(z - a)n is called a power series (in one variable). We assume that K is the field of complex numbers. For a given power series P, we can determine a unique real number R (OR  co) such that P converges if Iz- al < R and diverges if R < 1 z - a I. We call R the radius of convergence and the circle Iz -al == R (some- times 1 z -. a I < R) the circle of convergence of P. The value of R is given by R== l/limsup (Cauchy-Hadamard formula) with the conven- tions 0 == 11 co, co == 1/0. Also R = lim ienlcn+1l, provided that the limit on the right-hand side exists. A power series tconverges absolutely and uniformly on every compact subset inside its circle of convergence and defines there a single-valued complex function. Since the series is ttermwise differentiable, the function is actuaJly a holomorphic function of a complex variable. Conversely, any function j(z) holo- morphic in a domain can be represented by a power series in a neighborhood of each point a of the domain. Such a representation is called the Taylor expansion of j(z) at a (or in the neighborhood of a). A power series that represents a holomorphic function is called a holomorphic function element. K. Weierstrass defined an analytic function as the set of all elements that can be obtained by tanalytic continuations starting from a given function element (- 198 Holomorphic Functions). Besides the series Lo cn(z - a)", a series of the form Q == L:'=o Cnz -n is called a power series with center at the point at infinity, and its value at co is defined to be Co. By putting z - a == t when its center a is a finite point and Z-l = t when its center is co, every power series can be 1254 written in the form Locntn, and such a t is called a local canonical parameter. When t is a local canonical parameter, a series of the form L _ 00 C n t n is called a Lau- rent series and a series L_oocntnlk (k a fixed natural number) is called a Puiseux series, after the French mathematicians A. Lau rent and V. A. Puiseux. Power series are sometimes called Taylor series. If we perform tanalytic continuations of a power series from its center along radii of its circle of convergence, we encounter a tsingular- ity on the circumference for at least one radius. For a power series with the circle or conver- gence /z/==R, the argument IX of the singularity on Izl = R nearest z = R is given in t he follow- ing way. Suppose, for simplicity, that the radius of convergence R of L cnz n equals 1, and put Pn(h) = I ( n ) cn-vh V, v=O V P(h) =limsup IPn(h)l . n-oo Then IX is obtained from COSIX=P(O)= lim (P(h)-I)jh h-+O (S. Mandelbrojt, 1937). In particular, if all the C n are real and nonnegative, z = R is a singular- ity (Vivanti's theorem). B. Abel's Continuity Theorem As a property of the power series on the circle of convergence, we have Abel's continuity theorem: If the radius ofconvergencl ofj(z)== LoanZn is equal to 1 and Loan co verges (or is t(C, k)-summable (k> -1)) to A, then j(z)A when z approaches 1 in any sector {z Ilzl < 1, larg(l-z)1 < (nj2)-b}, b >0 (tStolz's path). The converse of this theorem is not always true. The existence of lim z _ 1 j( z) does not necessarily lead to the convergence of Loan' Even Cesaro summability of Lo an does not always follow from the existence oflimz_d(z). If an =o(ljn) and j(z)A when z approaches 1 along a curve ending at z = 1, then I: an con- verges to A (Tauber's theorem, 1897). The theorems concerning additional sufficient conditions for the validity of the converse of Abel's theorem are called theorems of Tauber- ian type (or Tauberian theorems). In Tauber's theorem, the hypothesis on the an may be replaced by a n =O(ljn) or nRea.. nlma n may be bounded from above but not nect:ssarily from below (G. H. Hardy and 1. E. little- wood). Here, the condition a n =O(ljn) cannot be weakened (Littlewood). Sufficient condi- 
1255 tions for a. to be tsummable for various summation methods are also known. C. Lambert Series A series of the form 00 z' '1 a. l _ z " Izli'l, is called a Lambert series. If  a. is convergent, (1) converges for any z with Izi i' 1, and more- over, it converges uniformly on any compact set contained in Izl < 1 or Izl > 1. If  a. is divergent, (1) and the power series  a.z' converge or diverge simultaneously for Izl i' 1. There is a detailed study of Lambert series by K. Knopp (1913). If R is the radius of con- vergence of  a.z' and dl' ad = A. is the sum extending over all divisors of n, then we have the reciprocity relation 00 z" 00 '1 a. 1- z' = '1 A.z', which holds for Izl < min(R, I). As special cases of this relation, we have 00 z' z = I Jl(n)------;;, .=1 1- z z 00 z' (l-Z)2 =J1 <p(n) l_z" where /1 and <p are the tM6bius function and tEuler function, respectively. If the na. are real and bounded from below, 00 z' lim I (l-z)na -=s 2-1-0 n =1 "l-zll implies I:a.=s. Hardy and Littlewood (1921) showed that this theorem of Tauberian type is equivalent to the tprime number theorem. D. Singularities of Power Series Given a power series P =  a.z', if the tbranch in Izi < R* of the analytic function J(z) deter- mined by P is single-valued meromorphic but the branch in Iz 1< R' with R' > R* has sin- gularities other than poles, then R* is called the radius of meromorplty of P and I z I = R * (sometimes I z I < R *) is called the circle of meromorphy of P. R* can be computed in the following way. Put lp=limsupll, .oo a. a.+! a.+ p D(p)- a.+l a.+ 2 a.+ p +! . - a.+ p a.+ p + 1 a.+2p 339D Power Series (1) Then the sequence of numbers lp-r/lp is in- creasing, and liml p _ 1 11 p =R*. If Rt, R2' '" are different valued of lp_r/lp, then J(z) is 11010- morphic at points not lying on Izl = R. (Hada- mard's theorem, 1892). When a point a and a set A of points in the complex plane are given, the set of points that can be joined with a by a segment disjoint from A is called the star region determined by a and A. Take any half-line starting at a; the point of A lying on it and nearest to a is called a vertex. For a power series I: c.z', the set of centers of the function elements obtained by analytic continuations along half-lines starting at the origin is called the star region of  c.z' with respect to the origin. Let {a} and {{1} be the set of vertices of the star regions with respect to the origin of  a.z' and  b.z', respectively. Then the star region determined by the origin and the set {a{1} is contained in the star region of  a.b.z' (Hadamard's multiplication theo- rem, 1892). The following are some results concerning conditions for the coincidence of the circle of convergence of a power series and its tnatural boundary. Let the a. be positive numbers and b a natural number greater than 1. If the radius of convergence of ;'=o a.z b ' is equal to I, then Izi = 1 is its natural boundary (Weier- strass, E. I. Fredholm). If the radius of con- vergence of ;'=o a.zJ.. (with {A..} an increas- ing sequence of natural numbers) is 1 and liminf.oo(A..+!-A..)IA..>O, then Izl = 1 is the natural boundary (Hadamard's gap theo- rem). The latter condition was weakened to liminf.oo(A..+t-A..)/A>O by E. Borel (1896). E. Fabry (1896) showed that with the radius of convergence of I:;'=oa.z' being 1, if there exist a suitable sequence of natural num- bers m 1 <m2 <... and a number () (0< ()< I) such that limsJmj=O, where the Si are the number of nonzero a. contained in the inter- val (m i (I-(}),m i (1 +(})), then Izl = 1 is the natural boundary of  a.z'. By applying this theorem to  a.z An with radius of con- vergence I, it can be shown that iflim.oo A..ln = 00, then Izl = 1 is its natural boundary. These theorems are called gap theorems because they concern power series with gaps in their exponents. A generalization of Fabry's theorem was obtained by G. P61ya. It is known that Fabry's last condition above is in a sense the best possible (P6Iya, 1942). Regarding the natural boundary of a power series, we also have the following result: When the radius of convergence of  a.z' is 1, by a suitable choice of the sequence {E.} (E. = :t 1), the series I:E.a.z' has Izl = 1 as its natural boundary (A. Hurwitz, P. Fatou, P61ya). 
339 E Power Series E. ()verconvergence If the radius of convergence of [(z) = L anz n is 1, the sequence of partial sums S(I,z), S(2,z), ..., where S(n, z) = Lo avz v , is naturally di- vergent for Izl > 1, but a suitable subsequence S(nk, z) (k = 1,2, ...) may still be convergent. A. Ostrowski [7,8] called this phenomenon overconvergence and proved the following re- sult: By definition,j(z) = L.%o anz)' has a lacunary structure if the sequence {A. n } has a subsequence {An,} such that Ank+1 > )'nk(1 + 0) (0 > 0). If this situation occurs, then in a suffi- ciently small neighborhood of a point on Izi = 1 where j(z) is holomorphic, S(A nk , z) (k = 1,2, '" ) converges uniformly. This result in- cludes Hadamard's gap theorem as a special case. Conversely, any power series for which overconvergence takes place can be repre- sented as the sum of a power series having a lacunary structure and a power series whose radius of convergence is greater than 1. G. Bourion [9] gave a unified theory of these results using tsuperharmonic functions. R. Jentsch [10] showed that all singularities of a power series on its circle of convergence are accumulation points of the zeros of the partial sums. On the other hand, if the zeros of a subsequence S(nk,z) (k= 1,2, ...) has no accumulation point on Izl = I, then the power series has a lacunary structure and overcon- vergence takes place for S(nk, z) (k = 1,2, .,. ). If lognk+l = O(n k ) and S(nk, z) (k = 1,2, ...) is overconvergent, then all boundary points of the domain of overconvergence are accumula- tion points of the zeros of S(nk, z) (Ostrowski [8]). A power series is completely determined by its coefficients, but little is known about the relations between the arithmetical properties of its coefficients and the function-theoretic properties of the function represented by the series. A known result is that if the power series L cnz n with rational coefficients repre- sents a branch of an talgebraic function, then we can find alJ integer y such that the C n yn (n;? 1) are all integers (Eisenstein's theorem, 1852). For power series of several variables  21 Analytic Functions of Several Complex Vari- ables; for formal power series  370 Rings of Power Series. For power series expansions  Appendix A, Table IO.IV. References For the general theory of power series, [1] E. G. H. Landau, Darstellung und Begriin- dung einiger neuerer Ergebnisse der Funk- tionentheorie, Springer, second edition, 1929. 1256 [2] P. Dienes, The Taylor series, Clarendon Press, 1931. [3] L. Bieberbach, Analytische Fortsetzung, Springer, 1955. Also  references to 198 Holomorphic Functions. For singularities, [4] S. Mandelbrojt, Les singularite., des fonc- tions analytiques representees par line serie de Taylor, Gauthier-Villars, 1932. For theorems of Tauberian type, [5] N. Wiener, Tauberian theorems, Ann. Math., (2) 33 (1932),1-100. [6] H. R. Pitt, Tauberian theorems, Oxford Univ. Press, 1958. For overconvergence, [7] A. Ostrowski, Uber eine Eigenschaft ge- wisser Potenzreihen mit unendlichvielen ver- schwinden der Koeffizienten, S.-B. Preuss. Akad. Wiss., (1921),557-565. [8] A. Ostrowski, Uber vollstiindige Gebiete gleichmiissiger Konvergenz von Folgen ana- Iytischer Funktionen, Abh. Math. Sem. Univ. Hamburg, 1 (1922),327-350. [9] G. Bourion, L'ultraconvergence dans les series de Taylor, Actualites Sci. Ind., Hermann, 1937. [10] R. Jentsch, Fortgesetzte Untersuchungen iiber die Abschnitte von Potenzreihen, Acta Math., 41 (1918),253-270. 340 (XVI1.17) Probabilistic Methods in Statistical Mechanics A. Introduction Probabilistic methods are often very useful in the rigorous treatment of the math,matical foundations of statistical mechanics and also in some other problems related to statistical mechanics. As examples of such methods, we explain here (1) Ising models, (2) Markov statistical mechanics, (3) percolation processes, (4) random Schr6dinger equations, and (5) the tBoltzmann equation. B. Ising Models The Ising model was proposed by E. Ising [1] to explain the phenomena of phase transitions of a ferromagnet, in which either a + or - spin is put on each site of a crystal lattice, and interaction between nearest neighboring sites is taken into consideration. Let V be a cube in the d-dimensional integer lattice space Zd and Xv={ +1, -I? be the totality of spin configurations in V. Each ele- 
1257 ment of X y is denoted by 0" = {O"iLEY (O"i = + 1 or -1). We suppose that a spin configuration 0" has a potential of the type Uy(O")=- I O"iO"j+hIO"i, (i.J)cy iEY where (i,j) means that (i,j) is a nearest neigh- boring pair of sites and h stands for the para- meter of an external field. A tprobability mea- sure on Xy is caIled a state on V. For each state /1, the free energy is defined by 1 F y (/1)= I /1(O")U y (O")+- I /1(O")log/1(O"), aeX y (J aeX v where P= IjkT (k is the tBoltzmann constant, T is the tabsolute temperature). Then there exists a unique state P h( exp(-PUy(O")) gy' 0") - , YEXV exp( - PUy(Y)) on V which minimizes the free energy F y (vari- ational principle). gp.h is called a Gibbs state on V of the Ising model with parameter (P, h). PhysicaIly, a Gibbs state is an equilibrium state, in which various physical quantities are calculated. Since for each Zd-homogeneous (i.e., trans- lationaIly invariant with respect to Zd) state /1 the mean free energy O"EX y , f(/1)= lim FY(/1) y _Zd !VI is well defined, a (limiting) Gibbs state can also be defined for the infinite domain V = Zd by the above mentioned variational principle. However, at present the probabilistic defi- nition of Gibbs states given by Dobrushin [2] and Lanford and RueIle [3] is prevalent. It is known that if an external field is present (i.e., h i'0) there is only one Gibbs state for any p. On the other hand when an external field is absent (h = 0) and d;::, 2, there are at least two Gibbs states, i.e., a phase transition occurs, for a sufficiently low temperature. FinaIly, we mention some known facts in this field. In the following we assume h=O. (1) In the I-dimensional case, the phase transition never occurs. (2) For d;::, 2, there exists a crit- ical value PAd) such that the phase transition does not occur for any P < Pc(d) but it occurs for every P> PAd). The calculation of PcP) has been carried out by Onsager [4]. (3) In the 2- dimensional case, every Gibbs state is Z2_ homogeneous [5,6]. (4) For d;::, 3, there is a Zd-inhomogeneous Gibbs state for sufficiently large P [7]. C. Markov Statistical Mechanics Stochastic Ising models, infinite interacting particle systems, and many models occurring 340D Probabilistic Methods in Statistical Mechanics in physics, biology, and sociology are formu- lated as a class of infinite-dimensional tMarkov processes. The field of investigation of station- ary states and statistical or ergodic properties of these processes is called Markov statistical mechanics, which has made rapid progress during the last decade. We explain this field by looking at a typical class of processes. Let Zd be the d-dimensional integer lattice space. Putting + or - spin on each site on Zd, let us consider a random motion of spins which evolves while interacting with neighboring spins. Let X = { + 1, -1 }Zd be the totality of spin configurations and an element of X be denoted by 1J={1J(i)};EZ' (1J(i) = +1 or -1). The process is described in terms of a coIlection of nonnegative functions c i (1J) defined for iEZ d and 1JEX. For the configuration 1J, at time t, 1J,(i) changes to -1J,(i) in the time interval [t,t+M] with probability ci(1J,)M+o(M). This process on X is caIled a spin-flip model. For an initial distribution /1 we denote by /1, the distribution at time t. If /1, = /1 for all t;::, 0, /1 is caIled a stationary state. Example 1. Stochastic Ising models. A sto- chastic Ising model was proposed by Glauber [8] to describe the random motion in a fer- romagnet upon contact with a heat bath. Then the flip rate {Ci(1J)} is defined by the potential of the Ising model. It is known that any Gibbs state of the Ising model is a reversible station- ary state of the stochastic Ising model, and the converse is also valid. Free energy plays an important role in the study of the ergodic properties of these models. In particular, the mean free energy is a nondecreasing functional along the distributions /1" 0  t < 00. Example 2. Contact processes. A contact process was introduced by Harris [11] to investigate the spread of infection. The flip rate of the contact process is given by { I if c i (1J)= kA if 1J(i) = +1, 1J(i)= -1 and # Ullj-il = 1, 1J(j)= +1} =k. Here +1 denotes an infected individual and -1 denotes a,healthy one. Denote by -1 the configuration at which all sites are healthy and by <>-1 the unit point mass at -1; then <>-1 is a stationary state. The most important result is the foIlowing: There exists a critical value Ad (0 < Ad < (0) such that if A < Ad 8_ 1 is a unique stationary state, and if A> Ad there is another stationary state /1 satisfying /1[1JEX; 1J(i)= +1 for infinitely many i] = 1. D. Percolation Processes A percolation process is a mathematical model which describes the random spread of a fluid 
340E Probabilistic Methods in Statistical Mechanics through a medium. It can be used to describe phenomena such as the penetration through a porous solid by a liquid or the spread of an infectious disease [14]. Usually, the process is identified as a site percolation process or a bond percolation process. Here we describe the latter only. Let L = {5, B} be a countable connected tgraph with a set of sites (tvertices) 5 and a set of bonds (tedges) B. Each bond b is open with probability P and closed with probability 1- P independently of all other bonds. Suppose that a fixed point 0 is the source of a fluid which flows from 0 along open bonds only. Let l1(p) be the probability that the fluid spreads infinitely far, and define the critical percolation probability PH=inf{pIO(p»O}. Then it is known that (1) PH= 1/2 for the square lattice [16], (2) PH=2sin(n/18) for the triangular lattice, and (3) PH= 1-2sin(n/18) for the honeycomb lattice. E. Random Schrodinger Equations Random SchrOdinger equations are tSchrodin- ger equations in R n with random potentials U(x, w); therefore the corresponding operators are of the form A(w) = -+ U(',w), where IV denotes a random parameter in a probability space (0, 3', P) and  denotes the tLaplacian in Rn. It is assumed that this sys- tem of potentials forms a spatially homoge- neous random field with the tergodic property. This system of equations is considered to be a model describing the motion of quantum- mechanical particles in a random medium. Mathematically, the problem is to investigate various spectral properties of the self-adjoint operators A(w). Since A(w) and its shifted operator A(w(' + x)) are tunitarily equiva- lent, the above assumption on the potentials U(', w) implies that the spectral structures are independent of each sample w a.s. if their structures of A(w) are measurable with respect to (O,m. Several rigorous results have been obtained. In the I-dimensional case, if the potentials U(', w) are functionals of a strongly ergodic tMarkov process, then it is proved that A(w) has only a pure point spectrum [17], and each eigenfunction decays exponentially fast [18]. In multidimensional cases, asymptotic behav- ior at the left edge of the mean of the resolu- tions of the identity for a certain A(w) have been investigated [19]. It is assumed that the random potential for this A(w) takes the form U(x,w)= r cp(x- y)n(dy,w), JR" 1258 where {n(dy,w)} is a tPoisson random mea- sure with mean measure dy and cp(x) is a non- negative measurable function satisfying cp(x) = o(lxl- n - 2 ) as Ixl-+'oo. Let E.(x,y,w) be the continuous kernel for the resolution of the identity for A(w), and denote by 1\"(A) the mean of E)(O, 0, w). Then N(A) is a nondecreasing function vanishing on ( - 00, 0) and has the following asymptotic form at A=O: A n / 2 10gN().)-+-Y7/ 2 as ).-+0, where Y1 is the first eigenvalue of ._ with a Dirichlet boundary condition on the ball in Rn with unit volume. The quantity N(A-) can be identified with a limiting state density of A(w), namely, the limit function of NV(A, w) = # UI Aj (w):o(A}/1V1 as V tends to Rn regularly, where : Aj(W)} is the set of eigenvalues of A(w) in a smooth bounded domain V in Rn with a Dirichlet boundary condition and I VI is the volume of V. To obtain the above asymptotic behavior of N(A), the theory of t1arge deviation for Mar- kov processes plays a crucial role [20]. F. Boltzmann Equation In the kinetic theory of gases the tBoltzmann equation is derived from the tLiouville equa- tion by considering the BBGKY hierarchy of particle distribution functions for N particles and then by taking the limit N -+ co under certain conditions ( 402 Statistical Me- chanics). Mathematica11y rigorom. discussions were given by O. E. Lanford [21] for a gas of hard spheres; he showed that solutions of the BBGKY hierarchy converge to those of the Boltzmann hierarchy for sma11 time under the Boltzmann-Grad limit (N-+oo, Nd 2 -+1, d= the diameter of the hard spheres). An approach to the Boltzmann equation in the spatia11y homogeneous case can also be based on a tmaster equation. M. Kac [22] considered a Poisson-like process describing the random time evolution of the n-tuple of the velocities of n particles. For a gas of hard spheres this is determined by the master equation a a-tu(t, x j, ..., x n ) = I f {u(t'X]'''''X;'''''X;'''',Xn) n 1 i<jn S2 -u(t,x), ...,X n )} l(xi-xj,I)ldl, t>O, xj,....x n ER3, (1 where 52 is the 2-dimensional unit sphere, dl is 
1259 the uniform distribution on S2 and X'l =xi+(xj-x;,I)I, xj=x j -(x j -x i ,l)1. Let (J be a positive constant and S() denote the (3n -1)-dimensional sphere with center 0 and radius . Given a symmetric probability density Un on S() for each n  1, a seq uence {un} is said to have Boltz- mann's property or to be chaotic (or u-chaotic to stress u), if there exists a probability density u on R3 such that lim f <Pl (xd... <Pm(xm)Un(Xl'.'" Xn)dXl noo S(jI!a) ...dx n = Ii r <Pk(x)u(x)dx k=l J R3 for each m 1 and <PkECo(R3), 1 km. Kac's assertion is that the Boltzmann equation is to be derived from the master equation via the propagation of chaos; more precisely, if {un} is a u-chaotic sequence, then {un(t)} is also u(t)- chaotic, where un(t) is the solution of the mas- ter equation (1) with un (0) = Un and u(t) is the solution ofthe fol1owing Boltzmann equation with u(O) = u: a a U(t'X)= f {u(t,x')u(t,y')-u(t,x)u(t,y)} t S'xR3 x l(x-y,I)ldldy, x' = x+(y-x, 1)1, y' =y-(y- x, 1)1. The propagation of chaos was verified by Kac [22], H. P. McKean [24], and others for a considerably wide class of nonlinear equations of Boltzmann type (with cutoff). The propagation of chaos is the stage correspond- ing to the t1aw of large numbers. The next stage is the tcentrallimit theorem or fluctu- ation theory, which was also discussed by Kac [23] and McKean [25]. Moreover, based on Kac's work [22], McKean [26] introduced a class of tMarkov processes associated with certain nonlinear evolution equations includ- ing the Boltzmann equation; a process of this type describes the time evol ution of the veloc- ity of a particle interacting with other similar particles. In the case of the spatially homo- geneous Boltzmann eq uation of Maxwellian molecules without cutoff, such a Markov process was constructed by solving a certain tstochastic differential equation [27]. This implies the existence of probability measure- valued solutions of the equation. References [1] E. Ising, Beitrag zur Theorie des Ferro- magnetism us, z. Phys., 31 (1925), 253- 264. 340 Ref. Probabilistic Methods in Statistical Mechanics [2] R. L. Dobrushin, The description of a random field by means of conditional proba- bilities and conditions of its regularity, Theory Prob. Appl., 13 (1968),197-224. (Original in Russian, 1968.) [3] O. E. Lanford III and D. Ruelle, Obser- vables at infinity and states with short range correlations in statistical mechanics, Comm. Math. Phys., 13 (1969),194-215. [4] L. Onsager, Crystal statistics I: A two- dimensional model with an order-disorder transition, Phys. Rev., 65 (1944), 117-149. [5] M. Aizenman, Translation in variance and insta bility of phase coexistence in the two dimensional Ising system, Comm. Math. Phys., 73 (1980), 83-94. [6] Y. Higuchi, On the absence of non- translationally invariant Gibbs states for the two-dimensional Ising model, Proc. Conf. on Random Fields, Esztergom, North-Holland, 1981. [7] R. L. Dobrushin, Gibbs state describing coexistence of phases for a three-dimensional Ising model, Theory Prob. Appl., 17 (1972), 582-600. (Original in Russian, 1972.) [8] R. J. Glauber, Time-dependent statistics of the Ising model, J. Math. Phys., 4 (1963), 294- 307. [9] R. L. Dobrushin, Markov processes with a large number of locally interacting compo- nents: Existence of a limit process and its er- godicity, Problems of Information Transmis- sion, 7 (1971), 149-164; Markov processes with many locally interacting components: The reversible case and some generalizations, ibid., 235-241. (Originals in Russian, 1971.) [to] F. Spitzer, Interaction of Markov pro- cesses, Advances in Math., 5 (1970), 246-290. [11] T. E. Harris, Contact interactions on a lattice, Ann. Probability, 2 (1974), 969-988. [12] R. Holley and T. M. Liggett, The survival of contact processes, Ann. Probability, 6 (1978),198-206. [13] T. M. Liggett, The stochastic evolution of infinite systems of interacting particles, Lecture notes in math. 598, Springer, 1977, 187-248. [14] S. R. Broadbent and J. M. Hammersley, Percolation processes I: Crystals and mazes, Proc. Cambridge Phil os. Soc., 53 (1957), 629- 641. [15] R. T. Smythe and J. C. Wierman, First- passage percolation on the square lattice, Lecture notes in math. 671, Springer, 1978. [16] H. Kesten, The critical probability of bond percolation on the square lattice equals 1/2, Comm. Math. Phys., 74 (1980), 41-59. [17] I. 1. Goldseid, S. A. MoIchanov, and L. A. Pastur, One-dimensional random Schrodinger operator with purely point spectrum, Func- tional Anal. Appl., 11 (1977), 1-10. (Original in Russian, 1977.) 
341 A Probability Measures [18] S. A. Mo1chanov, The structure of eigen- functions of one-dimensional unordered structures, Math. USSR-Izv., 12 (1978), 69- 10 l. (Original in Russian, 1978.) [19] S. Nakao, On the spectral distribution of the Schrodinger operator with random poten- tial, Japan. J. Math., 3 (1977), 111-139. [20] M. D. Donsker and S. R. S. Varadhan, Asymptotics for the Wiener sausage, Comm. Pure Appl. Math., 28 (1975), 525-565. [21] O. E. Lanford III, Time evolution of large classical systems, Lecture notes in physics 38, Springer, 1975. [22] M. Kac, Probability and related topics in physical sciences, Interscience, 1959. [23] M. Kac, Some probabilistic aspects of the Boltzmann-equation, Acta Phys. Austrica Supp. 10, Springer, 1973, 379-400. [24] H. P. McKean, Speed of approach to equilibrium for Kac's caricature of a Max- wellian gas, Arch. Rational Mech. Anal., 21 (1966),347-367. [25] H. P. McKean, Fluctuations in the kine- tic theory of gases, Comm. Pure Appl. Math., 28 (1975), 435-455. [26] H. P. McKean, A class of Markov pro- cesses associated with nonlinear parabolic equations, Proc. Nat. Acad. Sci. US, 56 (1966), 1907-1911. [27] H. Tanaka, Probabilistic treatment of the Boltzmann equation of Maxwellian molecules, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 46 (1978),67-105. 341 (XVII.2) Probability Measures A. General Remarks A probability measure <1> on a tmeasurable space (5,6) is defined to be a imeasure on (5,6) with <1>(5)= I ( 270 Measure Theory). In probability theory, probability measure appears usually as the tprobability distribution of a trandom variable ( 342 Probability Theory). Unless stated otherwise, we regard a +topological space T as a measurable space endowed with the topological a-algebra 123(T) on T, i.e., the ta-algebra generated by the topen subsets of T. Hence the distribution of an Rn-valued random variable is a probability measure on (R", 123" = I23(R")). From this proba- bilistic background we often call a probability measure on (R", 123") an n-dimensional (proba- bility) distribution. For probability measures on topological spaces  270 Measure Theory. 1260 B. Quantities Characterizing Probability Distributions Several different quantities characterize the properties of probability distributions in one dimension: the mean (or mathematical expecta- tion) m = Joc xd<1>(x), the variance 0 2 = J'XJ'lO Ix- ml 2 d<1>(x), the standard deviation a, the kth moment CXk = J':" ", Xk d<1>(x), the kth absolute moment Ilk = J". rye I xl k d<1>(x), the kth moment about the mean J1.k = J,:",,(x -m)kd<J>(x), etc. A one-to-one correspondence exists be- tween a I-dimensional distribution <1> and its (cumulative) distribution function F defined by F(x)= <1>« -00, x]). A distribution function is characterized by the following properties: (1) It is monotone nondecreasing; (2) it IS right con- tinuous; (3) lim<-_a F(x)=O and lim x _+ w F(x) = I. Similar statements hold for the multi- dimensional case. Let X(w) be a real random variable on a tprobability space (Q, ,P). Then lhe distri- bution of X is given by <1>(E)=P({wIX(W)E E}), E E123 1 , and the characteristic quantities of <1> defined above are given in terms of X(w) as follows: m=E(X), a 2 =E(X _m)2, F(x)= P({wl X(w):s:; x}), etc. The moments and the moments about the mean are connected by the relation J1.,= LoG)CXY-d _m)k (r= 1,2, ...). When <1> is an n-dimensional distribution, the following quantities are frequently used: the mean vector, which is an n-dimensional vector whose ith component is given by m i = J x j d<1>(x); the covariance matrix, which is an n x n matrix whose (i,j)-element is aij= J(x j -m j )(x j -m)d<1>(x); the moment matrix, which is an n x n matrix whose (i,) )-element is m ij = J x j x j d<1>(x). (The covariance matrix is also called the variance matrix or the variance- covariance matrix.) The covariance matrix and the moment matrix are tpositive definite and symmetric. The quantities listed above are defined only under some integrability conditions. C. Characteristic Functions Consider a probability measure <t. defined on a measurable space (Rn, 123"), where 'Bn is the a- algebra of all tBorel sets in R". The character- istic function of <1> is the tFourier transform <p defined by <p(z)= r e j (z.X)d<1>(x), zERn, JR" (I) where (z, x) denotes the tscalar product of z and x (z, x E R"). Let X be an n-dimensional random variable with probability distribution <1> defined on a tprobability space (Q, 123, P). 
1261 Then the Fourier transform of $ is also caned the characteristic function of X, which can also be written as E(ei(z.X)) ( 342 Probability Theory). The fonowing properties playa fundamental role in the study of the relationship between probability distributions and characteristic functions: (i) the correspondence defined by (1) between the n-dimensional probability distri- bution $ and its characteristic function qJ is one-to-one. (ii) For any a p ' bpER, ap<b p (p= 1,2, ...,n), we have L. Ll f(x p ; a p ' bp)d$(x) _' (  ) n I e I e n e-ibpzp_e-iapzp -hm ... n . C--I'OO 2n -c -c p=l -lZp X qJ(ZI' ...,zn)dz 1 , ...,dz n , (2) where f(t; a, b) denotes the modified indicator function of [a, b] defined by { I, tE(a,b), f(t;a,b)= 1/2, t=aorb, 0, t<t(a,b], and x =(x 1 ,..., xn)ERn. If an n-dimensional interval 1= [aj, ... ,an; b j, .. . , b n ] defined by ai Xi b i (i = 1,2,..., n) is an interval of con- tinuity for the probability distribution $, i.e., $(J/)=O, where JI denotes the boundary of I, then the left-hand side of (2) is equal to $(/). Equation (2) is caned the inversion formula for the characteristic function qJ. The characteristic function qJ of an n- dimensional probability distribution has the fonowing properties: (i) For any points z(l), ..., z(p) of the n-dimensional space Rn and any complex numbers aI' '" ,a p ' we have p I qJ(z(j) - z(k»)aii k ?- O. j.k=1 (ii) qJ(Z(k)) converges to qJ(O) as Z(k)->O. (iii) qJ(O) = 1. A complex-valued function qJ of zERn is caned tpositive definite if it satisfies the in- equality in (i). Any continuous positive defi- nite function qJ on R n such that qJ(O) = 1 is the characteristic function of an n-dimensional probability distribution (tBochner's theorem) ( 192 Harmonic Analysis). A counterpart to Bochner's theorem holds for any positive definite sequence as wen (tHerglotz's theorem). The characteristic function is often useful for giving probability distributions explicitly. (For characteristic functions of typical probability distributions  Appendix A, Table 22. For general information about criteria that can be used to decide whether a given function is a characteristic function  [8].) The moment generating function defined by f(z) = S exp( -(z, x)) d$(x)(z E R n ) does not 341 D Probability Measures necessarily exist for an n-dimensional distri- butions but does exist for a number of useful probability distributions $, and then f(z) uniquely determines $. Given a I-dimensional distribution $ with {3k < +00, we denote by Yk the coefficient of (izNk! in the tMac1aurin expansion of 10gqJ(z). We can Yk the (kth order) semi-invariant of $. The moments and semi-invariants are con- nected by the relations 1'1 =1Xj, ')'2=1X 2 -lXi= (J2, ')'3 =1X 3 - 31X 1 1X 2 +21XL ')'4 =1X 4 - 3IXi -41X 1 1X 3 + 12IXiIX2 - 61X1,.... D. Specific Distributions Given an n-dimensional distribution $, a point a with $( {a}) > 0 is caned a disconti- nuity point of $. The set D of an discontinuity points of $ is at most countable. When $(D) = 1, $ is caned a purely discontinuous distribu- tion. In particular, if D is a lattice, $ is caned a lattice distribution. If the distribution function of $ is a continuous function, $ is caned a continuous distribution. By virtue of the tLebesgue decomposition theorem, every probability distribution can be expressed in the form $=a 1 $1 +a 2 $2+ a 3$3' aj,a 2 ,a 3 ?-0, a 1 +a 2 +a 3 =I, where $1 is purely discontinuous, $2 is tab- solutely continuous with respect to tLebesgue measure, and $3 is continuous and tsingular. Let $ be an absolutely continuous distri- bution. Then there exists a unique (up to Le- besgue measure zero) measurable nonnega- tive function f(x) (XE R n ) such that $(E)= hf(x)dx. This function f(x) is caned the probability density of $. We now list some frequently used 1- dimensionallattice distributions (for explicit data  Appendix A, Table 22): the unit distri- bution with $( {O}) = 1; the binomial distribu- tion Bin(n,p) with parameters nand p; the Poisson distribution P(A) with parameter A; the geometric distribution G(p) with parameter p; the hypergeometric distribution H(N,n,p) with parameters N, n, and p; and the negative binomial distribution NB(m, q) with parameters m and q. The fonowing k-dimensionallattice distributions are used frequently: the multi- nomial distribution M(n,p) with parameters n and p; the multiple hypergeometric distribution; the negative multinomial distribution; etc. The fonowing I-dimensional distributions are absolutely continuous: the normal distri- bution (or Gaussian distribution) N (fl, (J2) with mean fl and variance (J2 (sometimes N(O, 1) is caned the standard normal distribution); the 
341 E Probability Measures Cauchy distribution C(J1, 0") with parameters J1 (tmedian) and 0"; the uniform distribution U(cx, fJ) on an interval ['X, [3J; the exponen- tial distribution e(O") with parameter 0"; the gamma distribution f(p, 0"); the tx 2 distribu- tion x 2 (n); the beta distribution B(p, q); the F- distribution F(m, n); the Z-distribution Z(m, n); the t-distribution t(n); etc. Furthermore, there are several k-dimensional absolutely continu- ous distributions, such as the k-dimensional normal distribution N(J1,.[) with mean vector J1=(J11,J12, ...,J1d and covariance matrix.[= (O"i), the Dirichlet distribution, etc. E. Convolution Given any two n-dimensional distributions <1>[, <1>2' the n-dimensional distribution <1>(£) = JR"XE(X+ y)d<1> 1 (x)d<1>2(Y) is called the com- position (or convolution) of <1>1 and <1>2 and is denoted by <1>1 * <1>2' where XE is the Indicator function of the set E. Let XI and X 2 be tinde- pendent random variables with distributions <1>, and <1>2' Then the distribution of XI +X 2 is <1>, * <1>2' When F i is the distribution function of <1>; (i = 1,2), the distribution function F 1 * F 2 of <1>1 *<1>2 is expressed in the form FI *F2(X)= JR' FI (x - y)dF 2 (y). If <1> I has a density fl (x), then <1>, * <1>2 has a density f(x) = JR,fl (x- y)dF 2 CI'). If <p(z) is the characteristic function of the convolution of two proba bility distri- butions <1>, and <1>2 with characteristic func- tions <PI and <P2' then <P is the product of <PI and <P2: <p(z) = <PI (z) . <P2(Z). Therefore, for every k, the kth order semi-invariant of the convo- lution of two distributions is equal to the sum of their kth order semi-invariants. Suppose that we are given a family of distributions <1> = {<1>(cx, [3, ...)} indexed with parameters cx,{J,.... Iffor (rx 1 JJ[,...) and (rx 2 ,fJ2"") there exists ('X 3 ,fJ3'''') such that <1>(rx p [3I' ...)* <1>('X 2 , {J2,''') = <1>(rx 3 , fJ3'''')' then we say that <1> has a reproducing property. Some of the distri- butions listed above have the reproducing property: P(A I )*P(A2)=P(A'1 +,12), Bin(n 1 ,p)* Bin(n 2 ,p)=Bin(n l +n 2 ,p), NB(mj,q)* NB(m 2 ,q)=NB(m l +m 2 ,q), N(J1j,O"D* N(J12' 0"5:)= N(J11 + J12, O"f +0"5:), f(PI' 0")* f(P2' 0-) = f(p I + P2' 0"), C(J1I' 0" I) * C(J12' 0"2) = C(J1, + J12, 0"1 +0"2), etc. Given a I-dimensional distribution function F(x). QfU)= max (F(x+/)-F(x-/», 1>0, - 00 <x< 00 is called the maximal concentration function of F (P. Levy [6J). Since it satisfies the relation QF,*F2(/)QF,(l) (i= 1,2), we can use it to study the properties of sums of independent 1262 random variables. The mean conc'ntration function defined by 1 f T CF(l)=- (F(x+/)-F(x-/»2dx 2/ -oc is also useful for similar purposes. Let N (m, v) be the l-dimension2l normal distribution with mean m and vanance v, and let P()" a) be the distribution obtained through translation by a of the Poisson distribution with parameter X If I-dimensional distri- butions <1>b 'Pk (k = 1,2) exist such that N(m, v) = <1>, * <1>2' P()., a) = 'P 1* 'P 2' we have <1>k = N(mk' v.), 'Pk = P()'bad (k= 1,2) for some m k , L\, Ab a k (k = 1,2). These are known, respec- tively, as Cramer's theorem and Raikov's theorem. Yu. V. Linnik proved a similar fact (the decomposition theorem) for 2 more gen- eral family with reproducing property by using the theory of analytic functions [9]. F. Convergence of Proba bility Distributions The concept of convergence of distributions plays an important role in limit theorems and other fields of probability theory. When Q is a topological space, we consider cOllvergence of probability measures on Q with respect to the tweak topology introduced in the space of measures on Q ( 37 Banach Spaces). Such convergence is called weak convergence in probability theory. For a sequence of n- dimensional distributions <1>k (k = 1,2,...) to converge to <1> weakly, each of the following conditions is necessary and sufficient. (I) For every continuous function with compact sup- port, limkoc,JR,f(x)d<1>dx)= JR,f(x)d<1>(x). (2) At every continuity point of the distribution function F(x" ... ,x n ) of <1>, limka: Fk(x l , '" ,x n ) = F(x[, ..., x n ) (F k is the distribution func- tion of <1>d. (3) For every continui ty set E of <1> (namely, a set such that <1>(£ - EO) = 0), limkoc <1>dE) = <1>(£). (4) For all open G eRn, liminfkoc <1>k(G)?:<1>(G). (5) For all closed Fe Rn, lim SUPkoc <1>k(F)  <1> (F). (6) limkoo P(<1>b <1» =0, where p is a metric defined in the follow- ing way: Given any n-dimensional distribu- tions <1>[, <1>2, we put eij= inf{el <1»F) < <1>i(F')+ e for every closed F} (F' is the e-neighborhood of F) and define P(<1>I,<1>2)=max("12,f. 2 d. The metric p, called the Levy distance. was intro- duced by Levy [6J in one dimension and by Yu. V. Prokhorov in metric spac:s [10]. Each of these conditions except (2) is still necessary and sufficient for <1>. to converge weakly to <1> when <1>n and <1> are probability measures on a tcomplete separable metric space It should also be noted that the probability measures on a complete separable metric space consti- 
1263 tute a complete separable metric space with respect to the Levy distance. A family $.(IXEA) of probability measures on a complete separable metric space is said to be tight if for every e > 0 there exists a tcom- pact set K =K(e) such that $.(KC)<e for all IXEA. A family $. (IXEA) is tight if and only if it is ttotally bounded with respect to the topol- ogy induced by the Levy distance. Hence a tight family $. (IX E A) has a weakly convergent su bseq uence. We can give a criterion for the convergence of pro bability measures in terms of their char- acteristic functions. Suppose that $k and $ are n-dimensional probability measures with char- acteristic functions <Pk and <po Then $k con- verges weakly to $ if and only iff or every z, Iim k <Pk(Z) = <p(z). Let <Pk be the characteristic function of an n-dimensional probability dis- tribution $k' If the sequence {<pd converges pointwise to a limit function <P and the conver- gence of <Pk is uniform in some neighborhood of the origin, then <P is also the characteristic function of an n-dimensional probability dis- tribution $ and the sequence {$k} converges weakly to $ [7] (Levy's continuity theorem). For any probability distribution concen- trated on [0, (0), the use of tLaplace trans- forms as a substitute for Fourier transforms provides a powerful tool. The method of probability generating functions is available for the study of arbitrary probability distri- bution concentrated on the nonnegative inte- gers [14]. The method of moment-generating functions is also useful. There are many results on the relation between these functions, proba- bility distributions, and their convergence [14- 16]. Let $1' $2"" and $ be I-dimensional distributions. If all absolute moments exist, '2:.r;1 p j - 1 Jj = 00 for Pj= Joo I x I j d$(x) < 00, and IimkooJooxjd$dx)= Jooxjd$(x) (j= 0, 1,2, ...), then $k weakly converges to $. This condition is sufficient but not necessary. G. Infinitely Divisible Distributions An n-dimensional probability distribution $ is called infinitely divisible if for every posi- tive interger k, there exists a probability dis- tribution $k such that $ = $k * $k * ... * $k (= $t k ). Both normal distributions and Pois- son distributions are infinitely divisible. If an n-dimensional distribution $ satisfies the con- dition Slxl>,$(dx)<e, we say that $Ev(e). Then $ is an infinitely divisible distribution if and only iffor every E > 0 we can find $1' $2' ... , $k E v(e) such that $ = $ 1 * $2 * ... * $k' 341 G Probability Measures Let X ki , i = 1,2, ..., n(k), be independent random variables for every k, and assume that the distribution of X ki belongs to v(ed, i= 1,2, ...,n(k), where ekO as koo. Ifthe probability distributions of the sums X k = '2:.7!"kl X ki converge to a probability distribu- tion as k 00, then the limit distribution is infinitely divisible. The characteristic function of a I-dimen- sional infinitely divisible distribution can be written in the form <p(Z) =exP(irz-z2 + f oo A(u, z) 1 :2 U2 dG(U)). (3) where')' is a constant, v is a nonnegative con- stant, G(u) is a nondecreasing bounded func- tion with G(-oo)=O, A(u,z)=exp(iuz)-I- izuj(1 +u 2 ), and the value of A(u,z)(1 +U 2 )ju 2 at u=O is defined to be -z2j2. Formula (3) is called Khinchin's canonical form. For the char- acteristic function of an infinitely divisible n- dimensional distribution, the canonical form is as follows: <P(Z)=exP(i(m,Z)- P.1 CpqZpZ q +f ( eiCz.X)_I_ i (Z,X)2 ) n(dX) ) , JR" 1 + Ixl Z=(ZI' ...,z,)ER', (4) where mER', (c pq ) is a positive semi definite matrix, and n(dx) is a measure on R' such that n( {O} )=0 and 1 Ixl2 -----zn(dx) < 00. R" 1+lxl Formula (4) is called Levy's canonical form. If a I-dimensional infinitely divisible distribution $ satisfies SRI x 2 d$(x)< 00, then its character- istic function is given by <P(Z)=eXP(imZ-Z2 f oo. 1 ) + (e'Z" -1- izu)zdK(u) , (5) -00 u where m is a real constant, v is a nonnega- tive constant, and K(u) is a nondecreasing bounded function such that K( -(0)=0. It is called Kolmogorov's canonical form. (For infinitely divisible distributions on a homo- geneous space  5 Additive Processes.) Let $ and '¥ be n-dimensional distributions. If for some ,bO, '¥(E)=$(AE) (AE= {A I E E} ) for every set E, we say that $ and '¥ are equivalent. Let $ and '¥ be probability distri- butions with distribution functions F and G 
341 H Probability Measures and characteristic functions (p and 1jJ. Then the following three statements are equivalent: (1) <P and 'P are equivalent; (2) G(x)=F(icx) for every x; and (3) IjJ(AZ) = <p(z) for every z. We call <P a stable distribution if for every pair of distributions <Ph <P 2 equivalent to <P, the con- volution <PI * <P 2 is equivalent to <P. If <P is stable, every distribution equivalent to <P is also stable. We can characterize stable distri- butions in terms of their characteristic func- tions <p(z) as follows: For every pair ),[,1. 2 > 0, there exists a ic=ic(A 1 J2»0 such that <p(icz) = <p(A l Z)<p(A 2 Z). We can restate this characteri- zation as follows: <P is stable if and only if for every pair of independent random vari- ables XI and X 2 with identical distribution <P and for any positive numbers ;-1 and 1. 2 , there exists a positive number X such that (A[ Xl + ;-2X2)j)- has the distribution <P. By the defi- nition we see that all stable distributions are infinitely divisible. In the I-dimensional case, putting <p(z) = expljJ(z), we have 1jJ()_z) = IjJ (I. 1 z)+IjJ()-2z), which implies ljJ(z)=( -co+i(z/Izl)cdlzl"' where Co ;;'0, -00 < CI < 00, ° <:x:S; 2. The parameter:x is called the exponent (or index) of the stable distribution. The stable distribu- tions with exponent :x = 2 are the normal dis- tributions, and the stable distributions with exponent :x = 1 are the Cauchy distributions. We have ljJ(z) = -colzl" for a symmetric stable distribution. (For the stable distribution with exponent 1/2  Appendix A, Table 22). Generalizing stable distributions, we can define quasistable distributions, which B. V. Gnedenko and A. N. Kolmogorov [17J called stable distributions also. Let F be the distri- bution function of a I-dimensional distri- bution <P. <P is said to be quasistable if to every b [ > 0, b 2 > ° and real AI, 1. 2 there corre- spond a positive number b and a real number I. such that we have the relation F((x -).d/bd* F((x- ic 2 )/b 2 ) = F(x- A)/b). Let {XJ be a sequence of independent ran- dom variables with identical distribution. If for suitably chosen constants An and Bn the distri- butions of the sums Bn-[(l:71 X i )- An converge to a distribution, the limit distribution is a quasistable distribution (Levy). A necessary and sufficient condition for a distribution to be quasistable is that its characteristic func- tion <p(z) satisfy the relation <p(b 1 z)<p(b 2 z) = <p(bz)e iYz (')' = I. - A 1 - 1. 2 ), The characteristic function of a quasistable distribution has the canonical representation <p(z) = exp ljJ(z), ljJ(z) = imz- clzl 7 (1 + iP(z/lzl)w(z, :X)), where m is a real number, c;;. 0, ° <:x :S; 2, IPI:S; I, and w(z,:x) = tan(n:x/2) (:x =1-1), w(z,:x) = 1264 (2/n)loglzl (:x = 1). The parameter:x is called the exponent of the quasistable distribution. A quasi stable distribution with IX =I- 1 is obtained from a stable distribution by tran;lation, but quasistable distributions with :x = 1 are not. Semistable distributions are another gen- eralization of stable distributions. A distri- bution is called semistable if its characteristic function <p(z) satisfies the relation ljJ(qz) = q71jJ(Z) for a positive number q (=1-1), where <p(z) = exp(ljJ(z)). Also in this case, the general form was obtained by Levy [6]. A I-dimensional probability distribution <P is called an L-distribution if the di;tribution function F of <P is the convolutior. of F(x/a) and some other distribution function Fa(x) for every 0< a < 1. <P is an L-distribution if and only if there exists a sequence of independent random variables {X k } such that for suitably chosen constants Bn > ° and An the distri- butions of the sums Bn- I (l:Z=[ X k ) - An con- verge to <P and SUPI <;;k<;;n P(IXk/Bnl > f.)-+O as n-> 00 for every f. > 0. Quasistable distribu- tions are L-distributions. H. The Shape of Distributions Let F(x) be a I-dimensional distn bution func- tion. The quantity (p such that F((p-O):S;p:S; F (( p) (0 < p < 1) is called the quan1 He of order p of F. In particular, the quantity ([/2 is called a median. If F satisfies the relation 1 - F(m + x) =F(m-x), it is called symmetric. In any 1- dimensional symmetric distribution, every moment of odd order about the mean (if it exists) is equal to zero. The ratio ')'[ = /l3/(J3 is used as a measure of departure from symmetry of a distribution and is called the coefficient of skewness. Further- more, the ratio ')'2 = /l3/(J4 - 3 is called the coefficient of excess. For the normal distri- bution, we have ')'[ = 12 = 0. If 12 =I- 0, ')'2 ex- presses the degree of deviation from the nor- mal distribution. A distribution function F(x) is called uni- modal if there exists one value x =, a such that F(x) is convex for x<a and concave for x>a. An L-distributions (and hence quasistable distributions) are unimodal [18]. I. Kolmogorov's Extension Theorem Let Q = RT, where T is an arbitrary index set. We associate with R T the (J-algebra I!3 T gen- erated by the cylinder sets, i.e., {WEQ I n'l (W)E El' ..., n,JW)E En}, where n,(cv) de.notes the tth coordinate ofcv, EkEI!3(Rl), 1 :S;k:S;n, t l <t 2 < ... < tn, and n = 1,2, .,. . Given a probability 
1265 measure ct> on (R T,  T), we can define a finite- dimensional tmarginal distribution ct>s for any finite subset 5 of T by ct>s(E)=ct>(7ri1(E»), EE s, where 7rs is the natural tprojection 7r s : R T ->R s . The measures {ct>s} satisfy the follow- ing consistency condition: If 51 C 52 ( c T) are finite and if EEs" then ct>s, (E) = ct>s2(7ri'2(E»), where 7rSlS2:Rs2->Rsl is the natural projection. Conversely, if we are given a family of finite- dimensional probability measures {ct>s} which satisfies the consistency condition (6), then Kolmogorov's extension theorem [1] asserts that there exists a unique probability measure ct> on (R T , T) such that ct>s(E)=ct>(7ri1(E)), E Es, for any finite 5 cT. This theorem is useful in constructing tsto- chastic processes. For example, let ct>""2....'" be the tproduct measure of k copies of a given probability measure ct> on R 1. Then the family {ct>'I"2....',,;n 1 ,n 2 , ...,nkEZ,kEN} satisfies the consistency condition and hence, by Kolmo- gorov's extension theorem, determines a prob- ability measure on R Z , which is denoted by ct>z. Thus X,(W) = 7r,(w), nEZ, (wE(RZ,Z,ct>Z)), are independent identically ct>-distributed ran- dom variables. Kolmogorov's extension theorem is gen- eralized to the case where the component spaces are tstandard measurable spaces ( 270 Measure Theory) instead of R 1, and also to the case where product spaces are replaced by tprojective systems [19]. J. Characteristic Functionals on Infinite- Dimensional Spaces Contrary to the finite-dimensional case, Bochner's theorem does not necessarily hold in infinite-dimensional spaces. For example, let (T, 11'11) be an infinite-dimensional tHilbert space and cp(t) =exp( -lltI1 2 ). cp is continuous and positive definite, and cp(O) = 1. But it is known that there is no probability measure on T= T* (topological dual of T) which corre- sponds to cp. Bochner's theorem is generalized to infinite-dimensional spaces as follows. Let T be a real tvector space endowed with the topology i defined by a system of tHilbert- ian semi norms {II'II.,IXEA}. Define a new topology I(i) of T by all Hilbertian semi- norms 11'11 each of which is HS-dominated by some 11'11., IXE A, i.e., sup{ (;IIejI12)1/2; {e j }: IX- orthonormal} < 00. If I(i)= i, then (T, i) is called a tnuclear space. Let '4* be the topological dual of (T, t) (i.e., the set of all i-continuous real valued linear functionals on T). Define a Borel structure 341 Ref. Probability Measures ('4*) of '4* as the (J-algebra generated by the system of half-spaces {XE '4*; x(t)<a}, tE T, aER 1 . For a probability measure ct> on ('4*, ('4*)), define cp(t)= f exp(i<x,t»)dct>(x), tET, T (6) which is called the characteristic functional of ct>. A functional cp on T is the characteristic functional of a probability measure ct> on ('4*, ('4*)) such that ct>*(U, 1;.;) = 1 for some sequence {IX,} c A, where ct>* is the touter measure ( 270 Measure Theory) and 1;.* is the topological dual of (T, 11'11.), if and only if cp is positive definite, cp(O) = 1, and continuous with respect to the topology I(i) [23]. As special cases of the foregoing theorem, we have the following. If (T, t) is a nuclear space, then every positive definite i-continuous functional cp with cp(O) = 1 is the characteristic functional of a probability measure on '4* (Minlos [24]). Schwartz's spaces Y'(R') and (R') are nuclear. Let (T, i= 11'11) be a Hilbert space. A tHilbert-Schmidt operator U is, by definition, a tbounded linear operator on T such that dl Ue i l1 2 < 00, by any tcomplete orthonormal system {e i } (this quantity does not depend on the choice of {e j }). Define a seminorm Il'IIu by Iltll u = II Utll, tE T, for a Hilbert-Schmidt operator U. Then the topol- ogy I(t) coincides with the topology induced by the system of semi norms 11'11 u, where U are Hilbert-Schmidt operators, which is called the Sazonov topology. Thus every functional cp on T, which is positive definite, cp(O) = 1, and continuous with respect to the Sazonov topology, is the characteristic functional of a probability measure on '4* (Sazonov [25]). The probability measure on Y" with the characteristic functional exp( - J  00 If(s)1 2 ds), fEY', is the probability measure of a Gaussian twhite noise on Y". References [1] A. N. Kolmogorov, Grundbegriffe der Wahrscheinlichkeitsrechnung, Springer, 1933; English translation, Foundations of the theory of probability, Chelsea, 1950. [2] H. Cramer, Mathematical methods of statistics, Princeton Univ. Press, 1946. [3] J. L. Doob, Stochastic processes, Wiley, 1953. [4] M. M. Loeve, Probability theory, Van Nostrand, third edition, 1963. [5] W. Feller, An introduction to probability theory and its applications II, Wiley, 1966. [6] P. Levy, Theorie de l'addition des vari- ables aleatoires, Gauthier-Villars, 1937. 
342 A Probability Theory [7J P. Levy, Ca1cul des probabilites, Gauthier- Villars, 1925. [8J E. Lukacs, Characteristic functions, Haf- ner, 1960. [9J Yu. V. Linnik, Decomposition of proba- bility distributions, Oliver & Boyd, 1964. (Original in Russian, 1960.) [10J Yu. V. Prokhorov, Convergence of ran- dom processes and limit theorems in proba- bility theory, Theory of Prob. Appl., 1 (1956), 155-214. (Original in Russian, 1956.) [11J L. LeCam, Convergence in distribution of stochastic processes, Univ. California Publ. Statist., 2 (1957),207-236. [12J K. R. Parthasarathy, Probability mea- sures on metric spaces, Academic Press, 1967. [13J P. Billingsley, Convergence of probability measures, Wiley, 1968. [14J W. Feller, An introduction to probability theory and its applications I, Wiley, second edition, 1957. [15J J. A. Shohat and J. D. Tamarkin, The problem of moments, Amer. Math. Soc. Math. Surveys, 1943. [16J D. V. Widder, Laplace transform, Prince- ton Univ. Press, 1941. [17J B. V. Gnedenko and A. N. Kolmogorov, Limit distributions for sums of independent random variables, Addison-Wesley, 1954. (Original in Russian, 1949.) [18J M. Yamazato, Unimodality of infinitely divisible distribution functions of class L, Ann. Prob., 6 (1978), 523-531. [19J S. Bochner, Harmonic analysis and the theory of probability, Univ. of California Press, 1955. [20J E. Nelson, Regular probability measures on function space, Ann. Math., (2) 69 (1959), 630-643. [21J I. M. Gel'fand and N. Ya. Vilenkin, Generalized functions IV, Applications of harmonic analysis, Academic Press, 1964. (Original in Russian, 1961.) [22J Yu. V. Prokhorov, The method of charac- teristic functionals, Proc. 4th Berkeley Symp. Math. Stat. Prob. II, Univ. of California Press, 1961,403-419. [23J A. N. Kolmogorov, A note on the papers of R. A. Minlos and V. Sazonov, Theory of Prob. Appl., 4 (1959), 221-223. (Original in Russian, 1959.) [24J R. A. Minlos, Generalized random pro- cesses and their extension to a measure, Selec- ted Transl. Math. Stat. and Prob., Amer. Math. Soc., 3 (1962), 291-313. (Original in Russian, 1959.) [25J V. Sazonov, A remark on characteristic functionals, Theory of Prob. Appl., 3 (1958), 188-192. (Original in Russian, 1958.) [26J M. Kac, Probability and related topics in physical sciences, Interscience, 1959. 1266 342 (XVII.1) Probability Theory A. History The origin of the theory of probability goes back to the mathematical problems connected with dice throwing that were discussed in letters exchanged by B. Pascal and P. de Fer- mat in the 17th century. These problems were concerned primarily with concepts such as tpermutations, tcombinations, and tbinomial coefficients, whose theory was esta blished at about the same time [1]. This elementary theory of probability was later enriched by the work of scholars such as Jakob Bnnoulli [2J, A. de Moivre [3], T. Bayes, L. de Buffon, Daniel Bernoulli, A. M. Legendre, and J. L. Lagrange. Finally, P. S. Laplace completed the classical theory of probability in his book Theorie analytique des probabilites (1812). In this work, Laplace not only systematized but also greatly extended previous important results by introducing new methods, such as the use of tdifference equations and tgenerat- ing functions. Since the 19th century, the theory of probability has been extensively applied to the natural sciences and even to the social sciences. The definition of a priori probability due to Laplace provoked a great deal of argument when it was applied. For example, R. von Mises advocated an empirical theory of proba- bility based on the notion of Kollektiv (col- lective), which is a mathematical model of mass phenomena [5]. However, these argu- ments are concerned with philosophical rather than mathematical aspects. Nowadays, the main concern of mathematicians lies not in the intuitive or practical meaning of probability but in the logical setup governing probability. From this viewpoint the mathematical model of a random phenomenon is given by a proba- bility measure space (Q, 1!3, P), whe re Q is the set of all possible outcomes of the phenom- enon, P(E) represents the probability that an outcome belonging to E be realized, and 1!3 is a (J"-algebra consisting of all sets E for which P(E) is defined. All probabilistic concepts, such as random variables, independence, etc., are defined on (Q, 1!3, P) in terms of measure theory. Such a measure-theoretic basis of probability theory is due to A. Kolmogorov [6J, though similar consideratiom had been made before him for special probkms, for example, in the work of E. Borel concerning the strong law of large numbers [7J and in the rigorous definition of Brownian motion by N. Wiener [8]. Ever since probability theory was given 
1267 solid foundations by Kolmogorov, it has made tremendous progress. The most important concept in today's probability theory is that of tstochastic processes, which correspond to functions in analysis. In applications a sto- chastic process is used as the mathematical model of a random phenomenon varying with time. The fonowing types of stochastic pro- cesses have been investigated extensively: tadditive processes, tMarkov processes and tMarkov chains, tmartingales, tstationary processes, and tGaussian processes. tBrownian motion and tbranching processes are impor- tant special stochastic processes. In the same way as functions are often defined by differen- tial equations, there are stochastic processes which can be defined by tstochastic differen- tial equations. The theory of stochastic pro- cesses and stochastic differential equations can be applied to tstochastic control, tstochas- tic filtering, and tstatistical mechanics. The tergodic theory that originated in statistical mechanics is now regarded as an important branch of probability theory closely related to the theory of stationary processes. B. Probability Spaces Let 0 be an tabs tract space and  be a t(J- algebra of subsets of O. A probability measure (or probability distribution) over O() is a set function peE) defined for EE and satisfying the fonowing conditions: (PI) peE) o; (P2) for every sequence {En} (n = 1,2, ...) of pairwise disjoint sets in , p( Y En) = P(En); (P3) P(O) = 1. The triple (0, ,P) is caned a probability space. The space 0 (resp. each element w of 0) is caned the basic space, space of elementary events, or sample space (resp. sample point or elementary event). We say that a condition s(w) involving a generic sample point w is an event; in particular, it is caned a measurable event or random event if the set E of an sample points satisfying s(w) belongs to . We assume that an event is always a measurable one, since we encounter only measurable events in the theory of probability. Because of the obvious one-to-one corre- spondence between measurable events and - measurable sets (i.e., the correspondence of each event s with the set E of an sample points w satisfying s), a -measurable set itself is frequently caned an event. If s is an event and E is the -measurable set corresponding to s, we can peE) or Pres) the probability that the event s occurs, i.e., the probability of the event E. The complementary event (resp. impossible 342B Probability Theory event, sure event) is the complementary set g (empty set 0, whole space 0). For a finite or infinite family {E,d (AE A), the sum event (resp. intersection or product event) of E;. is the set U;.E;. (n;.E;.). If EnF= 0, then we say that E and Fare mutuany exclusive or that they are exclusive events. By the definition of P, we have OP(E) 1 for any event E, P(0)=0, and P(O)= 1. Moreover, if {En} (n = 1,2, ...) is a sequence of pairwise exclusive events and E is the sum event of En' we have 00 peE) = I PeEn)' n=l This property is cal1ed the additivity of proba- bility. If peE) = 1, the event E is said to occur almost certainly (almost surely (abbrev. a.s.), for almost all w, or with probability 1). Given a finite sequence {En} (n = 1,2, ..., N) of events, we say that the events En (n = 1,2, "', N) are mutual1y independent or that the sequence {En} (n = 1,2, ... , N) is independent if every subsequence satisfies k P(E;, n E;2 n... n E;.) = n peE;). j=l Given an infinite family {E;.} (A E A) of events, we say that the events E;. (AEA) are mutual1y independent or that the family {E;.} (AEA) is independent if every finite subfamily is inde- pendent. The concept of independence of events can be generalized to a family {;.} (AEA) of (J-subalgebras of  as fonows. A family {;.} (AEA) of (J-subalgebras of events is said to be independent iffor every choice of E;.E;., the family {E;.} (AEA) of events is independent. For a sequence {En} (n = 1,2, ...) of events, the sets limsuPnEn and liminfnEn are caned the superior limit event and inferior limit event, respectively. The stiperior limit event (inferior limit event) is the set of an w for which in- finitely many events among En (an events except finitely many En) occur. Therefore P(lim inf n En) is the probability that infi- nitely many events among En occur, and P(1imsuPn En) is the probability that the events En occur for an n after some number no, where no depends on w in general. The Borel-Cantelli lemma, which is concerned with the evaluation of P(lim sUPn En), reads as fonows: Given a sequence {En} (n = 1,2, ... ) of events, we have (i) whether the events En (n = 1,2, ...) are mutu- any independent or not, n PeEn) < 00 implies that P(1im sUPn En) = 0; and (ii) if the events En (n = 1,2, ...) are mutuany independent, nP(En)= 00 imples that P(limsuPn En) = 1. Frequently, applications of part (ii) are greatly hampered by the requirement of independence; a number of sufficient conditions for depen- 
342 C Probability Theory dent events to have the same conclusion as (ii) have been discovered. The Chung-Erdos theorem [9J is quite useful in this connection. C. Random Variables Let (O.IB, P) be a probability space. A random variable is a real-valued function X defined on o that is IB-measurable (i.e., for every real number a, the set {wIX(w)<a} is in IB). If Xl' X 2 , ..., X n are random variables, the map- ping X =(Xj.X 2 , ...,X n ) from 0 into Rn is said to be an n-dimensional (or Rn-valued) random variable. More generally, a mapping X from (0, IB) into another tmeasurable space (S, G:) is called an (S, G:)-valued random variable if it is measurable, that is, for every set A of G:, the set {WIX(W)EA} belongs to lB. Let IB I be the u-algebra of all tBorel subsets of the real line R. Then each random variable X induces a probability measure <1> on (R, 1B1) such that <1>(A) = P( {w I X(w)EA}), AEIBI. The measure <1> is called the (I-dimensional) probability distribution of the random variable X or simply the distribution of X. The point function F defined by F(a)=P({wIX(w):S;a}), aER, is a monotone nondecreasing and right con- tinuous function such that lima_cxc F(a) = 1, lima_ -cxc F(a) = 0. The function F is called the cumulative distribution function (or simply the distribution function) of the random variable X. Similarly, an n-dimensional random variable X =(X"..., X n ) induces its n-dimensional prob- ability distribution (or simply n-dimensional distribution) and its n-dimensional distribution function F(a" ..., an)=P( {w I Xdw) :S;aj,"', Xn(w), an}). If the X n (n = 1,2, ''', N) are k n - dimensional random variables (11 = 1,2, ... , N), we say that the I( = =l kn)-dimensional ran- dom variable X = (Xl' X 2 , ..., X N ) is the joint random variable of X n (n = 1,2,..., N) and that the (I-dimensional) distribution <1> of X is the joint distribution (or simultaneous distribution) of X n (n = 1,2, ... , N). On the other hand, the k -dimensional distribution <1>n of X n is called the marginal distribution of the I-dimensional distribution <1>. Given a finite sequence {X n } (n= 1,2, ...,N) of random variables, if the relation P( {w I Xn(W)EAn (n= 1,2,..., N)}) N = TI P({wIXn(w)EAn}) n=l holds for every choice of I-dimensional tBorel sets An (n = 1, 2, "', N), we say that the random variables X n (n = 1,2, "', N) are mutually inde- 1268 (1 ) pendent or that the sequence {X n ) (11= 1,2, "', N) is independent. Given an infinite family {X.!} (AEA) of random variables, we say that the random variables are mutually independent or that the family is independent if every finite subfamily is independent. The latter definition of independence of random vari- ables is compatible with the previous defini- tion of independence of u-subalgebras of IB; i.e., if IB[X.!J denotes the u-subalgebras of IB generated by the sets {wIX.!(WIEA.!), with A.! an arbitrary I-dimensional Bore! set, the independence of the family {X.!} ().EA) in the latter sense is equivalent to the independence of the family {IB[X.!J} (AEA) in the previous sense. If the X n (n = 1,2, "', N) (X.l (},EA)) are k n - (k.!-) dimensional random variables, then the independence of the family {X n } (n= 1,2, "', N) ({X.!} (AEA)) is defined similarly; it is enough to take k n - (kd dimensional Borel sets An (A.!) for I-dimensional Borel sets in equation (1). Given a family {X.!} (},EA) ofr.mdom vari- ables, the smallest u-algebra with respect to which every X.! is measurable is called the u- algebra generated by {X.!} (AEA) and is de- noted by IB[X.!I).EA]. Each element of this class is said to be measurable with respect to the family {X.!} (AEA) of random variables. Since a random variable X is a IB- measurable function, we can speak of the tintegral of X relative to the meawre P on lB. If X is integrable relative to P, the integral of X over A is denoted by E(X; A). E(X; 0), usu- .ally denoted by E(X), is called tht mean, ex- pectation, or expected value of X, denoted also by M(X) or mx. If(X -E(X))2 is mtegrable, V(X)=E((X _E(X))2) is called the variance of X, denoted by u 2 (X). The standard deviation of X is the nonnegative square root u(X) of the variance. If X and Y are two random variables for which E((X- E(X))(Y - E(Y))) exists, the value E((X- E (X)) (Y - E( Y))) is called the covariance of X and Y. When X and Y have finite variances, the correlation coefficient of X and Y is defined by E(X -E(X))(Y-E(Y))) pIX, Y) {E((X _ E(X))2)E((Y _ E(y))2)} '/2' It follows that E(aX +bY)=aE(X)+bE(Y), V(aX +b)=a 2 V(X) for any real numbers a, b, and that, in particular, E(XY)==E(X)E(Y), V(X + Y)= V(X)+ V(Y) for mutually indepen- dent random variables X and Y. It also fol- lows from the definition that -1, pIX, Y):S; 1 in all cases. The independence of X and Y implies that pIX, Y)=O, but the converse is false in general. The variance is important because of the well-known Chebyshev inequal- 
1269 ity: If X is a random variable with finite vari- ance (J2, P(IX -E(X)I;;'c)(J2/C2 for every positive number c. D. Convergence of Random Variables If P(1im n _", X n = X",) = 1, the sequence {X n } is said to converge almost everywhere (almost certainly, almost surely (a.s.), or with proba- bility 1) to X",. If lim n _", P(I X n - X", I> e) = 0 for every positive number e, the sequence {X n } is said to converge in probability to X",. For a given positive number p the sequence {X n } is said to converge in the mean of order p to X", if lim n _", E(I X n - X", IP) = O. Finally, if the ran- dom variables X n (n = 1, 2, .., , CXJ) have distri- butions <1>n (n = 1,2, "', CXJ), respectively, and if ! f oo f(x)d<1>n(x) = f: f(x) d<1>", (x) for every continuous function f with compact support, the sequence {X n } is said to converge in distribution to X"" Note that the sequence of random variables converging in distribution may not converge in any ordinary sense. For example, random variables converging in distribution may even be defined on different probability spaces. On one hand, almost sure convergence does not in general imply conver- gence in the mean. On the other hand, either almost sure convergence or convergence in the mean implies convergence in probability, and convergence in probability implies conver- gence in distribution. However, P. Levy [10] proved that if the X n (n = 1,2, ...) are mutually independent, the sequence k Y,,= L X n , k= 1,2, "', n=l is convergent almost everywhere if and only if it is convergent in distribution (or in proba- bility). The famous three-series theorem of Khinchin and Kolmogorov [11] claims that the series nXn with XI' X 2 ,... independent is convergent almost surely if and only if there exists a sequence of independent random variables X;, X;,... such that each of the three series LP(Xn#X), LE(X), L V(X) n n n is convergent. E. Conditional Probability and Conditional Expectation Let (0, 1!3, P) be a probability space and IY a (J- subalgebra of 1!3. If X is a random variable 342E Probability Theory with finite mean, the function fl(E) = t X(w)dP, EE'ij, defines on (0, 'ij) a completely additive set function which is tabsolutely continuous with respect to P. Therefore, by the tRadon- Nikodym theorem, there is an IY-measurable function f such that fl(E) = tf(W)dP for every EEIJ. This function is unique up to a set of p. measure zero and is called the conditional expectation (or conditional mean) of X relative to IY, denoted by E(X I IY). When IY is gen- erated by a random variable Y, we also write E(X I Y) for E(X I IY) and calI it the conditional expectation of X relative to Y. In this case, there is a tBorel measurable function f such that E(X I Y) = f(Y(w)), and we write E(X I Y = y) for fey). The same fact holds when Y is a multidimensional random variable. It follows from the definition that the conditional expec- tation has the following properties, up to a set of P-measure zero: (i) if X ;;'0, then E(X 1m;;. 0; (ii) E(aX +bYi'ij)=aE(Xi'ij)+bE(YIIY); (iii) E(E(X I 'ij))=E(X); (iv) if X and IY are mutu- ally independent, i.e., I!3[X] and IY are mutu- ally indepent, then E(XIIY)=E(X); (v) if X is IY-measurable, then E(X IIY) = X and E(XYlIY) =XE(YIIY);(vi)ifIimncoXn=X", with IXnl Y and Y is an integrable random variable, then limn_co E(X n I IY)= E(XX) I IY); (vii) if (I) is a (J-subalgebra of IY, then E(E(X I IY)I (1))= E(X I (I)); (viii) if X 2 is integrable and Y is any IY-measurable random variable, then E((X- E(XIIY))2)E((X _ y)2). When X is the indicator function (i.e., the tcharacteristic function) XE of a set E in 1!3, E(XEIIY) is called the conditional probability of E relative to IY and is denoted by peE I IY). In particular, if IY = {F, F C , 0, O} with 1 > P(F) >0, peE I 'ij) is the simple function which takes the values P( En F)/ P( F) on F and peE n P)/P(F C ) on F C . These values are de- noted respectively by peE I F) and peE I F C ). The definition of peE I Y) or peE I Y = y) is also the same as in the case of the conditional expectation. Let IJ be a (J-subalgebra of 1!3 and Ya real random variable. According to the foregoing definition, PlY EEl IY) or P(y-I(E)I 'ij) is the conditional probability of the occurrence of the event Y E E under IJ. Since P( Y EEl IY) is determined except on a P-null set depending on E, an arbitrary version of P(YEE I IY), viewed as a function of E, does not always satisfy the conditions of a probability measure. However, we can prove that there exists a nice version of P(Y E E IIJ) which is a proba- 
342 F Probability Theory bility measure in E EIB 1 for every WE!"! and that such a version is unique almost surely. This version is called a regular conditional probability of Y E E under IY or the conditional probability distribution of Y under IY; this is written as 1'y(EIIY). 1'(YEEIX), 1'(YEEIX =x), 1'y(EIX), and 1'y(EIX=x) are interpreted similarly. The conditional probability distri- bution can be defined not only for real ran- dom variables but also for every random variable which takes values in an tanalytic measurable space. F. Bayes's Formula Let E1.El, ...,En be pairwise exclusive events, and assume that one of them must occur. If E is another random event, we have 1'(£IE) - 1'(EJ1'(EIEJ I 1'(E1)1'(EIE1)+'" +1'(En)1'(E 1 En) , where 1'(E;) is the probability of the event E j and 1'(E 1 E;) is the conditional probability of E under the assumption that the event E j has occurred. This is called Bayes's formula. In practical applications El' ... , En usually repre- sent n unknown hypotheses. Suppose that the probabilities on the right-hand side of the formula are given. We then apply Bayes's formula to reevaluate the probability of each hypothesis E j knowing that some event E has occurred as the result of a trial. This is why 1'(E;) (1'(E j 1 E)) is called the a priori (a pos- teriori) probability. However, the determina- tion of the values of a priori probabilities is sometimes difficult, and we often set 1'(E j ) = l/n in practical applications, although this has caused a great deal of criticism. When X is a random variable subject to the distribution with continuous probability den- sity f(x), Bayes's formula is extended to the following form: f(xo)1'(E I X = xo) f(x o I EI - Jro 1'(E 1 X = x)f(x)dx ' where f(x I E) is the conditional probability density of the random variable X under the assumption that the event E has occurred, and P(E I X = xo) is the conditional probability of E relative to X. G. Zero-One Laws In probability theory there are many theorems claiming that an event with certain properties has probability 0 or 1. Such theorems are called zero-one laws. Here, we mention two famous examples, Kolmogorov's zero-one law 1270 [6J and the Hewitt-Savage zero-oo.e law. Let 1X=IX(X 1 ,X l ,...) be an event concerning a se- quence of random variables {X n }. IX is called a tail event concerning {X n } if for every n, occur- rence or nonoccurrence of IX depends only on {X n , X n + 1 ,...}. For example, {limnro X n =O} is a tail event. IX is called a symmetric event con- cerning {X n } if occurrence or nonoccurrence of IX is invariant under every finite permutation of Xl, Xl.... For example, the event that k-l X k >0 for infinitely many n's is a symmetric event. Kolmogorov's zero-one law: Every tail event concerning a sequence of indepen- dent random variables has probability 0 or 1. Hewitt-Savage zero-one law: Every symmetric event concerning a sequence of independent and identically distributed random variables has probability 0 or 1. Kolmogorov's zero-one law can be extended as follows: Let IYn' n = 1,2, ..., be a sequence of independent (J-subalgebras of lB. Then the (J- algebra X = nk Un>k IYn, called the: tail (J- algebra of {/Yn}, is trivial, i.e., 1'(A) = 0 or 1 for every A EX. Kolmogorov's zero-one law is a special case where IYn is the (J-algebra gen- erated by X n for every n. References [IJ 1. Todhunter, A history of the mathemat- ical theory of probability from the time of Pascal to that of Laplace, Macmillan, 1865 (Chelsea, 1949). [2J J. Bernoulli, Ars conjectandi, ] 713. [3J A. de Moivre, The doctrine of chances, 1718. [4J P. S. Laplace, Theorie analytique des probabilites, Paris, 1812. [5J R. von Mises, Vorlesungen au, dem Gebiete der angewandten Mathematik I, Wahrscheinlichkeitsrechnung und ihre An- wendung in der Statistik und theoretischen Physik, Franz Deuticke, 1931. [6J A. Kolmogorov (KolmogorofI), Grund- befriffe der Wahrscheinlichkeitsrec:hnung, Erg. Math., Springer, 1933; English trao.slation, Foundations of the theory of probability, Chelsea, 1950. [7J E. Borel, Les probabilites denombrables et leur application arithmetiques, Rend. Circ. Mat. Palermo, 27 (1909),247-271 [8J N. Wiener, Differential space, J. Math. Phys., 2 (1923),131-174. [9J K. L. Chung and P. Erdos, On the appli- cation of the Borel-Cantelli lemma, Trans. Amer. Math. Soc., 72 (1952),179-186. [IOJ P. Levy, Theorie de I'addition des vari- ables aleatoires, Gauthier-Villars, 1937. [IIJ J. L. Doob, Stochastic processes, Wiley, 1953. 
1271 [12] M. M. Loeve, Probability theory, Van Nostrand, third edition, 1963. [13] W. Fel1er, An introduction to probability theory and its applications, Wiley, I, second edition, 1957; 11,1966. [14] L. Breiman, Probability, Addison-Wesley, 1968. [15] J. Lamperti, Probability, Benjamin, 1966. 343 (VI.14) Projective Geometry A. Introduction Projective geometry is the most fundamental of classical geometries and one of the first examples of ax iomati zed mathematics. B. Construction of Projective Geometry We construct projective geometry axiomati- cal1y [4]. Given two sets P, Q and a trelation r c P x Q, consider the triple 'P = {p, Q, q. We cal1 each element of P a point and each ele- ment of Q a line. If (p, I)E r holds for a point P and a line I, then we say that the line I contains the point p. When two lines II and 1 2 contain a point P, we say that they intersect at p. When several points are contained in the same line, these points are said to be collinear, and when severallines contain the same point, these lines are said to be concurrent. For 'P we impose the following axioms: (I) There exists one and only one line that contains two given distinct points. (II) Suppose that we are given noncol1inear points Po, PI' and P2' and distinct points qj, q2' Now suppose that {PO,P1,qr} and {Po, P2' q2} are col1inear triples. Then the line con- taining PI' P2 and the line containing q I, q 2 necessarily intersect (Fig. 1). A---- P, P, Fig. 1 (III) Every line contains at least three dis- tinct points. The 'P that satisfy axioms (I) and (II) and axioms (I), (II), and (III) are cal1ed general projective geometry and projective geometry, 343B Projective Geometry respectively. The set of al1 points that are contained in a line is cal1ed the point range with the line as its base. In projective geome- try, there exists a one-to-one correspondence between the set of lines and the set of point ranges, so we can identify every line with a point range. In this case a line IEQ is repre- sented as a subset of P, and the relation (p,I)E r means that the point P belongs to the set l. Let S be a subset of P and PI' P2 be any two distinct points of S. If the line that contains PI and P2 is always contained in S, then S is caned a subspace. Points and lines are sub- spaces. Now we impose the fol1owing axiom: (IV) There exist a finite number of points such that any subspace that contains al1 of them contains P. We cal1 a projective geometry satisfying axiom (IV) a finite-dimensional projective geometry, which from this point on wil1 be the sole object of our consideration. We can P a projective space. Consider sequences of subspaces of the type P  p.- 1  ...  PI  Po =1= 0, where 0 is the empty set. The num- ber n of the longest sequence is cal1ed the dimension of P. If P is of dimension n, we write p' instead of P. We cal1 pI a projec- tive line and p2 a projective plane. Each sub- space S of P, together with the set of lines of P contained in S, gives a finite-dimensional projective geometry, and so S is a projective space. Lines and points are projective spaces of dimensions 1 and 0, respectively. By con- vention, the empty set is a ( -I)-dimensional projective space. We cal1 each 2-dimensional subspace a plane and each (n -I)-dimensional subspace in p' a hyperplane. Let M, N be subspaces of P, and for a pair of points P E M, q E N consider the set P U q of al1 points on the line that contains P and q. The set {pUq IPEM, qEN} is denoted by MUN, and we can it the set spanned by M and N. By convention, we put 0 U M = M and pUp=p. Then prUp s is the projective space of the lowest dimension which contains pr and p s . On the other hand, if we denote the intersection of pr and p s by pr n p s , then it is the projective space of highest dimension that is contained in both of them. We cal1 pr UPS and pr n p s the join and the intersec- tion of pr and r, respectively. When the dimension of the space spanned by r + 1 points is r, we say that these points are inde- pendent; otherwise they are dependent. If any r + 1 points of a given subset M of p' are independent for each r  n, we say that points of M lie in a general position. The space pr necessarily contains r+ 1 independent points, and there necessarily exists a pr that contains r + 1 arbitrary given points in a projective space; it is unique if the points are indepen- 
343C Projective Geometry dent. If pr uP' = P' and P' n P' = p u , then r + s = t + u. We call the latter the dimension theorem (or intersection theorem) of projective geometry. The set El of all hyperplanes that contain a P O - 2 in pn is called a pencil of hyperplanes, and the P O - 2 common to them is called the center of El' If a pencil of hyperplanes contains two distinct hyperplanes of pn, then the pencil is determined uniquely by these two. When n = 2 and 3, it is called a pencil of lines and a pencil of planes, respectively. Each pencil of hyperplanes of pn, or more generally, each pencil of hyperplanes of a subspace of an arbi- trary dimension in P", is called a linear funda- mental figure of pn or simply a fundamental figure. In P", the set E, of all P"-', p n - dl , ..., pn-" that contain the same P o - r - l is called the star with center po-r-l. Each set that con- sists of the totality of subspaces of an arbitrary demension in the same P' or a subset of it is called a P' -figure. Undcr the assumption that P' and P' do not have points in common, the operation of con- structing pr UP' from P' and P' is called pro- jecting P' from P'. Assuming that P' and P' have points in common the operation of con- structing P' n ?" from P' and P' is called cut- ting P' by P'. Suppose that we are given spaces Po, PI' and P 2 , and a fundamental figure E in the space PI' By projecting E from Po and then cutting it by P 2 , we can construct a funda- mental figure E' on P 2 . This operation is called projection of E from Po onto P 2 , and we call Po the center of projection (Fig. 2). In this case, we say that E and E' are in perspective and denote the relation by EOX E'. Hfor two fundamental figures E and E' there exist a finite number of fundamental figures F i (1  i  I) such that EOX F, OX... OX F; OX E', then we say that E and E' are projectively related to each other and denote this by E7\ E' (Fig. 3). Now for arbitrary sub- spaces PI, P; (Orn), we take PO-,-I that n Pj P, Fig. 2 F' Fig. 3 1272 have no points in common with trem and project each point of PI onto P; from PO-,-I. The one-to-one correspondence P' -> P; thus obtained is called a perspective mapping. If a one-to-one correspondence PI-> P; is repre- sented as the composite of a finite number of perspective mappings, then we call it a projec- tive mapping. These mappings are extended to those of fundamental figures, too. Suppose that in a proposition or a figure in pn, we interchange pr and pn-,-I (0  r  n) and also interchange contains and is contained (and related terms). The proposition or the figure thus obtained is said to be dual to the original one. In projective geometry, if a pro- position is true, then its dual proposition is also true (duality principle). This is assured because propositions dual to axioms (I)-(lV) hold; and pr and E, are dual to each other. The projective space P; obtained by the prin- ciple of duality, by regarding the hyperplanes of P" as points of P;, is called the dual space of p n . C. Projective Coordinates Here we introduce projective coordinates in P". Consider Desargues's theorem: Suppose that PI, P2, P3 and ql' q2' q3' are two sets of points in pn, each of which is independent and satis- fies Pi #qi (i = 1,2,3). If the three lines Pi U qi (i = 1,2,3) are concurrent, then the three points (P2 U P3) n (q2 U q3), (P3 U PI! n (q3 U qI!, (PI U P2) n (ql U q2) are collinear. The converse is also true. This theorem holds for n? 3 gen- erally. However, when n=2, there exist projec- tive geometries for which it does not hold; we call these non-Desarguesian geomdries. In such cases it is impossible to introduce coordi- nates, so we assume Desargues's theorem for n=2. When four points Pi (1  i 4) in pn lie on the same plane and in general position, we call the figure that consists of these four points and the six straight lines 9ij = Pi U Pj (1 'S: i j  4) a complete quadrangle PIP2P3P4; ea,h Pi is called a vertex, and each 9ij is called a side. If six points qi (1  i  6) on a line 1 are points of intersection of six sides 912' 913' 914' 934' 924, 923 of a complete quadrangle with I, we call Pj q, Fig. 4 
1273 them a quadrangular set of six points (Fig. 4). By Desargues's theorem, we can show that if there are given three fixed distinct points on a line I, then any pair of distinct points on 1 determines uniquely a point on 1 such that the six points thus obtained constitute a quadran- gular set. The quadrangular property is invari- ant under projective mappings. On a line 1 we fix three mutually distinct points Po, PI' Poo' For any two points PX' Py different from Poo on I, we take the point s such that Poo' Px' Po, P oo ' P y , s constitute a quadrangular set of six points and can s the sum of Px and Py with respect to [Po, Poo' PI] (Fig. 5). On the other hand, the point t such that Po, Px, PI' Poo' P y , t con- stitute a quadrangular set of six points is caned the product of Px and Py with respect to [PO,Poo,P1] (Fig. 6). When we are given a fixed triple [Po, Poo' PI] on a line 1, as before, the set of points on 1 not equal to Poo is called a point range of the number system, provided that we exclude Poo from the point range. We can the set of three points [Po, Poo' Pi] a frame (or projective frame) of 1, and we can Po the origin, PI the unit point, and Poo the supporting point. g, Fig. 5 g" P" P I Fig. 6 A point range of the number system consti- tutes a tfield (which may be noncommutative) with respect to the previously defined sum and product. We can the field a Staudt algebra, and an abstract algebra isomorphic to it is caned a coefficient field of p.. We denote by K(po, Poo, Pi) the Staudt algebra that is deter- mined by a frame [Po, Poo' PI]. A projective mapping of 1 onto itself that leaves invariant each of three distinct points Po, Poo' PIon 1 is necessarily an tinner automorphism of the field K(PO,Poo,P1)' Denoting by [PO,Poo,P1] a frame on a line 1 of p. with coefficient field K, we can each isomorphism e:K(PO,Poo,Pl)-+K a co- ordinate system of I. For each point P on 1 we can the element  = 8(p) of K the inhomoge- 343C Projective Geometry neous coordinate of P with respect to this frame. Also, we call the pair (XO,Xl) such that xO, X1EK and x 1 (XO)-1 = homogeneous co- ordinates of p. Since the supporting point Poo is excluded from K(po, Poo' PI), we fix (O,x l ) such that Xl #0 as the homogeneous coordinates of Poo' In order for (XO,x 1 ) and (yO,yl) to be homogeneous coordinates of the same point, it is necessary and sufficient that there exist an element A40 of K such that y. = x. A. (IX = 0, 1). In conformity with these results, we now introduce coordinates in p.. A set /y=[ao, aI' .,. , a., u] of ordered n + 2 points in a gen- eral position is called a frame (or projective frame) of p.; each of a. (O:!i; 1X:!i; n) is caned a fundamental point, and u is caned a unit point. For A =a. o U .., U a., (O:!i; lXo <... <lXr:!i; n), we denote by A * the space spanned by the re- maining fundamental points. For any point P that is not contained in A*, we put PA = An (p U A *) and can it the component of P on A. Then /YA = [a. o "'" a." u A ] is a frame of A. Hereafter, we shall omit U A for brevity. Suppose that isomorphisms 8. p :K(a.,a p )-+K are assigned for each pair IX, {3 (O:!i; IX < {3:!i; n). Under a certain condition, the system {e. p } is determined by one of the 8. p . In this case we denote {8. p } by 8 and can {/y, 8} a projective coordinate system of p.. For any point P of p. not contained in Ao = a 1 U .., U a., we denote by Pi the component of P on aoUa j (1 :!i;i:!i;n), and we put i = 8(p;). The elements of the ordered set (r, e, ..., .) are called the in- homogeneous coordinates of P with respect to /Y, and those of the set (XO, Xl,..., X.) such that xi(XO) -1 = j are caned the homogeneous co- ordinates of p. When P is contained in A o ' we define (0, Xl, ... ,x.) as homogeneous coordi- nates of P with respect to /Y, provided that (Xl,..., x.) are homogeneous coordinates of P with respect to /YAo' Now we represent the point whose coordi- nates are (XO,xl, ...,x.) simply by x. In p., when coordinates are introduced, a necessary and sufficient condition for points z to be on the line that passes through two distinct points x and y is that z. = x...1. + y. Jl (O:!i; lX:!i;n), when )., JlE K are parameters. More generany, a point z is contained in the space spanned by r+ 1 independent points xp (O:!i;{3:!i;r) in p. if and only if Z.=Lp=oXpA. P (O:!i;IX:!i;n,A.PEK). In particular, the equation of a hyperplane is represented in the form L:=oX.Z. =0 (X.EK) with respect to variable coordinates z.. There- fore each hyperplane is uniquely determined by the ratio of X o , Xj, ...,X.. We can X o , XI' "', X. hyperplane coordinates of the hyper- plane. If n = 2, they are called coordinates of a line, and if n = 3, plane coordinates of a plane. (For coordinates of pr in p.  90 Coordinates B.) 
343D Projective Geometry D. Projective Transformations A one-to-one correspondence qJ between the point sets of two projective spaces P" and fin is called a coIlineation in the wider sense if for any three points PI' P2, P3 that are collinear, qJ(pJ (i = 1,2,3) are also c01linear and vice versa. If fin = PZ, we ca1l qJ a correlation; if fin = P", we ca1l qJ a collineation. If we denote a correlation by to, any other correlation is obtained as a composite of to and a c01lineation. If t is a correlation, it naturally induces a mapping PZ --+ P", which we also denote by t. Then tOt is a c01lineation. If tOt is an identity, we ca1l t an involutive correlation. Suppose that qJ: p n --+ pn is a c01lineation in the wider sense and O:S; r:S; n - 1. Then qJ ind uces a one-to-one cor- respondence between the set of r-dimensional subspaces of P" and the set of r-dimensional subspaces of fin; and if p':::J p s in P", then qJP':::J qJp s . Next, suppose that we are given two projec- tive spaces pn and pn that are subspaces of a space p N (n < N). (When Desargues's theorem holds, any two projective spaces of the same dimension can be identified with subs paces of a projective space of higher dimension.) In this case, when a c01lineation in the wider sense qJ: P"--+ j'in is a projective mapping, we call it a projective collineation in the wider sense. A projective collineation is also ca1led a projective transformation. The totality of c01lineations of P" constitutes a ttransformation group and is ca1led the group of collineations of P"; we de- note it by (£(P"). The totality of projective transformations of P" constitutes a tnormal subgroup of (£(P"); we denote it by <fJ(P") and ca1l it the group of projective transformations. The totality of projective transformations that leave invariant a frame IY of P" constitutes a subgroup <fJZ+1 (!j). It is isomorphic to the group of tinner automorphisms 3(K) of the coefficient field K of P". A c01lineation is not necessarily a projective transformation. The former is obtained as a composite of a projec- tive transformation and an automorphism of the coefficient field. Specifica1ly, if we denote the group of tautomorphisms of K by IlI(K), then (£(P")/<fJ(P")IlI(K)/3(K). Hence in order for a1l c01lineations to be projective transfor- mations, it is necessary and sufficient that a1l automorphisms of the coefficient field be inner automorphisms. If the coefficient field is the real number field, then collineations are always projective transformations. For the complex number field, however, this is not necessarily true. Now, we consider the f01l0wing three pro- positions: (1) The coefficient field of P" is com- mutative. (2) Given frames !j and !j' of P", there exists a unique projective transformation 1274 sending  onto '. (3) Given two distinct lines /1 and /2 contained in a plane in P" and two sets of three distinct points Pi (i = 1,2,3) and qi (i = 1,2,3) that lie on /1 and /2 respectively, then the three points (P2 U q3) n (P3 J q2), (P3Uql)n(PI Uq3), and (PI Uq2)n(P2Uql) are collinear. These three propositions are mutu- a1ly equivalent. We call proposition (2) the fundamental theorem of projective geometry and proposition (3) the theorem of Pappus. If the coefficient field is the real (complex) num- ber field, we ca1l the projective space a real (complex) projective space. In classical geom- etry, only these cases were studied. Suppose that the coefficient field is com- mutative. Then, if we assign an isomorphism 8 0 :K(Po,Poo,PI)--+K for the Staudt algebra K (Po, Poo' P d on a line in a space, then the isomorphism 8 of the Staudt algebra K(qo, qoo, q I) on an arbitrary line onto K can be uniquely determined so that 8- 1 00 0 is a pro- jective mapping. Utilizing such isomorphisms, we can determine homogeneous coordinates in an arbitrary subspace of P" by a frame on it. Suppose that the coefficient field is a com- mutative field whose characteristic is not 2. For four c01linear points Pi (1:S; i :S;4) in P", where Pb P2' P3 are distinct and P4 0/; Pb we consider a frame such that PI' P2, and P3 are, respectively, the supporting point, the origin, and the unit point. The inhomogeneous co- ordinate A of P4 with respect to this frame is ca1led the anharmonic ratio (cross ratio or double ratio) of these four points and is de- noted by [PbP2;P3,P4]. If we denote the inhomogeneous coordinates of Pi with respect to a general frame by (x?, xl) (i = 1,2,3,4), then }, can be expressed as A=[Pj'P2;p3,p4] (x?x1-xgxD(xxl-xx1) (x?xi -xxD(xx1-xgx1J' Moreover, if we interchange the order of the four points, then we have A=[P2,PI;P4,P3] =[P3,P4;PI,P2] =[P4,P3;P2,PI], 1 [PI, P2; P4' P3] =-;-, A [PI,P3;P2,P4]= I-A, 1 [P1,P3;P4,P2]= I-A ' A-I [PI,P4;P2,P3]=T' A [pj,P4;P3,P2] =--;-- 1 ' A- 
1275 In general, these six values are different; how- ever, there are the following two exceptions: when A.= -1,1/2, and 2; and when A. is a root of A.z-A.+ 1 =0. When A.= -1, these four points are called a harmonic range of points, and the points P3' P4 are called harmonic conjugates with respect to PI' pz; or PI' pz and P3, P4 are said to be harmonically separated from each other. When A. z - A. + 1 = 0, these four points are said to be an equianharmonic range of points. For the dual of these, we can consider the anharmonic ratio offour hyper- planes of a pencil of hyperplanes. The concept of the anharmonic ratio can be extended fur- ther to the case of four elements of funda- mental figures in general. The anharmonic ratio is a quantity that is invariant under projective transformations. Each projective transformation x->x is expressed with respect to homogeneous co- ordinates x. (0:::; oc:::; n) of p. as n px -.= " tp . x P , P t EK L... ' fj , p=o p #- 0, det(tfj) #- O. Conversely, if T=(tfj) is a tregular matrix (tpEK), then (1) determines a projective trans- formation. So there is a one-to-one corre- spondence between projective transformations and tequivalence classes of the regular ma- trices T=(tp) with the tequivalence relation TA.T(A.EK'-{O}). Therefore, when K is com- mutative, the group of projective transforma- tions ffi(pn) of pn is isomorphic to the factor group PGL(n+ 1, K) of the tgenerallinear group GL(n + 1, K) with the coefficient field K by its center {pI I pEK '-{O}}; that is, ffi(pn)  PGL(n+ 1,K). Extending the definition of projective trans- formations, we call the transformation repre- sented by (1), with an arbitrary square matrix that is not necessarily regular, a projective transformation. When T is regular, it is called a regular projective transformation, and when T is not regular, it is called a singular projec- tive transformation. In particular, if the trank of Tis n + 1- h, then we say that the projective transformation is singular of the hth species. If (1) is singular of the hth species, n+ 1 hyper- planes Lp=otfjXP=O (O:::;oc:::;n) have a space p h - 1 in common. We call p h - 1 the singular subspace of this transformation. A projective transformation is not defined on its singular subspace. A singular projective transformation of the h th species is the composite of the pro- jection of pn onto some pn-h with the singular subspace as its center and a regular projective transformation of pn-h, If the coordinates of a point are denoted by (x.) and hyperplane coordinates with 343E Projective Geometry respect to some frame by (X.), then the linear transformation n ,*:pX.= I t:px P , p, t:pEK, p=o is a correlation. (Here also, we extend the definition of correlation and include the case where T* = (t: p ) is not regular.) The condition that '* is an involutive correlation is given by T*= :!:'T*, When T*= _'T*, the involutive correlation '* is called a null system. The correlation '* is a nuII system if and only if any point x of pn is contained in the hyper- plane "t*(x). When T* = 'T*, we call the involu- tive correlation '* a polar system. For a polar system "t*, the set of points x that are con- tained in hyperplanes ,*(x) constitutes a qua- dric hypersurface (or hyperquadric). (2) E. Quadric Hypersurfaces (1 ) Let '* be a polar system, and let Q-1 be the totality of points x contained in "t*(x). Then the equation of the quadric hypersurface <rz- 1 is given by n I t:px.x P =0. ..p=o (3) For such a correlation '* we call a relation between the set of points x of pn and the set of hyperplanes ,*(x) a polarity with respect to Q-I. We calI "t* (x) the polar of x with respect to <rz-1, and x the pole of-r*(x) with respect to Q-I. If the points of intersection of a line passing through a point x with Q-1 and "t*(x) are denoted by z l' zz; y, then x, y; z l' Zz is a harmonic range of points. When a point x lies on the polar of a point y, we say that x and y are mutually conjugate. Each point on Q-1 is conjugate with itself, and the converse is also true. We call the polar of a point on Q-1 the tangent hyperplane of <rz- 1 at that point. If '* is regular or singular of the hth species, we calI the corresponding quadric hypersur- face regular or singular of the h th species. If "t* is singular of the hth species, its singular sub- space is contained in Q-I. We call points on this singular subspace singular points of <rz- 1 . Q-I, which is singular of the first species (i.e., <rz- 1 with just one singular point), is called a cone. We call a subspace contained in Q-1 a generating space. If it is a line we call it a generating line. We put q=(n - 2)/2 or (n -1)/2 according as n is even or odd. Then, if the coefficient field is an talgebraicalIy closed field, for each regular Q2- 1 there necessarily exist q-dimensional generating spaces. Also, Q is a truled surface covered by two families 
343F Projective Geometry of generating lines, and Qk is covered by two families of k-dimensional generating spaces. If Pi (1  i  5) are five points in a general position in a plane, then there exists one and only one Q passing through these points; we ca11 Q a conic. In order that six points Pi (1  i  6) in a plane lie on QL it is necessary and sufficient that the three points (PI U P2) n (P4 UPs), (P2 U P3)n(pS U P6), and (P3 U P4)n (P6 U PI) be co11inear (Pascal's theorem). The dual of the last theorem is ca11ed Brianchon's theorem. Given two hypersurfaces Q-I, Q-1 in pn, we consider another Q-1 such that the yolar of an arbitrary point x with respect to Q-1 belongs to the pencil of hyperplanes deter- mined by polars of x with resrect to Q-1 and Q-I. The set of a11 such Q-1 is ca11ed a pencil of quadric hypersurfaces. It is the set of a11 Q-1 that pass through the intersection of Q-1 and Q-I. In the cases n=2 and 3, we ca11 it a pencil of conics and a pencil of quad- rics, respectively. Denoting by lij and Tij (0  i j  3) the tPliicker coordinates of two straight lines 1 and Tin p3, we put (I, T) = 1° 1 [23 _1°2[13 + 1° 3 [12 + T01 [23 _ T02113 + T03112. Then (I, I) = 0 holds. If we regard these lij as homogeneous coordinates of p s , then there exists a one-to-one correspondence between the points on the regular quadric hypersurface Q1 defined by (I, I) = 0 and the lines in p3 . We say that each point of Q1 is the image of the line corresponding to it in p3. Two lines 1 and T intersect if and only if (I, T)=O. Geometrical1y, this means that the images of 1 and T are con- jugate with respect to Q1. Therefore the line passing through the images of 1 and T is a generating line of Q1. The image of a pencil of lines in p3 is a generating line of Q1. Quadric hypersurfaces and sets of lines in p3 are im- portant objects of study in both projective and algebraic geometry. In particular, linear line congruences (linear line complexes) that are families of lines dependent upon two (three) parameters are of great interest. In these theories, quadric hypersurfaces playa fun- damental role. When the coefficient field is noncommutative, the above theory has to be greatly modified. F. Projective Geometry and the Erlangen Program From the standpoint of the tEriangen pro- gram of F. Klein, the aim of projective geom- etry is to study properties that are invariant under the group of projective transformations. 1276 Utilizing various subgroups of this group, we can reconstruct various classical geometries. For example, consider the projective space P" whose coefficient field is the real number field, and fix a hyperplane IIoo. Let 6(P") be the subgroup of projective transformations formed by a11 projective transformations that leave IIoo invariant. Then the geometry that belongs to this group is taffine geometry. Similarly, fix an imaginary regular quadric hypersurface Q-2 in IIoo and consider the geometry that belongs to the subgroup of 6(P") leaving this Q-2 invariant. We thus obtain Eudidean geometry. Moreover, if we assign some regular quadric hypersurface Q-I, then the geometry belonging to the subgroup of ffi(P") that leaves the Q-1 invariant is a tnon-Euclidean or tconformal geometry according as the trans- forma tion space is the set of inner points of Q-1 or the whole Q-I. G. Projective Geometry and Modular Lattices (4) tLattices (lattice-ordered sets) and projective geometry are intimately related. The totality of subspaces of each dimension in general projec- tive geometry \13 constitutes a tcornplete tmodular lattice L(\I3) with respect to the in- clusion relation. If \13 is a finite-dimensional projective geometry, then it is an tirreducible complemented modular lattice of finite theight. Conversely, suppose that L is a modular lat- tice with tminimum element <1>, and denote by P the totality of elements P tprime over <1> (i.e., tatomic elements) and by Q the totality of elements 1 prime over atomic elements. Then, if p<1 and (p,I)Er, \I3(L)={P,Q,r} is a general projective geometry. If L is an irreducible complemented modular lattice of fmite height, then \I3(L) is a finite-dimensional projective geometry; in this case we have \13  \I3(L(\I3)) and L:::::; L(\I3(L)). So we may consider projec- tive geometry and irreducible complemented modular lattices as having the same mathe- matical structure. If a lattice L is an n- dimensional projective geometry, its tduallat- tice is also an n-dimensional projective geom- etry, and this is the principle of duality. H. Analytic Representations of Projective Geometry Let K be an arbitrary field, commutative or noncommutative. For an arbitrary natural number n, we consider an (n+ 1)-dimensional (for the noncommutative case, right or left) linear space V"+1(K) over K. The totality of linear subspaces in it constitutes an irreducible complemented modular lattice P"(K) with 
1277 respect to the inclusion relation, and P"(K) gives rise to an n-dimensional projective geom- etry. We call it a right or left projective space. Points of P"(K) correspond to (right or left) l-dimensional1inear subspaces. Conversely, it can be shown that an n-dimensional projec- tive geometry over K is isomorphic to P"(K). Therefore projective geometries can be com- pletely classified by means of the natural number n and the field K except when n=2 and the geometry is non-Desarguesian. We may restate this fact as fol1ows: We consider a space P= V"+I(K)-{O}. If we fix a basis of V"+1(K), then we can represent P={x=(XO, Xl, ...,X")IX"EK, 0:::; iX:::; n}, x#(O,O, ...,0). If there exists a nonzero element A. of K such that Y=XA., then the elements x and yare called equivalent; we write x y. We denote by P"(K) the factor set of P under the foregoing equiva- lence relation, and by [x] the equivalence class that contains x. We put l([x], [y])={[z]lz"= x" A. + y" jl, \fA., jlE K} and Q = {l([x], [y])1 [x], [Y]EP"(K)}. We call each element ofP"(K) a point and each element of Q a line. Then these points and lines and the natural inclusion relation satisfy axioms (I)-(IV) and give an n- dimensional projective geometry. When K is a ttopological field (e.g., the real number field, the complex number field, or the tquaternion field), we may define the topology of P"(K) as the factor space P"(K) = PI . In particular, if K is the real number field R, then P"(R) is homeomorphic to the factor space obtained from the n-dimensional hypersphere S":(XO)2+ ... + (X")2 = 1 in the (n + 1)-dimensional Eucli- dean space E"+1 by identifying the end points of each diameter. Hence P"(R) is compact. Similar facts hold for the cases of the complex and quaternion number fields. Since the group of projective transformations ffi(P"(K» acts ttransitively on P"(K), if K is a topological field we can regard P"(K) as a thomogeneous space of the topological group ffi(P"(K»). Moreover, the totality of r-dimensional sub- spaces in P"(K) constitutes a tGrassmann manifold. In algebraic geometry the tdirect product of two projective spaces is important; we call it a biprojective space. I. Tits's Theory of Buildings (Generalization of Projective Geometry) In a situation when a triple (G, B, N) consisting of a group G and its subgroups B, N satisfies the axioms of a BN -pair or Tits system ( 13 Algebraic Groups R), a new geometric object, called a "building," was introduced by 1. Tits [9]. His theory contains projective geometry as a particular case. The theory of buildings has deep connection with algebraic groups. 344 A Pseudoconformal Geometry The Tits system corresponds to a projective geometry in the following case. Let k be any commutative field, and let G be the general linear group of degree n over k, i.e., G consists of aU nonsingular square matrices of degree n with entries in k. Let B be the subgroup of G consisting of all upper triangular matrices (i.e., matrices whose entries below the principal diagonal are aU zero). Let N be the subgroup of G consisting of all monomial matrices (i.e., matrices such that each column and each row contain just one nonzero entry). Then (G, B, N) forms a Tits system called type (A"_I)' The corresponding theory of buildings of the type above is nothing but the projective geometry. Thus by means of Tits's theory of buildings the relationships among projective geometry and other geometries have been clarified [9]. References [1] G. Birkhoff, Lattice theory, Amer. Math. Soc. Colloq. Pub!., revised edition, 1967. [2] W. V. D. Hodge and D. Pedoe, Methods of algebraic geometry I, Cambridge, 1947. [3] O. Schreier and E. Spemer, Einftihrung in die analytische Geometrie und Algebra, Van- denhoeck & Ruprecht, 11,1951; English trans- lation, Projective geometry of n-dimensions, Chelsea, 1961. [4] O. Veblen and J. W. Young, Projective geometry I, II, Ginn, 1910-1938. [5] E. Artin, Geometric algebra, Interscience, 1957. [6] S. Iyanaga and K. Matsuzaka, Affine geometry and projective geometry, J. Fac. Sci. Univ. Tokyo, 14(1967), 171-196. [7] A. Seidenberg, Lectures in projective geom- etry, Van Nostrand, 1962. [8] R. Hartshorne, Foundations of projective geometry, Benjamin, 1967. [9] 1. Tits, Buildings of spherical type and finite BN-pairs, Lecture notes in math. 386, Springer, 1974. 344 (VII.22) Pseudoconformal Geometry A. Definitions Let A and A' be subsets (with relative topol- ogy) oftcomplex manifolds X and X' of dimen- sion n, respectively. A homeomorphism f of A onto A' is caned a pseudoconformal trans- formation if there exists a tbiholomorphic mapping 1 of an open neighborhood of A in X onto an open neighborhood of A' in X' such 
344B Pseudoconformal Geometry that J(x) = f(x) for XE A. If there exists such a mapping t, A is said to be pseudoconformally equivalent to A'. Pseudoconformal geometry is a geometry that studies geometric properties invariant under the pseudoconformal equiva- lence. However, most studies in pseudocon- formal geometry so far have concentrated mainly on the investigation of smooth hyper- surfaces in a complex manifold-more specifi- cally, the smooth (or real analytic) boundaries of bounded domains in e. In fact, to pseudo- conformal geometry on hypersurfaces we can apply the methods of differential geometry as well as those of the theory of functions of several complex variables. H. Poincare [IJ studied perturbations of the boundary of the unit ball in C 2 that are pseudoconformally equivalent. E. Cartan [2J studied the equivalence problem of hyper- surfaces in C 2 and gave the complete list of all simply connected hypersurfaces on which the group of pseudoconformal automorphisms acts transitively. Such a hypersurface is called homogeneous. Let M be a smooth hypersurface in a com- plex manifold X with the talmost complex structure tensor J, i.e., J x : TxX -. TxX is an involutive linear automorphism of the tangent space TxX of X at x induced by the complex structure of X. Put HxM = T,;M nJ x TxM for xEM. The union of all HxM is called the bundle of holomorphic tangent vectors of M and is denoted by H(M). H(M) is also called the CR (Cauchy-Riemann) structure of M. Let M' be a smooth hypersurface in a complex manifold X'. A diffeomorphismf:M-.M' is called a CR-equivalence if the tdifferential mapping Tf: TM -. TM' of f preserves the CR-structures, where TM denotes the ttangent bundle of M. Iff: M -.M' is a pseudocon- formal transformation, then f is clearly a CR- equivalence. Let Ex be the annihilator of HxM in Tx*(M). Then the union of Ex (xEM) defines a tline bundle E over M. The Levi form Lx at x E M, defined only up to a multiplier, is the quadratic form on HxM defined by LAu, v) = dO(u, v) for u, VE HxM, where IJ is a nonvanish- ing section of E in a neighborhood of x. If the Levi form is nondegenerate at every point of M, M is called a nondegenerate hypersurface. In particular, if the Levi form is definite, then M is called strictly pseudoconvex. B. Equivalence Problem Cartan studied the equivalence problem for the case n = 2, and obtained a criterion for two hypersurfaces in C 2 to be pseudoconformally equivalent. N. Tanaka (1965) generalized the method of Cartan for the case n:;d and ob- 1278 tained a criterion in terms of tCartan connec- tions in some fiber bundle over the hypersur- faces. However, he did not publish the proof of his result until S. S. Chern and J. Moser [4J, in- dependently of Tanaka [3J, obtained a similar result and gave the first proof of this result. Let M be a real analytic hypersurface in cn+1 (n? 1) whose Levi form has p positive and q negative eigenvalues (p + q = n). Let H be the subgroup of SU(p+ l,q+ 1) leaving the point (1,0, "', 0)ECn+2 fixed. According to the Cartan-Tanaka-Chern-Moser result, we can construct functorially a principal fiber bundle Y over M with structure group H and a Car- tan connection W on Y with values in the Lie algebra of SU(p+ l,q+ 1) such that if M and M' are pseudoconformally equivalent, then there is a bundle isomorphism qJ of Y to Y' preserving the Cartan connections: qJ*w' = w, where Y' is the corresponding principal fiber bundle over M' and w' is the Carta,n connec- tion on Y'. Conversely, if there is a bundle isomorphism qJ of Y to y' such tha t qJ*w' = w, then M and M' are pseudoconformally equiva- lent. By using this solution of the equivalence problem, we can prove that the group A(M) of all pseudoconformal automorphisms of a nondegenerate real analytic hypen.urface M in a complex manifold X of dimension n is a Lie transformation group of dimension not exceed- ing n 2 + 2n. H. Jacobowitz [5J con8tructed a similar bundle B over M and a Cartan con- nection on B in a different way from that of Chern and Moser. We do not know whether Band Y actually coincide. C. Classification Cartan (1932) classified all simply connected homogeneous hypersurfaces in C 2 . In partic- ular, he proved that if M is a compact homo- geneous strictly pseudoconvex hypersurface with dim M = 3, then M is pseudoconformally equivalent to either (1) S3 or its quotient by the action of a root of unity or (2) the hyper- surface given in the 2-dimensional projective space by the equation in homogeneous coordi- nates: (ZIZI +Z2Z2+Z3Z3)2 = m2Izi+z+z12 (m> 1) or the double covering of such a sur- face. A. Morimoto and T. Nagano [6J and later H. Rossi [7J tried to generalize this result and obtained a partial classification of simply connected compact homogeneous hypersurfaces with dimension? 5 D. Burns and S. Shnider [8J classified all simply connected compact homogeneous strictly pseudoconvex hypersurfaces M with dimM = 2n + I ? 5. They proved that M is pseudo- conformally equivalent to s2n+1 or the tangent sphere bundle of a rank one tsymmetric space 
1279 or the unit circle bundle of a homogeneous negative line bundle over a homogeneous algebraic manifold. On the other hand, a real hypersurface M in a complex manifold X of complex dimension n+ 1 is called spherical if at every point pEM, there is a neighborhood of p in X such that un M is pseudoconformally equivalent to an open sub manifold of s2n+1. The hyperquad- ric Qn+1 =au n + 1 is spherical, where U n +1 = {(z j,"', Zn+1)E C"+1/ Im(zn+1) > IZ112 +... + IZnI2}. If M is spherical, then the universal covering space M of M is also spherical. If M is a homogeneous spherical hypersurface, then there is a covering into mapping I: M ->s2n+1, and I(M) is a homogeneous domain in s2n+1. We know that the only compact simply con- nected spherical M is s2n+1. Burns and Shnider classified all homogeneous domains M in S2n+1: M is pseudoconformally equivalent to (I) or (II) of the following: (Ia) S2n+1_ vn S2n+1, where V is a complex vector sub- space of C"+1 with 0  dime V  n. (Ib) Qn+1 - Lm . 0, 0  m  n, where Lm is a certain sub- group of SU(n + 1, l)j(center). (II) S2n+1_ s2n+1 n R n + 1 . At present, it seems difficult to extend Cartan's classification of all simply connected homogeneous hypersurfaces to higher dimensions. K. Yamaguchi (1976) treated a hypersurface M in a complex mani- fold of dimension n with a large automorphism group A(M). He showed that if dimA(M) = n 2 + 2n, then M is a real hyperquadric in the n-dimensional complex projective space Pn C ( Section B). He then showed that the sec- ond largest dimension for A(M) is equal to n 2 + 1 except when n = 3 and the index r = 1, for which dimA(M)= 11( =n 2 + 1). Under the additional assumption that M is homoge- neous, he showed that if dimA(M)=n 2 + 1, then M is the affine part of a real hyperquadric in Pn C (except when n = 5 and r = 2). He also obtained a similar result in the nonhomoge- neous case. D. Relations to Other Equivalences Let D 1 and D 2 be bounded domains in C" with smooth boundary aD; = M j (i= 1,2) for which we denote by H(M j ) the CR structures. We consider the following propositions (A)-(D) for these domains: (A) D 1 is biholomorphically equivalent to D 2 . (B) M 1 is CR equivalent to M 2 . (C) M 1 is pseudoconformally equivalent to M 2 . (D) There is a diffeomorphism 1:15 1 ->15 2 such that II D, :D 1 ->D 2 is biholomorphic. It is clear that (C) implies (D) and that (D) implies (A). On the other hand, we can prove that (B) is equivalent to (D). When does (A) imply (B) and when does (B) imply (C)? 344E Pseudoconformal Geometry C. Fefferman [1 OJ proved that (A) implies (D) when D 1 and D 2 are strictly pseudoconvex. S. Bell generalized the result of F efferman in the case when one of D 1 and D 2 is strictly pseudoconvex. Bell and E. Ligocka [IIJ proved that if M 1 and M 2 are real analytic and if D 1 and D 2 are pseudoconvex, then (A) im- plies (D). When D 1 and D 2 are not strictly pseudoconvex and M j is not real analytic, we do not know whether (A) implies (D) or not. As remarked by Burns, Shnider, and Wells (1978), by using the theorem of Fefferman, we can prove that (A) implies (C) when M 1 and M 2 are real analytic and if D 1 and D 2 are strictly pseudoconvex. I. Naruki [12J obtained the same result. We do not know whether (A) implies (C) when M 1 and M 2 are real analytic and D 1 and D 2 are pseudoconvex, though we know that (A) implies (B). We do not know whether (B) implies (C) in general. S. I. Pinchuk [13J proved the following: Let D, D' be strictly pseudoconvex domains in C" with simply connected real analytic bound- aries aD, aD'. Let I: U ->C" be a nonconstant holomorphic mapping from a connected neigh- borhood U of a point pE aD in C" into C" such that I(U n aD) c aD'. Then we can find a holo- morphic mapping l: D-> D' such that J(x) = I(x) for xEDn U. Combining this theorem with Fefferman's result we see that for two domains as above, D is biholomorphically equivalent to D' if and only if aD is locally pseudoconformally equivalent to aD', i.e, there are neighborhoods U and V of a point pEaD and qE aD', respectively, such that un aD and vn aD' are pseudoconformally equivalent. Concerning the tproper holomorphic map- pings rather than diffeomorphisms, Burns and Shnider (1979) proved the following theorem: Let M j = aD; (i= 1,2) be strictly pseudoconvex, and let I: D 1 -> D 2 be a proper holomorphic mapping. (a) If D 1 = D2' then I extends smoothly up to the boundary D 1 . (b) If aD j is real analy- tic for i = 1, 2, then I extends holomorphically past the boundary. E. Deformations of Domains Let M be a compact connected strictly pseudo- convex real hypersurface in a complex mani- fold X of dimension n + 1. Let q> be a smooth strictly tplurisubharmonic function defined on a neighborhood V of M such that M = {XE VI q>(x)=O} and dq>#O on M. Let U = {x E VI -8 < q>(x) < 8} for small 8> 0 such that U is compact and a U is smooth. Let £iP( U) be the open set in COO(U) of strictly plurisub- harmonic functions t/t with dt/t A at/t A (dat/t)n #- 0 on U. Let Be R k be a small open ball around O. We denote by £iP(U x 8) the set of t/tECOO(U 
344F Pseudoconformal Geometry x B) such that t/J(x,t) = t/J,(x)E!l'( U) for all tE B. For t/JE!?J(U x B) we set M,..={XE Ult/J,(x)= b}. After introducing these notations, Burns, Shnider, and Wells (1978) proved the following theorem. There exists an open dense set y c &'( U x B) with qJ E Y and a set of tsecond category i3f c y such that for every t/JE i3f, tiEB and biER small enough, (i) M,!..! is CR- equivalent to M".., if and only ift, =t 2 , b 1 = b 2 . (ii) The group of CR-automorphisms of M,!..! reduces to the identity only. For t/JE &'(U x B), taking tEB, bER small enough, M,.. is a compact connected strictly pseudoconvex hypersurface in X. If M bounds the relatively compact region D in X then M,.. also bounds a relatively compact region D,... In particular, there exist smooth families of deformations of the unit ball in e+' of arbitrary high di- mension. There are arbitrary small perturba- tions of the unit sphere in e+ 1 that admit no pseudoconformal transformations other than the identity. F. Topics Related to Pseudoconformal Geometry (1) Pinchuk (1975) proved the following: Let D1' D 2 be strictly pseudoconvex domains in c n with CW boundary oD 1 , oD 2 . Let U be a neigh- borhood of a point pEoD 1 in e. If there is a C 1 -mappingf: un D 1 -+C" such that f is holomorphic on UnD 1 andf(UnoDr}coD2, then there is a holomorphic mapping]: U'-+ e of a neighborhood U' of un oD! into C" such that ](x) = f(x) for x E un D 1 n U'. This result is related to the implication (B) => (C) in Section D. (2) H. Alexander [14] proved the follow- ing: Let U be a connected neighborhood of a point PES 2n - 1 in c n and f: U -+c n a holomor- phic mapping such that f( un s2n-1) c S2n-1. Then either f is a constant mapping or there is a biholomorphic automorphism ]:Bn-+Bn of the unit open ball Bn such that ](x) = f(x) for x E U n Bn. He also proved that every proper holomorphic mapping f: Bn-+Bn is necessarily an automorphism of Bn if n > 1. G. M. Henkin (1973) proved that every proper holomorphic mapping f: D 1 -+ D 2 of a strictly pseudoconvex domain D 1 into a strictly pseudoconvex D 2 can be extended continuously to a function]: D 1 -+ D 2 . More precisely, there is a constant c>O such that If(z rJ - f(z2)1:S; clz, -z21 1 / 2 for every ZH Z2ED!. (3) Let M be a real hypersurface in e+ 1 with H(M) the bundle ofholomorphic tan- gent vectors to M. We take a real non vanish- ing I-form 0 that annihilates H(M). S. M. Webster (1978) called the pair (M, 0) a pseudo- Hermitian manifold. He considered the equiv- 1280 alence problem of pseudo-Hermitian mani- folds by applying Cartan's method of equiv- alence. He proves, among other things, that the group of all pseudo-Hermitian transforma- tions of the non degenerate pseudo-Hermitian manifold (M, 0) of dimension 2n + I is a Lie transformation group of dimension not ex- ceeding (n+ 1)2. Webster considered the rela- tion between pseudo-Hermitian manifolds and pseudoconformal geometry and proved that for n? 2 the ellipsoid E given by the equa- tion A,xi+B!Yi+... + A n +1 x ;+1 + B n +1Y;+1 = 1, where Zk=X k + iYk (k= 1, ..., n+ I) is pseudo- conformally equivalent to the hypersphere s2n+1 if and only if Ak=Bk (k= 1,.. ,n+ I). This result gives, by virtue of Fefferman's theorem, a necessary and sufficient condition for an ellipsoidal domain to be biholomorphi- cally equivalent to the unit ball. References [1] H. Poincare, Les fonctions analytiques de deux variables et la representation ,onforme, Rend. Circ. Mat. Palermo, 23 (1907),185-220. [2] E. Cartan, Sur la geometrie pseudo- conforme des hypersurfaces de I'espace de deux variables complexes I, II, Ann. Mat. Pura Appl., 11 (1932),17-90; Ann. Scuola Norm. Sup. Pisa, 1 (1932), 333-354. [3J N. Tanaka, On nondegenerate real hyper- surfaces, graded Lie algebras and Cartan connections, Japan. 1. Math., 2 (1976), 131- 190. [4J S. S. Chern and 1. Moser, Real hypersur- faces in complex manifolds, Acta Math., 133 (1974),219-27\. [5J H. Jacobowitz, Induced connections on hypersurfaces in e, Inventiones Math., 43 (1977),109-123. [6J A. Morimoto and T. Nagano, On pseudo- conformal transformations of hypersurfaces, 1. Math. Soc. Japan, 14 (1963), 289-300. [7J H. Rossi, Homogeneous strongly pseudo- convex hypersurfaces, Rice Univ. Studies, 59 (1973),131-145. [8J D. Burns and S. Shnider, Spherical hyper- surfaces in complex manifolds, Inventiones Math., 33 (1976), 223-246. [9J K. Yamaguchi, Non-degenerate hypersur- faces in complex manifolds admitting large groups of pseudoconformal transformations I, Nagoya Math. J., 62 (1976), 55-96. [IOJ C. Fefferman, The Bergman kernel and biholomorphic mappings of pseudoconvex domains, Inventiones Math., 26 (1974),1-65. [IIJ S. Bell and E. Ligocka, A simplification and extension of Fefferman's theorem on biholomorphic mappings, Inventiones Math., 57 (1980), 283-289. 
1281 [12] I. Naruki, On extendability of isomor- phisms of Cartan connections and biholomor- phic mappings of bounded domains, Tohoku Math. J., 28 (1976),117-122. [13] S. I. Pinchuk, On holomorphic mappings of real analytic hypersurfaces, Math. USSR- Sb., 34 (1978),503-519. [14] H. Alexander, Proper holomorphic map- pings in Co, Indiana Math. J., 26 (1977), 137- 146. [15] S. M. Webster, Pseudo-Hermitian struc- ture on a real hypersurface, J. Differential Geometry, 13 (1978), 25-41. 345 (XIII.33) PseudodifferentialOperators A. Pseudodifferential Operators Pseudodifferential operators are a natural extension of linear partial differential opera- tors. The theory of pseudodifferential opera- tors grew out of the study of singular integral operators, and developed rapidly after 1965 with the systematic studies by J. 1. Kohn and L. Nirenberg [1], L. Hormander [2], and others. The term "pseudodifferential operator" first appeared in Kohn and Nirenberg [1]. Let P be a tlinear partial differential opera- tor of the form P=p(x,Dx)= I a.(x)D, 1.I';m and let u(x) be a function of class CO'(Q) ( c CO'(R"». Then by means of the tFourier inversion formula, Pu(x) can be written in the form PU(x) = (2n)-"/2 [ exp(ix')p(x,)u()d, (2) JRn where u() denotes the tFourier transform of u(x) ( 160 Fourier Transform H). But this representation of Pu(x) has a meaning even if p(x,) is not a polynomial in . Thus, for a general function p(x, ), the pseudodifferential operator P = p(x, Dx) with the symbol p(x,) is defined by (2). A symbol class is determined in accordance with various purposes, but it is always required that the corresponding opera- tors have essential properties in common with partial differential operators. Horman- der [3] defined a symbol class S;'d(Q) for real numbers m, p, and b with p  0 and b  0 in the following way: Let p(x,) be a COO-function defined in Q x R". If for any pair of multi- indices iX, {3 and any compact set K c R", there 345 A Pseudodifferential Operators exists a constant C.. fJ . K such that IDDtp(x, )I  C.,fJ,K(l + IWm+dl.i-PlfJl, xEK, ER", then p(x,) is said to be of class S;'d(Q). The operator P defined by (2) is called a pseudo- differential operator (of order m) of class S;'d(Q) and is often denoted by P= p(x, Dx)E S;'d(Q). When Q=R" and constants C.,fJ,K= C.,fJ are independent of K, we denote S;'d(R") simply by S;'d' and set S-oo= n Si.o ( = n S;'d ) , -oo<m<oo -oo<m<oo S;'d = U S;'d' -oo<m<oo Differential operators (1) with coefficients of class P4 ( 168 Function Spaces B(13» belong to S,o' The complex power (1- y/2 of 1-  = 1- I.j=l 0 2 lax] is defined as a pseudo- differential operator of class St by the sym- bol (1 + 112y/2. Operators of class S;'d are continuous mappings of f/ into f/. Therefore, for any real s, the operator (1_)'/2 can be uniquely extended to be a mapping of f/' into f/' by the relation «1-Sf/2U, v)= <u,(I-M/ 2 v), UEf/', VEf/. For any 1 r 00 and real s, the tSobolev space H S " is defined by H S " = {UEf/' I (1- )S/2UE L,(R")}, (1) which is a Banach space provided with the norm lIulls,,= 1I(1-)'/2uIIL,. In particular, HS=Hs,2 is a Hilbert space with the norm Ilull s = Ilull'.2' Set H-oo.,= U H S ", H-oo=H-oo. 2 , -oo<s<oo H OO ,,= n H S ", H oo =H oo ,2. - 00 <5< 00 Then f/' => H-OO =>tff', H-OO => L 2 (R") =>Hoo( cP4). Choosing the Hormander class S;'d in the case Obp 1 and b < 1 as a model class, we here list the main results of the theory of pseudodifferential operators: (i) Pseudolocal property. The operator P of class S;'d in general does not have the local property U E f/' => supp Pu c supp u, but if p > 0, then P has the pseudolocal property uEf/' => singsuppPucsingsuppu [3]. (ii) Algebra of pseudodifferential operators. Let P=p(X,Dx)ES;'d and P.J=pix,Dx)ES;;d' j = I, 2. Then there exist P* = p*(x, DX)eS;'d and Q=q(X,D x )ES;,'d+ m2 such that (Pu, v)= (u, P*v) for u, VEf/, i.e., P* is the formal ad- 
345 A Pseudodifferential Operators joint of P, and Q=P 1 P 2 . Furthermore, if we set p(x, )=D(iD{)" p(x, ) and qa(x, 0= (iD{)a p1 (x, ODP2(X, ), then for any integer N we have 1 p*(x,)- I ,P:(X,)ES;'(P-,j)N (3) lal<NO(. and 1 q(x,)- I ,qa(X,)Es;,+m2-(p-")N. (4) lal<NO(. Hence the operator class S';. is an algebra in the sense PES';", ES;'j", j=I,2, => p* E S,;", PI + P 2 E S;', PI P 2 E S;' +m 2 , where m o =max(mj,m 2 ). In particular, ifO fJ<p I, we have m-(p-fJ)N---+ -00 and m j +m 2 -(p-fJ)N---+-00 as N---+oo. Then, we say that p*(x,) and q(x, 0 have asymptotic ex- pansions in the sense of (3) and (4), respectively: 1 p*(x, ) I ,p:(x,) a 0(. and 1 q(x, ) I ,qa(x,) a 0(. [3,4]. (iii) H'-boundedness. For PES';/j and any real s there exists a constant C, such that IIPullsCsllulls+m' UEW+ m [4]. (iv) A sharp form of Garding's inequality. Let p(x, )=(Pjk(X, );j, k= 1,..., I) be a Her- mitian symmetric and nonnegative matrix of hk(X, )E S;,,,. Then for P = p(x, DxJ there exists a constant C such that Re(Pu,u):;:, -qullm-(p-/j))/2' where u =(u 1 ,..., u 1 ) with u j EH m / 2 ,j= 1,..., I, and Ilull/2 = LJlllujll/2' (v) Invariance under coordinate transfor- mations. Assume that °  1 - p  fJ  p  1. Let x(y) =(xdY), ..., xn(y)) be a Coo-coordinate transformation from R; onto R: such that axdy)/2YjE,j, k= 1,..., n, and C- 1  Idet(oyx(y)) I  C for a constant C > 0, where det«7yx(y)) denotes the determinant of the Jacobian matrix (Oyx(y)) = (l3xdy)/rJYj)' Then for any P = p(x, Dx)E S;, /j in R:, there exists an operator Q = q(x, Dx) E S;,,, in R; such that (Qw)(y)=(Pu)(x(y)) for w(y)=u(X(y))E.'I'. This fact enables us to define pseudoditTerential operators on COO-manifolds [3,4]. (vi) Parametrix. For a given PES';", an operator E E S;:" is called a left (resp. right) parametrix of P if EP-I (resp. PE-l) is of class S- 00. If E is a left and right parametrix of 1282 P, we call it a parametrix of P. For a differen- tial operator P, the existence of a \eft (resp. right) parametrix is a sufficient condition for P to be thypoelliptic if p > 0, (resp. the eq uation Pu = f E 0!' is locally solvable). The estimate (5), in particular, when m=s= 0, has been obtained by Hormandn [3J, V. V. Grushin (Functional Anal. Appl., 4 (1970)), H. Kumano-go (J. Fac. Sci. Univ. Tokyo, 17 (1970)) when O fJ <p  1, and A. P. Calderon and R. Vaillancourt [5J, H. O. Cordes (J. Functional Anal. 18 (1975)), T. Kato (Osaka J. Math., 13 (1976)), Kumano-go [4J, and others when °  fJ  p  1 and fJ < 1 A sharp form of Garding's ineq uality has been proved by Hormander [6J, P. D. Lax and L. Niren- berg [7], and sharpened by A. Melin (Ark. Mat., 9 (1971)), C. FetTerman and D. H. Phong (Proc. Nat. Acad. Sci. US, 76 (19791), and Hormander [8]. A general sufficient condition for the existence of a parametrix for an opera- tor of class S;,,, was obtained by Hormander [3]. Let P=p(x, Dx) belong to S;,,, with O fJ < p  1. Assume that the symbol p(x,) satis- fies the fol1owing conditions: (i) for some Co> 0, real m' (  m), and R > 0, we have Ip(x,) I:;:, Collm' (II:;:'R); (ii) for any 0(, fJ there exists a constant C o . a . p such that IDD{p(x, Wp(x, )I  Co.a.pl I"IPI-plal, II:;:,R. (5) Then there exists a parametrix Q = q(x, Dx) of P in the class s;f. By means of operators of class S. o(Q) we can define the wave front set ofuE@'(Q), which enables us to resolve singsuppu on T*(Q) ,,0, the cotangent bundle of Q minus its zero section. An operator P = p(x, Dx) E S. o(Q) is said to be microlocally elliptic at (xo, O)E T*(Q) ,,0 if lim,oo Ip(xO, To)I/ITOlm > 0. For a distribution uE0!'(Q), we say that a point (xo, o) of T*(Q) ,,0 does not belong to the wave front set (or the singular spec1rum) of u, denoted by WF(u), if there exist a(x), b(X)E C;;o(Q), a(xo)#O, b(xO)#O, and PES.o(Q), which is microlocally elliptic at (xc, O), such that aPbUE Co(Q). Then we easily see that WF(u) is a closed conic subset of T*(Q) ,,0. An important fact is that the relation smgsuppu = Projx WF(u) (the projection ofWF(u) on Q) holds, from which we can perform a so-called microlocal analysis, the analysis on T*(Q)"O, of sing suppu. As the sharp form of the pseudo- local property of an operator PE S;, " , if °  fJ < p  1, P has the micro-pseudolocal prop- erty: UE.'I" => WF(Pu)c WF(u). PseudoditTerential operators of multiple symbol have been defined by K. O. Friedrichs (Courant lnst., 1968) and Kumano-go [4]. More refined and useful classes of pseudo- (6) 
1283 differential operators have been defined by R. Beals (Duke Math. J., 42 (1975)) Hormander [8], and others. The theory of pseudodifferential operators has found many fields of application, such as M. F. Atiyah and R. Bott (Ann. Math., 86 (1967)) on the tLefschetz fixed-point formula; Friedrichs and P. D. Lax (Comm. Pure Appl., 18 (1965)) on symmetrizable systems; Horman- der [6], Yu. V. Egorov (Russian Math. Surveys, 30 (1975)) on subelliptic operators; Kumano- go (Comm. Pure Appl. Math., 22 (1969)), F. Treves (A mer. J. Math., 94 (1972)), S. J. Alin- hac and M. S. Bouendi (Amer. J. Math., 102 (1980)) on uniqueness of the Cauchy problem; S. Mizohata and Y. Ohya (Pub I. Res. Inst. Math. Sci., 4 (1968)), Hormander (J. Analyse Math., 32 (1977)) on tweakly hyperbolic equa- tions; C. Morawetz, J. V. Ralston, and W. A. Strauss (Comm. Pure Appl. Math., 30 (1977)), M. Ikawa (Pub. Res. Inst. Math. Sci., 14 (1978)) on the exponential decay of solutions; and Nirenberg and Treves [16], Beals and Feffer- man [17] on local solvability theory. For recent developments in the theory of pseudodifferential operators and its applica- tions  Kumano-go [4], M. Taylor (Prince- ton Univ. Press, 1981), Treves (Plenum, 1981), and others. B. Fourier Integral Operators A Fourier integral operator A: Co(Rn)->'(Rn) is a locally finite sum of linear operators of the type Af(x) = (2n) -(n+N)/2 r exp(iqJ(x, 8, y)) JR N + n xa(x, 8, y)f(y)dyd8. (7) Here a(x, 8, y) is a COO-function satisfying the inequality I IY;VC D;a(x, 8, y)1  C(1 + 18I)m-pIPI+(1-p)(lal+IYI) for some fixed m and p, 1/2  p  1, and any triple of multi-indices iX, {3, y, and for qJ(x, 8, y) a real-valued function of class Coo for 8 #- ° and homogeneous of degree 1 in 8 there. The func- tion qJ is called the phase fnnction and a the amplitnde function. Let Cq>={(x,8,y)ld o qJ(x,8,y)=0, 8#-0} and w= {(x,y)ER n x R n 138#-0 such that(x, 8,y)E Cq>}' If dx.o.yqJ(x, 8, y) #-0 for 8#-0, then the kernel distribution k(x, y) of A is of class Coo outside W. There have been detailed studies of the case where the d x . o j8qJ(x, 8, y)/88), j = 1,2, ..., N, are linearly independent at every point of Cq>' In this case, Cq> is a smooth manifold in Rn x (R N ,,0) x R n , and the mapping <1>: Cq>3(X, e, y)->(x, y,, 1]),  =dxqJ(x, 8, y), I] = 345B Pseudodifferential Operators dyqJ(x, 8, y), is an immersion of Cq> to T*(R n x R n ) ,,0, the cotangent bundle of R n x R n minus its zero section. The image <1>Cq> = Aq> is a conic Lagrange manifold, i.e., the canonical 2-form (j= Ljdj/\dxj- Ljdl]j/\dYj v?Lnishes on Aq> and the multiplicative group of positive num- bers acts on Aq>' Let AI, ... , A 2n be a system of local coordinates in Aq>' These, together with 8qJ/88j, ..., 8qJ/88 N , constitute a system of local coordinate functions of Rn x (R N ,,0) x Rn in a conic neighborhood of Cq>' Let J denote the Jacobian determinant ( 8qJ 8qJ ) D AI, ...,A2n'aB;:' ''''aB; D(x,8,y) The function aA =jJ alc <1>-1 is called the local symbol of A. Here alc. is the restriction of . a to Cq>' The conic Lagrange manifold Aq> = Aq>(A) and the symbol a A =a A (A) essentially determine the singularity. of th kernel distri- bution k(x, y) of the Fourier integral operator A. Conversely, given a conic Lagrange mani- fold A in T*(Rn x Rn)"O and a function a A on it, one can construct a Fourier integral operator A such that Aq>(A)=A and aA.(A)= a A . Those Fourier integral operators whose associated conic Lagrange manifolds are the graphs of homogeneous tcanonical transfor- mations of T*(Rn)"O are most frequently used in the theory of linear partial differential equa- tions. Let A be a Fourier integral operator such that Aq>(A) is the graph of a homoge- neous canonical transformation X. Then the adjoint of A is a Fourier integral operator such that the associated conic Lagrange mani- fold is the graph of the inverse transformation X -I. Let Al be another such operator; if the associated conic Lagrange manifold is the graph of XI' then the composed operator Al A is also a Fourier integral operator and the associated conic Lagrange manifold is the graph of the composed homogeneous canon- ical transformation XIX. A pseudodifferential operator of class S;;'.1 p(Rn) is a particular type of Fourier inte- gral operator. In fact, a Fourier integral oper- ator A is a pseudodifferential operator of class S; l_p(Rn) if and only if Aq>(A) is the graph of the identity mapping of T*(R n ) ,,0. Hence for any Fourier integral operator A, A * A and AA * are pseudodifferential operators. The following theorem is due to Egorov [11]: Let P and Q be pseudodifferential opera- tors of class S;;'.I_p(R n ) with the symbols p(x, ) and q(x, ), respectively, and let A be a Fourier integral operator such that the asso- ciated conic Lagrange manifold Aq>(A) is the graph of a homogeneous canonical transfor- mation X of T*(Rn)"O. If the equality P A = AQ 
345 Ref. Pseudo differential Operators holds, then q(x, 0 - p(X(x, 0) belongs to the class s;,t2p(Rn). Assume that m = 1, p = 1, and that PI (x, () is a real-valued C"-function, homogeneous of degree 1 in ( for I (I> 1, such that p(x, () -pdx, )ESf.o(Rn) and dpdxo, (0) #0 at (XO, O), Iol > 1, where PI (XO, (°)=0. Then one can find a Fourier integral operator A such that the function q(x, 0 of Egorov's theorem satisfies the relation q(x, () -, E Sf. o(Rn). The theory of Fourier integral operators has its origin in the asymptotic representation of solutions of the wave equation ( 325 Partial Differential Equations of Hyperbolic Type L; also, e.g., [12,13, 14J). G. 1. Eskin (Math. USSR-Sb., 3 (1976)) used a type of Fourier integral operator in deriving the energy esti- mates and constructing the fundamental solu- tions for strict hyperbolic operators. Hor- mander (Acta Math., 121 (1968)) introduced the term "Fourier integral operators," and applied these operators to the derivation of highly accurate asymptotic formulas for spec- tral functions of el1iptic operators. Egorov (Math. USSR-Sb., 11 (1970)) applied his theo- rem and the corol1ary stated above to the study of hypoel1ipticity and local solvability for pseudodifferential operators of principal type. Using Egorov's theorem and the same corol1ary, Nirenberg and Treves [16J obtained decisive results concerning local solvability for linear partial differential operators of principal type; these results were completed by Beals and FetTerman [17]. Hormander and J. 1. Duistennaat [9, 15J constructed a general global theory of Fourier integral operators making use of Maslov's theory [14J, which was original1y published in 1965. By virtue of this research, the Fourier integral oper- ator has come to be recognized as a powerful tool in the theory of linear partial differential operators. An interesting application of the global theory of Fourier integral operators appeared in J. Chazarain (lnventiones Math., 24 (1974)). The boundedness of Fourier inte- gral operators in the spaces L 2 (R n ) (or the space H S ) has been studied in several cases. Some sufficient conditions for boundedness have been obtained by Eskin (Math. USSR- Sb., 3 (1967)), Hormander [9J, D. Fujiwara [18J, Kumano-go (Comm. Partial DifJ Eq., 1 (1976)), K. Asada and Fujiwara (Japan. J. Math., 4 (1978)), and others. A calculus of Fourier integral operators in R n was given in Kumano-go [4]. The propagation of wa ve front sets by means of a Fourier integral operator is de- scribed as fol1ows. Let us consider a phase function of the form <p(x, (, y) = S(x, () - y' ( in R x R x R;, and let a(x, 0 be an amplitude 1284 function independent of y of class S.o. Then by (7) the Fourier integral operator A = As is defined by A s u(x)=(2n)-n/2 r exp(iS(x, O)a(;(, Ou(Od(. JR" (8) Let T be the canonical transforma1:ion with the tgenerating function S(x, (), i.e., T is de- fined by y='YS(x,I'/), ='YxS(x,I'/). Then for the Fourier integral operator As we have WF(Asu) c {(x, () = T(y, 1'/) I (y, I'/)E WF(u)}, UE/I". (9) Next consider a hyperbolic operator L=D,+ p(t, x, DxJ for a real-valued symbol p(t, x, ()E .'?6'°([0, ToJ; S:.o) with some To> O. For a smal1 0< TO;; To the solution S(t, x,) of the eikonal equation c,S+ p(t,x, 'YxS)=O on [0, TJ with the initial condition SI,o =X'  exists in fJlI ([0, TJ; Sl.o). Consider the Cauchy problem Lu=O on [0, T], ul,o=uo' Then t1ere exists an amplitude function e(t, x, ()EfJl I ([0, TJ; Sf.o) such that the solution u(t) is found in the form u(t) = Es(t)u o . On the other hand, let (x, ()=(X(t,y, 1'/), E(t, y, 1'/)) be the bichar- acteristic strip defined by tHamilton's ca- nonical equation dx/dt = 'Y p(t, x, (), d(/dt = - 'Y xp(t, x,) with (x, ()I'O = (y, 1'/). Then (X (t, y, 1'/), E(t, y, 1'/)) can be solved by means of the relations y='YS(t,X,I'/), E= 'YxS(t,X,I'/), as a family of canonical transformations with a parameter tE[O, T]. Thus by means of(9) we have WF(u(t))c {(x, ()=(X(t, y, I1),E(t,y, 1'/)) I (y,I'/)EWF(u o )}, (10) which is the fundamental result in the study of the propagation of wave front sets as solutions of general hyperbolic equations (-. 325 Partial Differential Equations of Hyperbolic Type M). The works of Egorov, Nirenberg and Treves, and Hormander motivated the theory of hyperfunctions developed by M. Sato and gave rise to the concept of tquantized contact transformations, which correspond to Fourier integral operators in the theory of distribu- tions. The above-stated transformation theo- rem of Egorov has been studied in detail with reference to systems of pseudodifferential equations with analytic coefficient:; [19] ( 274 Microlocal Analysis). References [1] J. 1. Kohn and L. Nirenberg, An algebra of pseudo-differential operators, Cornm. Pure App1. Math., 18 (1965), 269305. 
1285 [2] L. Hormander, Pseudo-differential opera- tors, Comm. Pure App1. Math., 18 (1965), 501-517. [3] L. Hormander, Pseudo-differential opera- tors and hypoe11iptic equations, Amer. Math. Soc. Proc. Symp. Pure Math., 10 (1967), 138- 183. [4] H. Kumano-go, Pseudo-differential oper- ators, MIT Press, 1981. (Original in Japanese, 1974.) [5] A. P. Calderon and R. Vail1ancourt, A class of bounded pseudo-differential operators, Proc. Nat. Acad. Sci. US, 69 (1972), 1185- 1187. [6] L. Hormander, Pseudo-differential opera- tors and non-e11iptic boundary problems, Ann. Math., 83 (1966), 129-209. [7] P. D. Lax and L. Nirenberg, On stability for difference schemes, a sharp form of Gard- ing's inequality, Comm. Pure App1. Math., 19 (1966),473-492. [8] L. Hormander, The Weyl calculus of pseudo-differential operators, Comm. Pure App1. Math., 32 (1979), 359-443. [9] L. Hormander, Fourier integral operators I, Acta Math., 128 (1971), 79-183. [10] J. J. Duistermaat, Fourier integral oper- ators, Lecture notes, Courant Institute, 1973. [11] Yu. V. Egorov, On canonical transfor- mations of pseudo-differential operators (in Russian), Uspekhi Mat. Nauk, 25 (1969),235- 236. [12] P. D. Lax, Asymptotic solutions of oscil- latory initial value problems, Duke Math. J., 24 (1957),627-646. [13] D. Ludwig, Exact and asymptotic solu- tions of the Cauchy problem, Comm. Pure App1. Math., 13 (1960), 473-508. [14] V. P. Maslov, Theorie des perturbations et methodes asymptotiques, Gauthier- ViI1ars, 1972. [15] J. J. Duistermaat and L. H6rmander, Fourier integral operators II, Acta Math., 128 (1972),183-269. [16] L. Nirenberg and F. Treves, On local solvability of linear partial differential equa- tions I, II, Comm. Pure App1. Math., 23 (1970), 1-38,459-510. [17] R. Beals and C. Fefferman, On local sol vability of linear partial differential equations, Ann. Math., (2) 97 (1973), 482- 498. [18] D. Fujiwara, On the boundedness of integral transformations with highly osci11a- tory kernels, Proc. Japan Acad., 51 (1975), 96-99. [19] H. Komatsu (ed.), Hyperfunctions and pseudo-differential equations, Lecture notes in math. 287, Springer, 1973. 346C Psychometrics 346 (XVIII.17) Psychometrics A. General Remarks Psychometrics is a co11ection of methods for drawing statistical conclusions from vari- ous psychological phenomena which are ex- pressed numerica11y or quantitatively. It con- sists chiefly of statistical methods to deal with psychological measurements and of theories dealing with mathematical models concerning learning processes, social attitudes, and mental abilities. B. Sensory Tests A measurement wherein human senses are taken as the gauge is ca11ed a sensory test. The panel of judges must be composed appropri- ately, and the examining circumstances must be contro11ed. Various methods of psycholog- ical measurements are applied. In the fo11owing sections we describe the basic statistical proce- dures used in sensory testing. C. Paired Comparison When there are t objects (treatments or stimuli in some cases) 0 1 , 02' ...,0" the method of comparing them two at a time in every pos- sible way is ca11ed paired comparison. The fo11owing are typical mathematical models of this method. (1) Thurstone-Mosteller Model. Suppose that the probability that Oi is preferred to OJ for a pair (Oi' OJ) is Pij' Of the n judges who compare this pair, the number who prefer Oi is nij' and the number who prefer OJ is n ji = n - nij. In this comparison it is assumed that the strengths of the stimuli 0i' OJ to the senses are random variables Xi' Xj, and Oi is preferred when Xi> Xj. Furthermore, it is assumed that the joint probability distribution of Xi and X j is the 2- dimensional tnormal distribution with l1i and (J2 as mean and variance of Xi' and p as corre- lation coefficient of Xi and Xj. There is no loss of generality in assuming that 2(J2(1- p) = 1 and I::=l l1i=O. Let <I>(x) be the standardized normal distribution function and pij=<I>(l1i- 11). Using Plj= nijln as estimates ofthe true Pij' we can obtain the estimates Pi. Using pij = <I>(Pi - p j ) and Plj we can test the hypothesis that 111 = 112 =... = 11,. (2) The Bradley-Terry Model. The experi- mental method in the Bradley-Terry model 
346D Psychometrics is the same as in the Thurstone-Mosteller model. It is postulated that, associated with 0 1 , O 2 , ... , 0" there exist parameters n; for 0; (n;;::'O, L:1 n;= 1) such that p;j=nj(n;+n). Obtaining the tmaximum likelihood estimator of n;, we can test the appropriateness of the models. (3) Scheffe's Model. Each pair 0;, OJ is pre- sented to 2n judges; n of them examine 0; first and OJ next, and the remaining n examine the pair in the opposite order. A judgment is re- corded on a 7-point (or 9-, 5-, or 3-point) scale. In the 7-point scaling system a judge pre- sented with the ordered pair (0;, 0) marks one of the seven points 3, 2, 1, 0, -1, - 2, - 3, meaning, respectively, that 0; is strongly pre- ferable to OJ' 0; is moderately preferable to OJ, 0; is slightly preferable to OJ, no preference, OJ is slightly preferable to 0;, etc. The mark given by the kth judge on his preference of 0; to OJ is denoted by X;jk' which can be regarded as the sum of a main effect, deviation of subtractivity, order effect, and error. Significance tests for these effects and estimates of various para- meters are given by using statistical tlinear models. tBIBD, tPBIBD, etc., can also be applied to paired comparisons. D. The Pair Test, Triangle Test, and Duo-Trio Test The pair test, triangle test, and duo-trio test are sensory difference tests. The methods are as follows. Pair test: A judge is requested to designate a preference between the paired samples A and B. Triangle test: A judge is requested to select two samples of the same kind out of A, A, B. Duo-trio test: A judge is first acquainted with a sample A and then is requested to choose from A and B the one he has seen in the previous step. In a1l the above cases, the hypothesis that A and B are different and that the judge has no ability to determine the difference between them is tested by using the tbinomial distribution. E. Scaling (1) One-Dimensional Case. Psychometric scaling methods are procedures for construct- ing scales for psychological phenomena. Some of them require judgments concerning a par- ticular attitude that is considered unidimen- sional. Under the assumption that a psycho- logical phenomenon is a random variable with some distribution law and the parameters of the distribution law determine psycholog- ical scales, a psychological scaling is given by estimating the parameters. The Thurstone- 1286 Mosteller model is a method for scaling a set of stimuli by means of observable proportions. (2) Multidimensional Scaling (MDS). Multidi- mensional scaling is a co1lection of methods to deal with data consisting of many measure- ments on many objects and to characterize the mutual distance (dissimilarity), or closeness (affinity), by representing those objects by a small number of indices or by points in a sma1l-dimensional Euclidean space. It has seen useful applications in the analysis of people's attitude and perception and their characteri- zations by means of a few numbers or points in a space of low dimension. Historically, MDS was first developed by Torgerson (1958) and refined further by Shepard (1962) and Kruskal (1964). The method developed by Torgerson and also the INDSCAL method by Carrol and Chang (1970) are called metric multidimensional scal- ing, while the method by Shepard and Kruskal is called the non metric MDS. The former is applied when the data are represented in con- tinuous scales and the latter when the data are in discrete nominal or ordinal scale;;. Tech- niques of multidimensional scaling are closely related and sometimes actua1ly equivalent to various methods of multivariate analysis, especia1ly principal component analysis, ca- nonical correlation analysis, and discriminant analysis ( 280 Multivariate Analysis). F. Factor Analysis Though factor analysis can be considered to be a method to deal with multivariate data in general ( 280 Multivariate Analysis), it has had close connections with psychometric studies, in both theoretical developments and applications. Historically, it was initiated by Spearman (1927) and developed further by Thurston (1945) in order to measure human abilities from test scores. Mathematically, the model of factor analysis is formulated as fol- lows: Let Zjk be the standardized score of the jth test achieved by the kth subject, j = 1, ... ,p; k = 1,..., N; then it is assumed that it can be represented as a linear combination of r com- mon factors and one specific factor as Zjk = Gjrflk+ G j2 f2k + ... + Gjrfrk + UjV;k' (1) where J;k' i = 1,..., r, k = 1, ..., N, represents the magnitude of the ith common factor (ability) in the kth subject and G j ; is the size of contri- bution of the ith factor to the score of the jth test. Usually it is assumed that (i) V(z) = 1, (ii) V(J;) = 1 and Cov(J;,J;.) =0 for i i= t, (iii) V(v) = 1 and Cov(v j , vj')= O,j i= j', (iv) Cov(J;, v)= o. 
1287 Then it fo11ows that r ii , = Cor(zj' zj')= a jl aj'1 +a j2 aj'2 +... +ajraj'n and 1 =aA +aJ2 +... + afr+ u/ =hJ + uJ, where hJ is ca11ed the communality of the Jth variable Zj and uJ is ca11ed the specificity. It fo11ows easily from (1) that any orthogonal transformation of the scores does not affect the model. Problems offactor analysis are classified into three types: (1) Estimation of communality: There are several methods of determining communality or initial estimates of it when some iterative procedure is used. (2) Determination of factor loadings, which is the estimation of the a j ;: A number of methods have been proposed, among which those often used are the MINRES by Harman (1967), the varimax by Kaiser (1958), and the tmaximum likelihood based on the normal model by Lawley and Maxwel1. (3) Estimation of factor scores J;k: Usua11y fac- tor scores are estimated after factor loadings have been determined. Thurstone proposed F=ZR-IA and Harman F=ZA(A'A)-r, where F, Z, A are the matrices offactor scores, test scores, factor loadings, respectively, and R is the correlation matrix of the z's. G. Learning Theory (1) General Description. Assume that a se- quence of trials is done in order to study some given behavior and that on each trial particular events occur (stimuli, responses, reinforcements, etc.) that influence the en- suing behavior. Then the behavior itself is modified by such a sequence of trials. Learning models refer to such processes of behavior modification, and they are frequently repre- sented by recursive formulas for response probabilities. Assume that two mutua11y exclusive re- sponse alternatives AI and A 2 occur on the nth trial (n = 1,2, ...) with respective probabilities Pn and 1- P., and that an event E i occurs on the nth trial with Pr(Bn = E;) = 11:; (i = 1,2, ..., t; L:=I 11:i= 1). Then the recursive formula for Pn is of the form P n +! = f(Pn; B., Bn-j, ..., B I ). If the formula can be written as f = f(Pn; B., B n - 1 , ... ,Bn-d)' then the response probability is ca11ed d-trial path dependent. In the special case d = 0, it is ca11ed path independent. For simplicity we write f(Pn; Bn = Ei' Bn-I = Ej' ..., (tl = Ek)=fij...k(P n ) (i,j, ...,.k=I,2, ...,t). If the response probability is path independent, then J;j...k(P n ) = J;Jj... fk(P1), where f.(Pn) = 346G Psychometrics f(Pn; Bn = E.) (v = i,j, ..., k). When the recursive formula can be expressed as f = f(Pn; B., n), the response probability is said to be quasi- independent of path [11]. In the recursive for- mula f, two events E i and Ej (i =I- j) are said to be commutative if f..i...j...(Pn) = f..j...i...(Pn)' If any two events are commutative, the condi- tion of event commutativity is satisfied. By makin f explicit with respect to n, we write Pn=F(n;Bn-1' ...,B1;Pr). Under the condition of event commutativity, the explicit formula can be written Pn=F(Nj,N 2 , ...,N,;Pr), where N i is the frequency of occurrence of E i in the first (n -1) trials (L:1 N i = n -1). If both event commutativity and path independence of re- sponse probability are satisfied, the explicit formula Pn = fIN 1 ff2 '" it'(PI) can be obtained. (2) Linear Models. In a linear model, the re- cursive formula is written as a linear function of Pn' Example (1). Bush-Mosteller model [6]. The Bush-Moste11er model assumes the response probability to be path independent. The re- cursive formula is expressed as J;(Pn) = !XiPn + (1 - !X;)Ai' Bn = E i . Here !Xi (0';;; !Xi';;; 1) represents the degree of ineffectiveness of E i for learning and Ai (0';;; A;';;; 1) is the tfixed point of J;. A necessary and sufficient condition for E i and Ej (i =I- j) to be commutative is that either J; or Jj be an tidentity operator or Ai = Aj. Example (2). Estes's stimulus-sampling model [7]. We can consider the stimulus as a set composed ofm elements, each of which corresponds to either response AI or A 2 ; the manner of their correspondences depends on each trial. If I n elements correspond to AI on the nth trial, then we have Pn=Jnlm. Sup- pose that on the nth trial s ( ,;;; m) elements are sampled, among which X n elements corre- spond to A2' and the remaining X (=s-X n ) elements correspond to AI' As a result of the nth trial, if AI is reinforced, we set Y" = 1; otherwise, we set Y" = O. Furthermore, assume that I n + 1 = I n + X n Y" - X(I- Y,,). Hence, we obtain the recursive formula P n +! = Pn + {X n Y" - X( 1- y")} 1m. In this model, the response probability is path independent and Bn = (X., y"). Other linear models have been pro- posed in which the response probability is either quasi-independent of path [8] or path dependent [10]. (3) Nonlinear Models. In nonlinear models the recursive formula cannot be written as a linear function of Pn' Example (3). Luce's p-model [9]. Let the response strengths of AI and A 2 on the nth trial be v n and v, respectively (both positive), and assume that P., the response probability of AI' is expressed as vn/(v n + v). The response 
346 Ref. Psychometrics strengths V n and v depend on each tria!. Under the assumption that the response strength is path independent and that V n + 1 changes in- dependently from v, the recursive formula of V n is written as V n +! = qJi(V n ), tff n = E i . Here, if we assume qJ;(v»O for v>O and qJi(CV)=CqJ;(v) for v>O and c>O, then qJi(Vn)=P;v n with Pi>O. In a similar way, the recursive formula for v can be expressed as qJ:(v)=p:v (P:>O). Therefore we have Pn+1 = Pn/{ Pn + b;(1 - P n )} (b i = PUP;), tff n = E;. This model is nonlinear, and the re- sponse probability is path independent. By making the recursive formula explicit, we obtain Pn = P 1 /{P 1 +(1- Pdexp(L:=1 N;logb;)}. Hence it is clear that the events are commuta- tive. Other nonlinear models in which the re- sponse probability is either quasi-independent of path [5] or path dependent [9] have also been proposed. Here we have taken up only the case in which the number of response alternatives is 2, but we can generalize to the case of more than two alternatives. For fitting a model and experimental data, expected response prob- abilities and various other statistics deduced from the model (total error, trial number of first success or last error, and seq uential sta- tistics such as length of response trun or tauto- correlation between responses) are used. Es- timation methods have also been devised for the parameters involved. References For sensory tests, [1] H. A. David, The method of paired com- parisons, Hafner, 1963. [2] J. P. Guilford, Psychometric methods, McGraw-Hil!, second edition, 1954. [3] M. G. Kendal!, Rank correlation methods, Griffin, third edition, 1962. [4] W. S. Torgerson, Theory and method of scaling, Wiley, 1958. For learning theory, [5] R.1. Audley and A. R. Jonckheere, The statistical analysis of the learning process, Brit. 1. Statist. Psycho!., 9 (1956), 87-94. [6] R. R. Buch and F. Mostel!er, Stochastic models for learning, Wiley, 1955. [7] W. K. Estes, Component and pattern models with Markovian interpretations, Studies in Mathematical Learning Theory, R. R. Bush and W. K. Estes (eds.), Stanford Univ. Press, 1959,9-52. [8] M. I. Hanania, A generalization of the Bush-Mostel!er model with some significance tests, Psychometrika, 24 (1959), 53-68. [9] R. D. Luce, Individual choice behavior: A theoretical analysis, Wiley, 1959. 1288 [10] S. H. Sternberg, A path-dependent linear model, Studies in Mathematical Learning Theory, R. R. Bush and W. K. Estes (eds.), Stanford Univ. Press, 1959,308-339. [11] S. H. Sternberg, Stochastic learning theory, Handbook of Mathematical Psy- chology II, R. D. Luce, R. R. Bush, and E. Galanter (eds.), Wiley, 1963, 1-120. For MDS, [12] J. B. Kruskal, Nonmetric multidimen- sional scaling: A numerical method, Psycho- metrika, 29 (1964),115-129. [13] R. N. Shepard, The analysis of proxim- ities: Multidimensional scaling with an un- known distance function, Psychometrika, 27 (1962),125-219. [14] L. Guttman, Multiple rectinear predic- tion and the resolution into components, psychometrika, 5 (1940), 75-99. [15] 1. B. Carrol and 1. 1. Chang, Analysis of individual difference in multidimem;ional scaling via an N-way generalization of Eckart- Young decomposition, Psychometrika, 35 (1970),283-319. For factor analysis, [16] C. Spearman, The abilities of roan, Mac- millan, 1927. [17] H. H. Harman, Modern factor analysis, Univ. of Chicago Press, 1967. [18] H. F. Kaiser, The varimax crit'rion for analytic rotation in factor analysis, Psycho- metrika, 23 (1958), 187-240. [19] D. N. Lawley and A. E. Maxwel!, Factor analysis as a statistical criterion, Butterworth, 1963. [20] K. Takeuchi, H. Yanai, and B. N. Muk- herjee, Foundations of multivaria te analysis, Wiley, 1982. 
347 A Quadratic Fields 347 (V.12) Quadratic Fields A. General Remarks Any textension field of the rational number field Q of degree 2 is called a quadratic field. Any quadratic field k is obtained from Q by adjoining a square root of a square-free integer (i.e., an integer ye 0, 1 with no sq uare factor) m:ko=Q(";;). Ifm is positive (negative), k is called a real (imaginary or complex) quadratic field ( 14 Algebraic Number Fields). Let w= { (1 +";;)/2 for m: 1 (mod4), ..;; for m = 2,3 (mod 4). Then (1, w) is a tminimal basis of k. That is, any talgebraic integer  of k has the unique expression  = a + bw with a, bE Z. The tdis- criminant d of k is given by d = m in case m =:= I (mod 4) and d = 4m in case m =:= 2, 3 (mod 4). The conjugate element of an element  = a + b (a, bl=Q) of k over Q is given by '=a - b';;. The mapping (J:->' is an tautomorphism of the field k. B. Units Let k be an imaginary quadratic field. The tunits of k are ::!:: 1, :::I:: i in case m = - 1; ::!:: 1, ::!::wo, ::!::wii(w o =(1 +)=3)/2) in case m= -3; and i: I in all other cases. Let k be a real quadratic field. There exists a unit f: o that is the smallest one among the units (> 1) of k. Any unit E of k can be uniquely expressed in the form f. = ::!:: EO (n E Z). That is, :t f. J is a tfundamental unit of k. The fun- damental unit Eo = (x + yjd )/2 (> 1) can be calculated by finding a minimal positive inte- gral solution (x, y) of t Pell's equation x 2 - dy2 = ::!:: 4 by using continued fractions ( 83 Continued Fractions; for a table of the fun- damental unit of k for m < I 00  [I J). C. Prime Ideals The decomposition of a prime number p in the tprincipal order 0 of k is given as follows: (i) Let pi d, where d is the discriminant of k. Then p is decomposed in 0 in the form (p) = p2. (ii) Let p -I--- 2 and (p, m)= 1. If (m/p) = 1, then (p)= pp' in 0 (pyep') and N(p)=N(p')=p. If(m!p)= --I, then (p)=p in 0 and N(p)=p2. Here p, p' are prime ideals of 0, N means the tnorm, and (m/p) is the tLegendre symbol. (iii) Let 2..t d, tbat is, m=. I (mod 4). If m=:= 1 (mod 8), then (2)=pp'(p yep') and N(p)= N(p')=2. Um=:= 5 (mod8), then (2)=p and N(p)=4. 1290 D. The Kronecker Symbol Let k = Q(";;), and let d be the discriminant of k. We define the symbol X for k as follows: (i) X(p) = 1 if (p) = pp' (p -I--- p') in 0; (ii) X(p) = - 1 if(p)=p in 0; (iii) X(p)=O if(p)=J'2 in 0; and (iv) x(n) = ITiX(pJ"i for n = ITipfi> O. In partic- ular, we define X(1)= 1. If(n,d)= I, the sym- bol X can also be defined using the tJacobi symbol as follows: Um=:= 1 (mod4), then x(n)= (n/lml); and ifm=:=2,3 (mod4), thetl x(n)= x!(n)(n/lm'\) for d=2 e m', where (i) x!(n) = (_lyn-1)/2 for e = 2, m' =:= 3 (moM); (ii) x!(n) = ( _1)(n 2 -( )/8 for e = 3, m' =:= 1 (mod 4); and (iii) xi(n)= (- 1)(n 2 -1)/8+(n-1)/2 for e = 3, m' =:= 3 (mod 4). U(n,d)ye 1, then x(n)=O. For a nega- tive integer - n, we define x( - nl = (sgn d Jx(n). The symbol x(n) for n E Z is called the Kro- necker symbol for k. The Kronecker symbol for k has the follow- ing four properties: (1) x(n)= 0 jI" (n, d) -1---1, and x(n)=:!: 1 if(n,d)=l; (2) x(m)=x(n) ifm==n (mod d); (3) X(mn) = X(m)x(n); (4) x(n) = 1 if and only if n =:= N (0) (mod d) for some integral ideal o of k such that (o,(d))= 1. (Property (4) shows that a quadratic field provides ;l class field;  59 Class Field Theory.) E. Ideal Classes The tclass number h of k was calculated by P. G. L. tDirichlet (1840) by analytica] meth- ods as follows: 1 d-1 rn hlogEo= -- I x(r)logsin- for m>O, 2'1 d -w Idl-l h=- dl L: x(r)r form<O, 21 r= I where d is the discriminant of k, f.o is the positive fundamental unit (> [) of k, w is the number of roots of unity in k, and X is the Kronecker symbol. Denote by hid) the class number of the imaginary quadratic fIeld wit':J discriminant d. It was conjectured from the time of C. F. tGauss that h(d)->CXJ as Idl--> CXJ. This conjec- ture was proved by H. Heilbl'Onn (1934). More precisely, C. L. Siegel (Acta Arith., 1 (1935)) proved lim (logh(d))/(logJjdl) = I IdlC0 h(d)= 1 holds for Id/=3,4, 7,8,11,19,43,67, 163. In 1934, Heilbronn and E. H. Linfoot proved that there can be at most one more such d, and finally A. Baker and H. M. Stark independently proved that these nine numbers are the only ones for which h(d)= 1 (Baker, Mathematica, 13 (1966); Stak, Michigan Math 
1291 J., 14 (1967)). Also, Baker and Stark proved independently (Ann. Math., (2) 94 (1971); Math. Comp., 29 (1975)) that h(d)=2 hOlds only for Idl = 15,20,24,35,40,51,52,88,91,115,123, 148,187,232,235,267,403, and 427. For real quadratic fields, we have lim (1og(h(d)loge d ))/(log Jd)= 1, doo where e d is the fundamental unit of k = Q(Jd) (ed> 1). However, it is not yet determined whether there exist infinitely many d with h(d) = 1 ( Appendix B, Table 4). F. Genera Let G be the group of an (tfractional) ideals of k, and let H be the group of an tprincipal ideals (IX) of k such that N(IX) >0. Each coset of G modulo H is caned an ideal class in the narrow sense. (This notion is a special case of the notion of ideal classes in the narrow sense of algebraic number fields;  14 Algebraic Number Fields G.) In the cases (i) m < 0 and (ii) m>O, N(eo)= -1, the usual classification of ideals and classification of ideals in the narrow sense are identical. When m>O, N(eo) = 1, each ideal class is divided into two ideal classes in the narrow sense. We can an ideal class in the narrow sen simply an ideal class. Let PI' .., ,p, be the set of an prime numbers dividing d. For nEZ with (n, d)= 1, we define xdn),..., x,(n) as fonows: For Pi #-2, we define Xi(n) = (n/Pi); for Pi=2, we identify Xi with X! in the definition of the Kronecker symbo1. In order that XI (n)=... = x,(n) = 1 for nE Z with (n,d)= 1, it is necessary and sufficient that n:: N(IX) (mod d) for an integer IX of k (where N(IX) = IXIX'). Since XIX2 ...X, is equal to the Kronecker symbol, it follows that n::N(a) (mod d) for an integral ideal a of k is a neces- sary and sufficient condition for XI (n)... x,(n) = 1 to hold for nE Z with (n, d) = 1. Put ei=Xi(N(a)) (i= 1, ...,t), where a is an integral ideal with (a, (d))= 1. Then (eJ, '" ,e,) is uni- quely determined for the ide'al class C contain- ing a and does not depend on the choice of a. The set f> of all ideal classes of k such that (eJ, ... ,e,) = (1, ...,1) is ca11ed the principal genus of k, and f> is a subgroup of the ideal class group (£ of k. Each coset of (£ modulo f> is caned a genus of k. For each genus, the values e i = xlN(a)) (i = 1, ..., t) are uniquely determined. We ca11 (e I' .., ,e,) the character system of this genus, and each genus is uni- quely determined by its character system. A necessary and sufficient condition for (eJ, ..., e,) to be a character system for some genus is that e i = :!:: 1 and el e 2 .,. e, = 1. Hence there are 2'-1 347H Quadratic Fields genera of k, and (£/f> is an Abelian group of type (2,2, .., , 2). In order that an ideal class C belong to the principal genus, it is necessary and sufficient that C=ct-<7=ci hold for some ideal class C 1 . From this it fo11ows that there are t-l tinvariants of the ideal class group (£ of k that are powers of 2. Each ideal class C such that C = C is ca11ed an ambig class of k. There are 2'-1 ambig classes, and they form an Abelian group of type (2,2, ...,2). Each ideal a of k with a<7 = a is ca11ed an ambig ideal of k. Let (Pi) = pf (i = 1, '" ,t). Then each ambig ideal is uniquely expressed in the form a = pi' ... p,v'(a) by some aEQ and vi=O, 1. Each ambig class contains exactly two ambig ideals of the form pi'...p,vt. For example, for k= Q( J -65 ) we have d= -2 2 '5.13, t=3, h= 8, (£= {E, A, A 2 , A 3 , B, AB, A 2 B, A 3 B}, where A 4 =E, B2=E, A<7=A3, and B<7=B; the prin- cipal genus is f>= {E, A 2 }, and the ambig classes are {E, A 2, B, A 2 B}. G. Norm Residues A quadratic field k is the tclass field over Q for an ideal group H. The tconductor of H is said to be the conductor of k/Q or simply the con- duct0l". of k. The conductor 1 = npl p of k = Q(Jm) is given by I=d for m>O and 1= dpoo for m<O. That is, the p-conductor Ip=p for pi d, p#-2; and 12 =2 e for 21d, d=2 e m' ((2,m')= 1). By means of the tHilbert norm- residue symbol, the Kronecker symbol X is expressed by { n ( a, d ) x(a)= d P for (a, d) = 1, for (a, d) #- 1. H. History The arithmetic of quadratic fields was origi- na11y developed in terms of the theory of binary quadratic forms with rational integral coeffi- cients by Gauss and Dirichlet [2]. The theory was then translated into the terms of ideal theory by J. W. R. Dedekind [2] ( 348 Qua- dratic Forms M). For example, the theory of genera for quadratic fields explained in Section F was first developed by Gauss in terms of binary quadratic forms, and the class number formula was obtained by Dirichlet as a formula for binary quadratic forms. Hilbert [4, ch. 2] developed the arithmetic of quadratic fields systematica11y by introducing the Hilbert norm-residue symbol ( [1,5,6]). Later the arithmetic of quadratic fields assumed the aspect of a simple example of class field theory ( 59 Class Field Theory). 
347 Ref. Quadratic Fields References [1] Z. I. Borevich and I. R. Shafarevich, Num- ber theory, Academic Press, 1966. (Original in Russian, 1964.) [2] P. G. L. Dirichlet, Vorlesungen iiber Zah- lentheorie, herausgegeben und mit Zusiitzen versehen von R. Dedekind, Viewig, fourth edition, 1894 (Chelsea, 1969). [3] H. Hasse, Vorlesungen iiber Zahlentheorie, Springer, 1950. [4] D. Hilbert, Die Theorie der algebraischen Zahlkorper, Jber. Deutsch. Math. Verein. 4, 1- XVIII (1897),175-546. (Gesammelte Abhand- lungen I, Springer, 1932,63-363; Chelsea, 1967.) [5] B. W. Jones, The arithmetic theory of quadratic forms, Math. Assoc. of America and Wiley, 1950. [6] E. G. H. Landau, Vorlesungen iiber Zah- lentheorie I, II, III, Hirzel, 1927 (Chelsea, 1969). 348 (111.15) Quadratic Forms A. General Remarks A quadratic form Q is a quadratic homoge- neous polynomial with coefficients in a tfield K, written Q(x 1 , ...,x n )= L CikXiX k . likn If the coefficients C ik belong to the field of real (complex) numbers, we call Q a real (complex) quadratic form. Let V be an n-dimensional vector space over K. For a vector x in V whose coordinates are XI' ... ,x n , we put Q(x) = Q(x h ..., x n ). This gives rise to a mapping x->Q(x) of V into K. Such a mapping satisfies the following two conditions: (i) Q(ax)=a 2 Q(x) (aEK); and (ii) Q(x+ y)-Q(x)-Q(y)=B(x,y) is a tsymmetric bilinear form on V. Conversely, if a mapping Q: V -> K satisfies these two con- ditions. then Q must come from a quadratic form (- 256 Linear Spaces). B(x, y) is called the symmetric bilinear form associated with Q. We assume that the tcharacteristic of K is not 2. Putting aik=aki=cik/2 (i<k), aii=c ii (i = 1, ..., n), we have Q(x) = L?k=1 aikxixk' The matrix A = (a ik ) is the matrix of the quadratic form Q, and the determinant IA I is the dis- criminant of Q, denoted by (Q). (Sometimes, instead of 1 A I, we call (_l)n(n-l)/22 n IA I, the discriminant of Q.) The rank of the matrix A is called the rank of Q. Using the notation for the tinner product of vectors, we can write Q(x) 1292 =(x,Ax)='xAx; 2- 1 B(x,y)=(x,Ay)='xAy. We say that Q is nondegenerate if B(x, y) is nondegenerate (i.e., if IAI #0). Consider a linear substitution x i = Lj=l Pijxj (i.e., x=Px', with an n x m matrix P). Then we get a new quadratic form Q'(x') with the ma- trix tp AP. If each Pij belongs to th, field K (to a subring R of K that contains the: unit ele- ment of K), we say that Q represents Q' over K (resp. R). A basic problem in the theory of quadratic forms is to determine the exact conditions under which a given quadratic form Q represents another quadratic form Q'. The problem of representing numbers by a quadra- tic form (representation problem) is the par- ticular case corresponding to m = I. Any qua- dratic form Q represents 0 by taking the zero matrix as P. Hence, by the expresion "Q represents 0 over K," we usually mean the nontrivial representation of 0 over K by Q, i.e., Q(x) = 0 for some nonzero vector x. If Q is non degenerate and represents 0, then it repre- sents any element of K*. Given an element J1 in K*, we consider the quadratic form Q' de- fined by Q'(X1'"'' x n +J1 = Q(x) - J1X+1' Then Q represents J1 if and only if Q' represents O. Another important special case is that of n = m. Then the discriminant of Q' is given by !P121A I. In particular, if IPI #0 (IPI is an inver- tible element of R), we say that Q is equiva- lent to Q' over K (resp. R). This gives rise to an equivalence relation. Equivalent forms have the same rank. On the other hand, if the rank of Q is r, then Q is equivalem to a form LI=l aixf over K (ai#O, i= 1,... ,r). GeneralIy, for elements a and b in K* = K - {O}, we write ab if a' b- 1 E(K*)2. Then if Q is equivalent to Q', we have (Q)  (Q'). When we specify a field K, we assume that the coefficients of the quadratic forms and the coordinates of linear transformations are all contained in the field K. In particular, the equivalence of the forms is equival ence over K. B. Complex Quadratic Forms If K is the field of complex numbers C, then a form of rank r is equivalent to the form LI=1 xl; hence over C two forms of the same dimension are equivalent if and only if they are of the same rank. C. Real Quadratic Forms Now let K be the field of real num bers R. If Q is of rank r, then it is equivalent to the form Lf=1 xf - LJ=1 x+j (p+q =r). Hen, P and q are uniquely determined by Q (Sylvester's law of inertia). We call (p, q) the signature of Q. Two quadratic forms of the same dimension 
1293 are equivalent if and only if they have the same signature. A quadratic form with the signature (n,O) (resp. (0, n)) is caned a positive (negative) definite quadratic form. Q is caned a definite quadratic form if it is either positive or negative definite; otherwise it is an indefinite quadratic form. Each of the fonowing con- ditions is necessary and sufficient for a form Q to be positive definite: (i) for any nonzero real vector x we have Q(x»O; (ii) an the tprincipal minors of the matrix of Q are positive. Q is negative definite if and only if -Q is positive definite. A form with n variables is caned a positive (negative) semidefinite quadratic form if its signature is (r,O) (resp. (0, r)), where 1  r < n. A linear transformation x' --+ X = Px' that leaves invariant the unit form L?1 xf is an orthogonal transformation. Then P is an tor- thogonal matrix. Any quadratic form can be transformed into a diagonal form L?=1 aixf via an orthogonal transformation. Here aI' ..., an are the teigenvalues of the matrix of the form. Two forms Q and Q' are equivalent with re- spect to an orthogonal transformation if and only if the corresponding matrices have the same eigenvalues. D. QUj\dratic Forms over Finite Fields and p-adic Number Fields Let Q and Q' be nondegenerate quadratic forms over the tfinite field Fq. They are equiva- lent if and only if they have the same rank and (Q)-(Q'). Moreover, if the rank of Q is not less than 3, then Q represents O. Next, suppose that Q and Q' are nondegen- erate quadratic forms over the tp-adic number field K. They are equivalent if and only if they have the same rank, (Q)-(Q'), and they have the same Minkowski-Hasse character X, where X is defined as fonows: Let C(Q) be the tClitTord algebra of Q, and let C*(Q) denote C(Q) ifn is even and C+(Q) ifn is odd. Then X(Q)= lor -1 according as C*(Q)M,(K) or M,(K) @ D(K). (Here M, is the total matrix algebra of degree t over K and D(K) is the unique tquaternion algebra over K.) Also, if Q has rank not less than 5, then Q represents 0 in K. E. Quadratic Forms over a General Field K The fonowing facts are valid on any field K whose characteristic is not 2. Given a quadra- tic form Q 1 with variables XI' ... , X n and an- other form Q2 with variables X n +1' ..., Xn+m' we get a new quadratic form Q1 EB Q2 or QI +Q2 defined by Q1 EB Q2(X 1 , ...'X n + m )= 348E Quadratic Forms Q1(XI> ...,X n )+Q2(X n +1, ...,x n + m ). Q1 EB Q2 is caned the direct sum of Q1 and Q2' The matrix of Q 1 EB Q2 is the direct sum of the matrices of Q1 and Q2' If Q1 and Q'1 are equivalent and Q1 EB Q2 and Q'1 EB Q are also equivalent, then Q2 and Q are equivalent (Witt's theorem). The quadratic form X1X2 +X 3 X4 +... + X2,-1 X2, is caned the kernel form and is de- noted by N,. Any nondegenerate quadratic form Q(x l' ... ,x n ) is equivalent to the direct sum of a kernel form N,(x I> ..., X2') and a form QO(X2'+I' ...,x n ), where ifQo#O, i.e., n>2r, we have Qo(x 2 '+1' "', x n ) = 0 only if X 2 '+1 = ... = X n =0. N, and Qo are uniquely determined by Q up to equivalence. The decomposition N, EB Qo is caned the Witt decomposition of Q (E. Witt [5]). The number r is caned the index of Q. An element x in V is said to be singular with re- spect to Q if Q(x)=O. A subspace W of V is said to be totally singular if an the elements in Ware singular. Let B be the symmetric bilinear form associated with Q. Then x is singular with respect to Q if and only if B(x, x) =0 (characteristic of K #2). We say that x is isotropic if B(x,x)=O. Thus a subspace W is totany singular if and only if it is totally iso- tropic (i.e., B(x,y)=O for an X,YE W). The index r of Q is the dimension of a maximal totany singular subspace of V. In particular, if K = Rand (p, q) is the signature of Q, then the index r = min(p, q). Here we must be careful, since some authors can the number p - q or p or q the index of Q. To make the distinction clear, we also can our r the index of total isotropy, and the number p-q the index of inertia. Necessary and sufficient conditions for a nondegenerate Q to be a kernel form are: n=the rank of Q=O (mod2) when K =C; n=O (mod 2) and p-q=O when K=R; n= o (mod2) and (Q)-1 when K =Fq; n=O (mod 2), (Q)-I, and X(Q)= 1 when K is a p-adic number field. Let N, EB Qo, N s EB Qo be Witt decompo- sitions of Q and Q', respectively. We say that Q and Q' belong to the same type if Qo and Qo are equivalent and denote the set of types of nondegenerate quadratic forms over K by W. We define the sum of the types of Q and Q' as the type of Q EB Q', and this gives W the struc- ture of a commutative group. The type of a kernel form is the identity element of this group W, caned the Witt group. The structure of the Witt group depends on K. If K =C, then W  Z/2Z; if K = R, then W  Z; if K is a t10cal field with a tnon-Archimedean valua- tion, then W is a finite group; if K = Fq, then W(Z/2Z)+(Z/2Z) if q= 1 (mod4), WZ/4Z if q=3 (mod 4), and WZ/2Z if q is a power of2. 
348 F Quadratic Forms F. Hermitian Forms An expression H(x) = "Li.k' aikxixk is called a Hermitian form if a ik E C, a ik = a ki . (Here a ib Xi are the complex conjugates of a ib X h respec- tively.) The value H(x) is a real number. As for quadratic forms, we define the notions of the matrix of H and the discriminant, rank, and tsesquilinear form associated with H. The matrix A of H is a tHermitian matrix whose principal minors are real numbers. If we apply a linear transformation P(x') = x, we obtain a Hermitian form with respect to x' whose matrix is given by 'PAP. Any Her- mitian form is equivalent to a form "Lf=1 XiX i - "LJl xp+jx p + j ; (p, q) is called the signature of H. We define the notions of positive definite, negative definite, and indefinite Hermitian forms as we did for quadratic forms over the field of real numbers. Each of the following conditions is necessary and sufficient for H to be positive definite: (i) H(x»O for any non- zero complex vector x; (ii) all the principal minors of the matrix of H are positive. The definition of a semidefinite Hermitian form is given in the same manner as for a quadratic form. A linear transformation that leaves the Hermitian form "Li, XiX i invariant is called a unitary transformation, and its matrix is a tunitary matrix. Any Hermitian form can be transformed via a unitary transformation into a diagonal form "Li, aixix i , where a l , ..., an are the eigenvalues of the matrix of the Her- mitian form. The notion of Hermitian forms can be gen- eralized as follows: Suppose that K is a tdivi- sion ring with an involution a-->a (aEK) (i.e., a-->a is a linear mapping of K onto itself such that a = a, ab = ba). Then a Hennitian form H over K is defined by n H(x)= I Xiaikxb J,k=l where XiEK, aikEK, aik=a ki . In particular, if we have, for any given vector x whose coordi- nates belong to K, an element a in K such that H(x)=a+a, then we have a Witt decompo- sition for H. Two examples of such K having involutions that differ from the identity map- ping are a separable quadratic extension K of a field L and a tquaternion algebra K over a field L. G. Quadratic Forms over Algebraic Number Fields Let K be an algebraic number field of fi- nite degree, P be an t Archimedean or non- Archimedean tprime divisor of K, Kp be the 1294 p-completion of K, and Q and Q' be nonde- generate quadratic forms over K. Then Q rep- resents Q' over K if and only if Q represents Q' over Kp for all p, and Q represents 0 in K (i.e., there exists a nonzero vector ;c whose co- ordinates belong to K such that Q(x) = 0) if and only if it represents 0 in Kp for all p [3,4J. In particular, Q and Q' are equivalent over K if and only if they are equivalent over Kp for all p. Hence the invariants with respect to equivalence over K of a nondegenlrate qua- dratic form Q over K are n = the rank of Q,  = the discriminant of Q, the Minkowski-Hasse character X p for tprime divisors p of K, and the index ofinertiaj;. ofQ over Kpcn.;. for each treal infinite prime divisor PT.;' (I, = 1,..., rd of K. Here the following properties hold: (i) X p = 1 for all but finitely many p; (ii) I1 p x p = 1 (this is equivalent to the tproduct formula of norm- residue symbols); (iii)  (_1)(n2 + j;)f2 in K PT ;.; and (iv) X Pro ,;. = 1 if j; =0, 1,2, 7 (mod 8), = -.: 1 if j;. = 3, 4, 5, 6 (mod 8) [3,4]. Conversely, if the system {n'Xp,Xpro.))A'} satislies condi- tions (i)-(iv), then it is the set of invariants of a quadratic form over K (Minkowski-Hasse theorem). In general, if a property concerning K holds if and only if it holds for all Kp, we say that the Hasse principle holds for the property. H. Class and Genus of a Quadratic Form Let K be an algebraic number field of finite degree. Quadratic forms Q and Q' over K are said to be of the same class if they are equiva- lent over the tprincipal order 0 in K. On the other hand, Q and Q' are said to be of the same genus if (i) they are equivalent over the principle order op in Kp for all non- Archimedean prime divisors p of K and (ii) they are equivalent over Kp for all the Archi- me dean prime divisors p of K. A genus is decomposed into a finite number of classes. For example, if K is the field of rational num- bers, the number of classes in the genus of "L=lxtis 1 form8,whileitis 2form>8. I. Reduction of Real Quadratic Forms Let A be an m x m matrix and X an m x n matrix. We put A[X] =tXAX. Then we can write Q(x) = Sex] = txS x , where S is the matrix of the quadratic form Q. In this section we put K = R and define two forms to belong to the same class if they are equivalent over the ring of rational integers. We identify the form Q with its matrix S = (sij)' Let S be a positive definite form in m variables. Then S is said to be a reduced quadratic form if S [gk]  Skk and Sll+l 0(1 km, 1 lm-l), where gk is an 
1295 arbitrary vector whose coordinates 9 l' '" , gm are integers such that (gb ..., gm) = 1. Any class of positive definite quadratic forms contains at least one (and generany only one) reduced form. For a reduced form R = (r k ,), the fonow- ing inequalities hold: O<r 11 r22 ... rmm; :t2rk,rll' k<l; rllr22...rmm<c(m)IRI, where c(m) depends only on m. The set of an sym- metric matrices of degree m forms a linear space of dimension m(m + 1)/2 in which the subset  formed by the positive definite sym- metric matrices is a convex open subset. More- over, the subset 9\ formed by an reduced positive definite symmetric matrices is a con- vex cone whose boundary consists of finitely many hypersurfaces and whose vertex is the origin. Let S be an indefinite quadratic form whose signature is (n, m - n). The set of positive definite quadratic forms H such that S-[ [H] = S fonns a variety of dimension n(m - n), which is denoted by H(S). We say that S is reduced if H(S) n 9\ # 0. Given a natural number D, there are only a finite number of definite or indefinite reduced quadratic forms with rational integral coefficients whose dis- criminant is :tD. Hence the number of classes of quadratic forms with rational integral coef- ficients and discriminant :tD is also finite. J. Units Let S be a symmetric matrix of degree m with rational coordinates. Let O(S) be the set of an real m x m matrices W for which SEW] = S, and qS) be the subset of O(S) consisting of the integral matrices. An element of qS) is caned a unit of S. qS) is a finite group if S is definite, but otherwise it is infinite (except for the case m = 2, -ISI = r 2 , with rational r). O(S) is a tLie group, and qS) is a discrete subgroup with a finite number of generators. The thomo- geneous space O(S)/qS) is of finite measure with respect to a tHaar measure defined on the space. K. Minkowski-Siegel- Tamagawa Theory Let Sand T be rational integral positive de- finite symmetric matrices of degree m and n, respectively (m:;' n). Let A (S, T) be the number of rational integral solutions for the equation SEX] = T, and E(S) be the order of the group of units qS). We put A(SI> T) A(S2' T) M(S,T) - E(Sr) + E(S2) +..., 1 1 M(S) = E(SI) + E(S2) +..., M(S, T) Ao(S, T)= M(S) , 348K Quadratic Forms where S I> S2, ... is a complete system ofre- presentatives of the classes in the genus of S. M (S) is caned the measure of genus of S. On the other hand, for a natural number q, we denote by AiS, T) the number of the solu- tions of the congruence equation SEX] = T (modq). If q is a prime power pa, then the ratio em.nq -mn+n(n+1)J2 Aq(S, T) takes a constant value IXp(S, T) for sufficiently large a (where e m . n = 1/2 if m = n:;' 2; = 1 otherwise). Furthermore, let us consider a domain B in the Euclidean space of dimension n(n + 1 )/2 formed by the set of n x n symmetric matrices containing T, and let B 1 be the domain formed by the matrices X such that S[ X] E B. Then B[ is a domain in the space of dimension mn formed by an m x n matrices. Let IX",(S, T) be the limit of the ratio v(B 1 )/v(B) of the volumes of Band B[ as the domain B shrinks toward the point T. Then Siegel's theorem states: IX",(S, T)I1pIXiS, T)= eAo(S, T), where e=2 ifm=n+ lor m=n:;,2 and e = 1 otherwise. The infinite product of the left-hand side of this equation does not con- verge absolutely if either m = n = 2 or m = n + 2, and in those cases the order of the product I1 p is considered to be the natural order of the primes p. A special case of Siegel's theorem was proved by H. Minkowski, but it was C. L. Siege! [6] who proved it in its general form. Except for a finite number of p, the numbers IXp(S, T) have been calculated. The eXplicit form of IX", (S, T) is also known. In particular, if we take the identity matrix E(m) of degree m as S, then the formula in Siegel's theorem is re- lated to the pro blem of expressing natural numbers as sums ofm squares. For m= 2,3, '" , 8, the genus of E(m) contains only one class. Hence, putting n= 1, T=t (=a natural number), we obtain from Siegel's theorem the number of ways in which we can express t as the sum of m squares [6, pt. I]. Siegel's result was generalized by Siegel himself to the case where the form S is indefinite [6, pt. II] and where the coefficients of the forms are elements of an algebraic number field of finite degree [6, pt. III]. Also, regarding the number of possible ways to express a natural number t as a sum of m squares, the fonowing formula was obtained by C. G. J. Jacobi for the case where m=4, n= 1: A(E(4),t)=8 ( Id- I d ) . dlt 4dlt For the case m = 3, n = 1, it is known that if tis odd and A(E(3),t»O, then t=/=7 (mod 8) (for details  P. T. Batemann, Trans. Amer. Math. Soc., 71 (1951)). T. Tamagawa used the theory of tadelized algebraic groups and proved that the tTama- gawa number T(SO(n,S)) of the special ortho- 
348 L Quadratic Forms gonal group is 2. He also showed that from this fact, Siegel's theory in this section can be deduced ( 13 Algebraic Groups P) [10]. L. Theta Series Let Q(x l ,..., x m ) be a positive definite form with integral coefficients. For a complex num- ber z, we put F(z, Q) = L exp(2niQ(x l , "', xm)z), XI' . 'X m where XI' ... 'X m run over all the integers. If Imz>O, the series converges and represents an tentire function of z. These series are called theta series. If we denote by A(n) the number of integral solutions of the equation Q(x l ,... ,X m )=II, we have oc F(z, Q) = L A (n)e hinz . n:::::O Moreover, if m = 2k, we ha\>e the following transformation formula: ( az + b ) F --,Q =£(d)(cz+dlF(z,Q), cz+d where a, b, c, d are integers such that ad - be = 1, c==O (modN), N is a natural number determined by Q, and £ is a character mod N. In other words, F(z, Q) is a tmodular form with respect to the tcongruence subgroup of level N. Using the theory of modular forms, E. Hecks showed that A(n) = Ao(n) + O(n k / 2 ), where Ao(n) is a number-theoretic function of II determined by the genus of Q. M. Binary Quadratic Forms with Integral Coefficients Now put m = 2. Given a form Q(x, y) = ax 2 + bxy+ cy 2, we put D(Q)=b 2 -4ac and call it the discriminant of Q (i.e., D(Q) = -4(Q)). Q is said to be primitive if(a,b,c)= 1. When D(Q) is not a square, the theory concerning Q is closely related to the arithmetic theory of the tquadratic field Q(Jl}) = k. Let d be the tdis- criminant of k and put D=df2. When f = 1, there is a one-to-one correspondence between the tideal classes of k and the classes of qua- dratic forms with discriminant d (when D < 0, we consider the classes of positive definite forms). The correspondence is given in the following manner: If a is an ideal in k with a basis ?(I' ?(2' then the corresponding form is given by Q(x, y) = N(a)-I N(?(I x + ?(2Y), where N is the absolute tnorm. If f> 1, we must replace the ring of integers 0 by the torder of the tconductor r That is, if we consider the ring formed by the elements x + j yw (x, yare rational integers; the meaning of w is ex- 1296 plained shortly), we again have a one-to-one correspondence between the classes of ideals of this order and the classes of quadratic forms [3]. We let w=fll2 if d==O (mod4);=(1 + fl)/2 if d == 1 (mod 4). When D > 0, we can introduce the notion of proper equivalence as follows: Q and Q' are properly equivalent if the matrix of Q is transformed to the matrix of Q' by a linear transformation P whose determi- nant is 1. Then, in the correspondence for the case f = 1, if we take?(1 =r>0,?(2 =s+ tw, t>O (r. s, t E Q), then we get a relation between the classification of the forms and the classification in the finer sense of the ideals in k Suppose that D > 0 is not a square. Let (t, u) be an integral solution oftPell's equation t 2 _ Du 2 = :t4. Then the units of the form Q(x, y) =ax 2 +bxy+ ey 2 with discriminant Dare given by :t ( (t - bu)/2 au -cu ) (t + bu)/2 . Let (to, u o ) be the smallest positive integral solution of t 2 - Du 2 =4, put £0 =(/0+ u o JD)/2, and let ho be the class number in rhe finer sense of the forms of discriminant D. Then the following formula holds (Dirichlel): 1 00 1 ( D ) /n holog£o= L - - , v D "I n n where (Din) is the tKronecker symbol (here D =f 2 d; we put (Dln)=O if(f,n)# 1, and (Din) = (din) if (f, n) = 1). For D < 0, the order W o of the units is known: it is 6 if D = - 3; 4 if D = -4; and 2 otherwise. We also have 2n ho_ f I ( D ) JfDT Wo - '=1  -; . With respect to the numbers ho al1d £0, little else is known. References [IJ B. W. Jones, The arithmetic theory of quadratic forms, Wiley, 1950. [2J N. Bourbaki, Elements de mathematique, Algebre, ch. 9, Actualites Sci. Ind., 1272a, Hermann, 1959. [3J M. Eichler, Quadratische Formen und orthogonale Gruppen, Springer, ] 952. [4J O. T. O'Meara, Introduction to quadratic forms, Springer, 1963. [5J E. Witt, Theorie del' quadratischen For- men in beliebigen Korpern, J. Reine Angew. Math., 176 (1937),31-44. [6J C. L. Siegel, -aber die analyti5che Theorie del' quadratischen F ormen I, II, HI, Ann. Math., (2) 36 (1935), 527-606; 370936),230- 263; 38 (1937), 212-291 (Gesammelte Abhand- 
1297 lungen, Springer, 1966, vol. 1,326-405,410- 443,469-548). [7] C. L. Siegel, On the theory of indefinite quadratic forms, Ann. Math., (2) 45 (1944), 577-622. (Gesammelte Abhandlungen, Springer, 1966, vol. 2,421-466.) [8] E. Heeke, Analytische Arithmetik der positiven quadratischen Formen, Kgl. Danske Videnskab. Selskab, Mat.-Fys. Medd., XIII, 12 (1940). (Mathematische Werke, Vandenhoeck & Ruprecht, 1959, 789-918.) [9] G. L. Watson, Integral quadratic forms, Cambridge Univ. Press, 1960. [10] T. Tamagawa, Adeles, Amer. Math. Soc. Proc. Symp. Pure Math., 9 (1966),113-121. For binary quadratic forms, [11] P. G. L. Dirichlet, Vorlesungen tiber Zahlentheorie, Vieweg, fourth edition, 1894 (Chelsea, 1969). 349 (XIX.3) Quadratic Programming A. Problems A quadratic programming problem is a special type of mathematical programming ( 264 Mathematical Programming) where the objec- tive function is quadratic while the constraints are linear. A typical formulation of the prob- lem is as follows. (Q) Maximize z=c'x-!x'Dx under the condition Axb and xO, xERn. Let the Lagrangian form for this problem be cp(x, 1) = c'x -!x'Dx + l'(b - Ax). Then, from the general properties of Lagran- gian forms, the following theorem can be proved. Theorem: If x = x* is an optimal solution of the problem (Q), there exists a vector A. * satis- fying the conditions -Dx*+cl*, 1*0; b'l* =(c- Dx*)'x*. Moreover, if the matrix D is nonnegative de- finite, the above conditions are also sufficient for x = x* to be optimal. The second condition can be shown to be equivalent to l*'(b-Ax*)=O and x*'(A'l*+Dx*-c)=O. By introducing the slack vectors uO and v  0, the conditions can be expressed as xO, yO, uO, vO; ) Ax+u= A+Dx-v= y'u=O, x'v=O. 349C Quadratic Programming When D is nonnegative definite, any feasible solution of the above system of equalities gives an optimal solution of the primary problem (Q), and when D is positive definite, the solu- tion is unique. When D is not nonnegative definite, the optimal solution of (Q), if it exists, is one of the feasible solutions of (C). The last line of (C) implies that the solution must be a basic solution of the linear system of equalities, and it also restricts the possible combinations of the basic variables. Since there exist only a finite number of possible combinations of the basic variables, the quadratic programming problem can be solved in a finite number of steps, if it has an optimal solution. B. Duality The dual problem of (Q) is the following. (QD) Minimize w=b'y+!x'Dx under the condition A'y+Dxc and xO, yO. If D is nonnegative definite, the following theorem holds. Theorem: If the primary problem (Q) has a solution x=x*, then the dual problem has a solution x=x* and y=y*, and maxz=minw. A more general form of the quadratic pro- gramming problem can be given as follows. (Q) Maximize z=c'x-!x'Dx under the condition x E V and b - Ax E W, where Vand Ware closed convex cones in R n and R m , respectively. Then the dual problem is expressed as follows. (QD) Minimize w=b'y-!x'Dx under the condition XEV, YE W* and A'y+Dx-CE V*, where V* and W* are the dual cones of Vand W. The above theorem holds for both (Q) and (QD). C. Algorithms Various algorithms have been proposed for quadratic programming [1,2,4], most of which are based on condition (C). Wolfe [4] proposed a method based on the simplex method for linear programming. If we intro- duce the artificial vectors  and 1], we can find a feasible solution of (C) by solving the follow- ing linear programming problem. (LQ) Maximize z= -1'-1'1l under the condition that Ax+u-= A+Dx-v+ll= x y u v  1l y'u=O, x'v=O. (LQ) can be solved by applying the simplex algorithm with the only modification being 
349 Ref. Quadratic Programming that the last line of the condition restricts the possible changes in the basic variables. When D is positive definite, we can always obtain a solution if there is a feasible solution of the original problem, and Wolfe proposed a modification of the foregoing algorithm for the case when D is nonnegative definite that tells whether or not it has an optimal solution, and gives it if it has one. Some other algorithms are also effective when D is positive or nonnega- tive definite, but when D is not nonnegative definite, no simple effective method has been found to reach the optimal solution even when its existence has been established. References [I] E. M. L. Beale, On quadratic program- ming, Naval Res. Logistic Quart., 6 (1959), 227-243. [2J C. Hildreth, A quadratic programming procedure, Naval Res. Logistic Quart., 4 (1957), 79-85. [3J W. S. Dorn, Duality in quadratic pro- gramming, Quart. Appl. Mat., 18 (1960), 155- 162. [4J P. Wolfe, The simplex method for quadra- tic programming, Econometrica, 27 (1959), 382- 398. 350 (VI. 1 0) Quadric Surfaces A. Introduction A subset F of a 3-dimensional Euclidean space £3 is called a quadric surface (surface of the second order or simply quadric) if F is the set of zeros of a quadratic equation G(x, y, z) = 0, where the coefficients of G are real numbers. The equation G(x,y,z)=O is written as ax 2 + hy2 + cx 2 +d + 2fyz+ 2gzx + 2hxy+ 2f'x + 2g'y+ 2h'z=0. In general, a straight line intersects a quadric surface at two points. If it intersects the sur- face at more than two points, then the whole straight line lies on the surface. Suppose that we are given a quadric surface and a point O. Suppose further that we are given a straight line passing through the point 0 and intersect- ing the quadric surface at two points A and A'. If AO = 0 A' for all such straight lines, then the point 0 is called the center of the quadric surface. 1298 B. Classification The subset defined by equation (1:' may be empty; for example, x 2 + l + Z2 + 1 = O. In this article, we consider only quadric surfaces that are not the empty set. When a quadric surface F without tsingular points has a center or centers, we say that F is central. If we choose a suitable rectangular coordi- nate system, the equation of a central quadric surface is written in one of the foHowing forms: XZ yZ Z2 (2) "2+ h2 +"2= 1, a c x2 yZ ZZ (3) -+---= 1 a 2 h 2 C Z ' x 2 yZ Z2 (4) ----+-= 1 a 2 b 2 C Z ' XZ v 2 (5) a2 + b 2 = 1, x2 y2 (6) ---=1 a 2 h 2 ' x 2 "2=1. (7) a (1 ) When the equation takes the form (2), (3), (4), (5), or (6), we call the quadric surface an ellip- soid, hyperboloid of one sheet, hyperboloid of two sheets, elliptic cylinder (or elliptic cylin- drical surface), or hyperbolic c)linder (hyper- bolic cylindrical surface), respectively. When the equation takes the form (7\ the surface coincides with a pair of parallel planes. If a = bin (2), (3), (4), or (5), the surface is a tsurface of revolution with the z-axis as the axis of revo- lution. In this case, we call the surface an ellipsoid of revolution, hyperboloid of revolution of one sheet, hyperboloid of revolution of two sheets, or circular cylinder (or circular cylin- drical surface), respectively. If a=b=c for an ellipsoid of revolution, then tile surface is a sphere with radius a. If we choose a suitable rec1 angular coordi- nate system, the equation of a noncentral surface of the second order i" written in one of the following forms: x2 y2 2z="2+"2' a b (8) x2 y2 2z = --- a 2 h 2 ' (9) x 2 2Z=2' a (JO) When the equation takes the form (8), (9), or (10), we call the surface an dliptic paraboloid, hyperbolic paraboloid, or parabolic cylinder (or parabolic cylindrical surfac(), respectively. If a 
1299 = b in (8), the surface is called an elliptic para- boloid of revolution. Among these, (2), (3), (4), (8), and (9) are sometimes called proper quadric surfaces, and the others degenerate quadric surfaces. Equations (2)-(10) are called the canonical forms of the equations of these surfaces (a, b, C in canonical forms should not be confused with a, b, C in (1)). The planes x=O, y=O, and z=O in surfaces (2), (3), and (4) and the planes x = 0 and y=O in surfaces (8) and (9) are called principal planes of the respective surfaces; and lines of intersection of principal planes are called principal axes. For a surface ofrevo- lution, positions of principal planes and prin- cipal axes are indeterminate. We call a, b, C in equations of canonical form the lengths of the principal axes, or simply the principal axes. If F is a hyperboloid of one sheet or a hyperbolic paraboloid, there are two systems of straight lines lying on F; two straight. lines belonging to the same system never meet (and are not parallel), and two straight lines belonging to different systems always meet (or are parallel). If F satisfies (3), these systems of straight lines are given by { -=A( 1-) { _=(1 +) ::+= ( 1 + ) ::+= ( 1- ) . aCA b ac b If F satisfies (9), then two such systems are given by { -=A x y 2z -+-=- a b A { X y -+-= a b -=  (A and  are parameters). We call these straight lines generating lines of the respective surfaces. A hyperboloid of one sheet and a hyperbolic paraboloid are truled surfaces described by these generating lines. When a quadric surface has singular points, they are double points. The set of double points of a quadric surface F is either a single point 0, a straight line I, or a plane n. In the second case, F consists of two planes passing through I or I itself, and in the third case, F coincides with n. In the first case, we say that F is a quadric conical surface (or quadric cone) with vertex O. Its equation is written in the form AX2 + B y 2 + Cz 2 =0 (ABC #0). When A, B, C are of the same sign, F consists of only one point O. Otherwise, we can assume that A, B>O, C = -1. In this case, if A =B, F is called a right circular cone, and if A # B, F is called an oblique circular cone. Given hyperboloids (3) and (4), we call the 350D Quadric Surfaces quadric cones x 2 y2 Z2 -+---=0 a 2 b 2 c 2 (3') and x 2 y2 Z2 ----+-=0 a 2 b 2 c 2 (4') asymptotic cones of (3) and (4), respectively. C. Poles and Polar Planes Suppose that we are given a straight line S passing through a fixed point P not contained in a quadric surface F, and S intersects the surface at two points X, Y. The locus of the point Q that is the tharmonic conjugate of P with respect to X and Y is a plane. We call this plane n the polar plane of P with respect to the quadric surface F, and P the pole of the plane n. If the polar plane of a point P contains a point Q, then the polar plane of Q contains P. In this case, we say that the two points P and Q are conjugate to each other with respect to the quadric surface. When the point P is on the quadric surface, the tangent plane at Pis regarded as the polar plane of P. If the polar plane (with respect to a quadric surface) of each vertex of a tetrahedron is the face corre- sponding to that vertex, we call this tetra- hedron a self-polar tetrahedron. If the polar planes (with respect to a quadric surface) of four vertices of a tetrahedron A are four faces of a tetrahedron B, the same property holds when we interchange A and B. We say that such tetrahedrons are polar tetrahedrons with respect to the quadric surface. Suppose that we are given a quadric surface and two planes. If the pole (with respect to the quadric surface) of one plane is on the other plane, these two planes are said to be conjugate with respect to the surface. When we are given two tpencils of planes in tprojective correspondence, the locus of lines of intersection of two corresponding planes is generally a hyperboloid of one sheet or a hyperbolic paraboloid. In particular, if the axes of these pencils of planes intersect, the locus is a quadric conical surface, and if the axes are parallel, the locus is a quadric cylin- drical surface (i.e., an eJ1iptic or hyperbolic cylinder). When there exists a projective corre- spondence between two straight lines not on a plane, the locus of lines joining corresponding points is a quadric surface (M. Chasles). D. Surfaces of the Second Class A surface Fin E 3 is called a surface of the second class if it admits two tangent planes 
350 E Quadric Surfaces passing through an arbitrary straight line L provided that F n L = 0. This surface can be represented as the set of zeros of a homoge- neous equation of the second order in tplane coordinates u 1 , U 2 , U 3 , U 4 . It is possible that a surface of the second class degenerates into a conic or two points. In general, quadrics are surfaces of the second class, and vice versa. As in the case of quadrics, we can define poles, polar planes, and polar tetrahedrons with reference to surfaces of the second class. Four straight lines joining corresponding vertices of two tetrahedrons polar with respect to a surface of the second class are on the same quadric. We say that two such tetrahedrons are in hyperboloid position. E. Confocal Quadrics A family of central quadrics represented by the folIowing equations is calIed a family of con- focal quadrics: x 2 i Z2 -+-+-=1 a>b>c>O, (11) a+k b+k c+k ' where k is a parameter. For a quadric belong- ing to this family, any point on the elIipse x 2 j(a-c)+ y2j(b -c)= 1, z =0 or the hyperbola x 2 j(a-b)-z2j(b-c)=.1, y=O is calIed a focus. Tis elIipse and hyperbola are calIed focal conics of the quadric. Given an elIipsoid F and a point X (x, y, z) not contained in the principal plane, we can draw three quadrics F', F", F'" passing through X and confocal with F. These surfaces F', F", F'" intersect each other and are mutualIy per- pendicular. One of them is an elIipsoid, an- other one a hyperboloid of one sheet, and the third a hyperboloid of two sheets. Let kj, k 2 , k3 be the values of the parameter k in (11) corresponding to these three surfaces. Then the coordinates x, y, z of the point X are given by x= f(a+k 1 )(a+k 2 )(a+k 3 ) , .y- (b-a)(c-a) v= f0+kd(b+k 2 )(b+k 3 ) . .y- (a-b)(c-b) , z= Kc. '+ktJ(c+k 2 )(c+k 3 ) . .y- (a-c)(b-c) We calI k 1 , k 2 , k3 the elliptic coordinates of the point X. Two points (x, y, z), (x', y', z') are calIed corre- sponding points if they belong to confocal quadrics of the same kind, x 2 i Z2 -+-+-= 1, a b c (X')2 (y')2 (Z')2 -+-+-=1, a+k b+k c+k 1300 and satisfy x x' y y' - jb Jb+k' fi Ja+k ' z z' fi fi+k' If PI, P 2 ; Qj, Q2 are corresponding points, then P 1 Q2=P 2 QI (J. Ivory). F. Circular Sections When the intersection of a plane and a quadric is a circle, the intersection is called a circular section. In general, circular sections are cut ofT by two systems of paralIel planes through a quadric. The point of contact on the tangent plane paralIel to these is an tumbilical point of the quadric. G. Quadric Hypersurfaces A subset F of an n-dimensional Euclidean space En is calIed a quadric hypersurface (or simply hyperquadric) if it is the set of points (x 1, ... ,x n ) satisfying the folIowing equation of the second degree: n n I aikXiXk+2 I bixi+C=O, i,k=l i=l (12) where a ik , b;, c are all real numbers. We can assume without loss of generality that the matrix A = (a ik ) is symmetric. Assume that A is not a zero matrix. In the case n = 2, F is a conic, and in the case n = 3, it is a quadric surface. The theory of classification of quad- ric surfaces can be generalized to 1 he n- dimensional case as folIows: Let r(A *) = r* be the rank of the (n+ 1) x (n+ 1) ma1 rix l al1 a1n b1 ] A*= anI ann b n b 1 b n C {, 1 ] b n ' b n C A and put r(A)=r. Then we have thl folIowing three cases: (I) r=r*; (II) r+ 1 =r*; and (III) r+ 2 = r*. Corresponding to each cas,, equation (12) can be simplified (by a coordinate trans- formation in En) to the foIlowing canonical forms, respectively: (I) r I AiX(=O, i=1 (II) r  Xx2 + 1=0  I I , i=1 r (III) I AiX( + 2Xr+1 = 0, ;=1 
1301 where (AI' ..., A" 0, ...,0) (with n - r zeros) is proportional to the teigenvalues of the matrix A. In general, we have 1  r  n. In the cases where r=n in forms (I) and (II) and r+ 1 =n in (III), the hypersurface is called a properly (n- 1 )-dimensional quadric hypersurface, and in other cases, a quadric cylindrical hypersurface. In cases (I) and (II), the quadric cylindrical hypersurface is the locus of (n - r)-dimensional subspaces passing through each point of a prope'rly (r - I)-dimensional quadric hypersur- face and parallel to a fixed (n-r)-dimensional subspace. In case (III), the quadric cylindrical hypersurface is the locus of(n-r-l)-dimen- sional subspaces passing through each point of a properly r-dimensional quadric hypersur- face and being parallel to a fixed (n-r-l)- dimensional subspace. For form (I) with A;>O (i = 1, ..., n), a properly (n - I)-dimensional quadric hypersurface reduces to a point in En; for form (II) with A;>O (i= 1, ...,n) it becomes the empty set. Suppose that we are given a quadric hypersurface F that is neither a point nor empty. Then the system {A!,..., Ar} as- sociated with F in its canonical equation is unique up to order (and signature in form (III)). Suppose that we are given a quadric surface F and a point P on F. Suppose further that if a point X other than P is on F, then the whole straight line P X lies on F. In this case, the hypersurface is called a quadric conical hypersurface (or simply quadric cone). For example, for case (I), we can take P= 0 (the origin), and the hypersurface is a quadric cone. In cases (I) and (II), the hypersurface is sym- metric with respect to the origin. In these cases, a hypersurface is called a central quadric hypersurface, and in case (III), it is called a noncentral quadric hypersurface or parabolic quadric hypersurface. When we cut a parabolic quadric hypersurface by a (2-dimensional) plane containing the xn+1-axis, the section is a parabola. If A; < 0 (i = 1, ... , r) in form (II), then the surface is called an elliptic quadric hyper- surface, and if there are both positive and negative numbers among the Ai' the surface is called a hyperbolic quadric hypersurface. The section of an elliptic quadric hypersurface by a plane is always an ellipse. The section of a hyperbolic quadric hypersurface by a plane is an ellipse, a hyperbola, or two straight lines. In general, the section of a quadric hypersurface by a subspace is a quadric hypersurface on that subspace. H. Quadric Hypersurfaces in an Affine Space In Section G we considered a quadric hyper- surface defined by (12) in an n-dimensional Euclidean space En and transformed the equa- tion to canonical form by an orthogonal 350 Ref. Quadric Surfaces transformation of coordinates in En. Ifwe regard En as an n-dimensional taffine space over the real number field and reduce (12) to the simplest form by a coordinate transforma- tion in the affine space, we have the following canonical forms corresponding to cases (I), (II), and (III) discussed in Section G: s r (I) (s, t): L xf - L xJ =0, i=l j=s+l s r (II) (s, t): L xf - L xJ + 1 = 0, i=! j=s+l s r (III) (s,t): L xf- L x]+2xr+1 =0, i=l j=s+l where Osr and r-s=t. The terms properly (n - 1 )-dimensional, cylindrical, conical, para- bolic, elliptic, and hyperbolic can be defined in terms of this affine classification. For example, a cone is of type (I), a parabolic hypersurface is of type (III), an elliptic hypersurface is of type (II) (0, r), a hyperbolic hypersurface is of type (II) (s, t) (s, t >0), and type (II) (s, 0) represents the empty set. A necessary and sufficient con- dition for a (nonempty) hypersurface to be represented by two canonical forms N(s, t), N'(s', t') is that (i) N = N' and (ii) if N = (I) or N =(111), then s=s', t=t' or s=t', t=s', and if N =(11), then s=s', t= t'. I. Quadric Hypersurfaces in a Projective Space Suppose that we are given a field K of charac- teristic not equal to 2 and an n-dimensional tprojective space pn over K. A subset F ofpn is called a quadric hypersurface (or simply hyper- quadric) if F is represented by a homogeneous equation of the second degree L.k=Oaikxixk = 0, where (xo, XI' ..., x n ) are homogeneous coordinates in pn and aikEK; A = (a;k) is a nonzero symmetric matrix. The problem of classifying such surfaces is reduced to that of tquadratic forms or, equivalently, to that of symmetrix matrices in K. Two symmetric matrices A and B are equivalent if there exists a regular matrix T such that B = ITA T ( 348 Quadratic Forms). In particular, when K is an talgebraically closed field or a treal closed field, a simple result is obtained. If K is an algebraically closed field, then the equation of the quadric hypersurface is reduced to the canonical form L=oxf=O, where r=r(A)=the rank of A. When K is a real closed field, then the canonical form is Li=o xf - LJ=S+1 xJ = o. References [1] G. Salmon, A treatise on the analytic geom- etry of three dimensions, Hodges, Figgis & Co., seventh edition, 1928 (Chelsea, 1958). 
351 A Quantum Mechanics [2J G. Salmon and W. Fiedler, Analytische Geometrie des Raumes I, Teubner, fourth edition, 1898. [3J H. F. Baker, Principles of geometry III, Solid geometry, Cambridge Univ. Press, 1922. [4J O. Schreier and E. Sperner, Einfiihrung in die analytische Geometrie und Algebra II, Vandenhoeck & Ruprecht, 1951; English translation, Projective geometry of n- dimensions, Chelsea, 1961. [5J M. Protter and C. Morrey, Analytic geom- etry, Addison-Wesley, 1975. 351 (XX.23) Quantum Mechanics A. Historical Remarks tNewtonian mechanics (classical mechanics) successfully explained the motion of mechan- ical objects, both celestial and terrestrial, on a macroscopic scale. It failed, however, to explain blackbody radiation, which was dis- covered in the last decade of the 19th century. M. Planck introduced a hypothesis of discrete energy quanta, each of which contains an amount of energy E equal to the frequency of the radiation v multiplied by a universal con- stant h (called Planck's constant). He applied this hypothesis to derive a new formula for radiation that gives predictions in good agreement with observations. A. Einstein proposed the hypothesis of the photon as a particlelike discrete unit of light rays. Assum- ing that many physical quantities, including energy, have only discrete values, N. H. Bohr explained the stability of electronic states in atoms. As illustrated in these examples, quan- tum mechanics is applied to study the motion of microscopic objects, including molecules, atoms. nuclei, and elementary particles. B. Quantum-Mechanical Measurement Fundamental differences between the new mechanics and classical mechanics are due to the facts that many physical quantities, for example, energy, can take only discrete values in the microscopic world, and that states of microscopic objects are disturbed by observation. A (pure) state at a certain time is expressed by a unit vector IjJ in a tHilbert space ff, and observables, or physical quantities, are ex- pressed by tself-adjoint operators in such a space. Let an (n = 1,2, ... ) be teigenvalues of an observable A, and let Pn be a tprojection opera- 1302 tor onto the eigenspace spanned by teigen- vectors belonging to the eigenvalue an' Sup- pose that A =LnanPn. Then the hypothesis on measurement in quantum mechanics is given as follows. When an observable A is observed in a state 1jJ, one of the eigenvalues an is found with probability proportional to (1jJ, PnljJ)2. When an eigenvalue an is once observed, a :;tate jumps from IjJ to an eigenstate PnljJ which belongs to the eigenvalue an' Quantum mechanics pre- dicts only a probability Pn with which a certain value an is found when an observable A is observed. This probability, given by (1jJ, PnljJ), is not changed even if IjJ is replace:d by eiljJ, 01.I2n. Therefore e i8 1jJ represents the same state as 1jJ. The set of e i8 1jJ, 0  1.1  2n, for a fixed IjJ (111jJ11 = 1) is called a unit ray. If Pn is I-dimensional and Pn cP ,= cp, II cp II = I, then (1jJ, PnljJ) = 1(1jJ, cp)1 2 is called the transition probability between the two staten. The expectation value (or expectation) of an operator A in a state 1jJ, usually normalized to (1jJ,1jJ)= I, is defined to be <A) =(I/J,AIjJ) = LnanPn. A general self-adjoint operator A can be written as A = J AdP(A). When A is observed in a state 1jJ, the probability for a val ue to be found between AI and A2 (A2 > AI: A2 included and AI excluded if P(A) is right continuous) is (IjJ(P(A2) - P(A 1 )), 1jJ) ( 390 Spectral Analysis of Operators). The quantity (cp, AIjJ) is called the matrix element of A between the two states cp and 1jJ. A state IjJ can be viewed as a functionalljJ(A) = <A) on the set of all observables A (its value being the expectation), which is linear in A, positive in the sense IjJ(A* A) O for any opera- tor A, and normalized: 1jJ(1) = 1. If 0  A  I and IjJ(A)=AljJdA)+(I-A)1jJ2(A) for all obser- vables A, then the state IjJ is called a mixture of states IjJ 1 and 1jJ2 with weights A al1d (1- A). If a state is not a nontrivial (i.e., k;iO, 1,1jJ1 ¥-1jJ2) mixture, it is called pure. The state <A) = (1jJ, AIjJ) on the set of all self-adjoint operators A given by a vector IjJ is pure in this sense. If sup.IjJ(A.) = IjJ(A) whenever A is an increasing net of positive operators with A a s its limit, then IjJ is called normal. Any normal positive linear functional on the set of all self-adjoint operators can be described by a trace-class positive operator p, called the density matrix, as <A)=tr(Ap). If {ljJn} is a complete ortho- normal set, where each IjJn is an eigenvector of p belonging to the eigenvalue An' then <A)= Ln An (IjJn, AljJn). C. Canonical Commutation Relations In quantum mechanics, canonical variables are represented by the self-adjoint operators 
1303 Qk (coordinates) and P k (momenta), k= 1, ..., N, which satisfy the canonical commutation relations [Qk, pa = iM k1 l, where 1 is the identity operator, [A, B] de- notes the commutator AB-BA, h=h/(2n), and the relation is supposed to hold on a certain dense domain of vectors. Self-adjoint opera- tors Qk and P k satisfying the above relations are unique up to quasi-equivalence under a suitable domain assumption, e.g., if Lk Q + LkPf' is essentia11y self-adjoint on a dense domain invariant under multiplication of the Q's and P's and on which the above relations are satisfied. Under such an assumption, Qk and P k are unitarily equivalent to a direct sum of the Schrodinger representation on L2(RN,dxl...dxN)' where Qk is multiplication by the kth coordinate Xk and P k is the differen- tiation - ih(8/8x k ) (Rellich-Dixmier theorem). The above Schrodinger representation is caned the position representation (or q- representation). The formulation using the function space L 2 of real variables Pk, k = 1, 2, ... , N, on which the operators P k act as multiplications by Pk, is ca11ed the momentum representation (or p-representation). If Hermitian operators A and B satisfy the canonical commutation relations in the form (At/J, Bt/J)-(Bt/J, At/J)= ih(t/J, t/J), then the fo11ow- ing Heisenberg uncertainty relation holds for the expectation: h 2 «A -<AW) «B-<BW):;'4"' This gives the uncertainty in observations, which means that two observables A and B cannot simultaneously be observed with ac- curacy. This is another important property of microscopic motion that cannot be found in macroscopic motion. In a direct sum of the Schrodinger repre- sentation of the canonical commutation re- lations, the unitary operators V(a)=expi>kQk' V(b) =exp il)kPk k k with real parameters a k and b k , k=l, ...,N, satisfy the fo11owing Weyl form of the canon- . ical commutation relations: V(a) V(a') = V(a+a'), V(b) V(b') = V(b+b'), V(a) V(b) = V(b)V(a)ex p ( -iakb} Conversely, any pair of families of unitary operators V(a) and V(b) satisfying these rela- tions and depending continuously on para- meters a and b are unitarily equivalent to those obtained as above (von Neumann unique- ness theorem). 351 D Quantum Mechanics D. Time Evolution and the Schrooinger Equation The time t of an observation is fixed in the foregoing discussion. A state changes, how- ever, as the time t changes, in such a way that the transition probability between states is preserved. By Wigner's theorem ( Section H), this time evolution of states can be im- plemented by unitary operators V, defined by the transformation of vectors t/J-+V,t/J = t/J,. Furthermore, under some continuity assump- tion, such as that of (qJ, V,t/J), V, can be made a continuous one-parameter group. By Stone's theorem, V, = e -iHt/h for a self-adjoint operator H. This operator is caned the Hamiltonian operator (or simply Hamiltonian) determined by the structure of a system. An infinitesimal change in t/J corresponding to an infinitesimal change in t can be generated by this operator H as fo11ows: ih at/J' =Ht/J,. at This equation is ca11ed the time-dependent Schrodinger equation. A state t/J changes but observables do not change with time in the Schrodinger pic- ture above. The other picture, known as the Heisenberg picture, is equa11y possible. In this picture, the state is expressed by a time- independent vector, while operators A vary with time as fo11ows: A-+Vr* AV,=A(t). Rates of change of operators A(t) can be calculated by means of the equation dA f t ) i dt=r;[H,A(t)], which is ca11ed the Heisenberg equation of motion. When time t changes, the expectation value of an operator A in a state t/J changes in both pictures according to d<A)/dt=i<[H, A])/h. According to classical analytical dynamics, a change of a dynamical quantity that is a func- tion of tcanonical variables qj (positions) and Pi (momenta) is given by dA/dt= -(H,A), where H is a tHamiltonian function and the parentheses ( , ) denote the tPoisson bracket. A replacement of the Poisson bracket (A, B) by [A, B]/ih transforms this classical equation into the quantum-mechanical equation above. It should be noticed that the mathematical structure of the Poisson bracket is similar to that of commutator. In this transition from classical to quantum mechanics the corre- spondence principle can be used. This requires that the laws of quantum mechanics must lead 
351 E Quantum Mechanics to the equations of classical mechanics in the classical situation, where many quanta are involved and h can be regarded as infinites- imalIy small in the commutation relation. The correspondence principle suggests that Hamiltonian operators in quantum mechanics can be obtained from Hamiltonian functions H(Pb qk) of canonical variables Pk and qk in classical mechanics after replacing Pk and qk by the operators P k and Qk in the Schrodinger representation (up to uncertainty of about the order of operators). This process of moving from canonical variables and the Hamiltonian function in classical mechanics to canonical operators and the Hamiltonian in quantum mechanics is calIed quantization. Taking a system of s particles and letting x k , Yb and Zk be the Cartesian coordinates of the kth parti- cle, we usuaIly write the equation of motion as . oljJ ( , h 2 Ih--= - I -k ot k1 2m k + V(X 1 'Y1,ZI' ...,X"Y"Zs))IjJ, which is a second-order partial differential equation. Here m k is the mass of the kth par- ticle; k is the tLaplacian of Xk, Yb and Zk; and V is a real function called the potential energy. This equation is the time-dependent Schrodin- ger equation. The partial differential operator on the right-hand side is ca1\ed the (s-body) Schrodinger operator an d IjJ (x j, Y 1 , Z 1, ... , x" y" Z" t) is called a wave function. The prob- ability of finding a particle in the volume dxkdYkdzk bounded by X k , X k +dx k , Yk' Yk +dYb and Zb Zk+dzk is proportional to /1jJ(Xj,Y1,ZI, ...,x"y"z"tW. Usually 11jJ1 2 is normalized so that its integral over the whole space is 1. We sometimes call1jJ the probability ampli- tude. When IjJ is given by e-iEt/hcp(xj, ...,z,), the expectation value of an operator A in a state 1jJ, (A) = JIjJ* AIjJ dX 1 '" dz" does not de- pend on time. When this is the case, IjJ is called a stationary state. A real value E and a function cp(x 1, ..., Zs) are found by solving an teigenvalue prob- lem H cp = Ecp. This equation is the time- independent Schrodinger equation, which is a second-order partial differential equation. Since the Hamiltonian H stands for the energy of this system, the eigenvalues E are the energy values that this system can take. When a potential function V is given, it is a nontrivial matter to prove that the (s-body) Schrodinger operator with the given V is es- sentially self-adjoint on the set of, for example, all COO-functions with compact supports so that its closure H defines mathematically the continuous one-parameter group of unitaries U,=e- iHl / h for the time evolution of the quan- 1304 tum system of s particles with the given inter- action potential V. If V satisfies an estimate II VIjJ II  }dl HoljJ II + flll IjJ II (called thl Kato per- turbation on Ho) for some nonnegative A < 1 and fl;::' 0 and for alIljJ in a dense domain on which Ho is essentialIy self-adjoint, where Ho denotes the Schrodinger operator with V=O (called the free Hamiltonian or the kinetic energy term), then Ho + V is essentially self- adjoint on the same domain. Fo! the case where V consists of Coulomb interactions between electrons and Coulomb potentials on electrons by fixed nuclei, for example, such an estimate and hence the essential self- adjointness of Hamiltonians for atoms and molecules were established first by T. Kato (Trans. Amer. Math. Soc., 70(1951)). For a I-body Schrodinger operator (or 2- body Schrodinger operator after the center of mass motion has been separated out), the point spectrum is that of the particle trapped by the potential, and the state represented by its eigenvector is called a bound state. The eigenvalue is nonpositive for a reasonable class of potentials V (for example, if V(x) (xER 3 ) is continuous and (O(lxl- 1 -') as Ixl--+oo for some f. > 0), and its absolute value -is calIed the bind- ing energy. The eigensolutions of the Schro- dinger equation are what have been called stationary states above. There are also stationary solutions that do not correspond to the point spectrum and hence are not square integrabl,e. They are used in the stationary methods of scattering theory ( 375 Scattering Theory). E. Some Exact Solutions for the I-Body Schrodinger Equation (1) Harmonic oscillator. First consider the case in which the space is of 1 dimension, so that the Laplacian  is (d/dx)2 Let m be the mass of the particle and V(x)= mw 2 X2 /2 for a posi- tive constant w (calIed the angular frequency). The Hamiltonian H= -(h 2 /2m)(d/dx)2 + mw 2 x 2 /2 has simple eigenvalues En = hw(n +(1/2)), n =0,1,2, ..., with a complete orthonormal set of eigen- functions IjJn(X) = cnHn(q)e-q2/2, q = (mw/h) 1/2 X, where Hn(q) is a tHermite polynomial and C n is the normalization constant: [n/2] Hn(q)= I (-I)k n !(2q)"-2k/{(n-2k)!k!}, kO C n = {22n(n!)2 n h/(mw)} -1/4. 
1305 When the space is of r dimensions, n in En is replaced by n 1 + ... + n, with nonnegative integers n 1 , ..., n" and the corresponding eigen- function is n;=1 I/Inj(Xj). (2) One-dimensional square-well potential. Let V(X) = V for Ixi <::;aj2 and V(x)=O for Ixi > aj2 (xER).1f V:;, 0, there are no point spectra. If V < 0 and N -1 <a( -2mV)1/2j(nh)<::;N, then there are N eigenvalues (N = 1,2,...) obtained as the roots E < 0 of one of the fol- lowing equations: {(VjE) -1 } 1/2 = tan {a( - 2mE)1 / 2j(2h)}, {(VjE) -I} 1/2 = -cot{a( - 2mE)1 / 2 j(2h)}. (3) Separation of angular dependence for central potential. If V(x) (xER 3 ) depends only on r= Ixi = (Ll=1 (X/) 1/ 2 (caned a central poten- tial), then an eigenvalues E and a basis for eigenfunctions I/I(x) can be obtained in terms of the polar coordinate r, 8, qJ (XI =rsin8cosqJ, X2 =rsin8sinqJ, X3 =rcos8) as 1/1 (x) = Yim(8, qJ)r-Iu(r), -(h 2 j(2m))u"(r) + WI(l+ 1)j(2mr) + V(r)-E}u(r)=O, 111/111 = I" lu(rWdr< 00, where the angular function Yim is an eigenfunc- tion of the square L 2 of the orbital angular momentum L = - ix x V: Yim = c1mP!"(cos 8)e im <P, Cl m =( -In(21+ 1)(l-m)!j(4n(l+m)!W I2 , m=l, 1-1, .'., -1+ 1, -I, 1=0,1,2, .... Here P!,,(x) is an tassociated Legendre polynomial: P!,,(x) =(1 - x2)mI2(d 1 + m (x2 - l)ljdxl+m)j(t I!). The above equation for u(r) is caned the radial equation. The nonnegative integer I is the azimuthal qnantum nnmber, and the integer m is the orbital magnetic quantum number. The wave function I/I(x) with the angular depen- dence Yim(8, qJ) is caned the S-wave, P-wave, D- wave,... according as 1=0, 1,2, .... (4) Hydrogen-type atom. Let V(r) = -Ze 2 jr (Z>O). For each I and m, there are eigenvalues _e 2 Z2j(2an 2 ) with eigenfunctions I/Inlm= r-Iunl(r) Yim(8, qJ), where n= 1+ 1, 1+ 2,... is the principal quantum number, un1(r) = cnlLW1)(s)sl+1 e -sI2, S = 2Zrj(na), C n1 = - {(n-l-l)![2Zj(na(n+ 1)!)]3 j(2nWi2, and L(x) is the flth derivative of the tLaguerre 351 F Quantum Mechanics polynomial N LN(x)= I (N!)2( -x)'j{(N -V)!(V!)2}. v=O The eigenvalue is determined by n, and its multiplicity is n 2 , corresponding to the differ- ent possible values of I and m. F. Path Integrals R. P. Feynman (Rev. Mod. Phys., 20 (1948)) has given the solution of the Schrodinger equation as an integral of e iLlh over an possible paths q(t), where L= L(q, q) (q = (djdt)q(t)) is the classical Lagrangian for the Hamiltonian system. This integral is caned the Feynman path integral. Mathematical reformulation of the formula in terms of the Wiener measure has been given by M. Kac (Proc. 2nd Berkeley Symp. Math. Statist. Probability, 1950; Proba- bility and related topics in the physical sciences, Wiley, 1959). Consider the I-body Schrodinger operator H = Ho + V (form sum), where V is the sum of a locany integrable function bounded below and a Kato perturbation on Ho. Let b(t) (t:;'O) be the Wiener process and q(t) = hb(t)j(2m) 1/ 2 . For any L 2 functions f, (e -tHlhf)(x) =E(f(x+q(t))ex p { - t V(x+q(S))dSjh}) for almost an x, where E denotes the expecta- tion for the Wiener process. If V is a sum of L 2 and Loo functions (for spatial dimension <::;3), then the right-hand side is continuous in X for t> O. This is caned the Feynman-Kac formula. Let Lo be the Hamiltonian for a I-dimen- sional harmonic osci11ator with m = ill = h = 1 and % be the eigenfunction %(x) = n- 1/4 exp( -x 2 j2). Consider H =Lo + V (form sum), where V is a sum of a locany integrable function bounded below and a Kato pertur- bation on Lo. Let q(t) (tER) be Gaussian ran- dom variables with mean 0 and covariance E(q(t)q(s)) =2lexp( -It -sl) (caned the oscil- lator process). For any fj(x) in L 2 (R, 1/15 dx) (j = 1, ... ,n) and to <::; t I.'. <::; t n <::; tn+1' (l/Io,e-(t 1 -to)H fl e-(t 2 -t,)H f2" .J.e-(tn+l -tn)HI/IO) =E( {I] fj(q(tj))}ex p { - [+I V(q(S))dS}). The above path integral formulas are closely related to the Trotter product formula e-t(A+B)= lim (e -tAln e -tBlnr (t:;, 0), noo where A and B are self-adjoint operators 
351 G Quantum Mechanics bounded below and A + B is essentially self- adjoint. (The same formula holds without the boundedness assumption when tEiR.) G. The Dirac Equation The Schrodinger equation is not relativistically invariant. The Klein-Gordon equation (0 _K 2 )1jJ=0, 1 a 2 0=-2 at2 ' mc K=- h is obtained by replacing Pk by (h/i)a/ax k (k = 1,2,3) and E by ihajct in the relativistic identity E2=m 2 c4.+ p 2C 2 , where c is the speed of light. Wave functions of free particles are believed to satisfy this eq ua- tion. P. A. M. Dirac assumed that the IjJ of a free electron is expressed in terms of a tspinor with four components satisfying a linear dif- ferential equation that automatically implies the Klein-Gordon equation. Relativity re- quires the equal handling of space and time. The Dirac equation 3 aljJ L y-+iKIjJ=O O ax (XO = ct) satisfies these requirements. The coefficients y can be so determined that every component of IjJ also satisfies the Klein- Gordon equation. Thus the y are found to be 4 x 4 matrices satisfying the commutation relations y"y v + yV y "= 2g"V (/1, v =0,1,2,3), where gv=O for J1#v and gOo= _gkk= 1 (k= 1,2,3). Sixteen linearly independent matrices are obtained by repeated multiplication of five matrices, which include the four matrices yO, yl, y2, y3 and the identity matrix. Any 4 x 4 matrix can be expressed as a linear combina- tion of these sixteen matrices. The Dirac equation has tplane wave solutions ljJ(x, t) = u exp((i/h)p' x - (ifh)Et), wher e the energy eigenvalues E are ::!:J"m 2 c4. + p 2 C 2 . There are four independent eigensolutions u(l), U(2), u(3), and u(4.), because u has four compo- nents. Two of them are of positive energy and the other two are of negative energy. Although the negative energy case is physically undesir- able, it has to be taken into account in order to obtain a mathematically complete set. To solve this difficulty, Dirac proposed the hypoth- esis (Dirac's hole theory) that all the negative energy states are filled up by an infinite num- 1306 (8) ber of electrons in the normal state of the vacuum. The absence from the vacuum of a negatively charged electron in a negative energy state could then be expected to mani- fest itself as a positively charged particle (posi- tron) with positive mass and kinetic energy. If gamma rays are absorbed to excit, an electron from a negative energy state into a positive energy state, an electron-position pair must be created. Y. Nishina and O. Klein calculated the cross section of Compton scattering (the Klein-Nishina formula) by use of the Dirac equation and found good agreement with observations, thus providing evidence that the Dirac equation is correct. The eXJstence of negative energy states, however, forces us to give up considering the Dirac equation as an equation of one electron. The positron theory is introduced, and the Dirac equation is considered as the classical field of electron waves and is second-quantized (. 377 Second Quantization). The Klein-Gordon equation can also be considered to be the classical wave equation of matter and can be second-quantized. Motions of particles with zero spin, pi mtsons (n) for example, obey this equation. We can rewrite the Dirac equation as ihaljJ/at=HIjJ, H=ccx.p+mc 2 /3, where yk= /3cx k (k = 1, 2, 3), and yO = /3. Since H cannot commute with the orbital anguJar momentum of an electron L=r x p= -ihr)( V, L is not conserved. However, the total angular momen- tum J = L + (h/2) a can be conserved when a is a vector whose components can be given as ( (Jk 0 ) ( 1 0 ) ( 0 (Jk ) o (Jk ,where /3= 0 -1 ,CX k = (Jk 0 ' and the Uk- called Pauli spin matrices, are given ( 0 1 ) ( 0 - i ) ( 1 0 ) bY(Jx= 1 0 ,(Jy= i 0 ,(Jz= 0 -1 . (The y are called Dirac's y matrices.) The quantity S=a is the intrinsic angular momen- 2 tum of the electron, also called the spin. Many particles besides the electron, the neutron for example have spin. The matri( 8 2 = s; + s; + S; is diagonal and is equal to s(s + l)h 2 f (f is the identity matrix). For the electron s = 1/2, and h/2 is called the absolute value of the spin. Therefore we say that electrons are particles of spin h/2. This was predicted in the theory of light spectra. When the speed of an electron is very small, so that (V/C)2 can be neglected, states of the electron can be expressed in terms of two- component wave functions. This approxi- mation is called the Pauli approximation. If the spin-orbit term that app(:ars in the Pauli approximation is also neglected, these two components become independent of each 
1307 other and individually satisfy the Schrodinger equation. H. Application of Representation Theory of Lie Groups A symmetry (with active interpretation) is a bijective mapping of pure states (repre- sented by unit rays in a Hilbert space) preserv- ing transition probabilities between them. Wigner's theorem says that any symmetry can be implemented by either a unitary or an anti unitary mapping of the underlying Hilbert space as a mapping of unit rays onto them- selves. Furthermore a connected Lie group of symmetries is implemented by unitaries, which form a projective representation. Eigen- states t/J of the Schrodinger equation are func- tions of the coordinates of each particle. Let these coordinates be denoted together as x. Suppose that an operator T operates on the x, as, for example, a rotation of the coordinate system or a permutation of the labels of the particles. If T commutes with H or H is invar- iant under the transformation x--+x' = Tx, then Tt/J(x) = t/J'(x) = t/J(T- 1 x) satisfies the same Schrodinger equation as t/J, where the trans- formation of the function is defined by t/J'(x') = t/J(x). The set of transformations x--+x' = Tx forms a tgroup {T}, and the corresponding transformations t/J --+ Tt/J give a (generally infinite-dimensional) representation of this group, which should be tunitary on L 2 -space relative to the Lebesgue measure dx if T leaves the measure invariant. There are degeneracies of the energy eigenvalues, each of which is equal to the dimension of the corresponding representation of the group {T}. If the repre- sentation for each eigenvalue is decomposed into irreducible ones, then the decomposed stationary state can be labeled by an tirreduc- ible representation. When H is spherically symmetric, i.e., H is invariant under the 3-dimensional rotation group, states are classified by the irreducible representations D L of the trotation group ( 258 Lorentz Group). The square of the sum L of all orbital angular momenta has eigenvalues L(L + l)h 2 , where L must be 0 or a positive integer. There are 2L + 1 degenerate states, each of which belongs to a different M, the z-component of L, where M ranges from L to -L by unit steps. Even when there is an inter- action between the orbital angular momentum and the spin angular momentum, states are labeled by the irreducible representations DJ of the rotation group, where J is the sum of the orbital angular momentum L and the spin angular momentum S (J = L + S) and eigen- 351 H Quantum Mechanics values of J2 are given by J (J + l)h 2 . Each J must be zero, a positive integer, or a half- integer. Adding inversions to the pure rota- tions, we obtain the 3-dimensional orthogonal group ( 60 Classical Groups I). Irreducible representations of this group are written as Df, where::!:: corresponds to the characters of the inversion relative to the origin. States with + are called even states and those with -, odd states. For example, energy levels of atoms and nuclei can be classified by Df. To obtain matrix elements of observables between two stationary states, group repre- sentation theory is useful. The transformation of every observable obeys a certain rule under the transformation of coordinates. The scalar is transformed according to Do+ , the vector according to D 1 , the pseudovector according to Dt, and the traceless tensor according to D;. Ifthe transformation of an observable is given by DJo then a matrix element of this observable between the states belonging to Dr and Dr vanishes unless the tensor product representation DJ @ Dr contains as a factor a representation equivalent to Dr. In electro- magnetic transitions in atoms or nuclei, D 1 @ Dr=Dr+1 +Dr+Dr-1 (J' 1) if the electric dipole transition dominates (J = 1). This im- plies the selection rule J' --+J' + 1, J', J'-1. When J' = 0, only the transition 0--+ 1 is pos- sible. More general selection rules can be obtained in the same way for general multi- pole transitions. Representation theory is useful in determining general formulas of transition strengths. There is a class of particles, many of which can occupy the same state, called bosons. There is another class of particles, of which only one can occupy a given state, called fer- mions. For example, the electron, neutron, and proton are fermions, while the photon and pi meson are bosons. Two identical particles, both of which are either fermions of the same kind or bosons of the same kind, cannot be distinguished. Therefore the Hamiltonian should be invariant under permutations be- tween identical fermions, or between identical bosons. A system consisting of N identical particles can be classified by the irreducible representations of the tsymmetric group SN of N elements. When the particles are fermions, two of them cannot occupy the same state (this law is called the Pauli principle), so that only totally antisymmetric states are permissible for fermions. When a system consists only of fermions of the same kind with spin h/2 and a Hamiltonian of this system does not include terms depending on spins, then the wave func- tions are just products of spin and orbital parts. In order to make wave functions totally 
351 I Quantum Mechanics antisymmetric, orbital wave functions with a total spin vj2 should be limited to those corre- sponding to the ty oung diagram [2N", 1 "J = T(2, 2, ... , 1, 1). I. Polyatomic Molecules The group generated by the 2-dimensional rotations about the axis connecting two atoms and the reflections with respect to the planes containing this axis is used to classify states of diatomic molecules. Stationary states can be classified by the absolute value A. of the angular momentum of the diatomic system around this axis, which can be zero or a posi- tive integer. When A.;::, 1, the corresponding state has twofold degeneracy, whereas when A. = 0, two states (labeled by :!:) arise, depending on the character of the reflections. If these two atoms are identical, the molecular states can be classified further as even and odd according to the character of reflections with respect to the plane containing the center of mass and perpendicular to the axis. The classification of the spectral terms of a polyatomic molecule is related to its symmetry, described by the set of all transformations that interchange identical atoms. For exam pIe, stationary states of meth- ane molecules CH 4 are classified by the irre- ducible representations of the group T.1 (which is generated by adjoining reflection symmetry to the ttetrahedral group T). Level structures of a crystal are classified by the irreducible rep- resentations of its symmetry groups. In the approximation of many-body prob- lems by means of independent particles, the wave function of the total system is con- structed by multiplying the wave functions of the individual particles. To construct such a wave function, it is useful to consider the re- duction to irreducible parts of the ttensor products of representations of the groups attached to the individual particles. For exam- ple, an atom with two electrons carrying the angular momenta J 1 and J 2 has 2J' + 1 differ- ent angular momentum states, where J' = min(J 1 , J 2 ) corresponding to the decompo- sition D J , @D J , =D J , +J, +D J [ +J21 +... + D 1J ,-J 2 1' The right-hand side of this equation gives all possible states of the atom. J. Charge Symmetry The proton and the neutron can be considered to be different states of the same particle, called the nucleon, because these two particles have very similar natures except for their charges, masses, etc. As an approximation, the Hamiltonian of a system consisting of protons 1308 and neutrons may be taken to be invariant under the interchange of protons and neu- trons. This invariance is called charge sym- metry. Analogous to ordinary spin, isospin can be introduced to describe the two states of nu- cleons. The up state of the isospin corresponds to the proton and the down state corresponds to the neutron. Consider transformations belonging to the tspecial unitary group SU(2) in the 2-dimensional space spanned by the proton state and the neutron state. If a Hamil- tonian of N nucleons is invariant under any transformation belonging to SU(2), then the eigenstate of these N nucleons is classified by its irreducible representations DT' where T stands for the total isospin of each state. This invariance, .called isospin invariance, holds in the nucleus and elementary particles if electromagnetic and weak interactions, and possibly the interaction responsible for the proton-neutron mass difference, are neglected. When a state of N nucleons has an isospin T= vj2, the orbital-spin wave function of this state must correspond to the Young diagram [2N", 1 "J, since the isospin wave function multiplied by the orbital-spin wave function is a totaIly antisymmetric wave funcl:ion. If this Hamiltonian is also independent of spin, it is invariant under unitary transformations in the 4-dimensional space spanned by the four inter- nal states of the nucleon: up and down spins, up and down isospins. Therefore the states of N nucleons can be classified by th, irreducible representations of the group U(4) (Wigner's supermultiplet theory). K. The C*-Algebra Approach The uniqueness of operators satisfying the canonical commutation relations (represen- tations of CCR's) up to quasi-equivalence ( Section C) no longer holds if the number of canonical variables become infinit, (a so-called system of infinitely many degrees of freedom), a point first emphasized by K. O. Friedrichs (Mathematical aspects of the quantum theory of fields, Interscience, 1953), and physical examples ilIustrating this point were given by L. van Hove (Physica, 18 (1952)) and R. Haag (Mat. Fys. Medd. Danske Vid. Selsk., 29 (1955)). The use ofC*-algebras in physics was first advocated by I. E. Segal (Ann Math., 48 (1947)), and the physical relationship among all inequivalent representations of a C*-algebra was first discussed by R. Haag and D. Kastler (J. Math. Phys., 5 (1964)). In C*-algebra approach, a physical obser- vable is an element of a C*-algebra  and a state is a functional cp on  (its value cp(A) is 
1309 the expectation value of the observable A E  when measured in that state) that is linear, positive in the sense cp(A* A):;' 0 for any A E, and normalized, i.e., Ilcpll = lor equivalently cp(I)= 1 if 1 E. The tGNS construction associ- ates with every state cp a Hilbert space H"" a representation n",(A), A E, of  by bounded linear operators on H"" and a unit cyclic vec- tor Q", in H", such that cp(A)=(Q"" n",(A)Q",). Two states cp and 1/1 (or rather n", and nip) are called disjoint if there is no nonzero mapping T from H", to Hip such that Tn",(A)=nlp(A)T for all A E. Abundant disjoint states occur for a system of infinitely many degrees of freedom, e.g., equilibrium states of a infinitely extended system with different temperatures ( 402 Statistical Mechanics), superselection sectors explained below ( 150 Field Theory) and equilibrium or ground states with broken symmetry. Because actual measurement can be per- formed only on a finite number of observables (though chosen at will from an infinite number of possibilities) and only with nonzero experi- mental errors, information on any state cp can be obtained by measurements only up to a neighborhood in the weak topology: Icp(A j )- ad < B;, j = 1, ... , n. A set K 1 of states can describe measured information on states at least equally well as another set K 2 (K1 physi- cally contains K 2 ) if the closure of K 1 in the weak topology contains K 2 . From another viewpoint, all states of K 2 are weak limits of states in K 1 and are physically relevant if states in K 1 are physically relevant. The set of all mixtures of vector states (1/1, n(A)I/I) for any fixed faithful representation n of  is weakly dense in the set of all states of , a point em- phasized by Haag and Kastler as a foundation of the algebraic viewpoint in the formulation of quantum theory. Under 360° rotation a vector representing a state of a particle with spin "/2 acquires a factor -1 ( 258 Lorentz Group), while the vacuum vector would be unchanged. A non- trivial linear combination (superposition) of these two would then be changed to a vector in a different ray. If the 360° rotation is not to produce a physically observable effect, then we should either forbid nontrivial superpositions of states of the two classes or, equivalently, restrict observables to those leaving the sub- space spanned by vectors in each class invar- iant so that the relevant linear combinations of vectors, when considered as states on the algebra of observables in the form of expecta- tion functionals (1/1, AI/I), are actually mixtures (rather than superpositions) of states in two classes and are invariant under the 360 0 rota- tion. This is called the univalence superselec- tion rule and has been pointed out by A. S. 351 L Quantum Mechanics Wightman, G. C. Wick, and E. P. Wigner (Phys. Rev., 88 (1952)). In quantum field theory, the vacuum state can be taken to be pure (by central decompo- sition if necessary) and in the associated GNS representation (called the vacuum sector) all vectors can be assumed to be physically rele- vant pure states. In principle, all physically relevant information is in the vacuum repre- sentation; for example, a particle with spin "/2 can also be discussed in the vacuum sector if we consider a state of this particle in the pre- sence of its antiparticle at a far distance, such as behind the moon (the behind-the-moon argument). However, it is mathematically more convenient to consider the states of the par- ticle without any compensating object (in the same way that an infinitely extended gas is more convenient for some purposes than a finitely extended gas surrounded by walls), which can be obtained as weak limits of states in the vacuum sector by removing the com- pensating particle to spatial infinity and which produce inequivalent representations called superselection sectors. L. Foundation of Quantum Mechanics Hilbert spaces and the underlying field of complex numbers, which constitute a mathe- matical background for quantum mechanics, are not immediately discernible from physical observations, and hence there are various attempts to find axioms for quantum mechan- ics that imply the usual mathematical struc- ture and at the same time allow direct physical interpretation. One approach of this kind focuses attention on the set of all observables that have only two possible measured values 1 (yes) and 0 (no), called questions, together with their order structure (logical implications) and associated lattice structure (join, meet, and orthocomple- mentation as logical sum, product, and nega- tion). This is called quantum logic in contra- distinction to the situation in classical physics, where it would form a Boolean lattice. The lattice of all orthogonal projections (corre- sponding to all closed subspaces of a Hilbert space) in quantum mechanics is a tcomplete, torthocomplemented, weakly modular (also called orthomodular) tatomic lattice satisfy- ing the covering law, where weak modularity means c/\(c'vb)=b and bv(b' /\c)=c when- ever b  c, and the covering law means that every b=ftO possesses an atom p under it (pb) and that if an atom q satisfies q /\ b = 0, then any c between qvb and b (q vb:;, c:;' b) is b or q v b. Conversely any such lattice is a di- 
351 Ref. Quantum Mechanics rect sum of irreducible ones, each of which, if of dimension (length of longest chain) > 3, can be obtained as the lattice of subspaces V satisfying (V.l).l = V in a vector space over a (generally noncommutative) field with an anti- automorphic involution *, equipped with a nondegenerate Hermitian form. In this ap- proach, an additional requirement is needed to restrict the underlying field and its * to be more familiar ones, such as real, complex, or quaternion fields and their usual conjuga- tions *. If that is done, then the set of all pro b- ability measures on the lattice (i.e., assign- ment of expectation values 0  fl(a)  1 for all elements a in the lattice such that fl(Via i ) = Li fl(a i ) if a; 1- a J for all pairs i of- j, fl(a);::, 0, fl(I)= 1) is exactly the restriction to questions of states p(A) = tr(pA) given by the density matrices p (Gleason's theorem). It is also possible to characterize the set of all states equipped with the convex structure (mixtures) geometrically. The set of all states (without the normalization condition) of a finite-dimensional, formally real, irreducible tJordan algebra over the field of reals (the positivity of a state cp is defined by cp(a 2 );::, 0) has been characterized as a transitively homo- geneous self-dual cone in a finite-dimensional real vector space (a cone V is transitively homogeneous if the group of all nonsingular linear transformations leaving V invariant is transitive on the topological interior of V) by E. B. Vinberg (Trans. Moscow Math. Soc., 12 (1963); 13 (1965)), where the relevant Jordan algebras were completely classified earlier by P. Jordan, J. von Neumann, and E. P. Wigner (Ann. Math., 36 (1934)) as direct sums of the following irreducible ones: the Jordan algebra (with the product AoB=(AB+BA)/2) of all Hermitian n x n matrices over the real, com- plex, or quaternion field, all 3 x 3 Hermitian matrices over octanions, or the so-called spin balls (the set of all normalized states being a ball) linearly generated by the identity and Yj (j= 1, ...,n) satisfying YjOYk=O ifjof-k and yJ = 1. In infinite-dimensional cases, this type of characterization extends to the "natural" posi- tive cones of vectors (A. Connes, Ann. Inst. Fourier, 24 (1974); J. Bellissard and B. Iochum, Ann. Inst. Fourier, 28 (1978)); while the convex cone of all states (without normalization) of Jordan algebras and C*-algebras have been characterized in terms of a certain class of projections associated with faces of the cone, called P-projections, by E. M. Alfsen, F. W. Shultz, and others (Acta Math., 140 (1978); 144 (1980)). In finite-dimensional cases, Araki (Commun. Math. Phys., 75 (1980)) has given a characterization allowing direct physical inter- pretation by replacing P-projection with a 1310 notion of filtering corresponding to q uantum- mechanical measurement. Due to some features of quantum- mechanical measurement not in conformity with common sense, there have arisen hidden variable theories that are deterministic and reproduce the quantum-mechanical prediction. For a situation where a pair of (correlated) particles in states a and bare creat{:d and their spins (lor -1, i.e., up or down spin) measured at positions distant from each other, the expectation value E(a, b) for the product would be given in a hidden variables theory by E(a, b)= f Aa(..1.)Bb(..1.)dp(..1.) for a probability measure p and the functions Aa and Bb of hidden variables ..1., representing spins and hence satisfying IAal  1 and IBbl  I. Then the following Bell's inequality holds: IE(a, b)- E(a,b')+E(a',b)+ E(a', b')1 2. This contradicts both quantum-mechanical predictions and experimental resul1s, so that hidden variable theories of this typl have been rejected. References [1] P. A. M. Dirac, Principles of quantum mechanics, Clarendon Press, fourth edition, 1958. [2] A. Messiah, Quantum mechanics I, II, North-Holland, 1961, 1962. (Original in French, 1959.) [3J W. Pauli, General principles of quantum mechanics, Springer, 1980. (Original in Ger- man, 1958.) [4] S. Tomonaga, Quantum mechanics I, II, North-Holland, 1962-1966. [5] 1. von Neumann, Mathematical founda- tions of quantum mechanics, Princeton Univ. Press, 1955. (Original in German, 1932.) [6] B. L. van der Waerden, Sources of quan- tum mechanics, Dover, 1967. [7] B. L. van der Waerden, Group theory and quantum mechanics, Springer, 1974. [8] H. Weyl, The theory of groups and quan- tum mechanics, Dover, 1949. (Original in German, 1928.) [9] E. P. Wigner, Group theory and its appli- cation to the quantum mechanics of atomic spectra, Academic Press, 1959. [10] B. Simon, Functional integration and quantum mechanics, Academic Press, 1979. [11] G. G. Emch, Algebraic methods in statis- tical mechanics and quantum field theory, Wiley, 1972. [12] T. Bastin (ed.), Quantum theory and beyond, Cambridge Univ. Press, 1971. 
1311 [13] B. d'Espagnat, Conceptual foundations of quantum mechanics, Benjamin, second edition, 1976. Also  references to 150 Field Theory, 375 Scattering Theory, 377 Second Quantization, and 386 S-Matrices. 352 (XI.15) Quasiconformal Mappings A. History H. Grotzsch (1928) introduced quasiconformal mappings as a generalization of conformal mappings. Let I(z) be a continuously different- iable homeomorphism with positive Jacobian between plane domains. The image of an in- finitesimal circle Idzl = constant is an infini- tesimal eUipse with major axis of length (IIzI + 1h-l)ldzl and minor axis of length (IIzI- Ih-Dldzl. When the ratio K(z)=(IIzI+ Ih-D/(IIzI-Ih-D is bounded, I is called quasi- conformal. If K == 1, then I is conformal. Grotzsch noticed that Picard's theorem stiI1 holds under the weaker condition; he deter- mined the q uasiconformal mappings between two given domains, which are not conformally equivalent to each other, providing the smallest sup K, that is, those closest to conformality [1]. We cannot speak of the history of q uasi- conformal mappings without mentioning the discovery of extremal length by A. BeurIing and L. V. Ahlfors ( 143 Extremal Length), which has led to the precise definition for quasiconformality itself. Quasiconformal mappings have less rigidity than conformal mappings, and for this reason they have been utilized for the type problem or the classification of open Riemann surfaces (Ahlfors, S. Kakutani, O. Teichmiiller, K. I. Virtanen, Y. Toki;  367 Riemann Surfaces). Quasiconformal mappings have important applications in other fields of mathematics, e.g., in the theory of tpartial differential equa- tions of elliptic type (M. A. Lavrent'ev [2]) and especially in the problem of moduli of Riemann surfaces, including the theory of Teichmiiller spaces ( 416 Teichmiiller Spaces). These applications are explained in Sections C and D. B. Definitions The current definitions of quasiconformality, which dispense with continuous differentia- bility, are due to Ahlfors [3], A. Mori [4], and 352C Quasiconformal Mappings L. Bers [5] ( C. B. Morrey [6]). Consider an orientation-preserving topological mapping I of a domain D on the z( = x + iy)-plane. The quasiconformality of I is defined as fol1ows. (1) (the geometric defintion) Let Q be a curvilinear quadrilateral, i.e., a closed Jordan domain with four specified points on the boundary, and let the interior of Q be mapped conformally onto a rectangular domain I. The ratio (1) of the sides of I, cal1ed the modulus of Q and denoted by mod Q, is uniq uely determined. If modI(Q)':;;KmodQ for any curvilinear quadrilateral Q in D, then I is called a K- quasiconformal mapping of D. This is equiva- lent to: (2) (the analytic definition) I is ab- solutely continuous on almost every line seg- ment parallel to the coordinate axes contained in D (this condition is often referred to as ACL in D) and satisfies the inequality Ih-I.:;; K - 1 1 1' I K+1 Jz almost everywhere in D with some constant K  1. When the value of K is irrelevant to the problem considered, K-quasiconformal map- pings are simply said to be quasiconformal. The K -q uasiconformal mapping I satisfies the so-called Beltrami differential equation h- = /1Iz almost everywhere in D with the measurable coefficient /1. The maximal dilatation (1 + 11/11100)/(1-11/11100) does not exceed K. Some- times I is called, for short a /1-conformal map- ping. These notions are also defined for map- pings between tRiemann surfaces, where the (-1, I)-form /1dZdz- 1 is independent of the choice of the local parameter z. If in the above statements I is not neces- sarily topological but merely a continuous function satisfying the same requirements, we cal1 it a J1-conformal function. (If in addi- tion 11/11100 «K -l)/(K + 1), we call it a K- quasiregular function or K-pseudoanalytic function.) A /1-conformal function is repre- sented as the composite go h of an analytic function g with a /1-conformal mapping h. C. Principal Properties and Results The inverse mapping of a K -quasiconformal mapping is also K-quasiconformal. The com- posite mapping 12 0 II of a KI-q uasiconformal mapping II with a Krquasiconformal mapping 12' if it can be defined, is K 1 Kr quasiconformal. A l-quasiconformal mapping is conformal. Every quasiconformal mapping is ttotally differentiable a.e. (almost every- where), its Jacobian is positive a.e., and (IIzI + IhD/(IIzI-I.I)':;; K a.e. Let I be a K-quasiconformal mapping of Izi < 1 onto Iwl < 1. Then I extends to a homeo- 
352 C Quasiconformal Mappings morphism of I z I  1 onto I w I  1. If, further- more, .f(0) = 0, then the Holder condition ( IZI -Z21 ) K - 1.f(zl)- .f(z2)1 161z1 -z21 1 / K holds for IZII  1, IZ21  I, and 16 is the best coefficient obtainable independently of K (Mori). This shows that any family of K- quasiconformal mappings of Izi < I onto Iwl < 1 is tnormal. For further properties and bibliography  O. Lehto and Virtanen [7J and Ahlfors [8]. (a) Boundary Correspondences and Extensions. Ahlfors and Beurling characterized the corre- spondence between Izi = I and Iwl = I induced by.f [9]. What amounts to the same thing, the following theorem holds: Let J1(x) be a real- valued monotone increasing continuous func- tion on R such that limx_:too J1(x) = ::1::00. Then there exists a quasiconformal mapping of the upper half-plane y > 0 onto itself with bound- ary correspondence xc--. J1(x) if and only if 1 J1(x +t) - J1(x) - p P J1(x)-J1(x-t) for some constant p:;:" 1 and for all x, t E R. Theorem of quasiconformal reflection (Ahl- fors [10]). Let L denote a curve which passes through 00 and divides CU {oo} into two domains Q, Q* such that Q U L U Q* = CU {oo}. Then there exists an orientation-reversing quasiconformal mapping of Q onto Q* which keeps every point of L fixed if and only if some constant C exists satisfying 1(3 - (1IM2- (II  C for any three points (I, (2, (3 on L such that (3E62' . (b) Mapping Problem. Given a measurable function J1 in a simply connected domain D with 11J11100 < I, there exists a J1-conformal mapping of D onto a plane domain L1 which is unique up to conformal mappings of L1 [8]. When J1 is real analytic and the derivatives of functions are defined in the usual manner, a classical result concerning the tconformal mapping of surfaces asserts the existence of a solution of Beltrami's differential equation .fz=J1fz. Concerning the dependence of J1-conformal mapping on J1, Ahlfors and Bers [IIJ obtained the following important result: Denote by .f# a J1-conformal mapping of the whole finite plane onto itself that preserves 0 and 1. The space of functions J1 has the structure of a Banach space with Loo -norm, and the space of map- pings .f# also has the structure of a Banach space with respect to a suitable norm. If {J1(t) = J1(z; t)} is a family of J1 depending on the parameter t with 11J1(t)ll", k< I and J1(t) is 1312 continuous (resp. continuously differentiable, real analytic, complex analytic) in t, then .f#(I) is also continuous (continuously differentiable, real analytic, complex analytic). For the proofs of these important results, which have opened up a new way to study tTeichmiiller space, the extension and reflection of quasico1.formal mappings are made essential use oC (c) Extremal Quasiconformal Mappings. Let K(f) denote the maximal dilatation of a quasi- conformal mapping f Suppose that a family g; = {.f} of quasiconformal mappings is given. If some .fo E g; exists such that K (10) attains the infimum of K (f) for all.f E g;, then .fo is called an extremal quasiconformal mapping in g; Let R= {(x,y)IO<x<a,O<y<b}, R' = {(x',y')IO<x' <a',O<y' <b'} be a pair of rectangular domains. Let g; be the family of all quasiconformal mappings of R onto R' which map each vertex to a vertex with (0,0) c--. (0, 0). Then the unique extremal quasi- conformal mapping for g; is the affine map- ping x' = (a'/a)x, y' = (b'/b)y (Grotzsch [IJ). Next suppose that we are given rwo homeo- morphic closed Riemann surfaces R, S and a thomotopy class g; of orientation-preserving homeomorphisms of R onto S. Then g; con- tains a unique extremal quasiconformal map- ping. More precisely, either Rand S are con- formally equivalent to each other or else R admits an essentially unique analyric (2,0)- form <1> such that the respective local co- ordinates z, w of R, S satisfy the differential equation (cw/iJz)/(iJw/iJz) = [(K -1)/(K + 1)J'1>/I<1>1 (1) with some constant K> I everywhere on R, at which <1> # 0 (Teichmiiller [12J, Ahlfors [3J). This turns out to be a generalization of Grotzsch's extremal affine mapping. The ex- tremal mapping.f satisfying equation (I) is sometimes referred to as the TeichmiiIler mapping. Consider again a J1-conformal mapping 9 of the unit disk D: Izi < 1 onto itself which induces a topological automorphi,m of the boundary Izl = 1. If we define g; as the family of all quasiconformal automorphi,ms.f of D satisfying .f(e iO ) =g(e iO ), then the extremal quasiconformal mapping in g; exi:,ts but is not always determined uniquely (K. Strebel [13J). As to the Teichmiiller mapping, the unique- ness theorem is as follows: If the norm 11<1>11 = HDI<1>(z)ldxdy of <1> in (I) is finite, the Teich- miiller mapping is the unique extremal quasi- conformal mapping in g;. Otherw'se, the uniqueness does not hold in general (Strebel [13J). On the other hand, a necessuy and sufficient condition is proved for the Beltrami 
1313 coefficient J1 of a quasiconformal mapping of :F to be extremal (R. S. Hamilton [14], E. Reich and Strebel in [15]). Moreover, this last result can be extended to the extremal quasiconformal mapping between arbitrary Riemann surfaces. D. Applications In the earlier stage of development of this theory, quasiconformal mappings were applied only to the ttype problem of simply connected Riemann surfaces and to the classification of Riemann surfaces of infinite genus ( 367 Riemann Surfaces). This application is based on the fact that it is often possible to find a quasiconformal mapping with the prescribed boundary correspondence even when no equivalent conformal mapping exists and the fact that the classes OG and OnD ( 367 Riemann Surfaces) of Riemann surfaces are invariant under quasiconformal mappings, as they are under conformal mappings. It is worth remarking that the investigation of quasiconformal mappings is intimately connected with the recent development of the theory of tKleinian groups via Teichmli11er spaces. The theory of quasiconformal mappings was also applied by Lavrent'ev [16] and Bers [2] to partial differential equations, particularly to those concerning the behavior of fluids. They utilized the fact that if the density and its reciprocal are bounded in a steady flow of a 2-dimensional tcompressible fluid, then the mapping of the physical plane to the potential plane (the plane on which the values of the tvelocity potential and the tstream function are taken as coordinates) is quasiconformal, and that if in addition the supremum of the tMach number is sma11er than 1, then the mapping from the physical plane to the thodo- graph plane is pseudoanalytic. E. Similar Notions The term quasiconformal was used differently by Lavrent'ev, as fo11ows: A topological map- ping f = u + iv is ca11ed quasiconformal with respect to a certain system of linear partial differential equations when u and v satisfy the system. This is a generalized definition because the system may not be equivalent to a Beltrami equation. However, it is reduced to a q uasiconformal mapping if the system is uniformly e11iptic. Bers used the term psendo- analytic to describe a certain function related to linear partial differential eq uations of ellip- tic type. This function is pseudoanalytic in the sense of Section B on every relatively 352 Ref. Quasiconformal Mappings compact subset and has properties similar to those of analytic functions. Analytic transformations in the theory of functions of several variables are ca11ed pseudoconformal by some mathematicians, and there is a similar term quasi-analytic. The latter is an entirely different notion from the one discussed in this article. F. Generalization to Higher Dimensions Let f be a continuous ACL-mapping of a subdomain G of R" into R" whose Jacobian matrix is denoted by f'(x). Furthermore, the operator norm and the determinant of f' are denoted by II f' II and det f', respectively. Then f is said to be quasiregular if a11 the partial derivatives of fare 10ca11y of class L" on G and if there exists a constant K  1 such that (11f'II(x))" K' detf'(x) almost everywhere in G. The sma11est K 1 for which this inequality is true is called the outer dilatation of f and is denoted by Ko(J). Iff is quasiregular, then the sma11est K  1 for which the inequal- ity detf'(x)K' [minIYIIIf'(x+ y)j"] holds almost everywhere in G is ca11ed the inner dilatation of f and is denoted by K[(J). If max(K[(J), Ko(J))K', then fis said to be K' -quasiregular. An orientation-preserving mapping is ca11ed K-quasiconformal (1. Vai- sala [17]) if it is a K-quasiregular homeomor- phism. When n = 2, these definitions agree with those given in Section B. For n  3 the fo11owing properties also sti11 hold: A quasiregular mapping is discrete, open, tota11y differentiable a.e. and is abso- lutely continuous (0. Martio, S. Rickman, and Vaisala [18]). Quasiconformal extension of higher-dimensional half-spaces have been studied by Ahlfors and L. Carleson [15]. References [1] H. Grotzsch, Uber moglichst konforme Abbildungen von schlichten Bereichen, Ber. Verh. Sachs. Akad. Wiss. Leipzig, 84 (1932), 114-120. [2] M. A. Lavrent'ev, Varational methods for boundary value problems for systems of e11ip- tic equations, Noordhoff, 1963. (Original in Russian, 1962.) [3] L. V. Ahlfors, On quasiconformal map- pings, J. Analyse Math., 3 (1954),1-58,207- 208. [4] A. Mori, On quasi-conformality and pseudo-analyticity, Trans. Amer. Math. Soc., 84 (1957),56-77. [5] L. Bers, On a theorem of Mori and the definition of quasi-con formality, Trans. Amer. Math. Soc., 84 (1957),78-84. 
352 Ref. Quasiconformal Mappings 1314 [6] C. B. Morrey, On the solutions of quasi- linear elliptic partial differential equations, Trans. Amer. Math. Soc., 43 (1938),126-166. [7] O. Lehto and K. I. Virtanen, Quasikon- forme Abbildungen, Springer, 1965; English translation, Quasiconformal mappings in the plane, Springer, second edition, 1973. [8] L. V. Ahlfors, Lectures on quasiconformal mappings, Van Nostrand, 1966. [9] A. Beurling and L. V. Ahlfors, The bound- ary correspondence under quasiconformal mappings, Acta Math., 96 (1956), 125-142. [10] L. V. Ahlfors, Quasiconformal reflections, Acta Math., 109 (1963), 291301. [11] L. V. Ahlfors and L. Bers, Riemann's mapping theorem for variable metrics, Ann. Math., (2) 72 (1960), 385-404. [12] O. Teichmiiller, Extremale quasikon- forme Abbildungen und quadratische Differen- tiale, Abh. Preuss. Akad. Wiss. Math.-Nat. KI., 22 (1939),1-197. [13] K. Strebel, Zur Frage der Eindeutigkeit extremaler quasikonformer Abbildungen des Einheitskreises, Comment. Math. Helv., 36 (1962), 306323; 39 (1964), 77-89. [14] R. S. Hamilton, Extremal quasiconformal mappings with prescribed boundary values, Trans. Amer. Math. Soc., 138 (1969), 399-406. [15] L. V. Ahlfors et aI., Contributions to analysis, Academic Press, 1974. [16] L. Bers, Mathematical aspects of subsonic and transonic gas dynamics, Wiley, 1958. [17] 1. Viiisiilii, Lectures on n-dimensional quasiconformal mappings, Lecture notes in math. 229, Springer, 1971. [18] O. Martio, S. Rickman. and J. Viiisiilii, Definitions for quasiregular mappings, Ann. Acad. Sci. Fenn., 448 (1969),1-40. [19] L. Bers, A new proof of a fundamental inequality for quasiconformal mappings, J. Analyse Math., 36 (1979), 15-30. 
353 A Racah Algebra 353 (XX.25) Racah Algebra A. General Remarks Racah algebra is a systematic method of cal- culating the tmatrix element (1jI, AIjI') in tquan- tum mechanics, where A is a dynamical quan- tity and IjI and 1jI' are irreducible components of the state obtained by combining n tangular momenta. The angular momentum j has X-, yo, z-componentsjx,jy,jn respectively. Each component is i times the infinitesimal rotation around the respective axis and is the generator of the infinitesimal rotation for every irreduc- ible component 1jI. The addition of two angu- lar momenta leads to a ttensor representation D(jrJ <8J D(j2) of two tirreducible representa- tions of the 3-dimensional rotation group. The problem is to decompose this tensor rep- resentation into the direct sum of irreducible representations. B. Irreducible Representations of the Three- Dimensional Rotation Group Irreducible representations of the group SO(3) of 3-dimensional rotations can be obtained from irreducible representations D(j) (j = 0, 1,2, '" ) of its tuniversal covering group SU (2) of 2 x 2 matrices with determinant 1, through the 2-fold covering isomorphism SO(3)  SU (2)/{::!::l} ( 60 Classical Groups I). The representation D(j) (j = 0,1/2,1,3/2,...) of SU(2) is the 2j-fold tensor product A <8J... <8J A of A E S U (2) restricted to the totally symmetric part of the 2j-fold tensor product space. Let u=(b) and v=(?) be a basis for the complex 2- dimensional space on which SU(2) operates. The symmetrized tensor product of(;+m)-fold u and (i-m)-fold v multiplied by a positive normalization constant (m = j,j - 1, ..., - j) defines an orthonormal basis of the represen- tation space of D(j), which we shall denote by ljI(jm). Decomposition of the tensor prod uct of two irreducible representations D(j rJ and D(h) into irreducible components leads to D(jr)<8J D(j2)= I.D(j), ;=j1+j2, j1+h- l ,,,,,U1-j21. F or the basis we can write !jJ(;m) = I !jJ(jl m 1)1jI(j2 m 2)(j1 m 1j2 m 2U1j2jm), m 1 ,m 2 and the coefficients are called the Clebsch- Gordan coefficients or Wigner coefficients. The vectors ljI(jm) in each irreducible represen- 1316 tat ion space are determined only up to an overall phase factor (a complex number of modulus 1). By a suitable choice of the result- ing arbitrary phase (which may depend on jj.j2';)' the coefficients are given by j1 +j2:;:,j:;:,U1-hl, (j1m1j2m2Ilrhjm)=b(m1 +m 2 ,m) (2j+ 1)(j, +h -j)!(j+j1 -j2)!(j+h -jr)! x (j1+j2+j+l)! x I. ( ( _lr J(j1 +m 1 )!(j1 -mrJ!(j! +m2)! v V !(j1 +j2-j-V)!(jl- m 1- v )! J (j2- m 2)!(j+m)!(j-m)I ) x . (j2+m2-v)!(j-h+m1 +V)!(j-j1- m 2+ v )! They satisfy torthogonality relations. Another concrete expression for the same coefficients, but of a different appearance, was obtained earlier by Wigner. Wigner introduced the 3j- symbol, given by ( jlhh ) =( -1)j,-j2-m3(2j3+ 1)-1/2 m 1 m 2 m 3 X(j1m1j2m2U1j2j3 -m 3 ) for m 1 + m 2 + m 3 = 0 and zero otherwise. This is invariant under cyclic permutations of 1, 2, 3 and is multiplied by ( -1)11 + j,+ h under trans- positions of indices as well as und<:r the simul- taneous sign change of alI the m's. The 3j- symbol multiplied by (_I)h+h-j, is the V- coefficient of Racah. There are two ways, (D(j 1) <8J D(j2)) <8J D(h) and D(jrJ <8J (D(j2) <8J D(h)), to reduce the tensor prod uct of three irred ucible represen ta- tions, and two corresponding sets of basis vectors. The transformation coefficient for the two ways of reduction is written in the form (j1 h(j12)h;; I j1' hj3(j 23);j) = J (2jI2+ 1)(2j23 + 1) W(j1hj h;jd23)' Here W(abed; ef), called the Racah coefficient, can be written as the sum of products of four Wigner coefficients. W has the following sym- metry properties: W( abed; ef) = W( bade; ef) = W(edab; ef) = W(aebd;Je) = (_1)e+ f-H W(ebef; ad) =( _1)e+ f -h- C W(aefd; he) and satisfies an orthogonality relation. The 6j- symbol {m is related to the Racah coefficient by { abe } W(abde; ef) = (_1)a+h+c+d cd! . 
1317 C. Irreducible Tensors A dynamical quantity Tqk (q=k,k-l, ..., -k) that transforms in the same way as the basis of D(k) under rotation of coordinates is cal1ed an irreducible tenso r of rank k. That is, it satisfies [jx::!::ijy, Tqk]= J (k+q)(k::H+ 1) Tq\I' [j" Tqk] = qTqk. Here [a, b] = ab - ba. The matrix element of this quantity between two irreducible compo- nents can be written in the form (ajml Tqk I a'j'm') = (I/ J2j + 1 )(ajll T(k)lla'j')(j'm'kql j'kjm), where a is a parameter to distinguish multiple components with the same j, and components of different a are assumed to be orthogonal. In this formula the Clebsch-Gordan coeffi- cients are determined from group theory, while (aj II T(k) II a'j') depends on the dynamics of the system. When T(k) and U(k) operate only on the state vectors in the subs paces HI and Hz, respec- tively, of the total space (tHi1bert space) H = HI X Hz, their scalar product (T(k J , U(k») = :Eq( -1)q Tqk U; has the matrix element (a 1 a z j1hjml(T(k), U(k») I a'l azj'rfzjm) =( -1)h +h-jWUljz/lj; jk) x (a 1 jlll T(k)lla'rfd(a z j211 U(k)llazj). For an irreducible component of the tensor prod uct of two irred ucible tensors, [T(k , J @ U(k 2 )]: = L Tq2U:(klqlkzqzlklkzkq), qt +q2=q the matrix can be written as (aj dzjll[T(k , ) @ U(k2)]<k)llaj' ljj') = J (2k + 1 )(2j + 1 )(2j' + 1) xL (ajlll T(k')11a''j'd(a'Jz II u(k2)llaJ) ." t .1 jz j } Xj'ljj'. k 1 k z k The last factor, the 9j-symbol, is defined as the matrix element between basis vectors of [DUd x D(h)] x [D(h) x D(4)] and [D(1) x D(h)] x [D(h) x D(4)]: (j 1 hUI2)hj4U34)jm I j1hUI3)jzj4UZ4)j m) = J (2jl2 + 1)(2j34 + 1)(2jl3 + 1)(2jz4 + 1) j12 } 34 . { jl j2 X h j4 j13 jZ4 354 B Random Numbers The 9j-symbol can be written as a weighted sum of the products of the three W's. See [6] and [7] for explicit formulas of Clebsch-Gordan coefficients and [8] for Racah coefficients. References [1] E. P. Wigner, Group theory and its appli- cation to the quantum mechanics of atomic spectra, Academic Press, 1959. [2] M. E. Rose, Elementary theory of angular momentum, Wiley, 1957. [3] U. Fano and G. Racah, Irreducible ten- sorial sets, Academic Press, 1959. [4] A. R. Edmonds, Angular momentum in quantum mechanics, Princeton Univ. Press, 1959. [5] F. Bloch, S. G. Cohen, A. De-Shalit, S. Sambutsky, and 1. Talmi, Spectroscopic and group-theoretical methods in physics (Racah memorial volume), North-Hol1and, 1968. [6] E. U. Condon and G. H. Short1ey, Theory of atomic spectra, Cambridge Univ. Press, 1935 (reprinted with corrections, 1951). [7] M. Morita, R. Morita, T. Tsukada, and M. Yamada, Clebsch-Gordan coefficients for jz = 5/2,3, and 7/2, Prog. Theoret. Phys., Suppl., 26 (1963),64-74. [8] L. C. Biedenharn, 1. M. Blatt, and M. E. Rose, Some properties of the Racah and as- sociated coefficients, Rev. Mod. Phys., 24 (1952), 249-257. 354 (XVI.5) Random Numbers A. General Remarks A sequence of numbers that can be regarded as realizations of independent and identi- cal1y distributed trandom variables is cal1ed a sequence or table ofrandom numbers. It is a basic tool for the tMonte Carlo method, tsimulation of stochastic phenomena in nature or in society, and tsampling or trandomiza- tion techniques in statistics. Random numbers used in practice are pseudorandom numbers ( Section B); theoretical1y, the definition of random numbers leads to an algorithmic approach to the foundations of probability [1,2]. B. Pseudorandom Numbers Tables of numbers generated by random mechanisms have been statistically tested and 
354 C Random Numbers published. To generate random numbers on a large scale, electronic devices based on sto- chastic physical phenomena, such as thermo- electron noise or radioactivity, can be used. For digital computers, however, numbers generated by certain simple algorithms can be viewed practically as a sequence of random numbers; this is called a sequence of pseudo- random numbers. Distribution of random numbers that are easily generated and suitable for general use is the continuous uniform distribution on the interval (0, 1), which is approximated by the discrete distribution on {O, 1, ...,N -I} (N)> 1). Random numbers with distribution function F(') are obtained by transforming uniform distributions by F- 1 (-). For typical distribu- tions, computation tricks avoiding the direct computation of F- 1 ( .) have been devised. Among them the use of torder statistics and acceptance-rejection techniques have wide applicability. F or the generation of uniform pseudoran- dom numbers on {O, 1, ..., N -I}, N =n S (n=a computer word length), the following algo- rithms are used. Each of them is written in terms of simple computer instructions. (1) The middle-square method was proposed by von Neumann. We square an integer of s digits of radix (or base) n and take out the middle s digits as the next term. We repeat this process and obtain a sequence of pseudorandom num- bers. The sequence thus generated might be cyclic with a short period, possibly after many repetitions. The lengths from initial values to the terminal cycles are empirically checked. (2) The tFibonacci sequence {un} defined by Uk+1 == U k - 1 + Uk (mod n S ) is apparently regular, but it is uniformly distributed. (3) The con- gruence method [3]: Define a sequence by Uk+1 == aUk + C (mod n S ) or (mod n S ::!:: 1). If c = 0, the procedure is called the multiplicative con- gruence method, otherwise the mixed con- gruence method. The cycle, that is, the mini- mum k such that Uk = U o , and the constants a, c, and U o that make the cycle maximum for given nand s are determined by number theory. The points (Uk!. U kl +!' ..., U kl + l - 1 ), k = 0,1,2, "', lie on a small number of parallel hyperplanes in the [-dimensional cube. Good choice of the constant a makes the sequence quite satisfactory. (4) H. Weyl considered sequences f(k) = klY. (mod 1), where IY. is an irrational number and k = 1, 2, ... , whose values are uniformly distributed on the inter- val (0, 1). They are not independent, though they can be used for some special purposes. A modified sequence X k =k 2 1Y. (mod 1) is known to be random for any irrationallY. in the sense that the tserial correlation N- 1 Lf1 XkXk+/- 1/4 converges to ° uniformly in [ as N --> 00. 1318 C. Statistical Tests To check uniform random numbers on (0, 1) the following tests are used: (1) Di\ide (0, 1) into subintervals; then the frequency of ran- dom numbers falling into these is a multi- nomial sample. tGoodness of fit can be tested by the tchi-square test; independence can be tested by observing the frequency of tran- sitions of subintervals in which a pair of con- secutive numbers falls, as well as by observing the overall properties, such as uniformity of the frequency of patterns of subintervals in which a set of random numbers falls. (2) For a set of random numbers, the distance of the empirical distribution function from that of the theoretical one is tested by the tKolmogorov- Smirnov test. (3) Observe the rank orders of a set of random numbers, and test the random- ness of their permutations (test the number of runs up and down). D. Kolmogorov-Chaitin Complexi1y and Finite Random Sequences As Shannon's entropy is a quantity for mea- suring the randomness of random variables, the Kolmogorov-Chaitin complexity [4, 5J is that of individual objects based on logic instead of probability. For constructive ob- jects XEX, YE Yand a partial recursive func- tion A: Y x {1, 2, ...} -->X, define K ( I { min(lOg2 n I A(y, n) = x), A X y)= . 00 (If A(y,n)=x for no n). The function A is said to be asymptotically optimal if for any B there exists a constant C such that KA(xly)KB(xly)+C for any XEX and YE Y. For an asymptotically op- timal A, which is known to exist, KA(xly) is simply denoted by K(xly) and is called the Kolmogorov-Chaitin complexity of x given y. P. Martin-Lof [6J discussed a relation be- tween complexity and randomness Consider any statistical test for the randomness on the set of (say) finite decimal sequences which is effective in the sense that it has a finite algo- rithm. Then there exists a constant C indepen- dent of Land M such that K( 1,''', LI L):;:' Llog 2 1O- M implies the acceptance of the decimal sequence  1, ..., L by the test at the level 1 -. r M - C . This condition on the complexity is satisfied by at least (1 - r M ) IO L sequences among the decimal sequences of length L. E. Collective and Infinite Random Sequences For finite sequences, the notion of randomness is obscure by nature. For infinite sequences, 
1319 however, clearer definition is possible. Based on the notion of co11ectives by R. von Mises, a definition of infinite random sequences has been given by A. Church [7]. A selection func- tion is a {O, 1 }-valued function on the set of (say) finite decimal sequences such that {n, qJ(I'2, ""n-I)=I} is an infinite set for any infinite sequence I> 2'.... For a selection function qJ and an infinite seq uence  I>  2, ... , the qJ-subsequence is defined as n , n ,..., where {nl <n2<'" }={n,qJ(I' ":'nd= I}. For a class I/J of selection functions, an infinite decimal sequence is ca11ed a I/J-collective if each of the numbers 0, 1, ...,9 appears in it with a limiting relative frequency of 1/10, and the same thing hOlds for any qJ-subsequence with qJEI/J. By definition, a random sequence is a I/J- co11ective for the class I/J of recursive selection functions. Almost a11 real numbers are random in their decimal expansions. F. Normal Numbers Let x- [xJ =:I;xnr- n be the r-adic expansion of the fractional part of a real number x. For any ordered set Bk = (bl> "', b k ) of numbers 0, 1, ... ,r - 1, let Nn(x, B k ) be the number of occurrences of the block Bk in the seq uence XI'''' ,x n . If Nn(x, Bk)/n->r- k as n-> 00 for every k and every Bb then X is said to be normal to base r. Almost a11 real numbers are normal to any r. D. G. Champernowne [8J constructed a normal number given by the decimal expan- sion 0.1, 2, 3,4,5,6,7,8,9,10,11,12,13,.... No one has so far been able to prove or dis- prove the normality of such irrational num- bers as n, e, .ji, j3, .... W. Schmidt [9J proved that the normality to base r implies the normality to base p if and only if1ogrjlogp is rationa1. A real number whose decimal expan- sion is random in the above sense is normal to base 10. For the converse, a necessary and sufficient condition for a selection function qJ (for which qJ(  I' ... , J depends only on L) to have the property that the normality implies the {qJ }-co11ectiveness has been obtained in [10]. References [IJ D. E. Knuth, The art of computer pro- gramming II, second edition, Addison-Wesley, 1981, ch. 3. [2J E. R. Sowey, A second classified bibli- ography on random number generation and testing, Int. Statist. Rev., 46 (1978), 89-102. [3J D. H. Lehmer, Mathematical methods in large-scale computing units, Proc. 2nd Symp. on Large-Scale Digital Calculating Machinery, Harvard Univ. Press, 1951, 141-146. 355 A Real Numbers [4J A. N. Kolmogrov, Logical basis for infor- mation theory and probability theory, IEEE Trans. Information Theory, IT-14 (1968), 662-664. [5J G. J. Chaitin, Algorithmic information theory, IBM 1. Res. Develop., 21 (1977), 350- 359. [6J P. Martin-U;f, The definition of random sequences, Information and Control, 9 (1966), 602-619. [7J A. Church, On the concept of a random sequence, Bul1. Amer. Math. Soc., 47 (1940), 130-135. [8J L. Kuipers and H. Niederreiter, Uniform distribution of sequences, Wi1ey-Interscience, 1974. [9J W. Schmidt, On normal numbers, Pacific J. Math., 10 (1960),661-672. [1OJ T. Kamae, Subsequences of normal se- quences, Israel J. Math., 16 (1973), 121-149. 355 (11.10) Real Numbers A. Axioms for the Real Numbers The set R of a11 real numbers has the fo11owing properties: (1) Arithmetical properties: (i) For each pair of numbers x, yER, there exists one and only one number wER, ca11ed their sum and de- noted by X + y, for which x + y = y + x (com- mutative law) and (x+ y)+z=x+(y+z)' (associative law) hold. Furthermore, there exists a unique number 0 (zero) such that x+O=x for every xER (existence o[tzero element). Also, for each x, there exists one and only one number -xER for which x +( -x) =0. (ii) For each pair of numbers x, YER, there exists one and only one number wER, ca11ed their product and denoted by xy, for which xy= yx (commutative law), (xy)z = x(yz) (associative law), and (x + y)z = xz + yz (distributive law) hold. Furthermore, there exists a unique number 1 (unitY)ER such that lx=x for every xER (existence of tunity element). Also, for each x #0 (xER) there exists one and only one number X- 1 ER for which XX-I = 1. Owing to properties (i) and (ii), a11 tfour arithmetic operations obey the usuallaws (with the single exception of division by zero); in other words, R is a tfield. (2) Order properties: (i) For each x, y E R, one and only one of the fo11owing three rela- tions hOlds: x<y, x=y, or x> y. With x':;;y meaning x < y or x = y, the relation':;; obeys the transitive law: x':;;y and y':;;z imply x':;;z, which makes R ttota11y ordered. (ii) Order and 
355B Real Numbers arithmetical properties are related by: x:( y implies.x +z:(y+z for any zER, and x:(y and 0:( z imply xz:( yz; in other words, R is an tordered field. In particular, x E R with x> 0 is called a positive number, and x E R with x < 0 a negative number. We write Ixl = x if xO and Ixl = - x if x < 0, and calli x I the absolute value of x. (3) Continuity property: If nonempty subsets A and B of R, with a < b for each pair aE A and bEB, satisfy R= A UB and A nB= 0 (empty set), then the pair (A, B) of sets is called a cut of R. For each cut (A, B) of R, there exists a number XER (necessarily unique) such that for every a E A, a:( x, and for every bE B, b x (i.e, x= supA =inf B). This property of R is called Dedekind's axiom of continuity (  294 Numbers). The set R of all real numbers is determined uniquely up to an isomorphism, with respect to arithmetic operations and ordering, by properties (I )-(3). R forms an additive Abelian group; its subgroup {O, ::!:: 1, ::!:: 2, "', ::!:: n, ...} generated by 1 can be identified with the group Z of integers. The subset of all positive integers {I, 2, ..., n, .., } may be identified with the set N of all natural numbers. The subset {m/n I m, nE Z, n =1= O} of R forms the subfield of R generated by I. It can be identified with the field Q of all rational numbers. A real number that is not rational is called an irrational number. B. Properties of Real Numbers (I) For each pair of positive numbers a and h > a, there exists a natural number n with a < nh (Archimedes' axiom). (2) For each pair of positive real numbers a and b with a < b, there exists a rational num- ber x such that a < x < b (denseness of rational numbers). (3) For any subset A in R tbounded from above (below), the tleast upper bound of A: a =supA (tgreatest lower bound of A:b=infA) exists. Let {un} be a tsequence of real numbers. Assume that for each arbitrary positive num- ber F. there exists a number no such that Ian - bl < 8 for all n> no. Then we write limna: an = b (or an-->b) and call b the limit of {an}' We also say that {an} is a convergent sequence or that an converges to b. (4) If for two sequences {an}, {h n }, we have a I :( a2 :( '" :( an :( ... :( b n :(. .:( b 2 :( b l and lim(b n - an) = 0, then there exists one and only One number C E R with \im an = \im b n = C (principle of nested intervals) 1320 (5) Let {an} be a sequence of real numbers. If for any arbitrary positive 8 there exists a natural number no satisfying la m -u n l<8 for all m, n> no, then {an} is called a fundamental sequence or Cauchy sequence. Any funda- mental sequence of real numbers is convergent (completeness of real nnmbers). For a set with properties I and 2 of Section A, it can be proved that property 3 of Section A is equivalent to property 3, or properties I and 4, or properties I and 5 of this section. C. Intervals For two numbers a, bER with a<b, we write (a,b)= {xla<x<b}, (a,b]={xla<x:(b], [a, h) = {x I a :( x < b], [a, b] = {x I a:( x :( b}, and call them (finite) intervals, of which a and b are their left and right endpoints, respectively. Specifically, (a, b) is called an open interval and [a, b] a closed interval. The symbol, oc and -00 are introduced as satisfying cr., > x, x> -00,00> -00 for all XER. Writing +00 for oc, we call +00 and -00 positive infinity and negative infinity, respectively. To e),tend the concept of intervals, we define (-CXJ, b) = {x I x <b, xER}, (-00, b] = {xl x :(b, xER}, (a, (0)= {xl a <x, xER}, [a, (0)= {x I a :(x, xER}, and (-00, oo)=R, and call them infinite intervals. Let {an} be a sequence of real numbers. If for each infinite interval (a, (0) « -rXJ,a)) there exists a number no such that an E(a. (0) (an E (-00, a)) for all n> no, then we wri1 e a n --> +00 (a n --> -(0) and call 00 (-00) the limit of an, denoted as before by lima n . D. Topology of R With the collection of all its open intervals (a, b) as an topen base, R is a ttopological space (torder topology) that satisfies the tsep- aration axioms T2' T3, T4' In R every (finite or infinite) interval (including R itself) is tconnec- ted, and the set Q of rational numbers is dense. A necessary and sufficient condition for a subset F of R to be tcompact is that F be bounded and closed (Weierstrass's theorem). In particular, any finite closed interval is com- pact. R is a locally compact space satisfying the second tcountability axiom. Further, any (finite or infinite) open interval is homeomor- phic to R. The topology of R may also be 
1321 defined by the notion of convergence ( 87 Convergence). Arithmetic operations in R are an con- tinuous: If an-->a and bn-->b, then an+bn-->a+b, an-bn-->a-b, anbn-->ab, and an/bn-->a/b (where b =/00, b n =/0O). Hence R is a ttopological field (regarding the characterization of R as a topo- logical group or a topological field  422 Topological Abelian Groups). R as a topological Abelian group (with respect to addition) is isomorphic to the top- ological Abelian group R + of an positive real numbers with respect to multiplication. To be precise, there exist topological mappings f: R --> R + with f(x + y) = f(x)f(y) and g: R + --> R with g(xy)=g(x)+g(y). If f(1)=a, g(b) = 1, then f, g are uniquely determined and are written f(x) = aX, g(x)=log b x. Regarding R as a topological Abelian group (with respect to addition), any proper closed subgroup I of R is discrete and isomorphic to the additive group Z of integers. That is, for some B>O we have 1= {nBI nEZ}. In parti- cular, the quotient group R/Z as a topological group is isomorphic to the rotation group of a circle (I-dimensional ttorus group). Elements of R/Z are caned real numbers mod 1. E. The Real Line Let I be a Euclidean straight line considered to lie horizontany, say from left to right. Let Po, PI be two distinct points on I, with Po situated to the left of PI' Then there exists one and only one bijection q> from the set L of an points of I to R satisfying (i) q>(po) = 0, q>(pr) = 1; (ii) if p lies to the left of q, then q>(p) < q>(q); and (iii) for two line segments pq and p' q' (where p and p' are to the left of q and q', respectively), pq=.p'q' (pq and p'q' are tcongruent)=-q>(q)- q>(p) = q>(q') - q>(p'). Then q>(p) is caned the coordinate of the point p, and (Po,pr) the frame of the line l. A Euclidean straight line with a fixed frame is caned a real line (identified with R by the mapping q» and is usuany denoted by the same notation R or R I. References [1] N. Bourbaki, Elements de mathematique III. Topologie generale, ch. 4. Nombres reels, Actualites Sci. Ind., 1143c, Hermann, third edition, 1960; English translation, General topology pt. 1, Addison-Wesley, 1966. [2] G. Cantor, Gesammelte Abhandlungen, Springer, 1932. 356 A Recursive Functions [3] R. Dedekind, Gesammelte mathematische Werke I-III, Braunschweig, 1930-1932. [4] J. Dieudonne, Foundations of modern analysis, Academic Press, 1960, enlarged and corrected printing, 1969. [5] K. Weierstrass, Gesammelte Abhandlun- gen 1 - 7, Mayer & Miler, Akademische V er- lag., 1894-1927. Also  references to 381 Sets. 356 (1.9) Recursive Functions A. General Remarks A function whose tdomain and trange are both the set of natural numbers {O, 1,2,...} is caned a number-theoretic function. In this article, the term natural number is used to mean a non- negative integer. Hilbert (1926) and K. G6del [1] considered certain number-theoretic func- tions, ca11ed recursive functions by them and now ca11ed primitive recursive functions after S. C. Kleene [2] (the definition is given in Section B). G5del introduced an efficient method of arithmetizing metamathematics based on representing certain finitary procedures in metamathematics by primitive recursive func- tions. Then the fonowing problem natura11y arises: How sha11 we define a finitary method? In other words, how shan we characterize a number-theoretic function that is effectively computable, or provided with an algorithm of computation? G5del defined the notion of general recursive function by introducing a formal system for the elementary calculation of functions, fonowing the suggestion given by J. Herbrand. Kleene later improved G5del's definition and developed the theory of gen- eral recursive functions [2]. Furthermore, A. Church and Kleene defined A-calculable func- tions using the A-notation (Church [4]), and E. L. Post and A. M. Turing defined the notion of computable functions by introducing the concept of Turing machines. These notions, introduced independently and almost simulta- neously, were found to be equivalent. Hence such functions are now simply caned recursive functions. Here, instead of giving the definition of recursive functions in the original style (the Herbrand-G6del-Kleene definition), we give it by utilizing the idea of introducing schemata, a natural extension of the notion of primitive recursive functions. We employ the letters x, y, Z, xt. X2,'" for variables ranging over the natural numbers. 
356 B Recursive Functions B. Primitive Recursive Functions Consider the fonowing five definition schemata: (I) q)(X) = X' (=x+I), (II) q)(xj,...,xn)=q (q a given natural number), (III) q)(x), ""xn)=x i (I "':i",:n), (IV) q)(X1"" ,xn)=t/J(xd x ), ...,x n ), ..., Xm(xj, ..., x n )), (V) q)(O, x 2 , ..., x n ) = t/J(x 2 , ..., x n ), <pry', X2'"'' X n ) = X(y, <pry, X 2 ,..., X n ), X2, '" ,X n ), where V/( ) is a constant natural number if n = I. A function is caned primitive recursive if it is definable by a finite series of applications of the operations (IV) and (V) (t/J, x, XI' .,. , Xm are already-introduced functions) starting from functions each of which is given by (I), (II), or (III). Given the functions t/J l' . " , t/J I' we define the relativization (with respect to t/J I' ... , t/JI) of the definition of primitive recursive functions as fonows: A function is called primitive recur- sive in t/J) , . .. , t/JI if it is definable by a finite series of applications of (IV) and (V) starting from t/J l' ... , t/JI and from functions each of which is given by (I), (II), or (III). We say that a function <p(x j, ... , x n ) is the representing function of a tpredicate P(x 1 , ..., x n ) if <p takes only 0 and I as values and satisfies P(Xj, ..., x n )=<p(x 1 , "', x n ) =0. Then we call P a primitive recursive predicate if its representing function <p is primitive recur- sive. The following functions and predicates are examples of primitive recursive ones: a + b, a.b, a h , a!, min(a,b), max(a,b), la-bl, a=b, a<b, alb (a divides b), Pr(a) (a is a prime number), Pi (the (i + 1 )st prime number, Po = 2, PI = 3,...), (a)i (the exponent of Pi of the unique factorization of a into prime numbers if a #0; otherwise, 0). Whenever we are given a concept or a theorem, we always transform it by replacing the predicates contained in it (if any) by corre- sponding representing functions. Then an operation 0 is caned primitive recursive if the function or the predicate O(t/J) , ... , t/J/o Q 1, ... , Qm) that results from the application of 0 to functions t/J), .., , t/JI and predicates Q1, ..., Qm (I, m:;:'O, I +m>O) is primitive recursive in t/Jj, ...,Qm' Put <p(x 1 , ...,xn,z) = L yq tjJ(x 1 ,..., xn,y). Then <p is primitive recursive in t/J, and the finite sum Ly<z is a primitive recursive operation in this sense. 1322 Similarly, the following operations are primi- tive recursive: the finite product H y < zo the logical connectives I, V, /\, --> (. 411 Symbolic Logic), definitions by cases, the bounded quantifiers 3y y <z' T:lyy<z, and the bounded tl-operator tlYy<z defined as follows: tlYy<zR(x,y) is the least y such that y<z and R(x, y) holds, if there exists such a number y; otherwise, it equals z. The following operation is also primitive recursive: <p(y,x 2 , ...,x n ) =X(y,<)j(y;x 2 , ...,x n ),x 2 , ...,x n ), where ; ii (y ' x X ) - TI P ",(i,X2' 'Xn) 't' , 2, ..., n - i i<y A function <p is said to be primitive recursive uniformly in t/J 1, ... ,t/JI when <p is definable by applying a primitive recursive operation to t/J j, ''', t/JI' Almost an results mentioned in this section were given by G6del [1]. There are further investigations on primitive recursive functions by R. Peter (1934), R. Robinson (1947), and others. Note that a function definable by a tdouble recursion is not necessarily primitive recursive. Peter (1935, 1936) invesrigated in detail functions that are definable, in general, by k-fold recursions for every positive interger k [8]. C. General Recursive Functions The following tl-operator is used to define general recursive functions by extf.nding primi- tive recursive functions. For a predicate R(y) on the natural numbers, tlyR(y) is the least y such that R(y), if 3yR(y); otherwise, tlyR(y) is undefined. Generally, tly(t/J(xj, ...,xn,y)=O) is not necessarily defined for each n-tuple (XI"'" x n ) of natural numbers. Now, a func- tion is caned a general recursive function (or simply recursive function) if it is dffinable by a series of applications of schemata including a new schema (VI) <p(x 1 ,..., x n ) = tly(t/J(xj, ..., X n , y) =0) for the definition of <p from any function t/J that satisfies T:lx 1 ... T:lx n 3y(t/J(x 1 , ...,xn,y)=O), in addition to those used to define the primi- tive recursive functions. Thus, by definition, a primitive recursive function is general recur- sive. A general recursive predicate is a predicate such that its representing function is general recursive. The facts, including the ones con- cerning relativization, that are valid for pri- 
1323 mitive recursive functions are also valid for general recursive functions. Kleene's Normal Form Theorem. For each n, we can construct a primitive recursive predi- cate T,,(z, XI' "', x., y) and a primitive recursive function U(y) such that given any general recursive function cp(x I' "', x n ), a natural number e can be found such that VX I ... Vx n 3yT,,(e,xj, ...,xn,y), (1) cp(x l , ...,x n )= U(!1yT n (e,x l , ...,x.,y)). (2) Any natural number e for which (1) and (2) hold is said to define cp recursively or to be a Godel number of a recursive function cp. Let 1/11,...,1/1/ (abbreviated '1') be any given func- tions. We can relativize Kleene's normal form theorem with respect to them as follows: For each n, we can construct a predicate T,,'I'(z,xr,... ,X n , y) that is primitive recursive in 'I' such that given any function cp that is gen- eral recursive in '1', a natural number e can be found such that Vx l ... Vx n 3yT,,'I'(e,x 1 , ...,xn,Y), (3) cp(x I , ..., x n ) = U(!1yT n 'l'(e, XI' ... , X n , y)), (4) where U(y) is the primitive recursive function mentioned in Kleene's normal form theorem. A natural number e for which (3) and (4) hold is said to define cp recursively in 'I' or to be a Godel number of cp from '1'. In particular, a G5del number e of cp from 'I' can be found independently of 'I' (except for I and the re- spective numbers of variables of 1/1 j, ... ,1/1,) when cp is general recursive uniformly in '1'. Now let S be a tformal system containing ordinary number theory. A number-theoretic predicate P(x I' ..., x n ) is said to be decidable within S ifthere is a formula P(ar, ... , an) (with no tfree variables other than the distinct vari- ables aI' ..., an) of S such that for each n-tuple (XI"'" x n ) of natural numbers (the symbol f- means provable in S), (i) f-P(XI,...,xn) or f-iP(xr,...,x n ) and (ii) P(x 1 ,..., x n )= f--- P(xr,..., x n ), where XI' ..., X n designate the numerals corre- sponding to X I' ... ,X n in S. If S is a consistent system such that primitive recursive predicates are decidable within S and the predicates PIA (for any formula A, P£(xr, ...,xn,y) means that y is the G5del number of a proof of A(x l , "', x n )) are primitive recursive, then a necessary and sufficient condition for P to be decidable within S is that P is a general recur- sive predicate (A. Mostowski, 1947). Church (1936) proposed the following state- 356D Recursive Functions ment: Every effectively calculable function is a general recursive function. The converse of this is evidently true by the definition of recursive- ness. So Church's thesis and its converse pro- vide the exact definition of the notion of effec- tively computable functions. Though this notion is somewhat vague and intuitive, the definition seems to be satisfactory, as men- tioned at the beginning of this article. There- fore, any function with a computation proce- dure or algorithm can be assumed to be gen- eral recursive. Utilizing this, various decision problems have been negatively solved ( 97 Decision Problem). Furthe,rmore, traditional descriptive set theory can be reinvestigated from this point of view, and the concept of effectiveness used in tsemi-intuitionism is clarified using general recursive functions ( 22 Analytic Sets). D. Recursive Enumerability A set {cp(O), cp(I), cp(2), ... } enumerated by a general recursive function cp (allowing repe- titions) is called a recursively enumerable set. The empty set is also considered recursively enumerable. It is known that in this definition "general recursive" can be replaced by "primi- tive recursive" (J. B. Rosser, 1936). A set E of natural numbers is recursively enumerable if and only if there is a primitive recursive pre- dicate R(x,y) such that xEE=3yR(x,y) (Kleene [2]). Generally, a predicate E(x I' ... ,x m ) is called a recursively enumerable predicate if there is a general recursive predicate R(XI'"'' X m , Y1, ... ,Yn) such that E(xj,''', X m )=3YI ... 3Yn R (xl>'''' Xm'YI> '" ,Yn)' (Here "general recursive" can be replaced by "primitive recursive.") We call a set E a recursive set if the predi- cate X E E is general recursive. The set C = {xl 3yT 1 (x, x,y)} is an example ofa set that is recursively enumerable but not recursive, and it has the following remarkable property: For every recursively enumerable set E, there is a primitive recursive function such that xEE=cp(X)EC. In this sense, the set C is said to be complete for the class of recursively enumerable sets. Post's problem, which asked whether the sets that are recursively enumer- able but not recursive have the same tdegree of (recursive) unsolvability as that of C, Was negatively solved simultaneously by R. M. Friedberg (1957) and A. A. Muchnik (1956- 1958). A recursively enumerable set E is gen- eral recursive if and only if there is a gen- eral recursive predicate R(x, y) such that xE=3yR(x,y) (Kleene [5], Post [6]). 
356 E Recursive Functions E. Partial Recursive Functions A function (p(x I' ..., x n ) is called a partial func- tion if it is not necessarily defined for all n- tuples (x I, ..., x n ) of natural numbers. For two partial functions tf;(xt...., x n ) and X(x I ,..., x n ), tf;(x I' ... , x n ) "" X(x I' ... , x n ) means that if either tf;(x l , ...,x n ) or X(x 1 , ...,x n ) is defined for x I' ... , X n , so is the other, and the values are the same. For any given natural number e, (p(x I , ...,x n )"" U(l1yT,,(e,x I , ...,xn,y)) (or "" U({/yT,:"(e, XI,,,,, X n , y))) is a partial func- tion, in general. We say that such a function is partial recursive (partial recursive (uniformly) in 'P) and that a natural number e defines <p recursively ((uniformly) in 'P) or is a Godel number of a partial recursive function <p (from 'P). When a natural number e is a Gi.idel num- ber of <p(x I' '" ,x n ) (a Gi.ide1 number of <p trom 'P), <p(x I , ..., x n ) is sometimes written as {e] (XI' ...,xn)({e}'I'(X I , ...,x n )). Ifa predi- cate R(x I' ..., X,,, y) is general recursive, then l1yR(xt. ...,xn,y) is partial recursive. There- fore, {z}( XI, ... , x n ) is a partial recursive func- tion of the variables of z and of Xt...., Xn' On the partial recursive functions, the fol- lowing two theorems, given by Kleene [3], are most useful. (1) For natural numbers m, n, a primitive recursive function Snm(z, YI' ..., Ym) can be found such that, for any natural num- ber e, {e}(YI, ...,Ym,x I , ...,X n )"" {Snm(e'Yl' ..., Ym)](x I' ..., x n ). (2) For any partial recursive function tf;(z, X I' ..., x n ), a natural number e can be found such that {e} (XI' "', x n ) "" tf;(e,x l . ...,x n ). The notion of partial recursive functions appeared first in the theory of tconstructive ordinal numbers of Church and Kleene (1963). Partial recursive functions can be defined in the Herbrand-Gi.idel-Kleene style as a natural extension of general recursive functions, and they are also definable by a finite series of applications of the schemata (IV), (V), and (VI) (in each schema, = used for the definition of <p should be replaced by "") starting from func- tions given by (I), (II), and (III). F. Extension of Recursive Functions to Number-Theoretic Functionals Let O(t...., O(m be number-theoretic functions of one variable. If (p(x I' ... ,x n ) is (partial) recur- sive uniformly in 0(1' "', O(m, then a Gi.idel number e of <p is found independently of 0( I' '" , O(m, and <pi x I' ... ,x n ) is expressed as U(pyT,"l >m(e,x l , ...,xn,y)). Now suppose that 'X I ' ... , 'Xm range over the set N N of all number-theoretic functions of one variable, 1324 and put (P(e<:l, ".,C<:m'X b ...,x lI ) "" U(I1YT,,"' "m(e,x l , ...,xn,y)). We call such a functional <P(O(I' ..., O(m, X I' '" ,x n ) (partial) recursive, and with it we can develop a theory of recursive functions of variables of two types. Extending the notion of recursive func- tionals, Kleene introduced and investigated the recursive functionals of variables of arbit- rary (finite) types [10, I 1]. The natural num- bers are the objects of type 0, and the one- place functions from type-j objects to natural numbers are objects of type j + 1. Denote variables ranging over the type-j objects by O(i, fJi, ii, ..., or O(L O(, O(t ..., etc. Consider a functional (simply called a function) of a given finite number of such variables of types taking natural numbers as values. A function (P is called a primitive recursive function if it is definable by a finite series of applications of the following schemata (I)-(VIII), where a is a variable of type 0, b is any list (po;sibly empty) of variables that are mutually distinct and different from the other variables of the schema, and tf;, X are given functions of the indicated variables. (I) <p(a, b) = a'; (II) <p(b) = q (q is a natural number); (III) <pia, b)=a; (IV) <p(b)= tf;(X(b), b); (V) <p(0, b) = l/t(b), <p(a', b) = x(a, <p(a, b), b); (VI) <p(a)= tf;(a l ) I a l is a list of variables from which a is obtained by changing the order of two variabks of the same type); (VII) <p(0(t, a, b) = 0( I (a); (VIII) <p(0(l, b) = O(i(Arx i - 2 X(0(l, rx i - 2 , b)) (Arx i - 2 desig- nates that X is a function of the variables O(i- 2). We assign to each function <p(a:1 a natural number called an index (which plays the same role as a Gi.idel number) in such a way that it reflects the manner of application of the schemata used to introduce <p(a). '\1ow, we write <p(a) with an index e as {e}(ct). We call a function <p( a) partial recursive if it is defin- able by a finite series of applicatio ns of the schemata (I)-(VIII) ("" is employed instead of = in (lV)-(VI) and (VIII)), and (IX) <p(a, b, c) "" {a} (b, c) (c is a finite list of variables of arbitrary types). In particular, <pia) is called a general re- cursive function if it is defined for all values of the argument a. These notions can be relativized also with respect to any given functions. Note that for the case of types :( 1, prirlitive recurs- ive functions, partial recursive functions, and also general recursive functions in the present sense are equivalent to the corresponding notions (introduced via relativization with uniformIty) in the ordinary sense already de- scribed. The following theorem is important: Let r be the maximum type of a. Then there 
1325 are two primitive recursive predicates M, N such that {e}(a) w= V''-13'1'-2 M(e, a, w, c- I , '1'-2), r2, W=3C- 1 V''1,-2N(e, a, w, '-I, '1'-2), r>2. Every function definable using (IX') t/J(a) /lx(t/J(a, x) =0) instead of (IX) is partial recur- sive. However, not aU the partial recursive functions of variables of types  2 can be obtained by applying schemata (I)-(VIII) and (IX'). Further developments have been pursued by 1. E. Fenstad, J. Moldestad, and others in abstract computation theory [20-23]. G. Recursive Functions of Ordinal Numbers G. Takeuti introduced a notion of primitive recursiveness for functions from a segment of the ordinal numbers to ordinal numbers. Using this, he constructed a model of set theory in ordinal number theory. In connec- tion with recursive functions of ordinal num- bers, there are also investigations by A. Levy, M. Machover, Takeuti and A. Kino, T. Tugue, S. Kripke, and others. Early treatments of recursive functions of ordinal numbers dealt only with functions on infinite cardinals. For example, Takeuti con- sidered functions with a fixed infinite cardinal K as a domain and a range, and defined K- recursive functions using schemata similar to the abovementioned (I)-(VI). Subsequently Kripke observed that the assumption that K is a cardinal is not necessary, and introduced the notion of admissible ordinals. An admis- sible ordinal K has the closure properties required for the construction of the calculus, and whenever IX, {3 < K and {3 = f(IX) is com- putable, then {3 = f(IX) is computable in fewer than K stages. Given an admissible ordinal K, K-recursiveness can be defined, as in the case of general recursiveness, by various equivalent methods, e.g., schemata, the equation calculus, and definability in both quantifier forms. Most of the elementary properties of general recur- sive functions (e.g., the normal form theo- rem, parametrization theorem, enumeration theorem, etc.) are also valid for K-recursive functions. The notions of degrees of unsolva- bi1ity and recursive enumerability can also be generalized, yielding the notions of K-degrees and K-recursive enumerabi1ity, respectively. The fine structures of these properties are currently the objects of intensive research. 356H Recursive Functions Every infinite cardinal is admissible. The least admissible ordinal is w, and the next is the ordinal WI of Church and Kleene, i.e., the first nonconstructive ordinal. In fact, for every n 1, the first ordinal not expressible as the order type of a  predicate is admissible ( Section H). For each infinite cardinal K there are K+ admissible ordinals of power K. Platek investigated recursion theory in a still wider setting. He dealt with functions defined, not on a segment of ordinal numbers, but on a set, and introduced the notion of admissible sets, i.e., sets on which a wen-behaved recursion theory can be developed. An admissible set is a transitive B-model of a certain weak set theory, and an ordinal K is admissible if and only if there exists an admissible set A such that A n On = K, where On is the class of all ordinal numbers. Recent developments have shown that generalized recursion theory, set theory, and infinitary logic are closely related. In addition to the abovementioned, there are some investi- gations by Y. N. Moschovakis and others [14-27]. H. Hierarchies Utilizing the theory of trecursive functions, S. C. Kleene succeeded in establishing a theory of hierarchies that essentiaUy contains class- ical descriptive set theory as an extreme case [5,10,31,32]. Although research foUowing a similar line had also been done by M. Davis, A. Mostowskii, and others, it was Kleene who succeeded in bringing the theory to its present form. Sets or functions are described by tpredi- cates, which we classify as foUows. Let a, b, ..., aj, a2'''', x, y,..., be variables ranging over the set N of natural numbers, and IX, {3, ..., IX l' IX 2 , ... , ,'1, ... be variables ranging over the set NN of all tnumber-theoretic functions with one argument. Let t/J l' .. . , t/Jl(l  0) be number-theoretic functions. A predicate P(IX1' "', IXm, aI' ..., an) (m, n 0, m+ n> 0) with variables of two ttypes is caUed analytic in t/Jj,..., t/Jl (I 0) if it is expressible syntacti- caIly by applying a finite number of logical symbols: -+, v, /\, I, 3x, V'x, 3, V', to tgen- eral recursive predicates in t/J I' '" , t/Jl' In partic- ular, when P is expressible without function quantifiers 3, V', it is caned arithmetical in t/JI' ..., t/Jl (IO). When 1=0, they are caUed simply analytic and arithmetical, respectively. For brevity, consider the case 1 = 0, and denote by a a finite list of variables (IX I' ." , IXm, a j, '" ,an)' Every arithmetical pre- dicate P(a) is expressible in a form contained 
356H Recursive Functions in the following table (a): (a) R(a): 3xR(a,x), \lx3yR(a,x,y),..., \lxR(a, x), 3x\lyR(a,x,y), ..., where each R is tgeneral recursive. In order to obtain such an expression we first transform the given predicate into its tprenex normal form and then contract successive quantifiers of the same kind by the formula 3x 1 ...3x n A(x 1 , ...,.x n ) =3xA«x)o' ...,(x)n-rJ (1) and its "dual form." Each form in (a) (or the class of all predicates with that form) is de- noted by EE or nE, where the suffix k refers to the number of quantifiers prefixed, and E or n shows that the outermost quantifier is existential or universal, respectively. A predi- cate that is expressible in both forms EE and nE (or the class of such predicates) is denoted by M. A predicate belongs to M if and only if it is general recursive (an analog oftSuslin's theorem ). For k 1, there exists in EE (or nE) an enumerating predicate that specifies every predicate in EE (nE). For example, for n and m = n = 1, there is a tprimitive recursive predicate S«()(, Z, a, x, y) such that, given a general recursive predicate R(cx,a,x,y), we have a natural number e such that \lx3yRcx, a, x, y)= \lx3yS«()(, e, a, x, y) (enumeration theorem). In this theorem, we can take T 2 "(z,a,x,y) ( Section F) as S(()(,z,a,x,y). For each kO, there exists a EE+dnE+rJ predicate that is not expressible in its dual form nE+1 (EE+rJ (hence, of course, in neither EE not nE) (hierarchy theorem). There- fore, table (a) gives the classification of the arithmetical predicates in a hierarchy. This hierarchy is called the arithmetical hierarchy. F or each k  1, there exists a complete predi- cate with respect to EE(nE), that is, a EE(nE) predicate with only one variable such that any E(I7) predicate is expressible by sub- stituting a suitable general (or more strictly, primitive) recursive function for its variable (theorem on complete form). When m = 0, all the general recursive predicates in EE exhaust +1 (Post's theorem). Concerning the function quantifiers, we have 3cx 1 ... :lcx m A(cx 1 , ...,cx m ) =3cxA(At(CX(t))O' ...,At(CX(t))m-rJ (2) 3xA(x)=3cxA(cx(0)), (3) \lx3cxAix, ()()= 3cx\lxA(x, AtCX(Y' 3')), (4) 1326 and their dual forms. For any general recur- sive predicate R, there is a primitive recursive predicate S such that 3cxR(cx, a)=3cx3xS(iX(x), a)=3yS(y, a) (5) and its dual hold. Using these fact:" we can classify the forms of all analytic predicates by the table (b): (b) A(a); \lcx3xR(a, cx, x), 3cx\l fJ3xR(a, cx, fJ. x), '"', 3cx\lxR(a, cx, x), \lcx3fJ\lxR(a,cx,fJ,x),.... where A is arithmetical and each R is general recursive. Similarly, denote by Ef, nf each form of predicate in (b) (or the cla,s of all predicates reducible to that form), where k is the number of function quantifiers prefixed; also, denote by l the (class of) predicates expressible in both forms Ef and nf. For Ef, nf (k 1), we have the enumeration theorem, the hierarchy theorem, and the theorem on complete form. The hierarchy given by table (b) is called the analytic hierarchy. For 1>0 (namely, when predicates are arithmetical or analytic in t/JI' ...,1(11), we can tuniformly trelativize the above results with respect to t/J1' .,. , t/JI' Now let {Er' 'l' .'P" nr''P!. ..'P'h (r=O, 1) be the corresponding hierarchy relative to t/J1' ...,t/JI' Given a set C ( c NN) of functions with one argument, we can consider hierarchies of predicates which are arithmetical or analytic in a finite number of functions in C. Such a hierarchy is called a C-arithmetical or C-analytic hierarchy and denoted by {EnC], m[C]h or {EnC], nnC]h, respectively. That is, when we re- gard Er [C] as a class of predicates ( or sets) P, it is thefamily {PI PEL'r'I' ''. 1' ... IE C, 1=0,1,2,...}. These notations have been given by J. W. Addison [28,29]. The NN_ arithmetical hierarchy and the NN-analytic hierarchy for sets correspond respectively to the finite Borel hierarchy and the orojective hierarchy in the tspace of irrational numbers. Addison caned the theory of those hierarchies classical descriptive set theory, and in contrast to this, the theory of arithmetical and analytic hierarchies for sets (C = 0) effective descrip- tive set theory [28]. We now restrict our consideration to pre- dicates for natural numbers (i.e., to the case m =0). Define the predicates Lk by Lo(a)= a=a, Lk+1(a)=3xTfk(a, a, x). For each kO, Lk+1 (a) is a EE+1 predicate which is of the highest tdegree of recursive unsol "ability among the EE+1 predicates, and its degree is properly higher than that of Lk(a). Thus Lb k == 0, 1, 2, ... , determine the arithmetical hierarchy of degrees of recursive unsolvability. Kleene has extended the series of Lk by using 
1327 the system 8 3 ( 81 Constructive Ordinal Numbers) of notations for the constructive or- dinal numbers as fo11ows [6]: H 1 (a)=-a=a; for yEO, H 2 Aa)=-3xT 1 H y(a, a, x); for 3. YEO, H3 5 y (a)=- Hyt.),.(a)o), where Y. = {y} (no)' This Hy is defined for each y EO, and it is of a prop- erly higher degree than that of Hz when Z<oY. Ifly/=Izl (Iyl is the ordinal number represented by y), then Hy and Hz are of the same degree (C. Spector [34]). Thus a hierar- chy of degrees is uniq uely determined by con- structive ordinal numbers. This hierarchy is ca11ed the hyperarithmetical hierarchy of de- grees of recursive unsolvability. A function or predicate is said to be hyperarithmetical if it is recursive in Hy for some yEO. These concepts and the results mentioned below can be rela- tivized to any given functions or predicates. A necessary (Kleene [31]) and sufficient (Kleene [32]) condition for a predicate to be hyperarithmetical is that it be expressible in both one-function quantifier forms  (an effective version of Suslin's theorem). Denote by Hyp the set ( c NN) of a11 hyperarithmetical functions a. For an arithmetical predicate A(a, a), 3a. eHyp A(a, a) is always a Ilt predicate (Kleene [33]). Conversely, for any Ilt predi- cate P, there is a general recursive predicate R such that P(a)=-3a. e Hyp VxR(a,a,x) (Spector [35]). As to tuniformization, for a Ilt predi- cate P(a, b), we have Vx 3yP(x, y) = 3a. e Hyp VxP(x, a(x)) (G. Kreisel, 1962). Let E be an ob- ject of type 2 defined by: E(a)=O if 3x(a(x) =0), otherwise E(a) = 1. A function qJ(a 1 , ..., a.) is hyperarithmetical if and only if it is general re- cursive in E (Kleene [10]). A predicate that is hyperarithmetical relative to III predicates (kO) is of +1 (Kleene [32]), but the con- verse does not hold in general (Addison and Kleene, 1957). Kleene extended his theory of hierarchy to the case of predicates of variables of any type by utilizing the theory of general recur- sive functions with variables of finite types 0, 1,2, '" [10]. Let at be a list of variables of types ",;; t. We say a predicate P( a') is of order r in completely defined functions t/Jr."', t/Jl (10) (for brevity, denote them by '1') if Pis syntactica11y expressible in terms of variables of finite types, predicates that are general recursive in '1', and symbols of the tpredicate calculus with quantification consisting only of variables of types < r. The predicates of order o in 'I' are exactly the general recursive ones in '1'. When t  1, and 'I' are functions of vari- ables of type 0, a predicate P(a') is of order 1 (of order 2) in 'I' if and only if P is arithmetical (analytic). We have theorems similar to (2)-(4) and the fo11owing theorem and its dual for r  2: For any given general recursive predicate 356 Ref. Recursive Functions P(a', u', '-2), there is a primitive recursive predicate R(a',r(-I, '-2) such that 3u' V '-2 P(a', u', '-2) =- 311'-IV'-2 R(a', 11'-1, '- 2). (6) Using these equivalences, each predicate p(a') of order r + 1 (r  0) is expressible in one of the fo11owing forms: (c) B(a); Va' 3'-1 R( a, a', '-1), 3a' V f3' 3C- 1 R( a, a', f3', '-1 ), .., , 3a'V,-1 R(a, a', '-1), Va' 3f3' VC- 1 R(a, a', {3', '-1), '" , where B is of order r and each R is general recursive. When t=r+ 1, table (c) gives the classification of the predicates of order r + 1 into the hierarchy. In fact, for the predicates P(a'+l) in each form, we have the enumera- tion theorem, the hierarchy theorem, and the theorem on complete form (Kleene [10]). D. A. Clarke [30] has published a detailed review of the general theory of hierarchies. References [1] K. Godel, Uber formal unentscheidbare Satze der Principia Mathematica und ver- wandter Systeme I, Monatsh. Math. Phys., 38 (1931),173-198. [2] S. C. Kleene, General recursive functions of natural numbers, Math. Ann., 112 (1936), 727-742. [3] S. C. Kleene, On notation for ordinal numbers, J. Symbolic Logic, 3 (1938), 150-155. [4] A. Church, The calculi of lambda- conversion, Ann. Math. Studies, Princeton Univ. Press, 1941. [5] S. C. Kleene, Recursive predicates and quantifiers, Trans. Amer. Math. Soc., 53 (1943), 41-73. [6] E. L. Post, Recursively enumerable sets of positive integers and their decision problems, Bull. Amer. Math. Soc., 50 (1944),284-316. [7] S. C. Kleene, Introduction to metamath- ematics, Van Nostrand, 1952. [8] R. Peter, Rekursive Funktionen, Akade- mische Verlag., 1951. [9] A. A. Markov, Theory of algorithms (in Russian), Trudy Mat. Inst. Steklov., 42 (1954). [10] S. C. Kleene, Recursive functionals and quantifiers of finite types I, Trans. Amer. Math. Soc., 91 (1959), 1-52. [11] S. C. Kleene, Recursive functionals and quantifiers of finite types II, Trans. Amer. Math. Soc., 108 (1963), 106-142. [12] H. Hermes, Enumerability, decidability, computability, Springer, 1965. 
357 A Regular Polyhedra [13] H. Rogers, Theory of recursive functions and effective computability, McGraw-Hill, 1967. [14] J. Barwise, Infinitary logic and admissible sets, J. Symbolic Logic, 34 (1969), 226-252. [15] R. B. Jensen and C. Karp, Primitive recursive set functions, Proc. Symposia in Pure Math., XIII (1971),143-176. [16] S. Kripke, Transfinite recursions on ad- missible ordinals, J. Symbolic Logic, 29 (1964), 161-162. [17] M. Machover, The theory of transfinite recursion, Bull. Amer. Math. Soc., 67 (1961), 575-578. [18] G. Takeuti, A formalization of the theory of ordinal numbers, 1. Symbolic Logic, 30 (1965),295-317. [19J T. Tugue, On the partial recursive func- tions of ordinal numbers, J. Math. Soc. Japan, 16 (1964),1-31. [20J 1. Moldestad, Computation in higher types, Springer, 1977. [21] J. E. Fenstad, R. O. Gandy, and G. E. Sacks (eds.), Generalized recursion theory, North- Holland, 1974. [22] 1. E. Fenstad, R. O. Gandy, and G. E. Sacks (eds.), Generalized recursion theory II, North- Holland, 1978. [23] J. E. Fenstad, General recursion theory, Springer, 1980. [24J Y. N. Moschovakis, Elementary induc- tion on abstract structures, North-Holland, 1974. [25] Y. N. Moschovakis, Descriptive set theory, North-Holland, 1980. [26] J. Barwise, Admissible sets and structures, Springer, 1975. [27] P. G. Hinman, Recursion theoretic hier- archies, Springer, 1978. [28] 1. W. Addison, Separation principles in the hierarchies of classical and effective de- scriptive set theory, Fund. Math., 46 (1958- 1959),123-135. [29] J. W. Addison, Some consequences of the axiom of constructibility, Fund. Math., 46 (1958-1959),337-357. [30] D. A. Clarke, Hierarchies of predicates of finite types, Mem. Amer. Math. Soc., 1964. [31 J S. C. Kleene, Arithmetical predicates and function quantifiers, Trans. Amer. Math. Soc., 79 (1955),312-340. [32J S. C. Kleene, Hierarchies of number- theoretic predicates, Bull. Amer. Math. Soc., 61 (1955), 193-213. [33J S. C. Kleene, Quantification of number- theoretic functions, Compositio Math., 14 (1959), 23-40. [34] C. Spector, Recursive well-orderings, 1. Symbolic Logic, 20 (1955), 151-163. [35J C. Spector, Hyperarithmetical quantifiers, Fund. Math., 48 (1959-1960), 313-320. 1328 357 (VI.6) Regular Polyhedra A. Regular Polygons A tpolygon in a Euclidean plane bounding a tconvex cell whose sides and interior angles are all respectively congruent is called a regu- lar polygon. When the number of vertices (which equals the number of sides) is n, it is called a regular n-gon. There exist a circle (circumscribed circle) passing through all the vertices of a regular n-gon and a concentric circle (inscribed circle) tangent to all the sides. We call the center of these circles the center of the regular n-gon. The n vertices of a regular n-gon are obtained by dividing a circle into n equal parts. (When a polygon in a Eucli- dean plane bounds a tconvex cell, this 2-cell is sometimes called a convex polygon. Thus regular polygon sometimes means the convex cell bounded by a regular polygon as de- scribed above.) A necessary and sufficient condition for a regular n-gon to be geometri- cally constructible is that n be decompo- sable into the product of prime numbers n = 2 m P1 '" Pk (m:;:, 0), where the Pi (i = 1, 2, ...) are different tFermat numbers ( 179 Geo- metric Construction). B. Regular Polyhedra Consider a regular polygon on a plane, and take a point on the line perpendicular to the plane at the center of the polygon. The set of points on all half-lines joining this point and points on the polygon (considered as a convex cell) is called a regular polybedral angle having this point as vertex (Fig. 1). Fig. I When a tconvex polyhedron  in £3 satis- fies the following two conditions, we call it a regular polyhedron: (1) Each face of, which is a 2-dimensional cell, is a regular polygon, and all faces of  are congruent to each other. (2) Its vertices are all surrounded alike. That is, by the projection of  from each vertex of , we obtain a regular polyhedral angle; these regu- lar polyhedral angles are all congruent to each other. From (2) we see that the number of edges emanating from each vertex of  is 
1329 independent of the vertex. It has been known since Plato's time that there are only five kinds of regular polyhedra: tetrahedrons (Fig. 2), octahedrons (Fig. 3), icosahedrons (Fig. 4), cubes or hexahedrons (Fig. 5), and dodecahedrons (Fig. 6) (see also see Table 1). Fig. 2 Regular tetrahedron. Fig. 3 Regular octahedron. I I I I I I _-.1.- Fig. 4 Regular icosahedron. Fig. 5 Regular hexahedron (or cube). Fig. 6 Regular dodecahedron. From a given regular polyhedron, we can obtain another one by taking as vertices the centers of all the faces of the given polyhedron (Fig. 7). We say that the given regular poly- hedron and the one obtained in this way are dual to each other. The octahedron and hexa- 357C Regular Polyhedra hedron are dual to each other, as are the icosahedron and dodecahedron. The tetra- hedron is dual to itself. For a regular poly- hedron , there exist concentric circumscribed and inscribed spheres whose center is the center of symmetry of  and is called the center of. Drawing tangent planes to the circumscribed sphere at each vertex, we can obtain a regular pOlyhedron dual to the given one (Fig. 8).  \ \ --- :--- 'W Fig. 7 Fig. 8 In a regular polyhedron, let a be the length of an edge, e the magnitude of the dihedral angle at each edge, and Rand r the radii of circumscribed and inscribed spheres, respec- tively. Then the following relations hold (we assume that each face is a regular p-gon and q faces meet at each vertex): . e 11: / . 11: sllli=cos q Slllp' a 11: / 11: e R=i sin q sinpcosi' (1) a 11: e r=-cot-tan- 2 p 2' R 11: 11: -=tan-tan- r p q (see Table 2). Corresponding to these poly- hedra, we have finite subgroups of 0(3) called regular polyhedral groups ( 151 Finite Groups). C. Higher-Dimensional Cases It is possible to generalize these considerations to higher dimensions to define inductively Table 1. Regular Polyhedra in 3-Dimensional Euclidean Space E 3 N umber of Number of Number of N umber of Faces around Figure Face Vertices Edges Faces a Vertex Regular Equilateral tetrahedron triangle 4 6 4 3 Regular Equilateral octahedron triangl e 6 12 8 4 Regular Equilateral icosahedron triangl e 12 30 20 5 Regular hexahedron Square 8 12 6 3 Regular Regular dodecahedron pentagon 20 30 12 3 
357 Ref. 1330 Regular Polyhedra Table 2. Numerical Values for Eqs. (I) N umber of Faces sinO 0 R/a ria 4 2 V2 /3 70°31'43.6" \1'6/4 \1'6/12 6 I 90° V3/2 1/2 8 2V2 /3 109°28' 16.4" 1/V2 1/\1'6 12 2/V5 116°33'54.2" V3 (V5 + 1) V 25 + II V5 4 2m 20 2/3 138° 11'22.8" y( 5+ V5) 3+V5 - 2V2 4V3 Regular Polyhedra in 4-Dimensional Euclidean Space £4 3- Dimensional Regular Polyhedr a Kind Number Table 3. Figure N umber of Vertices Duality a Regular 5-hedron Tetrahedron 5 5 Regular 8-hedron Cube 8 I: f Regular 16-hedron Tetrahedron 16 Regular 24-hedron Octahedron 24 24 Regular 120-hedron Dodecahedron 120 600  Regular 600-hedron Tetrahedron 600 120 a: dual to itself; b: dual to each other b a b Table 4. Regular Polyhedra in n-Dimensional Euclidean Space (n;;;. 5) Regular Polyhedron in Rn - I Number of Vertices Duality Figure Kind Regular (n + 1)-hedron Regular n-hedron Regular 2n-hedron Regular (2n - 2)-hedron Regular 2 n -hedron Regular n-hedron a: dual to itself; b: dual to each other regular polyhedra in En, n;;;.4. When n=4 we have 6 kinds of regular polyhedra (Table 3). For n;;;. 5 we have only 3 kinds (Table 4) ( 70 Complexes). References [1] 1. S. Hadamard, Lec;ons de geometrie elementaire II, Armand Colin, second edition, 1906, 425-427. [2] D. Hilbert and S. Cohn-Vossen, Anschau- liche Geometrie, Springer, 1932; English translation, Geometry and the imagination, Chelsea, 1952. [3]E. Steinitz and H. Rademacher, Vorlesun- gen iiber die Theorie der Polyeder, Springer, 1934. [4] H. S. M. Coxeter, Regular polytopes, Methuen, 1948; third edition, Dower, 1973. [5] H. S. M. Coxeter, Regular complex poly- topes, Cambridge Univ. Press, 1974. Number n+1 2n 2 n n+1 a 2 n  2n \ b 358 (11.2) Relations A. General Remarks In its wider sense the term relation means tn- ary relation (n = 1,2,3, ...) ( 411 Symbolic Logic G), but in this article we restrict our- selves to its most ordinary meaning, i.e., to the case n = 2. Let X, Y be two sets and x, y be two variables taking their values in X, Y, respectively. A proposition R(x, y) containing x, y is caned a relation or a binary relation if it can be determined whether R(a, b) is true or false for each pair (a, b) in the tCartesian prod- uct X x Y. For example, if both X and Yare the set of rational integers, then the following propositions are relations: x:( y, x - y is even, x divides y. A relation R(x, y) is sometimes written as xRy. For a given relation R, we define its inverse relation R -1 by yR -1 x= xRy. Then R is the 
1331 inverse relation of R- 1 . In the example above, the inverse relation of x < y is y> x, and the inverse relation of x is a divisor of y is Y is a multiple of x. A relation R is caned reflexive if xRx holds. R is caned symmetric if xRy= yRx (namely, if Rand R -1 are identical). R is caned transitive if xRy and yRz imply xRz. R is caned antisymmetric if xRy and yRx imply x = y. A reflexive, symmetric, and transi- tive relation is caned an equivalence relation ( 135 Equivalence Relations). A reflexive and transitive relation is caned a tpreordering. A reflexive, transitive, and antisymmetric relation is caned an tordering ( 311 Ordering). Suppose that we are given a relation xRy (XEX,YE Y). Then the set G= {(x,y)1 xRy}, which consists of elements (x, y) of the Car- tesian product X x Y satisfying xRy, is caned the graph of the relation R. Conversely, for any subset G of X x Y, there exists a unique rela- tion R with the graph G given by xRy= (X,y)EG. B. Correspondences For a subset G of the Cartesian product X x Y, the triple r=(G,X, Y) is caned acorre- spondence from X to Y. The set X is called the initial set of the correspondence r, and Y the final set ofr. A relation xRy (XEX,YE Y) determines a correspondence r=(G,X, Y) by its graph G, and conversely, a correspondence r determines a relation R. Given a corre- spondence r=(G,X, Y), the sets A=prx G and B=pryG, where prx:X x Y->X and pr y: X x Y -> Yare the tcanonical projections, are caned the domain and range of the corre- spondence r, respectively. For XEX, the set {YE YI (x, Y)E G} is denoted by G(x) or r(x), and we say that any element y of G(x) corre- sponds to x by r. For a subset G of X x Y, we define a subset G-1 of Yx X by (X,y)EG=(y,X)EG- 1 . Given a correspondence r = (G, X, Y), the corre- spondence (G-r, Y,X) is denoted by r- 1 and is called the inverse correspondence of r. If G is the graph of a relation R, then G- 1 is the graph of the inverse relation R -1. The domain of the correspondence r is the range of r -1, and vice versa. We have (r- 1 )-1 =r. Suppose that we are given correspondences r 1 =(Gj,X, Y) and r 2 =(G 2 , Y,Z). We define a subset G of X x Z by: (x, Z)E G=there exists YE Y satisfying (X,y)EG 1 and (y, Z)EG 2 . Then the correspondence r=(G,X,Z) is denoted by r 2 0 r 1 and is called the composite of r 1 and r 2 . We have the associative law (r30r2)Or1 = r 3 0(r 2 0r 1 ) and the law(r 2 or 1 )-I=ri 1 0 r - 1 2 . Let r be a correspondence from X to Y, and 359B Relativity assume that to any x belonging to the domain A of r there corresponds one and only one YE Y, namely, r(x)={y} for any xEA. Then r is caned a univalent correspondence. If rand r- 1 are both univalent correspondences, r is caned a one-to-one correspondence. For given sets A and B, a univalent correspondence with domain A and range B is called a tmapping (or tfunction) with domain A and range B ( 381 Sets C). References See references to 381 Sets. 359 (XX.21) Relativity A. History The theory of relativity is a system of theoret- ical physics established by A. Einstein and is composed of special relativity and general relativity. Toward the end of the 19th century, it was believed that electromagnetic waves propagate through the ether, a hypothetical medium. A number of experimenters tried to find the motion of the earth relative to the ether, but an these attempts were unsuccessful (A. A. Michelson, E. W. Morley). Studying these results, in 1905 Einstein proposed the special theory of relativity, which extended Galileo's relativity principle of tNewtonian mechanics to telectromagnetism and radically revised the concepts of space and time. Almost an the conclusions of special relativity theory are now confirmed by experiments, and this theory has even become a guiding principle for developing new theories in physics. By extend- ing special relativity, Einstein established (1915) the general theory ofrelativity. Its prin- cipal part is a new theory of gravitation con- taining Newton's theory as a special case. Its conclusions about the solar system are com- patible with observed facts that are regarded as experimental support for the theory. Effects due to general relativity other than those just described have been studied to a considerable extent, but it is hard at present to test theoret- ical results experimentally, and there are some doubts about the limit of its applicability. B. Special Relativity In Newtonian mechanics, natural phenomena are described in a 3-dimensional Euclidean 
359C Relativity space considered independent of time. In spe- cial relativity, however, it is postulated that space and time cannot be separated but are unified into a 4-dimensional pseudo-Euclidean space with the tfundamental form ds 2 =" 9 dxadxb =c 2 dt 2 -dx 2 _d y 2 -dZ2 .ab , a.b where a, b=O, 1,2,3 and (x o , Xl, x 2 , x 3 ) =(ct, X, y, z). Here (x, y, z) are spatial Cartesian coordinates, t is time, and c is the speed of light. This space was introduced by H. Minkowski and is called Minkowski space-time. By means of it Min- kowski gave an ingenious geometric interpre- tation to special relativity. A nonzero vector V is called timelike, nul! (or light like), or spacelike according as V 2 > 0, =0, or <0, where V 2 = La.bgab va Vb. The group of motions in Minkowski space- time is called the inhomogeneous Lorentz group. Its elements can be written as Xi' == I cxa + c i , a '" a b L.. gabci C j = gij' a.b where c] and c i are constants. The transfor- mations with c i = 0 are usually called Lorentz transformations, and the group G composed of these transformations is called the homoge- neous Lorentz group or simply the Lorentz group. These are important concepts in special relativity. If Go denotes the tconnected compo- nent of the identity element of G, the factor group GIGo is an Abelian group of type (2,2) and of order 4. We call Go the proper Lorentz group. A frequently used element of Go is x-vt t-(V I C 2 )X Lv: x' - , t' = I J l _v 2 /c 2 /C2 ' y'=y, z'=z; Ivl<c. Such transformations form a I-dimensional subgroup of Go with v as a parameter, and the composition law of the subgroup is given by u+v w- I + uvlc 2 . La' Lv= Lw, Elements of G not belonging to Go are T:x O ,= _x o , Xi'=X i ; i= 1,2,3, S:xO'=x O , x i ,= _Xi; i= 1,2,3. Both T (time reversal) and S (space reflection or parity transformation) have aroused much interest among physicists. Historically, the transformation formula (1) was first obtained by H. A. Lorentz, under the assumption of contraction of a rod in the direction of its movement in order to over- come the difficulties of the ether hypothesis, but his theoretical grounds were not satisfac- tory. On the other hand, Einstein started with 1332 the following two postulates: (i) Special prin- ciple of relativity: A physical law should be expressed in the same form in all inertial sys- tems, namely, in all coordinate systems that move relative to each other with uniform velocity. (ii) Principle of invariance of the speed of light: The speed of light in a vacuum is the same in all inertial systems and in all direc- tions, irrespective of the motion of the light source. From these assumptions Einstein derived (1) as the transformation formula between inertial systems x=(ct,x,y,z) and x' = (ct', x', y', z') that move relative to each other with uniform velocity v along the com- mon x-axis. This was the first step in special relativity, and along this line of thought, Ein- stein solved successively the problems of the Lorentz-Fitzgerald contraction, the dilation of time as measured by moving clocks, the aber- ration of light, the Doppler effect, and Fresnel's dragging coefficient. C. Relativity and Electromagnetism (1) In special relativity, a physical quantity is represented by a ttensor (or a scalar or a vec- tor) in Minkowski space-time, and physical laws are written in tensor form and are invar- iant under Lorentz transformations of coordi- nates. This is the mathematical expression of the special principle of relativity. Since the transformation (1) tends to a Galileo transfor- mation in Newtonian mechanics as C-HfJ, the special principle of relativity is a generalization of the Newton-Galileo principle of relativity. To summarize mathematical!y, it may safely be said that special relativity is a theory of invariants with respect to the Lorentz group. To illustrate this conclusion we consider elec- tromagnetic theory. The electric field E is usually represented by a "polar vector" and the magnetic field H by an "axial vector" in a 3-dimensional Euclidean space. Even if the magnetic field does not exist in one inertial system, the field can arise in another system that moves uniformly rela- tive to the original system. In view of this, electric and magnetic fields are considered in relativity to form one physical quantity with components (F i ) = ( ; Ez -Ex o -Hz Hy -Ey Hz o -Hx -Ez ) -Hy H . x o This quantity transforms as an talternating tensor of degree 2 under Lorentz transfor- mations. In like manner, the electric charge density p and the electric current density J are unified into a tcontravariant vector with 
1333 respect to Lorentz transformations: s=(SO,SI,S2,S3) =(p, Jxje, Jyje, Jzje). Such a vector s in Minkowski space-time is sometimes ca11ed a four-vector as distin- guished from an ordinary vector such as J. If the electromagnetic field F ij and the current four-vector s are thus defined, the tMaxwe11 equations, the basic equations of electro- magnetism, can be written in tensor form: oF ia I=Si, a UX OFjk OF ki oF i . ox i + ox j + ox: =0, where Fij =" g ia g jb F "ia ,i L. . ab' L.g gja=Uj' a,b a In the same way, the equation of motion for a charged particle in an electromagnetic field can be expressed as d 2 Xi e .dx a ds 2 = me F, Fi_" i a F j - L. 9 ja' a where e and m are the charge and mass of the particle, respectively, and s is the arc length along the particle trajectory ( 130 Electromagnetism). Though special relativity originated in studies of electromagnetic phenomena, it has gradua1!y become clear that the theory is valid also for other phenomena. One interesting result is that the energy of a particle moving with unifo rm veloci ty v is given by E=me 2 j J 1-v 2 je 2 , and accordingly even a particle at rest has energy me 2 (rest energy). This shows the equiv- alence of mass and energy, with the conversion formula given by E=me 2 . This conclusion has been verified experimenta11y by studies of nuclear reactions and has become the basis of the development of nuclear power. The special principle of relativity also showed its validity in the electron theory of P. A. M. Dirac (1928) and the quantum electrodynamics of S. Tomonaga (1943) and others. It has, however, been shown that the in variance for space reflection (namely for the coset SG o of the Lorentz group G) is violated in the decay of elementary particles (T. D. Lee and C. N. Yang, 1956; C. S. Wu et a1., 1957). Similar results have been obtained for time reversal (J. H. Christenson, J. W. Cromin, V. L. Fitch, and R. Tur1ay, 1964). D. General Relativity Special relativity has its origin in studies of electromagnetic phenomena, while the central 359D Relativity part of general relativity is a theory of gravita- tion founded on the general principle of rela- tivity and the principle of equivalence. The first principle is an extension of the special prin- ciple of relativity to accelerated systems in genera1. It requires that a physicallaw should be independent of the choice of local coordi- nates in a 4-dimensional tdifferentiable mani- fold representing space and time (space-time manifold). Since a physical quantity is repre- sented by a tensor on the space-time manifold, physicallaws are expressed in tensor form, in agreement with the first principle. The second principle claims that gravitational and inertial mass are equal, and accordingly fictitious forces due to acceleration (such as centrifugal force) cannot be distinguished from gravita- tional force. This had been shown with high accuracy by the experiments of R. von E6tv6s (1890) and others. Starting from these two principles, Einstein was led to the fo11owing conclusion. If a gravi- tational field is produced by matter, the cor- responding space-time structure is altered' namely, flat Minkowski space-time is chaged into a curved 4-dimensional manifold with tpseudo-Riemannian metric of tsignature (1,3). The tfundamental tensor gij of this manifold represents the gravitational potential, and the gravitational equation satisfied by gij can be expressed as a geometric law of the manifold. Gravitational phenomena are thus reduced to properties of the geometric structure of the space-time manifold. This idea, which was not seen in the older physics, became the motif in the development of tunified field theories. Now the gravitationallaw proposed by Einstein is an analog of the tPoisson equation in Newtonian mechanics. Let R.. and R be the 'J tRicci tensor and the tscalar curvature formed from gij' Then outside the source of a gravita- tional field, gij must satisfy Gij=Rij-g ij Rj2=0, that is, Rij=O, (2) and inside the source, G ij = KT,j, (3) where K is the gravitational constant. Here the energy-momentum tensor T,j is a tsymmetric tensor representing the dynamical state of matter (energy, momentum, and stress). Usu- a11y (2) and (3) are ca11ed the exterior and interior field equations, respectively. Next, the equation of motion of a particle in a gravitational field is given by t5 2 x i t5s 2 =0, dx a dx b I gab- d - d = +1, a,b S S (4) if the particle mass is so sma11 that its effect on the field is negligible. Here t5jt5s stands for tcovariant differentiation with respect to the 
359E Relativity arc length s along the particle trajectory. In other words, a particle in a gravitational field moves along a timelike tgeodesic in the space- time manifold. Similarly, the path of light is represented by a null geodesic, whose equation is formally obtained from (4) by replacing the right-hand side of the second equation by zero. Experimental verification of the theory of general relativity has been obtained by detec- tion of the following effects: the shift of spec- trallines due to the gravity of the earth and of white dwarfs, the deflection of light or radio waves passing near the sun, the time delay of radar echo signals passing near the sun, and the advance of the perihelion of Mercury. All the observational data are compatible with the theoretical results. Time delay and the advance of the perihelion have been observed in a binary system of neutron stars. It is generally accepted that these results are experimental verifications of general relativity. It should be noted that (2) has wave solu- tions, which have no counterpart in Newton's gravitational theory. This fact implies that gravitational effects propagate with the veloc- ity of light. The gravitational waves transport energy and momentum, and the gravitational mass is decreased by the emission of the waves. Experiments to detect gravitational waves generated in the universe have been planned, and the decrease of the orbital period of a binary star system due to the emission of gravitational waves has been observed. The concept of gravitational waves suggests the existence of a quantum of the gravitational field (graviton); however, the detection of the graviton is far from feasible. In the interior equation (3), the matter pro- ducing a gravitational field is represented by a tensor j of class Co. But there is also a way of representing it by singularities of a solution of the exterior equation (2). From this point of view, the equation of motion of a material particle (i.e., singularity) is not assumed a priori as in (4), but is derived as a result of (2) (A. Einstein, L. Infeld, and B. Hoffman, 1938). In the static and weak field limits, the funda- mental form is approximately given by New- ton's gravitational potential ep as ds 2 ,=;c 2 (1-ep/c 2 )dt 2 -(1 +ep/c 2 )(dx 2 +dy2+dz 2 ), and (2) and (3) reduce in this limit to Laplace's and Poisson's equations, respectively. New- ton's theory of gravity is valid in the limit ep/c 2 « 1. Stimulated by the discoveries of neutron stars and black holes and by the big-bang theory of the universe in the 1960s, numerous studies of general relativity have been carried 1334 out on such problems as the gravitational field of a spinning mass, the dynamical process of gravitational collapse, the space-time structure of black holes, the generation of gravitational waves, the global structure and dynamics of the universe, and so on. A comparison of the theoretical predictions and the observations is generally favorable, but the phenomena in the universe are so complex that the effects of general relativity cannot always be isolated. E. Solutions of Einstein's Equations The isometric symmetry of space-time is de- scribed by tKilling vectors. The stationary metric is characterized by a timelike Killing vector, in which case equation (2) reduces to an telliptic partial differential equation on a 3-dimensional manifold. If the space-time is axially symmetric as well as stationary, (2) reduces to the Ernst equation: (s+s*)V 2 s=2Vs'Vs, (5) where V represents divergence in a flat space. The metric tensors are derived from the com- plex potential s. The solutions of (5) can be obtained using techniques developed for the soliton problem. One example of stationary and axially sym- metric solutions is the Kerr metric, which is written as 2mr ds 2 = c 2 dt 2 ----y(a sin 2 (Jdep -cdt)2 p ( dr2 ) 2 _p2 T+ d (J2 -(r2+a2)sin2(Jdep2, (6) with p2 =r 2 +a 2 cos 2 (} and L\.=r 2 - 2mr+ a 2 . This metric solution represents a gravita- tional field around a spinning mass with mass M = mc 2 /G and angular momentum J = Mac. When a = 0, this metric reduces to the Schwarzschild metric. Applying a Backlund transformation to the Kerr metric, an infinite series of stationary and axially symmetric solutions can be derived. All these solutions belong to the space-time metric with, in general, two Killing vectors. The dynamical evolution of space-time structure has been studied by means of the tCauchy problem of general relativity. Choos- ing appropriate dynamical variables, equation (2) or (3) is divided into constraint equations in terms of the initial data and evolution equations in terms of the dynamical variables. The latter thyperbolic equations may also be written in Hamiltonian form. A typical example of such a problem is the dynamics of a spatially homogeneous 3- 
1335 dimensional manifold; this has been studied as a cosmological model. The space-time with a constant scalar curvature R is caned de Sitter space, and reduces to the Minkowski space if R = O. If the 3-dimensional space is isotropic as we11 as homogeneous, the metric takes the form ds 2 = c 2 dt 2 - a(t)2 {dx 2 + f(x) 2 x (d0 2 +sin 2 0 dq>2)}, where f(x) = sin x, x, or sinh x. These are ca11ed Robertson-Walker metrics and are considered to describe a realistic expanding universe. F. Global Structure of Space-Time F o11owing the advances of modern differential geometry, manifestly coordinate-independent techniques to analyze space-time properties have been applied to general relativity. The mathematical model of space-time is a con- nected 4-dimensional tHausdorff Coo -manifold endowed with a metric of signature (1,3). The metric a110ws the physical description of local causality and of local conservation of energy and momentum. The metric functions obey the Einstein field equation (2) or (3). In order to clarify the global structure of the solutions of Einstein's equations, maxi- ma11y analytic extension of the solutions has been studied. The maximal extension of the Schwarzschild metric is given as 32m 3 ds 2 =_e- r / 2m (dv 2 -du 2 ) r -r 2 (d0 2 +sin 2 Odq>2), using the Kruskal coordinates, which are re- lated to the coordinates of (6) by (;m -1 )e r / 2m =u 2 _v 2 , tanh tj4m = ujv or vju. To study the global structure at infinity, a tconformal mapping of the metric is used. For example, the Minkowski metric is written in the form ds 2 = 0. 2 dS 2 , where ds 2 =dpdq-hin 2 (p-q)(d0 2 +sin 2 Odq>2) and o.=secpsecq, tanp=t+r, tanq=t-r. By means of this mapping, a11 points, includ- ing infinity, are assigned finite p, q coordinate values in -nj2<q<p<nj2. Singularities in space-time are one of the major problems concerning the global struc- ture of the manifold. For some Cauchy prob- lems relevant to cosmology and gravitational co11apse, the inevitable occurrence of a sin- gularity has been proved (singularity theorem). 359 Ref. Relativity A sufficient condition for occurrence of a singularity is that there be some point p such that all the nun geodesics starting from p converge to p again. In addition to this con- dition, for the proof of the singularity theorem it is presumed that the space-time is free of closed nonspacelike curves, that a Cauchy surface exists, and that the energy-momentum tensor satisfies the condition ( 1 ) .. ?; Tij-"2.T9ij V'VJ>O '.J for any timelike vector Vi. The singularity whose existence is implied by this theorem means that the space-time manifold is geodesi- ca11y incomplete (the space-time is complete if every geodesic can be extended to arbitrary values of its affine parameter). The causal structure of space-time is also related to the global structure of the manifold. In this regard, black holes have been intro- duced as the final state of gravitational col- lapse. In the black-hole structure of space- time, there exists a closed surface ca11ed an event horizon in an asymptotica11y flat space. The event horizon is the boundary (the set of points) in space-time from which one can escape to infinity, or the boundary of the set of points that one can see from the infinite future. Then the black hole is a region from which no signal can escape to the exterior of the event horizon. If we assume that singularities do not exist in the exterior of the event horizon, a station- ary black-hole structure is uniquely described by the Kerr metric [6]. In the case of spheri- caIly symmetric co11apse, this assumption is verified and the final metric is given by the Schwarzschild metric. However, it is not known whether this assumption is true in more general gravitational co11apse. References [1] A. Einstein, The meaning ofrelativity, Princeton Univ. Press, fifth edition, 1956. [2] H. Weyl, Space, time, matter, Dover, 1950. (Original in German, 1923.) [3] A. Lichnerowicz, Theories relativistes de la gravitation et de l'electromagnetisme, Masson, 1955. [4] C. W. Misner, K. S. Thorne, and J. A. Wheeler, Gravitation, Freeman, 1973. [5] L. Witten (ed.), Gravitation, an introduc- tion to current research, Wiley, 1962. [6] C. DeWitt and B. S. DeWitt (eds.), Black holes, Gordon & Breach, 1972. [7] S. W. Hawking and G. F. R. EIlis, The large scale structure of space-time, Cambridge Univ. Press, 1973. 
360 Ref. Renaissance Mathematics [8J M. Carmeli, Group theory and general relativity, McGraw-Hill, 1977. [9J S. W. Hawking and W. Israel (eds.), Gen- eral relativity, an Einstein centenary survey, Cambridge Univ. Press, 1979. [I OJ A. Hcld (ed.), General relativity and gravitation I, II, Plenum, 1980. 360 (XXI.8) Renaissance Mathematics Toward the middle of the 13th century, scho- lastic theology and philosophy were at their height with the Summa theologiae of Thomas Aquinas (1225 L I274); but in the latter half of the century, the English philosopher Roger Bacon (\ 214 1294) attacked Aquinian philoso- phy in his Opus majus, insisted on the impor- tance of experimental methods in science, and strongly urged the study of mathematics. The Renaissance flourished first in Italy, then in other European countries in the 15th and 16th centuries, in the domains of the arts and liter- ature. Newer ideas in mathematics and the natural sciences dominated the 17th century. However, it was the invention of printing in the 15th century, the translation of the Greek texts of Euclid and Archimedes into European languages, and the importation of Arabian science into Europe during the Renaissance that prepared for this development. In the 15th century, the German priest Nicolaus Cusanus (1401-1464) discussed infinity, the convergence of infinite series, and some problems of quadrature. During the same period, the German scholar Regiomon- tanus (14361476) wrote the first systematic treatise on trigonometry independent of as- tronomy. Leonardo da Vinci (\452 1519), the an-encompassing genius born in the same century, left manuscripts in which he wrote about mechanics, geometric optics, and per- spective. Da Vinci's contemporary, the Ger- man painter A. Diirer (14 71  1528), wrote a textbook on perspective. In 1494, L. Pacioli (1445 L 1514) published Summa de arithmetica, one of the first printed books on mathematics. Its content, influenced by Arabian mathemat- ics, includes practical arithmetic and double- entry bookkeeping. The book enjoyed wide popularity. The best known result of 16th-century mathematics is the solution of algebraic equa- tions of degrees 3 and 4 by the Italian mathe- maticians Scipione del Ferro, N. Tartaglia (1506-1557), G. Cardano (1501  1576), and L. Ferrari (\ 522 1565). Cardano published the 1336 solution of equations of the third degree in his book Ars magna (1545). The solution was due to Tartaglia, to whom acknowledgment was made, although publication of the method was against his will. This constitutes a famous episode in the history of mathematics, but what is historically more important is the fact that essential progress beyond Greek mathe- matics was made by mathematicians of this period, since the Greeks were able to solve equations only of degrees 1 and 2. Algebra was subsequently systematized by the French mathematician F. Viete (1540-1603). By the end of the 15th century, practical mathematics (influenced by the Arabians) had become popular in Europe, and more ad- vanced mathematics began to be studied in European universities, especially in Italy. In 1543, N. Copernicus (1473 1543) published his heliocentric theory (1543); G. Galilei (called Galileo) (1564-1642), the indomitable propo- nent of this theory, was also born in the 16th century. Copernicus studied at the Univer- sities of Bologna, Padua, and Ferrara; Galileo studied at the University of Pisa and taught at the Universities of Pisa, Padua, and Florence. A system of numeration was imported from Arabia to Europe in the 13th century; by the time of S. Stevin (1548? 16207) it took the definite form of a decimal system, and with the development and acceptance of printing, the forms of the numerals became fixed. References [IJ M. Cantor, V orlesungen iiber Geschichte der Mathematik II, Teubner, second edition, 1900. [2J J. Cardan (G. Cardano), The book of my life (trans. J. Stoner), Dover, 1963. [3J G. Cardano, Opera, 10 vols., Lyon, 1663. [4J N. Bourbaki, Elements d'histoire des mathematiques, Hermann, 1960. 361 (XX.32) Renormalization Group A. Introduction The concept of renormalization was intro- duced by S. Tomonaga, J. S. Schwinger, M. Gell-Mann, and F. E. Low in order to over- come the difficulty of divergence in field theory. If the upper bound of the momentum is limited to a finite cutoff value i\, then physical quan- tities, for example, the mass m of an electron, can be obtained as finite quantities by letting 
1337 A go to infinity after summing all divergent terms. This is called the renormalization method using subtraction. Since the cutoff A is arbitrary insofar as it is finite, the Green's functions are indefinite because they depend on A. This dependence on the cutoff A corresponds to the response for the scale transformation of length, and this transformation is a certain (semi) group, called a renormalization group. Several kinds of renormalization group have been used in field theory, as well as in the statistical mechanics of phase transition. B. Renormalization Group in Field Theory [1- 3] A typical method to resolve the ultraviolet divergence is to add subtraction terms in the Lagrangian so that they cancel the divergence. This cancellation is usually performed in each order of the perturbation expansion. When the addition of a finite number of subtraction terms cancels the divergence, the relevant Hamiltonian is said to be renormalizable. Otherwise it is called unrenormalizable. The Lagrangian density 1 1 2 2 1 4 .Po =;/}Il({JOO({JO -2mo({Jo - 4! go({Jo, for example, can be renormalized by the trans- formation ({Jo = Zj/2 ({J, go = ZI Z; 2 g, and m = m 2 + 15m 2 . All the divergences are taken into the renormalization constants 2 3 , ZI, and 15m 2 , so that the renormalized quantities ({J, g, and m are finite. These renormalization constants can be calculated by means of a perturbation method, but the requirement of circumventing the divergences alone is not sufficient to determine them explicitly. This indeterminacy is usually expressed as Z3(/1) and ZI (/1), i.e., in terms of a parameter /1, caned the renormalization point. The /1- dependence of these functions can be deter- mined by means of the following renormaliza- tion conditions: lim (p2 + m 2 )G(2)(p, /1)= 1; p2+#2_0 r(Pi,/1)= 1, /12 P . p .=- ( 1-4c5.. ) I J 3 IJ . Since the renormalization constants depend on the continuous parameter /1, the renormalized Green's function and coupling constant 9 are also functions of /1, and consequently the quantity d(2N)( {p;}, /1, g(/1)) defined by d(2N)( {Pi}, /1, g(/1)) := La (pt + m 2 )] G(2N)(Pi' /1, g(/1)) 361 C Renormalization Group satisfies the renormalization equation (/1 :/1 + p :g - 2Ny )d(2N)( {p;}, /1, g)=O, where p = /1dg/d/1 and y =t/1dlogZ 3 (/1)/d/1. If the coefficients p and yare calculated pertur- bationally up to a certain order, the renormal- ized Green's function is obtained up to the same order by solving the foregoing renor- malization equation. This is the first kind of renormalization group. A second kind ex- presses the response of the renormalized Green's function to the change in the mass and coupling constant, and is expressed by the Callan-Symanzik equation [4,5] (m 8 +P(g) :g + 2Nycp(9))GlfN)=.1GifN), where p(g)=Zm o og/8m o and ycp(g)=tZmo. ologZ3/8mo, and where Z is determined by Zmoom/8mo=m. The inhomogeneous term i!' defined by ,1 Glf N) = Zmo  GlfN) om o 8 + Zm o -(NlogZ 3 )GlfN). om o The irreducible Green's function rk,2N) satisfies (m o +P(g) :g -2Nycp(9)) rJ/ N ) = i.1rJ/ N ). Since the inhomogeneous term can be neg- lected in the high-energy region, the foregoing equation becomes homogeneous, and con- sequently its solution is r(2N) (  ) R Pi' A ,g =ex p [ -2N reA) y(g'}P-1(9')d 9 ] x rk,2N)(pi' m, g( A)). Here g(A) is the solution of the equation J3(A)P-1(g')dg'=logA. Since dimensional analy- sis yields rk,2N)(APi,m,g)=A 4 - 2N r R (pi,m/A,g), the foregoing solution shows that high-energy phenomena can be described by the low- energy phenomena whose coupling constant is given by goo :=g( 00). In particular, when goo =0, high-energy phenomena are described by the asymptotically free field. This circumstance is caned asymptotic freedom. C. Renormalization Group Theory in Statistical Physics The renormalization group technique has proved to be powerful in statistical physics, 
361 Ref. Renormalization Group particularly in studies of phase transitions and critical phenomena [6-10]. The correlation length  diverges like (T) (T -) -v near the critical point , where v is called the crit- ical exponent of . Similarly, the correlation function C(R) for the distance R behaves like C(R)R-(d-2+) x exp( -KR), K=C 1 , where d is the dimensionality of the system and IJ is the exponent describing the deviation of the singular behavior of C(R) from classical the- ory. Renormalization is useful in evaluating these critical exponents systematica11y. The fundamental idea is to eliminate some degrees of freedom, to find recursion formulas for interaction parameters, and then to evaluate critical exponents from their asymptotic be- havior near the fixed point. There are many different ways of carrying out this idea ex- plicitly. Roughly classifying these into two groups, we have (i) momentum-space renor- malization group theories [6-10] and (ii) real-space renormalization group theories [10,11]. The common fundamental structure of these renormalization group techniques is explained as fo11ows. First the momentum space or real space is divided into cells and rapidly fluctuat- ing parts, namely, sma11-momentum parts inside each cell are integrated or eliminated, and consequently the remaining slowly fluc- tuating parts, namely, long-wave parts, are renormalized. The original Hamiltonian Yfo is transformed into Yf 1 by means of this elimi- nation process and by some scale transforma- tion that preserves the phase space volume. This renormalization operation is written as Rb: i.e., '1 = RbYfO' Similarly we have Yf 2 = Rb'Yr\ = R ff o , "', Yf n = Rb'n-1 = RYfo, .... This transformation Rb has the (semi) group property Rw = RbRb" A generator G is defined by G =limH+o(R b - 1)/(b -1). That is, R b = exp(lG), e 1 = b. The transformation of Yf is expressed as dYf/d/=G[Yf]. The fixed point ff* = RbJf'* is the solution of G[Yf*] = O. In order to find critical exponents from the as- ymptotic behavior of G near Yf*, we consider a Hamiltonian of the form Yf = Yf* + wQ and expand G[Yf] as G[Yf*+wQ]=wKQ+ O(w 2 ). If the operator K thus defined has a negative eigenvalue Ai, the corresponding physical quantity Qi becomes irrelevant after repeating the renormalization procedure, and the physical quantity Qj corresponding to a positive eigenvalue }'j > 0 becomes relevant. Thus, by introducing a field h j conjugate to the relevant operator Qj, we study the Hamil- tonian ,= ,.if* + Lj hjQj' The free energy per unit volume J[Yf] == J(h1' h 2 ,...) is found to have the scaling property J(hl'"'' h j ,...)"'"- b-dJ(b AI hj,..., b%j'''')' 1338 By taking Q I as the energy operator, we have hi  T- ==t. By the normalization h Al h 1 = 1, we obtain the scaling law 1ft, "', hj."') "'"- t dlAI 1(1, ..., hjt"\...). The critical exponent of the specific heat de- fined by C  t -. is given by the formula ex = 2 - d/}'I' Other scaling exponents {<pJ can be obtained via the formula <pj=}'j/A1 from the eigenvalues of K. The simplest example of Rb is the case where a single interaction para- meter K is transformed into a new parameter K' by K' = Jb(K). The fixed point K* is given by the solution of K* = Jb(K*). The correla- tion exponent v defined by   (K - K*) -v is given by the Wilson formula v= log h/log A, A=(dJb/dK)KK*' In most cases, Rb is con- structed perturbationa11y, and critical expo- nents are usually calculated in power series of 8 == de - d, as <P = <Po + <PI 8 + <P2 8 2 + .. . , where de denotes the critical dimension. This is called the 8-expansion. The first few terms are cal- culated explicitly for specific models, such as the <p4- mo de1. By applying the Borel sum method to these 8-expansions, one can esti- mate critical exponents [10,11]. The renormalization group method can be applied to other many-body problems, such as the Kondo effect [12]. References [1] E. C. G. Stueckelberg and A. Petermann, La normalisation des constantes dans la theorie des quanta, Helv. Phys. Acta, 26 (1953), 499-520. [2] M. Gell-Mann and F. E. Low, Quantum electrodynamics at sma11 distances, Phys. Rev., 95 (1954),1300-1312. [3] N. N. Bogolyubov and D. V. Shirkov, Introduction to the theory of quantized fields, Interscience, 1959. [4] C. G. Ca11an, Jr., Broken scale invariance in scalar field theory, Phys. Rev., D2 (1970), 1541-1547. [5] K. Symanzik, Sma11 distance behaviour in field theory and power counting, Comm. Math. Phys., 18 (1970), 227-246; Sma11 distance-behaviour analysis and Wilson expansions, Comm. Math. Phys., 23 (1971), 49-86. [6] C. DiCastro and G. Jona-Lasinio, On the microscopic foundation of scaling laws, Phys. Lett., 29A (1969),322-323; C. DiCastro, The multiplicative renormalization group and the critical behavior in d=4-8 dimensions, Lett. Nuovo Cimento, 5 (1972), 69-74. [7] K. G. Wilson, Renormalization group and critical phenomena. I, Renormalization group and the Kadanoff scaling picture, Phys. Rev., 
1339 B4 (1971),3174-3183; II, Phase-space cell analysis of critical behavior, Phys. Rev., B4 (1971),3184-3205. [8] K. G. Wilson and M. E. Fisher, Critical exponents in 3.99 dimensions, Phys. Rev. Lett., 28 (1972), 240-243. [9] K. G. Wilson and J. Kogut, The renor- malization group and the Ii expansion, Phys. Rep., 12C (1974), 75-200. [10] C. Domb and M. S. Green (eds.), Phase transitions and critical phenomena VI, Aca- demic Press, 1976. [11] A. A. Migdal, Recursion equations in gauge field theories, Sov. Phys. JETP, 42 (1975),413-418; Phase transitions in gauge and sublattice systems, Sov. Phys. JETP, 42 (1975), 743-746. (Original in Russian, 1975.) [12] K. G. Wilson, The renormalization group: Critical phenomena and the Kondo problem, Rev. Mod. Phys., 47 (1975), 773-840. [13] L. P. Kadanoff, Notes on Migdal's recur- sion formulas, Ann. Phys., 100 (1976),359- 394. 362 (IV.16) Representations A. General Remarks For a mathematical system A, a mapping from A to a similar (but in general "more concrete") system preserving the structure of A is called a representation of A. In this article, we consider the representations of tgroups and tassociative algebras. For representations of other alge- braic systems  42 Boolean Algebras; 231 Jordan Algebras; 248 Lie Algebras. For topo- logical, analytic, and algebraic groups  13 Algebraic Groups; 69 Compact Groups; 249 Lie Groups; 422 Topological Abelian Groups; 423 Topological Groups; 437 Unitary Repre- sentations. For specific groups  60 Classical Groups; 61 Clifford Algebras. B. Permutation Representations of Gronps We denote by iSM the group of all tpermuta- tions of a set M ( 190 Groups B). A permuta- tion representation of a group Gin M is a homomorphism G->iS M . We denote by aM the permutation of M corresponding to aEG and write aM (x) = ax (XE M). Then we have a con- dition (ab)x=a(bx), 1x=x (a,bEG, 1 is the identity element, xEM). In general, if the prod- uct ax EM of aEG and XEM is defined and satisfies this condition, then G is said to oper- ate on M from the left, and M is called a left 362B Representations G-set. Giving a permutation representation of G in M is equivalent to giving the structure of a left G-set to M. A reciprocal permutation representation of G in M is an tantihomomor- phism G->iS M , which becomes a homomor- phism if we define the multiplication in iSM by the right notation x(fg) = (xf)g. If the product xa E M of a E G and x E M is defined and satis- fies the conditions x(ab)=(xa)b and xl =x, then, as before, G is said to operate on M from the right, and M is called a right G-set. Giving a reciprocal permutation representation of G in M is equivalent to giving the structure of a right G-set to M. A (reciprocal) permutation representation is said to be faithful if it is injective; the corre- sponding G-set is also said to be faithful. In particular, we can take G itself as M and de- fine the left (right) operation by the multipli- cation from the left (right). Then we have a faithful permutation representation (reciprocal permutation representation), which is called the left (right) regular representation of G. For aE G, the induced permutation aG:x->ax (xa) is called the left (right) translation by a. We call a left G-set simply a G-set. If a sub- set N of a G-set M satisfies the condition that aEG, XEN implies aXEN, then N forms a G- set, which is called a G-subset of M. If a G-set M has no proper G-subset (i.e., one different from M itself and the empty subset), then for any two elements x, YEM there exists an ele- ment a E G satisfying ax = y. In this case, the operation of G on M is said to be transitive, and the corresponding permutation represen- tation is also said to be transitive. If an equiva- lence relation R in a G-set M is compatible with the operation of G (i.e., R satisfies the condition that aEG, R(x,y) implies R(ax,ay)), then the quotient set M/R forms a G-set in the natural way, called the qnotient G-set of M by R. If a G-set M has no nontrivial quotient G- set, i.e., if the only equivalence relations com- patible with the operation are R(x, x') for any x, x' E M and R(x, x') if and only if x=x', then the operation of G on M and the corre- sponding permutation representation are said to be primitive. A mapping J: M -> M' of G-sets is called a G-rnapping (G-map) if the condition J(ax) = aJ(x) (aE G, XE M) is satisfied. G-injection, G- surjection, and G-bijection are defined natur- ally. The inverse mapping of a G-bijection is also a G-bijection. Two permutation represen- tations are said to be similar if there exists a G-bijection between the corresponding G-sets. 
362 C Representations Let M be a transitive G-set, and fix any element XE M. If we view G as a G-set, the mappingf:G->M defined by f(a)=ax is a G- surjection and induces a G-bijection 1: G/R ->M. Here an equivalence class of R is pre- cisely a left coset of the stabilizer (stability group or isotropy group) Hx= {aEGlax=x}. Hence we have a G-bijection G/Hx->M. Con- versely, for any subgroup H of G, G/H is a transitive left G-set. A transitive G-set is called a homogeneous space of G. F or a family {M)J ,<EA of G-sets, the Car- tesian product ILEAM,< and the tdirect sum LiEA M,< become G-sets in the natural way; they are called the direct product of G-sets and the direct sum (i.e., disjoint union) of G-sets, respectively. Every G-set M is the direct sum of a family {M,<} of transitive G-subsets, and each M,< is called an orbit (or system of transitivity). For a G-set M, the direct product G-set M k = M x ... x M (k times) contains a G-subset M(k) = {(XI"'" xdl i =lj implies Xi =lXj}' If M(k) is transitive, M is said to be k-ply transitive. If M is transitive and the stabilizer of each point of M consists of the identity element alone, M is said to be simply transitive. If M has n elements, a permutation repre- sentation of a group G in M is said to be of degree n. When G is a group of permutations of M, the canonical injection G->6 M is a faithful permutation representation; this case has been studied in detail ( 151 Finite Groups G). C. Linear Representations of Groups and Associative Algebras Let K be a tcommutative ring with unity ele- ment and M be a K-module. Though we shall mainly treat the case where K is a field and M is a finite-dimensional tlinear space over K, the case where K is an tintegral domain and M is a tfree module over K of finite rank is also important. Since K is commutative, we can write AX=XA (AEK,XEM). Let i!K(M) be the tassociative algebra over K consisting of all K- endomorphisms of M, and let GL(M) be the group of all tinvertible elements in i!K(M), where we assume M =I {O}. Let A be an as- sociative algebra over K. A linear represen- tation of the algebra A in M is an algebra homomorphism A->i!K(M). We always assume that A has a unity element and the homomor- phisms are unitary. For convenience, we can also consider a linear representation in the trivial space M = {O}, which is called the zero representation. A reciprocal linear represen- tation is an antihomomorphism A -+i!K(M). A linear representation of a group G in M is a group homomorphism G->GL(M). This can 1340 be extended uniquely to a linear representa- tion of the tgroup ring K[G] in M, and con- versely, the restriction of a linear representa- tion of K[G] in M to G is a linear representa- tion of G; and similarly for reciprocal linear representations. Thus the study of (reciprocal) linear representations of a group G in M can be reduced to the study of (reciprocal) linear representations of the group ring K [G] in M. We now consider the linear representation of associative algebras, which we call simply "algebras." (Note that a group ring has a canonical basisthe group itselfand allows a more detailed investigation;  Sections G, I.) Given a commutative ring K with unity and a linear representation p of a K-algebra A in a K-module M, we introduce the structure of a left A-module into M by defining ax = p(a)x (aEA,xEM); the structure of a K-module in M obtained by the canonical homomorphism K -+A coincides with the original one. This A- module is called the representation module of p. Conversely, for any left A-module M we can define a linear representation p of A in M (with M viewed as a K-module via K-+A) by putting p(a)x=ax; the representation module of p coincides with the original one. This rep- resentation p is called the linear representa- tion associated with M. A reciprocal linear representation of A corresponds to a right A- module. Thus the study of (reciprocal) linear representations of A is equivalent to the study of left (right) A-modules. For instance, if the operation of a group G on M is trivial: (IX = X ((IE G, X EM), the corresponding representation of Gin M assigns the identity mapping 1 M to every (IE G. Furthermore, if M = K, this repre- sentation is called the unit representation of G (over K). Let p, p' be linear representations of A in K-modules M, M', respectively. Then an A-homomorphism M-+M' is precisely a K- homomorphism f: M -> M' satisfying the con- ditionfop(a)=p'(a)of(aEA); this is some- times called a homomorphism from p to p'. In particular, an A-isomorphism is a K- isomorphism f: M -+ M' satisfying the con- dition fo p(a)oF 1 = p'(a) (aEA); in this case we say that p and p' are similar (isomorphic or equivalent) and write p ;;;'p'. Let M be the representation module of a linear representation p of an algebra A. If p is injective, p and the corresponding M are said to be faithful. For example, the linear represen- tation associated with the left A-module A is faithful; this is called the (left) regular represen- tation of A. If Mis tsimple as an A-module, p is said to be irreducible (or simple). A homo- morphism from an irreducible representation p to p must be an isomorphism or the zero 
1341 homomorphism (Schur's lemma). In particular, if K is an talgebraicany closed field and M is finite-dimensional, then such a homomor- phism is a scalar multiplication. A linear rep- resentation is said to be reducible if it is not irreducible. If Mis tsemisimple as an A- module, P is said to be completely reducible (or semisimple). If A is a semisimple ring, any linear representation of A is completely reduc- ible. The converse also holds (-'" 368 Rings G). The linear representations associated with a submodule and a quotient module of M as an A-module are caned a subrepresentation and a quotient representation, respectively. The linear representation associated with the direct sum M 1 + ... + M, of the representation modules M 1, ..., M, of linear representations PI' ..., P, is written PI + .,. + P, and caned the direct sum of representations. If P is never similar to the direct sum of two nonzero linear representa- tions, then P is said to be indecomposable; this means that Mis tindecomposable as an A- -module. For linear representations P, p' of a group G in M, M', we define the linear representation p@ p' in M@ M' by (p @ p')(g) = p(g)@ p'(g) (g E G); this is caned the tensor product of repre- sentations p and p'. D. Matrix Representations Let K n be the K-module consisting of an n- tuples (i) of elements in a commutative ring K. 0'K(K n ) is identified with the K-algebra Mn(K) of an n x n matrices (AU) over K:(Aij)() = (Lj1 Aij)' Thus a linear representation of A in K n , i.e., a homomorphism A--+Mn(K), is caned a matrix representation of A over K, and n is caned its degree. A matrix representation of a group Gover K of degree n is a homo- morphism G--+GL(n,K), where GL(n,K) is the group of an n x n invertible matrices. If (el' ..., en) is a tbasis of a K -module M, then by the K-isomorphism Kn--+M given by the as- signment (i)--+ L71 eii' we have a bijective correspondence between the matrix represen- tations of A of degree n and the linear repre- sentations of A in M, and the corresponding representations are similar. Explicitly, the linear representation p corresponding to a matrix representation a--+(Aij(a)) is given by n p(a)e j = I e;Aij(a), aEA. i=l Hence giving the finite-dimensional1inear representations over a field K is equivalent to giving the matrix representations over K. Let T, T' be matrix representations of degree n, n. Then a homomorphism from T to T' is an n' x n matrix P satisfying PT(a) = T'(a)P (aEA). 362E Representations Therefore T and T' are similar if and only if n = n' and PT(a)P -I = T'(a) (aE A) for some n x n invertible matrix P. For a representation of a group G, it suffices that this equation is satisfied by an aEG. We always assume that K is a field. Then a K-module is a linear space over K. A linear representation p over K of a K-algebra A is said to be of degree n if its representation module M is of dimension n over K. Suppose that a sequence {a} =M o cM1 c... cM,=M of A-submodules of M is given. We take a basis (e1' ..., en) of Mover K such that (el, ... ,em,) forms a basis of M i over K (t  i  r). Then the matrix representation corresponding to p relative to the basis (e I, . .. , en) has the form Tll (a) T12 (a) Tl,(a) T 22 (a) T 2 ,(a) aT(a)= a Trr (a) where, if we put ni=dimM;/M i _ 1 =mi-m i - I , 1;j(a) is an n i x n j matrix and 1;j(a) = a for i > j. The residue classes of emi+ 1 +1, ... , em, form a basis of the quotient space M;/MI-l over K, and the matrix representation corresponding to the linear representation Pi associated with M;/M i _ 1 relative to this basis is given by 1;;. The sequence {M;} is a tcomposition series if and only if each Pi (hence 1;i) is irreducible. In this case, PI' ... ,p, are uniq uely determined by p up to their order and similarity (Jordan- Holder theorem). An irreducible representation p' similar to some Pi is called an irreducible component of p. The number p>a of Pi similar to p' is caned the multiplicity of p' as an irre- ducible component of p. We also say that p contains p' p times or p' appears p times as an irreducible component of p. The representa- tion p is completely reducible if and only if it is similar to the direct sum of its irreducible components (admitting repetition). In this case, p is similar to the matrix representation Tll (a) a a a Trr ( a) E. Coefficients and Characters of Linear Representations We consider the linear representations of an algebra over a field K. A right (left) A-module M is regarded as a linear space over K. In its dual space M*, we introduce the structure of a left (right) A-module using the inner product (, )asfonows:(x,ax*)=(xa,x*)«x,x*a) 
362 F Representations = <ax,x*»), where aEA, xEM, x*EM*. If pis the representation associated with M, the rep- resentation associated with M* is called the transposed representation (dual representation or adjoint representation) of p, and is denoted by 'p. The linear mapping 'p(a) is the ttrans- posed mapping of p(a). If M is finite- dimensional over K, we have (M*)* = M as an A-module. For a linear representation p of a group G, the mapping g-+'p(g)-I (gEG) is called the cootragredient representation of p. The reciprocal linear representation associated with the right A-module A is called the right regular representation of A, and its transposed representation (i.e., the representation as- sociated with the left A-module A*) is called the coregnlar representation of A. For any finite-dimensional semisimple algebra and group ring of a finite group, the regular repre- sentation and the coregular representation are similar ( 29 Associative Algebras H). Let p be a linear representation of A over K and M be its representation module. For any xEM, x*EM*, we define a tlinear form Px.x,E A* on A by px.x.(a) = <ax, x*) (aE A). This is called the coefficient of p relative to x, x* and is determined by its values at generators of A as a linear space. In particular, a coefficient of a linear representation p of a group G can be regarded as a function on G taking values in K. For a fixed x* E M* the assignmen t X-+ Px.x' gives an A-homomorphism M-+A*, where A* is considered as a left A-module. Therefore any nonzero coefficient Px.x' of an irreducible representation p generates an A- submodule of A* isomorphic to M. In other words, any irreducible representation of A is similar to some subrepresentation of the co- regular representation of A. In particular, any irreducible representation of a finite- dimensional semisimple algebra or a finite group is an irreducible component of the regular representation. Let A be the subs pace of A* generated by all coefficients Px.x' (xEM, x* E M*) for a given linear representation p. Then p "" p' implies A = A,. If PI' "', p, are irreducible representations of A such that Pi and Pj are not similar unless i = j, the sum Al + ... + A, in A * is direct. In particular, for a semisimple algebra A, let the Pi (I :( i:( r) be the irreducible representations associated with the minimal left ideals of the tsimple compo- nents Ai of A. Then any irreducible represen- tation is similar to one and only one of PI' '" , p" and A* can be decomposed into the direct sum of Al' "', A,. In addition, each Ai is canonically identified with At. We shall treat finite-dimensional representa- tions exclusively. Let (e l ,..., en) be a basis of the representation module M of p over K, and let a-+ T(a) = (/'ij(a)) be the matrix representa- 1342 tion that corresponds to p with respect to this basis. Then Aij=Pe,.ei (1 :(i,j:(n), where (ef, "', e) is the dual basis. If K is algebrai- cally closed and P is irreducible (or more gen- erally, tabsolutely irreducible), then {I'iJ is linearly independent; therefore we have dim A = n 2 (G. Frobenius and I. Schur). We take a matrix representation T corresponding to P and put Xp(a) = tr T(a) (aE A). Then X p is a function on A that is uniquely determined by P and belongs to A; X p is called the character of p. For a linear representation p of a group G, the character of p can be regarded as a func- tion on G. Moreover, it can be viewed as a function on the set of all tconjugate classes of G. The character of p is equal to the sum of the characters of the irreducible components of p taken with their multiplicities. The character of an irreducible representation is called an irredncible character (or simple character). If K is of characteristic 0, then p "" p' is equivalent to X p = X p " and the different irreducible charac- ters are linearly independent. The character of an absolutely irreducible representation ( Section F) is called an absolntely irredncible character. If we consider absolutely irreducible characters only, the statement holds irrespec- tive of the characteristic of K. The sum of all absolutely irreducible charac- ters of A is called the rednced character (or reduced trace) of A. The direct sum of all ab- solutely irreducible representations of A is called the rednced representation of A, and its character is equal to the reduced character. The determinant of the reduced representation is called the rednced norm of A. F. Scalar Extension of Linear Representations Let K, L be commutative rings with unity element, and fix a homomorphism (J: K -+ L. We denote by M" the scalar extension (J*(M) = M @KL of a K-module M relative to (J:XA @ fl=X @ I," fl (XE M; AEK, flE L)(  277 Modules L). For an algebra A over K, the scalar extension A" of the K-module A has the natural structure of an algebra over L:(a @ A)(b @ fl) = ab @ I'fl (a, bE A; A,flEL). For a group G, we can regard (K[G])" = L[G]:g@ ;,=gA (gEG, AEL). If M is a left A-module, then M" has the natural structure of a left A "-module; (a @ I,)(X @ fl) = ax @ A/l (aEA,xEM;I',flEL). For the linear represen- tation p associated with M, the linear repre- sentation p" over L associated with M" is called the scalar extension of p relative to (J:p"(a@ l)=p(a)@ I L . Let (e l , ...,e n ) be a basis of Mover K. If the matrix representation a-+(Aij(a)) corresponds to M relative to this basis, then the matrix representation corre- 
1343 sponding to M G relative to the basis (e 1 @ 1, "', en @ 1) over L is given by a @ 1--+(.A. u (at). A linear representation over L is said to be realizable in K if it is similar to the scalar extension pG of some linear representation p over K. In particular, if u: K --+ L is an isomorphism, pG is caned the conjugate representation of p relative to u. The conjugate representation relative to the automorphism u:.A.--+X (complex conjugation) of the complex number field is caned the complex conjugate representation. If m is an ideal of K and u:K--+K/m (tresidue class ring) is the canonical homomorphism, then the construction of pG from p is caned the reduction modulo m ( Section I). If p is a prime ideal of K and u: K --+ Kp (tlocal ring) is the canonical homomorphism, then the con- struction of pG from p is caned the localization relative to p. If K is an integral domain and p = K - {O}, then Kp is the tfield of quotients of K. We can also consider the "completion of representation" with respect to p. Let K be a field, L a field extension, and u: K --+ L the canonical injection. Then for a linear representation p of A and its represen- tation module M, the scalar extensions pG, M G are written pL, M L , respectively. In view of M cM L , AcA L , tffK(M)ctffL(M L ) by the natural injections, we can regard pL as an extension of the mapping p. We shan consider finite- dimensional representations exclusively. For linear representations PI, P2 over K, PI  P2 is equivalent to pf  pf. An irreducible repre- sentation P over K is said to be absolutely irreducible if its scalar extension pL to any field extension L is irreducible; an equivalent con- dition is that the scalar extension pI<. to the talgebraic closure K is irreducible. Another equivalent condition is that every endomor- phism of the representation module M of p must be a scalar multiplication. If every irre- ducible representation of A over K is absolutely irreducible, K is caned a splitting field for A. For a group G, if the field K is a splitting field for the group ring K[G], then K is caned a splitting field for G. Let A be finite- dimensional over K. If K is a splitting field for A, any irreducible representation of A L is realizable in K for any field extension L of K. For an arbitrary field K, the scalar extension pL of an irreducible representation p to a tseparable algebraic extension L of K is com- pletely reducible. For simplicity, we assume that K is tperfect and L = 1(. Then the multi- plicities of an irreducible components of pL are the same; this multiplicity is caned the Schur index of p. The set S(K) of talgebra classes over K, each of which is represented by a (central) simple component of the group algebra K [G] of 362G Representations some finite group G, is a subgroup of the tBrauer group B(K) of K, known as the Schur subgroup of B(K). Recent research has clarified considerably the structure of this group [19]. G. Linear Representations of Finite Groups Let G be a finite group of order g. The linear representation of Gover K is equivalent to the linear representation of the group ring K[G], concerning which we have already stated the general facts. If K is the ring Z of rational integers, a linear representation over K is sometimes caned an integral representation. We assume that K is a field. If the character- istic of K is zero or more generany not a divi- sor of g, every linear representation of Gover K is completely reducible (H. Maschke). Such a representation is caned an ordinary represen- tation. If 9 is divisible by the characteristic of K, we have a modular representation ( Sec- tion I). The exponent of G is the smanest positive integer n satisfying an = 1 for every element aE G. A field containing an the nth roots of unity is a splitting field for G (R. Brauer, 1945). Consequently, for such a field K, any scalar extension of an irreducible representation over K is irreducible, and any irreducible represen- tation over any field extension of K is realizable in K. We fix a splitting field K for G and as- sume that K is of characteristic 0; for example, we can assume K = C. The number of nonsimilar irreducible repre- sentations of G is equal to the number of conjugate classes in G. Each irreducible repre- sentation appears as an irreducible component of the regular representation with multiplicity equal to the degree. In addition, each degree is a divisor of the order 9 of G. Let p be a linear representation of a subgroup H of G and M be its representation module. Then the linear representation ofG associated with the K[G} module K[G] @K[HIM is caned the induced representation and is denoted by pG. If the matrix representation T corresponds to p, then using the partition of G into the co sets G = a 1 H U ... U a,H we can write the matrix representation corresponding to pG as a--,> [ T(a:aal) T(a:aa,) ] , T (a,-I aaI ) T (a,-Iaa,) where we define T(b)=O for bf/:H. The induced representation from a representation of degree 1 of a subgroup is caned a monomial represen- tation. To such a representation corresponds a matrix representation T such that T(a) has exactly one nonzero entry in each row and column for every aE G. For the trivial sub- 
362H Representations group H = {e}, we obtain the regular represen- tation of G. In general, for an irreducible representation (J of G and an irreducible repre- sentation p of a subgroup H, the multiplicity of (J in pG coincides with that of p in the res- triction (JH of (J to H (the Frobenins theorem). The following orthogonality relations hold for irreducible characters X and IjJ of G: I x(a)ljJ(a-I)= { g, x=ljJ, aEG 0, x#-ljJ, I x(a)x(b- I ) = { g/ga, C(a)=C(b), x 0, C(a) #- C(b). In the second formula, X ranges over all the irreducible characters of G, C(a) denotes the conjugate class of G containing a, and ga is the number of elements in C(a). H. Linear Representations of Symmetric Gronps All irreducible representations of a tsymmetric group is. over the field Q of rational numbers are absolutely irreducible. Hence the represen- tation theory of is. over a field of character- istic zero reduces to that over Q. Since the group algebra A =Q[iSnJ is semisimple, to obtain an irreducible representation of iS n it is sufficient to find a tprimitive idempotent (i.e., an idempotent that is not the sum of two orthogonal nonzero idempotents) of A. Such an idempotent can be obtained in the follow- ing way. As in Fig. 1, we draw a diagram T consisting of n sq uares arranged in rows of decreasing lengths, the left ends of which are arranged in a single column. Such a diagram T is called a Y onng diagram; if it has k rows of lengths fl ?: f2 ?: . .. ?: fk > 0, fl + f2 + . .. + fk = n, then it is written T= T(fh ...Jk)' We put the numerals 1 to n in any order into the n squares of T= T(f1 J2' ... ,{k), as in Fig. 1, for example. We then denote by (J any permutation of is. that preserves each row and construct an element of A : s = L (J. Similarly, we denote by T any permutation of iS n that preserves each column and set t=L(sgnT)T. If we set u=t.s = L:t T . (J, then u is a primitive idempotent of A except for a numerical factor. This implies that u yields an irreducible representation of is n . The element u of A is called the Yonng symmetrizer associated with T. 11 '-- - 1 2 3 4 5 6 7181 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24  Fig. 1 Young diagram. n = 26; II = 8,12 = 6, 13 = 6, f4 =4, Is =2. 1344 If we put the numerals 1 to n into the n squares of T in a different order, we obtain another symmetrizer u' associated with T. However, these two irreducible representations associated with u and u' are similar. Hence there corresponds to T a fixed class of irre- ducible representations of is n , i.e., a fixed irre- ducible character of is.. Moreover, any two different Young diagrams yield different irreducible characters, and any irreducible character is obtained by a suitable Young diagram. Thus there exists a one-to-one corre- spondence between the Young diagrams and the irreducible characters of is.. The method of determining the character associated with a given diagram was found by Schur and H. Weyl ( 60 Classical Groups). I. Modular Representations of Finite Groups Let G be a finite group of order g, and let K be a splitting field of G of characteristic p #- O. If p is a divisor of g, we have the case of modular representation, in which the situation is quite different from the case of ordinary representa- tion. The theory of modular representations of a finite group was developed mainly by Brauer after 1935. The elements of G whose orders are prime to p are called p-regular. Let k be the number of p-regular classes of G, i.e., conjugate classes of G containing the p-regular elements. Then there exist exactly k nonsimilar absolutely irreducible modular representations F I , F2' "', Fk' The number of nonsimilar indecom- posable components of the regular representa- tion R of G is also equal to k, and we denote these representations by U h U 2 , ..., Uk' We can number them in such a way that F K ap- pears in UK as both its top and bottom compo- nent. If the degree of F K is fK and that of UK is UK' then UK appears fK times in Rand F K ap- pears UK times in R. The multiplicities Cd of FA in UK are called the Cartan invariants of G. Take an algebraic number field Q that is a splitting field of G. Let p be a prime ideal in Q dividing p, and let 0 be the domain of tp- integers of Q. Then the residue class field o/p is a finite field of characteristic p and a splitting field of G. Hence we can assume that o/p = K, where K is the field considered at the begin- ning of this section. Let Zh Z2' ... ,Z. be the nonsimilar irreducible representations of G in Q. We can assume that all the coefficients of Zi are contained in o. Replacing every coeffi- cient in Zi by its residue class modp, we obtain a modular representation Zi' The modular representations Zj, ..., Z. thus obtained may be reducible. The multiplicities d iK of F K in Zi are called the decomposition numbers of G. 
1345 They are related to the Cartan invariants by the fundamental relations n C.. = L dj.d i .. i=l The determinant 1e..1 of degree k is a power of p. We set g=pe g ', (p,g')= 1. Then we may assume that Q contains a primitive g'th root of unity c5(EO). Since (p,g')= 1, the residue class (5(E K) of c5 is a primitive g'th root of unity. Let M be a modular representation of G. The characteristic roots of M(a) for a p-regular element a are powers (5v of (5. We replace each (5v by c5 V and obtain an element (a) ofQ as the sum of these c5 v . In this manner we define a complex-valued function  on the set of p- regular elements of G. We caU  the modular character (or Brauer character) of M. Two modular representations have the same irre- ducible components if and only if their modu- lar characters coincide. Denoting by q>. the modular character of F. and by '1. that of U., we have the foUowing orthogonality relations for the modular characters: " -I { g, K=A, ,;:q>.(a)'1.(a )= 0, K*A, L q>.(a)'1.(b -1)= { g/ga, C(a)= C(b), . 0, C(a)*C(b). In the first sum, a ranges over an p-regular elements of G. We say that F. and F. belong to the same block if there exists a sequence of indices K, ex, p,..., y, A such that c.. *0, c. p *0,..., c Y ' *0. This is obviouSly an equivalence relation, and Ft. F2' ..., Fk are classified into a finite number, say s, of blocks B1' B2' ... ,Bs. If F. belongs to a block Bt' we say by a stretch of language that the corresponding U. also belongs to Bt. An the irreducible components of Zi belong to the same block since c.. *0 if di.*O and d j . *0. If the irreducible components of Zi belong to Bt, we say that Zi belongs to Bt. Let Xt be the number of Zi belonging to Bt and Yt the number of F. belonging to Bt' Then Xt Yt. If Xi is the ordinary character of Zi' then Xi can be considered as the modular character of Zi' If we denote the degree of Zi by Zj, then gaXj(a)/zi for aE G is an algebraic integer and hence belongs to o. Now Zi and Zj belong to the same block if and only if gaXi(a)/zi == gaXj(a)/zj (mod p) for aU p-regular elements a ofG. If p. is the highest power of p that divides an the degrees Zj of Zj belonging to B" then it is also the highest power of p dividing an the degrees J. of F. belonging to Bt' We caU d = e-ex the defect of Bt; obviously O<d<e. If Zi belongs to a block of defect d, then the power of p dividing Zj is pe-d+h, (hjO). A block of 362 I Representa tions defect ° contains exactly one ordinary repre- sentation Zj, hence also exactly one modular representation F. (Xt= Yt= 1). Moreover, we have Zi= F. = U.. It fonows that aU the de- grees Zi of Zi belonging to a block of defect 1 are exactly divisible by pe-1; the converse is also true. Zi belongs to a block of defect ° if and only if Xi(a)=O for any element a of G whose order is divisible by p. Let D be any p-Sylow subgroup of the tcen- tralizer CG(a) of an element a of G, and let (D: 1) = pd. Then d is caUed the defect of the class C(a), and D is caUed a defect group of C(a). The number of blocks of defect e is equal to the number of p-regular classes of defect e. Let Bt be a block of defect d. Then there exists a p-regular class of defect d containing an element a such that gaXi(a)/zi ,*0 (modp) for any Zi in Bt. The defect group D of C(a) is caUed the defect group of B" and D is uniquelY determined up to conjugacy in G. The number of blocks of G with defect group D is equal to the number of blocks of the tnormalizer NG(D) with defect group D. An arbitrary element x of G can be written uniquely as a product x=sr=rs, where s, caUed the p-factor of x, is an element whose order is a power of p, and r is a p-regular element. We say that two elements of G belong to the same section if and only if their p-factors are conjugate in G. This is an equivalence relation. ObviouSly, each section is the union of conjugate classes of G. If the p-factor of x is not conjugate to any element of the defect group D of Bt, then Xi(X)=O for aU Zi in Bt' Let q>1, q>, "', q>(S) be the absolutely irre- ducible modular characters of CG(s), and let Xl be the absolutely irreducible ordinary charac- ters of CG(s). Since x;(sr) = elx;(r) = elL d;"q>;(r), rE CG(s), we have Xi(sr) = L rilx;(sr) = L dt"q>;(r). I " The dt" are caned the generalized decompo- sition numbers of G. If the order of s is pi, then the dt" are algebraic integers of the field of the pith roots of unity. Let s be conjugate to an element of D. There corresponds to Bt a union lit of blocks of CG(s), and if (J * p, then lit and lip contain no irreducible modular represen- tations in common. We have dt" =0 for any Zi in Bt (i.e., q>; f/: lit)' Brauer's original proof of this result was considerably complicated; simpler proofs were given independently by K. Iizuka and H. Nagao. From these relations we get the foUowing refinement of the orthogon- ality relations for group characters. If Zi and Zj belong to different blocks of G, then LaeSXj(a)xj(a-I)=O, where a ranges over aU 
362 J Representations the elements belonging to a fixed section S of G. If elements a and h of G belong to different sections, then Lx,Xi(a)xi(h-I)=O, where Xi ranges over all the characters of G belonging to a fixed block B,. J. Projective Representations of Finite Gronps Let V be a finite-dimensional linear space over a field K, and let P( V) be the tprojective space associated with V ( 343 Projective Geom- etry). The set of all projective transforma- tions of P(V) forms the group PGL(V), which can be identified with the quotient group GL(V)/K*lv. Here K*=K-{O} and K*l v is the set of all scalar multiples of the identity transformation I v of V and is the center of GL(V). A homomorphism G-.PGL(V) is called a projective representation of G in Vor simply a projective representation of Gover K. Two projective representations (p, V) and (p', V') of G are said to be similar if there exists an isomorphism qJ:PGL(V)-.PGL(V') induced by a suitable isomorphism V -. V' such that qJop(a)oqJ-' =p'(a) (aEG). Let VI #{O} be a subspace of V. We can assume that P( Vd c P( V). If (p, V) is a projective representation of G such that each p(a) (aE G) leaves P(vd invariant, we get a projective representation (p" Vd by restricting the p(a) to P(vd. In this case (PI, Vd is called a snbrepresentation of p. A projective representation is said to be irre- ducible if there exists no proper su brepresen- tation of p. A mapping O":G-.GL(V) is called a section for (p, V) if n(O"(a)) = p(a) for each aEG, where n is the natural projection of GL(V) onto PGL(V). Any section 0" defines a mapping f: G x G-. K* satisfying O"(a)O"(b) =f(a, b)O"(ab) (a,bEG). The set {f(a,b)}a.bEG is called the factor set of p with respect to 0". The mapping f is a t2-cocycle of G with values in K*. The 2- cohomology class CpE H2(G, K*) of f is deter- mined by p and is independent of the choice of sections for p. A projective representation p has a section 0" which is a linear representation of G in V if and only if c p = I. If G is a finite group, for any CE H 2 (G, K*) there exists an irreducible projective representation p of G over K which belongs to c, i.e., cp=c. If p and p' are similar, then C p = C p" The tensor product p @ p' of two projective representations p and p' can be defined as in the case of linear repre- sentations, and we have Cp@I"=C p ' c p " If K is algebraically closed, then H2(G, K*) is deter- mined by the characteristic of K. When K is the complex field C, the group H2(G, C*) = ill/(G) is called the multiplier of G. If 9J1(G) = 1, then G is called a closed gronp, and any pro- jective representation of G is induced by a 1346 linear representation of G. In general, if p is a projective representation of Gover C, then the order of c p is a divisor of the degree of p (di- mension of V). Moreover, if p is irreducible, then both the degree of p and the square of the order of c p are divisors of the order of G. K. Yamazaki, among others, studied the projec- tive representations of finite groups in detail. K. Integral Representations Every complex matrix representation of G is equivalent to a matrix representation in the ring of algebraic integers. If an algebraic num- ber field K is specified, every K[G]-module V contains G-invariant R-tlattices (briefly, G- lattices), where R is the ring of integers in K. A G-lattice L is characterized as an R[G]- module, which is finitely generated and ttor- sion free (hence tprojective) as an R-module. It provides an integral representation of G as an automorphism group of the R-projective module L. R[G]-modules Land M need not be iso- morphic even when the K[G]-modules K @ L and K @ M are isomorphic. The set of G- lattices in a fixed K[G]-module V is divided into a finite number of R [G]-isomorphism classes (Jordan-Zassenhans theorem). Let p be a prime ideal of Rand Rp be the localization of R at p. The study of Rp-representations is intimately related with modular representation theory. For any R[G]-module L there is an associated family of Rp[G]-modules Lp = Rp @ L, where p ranges over all primes of R. G-lattices Land M in a K[G]-module V are said to be of the same genns if Lp  M p for every p. The number of genera of G-Iattices in V is given by I1 p1q h p (g = order G), where hp denotes the number of Rp[G]-equivalence classes of Rp[G]-lattices in V. When V is ab- solutely irreducible, the number of R[G]- equivalence classes in a genus equals the (ideal) tclass number of K (J. M. Maranda and S. Takahashi). The tKrull-Schmidt theorem, asserting the uniqueness of a direct sum decomposition into indecomposable R[G]-modules, holds if R is a complete discrete valuation ring or if R is a discrete valuation ring and K is a splitting field of G. The condition for the finiteness of the number of non isomorphic indecomposable G- lattices is known. In particular, for R = Z it reduces to the requirement that the Sylow p- subgroup of G be cyclic of order p or p2 for every pig. Regarding projective Z[G]-modules  200 Homological Algebra G. The isomorphism problem, i.e., the question of whether the isomorphism Z[GJZ[HJ of integral group algebras implies the isomor- 
1347 phism G  H of groups, has been answered affirmatively for certain special cases such as tmeta-Abelian groups. R[G] is an R-torder in K[G], and in this context, a considerable portion of the integral representation theory has been extended to more general orders in separable algebras [14- 16]. References [1] B. L. van der Waerden, Gruppen von linearen Transformationen, Springer, 1935 (Chelsea, 1948). [2] B. L. van der Waerden, Algebra II, Springer, fifth edition, 1967. [3] H. Boerner, Darstellungen von Gruppen, Springer, 1955. [4] N. Bourbaki, Elements de mathematique, Algebre, ch. 8, ActualiHs Sci. Ind., 1261a, Hermann, 1958. [5] C. W. Curtis and I. Reiner, Representation theory of finite groups and associative alge- bras Interscience, 1962. [6] R. Brauer and C. Nesbitt, On the modular characters of groups, Ann. Math., (2) 42 (1941), 556-590. [7] R. Brauer, Zur Darstellungstheorie der Gruppen endIicher Ordnung I, II, Math Z., 63 (1956),406-444; 72 (1959-1960), 25-46. [8] M. Osima, Notes of blocks of group char- acters, Math. J. Okayama Univ., 4 (1955), 175- 188. [9] I. Schur, Uber die Darstellung der endli- chen Gruppen durch gebrochene lineare Sub- stitutionen, J. Reine Angew. Math., 127 (1904), 20- 50. [10] I. Schur, Untersuchungen iiber die Dars- tellung der endlichen Gruppen durch gebro- chene lineare Substitutionen, J. Reine Angew. Math., 132 (1907), 85-137. [11] I. Schur, Uber die Darstellung der sym- metrischen und der aIternierenden Gruppe durch gebrochene Iineare Substitutionen, J. Reine Angew. Math., 139 (1911),155-250. [12] R. Brauer, Representations of finite groups, Lectures on modern mathematics, T. L. Saaty (ed.), Wiley, 1963, vol. 1, 133-175. [13] W. Feit, Characters of finite groups, Benjamin, 1967. [14] I. Reiner, A survey of integral representa- tion theory, Bull. Amer. Math. Soc., 76 (1970), 159-227. [15] R. Swan, K-theory of finite groups and orders, Lecture notes in math. 149, Springer, 1970. [16] K. W. Roggenkamp (with V. Huber- Dyson), Lattices over orders I, II, Lecture notes in math. 115, 142, Springer, 1970. 363 Ref. Riemann, Georg Friedrich Bernhard [17] L. Dornhoff, Group representation theory, Dekker, A, 1971; B, 1972. [18] J.-P. Serre, Representations Iineaires des groupes finis, second edition, Hermann, 1972. [19] T. Yamada, The Schur subgroup of the Brauer group, Lecture notes in math. 397, Springer, 1974. 363 (XXI.40) Riemann, Georg Friedrich Bernhard Georg Friedrich Bernhard Riemann (Septem- ber 17, 1826-July 20,1866) was born the son of a minister in Breselenz, Hanover, Germany. He attended the universities of Gottingen and Berlin. In 1851 he received his doctorate at the University of Gottingen and in 1854 became a lecturer there. In 1857 he rose to assistant professor, and in 1859 succeeded P. G. L. tDirichlet as full professor. In 1862 he con- tracted tuberculosis, and he died at age 40. Despite his short life, his contributions en- compassed all aspects of mathematics. His doctoral thesis (1851) stated the basic theorem on tconformal mapping and became the foundation for the geometric theory of functions. In his paper presented for the posi- tion of lecturer (1854), he defined the tRie- mann integral and gave the conditions for convergence of trigonometric series. In his inaugural lecture in the same year, he dis- cussed the foundations of geometry, intro- duced n-dimensional manifolds, formulated the concept of tRiemannian manifolds, and defined their curvature. In his paper of 1857 on tAbel- ian functions, he systematized the theory of tAbelian integrals and Abelian functions. In his paper of 1858 on the distribution of prime numbers he considered the tRiemann zeta function 'as a function of a complex variable and stated tRiemann's hypothesis concerning the distribution of its zeros. It remains for modem mathematics to investigate whether this hypothesis is correct. In his later years, influenced by W. Weber, Riemann became interested in theoretical physics. He gave lec- tures on the uses of partial differential equa- tions in physics that were edited and published by H. Weber. References [1] G. F. B. Riemann, GesammeIte mathema- tische Werke und wissenschaftIicher Nachlass, R. Dedekind and H. Weber, (eds.), Teubner, second edition, 1892 (Nachtriige, M. Noether and W. Wirtinger (eds.), 1902) (Dover, 1953). 
364 A Riemannian Manifolds [2] F. Klein, Vor1esungen tiber die Entwick- lung der Mathematik im 19. Jahrhundert I, Springer, 1926 (Chelsea, 1956). 364 (VII.3) Riemannian Manifolds A. Riemannian Metrics Let M be a tdifferentiable manifold of class C' (1 :O;;r:O;;w), and 9 be a tRiemannian metric of class C'-1 on M. Then (M,g) or simply Mis ca1led a Riemannian manifold (or Riemannian space) of class C' ( 105 Differentiable Mani- folds). The metric 9 is a tcovariant tensor field of order 2 and of class C'-I; it is called the fnndamental tensor of M. Using the value gp of 9 at each point pEM, a positive definite inner product gp(X, Y), X, YET p , is introduced on the ttangent vector space Tp to M at p, and hence Tp can be considered as a tvector space over R with inner product that can be identi- fied with the Euclidean space E" of dimension n = dim M. Utilizing the properties of the space En, we can introduce various notions on Tp and M. (For example, given a tangent vector LE Tp, we define the length IILII = IILllg.p of L to be the quantity gp(L,L)I/2. A normal vector at a point p of a submanifold N of M is we1l defined as an element of the orthogonal com- plement of the subs pace Tp(N) of Tp(M) with respect to gp; a differential form of degree 1 is identified with a tangent vector field.) A neces- sary and sufficient condition for a differenti- able manifold M of class C to have a Rie- mannian metric is that M be tparacompact. A Euclidean space E" has a Riemannian metric expressed by 'L7=odx i @dx i in terms of an orthogonal coordinate system (Xi). We assume that M is connected and of class Coo. A curve x: [a, b] -+ M is called piecewise smooth or of class D oo if x is continuous and there exists a partition of [a, b] into finite subintervals [t i - 1 , t l ] such that the restrictions x I [t H , tJ are timmersions of class COO. The length II x II of such a curve x is defined to be Si Ilx'(t)11 dt, where x'(t) is the tangent vector of x defined for almost a1l values of t. As in a Euclidean space, the length II x II is independent of the choice of parameter t, and the concepts of tcanonical parameter and orientation of x Can be defined ( 111 Differential Geometry of Curves and Surfaces). A function d:M x M -+ [0, 00) is dcfincd so that the value d(p, q), p, q EM, is the infimum of the lengths of curves of class D oo joining p and q. The function d is a tdistance function on M, and the topology of M defined by d coincides with the original 1348 topology of M. There exists an essentia1ly unique structure of a Riemannian manifold on (real or complex) te1liptic or thyperbolic space ( 285 Non-Euclidean Geometry), and d is the distance function of these spaces. If there exists an immersion cp of a differenti- able manifold N in a Riemannian manifold (M,g), then a Riemannian metric cp*g is de- fined on N by the tpullback process (II LII"'q= Ildcp(L)ll g ). (For example, a submanifold and a tcovering manifold of M have Riemannian manifold structures induced by the natural mappings ( 365 Riemannian Submanifolds).) If M = E 3 and N is a 2-dimensional submani- fold of M, then cp*g is the tfirst fundamental form of N. Assume further that cp is a diffeo- morphism and N has a Riemannian metric h. If cp*g=h, then (N,h) is said to be isometric to (M, g), and cp is called an isometry. The set I(M) of all isometries (isometric transforma- tions) of M onto M is a group. A necessary and sufficient condition for a mapping 1jJ: N-+M to be an isometry is that dN(p,q)= dM(IjJ(p), ljJ(q)), p, qE N. In particular, I(E") is the tcongruent transformation group. If a differentiable manifold M is the product manifold of Riemannian manifolds (M 1 , gd and (M 2 , g2), then (M, n! gl + n!g2) is called the Riemannian product of M 1 and M2' where n., 0(= 1,2, are projections from M to M.. Let F be the ttangent n-frame bundle over M and B = Bg(M) be the subset of F consisting of all orthonormal frames with respect to g. Then B is an O(n)-subbundle of F of class Coo, ca1led the tangent orthogonal n-frame bundle (or orthogonal frame bundle). In this way we get a one-to-one correspondence between the set of all O(n)-subbundles of F and the set of all Riemannian metrics of M. B. Riemannian Connections There exists a unique taffine connection in the orthogonal frame bundle B whose ttorsion tensor is zero. This connection is called the Riemannian connection (or Levi-Civita connec- tion;  80 Connections K). Let V denote the tcovariant differential operator defined by this connection ( 80 Connections, 417 Tensor Calculus). (For a vector field X, the covariant differential operator V x acts on any tensor field T defined on a submanifold having X as a tangent vector field.) The covariant differen- tial V 9 of the fundamental tensor 9 vanishes identically. The tconnection form of the Rie- mannian connection is expressed by n 2 differ- entiall-forms (wJh,,;u,,;" on B, and we have wJ + wi = O. Let (W I )1 ";i";n be the tcanonical I-forms on B. Then (W{)\";i";/";. together with (Wi) give rise to an absolute parallelism on B (that is, they are linearly independent at 
1349 every point). Let (Oi) and (O{) be the corre- sponding set of differential1-forms on the orthogonal frame bundle BN over another Riemannian manifold N with dim N = dim M. If there exists an isometry t/J: M -+ N, then the differential dt/J is a diffeomorphism from B = BM to B N , and we get (dt/J)*(Oi) = Wi, (dt/J)*(O{) = wI. Conversely, if there exists a diffeomorphism 'I:':BM-+B N satisfying 'I:'*(Oi) = Wi, 'I:'*(O{) = wI and Mis torientable, then there exists an isometry t/J: M -+ N such that dt/J = 'I:' hOlds on a connected component Bo of B. Moreover, t/J is uniquely determined if we choose one Bo. In this way the problem of the existence of an isometry from M may be reduced to one of the existence of a diffeomorphism from B preserv- ing absolute paranelism (as wen as the order of the basis (Wi, wi)). According to the general theory of affine connections, the Riemannian connection on M determines a tCartan connection uniquely with En=I(En)/O(n) as tfiber, which is caned the Euclidean connection. As a consequence, every tangent vector space Tp(M) is regarded as a Euclidean space E;, and for a given curve x: [a, b] -+ M of class D OO and for t E [a, b] there exists an isometry Ix.t: E(t) -+ E(a) satisfying the fonowing three conditions (we denote Ix.b by I J: (1) If x is a composite of two curves y and z, then Ix = Iy' Iz. (2) Differentiability: If x is of class Coo at to, then t-+lx.t is of class Coo at to. (3) Ix depends on the orientation of x but not on the choice of its parameter. The develop- ment x of x is the curve in E(a) <fefined by x(t) = Ix.t<x(t)), and we get IIxll = 114 (Ix is some- times caned the development along x.) Utilizing the concept of development, the theory of curves in En can be used to study curves on M ( 111 Differential Geometry of Curves and Surfaces). For example, if x is a segment, then x or x( [a, b]) is caned the geodesic arc ( 80 Connections L); the tFrenet formula is auto- maticany formulated and proved. The rotation part I: of Ix (the composite of Ix and the paral- lel displacement of E; translating Ix(x(b)) to x(a)) is regarded as an isomorphism of the inner product space TX(b) to Tx(a)' I: is ex- tended to an isomorphism of the ttensor alge- bra ff(Tx(b») to ff(Tx(a»)' which is denoted by the same symbol I: and caned the parallel displacement or parallel translation along x. Given a tensor field K on M, we have 'ilx'(a)K = [dI:'t(K(x(t)))/dt]t=a. In particular, a neces- sary and sufficient condition for 'ilK =0 is that I:(K(x(b))) = K(x(a)) for any x, in which case K is said to be parallel. C. Exponential Mapping (178 Geodesics) A curve x on M or the image of x is caned a geodesic if any subarc x I [a, b] of x is a geo- 364D Riemannian Manifolds desic arc. Let N(S) be the normal bundle of a submanifold S of M, that is, the differentiable vector bundle over S consisting of an normal vectors at an points of S. Then S is contained in N(S) as the set of zero vectors at an points of S. There exist a neighborhood U of S in N(S) and a mapping Exps: U -+M of class Coo with the fonowing property: There exists a geodesic arcx with the initial tangent vec- tor LE U, length IIxll = liLli, and final point ExPs(L). Let Us be the largest U with this property. Then Exps: Us-+M is determined uniquely by S. The mapping Exps is caned the exponential mapping on S. If the rank of the Jacobian matrix of Exps is less than n at LE Us, then L or ExPs(L) is caned the focal point of S on the geodesic s-+ExPs(sL) (Os, sL E Us), If S is compact, then S has an open neighborhood V s in N(S) satisfying the fonow- ing three conditions: (i) Vs c Us; (ii) II L II = d(ExPs(L), S) for L E Vs, where the right-hand member expresses the infimum of the distance between the point EXPs(L) and points of S; (iii) the restriction Exps I V s is an embedding. The image Exps(Vs) is the tubular neighborhood of S. In the special case where S consists of only one point p, N ( {p} ) coincides with the tangent vector space Tp(M), and the focal point of p is caned the conjugate point of p, given as the zero point of the tJacobi field ( 178 Geo- desics, 279 Morse Theory). In this case, Vs is denoted by Vp. If Tp is identified with Rn (or En) by means of an orthonormal basis of Tp, then (Exp p ) -1 defined on Expp(V p ) is a coordi- nate mapping, caned the normal coordinate mapping. Furthermore, Expp(V p ) contains a neighborhood W p of p such that there exists a unique geodesic arcxjoining any two points q and r of w;, with IIxll =d(q,r) and contained in W p ' w;, is caned a convex neighborhood of p. D. Curvature The set of differentiall-forms (Wi, wi), by means of which absolute paranelism is given in the orthogonal frame bundle B of M, satisfies the tstructure equation dw i = - r;j wj 1\ w j , dw] = - r;kW 1\ wJ + QJ, and (QJ) is caned the curvature form of the Riemannian connection of M. This form is expressed by a tensor field R ( 80 Connections; 41 7 Tensor Ca1cul us ) of type (1, 3) on M, caned the curvature tensor; if RJkl are the components of R with respect to an orthonormal frame bEB of the tangent vector space Tp of M, then QJ=(1/2)r;RJkIWk 1\ Wi at b. Let (X, Y) be an orthonormal basis of a 2-dimensional subspace P of Tp. Then the inner product Kp(P) of X and R(X, Y) Y is determined by P independently of the choice of the basis (X, Y), where the i-component of 
364E Riemannian Manifolds R(X, Y)Z with respect to the basis b of Tp is given by LR)klZjXkyl. Kp(P) is the tGauss- ian curvature of the surface Exp p ( V p n P) and is called the sectional curvatnre (or Riemannian curvatnre) of P. The curvature tensor R is uniquely determined by the function K p(P) of p and P. If dim M ? 3 and if at every point p of M, Kp(P) has a constant value Mp indepen- dent of the choice of P, then M p is a constant independent of the choice of p (F. Schur). If K p(P) is constant, then M is called a space of constant cnrvatnre. If V'R = 0, then M is called a locally symmetric space ( 412 Symmetric Riemannian Spaces and Real Forms; 413 Symmetric Spaces). In a local sense, Riemann- ian metrics of these spaces are uniquely deter- mined by the curvature tensor R up to a con- stant factor. If M is of constant curvature K, complete, and simply connected, then M is isometric to En, the sphere (which is the uni- versal covering Riemannian manifold of a real telliptic space), or a real thyperbolic space according as K is 0, positive, or negative. The compact spaces of positive constant curvature, that is, the Riemannian manifolds having the sphere as the universal covering Riemannian manifold, were completely classfied by 1. A. Wolf [1]. A complete, simply connected, and locally symmetric space is a tsymmetric Riemannian space. The Ricci tensor (R i ) is defined by Rij= - Lk Rtk' Let Q be the qua- dratic form on Tp given by (RiJ Then the value Q(L) for a unit vector LE Tp is the mean of K p(P) for all sections P (2-dimensional subspaces of T p ) containing L and is called the Ricci cnrvatnre (or mean cnrvatnre) of the direction L at p. The mean R of Q(L) for all the unit vectors L at p is called the scalar cnrvatnre at p ( 417 Tensor Calculus). Q(L) and R are expressed by Q(L)= Ligp(R(X i , L)L, X;) and R = Li Q(X;), up to positive constant factors, in terms of an orthonormal basis (X;) of Tp. If the Ricci tensor of M is a scalar mul- tiple of the fundamental tensor, then M is called an Einstein space. (When dim M ? 3, this scalar is constant.) If M is a tKahler manifold and P is restricted to a complex plane (in- variant under the almost complex structure), then K(P) is called the hoIomorphic sectional cnrvatnre. A Kahler manifold M of constant holomorphic sectional curvature is locally isometric to a complex Euclidean space, ellip- tic space, or hyperbolic space. The properties of the sectional curvature and the Ricci curvature are closely related to the behavior of geodesics of Riemannian mani- folds, and these properties reflect those of the topological structures of the manifolds ( 178 Geodesics). The compact simply connected homogeneous Riemannian manifolds of strict- ly positive sectional curvature have been 1350 classified [2-4]. Related to algebraic geome- try, as the solution of the Frankel conjecture, the following holds: If a compact Kahler mani- fold has strictly positive sectional curvature, then it is biholomorphic to the complex pro- jective space [5,6J ( 232 Kahler Manifolds). Furthermore, curvature tensors are related to tcharacteristic classes. For example, we have the Ganss-Bonnet formnla: If M is an even- dimensional compact and oriented Riemann- ian manifold, the integral of anK(n)w on M is equal to the tEuler-Poincare characteristic, where an = n!/(2 n n n / 2 (n/2)!), W is the volume element of M, and K(n) is defined as follows: For a positive even num- ber s, K(s) is a real-valued function of the s- dimensional subspaces P of the tangent vector spaces Tp of M, which is given by K(s)(P)=bsLl>i, i,l>j,. j,<Rp(Xi"Xi,)Xj"Xj,> ...<Rp(X i , "Xi)XjH'X j ) in terms of an orthonormal basis (XI' "', Xs) of P, where b s =( _1)'/2 /(2s/ 2 s!), L is summation over all pairs of s-tuples satisfying {i 1, '" , is}, {jj,''',}s}c{1,2,...,n},l>i, i,isthesignof (i 1, .. . ,is), < , > is the inner product in Tp with respect to gp, Rp is the value of the tensor R at p, and Rp(X i , X)X k is as already defined at the beginning of this section. In particular, K(2) = K. If K(s) of a compact and orientable M is constant for a certain s, then the kth tPontryagin class of M (with real coefficients) vanishes for all k? s/2. E. HoIonomy Gronps Let p be a fixed point of M, and let Qp be the set of all closed oriented curves of class D W with initial and final points p and with para- meters neglected. The set H = {U XEQp}, called the holonomy gronp of M, is a subgroup of I(T p ) (I;. is identified with En) independent of the choice of p ( 80 Connections), and x--+ Ix is a homomorphism from Qp to H. The restric- tion Ho of this homomorphism to all closed curves homotopic to zerO is called the re- stricted holonomy gronp. The rotation part h of H, called the homogeneons holonomy gronp, is a subgroup of the orthogonal group O(n) of Tp. The rotation part ho of Ho, called the re- stricted homogeneous holonomy group, is a connected component of h and a tcompact Lie group. The tLie algebra of ho is spanned by UARx(b)(X, Y))lx:[a,bJ--+M is of class D"', x(a) = p, and X, Y E TX(b)}, where Rx(b)(X, Y) is the endomorphism of the linear space 7;.;(b) defined in Section D. 
1351 If M = En, then H = {e}, where e is the iden- tityelement. If h = {e} (ho = {e}), then Mis called flat (locally flat) ( 80 Connections E). Local flatness is equivalent to M being locally isometric to En. If M is complete and H (re- garded as a transformation group of En) has a fixed point, then M is isometric to En. Any finite rotation group h is the homogeneous holonomy group of some locally flat and com- pact Riemannian manifold. With respect to the linear group h of Tp, we get a unique decomposition I;, = V(o) EB V(1) EB '" EB V(r) of mutually orthogonal sub- spaces, where V(o)(dim V(O);::' 0) consists of all h-invariant vectors and V(i)' i = 1,..., r, are irreducible h-invariant subspaces. If h or ho is irreducible (reducible) on Tp, then M is called irreducible (reducible). If M is complete and simply connected (hence h = h o ), then M is the Riemannian product of closed submanifolds M(.), 0(=0, 1, ..., r, satisfying V(.) = T(M(.»). This decomposition M = II M(.) is determined uniquely by M and called the de Rham de- composition of M [7]. In this case h is the direct product of closed subgroups h(.), where every h(.) acts on V(P)' f3 #- 0( as the identity, and can be regarded as the homogeneous holonomy group of M(.). If ho is irreducible and M is not locally symmetric, then ho acts ttransitively on the unit sphere of Tp. The classification of possible candidates for such ho has been made [8,9]. For example, if n is even and ho is the tunitary group U(n/2), then h acts transitively on the unit sphere. A necessary and sufficient condition for h to be contained in U(n/2) is that M have a tcom- plex structure and the structure of a Kahler manifold. The group h acts naturally on the ttensor algebra 5"(T p ) of I;,. If a tensor field A on M is parallel, then Ap is invariant under h. Con- versely, if AoE5"(Tp) is invariant under h, there exists a unique parallel tensor field A satisfying Ap=Ao. The orthogonal frame bundle B is treducible to the h-bundle. F. Transformation Groups The group I(M) consisting of all isometries of M with the tcompact-open topology is a tLie transformation group. The isotropy subgroup at any point is compact. In particular, if M is compact, so is I(M). The differential dcp of cpEI(M) is a transformation of the orthogonal frame bundle B. If b o is a fixed point of B, then the mapping f3 defined by cpdcp(bo) embeds I(M) as a closed sub manifold of B, and the differentiable structure of I(M) is thus deter- mined. If f3 is surjective, it follows from the structure equation that M is of constant curva- 364F Riemannian Manifolds ture and equals En, a real thyperbolic space, or a real telliptic space (or a sphere). A necessary and sufficient condition for the image of f3 to be a subbundle of B is that I(M) be transitive. If the image of f3 contains the h-bundle, then M is a symmetric space. If M is compact and I(M) is transitive, then the image of f3 is con- tained in the h-bundle ( 191 G-Structures). If I(M) is transitive on M, then M is complete and is the thomogeneous space of I(M). Con- versely, a homogeneous space M = G/K of a Lie group G by a compact subgroup K has a Riemannian metric invariant under G. In general, an element of I(M) preserves quan- tities uniquely determined by the Riemannian metric g, such as the Riemannian connection, its curvature, the set of all geodesics, etc. Fur- thermore, any element of I(M) commutes with V and the tLaplace-Beltrami operator. If Mis compact and oriented, then the connected component Io(M) of I(M) preserves any thar- monic differential form. If M is complete and simply connected, then Io(M) is clearly decom- posed into a direct product by the de Rham decomposition of M. An element of the Lie algebra of I(M) is regarded as a vector field X on M, called the infinitesimal motion, which satisfies the equation Lxg=O; that is, i+ Vij=O, where Lx denotes tLie derivation and the i are tcovariant components of X with respect to a natural frame (a/aX i ), i = 1, ...,n (417 Tensor Calculus). This equa- tion is called Killing's differential equation, and a solution X of this equation is called a Killing vector field. The set of all Killing vec- tor fields is a Lie algebra of finite dimension (  dim B). If M is complete, then this Lie alge- bra coincides with that of I(M). If M is com- pact and the Ricci tensor is negative definite, then I(M) is discrete. If, furthermore, the sec- tional curvature is nonpositive, then an iso- metry of M homotopic to the identity trans- formation is the identity transformation itself. It is known that dimI(M)n(n+ 1)/2 if dim M = n, and the maximum dimension is attained only when M is a space of constant curvature. For Riemannian manifolds with large I(M), extensive work on the structures of M and I(M) has been done by I. P. Egorov, S. Ishihara, N. H. Kuiper, L. N. Mann, Y. Muto, T. Nagano, M. Obata, H. Wakakuwa, K. Yano, and others [10, 11]. The fixed point set of a family of isometries has interesting differential geometric properties [10]. For example, let G be any subset of I(M) and F the set of points of M which are left fixed by all the elements of G. Then each con- nected component of F is a closed ttotally geodesic submanifold of M. If M is compact andfis an isometry of M, then Af=X(F), where Af denotes the tLefschetz number and 
364G Riemannian Manifolds X(F) the tEuler characteristic of the fixed point set F of f As for the existence of fixed points of an isometry, the foIlowing are known: Let f be an isometry of a compact, orientable Rie- mannian manifold M with positive sectional curvature. If dim M is even and f is orientation preserving, or if dim M is odd and f is orienta- tion reversing, then f has a fixed point. In the case of nonpositive curvature, the foIlowing is basic: Every compact group of isometries of a complete, simply connected Riemannian mani- fold with non positive sectional curvature has a fixed point (E. Cartan). If a compact, orien- table Riemannian manifold admits a fixed- point-free I-parameter group of isometries, then its tPontryagin numbers vanish. On a Riemannian manifold M, a transform- ation of M which preserves the Riemannian connection, or equivalently which commutes with covariant differentiation V is caIled an affine transformation. Let A(M) denote the group of all affine transformations of M. A transformation preserving the set of all geo- desics is called a projective transformation. Let P(M) denote the group of all projective trans- formations of M. A transformation preserv- ing the angle between tangent vectors is caIled a conformal transformation. Let C(M) de- note the group of all conformal transforma- tions. They are Lie transformation groups with respect to suitable topologies. Clearly, I(M)cA(M)cP(M), I(M)cC(M)( 191 G- Structures). Ao(M), the connected component of A(M), is decomposed into a direct product according to the de Rham decomposition of M when M is complete and simply connected (1. Hano). If M is complete and irreducible, then A(M) = I(M) except when M is a I-dimensional Euclid- ean space. If M is complete and its restricted homogeneous holonomy group ho leaves no nonzero vectors, then Ao(M)=Io(M). If Mis compact, then Ao(M)=Io(M) always. If M is complete and has a paraIlel Ricci tensor, then the connected component Po(M) = Ao(M), unless M is a space of positive con- stant sectional curvature (n > 2) (Nagano, N. Tanaka, Y. Tashiro). If M is compact, simply connected, and has constant scalar curvature, then Po(M) = 10 (M), unless M is a sphere (n > 2) (K. Yamauchi). Similarly to the case of P(M), it is known that if M is complete and has a paraIlel Ricci tensor, then the connected component Co(M) = 10 (M), unless M is a sphere (n>2) (Nagano). A conformal transformation remains con- formal if the Riemannian metric 9 is changed conformaIly, namely, to e 2f g, f being any smooth function on M. A subset of C(M,g) is caIled essential if it cannot be reduced to a subset of I(M, 9) for any metric 9 conformal to 1352 g. When M is compact, C(M) or Co(M) is essential if and only if it is not compact. If Co(M) is essential, then M is conformaIly diffeomorphic to a sphere or a Euclidean space (n>2) [12-15]. When M is compact and has constant scalar curvature and Co(M) -=I- 10 (M), sufficient conditions for M to be isometric to a sphere have been obtained by S. I. Goldberg and S. Kobayashi, C. C. Hsiung, S. Ishihara, A. Lichnerowicz, Obata, S. Tanno, Tashiro, K. Yano, and others. For example, if Co(M) is essential, then M is a sphere [14]. In general, however, there are compact Riemannian mani- folds with constant scalar curvature for which Co(M)-=l-Io(M) (N. Ejiri). G. Spheres as Riemannian Manifolds A Euclidean n-sphere sn (n;::'2) has the prop- erties of a Riemannian manifold. It is a space of positive constant sectional curvature 1 jr 2 (r = radius) with respect to the natural Riemann- ian metric as a hypersurface of the Euclidean (n + 1 )-space En+!. A sphere is characterized by the existence of solutions of certain differential equations on a Riemannian manifold. On a unit sphere sn in En+l, the eigenvalues of the tLaplace-Beltrami operator  on smooth functions are given by 0<..1. 1 < .., < Ak< ..., Ak =k(n+k-1). It is known that eigenfunctions f corresponding to Ab f= Ad, are the restric- tions to sn of harmonic homogeneous poly- nomial functions F of degree k on En+!. On a compact Riemannian manifold M, if the Ricci curvature of M is not less than that of sn, then the first eigenvalue Al of  on M satisfies Al ;::. AI = n [16]. Conversely, under the same as- sumption on the Ricci curvature, if XI = n, then M is a sphere (Obata). On the other hand, if 9 is the standard metric on sn, then f = nfis equivalent to the system of differen- tial equations '\jV,j + fgji=O. (Ed A complete Riemannian manifold M (n;::' 2) admits a nontrivial solution of (E d if and only if M is a sphere (Obata, Tashiro). In general, the restriction f to sn of a harmonic homogene- ous polynomial of degree k satisfies f = k(n + k - l)f as weIl as a certain system (Ek) of differential equations of degree k + 1 involv- ing the Riemannian metric. For example, Vk'\jV,j +2g ji Vd +gikf +gkjV,j =0. (E 2 ) If a complete Riemannian manifold M admits a nontrivial solution of (E k ) for some integer k? 2, then M is locally isometric to a sphere (Obata, Tanno, S. GaIlot [17]). The gradient of a solution of (E 1 ) is an infinitesimal conformal 
1353 transformation and that of (E 2 ) is an infini- tesimal projective transformation. As the Kahler or quaternion Kahler version of (E 2 ), there is a system of differential equa- tions characterizing the complex projective space or the quaternwn projective space as a Kahler manifold (Obata, Tanno, D. E. Blair, Y. Maeda). On a sphere, a Riemannian metric which is conformal to the standard metric and has the same scalar curvature as the standard one is always standard, namely, it has a positive constant sectional curvature [14]. H. Scalar Curvature On a 2-dimensional Riemannian manifold M, the sectional curvature, the Ricci curvature, and the scalar curvature aH coincide with the tGaussian curvature, which is a function on M. If M is compact, by the tGauss-Bonnet theorem the Gaussian curvature K of M must satisfy the foHowing sign condition in terms of the tEuler characteristic X(M): if X(M) > 0, then K is positive somewhere; if X(M) = 0, then K changes sign unless it is identica11y zero; if X(M) <0, then K is negative somewhere. This sign condition is also sufficient for a given function K to be the Gaussian curvature of some metric on M. More precisely, starting with a Riemannian metric with constant Gaus- sian curvature, one can say that a smooth function K is the Gaussian curvature of some metric conforma11y equivalent to the original metric if and only if K satisfies the foregoing sign condition [18]. H. Yamabe [19] announced that on every compact Riemannian manifold (M, g) of di- mension n;::' 3, there exists a strictly positive function u such that the Riemannian metric g=u 4 /(n-2)g has constant scalar curvature. N. S. Trudinger, however, pointed out that his original proof contains a gap in some cases. The problem reduces to the foHowing non- linear partial differential equation on a com- pact manifold M: - n-1 Rd n + 2 )1(n-2) =4-u + Ru n-2 ' where R is the scalar curvature of g and R a constant which should be the scalar curva- ture ofg=u4!(n-2)g ( 183 Global Analysis). Nevertheless, Yamabe's original proof can be pushed to cover a large class of metrics with SMRdM <0. Furthermore, it has since been solved far a wider class: namely, if M is not conforma11y flat and n;::' 6, or if it is conform- a11y flat and its fundamental group is finite, 364 I Riemannian Manifolds then the problem has been solved affirmatively [20]. On the other hand, any smooth function on a compact manifold M of dimension n;::' 3 that is negative somewhere is the scalar curvature of some metric on M. In particular, on a com- pact manifold (n;::. 3) there always exists a Riemannian metric with constant negative scalar curvature [18]. Any smooth function can be the scalar curvature if and only if M admits a metric of constant positive scalar curvature. The foregoing results show that there is no topological obstruction to the existence of metrics with negative scalar curva- ture of a compact manifold of dimension n;::' 3. For positive scalar curvature, there is a topological obstruction. A compact tspin structure (spin manifold) having non vanishing tA-genus cannot carry a Riemannian metric of positive scalar curvature. The existence of such a manifold has been shown. If a compact, simply connected manifold M of dimension n;::' 5 is not a spin manifold, then there exists a Riemannian metric of positive scalar curva- ture. Furthermore, if M is a spin manifold and spin kobordant to M' with positive scalar curvature, then M carries a Riemannian metric of positive scalar curvature [22]. A torus T" cannot carry a metric of positive scalar curva- ture. In fact, any metric of nonnegative scalar curvature on Tn must be flat [22]. Let Kg and Rg denote the sectional curva- ture and the scalar curvature, respectively, of a Riemannian metric g. Then the fo11owing are known for a compact manifold M of dimen- sion ;::. 3: If M carries a metric g with Kg';;;; 0, then it carries no metric with R > O. If M carries a metric g with Kg < 0, then it carries no metric with R;::' O. If M carries metrics g 1, g2 with Kg,';;;;O and R g2 ;::'0, then both metrics are flat [22]. If the assignment of the scalar curvature to a Riemannian metric is viewed as a mapping of a space of Riemannian metrics into a space of functions on a manifold M, then 10ca11y it is almost always surjective when M is compact (A. E. Fischer and J. E. Marsden, O. Koba- yashi,1. Lafontaine). I. Ricci Curvature and Einstein Metrics In this paragraph the manifolds under con- sideration are assumed to be of dimension n;::' 3. The Ricci tensor (R i ) is a symmetric tensor field of type (0,2) on a Riemannian manifold. The problem of finding a Riemann- ian metric g which realizes a given Ricci tensOr reduces to the one of solving a system of nonlinear second-order partial differential equations for g. The Bianchi identity ( 417 
364 Ref. Riemannian Manifolds Tensor Calculus) k' 1 9 J k Rd--VR=O Jl 2 I must be satisfied. There is a symmetric (0,2)- tensor on R" which cannot be the Ricci tensor for any Riemannian metric in a neighborhood ofOER". However, if a Coo (or CW) symmetric tensor field (R i ) of type (0,2) is invertible at a point p, then in a neighborhood of p there exists a C"" (or CW) Riemannian metric 9 such that (R i ) is the Ricci tensor of 9 [24]. The positivity of the Ricci curvature on a Riemannian manifold puts rather strong re- strictions on the topology of the manifold ( 178 Geodesics). However, nonnegative Ricci curvature and positive Ricci curvature are not too far from each other. If, on a com- plete Riemannian manifold M with nonnega- tive Ricci curvature, there is a point at which the Ricci curvature is positive, then there exists a complete metric on M with positive Ricci curvature [25- 27]. If a Riemannian manifold (M, g) is an Ein- stein space, then 9 is called an Einstein metric on the manifold M. Let v g denote the volume element determined by g. When M is compact, o$Z denotes the space of Riemannian metrics on M with total volume 1. The integral of the scalar curvature (g)= SMRgv g is a functional on J{, The critical points of  are Einstein metrics (D. Hilbert). Let AI ( C A) denote the space of metrics with constant scalar curva- ture. Then if  is restricted to AI , then the tnullity and tcoindex at the critical point are finite [28,29]. An Einstein metric is always real analytic in some coordinate system. In particular, if two simply connected Einstein spaces have neigh- borhoods on which metrics are isometric, then they are isometric [30]. Though S" with stan- dard Riemannian metric is a typical example of an Einstein space, S4k+3 (k? I) carries an Einstein metric that is not standard [31]. References [IJ 1. A. Wolf, Spaces of constant curvature, Publish or Perish, 1977. [2J M. Berger, Les varietes riemanniennes homogenes simplement connexes a courbure strictment positive, Ann. Scuola Norm. Sup. Pisa, (3) 15 (1961),179-246. [3J N. R. Wallach, Compact homogeneous Riemannian manifolds with strictly positive curvature, Ann. Math., (2) 96 (1972), 277-295. [4J L. Berard-Bergery, Les variHes rieman- niennes homogenes simplement connexes de 1354 dimension impaire a courbure strictment positive, 1. Math. Pures Appl., (9) 55 (1976), 47-67. [5J S. Mori, Projective manifolds with ample tangent bundles, Ann. Math., (2) 110 (1979), 593-606. [6J Y. T. Siu and S. T. Yau, Compact Kiihler manifolds of positive bisectional curvature, Inventiones Math., 59 (1980),189-204. [7] G. de Rham, Sur la reductibilite d'un espace de Riemann, Comment. Math. Helv., 26 (1952), 328-344. [8J M. Berger, Sur les groupes d'holonomie homo gene des varietes a connexion affine et des varietes riemanniennes, Bull. Soc. Math. France, 83 (1955), 279-330. [9] 1. Simons, On the transitivity of holonomy systems, Ann. Math., (2) 76 (1962),213-234. [IOJ S. Kobayashi, Transformation groups in differential geometry, Erg. Math., 70, 1972. [11] K. Yano, The theory of Lie derivatives and its applications, North-Holland, 1957. [12J 1. Lelong-Ferrand, Transformations conformes et quasi conformes des varietes riemanniennes compactes (demonstration de la conjecture de Lichnerowicz), Acad. Roy. Belg., CI. Sc. Mem. Colloq. 39, no. 5 (1971), 1-44. [13J A. 1. Ledger and M. Obata, Compact Riemannian manifolds with essential group of conformorphisms, Trans. Amer. Math. Soc., 150 (1970),645-651. [14J M. Obata, The conjectures on conformal transformations of Riemannian manifolds, J. Differential Geometry, 6 (1971), 247-258. [15J D. V. Alekseeviskii, Groups of conformal transformations of Riemann spaces, Math. USSR-Sb., 18 (1972), 285-301. [16] A. Lichnerowicz, Geometrie des groupes de transformations, Dunod, 1958. [17J S. Gallot, Equations differentielles caracteristiques de la sphere, Ann. Sci. Ecole Norm. Sup., 12 (1979), 235-267. [18J 1. L. Kazdan and F. W. Warner, Exis- tence and conformal deformation of metrics with prescribed Gaussian and scalar curva- ture, Ann. Math., (2) 101 (1975), 317-331. [19J H. Yamabe, On a deformation of Riemannian structures on compact manifolds, Osaka Math. 1.,12 (1960), 21-37. [20J T. Aubin, Equations differentielles non lineaires et probleme de Yamabe concernant la courbure scalaire, J. Math. Pure Appl., 55 (1976),269-296. [21J N. Hitchin, Harmonic spinors, Advances in Math., 14 (1974),1-55. [22J M. Gromov and L. B. Lawson, The classi- fIcation of simply connected manifold of posi- tive scalar curvature, Ann. Math., (2) 111 (1980),423-434. [23J R. Schoen and S. T. Yau, The structure 
1355 of manifolds with positive scalar curvature, Manuscripta Math., 28 (1979), 159 183. [24] D. DeTurck, Existence of metrics with prescribed Ricci curvature: Local theory, In- ventiones Math., 65 (1981),179-207. [25] T. Aubin, Metriques riemanniennes et courbure,1. Differential Geometry, 4 (1970), 383-424. [26] P. Ehrlich, Metric deformations of curva- ture I: local convex deformations, Geometriae Dedicata, 5 (1976),1-23. [27] 1. P. Bourguignon, Ricci curvature and Einstein metrics, Lecture notes in math. 838, Springer, 1981,42-63. [28] Y. Muto, On Einstein metrics, J. Dif- ferential Geometry, 9 (1974), 521-530. [29] N. Koiso, On the second derivative of the total scalar curvature, Osaka 1. Math., 16 (1979), 413421. [30] D. M. DeTurck and 1. L. Kazdan, Some regularity theorems in Riemannian geometry, Ann. Sci. Ecole Norm. Sup., 14 (1981), 249- 260. [31] 1. E. D'Atri and W. Ziller, Naturally reductive metrics and Einstein metrics on compact Lie groups, Mem. Amer. Math. Soc., 18 (1979). See also references to 80 Connections, 105 Differentiable Manifolds, 109 Differential Geometry, 111 Differential Geometry of Curves and Surfaces, 178 Geodesics, 191 G- Structures, 365 Riemannian Submanifolds,417 Tensor Calculus. 365 (VII.13) Riemannian Submanifolds A. Introduction If an timmersion (or an tern bedding) f of a tRiemannian manifold (M, g) into a Riemann- ian manifold (M, g) satisfies the condition f*g = g, then f is called an isometric immersion (or embedding) and M is called a Riemannian submanifold of M. In this article, f(M) will be identified with M except where there is danger of confusion. Suppose dim M = n and dim M = n + p. Then the tbundle F(M) of orthonormal tangent frames of M, the bundle Fy(M) of orthonormal normal frames of M, and their tWhitney sum F(M) $ F.(M) are tprincipal fiber bundles over M with tstructure groups O(n), O(p), and O(n) x O(p), respectively. These are subbundles of the restriction to M of the bundle F(M) of orthonormal frames of M. The vector bundles associated with F(M), Fy(M), and F(M) $ Fv(M) are, respectively, the ttan- 365C Riemannian Submanifolds gent bundle T(M), the tnormal bundle v(M), and their Whitney sum T(M) $ v(M). B. General Results for Immersibility An n-dimensional real analytic Riemannian manifold can be locally isometrically embed- ded into any real analytic Riemannian mani- fold of dimension n(n + 1)/2 (M. Janet (1926), E. Cart an (1927)). The generalization to the Coo case is an open question even when the ambient space is Euclidean. An n-dimensional compact C' Riemannian manifold (3  r  (0) can be isometrically em- bedded into an (n(3n+ 11)/2)-dimensional Euclidean space (1. F. Nash (1956)). An n- dimensional noncompact C' Riemannian manifold (3  r  (0) can be isometrically em- bedded into a 2(2n + 1) (3n + 7)-dimensional Euclidean space (Nash (1956), R. E. Greene (1970)). Let M be an n-dimensional Riemannian manifold with tsectional curvature K M and M an (n + p)-dimensional Riemannian manifold with sectional curvature Kf.i. Then M cannot be isometrically immersed into M in the fol- lowing cases: (1) pn- 2 and KM <Kf.i (T. Otsuki (1954)); (2) pn-l, KMKf.iO, M is compact, and M is complete and simply connected (C. Tompkins (1939), S. S. Chern and N. H. Kuiper (1952), B. O'Neill (1960)); (3) pn-l, KMO, Kf.i is constant (O), M is compact, and M is complete and simply con- nected [2]. C. Fundamental Equations Let f:(M,g)-4(M,g) be an isometric immer- sion. Let V and V denote the tcovariant differ- entiations with respect to the tRiemannian connections of M and M, respectively. For vector fields X and Yon M, the tangential component ofV x Y is equal to V x Y. Put O"(X, Y)=VxY-VxY. (1 ) Then 0" is a v(M)-valued symmetric (0,2) tensor field on M, which is called the second funda- mental form of M (or off). For a normal vector  at XE M, put g(AX, Y) = g(O"(X, YH). Then A defines a symmetric linear transfor- mation on TAM), which is called the second fundamental form in the direction of. The eigenvalues of A are called the principal curva- tures in the direction of . The connection on v(M) induced from the Riemannian connec- tion of M is called the normal connection of M (or off). Let V.L denote the covariant differ- entiation with respect to the normal con- 
365 D Riemannian Snbmanifolds nection. For a tangent vector field X and a normal vector field  on M, the tangential (resp. normal) component of Vx is equal to - A;X (resp. V'}), that is to say, the relation Vx= -A;X +V'}( holds. (1) is called the Ganss formula, and (2) is called the Weingarten formnla. Let R, R, and R1. be the tcurvature tensors of V', V, and V'1., respectively. Then the tinte- grability condition for (1) and (2) implies R(X. Y)Z = R(X, Y)Z + Au(x.z) Y - Aq(y.z)X +(V'O")(Y, Z)-(WO")(X, Z) for vector fields X, Y, Z tangent to M, where V" denotes covariant differentiation with re- spect to the connection in T(M) EB v(M). (3) is called the equation of Gauss and Codazzi. More precisely, the tangential component of (3) is given by the equation of Gauss and the normal component of (3) is given by the eqna- tion of Codazzi. Similarly, for vector fields  and IJ normal to M, the relation g(R(X, Y),IJ)=g(R1.(X, Y),IJ) -g([A;, AJX, Y) holds, which is called the eqnation of Ricci. Formulas (1)-(4) are the fundamental equa- tions for the isometric immersion f: M --> M. As a particular case, suppose M is a tspace form of constant curvature c. Then the equa- tions of Gauss, Codazzi, and Ricci reduce respectively to R(X, Y)Z=c(g(Y,Z)X -g(X,Z)Y)+Aa(y.z)X - Au(x.z) Y, (3 c )( (V'O")(Y,Z)-(WO")(X,Z)=O, (301n g(R1.(X, Y),IJ)=g([A;,AJX, Y). (401 Conversely, let (M,g) be an n-dimensional simply connected Riemannian manifold, and suppose there is given a p-dimensional tRie- mannian vector bundle v(M) over M with curvature tensor R 1. and a v(M)-valued sym- metric (0,2) tensor field 0" on M. For a tcross section  of v(M), define A; by g(A;X, Y) = <O"(X, Y), 0, where < , > is the fiber metric of v(M). If they satisfy (301" (3 c )n, and (401, then M can be immersed isometrically into an (n + p)-dimensional complete and simply connected space form Mn+P(c) of curvature c in such a way that v(M) is the normal bundle and 0" is the second fundamental form. Moreover, such an immersion is unique up to an tisometry of Mn+p(c). Let (e A )\ «A«n+p be a local cross section of F(M) such that its restriction to M gives a local cross section of F(M) EB Fv(M), and let (w A ) be its dual. Then f*w'=O for n+ l:(ct:( 1356 (2) n + p. Let (W)1 «A.B«n+p and (cI>)1 «A.B«n+p be the tconnection form and the tcurva- ture form of M with respect to (e A)' and put w =f*w. Then (w}) 1 «i.j«n is the con- nection form of M with respect to (e i )1 «i«n' (wn, «i«n<a«n+p gives the second fundamental form, that. is, O"(e i , e) = I wt(e)e a . (1 ') Put wt = L h'Uwj. Then (h'U) is the matrix repre- senting the symmetric linear transformation Ac, with respect to (eJ, that is, (3) Ae, e i = I hfje j . (2') Moreover, (w$)n+1 «a.p«n+p is the connec- tion form of the normal connection with re- spect to (ea)n+1 «a«n+p' Let (t1J)) 1 «i.j«n and (t1J P )n+1 «a,p«n+p be the curvature forms of (wj) and (w$), respectively. Then the equations of Gauss, Codazzi, and Ricci are given respec- tively by n+p f*cI>j= t1J)+ I w; /\ wI (I :( i,j:( n), (3')( a=n+l (4) n+p f*cI>f=dwt+ I W/\WiA (I :(i:(n<ct:(n+p), A=1 n r *,T.,a - t1Ja + " w a /\ w k . "'p - P L.. k P k=' (3')n (n+ I :(ct,/3:(n+p). (4') D. Basic Notions Let M be a Riemannian submanifold of M. A point x E M is called a geodesic point if 0" = 0 at x. If every point of M is a geodesic point, then M is called a totally geodesic snbmanifold of M. M is a totally geodesic submanifold of M if and only if every geodesic of M is a geodesic of M. A mapping l):M-->v(M) defined by x--> ; L7=1 O"(e i , eJ is independent of the choice of an orthonormal basis (e i ). l) is called the mean cnrvature vector and 11l)11 is called the mean curvature. M is called a minimal snbmanifold of M ifl)=O ( 275 Minimal Submanifolds). A point XE M is called an nmbilical point if 0" = 9 @ l) at x. x E M is an umbilical point if and only if A; is proportional to the identity transformation for all  E vAM). If every point of M is an umbilical point, then M is called a totally umbilical snbmanifold of M. A point x E M is called an isotropic point if 110"(X,X)II/IIXI1 2 does not depend on XE TAM). If every point of M is an isotropic point, then M is called an isotropic snbmani- fold of M. It is clear that an umbilical point is an isotropic point. J1(x) = dim n;ev,(M)ker A; is called the index of relative nullity at x E M. 
1357 E, Rigidity An isometric immersion J: M  M is said to be rigid if it is unique up to an isometry of M, that is, if f': M  M is another isometric im- mersion, then there exists an isometry cp of M such that f' = cp 0 f If J: M  M is rigid, then every isometry of M can be extended to an isometry of M. An isometric immersion J:MMn+l(c) of an n-dimensional Riemannian manifold into an (n+ I)-dimensional complete and simply connected space form is rigid in each of the following cases: (1) n = 2, c = 0, and M is compact and of posi- tive curvature (S. Cohn-Vossen (1929)). (2) The index of relative nullity is  n - 3 at each point (R. Beez (1876);  [8]). (3) n  5, c> 0, M is complete, and the index of relative nullity is  n - 2 at each point (D. Ferus (1970)). (4) n4, c#O, and the tscalar curvature of M is constant (#n(n -l)c) (C. Harle (1971)). A generalization of (2) for the case of higher codimension was obtained by C. Allendoerfer (1939). Various rigidity conditions have been studied by S. Dolbeault-Lemoine, R. Sack- steder, E. Kaneda and N. Tanaka, and others. F. Totally Geodesic and Totally Umbilical Submanifolds A totally geodesic submanifold of a space form is also a space form of the same curvature. Totally geodesic sub manifolds of compact tsymmetric spaces of rank 1 were completely classified by 1. A. Wolf (1963), and totally geodesic submanifolds of symmetric spaces of rank  2 were studied by Wolf and B. Y. Chen and T. Nagano [4]. Let J:M Mn+p(C) be a totally umbilical immersion of an n-dimensional Riemannian manifold M into an (n + p)-dimensional space form. Then M is a space form Mn(c) with cc, and J(M) is contained in a certain (n + 1)- dimensional totally geodesic submanifold Mn+1(C) of Mn+P(C). If c> 0, then J(M) is local- ly a hypersphere; if C=O, then J(M) is locally a hyperplane or a hypersphere; if c < 0, then f(M) is locally a geodesic sphere, a horo- sphere, or a parallel hypersurface of a totally geodesic hypersurface [2]. G. Minimal Submanifolds For general properties of minimal submani- folds  275 Minimal Submanifolds. There is no compact minimal submanifold in a simply connected Riemannian manifold with nonpositive sectional curvature (O'Nei!l 365H Riemannian Submanifolds (1960)). On the contrary, a sphere has plenty of compact minimal sub manifolds. For each positive integer s, an n-dimensional n sphere of curvature can be minimal- s(s+n-1) ly immersed into a { (2S + n _ 1) (s + n - 2)! 1 } _ s!(n-l)! dimensional unit sphere and the immersion is rigid if n = 2 or s  3 (E. Calabi (1967), M. do Carmo and N. Wallach (1971)). Among aH n-dimensional compact minimal sub manifolds of an (n + p)-dimensional unit sphere, the totaHy geodesic submanifold is isolated in the sense that it is characterized by each of the foHowing conditions: (1) sectional curvature > (T. Itoh 2(n + 1) (1978)), (2) Ricci curvature> n - 2 (N. Ejiri (1979)), n (3) scalar curvature> n(n -1) -- 2 (1. -lip Simons (1968)). H, Submanifolds of Constant Mean Curvature A manifold of constant mean curvature is a solution to a variational problem. In partic- ular, with respect to any volume-preserving variation of a domain D in a Euclidean space, the mean curvature of M = aD is constant if and only if the volume of M is critical. The interesting question "If the mean curva- ture of an isometric immersion1:MMn+!(c) of an n-dimensional compact Riemannian manifold M into an (n + 1 )-dimensional space form Mn+!(c) is constant, is M a sphere?" has not yet been completely solved, where Mn+!(c) denotes a Euclidean space, a hyperbolic space, or an open hemisphere according as c = 0, < 0, or > O. The answer is affirmative in the foHow- ing cases: (1) dimM = 2, and the tgenus of Mis zero (H. Hopf (1951), Chern (1955)). (2) J is an embedding (A. D. Alexandrov (1958);  [8]). These results remain true even if the as- sumption "the mean curvature is constant" is replaced by the weaker condition "the prin- cipal curvatures k 1 ...  k n satisfy a relation cp(k 1 , ..., k n )= 0 such that acplak j > 0." Unlike an open hemisphere, a sphere sn+! admits many compact hypersurfaces of con- stant mean curvature; among which totaHy umbilical hypersurfaces and the product of two spheres are the only ones with nonnega- tive sectional curvature (B. Smyth and K. Nomizu (1969)). A nonnegatively or nonpositively curved complete surface of nonzero constant mean curvature in a 3-dimensional space form M 3 (c) is either a sphere or a tClifford torus if c> o and is either a sphere or a right circular 
365 I Riemannian Snbmanifolds cylinder if c:::; ° (T. Klotz and R. Osserman (1966-1967), D. Hoffman (1973)). I. Isoparametric Hypersurfaces A hyper surface M of M is said to be isopara- metric if M is locally defined as the tlevel set of a function f on (an open set of) M with property df J\d11df11 2 =0 and df J\ d(lif) = 0. A hypersurface M of a complete and simply connected space form M n +' (c) is isoparametric if and only if M has locally constant principal curvatures (Cartan). If c :::;0, M has at most two distinct principal curvatures (Cartan). If c > 0, the number of distinct principal cur- vatures of Mis 1,2,3,4, or 6 (H. Mtinzner (1980)). If c = 0, then M is locally Sk X E n -\ and if c < 0, then M is locally En or Sk X Hn-k (Cartan). Isoparametric hypersurfaces of sn+1 having at most three distinct principal curva- tures were completely classified by Cartan. R. Takagi, T. Takahashi, and H. Ozeki and M. Takeuchi obtained several results for isopara- metric hypersurfaces of sn+1 with four or six distinct principal curvatures [7]. If a subgroup of the isometry group of Mn+1 (c) acts transitively on M, then Mis isoparametric. The converse is true if c:::; 0, or if c > ° and M has at most three distinct prin- cipal curvatures (Cartan), but not true in gen- eral (Ozeki and Takeuchi [7J). J. Isometric Immersions between Space Forms Let f: Mn(c)-,> Mn+P(C) be an isometric immer- sion of an n-dimensional space form into an (n + p)-dimensional space form. (I) If n= 2, p= 1, c>O, cc, M 2 (c) is complete, and M 3 (c) is complete and simply connected, then f is totally umbilical (H. Liebmann (1901);  [2J). (2) Ifn=2, p= 1, c<o, c<c, M 2 (c) is complete, and M 3 (C) is complete and simply connected, then f does not exist (D. Hilbert (1901);  [2J). (3) Ifn=2, p= 1, c=O<c, M 2 (0) is complete, and M 3 (C) is complete and simply connected, then there exist infinitely many f (L. Bianchi (1896);  [2J). (4) Ifn=2, p= I, c=O>c, M 2 (0) is complete, and M 3 (C) is complete and simply connected, then f(M 2 (0)) is either a horosphere or a set of points at a fixed distance from a geodesic (1. Volkovand S. Vladimirova, S. Sasaki;  [2J). (5) If n 3, p= 1, and c> C, then f is totally umbilical. (6) If p= 1, c=c=O, Mn(O), is complete, and M n + 1 (0) is complete and simply connected, 1358 then f is cylindrical (A. Pogorelov (1956), P. Hartman and L. Nirenberg (1959), and others). (7) If p :::;n-l, c=c> 0, and both Mn(c) and Mn+P(C) are complete, then f is totally geodesic (D. Ferus (1975)). (8) If p:::; n-l, c>c> 0, Mn(c) is complete, and Mn+P(C) is complete and simply connected, then f does not exist (1. Moore (1972)). K. Homogeneons Hypersurfaces Let M be an n-dimensional thomogeneous Riemannian manifold which is isometrically immersed into an (n+ I)-dimensional complete and simply connected space form Mn+1 (c). (1) If c = 0, then M is isometric to Sk X E n - k (S. Kobayashi (1958), Nagano (1960), Takahashi [9J). (2) If c > 0, then M is isometric to E 2 or else is given as an orbit of a subgroup of the isometry group of M n +1 (c) (W. Y. Hsiang, H. B. Lawson, Takagi;  [7J). (3) If c<o, then M is isometric to En, Sk X Hn-k, or a 3-dimensional group manifold Bm ,' },y,'ER} with the metric ds 2 = e - 2t dx 2 + e 2t d y2 + dt 2 (Takahashi (1971)). Each of the hypersurfaces above except E 2 in (2) and B in (3) is given as an orbit of a certain subgroup of the isometry group of M n +1 (c). L. Kiihler Snbmanifolds A tcomplex submanifold of a tKahler manifold is a Kahler manifold with respect to the in- duced Riemannian metric. A complex analytic and isometric immersion of a Kahler manifold (M,J, g) into a Kahler manifold (M,J,{j) is called a Kiihler immersion, and M is called a Kiihler snbmanifold of M. A Kahler submani- fold is a minimal submanifold. A compact Kahler submanifold M of a Kahler manifold M can never be homologous to 0, that is, there exists no submanifold M' of M such that M = aM'. If [MJEH*(M, Z) denotes the thomology class represented by a Kahler submanifold M of M, then vol(M):::;vol(M') holds for any submanifold M' E [MJ with equality if and only if M' is a Kahler submanifold (W. Wir- tinger (1936)). A Kahler manifold of constant tholomor- phic sectional curvature is called a complex space form. An n-dimensional complete and simply connected complex space form is either Pn(C), en, or D.. Every Kahler submanifold of a complex space form is rigid (Calabi [IOJ). Kahler immersions of complex space forms 
1359 into complex space forms were completely determined by Calabi [10] and by H. Naka- gawa and K. Ogiue (1976). c n (resp. Dn) is the only tHermitian sym- metric space that can be immersed in C m (resp. D"J as a Kahler submanifold (Nakagawa and Takagi [11]), and Kahler immersions of Her- mitian symmetric spaces into Pm(C) were precisely studied by Nakagawa, Y. Sakane, Takagi, Takeuchi, and others. More genera11y, Kahler immersions of homogeneous Kahler manifolds into Pm(C) were determined by Takeuchi (1978). Qn={[zaEPn+1(c)Izt=0} in P n + 1 (C) is the only Einstein-Kahler hypersurface of a complex space form that is not tota11y geodesic (B. Smyth (1967), S. S. Chern (1967)). The result remains true even if "Einstein" is re- placed by "para11el Ricci tensor" (Takahashi (1967)). Besides linear subspaces, Qn is the only Einstein-Kahler submanifold of Pm(C) that is a complete intersection (1. Hano (1975)). Integral theorems and pinching problems with respect to various curvatures for com- pact Kiihler submanifolds of Pm(C) have been studied by K. Ogiue, S. Tanno (1973), S. T. Yau (1975), and others [12]. For example, if the holomorphic sectional curvature of Pn+p(C) is 1, then each of the fo11owing is sufficient for an n-dimensional compact Kahler submani- fold to be tota11y geodesic: (1) holomorphic sectional curvature> 1/2 (A. Ros (1985)), (2) sectional curvature> 1/8 (A. Ros and L. Verstraelen (1984)), (3) Ricci curvature> n/2 [12], (4) embedded and scalar curvature> n 2 (1. H. Cheng (1981)). The index of relative nu11ity J1(x) of an n- dimensional complete Kahler submanifold M of Pm( C) satisfies Min xeM J1(x) = 0 or 2n (K. Abe (1973)). M. Totally Real Submanifolds An isometric immersion of a Riemannian manifold (M, g) into a Kahler manifold CfJ,J,ij) satisfying JT,,(M)cvx(M) at each point xEM is ca11ed a totally real immersion, and M is ca11ed a totally real submanifold of fJ. A tota11y geodesic submanifold Pn(R) in Pn(C), SI xS 1 in P 2 (C) and an immersion Pn(C) ->P n (n+2)(C) defined by [Zj] ->[zjzJ give typical examples of tota11y real submanifolds. N. Submanifolds with Planar Geodesics A surface in E 3 whose geodesics are a11 plane curves is (a part of) a plane or sphere. More 3650 Riemannian Submanifolds genera11y, letJ:M->Mm(c) be an isometric immersion of M into a complete and simply connected space form Mm(c). If the image of each geodesic of M is contained in a 2- dimensional tota11y geodesic submanifold of Mm(c), then J is either a tota11y geodesic im- mersion, a tota11y umbilical immersion or a minimal immersion of a compact symmetric space of rank 1 by harmonic functions of de- gree 2; the last case occurs only when c>O (S. L. Hong (1973), 1. Little (1976), K. Saka- moto [14]). A Kahler submanifold of a complete and simply connected complex space form with the same property as above is either a totally geodesic submanifold or a Veronese sub- manifold (a Kahler immersion of Pn(C) into Pn(n+ 3)/2 (C)) (K. Nomizu (1976)). Submanifolds with the above property are closely related to isotropic submanifolds with V'a=O. Submanifolds with V'a=O in sym- metric spaces have been studied by Ferus, H. Naito, Takeuchi, K. Tsukada, and others. O. Total Curvature Let j: M -> Em be an isometric immersion of an n-dimensional compact Riemannian manifold M into a Euclidean space. Let vr(M) be the unit normal bundle, sm-t the unit sphere cen- tered at OEEm, and let J.L: VI (M)->sm-t be the para11el translation. Let wand 0 be the tvol- ume elements of VI (M) and sm-t, respectively. Then for each  E VI (M), JIO = (det A)w holds. As a generalization of the ttotal curvature for a space curve, the total curvature of the im- mersion J is defined as r:(f)= 1(m-1) f IJIOI vo v,(M) = 1 m-1 f IdetAdw. vol(S ) v,(M) If f3(M) is the least number of critical points of a tMorse function on M, then inh(f) = f3(M) 2 f holds (Chern and R. Lashof(1957, 1958), Kuiper (1958)). ,(f) = 2 if and only if J is an embedding and J(M) is a convex hypersurface of some En+1 in Em (Chern and Lashof (1958)). If r:(f) < 3, then M is homeomorphic to sn (Chern and Lashof (1958)). These results gen- eralize theorems for space curves by W. Fen- chel (1929), I. Fary (1949), 1. Mi1nor (1950), and others. An isometric immersion J which attains inf(f) is ca11ed a minimum immersion or a tight immersion. tExotic spheres do not have minimum immersions (Kuiper (1958)). tR- 
365 Ref. Riemannian Snbmanifolds spaces have minimum immersions and, in particular, a minimum immersion of a sym- metric R-space is a tminimal immersion into a hypersphere (Kobayashi and Takeuchi (1968)). The total mean cnrvature of an isometric immersion f: M -> Em of an n-dimensional compact Riemannian manifold into a Euclid- ean space is, by definition, L 111)11"* 1. It satisfies L 111)11 n * 1 vol(sn), where sn is the n-dimensional unit sphere. The equality holds if and only if f is totally umbil- ical (T. J. Willmore (1968), Chen [3J). References [IJ S. Kobayashi and K. Nomizu, Founda- tions of differential geometry II, Interscience, 1969. [2J M. Spivak, A comprehensive introduction to differential geometry III, IV, V, Publish or Perish, 1975. [3J B. Y. Chen, Geometry of submanifolds, Dekker, 1973. [4J B. Y. Chen and T. Nagano, Totally geo- desic submanifolds of symmetric spaces I, II, III, Duke Math. J., 44 (1977), 745-755; 45 (1978),405-425. [5J S. S. Chern, M. doCarmo, and S. Koba- yashi, Minimal submanifolds of a sphere with second fundamental form of constant length, Functional Analysis and Related Fields, Springer, 1970. [6J A. D. Alexandrov, A characteristic prop- erty of spheres, Ann. Mat. Pura Appl., 58 (1962), 303- 315. [7J H. Ozeki and M. Takeuchi, On some types of isoparametric hypersurfaces I, II, Tohoku Math. J., 27 (1975), 515-559; 28 (1976), 7-55. [8J P. Ryan, Homogeneity and some curva- ture conditions for hypersurfaces, Tohoku Math. J., 21 (1969), 363-388. [9J T. Takahashi, Homogeneous hypersurfaces in spaces of constant curvature, J. Math. Soc. Japan, 22 (1970),395-410. [IOJ E. Calabi, Isometric imbedding of com- plex manifolds, Ann. Math., (2) 58 (1953), 1- 23. [IIJ H. Nakagawa and R. Takagi, On locally symmetric Kiihler submanifolds in a com- plex projective space, J. Math. Soc. Japan, 28 (1976),638-667. [12J K. Ogiue, Differential geometry of Kahler submanifolds, Advances in Math., 13 (1974), 73-114. 1360 [13J B. Y. Chen and K. Ogiue, On totally real submanifolds, Trans. Amer. Math. Soc., 193 (1974),257-266. [14J K. Sakamoto, Planar geodesic immer- sions, Tohoku Math. 1., 29 (1977),25-56. [15J D. Ferus, Totale Absolutkriimmung in Differentialgeometrie und Topologie, Springer, 1968. 366 (VIII.6) Riemann-Roch Theorems A. General Remarks The tRiemann-Roch theorem (abbreviation: R. R. theorem) is one of the most significant results in the classical theory of talgebraic functions of one variable. Let X be a compact tRiemann surface of tgenus g, and let D = L.miP i be a tdivisor on X. We denote by deg D the degree of D, which is defined to be L. mi' The divisor D is said to be positive if D =1= 0 and mi  0 for all i. A nonzero tmero- morphic function f on X determines a divisor (1)= L. aiQi- L. bjRj(a i , bj>O), where the Qi are the zeros of order a i and the Rj are the poles of order b j . The set of meromorphic functions f such that (I) + D is positive, together with the constant f = 0, forms a finite-dimensional linear space L(D) over C. The R. R. theorem asserts that dim L( D) = deg D - 9 + 1 + r( D), where r(D) is a nonnegative integer determined by D. If K is the tcanonical divisor of X, then r(D) = dim L(K - D) ( 9 Algebraic Curves C; 11 Algebraic Functions D). (For the R. R. theorem for algebraic surfaces  15 Algebraic Surfaces D.) Generalizations of this important theorem to the case of higher-dimensional compact tcomplex manifolds were obtained by K. Kodaira, F. Hirzebruch, A. Grothendieck, M. F. Atiyah and I. M. Singer, and others. Let X be a compact complex manifold, B be a tcom- plex line bundle over X, and 0(B) be the tsheaf of germs of holomorphic cross sections of B. When B is determined by a divisor D of X, we have HO(X, 0(B)) L(D). Hence a desirable generalization of the R. R. theorem will pro- vide a description of dime HO(X, (O( B)) in terms of quantities relating to the properties of X and B. Following this idea, various theorems of Riemann-Roch type have been obtained. B. Hirzebruch's Theorem of R. R. Type Keeping the notation given in Section A, we put X(X,(O(B))=L. q ( -l)q dimHq(X,&(B)). 
1361 Generally, if g; is an arbitrary tcoherent ana- lytic sheaf on X, we can define x(X, g;) using the same formula (replacing (!!(B) by g;). The quantity X(X,g;) has simple properties in various respects. For example, if the sequence O-->g;' -->g; -->g;" -->0 is exact, we have X(X, g;) = X(X, g;') + X(X, g;"). If an tanalytic vector bundle F depends continuously on auxiliary parameters, then X(X, (!!(F)) remains constant. Let X be a projective algebraic manifold of complex dimension n. We consider the tChern class c= 1 +c 1 +... +c n of X and express it formally as the product II7=1 (l + y;). Thus the ith Chern class C i is expressed as the ith ele- mentary symmetric function of Y1' '" , Yn' Con- sider the formal expression T(X) = II71 Y;/ (l-e- Y '). T(X) can be expanded as a formal power series in the Yi' and each homogeneous term, being symmetric in the Yi' can be ex- pressed as a polynomial in C 1 , ..., C m and thus determines a cohomology class of X. Similarly, we consider the formal expression of the Chern class of the vector bundle F as 1 +d 1 +... +d q = IIJ1 (1 + <>j), where q is the dimension of the fiber of F. We put ch(F) = '£J=1 e.;. The formal series ch(F) is also an element of the coho- mology ring of X whose (v+ l)st term consists of a 2v-dimensional cohomology class. We call ch(F) the tChern character of F ( 237 K- Theory B), and define T(X, F) to be the value of ch(F) T(X) at the tfundamental cycle X. (The multiplication ch(F)' T(X) is formal. T(X, F) is determined by the term of dimen- sion 2n alone.) T(X, F) is called the Todd char- acteristic with respect to F. Hirzebruch's theo- rem ofR. R. type asserts that X(X,(IJ(F)) = T(X, F). In particular, when n = 1, F = [D] (the line bundle determined by the divisor D), Hirzebruch's formula yields the classical R. R. theorem. If F satisfies the conditions for the vanishing theorem of cohomologies, the for- mula gives an estimate for dim HO(X, (!!(F)) [11]. In 1963, Atiyah and Singer developed a theory on indices of elliptic differential operators on a compact orientable differenti- able manifold and obtained a general result that includes the proof of Hirzebruch's theorem for an arbitrary compact complex manifold [4,5] ( 237 K -Theory H). C. R. R. Theorem for Surfaces If X is a compact complex surface, i.e., a com- pact complex manifold of dimension 2, then for complex line bundles F 1 and F2' the inter- section number (F1 F 2 ) is defined to be c 1 (Fd U c 1 (F 2 )[X]. The R. R. theorem for a line bundle F on X is stated as follows: X(X, (!!(F)) =(F 2 )/2-(KF)/2 +((K 2 )+c 2 (X))/12, where 366D Riemann-Roch Theorems K is the canonical line bundle of X and c 2 (X) denotes the value at X of the 2nd Chern class of X, that is, the Euler number of X. The Noether formula Xx=((K 2 )+c 2 (X))/12 follows from the above identity. The R. R. theorem for surfaces is a powerful tool for the study of compact complex surfaces. D. Grothendieck's Theorem of R. R. Type Grothendieck took an entirely new point of view in generalizing Hirzebruch's theorem. The following is a description of his idea as reformulated by A. Borel and J.-P. Serre [8]. We consider a nonsingular quasiprojective algebraic variety X ( 16 Algebraic Varieties) over a ground field of arbitrary characteristic. Namely, X is a closed subvariety of an open set in a projective space (over an algebraically closed ground field). Consider the group K(X), which is the quotient of the free Abelian group generated by the equivalence classes of alge- braic vector bundles over X modulo the sub- group generated by the elements of the form F - F' - F", where F, F', F" are classes of bundles such that there exists an exact se- quence 0--> F' --> F --> F" -->0. A similar construc- tion for the (equivalence classes of the) tcoher- ent algebraic sheaves instead of the vector bundles yields another Abelian group K'(X). It can be shown that K(X) is isomorphic to K'(X) by the correspondence F -->(!!(F) (= the sheaf of germs of regular cross sections of F). Addition in K(X) is induced by the tWhitney sum of the bundles, and K(X) has the struc- ture of a ring with multiplication induced by the tensor product. For a vector bundle F, its Chern class c(F)= 1 +cdF)+... +cq(F) (q= the dimension of the fiber) is defined as an element of the tChow ring A(X) with appro- priate properties. (A(X) is the ring of the rational equivalence classes of algebraic cycles on X, and ci(F) is the class of a cycles of co- dimension i.) We define ch(F) as before. It can be shown that c(F) and ch(F) are deter- mined by the image of Fin K(X), and we have c( HO=c()c(I'/), chg+I'/)=ch()+ch(I'/), ch(I'/)=ch()ch(l'/) (,I'/EK(X)). If we have a tproper morphism f: Y-+X between quasi- projective algebraic varieties Y and X, we have homomorphismsj':K(X)-->K(Y) and j,:K(Y)-->K(X). The former is defined by taking the induced vector bundle and the latter by the correspondence g; --> D -l)q(qJ)g;, where g; is a coherent algebraic sheaf on Y and (qJ)g; is the qth tdirect image of g; under f (Since f is proper, (qJ)g; is coher- ent.) Between Chow rings we have homomor- 
366E Riemann-Roch Theorems phisms f*: A(X)-->A(Y) and f*: A(Y)--> A(X), defined by taking inverse and direct images of cycles. With this notation, the theorem asserts that if X and Yare quasiprojective and.f: Y--> X is a proper morphism, then f*(ch()T(Y))= ch(f,())T(X). This is called Grothendieck's theorem of R. R. type. If X consists of a single point, the theorem gives Hirzebruch's theorem for algebraic bundles. Since algebraic and analytic theories of coherent sheaves on a complex projective space are isomorphic, this result covers Hirzebruch's theorem ( 237 K- Theory). The subgroup R(Y) of A(Y) given by R(Y)= {ch() . T( Y) I  E K (Y)} is ca1\ed the Riemann- Roch gronp of Y. Thus, using the notions developed by Grothendieck, the R. R. theorem can be expressed as follows: R(Y) is mapped into R(X) by a proper morphism f: Y --> X. Generalizations to talmost complex manifolds and tdifferentiable manifolds were made by Atiyah and Hirzebruch in this latter form [1]. One of the remarkable results is that an ele- ment of R( Y) takes an integral value at the fundamental cycle. This theorem is obtained by taking X to be a single point. E. R. R. Theorem for Singnlar Varieties Let X be a projective variety over C and let H.(X) (resp. H'(X)) denote the singular ho- mology (resp. cohomology) group with rational coefficients. Note that K(X) may not agree with K'(X) when X is singular. The R. R. theorem for X formulated by P. Baum, W. Fulton, and R. MacPherson [6J says that there exists a unique natural transformation "[: K'(X)--> H.(X) such that (1) if  E K(X) and I]EK'(X), then  @ I]EK'(X) and "[( @ 1])= ch() ("[(1])); (2) whenever X is nonsingular, "[((1\)= T(X)(X). Note that the naturality of"[ means that for any f:X --> Y and any I]EK'(X), f* "[(I]) = "[(f* 1]). References [IJ M. F. Atiyah and F. Hirzebruch, Riemann- Roch theorems for differentiable manifolds, Bull. Amer. Math. Soc., 65 (1959), 276-281. [2J M. F. Atiyah, R. Bott, and V. K. Patodi, On the heat equation and the index theorem, Inventiones Math., 19 (1973), 279-330. [3J M. F. Atiyah, V. K. Patodi, and I. M. Singer, Spectral asymmetry and Riemann geometry, Bull. Lond. Math. Soc., 5 (1973), 229-234. [4J M. F. Atiyah, G. B. Segal, and I. M. Sin- ger, The index of elliptic operators, Ann. Math., (2) 87 (1968), 484-604 (I by Atiyah and 1362 Singer, II by Atiyah and Segal, III by Atiyah and Singer). [5J M. F. Atiyah and I. M. Singer, The index of elliptic operators on compact manifolds, Bull. Amer. Math. Soc., 69 (1963), 422-433. [6J P. Baum, W. Fulton, and R. MacPherson, Riemann-Roch for singular varieties, Publ. Math. Inst. HES, no. 45 (1975),101-145. [7J P. Berthelot, A. Grothendieck, L. I1Iusie, et aI., Theorie des intersections et theoreme de Riemann-Roch, SGA 6, Lecture notes in math. 225, Springer, 1971. [8J A. Borel and J.-P. Serre, La theoreme de Riemann-Roch, Bu1\ Soc. Math. France, 86 (1958),97-136. [9J W. Fulton, Rational equivalence on sin- gular varieties, Publ. Math. Inst. HES, no. 45 (1975), 147-167. [I OJ W. Fulton and R. MacPherson, Intersect- ing cycles on an algebraic variety, Real and Complex Singularities, Sijthoff & Noordhoff, 1976,179-197. [llJ F. Hirzebruch, Topological methods in algebraic geometry, Springer, third edition, 1966. [12J K. Kodaira, On compact analytic sur- faces I, Ann. Math., (2) 71 (1961), 111-152. [13J R. MacPherson, Chern classes of singular varieties, Ann. Math., (2) 100 (1974), 423-432. [14J B. Segre, Nuovi metodi e risultati ne1\a geometria sulle varieta algebriche, Ann. Mat. (4) 35 (1953),1-128. [15J J. Todd, The geometrical invariants of algebraic loci, Proc. London Math. Soc. (2) 43 (1937),127-138. 367 (XI.12) Riemann Surfaces A. General Remarks Riemann considered certain surfaces, now named after him, obtained by modifying in a suitable manner the domains of definition of multiple-valued tanalytic functions on the complex plane in order to obtain single-valued functions defined on the surfaces. For example, consider the function z = f(w) = w 2 , where w varies in the complex plane, and its inverse function w = g(z) = fi. Then g(O) = 0 and g(oo)= 00, whereas if z0;60, 00, there exist two values of w satisfying g(z) = w. By setting z = re ie (r > 0, 0  e < 2n), the corresponding two values of ware WI =;-; e(e/2). and W 2 = j;-e(e/2H)i. Now consider how we should modify the complex z-plane so that we can obtain a single-valued function on the modi- fied surface representing the same relationship 
1363 between z and w. Let 11: 1 and 11:2 be two copies of the complex plane. Delete the nonnegative real axes from 11:1 and 11:2 and patch them crosswise along the slits (Fig. 1). The surface R thus obtained is locany homeomorphic to the complex plane except for the origin and the point at infinity, and situations at the origin and the point at infinity are as indicated in Fig. 1. For z#O, 00, there are two points ZI and Z2 in 11: 1 and 11:2' respectively, with the same coordinate z. Let WI and W 2 correspond to ZI and Z2, respectively. Then the function W= becomes single-valued on R, and WI and W 2 are holomorphic functions of ZI and Z2, respectively. The surface R is caned the Riemann surface determined by W =. R Fig. 1 Working from the idea i11ustrated by this example, H. Weyl and T. Rad6 gave rigorous definitions of Riemann surfaces. The usual de- finition nowadays is as fonows: Let 21 be a set of pairs (U, t/Ju) of open sets U in a tconnected tHausdorff space R and topological mappings t/Ju of U onto plane regions satisfying the fonowing two conditions: (i) R= U(U.lpu)e U; (ii) for each (Uj,t/Ju), (U 2 ,t/Ju,)E21 with V= U 1 n U 2 # 0, t/Ju 1 0 t/JiJ 2 1 gives an (orientation- preserving) tconformal mapping of each con- nected component of t/Ju 2 (V) onto a corre- sponding one oft/Ju1(V). Two such sets 21 1 and 21 2 are equivalent, by definition, if 21 1 U 21 2 also satisfies conditions (i) and (ii). The equivalence class (21) of such 21 is caned a conformal structure (analytic structure or com- plex structure) on R ( 72 Complex Mani- folds). A pair (R, (21)) consisting of a connected Hausdorff space R and a conformal structure (21) is caned a Riemann surface, with R its base space and (21) its conformal structure. (A Riemann surface in this sense is sometimes caned an abstract Riemann surface.) It is a complex manifold of tcomplex dimension 1. For (U, t/Ju) in 21 E (21), (U, t/Ju) (or sometimes U itself) is caned an analytic neighborhood, and t/Ju is caned a local uniformizing parameter (or simply a local parameter). In the remainder of this article we can R itself a Riemann surface. From condition (i) it fonows that a Riemann surface R is a real 2-dimensional ttopological 3678 Riemann Surfaces manifold. Moreover, by condition (ii) it can be deduced that R satisfies the tsecond counta- bi1ity axiom and consequently is a tsurface and admits a tsimp1icial decomposition (T. Rad6, 1925). It is also orient able ( 410 Sur- faces). Therefore R is a t10cany compact metric space. It is not possible to define curve lengths on R compatible with the conformal structure, but since angles can be defined, R is consid- ered to be a real 2-dimensional space with a tconformal connection. It is customary in the theory of functions to can R closed if it is compact and open otherwise. A plane region D is considered an open Riemann surface with the conformal structure 21 =(D, 1: D-->D). A Riemann sphere is also considered to be a closed Riemann surface whose analytic neigh- borhoods are given by {Upo/} and {U 2 , IN}, where U 1 (U 2 ) is the domain corresponding to {izi < 2} ({Izi > 1/2} U {oo}) under the stereo- graphic projection 0/. A function f on a Riemann surface is said to be meromorphic, holomorphic, or harmonic on R if f 0 t/JiJ1 is tmeromorphic, tholomor- phic, or tharmonic in the usual sense on t/Ju(U) for every analytic neighborhood (U, t/Ju)' More generany, suppose that for mappings between plane regions we are given a property  that is invariant under conformal mappings. A mapping T of a Riemann surface R 1 onto another Riemann surface R 2 is said to have the property  if the mapping t/Ju 20 To t/JiJ,1 of t/Ju, (Ud into t/JU 2 (U 2 ) has the property  for every pair of analytic neighborhoods (U1' t/Ju) nd (U 2 , t/Ju 2 ) in R 1 and R 2 , respec- tively. Thus such a mapping T may be con- formal, tanalytic, tquasiconformal, harmonic, etc. If there exists a one-to-one conformal mapping of a Riemann surface R 1 onto an- other Riemann surface R2' then R 1 and R 2 are said to be conformally equivalent. Two such Riemann surfaces are sometimes identified with each other. 8. Covering Surfaces One of the main themes of the theory of func- tions is the study of analytic mappings of a Riemann surface R into another Riemann surface Ro, i.e., the theory of covering surfaces of Riemann surfaces. Suppose, in general, that there are two surfaces Rand Ro and a continuous open mapping T of R into,Ro such that the inverse image of a point in Ro under T is an tisolated set in R. Then T is caned an inner transfor- mation in the sense of Stoilow and (R, Ro; T) a covering surface with Ro its basic surface and T its projection. Often R is caned simply a covering surface of Ro. A point Po with Po= 
367C Riemann Surfaces T(p) is caned the projection of p, and p is said to lie above Po. In this case, there exist sur- face coordinates (U, 1jJ) and (U o , ljJo) at p and Po, respectively, such that IjJ(U) = {z Ilzl < I}, ljJ(p) = 0, ljJo(U o ) = {w Ilwl < 1}, ljJo(Po) = 0, and w = (ljJo 0 To IjJ -1 )(z) = z", with the positive integer n independent of the choice of coordi- nate neighborhoods. If n > 1, then p is caned a branch point, n the multiplicity, and n - 1 the degree of ramification. The set of an branch points forms an at most countably infinite set of isolated points. If there is no branch point, then the covering surface (R, Ro; T) and the projection T are said to be unramified. For a given curve Co in Ro and a point p in R lying above the initial point of Co, a curve C in R with p its initial point satisfying T(C) = Co is caned the prolongation along Co (or the lift of Co) starting from p. If any proper subarc of Co sharing the initial point with Co admits a prolongation along itself starting from p but Co does not admit a prolongation along itself starting from p, then R is said to have a rela- tive boundary above the endpoint of Co. A covering surface without a relative boundary is called unbounded. A tsimply connected surface ROO that is an unramified unbounded covering surface of Ro is said to be a universal covering surface of Ro. The universal covering surface exists for every R. Suppose that R is an unbounded covering surface of Ro. Then the number of points on R that lie above each point of Ro is always constant, say n (:;;;;; +(0), where the branch points of R are counted with their multiplic- ities. n is called the number of sheets of Rover Ro, or R is said to be n-sheeted over Ro. If R and Ro are compact surfaces with tEuler char- acteristic X and Xo, respectively, and if R is an n-sheeted covering surface of Ro, then we have the Riemann-Hnrwitz relation: X = nxo - V, V being the sum of the degrees of ramification. A topological mapping S of an unramified un- bounded covering surface R of Ro onto itself such that To S = T is called a covering trans- formation. The group of an covering trans- formations of a universal covering surface of Ro is isomorphic to the tfundamental group (i.e., the I-dimensional homotopy group) of Ro. In a covering surface (R, Ro; T) whose basic surface Ro is a Riemann surface, T can be regarded as an analytic mapping of R onto Ro by giving R a conformal structure in a natural manner. In particular, if Ro is the sphere, then its covering surface is a Riemann surface. Conversely, any Riemann surface can be re- garded as a covering surface of the sphere. This fact had long been known for closed Riemann surfaces; for open Riemann surfaces, it can be deduced from the existence theorem of an- 1364 alytic functions proved by H. Behnke and K. Stein, which states that an open Riemann surface is a tStein manifold. Historicany, by a Riemann surface mathematicians meant either the abstract Riemann surface or a covering surface of the sphere, until the two notions were proved identical. Suppose that R is a covering surface of the z-sphere Ro with the projection T: R --> Ro, and denote by Ra the region that lies above 0 < Iz- al <roo If there exists a topological mapping IjJ of Ra onto {(r, 8)1 O<r<r o , -oc <lk oo} such that a +re i8 = T(1jJ -1 (r, (})), then R is said to have a logarithmic branch point above a; in contrast, a branch point of multiplicity n of the type defined previously is sometimes caned an algebraic branch point. Ahlfors's theory of covering surfaces, which treats covering surfaces (R, Ro; T) not only from the topological viewpoint but also from the metrical one, is particularly important. Let Rand Ro be either compact surfaces with simplicial decompositions or their closed subregions with boundaries consisting of 1- simplexes and vertices such that T preserves simplicial decompositions. Here we can the part of the boundary of R whose projection is in the interior of Ro the relative boundary. With respect to a suitable metric on Ro, let S be the ratio of areas of Rand Ro, L the length of the relative boundary, and - p and - Po the tEuler characteristics of Rand Ro, respec- tively. Then Ahlfors's principal theorem asserts that max(O,p)poS-hL, where h>O is a constant determined only by Ro. This has been applied widely in various branches of mathe- matics, including the theory of distribution of values of analytic mappings between Riemann surfaces. C. Uniformization Suppose that we are given a correspondence between the z-plane and w-plane determined by a tfunction element Po = (z, w) (z = Po(t), w = Qo(t)). This correspondence generally gives rise to a multiple-valued analytic func- tion w= j(z). We show how to construct a Riemann surface so that the function w= j(z) can be considered a single-valued function on it. We use j again to mean the connected component of the set of function elements p=(z*, w*) (z*=P(t), w*=Q(t)) in the wider sense containing Po, where the analytic neigh- borhood of each point p is defined to be the set of elements that are direct analytic continua- tions of p. Then j is a Riemann surface. For a point p=(z*, w*) (z*=P(t), w*=Q(t)) inf, set z=F(p)=P(O), w=G(p)=Q(O). Then two meromorphic functions z=F(p) and w= G(p) 
1365 are defined on f, and f can be considered as a covering surface of the z-sphere and the w- sphere. We ca11 f an tanalytic function in the wider sense. Thus we obtain a single-valued function w = G(p) on the Riemann surface f that can be regarded as a modification of the original function w= f(z). Suppose that there exist two meromorphic functions z = cp(') and w = t/J(,) on a region D in the' -plane, and let z=P(, -'0) and w=Q(, -'0) be tLaurent expansions of cp and t/J at each point '0 in D. If the function element P'o = (z*, w*) (z* = P(t), w* = Q(t») belongs to the Riemann surface f, then the correspondence w = f(z) determined by the function element D, is said to be locallv unifor- - .Ie o ., mized on D by z=cpm and w=t/J('). In par- ticular, if {p, I' ED} = f, then f is said to be uniformized by z=cp(O and w=t/Jm. If an analytic function f in the wider sense, consid- ered as a Riemann surface, is conforma11y equivalent to a region D in the '-plane, then f can be uniformized by z = F(p) and w = G(p). In general, f is not conforma11y equivalent to a plane region, but if an unramified unbounded covering surface (!. f; T) of f is conforma11y equivalent to a region D in the '-plane, then f is uniformized by z = F 0 T(,) and w = Go Tm (Schottky's uniformization). In particular, since the universal covering surface (f"'J; T) of fis simply connected, it is conforma11y equivalent to the sphere I"  00, the finite plane '" < 00, or the unit disk I" < 1. Consequently, f is uniformized by z = F 0 T(') and w = Go T('). Therefore analytic functions in the wider sense are always uniformizable. For example, an talgebraic function f consid- ered as a Riemann surface is always closed. If its tgenus g = 0, then f is the sphere and is thus uniformized by rational functions z = F( ,) and w = Gm. If g > 0, then (f'" J; T) is conforma11y equivalent to "I < 1 or "I < 00, and hence f is uniformized by z = F 0 Tm and w = G 0 Tm. When I" < 1, z and ware tautomorphic func- tions with respect to the group of linear trans- formations preserving I" < 1, while if I" < 00, they are te11iptic functions. D. The Type Problem A simply connected Riemann surface R is conforma11y equivalent to the sphere, the finite plane, or the unit disk. Then R is said to be elliptic, parabolic, or hyperbolic, respectively. The problem of determining the types of sim- ply connected covering surfaces of the sphere by their structures, such as the distributions of their branch points, is ca11ed the type problem for Riemann surfaces. For example, if a simply connected covering surface does not cover three points on the sphere, it must be hyper- 367E Riemann Surfaces bo\ic (Picard's theorem). The Nevanlinna theory of meromorphic functions stimulated this type problem. However, it is difficult to measure the ramifications of covering surfaces, and many detailed reuslts of the type problem obtained in the 1930s are limited mainly to the case where a11 branch points lie above a finite number of points on the sphere. A suffi- cient condition for R to be of parabolic type, given by Z. Kobayashi (using the so-ca11ed Kobayashi net, and a sufficient condition for R to be of hyperbolic type, given by S. Kaku- tani (using quasi conformal mappings), are significant results on the type problem. The type problem had by then been extensively generalized to the fo11owing classification theory. E. Classification Theory of Riemann Surfaces Riemann surfaces are, as pointed out by Weyl, "not merely devices for visualizing the many- valuedness of analytic functions, but rather an essential component of the theory... the only soil in which the functions grow and thrive." So the problem natura11y arises of how to extend various results in the theory of analytic functions of a complex variable to the theory of analytic mappings between Riemann sur- faces. In general, open Riemann surfaces can have infinite genus and are quite complicated. So to obtain fruitful results and systematic development, one usua11y sets certain restric- tions on the properties of the Riemann sur- faces. In connection with this, R. Nevanlinna, L. Sario, and others initiated the classification theory of Riemann surfaces, which classifies Riemann surfaces by the existence (or non- existence) offunctions with certain properties. Denote by X(R) the totality offunctions on a Riemann surface R with a certain property X. The set of a11 Riemann surfaces R for which X(R) does not contain any function other than constants is denoted by Ox. The family of analytic functions and the family of harmonic functions are denoted by A(R) and H(R), re- spectively. The family of positive functions, that of bounded functions, and that offunc- tions with finite Dirichlet integrals are denoted by P(R), B(R), and D(R), respectively. From these families, various new families are created, e.g., ABD(R)=A(R)nB(R)nD(R). Usua11y 0HB, 0HD' 0HBD, 0AB' 0AD, 0ABD, and also 0G, the family of Riemann surfaces on which there are no Green's functions, are considered. P. J. Myrberg found an example of a Riemann surface of infinite genus which has a large boundary but belongs to 0AB' The idea behind Myberg's example is often used to construct examples in classification theory. From works 
367F Riemann Snrfaces of y. T6ki, L. Sario, K. 1. Virtanen, H. L. Royden, M. Parreau, M. Sakai, and others, it can be seen that there are inclusion relations, as indicated in Fig. 2, among the classes just mentioned. There is no inclusion relation between OAB and 0HD' For Riemann surfaces of finite genus, 0G=OHD' Closed Riemann surfaces are all in 0G' Open Riemann surfaces in OG are also said to be parabolic (or of null bonndary), and those not in 0G, hyperbolic (or of positive boundary). Several characterizations for parabolic Riemann surfaces are known. c:<: 0HIJ = 0HRf) <:;:- Or,  ()HP  ()HH& e:- c: '* U 4 /i = ()4BI) * 04H *" Fig. 2 From a similar point of view, the classifica- tion problem for subregions was studied in detail by Parreau, A. Mori, T. Kuroda, and others. We call a noncompact region Q which is the complement of a compact subset of a Riemann surface a Heins's end. M. Heins called the minimal number (  C1J) of gen- erators of the semigroup of the additive class of HP-functions that vanish continuously at the relative boundary of a Heins's end Q the harmonic dimension of Q. Its properties were investigated by Z. Kuramochi, M. Ozawa, and others. Generally, a function! is said to be .I- minimal if! is positive and contained in .I(R) and there exists a constant C g for every 9 in .I(R) with !?g?O such that g= Cgf The family of Riemann surfaces R not belonging to OG and admitting .I-minimal functions on R is denoted by UI' C. Constantinescu and A. Cornea and others studied Riemann surfaces in U HB and U HD where HD is the class of posi- tive functions in H D or limits of monotone decreasing sequences of such functions. There are inclusion relations UHBOHB-OG' U llD OHDOG' and UHD ';f UUB' One of the inter- esting results in classification theory is the fact, discovered by Kuramochi, that UHBU OG ';f OAB and UHD UOGOAD' Classification theory has a very deep con- nection with the theory oftideal boundaries. A. Pfluger and Royden showed that the classes OG and OHD are invariant under quasicon- formal mappings. However, it is still an open problem whether OHB is invariant in this sense. F. Prolongations of Riemann Surfaces As classification theory shows, pathological phenomena occur for Riemann surfaces from the viewpoint of function theory in the plane. These are caused by the complexity of ideal 1366 boundaries of Riemann surfaces, and in par- ticular by the complexity of the set of ideal boundary points at which handles of Riemann surfaces (i.e., parts with cycles not homologous to 0) accumulate. Hence it is desirable to find larger Riemann surfaces so that ideal bound- ary points of original surfaces that are not accumulating points of handles become inte- rior points. Suppose that a Riemann surface R is conformally equivalent to a proper sub- region R' of another Riemann surface R 1 . Then R 1 is said to be a prolongation of R, and R is prolongable. A nonprolongable Riemann surface is said to be maximal. Closed Riemann surfaces are always maximal, but there also exist maximal open Riemann surfaces (Rad6). However, every open Riemann surface is homeomorphic to a prolongable Riemann surface (S. Bochner). Characterizations of prolongable Riemann surfaces and relation- ships between the various null classes men- tioned in Section E and prolongations were investigated from several viewpoints by R. de Possel, J. Tamura, and others. G. Analytic Mappings of Riemann Surfaces Apart from the development of the classifica- tion theory of Riemann surfaces, efforts have been made to extend various results in the theory of analytic functions of a complex variable to the case of analytic mappings be- tween Riemann surfaces. L. Sario studied the method of normal operators, which is utilized to construct harmonic functions on Riemann surfaces with given singularities at their ideal boundaries; and he extended the main theo- rems of Nevanlinna to analytic mappings between arbitrary Riemann surfaces ( 124 Distributions of Values of Functions of a Complex Variable). M. Heins introduced the notions of Lindelof type, Blaschke type, and others which are special classes of analytic mappings. Utilizing these notions, Con- stantinescu and Cornea, Kuramochi, and others extended various results in the theory of cluster sets ( 62 Cluster Sets) by studying the behavior of analytic mappings at ideal boundaries. The theory of tcapacities on ideal boundaries has also been developed. On every open Riemann surface R there exists a non constant holomorphic function (Behnke and Stein). Furthermore, Gunning and Narasimhan proved that there exists a holomorphic function on R whose derivative never vanishes. In other words, R is conform- ally equivalent to an unramified covering surface of the sphere. Such a locally homeo- morphic analytic mapping is caned the im- mersion of R. The proof is based on the fol- 
1367 lowing deep result (S. Mergelyan's theorem): Suppose that K is a compact set on R such that R - K has no relatively compact con- nected component. Then every continuous function which is holomorphic on the interior of K can be approximated uniformly on K by a holomorphic function on R [22]. A surface of genus 0 is said to be planar (or of planar character or schlichtartig). A simply connected surface is planar. Using the Dirich- let principle, P. Koebe proved the fo11owing general uniformization theorem: Every planar Riemann surface R can be mapped conform- a11y to the canonical slit regions on the ex- tended complex plane C. More precisely, given a point p on R, there exists the extremal hori- zontal (vertical) slit mapping F 1 (F2) such that (i) F 1 (F 2 ) maps R conforma11y to a region on C whose boundary consists of horizontal (ver- tical) slits and possibly points; (ii) F 1 and F 2 have a simple pole at p with residue 1; (iii) the total area of the slits and points is zero. Sup- pose that Fj== Ijz+a j +c j z+ '" (i== 1, 2) in terms of local parameter z at p. Then s == (c 1 - c 2 )j2 is ca11ed the span of R. It is known that s (== Ild(F 1 - F 2 )j211 2 );;;, 0, where the equality holds if and only if R belongs to 0AD' In the case of finite genus g, there also exist the con- formal mappings of R onto the para11el slit regions on the (g + I)-sheeted covering surface of C (Z. Nehari, Y. Kusunoki, and others). L. Ahlfors proved that a Riemann surface of genus g bounded by m contours can be mapped conforma11y to an at most (2g + m)- sheeted unbounded covering surface of the unit disk. The structures of closed Riemann surfaces are determined by the algebraic structures of meromorphic function fields on them. H. Iss'sa obtained a noteworthy result which estab- lished that open Riemann surfaces are also determined by their meromorphic function fields [24]. It is known too that open Rie- mann surfaces are determined by their rings of holomorphic functions. H. Differential Forms on Riemann Surfaces Since Riemann surfaces are considered real 2- dimensional differentiable manifolds of class Coo, differential1-forms w==udx+vdy and differential2-forms Q==cdx /\dy are defined on them, and operations such as the exterior derivative dw==(ovjox -oujoy)dx /\dy and exterior product can be defined ( 105 Dif- ferentiable Manifolds). Since coordinate trans- formations satisfy the Cauchy-Riemann dif- ferential equation, the conjugate differential * w== -vdx +udy of w can be defined. A dif- ferential form w satisfying dw==d * w==Q is 367 I Riemann Surfaces ca11ed a harmonic differential, and one with * w == - iw is said to be pure. A pure differential is expressed as w == f dz. Here, if f is a holomor- phic function, then w is ca11ed a holomorphic (or analytic) differentia and if f is a meromor- phic function, then w is ca11ed a meromor- phic (or Abelian) differential. The differential form w is a holomorphic differential if and only if it is closed (i.e., dw==O) and pure. The differential w is ca11ed exact (or total) if w is written as dF == Fxdx + Fydy with a globa11y single-valued function F. Next, the set of a11 measurable differentials w with IIwl1 2 == HR w/\ * W < 00 forms a tHil- bert space with respect to the norm Ilwl!. The method of orthogonal decomposition in the theory of Hilbert spaces is the main device to study this space and also its suitable subspaces and to obtain the existence theorem of har- monic and holomorphic differentials with various properties ( 194 Harmonic Inte- grals). However, in contrast to differentiable manifolds, it should be noted that finer or- thogonal decompositions into subspaces with specific properties hold for open Riemann surfaces. For instance, let fa(f h ) be the Hilbert space of analytic (harmonic) differentials with finite norm, and set fae == {w E fa I W is exact}, r hse == { WE r h I w is tsemiexact}; then we have the orthogonal decompositions fa == fae + f aS , f h == *f hse + f hm == f hse + *f hm , etc., where *r x stands for the space {wi *WE f x } and fas and f hm are known as the space of analytic Schottky differentials and the space of dif- ferentials of harmonic measures, respectively. Both spaces faS and f hm have remarkable properties [9]. I. Abelian Integrals on Open Riemann Surfaces The systematic effort to extend the theory of Abelian integrals on closed Riemann surfaces to open Riemann surfaces was initiated by Nevanlinna in 1940. At the first stage of the development, only those Riemann surfaces with sma11 boundaries (i.e., ones in 0G or 0HD) were treated; later a more general treatment was made possible by the discovery of the notion of semi exact differentials (K. Virtanen, Ahlfors). Let R be an arbitrary open Riemann sur- face. A I-dimensional cycle C is ca11ed a divid- ing cycle if for any compact set in R there exists a cycle outside the compact set homol- ogous to C. A differential is said to be semi- exact if its period along every dividing cycle vanishes. 
367 Ref. Riemann Snrfaces Ahlfors defined the distinguished (com- plex harmonic) differentials with polar sin- gularities and obtained in terms of them a generalization of Abel's theorem. Indepen- dently, Y. Kusunoki defined.the semiexact canonical (meromorphic) differentials and gave in terms of them a formulation of Abel's theorem and the Riemann-Roch theorem on R [27]. It was proved that a meromorphic dif- ferential df = du + idv is semiexact canonical if and only if du is (real) distinguished, and then u is almost constant on every ideal boundary component of R (in appropriate tcompactifi- cation of R). Hence every meromorphic func- tion f for which df is distinguished is almost constant on every ideal boundary component, and therefore f reduces to a constant by the boundary theorem of Riesz type if R has a large boundary. Whereas by the Riemann- Roch theorem above a nonconstant meromor- phic function f such that df is (exact) canon- ical exists on any open Riemann surface R with finite genus, and f gives a canonical par- allel slit mapping of R ( Section G). H. L. Royden [28J and B. Rodin also gave gener- alizations of the Riemann-Roch theorem. M. Yoshida, H. Mizumoto, M. Shiba, and others further generalized the Kusunoki type theorems. The Riemann-Roch theorem for a closed Riemann surface can be deduced from that for open Riemann surfaces by considering an open Riemann surface obtained from a closed surface by deleting a point. Riemann's period relation on R has been studied for various classes of differentials, but for the case of infinitely many non vanishing periods, no definitive result has been obtained. The same is true for the theory of Abelian differentials with infinitely many singularities. On the other hand, the analogy to the classical theory is lost completely if no restriction is posed on the differentials on R. In this con- text the following results due to Behnke and Stein [26J are outstanding' (1) There exists an Abelian differential of the first kind on R with infinitely many given periods. (2) For two discrete sequences {Pn} and {qn} of points in R, there exists a single-valued meromorphic function with zeros at Pn and poles at qm' It is further proved that there exists an Abelian differential with prescribed divisor and peri- ods (Kusunoki and Sainouchi). This gener- alizes the results above and the Gunning- Narasimhan theorem. References [IJ H. Weyl, Die Idee der Riemannschen Flache, Teubner, 1913, third edition, 1955; 1368 English translation, The concept of a Riemann surface, Addison-Wesley, 1964. [2J S. Stoi10w, Lei;ons sur les principes topo- logiques de la theorie des fonctions analy- tiques, Gauthier-Villars, 1938. [3J L. Ahlfors, Zur Theorie der Uberlagerungs- flachen, Acta Math., 65 (1935),157-194. [4J R. Nevan1inna, Uniformisierung, Springer, 1953. [5J H. Behnke and F. Sommer, Theorie der analytischen Funktionen einer komplexen Veranderlichen, Springer, second revised edi- tion, 1962. [6J A. Pfluger, Theorie der Riemannschen Flachen, Springer, 1957. [7J G. Springer, Introduction to Riemann surfaces, Addison-Wesley, 1957. [8J L. V. Ahlfors (ed.), Contribution to the theory of Riemann surfaces, Ann. Math. Studies, Princeton Univ. Press, 1953. [9J L. V. Ahlfors and L. Sario, Riemann sur- faces, Princeton Univ. Press, 1960. [IOJ M. Schiffer and D. C. Spencer, Func- tionals of finite Riemann surfaces, Princeton Univ. Press, 1954. [11J L. Bers, Riemann surfaces, Lecture notes, New York Univ., 1957. [12J M. Tsuji, Potential theory in modern function theory, Maruzen, 1959 (Chelsea, 1975). [13J R. C. Gunning, Lectures on Riemann surfaces, Princeton Univ. Press, 1966. F or the classification of Riemann surfaces  [6,9J and the following: [14J C. Constantinescu and A. Cornea, Ideale Rander Riemannscher Flachen, Springer, 1963. [15J L. Sario and M. Nakai, Classification theory of Riemann surfaces, Springer, 1970. [16J M. Sakai, Analytic functions with finite Dirichlet integrals on Riemann surfaces, Acta Math., 142 (1979), 199-220. For analytic mappings of Riemann surfaces  [4,6, 9J and the following: [17J B. Rodin and L. Sario, Principal func- tions, Van Nostrand, 1968. [18J L. Sario and K. Noshiro, Value distri- bution theory, Van Nostrand, 1966. [19J L. Sario and K. Oikawa, Capacity func- tions, Springer, 1969. [20J L. Ahlfors, Open Riemann surfaces and extremal problems on compact subregions, Comm. Math. Helv., 24 (1950),100-134. [21J R. Gunning and R. Narasimhan, Immer- sion of open Riemann surfaces, Math. Ann., 174 (1967),103-108. [22J E. Bishop, Subalgebras of functions on a Riemann surface, Pacif. J. Math., 8 (1958), 29- 50. [23J M. H. Heins, Algebraic structure and conformal mapping, Trans. Amer. Math. Soc., 89 (1958), 267-276. 
1369 [24] H. Iss'sa, On the meromorphic function field of a Stein variety, Ann. Math., (2) 83 (1966),34-46. For the generalization of algebraic functions and Abelian integrals  [4,6,9] and the fonowing: [25] R. Nevanlinna, Quadratisch integrierbare Differentiale auf einer Riemannschen Mannig- faltigkeit, Ann. Acad. Sci. Fenn., 1 (1941), 1-34. [26] H. Behnke and K. Stein, Entwicklung analytischer Functionen auf Riemannschen Fliichen, Math. Ann., 120 (1949), 430-461. [27] Y. Kusunoki, Theory of Abelian integrals and its applications to conformal mappings, Mem. Col1. Sci. Univ. Kyoto, (A, Math.) 32 (1959),235-258; 33 (1961), 429-433. [28] H. L. Royden, The Riemann-Roch theorem, Comment. Math. Helv., 34 (1960), 37-51. Also  reference to 11 Algebraic Functions, 416 Teichmiiner Spaces. 368 (111.9) Rings A. Definition A nonempty set A is caned a ring if the fonow- ing conditions are satisfied. (1) Two toperations, caned addition and multiplication (the ring operations), are defined, which send an arbitrary pair of elements a, b of A to elements a + band ab of A. (2) For arbitrary elements a, b, c of A, these operations satisfy the fonowing four laws: (i) a+b=b+a (commutative law ofaddition); (ii) (a+b)+c=a+(b+c), (ab)c=a(bc) (associative laws); (iii) a(b+c)=ab+ac, (a+b)c=ac+bc (distributive laws); and (iv) for every pair a, b of elements of A, there exists a uniq ue element c of A such that a+c=b. Thus a ring A is an tAbelian group under addition. Each element a of a ring A determines operations La and Ra defined by La(x)=ax, Ra(x)=xa (xEA). Thus a ring A has the structure of a t1eft A-module and a tright A-module. Since the operations La and Rb commute for every pair a, b of ele- ments of A, the ring A is also an A-A-bimodule ( 277 Modules). The identity element of A under addition is caned the zero element and is denoted by O. It satisfies the equation aO=Oa=O (aEA). An element eEA is caned a unity element (identity element or unit element) of A if it satisfies ae = ea=a (aEA). If A has such a unity element, it is uniq ue and is often denoted by 1. A ring with unity element is caned a unitary ring. Most of the important examples of rings are unitary. 368B Rings Hence we often can a unitary ring simply a ring. If a ring has only one member (namely, 0), then 0 is the unity element of the ring. Such a ring is caned a zero ring. However, if a ring has more than one element, the unity element is distinct from the zero element. A ring is caned a commutative ring if it satisfies (v) ab =ba (a,bEA) (commutative law for multipli- cation) ( 67 Commutative Rings). In this article we shan discuss associative rings. Certain non associative rings are impor- tant; an example is tLie algebra. (An algebra is a ring having a tground ring.) B. Further Definitions An element a oF- 0 of a ring A is caned a zero divisor if there exists an element b oF- 0 such that ab=O or ba=O. A commutative unitary ring having more than one element is caned an integral domain if it has no zero divisors ( 67 Commutative Rings). Elements a and b of a ring are said to be orthogonal if ab = ba = O. An element a satisfying an = 0 for some positive integer n is caned a nilpotent element, and a nonzero element a satisfying a 2 = a is caned an idempotent element. An idempotent element is said to be primitive if it cannot be represented as the sum of two orthogonal idempotent elements. For any subsets Sand T of a ring A, let S + T(ST) denote the set of elements s + t(st) (SES, tE T). In particular, SS is denoted by S2 (similarly for S3, S4, etc.), and further- more, {a} + S( {a} S) is denoted by a + S(aS). If ST= TS = {O}, then subsets Sand T are said to be orthogonal. A subset S of a ring is said to be nilpotent if sn = 0 for some positive integer n, and idempotent if S2 = S. For an element a of a unitary ring A, an element a' such that a' a = 1 (aa' = 1) is caned a left (right) inverse element of a. There exists a left (right) inverse element of a if and only if A is generated by a as a left (right) A-module. If there exist both a right inverse and a left in- verse of a, then they coincide and are uniq uely determined by a. This element is caned the inverse element of a and is denoted by a -1. An element that has an inverse element is caned an invertible element (regular element or unit). The set of an invertible elements of a unitary ring forms a group under multiplication. A nonzero unitary ring is caned a tskew field (or tdivision ring) if every nonzero element is inver- tible. Furthermore, a skew field that satisfies the commutative law is caned a commutative field or simply a field ( 149 Fields). In a general ring A, if we define a new operation (a, b)...... a 0 b by setting aob=a+b-ab, then A is a tsemigroup with the identity element o with respect to this operation. The inverse 
368C Rings element under this operation is called the qnasi-inverse element, and an element that has a quasi-inverse element is called a qnasi- invertible element (or qnasiregnlar element). An element a of a unitary ring is quasi-invertible if and only if the element I - a is invertible. C. Examples (1) Rings of numbers. The ring Z of rational integers, the rational number field Q, the real number field R, and the complex number field C are familiar examples ( 14 Algebraic Number Fields, 257 Local Fields). (2) Rings of functions. The set K I of func- tions defined on a set I and taking values in a ring K forms a commutative ring under point- wise addition and multiplication. In particular, let K = R, and let I be an interval of R. Then the set CO (I) of continuous functions, the set C(I) of functions that are r-times continuously differentiable, and the set CW(I) of analytic functions on I are su brings ( Section E) of R / . (3) Rings of expressions. The set K [X l' '" , Xn] of polynomials and the set K [[ XI' ... , Xn]] oftformal power series in indeterminates X I' ..., X n with coefficients in a commutative ring K are commutative rings ( 369 Rings of Polynomials, 370 Rings of Power Series). (4) Endomorphism rings of modules. The set tS'K(M) of tendomorphisms of a tmodule M over a ring K is in general a noncommutative ring. In particular, if M is a finite-dimensional tlinear space over a field K, then tS'K(M) can be identified with a tfull matrix ring ( 256 Linear Spaces, 277 Modules). (5) For other examples  29 Associative Algebras, 36 Banach Algebras, 67 Commuta- tive Rings, 284 Noetherian Rings, and 439 Valuations. D. Homomorphisms A mapping j : A --> B of a ring A into a ring B satisfying conditions (i) f(a+ b) = f(a) + f(b) and (ii) f(ab) = f(a)f(b) (a, bE A) is called a homomorphism. If a homomorphism f is tbijective, then the inverse mapping f-I: B--> A is also a homomorphism, and in this case f is called an isomorphism. More precisely, a homomorphism (isomorphism) of rings is often called a ring humomorphism (ring isomor- phism). There exists only one homomorphism of any ring onto the zero ring. For unitary rings A and B, a homomorphism f: A-->B is said to be nnitary if it maps the unity element of A to the unity element of B. By a homomor- phism, a unitary homomorphism is usually meant. In this sense there exists a unique hom- 1370 omorphism of the ring Z of rational integers into an arbitrary unitary ring. A composite of homomorphisms is also a homomorphism. The identity mapping l A of a ring A is an isomor- phism. A homomorphism of a ring A into itself is called an endomorphism, and an isomor- phism of A onto itself is called an antomor- phism of A. If a is an invertible element of a unitary ring A, then the mapping x -->axa- 1 (xEA) is an automorphism of A, called an inner antomorphism. When condition (ii) for a homomorphism is replaced by (ii') f(ab) = f(b)f(a) (a, bEA), a mapping satisfying (i) and (ii') is called an antihomomorphism. In particular, if an anti- homomorphism f is bijective, then the inverse mapping f- 1 is also an antihomomorphism, and f is called an anti-isomorphism. Antiendo- morphisms and antiantomorphisms are defined similarly. E. Snbrings, Factor Rings, and Direct Prodncts A subset S of a ring A is called a snbring of A if a ring structure is given on S and the canon- ical injection S --> A is a ring homomorphism. Thus the ring operations of S are the restric- tions of those of A. If we deal only with uni- tary rings and unitary homomorphisms, then a subring S necessarily contains the unity ele- ment of A. The smallest subring containing a subset T of a ring A is called the subring gen- erated by T. The set of elements that commute with every element of T forms a subring and is called the commntor (or centralizer) of T. In particular, the commutOr of A itself is called the center of A. A tquotient set AIR of a ring A by an equiv- alence relation R is called a factor ring (quo- tient ring) of A if a ring structure is given on AIR and the canonical surjection A-->AIR is a ring homomorphism. This is the case if and only if the equivalence relation R is com- patible with the ring operations (i.e., aRa' and bRb' imply (a + b)R(a' + b') and (ab)R(a'b')). Let ex and fJ be elements of the factor ring AIR, represented by a and b, respectively. Then the definition of factor ring implies that ex + fJ(exfJ) is the equivalence class represented by a + h(ab). Every ring A has two trivial factor rings, namely, A itself and the zero ring O. If A has no nontrivial factor rings, then A is called a q nasisimple ring ( Secti on G). If f: A --> B is a ring homomorphism, then the image f(A) is a subring of B, and the equivalence relation R in A defined by f(aRb= f(a) = f(h)) is com- patible with the ring operations of A. Thus f induces an isomorphism AIR --> f(A) ( Sec- tion F). If {A;} iEI is a family of rings, the Cartesian 
1371 product A = II jel Ai forms a ring under the componentwise operations (a j ) + (b;) = (a i + bj) and (aj)(b j ) = (ajb;). This ring is called the direct product ofthe family of rings {A;}iel' The mapping pj:AAi that assigns to each (a j ) its ith component ai is called a canonical homo- morphism. For any set of homomorphisms J;:BAj (iEI), there exists a unique homomor- phismf:BA such that J;=pjoffor each i. F. Ideals A subset of a ring A is called a left (right) ideal of A if it is a submodule of the left (right) A- module of A ( 277 Modules). In other words, a left (right) ideal J of A is an additive sub- group of A such that AJ cJ (JA cJ). Under the operations induced from A, J is a ring (however, J is not necessarily unitary). A sub- set of A is called a two-sided ideal or simply an ideal of A if it is a left and right ideal. For an ideal J of a ring A, we define a rela- tion R in A by aRba-bEJ. Then R is an equivalence relation that is compatible with the operations of A. Each equivalence class is called a residue class modulo J, and the quo- tient ring A/ R is denoted by AjJ and called the residue (class) ring (or factor ring) modulo J. If it is a field, it is called a residue (class) field. Conversely, given an equivalence relation R that is compatible with the operations of A, the equivalence class of 0 forms an ideal J of A, and the equivalence relation defined by J coincides with R. Iff:AB is a ring homomorphism, then the tkernel of f as a homomorphism of addi- tive groups forms an ideal J of A, and f in- duces an isomorphism AjJ  f(A). It S is a subring and J is an ideal of a ring A, then S + J is a subring of A and S n J is an ideal of S. Furthermore, the natural homomorphism S  (S +J)jJ induces an isomorphism s/snJ  (S + J)/ J (isomorphism theorem). A left (right) ideal of a ring A is said to be maximal if it is not equal to A and is properly contained in no left (right) ideal of A other than A. Similarly, a left (right) ideal of A is said to be minimal if it is nonzero and properly contains no nonzero left (right) ideal of A. If e is an idempotent element of a unitary ring A, then l-e and e are orthogonal idem- potent elements, and A = Ae + A (1- e) is the direct sum of left ideals. This is called Peirce's left decomposition. Peirce's right decomposition is defined similarly. A left ideal J of A can be expressed as J = Ae for some idempotent element e if and only if there exists a left ideal J' such that A =J +J' is a direct sum decom- position. More generally, if e 1 , ..., en are ortho- gonal idempotent elements whose sum is equal 368G Rings to 1, then A=Ae 1 +...+Ae n is the direct sum ofleft ideals. Conversely, if A =J 1 +... +I n is the direct sum of left ideals and 1 = e 1 + ... + en (eiEJ i ) is the corresponding decomposition of the unity element, then e l' ... ,en are ortho- gonal idempotent elements. In particular, if J l' . .. ,I n are two-sided ideals, then each J j is a ring with unity element e j , and by a natural correspondence, the ring A is isomorphic to the direct product II7=1 J j . In this case, A is called the direct sum of ideals J l' ... ,In and is denoted by A = €:J7=1 J j , or A = 7=1 J j . A ring A is called a left (right) Artinian ring if it is tArtinian as a left (right) A-module (i.e., if A satisfies the tminimal condition for left (right) ideals of A). A ring A is called a left (right) Noetherian ring if it is tNoetherian as a left (right) A-module (i.e., if A satisfies the tmaximal condition for left (right) ideals of A). If A is commutative, left and right are omitted in these definitions. The property of being Artinian or Noetherian is inherited by quo- tient rings and the direct product of a finite family of rings, but not necessarily by subrings. For general rings, the maximal and minimal conditions for left (right) ideals are indepen- dent, but for unitary rings, left (right) Artinian rings are necessarily left (right) Noetherian (Y. Akizuki, C. Hopkins). G. Semisimple Rings The statement that a unitary ring A is tsemi- simple as a left A-module is equivalent to the statement that A is semisimple as a right A- module; in this case A is called a semisimple ring ( Section H). Every module over a semi- simple ring is also semisimple. A semisimple ring is left (right) Artinian and Noetherian. A semisimple ring is called a simple ring if it is nonzero and has no proper ideals except {O}, that is, if A is a quasisimple ring. Thus A is a simple ring if and only if A is a nonzero, uni- tary, quasisimple, left (right) Artinian ring. If A is a semisimple ring, then it has only a finite number of minimal ideals AI' ...,An, and A is expressible as the direct sum A = AI + .,. + An' where each A j is a simple ring, called a simple component of A. Any ideal of A is the direct sum of a finite number of simple components of A. Quotient rings of a semisimple ring and the direct product of a finite number of semi- simple rings are also semisimple. Any left (right) ideal of a semisimple ring A is expressible as Ae (eA) for some idempotent element e, and Ae (eA) is minimal if and only if e is primitive. In particular, a minimal left (right) ideal is a simple left (right) A-module that is contained in a certain simple compo- nent. Two minimalleft (right) ideals are iso- 
368H Rings morphic as A -modules if and only if they are contained in the same simple component. If Ai' 1  i  n, are the simple components of A, then for each simple left A-module M there exists a unique i such that AiM # {O}, and M is isomorphic to a minimal left ideal contained in Ai' If M is a finite-dimensional linear space over a (skew) field D, then the endomorphism ring A =tS'D(M) of Mover D is a simple ring. Con- versely, for any ring A, the endomorphism ring D=tS'A(M) of a simple A-module M is a (skew) field (Schnr's lemma). If A is a simple ring, then any simple A-module M can be considered as a finite-dimensional linear space over D = tS'A(M), and A is isomorphic to tS'D(M) (Wed- derbnrn's theorem). Furthermore, if r is the dimension of Mover D, then tS'DUvf) is isomor- phic to the full matrix ring M,(DO) of degree r over the field DC, which is anti-isomorphic to D. The dimension r is also equal to the tlength of A as an A-module. The center of A = tS'D(M) is isomorphic to the center of D, which is a commutative field. Thus a simple ring is an associative algebra over a commuta- tive field ( 29 Associative Algebras). H. Radicals Let A be a ring. Then among ideals consisting only of quasi-invertible elements of A, there exists a largest one, which is called the radical of A and denoted by 91(A). The radical of the residue ring AI91(A) is {O}. A ring A is called a semiprimitive ring if 91(A) is {O}. On the other hand, A is called a left (right) primitive ring if it has a tfaithful simple left (right) A-module. The radical \R(A) is equal to the intersection of all ideals J such that AIJ is a left (right) primitive ring. In a unitary ring A, 91(A) coincides with the intersection of all maximal left (right) ideals of A. Furthermore, in a left (right) Ar- tinian ring A, 91(A) is the largest nilpotent ideal of A, and the condition 91( A) = {O} is equivalent to the condition that A is a semi- simple ring. Among ideals consisting only of nilpotent elements of A, there exists a largest one, which is called the nilradical (or simply the radical) and denoted by 9l(A) ( 67 Commutative Rings). The nilradical of AI91(A) is {O}. In general, 91(A) is contained in 91(A), and if A is left (right) Artinian, we have 91(A)= 91(A). A ring A is called a semiprimary ring if AI91(A) is left (right) Artinian and therefore semisimple. Furthermore, a ring A is called a primary ring (completely primary ring) if AI91(A) is a simple ring (skew field). A primary ring is isomorphic to a full matrix ring over a completely primary ring. 1372 References [1] B. L. van der Waerden, Algebra, Springer, I, 1966; II, 1967. [2] N. Jacobson, The theory of rings, Amer. Math. Soc. Math. Surveys, 1943. [3] E. Artin, C. J. Nesbitt, and R. M. Thrall, Rings with minimum condition, Univ. of Michigan Press, 1944. [4] N. Jacobson, Structure of rings, Amer. Math. Soc. Colloq. Pub!., 1956. [5] N. Bourbaki, Elements de mathematique, Algebre, ch. 1,8, Actualites Sci. Ind., 1144b, 1261a, Hermann, 1964, 1958. [6] C. Chevalley, Fundamental concepts of algebra, Academic Press, 1956. [7] N. Jacobson, Lectures in abstract algebra I, Van Nostrand, 1951 (Springer, 1976). [8] R. Godement, Cours d'algebre, Hermann, 1963. [9] J. P. Jans, Rings and homology, Holt, Rinehart and Winston, 1964. [10] S. Lang, Algebra, Addison-Wesley, 1965. [11] N. H. McCoy, The theory of rings, Mac- millan, 1964. 369 (111.13) Rings of Polynomials A. General Remarks In this article, we mean by a ring a tcommuta- tive ring with tunity element. Let R be a ring, and let X j, ... ,X n be variables (letters, indeter- minates, or symbols). Then the set of tpoly- nomials in XI"'" X n with coefficients in R is called the ring of polynomials (or polynomial ring) in n variables XI' "', X n over R and is denoted by R[X 1 , ...,X n ] (67 Commuta- tive Rings; 284 Noetherian Rings; 337 Poly- nomials; 368 Rings). On the other hand, when Rand R' are rings with common unity element and R c R', then for a subset S of R' we denote the subring of R', generated by S over R, by R[S]. When S = {XI' ..., x n }, then there is a homomorphism <p of the polynomial ring R[X j, ..., Xn] onto R[S] defined by <p(ail "inXil...Xn)=ail inxil...xn (a i ,.. i n ER ). If cp is an isomorphism, then XI' ., 'X n are said to be algebraically independent over R; and otherwise, algebraically dependent over R. Thus the ring of polynomials in n variables over R may be regarded as a ring R[XI'''' ,x n ] generated by a finite system of algebraically independent elements XI, ''', X n over R. 
1373 B. Ideals, Homogeneous Rings, and Graded Rings Consider the polynomial ring R[X]=R[X 1 , ..., Xn] in n variables over a ring R. A poly- nomial fER[X] is a tzero divisor if and only if there is a nonzero member a of R for which af =0. If a is an tideal of R, then R[X]/aR[X]  (Ria) [X l' '" ,X n ]. Therefore, if p is a tprime ideal of R, then pR[X] is a prime ideal of R[X]. If R is a tunique factorization domain (u.f.d.), then R[X] is also a u.f.d. If R is a nor- mal ring, then so is R[X]. By the tHilbert basis theorem, if R is tNoetherian, then R[X] is also Noetherian. If m is the tKrull dimen- sion of R, then Krull dim R [X] ;::. n + m; the equality holds if R is Noetherian. If R is a field, then R[X] is not only a u.f.d. but also a tMacaulay ring. A homogeneons ideal of R[X] is an ideal generated by a set of thomogeneous poly- nomials fA (the degree of fA may depend on A). When a is a homogeneous ideal, an element in R[X]/a is defined to be a homogeneons ele- ment of degree d if it is the class of a homoge- neous polynomial of degree d modulo a, and the quotient ring R[X]/a is called a homoge- neons ring. More generally, assume that a ring R is, as a tmodule, the tdirect sum o R, of its submodules R, (i=O, 1,2,...) and that R,RjcR'+j for every pair (i,j). Then we call R a graded ring, and an element in Rd a homo- geneons element of degree d. (In some literature the term graded ring is used in a wider sense; see below) In a graded ring R, if an ideal is generated by homogeneous elements, then the ideal is called a homogeneons ideal (or graded ideal). In a graded ring R = o R;, if the ideal 1 R, has a finite basis, then R is generated (as a ring) by a finite number of elements over its subring Ro. Therefore, the graded ring R =  R, is Noetherian if and only if Ro is Noetherian and R is generated by a finite number of elements over Ro. In this case, every homogeneous ideal is the intersection of a finite number of homogeneous tprimary ideals, and every prime divisor of a homogeneous ideal is a homogeneous prime ideal. The notion of a graded ring is generalized further as follows: A ring R is graded by an additive semigroup I (containing 0) if R is 'EIR, (direct sum) and if RtRjcR,+j. C. Zero Points (1) Zero Points in an Affine Space. We consider the polynomial ring K[X] =K[XJ, '" ,X n ] in n variables over a field K and a field a containing K. A point (ai, .." an) of an n- dimensional taffine space an = {(aJ,..., an) I atE 369D Rings of Polynomials a} is called a zero point of a subset S of K[X] if f(a 1 , ..., an) =0 for every f(X 1 ,..., Xn)ES. A point (aI, ... ,an) is called algebraic over K (K-rational) if every a, is algebraic over K (is an element of K). In this way we define alge- braic zero points and rational zero points. Zero points of S are zero points of the tideals generated by S. Therefore, in order to investi- gate the set of zero points of S, we may restrict ourselves to the case where S is an ideal. De- note by V(S) the set of zero points of S. If aI' a2 are ideals of K[X], then (i) V(a1 n a 2 )= V(a 1 a 2 )= V(a 1 )U V(a 2 ); (ii) V(a 1 +a 2 )= V(a 1 )n V(a2); and (iii) if a 1 and a2 have a common tradical, then V(arJ = V(a 2 ). (2) Zero Points in a Projective Space. A point (Aa 1 , .." Aa n ) of an (n -I)-dimensional tprojec- tive space over a (with aiEa, some a;,;o!' 0, AEa, A ¥= 0) is called a zero point of a polynomial f(XJ,... ,X n ) if,! being expressed as J, with homogeneous polynomials J, of degree i, J,(a 1 ,... ,an)=O for every i (this condition holds if and only if f(Aa 1 , ." , Aa n ) = 0 for any element A in a, provided that a contains in- finitely many elements). Therefore, zero points of a subset S of K [X] are zero points of the smallest homogeneous ideal containing S. Thus, in order to study the sets of zero points, it is sufficient to consider sets of zero points of homogeneous ideals, and propositions similar to (i), (ii), and (iii) of part 1 of this section hold for homogeneous ideals aJ, a 2 . D. The Normalization Theorem Let a be an ideal of theight h in the poly- nomial ring K[X] = K[X 1 ,... ,X n ] in n vari- ables over a field K. Then there exist elements YJ, ..., y" of K[X] such that (i) K[X] is tin- tegral over K[Y] = K[Y1' ..., Y,,] and (ii) Y 1 , ..., y" generate an K[Y] (normalization theorem for polynomial rings). Using this theorem, we obtain the following important theorems on finitely generated rings. (1) Normalization theorem for finitely gen- erated rings. If a ring R is finitely generated over an integral domain I, then there exist an element a (¥=O) of I and algebraically indepen- dent elements ZI, ..., z, of R over I such that the tring of quotients Rs (where S = {an I n = 1,2,...}) is integral over I[a- 1 ,zl' ...,z,l (2) If p is a prime ideal of an integral domain R that is finitely generated over a field K, then (height of p)+ (tdepth of p) = (ttranscendence degree of Rover K), and the depth of p coin- cides with the transcendence degree of Rip over K. In particular, if m is a maximal ideal of R, then Rim is algebraic over K. 
369 E Rings of Polynomials (3) Hilbert's zero point theorem (Hilbert Nullstellensatz). Let a be an ideal of the poly- nomial ring K[XJ = K [X 1, "', XnJ over the field K, and assume that the field a containing K is talgebraically closed. If fEK[XJ satisfies the condition that every algebraic zero point ( Section C) of a is a zero point of.r. then some power of f is contained in a. E. Elimination Theory Let f1' ... ,fN be elements of the polynomial ringR=l[Xj,"',X m , Y 1 ,..., y"J inm+n variables over an integral domain 1. For each maximal ideal m of 1, let CPmbe the canonical homomorphism with modulus m, and let am be an algebraically closed field containing 11m. Let w,,, be the set of points (aI, ..., an) of the n- dimensional affine space a, over an, such that the system of equations CPmU;)(Xj,..., X m , a j, ... ,an) = 0 (i = 1, 2, ... , N) has a sol ution in a;::. To eliminate X j, ..., X m from f1' ...,fN is to obtain g(Y 1 ,..., Y,,)E l[Yj,"', y"J such that every point of w,,, is a zero point of CPm(g) for every m; such a 9 (or an equation 9 = 0) is called a resultant of f1' ... ,fN' The set a of resultants forms an ideal of l[Yj, ..., Y,,], and {g l' ... , 9 M} is called a system ofresultants if the radical of the ideal generated by it coin- cides with a. If 1 is finitely generated over a field, then, denoting by b the radical of the ideal generated by f1' ... ,fN' we have a = b n 1[Y 1 ,..., YJ. In particular, let 1 be a field. It is obvious that W(O) is contained in the set V of zero points of a. However, it is not necessarily true that V = W(O)' If every J; is homogeneo us in X" ... , X m and also in Y 1 , ..., Y", then we have V= W(O)' If we wish to write a system of resultants explicitly, we can proceed as follows: Regard the J; as polynomials in XI with coefficients in 1[X 2 , ..., X m , Yj, ..., y"J, and obtain resultants R(J;,fj) by eliminating XI from the pairs ./;, fj. Then eliminate X 2 from these resultants, and so forth. To obtain R(J;,fj), we may use Sylvester's elimination method. Namely, let f and 9 be polynomials in x with coefficients in 1:f =aOXm+a1xm-1 + '" +a m , g=boxn+ b 1 x n - 1 + ... + b n . Let D(f, g) be the following determinant of degree m + n: "{ a o a 1 am 0 0 o a o a m - 1 am 0 0 0 0 a o a 1 a m - 1 am' m{ b o b 1 b n - 1 b n 0 0 0 ... 0 b o b 1 ... b n - 1 b n 1374 Then D(f, g) = 0 if and only if either f and 9 have a common root or a o = b o = O. Therefore, if 1 is a u.f.d. and ao and b o have no common factor, then D(f, g) is the required resultant R(f, g). (For other methods of elimination See B. L. van der Waerden, Algebra, vol. II. For criteria on whether a finitely generated ring over a field is tregular  370 Rings of Power Series B.) F. Syzygy Theory (1) Classical Case. The notion of syzygy was introduced by Sylvester (Phil. Trans., 143 (1853)), then generalized and clarified by Hil- bert [3J, whose definition can be formulated as follows: Let R = k[X p..., XnJ be a poly- nomial ring of n variables over a field k. R has the natural gradation (i.e., R is a graded ring in which each Xi (I  i  n) is of degree I and elements of k are of degree 0). Let M be a finitely generated graded R-module. If f1' ..., fm form a minimal basis of Mover R consisting of homogeneous elements, we introduce m indeterminates U 1 , ..., U m and put F=L1,;;j,;;mRuj, the free R-module gen- erated by U l' .. . ,U m . Set deg( u) = deg(fj) (1  j  m) and supply F with the structure of a graded R-module. Let cP be the graded R- homomorphism of F onto M defined by cp(u j ) = fj. Then N = Ker(cp) is a graded R-module uniquely determined by M up to isomorphism (of graded R-modules); N is called the first syzygy of M. For a positive integer r, the rth syzygy of M is inductively defined as the first syzygy of the (r - 1 )st syzygy of M. The Hilbert syzygy theorem states that for any finitely generated graded R-module M, the nth syzygy of M is free. In other words, M admits a free resolution of length  n, i.e., an exact sequence of the form 0-. F(V)-.... -.F<0-.F(O)-. M -.0, where v  n and each F(i) (0  i  v) is a finitely generated free graded R-module. It follows that if Md denotes the homogeneous part of degree d in M, there exists a polynomial P(X) of degree  n -1 such that dimdMd) = P(d) for sufficiently large d; P(X) is called the Hilbert polynomial (or characteristic function) of the graded R-module M. (2) Generalization by Serre. The syzygy theory was generalized by J.-P. Serre [2J as follows: Let R be a Noetherian ring and M a finitely generated R-module. Then we can find a fi- nitely generated free R-module F and an R- homomorphism cP of F onto M. The kernel of cP, called a first syzygy of M, is not uniquely 
1375 determined by M. However, if N 1 and N z are first syzygies of M, then there exist finitely generated tprojective R-modules PI and Pz such that N I EB PI  N z EB Pz ( 277 Modules K). For a positive integer r, an rth syzygy is defined inductively as in (1) of this section. An important result of Serre is that R is a tregular ring of tKrull dimension at most n if and only if an nth syzygy of every finitely generated R- module is tprojective. (3) Serre Conjecture. D. Quillen (Invertiones Math., 36 (1976)) and A. Suslin (DokI. Akad. Nauk SSSR (26 Feb. 1976)) solved the Serre conjecture by proving that every projective module over a ring of polynomials over a field is free. (4) Special Cases. In the following special cases, we can define the first syzygy of M uniquely up to isomorphism: (i) R is a Noe- therian tlocal ring and M is a finitely gen- erated R-module; and (ii) R is a graded Noe- therian ring d"O Rd' where Ro is a field and M is a finitely generated graded R-module [4]. References [1] M. Nagata, Local rings, Wiley, 1962 (Krieger, 1975). [2] J.-P. Serre, Algebre locale, multiplicites, Lecture notes in math., Springer, 1965. [3] D. Hilbert, Dber die Theorie der algebrai- schen Formen, Math. Ann., 36 (1890),473- 534. [4] J.- P. Serre, Sur la dimension homologique des anneaux et des modules noetheriens, Proc. Intern. Symp. Alg. Number Theory, Tokyo and Nikko (1955),175-189. Also  references to 284 Noetherian Rings. 370 (111.14) Rings of Power Series A. Rings of Formal Power Series ( 67 Com- mutative Rings; 284 Noetherian Rings; 368 Rings) Let R be a commutative ring with unity ele- ment 1. Let Fd be the module ofthomogeneous polynomials of degree d in XI' ... ,X n with coefficients in R. A formal infinite sum .ro ad =ao+al +... +a n +... + of elements adEFd is called a formal power series or simply power series in n variables XI' ..., X n with coefficients in R, and ad is called the homogeneous part of 370 A Rings of Power Series degree d of the power series. The homoge- neous part Go of degree zero is called the constant term. Addition and multiplication are defined by (ad) + (bd) = (ad + b d ), (ad)(bd)=Ai+j=daibj). By these oper- ations, the set of power series forms a com- mutative ring, which is called the ring of (formal) power series (or (formal) power series ring) in X I' ..., X n over R and is denoted by R[[X 1 , ..., Xn]] or R{X 1'"'' X n }. If there is a natural number N such that ad=O for every d > N, then the power series  ad is identified with the polynomial a o + al + ... + aN' Thus R[X 1 , ...,X n ]cR[[X 1 , ...,X n ]]. Set I= iXiR[[Xj,'" ,X n ]]. Then R[[Xj,... ,X n ]] is tcomplete under the I-adic topology ( 284 Noetherian Rings B). Assume that R' is a commutative ring con- taining R and having a unity element in com- mon with R, a' is an ideal of R' such that R' is complete under the a'-adic topology, and Xr. ..., X n are elements of a'. Then an infinite sum Cil...inXI... xn (each ij ranges over nonnegative rational integers and C i, "' in E R) has a well-defined meaning in R' (namely, if Sd is a finite sum of these terms such that  ij  d, then the infinite sum is defined to be limd_",Sd)' This element  Cil...inX1 ... xn is also called a power series in XI' ... 'X n with coefficients in R. The set of such power series in XI, ... , X n is a subring of R', called the power series ring in XI' .,. , X n over R and denoted by R[[XI,.'" xn]] or R {XI' ..., x n }. Defining qJ by qJ( Ci,...inX:' ... Xn) =  Ci,...inX:' ... xn, we obtain a ring homomorphism qJ: R[[X I ,..., Xn]]R[[xr.'" ,x n ]]. If qJ is an isomorphism, then we say that XI' ..., X n are analytically independent over R. If in is a tmaximal ideal of the formal power series ring R[[X 1 , "', X n ]], then m=in nR is a maximal ideal of R and in is generated by m and XI' ... , Xn. An element J of the power series ring is tinvertible if and only if its con- stant term Jo is an invertible element of R, and in this case J- I = .f=oJo-d-1 . (fo - J't. If R is one of the following, then R[[X 1'.'. ,X n ]] is also of the same kind: (i) tNoetherian ring, (ii) tlocal ring, (iii) tsemilocal ring, (iv) tintegral domain, (v) tregular local ring, (vi) Noetherian tnormal ring. But even if R is a tunique fac- torization domain (u.f.d.), R[[X l' ..., Xn]] need not be a u.f.d. (If R is a field, or more generally, if R is a regular semilocal ring, then R[[Xj,..., Xn]] is a u.f.d.). In particular, a formal power series ring k[[X]] in one vari- able X over a field k is an integral domain whose field of quotients is called the field of (formal) power series (or (formal) power series field) in one variable X over k and is denoted by k((X)); an element of k((X)) is expressed uniquely in the form ;:;'ranxn (anEk,rEZ). 
370 B Rings of Power Series B. Rings of Convergent Power Series Let K be a field with multiplicative tvaluation u (for instance, K = C, the complex number field, and u(:x) = lexl for :x E C). A formal power series [(Xj, ,,,,Xn)=CII inXI,...Xn is said to be a convergent power series if there are positive numbers r l , ..., r n , M such that U(C 1, l)rl'" .rn:'( M for every (i I' ... , in). In this case, if al E K and uta;) < rl' then  C i i a\' ." an has its slfm in the tcompletion 1 n I . . of K. The set of convergent power senes IS a subring of K [[XI' "', XnJ]. It is called the ring of convergent power series (or convergent power series ring) in n variables over K and is de- noted by K «XI'"'' X n » or K {XI'"'' X n }. It is a regular local ring of tKrull dimension n. Hence it is a u.f.d. and its completion is K [[X I, ..., XnJJ. If u is a ttrivial valuation, then K «XI'"'' X n » = K[[Xj,''', XnJ]. Weierstrass's Preparation Theorem. For an element[=cl 1 XI..,XnEK«Xj,"', Xn»,asumeth'ato o,=Ofori=O,I,..., r - 1 and Co 0, # O. Then for an arbitrary element 9 of K«Xj, ...,X n », there exists a unique qEK«X 1 , ...,X n » such that 9- qfE;:bXK«XI' ...,X n - 1 ». In partic- ular (considering the case where 9=X), there is an invertible element u of K «X l' ... , X n » such that [u =/0 + [1 X n +... + [,.-1 X-1 + X: (f,EK«X 1 , ...,X n -,»). By this theorem, we see easily that if a is an ideal oftheight h of K «X l' ..., X n », then K «X l' ..., Xn»/a is isomorphic to a ring that is a finite module over K «X I' ..., X n - h ». Ifp is a tprime ideal of K«X I , ...,X n », then pK [[ X j, ... , XnJ J is a prime ideal. The Jacobian Criterion. Let K be a field, and let R be the ring of polynomials K [X j, ... , XnJ, the ring of formal power series K [[X 1, '" , XnJJ, or the ring of convergent power series K«X 1 , ...,X n » in n variables Xl' ''',X n over K. tPartial derivatives c/DX i are well defined in R. For [1' ... ,f,ER, a tJacobian matrix JUI' "', It) is defined to be the t x n matrix whose (i,j)-entry is 1:[,/aX j . Let p be a tprime divisor of the ideal 1f,R, and let q be a prime ideal containing p. If the trank of (J (fl' ... ,f,) modulo q) is equal to the height of p, then the ring R,/ [,R, is a regular local ring. The converse is also true if K is a tperfect field (if K is not a perfect field, then, modifying J( [1' ...,1,), we can have a similar criterion [IJ). C. Hensel Rings A Hensel ring (or Henselian ring) is a com- mutative ring with unity element satisfying the 1376 following two conditions: (i) R has only one maximal ideal TIt (i.e., R is a tquasilocal ring); and (ii) iff, 90' ho are monic polynomials in one variable x (here, a polynomial in x is called monic if the coefficient of the term of the highest degree is 1) such that f -90hoEmR[XJ, goR[xJ+hoR[xJ+mR[xJ=R[x], then there are monic polynomials g, hER[xJ such that [=gh and 9=90, h=ho modulo m. Important examples of Hensel rings are complete local rings, rings of convergent power series, and tcomplete valuation rings. When R is a Hensel ring, a commutative ring R' with unity element such that R' is a finite R- module is the direct sum of a finite number of Hensel rings. For any quasilocal ring Q, there exists a Hensel ring 12, called the Henseliza- tion of Q (for details  [IJ), for which the following statements hold: (i) Q is a tfaithfully flat Q-module; (ii) if m is the maximal ideal of Q, then the maximal ideal of Q is mQ, and Q/mQ = Q/mQ; (iii) if R is a Hensel ring that contains Q and has a maximal ideal n, and n n Q = m, then there is one and only one Q- homomorphism IfJ of Q into R; (iv) if Q is a tnormal ring, then the Q-homomorphism IfJ is an injection; (v) if Q is a local ring. then Q is also a local ring, and Q is dense in Q. References [lJ M. Nagata, Local rings, Interscience, 1962 (Krieger, 1975). [2J O. Zariski and P. Samuel, Commutative algebra II, Van Nostrand, 1960; new edition, Springer, 1975. 371 (XVII I. 1 0) Robust and Nonparametric Methods A. General Remarks Robust and non parametric or distribution-free methods are statistical procedures specifically devised to deal with broad families of proba- bility distributions. In the theory of statistical inference it is usual to assume that the probability distri- bution of the population from which the ob- served values are chosen at random is specified exactly except for a small number of unknown parameters ( 401 Statistical Inference). In practical applications, however, it often hap- pens that the assumptions made for the model, 
1377 especiany those about the shape of the distri- bution, may not hold for the actual data. In such cases robust and/or nonparametric pro- cedures that do not require exact knowledge of the shape of the distribution and yet prove to be relatively efficient or valid are required. The term nonparametric or distribution-free is used for problems of testing hypotheses, and the term robust is mainly used for problems of point estimation ( 396 Statistic; 399 Statis- tical Estimation; 400 Statistical Hypothesis Testing). Although the idea of the sign test appears in the work of J. Arbuthnot (1710), the theoret- ical foundation for nonparametric tests was first given in the proposals for the permutation test by R. A. Fisher (1935), the rank test by F. Wilcoxon (1945), and the test based on U- statistics by H. B. Mann and D. R. Whitney (1947). In the years that fonowed two impor- tant ideas appeared: the concept of asymptotic relative efficiency by E. J. G. Pitman (1948) and the development of the theory of U- statistics by W. Hoeffding (1948). H. Chernoff and I. R. Savage (1958) showed, in studying the asymptotic distribution of a class of rank statistics, that the asymptotic efficiencies of nonparametric tests are incredibly high. These findings accelerated the studies of nonpar a- metric tests; recent progress is summarized in the books by J. Hiijek and Z. Sidak [1], M. L. Puri and P. K. Sen [2], R. H. Randles and D. A. Wolfe [3], and P. J. Huber [6]. On the other hand, G. E. P. Box (1953) first coined the term robustness in his sensitivity studies, in which he investigated how the standard statistical procedures obtained under certain assumptions are influenced when such assumptions are violated. Two papers by J. W. Tukey (1960, 1962) provided the initial founda- tion for robust estimation. J. L. Hodges and E. L. Lehmann (1963) noticed that estimators of location could be derived from nonpara- metric tests and that these estimators have sometimes much higher efficiency than the sample mean. A similar study for scale was made by S. Kakeshita and T. Yanagawa (1967). Huber (1964) proposed an estimator of location by generalizing the method of least squares. Along with the idea of the influence curve introduced by F. R. Hampel (1974) the estimator proposed by Huber has become a core of subsequent studies of robust estimation. K. Takeuchi (1971) proposed an adaptive estimate that is asymptoticany funy efficient for a wide class of underlying distribution functions. The developments of the theory of robust estimation are reviewed by Huber [4-6] and R. V. Hogg [7]. Various proposed estimators are compared in the book by D. F. Andrews et aI. [8]. 371 C Robust and Nonparametric Methods B. The One-Sample Problem Let F(x) be a tdistribution function of a tran- dom variable X, (XI' ... ,X n ) be an indepen- dent trandom sample of size n from F(x), and (XI' ...,x n ) be an observed sample value. The lOOp percentile of F is denoted by P' i.e., F(  p) = p. For testing the thypothesis H:  p  o against the talternative hypothesis A: p> o, the following procedure is proposed. Let i(x 1 , ..., x n ) be the number of Xi that are greater than o. A test procedure by the fol- lowing ttest function q> is tuniformly most powerful in some neighborhood of p=o for the double exponential distribution, where q>(xj, ..., x n ) is defined by the equations { I when i(X1'"'' x n ) >C, q>(xj,...,x n )= G when i(xj,"',xn)=c, o when i(X1' ...,xn)<c (0 < G < 1, 0  C  n). This procedure is caned the sign test. Suppose that F(x) is symmetric about X = 1/2' Let Rt be the rank ofIXi-ol among IX1-ol, ...,IXn-ol, and let '¥(t) = 1, 0 ac- cording as t>O, O. Set n Sn(Xj,..., X n )= I Gn(Rn'¥(Xi-O) i=l for some weights G n (I), ..., Gn(n). The following procedure q>, caned the signed rank test, is also used for testing the hypothesis H:  1/2  o against the alternative hypothesis A: 1/2 > o: t when Sn(x 1 ,,,, ,xn»c, q>(xj, ...,x n )= G when Sn(x 1 , ...,xn)=c, o when Sn(X1' ...,xn)<c. The procedure with Gn(i) = i is frequently used and is caned the Wilcoxon signed rank test, which is the uniformly most powerful rank test in a neighborhood of 1/2 = o for F(x)= 1/(1 +e- X ), the logistic distribution. C. The Two-Sample Problem Let F and G be continuous distribution func- tions of random variables X and Y, respec- tively, (Xj, ... ,X m ) and (Y 1 , ..., Y,,) be the corre- sponding random samples, and (XI'''. ,x m ) and (Yj, "',Yn) be the respective sample values. Consider the problem of testing the hypothesis H: F(x) == G(x) against the alternative hypoth- esis AI : F(x) =1= G(x) or A 2 : F(x)  G(x) for an X and F(x) =1= G(x). When the alternative hypoth- esis A 2 is true, we say that the random vari- able Y is stochastically larger than X and write F> G. A frequently used example of such an alternative hypothesis A 2 is G(x) = F(x - 0), 0 
371 D Robust and Nonparametric Methods > O. Let .x be the family of all strictly increas- ing continuous functions. Then the hypothesis H and the alternative hypothesis A 2 are in- variant under the group of transformations of the form xi = h(x,), yj= h(y) (i= 1,... ,m;j = 1, ... , n; hE .x). The tmaximal invariant statistic in this case is the rank (R 1, ." , Rm) of(Xr."', X m ) or the rank (SI'''', Sn) of(Y1' ..., Y,,) when the combined sample (Xr. ..., X m ; Y 1 , "', Y,,) is ordered in an ascending order. If a test function cp(x 1 , "', x m ; Yr. "', Yn) satis- fies P,,(F,F)rI. and P,,(F,G)?rI. for any F>G, then cp is considered a desirable test, where P,,(F,G)= r.. f cp(x 1 ,,,,,X m ;Y1,...,Yn) X TI dF(x,) TI dG(y). i j Lehmann's Theorem. If cp satisfies the con- ditions stating that yJ > Yj (j = 1, "', n) yield cp(x 1 , ...,xm;Y!,... ,Y:)CP(Xb" "Xm;Yl'''' ,Yn), then P,,(F, G)? P,,(F, F). If in addition cp is a tsimilar test, then cp is unbiased ( 400 Statis- tical Hypothesis Testing C). The Wilcoxon test (or the Mann-Whitney U- test) is described by a test function cp = 1 when U?C and cp=O when U <C, where U is a tU- statistic defined by 1 m n U=- L L if1(x;,Yj) mn j1 j=1 with if1(x,y)= 1 when xY and if1(x,y)=O when x> y. This test is similar and unbiased. Testing the hypothesis H: F = G against the alternative hypothesis A: F ¥= G = F(x/a), a> 1, is another two-sample problem, for which the following test was proposed by T. Tamura. The test function is given by cp = 1 for U? C and cp = 0 for U < c, where U = (2')-1 m- 1 j<j' j<j' if1(Xj, xi'; Y;, Yr1 with if1(u, u'; v, v') = 1 when v < u < v', v < u' < v' or v' < u < v, v' < u' < v and if1(u, u'; v, v') = 0 otherwise. The following statistic TN is used frequently in nonparametric problems. Let XI,'" ,X m ; Y 1, ... ,Yn be arranged in order of magni- tude. Set Zk = +1 or 0 when the kth value (k= 1,2, '" , n + m) in the arrangement is an Xj or Yj' respectively. For a given set of N = n + m reals {ed, TN is defined by T N =m- 1 r=1 ekz k . Set HN(X)=ANFm(X)+(1-AN)Gn(X), where Fm(x) and Gn(x) are the tempirical distribution functions based on (x 1, ... ,x m ) and (Yr. "', Yn), respectively, and 0< Ao  AN=m/N  l-A o < 1. Then TN is represented by the integral f IN(HN(x))dFm(x) with ek=JN(k/N). Chernoff and Savage [9] proved that under some regularity conditions the asymptotic distribution of TN is normal and that the asymptotic mean /1 and the vari- 1378 ance a 2 of TN are given by /1= f J(H(x))dF(x), Na 2 =2(1-A){ff G(x)(I-G(y)) x<y x J'(H(x))J'(H(y))dF(x)dF(y) l-A ff +T F(x)(l - F(y)) x<y x J'(H(X))J'(H(Y))dG(X)dG(Y)}, where J(H)=limN_aJN(H), H(X)=AF(x)+ (l-A)G(x), and A=limAN' When ek=k/N, the statistic TN is equivalent to the tU-statistic in the Wilcoxon test. When e k is the mean E(Zd of the kth order statistic Zk in an independent sample of size N from N(O, 1), then the test by TN is called the Fisher-Yates-Terry normal score test. When J is the inverse function of the distribution function of N(O, 1), then the test is ca11ed the van der Waerden test. D. The k-Sample Problem Let (X jj , j = 1,..., n j ) be a random sample of size n j from the population with a distribu- tion function Fj(x) for each i = 1, ..., k. The k- sample problem is concerned with testing the hypothesis H: F 1 (x) = .., = Fk(X) against an alternative hypothesis AI: not all the Fj(x) are equal, A 2 :F j (x)=F(x-8;j with 8j=l=8, or A 3 :F j (x) = F(x/a,) with aj =1= a. Several tests have been proposed for this problem, using quadra- tic forms of the vector-valued U -statistic U = (U 1 , ..., Uk) whose coordinates U j are defined by means of a function l/Ji(Xl h" .,X 1m1i ;... ;Xkl,'" ,X kmk ), i= 1,..., k. When N =  n j -> 00 with n j = pjN, () < pj < 1, and Pi= 1, then V = (jN (U 1 - E( U 1 )), .,. , jN (Uk - E( Uk))) has asymptotica11y a tmu1tivariate normal distribution N (0, E). Let B be the projection matrix corresponding to the eigenspace for the zero eigenvalues of the matrix E, and let A be a matrix such that AB=O, EA=I -B. The statistic V NV has asymptotically a tnoncen- tral chi-square distribution with degrees of freedom = rank E. Several kinds of test repre- sented by a critical region of the form V A V' > C are proposed, among which the Krnskal- Wallis test is a particular one having . " 1 if1'(x 1 ; ...;xd= L. -b(x.,x,), (X; nina. i,= 1,..., k, 
1379 as basic functions, where o(x, y) = I when x < y and o(x, y) = 0 otherwise. E. Asymptotic Relative Efficiency of Tests If there is more than one test proced ure for a given testing problem, then one may wish to compare these procedures. Let {<p.} and {1/1.} be two sequences oflevel IX tests, where qJ. and 1/1. are test functions based on a sample of size n. The tpower functions of qJ. and 1/1. are denoted by P(OlqJ.) and P(O' 1/1.), respectively. Let 0 be a real parameter and {Oi} be such that 0;->00 as i->oo. Consider a hypothesis 0=0 0 and a sequence {OJ of alternative hypotheses. If, for any increasing sequences {n;} and {nr} of positive integers satisfying IX < Iim P(Od qJ.,) = limp(O;j 1/1.1) < I, Iim nr /n; (= e( {qJ.}, {1/1.}), say) exists and is independent of IX and Iim P(Oi I qJ.,), then e is called Pitman's asymptotic relative efficiency of {qJ.} against N.}. Suppose further that the tests {qJ.} and {1/1.} are based on sta- tistics 1;, = t.(X) and 1;,* = t:(X), respectively, in the following manner: { I when t.(x»c, qJ.(x) = a when t.(x) = c, o when t.(x) < c, { I when t:(x»c*, I/1.(x)= b when t:(x)=c*, o when t:(x)<c*, where X=(X 1 , ..., X.) and X=(X1' ... ,x.). Put 0 0 =0 and 0.= k/fi (k = constant) for simplic- ity. If 1;, and 1;,* are asymptotically normal, then under some conditions e is given by the formula _ I ' (dEo(1;,)/dOlo=0)2(J(1;,*) e - 1m 2 2 . (dE o (1;,*)/dOl o =o) (Jo(1;,) As an example, consider a two-sample prob- lem on a tlocation parameter. If the popula- tion distribution is normal and the Wilcoxon test is used to test the hypothesis of equality of means, then its asymptotic relative efficiency against Student's test is 3/n. For the same problem, the asymptotic relative efficiencies of the Fisher-Yates-Terry normal score test and the van der Waerden test against Student's test are both unity. For the hypothesis of equality of means in the k-sample problem, the asymp- totic relative efficiency of the Kruskal- Wallis test against the F-test is 3/n, provided that the sample is distributed normally. F. Kolmogorov-Smirnov Tests Let F.(x) be the empirical distribution function based on a random sample of size n from 371 G Robust and Nonparametric Methods Fo(x). Set d. = sup I F.(x) - Fo(x)l, D. = sup(F.(x) - Fo(x)), I F.(x) - Fo(x) I s,,= sup , a';Fo(x) Fo(x) S _ F.(x)-Fo(x) . - sup . a';Fo(x) Fo(x} Then Iim Pr(d. <z/fi) =L(z) .oo = f (_I)k e -2k 2 Z 2 , k= - 00 Iim Pr(D. < z/fi) = K(z) = l_e- 2Z " .oo [.(l-D)] ( n ) Pr(D.<D)=I-D I . j=O ] x (I-D-)"-j (D+Y-1, 4 00 IimP r (s.<z/fi)=- I (_I)k n-+oo 1t k=O x (2k+ Ir1e-«2k+1)2n2/8)«I-a)/a)z2, Af, z J a/(l-a) Iim Pr(s.<z/fi)= - e- t2 / 2 dt. n-+oo 1t 0 The statistics d., D., SO'S. are frequently used to test the hypothesis F(x) = Fo(x). (This prob- lem is called testing goodness of fit.) In a two-sample problem, let Fm(x) and G.(x) be two empirical distribution functions based on samples of sizes m and n from F(x) and G(x), respectively. Set d m .. = suPlFm(x) - G.(x)l, Dm.. =sup(Fm(x) - G.(x)). If the hypothesis F = G is true, we have Iim Pr(d m .. < z/.JN) = L(z), IimPr(Dm.. < z/.JN) =K(z), provided that m and n tend to 00 so that N = mn/(m+n)->oo and m/n is constant. Taking account of these facts, d m .. and Dm,. are used to test the hypothesis F = G. The tests using the statistics d., D., d m .O' Dm,. are called Kolmogorov-Smirnov tests. G. Interval Estimation Let (XI,'''' X.) be an independent random sample from the population with a distri- bution function F(x - 0), where 0 is an un- known location parameter, and (x l' ." , x.) be its observed value. Suppose that F(x) is continuous and symmetric about the origin. 
371 H Robust and Nonparametric Methods Using the statistic Sn = S(x 1 , ..., x n ) for the one-sample nonparametric test for testing the hypothesis H: 0 = 0, a tconfidence interval of 8 is constructed as follows. For an appropriately given y (O<y < 1), select constants d l and d 2 in the range of S(x 1 , ..., x n ) that satisfy Po{d 1 <S(X 1 , ...,X n )<d 2 } = 1-y, where Po means the probability under the hypothesis H: 0 = O. If there exist statistics L,(xj, ..., x n ) and Us (X 1 , ..., X n ) such that L,(xj, ''', x n )  0 < Us (X 1 , "', X n ) if and only if d 1 <S(X 1 -8, ...,x n -8)<d 2 for aU 8, then the confidence interval of 0 with 100(1- y)% con- fidence coefficient is given by (Ls(X1' ..., x n ), U,(xj, ..., x n )). This interval is distribution- free, i.e., it holds that P{L,(X 1 , ...,Xn)<O< U,(X I , ...,X n )} = l-y for all F. When Sn is the statistic for the Wilcoxon signed rank test, Ls and Us are given by Ls = W(M+1-d 2 ) and U s = W(M-d,), where M = n(n+ 1 )/2 and W(!) ...  W(M) are ordered values for M averages (Xi + xN2 (i j= 1, 2,..., n). H. Point Estimation Let (XI' '" , X n ) be an independent random sample from the population with a distri- bution function F(x - 8), where 8 is an un- known location parameter, and let (xI> ...,x n ) be its observed value. There are four methods of constructing ro bust estimators of 8. Let X(1) < ... < X(n) be ordered values of XI' "', X.. The first method is to use 7;, = a 1 X(I) +... + anX(n) for some given constants aI, "', an such that Li=1 a i = 1. 7;, is called the L-estimator. An example is 7;,(IX) = (pX(["nJ+1) + X(["nJ+2) + '" + pX(n-["n]))ln(1- 21X), where p= 1 + [lXn] -lXn. This estimator is called the IX-trimmed mean. Let J be a real-valued function such that g J(t)dt = 1, and set a k = JkI)/nJ(t)dt; then as n-HX), 7;, converges to T(F) = gJ(t)P-1(t)dt in probability. Suppose that F is a distribution function having an tabsolutely continuous density function f Denote the derivative of J by J', and let I(F) be the tFisher information on 8. Set l/1(t) = -f'(t)IJ(t) and J(t)= 1/1'(F- 1 (t))II(F). Then Chernoff, J. L. Gastwirth, and M. V. Johns [10] proved that under some regularity con- ditions 7;, is an tasymptotically efficient es- timator of 8 for F. Let p be a real-valued (usually convex) function of a real parameter with derivative 1/1 = p'. The second method is to estimate 8 by 7;, 1380 by minimizing Li=1 P(Xi - 7;,) or by satisfying n L l/1(x i - 7;,)=0. i=l 7;, is caUed the M-estimator. When p(t)=t 2 , it agrees with the least squares estimator. Let <I>(x) be the distribution function of the stan- dard normal distribution, H (x) be an arbi- trary continuous distribution function which is symmetric about the origin, :? be a class of distribution functions of the form F(x) = (I-E)<I>(x) +EH(x) for a given F. (0< E< 1), and V(p, F) be the asymptotic variance of 7;,. Huber [11] proved that p minimizing SUPFE3' V(p,F) is given by PK(t)=t 2 /2, Kltl- K2/2 defined for I tl  K, > K, respectively, for some constant K. Under quite general condi- tions, the M-estimator converges as n-HX) to T(F) in probability, which is defined by S l/1(x- T(F))dF(x)=O. If l/1(t) = -J'(t)IJ(t) is chosen for l/1(x), then 7;, is the tmaximum likeli- hood estimator of 0 for F and is asymptoti- cally efficient under some regularity condi- tions. Generally, the M -estimator defined above is not scale invariant. A scale invariant version of the M-estimator is obtained by replacing the defining equation by f 1/1 ( x i - 7;, ) =0, ,=1 Sn (1 ) where Sn is any robust estimate of scale, e.g., the median of {IXi- MI/0.6745}i=1.2. 'On where M is the sample median, or by solving the simultaneous equations (1) and .f X ( Xi - 7;, ) =0 ,1 Sn with respect to 7;, and Sn' In the context of the maximum likelihood estimation, X is chosen to be X(t) = tl/1(t) - 1 ( 399 Statistical Estimation P). The third method employs nonparametric tests for testing the hypothesis H: 8 = 0 against the alternative hypothesis A: 8 > O. Let J be a real-valued and nondecreasing function such that SbJ(t)dt=O and Rn8) be the rank of IX k - 01 among IX I - 01,..., IX n - 81. Set 'P(t) = 1, 0 according as t>O, O and S(X 1 --0,..., X n - 8) =LZ=1 J«Rt(8)+n)/(2n+ 1))'P(X k -8). Let O*=sup{8;S(X 1 -8, ...,X n -8»I1}, 8** =inf{ 8; S(X 1 - 8,..., Xn-O)< 11}, where Jl is the expected value of S(X 1 , ..., X n ) under the hypothesis H:8=0. Then an es- timator of 8 is defined by 7;, =(8* + 8**)/2. Hodges and Lehmann [12] first proposed this technique, and this estimator is called the R- estimator. When F(x) is symmetric about the origin and J (t) = t - t. S tends to be the statis- tic for the Wilcoxon signed rank test, and the 
1381 R-estimator reduces to the median of n(n+ 1)/2 averages (X j + Aj)/2 (l  i j n). As n-+ 00, the R-estimator converges in probability to T(F), defined by f J( F(X) + 1- (2T(F)-X) )dF(X)=0. For symmetric F, the R-estimator defined by the statistic S with J(t) = - !'(F-1 (t))/J(p-1 (t)) is asymptotica11y efficient under some regular- ity conditions. Although the above three methods provide robust estimators, which are seldom affected by outlying observations or contamination by gross errors, their behavior sti11 depends on F. The last method of constructing robust es- timators consists of estimating () adaptively by utilizing information on the shape of F. Among these, a striking one is the asymptoti- ca11y fu11y efficient estimator for a wide class of F proposed by Takeuchi [13]. The es- timator is constructed by using subsamples of size K (K < n) drawn from the original sample, estimating the elements ofthe teo- variance matrix of the order statistics by U- statistics, and selecting the best weights of the L-estimator. L. A. Jaeckel [14] made an IX- trimmed mean adaptive estimator by selecting an IX that minimizes the estimated asymptotic variance. I. The Influence Curve Let T(F) be a functional of a distribution function F, and let an estimator 1;, of () cal- culated from an empirical distribution function Fn converge to T(F) in probability as n-+ 00. A real-valued function IC(x; F, T) defined by IC(x; F, T)=lim T«(1-6)F HO x )- T(F) e-O 8 for a11 x is ca11ed the influence curve, where Ox is the distribution function of a point mass 1 at x. The curve was first introduced by Hampel [15] to study the stability aspect of estima- tors against a sma11 change of F. As an exam- ple, when F is symmetric about the origin, the influence curve for the IX-trimmed mean IC(x; F, T) is given by P-1(1X)/(1 - 21X) x/(1-21X) when X<F- 1 (1X), when F-1(1X)X <F-1(1-1X), P-1(1-1X)/(1-21X) when x>F- 1 (1-1X). By substituting the empirical distribution function Fn for F in T(F), we can represent the robust estimators discussed in Section H, at 371 K Robust and Nonparametric Methods least approximately, by 1;,= T(F n ). Under some conditions, it can be proved that as n-+oo, 1 n n I12 (T(Fn)-T(F)-- I IC(Xj;F, T))-+O nj=1 in probability. Thus it fo11ows that n I/2 (1;,- T(F) is asymptotica11y norma11y distributed with asymptotic variance J (IC(x; F, T))2 dF(x). J. The Regression Problem Consider the linear regression problem ( 403 Statistical Models D) p X="().a..+6. i =I , 2 , ... , n , I  J lJ l' j=1 where the X j are observable variables, the ()j are regression coefficients to be estimated, the aij are given constants, and 61' 6 2 , ... ,6. are identica11y and independently distributed random errors whose distribution function is given by F(x). The idea of the M-estimator is a direct generalization of the method of least squares; namely, to adopt (6 1 ,,,,,6;,) as an estimator for «()1, ..., ()p) that minimizes i=1 p(X j - Aaij) for some function p such as the one described above. R-estimators of the regression coefficients are obtained by minimizing i=1 an(Rj)L'lj, where L'lj = X j -  Aaij, R j is the rank of L'lj among L'l1' .'., An' and a.(.) is some monotone function satisfying i=1 an(i) = O. It has been proved that minimizing i=l a.(Rj)A j is asymp- totica11y equivalent to minimizing jt1lj a.(Rj)aijl, the properties of which were first studied by J. Jureckova [16]. K. Dependence Let (XI' Yr), ...,(X., Y,.) be random samples from a population with a bivariate distri- bution function, R j be the rank of X j among XI' ... , X. when they are rearranged in an ascending order, and Sj be the rank of 1) among Y 1 ,..., Y,. defined similarly as R j . Vari- ous quantities are devised to measure the degree of dependence between X and Y. (1) Spearman's Rank Correlation. Set d j = R j -Sj. Then rs= 1-6jd?!(n3-n) is ca11ed Spearman's rank correlation. If there is no dependence between X and Y, i.e., if the X j and 1) are independent random variables, then E(rs)=O and V(rs)=(n-1r1. (2) Kendall's Rank Correlation. Take pairs (R j , Sj) and (R j , S). If (Rj - R;}(Sj -Sj) >0, set 
371 Ref. Robust and Nonparametric Methods qJij= 1; otherwise, qJij= -1. The statistic r k = G) -I L qJjj is cal1ed Kendal1's rank correla- tion, where L runs over al1 possible pairs chosen from (R 1 , Sd,..., (R n , S.). If there is no dependence between X and Y, then E(rd=O and V(rd=2(2n+5)/(9n(n-1)). (3) Rankit Correlation. R j and Sj, i = 1, . " , n, are replaced by the corresponding normal scores, i.e., the means of order statistics in an independent sample of size n from N (0, 1); then the usual tsample correlation coefficient is cal- culated from these scores. This correlation co- efficient r R is caned rankit correlation; and if there is no dependence between X and Y, then E(z)=O, V(z)=(n-3)-I, asymptotical1y, where 1 I +r R . z = -log- (tFisher's z-transformatlOn). 2 1-r R These statistics are used to test the hypoth- esis of independence. References [IJ J. Hajek and Z. Sidak, Theory of rank tests, Academic Press, 1967. [2J M. L. Puri and P. K. Sen, Nonparametric methods in multivariate analysis, Wiley, 1971. [3J R. H. Randles and D. A. Wolfe, Introduc- tion to the theory of nonparametric statistics, Wiley, 1979. [4J P. J. Huber, Robust statistics: A review, Ann. Math. Statist., 43 (1972),1041-1067. [5J P. J. Huber, Robust regression: Asymp- totics, conjectures and Monte Carlo, Ann. Statist., 1 (1973),799-821. [6J P. 1. Huber, Robust statistics, Wiley, 1981. [7J R. V. Hogg, Adaptive robust procedures: A partial review and some suggestions for future applications and theory, J. Amer. Statist. Assoc., 69 (1974), 909-923. [8J D. F. Andrews, P. J. Bickel, F. R. Hampel, P.1. Huber, W. H. Rogers, and J. W. Tukey, Robust estimates of location: Survey and advances, Princeton Univ. Press, 1972. [9J H. Chernoff and I. R. Savage, Asymptotic normality and efficiency of certain nonpara- metric test statistics, Ann. Math. Statist., 29 (1958),972-994. [IOJ H. Chernoff, J. L. Gastwirth, and M. V. Johns, Asymptotic distribution of linear com- binations of functions of order statistics with application to estimation, Ann. Math. Statist., 38 (1967), 52-72. [11J P. J. Huber, Robust estimation of a loca- tion parameter, Ann. Math. Statist., 35 (1964), 73-101. [12] J. L. Hodges and E. L. Lehmann, Esti- mates of location based on rank tests, Ann. Math. Statist., 34 (1963), 598-611. 1382 [13J K. Takeuchi, A uniformly asymptotical1y efficient estimator of a location parameter, J. Amer. Statist. Assoc., 66 (1971), 292-301. [14J L. A. Jaeckel, Some flexible estimates of location, Ann. Math. Statist., 42 (1971), 1540- 1552. [15J F. R. Hampel, The influence curve and its role in robust estimation, J. Amer. Statist. Assoc., 69 (1974), 383-393. [16J J. Jureckova, Nonparametric estimate of regression coefficients, Ann. Math. Statist., 42 (1971), 1328 -1338. 372 (XXI.3) Roman and Medieval Mathematics The Romans were interested in mathematics for everyday use; their arithmetic consisted of computation (by means of the abacns), weights and measures, and money. For their mone- tary system, they developed a computational method using duodecimal fractions. Julius Caesar (102?-44 B.C.), known for his calendar reform in 46 B.C., also undertook to measure his territory, which aroused a demand for land surveying techniques. Books on prac- tical geometry which provided this knowledge were cal1ed gromatics (a "groma" was a land surveying instrument). Toward the end of the Western Roman Empire (476), Greek mathe- matics was studied; during this period Boe- thius (c. 480-524) wrote his two books on arithmetic and geometry. The former was a summarized translation of Nichomachus' book, and the latter included propositions from the first three books of Euclid's Elements (without proof) and practical geometry. Music, astronomy, geometry, and arith- metic, which constituted the "mathemata" of Plato's Academy (closed in 529), were treated as the "quadrivium" (the four major subjects) in the Encyclopedia of Martianus Capel1a, Cassiodorus, Isidorus, Hispalensis, and others. After the establishment of the Roman Church in the 5th century, the quadrivium was to be studied for the glory of God. Books on mathe- matics from this period laid emphasis on the computation of an ecclesiastical calendar and mystical interpretations of integers, as seen in books by Bede Venerabilis, Alcuin, and Maurus from the 7th through 9th centuries. Arabian science was imported first through Spain-under Moorish influence beginning in 711, the year of the fal1 of the Visigoths-and then through the Crusades (1096-1270). Com- putation with figures, originating in India, replaced the abacus in the 12th century, when 
1383 Arabian books on arithmetic and algebra, along with Greek books on geometry and astronomy (such as books by Euclid and Ptolemy), were translated into Latin. Italian merchants, whose occupation necessitated computation, rapidly adopted the new system. Representative books of this period are Liber abaci (1202) and Practica geometrica (1220) by Leonardo da Pisa (also known as Fibonacci, c. 1170-1250). The former includes the four arithmetic operations, showing Indian in- fluence, commercial arithmetic, and algebra. The new methods were not limited to mer- chants. The French bishop Nicole Oresme (c. 1323-1382), who influenced Leonardo da Vinci (1452-1519), introduced fractional expo- nents and conceived the graphic representa- tion of temperature, a precursor of coordi- nates and functions. From the 11th century, universities de- veloped from theological seminaries, first in Italian cities such as Bologna and Palermo, and later in Paris, Oxford, and Cambridge. Mathematics was taught in these universities, although no remarkable creative contributions were made. However, theologians such as Albertus Magnus (c. 1193-1280) and Thomas Aquinas (1 225?-1274) discussed infinity in a way that went beyond Greek thought and thus helped to lay a basis for the modern philo- sophy of mathematics. References [1] M. Cantor, Vorlesungen iiber Geschichte der Mathematik, Teubner, 1,1880; 11,1892. [2] M. Clagett, Greek science in antiquity, Abelard-Schuman, 1955. [3] M. Clagett, Archimedes in the Middle Ages I, II, Univ. of Wisconsin Press, 1964. [4] M. Clagett, The science of mechanics in the Middle Ages, Univ. of Wisconsin Press, 1959. [5] G. Sarton, Introduction to the history of science I, From Homer to Omar Khayyam, Carnegie Institution, 1927. 372 Ref. Roman and Medieval Mathematics 
373 A Sample Survey 373 (XVIII.13) Sample Survey A. General Remarks The sample survey is a means of getting sta- tistical information about a certain aggregate from the observation of some but not aU of it. The aggregate concerned is usuaUy called the finite population, and the observed part is called the sample. Introducing a random mechanism, 1. Neyman established a method of ascertaining objectively the reliability of such information. This method is mainly ap- plied to demographic statistical surveys and opinion polls, but it is also applicable to ran- dom samples of physical materials. We briefly sketch the mathematical structure of this method without going into detail about tech- nical problems that arise in the practical sur- vey situation. Suppose that the population consists of N units, where N is caIled its size. Each unit has some characteristic IX, which is an element of some set Q. The set of all characteristics of the units in the population is designated by 8 = {IX 1, ... ,IX N }, which we regard as a parameter. The set of all possible 8 is denoted by 0 c QN. Suppose that one unit is chosen and ob- served according to some procedure, the index number of the unit in the population is J, and the observed characteristic is X. It is assumed that the observation is without error, hence X=IX]. Denote the whole sample by (J, X)=(Jr,..., I n , XI' ... ,X n ), where n is the sample size and Xi = IX],. The sample size n may be a random variable, and among J 1 , ... , I n , duplication may be allowed. The probabilistic scheme for J is called the sampling procedure, and if it satisfies the condition (c) Pr{Ji=j} is independent of IX], and of Xi+r, ... ,X n (but may depend on XI' '" ,Xi-d, it is called a random sampling procedure. More- over, if the tjoint distribution of J is indepen- dent of 8, it is caIled regular. Specifically, if n is constant and pr{J} is symmetric in J, it is caIled uniform. The two main mathematical pro blems of sample surveys are to determine a random sampling procedure and to provide methods whereby statistical inferences can be made concerning 0 ( 401 Statistical Inference). B. The Problem of Inference Condition (c) is assumed. The probability of (J, X) is given by 1386 Pr{ (J, X) =(j1' ... ,jn, Xr, ..., X n )} = Pr{J 1 =jr}Xo(X 1 )Pr{Jz =jZIJ1' XI} Xo(Xz) .., Pr{J n = jn I J 1 , XI'''' ,]n-1, Xn-d Xo(X n ), where Xo(X;) is defined as 1 if Xi =, IX], and 0 otherwise. This formula can be shortened to the form Pr{ (J, X)} = P(J, X)xo(X, J), (1 ) where Xo(X, J) = { if Xi = IX]" i = 1, ..., n, otherwise. Expression (1) is the fundamental model for the sample survey problem. Note that P(J, X) is independent of the parameter 8. Therefore if we let 1=(11'''' ,1m) (11 <... < 1m) be the set of numbers in (Jr, ..., I n ) after deleting dupli- cations, and let Y = (Y 1 , ..., Y m ) be the corre- sponding X values, then the joint distribution of I and Y can also be expressed as Pr{(I, V)} =P(I, Y)Xo(Y, I), where Xo(Y, 1)= { if Yj=1X1j,j= 1, ...,»1, otherwise. Since xiV, 1)= xo(X, J) for all 0, the tcondi- tional probability distribution of (J, X) for given (I, Y) is independent of Xo(X, J), and hence (J, X) is a tsufficient statistic. According to the general theory of sufficient statistics, we can restrict ourselves to the class of procedures depending only on (I, V). C. Estimation Suppose that g(8)=g(1X1' ...,IX N ) is a real para- meter whose unbiased estimators are under discussion. Theorem. There exists an unbiased estimator of g(8) if and only if there exists a decomposition g(8) = L: h.(lXj,(v), ... , IXj.(v»), Pr{I=(j1(V),... ,jn(v))} >0, v= L, 2,.... (2) If the sampling procedure is regular, the second condition can be replaced by Pr {I ::::J (jr(v),... ,jn(v))} > 0, v = 1,2, .... Hence, for example, if lX i is real and the sampling proce- dure is regular, ri. = "i.lXd N is testimable if and only if Pr{I03i} >0 for all i, and a = "i.(lX i - 'r;l/(N -l)="i."i.(lX i -IX//N(N -1) is esti- mable if and only if Pr{I 3 i,j} > 0 for all i and j. Also, ITf=llXi is not estimable unless Pr{I = (1, .. . , N)} > O. The decomposition (2) is not unique, and corresponding to different decom- positions, different unbiased estimators are derived. Also, for the case of regular sampling 
1387 procedures, for any B = Bo it is always possible to construct an unbiased estimator g(O) such that Pr{O(B) =g(Oo) I B= Bo} = 1 if g(B) is esti- mable. Hence the variance of the locally best unbiased estimator is always 0, and the tuni- formly minimum variance unbiased estimators exist only in the trival case. If some kind f symmetry exists among the population units as well as the sampling pro- cedure and the parameter, it would be natural to require the same kind of symmetry for the estimators. Let G be a group of permutations among N numbers. Assume that for any BEE> and YEG, we have yBEE> and g(yB) = g(B). If Pr{yJ} = Pr{J} for all y, then the sampling procedure is said to be invariant with respect to G. An estimator is also called invariant if its value does not change under any permutation y E G of the numbers of sample units. Thus if G is the set of all permutations (i.e., the tsym- metric group), then the invariant estimator is a function of Y (or X) only. Moreover, if the dimension m of Y is constant, Y is complete (under some mild conditions); hence there exists a unique minimum variance invariant unbiased estimator. When there is some additional information, it can be represented by auxiliary variables f3b ..., f3N, which are known and assumed to have some relation to IX!,..., IX N . Assume that the IXj are real numbers and that the parameter to be estimated is B=a=("f,lXj)/N. If we can assume that IX j and pj are approximately pro- portional, we can estimate the unknown popu- lation mean a by &* = (X/Z) x lJ where X is the sample mean of the IX's and Z the sample mean of the f3's. Although &* is not unbiased, we may expect that it has small error if the relation between two variables is close. * is usually called the ratio estimator. In practical research, as an estimator of the population total A = "f,lX j , we usually use A = "f,X;/P j , where P j =Pr{J3i}.lts variance is V(A)="f,"f,(Pj-Pjj)(lXdPj-lXj/f and is estimated by v(A) = "f,"f,{(Pj- Pjj)/PjJ}(XdPj- XJ/f, where Pij=Pr{J:::>(i,j)}. When N is unknown, it can be estimated by the same procedure as A (say IV), and the population mean a can be estimated by &=AIIV, which is called a ratio estimator. & is biased except when N is known. D. Asymptotic Confidence Intervals It is usually impossible to obtain any meaning- ful tconfidence interval based on exact small- sample theory. But when the IX j are real and the sampling procedure is uniform and with- out replacement, the sample mean j[ is asymp- totically normal with mean a and variance 373 F Sample Survey 1 ( n ) 1 ;; I- N a;,wherea;= N_1 L(lX j -a)2,as Nand n-HIJ, and limsupn/N < 1, provided that max (lXj-af l<iN lim sup L( -)2 - o. N-.oo Ct. i - Ct. Also, the sample variance converges to a; as n-+oo. From these results we can construct asymptotic confidence intervals for a. E. The Problem of Sampling Procedures In determining the sampling procedures, both the technical aspects of sampling and the accuracy of the estimators should be consid- ered. The most commonly used methods are multistage sampling and stratified sampling, or some combination of the two. For example, the population is partitioned into several clusters. First we select some of them accord- ing to a probability scheme and then choose units from the selected clusters. This procedure is called two-stage sampling. The probabilities for the selection of clusters may be uniform or proportional to the size of the clusters. Strati- fied sampling is the method of dividing the population into several subpopulations, called strata, and selecting the sample units within ach stratum. If the size of the ith stratum is N j , the size of the sample chosen from this stratum is n j , and the probability is uniform within each stratum, then the most common estimator for the population mean a is given by =IN;XdN, where X j is the mean of the sample values in the ith stratum. The variance of is given by  ( 1 ) 2" N/ ( n j ) 2 V(a)= - L..,- 1-- a j , N nj N j where a j 2 is the population variance within the ith stratum. If the cost of drawing one sample unit in the ith stratum is equal to C j , then for fixed cost, the variance of the estimator is minimized when ndNjocad, which is called the condition of optimum allocation. F. Replicated Sampling Plan W. E. Deming proposed an effective method in practical sample surveys, called a replicated sampling plan, following J. W. Tukey's hint. It enables us to easily evaluate variances of esti- 
373 G Sample Survey mates for any estimator and any sampling procedure. Let the sample be composed of k subordinate samples which are selected by the same random sampling procedure from the same population, and let 8,(J i , Xi) be the esti- mate from the ith subordinate sample by the same estimator and {j be the estimate from the whole sample by the same estimator. Then, provided {j = L O;/k approximately, the var- iance of (j can be estimated by v(O)= L(8,- (j)2Ik(k -1). If the sample is selected by the simple random sampling procedure and is of large scale, Oi and (j are approximately normal variates, and v(O) is evaluated by using the sample range of the Oi' In large-scale sample surveys, even when the random sampling procedure is not simple, the theory related to the normal distribution can be applied to the 8, and 0. It has been shown that v(8) evaluated by the above method includes not only the sampling error but also the random part of the nonsampling errors. G. Conceptual Problems Although it has been established that the sample survey method is useful in large-scale social or economic surveys, there are difficult conceptual problems about the foundations of the method (especially when auxiliary informa- tion exists) that are still far from being settled. References [IJ W. G. Cochran, Sampling techniques, Wiley, third edition, 1977. [2J W. E. Deming, On simplifications of sam- pling design through replication with equal probabilities and without stages, J. Amer. Statist. Assoc., 51 (1956),24-53. [3J R. J. Jessen, Statistical survey techniques, Wiley, 1978. [4J J. Neyman, Contribution to the theory of sampling human populations, J. Amer. Statist. Assoc., 33 (1938),101-116. [5J P. V. Sukhatme and B. V. Sukhatme, Sampling theory of surveys with applications, Asia Publishing House, second edition, 1970. [6J K. Takeuchi, Some remarks on general theory for unbiased estimation of a finite population, Japan. J. Math., 35 (1966), 73-84. 374 (XVII1.4) Sampling Distributions A. General Remarks To perform statistical inference, it is necessary to find the tprobability distribution of a tsta- 1388 tis tic involved in the situation ( 396 Statistic, 401 Statistical Inference). In general, the proba- bility distribution of a statistic is called the sampling distribution. A set {XI' X 2 , ..., Xn} oft random variables that are independently and identically distributed according to a distribution F is called a random sample from F. A common sampling distribution de- scribed in this article is that of the statistic Y = f(X 1 , ..., X n ), where the set {Xj, ..., Xn} is a random sample from a tnormal distribution. Examples of such a statistic Y of dimension 1 are the tsample mean, tsample variance, linear or quadratic forms of {XI' ..., X n }, their ratios, and torder statistic, while examples of Yof higher dimensions are the sample mean vec- tor and the sample covariance matrix and its eigenvalues. The tnormal distribution with tmean /1 and tvariance (12 is denoted by N (/1, (12), while the tp-dimensional (p-variate) normal distribution with mean vector # and covariance matrix L: is denoted by N(#,E) ( Appendix A, Table 22). B. Samples from Univariate Normal Distribntions If random variables XI' ..., X n are distributed independently according to N(/11' (1f), "', N (/1n' (1;), then a linear form Li a i Xi has the distribution N(Liai/1i' Lia?(1?). In particu- lar, if {XI' ... ,X n } is a random sample from N(/1, (12), then the sample mean g = LiX;/n has the distribution N(j1, (12 In). Let {X j, ... ,X n } be a random sample from the distribution N (0, 1). The sampling distri- bution of the statistic Y = LiX,2 is called the cbi-sqnarc distribntion with n degrees of free- dom and is denoted by x 2 (n). It has the tproba- bility density f,,(Y)=2-nI2(rG)) -1 y- 1 +n 1 2 e - y /2 for 0 <Y < cx), j(y)= 0 elsewhere, where r is the tgamma function. The distribution of Y = Li1 (Xi + /1;)2 depends only on nand", = Li /1? , and is called the noncentral chi-sqnare distribution with n degrees of freedom and noncentrality). and denoted by x2(n, A). It has the probability density _  -),1 2 ( "' ) k 1 fn.),(Y)- kO e 2. /Zi/n+2dY) ( n AY ) =e-),/ 2 o F 1 2< f,,(y) for O<y< 00, where fn+2k and j" are the prob- ability densities of chi-square distributions and OF1 is an extended hypergeometric func- 
1389 tion. Noncentral chi-square distributions have the fonowing treproducing property: If Y 1 , ..., Yk are distributed independently ac- cording to x2(nb Ad, ..., x 2 (n k , A k ), then i 1; has the distribution x2(ini' iAJ Also, we have Cochran's theorem (Proc. Cambridge Phi/os. Soc., 30 (1934)): If Xl' ""X n are dis- tributed independently according to N(/lb I), ...,N(/l.. I) and if for quadratic forms Qm= ija!j)XiXj for m= I, ...,k the matrices Am = (a!j») with trank r m satisfy the condition Al + ... + Ak = I (unit matrix), then a necessary and sufficient condition for Q b ..., Qk to have independent noncentral chi-square distri- butions with r 1 ,..., rk degrees offreedom, respectively, is that r 1 + ... + rk = n. In partic- ular, when /li=O for an i, they have (centra!) chi-square distributions, and the theorem implies their reproducing property. Let X and Y be independent random vari- ables having distributions N(t5, I) and x2(n), respectively. Then the sampling distribution of T = X /.JYiii is caned the noncentral t- distribution with n degrees of freedom and noncentrality 15 and is denoted by t(n, 15). Its probability density is given by J..(t)= f e-2/2t5k 2k/2 r((n+k+ 1)/2) kO k! r(n/2) (t/)k x (I +t 2 /n)(n+k+l)/2 for -00 <t< 00. In particular, when 15=0, the distribution is caned the t-distribution with n degrees offreedom and is denoted by t(n). Its probability density is simplified to -(n+l)/2 r((n+1)/2) ( t2 ) J.(t) = 1 +-  r(n/2) n for -oo<t<oo. Let {XI, ... ,X n } be a random sample from N(/l, ( 2 ). Exact sampling distributions of the tsample variance S2 = (Xi-X)2/(n-1) and of the t-statistic T=(X -/lo)/p, where /l= /lo is a given number, were essentiany obtained by Student [5]: (n - I)S2/ a 2 and T are dis- tributed according to x2(n- I) and t(n-1), respectively. His proof was made rigorous by R. A. Fisher (Metron, 5 (1925)), who proved in particular that X and S2 are independent. If /l =I- /lo, then T fonows the distribution t(n- I, (/l- /lo)/a). Let X and Y be distributed independently according to x2(m, A) and x 2 (n), respectively. The distribution of Z = (X/m)/(Y/n) is caned the noncentral F-distribution with m and n degrees of freedom and noncentrality A. In the special case when A.=O it is called the F- distribution with m and n degrees of freedom and is denoted by F(m, n). The probability 374C Sampling Distributions densities Jm.n.;' and Jm.n of these distributions are given by (m/nr/2 z(m/2)-1 Jm.n(z) B(m/2, n/2) (1 + mz/n)(m+n)/2 ' 00 ( A ) k 1 (m/n)(m/2)+k Jm.n,;.{Z) = k/-;'/2 "2 k! B((m/2)+k,n/2) Z(m/2)+k-l X (1 +mz/n)«m+n)/2)+k -e-;'/2 F ( m+n .. A mz/n )J. z - 11 2 '2'21+mz/n m.n() for 0 < z < 00, where Band 1 F 1 are the tbeta function and the confluent hypergeometric function ( 167 Functions of Confluent Type), respectively. Let X be a random variable having the distribution F(m, n). The distribution of Z = tlogX is caned the z-distribution with m and n degrees of freedom and is denoted by z(m, n). Its probability density is given by 2(m/n)m/2 e mz B(m/2, n/2) (I + me 2z /n)<m+n)/2 for -oo<z<oo. IfSf=i=I(Xi-X)2/(m-l) and si = ?1 (1; - y)2/(n -I) are sample var- iances based on independent samples of sizes m and n taken from N(/l,a 2 ) and N(v, ,2), respec- tively, then the statistic z=tlog(Sf lSD, which was introduced by Fisher (Proc. Int. Math. Congress, 1924), is distributed according to z(m - I, n - I) under the hypothesis a 2 = ,2. Fisher [6] tabulated percent points of z(m, n). C. Samples from Multivariate Normal Distributions Let X be a p-dimensional random vector, namely, a vector having real random variables as its components. X has the p-variate normal distribution N(f.l, 1:) if and only iffor any real vector a = (aI' ... ,a p )', the random variable a'X has the normal distribution N(a'f.l, a'Ea). If X b "', X n are independent and have p-variate normal distributions N(f.lb1: 1 ), ...,N(f.ln,1: n ), respectively, and if Ab"" An are m x p real matrices, then the random vector Al Xl + '" + AnXn has the m-variate normal distribu- tion N(f.l, 1:), where f.l= j=1 Ajf.lj and 1:= j=1 A j 1:A;. Suppose that {Xl' ..., Xn} is a random sample from the p-variate normal distribution N(O,1:), and let X=(X 1 ,..., X n ) be a p x n ma- trix. Then the probability distribution of W= XX' is caned the Wishart distribution with scale matrix 1: and n degrees of freedom and is denoted by W p (1:, n) or simply W(1:, n). If n > p -1, the joint probability density function 
374C Sampling Distributions of p(p + I )/2 arguments of W = (Wij) is .fz-.n( W) = (f p(n/2)12Iln I 2)-1 x etr( - I-I W/2)! W!(n- p -1) / 2 for W> 0, where W> 0 means that W is tposi- tive definite, etr(A)= exp(tr A), and f p' a multi- dimensional gamma function, is defined as f p(a)= n P (p-1)/4 fI f ( a -(i - 1» ) i1 2 for a>(p-l)/2. When np-l the distri- bution is singular and has no probability density. Suppose that XI, "', X n are independent and obey normal distributions N(f.lj,1:), ..., N(f.ln, I), and let X=(Xj,..., X n ), M = (f.ll' ... ,f.ln)' Then the distribution of W = XX' is called the p-dimensional noncentral Wis- hart distribution with scale matrix I, n de- grees of freedom, and noncentrality matrix Q=I- 1 MM' and is denoted by W(I, n, Q). If n > p -1, the probability density function is etr( - Q/2)oF1 (n/2; QI- 1 W/4)fE.n( W) for W> O. .F p is a hypergeometric function with matrix argument, which is defined by .Fp(aj, ...,a.;bj,... ,bp;S)=.F?)(aj, ...,a.; bj,..., b p ; S, I), where JfiP)(aj, ..., a; bj, ..., b p ; S, T) ( · 1 en TI (ai)K C ( S ) C ( T ) =II T- K, K , kO K Il (b)K k.CK(lp) )-1 K = (k l' ... , k p ) is an ordered set of integers such that k, + ... + kp = k and k 1  ...  kp  0, and where CK(S) is a zonal polynomial ( [8]) of a symmetric matrix S. The multivariate hyper- geometric coefficient (a)K is given by (a)K= fI ( a-(i-l» ) , i1 2 k, (a)k=a(a+ 1)...(a+k-l). The noncentral Wishart distribution is sin- gular when n  p -1. Similarly to the noncen- tral chi-square distribution, the noncentral Wishart distribution has the reproducing property with respect to both the number of degrees of freedom and the noncentrality matrix. Also, Cochran's theorem can be ex- tended to the multivariate case: Let X j, '" ,X n be p-variate random vectors independently distributed according to N(f.l1' I),..., N(f.ln, 1:), respectively, and let Am =(aj», m = 1, ..., k, be p x p real matrices of trank r m and such that A, + A 2 + ... + Ak = 1 (unit matrix). A necessary and sufficient condition for random matrices Qm= Li.jaj)XiX;, m= 1,..., k, to be indepen- dently distributed according to non central 1390 Wishart distributions with r l' "', r k degrees of freedom, respectively, is that r 1 + .., + r k = n. If, in particular, f.l1 =... =f.ln=O and Ifrm p, then Qm is distributed according to W(I, r m ). If W has the distribution W(I, J1) with n> p - 1, then the eigenvalues Aj, ..., Ap (AI ...  ),p > 0) of W ha ve the joint probability density function Cp.n!II-n p oF/r>( - I- 1 /2, A)IA!(n-p-1 )/2 IIi</Ai-A j ), where A is a diagonal matrix with diagonal elements AI' ... ,Ap and C p . n = nP2p(2pnI2fp(p/2)fp(n/2)f1. If I=I, then the joint probability density function becomes Cp.netr( - A/2)! AI(n- p -l)p TI (Ai - AJ i<j Suppose that SI and S2 have independent Wishart distributions W(I, n 1) and W(I, n 2 ), respectively. The random matrix B=(SI + S2r1/2S1(SI +S2r 1/2 is called the beta matrix, and its distribution is denoted by B(n 1 /2,n 2 /2). Its probability density function is (fp CI ;n 2 )/ f p C2 1 )fp C; ) )IBI(n O -1) / 2 X Jl_Bj(n 2 -p-1)/2 forO<B<I. Suppose that S has the distribution W(I, n) and B has B(nr/2, n2/2); then, for any non- singular symmetric matrix Q, P{S< Q} = {12IW11-n/2f p (p; 1 )/ ( n+ p + I )} ( . n+p+  . _ -1 ) f p 1 F1' ,I Q 2 2 2 2 and P{B<Q} = {f p C1 ;n 2 ) fp (P; ) / f p ( n 1 +;+l )f p (n; ) }IQl n '/2 x F (  _ n2 + p+l . I+P+l 'Q ) 2 1 2 , 2 2' 2 ' . If {XI' ..., Xn} is a random sample from N (f.l, I), then the sample mean X == L:1 X/n and the sample covariance matrix S=L:I(X. -X)(X.-X)'/(n-l) are distributed indepen- dently according to the respective distributions N(f.l, I/n) and W(I, n -1). If n > p-l, T 2 = n(X - f.loYS -'(X - f.lo) is called the noncentral Hotelling T 2 statistic with n - 1 degrees of freedom and noncentrality A = n(f.l- f.lO)'1:'-1 (p - f.lo). (n - p) T 2 /p(n - 1) has a tnoncentral F- distribution with p and n - p degrees of free- dom and noncentrality A. Let X=(X 1 ,..., X p )' and Y =(Yj,..., Yqy, p  q denote two random vectors, I: 11 and I 22 their respective covariance matrices, and I 12 the p x q matrix of covariances between 
1391 the components of X and Y. Each ofthe nonnegative roots PI' ... , P p of the equation l.E l2 .E;i .E 21 - p2 .Elil =0 is called the canonical correlation coefficient. Let (XI' Y d, '" , (X., Y.) be a random sample from the (p + q)-variate normal distribution N(O,L), and let Sll = I::=l XoXjn, S12 =Sl = I::=l Xo Yjn, S22 = I::l Yo Yjn, and S=(Sij);,j=I.2' The sample canonical correlation coefficients are the non- negative roots r l , ..., rp of ISl2SiiS21 -r 2 S 11 1 = ° and for n  p + q the probability density function of ri, ... , r is Cp,q,.II - p 2 1./ 2 IR 2 1(q-P-I)/211 _ R 2 1(.-q-p-I)/2 ( nn q ) X O( r 2 -r 2 ) F(p) - _'_'R2 p2 i<J I J 2 I 2' 2' 2' , , where Cp,q,. =n p2 / 2 r p G)! {rp (n;q )rp()rp()}, and R 2 and p2 are diagonal matrices with elements r'f and pl, respectively. If, in partic- ular, p = 1, then PI and r l are, respectively, the population and the sample multiple cor- relation coefficients, and (n - q)ri jq(l- ri) follows the distribution F(q, n - q) whenever P1=0. Let {(Xl' Y l ), ... , (X., Y,,)} be a random sample from the 2-dimensional normal distri- bution with tcorrelation coefficient p. Then the sample correlation coefficient i(Xi-X)(Y;- Y) R- (j(Xj-Xfi(Y;- Yf)I/2 has probability density J.(r; p) = (2.- 3 jn(n - 3)!)(1- p2)(.-I)/2(1_ r 2 )(.-4)/2 00 ( 1 ) (2pr)k x  r 2 -(n+k-l) -----;- k-O 2 k. for -1 < r < 1. For the special case P = 0, the probability density becomes ( 1 ) r -(n-1) f.(r) - 1  (1- r 2 )(.-4)/2, ..;; r(2(n-2)) which implies that T= Jn-2 R jJI-R2 has the tt-distribution with n - 2 degrees of freedom. Given a random sample from a p-variate normal distribution, the probability density of the tsample partial correlation coefficient R 12 . 3 ... p between the first and the second components with the remaining components fixed is given by f.- p +2(r; P12' 3..), where f is 374D Sampling Distributions the density of R mentioned in the previous paragraph and Pl2'L.p is the population par- tial correlation. D. Large-Sample Theory So far we have dealt with random samples {Xl, ... , X.} composed of finitely many ran- dom variables (or vectors). The theory dealing with such finite cases is called small-sample theory, which is not always suitable for numer- ical applications. In comparison with this, in large-sample theory, where the sample size is assumed to be sufficiently large, an approxima- tion of the sampling distribution can often be obtained easily by means of the tcentrallimit theorem. If for three sequences X., J1., and a., n = 1,2, ..., of random variables, real numbers, and positive numbers, respectively, the se- quence (X. - J1.)ja. tconverges in distribution to N(O, 1) as n.....oo, then the sequence X. is said to be asymptotically distributed according to N(J1.,a;). The definition can be extended to higher dimensions. We write X.=op(r.) for a sequence r. of positive numbers if and only if X.jr. tconverges in probability to zero as n.....oo. The following theorem is useful: If X.=a+op(r.), where a is a constant and r. =0(1), and if a real-valued function f(x) is of class CS in a neighborhood of x = a, then s 1 f(X.) = kO k! j<k)(a)(X. - a)k + op(r:). If X. is asymptotically distributed according to N(J1, 17 2 jn) and f(x) is differentiable at x = J1 with the derivativef'(J1) *0, then f(X.) is asymptotically distributed according to N(f(J1), (f'(J1)fa 2 jn). In higher-dimensional cases, if X. is asymptotically distributed according to N(p, .Ejn) and f(x) is continuously differenti- able in a neighborhood of x = p with nonzero vector c=(8fj8x[,..., 8fj8x p )x=", then f(X.) is asymptotically distributed according to N(f(p), c.Ec'jn). Let {Xl' . .. , X.} be a random sample from a univariate distribution with finite tmoments v j = E(X i ) for i= 1,..., k, and let a j = I:oXjn be its ith sample moment. Then the random vector (a l ,..., a k ) asymptotically follows the k- variate normal distribution as n.....oo with mean vector (VI, .., , Vk) and covariance matrix n-l(aij), where aij=vi+j-ViV j . Let Mi=I:o(Xo- X)ijn and J1i = E(X - vd i for i = 2, ..., k be the sample central moment of order i and popula- tion central moment of order i, respectively. Then the random vector (X, M 2 ,..., M k ) obeys the k-variate normal distribution asymptoti- cally as n.....oo with mean vector (V I ,J12'''' ,J1k) 
374 E Sampling Distributions and covariance matrix n-'(ui)' where U ll = f1.2' Uli=f1.i+1- i f1.2f1.i-j, Uij=f1.i+j- i f1.i-1f1.j+1- jf1.i+lf1.j-1 -f1.if1.j+Uf1.2f1.i-1f1.j-1 for i,j"?2. A random variable X that has a tchi-square distribution with n degrees offreedom obeys the distribution N (n, 2n) asymptotically as n-+oo. Also, J22 - J2n-1 obeys N(O, 1) asymptotically. The latter distribution approx- imates X indirectly better than N(n, 2n) ap- proximates X directly. The t-distribution with n degrees of freedom obeys N (0, I) asymptoti- cally as n-+oo. If X n obeys an F-distribution with m and n degrees of freedom, then mX n obeys asymptotically the distribution x 2 (m) as n-+oo. If X n obeys a tbinomial distribution Bin(n, p), then X n obeys asymptotically the dis- tribution N(np, np(l- p)), and Arcsin J Xnln obeys asymptotically N(ArcsinjP, 1/4n) as n -+ 00. This transformation is called the arc sin (or angular) transformation. If (XI, X 2 ,... ,X k ) obeys the multinomial distribution Mu(n; PI' P2' ..., pd, then it is asymptotically distributed according to the normal distribution N (Jln, L n ), where Jln = (np 1, .. . , npd, Ln = (u!;'»), U!?) = npi(l- p.), and u!j) = - npiPj, (i # j), and the random variable r.'-':l(Xj-np/lnpj' where Xk+' =n-(X 1 +... +X k ) and PHI = 1-(P1 + '" + pd, obeys asymptoticaJly the distribution X 2 (k) [11]. If X n has the tPoisson distribution with mean An' where )'n-+oo as n-+oo, then X n and JXn obey the respective distributions N(An,An) and N(fi, 1/4) asymptotically. If R is the sample correlation coefficient based on a ran- dom sample of size n from a 2-dimensional (bivariate) normal distribution with popula- tion correlation coefficient p, then R is asymp- toticalJy distributed according to N(p, (I - p2)2 In) as n-+ 00, and therefore z = 1log((1 + R)/(l- R)) obeys asymptotically the distri- bution N(1log«(1 + p)/(l- p)), I/n) asymptoti- cally. This transformation is called Fisher's z- transformation. The distribution ( I 1+ p p 1 ) N -log-+-- 2 I-p 2(n-l)'n-3 gives a better approximation. E. Empirical Distribution Function Let {XI, ... ,X n } be a random sample from a distribution F. The random function 1 Fn(x) = - {number of X's that are  x} n is called the empirical distribution function. For any collection of fixed x's ( - 00 = Xo < XI < '" <xk < 00), the random vector (nFn(xl)' n(F n (x2) - Fn(xrJ), ..., n(Fn(xd - Fn(xk-d)) obeys the tmultinomial distribution Mu(n; PI"'" Pk), 1392 where pj=F(xj)-F(xj_d,j= 1,.... k, provided that the p's are positive. In particular, the vector is asymptotically distributed according to the k-variate normal. The result is substan- tially strengthened as follows: the Glivenko- Cantelli theorem states that supx IFn(x) - F(x)1 converges to zero with probability 1 as n tends to infinity. If F(x) is continuous, then the ran- dom function (Fn(t) - F(t)) converges in distribution to a tGaussian process X(t) such that E(X(t)) =0 and E(X(s)X(t)) = F(s)(l- F(t)) for s  t. A Gaussian process X (t), 0  t  1, with this moment condition is called a Brownian bridge if F(t) = t, for 0  t  1. If F(x) is continuous, then the distributions of the ran- dom variables Cn=supAFn(x) -F(x)) and Dn=supIFn(x)-F(x)1 do not depend on F. Asymptotically, they have identical distribu- tions with sup, B(t) and sup, I B(t)l, respectively, where B(t) is a Brownian bridge. We have P(sup,B(t)x)= 1_e-2x 2 , CD P(sup,IB(t)lx)=1+2 L (_1)k e -2k 2 x', k1 x>O. Let {Xj, ... , Xm} and {Yj, ..., Y,,} be random samples from continuous distributions F and G, respectively, and let Fm(x) and Gn(x) be their empirical distribution functions. Under the hypothesis Ho: F == G, the distribution of the Kolmogorov-Smirnov test statistic Dm.n = sup IFm(x) - Gn(x)1 does not depend on F (or G), and asymptoti- cally, as m-+oo and mln-+). 1, the random function j;(Fm(t) - Gn(t)) converges in distri- bution to a Gaussian process X(t) such that E(X(t))=O and E(X(s)X(t)) =(1 +A)F(s)(l- F(t)),st. F. Edgeworth and Cornish-Fisher Expansions Let {XI' X 2 , ..., Xn} be a sample from a distri- bution with mean f1. and variance u 2 . The random variable (XI +X 2 +... +Xn-nf1.)/u is called the normalized sum of the sam pie. The distribution function Fn(x) of the normal- ized sum of a sample from an absolutely continuous distribution F with higher-order moments admits the Edgeworth expansion [ 15]: Fn(x) v-2 ( 1 ) k ( 1 ) V-1 = <I> (x) + k1 Rk(x)  ljJ(x)+B  where <I> and IjJ are the tcumulative distribution and the tprobability density functions, respec- 
1393 tively, of N(O, 1), and B is a quantity bounded by a constant depending on F and v. Rdx) is the polynomial given by ) Il k 1 ( Yj+2 ) m. Rk(x)= -l:Hk+2l-1(X. ' ( -----;--- 2) ' J, ):1 mj' } + . where Hk(x) is the tHermite polynomial of degree k, Yk is the tcumulant (tsemi-invariant) of order k of the distribution of (Xl - /l)/(J, the summation extends over a11 nonnegative m's such that m l +2m 2 + .., +kmk=k, and l=ml +m 2 +... +mk' In particular, we have Rl(x)= -Y3(X 2 -1)/6 and R 2 (x)= -Y4(X 3 - 3x)/24 _Y(X5 -lOx 3 + 15x)/72. For a tlattice distri- bution F concentrated on 0, ::1:1, :t2, .., but not on 0, ::I: p, :t 2p, '" for any p> 1, the fol- lowing expansion is valid for x=O, :tl, :t2, ...: Fn cn: ) =(z)+ :t: QdZ { Y (Z)+B (} -I, where z=(x-n/l+ 1/2)/(J and the Q's are suitable polynomials; Ql(z)=Rl(z) and Q2(Z) =R 2 (z)+z/24(J2. The Edgeworth expansion makes it possible to derive asymptotic formulas for the relation between those u and v such that Fn(v)=(u). If F is an absolutely continuous distribution with moments of order v (? 3), then we have the Cornish-Fisher expansions [16J: V- 2 ( 1 ) k u=v+ L: Ak(v) - +0(n-(v-I)/2) kl  and v-2 ( 1 ) k v=u+ L: Bk(u) - +0(n-(v-1)/2), k:1  where the A's and B's are polynomials derived from the R's of the Edgeworth expansion; Al (v) = -Y3(V 2 -1)/6, A 2 (v)= -Y4(V 3 - 3v)/24 +y(4V3 -7v)/36, Bdu)=Y3(U 2 -1)/6, B 2 (u) = Y4(U 3 - 3u)/24-yWu 3 - 5u)/36. The expansions imply, in particular, that the random variable v + I:;;f Ak(v)n -k/2 with v = (Xl +X 2 + '" +Xn-nJ1)/(J is asymptoti- ca11y distributed according to N(O, 1) and that the 1001>:% point v. of Fn is approximated by u. + I:;;f Bdu.)n -k!2,where u. is the 1001>:% point of N(O, 1). These approximations can be im- proved further in some cases by a suitable transformation of the sum Xl + X 2 +... + Xn. Thus, for example, if X is distributed accord- ing to x 2 (n), then the Cornish-Fisher expan- sions with v = 3 are 1 f2 2 ( 1 ) u=v-3..J(V -1)+0  374G Sampling Distributions and v=u+A(U2-1)+OG)' where v=(X -n)jj2r;. However, the distri- bution of the random variable  { (r 3 -1+ :n } is much better approximated by N(O, 1), and n ( 1-+ (2 U. ) 3 9n ..J 9n gives a more accurate approximation to the 100iX% point of the distribution x 2 (n). These are ca11ed the Wilson-Hilferty approximations (Proc. Nat. Acad. Sci. US, 17 (1931». The Edgeworth expansion was shown to be valid in more general situations by R. N. Bhat- tacharya and 1. K. Ghosh [17]. In particular, they obtained the fo11owing: Let {X l ,X 2 , ..., Xn} be a random sample from a p-variate distribution with a nonzero tabsolutely con- tinuous component w.r.t. tLebesgue measure on RP. Letfo (= l),fl' ...,fk be linearly in- dependent, real-valued, and continuously dif- ferentiable functions. For i = 1, ..., n, put Zj = UdXj),f2 (Xi), ." ,fdX i », and assume that the distribution of Zl has moments up to the order v (? 3). Let H be a real-valued function on R k such that the vth order derivatives are continuous in a tneighborhood of,ll= E(ZI)' Let V = (v ij)' i, j = 1, ... , k, be the covariance matrix of the random vector Z l' and put (J2 = I: Vijli/j, where Ii =aH(z)/iJzilF' and z= (z j, ... ,Zk)' Then splpr{(H(Z)-H(,ll»EB}- t 1/Iv.n(X)dX\ =0(n-(V-2)12), where Z = I::1 Zdn, the supremum is taken over a11 Borel measurable sets B, 1/Iv.n(x) = ( 1 + : (  y P k ( - :x) ) u(x), u(x) is the probability density function of the normal distribution N (0, (J2), and the P's are polynomials whose coefficients are indepen- dent of n. G. Order Statistics Let {XI' '.. ,X n } be a random sample from a univariate distribution with continuous prob- ability density f(x) and distribution function F(x), and let X(l) ."  X(n) be torder statistics. The joint probability density of Y l = X{.), Y 2 = 
374H Sampling Distributions X(p)' ..., Yp-l = Xl')' and Y p = X() is given by n! (a-l)!(f3-a-l)! '" (I/-F.-l)!(n-I/)! x (F(Ydr 1 (F(Y2)-F(Ydr'-1... x (F(y p ) - F(yp-d)"-'-l (1- F(Yp))n-... x f(ydf(Y2) ...f(yp) for -00 <Yl <... <Y p < 00, where a<f3 <... < F. < 1/. If for given constants 0 < ).1 < ... < ).p < 1, each of the subscripts a, ...,1/ tends to infin- ity as n-+ 00 under the conditions r j = n).j + o(), where a=r p ...,I/=r p , then the ran- dom vector (Y 1 ,..., Y p ) asymptotical1y obeys the p-dimensional normal distribution with mean vector (1' ... ,  p) and covariance matrix n- 1 (a j ), where aij=ajj=Aj(l-A)lf(;)f() for i j and i is the A j -quantile of the population, defined by }'j=F(J Suppose that there exist two sequences an and b n of real and positive numbers, respec- tively, such that as n-+ 00 the sequence (X(n)- an)lb n converges'in distribution to a nonde- generate distribution G. The underlying dis- tribution F is said to belong to the domain of attraction of the limiting distribution G (DA(G), for short). Except for the change of location and scale, only the fol1owing three distributions have nonempty domains of attraction: { o x<O G 1 (x)= e - x -; , X?O { e-(-X)Y G 2 = 1, x<O X?O' G 3 (x)=e-e- x . Writing F(x)= I-F(x), a(F)=inf{xl F(x»O} and w(F)=sup{xIF(x)< I}, we have the fol- lowing theorem (B. V. Gnedenko, Ann. Math., 44 (1943)): FEDA(G 1 ) iff w(F)= 00 and there exists an a such that lim F(a+ux)IF(a+u)=x- Y for an x; u'" FEDA(G 2 ) iff a:=w(F)<CIJ and limF(a-ux)IF(a-u)=x Y for an x>O; uo FEDA(G 3 ) iff there exists a positive function R(t) such that lim F(t+xR(t))IF(t)=e- x for an x. t-w(F) If F(x) is twice differentiable, f(x) = F'(x) is positive for sufficiently large x, and limxw(F)d{(I-F(x))/.r(x)}ldx=O, then FE DA(G 3 ). Noticing the relation X( 1) = -max{ -Xl, -X 2 ,..., -X n }, we can also derive the possible limiting distributions for 1394 the sequence (X( 1) - an)lb n and their domains of attraction. The statistics Rn = X(n) - X( 1) and M n = (X( 1) + X(n»)/2 are caned the range and the midrange of the sample, respectively. If, for some a., a, and b n , both sequences (X(n)- an)lb n and (X( 1) - a)/bn converge to nondegene- rate distributions G and H, respectively, then they are asymptotical1y independent, and we have lim Pr{ (Rn - an + a)lbn  x} = fa) (I-H(y-x))dG(y), lim Pr{(2M n -an-a)lbnx} = f: H(x- y)dG(y). H. Characterization of the Distribution by means of a Property of the Sampling Distribution A distribution or a family of distributions can be characterized by a property of the sam- pling distribution of a suitable statistic. Let {X l' X 2 , "', Xn} be a random sample from a nondegenerate distribution F, and let X(1) ...  X(n) be the torder statistics. The tsample mean X is independent of the tsample var- iance S2=(Xj-X)2/(n-l) iff F is normal N(fl., ( 2 ) (Kawata and Sakamoto, J. Math. Soc. Japan, 1 (1949)). Let aij, i,) = 1, ..., n, be real numbers such that aij=O and I,aij#O. If F has a finite second moment, then the condition E(I,aijXjX j I X)=const. implies that F is nor- mal. Two linear statistics L 1 = a 1 X j + .., + anX n and L 2 = b 1 X 1 +... + bnXn are inde- pendent only if F is normal, provided that ajbj#O for some}. In fact, the X's need not be identical1y distributed: If L 1 and L 2 are inde- pendent and ajb j # 0, then the distribution of X j is normal (Skitovich-Darmois theorem). Yu. V. Linnik [23J gave a necessary and sufficient condition for the normality of F 10 be equiv- alent to the identity of the distributions of L 1 and L 2 . The condition is stated in terms of the zeros of the entire function a(z)=la l IZ + ... + lanl z -lbjl Z -... -Ibnl z . The result contains as a special case the fonowing characterization theorem for the normal distribution: If L af = 1 and L 1 has a distribution identical to that of Xl, then F is normal N(fl., ( 2 ) with fl.( L a j - 1) =0. R. Shimizu gave a complete description of the characteristic function of the distribution for which L 1 has the same distribution as Xl' In particular, it was proved that if10gl a III logla 2 1 is an irrational number, a is the posi- tive number given by Ial= 1, and if L 1 has a distribution identical to that of Xl, then F is the tstable distribution with tcharacteristic 
1395 exponent rx. The result was extended in [24J to the cases where the a's are random variables independent ofthe X's. If E(Xi)=O and E(X IX I -X,X 2 -X, ...,Xn-X)=O, then F is normal [26]. Let J.lI' J.l2' ... , J.ln be a set of real numbers. If the sampling distribution of the statistic 1: (X; + J.l;)2 depends on the J1's only through 1: J.lf, then F is normal. If Xi is posi- tive and has finite mean, then the condition E(XIX 2 /X I , X 3 /X I , ..., Xn/XI)=const. implies that F is the gamma distribution. If the distri- bution F is not concentrated on a lattice 0, p, 2p, '.., then E(X(k+1)-X(k)IX(k»)=const. for some k implies that F is the texponential distribution: F(x) = l-e- h , x>O. X(k+1)- X(k) has the identical distribution for some k with min{X 1 , X 2 ,... ,X n - k } iff F is exponential. If ai' a 2 ,..., an are positive numbers such that a l + a 2 +... + an = 1 and such that logatlloga2 is irrational, then the sampling distribution of min{Xt!a l ,X 2 /a 2 , ...,Xn/a n } is the same as that of XI iff F is exponential. X(I) is indepen- dent of X(I)-X iff F is exponentia1. Suppose that the distribution F has a bounded density function and that the integral Soo e'x dF(x) is finite on a neighborhood of t = 0. Let {XI' X 2 , ... , Xn} be a random sample from a distribution of the family &'= {F((x- J.l)/a) I -00 < J.l < 00, a >O}. For n 9, the sam- pling distribution of the statistics { (X 3 - XI) (X 2 - XI) (X 6 - X 4 ) (X s - X 4 ) (X 9 -X 7 )' (X S -X 7 )' (X 9 -X 7 )' (X S -X 7 )' sgn(X 3 - XI)' sgn(X 2 - XI), sgn(X 6 - X 4 ), sgn(X s - X 4 )} uniquely determines the family &'. If F is symmetric, then for n6 the distribution of {1(X 4 -X 3 )/(X 2 -XI)I, I(X 6 - XS)j(X2 - XI)I} uniquely determines &' [25]. I. U-Statistics Let {XI' ... , Xn} be a random sample from a certain distribution, and let q>(x l , '" , x m ) be a real-valued function that is symmetric with respect to the arguments XI' ... , Xm. The statistic u= ( n ) -IL:q>(X. ,...,X.), m 'm where the summation extends over an combi- nations (rx l , ..., rx,J taken from (1,2,..., n), is caned a U-statistic. Let e=E(q>(X I , ...,X m )), and assume that E( q>(X I , ..., Xm)2) is finite. Then the mean and variance of U are given by ( n ) -I m ( m )( n-m ) E(U)=e, V(U)= L:. ---;- e, m ;=1 1m-I 374J Sampling Distributions where (i is the covariance between q>(X I , ...,X m ) and q>(X I , ...,X;,Xi+I' ...,X;"), with Xi+l, ..., X;" an additional independent random sample of size m - i from the same distribution. If (I *0, then U obeys the distri- bution N(e,m 2 (t!n) asymptoticany as n-+oo (W. Hoeffding, Ann. Math. Statist., 19 (1948)). These results can be generalized to the case of several populations and samples. Let XII' ...,X ln ,; ...;Xc!' .,.,X cn , be c independent random samples, each drawn from one of c populations. Assume that a real-valued func- tion q>(xlt. ..., x lm ,; ...; Xci' ... ,X Cm ) is sym- metric with respect to the arguments Xii' ... , x;mi for each i = 1, ..., c. Then c ( n. ) -1 U=TI I .L:q>(X la (ll),.",X la (lm,);...; i=1 m i X CiX (Cl), ..., XclX(cm c »)' where the summation extends over an combi- nations (rx(il),..., rx(imi)) of mi numbers taken from (1,2,..., n i ) for each i= 1,..., c, is caned a U-statistic. The mean and variance of U can be obtained as before, while U is asymptoti- cany normany distributed as the sample sizes n l , ..., nc tend to infinity in fixed proportion. Furthermore, if there are given several U- statistics, their tjoint distribution is asymptoti- cany normal. J. Distributions Having Monotone Likelihood Ratio, and P6lya-Type Distributions Let (X,) be a sample space and &'= {P8(X)I eEQ} be a family of probability densities with re- spect to a fixed ta-finite measure. The function P8(X) regarded as a function of e with a fixed observed value of X is caned the likelihood function, and its value at a particular point e is caned the likelihood of that point. The family &' with Q c R is said to have monotone likeli- hood ratio with respect to a real-valued function T(x) if and only if for any e < e' such that e and e' belong to Q the ratio P8,(X)/P8(X) is a nondecreasing function of T(x). Under the assumption that Xc Rand 8 2 Iogp8(x)/8x8e exists, a necessary and sufficient condition for &' to have monotone likelihood ratio with respect to T(x) == X is that 8 2 1ogp8(x)/8x8e O for any X and e. If {XI' ...,X n } is a random sample from a distribution that has a mono- tone likelihood ratio and if a real-valued func- tion 1/!(x l , ..., x n ) is nondecreasing in each of its arguments, then the expectation E 8 (1/!(Xt. ... , X n )) is a nondecreasing function of e. The family &' is said to be of P61ya type n if and only if for any m= 1,2, ...,n and any 
374 Ref. Sampling Distributions real numbers XI <... <x m and 0 1 <... <8m, the determinant of the matrix (Pe (x)), i, j = 1, '" ,m, is nonnegative, and .OJ is 'said to be strictly of P61ya type n if the determinant is positive. Being of P61ya type 2 is equivalent to having a monotone likelihood ratio. If 9 is (strictly) of P61ya type n for any positive in- teger n, then it is said to be (strictly) of P61ya type. An texponential family of distributions with probability density Pe(x)=exp(8x+iX(0) +s(x)) for xE.'?tcR and 8d2cR is strictly of P61ya type. Each of the noncentral chi- square distribution, noncentral t-distribution, and noncentral F-distribution is strictly of P61ya type with respect to the noncentrality parameter. References [IJ H. Cramer, Mathematical methods of statistics, Princeton Univ. Press, 1946. [2J S. S. Wilks, Mathematical statistics, Wiley, 1962. [3J M. G. Kendall and A. Stuart, The ad- vanced theory of statistics I, Griffin, fifth edi- tion, 1952. [4J N. L. Johnson and S. Kotz, Distributions in statistics, 4 vols., Wiley, 1969-1972. [5J Student (W. G. Gosset), The probable error of a mean, Biometrika, 6 (1908), 1-25. [6J R. A. Fisher, Statistical methods for re- search workers, Oliver & Boyd, 1925. [7J T. W. Anderson, An introduction to multi- variate statistical analysis, Wiley, 1958. [8J A. T. James, Distributions of matrix vari- ates and latent roots derived from normal samples, Ann. Math. Statist., 35 (1964), 475- 501. [9J A. G. Constantine, Some non-central distribution problems in multivariate analysis, Ann. Math. Statist., 34 (1963), 1270-1285. [!OJ H. Chernoff, Large-sample theory: Parametric case, Ann. Math. Statist., 27 (1956), 1-22. [11J K. Pearson, On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling, Philos. Mag., (5) 50 (1900). Reproduced in K. Pearson, Early statistical papers, Cambridge Univ. Press, 1948. [12J K. R. Parthasarathy, Probability mea- sures on metric spaces, Academic Press, 1967. [13J P. Billingsley, Convergence of probability measures, Wiley, 1968. [14J R. H. Randles and D. A. Wolfe, Introduc- tion to the theory of nonparametric statistics, Wiley, 1979. 1396 [15J F. Y. Edgeworth, The law of errors, Trans. Cambridge Philos. Soc., 20 (1904). [16J E. A. Cornish and R. A. Fisher, Moments and cumulants in the specification of distri- butions, Rev. Int. Statist. Inst., 5 (1937). Repro- duced in R. A. Fisher, Contributions to mathe- matical statistics, Wiley, 1950. [17J R. N. Bhattacharya and J. K. Ghosh, On the validity of the formal Edgeworth expan- sion, Ann. Statist., 6 (1978), 434-451. [18J R. N. Bhattacharya and R. Ranga Rao, Normal approximation and asymptotic expan- sions, Wiley, 1976. [19J E. J. Gumbel, Statistics of extremes, Columbia Univ. Press, 1958. [20J A. E. Sarhan and B. G. Greenberg (eds.), Contributions to order statistics, Wiley, 1962. [21J H. A. David, Order statistics, Wiley, second edition, 1981. [22J 1. Galambos, The asymptotic theory of extreme order statistics, Wiley, 1978. [23J Yu. V. Linnik, Linear forms and statis- tical criteria, Selected Trans!. Math. Statist. Prob., 3 (1962),1-90. (Original in Russian, 1953.) [24J R. Shimizu and L. Davies, General char- acterization theorems for the WeibulJ and the stable distributions, Sankhya, ser. A, 43 (1981), 282-310. [25J Yu. V. Prokhorov, On a characterization of a class of probability distributions by distri- butions of some statistics, Theory of Prob. App!., 10 (1965), 438-445. (Origmal in Russian, 1965.) [26J A. M. Kagan, Yu. V. Linnik, and C. R. Rao, Characterization problems in mathemat- ical statistics, Wiley, 1973. [27J 1. Galambos and S. Kotz, Characteriza- tions of probability distributions, Springer, 1978. [28J G. P. Patil, S. Kotz, and J. K. Ord (eds.), Statistical distributions in scientific work Ill, Characterizations and applications, Reidel, 1975. [29J S. Karlin, Decision theory for P61ya type distributions, Case of two actions I, Proc. 3rd Berkeley Symp. Math. Stat. Prob. I, Univ. of California Press, 1956, 115-128. 375 (XX.26) Scattering Theory A. General Remarks The path of a moving (incident) particle is distorted when it interacts with another (tar- get) particle, such as an atom or a molecule. Phenomena of this sort are generally called 
1397 scattering. Scattering is ca11ed elastic when the internal properties of the incident particle and the target remain unchanged after the co11i- sion, and inelastic when the internal properties change, other particles are emitted, or the two particles form a bound state. The extent of scattering depends on the sizes of the incident and target particles. The scat- tering cross section is defined as the probability that the incident beam wi11 be scattered per unit time (normalized to one particle per unit time crossing unit area perpendicular to the direction of incidence). In tclassical mechanics the scattering cross section is equal to the cross section of the target perpendicular to the incoming beam, hence the term "cross sec- tion." The probability of scattering into a unit solid angle in a particular direction is ca11ed the differential cross section. The probability that the incoming particle is absorbed by the target, ca11ed the absorption cross section, is intimately connected with the scattering cross section. Analyses of scattering give infor- mation on the structure and interactions of atoms, molecules, and elementary particles. One can also study the scattering of acoustic and electromagnetic waves by inhomogeneous media and obstacles, by considering notions similar to the above. Scattering theory may be dated back to Lord Rayleigh. Since the advent of quantum mechanics in the mid-1920s, scattering prob- lems, mainly for central (spherica11y sym- metric) potentials, have been investigated strenuously by physicists. It may be said, how- ever, that a scattering theory having mathe- matica11y rigorous foundations began around the 1950s, when the pioneering work of K. Friedrichs (Comm. Pure Appl. Math., 1 (1948)), A. Va. Povzner (Mat. Sh., 32 (1953)), T. Kato (J. Math. Soc. Japan, 9 (1957)), J. M. Cook (J. Math. Phys., 36 (1957)), and J. M. Jauch (Helv. Phys. Acta, 31 (1958)), among others, appeared, and scattering theory has now grown into a branch of mathematical physics. General references for mathematical scatter- ing theory are, e.g., [1-5]. B. Wave and Scattering Operators In tquantum mechanics the dynamics of an interacting system is given by a tone- parameter group of unitary operators e- itH , where t denotes the time and H, caUed the Hamiltonian of the system, is a tself-adjoint operator acting in a tHi1bert space :Yf. Ele- ments of:Yf represent (pure) states of the sys- tem. Let Ho be the "free" Hamiltonian of the corresponding "noninteracting" system. (There are at present no genera11y accepted definite 375B Scattering Theory criteria for "free" and "noninteracting.") Ho is assumed to be tabsolutely continuous, which is the case in most practical situations. Then the outgoing and incoming wave operators W:!: == W:!:(H, Ho) are defined, if they exist, by W+ = s-lim e;'He-itHo (s-lim= tstrong limit). - t-+:tce This means that given any free motion e-;'Hou there is an initial (t=O) state u:!: (= W:!: u) such that e-;tH u :!: and e-itHou are asymptotica11y equal at t = ::!:: CX). W:!: are tisometric, intertwine the two dynamics: e-itHW:t = W:te-;'Ho, and map :Yf (which is nothing but the tabso- lutely continuous subs pace :Yfac(Ho) for Ho) onto a closed subspace of JFac(H). The scatter- ing operator S is defined as S = W: W_ (A * is the Hilbert-space tadjoint of A). S commutes with Ho and maps states in the remote past into states in the distant future. One of the most important problems in scattering theory is to prove the tunitarity of S, or equivalently, Ran W+ = Ran W_ (Ran = trange = timage). W:t is ca11ed complete if Ran W:t = :Yfac(H). The completeness of W:t implies that S is unitary. As a typical example we consider the I-body problem. Note that the 2-body problem re- duces to the I-body problem by separating out the center-of-mass motion, which is free. Then Ho = - A (the negative tLaplacian in R 3 ), H = - A + V, the operator V being multiplication by a real-valued function Vex), ca11ed the po- tential, and :Yf = L 2 (R 3). If V is short range, i.e., if, roughly speaking, V(x)=O(lxl-1-e) (1::>0) at cx), the wave operators are known to exist and to be complete (S. Agmon, Ann. Scuola Norm. Sup. Pisa, (4) 2 (1975)). If the potential Vex) is long range, i.e., if, roughly speaking, Vex) = O(lxl- C ) (1::>0) at cx), then the foregoing defini- tion of the wave operators has to be modified. For the Coulomb potential V(x) = cjx, for instance, one can adopt the fo11owing defini- tion of modified wave operators: W+ =s-lime;'H exp( -itH o -i(cj2)H o - 1 / 2 1ogt). - t-+:tee It can be shown that the W:!: exist (which implies that the ordinary wave operators do not exist) and are complete: Ran W:!: =:Yfac(H) (1. D. Do11ard, in [6J). The same result obtains for more general long-range potentials (H. Kitada, J. Math. Soc. Japan, 30 (1978); T. Ikebe and H. Isozaki, Integral Equations and Operator Theory, 5 (1982)). If the wave operators exist and are com- plete, they give tunitary equivalence between the tabsolutely continuous parts of Ho and H ( T. Kato [7J; 331 Perturbation of Linear Operators). In the foregoing discussion it was tacitly assumed that in dealing with scattering phe- nomena we adhere to states in :Yfac(Ho) and 
375C Scattering Theory Yfac(H). A more physically intuitive definition in the caSe of potential scattering (Ho = - A, H = Ho + V) of scattering states I; t(H) is the following: f E I: t(H) if and only if for any r > 0, I f T . lim - IIF,e- uH fI1 2 dt=0 T-tooT 0 (II II =L 2 -norm), where F, is the (projection) operator of multi- plication by the characteristic function of {XE R311xI < r}. In general no inclusion rela- tions between I: t(H) and Yfac(H) are known. But for a wide class of potentials it is known that Yfac(H) as well as the tcontinuous sub- space of H coincides with I: t(H) (in this case there is no tsingular continuous spectrum) (W. O. Amrein and V. Georgescu, Helv. Phys. Acta, 46 (1973); C. Wilcox, J. Functional Anal., 12 (1973); Amrein, in [8J). A purely abstract result in scattering theory may be noted. Let Ho and H be self-adjoint operators in an abstract Hilbert space .Yf such that V = H - Ho is a ttrace-class (tnuclear) operator. Then the generalized wave operators Wt(H, Ho)= s-lim htoo exp(itH)exp( -itH o ) Pac(Ho) exist, where Pac(Ho) is the tprojection onto .Yfac(HO)' Since this statement is sym- metric in Ho and H, the "inverse" generalized wave operators Wt(Ho, H) also exist, from which one can conclude that Wt(H, Ho) are complete. Moreover, the invariance principle holds, which means roughly the following: If IjJ is a strictly increasing function on R, then Wt(IjJ(H), IjJ(Ho)) exists and is equal to W+(H, Ho). This result can be applied to poten- tial scattering when V(x)EL 2 (R 3 )n LdR3) (Kato [7J; D. B. Pearson, J. Functional Anal., 28 (1978)). C. Stationary (Time-Independent) Approach We again consider the I-body problem as in Section B. V(x) is assumed to verify certain appropriate decay conditions at 00 as the case may be. Consider the tresolvents Ro(z) = (H O -Z)-1 and R(z)=(H -zr 1 for ZEC -R, which are well-defined bounded integral opera- tors on Yf = L 2 (R 3 ). Here we note the follow- ing: [0, 00) is the tcontin uous spectrum of Ho and H, (-00,0) is contained in the tresol- vent set of Ho, and H has possibly tdiscrete teigenvalues in [ -a, 0) with (-00, -a) con- tained in the resolvent set, where a is a posi- tive number. When z approaches a positive real value, Ro(z) and R(z) do not have limits as tbounded operators on Yf. But if we regard them as operators from L2.y to L2. y (L 2 ..= {u I (1 + Ixl)"u(X)E L 2 (R 3 )}), y> 1/2, they can be shown to have boundary values Ro(A::!: iO) and 1398 R(A::!:iO), A>O (limiting absorption principle;  Agmon, loco cit., for short-range potentials; for long-range potentials  R. Lavine, J. Func- tional Anal., 12 (1973); T. Ikebe and Y. Saito, J. Math. Kyoto Univ., 12 (1972); and Saito, Pub I. Res. Inst. Math. Sci., 9 (1974). With these boundary values we can define "stationary" wave operators whose range is easily proved to coincide with Yfac(H), and show their equal- ity to the time-dependent wave operators discussed in Section B, thus obtaining the completeness of the latter. Ho is known to have generalized (improper) eigenfunctions CPo(x, ) = e ix '  with generalized (improper) eigenvalues 112. The associated eigenfunction expansion is nothing but the Fourier integral expansion. An eigenfunction expansion, similar to the Fourier expansion, that diagonalizes H can be obtained by using generalized eigenfunctions cP I(X,) = CPo (x, ) -(R(I 12::!: iO) VCPo(., ))(x), which are the solutions to the Lippmann- Schwinger equation cP t(x,) = CPo(x,) 1 i e:tilllx-YI -- 4 I I V(y)CPt(y, )dy. n R3 x - Y A rough statement of this is the following: Let Ut() = (2n) -3/2 S CPt (x, ) u(x)dx. Then Ilull = Ilull, (Hun)= 112UW, and u(X) = (2n) -3/2 S cP t(x, )u()d ( e.g. [4, XI.6J for a more precise statement). In view of the fact that S commutes with Ho, we can show that S admits the folIowing repre- sentation: Let a() = (2n)-3/2 S CPo (x, ) u(x)dx be the tFourier transform of u. Then (Sun)=(sa)() =u()-ni f II T(, 1lw')u(Ilw')dw', S2 where T(, ') =(2n) -3 f CPo (x, ) V(x)cp(x, ')dx is the kernel of the so-called T-operator, which is a tcompact operator on L 2 (S2) under suit- able conditions on V(x). T(, 0 is related to the experimentally measurable total cross section (for incident momentum I) (J(): (J()=2n f If(II;w,w'Wdw' (w=/IW, S2 f(A; w, w') = - 2n 2 T(AW, AW') (A > 0). The quantity f(A; w, w') is called the scattering amplitude and appears in the asymptotic 
1399 expansion of qJ :t(x,) as e:tjlllxl A qJ:t(x, )-qJo(x, )+ j(II;x, ), where a=a/lal for aER 3 . An abstract version of the limiting absorp- tion principle and eigenfunction expansion is known as the Kato-Kuroda theory, for which the reader is referred to Kato and S. T. Kuroda in [6] and in Functional analysis and related fields, Springer, 1970, and to Kuroda (J. Math. Soc. Japan, 25 (1973)). D. Time-Dependent Approach Consider the same situation as in Section C. Since scattering is a time-dependent phenom- enon, it seems natural to develop scattering theory in a time-dependent fashion. Indeed, there is an approach to the completeness of wave operators that does not resort to any eigenfunction expansion results, but instead fo11ows the temporal development of the wave packet e-itHu. The completeness of W:t wi11 be established if one can show that any uEYi'ac(H) orthogonal to Ran W:t is 0. A crucial step to prove this is to find a clever decomposition of a wave packet into an outgoing and an incom- ing one, or to find projections P:t such that P+ +p_ =! and P :te-itHo goes to ° as t-+ +00. Some compactness arguments are also needed. To construct such a decomposition or pro- jections one looks at the scalar product x. , x and  being the position and momentum (operators), respectively. The main idea is that if this is positive, the particle wi11 be outgoing (to infinity), and if negative, incoming (from in- finity). But since we work in the framework of quantum mechanics, this classical-mechanical intuition should be properly modified. Besides the completeness of wave operators it can also be shown through the above ap- proach that the singular continuous spectrum of H is absent. For details  [4,XI.17] and V. Enss, Comm. Math. Phys., 61 (1978). E. Partial Wave Expansion In this section we assume that the potential V(x) is central, i.e., V(x) is a function of I xl alone. Then the scattering operator S turns out to commute not only with Ho but also with the tangular momentum operator L = x X i- 1 V (vector product). The eigenvalues of U = L. L (scalar product) are 1(/+ 1) (1=0, 1,2,...), and those of L., the third component of L, are m= -I, -(1-1), ..., I-I, I if L 2 has eigenvalue 1(1 + I), while simultaneous eigenfunctions are given by suitably normalized tspherical har- 375F Scattering Theory monics llm(w), WES 2 . Let E, be the projection onto the subspace spanned by functions of the form =_lu'm(r)llm(w)(r=lxl>O,w=x/r). Yi' becomes an orthogonal sum of Yl/ = EjYi'. The aforementioned commutation property claims that E,S=SE,=S" and the operator  reduces to multiplication by a scalar function e2iI(A) in '*' ( Section C). b,(A) is ca11ed the phase shift. Defining the partial wave scattering amplitude J,(A)=(2iA)-1(e2jI(A) -1), one obtains the partial wave expansion of the scattering amplitude: 00 j(A;W,W')= I (2/+1)J,(A)P,(cosll), 1=0 where II is the angle between wand w', and P, is a tLegendre polynomia1. The total cross section is a(A)=4nA -2 0(21 + 1)1J,(A)1 2 [1-5,9]. F. Many-Body Problem (Multichannel Scattering) We consider only the 3-body case, which is complicated enough compared with the 2- (essentia11y 1-) body case. The complications are both kinematical and dynamical. The configuration of a 3-body system is given by a point in R 9 . Once we choose the center-of- mass coordinates, there is no kinematica11y natural way to choose the remaining six co- ordinates. In the 2-body case a freely moving particle in the remote past wi11 be freely mov- ing in the distant future. But in the 3-body case there come into play various other dy- namical processes, such as capture, breakup, rearrangement, and excitation. The 3-body Hamiltonian is a self-adjoint operator in L 2 (R 9 ) of the form _ 3 1 H= - I -.+ I V.(x.-x.) i=l 2m; I i<j l} l J ' where i is the 3-dimensional Laplacian as- sociated with particle i, m j is the mass of par- ticle i, and each V;ix) (= J.jj(x) = V;i -x)) is a real-valued function decaying at 00 in R 3 (not in R9). If we remove the center-of-mass motion, fj can be written in the form fj = Ho &)! +! &) H (&) = ttensor product), where Ho is the center-of-mass Hamiltonian in L 2 (R 3 ) representing the uniform free motion of the center of mass, and H is the Hamiltonian of relative motion in L 2 (R 6 ). One should note as mentioned above that there is no unique natural way of choosing coordinates in R 6 and representing H, but there are many equivalent representations. Suppose, for instance, that particles 1 and 2 and particles 1 and 3 form tbound states (12) and (13) and that there are 
375G Scattering Theory no bound states between 2 and 3. We partition the whole system into clusters: (1) (2) (3), (12) (3), and (13) (2) (( ) represents a cluster and figures in ( ) are the particles forming the cluster). A channel is a partition into clusters together with a specified bound-state eigen- function. Take, for instance, channel (12) (3), and suppose IjJ E Lz (R 3) is the eigenfunction in question. If we take X=Xz - x, and y=X3- (m l +mz)-I(m l x l +mzxz), then 1 1 H= --A --A + v z( x ) 2m } 2n x 1 ( mIx ) ( moX ) +V Z3 --y +V 31 ---'::-+y , mlmz m 1 +mz where m- 1 =m3 1 +(m 1 +m z )-I, n- I =m1 + mz 1 Let us neglect the interactions between (12) and (3), i.e., set V z 3 = V 31 = 0 to define the cluster decomposition Hamiltonian 1 1 H(1Z)(3)= - 2m ,1y- 2n ,1x+ V 1Z (x). Let .1f(IZ)(3)= Lz(R3) (called the channel Hil- bert space consisting of functions of y), and define a mapping T: Yt('Z)(3) --..1f = Lz(R6) (functions of x and y) by (Tf)(X,y)=IjJ(x).f(y). The channel wave operators It(IZ)(3):t are de- fined by W - s-lim e itH e -itH(l2)(3'T (IZ)(3):t - ,_:ton as isometries from Yt(IZ)(3) into .1f. Their ranges are not expected to coincide as in the 2-body case. It(I)(Z)(3):t and 1t(13)(Z):t are similarly defined. Note that Yt(1)(Z)(3)=.1f. If 7- and fJ are distinct channels, we have Ran 11';,+ .1 Ran I-Vp+ and Ran 11';,- .1 Ran I-Vp_, but no such relations exist between Ran Wa+ and Ran I-Vp_ or Ran 11';,- and Ran W p +. De- fine, for channels 7- and {J, S.p: .1fp--'.1f. by S.p =(Wx+)*W p _. Now the scattering operator S for the 3-body system is defined as the tdirect sum of S.P acting in the Hilbert space Lx ffi .1fx: S = L..P ffi Sxp. Naturally the ques- tion arises: Is S unitary? The first affirmative answer was made by L. D. Faddeev (Israel Program for Scientific Translations, 1965 (in English; original in Russian, 1963)), and later the work of 1. Ginibre and M. Moulin (Ann. Inst. H Poincare, A21 (1974)) and L. Thomas (Ann. Phys., 90 (1975)) came out. The method of these authors is stationary. There have also been some attempts using time-dependent methods. G. Inverse Problem The inverse problem in potential scattering may be formulated at least mathematically as follows: Given the scattering operator or scat- 1400 tering amplitude, determine the potential(s) giving rise to the operator or amplitude. We consider here only the 2-body case (as to the many-body case, almost nothing is known). The central-potential case can be reduced to I-dimensional problems on (0, 00). In the 1- dimensional case the celebrated Gel'fand- Levitan theory ( 112 Differential Operators 0) has long been known and has been success- fully applied even to nonlinear problems such as the tKorteweg-de Vries equation. In the 3-dimensional case, however, the problem becomes difficult; so far there has not been any satisfactory theory comparable to that for the I-dimensional case. The potential V(x) is a function R 3 --. R. The scattering amplitude f(),;w,w') is a function f:R x SZ x SZ--.C. Let M be the mapping that takes V into f The inverse problem deals with M -1. Several ques- tions may be posed (in order of increasing difficulty): (1) Is M one-to-one? (2) When it is known that M is one-to-one and f is in the image of M, how does one (re-)construct the V that yields f? (3) What conditions charac- terize the image of M? Question (1) has been rather satisfactorily answered insofar as short- range potentials are concerned. Concerning questions (2) and (3), attempts have been and are being made to generalize the Gel'fand- Levitan theory, but it may be said that we are still at the beginning stage. References are [2,9,10] and R. G. Newton (1. Math. Phys., 20 (1980); 21 (1981); 22 (1982)). H. Scattering for the Wave Equation Consider the t wave equation U,,- ,1u=O in R 3 . The solution u(t) = u(t, x) is uniquely deter- mined by the initial data {f, g} = {u(O), ut(O)}, and Uo(t){ f, g} = {u(t), u,(t)} defines the solu- tion operator Uo(t). The set of data {[, g} with finite energy: S (Wfl z + Iglz)dx < 00 forms a Hilbert space .1fo. Uo(t) is a unitary group on .1fo. A similar description is possible for solutions of the wave equation in an exte- rior domain Q outside an obstacle with zero boundary condition. Denote the resulting Hilbert space and solution operator by .1f and U(t), respectively. Let l:.1f o -->.1f be the identification operator defined by (1 {f,g} )(x) ={f,g}(x), xEQ. The wave operators are de- fined by W:t =s-limt_:too U( -t)JUo(t). The existence of W x is shown rather easily by using tHuygens's principle. As in Section B, we define the scattering operator S = W: W_ and say that W:t is complete if Ran W t =.1fac(H), where H is the self-adjoint tinfinitesimal gen- erator of U(tJ: U(tJ =e -irlf. The completeness of W:t and the unitarity of S are proved on the basis of the abstract translation representation 
1401 theorem: Let U(t) be a unitary group on a Hilbert space £. Suppose there exist sub- spaces D+ and D_, called outgoing and incom- ing subspaces, such that U(t)D I cD:f for :ttO, nteR U(t)D I ={O}, and UteR U(t)D:f is dense in £. Then we have two unitary operators fJt I :£-+L 2 (R;N), where N is an auxiliary Hilbert space, such that fJt:f U(t). fJtl is right translation by t, and D+ (D_) is mapped onto L 2 (0, 00; N) (L 2 ( -00, 0; N)) by fJt+ (fJt_). Turning to the concrete situation, one can study the detailed properties of S. The unique- ness theorem in the inverse problem is also obtained, to the effect that S determines the obstacle uniquely. The foregoing treatment of scattering is known as the Lax-Phi11ips theory (P. D. Lax and R. S. Phi11ips, in [6,8,11]. References [lJ W. O. Amrein, 1. M. Jauch, and K. B. Sinha, Scattering theory in quantum mechan- ics, Benjamin, 1977. [2J R. G. Newton, Scattering theory of waves and particles, McGraw-Hi11, 1966. [3J E. Prugovecki, Quantum mechanics in Hilbert space, Academic Press, 1971. [4J M. Reed and B. Simon, Methods of mod- ern mathematical physics III, Scattering theory, Academic Press, 1979. [5J 1. R. Taylor, Scattering theory, Wiley, 1972. [6J Rocky Mountain 1. Math., 1 (1) (1971) (special issue on scattering theory). [7J T. Kato, Perturbation theory for linear operators, Springer, 1976. [8J 1. A. Lavita and 1.-P. Marchand (eds.), Scattering theory in mathematical physics, Proc. NATO Adv. Study Inst., Denver, Colo., 1973, Reidel, 1974. [9J V. De Alfaro and T. Regge, Potential scattering, North-Holland, 1965. [IOJ K. Chadan and P. C. Sabatier, Inverse problems in quantum scattering theory, Springer, 1977. [l1J P. D. Lax and R. S. Phillips, Scattering theory, Academic Press, 1967. 376 (XIX.15) Scheduling and Production Planning Production planning emerges in many situ- ations. Models of economic planning can be classified as (1) fiscal policy-oriented, (2) final demand-oriented, (3) structure-oriented, (4) 376 Scheduling and Production Planning expenditure-oriented, and (5) industrialization- oriented. Types (2), (3), and (5) belong to pro- duction planning in a broad sense, as emphasis is placed on production in these models. A typical production planning theory of primary importance is activity analysis, which has made remarkable progress since its initi- ation by T. C. Koopmans [1]. Its principal theoretical content consists of tlinear pro- gramming. Most applications of linear pro- gramming are more or less concerned with production activities. Because of the additivity and divisibility of production as well as the limitation of production intensities, problems of production planning can be formulated as problems of linear programming. The meth- ods of tlinear algebra are used to obtain an optimal production plan and are very impor- tant in modern economic analysis, because these methods not only provide practical algorithms but also clarify the role of price, especially in tduallinear programming problems. The originators of general equilibrium theory ( 128 Econometrics) failed to give an analytical demonstration of the existence of solutions of certain systems of equations of economic relevance. The existence of a deter- minate equilibrium was established first by A. Wald for a system of equations of the Walras-Cassel type. On the other hand, 1. von Neumann [3J proved the existence of non- negative solutions rx, fJ, Xi' Yj for a system of inequalities m m rx L aijxi L bijXi' j= 1,2, ...,n, i=l i=l n n fJLaijYjLbijYj, i=1,2,...,m, j=l j=1 by reducing the problem to the proof of the existence of a tsaddle point ( 292 Nonlinear Programming A) of the function m n m n <13(X, Y)= L L bijxiYj/L L aijxiYj i=l j=1 il j=l by means of Brouwer's fixed-point theorem. In this result an equilibrium is defined in the broad sense that demand for goods does not exceed their supply, rather than requiring exact balance. A second important kind of production planning is related to both tinventory control ( 227 Inventory Control) and sales planning. An example is the minimization of IT (z(t)+ fJmax(dz/dt, O))dt subject to the condition z(t)  r(t), where z(t) and r(t) are the output and demand, respec- tively, at time t. Stabilization of employment 
376 Scheduling and Production Planning and production can also be classified as a production planning problem of this kind, in which inventory hOlding is considered as a means for lessening the change of employment level. This is related to the problem of smooth- ing production by inventory control. Dynamic programming ( 127 Dynamic Programming) is very useful in dealing with problems of smoothing production. Production planning as a production man- agement tool is often embodied in scheduling. Consider a project that consists of n indivi- sible tasks (jobs or activities) J i , i= 1,2, ...,n, each requiring Pi units of time for processing, where Pi is given either deterministica11y or probabilistically. A precedence constraint (genera11y a tpartial ordering) partia11y specify- ing the order in which these tasks are to be processed is also imposed by technical con- siderations: One attempts to find a schedule (i.e., a specification of the time to process the tasks JJ consistent with the given precedence constraint. Well-known techniques developed for this purpose are PERT (program evaluation and review technique) [4J and CPM (critical path method) [5J, in which the precedence con- straint is represented by an acyclic tdirected graph, ca11ed an arrow diagram, a project network, or a PERT network, such that each task J i corresponds to an arc of length equal to Pi' An arrow diagram is i11ustrated in Fig. 1. The longest path from the start node to the end node in the network is ca11ed the critical path, and gives the minimum time necessary to complete the project. Fo11owing computation of the critical path by means of dynamic pro- gramming, computations are also made for the earliest (latest) start time, the earliest (latest) finish time, and the floats (i.e., the a110wances for such start and finish times) of each task to be satisfied in order to complete the project within the indicated minimum time. These are used to review and control the progress of the project. In PERT the processing time of each task is probabilistica11y treated on the basis of three estimates: most likely, optimistic, and pessimistic. From these data other parameters, Fig. 1 A, B, ..., M denote tasks, while the associated in- tegers are their processing times. Bold arrows indi- cate the critical path. The start node is on the left, the end node is on the right. 1402 such as the probability of completing the project before the specified due date, are com- puted. In CPM, on the other hand, a mini- mum cost schedule to attain the given due date is obtained by utilizing tnetwork flow algorithms (1. E. KelIey [6J, D. R. Fulkerson [7J), in which the processing time of a task is determined by linear interpolation between the normal time (achieved with low cost) and the crash time (high cost). PERT and CPM are used in various areas of application, e.g., civil engineering and the construction industry, shipbuilding, produc- tion of automobiles, machines, and electric apparatus, and management of research and development programs. PERT was origina11y developed by the US Navy to monitor and control the development of the Polaris fleet ba11istic missile program, while CPM was developed by the RAND Corporation and the Du Pont Corporation, both in the late 1950s. Computers have been essential from the begin- ning, to handle the large amount of associ- ated data. A number of application program packages, each with some additional features, are currently available, e.g., PERT/TIME, PERT/COST, CPM, and RAMPS. The machine sequencing (scheduling) prob- lem arises when the resources, instruments, workers, and so forth, required to process a task are abstractly formulated as machines and if the restriction on the number of avail- able machines is taken into consideration (i.e., the conflict between tasks competing for the same machine at the same time must be resolved). Usually one machine is assigned to each task. Such a machine is either (a) uniquely determined for each task or (b) chosen from a given set of machines; in the latter case, there might be (i) paral1el machines with the same capability or (ii) machines with different capabilities. The precedence constraints are also ramified into independent (i.e., no con- straint), in-tree, out-tree, series-paralIel, and general partial ordering constraints. Each task has a ready time (release time) 'i such that J i cannot be processed before it, and a due time d i . One is asked to find a schedule satisfying the above machine constraints, precedence constraints, and ready time constraints, while considering an optimality criterion that is a function of the completion time C i of J i (i = 1,2, .., ,n). Typical criteria for minimization are: (1) maximum completion time (makespan) C max = maxi Ci; (2) flowtime (total completion time) F = L C i ; weighted flowtime L WiC" where Wi?O are weights representing the rela- tive importance of J i ; (3) maximum lateness Lmax =maxiLi, L i = Ci-d i ; (4) total tardiness T = L 7;, 7; = max(O, LJ, and weighted total 
1403 tardiness  Wi 1;; (5) number of tardy tasks V =  Vi' Vi = 1 (if C i > d i ), 0 (otherwise), and weighted number of tardy tasks  Wi Vi. Numerous problems can be defined by combining the above conditions. Typical ones might be: the job-shop scheduling problem, in which n tasks are scheduled on m machines of type (a), and where the maximum completion time is minimized; the flow-shop scheduling problem, which is the same as the job-shop scheduling problem except that n = n'm tasks are divided into n' groups of m tasks processed on machine 1, machine 2, ..., machine m, re- spectively, in this order; the multiprocessor scheduling problem, in which the maximum completion time of n independent tasks on m para11el machines is minimized; the one- machine sequencing problem, assuming only one machine (with various types of precedence constraints and optimality criteria), and others [8,9]. These machine sequencing problems are examples of the combinatorial optimization problem ( 281 Network Flow Problems E), as the processing time Pi is usua11y considered to be a given constant. Their computational complexity ( 71 Complexity of Computa- tions) has been extensively studied with an emphasis on the classification between those problems solvable in polynomial time and those that are tNP-complete, as summarized in [10]. Table 11ists representative results for one-machine sequencing problems with ri=O (i = 1,2, ..., n). The improvement of the algo- rithm efficiency is pursued for both polynomi- a11y solvable problems and NP-complete prob- lems. tBranch and bound ( 215 Integral Programming D) is a common approach used to solve NP-complete problems such as the 376 Ref. Scheduling and Production Planning job-shop and flow-shop scheduling problems [8,9]. Many approximation algorithms to obtain good suboptimal schedules in reason- able computation time are also known, and their worst-case and average accuracies have been analyzed [10], as these are important in practical applications. In more realistic scheduling situations, other factors, such as the set-up cost, balanc- ing of production lines, frequent modifications and updatings of project data, capacity of factories, manpower planning including the possibility of overtime and part-time employ- ment, should be taken into account. Both deterministic and probabilistic models have been proposed for these cases. Mathematical tools used to compute adequate schedules include tmathematical programming, tqueu- ing theory, and tsimulation techniques. References [1] T. C. Koopmans, Activity analysis of pro- duction and a11ocation, Wiley, 1951. [2]0. Morgenstern, Economic activity analy- sis, Wiley, 1954. [3] J. von Neumann, Dber ein okonomisches Gleichungssystem und eine Vera11gemeinerung des Brouwerischen Fixpunktsatzes, Ergebnisse eines Mathematischen Ko11oquiums, 8 (1937), 73-83; English translation, A model of general economic equilibrium, Co11ected Works, Per- gamon, 1963, vol. 6, 29-37. [4J D. G. Malcolm, 1. H. Reseboom, C. E. Clark, and W. Fazar, Application of a tech- nique for research and development program evaluation, Operations Res., 7 (1959), 646-669. Table 1. One-Machine Sequencing Problems with ri=O Optimality Precedence Other Criterion Constraint Constraints C max Partial order None Ci Partial order None WiCi Series-parallel None Partial order Pi= 1 (i= 1,2, ...,n) Lm>x Partial order None 1; Independent None Partial order None wi1; Independent None Vi Independent None In-tree, out-tree Pi= 1 (i= 1,2, ...,n) Complexity O(n 2 ) NP-complete O(nlogn) NP-complete O(n 2 ) Not known> NP-complete NP-complete O(nlogn) NP-complete O(nlogn) NP-complete b  Wi Vi Independent Pi < Pj => Wi  w j Independent None a. An algorithm with O(n 5 Pm>x) running time is known, where Pm>x = maxi Pi' b. An algorithm with O(nPi) running time is known. 
377 A Second Quantization [5J 1. E. Kelley, Jr., and M. R. Walker, Critical path planning and scheduling, Proc. Eastern Joint Computer Conference, 1959, 160-173. [6J 1. E. Kelley, Jr., Critical-path planning and scheduling: Mathematical basis, Operations Res., 9 (1961), 296-320. [7J D. R. Fulkerson, A network flow computa- tion for project cost curves, Management Sci., 7 (1961),167-178. [8J R. W. Conway, W. L. Maxwell, and L. W. Miller, Theory of scheduling, Addison- Wesley, 1967. [9J K. R. Baker, Introduction to sequencing and scheduling, Wiley, 1974. [1 OJ R. L. Graham, E. L. Lawler, 1. K. Len- stra, and A. H. G. Rinnooy Kan, Optimization and approximation in deterministic sequenc- ing and scheduling: A survey, Ann. Discrete Math., 5 (1979), 287-326. 377 (XX.27) Second Quantization A. Fock Space For a complex Hilbert space K with dimK? 1, K@n denotes the n-fold tensor product of K with itself (where the vectors f1 (8) ... (8) fn with fjE K are total). Let E) be the projection operators on totally symmetric and antisym- metric parts of K@n: E>U1 (8) .., (8) fn) =(n!)-1 L:G ct (P)fp(1) (8) ... (8)fp(n), where the sum is over all permutations P, G+(P) = 1 and L(P) is the signature of P (+ 1 for even permutations and -1 for odd permu- tations). The following orthogonal direct sum is called a Fock space (symmetric for E+, and antisymmetric for E_): 00 %+(K)= L:(j) E)K@n. - n=O- Here the term for n = 0 is the I-dimensional space C, and a vector Q represented by 1 E C is called the vacuum vector in %ct(K). The sub- space %:+:(K)n=E)K@n is called the n-particle subspace. The operator N = (j)o n, which takes the value n on %:+:(K)n, is called the num- ber operator. On the algebraic sum N Die =U L:(j) %:+:(K)nc%:+:(K), N n=O the creation operator a + (f) for f E K is defined as the unique linear operator with domain D:+: 1404 satisfying a+(f)E)f1 (8)... (8)fn =(n+ 1)1/2E+I)f(8)f1 (8)... (8)fn. For fEK, J denotes the element of the dual K* satisfying 1(g) = (gJ) for 9 E K (the inner product is linear in the first entry). The annihi- lation operator a(JJ is defined by a(J) E)f1 (8) ... (8) fn n - -1/2" j-1 ( (' f)E (n-1) f 10-.. . IO-.. f, -n L.,E Jj, ::!:: l\0J"'j...\0J", j1 where the tensor product of fb k i= j, appears in thejth term and G=:t1 depending on which of:t is taken in %:+:(K). For n=O, a(JJQ is defined to be O. The adjoint of a+(f) coincides with a(JJ on Die' The creation and annihilation operators map D ct into itself and satisfy the following commutation relations on Die: [a+(f1),a+(f2)h = [aCfr),a(h)h =0, [aCfr),a+(f2)h = (f2J1), where [A,Bh =AB+BA and + is used de- pending on the choice of:t in %:+:(K). These relations are often called canonical commuta- tion relations for [ , J _ (CCRs) and canonical anticommutation relations for [ , J + (CARs). On %_(K), a+(f) and a(JJ are bounded with Ila+(f)11 = Ila(JJII = Ilfll. On .'#'+(K), both a + (f) and a(JJ are not bounded, though a+(f)N- 1 / 2 and a(JJN- 1 / 2 are bounded. On %+(K), 2- 1 / 2 (a+(f)+a(J)) IS essentially self-adjoint. Let Ij;(f) be its closure. The oper- ator W(f)=e i1p (f) is unitary and satisfies the identity W(fd W(f2) = W(f1 + f2)exp( - ilm(f1 J2)/2). It also satisfies (W(f)Q, Q) = exp( -4 -lllfI1 2 ). Let K", be a real subspace of K such that the inner product (1, g) in K is real for any f and 9 in K", and K = K", + iK",. (K", is then a real Hilbert space.) The unitary operators U(f)= W(f) and V(f)= W(if) for fEK", satisfy the following Weyl form of the CCRs: U(f1)U(f2) = U(f1 + f2), V(f1)V(f2) = V(f1 + f2), U(f1) V(f2) = V(f2)U(f1)exp( - i(f1J2))' The infinitesimal generators of the continu- ous one-parameter groups of unitaries U(tf) and V(tg) (tER) are denoted by cp(f) and n(g) and satisfy the following CCRs: [CP(f1), CP(f2)J'¥ = [n(gl)' n(g2)J'¥ =0, [cp(f), n(g)J'¥ = i(f, g)'¥, where [A,BJ=AB-BA and '¥ED+. 
1405 If Q is a linear operator on K, r(Q) denotes the linear operator on jO:!:(K) defined as the closure ofEBoQ@.ID:!:. It is bounded if IIQII:( 1. r(Ql)r(Q2)= r(Ql Q2) on D-r' If His a self-adjoint operator on K, then r(e itH ) = expitdr(H) defines a self-adjoint operator dr(H) on jO-r(K), usua11y ca11ed a bilinear Hamiltonian and denoted (a+, Ha). More explicitly, dr(H)E)f1 @ ...@f. . = L E)fl @ ... @ Hh@... @f.. j=l If U is a unitary operator on K, then r(U) W(f)r(U)-l = W(UJ). B. Second Quantization A single (scalar) particle in quantum mechan- ics is described by a wave function 'P(x), XE R 3 , considered as a unit vector in a Hilbert space K =L 2 (R 3 ). The system consisting of n such identical particles is described by a tota11y symmetric function 'P(Xl>.'" x.), xjER3, con- sidered as a unit vector in the tota11y sym- metric part E) K@. of the n-fold tensor prod- uct of the one-particle Hilbert space K, where the restriction to tota11y symmetric wave func- tions is referred to as Bose statistics. In a non- relativistic system, the Hamiltonian operator on a I-particle space is T= -h 2 (2m)-1 L\, ca11ed the kinetic energy (Ax denotes the La- placian); on an n-particle space it is typica11y given by . . H.= -h 2 (2m)-1 L Ax+ L V(xi-x j ), j=l J i<j where V is a 2-body potential. The totality of multiparticle spaces E) K@. can be described in terms of the Fock space jO+(K), the vacuum vector Q (no-particle state), and the annihilation and creation oper- ators, denoted by a(J) f 'P(x) f(x)d 3 x, a*(J) = f 'P+(x)f(x)d 3 x. Since the CCRs (for operator-valued distributions) ['P(x), 'P(y)] _ = ['P+(x), 'P+(y)] _ =0, ['P(x), 'P+(y)] _ =b 3 (x - y) are a continuous generalization of CCRs for canonical variables in quantum mechanics and since 'P(x) comes from the wave function by way of quantization, the above formalism is 377C Second Quantization ca11ed second quantization. The Hamiltonians H. for a11 n can now be combined into the expression H= f 'P+(x)T'P(x)d 3 x +f 'P+(x)'P+(y)V(x-y) x 'P(x)'P(y)d 3 xd 3 y, where the first term is dr(T). For particles such as electrons the system of n identical particles is described by a tota11y antisymmetric wave function, the total anti- symmetry being referred to as Fermi statistics. Then the antisymmetric Fock space jO_(K) can be used in exactly the same manner as jO+(K) in the preceding case. The method of second quantization was introduced by P. Dirac [2] for the case of bosons and extended by P. Jordan and E. Wigner [3] to fermions. Electromagnetic waves, when quantized in this way, represent a system of photons, and the quantization of electron waves leads to the particle picture of the electron. The method of second quantiza- tion is intimately connected with the notion of fields, as shown below for free fields, and is the basis of the perturbation approach in field theory ( 150 Field Theory). C. Free Fields Let (jj (j = 1,2,3) be tPauli spin matrices and (jo = (6 ?). Let p = =o (j"P" for a 4-vector P = (po,p) with p=(p1,p2,p3) and pO=(m 2 +p2)1/2. Let uj(a, A) be the irreducible unitary repre- sentation [m+,j] of9! on a Hilbert space K j =L 2 (R 3 ,C 2 j, (2 p O)-1(mjj5)@ 2j d 3 p) (258 Lorentz Group C (3)). Consider first the symmetric Fock space jO+(K o ). For any complex-valued rapidly decreasing COO-function f (f E g'(R 4 )), let J(p)= (2n)-3/2 f e ip ' x f(x)d 4 x (p=(po, p), pO =(p2 + m2)1/2), A(f)=a+(J)+a(J) (= f A (x)f(x)d 4 x ). where p' x = pO XO - J=l pj x j and the bar denotes the complex conjugate. Then A(x) as an operator-valued distribution satisfies the tWightman axiom and is ca11ed the free scalar field of mass m. It satisfies the Klein-Gordon equation (Dx+m 2 )A(x)'P=0 (Dx=(a;ax O )2_ j t1 (a;ax j )2} 
377D Second Quantization it has the 4-dimensional scalar commutator i[A(x),A(y)]_ '1"= Am(x-y)'P, and it has the two-point function i(A(x)A(y)Q, Q) = A;:; (x - y). Here 'I" is any vector in D+; for example, Q is the vacuum vector, and the invariant distri- butions Am and A;:; are defined by A;:;(x)=i(2n:)-3 f e- ip X(2 p O)-ld 3 p, Am(x)= (2n:)-3 f(Sin p .X)(pO)-ld 3p . If we define U(a, A(A»= quo (a, A», then U(a, A)A(x)U(a, A)* =A(Ax+a). If gEY(R 3 ) and hEY(R 3 ), g(p) and h(p) obtained by substituting g(x)b(xo) and  h(x)b'(xo) into j in the defining equation of 1 above are in Ko. We then define cp(g)=a+@+a(g) (= f cp(X)9(X)d 3 X) , n(h) =a+(h)+ a(h) (= f n(X)h(X)d 3 x). The operator-valued distributions cp(x) and n(x) are the canonical field and its conjugate field at time ° for the free scalar field and satisfy the following canonical commutation relations: [cp(x), cp(y)] _ 'I" = [n(x), n(y)] _ 'I" =0, [cp(x), n(y)] _ 'I" = iP(x - y)'P. If we set T(t)= U(te o , 1) for e o =(I,O,O,O) and (ex@g)(x)=ex(XO)g(x) for exEY(R) and gEY'(R 3 ), then for 'PED+ A(ex@g)'P= r'oo T(t)cp(g)T(t)*'Pex(t)dt, - A (ex' @ g)'P = LWw T(t)n(g) T(t)*'Pex(t) dt, or, equivalently, A(x)= T(xo)cp(x)T(xo)*, iJA(x)jiJxo = T(xO)n(x)T(x°)*. If A(x) is a classical field, then cp(x) and n(x) are the value of A(x) and its time derivative at XO = 0, and they serve as initial data for the Klein-Gordon equation (D x +m 2 )A(x)=0. Consider next the antisymmetric Fock space .'F_(K 1/ 2 @ K 1/2)' For ft E Y(R 4 , C 2 ) (C 2 -valued rapidly decreasing COO-functions), write j = (f+ J-), f+ = UI' f), f - = U3J4), define It 1406 as before, and let i/lU) = m l/2 {a«f- + (p/m)f+) ffi 0) + a+(O ffi (0"2J+ -(pjm)0"2J-))) = X t 1 f i/lx(x)jx(x)d 4 x, where 0"2 = (? -) = - iE. Then i/lx(x) as an operator-valued distribution satisfies the tWightman axiom and is caned the free Dirac field of mass m. In the present formulation, {i/lt. i/l2} is a contravariant spinor of rank (1,0) and {i/l3' i/l4} is a covariant spinor of rank (0,1). This field satisfies the Dirac equation (Y(OjiJX)+im ) i/I(x) =0, as wen as the relations [i/lx(x), i/lp(y)] + 'I" =0, [i/lx(x), i/lp(y)*] + 'I" ={(Y(iJjiJx)-im)yOL Am(x-y), (i/lx<x)i/lp(y)*Q, Q) ={(Y(iJjiJX#)-im )yOL A;:;(x- y). Here 'PED+, Am and A;:; are as described above, and the y's are Dirac matrices in the following form (somewhat different from but equivalent to the usual form;  351 Quantum Mechanics): ° ( 0 1 ) y = 1 0' . ( 0 y1= O"j -} and the O"'S are tpauli spin matrices. D. Coherent Vectors and Exponential Hilbert Space In the symmetric Fock space .'F+(K), a vector of the form 00 expj=Q ffi I EB (n!)-1/2j0 n E .'F+(K) n:::::l for j E K is called a coherent vector. The set of exp f is linearly independent (in the algebraic sense) and total. The inner product is given by (expj1' expj2) =expUI J2)' Conversely, we can define .'F+(K) abstractly by introducing this inner product into the formal linear combinations of expf, jEK, and by completion. In this sense, .'F+(K) is also de- noted as expK and is called an exponential Hilbert space [5]. Then exp I EB K j = @expK j , 
1407 where exp EB fj is identified with (8) expfj. If the number of indices is infinite, the right- hand side is the incomplete infinite tensor product containing the product of the vacuum vector Q. If K = SaK.dJ.l(A) and the measure J.l is non atomic, then for any measurable set S in B, there corresponds a decomposition expK = (expK(S))(8) (expK(SC)), where K(S)= SsK. dJ.l(A) and SC is the complement of Sin B, and an associated von Neumann algebra R(S)=B(K(S)) (8) 1, where B(K(S)) is the set of all bounded linear operators on K(S). The system {R(S)} forms a complete Boolean lat- tice of type I factors on exp K. Coherent vec- tors are characterized by the property of being a product vector for {R(S)} in the sense that for any S, A E R(S) and A' E R(SC), the vector 'I' = exp f satisfies (AA''I', '1')11'1'11 2 = (A'I', 'I')(A''I', '1'). In this sense, we can interpret exp K as a con- tinuous tensor product of exp K. and also, if 'I' = S'I'. dJ.l(A), exp'l' as a continuous tensor product of exp'l'. [5]. References [IJ V. Fock, Konfigrationsraum und zweite Quantelung, Z. Phys., 75 (1932), 622-647. [2J P. A. M. Dirac, The quantum theory of the emission and absorption of radiation, Proc. Roy. Soc. London, 114 (1927), 243-265. [3J P. Jordan and E. P. Wigner, Dber das Paulische Aquivalenzverbot, Z. Phys., 47 (1928),631-651. [4J F. A. Berezin, The method of second quan- tization, Academic Press, 1966. (Original in Russian, 1965.) [5J H. Araki and E. J. Woods, Complete Boolean algebras of type I factors, Pub\. Res. Inst. Math. Sci., A2 (1966), 157-242, 451 452. [6J 1. Avery, Creation and annihilation oper- ators, McGraw-Hili, 1976. 378 (XII.14) Semigroups of Operators and Evolution Equations A. Introduction The analytical theory of semigroups was inaugurated around 1948 in order to define exponential functions in infinite-dimensional 378B Semigroups of Operators, Evolution Equations function spaces. Then it was generalized to the theory of evolution equations as ordinary differential equations in infinite-dimensional linear spaces. B. The HiIIe- Y osida Theorem Let X be a tlocally convex topological linear space, and denote by L(X) the totality of con- tinuous linear operators defined on X with values in X. A family {1; I t  O} of operators 1; E L(X) is called a (one-parameter) semigroup of class (Co) or a strongly continuous semigroup if it satisfies the following two conditions: (i) 1; 1'. = 1;+s (the semigroup property), To = I (the identity operator); and (ii) Iim h ,o 1;x = 1;ox (VXEX, Vt o O). When X is a Banach space, (ii) is implied by w-lim,!O 1;x=x (VXEX), as proved by N. Dunford in 1938. In this case there exist constants M > 0 and f1  0 such that (iii') II 1; II  M e P ' (Vt  0). Hence, considering e -P' 1; in place of 1;, we can assume the equi- continuity: (iii) For any continuous seminorm p on X, there exists a continuous seminorm q on X such that p(1;x)q(x) (VXEX, VtO). Such semigroups are called equicontinuous semigroups of class (Co) (abbreviated e.c.s.g. (CO)). Example 1. X = Lp(O, (0) with 00 > p  1. (1;x)(s) = x(t +s). Example 2. X=Lp( -00, (0) with oo>p 1. f '" ( s-uf ) (1;X)(s) = (2n:t)-1/2 exp - x(u)du, -00 2t =x(s), t=O. t>O, Example 3. X = BC( - 00, (0). 00 (Att (1;x)(s)=e-AtkO k!x(s-kJ.l), tO. Here A and J.l are positive constants. (For these examples, we have II 1; II  1; hence (iii) is satis- fied.) For Lp and BC  168 Function Spaces. We assume in the remainder of the article that X is sequentially complete, that is, if a sequence {xn} of X satisfies Iimn.moo p(x n - x m ) =0 for every continuous seminorm p on X, then there exists a unique XEX such that limnoo p(x - x n ) = O. The infinitesimal generator A of an e.c.s.g. (CO) {1;1 tO} is defined by Ax=limt- 1 (1;-l)x. (1) ,!O (This is also called the generator of 1;.) Then we have the following results. 
378 C Semi groups of Operators, Evolution Equations (I) Differentiability theorem. For every complex number A with Re).>O, the resolvent (Ai-A)-I E L(X) exists and (AI-A)-I X = t" e-MT,xdt ('tXEX), (2) where the integration is Riemannian. Hence the domain D(A) of A is dense in X and coin- cides with the range R((AI _A)-I), and A is a closed linear operator such that the family W.().I - A)-l)n I A >0, n=O, 1,2,... } (3) is equicontinuous. (II) Representation theorem. Let I n = (I - n -1 A) -1 and consider the approximations to T,: 00 T,(n)x=e-n' I (m!)-l(ntJn)m x , m=Q f: n ) x = (In,_,)n x. Then T,x = lim T,(n) x = lim fn) x n-+oc; n--+oo uniformly on every compact set of t. (III) Converse theorem. Let a linear opera- tor A with both dense domain D(A) and range R(A) in X satisfy the condition (nI -A)-IE L( X) for n = 1, 2, '" . Then a necessary and sufficient condition for A to be the infinitesi- mal generator of an e.c.s.g. (Co) is that the family of operators {(I -n-1A)-ml n = 1,2, ...;m=O, I,...} (3') be equicontinuous. Since such a semigroup {T,lt?O} is uniquely determined by A, we can write T,=exp(tA). These three theorems together are called the Hille- Y osida theorem or the Hille- Y osida- F eller- Phillips- M iyadera theorem. Examples of Infinitesimal Generators. A = d/ds for example 1 above, A = 2 -1 d 2 /ds 2 for example 2, and (Ax)(s) = ),(x(s -.u) - x(s)) for example 3. C. Groups An operator A in a Hilbert space X gener- ates a group {T, I -00 < t < oo} oftunitary operators of class (Co) satisfying T, 1'. = T,+s for -00 < t, s< 00 if and only if A is equal to iH for some tself-adjoint operator H (M. H. Stone's theorem, 1932). In a 10cally con- vex space, a necessary and sufficient condi- tion for a given e.c.s.g. (CO) {T,lt?O} to be extended to an equicontinuous group of class (Co) {T,I -00 < t< oo} is that thefamily (3') be equicontinuous also for n = ::I: 1, ::1:2, ... . 1408 D. Holomorphic Semigroups For an e.c.s.g. (Co) {T, I t ?O}, the following three conditions are equivalent (K. Y osida, 1963; the equivalence between (ii) and (iii) for Banach spaces was proved earlier by E. Hil1e, 1948): (i) When t>O, T,'x=limh- 1 (T,+h- T,)x h-O (5) exists for all XEX and {(CtT,')n I n = 1,2, ... and 0< t  I} is equicontinuous for a certain constant C>O. (ii) When t>O, T, admits a convergent expansion T A given 10cally by TAx =L.o(A-t)"T,(n)x/n!. The extension exists for largAI arctan(Ce-l), and the family of operators {e-AT;.} is equicontinuous in A for largAIarctan(2-kCe-1) with some positive constant k. (iii) For the infinitesimal generator A of T" there exists a positive constant C l such that {(C1A(Al -A)-I)"} is equicontinuous in n= 1,2,... and in.le with Re(.Ie)? 1 +6,6>0. An e.c.s.g. (CO) {T,} satisfying the above condi- tions is caned a holomorphic semigroup. For example, introduce f <1+iOO fr..p.) = (2n:i)-l . eZA-'z" dz, O'-roo A?O, t>O, a>O, O<()« 1, =0, ).<0, (6) where the branch of z' is taken so that Re z' > ° for Rez>O. Following S. Bochner (1949), we define f".x = f,x = 1 00 fr..(s) T.,x ds, t > 0, =x, t=O, (7) from a given e.c.s.g. (CO) {T,I t?O}. Then {f,..1 t?O} is a holomorphic semi group (Yo- sida, T. Kato, and A. V. Balakrishnan, 1960). Its infinitesimal generator A. can be consid- ered as the fractional power (-A)" of -A, multiplied by -1. Fractional powers ( -A)", ()(E C, of operators have also been defined for operators A satisfy- ing the weaker condition than (3') that {A(A- A) -11 A>O} is equicontinuous (Balakrish- nan, H. Komatsu). If A is such an operator, - J=A generates a holomorphic semigroup and the unique uniformly bounded solution of the "elliptic" equation x;/ = - Ax" t>O, (8) limx,=x o <-0 is the solution of x;= -J=A x" t>O, limx,=x o , '_0 and therefore XI = exp( -tJ=A)xo (Balakrish- nan). Equation (8) has also been discussed by 
1409 M. Sova and H. O. Fattorini from a different point of view. E. Convergence of Semigroups Let a sequence {exp(tA n ) I n = 1,2,... } of e.c.s.g. (CO) be equicontinuous as a family of operators EL(X). Then a necessary and sufficient con- dition for there to exist an e.c.s.g. (Co) exp(tA) such that lim n _ oo (exp(tA n ))x = (exp(tA))x uni- formly on every compact interval of t is that limn_oo(A.oI -An)-l x=J;.ox exist (for some ,1,0 with Re ,1,0> 0 and for a11 x E X) and be such that R(J;.) is dense in X (H. F. Trotter, Kato). F. Miscellaneous Semigroups (i) Distribution semigroups. The semigroup of translations (T,x)(s) =x(t + s) in X = Loo( -00,00) is not continuous and hence not measurable in t. However, T,x is an X-valued distribution. For semigroups {T,} such that T,x is an X-valued distribution for XEX, an ana- log of the Hi11e- Y osida theorem is known (J.-L. Lions, 1960). It has been generalized to ultradistribution semigroups by 1. Chazarain and to hyperfunction semigroups by S. Ouchi. (ii) Dual semigroups. The above semigroup {T,} of translations in Loo( -00,00) is obtained as T, = S,* from the e.c.s.g. (Co) {S,} defined by (S,x)(s)=x(s-t) in Ld -00,00). Let B=djds be the infinitesimal generator of {Sf}' The restric- tion of {S,*} to the space of uniformly con- tinuous functions, which is the closure of the domain D(B*) of the dual B* in Loo( -00,00), is an e.c.s.g. (CO). This fact holds for the semi- group {S,*} of an e.c.s.g. (Co) {S,} in a Banach space X in general (R. Phi11ips, 1955) and also in a 10ca11y convex space. (iii) Locally equicontinuous semigroups. The infinitesimal generator A of the semi group of translations (T,x)(s) = x(t + s) in X = C( -00,00) is djds. A has no resolvent since all complex numbers are eigenvalues of A. {T,} is not equicontinuous but 10ca11y equicontinuous, i.e., {T, I 0  t  t I} is equicontin uous for any t 1 >0. For 10ca11y equicontinuous (Co) semi- groups an analog of the Hille- Y osida theorem is obtained by using the notion of generalized resolvents (T. Komura, 1968; Ouchi, 1973). (iv) Differentiable semigroups. The notion of the holomorphy of semigroups in Section D is weakened to the differentiability. A character- ization of a semi group {T,} such that T,x is infinitely differentiable in t >0 is given by using the resolvent of the infinitesimal gen- erator (A. Pazy, 1968). (v) Nonlinear semigroups. For a (CO) semi- group {T,} of contractions (i.e., II T,x- T,yll  378H Semigroups of Operators, Evolution Equations II x - y II for x, y E X) in a Hilbert space X, an analog of the Hille- Y osida theorem is known (Y. Komura, 1969). This result has been par- tia11y extended to Banach spaces ( 286 Non- linear Functional Analysis X). G. The Evolution Equation Let T,=exp(tA) be an e.c.s.g. (Co). Then for xED(A), T,' x = A T,x( = T,Ax). (9) Considered in suitable function spaces, the tequation of heat conduction (A ==the tLaplacian), the tSchrodinger equation (A = F (- V(x))), and the twave equation given in matrix form (:)' =(::)=G )(:) are a11 of the form (9). For a linear operator A, in X depending on t, the ordinary differential equation in X x;=A,x,+ f(t), t);O, (10) is ca11ed the evolution equation. A family of operators {V(r,s)lr);s);O} in X which gives general solutions to the homogeneous evolu- tion equation x;=A,x, (11) (i.e., for any S);O, aED(A s ), x,= V(t, s)a is a solution to (11) for Xs = a) is ca11ed the evolu- tion operator associated with the generators {A,}. An evolution operator {V(r,s)} satisfies (i) V(r, r)= I, (ii) V(r, s) V(s, t)= V(r, t). The solution to (10) is forma11y expressed by x,= V(t,O)xo+ L V(t,s)f(s)ds. (12) H. Integration of the Evolution Equation For equation (11) we have the fo11owing result (Kato, 1953; Y osida, 1966). Assume the fo11ow- ing four conditions: (i) D(A,) is independent of t and dense in the Banach space X, and for a11 IX>O, (I -IXA,)-1 EL(X) with the estimate 11(1- IXA,)-111  1; (ii) B,.s=(1-A,)(1-A s )-1 is uni- formly bounded in the norm for 0  S, t  I; (iii) I:j;J IIB'J+l"o -B'i"oll N, where N is inde- pendent of the partition (0=t O <t 1 <... <t n = I); (iv) B,.o is weakly differentiable with respect to t such that the differentiated operator aB"ojat is strongly continuous in t. Under these assumptions, we can prove that for XoE D(A o ), the limit V(t, O)xo = s-1im k _ oo Vk(t, O)x o , 
378 I Semigroups of Operators, Evolution Equations with ( ( [ktJ ) ( [ktJ )) -1 I-'i(t,s)= 1- t-T A T ( 1 ( [kt J -1 )) -1 x I--A - k k I ( _ 1 ( [kS] + 1 )) -1 x ... x I I A k ( ( [kSJ + I ) ( [kSJ )) -1 X 1- --s A - k k (OstI), exists and gives the unique solu- tion of (J 1). If f(t) is continuously differenti- able, the right-hand side of (J2) exists and gives a unique solution to the inhomogeneous equation (10). I. The Evolution Equation of Parabolic Type Equation (10), for which every A, is the in- finitesimal generator of a holomorphic semi- group, is said to be of parabolic type by ana- logy to parabolic partial differential equations. Under weaker conditions, especially without the condition that D(A,) is independent of t, the existence of solutions of an equation of this type is obtained. Moreover, differentiability or analyticity of solutions follow from some natural assumptions. (i) Existence of weak solutions. Let X be a Hilbert space. For tE[O,lJ, let V, be a subspace and at the same time a Hilbert space with respect to a norm 111,111, stronger than 11.11. Since the form (A,x, y) is tsesquilinear (linear in x and antilinear in y), we get a sesquilinear functional a(t,',') on V, x V, such that a(t,x,y)= -(A,x,y), x,yED(A,), if D(A,) is dense in V, with respect to 111'111, and if I(A,x, y)1  Clllxlll,111 ylll" x, YED(A,). V" a(t, . , .) should be measurable in a certain sense. A solution x, of the equation (10) in [O,IJ satisfies f>(t,xr, v,)dt- J: (x" v;)dt = J: (f(t), v,)dt +(x o , vo) for any differentiable X -valued function v, such that V,E V" VI = 0, n Ilv,11 2 dt < 00, and n Ilv;11 2 dt < oc. A solution x, of (13) is called a weak solution of equation (10), though it does not necessarily satisfy (10). If the relation a(t, x,x)+ J,IIxI1 2 ?()(lllxlll" XE v" holds for some )., ()( > 0, a weak solution of (1 0) in the sense of (13) exists for a given Xo E X 1410 (Lions, 1961). In order to obtain the unique- ness or the differentiability of weak solutions, we need some additional conditions. (ii) Some properties of strong solutions. Let X be a Banach space. Let every semigroup {T}t)} generated by A, be holomorphic in a complex sector larg).1  0,0> 0, independent of t. Suppose one of the following conditions holds: (1) For some ()(, O<()( < I, D(A) is in- dependent of t and for 1 - ()( < f3 < 1, II(A-A:)Aoall C'lt-sIP, t,sE[O,lJ (P. E. Sobolevski"i, 1958-1961; Kato, 1961); (2) A,-1 is differentiable in t, IldA,-I/dt-dA s - 1 /dsll  C'lt-sIP for some C'>O, 0<f3 1, and II a . 1 11 N ct (A,-Ar 1If for every ).:largAI?n:/2-0 for some N, 0< ()« 1 (Kato and H. Tanabe, 1962). Then a differentiable evolution operator {V(t, s)} as- sociated with (11) exists. The most interesting property of evolution equations of parabolic type is the analyticity of solutions. If A, is holomorphic in t in a cer- tain sense, then the solutions are holomorphic (Tanabe, 1967; first noted by Komatsu, 1961). Furthermore, a characterization of evolution operators {V(t, s)} holomorphic in some com- plex neighborhood of [0, I] (called holomorphic evolution operators) is obtained by using the resolvent of A, (Kato and Tanabe, 1967; K. Masuda, 1972;  [8J). J. Application to Semilinear Evolution Equations (13) The evolution equation with a nonlinear addi- tive term f(t, x,):x;= A,x,+ f(t, x,) can be writ- ten as an inhomogeneous integral equation x, = V(t,O)x o + g V(t,s)f(s,xs)ds in the Banach space X, by means of the evolution operator {V(t, s)} introduced in Section G. The exis- tence, differentiability (Kato, H. Fujita, and Sobolevski"i, 1963-1966), and analyticity (Masuda, 1967) of solutions of the Navier- Stokes equation has been obtained by reduc- ing it to an integral equation of this type. Concerning quasilinear equations in which A, depends on x" the existence, differentia- bility, and analyticity of their solutions have been discussed. References [IJ E Hille and R. S. Phillips. Functional analysis and semi-groups, Amer. Math. Soc. Colloq. Pub!., 1957. 
1411 [2] N. Dunford and 1. T. Schwartz, Linear operators I, Interscience 1958. [3] 1. L. Lions, Equations differentielles oper- ationnelles et problemes aux limites, Springer, 1961. [4] K. Y osida, Functional analysis, Springer, 1980. [5] S. G. Krein, Linear differential equations in a Banach space, Amer. Math. Soc., 1971. (Original in Russian, 1967.) [6] P. L. Butzer and H. Berens, Semigroups of operators and applications, Springer, 1967. [7] R. W. Carroll, Abstract methods in partial differential equations, Harper & Row, 1969. [8] H. Tanabe, Equations of evolution, Pit- man, 1979. (Original in Japanese, 1975.) 379 (X.18) Series A. Convergence and Divergence of Infinite Series Let {an} (n = 1,2,3,...) be a sequence ofreal or complex numbers. Then the formal infinite sum a l + a 2 +... is called an infinite series (or series) and is denoted by :=1 an or  an' The number an is the nth term of the series an' and Sn = a 1 + a 2 + .. . + an is the nth partial sum ofan' Also, for a finite sequence a 1 ,a 2 , ...,a n , the sum a 1 +a 2 +... + an is called a series. To distinguish these two series, the latter is called a finite series. In this article, series means an infinite series. If the sequence of partial sums {sn} tconverges to s, we say that the series  an converges or is convergent to the sum sand write :=1 an=s or an=s. If the sequence {sn} is not convergent, we say that the series diverges or is divergent. In particular, if {sn} is divergent to +00 (-00) or toscillating, we say that the series is properly divergent to +00 (-00) or oscillating, respectively. The notation  an is customarily used for both the sum s of the convergent series and the formal series, which mayor may not be con- vergent. When the series is convergent, the sum is sometimes called the Cauchy sum in contrast to the "summations" of series, which are not necessarily convergent ( Sections K ff.). Applying the tCauchy criterion for the convergence of a sequence, we see that a neces- sary and sufficient condition for  an to be convergent is that for any given 6>0, we can take N sufficiently large so that ISm-snl =la n +1 +... +aml <6 for all m, n such that m > n > N. Hence if  an 379C Series converges, the an --+0 as n--+ 00, but the converse is not always true. Elementary properties of the convergence of series are: (1) If  an and  b n converge to a, b, respectively, then (an + b n ) converges to a + b. (2) If  an converges to a, then  can converges to ca for any constant c. (3) Suppose that {b m } is a subsequence of {an} obtained by deleting a finite number of terms an from {an}' Then bm is convergent if and only if  an is convergent. (4) When a series  an converges to a and {b m } is a sequence such that b 1 =a 1 +a 2 + ... +a", b 2 =a,,+1 +a,,+2+'" +a'2' b 3 =a'2+1 + ...+ a", ..., then  b m also converges to a. The converse, however, is not always true. For example, 1 - 1 + 1 - 1 + ... is oscillating, but (1-1)+(1-1)+... =0. B. Series of Positive Terms Suppose that  an is a series of positive (or nonnegative) terms. Since its partial sums Sn form a tmonotonically increasing sequence, the series is convergent if and only if {sn} is bounded. For example, the series 1 n- P (p > 0) converges if p > 1 because Sn < 2P-1 1 (2P-1 -1), whereas it diverges if p 1 because S2m+1> 1 +(m+ 1)/2. The geometric series :=1 a n - 1 (a >0) converges for a < 1 because Sn =(I-a n )/(1-a), and diverges for a  1 be- cause Sn  n. Some criteria for the convergence of a series an of nonnegative terms are: (1) If {an} is monotone decreasing, then the series  an and  2 v a r have the same convergence behavior (Cauchy's condensation test). (2) Suppose that f(x) is a positive monotone decreasing func- tion defined for x  1 such that f(n) = an (n = 1,2, ...). Then the series  an and the inte- gral Sf f(x)dx have the same convergence behavior (Cauchy's integral test), for example, n-P (p>O) and Sf x-pdx. (3) If for any posi- tive constant k we have an  kb n except for a finite number of n, then the convergence of  b n implies the convergence of  an' If kb n  an and  b n diverges, then  an also diverges (comparison test). (4) Let an>O and bn>O. If an+1lanbn+r!bn except for a finite number of values of n and  b n converges, then  an also converges; if an+r!an b n +1lb n and  b n di- verges, then  an also diverges ( Appendix A, Table 10). C. Absolute Convergence and Conditional Convergence A series  an (with real or complex terms an) is called absolutely convergent if the series  I an I is convergent. If a convergent series is not 
379D Series absolutely convergent, then it is ca1!ed con- ditionally convergent. An absolutely convergent series is convergent. A real series  an whose terms have alternating signs is ca1!ed an alter- nating series. An alternating series  an is convergent if the absolute values of terms lanl form a monotone decreasing sequence which converges to zero (Leibniz's test). An abso- lutely convergent series remains absolutely convergent under every rearrangement of terms and retains its sum under the rearrange- ment (Dirichlet's theorem). If a series with real terms is conditionaIly convergent, then it is possible to rearrange its terms so that the rearranged series converges to any given num- ber, diverges to +00 (or -00), or is osci11ating (Riemann's theorem). A convergent series whose convergence behavior is unaffected by rearrangement and whose sum remains un- changed is ca1!ed unconditionally convergent (or commutatively convergent). A real or com- plex series is unconditiona1!y convergent if and only if it is absolutely convergent. The notion of infinite series can be extended to any com- plete tnormed linear space, and absolute con- vergence can be defined by replacing the ab- solute values of the terms by the norm of the terms. However, in general, unconditional convergence is not always equivalent to ab- solute convergence. D. Abel's Partial Summation Let {aO,aj,a2''''} and {b o , b 1 , b 2 ,...} be arbitrary sequences, and put An=ao+a J + ... + an for n? O. Then the following formula of Abel's partial summation hOlds: n+k n+k L avb v = L Av(b v -bv+rJ- A n b n+1 v=n+l v=n+l + A n +k b n+k-1 for any n ? 0 and any k? 1; this formula also hOlds for n= -1 if we put A_I =0. Abel's partial summation enables us to deduce a number of tests of convergence for series of the form  anb n . In particular, the fo1!owing criteria are easy to apply: (1) anbn is convergent ifan is convergent and if the sequence {b n } is monotone and bounded (Abel's test). (2) anbn is convergent if the sequence {sn} of partial sums of  an is bounded and if {b n } is monotone and converges to zero (Dirichlet's test). (3) anbn is convergent if(bn-bn+1) is ab- solutely convergent and if  an is (at least conditionally) convergent (test of du Bois- Reymond and Dedekind). For example, criterion (2) implies that if {b n } 1412 is monotone and converges to zero, then the power series  bnz n of a complex variable z is convergent on the unit circle Izi = 1 except at most for z = 1; the case z = -1 gives Leibniz's test for alternating series ( Section C). E. Double Series A sequence with two indices, i.e., a mapping from the Cartesian product N x N of two copies of the set of natural numbers N to a subset of the real or complex numbers, is ca1!ed a double sequence and is denoted by {a mn } or {a m . n }. If there exists a number I such that for any positive /; there is a natural num- ber N(/;) satisfying la mn -II < /; for all m> N(f,) and n>N(/;), then we say that the sequence {a mn } has a limit I and write lim m _ oc .n-oo a mn = I. This limit should not be confused with re- peated limits such as lim n _ oo (Iim m _ oo a mn ). If 1im m _ oo a mn = C(n uniformly in nand lim n _ oo C(n = I, then lim m _ oo . n _ oo a mn = I. For a given double sequence {a mn }, the formal series .n' a mn is ca1!ed a double series and is some- times denoted by amn' In contrast with double series, the ordinary series discussed previously is caIled a simple series. Given a double series  a mn , when the double sequence of partial sums Smn = i:'=1 7=1 a kl is convergent to s, then  a mn is said to be convergent to the sum s. On the other hand, if Smn is not convergent,  a mn is said to be divergent. If ;;o1 a mn converges to b m for each m, then =1 b m = =1 (;;o=1 (Imn) is caIled the repeated (or iterated) series by rows. If =1 a mn converges to C n for each n, then ;;o=1 C n = ;;o=1 (1 a mn ) is caIled the re- peated (or iterated) series by columns. Even if two repeated series by rows and columns are convergent, the two sums are not always iden- tical, and  a mn is not always convergent. However, if the double series  a mn is conver- gent and n a mn is convergent for each m, then the repeated series by rows is convergent to the same sum. A similar statement is valid for the repeated series by columns. Suppose that we are given a double series  a mn of nonnegati ve terms. If anyone of m.namn' mnamn, and nmamn is conver- gent, the other two converge to the same sum. If the diagonal partial sum Smm = i:'1 r;1 a kl converges to a, then the double series  a mn also converges to a. If  I a mn I converges, the double series  a mn is ca1!ed absolutely convergent, whereas if  a mn converges but not absolutely, then amn is ca1!ed conditionally convergent. If  a mn is absolutely convergent, then any series ob- tained from  a mn by arranging the terms in an arbitrary order is convergent to the same sum. 
1413 F. Multiplication of Series The series 1 c., where e.=a 1 b. + a2b'-1 + .. . + a. b 1> is called the Cauchy product of two series :'1 a. and :'=1 b.. (1) Let  a. and  b. be two convergent series and A, B be the sums of these series. If their Cauchy product  e. is also convergent, then it has the sum C = AB (Abel's theorem). (2) If at least one of the two convergent series  a. and  b. with the sums A, B, respectively, is absolutely conver- gent, then their Cauchy product  e. is also convergent and has the sum C = AB (Mertens's theorem). (3) If  a. and  b. are absolutely convergent, then their Cauchy product  e. is absolutely convergent (Cauchy's theorem). (4) Let  a. and  b. be two convergent series with the sums A, B, respectively. If {na.} and {nb.} are bounded from below, then e. is convergent and has the sum C = AB (Hardy's theorem). G. Infinite Product Let {a.} be a given sequence with terms a.oI- o (n = 1, 2, ...). The formal infinite product a 1 'a 2 .a 3 .... is denoted by IIl a.. We call P. =a 1 . a 2 . .... a. its nth partial product. If the sequence {P.} is convergent to a nonzero limit p, then this infinite product is said to converge to p, and p is called the value of the infinite product. We write II a. = p. If {P.} is not con- vergent or is convergent to 0, then the infinite product is called divergent. Sometimes we consider the infinite product II a. with a. = 0 for a finite number of n's; and then by conver- gence or divergence of II a. we mean that of the infinite product IIa, where the sequence {a} is obtained by deleting zero terms from {a.}. Usually we do not treat an infinite prod- uct with a. = 0 for an infinite number of n's. A necessary and sufficient condition for IIl an to be convergent is that for any posi- tive /; there is a number N such that IPm/P.- 11 < /; for all n, m> N. If II a. converges, then a. --+ 1, but the converse is not always true. It is often convenient to write an infinite product as II(1 + a.). Then II(1 + a.) and  log(1 + a.) have the same convergence be- havior, where the imaginary part iO of the logarithm is assumed to satisfy 0,,;; 101 <no If a.O, convergence ofII(1 + a.) implies con- vergence of  a.. and vice versa. If II (1 + la.D converges, then II(1 + a.) is said to be absolutely convergent. An absolutely convergent infinite product is also convergent, and the value of the infinite product is un- changed by the alteration of the order of terms. 379 I Series H. Termwise Differentiation of Infinite Series with Function Terms Uniform convergence of an infinite series f.(x) is defined by uniform convergence of the sequence of the partial sums =dk(X) ( 435 Uniform Convergence). If the infinite series  f.(x) defined on an interval I of the real line is convergent at least at one point of I and f:(x) is convergent uniformly in I when the derivatives f:(x) exist, then f.(x) is con- vergent to f(x) uniformly in I, and f.(x) is termwise differentiable, that is, f' (x) =  f: (x). If the <P.(z) are holomorphic in a complex domain D and  <P.(z) converges to <p(z) uni- formly on every compact subset of D, then  <p(z) also converges to <p'(z) uniformly on every compact subset of D (Weierstrass's theorem of double series). (For term wise inte- gration  216 Integral Calculus.) I. Numerical Evaluation of Series In some special cases, we can express the nth partial sum s. of a series  a. as a well-known function of n. Specifically, if  a. is an arith- metic progression =l(a+(k-l)d) or a geo- metric progression =1 aqk-l, we have n s'="2(2a+(n-1)d), a(q'-I) Sn= , q-l respectively. If Iql < 1, then oaq' converges to a/(I-q). If B'+1(x) is the (r+ l)st tBer- noulli polynomial, then s. = l' + 2' + ... + n' = [B'+1(x)]'i+1/(r+ 1). This sum was studied by 1. Bernoulli, who gave formulas up to r= 10 in his Ars eonjeetandi. In the series  uO' if we can find another sequence {v.} such that u.=v.-v._ 1 , then s. =U 1 +u 2 + ... +u.=v.-v o . For example, ifu. =n(n+ 1)(n+2), then v.=n(n+ l)(n + 2)(n + 3)/4 and s. = v. because V o = 0 ( 104 Dif- ference Equations). Series with trigonometric function terms are calculated analogously. There are cases where the sum  a. itself can be expressed in a satisfactory form although we cannot find an appropriately simple ex- pression for each partial sum s.. For example, (r) = =11/m' can be represented by tBer- noulli numbers ifr is even. In particular, (2)= n 2 /6, (4)=n 4 /90 ( Appendix A, Table 10). If an infinite series converges rapidly, we can get a good approximation by taking a suitable partial sum. On the other hand, if the series converges less rapidly, an effective means for evaluating series is afforded by transformation of series. If the kth tdifference is exactly zero, then  ( n+ 1 ) i-1 s.= L... .  u 1 . ;=1 I 
379J Series Since the absolute value of finite differences often decreases rapidly, it is sometimes con- venient to consider the series whose terms are the differences of the original series. One finite difference method is Euler's transformation of infinite series. In particular, the formula 00 c< Ana I (-lta k = I n+1 0 kO nO 2 is useful for numerical calculation of sums of slowly converging alternating series. In numer- ical calculation of the series, we usually start calculating the numerical values of the first few terms; we then apply such transformations as Euler's to the remainder, and calculate the partial sums of the transformed series. When we calculate the sum of an infinite series approximately, we must estimate the error, i.e., the remainder that must be added to yield the sum of the series itself. We can esti- mate the maximum error by derivatives or differences of higher orders. We also have the transformations of Markov and Kummer. In the former, every term of the series is repre- sented by convergent series, and in the latter, the given series is reduced by subtracting an- other convergent series, which has a known sum and similar terms [1]. J. Infinite Series and Integrals In numerical calculation of functions, we sometimes use the Euler-Maclaurin formula [5]: 1 f x+w f(x+w)=- f(z)dz w x +, ( !' Br()A,J('-I)(X)) +Rm' f 1 Bm(-z) Rm= -w pm)(x+wz)dz, o m! where - { Bm(-Z+ 1), Bm(-z)= Bm(-z), <z, ?z. The speed of the convergence for this formula is greater than that for Taylor's expansion when w is large, since the terms of the for- mula are tBernoulli polynomials B,@ in 0:( :( 1. We also have Boole's formula, with tEuler polynomials as its terms [4]. The for- mulas discussed in this section are also used to calculated approximately the partial sums of infinite series. Another method of evaluating sums of infinite series analytically entails transforming infinite series to definite integrals using the tresidue theorem. If an analytic function f(z) is 1414 holomorphic except at poles an (n ,= 1,2,..., k) in a domain bounded by the closed curve C and containing the points z = m (m = 1,2, . .. , N), then N 1 r m/(m)= 2ni Jc n(cotnz)f(z)dz k - I Res [n(cot nz)f(z)Jza" n=l When the left-hand side of this equation is replaced by N I (-I)mf(m), m=l we replace cotnz by cosecnz. Res[F(z)Jza is the tresidue of F(z) at z = a. The line integral along C is often calculated easily by choosing a suitable deformation of C. Sometimes it can be shown immediately that the integral along C is zero, or its asymptotic value can be evaluated by the tmethod of steepest descent. K. History of the Study of Divergent Series Mathematicians in the 18th century did not concern themselves with the questIon of whether tseries were tconvergent or tdivergent. This indiscriminateness led to various con- tradictions. In 1821 an exact definition of the notion of convergence of series (Section A) was given by A. L. Cauchy; since then, mathema- ticians have mostly concerned themselves with convergent series. However, since divergent series appeared in many problems in analysis, the study of such series could not be neglected, and it became desirable to give a suitable definition of their sum. Although some results were given by L. Euler, N. Abel, and others, it was during the lattcr part of the 19th century that methods of summation of divergent series were studied systematically. This study consti- tuted a new branch of mathematics. In the foIl owing sections, some important methods of summation of divergent series are mentioned. Cesaro's method (- Section M) was the forerunner of the theory whose general foundation is now the theory of linear transformations. L. Linear Transformations For a sequence {sn} (n=O, 1,2,...) of real or complex numbers, assume that (In = 1:;':',0 anis i converges for n=O, 1,2, ..., where laik) is a given matrix (i, k = 0, 1,2, ...). The mapping T: {sn} --> {(In} is called a linear transformation, and {(In} is called the transform of {sn} under T. If the matrix satisfies a ik = 0 (k> i), then Tis defined for any sequence {sn} and T is said to 
1415 be triangular. If the transform {un} under Tis defined and convergent whenever {sn} con- verges, then T is caned a semiregular transfor- mation. If in addition {un} has the same limit as {sn}, then T is caned a regular transforma- tion. If for any bounded sequence {sn} the transform {un} is defined and convergent, then T is caned a normal transformation. If T is tri- angular and the transform {un} of {sn} under T is divergent to 00 whenever Sn --+ 00, then T is caned a totally regular transformation. Let T be a regular transformation. If for at least one divergent sequence {sn} the transform {un} of {sn} under T converges, then T is caned a method of summation. The limit S of {un} is caned the sum of {sn} under the method T of summation, and {sn} is said to be T-summable to s. For a given method of summation T1' let D( T 1 ) be the set of sequences that are T 1 - summable. If D(T 1 )=D(T 2 ), then the methods T 1 and T 2 are caned equivalent. If D(T 1 ) C D(T 2 ), then we say that T 1 is weaker than T 2 and T 2 is stronger than T 1 . If D(Td cf. D(T 2 ) and D(T 1 ) 1> D(T 2 ), then T 1 and T 2 are caned mutu- aUy noncomparable. The fonowing theorems on linear transformations of sequences are important: (1) Kojima-Schur theorem. In order that T be semiregular it is necessary and sufficient that (i) limnoo a nk exist for each k; (ii) t n = k=0 lank I exist and {t n } be bounded; and (iii) limnoo k=0 a nk exist. In that case, we have 00 L llim ankl< 00 k=O n-+oo and 00 lim Un = lim L anks k "-+00 n-+oo k=O = ( lim Sn )( lim f a nk ) n-+oo n-+oo k=O + f ( lim ank ) ( Sk -lim Sn ) . k=O n-+oo "-+co In particular, in order that {un} converge whenever sn--+O it is necessary and sufficient that conditions (i) and (ii) be satisfied. In that case, 00 lim Un = lim L anks k n-+oo n--H:() k=O = f ( lim ank ) Sk k=O n-+oo (I. Schur, J. Reine Angew, Math., 151 (1921); T. Kojima, T6hoku Math. J., 12 (1917)). (2) Toeplitz's theorem. In order that T be regular it is necessary and sufficient that (i') limnoo a nk = 0 for each k; (ii); and (iii') limnoo k=0 a nk = 1 (0. Toeplitz, Prace Mat.- Fiz., 22 (1914)). In particular, (ii) and (i") 379M Series limnoo k=Kank = 1 for each K imply that Tis regular (Perron's theorem). (3) Schur's theorem. In order that T be normal it is necessary and sufficient that (i); (ii); (iii); and (iv) for any B > 0 there exist a K > 0 such that k=K+1lankl <B for each n. (4) In order that the regular triangular trans- formation T be totany regular, it is necessary and sufficient that a nk  0 except for a finite number of k. M. Cesaro's Method of Summation We write 00 (l-x)-a-l = L Axn, n=O ( n+a ) n" A= n  r(a+1) ' 00 00 (l-X)-"-l L u n x n =(l-x)-" L sn xn n=O n=O 00 = " sa. x " L... n , n=O where Sn = 7=0 U i . Thus the series  U i is as- sociated with the sequence {s:}. If u")=s:/A converges to S as n--+oo, then we say that  Un is summable by Cesaro's method of order a (or simply (C, a)-summable) to S and write :'=o un = S (C, a). This method of summation is caned Cesaro's method of summation of order a (or simply (C, a)-summation). It is natural to consider (C, a)-summa- tion for a> -1. We say that  Un is (C, -1)- summable ifun converges and nu n =o(l). Here :'=o un =s (C,O) means limnoo Sn = s, and :'=1 Un = S (C, 1) means S = limnoo (so + S 1 + ... +Sn-1)/n. Generany, we have the fonowing resul ts: (1) A is increasing if a > 0 and decreasing if 0> a > -1. A = 1 and A > 0 if a > -1. (2) s:+P+1 = =o A_kS:, A- A-l = A-\ S;-S:_1 =S;-1 (3) (C,a) (aO) is regular, and D(C,a)=> D(C,fJ)ifa>fJ> -1. (4) If  un = S (C, a), then un = o(n"). More- over, ifu=s' (C,a), then (un+u'n)=s+ s' (C,a) and AUn=AS (C,a) for any number A. (5) If  Un = S (C, a) and  u = s' (C, 13), then their tCauchy product  V n = Ss' (C, a + 13 + 1) (Chapman's theorem). Moreover, if =o Als:-=-:-l(U_k)l/ A = 0(1), then  V n = Ss' (C, 13) (T. Kojima). If a', 13' > -1, s"')(un) = O(n"'), and s')(u) = O(n P '), then  V n = Ss' (C, a' + 13' + 2) (G. Doetsch). (6) For any integer a> 0, in order that  Un = S (C, a) it is necessary and sufficient that there exist {v n } such that un=(n+ l)(v n -v n +1) and vn=s (C,a-1) (G. H. Hardy, Proc. London Math. Soc., (2) 8 (1910)). This condi- 
379N Series tion is equivalent to (i)-(iii) together: (i) the series r.f=ns:2/(k+ 1)...(k+IX) converges to the limit b n ; (ii) b n =o(l) as n-->oo; and (iii) (s:I/A-I)+(n+IX)r(IX)bn+l-->s as n-->oo. (7) If L Un = S (C, IX) (IX> 0), one of the fo11ow- ing five conditions is sufficient for the conver- gence of L Un (this is a kind of tTauberian theorem): (i) nu n =o(I); (ii) tn=L1 vuv=o(n); (iii) LnPlunlP+l < 00 (p:;:'O); (iv) nUn> -K (K is independent of n); (v) 1im inf(sm - sn):;:' 0 as m> n--> 00, m/n--> 1 (R. Schmidt's condition). (8) IfIX'>IX> -1 and LUn=S(C,IX'), then LU n =s (C,IX+e) for any e>O. For a given series L Un, we write H for Sn' H; for the arithmetic mean of {H}, and H?; for the arithmetic mean of {Hn. Similarly, we can define {Hn for any integer p. If H%-->s as n-->oo, then LU n is said to be summable by HOlder's method of order p (or (H, p)- summable) to s, and we write Lun=s(H,p). For any integer p:;:,O, (H,p)-summabi1ity is equivalent to (C, p)-summability (Knopp- Schnee theorem). N. Abel's Method of Summation If the radius of convergence of the power series L.oUnrn is:;:, 1 and L.oUnrn-->s as r-->l, then L. Un is said to be summable by Abel's method (or A-summable) to s, and we write L.Un=s (A). The transformation matrix is denoted by A, and the transformation is ca11ed Abel's method of summation. (1) If L. Un = S (A), then lim SUPnro I u n I 1 / n :( 1. (2) IfL.un=s (A) and L.U=s' (A), then L.(u n + u)s + s' (A) and L. A.U n = A.s (A) for any constant A.. Moreover, L.::"k+l u n =s-U O -u 1 -... -Uk (A). (3) If L. Un =s (A) and L.U = s' (A), then the Cauchy product is L. Un = ss' (A). (4) If L. Un = S (A) and one of the fo11owing five conditions is satisfied, then L. Un = s: (i) nUn = 0(1); (ii) t n = L.1 vuv=o(n); (iii) nu n =O(I); (iv) nUn> -M; (v) 1iminf(sm -sn):;:'O as m > n--> 00 and m/n--> 1. These theorems are tTauberian, in the original form proved by Tauber (Monatsh. Math., 8 (1897)). (5) The matrix A is regular, and D(C, IX) c D(A) for any IX> -1. (6) If L. Un = S (A) and sn:;:' 0, then L. Un = S (C, 1). Moreover, if a") = 0(1), then L. Un = S (C, IX + e) for IX> -1 and e>O. O. Borel's Method of Summation If for a given series L.U n , 00 S x n u(x)= L  nO n! is convergent for all x, and u(x)/e x -->S as x --> 00, then L. Un is said to be summable by 1416 Borel's exponential method to the sum s, and we write r. un = S (B). The transformation thus determined is denoted by B and is ca11ed Borel's method of summation. If tro u(x)exdx=s, then L. Un is said to be summable by Borel's integral method (or -summable) to s, and we write L.Un=s (). Then we have: (1) B is regu- lar and D(C,IX)cD(B) (IX> -1), while D(C, IX) and D() are noncom parable. (2) If the radius of convergence of L.oUnXn is :;:, 1 and L.U n = s (B), then L.Un=s (A). (3) IfL.::"=k+1 Un=S (B) (resp. ()), then L.U n =s+U O +u 1 +... +udB) (resp. ()), but the converse is not always true. (4) L.Un=s (B) implies lu n I 1 / n =0(n). (5) IfL.u n = s (B) and L.U=s' (B), then L.(un+u)=s+s' (B), and L. A.U n = A.S (B) for any constant A.. The same is true for summation (). (6) If L. Un = S (B) and if one of the fo11owing two conditions is satisfied, then L.Un=s: (i)  u n =o(I); (ii) 1iminf(sm-sn):;:'O as m>n-->oo and (m- n)-->O. (7) if L. Un =s (B) and s:=f = o(n"-l J2 ), then L.Un=s (C,IX) (IX:;:'O). (8) IfL.un=s (A) and u(t» -Mt- 1 expt, then L.Un=s (B). (9) If the sequences {nd and {nk'} satisfy nk+1 >nk', n k . / nk > 1 + e (k = 1, 2, ... ; e > 0), U v = 0 (nk < v :( n k .), and L. Un = S (B), then sn. --> s as k --> 00. If L.U n = s () and tro I d::\X) le-xdx converges for all A. = 0, 1,2, ... , then L. Un is said to be absolute Borel summable (or II- summable). Concerning this we have: (1) If L.I Un I converges, then L. Un is I  I-summable, but even if L. Un converges, L. Un is not always II-summable. If L. Un is II-summable, then L. Un is -summable to s. In this case, we write L.Un=s (II). (2) L.::"oun=s (II) implies L.::"k+1 Un =s-(UO+u i +", +uk)(II). (3) If L. Un = S (B), L. U = s' (B), and if at least one of them is II-summable, then their Cauchy product is L.Vn=ss' (II). P. Euler's Method of Summation In the series L. Un, if {( k+l ) ( k+1 ) 1 so+ 2 SI+'" + ( k+1 ) S } 2-<k+1) k+l k (sn = L.o Uk) converges to s as k --> 00, then L. Un is said to be summable by Euler's method, and we write L Un = S (E). The transformation thus obtained is ca11ed Euler's method of sum- mation. A necessary and sufficient condition 
1417 for  Un = S (E) is that  V n = s, where v. = 2-('+1)I:=O(:)Uk. This summation method is regular. We have  u. = s if  u. = s (E) and if one of the fo11owing two conditions is satis- fied: (i);;; u.=O(l); (ii) liminf(sm-s.)O for m> n --> 00, (m - n)/ ;;; --> 1. Cesaro and Euler summations are noncom parable. As an ex- tension of Euler's method, the Euler method of summation of pth order is also defined (e.g.,  [6]). Q. NorIund's Method of Summation For a positive sequence {Pn}, let Pn = =oPv--> 00 as n--> 00 and Pn/ Pn-->O as n--> 00. If Ct skPn-k)! Pn = Ct UkPn-k)! Pn converges to s as n --> 00, then  Un is said to be summable by NorIund's method of type {Pn}, and we write  Un = S (N, {P.}). The transfor- mation thus obtained is also regular and is ca11ed NorIund's method of summation. If un=s (C, 1) and 0';:;;PO';:;;P1';:;;"" then un= s (N, {Pn})' Cesaro's method is actua11y a special case of this method. R. M. Riesz's Method of Summation Let {An} be a sequence with increasing terms and tending to +00 as n-->oo. If R(A n ,k,7:)= ( L (7:-A n )k U . )! 7: k A.r/<t converges to s as 7:-->00, then u. is said to be summable by Riesz's method of order k and type A., and we write  u. = s (R, A., k). The transformation thus obtained is regular and is cal1ed Riesz's method of summation of the kth order. In particular, if An=n, then D(R,A.,k)= D(C, k). S. Riemann's Metbod of Summation If 00 ( sin nh ) k L u. - h ' .l n UO=O, converges for h>O and tends to s as h-->O, then  Un is said to be (R, k)-summable to s. When k= 1, this method, often ca11ed Lebesgue's method of summation, is not regular. When k = 2, it is ordinarily ca11ed Riemann's method of summation and is regular. Corresponding to these cases, if 2" sin nh 2" ( sin nh ) Z :; L... Sn ----;:;- --> s or rch L... Sn ----;:;- --> S 380 A Set Functions as h -->0, then  Un is caned (R l)-summable or (Rz)-summable to s, respectively. The summa- tion method (R 1 ) is not regular, while (Rz) is regular. If  Un is (R 1 )-summable, then it is also (Rz)-summable, but (R, 2) and (Rz) are noncomparable. Other methods of summation were devel- oped by G. H. Hardy and 1. E. Littlewood, E. Le Roy, C. 1. de La Vanee Poussin, and others (e.g.,  [6]). For related topics  121 Dirichlet Series, 159 Fourier Series, 339 Power Series. References [1] K. Knopp, Theorie und Anwendung der unendlichen Reihen, Springer, 1921; fifth edi- tion, 1964; English translation from the second edition, Theory and application of infinite series, Blackie, 1928. [2] T. l. I'A. Bromwich, An introduction to the theory of infinite series, Macmil1an, second edition, 1926. For numerical computation of series, [3] L. B. W. loney, Summation of series, Chapman & Hal!, 1925. [4] N. E. NOrlund, Vorlesungen iiber Dif- ferenzenrechnung, Springer, 1924. For the Euler-Maclaurin formula, [5] N. Bourbaki, Elements de mathematique, Fonctions d'une variable reele, ch. 6, Actua1ites Sci. Ind., 1132a, Hermann, second edition, 1961. For the summation of divergent series and Tauberian theorems, [6] G. H. Hardy, Divergent series, Clarendon Press, 1949. [7] R. G. Cooke, Infinite matrices and se- quence spaces, Macmil1an, 1950. [8] K. Ze11er, Theorie der Limitierungsverfah- ren, Erg. Math., Springer, 1958; second edition, 1970. [9] K. Chandrasekharan and S. Minakshisun- daram, Typical means, Oxford Univ. Press, 1952. [10] H. R. Pitt, Tauberian theorems, Oxford Univ. Press, 1958. 380 (X.15) Set Functions A. General Remarks A tfunction whose domain is a tfamily of sets is cal!ed a set function. Usua11y we consider set functions that take real values or ::1:00. For example, if J(x) is a real-valued function 
380 B Set Functions defined on a set X, and if we assign to each subset A of X values such as SUPAI, infAI, or sUPAi - infAI, then we obtain a corresponding set function. In particular, a set function whose domain is the family of left open intervals in Rm is called an interval function. To distinguish between set functions and ordinary functions defined at each point of a set, we ca1! the latter point functions. For example, if f(x) is an tinte- grable (point) function with R ( = R 1) as its domain and we put F(/) = S:.f(x)dx for 1= (a, b], then we obtain an interval function F on R. B. Finitely Additive Set Functions Let <1>(E) be a real-valued set function defined on a tfinitely additive class  in a space X. If <1> satisfies the finite additivity condition: EI,E2E, EI nE 2 = 0 imply <1>(E I U E 2 ) = <1>(E I )+ <1> (E 2 ), then <1>( E) is ca1!ed a finitely additive set function on. For each EE we denote sup{<1>(A)IA c E, A E} (inf{<1>(A)IA c E, A E }) by 17(<1>; E) (1:'"(<1>; E)), the upper (lower) vari- ation of <1>. Since <1>(0) = 0, we have 1:'"(<1>; E):( 0:( 17(<1>; E). V(<1>; E) = 17(<1>; E)+ 11:'"(<1>; E)I is called the total variation of <1> on E. When we deal with a fixed <1>, instead of 17(<1>; E), 1:'"(<1>; E), V(<1>; E) we write simply V(E), 1:'"(E), V(E). If V(<1>; E) is bounded, then <1> is said to be of bounded variation. If <1>( E):;:' ° ( :( 0) for every EE, i.e., EcE' implies <1>(E):(<1>(E') (<1>(E):;:,<1>(E')), then <1> is said to be monotone increasing (decreasing). Every finitely addi- tive set function of bounded variation can be represented as the difference of two monotone increasing finitely additive set functions. Let 10 be a fixed interval in Rm and F(I) be an interval function defined for left open inter- vals Ie 1 0 , where 0 is considered as a degener- ate left open interval. If, for any two left open intervals II, 1 2 such that II U 1 2 is an interval and II n 1 2 = 0, we have F(ll U 1 2 )=F(lI)+ F(l2), then we ca1! F(l) an additive interval function in 10' Specifically, if f(x) is a real- valued bounded function on Rand D is an interval function determined by D(l)= f(b) - {(a), where I = [a, b) (i.e., D(l) is the incre- ment of f(x)), then D is an additive interval function on R, called the increment function of f For a given f the increment function is determined uniquely. Conversely, for a given D, a function f such that D is its increment function is determined uniquely up to an addi- tive constant. In this sense an additive interval function in R may be identified with the corre- sponding point function on R. Let 9\(/0) be the finitely additive class of a1! 1418 finite unions R ofleft open intervals in 10' Then any additive interval function F(/) can be extended to a finitely additive set function F(R) defined on 9\(/0)' For the rest of this article, it is understood that an additive inter- val function means this extended set function. If for any e > ° there exists a 15 > ° such that III <15 implies F(/)<e (where III is the vol- ume of the interval I), then we say that F is continuous. C. Completely Additive Set Functions Let <1>(E) be a real-valued set function defined on a tcompletely additive class  in a space X. If <1> satisfies the complete additivity condition: Ej,EkE, E j nE k =0 U#k) imply <1>C E j ) = j <1>(E), then <1>(E) is called a completely additive set function (or simply additive set function) on . In this case the corresponding upper variation V(E), lower variation 1:'"(E), and total variation V(E) are all completely additive set functions, and for every EE we have <1>(E) = V(E) + 1:'"(E) (Jordan decomposition). Furthermore, V(E)= sup LJI I <1> (E j ) I, where the supremum is taken over a1! decompositions of E such that E = UJI Ej (EjE, Ejn E k = 0,j#k). The com- pletely additive nonnegative set functions are the same as the finite measures. Hence the Jordan decomposition implies that every com- pletely additive set function is represented as the difference of two finite measures. A com- pletely additive set function is also ca1!ed a signed measure. Any continuous additive interval function of bounded variation can be extended to a completely additive set function. The notion of additive interval function of bounded variation is a generalization of that of function of bounded variation ( 166 Functions of Bounded Variation). Let <1> be a completely additive set func- tion and f.1 a finite or a-finite measure, both defined on. If f.1(E) =0 implies <1>(E) =0, then <1> is said to be absolutely continuous with respect to f.1 or f.1-absolutely continuous. Then <1> is f.1-absolutely continuous if and only if for any e > 0, there exists a 15 > ° such that f.1(E) < 15 implies 1<1>(E)1 < e. If for given <1> and f.1 there exists an EoE such that f.1(E o )=O and <1>(E) = <1>(E n Eo) for every E E, then <1> is said to be singular with respect to f.1 or f.1-singular. In a ta-finite measure space (X, ,f.1), every completely additive set function <1>(E) defined on  can be represented uniquely as the sum of a f.1-absolutely continuous set function and a 
1419 Wsingular set function (Lebesgue decompo- sition theorem). Also, <1>(E) is J1-absolute1y continuous if and only if <1>(E) can be repre- sented as the indefinite integral SEJ(x)dJ1 of a function J that is integrable on X with respect to J1 (Radon-Nikodym theorem). This function J(x) is called the Radon-Nikodym derivative, d<1>jdJ1, of <1> with respect to J1 ( 270 Measure Theory L (iii)). D. Differentiation of Set Functions Let m be the Lebesgue measure in Rm and E a Lebesgue measurable set. We denote sup(m(E)jm(Q)) for all cubes Q such that E c Q by r(E) and call it the parameter of regularity of E. If for a sequence of sets {En} there exists an rx such that r(En»rx>O, then {En} is called a regular sequence. If all the En contain a point P and the tdiameter of En tends to 0 as n--> 00, then we say that {En} converges to the point P. Let <1> be a set function in Rm. For a regular sequence {En} of closed sets converging to a point P, we put 1=limsup(<1>(E n )jm(E n )) and define the general upper derivative of <1> at P to be the least upper bound of I for all such se- quences {En}, denoted by D<1>(P). Similarly, the general lower derivative 1)<1>(P) of <1> at Pis defined to be the greatest lower bound of Iiminf(<1>(E n )jm(E n )) for all regular sequences {En} of closed sets converging to P. The or- dinary upper (lower) derivative, denoted by <Ii(E)(<<I>(E)), is defined in the same way by taking regular sequences of closed intervals instead of closed sets. D<1>, 1)<1>, <Ii, «I> are point func- tions derived from <1>. Clearly, Q<1>(P) «I>(P)  <Ii(P)D<1>(P). If i5<1>(P)=Q<1>(P), then we write it simply as D<1>(P). If D<1>(P) is finite, then we call it the general derivative of <1> at P and say that <1> is derivable in the general sense at P. If <Ii(P) = «I> (P), then we write it as <1>(P). If <1>(P) is finite, then we call it the ordinary derivative of <1> at P and say that <1> is derivable in the ordi- nary sense. We have the following theorems: (1) A completely additive set function is deriv- able in the general sense talmost everywhere (Lebesgue). The Radon-Nikodym derivative of a set function absolutely continuous with respect to the Lebesgue measure is equal almost everywhere to the generalized deriva- tive of the set function. (2) An additive interval function of bounded variation is derivable in the ordinary sense almost everywhere (Lebes- gue). (3) An additive interval function <1> is derivable in the ordinary sense at almost all points such that <Ii(P) < +00 or «I>(P) > -00. For the proof of these theorems, Vitali's covering theorem is essential: Let A be a given set and !y a family of measurable sets in Eu- clidean space. If for each x E A there is a 381 A Sets regular sequence of sets belonging to !Y that converges to x, then there exists a finite or countable set of disjoint En E!y such that m(A .....Uj1 E)=O. References [1] H. Lebesgue, Lel;ons sur I'integration et la recherche des fonctions primitives, Gauthier- Villars, 1904, second edition, 1928. [2] S. Saks, Theory of the integral, Stechert, 1937 (Dover, 1964). [3] W. Rudin, Real and complex analysis, McGraw-Hili, second edition, 1974. [4] H. L. Royden, Real analysis, Macmi1lan, second edition, 1963. 381 (11.1) Sets A. Definitions and Symbols G. Cantor defined a set as a collection of ob- jects of our intuition or thought, within a certain realm, taken as a whole. Each object in the collection is called an element (or member) of the set. The notation a E A (A=:w) means that a is an element of the set A. In this case we say that a is a member of A or a belongs to A. The negation of aEA (A==w) is written aA or aEA (Aa or A3a). The set having no element, namely the set A such that aA for every object a, is called the empty set (or null set) and is usually denoted by 0. Two sets A and B are identical, i.e., A = B, if every ele- ment of A belongs to B, and vice versa. The set containing a, b, C, ... as its elements is said to consist of a, b, c, ... and is denoted by {a, b, c, ...}. The symbol {x I C(x)} (or {x; C(x)}, sometimes Ex [C(x)]) denotes the set of all objects that have the property C(x). Thus {a} is the set whose only element is a, and {a, b} is the set with two elements a and b, provided that a#- b. A set is called a finite set or an infinite set according as the number of its elements is finite or infinite. A set A is a subset of a set B if each element of A is an element of B. In this case we also say that A is contained in B or that B contains A, and we write A c Band B::::> A. The negation of A c B (B::::>A) is A c;t B (B1>A). For every set A, 0 c A. A c Band Bee imply A c C. If A c Band Be A, then A = B. A is a proper subset of B (in symbols: A  B, B  A) if A c B and A#- B. Some authors use  () for c (::::> ), and c (::::» for  (*). 
381 B Sets B. Algebra of Sets The union (join or sum) of sets A and B, written AU B, is the set of all elements which belong either to A or to B or to both. The intersection (meet or product) of sets A and B, written An B, is the set of all elements which belong to both A and B. In other words, XE AU B if and only if xEA or XEB or both, and xEAnB if and only ifxEA and XEB. Given sets A, B, and C, A UB=BUA, A nB=BnA (commutative law); (A U B) U C = A U (B U C), (A n B)n C=A n(B n C) (associative law); A U(BnC)=(A UB)n(A UC)' A n(BU C)= (A n B) U (A n C) (distributive law); A U (A n B) = A, A n (A U B) = A (absorption law). Two sets A and B are disjoint if An B = 0. In this case the set C = A U B is said to be the disjoint union (or sum) of A and B, and is written sometimes as C = A + B. The set of elements of A which are not members of B is denoted by A - B, and is called the difference of A and B (or relative complement of B in A). If A ::::> B, A - B is called the complement (or complementary set) of B with respect to A. We often consider a theory in which we restrict our attention to elements and subsets of a certain fixed set Q, and call it the universal set of the theory. In geometic terms, Q is also called the space or the abstract space, elements of Q are called points, and subsets of Q point sets. If A is a subset of Q, Q - A is simply called the complement of A and is denoted A c. For A cQ and BcQ, A::::>B and ACcB C are equivalent. Furthermore, we have A U A C =Q, A nA C = 0, Acc=A; and (A n B)c=A c UBc, (A U B)C = A C n B C (de Morgan's law). The power set of a set X, written \.j3(X), is the set of all subsets of X. A set whose ele- ments are sets is often called a family of sets. The pair consisting of objects a and b is denoted by (a, b). Two pairs (a, b), (c, d) are defined to be equal if and only if a= c and b = d. A pair (a, b) is called an ordered pair, while the set {a, b} is sometimes called an unordered pair. Generally the n-tuple (a, b, c, ... ,d) of n given objects a, b, c, ..., d is defined to be «... (a, b), c), ...), d), so that (a,b,c, ...,d)=(a',b',c', ...,d') if and only if a= a', b= h',..., d=d'. The Cartesian product(or direct product) of sets A and B, written A x B, is the set of all pairs (a, b) such that aE A and bEB. A x B= 0 if and only if either A = 0 or B = 0; A x Be C x D if and only if A c C and BcD, provided that neither A nor B is empty. Furthermore, (A x B)U(A' x B)=(A UA') x B, (A x B)n(C x D)=(A nC) x(BnD). The subset {(a, a) I aE A} of Ax A is called the diagonal of A x A and is denoted by dA' 1420 Generally the Cartesian product (or direct product) of sets A, B, ... , D, written A x B x ... x D, is defined as the set {(a, b, . " d)laEA, bEB,...,dED}. C. Mappings If there exists a rule which assigns to each element of a set A an element of a set B, this rule is said to define a mapping (or simply map), function, or transformation from A into B. The term transformation is sometimes re- stricted to the case where A = B. Usually letters f, g, qJ, 1/1, ... stal}d for mappings. he expression f: A -->B (A-->B) means that f IS a function which maps A into B. If j : A -->B and aE A, then f(a) denotes the element of B which is assigned to a by f We call f(a) the image of a under f The notation f: a f->b (or f: a --> b) is often used to mean f(a) = b (but not in the present volumes). The domain of a mapping f: A --> B is the set A, and its range (or codo- main), written f(A), is the subset {f(a) I aE A} of B. Two functions f and 9 are equal (f = g) if their domains coincide and f(a) = g(a) for each a in the common domain. For a mappingf:A-->B and a set CE\.j3(A), f(C) is defined to be the set {.f(x) I X:E C}. This definition induces the mapping from \.j3(A) to \.j3(B) which is usually also denoted by f If A j E\.j3(A) (i= 1,2), then f(AI U A 2 )= f(AI)U .f(A 2 ) and f(A, n A 2 )cf(A I )nf(A 2 ). The inverse image of DE\.j3(B), denoted by J -I (D), is defined to be the set {XIXEA, f(X)ED}; thus the mapping r l :\.j3(B)-->\.j3(A) is defined. If B j E\.j3(B) (i= 1,2), then r 1 (B 1 UB2)=r1(BdU r 1 (B 2 ); r 1 (B I nB 2 )= r l (B 1 )nr l (B 2 ); f-l(B-B I )=A -.r'(B I ). Furthermore, Al c r l of (Ad and for1(BdcBI' A mapping 9 is an extension of a mapping f to a set A' if A' is the domain of 9 and contains the domain A of f, and if g(a)= f(a) for each a in A. In this case f is called a contraction (or restriction) of 9 to A or simply a partial map- ping of g, and is denoted by 9 I A. A mapping f is the constant mapping (or constant function) with the value b o if f(a)=b o for every a in the domain of f The identity mapping (or identity function) on A, often denoted by lA' is the mapping with the domain A such that f(a) = a for every a in A. Given two mappingsf:A-->B and g: B --> C, the mapping from A to C which assigns g(f(a)) to each aE A is called the com- posite of.f and 9 and is denoted by go f If.f: A -->B, g: B-->C, and h: C -->D, then (h og)o f = ho(gof) (associative law for composition of mappings). A mapping j : A --> B is from A onto B if f(A) = B. In this case f is also called a surjection (or a surjective mapping). A mapping f: A-->B 
1421 is one-to-one (1-1, or injective) if a #-a' implies f(a) #- f(a') for every pair of elements a and a' in A, that is, if for each b in the range of A, there exists only one element a of A such that f(a) = b. Such an f is also called an injection. In par- ticular, given a subset B of a set A, the injec- tion f: B  A defined by the condition f( b) = b for each bE B is called the inclusion map- ping (inclusion or canonical injection). A neces- sary and sufficient condition for f:AB to be a surjection is that go f = h 0 f imply 9 = h for every pair of mappings g:BC and h:BC. For f:AB to be an injection it is necessary and sufficient that fog = f 0 h imply 9 = h for every pair of mappings g: C  A and h: C  A. A mapping which is both a surjection and an injection is called a bijection (or bijective map- ping). Iff: A  B is a bijection, then the mapping from B to A which assigns to each element b of B the unique element a of A such that f(a) = b is called the inverse mapping (inverse function or simply inverse) of f, and is denoted by f-1. We have fof-t = 18 and f- 1 of= l A for every bijection f: A  B. If the domain A of a mapping f: A  B is the Cartesian product of AI and A 2 ,f(a)=b (where a=(a 1 ,a2)) is written as f(a t ,a2)=b. Given A=A1 x A 2 , B=B1 X B 2 , andkAiBi (i = 1,2), the mapping f: A  B defined by the condition f(a t , a 2 )=(ft (a t ),f2(a 2 )) is called the Cartesian product (or direct product) of the map- pings of f1 and f2' and is denoted by ft x f2' For a mappingf:AB, the subset G= {(a,f(a))laEA} of Ax B is called the graph of f The basic properties of the graph G of f are: (1) For every aE A there exists a bE B such that (a, b)E G. (2) (a, b)E G and (a, b')E G imply b=b'. Conversely, a subset G of Ax B with these two properties determines a mapping f: AB such that(a,'b)EG if and only if f(a) = b. All notions concerning a mapping f: A  B can be transferred by means of its graph to those concerning a subset of a Cartesian prod- uct A x B. Given sets A and B, we denote by B A the set of mappings from A to B. If a mapping is identified with its graph, B A is considered to be a subset of(A x B). For XE(A), the mapping cx:A{O, I} such that Cx(X) = 1 if XEX and Cx(X) =0 if xtX is called the charac- teristic function (or representing function) of X. By assigning to each X E (A) its characteristic function CxE{O, I}A, we obtain a one-to-one correspondence between (A) and {O, 1 }A; hence (A) is sometimes denoted by 2 A . D. Families of Sets A mapping from a set A to a set A is also called a family of elements of A indexed by A. A is its index set (or indexing set). In this case, 381 F Sets the image f(A) of AEA is denoted by a;., and the mapping itself is denoted by {a;.} ;'eA ({ a;.} (AE A), or simply {a;.}). In particular, if the set A is the power set of a set, the family {a;.} ;'EA is called a family of sets indexed by A, or simply a family of sets. (Moreover, if A is chosen to be a subset of the power set (X) of a set X and f to be the identity mapping on A, then the family of sets resulting from f can be identified with the set of subsets A itself.) The union U;.eAA;. of a family of sets {A;.} ;'EA is the set of all elements a such that aE A;. for at least one A in A. Their intersection n ;'EA A;. is the set of all elements a such that aEA for all A in A. A family of sets {A;'};'EA is mutually disjoint if A #- J1 implies A;. n A = 0. In this case A = U ;'EA A;. is called the disjoint union (or direct sum) of the sets of the family, and {A;.} ;'EA is called a partition (or decompo- sition) of A. For families of sets, the following hold: UA;.=U;.(UA;.), nA;.=n;.(nA;.) (AEA, J1EM;.) (associative law); (U;'EAA;.)n (U eM B) = U(;..)eA x M(A;. n B), (n;'EA A;.) U (nEMB)= n(;..)EAxM(A;. UB) (distributive law); (U;'EAA;.)C= n;'EA A 1, (n;'EA A ;.)" = U;.eAA1 (de Morgan's law). A family {A;.} ;'eA of sets is a covering of a set A, or covers A, if Ac U;.eAA;.. Given f:X  Y, {A;.} ;'EA and {B;.} ;'EA (where A;.cX and B;.c Y), then f(U;.eA A ;.) = U;.eAf(A;.), f(n;.eA A ;.) c n;'eAf(A;.); and r 1 (U;'EA B ;.)= U;'eAr 1 (B;.), rt(n;'eA B ;.)= n;'eArt(B;.). E. Direct Sum and Direct Product of Families of Sets Given a family {A;.} ;'EA of sets indexed by A, a set S, and a family of injections {i;.:A;,S};'eA' then the pair (S, {i;.};.eA) is called the direct sum of {A;'};'EA if {i;.(A;')};'EA is a partition of S. In this case, S is written ;'eAA;. (or ;.A;. or Il;.A;.). Each A;. is called a direct summand of S, and each i;. is called a canonical injection. The Cartesian product (or direct product) ILeA A;. (TI;.A;.) of {A;.};.eA (where A;. cX) is the set of all mappings from A to X such that f(A)EA;. for every AEA. The sets A;. are the direct factors of TI;.eA A;.. Each element f of TI;.eAA;. is denoted by {X;'};'EA or(..., X;., ...) (where X;. = f(A)). The element X;. is the Ath component (or coordinate) of f The mapping pr;.:TI;'EAA;.A;. which assigns X;. to each {x;.} ;'eA E TI;.eAA;. is called the projection of TI;.eAA;. onto its Ath component. If A= {I, 2}, TI;.eAA;. can be identified with AI x A 2 . F. Set Theory It was G. tCantor who introduced the concept of the set as an object of mathematical study. 
381 G Sets Cantor stated: "A set is a collection of definite, well-distinguished objects of our intuition or thought. These objects are called the elements of the set" (G. Cantor, Math. Ann., 46 (1895)). Cantor introduced the notions of tcardinal number and tordinal number and developed what is now known as set theory. He proved that the cardinal number of the set of tran- scendental numbers is greater than that of algebraic numbers, and that all Euclidean spaces have the same cardinal number regard- less of their dimension. He stated the tcon- tinuum hypothesis and also conjectured the twell-ordering theorem (G. Cantor, Math. Ann., 21 (1883)), which was proved by E. Zer- melo [2]. In this proof Zermelo stated the taxiom of choice explicitly for the first time, and used it in an essential way. Meanwhile it was pointed out that Cantor's naive set concept leads to various logical tparadoxes ( 319 Paradoxes). Since the set concept plays a fundamental role in every branch of mathematics, the discovery of the paradoxes had a serious impact upon math- ematics, and led to a systematic investigation of the Houndations of mathematics. In the course of attempts to a void paradoxes, set theory was reconstructed as taxiomatic set theory ( 33 Axiomatic Set Theory), in which Cantor's theory of cardinal numbers and ordinal numbers was restored. Also, the theory of the algebra of sets, which forms a basis for various branches of ma thematics, was re- constructed. Axiomatic set theory is consid- ered to be free from paradoxes. G. Classes A set in the naive sense is a collection {x I C(x)} of all x which satisfy a certain con- dition C(x). The only principle for generating sets in naive set theory is the axiom of compre- hension, which asserts the existence of the set {x I C(x)} for any condition C(x). However, this principle leads to paradoxes if the notion of an arbitrary set is considered to be well defined; for example, the tRusselI paradox is caused by the set {xlxx}. This situation necessitates some restrictions on the axiom of comprehen- sion. The simplest way to overcome the para- doxes is to adopt Zermelo's axiom of subsets: Given a set M and a condition C(x), there exists a set {x I XE M, C(x)}. But this axiom cannot produce any sets other than subsets of sets whose existence is preassumed. Hence further generating principles of sets had to be introduced. The following axioms are usually chosen as generating principles. Axiom of pairing: For any two objects (pos- sibly sets) a and b, there exists a set {a, b}. 1422 Axiom of power set: Given a set A, its power set (A) exists. Axiom of union: For any family of sets the union exists. Axiom of substitution (or replacement): For any set A and any mapping f from A, there exists a set of all images f(x) with XE A. In ordinary theories of mathematics the set of natural numbers, the set of real numbers, etc., are assumed to exist, in addition to sets generated by the axioms in this section. In pure set theory the axiom of infinity is needed to secure the existence of infinite sets. The concept of the "set" {x I x x} does not automatically lead to Russell's paradox. The trouble arises when this "set" is regarded as a member of a collection represented by x. This leads to a narrower concept of sets. Consider a fixed collection V consisting of sets in the naive sense and closed under the set-theoretic oper- ations mentioned in the axioms. Call a mem- ber of Va set in the narrow sense. Then set theory becomes free from the known para- doxes if the qualification for being a set is restricted in this narrow sense. When sets in the narrow sense are called simply sets, sets in the naive sense are called classes. The object {x I x  x} (where x ranges over sets in the nar- row sense) is a class which is not a set. Those classes which are not sets are called proper classes; for example, the class V of all sets and the class of all ordinal numbers are both pro- per classes. For classes, unrestricted use of the comprehension axiom again leads to para- doxes, but other set-theoretic opcrations are justifiably applicable to classes. The notion of classes was first introduced in connection with the construction of an axiomatic set theory. The term class was used originally to denote certain subclasses of the class V of all sets. In these volumes the term set is mostly used to mean a set in the naive sense, and most of the notions defined for sets are applicable to classes. References [IJ G. Cantor, Gesammelte Abhandlungen, Springer, 1932. [2J E. Zermelo, Beweis, dass jede Menge wohl- geordnet werden kann, Math. Ann., 59 (1904), 511-516. [3J A. Schoenflies, Entwicklung der Mengen- lehre und ihrer Anwendungen, Teubner, 1913. [4J F. Hausdorff, Grundzi.ige der Mengen- lehre, Veit, 1914 (Chelsea, 1949). [5J A. Fraenkel, Einleitung in die Mengen- lehre, Springer, third edition, 1928. [6J F. Hausdorff, Mengenlehre, Teubner, 1927; English translation, Set theory, Chelsea 1962. 
1423 [7] A. Fraenkel, Abstract set theory, North- Holland, 1953. [8] 1. L. Kelley, General topology, Van Nostrand, 1955, appendix. [9] N. Bourbaki, Elements de mathematique, I. Th€:orie des ensembles, ch. 2, Actualites Sci. Ind., 1212c, Hermann, second edition, 1960; English translation, Theory of sets, Addison- Wesley, 1968. [10] P. R. Halmos, Naive set theory, Van Nostrand, 1960. 382 (IX.23) Shape Theory A. General Remarks In 1968 K. Borsuk introduced the notion of shape as a modification of the notion of thomotopy type. His idea was to take into account the global properties of topological spaces and neglect the local ones. It is a classi- fication of spaces that is coarser than the homotopy type but that coincides with it on tANR-spaces. Let f1J be the category whose objects are all tpolyhedra and whose morphisms are homo- topy classes of continuous mappings between them. For spaces X and Y, denote the set of a11 thomotopy classes of continuous mappings of X to Y by [X, Y] and the homotopy class of a mapping f by [f]. For a space X, let Ilx be the functor from f1J to the tcategory of sets and functions that assigns to a polyhedron P the set Ilx(P) = [X, P]. A morphism qJ:PQ of f1J induces the function qJ#: [X, P][X, Q] defined by qJ#([f])=qJ' [f] for [f]:X P. A tnatural transformation from Il y to Ilx is a shape morphism from X to Y. A continuous mapping f: X  Y defines the shape morphism f# of X to Y as follows: For [g]: Y P in Ily(P), the composition [g' f] is an element of Ilx(P). The correspondence: [g] [g' f] de- fines a natural transformation from Il y to Ilx and hence determines the shape morphism f# : X  Y. The identity mapping 1 x on X defines the identity shape morphism If on X. Given spaces X and Y, X shape dominates Y if there are shape morphisms : Y  X and '1:X  Y such that '1= If, and we write Sh(X)  Sh( Y). If, in addition, '1 = 1:, then X and Yare of the same shape, and we write Sh(X)= Sh(Y). A shape category 51' is the category whose objects are all topological spaces and whose morphisms are shape mor- phisms between them. If we replace topo- logical spaces by pointed ones, the pointed shape category is obtained. In what follows, 382C Shape Theory for simplicity, we assume that all spaces are metrizable and the mappings are contino us. General references are Borsuk [1], J. Dydak and J. Segal [3], R. H. Fox [4], S. Mardesic [7]. B. Chapman's Complement Theorem Let X be a compactum. A closed set A of X is a Z-set in X if for any s> 0 there is a mapping f:X X -A such that d(x,f(x))<s for XEX, where d is a metric on X. The Hilbert cube Q is the countable product n1 Ii' where Ii is the closed interval [0,1]. The subset s = n1 I? (I? =(0,1)) is called the pseudointerior of Q. The following facts are known: (1) If a compact metric space X is contained in s or Q - s, then X is a Z-set in Q. (2) For any continuous mapping f of a compact metric space X into Q, there exists an embedding 9 of X into Q such that 9 is arbitrarily close to f and the image g(X) in Q is a Z-set. The complement theorem (T. A. Chapman [2]) states: Let X and Y be Z-sets in Q. Then Sh(X) = Sh(Y) iff Q - X and Q - Yare homeomorphic. C. FAR, F ANR, Movability, and Shape Group: Shape Invariants A closed set A of a compactum X is a funda- mental retract of X if there is a shape mor- phism r:X A such that r' i# = 1, where i is the inclusion of A into X. A compactum X is a fundamental absolute retract (FAR) (resp. fundamental absolute neighborhood retract (F ANR)) iffor any compactum Y containing X, X is a fundamental retract of Y (resp. of some closed neighborhood of X in Y). A com- pactum X is movable iffor any embedding Xc Q and for any neighborhood U of X in Q there is a neighborhood V of X satisfying the folIowing condition: For any mapping f of a compactum Y to V and for any neighborhood W of X, there is a homotopy H: Y x I  U such that H(y, 0)= f(y) and H(y, l)E W for YE Y. In this definition, if Y is replaced by a compactum with dimension  k, then X is said to be k-movable. Pointed FAR, F ANR, mova- bility, and k-movability are defined similarly in the pointed shape category. The follow- ing facts are known (Borsuk [1], Dydak and Segal [3], J. Keeslings [5], 1. Krasinkiewicz [8]). A compactum X is a FAR if and only if X is a pointed FAR iff Sh( X) = Sh(point), i.e., X has the same shape as a one-point space. A pointed compactum (X, x) is a pointed FANR iff Sh(X, x)  Sh(K, k) for some pointed poly- hedron (K, k). An F ANR is movable. A com- pact connected Abelian topological group is movable if and only if it is locally connected. 
382D Shape Theory A continuous image of a pointed I-movable compactum is pointed I-movable. It is un- known whether (i) an FANR is a pointed F ANR and (ii) movability means pointed movability. For a pointed compactum (X, x), let {(Kj, kJ Ii = 1,2, ...} be a countable tin- verse system consisting of pointed finite poly- hedra whose limit is (X, x). The limit group l nn(Ki' kJ is the kth shape group of (X, x), where nn(K, k) is the kth homotopy group of (K, k). It is known that the shape groups for movable compacta behave like homotopy groups for ANR.  property P of spaces is a shape invariant if whenever X has P and Sh(X)=Sh(Y), then Yhas P. FAR, FANR, movability, k-movability, and shape groups are shape invariants. D. CE Mappings A mapping f of a space X onto a space Y is a cell-like (CE) mapping if it is proper and Sh(r 1 (y)) = Sh(point) for each point y of Y. It is known (R. B. Sher [9], Y. Kodama [6]) that if there is a CE mapping of X to Y with finite dimension, then Sh(X) = Sh( Y). Here the finite- dimensionality of Y is essential. A Q-manifold is a space, each point of which has a closed neighborhood homeomorphic to Q. The fol- lowing are known (Chapman [2], J. E. West [10]): (1) Iff is a CE mapping of a Q-manifold M to an ANR X, then the mapping g:M x Q-> X x Q defined by g(m, x) = (f(m), x) for (m, X)E M x Q is approximated by homeomorphisms. As a consequence, if X is a locally compact ANR, then X x Q is a Q-manifold. (2) Every compact ANR is a CE image of a compact Q- manifold. (3) Every compact ANR has the same homotopy type as that of a compact polyhedron. The following problem raised by R. H. Bing is open: Is a CE image of a finite- dimensional com pactum finite-dimensional? References [1] K. Borsuk, Theory of shape, Monograf. mat. 59, Polish Scientific Publishers, 1975. [2] T. A. Chapman, Lectures on Hilbert cube manifolds, CBMS 28 regional conf. ser. in math., Amer. Math. Soc., 1976. [3] 1. Dydak and J. Segal, Shape theory, Lec- ture notes in math. 688, Springer, 1978. [4] R. H. Fox, On shape, Fund. Math., 74 (1972),47-71. [5] J. Keesling, Shape theory and compact connected Abelian topological groups, Trans. Amer. Math. Soc., 194 (1974),349-358. [6] Y. Kodama, Decomposition spaces and shape in the sense of Fox, Fund. Math., 97 (1977),199-208. 1424 [7] S. Mardesic, Shapes for topological spaces, General Topology and App\., 3 (1973),265- 282. [8] 1. Krasinkiewicz, Continuous images of continua and I-movability, Fund. Math., 98 (1978),141-164. [9] R. B. Sher. Realizing cell-like maps in Euclidean spaces, General Topology and Appl., 2 (1972), 75-89. [10] 1. E. West, Mapping Hilbert cube mani- folds to ANR's: A solution of a conjecture of Borsuk, Ann. Math., (2) 106 (1977), 1-8. 383 (11.26) Sheaves A. Presheaves Let X be a ttopological space. Suppose that the following conditions are satisfied: (i) There exists an (additive) Abelian group SF(V) for each open set V of X, and SF(0) = {OJ; and (ii) there exists a homomorphism "uy:SF(V)-> SF(V) for each pair V c V, such that "uu= 1 (identity) and "Uw="UyO"YW for U eVe W. We call SF, consisting of a family 1 SF( V)} of Abelian groups and a family of mappings {"uv}, a presheaf (of Abelian groups) on X. If aESF(V) and V c V, we write ,.Uy(a)=al V and call it the restriction of a to V. A homomor- phism qJ between two presheaves SF and '1J on X is a family {qJ( V)} of group homomorphisms qJ(V):SF(V)->'1J(V) satisfying "UyoqJ(V)= qJ(V)O,.uv whenever V c V. The presheaves on X and their homomorphisms form a tcategory. B. Axioms for Sheaves A presheaf SF is called a sheaf (of Abelian groups) if it satisfies the following condition: If V is open in X and (VJiEI is an topen covering of V, and if for each iEI an element Si of SF(V i ) is given such that Si I Vi n  = Sj I Vi n V j for all i andj, then there exists a unique sESF(V) such that S I Vi = Sj for all i. By definition, a homo- morphism between two sheaves is a homomor- phism of the presheaves. The sheaves on X also form a category. Let SF be a presheaf, x a point of X, and 91x the tdirected set of open neighborhoods of x, with the order opposite to that of inclusion. Then {SF(V)I VE91 x } is an inductive system of groups. The tinductive limit Iim xEu SF(V) of -> groups {g;( U)} is denoted by g;x and called the stalk of SF over x. The image of sESF(V) in ,cF x is called the germ of S at x and is written SX' 
1425 A homomorphism qJ::F -+ of pres heaves induces a homomorphism qJx::Fx-+x of stalks. C. Sheaf Spaces We introduce a topology on the tdirect sum :F' = llxex:Fx in the following way: For each open set U of X and each SE:F(U), consider the set Mu,s={sxlxE U} of the germs defined by S at the points of U, and take the set of all such M U,s as a tbase of open sets of the to- pology. If p::F' -+ X is the mapping that maps the points of:Fx to x, then p is continuous, and each p -I (x) (=:Fx) has the structure of an Abelian group. Moreover, the following con- ditions are satisfied: (i) p is a tlocal homeo- morphism, and (ii) the group operations on p-I(X) are continuous in the sense that (a, b)-+ a + b is a continuous mapping from the tfiber product :F' x x:F' (i.e., the subs pace {(a, b) I p(a) = p(b)} of the product space :F' x :F') to:F' and a-+ -a is a continuous map- ping from :F' to itself. In general, a topological space :F' with a structure satisfying these conditions is called a sheaf space over X. When :F' is a sheaf space, a continuous mapping S from a subs pace A of X to :F' such that pOS= l A is called a section of :F' over A. The set of sections over A, denoted by qA,:F'), is an Abelian group in the obvious way. If we associate q U, :F') with each open set U and define ruv by the restriction of sec- tions (ruv(s) = S I U), then we get a sheaf :F" on X. If we start from a presheaf :F and get :F" via :F', the correspondence :F-+:F" is a tcovar- iant functor from the category of presheaves to the category of sheaves, and :F" is called the sheaf associated with the presheaf :F. If :F is a sheaf, we can prove :F"';;',:F. Conversely, if we start from a sheaf space :F' and construct the sheaf :F" and then the sheaf space :F''', then :F'" is canonically isomorphic to :F'. Since we can identify a sheaf and the corresponding sheaf space, both are usually denoted by the same letter. In particular, when :F is a sheaf, :F(U) is usually written qu, :F). Given a section SEqX,:F) of a sheaf, the points XEX for which sx#O in:Fx from a closed set (the sheaf space :F is not necessarily Hausdorff even if X is so). This set is called the support of S and is denoted by supps. In the theory of this and the previous two sections, we can replace Abelian groups by groups, rings, etc. Then :F(U) is a group or ring accordingly, and :F(0) the group consist- ing of the identity element or the ring consist- ing of the zero element, respectively. We thus obtain the theories of sheaves of groups, sheaves of rings, etc. In general, a presheaf :F on X with values in a category re is a tcon- 383D Sheaves travariant functor from the category of open sets of X to re, and a homomorphism between presheaves :F and  on X is a tnatural trans- formation between the functors :F and . The pres heaves (sheaves) of Abelian groups on a space X form an tAbelian category, de- noted by r?J>x (reX). For a homomorphism f: :F-+ of presheaves, the image, coimage, kernel, and cokernel of f in r?J>x are given by (Imf)(U)=Imf(U), (Coimf)(U)=Coimf(U), (Ker f) (U) = Ker f( U), (Coker f) (U) = Coker f( U). When :F and  are sheaves, the kernel of fin rex coincides with the kernel in r?J>x, while the image and cokernel of f in rex are the as- sociated sheaves of the image and the cokernel in r?J>x, respectively. Thus,f::F -+ induces fx::Fx-+x at each XEX, (Kerf)x=Kerfx, (Imf)x=Imfx, (Cokerf)x= Cokerfx, and a sequence of sheaves O-+:F .!!..% -+0 is exact if and only if O-+:Fx:0.)%x-+O is exact at each XEX. D. Examples (1) Let G be an Abelian group (or some other talgebraic system) with tdiscrete topology. The Cartesian product X x G gives rise to a sheaf on X, called a constant sheaf (or trivial sheaf). (2) Let X be a topological space and Y be a topological Abelian group (e.g., the real or complex numbers). We obtain a sheaf :F on X by putting :F( U) = the set of an continuous mappings U -+ Yand ruv = the natural restric- tion. The stalk over x E X is the set of germs at x of continuous functions into Y. This sheaf is called the sheaf of germs of continuous func- tions with values in Y. (3) When X is an tanalytic manifold and Y is a commutative tLie group, we define the sheaf of germs of analytic mappings with values in Y in the same way. If Y is the complex number field C, this sheaf is the sheaf (!) of germs of analytic (or holomorphic) functions. A tcon- nected component of the sheaf space (!) can be identified with the tanalytic function deter- mined by the function element corresponding to a point on that component. The sheaf of germs of functions of class C' on a CS-manifold (r  s) is similarly defined. (4) Given a tvector bundle B over a topolog- ical space X, we define a sheaf on X by :F(U) = q U) (= the module of sections of B over U) and rvv = the natural restriction. Here the stalk over x E X consists of the germs at x of sections of B, and is caned the sheaf of germs of sec- 
383 E Sheaves tions of the vector bundle B. We have similar definition for the sheaf of germs of differenti- able (analytic) sections when X is a tdifferenti- able (complex) manifold. The case where B is a ttensor bundle (e.g., the tcotangent bundle :!*(X)) is important. The sheaf \ll r (X) of germs of C" -sections of the r-fold texterior power of :!*(X) is called the sheaf of germs of differen- tial forms of degree r (0  r  dim X). E. Sheaf Cohomology The category ((fx of sheaves of Abelian group, on X has sufficiently many tinjective objects. A sheaf:F with the property that rUX: qX,:F) -->qv,:F) is surjective for any open set V is said to be flabby (or scattered). An injective sheaf is flabby. Fix a nonempty family <1> of closed subsets of X satisfying the following two conditions: (i) A, BE <1> => A U B E <1>; (ii) any closed set con- tained in an element of <1> belongs to <1>. Put- ting r<l>(') = {s I SE qx, ,), sUPPSE<1>} for each. E'if?x, we obtain a tleft-exact tcovariant functor f<l> from rgX to the category (Ab) of Abelian groups. Therefore, by the general theory of homological algebra, we can define the tright derived functors Rqr<l>:'if?x-->(Ab) (q=O, I, 2, ...). We put Rqr<l>W)=H:!,(X,,) and call the H:!,( X, .) (q = 0, 1, . . . ) the coho- mology groups with coefficient sheaf:F and family of supports <1> ( 200 Homological Algebra I). When <1> is the family of all closed subsets of X, we write Hq(X,,) instead of H:!,(X, :F). Thus the cohomology group H:J,(X,:F) is the qth cohomology of the complex f<l>(.£I°)r<l>(.£I')r<l>(.£I2)... induced by an tinjective resolution O-->:F -->.£1 0 -->.£1' -->... of the sheaf:F: H:!,(X,,)= Kerd q /Imd q - 1 (q=O, I, ...;r 1 =0). Hg(X,:F)=f<l>(')' and from an exact se- quence of sheaves O-->:F -->'91-->ff -->0 we get an exact sequence O-->Hg(X, :F)-->Hg(X, '91)--> Hg(X, ff)-->HMx, .)-->H(X, '91)-->H(X, ff) -->H(X, :F)-->.... Similarly, the cohomology groups H:J,(X,:F) can also be calculated with an exact sequence O-->.-->.£I°-->.£I'-->..., where each.£li is as- sumed to be f<l>-acyclic (i.e.. H:J,(X, .£Ii) =0 for q > 0) instead of injective. The flabby sheaves, for instance, are r<l>-acyclic, so we can compute H:J,(X,.'¥) by a flabby resolution of:F (R. Godement). For example, let X be an n- dimensional tparacompact CCX" -manifold and :F=R. Then O-->R-->'lI°(X)'11'(X)... is exact, where 'lIq(X) is the tsheaf of germs of ex, -differential forms of degree q, d q is tex- terior differentiation (Poincare's theorem), and we have HP(X, Uq)=O for p>O. Therefore 1426 Hq(X, R) is the qth cohomology of the com- plex O-->DO(X)D1(X)..., where Di(X) =r(X, \ll i (X))=the group ofC"-differential forms of degree i on X. This proves the de Rham theorem, which says that the tde Rham cohomology group is isomorphic to the t(sin- gular) cohomology group of X with real coeffi- cients ( 105 Differentiable Manifolds R). For a sheaf:F of noncommutati ve groups, we can define the first cohomology HI (X, .) [2]. F. The Cech Cohomology Group Let U = {VJ be an open covering of X, and write Vi n V j = V ij , etc. Put cP(,)= IT qv io .ip,:F), po=O, 1,2,.... la, ,Ip An element of CP(:F) is called a cochain of degree p. Define d:CP(:F)-->CP+1() by ( d lf ' ) . . =/'+ O l ( -I ) r ( r. A . I v . ) fO. .lp+l r- Ji o 'r ..Ip-f-l 10" lp+l ' and denote the qth cohomology of the com- plex (cP(:F),d) thus obtained by Hq(U,.). When an open covering !B is a refinement of U, there is a canonical homomorphism Hq(l1, ,)-->Hq(!B, ,). So we can take the inductive limit of the groups Hq(U,:F) with respect to the refinement of open coverings. This limit group is denoted by fi"{x,:F) and is called the Cech cohomology group with coeffi- cient sheaf:F. It coincides with Hq(X,:F) for q  I, and if X is paracompact, for all q. G. Relation to Continuous Mappings Let X and Y be topological spaces and f: X --> Y be a continuous mapping. If  is a sheaf on Y, the fiber product X x y:F (where :F is viewed as a sheaf space over Y) is a sheaf on X. It is denoted by J*(:F) or J-1(.) and is called the inverse image of ,. The correspon- dence :F --> [* (:F) is an exact functor from ((fY to ((fx. Next, let '91 be a sheaf on X. Associating f(f-1 (V), '91) with each open set U of Y, we obtain a sheaf on Y, which we denote by J*('91) and call the direct image of '91. The corre- spondence J* is a left-exact functor 'if?x -->«jY, and we can consider its right derived functors R q[*. The sheaf R qJ*('91) is the sheaf associated with the presheaf that associates Hq( 1-1 (V), '91) with each open set V. A homomorphism I/J from :F to [*(r./f) is also called an .f-homomorphism from:F to r./f. To give such a I/J is equivalent to giving a family of homomorphisms of the stalks l/lx ::Fj1X)-->r./f x (XEX) satisfying the continuity condition: For any open set V of Yand any section S E q V, F) over V, the mapping ip from J-' (V) to '91 defined by ip(x)=I/JAs(f(x)) is continuous. The functors.f* and [* are related by 
1427 Hom(f*($'), )Hom(ffJ*()). The Leray tspectral sequence E.q=W(Y,Rqf*())=H"(X, ) exists and connects the cohomologies of X and ofY. H. Ringed Spaces Let X be a topological space and (!) be a sheaf on X of commutative rings with unity element such that (!)x#{O} for any XEX. Then the pair (X, (!)) is called a ringed space, and (!) is called its structure sheaf. A morphism (X, (!))-->(X', (!)') is by definition a pair (I, 0) consisting of a continuous mapping f: X -->X' and an f- homomorphism 0: (!)-->(!)'. When each (!) is a t10cal ring, (X, (!)) is ca1led a local ringedXspace. A morphism of local ringed spaces is defined to be a pair (f,O):(X,(!))-->(X',(!)') as before, satisfying the additional condition that 0 is local (i.e., Ox: (!)/(x)-->(!)x maps the maximal ideal into the maximal ideal for each XEX). These concepts are important in algebraic geometry and the theory offunctions of several complex variables. I. Direct Products and Tensor Products Let ff). (AEA) be sheaves of Abelian groups on a topological space X. The sheaf $' on X defined by $'(U) = TI).$').(U) and ruv= TI).l'). . d uv IS denote by ff = TI).ff). and called the direct product of sheaves {$').}. For each XEX there is a natural mapping $'x--> TI).($').)x, which is in general neither injective nor surjective. When A is a finite set, TI:F; is also written $' = $'1 + . .. + ff" and is ca1led the direct sum of the sheaves. The inductive limit $' = ind lim :F;. of an inductive system of sheaves on X also exists, and $'x=indlim$')..x' Let (X, (!)) be a ringed space. A sheaf of Abelian groups $' on X is called a sheaf of (!)- modules (or simply an (!)-module) if $'(U) is an (!)(U)-module for each U and ruv:$'(V) -->$'(U) is a module homomorphism compatible with (!)(V)-->(!)(U) for each U c V. Then $'x is an (!)x. module for each x E X. For a fixed (X, (!)), the (!)-modules form an Abelian category. When ff and  are (!)-modules, the tensor product .Y'f = $' &;  of $' and  as sheaves over (!) is defined as follows: Define a pre sheaf by U--> $'(U) &; (u)(U) and ruv=rv &; rbv, and let .Y'f be the associated sheaf of this presheaf. Then we have .Y'fx = $'x &; '" x' The notion of coherent ;heaves is important in the theory of (!)-modules ( 16 Algebraic Varieties E). 384 A Siegel Domains J. History About 1945, J. Leray established the theory of sheaf coefficient cohomology groups (in a form slightly different from that in Sections E and F) and the theory of spectral sequences to study the relation between the local properties of a continuous mapping and the global coho- mologies. In the theory of functions of several complex variables, K. Oka conceived the idea of "ideals of indefinite domain." These two ideas were unified by H. Cartan into the pre- sent form of sheaf theory. As a link between local properties and global properties, sheaf theory has been applied in many branches of mathematics ( 16 Algebraic Varieties; 21 Analytic Functions of Several Complex Variables; 23 Analytic Spaces; 72 Complex Manifolds). References [1] F. Hirzebruch, Neue topologische Metho- den in der algebraischen Geometrie, Erg. Math., Springer, second edition, 1962; English translation, Topological methods in algebraic geometry, Springer, third enlarged edition, 1966. [2] A. Grothendieck, Sur quelques points d'algebre homologique, Tohoku Math. J., (2) 9 (1957), 119-221. [3] R. Godement, Topologie algebrique et theorie des faisceaux, Actua1ites Sci. Ind., 1252, Hermann, 1958. [4] R. G. Swan, The theory of sheaves, Univ. of Chicago Press, 1964. [5] G. E. Bredon, Sheaf theory, McGraw-Hill, 1967. [6] J. Leray, L'anneau spectral et l'anneau fi1tre d'homologie d'un espace localement compact et d'une application continue, J. Math. Pure App1., 29 (1950), 1-139. [7] H. Cartan, Sem. de topologie algebrique, Paris, 1948-1949; 1950-1951. [8] A. Borel, Sem. de topologie algebrique de l'Ecole poly technique federale, 1951. (Co- homologie des espaces localement compacts d'apres 1. Leray, Lecture notes in math. 2, Springer, 1964.) 384 (VII.20) Siegel Domains A. Siegel Domains Let D be a bounded domain in C" and Gh(D) the full tholomorphic automorphism group of 
384 B Siegel Domains D, which is a tLie transformation group with respect to the tcompact-open topology. If Gh(D) acts transitively on D, then D is called a thomogeneous bounded domain. The study of homogeneous bounded domains was initiated systematically by E. Cart an in 1936, while the notion of Siegel domains, which was intro- duced by I. I. Pyatetskii-Shapiro, has made remarkable contributions to the study of homogeneous bounded domains. Let V be a convex domain in an n- dimensional real vector space R. V is called a regular cone iffor every XE V and A>O, AXE V and if V contains no entirely straight lines. Let W be a complex vector space. A mapping F: Wx W-->R c (the tcomplexification of R) is a V- Hermitian form if the following conditions are satisfied: (Fi) F(u, v) is C-linear in u, (Fii) F(u, v) = F(v, u), where the bar denotes the tconjuga- tion with respect to R, (Fiii) F(u, U)E V (the closure of V), and (Fiv) F(u,u)=O implies u=O. Given a regular cone VcR and a V- Hermi tian form F on W, one can define a Siegel domain D(V, F) (of the second kind) by putting D(V, F) = {(x+iy, u)ER c x Wly-F(u, U)E V}, which is holomorphically equivalent to a bounded domain in R C x W. When W = (0), D( V, F) is reduced to D(V)= {x+ iYER c I yE V}, which is called a Siegel domain of the first kind. A mapping L: W x W --> R C is a nondegenerate semi-Hermitian form if L can be written as L =L1 +L2' where L 1 and L 2 are RC-valued functions satisfying the conditions: (Li) L 1 satisfies (Fi) and (Fii); (Lii) L 2 is a symmetric C-bilinear form; (Liii) L(u,v)=O for all UEW implies v=o. Let B be a bounded domain in a complex vector space X and Lp (pEB) an R C _ valued nondegenerate semi-Hermitian form on W depending differentiably on pEE. Consider a domain D( V, L, B) in R C x W x X defined by putting D(V, L,B)= {(x+iy, u,p)ER c x W x XI y-ReLp(u,u)E V,pEB}. The domain D(V,L,B) is called a Siegel domain of the third kind over B if it is holomorphically equivalent to a bounded domain. D(V, L, B) is a fiber space over B. By the affine automorphism group G a of a Siegel domain D( V, F) c R C x W we mean the group consisting of all elements in the complex affine transformation group of R C x W leaving D( V, F) stable. The full holomorphic automor- phism group G h of D(V, F) contains G a as a closed subgroup. If G h acts transitively on D(V,F), then D(V,F) is said to be homogene- ous. A homogeneous Siegel domain is necessarily affinely homogeneous, i.e., G a acts transitively on D(V,F) [2]. The tBergman metric of D(V,F) which is a Gh-invariant tKah- 1er metric, is tcomplete [3], and so D(V, F) is a tdomain of holomorphy. 1428 Examples. H(n, R) denotes the vector space of all real symmetric matrices of degree n, and H +(n, R) the regular cone consisting of all positive definite matrices in H (n, R). (i) The Siegel domain of the first kind D(H+(n,R))={X +iYIXEH(n,R), YE H + (n, R)} is called the tSiegel upper half- plane, which is holomorphically equivalent to the classical tsymmetric domain of type III. (ii) Let u, VEC and F(u, v) be the 2 x 2 diag- onal matrix diag(uv, 0), which is an H+(2,R)- Hermitian form on e. The resulting Siegel domain is D(H+(2,R),F) = { (ZI,Z2,Z3,U)EC4 1( ImZ1-luI2 ImZ3 ) Imz3 Imz 2 EH+(2, R)}. (iii) Let B = {tEC Iitl < I}, and let u, VEe. Put L,(u, v) =(1-ltI 2 ) -1(UV + tuv). Then L,(u, v) is a nondegenerate semi-Hermitian form, and we have D(H+(2,R),L,B)={(z,u.t)EC31Imz -(1-ltI 2 )-1 Re(lul 2 + tu 2 ) >0, It I < l}, which is a Siegel domain of the third kind and is holomorphically equivalent to the Siegel upper half-plane of dimension 3. The domains in (i) and (ii) are both affinely homogeneous; the latter was originally found by Pyateteskii-Shapiro in 1959 [I] and pro- vides the least-dimensional example of non- symmetric homogeneous bounded domains, which answered affirmatively Cartan's con- jecture (1936): Are there non-tsymmetric homogeneous bounded domains in C" (n;'4)? B. Infinitesimal Automorphisms of Siegel Domains For a Siegel domain D(V, F) c R C x W, the Lie algebra gh of G h can be identified with the Lie algebra of infinitesimal automorphisms, i.e., all complete holomorphic vector fields on D(V, F). Let G( V) be the group consisting of all the linear automorphisms of R leaving V stable. Let us fix a base in R, and let (z l' Z2' ..., zn) be the complex linear coordinate system in R C corresponding to it. Choose a complex linear coordinate system (u 1 , u 2 , ..., urn) in W. We write F(u, v) as F(u, v) = (PI (u, v), .., Fn(u, v)). Consider the following two vector fields in the Lie algebra ga of G a : a a E = 2Ik Zko + I. u., ()Zk DU. a 1= I .iU'- a ' u. and put g; = {X Egal [E, X] = i.X}, AEZ. Then ga can be written as a tgraded Lie algebra in the following way: 9a=9a- 2 +9;1 +9. Here 
1429 we have g 2 = {Lk a k ak \ ak E R} , g1 = {2iLk Fk(U, c) ak + L c. a.\ c=(c.), C.EC}, and g2 consists of al1 vector fields X(A,B) of the form a a L.jakjz j aZ k + L,p b.pu p au; where the matrices A = (akj), B = (b. p ) satisfy the conditions: exptAEG(V), tER; AF(u,v)= F(Bu, v)+ F(u, Bv). For X(A,B)Eg2 we define tr X(A.B) to be the sum of the trace of A and that of B. Let gl be the A-eigenspace of ad E in gh (AEZ). Then gh can be written also in the form of a graded Lie algebra: gh=g-2+ g -1 + gO + gl + g2, and gl=g; is valid for A= -2, -1, O. Furthermore gh can be nicely deter- mined by ga in the fol1owing manner. Pill de- notes a polynomial on R C x W homogeneous of degree J.l in Z I' . ., ,Z., and homogeneous of degree A in u l , '" ,u m . Let 9 1 (resp. 9 2 ) be the set of al1 polynomial vector fields of the form " k a ,, ( . . )  L..k PI. 1 ".--+ L.... P1.0 + PO.2 au UZk . (res p . LkPto ak + L.P:'l aJ Then we have gl = {X Eg11 [X, g-l] egO}, and g2= {X Eg 2 1 [X, g-2] e gO, [X, g-l] e g1, ImTr[X, Y]=O for Y Eg -2}. Another descrip- tion of gl and g2 has been given in terms of Jordan triple systems [4]. The explicit descrip- tions of gl and g2 have been given for most homogeneous Siegel domains D(V, F) over irreducible self-dual cones V ( Section D; T. Tsuji, Nagoya Math. J., 55 (1974)). gh = ga is valid for the Siegel domains which are irreduc- ible quasisymmetric but not symmetric ( Section D). Main references for this section are [2-6]. C. j-Algebras and Homogeneous Bounded Domains The notion ofj-algebra was introduced by Pyatetskii-Shapiro [1], which reduces the study of homogeneous bounded domains to purely algebraic problems. Let 9 be a Lie algebra over R, and I a subalgebra of g, (j) a col1ection of linear endomorphisms of g, and w be a linear form on g. Then the quadruple 384C Siegel Domains {g, I, (j), w} (sometimes abbreviated g) is cal1ed aj-algebra if the fol1owing conditions are satisfied: (i)jI e I for jE(j) andj==j'(mod I) for j, j' E(j); (iiV == - id(mod I); (iii)j[k, x] == [k,jx] (modI) for kEI, XEg; (iv) [jx,jy] ==j[jx,y] + j[x,jy] + [x, y] (modI) for x, YEg; (v) w([k,x]) =0 for kEf; (vi) w([jx,jy]) = w([x, y]); (vii) w([jx,x]»O for xff. Let g' be a subalgebra of 9 such that j g' e g' + f. Then, putting f' = g' nt, one can natural1y induce a j-algebra structure on the pair {g', f'}. The j-algebra thus obtained is cal1ed aj-subalgebra of {g, I, (j), w}. A j-a1gebra {g, f, (j), w} is cal1ed proper (resp. effective), if, for any j-subalgebra {g', f'} with g' compact semisimple, g' is contained in f (resp. if {g, I} is an effective pair). Now let D be a homogeneous bounded domain in en, G a connected tLie subgroup of Gh(D) acting ttransitively on D, and K the tisotropy subgroup at a point in D. The Lie algebras of G and K are denoted by 9 and f, respectively. Then the pair {g, I} becomes an effective proper j-algebra. Conversely, to every effective proper j-algebra there corresponds a homogeneous bounded domain. The identity component of Gh(D) is isomorphic to the iden- tity component of a treal algebraic group via the tadjoint representation. Let {g, I, (j), w} be aj-algebra. Suppose that 9 satisfies the fol1owing conditions: (i) 9 = 9 -2 + 9 -1 + gO as a graded Lie algebra; (ii) gO=f+jg-2; (iii) there exists ajE(j) such thatjg -1 = 9 -1; and (iv) there exists an rEg- 2 such that [jx,r]=x for x E 9 - 2. Such a decomposition is cal1ed a Siegel decomposition of g. To an effectivej- algebra admitting a Siegel decomposition there corresponds a unique Siegel domain up to affine equivalence. Vinberg, Gindikin, and Pyatetskii-Shapiro (Appendix in [1] or Trans. Moscow Math. Soc., 12 (1963)) proved that the Lie algebra gh(D) of Gh(D) contains aj- subalgebra admitting a Siegel docomposition and corresponding to the same domain D, and obtained the realization theorem: Every homogeneous bounded domain D is holomor- phical1y equivalent to a Siegel domain. In con- sequence, D is diffeomorphic to a Euclidean space, and the isotropy subgroup Kh(D) is a maximal compact subgroup of Gh(D). We have the decomposition Gh(D) = Kh(D)' T (semi- direct), where T is an R-splittable solvable subgroup of Gh(D) acting simply transitively on D. T is uniquely determined up to conju- gacy in G(D) (= the identity component of Gh(D)), and is cal1ed the Iwasawa group of D. The j-algebra structure of the Lie algebra t of the Iwasawa group T is characterized by the fol1owing properties: (i) for every tEt, the eigenvalues of ad tare al1 real; (ii) there exists a tcomplex structure j such that [jx,jy] = 
384 D Siegel Domains j[jx,yJ + j[x,jyJ + [x,yJ for x, YEt; and (iii) there exists a linear form w on t such that w([jx,jyJ)=w([x,yJ) and that w([jx,xJ»O for x =1= O. A Lie algebra satisfying (i)-(iii) is called a normal j-algebra. There exists a one- to-one correspondence between the set of holomorphic equivalence classes of homog- eneous bounded domains and the set of j- isomorphism classes of normal j-algebras; by a j-isomorphism here we mean an isomorphism which commutes with j. Let {g,j, w} be a normal j-algebra and define an inner product (, )on9 b y(x,y)=w([jx,yJ).Theortho- gonal complement I) with respect to ( , ) of the tcommutator subalgebra gl of 9 is an Abelian subalgebra of 9, and the adjoint repre- sentation of I) on 91 is fully reducible. One has 9= La fa, I) = fa, and 91 = La*o fa, where fa = {XE 9 I [h, xJ = !X(h)x, hE I)}. The linear form !X on I) is called a root of 9. There exist 1 roots !X" "',!X l (I = dim I)) such that I) can be written in the form I) = jf a ! + ... + jf a " 1 being the rank of 9. Then, after a suitable change of the num- bering of the !X/s, any root !X will be seen to be of the form (!Xi + !Xd/2, (!Xi - !Xd/2 or !Xz/2, where I  i  k  I. A normal j-algebra admits a unique Siegel decomposition which can be constructed by using root spaces. D. Equivariant Holomorphic Embedding We retai" the notation of Section B. Let D( V, F) c R C x W be a Siegel domain and 9c be the tcomplexification of the Lie algebra 9h' 9- 1 has the complex structure defined by the endo- morphism adJ. Let g;' be the ::!:i-eigenspaces in the complexification gc' of g-1 under ad [. Let us consider the complex subalgebras b = 9::' +9g+9L+9 and n=9c2+g1 of 9c, where the subscripts C denote the complexi- fication of the respective space. Let G c be the connected tcomplex Lie group generated by 9c and containing G (= the identity com- ponent of G h ) as a subgroup. The Lie algebra of the normalizer B of b in G c coincides with b. Identifying R C x W with n as a complex vector space, and denoting by n the natural projection of G c onto the complex coset space Gc/B, the composite mapping T=nexp is a holomorphic embedding of n into G c / B, which induces a holomorphic G-tequivariant embedding of D(V, F) into Gc/B as an open submanifold. This embedding is called the Tanaka embedding. By the (Shilov) boundary S of D(V,F) we mean the real submanifold S= {(x+ iy, U)E R C x Wly=F(u, u)} of R C x W, which is a subset of the boundary of D(V,F). S has the natural tCR-structure induced from the complex structure of R C x W. Every element of 9h can be extended to a unique holomorphic 1430 vector field on R C x W which is tangent to S, and its restriction to S is an infinitesimal tCR- automorphism on S, i.e., a complete vector field generating a I-parameter group of tCR- equivalences of S onto itself. Conversely, every infinitesimal CR-automorphism on Scan be extended uniquely to a holomorphic vector field on R C x W, and an element of 9h is char- acterized as an infinitesimal CR-automorphism on S whose extension leaves the tBergman kernel form of D(V, F) invariant [6]. Let n = dimcg c , m= dimcb, and let k=()-1. Then Gc/B, and consequently D(V, F), is embedded holomorphically into the complex tGrass- mann manifold of m-dimensional subspaces in 9c and so into the tcomplex projective space Pdq. Any element of G is induced from a projective transformation and hence is a bi- rational transformation on D(V, F). Let D be a homogeneous bounded domain in en, gh the Lie algebra of Gh(D), and f h the isotropy subalgebra of gh; and let 9c be the complexification of 9h' 9h is aj-algebra. Let us define the complex subalgebra 9_ of 9c by putting 9_ ={x+ijxlxEgh,jE(j)}. Then we have 9c=9h+9-, 9hn9- =f h . Let G c be the connected Lie group generated by gc and containing G(D) as a subgroup. Let G_ be the connected (closed) Lie subgroup of G c gen- erated by g_. Then D can be holomorphically embedded in Gc/G- as the open G(D)-orbit of the origin of Gc/G _ [8]. This embedding is called the generalized Borel embedding. Gc/G- is compact if and only if D is symmetric, and in this case Gc/G- coincides with the compact dual [9]. Let {to,j, w} be a normal j-algebra of rank 1 corresponding to a homogeneous bounded domain Do, and define the Hermitian inner product h by h(x,y)=w([jx,yJ)+iw([x,yJ) for x, y E to. to has 1- I (normal) nontrivial j-ideals (i.e.,j-invariant ideals) up to j-isomorphisms. Take a j-ideal t 1 of to. Then we ha ve to = t 1 + t 2 , t 2 being a (normal) j-subalgebra of to de- fined as the orthogonal complement of t 1 in to with respect to h. The geometric version of this is that Do is represented as a holomorphic fiber space over the homogeneous bounded domain D 2 corresponding to t 2 , with fibers holomorphically equivalent to the homog- eneous bounded domain D 1 corresponding to t 1 . For this fibering there exists a universal fiber space D over the product 15 2 of certain classical symmetric domains, with the same fibers, which plays the same role as that of a tuniversal fiber bundle in topology. Here, D is again a homogeneous bounded domain. The fiber space Do-->D2 is induced from the fiber space D-->15 2 by the classifying mapping.le of D 2 to 15 2 , Let f3 be the generalized Borel em- bedding of D into Gc/G _' Then there exists a 
1431 complex Abelian subalgebra m of gc satisfying gc=g- +m (semidirect), and one can con- struct a biholomorphic mappingf of {3(D) onto a certain Siegel domain of the third kind over D 2 in the vector space m [8]. The fiber space Do --+ D 2 coincides with the one induced from the aforementioned Siegel domain of the third kind by the composite mapping of A. and f. {3. Every realization of a homogeneous bounded domain as a Siegel domain of the third kind is obtained by this method. E. Classification of Homogeneous Bounded Domains The main concern is to classify an homoge- neous bounded domains in C" up to holomor- phic equivalence. Since the realization as Siegel domains has been set up, the second step is to get the uniqueness theorem: The holomorphic equivalence of two homogeneous Siegel domains implies that they are linearly equivalent, that is, there exists a (complex) linear isomorphism between the ambient vec- tor spaces which carries the one domain to the other. The uniqueness theorem was first stated in 1963 (Appendix in [1]), rigorously proved in 1967 [10], and in 1970 the homogeneity assumption was removed [2]. A homogeneous Siegel domain is caned irreducible if it is not holomorphicany equivalent to a product of two homogeneous Siegel domains. Every homogeneous Siegel domain is linearly equiva- lent to a product of irreducible homogeneous Siegel domains [3,10]. A homogeneous Siegel domain D(V, F) is irreducible if and only if the regular cone V is irreducible, i.e., if V cannot be written as a direct sum of two regular cones. So the problem is to classify irreducible homogeneous Siegel domains up to linear equivalence. This reduces to classifying two kinds of nonassociative algebras with bigrada- tion, caned T-algebras and S-algebras [11]. Nonsymmetric homogeneous Siegel domains appear in dimension 4. The numbers of such domains are finite up to dimension 6, but in every dimension ;::, 7 there is at least one con- tinuous family of non symmetric irreducible homogeneous Siegel domains, which are not mutuany holomorphicany equivalent. There is a remarkable class of homogene- ous Siegel domains, caned quasisymmetric [12], which contains the class of symmetric bounded domains. A regular cone VcR is caned self-dual if there exists an inner product ( , ) on R such that V={xERI(x,y»O for yEfr -(O)}, V denoting the closure of V. V is caned homogeneous if the group G(V) is tran- sitive on v. Suppose that V is homogeneous self-dual. Then the group G(V) is tself-adjoint 384F Siegel Domains with respect to the inner product ( , ) on R. Let g(V) be the Lie algebra of G(V). Then the totality fry) of skew-symmetric operators in g(V) with respect to ( , ) is a tmaximal com- pact subalgebra of g(V) and is the isotropy subalgebra of g(V) at a point eE V. Consider the associated tCartan decomposition g(V)= f(V)+ p(V). For each xER there exists a unique element T(X)Ep(V) such that T(x)e=x. Let F be a V-Hermitian fonn on a complex vector space W. Define a Hermitian inner product < , > on Wby <u,v>=(e,F(u,v)) for u, VE W, and let H(W) be the set of Her- mitian operators on W with respect to < , >. A (homogeneous) Siegel domain D(V, F) c R C x W is caned quasisymmetric if V is homo- geneous self-dual and iffor each xER there exists R(X)E H(W) such that F(R(x)u, v) + F(u, R(x)v) = T(x)F(u, v) for u, VE W. The normal j- algebra t of an irreducible quasi symmetric Siegel domain is characterized by the fonowing conditions: dim f("+'k)/2 = a (1  i  k  I); and dim k.;/2 = b (1  i  I), where a, b are some constants and 1 is the rank of t (D'Atri and de Miateno). Quasisymmetric Siegel domains have been completely classified (M. Takeuchi, Nagoya Math. J., 59 (1975), also [12]). F. Generalized Siegel Domains and Further Results Let Q be a domain in C" x cm (n,m;::' 0) which is holomorphicany equivalent to a bounded domain and contains a point of the form (z,O), Z E C. Q is caned a generalized Siegel domain with exponent C (cER), ifQ is invariant under holomorphic transformations of C" x C m of the types (z, u) 1-+ (z + a, u) for an aER", (z, u)l-+(z, ei'u) for an tER, (z, u)1-+ (e' z, eC'u) for an tER. Let D be a bounded domain in C", and r a subgroup of Gh(D). r is said to sweep D if there exists a compact set KeD such that r K = D. r is said to divide D if r, provided with dis- crete topology, acts properly on D and sweeps D. D is caned sweepable (resp. divisible) ifthere exists a subgroup r of Gh(D) which sweeps (resp. divides) D. A divisible generalized Siegel domain is symmetric. A sweepable generalized Siegel domain with exponent C *0 (resp. C =0) is a Siegel domain (resp. a product of a Siegel domain of the first kind and of a homogene- ous bounded circular domain) ([13]; also A. Kodama, J. Math. Soc. Japan, 33 (1981)). Some results have been obtained concerning geometry of bounded domains, homogeneous bounded domains, and Siegel domains in 
384 Ref. Siegel Domains complex Banach spaces [14], and also con- cerning the unitary representations of the generalized Heisenberg group on the square- integrable cohomology spaces of ab-complexes on the Shilov boundary of a Siegel domain [ 15]. References [IJ I. I. Pyatetskii-Shapiro. Automorphic functions and the geometry of classical do- mains, Gordon & Breach, 1969. [2J W. Kaup, Y. Matsushima, and T. Ochiai, On the automorphisms and equivalences of generalized Siegel domains, Amer. 1. Math., 92 (1970),267-290. [3J K. Nakajima, Some studies on Siegel domains, J. Math. Soc. Japan, 27 (1975),54- 75. [4J J. Dorfmeister, Homogene Siegel-Gebiete, Habilitation, Univ. Munster, 1978. [5J S. Murakami, On automorphisms of Siegel domains, Lecture notes in math. 286, Springer, 1972. [6J N. Tanaka, On infinitesimal automor- phisms of Siegel domains, 1. Math. Soc. Japan, 22 (1970), 180-212. [7J S. G. Gindikin, E. B. Vinberg, and I. I. Pyatetskii-Shapiro, Homogeneous Kahler manifolds, Geometry of Homogeneous Bounded Domains, C.I.M.E. 1967; Cremonese, 1968,3-87. [8J S. Kaneyuki, Homogeneous bounded domains and Siegel domains, Lecture notes in math. 241, Springer, 1971. [9J K. Nakajima, On Tanaka's embeddings of Siegel domains, 1. Math. Kyoto Univ., 14 (I 974), 533-548. [IOJ S. Kaneyuki, On the automorphism groups of homogeneous bounded domains, 1. Fac. Sci. Univ. Tokyo, 14 (1967), 89-130. [IIJ M. Takeuchi, Homogeneous Siegel domains, Pub\. study group geometry 7, Kyoto, 1973. [12J I. Satake, Algebraic structure of sym- metric domains, Iwanami and Princeton Univ. Press, 1980. [13J 1. Vey, Sur la division des domaines de Siegel, Ann. Sci. Ecole Norm. Sup., 3 (1970), 479-506. [14J J. P. Vigue, Le groupe des automor- phismes analytiques d'un domaine borne d'un espace de Banach complexe, Ann. Sci. Ecole Norm. Sup., 9 (1976),203-282. [15J H. Rossi and M. Vergne, Group represen- tations on Hilbert spaces defined in terms of ab-cohomology on the Shilov boundary of a Siegel domains, Pacific J. Math., 65 (1976), 193-207. 1432 385 (XVI.6) Simulation A. General Remarks Simulation, in its widest sense, is a method of utilizing models to study the nature of certain phenomena. This method is employed when experimentation with the actual phenomena in question is difficult because of high cost in time or money. Also, it is sometimes almost impossible to carry out observations when the behavior of the objects can be influenced by their surroundings. We can classify simulation techniques into the following four types, although simulations in practical use are usually a mixture of them. The first type is model experimentation, which includes model basins and wind tun- nels in hydrodynamics and pilot plants in the chemical industry. In advance of construction in a real situation, we perform experiments on a small scale and verify or modify those theories upon which the construction is based. The second type is analog simulation or experimental analysis. We investigate the properties of real objects by experiments on alternative phenomena satisfying the same differential equations as those known for or assumed to be satisfied by the real objects. For example, we use an equivalent electric network to study dynamic vibration, and dynamic systems to study heat conduction problems. When theoretical analysis of the actual phe- nomenon is difficult, we look for other phenom- ena with similar properties and study them in order to construct mathematical models for them. This type of simulation has come into practical use mainly in engineering problems, but recently it has been utilized for the study of economic phenomena, nervous systems, the circulating system of an artificial heart, etc. Analog simulation was in the past often per- formed by means of tanalog computers. Now- adays, analog simulation is more frequently performed by digital computers than by ana- log ones. And with the progress of electronics it has become easier to make special-purpose simulators. The third method of simulation, simulation in the narrow sense, has become more impor- tant as tdigital computers have been developed. In general this method is applied to problems that are more complicated and of larger scale than problems treated by analog simulation. When the mathematical expressions of the phenomenon and the algorithms of its dy- namic structure are known, it is easy to simu- late it by means of a computer program. In 
1433 particular, when these techniques are used to study systems such as sets of machines, equip- ment at factories, or management organiza- tion, we cal1 them system simulations. Major fields where system simulation techniques have been used are traffic control on highways or at airports, arrangement or operation of ma- chines at factories, balancing problems in chemical processes, produCtion scheduling in connection with demands and stocks, overal1 management problems, and design of informa- tion systems. The method has also been ap- plied in designing plants and highways and in the study of social or biological phenomena. Also, when we investigate instruction systems of computers that are yet to be completed or develop programming systems for such computers, existing computers can be used to simulate the new ones. tRandom numbers play an important role where the simulation must include random fluctuations ( 354 Ran- dom Numbers). In such instances, the method is often cal1ed the Monte Carlo method ( Section C). The fourth method of simulation deals with systems containing human beings. Among them are war games for training in military- operation planning, business games for train- ing in business enterprises, and simulators for training pilots and operators of atomic power plants. The contribution of human decision to simulation processes is characteristic of these cases. For example, the participants in a busi- ness game are divided into several enterprise groups. Each group discusses and decides how to invest in plants, equipment, research, and advertising and how to schedule production for each quarter. On the basis of the deci- sions, a computer outputs the records of sales, stocks, and cash for each quarter, according to hidden rules. From the results, each group decides on the next steps. In this way, the groups compete for development. This type of simulation is important not only for train- ing but also for investigating the mechanism of human decision. Slightly different from this type of simulation, the "perceptron" and EPAM (Elementary Perceiver and Memorizer) are related to artificial inte11igence and have been used extensively in the cognitive sciences and in research into the structure and function of the human brain. The third type of simulation has attracted attention in particular and has been. used both in theoretical problems, such as the explication of various phenomena, and in practical prob- lems, such as design or optimum operation of systems or prediction of their behavior. For school education and training of technicians, this is put to use together with simulations of 385C Simulation the fourth type. But their use has also induced heated discussions and controversies on the validity of results. B. Programming Languages for Simulation We usual1y describe models by using general purpose language: such as FORTRAN, or list processing languages such as LISP to simulate situations on computers. For system simula- tion, a number of programming languages have been developed and put to practical use. They can be divided roughly into two cate- gories: those for which systems change continu- ously and those describing discrete changes. CSMP (Continuous System Modeling Pro- gram), CSSI (Continuous System Simulation Language), and DDS (Digital Dynamics Simulator) belong to the former, and hence al1 involve integration mechanisms; but each has a different way of describing a mode!. DYNAMO, which has been implemented, or 1. W. Forrester's Industrial Dynamics and World Dynamics, are used extensively. To control simulation time, one may use GPSS (General Purpose Simulation System), SIMULA (Simulation Language), or SIMSCRIPT (Simulation Scriptor), each employing a differ- ent method to describe state transitions. C. The Monte Carlo Method The Monte Carlo method was introduced by J. von Neumann and S. M. Ulam around 1945. They defined this as a method of solving deter- ministic mathematical problems using tran- dom numbers. L. de Buffon's needle experi- ment, in which the approximate value of 11: is obtained by dropping needles at random many times, is a classical example of this method. Another example is the problem of evaluat- ing a definite integral I = Jf(x)dx (B;::' f(x);::, A;::, 0). First we generate many pairs of (uni- form) random numbers (x,y), where a<:;;x<:;;b . and A <:;; Y <:;; B. The proportion (p) of pairs satisfying y <:;;f(x) gives an estimate of the inte- gral, i.e., Ip(B-A)(b-a). The techniques for inverting matrices, solving tboundary value problems of partial differential equations, and so on, are also examples of the Monte Carlo method in this sense. However, direct numer- ical calculation seems to be more useful in dealing with this sort of problem. At present, Monte Carlo methods are usual1y used when it is difficult to construct (or solve) mathematical equations describing the phenomena in ques- tion, for example, when the phenomena in- volve tstochastic processes such as trandom 
385 Ref. Simulation walks. Some methods have been devised so as to get precise results efficiently. References [IJ B. P. Zeigler, Theory of modeling and simulation, Wiley, 1976. [2J R. S. Lehman, Computer simulation and modeling, Wiley, 1977. [3J R. E. Shannon, Systems simulations: The art and science, Prentice-Hall, 1975. [4J 1. M. Hammersley and D. C. Handscomb, Monte Carlo methods, Wiley, 1964. [5J A. Newell and H. A. Simon, Human prob- lem solving, Prentice-Hall, 1972. [6J M. Minsky and S. Papert, Perccptrons, MIT Press, 1969; second printing with correc- tions by the authors, June 1972. [7J 1. W. Forrester, World dynamics, Wright- Allen Press, 1972. 386 (XX.29) S-Matrices A. Basic Notion It is often useful to focus attention on the relation between a physical system's input and output, without worrying about intermediate processes (the black box), which may be in- sufficiently understood or too complicated to analyze. For the scattering of particles, this leads to the notion of an S-matrix that directly relates the state of incoming particles (before scattering processes take place) to that of outgoing (scattered) particles. In typical cases, the incoming and outgoing particles are described as mutually noninter- acting (these are called free particles). This implies particle motions along straight lines at constant speeds (asymptotic to the actual motion at infinite past for incoming particles and at infinite future for outgoing particles) in classical mechanics, and wave functions (or vectors in a Hilbert space) obeying the Schro- dinger equation with a free Hamiltonian Ho in quantum mechanics and more or less the same in quantum field theory. A wave function for n particles is an L 2 - function of their momenta PI' ..., Pn (each Pj being a 3-dimensional vector) with respect to an appropriate measure (normally the Le- besgue measure dJ1(p) = IT d 3 Pj in quantum mechanics and the Lorentz-invariant measure dJ1(p) = IT {(m 2 + p}j-1/2d 3 Pj} in quantum field 1434 theory, where m is the mass of the particle), and the S-matrix S is described in terms of the S-matrix elements (PI, ..., PnISlp'l, "', p.) (which is a distribution) as (<I>,S'P)= f <I>(p)( pISlp')'P(p')dJ1(pldJ1(p'), where <I> and 'P are wave functions for nand n' particles, P =(PI, "', Pn), and similarly for p'. (pISlp') gives quantities measured in scattering experiments, as will be explained in Section B (2). In quantum mechanics, the free and actual (interacting) motion of particles is described in the same Hilbert space with free and interact- ing Hamiltonians Ho and H. A vector <I> in interacting motion behaves like a vector ({J in free motion at infinite past if II(exp[ -iHtJ)<I>-(exp[ -iHotJ)({JI!->O as t->-CX) and hence <1>= W_(H; HO)({J, W_(H; Ho)= lim eiH'e-iHo'. t-J. - oc Such a <I> is often written as <1>= <l>in(({J) = f <l>in(p)({J(p)dJ1(p), and is called an in-state. A definition for an out-state <l>0u, is obtained by changing t->-CX) to t-> +00 and W_ to W+. The S-matrix ele- ment is defined (as a distribution) by (pi Sip') = (<I>0ut(p), <l>in(p')). The existence and properties of Wi (called twave operators) are central subjects in scat- tering theory ( 375 Scattering Theory). In quantum field theory, the asymptotic description is given in terms of vectors in tFock space, and the in- and out-states are constructed in terms of tasymptotic fields ( 150 Field Theory). The foregoing description actually applies only to a system of one-component particles of the same kind. More generally. additional (discrete) variables, say ()(, are needed to dis- tinguish different kinds of particles and differ- ent spin components of each kind of particle, and the p's appearing in the above formulas should be replaced by (p, ()()'s, along with re- lated changes in the measure. Even in a quantum mechanics of many identical particles a bound state, if it exists, is to be treated as another particle (different from the original one) and should be distinguished by :x's in the asymptotic description cp. If the interaction is of1ong range (e.g., 
1435 Coulomb interaction), the classical path of a particle does not have an asymptote in gen- eral, and correspondingly the wave operators W:t do not exist for the usual free Hamil- tonian. Still, an asymptotic description of scattering is possible in some cases. In the presence of massless particles, such as photons in quantum field theory, another difficulty, called the infrared problem, can arise in the asymptotic description of scattering because the scattered particle may be accom- panied by an infinite number of massless par- ticles (with very small energy). In such a situ- ation, a representation of a free massless field not equivalent to the standard Fock represen- tation is believed to be a possible candidate for the asymptotic description of scattering. B. Basic Properties (1) Invariance. Let £0 be the Hilbert space for the asymptotic description of scattering, such as the space of L 2 -functions lp(P1' ..., Pn) rela- tive to the measure djJ(p). The S-matrix is an operator on £0 whose matrix element is as described above. (The corresponding operator S in the Hilbert space £ describing the inter- acting states is defined by S<I>°UI(lp)= <l>in(lp) and is sometimes called an S-operator.) S is said to be invariant under a group G of transformations of the P's (and possibly the IX's) if (U(g)lp)(p) = lp(g-1 p) defines a continuous unitary representation U(g) and if U(g)S = SU(g) for all gEG. First, S is usually invari- ant under time translation. In the quantum mechanics of a particle scattered by a rotation- ally invariant potential, S is invariant under the group of rotations of 3-dimensional vec- tors p; in the quantum mechanics of many particles (mutually interacting through central potentials), S is invariant under the 3-dimen- sional Euclidean group of transformations p--+Rp+a with rotation R; in relativistic field theory, S is assumed to be invariant under the tinhomogeneous Lorentz group oftransfor- mations p--+Ap+a with p=(pO, p) and homo- geneous Lorentz transformation A. In all these examples, G is of type I, i.e., there is a direct integral decomposition £0 = f £(k)@ .!l'(k)dv(k), U(g)= f U k (g)@I2'(k)dv(k), into irreducible representations Uk on £(k), which are mutually inequivalent, where .!l'(k) is some Hilbert space for each k. The S-matrix is 386B S-Matrices invariant under G if and only if S = f I ff (k) @ S(k)dv(k). When a (scalar) particle is scattered by a central potential, irreducible representations are labeled by the energy E(p) = p2 j(2m) (for time translation) and the angular momentum 1 (for rotations) with dim .!l'(E(p), I) = 1. There- fore each S(k)=SI(lpl) is a number. For any given energy, 1 = 0, 1,2, ... are referred to as the S-wave, P-wave, D-wave,... or generally as partial waves. For relativistic scalar particles, irreducible representations (with positive energy and nonzero real mass) are labeled by the center- of-mass energy squared S (= ((m2 + pt)I/2)2_ (pl) and the total angular momentum I. If S is below the threshold (3m)2 of 3-particle scattering, dim.!l'(k)= 1 and S(k)=SI(S) is a number. (2) Unitarity. Slp is supposed to represent the t= +00 asymptotic (free) behavior of the state that initially (i.e., at t= -(0) behaves like a free state lp. If there is no loss of probability in the description of scattering, the S-matrix is isometric. If all asymptotic configurations are realized as a result of scattering, the S- matrix must also be unitary. The mappings out out <l>in : lp E £0 --+ <l>in (lp) E £ (W" in q uan turn mechanics) are proved to be isometric under a general assumption. The unitarity is then proved in potential scattering (under some conditions on the potential) by showing that the two wave operators W" have the same range. In fact, a somewhat stronger result- that this range is the same as the absolutely continuous spectral subspace for the interact- ing Hamiltonian H-is usually proved and is called completeness (of scattering states in the absolutely continuous spectral subspace of H). If the mappings <l>0UI and <l>in (or W:t) are iso- metric and have the same range, then <l>in(lp)= <l>0UI(Slp), which shows that the state behaving like lp at t = -00 behaves like Slp at t = +00. In the simple multiplicity cases such as SI(lpl) and SI(S) above (which correspond to the physical situation of purely elastic scatter- ing without any production or change of par- ticles), these numbers must be of the form e 2id , due to unitarity, where the real number (jl is called the phase shift. In terms of the phase shift, the differential cross section d(Jjdil, which is the average number of particles scattered per unit time per unit solid angle around the direc- tion forming an angle () with the incident uniform parallel beam of unit intensity (one particle per unit time per unit area) when 
386B S-Matrices viewed in the center-of-mass system, and the total elastic cross section rT el = S (da/do.)dO. are given by the following formulas, called the partial wave expansion: da/do. = If(s, 0)1 2 , x f(s,0)=k- 1 I (21+ 1)'/;p/(cosO), .t; = sin6/e i ." /o CT c / =4nk- 2 I (21 + 1 )sin 2 6b / where dO. = sin () dO d({J (invariant measure on a 2-dimensional sphere S2) and k is the wave number of the particle in the center-of-mass system (k=h- 1 Ip/ = h-] [(s/4)_m 2 )112). The function f is called the scattering amplitude. The forward scattering amplitude /(s, 0) is related to the total cross section CT!O! by the optical theorem: rT t01 =4nk -1 Imf(s, 0), which follows from the unitarity of the S- matrix. Here the total cross section is expressed as the area of the transverse cross section of a classical (impenetrable) scatterer that would scatter the same amount of particles. In a purely elastic region, CT 10t = CT cb and the optical theorem is the same as the assertion Im.t;=1.t;1 2 . Even for values of s above the threshold of inelastic scattering, the restriction of So to the subspace of two particles is again described by numbers e 2iO " where 6/ is now complex, and the same formulas for the differential and total elastic cross sections hold, except that uni- tarity of So now implies Imt;? I t;1 2 . (3) TCP Symmetry. In the framework of either taxiomatic quantum field theory or the ttheory of local observables, the TCP theorem (or PCT theorem) shows [1J that to every particle there corresponds another particle with the same mass and spin, which is called the anti- particle and can be the original particle itself, such that any particle is the antiparticle of its antiparticle, and the following relation, called TCP invariance (or PCT invariance [2)) holds for S-matrix elements: (p,iXISlp',iX') = '1(iX)'1(iX')i F (»-F(a')(p', 0oiX'ISlp, BoiX). Here F(iX) and F(iX') are the number of particles with half udd integer spin amongst the incom- ing and outgoing particles (i.e., in iX and IX'), '1(iX) and '1(iX') are the product of ::!: 1 assigned to each particle in iX and iX', respectively, with the assignment to a particle and its antiparticle the same for bosons and opposite for fermions, (p, IX) is an abbreviation for an ordered n-tuple of energy-momentum 4-vectors Pi and other indices iX i (for spin components and particle 1436 species), i = 1,..., n, and similarly for (p', IX'), and 0 0 is the mapping from particles to their anti- particles (without changing spin indices). We may view the mapping !p, iX)->'1(iX)i F (»\p, lJoiX) as an antiunitary operator 0 0 on Yfo satisfying 0 0 S0 O' ] = S* so that (({J1, SO({J2) =(0 0 ({J2, So 0 0 ({Jd. This is related to the tTCP operator 0 in quantum field theory by 0\}1°u'( ((J) = \}Ii"(0 0 ({J) and 0\}1in(({J)= \}IoUl(0 0 ({J). The name TCP comes from the combinatio'n ofT for time reversal (incoming <=z outgoing. l' == p/m-> -V), C for charge conjugation (particle <=z anti- particle), and P for parity, which is a quan- tum number for space inversion (p-> - p). TCP symmetry was suggested to G. Liiders (Dansk. Mat. Fys. Medd., 1954) by B. Zumino in the form that P-invariance implies TC sym- metry. W. Pauli (Niels Bohr and the develop- ment of physics, Pergamon 1955, 30) realized that TCP is a symmetry. R. Jost (Helv. Phys. Acta, 1957) gave its proof in the framework of axiomatic quantum field theory. (4) Crossing Symmetry. In the framework of either axiomatic quantum field theory or the theory of local observables, it has been shown by J. Bros, H. Epstein, and V. Glaser (Comm. Math. Phys., 1 (1965); also [3)) that there exist analytic functions H(k, IX) of complex variables k = (k], ..., k4-) in a certain domain D such that each k j is a complex 4-vector, the variables are on the mass-shell manifold defined by kJ = mJ (k; is in the Minkowski metric) and L k j = 0, and the boundary value of H(k,iX) as k j ap- proaches f.jPj from Ims>O (s being the square of the sum of two k's for incoming particles) in D is the scattering amplitude for the following processes involving the particles Aj and their antiparticles  (j = 1, ...,4) with 4-momenta Pj' Gj= 1 for Aj and -1 for  (some of the A's and A's may coincide): (i) Al +A2->A3+A, (i') A3+ A 4-->A1 +A2' (ii) A] +A3->A2 + A4-, (ii') A2+A4-->A1 +A3' (iii) AI +A4-->A2+A3' (iii') A 2 +A3->A1 +A4-' Any pair of relations taken from (i)-(iii) con- stitutes a crossing symmetry. (5) High Energy Theorems. M. Froissart (Phys. Rev., 123 (1961)) obtained from the Mandel- starn representation the following bound for the forward scattering amplitude (called the 
1437 Froissart bound) at large s: IF(s,O)1 < (const)s(logs)2, F(s,t)kf(s,8), t=2k 2 (cos8-1). A. Martin has obtained such a bound in the axiomatic framework. As a consequence we have the Froissart-Martin bound: a tot <4nR- 1 (1 +E)2(1og(sj s O))2, where.E > 0 is arbitrary, R can be taken to be 4m (m n is the mass of a pion) for many cases, such as nn, nK, nN, and nA scattering, and So is an unknown constant. Many other upper and lower bounds for scattering amplitudes have been obtained under other assumptions [4, 5]. I. Ya. Pomeranchuk (Sov. Phys. JETP, 7 (1958)) suggested the asymptotic coin- cidence of total cross sections at high energy for scattering of AB and AE, where E is the antiparticle of B. This Pomeraochuk theo- rem has been shown to hold by Martin (Nuovo Cimento, 39 (1965)) by using the ana- lyticity derived in the axiomatic framework under the fo11owing sufficient condition: The existence of limsoo [a(AB) - a(AE)] and limsoo(s log s) -1 f(s, 0) = o. C. S-Matrix Approach (1) History. A11 the information needed to understand the experimental e1ementary- particle scattering data seems to be expressible by S-matrix elements. It was therefore natural to try to develop a foundation (and practical methods of calculation) for the theory of ele- mentary particles on the basis of some gen- eral properties of the S-matrix, especia11y when other approaches, such as quantum field theory, faced difficulties. W. Heisenberg (2. Phys., 120 (1943) first pointed out the possi- bility of such an approach soon after the intro- duction of the S-matrix by 1. A. Wheeler (phys. Rev., 52 (1937»). Unfortunately, in the early 1940s not much dynamical content could be given to such an S-matrix-theoretic approach because only unitarity and some invariance properties, such as tLorentz in variance, were used to characterize the S-matrix. In the late 1950s, through the study of the analyticity of the S-matrix in connection with dispersion relations in quantum field theory, it became evident that causality is another important determinant of S-matrix structure. In practice, causality in position space is used in the form of the analyticity of the S-matrix elements in the energy-momentum space (variables dual to space-time positions in the Fourier transform). Subsequently it was realized that analyticity combined with unitarity gives surprisingly 386C S-Matrices strong control over the structure of the S- matrix (G. F. Chew [6]; H. P. Stapp, Phys. Rev., 125 (1962); 1. C. Polkinghorne, Nuovo Cimento, 23 and 25 (1962); 1. Gunson, J. Math. Phys., 6 (1965); D. I. Olive, Phys. Rev., 135B (1964); Chew [7]; R. J. Eden et al. [8]). The study of the S-matrix based on its general properties, such as invariance, unitarity, and analyticity, is ca11ed S-matrix theory. In the present form, it is adapted only to massive particles with short-range interactions, and its applicability is believed to be limited to strongly interacting systems. (2) Landau-Nakanishi Variety. C. Chandler and Stapp (J. Math. Phys., 10 (1969)) and D. Iagolnitzer and Stapp (Comm. Math. Phys., 14 (1969» clarified the analytic structure ofthe S- matrix in terms of Landau equations ( 146 Feynman Integrals) based on the important physical idea of macroscopic causality, which gives much more precise infonnation in the physical region than a superficial application of tlocality (also ca11ed microcausality) in axiomatic quantum field theory, though it is possible that a detailed study starting from microcausality and incorporating tasymptotic completeness (the so-ca11ed nonlinear program in axiomatic quantum field theory) might eventua11y entail the macroscopic causality condition (e.g., 1. Bros, in [9]; Iagolnitzer [10, ch. IV]; also K. Symanzik, J. Math. Phys., 1 (1960)). (3) Microlocal Analysis. An important fact is that the normal analytic structure of the S- matrix discussed by Iagolnitzer and Stapp essentia11y coincides with the notion of analy- ticity in micro local analysis, i.e., with micro- analyticity ( 274 Microlocal Analysis; F. Pham and Iagolnitzer, Lecture notes in math. 449, Springer, 1975; M. Sato, Lecture notes in phys. 39, Springer, 1975)). In a word, the tLandau equations have acquired a new inter- pretation in the description of the micro- analytic structure of the S-matrix. In the new developments, the Landau equations define a variety in the cotangent bundle (over the mass- she11 manifold in momentum space), and the tsingularity spectrum of S-matrix elements is assumed to be confined to UGSf+(G) (except for the so-ca11ed ..#to-points), where Granges over a11 possible Feynman graphs and Sf+(G) denotes the positive-IX Landau-Nakanishi variety associated with G ( 146 Feynman Integrals). The union UGSf+(G) is known to be 10ca11y finite and hence makes sense (Stapp, J. Math. Phys., 8 (1967)). The old interpre- tation of Landau equations, as defining a variety in energy-momentum space, corre- sponds now to considering the variety L +(G) 
386 Ref. S-Matrices obtained by projecting 2'+(G) from the co- tangent bundle to the base manifold (i.e., the mass she1l manifold). The new interpretation of Landau equations led Sato (Lecture notes in phys. 39, Springer, 1975) to make a further intriguing conjecture that the S-matrix would satisfy a special toverdetermined system (a tholonomic system) oft(micro-) differential equations whose tcharacteristic variety is given by the complexification of Landau-Nakanishi varieties. This conjecture is closely related to the monodromy-theoretic approach by T. Regge (publ. Res. Inst. Math. Sci., 12 suppI. (1977) and the references cited therein) and his co-workers. (4) Discontinuity Formula. It has turned out that the above approach is closely related to the so-ca1led discontinuity formula obtained by combining the unitarity and the analyticity of the S-matrix. Actua1ly T. Kawai and Stapp (Publ. Res. Inst. Math. SCi., 12 suppI. (1977)) have shown that Sato's conjecture can be verified at several physica1ly important points on the basis of the discontinuity formula. The discontinuity formula was first found by R. E. Cutkosky (J. Math. Phys., 1 (1960)) for Feyn- man integrals. It describes the ramification property of the integral around its singularities ( 146 Feynman Integrals). An analogous formula has been shown to be valid also for the S-matrix, and it demonstrates how strict are the constraints derived from unitarity and analyticity (Eden et aI. [8, ch. 4J; M. J. D. Bloxham et aI., J. Math. Phys., 10 (1969); 1. Coster and Stapp, J. Math. Phys., 10 (1969); also Stapp in [llJ and Iagolnitzer [1OJ). Note, however, that the derivation of the hitherto- known discontinuity uses either some ad hoc assumptions or some heuristic reasoning which is not rigorous or sometimes is even erroneous from the mathematical viewpoint. Efforts to give a rigorous proof are stil1 being made, and these present several mathemati- ca1ly interesting problems (e.g., Iagolnitzer in [9J and M. Kashiwara and Kawai in [9J). (5) Regge Poles. The results stated so far con- cerning the analyticity of the S-matrix have been primarily derived in the low-energy region. It is commonly hoped that these results can be related to its high-energy behavior through the inner consistency of S-matrix theory, even though it is sti1l unclear to what extent such a relationship can be developed. Such a hope was advocated by Chew, who had been inspired by the results of Regge (Nuovo Cimento, 14 (1959); 18 (1960)) for potential scattering. After being adapted to the relativ- istic case, Regge's idea took the f01l0wing form: Consider the scattering of two incoming 1438 and two outgoing scalar particles with equal mass m> O. Let fz(s) be the partial scattering amplitude defined earlier. Regge introduced the idea of extending the function fz(s) to an analytic function f(l, s) (l E C) and of applying the Sommerfeld-Watson transformation in order to replace the partial wave expansion by the integral j=1j2 t (21+ l)f(l,s)P,(-cos{])dljsinrrl =F(s,t) for a certain contour C in the complex I-plane which encircles {O, 1,2,...}. If f(l, sl is mero- morphic in Re l> -lj2 and if it tends to zero sufficiently rapidly at infinity, then one can change the contour C so that, with the help of Cauchy's integral formula, F - constant. tats), rx(s)=maxRel(s), where the maximum is taken over a1l the poles of f(l, s). Thus the poles of the extended function f(l, s) determine the asymptotic behavior of F as t -> 00 (Regge behavior) under the assumption that f(l, s) can be chosen to satisfy suitable analyticity and growth order conditions. These poles are ca1led Regge poles. Even though meromorphy conditions are found to be satisfied for scatter- ing by a (Yukawa) potential, they do not seem to be satisfied for the fu1l S-matrix in the relati- vistic case. More general cases than those discussed here, i.e., the cases where more vari- ables are considered, are also being studied but without fu1l success at the moment. For details and references  [7, 12, 13]. (6) Veneziano Model. In connection with Regge-pole theory, we note an interesting observation by G. Veneziano (Nuovo Cimenta, 57A (1968)) to the effect that 1(1- rx(s))1(l- rx(t))jr(1- rx(s) - rx(t)), with rx(s) being linear in s, satisfies a crossing symmetry (in sand t) and shows an exact Regge-pole behavior. Although the many results that have been obtained give rise to a hope of constructing a realistic model of the S-matrix starting from the aforementioned function, no one has yet succeeded [14]. A more promising approach is the topological expansion procedure in which the first term of the expansion apparently shares with the potential-scattering functions the property of having only poles in the com- plex I-plane, along with several other physi- ca1ly important properties of Veneziano's function. References [IJ H. Epstein, CPT invariance of the S- matrix in a theory of local observables, 1. Math. Phys., 8 (1967), 750-767. 
1439 [2] R. F. Streater and A. S. Wightman, PCT, spin and statistics, and all that, Benjamin, 1964. [3] H. Epstein, in Axiomatic field theory, M. Chretien and S. Deser (eds.), Gordon & Breach, 1966. [4] A. Martin and F. Cheung, Analyticity property and bounds of the scattering ampli- tudes, Gordon & Breach, 1970. [5] A. Martin, Scattering theory in unitarity, analyticity and crossing, Lecture notes in phys. 3, Springer, 1969. [6] G. F. Chew, S-matrix theory of strong interaction physics, Benjamin, 1962. [7] G. F. Chew, The analytic S-matrix: A basis for nuclear democracy, Benjamin, 1966. [8] R. J. Eden, P. V. Landshoff, D. I. Olive, and J. C. Polkinghorne, The analytic S-matrix, Cambridge Univ. Press, 1966. [9] D. Iagolnitzer (ed.), Proc. Les Houches, Lecture notes in phys. 126, Springer, 1980. [10] D. Iagolnitzer, The S-Matrix, North- Holland, 1978. [11] R. Balian, and D. Iagolnitzer (ed.), Struc- tural analysis of collision amplitudes, North- Holland, 1976. [12] S. Frautschi, Regge poles and S-matrix theory, Benjamin, 1963. [13] P. O. B. Collins, An introduction to Regge theory and high energy physics, Cam- bridge Univ. Press, 1977. [14] M. Jacob, Dual theory, North-Holland, 1974. 387 (XX.34) Solitons A. General Remarks Solitons are nonlinear waves that preserve their shape under interaction. Mathematically, the theory of solitons continues to develop as a theory of completely integrable mechanical systems. Typical examples are the Korteweg- de Vries equation ( Section B), the Toda lattice ( 287 Nonlinear Lattice Dynamics), and the Sine-Gordon equation u,,-uxx=sin u, studied classically in connection with trans- formations of surfaces of constant negative curvature. B. The KdV Equation In the late 19th century J. Boussinesq and then D. Korteweg and G. de Vries obtained equa- 387C Solitons tions describing water waves having traveling- wave solutions. The equation u,-6uu x +u xx =0, u=u(x,t), (1) derived by de Vries is called the KdVequation for short. Putting u(x, t) = s(x - ct - b), we find that s is an telliptic function, and we obtain s= -sech2 ( f x) as its degenerate form. This solution is called a solitary wave. Around 1965, M. Kruskal and N. Zabusky solved the KdV equation numerically, taking several separated solitary waves as the initial data. They found that the waves interact in a complicated way but that eventually the initial solitary waves reappear. Noting the particle- like character of the waves, they called each of these waves a soliton. Subsequently, the KdV equation was found to have an infinite number of constants of motion. C. Gardner, J. Greene, Kruskal, and R. Miura associated the I-dimensional tSchrodin- ger operator - d 2 jdx 2 + u(x, t) to each solution u(x, t) of the KdV equation and showed that its teigenvalues are preserved in time. More- over, they applied inverse scattering theory ( Section D) and obtained explicit formulas for the solutions. C. Lax Representation Let L= -D 2 +u(x), D=djdx. For M= -4D 3 +6uD+3u x ' the commutator [M, L] =ML-LM is the operator of multiplication by the function 6uu x - U xxx ' So u = u(x, t) is a solution of the KdV equation if and only if L,=[M,L]. (2) Equation (2) is also the condition that all L= L(t) are tunitariIy equivalent to each other and the tspectrum of L is preserved through time. Most equations having soliton solutions can be represented in the form of (2) for a suitable pair of Land M. This representation is called the Lax representation. One sometimes says that an isospectral deformation of L is given by (2). On the other hand, isomonodromic defor- mations ( 253 Linear Ordinary Differential Equations (Global Theory)) have been studied extensively by M. Sato and his co-workers, and relations to soliton theory have been 
3870 Solitons discovered (Sato, T. Miwa, and M. limbo, Pub/. Res. Inst. Math. Sci., 14 (1978),15 (1979)). In the present case, the requirement that the commutator [M, LJ be a multiplication by a function determines an essentially unique (2n + 1 )th-order ordinary differential operator M = An' the differential operator part of the +fractional power L n +1/2. [An, LJ is a poly- nomial in u and its derivatives, denoted by Kn[u]. The equation u,= Kn[uJ is called the nth KdV equation. The transformation taking the initial data u(x) to the solution unIx, t) of the nth KdV equation is denoted by 7;.(t). Then 7;.(t) Trn(s) = Trn(s) 7;.(t), i.e., the flows de- fined by these higher-order KdV equations commute. This property and the existence of infinite number of invariant integrals are con- sequences of the complete integrability of the higher-order KdV equations considered as infinite-dimensional Hamiltonian systems. The KdV equations can be studied group- theoretically as Hamiltonian systems on a certain coadjoint orbit in the +dual space of the tLie algebra of a certain class of tpseudo- differential operators (M. Adler, Inventiones Math., 50 (1979); also  B. Kostant, Advances in Math., 34 (1979) for the analogous facts for the Toda lattice). The ordinary differential equation Kn [uJ = ° is called a stationary KdV equation. By the commutativity of the flows 7;.(t), each KdV flow leaves invariant the space of solutions of Kn[uJ =0. The flows restricted to this space form a completely integrable Hamiltonian system with finiteJy many degrees of free- dom (S. P. Novikov, Functional Anal. Appl., 8 (1974)). Kn[uJ =0 is also the condition that there exist an ordinary differential operator M which commutes with L. 1. Burchnal and T. Chaundy (Proc. London Math. Soc., (2) 21 (1922)) studied this problem and showed that such Land M are connected by the relation M 2 = P(L), where P is a certain polynomial of degree 2n + 1 and that the potential u is ex- pressed by the +theta function associated with the +hyperelliptic curve w 2 = P(z). D. Inverse Scattering Method Let u(x) be a potential such that u(x)-->O as X--> ::I::x. The equation L[=eI(Im(O) has solutions (t(x, () that can be represented as f -t(x, ()= ei:"x(1 + J;;" K i:(x, tier 2i(' dt). Putting ( = c: + iYf and noting that 1+ (x,) and f+(x, - c:) are independent solutions of L[ = 2 f, one can express f- as f-(x, )= a(){+(x, - )+h()f+(x, ). 1440 The coefficient a() can be continued analyti- cally to the upper half-plane, where it has only a finite num ber of zeros, aB of which are sim- ple and lie on the imaginary axis. Denote them by iYfj (j = 1, "', n). The tpoint spectrum of the operator L consists of the numbers -I]; , and the associated eigenfunctions are f+(x, irlj), which are real-valued. Put Cj =(J [+(x, iYf)2 dx) -1, and caB t() = a() -1 and r()= b()/a(c:) the transmission coefficient and the reflection coefficient, respectively. The triplet r(), I]j' c j (i = 1, " . , n) is called the scattering data. T t is connected with the kernel K = K t by the Gel'fand-Levitan-Marchenko equation K(x,t)+F(x+t)+ Ie>"' F(x+t+s)K(x,s)ds (t>O), F(x)=n- 1 f- r()e2iXdX+2j c j e- 2 "J x . The potential is given by u(x)= -(cK/ox)(x,O). When the reflection coefficient r() vanishes identically, the potential is called a reflection- less potential. The kernel K then becomes a tdegenerate kernel and the potential is ex- pressed by d 2 u(x)= -2------zlogD(x), dx where D(x) is the determinant of the n x n matrix whosej, k entry is bjk+cjexp{ -(Yfj+ Ilk) x }/(Yfj+ lId. The authors of [IJ showed that if u(x, t) is a solution of the KdV equation and if u(x, t)-->O (x--> ::J: x), then the time development of the scattering data of the potential u(x, t) is as follows: nand Yfj do not depend on t, and cj(t) = c j e 8 "]t, r(, t) = r()e8i3t. The solution associated with the reflectionless potentials are obtained by replacing Cj by cN) in the formula for D(x). These are soliton solutions of the KdV equation. R. Hirota developed a method of treating functions like D(x) directly for most of the equations in the soliton theory (Lecture notes in math. 515, Springer, 1976). A certain geometric method that enables one to obtain solutions of the Sine-Gordon equation from a known solution has been studied in the transformation theory of sur- faces of constant negative curvature (G. Dar- boux, Le(ons sur ta theorie generate des sur- faces, Chelsea, 1972, vol. 3, ch. 12), and its generalizations are called Backlund transfor- 
1441 mations. Soliton solutions of the KdV equa- tion can also be constructed by this method. Relations to the inverse scattering method, differential systems, and transformation groups have also been studied (Lecture notes in math. 515, Springer, 1976). E. Periodic Problem Let the potential u(x) be of period I, and con- sider Hill's equation ( 268 Mathieu Func- tions E) Lf=)f The real (ic-) axis is divided into intervals of unstable solutions (u-intervals) alternating with intervals of stable solutions. One of the u-intervals is of the form (-ooJo] and the others are finite, possibly degenerating to points. The special potentials that have only a finite number of nondegenerate u-intervals are the periodic analog of the reflectionless potentials, and are called the finite-gap (or -band) potentials. Consider the eigenvalue problem L{ = )l, {( T) = f( T + I) = 0 for a fixed real parameter T. Exactly one of its eigenvalues belongs to each finite u-interval. These eigenvalues move with " but those in degenerate intervals cannot move. Let u be a finite-gap potential, lj = [}'2j-l' )'2J (j = 1, ... ,g) be the nondegenerate (finite) u-intervals, and 14,) be the associated eigen- values in Ij' The potential is recovered by the formula 2g 9 u(x)= I ))-2I 14 x ). jO j=l Put P(ic)=II}J o (}'-}'), realize the tRiemann surface S of the hyperelliptic curve w 2 = P(z) as a two-sheeted cover of the tRiemann sphere, and consider the 14,) as points on S. Let w j (j = 1, ... ,g) be a basis for the space of the tdifferentials of the first kind on S. Fix a point Po in S and put 9 f "d T ) Wj(T)=  w j k-l Po (j= 1, ...,g) ( 3 Abelian Varieties L). Then the locus (wd,), ..., W q (,)) ( -00 <, < (0) on the tJacobian variety turns out to be a straight line, the direction V x being determined by the tperiods of certain tdifferentials of the second kind on S. Employing the solution of tJacobi's inverse problem, we can write the potential in terms of the tRiemann theta function as d 2 u(x)= -2-,logO(xv x +c)+C, dx- where c is a certain constant vector and C is a constant. Suppose now that u(x, t) is a solution of the KdV equation which is a finite-gap potential 387 F Solitons for each t. Then 9 and }'j are preserved in time. The determination of the direction v, on the Jacobian variety is similar to that of v x ' and the solutions of the KdV equation are ob- tained by replacing the vector c by tv, + c' in (3). The case 9 = 1 is the elliptic traveling wave solution ( Section B). Most of these results have been extended to the general periodic problem (H. P. McKean and E. Trubowitz, Comm. Pure Appl. Math., 29 (1976)). F. Two-Dimensional KdV Equation Let S be a compact tRiemann surface of tgenus g, and let Po; be a fixed point on S. Put F(K) = anK"+... +a o and G(K)=hmK m +... +ho. A function 1/1 (x, y, t, p) of pES and of x, y, t is uI1iquely determined by the following con- ditions: (a) it is meromorphic on S - {Poo}' and its pole tdivisor is a general divisor of degree 9 and does not depend on x, y, t; (b) for a tlocal parameter z at Px, (z(PCX") = 0) and K=Z-\ 1/10= l/1exp( - KX - F(K)y-G(K)t) is holomorphic near Pac, and 1/10 (p",) = 1. Moreover, there is a differential operator L = anD" + a n - 1 D"-l + 'L;g uj(x, y, t)Dj such that l/1y = LI/1. Expanding 1/10 at Poo as 1 + 'Ljl j(x, y, t)z< one can express the coeffi- cients u j by  l' 2' .... Analogously, there is an M = hmDm + h m - 1 Dm-l + 'Lj=-02 vjDJ such that 1/1, = MI/1. The operators Land M satisfy the relation L,- My=[L, M], (4) which is a generalization of the Lax represen- tation. The coefficients of Land M satisfy a certain system of nonlinear differential equa- tions (V. E. Zakharov and A. B. Shabat, Func- tional Anal. Appl., 8 (1974); I. M. Krichever, Functional Anal. Appl., 11 (1977)). Example. Let F(K) = K 2 and G(K) = K 3 + CK. Then one finds L=D2 + u, M =D 3 +(3uj2+c)D+ v, where u= -2; and V=31; -31-32' Eliminating v from (4), one has (3) 3u y }. + ( - 4u, + 4cu x + u xxx + 6uu x )x = 0, the so-called two-dimensional KdV equation (Kadomtsev-Petvyashvili equation). If u(x, y, t) does not depend on y, the equation reduces to the KdV equation, and if u does not depend on t, to the Boussinesq equation. The condition for reduction to these spe- cial cases can be described in terms of the meromorphic functions admitted by S. Sup- pose that there is a meromorphic function EF(p) holomorphic for p:f. Pr and oftprin- cipal part F(K) at p = Pac' Then 1/1 is writ- ten as cp(x, t, p)exp{ EF(p)y}, and the coeffi- 
387 G Solitons cients of Land M do not depend on y. cp satisfies Lcp = Ecp, and (4) reduces to the Lax representation. If such an E F exists, S is hypere11iptic, and PY0 is one of its tbranch points over the Rie- mann sphere. Thus the result of the previous section is recovered. G. Solvable Models in Field Theory The Sine-Gordon equation has been studied extensively as a solvable model in tfield theory. It is a special case of a field in two space-time dimensions with values in a +symmetric space; this can also be treated by a variant of the inverse scattering method (V. E. Zakharov and A. V. Mikhailov, Sov. Phys. J ETP, 47 (1978)). Much work has been done on the semiclass- ical tquantization of equations encountered in soliton theory. Recently, a method of exact quantization (ca11ed quantum inverse scatter- ing) was developed (see, for example, L. A. Takhtadzhyan and L. D. Faddeev, Russian Math. Surveys, 34 (5) (1979)). References [lJ c. S. Gardner, J. M. Greene, M. D. Krus- kal, and R. M. Miura, Method for solving the Korteweg-de Vries equation, Phys. Rev. Lett., 19(1967),1095-1097. [2J P. D. Lax, Integrals of nonlinear equations of evolution and solitary waves, Comm. Pure Appl. Math., 21 (1968),467-490. [3J B. A. Dubrovin, V. B. Matveev, and S. P. Novikov, Nonlinear equations of Korteweg-de Vries type, finite-band operators and Abelian varieties, Russian Math. Surveys 31 (1) (1976), 59-146. (Original in Russian, 1976.) [4J H. P. McKean, Integrable systems and algebraic curves, Lecture notes in math. 755, Springer, 1979, 83-200. [5J Yu. 1. Manin, Algebraic aspects of non- linear differential equations, J. Sov. Math., 11 (1) (1979),1-122. (Original in Russian, 1978.) 388 (XIII.32) Special Functional Equations A. General Remarks The term special functional equations usua11y means functional equations that do not in- volve limit operations. Such functional equa- tions appear in various fields, but there is no systematic method for solving them. Usua11y 1442 they are solved by reduction to functional equations of some standard type. In this article functions are a11 real-valued functions of real variables unless otherwise specified. B. Additive Functional Equations and Related Equations Suppose that we are given an equation f(x + y) = f(x) + f(y). (1) Clearly, f(x) = ex (e a constant) is a solution. If f(x) is continuous, (1) has no other solution (Cauchy). The same conclusion holds under anyone of the following weaker conditions: (i) f(x) is continuous at a point; (ii) f(x) is bounded in a neighborhood of a point; (iii) f(x) is tmeasurable in a neighborhood of a point. However, it was shown by G. Hamel and H. Lebesgue by means of ttransfinite in- duction that equation (1) has infinitely many nonmeasurable solutions. On the other hand, it was proved by A. Ostrowski [2J that if a solution f(x) of equation (1) does not take any value between two distinct numbers for x on a set of positive measure, then f(x) is con- tinuous. This result can be extended to the case where x is a point (x 1, ... ,x n ) of an n- dimensional Euclidean space. In this case, any continuous solution is of the form n f(x) = I CjX j j=l (C j constant). Next, we consider the equation f( x;y )= f(X); f(y) . (2) Any solution of equation (1) satisfies this equa- tion. When x is a point (x 1, '" ,x n ) of an n- dimensional Euclidean space, any continuous solution of (2) is of the form f(x) = L}=l CjX j + Co (C j constant). If a solution f(x) of (2) de- fined on a tconvex set K does not take any value between two distinct numbers for x on a set of positive measure, then f(x) is continuous (M. Hukuhara). Consider the equation g(x + y) = g(x)g(y). (3) If a solution g(x) vanishes at some point , then g(x) == O. Excluding this trivial case, we assume that g(x) never vanishes. Then, putting y = x, we see that g(x) > 0 for a11 x. The substi- tution f(x) = logg(x) then reduces equation (3) to equation (1). Thus we see that any continu- ous solution of (3) is of the form g(x) = exp(ex). Next, we consider the equation g(uv) = g(u)g(v). (4) 
1443 If a solution g(u) vanishes at some  #0, then g(u)=O. Excluding this case, we assume that g(u)#O for u#O. For u, v>O, by the substi- tution x=logu, y =logv, we have an equation of the form (3). On the other hand, putting v = -1, we have g( -u)=g( -1)g(u). Since g2( -1) =g(I)= 1, we see that any continuous solution of (4) is of the form lul c or (sgn u)lul C according as g( -1) = 1 or -1. C. The General Addition Theorem and Related Functional Equations The general addition theorem is: If the equation f(x+ y)=F(f(x),j(y)) has a continuous nonconstant solution {(x) on -00 <x< +00, then f(x) is strictly monotone, and F(u, v) is strictly monotone increasing and continuous with respect to u and u for Cl < u, u < fJ and satisfies Cl < F(u, u) < fJ for Cl < u, u < fJ. There is also a constant e satisfying F(e, e) = e, and the identity F(F(u, u), w) = F(u, F(u, w)) holds for any u, u, w in the interval (Cl, fJ). Con- versely, if F(u, v) is such a function, then (5) has a continuous nonconstant solution on -00 < x < +00. Let f(x) be such a solution. Then any other continuous solution is given by f(ex). When F(u, u) is continuously differentiable, a continuoll\' solution f(x) of (5) can be obtained as a solution of the differential equation f'(x) = Fv(f(x), a)e, e =f'(0), satisfying the initial condition f(O) = a. Consider the equation F(f(x)J(y),f(x+ y))=O. Suppose that F(u, u, w) is a polynomial in u, u, and w. If this equation has a tmeromorphic solution f(x), then f(x) must be a rational function, a rational function of exp ex, or an elliptic function [1, p. 64]. Next, consider the equation {(x+ y)+ f(x- y)=2(f(x)+ f(y)). Any solution continuous on -00 < x < 00 is of the form f(x) = ex 2 . When x is a point (x l' ... ,x n ) of an n-dimensional Euclidean space, any continuous solution of (6) is given by a quadratic form f(x) = L?j CijXiX j . Consider the equation f(x+ y)+ f(x- y)=2f(x)f(y). Any solution continuous on - 00 < x < 00 is of the form f(x) = cosh ex = (e CX + e -CX)/2 or f(x) = cos ex. If f(x) is allowed to take com- plex values, then any continuous solution can be written in the form f(x) = (e bx + e -bX)/2 in terms of a complex number b. A special case 3880 Special Functional Equations is cos x, since b may take purely imaginary values. D. SchrOder's Functional Equation Schroder's functional equation is f(O(x)) = ef(x), (8) (5) where O(x) is a given function and e is a con- stant. A general solution of (8) can be written as f(x) = f1 (x)q>(x), where f1 (x) # 0 is a particu- lar solution of (8) and cp(x) is a general solu- tion of the equation cp(O(x))= cp(x). Suppose that there is a point a such that fJ(a) = a, and fJ(x) and f(x) are both differentiable in a neigh- borhood of x = a. Then we have f'(a) = 0 or fJ'(a)=e. Consider the case where fJ'(a)=e, and suppose that O(x) is twice differentiable at x= a. When lei = IO'(a)1 < 1, define On(x) (n = 0,1,2,...) by Oo(x) = x and On(x) = OWn-r(x)). Then the sequence {(On(x-a)e- n } (n=O, 1, 2, ... ) converges uniformly in a neighborhood of x = a, and its limit function f(x) is a solu- tion of (8). When lei = IO'(a)1 > 1, put O(x)=u. Then we have the equation f(O-I(u))=e- 1 f(u), and the problem reduces to the previous case. The results obtained for equation (8) can be extended to the following system of equations: ,h(OI (x), 02(X), ..., On(x)) = Aj,h(X) + D j ,h-1(X) + gj(x), j= 1,2, ...,n, (9) (6) where the OJ(x) are given functions holomor- phic in a neighborhood of x = 0, OJ(O) = 0, the coefficients of f in the right-hand side of (9) are numbers such that the matrix A = (aiJ (aij =0 except for all =Aj' a j - 1 . j = D j ) is the tJor- dan canonical form of the matrix formed by {(aO)axdxo}, and the gj(x) are polynomials consisting of terms of the form constant x x, X2 .., x:;'" with exponents m 1 , m 2 , "', m n for which )'j= A7' .., A;;'". If 0< j).jl < 1 (j= 1,2, "', n), then we can always choose the coeffi- cients of the polynomials gj(x) so that equa- tion (9) has a solution {,h(x)} holomorphic in a neighborhood of x = O. The same conclusion can be obtained for IAjl> 1 (j= 1, ...,n). Consider Abel's functional equation f(O(x))= f(x) + 1. (10) If we put exp f(x) = cp(x), then we have Schro- der's functional equation (7) cp(O(x)) = ecp(x). Consider the equation f(x + 1) = A(x)f(x). (II) If we put cp(x) = logf(x), then we have a tlinear difference equation of the form cp(x+ l)-cp(x)=logA(x). 
388 Ref. Special Functional Equations References [IJ E. C. Picard, Lec;ons sur quelques equa- tions fonctionne11es, Gauthier-Villars, 1928. [2J A. Ostrowski, Mathematische Misze11en XIV, Uber die Funktionalgleichung der Ex- ponentialfunktion und verwandte Funktional- gleichungenm, Jber. Deutsch. Math. Verein., 38 (1929), 54-62. [3J F. Hausdorff, Mengenlehre, de Gruyter, second edition, 1927, 175; English translation, Set theory, Chelsea, 1962. [4J J. Aczel, Lectures on functional equations and their applications, Academic Press, 1966. (Original in German, 1961.) 389 (XIV.1) Special Functions A. Special Functions The term special functions usua11y refers to the classes of functions listed in (1)-(4) (other terms, such as higher transcendental functions, are sometimes used). (I) The tgamma function and related functions ( 174 Gamma Func- tion); (2) tFresnel's integral, the terror function, the tlogarithmic integral, and other functions that can be expressed as indefinite integrals of elementary functions ( 167 Functions of Confluent Type D); (3) t e J1iptic functions ( 134 E11iptic Functions); (4) solutions oftlinear ordinary differential equations of the second order derived by the method of separation of variables in certain partial differential equa- tions, e.g., tLaplace's equation, in various tcurvilinear coordinates. Recently, new types of special functions, such as tPainlevC's, have been introduced as the solutions of special differential equations. In this article we discuss class (4); for the other classes, see the articles quoted. Class (4) is further divided into the foJ1owing three types, according to the character of the tsin- gular points of the differential equations of which the functions are solutions. Equations with a sma11er number of singular points than those indicated in (1 )-(3) below can be in- tegrated in terms of elementary functions. (I) Special functions of hypergeometric type are solutions of differential equations with three t regu lar singular points on the Riemann sphere. Examples are the thypergeometric function and the tLegendre function. Any function of this type reduces to a hypergeo- metric function through a simple transforma- tion ( 206 Hypergeometric Functions; 393 Spherical Functions). 1444 (2) Special functions of confluent type are solutions of differential equations that are derived from +hypergeometric differential equations by the confluence of two regular singular points, that is, by making one of the regular singular points tend to the other one so that the resulting singularity is an tirregular singular point of class I ( 167 Functions of Confluent Type). Any function of this type can be expressed by means of tWhittaker func- tions, of which many important special func- tions, such as tBessel functions, are special cases ( 39 Bessel Functions). Also, one can reduce to this type the tparabolic cylindrical functions, that is, the solutions of differential equations with only one singular point which is at infinity and is irregular of class 2. (3) Special functions of ellipsoidal type are solutions of differential equations with four or five regular singular points, some of which may be confluent to become irregular singular points. Examples are tLame functions, tMath- ieu functions, and tspheroidal wave functions ( 133 E11ipsoidal Harmonics; 268 Mathieu Functions). In contrast to types (1) and (2), functions of type (3) are difficult to charac- terize by means of tdifference-differential equations and have not been fu11y explored. Sometimes the term special function in the strict sense is not applied to them. To specify the special functions of types (1) and (2), the term classical special functions has been proposed. B. Unified Theories of Special Functions Though many special functions were intro- duced separately to solve practical problems, several unified theories have been proposed. The classification in Section A based on dif- ferential equations may be regarded also as a kind of unified theory. Other examples are: (1) Expression by tHames's extended hyper- geometric function or its extension to the case of several variables by means of a definite inte- gral of the form f «( - ad b ,«( -az)b, '" (( -am)b m (( - z)' d( ( 206 Hypergeometric Functions). (2) A unified theory [14J that includes the gamma function and is based upon Truesde11's difference-differential equation of(z, ex)/oz = F(z, ex+ I). (3) Unification from the standpoint of ex- pansions in terms of tzonal spherical functions of a differential operator (the Laplacian) in- variant under a transitive group of motions on a tsymmetric Riemannian manifold ( 437 
1445 Unitary Representations). With this approach a great variety of formulas can be derived in a unified way [3,4]. References [IJ 1. Dieudonne, Special functions and linear representations of Lie groups, Expository lectures from the CBMS regional conference held at East Carolina Univ., March 1979, Amer. Math. Soc., 1980. [2J B. C. Carlson, Special functions of applied mathematics, Academic Press, 1977. [3J J. D. Talman, Special functions: A group- theoretic approach, Benjamin, 1968. [4J N. Ya. Vilenkin, Special functions and the theory of group representations, Amer. Math. Soc. Trans!. of Math. Monographs 22, 1968. (Original in Russian, 1965.) [5J W. Miller, Jr., Lie theory and special func- tions, Academic Press, 1968. [6J Y. L. Luke. The special functions and their approximations I, II, Academic Press, 1969. [7J S. Moriguti, K. Udagawa, and S. Hitotu- matu, Mathematical formulas, Special func- tions (in Japanese), Iwanami, 1960. [8J E. T. Whittaker and G. N. Watson, A course of modern analysis, Cambridge Univ. Press, 1902; fourth edition, 1958. [9J E. Jahnke and F. Emde, Funktionentafeln mit Formalen und Kurven, Teubner, 1933; bilingual edition, Tables of functions with formulas and curves, Dover, 1945. [1 OJ W. Magnus and F. Oberhettiner, For- mulas and theorems for the special functions of mathematical physics, Springer, third en- larged edition, 1966. [11J R. Courant and D. Hilbert, Methods of mathematical physics I, II, Interscience, 1953, 1962. [12J A. Erdelyi (ed.), Higher transcendental functions I-III, McGraw-Hill, 1953-1955. [13J J. Meixner, Spezielle Funktionen der mathematischen Physik, Handbuch der Physik II, Springer, second edition, 1933. [14J C. A. Truesdell, An essay toward a uni- fied theory of special functions based upon the functional equation of(z, rx)/oz = F(z, rx + 1), Ann. Math. Studies, Princeton Univ. Press, 1948. [15J I. N. Sneddon, Special functions of math- ematical physics and chemistry, Oliver & Boyd, 1956. [16J F. W. Schiifke, Einfiihrung in die Theorie der spezielle Funktionen der mathematischen Physik, Springer, 1963. [17J G. Szego, Orthogonal polynomials, Amer. Math. Soc. Colloq. Pub!., revised edi- tion, 1959. [18J E. D. Rainville, Special functions, Mac- miHan, 1960; Chelsea, second edition, 1971. 390 A Spectral Analysis of Operators 390 (XII.12) Spectral Analysis of Operators A. General Remarks Throughout this article, X stands for a tcom- pIe x linear space and A for a tlinear operator in X. Except when X is finite-dimensional, A need not be defined over all X. A linear operator A in X is by definition a linear map- ping whose tdomain D(A) and trange R(A) are linear subspaces of X. A complex number I. is said to be an eigenvalue (proper value or char- acteristic value) of A if there exists an xED(A) such that Ax=h, x#O. Any such x is called an eigenvector (eigenelement, proper vector, characteristic vector) associated with I.. When X is a tfunction space, the word eigenfunction is also used. For an eigenvalue I. of A, the subspace M(/,) of X given by M().) = M(ic;A)= {xl Ax=icx}, i.e., the subspace consisting of 0 and all eigen- vectors associated with )., is called the eigen- space associated with Ie, and the number m(ic) = dim M (i,) is called the geometric multiplicity of A. The eigenvalue Ie is said to be (geometri- cally) simple or degenerate according as m(ic) = I or m(Ie)? 2. The problem of seeking eigen- values and eigenvectors is referred to as the eigenvalue problem. When X is a ttopologicallinear space, the notion of eigenvalues leads to a more general object called the spectrum of A. Let I, be a complex number and put A). = 1.1 - A, where I is the tidentity operator in X. Furthermore, put R). =(A)r 1 =(iJ- A)-I, if the inverse exists. Then the resolvent set p(A) of A is de- fined to be the set of all Ie such that R). exists, has domain tdense in X, and is continuous. The spectrum a(A) of A is, by definition, the complement of p(A) in the complex plane, and it is divided into three mutually disjoint sets: the point spectrum ap(A), the continuous spec- trum ac(A), and the residual spectrum aR(A). These are defined as follows: ap(A) = PI R). does not exist} = {.Ie I ic is an eigenvalue of A}; ac(A) = {Ie I R) exists and has domain dense in X, but is not continuous}; aR(A) = {I. I R). exists, but its domain is not dense in X}. Let X be a tBanach space and B(X) the set of all tbounded linear operators with domain X. If A is a tclosed operator in X, then J.E p(A) if and only if R)EB(X). Moreover, a(A) is a closed set. In particular, if A E B(X), then a(A) is a nonempty compact set. In this case the spectral radius ra(A) is defined as rAA) = SUP).EaIA) 14 Then ra(A)":; II A"ll l !", n = 1, 2,..., and IIAnll'!n-->ra(A), n--> Xi. 
390B Spectral Analysis of Operators In many problems of analysis crucial roles have been played by methods involving the spectrum and other related concepts. This branch of analysis is called spectral analysis. For an infinite-dimensional X the theory is well developed when X is a tHilbert space and A is tself-adjoint or tnormal. B. Eigenvalue Problems for Matrices Throughout this section let X be an N- dimensional complex linear space (N < 00) and A a linear operator in X. (We assume that A is defined over all X.) With respect to a fixed basis (I/Jh ... , I/JN) of X, the operator A is repre- sented by an N x N matrix, also denoted by A. Then the eigenvalues of A coincide with the roots of the tcharacteristic equation det(icI- A)=O. There are no points of the spectrum other than eigenvalues, that is O"(A) = O"p(A). Let i'EO"(A). The multiplicity m(i,) of i. as a root of the characteristic equation is called the (algebraic) multiplicity of the eigenvalue i.. The sum of m(A) over all the eigenvalues of A is equal to N. The eigenvalue A is said to be (algebraically) simple or degenerate according as m(ic) = lor m(ic)?2. Let v= 1,2,..., and NAA) = {xl()} -A)"x=O}. Then {Nv(A)} forms a nondecreasing chain of subspaces M (i,) = Nd).)c Nz(i.) c..., which ceases to increase after a finite number of steps. When v?m().), the space N v ().) is equal to a fixed subspace M(i,), sometimes called the root subspace (or generalized eigenspace or principal subspace) of A associated with i.. A vector in the root sub- space is called a root vector (or a generalized eigenvector). Then dim MV) = m().) and hence m(i.):(m(A). When A is a tnormal matrix, M(i,)= M(i.) and m(ic)=m(A). If two matrices A and Bare tsimilar, i.e., if there exists an invertible matrix P such that B = p1 AP, then A and B have the same eigen- values with the same algebraic (and geometric) multiplicities. The same conclusion holds for A and A', where A' is the ttranspose of A. For the adjoint matrix A* we have O"(A*)= O"(A) == {II icE O"(A)}. For an arbitrary poly- nomial f the relation o-(f(A)) = {(O"(A)) == [ IF) I i. E a(A)} holds (Frobenius's theorem). These relations can be extended to operators in a Banach space. In particular, O"(f(A)) = f(a(A)) if A is a bounded operator and f is a function holomorphic in a neighborhood of O"(A) (for the tspectral mapping theorem  251 Linear Operators). In the next four paragraphs, in which the spectral properties of tnormal or tHermitian matrices is discussed, we introduce into X the Euclidean tinner product ( , ), regarding X as a space of N -tuples of scalars. Let A be an 1446 N x N normal matrix. Then the eigenspaces associated with different eigenvalues of A are mutually orthogonal. Moreover, the eigen- spaces of A as a whole span the entire space X. One can therefore choose a tbasis of X formed by a tcomplete orthonormal set of eigenvectors of A. Specifically, there exists a basis {cpj Ij = 1, ... ,N} of X such that A CPj = J1jCPj and (CPi' CPj) = (jij' where (jij is the tKro- necker delta. Moreover, J11, ..., J1N exhaust all the eigenvalues of A. In terms of the basis {cpJ, an arbitrary XE X can be expanded as N Ax= L J1j(x, cp) CPj = L icPJ.x, j1 J.EG(A) (1 ) where PJ. is the orthogonal tprojection on the eigenspace associated with the eigenvalue i.. Of particular importance among normal matrices are Hermitian matrices and tunitary matrices. The eigenvalues of a Hermitian matrix are real, and those of a unitary matrix have the absolute value 1. Solving the eigenvalue problem of a normal matrix A leads immediately to the diagonali- zation of A. For instance, let U be the N x N matrix whose jth column is equal to CPj' Here the basis {cpj} is as before, and each CPj is re- garded as a column vector. Then U is unitary, and the transform U* A U of A by U is the diagonal matrix whose diagonal entries are the J1j' The problem of transforming a tHermi- tian form to its canonical form can also be solved by means of U. In fact, a Hermitian form Q = Q(x) on X is expressed as Q(x) = (Ax, x) with a Hermitian matrix A. For this A, construct U as before. Then by the trans- formation x= Uy of the coordinates of X, the form Q is converted to its canonical form Q = J111Y11 z + ... + J1NIYNl z . When X is a real linear space and A is a real symmetric matrix, U is an orthogonal matrix. By means of the orthogonal transformation x = U y, the sur- face of the second order Q(x) = 1 in R N is con- verted to the form J11 yf + ... + J1NY = I. The orthogonal transformation x = U y is called the transformation to principal axes of the sur- face Q(x) = 1. When A is not normal, it can be transformed into tJordan's canonical form by a basis CPj taken from the root subspaces MIA). However. CPj need not be orthonormal even when A is diagonalizable. C. Spectral Analysis in Hilbert Spaces Throughout the rest of this article except for the last section, X is assumed to be a Hilbert space with inner product ( , ). Furthermore, the most complete discussions will be confined to tnormal or tself-adjoint operators. A funda- 
1447 mental theorem in spectral analysis for such operators is the spectral theorem, which as- serts that a representation such as (1) holds in a generalized form. When the operator is tcompact, we have only to replace the sum by an infinite sum ( 68 Compact and Nuclear Operators). In the general case we need a kind of integral. This is discussed in detail in Sec- tions D and E. The general theory of spectral analysis for nonnormal operators, however, is rather involved even in Hilbert spaces, but two important developments can be noted. One is the theory of Volterra operators, and the other is the theory of essentially normal operators. The former is discussed in Section H and the latter and its related results in Sections I and J. D. Spectral Measure Let  be a tcompletely additive class of sub- sets of a set Q, that is, (Q,) is a tmeasurable space. An operator-valued set function E = E( .) defined on  is said to be a (self-adjoint) spectral measure if (i) E(M), M E, is an t or thogonal projection in X; (ii) E(Q)= 1; and (iii) E is tcountably additive, that is, E(Q M n )= J, E(M n ) (tstrong convergence) for a disjoint sequence {M n } of subsets in . A spectral measure E satisfies E(MnN)=E(M)E(N)=E(N)E(M), M, N E. Spectral measures which are fre- quently used in spectral analysis are those de- fined on the family r (cJ of all tBorel sets in the field of real (complex) numbers R (C). A spectral measure on r (c) is sometimes re- ferred to as a real (complex) spectral measure. For such a spectral measure E the support (or the spectrum) of E, denoted by 1\(E), is defined to be the complement of the largest open set G for which E(G)=O. A complex spectral mea- sure such that 1\(E) c R can be identified with a real spectral measure. Let E be a spectral measure on " and put E, = E( -00, n), -00 <A< 00. Then E, satisfies the relations E,E#=E minV .#), s-limE,=E#, ).-Jl+O s-lim E, =0, ).._ - 00 s-lim E, = 1, A-+ + 00 where s-lim stands for strong convergence. A family {E;,} 'ER of orthogonal projections satis- fying the relation (3) is called a resolution of the identity. Relation (2) gives a one-to-one cor- respondence between the resolutions of the identity and the spectral measures on r' Let E be a spectral measure on " and let x, YEX. Then the set function M ->(E(M)x, x)= 390 E Spectral Analysis of Operators IIE(M)xI1 2 is a bounded regular tmeasure in the ordinary sense, and the set function M -> (E(M)x, y) is a complex-valued regular tcom- pletely additive set function. For every com- plex Borel tmeasurable function f on R, the operator S(f) in X is defined by the relations D(S(f))={xlt: If(AW(E(d},)x,x)< oo}, (S(f)x, y) = f CLoo f(A)(E(d).)x, y), (4) xED(S(f)), YEX. S(f) is a densely defined closed operator and is denoted by S(f) = J':"oof(A)E(dA). The corre- spondence ff-> S(f) satisfies formulas of the so-called operational calculus ( 251 Linear Operators). In particular, S(J) = S(f)*, and hence S(f) is self-adjoint if f is real-valued. If f is bounded on the support of E, then S(f) is everywhere defined in X and is bounded. S(f) is sometimes called the spectral integral of f with respect to E. The operator S(f) can be defined in a similar way for a spectral mea- sure on c (and for a more general spectral measure). E. Spectral Theorems For every self-adjoint operator H in a Hilbert space X, there exists a unique real spectral measure E such that H = toooo AE(d).). (5) In other words, Hand E correspond to each other by the relations D(H)={xlf: A 2 (E(dA)X,X)< +oo}, (Hx,y)= f: A(E(dA)X,y), (6) xED(H), YEX. (2) This is the spectral theorem for self-adjoint operators. The support of E is equal to the spectrum O"(H), so that we can write H= f ).E(dA)= f oo AX<1(H)(A)E(dA), <1(H) -00 (3) where XM stands for the tcharacteristic func- tion of M. Formulas (5) and (6) are called the spectral resolution (or spectral representation) of the self-adjoint operator H. We call E the spectral measure for H, and the {E ,} corre- sponding to E by formula (2) (or sometimes E itself) the resolution of the identity for H. Let A be a real number. Then AEO"p(H) if and only if E(P})¥'O. Also, AEO"dH) if and only if E({).}) =0 and E(V)¥,O for any neighborhood 
390F Spectral Analysis of Operators V of ic. The spectral measure E can be repre- sented in terms of the resolvent R(ex; H) = (exl - H) -I of H by the formula 1 f b-O E((a, b))= lim lim---; {R(fl-F.i; H) 0)0 dO 2m a+o -R(fl+1;i; H)}dfl (strong convergence). For every normal operator A in X, there exists a unique spectral measure E on the family of all complex Borel sets !JIl c such that A = t zE(dz). This is called the complex spectral resolution (or complex spectral representation) of the normal operator A. The support 1\(E) is equal to a(A). There are characterizations of point and continuous spectra similar to the case of self-adjoint operators. Normal operators have no residual spectra. For a unitary operator U, the support of the associated spectral measure is contained in the unit circle r, so that U can be represented as U = t e ill F(dO) with a spectral measure F defined on r. For- mula (7) is the spectral resolution of the unitary operator U. F or a self-adjoint operator H = S  cc icE(dJ.) the following two types of classification of a(H) are often useful. (i) The essential spectrum ae(H) is by defini- tion the set a(H) minus all the isolated eigen- values of H with finite multiplicity. When H is bounded this definition of the essential spec- trum coincides with that to be given in Section I for a general A EB(X). a(H)"ae(H) is cal1ed the discrete spectrum of H. (ii) The set Xac(H) (resp. Xs(H)) (called the space of absolute continuity (resp. singularity) with respect to H) of all UEX such that the measure (E(dic)u, u) is absolutely continuous (resp. singular) with respect to the tLebesgue measure is a closed subspace of X that reduces H. The restriction of H to Xac (resp. Xs(H)) is called the absolutely continuous (resp. sin- gular) part of H, and its spectrum, denoted by adc(H) (resp. a,(H)), is called the absolutely continuous (resp. singul&r) spectrum of H. Note that aac(H) and a,(H) may not be disjoint. F. Functions of a Self-Adjoint Operator Let H = S ".'00 icE(dJ,) be a self-adjoint operator in X. For a complex-valued Borel measurable function f on R, we define f(H) to be the oper- ator S(f) determined by (4) in reference to the 1448 resolution of the identity E associated with H: f(H)= f:oo (().)E(dic). For an arbitrary aEX, let Lz(a) be the Lz- space over the measure fla = fla(') = (E(' )a, a). In other words, fELz(a) if and only if aE D(f(H)). The correspondence f......j(H) a gives an isometric isomorphism between Lz(a) and the subspace M(a)={J(H)alfELz(a)} of X. (In particular, M(a) is closed.) H is reduced by M(a), and the part of H in M(a) corresponds to the multiplication f(ic)->icf(ic) in Lz(a). For a given self-adjoint operator H there exists a (not necessarily countable) family {all}IIEE> of elements all of X such that X = I M(a e ), eEE> (8) where L stands for the tdirect sum of mutual1y orthogonal closed subspaces. Consequently, X is represented by the direct sum LEE> Lz(a e ) of Lz-spaces. If XE D(H) is represented by {Je}eEE> in this representation, H x is represented by Pfe}eEE>' (7) G. Unitary Equivalence and Spectral Multiplicity In this section X is assumed to be a tseparable Hilbert space. Then (8) can be made more precise. Namely, for a self-adjoint operator H, we can find a countable family {an ;I of ele- ments of X such that 00 00 X = I M(a.):;;;; I Lz(a n ), n=1 n=1 (9) fla"+! is tabsolutely continuous with respect to fla"' n = 1,2, .... (10) Furthermore, if {a} is another family satisfy- ing (9) and (10), then fla" and fla; are absolutely continuous with respect to each other (Hellinger-Hahn theorem). fla, is said to be the maximum spectral measure. Two operators HI and Hz are said to be unitarily equivalent if there exists a unitary operator U such that Hz = U*H, U. A crite- rion for unitary eql'ivalence of self-adjoint operators can be given in terms of the spectral representation given previously. Namely, let {a::)}, i = 1, 2, be a sequence satisfying (9) and (10) with respect to Hi' Then HI and Hz are unitarily equivalent if and only if J.ial) and fla2) are absolutely continuous with respect to each other for al1 n= 1,2,.... A self-adjoint operator H is said to have a simple spectrum if there exists an a E X such that M(a)=X. Such an aEX is called a gen- erating element of X with respect to H. 
1449 Self-adjoint operators with simple spectra are closely related to Jacobi matrices. Let H be such an operator with a generating ele- ment aEX. Take a complete orthonormal set {Gn}' in Lz(a) such that G n = Gn(i") is a poly- nomial of degree n - I and i"Gn(i")E Lz(a). Then (Yn}';;'j, Yn = Gn(H)a, is a complete orthonor- mal set in X. The matrix representation {amnL amn=(Hgn,Yml of H with respect to the basis {Yn} has the following properties: (i) amn=O if 1m - nl)o 2; (ii) a n . n + J = a n +1. n #0; (iii) ann is real Any infinite matrix {a mn } satisfying (i), (ii), and (iii) is called a Jacobi matrix. A Jacobi matrix determines a tsymmetric operator whose tde- ficiency index is either (0,0) or (1, 1). Any self- adjoint extension has a simple spectrum. (For more details about Jacobi matrices and their applications  [8J ) H. Triangular Representation of Volterra Operators A linear operator A in a Hilbert space X is called a tv olterra operator if it is tcompact and tquasinilpotent (i.e., 0 is the only spec- trum). The name is justified because under very general assumptions such an operator is unitarily equivalent to the integral operator of Volterra type in the vector-valued Lz space on [0, I]. Let '-P be a maximal ttotally ordered family of orthogonal projections in X such that P X is an tinvariant subspace of a Volterra operator A for every P E '-P. Such a family '-P always exists and is called a maximal eigen- chain of A. Then generalizing the triangular representation of nilpotent matrices, we have the integraJ representation A=2i f PAldP, 'J3 where A[=(A - A*)/(2i) is the imaginary part of A and the integral is the limit in norm of approximating sums of the form 1:QiAI(Pi- Pi -1) for finite partitions 0 = Po < PI < .., < Pn = I of'-P in which Q; is an arbitrary projection in '-P such that P i - 1  Qi Pi (M. S. Brodskir). Conversely, let '-P be a totally ordered family of orthogonal projections that contains 0 and the identity. If the integral A = h\ P BdP con- verges in norm for a compact linear operator B, then A is a Volterra operator and '-P is an eigenchain of A. If, moreover, B is self-adjoint, we have B=AI (1. C. Gokhberg and M. G. Krein; Brodskii). Furthermore, assume for simplicity that '-P is continuous in the sense that for every PI < Pz in '-P there exists an element P in '-P such that PI < P < Pz. If B is a tHilbert-Schmidt operator, then the integral A = J'i,PBdP converges in the tHilbert-Schmidt norm and the mapping B A is an orthogonal 390 I Spectral Analysis of Operators projection to the set of all Volterra operators of Hilbert-Schmidt class possessing '-P as an eigenchain (Gokhberg and Krein). Volterra operators with the imaginary part of the ttrace class are especially important for applicatIOns. In this case we have the following fundamen- tal theorem on the density of the spectrum of the real part A R of the Volterra operator A: Let n+(r; A R ) and n.(r; A R ) be the numbers of eigenvalues of A R in the intervals [I/r, (0) and (-00, -1/r], respectively. Then lim n+(r; AR)/r= lim n_(r; A R ) = h/n. r->Cf. rCX) The number h is given by h= f IldPA[dPIII \\ = infI II(P i - PH)AI(P i - Pi-dill, where the norm is the ttrace norm and the infimum is taken over aU finite partitions Pi of a maximal eigenchain '-P for A We refer for further details and applications to the books by Gokhberg and Krein [9, 10]. I. Fredholm Operators and Essential Spectra of Operators Throughout Sections I and J we assume that X is a separable infinite-dimensional complex Hilbert space, and we consider only bounded linear operators in X. The set B<C)(X) of all tcompact linear operators in X is a tmaximal two-sided ideal of the tC* -algebra B(X) of all bounded linear operators. The simple quotient C*-algebra A(X) = B(X)/B<C)(X) is called the Calkin algebra. We denote the quotient map- ping by n:B(X)---.A(X). Then an operator A E B(X) is a tFredholm operator if and only if its image n(A) is an invertible element of A(X). Let F(X) be the set of all Fredholm operators in X, and let Fn(X), nE Z, be its subset of all operators oftindex n. Fn(X) is a connected set in B(X), and in particular, Fo(X) is the inverse image of the connected component of the identity in the multiplicative topological group n(F(X)) of all invertible elements in A(X). The index gives the group isomor- phisms F(X)/Fo(X)  n(F(X))/n(Fo(X))  Z. More generally, we have for any compact topological space Y the group isomorphisms [Y, F(X)]  [Y, n(F(X))]  K(Y) of the groups of thomotopy classes of continuous mappings and the K-group in the tK-theory (M. F. Atiyah [12J). If N is a tnormal operator, its essential spectrum (Je(N) is defined to be the set of all lE (J( N) that is not an isolated eigenvaJue of finite multiplicity. Let HI and Hz be self- adjoint operators. Then (Jc(H I )= (Je(Hz) if and 
390J Spectral Analysis of Operators only if U* H, U = H 2 + K for a unitary opera- tor U and a compact operator K (Weyl-von Neumann theorem). 1. D. Berg and W. Sikonia (1971) extended this result to normal operators N I and N 2 Moreover, for any compact subset Y ofC there exists a normal operator N such that O"e(N) = Y Hence it follows that the essential spectrum (Je(N) of a normal operator N coin- cides with the tspectrum (J(n(N)) of the image in A(X). Thus we define the essential spec- trum (Je(A) of an arbitrary operator A to be the spectrum (J(n(A)) in A(X). An operator A E B(X) is said to be essentially normal (resp. essentially self-adjoint, essentially unitary) if n(A) is normal (resp. self-adjoint, unitary) in A(X). (Note that this definition of essentially self-adjoint operators is completely different from that in 251 Linear Operators E.) Since an essentially self-adjoint operator is the sum of a self-adjoint operator and a compact operator, the Weyl-von Neumann theorem classifies essentially self-adjoint operators up to tunitary equivalence modulo B(C)(X). An operator A is essentially normal if and only if the commu- tator [A, A *J is compact, but it need not be the sum of a normal operator and a compact operator. For example, let V be the unilateral shift operator that maps the orthonormal basis ei of X into ei+l for every i = 1, 2,. .. Then V is essentially unitary, but it cannot be written as the sum of a normal operator and a compact operator. The essential spectrum (Je(V) is the unit circle, whereas the spectrum (J(V) is the unit disk and ind(V - Ie) = -1 for lie 1< 1. J. The Brown-Douglas-Fillmore (BDF) Theory The following is the main theorem for essen- tially normal operators, due to L. G. Brown, R. G. Douglas, and P. A. Fillmore [14]. Let Al and A 2 be essentially normal operators. There are a unitary operator U and a compact oper- ator K such that U* Al U = A 2 + K if and only If (Je(A I )=(Je(A 2 ) and ind(A, -Ie)= ind(A 2 -).) for every Ie in the complement of the essential spectrum. An essentially normal operator A is the sum of a normal operator and a compact operator if and only if ind(A - Ie) = 0 for every Ie in the complement of (Je(A). To prove this and many other facts, they developed the theory of extension of B(C)(X) by the C*-algebra C( Y) of continuous complex- valued functions on a compact metrizable space Y [14-16]. This revealed deep relations between the theory of operator algebras on Hilbert spaces (-" 36 Banach Algebras, 308 Operator Algebras) and algebraic topology (in particular, K-theory;  237 K-Theory). Exten- 1450 sion theory also gives a natural setting for the tindex theory of elliptic differential operators due to Atiyah and 1. M. Singer [13, 16]. An extension of WCJ(X) by C( Y) is a tshort exact sequence 0-> B(C)(X)EC( Y)->O (11 ) of a C*-subalgebra E ofB(X) and *-homo- morphisms, i.e., E is a C*-subalgebra of B(X) containing the identity I and including B(C)(X) as a C*-subalgebra, and IfJ is a *-homomor- phism onto C( Y) whose kernel is equal to B(C)(X). Or equivalently, an extension is a uni- tal (identity preserving) *-monomorphism ,: C(Y)->A(X) defined by ,=nolfJ-I. (For general extension of C*-algebras  36 Banach Alge- bras.) Two extensions (E 1 , IfJI) and (E 2 , <P2) (or TI and T 2 ) are said to be equivalent if there exists a *-isomorphism 1jJ: EI -> E 2 such that 1fJ2 ° IjJ = IfJI (or equivalently there exists a uni- tary operator U such that n(U*) '1(f)n(U)= '2(f) for every {EC(Y)). We denote by Ext(Y) the set of all equivalence classes of extensions of B(C)(X) by C( Y). Let A be an essentially normal operator in X with the essential spectrum (Je(A) = Y. Then the C*-algebra E A generated by B«)(X), A and the identity I, and the *-homomorphism IfJA of E A onto C( Y) which sends A to the function X(z)=z define an extension (E A , <PA)' It is easy to see that two essentially normal operators Al and A 2 are unitarily equivalent modulo B(C)(X) if and only if (Je(A 1 )=(Je(A 2 ) and the extensions (E AI , IfJA) and (E A2 , IfJA,) are equivalent. Con- versely, if Y is a compact subset of C and (11) is an extension, then (E, 1fJ) is equivalent to (E A , IfJA)' where A is an essentially normal operator in E such that <p(A) = X. Extensions of B(C)(X) by C(Y) appear also in different parts of analysis. Let X be the Hil- bert space L 2 (M) on a compact differentiable manifold M relative to a fixed smooth measure and let E be the C*-subalgebra of B(X) gener- ated by all zeroth-order tpseudodifferential operators together with B(cJ(X). Then E and the tsymbol mapping 1fJ: E-> C(S*(M)) define an extension ofB(C)(X) by C(S*(M)), where S*(M) is the tcosphere bundle of M. This extension is closely related to the t Atiyah- Singer index theorem. Let Q be a tstrongly pseudoconvex domain in cn. Then the C*- algebra generated by tToep1itz operators with continuous symbol gives rise to an extension of B(C)(H 2 (aQ)) by C(aQ). Let '1 and '2 be *-monomorphisms from C(Y) into A(X) and a l = ['IJ and Q2 = ['2J be corresponding elements in Ext( Y). Then the sum a l + a2 E Ext(Y) is defined to be the equivalence class of T: C( Y)->A(X) whIch sends f to 'I (f)EB '2(f)EA(X) EB A(X)c A (X EB X)  A (X). It does not depend on the 
1451 choice of '1' '2 and the unitary X E8 X  X. An extension ,: C( Y)->A(X) is said to be triv- ial if there exists a unital *-monomorphism 0": C(Y)->B(X) such that ,= 7T: 0 0". For each compact metrizable space Y there exists a unique equivalence class of trivial extensions in Ext(Y). Ext(Y) is an Abelian group in which the class of trivial extensions is the identity element. The extension (E A , CPA) for an essentially normal operator A is trivial if and only if A = N + K with N normal and K compact. Hence follows the Weyl-von Neumann-Berg-Sikonia theorem. The BDF theorem for essentially normal operators is proved by the pairing Ext(Y) x K 1 (Y)->Z defined by the index, where K 1 (Y)= K(SY+) = Iimnoo[Y,GL(n,C)] (237 K-Theory; [13J). The induced homomorphism roo: Ext( Y)-> Hom(K 1 (Y), Z) is always surjective, and it is an isomorphism for Y e R 3 or Y = sn but not for Y e R 4 . Ext is a tcovariant functor from th(, cate- gory of compact metrizable spaces to the category of Abelian groups. It is thomotopy invariant. Define for n = 0, 1, '" the group Ext 1 _ n ( Y) by Ext(sn Y), where sn Y is the n-fold tsuspension. Then we have the periodicity Ext n + 2 (Y)  Extn(Y). Moreover, for each pair of compact metrizable spaces Y::::>Z we have the long exact sequence i* p* (1' i* Ext (Z)->Ext(Y)->Ext(YjZ)->Ext (Z)-> .. , n n n n-1 where YjZ is the space obtained from Y by collapsing Z to a point and a is q*r;l: Extn(YjZ)-> Extn(SZ) defined by q: YUCZ-> (YU CZ)jY = SZ and r: YU CZ ->( YUCZ)jCZ = YjZ, CZ being the tcone over Z. Ext* satisfies the tEilenberg-Steenrod axioms for homology theory except for the dimension axiom, which is replaced by Extn(So) = Ext(SH) = Z for n even and = 0 for n odd. The von Neumann algebra B(X) is classified as a tfactor of type 100' In !he case of a factor M of type 1100 another index theory has been developed by H. Breuer [17J and others re- placing B(e)(x) by the closed ideal of M gen- erated by finite projections and using the tsemifinite trace on M for the dimensions of kernels and cokernels of operators in M. K. Spectral Analysis in Banach Spaces Spectral analysis becomes rather involved for general operators in a Banach space as well as for nonnormal operators in Hilbert space. For a tcompact operator A, the nature of the spectrum O"(A) and the structure of A in the root subspace associated with a nonzero eigen- value are well known ( 68 Compact and 390 Ref. Spectral Analysis of Operators Nuclear Operators). However, a full spectral analysis may not be possible without further assumptions. For a tclosed operator with nonempty re- solvent set p(A), an toperational calculus can be developed by means of a function-theoretic method based on the fact that the resolvent R, =().J - A)-1 is a B(X)-valued holomorphic function of.le in p(A). In particular, the tspec- tral mapping theorem holds ( 251 Linear Operators G). A general class of operators having asso- ciated spectral resolution was introduced by N. Dunford. Let X be a Banach space. An operator E E B(X) is called a projection if E 2 = E. As before we can define a (projection- valued countably additive) spectral measure on :JB e . An operator A E B(X) is said to be a spec- tral operator if there exists a spectral measure E on :JB e satisfying the following properties: (i) E(M)A=AE(M), ME:JB e ; (ii) O"(AIE(M)X) e M, ME:JB o where Aly is the restriction of A to Y and M is the closure of M; (iii) there exists a kO such that IIE(M)II «k for all M E:JB e . E is unique. A spectral operator A is expressed as A =S + N, where S= fczE(dz) and N is tquasi- nilpotent. A is said to be a scalar operator if N =0. Unbounded spectral operators are defined similarly, with (i') E(M)A e AE(M) in place of (i). However, for unbounded spectral opera- tors A we no longer have the decomposition A = S + N. (For mOre details about spectral operators  [3]. For other topics related to the material discussed in this section  68 Compact and Nuclear Operators; 251 Linear Operators; and 287 Numerical Computation of Eigenvalues.) References [IJ N. I. Akhiezer and I. M. Glazman, Theory of linear operators in Hilbert space I, II, Ungar, 1961, 1963. (Original in Russian, 1950.) [2J F. Riesz and B. Sz.-Nagy, Functional analysis, Ungar, 1955. (Original in French, 1952.) [3J N. Dunford and J. T. Schwartz, Linear operators, I, II, III, Interscience, 1958, 1963, 1971. [4J K. Y osida, Functional analysis, Springer, 1965. [5J T. Kato, Perturbation theory for linear operators, Springer, 1966. [6J M. Reed and B. Simon, Methods of mod- ern mathematical physics, Academic Press, I, 1972; 11,1975; III, 1979; IV, 1978. [7J P. R. Halmos, Introduction to Hilbert space and the theory of spectral multiplicity, Chelsea, second edition, 1957. 
39] A Spectral Geometry [8J M. H. Stone, Linear transformations in Hilbert space and their applications to analy- sis, Amer. Math. Soc. Colloq. Pub!., 1932. [9J I. C. Gokhberg (Gohberg) and M. G. KreIn, Introduction to the theory of linear non-self-adjoint operators, Amer. Math. Soc., 1969 (Original in Russian, 1965.) [10J I C. Gokhberg (Gohberg) and M. G. KreIn, Theory and applications of Volterra operators in Hilbert space, Amer. Math. Soc., 1970. (Origmal in Russian, 1967.) [IIJ R. G. Douglas, Banach algebra tech- niques in operator theory, Academic Press, 1972. [12J M F. Atiyah, K-theory, Benjamin, 1967. [13J M. F. Atiyah, Global theory of elliptic operators, Proc. Intern. Conf. on Functional Analysis and Related Topics, Univ. Tokyo Press, 1970, 21-30. [14J L. G. Brown, R. G. Douglas, and P. A Fillmore, Unitary eqUIvalence modulo the compact operators and extensions of C*- algebras, Lecture notes in math. 345, Springer, 1973, 58-128. [I5J L. G. Brown, R. G. Douglas, and P. A. Fillmore, Extensions of C*-algebras and K- homology, Ann. Math., 105 (1977),265-324. [16J R. G. Douglas, C*-algebra extensions and K-homology, Ann. math. studies 95, Princeton Univ. Press, 1980. [17J M. Breuer, Fredholm theories on von Neumann algebras, Math. Ann, I, 178 (I 968), 243-254; 11,180 (1969),313-325. 391 (VII.21) Spectral Geometry A. General Remarks Let £3 be a Euclidean 3-space with a standard coordinate system (X I ,X 2 ,X 3 ) and £2 be the XI x 2 -plane. Consider a domain D in £2 as a vibrating membrane with the fixed boundary aD. Then the height X 3 = F(Xl' x 2 ,t) obeys the differential equatIOn of hyperbolic type, c2F(X1,X2,t)ji3t2=c:2I1F(XI,X2,t), where 11= (2 F:XT + 17 2 jrx denotes the tLaplacian in £2 and c is a constant (we put c = 1 in the follow- ing). For solutions of the form F(x I, X2, t) = V(x 1, X2) V(t), V is a solution of the tDirichlet problem in Dc £2; I1V + I,V =0, where I, is a positive constant called an telgenvalue of 11 Solutions of the form F(X I ,X 2 ,t)= V(x 1, x 2 )sinfit represent the pure tones that the membrane produces as normal modes. That is, the shape of D is related to the possi- ble sounds or vibrations (I.e., to the eIgen- values of 11) through the Dirichlet problem. 1452 The set of eigenvalues of 11 is called the spec- trum of D and is denoted by Spec(D). There arises the question of how much information Spec(D) can impart about the geometric prop- erties (i.e., the shape, extent, and connected- ness) of D. Generally, spectral geometry is the study of the relations between the spectrum of domains D of tRiemannian manifolds or compact Riemannian manifolds (M,g) and the geometric properties of D or (M,gl. B. Spectra Let ::Z;P(M) denote the space of smooth tp_ forms on a compact rn-dimensional Coo_ Riemannian manifold (M,g). Then eigenvalues of the tLaplacian (Laplace- Beltrami operator) 11 acting on 9tJP(M) are discretely distributed in [0,(0), and each multiplicity is finite ( 68 Compact and Nuclear Operators, 323 Partial Differential Equations of Elliptic Type) The spectrum for p-forms SpecP(M, g) is {/cp.1  A p . 2  ... }, where each eigenvalue is repeated as many times as its multiplIcity indicates. If ° is an eigenvalue, its multiplicity is equal to the pth tBetti number of M. I n the following, mainly the case p = ° is explained. Speco(M,g) is denoted by Spec(M, g). ° is always in Spec(M,g) and its multiplicity is 1. So we put Ao = 0, and 1'1 is the first nonzero eigenvalue. A geometric meaning of 11{ at x for a function l is as fol- lows: If {Yh};:'1 are rn geodesics mutually or- thogonal at x and parametrized by arc length, then (!If)(x) = Lh(f 0 Yh)"(O). Let {({Ji} i",o be an orthonormal basis of q:o(M) consisting of eigenfunctions: l1({Ji + Ai({Ji =0, «({J,,<fJj) = SM ({Ji({Jj= Oij. Then the tfunda- mental solution £(x, y, t) of the heat equation 11-i3jt=O is given by £(x,y,t)=Lie-Ait({Ji(X)@ <fJi(y) as a function on M x M x (0,00). We put Z(t) = SM £(x,x, t)= Lie-Ai'. Z(t) and Spec(M,g) are equivalent. The Minakshisun- daram-Pleijel asymptotic expansion of Z(t), Z(t) (lj4nt)m/2(a o + a 1 t + a 2 t 2 + ...), tlO, is the bridge connecting Spec(M, g) and geo- metric properties of (M, g), because a o , aj, ... can be expressed as the integrals of functions over M defined by g=(gi)' the components Rjkt of the tRiemannian curvature tensor, and their derivatives of finite order [IJ a o is the volume of(M,g) and a 1 =(lj6))MS, where S is the tscalar curvature. a 2 was calculated by H. P. Mckean, I. M. Singer, and M. Berger, and a3 by T. Sakai. Let D be a bounded domain in £2 or, more generally, a bounded domain in a Riemannian manifold, and assume that the boundary aD is piecewise smooth. For smooth functions which 
1453 take the value ° on aD, eigenvalues of the Laplacian L1 are discretely distributed in (0, 00), and each multiplicity is finite. We denote the spectrum of D by Spec(D) = {AI < A 2  ... }. The multiplicity of )'1 is 1, and an eigenfunction I corresponding to Al takes the same sign in D. The behavior of Z(t)=Lie-Ait for D is dif- ferent from that for (M, g) since Z(t) reflects the geometric situation of aD in this case. (M, g) and (N, h), or DI and D 2 are called isospectral if they have the same spectra. Examples for which the spectrum is explic- itly calculable are as follows: spheres (sm, go = canonical), real projective spaces (Rpm, go), complex projective spaces (cpn,Jo,go),(S2n+1, g, = suitably deformed from go), flat tori, (and for domains D) unit disks, rectangles, equi- lateral triangles, etc. C. Congruence and Characterization Let D 1 and D 2 be bounded domains in E 2 . An open question is whether isospectral D1' D 2 are congruent. Concerning this, there is M. Kac's paper with the famous title "Can one hear the shape of a drum?" Let D be a bounded domain in £2 with smooth boundary aD. If D has r holes, then Z(t) A(D)/4nt- L(aD)/4j4;t+(1-r)/12, tlO, holds (A. Pleijel, Kac, P. Mckean, I. M. Singer;  [7J), where A(D) denotes the area of D and L(aD) denotes the length of aD. This theorem implies that the area, the length of aD, and the number of holes are determined by Spec(D). In particular, if D 2 is a unit disk, Spec(Dd = Spec(D 2 ) implies that DI and D 2 are congruent. There are some other results on Z(t) for do- mains D of surfaces in Em or for domains D in Riemannian manifolds (M,g). Two isometric (M, g), (N, h) are isospectral. Concerning the question of whether isospec- tral (M. g), (N, h) are isometric, there are some counterexamples. The first is the case of two flat tori T 16 , given by J Milnor. Examples with nonflat metrics were given by N. Ejiri using warped products and by M. F. Vigneras for surfaces of constant negative curvature. In those examples, M and N are homeo- morphic. A. Ikeda showed that there are tlens spaces that are isospectral but not homotopy equivalent. Examples of affirmative cases are as follows: Spec(M,g) = Spec(Sm, go), m6, implies that (M,g) is isometric to (sm, go) (M. Berger, S. Tanno). The result is the same for (Rpm, go), m6. For n6, (cpn,Jo, go) is characterized by a spectrum among tKiihlerian manifolds (M,l,g). 3910 Spectral Geometry The number of nonisometric flat tori (or more generally, compact flat Riemannian manifolds) with the same spectrum is finite (M. Kneser, T. Sunada). If one considers spectra for two types of forms, then the situation turns out to be sim- pler. For example, if SpecP(M, g) = SpecP(N, h) for p = 0, 1, then (M, g) is of constant curvature K if and only if (N, h) is also and K' = K (V K. Patodi [9J). D. The First Eigenvalue The first eigenvalue )'1 for (M, g) or for a domain D in (M, g) reflects the geometric situ- ation of (M, g) or D. A lower bound of A I given by 1. Cheeger is AI;;:' h(M)2/4, where h(M) is the isoperimetric constant, defined by h(M)= inf{ vol (S)/min[ vol(MrJ, vol(M 2 )J}, where the inf is taken over all smoothly em- bedded hypersurfaces S dividing M into wo open submanifolds M1' M2' aMI = aM 2 = S, and vol means the volume. 1/4 is the best possible estimate. Let p denote the maximum radius of a disk included in a simply connected Dc E 2 . Then ;'1 ;;:'1/(4 p 2) (W. K. Hayman, R. Osserman). If (M, g) has tRicci curvature ;;:. k > 0, then )'1 ;;:'mk/(m-l) holds, and the equality holds if and only if(M,g) is isometric to (sm,(I/k)go) (A. Lichnerowicz, M. Obata). )'1 can also be estimated from k and the diameter d(M) of M (P. Li, S. T. Yau). To obtain upper bounds of )'1 the minimum principle of Al is useful. We state it only for (M,g): AI =inf{L (\If,Vf)! LI2}, where inf is taken over all piecewise smooth functions I satisfying S M 1= 0, I of- 0, and ( , ) denotes the local inner product with respect to g. An upper bound of Al for (M, g) of nonnega- tive curvature is )'1 c(m)/d(M)2, where c(m) is some constant depending on m (Cheeger). For (M, g) or D a submanifold of another Riemannian manifold, there exist some esti- mates of Al in terms of second fundamental forms, etc. Al ;;:.l/(A(D)) holds for DcE 2 , and the equality holds if and only if aD is a circle (c. Faber, E. Krahn), where} denotes the first zero of the tBessel function lo. This estimate is generalized in many directions; for example, for Dc(M 2 ,g) in terms of the integral of the Gauss curvature, etc. It is very useful to note that the estimate of Al for D is deeply related to the isoperimetric inequality ( 228 Isoperi- metric Problems). 
391 E Spectral Geometry E. Hersch Type Theorem With respect to the first eigenvalue Adg) and the volume V ol(M, g) of (M, g), Al (g)' vol(M, g)2/m is invariant under a change of metric g->c 2 g (c is a constant). Hersch's prob- lem is stated as follows: Is there a constant k(M) depending on M so that for any Rie- mannian metric 9 on M, Al (g)' vol(M, g)2/m k(M)? J. Hersch proved this for a 2-sphere M = S2 with k(S2) = 8n, and in this case the equality holds if and only if 9 is proportional to the canonical metric go. The Hersch type theorem holds for an oriented Riemann surface M of genus q with k(M)= 8n(q + 1) (P. C. Yang, S. T. Yau). There is no such constant k(sm) for an m-sphere sm, m)o3 (H. Urakawa, H. Muto, S. Tanno). F. The Multiplicity of Ai By a theorem of K. Uhlenbeck each eigenvalue for a RIemannian manifold (M, g) is simple. However, for (sm, go) the first eigenvalue AI is m and its multiplicity is m+ 1. Furthermore, for some gs deformed from go, AI (gs) of (S2n+ l , gs) has multiplicity n 2 + 4n + 2, which is larger than m+ 1 (=2n+2). The multiplicity m(Ai) of the ith eigenvalue ).j for a Riemann surface of genus q satisfies m().i)  4q + 2i + 1 (S. Y. Cheng, G. Besson). G. kth Eigenvalue The minimum principle for Ak of Spec(M, g) is stated as fonows: Let I. be an eigenfunction corresponding to Ai' O i k-1. Define H k - 1 to be the set of piecewise smooth functions I of- 0 orthogonal to each I., i.e., S M II. = O. Then Ak=inf{L(Vf,V n ! LI2}. where inf is taken over IE H k - I . We have the minimax principle for Ak of the first and second type. We state the second type only: }'k = inf sup {f (Vf, v n ! f P } , L k + 1 0i=feL k + 1 M M where Lk denotes a k-dimensionallinear sub- space of 9&o(M). From this, for I-parameter metrics gu (a < u < b) on M, the continuity of )'k(gu) with respect to u fonows. H. Courant-Cheng Nodal Domain Theorem Let I be an eigenfunction on (M, g) or D. The set of all zero points of I is called the nodal set of .r (or the nodal curve of I if m = 2). Each connected component of the complement of 1454 the nodal set in (M, g) or D is called a nodal domain of f The nodal set of I is a smooth submanifold of (M, g) or D except for a set of1ower dimension. The number of nodal domains of an eigenfunction corresponding to the ith eigenvalue is  i + 1 for (M, g) and  i for D (Courant-Cheng nodal domain theorem). I. Estimate of N(A) N().) is defined as the number of eIgenvalues of (M,g) or D which are less than or equal to A. For Dc £2, H. A. Lorentz conjectured that the behavior of N(A) for A->OO does not depend on the shape of D but only on the area A(D) of D, i.e., lim ho N (A)j A = A( D)j4n. This was proved by H. Weyl. Generally, for D or (M,g), the behavior of N(A) for A->OO is vol(D)A m / 2 j (4n)m/2r(mj2+ 1), and this is related to the first term vol(D)j(4ntt/ 2 of the asymptotic expansion of Z(t) by tTauberian and tAbelian theorems. J. Spec( M, g) and Geodesics Let Tm = Em jr be a flat torus, r being the lattice for T m . Let r* be the lattice dual to r. Then Poisson's formula, L: e -4n'lyl't = (vol(T m )j(4nt)m/2) L: e -lx 1 '/4t, yer* xef gives a clear relation between Spec(Tm) = {4n 2 IYI2,YEr*} and the set {lxl,XEr} of lengths of closed geodesics on Tm. If (M, g) satisfies some conditions, then Spec(M, g) determines the set of lengths of periodic geodesics (Y. Colin de Verdiere), and the spectrum characterizes those Riemannian manifolds whose geodesics are all periodic (J. J. Duistermaat, V. W. Guillemin). K. SpecP(M, g) and the Euler-Poincare Characteristic Let (M, g) be oriented and even dimensional. Let £P(x, y, t) be the tfundamental solution of the theat equation for p-forms. Corresponding to Z(t) for Spec(M, g), we get ZP(t'I=.fM£P= Lie-4p.,c. Then Jo ( -1)PZP(t) = Jo (-1)P f M tr £P = X(M), where X(M) denotes the tEuler-Poincare char- acteristic of M (Mckean, Singer). On the other hand, the tGauss-Bonnet theorem is X(M) = .f Me, where C is a function on M expressed as a homogeneous polynomial of components of the Riemannian curvature tensor. Then 
1455 Patodi proved m L: (-I)Ptr£P=C+O(t), tiO. po L. 'I-Function Let (X, g) be a compact oriented 4k-dimen- sional Riemannian manifold with boundary eX = Y and assume that some neighborhood of Y in (X, g) is isometric to a Riemannian product Y x [0,8). Define an operator B acting on forms of even degree on Y by Bw=( -1)k+r+L(*d-d*)w, wE9&2'(Y), where * denotes the tHodge star operator and d denotes exterior differentiation on Y. Then B 2 =!i holds. Using the spectrum {tl} of B, we define the 'I-function by I](S)= L: (sgntl)ltll-'. w/'O 'I(S) is a spectral invariant, and sgn(X) = Ix Lk(PI,"', Pk) - '1(0) holds (Atiyah, Patodi, and Singer [2]), where sgn(X) is the tsignature of the quadratic form defined by the tcup product on the image of H 2k (X, y) in H 2k (X), Lk is the kth tHirzebruch L-polynomial, and Ph ..., Pk are the Pon- tryagin forms of (X, g). M. Analytic Torsion Let X be a representation of the fundamental group ndM) of(M,g) by the orthogonal group and Ex be the associated vector bundle. Let !iX be the Laplacian acting on Ex-valued p-forms on M and {A.i} be its spectrum. Then logT(M,X)= f (-I)PPIOg ( L:(A,;)-' )1 J'O ds , FO is independent of the choice of g. T(M, X) is called the analytic torsion of M. T(M, X) is equal to the t R-torsion T(M, X) (W. Miiller, Cheeger). N. Concluding Remarks An tisometry If; of (M, g) commutes with the Laplacian and induces a linear transformation If;f of each eigenspace VA' Using the asymp- totic expansion of Ltr(lf;f)e- A ', the Atiyah- Singer tG-signature theorem has been proved (H. Donnelly, Patodi). The Atiyah-Singer tindex theorem has been proved by using Gilkey's theory and the heat equation (Atiyah, R. Bott, Patodi). Let (N, h) be a complete Riemannian mani- fold of negative curvature. Then !i is extended 392 Spherical Astronomy to an unbounded self-adjoint operator for L 2 (N). Generally !i has a continuous spec- trum. Some conditions for (N, h) to have pure point spectrum were given in terms of curva- ture (Donnelly, P. Li). If D is a minimal submanifold of another Riemannian manifold, estimates of AI are related to the stability of D ( 275 Minimal Submanifolds). On the nonexistence of the I-parameter isospectral deformation (M, go)....(M, g,), there are results for (i) m = 2 and go of negative curvature, m;;:' 3 and negatively pinched go (V. Guillemin, D. Kazhdan); (ii) flat metrics go (R. Kuwabara); and (iii) go of constant positive curvature (Tanno). As for spectral geometry for complex La- placian on Hermitian manifolds, there are results by P. Gilkey, Tsukada, and others. References [1] M. F. Atiyah, R. Bott, and V. K. Patodi, On the heat equation and the index theorem, Inventiones Math., 19 (1973), 279-330. [2] M. F. Atiyah, V. K. Patodi, and I. M. Singer, Spectral asymmetry and Riemannian geometry I, II, III, Proc. Cambridge Phil os. Soc., 77 (1975), 43-69, 405-432; 79 (1976),71- 99. [3] M. Berger, P. Gauduchon and E. Mazet, Le spectre d'une variete riemannienne, Lecture notes in math. 194, Springer, 1971. [4] J. Cheeger, A lower bound for the smallest eigenvalue of the Laplacian, Problems in Analysis: Symposium in Honor of Salomon Bochner, R. C. Gunning (ed.), Princeton Univ. Press, 1970, 175-199. [5] P. B. Gilkey, The index theorem and the heat equation, Publish or Perish, 1974. [6] M. Kac, Can one hear the shape of a drum? Amer. Math. Monthly, 73 (April 1966), 1-23. [7] H. P. Mckean and I. M. Singer, Curvature and eigenvalues of the Laplacian, J. Differen- tial Geometry, 1 (1967),43-69. [8] S. Minakshisundarum and A. Pleijel, Some properties of the eigenfunctions of the Laplace operator on Riemannian manifolds, Canad. J. Math., 1 (1949),242-256. [9] V. K. Patodi, Curvature and the funda- mental solution of the heat operator, J. Indian Math. Soc., 34 (1970), 269-285. 392 (XX.5) Spherical Astronomy Spherical astronomy is concerned with the apparent positions of celestial bodies and their 
392 Ref. Spherical Astronomy motions on a celestial sphere with center at an observer on the Earth, while tcelestial mechan- ics is concerned with computing heliocentric true positions of planets and comets and geo- centric true positions of satellites. The purpose of spherical astronomy is to find all possible causes of displacement of the apparent posi- tions of celestial bodies on the celestial sphere from their geocentric positions and to study their effects. Atmospheric refraction, geo- centric parallax, aberration, annual parallax, precession, nutation, and proper motion are examples of these causes. When light from a celestial body passes through the Earth's atmosphere, it is refracted since air densities at different heights are differ- ent This phenomenon is called atmospheric refraction. The effect of refraction on the apparent direction of the celestial body is a minimum when the body is at its culmination, and vanishes when this coincides with the ob- server's zenith, while the maximum refraction of 34:5 occurs when the body IS at the horizon. Topocentric positions differ appreciably from geocentric positions for the Moon and planets, since their geocentric distances are not large compared with the radius of the Earth The difference is largest when the observer is on the equator and the celestial body is at the horizon, and this largest value is called the geocentric parallax. The geocentric parallax of the Moon is between 53:9 and 60:2; those of the Sun, Mercury, Venus, Mars, Jupiter, and Saturn are, respectively, 8:'64-8:'94, 6" -\6:' 5, 5" -32", 3:'5-23:'5, 1:'4-2:'1, and 0:'8- 1:'1. For fixed stars geocentric parallaxes can be re- garded as zero since the stars are far from the Earth. The Earth moves in an orbit around the Sun with period of one year (3652564 days) and rotates around the polar axis, which is inclined at 665 to the orbital plane (the ecliptic), with period of one day (23 hours, 56 minutes, 4.09\ seconds). Therefore the observer on the Earth moves with a speed depending on the latitude (0465 km/sec on the equator) due to the rota- tion and moves with an average speed of 29.785 km/sec on the ecliptic. Due to these motions of the observer, apparent directions of celestial bodies are displaced from their geo- metric directions. Displacement due to the rotation is called diurnal aberration, and that due to the orbital motion, annual aberration. The effect of diurnal aberration is between 0" and 0:'32 and varies with a period of one day, while that of the annual aberration is between 0" and 20:'496 and varies with a period of one year. Moreover, to compute the positions of celestial bodies, the travel time of light to the observer should be taken into account. \456 Annual parallax for a fixed star is half the difference of its apparent directions, which are measured at the ends of a diameter perpendic- ular to the direction of the star from the orbit of the Earth The effect of the annual paral- lax varies with a period of one year. However, except for nearby stars, it is not necessary to take this effect into account when computing apparent positions. The pole of the Earth on the celestial sphere moves on a circle around the pole of the eclip- tic due to the gravitational attraction of the Moon, Sun, and planets, and therefore the equinox moves clockwise on the ecliptic. Because the resultant of the attractive force of the Moon, Sun, and planets changes periodi- cally, the motion of the equinox is not uni- form. Therefore the motion is expressed as the sum of secular motion, called precession, and periodic motion, called nutation. Since the positons of fixed stars on the celestial sphere are measured with respect to the equator and the equinox, their right ascensions and decli- nations are continuously changing because of precession and nutation. Since the stars are not fixed in space but themselves have proper motions, their posi- tions on the celestial sphere are continuously changing. Spherical astronomy also includes predic- tions of solar and lunar eclipses, the theory of tor bit determination to compute apparent positions of celestial bodies in the solar system by use of orbital elements, and the compu- tation of ephemerides for the Sun, Moon, planets, and fixed stars. Practical astronomy, which develops theories and methods of ob- servation by use of meridian circles, transit instruments, zenith telescopes, sextants, the- odolites, telescopes with equatorial mountings, and astronomical clocks, and navigational astronomy, which deals with methods for determining the positions of ships and aircraft, are closely connected to spherical astronomy. It should be noted that recently radar has been used to measure distances to the Moon and planets accurately, a contribution to de- termining the size of the solar system with precision. References [IJ W. Chauvenet, A manual of spherical and practical astronomy I, II, Dover, 1960. [2J S. Newcomb, A compendium of spherical astronomy, Dover, 1960. [3J W. M. Smart, Textbook on spherical astronomy, Cambridge Univ Press, fifth edi- tion, \965. 
1457 [4J E. W. Woolard and G. M. Clemence, Spherical astronomy, Academic Press, 1966. 393 (XIV.6) Spherical Functions A. Spherical Functions The term spherical functions in modern termi- nology means a certain family of functions on tsymmetric Riemannian spaces obtained as simultaneous teigenfunctions of certain inte- gral operations ( 437 Unitary Representa- tions). In this article, however, we explain only the classical theory of Laplace's spherical functions with respect to the rotation group in 3-dimensional space Solutions oft Laplace's equation L1 V = 0 that are homogeneous polynomials of degree n with respect to the orthogonal coordinates x, y, z are called solid harmonics of degree n. If n is a positive integer, there are 2n+ 1 linearly independent solid harmonics of degree n. In tpolar coordinates (r, 0, cp) they are of the form r n Yn(O, cp), where y"(O, cp), the surface harmonic of degree n, satisfies the differential equation 1 c ( . cY,, ) 1 c 2 y" -- smO-- +-- sinOaO cO sin 2 0 Ccp2 + n(n+ 1) Y" =0. Here, if we apply tseparation of variables to 0 and cp and put z = cos 0, then the component in cp is represented by trigonometric functions, and the other component in 0 reduces to a solution of Legendre's associated differential equation 2 d 2 W dw (1-z )--2z- dz 2 dz +(n(n+ 1)- 1 ::2 )W=0. (1) B. Legendre Functions With m=O in (1) and n replaced by an arbi- trary complex number v, the equation is re- duced to Legendre's differential equation 2 d 2 w dw .(1-z )d;2-2z dz +v(v+l)w=O, (2) whose fundamental solutions are represented by 1 l(l+.z+) ((2-1)" . Pv(z) = 2ni 'f 2 V (z _ ()v+1 d, 1 f (i+,-1+) ((2-1)' Qv(z)= 4 ---:-:- 2V( ")'+1 de lsmvn Z-(, 393 B Spherical Functions where the contour of integration in (3) is a closed curve with positive direction on the (-plane, avoiding the half-line (-00, -1), and admitting 1 and z as inner points of the domain it bounds, whereas the contour in (4) is a closed oo-shaped curve encircling the point I once in the negative direction and the point -1 once in the positive direction. The functions Pv(z) and Q'(z) are called Le- gendre functions of the first and second kind, respectively. The integral representation (3) is Schliifli's integral representation. If Re( v + 1) > 0, we can deform the contour of integration and obtain 1 f l (1_(21' Qv(z) = 2v+1 _I (z _ ()v+l d(. (5) If v is an integer, it is convenient to use the representation (5). From (3)-(5), we can obtain the recurrence formulas for Legendre functions of distinct degrees. The recurrence formulas for Pv(z) and QAz) have exactly the same form ( Appendix A, Table 18.I1). Expanding the integrand in (3) and (4) with respect to z-1 and (/z, the fol- lowing identities are obtained: PJZ)=F(V+ 1, -v, 1, 1 z } 11-zl<2, j; qv+ I) Qv(z) (2z1'+ l qv+3/2) x F ( V + 1 , v + 2 , v + ,  ) , 2 2 2 z Izl> 1, largzl < n, where F(a, (J, y, z) is the thypergeometric func- tion. These expansions are the solutions in series of Legendre's differential equation in the neighborhood of the tregular singular points z= I and 00, respectively (Appendix A, Table 18.II). If v is a positive integer, since ( = 1 is not a tbranch point in (3), p"(z) is represented by Rodrigues's formula 1 l ((2_1)" Pn(z) = 2ni 'f 2"(z-()"+1 d( 1 d" =__(z2 _1)". (6) 2"n! dz" In this case, Pn(z) is a polynomial of degree n such that (3) ["/2] (2n-2r)! p ( z ) - " ( _1 ) ' z"-2r " - L, 2 n 1 ( _ ) 1 ( _ 2 ) 1 ' ,o r. n r. n r . (4) Po(z) = 1, which is called the Legendre polynomial (Le- gendre, 1784). The tgenerating function for the 
393 C Spherical Functions Legendre polynomials is (1-2PCOSO+p2)-'/2, whose expansion with respect to p is of the form L.;;:::o Pn(z)pn, z = cos O. Here the tgenerat- ing function (1 - 2p cos a + p2) -1/2 is the inverse of the distance between two points (p, a) and (1,0) in polar coordinates. Hence Pn(z) is also called the Legendre coefficient. If z is real, {( (2n + 1 )/2) 1/2 Pn(z) };;:::o constitutes an orthonormal system on [ -1, 1] ( 317 Orthogonal Func- tions). The n zeros of Pn(z) are all real, simple, and lie in (-1,1). For sufficiently large n, we have Pn(cos a) = J  O Sin (( n+ ) 0+ ) +O(I/n3/2), nnsm 2 4 Qn(cos 0) = J  c os (( n+ ) 0+ ) +O(I/n3/2) 2nsmO 2 4 as n-->oo. C, Associated Legendre Functions For any positive integer m, the functions P;'(z) = (1 - z2tl2 dmpy(z)/dz m , Q(z)=(1-z2t;2dmQy(z)/dzm are called the associated Legendre functions of the first and second kind, respectively. This definition, due to N. M. Ferrers, is convenient for the case -1 < z < 1. For arbitrary com- plex z in a domain G obtained by deleting the segment [ -1, 1] from the complex plane, the following definition, due to H. E. Heine and E. W. Hobson, is used: P;'(z) = (Z2 _1)m / 2 d m Py(z)/dz m , Q(z) = (Z2 - 1 t l2 dmQ.(z)/dz m . The associated Legendre functions satisfy the associated Legendre differential equation (1). In particular, for v = n (a positive integer) and z=x (real), {(2n + 1)(n-m)!/2(n +m)!} 1/ 2 Pnm(z), n=O, 1, .,. m=constant, constitute an orthonormal system on [ -1, 1]. The addition theorem for the Legendre func- tions is Pn(zlzd J ::t(1-zi) J ::t(1-Z)CosqJ) n (n-m)! = P n (Zd P n(Z2) + 2 I P,:"(zd m1 (n+m). X Pnm(Z2)cosmqJ, where the equality with the plus sign was 1458 obtained by Ferrers and that with the minus sign by Heine and Hobson. D. Surface Harmonics From the considerations so far for the sur- face harmonics y"(0, qJ), 2n + 1 independent solutions Pn(cosO), P,:"(cosO)sinmqJ, qP,:"(cos O)cosmqJ, lmn, are obtained Since Pn(cos a) vanishes on n latitudes of the unit sphere, and Pnm(cos 0). cosmqJ and Pnm(cos a) sin mqJ vanish on n-m latitudes and m longitudes of the unit sphere, respectively, the former functions are called zonal harmonics and the latter, tesseral har- monics. The general form of surface harmonics Y" of order n is given by a linear combination of zonal and tesseral harmonics: y"(0, qJ) = An.oPn(cos a) n + I (An.mcosmqJ+ Bn.msin mqJ)P,:"(cosO). (7) m=l Expressing two surface harmonics yl) and y!2) in linear combinations such as (7), the following orthogonality relations hold: fn I" yl)(O, qJ) y\2)(0, qJ)sin OdOdqJ 4n ( -,) - A(l) A(2) - n"2n+ 1 n,O n.O 1 n ( n+m ) ' ) +- I --------(A(l) A(2) + B(l) B(2») . 2 m=l (n _ m)! n,m n,m n,m n,m Since the family of all zonal and tesseral har- monics constitutes a tcomplete orthogonal system, it is possible to expand a function frO, qJ) on the sphere into an orthogonal series: 00 f(O,qJ)= I Y,,(O,qJ) n=O 00 ( n = no An.oPn(COSO)+ m2;;1 (An,mcos m qJ + Bn,msin mqJ)Pnm(cos a)) . To obtain surface harmonics, the following method is effective. Let v be a direction pro- portional to the direction cosines I, m, n. Then a function ( 1+m+n ) =IX 0 (  ) , ox oy oz r ov r ct= J I2+m2+n2 is a solution of Laplace's equation. Physically, this corresponds to a tpotential of double pole with moment IX and direction v. A more gen- 
1459 eral multi pole potential ( no )( 1 ) V=c [1a- - i=1 'OV i r also satisfies Laplace's equation. If we put V = Un(X, y, z)r- 2n - 1 , Un is a spherical function of order n (Maxwell's theorem). Various spherical functions correspond to particular directions Vi' For example, if every Vi is equal to z, we have zonal harmonics; and if n - m of the Vi are equal to z and m of the Vi are symmetric on the xy-plane, we obtain tesseral harmonics. Let y be an angle between two segments connecting the origin to the points (r, 0, cp) and (r', e', cp') in polar coordinates. Then cos y = cos 0 cos e' + sin Osin e' cos(cp - cp'), and if we choose the line connecting the origin to a point (r', e', cp') as the axis defining P n , we have r n +1 ( X' 0 y' 0 P (cosy)=( -1)"- -+- n n! r' ox r' oy + ) n (  ) . r' oz r These are called biaxial spherical harmonics, which can also be represented (by the addition theorem) by means of spherical harmonics with respect to each axis. E. Extension of the Legendre Functions We extend the associated functions with posi- tive integer m to any number m. First, if m is a negative integer, we put f z f m Pv-m(z) = (1- z2fm/2 1 d(m 1 d(m-I'" f " f 2 X 1 d(2 1 P v ((dd(l, f z f m Q;m(z)=(I_z 2 )-m/2 ao d(m ao d(m-I'" f 3 f 2 X ex) d(2 ao Qv((dd(l, a definition due to Ferrers. Then r(v-m+ 1) p-m(z) = ( -1)'" P::(z), v r(v+m+ 1) r(v-m+ 1) Q-"'(z)=( _1)m Q(z). v r(v+m+ 1) When we use the definition due to Heine and Hobson, the factor (_1)m in these formulas is excluded. Two fundamental solutions, called hypergeometric functions of the hyperspherical differential equation (l-z2)d 2 wJdz 2 -2(11+ l)zdwJdz +(v-ll)(v+ll+ l)w=O, 394 A Stability are P:::)(z) e- vtti 2v-4nsin vn X l (Z+.I+.Z-.1-) ((2_1)' d Y 'f (( - 2r++1 1" e(l+v)ni f W-1)' Q\:,__;,)(z) (-1+.1-) d(  2v-4isinvn ('-zr++1 ' where the contour of integration for the latter integral is a curve encircling the point -1 once in the positive direction and the point 1 once in the negative direction. Then the associated Legendre functions for an arbitrary number II are defined as follows: P:(z) r(v + Il+ 1) (Z2 -IV/ 2 P(')(z) 2r(v+l) v ' Q  v( z ) r(v+ II + 1) ( Z2 -IV/2 Q (__)(z). 2r(v+ 1) v  If V -II is a positive integer, P'::) is called the Gegenbauer polynomial, also denoted by Cv_(z). The Cv_(z) are obtained as coeffi- cients of the expansion of the generating func- tion (1-2hz+z2)-(2+I)/2 ( Appendix A, Table 20.1). For spherical functions of several variables there is an investigation by P. Appell and 1. Kempe de Feriet [2] (--" 206 Hypergeometric Functions D). References [IJ E. W. Hobson, The theory of spherical and ellipsoidal harmonics, Cambridge Univ. Press, 1931 (Chelsea, 1955). [2J P. Appell and 1. Kempe de Feriet, Fonc- tions hypergeometriques et hyperspheriques, polynomes d'Hermite, Gauthier-Villars, 1926. [3] C. Muller, Spherical harmonics, Lecture notes in math. 7, Springer, 1967. [4J T. MacRo bert, Spherical harmonics, Sned- den, third revised edition, 1967. Also  references to 206 Hypergeometric Functions, 389 Special Functions. 394 (XIII.13) Stability A. General Remarks Stability was originally a concept concerned with stationary physical states. When a state is affected by a small disturbance, this state is said to be stable if the disturbance subse- quently remains small, and unstable if the dis- 
394 B Stability turbance gradually increases. For instance, consider a rod placed in the Earth's gravi- tational field with one end fixed at a point around which the rod can rotate freely. When the rod is placed vertically, this state is station- ary. It is stable if the rod is hanging down from the fixed end, and unstable if it is standing on the fixed end. In physical systems only the stable state is practically realizable, so this dis- tinction is important. The concept of stability is used not only in relation to physical states but also in many other fields of science. We shall restrict our- selves to stability of solutions of differential equations. There, the term stability is used in the sense that a small change in the initial values results in a small change in the solution. As long as the solution is considered within a finite interval of the independent variable, this stability is naturally guaranteed by the con- tinuity of the solution with respect to its initial values ( 316 Ordinary Differential Equations (Initial Value Problems)). The problem arises when an independent variable moves over an unbounded interval. Let (Xl' ...,xn)=x, (xdt),... ,xn(t))=x(t), (x't(t), ...,x(t))=x'(t) (the symbol' means differentiation by t), and Ixl=Lj'llxJ Con- sider the differential equation x'=f(t,x), for which the existence and uniqueness of the solution of the initial value problem is as- sumed for Itl<oo, Ixl<oo. Let x=qJ(t) be a solution of (I). If for any 1»0 and to, a b>O can be chosen so that Ix(to)-qJ(to)l<b implies Ix(t)-qJ(t)1 <I> for to«t<oo (-00 <t«t o ), where x(t) is any solution of (I), then x=qJ(t) is said to be (Lyapunov) stable in the positive (negative) direction. If it is stable both in the positive and negative directions, it is said to be stable in both directions. In the remainder of this article we will consider stability in the positive direction only. Corresponding asser- tions for stability in the negative direction can be obtained by reversing the sign of t. B. Classification We denote by x=x(t,to,xo) a solution of (1) such that x=x o at t=t o Suppose a solution x = qJ(t) is stable. Iffor any to there exists a (>0 such that Ix(t,to,x o ) -qJ(t)I......O as t......oo for any x(t,to,xo) with Ixo - qJ(to)1 < (, qJ(t) is said to be asymptotically stable. If a constant b in the definition of stability can be chosen independently of to, f/J(t) is said to be uniformly stable. When equation (1) 1460 is tautonomous, stability implies uniform SLability. If (1) qJ(t) is uniformly stable and (2) for any to and I] > 0, there exist a (> 0 independent of to and a T> 0 independent of to such that Ix o - qJ(to)1 < ( and t> to + T imply Ix(t,to, xo) -qJ(t)1 < 1], then qJ(t) is said to be uniformly asymptotically stable. Suppose that there exists a positive number )" with the following property: For any to and c;>0 one can take a b(I»>O such that Ix o -(Wo)1 < b(l» implies Ix(t,to, xo)-t/J(t)1 < c;e-i(t-I o ) for t to. Then t/J(t) is said to be exponentially stable. Exponential stability implies uniform asymptotic stability. C. Criteria To deal with the stability of x = qJ(t), we need consider only the case qJ(t) = 0, smce the trans- formation x = y + qJ(t) reduces equation (1) to y' = f(t, y + qJ(t)) -f(t, qJ(t)) = F(t, y), F= ( and thus x = qJ(t) is transformed into y = O. If F is continuously differentiable with respect to y, (2) can be written in the form (1) y' = Fy(t, O)y +g(t,y), g(t, y)=o(lyl). The linear part of this equation, y' = Fy(t, O)y, is called the variational equation for (1). So, in this section, we can state several criteria for stabihty of the null solution y =0 of the equation y'=P(t)y+g(t,y), Ig(t,y)l=o(lyl). (3) (I) If (3) is linear (i.e., g(t, y) = 0). then y = 0 is stable if and only if every solution of (3) is bounded as t......oo. (II) If (3) is linear, uniform asymptotic sta- bility implies exponential stability. Let {(t, y) be a function defined for I yl < p, t > ex If there exists a continuous function w(y) such that w(O) = 0, w(y) > 0 (y of- 0). f(t, y)  w(y) (Iyl < p, t> a), then f(t, y) is said to be positive definite. If - f(t, y) is positive definite, then f(t, y) is said to be negative definite. (1I1) The existence of a Lyapunov function V(t, y) with the following properties implies the stability of y = 0: (i) V(t, y) is positive definite and differentiable, (ii) V(t, 0) = 0, (iii) V(t, y) = v,+ Vy'(P(t)y+g(t,y))«O. The existence of V(t, y) with the following properties implies the uniform asymptotic stability of y == 0: (i) same as (i) above, (ii) there exists a continuous function v(y) such that 
1461 v(O)=o, v(y»O (y#O), V(t, y)v(y), (iii) V(t, y) is negative definite. Hereafter we shall assume that Ig(t, y)1 = o(lyl) as y->O uniformly with respect to t. (IV) If P(t) is a constant matrix all of whose teigenvalues have negative real parts, then y == 0 is asymptotically stable [3,4]. (V) Let P(t) be continuous and periodic with penod T and Z be a tfundamental system of solutions of the variational equation z' = P(t)z. Then there exists a constant matrix C such that Z(t + T)=Z(t)c. Let AI, ..., An be the eigenvalues of C. Then the num bers J1k = (log Ad/T (k = 1, ..., n) are called the charac- teristic exponents of (4). Obviously they are determined up to integral multiples of 2ni/T. If the real parts of the characteristic exponents are all negative, then y == 0 is asymptotically stable [3,4]. (VI) Iff(t,x) in (I) is periodic in t with period T and (1) admits a periodic solution x = qJ(t) with period T, then (1) can be reduced to (3) by putting x = y + qJ(t), and P(t), g(t, y) are both periodic in t with period T. Thus criterion (V) can be applied as a stability criterion for the periodic solution of (1). There are many other criteria for various particular forms of the equation ( 290 Nonlinear Oscillation). For the tautonomous case where f(t, x) is of the form p(x) or p(x) + q(t) with q(t + T) = q(t), many results have been found. (VII) If the solution z == 0 of z' = P(t)z is uniformly asymptotically stable, then the solution y == 0 of (3) is also uniformly asymp- totically stable [4]. D. Conditional Stability Let qJ(t) be a solution and  a family of solu- tions of (1). If for any £ > 0, a b > 0 can be determined so that Ix(to) - qJ(to)1 < b implies Ix(t) - qJ(t)1 < c; for to  t < ex) for any solution x(t) in , then qJ(t) is said to be stable with respect to the family . If a family  can be found so that a solution is stable with respect to , the solution is said to be conditionally stable. For instance, in equation (3), if P(t) is a constant matrix some of whose eigenvalues have negative real parts, g(t, y) is differentiable with respect to y, and gy(t, y) = 0(1) uniformly in t as y->O, then y==O is conditionally stable. We now mention a weaker kind of stabil- ity called orbital stability. Let qJ(t) be a solu- tion and £ any positive number. If there can be found a positive number b such that for 394 Ref. Stability (4) any solution x(t) with Ix(t 11 - qJ(t o ) I < b for some to and t 1 , U,,<;t<wx(t) belongs to the f;- neighborhood of Uto<;t< w qJ(t), then qJ(t) is said to have orbital stability. When f(t, x) in equation (1) is independent of t, (I) is often called a tdynamical system. In the theory of dynamical systems, not merely the stability of a solution itself but also the sta- bility of a closed invariant set is of importance ( 126 Dynamical Systems). It is also of importance to investigate the change in solution caused by a small change in the right-hand member of the equation. Sup- pose, for instance, that the right-hand member of the equation depends continuously on a parameter £. Then the question arises as to how the solution changes if £ changes. The theory of such problems is called tperturbation theory Suppose that the equation x' =f(t, x, £) admits a periodic solution qJ(t) for £=0. Then qJ(t) is said to be stable under perturbation if for £ #0 the same equation admits a periodic solution lying near qJ(t). In tcelestial mechanics and tnonlinear oscillation theory this concept plays an important role. References [IJ R E Bellman, Stability theory of differen- tial equations, McGraw-Hill, 1953. [2J L. Cesari, Asymptotic behavior and sta- bility problems in ordinary differential equa- tions, Springer, third edition, 1971. [JJ E. A. Coddington and N. Levinson, Theory of ordinary differential equations, McGraw-Hill,1955. [4J S. Lefschetz, Differential equations, geo- metric theory, Interscience, 1957. [5J A. Lyapunov (Ljapunov), Probleme gen- eral de la stabilite du mouvement, Ann. Math. Studies, Princeton Univ. Press, 1947. [6J V. V. Nemytskii (Nemyckii) and V. V. Stepanov, Qualitative theory of differential equations, Princeton Univ. Press, 1960. (Original in Russian, 1947.) [7J O. Perron, Ober eine Matrixtransforma- tion, Math Z., 32 (1930), 465-473. [8J O. Perron, Die StabiliHitsfrage bei Dif- ferentialgleichungen, Math. Z., 32 (1930), 703- 728. [9J A. Halanay, Differential equations, sta- bility, oscillations, time lags, Academic Press, 1966. [IOJ T. Y oshizawa, Stability theory by Lja- punov's second method, Pub!. Math. Soc. Japan, 1966. 
395 A Stationary Processes 395 (XVI1.12) Stationary Processes A. Definitions Stationary process IS a general name given to all tstochastic processes ( 407 Stochastic Processes) that have the property of being stationary (to be defined in the next para- graph) under a shift of a time parameter t that extends over T, which is either the set of all real numbers R (a continuous parameter) or the set of all integers Z (a discrete parameter). Let (Q, 113, P) be a tprobability space and {X,(w)} (tE T, wEQ) a complex-valued tsto- chastic process. If for every n, every t l' t 2 , ... , tnE T, and every tBorel subset En of complex n-dimensional space en, the equality P«(X" +" "', X'n+t)E En) =P«X", ...,X,)EE n ) holds, then {X,} is called a strongly (or strictly) stationary process; while if E(IX,1 2 ) is finite for every t, and if the tmoments up to the second order are stationary, i.e., if E(X'+h)=E(X,), E(X'+hXs+h) = E(X,X,), then {X,} is called a weakly stationary process or a stationary process in the wider sense. The "stationary" in the latter sense obviously in- cludes the former if E(IX,1 2 )<e1J. Condition (2) is equivalent to E(X,) = m (a constant independent of t), E(X,-m)(Xs-m))= p(t -s) (a function of t - s). (3) We call m and p(t) the mean and the covari- ance function of {X,}. In the continuous parameter case, we as- sume tcontinuity in probability, limP(IX,th-X,I>£)=O, £>0, hO for a strongly stationary process, and continu- ity in mean square, lim E(IX'+h-X,12)=0, hO for a weakly stationary process. The latter assumption is equivalent to continuity of the covariance function p(t). A tGaussian process is strongly stationary if and only if it is we'ikly stationary; and so it is simply called a tstationary Gaussian process. Such processes constitute a typical class of stationary processes ( 176 Gaussian Pro- cesses C). 1462 B. Spectral Decomposition of Weakly Stationary Processes (1 ) The covariance function p(t) is obviously tpositive definite and continuous. Therefore, by tBochner's theorem, we have the tspectral decomposition of p(t): p(t) = L e iA ' F(d2), where T' is either R (when T= R) or [ - n, n] (when T= Z) and F is a bounded measure on T. The decomposition (4) is called the Khin- chin decomposition of p(t), and F(d2) is called the spectral measure. If the process {X,} is real- valued, then the spectral measure F(d2) is symmetric with respect to the origin. To obtain the spectral decomposition of a weakly stationary process X, itself. we intro- duce the tHilbert space L 2 (Q) (where Q = Q(!B, P) is the basic probability space on which each X, is regarded as a tsquare inte- grable function). Let 9Jl(X) be the subspace of L 2 (Q) spanned by the X, (tE T) and the con- stant function 1. Since {X,} is weakly station- ary, we can define a one-parameter group of tunitary operators V, (t E T) determined by V,X s = X'+s and V,I = 1. By tStone's theorem we have the spectral decomposition of V,: (4) (2) V,= f eiA'E(d2). T' (5) Setting M(A)=E(A)(Xo-m), we obtain the spectral decomposition of X,: f iA' d ' X,= V,Xo=m+ e M( 11.). T' (6) We also have (M(A),I)=O, (M(Ad, M(A 2 ))=F(A 1 nA 2 ). (7) The study of weakly stationary processes is based on the decomposition (6). For example, the weak law of large numbers for {X,}, 1 f B l.i.m. - X,dt=m+M({O}), B-A",B-A A (8) is an immediate consequence of (6). In the discrete parameter case a similar result is obtained by replacing the integral sign in expression (8) by the summation sign. In partic- ular, if F is continuous at the origin, we have M( {O} )=0, and only the constant m remains in the right-hand side of (8) [1,2]. C. Weakly Stationary Random Distributions Just as we introduce tdistributions as gen- eralizations of ordinary functions, we define 
1463 weakly stationary random distributions as generalizations of weakly stationary processes. Let q; be the space of all functions of class Coo on T= R with compact support. We introduce the same topology on q; as in the theory of distributions. If a random variable X", E Lz (Q) is defined for every rp E q; and the mapping rp --> X", is continuous in the Lz-sense and linear, then the family {X",} of random variables is called a random distribution in the wider sense ( 407 Stochastic Processes). Furthermore, if (X"""I)=(X""I), (X""" X"",) = (X"" X",) (9) for every hE R 1, where ( , ) stands for the inner product in Lz(Q) and Thrp(t)=rp(t-h), then {X",} is said to be a weakly stationary random distribution. With a weakly stationary process we can associate a weakly stationary random distribution by the relation X",= t X,rp(t)dt. This correspondence is one-to-one, and there- fore we can identify {X,} with {X",} as we identify an ordinary function with a distri- bution. From equations (9) it follows that there exist a constant m and a distribution p such that E(X",)=mS rp(t)dt and E(X",- E(X",) )(X",-E(X",)) =p(rp*), where * de- notes convolution, and (t) = 1/;( - t) . We call m and p the mean value and covariance distri- bution of {X"'}, respectively. By the generalized Bochner theorem p can be expressed in the form p(rp) = I cjJ(A)F(dA), (p(A) = I e i )" rp(t)dt, where F(d),) is a slowly increasing measure, i.e., I(I +),2)-kF(dA) < 00 for some positive integer k. F(d)") is called the spectral measure. This expression is the gen- eralization of the Khinchin decomposition. The spectral decomposition corresponding to (6) and the tlaw of large numbers for X", can be discussed in a manner similar to that for weakly stationary processes (K. It6 [3]). D. Prediction Theory Let {X,} be a weakly stationary process. Sup- pose that its values X, (s  t) up to time tare observed. Prediction theory deals with the problem of forecasting the future value X,+, 395D Stationary Processes (T > 0) from the known values Xs (s  t). If the domain of the admissible predictors is limited to linear functions of Xs (s  t), the theory is called linear prediction theory. We can assume without loss of generality that the mean value m of X, is zero and that the spectral measure F(dA) of {X,} is not a zero measure. Let vit,(X) be the subspace of Lz(Q) spanned by the Xs (s t), then vit(X) = V,vit,(X). A linear predictor for X,+, from Xs (s  t) is an element Y of vit,(X). If a linear predictor minimizes the prediction error (J"z(T)=E(IX,+,- Y12) in vit,(X), it is called an optimum linear predictor, which turns out to be the t(orthogonal) projec- tion of X,+, on vit,(X) and which is denoted by X,.,. Since {X,} is stationary, the error (J"z(T) does not depend on t for such a predictor. Corresponding to the spectral decomposition (6) of X" the optimum linear predictor is ex- pressed in the form x,.,= f ei),'cjJ,(A)M(dA), T' (12) (10) where cjJ,(. ) is square integrable with respect to the spectral measure F(dA). The subspace vit,(X) is non decreasing in 1. If vit,(X) is independent of t, i.e., vit,(X) = vit(X) for every t, then {X,} is said to be determinis- tic. In this case we have X,., = X,+, for every t and T>O, since X,+,Evit,(X). This means that the linear predictor enables us to determine the unknown quantities without error, and therefore such a process is of no probabilistic interest. On the other hand, if n,vit,(X) = {O}, then {X,} is said to be purely nondeterministic. A general {X,} is expressed as the sum of the deterministic part {xf} and the purely nonde- terministic part {X;} (Wold decomposition). Furthermore, we have vit(X d ) = n,vit,(X), and vit,(X) = vit(X d ) + vit,(X") (direct sum). Thus {Xt} and {X,"} can be dealt with separately. A weakly stationary process {X,} is purely nondeterministic if and only if the spectral measure F(dA) is absolutely continuous with respect to the Lebesgue measure, and the density f(A) is positive almost everywhere and satisfies (II) fn 10gf(A)dA> -00 (discrete parameter case), I oo 1 0gfP') d "  A>-OO -00 I +A Z (continuous parameter case). By using f(A) the optimum linear predictor can be obtained. First, we explain the discrete parameter case. There exists a function y(z) = L,'o a,z' in the tHardy class Hz relative to the unit disk 
395D Stationary Processes such that its boundary value satisfies f(l) = (1 j2n) I }'(e -iJ) 1 2 . Then we can find a sequence of mutually or- thogonal random variables {,} (I E Z) such that {X,} admits a backward moving average representa tion , X,= L a,_,\. There are many pairs {a,} and {,} which give the representation (13), but if y(z) is maximal (optimal), namely, if y(z) is expressed as ( I I n e-iJ.+ z ) y(z)= l2;exp - 10gj(A)d)., y' L.IL 4n -n e u. - z (14) then the representation (13) is canonical in the sense that 04f,(X) = jf,() for every I. Hence the optimum predictor X,., for X,+, is given by , I n X,.,= F2:.,. a'h-S'= -n eiA'cjJ,(A)M(d).), where (p,(ic)= ei:J ( y(e- iJ .)_ 'f a,e- iJ ., ) . y(e) FQ The prediction error (J"2('r) of this predictor is given by ,-1 (J"2(T)= L la s l2. os::::Q Example. Let the covariance function of a weakly stationary process {Xt} be e-al'l (a>O). Then we have W) = (I j2nJ( I - fi2) II - fie -"1- 2 , fi = e -a The maxImal 'y(z) is expressed as J1=fi2(1 - fiz) -', and X = I n eiA< e ih . I (i'e-ik'M(dA) 1.t -n: 1- fie-I). =T =fi'X" (J"2(T) = 1-2fi'e- a '+fi2'= l_e- 2a , We now come to the continuous parameter case. By replacing the holomorphic function y(z) on the unit disk with the one on the half- plane, we see that almost all results obtained in the discrete parameter case hold similarly in this case. The maximal y(z) is expressed as ( 1 I m I + AZ dA ) y(z) =  exp  10gj(A)-----=- ) _ I ) 2 . 2m -,x, z. +. Using the tFourier transform a, of the bound- ary functIOn of y(z) and a process {,} with orthogonal increments, we have the canonical backward moving average representation for 1464 the process {X,}: X, = I ' a,_,d" -x' (13) which enables us to obtain the optimum pre- dictor and the prediction error in a manner similar to the discrete parameter case [4]. In particular, if the optimal y(z) is of the form y(z) = cj P(iz), where c is a constant and P(z) is a polynomial of degree p, then X, is p - I times differentiable and P(djdl)X, = (djdl), up to a multiplicative constant; therefore X,., is ob- tained explicitly. To obtain the optimum linear predictor for Y E A'1(X), we first establish the expression (15) Y=sJ)(S)s or f:xj(S)d" and then take L-",. r(s)s or SC:-x;j(s)d, for the optimum linear predictor. The results stated above can be generalized to multivariate (n-dimensional) stationary processes [6,7J and to the case where the parameter space T is multidimensIOnal. N. Wiener observed the individual tsample process X (I, OJ) and discussed a method of finding the optimum predictor for X (I + T, OJ) by using a linear functional I'" X(I-s,OJ)dK(s) (K is of tbounded variation) of the values Xs (s:( I) [8]. The spectral measure played an important role in his observation. Calculations in this case are analogous to those mentioned above. For a weakly stationary random distribu- tion {X(<p)} (cpE!Z), the prediction theory is reduced to that for ordinary stationary pro- cesses. Assume that the spectral measure F(d;c) of {X(cp)} satisfies (\1). Set e(I)=exp(l) (1:(0), =0 (1)0), and let ek(l) be the k-times convo- lution of e(l) with itself. Set Y(cp) = X(e k * cp). Then {Y(<p)} is equivalent to a weakly station- ary process. It is obvious that .4"t,(X)=.4t,(Y) for every I, where .4"t,(X) is the linear subspace spanned by {X(cp) I support of <p c( -00, IJ}. This consideration allows us to reduce the prediction problem for {X(cp)} to that for the stationary process corresponding to {Y(cp)}. Nonhnear prediction theory is formulated as follows. Let , be the smallest (J"-algebra with respect to which every Xs (s:( I) is mea- surable and H,(X) be the subspace of L 2 (Q) consisting of all ,-measurable elements. The problem is to forecast X,+, (T >0) by using an element of H,(X). The optimum predictor is obviously equal to E(X,+,I 'S,). For a station- ary Gaussian process it has been proved that 
1465 the optimum predictor found in H,(X) belongs to .tt,(X). Therefore the optimum nonlinear predictor coincides with the optimum linear predictor. However, except for stationary Gaussian processes, no systematic approach for nonlinear prediction theory has been es- tablished so far. (For a typical case that arises from a stationary Gaussian process  176 Gaussian Processes H.) E. Interpolation and Filtering Interpolation and filtering of stationary pro- cesses have many similarities with prediction theory, both in the formulation of the prob- lems and in their method of solution. Let {X,} be a weakly stationary process, all of whose values {X, I tf Tr}. T 1 some interval, are known with the exception of those at t E T 1 . The problem of linear interpolation of the unknown value X, (tE TrJ is to find the best approximation of this random variable by the limit oflinear combinations of the known values. The following example illustrates the problem in the discrete parameter case. Example. Let T 1 = {to} and f(A.)dA. be the spectral measure of {X,}. The interpolation of X'o has an error if and only if f . 1 -d),<CIJ. -.f(A) Expressing X, in the form (6) with m=O, the best (linear) interpolation X'o of X'o is given by f . eiAto ( I-2n ( f(A.) f ' djl ) -I ) M(dA.)' -. -.f(jl) and the error of the interpolation is expressed by (f ' 1 ) -1 E ( IX, -X, 1 2 )=471: 2 -dA. o 0 -.f(A) The problem of interpolation for multivariate stationary processes has also been discussed [7]. The filtering problem originated in com- munication theory as a technique to extract the relevant component from a received signal with noise [8, 13]. Suppose that a complex- valued stationary process {X,} with continu- ous parameter is expressed in the form X,= fro eiAtM(dA.)=S,+N" where (S" N,) is a (2-dimensional) weakly stationary process with mean vector O. Here, S, and N, indicate the signal and noise, respec- tively. The filtering problem is to find the element of A,(X) that approximates S,+t as 395 F Stationary Processes closely (relative to the A(X)-norm) as possible. The best approximation is the projection of S'H on A,(X), but its expression in terms of the spectral measure becomes extremely com- plicated [8]. This problem is usually discussed under the assumption that S, and N, are ortho- gonal. Let us further assume that their spectral measures are absolutely continuous. The den- sity functions are denoted by fsP.) and fN(A.), respectively. If {X, 1 tE T} is observed, then the best (linear) approximation 5, of S, is given by 5,= fro eiA'qJo(A.)M(dA), where qJo(A.) = fs(A.)/Us(A.) + fN(A.)). The mean square error E(IS,-5,1 2 ) of this filtering is fro fS(A)fN(A)/US(A) + fN(A))dA.. F. Strongly Stationary Processes and Flows Let {X,(w)} (tET, wEQ(!B,P)) be a strongly stationary process. To study it we take the coordinate representation of {X,} as follows. Let Q be the complex vector space C T , !B the (J- algebra generated by the Borel tcylinder sets, X,(w) the tth coordinate of the function WE C T , and P the probability distribution of the process {X,} defined on (Q, !B). Define the shift transformation S, of Q onto itself by (S,w) (s)=w(s+t). Then {S,} forms a group of tmeasure-preserving transformations on Q(!B, P) ( 136 Ergodic Theory) since {X,} is strongly stationary. Thus we are given a (measure-preserving) tflow {S,} (t E T) on Q(!B, P). Conversely, if {S,} (tE T) is a (measure-preserving) flow on a probability space Q(!B, P), then {X,}, given by X,(w) = f(S,w), is a strongly stationary process, pro- vided that f is measurable. Many properties of a strongly stationary process are closely related to those of the corresponding flow. For example, the tstrong law oflarge num- bers for a strongly stationary process follows from tBirkhoff's individual ergodic theorem for flows. tErgodicity, several kinds of tmixing properties, and the spectral properties of a strongly stationary process are defined in accordance with the respective notions for the corresponding flow. Now we give some examples of flows corresponding to strongly stationary processes. (1) If X, (tEZ) are mutually independent and have the same probability distribution, then the process {X,} (t E Z) is strongly stationary and n,!B, is trivial (the definition of !B,-->D). Hence the corresponding flow is a tKolmo- gorov flow. 
395 G Stationary Processes (2) Similarly to (1), the flow corresponding to tGaussian white noise ( 176 Gaussian Process) is also a Kolmogorov flow. (3) The mixing properties of the flow corre- sponding to a stationary Gaussian process is determined by the smoothness of its spectral measure F(d)"). The flow is ergodic if and only if F is continuous (i.e., F has no point mass). In this case, the flow is also tweakly mixing. For the flow to be tstrongly mixing, it is necessary and sufficient that the covariance function p(t) of the process tend to zero as I tl-HXJ. In this case the flow is tmixing of all orders ( 136 Ergodic Theory) [14-16]. G. Analytic Properties of Sample Functions of Stationary Processes In the continuous parameter case, we always assume that tcontinuity in probability holds for strongly stationary processes and tmean square continuity holds for weakly stationary processes. Hence the processes discussed here are all continuous in probability, and without loss of generality we can assume that the sta- tionary processes are tseparable and tmea- surable ( 407 Stochastic Processes). Let {X,} be a weakly stationary process. Assume that the moments up to order 2n of the spectral measure F(dic) are all finite. Then almost all tsample functions of {X,} are n - 1 times continuously differentiable, and almost all sample functions of {x!n-l)} are absolutely continuous. Define the spectral distribution function FV) = F( ( - 00,2]) for the spectral measure F(d)") of {X,}. If F satisfies " I I nlr(F(n + 1) - F(n)j1/2 < 00 n=-7:) for a nonnegative integer r, then almost all sample functions of {X,} have continuous rth derivatives. In particular, if the condition (16) is satisfied for r=O, then almost all sam- ple functions are continuous. Conditions for Holder continuity of almost all sample func- tions of a weakly stationary process have also been obtained [17, 18]. (For sample functions of stationary Gaussian processes  176 Gaussian Processes F.) For a strongly stationary process {X,} with E(Xo)=O and finite E(XJ), the sample covari- ance function 1 f T - R(t)= lim - X,+s(w)Xs(w)ds Too 2T -1' is determined with probability 1. We can therefore apply the theory of tgeneralized harmonic analysis, due to Wiener [9]. (Further results on sample function properties are found III [19].) 1466 H. Strongly Stationary Random Distributions Let £iI be the space of all CX-functions with compact support and £ii' be the space of tdistri- butions. If X",(w) is defined for WEn(!B, P) and rp E '2!, and iffor almost all w, X",(w) belongs to £ii' as a linear functional of rp, then {X",} is called a random distribution. Suppose that the joint distribution of the random variables X'h'" , , X'h"'" ..., X'h"'. (Thrp(t) = rp(t - h)) is inde- pendent of h. Then we caIl {X",(w)} a strongly (or strictly) stationary random distribution. If we identify random distributions that have the same probability law, then {X",} is determined by the characteristic functional C(rp)= E(e iX .). For {X",} to be strictIy stationary it is neces- sary and sufficient that the equality C(Thrp) = C(rp) hold. The simplest example of a strictIy stationary random distribution is the Gaussian white noise ( 176 Gaussian Processes, 341 Probability Measures) [20]. I. Generalizations of Stationary Processes (16) The concept of stationary processes is gen- eralized in many directions. (1) Let T be a set different from R or Z, and suppose that there is given a group G of trans- formations that map T onto itself. If a family {X,} of random variables with parameter t E T has the property that for every choice of ran- dom variables X", X", ..., X,., the joint distri- bution of (X g " , ..., X q ,) is always independent of 9 E G, then {X,} (t E T) is said to be a strictly G-stationary system of random variables. Similarly, a weakly G-stationary system of random variables can be defined [21,22]. (2) Let T be a Riemannian space, and let G be the group of all isometric transformations on T or one of its subgroups. Suppose that a ttensor field X,(w) of constant rank is asso- ciated with any WE n(!B, P) at every point t. Then X(w) == {X,(w) I tE T} is caIled a random tensor field over the Riemannian space T. Any 9 E G induces an isometric transformation of the tangent vector space at t to that at gt. Hence 9 maps a tensor field X(w) to another tensor field gX(w) for every w. If X(w) and gX(w) have the same probability law, then X(w) is said to be strictly G-stationary. X(w) is defined to be weakly G-stationary in a similar way [21, 22]. (3) In the same way as we extended stochas- tic processes to random distributions, we can generalize random tensor fields to random currents and discuss stationary random cur- rents [21]. (4) Stochastic process with stationary incre- 
1467 ments of order n. Assume that {X,} (tER) is not necessarily a stationary process but that the nth-order increment of X, is stationary. Then by taking the nth derivative D" X, in the sense of random distributions, we obtain a station- ary random distribution. From the properties of D" X, we can investigate the original process itself. Brownian motion is an example of a stochastic process with stationary increments of order I. (5) Weakly stationary processes of degree k. A weakly stationary process is a process whose moments up to order 2 are stationary. Gen- eralizing this, we can define a weakly station- ary process of degree k by requiring the mo- ments up to order k to be stationary. We can obtain more detailed properties of such pro- cesses than those of weakly stationary pro- cesses [23]. References [IJ J. L. Doob, Stochastic processes, Wiley, 1953. [2J U. Grenander and M. Rosenblatt, Statis- tical analysis of stationary time series, Wiley, 1957. [3J K. Ito, Stationary random distributions, Mem. CoIl. Sci. Univ. Kyoto, (A) 28 (1954), 209- 223. [4J K. Karhunen, Dber die Struktur station- arer zufalliger Funktionen, Ark. Mat., 1 (1950), 141-160. [5J H. Cramer, On the linear prediction prob- lem for certain stochastic processes, Ark. Mat., 4 (1963),45-53 [6J N. Wiener and P. Masani, The prediction theory of multivariate stochastic processes I, II, Acta Math., 98 (1957),111-150; 99 (1958), 93-137 [7J Yu. A. Rozanov, Stationary random pro- cesses, Holden-Day, 1967. (Original in Rus- sian, 1963.) [8J N. Wiener, Extrapolation, interpolation, and smoothing of stationary time series, MIT Press, 1949. [9J N. Wiener, Generalized harmonic analysis, Acta Math., 55 (1930), 117-258. [IOJ N. Wiener, The homogeneous chaos, Amer. J. Math., 60 (1938), 897-936. [11J N. Wiener, Nonlinear problems in ran- dom theory, MIT Press, 1958. [12J P. Masani and N. Wiener. Non-linear prediction, in Probability and Statistics: The H. Cramer Volume, Wiley, 1959, 190-212. [13J A. Blanc-Lapierre and R. Fortet, Theorie des fonctions aleatoires, Masson, 1953 [14J G. Maruyama, The harmonic analysis of stationary stochastic processes, Mem. Fac. Sci. Kyushu Univ., (A) 4 (1949), 45-106. 396 B Statistic [15J S. Kakutani, Spectral analysis of station- ary Gaussian processes, Proc. 4th Berkeley Symp. Math. Stat. Prob. II, Univ. of California Press (1961),239-247. [16J H. Totoki, The mixing property of Gauss- ian flows, Mem. Fac. Sci. Kyushu Univ., (A) 18 (1964),136-139. [17J 1. Kubo, On a necessary condition for the sample path continuity of weakly stationary processes, Nagoya Math. J., 38 (1970), 103- 111. [18J T. Kawata and I. Kubo, Sample prop- erties of weakly stationary processes, Nagoya Math. J., 39 (1970), 7-21. [19J H. Cramer and M. R. Leadbetter, Sta- tionary and related processes, Wiley, 1967. [20J 1. M. Gel'fand and N. Ya. Vilenkin, Gen- eralized functions IV, Academic Press, 1964. (Original in Russian, 1961.) [21J K. Ito, Isotropic random current, Proc. 3rd Berkeley Symp. Math. Stat. Prob. II, Univ. of California Press (1956), 125-132. [22J A. M. Yaglom, Second-order homo- geneous random fields, Proc. 4th Berkeley Symp. Math. Stat. Prob. II, U niv. of California Press (1961), 593-622 [23J A. N. Shiryaev, Some problems in the spectral theory of higher-order moments I, Theory Prob. Appl., 5 (1960), 265-284. (Orig- inal in Russian, 1960.) 396 (XVIII.3) Statistic A. General Remarks A statistic is a function of a value (i.e., a sample value) observed in the process of sta- tistical inference ( 401 Statistical Inference). A statistic is used for two purposes: (a) to characterize the set of observed values or sample values, and (b) to summarize the in- formation contained in the sample about the unknown parameters of the population from which it is assumed to have been drawn. B. Samples and Statistics The basic concepts in statistical inference are tpopulation and tsample. Let (Q, q), P) be a tprobability space, where P is a tprobability measure on f!lJ. A trandom variable X defines a I-dimensional probability distribution <1>(A) =P{WIX(W)EA}, where A is a I-dimensional tBorel set, which gives rise to a I-dimensional probability space (R, f!lJ1, <1». Here :!Jl is the 
396C Statistic family of aliI-dimensional tBorel sets. Let Xl, X 2 , ..., X n be tindependent random vari- ables with identical I-dimensional distribu- tions. The tn-dimensional random variable X =(Xj, X 2 ,..., X n ) is called a random sample of size n from the population (Q,:Jl, P). In particular, when each of Xl,"', X n takes only two values (usually 0 and I), the sample is called a Bernoulli sample or a sequence of Bernoulli trials. Generally, if <1>n is the tn- ,dimensional probability distribution deter- mined by X (i.e., the direct product of n copies of the I-dimensional probability distribu- tion <1», then the n-dimensional probability space (R n , :Jln, <1>n), where :Jln is the family of n-dimensional Borel sets, is called an n- dimensional sample space. A point belonging to the set of actually observed values of the sample X, which is a random variable by definition, is called a sample value and is de- noted by x. Thus the sample value can be expressed as x= X(w) (WEQ) and regarded as a point in the sample space (sample point). The basic underlying structure which determines the probability distribution is the set 0., which we can view as describing the physical struc- ture of the 0 bserved phenomena, but statistical procedures are always carried out through the observations of samples, and 0. itself is often disregarded. The I-dimensional probability distribution <1> (the n-dimensional probability distribution <1>n determined by X) is called the population distribution in the I-dimensional (n- dimensional) sample space, since it is induced from the pro ba bility measure on (0., :Jl). A statistic Y is a random variable expressed as Y = f(X), where f is a tmeasurable func- tion from the sample space (R n , :Jln, <1>n) into a measurable space (R, :Jl1). The value of the statistic Y corresponding to a sample value x of the sample X is denoted by y= f(x). When we deal with a statistical problem we often have no exact knowledge of the popula- tion distribution <1>(<1>n) except that it belongs to a family [lJ>= {Pol I)Ee} of probability mea- sures on :Jl1 (:Jl n ). We call I) the parameter of the probability distribution and e the para- meter space. The typical cases described in this section can be extended as follows: (1) The distribution <1> may be an r-dimensional proba- bility distribution. In this case a sample of size n induces an nr-dimensional sample space. (2) Random variables XI' "', X n , being mutually independent, may not have identical distri- butions. (3) Random variables XI' "', X n may not be mutually independent. In both cases (2) and (3) the sample space is of the form (R n , :Jln, <1>n), but n may not be the sample size itself, nor <1>n be the direct product of n copies of identical I-dimensional components. (4) The most general sample space is expressed as a 1468 certain measurable space (.0£, d) and a family [lJ>= {Pol BEe} of probability measures on d. A statistic, in general, is a random variable expressed as Y = f(X) by a measurable func- tion f defined on a sample space (.or, d) taking values in another meaurable space ('1?1, '6'). When ('1?1, '6') is (R,:Jl1) or (Rn, :Jln), Y = f(X) is accordingly called a l-dimensional or n- dimensional statistic. C. Population and Sample Characteristics in the I-Dimensional Case In a I-dimensional probability space (R, :Jl1, Po) the following quantities, called popu- lation characteristics, are used to characterize the population distribution Po: Letting F(z) = Po( ( - 00, z]) be the tdistribution function of Po, we use the population mean J1 = S z dF(z); the population variance a 2 = S (z - J1)2 dF(z); the population standard deviation a(;:' 0); the popu- lation moment of order k J1k= S zkdF(z) (J1'1 =J1); the kurtosis J14/a4; the coefficient of excess (J14/a4) - 3; the skewness J13/a3; the a-quantile or 100a%-point m satisfying F(m-O)a F(m+O); the median, which is the 50%-point; the first and third quartiles, which are the 25%-point and 75%-point, respectively; the range, which is the third quartile minus the first quartile; and the mode, which is the value or values of z for which dF(z)/dz attains its maximum. Sometimes the kurtosis and others are called population kurtosis, etc. Here the word "population" is used when it is desirable to distinguish population characteristics from the sample characteristics defined in 341 Proba- bility Measures. Let x=(x p "', x n ) be a point of an n- dimensional sample space (a sample value). Corresponding to each I-dimensional Borel set A, the num ber of components of x that belong to A is called the frequency of A in the sample value x = (xl' ... , x n ), and (frequency)/n is called the relative frequency of A. If we take A = (-00, z] and regard its relative frequency Fx(z) as a function of z, it becomes a tdistri- bution function for every x ERn, called the empirical distribution function based on x. Various characteristics can be defined from the empirical distribution function in exactly the same way as population characteristics are derived from a population distribution func- tion. These are called sample characteristic values and can be expressed as functions of Xl' ...,x n . Assuming that x = (x I' '" ,x n ) is a sample value of a sample X = (XI, '" ,X n ), the statistic obtained by substituting X for x in the func- tion denoting a sample characteristic value is 
1469 called a sample characteristic and given the same name as the corresponding population characteristic, except that the word "popula- tion" is replaced by the word "sample." A sample characteristic is a function of random variables. Hence it is also a random variable, and the problem of deriving its probability distribution from the assumed population distribution is called that of sampling distri- bution ( 374 Sampling Distributions). Thus we define the sample mean X = l:?1 X;/n; the sample variance l:?1 (Xi - X)2/n (sometimes l:?=,(X i -Xf/(n-l) is taken as the sample variance); the sample standard deviation n 2:: (X i -X)2/ n , i=l which is the positive square root of the sample variance; the sample mode, which is the value taken by the largest number of Xi; and the sample moment of order k l:?=t (Xi-XNn. Among other statistics of frequent use are the order statistic, i.e., the set of values of XI' '" ,X n arranged in order of magnitude and usually denoted by X(I) < X(2) < ... < X(n)' Various other statistics are defined in terms of order statistics: the sample median X med = X(n+1)/2) for odd nand = (X(n/2) + X((n/2)+I))/2 for even n, the sample range R = max X i - minXi=X(n)-X(,,, and so on. The empir- ical distribution function Fx(z) or its standard- ized form SAz)={Fx(z)-F(z)} can also be considered to be a function of the order sta- tistics, and hence is a statistic taking values in the space of functions of a real variable. So is the empirical characteristic function l/JAt) = f exp(itz) dFAz) =  exp(itx(j))/n. In a sequence of Bernoulli trials, a set of successive components with an identical value is called a run. For example, (01100010) has a run of 0 of the length 3 and a run of 1 of the length 2. Among the statistics listed in the previ- ous paragraphs, the order statistic is an n- dimensional statistic and all others are 1- dimensional. D. Other Cases Let (R 2 , 36'2, Po) be a 2-dimensional probability space with a 2-dimensional population distri- bution Po, and let (XI, ...,X n ) (Xi=(V i , V;)) be a random sample of size n from Po. In this case also, the population characteristics for the tmarginal distributions of Vi and V; and sample characteristics for (V b ... , V n ) and (VI' ... , v,,) are defined as in Section C. As an index for association between Vi and 396 E Sta tistic V; the population covariance S (u - 1l(I))(V- 1l(2))dF(u, v) of Vi and V; and the population correlation coefficient, which is equal to (popu- lation covariance)/0"(1)0"(2)' are defined. Here F(u, v) is the joint distribution function of Vi and V;, 1l(1) and 1l(2) are the respective popula- tion means of Vi and V;, and 0"(1) and 0"(2) are the respective standard deviations. As corre- sponding sample characteristics, we have the sample covariance l:?=1 (Vi - U)(V; - V)/n of (VI' ..., V n ) and (Vp"', V n ) and the sample correlation coefficient n 2:: (Vi - O')(V; - V) i=l ( n ) 1/2 ( n ) 1/2' I (V i - 0')2 i (V;- V)2 where 0 = l:?1 V;/n and V = l:?=1 V;/n. Similarly, statistics of the samples from a population of k-dimensional distribution (k;::' 3) can be defined ( 280 Multivariate Analysis). More generally, in statistical in- ference we encounter samples where observed values may not be mutually independent or identically distributed, but have more com- plicated probability structures. Statistics as functions of such samples are also considered. E. General Properties of Statistics The general theory of statistics has been studied in a measure-theoretic framework. (.'3[, s1, &) is called a statistical structure, where (.'3[, s1) is a measurable space and & is a family of probability measures on (.'3[, s1). A 0"- subfield 36' of s1 (hereafter abbreviated O"-field) is called sufficient for & if for any A E s1 there exists a 36'-measurable conditional proba- bility of A independent of PoE&, that is, a 36'- measurable function <PA(X) satisfying po(AnB)= L <PA(X)dPO(x) for all PoE.OJ> and B E36'. For any two O"-fields 36'1 and 36'2, the nota- tion 36'1 c36'2[&] means that to each set AI in !J6 1 there corresponds an A 2 in 36'2 satisfy- ing Po«A, - A 2 ) U(A 2 - A 1 ))= 0 for all PoEfYJ. When the reverse relation 36'2 c36'1 [&] also holds, we write 36'1 =36'2[&]. For a statistic t which is a measurable func- tion from (.'3[,s1) to (c[Ij, 'C), 36'(t) = {BI BE,I1', t(B)E'C} is a O"-field and is called the O"-field induced by t. If 36'(t) is sufficient for &, t is said to be sufficient for &. Since sufficiency of a statistic means that of a O"-field, we consider only sufficiency of a O"-field. 36' is called necessary if for any sufficient 36'0 we have 36'c36'o[&]. A necessary and sufficient O"-field is called a minimal sufficient O"-field. A 
396F Statistic necessary and sufficient statistic is also called a minimal sufficient statistic Such a statistic does not always exist; .gfJ2 containing a suffi- cient 311 1 is not always sufficient. Jd is said to be complete if for every f!l1- measurable integrable function <p, S" <p(x)dPe(x) = 0 for all P e E/1J' implies Pe({ x I <p(x) # O} ) = o for all PeE.OJ!. f!l1 IS said to be boundedly com- plete if for every bounded f!l1-measurable <p, S,,<p(x)dPe(x)=O for all PeE:Y' implies Pe({xl <p(x)#O} )=0 for all PeE&'. When :!d(t) is (boundedly) complete, t is called (boundedly) complete. If 24 1 e:Ja 2 and f!l1 2 is (boundedly) complete, [yJ1 is also (boundedly) complete. If .;$1 is (boundedly) complete and sufficient and f!l1 2 is minimal sufficient, we have fJd 1 = f!l1 2 [:!I']. F. Dominated Statistical Structure When all PeE&' are absolutely continuous with respect to a O"-finite measure A on .91, then (:?f, .91,.OJ!) is said to be a dominated statistical structure and .0;; is said to be a dominated family of probability distributions. In this case, Pe has the density !e(x) = dPe/dA with respect to A by the Radon-Nikodym theorem. If sf is separable, .0;; is a separable metric space with respect to the metric p(P e " Pe,} = SUPBE cd I Pel (B) - Pe,(B)I. There exists a countable subset f!IJ' = {P e " P e "... } of.o;; such that Pe(N) = 0 for all PeE9' implies Pe(N)=O for all PeE&'. Ifwe put Ao=LjCjPe" cj>O, LjCj= 1, ,1,0 dominates 9, and if [yJ is sufficient for .OJ! we can choose a 311- measurable version of dPe/d},o. Conversely, if there exists a O"-finite measure A such that we can choose a f!l1-measurable version of dPe/d), for all PeEff', then .gfJ is sufficient. If f!IJ is dominated by a O"-finite }., .'% is suffi- cient if and only if there exist a f!l1-measurable ge and an .91 -measurable h independent of 0 satisfying dPe , dA = ge h a.e. (sf, Ie) for all PeE 9. This is called Neyman's factorization theorem. With a dominated statistical structure, there exists a minimal sufficient O"-field, and a O"-field containing a sufficient O"-field is also sufficient. We say that a O"-field :!d is pairwise sufficient for &' if it is sufficient for every pair {P e " Pe,} of measures in :Y. A necessary and sufficient condition for fJd to be sufficient for a domi- nated set & is that :J4 be pairwise sufficient for .0fJ. Recently, a more general statistical struc- ture has been studied. Put SJl'e(tl) = {A I A E .91, tl(A) < oo}. A measure J1 on sf is said to be a localizable measure if there exists ess-supff(tl) for any subfamily .'ffF e .Yie(tl), that is, if there exists a set E E.91 such that tl(A - E) = 0 holds 1470 for all AEff, and tl(A-S)=O for all AE.'ffF implies tl(E - S) = O. A O"-finite measure is localizable. A measure tl is said to have the finite subset property if for any A satisfying 0 < tl(A), there exists a BeA satisfying O<tl(B)< 00. A statistical structure (:?f, sf, i?l') is said to be weakly dominated if f!IJ is dominated by a localizable measure tl with the finite subset property and a density dPe/dtl exists for all PeEi?l'. In this case a minimal sufficient O"-field exists, and a pairwise sufficient O"-field is suffi- cient. For example, let .91 = 2" and;JjJ be the set of all discrete probability measures on .91. :!I' is weakly dominated by the counting mea- sure tl which is localizable on sf. The order statistic is sufficient if &' is domi- nated, XI' ... , X n are mutually independent and identically distributed random variables with :?f=R n , and each PeE.OJ! is invariant under every permutation of the components of the points x = (x j,.. , x n ) in :?f. Moreover, the order statistic is complete if 9 is large enough. For example, we have the following theorem: The order statistic is complete if every P (the component of Pe on f!l1 o ef!l1 1 , OE0) is ab- solutely continuous with respect to the Lebes- gue measure I on Rand {Pe O I OE 0} contains all Peo for which 9(Z)=dPNdl is constant on some finite disjoint intervals in :?f o e R. A similar result holds for discrete distributions. We call 0 a selection parameter when f(x) = c(O)XE)x)h(x), where h(x) is a positive f!l1- measurable function, XE,(X) is the indicator function of a set EeE f!l1, and c(O) is a constant depending on O. Here 0 determines the carrier Ee of fe(x) but does not essentially affect the functional form of fe(x). A necessary and suffi- cient statistic is given by t*(x) = n {Ee I Ee '3 x, PeE.o;;}. Here the class of sets of the form given in the right-hand side of this expression is taken as all, and we set (/&' = {CI C c!{if, t*-l (C)E f!l1}. We call t*(x) the selection statistic. Two examples follow. (I) tUniform distributions. Let (= {(IX, [J) I -00 < a < Ii < oo}, :?f o = R, P be the uniform distribution on (a, [J), and X = (XI' ..., X n ) be a random sample of size n having p,? as its popu- lation distribution. Then Ee = {x I ex  mini Xi  maxjxj[J} andj(x)=(fJ-a)-nXr.o(x). If we put t(x) = (minj Xj, maXj x;), 9' = R 2, and rg = the set of all Borel sets of R 2 , it follows that t(x) = t*(x) [.OJ!], where t*(x) is the selection statistic. Hence t(x) itself is necessary and sufficient. (II) tExponential distributions. Put 0 = ( - 00,(0), :?f o = R, and { ae -,(z- ej z>- 0 ge(Z)= 0 '? O ' , z< , where a is a known constant, and let X = (Xl, ... , X n ) be a random sample of size n 
1471 having a population distribution with density function ge(z). Then Ee={xIOm}nxi}' fe(x)=xnen.eXEo(x)exp( -Xi x} If we put t(x) = minix i and let t*(x) be the selec- tion statistic, it follows that t(x) = t*(x) [27'J, and t(x) is necessary and sufficient. If e = {(a,O)IO<a< 00, -Cfj <0< oo}, then t(x)= {mini Xi' Li Xi} is a necessary and sufficient statistic. G. Exponential Families of Distributions A dominated &' is called an exponential family of distributions if and only if fe(x)=dPe/d}. can be expressed in the form h)(X)=expCt s)x)a)O) +ao(O)+ So (X)). XE.-f, OEe, (1) where the six) (j =0,1, ", k) are real-valued Jl-measurable functions and the aj(O) (j=0, 1. ... ,k) are constants depending on O. If there exists a sufficient statistic for 21 that is not equivalent to but is in a certain sense simpler than the sample itself or the order statistics, then it can be shown under some regularity conditions that &' must be an exponential family. The following theorem provides an instance of the hypotheses that guarantee such a conclusion: Let X be a sample from a I-dimensional probability space (."l, fJlJ o , P e o) with p the population distribution, where !l:o is a finite or infinite interval in Rand fJlJ o is the class of all Borel sets. Let I denote the Lebesgue measure. Assume that {pn is domi- nated by I and ge(Z) = dP /dl is greater than a positive constant and continuously differenti- able in z on !l:o. Assume further that there exists a sufficient statistic t(x) with the prop- erty that for each open subset B of ;;{ (c R n ) and A-null set N there are two points x =1= x' in B - N such that t(x) =I- t(x'). Then [JjJ is an exponential family. and the k given in (1) is less than n. Similar results are known also for cases where X" ... X n are not identically distributed. It is evident from the construction of a necessary and sufficient statistic that the statistic t(x) = (s 1 (x), ..., sdx)), where the Sj( x) are those appearing in (1), is sufficient for an exponential family and necessary if aIlO), ..., adO) arc linearly independent. If {(aIlO), ..., ak(O)) I ° E e} contains a k-dimensional interval, t(x) is complete. The distribution of t(x) is of exponential type. When XI' ..., X n are mutu- 396 I Statistic ally independent with a common distribu- tion of exponential type, the distribution of X = (Xl' "" X n ) is of exponential type, and vice versa. The family (1) of distributions is a special form oftP6lya-type distributions, and various distributions given in (III)-(VII) below are written in this form. In the following exam- ples, for a sample of sIze n from the specified distribution, feix) is the density with respect to Lebesgue measure in (III), (IV), and (V) and to counting measure in (VI) and (VII) ( Appendix A, Table 22) (III) tNormal distributions N(I1, (J2), /(0 = (-00,00). fo(x)=ex p ( - ( f X? ) - 2 1 2 + ( f Xi ) 112 [=1 6 1=1 6 nl1 2 n ) ---nlog(J--log2rr . 2 2 (IV) tr-distributions r(p, (J), .'1'0 = (0,00). fe(x)=exP(p-l)i 10gXi-i x, -nplog(J-nlog r(P))' (V) tExponential distributions e(11, (J), !l:o = (p, (0). fe(x)=ex p ( - ( f xi ) -nIOg(J+n ) . i=l (J (J (VI) tRinomial distributions Bin(N, p), .q[o = {a, 1,... ,N}. fe(x)=ex p ( Ct x}Og l p +nNlog(l-p) - i 10g() ). (VII) tPoisson distributions PIA), ;!l"o = {O, 1,2,3,...}. fe(x)=ex p ( (x}Ogtl- i 10g(Xi!)-n}.) H. Ancillary Statistics A statistic t(x) is called an ancillary statistic when for every element A in d(t), Pe(A) is independent of 0, or in other words, when the distribution of t(x) is independent of O. A suffi- cient condition for a statistic to be ancillary is that it is independent of some sufficient statis- tic. Conversely, an ancillary statistic is inde- pendent of all boundedly complete sufficient statistics. I. Invariant Statistics Suppose that we are given groups of one-to- one measurable transformations G and G on 
396J Statistic q and e, respectively. Suppose also that we are given a thomomorphism 9-9 from G to G satisfying P,,(g-1 B)= Pq,,(B). In this case f7>= {P" II1Ee} is called G-invariant. If G is transi- tive, there exists a fixed element 00 of e that is sent to an arbitrary 11 by an element 9 of G. In this case, 11 is called a transformation para- meter. In particular, if e = R, X is a random sample from a population distribution, and P,,(B)=P"o(B-O), where B-0={xl(x 1 + 0, ...,x n +I1)EB}, then 0 is called a location parameter. When e=(O, (0) and P,,(B)= PI (B/I1), where B/I1={xl(Oxj, ...,l1x n )EB}, 11 is called a scale parameter Now assume that 11 is a com bination of these two kinds of param- eters such that 0 =(rx, {3) (-00 < rx < 00, ° < {3 < (0) and P,,(B)= P"o«B - rx)/fJ), where 00 = (0,1). Then if f7> is an exponential family, (1) of Section G can be written as dP.° 1 ( m ( z - Ct: ) j ) Ye(z)=-i=fjex p j0 0 k j T ' where the k j (j = 0,1, ..., m) are constants. We call t(x) an invariant statistic with re- spect to a general transformation group G when t(gx)=t(x) for all gEG and XE?£'. An invariant statistic is said to be maximal invar- iant with respect to G if, for t(x) = t(x'), there exists a gEG such that x=gx'. If to is maximal invariant with respect to G, a statistic t is invariant under G if and only if to(x)= to (x') implies t(x) = t(x'). When f7> is G-invariant, a set A (Ed) is called G-invariant if gA=A for all gEG. We denote by dO the set of all G-invariant sets in d. .<710 is clearly a a-field. A set A (E d) is called almost G-invariant if gA=A (d,f7» for all gEG. We denote by d* the a-field consisting of all almost G-invariant sets. If f7> is G-invariant, @ is sufficient for f7>, g@ =@ for all 9 E G, and moreover, @o =@* (d, f7», then :JIJ 0 is a suffi- cient a-subfield of dO, where @o=@ndo and @*=@nd*. J. Various Definitions of Sufficiency There are many different definitions of suffi- ciency, and the relations among them have been investigated. A a-field @ is called decision- theoretically sufficient or D-sufficient if for a given d-measurable decision function b there exists a @-measurable decision function b' such that f b(x, E) dP,,(x) = f b'(x, E) dP,,(x) J !l for all E E fiJ, P" E .9, where a decision space (D,) is quite arbitrary. @ is called test suffi- cient if for any given d -measurable test func- tion cp, there exists a @-measurable test func- 1472 tion cp' satisfying E,,(cp) = E,,(cp') for all P"Eflf'. Let (e, (1&') be a measurable space of parameter 11 and e be the set of all probability measures on CC. Moreover we assume that P,,(B) is CC- measurable as a function of 11 for any fixed BEd. For any Ee, we define A. by A.(A x C)= f P,,(A)d(I1), AEd, cel&'. We denote by X the extension of A to d x CC. :JIJ is said to be Bayes sufficient if E;-pxxcld x CC)=E;-pxxcl@ x CC) for all Ee, CECC, that is, the a posteriori distribution on e given d coincides with that given @ for any a priori . When flf' is domi- nated, these definitions coincide with the class- ical definition of sufficiency. Generally, a D- sufficient a-field contains at least one sufficient a-field. A a-field containing a sufficient a-field is Bayes sufficient. Hence Bayes sufficiency follows from D-sufficiency and from classical sufficiency. If a D-sufficient a-field is separable it is sufficient. The notion of prediction sufficiency or ade- quacy was defined by Skibinsky [10]. Let (X, y) be a pair of random variables defined over the probability space (?£' x all, d x @, ). We suppose that X is the sample to be ob- served, and Y is (are) the value(s) of future observation(s) about which we are to make prediction(s) based on X. X and Y have joint probability distribution with an unknown parameter. A statistic T= T(X) or a subfield CC of d is said to be prediction sufficient or adequate if (a) given T, X and Yare condition- ally independent (or given CC, d and @ are conditionally independent or Markov) and (b) T is sufficient for X (CC is sufficient for d). It was proved that in any form of prediction on Y, we may restrict ourselves to the class of procedures that are functions of T (or are CC- measurable). References [lJ D. A. S. Fraser, Nonparametric methods in statistics, Wiley, 1957. [2J E. L. Lehmann, Testing statistical hypoth- eses, Wiley, 1959. [3J S. S. Wilks, Mathematical statistics, Wiley, 1962. [4J R. R. Bahadur, Sufficiency and statistical decision functions, Ann. Math. Statist., 25 (1954),423-462. [5J T. S. Pitcher, A more general property than domination for sets of probability mea- sures, Pacific J. Math., 15 (1965), 597-611. [6J L. Brown, Sufficient statistics III the case of independent random variables, Ann. Math. Statist., 35 (1964), 1456-1474. 
1473 [7J E. W. Barankin and A. P. Maitra, Gen- eralization of the Fisher-Darmois-Koopman- Pitman theorem on sufficient statistics, Sank- hya, ser. A, 25 (1963), 217-244. [8J W J. Hall, R. A. Wijsman, and J. K. Ghosh, The relationship between sufficiency and invariance with applications in sequential analysis, Ann. Math. Statist, 36 (1965), 575- 614. [9J K. K. Roy and R. V. Ramamoorthi, Rela- tionship between Bayes, classical and deci- sion theoretic sufficiency, Sankhya, ser. A, 41 (1979),48-58. [IOJ M. Skibinsky, Adequate subfields and sufficiency, Ann. Math. Statist., 38 (1967), 155- 161 397 (XVIII. 1 ) Statistical Data Analysis A. Statistical Data Statistical data analysis is comprised of a col- lection of mathematical methods whereby we can deal with numerical data obtained through observations, measurements, surveys, or experiments on the "objective" world. The purpose of statistical analysis is to extract the relevant information from that numerical data pertinent to the subject under consideration. The nature and the properties of the subject and also the purpose of the analysis may vary greatly. The subject may be physical, biolog- ical, chemical, sociological, psychological, economic, etc. in nature, and the purpose of the analysis can be purely scientific, as well as technological. medical, or managerial. Because of the great diversity of statistical data, the methods of statistical data analysis and the manner of application should differ greatly from situation to situation; we cannot expect a single unified system of methods to be ap- plicable to all cases. Nevertheless, we have several formal methods of statistical analysis that are more or less mutually related and have been successfully applied to most, if not all, statistical data. Statistical data can be classified into several types according to a few criteria: according to the property of each observation or mea- surement, they can be either quantitative or qualitative; according to whether only one observation is made on each object under investigation or many observations on the same object, they can be either univariate or multivariate; and according to whether the observations are made at one time or consecu- tively in the course of time, they may be either 397 B Statistical Data Analysis cross sectional or time series. Each different type of statistical data requires a different type of procedure ( 280 Multivariate Anal- ysis, 421 Time Series Analysis). B. Frequency Distributions and Histograms Statistical data have the simplest structure when they consist of a collection of observa- tions made on an aggregate of objects sup- posedly of the same kind. Such an aggregate is usually called a population, and the number of its members (its size) is denoted by N. When the data are qualitative or categorical, each member of the population is classified into several types according to some criteria, the data consist of the numbers of the members of the population classified into each of the categories. Such numbers are usually called frequencies, and the set of frequencies is called the frequency distribution. When the data are quantitative and univari- ate, one quantitative attribute of each member of the population is observed, and the results are given as a set of N real numbers (x" x 2 , ..., XN)' When N is large, as is usually ex- pected, it is necessary to summarize these results in some manner. One common method is to tabulate the frequency distribution: We define a certain number of intervals (a i -" aJ, i= 1,..., K, a o < a, <... < a K , a o minxi' max Xi  a K ; and we count the numbers of those x's falling within each of the intervals and tabulate those numbers or frequencies 1;, i = 1,2, "', K. Frequency distribution is often represented in the form of a histogram, where the endpoints of the intervals are marked on the horizontal axis, and above each interval a rectangle of area proportional to the frequency for the interval is drawn. It is usually recom- mended that the widths of the intervals in the frequency distribution be equal, especially when it is to be represented by a histogram. It is, however, often impossible or impractical to do so, and sometimes a logarithmic or other functional scale is used in the abscissa of the histogram; then it is desirable that the inter- vals of the transformed values are approxi- mately of equal lengths. The number K of the intervals should also be of an appropriate magnitude, neither too large nor too small; K is often constrained by the size N of the popu- lation, the shape of the distribution, or other factors. Usually, K is chosen to be between 6 and 20. From the frequency distribution, we obtain the cumulative distribution by associating with each endpoint a i of the intervals the num ber F; of x's not greater than ai' namely, Fi=Lj"<ifj. The curve obtained by connecting K + 1 points 
397C Statistical Data Analysis of coordinates (ai' F), i = 0,1, ..., K, by linear segments is called the cumulative distribution curve (or polygon). C. Characteristics of the Distribution In order to summarize univariate quatitative data, various values are calculated from the values XI' ...,X N . Such values are called statis- tics (singular, tstatistic) and are used to char- acterize the distribution of the values. Various types of statistics characterize different aspects of the distribution: (a) Representative value or measure of loca- tion: a value which is supposed to give the "representative," "typical," or "most common" value in the population. By far the most com- monly used measure is the mean x= r.1 xi/No x is sometimes called the arithmetic mean, and some other "means" are also calculated: es- pecially when all the values are positive, the geometrical mean xG=(TIiX// N ) or harmonic mean X H =(r. i (l/x;)/N)-1 may be calculated; more generally, for some monotone function f(x) we can calculate the fmean by x J = r 1 (r.;[(x)/N), of which the geometric and the harmonic means are special cases. Another measure of location is the median, which is the value in the population located exactly in the middle of the ordering of the magnitudes; more precisel y, if x( 1) < x(2) < ... < X(N) are the values in the population arranged according to their magnitudes, the median x med =X((N+1);2) for odd N, and ='!(X(N/2) +x((N;2)+1j) for even N. The mode is also sometimes used; this is defined as the value (usually the center) corresponding to the highest frequency. (b) The measure ofvariability or dispersion shows how widely the values in the popula- tion vary. The most common measure is the st andard deviati on, which is defined by Sx = J r.i(X i - X)2 / N, and its square is called the variance V} . A similar measure is the mean absolute deviation Dx=r.;!xi-xl/N. Another type of a measure of dispersion is the range Rx = max Xi - min Xi, and the interquartile range Qx = X(3N/4) - X(N/4) and more generally the interquantile range X(XN)-X((I-x)N) for some ex. The ratio of the standard deviation to the mean is called the coefficient of variation (c. V. for short) and is used as a measure of relative variability when all the values in the popula- tion are positive. (c) Characteristics often used to characterize the "shape" of distributions are the moments (around the origin) mk = N -I r.iX and the central moments (moments around the mean) Mk=N-1r.i(Xi-Xt for a positive integer k; for a specific k these are called the kth moments. Central kth moments with odd k are equal to zero when the distribution is symmetric, i.e., 1474 when the histogram is symmetrically shaped. Hence the third moment or its ratio to the third power of the standard deviation s; is used as a measure of the asymmetry of a distribution; this is called the skewness. The fourth moment is large if there are some values which are far off from others and small when all values are concentrated; hence it tends to be large when the histogram has a rather sharp peak in the center and has a long tail in either direction or both, and tends to be small when the histo- gram is flat in the center and drops off sharply at both ends. Accordingly, the ratio M 4 /V/ is used as a measure of long-tailedness of the distribution; this ratio minus 3 is called the kurtosis. D. Theoretical Frequency Distribution When the observed values can be any real number (sometimes in an interval), the size of the population N is increased indefinitely, and the widths of the intervals are decreased to 0, the histogram is expected to approach a smooth curve. And in the limit when N is infinity, we can assume that the distribution is represented by a mathematically well-behaved function f(x) and that the ratio of the numbers of those values in the population within the interval (a, b) to the size of the population approaches Sf(x)dx. Such a function f(x) is called the frequency function or density func- tion. Various types offunctions have been proposed and used as "theoretical" frequency functions to approximate the actually ob- served frequency distributions. The most im- portant is the normal density function q>(X) =exp { -(X -11)2 } . (J ji;. 2(J The following density functions most com- monly appear in applications: the gamma density, f(x) = x r1 exp( -xja)/aPr(p) for x> 0 and =0 for xO; the beta density,j(x)= x p - 1 (1-X)Q-1/B(p,q) for O<x < 1 and =0 otherwise. We can conceive of a population of infinite size with some density function; the term theoretical distribution is used to mean such a population with its density function, and more specifically the tnormal distribution, etc. Such a population and associated density is often called a continuous distribution. For a theoretical distribution, the mean, variance, and moments are naturally defined by 11= f xf(x)dx, (J2= f(X-11)2f(X)dX, Ilk = f(x- I1N (X)dX, 11= f xkf(x)dx. 
1475 It should, however, be noted that the mean, the variance, or the moments may not exist for particular distributions. K. Pearson introduced a system of density functions defined as solutions of the differen- tial equation dlnf(x)/dx=(A+ Bx)/(C +Dx+ EX2), where A, B,. ., E are constants. A distribution thus obtained is called a Pearson distribntion. The normal, gamma, and beta distributions together with some other commonly used dis- tributions, such as the t- and F-distributions, are Pearson distributions. E. Measure of Concentration When all the observed values are nonnegative in nature, we may sometimes require some measure of inequality or concentration of the distribution. For such a purpose we order the observed values according to their magnitudes and obtain X(I) X(2) ...  X(N); we define Sj= 1:i<;jx(i) for i= 1,..., N, and So =0, plot N + 1 points (S;/SN, i/ N), i = 0, 1, ... , N, and con- nect them by line segments. The graph thus obtained is called the Lorentz curve or the curve of concentration, and it connects the origin and the point (1,1). It lies below the 45° line, and if all the values are nearly equal the curve comes close to the 45° line, but if values are widely unequal, the curve comes close to the horizontal axis and suddenly jumps to the point (1,1) The area between the curve and the 45° line is called the area of concentration, and it is equal to one-fourth of the mean difference b divided by the mean, where b is defined by _ 1 b=2:LL!Xj-x j !. N j j G = b/x is called the Gini coefficient of con- centration and is used as a measure of con- centration or inequality of distribution. Other measures, including the coefficient of variation, are also used to represent the concentration. F. Discrete Distributions There are cases where the observed values are taken only from the nonnegative integers, e.g., the num ber of individual animals of a specific species in an area, of accidents during a specified time, etc. In such cases, when we increase the number of observations, the distri- bution does not approach one with a contin- uous density function but rather one with a certain theoretical discrete distribution. Among theoretical discrete distributions, the most commonly used are the binomial distri- 397 G Statistical Data Analysis bution: fj=nCjpj(1-prj for O]n, and the Poisson distribntion: fj=e-; ;//]! for]= 0, 1, .. ., The hypergeometric distribution: jj = MC j ' N-MCn-j/NCn, for max(O, M + n- N) j  min(n, M), and the negative binomial distri- bution: jj = j+r-l C r - 1 pr(l - p)j,] =0,1, ..., are also often used. For discrete distributions we can define moments by 11= 1:d; and Ilk = 1:/j- l1)kfj, and /1 = 1: j /fj, k = 2, 3, .... G. Generating Fnnctions and Cumnlants For a theoretical distribution with the density function f(x), the moment generating function M(O) is defined by M(O) = J e Ox [(x)dx When M(O) is well defined in an open interval includ- ing the origin, the distribution has all kth moments, and it can be expanded as 1 1 M(O) = 1 + 11'1 0 +- I102 + ... +- /1Ok, 2! k l from which the term "moment generating function" is denved. When 0 is replaced by it with real t, we have the characteristic fnnction cp(t)=M(it), which can be expanded as 1 1 cP(t) = 1 +/1'l(it)+-/1(itf+... +-/1(it)k 2! k! +o(itl k ) if the distribution has moments up to the kth. The function K(O)=lnM(O) is called the cumnlant generating fnnction, and the coeffi- cients Kj in the expansion K2 2 K(0)=K 1 0+-O +... 2 are called the cumulants. The kth cumulant Kk is expressed as a polynomial of the moments of order not exceeding k; thus K, = /1'1, K 2 = 112 - Ill, K3 = /13 - 3p/l'1 + 2p'?, etc. For the normal distribution, { (J2 0 2 } M(O)=exp /lO+T ; hence (J20 2 K(O)=pO+-, 2 and the kth cumulant for k:;o, 3 is equal to 0. Cumulants are used as measures indicating whether the distribution is close to or different from the norma\. For discrete distributions, the moment generating function is defined as M(O)= 1: e Oj fj, but the probability generating func- 
397H Statistical Data Analysis tion P(t)= .fjtj = M(1n t), the factorial moment generating function M(8)= P(8 + 1), and the factorial cumulant generating functions K (t) = In M (t) are also of use. The coefficient of (j in the Maclaurin expansion of M(t) is ex- pressed as l1[k] and called the kth factorial moment; it is eq ual to l1[kJ = j J(j - 1) ... (j- k + 1 )fj' That in the expansion of R (t) is the factorial cumulant. The factorial cumulant K[kJ is expressed by the same polynomial in l1[k] as Kk is expressed in 11. For the Poisson distri- bution, M(O)=expA(e 6 -1); hence K(t)=At and it follows that the factorial cumulants K[k] for k  2 are all equal to zero if and only if the distribution is Poisson. H. Bivariate Distribution When two quantitative observations are ob- tained for each member of a population of size N, the results are given as N pairs of real numbers (Xi' Yi)' i = 1,2, ..., N. Such data are called bivariate data and the distribution, bivariate distribution. Those data can be illus- trated as N points in a plane with coordinates (Xi' Yi)' and such an illustration is called a scatter diagram. In order to characterize a bivariate distribution, we often use bivariate moments 1 Mk.l="NDxi-X)k(Yi- y)', where x and yare the means of X and y, re- spectively; especially, the (1,1) moment MI.1 is called the covariance and is denoted as Cov(x, y). The most often used measure of the strength of the relation between X and Y values is the correlation coefficient r x . y = Cov(x, y)/sxsy, where Sx and Sy are the stan- dard deviations of x and y. It is easily shown that -1  r x . y  1, and when there exists a nearly linear relationship between x and y values, rx.y is close to either +1 or -1 accord- ing to whether the x and Y values change in the same direction or in opposite directions. When there is no clear relationship between x and Y, the correlation coefficient is close to zero, but it may not be a good measure of the relationship when x and y values are related nonlinearly. A linear function y = a + bx is called the linear regression function of yon x, for which the sum of the square distances JYi - a- bX;)2 is minimized. For the linear regression function the coefficients a and b are deter- mined by b=Cov(x,y)ls and a=y-bx, and b is called the regression coefficient. We have that JYi - a - bX;)2 1(Yi - y)2 = 1 - r;y, i.e., that the square of the correlation coefficient is 1476 equal to 1 minus the ratio of the variance of the residual Yi - a - bX i to that of y; hence it is sometimes called the coefficient of determina- tion. Similarly, the linear regression function of x on y is defined by x = c + dy, where d = Cov(x,y)/s; and c=x-dy. We have bd=r;y, and 111dJ = I bJr;y I  Ibl. We can tabulate the bivariate frequency distribution by splitting the range of x values into K intervals (ai-I, a;], i= 1, ...,K, and the range of y values into L intervals (bj-b b j ], j = 1,..., L, and counting the number fij of cases for which a i - 1 < x  a i and bj1 < y  b j . In contrast to the bivariate frequency distribu- tion, the distributions of x and y values are called the marginal distributions. I. Bivariate Density Function As we did for the univariate distribution, we can consider the limiting shape of the bivariate frequency distribution when the size N of the population tends to infinity and define a con- tinuous bivariate distribution with density function f(x, y), with which the ratio of those members in the population with values (x, y) in a set S in a plane is given by SSs f(x, y) dx dy. The bivariate density function is also called the joint density, and then the density functions of x and yare called the marginal density functions and are given by f1 (x) = J f(x, y)dy and f2(y) = J f(x, y)dx. The joint moments of a continuous bivariate distribution are defined by 11k,l= SS (x -111)k(y-112)lf(x, y)dx dy, where 111 and 112 are the means of x and y, respec- tively. The joint moment generating function is defined by M(tj, (2) = H e"x+t,Yf(x, y)dxdy, and the cumulant generating function by K(t b t 2 ) = logM(t 1 , t 2 ), from which the joint cumulants a Ul K =-K ( t t ) 1 k.l atkacl 10 2 ',=0.1,=0 1 2 are derived. The conditional density of y given x is de- fined by f(y I x) = f(x, y)lf1 (x), and the distri- bution with this density function is the con- ditional distribution of y given x; this latter can be interpreted as the distribution of y of those members in the population with x values in the interval (x, x + dx], where dx is small. The conditional density and the conditional distri- bution of x given yare similarly defined. The mean and the moments of the conditional distribution are called the conditional mean and the conditional moments. The conditional mean of y given x, considered as a function of x, is called the regression function of y on x. By far the most important theoretical bivari- ate density is the bivariate normal density, 
1477 which is given by f(x, y)=(2nO"I 0"2j1=P2)-1 { 1 [ (X-l1d 2 (Y-112)2 x exp 2(1- p2) O"i + O"i _ 2p (x -111)(Y-112) ]} , 0"10"2 for which the mean of x is III and that of Y is 112, the variances of x and yare O"i and O"i, respectively, and the covariance of x and Y is equal to PO"I 0"2' The bivariate normal distri- bution has several remarkable properties: All the k, I joint cumulants are equal to zero for k + I  3; the marginal distributions of x and yare normal; the regression functions of Y on x and x on yare both linear; the conditional distribution of y given x (and x given y) is normal and the conditional variance is con- stant; and the contours f(x, y) = c for different values of care equicentric ellipsoids. J. Higher-Dimensional Data When the data are of more than two dimen- sions, i.e., more than two observations are made on each of the objects, we designate the data by Nk-tuples ofreal numbers (XiI' xi2, ..., X ik ), i = 1, "', N (k  3). Then we can calcu- late the moments of each of the variates and the joint moments, which are defined by M k !.k 2 . .k, I '\' ( . - ) k 1 ( - ) k2 ( X - ) k, =-L... X i1- X I X i2 -X2 ",Xik-k' N Also, we can arrange the variances and covar- iances in a symmetric matrix of order k, and we call it the (variance-) covariance matrix. A covariance matrix is easily shown to be non- negative definite. The determinant of the covar- iance matrix is called the generalized variance. The matrix with the (i,j) element equal to the correlation coefficient of the ith and the jth variates r i ; (r ii is set equal to 1) is called the correlation matrix and is denoted by R. R is also nonnegative definite. If we denote the (i,j) cofactor of R by Rij, the quantity defined by Rill.. .(i). .k= j l-IRI/Rii is called the multiple correlation coefficient of the ith variate and all other variates; and rijll. .(1). ..w. .k= -Rj jRiiRjj is called the partial correlation coefficient of the ith and the jth variates given all other variates. The meaning of these coefficients will be elucidated below. A linear function a o + a 1 x I +a 2 x 2 +... +a k - 1 X k - I is called the linear 397 J Statistical Data Analysis regression function of X k on x I' ... ,X k - I , when the coefficients ai, ..., a k - l are so determined that the sum Q=i(xik-aO-alxil -...- a k - I X ik _ I )2 is minimized. They are determined from the equation Ci,a l +Ci2a2+",+Cik-lak-1 =C ik , (*) i=I,...,k-l, with a o = x k - a l XI -... - a k -, Xk-I, where Cij are the covariances. aI' "" a k - I thus deter- mined are called the regression coefficients of X k on x I> ... 'X k - I ' and such a procedure is called the method of least squares. The equa- tion (*) is called the normal equation. If we write xik=a O +a l XiI +... +a k - I X ik - I , we have Q =(Xik-Xik)2 =(Xik-Xk)2- (Xik-xd2 = (Xik -X k )2 x (1- Rkll. .k_12), where Rkll.....k-l is the multiple correlation coefficient of X k and x P '" 'Xk-I> which is also equal to the correla- tion coefficient of X k and Xk' The square of the multiple correlation coefficient is also called the coefficient of determination. The quantities X ik - X ik are called the residuals. Let Xi k-l and Xik be the values of regression functions of X k - l and Xk, respectively, on XI' .., ,X k - 2, and let Yik-1 =Xik-l -Xik-l and Yik=Xik-Xik be the residuals; then the correlation coefficient of Yk-I and Yk is equal to the partial correlation co- efficient ofxk-I and Xk given XI' ...,Xk-2' We have the following relation between the multiple and the partial correlation coefficients: 1 - Rill. .k-1 =(1- Rlll.. ..k-2)(I- r f-1.kll. .k-2)' Multiple and partial correlation coefficients are also expressed in terms of the correlation coefficients of the variates. For example, it can be shown that R112 = (ri3 + ri3 - 2r I 2 r 13 r 23)/(I- rf2) and that r 23 11 = (r 23 - r 12 r 13 )/ j (1- rf2)(1 - rf3)' F or higher dimensions, we can also define the (joint) density function f(XI, X 2 , ..., xd and the (joint) moment generating function M(t 1 ,t 2 ,...,td = J...fexp(tlX1+t2X2+...+tkXk) x f(xI' X2,"" xddxI '" dXk The most important multivariate joint den- sity is that of the multivariate normal distri- bution, which is expressed by { I.. f(x I'"'' Xk)= (2n)-R/2I.Elexp 2 L L O"'J(x i -l1i)(x j -l1 j )}, 
397 K Statistical Data Analysis where E is the covariance matrix with ele- ments aij' E -, = (aij), and l1i is the mean of the ith variate. For the multivariate normal dis- tribution the moment generating function is given by M(tt,.. ,tk)=ex p ( L:l1iti+L>ijtitj). K. Contingency Tables When several qualitative observations are made on N objects, each object is classified according to the combination of the cate- gories, and the data are summarized by the numbers N(i"i 2 , ...,i k ) of the objects that fall in the it th category according to the first observation, i 2 th category in the second obser- vation, etc. A table that shows the results of such observations is called a (k-way) contin- gency table. If there are mt categories in the first criterion, m 2 categories in the second, etc., the contingency table is also called an m 1 by m 2 by... by m k table. The numbers N (j, i j ) of the objects which are classified into the ijth category according to the jth criterion are called marginal frequencies. If we have N(i1, i 2 ,..., ik)/N = N(1, irJN(2, i 2 ) .. N(k, ik)/N k for an it. i 2 ,..., i k , then the k observations or criteria are independent. The simplest contingency table is a 2 by 2 table, where several measures for the relation of two observations or criteria have been proposed, among which the most commonly used are the measure of association defined by N(l, I)N(2,2)-N(1,2)N(2, 1) Q - N(1, I)N(2,2)+N(1,2)N(2, 1) and the odds ratio b= N(1, I)N(2,2) N(1,2)N(2,1) and also V= N(1, I)N(2,2)-N(I,2)N(2, 1) I N(1, 1)N(I,2)N(2, I)N(2,2)' where N(i,);) are marginal frequencies. The two observations are independent if and only if Q = 0 or a = 1 and V = o. V is equal to the correlation coefficient of the variables x 1 and X 2 , for which Xi =0 if the object is classified into the first category according to the ith criterion and X i = 1 if it is classified into the second category. In a two-way m 1 by m 2 table a measure of association is defined by X 2 =",,",, ( N(it.i2)N _ ) 2. L..L.. - - 1, N N(1,idN(2,i2) 1478 it can be shown that X 2 / N = 0 if and only if the two criteria are independent, and that 0X2/N min(mt -1,m2 -1). When we take X i = 1 if the object is classified into the ith category according to the first criterion and xi=O otherwise and take Yj= 1 if it belongs to the jth category in the second criterion and Yj=O otherwise, it is shown that the sum of squares of the multiple correlation coefficients of Xi and Yt.... ,Ym,-t is equal to X 2 /N L. Decomposition of the Variance When one observation is qualitative while another is quantitative the objects are classi- fied into several categories according to the first observation, while for each object the value of the second observation is also given. Let xij be the observed value of the jth object in the group of the ith category; then for each i we can obtain frequency distributions of Xij. and compare these distributions Let N i be the number of objects in the ith category, and Xi =  xu! N i be the mean in the ith category. Then the weighted variance of the .x i defined by VB = i Ni(X i - X)2 / N, where x is the mean of an the observations, i.e., X =  Nix i IN, is caned the between-group variance, and the weighted mean of the variances of each of the groups defined by vW=ij(xij-xi/N is called the within-group variance. It can be shown that VB + V w = V= (xii-X)2/N, i.e., the variance of all the observations is decomposed as a sum of the between-group and the within-group var- iances. The ratio VB/V is called the correlation ratio, which is equal to the square of the multi- ple correlation coefficient of x and Y t, ... ,Ym' where Yi= 1 if the object is in the ith category and Yi=O otherwise. M. Ordinal Data When the observation is not quantitative but there exists a natural ordering among the categories into which the objects are classified, the observation is said to be in an ordinal scale, or simply ordinal. When two ordinal observations are made on the same set of N objects, we can define several measures of association between the two ordinal scales. For each pair of objects we define a variable cij' i, i = 1, ... , N, i of- j, as c ij = 1 if the ith object is classified as "better" (or "superior") than the jth object according to both of the measurements, c ij = -I if the order- ings are different in the two scales, or cij=O if they are in the same category according to either or both of the scales. A measure of asso- ciation is then given by S=cij/N(N -1), 
1479 which takes a value between -I and + I but usually cannot attain :1:: 1. Other ways of nor- malizing the sum   c i ; have been proposed. Another method of calculating the associ- ation is the scoring method, i.e., giving a set of ordered real numbers to the categories of each of the scales and calculating the correlation coefficient between the scores. The simplest scores are 0, I, '" , m - 1 when there are m categories, but other methods of scoring are also used. Scores that give the largest pos- sible correlation are called canonical scores, which are obtained as the characteristic vec- tors of the matrices NN' and N'N, where N is the matrix of the contingency table. N. Time Series Data Time series data can be recorded in a con- tinuous time scale, but usually measurements are made at discrete times, which are most commonly equally spaced. Hence we here denote them as x(t), t = 1,2, ..., T. First we consider the quantitative univariate case. The intertemporal change of x(t) is often decom- posed into three parts: x(t)= m(t)+c(t)+e(t), where m(t) is called the trend, and represents the secular, systematic change of x; c(t) is called the cycle, and represents the recurrent pattern of the change; and e(t) is called the error or random fluctuation, and represents the irregular changes. Such a decomposition cannot be defined rigorously without assuming some probabilistic or stochastic model for x(t), but it is intuitively clear and practically useful in many applications. There are two ways to estimate the trend One is to calculate the moving average x(t) = (x(t-k)+x(t-k+ 1)+... +x(t)+... +x(t+ k))/(2k + I) and use it as an estimate of the trend of x(t); here k should be chosen to sub- stantially eliminate the cyclic and random parts More generally we can use the weighted moving average defined as x(t)= J -k w(j)x (t +j), where w(  J)= w(j) and  w(j)= I. The second method is to assume some functional form, usually a polynomial in t, for the trend: m(t)=a o +a, t+... +akt\ and to determine the coefficient by least squares, i.e., to calculate the values of aO,a b ...,a k which minimize t(x(t) -ao-a,t ..-a k t k )2. There are two cases of cyclical changes One is the case when there is a clearly defined rele- vant external time period, such as the seasons of the year or the days of the week. In such cases the effects of such external periodIcal cycles must be eliminated, and the process which does that is called seasonal adjustment 397N Statistical Data Analysis of the time series data. Various methods of seasonal adjustment have been proposed and applied, but none is definitive. The other case is where the cyclical changes are produced from the observed process itself; here, the length of the period and the pattern of the cyclical change must be estimated Now assume that the data do not contain any trend, or that the trend has been effec- tively eliminated. First we calculate the cor- relation coefficient between x(t) and x(t +s) by I (x(t) - x:)(x(t + s) - x) r(s) - , J I(x(t)-X;)2 I(x(t +s)- X;)2 where 1-, x;= I x(t)/(T-s), t::::l T x; = I x(t)/(T - s), t=s+t or more simply by r(8)= TI(x(t) -x)(x(t +s)-x)/ ((T -s) J I(x(t)-X)2j, where x =  x(t)/T. r(s) is called the serial correlation coefficient or the autocorrelation coefficient of lags. When there exists a clear and definite cyclical change of period s in the data, r(s) is close to 1. The diagram in which the serial correlation coefficient r(s) is plotted against s is calIed the correlogram. In order to see the cyclical properties of the data more clearly, we calculate the power spectral density w(},) = 1 +2Ir(s)cosAs. s The graph of w(A) is called the power spectrum, or simply the spectrum. w(A) represents the square of the width of the sine curve of fre- quency J./2n or of period 2n;;, contained in the data. The spectral density is closely related to the intensity, defined by l(}.)=( (X(t)COSAJ + (X(t)Sin).tY), which is proportional to the square of the multiple correlation coefficient of x(t) and the functions cos At and sin At and is large if the data contains a sine curve of frequency A/2n It can be shown that 1(..1.) is approximately equal to w(}.) V(x)/n. The spectral density thus ob- tained usually oscillates irregularly and far from smoothly; hence smoothing by use of a "spectral window" is often applied ( 421 Times Series Analysis). When several observations are made in time series data, we speak of multivariate time series. Let x,(t) be the ith observation in the tth period. The correlation coefficient between xi(t) and xj(t+s) is called the serial cross- correlation coefficient and is denoted by ri;{s) (8 = 0, :1:: I, :1::2, ... ). Analogously to the univari- 
3970 Statistical Data Analysis ate case we define WiP) = Irij(s)coSAS+ iI rij(s) sin i.s $ $ = Pij(A) + iqij(A), and we call PiP) the cospectral density be- tween the ith and the jth variables, and %(A) the quadrature spectral density. P0 + q0 is called the amplitude, and . pW.)+qW.) Cij(A) = . . ' J WJIc)Wj(/.) where Wi and wi are the spectral densities of the ith and the jth variables, is called the coherence. O. Events in Time Scale Some data give us the time points at which a specific event occurs. Let 7; be the time when the event occurs for the ith time. Then usually the most important information we want to obtain is about the time intervals d; = 7;+1 - 7;. If there is a periodicity in the occurrences of the event, the d i will be approximately equal. On the other hand, if the event tends to occur repeatedly after its first appearance, some d i will be small while others will be large. When there is no periodicity, no tendency to repeti- tion, and no increasing or decreasing trend in the occurrences, we can suppose that the event occurs simply by chance, and this is good reason to suppose that the density func- tion of the distribution of the intervals is exponential, i e., it can be expressed by f(d) = (exp( -djrt.))jrt. for d>O. Also in such a case the number of occurrences in fixed time in- . tervals are distributed according to the Pois- son distribution. Such a sequence of occur- rences of an event is called a tPoisson process. More generally, let f(d) be the density function of the time intervals; then h(d) f(d) 1- Sgf(c)de is called the hazard rate or hazard function. The hazard function is constant if and only if the process is Poisson. P. Probabilistic Models In many applications of statistical data anal- ysis, the data exhibit variabilities and fluctu- ations that are due to fortuitous or hazardous causes or chance effects and that obscure the information contained in the data. In such cases we assume that the chance variabilities and fluctuations are random variables distri- buted according to some probability distribu- 1480 tion, and the information we require is repre- sented by a set of unknown constants that characterize the probability distribution of the data as unknown parameters. Suppose, to take the simplest case, that we have repeated obser- vations of results of some experiment under a fixed condition and that we obtain the values XI' X 2 ,..., Xn' Since the experimental condition is fixed, the variations among the Xi values can be considered to be due to chance causes, such as variations in materials, uncontrolled small fluctuations in experimental conditions or instruments, and various other variations usually called the errors. Whatever the true causes of the variations, we can consider them to be random, and we can regard the values XI' X 2' ... , X n as the results of random experi- ments or the realizations of random variables Xl' X 2 , ... ,X n independently and identically distributed according to some probability dis- tribution. Or we may think of a hypothetical infinite population of the results of supposedly infinite replications of the experiment under the same fixed condition, and regard the actual observations as n values chosen from this population at random. We may also consider that in this hypothetical infinite population, the frequency distribution is represented by a density function f, which in turn determines the probability distribution of each observa- tion. We may be interested in the "average" values of the result of the experiment as well as the magnitude of the variability; then those values are represented by the mean and the variance of the population distribution. If the form of the population distribution is assumed to be completely specified except for the mean /1 and the variance a 2 , the density function f is determined without these two parameters, and is expressed as f(x; /1, a). The joint density for n repeated observations is TI;/(x i ; It, a). The set of assumptions that determines the probability distributions of the observations in terms of the unknown parameters is called the probabi- listic model, and its determination is called the problem of specification. Once the probabilistic model is given, the purpose of statistical data analysis can be formulated as making judgments on the values of the parameters, which may sometimes go wrong but can be relied on with some margin of probability of error that can be mathemati- cally rigorously ascertained. The formal proce- dure of making such judgments is called sta- tistical inference, and its mathematical theory has been well established over the last hundred years ( 401 Statistical Inference). In most cases of statistical inference, the joint density function of the data plays an important role, and when it is regarded as a function of the unknown parameters for given 
1481 values of observations it is called the tlikeli- hood function. Q. Exploratory Procedures In many applications of statistical data anal- ysis, we are not quite sure of the validity of the probabilistic model assumed, or we admit that the models are, at best, approximations to reality and hence cannot be exactly correct; the approximation may not be precise enough for the conclusions drawn from the assump- tions to be practically reliable. Therefore we have to check whether the model assumed is at least approximately valid for the data, and if not, we have to look for a better model that reflects more accurately the structure of the actual data. Thus in many practical appli- cations of data analysis, we have to scruti- nize the structure of the data and try various models before settling on a model and draw- ing final conclusions (which are still suscep- tible to further revisions when more data are obtained). Methods used in such a process are called exploratory procedures, which de- pend partly on the formal procedure of testing hypotheses and partly on intuitive reasoning sometimes combined with graphical presen- tations of the data, and also on scientific and empirical understanding of the subject matter. Suppose in the simplest case that n obser- vations XI' ... , X n are assumed to be indepen- dently and identically distributed Under the condition that all those values are observed under the same well-controlled situation, this assumption is reasonable. But in reality some of the observations may be subject to some unexpected effect due to either a fortuitous outside cause or some "gross error" in the measurement procedure, the process of re- porting, etc., and may show much greater variation than others. Such observations can be detected by certain outlier tests or simply by looking at the data carefully, and if it is established that some observations are de- finitely outside of the possible random varia- bility or are subject to some hazardous ex- ternal effect, those data could be omitted from consideration. Further, the assumed proba- bility density f(x, 0) may not well approximate the distribution of the actual data even after the "outliers" are omitted. Some test for good- ness of fit should be applied, and if the hy- pothesis is rejected, we have to modify our model Also, if we are provided with several candidates for the model to be adopted, we have to apply some procedure of model selec- tion ( 400 Statistical Hypothesis Testing, 403 Statistical Models). It could also happen that the supposedly uniform conditions of 398 A Statistical Decision Functions observation, measurement, or experimentation did not actually prevail but that there has been some heterogeneity among the observations. Bimodality or multimodality of the histogram, i.e., existence of two or more peaks in the histogram, usually strongly suggests such heterogeneity. In such cases, grouping or stra- tification of the observations is required to make the conditions of observation within each group nearly uniform. References [IJ R. A. Fisher, Statistical methods for re- search workers, Oliver & Boyd, first edition, 1925. [2J M. G. Kendall and A. Stuart, The ad- vanced theory of statistics, 3 vols., Griffin, 1958-1966. [3J J. W. Tukey, Exploratory data analysis, Addison-Wesley, 1977. 398 (XVIII.6) Statistical Decision Functions A. General Remarks The theory of statistical decision functions was established by A. Wald as a mathematically unified theory of statistics ( 401 Statistical Inference). In this theory, the problems of mathematical statistics, for example, statistical hypothesis testing and tstatistical estimation, are formulated in a unified way [1]. A tmeasurable space (:?l',) with a fixed tprobability measure is called a sample space, and an element xE:?l' is called a sample point. Suppose that we are given a family f7> = {Po I OEQ} of probability measures on (2[, ), where Q is called the parameter space, and a trandom variable X takes values in .0£ accord- ing to a true probability distribution P as- sumed to belong to f7>. This article deals with the problems involved in making a decision about the parameter 0, called determining the true value of the parameter, such that P = Po. To describe the procedure for such a decision based on the observation of the behavior of X, we need a triple (., (f; D) consisting of a set ., a to--algebra (f of subsets of ., and a set D of mappings b from :?l' into the set of probability measures on (., (f), b: x->b(-I x), such that for a fixed C E(f, the function b( CI x) is - measurable. We call . an action space or decision space, b a statistical decision function (or simply a decision function) or statistical deci- 
398B Statistical Decision Functions sion procedure, and D a space of decision func- tions. In actual decision procedures, b( CI x) is the probability that an action belonging to C is taken, based on the observation of sample point x. We further consider a nonnegative function w: f.! x c91 -> R, called a loss function, such that for a fixed 8, w(O, a) is <f-measurable. By averaging the loss, we obtain the risk function: r((j,b)= r L w(O,a)b(dalx)Pe(dx). Two decision functions band b' are identified if b( C I x) = b'( C I x) for almost every x with respect to Pe, for all 8 Ef.! and all C E<f. When to each XE.U£ there corresponds a unique ac- tion ax such that 6( {ax} I x) = 1, the decision function b is said to be nonrandomized; other- wise, randomized. The system (.?f,,.'?l-', f.!, .W', <f, W, D) is called a statistical decision problem. From the point of view of this theory, tpoint estimation, tinterval estimation, and statistical hypothesis testing ( 400 Statistical Hypoth- esis Testing) are described as follows. (1) In point estimation we assume that the action space .91 is a subset of R and that we are given functions <p :!!f ->.91 and I: eP -> R. The problem is to estimate the value of I(P) by using the real value <p(x) at an observed sample point XE.'£. As the loss function, we often set w(O, a) = C(8)(a -1(Pe))2, where C(O) is a function of 0, and call it a quadratic loss function ( 399 Statistical Estimation). (2) In interval estimation we assume that each action is represented as an interval in R. Each interval [u, v] can be represented by a point (u, v) of the half-space R 2/ = {z = (ZI,Z2)lzl Z2}, which may be taken as the action space. A weighted sum IXwr(8, z) + {3w 2 (B, z) (1X,f3?3 0) of two functions, { I (8<t[ZI,Z2])' wr(O z)= , 0 (OE[ZI,Z2]), W2(0,Z)=Z2- Z " often supplies the loss function ( 399 Statis- tical Estimation). (3) In testing a hypothesis H:OEwo versus an talternative A: OEW l (w o n WI = 0, W o U WI = f.!), the action space can be expressed by the set consisting of two points aj, a 2 , where a 1 denotes the decision to reject Hand a 2 the decision to accept H. The loss function is defined as follows: w(O,arJ= { w(IJ,a 2 )= { (8EW O ), (8EWrJ, (8EW o ), (OEWrJ. This is called a simple loss function. Whatever 1482 testing procedure b is adopted, the proba- bilities of the terrors of the first or second kind coincide with the values of r(O, b) for 8EW o or OEW 1 , respectively ( 400 Statistical Hypoth- esis Testing). When f.! is the union of mutually disjoint nonempty sets Wj, W 2 , ...,W n , d={a 1 ,a 2 , ..., an}, and w(O, a j )= Cij (8EW i ) with c ij ?3 0 (i =I- j), C ii = 0, the decision problem is called an n-decision problem. B. Optimality of Statistical Decision Functions Consider the problem of choosing the best decision function b. When r(8, brJ  r(O, b 2 ) for all 8 and there exists at least one 8 0 such that r(8 o , brJ < r(80, b 2 ), the decision function b 1 is said to be uniformly better than b 2 . If there exists a decision function b o in D that is uni- formly better than any other b in D, it is the best decision function. However, such a fUnc- tion b o does not always exist. A decision func- tion bin D is said to be admissible if there exists no other decision function b in D that is uniformly better than S. In other words, b is admissible if and only if the validity of the inequality r(8,b)r(0,J") for some bED and all 8Ef.! implies r(0,b)=r(8,b) for all 8Ef.!. When there is no information about P except that it is a member of eP, we follow the minimax principle and choose a function b* for which we have infoeDsuPeenr(0,b)=suPeEnr(8,b*). This decision function b* is called a minimax decision function or minimax solution. Let ('Y be a a-algebra of subsets of f.!, and suppose that r(O, b) is ('Y-measurable for any fixed b. If, furthermore, we are given a proba- bility measure ( called an a priori distribution, on (f.!, (Y), we choose a 8 that satislies inf r r(O, b)d(8)= r r(8J)d(0). oeDJn In Such a 8 and the integral of r(8, 8) are called a Bayes solution and the Bayes risk relative to , respectively. Let F be a family of a priori dis- tributions on (f.!, ('Y). If 8 satisfies inf ( r r(0,3)d(8)-inf r r(0,b)d;(8) ) =0, eF In oeDJn 8 is called a Bayes solution in the wider sense relative to F. If eP is tdominated by Ie with a  x 6-measurable f(x, 8)=dP e /dJ.., w(O,a) is 6 x <f-measurable, and Ax= {aEdiL w(0,a)f(x,8)d(8) =inf r W(IJ,a)[(X,IJ)d(IJ) } aE.W J n is nonempty and <f-measurable for },-almost 
1483 every x, then a Bayes solution b with respect to  satisfies b(Ax I x) = 1 for A-almost every x. For a sample point x with SQf(x,O)d(O);fO, a probability measure 11('1 x, ) on (Q, m defined by I1(Blx,) In fix, O)d(O) tf(x,O)d(O) is called an a posteriori distribution. To get a Bayes solution it is enough to minimize the value of Sn w(O, a)dl1(O I x,) for every observed x. If  in the definition of Ax defined above is a a-finite measure on (Q, m, a decision function b satisfying ()(Axl x) = 1 for A-almost every x is called a generalized Bayes solntion with respect to . Let D' be a subset of the space D. If for any bED - D' there exists a b' E D' that is uniformly better than b, then D' is called a complete class. Iffor any bED there exists a b'ED' that is either uniformly better than b or has the same risk function as b, then D' is called an essen- tially complete class. If D' is complete and any proper subset of D' is not complete, then D' is called a minimal complete class. If a minimal complete class exists, it is unique and coin- cides with the set of all admissible decision functions. C. n-Decision Problems In an n-decision problem where d={a p ...,a n }, we set b;(x)=b(a;!x) for a decision function b, where b;(x) is -measurable and satisfies b;(x);:'O, bdx)+... +bn(x)= 1. We consider the set fi! of vector-valued functions (x)=(bt (x),..., bn(x)) whose components bj(x) satisfy the conditions just given. Such a vector- valued function (x) can be identified with b(x); we write b(x) instead of (x) also. If in addition the parameter space Q is a finite set {Op O 2 ,,,,, 0d, we can consider a mapping 1jJ:f!i;.....,.R k defined by ljJ(b) = (r(OI ,b), ...,r(Ok,b)), and then S=IjJ(f!i;)={IjJ(b)!bEf!i;} is convex and closed in R k . If b is nonrandomized, then for each x, one and only one of the bi(x) is 1, and aU others are 0. Hence, in this case, ?£' is the disjoint union of $-measurable subsets B i (i = 1,..., n) such that b(a;! x) = 1 if and only if XE B j . A probability measure m on (?£',) is said to be atomless iffor any set AE with m(A) > ° and any b with 0< b < m(A), there exists a subset BE of A such that m(B)=b. If Q is finite and every member of fJjJ is atomless, the image t/J(O) of the set f!i;0 of all nonrandom- ized decision functions coincides with 1jJ(f!i;). This shows that f!i;0 is an essentially complete 398D Statistical Decision Functions class in fi!. However, when some members of .iJ> are not atomless, 1jJ(fi!) is not always equal to 1jJ(f!i;0), but 1jJ(f!i;0) is a closed subset of R k . In particular, when n = 2 for given probability measures PI' ..., P k over (?£', ), the set S of all points (PI (B), ..., Pk(B)) (BE) in R k is a closed set. If in addition PI' ... , P k are all atomless, S is convex. These results are known as the Lyapnnov theorem. A two-decision problem with W t = {I}, W 2 ={2}, and cij= 1 (ioftj), =0 (i=j) is called a dichotomy. We discuss this problem in some detail in order to explain the concept of opti- mality of decision functions. For a dichotomy, S=I/J(fi!) is a set in R 2 that (i) is convex, (ii) is closed, (iii) is symmetric about (\/2, 1/2), (iv) contains the points (0, 1), (1,0), and (v) is a subset of interval [0, IJ x [0, IJ (Fig. 1). The set of decision functions b that is mapped under IjJ onto the curve ACDB in Fig. 1 constitutes the minimal complete class, and a decision func- tion b mapped onto the point D is a minimax solution. Let  be an a priori distribution such that W)=IX, (2)=fj (1X+f3=I,IX;:'O,fi;:,O). Fig. 1 Then a Bayes solution relative to  is mapped under I/J onto a supporting point c with direc- tion ratio -1X1f3 and is obtained as the char- acteristic function of the set E = {x IlXj (x) < f3g(x)}, where f and 9 are tRadon-Nikodym derivatives dPt/dA. and dP 2 ldA. with respect to ), = PI + P 2 , that is, f, 9 are measurable func- tions on ?£' such that for any measurable set E, we have Pr!E) = LfdA.' P 2 (E)= LgdA. This fact implies that the most powerful test constructed in the Neyman-Pearson funda- mental ]emma (400 Statistical Hypothesis Testing B) is precisely a Bayes solution. D. Complete Class Theorems Suppose that .OJ> is tdominated by a a-finite measure A, and let fix, 0) be the Radon- Nikodym derivative of Po with respect to A. 
398 E Statistical Decision Functions Consider a subs pace L of L d::'l', ).) containing {fix, 0) I OEa}, and the following equivalence relation between bounded -measurable func- tions on .1'; <fJ1 and <fJ2 are defined to be equiv- alent if and only if S <fJljdA = S <fJdd). holds for every fEL. We further assume that the dual space of L is the linear space 911 of the equivalence classes of bounded measurable functions Just defined. Let Co(') be the set of all continuous functions on . with compact support, IT ° rx the integral of rxE Co(') with respect to a probability measure IT over ., and gob the mtegral f b(' I x)g(x) d). for gEL, bED. As a base for neigh borhoods of the space of decision functions around b o ED, we consider V(bo.rx" ""(Xn,gl, ...,gn'F.)= {b \ 19,0 boo rxi-gio b ° rxd <£, i= 1, "', n}, where rxiECo('), giEL, £>0. Let F be the set of all a priori distributions  on (a, (Y) each of which assigns the total mass 1 to a finite subset of a, B the set of all Bayes solutions relative to some (E F), and W the set of all Bayes solutions in the wide sense relative to F. Suppose that . is a tlocally compact, separable metric space and w(O, a) is lower semicontinuous with respect to a for every fixed O. Then if D is compact and convex, the intersection of Wand the closure B of B constitutes an essentially complete class [2]. Moreover, if a is compact with respect to the metric d(OI' 02)=SUPBEIPeJB)- P o2 (B)1 and {w(', a) laEd} is a uniformly bounded and tequicontinuous family on a, then the class of Bayes solutions relative to some a priori distri- bution is a complete class [1]. These propo- sitions are called complete class theorems (for complete classes in specified problems and admissibility of individual procedures  399 Statistical Estimation; 400 Statistical Hypoth- esis Testing; and Appendix A, Table 23). Two measurable spaces (5, .'/)) and (R, .9?) are said to be isomorphic if there exists a corre- spondence p of 5 and R (EE!I' implies p(E)E;f,f and, conversely, p(E)e3£ implies E E ,';1') where p is one-to-one onto. Let f!2J be the set of all decision functions associated with a sample space Uf, ) and an action space (., (f), T be a tstatistic on (::'l',) taking values in a meaSUr- able space (&, G:), and 9C* be the set of all decision functions having sample space (qy, G:) and action space (d, (f). The set of all bE f!2J for which there exists a b* E f!2J* satisfying b(CI x) = b*(CI T(x)) (C E(f) is denoted by '?4 1 . If (i) T is a tsufficient statistic and if (ii) (.W', (f) is iso- morphic to the measurable space R k associated with the o--algebra of its Borel subsets, then f!2J T is essentially complete in 9C. Conversely, if (i) .0/ is a dominated family, (ii) f(x, 0) > 0 always 1484 holds on:1:, and (iii) for any pair 0 1 , 02(Ea), there exists a 03(Ea) such that no element a in . makes two of the W(Oi' a) (i = 1,2,3) attain their minimum simultaneously, then essential completeness of f!2J T in 9C impI:es that Tis sufficient [3] ( 399 Statistical Estimation). E. Invariance Suppose that there exist one-to-one transfor- mation groups G, G, and G of X, a, and <, respectively, onto themselves (transformations belonging to G, G, and G are tmeasurable with respect to , (Y, and (f, respectively) and that there exist homomorphisms g->g, g->g of G to G and G, respectively, such that for B E  we have P e (g-IB)=P ge (B) and w(ge,ga)=w(O,a). Then a decision problem (.?r, ,f7>, a, d, (f, w,D) is said to be invariant under (G, G, G). In a decision problem invariant under (G, G, G), a decision function satisfying b(gCl gx) = b( CI x) is called an invariant decision function. Suppose that the transformation group G is locally compact and is the union of a count- able family {Kn} of compact subsets. Let f be the o--algebra of Borel subsets of G such that the mapping (g, x)->(g, gx) is measurable in the sense that the inverse image of any set in r x  is also a set in r x . For such a r the torbit G(x) of G through x is r-measurable. We assume that following conditions: (1).01 is dominated; (2) G operates teffectively on :1:; (3) G operates ttransitively on a; (4) for any com- pact subset J of G, Iim {J1'(Kn)/J1'(Kn' F1)} = 1, nro where J1' is a right-invariant Haar measure ( 225 Invariant Measures C) on G and Kn F'={gh- l lgEK n ,hEJ};(5)there is a tconditional probability distribution PeC: z), given Z E G(x), on .?r; (6) the integral SG(X) w(e, ga)Pe(dgx :z) attains its minimum value b(e,z) for any e and zEG(X); and (7) Iim { r w(O, ga) Pe(dgx: z) - b(e, Z) }  0 tJ---+XJ J Kn x uniformly in a, where Knx={gxIYEK n }. Then the best invariant decision function exists which is also minimax in f!2J [4] ( 400 Statis- tical Hypothesis Testing). It is shown in [9] that some invariant minimax decision func- tions are not admissible. F. Sequential Decision Problems I n the general framework of statistical deci- sion theory, not only the decisions to be taken but the number of samples to be observed 
1485 may be determined based on the previous o bserva tions. A simple formulation is ilIustrated for the sequential decision problem given below. Suppose that XI, X 2 , ..., X n are independently and identically distributed random variables with probabiJity measure Po. We assume that the X's are to be observed one by one, and at the ith stage, when we have observed XI' ..., Xi' we are to decide whether to continue sampling and observe X i +, or to stop obser- vation and choose an action or decision in f», utilizing all the observations thus far obtained. Then a decision rule is defined by a sequence of pairs (b n , sn), n= 0,1, ..., where b n is a map- ping from the space x n to f!iJ (for the sake of simpJicity we here exclude randomized deci- sions), and Sn = Sn(X 1, ... ,X n ) is a measurable function from x n to the interval [0, 1]. Sn gives the probabiJity of stopping the sampJing when the first n of the X's are observed, and b n de- fines the decision taken when the observations are stopped. We include So and b o to denote the probabiJity of taking a decision without making any observation and the decision to be taken then. We call b n the terminal decision rule and Sn the stopping rule. Then for such a decision rule J'the total expected loss or the risk is given by r(O, J')= JofTI (l-s j (x 1 ,..., xJ) X {sn(x 1 , ...,xn)w(O, b n (x 1 , ...,x n )) n + cn(x n I x j, ... ,xn-d} Il dPo(xJ, j=1 where cn(x n I Xj,..., xn-d is the cost of observa- tion of Xn=x n when XI =X I , ...,X n -, =X n - 1 . The rule J'is called a sequential decision rule or a sequential decision function, and the whole setup a sequential decision problem. In most of the sequential decision problems the cost of observation is assumed to be equal to a con- stant C per observation, and then the Bayes risk r*( o) for the prior distribution o satisfies the relation r*(o)=min{if f w(O,d)do, f f r*(no(x l , o))dPo(xddo + C}, where no(x I, o) denotes the posterior distri- bution when XI =X I is observed under the prior distribution o; and the Bayes decision rule satisfies the condition Sn(X1' ...,x n )= 1 ifr*(no(xj, ...,X n ; o)) = inf f w(O, d)dno(xj, ..., x n ; o), = 0 otherwise, 398 G Statistical Decision Functions and b n (X1' ..., x n ) =d* if S w(O, d*)dn o (x 1 , "', X n ; o) = infd S w(O, d) dno(x 1, ... ,X n ; o), where no(x 1, ... ,X n ; o) is the posterior distribution given XI = XI, "', X n = X n under the prior o. For the dichotomy problem with the simple loss function discussed above, the Bayes deci- sion rule has the form Sn(Xj,...,x n )=1 ifno(x1,,,,,Xn)n1 or n2, = 0 otherwise, and bn(xj,..., x n )=d 1 if no(x l , ..., xn) nj, =d o ifn o (x1' ...,x n ) n2' where no(x 1, ... ,x n ) is the posterior prob- ability that 0=00, given XI =Xj, Xn=xn. This amounts to the following rule: Continue sam- pJing as long as II <).(x I ,..., x n )= Il Poo(xi)/Il POI (Xi) <12, i and decide on do as soon as A.(X 1, ... ,X n )  12 and on d 1 as soon as A.(Xj, ""Xn)11' which is actually equivalent to a tsequential proba- biJity ratio test ( 400 Statistical Hypothesis Testing). G. Information in Statistical Experiments The part <ff = (:?r,, f7>, Q) of a decision problem A =(:?r,, f7>, Q, d, <r, w, D) is called a statistical experiment. In this section, we consider n- decision problems, i.e., those wherein Q con- sists of a finite number {I, 2, ..., n} of states, and we denote the set S = {r(l, b), r(2, b), "', r(n, b) I bEf!iJ} by L(A). Let <ff l and <ff 2 be two experiments having a common parameter space Q, and let AI and A 2 be two decision problems composed of <ff 1 and <ff 2 , respectively, and a common (d, <r, w). We say that the experiment <ff 1 is more informative than the experiment <ff 2 if L(Ad:::>L(A 2 ) for any action space C,<r) and any loss function w [6]; that is, whatever the actions proposed and the loss incurred, the experiment <ff l can offer a decision procedure at least as efficient as the experiment <ff 2 . Thus the set L(A) with A= (<ff, c, <r, w, D) represents some feature of infor- mation that <ff can provide about the states Q. However, comparison of L(A) is not easy to carry out. S. Kullback and R. A. Leibler de- fined the concept of information for the case of a dichotomy A [5]. If the probabiJity distri- bution induced by a random variable X has a Radon-Nikodym derivative f1 (x) (> 0) or f2(X) (> 0) with respect to A., we define f fdx) I(X: 1,2)=1(/1'/2)= log- f1 (X) d)., f f2(X) 
398 H Statistical Decision Functions calling this the Kullback-Leibler information number (or K-L information number). This number is uniquely determined by the set S = L(A) in Fig. I, and the larger S becomes, the larger 1 (fl Jz) becomes. If 0(, fJ (rx + fJ = I) are a priori probabilities and 11(11 x) and 11(21 x) are a posteriori probabilities of I and 2, respectively, we have, from the Bayes theorem, fl(X) 11(1 Ix) rx log-=log--log-. f2(X) 11(21 x) fJ Here the right-hand side stands for the change in the probability of the occurrence of the state after an observation of x, and the texpectation of the left-hand side under /, is 1(11,[2)' The K- L information number has the following prop- erties: (i) I(X: 1,2)0, and I(X: 1,2)=0= f = g; (ii) tindependence of X and Y implies I(X: 1,2)+/(Y: 1,2)=/(X, Y: 1,2); (iii) for a statistic T=t(x), I(X: 1,2)/(T: 1,2), where the equality holds if and only if T is sufficient for 21-'= {PI' Pz}, in which dP,/d)'= fl and dPz/ d). = fz; and, as a result of (ii), (iv) if Xl, .., X n are distributed independently with the same distribution, I(X l , ..., X n : 1,2)= n/(X, : 1,2). Suppose next that Q is the real line and that the Radon-Nikodym derivative g(t: 0) of the distribution of a statistic T with respect to a measure A has the following properties: (i) the set of all t at which g(t:O»O is independent of 0; (ii) g(t: 0) is continuously twice differentiable, and (iii) the order of differentiation with re- spect to 0 and integration with respect to t can be interchanged. Then I(T:O,O+dO)= I(T: O)dO z for an infinitesimal displacement dO of 0, where f ( D lo g g( t . O )) Z I(T:O)= if 80' g(t:O)dJe. Here I(T: 0) coincides with the Fisher infor- mation ( 399 Statistical Estimation). Suppose that we are given a sequence (Xl' Xz,...) of independent random variables whose distributions have as their density either (fl J2"") or (g" g2'''')' that is, t; and gi are candidates for the density of distribution of Xi (i = 1,2, ...). A method to determine which of the sequences (t;), (g;) actually corresponds to IX;} is given by the Kakutani theorem. Let F be the distribution of (Xl' Xz,"') when each Xi is distributed according to t;, and let G be that of (Xl' Xz, ...) when each Xi is distributed accord- ing to gi' To see how X"X z ,'" are actually dis- tributed, we assume that the loss incurred by an incorrect decision is 1 and the loss incurred by a correct decision is O. Denote such a deci- sion problem (dichotomy) by A. Then we gen- erally have L(A)c/, where 1={(x,y)IOx, y  I}. A necessary and sufficient condition for 1486 L(A) = 1 is that n, p(fn , gn) = 0, where p(fn, gn) = Ix jf n(x)gn(x)dx. Consequently, if the X n (n = 1,2,...) have the same distribution, we have L(A)= I, and the correct decision can be made with no error based on infinitely many independent obser- vations of X" X z ,.... H. Relation to Game Theory The theory of statistical decision functions is closely related to game theory. From the game-theoretic viewpoint, a statistical decision problem is considered to be a zero-sum two- person game played by the statistician against nature. A strategy of nature is the true distri- bution P of the variable X or the true value of 0, and a strategy of the statistician is a decision 6. In this setup, the risk function r(O, 6) can be regarded as a tpayoff function paid by the statistician to nature. An a priori distribution ( is a tmixed strategy of nature. A randomized decision function is a mixed strategy of the statIstician A minimax decision function cor- responds to a minimax strategy of the statis- tician. A minimax strategy of nature is called a least favorable a priori distribution. If a deci- sion problem is tstrictly determined as a game, a minimax solution is a Bayes solution in the wide sense. If 6 0 is a Bayes solution with respect to (0 and r(O, 6 0 )  R«(o, 6 0 ) for all 0, the decision problem is strictly determined, and 6 0 is mini- max and (0 is a least favorable a priori distri- bution, where R«(o, 6 0 )= Snr(O, 60)d(0(0). If 6 0 is a Bayes solution in the wide sense and r(O, 6 0 ) is constant as a function of 0, the deci- sion problem is strictly determined, and 6 0 is minimax. If 6 0 is admissible and r(O, 6 0 )= C < 00, 6 0 is minimax. If Q is a finite set and info sUPer(O, b) < 00, the decision problem is strictly determined and there exists a least favorable a priori distribution. We have few general results about generalized Bayes solu- tions ( 173 Game Theory; [6J). References [IJ A. Wald, Statistical decision functions, Wiley, 1950 (Chelsea, 1971). [2J L. LeCam, An extension of Wald's theory of statistical decision functions, Ann. Math. Statist., 26 (1955), 69-81. [3J R. R. Bahadur, Sufficiency and statistical decision functions, Ann. Math. Statist., 25 (1954),423-462. 
1487 [4J H. Kudo, On minimax invariant estimates of the transformation parameter, Nat. Sci. Rep. Ochanomizu Univ., 6 (1955), 31-73. [5J S K ullback, Information theory and sta- tistics, Wiley, 1959. [6J D. H. Blackwell and M. A. Girshick, Theory of games and statistical decisions, Wiley, 1954. [7J L. Weiss, Statistical decision theory, McGraw-Hill,1961. [8J H Raiffa and R. Schlaifer, Applied statis- tical decision theory, Harvard Univ. Graduate School of Business Administration, 1961. [9J W. James and C. Stein, Estimation with quadratic loss, Proc 4th Berkeley Symp. Math. Stat. Prob I, Univ of California Press (1961),361-379. [IOJ J. 0 Berger, Statistical decision theory, foundations, concepts, and methods, Springer, 1980. [IIJ R. H. Farrell, Weak limits of sequences of Bayes procedures in estimation theory, Ap- pendix, Decision theory, Proc. 5th Berkeley Symp. Math. Stat. Prob. I, Univ. of California Press, 1967,83-11 I. 399 (XVIII.7) Statistical Estimation A. General Remarks Statistical estimation is one of the most im- portant methods of statistical inference ( 401 Statistical Inference). Its purpose is to estimate the values of tparameters (or their functions) involved in a distribution of a statistical 'popu- lation by using observations on the popula- tion ( 396 Statistic). Let:f' = {PolO Ee} be a family oftprobability distributions, indexed by a parameter 0 and defined over a tmeasurable space (i.e, sample space) (/l", $). Let X be a trandom variable taking values in ::r and dis- tributed according to a probability distribu- tion P that is a member of .f'. Statistical esti- mation is a method of estimating the ttrue value of the parameter 0 (i.e., the parameter 0 such that P = Po) or the (true) value g(O) of a given parametric function 9 (i e., a function defined over e) or both, at 0, based on the observed value x of the random variable X. The function 9 maps the parameter values into R, R k , or some function space. Statistical esti- mation methods are classified into two types: point estimation, which deals with individual values of g(O), and interval (or region) estima- tion, by means of which regions that may con- tain the value g(O) are considered We can also 399C Statistical Estimation include in statistical estimation the problem of predicting the tolerance region in which the value of a yet unobserved random variable may come out B. Point Estimation In the method of point estimation for a given parametric function g, we choose a measurable mapping <p from the sample space (.':l", $) into a measurable space (s:l, <f) and state that "the value of g(O) is <p(x)" for an observed value .x, where ,'21 is a set containing the range of 9 and <f is a tcomplete additive class of subsets in sl. The mapping <p, or the random variable <p(X) taking values in the space .'21, is called an estimator of g(O), while the value <p(x) deter- mined by the observed value x is called an estimate of g(O) This estimate is sometimes termed a nonrandomized estimate in contrast to the following generalized notion of esti- mator. A mapping from fll' to a set of proba- bility distributions defined over (.W', <f) is called a randomized estimator, which reduces to a nonrandomized estimator when each image distribution degenerates to a single point. We assume that .'21 = Rand <f = the class $ of all tBorel sets in R, unless stated otherwise. We denote the texpectation and tvariance with respect to Po by Eo and Vo, respectively. c. U nbiasedness An estimator <p(X) of g(O) may not be exactly equal to g(O) for any OEe except for trivial cases, but could instead be stochastically distributed around it. An estimator <p(X) IS said to have unbiasedness if it is stochasticall y balanced around g(O) in some sense, such as mean, median, or mode. A statistic <p(X) is called a (mean) unbiased estimator of g(O) if Eo(<p(X))= g(O) for any OEe. A parametric function 9 IS said to be estimable if it has an unbiased estimator. For example, the sample mean is unbiased for the population mean: Eo(X)= Ee(X) for any OEe. Unbiasedness usually imphes mean un- biasedness, and we assume this unless stated otherwise. The function b(O)= Ee(<p(X))-g(O) is called the bias of the estimator <p(X) If we restrict ourselves to unbiased estimators cp(X) only, it is best to choose, if possible, a <p(X) whose variance Vo(<p(X)) is minimum uni- formly for every OEe. Theorem (Rao-Blackwell) If T= t(X) is a tsufficient statistic, then for any unbiased 
399D Statistical Estimation estimator <p(X) of 9 the tconditional expecta- tion t/J(t) = E(<p(X) I T= t) yields another un- biased estimator <p*(X)= t/J(t(X)) of g, which satisfies Ve(<p*) Ve(<p) for all OEe, with the equality holding if and only if <p(x)= <p*(x) (a.e..o/'). The notation a.e..o/' means that the statement concerned holds with probability 1 with respect to Po for each OEe. An estimator <p of g(O) is called a uniformly minimum var- iance (or UMV) unbiased estimator if <p is unbiased for g(O) and has a minimum variance uniformly in e among the class of unbiased estimators for g(O) Theorem (Lehmann-Scheffe). If T is a suffi- cient and tcomplete statistic, then for any estimable parametric function g(O), there exists a unique UMV unbiased estimator of g(O) that is a function of T. For exam- ple, suppose that X =(Xj,X 2 , ...,X n ) is a ran- dom sample from a population with exponen- tial type distribution Po with density po(x) = 13(0)u(x)exp(=IIX;(0)t;(x)) with respect to Lebesgue measure and that the set { (IX 1 (0), '" , IXk(O)) I OEe} contains some open set of R k . In this case, T= (t 1 (X), "', tdX)) is a sufficient and complete statistic, and hence every real- valued measurable function t/J(T) is the unique UMV unbiased estimator of the parametric function Eo(t/J(T)). If for any OEe the tmedian of the distribution of an estimator <p(X) equals a real parametric function g(8) when X is dis- tributed as Po, i.e., if Po{ <p(X) < g(O)} !  Po{ <p(X)  g(O)}, then <p(X) is called a median unbiased esti- mator. For example, a sample median (suitably defined for the case of an even number of samples) is median unbiased for the popula- tion median. If <p(X) is a median unbiased estimator of g, then for any real-valued mono- tone function h, an estimator h(<p(X)) for h(g(O)) is median unbiased, that is, median unbiasedness is preserved under monotone transformations, which is not the case with mean unbiasedness. Restricting our consideration to the class of all median unbiased estimators, we can use the function { Po{<P(X)u} a(u,O,<p)= Po{<p(X)u} for u < 0 for u> 0 as an indicator of the behavior pattern of an estimator <po The estimator <p that minimizes a(u, 0, <p) for all values of u and 0 (u "'" 0) is said to be uniformly best. This property is also preserved under monotone transformations. Theorem (Birnbaum). If a family of distri- butions {Po I OE e c R} has a monotone tlike- lihood ratio with respect to a statistic t(x) and the distribution function F(t, 0) of T= t(X) is continuous both in t for any 0 and in 0 for any 1488 t, then there exists a uniformly best median unbiased estimator of O. Actually, if 0 = O(t) is a solution of F(t, 0)= 1/2, then <p(X)= O(t(X)) is such an estimator. If for any OE e the tmode of the density function or the probability mass function of an estimator <p(X) is equal to g(O), then <p(X) is ca\1ed a modal unbiased estimator of g(O). D. Lower Bounded of the Variance of an Unbiased Estimator When there does not exist a sufficient and complete statistic, we can still seek to minimize the variance of the (mean) unbiased estimator at every fixed point 0= 0 0 , In the lemainder of this section, .0/' is assumed to be dominated by a measure /1, and Po (x) denotes the density function of Po with respect to /1 and no(x) = Po(x)/Poo(x). The foIlowing theorem guaran- tees the existence of the 10ca\1y best unbiased estimator. Theorem (Barankin). Let At be the set of all unbiased estimators of a parametric function gee) with finite variance at 0 = 0 0 , A.ssume that o4t is not empty and EOo((no(X))2) < 00. Then there exists an estimator <Po in At that mini- mizes the variance at 0 0 within At. Actua\1y, {<Po} = At n .0/'0, where .0/'0 is the linear space generated by {no(x)IOEe}. The minimum variance is given as follows: Ve o ( <Po) = inf{ Voo(<p(X)); <pE At} = SUPl { C a;h(O;) r I Eoo Ct a;no,(X) Y} =supn{jt h(O;)h(OPij} , where h(O)=g(O)-g(Oo) and ).;j is the (i,j)- component of the inverse of the n x n matrix (A;) with A;j= Eoo(no,(X) no/X)) and where the supremum SUP1 is taken over a11 positive in- tegers n, 0 1 , ..., OnEe and aj, ..., QnE R, and becomes SUPn when the supremum is taken over n and the OJ' This theorem leads to the following three theorems with respect to the lower bound of the variance of an unbiased estimator. The first is immediate and the last two are obtained by replacing some order differences of no with the corresponding differentials. Theorem. For any unbiased estimator <p(X) of 9 and 0oEe, we have Veo(<p(X)) sup {(g(O)- g(00))2 /EoJ(no(X) OEe - nOo(X))2} (Chapman-Robbins-Kiefer inequality). 
1489 Theorem. Suppose E>cR. For any unbiased estimator <p(X) of 9 and under certain regular- ity conditions, we have v, ( ( X » >- (g'(00»2 eo <p ---- Eoo((iJlogPe(X)/oOI e-el) (Cramer-Rao inequality), where the equality holds only for the exponential type distri- bution Pe(x) = f)(O)u(x) exp(IX(O) <p(x». An ex- ample of such regularity conditions is (i)-(iii): (i) E eo ((n e (X»2) < co for all OEE>; (ii) Pe(x) has a partial derivative p(x) at 0=0 0 (a.e. Pe); and (iii) lim E [( pe O H1e(X)-pe o (X) P0(X» ) 2 J =0. M-O eo Pe (X),10 Pe (X) o 0 Corollary. Let X = (XI, ..., X n ) be a random sample from a distribution with density f (x, 0) and let 1(0) = Ee((iJ logf(X 1 , 0)/00)2). Since E e ((010gPe(X)/00)2)=nl(O), the Cramer- Rao inequality implies VOo(<p(X»  (g'(00»2 /nl (0 0 ), The number 1(0 0 ) is called the Fisher infor- mation of the distribution f(x, 0). When the equality holds for an unbiased estimator <p(X), <p(X) is called an efficient estimator of g(O). In general, the efficiency of an unbiased estimator <p at 0 = 0 0 is defined by Eff(<p) = (g'(00))2 /(n/(Oo) Veo(<p). Theorem. For any unbiased estimator <p(X) of g(O) and under certain regularity conditions. we have k k Veo(<p(X) I I g{i)(Oo)g(j)(Oo)Kij i-I j=l (Bhattacharyya inequality), where g(i)(Oo) = d i g(O)/dO i I e-eo and Kij is the (i,j)-component of the inverse of the matrix (K ij ) with K ..= E ( pi(X)PW(X» ) I) 0 0 ( X ) ( X ) ' i,j=I,...,k. Poo P eo An example of such regularity conditions is (i)-(iii): (i) Ee o (ne(X»2) < 00; (ii) Pe(x) is k-times differentiable with respect to 0 at 0 = 0 0 ; (iii) the ith partial derivative p(x), i  k, satisfies 1" E [( ,1i pe (X)le=e o P(X» ) 2 J =0 AO eo Peo(X) (M)i Peo(X) , where ,1Pe(X)le-e o =Pe o +M(X)-Pe o (X) and ,1i Pe(X) = ,1(,1i-1 Pe(X») for i 2. For k = 1 the Bhattacharyya lower bound is the same as the Cramer-Rao lower bound. In general, the former gives a sharper lower bound than the latter. If the parameter is multidimensional, 0 = (0 1 "", Ok)" then for any unbiased estimator 399E Statistical Estimation <p(X) of g(O) and under similar conditions to those for the I-dimensional case, we have k k Veo(<p(X»  I I g;(Oo)g;(Oo)Jij, i=l j=l where g;(Oo)=og(O)/oOile=e o and Jij is the (i,j)- component of the inverse of the matrix (Jij) with J ij = Eeo(o 10gPe(X)/00i I e=e o o 10gPe(X)/00jl e=eJ If O*(X)=(Oi(X),..., O:(X»' is an unbiased estimator of 0=(0 1 ,,,,, Ok)' (i.e., E o (8;*(X»=O, for i = 1,2, "', k), then the tcovariance matrix Veo(O*(X» of O*(X) at 0 0 satisfies Veo(O*(X»)  r 1 , that is, the difference Ve(O*(X»-r l is a nonnegative definite matrix. If X =(X 1 , ..., X n ) is a random sample from a distribution having density f(x 1 , 0), then by setting lij = Eeo(o logf(X 1 , 0)/00;1 e=e o x 0 logf(X 1 , O)/oOj I e=e o )' we have Jij = nl'j' The matrix 1= (Iij) is called the Fisher information matrix of the distri- bution f(x 1 , 0). E. Decision-Theoretic Formulation ( 398 Statistical Decision Functions) Let W(O, a) (O) be the loss incurred from an estimate (or action) a of the parameter when the true value of the parameter is O. The risk function of an estimator <p(X) of the para- metric function g(O) is then defined as r(O, <p) = Ee(W(O, <p(X)). Statistical decision theory deals with the prob- lem of minimizing, in an appropriate manner, the risk function by a suitable choice of <po The notions of complete class, Bayes estimator, admissibility, minimax estimator, and invar- iant estimator, explained here and in Sections F-I, are the most important of the theory. The unbiased estimator explained in Section C may also be considered an important concept of the theory. A class C of estimators is said to be essen- tially complete if for any estimator <p there exists an estimator <Po in C such that r(O, <Po)  rIO, <p) for any OEE>. The following two theorems hold, provided that the action space .'7i is R and the loss function W(O, a) is convex with respect to aEd for any OEE>. Theorem (Hodges-Lehmann). If W(O,a)-> 00 as lal->oo, then the class of all nonrandomized estimators is essentially complete. Theorem. If T = t(X) is a sufficient statistic, then the class of all functions of T is essentially 
399 F Statistical Estimation complete. Actually, given any estimator <p(X) of g(O), the conditional expectation Ij;(t)= E(<p(X)1 T=t) yields an estimator <Po(X)= lj;(t(X)) satisfying 1'(0, (Po)  1'(0, (p) for any 0, where the equality holds when and only when <p=<po, provided that W is convex in the strict sense. A loss function of the form W(0,a)=).(0)(a-g(0))2, ;.(0»0, is called a quadratic loss functions. If, in partic- ular, ;.(0)= 1, then 1'(0, <p) = Ee((<P(X) - g(0))2) is called the mean square error of the estimator <p(X) of g(O). This error coincides with the variance when the estimator is unbiased. F. Bayes Estimators Let  be an a priori distribution over the parameter space 8 associated with a certain t()-algebra , and assume that 1'(0, <p) is - measurable for every <po Denote by E the average operator relative to . The infimum of the average risk r(,<p)=E(r(O,<p))= E Ee( W(O, <p(X))) for <p running over its range is called the tBayes risk relative to , while an estimator <p(X) of g(O) at which the average risk r(, <p) attains the infimum is called a Bayes estimator relative to . A Bayes esti- mator is obtained as follows: Assume that /1 is dominated by a measure 11 with 'B x - measurable density Pe(x), the loss function W( 0, a) is  x (£-measurable, and L Pe(x)d(O) < 00. For each observed value of x, the Bayes esti- mator <p(x) takes the value a that minimizes r(al x) = E(W(O, a) I x) = f. W(O, a)p(O I x)d(O), (0) where p(O I x) is the probability density, with given x, of O. We call r(a I x) the posterior risk. Theorem (Girshick-Savage). Suppose that the loss function is quadratic. For any x the value of the posterior risk is either 00 (for every value of a) or finite (for all or only one value of a). If the Bayes risk relative to ( is finite, then a Bayes estimator <p*(X) relative to  is determined uniquely as foIlows: <p*(x)=a o if r(a o I x) < CXJ for only one value ao' whereas <p*(x) = E(g(O)).(()) I x)/E(;'(())! x) if r(a I x) < "JJ for every a. If E(J.(O)) < 00, then a Bayes esti- mator is either biased or has average risk zero. 1490 G. Admissibility of Estimators An estimator <Po (X) of a parametric function g(O) is said to be admissible if and only if for any estimator <p(X) of g(O) the inequality r(O,<p)r(O,<po) for all OE8 implies that 1'(0, <p) = 1'(0, <Po) for alI OE8. Ifan estimator is the unique Bayes estimator relative to some a priori distribution, then it is admissible. For exam- ple, let Xl' ..., X n be a random sample from N(O, 1), and let W(O,a)=(a-Of Then <p(X) =(C+M 2 X)/(1 +M2) is the unique Bayes estimator relative to the prior distribution N(c, ()2) of 0, where X is the sample mean. Hence any estimator of the form 7(X)=aX +h is admissible when 0< a < 1 and --00 < h < 00. I n the rest of this section and the next sec- tion, we restrict ourselves to quadratic loss functions. If a statistic of the form c<p(X) with real c is an admissible estimator of g(O), then inf ( g(O) Ee(<P; ) csup ( g(O) Ee i<& ) . e Ee( <p ) e Ee( <p2) Theorem (Karlin). Let X be a random vari- able having a I-dimensional expo tlential type distribution dPe(x) = {3(O) eexd l1(x) with a param- eter space 8 = I ({J, 8) = {O I J",oo eex dl1(x) < oo} a closed or open interval, and let g(O) = Ee(X) = -13'(0)/13(0) be a parametric function to be estimated. Then the estimator <PiJX) = X/ (). + 1) for real ic is admissible provided that J(#({)W!.dO-->oo as h-->O and J(p'(OW!.dO-->oo as a-->{J for any CE({), 0). Corollary. When 8 = (-00, (0) the estimator <Po(X)=X is admissible. Corollary. Let 8=( -00, (0), and assume that both intervals (-oc,O] and [0, (0) have positive measure with respect to Jl. Then every estimator of the form <p(X) = aX with 0< a  I is admissible. This theorem can be applied to a random sample XI, ..., X n drawn from an exponential type distribution because the sufficient statistic X =  X,fn has an exponen- tial type distribution and Ee(X) = Ee(X,). Theorem (Karlin). Let X be a random variable having a distribution dPe(x) = q(O)' r(x)dx for OXO and =0 otherwise, with the parameter space 8=(0, (0), where Jr(x)dx< +00 and J;;' r(x)dx= 00. Among the estima- tors of the type <p,(X)=c(q(X)r a for g(O)= (q(OW with IX>O, only one estimator with the value c=(21X+ 1)/(IX+ 1) is admissible. This theorem is applicable also when the size of the random sample is larger than 1. Theorem (Stein). Let Xl' ..., X n be a random sample from a univariate distribution dPeCx) = f (x - 0) dx with a location parameter O. Define Ak=Ad x " ....x n )= ref O{Qf(Xi-O))dO 
1491 for k=O, 1,2. If f h f if, ( A2 (  ) 2 ) 3!2 n ... --- fl {(xJ - -if, Ao Ao j' n X fl dX j < 00, i=l then the Pitman estimator <Po(X" ..., X n ) = A,(X" ...,Xn)/Ao(X" ...,X n ) of the parameter o is admissible. Inadmissibility of the Usual Estimator for Three or More Location Parameters. Let X = (X" ..., Xd' be a k-variate normal random variable with mean 0 = (0" ..., Ok)' and covar- iance matrix I, the identity. Then the Pitman estimator of FJ is (j = X. However, Stein showed that X is inadmissible. It is strictly dominated by the estimator iJ*(X)=(1-(k-2)/IXI 2 )X, where 1'1 denotes the Euclidean norm IXI 2 = L:, X?, That is, if k?;3, EI0*(X)012< EIX _01 2 for any O. An estimator such as O*(X) is called Stein's shrinkage estimator (James and Stein, Proc. 4th Berkeley Symp., I (1960)). H. Minimax Estimation An estimator 'p*(X) IS said to be minimax if and only if sup r(O, <p*) = infsup r(O, <p). 6 'fJ e If an estimator <p* is admissible and the risk r(O, <p*) is constant with respect to 0, then <p* is minimax. Theorem (Hodges-Lehmann). A Bayes esti- mator <p* relative to an a priori distribution  is minimax if  assigns the whole proba- bilIty to a subset ill c e, r(O, <p*) is constant (say, c) for OEW, and r(O, <p*)  c for OEe. Let X have a binomial distribution B(n, 0),0 < 0< I; 0 is unknown. If the prior distribution of 0 is a beta distribution fJ( n/2 , ;;!j), then the Bayes estimator is T*(X) = (X + j;;/2)!(n + j;;), which has constant risk Ee(T*(X)- 0)2 = (2(1 + j;;)r 2 for all 0, 0 <0 < 1. Thus, ac- cording to this theorem, T*(X) is minimax. It is interesting to compare the mean square error of T*(X) to that of minimum variance unbiased estimator {j = X !n, 0(1 - O)/n. Theorem (Wald). If there exists a sequence of prior distributions {n} such that Iiminf ( r(O,<p*)-inf r r(O,<P)dn(O) ) O n--1-,y qJ Je for any OEe, then <p* is minimax. For exam- ple, the last theorem led to the proof of the fact that the Pitman estimator of a location parameter is minimax [4]. In the discussion of robust estimation, Huber (Ann. Math. Statist., 399 J Statistical Estimation 35 (1964)) proved that Huber's minimax robust location estimator minimizes the maximum asymptotic variance over some family of sym- metric distributions in a neighborhood of the normal distribution ( 371 Robust and Non- parametric Methods). I. Invariant Estimator For simplicity, assume that the range A of a parametric function g(O) coincides with the parameter space, and consider the point esti- mation of g(O)==:O (0 is not necessarily real). Suppose that there exist two groups G = {T} and G = {"f} of one-to-one measurable trans- formations of!£ and e, respectively, onto themselves such that (i) there exists a homeo- morphic mapping T-+"f from G to G; (ii) if X has a distribution Pe, then TX has the distri- bution P'6; and (iii) W("fO, "fa) = W(O,a) for any 0, a, and T. An estimator <p is said to be invariant if it satisfies <p(u) = "f<p (x) for any T E G, a.e. &. An estimator <p is called a best invariant estimator if the risk function r(FJ, <p'), where <p' is an invariant estimator, takes its minimum value when <p' = <po If the group Gis ttransitive on e, then the risk function of any invariant estimator is independent of the value of 0, and hence any admissible invariant estimator is best invar- iant. For example, in the point estimation of a location parameter with a quadratic loss func- tion, the Pitman estimator is best invariant. Theorem. If e is a compact topological space, G is a group of homeomorphisms of e onto itself, and G is homeomorphic to e under 9 E G -+gOo E e with a fixed 0 0 E e, then any Bayes estimator relative to the tright- invariant Haar measure over G is best in- variant. This result can be generalized to a locally compact e ( 398 Statistical Decision Functions). J. Sequential Estimation Estimation methods based on sequential sam- pling are not as popular as tsequential tests, because their efficiency is not very large com- pared to that of nonsequential estimation. A generalization of the Cramer-Rao inequality to any sequential unbiased estimator <p(X) of a parametric function g(O) is the Wolfowitz inequality, Ve(<p(X)) ?;(g'(O))2 /(Ee(N)I(O)) for every 0 E e, under regularity conditions similar to those for the fixed-size sample prob- lem, where N is the sample size and 1(0) the Fisher information. 
399 K Statistical Estimation K. Asymptotic Theory In practical problems of statistical inference the sample size n is often large enough to give sharp estimates of the parameters involved; then the sample distributions of estimators can be approximated closely by their asymptotic distributions, which are of a simpler nature. Assume that X=(X 1 ,X 2 , ...) is a sequence of independent and identically distributed (i.i.d.) random variables with the common distri- bution Pe, OEE>. For each n, let <Pn=<Pn(X1' ..., X n ) be an estimator of g(O) that is a func- tion of 0 to .91 (c RP). Thus <Pn is a measurable mapping from (:'In, !En) to (.91, 'ti). Let us de- note the distribution of <Pn by .:l'( <Pn), .:l'(<Pn I 0) or 2'(<Pn I Pe), the last two emphasizing that the underlying probability distribution is Pe. For example, if the mean vector Ee(<pn)=mn(O) and the covariance matrix vn(O)= Vo(<Pn) exist for every n, and if .:l'[v n (Or l / 2 (<Pn(X) -mn(O)) 1 OJ-> Np(O,I) as n-> 00, then .:l'(<Pn I 0) is approxi- mated by a p-variate normal distribution Np(mn(O), vn(O)) ( 341 Probability Measures D). <Pn is said to be asymptotically (mean) unbiased for g(O) if mn(O)->g(O) for any OEE> as n->oo. But we often calculate the asymptotic distribution without obtaining the exact mean and covariance matrix of the estimator <Pn for each n. In the asymptotic theory it may be reasonable to regard the sequence of estima- tors {<Pn} rather than each estimator <Pn as an "estimator," but we do not bother with the difference between these definitions of an estimator. Consistency. {<Pn} is called a consistent esti- mator of g(O) if <Pn converges to g(O) in proba- bility as n->oo: IimPe{l<pn-g(O)I><.}=O for any <.>0 and every OEE>. If the convergence is almost sure, it is called a.s. consistent. For example, if <Pn is asymptoti- caIly unbiased with the covariance matrix vn(O) such that 1 v n ( 0) 1->0 as n -> 00, then <Pn is a consistent estimator of g(O). A sufficient con- dition for existence of a consistent estimator is \ given by the foIlowing result. Theorem (LeCam). Let :'l be a Euclidean space and !E the O"-algebra of all Borel sets in .?l'. If the parameter space E> is a locally com- pact subset of R k , Pe i= PO' for any 0 i= O' (identi- fiability condition), and P e . ->P e whenever On-> 0, then there exists a consistent estimator of O. <Pn=g(T,,) is a consistent estimator of g(O) if {Tn} is a consistent estimator of 0 and g(O) is a continuous function of O. Asymptotic Normality. The class of estimators is restricted to what are caIled consistent esti- 1492 mators. An estimator {<Pn} is said to be asymp- totically normally distributed if tht' asymptotic distribution of n 1/2 (<Pn - g( 0» is normal: 2'[n I/2 (<Pn-g(0))IOJ->N p (/1(0), v(O)) as n->oo. /1(0) and v(O) are caIled the asymptotic bias and asymptotic covariance matrix, respectively. They are not always equal to the limits, if any, of the mean and covariance matri  of n 1/ 2 (<Pn - g(O)). {<Pn} is usuaIly caIled a consistent and asymptotically normal (CAN) estimator if the asymptotic distribution of n I/2 (<Pn -g(O)) is normal with the asymptotic bias zero. Then the distribution 2'(<Pn 1 0) is apprm.imated by Np(O, v(O)jn). For example, the "moment method estimator is a CAN estimator. Theorem. Let {<Pn} be a CAN estimator of g(0)ER 1 with asymptotic variance v(O). Then Iiminf Ee{nl<Pn-g(OW};:.v(O) for every OEE>. nXJ Theorem. Suppose that g(O) is a continu- ously differentiable function from E> ( c Rk) to RP (pk). Let G(O)=(agi(O)jao j ), the tJaco- bian matrix. If {T,,} is a CAN estimator of 0 with asymptotic covariance matrix v(O), then {g(T,,)} is a CAN estimator of g(O): .:l'[ n 1/2(g(T,,) - g(O) 1 OJ -> Np(O, G(/;I) v(O) G(O)'). An estimator {T,,} is said to be a best asymp- totically normal (BAN) estimator of 0 if {T,,} is a CAN estimator of IJ with asymptotic variance r 1 (0), where /(0) is the Wisher in- formation matrix on 0 in a single observation. We can see that the maximum likelihood (ML) estimator ( Section M) is a BAN estimator. Functional on Distribution Functions. Let <p(F) be a functional on distribution functions to R 1. Let us consider the class of estimators that are defined by <Pn = <p(Fn), where Fn is the tem- pirical distribution function of n samples Xl' ..., Xn. An estimator {<Pn] with <Pn = <p(Fn) for each n is said to be Fisher consistent for g(O) if <p(Fe) =g(O) for every OE E> when Fe is the true distribution function. {<Pn] is also a.s. consistent for g(O) if {<Pn} is a Fisher consistent estimator of g(O) and if <P is a continuous func- tional. Furthermore, if <P is differentiable, we can see that {<Pn} is also a CAN estimator by using the fact that n I/2 (Fn(Fe- 1 (t))- t), Ot 1, converges weakly to the tBrownian bridge. Let S be a set of distribution functions. S is said to be a star-shaped set of F if HE S implies F(t) =(1 - t)F + tH E S for any tE [0,1]. Theorem (Von Mises). Assume that (Fl) there exists a star-shaped set Se at F such that limn Pe{ FnE Se} = 1; (F2) for any tE[O, IJ and HES e , there exist derivatives (d i jdt i )<p[(1 - t)F +tHJ, i = 1,2; (F3) there exists t/Je(Y) from R 1 to R 1 such that 
1493 d f oo <P [(1- t)Fe +tHJ 1 '=0 = t/Je(y)d(H(y)  -00 - Fe(Y)) for all HE Se; and (F4) Ii mPe { n'-. sup I d: <p[Fn l >[; } =o n-oo 'E[O. 1] dt for any band [;>0, where F=(I-t)Fe+tFn' Then if <Pn with <Pn = <p(F n ) is Fisher consistent for g(O), we have .:l'[n 1/2 (<Pn - g(O)) I OJ -+N(O, V(O)), where v(O)= f:oo t/JbfdFe(y)-{f:ac t/Je(Y)dFe(y)f {cn}-Consistency. For a sequence of positive numbers C n tending to infinity as n-+ 00, an estimator T" is called consistent for 0 E E> with order C n (or {cn}-consistent for short) if for every [; > 0 and every 0 E E> there exist a suffi- ciently small positive number b and a suffi- ciently large number K satisfying Iim sup sup[P,{ cnl T" - TI  K}; I, - 01 < bJ < E. n-+oo Let {c n } be a maximal order of consistency. This notion was introduced by Takeuchi and Akahira. They studied consistent estimators of location parameters with various orders. Let 2[=E>=R 1 . Suppose that for every OEE>, Pe has a density function f(x - 0) with respect to the Lebesgue measure. Theorem. Assume that (DC1) {(x»O if a <x <b and f(x)=O if xa or xb; (DC2) there exist positive numbers 0 < rx  fJ < 00 and 0 < A', B' < 00 such that lim (x-a)I-'f(x)=A', x-+a+O Iim (b-X)I-f(x)=B'; x......b-O (DC3) f(x) is twice continuously differentiable in the interval (a, b) and there exist positive numbers 0 <A", B" < 00 such that lim (x-a)HIf'(x)I=A", x-+a+O lim (b-X)2-I{'(x)I=B". x......b-O Assume further that f"(x) is bounded if rx  2. Then for each rx there exists a consistent esti- mator with the order given in Table 1. Table 1 { cn}-consistent rx order C n estimator 0<a<2 nIl' {min Xi + max X i - (a+b)}/2 x=2 (nlogn)1 / 2 ML estimator rx>2 n 1/2 ML estimator 399M Statistical Estimation L. Moment Method The moment method is also utilized to obtain estimators. Suppose that ,q{ c R 1 and E> c R k . Denote the tpopulation distribution function by Fe and the tempirical distribution function of n samples Xl' ..., X n by Fn. The following system of simultaneous equations is derived from letting the jth tpopulation moment 11/0)= Ee(xj) = f xjFe(dx) be equal to the jth tsample moment n f 1 . .  mnj=n- iXt= xJFn(dx). For example, for j= 1,..., k, 11(0)=(111 (0), ..., I1k(O))' =(m n1 , ..., m nk )' =m n . A moment method estimator is determined as a solution 0= 0n(X)E E> of k numbers of simulta- neous equations. Theorem. Assume that the function 11(0) from E> to R k is continuously differentiable and that the Jacobian matrix M(O) = 811(0)/80 = (8I1i(O)/80 j ), i,j= 1, ...,k, is nonsingular in a neighborhood of the true parameter. Then the moment method estimator exists and is a CAN estimator: .:l'[n I/2 (On -0) I OJ -+Nk(O, M(O)-1 v(O)M(Oy-l), where v(O)=(cove(X i , xj)), i, j = 1, ..., k. In general, a moment method estimator is not a BAN estimator. However, in view of its simple form, a moment method estimator is important and often utilized as a first-step estimator in order to determine the maximum likelihood estimator by the iteration method. M. Maximum Likelihood Method Suppose that a distribution Pe has the density function f(x, 0), OEE>cR\ with respect to a to'- finite measure 11, and let XI' ..., X n be observed values of random samples XI, ..., X n from the population f(x, 0). Then the function Ln of 0 defined by n Ln(O; XI' ..., x n ) = TI f(x;, 0) i=l is called the likelihood function. If 0 = Dn(x 1, ..., x n ) maximizes the value of Ln(O) for fixed Xl, .., , X n and if it is a measurable mapping from (2[n, !En) to (E>, ((f) with '"{} a to'-algebra of subsets of E>, then 0n(X)= (In(X 1 ,,,,, X n ) is called the maximum likelihood (ML) estimator of 0. This method of finding estimators is called the maximum likelihood method. If the parameter is transformed into a new para- meter n=hlo) bv means of a known one-to- 
399M Statistical Estimation one bimeasurable transformation h and if there exists a unique ML estimator en of 0, then n(X)=h(On(X») is a unique ML estimator of I). In other words, the ML estimator is in- variant for every one-to-one transformation. Many statisticians have investigated and improved known adequate regularity con- ditions under which the ML estimator exists and is a BAN estimator. Theorem (Wald). Assume that (C1) E> is a closed subset of R k with nonempty interior E>0; (C2) for any x EX, f(x, 0) is continuous with respect to 0 and limlel_J f(x, 0) = 0 if E> is not bounded; (C3) if 0 1 of-0 2 , then Pel of-Pe 2 and Jlf(xA)- f(x, °2)dJ1(x) > 0; (C4) Eeo(iIogf(X, 0 0 )1) < 00; and (C5) Eeo(log +f(X, 0, p») < 00 and Eeo(log+ <p(X, r)) < 00, where f(x, 0, p'j=sup{f(x, 0'); 10' -01 p} and <p(x, r) = sup {f(x, 0); 101> r}. (The last two functions are measurable according to assumption (C2).) Then if a sequence of measurable functions, {Un(x h .'. ,x n )}, satisfies lim i nf { Ln(On; XI' ..., Xn) } ? C > 0 (a.s. Pe), n-x L n (00;x 1 ,...,x n ) then as n-+oo, U n (X 1 , ...,X n ) converges a.s. to the true value 00 of the parameter. Hence if the ML estimator exists, it is a.s. consistent. Pfanzagl (Metrika, 14 (1969» and Fu and GIeser (Ann. /nst. Statist. Math., 27 (1975» gave rigorous proofs for the existence of the ML estimator. Theorem Under assumptions (Cl) and (C2), there exists a maximum likelihood estimator On for any positive integer n. That is, On = On(x l' "" X n ) is a measurable function from (.1' n . n) to (E>/6) and satisfies L(On' x h '" ,x n ) =SllPe L(O; XI' ..., x n ). In the remainder of this section we suppose that assumptions (Cl )-(C5) are satisfied. We use the notation f = ( ,r'! f(X,O) ) , a k-column vector, cO cO i 2 ) ; = ( , D: f(X,O» ) , a k x k matrix, d COiCOj (310gf = ( r'!3 logf(x, 0) ) 3 ' h, i, j = I, .. . , k. r'!O r'!OhaOiCOj Theorem (Cramer). Assume that (ANI) for a.s. [J1]x, f(x, 0) is three-times con- tinuously differentiable with respect to each component of 0=(0 1 ,,,,, 0d' EE>°; (AN2) for OEE>O, f af(:,O) dJ1=O .1: 00 f a 2 f(x,0) and d J1 = Q' 1 a0 2 ' 1494 (AN3) Ee(\ ;: logf(x,o)n< 00 for OEE>°; (AN4) the Fisher information matrix /(0)= Eo[ ( :O IOgf(X, 0)) ( :O IOgf(X, O):n exists and is positive definite for OEE>°; and (AN5) there exists an H(x) such that 1 :;3 10gf(x,0)! <H(x) and Ee(H(X))<C, a constant for OEE>°. Then the maximum likelihood estimator is a BAN estimator: 2'[n 1 / 2 (e n -0) I 0]-+ N(O,/(O(I) as n-+oo for OEE>°. Note that under assumption (AN1) the likelihood func- tion attains the maximum in E>0 with proba- bility tending to 1 as n -+ 00 if the true value of the parameter exists in E>0. Hence the ML estimator is determined as a root 0 = en (x 1 , ..., x n ) of the likelihood equation with the same probability as above: (:0 10gLn(0)=( a(;)j logf(x i , 0)}1. .k =0. We also call n- 1 alogL n (0)/cO the likelihood estimating function. The essential fact used in the proof of the above theorem is the asymp- totic equivalence of the ML estimator and the likelihood estimating function: lln(0)-/(0)n 1 / 2 (e n -0)-+O in Pe as n-+oo, where lln(0)=n- 1 / 2 alogL n (0)/aO. Note the fact that 2'[ lln(O) 1 0] -+ Nk(O, /(0» as n -+ (I.) holds according to the centrallirnit theorem ( 250 Limit Theorems in Probability Theory B(I)). Contiguity. We now turn to the situation where we need asymptotic distributions under the alternative distribution Pe+n-l!2h with 0 + n- 1 / 2 hEE> in estimation as we do in testing hypotheses ( 400 Statistical Hypothesis Testing). The notion of contiguity, due to LeCam (1960), is basic for the as) mptotic methods of estimation theory. We consider sequences {Pn} and {P} of probability mea- sures on (.¥n, n) with the tRadon-Nikodym derivatives Pn and P with respect to a a-finite measure, such as Pn+P' Denote by Xn= A [P; Pn] a generalized log-likelihood ratio that is defined by 10gP/Pn on the set {PnP > O} and is an arbitrary measurable function on the set {PnP = OJ. Let {Bn} be any sequence of {n}-measurable sets, and let {7;.} be any sequence of {n}-measurable functions. A sequence of distributions {2'[7;.1 Pn]} is said to be relatively compact if every subsequence {n'} c {n) contains a further subsequence [m} c {n'} along which it converges to a prob- 
1495 ability distribution. In the Euclidean space relative compactness is equivalent to tight- ness: that is, for every I' > 0 there is a b(l') such that Pn{1 7;.1 >b(F.)} <F. for every n. Theorem. The following statements are all equivalent. (1) For any {T,,}, 7;.-->0 in Pn if and only if 7;.--> o in P. (2) For any {7;.}, {2'[7;,\Pn]} is relatively compact if and only if {2'[7;.1 P]} is relatively compact. (3) For any {Bn}, Pn [Bn} -->0 if and only if P {Bn} -->0. (4) Whatever the choice of Xn' {2'[Xn I Pn]} and {2'[Xn I P]} are relatively compact. (5) Whatever the choice of Xn, {2'[Xn 1 Pn]} is relatively compact. Furthermore, if {m} c {n} is a subsequence of {n} such that 2"[Xm 1 Pm] converges to 2'[X], then E {eX} = 1. Two sequences {Pn} and {P) satisfying requirements (1)-(5) of the above theorem are said to be contiguous. Theorem. Suppose that {Pn} and {P} are contiguous. Let {m} c {n} be a subsequence such that 2'[Xm, TmiPm] converges to a limit ..P[X, T]. Then ..P[Xm, Tm I P] converges to eXY[X, T], where v=e x 2'[x, T] is given by v(A)= SA e X d2'[x, T]. Now, set Pn = P; and P = P;+n-',2h for each n. Under suitable regularity conditions, such as assumptions (Cl)-(C5) and (ANl)-(AN5), it is easy to see that {P;} and {P;+n 1 2h} are contiguous. At the same time, we can see that the asymptotic linearity of An(O + n -1/ 2 h; 0) = A[P;+n-' 2h;P;] holds (say) in the vicimty of the true parameter as follows: An(0+n-112h;(})-h'n(0)+h'/(0)h-->O in Po' 2 The asymptotic equivalence of the ML es- timator and n(O), and the asymptotic linearity of the log-likelihood function and n(O), leads to the regularity ( Section N) of the ML estimator. Theorem. Under suitable regularity con- ditions as above, the ML estimator is regular: 2'[n I/2 (8" - 0)- h I 0 + n -11 2 h] --> Nk(O, /(Or l ), for any hER k with 0+n- l/2 hEe. N. Asymptotic Efficiency In Section D we discussed lower bounds of variances of unbiased estimators for finite sample size and defined the efficiency of an unbiased estimator with variance vn(O) by (v n (0)n/(0)r 1 . In this section we first discuss the asymptotic efficiency of a CAN estimator in the same vein as in the case of finite sample size. Second, we see a specific approach to the large-sample theory of estimatIOn. Throughout 399N Statistical Estimation this section we assume (Cl)-(C5) and (AN1)- (AN5) stated in Section M. BAN Estimators. We suppose that the para- meter space e is a subset of R I in this para- graph. We restrict our attention to the class of CAN estimators {7;.} of the real-valued parameter 0 for which 2'[n I/2 (7;, - 0) I 0]--> N(O, v(O)) as n-->oo. Fisher's conjecture con- cerning the lower bound to asymptotic var- iance v(O) of any CAN estimator is v(O)? /(Or 1 , where /(0) is the Fisher information on 0 in a single observation. The asymptotic efficiency of a CAN estimator with asymptotic var- iance v(O) is defined by (v(0)/(0)f1. A CAN estimator with asymptotic variance /(Or l is called a BAN estimator or an asymptotically efficient estimator. Note that under suitable regularity conditions there always exists an asymptotic efficient estimator, for example a ML estimator, although for a sample of finite size there exists an efficient estimator if and only if the family of density functions is of the exponential type. Unfortunately, Fisher's conjecture is not true without further conditions on the compet- ing estimators. A counterexample was pro- vided by Hodges and reported by LeCam (1953). Let {7;,) be any CAN estimator with the asymptotic variance v(O). Consider the estimator T' = { XT n if 17;,1 < n -1 / 4, n 7;, if 17;,1? n -'/ 4 , where 0 < x < 1 is a constant. {T:} is also a CAN estimator with asymptotic variance v'(O) such that { X 2 v(O) if 0 = 0, v'(O) = v(O) otherwise. Let {7;,} be a BAN estimator; then T: is an estimator with asymptotic variance less than /(Or 1 and is called a superefficient estimator. Theorem (LeCam). The set of points 0 for which the inequality due to Fisher fails is of Lebesgue measure zero. A condition due to Bahadur leads to the continuity of asymptotic variance which Implies the validity of the above inequality. Theorem (Bahadur). Suppose that {7;,) is a CAN estimator with asymptotic variance v(O) satisfying the condition 1 lim inf P O )+n-!l2{ 7;, < 0 0 + n- 112 } - 2 n--->G(J ( or 1 lim inf Po +n 1/2{ Tn > 0 0 + n- 112 ) - 2 ' n-+ 'L 0 
3990 Statistical Estimation Then the following inequality, due to Fisher, holds at 0=0 0 : v(Oo)? /(Oor'. Regular Estimators. W olfowitz and Kaufman considered an operationally more justifiable restriction on competing estimators, called the uniformity property, stating that, for an estimator {T,,} of 0, 2'[n I/2 (T,,-0)IOJ(y) con- verges to any limit Le(Y) uniformly in (y, O)E R k X C, where C is any compact subset of the interior eO of e c R k . The ML estimator {en} also has this uniformity property under suit- able regularity conditions, such as (C1)-(C5) and (AN1)-(AN5) to which some uniform- ity properties are added [29J We note that asymptotic variance is not a good measure- ment of asymptotic efficiency unless an estima- tor is a CAN estimator, and that asymptotic concentration is in general a more pertinent measurement. Theorem. For an estimator {T,,} with uni- formity property above, it holds that the limit Le(Y) is a probability distribution function and continuous for either one of the variables y or 0 if the other is fixed and furthermore that the probability measure Le is absolutely contin- uous with respect to the Lebesgue measure on R k . Theorem. The asymptotic concentration of the ML estimator {On} about 0 is not less than that of any estimator {T,,} with uniformity property: For any convex and symmetric set ScR k about the origin, lim Pe{nI/2«()n-0)ES}? lim Pe{nI/2(Tn-0)ES}. n---+(f 11-'1) Schmetterer (Research papers in statistics, F. N. David (ed.), 1966) provided the notion of the continuous convergence of distributions of estimators of 0 which is weaker than uniform convergence. An estimator {Tn} is said to be regular if 2'[n '/2(T" - 0) - h I 0+ n -1/ 2 hJ -+Le as n-+ 00, where Le is a probability distribution inde- pendent of h with 0 + n -'/ 2 h E e. It was shown that the ML estimator {On} is regular under ordinary regularity conditions. Hajek obtained the following characterization of the asymp- totic distribution of any regular estimator and, independently Inagaki (Ann. /nst. Statist. Math., 22 (1970); 25 (1973)) obtained a similar result. Theorem. The asymptotic distribution Lo of any regular estimator {T,,} is represented as the tconvolution of a normal distribution Nil and some residual distribution G e : Le=Ne*G e , where N e = Nk(O, /(0)-1), the asymptotic distri- 1496 but ion of the ML estimator {en} in the ordinary regular case. It follows from the characteriza- tion Le = N e * G e , that the first two theorems in this section hold also for regular estimators. ( LeCam, Proc. 6th Berkeley Symp., 1972, and Roussas and Soms, Ann. /nst Statist. Math., 25 (1973).) O. Higher-Order Asymptotic Efficiency In Section N it was shown that the ML esti- mator is a BAN estimator. In general, how- ever, there exist many BAN estimators. For example, consider the case of a tmultinomial distribution where probabilities of events are parametrized. That is, let X = (n 1 , ..., n m )', n = n, + '" + n m , be distributed as a multinomial distribution M(n; 1T 1, ... , 1T m ), 1T 1 + .., + 1Tm = 1, and let fle be a subset of m-vectors, fle = {1T(O)=(1T,(O), ..., 1T m (O))';OEe}, EkR 1 . De- fine P=(P1'"'' Pm)' =(n1/n, ..., nm/n)'. Then we consider (i) the ML estimator, (ii) the minimum-chi-square estimator, (iii) the mini- mum modified chi-square estimator, (iv) the minimum Haldane discrepancy (D k ) estima- tor, (v) the minimum Hellinger distance (HD) estimator, and (vi) the minimum Kullback- Leibler (K-L) information estimator. These are defined as the values of the parameter 0 that minimize the following quantities, respectively: (i) ML= -logL n = -nLIIP;log1T;(O); (ii) X 2 = LI=1 (nPi - n1T;(O))2/(n1T;(0)); (iji) mod / = LI1 (np; - n1T;(0))2 /(np;); (iv) Dk = :£111T;(O)k+1 / p7; (v) HD =cos -1 L1 (p;1T;(O))I;2; and (vi) K-L = LI,1T;(O)log(1T;(O)/PJ Rao [32J showed that under suitable regularity conditions these estimators are Fisher consistent and BAN estimators. Fisher-Rao Approach to Second-Order Asymp- totic Efficiency . For 0 E e eRr, let Pen be the density for n i.i.d. observations x =(x 1 , ...,x n ), and let qen be the density of estimator Tn. The tFisher information contained in X and in T" are defined by n/(O) = E(d logPen/dO)2 and n/dO)=E(dlogqen/dO)2, respectively. Rao defined the first-order (asymptotic) efficient estimator Tn satisfying one of the following conditions: (1) nI/2IdlogPen/dO-dlogqen/dOI-+O in probability; (2) /(O)-/dO)-+O as n-+ 00; (3) the asymptotic correlation between n'/2(T,,-0) and n'/2dlogPen/dO is unity; (4) InI/2dlogPenldO-a-fJnI/2(Tn-O)l-+O in prob- ability. We note that the larger the condition number (j), the stronger the condition. A first- order efficient estimator is a BAN estimator. Fisher proposed £; = lim (n/(O)-n/Tn(O)) = lim Vo(dlogPon/dO n-CG n-a:; - d log qenldO) 
1497 as a measure of second-order (asymptotic) efficiency to distinguish different BAN es- timators. Fisher stated without any sort of proof that the maximum likelihood estimator minimizes E 2 , i.e., maximizes second-order efficiency. Rao proposed E 2 =min Ve(d10gPon/dO - an 1/2 - [311(7;, - 0) A - }cl1(T,,- 0)2) as a measure of second-order efficiency for first-order efficient estimators {7;,} satisfying condition (4). He showed that the estimators mentioned above for multinomial distribution are first-order efficient estimators satisfying condition (4) and furthermore calculated second-order efficiencies of these estima- tors measured in terms of E 2 : (i) E 2 (ML)= },2(0)1(0); (ii) E2(X2)=(0)+E2(ML); (iii) E2(mod/)=4(0)+ E 2 (ML); (iv) E 2 (D k ) = (k+ 1)2(0)+E2(ML); (v) E2(HD)=(0)/4 +E 2 (ML); (vi) E2(K-L)=(0)+E2(ML) with ,,/(0)=(/102 -2/121 + /140)/[2(0)-1 - (/1L + /10 - 2/111 /130)/ /3(0), m (O)= I (n;(0)/ni(0))2/2-J140/ 1 (0) i-=l + J10/(2P(0)), where J1" = /1,,(0) = Eo{ ((df(X, O)/dO)/f(X, 0))' x (W.f(X, O)/d0 2 )/f(X, OJ)'} m = I ni(O)(ni(O)/ni(O))'(n;'(O)/ni(O))'. i=l Rao [33J gave another definition of second- order efficiency based on the expansion of the variance of 7;, after correcting for bias: Ve( 7;,) =(111(0))-1 + IjJ(O)I1- 2 + 0(11- 2 ). The quantity I(O) is considered to be a measure of second- order efficiency. The results of Fisher and Rao were confined to multinomial distributions. Efron (AI1I1. Statist., 3 (1975)) and Ghosh and Subramanyan (Saflkhyii, sec. A, 36 (1974)) extended the results to the so-called curved exponential family of distributions. Efron gave a geometric interpretation to the effect that second-order efficiency is related to the curva- ture of a statistical problem corresponding to ((0) above, and S. Amari recently extended this differential-geometric approach to the discussion of higher-order efficiency of esti- mators. Rao suggested that E 2 is equal to E 2 . Ghosh and Subramanyan gave a sufficient condition for the equality to hold, whereas Efron provided a counterexample to show that E 2 =P E 2 in general. Pfanzagl and Takeuchi-Akahira Approaches to Higher-Order Asymptotic Efficiency. For each 399P Statistical Estimation k= I, 2, ...,a t{cn}-consistent estimator {Tn} is said to be the kth-order asymptotically median unbiased (AMU) estimator iffor any OEe (c R 1) there exists a positive number,) such that lim sup C-I I Pt{7;,<T}- I =o nx It-Ol<o 2 or lim sup (' -I I Pt{7;,T}- I =o. nT 1'-01<0 2 This notion, which is an extension of the con- dition due to Bahadur for the asymptotic efficiency, was introduced by Takeuchi and Akahira. For a kth-order AMU estimator {T,,}, Go(t, 0) + ('n- 1 Gdt, 0) +... + ('n-k+l G H (t, 0) is called the kth-order asymptotic distribution of c n ( 7;, - 0) if lim c-IIPO{ c n (7;, - 0) < t} - Go(t, 0) n-cx -C n - 1 Gdt, 0)-... -C n -k+l Gk+dt, 0)1 =0. Pfanzagl and Takeuchi and Akahira obtained the concrete form of the second- or third-order asymptotic distribution of the ML estimator. A kth-order AMU estimator IS said to be kth-order asymptotic efficient if the kth-order asymptotic distribution of it attains uniformly the bound for the kth-order asymptotic distri- butions of the kth-order AMU estimators. Takeuchi and Akahira showed that under suitable regularity conditions, 7;, is second- order asymptottc efficient if J {(111 (0))1/2( 7;, - 0) < t} = <I>(t) + 11- 1 / 2 (3/111 (0) + 2/130(0))/(6/(0)3/2) x t 2 <p(t) + 0(11- 112 ), where <I>(t) is the standard normal distribution function and <p(t) is its density function, and further that the modified ML estimator 0; = tt + 11- 1 J130(8,,)/(61 (On)2) for the ML estimator {in is second-order asymp- totic efficient. Pfanzagl (AI1I1. Statist., 1(1973)) obtained a similar result. The formulation due to Takeuchi and Akahira is more extensive since it can be applied to the so-called non- regular cases. P. Estimating Equations We often determine an estimator as a solution 0= T,,(x 1, "', x n ) of an equation 'I'n(x 1, ..., X n ; 0) =0, for example, the ML estimator as a solu- tion of the likelihood equation. In such case, such an equation is calIed an estimating equa- tion and '£In (0) = '£In (XI , ..., X n ; 0), a random function, is called an estimating function [3]. 
399Q Statistical Estimation Call T., an estimator based on an estimating function 'l'n(O). The following result is a modifi- cation of a theorem due to Hodges and Leh- mann (Ann. Math. Statist., 34 (1963)). Theorem. Let E> be an interval of R'. Sup- pose that a real-valued estimating function 'l'n(O) satisfies the following three conditions: (M 1) 'l'n(O) is a nonincreasing function of the real parameter 0; (M2) for any real number h, n I/2 '1'n(00 + n -1/ 2 h) -n I/2 '1'n(00) + yh--+O in probability, wherey is a positive constant; and (M3) ,,<l'[n 1/2 'l'n(OO)J (Y)--+Ct>(Y/O"), where ct> is a continuous distribution function with zero mean and unit variance. Define an estimator based on 'l'n by T., =(0: + 0:*)/2, where 0: =sup{ 01 'l'n(O) > O} and 0:* =inf{OI'l'n(O)<O}, Then we have ,,<l'[n I/2 (Tn- 0 0 )] (y)--+ct>(yyjO") as n--+oo. Huber consid- ered a formulation that guarantees the asymp- totic normality of an M -estimator. An M- estimator Tn is defined by a minimum problem of the form Li1 p(X;, T.,) = mine Li=t p(Xj, 0) or by an implicit equation LiIIjJ(Xi' T.,)=O, where p is an arbitrary function and ljJ(x, 0) = Dp(x, OJ/DO. Note that p(x, 0) = -Iogf(x, 0) gives the ordinary ML estimator. Let E> be a closed subset of R\ let (.q(,!E, P) be a proba- bility space, and let ljJ(x, 0) be some function on!£ x E> with value in R k . Assume that XI, X 2 , '" are independent random variables with values in !£ having the common probability distribution P that need not be a member of the parametric family. Consider an estimating function 1 n 'l'n(O) =- L IjJ(X i , 0). nio:::l Assume that (Nl) for each fixed OE E>, ljJ(x, 0) is !E- measurable and ljJ(x, 0) is tseparable; (N2) the expected value A(O)=E{IjJ(X,O)} exists for all 0 E E>, and has a unique zero at O=OoEE>°; (N3) there exists a continuous function that is bounded away from zero, b(O)? b o > 0, such that E(suPe{ IIjJ(X, O)l!b(O)}) < 00, Iim inelro {1).(O)I/b(O)}? 1, and E(lim SUPI81ac {11jJ(X i , 0) - A(O)I/b(O)}) < 1; (N4) for u(x, 0, d)=sup{lljJ(x, T)-IjJ(X, 0)11 IT-OId}, (i) as d--+O, E(u(X, 0, d))--+O, (ii) there exist positive numbers do, b l , and b 2 such that E(u(X, O,d))bl d and E(u(X, 0, d)2) b2d for 0 and d satisfying 0<10-001 +ddo; (N5) in some neighborhood of 0 0 , A(O) is con- tinuously differentiable and the Jacobian ma- trix at 0 = 0 0 , A = (aA;(Oo)/oO), is nonsingular; and (N6) the covariance matrix L"=E{IjJ(X,Oo)' IjJ(X, 0o)'} exists and is positive definite. 1498 Theorem. Suppose that an estimator {T.,} satisfies 'l'n(T.,)--+O a.s. (or in probability) as n--+oo. Then, under (Nl)-(N4) (i), T.,--+0o a.s. (or in probability) as n--+oo. Lemma. Under (Nl)-(N5), sup{ l'I'n(T) - 'l'n(OO) - ).(T)I/(n -1/ 2 + IA(,)I); IT--Ooldo}--+O in probability as n --+ 00. Suppose that {2'[n I/2 '1'n(Tn)J} is ttight. It implies that 'l'n(T.,)--+O in probability, and hence from the above theorem T.,.....00 in prob- ability. Thus letting T = Tn we ha ve 1 'l'n(T.,) - 'l'n(OO) - ).(T.,)I/(n -1/2 + 1).( T.,)I)--+O in probability. That is, for any £ > 0 there exists an no such that for n?n o , P{n I/2 j).(T.,)1 <£+!nI/2'1'n(T.,)-nI/2'1'n(00)1} > 1-£. This and the tightness of {,,<l'[ n 1/2'1'n(T.,)J} and {2'[n I/2 '1'n(00)J} lead to the tightness of {,,<l'[n I/2 (T., - 0o)J}, and the converse is also true. At the same time we have n l ,2'1'n(T.,)- nI/2'1'n(00)-AnI/2(T.,-00)--+O in probability. The following theorem is a straightforward consequence. Theorem. Suppose that an estimator 8. satis- fies n I/2 '1'n(8.)--+O as n--+oo. Then, under (Nl)- (N6),,,<l'[nI/2(8n-Oo)IPJ--+NdO,A -'LA' 1) as n --+ 00 . Q. Interval Estimation Interval estimation or region estimation is a method of statistical inference uti! ized to esti- mate the true value g(O) of the given para- metric function by stating that g(O) belongs to a subset S(x) of .91, based on the observed value x of the random variable X If P e {g(O)ES(X)}?I-1X foranyOEE> for a constant IX (0 < IX < 1), then the random region S(X) is called a confidence region of g(O) of confidence level 1 -IX, and the infim um of the left-hand side with respect to 19EE> is called the confidence coefficient. In particular, if E> c R and a confidence region is an interval, as is often the case, then the region is called a confidence interval, and two boundaries of the interval are called confidence limits. If a partic- ular subset S(X) among the set of confidence regions of g(O) of confidence level I-IX mini- mizes Pe{g(O')ES(X)} for all pairs ,9 and 0' (#0), S(X) is said to be uniformly most power- ful. If a confidence region S(X) of g(O) of confi- dence level 1 -IX satisfies Pe {g( 0') ,= S (X) }  1 - IX for all pairs 0 and 0' ( # 0), then it is said to be unbiased. The notion of invariane of a con- fidence region can be defined similarly, and the definition for S(X) being uniformly most powerful unbiased (UMPU) or uniformly most 
1499 powerful invariant (UMPI) can be formulated in an obvious manner. For each 80EE> let A(8 0 ) be an tacceptance region of a ttest of level rx of the thypothesis 8 =8 0 , For each XE!1: let S(X) = {8 I XE A(8), 8 E E>}. Then S(X) is a confidence region of 8 of confidence level 1- rJ.. If A(8 0 ) is an accep- tance region of a UMP test of the hypothesis 8=8 0 for each 8 0 , then S(X) is a UMP con- fidence region of 8 of confidence level 1 - rJ.. Furthermore, corresponding to an acceptance region A(8 0 ) of a UMP unbiased (invariant) test, a UMP unbiased (invariant) confidence region can be constructed in a similar manner. R. Tolerance Regions Let X and Y be distributed according to prob- ability distributions p{ and pI labeled by a common 8EE> over measurable spaces (!1:,23) and ('!Y, (£:), respectively, and consider the prob- lem of predicting a future value of the ran- dom variable Y using the observed value x of the random variabl X. If a mapping S(x) sending a point x to a set belonging to (£: is used to predict that the value of Y will lie in the set S(x), then the random region is calIed a tolerance region of Y. In particular, if a toler- ance region of a real random variable is an interval, it is called a tolerance interval and its boundaries tolerance limits. For simplicity, suppose that X and Yare independent. There are several kinds of toler- ance regions. First, if p{(Pl{Y ES(X)}  f3)y for any 8EE>, then S(X) is called a tolerance region of Y of content f3 and level y. Second, if E:(Pl{YES(X)})f3 for any 8EE>, then S(X) is called a tolerance region of Y of mean con- tent f3. Suppose that the random variable X = (XI' .,. , X n )' is a trandom sample and that both XI and Y obey the same distribution. If further the set {pII 8 E E>} forms the totality of I-dimensional tcontinuous distributions and the distribution of PI {Y E S(X)} does not depend on the choice of 8, then S(X) is called a distribution-free tolerance region. F or example, if X(l)  X(2)  .. .  X(n) is an torder statistic, then the interval [X(r), X(S)J (for r<s) is a distribution-free tolerance interval for a random variable Y, independent of XI' ..., X n , which has the same distribution as XI' References [IJ H. Cramer, Mathematical methods of statistics, Princeton Univ. Press, 1946. [2J C. R. Rao, Linear statistical inference and its applications, Wiley, second edition, 1973. [3J S. S. Wilks, Mathematical statistics, Wiley, 1962. 399 Ref. Statistical Estimation [4J S. Zacks, The theory of statistical infer- ence, Wiley, 1971. [5J A. Birnbaum, On the foundation of statis- tical inference, J. Amer. Statist. Assoc., 57 (1962),269-326. [6J E. L. Lehmann and H. Scheffe, Complete- ness, similar regions and unbiased estimation I, II, Sankhyii, sec. A, 10 (1950), 305- 340; 15 (1955),219-236. [7J H. Morimoto, N. Ikeda, and Y. Washio, Unbiased estimates based on sufficient statis- tics, Bull. Math. Statist., 6 (1956). [8J E. W. Barankin, Locally best unbiased estimates, Ann. Math. Statist., 20 (1949), 477- 501. [9J A. Bhattacharya, On some analogues to the amount of information and their uses in statistical estimation, Sankhyii, sec. A, 8 (1946), 1-14. [10J K. Ishii, Inequalities of the types of Chebyshev and Cramer-Rao and mathemat- ical programming, Ann. Inst. Statist. Math., 16 (1964),277-293. [11J c. R. Rao, Information and the accuracy attainable in the estimation of statistical para- meters, Bull. Calcutta Math. Soc., 37 (1945), 8 I -91. [12J M. A. Girshick and L. G. Savage, Bayes and minimax estimates for quadratic loss function, Proc. 2nd Berkeley Symp. Math. prob., 1 (1951), 53-74. [13J S. Karlin, Admissibility for estimation with quadratic loss, Ann. Math. Statist., 29 (1958), 406-436. [14J E. J. Pitman, The estimation of the loca- tion and scale parameters of a continuous population of any given form, Biometrika, 30 (1939), 391-421. [15J C. Stein, Inadmissibility of the usual estimator for the mean of a multivariate nor- mal distribution, Proc. 3rd Berkeley Symp. Math. Pro b., 1 (1956), 197-206. [16J C. Stein, The admissibility of the Pit- man's estimator for a single location para- meter, Ann. Math. Statist., 30 (1959), 970-979. [17J 1. L. Hodges and E. L. Lehmann, Some problems in minimax estimation, Ann. Math. Statist., 21 (1950), 182-197. [18J H. Kudo, On minimax invariant esti- mates of the transformation parameter, Nat. Sci. Rep. Ochanomizu Univ., 6 (1955), 31-73. [19J J. Wolfowitz, The efficiency of sequential estimates and Wald's equation for sequential processes, Ann. Math. Statist., 18 (1947),215- 230. [20J R. A. Fisher, On the mathematical foun- dations of theoretical statistics, Philos. Trans. Roy. Soc. London, 222 (1922), 309-368. [21J R. A. Fisher, Theory of statistical estima- tion, Proc. Cambridge Philos. Soc., 22 (1925), 700- 725. 
400 A Statistical Hypothesis Testing [22J R. von Mises, On the asymptotic distri- butions of differentiable statistical functions, Ann. Math. Statist., 18 (1947), 309-348. [23J L. LeCam, On some asymptotic prop- erties of maximum likelihood estimates and related Bayes estimates, Univ. California Pub!. Statist., I (1953),277-330. [24J L. LeCam, Locally asymptotically normal families of distributions, Univ. California Pub!. Statist., 3 (1960), 37-98. [25J G. G. Roussas, Contiguity of probability measures, Cambridge Univ. Press, 1972. [26J A. Wald, Note on the consistency of the maximum likelihood estimate, Ann. Math. Statist., 20 (1949),595-601. [27J R. R. Bahadur, On Fisher's bound for asymptotic variances, Ann. Math. Statist., 35 (1964), 1545- 1552. [28J J. Hajek, A characterization of limiting distributions of regular estimates, Z. Wahr- sheinlichkeitstheorie und Verw. Gebiete, 14 (1970),323-330. [29J S. Kaufrr::m, Asymptotic efficiency of the maximum likelihood estimator, Ann. Inst. Statist. Math., 18 (1966), 155-178. [30J J. Wolfowitz, Asymptotic efficiency of the maximum likelihood estimator, Theory Prob. App!., 10 (1965),247-260. [31J M. Akahira and K. Takeuchi, Asymptotic efficiency of statistical estimators: Concepts and higher order asymptotic efficiency, Lec- ture notes in statist. 7, Springer, 1981. [32J C. R. Rao, Asymptotic efficiency and limiting information, Proc. 4th Berkeley Symp. Math. Prob., 1 (1961),531-546. [33J C. R. Rao, Criteria of estimation in large samples, Sankhya, sec. A, 25 (1963), 189-206. [34J P J. Huber, The behavior of maximum likelihood estimates under nonstandard con- ditions, Proc. 5th Berkeley Symp. Math. Sta- tist. Prob., 1 (1967),22 1- 233. 400 (XVII1.8) Statistical Hypothesis Testing A. General Remarks A statistical hypothesis is a proposition about the tprobability distribution of a tsample X. If it is known that the tdistribution of X belongs to a family of distributions :J> = {Pe 10 E Q} with a parameter space Q, the hypothesis can be stated as follows: The value of the parameter () belongs to W H , where WH is a nonempty subset of the parameter space Q. This hypothesis is also written simply as H:IiEW H . When W II 1500 consists of one point, it is called a simple hy- pothesis, otherwise a composite hypothesis. Let ,uJ" be a tsample space associated with a t(J-algebra !B of subsets of!£. To test a hypoth- esis H is to decide whether H is false, based on the observation of a sample X (E .'1'). The asser- tion that H is not false does not necessarily imply the validity of H. Such an assertion is called the acceptance of H, while the opposite assertion, that H is false, is called the rejection of H. In this framework for the testing prob- lem, H is often called a null hypothesis ( 401 Statistical Inference). Consider a testing procedure in which H is rejected with probability <p(x) (0", <p(x) < 1) and accepted with probabilIty 1- <p(x), when xE!l' is observed. This testing procedure is characterized by the function <p on .'1' with range in [0, 1]. Here <p(x) is taken as !B- measurable on :'[', and is called a test function or test. If <p(x) is the indicator function XB(X) of a set B (E !B), then the test is rejecting H when x belongs to B and accepting H otherwise. The set B is called a critical region, and its comple- mentary set an acceptance region. A test is called a nonrandomized test if it is the indicator function of a set. Other tests are called random- ized tests. Suppose that the tdistribution of the sample X is a probability measure Pe on I :;{, !B). The probability of rejecting H when 0 is the true value of the parameter is calculated from Ee(<p) = f <p(x)Pe(dx). "'1 Let ex be a given constant in (0, 1). If a test <p(x) satisfies Ee(<P) < ex for allIiEw H , or, in other words, if the probability of rejecting H when H is true is not greater than ex, ex is called the level of <p and such a test is called a level ex test. We denote the set of all level ex tests by $( ex), and su PeEW H Ee( <p) is called t h.e size of <po To judge the merit of tests, we introduce a different hypothesis A: The true value of 0 belongs to W A eQ-w H . This is called an alter- native hypothesis, or, for simplicity, an alterna- tive. The errors of a test are divided into two kinds: errors owing to the rejection of the hypothesis H when it is true, and errors owing to the acceptance of H when it is false. The former are called errors of the first kind, and the latter, errors of the second kind. The proba- bilIty Ee(<p) of rejecting H when IiEW A , that is, the probability of the correct decision being made for OEW A , is called the power of a test or the power function. The probability of com- mitting an error of the second kind is 1 - Ee(<p) for OEWA' A testing problem is indicated by the notation (2l',!B,:J>, W H , WA)' A test <p in a class $(ex) of tests is said to be nniformly most powerfnl in $(ex) (or UMP in $()) if for any 
1501 IjJE$(Q(), Ee(cp)'?Ee(ljJ) for all OEOJA' When O)A consist of a single point, it is said to be most powerful. B. The Neyman-Pearson Fundamental Lemma Let J1 be a +(J-finite measure over (:/', IB) and /" ..., fn+' be J1-integrable real-valued func- tions. If C I' ... ,en are constants such that the set $(e I, ", en) of test functions ifJ satisfying f !fJ/;dJ1Ci' i= 1,2, ...,n, is not empty, then there exists at least one test !fJo in $(e" ..., cn) that maximizes S !fJfn+1 dJ1 among all (p in $(c" ... , Cn)' A test rP is one of these tests if it satisfies the following two conditions: (I) For appropriate constan ts k I, .., , k n '? 0, .(x) {: n when f+dx) > I kJi(X), i=l n when fn+' (x) < I kJ;(x) i=l almost everywhere with respect to J1, and (2) the equation f rPfidJ1 =C i , i=I,2,...,n, holds. If(c l , ...,e ll ) is an interior point of the subset M=f(f !fJf!dJ1,..', f !fJfndJl)I!fJ is a test} of the n-space R n and rP satisfies (2) and maxi- mizes S !fJfn+1 dJi among all !fJ in $(c I' ''', cn), then rP satisfies (1). These statements are called the Neyman-Pearson lemma. As an illustration, suppose that Q = (1,2, ... , n,n+ I) and that {PeIOEQ) is dominated by a (J-finite measure JI. Let t;(x) be the density of Pi with respect to J1. When W H is a finite set [ I, .., , n} and W A consists of a single point n+ I, then rP satisfying (I) and (2) with C 1 = .,. = C n = Q( is a uniformly most powerfulleve1 Q( test. If 23 is generated by a countable number of sets, then there exists a most powerful level Q( test for any hypothesis against a simple alter- native A:IJ=e. A method of obtaining such a most powerful test is given in the Lehman- Stein theorem: Denote by!e the density func- tion of Pe with respect to a (J-finite measure J1, and define h,(x)= f .fo(x)dic(IJ) W H for a probability measure }, on W w Consider testing the simple hypotheses HI: The density 400C Statistical Hypothesis Testing of distribution of the sample is h;., against the alternative A: IJ = e, and let !fJ, be a most powerful level rJ. test for this problem (H,: A). If SUPeCWH Ee(!fJ,)  iX, !fJ, is a most powerfu level Q( test for testing H: IJ E W H against A: e = 0. The measure ), satisfies Ee(!fJ;,') '? Ee(!fJ,) for any probability measure J.' on w H and is called a least favorable distribution. When the alternative hypothesis W A consists of more than one point, a uniformly most powerful test does not generally exist. How- ever, ifQ=R, w H =( -oo,IJ o ], W A =(0 0 , (0), and /(x) is a density function with tmonotone likelihood ratio with respect to a statistic T(x), then a UMP level Q( test !fJ(x) exists and is defined by !fJ(x) = 1 if T(x) > c; = a constant if T(x)=c; and =0 if T(x)<c. For a one- parameter exponential family of distributions, there exists a UMP level Q( test for testing H :W H =( -ex;, ° 1 ] U [° 2 , (0) against A:w A = «()I,()Z) (0 1 <Oz). However, a UMP test does not exist for the problem obtained by inter- changing the positions of W H and 0) A' Since hypothesis-testing problems admitting UMP tests are rather rare, alternative ways for Judging the merit of tests are needed, and two have been devised. The first is to restrict the class $ of tests and to find a UM P test in this restricted class. The second is to introduce an alternative criterion of optimality and to select a test accordingly. The first is discussed in detail in Sections C, D, and E, and the second in Section F. C. Unbiased Tests The unbiasedness criterion is based on the idea that the probability of rejecting the hy- pothesis H when it is true (the probability of an error of the first kind) should preferably be no larger than that of rejecting H when it is false (the power). If a level Q( test !fJ satisfies Ee(!fJ) '? Q( for OEW A , then !fJ is called an unbiased level Q( test. Let $u(Q() be the set of all unbiased level Q( tests. A UMP test in $u(Q() is called a uni- formly most powerful (or UMP) unbiased level Q( test. If Pe is of the +exponential family whose parameter space Q is a finite or an infinite open interval of R k , then there exists a UM P unbiased level Q( test for the following two problems: (1) H:O, a, A:O, >a, where 0 1 is the first coordinate of IJ = (0\, ..., Od; and (2) H:OI =a, A:OI =fa. For example, when the sample is normally distributed with unknown mean J1 and unknown variance (Jz, the Student test (defined in Section 0) for a hypothesis H : J1 = J10 against an alternative A: J1 =f 110 is a UMP unbiased test. 
400 D Statistical Hypothesis Testing D. Similar Tests and Neyman Structure If Ee(CP) is constant for a11 OEW (eQ), cP is called a similar test with respect to w. If Ee(CP) is a continuous function of 0, an unbiased test cP is similar with respect to the common boundary w of W H and WA, provided that Q is a topological space and the density function is continuous in w. Therefore, in this case, unbiased tests are found in the class of all tests similar with respect to W. Let a statistic y = T(x) be tsufficient for gJ",= {Pe I OE w}. A test cP is said to have Neyman structure with respect to T if the conditional expectation E(cp I T(x)= y) of cp equals a constant [gJ",]. (For the notation [g:oJ  396 Statistic.) A test cp having Neyman structure with respect to the statistic T(x) is similar with respect to w. A test cp similar with respect to W has Neyman struc- ture with respect to T(x) if and only if the family .2= {Qe= Pe T-tl OEW} oftmarginal distributions of T is tboundedly complete. E. Invariant Tests Consider groups G and G of one-to-one trans- formations on f£ and Q, respectively. Suppose that each element of G is a measurable trans- formation of f£ onto itself (i.e., gB E 23 for any BE23) and that a homomorphism g->g of G into G is defined so that Pe(g-IB)=P"e(B). The hypothesis H:OEwH and the alternative A: o E W A are said to be invariant under G if gw H = W H and gW A = W A for a11 9 E G, and in this case the testing problem (f£, 23, gJ, w H , w A ) is said to be invariant under G. A test is called an invariant test if cp(gx) = cp(x) for every gEG, and Ege(CP) = Ee(CP) holds for any invariant cp. Accordingly, if G is ttransitive on W H , and invariant test is similar with respect to W H . If the sample space f£ is a subset of R n that is invariant under the translation (x t, ." ,x")-> (x t + a, ..., x" + a) with a real a and if there exists a O'=a.OEQ such that Pe.(B)=Pe({(x t -a, ...,x"-a)l(x t , ...,x")EB}) for any OEQ, then a is a transformation on Q. In this case the real number a is ca11ed a location para- meter. Furthermore, if the sample space f£ is a subset invariant under the similarity trans- formation (x t. ..., x")->(ax t, ..., ax") (a> 0) and if there exists a 0' = ij' OE Q such that Pw(B) = Pe({(Xt/a,... ,x"/a)l(xt. ...,x")EB}) for any OE Q, then the real number a is ca1\ed a scale parameter. The in variance principle states that a test for a testing problem invariant under G should preferably be invariant under G. A test cp(x) is called an almost invariant test if cp(gx) = cp(x) [&1'] for a11 gE G. Suppose that the testing problem of a hypoth- esis under consideration is invariant under a 1502 transformation group G on the sample space f£. Denote the set of a11 invariant level rx tests by $[(rx). A test that is uniformly most power- ful in $[(rx) is ca11ed a uniformly most powerful (in short, UMP) invariant level rx test. If there exists a unique UMP unbiased lev<:\ rx test cp*, then a UMP invariant level rx test (if it exists) coincides with cp*[gJ]. When T(x) is tmaximal invariant under G, a necessary and sufficient condition for cp(x) to be invariant is that cp be a function of T(x). For example, suppose that the sample X =(X t , ...,X") is taken from N(j1,(J2) with unknown j1 and (J2. In this situation, Y = (X, S) is a sufficient sta tistic, where X = "i.:i X;/n and S = ) L(X i - X)2 . Let G be the group of trans- formations (x, S)->(6, cs) (c > 0) on the range Ij of Yand G be the group of transformations (j1, (J2)->(Cj1, C(J2) (c >0) on the parameter space. Both the hypotheses H t : j1/(J2  0 and H 2 : j1/(J2 =0 are invariant under G. Since t=. x/(s/) is maximal invariant, any invari- ant level rx test is in the class of functions of t. The Student test, defined in Section G, is UMP invariant under G. F. Minimax Tests and Most Stringent Tests Minimax tests and most stringent tests are sometimes used as alternatives to lJMP tests. Suppose that gJ={PeIOEQ} is a tdominated family and 23 is generated by a countable number of sets. A level rx test cp* is ca11ed a minimax level rx test if for any level rx test cp, inf Ee(CP*);;:O inf Ee(CP). OewA 8eWA Such a test exists for any rx E (0, 1). If a group G of measurable transformations on '!C leaves a testing problem invariant, then an intimate relation exists between the minimax property and in variance. Concerning this relation, we have the fo1\owing theorem: For each rxE(O, 1) there is an almost invariant level rx test that is minimax if there exists a (J-field  of subsets of G and a sequence {v"} of probability mea- sures on (G,) such that (i) BE23 implies {(x,g)lgxE B} E23 x; (ii) A E, gEG implies Ag Ed; and (iii) limnoo Iv"(Ag)- v"(A)1 =0 for any AE and gEG. Fundamental in the invar- iant testing problem is the Hunt-Stein lemma: Under the condition just stated, for any cp there exists an almost invariant test IjJ such that inf Ege(cp)Ee(IjJ)supEge(CP)' geG geG The following six types of transformation groups satisfy the condition of the theorem: (1) the group of translations on R", (2) the group of similarity transformations on R", (3) the 
1503 group of transformations 9 = (a, b): (x I, '" , x n ) E Rn(aX, +b. ...,aXn+b)ER n (O<a< oc, -'JJ < b < 'JJ), (4) finite groups, (5) the group of ortho- gonal transformations on R n , and (6) the direct product of a finite number of the groups men- tioned in (1H5). We call fi:(O) =SUP<PE</I(.) E 6 (<p) an envelope power function, and <p* (E$(a)) is called a most stringent level a test if sup (fi:(O) - E 6 (<p*))  sup (fi:(O) - E 6 (<p)) OE(J)A eEWA for any <pE$(Q(). There exists a most strin- gent level Q( test for each aE (0, I). If a testing problem is invariant under a transformation group G on X and G satisfies the condition in the Hunt-Stein lemma, then a uniformly most powerful invariant level a test is most stringent among the level a tests ( 398 Sta- tistical Decision Functions). Admissibility of a test and completeness of a class of tests are defined with respect to the probability of an error of the second kind ( 398 Statistical Decision Functions). The uni- formly most powerful level Q( test and the uniformly most powerful unbiased level a test are admissible. G. Useful Tests Concerning Normal Distributions (Appendix A, Table 23) In this section, we treat the rejection regions 5 that are commonly used in testing problems related to normal distributions. Let a be the level of 5, and let c(a) and d(a) be positive numbers determined by a. In (1 )-(5) below, the sample consists of n mutually independent random variables X" ..., X n each of which is assumed to be normally distributed with mean 11 and variance (J2. For any sample point x=(x 1 , ...,x n ), denote L7=lx;/n by x and L7dxi-X)2 by S2. (1) To test the hypothesis 11  110 against an a1ternative 11 > 110' we can use as a critical region 5 = {x I t(x) > c(a)}, where the tesl statistic t(x) is given by (x -110)/ J s 2/(n =-1). (2) To test the hypothesis 11 = J10 against an alternative 11 i=- 110' we can use 5 = {x II t(x)l? c(a)} with the same test statistic t(x) as in (I). These tests based on the statistic t(x) are generally calIed Student tests or t-tests. (3) To test the hypothesis (J2 = (J against the alternative (J2 > (J with (J6 > 0, we can use 5 = {x I X 2 (x)? c(Q()}, where X 2 =S2 /(J. (4) To test the hypothesis (J2 > (J6 against the alternative (J2 < (J6, we can use 5 = {x I X 2 (x) c(a)}, where X 2 is the same as in (3). (5) To test the hy- pothesis (52 = (56 against (52 of- (J6, we can use 5={x\X2(x)c(a) or ?d(a)}, where / is the same as in (3). Each 6f these tests based on the statistic l is called a chi-square test. Among 400G Statistical Hypothesis Testing these tests, (3) is UMP and (1) is UMP when !X? 1/2. All tests (1)-(5) are UMP unbiased, and (3)-(5) are UMP invariant under the translations (x 10 ..' , xn)(x 1 + a, "', X n + a) (-'JJ < a < 'JJ). Since (1) and (2) are also UMP invariant under the transformations (x 1, .., , x n ) (aX1"'" ax n ) (O<a < 00), they are most stringent tests. Suppose that XI,"" X m are independently distributed according to N (111' (Jf) and that Y" ..., y" are independently distributed ac- cording to N(112, (In, where 111,112' (J1o and (J2 are assumed unknown unless otherwise stated. Here we give the important tests for 111,112' af, (Ji. Let x =(x 1 ,..., x m ) and Y=(Y1' "', Yn) be sample points in R m and R n , respectively, and denote L1 x;/m, L71 y;/n, Ldxi-X)2, and L71 (Yi - y)2 by x, y, s, and s;, respectively. (6) Assume that (J, and (J2 are known, and con- sider a hypothesis 111 = 112. When an alterna- tive 111 > 112 (111 of- 112) is taken, we can use as a critical region 5 = {(x, y) I T(x, y)  c(Q()} (5 = {(x,y )IIT(x,Y)I? c(a)}), where T(x,y)=(x- y)/ J (Jf!m+(Ji/n. These tests are both UMP unbiased and invariant under the translations (x"...,xm,Y"...,Yn)(XI +a,...,xm+a,y, + a, ... ,Yn + a). (7) Assume that (J 1 = (J 2' and consider a hypothesis 11, =112' When 11, i=-112 (J11 > 112) is the alternative, 5 = {(X, y) II T(x, y)1 ?c(a)} (5={(x,y)1 T(x,y»c(a)}) can be used as a critical regi on, where T(x, y) = (x- y) J m +n - 2/( J 1/m+ I/n Js + s; ). Both tests are UMP unbiased and are in- variant under (x lo ..., x m , Ylo ..., Yn)(aX1 + b, ...,ax m +b,aY1 +b, ...,aYn+b) (-00 <b< 'JJ,O<a< 00). (8) Testing the hypothesis H: {l, = 112 is calIed the Behrens-Fisher problem. Note that nothing is assumed about the rela- tion of the variances (Jf and (Ji of the two samples X and Y, in contrast to (7). It is dif- ficult to construct a statistic whose distribu- tion is independent of af and (5i when H is true. Compare this with (1 )-(7), where the proposed statistics have this property and are used to construct similar critical regions 5. The critical region {(x,Y)I(x- y)j J s/m(m-I)+s;/n(n-l) ?f(s;/s)}, with an appropriately chosen 1, is similar to such a region 5. This test is called Welch's test. (9) For a hypothesis (J1 =(J2, we can use as a critical region 5 = {(x, y) I F(x, y)  c(a)}, 5 = {(x,Y)IF(x,y)?c(a)}, or 5={(x,y)IF(x,y)?d(a) or < c(a)}, when (J1 < (J2, (J2 < (J1o or (51 i=-(J2' respectively, is taken as alternative, where F(x,y)=(n-I)s/(m-I)s;. All these tests are UMP unbiased and are invariant under the transformations (x lo ...,Xm'YIo ...,Yn) (ux, +b, ...,axm+h,ay, +b, ...,aYn+b) (-'JJ < 
400H Statistical Hypothesis Testing b < oc, 0 < a < oc). A test based on F(x, y) is called an F -test. H. Linear Hypotheses Let XI' ... ,X n be independent and distributed according to N(/1i,(J2) (i= 1,2, ...,n), where /11' J12' ..., J1, (s < n), (J are assumed to be unknown and /1i = 0 (8 < i  n). The hypothesis is H: /11 = /12 = .. . = /1r = 0 (r  s), and the alternative hypothesis is that at least one J1i, 1  i  r, does not vanish. The critical region S = {(x 1, .., , X n ) I F(x 1 ,..., Xr;X+l'.", X n ) = L=l xf /L7=+1 xl;;: e(Q()} is a UMP unbiased test for this prob- lem, and S is invariant under the group 91 of translations (X h ..., X" xr+h"" Xs, X s + 1 , "', x n ) (Xl,'" ,X r , XY+l +a,..., xs+a, xs+l"'" x n ), the group 92 of similarity transformations (XI' ...,x n )-->(ex 1 , ...,ex n ), the group 93=O(r) of orthogonal transformations in R r = {( XI, ...,x,)}, the group 94=O(n-s) of orthogonal transformations in Rn-., = {(xs+ I' .., ,x n )}, and finite products of elements of the groups 91, 92,93' and 94' This test is also a kind of F- test. More generally, let us denote X=(Xj, X 2 ,..., X n )' and assume that it is expressed as X=A.;+W, W=(W I , W 2 ,..., W n )', (I) where'; = (( I, (2, ... , (s)', s  n is a vector of un- known parameters and A a matrix of known constants, and WI' W 2 , ..., w.. are distributed independently according to the normal distri- bution with mean 0 and variance (J2. Then a general linear hypothesis is a hypothesis stating that the vector'; lies within a linear subspace M of RS. The set of points A'; with'; satisfying a linear hypothesis H is the linear subspace L(B) of L(A) spanned by the column vectors of an n x k l matrix B. Assume, for example, that the dimension of L(B) (= the rank of B) is s - r. Let C be an n x k 2 matrix whose column vectors span the orthocomplement Li(B) of the space L(B) with respect to the space L(A). Then the model (1) can be written as X=B'1+C,+W, E(W)=O, with a kl-vector 'I and a k 2 -vector', and hence the hypothesis H is represented by' = 0'. We denote by Y = P A X the projection of X onto the space L(A) and by Z the projection PBX of X onto the space L(B). The quantity QI/ = X'(P A - PB)X equals the square of the length of the vector Y - Z and represents the sum of squares of residuals for the hypothesis H. The error mean square 6 2 = X'(l- PA)X/(n - s) = Qe/(n - s) and also 8 = QII /r under H are unbiased estimators of (J2. 1504 Assume that B'C =0. Let U be an orthogo- nal transformation in Rn such that the first, ...,rth, (s+ l)st, ...,nth rows of UB and the (s + l)st,..., nth rows of UC are all equal to zero vectors. Using the notation X = UX, it= U B'1,' = UC" and W = UW, we obtain the canonical form X = it +, + W of the: model (2). W is also a vector of independently and identi- cally distributed normal random variables. The hypothesis H is expressed as ,= O. In this model, we have E(X,)=O for i=s+ 1,..., n, and moreover, E(Xi)=O for i= 1,..., r,3+ 1,..., n if and only if H is true. The least squares estimator Y of A'; is the tmaximum likelihood estimator, and X - Y, Y - Z, Z are distributed independe.ntly ac- cording to the n-variate normal distributions N(O, (J2(1- PA))' N«(P A - PB)C" (J2(PA - P B )), and N(B'1+PBC', (J2P B ), respectively. Hence Q/(J2, QI//(J2, and X' P B X/(J2 are distributed independently according to the tnoncentral X 2 - distribu tions with n - s, r, and s - r degrees of freedom and tnoncentrality parameters 0, "C'(P A - P B )CQ(J2, and (B'1+ P B CmB'1+ P B C(J/(J2, respectively. The tlikelihood ratio test of the hypothesis H has a critical region 8 /8 2 > c, is a tuniformly most powerful invar- iant test with respect to the group of linear transformations leaving the hypothesis H invariant, and is the tmost stringent test. This test is also uniformly most powerful among the tests whose tpower function has a single vari- able "C'(P A - P B )C(j(J2. Furthermore, for s- r= 1, this test is a tuniformly most powerful unbiased test. In the decomposition X'X = X'(P A -PB)X +X'PBX +X'(I- PA)X=QI/+ QB + Qe the terms QI/ and Qe are c.llled the sum of squares due to the hypothesis and due to the error, respectively. Such a process of decomposition is called the analysis of var- iance and its result is summarized in the anal- ysis of variance table (Table 1). I. The Likelihood Ratio Test (2) The likelihood ratio test is comparatively easy to construct. Let L( XI, .' . ,X n ; 0) be: the tlikeli- hood function. Then SUPOEW L(x l , ...,xn;IJ) A(xl,...,x n )- H SUPOEWHUwAL(Xj,'" ,xn;O) is called the likelihood ratio, and the test corre- sponding to the critical region S = {(XI, ..., x n ) I A(x lo ..., x n )  e,} is called the likelihood ratio test, where e, is a positive constant deter- mined by the level Q(. Let el/(xj,..., x n ) and (JI/ v A (X 1, ... ,x n ) be the tmaximum likelihood estimators for e in WI/ and in wI/Uw A , respec- tively; that is, L(x; (x))=SUPOEW L(x; 0) and L(X;(JI/VA(X))=SUPOEW Uw L(x;O).R Then H A 
J505 Table 1 400 K Statistical Hypothesis Testing Factor Degrees of Freedom Ratio of Variances (j /82 H B Error Total Sum of Squares QH = X'(P A - PB)X QB=X'PBX Qe=X'(l-PA)X X'X r Mean Square (j = QH/r (j/(s - r) 8 2 = Qe/(n - s) s-r n-s n L(x; e H ) A(x)=-. L(X;OHvA) The F-test for a linear hypothesis is a likeli- hood ratio test, and other examples of the Jike- lihood ratio test are shown in Appendix A, Table 23. However, the likelihood ratio test does not necessarily have the desirable prop- erties stated in the preceding sections. J. Complete Classes The set of critical regions of the type {x J T(x) > c} for the problems H: 0  0 0 and A: 0 > 0 0 , where the distribution family ,11 of the statistic T is of tP61ya type 2 in the strict sense, and the set of regions of the type {x I c < T(x) < d} for the problem H:O, O02 and A:O<O, or O 2 < 0, where the distribution family fJjJ of the statistic T is of tP6Jya type 3 in the strict sense, are examples of minimal complete classes [6]. It has been proved under a mUd condition that the set of all tests with convex critical regions is essentially complete when the underlying distributions are of exponential type and the null hypothesis is simple. Let P ei E.9' (i=O, I) and let 1B0 be a (J- subalgebra of lB. 1B0 is sufficient for IB w.r.t. {Poo' Po,} if and only if the class of all 1B 0 - measurable test functions is essentially com- plete, i.e. iff for every critical region BE IB there exists a lBo-measurable test function CPo such that Eoo(CPo)EoO<XB) and Eo,(C(Jo)?;Eo,(XB)' Assume that fJjJ= {poIOEe} is dominated; if for every IB-measurable test function cP there exists a lBo-measurable test function Iji such that Eo(lji) = Eo(cp) for all OEe, then 1B0 is sufficient for IB W.r.t .11 [9,1 OJ ( 398 Statis- tical Decision Functions D, 399 Statistical Estimation E). K. Asymptotic Theory Let (1" IB", P"o) (v = 1,2, ..., n) be a sequence of probability spaces, where the parameter space Q is common to all v. Let (.'?t' I "\ IBI"\ PJ"I) be the direct product probability space of (Y" IB" P,o) for v = 1,2, ... , n. For each sample space (:?[In\ lB(n), PJ")), denote a test function for H : 0 E WH( eQ) and A: OEW A ( eQ-w lI ) by CPn(x, , ...,x n ). A sequence {CPn} (n= 1,2,...) is often caUed a test. For exampJe, a Jikelihood ratio test is frequently understood as a sequence of tests Sn={(x 1 , ...,x n )IA n (x 1 , ...,Xn)An}' where {A.n} is a sequence of constants and An(x 1, '.. , x n ) is the likelihood ratio defined by (:?l'ln), IBln), PJn)) and W II . If a test {CPn} satisfies Eo(CPn}-+O (OEWII) and Eo(CPn)-+1 (OEW A ) as n -+ 00, {CPn} is said to be a consistent test. If these convergences are uniform with respect to 0, {CPn} is said to be a uniformly consistent test. When a uniformly consistent test exists, W H and WA are said to be finitely distinguishable. Suppose that the observed values are identi- cally distributed (that is, (::£" IB", P,o) is a copy of a probability space (f, IB, Pe)) and WII and (VA are both compact with respect to the met- ric p(O, 0') = SUPBE'BlPe(B) - Po.(B)J. In this case, W II and W A are finitely distinguishable if Eo(cp) is a continuous function of 0 for any cP [4]. Kakutani's theorem ( 398 Statistical Decision Functions) is regarded as a proposi- tion concerning distinguishability when the null hypothesis and the alternative are both simple. The following result about the tlimit distri- bution of a likelihood ratio is due to H. Cher- noff [3]: Let yln) be an n-space and Q be an open subset of R k containing the origin O. Suppose that the observed random variables are independent and distributed according to a density f(x, 0); that is, the likelihood function L(x; 0) is I171 f (x;, 0). Moreover, assume the following regularity conditions: (1) log f(x, 0) is three-times differentiable with respect to 0 at every point of the closure of some neighborhood N of 0 =0. (2) There exist an integrable function F and a measurable function H such that (i) IN/DO.! < F(x) for every OE N; (ii) ID2f/DO;cOjl < F(x) for every OE N; (iii) ID3 logj/cO;e')()/JOml < H(x); and (iv) sUPoEe(H(x)) < 00. (3) For every i, } = 1,2, ... , k, we have J;j = Ee[(Dlogj/oO;) (Dlogf/cO)J < 00, and the matrix Jo=(J;j) is positive definite for all OEN. Let P(x 1 , ...,xn;w)=suPoEwL(x1, ...,xn;O) for a subset W of Q. Consider testing a hypothesis OEWII against an alternative OEW A , where 0 is an accumulation point of WII. If }, *(x" '" , x n ) =P(Xl' ...,X n ;WII)/P(X 1 , ...,x.;w A ), A* plays essentiaUy the same role as the 1ikelihood ratio A and hence can be used in its place. We call a subset C of Q a cone if OE C implies aOE C for 
400 K Statistical Hypothesis Testing any a> O. A subset W of Q is said to be appro- ximated by C if w satisfies in("Eclix - yll = o(llyll) for all yEw and infyEwllx - yll =o(llxll) for all x E C around the origin, where II x f = L71 xf. Suppose that WH and W A are appro- ximated by two cones C H and C A , respectively. Then, setting zn=,J;; J 0 1 A(x) with [ 1  <3 logf(x" 0) A(x)= -L, , , n F 1 c0 1 ". 'at OIOg(;kx", 0) ]', the limit distribution of - 210g A'*, when 0 =0, coincides with that ofinfoEc)zn-O)'Jo(zn -O)-infoEc)zn-O)'Jo(zn-O). In particular, when Q= R k and WH = {(O?,..., O? Os +1 , ..., Ok) I -00 < OJ < 00, i =s + 1,..., k} and some regular- ity conditions are assumed, the limit distri- bution of - 210g}.'* is the t x 2 -distribution with s degrees of freedom if the hypothesis is true. The asymptotic behavior of the chi-square test of goodness of fit is also very important. Suppose that (Xl' ..., X k ) has a multinomial distribution n!(x 1 ! ". Xk!)-l pf' ... p:k(Ll X j = n, Xi? 0), and consider testing H : P 1 = p?, .. ,Pk = p. The chi-square test of goodness of fit has a critical region of the type {xlx 2 (X 1 , .. ,xk)?c}, where X 2 (X 1 , ...,x k ) is LI«Xj- np?)2/ np ?), i.e., the weighted sum of the squares of the differences between the value p? of Pi and the maximum likelihood estimator x;/n of pj. The limit distribution of X 2 (x 1, ... , Xk) when Pi = p?, i = 1,2, ... ,k, is the chi-square distribution with k - 1 degrees of freedom. Suppose moreover that k functions p;(O) (i= 1,..., k; s <k) of 0 (ERS) are given and that the hypothesis to be tested is that the sample has been drawn from a population having a distribution determined by H:Pi=Pi(O) (i= 1, ... , k; s < k) for some value of O. In this case the chi-square test of goodness of fit could be applied after replacing the parameter 0 in Pi = p;(O) (i = 1,2, ... ,n) by the solutions en(x l' ... ,xd of the system of equations of the modified minimum chi-square method, t xj-nf!..i0Pi =O i1 Pi oOj (j= 1, ..., s). Suppose that (1) pj(O) > c 2 > 0 (i = 1, ''', k) and L=1 p;(O) = 1; (2) p;(O) is twice continuously differentiable with respect to the coordinates of 0; and (3) the rank of the matrix (op;/oOj) is k. Then the system of equations above has a unique solution 0 = On(x l' ..., Xk), and en converges in probability to 0 0 when 0 = 0 0 , The asymptotic distribution of X2(X) = L=1 «Xj- np;(On))2 /npi(o,,)) is the chi-square distribution with n - s - I de!!:rees of freedom 1506 [5]. For the test of goodness of fit, the em- pirical distribution function may also be used ( 371 Robust and Nonparametric Methods). A test of independence by contingency tables is one application of the chi-square test of goodness of fit. We suppose that n individ- uals are classified according to two categories A and B, where A has r ranks Aj, A 2 , ..., Ar and B has s ranks B1' B2' ..., Bs. Let pj 'P'j' Pij be the probabilities that the observed value of an individual belongs to Ai' Bj, Ai n Bj, re- spectively. Let Xi ,x j' x ij be the numbers of individuals belonging to Aj, Bj' and Ai n Bj' respectively. Table 2 is called a contingency table. To test the null hypothesis H that the divisions of A and B into their ranks are inde- pendent, that is, H: Pij = pj P j' the statistic '/ = L;=l Ljl (Xij- xj.x)n)2 /(x j X )n) is ap- plied. When H is true, X 2 is asymptotically distributed according to the chi-square distri- bution with (r - 1)(s - 1) degrees of freedom as n-+ 00. Likelihood ratio tests and chi-square tests of goodness of fit are consistent tests under con- ditions stated in their respective descriptions. In general, there are many consistent tests for a problem. Therefore it is necessary to con- sider another criterion that has to be satisfied by the best test among consistent tests. Pit- man's asymptotic relative efficiency is such a criterion. Other notions of efficiency have also been introduced. A completely specified form of distribution is rather exceptional in applications. More often we encounter cases where distribution of the sample belongs in a large domain. Various tests independent of the functional form of distribution have been proposed, and the asymptotic theory plays an important role in those cases ( 371 Robust and Nonparamet- ric Methods). The following concept of asymptotic effi- ciency is due to R. R. Bahadur [11]: Let {T,J be a sequence of real-valued statistics defined on :!(n). {T,,} is said to be a standard sequence (for testing H) if the following three conditions are satisfied. (I) There exists a continuous probability distribution function F such that for each OEW H , limnoo pJn){T,,< t} =F(t) for every tER l . Table 2. Contingency Table Bl B 2 B, Total AI x ll X 12 XIS XI A 2 X 21 X 22 X2, x 2 A, x n x r2 x rs X r Total XI X 2 X, n 
1507 (II) There exists a constant a, 0< a < 00, such that 10g{1-F(t)}= -(at 2 /2){1 +o(1)} as t->oo. (III) There exists a function b(O) on Q-w H with O<b(O)< 00 such that for each OEQ-WH' !im PJn J {I(T,,/n I/2 )-b(O)1 > t} =0 noo for every t > O. Suppose that {T,,} is a standard sequence. Then T" has the asymptotic distribution F if H is satisfied, but otherwise T,,-> 00 in proba- bility. Consequently, large values of T,. are significant when T" is regarded as a test statis- tic for H. Accordingly, for any given x EP[(n), 1 - F(T,,(x)) is called the critical level in terms of T", and is regarded as a random variable defined on Pl'(n) [1]. It is convenient to describe the behavior of this random variable as n-> 00 in terms of Kn' where Kn(x)= -210g[l- F(T,,(x))]. Then for each OEW H , Kn is asymp- totically distributed as a chi-square vari- able X with 2 degrees of freedom and for OE Q-w H , Kn/n->abz(O) in probability as n->oo. The asymptotic slope of the test based on {T,,} (or simply the slope of {T,,}) is defined to be c(O)=ab 2 (O). Note that the statistic K/z is equivalent to T" in the following technical sense: (i) {K/2} is a standard sequence; (ii) for each OEQ, the slope of {K/2) equals that of {Tn}; and (iii) for any given n and x, the critical level in terms of K/2 equals the critical level m terms of T". Since the critical level of K /2 is found by substituting K/2 into the function representing the upper tail of a fixed distribu- tion independent of F, {K/Z} is a normalized version of {Tn}. Suppose that {T,,(I)} and {T,,(2 J } are two standard sequences defined on p[(n), and let F(iJ(X), ai, and bj(O) be the functions and constants prescribed by conditions (I)-(I1I) for i = 1, 2. Consider an arbitrary but fixed 0 in Q - w H , and suppose that x is distributed according to Pe. The asymptotic efficiency of {T,,(I)} relative to {T,,(2 J } is defined to be 'PdO) = cd O )/c 2 (O), where cj(O)=aN(O) is the slope of {T,,(i)}, i= 1,2. The asymptotic effi- ciency is called Bahadur efficiency. Several comparisons of standard sequences are given in [11]. The relationship between Bahadur efficiency and Pitman efficiency for hypothesis-testing problems has also been studied. Under suitable conditions the two efficiencies coincide. L. Sequential Tests Let Xj, X z ,... be a given sequence of random variables. To test a hypothesis concerning the distributions of these variables (sample sizes are not predetermined), we observe first Xj, 400 Ref. Statistical Hypothesis Testing then X 2 , etc. At each stage a decision is made on the basis of the previously obtained data whether the observation should be stopped and a judgment made on the acceptability of the hypothesis. Such a test is called a sequen- tial test. Let Xl' X z ,... be independent and identically distributed by le(x). For testing a simple hypothesis H: 0 = 0 against a simple alternative A:O= 1, we have the sequential probability ratio test: Let Gn(x j, X2, ... ,x n ) = I1i1 11 (XJ/I1i1 lo(x/), and preassign two constants a o < a 1 . After the observations of XI' ... , X n are performed, the next random variable X n +1 is observed if a o < Gn(x l' ... , X n ) < a 1 . Otherwise the experiment is stopped, and we accept H when G n  a o or accept A when a 1  G n . The constants a o and a 1 are determined by the desired probabilities ()(1 and ()(2 of errors of the first and second kind, re- spectively. It is known that, among the class of sequential tests in which the probability of error of the first (second) kind is not greater than ()(o (Q(t), the sequential probability ratio test minimizes the expected number of obser- vations when either H or A is true ( 398 Sta- tistical Decision Functions; 404 Statistical Quality Control). References [1] E. L. Lehmann, Testing statistical hypoth- eses, Wiley, 1959. [2] A. Wald, Sequential analysis, Wiley, 1947. [3] H. Chernoff, On the distribution of the likelihood ratio, Ann. Math. Statist., 25 (1954), 573-578. [4] W. Hoeffding and J. Wolfowitz, Distin- guishability of sets of distributions, Ann. Math. Statist., 29 (1958), 700-718. [5] M. G. Kendall and A. Stuart, The ad- vanced theory of statistics II, Griffin, 1961. [6] S. Karlin, Decision theory for P6lya type distributions, Case of two actions I, Proc. 3rd Berkeley Symp. Math. Stat. Prob. I, Univ. of California Press (1956),115-128. [7] A. Birnbaum, Characterizations of com- plete classes of tests of some multiparametric hypotheses, with applications to likelihood ratio tests, Ann. Math. Statist., 26 (1955), 21- 36. [8] H. Cramer, Mathematical methods of statistics, Princeton Univ. Press, 1946. [9] K. Takeuchi and M. Akahira, Characteri- zations of prediction sufficiency (adequacy) in terms of risk functions, Ann. Statist., 3 (1975), 1018-1024. [10] J. Pfanzagl, A characterization of suffi- ciency by power functions, Metrika, 21 (1974), 197-199. 
401 A Statistical Inference [IIJ R. R. Bahadur, Stochastic comparison of tests, Ann. Math. Statist., 31 (1960),276-295. 401 (XVIII.2) Statistical Inference A. The Statistical Model Broadly and loosely speaking, the term "statis- tical inference" may imply any procedure for drawing conclusions from statistical data. But now it is usually understood more rigorously to mean those procedures based upon a tproba- bilistic model of the data to obtain conclusions concerning the unknown parameters of the population that represents the probabilistic model by viewing the observed data as a t ran - dom sample extracted from the population. As the simplest example, suppose that, for some system, we have a number of observa- tions from repeated measurements or experi- ments under a supposedly umform condition. If we can assume that there are no systematic trcnds or tendencies involved, we can suppose that the variations among repeated observa- tions are due to random causes and assume that the observed values XI, X 2 ,... are inde- pendently and identically distributed random variables. Our purpose in making observations is to draw some information from the data, that is, to make some Judgment on an unknown system quantity 0, which together with some other quantity (quantities) I] characterizing the measurement or the experiment, determines the distribution of the X;. We assume that the distribution has a density function j (x; 0, 1]). This amounts to assuming that the observed values Xl, Xl,'" are a sample randomly drawn from a hypothetical population of the results of the measurements or experiments supposedly continued indefinitely. Then the problem of statistical inference is one of mak- ing some judgment based on the random sample. The set of hypotheses postulating the distribution of the observed values is called the probabilistic model of the observations, and the problem of determining a model in a spe- cific situation is called that of specification. B. Bayesian and Non-Bayesian Approaches There are two different ways to make infer- ences on the population parameters: the Baye- sian approach and the non-Bayesian approach. In the Bayesian approach it is assumed that we have some probability density n(O, 1]) for 1508 the parameters () and 1]. Then, given the ob- servations XI =Xj, X 2 =x 2 , ..., the conditional probability density for 0 and I] is given by - o [(Xj;O,I]) iT(O,ll) P(O, I] I x I, Xl'''') - J J flf(x;; 0, 1]) n (0, -;]) dO dl] . n is called the prior density and p the posterior density for the parameters. Then all the in- formation obtained from the sample is consid- ered to be contained in the posterior distri- bution with the density fi(O, 1]), and conclusions on the parameters can be drawn from it. The prior density n(O, 1]) does nor necessarily represent a frequency function of a population of which the parameters are a random sample, but in most cases treated by the Bayesian approach it is considered to be a summary of the statistician's Judgment over relative possi- bilities of the different values of the parameters based on all the information obtained before the observations are made. Bayesians claim that it is always possible to determine such a prior distribution in a coherent way, specifying the subjective probability, representing a per- son's judgment under uncertainty, as opposed to the objective probability, representing the re- lative frequencies in a population. L. J. Savage [7J succeeded in developing a formal mathe- matical theory of the subjective probability from a set of postulates about the consistent behavior of a person under uncertainty. The non-Bayesian statisticians, however, do not accept the Bayesians' viewpoint and insist that statistical inference should be free from any subjective judgment and be based solely on the objective properties of the sample de- rived from the assumed model. The theory developed by R. A. Fisher, J. Neyman, and E. S. Pearson, and others is based on the non- Baycsian approach. C. Problems of Non-Bayesian Inference The most commonly used forms of statistical inference are point estimation, used when we want to get a value as the estimate for the parameter; interval estimation, when we want to get an interval that contains the true value of the parameter with a probability not smaller than the preassigned level; and hypoth- esis testing, when it is required to determine whether or not some hypothesis about the parameter values is wrong ( 399 Statis- tical Estimation, 400 Statistical Hypothesis Testing). In any type of statistical inference, the prob- lem can be abstractly formulated by deter- milling a procedure that defines a rule, based on the sample observed, for choosing an ele- ment from the set of possible conclusions. 
1509 Such a procedure is evaluated by the probabil- istic properties derived under different values of the parameter from the distribution of the sample, and it is usually required to satisfy some type of validity criteria (such as un- biasedness of an estimator, size of a test, etc.), and among those satisfying them. one which is considered to be best according to some optimality criterion (such as minimum var- iance or most-powerfulness) is looked for. But in the sense of objective probability, the prob- abilistic property of a procedure is relevant only for the frequencIes in repeated trials when the same procedure is applied to a sequence of samples obtained from the population and has no direct implications for the conclusion ob- tained by applying the procedure to a spe- cific sample we have in hand. For this reason Neyman argued that in statistical inference there is really no such thing as inductive in- ference but only inductive behavior. Fisher dIsagreed strongly with this argument and emphasized that statistical analysis is induc- tion and that its purpose is to allow us to draw the proper conclusions from a particular sample and that the probabilIstic properties of the procedure should and could have rele- vance for a particular conclusion obtained from a specific sample, provided that all the in- formation contained in the sample is used. The arguments between Fisher and Neyman led to a heated controversy between their followers that is still not completely settled Fisher's arguments lead to the principle of sufficiency and the principle of conditionality. The prin- ciple of sufficiency dictates that all inferences should be based on a sufficient statistic if there is one, and the principle of conditionality re- quires that any inference should be based on the conditional distribution given the ancillary statistic, i.e., a statistic whose distribution is independent of the parameter, if there is slIch a statistic. These two principles are accepted by many statisticians who do not follow all of Fisher's arguments, though the principle of conditionality sometimes leads to difficulties due to nonuniqueness of the ancillary. D. Specification Problem It is often difficult and sometimes impossible to have an exactly correct model for the data, and we must be satisfied with a model that gives a sufficiently close approximation and is mathematically tractable as well. It may also happen, however, that a model first specified may be far from reality and could lead to erroneous conclusions if relied on blindly. Here, the problem of model selection arises ( 401 E Statistical Inference 403 Statistical Models), i.e., choosing the best of various possible models. We may also seek procedures that are little affected by the departure of the distribu- tion of the data from the assumed or some other model that satisfies the condition of validity without any assumption about the exact shape of the distributions ( 371 Robust and Nonparametric Methods). Generally, the problem of determining the model or specifi- cation should not be dealt with by mathemat- ical methods alone, and it should be consid- ered by taking into account the properties and nature of the subject under considera- tion and also the process of measurement or experimentation. E. History The first appearance of statistical inference as a method of grasping numerical characteris- tics of a collective was seen in the study by J. Graunt (1662) of the number of people who died in London. W. Petty applied Graunt's method further to the comparison of commu- nities in his Political arithmetic (1690). 1. P. Siissmilch, a member of the Graunt school, perceived the regularity in mass observations and stressed the statistical importance of this regularity. The development of the theory of probability inevitably affected the theory of statistical inference. The method of T. Bayes was the first procedure of statistical inference in the current meaning of this expression We now have a theorem bearing his name (the Bayes theorem), which is stated in current language as follows: If we know the proba- bility PdE) that a cause C produces an effect E and if the prior (or t a priori) probability P(C) of the existence of the cause C is also known, then the posterior (or t a posteriori) probability of C, given an effect E, is equal to P(C)PdE) PE(C)=Ic P(c)Pd E ) ( 342 Probability Theory F). This theorem, easily extendable to the continuous case, sug- gests the following inference procedure: If we are informed that an effect E has taken place, then we calculate the probabilities PE(C) for every cause C, compare them, and infer that the C* with PE(C*)=maxCPE(C) is the most probable cause of E. Both P. S. Laplace and C F. Gauss dis- cussed the theory of estimation of parameters ( 399 Statistical Estimation) as an application of the Bayes theorem. In his research, Laplace considered a monotone function W(lt- 01), and W=lt-OI in particular, of the distance It-Ol between a parameter value 0 and its 
401 F Statistical Inference estimate t as a measure of significance of the error of the estimate t. Gauss, following La- place, used this weight function W(lt - 81) of error, and going beyond Laplace, realized that it would be mathematically fruitful to put W(lt - 81) =(t - 8)2. Such considerations led him to the study of the tleast squares method, in which the terminology and notation he de- vised are still in use. He also developed the theory of errors and recognized the impor- tance of the normal distribution and found that the least square estimate is equal to the most probable value if the errors are normally distributed. F. Galton, a biologist, revealed the useful- ness of statistical methods in biological re- search and explored what we call tregression analysis ( 403 Statistical Models) by intro- ducing the concepts of regression line and tcorrelation coefficient. His research on re- gression analysis originated from the study of the correlation between characteristics of parents and children, but he failed to realize the difference between tpopulation character- istics and tsample characteristics. Fol1owing Galton, K. Pearson developed the theory of regression and correlation, with which he succeeded in establishing the basis of biometrics ( 40 Biometrics). He arrived at the concept of population in statistics: A sta- tistical population is a collecti ve consisting of observable individuals, while a sample is a set of individuals drawn out of the population and containing something telling us about charac- teristics of the population. Thus statistical research is regarded as investigation that focuses not on a sample as such but on a population from which the sample has been drawn. This consideration raised the problem of the goodness-of-fit test ( 400 Statistical Hypothesis Testing), that is, the problem of knowing whether a sample is likely to have been drawn from a population whose distri- bution was determined by theoretical con- siderations. K. Pearson characterized some population distributions occurring in practice by a differential equation, and classified them into several types. Using this classification, he discussed goodness-of-fit tests and developed the X2-distribution (tchi-square distribution) in relation to the problem of testing hypotheses. Statisticians in the time of K. Pearson thought of a population as a collective having infinitely many individuals (i.e., an infinite population), which led to the idea that the larger the size of a sample (i.e., the number of individuals in the sample), the more precisely could the sample give information about the population. They carried out inferences, in- cluding the testing of hypotheses (  400 Sta- tistical Hypothesis Testing), by approximate 1510 methods, which later came to be termed large sample theory. Suppose, for instance, that {Xl' ... , X n} is an tindependent sample of size n from a normal population N (fl., ,,2). The random variable Z = vIn(X - fl.o)/t) with X = L; X;/n is distributed according to N (0, 1) when fl. = fl.o. Therefore, if the size n is suffi- ciently large (n--> 00), we estimate (r by 6' = {L;(X j -X)2/(n-I)}1/2 and deal with the random variable T= vinci - fl.oWt obtained by inserting 6' in place of (J in the expression for Z, as if T itself were distributed according to N(O, 1). F. Development in the 20th Century W. S. Gosset, writing under the pen name "Student," reported in 1908 the discovery of the exact distribution of T and th(:reby opened the new epoch of exact sampling theory ( 374 Sampling Distributions). This work of Student made it possible to perform statistical in- ference by means of small samples and conse- quently changed statistical research from the study of collectives to that of uncertain phe- nomena; in other words, the concept of popula- tion was once again related to a tprobability space with a tprobability distribution (i.e., a population distribution) containing unknown parameters. Thus it began to be emphasized that a sample has to be drawn at random (i.e., a random sample) from the population if we are to make an inference about a parameter based on the sample. Fisher presented a complete derivation, using the multiple integration method, of the tt-distribution (the sampling distribution of T). In addition, Fisher introduced the con- cepts of tnull hypothesis and significance test, which were the starting points for later pro- gress in the theory of hypothesis testing. He also added the concepts of tconsistency, teffi- ciency, and tsufficiency to the list of possible properties of testimators, and he studied the connection between the information contained in a sample and the accuracy of an estimator, which led to the idea of amoUnt of informa- tion. Fisher also proposed the tmaximum likelihood estimator, which is formally equi- valent to the most probable value, but he renamed it and gave it a foundation com- pletely independent of any prior information and showed that it leads to the at least asymp- totical1y efficient estimator. Fisher made efforts to obtain a distribution of the parameter directly from the sample observation, hence independently of the con- cept of prior probability. He sought in this way to be released from the weakness of the Bayes method. For this purpose he introduced 
1511 the concept of fiducial distribution, which was the subject of bitter controversy in the period that followed As an example of a fiducial distribution, we consider here the tBehrens- Fisher problem: Let XI' ..., X m and Yr."" Y" be samples drawn independently from the populations N(111' (JD and N(112' (JD, respec- tively, where the parameters 111, 112, (J1' and (J2 are all unknown. The problem raised is to test the hypothesis 111 = 112 or to estimate b = 111 - 112 by an interval. To solve this problem, we put X=IX;/m, Y=I /n, i j 5?= I(X i -X)2/(m-1), i 5=I(- y)2/(n-1), j and learn that T 1 = j;(X - I1rJ/5 1 and T 2 = (Y - 112)/5 2 are mutually independent and distributed according to the t-distribution with degrees offreedom m-l and n-l, re- spectively. From this fact Fisher reasoned as follows: Given observed values x, y, SI' S2 of the variables X, Y, 5 1 ,5 2 , the distributions of the parameters 111 and 112 are induced from the distributions of T 1 and T 2 by means of transformations  T 1 s 1 111 =x- j; '  T 2 S 2 112 = Y -  . Consequently the distribution of b=x- y- (T 1 sr/j; - T2S2/) is obtained. These distributions are called the fiducial distribu- tions of the parameters 11r. 112, and b. The intervallb - (x - y)1 < c of b deduced from the fiducial distribution of b is called a fiducial interval of b. Neyman and E. S. Pearson developed a mathematical theory of testing hypotheses, in which they deliberately defined a family of population distributions admissible for formal treatment and considered alternative hypoth- eses within the family. They proposed to relate a test to its tpower function, on the basis of which the test would be judged. Their ideas brought mathematical clarity to the theory of inference. Furthermore, concerning interval estimation, Neyman devised an alternative to the fiducial interval, the tconfidence interval, which has full mathematical justification. Unfortunately it was later found that the confidence interval, fiducial interval, and the Bayes posterior interval based on the posterior distribution often gave distinctly different results to the same problem, which became a source of controversy among different schools of thought. Since the publication of A. Wald's theory of statistical decision functions ( 398 Statistical 401 G Statistical Inference Decision Functions) in 1939, there has been a steady increase in its importance. In this theory the totality !'fi of available statistical procedures, which is considered implicitly in the Neyman-Pearson theory, is put forth ex- plicitly as a set and defined as the space of decision functions. Wald also defined the trisk function of a statistical decision procedure and used it as a basis for judging procedures. In addition, he employed the concept of prior probability and the Bayes procedure for the purpose of proving the tcomplete class theo- rem. Wald's idea of bringing the concept of prior probability back into statistical theory carried a great deal of weight, and much litera- ture has now been accumulated on this sub- ject. Prior probability as a technique in statis- tics was abandoned after Fisher's introduction of the maximum likelihood method indepen- dent of prior probability and Neyman's assertion that a probability distribution on the tparameter space made no sense. In addition, Wald linked statistical inference to games (  173 Game Theory) and introduced the tmini- max principle into statistics. The decision- theoretic setup also enabled him to develop a theory of sequential analysis by comparing the cost of sampling with the risk of erro- neous decisions ( 400 Statistical Hypothesis Testing). After the publication of Savage's book in 1954, there was a revival of the Bayesian ap- proach, i.e., one based On the concept of sub- jective probability, and now the group of those statisticians who accept the Bayesian approach are called Bayesians or neo-Bayesians. G. Applications Methods of statistical inference are applied in many fields where statistical data are used for scientific, engineering, medical, or managerial purposes. Methods of producing data that are appropriate for statistical inference have also been developed. R. A. Fisher developed the method of statistical tdesign of experi- ments ( 102 Design of Experiments) that when it is impossible or impractical to elimi- nate completely experimental errors or vari- abilities, provides the procedures to obtain such data. These data, though subject to ran- dom errors, are susceptible to rigorous statis- tical inference. For this purpose Fisher intro- duced the principles of trandomization, tlocal control, and treplication in the design of experi- ments. W. A. Shew hart defined the tstate of statistical control in mass-production pro- cesses, where the variabilities of the products can be considered to be due to chance causes alone and hence are statistically analyzable. 
401 Ref. Statistical Inference Applying the idea of statistical inference to this situation, Shew hart established the method of statistical quality control ( 404 Statistical Quality Control). Neyman introduced the method of trandom sampling into statistical surveys and developed the theory of estima- tion and allocation based on the theory of statistical inference ( 373 Sample Survey). In many applied fields there exist systems of statistical methods which have been devel- oped specifically for the respective fields, and although all of them are based essentially on the same general principles of statistical in- ference, each has its own special techniques and procedures. Specific names have been invented, such as biometrics ( 40 Biometrics) econometrics ( 128 Econometrics), psycho- metrics ( 346 Psychometrics), technometrics, sociometrics, etc. References [IJ R. A. Fisher, Statistical methods for re- search workers, Oliver & Boyd, 1925; eleventh edition, 1950. [2J R. A. Fisher, The design of experiments, Oliver & Boyd, 1935; fourth edition, 1947. [3J R. A. Fisher, Contributions to mathemat- ical statistics, Wiley, 1950. [4J R. A. Fisher Statistical methods and scien- tific inference, Oliver & Boyd, 1956. [5J 1. Neyman and E. S. Pearson, Contri- butions to the theory of testing statistical hypotheses I, II, Statist. Res. Mem. London Univ., 1 (1936), 137; 2 (1938), 2557. [6J A. Wald, Statistical decision functions, Wiley, 1950 (Chelsea, 1971). [7J L. J. Savage, The foundation of statistical inference, Methuen, 1962. [8J C. R. Rao, Linear statistical inference and its applications, Wiley, 1965. [9J D. A. Fraser, The structure of inference, Wiley, 1968. [IOJ 0 V. Lindley, Introduction to proba- bility and statistics from a Bayesian viewpoint I, II, Cambridge Univ. Press, 1965. 402 (XX.19) Statistical Mechanics A. General Remarks One cubic centimeter of water contains about 3 x J022 water molecules. A macroscopic sys- tem of matter thus consists of an enormous number of particles incessantly moving in accordance with the laws of dynamics ( 271 1512 Mechanics; 351 Quantum Mechanics). Dy- namical description of such microscopic mo- tion in full detail is impossible and even mean- ingless. A physical process in thermodynamics or thydrodynamics is described in terms of a relatively small number of macroscopic vari- ables, such as temperature, pressure, and a velocity field. Such a process shows a remark- able simplicity which is a statistical result of the molecular chaos. This is the reason why statistical mechanics is needed as a theoretical model to unify microscopic dynamics and +probability theory. Thus statistical mechanics aims at deriving physical laws in the macro- scopic world from the atomistic structures of the microscopic world On the basis of micro- scopic dynamical laws and probabilistic laws. Its function is twofold. First, statis1ical me- chanics should give microscopic proofs of the macroscopic laws of physics, such as those of thermodynamics or the laws of macroscopic electromagnetism. Second, it should also pro- vide us with detailed knowledge of physical properties of a given material system once its microscopic structure is known. In this sense, statistical mechanics is an essential basis of the modern science of materials. Strictly speaking, the dynamics of the microscopic world obeys +quantum mechanics. However, even before the birth of quantum mechanics, statistical mechanics had pro- gressed on the basis of classical mechanics. This stage of statistical mechanics is often called classical statistical mechanics, in contrast to quantum statistical mechanics based on quan- tum mechanics. Statistical mechanics has a fully developed formalism to apply to physical systems in thermal equilibrium. This is some- times called statistical thermodynamics or equilibrium statistical mechanics. Until the 1950s the term "statistical mechanics" had often been used in this narrow sense. In a wider sense it is concerned with systems m more general states, for instance, in nonequilibrium states. In the modern literature, a general statistical mechanical theory of nonequilib- rium systems is often referred to as the statis- tical mecha nics of irreversible processes B. History The early stage of statistical mechanics can be traced back to the kinetic theory of gases, which started in the 18th century. I n dilute gases, gas molecules fly freely through the whole volume of the vessel and collide only from time to time. In thermal equilibrium, the average energy of each molecule is determined by the temperature of the gas; namely m v;/2=m v;/2=m v;/2=kT/2, 
1513 where (v x , v y ' v z ) is the velocity, an overbar means the average, m is the mass of a mole- cule, T is the absolute temperature, and k is the Boltzmann constant ( = 1.38 x 10 -16 erg' deg -1). The velocity of each molecule is only probabilistic, and a tdistribution function nvx, v y ' v z ) is defined as the tprobability den- sity that the velocity of a given molecule is found to be In the neighborhood of (v x ' v y , vJ In a dilute gas, this is given by f(Vx'Vy,vz)=cexpl-I(V+vi+V;)/kT l (1) the Maxwell-Boltzmann distribution law. L. Boltzmann viewed the velocity distri- bution function as changing in time as a result of molecular collisions and gave an equation of the form iJf -;;- = A [f] + ref], ct where A [n is the change of the distribution function r by acceleration due to the presence of external forces and f[ fJ is the change caused by molecular collisions. f[fJ is an integral which is nonlinear in f. This type of equation is called a Boltzmann equation [1, 2J Boltzmann introduced the H-function by the definition H = f f f flogf dvxdvydv z and proved on the basis of equation (2) that dH / dt  0 This theorem is known as the H- theorem [I 4]. The equilibrium distribution (1) is therefore obtained from equation (2) as the solution that makes H a minimum. In fact the H-function is related to the entropy S by S= -kH. Boltzmann further showed (1877) that the distribution function of a system in thermal equilibrium can be obtained on more general grounds without relying on a kinetic equa- tion of the type (2) and that the statistical mechanics of systems in equilibrium can thus be constructed on a basis much more general than that given by a kinetic theory. It was W. Gibbs, however, who clearly established (1902) the complete framework of statistical thermo- dynamics, although he had to confine himself to classical statistical mechanics [5J C. The Ergodic Hypothesis For a given dynamical system with n tdegrees of freedom, the phase space is defined as a 2n- dimensional space with tgeneralized coordi- nates q" ... , qn and tgeneralized momenta 402 D Statistical Mechanics (2) PI' ..., Pn' Dynamical states of the system constitute a set of points in this space. At a given time, the state of the system is repre- sented by a point P in the phase space, and hence the motion of the system is represented by the motion of P. If the system is conserva- tive, its energy function is constant. Let ;Y{' be the tHamiltonian function. Then the motIOn of P is confined to an energy surface defined by the condition ,lie = E = constant. Measure on an energy surface is defined as the limit of the volume element lying between two neigh- boring energy surfaces corresponding to the energies E and E + dE. The motions of P form a ttopological group that makes this measure invariant (tLiouville's theorem). A dynamical quantity A(p,q) of the system changes its value as the phase point P moves on the energy surface. The time average A of A is identified with the value of A observed in the equilibrium state of the system, namely, the average of A with respect to the invariant measure. Boltzmann justified this assumption by the following reasoning. If the energy sur- face has a finite measure and the trajectory of P does not make a closed curve on the energy surface, it can be assumed that the trajectory will move around practically everywhere on the surface. Mathematically formulated, the only measurable subset of the surface that has a nonzero measure and is invariant under the motion is the whole surface. This assumption is the ergodic hypothesis [69J The long-time average of A will then equal the average of A over the entire energy surface with weight function equal to the measure previously in- troduced. The latter average IS called the phase average and is denoted by (A). Boltzmann thus asserted that (3) (4) A = (A). (5) Efforts of mathematicians to study the ergodic hypothesis created an important branch of mathematics called ergodic theory ( 136 Ergodic Theory). D. Ensembles in Classical Statistical Mechanics Once we admit the ergodic hypothesis, or more specifically the assumption (5), the cal- culation of the observed value of a physical quantity A for a system in equilibrium is re- duced to finding the phase average of A on an energy surface. The task of statistical me- chanics of systems in equilibrium is thus re- duced essentially to calculating phase averages and establishing relationships between them [I o 13]. For a set (called an ensemble in this case) of 
402 E Statistical Mechanics identical systems with the same energy, we consider the phase average for the tprobability space with the measure mentioned in Section C on the energy surface corresponding to the given energy value. Gibbs called a probability space of this kind a microcanonical ensemble. An average in this probability space is defined by A I AdS II dS ( )= .ffE Igrad ,Jrlff=E 1 grad £'1' where£' is the Hamiltonian function, grad£' is its gradient in the 2n-dimensional phase space, and the integration is carried over the energy surface with dS as surface element. When the observed system is in mechanical contact with a heat reservoir, the composite system consisting of the system and the heat reservoir is regarded as an isolated system with constant energy. Then an ensemble of the composite systems is treated as a microcanon- ical ensemble. It is more convenient and more physical, however, to consider the heat re- servoir simply as providing an environment characterized by its temperature T, and to concentrate only on the system in which we are interested. Then the system is no longer isolated and exchanges energy with its en- vironment. Since the energy of the system is no longer constant, the system will be found in any part of the phase space with a certain probability. To find the probability distri- bution for an ensemble of this system is a problem of asymptotic evaluation which is solved on the basis of the ergodic hypothesis and the fact that a heat reservoir has an ex- tremely large number of degrees of freedom. This asymptotic evaluation is traditionally done with the help of tStirling's formula or by using the Fowler-Darwin method [IOJ, but it is essentially based on the tcentrallimit theorem [11]. The probability space of this kind of en- semble of systems in contact with heat re- servoirs was called a canonical ensemble by Gibbs [5]. If df is a volume element of the phase space of the system, the probability of finding a system arbitrarily chosen from the ensemble in a volume element df is given by Pr(df) = Cexp( -£'jkT)df. Accordingly, the average of a dynamical quan- tity A is given by (A) = f Ae- Yf Ik1 df If e- YfjkT df. For example, the average energy is (£')=E= f ffe-YflkTdfl fe-YflkTdf. 1514 By a traditional convention we introduce the parameter (J= IjkT and write (9) as E= -iJlogZ(fJ)jiJ{J, (6) where Z(fJ) is called the partition function or the sum over states and is given for a system composed of N identical particles by Z(fJ)= f e-p.Yf dfjN!. (10) If an exchange of particles with t he environ- ment takes place in addition to an exchange of energy, the probability of finding a system with particle number N in the volume element d f is given by Cexp( - {J£' + {Jf1N)df jN!, where f1 is a real parameter called t he chemical potential; this characterizes the environment with regard to the exchange of particles. This ensemble is called the grand canonical en- semble. The average of a dynamical quantity A is then given by (A)=N!-1 f ANe-PJ{'(N)+PNdfjB({J,f1), (11) where the dependence of A and£' on N is now explicitly written, and where B({J, f1) =  N! -, f e-p.Yf(N)+PN df (12) is called the grand partition function. E. Ensembles in Quantum Statistical Mechanics The quantum counterpart of the classical ergodic hypothesis is that to each of these quantum states an equal probability weight should be assigned [10]. A microcanonical ensemble is then defined by this principle of equal weight, which yields in turn (A)=LAdLI I I (13) (7) instead of (6). Here the index 1 refers to the quantum states lying in the interval t:J.E, and Al is the quantum-mechanical expectation of a dynamical variable A in the quantum state I. A canonical ensemble is now defined by assigning (8) Ij= e-PEjjL e -PE j j (14) (9) to the jth quantum state as the probability that the system will be found in that state. The 
1515 expectation value of A must be given by <A) = I Aje -PEj/I e- pEJ = tr Ae- PH Itre -PH, j j where H is the Hamiltonian. The partition function is defined by Z = Ie- PEj = tre- PH , j corresponding to (10). For a system consisting of identical parti- cles, quantum mechanics requires a particular symmetry of its twave function; namely, the wave function must be even or odd with re- spect to permutation of any two particles according as the particles are bosons or fer- mions. This symmetry requirement is peculiar to quantum mechanics. Thus, even for an ideal gas consisting of noninteracting particles, quantum statistics leads to results characteris- tically different from those of classical statis- tical mechanics. This difference becor.1es more significant when the particle mass is smaller, the density is larger, and the temperature is lower. Quantum effects of this kind are seen in metallic electrons, in liquid helium, in an as- sembly of photons or phonons, and in high- density stars. The statistical laws obeyed by bosons are called Bose statistics, and those obeyed by fermions, Fermi statistics. The expectation value of A in the grand canonical ensemble is given in quantum sta- tistics by <A) = B({J, J1) -I tr(Ae -PH+PN), where H is the tsecond-quantized Hamil- tonian, N is the number operator, the trace tr is taken on the (nonrelativistic) tFock space (symmetric or antisymmetric according to Bose or Fermi statistics), and B({J, J1) is the grand partition function gi ven by B({J, J1) = tr ePH+PN F. Many-Body Problems in Statistical Mechanics Since statistical mechanics is primarily con- cerned with systems with large numbers of particles, problems in statistical mechanics are essentially many-body problems. In prac- tice, however, there are some cases where extreme idealization is possible, as in ideal gases, where the interaction between gas mole- cules is ignored. In some cases we can proceed by successive approximation, taking the par- ticle interactions as perturbations. Such per- turbational treatments are, however, entirely useless for some problems, such as phase tran- sitions, of which an example is the conden- sation of gases into liquid states, where the 402G Statistical Mechanics (15) interaction of particles plays a critical role. Such problems are clearly many-body prob- lems. There are a number of important and interesting problems in this category, for example, transitions between ferromagnetic and paramagnetic states and those between the superconducting and normal states of metals. Transition from a high-temperature phase to a low-temperature phase is generally regarded as a conseq uence of the appearance of a certain type of order in thermal motion. This kind of phase change is called an order- disorder transition [14 16]. (16) G. Thermodynamic Limit and Characterization of Equilibrium States (17) Although an actual system is finitely extended, the enormous sizes of the usual macroscopic systems in comparison to the sizes of their constituent particles justifies the idealization to infinitely extended systems. At the same time, there are several mathematical advantages in considering infinitely extended systems, such as the absence of walls (replaced by the bound- ary condition at infinity, should it be relevant), appearance of phase transitions as mathemat- ical discontinuities rather than mathematically smooth though quantitatively sudden changes, and mathematically clear-cut occurrence of broken symmetries. Equilibrium states of infinitely extended systems are usually obtained by taking the limit of the equilibrium states of systems in a finite volume Vas both V and the number of particles N tend to 00 with the density p = N IV fixed; this is called the thermodynamic limit. It is sometimes possible to formulate the dynamics of infinitely extended systems direct- ly and to characterize their equilibrium states, which more or less coincide with the thermo- dynamic limit of equilibrium states of finitely extended systems [17-21]. The simplest and most fully investigated case of lattice spin systems is explained below in detail [17]. Since classical systems can be viewed as special cases of quantum systems, we start with the latter. To be definite, we take a v-dimensional cubic lattice ZV with a lattice site n = (n l , ..., nv) specified by its integer coordinates n j . (In the lattice case, the thermodynamic limit is simply the limit as V -> 00.) The C*-algebra  of observables is gen- erated by the subalgebra n at each lattice site n, which is assumed to be the algebra of all d x d matrices (for example, linear combinations of t Pauli spin matrices O'(n) = (O"n), O"n), O"n)) and the identity for d = 2) and to commute with operators at other lattice sites. The group of lattice translations n->n + a is represented by (18) 
402 G Statistical Mechanics automorphisms Ya ofl, satisfying Yaln=n+a (Ya(J(n) = (J(n+a»). For any subset A of Z', (A) denotes the C*-subalgebra of  generated by Ia' n E A. A model is specified by giving a potential <1> which assigns to each finite non empty subset I of Z' an operator <1>(1) = <1>(1)* E IJI(!). The Hamiltonian for a finite subset A of Z' is given by U (A) = LIeA <1>(1). In order to control long- range interactions, various assumptions are introduced. Examples are finiteness of either of the following: 11<1>11 =sup L N(!) -'11<1>(1)11, n [=m 111<1>III=sup L 11<1>(1)11 n 13n Here N (I) is the number of points in I. Let IJIl be a maxImal Abelian *-subalgebra of n (such as {c[ + c 2 ut)}) satisfying }'a [ = 1J(a and IC[ be the Abelian C*-subalgebra of  generated by 11, n E Z'. If <1>( I) is in cl for all I, we call the potential <1> Abelian or classical. There exists a conditional expecta- tion n cl which is a positive mapping of norm I from IJI onto lcl satisfying ncl(ABC) = Ancl(B)C for A and C in IJIC[ and n C1 (1)= I. If a state tp on IJI satisfies tp(A)= tp(ncl(A)), we call the state (P classical. Classical states are in one-to-one correspondence with the restriction on cl, which can be viewed as a probability measure on the spectrum (also called configu- ration space) of the C*-algebra 1J1 c1 of obser- vables for classical spin lattice systems. This correspondence makes it possible to view classical spin lattice systems as quantum spin lattice systems with Abelian interactions. For a given potential <1>, the time evolution of the infinitely extended system is described by the one-parameter group cx" tE R, of *- automorphisms of IJI defined as the following limit: cx,(A)= lim eitU(A)Ae-i'U(A) (AEIJI). (21) A'Z' The limit exists if <1>(1) = ° for N(I) > N and 111<1>111 < 00, or if for some Ie> ° Lne}.n(suPxLdll<1>(I)111 I3x, N(I)=n})< 00, or if v = I (I-dimensional lattice) and sUPxLd 11<1>(1)111 In( -00, x)i" 0, In [x, oo)i" 0} < CXJ. An alternative way is first to de- fine b(j)(A) = LA<1>(I), A] for AE UA IJI(A) (A is a finite subset of Z'), which exists if 111<1>111 < Cf), and to prove that the closure ;5(j) of b(j) is a generator of a one-parameter sub- group cx, ( = exp t;5(j»)' In the above cases, ;5(j) is a generator. A general canonical ensemble for a system in a finite subset A of the lattice Z" with some boundary condition in the outside N = Z'\A, is given by tpA(A) = ("fA @ I/J)(e-/JH(A) x 1516 (\ 9) Ae- PH (A)/2), where "fA @ I/J is the product of the unique tracial state "fA on (A) and a state I/J on (N) (the boundary condition), H(A)= U(A)+ W(A) and W(A)=L[{<1>(l) InAi"0, I n A C i" 0} (the surface energy). The follow- ing conditions on a state tp of  are mutually equivalent under the condition that ;5(j) is a generator (which holds under anyone of the conditions described above) and is satisfied by any limit state of the above tpA .IS A?' Z' (i.e., a state in nA {tpA 1 A' c A, I/J}, with the bar denoting weak closure). I. KMS condition: tp(Acxjp(B)) = 4J(BA) for any A, BEI such that cx,(B) is an entire func- tion of t. (tp is called a {3-KMS state.) 2. Local thermodynamic stability: For any finite subset !\ of Z' and for any state I/J having the same restriction to (A) as the state (P under consideration, FA.P(tp)FA./j(l/J) (the minimality of the free energy multiplied by j3), where F A . P (tp)={3tp(H(A))-SA(tp), SA(tp) = Jim{SA(tp)-SA'A(tpJ} as A' ?' zv (the open system entropy), SA(tp) = - tp(logpA«(p)) (the closed system entropy) and the density matrix PA(tp)EIJI(A)+ is defined by tp(A) = "fA(PA(tp) A) for all A E (A). 3. Gibbs condition: For every finite subset A of Z', the perturbed state tpPW(A) (not neces- sarily normalized) is the product tpf x I/J of the Gibbs state tpf(A)=tr(e-PU(A)A)jtre-PU(A) on A E(A) and some (unknown) ,tate I/J on (A C), where the representatIve vector <1> for tp for the GNS representation nq> is assumed to be separating for nq>()", and then tpfiW(A)(A) = (0, nq>(A)O) for (20) x 1 [12 I S 1 I S.-I 0= L {3n ds[ ds 2 ... dSnd;nq> n=O 0 0 0 X (W(A))-I-'....  -s'nq>(W(A))<1>, (22) where (j) is the tmodular operator for <1> and the series converges. For a classical potential, this condition re- duces to the conditions that tp is classical and that the restriction of tp to c[ as a measure on the configuration space {I .., df' satisfies the following DLR equation due to R. L. 00- brush in, O. E. Lanford, and D. Ruelle: The conditional probability for (A)E{I...d}" knowing (N)E {I... d}Z"\A is proportional to exp( -{3H(A)), where H(A)= U(.'\) + W(A) is a function of (U(A) depending only on (A)). 4. Roepstorff-Araki-SeweIl inequality: For any A E UA IJI(A), itp(A*b(j)(A)) is real and - i/hp(A *b(j)(A));' S(tp(A * A), tp(AA )), (23) where S(u, v)= ulog(ujv) if u >0, v> 0, S(O, v)= ° for v;,O and S(u,O)= +00 for u>O. 5. Roepstorff-Fannes- V erbeure inequality: 
1517 For IX-entire AElJl, f3- 1 r <p(A*1X1)(A))dA"':;F(<p(A:A),<p(AA*)), (24) where F(u, v) = (u-v)jlog(ujv) for u>O, v>O, u#v, F(u,u)=.u for u>O, F(u,O)= F(O, v)=O. If the interaction is translationally invariant (i.e., '1.<1>(1)=(1>(1 +a) for all aEZ' and l) and if we restrict our attention to translationally invariant states (i.e., !p(y a(A)) = <p(A) for all A E IJl and a E ZV), then the following conditions are also equivalent to the above. 6. Variational principle: fJe(<p)-s(<p)"':;fJe(ljJ) - s(ljJ) for all translationally invariant IjJ (the minimahty of the mean free energy), where e(<p) = iJm N(I\)-1 !p( U(I\)) = hm N(i\)-i <p(H(I\)) (the mean energy), s(<p) = lim N (1\)-1 SA(<P) (the mean entropy), the infimum value fJe(<p)-s(<p) is - P(fJl1» with P(f3I1» = lim N(I\)-llogTA(e -PU(A)) (the pressure), and the limits exist if I\? Z' is taken In the following van Hove sense: For any given cube C of lattice points, the mini- mal number n (C) of translations of C that cover 1\ and the maximal number nA(C) of mutually disjoint translations of C in 1\ satisfy n; (C)jn A (C)--> I as I\? Z'. 7. Tangent to the pressure function: P(I1» is a continuous convex function on the Ba- nach space of translation ally invariant 11> with 1111>11 < CfJ. A continuous linear functional IX on this Banach space is a tangent to P at 11> if P(11) + 'I')?: P(I1» + IX('I') for all '1'. For a trans- lationally invariant state 1jJ, we define IX q ,('I') = IjJ(L1e> o N(I)-I'I'(I)). The condition is that -IX,/, is a tangent to P at fJl1>. (Conversely, any tangent IX to P at fJl1> arises in this manner.) The set Kp of all (normalized) fJ-KMS states is nonempty, compact, and convex. A fJ-KMS state <p is an extremal point of Kp if and only if it is factorial (i.e., the associated von Neumann algebra 7r,/,(IJI)" has a trivial center). It then has the clustering property 1ima.cr {<p(Ay a(B))- <p(A)<P(Ya(B))} =0 and is interpreted as a pure phase. Any fJ-KMS state has a unique integral decomposition into extremal fJ-KMS states. For any 11>, Kp is a one-point set for suffi- ciently smalllfJl. For a I-dimensional system (v = I), K p consists of only one point (uniq ue- ness of equilibrium states usually interpreted as indication of no phase transition) if the sur- face energy W([ - N, NJ) is uniformly bounded (H. Araki, Comm Math. Phys., 44 (1975); A. Kishimoto, Comm. Math. Phys., 47 (1976)). For the two-body interaction 11>( {m, n}) = - Jlm- nl-'rr;m)rr;n), this condition is satisfied if IX> 2 while 1111>11 < CfJ and IX, defined if IX > 1. There is more than one KMS state (with spontaneous magnetization) for 2?: IX > I and large #J >0, and hence a phase transition exists (F. J. 402 G Statistical Mechanics Dyson, Comm. Math. Phys., 12 (1969); 1. Froh- lich and T. Spencer, Comm. Math. Phys., 83 (1982)). If a I-dimensional interaction has a finite range (i.e., 11>(1) = 0 if the diameter of I exceeds some number ro) or if it is classical and LhoN(l)-l(diamI + 1)1111>(1)11 < CfJ for d = 2, then <p(A) for <p E Kp and A E 111: (A) for a finite A is real analytic in fJ and any other analytic parameter in the potential (Araki, Comm. Math. Phys., 14 (1969); [22]; M. Cas- sandro and E. Olivieri, Comm. Math. Phys., 80 (1981 )). For a 2-dimensional Ising model with the nearest-neighbor ferromagnetic interaction [23], K p consists of only one point for 0",:; fJ",:; fJc while Kp for fJ > fJc has exactly two extremal points corresponding to positive and negatiVe magnetizations (M. Aizenman, Comm. Math. Phys., 73 (1980); Y. Higuchi, Colloquia Math. Soc. Hmos Bolyai, 27 (1979)). In this case, all KMS states are translation ally invariant, while there exist (infinitely many) translation- ally noninvariant KMS states for sufficiently large fJ if v = 3 (Dobrushin, Theory Prob. Appl., 17 (1972); H. van Beijeren, Comm. Math. Phys., 40 (1975)). The accumulation points of fJ-KMS states as fJ--> +CfJ (or -CfJ) provide examples of ground (or ceiling) states defined by anyone of the following mutually equivalent conditions 1+,2+ (or 1_,2_) (0. Bratteli, A. Kishimoto, and D. W. Robinson, Comm. Math. Phys., 64 (1978)): 1+ (1_). Positivity (negativity) of energy: For any AEUA I1I:(A), -i<p(A*15(j)(A)) is real and positive (negative). 2+ (2_). Local minimality (maximality) of energy: For any finite subset A of Z' and for any state IjJ with the same restriction to I1I:(N) as the state <p under consideration, <p(H (A))"':; IjJ(H(A)) (<p(H(A))?: IjJ(H(A))). For translationally invariant potentials and states, the following condition is also equiva- lent to the above: 3 + (3 _ ). Global minimality (maximality) of energy: e( <p)"':; e( 1jJ) (e( <p)?: e( 1jJ)) for all trans- lationally invariant states 1jJ. The totality of KMS, ground, and ceiling states can be characterized by the follow- ing formulation of the impossibility of per- petual motion: Let p, = P,* E III be a norm- differentiable function of the time t E R with a compact support, representing (external) time- dependent perturbations. Then there exists a unique perturbed time evolution IX; as a One- parameter family of *-automorphisms of 111: satisfying (djdt)IX;(A) = 1X;(15(j)(A) + i[P" AJ) for all A E III in the domain of 15(j)' A state <p changes with time t as <p,(A) = <p(Tt(A)) under the perturbed dynamics IX;, and the total 
402H Statistical Mechanics energy given to the system (mechanical work performed by the external forces) is given by U(ip) = S ip,(dp'jdt)dt. For KMS states at any 13, as well as ground and ceiling states, U(ip) '? 0 for any P,. If ip is a factor state, the converse holds, i.e., U(ip)'?O for all p'implies that ip is either a KMS, ground, or ceiling state. The condition U(ip) '?O for all p, is equivalent to - iip( U*15(j)( U)) ,?O for all unitary U in the domain of 15(j) and in the identity component of the group of all unitaries of Ill. A state ip satisfying this condition is called passive, and a state ip whose n-fold product with itself as a state on 1ll0 n is passive relative to a,0 n for all n is called completely passive. The last property holds if and only if ip is a KMS, ground, or ceiling state (W. Pusz and S. L. Woronowicz, Comm. Math. Phys., 16 (\970) ). The totality of KMS, ground, and ceiling states can be characterized by a certain sta- bility under perturbations (P, considered above) under some additional condition on '1, (R. Haag, D. Kastler, and E. B. Trych- Pohlmeyer, Comm. Math. Phys., 38 (1974); O. Bratteli, A. Kishimoto, and D. W. Robinson, Comm. Math. Phys., 61 (1978)). When a lattice spin system is interpreted as a lattice gas, an operator NnE 1ll1 (such as (O"n) + 1 )j2) is interpreted as the particle number at the lattice site nand N (A) = LncA N n is the particle number in A. It defines a representa- tion of a unit circle T by automorphisms TO of III defined as To(A) = limeiN(A)O Ae-iN(A)O (A? ZV), called gauge transformations (of the first kind). The grand canonical ensemble can be formulated as a j3-KMS state with respect to a,T, (instead of a,), where the real constant J1 is called the chemical potential. It can be interpreted as an equilibrium state when the gauge-invariant elements {A EIllI To(A)= A} instead of III are taken to be the algebra of observables or as a state stable under those perturbations that do not change the particle number. H. The Boltzmann Equation Statistical mechanics of irreversible processes originated from the kinetic theory of gases. Long ago, Maxwell and Boltzmann tried to calculate viscosity and other physical quanti- ties characterizing gaseous flow in nonequilib- rium. The tBoltzmann equation is generally a nonlinear tintegrodifferential equation. On the basis of this equation mathematical theories were developed by D. Enskog, S. Chapman, and D. Hilbert [2]. Free electrons in a metal can be regarded 1518 as forming an electron gas, in which electron scattering by lattice vibrations or by impurities is more important than electron-electron scat- tering. Following the example of gas theories H. A. Lorentz set forth a simple theory of irreversible processes of metallic electrons. His theory was, however, not quite correct, since metallic electrons are highly quantum- mechanical and classical theories cannot be applied to them. Quantum-mechanical theories of metal electrons were developed by A. Sommerfeld and F. Bloch. I. Master Equations The Boltzmann equation gives the velocity distribution function of a single particle in the system. This line of approach can be extended in two directions. The first is the so-called master equation. For example, consider a gaseous system consisting of N particles, and ask for the probability distribution of all the momenta, namely, the distribution function fN(Pl' ""PN;t), where PI' ""PN are the mo- menta of the N particles. The equations of motion are deterministic with respect to the complete set of dynamical variables (XI' PI' ..., x N , PN)' The equation for f(PI, "', PN, t) may not be deterministic, but it may be stochastic because we are concerned only with the vari- ables PI' ..., PN' with all information about the space coordinates X I' ... ,X N disregarded. This situation is essentially the same in both classical and quantum statistical mechanics. If the duration of the observation process is limited to a finite length of time and the preci- sion of the 0 bservation to a certain degree of crudeness, the time evolution of the momen- tum distribution function fN can be regarded as a tMarkov process. In general. an equation describing a Markov process of a certain dis- tribution function is called a master equation. Typically it takes the following form for a suitable choice of variables x: (r?jot)f(x, t) = f dx'(W(x',x)f(x',t)- W(x,x'){(x,t)), (25) where W(x, x') is the transition probability from x to x'. By expanding the first integrand into a power series in x - x', with x' fixed and by retaining the first few terms, we obtain the Fokker-Planck equation: (ojot)f(x, t)= -(cj8x)(adx)f(x, t)) +WjOX 2 )(a2(X){(X,t))j2, (26) an(x) = f W(x, x + r)r n dr. (27) 
1519 J. The Hierarchy of Particle Distribution Functions Another way of extending the Boltzmann equation is to consider a set of distribution functions of one particle, of two particles, and generally of n ( < N) particles selected from the whole system of N particles. For example, a two-particle distribution function is the func- tion f2(X1' Vr, X 2 , V 2 , t) for positions and veloc- ities of two particles at time t. The complete dynamics of the entire system of particles can be projected to the time evolution of this hierarchy of distribution functions. The equa- tion of motion for fl then contains the func- tion f2 if the interaction of particles if paIrwise, the equation for f2 contains 13' and so on. Thus the equations of motion for the set of distribution functions make a chain of equa- tions. The whole chain is equivalent to the deterministic equations of motion for the dynamics of N particles. However, if the parti- cle number N is made indefinitely large, with the time scale of observation always finite, the chain of equations for the distribution func- tions asymptotically approaches a stochastic process if certain conditions are satisfied. Approximate methods of solving the hierarchy equations in classical cases have been devel- oped by J. Yvon, J. G. Kirkwood, M. Born, H. S. Green, and others. In quantum statistics, similar hierarchy equations can be considered. A typical exam- ple is the so-called Green's function method [27]. K. Irreversible Processes and Stochastic Processes The statistical mechanics of physical processes evolving in time is a hybrid of dynamics and the mathematical theory of stochastic pro- cesses. A typical example is the theory of Brownian motion. A colloidal particle floating in a liquid moves incessantly and irregularly because of thermal agitation from surrounding liquid molecules. For simplicity, an example of I-dimensional Brownian motion is considered here. Phenomenologically we assume that a colloid particle follows an equation of motion of the form mu= -myu+ f(t), called the Langevin equation, where m is the mass of the colloid particle and u is the veloc- ity. The first term on the right-hand side is the friction force due to viscous resistance, and the second term represents a random force acting on the particle from surrounding moecues. If (28) describes the Brownian motIOn In thermal equilibrium, the friction constant my 402 Ref. Statistical Mechanics and the random force cannot be independent, but are related by a theorem asserting that my= t' (f(tdf(t 1 +t»dt. (29) In an electric conductor, the thermal motion of charge carriers necessarily induces irregular- ities of charge distribution, and so an electro- motive force that varies in time in a random manner is created. This random electromotive force is similar to the random motion of a Brownian particle and is called the thermal noise. For such a thermal noise there exists a relation similar to (29) between the resistance and the random electromotive force. This relation is known as the Nyquist theorem. These theorems are contained in a more gen- eral theorem called the fluctuation-dissipation theorem. When an external force is applied to a system in thermal equilibrium, some kind of irreversible flow, an electric current, for example, is induced in the system. The rela- tionship between the flow and the external force is generally represented by an admit- tance. If the external force is periodic, the admittance is a function of frequency wand is given by (28) x(w)= {"" e-iW'IjJ(t)dt, where ljJ(t) is equal to the correlation function of the flow that appears spontaneously as the fluctuation in thermal equilibrium when no external force is applied. This general ex- pression for an admittance, often called the Kubo formula, gives a unified viewpoint from which responses of physical systems to weak external disturbances can be treated without recourse to the traditional kinetic approach. The static limit (w-->O) of the admittance is the transport coefficient. The reversibility of dynamics leads to relations among transport coefficients, called On sager's reciprocity rela- tions in the thermodynamics of irreversible processes. When external disturbances are so large that the system deviates considerably from thermal equilibrium, the responses may show character- istic nonlinearities. Such nonlinear phenomena are important from both experimental and theoretical points of view, and constitute a central subject of modern research ( 433 Turbulence and Chaos). (30) References [I] L. Boltzmann, Lectures on gas theory, Univ. of California Press, 1964. (Original in German, 1912.) 
403 A Statistical Models [2J S. Chapman and T. G. Cowling, The mathematical theory of non-uniform gases, Cambridge Univ Press, third edition, 1970. [3J R. C. Tolman, The principles of statistical mechanics, Oxford Univ. Press, 1938. [4J D. ter Haar, Elements of statistical me- chanics, Holt, Rinehart and Winston, 1961. [5J J. W. Gibbs, Elementary principles in statistical mechanics, Yale Univ. Press, 1902 (Dover, 1960). [6J P. and T. Ehrenfest, The conceptual foun- dations of the statistical approach in mechan- ics, Cornell Univ. Press, 1959. (Original in German, 1911 ) [7J I. E. Farquhar, Ergodic theory in statis- tical mechanics, I nterscience, 1964. [8J V. I Arnol'd and A. Avez, Ergodic prob- lems of classical mechanics, Benjamin, 1968. [9J R. Jancel, Foundations of classical and statistical mechanics, Pergamon. 1963. [IOJ R. H. Fowler, Statistical mechanics, Cam- bridge U niv. Press, second edition, 1936. [11J A. Ya. Khinchin, Mathematical founda- tion of statistical mechanics, Dover, 1949. (Original in Russian, 1943.) [12J L. 0 Landau and E. M. Lifshits, Statis- tical physics, Pergamon, 1969. (Original in Russian, 1964) [13J R. Kubo, Statistical mechanics, North- Holland, 1965. [14J H. S. Green and C. A. Hurst, Order- disorder phenomena, Interscience, 1964. [15J H. E. Stanley, Introduction to critical phenomena, Oxford U niv. Press, 1971. [16J C. Domb and M. S. Green, Phase tran- sitions and critical phenomena, Academic Press, 1972. [17J 0 Bratteli and D. W. Robinson, Opera- tor algebras and quantum statistical mechan- ics II, Springer, 1981. [18J D. Ruelle, Statistical mechanics: Rigorous results, Benjamin, 1969. [19J D. Ruelle, Thermodynamic formalism, Addison-Wesley, 1978. [20J D. W. Robinson, The thermodynamic pressure in quantum statistical mechanics, Springer, 1971 [21J R. B. Israel, Convexity in the theory of lattice gas, Princeton Univ. Press, 1979. [22J D H. Mayer, The Ruelle-Araki transfer operator in classical statistical mechanics, Springer, 1980. [23J B. M. McCoy and T. T. Wu, The two- dimensional Ising model, Harvard Univ. Press, 1973. [24J N. S. Krylov, Works on the foundation of statistical physics, Princeton Univ. Press, 1979 [25J K Huang, Statistical mechanics, Wiley, 1963. [26J F. Reif, Fundamentals of statistical and thermal physics, McGraw-Hill, 1965. 1520 [27J A A. Abrikosov, L. P. Gor'kov, and I. E. Dzyaloshinskii, Methods of quantum field theory in statistical physics, Prentice-Hall, 1963. (Original in Russian, 1962.) [28JE. H. Lieb and D. C. Mattis, Mathemat- ical phYSICS in one dimension, Academic Press, 1966. 403 (XVII1.5) Statistical Models A. General Remarks A statistical model is defined by specifying the structure of the probability distributions of the relevant quantities. When a statistical model is used for the analysis of a set of data, its role is to measure the characteristics of a certain con- figuration of the data points. R. A. Fisher [IJ advanced a systematic procedure for the appli- cation of statistical models. The process of statistical inference contemplated by Fisher may be characterized by the following three phases: (I) specification of the model, (2) esti- mation of the unknown parameters, and (3) testing the goodness of fit. The last phase is followed by the first when the result of the testing is negative. Thus the statistical in- ference contemplated by Fisher is realized through the process of introduction and selec- tion of statistical models. We always assume that the true distribution of an observation exists in each particular application of statistical inference, even though it may not be precisely known to us. Our par- tial knowledge of the generating mechanism of the observation suggests various possible con- straints on the form of the true dIstribution. The basic problem of statistical inference is then to generate an approximation to the true distribution by using the available obser- vational data and a model defined by a set of probability distributions satisfying the constraints. B. The Criterion of Fit The use of statistical models can best be ex- plained by adopting the predictive point of view, which defines the objective of statistical inference as the determination of the predictive distribution, the probability distribution of a future observation defined as a function of the information available at present. The perfor- mance of a statistical inference procedure is then evaluated in terms of the expected dis- crepancy of the predictive distribution from the true distribution of the future observation. 
1521 The probabilistic interpretation of thermo- dynamic entropy developed by L. Boltzmann [2J provides a natural measure of the discre- pancy between two probability distributions. The entropy of a distribution specified by the density f(y) with respect to the distribution specified by g(y) is defined by B ([ . ) - - f [(Y) IO l j(YJl ( ) d ,g - g(y) g g(y)J g y y, where, as in what follows, the integral is taken with respect to some appropriate measure dy. This definition of entropy is a faithful repro- duction of the original probabilistic interpre- tation of the thermodynamic entropy by Boltz- mann and allows the interpretation that the entrOpy B(/;g) is proportIonal to the logar- ithm of the probability of getting a statistical distribution of observations closely approxi- mating f(y) by taking a large number of inde- pendent observations from the distribution g(y). (For a detailed discussion  [3].) Obviously we have B(/;g)= f [(y)logg(y)dy- fj(y)IOgf(y)d Y . The second term on the right-hand side is a constant depending only on f(y). The first term is the expected log likelihood of the dis- tribution g(y) with respect to the true distri- bution fry). Thus a distribution with a larger value of the expected log likelihood provides a better approximation to the true distribution. Even when f(y) is unknown, logg(y) provides an unbiased estimate of the expected log like- lihood. This fact constitutes the basis of the objectivity of the tlikelihood as a criterion for Judging the goodness of a distribution as an approximation to the true distribution. C. Parametrization of Probability Distributions When we construct a statistical model it is a common practice to represent the uncertain aspect of the true distribution by a family of probability distributions with unknown para- meters. This type of family is called a para- metric family; the model is called a parametric model. The parameters in a parametric model are the keys to the realization of the infor- mation extraction from data by statistical methods. Accordingly, the introduction of mathematically manageable parametric models forms the basis for the advance of statistical methods. (1) Pearson's System of Distributions. A wide family of distributions can be generated by assuming a rational-function representation of 403 C Statistical Models the sensitivity of the density function f(y) given by d aO+a1y+...+apYP -logj(y)= dy bo+h,y+... +bqyq Pearson's system of distributions is defined by putting p = 1 and q = 2 and by assuming vari- ous constraints on the parameters a j and b j and the support of f(y) [4]. E. Wong [5J discussed the construction of continuous-time stationary Markov processes with the distributions of Pearson's system as their stationary distributions. This allows a structural interpretation of the parameters of a distribution of the system. (2) Maximum Entropy Principle and the Ex- ponential Family. To develop a formal theory of statistical mechanics E. T. Janes [6J intro- duced the concept of the maximum entropy estImate of a probability distribution. This concept leads to a natural introduction of the exponential family. Following Kullback [7J, we start with a distribution g(y) and try to find f(y) with prescribed expectations of statistics T 1 (y), ..., (y) and with maximum entropy B(/;g). Such a distribution f(y) is given by the relation [(y) ex: exp[T l Tdy) + '" + 'k (y)Jg(y), where it is assumed that the right-hand side is integrable. By varying the parameters, 1, ..' , Tk over the allowable range we get the exponen- tial family of distributions. I. J. Good [8J con- sidered the Janes procedure as a principle for the generation of statistical hypotheses and called it the maximum entropy principle. (3) Parametric Models of Normal Distribu- tions. Of particular interest within the ex- ponential family is the family of normal distri- butions. This is obtained by assuming the knowledge of the first- and second-order mo- ments of a distribution and applying the maxi- mum entropy principle [9]. Obviously, the parametrization of a normal distribution is concerned only with the mean vector and the variance-covariance matrix. Let X = (XI, ..., X n )' be an n-dimensional normal random variable with mean EX = (m 1 , ..., m n )' and variance-covariance matrix I=(O"ij), where O"ij=E(Xj-mJ(Xj-mJ (E denotes expectation and' denotes the trans- pose.) A nonrestrictive family of n-dimensional normal distributions is characterized by n + n(n + 1 )/2 parameters, m j (i = 1, ..., n), and O"ij (i = I, '" ,n;j = 1, ... ,n). The prior information on the generating mechanism of X introduces constraints on these parameters and reduces the number of free parameters. Reduction of the dimensionality of the 
403 C Statistical Models parameter vector m =(m 1 , ..., mnY is realized by assuming that m is an element of a k- dimensional subspace of R n spanned by the vectors a 1 =(al1,"', a tn )',..., a k =(a k1 ,..., akn)', i.e., by assuming the relation m=Ac, where A =(a l , ...,a k ) is an n x k matrix and c= (c I' ... ,cd' is a k-dimensional vector with k < n. This parametrization is obtained when for each Xi the observation (ail' ..., aid is made on a set of k factors and the analysis of the linear effect of these factors on the mean of Xi is required. We have the representation X=Ac + W, where W is an n-dimensional normal random vector with EW = 0 and variance- covariance matrix  ( Section D). To complete the model we have to specify the variance-covariance matrix . One of the simplest possible specifications is obtained by assuming that the Xi are mutually indepen- dent and of the same variance (J2. This reduces  to (J2 I, where I denotes an n x n identity matrix. With this assumption the number of necessary parameters to represent the variance-covariance matrix reduces from n(n + 1 )/2 to 1. The model obtained with the assumptions  = (J2 I and m = Ac is called the general linear model (or regression model) with normal error, or simply the normal linear model. The model is called a regression model also when the a i are random variables ( e.g., [10, II J ). A typical example of nontrivial parametri- zation of the covariance structure of X is ob- tained by assuming the representation X=m+AF+W, where F = (/1' .., ,fqY denotes the vector of random effects and W the vector of measure- ment errors. It is assumed that F and Ware mutually independent and normally distri- buted with EF = 0 and EW = O. Also the com- ponents of Ware assumed to be mutually independent. The variance-covariance matrix  of X is then given by =A<1>A' +L\, where <1> = EFF' and L\ = EWW', which is diagonal. When A is a design matrix, the parametri- zation provides a components-of-variance model (or random-effects model) of which the main use is the measurement of the variance- covariance matrix <1> of the random effects fl' ... ,fg rather than the measurement of F itself. If we consider F to be representing the effects of some latent factors for which A is not uniquely specified, the above representation of  gives merely a formal, or noncausal, parametrization of E. In this case the model is 1522 called the factor analysis model and the dimen- sion 9 of F is called the number of factors. By keeping the number of factors sufficiently smaller than the dimension of X, we get a parametrization of  with a smaller number of free parameters than the unconstrained model. Starting with 9 = 1 and successively increasing the number offactors, we can get a hierarchy of models with successively increasing numbers of parameters. ( [12J for a very general modeling of the variance-covariance matrix.) (4) Parametrization of Discrete Distribution. Consider the situation where the observation produces one of the events represented by r = 0, 1,2, ... ,k with probability p(r), where k may be infinite. Represent by X = (XI, ... ,X n ) the result of n independent observations. The probability p(X) of getting such a result is given by the relation k 10gp(X)= I 8 r n r (X), r='O where e, = log p(r) and nr(X) denotes the num- ber of X:s which are equal to r. (The term not depending on the e:s is omitted in the above and subsequent formulas, since it is immaterial for problems of inference.) Thus a nonrestric- tive model is obtained by assuming only the relations Or';;; ° and Loe8,= 1. Obviously the model defines an exponential family and vari- ous useful parametrizations are realized by introducing some constraints on the para- meters e r . When the events r are arranged in a 2- dimensional array (i,j) (i = 1, ..., m;j = I, "', n) we have m n 10gp(X)= I I eijnii X ). i==l j=l One simple parametrization is given by Oij= fl.+ ct i + [lj+ lij' where it is assumed that LI ct i = L}=I [lj= L=I lij= L'j1 tij= 0. Obviously this is a pa- rametrization of e ij as a linear function of the parameters ct i , [lj' and lij' and the model thus obtained is called the log linear model. The model shows a formal similarity to the analysis-of-variance model ( Section D). By introducing successively more restrictive as- sumptions on the parameters, we can get a hierarchy of models for the analysis of a two- way contingency table. Extension to cases when more than two factors are involved is obvious ( e.g., [13J). Here we consider that Xi is a dichotomous variable, i.e., k = 1, and that the probability of X i = 1 may depend on i, i.e., we have 10gp(X;) = nOOi+(e li - eOi)n l (X;), 
1523 where 8 0i =logProb(X i =O) and Oli= log Prob(Xi = 1). We assume that a vector of observations a i = (ail' ..., aim)' is available simultaneously with Xi and that we are inter- ested in analyzing the relation between a i and the probability distribution of Xi' The analysis is realized by exploring the functional relation between 'i = 81i - 8 0i and a i . We can assume a linear relation 1=Ac, where 1=('1,"" Tn)', A =(a 1 ,..., an)', and c = (cJ,"" cm)" The parameter 'i=log{p(X i = l)j(l-p(X i = I))} is the log odds ratio or 10git of the event Xi = 1, and the model is called the linear logistic model [14]. A hierarchy of models can be generated by assuming a successively more restrictive linear relations among the components of C. D. General Linear Models Another class of models often used in practical applications is composed of general linear models or linear regression models, where the observed value is considered to be the sum of the effects of some fixed causes and the error. Let X=(X I , ...,X n )' be an n-dimensional trandom variable, and denote the expectation of X by E(X) =(111, ..., I1n)'. If E(X) is of the form A'; with an unknown parameter'; = (" "', d', and a given n x k matrix A, then we can ex press X as X=A.;+W, E(W)=(O,...,O)', with the error term W = (WI' "', w,,)'. We frequently assume a set of conditions on the distribution of X; for example, (i) XJ, ..., X n are mutually +independent, (ii) XI' "', X n have a common unknown tvariance (J2, (iii) (X" .., ,X n ) is distributed according to an n- dimensional tnormal distribution. The equa- tions (1) together with conditions on the distribution are called a linear model. Among the methods of statistical analysis of linear models are regression analysis, analysis of variance, and analysis of covariance as explained below, but these are not clearly distinguished from each other. (I) In design-of- experiment analysis, i.e., analysis of variance, the matrix A and the vector'; in (1) are called a design matrix and an effect, respectively. In this case entries of A are assumed to be either 1 or O. (II) In regression analysis, we are first given a linear form x = LJ=I ajj of a vector a = (aI' ... ,a k )' with coefficient vector'; = (I, ..., k)" Let XI, ..., X n be the observed values of x at n points at =(ajj, ...,an)', ...,a n = (anI' ..., and', respectively, where n> k. If the observations are unbiased, that is, if E(X;) = 403 E Statistical Models LJ=I aijj, i = 1, ..., n, then the model (1) is obtained with A = (aij)' Usually one of the components of the vector a is taken as unity. In this framework the form x = Ljajj is called the linear regression function or regression hyperplane, and for k = 2, the graph of the linear function x = a l I + 2 and its coefficient I are called the regression line and regression coefficient, respectively. The components of the vector a are called fixed variates or ex- planatory variables. Frequently we encounter the case where the vector a, and consequently the matrix A, are random variables. When this is the case, a discussion like that above can be carried out for given A by regarding A'; in (1) as the conditional expectation of the vector X. E. The Method of Least Squares (1) Consider the subspace L(A) of the tsample space Rn spanned by the column vectors of A. Then the dimension s of L(A) equals the rank of A, and L(A) and its torthocomplement L l.(A) are called the estimation space and the error space, respectively. The torthogonal projection y of a point x to the space L(A) is expressed as y = P A x with a real tprojection matrix P A . The variable Y = P A X is called the least squares estimator of E(X), and the rou- tine of getting such an estimator Y, called the method of least squares, minimizes the squared error (X - A.;)'(X - A';) for a given X. This method consists of two operations solving the normal equation A' A'; = A'X with respect to .;, and setting Y = A, where  is a solution of the equation. For s= k, we obtain Y =A(A'A)-t A'X directly. Even when s<k, where the solution of the normal equation is not unique, Y is uniquely determined. The quantity Q = X'(l- PA)X, where I is the unit n x n matrix, is the squared distance of the point X from the space L(A) and is called the error sum of squares with n - s degrees of freedom. A linear function P'.; of the parameter'; with coefficient vector P= (fJp ..., fJk)' is called a linearly estimable parameter (or estimable parameter) if there is a linear unbiased esti- mator, that is, an unbiased estimator of the form b'X, of P'.;. In order that P'.; be estimable it is necessary and sufficient that P' be a linear combination u' A of the row vectors of the matrix A. A linear unbiased estimator that has minimum variance among all linear unbiased estimator uniformly in .; is called the best linear unbiased estimator (b.l.u.e.). If the con- ditions (i) and (ii) of Section 0 are satisfied, then for any given n-vector u the b.l.u.e. of a parameter y=u'A is given by y=u'Y with Y = PAX, and its variance equals (u' P A U)(J2, while the expectation of the quantity Q is 
403F Statistical Models gIven by (n-s)0"2. This proposition is known as the Gauss-Markov theorem. Hence the b.l. u.e ]I = u'Y is frequently cited as the least squares estimator of y. The quantity 8 2 = Q/(n - s) is an unbiased estimator of 0"2 and is called the mean square error. If in addition the condition (iii) of Section D is assumed, then y and 8 2 are the tuniformly minimum variance unbiased estimators of y and 0"2, respectively. When the error term W in (1) has covar- iance matrix L = 0"2 La with an unknown real parameter 0"2 and a known matrix La, the valaue of the parameter'; minimizing Q = (X - A';), L01 (X - A';) is called the generalized least squares estimator of'; if it exists. This estimator has properties similar to those of the least squares estimator. F. Model Selection and the Method of Maximum Likelihood When a parametric family of distributions {f(.j 0); OE E>} is given and an observation x is made, logf(x j 0) provides an unbiased esti- mate of the expected Jog likelihood of the distribution f('1 0) with respect to the true distribution of the observation. The .value of 0 which maximizes this estimate is the maximum likelihood estimate of the parameter and is denoted by O(x) ( 399 Statistical Estimation) In a practical application we often have to consider a multiple model, defined by a set of component models {.t;(. I 0;); OiEE>i} (i= I, .., ,k). The problem of model selection is concerned with the selection of a component model from a multiple model. The difference of the difficulties of handling a simple model defined by a one-component model, and a multiple model is quite significant. For a sim- ple model {f(.j 0); 0 E E>}, each member of the family is a probability distribution. In the case of a multiple model, its member is a model which is simply a collection of distributions and does not uniquely specify a probabilistic structure for the generation of an observation. Thus the likelihood of each component model with respect to the observation x cannot be defined and the direct extension of the method of maximum likelihood to the problem of model selection is impossible. This constitutes a serious difficulty for the handling of multiple models. Apparently, Fisher used the proce- dure of testing to solve this difficulty. (1) Analysis of log Likelihood Ratios. The procedure of model selection by testing, which is applicable to a wide class of models, is the method of analysis of log likelihood ratios [15]. Consider the situation where a model is to be determined by using a hierarchy of 1524 models { f('j 0;); OiE E>;} (i = 1, "', k) such that E>1 c E>2 C... C E>k' The comparison of models is then realized through the comparison of the maximum likelihoods f(x I 0i(X)), where Oi(X) denotes the maximum likelihood estimate of Oi based on the data x. For E>i c E)i the log likelihood ratio is defined by A(E>;/E>j; x)= -2Iog{f(xl Oi(x))/f(x j OJ(x))}. The analysis of log likelihood ratios is realized by the decomposition A(E>I/E>k;x) =A(E>,/E>2; x)+A(E>Z/E>3;X)+... + A(E>k_ rlE>k; x). The log likelihood ratios A(E>knrlE>k; x), A(E>k-2/E>k-1; x), "', A(E>rlE>2; x) are succes- sively tested by referring to chi-square distri- butions with the degrees of freedom d(k)- d(k-1), d(k-1)-d(k-2), ...,d(2)-d(1), respectively, where d(i) denotes the dimension of the manifold E>i' The assumption of the chi- square distributions is only asymptotically valid under the usual regularity conditions ( 400 Statistical Hypothesis Testing). The model defined with E>i for which A(E>;/8 1 i-j; x) first becomes significant is selected and f(y I ei(x)) is accepted as the predictive distribution. The problem of how to choose the levds of sig- nificance to make the test procedure a proce- dure for model selection remains .open. (2) Model selection by Ale. One way out of the difficulty of model selection is to assume a prior distribution over E>i for each model {J.('j 0;); 0iEE>J This leads to Bayesian model- ing, which is discussed in Section G. Another possibility is to replace each cOIl1.ponent model {t;CjOi);OiEE>i} by a distribution t;('jOi(X)) specified by the maximum-likelihood estimate Oi(X). The problem here is how to define the likelihood of each distribution 1.('1 Oi(X)). An information criterion AIC was introduced by H. Akaike [16J for this purpose; it is defined by AIC = (- 2)loge(maximum likelihood) + 2 (number of estimated parameters). We may consider -0.5 AIC to be the log "likelihood" of f('1 O(x)) which is corrected for its bias as an estimate of ExEy logf(y j O(x)), where Ex denotes the expectation with respect to the true distribution of x, and where it is assumed that x and y are independent and identically distributed. The maximum "likeli- hood" estimate of the model is then defined by the model with minimum Ale. This realizes a procedure of model selection that avoids the ambiguity of the testing procedure. It is appli- cable, at least formally, even to the case of a nonhierarchical set of models 
1525 G. Bayesian Models Consider the situation where an observation x is made and it is desired to produce an esti- mate pry I x) of the true distribution of a future observation y. p(yl x) is called a predictive distribution. Assume that x and yare sampled from one of the distributions within the family {g(y I O)f(x 10); 0 E E>}, i.e., x and yare stochasti- cally independent but share common struc- tural information represented by O. As a design criterion of p( y I x) we assume a probability distribution n(O) of O. The model {AI 0); OE E>} with n(O) is called a Bayesian model and n(O) is called the prior distribution. From the relation EeExleEYlelogp(y I x) = ExEy])ogp(y I x), where E y1x denotes the expectation of y con- ditional on x and Ex the expectation with respect to the marginal distribution of x, it can be seen that the optimal choice of pry I x) which maximizes the expected log likelihood is given by the conditional distribution p(ylx)= f g(YIO)p(Olx)dO, where p(e I x) is the posterior distribution of 0 defined by p(OI x)= f(x I o)n(o)(f f(x I o)n(O)do)-1 When a prior distribution n(O) is specified, the parametric family of distributions {f('1 0); OE E>} is converted into a stochastic structure which specifies a probability distribution of the observations. The likelihood of the struc- ture, or the Bayesian model, with respect to an observation x is defined by f f(x I O)rc(O) dO. When there is uncertainty about the choice of the prior distribution we can consider a set of possible prior distributions and apply the method of maximum likelihood. Such a proce- dure is called the method of type II maximum likelihood by I. J. Good [17]. For a multiple model {jkl OJ); OjE E>;} (i = 1,.. ,k), if prior distributions ITj(O;) are defined, a model selec- tion procedure is realized by selecting the Bayesian model with maximum likelihood. Bayesian modeling has often been consid- ered as not quite suitable for scientific appli- cations unless the prior distribution is objec- tively defined. However, even the construction of an ordinary statistical model is always heavily dependent on our subjective judgment. Once the objective nature of the likelihood of a Bayesian model is recognized, the selection or determination of a Bayesian model can 403 Ref. Statistical Models proceed completely analogously to Fisher's scheme of statistical inference ( e.g, [18J). The basic underlying idea of both the mini- mum AIC procedure and Bayesian modeling is the balancing of the complexity of the model against the amount of information available from the data. This unifying view of the con- struction of statistical models is obtained by the introduction of entropy as the criterion for judging the goodness of fit of a statistical model ( [19J for mOre details). References [IJ R. A. Fisher, Statistical methods for re- search workers, Oliver & Boyd, 1925; Hafner, fourteenth edition, 1970. [2J L. Boltzmann, Ober die Beziehung zwi- schen dem Zweiten Hauptsatze der mechani- schen Warmetheorie und der Wahrscheinlich- keitsrechnung respektive den Satzen iiber das Warmegleichgewicht, Wiener Berichte, 76 (1877),373-435. [3J I. N. Sanov, On the probability of large deviations of random variables, IMS and AMS Selected Trans\. Math. Statist. Pro b., 1 (1961), 213244. (Original in Russian, 1957.) [4J K. Pearson, Contributions to the mathe- matical theory of evolution II, Skew vari- ation in homogeneous material, Philos. Trans. Roy. Soc. London, ser. A, 186 (1895),343-414. Also included in Karl Pearson's early statis- tical papers, Cambridge Univ. Press, 1948, 41  112. [5J E. Wong, The construction of a class of stationary Markoff processes, Proc. Amer. Math. Soc. Symp. App\. Math., 16 (1963), 264 276. Also included in A. H. Haddad (ed.), Nonlinear systems, Dowden, Hutchinson & Ross, 1975, 33-45. [6J E. T. Janes, Information theory and statis- tical mechanics, Phys. Rev., 106 (1957),620 630. [7J S. Kullback, Information theory and statis- tics, Wiley, 1959 (Dover, 1967). [8J I J. Good, Maximum entropy for hypoth- esis formulation, especially for multidimen- sional contingency tables, Ann. Math. Statist., 34 (1963), 911934. [9J C. E. Shannon and W. Weaver, The math- ematical theory of communication, Univ. of Illinois Press, 1949. [IOJ S. R. Searle, Linear models, Wiley, 1971. [IIJ F. A. Graybill, Theory and application of the linear model, Duxbury Press, 1976. [12J K. G. J6reskog, A general method for analysis of covariance structures, Biometrika, 57 (1970), 239251. [13J Y. M. M. Bishop, S. E. Feinberg, and P. W. Holland, Discrete multivariate analysis: Theory and practice, MIT Press, 1975. 
404 A Statistical Quality Control [14J D. R. Cox, The analysis of binary data, Chapman & Hall, 1970. [15J I. J. Good, Comments on the paper by Professor Anscombe, J. Roy. Statist. Soc., ser. B, 29 (1967), 39-42. [16J H. Akaike, A new look at the statistical model identification, IEEE Trans. Automatic Control, AC-19 (1974), 716 723. [17J I. J. Good, The estimation of proba- bilities, MIT Press, 1965. [18J G. E. P. Box, Sampling and Bayes' in- ference in scientific modeling and robustness, J. Roy. Statist. Soc., ser. A, 143 (1980), 383- 430. [19J H. Akaike, A new look at the Bayes pro- cedure, Biometrika, 65 (1978), 53-59. 404 (XVIII.14) Statistical Quality Control A. General Remarks According to the Japanese Industrial Standard (lIS) Z 8101, "Quality Control (QC) is a system comprising all the methods used in manufac- turing products or providing services econom- ically that meet the quality requirements of consumers." To emphasize that modern qual- ity control makes use of statistical methods, it is sometimes referred to as Statistical Qual- ity Control (SQc). In order to implement effective QC, statistical concepts and methods must be applied and the "Plan-Oo-Check- Action" (POCA) cycle must be followed in research and development, design, procure- ment, production, sales, and so on. These QC activities are executed on a company- wide basis from the top management to the production workers. This type of QC is called Company-Wide Quality Control (CWQc) or Total Quality Control (TQc)' The quality Q is an abstract notion of the conformity of a product or service to con- sumers' requirements; it also refers to the total of the characteristics of a prod uct or service as perceived by consumers. The quality charac- teristics may include both measurable physical and/or chemical features, such as strength and purity, or features such as color or texture as appreciated by individuals. These latter char- acteristics could be called "consumer qual- ities." Furthermore, the concept "quality" has also been used to describe the social im- pact of a product or service. This might be called "social quality." Examples of social- quality issues are pollution by solid waste or drainage in the production stage; degradation, maintainability, and safety of a product in 1526 daily use; and pollution following disposa1. For a product or service to conform to this sort of quality it has become necessary to conduct QC activities not only during produc- tion but also at early stages of design and development of new products. The measured characteristics of quality vary from one product to another because of natural variability in the material and produc- tion process involved, ability of individual workers, errors in different sorts of measure- ment, etc. If the variations among the mea- sured values from a process can bf: attributed to "chance causes" and their distribution ex- pressed by a probability or a probability den- sity function, the process is said to be in a "state of statistical control" according to W. A. Shew hart or in a "stable state" by lIS. In this case the value of a characteristic is deemed to be the realization of a random variable X. Sometimes the variations are attributed to "assignable causes," which must be identified and eliminated. B. Control Charts The control chart provides a means of evaluat- ing whether a process is in a stable state. The control chart is made by plotting points illustrating a statistic of the quality charac- teristics or manufacturing conditions for an ordered series of samples or subgroups. A sheet of the control chart is provided with a middle line between a pair of lines depicting the upper control limit (VCL) and the lower control limit (LCL). The stable state is as- sumed to be exhibited by points within the control limits. Points falling outside the con- trollimits suggest some assignable causes, which should be eliminated through corrective measures. The idea underlying control charts as devel- oped by Shewhart is to apply the statistical principle of significance to the control of pro- duction processes. Other types of control charts have also been developed, for example, acceptance control charts and adaptive control charts. These have been successfully applied to many quality control problems. The foundation of Shew hart's control chart is the division of observations into what are called "rational subgroups." A rational sub- group is the one within which variability is due only to chance causes. Between different subgroups, however, variations due to assign- able causes might be detected. In most pro- duction processes the rational subgroup com- prises the data collected over a short period of time during which essentially the same con- 
1527 dition in materia], too] setting, environmental factors, etc., prevails. The limits on the control charts are placed, according to Shewhart, at a 30" distance from the middle line, where 0" is the population standard deviation (or standard error) of the statistic; 30" expresses limits of variability within the subgroup. Assuming that the popu- lation distribution of an observed character- istic is "normal," the range between the limits should include 99.7% of the points plotted so long as the process is "in control" at the mid- dle value. Accordingly, 0.3% of the plotted points from the "in control" process fall out- side the limits, and thus give an erroneous "out of control" signal. To determine that the process is in con- trol for a normally distributed characteristic N (f1, 0"2), we have to investigate the variability between the means f1 and the standard devi- ations 0" of different distributions of X for different subgroups. Thus the state of control of a process is determined with control charts for n x= I XJn and s= i=l n I (Xi-X)/(n-I), i=l which are the appropriate statistics corre- sponding to f1 and 0". Despite the theoretical dra wback of the statistical range R = maxi X i - mini Xi against s, use of the range is often pre- ferred in QC work because of its simplicity in computation. Hence the X - R charts are obtained from the previously collected k ra- tional subgroups each of size n as follows: UCL=X +A2R, UCL=D 4 R. LCL=X -A 2 R. LCL=D 3 R, where X and R are the averages of the k values of X and R, respectively, and 3 A 2 = C ' ...;nd 2 d 3 D4=1+3 d 2 ' d 3 D 3 = I- 3 d' 2 E[R]=d 2 0", V[R]=E[(R-E[RW] =d 3 2 0"2. For n < 7, LCL for R cannot be given because D3 becomes negative. The other commonly used control charts are the p chart (proportion of nonconformity: binomial distribution) and the c chart (number of defects: Poisson distribution). For those charts the above theory of norma] distribution is also used to approximate the binomial and Poisson values. It is generally sufficient to use the agreed-on decision criterion (30" limits) and to recognize a relatively small risk (C( = 0.003) for practical purposes. It should be noted, however, that a shift of the process mean f1 by 10" would not be 404C Statistical Quality Control observed at a ratio of 97.7%, which is the value of the risk fJ for each plotted point, under the norma] distribution of the plotted statistic. One reason why the control chart has been a practical too] in many applications is this lack of sensitivity to a relatively small shift of the level. If greater sensitivity is required, 20" limits, "warning limits" are used. This results in a greater risk C( of erroneously finding a process out of control. Other decision criteria based on aspects of run theory are also used. Charts using ac- cumulated data from several rational sub- groups for each plotted value are sometimes recommended: Moving average and moving range charts and the cusum ("cumulative sum") chart are examples. The statistical theory for these charts is more complicated than that for the simple charts discussed here. C. Sampling Inspection Sampling inspection determines whether a lot should be accepted or rejected by drawing a sample from it, observing a quality character- istic of the sample, and comparing the ob- served value to a prescribed acceptance crite- rion. Sampling may be conducted in several stages. Definite criteria are required to decide at each stage whether to accept or reject the lot or continue sampling on the basis of sam- ple values observed so far. There also must be some rules to determine the size of the next sample if it is to be taken. These criteria and rules together are called the sampling inspec- tion plan. The number of samples eventually drawn and observed and their sizes are gener- ally random variables. In single sampling in- spection the final decision is always reached after one stage of sampling is completed. Dou- ble sampling inspection makes the final deci- sion after at most two stages of sampling are completed. Multiple sampling inspection makes the final decision after at most N stages of sampling are completed (N < 00). Inspection without a predetermined limit on the number of sampling stages, sequential sampling inspec- tion, is usually constructed so that the proba- bility of the indefinite continuation of sam- pling is O. Once a sampling inspection plan is deter- mined, the probability for accepting a lot with given composition can be calculated. This probability as a function of lot composition is called the operating characteristic of the plan. In most cases, the quality of a lot is expressed by a rea] parameter () (e.g., fraction defective, i.e., percentage of defective products, or the average of some quality characteristic), and we use only inspection plans whose operating 
404 Ref. Statistical Quality Control characteristics are expressed as a function of e. The graph of this function is called an OC- curve. We impose certain desirable conditions on the DC-curve and design plans to satisfy them. Tables for this purpose, sampling inspec- tion tables, are prepared for practical use. The condition most frequently employed is ex- pressed in the following form in terms of four constants eo, 0 1 , rx, fJ: The probability of rejec- tion is required to be at most rx when e  eo (or e  00), and the probability of acceptance at most fJ when 0e1 (or (}Od. Here rx is called the producer's risk, and fJ the consumer's risk. If the rejection of a lot is identified with the rejection of a statistical hypothesis 0  00' the DC-curve is actually the power curve of the test upside down (i.e., the graph of 1 minus the tpower function), and the producer's and consumer's risks are precisely the terrors of the first and second kind. The choice of a plan is actually the choice of a test under certain con- ditions on its power curve. Commonly used plans are mostly based On well-established tests, some of which have certain optimum properties A few examples, (1 )(4), are given below. Here sampling inspection by attribute is an inspection plan that uses a statistic with a discrete distribution, whereas sampling inspec- tion by variables uses a statistic with a contin- uous distribution ( 400 Statistical Hypoth- esis Testing). (1) Single sampling inspection by attribute concerning the fraction of defective items in a lot: Let the defective fraction be denoted by P and identified with () in the preceding para- graph. Assign two values of p, say Po and PI (0 < Po < PI < 1), the producer's risk rx, and the consumer's risk fJ. Together they give con- ditions to be fulfilled by the DC-curve. Draw n items from a lot at random, and suppose that they contain Z defective items. The decision IS then made after observing Z, whose distri- bution is thypergeometric and approximately tbinomial when the size of the lot is large enough. There exists a plan that minimizes n among all plans satisfying the imposed COn- ditions under either of the two assumptions about the distribution of Z. It rejects the lot when Z is greater than a fixed number deter- mined by Po, PI, rx, and fJ. This plan is based on the tUM P test of the hypothesis P  Po against the alternative P  Pl' (2) Single sampling inspection by variables concerning the population mean f1 in the case where the population distribution is N (f1, 0'2) with known 0'2: Draw a sample (XI' ..., X n ) of size n from a lot. Assume that the X are inde- pendently distributed with the same distri- bution N(J1, 0'2). Suppose that smaller values of the quality characteristic stand for a more desirable quality. If two values of f1, say f10 and 1528 f11, rx, and fJ are assigned, a plan can be estab- lished to minimize n. It rejects the lot when the sample mean X = L71 XJn exceeds a fixed number determined by f10, f11' rx, and fJ. This too is based on the UM P test of J1  J10 against J1f1I' (3) Cases where the samples are drawn in mOre than one stage: As in (2), assign two values of e, rx, and fJ; there is still liberty to choose n 1 , n 2 , ..., which are the sizes of the samples drawn at each stage. Hence there are many possible plans fulfilling the imposed conditions. Among them a plan is sought to minimize the expectation of n = n 1 + n 2 + ... (called the average sample number). For example, plans based on the sequential proba- bility ratio tests are in common use ( 400 Statistical Hypothesis Testing). (4) Among other special plans, sampling inspection with screening and samilling inspec- tion with adjustment are worthy of mention. In the first plan. all the units in the rejected lots are inspected and defective units replaced by non defective ones. In this case, fixing PI (the lot tolerance percent defective) and fJ, or the average fraction defective after the inspection (average outgoing quality level), we attempt to minimize the expected amount of inspection, that is. the expected number of inspected units induding those in the rejected lots. In the second plan, acceptance criteria are tightened or loosened according to the qual it;, of the lots Just inspected. References [IJ J. M. Juran, Quality control handbook, McGraw-Hill, 1974. [2J Japanese Standard Association, Termi- nology, 1IS Z 8101,1963. [3J Japanese Standard Association, Sampling inspection, 1IS Z 9001-9006, 1957. [4J Japanese Standard Association, Control charts, 1IS Z 90219023, 1963. 405 (XVI1.16) Stochastic Control and Stochastic Filtering A. General Description of Stochastic Control Stochastic control is an optimization method for systems subject to random disturbance. Let r be a compact convex subset of R\ called a control region. Let W, be an tn-dimensional Brownian motion and O't(W)=O'(H-;;st) (say 3";) be the least to'-field for which I, s  t, are 
1529 measurable. An t3";-progressible measurable ['-valued process is called an admissible con- trol. For an admissible control V, the system evolves according to an n-dimensional con- trolled stochastic differential equation (CSDE) dX, = C((X" V,)dW. + y(X" V,) dt, where a symmetric n x n matrix C((x, u) and n- vector y(x, u) are continuous in R n x [' and Lipschitz continuous in xERn. Hence the CSDE has a unique solution, called the re- sponse for V,. The problem is to maximize (or minimize) the performance J: J("x, cp, V) = EX[I e-JC(X,.u,)dSf(X" V,)dt + e - JoC(X,. UJds CP(X T )] , where X, is the response for V, with Xo = x and , is a constant time or a thitting time asso- ciated with a target set. We put V("x,cp)= sUPUEadm controlJ(-r, x, cp, V) the value function as a function of x. If the supremum value is attained at an admissible control 0" then 0, is called an optimal control. B. Bellman Principle In order to get V(t + s, x, cp), R. Bellman ap- plied the following two-stage optimization. After using any V up to time t, a controller changes V to an optimal one. Then at time t + s the performance J(t, x, V(s,', cp), V) is ob- tained. Taking the supremum with respect to V, One gets V(t + s, x, cp). This is called the Bellman principle. Let C be the tBanach lattice of the totality of bounded and uniformly con- tinuous functions on R n . Suppose that c(, y, f and c(;;;' 0) are bounded and smooth; then for constant time t, the value function V(t, x, cp) belongs to C whenever cp E C. Moreover, the family of operators V(t) defined by V(t)cp(x)= V(t, x, cp) becomes a tmonotone contraction semi group on C. The semigroup property V(t + s, x, cp) = V(t, x, V(s, " cp)) is nothing but the Bellman principle. The tgenerator G is expressed by Gcp(x) = sup {L"cp(x)- c(x, u)cp(x) + f(x, u)} UEf for a smooth function cp, where L U is the gen- erator of tdifTusion of the response for con- stant control U(E 1), namely, 1 n a 2 L" =  L C(jp(x, u)C(jp(x, u)-;;---- a 2 j.l.p1 oX j Xi n a + j Yi(X' u) ax; . Furthermore, assume that C( is uniformly posi- 405 C Stochastic Control and Stochastic Filtering tive definite and cp is smooth. Then V(t)cp(x) belongs to tWpi;c for any p and is the unique solution of the Bellman equation ( = dynamic programming equation) oW --:;-=sup {L"W -c(x, u)W + f(x, u)} at UEf a.e. on (0, 00) x Rn, W(O, x) = cp(x) on Rn. In addition if infx.u c(x, u) > 0, then W = lim Hoo V(t)cp exists and is the unique solution of the Bellman equation sUPuef{LUW-c(x,u)W + f(x, u)} = ° a.e. on R n . When T is the hitting time, the value function is related to the Beil- man equation with a boundary condition. C. Feedhack Control In practical problems we specify the kind of information on which the decision of the con- troller can be based at each time. We fre- quently assume that the data obtained up to that time is available. The following situations are possible: (I) The controller knows the com- plete state of the system. This is called the case of complete ohservation. (2) The controller has partial knowledge of the state of system. This is called the case of partial observation. A feedback control (= policy) is a function of available information, namely, a ['-valued progressible measurable function defined on [0, 00) x Cl[O, 00), where j is the dimension of data and Cl[O, 00) is a metric space of totality ofj-vector valued continuous functions on [0, 00). A policy V is called a Markovian policy if V(t,) is a Borel function of t and the tth coordinate of. When a policy V is applied, the system is governed by the tSD E dX,=C((X" V(t, Y))dW.+y(X" V(t, Y))dt with data process Y. When the SDE has a tweak solution, V is called admissible. For example, when X = Y, any Markovian policy is admissible if C( is uniformly positive definite. Let X, be a weak solution for V. Then its performance J(" x, cp, V)  V(t, x, cp). (1) The case of complete ohservation. When C( is uniformly positive definite, an optimal Markovian policy can be constructed in the following way. Since [' is compact, there exists a Borel function 0 on [0, 00) x R n which gives the supremum, namely, sup {L"V(t)cp(x) - c(x, u) V(t)cp(x) + f(x, u)} uef = L O(r.X)V(t)cp(x) - c(x, Oft, x)) V(t)cp(x) + f(x, O(t,x)). This relation implies that V(t)cp(x) = J(t, x, cp, Oft, X,)) for any weak solution X,. Hence 
405 D Stochastic Control and Stochastic Filtering a is an optimal Markovian policy. Especially when rx, c, and f are independent of u and 1'(x, u) = R(x)' u, where R(x) is an n x k matrix, an optimal Markovian policy a is obtained by a measurable selection of maximum points of (grad. V(t)<p, R(x)' u). Since the supremum of a linear form is attained at the boundary or, one can suppose that O(t, x) belongs to or. This is called bang-bang control. (2) The case of partial observation. One useful method is the separation principle. This means that the control problem can be split into two parts. The first is the mean square estimate for the system using a tfiltering. The second is a stochastic optimal control with complete observation. But generally speaking, the problem of deciding under what conditions the separation principle is valid is difficult. In the case of the following linear regulator the separation principle holds. Suppose that the system process X, and the observation process Y, obey the following SDEs: dX,=A(t)dW,+(B(t)X,+b(t, V(t, Y))dt, dY, = dl¥. + H(t)X,dt, where A(t), B(t), and H(t) are nonrandom matrix-valued functions and I¥. is aj- dimensional Brownian motion independent of W,. The problem is to search for a feedback control which gives the maximum value. Sup- pose that a feedback control V(t,) is Lip- schitz continuous in  E ClEO, 00). Put Q(s,x) = sup Esx lf T L(t, X" V(t, Y))dt+ <1>(X,) ] , V,Llp '\ where (X" Y,) is the unique solution for V with the initial condition Xs = x, Y, = O. By the Lip- schitz condition of V, 0", = 0"( Y,; S  t) is inde- pendent of V, and the tconditional expectation X, = E(X,/O",) is governed by the following SDE, by way of the tKalman-Bucy filter: dX,= P(t)H'(t)dW,* +(B(t)X,+ b(t, O,))dt with some O",-progressible measurable control 0, and an n-dimensional Brownian motion w,* adapted to 0",. Moreover, P(t) is the terror matrix satisfying the tRiccati equation, and H' is the transpose of H. Let g(t, x) be the probability density of the normal distribution N(O, P(t)), and put L(t, X, u)= S L(t, x, u) g(t, x -x)dx and '¥(x) = S'P(x)g(T, x- x)dx; then the problem turns into Q(s, x) = sp E,x If L(t, X" O,)dt + '¥(X,)]. Recalling the SDE for X" we can use the Bell- man equation for choosing an optimal one. 1530 D. Stochastic Maximum Principle A stochastic version of tPontryagin's maxi- mum principle gives a necessary condition for optimality. This means that the instantaneous value of optimal control maximizes the sto- chastic analog of Pontryagin's Hamiltonian. Suppose that the system evolves according to an n-dimensional CSDE dX,=rx(X,)dW,+y(X" V,)dt. The problem is to seek conditions on admis- sible control V, such that Ex [Jlf(X" V,)dt] is maximized, where T is a constant time. Assume that rx, 1', and f are bounded and smooth. Define a Hamiltonian H on Rn x r x Rn by H(x, u, 'P) = 1'(x, u). 'P + f(x, u). Let 0, be op- timal and X, its response starting at x. Then under some conditions there exist .  0 and :F,- progressively measurable q'.k = (qt.k 1 ' q'.k" "" q,.k) (k= 1, ...,n) and 'P,=('Pu... 'P,.n) which satisfy the SDE ( 01' - - ,of - - ) d'P'.k = - -;;-(X" V,), 'P,+ A(X" V,) dt UX k oX k +q,.kdW" k=I,...,n, and H(X" 0" 'P,) = max UEr H(X" u, 'P,) a.e. E. Optimal Stopping and Impulse Control Suppose that X, is an n-dimensional diffusion whose generator A is an elliptic differential operator. Let, be a tstopping time. The op- timal stopping problem is to seek a stop- ping time i so that Ex[g(X,] is maximized, where 9 is nonnegative and continuous. i is called optimal. The value function V(x)= sup, Ex[g(X,)] is characterized as the least texcessive majorant of g. Moreover, under some conditions V belongs to the domain of A and is the unique solution of the tfree bound- ary problem; Vg, A VO, and (V -g)' A V=O. Therefore, in the Hilbert space framework, the value function is related to the variational inequality. An optimal stopping tIme is pro- vided by the hitting time for the set {x I V (x) = g(x)}. Impulse control is a variant of the optimal stopping problem. At some moment ( = stop- ping time) a controller shifts the current state to some other state. But not all shifts are al- lowed: State x can be shifted to a state of x + [0, 00)". Let 'b k = 1, 2, be a sequence of in- creasing stopping times and k be a [0, 00)"- valued O"'k(X)-measurable random variable. The sequence V = {", h '2, 2""} is called an impulse control. V transfers the process X, to y,U=Xo+ f ' rx(Y,U)dw,+ f ' 1'(y,u)ds+ L i' o 0 TIt 
1531 if 1 a 2 a A =- Io:o:*(x)ij-----;;--+ I y;(x)-, 2 i.j aXjOX j i aX i and the problem is to seek U so as to maximize Ex[Joe-MfO-;U)dt- ;;":1 e-AtkK(k)J, where }, (> 0) is constant and the function K C;:' 0) stands for the cost of shifting. The value func- tion is related to a quasi variational inequality. F. General Description of Stochastic Filtering The problem of estimating the original signal from data disturbed by noises is called a sto- chastic filtering problem. Let X" t E [0, TJ, be a continuous stochastic process with values in an, called a signal (or system) process. It is transformed (or coded) to h(t, X,), where h(t, x) is an m-vector-valued continuous function. Suppose that it is disturbed by a noise W, and we observe Y, = h(t, X,) + w,. Usually W, is assumed to be a twhite noise independent of X,. Since the white noise is a generalized func- tion, the integral of Y" i.e., Y, = I h(s, X,)ds+ 11';, is called an observation process, where 11'; is an +m-dimensional Brownian motion independent of X,. It is assumed for convenience that I X,I 2 and g Ih(s, X,)1 2 ds are integrable. Assume that X, is a I-dimensional signal process. The least square estimation of X, by nonlinear functions of observed data }'" s:;:;; t, is called a nonlinear filter of X, and is denoted by X,. Let  or 0'(1';;; s:;:;; t) be the least ta-field for which 1';;, s:;:;; t, are measurable. Then the filter X, is equal to an t-measurable random variable such that EI X, - X,1 2 :;:;; EI X, - 21 2 holds for any -measurable L 2 random vari- able 2. Hence it coincidcs with thc tcon- ditional expectation E[ X, I]' Now let H, be the closed linear space spanned by}'" s:;:;; t. The least square estimation of X, by elements of H" i.e., the orthogonal projection of X, onto H" is called the linear filter of X, and is de- noted by X,. Obviously, the mean square error of a nonlinear filter is less than or equal to that of a linear filter, but a linear filter is calculated more easily. If (X" Y,) is a tGaussian process, both filters coincide. When X, is an n-dimensional process (Xi, ..., X;), the n-vector process X, = (Xi, ..., X,n) (or X, = (Xi, ..., X;)) is called the nonlinear (or linear) filter of X,. G. Kalman-Bucy Filter Suppose that the signal process X, is governed by a tlinear stochastic differential equation 405H Stochastic Control and Stochastic Filtering (LSD E) X,=Xo+ IA(S)X,ds+ IB(S)dW:, where A(s) (or B(s)) is an n x n (or n x r) matrix-valued continuous function, I¥, is an r- dimensional Brownian motion independent of the noise 11';, and the initial data Xo is a Gauss- ian random variable independent of 11'; and I¥,. Suppose further that h(t, x) is linear, i.e., h(t, x) = H(t)x, where H(t) is an m x n-matrix- valued function. Then the joint process (X" Y,) is Gaussian. Hence the nonlinear filter X, coincides with the linear filter and satisfies X, '" E[XoJ + I (A(s) - P(s)H(s)' H(s))X,ds + I P(s)H(s)' d1';;, where H(s)' is the transpose of H(s), and P(t) = (Pij(t)) is the error matrix defined by Pi)t) = E(X; - X;) (XI- XI). It satisfies the matrix Riccati equation dP(t) - = A(t)P(t) + P(t)A(t)' dt - P(t)H(t)' H(t)P(t) + B(t)B(t)', P(O) = covariance of Xo. Let <I>(t, s) be the tfundamental solution of the linear differential equation dxldt = (A(t)- P(t)H(t)' H(t))x. Then the solution X, is repre- sented by X, = <I>(t, O)E[XoJ + I <I>(t, s)P(s)H(s)' d1';;. This algorithm is called the Kalman-Bucy filter [1]. Analogous results for discrete-time models have been obtained by Kalman. H. Nonlinear Filter In the study of nonlinear filters, the tcon- ditional distribution n(dx) = P(X,Edxl F,) is considered besides X,. Suppose that X, is governed by the SDE X,=Xo+ r a(s,X,)ds+ r b(s,X,)dlT:;, where a(s, x) (or b(s, x)) is an n-vector (n x r- matrix) valued Lipschitz continuous function. Then n,(f) = Sf(x)n,(dx) satisfies the SDE n,(f)=E[f(Xo)J + I n,(Lf)ds + t (n,(h,f) -n,(h,)n,( f))(d1';;- n,(hJd.l'), 
405 I Stochastic Control and Stochastic Filtering where hs(x) = h(s, x) and ". af Lf(x) = 1,. a'(s, x) ax' , 1 ( ) a2 f +-2: 2: bids, x)bjds, x) _ a ' a j' 2 i.j k X X Under additional conditions on a(s, x) and b(s, x), n,(dx) has a density function n,(x), and it satisfies n,(x)=no(x)+ r L*ns(x)ds+ r ns(x) (hs(x) - f ns(X)hs(X)dX)(dYs-(f ns(x)hs(x)dx )dS). where L * is the formal adjoint of L. The process I, == Y, - S& ns(hs) ds is a Brown- ian motion such that a(ls; s,;;;; t) c:F, holds for any t. If a(ls; s,;;;; t) = iF, holds for alI t, I, is calIed the innovation of Y,. The innovation property is not valid in general. A sufficient condition is that (X" Y,) is a Gaussian process or h(t,x) be a bounded function. However, in any case, t:F,-adapted martingales are always represented as tstochastic integrals of the form L71 S&!s'(w) dI, where the is' are 31';-adapted processes. I. Bayes Formula Let C (or D) be the space of alI continuous mappings x (or y) from [0, T] into Rn (or Rm) equipped with the uniform topology. x, (y,) is the value of x (y) at time t. Let iJ6',(C) be the least a-field of C for which x" s,;;;; t, are mea- surable. iJ6',(D) is defined similarly. We denote by <l>x, <l>w, <l>x. y the tlaws of processes X" If; , and (X" Y,), respectively. These are defined on iJ6'T(C)' iJ6'T(D), and iJ6'T(C)@iJ6'T(D), respectively. Then <l>x, y is equivalent (mutualIy tabsolutely continuous) to the product measure <l>x@<I>w on each iJl,(C) @iJ6',(D). The tRadon-Nikodym density a, of <l>x. y with respect to <l>x @ <l>w is written as a,(x,y)=exp{ r h'(s,xsJdy 1" i ' i 2 } --1,. h (s,x s ) ds , 2, 0 where hi and y' are the corresponding com- ponents of vectors and dy denotes the tIto integral. The conditional distribution n,(dx) is com- puted by the Bayes formula: (f) = p,(f) n, p,(1) , Pt(f) = f f(x,)a,(x, Y)<I>x(dx), where Y = (Y,; 0,;;;; t';;;; T). Moreover, p,(f) satis- 1532 fies the LSDE p,(f) = Po (f) + f>s(Lf)ds+ f>s(hs)Ps(f)dI s . The density p,(x) (if x exists) satisfies p,(x)=Po(x)+ r L*Ps(x)ds + r Ps(X)(f Ps(y)hs(y)dy )ells. If h(t, x) is a smooth function, then a,(x, y) is continuous in y, so that n,(f) or p,(f) is a continuous functional of the observed data (Y,;s';;;;t). Thus the filter n, is a trobust statistic. Remarks. (i) the signal and noise are not independent if the signal is controlled based on the observed data. In these cases, correc- tion terms are sometimes needed for the SDE of the nonlinear filter. (ii) If the tsa mple paths of the signal process are not continuous, a similar SDE for a nonlinear filter liS valid with L being replaced by some integrodifferential operator. If it is a tMarkov chain with finite state, L is the generator of the chain. (iii) Sev- eral results are known for the case where the noise If; is not a Brownian motion but a tPoisson process. References [1] W. F. Fleming and R. W. Rishel, Deter- ministic and stochastic optimal control, Springer, 1975. [2] R. BelIman, Dynamic programming, Princeton Univ. Press, 1957. [3] N. V. Krylov, ControlIed diffusion pro- cesses, Springer, 1980. (Original in Russian, 1977.) [4] M. Nisio, Stochastic control theory, Indian Statist. Inst. Lect. Notes, MacmilIan, 1981. [5] H. J. Kushner, On the stochastic maxi- mum principle, fixed time of control, J. Math. Anal. Appl., 11 (1965),78-92. [6] A. N. Shiryaev, Optimal stopping rules, Springer, 1980. (Original in Russian, 1976.) [7] A. Bensoussan and J. L. Lions, Sur la theorie du controle optimal. I, Temps d'arret; II, Controle impulsionnel, Hermann, 1977. [8] R. S. Bucy and R. E. Kalman, New results in linear filtering and prediction theory, J. Basic Eng. ASME, (D) 83 (1961), 95-108. [9] R. S. Bucy and D. D. Joseph, Filtering for stochastic processes with applications to guidance, Interscience, 1968. [10] M. Fujisaki, G. KalIianpur, and H. Kunita, Stochastic differential equations for the nonlinear filtering problem, Osaka J. Math., 9 (1972), 19-40. 
1533 [llJ R. S. Liptzer and A. N. Shiryaev, Statis- tics of stochastic processes. I, General theory; II, Applications, Springer, 1977, 1978. (Original in Russian, 1974.) [12J G. Kallianpur, Stochastic filtering theory, Springer, 1980. 406 (XVI1.14) Stochastic Differential Equations A. Introduction Stochastic differential equations were rigor- ously formulated by K. Ito [7J in 1942 to construct diffusion processes corresponding to Kolmogorov's differential equations. For this purpose he introduced the notion of stochastic integrals, and thus a differential-integral cal- culus for sample paths of stochastic processes was established. This theory, often called Ito's stochastic analysis or stochastic calculus, has brought an epoch-making method to the theory of stochastic processes. It provides us with a fundamental tool for describing and analyzing diffusion processes that we can apply effec- tively to limit theorems and to the probabil- istic study of problems in analysis. It also plays an important role in the statistical the- ory of stochastic processes, such as tstochastic control or tstochastic filtering. Stochastic dif- ferential equations on manifolds provide a probabilistic method for differential geome- try, sometimes called stochastic differential geometry. Recently, many interesting examples of infinite-dimensional stochastic differential equations have been introduced to describe probabilistic models in physics, biology, etc. A unified theory of stochastic calculus has been developed in the framework of Doob's martingale theory and this, combined with Stroock and Varadhan's idea of martingale problems, provides an important method in the theory of stochastic processes ( 262 Martingales). B. Stochastic Integrals As is well known, almost all sample paths of a Wiener process are continuous but nowhere differentiable ( 45 Brownian Motion), and hence integrals with respect to these functions cannot be defined as the usual Stieltjes inte- grals. But these integrals can be defined by making use of the stochastic nature of Brown- ian motion. Wiener defined them (the Wiener integrals) for nonrandom integrands, but Ito 406 B Stochastic Differential Equations defined them for a large class of random inte- grands. Ito's integrals have been extended in the martingale framework by H. Kunita and S. Watanabe, and by others [14, 19J, as shown below. Let (0, ff, P) be a probability space, and let F = {g;;},;.o be an increasing family of (J- subfields of ff. Usually we assume that {g;;} is . h . . u;: n m;- w: ng t contInuous, I.e., t+o: = ,>o'?,+,=, for every t  O. Denote by A = A(F) the total- ity of all continuous square-integrable martin- gales X = (X,) relative to {g;;}; to be precise, X is an {g;;}-martingale such that, with proba- bility 1, Xo=O, tX, is continuous and E(X) < CIJ for every tO. We introduce the metric IIX - YII = k1 Tnmin(l, IIX n - Y,,112) on A where II 11 2 stands for the L 2 (0, P)-norm. We always identify two stochastic processes X = (X,) and Y=(Y,) if sample functions tX, and t Y, coincide with probability 1. Then, by virtue of Doob's inequality Ilmaxo"s",1 X s - Y,,\ 11 2 21IX,- Y,112 ( 262 Martingales), A becomes a complete metric vector space. Next, by an integrable increasing process we mean a process A =(A,) with the following properties: (i) A is adapted to {g;;}, i.e., A, is g;;-measurable for every t  0; (ii) with proba- bility 1, Ao=O, tA, is continuous and nonde- creasing; (iii) A, (O) is integrable for every tO, i.e., E(At)< CIJ. We denote by d=d(F) the totality of integrable increasing processes. We call a process V = O) an integrahle process of bounded variation if V is expressed as V, = Ai - A with A I, A 2 E ,w'. The totality of inte- grable processes of bounded variation is de- noted by 1/ = r(F). It follows from the tDoob- Meyer decomposition theorem that, for every M, N EA, there exists a unique VE1/ such that M,N, - V, is an {g;;}-martingale. We denote this Vas <M,N). In particular, <M,M)Ed, and it is denoted simply by <M). <M, N) is called the quadratic variation process because ?I(Mti- Mti_,)(Nti-Nti_l)<M,N), in probability as 1!1IO, where !1:t O =O<t l <... < t n = t is a partition and 1!11 = maxI "i"n I t i - ti-,I. Brownian motion is the most important example of continuous square-integrable mar- tingales, and this is characterized in our frame- work as follows. Suppose that ad-dimensional continuous {g;;}-adapted process X = (Xi) satisfies Mi=Xi-xJEA and <M,Mj),= (ji j t , i,j= 1, 2, ...,d. Then X is ad-dimensional Brownian motion such that Xu-Xv, and the g;; are independent for every u  v  t. Such a Brownian motion is called an {g;;}-Brownian motion, and a system of martingales MiEA having this property is often called a system of {g;;}- Wiener martingales. Now, we fix M EJI. We denote by !i'2(M) the totality of real, {3";}-adapted, and measur- able processes <1> = (<1> (t)) such that 11<1>IIM= 
406 B Stochastic Differential Equations E[f<I>(S)2d<M)J < 00 for every t;?oO. Two <1>" <l>2E2'2(M) are identified if 11<1>1 -<I> 2 11t.M= ° for all t;?oO. Since II II,.M is an Lz-norm on [0, tJ x Q with respect to the measure J1M(ds, dw)= d<M),(w)P(dw), it is easy to see that 2'2(M) is a complete metric vector space with the metric 1I<I>-<I>'IIM=;;;"12-nmin(!, 11<1> -<I>'lln,M)' <1>, <1>' E2'2(M). 1[<1> = (<I>(t)) is given, for a partition 0=t O <t 1 <... <tn...-->oo and ,,-measurable bounded functions f., i = 0, 1,..., by <I>(t, w)= { fo(w), .f.-dw), ti-,<tti, i=I,2,..., t=o, then <l>E2'2(M) and the totality 2'0 of such processes are dense in 2'2 (M). I[ <I> E 2'0' we define 1 M (<I»= (lM(<I>)(t)),;> 0 by n-l IM(<I>)(t)= I f.(M,;" -M,,J + fn(M,-M,J, ;=0 tnttn+1' Then I M (<I»E.-4I, and it holds also that <IM(<I», IN(I.JI) = J <I>(s)l.JI(s)d<M, N), for M, N E,;/f and <1>, I.JIE2'o' In particular, IIIM(<I>)(t)II= E[ <IM(<I»),J = II<I>IIM' and hence III M (<I»II = II<I>IIM' This implies that <l>E2'0 c2'2 (M)--> 1 M (<I»E,-4I is an isometric linear mapping, and hence it can be extended to 2'2(M) uniquely, preserving the isometric property. IM(<I»E ,;If is called the stochastic integral of <I> E 2'2(M) by ME vII. 1 M (<I»(t) is often denoted by J<I>(s)dM" and the random variable IM(<I»(t) obtained by fixing t is also called a stochastic integral. The definition of stochastic integrals can be extended further by the following localization method. For an {,9;}-tprogressively measur- able process X =(X,) and {.}-stopping time a, the stopped process Xu = (X;) is defined by X = X'M (t /\ a = min(t, a)). It follows from the toptional sampling theorem of Doob that Xu E,4f if X uti. For <I> =(<I>(t))E2'2(M) and an {'?;}-stopping time a, <l>u= (<I>At)) defined by <l>a(t)=/[,u}<I>(t) also belongs to 2'2(M) and it holds that 1 M (<I>u) = [1M (<I»Ju. Keeping these facts in mind, we give the following definition. Let ';/floc = { M = (M,) I there exists a seq uence of {%,}-stopping times an such that a n < 00, an joo as n-->CIJ a.s. and MU"E.I{ for every n = 1,2, ... }. ,W'loc and ,//'IOC are defined in a similar way. For M, N E .If loc , < M, N) E 1' 10c is defined to be the unique process in 1' loc such that <M u ", N U ") = <M, N)a" for a sequence of stopping times {an} as above, which can be chosen common to M and N. <M, M) is de- noted by <M) as before. We fix M E,-4I loc and set 2'iO C (M)= {<I> = (<I>(t)) I a real, {,'F,}-adapted and measurable process such that, with proba- bility one, J <I> (S)2 d< M), < 00 for every t;?o O}. 1534 For <l>E2'dO C (M), we can choose a sequence of {:F,}-stopping times {an} such that. an < 00, anjCJJ a.s. and M U "E,-4I, <l>a"E2'2(M u "), n= 1, 2, .... For example, set an = min [n, inf{ t I <M), + J <I>(S)2 d<M),;?o n}J. Then there exists an IM(<I»E,/{tIOC such that IM(<I>)""=IM""(<I>a) for n = 1, 2,..., which is unique and indepen- dent of a particular choice of {an}' IM(<I» is called the stochastic integral of <I> E 2'iO C ( M) by M EJtf'oc. IM(<I»(t) is often denoted by J <I>(s)dM" and the random variable 1 M (<I>)(t) obtained by fixing t is also called a stochastic integral. Some of the basic properties of stochas- tic integrals are: (i) I[ M EJtf 10c , <l>E2'dO C (M), and I.JIE2'do c (lM(<I>)), then <l>I.JIE2'd DC (M) and 1 M (<I>I.JI) = IIM(<I»(I.JI). (ii) I[ ME ,4/ loc and <1>, I.JI E 2'dO C ( M), then for every CJ., 13 E R we have CJ.<I> + IJI.JI = (CJ.<I>(t) + fJl.JI(t)) E2'dO C (M) and IM(CJ.<I>+lil.Jl) = CJ.I M (<1» + 131M (I.JI). Also if M, N E -41 loc and <I> E 2'dO C (M) n 2'dO C (N), then for CJ., 13 E R we have <l>E2'iO C (CJ.M +fJN) and rM+PN(<I» == CJ.IM(<I» + fJI N (<1». (iii) I[ M, N E ,1{IO", <l>E 2'dO'(M), and I.JIE2'dO C (N), then <l>I.JIE2'!oC«M,N») and <1M (<1», IN(I.JI), = J <l>s l.JI,d< M, N),. Here, 2'OC( V) for V E 1' 1oc is defined as follows: With probability 1, S--> V, is of bounded varia- tion on every finite interval [0, tJ, the total variation of which is denoted by IVI,. Then I V IE ,W'IOC, and we define 2'OC( V) (p;?o 1) to be the totality of real, {:F,}-adapted, and measurable processes <I> = (<I>(t)) mh that, with probability 1 , J 1<I>(s)IP dl VI, < 00 for every t;?o 0. In particular, 2'd"C( M) = 2'iO C ( < M»). (iv) I[ M E,-4I lo \ <l>E2'iO C (M), and a is an {:F,}- stopping time, then IM(<I>u)=IM"(q;')=IM"(<I>u) = [lM(<I>)]u. (v) I[ <I>(t) = I[u<'}' f, where a is an {:F,}-stopping time and f is a bounded ,>measurable random variable, then <l>E 2'iO C (M) for every M E,-4I IOC and 1 M (<I»(t) = f(M, - M'M)' (vi) The definition of stochastic integrals is independent of the increasing family of a-subfields in the following sense: I[ {,} is another family such that M belongs to the class Jtfloc for both {:F,} and {,} and <I> belongs to the class 2'2(M) for both {,} and {,}, then IM(<I» is the same whether it is de- fined with respect to {3-';} or {,}. In particular, N = 1 M (<1», M E,-4I!O\ <l>E Y'dOC(M), satisfies <N, L),= J<I>(s)d<M, L), for all LE,-4l 10c . Conversely, NE.lt loc having this property is unique, and hence it coin- cides with IM(<I». 1 M (<I»E,4f if and only if J <I>(S)2 d <M),Ed. The above definition of stochastic integrals can be extended with a slight technical modifi- cation to the case when M, is not necessarily continuous [19]. Among such gen'ral stochas- tic integrals, a particularly important role is played by stochastic integrals describing point 
1535 processes, including Poisson point processes as an important special case. These stochastic integrals are important in the study of discon- tinuous processes including tLevy processes; even in the study of continuous processes, such as diffusion, they provide an important tool for the treatment of excursions [4,6,8]. Let (a, Y, P) and {} be as above. By a continuous semimartingale with respect to {Jf;}' or simply a semimartingale when there is no danger of confusion, we mean a process X =(X(t)) of the following form: X(t)= X(O)+ M(t)+ V(t), where X(O) is an 'o-measurable random variable, M =(M(t))E,t'floc and V= (V(t))E ,//"Ioe. M and V are uniquely deter- mined from X, and this decomposition is called the semimartingale decomposition of X. M is called the martingale part and V the drift part of the semimartingale X. A semimartin- gale X is often called an Ito process if M (t) = S <I>(s)dB(s) and V(t) = S6 '¥(s)ds, where B(t)E .it is an {.9';}-Brownian motion, <l>E2iO\ and \II E 2';oe. (When V, = t, 2'OC( V) is denoted simply by 2'oe.) A d-dimensional process whose components are semi martingales is called ad-dimensional semimartingale. The following formula, originally due to Ito and extended by Kunita and Watanabe, is of fun- damental importance in stochastic calculus. Ito's formula. Let X(t) = (X 1 (t), ..., Xd(t)) be ad-dimensional semimartingale and Xi(t)= Xi(O)+ Mi(t) + Vi(t) be the semimartingale decomposition of components. Let F(x) = F(x 1 , ..., x d ) be a C 2 -function defined on Rd. Then F(X(t)) is also a semimartingale, and we have d I ' F(X(t))=F(X(O))+i 0 DiF(X(s))dM'(s) d I ' + i 0 DiF(X(s))dvi(s) 1 d I ' +2i.1 0 DiDjF(X(s))d<M i , Mj)(s) (where D i = a/ax i). In others words, if Y(t)= Y(O)+ M(t)+ V(t) is the semimartingale decomposition of Y(t) = F(X(t)), then M(t)=1, SDiF(X(s))dM'(s) and V(t)= i', SDiF(X(s))dVi(S)+ 1/2L1 SDiDJ(X(s))d<M', W)(s). We now discuss other important transfor- mations on semi martingales. Time change. Let A E ,9110", and assume fur- ther that with probability I, t -> A, is strictly increasing and lim,!a: A, = 00. Let u->C u be the inverse function of t->A" i.e., C u = min {t I A,u}. Then for every uO, C u is an {,}- stopping time. Set g;, = :Fe" t  O. For an {}- tprogressively measurable process X = (X(t)), 406C Stochastic Differential Equations we define X A =(XA(t)) by XA(t)= X(C,) and call it the time change of X determined by A. Then X A is progressively measurable with respect to {g;,}. If X:X(t)=X(O)+M(t)+ V(t) is a semimartingale with respect to {}, then X A is a semimartingale with respect to {,}, and its semimartingale decomposition is given by XA(t)=X(O)+ MA(t)+ VA(t). The map- pings M -> M A and V -> V A are bijections between .Ioe and .ijIoe and between "floc and 11"loc, respectively, where ,Ioe and 11"l o c are defined relative to {,}. Furthermore, <MA,NA)=<M,N)A for every M, NEU#loc. Noting that <l>E2'iO C (M) can always be chosen {,}-progressively measurable (in fact {,}- tpredictable), the mapping <I>-><I>A defines a bijection between 2'io e (M) and 2'iO C (M A ), and we have JMA(<I>A) = [1M (<I>)]A. Transformation of drift (Girsanov transfor- mation). For mE,I0c, set Dm(t)=exp[m,- -!<m),]. Then Dm-1 Eo/ltl0c, and ifm satisfies a certain integrability condition (for example, E(exp[-!<m),J) < 00 for every t O, in partic- ular, <m), ,;;; ct for all t for some constant c> 0), then Dm is a martingale, i.e., E(Dm(t)) = I for all tO. If E(Dm(t)) = I for all t, then there exists a probability? on (a, 3-") (if (a, Y) is a nice measurable space and , = V,;;, 0 Y" which we can assume without loss of generality) such that P(A)= E(Dm(t): A) for al' AE, tO. Let X be a semi martingale with the decomposition X(t)=X(O)+M(t)+ V(t). On the probability space (a, ,,?) with the same family {,}, X is still a semimartingale but its semimartingale decomposition is given by X (t) = X (0) + M (t) + V(t), where M(t)=M(t)-<M,m)(t) and V(t)= V(t)+<M,m)(t). Furthermore, it holds that <M,N)=<M,N), M, NEU#loc. This result is known as Girsanov's theorem. The transfor- mation of probability spaces given above is called a transformation of drift or a Girsanov transformation since it produces a change as shown above of the drift part in the semi mar- tingale decomposition. In the discussion above, the increasing family {} was fixed. It is also important to study how the semimartingale character changes under a changing increasing family [12]. C. Stochastic Differentials In this section, we introduce stochastic dif- ferentials of semi martingales and rewrite the results in the previous section in more conve- nient form. Let (a, ,, P) and {} be as above and U#, ,9/, 1/, U#loc, d 'uc , yloC be defined as in Section B. By !!l. we denote the totality of continuous semimartingales relative to {Jf;}. 
4060 Stochastic Differential Equations For XE!ii, let X(t)=X(O)+Mx(t)+ Vx(t) be the semi martingale decomposition. We write formalIy X (t) - X (0) = S dX (s) and calI dX (denoted also by dX, or dX(t)) the stochastic differential of X. To be precise, dX can be considered as a random interval function dX(/) = X(t) - X(s), 1= (s, t] or the equivalence class containing X under the equivalence relation X  Y on 12 defined by X  Y if and only if X(t)-X(O)= Y(t)- Y(O), tO. For X, YE12 and IX, f3ER, IXdX +f3dY is defined by d(IXX + fiY) and dX' dY by d<M x , My). Let d12 be the totality of stochastic differentials of elements in 12 and dA and d1/" be that of elements in Aloe and 1/"IOC, respectively. d12 is a commutative algebra under the operations just introduced. Note that dX' dY Ed1/" and that dX' dY =0 if either of dX and dY is in d1/". In particular, dX . d y. dZ = 0 for every dX, dY, and dZ. Let :?J be the totality of {.}- progressively measurable processes <I> = (<I> (t)) 'such that, with probability 1, suPo<;s<;,I<I>(s)\ < CfJ for every t  O. Noting that :?J c 2'OC(M) for any ME dlt lO \ we define <1>. dX E d12 for <1> E:?J and X E 12 to be the stochastic differ- ential of the semimartingale S <I>(s) dM x(s) + S <I>(s) dVx(s). <1>. dX is uniquely determined by <1> and dX. Ito's formula is stated, in this context, as follows: For X=(Xt, ...,X d ), XiE 12, and F:Rd->R, which is of class C 2 , F(X)E 12, and d dF(X)= I: DiF(X)'dX i i'=l 1 d .. +- I: D;DJF(X)' dX'. dXJ. 2 i.J1 We now define another important operation on the space d12. Noting that 12 c f?4, we define X odY for X, YE12 by . 1 XodY=X.dY +-dX.dY. 2 This is uniquely determined from X and dY, and is called the symmetric multiplication of X and dY. It is also called a stochastic differential of the Stratonovich type or Ito's circle operation since the notation was introduced by Ito [9]. S X 0 d Y is calIed the stochastic integral of the Stratonovich type, whereas S X . dY is that of the Ito type. Under this operation, Ito's for- mula is rewritten as follows: For X = (X t, ... , X d ), X i E12, and F: Rd->R, which is of class C 3 , F(X), D i F(X)E12, and d dF(X) = I: DiF(X)odX i . i=l This chain rule for stochastic differentials takes the same form as in the ordinary calculus. For this reason symmetric multiplication plays an 1536 important role in transferring notions used in ordinary calculus into stochastic calculus and in defining intrinsic (i.e., coordinate-free) no- tions probabilistically. In particular, it is fun- damental to the study of stochastic differential equations on manifolds ( Section G). D. Stochastic Differential Equations Here, we give a general formulation of stochas- tic differential equations in which the infini- tesimal change of the system may depend on the past history of the system; however, equa- tions of Markovian type, in which the infini- tesimal change of the system depends only on the present state of the system, are consid- ered in most cases. Let W d be the space of d- dimensional continuous paths: W d = C([O, 00) -> R d ):= the totality of all continuous functions w: [0, oo)->R d , endowed with the topology of the uniform convergence on finite intervals and :?J(W d ) be the topological a-field. For each tO, define p,: W d -> W d by (p,w)(s)=w(t/\s), and let :?J,(W d )=p,-I(:?J(W d )), tO. Let dd" be the totality offunctions lX(t, w) == (1X](t, w)): [0, 00) X W d -> R d @ R' (:= the totality of d x r real matrices) such that each component IX](X, w) (i = 1,2, ... , d;j = 1,2, ..., r) is :?J([O, 00)) x :?J(Wd)-measurable and :?J,(Wd)-measurable for each fixed t  O. In general, 1X;(t, w) is called nonanticipative if it satisfies the second property above. An important case of IXEd"" is when it is given as lX(t, w)=a(t, w(t)) by a Borel func- tion a: [0, CfJ) X R d ..... R d @ R'. In this case, IX is called independent of the past history or of Markovian type. For a given IXEd d .' and fiE dd. 1, we consider the following stochastic differential equation: , (1) dXi(t) = I: IXJ(t, X)dBJ(t) + fii(t, X)dt, Jl i= 1,2, ...,d, also denoted simply as dX (t) = lX(t, X)dB(t) + fi(t, X) dt. Here X(t)=(X 1 (t), "', Xd(t)) is ad-dimensional continuous process. B(t)=(B 1 (t),..., B'(t)) is a r-dimensional Brownian motion with B(O)=O. A precise formulation of equation (1) is as follows. X =(X(t)) is called a solution of equa- tion (1) if it satisfies the following conditions: (i) X is a d-dimensional, continuous, and {g;;}- adapted process defined on a probability space (Q, ff, P) with an increasing family {g;;}, i.e., X: Q-> W d which is g;;/:?J,(Wd)-measurable for every t 0; (ii) IXJ(t, X)E 2'1°", fii(t, X)E 2':oc, i = 1, ..., d,j = 1, "', r (-->. Section B for the definition of 2'OC); (iii) there exists an r- 
1537 dimensional {g;;}-Brownian motion B(t) with B(O) = 0 such that the equality Xi(t)-Xi(O)= Jl I Cl.J(s, X)dBj(s) + f>i(S,X)dS, i=I,2,...,d, holds with probability 1. Thus a solution X is always accompanied by a Brownian motion B. To emphasize this, we often call X a solution with the Brownian motion B or call the pair (X, B) itself a solution of (1). In the above definition, a solution is given with reference to an increasing family {}. The essential point is that a-fields a(B(u) - B(v); u;;:' v;;:' t) and a(X(s), B(s);Os t) are independent for every t: If X satisfies the con- ditions of solutions stated above, then the specified independence is obvious, and con- versely, if this independence is satisfied, then by setting g;; = n,>o 0'( X (s), B(s); 0  s  t + r.), the conditions of solutions stated above are satis- fied. But it is usually convenient to introduce some increasing family {} into the definition of solutions as above. When CI. and 13 are of the Markovian type, CI.(t, w)=a(t, w(t)),fJ(t, w)= b(t, w(t)), the corresponding equation (2) dX(t) = a(t, X(t))dB(t) + b(t, X(t))dt is called a stochastic differential equation of Markovian type. Furthermore, if a(t, x) and b(t, x) are independent of t, i.e., a(t, x) = a (x) and b(t,x)=b(x), the equation (3) dX(t)= a(X(t))dB(t) + b(X(t))dt is called a stochastic differential equation of time homogeneous (or time-independent) Mar- kovian type. Next, we define the notions of the unique- ness of solutions. There are two kinds of uniqueness: uniqueness in the sense oflaw (in distribution) and path wise uniqueness. When we consider the stochastic differential equa- tions as a means to determine the laws of continuous stochastic processes, uniqueness in the sense of law is sufficient. If, on the other hand, we regard the stochastic differential equation as a means to define the sample paths of solutions as a functional of the ac- companying Brownian motion, i.e., if we re- gard the equation as a machine that pro- duces a solution as an output when we input a Brownian motion, the notion of pathwise uniqueness is more natural and more impor- tant. As we shall see, this notion is closely related to the notion of strong solutions. These notions are defined as follows. For a solution X =(X(t)) of 0), X(O) is caned the initial value, its law on R d is called the initial 406D Stochastic Differential Equations law (distrihution), and the law of X on W d is called the law (distribution) of X. We say that the uniqueness in the sense of law of solutions for (1) holds if the law of any solution X is uniquely determined by its initial law, i.e., if whenever X and X' are two solutions whose initial laws coincide, then the laws of X and X' coincide. In this definition, we restrict ourselves to the solutions whose initial values are nonrandom, i.e., the initial laws are 15- distributions at some points in Rd. Next, we say that the pathwise uniqueness of solutions for (1) holds if whenever X and X' are any two solutions defined on the same probabil- ity space (Q, .'F, P) with the same increasing family {g;;} and the same r-dimensional {g;;}- Brownian motion such that X (0) = X'(O) a.s., then X(t)=X'(t) for all t;;:'O a.s. In this defi- nition also, the solutions can be restricted to those having nonrandom initial values. We say that equation (1) has a unique strong solution if there exists a function F(x, w): R d x H-Z' -> W d (H-Z' = {WE WI w(O) = O}) such that tP- following are true: (i) For any solution (X, . I of(1), X = F(X(O), B) holds a.s.; (ii) for any Rd-valued random variable X(O) and an r- dimensional Brownian motion B=(B(t)) with B(O) = 0 which are mutually independent, X = F(X(O), B) is a solution of (1) with the Brown- ian motion B and the initial value X(O). If this is the case, F(x, w) itself is a solution of (1) with the initial value x, and with respect to the canonical Brownian motion B(t, w) = w(t) on the r-dimensional Wiener space (WO",g;, P), g; is the completion of (WO") with respect to the r-dimensional Wiener measure P. If equation (I) has a unique strong solution, then it is clear that pathwise uniqueness holds. Conversely, if pathwise uniqueness holds for (1) and if a solution exists for any given initial law, then equation (1) has a unique strong solution, [6,25]. The existence of solutions was discussed by A. V. Skorokhod [20]. If the coefficients CI. and 13 are bounded and continuous on [0, 00) X W d , a solution of (I) exists for any given initial law. This is shown as follows [6]. We first con- struct approximate solutions by Cauchy's polygonal method and then show that their probability laws are ttight. A limit process in the sense of probability law can be shown to be a solution. The assumption of bounded- ness above can be weakened, e.g., to the fol- lowing condition: For every T> 0, a constant K 1 > 0 exists such that (4) IICI.(t, w)11 + IlfJ(t, w)11  K I (1 + Ilwll t ), tE [0, T], WE W d . Here Ilwll,=maxo<,;,<,;,lw(s)l. In the case of the 
406 E Stochastic Differential Equations Markovian equation (2), it is sufficient to assume that a(t, x) and h(t, x) are continuous: (5) Ila(t, x)11 + Ilh(t, x)11  K T (1 + Ixl), tE[O, T], xER d . If these conditions are violated, a solution X(t) does not exist globally in general but exists up to a certain time e, called the explosion time, such that limqe Ix(t)1 = (X) if e < 00. To extend the notion of solutions in such cases, we have to replace the path space W d by the space W d that consists of all continuous func- tions w: [0, CX))-> ad ( = R d U {.1} = the one-point compactification) satisfying w(t) =.1 for every t;?e(w) (=inf{tlw(t)=.1}). Now, we list some results on the uniqueness of solutions. First consider the equations of the Markovian type (2), and assume that the coefficients are continuous and satisfy the condition (5). (i) If a, h are Lipschitz continu- ous, i.e., for every N > 0 there exists a constant KN such that Ila(t,x)-a(t,Y)11 + Ilb(t,x)- h(t,Y)11  KNlx - yl, tE[O, T], x, YE B N := [zERdllzl N}, then the pathwise uniqueness of solutions holds for equation (2). Thus the unique strong solution of (2) exists, and this is constructed directly by Picard's succes- sive approximation (Ita [7, 8J). (ii) If d = 1, a is Holder continuous with exponent 1/2 and h is Lipschitz continuous, i.e., for every N > 0, K N exists such that la(t,x)-a(t,yW + Ib(t,x)-b(t,Y)1  KNlx - yl, tE[O,N], x,YEB N , then the path wise uniqueness of solutions holds for equation (2) (T. Yamada and Watanabe [25]). (iii) If the matrix a(t, x) = a(t, x)a(t, x)* (i.e., a'j(t, x) = Lkl al(t, x)aI(t, x)) is strictly positive definite, then the unique- ness in the sense of law of the solution for (2) holds (D. W. Stroock and S R. S. Varadhan [21]). (iv) An example of stochastic differential equations for which the uniqueness in the sense of law holds but the pathwise uniqueness does not hold was given by H. Tanaka as follows: d= r= 1, b(t, x)=O and a(t, x) = I:x"o; - I:x<o:. Another example in the non- Markovian cases was given by B. S. Tsirel'son (see below). Next, consider non-Markovian equations of the following form: (6) dX(t)=dB(t)+[J(t,X)dt; i.e., the case d = rand o:(t, w)= I (identity ma- trix). Assume further that /iE.<7/ d . 1 is bounded. Then a solution of (6) exists for any given initial distribution, unique in the sense of law, and it can be constructed by the Girsanov transformation of Section B as follows. On a suitable probability space (Q, .¥, P) with an 1538 increasing family {,} such that :F = V,>o'' we set up an :Fo-measurable, d-dimensional random variable X(O) with a given law and a d-dimensional {,}-Brownian motion B = (B(1)) such that B(O)=O. Set X(t)=X(O) +B(t) and M(t)=exp[fb[J(s,X)dB(s)- Hbl[J(s, XWds]. Then M(t) is an {,}- martingale, and the probability P on (Q, ff) is determined by P(A)=E(M,;A), AE,. By Girsanov's theorem, B(t) = X(t) -X(O)- Jb[J(s, X)ds is ad-dimensional {,F,}-Brownian motion on (Q, ., P), and hence (X, B) is a solution of (6). Any solution is given in this way and hence the uniqueness in the sense of law holds. But the pathwise uniqueness does not hold in general; an example was given by Tsirel'son [1,6] as follows. Let {t n } be a sequence such that 0 <... < t n < t n - 1 < to = 1 and limnl'l t n = O. Set { O' t;?to and t=O, O ( W(t i + d - W(t i + 2 ) ) [J(t, w) = , t i + 1 - t i + 2 t E [t i +, , 1;), i = 0, 1, 2, ... , where 8(x)=x-[x], xER, is the decimal part of x. Time changes ( Section B) are also used to solve some stochastic differential equations [6]. E. Stochastic Differential Equations and Diffusion Processes In this section we consider equations of time- independent Markovian type (3) only. The time-dependent case can be reduced to the time-independent case by adding one more component X d + 1 (t) such that dX d +, (t) = dt. Further, we assume that coefficients a(x)E R d @ R' and h(X)ER d are continuous on R d and the uniqueness in the sense of law of solu- tions holds. Let P x , x E R d , be the law on W d , or on W d if there is an explosion, of a solution with the initial law b x (= the unit measure at x). Then {Px} possesses the tstrong, Markov property with respect to {,F,}, where :F, is a suitable completion of :?4,( W d ) or ,,,( W d ), and hence (W d , {,F,}, Px) or (W d , {,F,}, P,) is a diffu- sion process ton R d ( 115 Diffusion Pro- cesses, 261 Markov Processes). Let A be the differential operator A = f a'j(x)DiD j + f bi(x)D i (Di=rl/Dx i ) 2 i.jl i' with the domain C6 (R d ) ( = the totality of C 2 _ functions on R d with compact supports), where aij(x) = Lk1 aUx)ai(x). By Ito's formula, (7) ((w(t)) - j(w(O))- I (A{)(w(sl)ds 
1539 is an {g;;}-martingale for every f E CJ(R d ) (we set f(l1) = 0), and this property characterizes the diffusion process. The diffusion is gen- erated by the operator A in this sense. Fur- thermore, if for some »0, ().-A)(CJ(R d )) is a dense subset of CCX)(R d ) (= the totality of continuous functions f on R d such that limlxlcxJ(x)=O) then the ttransition semi- group of the diffusion is a tFeller semigroup on C",(R d ), and its infinitesimal generator A is the closure of (A, CJ(Rd)). Hence u(t, x) = ExU(w(t))], IE CJ(Rd), is the unique solu- tion of the evolution equation du/dt= Au, u I tO = f Generally, if the coefficients (J and b are sufficiently smooth, we can show, by using the stochastic differential equation (3), that u(t, x) is also smooth for a smooth f and satisfies the heat equation au/at = Au. Taking the expectations in (7), we have the re- lation ExU(w(t))J =f(x) + J Ex [Af(w(s))] ds, which implies that the transition probability P(t, x, dy) of the diffusion satisfies the equation op/ot=A*p in (t,y) in a weak sense, where A* is the adjoint operator of A. If a/at - A * is thypoelliptic, we can conclude that P(t, x, dy) possesses a smooth density p(t, x, y) by appeal- ing to the theory of partial differential equa- tions. Recently, P. Malliavin showed that a probabilistic method based on the stochastic differential equations can also be applied to this problem effectively, [6, 16, 17]. If crt, x) is continuous and u(t, x) is suffi- ciently smooth in (t, x) on [0, CIJ) X R d , then the following fact, more general than (7), holds: (8) u(t, w(t))exp[I c(s, w(s))ds J - u(O, x) - I exp[t c(u, w(u))du J x (iJu/at +(A + c)u)(s, w(s))ds is a local martingale (i.e., EAfIOC) with respect to {,:F" Px}. By applying the toptional sampling theorem to (8) for a class of {Y;}-stopping times, we can obtain the probabilistic repre- sentation in terms of the diffusion of solutions for initial or boundary value problems related to the operator A [3,4]. F. Stochastic Differential Equations with Boundary Conditions As we saw in the previous section, diffusion processes generated by differential operators can be constructed by stochastic differential equations. A diffusion process on a domain with boundary is generated by a differential operator that describes the behavior of the process inside the domain, and a boundary condition that describes the behavior of the process on the boundary of the domain. For 406F Stochastic Differential Equations example, consider a reflecting tBrownian mo- tion on the half-line [0, CIJ). This is a diffusion process X = (X,) on [0, CIJ) obtained by set- ting X, = Ix,l from a I-dimensional Brownian motion x,, The corresponding differential operator is A = id 2 /dx 2 , and the boundary condition is Lu == du/dx I FO = 0, that is, the transition expectation u(t, x) = ExLf(X , )] is determined by au/at=Au, Lu=O, and UI,O = f In constructing such diffusion processes with boundary conditions, stochastic differen- tial equations can be used effectively. In the case of reflecting Brownian motion, it was formulated by Skorokhod in the form (9) dX(t) =dB(t) +dcp(t). Here B(t) is a I-dimensional Brownian mo- tion (B(O)=O), X(t) is a continuous process such that X(t)O, and cp(t) has the following property with probability 1: cp(O)=O, t-4 cp(t) is continuous and non decreasing and increases only on such t that X(t)=O, i.e., J:> J:O)(X(s))dcp(s) = cp(t). Given a Brownian motion B(t) and a nonnegative random vari- able X(O) which are mutually independent, X(t) satisfying (9) and with the initial value X(O) is unique and given by X(t)= X(O) + B(t). t<(Jo=min{tIX(O)+B(t)=O} and X(t)=B(t) -minuo<,;s<';IB(s), t(Jo (P. Levy, Skorokhod;  [6,18J). In the case of multidimensional processes, possible boundary conditions were determineo by A. D. Venttsel' [24]. Stochastic differential equations describing these diffusions were for- mulated by N. Ikeda [5J in the 2-dimensional case and by Watanabe [23J in the general case as follows. Let D be the upper half-space R = {x =(x l ,... ,x d ) IXdO}, aD= {x I x d =O}, and 15 = {x I x d > O}. The general case can be reduced, at least locally, to this case. Suppose that the following system of functions is given: (J(x):D-4R d x R', b(x):D-4R d , T(X): aD -4R d - 1 x R", fi(x):aD-4R d - l , and p(x):oD-4 [0, ex:), which are all bounded and con- tinuous. Consider the following stochastic differential equation: r dXi(t) = I: (Jj(X(t))Ib(X(t))dBJ(t) JI + bi(X(t))Ib(X(t))dt S + I: ,1(X(t))li'D(X(t))dM k (t) k1 (10) +fii(X(t))dljJ(t), i = 1, 2, ... , d - 1, r dXd(t) = I: (J/(X(t))Jb(X(t))dBJ(t) JI + bd(X(t))Ib(X(t))dt +dcp(t), J(D(X (t))dt = p(X(t))dcp(t). 
406G Stochastic Differential Equations By a solution of this equation, we mean a system of continuous semimartingales .r = (X(t), B(t), M(t), <p(t)) over a probability space (Q,:?, P) with an increasing family {g;;} satis- fying the following conditions: (i) X(t)=(X1(t), ..., Xd(t)) is D-valued, i.e., Xd(t)  0; (ii) with probability 1, ip(O) =0, t-><p(t) is nondecreas- ing, and JIaD(X(s))d<p(s)=<p(t); (iii) B(t) and M (t) are r-dimensional and s-dimensional systems of elements in AID", respectively, such that <Bi, Bj), = (jUt, <B"Mm),=O, and <Mm,Mn),=(jmn<p(t), i,j= 1, ...,r, m, n= 1, ...,s; and finally (iv) the stochastic differentials of these semimartingales satisfy (10). The processes B(t), M(t), and <p(t) are sub- sidiary, and the process X(t) itself is often called a solution. We say that the uniqueness of solution holds if the law of X = (X(t)) is uniquely determined from the law of X(O). As before, the existence and the uniqueness of solutions imply that solutions define a diffu- sion process on D, and these are guaranteed if, for example, minxEr'Dadd(x) > 0 and a, b, " {J are Lipschitz continuous, [6,23]. Here, we set aU(x) = =1 af(x)ad(x) and IXU(X) = i=l T(X)Ti(x). It is a diffusion process gener- ated by the differential operator 1 d d A =- 2:: aU(x)Dp j + 2:: bi(x)D i , 2 i,jl il and by the Venttsel' boundary condition, 1 d-j d-j Lu(x) =2: 2:: lXij(x)DiDju(x) +2:: {Ji(x)Diu(x) 1,)-1 t=l +Ddu(x)-p(x)(Au)(x)=O on aD. G. Stochastic Differential Equations on Manifolds Let M be a connected a-compact Coo-manifold of dimension d, and let W M = C([O, oo)->M) be the space of all continuous paths in M. If M is not compact, let M = M U {.1} be the one- point compactification of M and W M be the space of all continuous paths in M with .1 as a ttrap. These path spaces are endowed with the a-fields (WM) and (WM)' respectively, which are generated by Borel cylinder sets. By a continuous process on M we mean a (WM,(WM))-valued random variable, and by a continuous process on M admitting explo- sions we mean a (W M , (WM))-valued ran- dom variable. In this section the probability space is taken to be the r-dimensional Wiener space (W o ,:?, P) with the increasing family {g;;}, where g;; is generated by ,(Wo) and P-null sets. Then w=(w(t)), WE W o , is an r- dimensional {g;;}-Brownian motion. Suppose that we are given a system of cae - vector fields Ao, AI' .,., Ar on M. We consider 1540 the following stochastic differential equation on M: (11) dX,= Ak(X,)odwk(t)+ Ao(X,idt. (Here, the usual convention for the omission of the summation sign is used.) A precise mean- ing of equation (11) is as follows: We say that X =(X,) satisfies equation (11) if X is an {g;;}- adapted continuous process on M admitting explosions such that, for any Coo-function f on M with compact support (we set f(.1) = O)J(X,) is a continuous semimartingale satisfying (12) df(X,) = (Akf)(X,) odwk(t) + (Aof)(X,)dt, where 0 is !to's circle operation defined in Section C. This is equivalent to saying that X, = (Xi, . " , X:), in each local coordinate, is ad-dimensional semimartingale such that (13) dX: = af(X,) odwk(t) + bi(X,)dt = af(X,)dwk(t) +[tt Djafad+b}x,)dt, where Adx) = af(x)Di' k = 1, 2,..., r, and Ao(x) =bi(x)D i . By solving the equation in each local coordinate and then putting these solu- tions together, we can obtain for each XE M a unique solution X, of (11) such that Xo = x. We can also embed the manifold M in a higher-dimensional Euclidean space and solve the stochastic differential equation there. We denote the solution by X(t, x, w). The law Px on W M of [t-> X(t, x, w)] defines a diffusion process on M which is generated by the dif- ferential operator A =t =1 Af + .4 0 ' Next, if we consider the mapping X-> X(t, X, w); then, except for w belonging to a set of P-measure 0, the following is valid: For all (t, w) such that X(t, xo, W)E M, the mapping x->X(t, x, w) is a diffeomorphism between a neighborhood of Xo and a neighborhood of X(t, x o , w). This is based on the following fact for stochastic differential equations on Rd. If in equation (3) the coefficients af and b i are C"'-functions with bounded derivatives of all orders IX, IIX I  1, then, denoting by X (t, x, w) the solution such that X(O)=x, we have that x->X(t, x, w) is, with probability 1, a diffeo- morphism of R d for all t [13]. Example 1: Stochastic moving frame [6, 15]. Let M be a Riemannian manifold of dimen- sion d, O(M) be the orthonormal frame bundle over M, and L1' L2' ..., Ld be the basic vector fields on O(M), that is, 1 (LJ)(x, e)=lim -[j(x" e,)-f(x, e)], '-0 t j= 1,... d, 
1541 where e = (e 1, .., , e d ) is an orthonormal basis in T;;(M), x,= Exp(teJ, i.e., the geodesic such that Xo = x and x = e j , and e, is the para11el translate of e along x,. Let b be a vector field on M and Lb be its horizontal lift on O(M), i.e., Lb is a vector field on O(M) determined by the fo11ow- ing two properties: (i) Lb is horizontal and (ii) dn(Lb)=b, where n:O(M)-->M is the projec- tion. Consider the fo11owing stochastic differ- ential equation on O(M): dr(t) = L;(r(t)) 0 dwi(t)+ Lb(r(t))dt. Solutions determine a family of (local) diffeo- morphisms r-->r(t, r, w)=(X (t, r, w), e(t, r, w)) on O(M). The law of [t-->X(t, r, w)J depends only on x = n(r), and it defines a diffusion process on M that is generated by the dif- ferential operator t!1 M + b (!1 M is the Laplace- Beltrami operator). Using this stochastic mov- ing frame r(t, r, w), we can realize a stochastic para11el translation of tensor fields along the paths of Brownian motion on M (a diffusion generated by t!1 M ) that was first introduced by Ita [10J, and by using it we can treat heat equations for tensor fields by means of a prob- abilistic method. Example 2: Brownian motion on Lie groups. Let G be a Lie group. A stochastic process {g(t)} on Gis ca11ed a right-invariant Brownian motion if it satisfies the fo11owing conditions: (i) With probability 1, g(O) = e (the identity), and t-->g(t) is continuous; (ii) for every ts, g(t)g(S)-1 and a(g(u); u:;::; s) are independent; and (iii) for every ts, g(t)g(sf 1 and g(t-s) are equa11y distributed. Let Ao, A l' "', Ar be a system of right- invariant vector fields on G, and consider the stochastic differential equation (14) dg, = Aj(g,) 0 dwi(t)+ Ao(g,)dt. Then a solution of (14) with go = e exists uniquely and globa11y; we denote this solution by g0(t, w). It is a right-invariant Brownian motion G, and conversely, every right-invariant Brownian motion can be obtained in this way. The system of diffeomorphisms g-->g(t, g, w) defined by the solutions of (14) is given by g-->g(t, g, w)= gO(t, w)g. Genera11y, if M is a compact manifold, the system of diffeomorphisms g,: x-->X(t, x, w) defined by equation (11) can be considered as a right-invariant Brownian motion on the infinite-dimensional Lie group consisting of a11 diffeomorphisms of M [2]. References [IJ B. S. Tsirel'son (Cirel'son), An example of stochastic differential equation having no 406 Ref. Stochastic Differential Equations strong solution, Theory Prob. App1., 20 (1975), 416-418. [2J K. D. Elworthy, Stochastic dynamical systems and their flows, Stochastic Analysis, A. Friedman and M. Pinsky (eds.), Academic Press, 1978,79-95. [3J A. Friedman, Stochastic differential equa- tions and applications I, II, Academic Press, 1975. [4J I. I. Gikhman and A. V. Skorokhod, Sto- chastic differential equations, Springer, 1972. [5J N. Ikeda: On the construction of two- dimensional diffusion processes satisfying Wente11's boundary conditions and its applica- tion to boundary value problems, Mem. Col1. Sci. Univ. Kyoto, (A, Math.) 33 (1961), 367- 427. [6J N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, Kodansha and North-Ho11and, 1981. [7J K. Ita, Differential equations determining Markov processes (in Japanese), Zenkoku Shijo Sugaku Danwakai, 1077 (1942),1352- 1400. [8J K. Ita, On stochastic differential equations, Mem. Amer. Math. Soc., 4 (1951). [9J K. Ita, Stochastic differentials, App1. Math. and Optimization, 1 (1975),347-381. [10] K. Ita, The Brownian motion and tensor fields on Riemannian manifold, Proc. Intern. Congr. Math., Stockholm 1962,536-539. [11J K. Ita and S. Watanabe, Introduction to stochastic differential equations, Proc. Intern. Symp. SDE, Kyoto, 1976, K. Ita (ed.), Kino- kuniya, 1978, i-xxx. [12J T. Jeulin, Semi-martingales et grossisse- ment d'une filtration, Lecture notes in math. 833, Springer, 1980. [13J H. Kunita, On the decomposition of solutions of stochastic differential equations, Proc. LMS Symp. Stochastic Integrals, Durham, 1980. [14J H. Kunita and S. Watanabe, On square integrable martingales, Nagoya Math. J., 30 (1967),209-245. [15J P. Ma11iavin, Geometrie differentie11e stochastique, Les Presse de l'Universite de Montreal, 1978. [16J P. Ma11iavin, Stochastic calculus ofvari- ation and hypoe11iptic operators, Proc. Intern. Symp. SDE, Kyoto, 1976, K. Ita (ed.), Kino- kuniya, 1978, 195-263. [17J P. Ma11iavin, C k -hypoe11ipticity with degeneracy, Stochastic Analysis, A. Friedman and M. Pinsky (eds.), Academic Press, 1978, 199-214,327-340. [18J H. P. McKean, Stochastic integrals, Academic Press, 1969. [19J P. A. Meyer, Un cours sur les integrales stochastiques, Lecture notes in math. 511, Springer, 1976,245-400. 
407 A Stochastic Processes [20] A. V. Skorokhod, Studies in the theory of random processes, Addison-Wesley, 1965. [21] D. W. Stroock and S. R. S. Varadhan, Diffusion processes with continuous coeffi- cients I, II, Comm. Pure Appl. Math., 22 (1969),345-400,479-530. [22] D. W. Stroock and S. R. S. Varadhan, Multidimensional diffusion processes, Springer, 1979. [23] S. Watanabe, On stochastic differential equations for multi-dimensional diffusion processes with boundary conditions, I, II, J. Math. Kyoto Univ., 11 (1971), 169-180,545- 551. [24] A. D. Venttsel' (Wentzell), On boundary conditions for multidimensional diffusion pro- cesses, Theory Prob. Appl., 4 (1959), 164-177. [25] T. Yamada and S. Watanabe, On the uniqueness of solutions of stochastic differen- tial equations, J. Math. Kyoto Univ., 13 (1973), 497-512. 407 (XVI1.4) Stochastic Processes A. Definitions The theory of stochastic processes was origi- nally involved with forming mathematic;al models of phenomena whose development in time obeys probabilistic laws. Given a basic tprobability space (Q, , P) and a set T of real numbers, a family {X'}'ET of real-valued tran- dom variables defined on (Q, ,P) is called a stochastic process (or simply process) over (Q, , P), where t is usually called the time parameter of the process. For each finite t-set {t h ..., t n }, the tjoint distribution of (X,!' ..., X,) is called a finite-dimensional distrihution of the process {X'},E1' Stochastic processes are classified into large groups such as taddi- tive processes (or processes with independent increments), +Markov processes, +Markov chains, tdiffusion processes, +Gaussian pro- cesses, +stationary processes, tmartingales, and +branching processes, according to the prop- erties of their finite-dimensional distributions. This classification is possible because of the following fact, a consequence of Kolmo- gorov's +extension theorem ( 341 Proba- bility Measures I): Given a system .0fJ of finite- dimensional distributions satisfying certain tconsistency conditions, we can construct a suitable probability measure on the space W = R 1 of real-valued functions on T so that the stochastic process {X'},ET' obtained by setting X,(w)=the value of WE W at t, has : as its system of finite-dimensional distri- 1542 butions. Now, consider two stochastic pro- cesses .0£ = {X,} 'E T and '!Y = { Y,} 'E 1 qlj is called a modification of .5f if they are defined over a common probability space (Q, , P) and P(X, = Y,)= 1 (tET). Regardless of whether .0£ and '!Y are defined over a common probability space or over different probability spaces, X and Yare said to be equivalent or each is said to be a version of the other if their finite- dimensional distributions are the same. Ac- cording to Kolmogorov's extension theorem, every stochastic process has a version over the space W=R T . The function X(w) of t obtained by fixing w in a stochastic process {X'}'ET is called the sample function (sample process or path) corre- sponding to w. In applying various operations to stochastic processes and studying detailed properties of stochastic processes, such as continuity of sample functions, the notions of measurability and separability play important roles. We assume that T is an interval in the real line, and (if needed) that the probability measure P is tcomplete. Denote by  the class of all tBorel subsets of T. A stochastic process {X'}'ET is said to be measurable if 1he function X,(w) of (t, w) is  x -measurable. Continuity in probability defined in the next paragraph gives a sufficient condition for a stochastic process to have a measurable modification. A stochastic process {X, LET is said to be sepa- rable if there exists a countable su bset S of T such that P{WIi!! Xs(w)X,(w) limsupXs(w) for any tE T } = 1. s-tr,SES It was proved by J. L. Doob that every sto- chastic process has a separable modification [6]. Various types of continuity are considered for stochastic processes. {X'}'ET is said to be continuous in probability at SE T if P(IX,-Xsi > F.)-+O (t-+s, tE T) for each F.> 0; it is said to be continuous in the mean (of order 1) at SE T if E(lX,-+XslJ-+O (t-+s, tE T). Continuity in the mean of order p (> 1) is defined similarly. Con- tinuity in the mean of any order implies con- tinuity in probability. Suppose that {X'},ET is separable. Then Ds= { w I X,(W)= lim X'(W) } C and U D, t---',tE T 'liE T are measurable events. If P(Ds) > 0, then SE T is called a fixed point of discontinuity. The con- dition P(USETD,)=O means that almost all sample functions are continuous. Regularity properties of sample functions of processes, such as continuity or right continuity, have 
1543 been studied by many people. The following theorem is due to A. N. Kolmogorov: Let T= [0, I]. If E(IX,- XsIY):;::;clt-sI I +f for constants 1'>0, 8>0, and c>O, then {X'}'E1 has a modification {X,},er for which almost all sample functions are continuous, and P [ limh-. sup IX,-X,I=O 1= I hlO I,-sl';h J l,SE1 for any 15(0 < 15 < 8/1'). Each of the following is a sufficient condition for :;l' = {X'}'E 1 to have a modification for which almost all sample paths are right continuous functions with left limits. (i) f!( is an additive process which is continuous in probability (P. Levy [3,4], K. Ito [8J;  5 Additive Processes B). (ii) FE is a supermartin- gale that is continuous in probability (Doob [6J;  262 Martingales C). B. Increasing Families of a-Algebras In the investigation of stochastic processes (especially Markov processes. martingales, and stochastic differential equations), the notion of increasing families of a-algebras often plays an important role. Let (Q,, P) be a probability space, and let T = [0, (fJ). A family {'}'ET of a-subalgebras of  is called an increasing family of a-algebras on (Q, ,P) if,c, for s<t. A process {X'}'E1 is said to be adapted to {,} if X, is ,-measurable for each t E T. {X,} is said to be progressively measurable (or a progressive process) with respect to {,} if for every tE T the mapping (s, w)f-> Xs(w) of [0, tJ x Q into R is measurable with respect to the a-field ([O, tJ) x ,. A process {X,} with right continuous paths, adapted to {,}, is progressively measurable with respect to {,}. The same concl usion holds for a process with left continuous paths. A subset A of [0, (fJ) x Q is said to be progres- sive if the indicator process a,( w) = 1 A (t, (J)) of A is a progressive process. A random time, on Q with values in [0, (fJ J is called a stopping time (or Markov time) if {,:;::; t} E, for all t  0. Constants (O) are stopping times. If a and, are stopping times, then min(a, ,) and max(a, ,) are also stopping times. The limit of an increasing sequence of stopping times is a stopping time, while the limit of a decreasing sequence of stopping times is a stopping time with respect to {,+}, where ,+ = n,>, s' Let , be the class of A E such that An {,:;::; t} E, (v tE T); then it is a a-algebra if, is a stopping time. If {X,} is a progressive pro- cess and if, is a stopping time, then X,l:«oc) is ,-measurable. An increasing family {,} 407 B Stochastic Processes of a-algebras on (Q, ,P) is said to be com- plete if the probability space (Q, , P) is com- plete and if all the P-negligible sets belong to o. Fron now on we assume that {,} is complete and right continuous (i.e., ,= ,+ for all t  0). Let B be a subset of Rand {X,} be a process. We call 'B= inf{ tOI X,(W)E B} a hitting time for B. Measurability of 'B is not always guaranteed, that is, 'B is not always a stopping time. G. A. Hunt showed that for a wide class of Markov processes hitting times for analytic sets are stopping times. This result is based on a theorem ofG. Choquet on capa- citability and was generalized by P. A. Meyer as follows: (i) For every progressively mea- surable process, hitting times for analytic sets are stopping times; and (ii) for every progres- sive set A, D A =inf{tOI(t,w)EA} is a stop- ping time. The following notions on measur- ability are also important. The predictable a- algehra on [0, (fJ) x Q, denoted by .0/, is defined to be the least a-algebra on [0, (fJ) x Q with respect to which every process X,(w) that is adapted to {,} and has left continuous paths is measurable in (t, w). The well-measurable or optional a-algebra on [0, oc) x Q, denoted by (0, is defined to be the least a-algebra on [0,(0) x Q with respect to which every process X,(w) that is adapted to {,} and has right continu- ous paths with left limits is measurable in (t, w). A process {X,} defined on Q is said to be pre- dictable (resp. well-measurable or optional) if the function (t,w)f->X,(w) on [0, (0) x Q is measurable with respect to the predictable a- algebra:lJ' (resp. the optional a-algebra (1:'). For further information regarding the notions given in this section  [10]. Up to this point it was assumed that the space in which a process {X'}'ET takes values, namely, the state space of {X,},er, is a set of real numbers; but in general, topological spaces or merely measurable spaces can be taken for the state spaces of stochastic pro- cesses. The general definitions and results already given can be extended to stochastic processes whose state spaces are tlocally com- pact Hausdorff spaces satisfying the second tcountability axiom. Moreover, the time parameter set T of a process {X'}'ET need not be a set of real num- bers. For example, P. Levy [12J and H. P. McKean [13J investigated stochastic processes with several-dimensional time; such processes are sometimes called random fields. The case in which Tis :$, if, or in general a space of functions (which is nuclear) has also been investigated ( Section C). A probabilistic formulation of equilibrium states given by R. L. Dobrushin [14J initiated recent probabilis- tic study of statistical mechanics. For further information concerning processes with general 
407 C Stochastic Processes time parameter spaces  136 Ergodic Theory, 176 Gaussian Processes, 340 Probabilistic Methods in Statistical Mechanics. C. Random Distributions and Generalized Stochastic Processes The investigation of random distributions was initiated by I. M. Gel'fand [15] and Ito [17]. Denote by f0 the space of functions of t ( - 00 < t < 00) of class CW with compact tsupport, and by f0' the space of tdistributions. At func- tion X(<p, w) of wEQ and <pEf0 is called a random distribution (or generalized stochastic process) if X(<p, w) is a distribution as a func- tion of <p for almost all wand is measurable as a function of w for each fixed <po Denote by (f0') the smallest a-algebra containing sets of the form {YEf0'IY(<p)EE} (<pEf0, E is a Borel set). A random distribution is nothing but a f0'-valued random variable. For a random distribution X(<p,w), a probability measure <l>x on (f0') is induced by <l>x(B) = P{wl X(', W)E B}, BE(f0'). The functional e(<p) = E(eiX(q>.W») = f eiy(q»<I>x(dy), <pEf0, is called the characteristic functional of X (<p, w) or <l>x. The functional e(<p) is continuous posi- tive definite, and e(O) = 1. Conversely, given a functional e(<p) with these properties, a theo- rem ofR. A. Minlos ( 341 Probability Measures J) states that there exists a unique probability measure <I> on (f0') whose charac- teristic functional equals e(<p). In other words, a random distribution with the character- istic functional e(<p) can be constructed over (f0', (f0'), <1». Typical classes of random distributions that have been investigated so far are stationary random distributions and random distri- butions with independent values at every point. (For stationary ones  395 Stationary Processes.) A random distribution X(<p, w) is called a random distribution with independent values at every point if <Pr(t)<P2(t) =0 implies the independence of X(<P1'W) and X(<P2,W), that is, e(<p1 + <P2) = c(<Pt1e(<P2)' A sufficient condition for the functional of the form e(<p) =exp(I-: f(<p(t), <p'(t),..., <p(k)(t))dt) (.f is continuous and frO, ... ,0) = 0) to be the characteristic functional of a stationary ran- dom distribution with independent values at every point is that the function exp(sf(x o , XI' ..., x k )) of (x o , XI' ..., Xk)E R k be positive defi- nite for each s> 0 [16]. Under this condition, 1544 a concrete representation of f is known [16]. When k = 0, a necessary and sufficient condi- tion for e(<p) to be the characteristic functional of a stationary random distribution with independent values at every point IS that exp f(x) be the characteristic function of an tinfinitely divisible distribution. One such random distribution is the so-called Guassian white noise, namely, the distribution deriva- tive of Brownian motion whose characteristic functional is ex p ( -fl<P(t)12dt). A family {Xq>} q>E£C of real-valued random variables indexed by f0 is called a random distribution in the wide sense if Xq> is linear in <p, namely, Xaq>+b", = aXq> + bX", with proba- bility 1 for fixed <p, if; E f0, and real constants a, b, and if Xq> I-> 0 in probability whenever <p-->O in the topology of f0. (For a typical class of random distributions in the wide s'nse  395 Stationary Processes C.) A random distri- bution in the wide sense has a modification that is a random distribution. In the definition of random distributions (in the wide sense) one can replace the space f0 by the space!:!' of rapidly decreasing C W _ functions or in general by some space <I> of functions that is nuclear. For example, one can define random distributions as !:!,'-valued random variables. Up to this point random distributions of one variable have been considered. Random distributions of several variables are called generalized random fields and have been in- vestigated by Ito [18], A. M. Yaglom [19], Gel'fand and N. Va. Vilenkin [16]. and others. Moreover, K. Urbanik [20,21] developed a theory of generalized stochastic processes based on G. Mikusinski's theory instead of L. Schwartz's theory of distributions. D. Random Measure Let (S, 3', m) be any a-finite measure space, and put 3'0 = {A E 3' I m(A) < oo}. By virtue of tKolmogorov's extension theorem, there exists a family {W(A)} AEcro of real random variables indexed by 3'0 such that (i) for any mutually disjoint A I' ..., AnE 3'0' {W(A rJ, ..., W(A n )} is independent; (ii) for any A E 3' 0' W( A) is Gauss- ian distributed with mean 0 and variance m(A); and (iii) for any A, BE 3'0, EI W(A) W(B)) = m(A n B). Similarly, there exists a family {N(A)}AEcro of real-valued random variables indexed by 3'0 such that (i) for any mutually disjoint Aj, ...,AnEIYo, {N(A 1 ), ...,N(A n )} is independent; (ii) for any A E 3'0, NIA) is tPois- son distributed with mean m(A); and (iii) for 
1545 any A, BE'iJo, E(N(A)N(B))=m(A nB). {W(A)} AE\rO and {N(A)} AE\rO are called a Gaussian random measure and a Poisson ran- dom measure associated with the measure space (S, 'iJ,m), respectively. By using these random measures, the theory of multiple inte- grals can be developed. By a point function p on S we mean a map- ping p:Dp-->S, where the domain Dp is a count- able subset of (0, 00). p defines a counting measure N p on (0, 00) x S such that Np«O, tJ x U)= # {sEDp;s:;::;t,p(S)EU} (t>O, UE'iJo)' For a point function p and t > 0, the shift point function 0,p is defined by De,p= {SE(O, 00); S + tED p } and (0,p)(s) = p(s+ t). Let fls be the totality of point functions on S and (fls) be the smallest a-field on fls with respect to which all p--> Np«(o, tJ x U, t> 0, U E 'iJo) are measurable. A point process on S is a (fls, (fls))-valued random variable. Then there exists a point process p on S such that (i) for any t > 0, p and 0,p have the same prob- ability law, and (ii) N p is a Poisson random measure associated with «0, 00) x S, (O, 00) x P4(fls),dt x m(ds)). The point process p is called the stationary Poisson point process with the characteristic measure m. References [1J A. N. Kolmogorov, Grundbegriffe der Wahrscheinlichkeitsrechnung, Erg. Math., 1933; English translation, Foundations of the theory of probability, Chelsea, 1950. [2J N. Wiener, Differential-space, J. Math. Phys.,2 (1923), 131-174. [3J P. Levy, Processus stochastiques et mouvement brownien, Gauthier-Villars, 1948. [4J P. Levy, Theorie de I'addition des vari- ables aleatoires, Gauthier-Villars, 1937. [5J M. S. Bartlet, An introduction to stochas- tic processes, Cambridge Univ. Press, 1955. [6J 1. L. Doob, Stochastic processes, Wiley, 1953. [7J J. L. Doob, Stochastic processes depending on a continuous parameter, Trans. Amer. Math. Soc., 42 (1937),107-140. [8J K. Ita, Stochastic processes, Aarhus Univ. lecture notes 16, 1969. [9J P. A. Meyer, Une presentation de la theorie des ensembles sousliniens; application aux processus stochastiques, Sem. Theorie du Potentiel, 1962-1963, no. 2, Inst. H. Poincare, Univ. Paris, 1964. [1 OJ C. Dellacherie and P. A. Meyer, Prob- abilites et potentiel, Hermann, 1975, chs. I-IV. [IIJ C. Dellacherie, Capacites et processus stochastiques, Springer, 1972. [12J P. Levy, Le mouvement brownien fonc- tion d'un ou de plusieurs parametres, Rend. 408 A Stochastic Programming Sem. Math. Univ. Padova, (5) 22 (1963), 24- 101. [13J H. P. McKean, Jr., Brownian motion with a several-dimensional time, Theory Prob. Appl., 8 (1963), 335-354. [14J R. L. Dobrushin, The description of the random field by its conditional distributions and its regularity conditions, Theory Prob. Appl., 13-14 (1968),197-224. (Original in Russian, 1968). [15J I. M. Gel'fand, Generalized random processes (in Russian), Dokl. Akad. Nauk SSSR (N.S.), 100 (1955),853-856. [16J I. M. Gel'fand and I. Va. Vilenkin, Gen- eralized functions IV, Academic Press, 1964. (Original in Russian, 1961.) [17J K. Ita, Stationary random distributions, Mem. Coil. Sci. Univ. Kyoto, (A) 28 (1954), 209-223. [18J K. Ita, Isotropic random current, Proc. 3rd Berkeley Symp. Math. Stat. Prob. II, Univ. of California Press, 1956, 125-132. [19J A. M. Yaglom (Jaglom), Some classes of random fields in n-dimensional space related to stationary random processes, Theory Prob. Appl., 2 (1957), 273-320. (Original in Russian, 1957.) [20J K. Urbanik, Stochastic processes whose sample functions are distributions, Theory Prob. Appl., 1 (1956), 132-134. (Original in Russian, 1956.) [21J K. Urbanik, Generalized stochastic pro- cesses, Studia Math., 16 (1958), 268-334. 408 (XIX.7) Stochastic Programming A. General Remarks Stochastic programming is a method of finding optimal solutions in mathematical program- ming in its narrow sense ( 264 Mathematical Programming), when some or all coefficients are stochastic variables with known probabil- ity distributions. There are essentially two dif- ferent types of models in stochastic program- ming situations: One is chance-constrained pro- gramming (CCP), and the other is a two-stage stochastic programming (TSSP). The differ- ence between them depends mainly on the informational structure of the sequence of ob- servations and decisions. For simplicity, let us here consider stochastic linear programming, which is the best-known model at present. Let Ao, A be m x n-dimensional matrices and x, C E R" and b, b o E R m . Suppose further that components of A, b, c are random variables, while those of A o ' b o are constants. Consider 
408 B Stochastic Programming the following formally defined linear program- ming problem: minx{c'xl Ax :;::;b,XEX o }, Xo= {x I Aox:;::;b o , xO}. Let (Q,, P) be a proba- bility space ( 342 Probability Theory) such that {A(w),b(w),c(w)} is a measurable trans- formation on Q into Rm x n+m+n. B. Chance-Constrained Programming (CCP) This method is based on the assumption that a decision x has to be made in advance of the realization of the random variables. Sup- pose that Ai(w) is the ith row of A(w), and bi(w) is the ith component of b(w). We call P( {w I A i( w)x :;::; b i ( w) } )  Ct. i a chance constraint, where Ct. i is a prescribed fractional value deter- mined by the decision maker according to his attitude toward the constraint Ai(w)x:;::; bi(w): if he attaches importance to it, he will take Ct. i as great as possible. Defining feasible sets Xi(Ct..) and X by Xi(Ct.i) = {xl P({wIAi(w)x:;::; b,(W)})Ct.i}, X = Xo n {n1 Xi(Ct.i)}, we can formulate CCP as follows: minx{F(x)1 XEX}, where F: X --> R, is the certainty equivalent of the stochastic objective function c'x. We have four models of CCP corresponding to differ- ent types of F(x): (i) E-model: F(x) = c'x, c = Ewc(w)'x. (ii) V-model: F(x)= Var(c(w)'x)= x'v;.x, where v;. is a variance-covariance ma- trix of c(w). (iii) PI-model: F(x) = 1, P({wl c(w)'x :;::;f})  Ct. o , 1/2:;::; Ct. o :;::; 1. (iv) P 2 -model: F(x) = P( {w I c(w)'x y}) for a given constant y. In particular, if the components of A(w), b(w), c(w) have a multidimensional normal distri- bution, the certainty equivalent of the ith chance constraint is derived in the following form: A,x + <1>-[ (Ct.;) (x'V;x + 2w;x + Vf)112:;::; 5 i , where Ai, V;, Wi, V" 5 i are expectation vectors or a variance-covariance matrix of Ai(w) and bi(w) and <I>(t) = S'-ooexp( -z2/2)dz/. The set Xi(Ct..) for this constraint can be shown to be convex for 1/2:;::; Ct. i :;::; 1, by using the con- vexity of the function j x' Vx for a positive semidefinite matrix V. Under the same assump- tion we can obtain the objective functions F(x)=c'x+<I>-'(Ct.o) jx'v;.x for the PI-model and F(x)=(c'x-y)/ jx'v;.x for the Prmodel. These four models have been shown to be computable by applying convex programming techniques, including the conjugate gradient method. Further studies on the convexity of a more general chance constraint P( {w I A(w)x:;::; b(w)})Ct., 0:;::; Ct.:;::; 1, appear in several articles. C. Two-Stage Stochastic Programming (TSSP) This method divides the decision process into two stages. First stage: Before the realization of random variables, one makes a decision x, 1546 being allowed to compensate for it after the specification of those values. Second stage: One obtains an optimal compensation YE R' for the given x and the realized values of the random variables. Assuming that q E R' is a random vector in addition to A, b, C, we can formulate TSSP as follows. First stage: minx Ew {(c(w)'x + Q(x, w) I x E X)}; second stage: Q(x,w)=miny{q(w),yl Wy=A(w)x- b(w),yO}, where X =xon K, K 0= {xl Q(x,w) < +00 with probability I} and q(UJ)'Y is a loss function for the deviation A(w)x -- b(w). The m x n matrix W is called a compensation ma- trix. Several theorems have been proved: (i) K is a closed convex set; (ii) Q(x) = EwQ(x, w) is a convex function on K if the random variables in A(w), b(w), q(w) are square integrable; (iii) if P has a density function, then Q(x) has a con- tinuous gradient on K; (iv) when P has a finite discrete probability distribution, a TSSP prob- lem is reduced to a linear programming prob- lem having a dual decomposition structure. References [IJ A. Charnes and W. W. Cooper, Chance- constrained programming, Management Sci., 6 (1959), 73-79. [2J G. B. Dantzig and A. Madansky, On the solution of two-stage linear programs under uncertainty, Proc. 4th Berkeley Symp. Math. Statist. Probab. I, Univ. of California Press, 1961. [3J S. Kataoka, A stochastic programming model, Econometrica, 31 (1963), 181-196. [4J I. M. Stancu-Minasian and M. J. Wets, A research bibliography in stochastic program- ming 1955-1975, Operations Res., 24 (1976), 1078-1119. [5J S. Vajda, Probabilistic programming, Academic Press, 1972. [6J P. Kall, Stochastic linear programming, Springer, 1976. [7J V. V. Kolbin, Stochastic programming, Reidel 1977. (Original in Russian, 1977.) 409 (11.7) Structures A. Examples of Structure Structure is a unified description of mathe- matical objects such as tordered sets, trings, tlinear spaces, ttopological spaces, tprob- ability spaces, tmanifolds, etc., usi I1g only the concepts tset and trelation. The following are examples. 
1547 (1) Order. An tordering on a set A is a binary relation in A ( 358 Relations) with a tgraph 'Y. such that (i) if a E A, then (a, a) E 'Y.; (ii) if (a,b)E'Y. and (b,a)E'Y., then a=b; and (iii) if(a,b)E'Y. and (b,C)E'Y., then (a,c)E'Y., where 'Y. is an element of the tpower set 1J(A x A). We say that 'Y. determines a structure of ordering on the set A. (2) Law of composition. A law of compo- sition on a set A is a mapping from A x A (or a subset) to A. This mapping is considered as a ternary relation with a graph 'Y.E(A x A x A), satisfying the following two conditions (or possibly only (ii)): (i) if (a, b)E A x A, then (a, b, C)E 'Y. for some CE A; (ii) if (a, b, C)E 'Y. and (a, b, C')E'Y., then C = c'. We say that 'Y. deter- mines the structure of a law of composition in A. The tassociative law in the law of com- position determined by'Y. is given by: (iii) if (a, b, X)E'Y., (b, C,Y)E'Y., (x, C,Z)E'Y., and (a,y,z')E'Y., then Z = z'. A set with conditions (i), (ii), and (iii) is called a semigroup. (3) Operation. An operation of a set A on a set B is a mapping from A x B (or a subset) to B. It is considered as a ternary relation with the graph YE1J(A x B x B), satisfying the fol- lowing two conditions (or possibly only (ii)): (i) if (a, b)E A x B, then (a, b, C)EY for some CE B; (ii) if (a, b, c)Ey and (a, b, C')EY, then C =c'. We say that I' determines the structure of an oper- ation of A on B. Each element 'Y. of A is called an operator on B; A is called a domain of operators on B, and B is called an A-set. When B is the main object of consideration, an oper- ation of A on B is sometimes called an external law of composition of A on B. The law of com- position of A as described in (2) is then called an internal law of composition of A. When a domain of operators A on B determined by YE(A x B x B) has an internal law of compo- sition determined by 'Y.E\(A x A x A), it is usually assumed that the following conditions on 'Y., yare satisfied: (iii) if (a, b, X)E'Y., (b, c, Y)E 1 ', (X,C,Z)EY, and (a,y,z')EY, then z=z'. If we denote the law of composition by (a, b)->ab and the operation by (a, b)->a' b, then con- dition (iii) may be written: (iii') (ab)'c=a'(b'c) for a, bEA and CEB. When an A-set B with an external law of composition determined by l'E1J(A x B x B) has an internal law of compo- sition determined by f)E (B x B x B), it is usually assumed that the following condition on 11,)' is satisfied: (iv) if(a,b,x)E}', (a,c,Y)EY, (b,c, Z)E f), (x, y, W)E f), and (a, z, W')E)', then W = w'. According to the notation ab and a' b, it is described as: (iv') (a'b)(a'c)=a'(bc) for aEA and b, CE B. The mappmg A x B -> B( B x A -> B) is called a left (right) operation of A on B. To emphasize leftness or rightness, "Ieft-" or "right-" is at- tached to correspondmg concepts. 409 B Structures (4) Topology. A ttopology on a set A is determined by a set 'Y.E1J(A), called the tsys- tem of open sets, satisfying the following conditions: (i) 0E'Y. and AE'Y.; (ii) if f)c'Y., then UHE'Y.; and (iii) if f)c'Y. is finite, then nf1E'Y.. We say that 'Y. defines the structure of a topol- ogy on the set A ( 425 Topological Spaces). B. Mathematical Structures We now explain the concept of mathematical structure for the case of a tlinear space ( 256 Linear Spaces). A linear space has two basic sets, one of which is a set K of elements called scalars and the other, a set V of elements called tvectors, two laws of compositions in K called addition and multiplication, a law of compo- sition in V called addition, and an operation of K on V called scalar multiplication. The laws of composition and the operation are given by elements of power sets: 'Y.h 'Y.2E(K x K x K), 'Y.3E(VX Vx V), and 'Y.4E1J(K x Vx V); and the basic properties of the linear space, such as i.(a +b)= Xa+ i,b (lE K, a, bE V), are described as propositions on K, V, 'Y. 1 , ..., 'Y.4 and denoted by P(K, v,'Y. 1 '...,'Y. 4 ),.... Up to now, we have been considering a given linear space. To give a description of a linear space in general, we use the symbols Xl' X 2 , (1, ..., (4 instead of the symbols K, V, 'Y." ". ,'Y. 4 , replace conditions such as 'Y., E (K x K x K),... by (1 E1t\(X, X XI x XtJ, ..., and consider the set r of these symbols and formulas: r:x 1 , X 2 , ("..., (4; (lE\(X, xX, XXI),'''' (4E1J(X, x X 2 x X 2 ). The set r is called the type of linear space. Similarly, we consider the set r of all P(X j , X 2 , (h"', (4),'" corresponding to the basic properties P(K, V, 0(" ..., 'Y. 4 ), ... of the linear space. The set r is called the axiom system of the linear space. I n general, let A" ... , Am be the basic sets (K and V in the preceding example). The basic concepts 'Y. 1 , "', 'Y. n ('Y.1' ..., 'Y.4 in the preceding example) are given as elements of finitely generated sets from AI' "', Am' i.e., elements of sets obtained by a finite number of ap- plications of the operations of forming the tCartesian product and the tpower set from A , , . .. , Am. Basic properties are given as pro- positions on AI' "', Am' 'Y. 1 , ..., 'Y... These basic properties and AI, ..., Am' 'Y.1, "', 'Y. n determine a mathematical system. We consider also a type r of symbols Xl,"" X m of basic sets and symbols (h ... , (n of basic concepts, and an 
409 C Structures axiom system r of basic properties. The pair (.E, r) determines a mathematical struc- ture. When we substitute sets AI, ..., Am for Xl, "', X m and!X""',!X n for 1"", n' where X" '" , X m and (" ... , (n satisfy the axiom system r, then (A" ...,Am,!X" ...,!X n ) is called a mathematical system with the mathematical structure (.E, r), or a model of the structure (.E, r). Two mathematical systems are called similar or of the same kind if they have the same mathematical structure. Groups, rings, topological spaces, etc., are mathematical structures. Mathematical systems are some- times called algebraic systems in the wider sense, and when we consider mainly their laws of composition (and operations), we ca1l them algebraic systems. We explain this in further detail in Section C. C. Algebraic Systems Algebraic systems are sets with laws of compo- sition and operations satisfying certain axiom systems; the laws of composition and oper- ations and the axiom systems they satisfy determine their type ( 2 Abelian Groups, 29 Associative Algebras, 42 Boolean Algebra, 67 Commutative Rings, 149 Fields, 151 Finite Groups, 190 Groups, 231 Jorden Algebras, 248 Lie Algebras, 368 Rings). Each algebraic sys- tem has its own theory, but general properties and related concepts are dealt with from a common standpoint. From this common ground we often get an insight into concepts from which arose a general theory of math- ematical systems. We describe here only al- gebraic systems, but it is possible to describe similar concepts for mathematical systems. The following is a description based mainly on one law of composition (a, b)-->ab; a similar description is possible for the case of two or more laws of composition. The law of composition ab is sometimes written a + h, a' b, [a, bJ, etc. A mapping f: A--> A' of similar algebraic systems A and A' is called a homomorphism provided that.f (ab) = j (a) f(b) (a, hE A). A' is said to be homomor- phic to A if there is homomorphism from A onto A'. If j is one-to-one, onto, and.its in- verse mapping I-' : A' -->A is also a homomor- phism, then f is ca1led an isomorphism, A' is said to be isomorphic to A, and the relation is written A", A'. The composition of homomor- phisms is a homomorphism, and the identity mapping is an isomorphism. A homomor- phism of A to A itself is called an endomor- phism, and if it is also an isomorphism, then it is an automorphism. The set of all endo- morphisms of A forms a tsemigroup under 1548 composition, and the set of automorphisms forms a group under composition The con- cept of homomorphism is a fundamental concept appearing in all algebraic systems. A homomorphism is sometimes called a representation. An element e of an algebraic system A with a law of composition ab is called an identity element if ae = ea = a (for all a E A) If such an element exists, then it is unique. In the case of a ring, two laws of composition, addition and multiplication, are given. In this case the iden- tity element for multiplication (if it exists) is called the identity element (or tunity element) of the ring. In the case of homomorphism between groups, the identity element is map- ped to the identity element, but this does not always hold in general algebraic systems. Since the identity element plays an important role, it is frequently added to the basic con- cepts. Homomorphism between mathematical systems is generally defined to induce a map- ping between basic concepts. A semigroup and a ring with a unity element are called a unitary semigroup (monoid) and a unitary ring, respec- tively, and homomorphisms betwe:en these systems are restricted to mappings that map the identity element to the identity element. Let A and A' be similar algebraIc systems, and let A be a subset of A'. A is called a sub- system of A' if the mapping f: A --> A' defined by f(a)=a(aEA) is a homomorphl'sm. A sub- system of a group (ring) is called a tsubgroup (tsubring), and similarly for other algebraic systems. An tequivalence relation R in an alge- braic system A is called compatible with the law of composition if R(a, a') and R(b, b') imply R(ab,a'h'). Consider the tquotient set AIR. Then the law of composition in AIR is uniquely determined so that the mapping j:A-->AIR defined by aEf(a) is a homomor- phism. The algebraic system thus obtained is called a quotient system. A quotient sys- tem of a group (ring) is a group (ring), called a tquotient group (tquotient ring). Other cases, including those where operations are given, are treated similarly ( 52 Categories and Functors). References [IJ N. Bourbaki, Elements de mathematique, I. Theorie des ensembles, ch. 4, Structures, Actualites Sci. Ind., 1258a, Hermann, second edition, 1966; English translation, Theory of sets, Addison-Wesley, 1968. [2J C. Cheva1ley, Fundamental concepts of algebra, Academic Press, 1956. 
1549 410 (VI.21) Surfaces A. The Notion of a Surface The notion of a surface may be roughly ex- pressed by saying that by moving a curve we get a surface or that the boundary of a solid body is a surface. But these propositions can- not be considered mathematical definitions of a surface. We also make a distinction between surfaces and planes in ordinary language, where we mean by surfaces only those that are not planes. In mathematical language, how- ever, planes are usually included among the surfaces. A surface can be defined as a 2-dimensional tcontinuum, in accordance with the definition of a curve as a l-dimensional continuum. However, while we have a theory of curves based on this definition, we do not have a similar theory of surfaces thus defined ( 93 Curves). What is called a surface or a curved surface is usually a 2-dimensional ttopological mani- fold, that is, a topological space that satisfies the tsecond countability axiom and of which every point has a neighborhood thomeomor- phic to the interior of a circular disk in a 2-dimensional Euclidean space. In the follow- ing sections, we mean by a surface such a 2- dimensional topological manifold. B. Examples and Classification The simplest examples of surfaces are the 2- dimensional tsimplex and the 2-dimensional tsphere. Surfaces are generally tsimplicially decomposable (or triangulable) and hence homeomorphic to 2-dimensional polyhedra (T. Rad6, Acta Sci. Math. Szeged. (1925)). A tcom- pact surface is called a closed surface, and a noncompact surface is called an open surface. A closed surface is decomposable into a finite number of 2-simplexes and so can be inter- preted as a tcombinatorial manifold. A 2- dimensional topological manifold having a boundary is called a surface with boundary. A 2-simplex is an example of a surface with boundary, and a sphere is an example of a closed surface without boundary. Surfaces are classified as torientable and tnonorientable. In the special case when a sur- face is tembedded in a 3-dimensional Euclid- ean space £3, whether the surface is orien- table or not depends on its having two sides (the "surface" and "back") or only one side. Therefore, in this special case, an orient able surface is called two-sided, and a nonorientable 410 B Surfaces surface, one-sided. A nonorientable closed surface without boundary cannot be embed- ded in the Euclidean space £3 ( 56 Charac- teristic Classes, 114 Differential Topology). The first example of a nonorientable surface (with boundary) is the so-called Mobius strip or Mobius band, constructed as an tidenti- fication space from a rectangle by twisting through 180 0 and identifying the opposite edges with one another (Fig. 1). A f I B D I ! e f/if!j!) DB Fig. As illustrated in Fig. 2, from a rectangle ABCD we can obtain a closed surface homeo- morphic to the product space Sl x Sl by identifying the opposite edges AB with DC and BC with AD. This surface is the so-called 2-dimensional torus (or anchor ring). In this case, the four vertices A, B, C, D of the rec- tangle correspond to one point p on the sur- face, and the pairs of edges AB, DC and BC, AD correspond to closed curves a' and h' on the surface. We use the notation aha-1b- 1 to represent a torus. This refers to the fact that the torus is obtained from an oriented four- sided polygon by identifying the first side and the third (with reversed orientation), the sec- ond side and the fourth (with reversed orienta- tion). Similarly, aa- 1 represents a sphere (Fig. 3), and a1b1allbllazbza;lb;1 represents the closed surface shown in Fig. 4. A a ! B . I . e u  ---------' a' I b' P I Fig. 2 A "0" CD lJ Fig. 3 
410 B Surfaces {" b I Fig. 4 All closed surfaces without boundary are constructed by identifying suitable pairs of sides of a 2n-sided polygon in a Euclidean plane £2. Furthermore, a closed orientable surface without boundary is homeomorphic to the surface represented by aa- 1 or at hI aj1 bjl ... aphpa;;l b;l. The I-dimensional +Betti number of this surface is 2p, the O-dimensional and 2-dimen- sional tBetti numbers are I, the ttorsion coeffi- cients are all 0, and p is called the genus of the surface. Also, a closed orientable surface of genus p with boundaries C l' ... ,C k is repre- sented by W 1 C 1 wjl... Wkckwk-lalb1aj1bjl ...apbpa;;lb;l (2) (Fig. 5). A closed nonorientable surface with- out boundary is represented by a]ala2a2...aqaq' Fig. 5 1550 The I-dimensional Betti number of this surface is q - 1, the O-dimensional and 2- dimensional Betti numbers are 1 and 0, re- spectively, the I-dimensional torsIOn coeffi- cient is 2, the O-dimensional and 2 -dimensional torsion coefficients are 0, and q is called the genus of the surface. A closed nonorientable surface of gen us q with boundaries C 1, . .. ,Ck is represented by -1 -1 W1C1W[ "'WkCkWk a 1 a 1 ...a q a q . (4) Each of forms (1 )-(4) is called the normal form of the respective surface, and-the curves a;, b j , w k are called the normal sections of the surface. To explain the notation in (3), we first take the simplest case, aa. In this case, the surface is obtained from a disk by identifying each pair of points on the circumference that are end- points of a diameter (Fig. 6). The surface aa is then homeomorphic to a tprojective plane of which a decomposition into a complex of triangles is illustrated in Fig. 7. On the other hand, aabb represents a surface like that shown in Fig. 8, called the Klein bottle. Fig. 9 shows a handle, and Fig. 10 shows a cross cap. Fig. 6 (1) 4 A Fig. 7 (3) "0, {1 Fig. 8  b , \ I \ I \ 6 Fig. 9 
1551 I \ 0 ' (1////' C /// ( ,,/ JJ .1 A  A2B EJ -- 1 1 _-------_ _-------_ -= __J__ -- _____ ....- -..... -- -..... Fig. 10 The last two surfaces have boundaries; a handle is orientable, while a cross cap is non- orientable and homeomorphic to the Mobius strip. If we delete p disks from a sphere and replace them with an equal number of handles, then we obtain a surface homeomorphic to the surface represented in (1), while if we replace the disks by cross caps instead of by handles, then the surface thus obtained is homeomorphic to that represented in (3). Now we decompose the surfaces (1) and (3) into triangles and denote the number of i- dimensional simplexes by cx, (i=O, 1,2). Then in view of the tEuler-Poincare formula, the sur- faces (1) and (3) satisfy the respective formulas CXO-CX t +cx 2 =2(I-p), CXO-CX t +cx 2 =2-q. The tRiemann surfaces oftalgebraic func- tions of one complex variable are always sur- faces of type (1). and their genera p coincide with those of algebraic functions. All closed surfaces are homeomorphic to surfaces of types (1), (2), (3), or (4). A necessary and sufficient condition for two surfaces to be homeomorphic to each other is coincidence of the numbers of their boundaries, their orienta- bility or nonorientability, and their genera (or tEuler characteristic CXO - cx t + cx 2 ). This propo- sition is called the fundamental theorem of the topology of surfaces. The thomeomorphism problem of closed surfaces is completely solved by this theorem. The same problem for n (n  3) manifolds, even if they are compact, remains open. (For surface area  246 Length and Area. For the differential geometry of surfaces  111 Differential Geometry of Curves and Surfaces.) References [IJ B. Kerekjart6, Vorlesungen iiber Topo- logie, Springer, 1923. [2J H. Seifert and W. Threlfall, Lehrbuch der Topologie, Teubner, 1934 (Chelsea, 1945). [3J S. Lefschetz, Introduction to topology, Princeton Univ. Press, 1949. 411 B Symbolic Logic [4J D. Hilbert and S. Cohn-Vossen, Anschau- liche Geometrie, Springer, 1932; English translation, Geometry and the imagination, Chelsea, 1952. [5J W. S. Massey, Algebraic topology: An introduction, Springer, 1967. [6J E. E. Moise, Geometric topology in dimen- sions 2 and 3, Springer, 1977. 411 (1.4) .......a-......I:... I ......_: JIIIUUII\i LU!:II\i A. General Remarks Symbolic logic (or mathematical logic) is a field of logic in which logical inferences commonly used in mathematics are investigated by use of mathematical symbols. The algebra of logic originally set forth by G. Boole [lJ and A. de Morgan [2J is actually an algebra of sets or relations; it did not reach the same level as the symbolic logic of today. G. Frege, who dealt not only with the logic of propositions but also with the first-order predicate logic using quantifiers ( Sections C and K), should be regarded as the real originator of symbolic logic. Frege's work, however, was not recognized for some time. Logical studies by C. S. Peirce, E. Schroder, and G. Peano appeared soon after Frege, but they were limited mostly to propositions and did not develop Frege's work. An essential development of Frege's method was brought about by B. Russell, who, with the collabor- ation of A. N. Whitehead, summarized his results in Principia mathematica [4], which seemed to have completed the theory of sym- bolic logic at the time of its appearance. B. Logical Symbols If A and B are propositions, the propositions (A and B), (A or B), (A implies B), and (not A) are denoted by A i\ B, A v B, A --. B, I A, respectively. We call I A the negation of A, A i\ B the conjunction (or logical product), A v B the disjunction (or logical sum), and A--.B the implication (or B by A). The propo- sition (A--.B)i\(B--.A) is denoted by A+-+B and is read "A and B are equivalent." A v B means that at least one of A and B holds. The propositions (For all x, the proposition F(x) holds) and (There exists an x such that F(x) holds) are denoted by V'xF(x) and 3xF(x), respectively. A proposition of the form V'xF(x) 
411 C Symbolic Logic is ca11ed a universal proposition, and one of the form :lxF(x), an existential proposition. The symbols A, v, --+, +-+, I, V, :I are caned log- ical symbols. There are various other ways to denote logical symbols, including: AAB: A&B, A-B, AvB: A+B, A--+B: A :::JB, A =B, A+-+B: ApB, A=B, AB, A:::J cB, A=B, IA: A, A, VxF(x): (x)F(x), I1xF(x), j\xF(x), :lxF(x): (Ex)F(x), l:xF(x), VxF(x). C. Free and Bound Variables Any function whose values are propositions is called a propositional function. 'Ix and :Ix can be regarded as operators that transform any propositional function F(x) into the propo- sitions VxF(x) and :lxF(x), respectively. 'Ix and :Ix are called quantifiers; the former is called the universal quantifier and the latter the existential quantifier. F(x) is transformed into VxF(x) or :lxF(x) just as a function /(x) is transformed into the definite integral Jd f(x)dx; the resultant propositions VxF(x) and :lxF(x) are no longer functions of x. The variable x in VxF(x) and in :lxF(x) is called a bound variable, and the variable x in F(x), when it is not bound by 'Ix or :Ix, is called a free variable. Some people employ different kinds of symbols for free variables and bound variables to avoid confusion. D. Formal Expressions of Propositions A formal expression of a proposition in terms oflogical symbols is called a formula. More precisely, formulas are constructed by the following formation rules: (1) If  is a formula, 1  is also a formula. If  and  are for- mulas,  A 'B,  v ,  --+ 'B are all formulas. (2) If 3'(a) is a formula and a is a free variable, then Vx3'(x) and :lx3'(x) are formulas, where x is an arbitrary bound variable not contained in 3'(a) and 3'(x) is the result of substituting x for a throughout 3'(a). We use formulas of various scope accord- ing to different purposes. To indicate the scope of formulas, we fix a set of formulas, each element of which is called a prime formula (or atomic formula). The scope of formulas is the set of formulas obtained from the prime for- mulas by formation rules (1) and (2). 1552 E. Propositional Logic Propositional logic is the field in symbolic logic in which we study relations between propositions exclusively in connection with the four logical symbols A, v, --+, and I, called propositional connectives. In propositional logic, we deal only with operations of logical operators denoted by propositional connectives, regarding the vari- ables for denoting propositions, caned propo- sition variables, only as prime formulas. We examine problems such as: What kinds of formulas are identically true when their propo- sition variables are replaced by any propo- sitions, and what kinds of formulas can some- times be true? Consider the two symbols Y and A, read true and false, respectively, and let A = {Y, A}. A univalent function from A, or more generally from a Cartesian product A x ... x A, into A is caned a truth function. We can regard A, v, --+, 1 as the following truth functions: (1) A A B= Y for .4 = B= Y, and A A B = A otherwise; (2) A vII = A for A = B = A, and A v B = Y otherwise; (3) A--+B= A for A = Y and B= A, and A--+B= Y otherwise; (4) IA = A for A = Y, and 1 A = Y for A = A. If we regard proposition variables as vari- ables whose domain is A, then each formula represents a truth function. Conversely, any truth function (of a finite number of indepen- dent variables) can be expressed by an appro- priate formula, although such a formula is not uniquely determined. If a formula is regarded as a truth function, the value of the function determined by a combination of values of the independent variables involved in the formula is called the truth value of the formula. A formula corresponding to a truth function that takes only Y as its value is called a tau- tology. For example, v I and «--+) --+)--+  are tautologies. Since a truth func- tion with n independent variables takes values corresponding to 2" combinations of truth values of its variables, we can determine in a finite number of steps whether a given formula is a tautology. If +-+ is a tautology (that is,  and 'B correspond to the same truth func- tion), then the formulas  and 'B are said to be equivalent. F. Propositional Calculus It is possible to choose some specific tau- tologies, designate them as axioms, and derive a11 tautologies from them by appropriately given rules of inference. Such a system is called a propositional calculus. There are many ways 
1553 to stipulate axioms and rules of inference for a propositional calculus. The abovementioned propositional calculus corresponds to the so-ca1\ed classical pro po- sitionallogic ( Section L). By choosing ap- propriate axioms and rules of inference we can also forma1\y construct intuitionistic Or other propositional logics. In intuitionistic logic the law of the texcluded middle is not accepted, and hence it is impossible to formalize intui- tionistic propositional logic by the notion of tautology. We therefore usua1\y adopt the method of propositional calculus, instead of using the notion of tautology, to formalize intuitionistic propositional logic. For example, V. I. Glivenko's theorem [5J, that if a formula { can be proved in classical logic, then I lilt can be proved in intuitionistic logic, was ob- tained by such formalistic considerations. A method of extending the classical concepts of truth value and tautology to intuitionistic and other logics has been obtained by S. A. Kripke. There are also studies of logics inter- mediate between intuitionistic and classical logic (T. Umezawa). G. Predicate Logic Predicate logic is the area of symbolic logic in which we take quantifiers in account. Mainly propositional functions are discussed in predi- cate logic. In the strict sense only single- variable propositional functions are called predicates, but the phrase predicate of n argu- ments (or n-ary predicate) denoting an n- variable propositional function is also em- ployed. Single-variable (or unary) predicates are also ca1\ed properties. We say that a has the property F if the proposition F(a) formed by the property F is true. Predicates of two arguments are ca1\ed binary relations. The proposition R(a, b) formed by the binary re- lation R is occasiona1\y expressed in the form aRb. Genera1\y, predicates of n arguments are ca1\ed n-ary relations. The domain of defini- tion of a unary predicate is ca1\ed the object domain, elements of the object domain are ca1\ed objects, and any variable running over the object domain is ca1\ed an object variable. We assume here that the object domain is not empty. When we deal with a number of predi- cates simultaneously (with different numbers of variables), it is usual to arrange things so that all the independent variables have the same object domain by suitably extending their object domains. Predicate logic in its purest sense deals exclusively with the general properties of quantifiers in connection with propositional connectives. The only objects dealt with in this 411 H Symbolic Logic field are predicate variables defined over a certain common domain and object variables running over the domain. Propositional vari- ables are regarded as predicates of no vari- ables. Each expression F(a 1 , ...,a n ) for any predicate variable F of n variables a [, .,. ,an (object variables designated as free) is regarded as a prime formula (n = 0, 1,2, ...), and we deal exclusively with formulas generated by these prime formulas, where bound variables are also restricted to object variables that have a common domain. We give no specification for the range of objects except that it be the com- mon domain of the object variables. By designating an object domain and sub- stituting a predicate defined over the domain for each predicate variable in a formula, we obtain a proposition. By substituting further an object (object constant) belonging to the object domain for each object variable in a proposition, we obtain a proposition having a definite truth value. When we designate an object domain and further associate with each predicate variable as well as with each object variable a predicate or an object to be sub- stituted for it, we ca1\ the pair consisting of the object domain and the association a model. Any formula that is true for every model is called an identically true formula or valid formula. The study of identica1\y true formu- las is one of the most important problems in predicate logic. H. Formal Representations of Mathematical Propositions To obtain a formal representation of a math- ematical theory by predicate logic, we must first specify its object domain, which is a non- empty set whose elements are ca1\ed individ- uals; accordingly the object domain is called the individual domain, and object variables are called individual variables. Secondly we must specify individual symbols, function symbols, and predicate symbols, signifying specific indi- viduals, functions, and tpredicates, respectively. Here a function of n arguments is a univa- lent mapping from the Cartesian product D x '" x D of n copies of the given set to D. Then we define the notion of term as in the next paragraph to represent each individual forma1\y. Fina1\y we express propositions for- mally by formulas. Definition of terms (formation rule for terms): (1) Each individual symbol is a term. (2) Each free variable is a term. (3) f(t [, ..., t n ) is a term if t [, ... , t n are terms and .f is a function symbol of n arguments. (4) The only terms are those given by (1)-(3). As a prime formula in this case we use any 
411 I Symbolic Logic formula of the form F(t l' ..., t n ), where F is a predicate symbol of n arguments and t 1, ..., t n are arbitrary terms. To define the notions of term and formula, we need logical symbols, free and bound individual variables, and also a list of individual symbols, function symbols, and predicate symbols. In pure predicate logic, the individual domain is not concrete, and we study only general forms of propositions. Hence, in this case, predicate or function symbols are not representations of concrete predicates or func- tions but are predicate variables and function variables. We also use free individual variables instead of individual symbols. In fact, it is now most common that function variables are dispensed with, and only free individual vari- ables are used as terms. I. Formulation of Mathematical Theories To formalize a theory we need axioms and rules of inference. Axioms constitute a certain specific set of formulas, and a rule of inference is a rule for deducing a formula from other formulas. A formula is said to be provable if it can be deduced from the axioms by repeated application of rules of inference. Axioms are divided into two types: logical axioms, which are common to aII theories, and mathematical axioms, which are peculiar to each individual theory. The set of mathematical axioms is calIed the axiom system of the theory. (I) Logical axioms: (1) A formula that is the result of substituting arbitrary formulas for the proposition variables in a tautology is an axiom. (2) Any formula of the form Itx6(X)->6(t) or 6(t)->3x6(X) is an axiom, where 6(t) is the result of sub- stituting an arbitrary term t for x in 6(X). (II) Rules of inference: (I) We can deduce a formula 'B from two formulas III and 111->'B (modus ponens). (2) We can deduce 1l!->ltx6(X) from a formula 111->6(a) and 3X6(x)->111 from 6(a)->IlI, where £1 is a free individual variable contained in neither Il! nor 6(x) and 6(a) is the result of substituting a for x in 6(X). If an axiom system is added to these logical axioms and rules of inference, we say that a formal system is given. A formal system S or its axiom system is said to be contradictory or to contain a con- tradiction if a formula Il! and its negation I \U are provable; otherwise it is said to be consis- tent. Since (\!! A Ill!) -> 'B is a tautology, we can show that any formula is provable in a formal system containing a 1554 contradiction. The validity of a proof by reductio ad absurdum lies in the fact that (\!!->('BA ,IS))-> I\!! is a tautology. An affirmative proposition (formula) may be obtained by reductio ad absurdum since the formula (of propositional logic) representing the discharge of double negation I 11l!->111 is a tautology. J. Predicate Calculus If a formula has no free individual variable, we call it a closed formula. Now we consider a formal system S whose mathematical axioms are closed. A formula \!! is provable in S if and only if there exist suitable mathematical axioms {£1' "', {£n such that the formula ({£1 A ... A {£n)->\!I is provable without the use of mathematical axioms. Since any axiom system can be re- placed by an equivalent axiom system contain- ing only closed formulas, the study of a formal system can be reduced to the study of pure logic. In the folIowing we take no individual sym- bols or function symbols into cOl1sideration and we use predicate variables as predicate symbols in accordance with the commonly accepted method of stating properties of the pure predicate logic; but only in the case of predicate logic with equality will we use predi- cate variables and the equality predicate = as a predicate symbol. However, we can safely state that we use function variables as function symbols. The formal system with no mathematical axioms is called the predicate calculus. The formal system whose mathematical axioms are the equality axioms a=£1, a=b -> (6(a)->6(b)) is calIed the predicate calculus with equality. In the following, by being provable we mean being provable in the predicate calculus. (1) Every provable formula is valid. (2) Conversely, any valid formula is prov- able (K. Godel [6J). This fact is called the completeness of the predicate calculus. In fact, by Godel's proof, a formula Il! is provable if III is always true in every interpretation whose individual domain is of tcountable cardinality. In another formulation, if I \U is not provable, the formula III is a true proposition in some interpretation (and the individual domain in this case is of countable cardinality). We can 
1555 extend this result as follows: If an axiom sys- tem generated by countably many closed formulas is consistent, then its mathematical axioms can be considered true propositions by a common interpretation. In this sense, GOdel's completeness theorem gives another proof of the tSkolem-Lowenheim theorem. (3) The predicate calculus is consistent. Although this result is obtained from (I) in this section, it is not difficult to show it directly (D. Hilbert and W. Ackermann [7]). (4) There are many different ways of giving logical axioms and rules of inference for the predicate calculus. G. Gentzen gave two types of systems in [8]; one is a natural deduction system in which it is easy to reproduce formal proofs directly from practical ones in math- ematics, and the other has a logically simpler structure. Concerning the latter, Gentzen proved Gentzen's fundamental theorem, which shows that a formal proof of a formula may be translated into a "direct" proof. The theorem itself and its idea were powerful tools for ob- taining consistency proofs. (5) If the proposition 3xill(x) is true, we choose one of the individuals x satisfying the condition ill(x), and denote it by Exill(x). When 3x\!1(x) is false, we let f:XI,U(X) represent an arbitrary individual. Then 3xlll(x)--till(c:xIlI(x) ) is true. We consider f:X to be an operator as- sociating an individual Ex\!l(x) with a propo- sition \ll(x) containing the variable x. Hilbert called it the transfinite logical choice function; today we call it Hilbert's f:-operator (or E- quantifier), and the logical symbol E used in this sense Hilbert's f;-symbol. Using the E- symbol, 3x\!l(x) and Itx\!l(x) are represented by 1l1(f:x\!l(x)), \!I(r.x I \ll(x)), respectively, for any \!I(x). The system of predi- cate calculus adding formulas of the form (I) as axioms is essentially equivalent to the usual predicate calculus. This result, called the c- theorem, reads as follows: When a formula (£ is provable under the assumption that every formula of the form (I) is an axiom, we can prove (£ using no axioms of the form (1) if (£ contains no logical symbol E (D. Hilbert and P. Bernays [9]). Moreover, a similar theorem holds when axioms of the form It x (ill (x)......'B (x) )--t f:x\!1 (x) = r.x'B(x) are added (S. Maehara [IOJ). (6) For a given formula ill, call \II' a normal form of III when the formula \!I......ill' is provable and ill' satisfies a particular con- dition. For example, for any formula \!I there i 411 K Symbolic Logic a normal form ill' satisfying the condition: Ill' has the form Q1 X 1'" Qnxn'B(x 1 , ...,x n ), (1) where Qx means a quantifier Itx or 3x, and 'B(x 1, "', x n ) contains no quantifier and has no predicate variables or free individual variables not contained in L A normal form of this kind is called a prenex normal form. (7) We have dealt with the classical first- order predicate logic until now. For other predicate logics (--t Sections K and L) also, we can consider a predicate calculus or a formal system by first defining suitable axioms or rules of inference. Gentzen's fundamental theorem applies to the intuitionistic predicate calculus formulated by V. I. Glivenko, A. Heyting, and others. Since Gentzen's funda- mental theorem holds not only in classical logic and intuitionistic logic but also in several systems of first-order predicate logic or pro- positional logic, it is useful for getting results in modal and other logics (M. Ohnishi, K. Matsumoto). Moreover, Glivenko's theorem in propositional logic [5] is also extended to predicate calculus by using a rather weak representation (S. Kuroda [12]). G. Takeuti expected that a theorem similar to Gentzen's fundamental theorem would hold in higher- order predicate logic also, and showed that the consistency of analysis would follow if that conjecture could be verified [13]. More- over, in many important cases, he showed constructively that the conjecture holds par- tially. The conjecture was finally proved by M. Takahashi [14] by a nonconstructive method. Concerning this, there are also con- tributions by S. Maehara, T. Simauti, M. Yasuhara, and W. Tail. K. Predicate Logics of Higher Order (2) In ordinary predicate logic, the bound vari- ables are restricted to individual variables. In this sense, ordinary predicate logic is called first-order predicate logic, while predicate logic dealing with quantifiers It P or 3P for a predi- cate variable P is called second-order predicate logic. Generalizing further, we can introduce the so-called third-order predicate logic. First we fix the individual domain Do. Then, by intro- ducing the whole class D of predicates of n variables, each running over the object domain Do, we can introduce predicates that have D as their object domain. This kind of predicate is called a second-order predicate with respect to the individual domain Do. Even when we restrict second-order predicates to one- variable predicates, they are divided into vari- 
411 L Symbolic Logic ous types, and the domains of independent variables do not coincide in the case of more than two variables. In contrast, predicates having Do as their object domain are called first-order predicates. The logic having quan- tifiers that admit first-order predicate variables is second-order predicate logic, and the logic having quantifiers that admit up to second- order predicate variables is third-order predi- cate logic. Similarly, we can define further higher-order predicate logics. Higher-order predicate logic is occasionally called type theory, because variables arise that are classified into various types. Type theory is divided into simple type theory and ramified type theory. We confine ourselves to variables for single- variable predicates, and denote by P such a bound predicate variable. Then for any for- mula (a) (with a a free individual variable), the formula 3Plfx(P(x)......(x)) is considered identically true. This is the point of view in simple type theory. Russell asserted first that this formula can- not be used reasonably if quantifiers with respect to predicate variables occur in (x). This assertion is based on the point of view that the formula in the previous paragraph asserts that (x) is a first-order predicate, whereas any quantifier with respect to first- order predicate variables, whose definition assumes the totality of the first-order predi- cates, should not be used to introduce the first- order predicate (x). For this purpose, Russell further classified the class of first-order predi- cates by their rank and adopted the axiom 3pklfx(pk(X)......(x) ) for the predicate variable pk of rank k, where the rank i of any free predicate variable OCCur- ring in (x) is <£; k, and the rank j of any bound predicate variable occurring in (x) is < k. This is the point of view in ramified type theory, and we still must subdivide the types if we deal with higher-order propositions or propositions of many variables. Even Russell, having started from his ramified type theory, had to introduce the axiom of reducibility afterwards and reduce his theory to simple type theory. L. Systems of Logic Logic in the ordinary sense, which is based on the law of the excluded middle asserting that every proposition is in principle either true or false, is called classical logic. Usually, propo- 1556 sitionallogic, predicate logic, and type theory are developed from the standpoint of classical logic. Occasionally the reasoning of intuition- is tic mathematics is investigated using sym- bolic logic, in which the law of the excluded middle is not admitted ( 156 Foundations of Mathematics). Such logic is called intuitionistic logic. Logic is also subdivided into proposi- tional logic, predicate logic, etc., according to the extent of the propositions (formulas) dealt with. To express modal propositions stating possi- bility, necessity, etc., in symbolic logic, J. Lu- kaszewicz proposed a propositional logic called three-valued logic, having a third truth value, neither true nor false. More generally, many- valued logics with any number of truth values have been introduced; classical logic is one of its special cases, two-valued logic with two truth values, true and false. Actually, however, many-valued logics with more than three truth values have not been studied much, while various studies in modal logic based on classi- cal logic have been successfully carried out. For example, studies of strict imlllication belong to this field. References [IJ G. Boole, An investigation of the laws of thought, Walton and Maberly, 1854. [2J A. de Morgan, Formal logic, or the cal- culus of inference, Taylor and Walton, 1847. [3J G. Frege, Begriffsschrift, eine der arith- metischen nachgebildete Formalsprache des rein en Denkens, Halle, 1879. [4J A. N. Whitehead and B. Russell, Principia mathematica I, II, III, Cambridge Univ. Press, 1910-1913; second edition, 1925-1927. [5J V. Glivenko, Sur quelques points de la logique de M. Brouwer, Acad. Roy. de Bel- gique, Bulletin de la cia sse des sciences, (5) 15 (1929),183-188. [6J K. Godel, Die Vollstandigkeit der Axiome des logischen Funktionenkalkiils, Monatsh. Math. Phys., 37 (1930), 349-360. [7J D. Hilbert and W. Ackermann, Grundziige der theoretischen Logik, Springer, 1928, sixth edition, 1972; English translation, Principles of mathematical logic, Chelsea, 1950. [8J G. Gentzen, Untersuchungen iiber das logische Schliessen, Math. Z., 39 (1935), 176- 210,405-431. [9J D. Hilbert and P. Bernays, Grundlagen der Mathematik II, Springer, 1939; second edition, 1970. [I OJ S. Maehara, Equality axiom on Hilbert's £-symbol, J. Fac. Sci. Univ. Tokyo, (I), 7 (1957), 419-435. 
1557 [IIJ A. Heyting, Die formalen Regeln der intuitionistischen Logik I, S.-B. Preuss. Akad. Wiss., 1930,42-56. [12J S. Kuroda, Intuitionistische Untersu- chungen der formalistischen Logik, Nagoya Math. J., 2 (1951),35-47. [13J G. Takeuti, On a generalized logic cal- culus, Japan. J. Math., 23 (1953). 39-96. [14J M. Takahashi, A proof of the cut- elimination theorem in simple type-theory, J. Math. Soc. Japan, 19 (1967), 399-410. [15J S. C. Kleene, Mathematical logic, Wiley, 1967. [16J 1. R. Shoenfield, Mathematical logic, Addison-Wesley, 1967. [17J R. M. Smullyan, First-order logic, Sprin- ger, 1968. 412 (IV.13) Symmetric Riemannian Spaces and Real Forms A. Symmetric Riemannian Spaces Let M be a tRiemannian space. For each point p of M we can define a mapping rIp ofa suit- able neighborhood Up of p onto Up itself so that rIp (x,) = X _" where x, (It I < G, Xo = p) is any tgeodesic passing through the point p. We call M a locally symmetric Riemannian space if for any point p of M we can choose a neighbor- hood Up so that rIp is an tisometry of Up' In order that a Riemannian space M be locally symmetric it is necessary and sufficient that the tcovariant differential (with respect to the tRiemannian connection) of the tcurvature tensor of M be 0. A locally symmetric Riemann- ian space is a treal analytic manifold. We say that a Riemannian space M is a globally sym- metric Riemannian space (or simply symmetric Riemannian space) if M is connected and if for each point p of M there exists an isometry rIp of M onto M itself that has p as an isolated fixed point (i.e., has no fixed point except p in a certain neighborhood of p) and such that rI is the identity transformation on M. In this case rIp is called the symmetry at p. A (globally) symmetric Riemannian space is 10cal1y sym- metric and is a tcomplete Riemannian space. Conversely, a tsimply connected complete locally symmetric Riemannian space is a (glob- ally) symmetric Riemannian space. B. Symmetric Riemannian Homogeneous Spaces A thomogeneous space GIK of a connected tLie group G is a symmetric homogeneous 412 C Symmetric Riemannian Spaces and Real Forms space (with respect to 0) if there exists an in- volutive automorphism (i.e., automorphism of order 2) e of G satisfying the condition K c K eKe, where Ke is the closed subgroup con- sisting of all elements of G left fixed by 0 and K is the connected component of the iden- tity element of Ke. In this case, the mapping aK-.e(a)K (aEG) is a transformation of GIK having the point K as an isolated fixed point; more generally, the mapping Oa o : aK-. aoO(ao)-1 O(a)K is a transformation of GIK that has an arbitrary given point aoK of GI K as an isolated fixed point. If there exists a G- invariant Riemannian metric on GIK, then GIK is a symmetric Riemannian space with symmetries {e ao I aoE G} and is called a sym- metric Riemannian homogeneous space. A sufficient condition for a symmetric homoge- neous space GIK to be a symmetric Riemann- ian homogeneous space is that K be a com- pact subgroup. Conversely, given a symmetric Riemannian space M, let G be the connected component of the identity element of the Lie group formed by all the isometries of M; then M is represented as the symmetric Riemannian homogeneous space M = GIK and K is a com- pact group. In particular, a symmetric Rie- mannian space can be regarded as a Riemann- ian space that is realizable as a symmetric Riemannian homogeneous space. The Riemannian connection of a symme- tric Riemannian homogeneous space GIK is uniquely determined (independent of the choice of G-invariant Riemannian metric), and a geodesic x,( I t I < CfJ, Xo = a o K) passing through a point aoK of GIK is of the form x,=(exptX)aoK. Here X is any element of the Lie algebra 9 of G such that O(X) = - X, where o also denotes the automorphism of 9 induced by the automorphism 0 of G and exptX is the tone-parameter subgroup of G defined by the element X. The covariant differential of any G- invariant tensor field on GIK is 0, and any G- invariant tdifferential form on GIK is a closed differential form. C. Classification of Symmetric Riemannian Spaces The tsimply connected tcovering Riemannian space of a symmetric Riemannian space is also a symmetric Riemannian space. Therefore the problem of classifying symmetric Riemannian spaces is reduced to classifying simply con- nected symmetric Riemannian spaces M and determining tdiscontinuous groups of iso- metries of M. When we take the tde Rham decomposition of such a space M and repre- sent M as the product of a real Euclidean space and a number of simply connected irre- 
412 D Symmetric Riemannian Spaces and Real Forms ducible Riemannian spaces, all the factors are symmetric Riemannian spaces. We say that M is an irreducible symmetric Riemannian space if it is a symmetric Riemannian space and is irreducible as a Riemannian space. A simply connected irreducible symmetric Riemannian space is isomorphic to one of the following four types of symmetric Riemannian homogeneous spaces (here Lie groups are always assumed to be connected): (1) The symmetric Riemannian homoge- neous space (G x G)/{(a,a)laEG} of the direct product G x G, where G is a simply connected compact tsimple Lie group and the involutive automorphism of G x G is given by (a, b)-+(b, a) «(a,b)EG x G). This space is isomorphic, as a Riemannian space, to the space G obtained by introducing a two-sided invariant Riemannian metric on the group G; the isomorphism is induced from the mapping G x G3(a,b)-+ ab- 1 EG. (2) A symmetric homogeneous space GIKe of a simply connected compact simple Lie group G with respect to an involutive auto- morphism () of G. In this case, the closed sub- group Ke= {aEGI lI(a) = a} of G is connected. We assume here that () is a member of the given complete system of representatives of the tconjugate classes formed by the elements of order 2 in the automorphism group of the group G. (3) The homogeneous space GC/G, where G C is a complex simple Lie group whose tcenter reduces to the identity element and G is an arbitrary but fixed maximal compact subgroup ofG c (4) The homogeneous space GoIK, where Go is a noncompact simple Lie group whose center reduces to the identity element and which has no complex Lie group structure, and K is a maximal compact subgroup of G. In Section D we shall see that (3) and (4) are actually symmetric homogeneous spaces. All four types of symmetric Riemannian spaces are actually irreducible symmetric Riemannian spaces, and G-invariant Riemannian metrics on each of them are uniquely determined up to multiplication by a positive number. On the other hand, (I) and (2) are compact, while (3) and (4) are homeomorphic to Euclidean spaces and not compact. For spaces of types (!) and (3) the problem of classifying simply connected irreducible symmetric Riemannian spaces is reduced to classifying tcompact real simple Lie algebras and tcomplex simple Lie algebras, respectively, while for types (2) and (4) it is reduced to the classification of noncom pact real simple Lie algebras ( Section D) (for the result of classification of these types  Ap- pendix A, Table 5.I1). On the other hand, any (not necessarily simply connected) irreducible 1558 symmetric Riemannian space defines one of (1)-(4) as its tuniversal covering manifold; if the covering manifold is of type (3) or (4), the original symmetric Riemannian space is neces- sarily simply connected. D. Symmetric Riemannian Homogeneous Spaces of Semisimple Lie Groups In Section C we saw that any irreducible sym- metric Riemannian space is representable as a symmetric Riemannian homogeneous space GIK on which a connected semisimple Lie group G acts talmost effectively (  249 Lie Groups). Among symmetric Riemannian spaces, such a space M = GIK is characterized as one admitting no nonzero vector field that is tparallel with respect to the Riemannian connection. Furthermore, if G acts effectively on M, G coincides with the connected compo- nent I(M)o of the identity element of the Lie group formed by all the isometries of M. We let M=GIK be a symmetric Riemann- ian homogeneous space on which a con- nected semisimple Lie group G acts almost effectively. Then G is a Lie group I:hat is tlocally isomorphic to the group I(M)o, and therefore the Lie algebra of G is determined by M. Let 9 be the Lie algebra of G, f be the subalgebra of 9 corresponding to K, and () be the involutive automorphism of G defining the symmetric homogeneous space GIK. The automorphism of 9 defined by II is also denoted by II. Then f = {X Eg I II(X)=X}. Putting m = {X Egi 8(X) = - X}, we have !1 = m + f (direct sum of linear spaces), and m can be identified in a natural way with the tangent space at the point K of GIK. The tadjoint representation of G gives rise to a representation of Kin !1, which in- duces a linear representation Ad l1 .(k) of K in m. Then {Adm(k)lkEK} coincides with the tres- tricted homogeneous holonomy group at the point K of the Riemannian space GIK. Now let !{J be the tKilling form of g. Then f and m are mutually orthogonal with respect to !{J, and denoting by !{Jf and !{Jm the restrictions of!{J to f and m, respectively, !{Jf is a negative definite quadratic form on 1. If !{Jm is also a negative definite quadratic form on m, !1 is a compact real semisimple Lie algebra and GIK is a compact symmetric Riemannian space; in this case we say that GI K is of compact type. In the opposite case, where !(Jm is a tpositive definite quadratic form, G I K is said to be of noncom pact type. In this latter case, GIK is homeomo'rphic to a Euclidean space, and if the center of G is finite, K is a maximal com- pact subgroup of G. Furthermore, the group ofisometries I(GIK) ofGIK is canonically 
1559 isomorphic to the automorphism group of the Lie algebra g. When GI K is of compact type (noncompact type), there exists one and only one G-invariant Riemannian metric on GIK, which induces in the tangent space m at the point K the positive definite inner prod uct - !Pitt (!p",). A symmetric Riemannian homogeneous space GIKe of compact type defined by a sim- ply connected compact semisimple Lie group G with respect to an involutive automorphism o is simply connected. Let g = m + fe be the de- composition of the Lie algebra g of G with respect to the automorphism 0 of g, and let gC be the tcomplex form of g. Then the real sub- space ge=j -I m+fe in gC is a real semi- simple Lie algebra and a treal form of gC. Let G e be the Lie group corresponding to the Lie algebra ge with center reduced to the identity element, and let K be the subgroup of G e cor- responding to fe. Then we get a (simply con- nected) symmetric Riemannian homogeneous space of noncompact type Gel K. When we start from a symmetric Riemann- ian space of noncompact type GI K instead of the symmetric Riemannian space of compact type GI Ke and apply the same process as in the previous paragraphs, taking a simply connected G e as the Lie group corresponding to ge, we obtain a simply connected symmetric Riemannian homogeneous space of compact type. Indeed, each of these two processes is the reverse of the other, and in this way we get a one-to-one correspondence between simply connected symmetric Riemannian homoge- neous spaces of compact type and those of noncompact type. This relationship is called duality for symmetric Riemannian spaces; when two symmetric Riemannian spaces are related by duality, each is said to be the dual of the other. If one of the two symmetric Riemannian spaces related by duality is irreducible, the other is also irreducible. The duality holds between spaces of types (1) and (3) and be- tween those of types (2) and (4) described in Section C. This fact is based on the following theorem in the theory of Lie algebras, where we identify isomorphic Lie algebras. (i) Com- plex simple Lie algebras gC and compact real simple Lie algebras g are in one-to-one corre- spondence by the relation that gC is the com- plex form of g. (ii) Form the Lie algebra ge in the above way from a compact real simple Lie algebra g and an involutive automorphism (j of g. We assume that () is a member of the given complete system of representatives of conjugate classes of involutive automorphisms in the automorphism group of g. Then we get from the pair (g, 0) a noncompact real simple Lie algebra ge, and any noncompact real 412 E Symmetric Riemannian Spaces and Real Forms simple Lie algebra is obtained by this process in one and only one way. Consider a Riemannian space given as a symmetric Riemannian homogeneous space M = G I K with a semisimple Lie group G, and let K be the tsectional curvature of M. Then if M is of compact type the value of K is ;::, 0, and if M is of noncompact type it is :;:;; O. On the other hand, the rank of M is the (unique) di- mension of a commutative subalgebra of g that is contained in and maximal in m. (For results concerning the group of isometries of M, distribution of geodesics on M, etc.  [3].) E. Symmetric Hermitian Spaces A connected tcomplex manifold M with a tHermitian metric is called a symmetric Her- mitian space if for each point p of M there exists an isometric and tbiholomorphic trans- formation of M onto M that is of order 2 and has p as an isolated fixed point. As a real ana- lytic manifold, such a space M is a symmetric Riemannian space of even dimension, and the Hermitian metric of M is a tKiihler metric. Let I(M) be the (not necessarily connected) Lie group formed by all isometries of M, and let A(M) be the subgroup consisting of all holo- morphic transformations in I(M). Then A(M) is a closed Lie subgroup of I(M). Let G be the connected component A (M)o of the identity element of A(M). Then G acts transitively on M, and M is expressed as a symmetric Rie- mannian homogeneous space GIK. Under the de Rham decomposition of a simply connected symmetric Hermitian space (regarded as a Riemannian space), all the factors are symmetric Hermitian spaces. The factor that is isomorphic to a real Euclidean spaces as a Riemannian space is a symmetric Hermitian space that is isomorphic to the complex Euclidean space cn. A symmetric Hermitian space defining an irreducible sym- metric Riemannian space is called an irreduc- ible symmetric Hermitian space. The problem of classifying symmetric Hermitian spaces is thus reduced to classifying irreducible sym- metric Hermitian spaces. In general, if the symmetric Riemannian space defined by a symmetric Hermitian space M is represented as a symmetric Riemannian homogeneous space GIK by a connected semi- simple Lie group G acting effectively on M, then M is simply connected, G coincides with the group A(M)o introduced in the previous paragraph, and the center of K is not a tdis- crete set. In particular, an irreducible sym- metric Hermitian space is simply connected. Moreover, in order for an irreducible symmetric Riemannian homogeneous space GIK to be defined by an irreducible symmetric Hermitian 
412 F Symmetric Riemannian Spaces and Real Forms space M, it is necessary and sufficient that the center of K not be a discrete set. If G acts effectively on M, then G is a simple Lie group whose center is reduced to the identity ele- ment, and the center of K is of dimension 1. For a space GIK satisfying these conditions, there are two kinds of structures of symmetric Hermitian spaces defining the Riemannian structure of G I K. As follows from the classification of irre- ducible symmetric Riemannian spaces, an irreducible Hermitian space defines one of the following symmetric Riemannian homogeneous spaces, and conversely, each of these homoge- neous spaces is defined by one of the two kinds of symmetric Hermitian spaces. (I) The symmetric homogeneous space GIK of a compact simple Lie group G with respect to an involutive automorphism 0 such that the center of G reduces to the identity element and the center of K is not a discrete set. Here (] may be assumed to be a representative of a conjugate class of involutive automorphisms in the automorphism group of G. (II) The homogeneous space GolK of a noncompact simple Lie group Go by a maxi- mal compact subgroup K such that the center of Go reduces to the identity element and the center of K is not a discrete set. An irreducible symmetric Hermitian space of type (I) is compact and is isomorphic to a trational algebraic variety. An irreducible symmetric Hermitian space of type (II) is homeomorphic to a Euclidean space and is isomorphic (as a complex manifold) to a bounded domain in C" (Section F). By the same principle as for irreducible symmetric Riemannian spaces, a duality holds for irreducible symmetric Hermitian spaces which establishes a one-to-one correspondence between the spaces of types (I) and (II). Fur- thermore, an irreducible symmetric Hermitian space M h of type (II) that is dual to a given irreducible symmetric Hermitian space Ma = GIK of type (I) can be realized as an open complex submanifold of Ma in the following way. Let G C be the connected component of the identity element in the Lie group formed by all the holomorphic transformations of Ma. Then G C is a complex simple Lie group con- taining G as a maximal compact subgroup, and the complex Lie algebra gC of G C contains the Lie algebra 9 of G as a real form. Let (] be the involutive automorphism of G defining the symmetric homogeneous space GIK, and let 9 = m + f be the decomposition of 9 determined by O. We denote by Go the real subgroup of G C corresponding to the real form 90 = P m + f of gC. Then Go (i) is a closed subgroup of G C whose center reduces to the identity ele- ment and (ii) contains K as a maximal com- 1560 pact subgroup. By definition the space M h is then given by GoIK. Now the group Go acts on Ma as a subgroup of G C , and the orbit of Go containing the point K of Ma is an open com- plex submanifold that is isomorpnic to M h (as a complex manifold). Ma regarded as a com- plex manifold can be represented as the homo- geneous space GCIU of the complex simple Lie group G C . F. Symmetric Bounded Domains We denote by D a bounded domain in the complex Euclidean space en of dimension n. We call D a symmetric bounded domain if for each point of D there exists a holomorphic transformation of order 2 of D onto D having the point as an isolated fixed point. On the other hand, the group of all holomorphic transformations of D is a Lie group, and D is called a homogeneous bounded domain if this group acts transitively on D. A symmetric bounded domain is a homogeneous bounded domain. The following theorem gives more precise results: On a bounded domain D, tBergman's kernel function defines a Kiihler metric that is invariant under all holomorphic transformations of D. If D is a symmetric bounded domain, D is a symmetric Hermitian space with respect to this metric" and its defin- ing Riemannian space is a symmetric Riemann- ian homogeneous space of noncompact type GIK with semisimple Lie group G. Conversely, any symmetric Hermitian space of noncom- pact type is isomorphic (as a complex mani- fold) to a symmetric bounded domain. When D is isomorphic to an irreducible symmetric Hermitian space, we call D an irreducible symmetric bounded domain. A symmetric bounded domain is simply connected and can be decomposed into the direct product of irre- ducible symmetric bounded domains. The connected component of the identity element of the group of all holomorphic trans- formations of a symmetric bounded domain D is a semisimple Lie group that acts transitively on D. Conversely, D is a symm(:tric bounded domain if a connected semisimple Lie group, or more generally, a connected Lie group admitting a two-sided invariant tHaar mea- sure, acts transitively on D. Homogeneous bounded domains in en are symmetric bounded domains if n:;:;; 3 but not necessarily when n ;::,4. G. Examples of Irreducible Symmetric Riemannian Spaces Here we list irreducible symm(:tric Riemannian spaces of types (2) and (4) ( Section C) that 
1561 can be represented as homogeneous spaces of classical groups, using the notation introduced by E. Cartan. We denote by M" = G j K a sim- ply connected irreducible symmetric Riemann- ian space of type (2), where G is a group that acts almost effectively on M" and K is the subgroup given by K = K for an involutive automorphism 0 of G. For such an M", the space of type (4) that is dual to M" is denoted by Me = GejK. Clearly dimM,,=dimMe. (For the dimension and rank of M" and for those M" that are represented as homogeneous spaces of simply connected texceptional com- pact simple Lie groups  Appendix A, Table 5.1II.) In this section (and also in Appendix A, Table 5.111), O(n), U(n), Sp(n), SL(n, R), and SL(n, C) are the torthogonal group of degree n, the tunitary group of degree n, the tsymplectic group of degree 2n, and the real and complex tspeciallinear groups of degree n, respectively. Let SO(n)=SL(n,R)nO(n) and SU(n)= SL(n,C)nU(n). We put ( -I 0 ) I - p P.q- 0 Iq' ( -I p 0 K _ 0 Iq p.q- 0 0 o 0 ( 0 In ) I n = -In 0 ' o 0 ) o 0 -I p 0 ' o Iq where Ip is the p x p unit matrix. Type AI. M,,=SU(n)jSO(n) (n> 1), where O(s) = s (with s the complex conjugate matrix ofs). Me=SL(n,R)jSO(n). Type All. M,,=SU(2n)/Sp(n) (n> 1), where 0(s)=J n sJ n - 1 . M e =SU*(2n)jSp(n). Here SU*(2n) is the subgroup of SL(2n, C) formed by the matrices that commute with the trans-. formation (zr, ...,2n, 2 n + 1 , ...,z2n)(zn+l' .." 2 2n , -2 1 "", -2 n ) in C"; S U*(2n) is called the quaternion unimodular group and is isomorphic to the commutator group of the group of all regular transformations in an n-dimensional vector space over the quaternion field H. Type AlII. Mu=SU(p+q)jS(U p x U q ) (p q 1), where S(U p x Uq)=SU(p+q)n(U(p) x U(q)), with U(p) x U(q) being canonically identified with a subgroup of U (p + q), and O(s) = I p.qsI p.q' This space M" is a tcomplex Grassmann manifold. Me=SU(p,q)jS(U p x U q ), where SU(p,q) is the subgroup of SL(p+q,C) consisting of matrices that leave invariant the Hermitian form Z l 2 1 +... +Zp2p-Zp+12p+1- '" -zp+qZp+q' Type AIV. This is the case q = 1 of type AliI. M" is the (n-1)-dimensional complex projec- tive space, and Me is called a Hermitian hyper- bolic space. Type BOI. M"=SO(p+q)(SO(p) x SO(q) (pq 1,p> I.p+qo;t4), where (}(s) = Ip.qsIp.q. M" is the treal Grassmann manifold formed by 412H Symmetric Riemannian Spaces and Real Forms the oriented p-dimensional subspaces in Rp+q. Me = SOo(p, q)jSO(p) x SO(q), where SO(p, q) is the subgroup of SL(n, R) consisting of matrices that leave invariant the quadratic form xi + ... + x - X+1 - .., - x+q, and SOo(p, q) is the connected component of the identity element. Type BOIl. This is the case q = 1 of type BOI. M" is the (n-l)-dimensional sphere, and Me is called a real hyperbolic space. Type 0111. M,,=SO(2n)jU(n) (n>2), where U(n) is regarded as a subgroup of SO(2n) by identifying SE U(n) with ( Res Ims ) ESO(2n), -Ims Res and O(s) = JnsJ;;I. M o =SO*(2n)jU(n). Here SO*(2n) denotes the group of all complex orthogonal matrices of determinant 1 leaving invariant the skew-Hermitian form Z 1 2n+l  Zn+ 1 2 1 + 222n+2 - 2 n + 2 2 2 + ... + ZnZZn - Z2n Z n; this group is isomorphic to the group of all linear transformations leaving invariant a nondegenerate skew-Hermitian form in an n- dimensional vector space over the quaternion field H. Type CI. Mu=Sp(n)jU(n) (n 1), where U(n) is considered as a subgroup of Sp(n) by the identification U(n)cSO(2n) explained in type OIlI and O(s)=s(=JnsJ n - I ). Me=Sp(n,R)jU(n), where Sp(n, R) is the real symplectic group of degree 2n. Type CII. Mu=Sp(p+q)jSp(p) x Sp(q) (p q  1), where Sp(p) x Sp(q) is identified with a subgroup of Sp(p+ q) by the mapping (( At Bl ) , ( A2 B2 )) C t D 1 C 2 D 2 -> ( I 2 gl 2 ) C 1 0 D 1 0 o C 2 0 D 2 and (}(s) = Kp.qsKp.q. Me= Sp(p, q)jSp(p) x Sp(q). Here Sp(p, q) is the group of complex symplectic matrices of degree 2(p+ q) leav- ing invariant the Hermitian form (z I' ... , Z p+q)K p.q '(2 1 , ..., 2p+q); this group is interpreted as the group of all linear transformations leav- ing invariant a nondegenerate Hermitian form of index p in a (p + q)-dimensional vector space over the quaternion field H. For q = 1, Mu is the quaternion projective space, and Me is called the quaternion hyperbolic space. Among the spaces introduced here, there are some with lower p, q, n that coincide (as Rie- mannian spaces) ( Appendix A, Table 5.II1). H. Space Forms A Riemannian manifold of tconstant curvature is called a space form; it is said to be spherical, 
4121 Symmetric Riemannian Spaces and Real Forms Euclidean, or hyperbolic according as the con- stant curvature K is positive, zero, or negative. A space form is a locally symmetric Riemann- ian space; a simply connected complete space form is a sphere if K >0, a real Euclidean space if K =0, and a real hyperbolic space if K <0. More generally, a complete spherical space form of even dimension is a sphere or a projective space, and one of odd dimension is an orientable manifold. A complete 2- dimensional Euclidean space form is one of the following spaces: Euclidean plane, cylinder, torus, tMobius strip, tKlein bottle. Except for these five spaces and the 2-dimensional sphere, any tclosed surface is a 2-dimensional hyper- bolic space form (for details about space forms  [6]). I. Examples of Irreducible Symmetric Bounded Domains Among the irreducible symmetric Riemannian spaces described in Section H, those defined by irreducible symmetric Hermitian spaces are of types AlII, DIll, 8DI (q = 2), and CI. We list the irreducible symmetric bounded domains that are isomorphic to the irreducible Her- mitian spaces defining these spaces. Positive definiteness of a matrix will be written »0. Type 1m m' (m';::' m;::' 1). The set of all m x m' complex matrices Z satisfying the condition 1m' - '2Z »0 is a symmetric bounded domain in cmm', which is isomorphic (as a complex manifold) to the irreducible symmetric Hermi- tian space defined by Mo of type AlII (p = m, q=m'). Type 11m (m;::' 2). The set of all m x m com- plex tskew-symmetric matrices Z satisfying the condition Im-'ZZ»O is a symmetric bounded domain in cm(m-I)/2 corresponding to the type DIll (n = m). Type IIIm (m;::, 1). The set of all m x m com- plex symmetric matrices satisfying the con- dition Im-'ZZ»O is a symmetric bounded domain in cm(m+I)/2 corresponding to the type CI (n = m). This bounded domain is holomor- phically isomorphic to the tSiegel upper half- space of degree m. Type IV m (m;::, 1, m =f. 2). This bounded domain in cm is formed by the elements (zJ,"', zm) satisfying the condition IZ112 + ... + IZml2 «1 + Id +... +z;.I)/2< 1, and corresponds to the type 8DI (p=m,q=2). Among these four types of bounded domains, the following complex analytic iso- morphisms hold: I 1,1  II 2  III 1  IV l , II 3  11.3, IV3III2' IV4I2.2, IV6II4' (For details about these symmetric bounded domains  [2].) There are two more kinds of irreducible symmetric bounded domains, 1562 which are represented as homogeneous spaces of exceptional Lie groups. J. Weakly Symmetric Riemannian Spaces A generalization of symmetric Riemannian space is the notion of weakly symmetric Rie- mannian space introduced by Selberg. Let M be a Riemannian space. M is called a weakly symmetric Riemannian space if a Lie sub- group G of the group of isometries I(M) acts transitively on M and there exists an element J-IE I(M) satisfying the relations (i) IGJ-I-l = G; (ii) J-I2 E G; and (iii) for any two points x, y of M, there exists an element m of G such that J-IX = my, J-Iy = mx. A symmetric Riemannian space M becomes a weakly symmetric Riemannian space if we put G=I(M) and J-I=the identity transformation; as the element m in condition (iii) we can take the symmetry rIp at the mid- point p on the geodesic arc joining x and y. There are, however, weakly symmetric Rie- mannian spaces that do not have the structure of a symmetric Riemannian space. An example of such a space is given by M = G = SL(2, R) with a suitable Riemannian metric, where J-I is the inner automorphism defined by ( -) (Selberg [4]). On a weakly symmetric Rie- mannian space, the ring of all G-invariant differential-integral operators is commutative; this fact is useful in the theory of spherical functions ( 437 Unitary Representations). References [1] S. Helgason, Differential geometry, Lie groups, and symmetric spaces, Academic Press, 1978. [2] C. L. Siegel, Analytic functions of several complex variables, Princeton Vni\'. Press, 1950. [3] J.-L. Koszul, Exposes sur les espaces homogenes symetriques, Silo Paulo, 1959. [4] A. Selberg, Harmonic analysis and discon- tinuous groups in weakly symmetric Riemann- ian spaces with application to Dirichlet series, J. Indian Math. Soc., 20 (1956), 47 -87. [5] E. Cartan, Sur certaines formes rieman- niennes remarquables des geometries a groupe fondamental simple, Ann. Sci. Ecole Norm. Sup., 44 (1927),345-467 (Oeuvres completes, Gauthier- Villars, 1952, pt. I, vol. 2, 867-989). [6] E. Cartan, Sur les domaines bornes homo- genes de l'espace de n-variables complexes, Abh. Math. Sem. Vniv. Hamburg, 11 (1936), 116-162 (Oeuvres completes, Gauthier-Villars, 1952, pt. I, vol. 2, 1259-1305). 
1563 [7J J. A. Wolf, Spaces of constant curvature, McGraw-Hill,1967. [8J S. Kobayashi and K. Nomizu, Founda- tions of differential geometry II, Interscience, 1969. [9J O. Loos, Symmetric spaces. I, General theory; II, Compact spaces and classification, Benjamin, 1969. .d 1  (VII 7\ -,-." \. ..../ Symmetric Spaces A tRiemannian manifold M is called a sym- metric Riemannian space if M is connected and if for each pE M there exists an involutive tisometry rIp of M that has p as an isolated fixed point. For the classification and the group-theoretic properties of symmetric Rie- mannian spaces  412 Symmetric Riemann- ian Spaces and Real Forms. We state here the geometrical properties of a symmetric Rie- mannian space M. Let M be represented by G/K, a tsymmetric Riemannian homogeneous space. The tLie algebras of G and K are de- noted by 9 and f respectively. Let us denote by '0 the tIeft translation of M defined by a E G, and by X* the vector field on M generated by X E g. We denote by 0 the differential of the involutive automorphism of G defining G/K and identify the subspace m = {X E g I II(X) = - X} of 9 with the tangent space y"(M) of M at the origin 0 = K of M. The trepresen- tation of f on m induced from the tadjoint representation of 9 is denoted by ad m . A. Riemannian Connections M is a complete real analytic thomogeneous Riemannian manifold. If M is a tsymmetric Hermitian space, it is a thomogeneous Kiih- lerian manifold. The tRiemannian connection V of M is the tcanonical connection of the homogeneous space G/ K and satisfies VyX* = [X, Y] (YEm) for each X Ef and VyX* =0 (YEm) for each X Em. For each X Em, the curve Yx of M defined by Yx(t)=(exptX)o (tER) is a tgeodesic of M such that yx(O)=o and yx(O)=x. In particular, the texponen- tial mapping Expo at 0 is given by ExpoX = (expX)o (X Em). For each X Em, the tparal- leI translation along the geodesic arc Yx(t) (0:;:;; t:;:;; to) coincides with the differential of '_xp'ox. If M is compact, for each pE M there exists a smooth simply closed geodesic passing through p. Any G-invariant tensor field on M 413C Symmetric Spaces is tparallel with respect to V. Any G-invariant tdifferential form on M is closed. The Lie algebra I) of the trestricted homogeneous holonomy group of M at 0 coincides with adm[m, m]. If the group I(M) of all isometries of Mis tsemisimple, one has 1)= {A EgI(m)1 A. go=O, A. Ro=O} = admf. Here, go and Ro denote the values at 0 of the Riemannian metric 9 and the tRiemannian curvature R of M, respectively, and A. is the natural action of A on the tensors over m. If, moreover, M is a symmetric Hermitian space, the value J o at 0 of the talmost complex structure J of M belongs to the center of g. In general, I) = {O} if and only if M is tflat, and I) has no nonzero invariant on m if and only if I(M) is semisimple. B. Riemannian Curvature Tensors The Riemannian curvature tensor R of M is parallel and satisfies Ro(X, Y)= -adm[X, YJ (X, Y Em). Assume that dim M ;::, 2 in the fol- lowing. Let P be a 2-dimensional subspace of m, and {X, Y} an orthonormal basis of P with respect to go. Then the tsectional curvature K(P) of P is given by K(P)=go([[X, Y],X], Y). K =0 everywhere if and only if M is flat. If M is of tcompact type (resp. of tnoncompact type), then K;::, 0 (resp. K:;:;; 0) everywhere. K >0 (resp. K <0) everywhere if and only if the trank of M is 1 and M is of compact type (resp. of noncom pact type). For any four points p, q, p', q' of a manifold M of any of these types satisfying d(p, q) = d(p', q'), d being the tRiemannian distance of M, there exists a IjJEI(M) such that ljJ(p)=p' and ljJ(q)=q'. Other than the aforementioned M's, the only Riemannian manifolds having this property are circles and Euclidean spaces. If K > 0 everywhere, any geodesic of M is a smooth simply closed curve and all geodesics are of the same length. For a symmetric Hermitian space M, the tholomorphic sectional curvature H satisfies H =0 (resp. H >0, H <0) everywhere if and only if M is flat (resp. of compact type, of noncom pact type). C. Ricci Tensors The tRicci tensor S of M is parallel. If rpm denotes the restriction to m x m orthe tKilling form rp of g, the value So of S at 0 satisfies So = -!rpm' If Mis tirreducible, it is an tEinstein space. s=o (resp. positive definite, negative definite, nondegenerate) everywhere if and only if M is flat (resp. M is of compact type, M is of noncom pact type, I(M) is semisimple). If M is a tsymmetric bounded domain and 9 is the tBergman metric of M, one has S= -g. 
413 D Symmetric Spaces D. Symmetric Riemannian Spaces of Noncompact Type Let M be of noncom pact type. For each pE M, p is the only fixed point of the tsymmetry rIp, and the exponential mapping at p is a diffeo- morphism from Tp(M) to M. In particular, M is diffeomorphic to a Euclidean space. For each pair p, qE M, a geodesic arc joining p and q is unique up to parametrization. For each pE M there exists neither a tconjugate point nor a tcut point of p. If M is a symmetric Hermitian space, that is, if it is a symmetric bounded domain, then it is a tStein manifold and holomorphically homeomorphic to a tSiegel domain. E. Groups of Isometries The isotropy subgroup at 0 in I(M) is denoted by Io(M). Then the smooth mapping Io(M) x m-->I(M) defined by the correspondence i/J x X I---> i/J'expx is surjective, and it is a diffeo- morphism if M is of noncompact type. If M is of noncom pact type, I(M) is isomorphic to the group A(g) of all automorphisms of 9 in a natural way, and Io(M) is isomorphic to the subgroup A(g, f) = {i/JE A(g) I i/J(f) = f} of A(g), provided that G acts almost effectively on M. Moreover, in this case the center of the iden- tity component I(M)o of I(M) reduces to the identity, and the isotropy subgroup at a point in I(M)o is a maximal compact subgroup of I(Mf If I(M) is semisimple, any element of I(M)O may be represented as a product of an even number of symmetries of M. In the fol- lowing, let M be a symmetric Hermitian space, and denote by A(M) (resp. H(M)) the group of all holomorphic isometries (resp. all holomor- phic homeomorphisms) of M, and by A(M)o and H(M)o their identity components. All these groups act transitively on M. If M is compact or if I(M) is semisimple, one has A(M)o=I(M)o. If I(M) is semisimple, Mis simply connected and the center of I(M)o reduces to the identity. If M is of compact type, M is a trational tprojective algebraic manifold, and H(M)o is a complex semisimple Lie group whose center reduces to the identity, and it is the tcomplexification of I(M)o. In this case, the isotropy subgroup at a point in H(M)o is a tparabolic subgroup of H(M)o. If Mis of noncom pact type, one has H(M)o= I(M)o. In the following we assume that Gis compact. F. Cart an Subalgebras A maximal Abelian tLie subalgebra in m is called a Cartan subalgebra for M. Cartan sub- 1564 algebras are conjugate to each otht:r under the tadjoint action of K. Fix a Cartan subalge- bra a and introduce an inner product ( , ) on a by the restriction to a x a of g(. For an element rx of the dual space a* of a, we put m.= {X Eml [H, [H, X]] = -rx(H)2X for any HE a}. The subset I = {rxE a* - {O} I m. =1= {O} } of a* is called the root system of M (relative to a). We write m.=dim m. for rxEI. The subset D = {H Ea I rx(H)EnZ for some rxE..[} of a is called the diagram of M. A connec1ed compo- nent of a - D is called a fundamentlll cell of M. The quotient group W of the normalizer of a in K modulo the centralizer of a in K is called the Weyl group of M. W is identified with a finite group of orthogonal transformations of a. G. Conjugate Points For a geodesic arc Y with the initial point 0, any tJacobi field along y that vanishes at 0 and the end point of y is obtained as the restriction to y of the vector field X* generated by an element XEf. For HEa-{O}, Exp"H is a conjugate point to 0 along the geodesic YH if and only if rx(H)E nZ- {O} for some rxEI. In this case, the multiplicity of the conjugate point Expo H is equal to -! l:.EE..(H)EnZ-{Oj M.. From this fact and Morse theory ( 279 Morse Theory), we get a tcellular decompo- sition of the Hoop space of M. The set of all points conjugate to 0 coincides with KExPoD and is stratified to a disjoint union of a finite number of connected regular subrnanifolds with dimension  dim M - 2. H. Cut Points We define a tlattice group 1 of a by 1 = {AEaIExpoA=o}, and put Co={HEal Max AEr _{oj2(H, A)/(A, A) = I}. Then, for HE a- {O}, ExPoH is a cut point of 0 along the geodesic YH if and only if HE Ca. The set Co of all cut points of 0 coincides with K Expo Co and is stratified to a disjoint union of a finite number of connected regular submanifolds with dimension  dim M - 1. The set of all points tfirst conjugate to 0 coincid,s with Co if and only if M is simply connected. I. Fundamental Groups Let 1o denote the subgroup of a generated by {(2n/(rx,rx))rxlrxEI}, identifying a* with a by means of the inner product ( , ) of a. This is a subgroup of 1. We regard 1 as a subgroup of the group I(a) of all motions of a by parallel 
1565 translations. The subgroup W = Wf of 1(0) generated by f and the Weyl group W is called the affine Weyl group of M. W leaves the diagram D invariant and acts transitively on the set of all fundamental cells of M. Take a fundamental cell rI such that its closure (j contains 0, and put W a = {WE WI W(rI) = rI}. Then the fundamental group n)(M) of M is an tAbelian group isomorphic to the groups W a and f/fo. n,(M) is a finite group if and only if M is of compact type. In this case, the order of n, (M) is equal to the cardinality of the set f n (j as well as to the index [f: fo]. Moreover, if we denote by W: the group IV a for the symmetric Riemannian space M* = G* I K* de- fined by the tadjoint group G* of G and K* = {a E G* I a() = Oa}, then W a is isomorphic to a subgroup of W:. If M is irreducible, W: is isomorphic to a subgroup of the group of all automorphisms of the textended Dynkin dia- gram of the root system E. J. Cohomology Rings Let P(g) (resp. P(f)) be the tgraded linear space of all tprimitive elements in the tcohomology algebra H(g) of 9 (resp. H(f) of f), and Pig, f) the intersection of Pig) with the image of the natural homomorphism H(g, f)-->H(g), where H(g, f) denotes the relative cohomology alge- bra for the pair (g, f). Then one has dim P(g, f) + dim P(f) = dim Pig). Denote by A P(g, f) the exterior algebra over P(g, f). The tgraded algebra of all G-invariant polynomials on 9 (resp. all K-invariant polynomials on f) is dnoted by l(G) (resp. l(K)), where the de- gree of a homogeneous polynomial with de- gree p is defined to be 2p. We denote by l+(G) the ideal of 1 (G) consisting of all f E l(G) such thatf(O)=O, and regard I(K) as an I+(G)- module through the restriction homomor- phism. Then the treal cohomology ring H(M) of M is isomorphic to the tensor product AP(g, f) @ (I(K)/P(G)I(K)). If K is connected and the tPoincare polynomials of P(g), P(f), and P(g, f) are L, t 2mj - l , L1, t 2n ,-I, and LH' t 2m ,-" respectively, then the Poincare polynomial of H(M) is given by I1S+1 (1 + t 2m ,-1 )I1f1 (1 - t2m')I1:1 (1 _ t 2n ,)-,. References [IJ E. Cartan, Sur certaines formes rieman- nienes remarquables des geometries a groupe fondamental simple, Ann. Sci. Ecole Norm. Sup., 44 (1927),345-467. [2J S. Helgason, Differential geometry, Lie groups, and symmetric spaces, Academic Press, 1978. 414 B Systems of Units [3J S. Kobayashi and K. Nomizu, Founda- tions of differential geometry II, Interscience, 1969. [4J H. C. Wang, Two point homogeneous spaces, Ann. Math., (2) 55 (1952),177-191. [5J A. KOI<inyi and J. A. Wolf, Realization of Hermitian symmetric spaces as generalized half-planes, Ann. Math., (2) 81 (1965),265-288. [6J R. Bott and H. Samelson, Applications of the theory of Morse to symmetric spaces, Amer. J. Math., 80 (1958),964-1029. [7J T. Sakai, On cut loci of compact sym- metric spaces, Hokkaido Math. J., 6 (1977), 136-161. [8J M. Takeuchi, On conjugate loci and cut loci of compact symmetric spaces I, Tsukuba J. Math., 2 (1978), 35-68. [9J R. Crittenden, Minimum and conjugate points in symmetric spaces, Canad. J. Math., 14 (1962),320-328. [1 OJ J. L. Koszul, Sur un type d'algebre dif- ferentielles avec la transgression, Colloque de Topologie (Espaces fibres), Brussels, 1950, 73- 81. 414 (XX.1 ) Systems of Units A. International System of Units Units representing various physical quantities can be derived from a certain number of fun- damental (base) units. By a system of units we mean a system of fundamental units. Various systems of units have been used in the course of the development of ph ysics. T oda y, the standard is set by the international system of units (systeme international d'unites; abbre- viated SI) [1], which has been developed in the spirit of the meter-kilogram system. This system consists of the seven fundamental units listed in Table 1, units induced from them, and unit designations with prefixes representing the powers of 10 where necessary. It also con- tains two auxiliary units for plane and solid angles, and a large number of derived units [1]. B. Systems of Units in Mechanics Units in mechanics are usually derived from length, mass, and time, and SI uses the meter, kilogram, and second as base units. Neither the CGS system, derived from centimeter. gram, and second, nor the system of gravita- tional units, derived from length, force, and time, are recommended for general use by 
414 C Systems of Units Table 1 Quantity Length SI unit Symbol meter m Mass kilogram kg Time second Intensity of electric current A am pere Temperature kelvin K Amount of mole mol substance Luminous candela cd intensity 1566 Description The meter is the length equal to 1,650,763.73 wave- lengths in vacuum of the radiation corresponding to the transmission between the levels 2 p lO and 5d 5 of the krypton-86 atom. The kilogram is equal to the mass of the interna- tional prototype of the kilogram. The second is the duration of9,192,63L770 periods of the radiation corresponding to the transmission between the two hyperfine levels of the ground state of the cesium-133 atom. The ampere is the intensity of the constant current maintained in two parallel, rectilinear conductors of infinite length and of negligible circular section, placed 1 m apart in vacuum, and producing a force between them equal to 2 x \0-7 newton (m' kg' s -2) per meter of length. The kelvin, the unit of thermodynamical tempera- ture, is 1/273.16 of the thermodynamical tempera- ture of the triple point of water. The mole is the amount of substance of a system containing as many elementary entities as there are atoms in 0.012 kg of carbon-12. The candela is the luminous intensity in a given direction of a source emitting monochromatic radiation of frequency 540 x 10 12 hertz (= s -1), the radiant intensity of which in that direction is 1/683 watt per steradian. (This revised definition of candela was adopted in 1980.) Table 2 Unit in terms of SI base or derived Quantity SI unit Symbol units Frequency hertz Hz 1 Hz=l S-1 Force newton N 1 N = 1 kg' m/s 2 Pressure and stress pascal Pa 1 Pa= 1 N/m 2 Work, energy, quantity of heat joule J 1J=1 N'm Power watt W I W = 1 J/s Quantity of electricity coulomb C 1C==lA.s Electromotive force, potential volt V 1 V=l W/A difference Electric capacitance farad F 1 F==l C/V Electric resistance ohm il 1 il==l V/A Electric conductance siemens S lS=olil- 1 Flux of magnetic induction weber Wb 1 Wb = 1 V' s magnetic flux Magnetic induction, magnetic tesla T 1 T == 1 Wb/m 2 flux density Inductance henry H 1H=IWb/A Luminous flux lumen 1m 1 1m = 1 cd. sr Illuminance lux Ix 1Ix.= 11m/m 2 Activity becquerel Bq 1 Bq = 1 S-1 Adsorbed dose gray Gy 1 Gy= 1 J/kg Radiation dose sievert Sv 1 Sv=1 J/kg 
1567 the SI Committee. Besides the base units, minute, hour, and day, degree, minute, and second (angle), liter, and ton have been ap- proved by the SI Committee. Units such as the electron volt, atomic mass unit, astronomical unit, and parsec (not SI) are empirically de- fined and have been approved. Several other units, such as nautical mile, knot, are (area), and bar, have been provisionally approved. C. System of Units in Thermodynamics The base unit for temperature is the degree Kelvin CK; formerly called the absolute tem- perature). Degree Celsius CC), defined by t = T - 273.15, where T is in oK, is also used. The unit of heat is the joule J, the same as the unit for other forms of energy. Formerly, one calorie was defined as the quantity of heat that must be supplied to one gram of water to raise its temperature from 14.5°C to 15.5°C; now one calorie is defined by 1 cal = 4.1855 J. D. Systems of Units in Electricity and Magnetism Three distinct systems of units have been developed in the field of electricity and mag- netism: the electrostatic system, which origi- nates from Coulomb's law for the force be- tween two electric charges and defines mag- netic quantities by means of the Biot-Savart law; the electromagnetic system, which origi- nates from Coulomb's law for magnetism; and the Gaussian system, in which the dielectric constant and permeability are taken to be non- dimensional. At present, however, the rational- ized MKSA system of units is adopted as the international standard. It uses the derived units listed in Table 2 (taken from [2J), where the derived units with proper names in other fields are also listed. E. Other Units In the field of photometry, the fol1owing defi- nition was adopted in 1948: One candela (cd) (0.98 old candle) is defined as 1/(6 x 105) of the luminous intensity in the direction normal to a plane surface of I m 2 area of a black body at the temperature of the solidifying point of platinum. The total luminous flux emanating uniformly in all directions from a source of luminous intensity 1 cd is defined as 4n lumen (1m). One lux (Ix) is defined as the il1uminance on a surface area of I m 2 produced by a lumi- nous flux of I cd uniformly incident on the surface. In] 980, the definition was revised as shown in Table 1. 414 Ref. Systems of Units F or theoretical purposes, a system of units caned the absolute system of units is often used, in which units of mass, length, and time are chosen so that the values of universal constants, such as the universal gravitational constant, speed of1ight, Planck's constant, and Boltzmann's constant, are equal to 1. References [IJ Bureau International des Points et Me- sures, Le systeme international d'unites, 1970, fourth revised edition, 1981. [2J R. G. Lerner and G. L. Trigg (eds.), En- cyclopedia of physics, Addison-Wesley, 1981. 
415 Ref. Takagi, Teiji 415 (XXI.41) Takagi, Teiji Teiji Takagi (April 21, 1875-February 28, 1960) was born in Gifu Prefecture, Japan. After graduation from the Imperial University of Tokyo in 1897, he continued his studies in Germany, first with Frobenius in Berlin and then with Hilbert in Gottingen. He returned to Japan in 1901 and taught at the Imperial University of Tokyo until 1936, when he re- tured. He died in Tokyo of cerebral apoplexy. Since his student years he had been inter- ested in Kronecker's conjecture on tAbelian extensions of imaginary quadratic number fields. He solved it affirmatively for the case of Q(j=l) while still in Gottingen and presented this result as his doctoral thesis. During World War I, he pursued his research in the theory of numbers in isolation from Western countries. It developed into tclass field theory, a beautiful general theory of Abelian extensions of alge- braic number fields. This was published in 1920, and was complemented by his 1922 paper on the treciprocity law of power residues and then by tArtin's general law of reciprocity published in 1927. Besides these arithmetical works, he also published papers on algebraic and analytic subjects and on the foundations of the theories of natural numbers and of real numbers. His book (in Japanese) on the his- tory of mathematics in the 19th century and his General course 01 analysis (also in Japanese) as well as his teaching and research activities at the University exercised great influence on the development of mathematics in Japan. Reference [1] S. Kuroda (ed.), The collected papers of Teiji Takagi, Iwanami, 1973. 416 (XI.16) TeichmOlier Spaces Consider the set Mg consisting of the con- formal equivalence classes of closed Riemann surfaces of genus g. In 1859 Riemann stated, without rigorous proof, that Mg is parame- trizedbym(g)(=Oifg=O, =lifg=l, =3g-3 if g;' 2) complex parameters ( 11 Algebraic Functions). Later, the introduction of a topol- ogy and m(g)-dimensional complex structure on Mg were discussed rigorously in various ways. The following explanation of these methods is due to O. Teichmiiller [1, 2J, L. V. Ahlfors [3,4J, and L. Bers [5-7]. For the 1570 algebraic-geometric approach  9 Algebraic Curves. The trivial case g = 0 is excluded, since Mo consists of a single point. Take a closed Rie- mann surface 9{0 of genus g;' 1, and consider the pairs (9{, H) consisting of closed Riemann surfaces 9{ of the same genus g and the tho- motopy classes H of orientation-preserving homeomorphisms of 9{0 into 9{. T",o pairs (9{, H) and (9{', H') are defined to be con- formally equivalent if the homotopy class H'H- 1 contains a conformal mapping. The set T 9 consisting of the conformal equivalence classes <9{, H) is called the Teichmiiller space (with center at 9{0). Let g be the group of homotopy classes of orientation-pn:serving homeomorphisms of 9{0 onto itself. g is a transformation group acting on Tg III the sense that each I} E g induces the transformation <9{, H)-><9{, HI}). It satisfies T g/,' = Mg. The set 3 g of elements of g fixing every point of T 9 consists only of the unity element if g;' 3 and is a normal subgroup of order 2 if g= 1,2. For the remainder of this article we assume that g;,2. The case g= 1 can be discussed similarly, and the result coincides with the classical one: T 1 can be identified with the upper half-plane and r!31 is the tmodular group. Denote by B(9{o) the set of measurable invariant forms I1 dz dz- 1 with 11111100 < 1. For every 11 E B(9{o) there exists a pair (9l, H) for which some hEH satisfies h z =l1h z (. 352 Quasiconformal Mappings). This correspon- dence determines a surjection 11 E B(9{o) I--> <9{,H)ET g . Next, ifQ(9{o) denotes the space of holomorphic quadratic differentials rpdz 2 on 9{o, a mapping I1EB(9{o)l-->rpEQ(9{o) is obtained as follows: Consider 11 on the uni- versal covering space U (= upper half-plane) of 9{0. Extend it to U* ( = lower half-plane) by setting 11 = 0, and let I be a quasiconformal mapping I of the plane onto itself satisfying Iz= I1Iz' Take the tSchwarzian derivative IjJ = {f, z} of the holomorphic functi on f in U*. The desired rp is given by rp(z) = ljJ(z) on U. It has been verified that two 11 induce the same rp if and only if the same <9{, H) corresponds to 11. Consequently, an injection <9{, H) E T g l-->rpEQ(9{o) is obtained. Since Q(9{o)= cm(g) by the Riemann-Roch theorem, this in- jection yields an embedding Tgccm(g), where T 9 is shown to be a domain. As a subdomain of cm(g), the Teichmiiller space is an m(g)-dimensional complex analytic manifold. It is topologically equivalent to the unit ball in real 2m (g)-dimensional space and is a bounded tdomain of holomorphy in Cm(g). Let {lXI, ..., IX 2g } be a I-dimensional ho- mology basis with integral coefficients in 91 0 such that the intersection numbers are (lXi, IX j ) =(IXg+i,IXg+)=O, (lXi, IX g + j ) = (jij, i,j= 1, ...,g. 
1571 Given an arbitrary <9{, H) E T q, consider the tperiod matrix Q of 9{ with respect to the homology basis H!X I , ... , H !X2g and the basis W b ... , w g of t Abelian differentials of the first kind with the property that SHa,w j =6ij' Then Q is a holomorphic function on Tg. Furthermore, the analytic structure of the Teichmiiller space introduced previously is the unique one (with respect to the topology defined above) for which the period matrix is holomorphic. 5 g is a properly discontinuous group of analytic transformations, and therefore Mg is an m(g)-dimensional normal tanalytic space. 5 q is known to be the whole group of the holomorphic automorphisms of Tq (Royden [8J); thus Tq is not a tsymmetric space. To every point r of the Teichmiiller space, there corresponds a Jordan domain D(r) in the complex plane in such a way that the fiber space Fg={(r,z)lzED(r),rETgccm(g)} has the following properties: Fg is a bounded domain of holomorphy of cm(g)+!. It carries a properly discontinuous group &g of holomorphic auto- morphisms, which preserves every fiber D(r) and is such that D(r)/&g is conformally equiva- lent to the Riemann surface corresponding to r. Fg carries holomorphic functions FJ(r, z), j = 1, ... , 5g - 5 such that for every r the func- tions FJ/F I ,j=2, ...,5g-5 restricted to D(r) generate the meromorphic function field of the Riemann surface D(r)/&g. By means of the textremal quasiconformal mappings, it can be verified that Tg is a com- plete metric space. The metric is called the Teichmiiller metric, and is known to be a Kobayashi metric. The Teichmiiller space also carries a natu- rally defined Kahler metric, which for 9 = 1 coincides with the t Poincare metric if T I is identified with the upper half-plane. The tRicci curvature, tholomorphic sectional cruvature, and tscalar curvature are all negative (Ahlfors [9J). By means of the quasiconformal mapping f, which we considered previously in order to construct the correspondence J.11-HfJ, it is possible to regard the Teichmiiller space as a space of quasi-Fuchsian groups ( 234 Kleinian Groups). To the boundary ofTg, it being a bounded domain in Cm(g), there corre- spond various interesting Kleinian groups, which are called tboundary groups (Bers [IOJ, Maskit [11J). The definition of Teichmiiller spaces can be extended to open Riemann surfaces 9{o and, further, to those with signatures. A number of propositions stated above are valid to these cases as well. In particular, the Teichmiiller space for the case where 9{o is the unit disk is called the universal TeichmiilIer space. It is a bounded domain of holomorphy in an infinite- 417 A Tensor Calculus dimensional Banach space and is a symmetric space. Every Teichmiiller space is a subspace of the universal Teichmiiller space. References [lJ O. Teichmiiller, Extremale quasikonforme Abbildungen und quadratische Differentiale, Abh. Preuss. Akad. Wiss., 1939. [2J O. Teichmiiller, Bestimmung der extrema- len quasikonformen Abbildung bei geschlos- senen orientierten Riemannschen Flachen, Abh. Preuss. Akad. Wiss., 1943. [3J L. V. Ahlfors, The complex analytic struc- ture of the space of closed Riemann surfaces, Analytic functions, Princeton Univ. Press, 1960,45-66. [4J L. V. Ahlfors, Lectures on quasiconformal mappings, Van Nostrand, 1966. [5J L. Bers, Spaces of Riemann surfaces. Proc. Intern. Congr. Math., Edinburgh, 1958,349- 361. [6J L. Bers, On moduli of Riemann surfaces, Lectures at Forschungsinstitut fiir Mathema- tik, Eidgenossische Technische Hochschule, Ziirich, 1964. [7J L. Bers, Uniformization, moduli, and Kleinian groups, Bull. London Math. Soc., 4 (1972),257-300. [8J H. L. Royden, Automorphisms and isome- tries of Teichmiiller spaces, Advances in the Theory of Riemann Surfaces, Princeton Univ. Press, 1971,369-383. [9J L. V. Ahlfors, Curvature properties of Teichmiiller's space, J. Analyse Math., 9 (1961). 161-176. [10J L. Bers, On boundaries of Teichmiiller spaces and on Kleinian groups I, Ann. Math., (2) 91 (1970),570-600. [IIJ B. Maskit, On boundaries of Teichmiiller spaces and on Kleinian groups II, Ann. Math., (2) 91 (1970), 608-638. 417 (VII.5) Tensor Calculus A. General Remarks In a tdifferentiable manifold with an taffine connection (in particular, in a tRiemannian manifold), we can define an important opera- tor on tensor fields, the operator of covariant differentiation. The tensor calculus is a differ- ential calculus on a differentiable manifold that deals with various geometric objects and differential operators in terms of covariant differentiation, and it provides an important tool for studying geometry and analysis on a differentiable manifold. 
417 B Tensor Calculus B. Covariant Differential Let M be an n-dimensional smooth manifold. We denote by 3'(M) the set of al1 smooth functions on M and by .I;(M) the set of al1 smooth tensor fields of type (r, s) on M. .I6(M) is the set of al1 smooth vector fields on M, and we denote it simply by .I(M). In the fol1owing we assume that an affine connection V is given on M. Then we can define the covariant differential of tensor fields on M with respect to the connection ( 80 Connections). We denote the covariant deriva- tive of a tensor field K in the direction of a vector field X by V x K and the covariant dif- ferential of K by V K. The operator V x maps .I(M) into itself and has the fol1owing properties: (I)Vx+y=V x +V y , Vfx=fV x , (2) Vx(K + K')=VxK + VxK', (3) Vx(K (8) K')=(VxK) (8) K' + K (8) (VxK'), (4)V x f=Xj, (5) V x commutes with contraction of tensor fields, where K and K' are tensor fields on M, X, Yd(M) andfE3'(M). The torsion tensor T and the curvature ten- sor R of the affine connection V are defined by T(X, Y)=V x Y -VyX -[X, Y], R(X, Y)Z = Vx(VyZ) - Vy(VxZ) - V[x. Y]Z for vector fields X, Y, and Z. The torsion ten- sor is of type (1,2), and the curvature tensor is of type (1,3). Some authors define - R as the curvature tensor. We here folIow the conven- tion used in [1-6J, while in [7,8J the sign of the curvature tensor is opposite. The torsion tensor and the curvature tensor satisfy the identities T(X, Y)= -T(Y,X), R(X, Y)= -R(Y,X), R(X, Y)Z+R(Y,Z)X +R(Z,X)Y =(V x T)(Y,Z)+(V y T)(Z, X)+(Vz T)(X, Y) + T(T(X, Y), Z) + T(T( Y, Z), X) + T(T(Z, X), Y), (VxR)(Y,Z)+(VyR)(Z,X)+(VzR)(X, Y) =R(X, T(Y,Z))+R(Y, T(Z, X)) +R(Z, T(X, Y)). The last two identities are called the Bianchi identities. The operators V x and V y for two vector fields X and Yare not commutative in general, and they satisfy the following formula, the Ricci formula, for a tensor field K: Vx(VyK)-Vy(VxK)- V[x.y]K =R(X, y). K, where in the right-hand side R(X, Y) is re- 1572 garded as a derivation of the tensor algebra L,.,.I(M). A moving frame of M on a neighborhood U is, by definition, an ordered set (1' l' ... ,en) of n vector fields on U such that 1'1 (p), ..., en(p) are linearly independent at each point pE U. For a moving frame (1' l' ... ,en) of M on a neigh- borhood U we define n differential I-forms (11, ..., (1n by Oi(e) = 15f, and we call them the dual frame of (1'1, ... , l' n)' For a tensor field K of type (r, s) on M, we define nr+s functions K i , ..i! on U by J 1 .J KJ:..: = K(ejl' ..., ej' (ji" ..., (jir) and call these functions the components of K with respect to the moving frame (e l' ... ,en)' Since the covariant differentials '\le j are tensor fields of type (1, 1), n 2 differential 1- forms wf are defined by Ve j =wf(8)e i , where in the right-hand side (and throughout the following) we adopt Einstein's summation convention: If an index appears twice in a term, once as a superscript and once as a subscript, summation has to be taken on the range of the index. (Some authors write the above equation as dej=wJe i or Dej=wJe i .) We call these I-forms wf the connection forms of the affine connection with respect to the moving frame (1'1' ., . , en)' The torsion forms e i and the curvature forms Qj are defined by e i = d(1i + wj /\ oj, QJ= dwf + w /\ wr These equations are called the strm:ture equa- tion of the affine connection V. If we denote the components of the torsion tensor and the curvature tensor with respect to (1'1, ..., en) by T; and Rikl (= (1i(R(e k , 1'1)1';)), respectively, then they satisfy the relations e i = T i oj /\ (1k 2 Jk , i 1 i k I Q .=-R. kl (1 /\ (1 ) 2 ) . Using these forms, the Bianchi identities are written as de i + wj /\ e j = Qj /\ oj, dQj +w /\ QJ-wf /\ Qt= o. Let K be a tensor field of type (r, s) on M and Kj;:::;, be the components of K with re- spect to (el, ...,e n ). We define the covariant differential DK i ,.. i, and the covariant deriva- h .Js tive Ki,...i! by Jt"'h.k , DKi,...i,= K i , .i! (1k=dK i , ..i!+ '\ K. i , .. J " i 'W Q i, J 1 h 11. ,)s. k JJ . Js  JJ  v=l s - L K;: i r sOJj' v::: 1 
1573 Then Kj; j.k are the components of V K with respect to the moving frame (e l' ... ,en)' Some authors write VkK i , ' J ' instead of K i , i, [5,6]. It , 11 is Using components, the Bianchi identities are written as Rtk + R7ki + RZ ij = 7;7.k + 1Jz'i + 7kL + 7;, 1J% + 7;, 7k + 7;, 7;; , Rtk.l + R?kl.j+ R?lj.k = R?ak 1J + Rj 7;% + R?al 7;.]' The Ricci formula is written as r Kit i, _ Kil.i; = '" Rip KJ a i, )1 j.kl jj J.lk L.. alk it j, v=l s - '\' R a lk Ki , L.. Jt it 1'=1 .....i, a. j + T." K i , .i, kl h j.a' Let (Xl, ..., x n ) be a local coordinate system defined on a neighborhood U of M. Then (a/hI, ..., a/ax n ) is a moving frame of M on U, and we call it the natural moving frame asso- ciated with the coordinate system (Xl, ..., x n ). Components of a tensor field with respect to the natural moving frame (e/ax 1 ,..., a/vx n ) are often called components with respect to the coordinate system (x 1, ... , Xn). We define an n 3 function r1 j on U by wJ = njdxk, where wJ are the connection forms for the natural moving frame. n j are called the coefficients of the affine connection V. The components of the torsion tensor and the curvature tensor with respect to (Xl, ..., x n ) are given by 1J = fj - f1;, R?jk = (fti - akf; + r:ifj - rifta' where Vi = a/ex i . With respect to the foregoing coordinate system , the components K i , .i! k of the covar- 11 J" iant differential V K of a tensor field K of type (r, s) are given by K i , It , i'=oK i , i,+ '\' ri'Ki, hd J it. J L.. JQ 11 j v=l .a i, , _ '\' fa Ki, i, L.. 111 11 a j\ . v=l C. Covariant Differential of Tensorial Forms A tensorial p-form of type (r, s) on a manifold M is an alternating O'(M)-multilinear mapping  I' '------, of .r(M) x '" x .r(M) to .r:(M). A tensorial p- form of type (0,0) is a differential p-form in the usual sense. A tensorial p-form of type (1,0) is often called a vectorial p-form. 1f an affine connection V is provided on M, we define the covariant differential of tensorial forms. Let r:J. be a tensorial p-form of type (r, s). 417 C Tensor Calculus The covariant differential Da of a is a tensorial (p + I)-form of type (r, s) and is defined by (p+ I)Da(Xj, ...,X p +1) 1'+1 '\' . I A = L... (-IY- VX,(r:J.(X I , ...,X i , ...,Xp+d) ,=1 + I (_1)i+ j a ([X i ,XD, i<j Xl' ..., X;, "', X j , ..., X 1'+,), where Xi means that Xi is deleted. If r:J. is of type (0,0), Da coincides with the usual exterior differential dr:J.. The simplest example of a tensorial form is the identity mapping of .r(M), which will be denoted by O. Some authors write this vec- torial form as dp or dx, where p or X expresses an arbitrary point of a manifold. We call 0 the canonical vectorial form of M. The torsion tensor T can be regarded as a vectorial 2-form, and we have 2DO= T. The curvature tensor R can be regarded as a tensorial 2-form of type (1,1), i.e., (X, Y)->R(X, Y)E.r:(M), and the Bianchi identities are written as DT= R /\ (J, DR = 0, where the exterior product R /\ r:J. of R and a tensorial p-form a is defined by (p+ 1)(p + 2)(R /\ r:J.)(Xj, ..., X p + 2 ) =2I (_1)i+j-1 R(Xl.X)a(X"..., Xi'"'' X j , i<j ...,X p + 2 ). In general, 2D 2 a = R /\ a holds for an arbitrary tensorial form r:J.. Let (e" ..., en) be a moving frame of M on a neighborhood U and 0 1 , ... ,on be its dual frames. A tensorial p-form a of type (r, s) is written as r:J.=a i , i'0e i 0...0e i 00 j '0...0()j" 1\,..)s 1 r on U where the ai, i, are the usual differential , 11 J p-forms on U. We call them the components of a with respect to (e 1 , ..., en)' Then the compo- nents of Da, which we denote by Daj: ." are given by Dr:J. i , i!=dr:J. i , J l' .J it , if + '" Wi!, /\ Ctl J L.. a it j v=l a i r s - '" (r) /\ lit.. i r '. L.. JI' 11 a. J v=l Then we have , s D2 l i; = '" Qit /\ [il'  ..i, _ '" Q /\ Ctl i, 11. h L.. a 11 J, L.. Jv 11 a j,' ,'=1 v=l This is an expression of 2D 2 a = R /\ a in terms of components. The components of the ca- nonical vectorial form 0 are the dual forms 0 1 ,..., on of (e l ,..., en), and we have DO i = e i , which means that the components of DO are the torsion forms e i . 
417 D Tensor Calculus D. Tensor Fields on a Riemannian Manifold Let (M,g) be an n-dimensional Riemannian manifold ( 364 Riemannian Manifolds). The fundamental tensor 9 defines a one-to-one correspondence between vector fields and differential I-forms. A differential I-form a which corresponds to a vector field X is de- fined by a(Y)=g(X, Y) for any vector field Y. This correspondence is naturally extended to a one-to-one correspondence between .r(M) and .r:(M), where r + s = r' + s'. Let (e 1 , ..., en) be a moving frame of M on a neighborhood U and gij be the components of 9 with respect to the moving frame. Let (gij) be the inverse matrix of the matrix (gi). The gij are the components of a symmetric contravariant tensor field of order 2. Let Xi be the components of a vector field X and r:J.i be the components of the differential 1- form a corresponding to X. Then Xi and a i satisfy the relations a i = gijX j and Xi = gija j . If Kt are the components of a tensor field K of type (1,2) (here taken for simplicity), then K hij = Kijgah' KJi = Kjgai, Khij = Kbgai gb j , ... , are the components of a tensor field of type (0,3), (2, 1). (3,0), ... , respectively, all of which correspond to K. We call this process of ob- taining the components of the corresponding tensor fields from the components of a given tensor field raising the subscripts and lowering the superscripts by means of the fundamental tensor g. On a Riemannian manifold, we use the tRiemannian connection, unless otherwise stated. The covariant derivative with respect to the Riemannian connection is given by 2g(v'x Y,Z)=Xg(Y,X)+ Yg(X,Z)-Zg(X, Y) +g([X, Y],Z)-g([X,Z], Y) -g(X,[Y,Z]) for vector fields X, Y, and Z. The coefficients of the Riemannian connection with respect to a local coordinate system (Xl, ..., x n ) are usu- ally written as {fj}, called the Christoffel sym- bols, which are given by {fJ = gia(Okgja + (Wka- oagkj)/2. The curvature tensor R of the Rie- mannian connection satisfies the identities R(X, Y)Z+R(Y,Z)X +R(Z,X)Y=O, (VxR)(Y, Z)+(VyR)(Z, X) + (VzR)(X, Y) =0, R(X, Y) = - R(Y, X), g(R(X, Y)Z, W)=g(R(Z, W)X, Y) = -g(Z,R(X, Y)W), g(R(X, Y)Z, W)+g(R(X,Z)W, Y) +g(R(X, W)Y,Z)=O. 1574 In terms of the components, these identities are Rtk + RJki + Rij = 0, Rtk.l + Rkl.j + Rlj. k = 0, Rjk = - Rkj, R hijk = Rjkhi = - R ihjb R hijk + R hjki + R hkij = 0, where R hijk = Rfjkgah' The tRicci tensor S of the Riemannian manifold is a tensor field of type (0. 2) defined by S(X, Y)= trace of the mapping Z->R(Z, X) Y for vector fields X and Y. The components Sji of the Ricci tensor are given by Sji=R'jai' The tscalar curvature k of the Riemannian mani- fold M is a scalar on M defined by k = gjiS ji . The Ricci tensor and the scalar curvature satisfy the identities S(X, Y)=S(Y,X) or Sji = Sij' Sij.k - Sik.j= Rfkj.a, 2gjkSij.k = Oi k . For a moving frame of a Riemannian mani- fold, it is convenient to use an orthonormal moving frame. A moving frame (e],..., en) is orthonormal if e1, "', en satisfy g(e i , e j ) = (jij' Since the components of the fundamental tensor with respect to an orthonormal moving frame are (jij, raising or lowering the indices does not change the values of the components. Some authors write all the indices as sub- scripts. Also they write the dual I-forms, the connection forms, and the curvature forms as (J. w.. and Q.. res p ectively, instead of (Ji, W J i , p )1' JO and QJ. With respect to an orthonormal mov- ing frame, the connection forms wj and the curvature forms QJ satisfy wj+wf=O and QJ+Qf=O. On a Riemannian manifold, the divergence of a vector field and the operators d, (j, and  on differential forms ( 194 Harmonic In- tegrals) can be expressed by using the covar- iant derivatives with respect to the Riemann- ian connection. If Xi are the components of a vector field X with respect to a local coordinate system (Xl, ..., x n ), the divergence div X of X is given by div X = X i . i . Let r:J. be a differential p-form on M. a is written locally in the form a = (1/P!)(X i, .irdxil/\ ... /\ dx ip , where the coefficients ail ..ip are skew- symmetric in all the indices. We call a i, ... ip the components of a with respect to the co- ordinate system. Since a is regarded as an alternating tensor field of type (0, p), we can define the covariant differential Va of a. Then the components of da, (ja, and a are 
1575 given by p+' ( da )" " =" ( _I ) V-l a . A 'p+l  '1 11 11=1 ip+t. it' (()Cl)i t i p - 1 = _gab LLait ip_\.b, ( AOC)i , . = - g ab [ a. . 'p . 'I p . b -" S. a. Ip.a  ll,a t 1 b ip v::::ol - " R C .. a. b . J  alvt w '1 .. c...l p . v<w For a smooth function f and a differential1- form fJ we have M = - )g c i (gijj9 djf), (AfJ)i= _gab[fJi.ab-SiafJb], where g = det(giJ E. Van der Waerden-Bortolotti Covariant Differential Let E be a finite dimensional smooth tvector bundle over a smooth manifold M and f(E) be an \y(M)-module of all smooth sections of E. A connection V' in E is a mapping of .I(M) x f(E) to f(E) such that (1) Vi(+'1)=Vi+Vi'1, (2) Vi(J)=Xf' + fVi, (3) Vi+Y = Vi + W, (4) V;x( = fVi, for X, YE.I(M),, '1Ef(E), and fE (M). Vi is called the covariant derivative of  in the direction X. An element K of .I(M) <8> f(E) is called a tensor field of type (r, s) with values in E (or simply an E-valued tensor field of type (r, s)). K can be regarded as an 3'(M)-linear mapping of .I(M) to f(E) or an 3'(M)-mu1tilinear map- ,.----J , '---------., ping of .I(M) x .., x .I(M) to .I(M) <8> f(E). For a given Ef(E), a mapping X ->Vi de- fines a tensor field of type (0, I) with values in E which we call the covariant differential of , denoted by V'. The curvature tensor R' of V' is a tensor field of type (0,2) with values in E* <8> E (E* is the dual vector bundle of E), and is defined by R'(X, Y)= Vi(W)- W(Vx)- Vix.Y] for any vector fields X and Y and any E f(E). If an affine connection V is given on M, we can define the van der Waerden-Bortolotti covariant derivative VxK for V and V' of a tensor field K of type (r, s) with values in E. It is defined by (\K)(S)= V(K(S))- K(VxS) for any SE.I(M). Ifwe regard Ef(E) as an E- 417 E Tensor Calculus valued tensor field of type (0,0), we have Vx = Vi. The covariant derivative VxR' of the curvature tensor R' of V' is a tensor field of type (0,2) with values in E* <8> E is defined by (VxR')(Y,Z) = Vi(R'(Y,Zg)- R'(Vx Y,Z) - R'(Y, VxZ) - R'(Y, Z)Vi. The Bianchi identity is written as (VxR')(Y, Z)+(VyR')(Z, X)+(VzR')(X, Y) =R'(X, T(Y,Z))+R'(Y, T(Z, X)) + R'(Z, T(X, Y)), where T is the torsion tensor of V. The Ricci formula is given by (Vx(VyK))(S) - (Vy(V x K) )(S) - (V[x. Y]K) (S) =R'(X, Y)'K(S)-K(R(X, Y)'S), where R is the curvature tensor of V, K E .I;(M) <8> f(E) and SE.I(M). In the following we assume that the fiber of E is of finite dimension m. A moving frame of E on a neighborhood U of M is an ordered set (j, .. . , m) of local sections  1, ''', m on U such that l (p),..., m(P) are linearly independent at each point p of U. Let (e1' ..., en) be a moving frame of M on U. Then an E-valued tensor field K of type (r, s) is locally written as K J i, J i'"ei <8>...<8>ei <8>Oj'<8>...<8>Oj'<8>.. 1 sir where (j!, ..., en are the dual I-forms of (ej, ..., en)' The nr+sm functions KJ::: J: on U are called the components of K with respect to (e 1 , ...,e n ) and (l' ""m)' We define the con- nection forms uJ'p" of the connection V' by V' p = uJ'p <8> ". Then the curvature forms D.'p" are defined by D.'p" = duJ'p" + uJ';," /\ uJ'p' = 1 RpjiOj /\ e i , where R Pji are the components of the curva- ture tensor R', i.e., R'(e j , eJp = Rpji"' For a given tensor field K of type (r, s) with values in E, the mapping X -> V x K de- fines a tensor field V K of (r, s + 1) with val ues in E which we call the van der Waerden- Bortollotti covariant differential of K. Then if K i , if" are the components of K with respect 11 h to (e 1 , ..., en) and (1' "', m)' the components KJ;:. J\ of VK are given by , Ki, .if ek=dK i , if" + " Ki" ....i,"w'" 11 Js,k h. is  h. h a '=1 S _  Kl i,a. (J)? + Kl i!,PU)'a.  it a is Jv it J P v=l Let f be a smooth mapping of M into a smooth manifold M'. The differential f*(or df) can be regarded as a tensor field of type (0, 1) with values in f*T(M'). Assume that M (resp. M') has a Riemannian metric g (resp. g'). We denote the Riemannian connection of M by V. 
417 F Tensor Calculus From the Riemannian connection of M' a connection V' in f * T( M') can be defined. Let (y 1, ... , ym) be a local coordinate system of M' on a neighborhood V and (Xl, ..., x n ) be a local coordinate system on a neighborhood U of M such that J(U)c: V. Put ,(p)=(o/cy')U(p)) for a point pE U. Then (j,"" m) is a moving frame of [*T(M'). The components of J* with respect to W/ox" ...,a/ox") and (l' ""m) are given by F(p)=(8y'/i3x i )(p). The Laplacian A[ of the mapping J is a tensor field of type (0,0) with values in J * T( M') and is defined by (Aft = gij [tj. If At = 0, the mapping J is called a harmonic mapping ( 195 Harmonic Mappings). F. Tensor Fields on a Submanifold Consider an n-dimensional smooth mani- fold M immersed in an (n+m)-dimensional Riemannian manifold ( M, g). If we denote the immersion M -> M by J, then g = J *g is a Riemannian metric on M, and we denote its Riemannian connection by V. The induced bundle J * T( M) splits into the sum of the tangent bundle T(M) of M and the normal bundle T1.(M). The Riemannian connection on M induces connections in J*T( M) and in T1.(M) which are denoted by V and V1., re- spectively. The van der Waerden-Bortolotti covariant derivative for V and V1. is denoted by V. For vector fields X and Yon M, the tan- gential part of V x Y(here we regard Y as a section of J*T( M) ) is V x Y, and we denote the normal part ofV x Y by h(X, Y). Then h is a symmetric tensor field of type (0,2) with values in T1.(M), and we call h the second funda- mental tensor of the immersion.f For  E r(T1.(M)), the tangential part ofVx (here  is also regarded as a section of J*T( M)) is denoted by - AX and the normal part of Vx is vf. Thus we have VxY=VxY+h(X, Y), Vx=-AX+Vx. h and A are related by g(h(X, Y),) = g(AX, Y). We have the following formulas, called the equations of Gauss, Codazzi, and Ricci: g(R(X, Y)Z, W)=g(R(¥, Y)Z, W) +g(h(X,Z),h(Y, W)) -g(h(X, W),h(Y,X)), g(R(X, Y)Z,)=g(Vxh)(Y,Z),) -g((Vyh)(X, Z), ), g(R(X, Y),'1)=g(R1.(X, Y),'1) +g([A,AJX, Y), 1576 for X, Y. Z, WEX(M) and, '1EqT1.(M)), where R, R, and R1. are the curvature tensors of V, V, and V1., respectively. For the manifold M immersed in M, we use a moving frame (e" ..., en, 1' ..., "') such that (e 1 , ..., en) is an orthonormal moving frame of M on a neighborhood U and (j,..., m) is a moving frame of T1.(M) on U with g(" p)= 6,p' Then we can define the connection forms wJ for V and w p for V1.. If we extend (e 1 , ..., en, 1' ..., m) to an orthonormal moving frame (ej, ..., e n + m ) of M such that ei(p) = ei(p) (i = I, ..., n) and e"+,(p) = Ap) (IX= I,..., m) for pE U, then the restriction .r*(jA and J*w of the dual I-forms and the connection forms of M with respect to (1.'1, ..., e"+m) satisfy the relations J*7Ji=()i, .r*fj"+'=O, J*wj=wJ, f*wp=OJp, f*w;+cx= Ihijf)j, j where hu are the components of the second fundamental tensor h with respect to (e 1 , ...,e n , j, ... , m)' The components hij.k of the covariant dif- ferential Vh of h are defined by hij.J)k =dhij-h;jwf -hfawj + htwp. In terms of the components, the equations of Gauss, Codazzi, and Ricci are given by R hijk = R hijk + I (hijhk - hfkhj), , RUk = hfk.j - hij.k, Rpjk = R1.!XPjk- I(hj.hk-hf"h;d. a Let (Xl, ..., x") be a local coordinate system on a neighborhood U of M and (i, ..., yn+m) be a local coordinate system on a neighbor- hood V of M such that J(U) c: V. Regarding the differential J* of the immersion J as a ten- sor field of type (0, I) with values in J*T( M) , we denote the components of J* with re- spect to (Xl, ..., x") and (yl, ..., yn+m) by B (i = I, ..., n; A = I, ... , n + m). Then we have B = i3yA/i3x i . We denote by V' the van der Waerden-Bortolotti covariant derivative for V and V. Then the components Bj or V'f* are given by Bj=i3jB - {jJB: + Bi Bf{l}, where i3 j =i3/i3X j , {J;}, and {(} are the Chris- toffel symbols of the Riemannian rnetrics g and g, respectively. Let ("..., m) be an orthonormal moving frame of T1.(M) on U and : be the compo- nents of , with respect to (i , ... , y"+m). Then we have Bj=hij:, where h ij are the components of the second 
1577 fundamental tensor with respect to (a/ax], ..., i1/cxn) and (1'"'' m)' A tensor field K with values in TJ.(M) can be regarded as a tensor field with values in I*T(M), and VK is the normal component of \7' K. For example, if we regard the second fundamental tensor h as a tensor field with values in I*T( M) , the components of h with respect to the coordinates (Xl, "', x n ) and (i, ..., yn+m) are equal to Bj' and we have hij.k = BjkgAB' References [IJ E. Cartan, Lei;on sur la geometrie des espaces de Riemann, Gauthier-Villars, second edition, 1946; English translation, Geometry of Riemannian Spaces, Math-Sci Press, 1983. [2J B.- Y. Chen, Geometry of submanifolds, Dekker, 1973. [3J S. Helgason, Differential geometry, Lie groups, and symmetric spaces, Academic Press, 1978. [4J S. Kobayashi and K. Nomizu, Founda- tions of differential geometry, Interscience, I, 1963; II, 1969. [5J J. A. Schouten, Ricci-calculus, Springer, second edition, 1954. [6J K. Yano, Integral formulas in Riemannian geometry, Dekker, 1970. [7J G. de Rham, Varietes differentiables, Ac- tualites Sci. Ind., Hermann, second edition, 1960. [8J L. P. Eisenhart, Riemannian geometry, Princeton Univ. Press, second edition, 1949. [9J R. L. Bishop and S. I. Goldberg, Tensor analysis on manifolds, Macmillan, 1968. 418 (IX.20) Theory of Singularities A. Introduction Let fl J2, .., , f be tholomorphic functions defined in an open set U of the complex space e. Let X be the analytic set Il- 1 (0) n... n {..-1(0). Let ZoEX, and let Y1, ...,Ys be a sys- tem of generators of the ideal .J'(X)zo of the germs of the holomorphic functions which vanish identically on a neighborhood of Zo in X. Zo is called a simple point of X if the matrix (i1y;/iJz j ) attains its maximal rank, say k, at Z = zoo In thi.s case, X i.s a tcomplex manifold of dimension n - k near Zoo Otherwise, Zo is called a singular point of X. 418 C Theory of Singularities B. Resolution of Singularities Let X be a complex analytic space, and let Y be its singular locus. A resolution of the sin- gularity of X is a pair of a complex manifold X and a proper surjective holomorphic mapping n: X -> X such that the restriction n I x -n I (Y) is biholomorphic and X _n- 1 (y) is dense in X. H. Hironaka proved that there exists a reso- lution for any X such that n- 1 (Y) is a divisor in X with only tnormal crossings [16,17]. Suppose that a compact connected ana- lytic subset Y of a complex manifold X has a tstrongly pseudoconvex neighborhood in X. Then the contraction X/Y naturally has a structure of a tnormal complex analytic vari- ety such that the projection X ->X/Y is a resolution of X/Y (H. Grauert [14J). C. Two-Dimensional Singularities Let X be a normal 2-dimensional analytic space. Then the singular points of X are discrete. Among the resolutions of X, there exists a unique resolution n:X ->X with the following universal property: For any resolution n':X'-> X, there exists a unique mapping p: X' -> X with n' = nap. This resolution is called the minimal resolution. Let n: X -> X be a resolution of a singular point x of X, and let Ai (i= 1, ...,m) be the irreducible components of n- 1 (x). The matrix (Ai' Aj) of the tintersection numbers is known to be negative definite (P. Du Val [12J). The resolution n: X -> X is called good if (i) each Ai is nonsingular, (ii) Ai n Aj (i =F j) is at most one point and the intersection is trans- verse and (iii) no three A/s meet at a point. For a given good resolution n:X ->X, we associate a diagram in which the vertices Vi (i= 1, ...,m) correspond to Ai (i= 1, ...,m) and Vi and v j are joined by a segment if and only if A i nA j =F0. The geometric genus Pg(X, x) of a singular point x E X is the dimension of the tstalk at x of the first direct image sheaf R 1 n*0 x , where n:X ->X is a resolution of XEX and Ox is the tstructure sheaf of X. The definition is inde- pendent of the choice of the resolution, and pq(X, x) is a finite integer. Among the positive cycles of the fOrm Z = L7=1 niA i (i.e., ni  0) such that Z. Ai < 0 for each i = 1, ... ,m, there exists a smallest one Zo, which is called the fundamental cycle [3]. (1) Rational singularities. A singular point x of X is called rational if pg(X, x) = O. (The sin- gularity (X,x) is also called rational even when dim X  3 if the direct image sheaf Rin*(!,x = 0 for i>O.) 
418 D Theory of Singularities For a rational singularity XEX, the tmulti- plicity of X at x equals - zij and the local embedding dimension of X at x is - zij + 1. Hence a rational singularity with multiplicity 2, which is called a rational double point, is a hypersurface singularity. The following weighted homogeneous polynomials ( Sec- tion D) give the complete list of the defining equations up to analytic isomorphism: An:xn+1 + y2 +Z2, weights (1/(n+l), 1/2, 1/2), n;;;,l; Dn: x "-' + xy 2 +Z2, weights (1/(n-l), (n-2)/2(n-l), 1/2), n;;;,4; E6:X4 + l +Z2, weights (1/4, 1/3, 1/2); E 7 :X 3 y+y3+ Z 2, weights (2/9, 1/3, 1/2); Es:x 5 +l+z 2 , weights (1/5, 1/3, 1/2), where the labels appearing at the left are given according to the coincidence of the diagram of the respective minimal resolutions and the tDynkin diagrams. Rational double points have many different characterizations [11]. The generic part of the singular locus of the unipotent variety of a tcomplex simple Lie group G (= the orbit of the subregular tunipo- tent elements in G) is locally expressed as the product of a rational double point and a poly- disk. The tuniversal deformation of a rational double point and its tsimultaneous resolution are constructed by restricting the following diagram on a transverse slice to the subregular unipotent orbit (Brieskorn [7J; [34J): y . G 1 1 T . T/W where T is a tCartan subgroup of G with the action of the Weyl group W, G-- T/W is the quotient mapping by the tadjoint action of G and Y = {(x, B)I XE G and B is a tBorel sub- group of G with x E B}, and other morphisms are defined naturally so that the diagram commutes. Here, Y -- T is the simultaneous resolution of the morphism G-- T/W. (2) Quotient singularities. A singular point x E X is called a quotient singularity if there exists a neighborhood of x which is analyti- cally isomorphic to an orbit space U /G, where U is a neighborhood of 0 in C 2 and G is a finite group of analytic automorphisms of U with the unique fixed point O. The quotient singularities are rational, and their resolutions 1578 have been well studied [6]. U /G has a rational double point at 0 if and only if G is conjugate to a nontrivial finite subgroup of SU(2). (3) Elliptic singularities. The singularity (X, x) is called minimally elliptic if Pg(X, x) = 1 and (X, x) is Gorenstein [23]. The following are examples of minimally elliptic singularities. A singular point x E X is called simply ellip- tic if the exceptional set A of the minimal resolution is a smooth telliptic curve [33]. WhenA 2 =-I, -2, -3, (X, x) is a hyper- surface singularity given by the following weighted homogeneous polynomials: E6 :x 3 + y3 +z3 +axyz, weights (1/3, 1/3, 1/:1), A 2 = -3; E7: x 4 + y4 + Z2 + axyz, weights (1/4, 1/4, 1/2), A 2 = -2; Es :x 6 + y3 +Z2 + axyz, weights (1/6, 1/3, 1/2), A 2 = -1, (4) Cusp singularities. A singular point x E X is called a cusp singularity if the exceptional set of the minimal resolution is either a sin- gle rational curve with a tnode or a cycle of smooth rational curves. Cusp singularities appear as the boundary of tHi1bert modular surfaces [18]. The hypersurface CllSP singular- ities are given by the polynomials T p . q ,': x P + yq + z' + axyz, where l/p+ l/q+ l/r< 1 and a#O. D. The Milnor Fibration for HYPIrsurface Singularities Let V be an analytic set in CN, and take a point ZoE V. Let S,=S(zo,£) be a (2N -1)- dimensional sphere in C N with center Zo and radius F. > 0, and let K, = vn S,. If £ is suffi- ciently small, the topological type of the pair (S" K,) is independent of £ [27]. By virtue of this fact, the study of singular points consti- tutes an important aspect of the application of topology to the theory of functions of several complex variables. A singular point Zo of V is said to be isolated if, for some open neighborhood W of Zo in CN, wn V - {zo} is a smooth submanifold of W- {zo}. In that case, K, is a closed smooth sub- manifold of S" and the diffeomorphism type of (S" K,) is independent of (sufficiently small) f. > O. So far, the topological study of such singular points has been primarily focused on isolated singularities. When V is a plane curve, that is, N = 2 and r= 1, all the singular points of V are isolated, and the submanifold K, of the 3-sphere S, can be descnbed as an iterated torus link, where type numbers are 
1579 completely determined by the tPuiseaux ex- pansion of the defining equation I of Vat the point Zo [5]. In 1961, D. Mumford, using a resolution argument, showed that if an alge- braic surface V is tnormal at Zo and if the closed 3-manifold K, is simply connected, then K, is diffeomorphic to the 3-sphere and Zo is nonsingular [29]. The following theorem in the higher-dimensional case is due to E. Bries- korn [8J (1966): Every thomotopy (2n -I)-sphere (n #2) that is a boundary of a tIT-manifold is dif- feomorphic to the K, of some complex hyper- surface defined by an equation of the form l(z) = Z1 + ... + z'tl1 = 0 at the origin in C"+I, provided that n # 2. The hypersurface of this type is called the Brieskorn variety. Inspired by Brieskorn's method, 1. W. Milnor developed topological techniques for the study of hyper- surface singularities and obtained results such as the Milnor fibering theorem, which can be briefly stated as follows: Suppose that V is defined by a single equa- tion l(z) = 0 in the neighborhood of Zo E C n + I. Then there is an associated smooth tfiber bundle cp: S, - K,->Sr, where cp(Z) = l(z)/II(z)1 for zES,-K,. The fiber F=cp-1(p) (pES 1 ) has the homotopy type of a finite CW -complex of dimension n, and K, is (n - 2)-connected. Suppose that Zo is an isolated critical point of f Then F has the homotopy type of a tbou- quet of spheres of dimension n [27]. The MiI- nor number J.1(f) of I is defined by the nth Betti number of F, and it is equal to dim c (('c,+I. z ) (01/o z 1'"'' iJl/czn+d, where (I;'c+1.zo is the ring of the germs of analytic functions of n + 1 variables at z = zoo The Milnor monodromy h* is the automorphism of Hn(F) that is induced by the action of the canonical generator of the fundamental group of the base space SI. The tLefschetz number of h* is zero if ZO is a singular point of V. Let A(t) be the charac- teristic polynomial of h*. Then K, is a homol- ogy sphere if and only if A(1) = ::!:: 1 [27]. It is known that A(t) is a product oftcyclotomic polynomials. The diffeomorphism class of (S" K,) is com- pletely determined by the congruence class of the linking matrix L(e i , e j ) (1  i,j  J.1(f), where e 1 , ..., e(f) is an integral basis of Hn(F) and L(ei,e) is the linking number [21, 10]. The Milnor fibration is also described in the following way. Let E(s, 6) be the intersection of I-I (Dt) and B(s), the open disk of radius s and center zo, where Dt is {11 EC\ 0 < 1111 < 6}. The restriction of J to E(s, 6) is a tlocally triv- ial fibration over Dt if 6 is sufficiently smaller than I: [27]. Let [(z) be an analytic function; suppose that.f (0) = 0 and let L PE N'+1 apz P be the Taylor expansion of [ at z = O. Let r + (f) be the con- 418 E Theory of Singularities vex hull of the union of {p+(R+)n+1} for pENn+! c Rn+1 with ap#O, where R + = {XE R I x  A}, and let r(f) be the union of com- pact faces of r + (f). We call r(f) the Newton boundary of.r in the coordinates z l' ... 'Zn+!' For a closed face A of r(f) of any dimension, let 1",(z) = L pE ", apz P We say that I has a non- degenerate Newton boundary if (i1f",/ iJ z l' " . , (!j",/DZn+1) is a nonzero vector for any ZE(C*)"+' and any A E r(f). Suppose that I has a non- degenerate Newton boundary and 0 is an isolated critical point of f Then the Milnor fibration of I is determined by r(j) and J.1(f), and the characteristic polynomial can be ex- plicitly computed by r( f) [22,38]. l(z) is called weighted homogeneous if there exist positive rational numbers r 1 , ...,rn+l, which are called weights, such that a p = 0 if Li:tJ Piri # 1. An analytic function l(z) with an isolated critical point at 0 is weighted homo- geneous in suitable coordinates if and only if .r belongs to the ideal ([/oz I' ..., iJI/Dzn+d (K. Saito [32J). Suppose that [(z) is a weighted homogeneous polynomial with an isolated critical point at O. Then the Milnor fibration of r is uniquely determined by the weights, and J.1( f) = TIi:J U -1). The surface r l (0) for n = 2 is a rational double point if and only if L1 r i > 1. E. Unfolding Theory An unfolding of a germ of an analytic func- tion [(z) at 0 is a germ of an analytic function F(z, t), where t Ecm (m is finite) such that F(z, 0) = l(z). We assume that I has an isolated crit- ical point at O. Among all the unfoldings of {, there exists a universal one, in a suitable sense, that is unique up to a local analytic isomor- phism. It is called the universal unfolding of f [36,37,26J ( 51 Catastrophe Theory). Ex- plicitly it can be given by F(z, t)= l(z) + t 1 CPI (z) +... +tcP(z), where CPi(Z) (i= 1, ".,J.1) are holomorphic functions which form a C-basis of the Jacobi ring (Yc'''''.o/(O[/OZI' ..., olji'zn+d (J.1 = J.1(f)). In the universal unfolding F(z, t) off, the set of points (zo, to) such that F(z, to) has an iso- lated critical point at Zo with the Milnor num- ber J.1( f) and F(zo, to) = 0 forms an analytic set at (z, t) = O. The modulus number of I is the dimension of this set at O. This set is some- times called the J.1-constant stratum. Let g be a germ of an analytic function. g is said to be ad- jacent to.r (denoted by I ->g), if there exists a sequence of points (z(m), t(m)) in C"+1 x C that converges to the origin such that the term of F(z, t(m)) at z(m) is equivalent to g. Adjacency relations are important for the 
418 F Theory of Singularities understanding of the degeneration phenomena of functions. The unfolding theory can be considered in exactly the same way as that for the germ of a real-valued smooth function that is finitely determined [36,26]. The germs of analytic functions with modulus number 0, 1, and 2 are cal1ed simple, unimodular, and bimodular, respectively. They were classified by V. I. Arnold [1] ( Ap- pendix A, Table 5.V). Simple germs corre- spond to the equations for the rational double points, and unimodular germs define simply elliptic singularities or cusp singularities. Every unimodular or bimodular germ defines a sin- gularity with Pg = 1. F. Picard-Lefschetz Theory Let f(z) be a holomorphic function such that f(O) = 0 and 0 is an isolated critical point with the Milnor number fl.. Let F(z, t) be a universal unfolding of fat O. Let f: E(F., (j)->Dt be the Milnor fibration of f by the second description in Section D. There exists a positive number r and a codimension 1 analytic subset A (cal1ed the bifurcation set) of B'(r), the open disk of radius r with the center 0 in the parameter space C, such that for any to E B'(r) - A, fro = F I B(e) x '0 has fl. different nondegenerate crit- ical points in B(F.). Let PI' ..., P be the critical points of!,o' For each Pi' one can choose local coordinates (Y1, ..., Yn+d so that fro(Y)= fro(pJ + yi +... + Y;+I' Such an fro is called a Morsi- fication of 1- Let B i be a smal1 disk with center Pi in C"+1. Then for any qi which is near enough to fro(pJ, the intersection fr: 1 (qJ n B i is diffeomorphic to the tangent disk bundle of the sphere sn. The vanishing cycle e i is the corresponding n- dimensional homology class of fr: 1 (q,Jn B i . (We fix qi') The self-intersection number of e i is given by _ { 2( -1 )"(n-1 )(2 , n even, (e i , e,) - 0 dd , no. For a sufficiently small toEB'(r)-A, one has the fol1owing: (i) Ifro(p,J1 < 15; (ii) the restriction of fr to E is a fiber bundle over D', where D' ={;ECllwl":;(j, and w#fro(pJ for i= 1, ...,fl.} and E = fr: 1 (D') n B(F.); (iii) the restriction of the above fibration to {w II w 1= (j} is equivalent to the restriction of the Milnor fibration of f to {w Ilwl =(j}. Let Wo be a fixed point of D', and let F=.!,:l(wo)nE. Then F is diffeomor- phic to the Milnor fiber of f Let Ii be a simple path from Wo to qi' and let 1'i be the loop Iw- fro(p,J1 = Iqi - fro(pJI. We suppose that the union of the Ii is contractible to woo By paral1el translation of the vanishing cycle e i along Ii' we consider e i E Hn(F). The collection {ei I i = 1580 1, ... ,fl.} is an integral basis of Hn( F), which is called a strongly distinguished basis (Fig. 1). Now let hi be the linear transformation of Hn(F) that is induced by the parallel trans- lation along li1'ili-1. The Picard-Ldschetz for- mula says that h,(e)=e-( _1)"(n-1)(2(e,e,) 'e i for eEHn(F). Here ( , > is the intersection number in Hn(F). For n even, hi is a treflection. The Milnor monodromy h* of f is equal to the composition hI .., h under a suitable ordering of the hi' The subgroup of the group of linear isomorphisms of Hn( F) generated by h 10 ... ,h is called the total monodromy group. When f is a simple germ and n =2mod4, the total monodromy group is isomorphic to the tWeyl group of the corresponding Dynkin diagram. Even-dimensional simple singular- ities are the only ones for which the mono- dromy group is finite. These are also char- acterized as the singularities with definite intersection forms. E  B(e) 1,,-1 (q,) !I., l )f' q, . . !, qo !',(P,) . D, Fig. I G. Stratification Theory The notion of Whitney stratification was first introduced by H. Whitney to study the sin- gularities of analytic varieties [39] and was developed by R. Thorn for the general case [37]. Let X and Y be submanifolds of the space R n . We say that the pair (X, Y) satisfies the Whitney condition (b) at a point yE Y if the following holds: Let Xi (i = 1,2,...) and Yi (i = 1,2,...) be sequences in X and Y, respec- tively, that converge to y. Suppose that the tangent space TXiX converges to a plane Tin the corresponding Grassmannian space and the secant XiYi converges to a line L. Then L c T. We say that (X, Y) satisfies the Whitney condition (b) if it satisfies the Whitney con- 
1581 clition (b) at any point y E Y. Let h be a local diffeomorphism of a neighborhood of y. One can see that (h(X), h(Y») satisfies the Whitney condition (b) at h(y) if (X, Y) satisfies it at y. Thus the Whitney condition can be considered for a pair of submanifolds X and Y of a mani- fold M using a local coordinate system. Let S be a subset of a manifold M, and let Y be a family of submanifolds of M. Y is called a Whitney prest ratification of 5 if Cf is a locally finite disjoint cover of 5 satisfying the follow- ing: (i) For any X E ,Cf', the frontier X - X is a union of YE.'/'; (ii) for any pair (X, Y) (X, YE .Cf'), the Whitney condition (b) is satisfied. A submanifold X in if' is called a stratum. There exists a canonical partial order in if that is defined by X < Y if and only if Xc Y - y. Let V be an analytic variety, and let .'/' be an analytic stratification of V that satisfies the frontier condition (i). Then there exists a Whitney prestratification .'f' that is finer than .Cf' (Whitney [39]). For a given Whitney prestratification if, one can construct the following controlled tubular neighborhood system: For each X E,Cf', a ttubular neighborhood I Txl of X in M and the projection n x: 1 Tx 1-> X and a tubular func- tion Px: 1 Txl-> R + (= the square of a norm under the identification of I Txl with the tnor- mal disk bundle of X) are given such that the commutation relations nx' ny(m) = nx(m), PXny(m) = px(m) for mEM, X < Y. are satisfied whenever both sides are defined. By virtue of this, the notions of vector fields and their integral curves can be defined on a Whitney prestratified set so that several im- portant results on a differentiable manifold can be generalized to the case of stratified sets. For example, the following is Thom's first isotopy lemma: Let M and P be differentiable manifolds, and let (5, .5P) be a Whitney pre- stratified subset of M. Let f: 5 -> P be a con- tinuous mapping that is the restriction of a differentiable mapping from M to P. Suppose that the restriction of f to each stratum X of .'f is a proper submersion onto P. Then f: 5-> P is a fiber bundle [37]. H. b-Functions Let {(z) be a germ of an analytic function in C"+1 with frO) = O. The h-function of fat 0 is the monic polynomial bJ(s) of lowest de- gree among all polynomials his) with the following property [4,20]: There exists a differential operator P(z, o;oz, 5), which is a polynomial in s, such that b(s)j'(z) = P(z, c /iJz, 5) f' H (z). Since bJ(s) is always 418 Ref. Theory of Singularities divisible by 5+ 1, we define bJ(s)=bJ(s)/(s+ 1). All the roots of hJ(s)=O are negative rational numbers (M. Kashiwara [20]. When j has an isolated critical point at 0, the set {exp(2niIX 1 IX is a root of bJ(s) = O} coincides with the set of eigenvalues of the Milnor monodromy [25]. The name "b-function" is due to M. Sato. He first introduced it in the study of tprehomo- geneous vector spaces. Some authors call it the Bernstein (Bernshtein) polynomial. I. Hyperplane Sections Let V be an algebraic variety of complex di- mension k in the complex projective space pn. Let L be a hyperplane that contains the singular points of V. Then the trelative homo- topy group nJ V, vn L) is zero for i < k. Thus the same assertion is true for the trelative homology groups (S. Lefschetz [24]; [28]). Let f be a holomorphic function defined in the neighborhood of 0 E C"+! and frO) = O. Let H be the hypersurface f- I (0). There exists a tZariski open subset U of the space ( = pn) of hyperplanes such that for each LE U, there exists a positive number F. such that nJB(r)- H, (B(r)-H)nL)=Ofor i<n and O<rF., where B(r) is a disk of radius r (D. T. Le and H. Hamm [15]). This implies the following theorem of Zariski: Let V be a hypersurface of pn, and let p2 be a general plane in pn. Then the fundamental group of pn - V is isomorphic to the fundamental group of p2 - C, where C = vn p2. The fundamental group of p2 - Cis an Abelian group if C is a nodal curve [9, 13]. Suppose that f has an isolated critical point at O. Let /1(n+l) be the Milnor number /1(f). Take a generic hyperplane L. The Milnor number of {I L is wen defined, and we let /1(n) = /1(f 1 L)' Similarly one can define /1(i) of f and let /1* = (Jt(n+ 1), /1(n), ... , /1(1 ». Let j,(z) be a de- formation of f Each j, has an isolated critical point at 0, and t is a point of a disk D of the complex plane. Let W={(z,t)Ij,(z)=O} and D'={O} xD. W-D' and D' satisfy the Whitney condition (b) if and only if /1*(f,) is invariant under the deformation [35]. The Whitney condition (b) implies topological triviality of the deformation. References [1] V. I. Arnol'd, Local normal forms offunc- tions, Inventiones Math., 35 (1976),87-109. [2] V. I. Arnol'd, Normal forms for functions near degenerate critical points, the Weyl group of A k , Ok' Ek and Lagrangian singularities, Functional Ana!. Appl., 6 (1972), 254-272. (Original in Russian, 1972.) 
419 A Thermodynamics [3J M. Artin, On isolated rational singularities of surfaces, Amer. J. Math., 88 (1966),129-136. [4J I. N. Bernshtein, The analytic continuation of generalized functions with respect to para- meter, Functional Anal. Appl., 6 (1972), 273- 285. (Original in Russian, 1972.) [5J K. Brauer, Zur Geometrie der Funktionen zweier komplexen Veranderlichen III, Abh. Math. Sem. Hamburg, 6 (1928),8-54. [6J E. Brieskorn, Rationale Singularitaten Komplexer Flachen, Inventiones Math., 4 (1968),336-358. [7J E. Brieskorn, Singular elements of semi- simple algebraic groups, Actes Congres Int. Math., 1970, vol. 2, 279-284. [8J E. Brieskorn, Beispiele wr Differential- topologie, Inventiones Math., 2 (1966),1-14. [9J P. Deligne, Le groupe fondamental du complement d'une courbe plane n'ayant que des points doubles ordinaires est abelian, Sem. Bourbaki, no. 543, 1979-1980. [IOJ H. Durfee, Fibered knots and algebraic singularities, Topology, 13 (1974), 47-59. [11J A. H. Durfee, Fifteen characterizations of rational double points and simple critical points, Enseignement Math., 25 (1979), 131- 163. [12J P. Du Val, On isolated singularities of surfaces which do not affect the condition of adjunction I-III, Proc. Cambridge Philos. Soc., 30 (1933/34), 453-465, 483-491. [13J W. Fulton, On the fundamental group of the complement of a node curve, Ann. Math., (2) 111, (1980),407-409. [14J H. Grauert, Ober Modifikationen und exzeptionelle anaytische Mengen, Math. Ann., 146 (1962),331-368. [15J H. Hamm and D. T. Le, Un theoreme de Zariski du type de Lefschetz, Ann. Sci. Ecole Norm. Sup., 3, 1973. [16J H. Hironaka, Resolution of sing ulari ties of an algebraic variety over a field of charac- teristic zero, Ann. Math., (2) 79 (1964), 109- 326. [17J H. Hironaka, Bimeromorphic smoothing of a complex analytic space, Acta Math. Viet- namica, 2, no. 2 (1977),106-168. [18J F. Hirzebruch, Hilbert modular surfaces, Enseignement Math., 19 (1973),183-281. [19J P. Holm, Real and complex singularities, Sijthoff and Noordhoff, 1976. [20J M. Kashiwara, B-functions and holo- nomic systems, Inventiones Math., 38 (1976), 33-54. [21J M. Kato, A classification of simple spin- nable structure on a I-connected Alexander manifold, J. Math. Soc. Japan, 26 (1974), 454- 463. [22J A. G. Kouchnirenko, Polyedre de New- ton et nombres de Milnor, Inventiones Math., 32 (1976), 1-31. 1582 [23J H. Laufer, On minimally elliptic singular- ities, Amer. J. Math., 99 (1977),1257-1295. [24J S. Lefschetz, L'analysis situs et la geo- metrie algebrique, Gauthiers-Villars, 1924. [25J B. Malgrange, Le polynome de Bernstein d'une singularite isolee, Lecture notes in math. 459, Springer, 1976,98-119. [26] 1. N. Mather, Stability ofCex'-mappings. I, Ann. Math., (2) 87 (1968),89-104, II, (2) 89 (1969),254-291; III, Publ. Math. Inst. HES, 35 (1970),301-336; IV, Lecture notes in math. 192, Springer, 1971,207-253. [27J 1. W. Milnor, Singular points of complex hypersurfaces, Ann. math. studies 61, Prince- ton Univ. Press, 1968. [28J 1. W. Milnor, Morse theory, Ann. math. studies 51, Princeton Univ. Press. 1963. [29J D. Mumford, The topology of normal singularities of an algebraic surface and a criterion for simplicity, Publ. Math. Inst. HES, 9 (1961),5-22. [30J F. Pham, Formules de Picard-Lefschetz generalisees et ramification des integrales, Bull. Soc. Math. France, 93 (1965), 333-367. [31J E. Picard and S. Simart, Traite des fonc- tions algebriques de deux variables I, Gauthier- Villars, 1897. [32J K. Saito, Quasihomogene isolierte Sin- gularitiiten von Hyperfliichen, Inventiones Math., 14 (1971),123-142. [33J K. Saito, Einfach elliptische Singulari- tiiten, Inventiones Math., 23 (1974), 289-325. [34J P. Slodowy, Simple singularities and simple algebraic groups, Lecture notes in math. 815, Springer, 1980. [35J B. Teissier, Cycles evanescents, sections planes et conditions de Whitney, Asterisque, 78 (1973), 285-362. [36J R. Thorn, La stabilite topologique des applications polynomiales, Enseignement Math.. 8 (1962), 24-33. [37J R. Thorn, Ensembles et morphismes stratifies, Bull. Amer. Math. Soc., 75 (1969), 240- 284. [38J A. N. Varchenko, Zeta function of the monodromy and Newton's diagram, Inven- tiones Math., 37 (1976), 253-262. [39J H. Whitney, Tangents to an analytic variety, Ann. Math., (2) 81 (1964). 496-549. [40J O. Zariski, Algebraic surfaces, Erg. math. 61, Springer, 1971. 419 (XX.18) Thermodynamics A. Basic Concepts and Postulates Thermodynamics traditionally focuses its attention on a particular class of states of a 
1583 given system called (thermal) equilibrium states, although a more recent extension, called the thermodynamics of irreversible processes, deals with certain nonequilibrium states. In a simple system, an equilibrium state is completely specified (up to the shape of the volume it occupies) by the volume V (a posi- tive real number), the mole numbers Nj, ..., N, (nonnegative reals) of its chemical compo- nents, and the internal energy U (real). (More variables might be needed if the system were, e.g., inhomogeneous, anisotropic, electrically charged, magnetized, chemically not inert, or acted on by electric, magnetic, or gravitational fields.) This means that any of the quantities associated with equilibrium states (called thermodynamical quantities) of a simple sys- tem under consideration is a function of V, N 1 , ...,N" and U. When n copies of the same state are put next to each other and the dividing walls are re- moved, V, N 1 , ..., N" and U for the new state will be n times the old values of these variables under the assumptions that each volume is sufficiently large and that the effects of the boundary walls can be neglected. Thermo- dynamical quantities behaving in this manner are called extensive. Those that are invariant under the foregoing procedure are called inten- sive. More precisely, the thermodynamic vari- ables are defined by homogeneity of degree 1 and 0 as functions of V, N" ... , N" and U. Bya shift of the position of the boundary (called an adiabatic wall if energy and chemical substances do not move through it) or by transport of energy through the boundary (called a diathermal wall if this is allowed) or by transport of chemical components through the boundary (called a permeable memhrane) (in short, by thermodynamical processes), these variables can change their values. If these shifts or transports are not permitted (especially for a composite system consisting of several simple systems, at its boundary with the out- side), the system is called closed. Otherwise it is called open. Those equilibrium states that do not under- go any change when brought into contact with each other across an immovable and impermeable diathermal wall (called a ther- mal contact) form an equivalence class. This is sometimes called the Oth law of thermo- dynamics. The equivalence class, called the temperature of states belonging to it, is an intensive quantity. The force needed to keep a movable wall at rest, divided by the area of the wall, is called the pressure. It is another intensive quan- tity. For a (slow) change of the volume by an amount dV under a constant pressure P, me- chanical work of amount - P d V is done on 419 A Thermodynamics the system. Together with a possible change of the internal energy, say of amount dU, the amount (jQ=dU-PdV (1) of energy is somehow gained (if it is positive) or lost (if it is negative) by the system. This amount of energy is actually transported from or to a neighboring system through diathermal walls sO that the total energy for a bigger closed composite system is conserved. This is called the first law of thermodynamics, and (jQ is called the heat gain or loss by the system. If two states of different temperatures T 1 and T 2 are brought into thermal contact, energy is transferred from one, say II, to the other (called heat transfer). This defines a binary class relation denoted by T 1 > T 2 . The Clausius formulation of the second law of thermodynamics says that it is impossible to make a positive heat transfer from a state of lower temperature to another state of higher temperature without another change else- where. By considering a certain composite system, one reaches the conclusion that there exists a labeling of temperatures by posi- tive real numbers T, called the absolute tem- perature, for which the following is an exact differential: (jQ/T=(dU -PdV)/T=dS. (2) The integral S is an extensive quantity, called the entropy. Furthermore, the sum of the en- tropies of component simple systems in an isolated composite system is nondecreasing during any thermodynamic process, and the following entropy maximum principle holds: An isolated composite system reaches an equilibrium at those values of extensive param- eters that maximize the sum of the entropies of component simple systems (for constant total energy and volume and within the set of allowed states under a given constraint). A relation expressing the entropy of a given system as a function of the extensive param- eters (specifying equilibrium states) is known as the fundamental relation of the system. If it is given as a continuous and differentiable homogeneous function of V, N 1 , ..., N" and U and is monotone increasing in U for fixed V, N 1 ,..., N" then one can develop the thermo- dynamics of the system based on the above en- tropy maximum principle. A relation express- ing an intensive parameter as a function of some other independent variables is called an equation of state. Another postulate, which is much less fre- quently used, is the Nernst postulate or the third law of thermodynamics, which says that the entropy vanishes at the vanishing abso- lute temperature. 
419 B Thermodynamics B. Various Coefficients and Relationships The partial derivative D/Dx of a function f(x, y, '" ) with respect to the variable x with the variables y, ... fixed is denoted by (o!/ox)y. . We abbreviate N 1 , "" N, as N in the following. If the fundamental relation is written as U = U (V, N 1 , "', N" S) (instead of S being repre- sented as a function of the other quantities), then (2) implies (i3u/aS)V.N= T, (iJU/aV)N.S= -P. The other first-order partial derivatives of U are flj=(DU/DNj)v.N t . i .N,.S, with {Ii called the chemical potential (or elec- trochemical potential) of the jth component. If a system is surrounded by an adiabatic wall (i.e., the system is thermally isolated) and goes through a gradual reversible change (quasistatic adiabatic process), then the entropy has to stay constant. If a system is in thermal contact through a diathermal wall with a large system (called the beat bath) whose tempera- ture is assumed to remain unchanged during the thermal contact, then the temperature of the system itself remains constant (an iso- thermal process). The decrease of the volume per unit increase of pressure under the latter circumstance is called the isothermal compress- ibility and is given by Kf = - V- l (8V/iJP)J.N' Under constant pressure, the increase of the volume per unit increase of the temperature is called the coefficient of thermal expansion and is given by rx= V- 1 (oV/8T)p.N' Under constant pressure, the amount of (quasi- static) heat transfer into the system per mole required to produce a unit increase of tem- perature is called the specific heat at constant pressure and is given by C p = N - 1 T(DS/i!T)p.N' where N = N l +... + N,. The same quantity under constant volume is called the specific heat at constant volume and is given by C..= N- l T(8S/oT)v.N' The positivity of c 1 is equivalent to the convex- ity of energy as a function of entropy for fixed values of V and N. Because of the first-order homogeneity of an extensive quantity as a function of other ex- tensive vaciables, one can derive an Euler 1584 relation, such as U= TS- PV+ fll Nt +... + fl,N" for a simple system. Its differential form im- plies the following Gibbs-Duhem relation: SdT - VdP+ N l d{l l +... + N,d{I,=O. Because of the identity ( C: )( Df ) ( o )( Of ) (Ix cy = oy ex ' there arise relationships among second deriva- tives, known as the Maxwell relations: (i7T/iJV)S.N= -(ap/aS)V.N' (ev/iJS)P. N = (DT/( 1P h. N, (cs/iJ Vh.N = (iJP/(iT)V.N' (CS/8P)T.N= -(DV/cT)p.N' By computing the Jacobian of transformations of variables, further relations can be obtained. For example, c p = C v +N- l TVrx 2 /K T . C. Legendre Transform and Variational Principles The Legendre transform of a function !(x l' ..., Yb"') relative to the variables x is given by g(P1'''' 'Yl'''')= f - 'i>iPi j as a function of the variables Pi = (1/iJXj and y. The original variables x can b, recovered as -xj=og/Dpj' In terms of Legendre transforms, the en- tropy maximum principle can be reformulated in various forms: Energy minimum principle: For given values of the total entropy and volume, the equilib- rium is reached at those values of uncon- strained parameters that minimize the total energy. This principle is applicable in rever- sible processes where the total entropy stays constant. Helmholtz free energy minimum principle: For given values of the temperature (equal to that of a heat bath in thermal contact with the system) and the total volume, the equilibrium is reached at those values of the unconstrained parameters that minimize the total Helmholtz free energy, where the Helmholtz free energy for a simple system is defined as a function of T, V, N 1 , ...,N, by F=U- TS, dF= -SdT-PdV+fll dN l +fl,dNr' Enthalpy minimum principle: For given values of the pressure and the total entropy, 
1585 the equilibrium is reached at those values of unconstrained parameters that minimize the total enthalpy, where the enthalpy for a sim- ple system is defined as a function of S, P, N1,...,N, by H=U+PV, dH = TdS + VdP+/11 dN 1 + ... +/1rdN,. Gibbs free energy minimum principle: For constant temperature and pressure, the equi- librium is reached at those values of uncon- strained parameters that minimize the total Gibbs free energy, where the Gibbs free energy for a simple system is given as a function of T, P N. N \w . , . '1' ... , . 'r .... J G= U -TS+PV, dG= -S dT + V dP+ /11 dN 1 +... + /1rdN,. References [IJ M. W. Zemansky, Heat and thermo- dynamics, McGraw-Hili, 1951. [2J H. B. Callen, Thermodynamics, Wiley, 1960. [3J J. W. Gibbs, The collected works of J. Willard Gibbs, vol. 1, Thermodynamics, Yale Univ. Press, 1948. [4J M. Planck, Treatise on thermodynamics, Dover, 1945. (Original in German, seventh edition, 1922.) [5J E. A. Guggenheim, Thermodynamics, North-Holland,1949. [6J F. C. Andrews, Thermodynamics: Prin- ciples and applications, Wiley, 1971. 420 (XX.8) Three-Body Problem A. n-Body Problem and Classical Integrals In the n-body problem, we study the motions of n particles P;(Xi' Yi' zJ (i = 1,2, ..., n) with arbi- trary masses mil > 0) following tNewton's law of motion, d 2 W i au m i dt 2 = rw ' i = 1,2, . .. , n, 1 where Wi is anyone of Xi' Yi, or Zi, U = " k 2 m.m / r.. '-' I J lJ' i¥-j with k 2 the gravitation constant, an d rij= J (xi-xi +(Yi Yi +(zi-zl. Although the one-body and two-body prob- lems have been completely solved, the prob- 420C Three-Body Problem lem has not been solved for n> 2. The three- body problem is well known and is important both in celestial mechanics and in mathemat- ics. For n>3 the problem is called the many- body problem. The equations (I) have the so-called ten classical integrals, that is, the energy integral L;(m;/2)(xi)2 + (yJ2 + (iJ2)- U = constant (w = dw/dt), six integrals of the center of mass Li m i Wi = constant, Li m i Wi = (Li mi wi)t + con- stant, and three integrals of angular momen- tum Limi(uiWiWiuJ=constant (u#w). Using these integrals and eliminating the time t and the ascending node by applying Jacobi's method, the order of the equations (1) can be reduced to 6n -12. H. Bruns proved that alge- braic integrals cannot be found except for the classical integrals, and H. Poincare showed that there is no other single-valued integral (Bruns, Acta Math., 11 (1887); Poincare [2, I, ch. 5J). These results are called Poincare-Bruns theorems. Therefore we cannot hope to obtain general solutions for the equations (1) by tquadrature. General solutions for n;;:' 3 have not been discovered except for certain specific cases. B. Particular Solutions (I) Let r i be the position vector of the particle Pi with respect to the center of mass of the n- body system. A configuration r= {rj,..., r n } of the system is said to form a central figure (or central configuration) if the resultant force acting on each particle Pi is proportional to mir i , where each proportionality constant is independent of i. The proportionality con- stant is uniquely determined as - U /L71 mir,z by the configuration of the system. A con- figuration r is a central figure if and only if r is a tcritical point of the mapping r f-> U2(r)L71 mir,z [5,6]. A rotation of the sys- tem, in planar central figure, with appropriate angular velocity is a particular solution of the planar nobody problem. Particular solutions known for the three- body problem are the equilateral triangle solu- tion of Lagrange and the straight line solu- tion of Euler. They are the only solutions known for the case of arbitrary masses, and their configuration stays in the central figure throughout the motion. C. Domain of Existence of Solutions The solutions for the three-body problem are analytic, except for the collison case, i.e., the case where minrij=O, in a strip domain en- closing the real axis of the t-plane (Poincare, P. 
420D Three-Body Problem Painleve). K. F. Sundman proved that when two bodies collide at t = to, the solution is expressed as a power series in (t - t o )I/3 in a neighborhood of to, and the solution which is real on the real axis can be uniquely and analytically continued across t = to along the real axis. When all three particles collide, the total angular momentum f with respect to the center of mass must vanish (and the motion is planar) (Sundman's theorem); so under the assumption f #0, introducing s= r(U + I)dt as a new independent variable and taking it for granted that any binary collision is analyt- ically continued, we see that the solution of the three-body problem is analytic on a strip domain 11m s I < 6 containing the real axis of the s-plane. The conformal mapping w=(exp(ns/26) -1)/(exp(ns/26) + I) maps the strip domain onto the unit disk I wi < I, where the coordinates of the three particles W K , their mutual distances 'kh and the time t are all analytic functions of wand give a complete description of the motion for all real time (Sundman, Acta Math., 36 (1913); Siegel and Moser [7J). When a triple collision occurs at t = to, G. Bisconcini, Sundman, H. Block, and C. L. Siegel showed that as t->t o , (i) the configura- tion of the three particles approaches asym p- totically the Lagrange equilateral triangle configuration or the Euler straight line con- figuration, (ii) the collision of the three par- ticles takes place in definite directions, and (iii) in general the triple-coIlision solution cannot be analytically continued beyond t = to. D. Final Behavior of Solutions Suppose that the center of mass of the three- body system is at rest. The motion of the system was classified by J. Chazy into seven types according to the asymptotic behavior when t-> +GO, provided that the angular mo- mentum f of the system is different from zero. In terms of the torder of the three mutual dis- tances 'ij (for large t) these types are defined as follows: (i) H+: Hyperholic motion. 'ij  t. (ii) H P+: H yperholic-parabolic motion. , 13, '23t and '12t2/3. (iii) HE+: Hyperbolic-elliptic motion. '13, '23t and'12 <a (a = finite). (iv) p+: Parabolic motion. 'ij  t2/3. (v) PE+: Paraholic-elliptic motion. '13' '23t2/3 and'12 <a. (vi) L +: Lagrange-stahle motion or hounded motion. 'ij < a. (vii) OS+: Oscillating motion. lim H (1)sup'ij= GO, lim ,_c,," SUP'ij< GO. 1586 Define H -, HE - , etc. analogously but with t-> -GO. There are three classes for each of the motions HP, HE, and PE, depending on which of the three bodies separates from the other two bodies and recedes to infinity, denoted by HP i , HEi' PE, (i = 1,2,3), respectively. The energy constant h is positive for H- and HP- motion, zero for P-motion, and negative for PE-, L-, and OS-motion. For HE-motion, h may be positive, zero, or negative. We say that a partial capture takes place when the motion is H - for t -> -CI) and HEt for t-> +GO (for h>O), and a comillete capture when the motion is HE i - for t-> --GO and L + for t -> + GO (for h < 0). We say also that an exchange takes place when HE i - for t->-GO and HE/ for t -> + GO (t # j). The probability of complete capture in the domain h < 0 is zero (1. Chazy, G. A. Merman). E. Perturhation Theories The radius of convergence in the s-plane for Sundman's solution is too small and the con- vergence is too slow in the w-plane to make it possible to compute orbits of cekstial bodies, and for that purpose a perturbation method is usually adopted. When the masse:s m 2 , ..., m n are negligibly small compared with m 1 for the nobody problem, the motion of the nth body is derived as the solution of the two-body prob- lem for m 1 and m n by assuming m 2 = ... = m n - 1 = 0 as a first approximation. and then the deviations of the true orbit from the ellipse are derived as tperturbations. In the general theory of perturhations the deviations are derived theoretically by developing a disturb- ing function, whereas in the special theory of perturhations they are computed by numerical integration. In general perturbation theory, problems concerning convergence of the solu- tion are important, and it becomes necessary to simplify the disturbing function in deal- ing with the actual relations among celestial bodies. Specific techniques have to be devel- oped in order to compute perturbations for lunar motion, motions of characteristic aster- oids, and motions of satellites (e.g., the system of the Sun, Jupiter, and Jovian satellites). F. The Restricted Three-Body Problem Since the three-body problem is very difficult to handle mathematically, matht:matical inter- est has been concentrated on the restricted three-body problem (in particular, the planar problem) since Hill studied lunar theory in the 19th century. For the restricted three-body problem, the third body, of zero mass, cannot have any influence on the motion of the other 
1587 two bodies, which are of finite masses and which move uniformly on a circle around the center of mass. In the planar case, let us choose units so that the total mass, the angu- lar velocity of the two bodies about their center of mass, and the gravitation constant are all equal to 1, and let (qj, q2) be the coordi- nates of the third body with respect to a rotat- ing coordinate system chosen in such a way that the origin is at the center of mass and the two bodies of finite masses /1 and 1 -/1 are always fixed on the q I,axis. Then the equa- tions of motion for the third body are given by a Hamiltonian system: dqi oH dp; - - dt OPi' dt with ciH i=I,2, oq,' 1 2 2 H ="2(P1 + P2)+q2PI -q1P2  U(ql ,q2), 1-/1 /1 U - + J (q1 +/1)2+q J (q1 +/1-1)2+q The equations (2) have the energy integral H (p, q) = constant, called Jacobi's integral. Siegel showed that there is no other algebraic integral, and it can be proved by applying Poincare's theorem that there is no other single-valued integral. Regularization of the two singular points for the equations (2) and solutions passing through the singular points were studied by T. Levi-Civita, and solu- tions tending to infinity were studied by B. O. Koopman. After reducing the number of variables by means of the Jacobi integral, the equations (2) give rise to a flow in a 3-dimensional manifold of which the topological type was clarified by G. D. BirkhofT(Rend. Circ. Mat. Palermo, 39 (1915)). Since this flow has an tin variant mea- sure, the equations have been studied topo- logically, and important results for the re- stricted three-body problem, particularly on periodic solutions, have been obtained. G. Stability of Equilateral Triangular Solutions Suppose that the origin qi = Pi = 0 is an tequi- librium point for an autonomous Hamiltonian system with two degrees of freedom: dqi ciH dpi - - dt OPi' dt ciH Oqi' i=1,2, with the Hamiltonian H being analytic at the origin. When the teigenvalues of the corre- sponding linearized system are purely imagi- nary and distinct, denoted by ::I:A1, ::1:)'2' and )'1 k 1 + A 2 k 2 #0 for 0< Ik 1 1 + Ik 2 1 4 (where k i is an integer), we can find suitable coordinates 420 Ref. Three-Body Prohlem i' '1i so that the Hamiltonian H takes the form I H =)., '1 +A2'2 +2"(CII? + 2C 12 '1 '2 +c 22 m +Hs+... with (, = i'1i and real c ij . It is necessary that '1i=P;;i for the solutions to be real. In ad- dition, if the condition D=C 11 ). - 2C12A1A2 + c 22 Jci #0 (2) is satisfied, then the origin is a tstable equi- librium point of the original system (V. I. Arnol'd, J. Moser) [7]. For Lagrange equilateral triangular solu- tions of the planar restricted three-body prob- lem, the eigenvalues Jc of the linearized system derived from (2) are given as roots of the equation Jc4 + ),2 + (27/4)/1(1-/1) =0 and are purely imaginary if /1(1-/1)< 1/27. Applying the Arnol'd-Moser result, A..M. Leontovich and A. Deprit and Bartholome showed that the Lagrange equilibrium points are stable for /1 such that 0</1</10, where /10 is the smaller root of 27/1(1 -/1) = 1, excluding three values: /11' /12 at which AI k 1 + A 2 k 2 = 0 Ik 1 1 + Ik214 and /13 at which D=O. Arnol'd proved that if the masses m 2 , '" ,m n are negligibly small in comparison with m 1 , the motion of the nobody system is tquasi- periodic for the majority of initial conditions for which the eccentricities and inclinations of the osculating ellipses are small. References [IJ Y. Hagihara, Celestial mechanics. I, Dy- namical principles and transformation theory, MIT Press, 1970; II, Perturbation theory, pts. 1, 2, MIT 1972; III, Differential eq uations in celestial mechanics, pts. 1,2, Japan Society for the Promotion of Science, 1974; IV, Periodic and quasi-periodic solutions, pts. 1,2, Japan Society, 1975; V, Topology of the three-body problem, pts. 1,2, Japan Society, 1976. [2J H. Poincare, Les methodes nouvelles de la mecanique celeste I-III, Gauthier-Villars, 1892-1899. [3J E. T. Whittaker, A treatise on the analyt- ical dynamics of particles and rigid bodies, Cambridge Univ. Press, fourth edition, 1937. [4J H. Happel, Das Dreikorperproblem, Koehler, 1941. [5J A. Wintner, The analytical foundations of celestial mechanics, Princeton Univ. Press, 1947. [6J R. Abraham and J. E. Marsden, Founda- tions of mechanics, Benjamin, second edition, 1978. [7J C. L. Siegel and J. Moser, Lectures on celestial mechanics, Springer, 1971. 
421 A Time Series Analysis [8] V. M. Alekseev (Alexeyev), Sur I'allure finale de mouvement dans Ie probleme des trois corps, Actes Congres Intern. Math., 1970, vol. 2, 893-907. [9] V. Szebehely, Theory of orbits, Academic Press, 1967. 421 (XVII1.11) Time Series Analysis A. Time Series A time series is a sequence of observations ordered in time. Here we assume that mea- surements are quantitative and the times of measurements are equally spaced. We consider this sequence to be a realization of a stochastic process X, ( 407 Stochastic Processes). Usu- ally time series analysis means a statistical analysis based on samples drawn from a sta- tionary process ( 395 Stationary Processes) or a related process. In what follows we denote the sample by X=(Xj,X 2 , ...,X-r)'. B. Statistical Inference of the Autocorrelation Let us assume X, (t an integer) to be real- valued and weakly stationary ( 395 Station- ary Processes) and for simplicity EX, = 0 and consider the estimation of the autocorrelation Ph=RhiRo of time lag h, where R,=EX,X,+,. We denote the sample autocovariance of time lagh as _ 1 '-Ihl R -- '\' X X h- T-Ihl, ' '+Ihl' and define the serial correlation coefficient of time lag h by Ph = Rhl Ro. It can be shown that the joint distribution of {ft(Ph - Ph) 1 1 hH} tends to an H-dimensional tnor- mal (Gaussian) distribution with mean vec- tor 0, if one assumes that X, is expressed as X,=L-"A'-j, where L-oc Ib) < +00, L};c - Ijl1!2bf < +00, and the, are indepen- dently and identically distributed random vari- ables with E,=O and Ei< +00. When X, is an autoregressive process of order K ( Section D) and also a tGaussian process, it can be shown that the asymptotic distribution of {ft(Ph - Ph) 11  h  K} as T--> 00 is equal to the asymptotic distribution of {ft(Ph-Ph)!1 hK}, where Ph is the tmaxi- mum likelihood estimator of Ph' In general, it is difficult to obtain the maximum likeli- hood estimator of Ph' The statistical properties of other estimators of Ph, e.g., an estimator constructed by using sgn(X,) (sgn(y) means 1588 1 (y>O), 0 (y=O), -1 (y<O)) have also been investigated. Testing hypotheses concerning autocorre- lation can be carried out by using the above results. Let us now consider the problem of testing the hypothesis that X, is a twhite noise. Assume that X, is a Gaussian process and that a white noise with EX,2 = (12 exists, and define C h = L/dX, - X)(Xt+h - X) and ;h = ChiCo for h;;:. 0, where X T+ j = X j and X = Li1 X,IT. Then the probability density func:tion of }II can be obtained and it can be shown that m 1 P(Y1 >y)= L()'j_y)(T-3)!2_, 'm+1 Y)'m' j=l Aj where )'j = cos 27rj1T and (T-')!2 Aj= n (I'j-icd, T=3,5,..., k=' (k#j) Tfi-:;1 Aj = 11 ()r Ak)' k=1 (k# j) T=4, 6,..., 1  m(T - 3)/2 if T is odd, lmTI2-1 if Tis even. This can be used to obtain a test of significance. C. Statistical Inference of the Sptctrum To find the periodicities of a real-valued tweakly stationary process X, with mean 0, the statistic, called the periodogram, 1 1 T 1 2 1 1 (ic)=  L X,e- 2ni " ft'=1 is used. If X, is expressed as L X, = L {m, cos 27rA,t + m; sin 27ric,i' } + Y" '=1 where {miL {m;}, and {Y,} are mutually inde- pendent random variables with Em, = Em; = 0 and V(m,) = V(ml) = (1,2 and {y,} is independent and identically distributed with means 0 and finite variances (12, the distribution of 1T(ic) converges to a distribution with finite mean and finite variance at ). =I ::!:: )., fOt 1  I  L when T tends to infinity. On the other hand, the magnitude of 1 T (),) is of the order of T at ic = ::!:: ic[, 1  I  L. This means th,lt we can find the periodicities of X, by using 1-,r(ic). When X, = Y" we find that the distribution of 21 T(A)I (12 (when A=lO, ::!:: 1/2) or 1 T (A)I(1; (when ).=0 or ::!:: 1/2) tends to the t X 2 distribution with degrees of freedom 2 Or 1, respectively, and 1(I1r), 1(112)' ...,!(I1M) are asymptotically in- dependent random variables for 0  11111 < 11121 < ... < 111M 1  1/2 when T --> CJe. Applying this result, we can test for periods in the data. 
1589 Let f(A) be the spectral density function of a real-valued weakly stationary process X,. In general, the variance of IL?1 X,e- 2nitA jfti does not tend to 0 as T tends to infinity; hence /I(A) cannot be used as a good estimator for the spectral density. To obtain an estimate of f().), several estimators defined by using weight functions have been proposed by several authors. Let W T ().) be a weight function de- fined on ( - 00, (0), and construct a statistic - S 12 - f(}.) = .! 112/ T(/1) W T (}, -/1) d/1. Let us use f(/1) for the estimation of f(A). WT(A) is calIed a window. An important class of WT(A) is as fol- lows. Let W(.!c) be continuous, W(A)= W( -A), W(O) = 1, I W(.!c) I < 1, and Soc W(A)2 d.!c< +00, and let H be a positive integer depend- ing on T such that H -> 00 and H jT ->0 as T -> 00. Putting Wj = W (j/ H), we define WI (Ie) by WT().)=LJ::T+l wje-hijA. Ten I(A) can be expressed as I(A) = L:T+1 Rh whe-2nihA, where Rh = L?lh X'+hX,jT for h;;, 0 and '" T R h = L'=lhl+l X'+hX,jT for h <0. Let X, be stationary to the fourth order ( 395 Station- ary Processes) and satisfy 'x; L IRhl < +00, h-oo J) L ICo.h.l.pl<+oo, h,l,p= -oc where C o . h . I . P is the fourth-order Joint tcumu- lant of X" X'+h, X'+b and X,+p' Then we have T f ey;, lim - V(](O)) = 2f(0) 2 W(A)2dA, l-ocH -CX) T - f oo lim - V(f(::!:: Ij2))=2f(1j2)2 W(},)2d)., T-oc H -00 T - f oo lim - V(f(A)) = f().)2 W()/ dA, T-oc H -oc )40, ::!:: Ij2, T _ _ lim -Cov({(Je), f(JI)) =0, )4/1. (1) 1'-", H {Wh} or WI ().) should have an optimality, e.g., to minimize the mean square error of I().). But. generalIy, it is difficult to obtain such a {Wh} or WT(A). Several authors have proposed specific types of windows. The following are some examples: (i) (Bartlett) W h = (1 -lhljH) for Ihl:::; Hand Wh=O for Ihl >H; (ii) (Tukey) W h = L -00 alcos(n:lhjH) for Ihl:::;H and Wh=O for I hi> H, where the a l are constants such that L;';-oolall < +00, L-ooal= 1 and al=a- l . The Hanning and Hamming windows are a o =0.50, a 1 =a_ 1 =0.25, and al=O for 111;;,2 and a o =0.54, a[ =a_ 1 =0.23, and al=O for 111;;'2, respectively [2]. Let X, = Lf= -ex; bjF., _ j' where L f= - 00 I b j I < + 00 and the F., are independently and identicalIy distributed random variables 421 D Time Series Analysis with EF.,=O and EF.;< +00. Let {A j ll :::;j:::;M} be arbitrary real numbers such that 0:::; )'1 < )'2 <... < AM:::; 1j2, where M is an arbitrary posi- tive integer. Then the joint distribution of { jTjH (](Av)-E{(AvJ) I 1 :::; v:::; M} tends to the normal distribution with means 0 and covar- iance matrix E, which is defined by (1). Let us assume, furthermore, that Iimx_o(1- w(x))jlxl q = C and L -oolhlPJRhl < +00, where C, q, and p are some positive constants satisfying the folIowing conditions: (i) when p;;' q, HqjT->O (p;;' 1) and Hq+1- p jT ->0 (p;;' 1) as T -> 00 and lim T _,,) TjH 2 q+1 is finite; (ii) when p < q, HPjT ->0 (p;;' 1) and HjT ->0 (p:::; 1) as T ->00 and lim T _ oc TjH 2 p+1 =0. Then jTjH Cf(A,.)- EI().v)) in the results above can be replaced by jTjH ([(A v )- f(},v))' Estimation of higher-order spectra, partic- ularly the bispectrum. has also been discussed. Let X, be a weakly stationary process with mean 0, and let its spectral decomposition be given by X,= Slfi/2e2ni'AdZ().) (395 Station- ary Processes). We assume that X, is a weakly stationary process of degree 3 and put Rh,.h, = EX,X'+h, X'+h, for any integers hI and h 2 . Then we have f 1 1 2 f 11 2 R = e 2ni (h,A, +h,A,) dF ( A ). ) . h 1 ,h 2 P 2 [/2 -11 2 Symbolically, dF()'I' )'2) = EdZ(ArJ dZ(A 2 ) dZ( - )'1 - )'2)' If F(.!c 1 , .!c2) is absolutely contin- uous with respect to the Lebesgue measure of R 2 and OF(}'l' A2)joA l oA2 = ((}'1,)'2)' we calI f(.!c 1 , A2) the bispectral density function. When X, is Gaussian, Rh,.h z = 0 and .f(A1' A2) = 0 for any hj, h 2 and any AI' A 2 . fU'I' .!c2) can be considered to give a kind of measure of the departure from a Gaussian process or a kind of nonlinear relationship among waves of different frequencies. We can construct an estimator for f(A1 ,).2) by using windows as in the estimation of a spectral density [3]. D. Statistical Analysis of Parametric Models When we assume merely that X, is a stationary process and nothing further, then X, contains infinite-dimensional unknown parameters. In this case, it may be difficult to develop a satis- factory general theory for statistical infe'rence about X,. But in most practical applications of time series analysis, we can safely assume at least some of the time dependences to be known. For this reason, we can often use a model with finite-dimensional parameters. This means, mainly, that the moments (usually, second-order moments) or the spectral density are assumed to be expressible in terms of finite- dimensional parameters. As examples of such 
421 D Time Series Analysis models, autoregressive models, moving average models, and autoregressive moving average models are widely used. A process X, is called an autoregressive process of order K if X, satisfies a difference equation 'Lt=o akX,-k = " where the ak are constants, a o = I, a K "" 0, and the , are mutu- ally uncorrelated with E, = 0 and V( ,) = az > O. We usually assume that X, is a weakly sta- tionary process with EX, = O. We sometimes use the notation AR(K) to express a weakly stationary and autoregressive process of order K. Let {,} be as above. If X, is expressed as X, = 'Lt=o b , '-I' where the b l are constants, b o = 1 and b L "" 0, X, is called a moving average pro- cess of order L (MA(L) process). Furthermore, if X, is weakly stationary with EX,=O and expressed as 'LfoakX'-k='Lt=ob/r-l with a o = 1, b o = 1, and aKbL""O, then X, is called an autoregressive moving average process of order (K, L) (ARMA(K, L) process). Let A(Z) and B(Z) be two polyhomials of Z such that A(Z) = 'Lf=o akZK-k and B(Z) = 'Lt-o bIZ L - I , and let {IX k ll :(;k:(;K} and {Ptl1 :(;1:(;L} be the solu- tions of the associated polynomial equations A(Z)=O and B(Z)=O, respectively, we as- sume that IIXd < 1 for l:(;k:(; K and IPII < 1 for 1:(; 1:(; L. This condition implies that X, is purely nondeterministic. Let the observed sample be {X, 11:(; t:(; T}. If we assume that X, is Gaussian and an ARMA(K, L) process, we can show that the tmaximum likelihood es- timators {ad and {b l } of {a k } and {b l } are tconsistent and asymptotically efficient when T -400 ("asymptotically efficient" means that the covariance matrix of the distribution of the estimators is asymptotically equal to the inverse of the information matrix) [5J ( 399 Statistical Estimation D). Furthermore, if X, is an AR(K) process, the joint distribu- tion of {j7\ak- ad 11 :(; k :(;K} tends to a K- dimensional normal distribution with means 0, and this distribution is the same as the one to which the distribution of the t1east-square estimators {ad minimizing Q = 'Ll:K+rfX, + 'Lf=1 a k X,_d 2 tends when T -400. If X, is a MA(L) or ARMA(K,L) process (L 1), the likelihood equations are complicated and cannot be solved directly. Many approxi- mation methods have been proposed to ob- tain the estimates. When X, is an AR(K) process with I IXkl < I for 1:(; k:(; K, RI satisfies 'Lf=o akRh-k = 0 for h  1. These are often called the Y ule- Walker equations. Rh can be expressed as Rh = 'Lf=l Cpjh if the IXk are distinct and a K "" 0, where {C j } are constants and determined by Rh for 0:(; h:(; K - 1. When X, is an ARMA(K,L) process, 'LtoakRh-k=O for hL+ 1, and the C j of Rh='Ll Cpjh are determmed by {Rh I O:(;h :(;max(K, L)}. 1590 The spectral density is expressed as f(.!c) = az I B(e2niA)12/1 A (e 2niJ W. If X, is Gaussian, the maximum likelihood estimator of f(.!c) is asymptotically equal to the statistic obtained by replacing az, {bl}' and {ak} in f(.!c) with IlZ, {b;}, and {ak}, respectively, where III is the maximum likelihood estimator of az, when T -400. When we analyze a time series and intend to fit an ARMA(K, L) model, we have to determine the values of K and L. For AR(K) models, many methods have been proposed to determine the value of K. Some examples are: (i) (Quenouille) Let (ZK A(1/Z))2 = 'LJ!o AjZl, and G K = 'Lj=o AJfRJi Ro), where Aj is obtained by replacing {a k } in Aj by {a k }, and we con- struct the statistic XI = 'L{=1 G K + I . Then XJ has a tl distribution asymptotically with f degrees of freedom under the assumption that K  Ko, where Ko is the true order, as T --j 00. Using this fact, we can determine the order of an AR model. (ii) (Akaike) We consider choosing an order K satisfying KL:(;K :(;K M , where K L and KM are minimum order and maximum order, respectively, specified a priori. Then we con- struct the statistic AIC(K)=(T-K)logaZ(K) +2K, where T az(K)= I (X,+d 1 X'-1 +... +t1 K X'_K)2/T '=K+1 and {d k 11 :(; k:(; K} are the least square esti- mators of the autoregressive coefficients of an AR(K) model fitting X,. Calculate AIC(K) for K = KL' K L + 1,..., K M . If AIC(K) has the minimum value at K = K, we determine the order to be K [6J (403 Statistical Models F). Parzen proposed another method by using the criterion autoregressive transfer function (CAT). Here CAT(K)= l-a 2 (00)/a[(K)+K/T, where aZ(K)=(T/(T-K))6Z(K) and a 2 (00) is an estimator of a 2 ( (0) = exp(SI/210gf(.!c)d.!c) [7]. (iii) We can construct a test statistic for the null hypothesis AR(K) against the alterna- tive hypothesis AR(K + 1) (Jenkins) or use a multiple decision procedure (1'. W. Anderson [8J). Not much is known about the statistical properties of the above methods, and few comparisons have been made among them. Another parametric model is an exponential model for the spectrum. The spectral density is expressed by f(.!c) = C 2 exp{2'Lf=1 Okcos(2n:kA)}, where the (}k and C are constants. We now discuss some general theories of estimation for finite-dimensional- parameter models. Let X, be a real-valued Gaussian process of mean 0 and of spectral density f(},) which is continous and positive in [ -1/2, 1/2J, and let the moving average representation of X, be X, = 'L;x;,o b/'_I' where , is a white noise and al=Ee. We assume thatf(,)/al=g(.!c) 
1591 depends only on M parameters B = (0 1 , IJ 2 , " ,OM)' which are independent of CJl. Then the logarithm of the tlikelihood function can be approximated by - (1 /2){ Tlog 2nCJl + X'T'(B)X/CJn by ignoring the lower-order terms in T, where CJl 1'(B) is the covariance matrix of X. Usual1y, it is difficult to find an explicit expression for each element of T1 (B). Another approximation for the logarithm of the likelihood function is given by T f 112 [ I ()) ] -- 10 gf (A)+ f 1' " dJ... 2 -11 2 (Ie) Under mild conditions on the regularity of g(A), the estimators 0 = (e 1 , ° 2 , ... , eM) and 8Z, obtained as the solutions of the likelihood equations, are tconsistent and asymptotical1y normal as T tends to infinity. This means that the distribution of ft(8Z - CJZ) is asymptoti- cal1y normal and ft(81-CJl} and ft(O-B) are asymptotical1y independent. The asympto- tic distribution of ft(O- fI) is the normal dis- tribution N(O, f-'), where the (k, I)-component r kl of r is given by _I f 112 ( aIOgg(J,). DI0 g9 (A) ) --  2 -11 2 aO k aO I 9 E. Statistical Analysis of Multiple Time Series Let X,=(X,(I),XI 2 ), ...,X!P»)' be a complex- valued weakly stationary process with EX, = 0 and EX,X = R,_,. R,-s is the p x p matrix whose (k, I)-component is R) = EX!k) XI). We discuss the case when t is an integer. Rh has the spectral representation f 112 R h = e2nihAdF(A), -11 2 where F(A) is a p x p matrix and F(J,rJ - F(A2)' J" ;:;,).2, is Hermitian nonnegative. Let fk'/(A) be the (k, I)-component of the spectral density matrix f(A), i.e., Faf}.) = S'-IIz!(/l)d/l, of the absolutely continuous part in the Lebesgue decomposition of F(A). The function fk"(J,) for k i= I is called the cross spectral density function. {k'/(A) represents a kind of correlation between the wave of frequency A included in Xl k ) and the one included in XII). Let X,=(XP), XI 2 ),... ,X!P»)' and Y,= P;(I), Y,(2), "', Y,(q»)' be two complex-valued weakly stationary processes with EX, = 0, EY,=O, EX,X= R_" EY,V;= R7-s and EX,V;=R_Y,. We assume Y,=-c" AsX,_" where As is a q x p matrix whose components are constants depending on s. Put A(A) =  _ex, Ase-2nisA. A(A) should exist in the sense of mean square convergence with respect to the spectral distribution function F for X,. 421 F Time Series Analysis The function A(J,) is called the matrix fre- quency response function. As a measure of the strength of association between X,(k) and XII) at frequency A, we intro- duce the quantity yk'/(J,) = I {k.l(AW /{k.k(A)t./().). yk.I(J,) is called the coherence. Let Xl k ) = ;'" -CXJ a;'/ X\Is +17" where 17, is a weakly sta- tionary process with mean 0 and uncorrelated with XI), -00 < s < 00. If EI17,12 = 0, l'/(A) = I. If EIex,a;'/XsI2=0, l'/(A)=O. Generally, we have O:::;yk'/(A):::; 1. For the estimation of F(A), A(),), and yk'/(A), the theories have been similar to those for the estimation of the spectral density of a scalar time series. For example, an estimator of f(A) is given [11] in the form 1'-1 f(A)= I Rhwhe-2nihA, h -(1'-1) where _ 1'-lhl R h = I X'+lhIX;/T (::::1 and the W h are the same as in Section C. We can define an autoregressive, moving average, or autoregressive moving average process in a similar way as for a scalar time series. The a k and hi in Section D should be replaced by p x p matrices and the associated polynomial equations A(Z)=O and 8(Z)=0 should be understood in the vector sense [II]. There are problems with determining the coeffi- cients uniquely or identifying an ARMA(K, L) model, and these problems have been dis- cussed to some extent. F. Statistical Inference of the Mean Function Let X, be expressed as X, = m, + Y" where m, is a real-valued deterministic function of t and Y, is a real-valued weakly stationary process with mean 0 and spectral distribution function F(A). This means that EX,=m,. We consider the case when m,= 1 C/Pj), where C=(C 1 , C 2 , ..., CM)' is a vector of unknown coefficients and q.>,=(cpl),cp2), ...,cpM»)' is a set ofknown (regression) functions. Let us construct tlinear unbiased estimators {C j = J1 yj'X, 11 :::;j:::; M} for the coefficients C j , where the yj' are known constants. Put <1> = (q.>I' q.>2' ..., q.>1')" Then the t1east squares estimator of C is given by C = (<1>'<1>(I<1>'X when <1>'<1> is nonsingular. Let E be the covar- iance matrix of X. Then the tbest linear un- biased estimator is C* = (<1>'E- 1 <1»-1 <1>'E- 1 X. We put Ilcp(j)II=idcpF»)2 and assume that lim 1' _", IIcp(j)II= 00, lim 1' _ ro IlcpUJII+h/llcp(j)II = 1 for 1 :::;j.:::; M and any fixed h and assume the existence of i/!hj.k)=lim 1' _ ro i1 CP1hCPk)/ II cpUJ II l' II cp(k) II l' for 1 :::;j, k:::; M. We also assume 
421 G Time Series Analysis that F(A) is absolutely continuous and F'(Je) = {(i,) is positive and piecewise continuous. Let i/Jh be the M x M matrix whose (j, k)- component is i/J1 j . k ). Then i/Jh can be repre- sented by I '/l i/Jh = e 2nih ). dG().), -1/2 where G(Je) - G(f1) is a nonnegative definite matrix for Je>f1. Assume that i/Jo=G(I/2) - G( -1/2) is nonsingular and put H(A) = i/Jo-l/2G(A)i/Jo- 1/2, and for any set S, H(S)= Is H(d).). Suppose further that Sj, S2' "', Sq are q sets such that H(S) >0, "2:J, H(S) = I, H(S)H(Sd=O,j#-k, and for any j there is no subset S;cS j such that H(S;) > 0, H(Sj-Sj) >0 and H(Sj)H(Sj-S;)=O. We have q M. It can be shown that the spectrum of the re- gression can be decomposed into such disjoint sets SI' ..., Sq. Then we can show that C is asymptotically efficient in the sense that the asymptotic covariance matrix of C is equiva- lent to that of C* if and only if f().) is constant on each of the elements Sj. Especially, if i/Jlj) = tje2nit!lj, C is asymptotically efficient. G, Nonstationary Models It is difficult to develop a statistical theory for a general class of nonstationary time series, but some special types of nonstationary pro- cesses have been investigated more or less in detail. Let X, (t an integer) be a real-valued sto- chastic process and V be the backward dif- ference operator defined by V X, = X, - X'-1 and V d X, = V(V d - 1 X,) for d 2. We assume that X, is defined for t  to (to a finite integer), and EX,2 < +00. For analyzing a nonstation- ary time series, Box and Jenkins introduced the following model: For a positive integer d, Y, = V d X" t  to +d, is stationary and is an autoregressive moving average process of order (K, L) for t  to +d +max(K, L). They called such an X, an autoregressive integrated moving average process of order (K, d, L) and denoted it by ARIMA(K, d, L). The word "integrated" means a kind of summation; III fact, X, can be expressed as a sum of the weakly stationary process Y" i.e., X, = Xo + (V Xo)t + (V 2 X O {tl ,J + ... +(Vd-l X o ) ( I_ ... t ) '\d_,-l s,-l , 'd +I I sd=l Sd 1=1 '2 I ¥', sl=l when t o = -d + 1. Using this model, methods of forecasting and of model identification and estimation can be discussed [13]. 1592 Another nonstationary model is based on the concept of evolutionary spectra [14]. In this approach, spectral distribution functions are taken to be time-dependent. Let X, be a complex-valued stochastic process (t an integer) with EX,=O and R,.,=EX,X,. In the following, we write simply I for Ilji/2' We now restrict our attention to the class of X, for which there exist functions {U,(A)} defined on [-1/2,1/2] such that R,., can be expressed as R,., = S u,(X)uAJe)df1(),), where f1(Je) is a measure. u,(Je) should satisfy Ilu,(JeW df1().) < +00. Then X, admits a representation of the form X, = I u,(A)dZ(Je), where Z(A) is a process with orthogonal increments and EldZ(JeW =df1(Je). If u,(Je) is expressed as u,(Je) = y,().)e 2niO (A)' and Y,(A) is of the form y,(Je) = I e 2ni 'w df A(W) with IdfA(w)1 having the absolute maximum at w= 0, we call U,(A) an oscillatory function and X, an oscillatory process. The evolutionary power spectrum dF,(Je) is defined by dF,(A) = ly,(AW df1().). Other models, such as an autoregressive model whose coefficients vary with time or whose associated polynomial has, roots outside the unit circle, have also been discussed. References [1] E. J. Hannan, Time series analysis, Methuen, 1960. [2] R. B. Blackman and 1. W. Tukey, The measurement of power spectra from the point of view of communications engineering, Dover, 1959. [3] D. R. Brillinger, Time series: Data analysis and theory, Holt, Rinehart and Winston, 1975. [4] M. S. Bartlett, An introduction to stochas- tic processes with special reference to methods and applications, Cambridge Univ. Press, second edition, 1966. [5] P. Whittle, Estimation and information in stationary time series, Ark. Mat. 2 (1953), 423-434. [6] H. Akaike, Information theory and an extension of the maximum likelihood prin- ciple, 2nd Int. Symp. Information Theory, B. N. Petrov and F. Csaki (eds.), Akademiai Kiad6, 1973,267-281. [7] E. Parzen, Some recent advances in time series modeling, IEEE Trans. Automatic Con- trol, AC-19 (1974), 723-730. [8] T. W. Anderson, The statistical analysis of time series, Wiley, 1971. [9] P Whittle, Hypothesis testing in time series analysis, Almqvist and Wiksell, 1951. [10] H. B. Mann and A. Wald, On the statis- tical treatment of linear stochastic difference equations, Econometrica, 11 (1943), 173-220. 
1593 [llJ E. J. Hannan, Multiple time series, Wiley, 1970. [12J U. Grenander and M. Rosenblatt, Statis- tical analysis of stationary time series, Wiley, 1957. [13J G. E. P. Box and G. M. Jenkins, Time series analysis: Forecasting and control, Holden-Day, revised edition, 1976. [14J M. B. Priestley, Evolutionary spectra and nonstationary processes, 1. Roy. Statist. Soc., ser. B, 27 (1965), 204-237. 422 (IV.7) Topological Abelian Groups A. Introduction A commutative topological group is called a topological Abelian group. Throughout this article, except in Section L, all topological groups under consideration are locally com- pact Hausdorff topological Abelian groups and are simply called groups ( 423 Topolog- ical Groups). B. Characters A character of a group is a continuous func- tion X(x) (XEG) that takes on as values com- plex numbers of absolute value 1 and satis- fies X(xy) = X(x)X(y). Equivalently, X is a 1- dimensional and therefore an irreducible tunitary representation of G. Conversely any irreducible unitary representation of G is l-dimensional. Indeed, for a topological Abelian group, the set of its characters coin- cides with the set of its irreducible unitary representations. If the product of two char- acters X, X' is defined by XX'(x) = X(x)X'(x), then the set of all characters forms the character group C(G) of G. With tcompact-open topo- logy, C( G) itself becomes a locally compact topological Abelian group. C. The Duality Theorem For a fixed element x of G, X(x) (XE C(G)) is a character of C(G), namely, an element of CC(G). Denote this character of C(G) by x(X), and consider the correspondence G3X-->X(X). That this correspondence is one-to-one follows from the fact that any locally compact G has tsufficiently many irreducible unitary repre- sentations ( 437 Unitary Representations) and the fact that if G is an Abelian group, then any irreducible unitary representation of G is a character of G. Furthermore, any character 422 E Topological Abelian Groups of C(G) is given as one of the x(X); indeed, by this correspondence, we have G  CC(G) (Pontryagin's duality theorem). By the duality theorem, each of G and C(G) is isomorphic to the character group of the other. In this sense, G and C(G) are said to be dual to each other. D. Correspondence between Subgroups Let G, G' = C(G) be groups that are dual to each other. Given a closed subgroup 9 of G, the set of all X' such that x'(x) = 1 for all x in 9 forms a closed subgroup of G', usually denoted by (G', 9). The definition of (G,g') is similar. Then g-(G',g)=g' gives a one-to-one corre- spondence between the closed subgroups of G and those of G'. If 91 =0 92' then 91/92 and (G', g2)/(G', YI) are dual to each other. If the group operations of G, G' are written in addi- tive form, with 0 for the identity, then x(X') = 1 is written as x(X')=O. In this sense, (G',g) is called the annihilator (or annulator) of y. E. The Structure Theorem Let 'll be the set of all groups (more precisely, of all locally compact Hausdorff topological Abelian groups). If G 1 , G 2 E'll, then the direct product G 1 x G 2 E'll, and if GE'll and H is a closed subgroup of G, then HE III and G/ HE'll. In addition, if H is a closed subgroup of a group G such that HE III and G/ HE Ill, then G E 'll. In other words, III is closed under the opera tions of forming direct products, closed subgroups, quotient groups, and textensions by members of'll. Furthermore, the operation C that assigns to each element of'll its dual element is a reflexive correspondence of III onto'll, and if G =0 H, the annihilator (C(G), H) of H is a closed subgroup of C(G). Also, C(G/H)(C(G), H), C(H)C(G)/(C(G), H). Furthermore, C(G 1 x G2) C(G 1 ) X C(G 2 ). Finally, H = (G,(C(G), H)) (reciprocity of annihilators). Typical examples of groups in 'll are the additive group R of real numbers, the additive group Z of rational integers, the l-dimensional ttorus group T = R/Z, and finite Abelian groups F. The torus group T is also isomor- phic to the multiplicative group U( 1) of com- plex numbers of absolute value 1. The direct product R n of n copies of R is the vector group of dimension n, and the direct product T" of n copies of T is the torus (or torus group) of dimension n (or n-torus). Both T" and F are compact, while R n and Z" are not. We have C(R)=R, C(T)=Z, C(Z)=T. Any finite Abelian group F is isomorphic to its character group C(F). The direct product of a finite 
422 F Topological Abelian Groups number of copies of R, T, Z, and a finite Abelian group F, namely, a group of the form R 1 X Tm x zn X F, is called an elementary topo- logical Abelian group. Any group in ( is isomorphic to the direct product of a vector group of some dimension and the extension of a compact group by a discrete group (the structure theorem). Hence, if the effect of the operation C is explicitly known, then the problem of finding the struc- ture of groups in ( is reduced to the pro- blem concerning discrete groups alone. For the structure of groups in (, the following theorem is known: If G Ell! is generated by a compact neighborhood of the identity e, then G is isomorphic to the direct product of a compact subgroup K and a group of the form R" x zm (n, In are nonnegative integers). Then any compact subgroup of G is contained in K, which is the unique maximal compact sub- group of G. A group G E ( generated by a compact neighborhood of e is the tprojec- tive limit of elementary topological Abelian groups. L. S. Pontryagin first proved a struc- ture theorem of this type and then the duality theorem. F. Compact Elements An element a of a group GE( is called a com- pact element if the cyclic group {an I nE Z} gen- erated by a is contained in a compact subset of G The set Co of all compact elements of G is a closed subgroup of G, and the quotient group G/C o does not contain any compact element other than the identity In particular, if G is generated by a compact neighborhood of the identity, then Co coincides with the maxima] compact subgroup K of G. Let Co be the set of a]] compact elements of a group GEI. The annihilator (C(G), Co) is a con- nected component of the character group C(G) of G If G is a discrete group, then a compact element of G is an element of G of finite order. G. Compact Groups and Discrete Groups Suppose that two groups G, X EIl( are dual to each other. Then one group is compact if and only if the other group is discrete. By the du- ality theorem, the properties of a compact Abelian group G can be stated, in principle, through the properties of the discrete Abelian group C(G). The following are a few such examples. Let G be a compact Abelian group Then its tdimension is equal to the trank of the discrete Abelian group C(G). A subgroup Yof a discrete AbelIan group X is called a divisible 1594 subgroup if the quotient group X/Y contains no element of finite order other than the iden- tity. A compact Abelian group G is locally connected if and only if any finite subset of the character group C(G) is contained in some divisible subgroup of C(G) generated by a finite number of elements. Hence if a compact locally connected Abelian group G has an topen basis consisting of a countable number of open sets, then G is of the form T" x F, where F is a finite Abelian group and T" is the direct product of an at most countable number of I-dimensional torus groups T. H. Dual Decomposition into Direct Products Let G be a compact or discrete Abelian group, and let 9R = {H, I IX E A} be a family of closed subgroups of G. Let ,.-\.(9]1)= n'd H" and denote by E(9J1) the smallest closed sub- group of G containing U'EA H,. Then, with Q{(C(G),H,)IIXEA}, the relations "-\'(Q) = (C(G),E(9J1)) and E(Q)=(C(G), "-\'(m)) hold. Furthermore, suppose that G is decomposed into the direct product G = n'EA H" and for each IXE A put K,=E(9J1- {H,}), X, = (C(G), K,). Then X, is the character group of H" and C(G) can be decomposed IOtO the direct product C(G) = n'd X,. This decompo- sition of C(G) into a direct product is cal1ed the dual direct product decompositiion corre- sponding to the decomposition G.= n'EA H,. I. Orthogonal Group Pairs Suppose that for two groups G, G' there exists a mapping (x, x')-->xx' of the Cartesian prod- uct G x G' into the set U(I) of all complex numbers of abs01ute value 1 such that (x, x 2 )x' (Xl x')(x 2 x'), x(x; x;)=(xx; )(xx z ). Then G, G' are said to form a group pair. Sup- pose that G, G' form a group pair, and con- sider xx' to be a function x(x') in x'. If two functions xdx') and x 2 (x') coincide only when Xl =x 2 and the same is true when the roles of G and G' are interchanged, then G, G' are said to form an orthogonal group pair. If G is a compact Abelian group, G' is a discrete Abelian group, and G, G' form an orthogonal group pair, then G, G' are dual to ,ach other J. Commutative Lie Groups An elementary topological Abelial1 group R 1 X Tm x zn X F is a commutative tLie group. Conversely, any commutative Lie group G 
1595 generated by a compact neighborhood of the identity is isomorphic to an elementary topo- logical Abelian group. In particular, any con- nected commutative Lie group G is isomor- phic to R' X Tm for some I and m. A closed subgroup H of the vector group R" of dimen- sion n is isomorphic to RP x zq (Op+qn). More precisely, there exists a basis a 1 ,..., a" of the vector group R n such that H = U;fl Xia, + jp+1njajIXiER,njEZ}. Hence the quo- tient groups of R" that are tseparated topolog- ical groups are all isomorphic to groups of the form R ' X Tm (0  I + m  n). Any closed sub- group of the torus group Tn of dimension n is isomorphic to a group of the form T P x F (Opn), where F is a finite Abelian group. Hence the quotient groups of Tn that are separated topological groups are all isomor- phic to Tm (0  m  n). A tregular linear trans- formation of the linear space Rn is a continu- ous automorphism of the vector group R", and in fact, any continuous automorphism of R" is given by a regular linear transformation. Indeed, the group of all continuous automor- phisms of R" is isomorphic to the tgeneral linear group GL(n, R) of degree n. Any continu- ous automorphism of the torus group T" = Rn/Z" of dimension n is given by a regular linear transformation <p ofR n such that <p(zn) = zn. Hence the group of continuous auto- morphisms of T" is isomorphic to the multi- plicative group of all n x n matrices, with de- terminant ::!: 1 and with entries in the set of rational integers. K. Kronecker's Approximation Theorem Let H be a subgroup of a group G Ellr (not necessarily closed). Then (G,(C(G), H)) coin- cides with the closure H of H. In particular, H is tdense in G if and only if the annihilator (C(G), H) consists of the identity alone. Now let G = R n and let H be the subgroup of R" generated by 0 = (° 1 , ..., On)E Rn and the na- tural tbasis e 1 = (1,0, ... ,0), .. . , en = (0, .., ,0, I) of R". Then H is dense in R" if and only if (Rn,H)={O); that is, ej, ...,On' 1 are linearly independent over the rational number field Q (Kronecker's approximation theorem). This theorem implies that the torus group Tn of dimension n has a cyclic subgroup and a 1- parameter subgroup that are both dense in Tn. L. Linear Topology Consider the discrete topology in a field Q Suppose that an Q-module G has a topology that satisfies tHausdorff's separation axiom and is such that a base for the neighborhood 422 Ref. Topological Abelian Groups system of the zero element 0 consists of Q- submodules, and suppose that G together with this topology constitutes a topological Abelian group. Then this topology is called a linear topology. If a linear topology is restricted to a Q-submodule, then it is also a linear topology. If G is of finite rank, then any linear topology is the discrete topology. The discrete topology on G is a linear topology. Let H be a Q- sub module. Then the subset V = H +g of G obtained by translating H by an element 9 of G is called a linear variety in G. If V is a linear variety, then V is also a linear variety. If Q- modules G, G' have linear topologies, a homo- morphism of G into G' is always assumed to be open and continuous with respect to these topologies. A linear variety V in G is said to be linearly compact if, for any system { II;,} of linear varieties closed in V with the tfinite intersection property, we have n. II;, i= 0. In this case V is closed in G. If linearly compact Q-submodules can be chosen as a base for the neighborhood system of the zero element of G, we say that G is locally linearly compact. The set CdG) of homomorphisms of an Q- module G with linear topology into Q is also an Q-module. For any linearly compact Q- sub module H ofG, let U(H)={xlx(q)= 0, gEH}. Then, with {U(H)} as a base for the neighborhood system, a linear topology can be introduced in Cn(G). According as Gis discrete, linearly compact, or locally linearly compact, Cn(G) is linearly compact, discrete, or locally linearly compact. Let G, H be Q- modules each of which has a linear topology, and let <p:G3g-><pqECdH), IjJ:H3h->ljJhE Cn(G) be homomorphisms such that <pg(h) = IjJh(g). Then if one of <p, IjJ is an isomorphism, so is the other. This is an analog of the Pon- tryagin duality theorem and is called the duality theorem for Q-modules. In particular, a linearly compact Q-module is the direct sum of I-dimensional spaces (S Lefschetz [3J). References [IJ L. S. Pontryagin, Topological groups, first edition, Princeton Univ. Press, 1939; second edition, Gordon & Breach, 1966 (translation of the 1954 Russian edition). [2J A. Wei!, L'integration dans les groupes topologiques et ses applications, Actualites Sci. Ind., Hermann, 1940. [3J S. Lefschetz, Algebraic topology, Amer. Math. Soc. Colloq. Pub!., 1942. [4J W. Rudin, Fourier analysis on groups, Interscience, 1962. [5J E. Hewitt and K. A. Ross, Abstract har- monic analysis, Springer, I, 1963; II, 1970. 
423 A Topological Groups 423 (IV.6) Topological Groups A. Definitions If a tgro up G has the structure of a ttopolog- ical space such that the mapping (x,y)-->xy (product) of the Cartesian product G x G into G and the mapping x-->x- I (inverse) of G into G are both continuous, then G is called a topo- logical group. The group G without a topo- logical structure is called the underlying group of the topological group G, and the topolog- ical space G is called the underlying topological space of the topological group G. Let G, G' be topological groups. A mapping f of G into G' is called an isomorphism of the topological group G onto the topological group G' if r is an tisomorphism of the underlying group G onto the underlying group G' and also a thomeomorphism of the underlying topolog- ical space G onto the underlying topological space G'. Two topological groups are said to be isomorphic if there exists an isomorphism of one onto the other. B. Neighborhood Systems Let 91 be the tneighborhood system of the identitye of a topological group G. Namely, 91 consists of all subsets of G each of which con- tains an open set containing the element e. Then 91 satisfies the following six conditions: (i) If U E91 and U c V, then VE91. (ii) If U, VE91, then un VE91. (iii) If U E91, then eE U. (iv) For any U E 91, there exists aWE 91 such that WW= {xyl X, YE W} C U. (v) If U E91, then U- I E91. (vi) If U E91 and aEG, then aUa -I E91. Conversely, if a nonempty family 91 of subsets of a group G satisfies conditions (i)- (vi), then there exists a ttopology D of G such that 91 is the neighborhood system of e and G is a topological group with this topology. Moreover, such a topology is uniquely deter- mined by 91. tLeft translation x-->ax and tright translation x-->xa in a topological group G are homeomorphisms of G onto G; thus if 91 is the neighborhood system of the identity e, then a91 = 91a is the neighborhood system of a, where a 91 = {aU I U E91}. If the underlying topological space of a top- ological group G is a tHausdorff space, Gis called a T 2 -topological group (Hausdorff topo- logical group or separated topological group). If the underlying topological space of a topo- logical group G is a tTo-topological space, then, as is easily seen, it is a tT I-topological space. If it is a TI-topological space, then by the fact that the topology may be defined by a 1596 tuniformity, it is a tcompletely regular space, hence, in particular, a Hausdorff space ( Sec- tion G). Thus a topological group whose un- derlying topological space is a To-topological space is a T 2 -topological group. C. Direct Product of Topological Groups Consider a family {G.} .EA of topological groups. The Cartesian product G=- ILEA G. of the underlying groups of G. is a topological group with the tproduct topology of the un- derlying topological spaces of G.. This topo- logical group G = IT. EA G. is called the direct product of topological groups G. (() E A). D. Subgroups Let H be a subgroup of the underlying group of a topological group G. Then H is a topolog- ical group with the topology of a ttopological subspace of G (trelative topology). This topo- logical group H is called a subgroup of G. A subgroup that is a closed (open) set is called a closed (open) subgroup. Any open subgroup is also a closed subgroup. For any su bgroup H of a topological group G, the closure H of H is also a subgroup. If H is a normal subgroup, so is H. If H is commutative, so is fl. In a T 2 - topological group G, the tcentralizer C(M) = {xEGlxm=mx (mEM)} of a subset M ofG is a closed subgroup of G. In particular, the tcenter C = C(G) of a T 2 -topological group is a closed normal subgroup. E. Quotient Spaces Given a subgroup H of a topological group G, let G/H = {aH I aE G} be the set of tleft co sets, and let p be the canonical surjection p(a) = aH of G onto G/ H. Consider the tquotient topo- logy on G/H, namely, the strongest topology such that p is a continuous mapping. Since a subset A of G/H is open when p-I (A) is an open set of G, p is also an topen mapping. The set G/ H with this topology is called the left quotient space (or left coset space) of G by H. The right quotient space (or right coset space) H\G = {Hal aEG} is defined similarly. The quotient space G/ H is discrete if and only if H is an open subgroup of G. The quotient space is a Ha usdorff space if and only if H is a closed subgroup. If G/H and H art' both tcon- nected, then G itself is connected. If G/ Hand H are both tcompact, then G is compact. If H is a closed subgroup of G and G/H, H are both tlocally compact, then G is locally compact. Suppose that H is a normal subgroup of a topological group G. Then the quotient group 
1597 GIH is a topological group with the topology of the quotient space GIH. This topological group is called the quotient group of the topo- logical group G by the normal subgroup H. F. Connectivity The tconnected component Go containing the identity e of a topological group G is a closed normal subgroup of G. The connected compo- nent that contains an element (IE G is the coset aGo = Goa. Go is called the identity component of G. The quotient group GIGo is ttotally dis- connected. A connected topological group G is generated by any neighborhood U of the identity. Namely, any element of G can be expressed as the product of a finite number of elements in U. Totally disconnected (in partic- ular, discrete) normal subgroups of a con- nected topological group G are contained in the center of G. G. Uniformity Let 91 0 be the neigh borhood system of the identity of a topological group G, and let U/ ={(X,Y)EG x GlyEXU} for UE91 o ' Then a tuniformity having {U11 U E 91 D } as a base is defined on G. This uniformity is called the left uniformity of G. Left translation x-->ax of Gis tuniformly continuous with respect to the left uniformity. The right uniformity is defined similarly by U r = {(x, y) lyE U X}. These two uniformities do not necessarily coincide. The mapping x-->x- 1 is a tuniform isomorphism of G considered as a uniform space with respect to the left uniformity onto the same group G considered as a uniform space with respect to the right uniformity. A topological group G is thus a tuniform space under a uniformity tcompatible with its topology, and hence it is a completely regular space if the underlying topological space is a T I-space. H. Completeness If a topological group Gis tcomplete with respect to the left uniformity, then it is also complete with respect to the right uniformity, and conversely. In this case the topological group G is said to be complete. A locally com- pact T 2-topological group is complete. If a T 2 - topological group G is isomorphic to a dense subgroup of a complete T 2 -topological group G, then G is called the completion of G, and G is said to be completable. A T 2 -topological group G is not always completable. For a T 2 - topological group G to be completable it is necessary and sufficient that any tCauchy filter 423 J Topological Groups of G considered as a uniform space with res- pect to the left uniformity is mapped to a Cauchy filter of the same uniform space G under the mapping x --> X-I. Then the com- pletion G of G is uniquely determined up to isomorphism. A commutative T 2 -topological group always has a completion G, and G is also commutative. If each point of a T 2- topological group G has a ttotally bounded neighborhood, there exists a completion G, and G is locally compact. I. Metrization If a tmetric can be introduced in a Tz- topological group G so that the metric gives the topology of G, then G is said to be metri- zable. For a T 2 -topological group G to be met- rizable it is necessary and sufficient that G satisfy the tfirst axiom of count ability. Then the metric can be chosen so that it is left in- variant, i.e., invariant under left translation. Similarly, it can be chosen so that it is right invariant. In particular, the topology of a compact T 2 -topological group that satisfies the first axiom of countability can be given by a metric that is both left and right invariant. J. Isomorphism Theorems Let G and G' be topological groups. If a homo- morphism I of the underlying group of G into the underlying group of G' is a contin- uous mapping of the underlying topological space of G into that of G', I is called a con- tinuous homomorphism. Iff is a continuous open mapping, f is called a strict morphism (or open continuous homomorphism). A continuous homomorphism of a tparacompact locally compact topological group onto a locally compact Tz-topological group is an open continuous homomorphism. A topological group G' is said to be homo- morphic to a topological group G if there exists an open continuous homomorphism I of G onto G'. Let N denote the kernel I-I (e) of j. Then the quotient group GIN is isomorphic to G', with GIN and G' both considered as topo- logical groups (homomorphism theorem). Let I be an open continuous homomorphism of a topological group G onto a topological group G', and let H' be a subgroup of G'. Then H = f-1(W) is a subgroup ofG, and the mapping cp defined by cp(gH)= I(g)H' is a homeomor- phism of the quotient space GIH onto G'IH'. In particular, if H' is a normal subgroup, then H is also a normal subgroup and cp is an iso- morphism of the quotient group GI H onto G' I H' as topological groups (first isomorphism theorem). Let Hand N be subgroups of a topo- 
423 K Topological Groups logical group G such that H N = N H. Then the canonical mapping f: h(H n N)-+hN of the quotient space HjHnN to HNjN is a con- tinuous bijection but not necessarily an open mapping. In particular, if N is a normal sub- group of the group H N, then [IS a continuous homomorphism. In addition, if [ is an open mapping, the quotient groups HjH n Nand HNjN are isomorphic as topological groups (second isomorphism theorem). For example, [ is an open mapping (I) if N is compact Or (2) if G is locally compact, H Nand N are closed subgroups of G, and H is the union of a count- able number of compact subsets. Let H be a subgroup of a topological group G and N be a normal subgroup of G such that H => N. Then the canonical mapping of the quotient space (GjN)j(lljN) onto GjH is a homeomorphism. In particular, if H is also a normal subgroup, the quotient groups (GjN)j(HjN) and GjH are isomorphic as topological groups (third iso- morphism theorem). K. The Projective Limit Let {G').EA be a family of topological groups indexed by a tdirected set A, and suppose that if ex  {>, there exists a continuous homomor- phism [,p:Gp-+G, such that fo y = f/IOfp;, if ex  fi  y. Then the collection {G" [op} of the family {GJ 'EA of topological groups together with the family {{,p} of mappings is called a projective system of topological groups. Con- sider the direct product IT'EA Go of topological groups {GJ, and denote by G the set of all elements X={XJ'EA ofITG, that satisfy X,= ['II(Xp) for ex  fi. Then G is a subgroup of IT Go. The topological group G obtained in this way is ca11ed the projective limit of the projective system (G" [,p) of topological groups and is denoted by G = lim G,. If each G, is a Trtopological (resp. complete) group, then G is also a T 2 -topological (complete) group. Now consider another projective system {GJ;II} of topological groups indexed by the same A, and consider continuous homomor- phisms U,: G,-+G such that u,o {,p = j;po up for ex  f3. Then there exists a unique continu- ous homomorphism u of G =!im G, into G' = lim G such that for any exE A, oj {; ou hOlds, where Df;) is the restriction to G(G') of the projection of IT G, (IT G) onto G,(G). The homomorphism u is called the projective limit of the family {u,] of continuous homomor- phisms and is denoted by u = lim u O ' Let G be a T r topological group, and Ict{HJoEA be a decreasing sequence (H, => Hp for ex  [3) of closed normal subgroups of G. Consider the quotient group GjH" and let {'II be the canon- ical mapping qH{I-+qH, of G p to G, for ex  {r 1598 Then {G" f/l} is a projective system of topo- logical groups. Let f be the projec1 ion of G onto G,= GjH" and let [=I f,. Now assume that any neighborhood of the identity of G contains some H, and that some H, is com- plete. Then { = lim {, is an isomorphIsm of G onto lim Gj H, a:topological groups. (For a gener;] discussion of the topological groups already discussed  [1,4].) L. Locally Compact Groups For the rest of this article, all topological groups under consideration are assumed to be T r topological groups. The identity com- ponent Go of a locally compact group G is the intersection of all open subgroups of G. In particular, any neighborhood of the identity of a totally disconnected locally compact group contains an open subgroup. A totally dis- connected compact group is a projective limit of finite groups with discrete topology. A T 1 -topological space L is called a local Lie group if it satisfies the fonowing six conditions' (i) There exist a nonempty subset M of Lx L and a continuous mapping j1: M -+ L, called multiplication (j1(a, h) is written as (lh). (ii) If (a,h), (ah,c), (h,c), (a, he) are all in M, then (ah)e = a(he). (iii) L contains an element e, called the identity, such that L x {e} c M and ae = a for all aEL. (iv) There exists a nonempty open subset N of L and a continuous mapping v: N -+L such that av(a)=e for all aEN. (v) There exist a neighborhood U of e in L and a homeo- morphism f of U into a neighborhood V of the origin in the Euclidean space R n . (vi) Let D be the open subset of V x V defined by D = [(X,Y)EVX vl(f-I(x),r'(Y))EM, rl(x), f -I(Y)E U} Then the function F:D-+ V defined by F(x,y)= [j1U- 1 (x), [-I(y)) is oftclass CWo For any neighborhood U of the identity e of a connected locally compact group G, there exist a compact normal subgroup K and a subset L that is a local Lie group under the +induced topology and the group operations of G such that the product LK is a neighborhood of e contained in U Furthermore, under (I, k) -+lk, LK is homeomorphic to the product space L x K. Any compact subgroup of a connected locally compact group G is con- tained in a maximal compact subgroup, and maximal compact subgroups of G are +conju- gate. For a maximal compact subgroup K of G, there exists a finite number of subgroups HI"", Hr of G, each of which is isomorphic to the additive group of real number;, such that G  KH 1 .. H" and the mapping (k, hi,"" h,) -+kh 1 ... h, is a homeomorphism of the direct product K x HI x... X Hr onto G. Any 10ca11y compact group has a left-invariant positive 
1599 measure and a right-invariant positive mea- sure, which are uniquely determined up to constant multiples ( 225 Invanant Mea- sures). Using these measures, the theory of harmonic analysis on the additive group R of real numbers can be extended to that on G ( 69 Compact Groups; 192 Harmonic Analysis; 422 Topological Abelian Groups; 437 Unitary Representations). M. Locally Euclidean Groups Suppose that each point of a topological group G has a neighborhood homeomorphic to an open set of a given Euclidean space. Then G is called a locally Euclidean group. If the underlying topological space of a topolog- ical group has the structure of a treal analytic manifold such that the group operation (x, y) ->xy-I is a real analytic mapping, then G is called a tLie group. A Lie group is a locally Euclidean group. N, Hilbert's Fifth Problem Hilbert's fifth problem asks if every locally Euclidean group is a Lie group ( 196 Hil- bert). This problem was solved affirmatively in 1952; it was proved that any tlocally con- nected finite-dimensional locally compact group is a Lie group (D. Montgomery and L. Zippin [3J). In connection with this, the rela- tion between Lie groups and general locally compact groups has been studied, and the following results have been obtained: A neces- sary and sufficient condition for a locally compact group to be a Lie group is that there exist a neighborhood of the identity e that does not contain any subgroup (or any normal subgroup) other than {e}. A locally compact group has an open subgroup that is the projec- tive limit of Lie groups. Hilbert's fifth problem is closely related to the following problem: Find the conditions for a ttopological trans- formation group operating teffectively on a manifold to be a Lie group ( 431 Transfor- mation Groups). O. Covering Groups Let (\j be the collection of all tarcwise con- nected and tlocally arcwise connected Tz- topological groups. Suppose that G* E (5 is a tcovering space of G E Cb and the tcovering mapping f: G* -> G is an open continuous homomorphism, with G* and G considered as topologi\.:al groups. Then G* (or, more pre- cisely, (G*,f)) is called a covering group of G. Then the kernel r'(e)=D of {is a discrete 423 P Topological Groups subgroup contained in the center of G*, and G*/D and G, considered as topological groups, are isomorphic to each other. Let n, (G) be the tfundamental group of G. The natural homo- morphism f*: 1!, (G*)->1!j (G) induced by f is an injective homomorphism, and if we identify 1!, (G*) with the subgroup f*(1T., (G*)) of 1T.! (G), we have D:;::1T.dG)/1!! (G*). Conversely, if D is any discrete subgroup contained in the center of G* E (\), then G* is a covering group of G = G* /D. For any covering space (G*,f) of G E (I), a multiplication law can be mtroduced in G* so that G* is a topological group be- longing to (I) and (G*, f) is a covering group of G. In particular, any GEff) has a t s imp1y con- nected covering group (G, cp). Then for any covering group (G*, f) of G, there eXIsts a homomorphism f*: G->G*, and (G, (*) is a covering group of G*. Furthermore, cp = {o {*. Hence, in particular, any simply connected covering group of G is isomorphic to G, with G and G considered as topological groups. This simply connected covering group (G, cp) is called the universal covering group. Let G and G' be topological groups, and let e and e' be their identities. A homeomorphism f of a neighborhood U of e onto a neighbor- hood U' of e' IS called a local isomorphism of G to G' if it satisfies the following two conditions: (i) If a, b, ab are all contained in U, then (tab) = f(a)f(b). (ii) Let r! =g, then if a', b', a'b' E U', g(a'b') = g(a')g(b') holds. If there exists a local isomorphism of G to G', we say that G and G' are locally isomorphic. If G* is a cover- ing group of G, then G* and G are locally isomorphic. For two topological groups G and G' to be locally isomorphic it is necessary and sufficient that the universal covering groups of G and G' be isomorphic. For two connected Lie groups to be locally isomorphic it is neces- sary and sufficient that their tLie algebras be isomorphic. Let f be a mapping of a neighborhood U of the identity of a topological group G into a group H such that If a, b, ab are all contained in U, then f(ab) = ((a) f(b). Then {is called a local homomorphism of G into Hand U is called its domain. A local homomorphism of a simply connected group GECb into a group H can be extended to a homomorphism of G into H if the domain is connected [2,4]. P. Topological Rings and Fields If a ring R has the structure of a topological group such that (x,y)->x+ y (sum) and (x,y) ->xy (product) are both continuous mappings of R x R into R, then R is called a topological ring. If a topological ring K is a field (not necessarily commutative) such that x->x-! 
423 Ref. Topological Groups (inverse element) is a continuous mapping of K* = K - {O} into K*, then K is called a topo- logical field. Let us assume that K is a topo- logical field that is a locally compact Haus- dorff space and is not discrete. If K is con- nected, then K is a tdivision algebra of finite rank over the field R of real numbers; hence it is isomorphic to the field R of real numbers, the field C of complex numbers, or the tqua- ternion field H. If K is not connected, then K is totally disconnected and is isomorphic to a division algebra of finite rank over the tp-adic number field Qp or a division algebra of finite rank over the tformal power series field with coefficients in a finite field [4]. For various important classes of topological groups  69 Compact Groups; 249 Lie Groups; 422 Topological Abelian Groups; 424 Topological Linear Spaces. References [I] N. Bourbaki, Elements de mathematique, Topologie generale, ch. 3, Actualites Sci. Ind., I 143c, Hermann, third edition, 1960; English translation, General topology, pt. 1, Addison- Wesley, 1966. [2] C. Chevalley, Theory of Lie groups I, Princeton Univ. Press, 1946. [3] D. Montgomery and L. Zippin, Topolog- ical transformation groups, Interscience, 1955. [4] L. S. Pontryagin, Topological groups, first edition, Princeton Univ. Press, 1939, second edition, Gordon & Breach, 1966. (Second English edition translated from the second Russian edition, 1954.) 424 (XII.5) Topological Linear Spaces A. Definition A tlinear space E over the real or complex number field K is said to be a topological linear space, topological vector space, or linear topological space if E is a ttopological space and the basic operations x+ y and:xx (x,YEE, :XE K) in the linear space are continuous as mappings of Ex E and K x E, respectively, into E. The coefficient field K may be a gen- eral ttopological field, although it is usually assumed to be the real number field R or the complex number field C, and accordingly E is called a real topological linear space or a com- plex topological linear space. Topological linear spaces are generalizations of tnormed linear spaces and play an important role in the study 1600 of tfunction spaces, such as the tspace of dis- tributions, that are not tBanach spaces. Each topological linear space E is equipped with a tuniform topology in which 1ranslations of the neighborhoods of zero form a tuniform family of neighborhoods, and the addition x + y and the multiplication :xx by a scalar :x are uniformly continuous relative to this uniform topology. In particular, if for each x i= 0 there is a neighborhood of the origin that does not contain x, then E satisfies the tseparation axiom T 1 and hence is a tcompletely regular space. The tcompletion E of E is also a topo- logical linear space. We assume in this article that K is the real or complex number field and E is a topological linear space over K satisfying the ajom of T 1- spaces. Then E is finite-dimensional if and only if E has a ttotally bounded neighborhood of zero. The topology of E is tmetrizable if and only if it satisfies the tfirst countability axiom. B. Linear Functional A K -valued function f(x) on E is said to be a linear functional if it satisfies (i) f(x + y) = f(x) + f(y) and (ii) ((:xx) = :xf(x). A linear functional that is continuous relative to the topologies of E and K is said to be a continuous linear functional. (Sometimes continuous linear functionals are simply called linear functionals, while abstract linear functionals are called algebraic linear functionals.) The following three statements are equivalent for linear functionals f(x):(i) {(x) is continuous; (ii) the half-space {x EEl Re f(x) > O} is open; (iii) the hyperplane {xEElf(x)=O} is closed. C. The Hahn-Banach Theorem A linear functional f(x) defined on a linear subspace F of E can be extended to a continu- ous linear functional on E if and only if there exists an open tconvex neighborhood V of the origin in E that is disjoint with {XE F I {(x) = I}. Furthermore, if f(x) can be extended, at least one extension f(x) never takes the value 1 on V (Hahn-Banach theorem). D. Dual Spaces The set E' of all continuous linear functionals on E is called the dual space of E. It is often denoted by E* and is also called the conjugate space or adjoint space. It forms a linear space when f + g and exf (j, gEE', rJ.EK) arC' defined by (f +g)(x) = f(x)+g(x) and (:xf)(x)=:x(f(x)) for XE E. 
1601 E. Locally Convex Spaces A topological linear space is said to be locally convex if it has a family of convex sets as a tbase of the neighborhood system ofO. It follows from the Hahn-Banach theorem that for each x of 0 in a locally convex space E there is a continuous linear functional f such that f (x) of O. A subset M of E is said to be circled if M contains iXM = {iXX I XE M} whenever I iX I:::; 1. A set that is both circled and convex is called absolutely convex. In a locally convex space, a family of absolutely convex and closed sets can be chosen as a base of the neighborhood sys- tem of the origin. Let A and B be subsets of E. A is said to absorb B if there is an iX > 0 such that iXA => B. A set V that absorbs every point x E E is called absorbing. Neighborhoods of 0 are absorbing. F. Seminorms A real-valued function pix) on E is said to be a seminorm (or pseudonorm) if it satIsfies (i) O:::;p(x) < +00 (xEE); (ii) p(x+ y):::;p(x) + p(y); and (iii) p(iXx)=liXlp(x). The relation V= {x I p(x):::; I} gives a one-to-one correspon- dence between seminorms pix) and absolutely convex absorbing sets V whose intersection with any line through the origin is closed. In terms of seminorms, the Hahn- Banach theorem states: Let E be a linear space on which a seminorm pix) is given. If a linear functional fix) defined on a linear subspace F of E satisfies I f(x)l:::; pix) on F, then fix) can be extended to the whole space E in such a way that the inequality holds on E. The topology of a locally convex space is determined by the family of continuous semi- norms on it Conversely, if there is a family of semi norms {PA(X)} (J,EI\.) on a linear space E over K that satisfies (iv) p)(x)=O for all ic implies x = 0, then there exists on E the weakest locally convex topology that renders the seminorms continuous. This topology is called the locally convex topology determined by {p),(x)}. We assume that E is a locally convex space whose topology is determined by the family of seminorms (PA(X)} (i"E 1\.). Then a tnet Xv of E converges to x if and only if PA(X V - x)--.O for all ),E 1\.. If F is a locally convex space whose topology is determined by the fami1y of semi- norms {q"(y)}, then a necessary and sufficient condition for a linear mapping u: E--.F to be continuous is that for every q"(y) there exist a finite number of ic 1" . )'n E I\. and a constant C such that q(u(x)):::; C(p! ,(x) +... + p!.,(x)) (xEE). A set is said to be bounded if it is absorbed 424 H Topological Linear Spaces by every neighborhood of zero. When the topology of E is determined by the family {p A (x)} of seminorms a set B is bounded if and only if every PA is bounded on B. Totally bounded sets are bounded. The unit ball in a normed space is bounded. Conversely, a locally convex space is normable if it has a bounded neighborhood of O. A locally convex space is called quasicomplete if every bounded closed set is complete. Since Cauchy sequences (x n } are totally bounded, all Cauchy sequences converge in a quasicomplete space (i.e., the space is sequentially complete). G. Pairing of Linear Spaces Let E and F be linear spaces over the same field K. A K-valued function B(x, y) (xEE, YE F) on Ex F is called a bilinear functional or bilinear form iffor each fixed YEF (resp. xEE), it is a linear functional of x (resp. y). When a bilinear functional <x, y) on E x F is given so that <x, y) = 0 ror all YE F (all xEE) implies x = 0 (y = 0), then E and F are said to form a (separated) pairing relative to the inner product <x, y). A locally convex space E and its dual space E' form a pairing relative to the natural inner product <x, x') = x'(x) (xEE, x' EE'). H. Weak Topologies When E and F form a pairing relative to an inner product <x, y), the locally convex top- ology on E determined by the family of semi- norms {I < x, y) II y E F} is called the weak top- ology (relative to the pairing <E, F») and is denoted by (J(E, F). A net Xv in E is said to converge weakly if it converges in the weak topology. When E and E' are a locally convex space and its dual space, (J(E, E') is called the weak topology of E, and (J(E', E) the weak* topology of E'. The weak topology on a lo- cally convex space E is weaker than the orig- mal topology on E. Consequently, a weakly closed set is closed. If the set is con vex, the converse holds, and hence a convex closed set is weakly closed. Also, boundedness is pre- served if we replace the original topology by the weak topology. Thus a weakly bounded set is bounded. Let E and F form a pairing relative to <x,y), and let A be a subset of E. Then the set A' of points YEF satisfying Re<x,y) -1 for all XEA is called the polar of A (relative to the pairing). If A is absolutely convex, AO is also absolutely convex and is the set of points y such that l<x,y)l:::;l for all XEA. If A is a convex set containing zero, its (weak) closure is equal to the bipolar AO O =(AOt (bipolar theorem). In general, let A be a subset of a 
424 I Topological Linear Spaces topological linear space E. We call the smallest closed convex set containing A the closed convex hull of A. If E is locally convex, the bipolar A 0' relative to E' coincides with the closed convex hull of A U {O}. A subset B of the dual space E' is tequi- continuous on E if and only if it is contained in the polar V' of a neighborhood V of 0 in E. Also, V e is weak*- compact in E' (Banach- Alaoglu theorem). I. Barreled Spaces and Bomological Spaces An absorbing absolutely convex closed set in a locally convex space E is called a barrel. In a sequentially complete space (hence in a quasi- complete space also), a barrel absorbs every bounded set. A locally convex space is said to be barreled if each barrel is a neighborhood of O. A locally convex space is said to be quasi- barreled (or evaluable) if each barrel that ab- sorbs every bounded set is a neighborhood of O. Furthermore, a locally convex space is said to be bomological if each absolutely convex set that absorbs every bounded set is a neighbor- hood of O. Bomological spaces are quasi- barreled. However, they are not necessarily barreled. Furthermore, barreled spaces are not necessarily bomological. A metrizable locally convex space, i.e., a space whose topology is determined by a countable number of semi- norms, is bomological. A complete metrizable locally convex space is called a locally convex Frechet space «F)-space or sim pi y Frechet space). To distinguish it from Frechet space as in 37 Banach Spaces, it is sometimes called a Frechet space in the sense of Bourbaki. (F)- spaces are bomological and barreled. A continuous linear mapping u: E-->F of one locally convex space into another maps each bounded set of E to a bounded set in F. Con- versely, if E is bomological, then each linear mapping that maps every bounded sequence to a bounded set is continuous. J. The Banach-Steinhaus Theorem In the dual space of a barreled space E, each (weak*-)bounded set is equicontinuous Thus if a sequence of continuous linear mappings Un of E into a locally convex space F con- verges at each pomt of E, then Un converges uniformly on each totally bounded set of E, and the limit linear mapping is continuous (Banach-Steinhaus theorem). K. The S- Topology Let E and F be paired linear spaces relative to the inner product <x,y). When a family S 1602 of (weakly) bounded sets of F generates a dense subs pace of F, the family of seminorms (SUPYEBI<x,Y)11 BES} determines a locally convex topology on E. This is called the S- topology or topology of uniform convergence on members of S, because Xv-->X in the S-topology is equivalent to the uniform convergence of <x"y)--><x,y) on each BES. The space E with the S-topology is denoted by Es. The weak topology is the same as the topology of pointwise convergence. The S-topology in which S is the family of all bounded sets in F is called the strong topology and is denoted by {1(E, F). The dual space E' of a locaily convex space E is usually regarded as a locally convex space with the strong topology fJ(E', E). It is called the strong dual space. The topology of a locally convex space E is that of uniform con- vergence on equicontinuous sets of E'. The topology of a barreled space E coincides with the strong topology fJ(E, E'). L. Grothendieck's Criterion of Completeness Let E and F be paired spaces, and let S be a family of absolutely convex bounded sets of F such that: (i) the sets of S generate F; (ii) if B" B 2 ES, then there is a B 3 ES such that B3B, and B3  B 2 . Then Es is complete if and only if each algebraic linear functional j (y) on F that is weakly continuous on every BES is ex- pressed as [(y) = <x, y) for some XE:E. When Es is not complete, the space of all linear func- tionals satisfying this condition gives the com- pletion Es of Es. M. Mackey's Theorem Let E, F, and S satisfy the same conditions as in Section L. Then the dual space of Es is equal to the union of the weak completions of i.B, where ).>0 and BES (Mackey's theorem). N. The Mackey Topology When E and F form a pairing, the topology on E of uniform convergence on convex weakly compact sets of F is called the Mackey topol- ogy and is denoted by T(E, F). The dual space of E endowed with a locally convex topology T coincides with F if and only if T is stronger than the weak topology (J(E, F) and weaker than the Mackey topology T(E, F) (Mackey- Arens theorem). A locally convex space is said to be a Mackey space if the topology is equal to the Mackey topology T(E, E'). Every quasi- barreled space is a Mackey space. 
1603 O. Reflexivity Let E be a locally convex space. The dual space E" of the dual space E' equipped with the strong topology contains the original space E. We call E semireflexive if En = E, and reflex- ive if in addition the topology of E coincides with the strong topology PIE, E'). E is semi- reflexive if and only if every bounded weakly closed set of E is weakly compact. E is re- flexive If and only if E is semIreflexive and ( quasi)barreled. A barreled space in which every bounded closed set is compact is called a Montel space or (M)-space. (M)-spaces are reflexive, and their strong dual spaces are also (M)-spaces. Many of the function spaces that appear in applications are (F)-spaces or their dual spaces. For these spaces detailed consequences of the countability axiom are known [7,8]. A convex set C in the dual space E' of an (F)- space E IS weak*-closed if and only if for every neighborhood V of 0 in E, en V" is weak*- closed (Krein-Shmul'yan theorem). The strong dual space E' of an (F)-space E is (quasi)barreled if and only if it is bornological. In particular, the dual space of a reflexive (F)- space is bornological. P. (DF)-Spaces A locally convex space is called a (DF)-space if it satisfies: (i) There is a countable base of bounded sets (i.e., every bounded set is in- cluded in one of them); (ii) if the intersection V of a countable number of absolutely con- vex closed neighborhoods of 0 absorbs every bounded set, then V is also a neighborhood of O. The dual space of an (F)-space is a (DF)- space, and the dual space of a (DF)-space is an (F)-space. A linear mapping of a (DF)-space E into a locally convex space F is continuous if and only if its restriction to every bounded set of E is continuous. A quasicomplete (DF)- space is complete. Q. Bilinear Mappings A bilinear mapping b(x, y) on locally convex spaces E and F (x E E, Y E F) to a locally convex space G is said to be separately continuous if for each fixed YEF (xEE) it is continuous as a function of x (y). The linear mappings obtained from b(x, y) by fixing x (y) are denoted by bAY) (by(x)). We call b(x, y) hypocontinuous if for each bounded set B of E and B' of F, {bJy)lxEB} and {by(x)IYEB'} are eqUlcon- tinuous. A continuous bilinear mapping is hypocontinuous. However, the converse is 424 S Topological Linear Spaces not always true. A separately continuous bi- linear mapping is not necessarily hypocontinu- ous. If both E and F are barreled, however, then every separately continuous mapping is hypocontinuous. If E is an (F)-space and F is metrizable, then every separately continu- ous bilinear mapping is continuous. Simi- larly, if both E and Fare (DF)-spaces, then every hypocontinuous bilinear mapping is continuous. R. Tensor Products It is possible to introduce many topologies in the ttensor product E @ F of locally convex spaces E and F. The projective topology (or topology n) is defined to be the strongest topol- ogy such that the natural bIlinear mapping Ex F --.E @ F is continuous. The dual space of E @nF is identified with the space B(E, F) of all continuous bilinear functionals on Ex F. The completion of E @nF is denoted by E @ F. The topology of biequicontinuous con- vergence (or topology £) is defined to be the topology of uniform convergence on sets Vo x V\ where Vand V are neighborhoods of 0 in E and F, respectively, considering the ele- ments of E @ F as linear functionals on E' @ F' by the natural pairing of E @ F and E' @ P. The completion of E @,F is denoted by E @ F. The dual space of E@,F coincides with the subspace liE, F) of B(E, F) composed of the union of the absolute convex hulls of the products Va @ Va of equicontinuous sets. The elements of l(E,F) are called integral bilinear functionals. Closely related to E @ F is L. Schwartz's £ tensor product E £ F [12]. (They coincide if E and F are complete and if E or F has the tap- proximation property.) E £ F can be regarded as (i) a space of vector-valued functions if E is a space of functions and F is an abstract lo- cally convex space, especially a space of func- tions of two variables if E and F are, respec- tively, spaces of functions of one variable, and (ii) a space of operators G--.F if E is the dual space G' of a locally convex space G. S. Nuclear Spaces Let E be a locally convex space, V be an ab- solutely convex closed neighborhood of the origin, and pix) be the seminorm correspond- ing to V. Then we denote by Ev the normed space with norm pix) obtained from E by identifying the two elements x and y with p(x- y)=O.1f U c V, then a natural lInear mapping <Pu.v:Eu--.Ev is defined. A locally convex space E is said to be a 
424 T Topological Linear Spaces nuclear space (resp. Schwartz space or simply (S)-space) if for each absolutely convex closed neighborhood V of 0 there is another U such that <fJu.v is a tnuclear operator (resp. tcom- pact operator) as an operator of Eu into the completion of Ev. A nuclear space or (S)-space is an (M)-space if it is quasicomplete and quasibarreled. A locally convex space E is a nuclear space if and only if the topologies n and G cOIncide on the tensor product E @ F with any locally convex space F. Accordingly, it follows that B(E, F)=J(E, F). This can be regarded as a generalization of Schwartz's kernel theorem, which says that every sepa- rately continuous bilinear functional on !l!x x '3J y is represented by an integral with kernel in !l!y' The theory of topological tensor prod- ucts and nuclear spaces is due to Grothen- dieck [9]. A locally convex space E is a nuclear (F)- space if and only if E is isomorphic to a closed subspace of C"'( -00,00) (T. Komura and Y. K6mura, 1966). An example of a nuclear (F)- space without basis is known (8. S. Mityagin and N. M. Zobin, 1974). T. Gel'fand Triplet Let Hand L be Hilbert spaces. If L is a dense subspace of H and the injection L--> H is a tHilbert-Schmidt operator, then H = H' is regarded as a dense subspace of L' and the injection H' -->L' is a Hilbert-Schmidt operator. In this case, (L, H, L') is called a Gel'fand trip- let (or a rigged Hilbert space). A subset of H is called a cylindrical set if it is expressed in the form P F - 1 (B) by the ortho- gonal projection P F onto a finite-dimensional subspace F and a Borel subset B of F. If a finitely additive positive measure f.1. with II f.1. II 1 = 1 defined on the cylindrical sets of H satisfies (i) f.1. is countably additive on cylindrical sets for a fixed F and (ii) for any G > 0 there exists aD> 0 such that IIx II < D implies f.1.{YE HI I < x, y> I ;;, I} < G, then f.1. is the restriction of a countably additive measure ji defined on the Borel subsets of L' (Minlos's theorem, 1959). Let T be a self-adjoint operator in H. Then T has a natural extension T in L' and almost every continuous spectrum Ie of T has an asso- ciated eigenvector X A in L': TX A =),x A , XAEL'. U. The Extreme Point Theorem Let A be a subset of a linear space E. A point x E A is said to be an extreme point if x is an extreme point of any real segment containing x and contained in A. If A is a compact convex subset of a locally convex space E, A is the convex closed hull of (i.e., smallest convex 1604 closed set containing) the set of its extreme points (Krein-Milman theorem). In applica- tions it is important to know whether every point of A is represented uniquely as an inte- gral of extreme points. For a metrizable con- vex compact subset A of a locally convex space E, the following two conditions are equivalent (Choquet's theorem): (i) A is a sim- plex, i.e., if we put A = {(Ax, .!c) I xEA, Ie > O} c E x R 1, the vector space A - A be:comes a Hattice with positive cone A; (ii) for any XEA there exists a unique positive measure f.1. on A with 11f.1.111 = 1 such that I(x) = SA II y)df.1.(Y) (l E E') and the support of f.1. is contained in the set of extreme points of A. V. Weakly Compact Set A subset of a quasicomplete locally convex space is relatively weakly compact If and only if every sequence in the set has a wt'ak ac- cumulation point (Eberlein's theorem). If E is a metrizable locally convex space, every weakly compact set of E is weakly sequentially com- pact (Shmul'yan's theorem). If E is a quasi- complete locally convex space, the convex closed hull of any weakly compact subset is weakly compact (Krein's theorem). If E is not quasicomplete, this is not necessarily true. W. Permanence Each subspace, quotient space, direct product, direct sum, projective limit, and inductive limit (of a family) of locally convex spaces has a unique natural locally convex topology. These spaces, except for quotient spaces and induc- tive limits, are separated, and a quotient space E/ A is separated if and only if the subspace A is closed. The limit of a sequence EJ c E 2 C '" is said to be a strictly inductive limill if En has the induced topology as a subspace of E n +1' If E is a strictly inductive limit of a sequence En such that En is closed in E n +1 or if E; is the inductive limit of a sequence E 1 C E 2 C ... such that the mapping En-->En+l maps a neigh- borhood of 0 to a relatively weakly compact set, then E is separated and each bounded set of E is the image of a bounded set in some En. If E = U En is the strictly inductive limit of the sequence {En}, then the topology of En coin- cides with the relative topology of En C E. The strictly inductive limit of a sequence of (F)- spaces is called an (LF)-space. Any complete locally convex space (resp. any locally convex space) is (resp. a dense linear subspace of) the projective limit of Banach spaces. Every (F)-space E is the projec- ti ve limit of a sequence of Banach spaces E 1 <- E 2 <-.... In particular, E is said to be a count- 
1605 ably normed space if the mappings E -> En are one-to-one and Ilxll n :( IIxll n +, for all xEE with E considered as a subspace En. We call E a countably Hilbertian space if, in particular, the En are tHilbert spaces. An (F)-space with at least one continuous norm is a nuclear space if and only if it is a countably Hilbertian space such that the mappings En+' -> En are Hilbert- Schmidt operators or nuclear operators. A locally convex space is bornological if and only if it is the inductive limit of normed spaces. A locally convex space is said to be uItrabornological if it is the inductive limit of Banach spaces, or in particular, if it is quasi- complete and bornological. Properties of spaces, such as being complete, quasicomplete, semireflexive, or having every bounded closed set compact, are inherited by closed subspaces, direct products, projec- tive limits, direct sums, and strictly inductive limits formed from the original spaces, and properties of spaces, such as being Mackey, quasibarreled, barreled, and bornological, are inherited by quotient spaces, direct sums, in- ductive limits, and direct products formed from the spaces. (For direct products of high power of bornological spaces, unsolved prob- lems still exist concerning the inheritance of properties.) Quotient spaces of (F)-spaces are (F)-spaces, but quotient spaces of general complete spaces are not necessarily complete. There are examples of a Montel (F)-space whose quotient space is not reflexive and a Montel (DF)-space whose closed subspace is neither a Mackey space nor a (DF)-space. The property of being a Schwartz space or a nu- clear space is inherited by the completions, subs paces, quotient spaces of closed subs paces, direct products, projective limits, direct sums of countable families, and inductive limits of countable families formed from such spaces. Tensor products of nuclear spaces are nuclear spaces. Y. Komura gave an example of a non- complete space that is quasicomplete, borno- logical, and nuclear (and hence a Montel space). X. The Open Mapping Theorem and the Closed Graph Theorem Let E and F be topological linear spaces. The statement that every continuous linear map- ping of E onto F is open is called the open mapping theorem (or homomorphism theorem), and the statement that every linear mapping of F into E is continuous if its graph is closed in F x E is called the closed graph theorem. These theorems hold if both E and F are complete and metrizable (S. Banach). A locally convex space is said to be B- 424 Ref. Topological Linear Spaces complete (or fully complete) if a subspace C of E' is weak*-closed whenever en V C is weak*- closed for every neighborhood V of 0 in E. (F)- spaces and the dual spaces of reflexive (F)- spaces are B-complete. B-complete spaces are complete, and closed subs paces and quotient spaces by closed subspaces of B-complete spaces are B-complete. If E is B-complete and F is barreled, then the open mapping theorem and the closed graph theorem hold (V. Ptak). Both theorems hold also if F is ultraborno- logical and E is a locally convex space ob- tained from a family of (F)-spaces after a finite number of operations of taking closed sub- spaces, quotient spaces by closed spaces, direct products of countable families, projective limits of countable families, direct sums of countable families, and inductive limits of countable families. This was conjectured by Grothendieck and proved by W. Slowikowski (1961) and D. A. Raikov. Later, L. Schwartz, A. Martineau, M. De Wilde, W. Robertson, and M. Nakamura simplified the proof and enlarged the class of spaces E [15]. (LF)-spaces, the dual spaces of Schwartz (F)- spaces, and the space flJ' of distributions are examples of spaces E described in the previous paragraph. Y. Nonlocally Convex Spaces The space Lp for 0 < p < I shows that non- locally convex spaces are meaningful in func- tional analysis. Recently, the Banach-Steinhaus theorem, closed graph theorems, etc. have been investigated for nonlocally convex topo- logical linear spaces [13]. Z. Diagram of Topological Linear Spaces The spaces in Fig. I are all locally convex spaces over the real number field or the com- plex number field and satisfy the separation axiom T,. The notation A -> B means that spaces with property A have property B. Main properties of dual spaces are listed in Table I. References [IJ N. Bourbaki, Elements de mathematique, Espaces vectoriels topologiques, Actualites Sci. Ind., Hermann, 1189a, 1966; 1229b, 1967; 1230a, 1955 [2J A. Grothendieck, Espaces vectoriels topo- logiques, Lecture notes, Sao Paulo, 1954. [3J G. Kothe, Topologische lineare R<iume I, Springer, second edition, 1966; English transla- tion, Topological vector spaces I, II, Springer, 1969,1979. 
425 A Topologkal Spaces I Nuclear I Fig. 1 Topological linear spaces. Table 1 E E' Barreled Reflexive Quasicomplete Complete Bornological (DF) (F) (M) Nuclear, reflexive U1trabornological Semireflexive Reflexive Quasibarreled Bornological Reflexive, (F) (F) (DF) (M) Nuclear, (LF) or (DF) Complete, (S) [4] I. M. Gel'fand et a!., Generalized functions. II, Functions and generalized function spaces (with G. E. Shilov), Academic Press, 1968; IV, Applications of harmonic analysis (with N. Ya. Vilenkin), Academic, 1964. (Original in Russian, 1958-1966.) [5] A. Pietsch, Nukleare 10kalkonvexe Riiume, Berlin, 1965. [6] A. P. Robertson and W. Robertson, Topo- logical vector spaces, Cambridge Univ. Press, 1964. [7] J. Dieudonne and L. Schwartz, La dualite dans les espaces (Ij) et (Em, Ann. Inst. Fourier, 1 (1949), 61-101. [8] A. Grothendieck, Sur les espaces (F) et (DF), Summa Brasil. Math., 3 (1954), 57-123. [9] A. Grothendieck, Produits tensorie1s topo- 10giques et espaces nucleaires, Mem. Amer. Math. Soc., 1955. 1606 [10] G. Choquet, Le theoreme de representa- tion integrale dans les ensembles convexes compacts, Ann. Inst. Fourier, 10 (1960),333- 344. [11] J. L. Ke11ey, I. Namioka, et a!., Linear topological spaces, Van Nostrand, 1963. [12] L. Schwartz, Theorie des distributions it valeurs vectorie11es I, Ann. Inst. Fourier, 7 (1957),1-141. [13] N. Adasch, B. Ernst, and D. Keim, Topo- logical vector spaces: The theory without convexity conditions, Lecture notes in math. 639, Springer, 1978. [14] F. Treves, Topological vector spaces, distributions and kernels, Academic Press, 1967. [15] M. De Wilde, Closed graph theorems and webbed spaces, Pitman, 1978. 425 (11.16) Topological Spaces A. Introdnction Convergence and continuity, as we11 as the algebraic operations on real numbers, are fundamental notions in analysis. In an abstract space too, it is possible to provide an ad- ditional structure so that convergence and continuity can be defined and a theory analo- gous to classical analysis can be developed. Such a structure is ca11ed a topological struc- ture (for a precise definition,  Section B). There are several ways of giving a topology to a space. One method is to axiomatize the notion of convergence (M. Frechet [1], 1906;  87 Convergence). However, defining a topology in terms of either a neighborhood system (due to F. Hausdorff [3], 1914), a clo- sure operation (due to C. Kuratowski, Fund. Math., 3 (1922)), or a family of open sets is more common. B. Definition of a Topology Let X be a set. A neighborhood system for X is a function U that assigns to each point x of X, a family U(x) of subsets of X subject to the fo11owing axioms (U): (1) XE U for each U in U(x). (2) If U 1 , U 2 EU(X), then U 1 n U 2 EU(X). (3) If U EU(X) and U c V, then VEU(X). (4) For each U in U(x), there is a member Wof U(x) sch that U EU(y) for each y in W. A system of open sets for a set X is a family D of subsets of X satisfying the fo11owing axioms (0): 
1607 (1) X, 0ED. (2) IfO,,02ED, then 0 1 n0 2 ED. (3) If0JeED (AEA), then UM,OJeED. A system of closed sets for a space X is a famiJy  of subsets of X satisfying the follow- ing axioms (F): (I) X, 0E. (2) If F 1 ,F2 E, then FI U F 2 E. (3) If Fl E ij (I, E A), then n lEA Fl E . A closure operator for a space X is a func- tion that assigns to each subset A of X, a subset A" of X satisfying the following axioms (C): (1)0"=0 (2) (A U B)"=A"U B". (3) A cA". (4) A"=A"". An interior operator for a space X is a func- tion that assigns to each subset A of X a subset A i of X satisfying the following axioms (I): (1)X i =X. (2) (AnB)i=AinB i . (3) Ai cA. (4) Aii=A i . Anyone of these five structures for a set X, i e., a structure satisfying anyone of (U), (0), (F), (C), or (I), determines the four other struc- tures in a natural way. For instance, assume that a system of open sets D satisfying (0) is given. In this case, each member of D is called an open set. A subset U of X is called a neigh- borhood of a point x in X provided that there is an open set ° such that XE ° cU. If U(x) is the family of all neighborhoods of x, the func- tion x-->U(x) satisfies (U). The complement of an open set in X is called a closed set. The family  of all closed sets satisfies (F). Given a subset A of X, the intersection A" of the family of all closed sets containing A is called the closure of A, and each point of A" is called an adherent point of A. The closure A" is the smallest closed set containing A, and the func- tion A-->A" satisfies (A). The closure A" is also denoted by A or Cl A. Dually, there is a largest open subset A i of A The set A i (also denoted by A J or Int A) is called the interior of A, and each point of A ° is called an interior point of A. The closure and interior are related by AO=X -(X -A) and A =X -(X -A)o. The correspondence A --> A ° satisfies (I). Con- versely, open sets can be characterized vari- ously as follows' A is open = A E U(x) for each x in A =X -AE =(X --Aj =X-A =Ao=A When a structure satisfying (U), (F), (C), or (I) is gi ven, one of the four characteriza tions of open sets can be used to define a system of 425 D Topological Spaces open sets satisfying (0) and hence the other structure. A topological structure or simply a topology for a space X is any of these five structures for X. If two topologies 'I and '2 for X give rise to identical systems of open sets, then 'I and '2 are considered to be identical. For this reason "topology" frequently means simply "system of open sets" in the literature. A topological space is a set X provided with a topology, and is denoted by (X, ,) or simply X when there is no ambiguity. C. Examples (1) Discrete Topology. Let X be a set, and let the system D of open sets be the famiJy of all subsets of X The resulting topology is called the discrete topology, and X with the discrete topology is a discrete topological space. In this space, A = A ° = A for each subset A, and A is a neighborhood of each of its points. (2) Trivial Topology. The trivial (or indiscrete) topology for a set X is defined by the system of open sets which consists of X and 0 only. If A X, then AO= 0, and if k;i 0, then A =X. Each point of X has on]y one neighborhood, X itself. (3) Metric Topology. Let (X, p) be a tmetric space, i.e., a set X provided with a tmetric p. For a positive number F., the F.-neighborhood of a point x is defined to be the set U,(x) = {YIYEX,p(X,y)<F.}. Let U(x) be the family of all sets V such that U,(x) c V for some £; then the assignment x--> U(x) satisfies (U) and hence defines a topology. This topology is the metric topology for the metric space (X, p). (4) Order Topology. Let X be a set tlinearly ordered by . For each point x in X, let U(x) be the family of an subsets U such that XE {y I a <y < b} c U for some a, b. The function x-->U(x) satisfies (U) and defines the order topology for the linearly ordered set X. (5) Convergence and Topology. We can define the notion of convergence in a topological space, and conversely we can define a topology using convergence as a primitive notion ( 87 Convergence). In particular, for a metric space. the metric topology can be defined in terms of convergent sequences ( 273 Metric Spaces) D. Generalized Topological Spaces When a space X is equipped with a closure operator that does not satisfy all of (C), the 
425 E Topological Spaces space is called a generalized topological space by some authors. Topological implications of each axiom in (C) have been investigated for such spaces. E. Local Bases Let X be a topological space, and let x be a point of X. A collection Uo(x) of neighbor- hoods of x is called a base for the neighborhood system (fundamental system of neighborhoods of a point x or local base at x) if each neigh- borhood of x contains a member of Uo(x). Let {Uo(x) I x E X} be a system of local bases; then the system has the following properties (V o ): (I) For each V in Uo(x), XE VeX. (2) If VI, V z E Uo(x), then there is a V 3 in Uo(x) such that V 3 e VI n V z . (3) For each V in Uo(x), there exists aWe V in Uo(x) such that for each y in W, V contains some member of Uo(y). Conversely, suppose that {Uo(X)IXEX} is a system satisfying (V o )' For each x in X, let U(x) consist of all subsets V of X such that V U for some U in Uo(x). Then the sys- tem {U(X)IXEX} satisfies (V) and therefore defines a topology for X. This topology is called the topology determined by the system {Uo(X)IXEX}. For instance, in a metric space X, the set of 8-neighborhoods of X(8 > 0) is a local base at x with respect to the metric topology. In an arbitrary topological space, the collection of all open sets containing x, i.e., the open neigh- borhoods of x, is a local base at x. Two systems satisfying (V o ) are called equivalent if they determine the same topology. For systems {Uo(X)IXEX} and {5D o (X)IXEX} to be equivalent it is necessary and sufficient that for each x in X each member of Uo(x) contain a member of 5Do(x) and each member of 5Do(x) contain a member of Uo(x). Sometimes the word "neighborhood" stands for a member of a local base or for an open neighborhood. However, this convention is not used here. F. Bases and Subbases A family Do of open sets of a topological space X is called a base for the topology (base for the space, or open base) if each open set is the union of a subfamily of Do. A base Do for the topology of a topological space X has the following properties (0 0 ): (1) UDo=X. (2) If WI' W 2 ED o and xE WI n W z , then there is a W 3 in Do such that XE W 3 e WI n W z . Conversely, if a family Do of subsets of a set 1608 X satisfies (00), then Do is a base for a unique topology. A member of Do is called a basic open set. A family .0 00 of open sets of a topological space X is a subbase for the topolo:y (or sub- base for the space) if the family of all finite intersections of members of .0 00 is a base for the topology. If :Doo a subbase for the topol- ogy of a topological space X, then U .0 00 = X. Conversely, if .0 00 is a family of subsets of a set X such that U .0 00 = X, then the family of all finite intersections of members of .0 00 is a base for a unique topology T. A subset of X is open for T if and only if it is the Ull10n of a family of finite intersections of members of .0 00 , The system of open sets relative to , is said to be generated by the family Doo. Thus any family of sets defines a topology for its umon. A set 6 of subsets of a topologic.l1 space is called a network if for each point x and its neighborhood U there is a member FE 6 such that XE Fe U (A. V. Arkhangel'skiJ', 1959). If all FE 3' are required to be open, the network 3' is exactly an open base. G. Continuous Mappings A mapping f on a topological space X into a topological space Y is called continuous at a point a of X if it satisfies one of the following equivalent conditions: (1) For each neighborhood V of f(a), there is a neighborhood U of a such that f(U) e V. (1') For each neighborhood V of f(a), the inverse image f- 1 (V) is a neighborhood of a. (2) For an arbitrary subset A of X such that aE A, .f(a) Ef(A) . Continuity can also be defined in terms of convergence ( 87 Convergence). If f is continuous at each point of X, f is said to be continuous. Continuity or f is equiv- alent to each of the following conditions: (1) For each open subset 0 of Y, the inverse image f- 1 (0) is open in X. (1') The inverse image under f of each member of a subbase for the topology of Y is open in X. (2) For each closed subset F of Y, the inverse image f-1(F) is closed. (3) For each subset A of X,f(A)ef(A). The image f(X) of X under a continuous mapping f is called a continuous image of X. Let X, Y, and Z be topological spaces, and let f: X --> Y and g: Y --> Z be mappings. If f is con- tinuous at a point a of X and 9 is continuous at f(a), then the composite mapping go.f: X -+2 is continuous at the point a. Hence if f and 9 are continuous, so is gO f When a continuous mapping f: X --> Y is 
1609 tbijective and j-1 is continuous, the mapping f is called a homeomorphism (named by H. Poincare, 1895) or topological mapping. Two topological spaces X and Yare homeomorphic, X  Y, if there is a homeomorphism f: X --> Y. The relation of being homeomorphic is an tequivalence relation. A property which, when held by a topological space, is also held by each space homeomorphic to it is a topological property or topological invariant. The problem of deciding whether or not given spaces are homeomorphic is called the homeomorphism problem. A mappingf:X --> Yis called open (resp. closed) if the image under f of each open (resp. closed) subset of X is open (closed) in Y. A continuous bijection that is either open or closed is a homeomorphism. A continuous surjection f: X --> Y is called a quotient mapping if U c Y is open whenever f -I (U) is open ( Section L). If moreover f If- 1 (5) is quotient for each 5 c Y as a map- ping from the subspace ( Section J) f-'(5) onto the subspace 5, then f is called a hered- itarily quotient mapping. Open or closed con- tinuous mappings are hereditarily quotient mappings. H. Comparison of Topologies When a set X is provided with two topologies " and '2 and the identity mapping: (X, '1) -->(X, '2) is continuous, the topology" is said to be stronger (larger or finer) than the topol- ogy '2, '2 is said to be weaker (smaller or coarser) than '1' and the notation, I? '2 or '2  '1 is used. Let Di' 6i, U i , and a i be the system of open sets, system of closed sets, neighborhood system, and closure opera- tion for X relative to the topology 'i (i = 1,2), respectively. Then each of the following is equivalent to the statement" ?'2: (I)DI=>D2' (2) 1=>62' (3) For each x in X, Udx) => U 2 (x). (4) A"' c A"' for each subset A of X. Let 5 be the family of all topologies for X. Then 5 is ordered by the relation ? The discrete topology is the strongest topology for X. If {').I },EA} is a subfamily of 5, then among the topologies stronger than each 'A' there is a weakest one '1 =sup{'AI}.EA}. Similarly, among the topologies weaker than each ,)., there is a strongest one '2 = inf{ 'A I J.E A}. In fact, let D A be the family of all open sets rela- tive to 'A; then the system of open sets for '1 is generated by U ).EA D A' and the system of open sets for '2 is precisely nAEA D).. The family 5 is therefore a tcomplete lattice. 425 K Topological Spaces I. Induced Topology Let f be a mapping from a set X into a topo- logical space Y. Then the family {f-' (0) 10 is open in Y} satisfies axioms (0) and defines a topology for X. This topology is called the topology induced by f (or simply induced topol- ogy), and it is characterized as the weakest one among the topologies for X relative to which the mapping f is continuous. J. Subspaces Let (X, ,) be a topological space and M be a subset of X. The topology for M induced by the inclusion mapping f: M --> X, i.e., the mapping f defined by f(x) = x for each x in M, is called the relativization of, to M or the relative topology. The set M provided with the relative topology is calIed a subspace of the topological space (X, I). Topological terms, when applied to a subspace, are frequently preceded by the adjective "relative" to avoid ambiguity. Thus a relative neighborhood of a point x in M is a set of the form M n U, where U is a neighborhood of x in X. A relatively open (relatively closed) set in M is a set of the form M n A, where A is open (closed) in X. For a subset A of M, the relative closure of A in M is M n A, where A is the closure of A in X. A mapping f: X --> Y is called an embedding if f is a homeomorphism from X to the subspace f(X), and in this case X is said to be embedded into Y. A property P is said to hold locally on a topological space X if each point x of X has a neighborhood U such that the property P holds on the subspace U. A subset A of X is locally closed if for each point x of X, there exists a neighborhood V of x such that vn A is relatively closed in V. A subset of X is 10calIy closed if and only if it can be represented as on F, where 0 is open and F is closed in X. K. Product Spaces Let X be a set, and for each member }, of an index set A, let h. be a mapping of X into a topo 10 gical space X A' Then there is a weakest topology for X that makes each f). continuous. In fact, this topology is sup{,).}, where 'A is the topology for X induced by f).. In particular, let {XAII.EA} be a family of topological spaces, and let X be the Cartesian product nAEAX).. Then the weakest topology for X such that each projection pr A: X --> X A is continuous is called the product topology or weak topology. The Cartesian product DAEAX A equipped with the product topology is calIed the product topological space or simply the product space 
425L Topological Spaces or direct product of the famil y {X I I J. E A} of topological spaces. If D is the family of all open subsets of XI., the union U"pr;I(D I .J is a subbase for the product topology. If x = [x J is a point of X, then sets of the type n'i1 prt(V)= D",'I, ...I. n X " X VI X .., X V n form a local base at x for the product to- pology, where AI,'" ,AnEA and  is a neigh- borhood of XI.}. Each projection pr l : X ......X I . is continuous and open, and a mapping f from a topological space Y into the product space D). X" is continuous if and only if pr" of: Y ...... X I. is continuous for each A. Given a family { I;) of continuous mappings fl.: Xl....... Y", the product mapping D".f,,: 0 1 . X"...... D" Y" is con- tinuous with respect to the product topologies. For the Cartesian product DI. X" of a family {X IV E A} of topological spaces, there is an- other topology called the box topology (or strong topology). A base for the box topology is the family of all sets D" 0 1 ., where 0" is open in X" for each A. For a point x= {x,,}, the family of all sets of the form D" VI. is a local base at x relative to the box topology, where VI. is a neighborhood of XI. for each i.. With respect to the box topology, each projection prl.:D"X).......X" is continuous and open, and the product mapping D/ I : DI. X"...... D" Y" of a family {j;.} of continuous mappings f,,: X"...... 1";. is contlllUOUS. For a finite product of topolog- ical spaces, the product topology agrees with the box topology, but for an arbitrary product the product topology is weaker than the box topology. For the Cartesian product of to po- logical spaces the usual topology considered is the product topology rather than the box topology. L. Quotient Spaces Let f be a mapping of a topological space X onto a set Y. The quotient topology for Y (relative to the mapping f) is the strongest topology for Y such that f is continuous. A subset 0 of Y is open relative to the quotient topology if and only if I -I (0) is open. Given an equivalence relation  on a topological space X, the tquotient set Y = XI  provided with the quotient topology relative to the' projection <p: X...... Y is called the quotient topo- logical space (or simply quotient space). A mapping {from the quotient space Y = X I  into a topological space is continuous if and only if fa <p is continuous. A partition of a space X is a family {A) I icE A} of pairwbe disjoint subsets of X such that U I AI. = X. A partition {A I.} of a topological space X determllles an equivalence relation  on X such that the family {AI} is precisely 1610 the family of all equivalence classes under , and therefore the partition determines the quotient space Y = X I . This spac:e is called the identification space of X by the given par- tition. Each member A" of the partition can be regarded as a point of Y, and the projec- tion (p: X...... Y satisfies <p(x) = AI. whenever XE AI' A partition {A"li.EA} of a topological space is called upper semicontinuous if for each A" and each open set V containing A", there is an open set V such that A, eVe V, and V is the union of members of [A" I i.EA}. A partition {AI.1 i.E A} is upper sernicontinu- ous if and only if the projection <p X...... Y = {A" I AEA} is a closed mapping. M. Topological Sums Let X be a set, and for each member A of an index set A, let f" be a mapping of a topo- logical space X" to X. Then the family {O c X I f,,-1 (0) is open for any i.} satisfies the axioms of the open sets. This topology T is characterized as the strongest one for X that makes each .f" continuous. A mapping g on X with T to a topological space Y is continuous if and only if g a f,,: Xl....... Y is contin 1I0US for each i. E A. The simplest is the case where X is the disjoint union of X" and f" is the mclusion mapping. Then we call the topological space X the direct sum or the topological sum of {X,,} and denote it by EJjX" or lJx l .. More gener- ally let the set X be the union of topological spaces {XI LeA such that for each ic and fJ E A the relative topologies of X" n X p from XI. and X p coincide. Then we call the topology T the weak topology with respect to {X j. If X" n X p is closed (resp. open) in X p for any fJ, then X" is closed (resp. open) in X and th( original topology of XI. coincides with the relative topology. If, moreover, for each subset f of A, F= U"erX" is closed and the weak topology of F with respect to {X}.} ler coincides with the relative topology induced by T, then X with T is said to have the hereditarily weak topology with respect to {X,,} (or to be dominated by {X},}). A topological space has the hereditarily weak topology with respect to any locally finite closed covering, and every CW -complex ( 70 Complexes) has the hereditarily weak topology with respect to the covering of all finite subcomplexes. When {X n } is an increasing sequence of topological spaces such that each X n is a sub- space of Xn+l' then the union X == U X n with the weak topology is called the inductive limit of {X n } and is denoted by limXn. Each X n may again be regarded as a subace of X. 
1611 N. Haire Spaces For a subset A of a topological space X, the set X - A is called the exterior of A, and the set An X - A is called the boundary of A, de- noted by HdA, Fr A, or cA. A point belonging to the exterior (boundary) of A is an exterior point (boundary point or frontier point) of A. If the closure of A is X, then A is said to be dense in X. When X - A is dense in X, i.e., when the interior of A is empty, A is called a boundary set (or border set), and if the closure A is a boundary set, A is said to be nowhere dense. The union of a countable family of nowhere dense sets is called a set of the first category (or meager set). A set that is not of the first category is called a set of the second category (or nonmeager set). The complement of a set of the first category is called a residual set. In the space R of real numbers, the set Q of all ra- tional numbers is of the first category, and the set R-Q of all irrational numbers is ufthe second category. Hoth Q and R - Q are dense in X and hence are boundary sets. The union of a finite family of nowhere dense sets is no- where dense, and the union of a countable family of sets of the first category is also of the first category. A subset A of X is nowhere dense in X if and only if for each open set 0, on A is not dense in O. A topological space X is called a Haire space (Haire, 1899) if each subset of X of the first category has an empty interior. Each of the following conditions is necessary and sufficient for a space X to be a Haire space: (1) Each nonempty open su bset of X is of the second category. (2) If F1' F2' '" is a sequence of closed subsets of X such that the union U1 Fn has an inte- rior point, then at least one Fn has an interior point. (3) If OJ, O 2 ,,,, is a sequence of dense open subsets of X, then the intersection n'l On is dense in X. An open subset of a Baire space is a Haire space for the relative topology. A topological space that is homeomorphic to a complete metric space ( 436 Uniform Spaces I) is a Haire space (Haire-Hausdorff theorem). A locally compact Hausdorff space ( Section V) is also a Haire space. The class of Cech- complete completely regular spaces ( Section T) includes both of these spaces, but there are also Baire spaces that are not in the class. A subset A of a topological space is said to sat- isfy Haire's condition or to have the Haire property if there exist an open set 0 and sets PI' P 2 of the first category such that A = (0 U Pd - P 2 . A tBorel set satisfies Haire's condition. 425P Topological Spaces O. Accumulation Points A point x is called an accumulation point, or a cluster point of a subset A of a topological space X if x E -A - {x} . The set of all accum ula- tion points of a set A is called the derived set of A and is denoted by A' or A d. A point x belongs to A' if and only if each neighborhood of x contains a point of A other than x itself. A point belonging to the set AS = A - A' is called an isolated point of A, and a set A consisting of isolated points only, i.e., A = A s, is said to be discrete. If each nonempty subset of A contains an isolated point, then A is said to be scat- tered; and if A does not possess an isolated point, i.e., A c A', then A is said to be dense in itself. The largest subset of A which is dense in itself is called the kernel of A. If A = A', then A is called a perfect set. If x is an accumulation point of A, then for each neighborhood U of x, un (A - {x}) =F 0. Furthermore, it is possible to classify an ac- cumulation point of A according to the tcar- dinality of un (A - {x} ). A point x is called a condensation point of a set A if for each neigh- borhood U of x, the set UnA is uncountable. A point x is a complete accumulation point of A if for each neighborhood U of x, the set unA has the same cardinality as A. P. Count ability Axioms A topological space X satisfies the first count- ability axiom if each point x of X has a coun- table local base (F. Hausdorff [3J). Metric spaces satisfy the first countability axiom. In fact, the family of (l/n)-neighborhoods (n= 1,2, ...) of a point is a local base of the point. The topology of a topological space that satisfies the first countability axiom is com- pletely determined by convergent sequences. F or instance, the closure of a subset A of such a space consists of all limits of sequences in A ( 87 Convergence). A topological space X is said to satisfy the second countability axiom or to be perfectly separable if there is a countable base for the topology. tEuclidean spaces satisfy the second countability axiom. If X contains a countable dense subset, X is said to be sepa- rable. A space that satisfies the second count- ability axiom satisfies the first and is also a separable Lindelof space ( Section S). How- ever, the converse is not true. Each of the following properties is independent of the others: separability, the first countability axiom, and the Lindelof property. If a metric space is separable, then it satisfies the second countability axiom. There are metric spaces that are not separable. 
425 Q Topological Spaces Q. Separation Axioms Topological spaces that are commonly en- countered usually satisfy some of the following separation axioms. (To) Kolmogorov's axiom. For each pair of distinct points, there is a neighborhood of one point of the pair that does not contain the other. (1',) The first separation axiom or Frechet's axiom. For each pair x, y of distinct points, there are neighborhoods U of x and V of y such that x rt V and yrt U. Axiom (1',) can be restated as follows: (T',) For each point x of the space, the sin- gleton {x} is closed. (1'2) The second separation axiom or Haus- dorff's axiom [3]. For each pair x, y of dis- tinct points of the space X, there exist disjoint neighborhoods of x and y. Axiom (1'2) is equivalent to the following: (T) In the product space X x X the diago- nal set Li is closed. (1'3) The third separation axiom or Vietoris's axiom (Monatsh. Math. Phys., 31 (1921)). Given a point x and a subset A such that x rt A, there exist disjoint open sets 0, and 02 such that XEO, and Ac0 2 . (In this case, the sets {x} and A are said to be separated by open sets.) Axiom (1'3) can be restated as (T) or (1';): (T) For each point x of the space, there is a local base at x consisting of closed neighbor- hoods of x. (1';) An arbitrary closed set and a point not belonging to it can be separated by open sets. (1'4) The fourth separation axiom or Tietze's first axiom (Math. Ann., 88 (1923)). Two dis- joint closed sets FI and F 2 can be separated by open sets, i.e., there exist disjoint open sets 0 1 and O 2 such that FI cO I and F 2 c 0 2 . (1'5) Tietze's second axiom. Whenever two subsets A, and A 2 satisfy A I n A 2 = Al n A 2 = 0, Al and A 2 can be separated by open sets. It is easily seen that (1'5)=>(1'4)' (To) and (1'3)=>(1'2), (1'4) and (1'1)=>(1'3)' Axiom (1'4) is equivalent to each of (T) and (T): (T) Whenever FI and F 2 are disjoint closed subsets, there exists a continuous function f on the space into the interval [0, IJ such that f is identically 0 on FI and 1 on F 2 . (T) Each real-valued continuous function defined on a closed subspace can be extended to a real-valued continuous function on the entire space. The implications (1'4) => (T) and (1'4) => (T) are known as Uryson's lemma (Math. Ann., 94 (1925)) and the Tietze extension theorem (1. Rei/Ie Angew. Math., 145 (1915)), respec- tively. In addition, there are two more related aXIOms: 1612 (1'3\) Tikhonov's separation axiom. For each closed subset F and each point x not in F, there is a real-valued continuous function f on the space such that f(x) = 0 and f is identically Ion F. (1'6) (N. Vedenisov). For each closed subset F, there is a real-valued continuous function f on the space such that F = {x I f(x) = O}. Axioms (1'5) and (1'6) are equivalent to the following (T) and (T), respectively: (T) Each subs pace satisfies (1'4) (1'6) X satisfies (1'4) and each closed set is a tG.-set. The following implications are valid: (1'3\) => (1'3)' (1'6)=>(1'5)' (1'4) and (T,)=>(1'3})' Table 1 gives a classification of topological spaces by the separation axioms. Each line represents a special case of the preceding line. A tmetrizable space is perfectly normal, but the converse is false (for metrization theorems  273 Metric Spaces). Among the spaces satisfying the second countability axiom, regular spaces are normal (Tikhonov's theo- rem, Math. Ann., 95 (1925)) and metrizable (Tikhonov-Uryson theorem; P. Uryson, Math. Ann., 94 (1925)). Table 2 shows whether various topological properties are preserved in subspaces, product spaces, and quotient spaces. The topological properties considered are 1'1,1'2 = Hausdorff, 1'3 = regular, CR = completely regular, 1'4 = normal, 1'5 = completely normal, M = metriz- able, C I = first axiom of countability, Cll = second axiom of countability, C = compact, S = separable, and L = LindelOf. Each position is filled with 0 or x according as the prop- erty (say, P) listed at the head of the column is preserved or not in the sort of space listed on the left obtained from space(s) all having property P. R. Coverings A family 9.1/ = {M;J AEA of subsets of a set X is called a covering of a subset A of X if A c U;.M).. If9J1 is finite (countable), it is called a finite covering (countable covering). An open (closed) covering is a covering consisting of open (closed) sets. A family 9.11 of subsets of a topological space X is said to be locally finite if for each point x of X, there is a neighborhood of x which inter- sects only a finite number of members of9J1. If moreover { M A } ;.EA is disjoint, then 9J1 is called discrete. 9J1 is called star-finite if each mem- ber of 9J/ intersects only a finite number of members of 9J1. A a-locally finite or a-discrete family of subsets of X is respectively the union of a countable number of locally finite or dis- crete families of subsets of X. A covering 9J1 
1613 425S Topological Spaces Table 1. Separation Axioms Axioms (To) (Td (T 2 ) (To) and (T 3) (Td and (T 3j ) (Td and (T 4 ) (Td and (T s ) (Td and (T b ) Spaces Satisfying the Axioms To-space (Kolmogorov space) Tj-space (Kuratowski space) T 2 -space (Hausdorff space, separated space) T 3 -space (regular space) Completely regular space (Tikhonov space) T 4-space (normal space) Ts-space (completely normal space, hereditarily normal space) T 6-space (perfectly normal space) Table 2. Topological Properties and Spaces Space T j Tz T3 CR Subspace 0 0 0 0 Closed subspace 0 0 0 0 Open subspace 0 0 0 0 Product 0 0 0 0 Countable product 0 0 0 0 Quotient space X X X X T4 Ts M C j C n C S L X 0 0 0 0 X X X 0 0 0 0 0 0 X 0 X 0 0 0 0 X 0 X X X X X X 0 X X X X 0 0 0 0 0 X X X X X X 0 0 0 is called point-finite if each infinite number of members of WI has an empty intersection. A covering 9.11 is a refinement of a covering \H (written WI-(91) if each member of 9Jl is con- tained in a member of 91. The order of the covering 9.11 is the least integer r such that any subfamily of 9Jl consisting of r + I members has an empty intersection. Let 9.11 be a covering of X, and let A be a subset of X. The star of A relative to WI, de- noted by S(A, WI), is the union of all members of WI whose intersection with A is nonempty. Let WI'" denote the family {S( {x}, 9.11)} XEX and 9)1* the family {S(M, WI)) MEJI' Then Will. and WI* are coverings of X, and 9.11-<9.11"'-<9'Jl*-< 9.11 M . A covering 9Jl is a star refinement of a covering 9'1 if 9.11* -< 91, and 9.11 is a barycentric refinement (or d-refinement) of 9'1 if 9.111'.-<91. A sequence 9.11 1 ,9.11 2 , .., of open coverings of a topological space is called a normal sequence if 9.11+j -<Win for n = 1,2, ..., and an open covering 9.11 is said to be a normal covering if there is a normal sequence \JJ1 1 , 9.11 2 , .., such that 9)11 -<WI. The support (or carrier) of a real-valued function.f on a topological space X is defined to be the closure of the set [x l.f(x) ?"cO}. Let {f,LEA be a family of con- tinuous nonnegative real-valued functions on a topological space X, and for each (J. in A, let C, be the support of f. The family { {,LEA is called a partition of unity if the family {C'}EA is locally finite and L.r.(x) = 1 for each x in X. If the covering {C'}OEA is a refinement of a covering WI, the family { {oLEA is called a par- tition of unity subordinate to the covering 9.11. A partition of unity subordinate to a covering 9.11 exists only if 1))1 is a normal covering ( Sec- tion X). If p is a continuous tpseudometric on a T 1 -space X, then define a covering M n for each natural number n by M n = {U(x; 2 -n)}XEX, where U(x;E)={ylp(x,y)<E}. Then the se- quence 9'Jl 1 , 9.11 2 , .., is a normal sequence of open coverings. Conversely, given a normal sequence 9'Jlj, 9']12, ... of open coverings of X, there exists a continuous pseudometric p such that p(x,y)rn whenever XES(y, 9.11 n ), and p(x,y)2-n-j whenever x rtS(y, Win). Ifin addition for each x the family {S (x, 9]1n) I n = 1,2, ... } is a local base at x, then the metric topology of p agrees with the topology of X. s. Compactness If each open covering of a topological space X admits a finite open covering as its refinement, the space X is called compact; if each open covering of X admits a countable open refine- ment, X is said to be a LindelOf space (P. Alek- sandrov and P. Uryson, Verh. Akad. Wetensch., Amsterdam, 19 (1929)); if each open covering of X admits a locally finite open refinement, X is called paracompact (J. Dieudonne, J. Math. Pures Appl., 23 (1944)); and if each open cover- ing of X admits a star-finite open refinement, X is said to be strongly paracompact (c. H. Dowker, Amer. J. Math., 69 (1947)) or to have the star-finite property (K. Morita, Math. J aponicae, 1 (1948)). The space X is compact (Lindelof) iffor each open covering WI of X, there is a finite (countable) subfamily of 9']1 whose union is X. The following properties for a topological space X are equivalent: (1) The space X is compact. (2) If a family {F)} AEA of closed sub- sets of X has the finite intersection property, i.e., each finite subfamily of {F)} ).EA has non- empty intersection, then n) F)?"c 0. (3) Each 
425 T Topological Spaces infinite subset of X has a complete accumu- lation point. (4) Each tnet has a convergent tsubnet. (5) Each tuniversal net and each tultrafilter converge. If a subset A of X is compact for the relative topology, A is called a compact subset. A subset A of X is said to be relatively compact if the closure of A in X is a compact subset. A closed subset of a compact topological space is compact, and a compact subset of a Hausdorff space is closed. A continuous image of a com- pact space is compact, each continuous map- ping of a compact space into a Hausdorff space is a closed mapping, and a continuous bijec- tion of a compact space onto a Hausdorff space is a homeomorphism. The product space of a family {X iJ ),eA of topological spaces is compact if and only if each factor space is compact (Tikhonov's product theorem, Math. Ann., 102 (1930)). A compact Hausdorff space is normal. A compact Hausdorff space is metrizable if and only if it satisfies the second countability axiom. A metric space or a tuni- form space is compact if and only if it is ttotally bounded and tcomplete. A subset of a Euclidean space is compact if and only if it is closed and bounded. In a discrete space only finite subsets are compact. The cardinality of a compact Hausdorff space with the first count- ability axiom cannot exceed the power of the continuum (Arkhangel'skiI). There are a number of conditions related to compactness. A topological space is sequenti- ally compact if each sequence in X has a con- vergent subsequence. A space X is countably compact (M. Frechet [IJ) if each countable open covering of X contains a finite subfamily that covers X. A space X is pseudocompact (E. Hewitt, 1948) if each continuous real-valued function on X is bounded. Some authors use compact and hicompact for what we call coun- tably compact and compact, respectively. N. Bourbaki [9J uses compact and quasicompact instead of compact Hausdorff and compact, respectively. A T 1 -space is countably compact if and only if each infinite set possesses an accumulation point. If X is countably com- pact, then X is pseudocompact, and if X is normal, the converse also holds. If a tcomplete uniform space is pseudocompact, then it is compact. A space satisfying the second counta- bility axiom is compact if and only if it is sequentially compact. If X is sequentially compact, then X is countably compact, and if X satisfies the first countability axiom, the converse is true. T. Compactification A compactification of a topological space X consists of a compact space Yand a homeo- 1614 morphism of X onto a dense subs pace XI of Y. We can always regard X as a dense subspace of a compactification Y. If X is completely regular, then there is a Hausdorff com pac- tification Y such that each bounded real- valued continuous function on X can be extended continuously to Y. Such a compacti- fication is unique up to homeomorphism; it is called the Stone-tech compactification of X (E. tech, Ann. Math., 38 (1937); M. H. Stone, Trans. Amer. Math. Sac., 41 (1937)) and is denoted by {3(X). Let {f),} iEA be the set of all continuous functions on a completely regular space X into the closed interval j = [0,1]. Then a continuous mapping cp of X into a parallelotope j A = n), j), (Ii = 1) is defined by (p(x) = {f),(x)} ),EA, and the mapping cp is a homeomorphism of X onto the subspace cp(X) of jA (Tikhonov's embedding theorem, Math. Ann., 102 (1930)). The closure cp(;q of cp(X) in jA is the Stone-tech compactification of X. The natural mapping fi(X I x X 2 )-->fi(XrJ x {3(X 2 ) is a homeomorphism if and only if X, x X 2 is pseudocompact (I. Glicksbe'fg, 1959). For a topological space X, let 'XJ be a point not in X, and define a topology on the union XU { 00 } as follows: A subset U of XU { 00 } is open if and only if either oc =I- U and U is open in X, or 00 E U and X - U is a compact closed subset of X. The topological space XU {oo} thus obtained is compact, and if X is not already compact, the space XU {XJ } is a com- pactification of X called the one-I)Oint com- pactification of X (P. S. Aleksandrov, C. R. Acad. Sci. Paris, 178 (1924)). The one-point compactification of a Hausdorff ,pace is not necessarily Hausdorff. The one-point compac- tification of the n-dimensional Euclidean space R" is homeomorphic to the n-dimensional sphere sn. A completely regular space X is a tG.-set in the Stone-tech compactification fi(X) if and only if it is a G.-set in any Hausdorff space Y which contains X as a dense subspace. Then X is said to be tech-complete. U. Absolutely Closed Spaces A Hausdorff space X is said to be absolutely closed (or H-c1osed; P. Aleksandrov and P. Uryson, 1929) if X is closed in each Hausdorff space containing it. A compact Hausdorff space is absolutely closed. A Hausdorff space is absolutely closed if and only if for each open covering {N),} ),EA of X, there is a finite sub- family of {N),} ),EA that covers X. The product space of a family of absolutely closed spaces is absolutely closed. Each Hausdorff space is a dense subset of an absolutely closed space. Similarly, a regular space X is said to be r- 
1615 closed if X is closed in each regular space containing it (N. Weinberg, 1941). V. Locally Compact Spaces A topological space X is said to be locally compact if each point of X has a compact neighborhood (P. Aleksandrov and P. Uryson, 1929). A tuniform space X is said to be uni- .formly locally compact if there is a member V of the +uniformity such that V (x) is compact for each x in X ( 436 Uniform Spaces). A noncompact space X is locally compact and Hausdorff if and only if the one-point com- pactification of X is Hausdorff, and this is the case if and only if X is homeomorphic to an open subset of a compact Hausdorff space. A locally compact Hausdorff space is completely regular, and for each point of the space, the family of all of its compact neighborhods forms a local base at the point. A locally closed, hence open or closed, subset of a lo- cally compact Hausdorff space is also locally compact for the relative topology. If a sub- space A of a Hausdorff space X is locally compact, then A is a locally closed subset of X. The Euclidean space Rn is locally compact, and hence each locally Euclidean space, i.e., a space such that each point admits a neighbor- hood homeomorphic to a Euclidean space, is locally compact. A topological space is called a-compact if it can be expressed as the union of at most countably many compact subsets. W. Proper (Perfect) Mappings A mapping f of a topological space X into a topological space Y is said to be proper (N. Bourbaki [9J) (or perfect [14J) if it is con- tinuous and for each topological space Z, the mapping f x I: X x Z ---> Y x Z is closed, where (f x 1)(x,z)=(f"(x),z). A continuous mapping J: X ---> Y is proper if and only if it is closed and f-' ({y}) is compact for each y in Y. Another necessary and sufficient condition is that if {X7}[ is a tnet in X such that its image {.f(x a )} converges to YE Y, then a subnet of {xa) con- verges to an XEf-'(Y) in X. A continuous mapping of a compact space into a Hausdorff space is always proper. For a compact Haus- dorff space X, a quotient space Y is Hausdorff if and only if the canonical projection lP: X---> Y is proper. F or a continuous mapping f of a locally compact Hausdorff space X into a locally compact Hausdorff space Y, the following three conditions are equivalent: (1) f is proper. (2) For each compact subset K of Y, the in- verse image r l (K) is compact. (3) If XU: x f } 425 X Topological Spaces and YU (y f ) are the one-point compactifi- cations of X and Y, then the extension fl of f such that f, (x x) = Y f is continuous. The composition of two proper mappings is proper and the direct product of an arbitrary number of proper mappings is proper. X. Paracompact Hausdorff Spaces A paracompact Hausdorff space (often called simply a paracompact space) is normal. For a Hausdorff space X, the following five con- ditions are equivalent: (1) X is paracompact. (2) X is fully normal (1. W. Tukey [8J), i.e., each open covering of X admits an open barycentric refinement. (3) Each open covering has a partition of unity subordinate to it. (4) Each open covering is refined by a closed co veri ng {Fa I a E A} that is closure-preserving, i.e., U{F 1i lf3EB} is closed for each Bc A. (5) Each open covering {Va I a E A) has a cushioned refinement {ValaEA}, i.e., CI(U {v(dfJEBj)C U [ V p I Ii E B) for each B cA. The implication (1)-(2) is Oieudonne's theorem. The implication (2) --->( I) is A. H. Stone's theorem (1948), from which it follows that each metric space is para- compact. The implications (5)--->(4)--->(1) is Michael's theorem (1959, 1957). For normal spaces, the following weaker versions of (2) and (3) hold: A TI-space X is normal if and only if each finite open covering of X admits a finite open star refinement (or finite open barycentric refinement). For each locally finite open covering of a normal space, there is a partition of unity subordinate to it. F or a regular space X the following three conditions are equivalent: (1) X is paracom- pact. (2) Each open covering of X is refined by a a-discrete open covering. (3) Each open covering of X is refined by a a-locally finite open covering. Tamano's product theorem: For a completely regular space X to be paracom- pact it is necessary and sufficient that X x f3(X) be normal (1960). For a i'connected locally compact space X, the following conditions are equivalent: (1) X is paracompact. (2) X is a-compact. (3) In the one-point compactification XU {C1J}, the point C1J admits a countable local base. (4) There is a locally finite open covering { V J . ] AEA of X such that OA is compact for each l. (5) X is the union of a sequence (V n } of open sets such that Un is compact and UncV n +, (n=I,2,...). (6) X is strongly paracompact. Every t Fa-set of a paracompact Hausdorff space is paracompact (Michael, 1953). When a T [-space X has the hereditarily weak topology with respect to a closed covering {FA}' then X is paracompact Hausdorff (normal, completely normal or perfectly normal) if and only if each 
425 Y Topological Spaces F). is (Morita, 1954; Michael, 1956). In par- ticular, every CW-complex is paracompact and perfectly normal (Morita, 1953). Y. Normality and Paracompactness of Direct Products A topological space X is discrete if X x Y is normal for any normal space Y (M. Atsuji and M. Rudin, 1978). There are a paracompact Lindel6f space X and a separable metric space Y such that the product X x Y is not normal (Michael, 1963). The following are conditions under which the products are normal or paracompact. Let m be an infinite tcardinal number. A topological space X is called m- paracompact if every open covering consisting of at most m open sets admits a locally finite open covering as its refinement. When 111 is countable, it is called countably paracompact. If X has an open base of at most m mem bers, m-paracompact means paracompact. The following conditions are equivalent for a topo- logical space X: (1) X is normal and count- ably paracompact; (2) The product X x Y is normal and countably paracompact for any compact metric space Y; (3) X x I is normal, where 1=[0, IJ (c. H. Dowker, 1951). Rudin (1971) constructed an example of a collection- wise normal space ( Section AA) that is not countably paracompact. When m is general the following conditions are equivalent: (1) X is normal and m-paracompact; (2) If Y is a compact Hausdorff space with an open base consisting of at most m sets, then X x Y is normal and m-paracompact; (3) X x 1 1n is normal; (4) X x {O, I }"' is normal (Morita, 1961). I n particular, the product X x Yof a paracompact Hausdorff space X and a compact Hausdorff space Y is paracompact (Dieudonne, 1944). A topological space X is called a P-space if it satisfies the following conditions: Let Q be an arbitrary set and {G(1X 1 , ..., IX;) 11X 1 , '" IXjEQ, i = 1,2, ... } be a family of open sets such that G(IX"... ,IX;)C G(1X 1 ,..., IXj, IX j +1)' Then there is a family of closed sets {F(IXI' "', IX;) IIX" ..., IXjEQ, i= 1,2,...) such that F(IX" ...,IX;)cG(1X 1 , ... , IX;) and that if U;-;, G(1X 1 , ..., IX;) = X for a se- quence {IX;}, then Ui";,l F(1X 1 , "', IX;) = X. Per- fectly normal spaces, countably compact spaces, Cech-complete paracompact spaces and a-compact regular spaces are P-spaces. Normal P-spaces are countably paracom- pact. A Hausdorff space X is a normal (resp. paracompact) P-space if and only if the prod- uct X x Y is normal (resp. paracompact) for any metric space Y (Morita, Math. Ann., 154 1964). 1616 The product X x Y of locally compact Haus- dorff spaces X and Y is a locally compact Hausdorff space. If, in this case, X and Yare paracompact, then so is the product. If the direct product space n). X;. of metric spaces is normal, then X;. are compact exc'pt for at most countably many;., and hence the prod- uct space is paracompact (A. H. Stone, 1948). A class ((j of topological spaces is called countably productive if for a sequence X j of members of((j their product nX, is again a member of ((j. The classes of (complete) (sepa- rable) metric spaces form such examples. The class of paracompact and Cech-complete spaces is countably productive (Z. Frolik, 1960). A topological space X is called a p- space if it is completely regular and there is a sequence Wl j of families of open sets in the Stone-Cech compactification {3(X) such that, for each point XEX, XE n S(x, \D!j) cX (Arkhangel'skii, 1963). X is called an M-space if there is a normal sequence !JJl j of open coverings of X such that if K 1 :::J K 2 :::J .., is a sequence of nonempty closed sets and K j C S(x, WI;), i= 1,2,..., for an XEX, then n K j * o (Morita, 1963). The class of paracompact p-spaces and that of paracompact Hausdorff M-spaces are the same and are count ably pro- ductive. For a covering  of X and an XEX we set C(x, ty)= n{FlxEFEty). X is called a I-space if X admits a sequence tyj of locally finite closed coverings such that if K , :::J K 2 :::J ... is a seq uence of nonem pty closed sets and KjcC(x, ty;), i= 1,2, "', for an XE'X, then nKj*0 (K. Nagami, 1969). I-spaces are P-spaces. The class of all paracompact I- spaces is also countably productive. Among the above classes each one is always wider than its predecessors. Yet the product X x Yof a paracompact Hausdorff P-space X and a paracompact Hausdorff I-space Y is paracom- pact. Other examples of countably productive classes are the Suslin spaces and the Luzin spaces ( Section CC) introduced by Bour- baki (1958), the stratifiable spaces by J. G. Ceder (1961) and C. J. R. Borges (1966), the o- spaces by Michael (1966) and the a-spaces by A. Okuyama (1967). Z. Strongly Paracompact Spaces Regular Linde16f spaces are strongly paracom- pact. Conversely, if a connected regular space is strongly paracompact, then it is a Lindel6f space (Morita, 1948). Hence a connected non- separable metric space is not strongly para- compact. Paracompact locally compact Haus- dorff spaces and uniformly locally compact Hausdorff spaces are strongly paracompact. 
1617 These classes of spaces coincide under suitable tuniform structures. AA. Collectionwise Normal Spaces A Hausdorff space X is called a collectionwise normal space iffor each discrete collection {Fa IIX E A} of closed sets of X there exists a disjoint collection {VxIIXE A} of open sets with Fxc V x (IXEA) (R. H. Bing, 1951). If X satisfies an analogous condition for the case where each Fa is a singleton, X is called a collection- wise Hausdorff space. Paracompact Hausdorff spaces are collectionwise normal (Bing). Every point-finite open covering of a collectionwise normal space has a locally finite open refine- ment (Michael, Nagami). A topological space X is called a developable space if it admits a sequence U i , i = 1,2, ... , of open coverings such that, for each point x EX, {S(x, U i ) I i = 1,2, .., } forms a base for the neighborhood system of x (R. L. Moore, 1916). A regular developable space is called a Moore space. The question of whether or not every normal Moore space is metrizable is known as the normal Moore space problem ( 273 Metric Spaces K). Collectionwise normal Moore spaces are metrizable (Bing). BB. Real-Compact Spaces A completely regular space X is called real- compact if X is complete under the smallest tuniformity such that each continuous real- valued function on X is uniformly continuous ( 422 Uniform Spaces). This notion was introduced by E. Hewitt (Trans. Amer. Math. Soc., 64 (1948)) under the name of Q-space, and independently by L. Nachbin (Proc. Inter- national Congress of Mathematicians, Cam- bridge, Mass., 1950). A Lindelo[ space is real-compact. If X 1 and X 2 are real-compact spaces such that the rings C(X 1 ) and C(X 2 ) of continuous real-valued functions on XI and X 2 are isomorphic, then XI and X 2 are homeomorphic (Hewitt). If X is real-compact, then X is homeomorphic to a closed subspace of the product space of copies of the space of real numbers, and conversely. Cc. Images and Inverse Images of Topological Spaces Each continuous mapping f: X -> Y is decom- posed into the product i 0 hop of continuous mappings p: X ->X/, h:X/  ->f(X) and i: f(X)-> Y, where'" is the equivalence relation such that Xl X2 if and only if f(xd= f(x 2 ). 425 CC Topological Spaces The mapping f is open (resp. closed) if and only if these mappings are all open (resp. closed). Then h is a homeomorphism. The image of a paracompact Hausdorff space under a closed continuous mapping is para- compact (Michael, 1957). Let f: X -> Y be a perfect surjection. Then Y is called a perfect image of X and X a perfect inverse image of Y. If, in this case, one of X and Y satisfies a property such as being com- pact, locally compact, a-compact, Lindelof, or count ably compact, then the other also satis- fies the property. When X and Yare com- pletely regular, the same is true with regard to Cech completeness. Properties such as regularity, normality, complete normality, perfect normality, and the second countability axiom are preserved in perfect images; but complete regularity and strong paracompact- ness are not. Perfect images of metric spaces are also metrizable (S. Hanai and Morita, A. H. Stone, 1956). Conversely, perfect inverse images of paracompact spaces are paracom- pact. If a Hausdorff space is a perfect inverse image of a regular space (resp. k-space;  below), then it is a regular space (resp. k- space). Every paracompact Cech-complete space is a perfect inverse image of a tcomplete metric space (Z. Frolik, 1961). A completely regular space is a paracompact p-space if and only if it is a perfect inverse image of a metric space (Arkhangel'skii, 1963). A mapping f: X -> Y is called quasi-perfect if it is closed and continuous and the inverse image .r- 1 (y) of each point yE Y is countably compact. A topo- logical space X is an M-space if and only if there is a quasi-perfect mapping from X onto a metric space Y (Morita, 1964). Let f: X -> Y be a quasi-perfect surjection. If one of X and Y is a .E-space, then the other is also a .E-space (Nagami,1969). A topological space X is called a Frechet- Uryson space (or a Frechet space) if the closure of an arbitrary set A c X is the set of a1l1imits of sequences in A (Arkhangel'skiI, 1963). X is called a sequential space if A c X is closed when- ever A contains all the limits of sequences in A (S. P. Franklin, 1965). X is called a k'-space if the closure of an arbitrary set A is the set of all points adherent to the intersection An K for a compact set K in X (Arkhangel'skii, 1963). X is called a k-space if A c X is closed whenever An K is closed in K for any compact set K ( Arkhangel'skii, Trudy Moskov. Mat. Obshch., 13 (1965)). Spaces satisfying the first countability axiom are Frechet-Uryson spaces. The Frechet-Uryson spaces (resp. sequential spaces) are characterized as the images under hereditarily quotient (resp. quotient) mappings of metric spaces or locally compact metric 
425 Ref. Topological Spaces spaces. Similarly the k'-spaces (resp. k-spaces) coincide with the images under hereditarily quotient (resp. quotient) mappings oflocally compact spaces. The image of a metric space under a closed continuous mapping is called a Lashnev space. Any subspace of a Frechet- Uryson space is a Frechet-Uryson space. Con- versely, a Hausdorff space is a Frechet-Uryson space if any of its subspaces is a k-space. Cech- complete spaces are k-spaces. A Hausdorff space is called a Suslin space (resp. Luzin space) if it is the image under a continuous surjection (resp. continuous bijection) of a complete separable metric space (Bourbaki [9]; also  22 Analytic Sets). In Figs. 1, 2, and 3, the relationships be- Fig. 1 Fig. 2 1618 tween the various properties are indicated by the arrows. Fig. 3 References [1] M. Frechet, Sur quelques points du ca1cul fonctionnel, Rend. Circ. Mat. Palermo, 22 (1906), 1-74. [2] F. Riesz, Stetigkeitsbegriff und abstrakte Mengenlehre, Atti del IV Congr. Intern. Mat. Roma 1908, II, 18-24, Rome, 1909. [3] F. Hausdorff, Grundzi.ige der Mengen- lehre, Teubner, 1914. [4] M. Frechet, Les espaces abstraits, Gauthier-Villars, 1928. [5] C. Kuratowski, Topologie, I, Monograf. Mat., I, 1933, revised edition, 1948; II, 1950. [6] W. Sierpinski, Introduction to general topology, University of Toronto Press, 1934. [7] P. S. Aleksandrov (Alexandroff) and H. Hopf, Topologie I, Springer, 1935. [8] J. W. Tukey, Convergence and uniformity in topology, Ann. Math. Studies, Princeton Univ. Press, 1940. [9] N. Bourbaki, Elements de mathematique, III. Topologie generale, 1940-1949; new edi- tion, 1970-1974; Hermann, English trans- lation, General topology, Addison-Wesley, 1966. [10] J. L. Kelley, General topology, Van Nos- trand, 1955. [11] J. Dugundji, Topology, Allyn and Bacon, 1966. [12] 1. Nagata, Modern general topology, North-Holland, 1968. [13] R. Engelking, General topology, Polish Scientific Pub!., 1977. 
1619 [14J P. S. Aleksandrov, Some results in the theory of topological spaces, obtained within the last twenty-five years, Russian Math. Sur- veys, 16, no. 2 (1960),23-83. [15J A. V. Arkhangel'skiI, Mappings and spaces, Russian Math. Surveys, 21, no. 4 (1966), 115-162. [16J Y. Kodama and K. Nagami, Theory of topological spaces (in Japanese), Iwanami, 1974. [17J L. Steen and J. Seebach, Jr., Counter- examples in topology, Holt, Rinehart and Winston, 1970. [18J L. GilIman and M. Jerison, Rings of continuous functions, Van Nostrand, 1960. [19J S. A. Gaal, Point set topology, Academic Press, 1964. [20J E. tech, Topological spaces, Interscience, 1966. [21J R. C. Walker, The Stone-tech compacti- fication, Springer, 1974. [22J R. A. Alo and H. L. Shapiro, Normal topological spaces, Cambridge Univ. Press, 1974. [23J M. D. Weir, Hewitt-Nachbin spaces, North-Holland,1975. 426 (IX.1) Topology The term topology means a branch of mathe- matics that deals with topological properties of geometric figures or point sets. A classical result in topology is the Euler relation on polyhedra: Let ()(o, ()(t, and ()(2 be the numbers of vertices, edges, and faces of a polyhedron homeomorphic to the 2-dimensional sphere; then ()(o - ()(t + ()(2 = 2 (tEuler-Poincare formula for the 2-dimensional case; actualIy, the for- mula was known to Descartes). It is one of the earliest results in topology. In 1833, C. F. Gauss used integrals to define the notion of tlinking numbers of two closed curves in a space ( 99 Degree of Mapping). It was in J. B. Listing's classical work Vorstudien zur Topologie (1847) that the term topology first appeared in print. In the 19th century, B. Riemann published many works on function theory in which topo- logical methods played an essential role. He solved the homeomorphism problem for com- pact surfaces ( 4 I 0 Surfaces); his result is basic in the theory of algebraic functions. In the same period, mathematicians began to study topological properties of n-dimensional polyhedra. E. Betti considered the notion of thomology. H. Poincare, however, was the first to recognize the importance of a topological 426 Topology approach to analysis in general; he defined the thomology groups of a complex [I]. He ob- tained the famous tPoincare duality theorem and defined the tfundamental group. He con- sidered tpolyhedra as the basic objects in top- ology, and deduced topological properties utilizing tcomplexes obtained from polyhedra by tsimplicial decompositions. He thus con- structed a branch of topology known as com- binatorial topology. In its beginning stages combinatorial top- ology dealt only with polyhedra. In the late 1920s, however, it became possible to apply combinatorial methods to general tcompact spaces. P. S. Alexandrov introduced the con- cept of approximation of a tcompact metric space by an inverse sequence of complexes and the definition of homology groups for these spaces. His idea had a precursor in the notion of i'simplicial approximations of continuous mappings, which was introduced by L. E. J. Brouwer in 1911. In 1932, E. tech defined homology groups for arbitrary spaces utilizing the tinductive limit of the homology groups of polyhedra; and ttech cohomology groups for arbitrary spaces were also defined. S. Eilenberg established tsingular (co)homology theory using tsingular chain complexes (1944). The axiomatic approach to (co)homology theory is due to Eilenberg and Steenrod, who gave axioms for (co)homology theory in a most comprehensive way and unified various (co)- homology theories (1945) ( 201 Homology Theory. The approach using algebraic methods has progressed extensively in connection with the development of homology theory. This branch is called algebraic topology. In the 1920s and 1930s, a number of remarkable results in alge- braic topology, such as the tAlexander duality theorem, the tLefschetz fixed-point theorem, and the tHopf invariant, were obtained. In the late 1930s, W. Hurewicz developed the theory of higher-dimensional thomotopy groups ( 153 Fixed-Point Theorems, 201 Homology Theory, 202 Homotopy Theory). J. H. C. Whitehead introduced the concept of tcw complexes and proved an algebraic charac- terization of the homotopy equivalence of CW complexes. N. Steenrod developed tobstruc- tion theory utilizing tsquaring operations in the cohomology ring (1947). Subsequently, the theory of tcohomology operations was introduced ( 64 Cohomology Operations, 305 Obstructions). The theory of tspectral sequences for tfiber spaces was originated by J. Leray (1945) and J.-P. Serre (1951) and was successfully applied to cohomology operations and homotopy theory by H. Caftan and Serre (1954) ( 148 Fiber Spaces, 200 Homological Algebra). The study of the combinatorial 
426 Ref. Topology structures of polyhedra and tpiecewise linear mappings has flourished since 1940 in the works of Whitehead, S. S. Cairns, and others. S. Smale and, independently, J. Stallings solved the tgeneralized Poincare conjecture in 1960. The tHauptvermutung in combinatorial topology was solved negatively in 1961 by B. Mazur and J. Milnor. E. C. Zeeman proved the unknottedness of codimension 3 (1962). The recent development of the theory in con- Junction with progress in tdifferential topol- ogy is notable. The Hauptvermutung for com- binatorial manifolds was solved in 1969 by Kirby, Siebenmann, and \Vall. In particular, there exist different combinatorial structures on tori of dimension? 5, and there are topo- logical manifolds that do not admit any com- binatorial structure ( 65 Combinatorial Manifolds, 114 Differential Topology, 235 Knot Theory). The global theory of differentiable manifolds started from the algebraic-topological study of tfiber bundles and tcharacteristic classes in the 1940s. R. Thorn's fundamental theorem of tcobordism (1954) was obtained through ex- tensive use of cohomology operations and homotopy groups. Milnor (1956) showed that the sphere S7 may have differentiable struc- tures that are essentially distinct from each other by using tMorse theory and the tindex theorem of Thorn and Hirzebruch. These results led to the creation of a new field, tdif- ferential topology ( 56 Characteristic Classes, 114 Differential Topology). Since 1959, A. Grothendieck, M. F. Atiyah, F. Hirzebruch, and J. F. Adams have devel- oped tK-theory, which is a generalized coho- mology theory constructed using stable classes oPvector bundles ( 237 K-Theory). tKnot theory, an interesting branch of top- ology, was one of the classical branches of topology and is now studied in connection with the theory of low-dimensional manifolds ( 235 Knot Theory). On the other hand, G. Cantor established general set theory in the 1870s and introduced such notions as taccumulation points, topen sets, and tclosed sets in Euclidean space. The first important generalization of this theory was the concept of ttopological space, which was proposed by M. Frechet and developed by F. Hausdorff at the beginning of the 20th century. The theory subsequently became a new field of study, called general topology or set-theoretic topology. It deals with the topo- logical properties of point sets in a Euclidean or topological space without reference to polyhedra. There has been a remarkable devel- opment of the theory since abount 1920, nota- bly by Polish mathematicians S. Janiszewski, W. Sierpinski, S. Mazurkiewicz, C. Kuratow- 1620 ski, and others. The contributiom of R. L. Moore, G. T. Why burn, and K. Menger are also important ( 382 Shape Theory, 425 Topological Spaces). Topology is not only a foundation of vari- ous theories, but is also itself one of the most important branches of mathematics. It consists ofthomology theory, thomotopy theory, tdif- ferential topology, tcombinatorial manifolds, tK-theory, ttransformation groups, ttheory of singularities, tfoliations, tdynamical systems, tcatastrophe theory, etc. It continues to de- velop in interaction with other branches of mathematics ( 51 Catastrophe rheory, 126 Dynamical Systems, 154 Foliations, 418 Theory of Singularities, 431 Transformation Groups). References [1] H. Poincare, Analysis situs, J. Ecole Poly- tech., (2) 1 (1895), 1-121. (Oeuvres, Gauthier- Villars, 1953, vo!. 6, 193-288.) [2] P. S. Aleksandrov and H. Hopf, Topologie I, Springer, 1935 (Chelsea, 1965). [3] S. Lefschetz, Algebraic topology, Amer. Math. Soc. Colloq. Pub!., 1942. [4J S. Eilenberg and N. E. Steenrod, Founda- tions of algebraic topology, Princeton Univ. Press, 1952. 427 (IX.12) Topology of Lie Groups and Homogeneous Spaces A. General Remarks Among various topological structures of tLie groups and thomogeneous spaces, the struc- tures of their t(co)homology groups and thomotopy groups are of special interest. Let G/H be a homogeneous space, where G is a Lie group and H is its closed subgroup. Then (G, G/H, H) is a tfiber bundle, where G/H is the base space and H is the fiber Thus homol- ogy and homotopy theory of fiber bundles (tspectral sequences and thomotopy exact sequences) can be applied. The tcellular de- composition of tStiefel manifolds, tGrassmann manifolds, and tKiihler homogeneous spaces are known. Concerning tsymmetric Riemann- ian spaces, we have various interesting meth- ods, such as the use of invariant differential forms in connection with real cohomology rings and the use of tMorse theory in order to establish relations between the diagrams of symmetric Riemannian spaces G/H and homo- logical properties of their tloop spaces and 
1621 some related homogeneous spaces [4,5]. Lie groups can be regarded as special cases of homogeneous spaces or symmetric spaces, although their group structures are of partic- ular importance. A connected Lie group is homeomorphic to the product of one of its compact subgroups and a Euclidean space (tCartan-Mal'tsev-lwasawa theorem). Hence the topological structure of a connected Lie group is essentially determined by the topolog- ical structures of its compact subgroups. B. Homology of Compact Lie Groups Let G be a connected compact Lie group. Since G is an tH-space whose multiplication is given by its group multiplication h, H*(G; k) and H*(G; k) are dual tHopf algebras for any coefficient field k. Also, H*(G; k) is isomorphic as a tgraded algebra to the tensor product of telementary Hopf algebras ( 203 Hopf Alge- bras), but no factor of the tensor product is isomorphic to a polynomial ring because G is a finite tpolyhedron. In particular, if k= R (the field of real numbers), then H*(G; R)  ;\R(X" ..., xJ (the exterior (Grassmann) alge- bra over R with generators XI' .,. ,XI of odd degrees). Here we can choose generators Xi such that h*(x,) = 1 @Xi+Xi@ 1, 1 il. The Xi that satisfy this property are said to be primitive. Since in this case the tcomultiplica- tion h* is commutative, the multiplication h* is also commutative and the Hopf algebra H*(G; R) is an exterior algebra generated by elements Yi having the same degree as Xi (i = 1, "', I). When the characteristic of the coefficient field k is nonzero, h* need not be commutative. The dimension of a tmaximal torus of a connected compact Lie group G is indepen- dent of the choice of the maximal torus and is called the rank of G. The rank of G coincides with the number I of generators of H*(G; R). E. Cartan studied H*(G; R) by utilizing invariant differential forms. The cohomology theory of Lie algebras originated from the method he used in his study. H*(G; R) is invariant under tlocal isomorphisms of groups G. For tclass- ical compact simple Lie groups G, R. Brauer calculated H*(G; R), while c.- T. Yen and C. Chevalley calculated H*(G; R) for texceptional compact simple Lie groups ( Appendix A, Table 6.IV). The degrees of the generators have group-theoretic meaning. Suppose that the degree of the ith generator is 2m i - I, 1  il, and that m l m2  '" ml' When G is simple, there is a relation m i +mH+l =const- ant (Chevalley's duality). We have a proof for this property that does not use classification. The cohomology groups H*(G; Zp) (where P 427C Topology of Lie Groups, Homogeneous Spaces is a prime and Zp = ZlpZ) have been deter- mined as graded algebras for all compact simple Lie groups by A. Borel, S. Araki, and P. Baum and W. Browder ( Appendix A, Table 6JV). C. Cohomology of Classifying Spaces Let (E G , BG' G) be a tuniversal bundle of a connected compact Lie group G and p a prime or zero. Suppose that the integral cohomology of G has no p-torsion (no torsion when p = 0). Then H*(G; Zp)= ;\Zp(X'I' ... ,x;) (H*(G; Z)= l\z(x'I' ... ,x;) when p = 0), an exterior alge- hr" with cip<Yyc=?m.-l 1 <.i<.l ""ci thf' ....,....... "£""&-OVl -.../ &,  --.... .......,&-- ..,.&- generators x; can be chosen to be ttransgres- sive in the spectral sequence of the universal bundle. Let Y 1, ... ,YI be their transgression images. Then degYi=2m i , 1 il, and the cohomology of the tclassifying space BG over Zp (resp. Z) is the polynomial algebra with generators Y I' .,. ,h Let T be a maximal torus of G. Then B 1' = EGIT is a classifying space of T, the tWeyl group W=N(T)IT of G with respect to T operates on B1" by tright transla- tions, and H*(T; Z) has no torsion and is an exterior algebra with I generators of degree 1. Thus H*(B1';Z)=Z[u" ...,u I ], degu i =2. Let Iw be the subalgebra of H*(B 1' ; Z) consisting of W-invariant polynomials, and let p be the projection of the bundle (B1"' BG' GIT). Then under the assumption that G has no p-torsion (no torsion), the cohomology mapping p* over Zp(Z) is monomorphic, and p*: H*(B G ; Zp) Iw @ Zp (H*(B G ; Z)  Iw) [1]. In the case of real coefficients, we have H*(B G ; R) Iw @ R for all G, and m" ..., m l are the degrees of generators of the ring Iw of W-invariant polynomials. Example (1) G = U (n): 1= nand G has no torsion. W operates on H*(B1"; Z) as the group of all permutations of generators U I' "', Un' Thus generators of I ware the telementary symmetric polynomials (J 1, ... ,(In of U I' .., , Un' Let e l' .., ,en be the tuniversal Chern classes; then p*(e,)=(Ji and H*(Bu(n);Z)=Z[c I , ...,c n ]. Example (2) G = SO (n): 1= [nI2] and G has no p-torsion for p # 2. W operates on H*(B1"; Z) as the group generated by the per- mutations of generators U I' .,. , U I and by the transformations (J(ui)=eiu" e i = ::1::1, where the number of U i for which e i = -1 is arbitrary for odd n and even for even n. Thus the generators of Iw are the elementary symmetric poly- nomials (J;, .., ,(J; of uf, ... , uf for odd nand (J'I' ...,(J;_I and u I ... U I for even n. Let Ph ...,PI be the tuniversal pontryagin classes and X be the tuniversal Euler-Poincare class in the case of even n. Then p*(p,) = (Jf and p*(x) = U I .., U/ for integral cohomology. Denote the modp 
4270 Topology of Lie Groups, Homogeneous Spaces reduction of Pi and X by Pi and 7., respectively. Then H*(B,wJ(2/+,);Zp)=Zp[Pl"" ,P/J and H*(B so (2/); Zp)= Zp[Pl'"'' PH, xJ (p=O or >2). Example (3) G = O(n): If we use the subgroup Q consisting of all diagonal matrices instead of T, then we can make a similar argument for Zrcohomology. Since Q  (Z2)"' H*(B Q ; Z 2) = z 2 [11 , .. . , v nJ (Z 2 [1'1 ' .. . , v nJ is a polynomial ring with deg Vi = I), and W 2 = N(Q)/Q operates on BQ by right translations and on H*(B Q ; Z2) as the group of all permuta- tions of VI, ''', I'n' Let IW 2 be the subalgebra of H*( B Q ; Z2) consisting of all W 2 -invariant polynomials. Then IW 2 is a polynomial ring generated by the elementary symmetric poly- nomials CJ, ..., CJ: of VI, ..., V n . The projec- tion P2 : BQ--+ BO(n) induces a monomorphic cohomology mapping p! over Z2' and p!: H*(Boln);Z2)lw2' Let WI, ..., W n be the tuni- versal Stiefel-Whitney classes. Then P!(Wi) = CJi' and H*(B o (");Z2)=Z2[W I ,...,w,,J [2]. D. Grassmann Manifolds The following manifolds are called Grass- mann manifolds: The manifold M,,+m.n(R) con- sisting of all n-subspaces of Rn+m; the mani- fold Mn+m.n(R) consisting of all oriented n- subspaces of Rn+m; and the manifold Mn+m.n(C) consisting of all complex n-subspaces of c n + m . These are expressed as quotient spaces as follows: Mn+m.n(R) = O(n + m)/O(n) x O(m), Mn+m.n(R)= SO(n + m)/SO(n) x SO(m), and M,,+m.,,(C)= U(n+m)/U(n) x U(m). They admit cellular decompositions by tSchubert varieties from which their cohomologies can be com- puted ( 56 Characteristic Classes). MlI+m.,,(R) and Mn+m.n(R) have no p-torsion for pi=2, and M,,+m.n(C) has no torsion. These spaces are m-, m-, and (2m + 1 )-classifying spaces of O(n), SO(n), and U(n), respectively. Hence their cohomologies are isomorphic to those of BG (G=O(n), SO(n), U(n)) in dimensions <m, < m, and  2m, respectively; and they are poly- nomial rings generated by suitable univer- sal characteristic classes in low dimensions. E. Co homologies of Homogeneous Spaces G/U (Rank G = Rank U) Let G be a compact connected Lie group and U a closed subgroup of G with the same rank as G. Denote the degrees of generators of H*(G; R) and H*(U; R) by 2m 1 -1,..., 2m/-I, and 2n, - 1, ..., 2n/- 1, respectively. Then the real-coefficient tPoincare polynomial Po of the homogeneous space G/U is given by Po(G/U, t) = Il(l - t 2m ')/(I - t 2n ,) (G. Hirsch). When G, U, and G/U have no p-torsion, the same formula 1622 is valid for the Zp-coefficient Poincare poly- nomial [I]. When U is the tcentralizer of a torus, G/U has a complex analyti,: cellular decomposition [3]. Hence G/U has no torsion in this case. This was proved by R. Bott and H. Samelson by utilizing Morse theory [5J ( 279 Morse Theory). The case U = T has also been studied. F. Homotopy Groups of Compact Lie Groups The tfundamental group 1[1 (G) of a compact Lie group G is Abelian. Furthermore, 1[2(G)= O. If we apply Morse theory to G, the varia- tional completeness of G can be utilized to show that the loop space OG has no torsion and that its odd-dimensional cohomologies vanish [4]. Consequently, when Gis non- Abelian and simple, we have 1[3(G)Z. A tperiodicity theorem on tstable homotopy groups of classical groups proved by Bott is used in K-theory ( 202 Homotopy Theory; 237 K- Theory). (For explicit forms of homo- topy groups  Appendix A, Table 6.VI). Homotopy groups of Stiefel manifolds are used to define characteristic classes by tob- struction cocycles ( 147 Fiber Bundles; Ap- pendix A, Tab]e 6.VI). References [IJ A. Borel, Sur la cohomologie des espaces fibres principaux et des espaces homogenes de groupes de Lie compacts, Ann. Math., (2) 57 (1953),115-207. [2J A. Borel, La cohomologie mod 2 de cer- tains espaces homogenes, Comffilnt. Math. Helv., 27 (1953),165-197- [3J A. Bore], Kiihlerian coset spaces of semi- simple Lie groups, Proc. Nat. Acad. Sci. US, 40(1954),1147-1151. [4J R. Bott, An app]ication of the Morse theory to the topology of Lie-groups, Bull. Soc. Math. France, 84 (1956),251-281. [5J R. Bott and H. Samelson, Applications of the theory of Morse to symmetric spaces, Amer. J. Math., 80 (1958), 964-1029. 428 (XII1.17) Total Differential Equations A. Pfaff's Problem A total differential equation is an equation of the form (1)=0, (I) 
1623 where w is a tdifferential I-form L7=1 ai(x)dx i on a manifold X. A submanifold M of X is called an integral manifold of (I) if each vector  of the ttangent vector space Tj M) of M at every point x on M satisfies w()=O. We de- note the maximal dimension of integral mani- folds of(1) by m(w). J. F. Pfaff showed that m(w)? (n -1 )/2 for any w. The problem of determining m(w) for a given form (j) is called Pfaff's problem. This problem was solved by G. Frobenius, 1. G. Darboux, and others as follows: Form an talternating matrix (aij)! i,jn from the coefficients of the texterior derivative of OJ, 1 n n dw = - '\' '\' a.. ( x ) dx 1\ dx 2.;"'" I} I l' 1=1/-1 where a ii = (la)rx i - DajDx j . Suppose that the rank of (2) is 2t. Then the rank of the matrix ( a ij -ai ) a j 0 1 i.in is 2t or 2t + 2. In the former case m(w) = n - t, and w can be expressed in the form I L U 2i -' dU 21 i:::l by choosing a suitable coordinate system (u," "', un)' In the latter case m(w) = n - t -1, and w can be expressed in the form I L U2i-l dU 2i +du 2t +, j=l by choosing a suitable coordinate system (u" ..., un)' This theorem is called Darboux's theorem. A I-form w is called a Pfaffian form, and equation (1) is called a Pfaffian equation. A system of equations Wi = 0 (1  i  s) for I-form Wi is called a system of Pfaffian equations or a system of total differential equations [6, 12,26]. 8. Systems of Differential Forms and Systems of Partial Differential Equations Let Q be a system of differential forms w;, Opn, I ivp, on X, where w; is a p-form on X. A submanifold M of X is called an integral manifold of Q = 0 if for each p (0  p  dim M), any p-dimensional subs pace Ep of T.:(M) satisfies w;(Ep)=O (1 ivp) at every point x on M. Denote the maximal dimension of integral manifolds of Q = 0 by m(Q). The problem of determining m(Q) for a given sys- tem Q is called the generalized Pfaff problem, and will be explained in later sections. By fixing a local coordinate system of X and 428 B Total Differential Equations dividing it into two systems (x" ..., x,) and (y"..., Ym) (m= n-r), we can consider the problem of finding an integral manifold of Q = 0 defined by Y.= y.(x"... ,x,), 1IXm. This problem can be reduced to solving a system of partial differential equations of the first order on the submanifold N with the local coordinate system (x" ... ,x,). Consider a system of partial differential equations t1>=0 of order I: (2) (P}(Xi,y.,pl .j,)=O, (3) I ;,s, with 1 ir, 1 IX, #m,.i, +... +.i,/, where . . iJi j +. +Jry plt.}, _. P (3 cxi1 ... 8x/. r (4) A submanifold defined by Y. = y.(x" ..., x,), 1  IX  m, is called a solution of t1> = 0 if it satis- fies (3) identically. The problem of determining whether a given system t1> = 0 has a solution was solved by C. Riquier, who showed that any system can be prolonged either to a pas- sive orthonomic system or to an incompatible system by a finite number of steps. A system of partial differential equations is called a prolongation of another system if the former contains the latter and they have the same solution. A passive orthonomic system is one whose general solution can be parametrized by an infinite number of arbitrary constants. A solution containing parameters is called a general solution if by specifying the parameters we can obtain a solution of the tCauchy prob- lem for any initial data. A system (3) is said to be incompatible if it implies a nontrivial rela- tion f(x [, '" , x,) = 0 among the Xi' The problem of solving a system t1> = 0 of partial differential equations can be reduced to that of finding integral manifolds of a system of differential forms E as follows: Let f be a manifold with the local coordinate system (Xi,y.,pl j,; lir, 11X, #m, .i, +...+.i,I), and E be a system ofO-forms lfJA (I ;.s) and I -forms , dy.- L p;dx i , i=l , dpl . j, - L p' .j, +1.. j'dx k k1 (1  IX, #m,.il +... +.i, < I). Then an integral manifold of L = 0 of the form Y.= y.(x" ....x,), 1 1Xm, P h ..j'_ p h }, (x X ) P - P . " ..., " 1 #m, .i, +... +.i,I, 
428 C Total Differential Equations gives a solution Y. = y.(xj, ..., x y ), 1  IX  m, of <1>=0, and yp and pt' J, satisfy (4). Conversely, a solution y= y.(x 1 ,... ,x,), I  IX  m, of <1> = 0 gives an integral manifold of I = 0 if we define pl' J,(x" ... ,x,) by (4) [23,24,26]. C. Systems of Partial Differential Equations of First Order with One Unknown Function Consider a system of independent tvector fields on N: r iJ L; =I bAi(X)l' ,=1 ex,. I ;,s. We solve a system of inhomogeneous equations L;y- j;(X)y-gA(X)=O, 1 },s, for a given system of f;(x) and gA(X). The sys- tem (5) is called a complete system if each of the expressions [LA' L,,]y - (LAf - LJ;)y-( fgA - fAg) -(L;g-LgA)' 1 A<J1S, (6) is a linear combination of the left-hand sides of (5), where [L;LJ means the tcommutator of L;. and Lw This condition is called the com- plete integrability condition for (5). Suppose that the homogeneous system LAy=O, I AS, is complete. Then it has a system of tfunction- ally independent solutions Yl' ..., Yy_,,, and any solution Y of (8) is a function of them: y= Ij;(y"..., yy-sJ. If the inhomogeneous system (5) is complete, then the homogeneous system (7) is complete. This notion of a complete system is due to Lagrange and was extended to a system of nonlinear equations by Jacobi as follows ( 324 Partial Differential Equa- tions of First Order C). Consider a system of nonlinear equations F;(xj, ...,x"y'Pj, ...,p,)=O, 1 },s, (8) where Pi = r?y/ilxi' The system (8) is called an involutory system if each of [F;, FJ, I  A < J1  S, is a linear combination of Fl' ..., F,. Here tLagrange's bracket [F, GJ is defined by , rJF ( OG CG ) [F,GJ= I - -+Pi---;;- i' Dpi oX i cy _ i OG ( OF +Pi rJF ) . il UPi oX i oy Suppose that the system (8) is involutory and F" ... , F, are functionally independent. Then, in general, we can solve the following tCauchy problem for an (r-8)-dimensional submani- 1624 fold N,-s of N: Given a function f on N,_" find a solution y of (8) satisfying y = f on N y - s . We can construct a solution by integrating a sys- tem of ordinary differential equations called a tcharacteristic system of differential equations. Hence the solution of these problems may be carried out in the cro-category (-" 322 Partial Differential Equations (Methods of Integra- tion) B) [7,11]. D. Frobenius's Theorem Let X be a tdifferentiable manifold of class C ro and 0 be a system of independen1 I-forms w" 1  i  s, on X. Then the system of Pfaffian equations 0 = 0 is called a complf.tely inte- grable system if at every point x of X, (5) , dw= '\ O..l\w. 1 is , I L..... IJ l' J1 (7) for I-forms Oij on a neighborhood of x. Sup- pose that 0=0 is completely integrable. Then at every point x of X, there exists a local co- ordinate system (f1' ..., r.., X'+l, ..., x n ) in a neighborhood U of x for which a tangent vec- tor  of X at ZE U satisfies Wi()=O, 1 is, if and only if J. = 0, I  i  s. In this case, each of the df. is a linear combination of WI' ..., w" and conversely, each of the Wi is 2, linear com- bination of dfl' ..., e1fs' In general, a function f for which elf is a linear combination of WI' .." W s is called a first integral of 0 = O. The theorem of the previous paragraph is called Frobenius's theorem, which can be stated in the dual form as follows Let D(X) be a tsubbundle of the ttangent bundle T(X) over X. The mapping X 3 x...... Dx(X) is called a dis- tribution on X. It is said to be an involutive distribution if at every point x of X we can find a system of independent vector fields L i (I  i  s) on a neighborhood U or x such that the Li(z) (1  i  s) form a basis of Dz(X) at every ZE U and satisfy [Li' LJ == 0 (Lj, ..., Ls), I i<js, on U. A connected submanifold M of X is called an integral manifold of D(X) if T.x(M) = Dx(X) at every point x of M. Suppose that D(X) gives an involutive distribution on X. Then every point x of X is in a maximal integral manifold M that contains any integral manifold including x as a submanifold. E. Cartan-Kahler Existence Theorems Let X be a treal analytic manifold. Denote the tsheaf of rings of differential forms on X by A(X) and its subsheaf of (C'(X)-modules of P- forms on X by Ap(X), 1 pn, where (i)(X) is the sheaf of rings of O-forms on X. A subsheaf of ideals I is called a differential iideal if it is generated by Lp, Opn, and contains dI, 
1625 where Ip=InAp(X). Consider a differential ideal I on X. Denote the tGrassmann mani- fold of p-dimensional subspaces of TAX) with origin XEX by Gp(x), and the Grassmann manifold UXEX G(x) over X by Gp(X). An element Ep of Gp(x) is called a p-dimensional contact element with origin x. An element Ep of Gp(x) is cal1ed an integral element of Ip if w(E p ) = ° at x for any p-form OJ in I, further- more, Ep is called an integral element of I if any element Eq contained in Ep, Oqp, is an integral element of Iq. In particular, 0- dimensional and I-dimensional integral ele- ments are cal1ed integral points and integral vectors, respectively. It can be proved that an element Ep is an integral element of I if and only if it is an integral element of Ip. The polar element H(E p ) of an integral element Ep with origin x is defined as the subspace of T;;(X) consisting of al1 vectors that generate with Ep an integral element of I. Let (L p)O, °  p  n, be the subsheaf of C0(X)-modules in C0'(G p (X)) consisting of all O-forms L a 1J ...lpZll".lp 1 (J <,,,<ipn on Gp(X) derived from a p-form L ai, ipdXi,I\...l\dXipEIp. 1 II <" <Ipn where zi,,,.ip is the tGrassmann coordinate of Ep. An integral element E is called a regular integral element if the fol1owing two conditions are satisfied: (i) (Ip)o is a regular local equation of IIp at E, where IIp is the set of all integral elements of I p ; (ii) dim H(Ep) = constant around E on IIp. This definition, due to E. Kiihler, is different from that given by E. Cartan [4]. Here, in general, a su bsheaf <1> of (V( X) is called a regular local equation of 1<1> at an integral point Xo if there exists a neighborhood U of Xo and tcross sections qJ l' ., . , qJs of <1> on U that satisfy the following two conditions: (i) dqJl"'" dqJs are linearly independent at every x on U; (ii) a point x of U is an integral point of <1> if and only if qJI(X)=", =qJs(x)=O. First existence theorem. Suppose that we are given a p-dimensional integral manifold M with a regular integral element TAM) at a point x on M. Suppose further that there exists a submanifold F of X containing M such that dimF=n-t p + l , dim(TAF)nH(Ep))=p+ I, where E p = TAM) and t p + l =dimH(E p )- P-1. Then around x there exists a unique integral manifold N such that dim N = P + I and F:::J N:::JM. This theorem is proved by integrating a system of partial differential equations of Cauchy-Kovalevskaya type. E. Cartan [2-4J also tried to obtain an existence theorem by integrating a system of ordinary differential equations. 428 E Total Differential Equations A chain of integral elements Eo eEl c .., c E, is called a regular chain if each of Ep (Op<r) is a regular integral element. For a regular chain Eo c El C ... c E" define t p+l by t p + I =dimH(E p )- p-I, o p<r, and define sp by sp=tp-tp+I-1 (Op<r), s,=t" where to =dimlIo. Then we have sp?O (Opr), So +... +s,= to -r, and we can take a local co- ordinate system (XI, ""X"Yl' ...,Ym), m=n-r, around Eo that satisfies the following four conditions: (i) lID is defined by Y'o-,+l =... = Ym=O; { 0 0 (ii) H(E p )= -,...,-, oX l ox, o OY'o+" +'p ,+1 ' ..., Dy-J Op<r; { 0 0 } ( i l 'i ) E p= oxl ""' ox p , Ipr; (iv) Eo=(O,...,O,O,...,O). The integers So, ... ,S, are called the characters of the regular chain Eo c .., c E,. Second existence theorem. Suppose that a chain of integral elements Eo c ... c E, is regu- lar, and take a local coordinate system satisfy- ing (i)-(iv). Consider a system of initial data II, ... ,[so' 1'°+1 (xI!, "', /;o+s, (XI)' fO+l::.l + 1 (Xl' X 2 ), ... ,fso+SJ +.'1"2 (Xl' X 2 ), !so+ "+S,_I +1 (Xl,'" ,X,),... ,fro-'(X l '''' ,xrJ, Then if their values and derivatives of the first order are sufficiently small, there exists a unique integral manifold defined by Y = Ya(x l ,,,,, x,), Yp=O, I rx to -r<{J m, such that Ya(X l , ..., xP' 0, ...,0) = fa(x l , ..., x p ), So+... +Sp-l <rxso+'" +sP' Opr. This theorem is proved by successive appli- cation of the first existence theorem. These two theorems are cal1ed the Cartan-Kiihler existence theorems. I is said to be involutive at an integral element E, if there exists a regular chain Eo c ... c E,. An integral manifold pos- sessing a tangent space at which I is involutive is cal1ed an ordinary integral manifold Or ordi- nary solution of I. An integral manifold that does not possess such a tangent space is called a singular integral manifold or singular solution of I. Cartan's definition of ordinary and regular integral elements is as fol1ows: An integral point Eg is an ordinary integral point if Io is a regular local equation of lID at Eg. An ordi- 
428 F Total Differential Equations nary integral point E8 is a regular integral point if dim H(E o ) is constant on 1 Lo around Eg. Inductively, an integral element E is called an ordinary integral element if (Lp)O is a regular local equation of 1 Lp at E and E contains a regular integral element E_,. An ordinary integral element E is a regular integral element (in the sense of Cartan) if dimH(EpJ is constant on ILp around E. It can be proved that L is involutive at an integral element Er if and only if Er is an ordinary integral element of L. An integral manifold possessing a tangent space that is a regular integral element of L is called a regular in- tegral manifold or regular solution of L. Let m p +' be the minimal dimension of H(E p ), where Ep varies over the set of p-dimensional ordinary integral elements, and 9 be an integer such that mp? p (1  p g) and m q +1 = p. Then this integer 9 is called the genus of L. ] t is the maximal dimension of ordinary integral mani- folds of L. However, in general, it is not the maximal dimension of integral manifolds of L. D. C. Spencer and others have been trying to obtain an existence theorem in the C'''- category analogous to that of Cartan and Kiihler. (For a system of linear partial dif- ferential equations  [2,4,11,13,25,27].) F. Involutive Systems of Partial Differential Equations To give a definition of an involutive system of partial differential equations, we define an involutive subspace of Hom(V, W), where V and Ware finite-dimensional vector spaces over the real number field R. Let A be a sub- space of Hom(V, W). For a system of vectors v" ...,u p in V, A(u" ...,u p ) denotes the sub- space of A that annihilates u l' "', Up' Let gp be the minimal dimension of A(u 1 , ''', up) as (u"..., up) varies, where Opr=dim V. A basis (u I, .., , ur) of V is ca!1ed a generic basis if it satisfies gp=dimA(u 1 , ..., up) for each p. There exists a generic basis for any A. Let W@ S2(V*) be the subspace of Hom(V, Hom(V, W)) consisting of all elements  satisfying (u)u = (u)u for any u and u in V. Then the prolonga- tion pA of A is defined by pA = Hom(V, A)n W@ S2(V*). For any basis (u" "', ur) of V, we have the inequality , dimpA L dimA(u 1 ,...,u p )' FO The subspace A is ca!1ed an involutive subspace of Hom(V, W) ifdimpA=Logp' This notion of an involutive subspace was obtained by V. W. Gui!1emin and S. Sternberg [13]. A triple (X, N; n) consisting of two mani- folds X, N and a projection n from Xonto N is called a fibered manifold if the tdifferential n* 1626 is surjective at every point of X. Take the set of all mappings f from a domain in N to X satisfying no f == identity for a fibered manifold (X, N; n). Then an tl-jet j;(f) is an equivalence class under the equivalence relation defined as follows j;(f) = j(g) if and only if .x = u, f(x) = g(u), and (lit + . +i, f 8;1 +,..+irg " , (x) -,' " (u), OXl...ox:r CX;1...0x;r i 1 +... + i,  I, where (x j, ... ,x r ) is a local co- ordinate system of N around x = l< ( 105 Differentiable Manifolds X). Denote the space of all I-jets of a fibered manifold (X, N; n) by JI(X, N; n) or simply JI. Then a subsheaf of ideals <1> in (O(J I ) is called a system of partial differential equations of order Ion N. A point z of JI is called an integral point of <1> if cp(z) = ° for all cp E <1>. The set of all integral points of <1> is denoted by J <1>. Let n l be the natural projection of JI onto J I - 1 Then at a point z of JI, we can identify Kef n with Hom(TAN),Kern), where x=nn'... nlz. The principal part C z (<1» of <1> is defined as the sub- space of Kern that annihilates <1>. The pro- longation p<1> of <1> is defined as the system of order 1+1 on N generated by cIJ' and i\<1>, 1 kdimN, where a k is the formal derivative with respect to a coordinate X k of .'V: a (akCP)W+1(f))= a cp(j;(f)), cpI,,(!)(JI). Xk Let w be an integral point of p<1> and z be n l +1 w. Then we have the identity pC z (<1» = C w (p<1». The fo!1owing definition of an involutive system is due to M. Kuranishi [19]: <1> is invo- lutive at an integral point z if the following two conditions are satisfied: (i) <1> is a regular local equation of 1<1> at z; (ii) there exists a neighbor- hood U of z in JI such that (n l + 1 f1 un l(p<1» forms a fibered manifold with base un 1<1> and projection n l + 1 . A system of partial differential equations is said to be involutive (or involutory) if it has an integral point at which it is involutive. Fix a system of independent variables () 1, ... , YN) in X. Then a system of differential forms is said to be involutive (or involutory) if it has an in- tegral element at which it is involutive and dY1 /\.../\ dYN *0. It can be proved that these two definitions of involutive system are equiva- lent [19,25]. G. Prolongation Theorems Cartan gave a method of prolongation by which we can obtain an involutivt:: system from a given system with two independent 
1627 variables, if it has a solution. He proposed the following problem: For any r > 2, construct a method of prolongation by which we can obtain an involutive system from a given sys- tem with r independent variables, if it has a solution. To solve this problem, Kuranishi prolonged a given system <I> successively to p'<I>, t = 1, 2, 3, ..., and proved the following theorem: Suppose that there exists a sequence of integral points z' of p'<I> with n'+l z' = Z'-l, t = 1, 2, 3, ..., that satisfies the following two conditions for each t: (i) p'<I> is a regular local equation of I(p'<I» at z'; (ii) there exists a neighborhood V' of z' in I(p'<I» such that n'+I V' contains a neighborhood of Z'-l in I(p'-l<l» and forms a fibered manifold (V', n'+l V'; n'+l). Then p'<I> is involutive at z' for a sufficiently large integer t. This prolongation theorem gives a powerful tool to the theory of tinfinite Lie groups. How- ever, if we consider a system of partial dif- ferential equations of general type, there exist examples of systems that cannot be prolonged to an involutive system by this prolongation, although they have a solution. To improve Kuranishi's prolongation theorem, M. Mat- suda [22J defined the prolongation of the same order by Po<l> = p<l> n (!J(JI) for a system <I> of order I. This is a generalization of the classi- cal method of completion given by Lagrange and Jacobi. Applying this prolongation suc- cessively to a given system <1>, we have '£I = U)=l pg<l>. Define the p*-operation by p* = U:'=l Pop. Then applying this prolongation successively to '£I, we have the following theo- rem: suppose that there exists a sequence of integral points z' of p '£I with n 1 +' z' = Z'-l, t = 1, 2, 3, ..., that satisfies the following two conditions for each t: (i) p '£I is a regular local equation of I (p '£I) at z'; (ii) dimpC(p '£I) is constant around z' on I(p '£I). Then p '£I is involutive at z' for a sufficiently large integer t. To prove this theorem Matsuda applied the following theorem obtained by V. W. Guil- lemin, S. Sternberg, and J.-P. Serre [25, ap- pendix]: suppose that we are given a sub- space Ao of Hom(V, W) and subspaces A, of Hom(V, A'_I) satisfying A,e pA,-t. t = 1,2, 3, .... Then A, is an involutive subspace of Hom(V, A,-l) for a sufficiently large integer t. Thus Cartan's problem was solved affirma- tively. To the generalized Pfaff problem these prolongation theorems give another solution, which differs from that obtained by Riquier. H. Pfaffian Systems in the Complex Domain Consider a linear system of Pfaffian equations n m du i = L L at(x)ujdx k , i= 1, ...,m, k=lj=1 428 H Total Differential Equations where x = (x p .., , x n ) is a local coordinate of a complex manifold X and at are meromor- phic functions on X. If we put U = '(u l , ..., u m ) and Ak(x)=(at(x)), k= 1, ...,n, the system is written as du = Ct l Ak(X)dX k ) u. (9) System (9) is completely integrable if and only if aAj aA I axl - axj =[A1,Aj], j,l=l,...,n. Suppose that (9) is completely integrable. If the Ak(x) are holomorphic at Xo =(x?, ..., X)EX, there exists for any UO E C m one and only one solution of (9) that is holomorphic at XO and satisfies u( XO) = uo. This implies that the solu- tion space of (9) is an m-dimensional vector space; the basis of this space is called a funda- mental system of solutions. Therefore any solution is expressible as a linear combination of a fundamental system of solutions and can be continued analytically in a domain where the Ak(x) are holomorphic. A subvariety of X that is the pole set of at least one of the Ak(x) is called a singular locus of (9), and a point on a singular locus is ca1\ed a singular point. R. Gerard has given a definition of regular singular points and an analytic expression of a fundamental system of solutions around a regular singular point, and he studied systems of Fuchsian type [8; also 9, 30]. Let Q = L=l Ak(x)dxk' Then the system (9) can be rewritten as (d-Q)u=O. If we consider a local coordinate (x, u) of a fiber bundle over X, the operator d-Q in- duces a meromorphic linear connection V over X. Starting from this point of view, P. Deligne [5J introduced several important concepts and obtained many results. The first results for irregular singular points were obtained by Gerard and Y. Sibuya [10], and H. Majima [20J studied irregular singular points of mixed type. The systems of partial differential equations that are satisfied by the hypergeometric func- tions of several variables are equivalent to linear systems of Pfaffian equations [1]. This means that such systems of partial differential equations are tholonomic systems. M. Kashi- wara and T. Kawai [15J studied holonomic systems with regular singularities from the standpoint of microlocal analysis. Special types of holonomic systems were investigated by T. Terada [28J and M. Yoshida [29]. Consider a system of Pfaffian equations Wj=O, j= I, ...,r, (10) 
428 Ref. Total Differential Equations where (I)j = LZl ajk(x)dx k and x = (x l' ..., x n ). Suppose that a jk are holomorphic in a domain D of en and that dOJ j 1\ OJ! 1\ ...1\ OJ, = 0 in D. Denote by S the zero set of OJ I 1\ ...1\ OJ, = O. A point of S is called a singular point of(10). If the codimension of S is ? 1, then system (10) is completely integrable in D - S. The following theorem was proved by B. Malgrange [21]: Let XO E S, and suppose that the codimen- sion of S is ? 3 around xO; then there exist functions fj, j= 1,..., r, and Yjb j, k = 1, ..., r, that are holomorphic at XO and satisfy (J)j= L! Yjk dl k and det(Yjdxo)h'O. References [IJ P. Appell and J. Kampe de Feriet, Fonc- tions hypergeometriques et hyperspheriques, Gauthier-Villars, 1926. [2J E. Cartan, Sur l'integration des systemes d'equations aux differentielles totales, Ann. Sci. Ecole Norm. Sup., 18 (1901), 241-311. [3J E. Cartan, Leyons sur les invariants in- tegraux, Hermann, 1922. [4J E. Cartan, Les systemes differentielles exterieures et leurs applications geometriques, Hermann, Actualites Sci. Ind., 1945. [5J P. Deligne, Equations differentielles a points singuliers reguliers, Lecture notes in math. 163, Springer, 1970. [6J A. R. Forsyth, Theory of differential equa- tions, pt. I. Exact equations and Pfaff's prob- lem, Cambridge Univ. Press, 1890. [7J A. R. Forsyth, Theory of differential equa- tions, pt. IV. Partial differential equations, Cambridge Univ. Press, 1906. [8J R. Gerard, Theorie de Fuchs sur une variete analytique complexe, 1. Math. Pures Appl., 47 (1968), 321-404. [9J R. Gerard and A. H. M. Levelt, Sur les connexions a singularites regulieres dans Ie cas de plusieurs variables, Funkcial. Ekvac., 19 (1976),149-173. [IOJ R. Gerard and Y. Sibuya, Etude de cer- tains systems de Pfaff avec singularites, Lec- ture notes in math. 712, Springer, 1979. [1 IJ E. Goursat, Leyons sur I'integration des equations aux derivees partielles du premier ordre, Hermann, second edition, 1920. [12J E. Goursat, Leyons sur Ie prob\eme de Pfaff, Hermann, 1922. [13J V. W. Guillemin and S. Sternberg, An algebraic model of transitive differential geom- etry, Bull. Amer. Math. Soc., 70(1964),16-47. [14J E. Kahler, Einfiihrung in die Theorie der Systeme von Differentialgleichungen, Teubner, 1934. [15J M. Kashiwara and T. Kawai, On holo- nomic systems of microdifferential eq uations IV, systems with regular singularities, Publ. Res. Inst. Math. Sci., 17 (1981),813-979. 1628 [16J M. Kuranishi, On E. Cartan's prolonga- tion theorem of exterior differential systems, Amer. J. Math., 79 (1957), 1-47. [17J M. Kuranishi, Lectures on e).terior dif- ferential systems, Lecture notes, Tata Inst., 1962. [18J M. Kuranishi, On the local theory of continuous infinite pseudo groups I, II, Nagoya Math. J., 15 (1959), 225-260; 19 (1961), 55-91. [19J M. Kuranishi, Lectures on involutive systems of partial differential eq uations, Publ. Soc. Math. Siio Paulo, 1967. [20J H. Majima, Asymptotic analysis for inte- grable connections with irregular singular points, Lecture notes in math. 1075, Springer, 1984. [21J B. Malgrange, Frobenius avec singular- ites II, Le cas general, Inventiones Math., 39 (1976),67-89. [22J M. Matsuda, Cartan-Kuranishi's pro- longation of differential systems combined with that of Lagrange and Jacobi, Publ. Res. Inst. Math. Sci., 3 (1967/68), 69- 84. [23J C. Riquier, Les systemes d'eq uations aux derivees partielles, Gauthier-Villars, 1910. [24J J. F. Ritt, Differential algebra, Amer. Math. Soc. Colloq. Publ., 1950. [25J I. M. Singer and S. Sternberg" The infinite groups of Lie and Cartan I. The transitive groups, J. Analyse Math., 15 (1965), 1-114. [26J J. A. Schouten and W. v. d. Kulk, Pfaff's problem and its generalizations, Clarendon Press, 1949. [27J D. C. Spencer, Overdetermined systems of linear partial differential equations, Bull. Amer. Math. Soc., 75 (1969), 179-239. [28J T. Terada, Probleme de Rieman et fonc- tions automorphes proven ant des fonctions hypergeometriques de plusieurs variables, J. Math. Kyoto Univ., 13 (1973), 557-578. [29J M. Yoshida, Local theory of Fuchsian systems with certain discrete monodromy groups I, II, Funkcial. Ekvac., 21 1,1978), 105. 137,203-221. [30J M. Yoshida and K. Takano, On a linear system of Pfaffian eq uations with regular singular point, Funkcial. Ekvac., 19 (1976), 175-189. 429 (XI.6) Transcendental Entire Functions A. General Remarks An entire function (or integral funetion) [(z) is a complex-valued function of a complex variable 
1629 z that is holomorphic in the finite z-plane, z#-oc. If j(z) has a pole at 'XJ, then I(z) is a polynomial in z. A polynomial is called a rational entire function. If an entire function is bounded, it is constant (tLiouvil1e's theorem) A transcendental entire function is an entire function that is not a polynomial, for example, exp z, sin z, cos z. An entire function can be developed in a power series L;;u anz n with infinite radius of convergence. If j (z) is a tran- scendental entire functiOtt, this is actually an infinite series B. The Order of an Entire Function If a transcendental entire function j(z) has a zero of order m (m ? 0) at z = 0 and other zeros at :Xh :x 2 , "', :X n , ... (0 < 1:x 1 1  1:x 2 1  1:x 3 1 ...-> X)), multiple zeros being repeated, then /(z) can be written in the form j(z)=eg(z)zm fI ( 1- ) egd=>, kl :X k where g(z) is an entire function, gk(Z) = (z/:x k ) + (1/2) (Z/:Xk)2 +(1/3)(z/:xd 3 +.. + (l/pd(z/:xd P " and PI, P2' '" are integers with the property that L;1Iz/:XkIPk+1 converges for all z (Weier- strass's canonical product). E. N. Laguerre introduced the concept of the genus of a transcendental entire function j(z). Assume that there exists an integer P for which Lo l:x k l-(p+1) converges, and take the smallest such p. Assume further that in the representation for i(z) in the previous para- graph, when PI = P2 =. . = p, the function g(z) reduces to a polynomial of degree q; then max(p,q) is called the genus of j(z). For tran- scendental entire functions, however, the order is more essential than the genus The order P of a transcendental entire function j(z) is de- fined by P = Jim sup log log M(r)/Iog r, r-->xJ where M(r) is the maximum value of Inz)1 on Izl = r. By using the coefficients of j(z) = L a"zn, we can write p= Jim sup n log n/log( 1/la"I). The entire functions of order 0, which were studied by Valiron and others, have prop- erties similar to polynomials, and the en- tire functions of order less than 1/2 satisfy lim,,,_'l minl=I'" Ij(z)1 = rfJ for some increasing sequence r" i 'XJ (Wiman's theorem). Hence entire functions of order less than 1/2 cannot be bounded in any domain extending to infin- ity Among the functions of order greater than 1/2 there exist functions bounded in a given angular domain D::x < arg z <:x + 1[//1. If Ij(z)1 4298 Transcendental Entire Functions <expr P (P</1) and ((z) is bounded on the boundary of D, then j(z) is bounded in the angular domain ( 272 Meromorphic Func- tions). In particular, if the order P of I(z) is an integer p, then it is equal to the genus, and g(z) reduces to a polynomial of degree p (1 Hadamard). These theorems originated in the study of the zeros of the tRiemann zeta func- tion and constitute the beginning of the theory of entire functions. There is some difference between the prop- erties of functions of integral order and those of others. Generally, the point z at which f(z) = \v is called a w-point of j(z). If {zn} consists of w-points different from the origin, the in- fimum pdw) of k for which L 1/lz"l k converges is called the exponent of convergence of j - w. If the order p of an entire function is integral, then pdw)=p for each value 11' with one pos- sible exception, and if p is not integral, then PI (11') = P for all 11' (E. Borel). Therefore any transcendental entire function has an infinite number of w-points for each value 11' except for at most one value, cal1ed an exceptional value of f(z) (Picard's theorem). In particular, f(z) has no exceptional values if p is not integral For instance, sinz and cosz have no excep- tional values, while e Z has 0 as an exceptional value. Since transcendental entire functions have no poles, 'XJ can be counted as an excep- tional value Then we must change the state- ment in Picard's theorem to "except for at most two values." Since the theorem was ob- tained by E. Picard in 1879, problems of this type have been studied intensively ( 62 Cluster Sets. 272 Meromorphic Functions). After Picard proved the theorem by using the inverse of a tmodular function, several alternative proofs were given. For instance, there is a proof using the Landau-Schottky theorem and tBloch's theorem and one using tnormal families. Picard's theorem was ex- tended to meromorphic functions and has also been studied for analytic functions defined in more general domains. There are many fully quantitative results, too. For instance, Valiron [3J gave such results by performing some calculations on neighborhoods of points where entire functions attain their maximum ab- solute values. Thereafter, the distribution of w-points in a neighborhood of an essential singularity was studied by many people, and in 1925 the Ne- vanlinna theory of meromorphic functions was established. The core of the theory consists of two fundamental theorems, tNevanlinna's first and second fundamental theorems ( 272 Meromorphic Functions). Concerning com- posite entire functions F(z) = j(g(z)), P61ya proved the following fact: The finiteness of the order of F impJies that the order of { should 
429 C Transcendental Entire Functions be zero unless 9 is a polynomia1. This gives the starting point of the factorization theory, on which several people have been working recently. Several theorems in the theory of meromorphic functions can be applied to the theory. One of the fundamental theorems is the fonowing: Let F(z) be an entire func- tion, which admits the factorizations F(z) = Pm(fm(z)) with a polynomial Pm of degree m and an entire function [m for an integers m. Then F(z)=Acos JH(z) +B unless F(z) = A exp H (z) + B. Here, H is a nonconstant entire function and A, B are constant, A *0. C. Julia Directions Applying the theory oftnormal families of holomorphic functions, G. Julia proved the existence of Julia directions as a precise form of Picard's theorem [5]. A transcendental entire function [(z) has at least one direction arg z = (j such that for any E > 0, [(z) takes on every (finite) value with one possible exception infinitely often in the angular domain (j - E < arg z < (j + E. This direction arg z = (j is caned a Julia direction of [(z). D. Asymptotic Values tAsymptotic values, tasymptotic paths, etc., are defined for entire functions as for mero- morphic functions. In relation to tIversen's theorem and tGross's theorem for inverse functions and results on tcluster sets, tordinary singularities of inverse functions hold for entire functions in the same way as for meromorphic functions. Also, as for meromorphic functions, ttranscendental singularities of inverse func- tions are divided into two classes, the tdirect and the tindirect transcendental singularities. The exceptional values in Picard's theorem are asymptotic values of the functions, and 00 is an asymptotic value of any transcenden- tal entire function. Therefore [(z)-+ 00 along some curve extending to infinity. Between the asymptotic paths corresponding to two distinct asymptotic values. there is always an asymptotic path with asymptotic value 00. By tBloch's theorem, A. Bloch showed that the tRiemann surface of the inverse function of a transcendental entire function contains a disk with arbitrarily large radius. Denjoy conjec- tured in 1907 that 11  2p, where p is the order of an entire function and 11 is the number of distinct finite asymptotic values of the func- tion, and L. V. Ahlfors gave the first proof (1929). This result contains Wiman's theorem. There are transcendental entire functions with 11=2p. It was shown by W. Gross that among entire functions of infinite order there exists 1630 an entire function having every value as its asymptotic value. References [lJ E. C. Titchmarsh, The theory of functions, Clarendon Press, second edition, 1939. [2J R. P. Boas, Entire functions, Academic Press, 1954. [3J G. Valiron, Lectures on the general theory of integral functions, Librairie de rUniversite, Deighton, Ben and Co., 1923 (Chelsea, 1949). [4J L. Bieberbach, Lehrbuch der Funktionen- theorie II, Teubner, 1931 (Johnson Reprint Co., 1969). [5J G. Julia, Sur quelques proprietes nouvenes des fonctions entieres ou meromorphes, Ann. Sci. Ecole Norm. Sup., (3) 36 (1919), 93-125. 430 (V .11 ) Transcendental Numbers A. History A complex number (1. is caned a transcendental number if (1. is not talgebraic over the field of rational numbers Q. C. Hermite showed in 1873 that e is a transcendental number. Fol- lowing a similar line of thought as that taken by Hermite, C. L. F. Lindemann showed that 1[ is also transcendental (1882). Among the 23 problems posed by D. Hilbert in 1900 ( 196 Hilbert), the seventh was the problem of estab- lishing the transcendence of certain numbers (e.g., 2 J2 ). This stimulated fruitful investiga- tions by A. O. Gel'fond, T. Schneider, C. L. Siegel, and others. The theory of transcen- dental numbers is, however, far from complete. There is no general criterion that can be uti- lrzed to characterize transcendental numbers. For example, neither the transcendence nor even the irrationality of the tEuler constant C=limnoo(1 + 1/2+... + l/n-logn) has been established. A survey of the development of the theory of transcendental numbers can be found in [18J, in which an extensive list of relevant publications up to 1966 is given. B. Construction of Transcendental Numbers Let Q be the field of talgebraic numbers. Sup- pose that (1. is an element of Q that satisfies the irreducible equation J(x) = aox" + (l1 X"-1 + .,. + an =0, where the a; are rational itltegers, ao*O, and a O ,a 1 , ...,a n have no common factors. Then we define H«(1.) to be rhe maxi- 
1631 mum of lad (i=O,..., n) and call it the height of :x. J. Liouville proved the following theorem (1844): Let  be a real number ( Q). If inf{ qnl  - p/qll p/qEQ} =0 for any positive integer n, then  is transcendental. Transcendental numbers having this prop- erty are called Liouville numbers. Examples are: (i)  = ;;:'l 9 -", where 9 is an integer not smaller than 2 (ii) Suppose that we are given a sequence {nd of positive integers such that nk-->CfJ (k-->CfJ). Let  be the real number expressed as an tinfinite simple continued frac- tion ho+ I/h! -+- l/h 2 -+- .... Let B, be the denomi- nator of the ith tconvergent of the continued fraction. If h nk +! ?o Br 2 for k?o 1, then  is a Liouville number. On the other hand, K. Mahler [8, 9J proved the existence of transcendental numbers that are not Liouville numbers. For example, he showed that if f(x) is a nonconstant inte- gral polynomial function mapping the set of positive integers into itself, then a number  expressed, e.g., in the decimal system as OY! Y2Y3 ... is such a number if we put Yn = f(n), n = 1, 2, 3, ... . (In particular, from f(x) = x we get the non-Liouville transcendental number  =0.123456789101112....) Mahler proved this result by using tRoth's theorem (1955) ( 182 Geometry of Numbers). Both Liouville and Mahler utilized the theory oftDiophantine approximation to construct transcendental numbers. On the other hand, Schneider [1O-l2J and Siegel [3J constructed transcendental num- bers using certain functions. Examples are. exp:x (:XEQ, 1:->;60);:x P (IXEQ, IX#O, 1; #EQ-Q); J (T), where J is the tmodular function and T is an algebraic number that is not contained in any imaginary quadratic number field; p(2ni/:X), where fP is the Weierstrass f,)- function, :XEQ, and IX #0; and B(p, q), where B is the tBeta function and p, qEQ-Z. Since e = exp 1 and I = exp 2ni, the tran- scendence of e and n is directly implied by the transcendence of explX (IXEQ, :x #0). C. Classification of Transcendental Numbers (1) Mahler's classification: Given a complex number  and positive integers nand H, we consider the following: Wn(H,O=min{lt a,CII a,EZ, la,IH, t a,c#o}, wn()=wn=limsup( -logwn(H, WlogH), H"" w()= w= limsupwn()/n, 430D Transcendental Numbers and let {1 = the first number n for which W n is Xc Then we have the following four cases: (I) w =0, {1 = 'XJ; (ii) ° < w < CfJ, {1 = CfJ; (iii) w= {1 = CfJ; (iv) w = CfJ, {1 < CfJ, corresponding to which we call  an A-number, S-number, T- number, or U-number. The set of A-numbers is denoted by A, and similarly we have the classes S, T, and U. It is known that A=Q. If two numbers  and 1] are talgebraically dependent over Q, then they belong to the same class If  belongs to S, the quantity O() = sup {w n ( )/n I n = 1,2, ... } is called the type of  (in the sense of Mahler). Mahler conjectured that almost aU transcendental numbers (except a set of Lebesgue measure zero) are S-numbers of the type 1 or 1/2 according as they belong to R or not. Various results were obtained concerning this conjecture (W. J. LeVeque, J F. Koksma, B. Volkmann) until it was proved by V. G. Sprindzhuk in 1965 [14,15]. The existence ofT-numbers was proved by W. M. Schmidt (1968) [16]. All Liouville numbers are U-numbers [7J On the other hand, 10gIX (:XEQ, :x> 0, :x # I) and n are transcendental numbers that do not belong to U. (2) Koksma's classification: For a given transcendental number  and positive numbers nand H, we consider the following: w:(H, O=min{I-IXIIIXEQ, H(IX)H, [Q(IX):QJn}, w:()= W:=limsup( -log(Hw:(H, m/logH), ll-x, w*(O=w*=limsupw:(Wn, nX and let {1* =the first number n for which w: is CfJ. Then we have the following three cases: (i) w* < CfJ, {1* = CfJ; (ii) w* = {1* = CfJ; (iii) w* = 00, {1* < 00. We call  an S*-number, T*-number, or U*-number according as (i), (ii), or (iii) holds and denote the set of S*-numbers by S*, etc. If  belongs to S*, we call O*()=sup{w:(Wnln = 1, 2, ... } the type of  (m the sense of Kok- sma). It can be shown that S=S*, T=T*, and U = U*, and that if ES, then O*(O O() (}*()+1. D. Algebraic Independence Concerning the algebraic relations of tran- scendental numbers, we have the following three principal theorems: (1) Let :x" '" ':X m be elements of Q that are linearly independent over Q. Then exp:x 1, ... , exp:X m are transcendental and alge- braically independent over Q (Lindemann- Weierstrass theorem). (2) Let Jo(x) be the tBessel functIOn and IX a nonzero algebraic number. Then Jo(rx) and J(:x) are transcendental and algebraically independent over Q (Siegel). 
430D Transcendental Numbers (3) Let rx" ... ,rx n be nonzero elements of Q such that log rx" ... , log rx n are linearly inde- pendent over Q Then 1, log rx J, ... , logrx n are linearly independent over Q (A. Baker). Besides these theorems, various related results have been obtained by A. B. Shidlovskii, Gel'fond, N. I. Fel'dman, and others. A quanti- tative extension of theorem (3), also by Baker, will be discussed later. First we give more detailed descriptions of theorems (1) and (2). Let rx l ,..., rxm be as in theorem (1), s=[Q(rx l , ..,rxm):QJ, P(X J,"', X m ) be an arbitrary polynomial in Q [XJ,"', XmJ of degree n, and H(P) be the maximum of the absolute values of the coeffi- cients of the polynomial P. Then there exists a positive number C determined only by the numbers rx l , ..., rx m and n( =deg P) such that -2'( 2e'mn+m+n) -I) IP(e"',...,e"m)I>CH(P) m . In particular, if rx is a nonzero algebraic number, then exprx belongs to Sand O(exprx) :( 85 2 + 65. (2') Let rx be a nonzero algebraic number, s=[Q(rx).QJ, PEQ[XJ,X 2 J, degP=n. Then there exists a positive number C determined only by rx and n such that IP(Jo(rx), J(rx))1 > CH(P) -S2.,1 n 3. Theorems (1) and (2) are actually special cases of a theorem obtained by Siegel. To state this theorem, the following terminology is used: An entire function f(z) = Lo C n ' zn/n! is called an E-function defined over an talge- braic number field K of finite degree if the fol- lowing three conditions are satisfied: (i) C n E K (n = 0,1,2, ...). (ii) For any positive number 0, C n = O(n'"). (iii) Let qn be the least positive integer such that Ckqn belongs to the ring D of algebraic integers in K (O:(n, O:(k:(n). Then for an arbitrary positive number F., qn = O(n'"). A system {II (z), ... Jm(z)} of E-functions defined over K is said to be normal if it satis- fies the following two conditions: (i) None of the functions f;(z) is identically zerO. (ii) If the functions W k = f(z) (k = 1,..., m) satisfy a sys- tem ofthomogeneous linear differential equa- tions of the first order, then w = Lr1 Qk'(Z)W" where the Qk/(Z) are rational functions of z, with coefficients in the ring D. The matrix (Qk/) can be decomposed by rearranging the order of the indices k, I if necessary into the form (:1' } where w,= [ Q.I.:,t ::: Q(.t ] , Qmt1 ,t... Qmtmt t n 1 :( t :( r, I m, = m. r=l 1632 The decomposition is unique if we choose r as large as possible, in which case we call W" .., w,: the primitive parts of (Qk/)' The requirement is that the primitive parts W; are independent in the following sense: If there are numbers C"EK and polynomial functions Pk,(Z)EK[zJ such that , [ Plt(Z) ,! I (Clt...Cm,,)W, : =0, t' Pm,(z) , then C,,=O, Pk/(Z) =0. Let N be a positive integer. A normal sys- tem {f (z), '" Jm(z)} of E-functions is said to be of degree N if the system {F n ,. .nm(z)= .fdz)n, ....fm(z)"mlnj?oO, Lr1 nj:(N} is also a normal system of E-functions. Then the theorem obtained by Siegel [4J is: Let N be an arbitrary positive integer and { fl (z), '" ,fm(z)} be a normal system of E-functions of degree N defined over an algebraic number field of finite degree K satisfying the system of differen- tial equations .f(Z)=Lr, Qk/(Z)j;(z), where Qk/(Z)ED(z), 1:( k:( m. If rx is a nonzero alge- braic number that is not a tpole of anyone of the functions Qk/(Z), then fl (rx), "', rm(rx) are transcendental numbers that are algebraically independent over the field Q. Theorem (3) at the beginning of this sec- tion implies, for example, the following: (i) If rx l ,..., rxn and fJI, "', fJn all belong to Q and }' = rx,logfJ, + .., + rx n 10gfJn =1=0, then}' is tran- scendental. (ii) If rx I, ... ,rx n , fJo, fJI' . ,fJn are nonzero algebraic numbers, then eBorxq, .., rxn is transcendental. (iii) If rx"..., rx n are alge- braic numbers other than 0 and 1. and /3" ... ,fJn also belong to Q, with 1, fJI ... ,f3n linearly independent oyer Q, then rxf' .., rxn is transcendental. Baker [17J also obtained a qual1titative extension of theorem (3): Suppose that we are given integers A ?o 4, d?o 4 and nonzero algebraic numbers rx" ..., rx n (n?o 21 whose heights and degrees do not exceed A and d, respectively. Suppose further that 0 < 15:( 1, and let log rx" .., , log rx n be the principal values of the logarithms. If there exist rational in- tegers b l , ... ,b n with absolute value at most H such that 0<lb 1 10grx l +... +bnlogrxnl <e- Mt , then H «4 n1 15-' d 2n logA)(2n+,)2. This theorem has extensive applications in various problems of number theory, including a wide class of tDiophantine problems [19]. A number of new, interesting results on the algebraic independence of values of exponen- tial functions, elliptic functions, and some other special functions have been obtained 
1633 recently by D. Masser, G. V. Chudnovskii, M Waldschmidt, and other writers. In particular, Chudnovskii (1975) obtamed the remarkable result that r(1/3) and r(1/4) are transcenden- tal numbers See [20-24]. References [IJ A. O. Gel'fond, Transcendental and alge- braic numbers, Dover, 1960. (Original in Russian, 1952.) [2J T Schneider, infiihrung in die trans- zendenten Zahlen, Spnnger, j 957. [3J c. L. Siegel, Uber einige Anwendungen diophantischer Approximationen, Abh. Preuss. Akad Wiss., 1929 (Gesammelte Abhandlun- gen, Springer, 1966, vo!. 1,209-266). [4J C. L. Siegel, Transcendental numbers, Ann. Math Studies, Princeton U niv. Press, 1949. [5J N. I Fel'dman, Approximation of certain transcendental numbers I, Amer. Math. Soc. Trans!., 59 (1966), 224-245. (Original in Rus- sian, 1951.) [6J W. J. LeVeque, Note on S-numbers, Proc. Amer. Math. Soc., 4 (1953),189-190. [7J W J. LeVeque, On Mahler's U-numbers, J London Math. Soc., 28 (1953), 220-229. [8J K. Mahler, Arithmetische Eigenschaften einer Klasse von Dezimalbriichen, Proc. Acad. Amsterdam, 40 (1937),421-428. [9J K. Mahler. Uber die Dezimalbruchent- wicklung gewisser Irrationalzahlen, Mathe- matica B, Zutphen, 6 (1937),22-36 [IOJ T. Schneider, Transzendenzuntersuchun- gen periodischer Funktionen I, J Reine Angew Math., 172 (1935), 65-69. [llJ T. Schneider, Arithmetische Untersu- ch ungen elliptischer I ntegrale, Math. Ann., 113 (1936),1-13 [12J T. Schneider, Zur Theorie der Abelschen Funktionen und Integrale, J. Reine Angew Math., 183(1941),110-128. [13J S. Lang, Introduction to transcendental numbers, Addison-Wesley, 1966. [14] V. G. Sprindzhuk, A proof of Mahler's conjecture on the measure of the set of S- numbers, Amer. Math. Soc. Trans!., 51 (1966), 215-272. (Original in Russian, 1965.) [15J V. G. Sprindzhuk, Mahler's problem in metric number theory, Amer. Math. Soc. Trans!. Math. Monographs, 1969. (Original m Russian, 1967.) [16J W. M. Schmidt, T-numbers do exist, Rend. Convegno di Teoria dei Numeri, Roma, 1968. [17J A. Baker, Linear forms in the logarithms of algebraic numbers I, II, III, IV, Mathe- matika, 13 (1966), 204-216,14 (1967),102- 107; 14 (1967),220-228; 15 (1968), 204-216. 431 A Transformation Groups [18] N. 1. Fel'dman and A. B. Shidlovskii, The development and present state of the theory of transcendental numbers, Russian Math. Sur- veys, 22 (1967), 1-79. (Original in Russian, 1967.) [19J A. Baker, Transcendental number theory, Cambridge Univ. Press, 1975. [20J D. Masser, Elliptic functions and tran- scendence, Lecture notes in math. 437, Springer, 1975 [2lJ G. V. Chudnovskii, Algebraic indepen- dence of values of exponential and elliptic functions, Proc. Int. Congr. Math., Helsinki, 1978. [22] A. Baker and D. Masser (eds.), Tran- scendence theory: Advances and applications, Academic Press, 1977. [23J M. Waldschmidt, Transcendence meth- ods, Queen's papers in pure and app!. math. 52, Queen's Univ., 1979. [24J M. Waldschmidt, Nombres transcendants et groups algebriques, Asterisque, 69-70 (1980). 431 (IX.19) Transformation Groups A. Topological Transformation Groups Let G be a group, M a set, and f a mapping from G x Minto M. Put f(g, x)=g(X)(gEG, x EM). Then the group G is said to be a trans- formation group of the set M if the following two conditions are satisfied: (i) e(x)=x (xEM), where e is the identity element of G; and (ii) (gh)(x) = g(h(x)) (XE M) for any g, hEG. In this case the mapping x->g(x) is a one-to-one mapping of M onto itself Let G be a transformation group of M. If G is a topological group, M a topological space, and the mapping (g, x)->g(x) a continuous mapping from G x Minto M, then G is called a topological transformation group of M. In this case x->g(x) is a homeomorphism of M onto itself. The mapping (g, x)->g(x) is called an action of G on M. The space M, together with a given action of G, is called a G-space. F or a point x of M, the set G(x) = {g(x) I gE G} is called the orbit of G passing through the point x. Defining as equivalent two points x and y of M belonging to the same orbit, we get an equivalence relation in M. The quotient space of M by this equivalence relation, de- noted by M /G, is called the orbit space of G-space M. If G(x) = {x}, then x is called a fixed point The set of all fixed points is denoted by MG. For a point x of M, the set Gx={YEGlg(x)= 
431 B Transformation Groups x} is a subgroup of G called the isotropy subgroup (stabilizer, stability subgroup) of G at the point x. A conjugacy class of the subgroup G x is called an isotropy type of the transforma- tion group G on M. The group G is said to act nontrivially (resp. trivially) on M if M =1= M G (resp. M = M 6 ). The group G is said to act freely on M if the iso- tropy subgroup G x consists only of the identity element for any point x of M. The group G is said to act transitively on M if for any two points x and y of M, there exists an element g E G such that g(x) = y. Let N be the set of all elements g E G such that g(x) = x for all points x of M. Then N is a normal subgroup of G. If N consists only of the identity element e, we say that G acts effec- tively on M, and if N is a discrete subgroup of G, we say that G acts almost effectively on M. When N=I={e}, the quotient topological group G/N acts effectively on M in a natural fashion. An equivariant mapping (equivariant map) (or a G-mapping, G-map) h:X--> Y between G- spaces is a continuous mapping which com- mutes with the group actions, that is, h(g(x) = g(h(x)) for all gEG and XEX. An equivariant mapping which is also a homeomorphism is called an equivalence of G-spaces. For a G-space M, an equivalence class of the G-spaces G(x), x E M, is called an orbit type of the G-space M. B. Cohomological Properties We consider only tparacompact G-spaces and tCech cohomology theory in this section. We shall say that a topological space X is tinitistic if every open covering has a finite-dimensional refinement. The following theorems are useful [1-3]. (1) If G is finite, X a finitistic paracompact G-space, and K a field of characteristic zero or prime to the order of G, then the induced homomorphism 7[*: H*(X /G; K)-->H*(X; K)G is an isomorphism. Here, 7[ is a natural projec- tion of X onto X/G. The group G acts natu- rally on H*(X; K), and H*(X; K)G denotes the fixed-point set of this G-action. (2) Let X be a finitistic G-space and G cyclic of prime ordor p. Then, with coefficients in Z/pZ, we have "£ 00 (a) for each II I rank Hi(X G )  I rank Hi(X), i=n i=n (b) X(X)+(p-l)X(X G )=PX(X/G). Here the tEuler-Poincare characteristics X( are defined in terms of mod p cohomology. (3) Smith's theorem: If G is a p-group (p prime) and if x is a finitistic G-space whose modp cohomology is isomorphic to the 11- 1634 sphere, then the mod p cohomology of the fixed-point set X G is isomorphic to that of the r-sphere for some -1  r  II, where ( -1)- sphere means the empty set. (4) Let T k denote the k-dimensional toral group. Let X be a Tk-space whose rational cohomology is isomorphic to the l1-sphere, and assume that there are only a finite number of orbit types and that the orbit spaces of all subtori are finitistic. Let H be a subtorus of T k . Then by the above theorem the rational coho- mology of X H is isomorphic to that of the r(H)-sphere for some -1 r(H)I1. Assume further that there is no fixed point of the Tk_ action. Then, with H ranging over all subtori of dimension k-l, we have 11 + 1 = I (r(H) + 1). H C. Differentiable Transformation Groups Suppose that the group G is a transformation group of a tdifferentiable manifold M, G is a tLie group, and the mapping (g, x)-->g{x) of G x Minto M is a differentiable mapping. Then G is called a differentiable tr:msformation group (or Lie transformation group) of M, and M is called a differentiable G-manifold. The following are basic facts about compact differentiable transformation groups [3,4]: (5) Differentiable slice theorem: Let G be a compact Lie group acting differen1iably on a manifold M. Then, by averaging an arbitrary tRiemannian metric on M, we may have a G- invariant Riemannian metric on M. That is, the mapping x -->g(x) is an tisometry of this Riemannian manifold M for each !lEG. For each point x E M, the orbit G{x) through x is a compact submanifold of M and the mapping g g(x) defines a G-equivariant diffeomor- phism G/Gx;::;,G(x), where G/G x is the left quo- tient space by the isotropy subgroup G x . G x acts orthogonally on the ttangent space TxM at x (resp. the tnormal vector space N x of the orbit G(x)); we call it the isotropy representa- tion (resp. slice representation) of G x at x. Let E be the tnormal vector bundle of the orbit G(x). Since G acts naturally on E as a bundle map- ping, the bundle E is equivalent to the bundle (G x NxJ/G x over G/G x as a tG-vector bundle, where G x acts on N x by means of the slice representation and G x acts on G by the right translation. We can choose a small positive real number e such that the texponential mapping gives an equivariant tdiffeomorphism of the e-disk bundle of E onto an invariant ttubular neighborhood of G(x). (6) Assume that a compact Lie group G acts differentiably on M with the orbit :,pace M* = M /G connected. Then there exists :l maximum 
1635 orbit type G/H for G on M (i e., H is an iso- tropy subgroup and H is conjugate to a sub- group of each isotropy group). The union M(H) of the orbits of type G / H is open and dense in M, and its image M(J) in M* is connected. The maximum orbit type for orbits in M guaranteed by the above theorem is called the principal orbit type, and orbits of this type are called principal orbits. The corresponding isotropy groups are called principal isotropy groups. Let P be a principal orbit and Q any orbit. If dim P > dim Q, then Q is called a sin- gular orbit. IfdimP=dimQ but P and Q are not equivalent, then Q IS called an exceptional orbit. (7) Let G be a compact Lie group and M a compact G-manifold. Then the orbit types are finite in number. By applying (5) and (6) we have that an iso- tropy group is principal if and only if its slice representation is trivial. The situation is quite different in the case of noncompact transformation groups For example, there exists an analytic action of G = SL(4, R) on an analytic manifold M such that each orbit of G on M is closed and of codimension one and such that, for x, J'E M, G x is not isomorphic to G y unless x and y lie on the same G-orbit [5]. D. Compact Differentiable Transformation Groups Many powerful techniques in tdifferential topology have been applied to the study of differentiable transformation groups For example, using the techniques of tsurgery, we can show that there are infinitely many free differentiable circle actions on thomotopy (2n + 1 )-spheres (n  3) that are differentiably ineqUivalent and distinguished by the rational tPontryagin classes of the orbit manifolds (W. C. Hsiang [6J). Also, using tBrieskorn varieties, we can construct many examples of differentiable transformation groups on homo- topy spheres [3,4,7]. Differentiable actions of compact connected Lie groups on homol- ogy spheres have been studied systematically (Hsiang and W. Y. Hsiang [4J). The Atiyah-Singer tindex theorem has many applications in the study of transfor- mation groups. The following are notable applications: (8) Let M be a compact connected toriented differentiable manifold of dimension 4k with a tspin-structure. If a compact connected Lie group G acts differentiably and nontriv- ially on M, then the A-genus (.qf(M), [M] > of M vanishes (where ,M) denotes the tA- characteristic class of M) (M. F. Atiyah and F 431 E Transformation Groups Hirzebruch [8J, K. Kawakubo [9J). For fur- ther developments, see A. Hattori [IOJ (9) Let M be a closed oriented manifold with a differentiable circle action. Then each con- nected component Fk of the fixed point set can be oriented canonically, and we have I(M)= 'L.!(F k ), k where I( ) denotes the tThom-Hirzebruch index [8,9]. Let G be a compact Lie group and G...... EG......BG the tuniversal G-bundle. Then the tsingular cohomology H*(EG x GX) is called equivariant cohomology for a G-space X and is an H*(BG)-module. Let G=U(I), M a dif- ferentiable U (1 )-manifold, F = M G , and i: F...... M the inclusion mapping. Then the tlocaliza- tion of the induced homomorphism S-I i*:S-1 H*(EG x GM)......S-I H*(BG x F) is an isomorphism, where S-1 denotes the localization with respect to the multiplicative set S = {at k } with a, k ranging over all posi- tive integers and t the generator of H 2 (BG). Theorems (8) and (9) can be proved by the above localization isomorphism. Let M be a differentiable manifold. The upper bound N(M) of the dimension of all the compact Lie groups that acts effectively and differentiably on M is called the degree of symmetry of M. It measures, in some crude sense, the symmetry of the differentiable mani- fold M. The number N(M) depends heavily on the differentiable structure. For example, N(sm) =m(m+ 1)/2 for the standard m-sphere, but N(L'm) «m+ 1)2/16+5 for a thomotopy m-sphere (m  300) that does not bound a tn-manifold [11]. Also, N(Pn(C))=n(n+2) for the complex projective n-space Pn(C), but N(hPn(C)) «n+ 1)(n+2)/2 for any homotopy complex projective n-space hPn(C) (n  13) other than Pn(C) (T. Watabe [12J). Let X be a differentiable closed manifold and h: X...... Pn(C) be an orientation-preserving thomotopy equivalence. There is a conjec- ture about the total A-classes that states: If X admits a nontrivial differentiable circle action, then d(X)=h*,qf(Pn(C)) (T. Petrie [13J). It is known that if the action is free outside the fixed-point set, then the conjecture is true (T. Yoshida [14J). E. Equivariant Bordism Fix a compact Lie group G; a compact ori- ented G-manifold (1jJ, M) consists of a compact toriented differentiable manifold M and an orientation-preserving differentiable G-action IjJ:Gx M......M on M. 
431 F Transformation Groups Given families F F' of subgroups of G, a compact oriented G-manifold (if;, M) is (F, F')- free if the following conditions are satisfied: (i) if XE M, then the isotropy group G x is conju- gate to a member of F; (ii) if xEoM, then G x is conjugate to a member of F'. If F' is the empty family, then necessarily iJ M is empty and M is closed. In this case we say that (if;, M) is F-free. Given (if;,M), define -(if;,M)=(if;, -M) with the structure precisely the same as (if;, M) except for torientation. Also define o(if;, M)= (if;, oM). Note that if (if;, M) is (F, F')-free, then (if;, oM) is F'-free. Define (if;, M) and (if;', M') to be isomorphic if there exists an equivariant orientation-preserving diffeomorphism of M onto M'. An (F, F')-free compact oriented n- dimensional G-manifold (if;, M) is said to bord if there exists an (F, F)-free compact oriented (n+ I)-dimensional G-manifold (<I>, W) to- gether with a regularly embedded compact n- dimensional manifold M I in oW with M, invariant under the G-action <I> such that (<I>, M,) is isomorphic to (if;, M) and G x is con- Jugate to a member of F' for x E a W - M I . Also, M, is required to have its orientation in- duced by that of W. We say that (if;I, M,) is bordant to (if;2' M z ) if the disjoint union (if;I, M,) + (if;z, - M z ) bords. Bordism is an equivalence relation on the class of (F, F')-free compact oriented n-dimensional G-manifolds The bordism classes constitute an Abelian group O(F, F') under the oper- ation of disjoint union. If F' is empty, denote the above group by O(F). The direct sum O(F, F')= EBO(F, F') is naturally an Q-module, where Q is the toriented cobordism ring. If F consists of all subgroups of G, then O(F) is denoted by O. Suppose now that FF' are fixed families of subgroups of G. Every F' -free G-manifold is also F-free, and so this inclusion induces a homomorphism cx::O(F')->O(F). Simi- larly every F-free G-manifold is also (F, F')- free, inducing a homomorphism f3:0(F)-> O(F, F'). Finally, there is a homomorphism ?'O(F,F')->O_dF') given by 8(if;,M)= (if;, aM) Then the following sequence is exact [15]: .., O(F')O(F).!!.O(F, F')O_I (F')':' . A weakly almost complex compact G- manifold (if;, M) consists of a tweakly almost complex compact manifold M and a differ- entiable G-action if;: G x M -> M that preserves the weakly almost complex structure on M. UZ(F, F'), U are defined similarly, and they are U*-modules, where U* is the tcomplex 1636 cobordism ring of compact weakly almost complex manifolds. To study O and U:, (co)bordisrn theory is introduced (P E. Conner and E. E Floyd [16J), which is one of the tgeneralized (co)- homology theories. Miscellaneous results are known, in particular, for G a cyclic group of prime period. By means of the equivariant tThom spectrum, equivariant cobordism theory can be developed (T tom Dieck [17J); this is a multiplicative generalized cohomology theory with Thorn classes ( 114 Differential Topology; also  201 Homology Theory, 56 Characteristic Classes). F. Equivariant Homotopy Let G be a compact Lie group. On the category of closed G-manifolds, we say that two objects M, N are x-equivalent if X(M H ) = X(N ll ) for all closed subgroups H of G, where X( ) is the tEuler-Poincare characteristic. On the set of equivalence classes A(G), a ring structure is imposed by disjoint union and the Cartesian product. We call A(G) the Burnsid{ ring of G. If G is finite, A(G) is naturally isomorphic to the classical Burnside ring of G [18]. Denote by S( V) the unIt sphere of an or- thogonal G-representation space V Let V, W be orthogonal G-representation spaces. The equivariant stable homotopy group [[S(V),S(W)]J, which is defined as the dIrect limit of the equivariant homotopy sets [S(V + U), S(W + U)JG taken over orthogonal G- representation spaces U and suspensions, IS denoted by W x for rx= V - WERO(G). The tsmash product of representatives] nduces a bilinear pairing W x x UJp->wx+p. Then W o is a ring, and W x is an wo-module. The ring W o is isomorphic to the Burnside ring of G, and W x is a tprojective wo-module of rank one. The W o - module W x is free if and only if S(V) and S(W) are stably G-homotopy equivalent [18]. Let E be an orthogonal G-vector bundle over a compact G-space X. Denote by S(E) the sphere bundle associated with E. Let E, F be orthogonal G-vector bundles over X. Then E and F have the same spherical G-fiber homo- topy type if there exist fiber-preserving G- mappings {: S(E)->S(F), {': S(F)-> S(E) and fiber-preserving G-homotopies h,:S(E)->S(E), 11;:S(F)->S(F) such that 11 0 = {'of, hI = identity, 11=fo{', 11', = identity Let KOdX) be the tequivariant K-group of real G-vector bundles over X. Let TdX) be the addItive subgroup of KOdX) generated by elements of the form [EJ - [FJ, where E and F are orthogonal G-vector bundles having the same spherical G-fiber homotopy type. The factor group JG(X)= KOr;(X)/TG(X) and the natural projection 
1637 JG:KOG(X)-->JdX) are called an equivariant J-group and an equivariant J-homomorphism, respectively ( 237 K-Theory). In particular, Jd {xo}) is a factor group of the real representation ring RO(G) tAdams operations on representation rings are the main tools for studying the group J G( {xo} ) [18J G. Infinitesimal Transformations Let (: G x M --> M be a differentiable action of a Lie group G on a differentiable manifold M. Let X be a tleft invariant vector field on G. Then we can define a differentiable vector field f +(X) on Mas r(X)qh=limlh(J(exp( -tX),q))-h(q))/t ,o for each q E M and any differentiable function h defined on a neighborhood of q. It is easy to see that f +(X)q = 0 if and only If q is a fixed point of the one-parameter subgroup {exp(tX)}. A vector field r+(X) is called an infinitesimal transformation of the differenti- able transformation group G The set g of all infinitesimal transformations of G forms a finite-dimensional tLie algebra (the laws of addition and tbracket product are defined from those for the vector fields on M). If G acts effectively on M, g is isomorphic to the Lie algebra of the Lie group G ( 249 Lie Groups). In fact, the correspondence X --> r+(X) defines a Lie algebra homomorphism r+ from the Lie algebra of all left invariant vector fields on G into the Lie algebra of all differentiable vector fields on M [19]. The following fact [20J is useful for the study of noncom pact real analytic transfor- mation groups. Let g be a real tsemislmple Lie algebra and p: g --> L( M) be a Lie algebra homomorphism of g into a Lie algebra of real analytic vector fields on a treal analytic mani- fold M. Let p be a point at which the vector fields in the image p(g) have common zero. Then there exists an analytic system of coordi- nates (U; U I" " urn) with origin at p in which all the vector fields in p(g) are linear. Namely, there exists aijE g* = HomR(g, R) such that a p(X)q=-2:>u(X)u j (q)-;;-; XEg, qEU. i,j (lU j The correspondence X -->(aij(X)) defines a Lie algebra homomorphism of g into sl(m, R). For example, we can show that a real ana- lytic SL(n, R) action on the m-sphere is charac- terized by a certain real analytic vector field on (m-n+ I)-sphere (5:<Sn:<sm:<S2n-2) [21]. In particular, there are infinitely many (at least the cardinality of the real numbers) inequivalent 431 H Transformation Groups real analytic SL(n, R) actions on the m-sphere (3:<Sn:<Sm). Conversely, let g be a finite-dimensional Lie algebra of vector fields on M. Although there is not always a differentiable transformation group G that admits g as its Lie algebra of infinitesimal transformations, the following local result holds. Let G be the tsimply con- nected Lie group corresponding to the Lie alge- bra g. Then for each point x of M, there exist a neighborhood 0 of the identity element e of G, neighborhoods V, W (V c W) of x, and a differentiable mapping r of 0 x V into W with the following properties. Putting f(g,y)=g(y) (gE O,YE V), we have: (i) For all YE V, e(y) = y (ii) If g, hE 0, YE V, then (gh)(y) = g(h(y)), pro- vided that ghE 0, h(Y)E V. (iii) Let X be an arbitrary element of g. Put g, = exp( - tX), the corresponding one-parameter subgroup of G. If I-: > 0 is taken small enough, then we have g,E 0 for It I <I-: so that g,(y)(ltl < 1-:, YE V) is well defined. Therefore g, determines a vector field X on V by the formula Xyh =!im (h(g,(y)) - h( y))/t. ,o The vector field X coincides with the restric- tion of X to V. This local proposition IS often expressed by the statement that 9 generates a local Lie group of local transformations, which is called Lie's fundamental theorem on local Lie groups of local transformations. H. Criteria It is important to know whether a given trans- formation group is a topological or a Lie transformation group. The following theorems are useful for this purpose [22,23]: (10) Let G be a transformation group of a tlocally compact Hausdorff space M. If we introduce the tcompact-open topology in G, then G is a topological transformation group of M when M is locally connected or M is a tuniform topological space and G acts tequicontinuously on M. (11) Suppose that M is a tCI-manifold and G is a topological transformation group of M acting effectively on M. If G is locally compact and the mapping x-->g(x) of M is of class C' for each element g of G, then G is a Lie trans- formation group of M. (12) Assume that G is a transformation group of a differentiable manifold M and G acts effectively on M. Let g be the set of all vector fields on M defmed by one-parameter groups of transformations of M contained in G as subgroups. If g IS a fimte-dlmenslOnal Lie algebra, then G has a Lie group structure with respect to which G is a Lie transformation group of M, and then 9 coincides with the Lie 
431 Ref. Transformation Groups algebra formed by the infinitesimal transfor- mations of G. By applying theorems (10), (11), and (12) we can show that the following groups are Lie transformation groups: the group of all tiso- metries of a tRiemannian manifold; the group of all affine transformations of a differentiable manifold with a t1inear connection (generally, the group of all transformations of a differenti- able manifold that leave invariant a given tCartan connection); the group of all analytic transformations of a compact complex mani- fold (this group is actually a complex Lie group); and the group of all analytic (holomor- phic) transformations of a bounded domain inC". For related topics  105 Differentiable Manifolds, 114 Differential Topology, 122 Discontinuous Groups, 153 Fixed-Point Theorems, 427 Topology of Lie Groups and Homogeneous Spaces, etc. References [IJ A. Borel et aI., Seminar on transformation groups, Ann. Math. Studies, Princeton Univ. Press, 1960. [2J W. Y. Hsiang, Cohomology theory of topological transformation groups, Erg. math. 85, Springer, 1975. [3J G. E. Bredon, Introduction to compact transformation groups, Academic Press, 1972. [4J W. C. Hsiang and W. Y. Hsiang, Differen- tiable actions of compact connected classical groups I, II, III, Amer. J. Math. 89 (1967); Ann. Math., (2) 92 (1970); Ann. Math., (2) 99 (1974). [5J R. W. Richardson, Deformations of Lie subgroups and the variation of isotropy sub- groups, Acta Math., 129 (1972). [6J W. C. Hsiang, A note on free differentiable actions of SI and S3 on homotopy spheres, Ann. Math., (2) 83 (1966). [7J F. Hirzebruch and K. H. Mayer, O(n)- Mannigfaltigkeiten, Exotische Spharen und Singularitaten, Lecture notes in math. 57, Springer, 1968. [8J M. F. Atiyah and F. Hirzebruch, Spin- manifolds and group actions, Essays on Topol- ogy and Related Topics, Memoires dedies a Georges de Rham, Springer, 1970. [9J K. Kaakubo, Equivariant Riemann-Roch theorems, localization and formal group law, Osaka J. Math., 17 (1980). [I OJ A. Hattori, SpinC-structures and SI- actions, Inventiones Math., 48 (1978). [IIJ W. C. Hsiang and W. Y. Hsiang, The degree of symmetry of homotopy spheres, Ann. Math., (2) 89 (1969). [12J T. Watabe, On the degree of symmetry of complex quadric and homotopy complex 1638 projective space, Sci. Rep. Niigata U niv., 11 (1974). [13J T. Petrie, Smooth SI actions on homo- topy complex projective spaces and related topics, Bull. Amer. Math. Soc., 78 (1972). [14J T. Yoshida, On smooth semifree SI ac- tions on cohomology complex projective spaces, Pub I. Res. Inst. Math. Sci., 11 (1976). [15J P. E. Conner and E. E. Floyd, Maps of odd period, Ann. Math., (2) 84 (1966). [16] P. E. Conner and E. E. Floyd, Differenti- able periodic maps, Erg. Math., 33 (1964). [17J T. tom Dieck, Bordism of G-rnanifolds and integrality theorems, Topology, 9 (1970). [18J T. tom Dieck, Transformation groups and representation theory, Lecture notes in math. 766, Springer, 1979. [19J R. S. Palais, A global formulation of the Lie theory of transformation groups, Mem. Amer. Math. Soc., 22 (1957). [20J V. Guillemin and S. Sternberg, Remarks on a paper of Hermann, Trans. Amer. Math. Soc., 130 (1968). [21J F. Uchida, Real analytic SL(n, R) actions on spheres, T6hoku Math. J., 33 (1981). [22J D. Montgomery and L. Zippin, Topolog- ical transformation groups, Interscience, 1955. [23J S. Kobayashi, Transformation groups in differential geometry, Erg. math. 70, Springer, 1972. 432 (VI.8) Trigonometry A. Plane Trigonometry Fix an orthogonal frame O-X Y in a plane, and take a point P on the plane such that the angle POX is Ct:. Denote by (x,y) the coordinates of P, and put OP=r (Fig. 1). We call the six ratios sinCt:= y/r, cosCt:=x/r, tanCt:= y/x, cot Ct: = x/y, sec Ct: = r/x, cosec Ct: = r/y the sine, cosine, tan- gent, cotangent, secant, and cosecant of Ct:, re- spectively. These functions of the angle Ct: are called trigonometric functions or circular func- tions ( 131 Elementary Functions). They are periodic functions with the fundamental period n for the tangent and cotangent, and 2n for the others. The relation sin 2 Ct: + cos 2 Ct: = 1 and the addition formulas sin(Ct::tfi)=sinCt:cosfi :t cosct: sinfi, cos(Ct::t fi) =cos Ct:cos fi += sinCt:sinfi follow from the definitions ( Appendix A, Table 2). Given a plane triangle ABC (Fig. 2), we have the following three proper1ies: (i) a = b cos C + c cos B (the first law of cosines); (ii) a 2 = b 2 + c 2 - 2bc cos A (the second law of cosines); (iii) a/sin A = b/sin B = c/sin C = 2R, where R is the radius of the circle circum- 
1639 scribed about L.ABC (laws of sines) ( Ap- pendix A, Table 2). Thus we obtain relations among the six quantities a, b, e, LA, L B, and L C associated with the triangle ABC. The study of plane figures by means of trigono- metric functions is called plane trigonometry. For example, if a suitable combination of three of these six quantities (including a side) associated with a triangle is given, then the other three quantities are uniquely determined. The determination of unknown quantities associated with a triangle by means of these laws is called solving a triangle. P(x,y) .a I I :  I () x .\ Fig. 4  Jj c Fig. 2 B. Spherical Trigonometry The part ABC of a spherical surface bounded by three arcs of great circles is called a spher- ical triangle. Points A, B, C are called the vertices; the three arCS a, b, e are called the sides; and the angles formed by lines tangent to the sides and intersecting at the vertices are called the angles of the spherical triangle (Fig. 3). If we denote the angles by A, B, C, we have the relation A +B+ C-n=E>O, and E is called the spherical excess. Spherical triangles have properties similar to those of plane trian- gles: sin a/sin A = sin b/sin B = sin e/sin C (laws of sines), and cosa =cos bcose + sinb sinecos A (law of cosines). The study of spherical figures by means of trigonometric functions, called spherical trigonometry, is widely used in astron- omy, geodesy, and navigation ( Appendix A, Table 2) Fig. 3 433 Turbulence and Chaos C. History Trigonometry originated from practical prob- lems of determining a triangle from three of its elements. The development of spherical trigonometry, which was spurred on by its applications to astronomy, preceded the devel- opment of plane trigonometry. In Egypt, Babylon, and China, people had some knowl- edge of trigonometry, and the founder of trigonometry is believed to have been Hippar- chus of Greece (fl. 150 B.C.). In the Almagest of Ptolemy (c. 150 A.D.) we find a table for 2 sin IX for IX = 0,30', 1°, 1°30',... that is exact to five decimal places, and the addition for- mulas. The Greeks calculated 2 sin IX, which is the length of the chord corresponding to the double arc. Indian mathematicians, on the other hand, calculated half of the above quan- tities, that is, sin IX and I - COS IX for the arc IX. In the book by Aryabhatta (c. 500 A.D.) we find laws of cosines. The Arabs, influenced by Indian mathematicians, expressed geometric computations algebraically, a technique also known to the Greeks. AbuI Wafa (in the latter half of the 10th century A.D.) gave the correct sines of angles for every 30' to 9 decimal places and studied with AI Battani the projection triangle of the sundial, thereby obtaining the concepts of sine, cosine, secant, and cosecant. Later, a table of sines and cosines for every minute was established by the Arabs. Regio- montanus (d. 1476), a German, elaborated on this table. The form he gave to trigonometry has been maintained nearly intact to the pre- sent day. Various theorems in trigonometry were established by G. J. Rhaeticus, 1. Napier, J. Kepler, and L. Euler (1748). Euler treated trigonometry as a branch of analysis, gen- eralized it to functions of complex variables, and introduced the abbreviated notations that are still in use ( 131 Elementary Functions). References [IJ H. Flanders and J. Price, Trigonometry, Academic Press, 1975. [2J A. Washington and C. Edmond, Plane trigonometry, Addison-Wesley, 1977. 433 (XX.12) Turbulence and Chaos Turbulent flow is the irregular motion of fluids, whereas relatively simple types of flows that are either stationary, slowly varying, or peri- odic in time are called laminar flow. When 
433 A Turbulence and Chaos a laminar flow is stable against external dis- turbances, it remains laminar, but if the flow is unstable, it usually changes into either another type of laminar flow or a turbulent flow. A. Stability and Bifurcation of Flows The velocity field u(x, t), x being the space coordinates and t the time, of a flow of an incompressible VISCOUS fluid in a bounded domain G is determined by the tNavier-Stokes equation of motion, Du 1 -+(u' grad)u-vu+-gradp=O, (I) ct p and the equation of continuity, dlVU=O, with the prescribed initial and boundary con- ditions, where  denotes the Laplacian, p the pressure, p the density, and v the kinematic vicosity of the fluid Suitable extensions must be made in the foregoing system of equations if other field van ables, such as the temperature in thermal-convection problems, are to be considered. The stability of a fluid flow is studied by examining the behavior of the solution of equations (I) and (2) against external distur- bances, and, in particular, stability against infinitesimal disturbances constitutes the linear stability problem. The stability characteristics of the solution of equations (1) and (2) depend largely upon the value of the tReynolds num- ber R = U L/v, U and L being the representa- tive velocity and length of the flow, respec- tIvely. Let a stationary solutIOn of equations (I) and (2) be uo(x, R). If the perturbed flow is given by uo(x, R) + v(x, R)exp(m), v being the perturbation velocity, and equation (I) is linearized with respect to v, we obtain a tlinear eigenvalue problem for (J. The flow is called linearly stable if max(Re 0-) is negatIve, and linearly unstable if it is positive. For small values of R, a flow is generally stable, but it becomes unstable if R exceeds a critical value Reo which is called the critical Reynolds num- ber [I]. The instability of a stationary solution gives rise to the tbifurcation to another solutIOn at a tbifurcation point Re of the parameter R. Tf 1m 0-=0 for an eigenvalue 0- at R "" R" a stationary solution bifurcates from the solu- tion U o at Reo and ifImo-#O, a time-periodic solution bifurcates at Re. The latter bifurca- tion is called the Hopf bifurcation. A typical example of stationary bifurcation is the gener- ation of an axially periodic row of Taylor vortices in Couette flow between two rotating coaxial cylinders, which was studied by G I. 1640 (2) Taylor (1923), with excellent agreement be- tween theory and experiment [2]. Hopf bifur- cation is exemplified by the generation of Tollmien-Schlichting waves in the laminar tboundary layer along a flat plate, which was predicted theoretically by W. Tollmien (1929) and H Schlichting (J 933) and later confirmed experimentally by G. B. Schubauer and H. K. Skramstad (J 947) [3]. In either type of bifurcation (1m 0- = 0 or # 0) the bifurcation is called supercritica I if the bifurcating solution exists only for R > Re, subcritical if It exists only for R < R" and transcritical if it happens to exist on both sides of Re. The amplitude of the departure of the bifurcating solution from the unperturbed solution Uo tends to zero as R-->Rc. The be- havior of the bifurcating solution around the bifurcation point Rc is dealt with systemat- ically by means of bifurcation analysis. In supercritical bifurcation, the bifurcating solu- tion is stable and represents an eqmlibrium state to which the perturbed flow approaches Just as in the cases of Taylor vortices and Tollmien-Schlichting waves. On the other hand, for subcritical bifurcation the bifurcat- ing solution is unstable and gives a critical am- plitude of the disturbance above which the linearly stable basic flow (R < Rc) becomes unstable. In this case, the instability of the basic flow gives rise to a sudden change of the flow pattern resulting in either a stationary (or time-periodic) or even turbulent flow. The transition to turbulent flow that takes place in Hagen-PoiseuiIle flow through a circular tube and is linearly stable at all values of R (Re = 00) may be attributed to this type of bifurcation. The concept of bifurcation can be extended to the case where the flow U o is nonstationary, but the bifurcation analysis then becomes much more difficult. B. Onset of Turbulence The fluctuating flow resulting from ;:In insta- bility does not itself necessarily contitute a turbulent flow. In order that a flow be turbu- lent, the fluctuations must take on some irreg- ularity. The turbulent flow is usually defined in terms of the long-time behavior of the flow velocity u(x, t) at a fixed point x in space. The flow is expected to be turbulent if the fluctuat- ing velocity . 1 i T bu(x, t) = u(x, t) - hm - u(x, t) dt Jx T 0 (3) satisfies the condition I i T lim lim - buJx,t)buJx,t+!)dt=O, ,.,.,Tcx; T 0 (4) 
1641 where the subscripts label the components. Condition (4) implies that the tdynamical system of a fluid has the mixing property. This condition also states that the velocity fluctu- ation i5u j has a continuous frequency spectrum In practical situatIOns the frequency spectrum of a turbulent flow may contain both the line and continuous spectra, in which case the flow is said to be partially turbulent L. D. Landau (1959) and E. Hopf (1948) proposed a picture of turbulent flow as one composed of a tquasiperiodic motion, u(t) = f(w 1 t, W2t,. ., wnt), with a large number of rationally independent frequencies WI' "', W n produced by successive supercritical bifurca- tions of Hopf type. This picture of turbulence is not compatible with the foregoing definition of turbulence, since it does not satisfy the mixing property (4). The fact that the gener- ation of real turbulence is not necessarily preceded by successive supercritical bifurca- tions casts another limitation on the validity of this picture. The concept of turbulence is more clearly exhibited with respect to a dynamical system of finite dimension. Although we are without a general proof, it is expected that the Navier- Stokes equation with nonzero viscosity v can be approximated within any degree of ac- curacy by a system of finite-dimensional tfirst- order ordinary differential equations dX d"t=F(X). Thus the onset and some general properties of turbulence are understood m the context of the theory of tdynamical systems. Turbulence is related to those solutions of equation (5) that tend to a tset in the tphase space that is neither a tfixed point, a tclosed orbit, nor a ttorus. A set of such complicated structure is called a non periodic tattractor or a strange attractor. Historically, the strange attractor originates from the strange Axiom A attrac- tor that was found in a certain class of dy- namical systems called the Axiom A systems. However, this term has come to be used in a broader sense, and It now represents a variety of nonperiodic motions exhibited by a system that is not necessarily of Axiom A type. The above-mentioned Landau-Hopf picture of turbulence was criticized by D. Ruelle and F. Takens (1971), who proved for the dynamical system (5) that an arbitrary small perturbation on a quasiperiodic tflow on a k-dimensional torus (k?4) generically (in the sense of residual sets) produces a flow with a strange Axiom A attractor [4]. There exist a number of examples of first- order ordinary differential equations of rela- tively low dimension whose solutions exhibit 433B Turbulence and Chaos nonperiodic behavior. An important model system related to turbulence is the Lorenz model (1963) of thermal convection in a hori- zontalfluid layer. This model is obtained by taking only three components out of an in- finite number of spatial tFourier components of the velocity and temperature fields. The model is written as dX -= -(JX +(JY dt ' dY -= -XZ+rX - Y dt ' dZ -=XY-bZ dt ' (6) (5) where (J (> b + I) and b are positive constants and r is a parameter proportional to the Ray- leigh number. Obviously, equations (6) have a fixed point X = Y = Z = 0 representing the state of thermal convection without fluid flow For r < I, this fixed point is stable, but it be- comes unstable for r> 1, and a pair of new fixed points X = Y = :f:: j b(r - I), Z = r - 1 emerges supercritically. This corresponds to the onset of stationary convection at r = t. At a still higher value ofr=(J((J+b+3)/((J-b-l), a subcritical Hopf bifurcation occurs with re- spect to this pair of fixed points, and for a certain range of r above this threshold the solutions with almost any initial conditions exhibit non periodic behavior This corre- sponds to the generation of turbulence The property ax a Y oi ax + ay + az = -((J+b+ 1)<0, (7) where the dots denote time derivatives, shows that each volume element of the phase space shrinks asymptotically to zero as the time increases indefinitely. This property is char- acteristic of dynamical systems with energy dissipation, in sharp contrast to the tmeasure- preserving character of tHamiltonian systems [5]. For a certain class of ordinary differen- tial equations, the bifurcation to nonperiodic motion corresponds neither to the bifurcation of tori, just as in the Ruelle- Takens theory, nor to subcritical bifurcation, as in the Lorenz model. Such a bifurcation takes place when nonperiodic motion emerges as the conse- quence of an infinite sequence of supercritical bifurcations at each of which a periodic orbit of period T bifurcates into one of period 2 T. If we denote the nth bifurcation point by f n , the distance r n + 1 - f n between two successive bifurcation points decreases exponentially with increasing n, and eventually the bIfurcation points accumulate at a point fe' beyond which nonperiodic motion is expected to emerge. It is 
433 C Turbulence and Chaos not yet clear if any of the above three types of bifurcation leading to nonperiodic behavior is actually responsible for the generation of real turbulence. Some important properties of a dynamical system with a nonperiodic attractor, which may be either a flow or a tdiffeomorphism, can be stated as follows: (i) The distance between two points in the phase space that are initially close to each other grows exponentially in time, so that the solutions exhibit a sensitive dependence on the initial conditions. (ii) The nonperiodic attractor has tLebesgue measure zero, and such a system is expected to have many other tergodic tinvariant measures. The irregular behavior of a deterministic dynamical system is also called chaos, but this concept is more abstract and general than that of turbulence, and covers phenomena ex- hibited by systems such as nonlinear electric circuits, chemical reactIOns, and ecological systems. C. Statistical Theory of Turbulence The statistical theory of turbulence deals with the statistical behavior of fully developed turbulence. The turbulent field is sometimes idealized for mathematical simplicity to be homogeneous or isotropic. In homogeneous turbulence the statistical laws are invariant under all parallel displacements of the coordi- nates, whereas in isotropic turbulence invar- iance under rotations and reflections of the coordinates is required in addition. The instantaneous state of the fluid motion is completely determined by specifying the fluid velocity u at all space points x and can be expressed as a phase point in the infinite- dimensional tphase space spanned by these velocities. The phase point moves with time along a path uniquely determined by the solu- tion of the Navier-Stokes equation. In the turbulent state the path is unstable to the initial disturbance and describes an irregular line in the phase space. In this situation the deterministic description is no longer useful and should be replaced by a statistical treat- ment. Abstractly speaking, turbulence is just a view of fluid motion as the random motion of the phase point u(x) ( 407 Stochastic Pro- cesses). The equation for the tcharacteristic functional of the random velocity u(x) was first given by E. Hopf (1952). An exact solution obtained by Hopf represents a tnormal distri- bution associated with a white energy spec- trum, but so far no general solution has been obtained [6]. Besides the formulation in terms of the 1642 tprobability distribution or the characteristic functional, there is another way of describing turbulence by tmoments of lower orders. This is the conventional statistical theory originated by G. I. Taylor (1935) and T. von Karm<'m (1938), which made remarkable progress after World War II. The principal moments in this theory are the correlation tensor, whose (i, j)- component is the mean of the product of two velocity components u; at a point'" and u j at another point x + r, B;)r) = <u;(x)Uj(x + r», (8) and its tFourier transform, or the f'nergy spec- trum tensor, <I> ij(k)= f Bij(r)exP( -p k. r)dr. (9) (2n) In isotropic turbulence <I>ij is expressed as 1 ( k.k. ) <I>;)k) =----z E(k) (jij- ' 4nk k k='lkl, (10) where E(k) is the energy spectrum function, representing the amount of energy included in a spherical shell of radius k in the wave num- ber space. The energy of turbulence tff per unit mass is expressed as 1 2 1 I f tff=2<lu! >=2 B ;;(0)=2 <I>;;(k)dk = LX) E(k)dk. (11) The state of turbulence is characterized by the Reynolds number R = EIP /(vktF), where Eo and ko are representative values of E(k) and k, respectively. For weak turbulence of small R, E(k) is governed by a linear equation with the general solution E(k, t)= E(k, O)exp( -2vk 2 t), (12) E(k,O) being an arbitrary function. Thus E(k) decays in time due to the viscous dissipation. For strong turbulence of large R, it is difficult to obtain the precise form of E(k), and this is usually done by way of some assumption that allows us to approximate the nonlinear effects [7]. Some of the similarity laws governing the energy spectrum and other statistical functions can be determined rigorously but not neces- sarily uniquely. For 3-dimensional incompress- ible turbulence, the energy spectrum satisfies an in viscid similarity law E(k)/Eo = Fe(k/ko) (13) in the energy-containing region k == O(ko) char- acterized by a wave number ko, and a viscous similarity law E(k)/Eo = R -5/4 Fd(k/(R3/4k o )), (14) 
1643 in the energy dissipation region k = O(R 3 / 4 k o ), where Fe and Fd denote dimensionless func- tions generally dependent on the initial con- dition and the time [6]. If an assumption is made to the effect that the statistical state in the energy-dissipation re- gion depends only upon the energy-dissipation rate e= -dg/dt besides the viscosity v (or R), then (14) becomes Kolmogorov's equilibrium similarity law (1941): E(k) = el/4 V 5/4 F(k/(e 1/4 v - 3 / 4)), where F is a dimensionless function. For extremely large R (or small v) there exists an inertial subregion between the energy- containing and energy-dissipation regions such that the viscous effect vanishes and (15) takes the form E(k) = Ke 2/3 k- 5/3 , where K is an absolute constant. Kolmogorov's spectrum (16) has been observed expe"imen- tally several times, and now its consistency with experimental results at large Reynolds numbers is well established [8]. Kolmogorov (1962) and others modified (16) by taking account of the fluctuation of e due to the intermittent structure of the energy- dissipation region as E(k)= K'e 2/3 k -5/3(Lk) -P.19, where e is now the average of the fluctuating e, /1 is the covariance of the log-normal distribu- tion of e, and L is the length scale of the spatial domain in which the average of e is taken [8]. A similar modification, with the exponent - /1/3 in place of - /1/9, is obtained using a fractal model of the energy-cascade process. These corrections to E(k), based upon the experimentally estimated /1 of 0.3-0.5, are too small to be detected experimentally, but the deviation is expected to appear more clearly in the higher-order moments [8-10]. It should be noted that Kolmogorov's spec- trum (16) itself does not contradict the notion of intermittent turbulence and gives one of the possible asymptotic forms in the limit R -HJJ. The I-dimensional Burgers model of tur- bulence satisfies the same similarity laws as (13) and (14), but it has an inviscid spectrum E(k)ock- 2 instead of(16). Two-dimensional incompressible turbulence has no energy- dissipation region, and hence Kolmogorov's theory is not valid for this turbulence. It has an inviscid spectrum E(k)ock- 3 , first derived by R. H. Kraichnan (1967), C. E. Leith (1968), and G. K. Batchelor (1969). These inviscid spectra for 1- and 2-dimensional turbulence have been confirmed by numerical simulation [11]. 433 Ref. Turbulence and Chaos References (15) [1] C. C. Lin, The theory of hydro dynamical stability, Cambridge Univ. Press, 1955. [2] S. Chandrasekhar, Hydrodynamic and hydromagnetic stability, Clarendon Press, 1961. [3] H. L. Dryden, Recent advances in the mechanics of boundary layer flow, Adv. Appl. Mech., 1 (1948), 1-40. [4] D. Ruelle and F. Takens, On the nature of turbulence, Comm. Math. Phys., 20 (1971), 167-192. [5] E. N. Lorenz, Deterministic non periodic flow, J. Atmospheric Sci., 20 (1963),130-141. [6] T. Tatsumi, Theory of homogeneous tur- bulence, Adv. Appl. Mech., 20 (1980), 39-133. [7] G. K. Batchelor, The theory of homoge- neous turbulence, Cambridge Univ. Press, 1953. [8] A. S. Monin and A. M. Yaglom, Statistical fluid mechanics II, MIT Press, 1975. [9] B. Mandelbrot, Fractals and turbulence: Attractors and dispersion, Lecture notes in math. 615, Springer, 1977,83-93. [10] B. Mandelbrot, Intermittent turbulence and fractal dimension: Kurtosis and the spec- tral exponent 5/3 + B, Lecture notes in math. 565, Springer, 1977, 121-145. [11] T. Tatsumi, Analytical theories and numerical experiments on two-dimensional turbulence, Theor. Appl. Mech., 29 (1979), 375-393. (16) (17) 
434 A Unified Field Theory 434 (XX.22) Unified Field Theory A. History Unified field theory is a branch of theoretical physics that arose from the success oftgen- eral relativity theory. Its purpose is to dis- cuss in a unified way the fields of gravitation, electromagnetism, and nuclear force from the standpoint of the geometric structure of space and time. Studies have continued since 1918, and many theories of mathematical interest have been published without attaining, how- ever, any conclusive physical theory. A characteristic feature of relativity theory is that it is based on a completely new concept of space and time. That is, in general relativity theory it is considered that when a gravita- tional field is generated by matter, the struc- ture of space and time changes, and the flat tMinkowski world becomes a 4-dimensional tRiemannian manifold (with signature (1,3)) having non vanishing curvature. The tfunda- mental tensor gij of the manifold is interpreted as the gravitational potential, and the basic gravitational equation can be described as a geometric law of the manifold. It is character- istic of general relativity theory that gravita- tional phenomena are reduced to space-time structure ( 359 Relativity). The introduction of the Minkowski world in tspecial relativity theory was a revolutionary advance over the 3-dimensional space of Newtonian mechanics. But the inner structure of the Minkowski world does not reflect gravitational phenom- ena. The latter shortcoming is overcome by introducing the concept of space-time repre- sented by a Riemannian manifold into general relativity theory. When a coexisting system of gravitational and electromagnetic fields is discussed in gen- eral relativity theory, simultaneous equations (Einstein-Maxwell equations) must be solved for the gravitational potential Yij and the electromagnetic field tensor Fij. Thus the gravitational potential gij is affected by the existence of an electromagnetic field. As the validity of general relativity began to be ac- cepted, it came to be expected that all physical actions might be attributed to the gravita- tional and electromagnetic fields. Thus various extensions of general relativity theory have been proposed in order to devise a geometry in which the electromagnetic as well as the gravitational field directly contributes to the space-time structure, and to establish a uni- fied theory of both fields on the basis of the geometry thus obtained. These attempts are illustrated in Fig. 1. 1646 4-Jlml'nllHldl fldl "[MCI'  Gr.I\ILlJ1un,tllicIJ 4-ulmt'n<'lonJ.1 Rlem,mOIJ.o mdolfulJ 1___ [leun'Ill,,"ncm r,elJ "rt'luJ feldtl\I!'; lhenf\ '_iener.tl rd,Il!\I!" henr\ _ I1lflc.:J field lhconc... Fig. B. Weyl's Theory The first unified field theory was proposed by H. Weyl in 1918. In Riemannian geometry, which is the mathematical framework of gen- eral relativity theory, the tcovariant derivative of the tfundamental tensor Yij vanishes, i.e., 'Ilig jk == Ogjk/OXi - gjafi - Yakfij =0, (1 ) where IJ is the tChristoffel symbol derived from Yij' Conversely, if IJ is considered as the coefficient of a general taffine connection and (1) is solved with respect to fA under the con- dition IJk = rl j , then the Christoffel symbol derived from gij coincides with IJik' In this sense, (I) means that the space-time manifold has Riemannian structure. On the other hand, Weyl considered a space whose structure is given by an extension of (1), 'Iligjk=2Aigjb (2) and developed a unified field theory by regard- ing Ai as the electromagnetic potential. This theory has mathematical significance in that it motivated the discovery of Cartan's geometry of connection, but it has some unsatisfactory points concerning the derivation of the field equation and the equation of motion for a charged particle. The scale transformation gIven by 9ij = p2 Yij is important in Weyl's theory. If in addition to this transformation, Ai is changed to A;= Ai -alogp/ox i , (3) then (2) is left invariant and the space-time structure in Weyl's theory remains unchanged. We calI (3) the gauge transformation, cor- responding to the fact that the electromagnetic potential Ai is determined by the electro- magnetic field tensor Fij up to a gradient vec- tor. In the tfield theories known at present, the gauge transformation is generalized to various fields, and the law of charge conservation is derived from the invariance of field equations under generalized gauge transformation. C. Further Developments A unified field theory that appeared after Weyl's is Kaluza's 5-dimensional theory (Th. 
1647 Kaluza, 1921). This theory has been criticized as being artificial, but it is logically consistent, and therefore many of the later unified field theories are improved or generalized versions of it. The underlying space of Kal uza's theory is a 5-dimensional Riemannian manifold with the fundamental form ds 2 = (dx 4 + Aadxa)2 + qab dx " dx b , where Ai and qij are functions of Xi alone (a, b, ... , i, j = 0, 1,2,3). The field equation and the equation of motion of a particle are derived from the variational principle in general re- lativity theory. The field equation is equivalent to the Einstein-Maxwell equations. The traJec- tory of a charged particle is given by a geo- desic in the manifold, and its equation is re- ducible to the Lorentz equation in general relativity. After the introduction of Kaluza's theory, various unifield field theories were proposed, and we give here the underlying manifolds or geometries of some mathematically interesting theories: a manifold with taffine connection admitting absolute parallelism (A. Einstein, 1928); a manifold with tproJective connec- tion (0. Veblen, B. Hoffman, 1930 [4J, 1. A. Schouten, D. van Dantzig, 1932); wave geome- try (a theory based on the linearization of the fundamental form; Y. Mimura. 1934 [3J); a nonholonomic geometry (G Vranceanu, 1936); a manifold with tconformal connection (HolT- man, 1948). The investigations since 1945 have been motivated by the problem of the representa- tion of matter in general relativity theory. Einstein first represented matter by an energy- momentum tensor j of class Co, whIch must be determined by information obtained from outside relativity. Afterward he felt that this point was unsatisfactory and tried to develop a theory on the basis of field variables alone, without introducing such a quantity as j' This theory is the so-called unitary field theory, and a solution without singularities is required from a physical point of view. His first attempt was to remove singularities from an exterior solution in general relativity by changing the topological structure of the space-time mani- fold. This idea was then extended to a unified field theory by J. A. Wheeler, and an interpre- tation was given to mass and charge by apply- ing the theory oftharmonic integrals (1957) [2]. Einstein's second attempt was to propose a nonsymmetric unified field theory (1945) [I, Appendix II; 6]. The fundamental quantities in this theory are a nonsymmetric tensor qij and a nonsymmetric affine connection fj The underlying space of the theory can be con- sidered a direct extension of the Riemannian 435 A Uniform Convergence manifold, since (I) is contained in the field equations (notice the order of indices in this equation). E. Schrodinger obtained field equa- tions of almost the same form by taking only fj as a fundamental quantIty (1947) [5]. References [IJ A. Einstein, The meaning of relativity, Princeton U niv. Press, fifth edition, 1956. [2J 1. A. Wheeler, Geometrodynamics, Aca- demic Press, 1962. [3J Y. Mimura and H. Takeno, Wave geome- try, Sci. Rep. Res. Inst. Theoret. Phys. Hiro- shima Univ., no. 2 (1962). [4J O. Veblen, Projektive Relativitatstheorie, Springer, 1933. [5J E. Schr6dinger, Space-time structure, Cambridge Univ. Press, 1950. [6J A. Einstein, A generalization of the relativ- istic theory of gravitation, Ann. Math., (2) 46 (1945),578-584. [7J V. Hlavaty, Geometry of Einstein's unified field theory, Noordhoff, 1957. [8J M. A. Tonnelat, Einstein's unified field theory, Gordon & Breach, 1966 (Original in French, 1955.) 435 (11.23) Uniform Convergence A. Uniform Convergence of a Sequence of Real-Valued Fnnctions A sequence of real-valued functions {fa(x)} defined on a set B is said to be uniformly con- vergent (or to converge uniformly) to a function f(x) on the set B if it converges with respect to the tnorm II qJ II = sup{lqJ(x)11 XE B}, i.e., limn_ocllfn-fll =0 ( 87 Convergence). In other words, {fa (x) } converges uniformly to f (x) on B if for every positive constant /: we can select a number N independent of the point x such that Ifn(x) - f(x)1 < £ holds for all 11 > N and x E B. By the tcompleteness of the real numbers, a sequence of functions { r.(x)] converges uniformly on B if and only if we can select for every positive constant £ a number N independent of the point x such that Ifm(x)- fa(x)1 < £ holds for all m, 11 > Nand XE B. The uniform convergence of a series Ln [n(x) or of an infinite product nafa(x) is defined by the uniform convergence of the sequence of its partial sums or products. If the series of the absolute values La I.r.(x)l converges uniformly, then the series Ln [a(x) also converges uni- formly In this case the series La [n(x) is said to 
435B Uniform Convergence be uniformly absolutely convergent. A sequence of (nonnegative) constants M n satisfymg I f(x)1  M n is called a dominant (or majorant) of the sequence of functions lj(x)}. A series of functions In fn(x) with converging majorant series In M n is uniformly absolutely conver- gent (Weierstrass's criterion for uniform convergence). Let {;'n(x)) be another sequence of functions on B. The series In ;.n(x) fn(x) is uniformly convergent if either of the following conditions holds: (i) The series I.Jn(x) converges uni- formly and the partial sums of the series Lnll.n(x) - ;'n+' (x)1 are uniformly bounded, i.e., bounded by a constant independent of x E B and of the number of terms; or (ii) the series In I ;'n(x) - ;'n+I (x)1 converges umformly, the sequence {/.n(x)} converges uniformly to 0, and the partial sums of InlJ.,(x)I are uniformly bounded. B. Uniform Convergence and Pointwise Convergence Let Un(x)} be a sequence of real-valued func- tions on B, and letf(x) be a real-valued func- tion also defined on B. If the sequence of numbers [f,,(x o )} converges to f(xo) for every point xoEB, we say that {{n(x)} is pointwise convergent (or simply convergent) to the func- tionf(x). Pointwise convergence is, of course, weaker than uniform convergence. If we repre- sent the functIOn f (x) by the point TI XEB f(x) = Lf] of the tCartesian product R B = TI XE8 R, then the pointwise convergence of { (n(x)} to f (x) is equivalent to the convergence of the sequence of points {Un]} to U] in the tprod- uct topology of R B . When B is a ttopological space and every fn(x) is continuous, the pointwise limit f(X) of the sequence { f,,(x)} is not necessarily con- tinuous. However, if the sequence of continuous functions ( (n(x)) converges uniformly tof(x). then the limit functionf(x) is continuous. On the other hand, the continuity of the limit does not imply that the convergence is uniform. If the set B is tcompact and the sequence of continuous functions {,{.(x)} is monotone (i.e" fn(x) fn+,(x) for all n or J.,(x)  f+, (x) for all n) and pointwise convergent to a continuous function{(x), then the convergence is uniform (Dini's theorem). C. Uniform Convergence on a Family of Sets Let B be a topological space. We say that a sequence of functions { fn(:<)} is nniformly convergent in the wider sense to the function 1648 j (x), depending on circumstances, in either of the following two cases: (1) Every pomt Xo E B has a neighborhood U on which the sequence {J(x)} converges uniformly tof(x); or (ii) {fn(x)] converges umformly to j (x) on every compact subset K in B. If B is tlocally com- pact, the two definitions coincide. The term uniform convergence on compact sets is also used for (ii). In general, given a family fJ' of subsets in B, we may introduce in the space ,'j' of real- valued functions on B a family o[tseminorms IlfilK = sup{ I {(x) 1 I XE K} for every set K EfY'. Let T be the topology of ,'j' defined by this family of seminorms ( 424 Topological Linear Spaces). A sequence {J.,(XI} is called nniformly convergent on fJ' if it is convergent with respect to T In particular, when .0/' coin- cides with {B}, { {x} I xEB}, or the family of all compact sets in B, then umform convergence on .-f> coincides with the usual uniform conver- gence, pointwise convergence, or uniform convergence on compact sets, respectively If.-f> is a countable set, the topology Tis tmetri- zable. Most of these definitions and results may be extended to the case of functions whose values are in the complex number field, in a tnormed space, or in any tuniform space D. Topology of the Space of Mappings Let X, Y be two topological spaces. Denote by C(X, Y) the space of all continuous mappings f: X --> Y. This space C(X, Y), or a subspace.Y of C(X, Y), is called a mapping space (or func- tion space or space of continuous mappings) from X to Y. A natural mapping <D: g; x X --> Y is defined by <D(f;X) = f(X)(fE,9-,XEX). We define a topology in ,'7' as folJows: for a com- pact set K in X and an open set U in Y, put W(K, U) = {fEg; If(K)E U}, and introduce a topology in ,'J' such that the base for the to- pology consists of intersections of finite num- bers of W(K;, UJ This topology is called the compact-open topology (R. H. Fox, Bull. Amer. Math. Soc., 51 (1945)). When X is a tlocall y compact Hausdorff space and Yis a tHaus- dorfT space, the compact-open topology is the tweakest topology on g; for which the func- tion <D is continuous. If, in this case, ,'7' IS compact with respect to the compact-open topology, then the compact-open topology coincides with the topology of pointwIse convergence. In particular, when Y is a tmetric space (or, in general, a tuniform space with the uniform- ity U), the compact-open topology in .'7' coin- cides with the topology of uniform conver- gence on compact sets. A family ,Y is called 
1649 equicontinuous at a point x E X if for every positive number E (in the case of uniform space, for every VEU) there exists a neighbor- hood U of x such that pU(xl,[(p))<r. U(x), f( p)) E V) for every point p E U and fOr every functionjE.:F (G. Ascoli. 1883-1884). If X is a tlocally compact Hausdorff space, a necessary and sufficient condition for ;Y to be relatively compact (i.e., for the closure of.:F to be com- pact) with respect to the compact-open top- ology (i.e., to the topology of uniform conver- gence on compact sets) is that .fF be equicon- tinuous at every point XEX and that the set {J(X)I[E.17} be relatively compact in Yfor every point XEX (Ascoli's theorem). In partic- ular, when X is a a-compact locally compact Hausdorff space and Y is the space of real num bers, a family of functions , that are equicontinuous (at every point XEX) and uniformly bounded is relatively compact. Hence, for any sequence of functions { In} in ,y, we can select a subsequence {fn(v») which converges uniformly on compact sets (Ascoli- Arzela theorem). E. Normal Families P. Montel (1912) gave the name normal family to the family of functions that is relatively com- pact with respect to the topology of uniform convergence on compact sets. This terminol- ogy is used mainly for the family of com plex analytic functions. In that case, it is customary to compactify the range space and consider Y to be the tRiemann sphere. Using this notion, Montel succeeded in giving a unified treatment of various results in the theory of complex functions. A family of analytic functions , on a finite- dimensional tcomplex manifold X is a normal family if it is uniformly bounded on each com- pact set (Montel's theorem). Another criterion is that there are three values on the Riemann sphere which no function [E.9' takes. More generally, three exceptional values not taken by fE,? may depend on.[, if there is a positive lower bound for the distances between these three values on the Riemann sphere. This gives an easy proof of the tPicard theorem stating that every ttranscendental meromorphic func- tion f(z) in Izi < CIJ must take all values except possibly two values. In fact the family of func- tions [..(z) = f(z/2 n ), n = 1, 2, 3, "., in {1 < Izi < 2} cannot be normal. Using a similar proce- dure, G. Julia obtained the results on tJulia's direction. F. Marty introduced the notion of spherical derivative If' (z)1 /( I + 1 /,(z)1 2 ) for the analytic or meromorphic function [(z) and proved that 435 Ref. Uniform Convergence for a family .:F = { [(z)} of analytic functions to be normal, it is necessary and sufficient that the spherical derivatives off E.? be uniformly bounded. This theorem implies Montel's theorem and its various extensions, including, for example, quantitative results concerning tBorel's direction. A family .:F of analytic functions of one variable defined on X is said to form a quasi- normal family if there exists a subset P of X consisting only of isolated points such that from any seq uence { fn} Un E.:F) we can select a subsequence {In(v)} converging uniformly on X - P. If P is finite and consists of p points, the family ,y is called a quasinormal family of order p. For example, the family of at mOst tp_ valent functions is quasinormal of order p. The theory of normal families of complex analytic functions is not only applied to tvalue distribution theory, as above, but also used to show the existence of a function that gives the extremal of functionals. The extremal function is usually obtained as a limit of a subsequence of a sequence in a normal family. A typical example of this method is seen in the proof of the tRiemann mapping theorem. This is per- haps the only general method known today in the study of the iteration oPholomorphic functions. By this method, Julia (1919) made an exhaustive study of the iteration of mero- morphic functions; there are several other in- vestigations on the iteration of elementary transcendental functions. On the other hand, A. Wintner (Comm. Math. Helv., 23 (1949)) gave the implicit function theorem for analytic functions in a precise form using the theory of normal families of analytic functions of several variables. References [IJ N. Bourbaki, Elements de mathematique, Ill., Topologie generale, ch. 10, Espaces fonc- tionnels, Actualites Sci. Ind., 1084b, Hermann, second edition, 1967; English translation, Theory of sets, Addison- Wesley, 1966. [2J J. L. Kelley, General topology, Van Nos- trand, 1955, ch. 7. [3J 1. Dieudonne, Foundations of modern analysis, Academic Press, 1960, enlarged and corrected printing, 1969. For normal families of complex functions, [4J P. Montel, Lel;ons sur les families nor- males de fonctions analytiques et leurs appli- cations, Gauthier-Villars, 1927. [5J G. Valiron, Families normales et quasi- normales de fonctions meromorphes, Gauthier- Villars, 1929. 
436 A Uniform Spaces 436 (11.22) Uniform Spaces A. Introduction There are certain properties defined on tmetric spaces but not on general ttopological spaces, for example, tcompleteness or tuniform con- tinuity of functions. Generalizing metric spaces, A. Weil introduced the notion of uniform spaces. This notion can be defined in several ways [3,4]. The definition in Section B is that of Weil [IJ without the tseparation axiom for topology. We denote by d x the diagonal [(X,X)IXEX} of the Cartesian product X x X of a set X with itself. If U and Yare subsets of X x X, then the composite Vo U is defined to be the set of all pairs (x, y) such that for some element z of X, the pair (x, z) is in U and the pair (z, y) is in V. The inverse U -, of U is defined to be the set of all pairs (x, y) such that (y, x) E U. B. Definitions Let )11 be a non empty family of subsets of X x X such that (i) if U E )11 and U e V. then VEO/I; (ii) if U, VEvll, then un VEoII; (iii) if U E 1/, then d x e U; (iv) if U E )11, then U -I E 41; and (v) if U E vll, then Vo Ve U for some Vell. Then we say that a uniform structure (or simply a uniformity) is defined on X by ull. If a uniformity is defined on X by)ll, then the pair (X, °11) or simply the set X itself is called a uniform space, and °11 is usually called a uniformity for X. A subfamily .Cj{J of the uniformity 111 is called a base for the uniformity vll if every member of )11 contains a member of . If a family . of subsets of X x X is a base for a uniformity )11, then the fOllowing propositions hold: (ii') if U, VE!J6', then there exists a WE.Ci6' such that We un V; (iii') if U E]$, then d x e U; (iv') if U E!, then there exists a VE::!6' such that Ve U- 1 ; (v') if U E:, then there exists a VEJ6' such that Vo Ve U. Conversely, if a family [$ of subsets of a Cartesian product X x X satis- fies (ii')-(v'), then the family °11 = {U I U e X X X, Ve U for some VE[$} defines a uniformity on X and Pl1 is a base for 0/1. Given a uniform space (X, 11), a member Vof )11 is said to be symmetric if V= V -I. The family of all sym- metric members of JII is a base for vll. C. Topology of Uniform Spaces Given a uniform space (X, vII), an element XEX, and UE)II, we put U(x)={ yIYEX,(X,y) 1650 E U}. Then the family vll(x) = {U(x) I U E )II} forms a neighborhood system of XEX, which gives rise to a topology of X ( 425 Topo- logical Spaces). This topology is called the uniform topology (or topology of the uni- formity). When we refer to a topology of a uni- form space (X, vII), it is understood to be the uniform topology; thus a uniform space is also called a uniform topological space. If ::id is a base for the uniformity of a uniform space (X,)lI), then (x)={U(x)1 UE5B} is a base for the neighborhood system at each point XEX. Each member of vll is a subset of the topolog- ical space X x X, which is supplied with the product topology. The family of all open (closed) symmetric members of )11 forms a base for 11. A uniform space (X, '11) is a tT 1 - topological space if and only if the intersection of all members of W is the diagonal d x . In this case, the uniformity of (X, °11) is called a T 1 - uniformity, and (X, °11) is called a T, -uniform space. A TI -uniform space is always tregular; a fortiori, it is a T 2 -topological space. Hence a T, -uniform space is also said to be a Haus- dorff uniform space (or separated uniform space). Moreover, a uniform topology satisfies tTikhonov's separation axiom; in particular, a TI-uniform space is tcompletely regular. D. Examples (I) Discrete Uniformity. Let X be a nonempty set, and let W= {U Id x e U eX X X}. Then (X, 11) is a TI -uniform space and J3' = {d x } is a base for °11. This uniformity is called the dis- crete uniformity for X. (2) Uniform Family of Neighborhood System. A family {U.(X)}aEA(XEX) of subsets of a set X is called a uniform neighborhood system in X if it satisfies the following four requirements: (i) XE Ua(x) for each iXEA and each XEX; (ii) if x and yare distinct elements of X, then y U.(x) for some iXEA; (iii) if iX and {3 are two elements of A, then there is another element YEA such that U,(x) e Ua(x) n Up(x) for all x E X; (iv) if iX is an arbitrary element in A, then there is an element {3 in A such that yE Ua(x) whenever x, y E Up(z) for some z in X. If we denote by U.(iXE A) the subset of X x X consisting of all elements (x, y) such that x E X and y E Ua(x), then the family { U, I iXE A} satisfies all the conditions for a base for a uniformity. In par- ticular. it follows from (ii) that n'EA Ua=d x , which is a stronger condition than (iii') in Section B. For instance, if {U a I iXE A} is a base for the neighborhood system at the identity element of a TI-topological group G, then we have two uniform neighborhood systems {U;(x)} and {U;(x)], where U;(.x)=xU, and 
1651 V(x) = Vax. Two uniformities derived from these uniform neighborhood systems are called a tleft uniformity and a tright uniformity, respectively. Generally, these two uniformities do not coincide ( 423 Topological Groups). (3) Uniform Covering System [4J. A family {Ua}.eA oftcoverings of a set X is called a uniform covering system if the following three conditions are satisfied: (i) if U is a covering of X such that U-<U a for all iXEA, then U coincides with the covering ={ {x} Lex; (ii) if iX, {3 E A, then there is ayE A such that U y -<U a and Uy-<U p ; (iii) if ()(E A, then there is a {3 E A such that Up is a M-refinement of U a «Upt-<Ua). For an example of a uni- form covering system of X, suppose that we are given a uniform neighborhood system {Va(X)}aeA (XEX). Let O/Ia={Va(X)}aEx (iXEA). Then {O/I.}aEA is a uniform covering system. On the other hand, for a covering U = { VA} AEA' let S(x, U) be the union of all members of U that contain x. If {U.}.EA is a uniform covering system and V.(x) == S(x, U.), then {Va(X)}'EA (XE X) is a uniform neighborhood system. Hence defining a uniform covering system of X is equivalent to defining a T j - uniformity on X. (4). In a metric space (x, d) the subsets U, = {(x,y)ld(x, Y)<r}, r>O, form a base of un i- formity. The uniform topology defined by this coincides with the topology defined by the metric. E. Some Notions on Uniform Spaces Some of the terminology concerning topolog- ical spaces can be restated in the language of uniform structures. A mappingffrom a uni- form space (X, 0/1) into another (X', °ll') is said to be uniformly continuous if for each member V' in 0Zt' there is a member V in 0/1 such that (f(x), f(Y))E V' for every (x, Y)E V. This con- dition implies thatfis continuous with respect to the uniform topologies of the uniform spaces. Equivalently, the mapping is uniformly continuous with respect to the uniform neigh- borhood system {V.(x) LEA if for any index fJ there is an index iX such that YE V.(x) implies f(Y)EVp(f(X)). Iff:X-->X' andg:X'-->X" are uniformly continuous, then the composite gof:X -->X" is also uniformly continuous. A bijectionf of a uniform space (X, 0/1) to another (X', 0Zt') is said to be a uniform isomorphism if bothfandf-1 are uniformly continuous; in this case (X, °11) and (X', 0/1') are said to be uniformly equivalent. A uniform isomorphism is a homeomorphism with respect to the uni- form topologies, and a uniform equivalence 436 F Uniform Spaces defines an equivalence relation between uni- form spaces. If °lll and °ll 2 are uniformities for a set X, we say that the uniformity 0Zt 1 is stronger than the uniformity 0Zt 2 and ull 2 is weaker than 0Zt 1 if the identity mapping of (X, 0Zt 1) to (X, 0Zt 2) is uni- formly continuous. The discrete uniformity is the strongest among the uniformities for a set X. The weakest uniformity for X is defined by the single member X x X; this uniformity is not a Tj-uniformity unless X is a singleton. Generally, there is no weakest T 1 -uniformity. A uniformity 0/1 1 for X is stronger than an- other 0/1 2 if and only if every mem ber of 0/1 2 is also a member of 0Zt 1 . If.fis a mapping from a set X into a uniform space (Y, 1') and g is the mapping of X x X into Yx Y defined by g(x,y)=(f(x),j(y)), then e6'={g-j(V)\ VE1'} satisfies conditions (ii')- (v') in Section B for a base for a uniformity. The uniformity 0/1 for X determined by e6' is called the inverse image of the uniformity 1/ for Y by f; 0/1 is the weakest uniformity for X such thatfis uniformly continuous. Hence a mappingffrom a uniform space (X, 0/1) into another (Y, 1') is uniformly continuous if and only if the inverse image of the uniformity l' under fis weaker than the uniformity 0/1. If A is a subset of a uniform space (X, 0/1), then there is a uniformity l' for A determined as the inverse image of 0/1 by the inclusion mapping of A into X. This uniformity l' for A is called the relative uniformity for A induced by oll, or the relativization of 01/. to A, and the uniform space (A, 1') is called a uniform subspace of (X, -fI). The uniform topology for (A, 1') is the relative topology for A induced by the uniform topology for (X, 0/1). If {(X" 01/.,)} 'EA is a family of uniform spaces, then the product uniformity for X = IT AEA X A is defined to be the weakest uni- formity 0/1 such that the projection of X onto each X A is uniformly continuous, and (X, 011) is called the product uniform space of {(X)., °IlA)} 'EA' The topology for (X, eft) is the product of the topologies for (X" 0/1,) (AE A). F. Metrization Each tpseudometric d for a set X generates a uniformity in the following way. For each positive number r, let ,,={(X,Y)EX x Xld(x,y)<r}. Then the family {.,lr>O} satisfies conditions (ii')-(v') in Section B for a base for a uniformity 0/1. This uniformity is called the pseudo metric uniformity or uniform- ity generated by d. The uniform topology for (X, W) is the pseudometric topology. A uni- form space (X, 0/1) is said to be pseudometrizable (metrizable) if there is a pseudometric (metric) 
436 G Uniform Spaces d such that the uniformity /11 is identical with the uniformity generated by d. A uniform space is pseudometrizable if and only if its uniformity has a countable base. Conse- quently, a uniform space is metrizable if and only if its uniformity is a T, -uniformity and has a countable base. For a family P of pseudometrics on a set X, let Va., = [(x, Y)E X X X I d(x,y)< r} for dE P and positive r. The weakest uniformity containing every Va., (d E P, r >0) is called the uniformity gen- erated by P. This uniformity may also be de- scribed as the weakest one such that each pseudometric in P is uniformly continuous on X x X with respect to the product uniformity. Each uniformityJl! on a set X coincides with the uniformity generated by the family Px of all pseudometrics that are uniformly con- tinuous on X x X with respect to the product uniformity of JII with itself. It follows that each uniform space is uniformly isomorphic to a subspace of a product of pseudo metric spaces (in which the number of components is equal to the cardinal number of Px) and that each T,-uniform space is uniformly isomorphic to a subspace of a product of metric spaces. A topology T for a set X is the uniform topology for some uniformity for X if and only if the topological space (X, T) satisfies tTikhonov's separation axiom; in particular, the uniformity is a T[-uniformity if and only if (X, T) is tcom- pletel y regular. G. Completeness If (X,Jl!) is a uniform space, a subset A of X is called a small set of order U (U E 4/) if Ax A c U. A tfilter on X is called a Cauchy filter (with respect to the uniformity Oil) if it contains a smalI set of order U for each U in )11. If a filter on X converges to some point in X, then it is a Cauchy filter. If f is a uniformly con- tinuous mapping from a uniform space X into another X', then the image of a base for a Cauchy filter on X under [is a base for a Cauchy filter on X'. A point contained in the closure of every set in a Cauchy filter  is the limit point of. Hence if a filter converges to x, a Cauchy filter contained in the filter also converges to x. A tnet x(QI) = {x.} 'E[ (where Q( is a directed set with a preordering :s;) in a uniform space (X, JlI) is called a Cauchy net if for each U in 011 there is a "y in QI such that (x., Xp)E U for every Y. and {3 such that }':s; Y., "y:s; f3. If Q( is the set N of all natural numbers, a Cauchy net {Xn}nEN is calIed a Cauchy sequence (or fundamental sequence). Given a Cauchy net {x.} 'Eb let A, ={x p lf3?y.}. Then ={A.Iy.EQI} is a basefor a filter, and the filter is a Cauchy filter. On the 1652 other hand, let  be a base for a Cauchy filter . For U, VE, we put U:s; Vifand only if U => V Then  is a directed set with respect to :S;. The net {X[l } [IE"" where X[I is an arbitrary point in U, is a Cauchy net. A proposition concerning convergence of a Cauchy filter is always equivalent to a proposition concerning convergence of the corresponding Cauchy net. A Cauchy filter (or Cauchy net) in a uni- form space X does not always converge to a point of X. A uniform space is said to be com- plete (with respect to the uniformity) if every Cauchy filter (or Cauchy net) converges to a point of that space. A complete uniform space is calIed for brevity a complete space. A closed subspace of a complete space is complete with respect to the relative uniformity. A pseudo- metrizable uniform space is complete if and only if every Cauchy sequence in the space converges to a point. Hence in the case of a metric space, our definition of completeness coincides with the usual one ( 273 Metric Spaces). A mapping {from a uniform space X to an- other X' is said to be uniformly continuous on a subset A of X if the restriction of {to A is uniformly continuous with respect to the rela- tive uniformity for A. Iffis a uniformly con- tinuous mapping from a subset A of a uniform space into a complete T 1 -uniform space, then there is a unique uniformly continuous exten- sion T of [ on the closure A. Each T 1 -uniform space is uniformly equiva- lent to a dense subspace of a complete T[- uniform space; this property is a generaliza- tion of the fact that each metric space can be mapped by an isometry onto a dense subset of a complete metric space. A completion of a uniform space (X, .Ill) is a pair (j; (X*, JII*)), where (X*, JlI*) is a complete space and [is a uniform isomorphism of X onto a dense subspace of X*. The T[-completion of a T[-uniform space is unique up to uniform equivalence. H. Compact Spaces A uniformity °ll for a topological space (X, T) is said to be compatible with the topology T if the uniform topology for (X, UII) coincides with T. A topological space (X, T) is said to be unifor- mizable if there is a uniformity compatible with T. If (X, T) is a compact Hausdorff space, then there is a unique uniformity "1t compa- tible with T; in fact, '7/ consists of all neighbor- hoods of the diagonal d x in X x X; and the compact Hausdorff space is complete with this uniformity. Hence every subspace of a com- pact Hausdorff space is uniformizable, and every tlocally compact Hausdorff space is 
1653 uniformizable. Any continuous mapping from a compact Hausdorff space to a uniform space is uniformly continuous. A uniform space (X, )1/) is said to be totally bounded (or precom- pact) if for each U E)1I there is a finite covering consisting of small sets of order U; a subset of a uniform space is called totally bounded if it is totally bounded with respect to the relative uniformity. A uniform space X is said to be locally totally bounded if for each point of X there is a base for a neighborhood system consisting of totally bounded open subsets. A uniform space is compact if and only if it is totally bounded and complete. If.fis a uni- fOTmly continuous mapping from a uniform space X to another, then the image.f(A) of a totally bounded subset A of X is totally bounded. I. Topologically Complete Spaces A topological space (X, T) is said to be topo- logically complete (or Dieudonne complete) if it admits a uniformity compatible with T with respect to which X is complete. Each tpara- compact Hausdorff space is topologically complete. Actually such a space is complete with respect to its strongest uniformity. A Hausdorff space which is homeomorphic to a tG.-set in a compact Hausdorff space is said to be Cech-complete; A metric space is homeomorphic to a complete metric space if and only if it is Cech-complete. A Hausdorff space X is paracompact and Cech-complete if and only if there is a tperfect mapping from X onto a complete metric space. References [IJ A. Weil, Sur les espaces a structure uni- forme et sur la topologie generale, Actualites Sci. Ind., Hermann, 1938. [2J J. W. Tukey, Convergence and uniformity in topology, Ann. Math. Studies, Princeton U ni v. Press, I 940. [3J J. L. Kelley, General topology, Van Nos- trand,1955. [4J N. Bourbaki, Elements de mathematique, III. Topologie generale, ch. 2, Actualites Sci. Ind., I 142d, Hermann, fourth edition 1965; English translation, General topology, Addiion- Wesley, 1966. [5J J. R. Isbell, Uniform spaces, Amer. Math. Soc. Math. Surveys, 1964. [6J H. Nakano, Uniform spaces and trans- formation groups, Wayne State Univ. Press, 1968. [7J R. Emgelking, General topology, Polish Scientific Publishers, 1977. 437 A Unitary Representations [8J Z. F rolik, Generalization of the G r property of complete metric spaces, Czech. Math. 1.,10 (1960),359-379. 437 (IV.17) Unitary Representations A. Definitions A homomorphism U of a ttopological group G into the group of tunitary operators on a tHilbert space t) (o;io{O}) is called a unitary representation of G if U is strongly continuous in the following sense: For any element XEt), the mapping 9 --> Ugx is a continuous mapping from G into t). The Hilbert space t) is called the representation space of U and is denoted by t)(U). Two unitary representations U and U' are said to be equivalent (similar or isomor- phic), denoted by U  U', if there exists an tisometry T from t)(U) onto t)(U') that satis- fies the equality To U g = U 0 T for every 9 in G. If the representation space t)(U) contains no closed subspace other than t) and {O} that is invariant under every U g , the unitary repre- sentation U is said to be irreducible. An element x in t)( U) is called a cyclic vector if the set of all finite linear combinations of the elements UgX(gEG) is dense in f,(U). A representation U having a cyclic vector is called a cyclic representation. Every nonzero element of the representation space of an irreducible repre- sentation is a cyclic vector. Examples. Let G be a ttopological transfor- mation group acting on a tlocally compact Hausdorff space X from the right. Suppose that there exists a tRadon measure 11 that is invariant under the group G. Then a unitary representation R is defined on the Hilbert space t) = L 2 (X, 11) by the formula (Rf)(x) =.f(xg) (JEt),XEX,gEG). The representation R is called the regular representation of G on (X, 11). If G acts on X from the left, then the regular representation L" is defined by (Lf)(x)= f(g-1 x). In particular, when X is the tquotient space H\G of a tIocal1y compact group G by a closed subgroup H, any two invariant measures 11, 11' (if they exist) coincide up to a constant factor. Hence the regular representation R" on (X, 11) and the regular representation R' on (X,I1') are equivalent. In this case, the representation R is calIed the regular representation on X. When H = {e}, a locally compact group G has a Radon measure WIO that is invariant under every right (left) translation h-->hg (h-->gh) and is called a right (!cft) tHaar measurc on G. So G has the regu- 
437B Unitary Representations lar representation R (L) on G. R (L) is called the right (left) regular representation of G. B. Positive Definite Functions and Existence of Representations A complex-valued continuous function q> on a topological group G is called positive definite if the matrix having q>(gi- 1 gj) as the (i,j)- component is a tpositive semidefinite Her- mitian matrix for any finite number of ele- ments gl,"', gn in G. If U is a unitary repre- sentation of G, then the function q>(g) = (Ugx, x) is positive definite for every element x in NU). Conversely, any positive definite function q>(g) on a topological group G can be expressed as q>(g) = (Ugx, x) for some unitary representation U and x in f,(U). Using this fact and the tKrein-Milman theorem, it can be proved that every locally compact group G has sufficiently many irreducible unitary representations in the following sense: For every element g in G other than the identity element e, there exists an irreducible unitary representation U, generally depending on g, that satisfies the inequality U g -# 1. The groups having sufficiently many finite-dimensional (irreducible) unitary repre- sentations are called tmaximally almost periodic. If a connected locally compact group G is maximally almost periodic, then G is the direct product of a compact group and a vec- tor group R m . On the other hand, any non- compact connected tsimple Lie group has no finite-dimensional irreducible unitary repre- sentation other than the unit representation g -..1 ( 18 Almost Periodic Functions). C. Subrepresentations Let U be a unitary representation of a topo- logical group G. A closed subspace 91 of f,(U) is called U-invariant if 91 is invariant under every U g (g E G). Let 91 -# {O} be a closed invar- iant subspace of f,(U) and J.-;, be the restric- tion of U g on 91. Then V is a unitary represen- tation of G on the representation space 91 and is called a subrepresentation of U. Two unitary representations Land M are called disjoint if no subrepresentation of L is equivalent to a subrepresentation of M; they are called quasi- equivalent if no subrepresentation of L is dis- joint from M and no subrepresentation of M is disjoint from L. D. Irreducible Representations Let U be a unitary representation of G, M be the tvon Neumann algebra generated by {UqlgEG}, and M' be the tcommutant ofM. 1654 Then a closed subspace 91 of f,(U) is invariant under U if and only if the tprojection operator P corresponding to 91 belongs to M'. There- fore U is irreducible if and only if M' consists of scalar operators {IXII IXE C} (Schur's lemma). A representation space of a cyclic or irreduc- ible representation of a tseparable topological group is tseparable. E. Factor Representations A unitary representation U of G is called a factor representation if the von Neumann algebra M = { U g I g E G} is a tfactor, that is, MnM' = {IXII IXEC}. Two factor representa- tions are quasi-equivalent if and only if they are not disjoint. U is called a factor represen- tation of type I, II, or III if the von Neumann algebra M is a factor of Hype I, II, or III, respectively ( 308 Operator Algebras). A topological group G is called a group of type I (or type I group) if every factor representation of G is of type I. Compact groups, locally compact Abelian groups, connected tnilpotent Lie groups, connected tsemisimple Lie groups, and real or complex tlinear algebraic groups are examples of groups of type I. There exists a connected solvable Lie group that is not of type I ( Section U), but a connected solvable Lie group is of type I if the exponential map- ping is surjective (0. Takenouchi). A discrete group G with countably many elements is a type I group if and only if G has an Abelian normal subgroup with finite index (E. Thoma). F. Representation of Direct Products Let G 1 and G 2 be topological groups, G the tdirect product of G 1 and G 2 (G = G 1 x G 2 ), and U i an irreducible unitary representation of G i (i = 1,2). Then the ttensor product representa- tion U 1 @ U 2 :(gl,g2)-"U 9 , @ U g2 is an irreduc- ible unitary representation of G. Conversely, if one of the groups G 1 and G 2 is of type I, then every irreducible unitary representation of G is equivalent to the tensor product U 1 @ U 2 of some irreducible representations U i of G i (i= 1,2). G. Direct Sums If the representation space f, of a unitary representation U is the tdirect sum <:82EI NIX) of mutually orthogonal closed invariant sub- spaces {f,(IX)}2EI' then U is called the direct sum of the subrepresentations U(IX) induced on f,(IX) by U, and is denoted by U = (172EI U(IX). Any unitary representation is the direct sum of cyclic representations. A unitary represent a- 
1655 tion V is called a representation without multi- plicity if V cannot be decomposed as a direct sum VI EB V 2 unless VI and V 2 are disjoint. If V is the direct sum of {V(IJ()LEI and every V(IJ() is irreducible, then V is said to be decomposed into the direct sum of irreducible representa- tions. Decomposition into direct sums of irreducible representations is essentially unique if it exists; that is, if V = EBOEl V(IJ() = EBPEJ V(fJ) are two decompositions of V into direct sums of irreducible representations, then there exists a bijection q> from I onto J such that V(IJ() is equivalent to V(q>(IJ()) for every IJ( in I. A factor representation V of type 1 can be decomposed as the direct sum V = EBoE1 V(IJ() of equivalent irreducible represen- tations V(IJ(). In general, a unitary representa- tion V cannot be decomposed as the direct sum of irreducible representations even if V is not irreducible. Thus it becomes necessary to use direct integrals to obtain an irreducible decomposition. H. Direct Integrals Let V be a unitary representation of a group G and (X, J1) be a tmeasure space. Assume that the following two conditions are satisfied by V: (i) There exists a unitary representation V(x) of G corresponding to every element x of X, and S( V) is a tdirect integral ( 308 Operator Algebras) of S( V(x)) (XE X) (written S(V)= Ix S( V(x))dJ1(x)); (ii) for every 9 in G, the operator V g is a decomposable operator and can be written as V g = Ix V q (x)dJ1(x). Then the unitary representation V is called the direct integral of the family { V(x)} XEX of uni- tary representations and is denoted by V = Ix V(x)dJ1(x). If every point of X has mea- sure 1, then a direct integral is reduced to a direct sum. 1. Decomposition into Factor Representations We assume that G is a locally compact group satisfying the tsecond countability axiom, and also that a Hilbert space is separable. Every unitary representation V of G can be decom- posed as a direct integral V = Ix V(x)dJ1(x) in such a way that the center A of the von Neumann algebra M" = { V g I 9 E G}" is the set of all t diagonalizable operators. In this case almost all the V(x) are factor representations. Such a decomposition of V is essentially unique. There exists a tnull set N in X such that for every x and x' in X - N (x # x'), V (x) and V(x') are mutually disjoint factor repre- sentations. Hence the space X can be identified with the set G* of all quasi-equivalence classes 437 K Unitary Representations of factor representations of G endowed with a suitable structure of a measure space. The space G* is called the quasidual of G. The measure J1 is determined by V up to tequiva- lence of measures. J. Duals A topology is introduced on the set G of all equivalence classes of irreducible unitary rep- resentations of a locally compact group G in the following way. Let Hn be the n-dimensional Hilbert space 12(n) and In the set of all irreduc- ible unitary representations of G realized on Hn (1 :(n:( 00). We topologize In in such a way that a tnet {VALEL in In converges to V if and only if( VgAX, y) converges uniformly to (Vgx, y) on every compact subset of G for any x and y in Hn. Equivalence between representations in In is an open relation. Let G n be the set of all equivalence classes of n-dimensional irreduc- ible unitary representations of G with the topology of a quotient space of In and G = Un G n be the direct sum of topological spaces G n . Then the topological space G is called the dual of G. G is a locally compact tBaire space with countable open base, but it does not satisfy the tHausdorff separation axiom in general. If G is a compact Hausdorff topolog- ical group, then G is discrete. If G is a locally compact Abelian group, then G coincides with the tcharacter group of G in the sense of Pontryagin. If G is a type I group, then there exists a dense open subset of G that is a locally compact Hausdorff space. The ter-additive family generated by closed sets in G is denoted by . In the following sections, a measure on G means a measure defined on . K. Irreducible Decompositions In this section G is assumed to be a locally compact group of type 1 with countable open base. For any equivalence class x in G, we choose a representative V (x) E x with the rep- resentation space H(V(x))=lz(n) if x is n- dimensional. For any measure J1 on G, the representation v= Ie V(x)dJ1(x) is a unitary representation without multiplicity. Con- versely, any unitary representation of G with- out multiplicity is equivalent to a V for some measure J1 on G. Moreover, V is equivalent to VV if and only if the two measures J1 and v are equivalent (that is, J1 is absolutely continuous with respect to v, and vice versa). A unitary representation V with multiplicity on a sepa- rable Hilbert space S can be decomposed as follows: There exists a countable set of mea- sures J11, J12' ..., J17 whose supports are mutu- 
437 L Unitary Representations ally disjoint such that V  Ie V(X)dJ11 (x) EB 2Ie V(X)dJ12(X) EB ... EB 00 Ie V(x)dJ1c", (x). The measures J1h J12,"" J100 are uniquely determined by V up to equivalence of mea- sures. Any unitary representation V on a separable Hilbert space S of an arbitrary locally compact group with countable open base (even if not of type I) can be decomposed as a direct integral of irreducible representa- tions. In order to obtain such a decompo- sition, it is sufficient to decompose S as a direct integral in such a way that a maximal Abelian von Neumann subalgebra A of M' = {V q I gE Gj' is the set of all diagonalizable operators. In this case, however, a different choice of A induces in general an essentially different decomposition, and uniqueness of the decomposition does not hold. For a group of type I, the irreducible representations are the "atoms" of representations, as in the case of compact groups. For a group not of type I, it is more natural to take the factor represen- tations for the irreducible representations, quasi-equivalence for the equivalence, and the quasidual for the dual of G. Therefore the theory of unitary representations for a group not of type I has different features from the one for a type I group. The theory of unitary representation for groups not of type I has not yet been successfully developed, but some important results have been obtained (e.g., L. Pukanszky, Ann. Sci. Ecole Norm. Sup., 4 (1971) ). Tatsuuma [IJ proved a duality theorem for general locally compact groups which is an extension of both pontryagin's and Tannaka's duality theorems considering the direct in- tegral decomposition of tensor product representations. L. The Plancherel Formula Let G be a unimodular locally compact group with countable open base, R(L) be the right (left) regular representation of G, and M, N, and P be the von Neumann algebras generated by {Rq}, {Lg}, and {Rq, Lq}, respectively. Then M'=N, N'=M, and p'=MnN. If we decom- pose S into a direct integral in such a way that P' is the algebra of all diagonalizable operators, then M(x) and N (x) are factors for almost all x. This decomposition of S pro- duces a decomposition of the two-sided regu- lar representation {Rq, Lq} into irreducible representations and a decomposition of the regular representation R(L) into factor repre- sentations. Hence the decomposition is realized as the direct integral over the quasidual G* of G. Moreover, the factors M(x) and N(x) are of type I or II for almost all x in G*, and there 1656 exists a ttrace t in the factor M(x). For any J and g in L,(G) n L 2 (G), the Plancherel formula f J(s)g(s)ds = f t(V;(x) V f (x))dJ1(x) (1) G G* holds, where Vf(x) = IGJ(s)Vs(x)ds and V* is the tadjoint of V. The inversion formula h(S)= f t(Vh(x)V;(x))dJ1(x) (2) G* is derived from (1) for a function i1 = f * g (f,gEL,(G)nL 2 (G)). In (1) and (2), because of the impossibility of normalization of the trace t in a factor of type 11 00 , the measure J1 cannot in general be determined uniquely. However, if G is a type I group, then (1) and (2) can be rewritten as similar formulas, where the repre- sentation V (x) in (1) and (2) is irreducible, the trace t is the usual trace, and the domain of integration is not the quasidual G* but the dual G of G. The revised formula (1) is also called the Plancherel formula. In this case the measure J1 on G in formulas (1) and (2) is uniquely determined by the given Haar mea- sure on G. The measure J1 is called the Plan- cherel measure of G. The support G, of the Planchere1 measure J1 is called the reduced dual of G. The Plancherel formula givt:s the direct integral decomposition of the regular repre- sentation into the irreducible representations belonging to Gr. Each V in G, is contained in this decomposition, with the multiplicity equal to dim Sf V). M. Square Integrable Representations An irreducible unitary representation V of a unimodular locally compact group G is said to be square integrable when for some element x # 0, in S( V), the function <p(g) =, (Vgx, x) belongs to L 2(G, dg), where dg is the Haar measure of G. If V is square integrable, then <Px.y(g)=(Vgx,y) belongs to L 2 (G,dg) for any x and y in S( V). Let V and V' be the two square integrable representations of G. Then the following orthogonality relations hold: t (Vgx, y )( Vu, v) dg  {:,;, (x, "H', y) if V is not equivalent to V', ifV=V'. (3) When G is compact, every irreducible unitary representation V is square integrable and finite-dimensional. Moreover, the scalar d u in (3) is the degree of V if the total measure of G is normalized to 1. In the general case, the scalar d u in (3) is called the formal degree of V and is determined uniquely by the given Haar 
1657 measure dg. Let y be an element in f,(V) with norm 1 and V be the subs pace {<Px) x E i>( V)} of L2(G). Then the linear mapping T :x-+Jd;; <Px.y is an isometry of f,(V) onto V. Hence V is equivalent to a subrepresentation of the right regular representation R of G. Conversely, every irreducible subrepresentation of R is square integrable. Thus a square integrable representation is an irreducible subrepresen- tation of R (L). Therefore, in the irreducible decomposition of R, the square integrable representations appear as discrete direct sum- mands. Hence every square integrable repre- sentation V has a positive Plancherel measure J1( V) that is equal to the formal degree duo There exist noncompact groups that have square integrable representations. An example of such a group is SL(2,R) ( Section X). N. Representations of Ll (G) Let G be a locally compact group and LdG) be the space of all complex-valued integrable functions on G. Then Ll (G) is an algebra over C, where the convolution (/"*g)(s)= Lf(st-[)9(t)dt is defined to be the product of f and g. Let d be the tmodular function of G. Then the map- ping f(s)-+ f*(s)= d(S-1 ) f(5 -= T)is an tinvol- ution of the algebra Ll (G). Let V be a unitary representation of G, and put VI = fG VJ(s)ds. Then the mapping f -+ VI gives a nondegen- erate representation of the Banach algebra Ll(G) with an involution, where nondegenerate means that {VIxlfELdG), X Ef,(V)}.l reduces to {a}. The mapping V -+ V' gives a bijection between the set of equivalence classes of uni- tary representations of G and the set of equiv- alence classes of nondegenerate representa- tions of the Banach algebra L[ (G) with an involution on Hilbert spaces. V is an irreduc- ible (factor) representation if and only if V' is an irreducible (factor) representation. There- fore the study of unitary representations of G reduces to that of representations of Ll (G). If U; is a tcompact operator for every f in L 1 (G), then V is the discrete direct sum of irreducible representations, and the multiplicity of every irreducible component is finite. (See [2J for Sections A- N.) O. Induced Representations Induced representation is the method of con- structing a representation of a group G in a canonical way from a representation of a subgroup H of G. It is a fundamental method 437Q Unitary Representations of obtaining a unitary representation of G. Let G be a locally compact group satisfying the second countability axiom, L be a unitary representation on a separable Hilbert space f,(L) of a closed subgroup H of G, and m, n, d, and b be the right Haar measures and the modular functions of the groups G and H, respectively. Then there exists a continuous positive function p on G satisfying p(hg)= b(h)d(h)-l p(g) for every h in Hand 9 in G. The tquotient measure J1 = (pm)/n is a quasi- invariant measure on the coset space H\G ( 225 Invariant Measures). Let f, be the vector space of weakly measurable functions f on G with values in f,(L) satisfying the following two conditions: (i) f(hg) = Ld(g) for every h in Hand 9 in G; and (ii) IIfII2 = fH\G Ilf(g) 11 2 dJ1@ < + 00, where g represents the coset Hg. By condition (i), the norm Ilf(g) II is constant on a coset Hg=g and is a function on H\G, so the integral in condition (ii) is well defined. Then f, is a Hilbert space with the norm defined in (ii). A unitary representation V of G on the Hilbert space f, is defined by the formula (VJ)(g)= J p(gs)/p(g) f(gs). V is called the unitary representation induced by the representation L of a subgroup H and is denoted by V = V L or Ind L. Induced repre- sentations have the following properties. (1) VL,8'JL2 V L , E8 V L 2 or more generally, VfU(x)d#(x)  f V L (X)dJ1(x). Therefore if V L is irreducible, L is also irreducible (the converse does not hold in general). (2) Let H, K be two subgroups of G such that He K, L be a unitary representation of H, and M be the representation of K induced by L. Then two unitary representations V M and V L of G are equivalent. An induced representation V L is the repre- sentation on the space of square integrable sections of the tvector bundle with fiber H(L) tassociated with the principal bundle (G, H\ G, H) ( G. W. Mackey [3], F. Bruhat [4J). P. Unitary Representations of Special Groups In the following sections we describe the fun- damental results on the unitary representa- tions of certain special groups. Q. Compact Groups Irreducible unitary representations of a com- pact group are always finite-dimensional. Every unitary representation of a compact group is decomposed into the direct sum of irreducible representations. Irreducible unitary representations of a compact connected Lie 
437 R Unitary Representations group are completely classified. The characters of irreducible representations are calculated in an explicit form ( 69 Compact Groups; 249 Lie Groups). Every irreducible unitary repre- sentation U of a connected compact Lie group G can be extended uniquely to an irreducible holomorphic representation U C of the com- plexification G C of G. U C is holomorphically induced from a I-dimensional representation of a Borel subgroup B of G C (Borel-Weil theorem;  R. Bott [5J). R. Abelian Groups Every irreducible unitary representation of an Abelian group G is I-dimensional. tStone's theorem concerning one-parameter groups of unitary operators, U t = S:':'x eiJ.t dE.!. gives irreducible decompositions of unitary repre- sentations of the additive group R of real numbers. tBochner's theorem on tpositive definite functions on R is a restatement of Stone's theorem in terms of positive definite functions. The theory of the tFourier trans- form on R, in particular tPlancherel's theorem, gives the irreducible decomposition of the regular representation of R. The theorems of Stone, Bochner, and Plancherel have been extended to an arbitrary locally compact Abelian group ( 192 Harmonic Analysis). S. Representations of Lie Groups and Lie Algebras Let U be a unitary representation of a Lie group G with the Lie algebra !1. An element x in t)( U) is called an analytic vector with respect to U if the mapping 9 -> Uqx is a real analytic function on G with values in t)( U). The set of all analytic vectors with respect to U forms a dense subspace W=W(U) of f,(U). For any elements X in!1 and x in W(U), the derivative at t = 0 of a real analytic function UexptXX is denoted by V(X)x. Then V(X) is a linear transformation on W, and the mapping V: X -> V(X) is a representation of!1 on W. We call V the differential representation of U. The rep- resentation V of!1 can be extended uniquely to a representation of the tuniversal en- veloping algebra \B of!1. Two unitary repre- sentations U(1) and U(2) of a connected Lie group G are equivalent if and only if there exists a bijective bounded linear mapping T from t)(U(1») onto t)(U(2») such that T maps 'l1(U('») onto W(U(2») and satisfies the equality (To V(1)(X))X=(V(2)(X)O T)x for all X in g and x in W(U(1»). Let XI,"', X n be a basis of!1 and U be a unitary representa- tion of G. Then the element A = xi + ". + X 1658 in the universal enveloping algebra \B of!1 is represented in the differential representation V of U by an tessentially self-adjoint operator V(1'1). Conversely, if to each element X in !1 there corresponds a (not necessarily bounded) tskew-Hermitian operator p(x) that satisfies the following three conditions, then there exists a unique unitary representation U of the simply connected Lie group G with the Lie algebra!1 such that the tclosure of V(X) coin- cides with the closure of p(X) for every X in !1: (i) There exists a dense subs pace D contained in the domain of p(X)p(Y) for every X and Y in !1; (ii) for each X and Y in !1, a and b in R, and x in D, p(aX + b Y)x = ap(X)x + bp( Y)x, p([X, YJ)x =(p(X)p(Y)- p(Y)p(X))x; (iii) the restriction of p( X d 2 + ... + p( Xn)2 to D is an essentially self-adjoint operator if X" "', X n is a basis of!1 (E. Nelson [6J). T. Nilpotent Lie Groups For every irreducible unitary representation of a connected nilpotent Lie group G, there is some I-dimensional unitary representation of some subgroup of G that induces it. Let G be a simply connected nilpotent Lie group, !1 be the Lie algebra of G, and p be the contragredient representation of the adjoint representation of G. The representation space of p is the dual space !1* of g. A subalgebra I) of 9 is called subordinate to an element f in !1* if f annihi- lates each bracket [X, Y] for every X and Y in I): C{, [X, Y])=O. When!) is subordinate to (, a I-dimensional unitary representation L of the analytic subgroup H of G with the Lie algebra I) is defined by the formula ;'f(exp X) = e2niIJ.X) (X E I)). Every I-dimensional unitary represen- tation )'f of H is defined as in this formula by an element f in g* to which I) is subordinate. The unitary representation of G induced by such a Af is denoted by U(j, I)). The represen- tation UC{,I)) is irreducible if and only if I) has maximal dimension among the subalgebras subordinate to f Two irreducible represent- ations U(j, I)) and U(j, I)') are equivalent if and only iff and f' are conjugate under the group p(G). Therefore there exists a bijection be- tween the set of equivalence classes of the irreducible unitary representations of a simply connected nilpotent Lie group G and the set of orbits of p(G) on g* (A. A. Kirillov [7J). U. Solvable Lie Groups Let G be a simply connected solvable Lie group. If the exponential mapping is bijective, G is called an exponential group. All results stated above for nilpotent Lie groups hold for exponential groups except the irreducibility 
1659 criterion. In this case the representation U ( f, I)) is irreducible if and only if I) is of maximal dimension among subordinate subalgebras and the orbit 0 = p(G) [contains the affine subspace j + 1)1. = [ + {g I g(l)) = o} (Pukanszky condition). The situation is more complicated for gen- eral solvable Lie groups. The isotropy sub- group G f = {gE G I p(g)f =f} at [Eg* is, in general, not connected. A linear form f is called integral if there exists a unitary charac- ter IJf of G f whose differential is the restriction of 2nif to 9f (the Lie algebra of G f ). Using the notion of "polarization," an irreducible unitary representation of G is constructed from a pair (f, IJf) of an integral form f Eg* and a character IJf' If G is of type I, then every irreducible unitary representation of G is obtained in this way. A simply connected solvable Lie group G is of type I if and only if (i) every [Eg* is in- tegral and (ii) every G-orbit p(G)f in g* is locally closed (Auslander and Kostant [8]). As an example, let ex be an irrational real number. Then the following Lie group G is not O"YP"G{(r ;" ) I"R",WEC} V. Semisimple Lie Groups A connected semisimple Lie group is of type I. The character X = Xu of an irreducible unitary representation U of G is defined as follows: Let CJ'(G) be the set of all complex-valued C X '_ functions with compact support on G. Then for any function f in CO" (G), the operator U f = SG Ugf(g)dg belongs to the ttrace class, and the linear form x:f -> T,. U f is a tdistribution in the sense of Schwartz. The distribution X is called the character of an irreducible unitary representation U. A character X is invariant under any inner automorphism of G and is a simultaneous eigendistribution of the algebra of all two-sided invariant linear differential operators on G. Two irreducible unitary repre- sentations of G are equivalent if and only if their characters coincide. The distribution X is a tlocall y summable function on G and coin- cides with a real analytic function on each connected component of the dense open sub- manifold G' consisting of regular elements in G. In general, X is not real analytic on all of G (Harish-Chandra [9, III; 10]. W. Complex Semisimple Lie Groups There are four series of irreducible represen- tations of a complex semisimple Lie group G. (1) A principal series consists of unitary representations of G induced from 1- 437W Unitary Representations dimensional unitary representations L of a tBorel subgroup B of G. L is uniquely deter- mined by a unitary character vEHom(A, U(1)) = A * of the tCartan subgroup A of G con- tained in B. Hence the representations in the principal series are parametrized by the ele- ments in the character group A * of the Cartan subgroup A. If we denote U L by U" two repre- sentations U V and U"'(v, v' E A *) are equivalent if and only if v and v' are conjugate under the tWeyl group W of G with respect to A. (2) A degenerate series consists of unitary representations induced by I-dimensional unitary representations of a tparabolic sub- group P of G other than B. (A parabolic sub- group P is any subgroup of G containing a Borel subgroup B.) (3) A complementary series consists of irre- ducible unitary representations U L induced by nonunitary I-dimensional representations L of a Borel subgroup B. In this case, con- dition (ii) in the definition of U L ( Section 0) must be changed. When L is a non unitary representation, then the operator Ui is not a unitary operator with respect to the usual Lz- inner product (ii). However, if L satisfies a certain condition, then U; leaves invariant some positive definite Hermitian form on the space of sufficiently nice functions. Completing this space, we get a unitary representation U L . The representations thus obtained form the complementary series. (4) A complementary degenerate series con- sists of irreducible unitary representations induced by non unitary I-dimensional repre- sentations of a parabolic subgroup Pol B. Representations belonging to different series are never equivalent. It seems certain that any irreducible unitary representition of a con- nected complex semisimple Lie group is equiv- alent to a representation belonging to one of the above four series, but this conjecture has not yet been proved. Moreover, E. M. Stein [11] constructed irreducible unitary repre- sentations different from any in the list ob- tained by I. M. Gel'fand and M. A. Naimark (Neumark) [12]. These representations belong to the complementary degenerate series. The characters of the representations in these four series are computed in explicit form. For ex- ample, the character Xv of the representation U V in the principal series can be calculated as follows: Let A be a linear form on a Cartan subalgebra a such that v(exp H) = eA(H) for every H in a, let D be the function on A de- fined by D(expH) =ILle'(IlYZ_e-O(H)(ZjZ, where ex runs over all positive roots. Then the character Xv of a representation U V in the principal series is given by the formula ;dexpH)=D(expH)-1 I esi.(ll). SEW 
437 X Unitary Representations In the irreducible decomposition of the regular representation of G, only irreducible represen- tations belonging to the principal series arise. Hence the right-hand side in the Planche reI formula is an integral over the character group A* of a Cartan subgroup A. Under a suitable normalization of the Haar measures in G and A *, the Planche reI measure J1 of G can be expressed by using the Haar measure dv of A *: dJ1(v) = w-, r11(A, IX)/(p, IXWdv, . where w is the order of the Weyl group, p is the half-sum of all tpositive roots, and IX runs over all positive roots (GeI'fand and Naimark [12]). X. Real Semisimple Lie Groups As in the case of a complex semisimple Lie group, a connected real semisimple Lie group G has four series of irreducible unitary repre- sentations. However, if G has no parabolic subgroup other than a minimal parabolic subgroup Band G itself, then G has no repre- sentation in the degenerate or complementary degenerate series. Examples of such groups are SL(2, R) and higher-dimensional tLorentz groups. In general, the classification of irreduc- ible unitary representations in the real semi- simple case is more complicated than in the complex semisimple case. Irreducible unitary representations arising from the irreducible decomposition of the regular representation are called representations in the principal series. The principal series of G are divided into a finite number of subseries corresponding bijectively to the conjugate classes of the tCar- tan subgroups of G. A connected semisimple Lie group G has a square integrable representation if and only if G has a compact Cartan subgroup H. The set of all square integrable representations of G is called the discrete series of irreducible unitary representations. The discrete series is the sub- series in the principal series corresponding to a compact Cartan subgroup H. The representa- tions in the discrete series were classified by Harish-Chandra. Let I) be the Lie algebra of H, P the set of all positive roots in I) for a fixed linear order, n the polynomial Il. Ep H., and :#' the set of all real-valued linear forms on PI). Moreover, let L be the set of all linear forms A in :#' such that a single-valued char- acter A of the group H is defined by the for- mula A(expX)=eA(X), and let L' be the set of all A. in L such that n(A) *0. Then for each A in L', there exists a representation W(A) of G in the discrete series, and conversely, every representation in the discrete series is equivalent to w(},) for some I, in L'. Two representations 1660 W(A.,) and W(A 2 ) (A." A2 E L') are equivalent if and only ifthere exists an element sin W G = N(H)/H such that A2 =sic" where N(H) is the normalizer of H in G (W G can act on :#' as a linear transformation group in the natural way). The value of the character X,l on the subgroup H of the representation W(A) (icE L') is given as follows: Let £(A.) be the signature of n(}.) = Il. Ep A(H.), and define q and d by q= (dim G/K)/2 and d(expH)= Il. EP (e.(H)/2- e -.(H)/2). Then the character XA of the repre- sentation w(A.) has the value (-1)q';(A)XA(h) = d(h) -, LSEwG(detsgsA(h) on a regular element h in H. The formal degree d(w(A.)) of the representation W(A) is given by the formula d(W(A))=C-'[WG]ln(A)I, where C is a positive constant (not depending on A) and [W G ] is the order of the finite group W G (Harish-Chandra [13]). A formula expressing the character XA on the whole set of regular elements in G has been given by T. Hirai [14]. The representations in discrete series are realized on L 2 -cohomology spaces of homogeneous holomorphic line bundles over G/H (W. Schmid [15]). They are also realized on the spaces of harmonic spinors on the tRiemannian symmetric space G/K (M. Atiyah and Schmid [16]). They are also realized on the eigenspaces of a Casimir opera- tor acting on the sections of vector bundles on G/K (R. Hotta, J. Math. Soc. Japan, 23; N. Wallach [17]). An irreducible unitary rep- resentation is called integrable if a1 least one of its matrix coefficients belongs to U (G). Integrable representations belong to the dis- crete series. They have been characterized by H. Hecht and Schmid (Math. Ann.. 220 (1976)). The theory of the discrete series is easily ex- tended to reductive Lie groups. The general principal series representations of a connected semisimple Lie group G with finite center are constructed as follows. Let K be a maximal compact subgroup of G. Then there exists a unique involutive automorphism e of G whose fixed point set coincides with K. e is called a Cartan involution of G. Let H be a e-stable Cartan subgroup of G. Then H is the direct product of a compact group T=HnK and a vector group A. The centralizer Z(A) of A in G is the direct product of a reductive Lie group M = e(M) and A. M has a compact Cartan subgroup T. Hence the set Md of the discrete series representations of M is not empty. Let IX be an element of the dual space a* of the Lie algebra a of A and put 9. = {XE91[H,X]=IX(H)X(\fHEa)} andd= {IXE a* I 9. * {O}}. Let f'o" + be the set of posi- tive elements of d in a certain ord(:r of a* and put n =L.E' 9. and N =expn. Then P= MAN is a closed subgroup of G. P is called a cuspidal parabolic subgroup of G. Let DE Md and v E a*. Then a unitary representation D @ e iv of P 
1661 is defined by (D @ e iv ) (man) = D(m)eiv(loga) (m E M, a E A, n EN). The unitary representation n D , v of G induced by D @ e iv is independent of the choice of d + up to equivalence. Thus n D . v depends only on (H, D, v). The set of represen- tations {nD.vIDEM d , vEa*} is called the prin- cipal H-series. Ifv is regular in a* (i.e., (v,()()#O for all ()(Ed), then nD,v is irreducible. Every nD,v is a finite sum of irreducible representations. The character 0D. v of n D . v is a locally sum- mable function which is supported in the closure of UqEGg(M A)g-l. If two Cartan subgroups HI and H 2 are not conjugate in G, then every HI-series representation is disjoint from every H 2 -series representation. Choose a complete system {HI,"" H,} of conjugacy classes of Cartan subgroups of G. Then every Hi can be chosen as O-stable. The union of the principal Hi-series (1  i  r) is the principal series of G. The right (or left) regular represen- tation of G is decomposed as the direct integral of the principal series representations. Every complex-valued COO-function on G with com- pact support has an expansion in terms of the matrix coefficients of the principal series repre- sentations. Harish-Chandra [18] proved these theorems and determined explicitly the Plan- cherel measure by studying the asymptotic behavior of the Eisenstein integral [19,20]. Y. Spherical Functions Let G be a locally compact tunimodular group and K a compact subgroup of G. The set of all complex-valued continuous functions on G that are invariant under every left translation Lk by elements k in K is denoted by C(K\ G). The subset of C(K\ G) that consists of all two- sided K-invariant functions is denoted by C(G, K). The subset of C(G, K) consisting of all functions with compact support is denoted by L=L(G,K). L is an algebra over C if the prod- uct of two elements f and 9 in L is defined by the convolution. Let A be an algebra homomorphism from L into C. Then an element of the eigenspace F(.A.) = {t/1E C(K, G)lf * t/1 =.A.(j)t/1 (\tfEL)} is called a spherical function on K\ G. If F(.A.) contains a nonzero element, then F(.A.) contains a unique two-sided K-invariant element W normalized by w(e)= 1, where e is the identity element in G. This function w is called the zonal spher- ical function associated with .A.. In this case, the homomorphism .A. is defined by .A.(j) = SGf(g)W(g-l)dg. Hence the eigenspace F().) is uniquely determined by the zonal spherical function w. A function w # 0 in C( G, K) is a zonal spherical function on K\ G if and only if w satisfies either of the following two con- ditions: (i) The mappingf U(g)w(g-l)dg is 437 Z Unitary Representations an algebra homomorphism of L into C; (ii) w satisfies the functional equation L w(gkh)dk=w(g)w(h). When G is a Lie group, every spherical func- tion is a real analytic function on K\G. Z. Expansion by Spherical Functions In this section, we assume that the algebra L of two-sided K -invariant functions is com- mutative. In this case there are sufficiently many spherical functions of K\G, and two- sided K-invariant functions are expanded by spherical functions. An irreducible unitary representation U of G is called a spherical representation with respect to K if the represen- tation space tI(U) contains a nonzero vector invariant under every U b where k runs over K. By the commutativity of L, the K -invariant vectors in tI(U) form a I-dimensional sub- space. Let x be a K-invariant vector in tl(U) with the norm IIxll = 1. Then w(g)=(Uqx,x) is a zonal spherical function on K\G, and for every y in tI(U), thefunction qJy(g)=(Uqx,y) is a spherical function associated with w. Moreover, in this case the zonal spherical function w is a positive definite function on G. Conversely, every positive definite zonal spher- ical function w can be expressed as w(g)= (Uqx, x) for some spherical representation U and Some K-invariant vector x in tI(U). The set of all positive definite zonal spher- ical functions becomes a locally compact space Q by the topology of compact convergence. The spherical Fourier trausform J of a function fin L 1 (K\ G) is defined by J(w) = Lf(9)w(g-l)d 9 . There exists a unique tRadon measure fl. on Q such that for every fin L, f belongs to L 2 (Q, fl.). Also, the Plancherel formula Lf(S )9(S) dS= Lf(W )g(W) dfl.(W) (4) holds for every f and 9 in L, and an inversion formula f(s) = Sf! J(w)w(s)dfl.(w) holds for a sufficiently nice two-sided K-invariant func- tion f[21]. Identifying a positive definite zonal spherical function with the corresponding spherical representation, we can regard Q as a subset of the dual G of G. The Plancherel formula for two-sided K-invariant functions is obtained from the general Plancherel formula on G by restricting the domain of the integral from G to Q. When G is a Lie group and L is commutative, a spherical function on K\ G can be characterized as a simultaneous eigenfunc- 
437 AA Unitary Representations tion of G-invariant linear differential operators on K\G. AA. Spherical Function on Symmetric Spaces The most important case where the algebra L=L(G,K) is commutative is when K\G is a tweakly symmetric Riemannian space or, in particular, a tsymmetric Riemannian space When K\G is a compact symmetric Riemann- ian space, a spherical representation with respect to K is the irreducible component of the regular representation Ton K\ G, and a spherical function on K\ G is a function that belongs to the irreducible subs paces in L 2 (K\G). In particular, ifG is a compact con- nected semisimple Lie group, the highest weights of spherical representations of G with respect to K are explicitly given by using the Satake diagram of K\G. The Satake diagram of K\ G is the tSatake diagram of the noncom- pact symmetric Riemannian space K\ Go dual to K\ G or the Satake diagram of the Lie alge- bra of Go. If a symmetric space is the under- lying manifold of a compact Lie group G, then G can be expressed as G = K\(G x G), where K is the diagonal subgroup of G x G. In this case, a zonal spherical function won G = K\(G x G) is the normalized character of an irreducible unitary representation V of G: w(g) = (deg V)-I T.V q . The explicit form of w is given by tWeyl's character formula ( 249 Lie Groups). The zonal spherical functions on a sym- metric Riemannian space K\ G of noncom pact type are obtained in the following way: Let G be a connected semisimple Lie group with finite center, K be a maximal compact sub- group of G, and G= N A+K be an tIwasawa decomposition. Then for any 9 in G there exists a unique element H(g) in the Lie algebra tl+ of A+ such that 9 belongs to N expH(g)K. Let tl be a Cartan subalgebra containing tl+, P be the set of all positive roots in tl, and p = (LEP ex)/2. Then for any complex-valued linear form v on tl +, the function w,(g) = IK eU,-p)(H(kg» dk is a zonal spherical function on the symmetric Riemannian space K\G. Conversely, every zonal spherical function w on K\G is equal to w, for some v. Two zonal spherical functions w, and W,' coincide if and only if v and v' are conjugate under the operation of the Weyl group W o =NK(A)/ZK(A) of K\G (Harish- Chandra [22], S. Helgason [23J). If v is real- valued, then w, is positive definite. Such a zonal spherical function w, is obtained from a spherical representation belonging to the 1662 principal A-series. Let Qo be the set of all zonal spherical functions w, associated with the real-valued linear form v. Then. the support of the Plancherel measure J.1. on K\ G is con- tained in Qo. We can choose v as a parameter on the space Qo. Then the right-hand side of the Plancherel formula can be expressed as an integral over the dual space L of Cl+. More- over, the Plancherel measure J.1. is absolutely continuous with respect to the Lebesgue mea- sure dv on the Euclidean space L .and can be expressed as dJ.1.(w,)=w(j'lc(v)I- 2 dv under suitable normalization of J.1. and dv. The problem of calculating the function c(v) can be reduced to the case of symmetric spaces of rank 1 and can be solved explicitly. Let p be the multiplicity of a restricted root ex and I(v) be the product I(v)= Il B(tp.Jp.+(v,ex) (ex,ex)-I), > where ex runs over all positive restricted roots and B is the tbeta function. Then c(v)=I(iv)/ I(p) [20,24]. Every spherical function f on K\ G is expressed as the Poisson integral of its "boundary values" on the Martin boundary P\G of K\G, where P=MA+ N is a minimal parabolic subgroup of G. The boundary values of f form a hyperfunction with values in a line bundle over P\G (K. Okamoto et al. [25J). HH. Spherical Functions and Special Functions Some important special functions are obtained as the zonal spherical functions on a certain symmetric Riemannian space M = K\G (G is the motion group of M). In particular when M is of rank t, then the zonal sph,rical func- tions are essentially the functions of a single variable. For example, the zonal spherical functions on an n-dimensional Euclidean space can be expressed as W,(r) = 2 m f"(m + 1)(vr)-m Jm(vr), where 2m = n - 2 and J m is the tBessel function of the mth order. The zonal spherical function on an (n -I)-dimensional sphere 8"-1 = SO(n-l)\SO(n) is given by w,(8) = f"(v + 1 )f"(n - 2)f"(v + n - 2) -I C:(cos 0) (v=0,1,2,..." where C:(z) is the tGegenbauer polynomial. The .tonal spherical functions on an (n-t)- dimensional Lobachevskii space can be ex- pressed as w,(t) = 2 m - 1/2 f"(m + 1/2)sinh- m + 1/2 , x I.plj?zFn m +v (cosh t) 
1663 using a generalized tassociated Legendre func- tion ;,. Many properties of special functions can be proved from a group-theoretic point of view. For example, the addition theorem is merely the homomorphism property Ugh = U g U h expressed in terms of the matrix com- ponents of U. The differential equation satis- fied by these special functions is derived from the fact that a zonal spherical function OJ is an eigenfunction of an invariant differential oper- ator. The integral expression of such a special function can be obtained by constructing a spherical representation U in a certain func- tion space and calculating explicitly the inner product in the expression w(g) = (Ugx, x) (N. Va. Vilenkin [26]). cc. Generalization of the Theory of Spherical Functions The theory of spherical functions described in Sections Y - BB can be generalized in several ways. First, spherical functions are related to the trivial representation of K. A generaliza- tion is obtained if the trivial representation of K is replaced by an irreducible representation of K. The theory of such zonal spherical func- tions is useful for representation theory [20]. For example, the Plancherel formula for SL(2, R) can be obtained using such spherical functions (R. Takahashi, Japan. J. Math., 31 (196 I)). Harish-Chandra's Eisenstein integral is such a spherical function on a general semi- simple Lie group G. He used it successfully to obtain the Plancherel measure of G. Another generalization can be obtained by removing the condition that K is compact. In particu- lar, when K\ G is a symmetric homogeneous space of a Lie group G, the algebra 9 of all G- invariant linear differential operators is com- mutative if the space K\ G has an invariant volume element. In this case, a spherical func- tion on K\ G can be defined as a simultaneous ei'genfunction of fiJ. The character of a semi- simple Lie group is a zonal spherical fUnc- tion (distribution) in this sense. The spherical functions and harmonic analysis on sym- metric homogeneous space have been studied by T. Oshima and others. T. Oshima and J. Sekiguchi [27] proved the Poisson integral theorem ( Section AA) for a certain kind of symmetric homogeneous spaces. The spherical functions and unitary repre- sentations of topological groups that are not locally compact are studied in connection with probability theory and physics. For example, the zonal spherical functions of the rotation group of a real Hilbert space are expressed by Hermite polynomials. 437 D D Unitary Representations DD. Discontinuous Subgroups and Representations Let G be a connected semisimple Lie group and r be a discrete subgroup of G. Then the regular representation T of G on r\ G is de- fined by (f)(x)= f(xg) (jEU(r\G)). The problem of decomposing the representation T into irreducible components is important in connection with the theory of tautomorphic forms and number theory. First assume that the quotient space r\ G is compact. Then for every function fin L 1 (G), the operator T(j) is a compact operator. Hence the regular repre- sentation Ton r\ G can be decomposed into the discrete sum T= LI T(k) of irreducible unitary representations T(k), and the multiplic- ity of every irreducible component is finite. The irreducible unitary representation U of G is related to the automorphic forms of r in the following way: Let x be a nonzero element in the representation space f, = f,(U) of U. f, is topologized into a tlocally convex topological vector space f,x by the set N x of tseminorms: Nx={Pdy)=maXgECI(Uqx,Y)I}, where C runs over all compact subsets in G. The topology .UJ x of f,x is independent of the choice of x provided that dim {x I k E K} < 00, where K is a maximal compact subgroup of G. Let f,* be the completion of f,x with respect to the topo- logy :Yx (the completion is independent of the choice of x), f,* contains the original Hil- bert space f, as a subspace. Then the repre- sentation U of G on f, can be extended to a representation U* of G on the space f,*. An element f in f,* invariant under U';' for every y in r is called an automorphic form of r of type U. Then the multiplicity of an irreducible representation U in the regular representation T on r\ G is eq ual to the dimension of the vector space consisting of all automorphic forms of type U. This theorem is called the Gel'fand- Pyatetskii-Shapiro reciprocity law [28]. Let T= LI T(k) be the irreducible de- composition of T and Xk be the character of the irreducible unitary representation T(k). Then for a suitable function f on G, the in- tegral operator Kf on f,(T)=U(T\G) with kernel kf(x, y) = LyEr f(x -I yy) belongs to the trace class. By calculating the trace of K fin two ways, the following trace formula is obtained: t f f(9)Xd9)d9=I ! f(XIYX)dx, k-1 G ['yl 1), where {y} is the conjugate class ofy in rand :Dy is the quotient space of the centralizer Gyof y in G by the centralizer ry of y in r. When the groups G and r are given ex- plicitly, the right-hand side of the trace for- mula can be expressed in a more explicit form, 
437 EE Unitary Representations and the trace formula leads to useful conse- quences. A similar trace formula holds for the unitary representation U L induced by a finite- dimensional unitary representation L of r instead of the regular representation T on r\ G. When the quotient space r\ G is not compact, the irreducible decomposition of the regular representation Ton r\ G contains not only the discrete direct sum but also the direct integral (continuous spectrum). A. Selberg showed that even in this case, there are explicit examples for which the trace formula holds for the part with discrete spectrum. Also, the part with continuous spectrum can be described by the tgeneralized Eisenstein series. Analytic properties and the functional equation of the generalized Eisenstein series have been studied by R. Langlands [30]. Recent developments are surveyed in [31]. EE. History Finite-dimensional unitary representations of a finite group were studied by Frobenius and Schur (1896-1905). In ] 925, tWeyl studied the finite-dimensional unitary representation of compact Lie groups. The theory of infinite- dimensional unitary representation was init- iated in 1939 by E. P. Wigner in his work on the inhomogeneous Lorentz group, motivated by problems of quantum mechanics. In 1943, Gel'fand and D. A. Raikov proved the existence of sufficiently many irreducible unitary representations for an arbitrary locally compact group. The first systematic studies of unitary representations appeared in 1947 in the work of V. Bargmann on SL(2, R) [31] and the work of Gel'fand and Neumark on SL(2, C). Gel'fand and Naimark established the theory of unitary representation for com- plex semisimple Lie groups [12]. Harish-Chandra proved theorems concern- ing the unitary representations of a general semisimple Lie group; for instance, he proved that a semisimple Lie group G is of type I [7] and defined the character of a unitary repre- sentation of G and proved its basic properties [9, III; 10]. Harish-Chandra also determined the discrete series of G and their characters. Harish-Chandra [18] proved the Plancherel formula for an arbitrary connected semisimple Lie group G with finite center. Hence har- monic analysis of square integrable functions on G is established. Further studies on harmonic analysis on semisimple Lie groups have been carried out. In particular, Paley- Wiener-type theorems, which determine the Fourier transform image 1664 of the space Cc(G) of COO-functions with com- pact support, have been proved for the group PSL(2, R) (L. Ehrenpreis and F. Mautner [33]), complex semisimple Lie groups (Zhelo- benko [34]), and two-sided K-invariant func- tions on general semisimple Lie groups (R. Gangolli [35]). A. W. Knapp and E. M. Stein [36] studied the intertwining operators. Concerning the construction of irreducible representations, G. W. Mackey [3] and Bruhat [4] developed the theory of induced represen- tations oflocal1y compact groups and Lie groups, respectively. B. Kostant [37] (see Biattner's articie in [38J) noticed a relation between homogeneous tsymplectic manifolds and unitary representations and proposed a method of constructing irreducible unitary representations of a Lie group. Selberg's re- search [29] revealed a connection between unitary representations (or spherical functions) and the theory of automorphic forms and number theory. A number of papers along these lines have since appeared [3]]. In con- nection with number-theoretic inv,stigations of an talgebraic group defined over an alge- braic number field, unitary representations of the tadele group of G or an algebraic group over a tp-adic number field have been studied ( [31,38], Gel'fand, M. I. Grayev, and I. I. Pyatetskii-Shapiro [39J, and H. M. Jacquet and R. P. Langlands [40J). For the algebraic approach to the infinite- dimensional representations of semisimple Lie groups and Lie algebras  [41]. For surveys of the theory of unitary repre- sentations  [2, 19,20,31,38]. References [1] N. Tatsuuma, A duality theorem for lo- cally compact groups, J. Math. Kyoto Univ., 6 (1967),187-293. [2J J. Dixmier, Les C*-algebres et leurs repre- sentations, Gauthier-Villars, 1964. [3J G. W. Mackey, Induced representations of 10cal1y compact groups I, Ann. Math., 55 (1952),101-139. [4J F. Bruhat, Sur les representations induites des groupes de Lie, Bull. Soc. Math. France, 84 (1956), 97-205. [5] R. Bott, Homogeneous vector bundles, Ann. Math., 66 (1957), 203-248. [6J E. Nelson, Analytic vectors, Ann. Math., 70 (1959),572-615. [7J A. A. KiriIlov, Unitary represetltations of nilpotent Lie groups, Russian Math. Surveys, 17, no. 4 (1962), 53-104. (Original in Russian, 1962.) [8J L. Auslander and B. Kostant, Polarization 
1665 and unitary representations of solvable Lie groups, Inventiones Math., 14 (1971), 255 354. [9J Harish-Chandra, Representations of a semisimple Lie group on a Banach space I-VI, Trans. Amer. Math. Soc., 75 (1953),185-243; 76 (1954), 26-65; 76 (1954), 234-253; Amer. J. Math., 77 (1955), 743-777; 78 (1956), 1-41; 78 (1956),564-628. [IOJ Harish-Chandra, Invariant eigendistri- butions on a semisimple Lie group, Trans. Amer. Math. Soc., 119 (1965), 457-508. [11J E. M. Stein, Analysis in matrix spaces and some new representations of SL(N, C), Ann. Math., (2) 86 (1967), 461 490. [12J I. M. Gel'fand and M. A. Naimark (Neu- mark), Unitiire Darstellungen der klassischen Gruppen, Akademische Verlag., 1957. (Original in Russian, 1950.) [13J Harish-Chandra, Discrete series for semi- simple Lie groups I, II, Acta Math., 113 (1965), 241-318; 116(1966), 1-111. [14J T. Hirai, The characters of the discrete series for semisimple Lie groups, J. Math. Kyoto Univ., 21 (1981),417-500. [15J W. Schmid, U-cohomology and the discrete series, Ann. Math., 102 (1976), 375- 394. [16J M. F. Atiyah and W. Schmid, A geo- metric c'onstruction of the discrete series for semisimple Lie groups, Inventiones Math., 42 (1977), 1-62. [17J N. Wallach, On the Enright-Varadarajan modules: A construction of the discrete series, Ann. Sci. Ecole Norm. Sup., 9 (1976), 81-102. [18J Harish-Chandra, Harmonic analysis on real reductive groups I-III, J. Functional Ana!., 19 (1975), 104-204; Inventiones Math., 36 (1976),1-55; Ann. Math., 104 (1976),117- 201. [19J J. A. Wolf, M. Cahen, and M. De Wilde (eds.), Harmonic analysis and representations of semisimple Lie groups, Reidel, 1980. [20J G. Warner, Harmonic analysis on semi- simple Lie groups I, II, Springer, 1972. [21J H. Yoshizawa, A proof of the Plancherel theorem, Proc. Japan Acad., 30 (1954),276- 281. [22J Harish-Chandra, Spherical functions on a semisimple Lie group I, II, Amer. J. Math., 80 (1958), 241-310, 553-613. [23J S. Helgason, Differential geometry and symmetric spaces, Academic Press, 1962. [24J J. Rosenberg, A quick proof of Harish- Chandra's Plancherel theorem for spherical functions on a semisimple Lie groups, Proc. Amer. Math. Soc., 63 (1977),143-149. [25J M. Kashiwara, A. Kowata, K. Minemura, K. Okamoto, T. Oshima, and M. Tanaka, Eigenfunctions of invariant differential 437 Ref. Unitary Representations operators on a symmetric space, Ann. Math., 107 (1978), 1-39. [26J N. Ya Vilenkin, Special functions and theory of group representations, Amer. Math. Soc. Trans!. of Math. Monographs, 22, 1968. (Original in Russian, 1965.) [27J T. Oshima and J. Sekiguchi, Eigenspaces of invariant differential operators on an affine symmetric space, Inventiones Math., 57 (1980), 1-81. [28J I. M. Gel'fand and I. I. Pyatetskii- Shapiro, Theory of representations and theory of automorphic functions, Amer. Math. Soc. Trans!., (2) 26 (1963),173-200. (Original in Russian, 1959.) [29J A. Selberg, Harmonic analysis and dis- continuous groups in weakly symmetric Riemannian spaces with applications to Dirichlet series, J. Indian Math. Soc., 20 (1956),47-87. [30J R. P. Langlands, On the functional equa- tions satisfied by Eisenstein series, Lecture notes in math. 544, Springer, 1976. [31J Automorphic forms, representations and L-functions, Amer. Math. Soc. Proc. Symposia in Pure Math., 33, 1979. [32J V. Bargmann, Irreducible unitary repre- sentations of the Lorentz group, Ann. Math., (2) 48 (1947), 568-640. [33J L. Ehrenpreis and F. I. Mautner, Some properties of the Fourier transform on semi- simple Lie groups I-III, Ann. Math., (2) 61 (1955),406-439; Trans. Amer. Math. Soc., 84 (1957), 1 55; 90 (1959), 431-484. [34J D. P. Zhelobenko, Harmonic analysis of functions on semisimple Lie groups I, II, Amer. Math. Soc. Trans!., (2) 54 (1966), 177- 230. (Original in Russian, 1963, 1969.) [35J R. GangolIi, On the Plancherel formula and the Paley-Wiener theorem for spherical functions on a semisimple Lie group, Ann. Math., 93 (1971),150-165. [36J A. W. Knapp and E. M. Stein, Intertwin- ing operators for semisimple groups, Ann. Math., 93 (1971), 489-578. [37J B. Kostant, Quantization and unitary representations, Lecture notes in math. 170, Springer, 1970,87-207. [38J Harmonic analysis on homogeneous spaces, Amer. Math. Soc. Proc. Symposia in Pure Math., 26 (1973). [39J I. M. Gel'fand, M. I. Grayev, and I. I. Pyatetskii-Shapiro, Representation theory and automorphic functions, Saunders, 1969. (Original in Russian, 1966.) [40J H. Jacquet and R. P. Langlands, Auto- morphic forms on G L(2), Lecture notes in math. 114, Springer, 1970. [41 J D. A. Vogan, Representations of real reductive Lie groups, Birkhiiuser, 1981. 
438 A Univalent and Multivalent Functions 438 (XI.5) Univalent and Multivalent Functions A. General Remarks A single-valued tanalytic function f(z) defined in a domain D of the complex plane is said to be univalent (or simple or schlicht) if it is injec- tive, i.e., if [(z d #f(Z2) for all distinct points z 1, Z2 in D. A multiple-valued function [(z) is also said to be univalent if its distinct function elements always attain distinct values at their centers. The derivative of a univalent function is never zero. The limit function of a tuni- formly convergent sequence of univalent func- tions is univalent unless it reduces to a con- stant. When }(z) is single-valued, the univalent function w= [(z) gives rise to a one-to-one tconformal mapping between D and its image f(D). B. Univalent Functions in the Unit Disk A systematic theory of the family of functions tholomorphic and univalent in the unit disk originates from a distortion theorem obtained by P. Koebe (1909) in connection with the uniformization of analytic functions. In gen- eral, distortion theorems are theorems for determining bounds of functionals, such as If(z)l, If'(z)l, argf'(z), within the family under consideration. In particular, distortion theo- rems concerning the bounds of the arguments of f(z) and 1'(z) are also called rotation theo- rems. Though results were at first qualitative, they were made quantitative subsequently by L. Bieberbach (1916), G. Faber (1916), and others. Any univalent function f(z) holo- morphic in the unit disk and normalized by f(O) = 0 and ['(0) = 1 satisfies the distortion inequalities Izl Izl (1 + Izl)2  If(z)l (1 -lzI)2 ' l-izi 1+lzl (1 + Izl)3  If'(z)i  (1 -lzl)3 . Here the equality holds only if f(z) is of the form z/(1-£z)2(1r.1= 1). In deriving these in- equalities, Bieberbach centered his attention on the family oftmeromorphic functions g(O =( + I.ohv(-V univalent in 1(1 > 1. He es- tablished the area theorem I.%1 vlh v l 2  1, which illustrates the fact that the area of the complementary set of the image domain is nonnegative. Bieberbach, R. Nevanlinna (1919-1920), and others constructed a sys- 1666 tematic theory of univalent functions in the unit disk based on this theorem. After the area theorem, the chief tools in the theory of univalent functions have been Lowner's method, the method of contour integration, the variational method, and the method of the extremal metric. In contrast to the theory of univalent functions based on Bieberbach's area theorem, K. Lowner (1923) introduced a new method. In view of a theo- rem on the domain kernel (c. Caratheodory, 1912), it suffices to consider an everywhere dense subfamily in order to estimate a con- tinuous functional within the family of univa- lent functions holomorphic in the unit disk. Lowner used the subfamily of functions map- ping the unit disk onto the so-called bounded slit domains. Namely, the range of a member of this subfamily consists of the unit disk slit along a Jordan arc that starts at a periphery point and does not pass through the origin. A mapping funetion of this nature is deter- mined as the integral f(z, to) of Lowner's dif- ferential equation of(z, t) 1 + K(t)f(z, t) ot -= -f(Z,t) I_K(t)f(z,t) ' Otto, with the initial condition f(z,O)=2', where K(t) is a continuous function with absolute value equal to 1. Any univalent function f(z) holo- morphic in the unit disk and satisfying [(0) = 0,1'(0) = 1 has an arbitrarily close approxi- mation by functions of the form el)f (z, to)' By means of this differential equation Lowner proved that 1 a 3 1  3 for any univalent function f(z)= z + I.2 anz n (Izl < 1) and also derived a decisive estimate concerning a coefficient problem for the inverse function [2]. G. M. Golusin (1935) and I. E. Bazilevich (1936) first noticed that Lowner's method is also a powerful tool for deriving several distor- tion theorems. They showed that classical distortion theorems can be derived in more detailed form (Golusin, Mat. Sh., 2 (1937),685); in particular, Golusin (1938) obtained a precise estimate concerning the rotation theorem, i.e., I argf'(z)1 { 4arcsin l z l ,  n+log(lzI 2 /(1-lzI 2 )), Izl < 1/j2, 1/j2lzl<1. Lowner's method was also investigated by A. C. Schaeffer and D. C. Spencer (1945) [8]. The method of contour integration was introduced by H. Grunsky. It starts with some 2-dimensional integral which can be shown to be positive. Transforming it into a boundary integral and using the tresidue theorem, we obtain an appropriate inequality by means of this integral. By this method Grumky estab- lished the following useful inequality (Math. 
1667 Z., 45 (1939)) For g(()=( + LoM-V, which is univalent in 1'1> I, let I g(z)-g(()   -my-n og "- L., L., cmnz S z-s m=l n=l (Izl> 1,1'1> I). The coefficients c mn are polynomials in the coefficients b v of q. Then Grunsky's inequality is: For each integer N and for all complex numbers )'1' ..., J'N, I NN I N 1 2..: I CmnAm}'n  I -1)'nI 2 m-l n=l n=1 n It is known that if this inequality holds for an arbitrary integer N and for all complex numbers A h ... ).N, then 9 (() is univalent in I" > 1. There are several generalizations of Grunsky's inequality [13]. The variational method was first developed by M. Schiffer for application to the theory of univalent functions. He first used boundary variations (Proc. London Math. Sac., 44 (1938)) and later interior variations (Amer. J. Math., 65 (1943)). The problem of maximizing a given real-valued functional on a family of univalent functions is called an extremal problem, and a function for which the functional attains its maximum is called an extremal function. The variational method is used to uncover charac- teristic properties of an extremal function by comparing it with nearby functions. Typical results are the qualitative information that the extremal function maps the disk Izi < I onto the complement of a system of analytic arcs satisfying a differential equation and that the extremal function satisfies a differential equa- tion. Following Schiffer, Schaeffer and Spen- cer [8J and Golusin (Math. Sb., 19 (1946)) gave variants of the method of interior variations. H. Grotzsch (1928-1934) treated the theory of univalent functions in a unified manner by the method of the textremal metric. The idea of this method is to estimate the length of curves and the area of some region swept out by them together with an application of tSchwarz's inequality ( 143 Extremal Length). After Grotzsch, the method of the ex- tremal metric has been used by many authors. In particular, O. Teichmiiller, in connection with this method, formulated the principle that the solution of a certain type of extremal problem is in general associated with a tqua- dratic differential, although he did not prove any general result realizing this principle in concrete form J. A. Jenkins gave a concrete ex- pression of the TeichmUller principle; namely, he established the general coefficient theo- rem and showed that this theorem contains as special cases a great many of the known results on univalent functions [II]. 438C Univalent and Multivalent Functions Univalence criteria have been given by various authors. In particular, Z. Nehari (Bull. Amer. Math. Soc., 55 (1949)) proved that if I {f(z), z} I  2(1-lzI 2 )-2 in Izi < 1, then j(z) is univalent in Izl < 1, and E. Hille (Bull. Amer. Math. Soc., 55 (1949)) proved that 2 is the best possible constant in the above inequality. Here, {f(z). z} denotes the tSchwarzian deriva- tive of J(z) with respect to z: ( /"(Z) ) ' 1 ( /,,(Z) ) 2 {J(z),z}= f'(z) -2" f'(z) C. Coefficient Problems In several distortion theorems Koebe's ex- tremal function z/(1_/;z)2 = L1 n£n-l zn(I£1 = 1) is extensively utilized. Concerning this, Bieberbach stated the fo1\owing conjecture. If f(z) = z + L2 anz n is holomorphic and univa- lent in Izi < I, then lanl  n (n = 2, 3, ...), with equality holding only for Koebe's extremal function z/(1 - a)2 (It I = 1). This conjecture was solved affirmatively by L. de Branges in 1985 after enormous effort by many mathema- ticians, as described below. Bieberbach (1916, [IJ) proved la 2 1  2 as a coro1\ary to the area theorem. This result can be proved easily by most of the methods. In 1923 Lowner [2J proved I a31  3, introducing his own method. Schaeffer and Spencer gave a proof of la31  3 by the variational method (Duke Math. J., 10 (1943»). Furthermore, Jen- kins used the method of the extremal metric to prove a coefficient inequality that implies la313 (Analytic Functions, Princeton Univ. Press, 1960). The problem of the fourth coeffi- cient remained open until 1955, when P. R. Garabedian and Schiffer [3J proved la41 4 by the variational method. Their proof was ex- tremely complicated. Subsequently, Z. Char- zynski and Schiffer gave an alternative brief proof of la 4 1 4 by using the Grunsky inequal- ity (Arch. Rational Mech. Anal., 5 (1960». M. Ozawa (1969, [4J) and R. N. Pederson (1968, [5J) also used the Grunsky inequality to prove la616. In 1972, Pederson and Schiffer [6J proved I asl  5. They applied the Garabedian- Schiffer inequality, a generalization of the Grunsky inequality which Garabedian and Schiffer had derived by the variational method. On the other hand, W. K. Hayman [7J showed that for each fixed j(z) = z + L2 anz n , . lanl hm-=()(I, n---+-::fJ n with the equality holding only for Koebe's extremal function z/(I- £z)2 (1/;1 = 1). Further, it was shown that Koebe's extremal function z/(1- Z)2 gives a local maximum for the nth 
4380 Univalent and Multivalent Functions coefficient in the sense that Re{ an}:;:;; n when- ever la 2 -21<b n for some bn>O (Garabedian, G. G. Ross, and Schiffer, J. Math. Mech., 14 (1964); E. Bombieri, Inventiones Math., 4 (1967); Garabedian and Schiffer, Arch. Ra- tional Math. Anal., 26 (1967)). In the most general form, the coefficient problem is to determine the region occupied by the points (a 2 , "', an) for all functions I(z) = z + L2 anz n univalent in Izi < 1. Schaeffer and Spencer [8J found explicitly the region for (a2' a 3 ). For the coefficients of functions g(O=' + LobvCv univalent in 1'1> 1, the following results are known: Ib 1 1:;:;; 1 (Bieberbach [lJ), I b 2 1 :;:;; 2/3 (Schiffer, Bull. Sac. Math. France, 66 (1938); Golusin, Mat. Sb., 3 (1938)), Ib 3 1:;:;; 1/2 + e- 6 (Garabedian and Schiffer, Ann. Math., (2) 61 (1955)). D. Other Classes of Univalent Functions We have discussed the general family of func- tions univalent in the unit disk. There are also several results on distortion theorems and coefficient problems for subfamilies deter- mined by conditions such as that the images are bounded, tstarlike with respect to the origin, or tconvex. For instance, if I(z)=z+ L2anZn is holomorphic and univalent in Izi < 1 and its image is starlike with respect to the origin, then I an I:;:;; n (n = 2,3, ...). If the im- age of I(z) is convex, then I(z) satisfies lanl:;:;; I (n = 2, 3, ... ) and the distortion inequalities Izi Izi 1 + Izi :;:;;1/(z)l:;:;; l-I z l ' 1 1 (1 +lzl)2 :;:;;1/'(z)1:;:;; (I_lzl)2 ' Here the equality sign appears at Zo (0 < Izol < 1) if and only if I(z) is of the form z/(1 + ez) with e=:t Izoi/zo. On the other hand, problems on conformal mappings of multiply connected domains involve essential difficulties in comparison with the simply connected case. Although Bieberbach's method is unsuitable for multi- ply connected domains, Lowner's method, the method of contour integration, the varia- tional method, and the method of the ex- tremal metric remain useful ( 77 Con- formal Mappings). E. Multivalent Functions Multivalent functions are a natural generaliza- tion of univalent functions. There are several results that generalize classical results on univalent functions. 1668 A function I(z) that attains every value at most p times and some values exactly p times in a domain D is said to be p-valent in D and is called a multivalent function provided that p> 1. In order for f(z) = Lo anz n , holomorphic in Izl:;:;; 1, to be p-valent there, it is sufficient that it satisfies p-l <Re(z/,(z)/I(z))<p+ 1 on Izl= 1. Hence it suffices to have p CD lapl- L nlap+r-nl> L nla p -1+nl. n=2 n=2 Iff(z)=(1 +a1z+a2z2+ ...)/zP is holomorphic and p-valent in O<lzl:;:;; 1, then d f 2" 1 - F(I/(re i )l)d8:;:;;0 dr 0 for any increasing function F(p) in p  0. In particular, if F(p) = p2, this becomes an area theorem from which follow coefficient esti- mates, etc., for p-valent functions. Various subfamilies and generalized families of multivalent functions have been considered. Let I(z) be p-valent in D, and Co + C 1 Z + ... + c p - 1 zr 1 + cpl(z) be at most p-valent in D for any constants co, c 1, ..., C p' Then I(z) is said to be absolutely p-valent in D. If a function I(z) holomorphic in a convex domain K satis- fies Re(eial(p)(z)) > ° for a real constant Ct., then }(z) is absolutely p-valent in K. If I(z) is ab- solutely p-valent in D, then Ct: bkz k + bpl(Z)) I Ct: Ck Zk +c pl(Z)) is at most p-value in D for any constants b k and c k . If I(z) is p-valent in the common part of a domain D and the disk centered at each point of D with a fixed radius p, then I(z) is said to be locally p-valent in D, and p is caIJed its modulus. A necessary and sufficient condi- tion for I(z), holomorphic in D, to be at most locally p-valent is that /,(z), '" ,j(p)(z) not vanish simultaneously. In order for I(z), holo- morphic in D, to be locally absolutely p-valent it is necessary and sufficient that I(P)(z) *0. Let the number of Rei"-points of I(z) in D be n(D, Rei"). If f(z) satisfies 1 f 2" - n(D, Rei")drp:;:;;p, 2n 0 for any R > 0, it is said to be circumferentialIy mean p-valent in D. If I(z) satisfies IR I2" n(D,Re i ")RdRdrp:;:;;pnR 2 , it is said to be arealIy mean p-valent in D. If I(z)q with q> 1 is areally mean p-valent in D, then I(z) is are ally mean p/q-valent in D. For 
1669 J(z) =(1 +a 1 z+a Z z Z + ...)/Z!. holomorphic and areally mean A-valent in O<lzl:( 1, thefollow- ing area theorem holds: 00 L: (n-l)la n l z :(A. n==l Let E be a set containing at least three points. If J(z) in D attains every value of E at most p times and a certain value of E exactly p times (it may attain values outside E more than p times), then J(z) is said to be quasi-p- valent in D. If w = J(z) is p-valent in D and g(w) is quasi-q-valent in J(D), then g(f(z)) is at most quasi-pq-valent in D. The first success in obtaining sharp inequal- ities for multivalent functions was attained by Hayman. In his work, an essential role was played by the method of tsymmetrization. For instance, he obtained the following result. If J(z) = zP + a p + 1 zp+1 +... is holomorphic and circumferentially mean p-valent in Izi < 1, then la p +11 :(2p, and for Izi =r, 0< r< 1, r P r P (1 +r)zp :(IJ(z)I:( (1_r)zP ' , <p(1 +r) <prr1(1 +r) IJ (z)l""" r(1- r) I [(z) 1 """ (1- r)ZP+1 References [IJ L. Bieberbach, Ober die Koeffizienten derjenigen Potenzreihen, welche eine schlichte Abbildv:1g des Einheitskreises vermitteln, S.-B. Preuss., Akad. Wiss., (1916), 940-955. [2J K. Lowner, Untersuchungen liber schlichte konforme Abbildungen des Einheitskreises I, Math. Ann., 89 (1923),103-121. [3J P. R. Garabedian and M. Schiffer, A proof of the Bieberbach conjecture for the fourth coefficient, J. Rational Mech. Anal., 4 (1955), 427-465. [4J M. Ozawa, On the Bieberbach conjecture for the sixth coefficient, K6dai Math. Sem. Rep., 21 (1969),97-128. [5J R. N. Pederson, A proof of the Bieberbach conjecture for the sixth coefficient, Arch. Ra- tional Mech. Anal., 31 (1968-69), 331-351. [6J R. N. Pederson and M. Schiffer, A proof of the Bieberbach conjecture for the fifth coeffi- cient, Arch. Rational Mech. Anal., 45 (1972), 161-193. [7J W. K. Hayman, The asymptotic behavior of p-valent functions, Proc. London Math. Soc., (3) 5 (1955),257-284. [8J A. C. Schaeffer and D. C. Spencer, Coeffi- cient regions for schlicht functions, Amer. Math. Soc. Colloq. Pub\., 35 (1950). [9J G. M. Golusin, Geometric theory of func- tions of a complex variable, Amer. Math. Soc. 438 Ref. Univalent and Multivalent Functions Transl. of Math. Monographs, 26 (1969). (Original in Russian, 1952.) [IOJ W. K. Hayman, Multivalent functions, Cambridge Univ. Press, 1958. [11J J. A. Jenkins, Univalent functions and conformal mapping, Springer, second edition, 1965. [12J L. V. Ahlfors, Conformal invariants: Topics in geometric function theory, McGraw- Hill, 1973. [13J C. Pommerenke, Univalent functions (with a chapter on quadratic differentials by G. Jensen), Vandenhoeck & Ruprecht, 1975. [14J G. Schober, Univalent functions- Selected topics, Lecture notes in math. 478, Springer, 1975. [5J P. L. Duren, Coefficients of univalent functions, Bull. Amer. Math. Soc., 83 (1977), 891-911. 
439 A Valuations 439 (111.19) Valuations A. Introduction There are two related kinds of valuations, additive ( Section B) and multiplicative ( Section C). The notion of valuations, originally defined on (commutative) tfields, has been extended to more general cases ( Section K); however, we first consider the case of fields. B. Additive Valuations In this article, we mean by an ordered additive group a totaIly ordered additive group, namely, a commutative group whose operation is addition, which is a ttotally ordered set satisfy- ing the condition that a"?b and c"?d imply a +c"?b+d and -a -b. Suppose that we are given a field K, an ordered additive group G, and an element ctJ defined to be greater than any element of G. Then a mapping v: K-> G U { ctJ} is called an additive valuation (or simply a valuation) of the field K if v satisfies the following three conditions: (i) v(a) = CD if and only if a=O; (ii) v(ab)=v(a)+v(b) for all a, b#O; and (iii) v(a+b)"?min{v(a),v(b)}. The set {v(a) I aE K - {O} } is a submodule of G and is called the value group of v, while the set Rv= {aEK I v(a)"?O} is a subring of K and is called the valuation ring of v. The ring Rv has only one tmaximal ideal {alv(a»O}, called the valuation ideal of v (or of Rv), and the tresidue class field of Rv modulo the maximal ideal is called the residue class field of the valuation v. We have v(a)  v(b) if and only if aRv::::J bRv. Two valuations v and v' of the field K are said to be equivalent when v(a)  v(b) if and only if v'(a)v'(b); hence v and v' are equivalent if and only if Rv = Rv" The rank of v is defined to be the tKrull dimension of the valuation ring Rv, and the rational rank of v to be the maximum (or supremum) of the numbers of linearly independent elements in the value group. An extension (or prolongation) of v in a field K' containing K is a valuation v' of K' whose restriction on K is v; such an extension exists for any given v and K'. Sometimes a valuation of rank I is called a special valuation (or ex- ponential valuation), and a valuation of a gen- eral rank is called a generalized valuation. On the other hand, if k is a subfield of K such that v(a) = 0 for every nonzero element a of k, then v is called a valuation over the subfield k. c. Multiplicathe Valuations A multiplicative valuation (or valuation) of a field K is a mapping w: K ->r U {O} that satis- 1672 fies the following three conditions, where r is the multiplicative group of positive real num- bers: (i) w(a) = 0 if and only if a = 0; (ii) w(ab) = w(a)w(b); and (iii) w(a + b)  C(w(a) + w(b)), where C is a constant (independent of the choice of a and b, but dependent on the choice ofw). The value group of w is defined to be {w(a) I aEK - {O} }. Extensions of a valuation and equivalence of valuations are defined as in the case of additive valuations. Thus w' is equivalent to w if and only if there is a positive r such that for all aEK, w(a)=w'(a)'. In each equivalence class of valuations of a field, there exists a valuation for which the constant C in condition (iii) can be taken to be 1. A valu- ation w is said to be a valuation oVIr a subfield k if w(a) = 1 for any nonzero element a of k. We call w an Archimedean valuation iffor any elements a, bE K, a # 0, there exists a natural number n such that w(na) > w(b); otherwise, w is said to be a non-Anhimedean valuation. If w is an Archimedean valuation of a field K, then there is an injection a from K into the complex number field C such that w is equivalent to the valuation w' defined by w'(a) = la(a)l. If w is a non-Archimedean valuation of a field K, then w(a+ b)max{w(a), w(b)}. Hence in this case we get an additive valuation v of K when we define v(a) = -log w(a) (aE K), and either v is of rank 1 or v( K) = {I, O} (in the latter case, v is called trivial). Conversely, every additive valuation of rank I of K is equivalent to an additive valuation obtained in this way from a non-Archimedean valuation. (This is why an additive valuation of rank I is called an exponential valuation.) Therefore a non- Archimedean valuation determines a valuation ring and valuation ideal in a natural manner. Thus we can identify a non-Archimedean valuation with an additive valuation of rank \. D. Topology Defined by a Valuation Let w be a multiplicative valuation of a field K. When the tdistance between two elements a, b of K is defined by w(a - b), K btcomes a ttopological field. (Although this distance may not make K into a tmetric space, there exists a valuation w' equivalent to the valuation w such that K becomes a metric space with respect to the distance w'(a-b) between a and b (a, bE K).) If K is tcomplete under the topol- ogy, then we say that K is complete with respect to wand w is complete on K. On the other hand, suppose that w' is an e.tension of w in a field K' containing K. If w' is complete and K is tdense in K' under the topology de- fined by w', then we say that the valuatiori w' is a completion of wand that the field K' is a 
1673 completion of K with respect to w. For any w, a completion exists and is unique up to iso- morphism. When w is a non-Archimedean valuation, the valuation ring of the completion of w is called the completion of the valuation ring ofw. When v is an additive valuation of a field K, we can introduce a topology on K by taking the set of all nonzero ideals of the valuation ring Rv of vas a tbase for the neighborhood system of zero. Important cases are given by valuations of rank 1, which are the same as those given by non-Archimedean valuations. Ifw is a complete non-Archimedean valu- ation of a field K, then the valuation ring Rw of w is a tHensel ring, which implies that if K' is a finite algebraic extension of K such that [K' :KJ =n, then w is uniquely extendable to a valuation w' of K' and w'(a)" = w(N(a)), where N is the tnorm NK'/K' E. Discrete Valuations For a non-Archimedean valuation (or an additive valuation of rank 1) w, if the valuation ideal of w is a nonzero tprincipal ideal gen- erated by an element p, then we say that p is a prime element for w. w is a discrete valuation, and the valuation ring for w is a discrete valu- ation ring. The condition on the valuation ideal of w holds if and only if the value group of w is a discrete subgroup of the (multiplica- tive) group r of positive real numbers: In the terminology of additive valuations, a valuation w is discrete if and only if it is equivalent to a valuation w' whose value group is the additive group of integers. Such a valuation w' is called a normalized valuation (or normal valuation). However, we usually mean normalization of a discrete non-Archimedean valuation as in Section H. Sometimes an additive valuation whose value group is isomorphic to the direct sum of a finite number of copies of Z (the additive group of integers) with a natural tlexicographic order is called a discrete valu- ation. Concerning a complete discrete valu- ation w, it is known that if the valuation ring of w contains a field, then it is isomorphic to the ring of tformal power series in one variable over a field (for other cases  449 Witt Vec- tors A). F. Examples (1) Trivial valuations of a field K are the addi- tive valuation v of K such that v(a) = 0 for all aE K - {O} and the multiplicative valuation w of K such that w(a)== I for all aE K - {O}. (2) If K is isomorphic to a subfield of the complex number field, then we get an Archi- 439G Valuations medean valuation using the absolute value, and as stated in Section C, every Archimedean valuation of K is equivalent to a valuation obtained in this way. (3) Let p be a tprime ideal of a tDedekind domain R, nEp be such that n$p2, and K be the field of quotients of R. Then each nonzero element ex of K can be expressed in the form n r ab- 1 (rEZ;a,bER;a,b$p), where r, the degree of ex with respect to p, is uniquely determined by ex. Hence, letting c be a constant greater than 1, we obtain a non-Archimedean valu- ation w defined by w(ex)=c- r . This valuation w is called a p-adic valuation. We also get an additive valuation v defined by v(ex) = r, called a p-adic exponential valuation. The completion Kp of K with respect to v is called the p-adic extension of K. If K is a finite talgebraic num- ber field, the Kp is called a p-adic number field. If p is generated by an element p, then "p-adic" is replaced by "p-adic." For instance, given a rational prime number p, we have a p-adic valuation of the rational number field Q, and we obtain the p-adic extension Qp ofQ, which is called the p-adic number field. Every non- zero element ex of Qp can be written as a uniquely determined expansion "L;';'=,anpn (a, #0, rEZ, anEZ, O an <p). Then we obtain a valuation v of Qp defined by v (ex) = r. This valuation v is a discrete additive valuation, and Qp is complete with respect to v. The valuation ring of v is usually denoted by Zp, which is called the ring of p-adic integers. Each element of Qp (Zp) is called a p-adic number (p- adic integer). (4) Consider the field oftpower series k((t)) in one variable t over a field k. For O#exE k((t)), we define v(ex) == r if ex = "L, ant n (anE k, a, #0). Then v is a discrete valuation of k((t)), and k((t)) is complete with respect to this valuation. (5) Let v be an additive valuation of a field K with the valuation ring Rv and the valuation ideal mv' Let v' be an additive valuation of the field Rv/mv with the valuation ring Rv" Then R" = {aE Rv I(amod mv)E R v '} is a valuation ring of K. A valuation v" whose valuation ring coincides with R" is called the composite of v and v'. G. The Approximation Theorem and the Independence Theorem The approximation theorem states: Let WI' .., . W n be mutually nonequivalent and nontrivial multiplicative valuations of a field K. Then for any given n elements aI' "', an of K and a positive number B, there exists an element a of K such that wi(a - aJ < B (i = 1,2, ...,n). 
439H Valuations From this follows the independence theorem: Let e b ..., en be real numbers, and let Wi and K be as in the approximation theorem. If I1 i w,(a)"i= I for all aEK -{O}, then e l =... =en=O. Similar theorems hold for additive valu- ations. The following independence theorem is basic: Let v I' ... , V n be additive val ua tions of a field K, RI'"'' Rn their valuation rings, and tlI l , ..., tn n their maximal ideals. Let D = niRi' Pi = m i n D, and consider the rings of quotients Dpi' Then D pi = R i . If Rd. Rj (for i i= j), then D has exactly n maximal ideals PI, ..., Pn' H. Prime Divisors Let K be an talgebraic number field (algebraic function field of one variable over a field k). An equivalence class of nontrivial multiplicative valuations (over k) is called a prime divisor (prime spot) of K. If K is an algebraic number field of degree n, there are exactly n mutually distinct injections 0"1, ..., O"n of K into the complex number field C. We may assume that O"i(K) is contained in the real number field if and only if i  r l and O"n-i+1 (a) and 0"" +i(a) are conjugate com- plex numbers (n-r l ;?oi>O, aEK) For irp let v,(a)=IO"i(a)l, and for 1 i(n-rd/2, let v,,+,(a)=IO"r,+,(a)1 2 . Then VI' ...,v,,+r, (r 2 = (n - rd/21 is a maximal set of mutually non- equivalent Archimedean valuations of K. Equivalence classes of V I' '" , v r , are called real (infinite) prime divisors, and those of v" +1, ..., v" +" are called imaginary (infinite) prime divisors; all of them are called infinite prime divisors. An equivalence class of non- Archimedean valuations of K is called a finite prime divisor. An Archimedean valuation of K is said to be normal if it is one of the valuations Vi' If V is non-Archimedean, then v is a p-adic valuation, where P IS a prime ideal of the principal order o of K ( Section F, example (3)). Hence if a is an element of K, there exists a constant c (c > 1) such that v(a)=c-', where r is the degree of a with respect to p. In particular, if c is the norm of P (i.e., c is the cardinality of the set o/p), then the valuation v is called normal. Any finite prime divisor is represented by a normal valuation. Then we have the product formula I1 w w(a)= 1 for all aEK -{O}, where w ranges over all normal valuations of K. For a function field, a normal valuation is defined similarly, using e f instead of the norm of P, where e is a fixed real number greater than I and r is the degree of the residue class field of the val uation over k. In this case we also have the product formula. 1674 I. Extending Valuations to an Al!:ebraic Extension of Finite Degree Assume that a field K' is a finite algebraic extension of a field K. Let v be an additive valuation of K and v' be an extension of v to K'. We denote the valuation rings, valuation ideals, and value groups of v and v' by Rv, R v " m v , m v " and G, G', respectively. Then the degree of the extension fv' = [Rv./mv': Rv/mv] is called the degree of v' over v. The group index e ,.' = [G': G] is called the ramification index of v' over v. If v' ranges over all extensions of v in K', then the sum '2:.fv.ev' is not greater than [K': K] and the equality holds when v is a discrete valuation and either K' is tseparable over K or v is complete. J. Places Let k, K, and L be fields, and suppose that k c K. Let f be a mapping of K onto L U { ctJ } such that f(ab) = f(a)f(b) and f(a + b) = f(a) + ((b), whenever the right member is meaning- ful, and such that the restriction of f to k is an injection. Here ctJ is an element adjoined to L and satisfying ctJ + a = a + ctJ = oc, ctJa = actJ = ctJ (for any nonzero element a of K), l/ctJ = 0, and 1/0 = ctJ. Then f is called a place of K over k. In this case R = {x E K I f(x) i= ctJ} is a valuation ring of K containing k. Let m be the maximal ideal of R. Then f can be identified with the mapping g: K -> R/m U { c"fJ} defined as follows: If aE R, then g(a) =(amod m); other- wise, g(a)= ctJ. Places of Kover h can be classified in a natural way, and there exists a one-to-one correspondence between the set of classes of places of Kover k and the set of equivalence classes of additive valuations over k. When K is an talgebraic function field, we usually consider the case where k is the tground field. Then if aI' ..., anE R, (aI' ..., an) ->(g(ad, ..., g(a n )) gives a tspecialization of points over k. Conversely, if a" hjE K are such that (aI' "', an)->(b[, ..., h n ) is a speciali- zation over k, then there is a place f of K over k such that (b l , ..., b n ) is isomorphic to ({(ad, "', ((an)) (usually there are infinitely many such f's). K. Pseudo valuations A pseudovaluation rp of a ring A (not neces- sarily commutative) is a mapping of A into the set of nonnegative real numbers satisfying the following four conditions: (i) rp(a) = 0 if and only if a = 0; (ii) rp(ab) rp(a)rp(b); (iii) rp(a +b)  cp(a) + cp(b); and (iv) cp( -a)= cp(a). These con- ditions are weaker than those for multipli- cative valuations, but with them a topology 
1675 can be introduced into A as in Section D, with respect to which A becomes a topological ring. L. History The theory of valuations was originated by K. Hensel when he introduced p-adic numbers and applied them to number theory [1]. 1. Kiirschak (J. Reine Angew. Math., 142 (1913)) first treated the theory of multiplicative valu- ations axiomatically; it was then developed remarkably by A. Ostrowski (Acta Math., 41 (1918)). However, in their theory condition (iii) ( Section C) was given only in the case C = 1, thus excluding the normal valuation of an imaginary prime divisor in an algebraic num- ber field. A valuation with general C was introduced by E. Artin [3]. The theory of ad- ditive valuations was originated by W. Krull (J. Reine Angew. Math., 167 (1932)), although the concept of exponential valuations existed before. The theory of valuations is used to simplify tclass field theory and the theory of algebraic function fields in one variable. For these purposes, the notion of multiplicative valuations is sufficient ( 9 Algebraic Curves; 59 Class Field Theory). The idea is also used in the theory of normal rings and in alge- braic geometry, for both of which the con- cept of additive valuations is also necessary. Pseudovaluations were used by M. Deuring (Erg. Math., Springer, 1935) in the arithmetic of algebras. References [IJ K. Hensel, Theorie der algebraischen Zahlen, Teubner, 1908. [2J O. F. G. Schilling, The theory of valu- ations, Amer. Math. Soc. Math. Surveys, 1950. [3J E. Artin, Algebraic numbers and algebraic functions, Gordon & Breach, 1967. [4J O. Zariski and P. Samuel, Commutative algebra II, Van Nostrand, 1960. Also  references to 67 Commutative Rings. 440 (X.35) Variational Inequalities A. Introduction Variational inequalities arise when we con- sider extremal problems of functionals under unilateral constraints. Some problems in physics and engineering are studied by for- mulating them as elliptic, parabolic, and hyperbolic variational inequalities [I -8]. 440 C Variational Inequalities B. Stationary Variational Inequality Let D be a bounded domain in m-dimensional Euclidean space and r E L 2 (D) be a given real- valued function. Consider the variational problem of minimizing the following func- tional J with the argument function v: J[vJ= L1gradvl2dx-2 LfVdX. Here, we suppose the set of admissible func- tions to be the closed convex subset K={VEHJ(D)I v:s;Oa.e. in D} of the Hilbert space H (D) ( 168 Function Spaces). It can be shown by choosing a mini- mizing sequence that there exists a minimum value of J which is realized by a unique UE K. Since the stationary function U belongs to H (D), it can be shown that the boundary condition U I ('D = 0 is satisfied in the sense that the ttrace Y o uEH 1 / 2 (aD) ( 224 Interpolation of Operators) of U on aD vanishes a.e. on aD. In view of the fact that J[uJ :S;J[vJ is valid for any VE K, it can be verified that the stationary variational inequality -f..u- f:S;O } u:s;O ( - f..u - f) . u = 0 (1) is satisfied in D in the sense of differentiation of distributions ( 125 Distributions and Hyperfunctions). The problem (J) is a Dirichlet problem with obstacle. Moreover, we can prove the regularity ofuEH 2 (D) under an assump- tion of suitable smoothness for aD by estab- lishing the boundedness of the solutions u, in H 2 (D) of the penalized problems associated with (1): I +_ -f..u,+-u, -f (00), £ U,lrD=O. Here we note that the u, are the stationary functions of the ordinary variational problems of minimization in HJ (D) of the functionals J,[VJ= f I gradvl2dX-2 f fVdx+ f IV+12dX D D £ with the penalty term (the third term of the right-hand side of the equality above). We have thus found that the stationary variational inequality (J) is the Euler equation of a con- ditional problem of variation ( 46 Calculus of Variations). C. Variational Inequality of Evolution Let IjiEH 1 (D) be a given function on D such that IjilrD?O and f..IjiEL 2 (D). The variational 
4400 Variational Inequalities inequality of evolution i:u -:1-,1U O, 6t ulfi, e -,1u )-(U-Ifi)=O (t>O,xED), u(O, x) = a(x) u(t,x)lm=O (xED), (t>O) can be formulated as an abstract Cauchy problem ( 286 Nonlinear Functional Analy- sis X) du -EAu (t>O), dt u( +O)=a in a Hilbert space with a multivalued operator A = - i1rp, where r:rp is the subdifferential of the following lower semicontinuous proper convex function on the Hilbert space L 2 (D): rp(V) {  f IgradvI2dx ifvEHJ(D) and = 2 D +co otherwise. vl/J, Thus the solution u is given by the vector- valued function u(t)=etAa. Here etA is the tnonlinear semigroup generated by A ( 88 Convex Analysis, 378 Semigroups of Operators and Evolution Equations). D. Optimal Stopping Time Problem and Variational Inequalities Let {Xt}tO be an m-dimensional Brownian motion ( 45 Brownian Motion) and consider the problem of finding a tstopping time a that minimizes Jx[a]=Ex(J: {(Xt)dt) (xERm) under the restriction that 0  a  ar'D, where ar'D is the thitting time for the boundary 3D. Let us define u(x)=mjn Jx[a]. Then the tprinciple of optimality in dynamic programming gives the stationary varia- tional inequality (1) with,1 replaced by +,1, and we can show by the tDynkin formula that an optimal stopping time a is the hitting time for the set {xEQlu(x)=O} ( 127 Dynamic Programming). We can systematically discuss problems in mathematical programming and 1676 operations research by introducing quasi varia- tional inequalities, which are slight generaliza- tions of variational inequalities (-, 227 Inven- tory Control, 408 Stochastic Programming). The above-mentioned facts are applicable to general tdiffusion processes described by tsto- chastic differential equations ( 115 Diffusion Processes, 406 Stochastic Differential Equa- tions). We have thus found the relation free boundary problem ......variational inequality "'"optimal stopping time problem/ ( 405 Stochastic Control and Stochastic Filtering). E. Numerical Solution of Variational Inequalities Since the solution u of the variational inequal- ity (1) is the stationary function for the vari- ational problem, we can apply to the evalu- ation of the function u numerical methods based on the direct method of the calculus of variations ( 300 Numerical Methods). The tfinite element method, which can be regarded as a type of Ritz-Galerkin method, is exten- sively employed to calculate numerical solu- tions. In view of the unilateral constraint u  0, iteration methods, such as the Gauss- Seidel iteration method, are used with modifi- cations. An algorithm of relaxation with pro- jection is proposed in [3] ( 304 Numerical Solution of Partial Differential Equations). References [1] H. Brezis, Problemes unilateraux, J. Math. Pures Appl., 51 (1972),1-168. [2] V. Barbu, Nonlinear semigroups and dif- ferential equations in Banach spaces, Noord- hoff, 1976. [3] J.-L. Lions, Sur quelques questions d'ana- lyse, de mecanique et de controle optimal, Les Presses de I'Universite de Montreal, 1976. [4] J.-L. Lions and G. Stampacchia, Vari- ational inequalities, Comm. Pure Appl. Math., 20 (1967),493-519. [5] C. Baiocchi, Su un problema di frontiera libera connesso a questioni di idraulica, Ann. Mat. Pura Appl. 92 (1972),107-127. [6] G. Duvaut and J.-L. Lions, Les inequa- tions en mecanique et en physique, Dunod, 1972; English translation, Springer, 1975. [7] A. Friedman and D. Kinderlehrer, A one phase Stefan problem, Indiana Univ. Math. J., 24(1975),1005-1035. [8] R. Glowinski, J.-L. Lions, and R. Tremo- lieres, Analyse numerique des inequations variationelles I, II, Dunod, 1976; English translation, North-Holland, 1981. 
1677 441 (XX.3) Variational Principles A. General Remarks Among the principles that appear in physics are those expressed not in terms of differen- tial forms but in terms of variational forms. These principles, describing the conditions under which certain quantities attain ex- tremal values, are generally called variational principles. Besides Hamilton's principle in classical mechanics ( Section B) and Fermat's principle in geometric optics ( Section C), examples are found in telectromagnetism, trelativity theory, tquantum mechanics, tfield theory, etc. Independence of the choice of coordinate system is an important character- istic of variational principles. Originally these principles had theological and metaphysical connotations, but a variational principle is now regarded simply as a postulate that pre- cedes a theory and furnishes its foundation. Thus a variational principle is considered to be the supreme form of a law of physics. B. Mechanics In 1744 P. L. Maupertuis published an almost theological thesis, dealing with the principle of least action. This was the beginning of the search for a single, universal principle of me- chanics, contributions to which were made successively by L. Euler, C. F. Gauss, W. R. Hamilton, H. R. Hertz, and others. Let {qr} be the tgeneralized coordinates of a system of particles, and consider the integral of a function L(q" q" t) taken from time to to t I' If we compare the values of the integral taken along any arbitrary path starting from a fixed point Po in the coordinate space at time to and arriving at another fixed point PI at time t l , then the actual motion qr(t) (which obeys the laws of mechanics) is given by the condition that the integral is an textremum (tstationary value), that is, b J: Ldt = 0, provided that the function L IS properly chosen. This is Hamil- ton's principle, and L is the tLagrangian func- tion. In tNewtonian mechanics, the tkinetic energy T of a system of particles is expressed as a tquadratic form in qr' Furthermore, if the forces acting on the particles can be given by - grad V, where the potential V does not de- pend explicitly on qr, we can choose L = T- V. Also, for a charged particle in tspecial rela- tivity, we can take L = -m o c 2 (1-v 2 /C 2 )1/2_ e({! + e(v' A), where mo is the rest mass of the particle, e is the charge, v is the velocity (with v its magnitude), c is the speed of light in 441 E Variational Principles vacuum, and qJ and A are the scalar and vector potentials of the electromagnetic field, respectively. In general relativity theory, the motion of a particle can be derived from the variational principle b J ds = 0 (ds is the Riemannian line element). Hence, geometrically, the particle moves along a tgeodesic curve in 4-dimen- sional space-time. C. Geometric Optics The path of a light ray between two points Po and PI (subject to reflection and refraction) is such that the time of transit along the path among all neighboring virtual paths is an extremum (stationary value). This is called Fermat's principle. If the index of refraction is n, Fermat's principle can be expressed as b J:( n ds = 0 (ds is the Euclidean line element). The laws ofreflection and refraction of light, as well as the law of rectilinear propagation of light in homogeneous media, can be derived from this principle. D. Field Theory Not only the equations of motion of a system of particles, but also various field equations (tMaxwell's equations of the electromagnetic field, tDirac's equation of the electron field, the meson field equation, the gravitational field equation, etc.) can be derived from variational principles in terms of appropriate Lagrangian functions. In tfield theory the essential virtue of the variational principle appears in the fact that the properties of various possible fields as well as conservation laws can be systematically discussed by assuming relativistic invariance and gauge invariance of the Lagrangian func- tions adopted. In particular, for an electro- magnetic field in vacuum, the Lagrangian function density is L = (H 2 - E 2 )/2, and the integration is carried out over a certain 4- dimensional domain. E. Quantum Mechanics If H is the tHamiltonian operator for any quantum-mechanical system, the eigenfunc- tion Iji can be determined by the variational principle b f IfHljidT=o, with f IfljidT= 1, where If is the complex conjugate of Iji and dT is the volume element. Based on this varia- tional principle, the tdirect method of the calculus of variations is often employed for 
441 F Variational Principles an approximate numerical calculation of the energy eigenvalues and eigenfunctions. In par- ticular, by restricting the functional form of IjJ to the product of one-body wave functions, we can obtain Hartree's equation. A further suitable symmetrization of IjJ leads to Fock's equation F. Statistical Mechanics Let qJ be a statistical-mechanical state of a system, and let S(cp) and E(cp) be the state's entropy and energy (mean entropy and mean energy for an infinitely extended system); T is the thermodynamical temperature, and f(cp) = E(cp) - TS(cp) is the free energy. Then the equilibrium state for T 0 is determined as the state cp that gives the minimum value of f(cp) (maximum for T<O). References [IJ R. Courant and D. Hilbert, Methods of mathematical physics, Interscience, I, 1953; II, 1962. [2J B. L. Moiseiwitsch, Variational principles, Interscience, 1966. [3J R. Weinstock, Calculus of variations, with applications to physics and engineering, McGraw-Hill, 1952. Also  references to 46 Calculus of Variations. 442 (VI.12) Vectors A. Definitions The vector concept originated in physics from such well-known notions as velocity, acceler- ation, and force. These physical quantities are supplied with length and direction; they can be added or multiplied by scalars. In the Euclid- ean space E" (or, in general, an taffine space), a vector a is represented by an oriented seg- ment pq. Two oriented segments M and  are considered to represent the same vector a if and only if the following two con- ditions are satisfied: (I) The four points p" qJ, Pl, ql lie in the same plane n. (2) PI ql/ /Plq2 and P,ql/ /q,q2' Hence a vector in En is an equivalence class of oriented segments pq, where the equivalence relation J517[;  P27h. is defined by the two conditions just given. Hereafter, we denote the vector by [ pq], or simply pq. The points p and q are called the initial point and terminal point of the vector pq. 1678 Given a vector a = pq and a real number )", we define the scalar multiple Aa a. the vector pI', where r is the point on the straight line containing both p and q such that the ratio [ pr : pqJ is equal to A (if p = q, then we put r = p). The operation (A, a)-->Aa is call,d scalar multiplication. Given two vectors a = pq and b = qS, the vector c = pS is called the sum of a and b and is denoted by c = a + b. The vector ptJ = 0 is caned the zero vector. If a = pq, we put - a = qp. Scalar multiplication and addition of vec- tors satisfy the following seven conditions: (1) a+b=b+a (commutative law); (2) a+(b+c)= (a + b) +c (associative law); (3) a + 0 = a; (4) for each a there is - a such that a + ( - a) = 0; (5) A(a + b) = ).a + Ab, ()" + )a = Aa + a (dis- tributive laws); (6) A(a) = ().)a (associative law for scalar multiplication); and (7) la=a. Hence the set V of all vectors in En forms a treallinear space. Sometimes, a se,t satisfying (1 )-(7), that is, by de fin ition, a linear space, is called a vector space, and its elements are called vectors. The pair consisting of a vector pq and a specific initial point p of pq is sometimes called a fixed vector. An illustration of this is given by the force vector with its initial point being where the force is applied. By contrast, a vec- tor pq is sometimes called a free ector. If we fix the origin 0 in En, then for any point p in E", the vector op is called the position vector of p. If two vectors a = op and b = oq are tlinearly dependent, they are sometimes said to be collinear. If there vectors a = op, b = oq, and c = Of are linearly dependent, they are some- times said to be coplanar. If a set of vectors e l , ... ,en forms a tbasis of a vector space V, then the vectors e j are called fundamental vectors III V. Each vector aE V is uniquely expressed as a =  IXje j (lJ(iE R). We call (IJ( 1, . .. , IXn) the components of I:he vector a with respect to the fundamental vectors e 1 , .-. ,en. B. Inner Product In the Euclidean space En, the length of the line segment pq is called the absolute value (or magnitude) of the vector a = pq and is denoted by lal. A vector oflength one is called a unit vector. For two vectors a = op and b= oq, the value (a, b) = lallbl cos f) is called the inner product (or scalar product) of a and b, where () is the angle Lpoq. Instead of (a, bl, the nota- tions a. b, or ab are also used. If neither vec- tor a nor vector b is equal to 0, then (a, b) = 0 implies L poq = n/2, that is, the orthogonality of the two vectors op and oq. If W take an 
1679 torthonormal basis (e l , ..., en) in En (i.e., a set of fundamental vectors with leil = 1, (e i , e j ) = 0 (i # J)), the inner product of vectors a = L Ct.je i , b= L fiiei is equal to L71 Ct.j]i' The inner product has the following three properties (i) (x, x))o 0 and is zero if and only if x = 0; (ii) (x, y)=(y, x); (iii) (XI +X2, y)=(x l , y) +(x 2 , y), (Ct.x,y)= Ct.(x, y) (Ct.ER). Similar linearity holds for y. Generally, an R-valued tbilinear form (x, y) on a linear space V satisfying the previous three conditions is also called an inner prod- uct If a linear space V is equipped with an inner product, the space is caned an inner product space ( 256 Linear Spaces H; 197 Hilbert Spaces). If V is an inner product space, the absolute value Ixl of x E V is defined to be J(x,x) . c. Vector Product In the 3-dimensional Euclidean space E 3 , we take an orthonormal basis e" e2' e 3 . Let a and b be vectors in E 3 whose components with respect to e l , e 2 , e 3 are (Ct. I , Ct. 2 , Ct. 3 ), (fil' fi2' fi3)' The vector ! Ct.2 Ct.3 1 ! Ct.3 Ct.1 ! 1 Ct.1 e l + e 2 + fi2 fi3 fi3 fil fit Ct. 2 ! e 3 , fi2 which is symbolically written as e I e 2 e 3 Ct. 1 Ct. 2 Ct. 3 , fi I fi2 fi3 is called the exterior product or vector product of a and b and is denoted by [a, b] or a x b. The vector [a, b] is determined uniquely up to its sign by a and b and is independent of the choice of the orthonormal basis. Suppose that we have a=op, b=oq. Then I[a, b]1 = lal'lbl sinO, where 0 = Lpoq. Also I [a, b] I is equal to the area of the parallel- ogram determined by a and b. To illustrate the orientation of [a, b], we sometimes use the idea of a turning screw. That is, the direction of a right-handed screw advancing while turning at o from p to q (within the angle less than 180") coincides with the direction of [a, b] (Fig. 1). The exterior product has the following three properties: (1) [a, b] = - [b, a] (antisymmetric law); (2) Ua, b] = }"[a, b] (associative law for (IX" / Fig. 1 442 D Vectors scalar multiplication); (3) [a, b+c] = [a, b] + [a,c] (distributive law). The vector product does not satisfy the associative law, but it does satisfy the tJacobi identity [a, [b, c]] + [b, [c,a]]+[c,[a,b]]=O. The vector [a,[b,c]] is sometimes called the vector triple product, and for this we have Lagrange's formula [a, [b,c]] =(a,c)b(a, b)c. Let a, b, c be vectors in E 3 whose compo- nents with respect to an orthonormal funda- mental basis are (Ct. I , Ct.2, Ct.3)' (fil, fi2, fi3), and (Yt. 12' Y3)' Then (a, [b, c]) =(b, [c, a]) = (c, [a, b]) = [a. b, c], and the common value is equal to the determinant of the 3 x 3 matrix ( Ct. I fil 'J II Ct.2 Ct. 3 ) fi2 fi3 . i'2 Y3. The value denoted by [a,b,c] is called the scalar triple product of a, b, c and is equal to the volume of the parallelotope whose three edges are a=op, b=oq, and c=or with com- mon initial point o. The triple a, b, c is called a right-hand system or a left-hand system ac- cording as [a, b, c] is positive or negative. We have [a, b, c] = 0 if and only if a, band care coplanar. (For the texterior product of vectors in En and the concept of tp-vectors  256 Linear Spaces 0.) D. Vector Fields In this section we consider the case of a 3- dimensional Euclidean space E 3 (for the gen- eral case  105 Differentiable Manifolds). A scalar-valued or a vector-valued function defined on a set D in E 3 is called a scalar field or a vector field, respectively. The continuity or the differentiability of a vector field is de- fined by the continuity or the differentiability of its components. For a differentiable scalar field f(x, y, z), the vector field with the components (of/ex, af/ey, ef/oz) is called the gradient of f and is denoted by grad! For a differentiable vec- tor field V (x, y, z) whose components are (u(x, y, z), v(x, y, z), w(x, y, z)), the vector field with components ( OW ov au ow av ou ) cy - oz ' oz - ox ' ox - oy is called the rotation (or curl) of V and is de- noted by rot V (or curl V). Also, for a differenti- able vector field V, the scalar field defined by au/ex + Dv/oy + ow/Dz is called the divergence of V and is denoted by divV. Utilizing the vector operator V having differential operators (a/ex, %y, a/oz) as its components, we may write simply gradf=Vf, divV=(V, V), rotV= 
442 Ref. Vectors [V, V]. The symbol V is called nabla, atled (inverse of delta), or Hamiltonian. A vector field V with rot V = 0 is said to be irrotational, (lamellar, or without vortex). A vector field V with div V = 0 is said to be sole- noidal (or without source). Thus gradfis irro- tational and rot V is solenoidal. In a small neighborhood or in a tsimply-connected do- main, an irrotational field is a gradient, a solenoidal field is a rotation, and an arbitrary vector field V is the sum of these two kinds of vector fields: V = grad rp + rot u (Helmholtz theorem); the function rp is called the scalar potential of V, and the vector field u is called the vector potential of V. Furthermore, the operator V 2 = VV = div grad = a 2 jex 2 +a 2 j8 y 2 + (1 2 jaz 2 is called the Laplace operator (or Laplacian) and is denoted by ,1. A function that satisfies ,1rp = 0 is called a tharmonic function. Locally, an irrotational and solenoi- dal vector field is the gradient of a harmonic function. If A is a vector field whose compo- nents are (rp" rp2, rp3) (i.e., A(v) = (rp, (v), rp2 (v), rp,(v))), we can let ,1 operate on A by setting ,1A=(,1rp",1rp2,,1rp3)' We then have ,1A= V 2 A = grad div A - rot rot A. Suppose that we are given a vector field V and a curve C such that the vector V(p) is tangent to the curve at each point pE C. The curve C is the tintegral curve of the vector field V and is called the vector line of the vector field V. The set of all vector lines intersecting with a given closed curve C is called a vector tube. Given a closed curve C and a vector field V, the tcurvilinear integral J (V, ds) (where ds is the line element of C) is called the circulation (of V) along the closed curve C. A vector field is irrotational if its circulation along every closed curve vanishes; the converse is true in a simply connected domain. Further, let V n be the tnormal component of a vector field V with respect to a surace S, and let dS be the volume element of the surface. We put ndS = dS, where n is the unit normal vector in the positive direction of the surface S. Then the tsurface integral J vndS = J (V, dS) is called the vector flux through the surface S. A vector field whose vector flux vanishes for every closed surface is solenoidal. (For the corre- sponding formulas  94 Curvilinear Integrals and Surface Integrals. For generalizations to higher-dimensional manifolds  105 Dif- ferentiable Manifolds; 194 Harmonic Integrals; Appendix A, Table 3.) References [1J H. Weyl, Raum, Zeit, Materie, Springer, fifth edition, 1923; English translation, Space, time, matter, Dover, 1952. 1680 [2J S. Banach, Mechanics, Warsaw, 1951. [3J H. Flanders, Differential forms, with ap- plications to the physical sciences, Academic Press, 1963. [4J H. K. Nickerson, D. C. Spencer, and N. E. Steenrod, Advanced calculus, Van Nostrand, 1959. 443 (XII.8) Vector-Valued Integrals A. General Remarks Integrals whose values are elements (or sub- sets) ofttopologicallinear spaces are gen- erall y called vector-valued integrals or vec- tor integrals. As in the scalar case, there are vector-valued integrals of Riemann type ( 37 Banach Spaces K) and of Lebesgue type. In this article we consider only the latter. There are cases where integrands are vector-valued, where measures are vector-valued, and where both are vector-valued. The methods of inte- gration are also divided into the strong type, in which the integrals are defined by means of the original topology of the topological linear space X, and the weak type, in which they are reduced to numerical integrals by apply- ing continuous linear functionals on X. Com- bining these we can define many kinds of integrals. Historically, D. Hilbert's tspectral resolution is the first example of vector-valued integrals, but the general theory of vector-valued inte- grals started only after S. Bochner [IJ defined in 1933 an integral of strong type for functions with values in a Banach space with respect to numerical measures. Then G. Birkhoff [2J defined a more general integral by replacing absolutely convergent sums with uncondi- tionally convergent sums. At approximately the same time, N. Dunford introduced inte- grals equivalent to these. Later, R. S. Phillips (Trans. Amer. Math. Soc., 47 (1940)) general- ized the definition to the case where values offunctions are in a tlocally convex topolog- icallinear space, and C. E. Rickart (Trans. Amer. Math. Soc., 52 (1942)) to the case where functions take subsets of a locally convex topological linear space as their values. The theory of integrals of weak type for functions with values in a Banach space and numerical measures was constructed hy 1. M. Gel'fand [3], Dunford [4J, B. J. Pettis [5], and others (1936-1938). N. Bourbaki [6J dealt with the case where integrands take values in a locally convex topological linear space. As for inte- grals of numerical functions by vector-valued 
1681 measures, a representative of strong type inte- grals is the integral of R. G. Bartle, Dunford, and 1. T. Schwartz [7J (1955). Weak type integrals have been discussed by Bourbaki [6J, D. R. Lewis (Pacific J. Math., 33 (1970)), and I. Kluvanek (Studia Math., 37 (1970)). The bilinear integral of Bartle (Studia Math., 15 (1956)) is typical of integrals in the case where both integrands and measures are vector- valued. For interrelations of these integrals  the papers by Pettis and Bartle cited above and T. H. Hildebrandt's report in the Bulletin of the American Mathematical Society, 59 (1953). Since the earliest investigations [1- 3J the main aim of the theory of vector-valued inte- grals has been to obtain integral representa- tions of vector-valued (set) functions and various linear operators [8]. However, there is the fundamental difficulty of the nonvalidity of the Radon-Nikodym theorem. Whatever de- finition of integrals we take, the theorem does not hold for vector-valued set functions un- conditionally. Many works sought conditions for functions, operators, or spaces such that the conclusion of the theorem would be re- stored; the works of Dunford and Pettis [9J and Phillips (Amer. J. Math., 65 (1943)) marked a summit of these attempts. Later, after A. Grothendieck's investigations (1953-1956), this problem began to be studied again, begin- ning in the late 1960s, by many mathemati- cians ( J. Diestel and 1. J. Uhl, Jr., Rocky Mountain J. Math., 6 (1976); [10J). Recently, integrals of multi valued vector- valued functions have also been employed in mathematical statistics, economics, control theory, and many other fields. Some contri- butions are, besides Rickart cited above. G. B. Price (Trans. Amer. Math. Soc., 47 (1940), H. Kudo (Sei. Rep. Ochanomizu Univ., 4 (1953)), H. Richter (Math. Ann., 150 (1963)), R. J. Au- mann [11], G. Debreu [12J, and M. Huku- hara (Funkcial. Ekvac., 10 (1967)). Further- more, C. Castaing and M. Varadier [13J have defined weak type integrals of multivalued functions and introduced many results con- cerning them. In the following we shall give explanations of typical vector-valued integrals with values in a Banach space only. B. Measurable Vector-Valued Functions Let x(s) be a function defined on a ta-finite measure space (S, S, J1) with values in a Banach space X. This is called a simple func- tion or finite-valued function if there exists a partition of S into a finite number of mutually disjoint measurable sets AI' A2' ..., An in each of which x(s) takes a contant value c j . Then 443C V ector-Valued Integrals x(s) can be written as j'1 cjx4s), where x4s) is the tcharacteristic function of Aj. A func- tion x(s) is said to be measurable or strongly measurable if it is the strong limit of a se- quence of simple functions almost everywhere, that is, limn<X) Ilxn(s)-x(s)11 =0 a.e. Then the numerical function Ilx(s)11 is measurable. If J1 is a tRadon measure on a compact Hausdorff space S, then the measurable functions can be characterized by tLuzin's property (-" 270 Measure Theory I). A function x(s) is said to be scalarly measur- able or weakly measurable if the numerical function <x(s), x'> is measurable for any tcon- tinuous linear functional x' E X'. A function x(s) is measurable if and only if it is scalarly mea- surable and there are a tnull set Eo c S and a tseparable closed subspace Y c X such that X(S)E Y whenever set Eo (Pettis measurability theorem). C. Bochner Integrals A measurable vector-valued function x(s) is said to be Bochner integrable if the norm Ilx(s)11 is tintegrable. If x(s) is a Bochner inte- grable simple function Lcjx4s), then its Boch- ner integral is defined by 1 x(s)dJ1= LJ1(A j )c j . F or a general Bochner integrable func- tion x(s) there exists a sequence of simple functions satisfying the following condi- tions: (i) Iimn<X) Ilxn(s)-x(s)11 =0 a.e. (ii) limnSs Ilxn(s)-x(s)11 dJ1=O. Then ]sx n (s)dJ1 converges strongly and its limit does not de- pend on the choice of the sequence {xn(s)}. We call the limit the Bochner integral of x(s) and denote it by Ssx(s)d or by (Bn) Ssx(s)dJ1 to distinguish it from other kinds of integrals. A Bochner integrable function on S is Boch- ner integrable on every measurable subset of S. The Bochner integral has the basic prop- erties of Lebesgue integrals, such as linear- ity, tcomplete additivity, and tabsolute con- tinuity, with absolute values replaced by norms. tLebesgue's convergence theorem and tFubini's theorem also hold. However, the Radon-Nikodym theorem does not hold in general ( Section H). Let T be a tc\osed linear operator from X to another Banach space Y. If both x(s) and Tx(s) are Bochner integrable, then the integral of x(s) belongs to the domain of T and T(l X(S)d)= 1 Tx(s)d. If, in particular, T is bounded, then the as- sumption is always satisfied. If  is the tLe- 
443D V ector-Valued Integrals besgue measure on the Euclidean space Rn, then Lebesgue's differentiability theorem holds for the Bochner integrals regarded as a set function on the regular closed sets ( 380 Set Functions D). D. Unconditionally Convergent Series Let Lr1 x j be a series of elements x j of a Banach space X. It is said to be absolutely convergent if L Ilx)1 < 00. It is called uncon- ditionally convergent if for any rearrangement (j, the resultmg senes L x'(j) converges strongly. Then the sum does not depend on rx. Clearly, an absolutely convergent series is uncondition- ally convergent. If X is the number space or is finite-dimensional, then the converse holds. However, if X is infinite-dimensional, there is always an unconditionally convergent series which is not absolutely convergent (Dvoretzky-Rogers theorem). A series L x j is unconditionally convergent if and only if each subseries converges weakly (Orlicz-Pettis theorem). If L x j is an uncon- ditionally convergent series, then L_<Xj' x') converges absolutely for any continuous linear functional x' E X'. If X is a Banach space con- taining no closed linear subspace isomorphic to the tsequence space Co, then conversely a series LX j converges unconditionally when- ever LI<xj,x')I< 00 for any X'EX' (Bessaga- PelczyiIski theorem). A Banach space that is tsequentially complete relative to the weak topology, such as a treflexive Banach space, and a separable Banach space that is the dual of another Banach space, such as I, and the tHardy space HI (Rn), satisfy the assump- tion, while Co' 1%, and L%(O) for an infinitely divisible 0 do not. The totality of absolutely convergent series (resp. unconditionally con- vergent series) in X is identified with th ttopo- logical tensor product I, @ X (resp. I, @ X) (Grothendieck). E. Birkhoff Integrals We say that a series L Bj of subsets of X con- verges unconditionally if for any XjE Bj the series L x j converges unconditionally. Then L Bj denotes the set of such sums. A vector- valued function x(s) is said to be Birkhoff integrable if there is a countable partition : S= Uj-x>] Aj (A j E6, AjnA k = 0 (j¥-k), (A)< 00) such that the set x(A) of values on Aj are bounded and L(A)x(A) converges uncon- ditionally and if the sum converges to an ele- ment of X as the partition is subdivided. The limit is called the Birkhoff integral of x(s) and is denoted by (Bk) rsx(s)d or simply by 1682 Ss x(s) d. A Birkhoff integrable function is Birkhoff integrable on any measurable set. The Birkhoff integral has, as a set function, com- plete additivity and absolute continuity in . It is linear in the integrand but Fubini's theo- rem and the Radon-Nikodym theorem do not hold. A Bochner integrable function is Birk- hoff integrable, and the integrals coincide. The converse does not hold. F. Gel'fand-Pettis Integrals A scalar1y measurable function x(s) is said to be scalarly integrable or weakly integrable if for each x' EX', <x(s), x') is integrable. Then the linear functional x* on X' defined by 1 <x(s), x') d = <x', x*) is called the scalar integral of x(s). Gel'fand [3J and Dunford [4J proved that x* belongs to the bidual X". Hence scalarIy integrable func- tions are often called Dunford integrable and the integrals x* the Dunford integrals. More generally, Gel'fand [3J showed that if x'(s) is a function with values in the dual X' of a Banach space X such that <x, x'(s) is inte- grable for any x E X, then there is an x' E X' satisfying 1 <x,x'(s)d=<x,x'). This element is sometimes called 1he Gel'fand integral of x'(s). A scalarly integrable func- tion x(s) is scalarly integrable on any measur- able subset A. If the scalar integral is always in X, i.e., for each A there is an x A E X such that I <x, x'(s) d = <XA' x'), x' EX', then x(s) is said to be Pettis integrable or Gel'fand-Pettis integrable and X A IS called the Pettis integral or Gel'fand-Pettis integral on A and is denoted by (PJJAx(s)d or simply by SA x(s) d. The Pettis integral has complete additivity and absolute continuity as a set function, similarly to the Birkhoff integral. Again, Fubini's theorem and the Radon- Nikodym theorem do not hold. The scalar integral on measurable sets of a scalarly inte- grable function x(s) is completely additive and absolutely continuous with respect to the tweak* topology of X" as the dual to X'. It is completely additive or absolutely continuous in the norm topology if and only if x(s) is Pettis integrable (Pettis [5]; [10J) If x(s) is Pettis integrable and f(s) is a numerical func- tion in L%(S), then the product f(s)x(s) is Pettis integrable. Birkhoff integrable functions are Pettis integrable, and the integrals coin- 
1683 cide. Conversely, if a measurable function is Pettis integrable, then it is Birkhoff integrable. When X satisfies the Bessaga-Pe1czynski con- dition ( Section D), a measurable scalariy integrable function is Pettis integrable. G. Vector Measures Let <I> be a set function defined on a com- pletely additive class 2. of subsets of the space S and with values in a Banach space X. It is called a finitely additive vector measure (resp. a completely additive vector measure or simply a vector measure) if <I>(At UA 2 )=<I>(Ad+<I>(A 2 ) whenever At and A 2 E 2. are disjoint (resp. <I>(UT;l A) = Lr=t <I>(A) in the norm topology for all AjE6 such that A j nA k =0 (ji=k)). We remark that the latter sum always converges unconditionally. A set function <I> is completely additive if and only if <<I>(A), x') is completely additive for all x' E X' (Pettis complete additiv- ity theorem). Let <I> be a finitely additive vector measure and £ be a measurable set. The total variation of <I> on £ and the semivariation of <I> on £ are defined by n V(<I>)(£)=sup I II<I>(A)II jt and 1I<I>II(£)=suplljtl Xj<l>(A)II, respectively, where the suprema are taken over all finite partitions of £: £ = U Aj (AjE 6, Aj n Ak = 0(j i=k)) and all numbers x j with Ix)  1. If V(<I>)(S) < CfJ, then <I> is called a measure of bounded variation. II<I>II(S) < ex if and only if sup{II<I>(A)IIIAE6}<::tJ. Then <I> is said to be bounded. The function V(<I>)(£) of £ is finitely additive but II<I>II(E) is only subaddi- tive: II<I>II(AUB)II<I>II(A)+ II<I>II(B). If <I> is a vector measure of bounded variation, then V(<I» is a positive measure. Every vector mea- sure is bounded. A completely additive vector measure on a tfinitely additive class i! can uniquely be extended to a vector measure on the completely additive class 6 generated by i! (Kluvanek). Let fl. be a positive measure and <I> be a vector measure. Then we have <I>(A)-->O as fl.(A)-->O if and only if <I> vanishes on every A with fl.(A) =0. Then <I> is said to be absolutely continuous with respect to fl.. For every vector measure <I> there is a measure fl. such that II<I>II(A)-->O as fl.(A)-->O and that Ofl.(A) Ij<l>II(A) (Bartle, Dunford, and Schwartz). As a set function, the Bochner integral is a vector measure of bounded variation and the Pettis integral is a bounded vector meaSure. Both 443H V ector-Valued Integrals (I) are absolutely continuous with respect to the integrating measure. Let X be Lp(O, 1) for I p ex, and define <1>(£) for a Lebesgue measurable set E to be the characteristic func- tion of E. If p= 1, <I> is a vector measure of bounded variation. If 1 <p< ex. <I> is a bounded vector measure, but it is not of bounded varia- tion on any set £ with fl.(£) >0. If p= 00, then <I> is no longer completely additive. These vec- tor measures are absolutely continuous with respect to the Lebesgue measure, but they cannot be represented as the Bochner integral or the Pettis integral. Let <I> be a vector measure on 6. An 6- measurable numerical function .f(s) is said to be <I>-integrable if there exists a sequence of simple functions .fn(s) such that f,,(s)--> f(5) a.e. and that for each EE 6, IEf,,(s)d<l> converges in the norm of X. Then the limit is indepen- dent of the choice of in. It is called the Bartle- Dunford-Schwartz integral and is denoted by h{(s)d<l>. Lebesgue's convergence theo- rem holds for this integral. If <I> is absolutely continuous with respect to the measure fl., then every (ELcfo(fl.) is <I>-integrable, and the operator that maps f to Is! d<l> is continuous with respect to the weak* topology in LCX"(fI) and the weak topology of X. Hence it is a tweakly compact operator. In particular, the range of a vector measure is relatively compact in the weak topology [7]. If <I> is the vector measure of the Pettis integral of a vector- valued function x(s), then the above integral is equal to the Pettis integral of f(s)x(s). A vector measure <I> is said to be nonatomic if for each set A with <I>(A) i=0 there is a subset B of A such that <I>(B) i= 0 and <I>(A "B) i= O. If X is finite-dimensional, then the range of a non- atomic vector measure is a compact convex set (Lyapunov convexity theorem). This has been generalized to the infinite-dimensional case in many ways, but the conclusion does not hold in the original form ( Kluvanek and G. Knowles [15J; [IOJ). (2) H. The Radon-Nikodym Theorem As the above examples show, the tRadon- Nikodym theorem does not hold for vectr measures in the original form. From 1967 to 1971, M. Metivier, M. A. Rieffel, and S. Moedomo and Uhl improved the classical result of Phillips (1943) and proved the follow- ing theorem. Radon-Nikodym theorem for vector mea- sures. The following conditions are equivalent for fl.-absolutely continuous vector measures <I> defined on a finite measure space (S, 6, fl.): (i) There is a Pettis integrable measurable fune- 
443 I V ector-Valued Integrals tion x(s) such that <I>(A)=(P) t x(s)d)1. (ii) For each [; > 0 there is an E E 6 such that )1(S......E)<[; and such that {<I>(A)/)1(A)IAE6, AcE} is relatively compact. (iii) For each EE 6 with J1(E) > 0 there is a subset F of E with J1(F) >0 such that {<I>(A)/)1(A)IAE6,A cF} is relatively weakly compact. Then <I> is of bounded variation if and only if x(s) is Boch- ner integrable. On the other hand, since Birkhoff and Gel'fand it has been known that for special Banach spaces X every )1-absolutely continu- ous vector measure of bounded variation with values in X can be represented as a Boch- ner integral with respect to )1. Such spaces are said to have the Radon-Nikodym property. Separable dual spaces (Gel'fand, Pettis; Dun- ford and Pettis), reflexive spaces (Gel'fand, Pettis, Phillips), and 11 (0),0 arbitrary, etc., have the Radon-Nikodym property, while L",(O, 1) (Bochner), Co (1. A. Clarkson), LdO) on a nonatomic 0 (Clarkson, Gel'fand), and C(O) on an infinite compact Hausdorff space 0, etc., do not. Gel'fand proved that LdO, 1) (and co) is not a dual by means of this fact. From 1967 to 1974, Riefell, H. B. Maynard, R. E. Huff, and W. J. Davis and R. P. Phelps succeeded in characterizing geometrically the Banach spaces with the Radon-Nikodym property. We know today that the following conditions for Banach spaces X are equivalent [10]: (i) X has the Radon-Nikodym property. (ii) Every separable closed linear subspace of X has the Radon-Nikodym property. (iii) Every function f: [0,1]--> X of bounded variation is (strongly or weakly) differentiable a.e. (iv) For any finite measure space (S, 6,)1) and bounded linear operator T:LdS)-->X, there is an essentially bounded measurable function x(s) with values in X such that T{= If(S)X(S)d)1, fEL,(S). (v) Each nonvoid bounded closed convex set K in X is the tclosed convex hull of the set of its strongly exposed points, where a point Xo E K is called a strongly exposed point of K if there is an x' E X' such that <xo, x') > <x, x') for all xEK......{xo} and that any sequence xnEK with Iim<xn, x') = <x o , x') converges to Xo strongly. A Banach space X is said to have the Krein- MiI'man property if each bounded closed convex set in X is the closed convex hull of its textreme points. A Banach space X with the Radon-Nikodym property has the Krein- MiI'man property (J. Lindenstrauss). If X is a dual space, then the converse holds (Huff and P. D. Morris). A Banach space with the Krein- 1684 Mil'man property clearly has no closed linear space isomorphic to Co, but there are Banach spaces that do not contain Co and do not have the Krein-Mil'man property. The' dual X of a Banach space Y has the Radon-Nikodym property if and only if the dual or every sepa- rable closed linear subspace of Y is separable (Uhl, C. Stegall). I. Integrals of Multivalued Vector Functions Let r(s) be a multivalued functiotl defined on a a-finite complete measure space (S, 6,)1) with values that are nonempty dosed subsets of a separable Banach space X. The inverse image of a subset E of X under r(s) is, by definition, the set of all s such that r(s) n E 'i 0. r(s) is said to be measurable or strongly measurable if the inverse image of each open set in X under r(s) belongs to 6. Let I!3(X) be the tBorel field of X, and 6 x I!3(X) be the product completely additive class, that is, the smallest completely additive class containing all direct products A x B of AE 6 and BEI!3(X). Thetl the mea- surability of r(s) is equivalent to each of the following: (i) The graph {(S,X)IXET(s),SE6} of r(s) belongs to 6 x I!3(X). (ii) The inverse image of every Borel set in X under r(s) be- longs to 6. (iii) For each x E X, the distance d(x, r(s))=inf{ Ilx- ylll YEr(S)} between x and r(s) is measurable as a function on S. A measurable function x(s) on S with values in X is called a measurable selection of r(s) if x(s) is in r(s) for all s. (X being separable, we need not discriminate between strong and weak measurability.) The measurability of r(s) is also equivalent to the following impor- tant statement on the existence or measurable selections of r(s): (iv) There are a countable number of measurable selections {xn(s)} of r(s) such that the closure of the set {x n ( s) I n = 1,2,...} coincides with r(s) for all SES. r(s) is said to be scalarly measurable or weakly measurable if the support function b'(x', r(s)) = sup{ <x, x') IXE r(s)} is measurable on S for all x' E X'. The strong measurability of r(s) clearly implies the weak one. If the values of r(s) are nonempty weakly compact convex sets, then the measurabilities are Iquivalent. Hereafter we shall assume that ns) takes the values in the weakly compact convex sets. If the support function b'(x', r(s)) is integrable on S for all x' E X', then r(s) is said to be scalarly integrable. Then the scalar integral of r(s) is defined to be the set in X" of all scalar inte- grals of its measurable selections, i.e., I r(s)d)1 = {I x(s) d)11 x(s) is a measurable selection of r(S)}. 
1685 If l(r(s)II =sup{llxlllxEr(S)} is integrable, then every measurable selection is Bochner integrable and the integral Is r(s) dJ1 becomes a nonempty weakly compact convex set in X. When the values of r(s) are nonempty compact convex sets, there is another method, by G. Debreu, of defining the integral. Let i! be the class of a11 nonempty compact con- vex sets in X and 15 be the Hausdorff met- ric, i.e., for K 1 and K 2 Ei! define b(K"K 2 )= max[sup{d(x, K 2 )1 XE K 1 }, sup{d(x, K 1 )! XE K 2}]. Further, for K 1, K 2 E i! and a  0 define the sum and the nonnegative scalar multiple by K 1 +K 2 ={X 1 +x2Ix1EK1,X2EK2} and a.K1 ={lXxlxEK 1 }, respectively. Then i! en- dowed with the Hausdorff metric and the above addition and scalar multiplication is isometrica11y embedded in a closed convex cone in a separable Banach space Y by the Radsrom embedding theorem (Proc. Amer. Math. Soc., 3 (1952)). Let rp be this isometry. Then the (strong) measurability and the (strong) integrability of r(s) are defined by the measur- ability and the Bochner integrability of the Y- valued function cp(r(s)), respectively, and its (strong) integral as the inverse image of the Bochner integral of cp(r(s)) under rp: 1 r(s) dJ1 = rp -1 (1 rp(r(S))d J1 ). This definition of integral for strongly measur- able r(s) is shown to be compatible with that mentioned before. It is clear by the definition that the integral value in this case is a non- empty compact convex set and that most prop- erties of Bochner integrals also hold for this integral. References [lJ S. Bochner, Integration von Funktionen, deren Werte die Elemente eines Vektorraumes sind, Fund. Math., 20 (1933),262-276. [2J G. Birkhoff, Integration of functions with values in a Banach space, Trans. Amer. Math. Soc., 38 (1935), 357-378. [3J I. Gel'fand, Abstrakte Funktionen und !ineare Operatoren, Mat. Sb., 4 (46) (1938), 235286. [4J N. Dunford, Uniformity in linear spaces, Trans. Amer. Math. Soc., 44 (1938),305-356. [5J B. J. Pettis, On integration in vector spaces, Trans. Amer. Math. Soc., 44 (1938), 277-304. [6J N. Bourbaki, Elements de mathematique, Integration, Hermann, ch. 6, 1959. [7J R. G. Bartle, N. Dunford, and J. Schwartz, Weak compactness and vector measures, Canad. 1. Math., 7 (1955), 289-305. 444 Ref. Viete, Franftois [8J N. Dunford and J. T. Schwartz, Linear operators I, Interscience, 1958. [9J N. Dunford and B. J. Pettis, Linear oper- ations on summable functions, Trans. Amer. Math. Soc., 47 (1940), 323-392. [10J 1. Diestel and J. 1. Uhl, Jr., Vector mea- sures, Amer. Math. Soc. Math. Surveys 13 (1977). [11J R. J. Aumann, Integrals of set-valued functions, 1. Math. Anal. Appl., 12 (1965), 1- 22. [12J G. Debreu, Integration of correspon- dences, Proc. Fifth Berkeley Symp. Math. Statist. Probab., II, pt. I (1967), 351-372- [13J C. Castaing and M. Valadier, Convex analysis and measurable multifunctions, Springer, 1977. [14J N. Dinculeanu, Vector measures, Per- gamon, 1967. [15J I. Kluvanek and G. Knowles, Vector measures and control systems, North-Ho11and, 1975. 444 (XXI.42) Viete, Franois Fran<;ois Vide (1540December 13,1603) was born in Fontenay-le-Comte, Poitou, in western France. He served under Henri IV, first as a lawyer and later as a political advisor. His mathematics was done in his leisure time. He used symbols for known variables for the first time and established the methodology and principles of symbolic algebra. He also sys- tematized the algebra of the time and used it as a method of discovery. He is often ca11ed the father of algebra. He improved the methods of solving equations of the third and fourth de- grees obtained by G. Cardano and L. Ferrari. Realizing that solving the algebraic equation of the 45th degree proposed by the Belgian mathematician A. van Roomen can be reduced to searching for sin(aj45) knowing sin a, he was able to solve it almost immediately. How- ever, he would not acknowledge negative roots and refused to add terms of different degrees because of his belief in the Greek principle of homogeneity of magnitudes. He also contri- buted to trigonometry and represented the number n as an infinite product. References [IJ Francisci Vietae, Opera mathematica, F. van Schooten (ed.), Leyden, 1646 (Georg alms, 1970). [2J Jacob Klain, Die griechische Logistik und die Entstehung der Algebra I, II, Quellen und 
445 Ref. Yon Neumann, John 1686 Studien zur Gesch. Math., (B) 3 (1934), 18- 105; (B) 3 (1936),122-235. 445 (XXI.43) Von Neumann, John John van Neumann (December 28,1903- February 8, 1957) was born in Budapest, Hungary, the son of a banker. By the time he graduated from the university there in 1921, he had a]ready pub]ished a paper with M. Fekete. He was later influenced by H. Weyl and E. Schmidt at the universities of ZUrich and Berlin, respectively, and he became a lecturer at the universities of Berlin and Hamburg. He moved to the United States in 1930 and in 1933 became professor at the Institute for Advanced Study at Princeton. In 1954 he was appointed a member of the US Atomic Energy Commission The fie]ds in which he was first interested were tset theory, theory of tfunc- tions of rea] variables, and tfoundations of mathematics. He made important contribu- tions to the axiomatization of set theory. At the same time, however, he was deeply inter- ested in theoretical physics, especially in the mathematical foundations of quantum me- chanics. From this field, he was led into re- search on the theory oftHilbert spaces, and he obtained basic results in the theory oftoper- ator rings of Hi]bert spaces. To extend the the- ory of operator flngs, he Introduced tcontinu- ous geometry. Among his many famous works are the theory of talmost periodic functions on a group and the solving of tHilbert's fifth problem for compact groups. In his ]ater years, he contributed to tgame theory and to the design of computers, thus playing a major role in all fields of applied mathematics. References [IJ 1. von Neumann, CoJlected works I-VI, Pergamon, 1961-1963. [2J J von Neumann, 1903-1957, J C. Oxtoby, B.1. Pettis, and G. B. Price (eds.), Bull. A mer. Math. Soc., 64 (1958),1-129. [3J 1. von Neumann, Mathematische Grund- lagen der Quantenmechanik, Springer, 1932. [4J 1. von Neumann, Functional operators I, II, Ann. Math. Studies, Princeton Untv. Press, 1950. [5J 1. von Neumann, Continuous geometry, Princeton Univ. Press, 1960 [6J 1. von Neumann and O. Morgenstern, Theory of games and economic behavior, Princeton U niv. Press, third edition, 1953. 
446 Wave Propagation 446 (XX.13) Wave Propagation A disturbance originating at a point in a medium and propagating at a finite speed in the medium is called a wave. For example, a sound wave propagates a change of density or stress in a gas, liquid, or solid. A wave in an elastic solid body is called an elastic wave. Surface waves appear near the surface of a medium, such as water or the earth. When electromagnetic disturbances are propagated in a gas, liquid, or solid or in a vacuum, they are called electromagnetic waves. Light is a kind of electromagnetic wave. According to tgeneral relativity theory, gravitational action can also be propagated as a wave. It many cases waves can be described by the wave equation: {)21ji =e 2 ( {)21ji + a21ji + {)21ji ) i1t 2 DX2 D y 2 az 2 . Here t is time, x, y, z are the Cartesian coordi- nates of points in the space, e is the propaga- tion velocity, and Iji represents the state of the medium. If we take a closed surface surrounding the origin of the coordinate system, the state Iji(O, t) at the origin at time t can be determined by the state at the points on the closed surface at time t - rle, with r the distance of the point from thc origin. More precisely, we have 1 f( j) ( 1 ) 1 (11ji Iji(O,t)=- Iji- - ---::;- 4n an r r un I 81ji ar ) ---- df er at an I-'Ie . Here n is the inward normal at any point of the closed surface, and the integral is taken over the surface, while the value of the inte- grand is taken at time t - rle. This relation is a mathematical representation of Huygens's principle, which is valid for the 3-dimensional case but does not hold for the 2-dimensional case ( 325 Partial Differential Equations of Hyperbolic Type). A plane wave propagating in the direction of a unit vector n can be represented by Iji = F(t-n.r/c), where F is an arbitrary function and r(x, y, z) is the position vector. The sim- plest case is given by a sine wave (sinusoidal wave): Iji = A sin (wt - k. r + b). Here A(ampli- tude) and 15 (phase constant) are arbitrary con- stants, k is in the direction of wa ve propaga- tion and ,atisfies the relation Iklc=w. w is the angular frequency, wl2n the frequency, k the wave number vector, Ikl the wave number, 2nlw 1688 the period, and 2n/lkl the wavelength. The velocity with which the crest of the wave ad- vances is equal to w/lkl =e and is called the phase velocity. A spherical wave radiating from the origin can generally be represented by Iji= L qJ n (  ) n !F ( t- ) , n rdr r e where qJn is the tsolid harmonic of order n. Waves are not restricted to those governed by the wave equation. In general. Iji is not a scalar, but has several components (e.g., tjJ may be a vector), which satisfy a set ofsimulta- neous differential equations of various kinds. Usually they have solutions in the form of sinusoidal waves, but the phase velocity c = w/lkl is generally a function of the wavelength X Such a wave, called a dispersive wave, has a propagation velocity (velocity of propagation of the disturbance through the medium) that is not equal to the phase velocity. A distur- bance of finite extent that can be approxi- mately represented by a plane wave is propa- gated with a velocity e - 2dcld;., called the group velocity. Often there exists a definite relationship between the amplitude vector A (and the corresponding phase constant b) and wave number vector k, in which case the wave is said to be polarized. In particular, when A and k are parallel (perpendicular), the wave is called a longitudinal (transverse) wave. Usually equations governing the wave are linear, and therefore superposition of two solutions gives a new solution (tprinciple of superposition). Superposition of 1 wo sinusoi- dal waves traveling in opposite directions gives rise to a wave whose crests do not move (e.g., Iji=Asinwtsink'r). Such a wave is called a stationary wave. Since the energy of a wave is proportional to the square of Iji, the energy of the resultant wave formed by superposition of two waves is not equal to the sum of the energies of the component waves. This phe- nomenon is called interference. When a wave reaches an obstacle it propagates into the shadow region of the obstacle, where there is formed a special distribution of energy de- pendent on the shape and size of the obtacle. This phenomenon is called diffraction. For aerial sound waves and water waves, if the amplitude is so large that the wave equation is no longer valid, we are faced with tnonlinear problems. For instance, shock waves appear in the air when surfaces of dis- continuity of density and pressure exist. They appear in explosions and for bodIes traveling at high speeds. Concerning wave mechanics dealing with atomic phenomena . 351 Quan- tum Mechanics. 
1689 References [IJ H. Lamb, Hydrodynamics, Cambridge Univ. Press, sixth edition, 1932. [2J Lord Rayleigh, The theory of sound, Mac- mi11an, second revised edition, I, 1937; II, 1929. [3J M. Born and E. Wolf, Principles of optics, Pergamon, fourth edition, 1970. [4J F. S. Crawford, Jr., Waves, Berkeley phys. course III, McGraw-Hi11, 1968. [5J C. A. Coulson, Waves; A mathematical theory of the common type of wave motion, Oliver & Boyd, seventh edition, 1955. [6J L. Bri11ouin, Wave propagation and group velocity, Academic Press, 1960. [7J I. Tolstoy, Wave propagation, McGraw- Hill, 1973. [8J 1. D. Achenbach, Wave propagation in elastic solids, North-Ho11and, 1973. [9J K. F. Graff, Wave motion in elastic solids, Ohio State Univ. Press, 1975. [IOJ J. Lighthi11, Waves in fluids, Cambridge Univ. Press, 1978. [IIJ R. Courant and D. Hilbert, Methods of mathematical physics II, Interscience, 1962. 447 (XXI.44) Weierstrass, Karl Karl Weierstrass (October 31, 1815-February 19, 1897) was born into a Catholic family in Ostenfelde, in Westfalen, Germany. From 1834 to 1838 he studied law at the University of Bonn. In 1839 he moved to Munster, where he came under the influence of C. Gudermann, who was then studying the theory of e11iptic functions. From this time unti11855, he taught in a parochial junior high school; during this period he published an important paper on the theory of analytic functions. Invited to the University of Berlin in 1856, he worked there with L. Kronecker and E. E. Kummer. In 1864, he was appointed to a fuII professorship, which he held until his death. His foundation of the theory of analytic functions of a complex variable at about the same time as Riemann is his most fundamental work. In contrast to Riemann, who utilized geometric and physical intuition, Weierstrass stressed the importance of rigorous analytic formulation. Aside from the theory of analytic functions, he contributed to the theory of functions of real variables by giving examples of continuous functions that were nowhere differentiable. With his theory of tminimal surfaces, he also contributed to geometry. His lectures at the University of Berlin drew many 448 Ref. Weyl, Hermann listeners, and in his later years he was. a re- spected authority in the mathematical world. References [IJ K. Weierstrass, Mathematische Werke I-VII, Mayer & Miller, 1894-1927. [2J F. Klein, Vorlesungen uber die Entwick- lung der Mathematik im 19. Jahrhundert I, Springer, 1926 (Chelsea, 1956). 448 (XXI.45) Weyl, Hermann Hermann Weyl (November 9, 1885-December 8, 1955) was born in Elmshorn in the state of Schleswig-Holstein in Germany. Entering the University of Gottingen in 1904, he also au- dited courses for a time at the University of Munich. In 1908, he obtained his doctorate from the University of G6ttingen with a paper on the theory of integral equations, and by 1910 he was a lecturer at the same university. In 1913, he became a professor at the Federal Technological Institute at Zurich; in 1928- 1929, a visiting professor at Princeton Univer- sity; in 1930, a professor at the University of Gottingen; and in 1933, a professor at the Institute for Advanced Study at Princeton. He retired from his professorship there in 1951, when he became professor emeritus. He died in ZUrich in 1955. Weyl contributed fresh and fundamental works covering a11 aspects of mathematics and theoretical physics. Among the most notable are results on problems in tintegral equations, tRiemann surfaces, the theory of tDiophantine approximation, the representation of groups, in particular compact groups and tsemisimple Lie groups (whose structure he elucidated), the space-time problem, the introduction of taffine connections in differential geometry, tquantum mechanics, and the foundations ofmathemat- ics. In his later years, with his son Joachim he studied meromorphic functions. In addition to his many mathematical works he left works in philosophy, history, and criticism. References [IJ H. Weyl, Gesamme1te Abhandlungen I - IV, Springer, 1968. [2J H. Weyl, Die Idee der Riemannschen F1ache, Teubner, 1913, revised edition, 1955; English translation, The concept of a Riemann surface, Addison-Wesley, 1964. 
449 A Witt Vectors [3J H. Weyl, Raum, Zeit, Materie, Springer, 1918, fifth edition, 1923; English translation, Space, time, matter, Dover, 1952. [4J H. Weyl, Das Kontinuum, Veit, 1918 [5J H Weyl, Gruppentheorie und Quanten- mechanik, Hirzel, 1928. [6J H. Weyl, Classical groups, Princeton Univ. Press, 1939, revised edition, 1946. [7J H. Weyl and F. 1. Weyl, Meromorphic functions and analytic curves, Princeton Univ. Press, 1943. [8J H. Weyl, Philosophie der Mathematik und Naturwissenschaften, Olden bourg, 1926; En- glish translation, Philosophy of mathematics and natural science, Princeton Univ. Press, 1949. [9J H. Weyl, Symmetry, Princeton Univ. Press, 1952 449 (111.18) Witt Vectors A. General Remarks Let r be an tintegral domain of character- istic 0, and p a fixed prime number. For each infinite-dImensional vector x = (x o , XI, ...) with components in r, we define its ghost com- ponents XIOI, x(1 J, ... by X(OI = X o , Xl") = xg" + pxf"-t +... + p" X n . We define the sum of the vectors x and Y = (Yo, y, , ... ) to be the vector with ghost components XIOI + y<0l, Xl') + y<' I, ..., and their product to be the vector with ghost components XIO) y<0>, Xl') y<' I, .... The sum and product are uniquely determined vectors with components in r Writing their first two terms explicitly, we have x+y = f x o + Yo, x, + YI - P:t'  (  ) xyg- " ...1, ,I p v J xy=(xoYo, x, yg + YI xg + pX I y"...). In general, it can be proved that the 11th components O"n(x, y) and nn(x, y) of the sum and product are polynomials in X o , Yo, XI, YI, ..., x"' y" whose coefficients are rational integers. With these operations of addition and multi- plication, the set of these vectors forms a tcom- mutative ring, of which the zero element is (0,0,. .) and the unity element is (1,0, ... ). Let k be a field of characteristic p For vectors ((0' (1'''')' and (110,/11,...) with components in k, we define their sum and product by (so, Sl' ...) + (/10, 'It, ...) = ( ..., O"n((, 'I), ...) and (so, , , ... )(110' 11, , ... ) = ( ... , n,,(, 11), ... ). Since the coefficients of 0"" and nn are rational in- 1690 tegers, these operations are well defined. With these operations, the set of such vectors be- comes an integral domain W(k) of character- istic 0. Elements of W(k) are called Witt vec- tors over k. If we put V((o, (1, ...) = (0, (0, J' ...) and (o,("...)P=«(g,(f, ...), we get the formula PS = V I;p. (Note that this (p is not the pth power of ( in W(k) in the usual sense.) There- fore, if we pu t I  1= p -" for a vector  whose first nonzero component is n' then this abso- lute value I I gives a tvaluation of W(k). In par- ticular, when k is a tperfect field, d'noting the vector (0,0,...) by {o} we get (o, I' ...)= Lpj{r-l and W(k) is a tcomplete valu- ation ring with respect to this valuation. Therefore the tfield of quotients of W(k) is a complete valuation field of which p is a prime element and k is the tresidue class field. Con- versely, let K be a field of characteristic ° that is complete under a tdiscrete valuation v, 0 be the valuation ring of v, and k be the residue class field of L Assume that k is a perfect field of characteristic p. If p is a prime element of lJ, then 0 = W(k) If v(p) = v(n e ) (e> 1) with a prime element J[ of 0, we have 0 = W(k) [nJ, and J[ is a root of an tEisenstein polynomial X" + a j X e -' +... + a e (ajEpW(k), a e $ p2 W(k)). In this way we can determine explicitly the structure of a tp-adic number field ( 257 Local Fields). B. Applications to Abelian p-Extensions and Cyclic Algebras of Characteristic p Next we consider W,,(k) = W(k)/V"W(k). The elements of Wn(k) can be viewed as the 11- dimensional vectors (o, "', (,,_,), but their laws of composition are defined as in the pre- vious section. They are called Witt vectors of length 11 We define an operator \') by\,) = P -. Using it, we can generalize the theory of t Artin-Schreier extensions ( ] 72 Galois Theory) to the case of Abelian ex1ensions of exponent p" over a field of characteristic p Indeed, let k be a field of characteristIc p and (=((0, ...,,,_,) an element of W(k). Ifl1= (110' ...,11,,-,) is a root of the vector equation .)X -=o, then the other roots are of the form 11 + IX(IX = (1X0,..., IX n -,), lXiE F p ). In partic- ular, if o $ pk = {IX P -IX IIXE k}, the field K = k(l1o, "', I1n-') is a cyclic extension of degree p" over k, and conversely, every cyclic extension of k of degree pn is obtained in this way. Let (I/p) denote the set of all roots of gJX-  =0. Then more generally, any fitlite Abelian extension of exponent pn of k can be obtained as K = k((I/ p) I  E H) with a suitable finite subgroup H/J w,.(k) of Wn(k)/p W,(k), and 
1691 the Galois group of K/k is isomorphic to Hh} w.,(k). Moreover, for a tcyclic extension K = k((1/ p)f3) of exponent pn over k and for exEk(ex#O), we can define a tcyclic algebra (ex, fJJ generated by an element u over K by the fundamental relations u pn = ex, pO=[1, uOu- 1 =(}+(I,O, ..,0) (where 0=(° 0 , ...,On-I)' uOu- 1 = (uOou -I , ..., uO n - t u-' )), and (ex, Ii] is a central simple algebra over k. Using these results, we can develop the structure theory of the tBrauer group of expo- nent pn of a tfield of power series in one vari- able with coefficients in a finite field Fq (of a tfield of algebraic functions in one variable over Fq) exactly as in the case of a p-adic field (of an algebraic number field) (E. Witt [1 J;  29 Associative Algebras G). On the other hand, Wn(k) is a commutative talgebraic group over k and is important in the theories of algebraic groups and tformal groups ( 13 Algebraic Groups). References [IJ E. Witt, Zyklische Karper und Algebren der Charaktcristik p vom Grad pn, J. Reine Angew. Math., 176(1937), 126-140. [2J H. Hasse, Zahlentheorie, Akademie- Verlag., 1949. [3J N. Jacobson, Lectures in abstract algebra III, Van Nostrand, 1964. 449 Ref. Witt Vectors 
450 A Zeta Functions 450 (V.19) Zeta Functions A. Introduction Since the 19th century, many special functions called (-functions (zeta functions) have been defined and investigated. The four main prob- lems concerning (-functions are: (I) Methods of defining (-functions. (2) Investigation of the properties of (-functions. Generally, (- functions have the following four properties in common: (1) They are meromorphlc on the whole complex plane; (ii) they have tDirichlet series expansions; (iii) they have Euler product expansions; and (iv) they satisfy certain func- tional equations. Also, it is an important prob- lem to find the poles, residues, and zeros of (- functions. (3) Application to number theory, in particular to the theory of decomposition of prime ideals in finite extensions of algebraic number fields ( 59 Class Field Theory) (4) Study of the relations between different (- functions. Most of the functions called (-functions or L-functions have the four properties of prob- lem (2). The following is a classification of the important types of (-functions that are already known, which will be discussed later in this article: (I) The (- and L-functions of algebraic num- ber fields: the Riemann (-function, Dirichlet L- functions (study of these functions gave im- petus to the theory of (-functions), Oedekind (-functions, Hecke L-functions, Hecke L- functions with tGrossencharakters, Artin L- functions. and Weil L-functions. (2) The p-adic L-functions related to the works of H. W. Leopold!, T Kubota, K. Iwasawa, etc (3) The (-functions of quadratic forms: Epstein (- functions, (-functions of indefinite quadratic forms (C L. Siegel), etc. (4) The (- and L- functions of algebras: Hey (-functions and the (-functions given by R. Godement, T. Tamagawa, etc (5) The (-functions associated with Hecke operators, related to the work of E Hecke, M Eichler, G. Shimura, H. Jacquet, R. P. Langlands. etc. (6) The congruence (- and L-functions attached to algebraic varieties defined over finite fields (E. Artin, A. Weil, A Grothendieck, P. Deligne), (- and L-functions of schemes. (7) Hasse (-functions attached to the algebraic varieties defined over algebraic number fields. (8) The (-functions attached to discontinuous groups. Selberg (-functions. the Eisenstein series defined by A. Selberg, Gode- ment, and I M. Gel'fand, etc. (9) Y. Ihara's (- function related to non-Abelian class field theory over a function field over a finite field. 1694 (10) (-functions associated with prehomoge- neous vector spaces (M. Sato, T. Shintani). B. The Riemann (-Function Consider the series I I 1 ( 5 ) = 1+-+-+... +-+..., 2' 3' n S which converges for all real numbers 5> I. It was already recognized by L. Euler that (5) can also be expressed by a convergent infinite product ITp(l  p -') -', where p runs over all prime numbers (Werke, ser. I, vol VII, ch. XV. 9 274). This expansion is called Euler's infinite product expansion or simply the Euler product. However, Riemann was the first to treat (5) successfully as a function of a complex variable 5 (1859) [R I]; for this reason, it is called the Riemann (-function. As can be seen from its Euler product expansion, (5) is holomorphic and has no zeros in the domain Re 5 > 1. Riemann proved, moreover, that it has an analytic continuation to the whole complex plane, is meromorphic everywhere, and has a unique pole 5 = I. The functions (5 - 1)(5) and (5) - 1/(5 - 1) are tintegral functions of 5. This can be seen by considering the integral ex preSSIOn 7[-S/2f (  ) (5)= f X xs/2-1 ( :t e-n2u ) dX 2 0 n1 1 1 I x , = ___.+ (X(1-,)/2-1 +x('2)-I) 5 I-, 1 x ( :t e -n 2 U ) dX. n=l From this last formula, we also obtain an equality (s)=w -S), where (s) = 1[ -s/2 f (s/2)(s). This equality is called the functional equation for the (-function. The residue of (s) at 5= I is 1, and around s= I, I (s)=--+C+O(ls-II), 8-1 where C is tEuler's constant This is called the Kronecker limit formula for (5). The function (5) has no zeros m Re 5? I, and its only zeros in Re50 are simple zeros at s= -2, -4. ..., -2n,.... But ((S) has i,n- finitely many zeros in 0 < Re 5 < 1, which are called the nontrivial zeros. B. Riemann conjec- 
1695 tured that all nontrivial zeros lie on the line Res= 1/2 (1859). This is called the Riemann hypothesis, which has been neither proved nor disproved ( Section I). If N(T) denotes the number of zeros of (s) in the rectangle 0 < Res< 1, 0 <Im s <: T, we have an asymptotic formula I 1 + log2IT N(T)=TlogT- T +O(logT) 2IT 2IT (H. von Mangoldt, 1905). Also, (s) has the following infinite product expansion: 1 1 / , ) (s-I)((s)=eh' rG'+ I) IJ I- e S / P , where h is a constant and p runs over all non- trivial zeros of (s) (J. Hadamard, 1893). Hadamard and C. de La Vallee-Poussin proved the tprime number theorem, almost simultaneously, by using some properties of (s) ( 123 Distribution of Prime Numbers B). The following approximate functional equa- tion is important in investigating the values of (s). 1 I ((8)= L -;-+<p(s)L  n<x n n<y n + O(x- U )+ O(yH ItI (112 )-a), where <p is tEuler's function, and (s) = <p(s)W -s), s=a+it, 2ITxy=ltl, and the approximation is uniform for - h:::; a:::; h, x> k, y> k with hand k positive constants (G H. Hardy and J. E. Littlewood, 1921). Euler obtained the values of «s) for positive even in te gers s: 22m-1IT2m B " ( 2 ) 2m  m = (2m)! (m = 1,2,3, .., , and the B 2m are tBernoulii numbers). The values of (s) for positive odd integers s, however, have not been expressed in such a simple form. The values of (s) for negative integers s are given by (0) = B 1 (0) 1 " Bn(O)_ = -2, (,(1-n)= --, n-2, 3, ..., where n the Bn(x) are tBernoulii polynomials. As a slight generalization of ((s), A. Hurwitz (1862) considered % 1 ((s,a)= L -, O<a:::;1 no(n+a)' This is called the Hurwitz (-function. Thus (s, I)=«s), and (s, 1/2)=(2'-I)(s). This function (s, a) can also be continued analyti- cally to the whole complex plane and satIsfies a certain functional equation. But in general it has no Euler product expansion. 450 C Zeta Functions C. Dirichlet L-Functions Let m be a posItive integer, and classify all rational integers modulo m. The set of all classes coprime to m forms a multiplicative Abelian group of order h = <p(m). Let X be a tcharacter of this group. Call (n) the residue class of n mod m, and put x(n) = x((n)) when (n,m)= 1 and X(n)=O when (n,m)-# 1. Now, the function of a complex variable s defined by % x(n) L(s) = L(s, X) = L ----;- nl n is called a Dirichlet L-function. This function converges absolutely for Res> I and has an Euler product expansion I L(s, X) = TI I ,() -, P -Xpp If there exist a divisor f of m (J -# m) and a character l modulo [such that x(n)=xo(n) for all n with (n, m) = I, we call X a nonprimitive character. Otherwise, X is called a primitive character. If X is nonprimitive, there exists such a unique primitive Xo. In this situation, the divisor f of m associated with Xo is called the conductor of X (and of XO). We have L(s,x) = L(s, Xo) TI (1- XO(p)p -'). plm Let X be primitive. If the conductor [= I, then X is a trivial character (X = I), and L(s) is equal to the Riemann (-function (s). On the other hand, if f> I, then L(s) is an entire func- tion of s. In particular, if X is a nontrivial primitive character, L(1) = L(1, X) is finite and nonzero. P. G. L. Dirichlet proved the theorem of existence of prime numbers in arithmetic progressions using this fact ( 123 Distri- bution of Prime Numbers D). L(s, X) has a functional equation similar to that of (s); namely, if X is a primitive character with conductor f and we put (s, X) = (J/IT)'/2 r«s+ a)/2)L(s, X), where a=O for X( -1)= 1 and a= 1 for X( -1)= -1, then we have (s,X)= W(X)W s,X), where W(x)=( -itF 1 / 2 T(X), T(X)= L x(n)(f nmodl «(f=exp(2ITi/J)). The latter sum is called the Gaussian sum. Note that I W(x)1 = I. The values of L(s) for negative integers s are given by L(I-m, X)= -Bx.m/m (m= 1,2,3,...), where the Bx. m are defined by f X(fJ.)te1 co I ---yr-= L Bx,mt m . 1 e - 1 mO 
450D Zeta Functions Moreover, if X( -1)= -1, we have n T(X) 1_ I.(1,X)=-;-- L (-X(x)'x) I f x=1 =niT(x)B X . 1 , and if X( -1)= 1, X# 1, we have T(X) [11 2 ] I.(1,x)=2 j J1 (-X(x)logll-(fIJ. In certain cases, the functional equation can be utilized to obtain the values of I.(m, x) from those of I.(I-m,X). Actually, ifX(-I)= 1, m= 2n= 2,4, 6, ..., we have ( -1)" ( 2n ) 2n I.(2n, x) = (2n)! f T(X)( - Bx. 2n), and if X( -1)= -1, m=2n+ 1 = 3, 5, 7,..., we have I.(2n + 1, X) . (-1)" ( 2n ) 211+1 =(-I) (2n+ I)! r T(X)(-B x . 211 +J)' Dirichlet I.-functions are important not only in the arithmetic of rational number fields but also in the arithmetic of quadratic or cyclo- tomic fields. D. (-Functions of Algebraic N umber Fields (Dedekind (-Functions) The Riemann (-function can be generalized to (-functions of algebraic number fields ( 14 Algebraic Number Fields). Let k be an alge- braic number field of degree n, and let 0 run over all integral ideals of k. Consider the se- quence (k(S) = La N(o) -so This sequence con- verges for Res> 1 and has an Euler product expansion (k(S) = ITp(l -N(p)-r 1 , where p runs over all prime ideals of k. This function, which is continued analytical1y to the whole complex plane as a meromorphic function, is called a Dedekind (-function. Its only pole is a simple pole at s= 1, with the residue hkKk' Here h k is the tclass number of k, and Kk = 2" +"n"R/(wldj1/2), where r 1 (2r 2 ) is the number of isomorphisms of k into the real (complex) number field, 11,' is the number of roots of unity in k, d is the tdiscriminant of k, and R is the tregulator of k (R. Dedekind, 1877) [01]. The function (ds) has no zeros in Re S  1, while in ResO it has zeros of order r 2 at -1, -3, -5,..., zeros oforderr 1 +r 2 at -2, -4, -6,..., and a zero of order r 1 + r 2 -1 at s=O. All other zeros lie in the open strip 0 < Re s < 1, which actually contains infinitely many zeros. It is conjectured that all these zeros lie on the line Res= 1/2 (the Riemann hypothesis for Dedekind (-functions). To obtain a generali- 1696 zation of the functional equation for the Riemann (s) to the case of (k(S), we put ( Jidf ) s ( s ) " Bk(s)= 2"n"/2 r:2 r(S)"(k(S), Then Bds)=Bdl-s) (Hecke, 1917). If K is a Galois extension of k, then (K(S)/':k(S) is an integral function (H. Aramata, 1933; R. Brauer, 1947). E. Heeke I.-Functions As a generalization of Dirichlet I.-functions to algebraic number fields, Hecke (1917) defined the fo\1owing I.-function I.k(s, X): Let k be an algebraic number field of finite degree, and let m = mITp '" be an tintegral divisor (m the finite part, IIp,,, the tin finite part). Consider the tideal class group of k modulo m and its char- acter X (here we put x(o)=O for (a, m) # 1). Then the I.-functions are defined by I.k(s, X)= L X(o)/N(o)' a [H2], where 0 runs over all integral ideals of k. I.ds, x) is called a Hecke I.-function. It con- verges for Re s > 1 and has an Euler product expansion 1 I.k(s,X)=[l . p 1 - X(p)N(p)-S Here p runs over al1 prime ideals of k. If there is a divis.?r fl m (f # m) and a character XO modulo f such that XO(o) = x(o) for all 0 with (0, m)= 1, then X is called nonprimitive: other- wise, X is called a primitive charal:ter. In gen- eral, there exist unique such f and Xc. In this situation, f is called the conductOl' of X. If X is primitive and the conductor f is (1), then X is a trivial character and I.k(s, X) coincides with (k(S), If X is primitive and X # 1, then I.k(s, X) is an integral function of s, and I. k (1' X) # O. Utilizing this fact, it can be proved that there exist infinitely many prime ideals in each class of the ideal class group modulo an integral divisor m of k. Let X be a primitive character with the con- ductor f. d be the discriminant of k, 0'1' ...,0', be all distinct isomorphisms of k into the rea\ number field R, and f be the finite part off. Then if  is an integer of k such that  == I (mod f), we have X«))=(sgn "')"'.... . (sgn ",,)"". where am (m = 1, ..., rrJ is either 0 or 1, depend- ing on x. By putting k(S, X) = ( J1dIN(f) ) ' " ( s+a m ) , 2"n"/2 'Il r ------z- r(s) 2 . I.k(s, X), 
1697 we have the following functional equation for the Hecke L-function: k(s,x) = W (X)k(l - s,x), where W(x) is a complex number with ab- solute value 1 and the exact value of W(X) is given as a Gaussian sum. Just as some prop- erties concerning the distribution of prime numbers can be proved using the Riemann (- function and Dirichlet L-functions, some prop- erties concerning the distribution of prime ideals can be proved using the Hecke L- functions ( 123 Distribution of Prime Num- bers F). T. Takagi used Hecke L-functions in found- ing his tclass field theory. In the other direc- tion, this theory implies L(I, X) # 0 (X # 1). Let K be a tclass field over k that corre- sponds to an ideal class group H of k with index h. By using class field theory, we obtain (K(S) = ITx Lk(s, X), where the product is over all characters X of ideal class groups of k, such that X( H) = 1. This formula can be regarded as an alternative formulation of the decompo- sition theorem of class field theory ( 59 Class Field Theory). By taking the residues of both sides of the formula at s= 1, we obtain hKK K =hkKkITdl Lk(l, Xl. In particular, if k = Q (the rational number field) and K is a quadratic number field Q(jd) (d is the discriminant of K), then we have (K(S) = ((s). L(s), L(s)= t (  ) n-" 11=1 n where (din) is the tKronecker symbol, and we put (din) = 0 when (n, d) # 1. From this, we obtain the class number formula for quadratic number fields ( 347 Quadratic Fields). A similar method is used for computation of class numbers of cyclotomic fields K ( 14 Algebraic Number Fields L). In general, the computation of the relative class number hKlh k when Klk is an Abelian extension is reduced to the evaluation of L(I, X). This computation has been made suc- cessfully for the following cases (besides for the examples in the previous paragraph): k is imaginary quadratic and K is the absolute class field of k or the class field corresponding to tray S(m); k is totally real and K is a totally imaginary quadratic extension of k. H. M. Stark and T. Shintani made conjectures about the values of L(I, X) [S25, S19]. Let L(s, X) be a Hecke L-function for the character X. Then it follows from the func- tional equation that the values of L(s, X) at s =0, -1, -2, -3,... are zero if k is not totally real. Furthermore, if k is a totally real finite algebraic number field, then these values of L(s, X) are algebraic numbers (c. L. Siegel, H. Klingen, T. Shin tan i). 450 F Zeta Functions F. Hecke L-Functions with Gri.issencharakters E. Hecke (1918,1920) extended the notion of characters by introducing the tGrossen- charakter X and defined L-functions with such characters: x(a) Lds,x)= N(ar He also proved the existence of their Euler product expansions and showed that they satisfy certain functional equations [H3]. Moreover, by estimating the sum LN(p)<x X(V), he obtained some results on the distribution of prime ideals. Later, Iwasawa and J. Tate independently gave clearer definitions of the Grossencharak- ter X and Lk(s, X) by using harmonic analysis on the adele and idele groups of k ( 6 Adeles and Ideles) [L3]. Let J k be the idele group of k, P k be the group of tprincipal ideles, and C k = J klPk be the idele class group. Then a Grossencharakter is a continuous character X of Ck, and X in- duces a character of J k , which is also denoted by X. Let J k = J x x J o be the decomposition of J k into the infinite part J x and the finite part J o . Let U o be the unit group of J o , and for each integral ideal m of k, put U m . O = {11 E U o I u:= 1 (mod m)}, so that {U m . O } forms a base for the neighborhood system of 1 in J o . Put J"T.o={aEJola p = 1 for all vim}, and with each a E J m . O , associate an ideal a = IT p v I'p(aJ, where a=(a p ) and the ideal in kp generated by a p is equal to pl'p(o). Then the mapping a->a gives a homomorphism of J m . O into the group G(m)={al(a, m)= I}, and its kernel is contained in U m . O ' Since X is continuous, x(U""o)= 1 for some m. The greatest common divisor f of all such ideals m is called the con- ductor of X. For each aEJ r . o , X(a) depends only on the ideal a (E G(f)); hence by putting x(a) = x(a), we obtain a character X of G(f). Now put Lds,X)=Lx(a)IN(a)', where the sum is over an integral ideals a E G(f). This is called a Hecke L-function with Grossencharakter X. For X# 1, it is an entire function. On the other hand, if we restrict X to J en = R*" x C*", then for u=(a 1 , .,. ,Q'l,Q'l+ 1 , ... ,arl+r)EJ, we have '1 h 2 " '1+'2 ( a ) "' X(u)= f1l a i'j.j-I. f1 (sgna)e j . f1  , j=l j=t l=r 1 +1 lall where ej=O or 1, e[EZ, AjER. The numbers ej, e[, )"j are determined uniquely by X. Putting k(S, X) = ( JIdINW ) S . f1 " f ( s + e j + )"jJ=1 ) 2'2n nj2 j=1 2 '1+" ( Ie 1+), J=1 ) x f1 f s + I [ . L(s. X), [='1 +1 2 
450 G Zeta Functions we have a functional equation ds, X) = W(z)k( 1 - S, n where W(X) is a complex number with ab- solute value I. We can express ds, X) by an integral form on J k as  ds.z)=c f (p(r)x(r)V(r)'d*r, J, where V(r) IS the ttotal volume of the idele r, (' is a constant that depends on the tHaar mea- sure d*r of J k , and (p(r) is defined by <p(r) = f1 <p,,(x p ), r = ( ... x" ..), " (P", ,(x) =xcle-nx2, irl' k p,1 =R, =_xele -2nl\:1 2 e o } = 2 1 n x -e;e 2ITI,'1 2 e' 0' i> r" k p ,; = C, 2n ' I , (p,,(x)=e 2ni '((x\ XE(bf)" } . - -I P finIte. =0, xrj3(bt)" Hence (bf)1 is the p-component ( 6 Adeles and Ideles B) of the ideal (bf) -, (b is the tdif- ferent of k/Q) and ;,(.x) is an additIve character of k" defined as follows. Qp is the tp-adic field, Zp is the ring of p-adic integers, ;'0 is the mapping QI'-+Qp/ZpcQ/Zc R/Z, and ;,= ;'00 Trk" Qp By putting X(r)= IT" X,,(x,,), r = (... x"...), we have ds, X) = (' 0 r (Pp(x)x,,(x) Y;,(x) -'d*x, p J k.p where p runs over all prIme divIsors of k, finite or infinite. Moreover, with a constant C", we have r (P", Jx)X", ,{x)Y;" ,(.x)-'d*x 1 p , , =c 'J[ (,>+" !/l+eiJ2 ", " xf(s+-=-l ;'i+ei)/2), k", ;=R, = C ( 2n ) 1\' 1./ -1 i, +,".11;2) p, , x f(S+ (YCl ;'1 + le.l)/2), k p " = C, r <p(x)X,,(x)Y;,(x)-'d*x J"p - C N(b ) '-1/2 - ( E -I ) . I - p " X '" 1 _ X(p);N(p)' , =C p N«(bi)p)'t,,(X,,)'j1(Vj,p), pli. Here t,,(X p ) is a constant called the local Gaussian sum, and j1( Vi ,) is the volume of { U E k" I U == 1 (mod i) }. These integrals over k" are the r -factors and Euler factors of k(5, X), according as p is infinite or finite. The func- 1698 tional equation is obtained by applying the tPoisson summation formula for <p(x) and its "F ou rier transform on the adele group Ak ( 6 Adeles and Ideles). Let Ok be the connected component of I in C k . If X(O.j= I, the corresponding X is a char- acter of an ideal class group of k with a con- ductor f. Conversely, all such characters can be obtained in this manner. As stated in Section E, the Hecke L- functions with characters (of ideal class groups) can be used to describe the decom- position law of prime divisors in class field theory. However, for L-functions with Gras- sencharakter, such arithmetic implications have not been found yet, except that in the case of Grossencharakters of Ao type, Y. Taniyama discovered, following the suggestion of A. Weil, that the L-function has a deep connection with the arithmetic of a certain infinite Abelian extension of k [T2, W7]. In particular, when L(s, X) is a factor of the tHasse (-function of an Abelian variety A with tcomplcx multiplication, it dcscribcs thc arith- metic of the field generated by the coordinates of the division points of A. G. Artin L-Functions Let K be a finite Galois extension of an alge- braic number field k (of degree n), G = G(K/k) be its Galois group, a-+A(a) be a matrix representation (characteristic 0) of G, and X be its character. Let p be a prime ideal of k, and define Lp(s, X) by x X(pm) 10gL,,(s,X)= L - , Res> 1, m=l mN(pmy with X(pm)=(1/e)L<EIX(a m t), where T is the tinertia group of p, I TI = e, and a is a tFrobe- nius automorphism of p. Then we have L,,(s, X)= det(E - A,,' N(p)-')-l, 1 A,,=- L A(at). e tEl' In particular, if T = {I} (i.e., p is tunramified in K/k), then L,,(s, x)=det(E - A(a)' N(p)-')-I. Now put L(s,X,K/k)=OL,,(s,X), Res>l, " and call L(s, X, K/k) an Artin L-function [A2]. (I) The most important property of L(s,X, K/k) is that if K/k is an Abelian extension and X is a linear character, it follows from class field theory that X(p) is the character of the ideal class group of k (modulo the tconductor of K/k) and that the Artm L-function equals 
1699 a Hecke L-function. This equality is equivalent to Artin's treciprocity law, and in fact Artin obtained his reciprocity law after he conjec- tured the equality. (2) If K':::J K:::J k and K'/k is a Galois exten- sion, then L(s, X, K/k) = L(s, X, K'/k). (3) If K:::JQ:::J k and IjJ is a character of G(K/Q), then L(s, 1/1, K/Q) = L(s, X, K/k), where X, is the character of G(K/k) tinduced from iff. (4) If X, = I, then L(s, X" K/k) = skis). (5) L(.s,x, +X2,K/k)=L(s,X"K/k) L(s,x2, K/k) Conversely, the Artin L-function L(s, X, K/k) is characterized by properties (1 )-(5) (6) If XR is the tregular representation of G, then L(s, XR, K/k) = sKis); hence SK(S)=Sk(S) f1 L(s, X, K/kV 111 , 1#1 where X runs over all irreducible characters #lofG. (7) Every character of a finite group G can be expressed as X=LmiX, (miEZ), where each 1.", is an tinduced character from a certain linear character 1/1; of an elementary subgroup of G (Brauer's theorem). (Here an elementary subgroup is a subgroup that is the direct product of a cyclic group and a p-group for some prime p.) Hence (3) and (5) imply that an Artin L-function is the product of integral powers (positive or negative) of Hecke L- functions Ln,(s, I/1J L(.\,X, K/k)= f1 Ln,(s, I/1r'. i Hence an Artin L-function is a univalent meromorphic function defined over the whole complex plane. Artin made the stil1 open con- jecture that if X is irreducible and 1.# 1, then L(s,"/., K/k) is an entire function (Artin's conjecture ). This conjecture holds obviously if all m; are nonnegative. Except for such a case, Artin's conjecture had no affirmative examples until 1974, when Deligne and Serre [D9] proved that each "new cusp form" of weight I gives rise to an entire Artin L-function L(s, X, K/k) with 1.(1) = 2 and X(p) = 0 (p is the complex conjugation); by this method, some nontrivial examples were computed by J. Tate and J Buhler (Lecture /Jotes i/J math. 654 (1978)). Then R. P. Langlands [L5] constructed non- trivial examples of Artin's conjecture for cer- tain 2-dimensional representations Gal(K/k)3()"f---->A(a)EGL(2, C) by using ideas of H. Saito and T. Shintani [SI, S20]. This method works for all represen- tations for which the image of the A(a) in PG L(2, C) is the ttetrahedral group. 1 t also works for some toctahedral cases, but a new idea is needed in the ticosahedral case. 450 H Zeta Functions (8) Let P f.i (i= I,. .,1', +1'2) be the infinite primes of k. Put ;'(s, X, P f i, K/k) = (f(s/2)[«(s+ 1)/2))'(1) for complex P x.h = [(s/2)(X(1 )+XIU»/2 [«(s + 1 )/2)IX(1)- xI U »/2 for real P x.i, where aEG is the complex conjugation deter- mined by a prime factor of P x.i in K. Next we introduce the notion of the conductor f I. with the group character X defined by Artin (J. Rei/Je A/Jgew. Math., 164 (1931 )). First, for any subset 111 c G, we put 1.(111) = L mEm X (m); then fx is given by Ix = f(f., K/k) = f1 p 11P1 , p where 1 f(p) =[ {ex(I) - X(T)) + {pe, 1.(1) - xiV,)) e + {p"'x(l)- X(V2)) +. J. and where V" V z , "', are the higher tramifi- cation groups of prime factors of p in K (in lower numbering) and pe, = I Vii ( 14 Alge- braic Number Fields I). Now put , " _ ( ldI1.l1Nk(fx) ) '/2 <,(s,X,Kjk)- 7["1.11) x f1 }I(s, x, Px .j, K/k)' L(s, X, K/k). p; 1 Then the functional equation is written W -s,X,K/k)= W(X)(s,X,K/k), IW(x)l= I The known proof of this functional equation depends on (7) and the functional equations of Hecke L-functions discussed in Section E. As for the constants W(x), there are significant results by B. Dwork, Langlands, and Deligne [D6]. (9) There are some applications to the theory of the distribution of prime ideals. H. Weil L-Functions Weil defined a new L-function that is a gen- eralization of both Artin L-functions and Hecke L-functions with Grossencharakter [W5] Let K be a finite Galois extension of an algebraic number field k, let C K be the idele class group K,UK x of K, and let 'Y.K/kE Jf2(Gal(K/k), C K ) be the tcanonical coho- mology class of tclass field theory. Then thIs :J. K k determines an extension W K k of Gal(K/k) by C K : I--->C c '" W K k --->Gal(Kjk)---> I (exact), and 
450 I Zeta Functions the transfer induces an isomorphism Wfk  Ck, where ab denotes the topological com- mutator quotient. If L is a Galois extension of k containing K, then there is a canonical homomorphism W L / k -> W K / k . Hence we define the Weil group I-fi for k/k as the tprojective limit group projK lim W K / k of the W K / k . It is obvious that we have a surjective homomor- phism cp: I-fi ->Gal(k/k) and an isomorphism rk: C k -> Wk'b, where Wk'b is the maximal Abelian Hausdorff quotient of I-fi. For WE I-fi, let II w II be the adelic norm of r k - 1 (w). If k, is a tlocal field, then we define the Weil group I-fi" for kv/kv by replacing the idele class group C K with the multiplicative group K; in the above definition, where Kw denotes a Galois extension of kv. If kv is the completion of a finite algebraic number field k at a place v, then we have natural homomorphisms k; ->C k and Gal(k,.jk,,)->Gal(k/k). Accordingly, we have a homomorphism I-fi, -> I-fi that com- mutes with these homomorphisms. Let I-fi be the Weil group of an algebraic number field k, and let p: I-fi->GL(V) be a continuous representation of W k on a complex vector space V. Let v = p be a finite prime of k, and let p" be the representation of I-fi" induced from p. Let ct> be an element of I-fi" such that (p(ct» is the inverse Frobenius element of p in Gal(kv/k,,), and let I be the subgroup of I-fi, consisting of elements w such that cp(w) be- longs to the tinertia group of p in Gal(k,,/k,,}. Let VI be the subspace of elements in V fixed by p,,(l), let Np be the norm of p, and let Lp(V, s) = det(I-(Np(S Pv(ct» I V I r 1 . We can define Lv(V,s) for each Archimedean prime v also, and let L(V,s)= f1 L,,(V,s). v Then this product converges for s in some right half-plane and defines a function L(V,s). We calI L(V,s) the Weil L-function for the representation p: I-fi->GL(V). This function L( V, s) can be extended to a meromorphic function on the complex plane and satisfies the functional equation L(V,s)=£(V,s)L(V*,I-s) (T. Tamagawa), where V* is the dual of V, and £( V, s) is an exponential function of s of the form ab s [T6]. P. Deligne generalized these results in the following manner: Let W; be a tgroup scheme over Q which is the tsemidirect product of I-fi by the additive group G a , on which W k acts by the rule wxw- 1 = Ilwllx. We can define the notion of representations of W; and the L- functions of them in the natural manner [T6]. 1700 I. The Riemann Hypothesis As mentioned in Section B, the Riemann hypothesis asserts that all zeros of the Rie- mann (-function in 0 < Res < 1 lie on the line Re s = 1/2. In his celebrated paper [R 1], Rie- mann gave six conjectures (including this), and assuming these conjectures, proved the tprime number theorem: x f " dx n(x)---Li(x)= -, x->oo. logx 2 logx Here n(x) denotes the number of prime num- bers smalIer than x. Among his six conjectures, alI except the Riemann hypothesis have been proved (a detailed discussion is given in [Ll]). The prime number theorem was proved inde- pendently by Hadamard and de La ValIee- Poussin without using the Riemann hypothesis ( Section B; 123 Distribution of Prime N um- bers B). R. S. Lehman showed that there are exactly 2,500,000 zeros of ((0" + it) for which 0 < t < 170,571.35, all of which lie on the critical line 0"= 1/2 and are simple (Math. Camp., 20 (1966)). Later R. P. Brent extended this com- putation up to 75,000,000 first zeros (1979). Hardy proved that there are infinitely many zeros of ((s) on the line Res= 1/2 (1914). Fur- thermore, A. Selberg [S6] proved that if No(T) is the number of zeros of ((s) on the line with 0 < 1m s < T, then No(T) > A Tlog T (A is a posi- tive constant) (1942). Thus if N(T) is the num- ber of zeros of ((s) in the rectangle 0 < Re s < I, 0< Ims< T, then lim infToo No(T)/N(T) > O. N. Levinson proved hm infToo N lI (T)/N(T) > 1/3 (Advances in Math., 13 (1974)). If N,(T) is the number of zeros of (s) in 1/2 - £ < Res < 1/2+£, O<lms< T, then limTooN,(T)/N(T) = 1 for any positive number £ (H Bohr and E. Landau, 1914). Bohr studied the distribution of the values of ((s) in detail and Initiated the theory oftalmost periodic functions (1925). D. Hilbert remarked in his lecture at the Paris Congress that the Riemann hypothesis is equivalent to n(x) = Li(x) + O(fi log x), x-> co (H. von Koch, 1901). It is also equivalent to N L /l(n)=O(N 1 /2+'), N->oo, n=l for any £ > 0, where /l(n) is the Mejbius func- tion. Assuming the Riemann hypothesis, we get 1 1 +log2n I T) N(T)=-TlogT T+o(og 2n 2n (Littlewood, 1924). 
1701 The computation of the zeros of the (- functions and the L-functions of general alge- braic number fields is more difficult, but con- jectures similar to the Riemann hypothesis have been proposed. Weil showed that a necessary and sufficient condition for the validity of the Riemann hypo- thesis for all Hecke L-functions L(s, X) is that a certain distribution on the idele group J k be positive definite [WI (1952b)]. It is not known whether the general (- and L-functions of algebraic number fields have any zeros in the interval (0, 1) on the real axis (see the works of A. Selberg and S. Chowla). Similar problems are considered for the vari- ous (-functions given in Sections P, Q, and T. J. p-Adic L-Functions Let X be a tprimitive Dirichlet character with conductor j, and let L(s, X) be the tDirichlet L- function for X. Then the values L(1 - n, X) of L(s, X) at non positive integers 1 - n (n = 1,2,...) are algebraic numbers ( Section E). Let p be a prime number, let Qp be the tp-adic num- ber field, and let C p be the completion of the algebraic closure Qp of Qp. It is known that C p is also algebraicaIly closed. Since Q c Qp, we fix an embedding Q c Qp and consider {L(1 - n, X) }l as a sequence in Cpo Let I I p be the extension to C p of the stan- dard p-adic valuation of Qp. Let q be p or 4 according as pi=- 2 or p = 2, and let w be the primitive Dirichlet character with conductor q satisfying w(n) == n (mod q) for any integer n prime to p. Then T. Kubota and H. W. Leo- poldt proved that there exists a unique func- tion Lp(s, X) satisfying the conditions [K5]: a c<, (1) Lp(s,x)=--=-!..-+ I a n (s-l)" (an EC p ); s-1 nO (2) a_I =0 if xi=- 1 and the series L'%oan(S -1)" converges for Is-ll p < Iq-1 pl(P-l)l p ; (3) L p (l--n,x)=(I- Xw- n p1-n)L(1-n,Xw- n ) holds for n = 1,2,3, .... The function Lp(s, X) satisfying these three condition5 is caIled the p-adic L-function for the character X. It is easy to see that Lp(s, X) is identically zero if X( -I) = -1, but Lp(s, X) is nontrivial if X( -1) = 1. Let Bn be the Bernoulli number. Then Bn satisfies the conditions: (1) Bn/n is p-integral if (p - l)tn (von Staudt) and (2) (1/n)Bn == (l/(n + p -1))B n + p -- 1 (modp) holds in this case (Kum- mer). The generalization of these results for the generalized Bermoulli number Bx.n was obtained by Leopoldt. Since L(1 - n, X) = -(1 (n)B x . n , such p-integrabilities and con- gruences can be naturally interpreted and 450J Zeta Functions generalized in terms of the p-adic L-functions L(s, X). We assume X( -1)= 1. Then L p (O,X)=(l- xw -1(p))L(O, Xw -1) and xw -1( -1)= -1. Hence we can express the first factor h;; of the class number of a cyclotomic field Q(exp(2ni/ N)) as a product of some Lp(O, X)'s. By using this fact, K. Iwasawa proved [I7] that the p-part pe" of the tfirst factor h;;p" (N E N) satisfies e n - = An + j1pn + v (A, j1, VE Z; A, j1 O) for any sufficiently large n. Here Iwasawa conjectured j1 = 0, which was proved by B. Ferrero and L. Washington [Fl]. Also, we can obtain some congruences involving the first factor h;; of Q(exp(2ni/N)) from this formula. Let X be a nontrivial primitive Dirichlet character with conductor f, let f T(X)= I x(a)e2nia/f a=l be the tGaussian sum for X, and let logp be the p-adic logarithmic function. Then Leopoldt [L6] calculated the value L(I, X) and obtained L p (1, X) = - ( 1- X(P) ) T(X) f x(a)logp(1-e-2nia/f). p f a1 As an application of this formula, Leopoldt obtained a p-adic tclass number formula for the maximal real subfield F=Q(cos(2n/N)) of Q(exp(2ni/N)): Let (p(s,F) be the product of the Lp(s, X) for all primitive Dirichlet characters X such that (1) X( -1) = 1 and (2) the conductor of X is a divisor of N. We define the p-adic regulator Rp by replacing the usual log by the p-adic logarithmic function logp' Let h be the class number of F, m = [F: Q], and let d be the discriminant of F. Then the residue of (p(s,F) at S = 1 is ( X(p) ) 2m-1 hRp f]1-- . x P jd Hence (p(s,F) has a simple pole at s= 1 if and only if Rpi=-O. In general, for any totally real finite algebraic number field F, Leopoldt con- jectured that the p-adic regulator Rp of F is not zero (Leopoldt's conjecture). This conjec- ture was proved by J. Ax and A. Brumer for the case when F is an Abelian extension of Q [A4, B7]. By making use of the Stickelberger element, Iwasawa gave another proof of the existence of the p-adic L-function [17]. In particular, he obtamed the following result: Let X be a primi- tive Dirichlet character with conductor f Then there exists a primitive Dirichlet character () such that the p-part of the conductor of 0 is 
450K Zeta Functions either 1 or q and such that the conductor and the order of yO- I are both powers of p. Let 0e be the ring generated over the ring Zp of p- adic integers by the values of O. Then there exists a unique element f(x,O) of the quotient field of oo[[x]] depending only on 0 and satisfying Lp(s, x) = f(((1 + qo)' - 1,0), where qo is the least common multiple of q and the conductor of 0, and (=X(I +qo)-'. Fur- thermore, lwasawa proved that [(x, 0) belongs to oe[[x]] if 0 is not trivial Let P=Q(exp(2rrijq)) and, for any n';: I, let P" = Q(exp(2rrij qp n)). Let P" = UI1?] Pn' Then P y is a Galois extension of Q satisfying Gal(py jQ) Z; (the multiplicative group of p- adic units), and P is the subfield of Pey jQ cor- responding to the subgroup I + qZp of Z; . Let JjJ be a Cp-valued primitive Dirichlet character such that (I) JjJ( -1) = - I and (2) the p-part of the conductor f". of JjJ is either 1 or q Let K. be the cyclic extension of Q corre- sponding to JjJ by class field theory Let K = K,j.'P, Kn=K'Pnand Key =K'P f . Let An be the p-primary part of the ideal class group of Kn' let An-->Am (n';:m) be the mapping in- duced by the trelative norm Nk,!Km' and let X K =limAn. Since each An is a finite p-group, X K is aZp-module. Let V K = X k @zp C p , and let V,= {VE VKI b(v)= JjJ(b)u for all bEGal(KjQ)}. Let qo be the least common multiple of J,p and q, and let Yo be the element of Gal(K.yjK) that corresponds to 1 +qoEl +qZp=Gal(p.yjP) by the restriction mapping Gal(K",jK) c.Gal(Px/P). Let f(x) be the characteristic polynomial of 10  1 acting on V. Hence jop(x) IS an element of O'j.[x]. We assume that wJjJ I is not trivial Let f(x,WJjJ-I) be as before. Then f(X,WJjJ-I) is an element of 0",[[ xJJ Iwasawa conjectured that fOf'(X) and [(x,wJjJ-') coincide up to a unit of OOf'[[X]] (Iwasawa's main conjecture). This con- jecture was proved recently by B. Mazur and A. Wiles in the case where JjJ is a power of w. Let F be a totally real finite algebraic num- ber field, let K be a totally real Abehan exten- sion of F, and let y be a character of Gal(KjF). Let L(s, X) be the t Artin L-function for X. Then we can construct the p-adic analog Lp(s, X) of L(s, X) (J.-P. Serre, J. Coates, W. Sinnott, P Deligne, K. Ribet, P. Cassou-Nogues). But, at present, we have no formula for Lp(l, X) Coates generalized Iwasawa's main conjecture to this case, but it has not yet been proven. p-adic L-functions have been defined in some other cases (e.g.  [K3,Ml,M3]). 1702 K. (-Functions of Quadratic Forms Dirichlet defined a Dirichlet series associated with a binary quadratic form and also consid- ered a sum of such Dirichlet series extended over all classes of binary quadratic forms with a given discriminant D, which is actually equivalent to the Dedekind (-function of a quadratic field Dirichlet obtained a formula for the class numbers of binary quadratic forms The formula is interpreted nowadays as a formula for the class num bers of quadratic fields in the narrow sense. According as the binary quadratic form is definite or indefinite, we apply ditferent methods to obtain its class number. Epstein (-functions P. Epstein get1eralized the definition of the (-function of a positive definite binary quadratic form to the case of n variables (Math. Ann., 56 (1903),63 (1907)). Let V be a real vector space of dimension m with a positive definite quadratic form Q. Let M be a tlattice in V, and put I m 'Q(s, M)= I - Q - ( '- ) -;' Res>- 2 ' XE ,\:[ X ,,-0 ThIs series is absolutely convergent in Re s > mj2, and lim ( S- ) (Q(S' M)=D(M)-'!2rrm/2f (  ) -I, Hm/2 2 2 D(M) =detIQ(xj' xj)l, where XI' ..., X m is a basis of M and Q(x, y) =(Q(x+ y)-Q(x)-Q(y))j2.Ifthe Q(x) (xEM,x#O) are all positive integer;, we can write  x a(n) I.,Q(s, M) = I ----;-, n=l n where a(n) is the number of distinct XE M with Q(x) = n In general, let XI, ..., X m be a basis of the lattice M and xi, .., , x be its dual basis (Q(x"xj)=bij)' Call M*=Ljx,!,Z the dual lattice of M. If we consider the ,9-series (ttheta series) ,9 Q (u, M)= I exp( -rruQ(x)) (Rell>O), XE then ,9 Q (u, M)=(u- m / 2 D(M)-1!2),9 Q (u-', M*) WIth (Qls, M)= rr-T(s)(Q(s, M), the displayed equality leads to the functional equation ( m '\ i' ( s M ) =D ( M ) -1/2'F --s M* I Q , Q 2 ' , ) . In general, 'Q(s, M) has no Euler product expansion. Consider the case where M = L Zx; (x; = 
1703 (0,...,0, 1,0,...,0)), Q(x)=Lr,U;, for x= (u" ., um)' If we put (m(s) = (Q(s, M), L(s)= LJ ( -4/n)I1- S , then we have (,(.1')=2(2.1'), (2(s)=4((s)' L(s)=4 x (the Dedekind (-function of Q(.J=l)), (4(8)=8(1- 2 2 - 2s )((s)((s_ I), (6(.1') = -4(((s)L(s  2)- 4((.1'- 2)L(s)), (8(.1')= 16(1 _2"+2 4 2')((5)((.1'-3), (,o() = (4/5)(((s)L(s - 4) + 4 2 (.1' 4)L(s)) /14 L --;- I'cZ[, II (/1/1) 11#0 -2 :;ds)=(,2'((8)((s5)(2626 ') H 2 <P{fi(0J. where <pi fl( the Dirichlet series corre- sponding to  (T) by the tMellin transform and (T)= z (TI:dl- znW4 with z= e 2nir (m(.) has zeros on the line Res = IT = m/4, given explicitly for m = 4, 8 as follows: m=4 .1'=01 +/ni/log2, /= 1,2, "', m=8: .1'=0 2 + (i/log2)(2/n::tarctan Jl5\, /=0, ::tl,... Regarding the Epstein (-function of binary quadratic forms (Q(8)= I' Q(m, n)-', m.n with Q(x, y)= ax 2 + hx}, +cy2, a, h, CE R, a >0, C >0,  =4ac _h 2 >0, we have the Chowla-Selberg formula (1949) ( ,2 2 'a s -' fi :; Q (s)= 2(2s)a +-- f(s)' 112 x ((2s-1)r(sD) , '2 s + 3 / 2 + ( a'/2;(s)'/2-"'4  112 nnh ) x /;;'J n'- 1T'_2,(n)COS- J f' { nn':2 } x <p'-3/2 exp ---(<p+<p-I) dip, o 2a where ITk(I1I=Ldl"dk and ((.1') is the Riemann (- function By using this formula, we can give another proof of the following result of H Heilbronn. Let h( -) be the class number of the imaginary quadratic field with discrimi- nant -. Then h( - )--> 'i (--> 'iJ)' 450 L Zeta Functions The following generalization of this result was obtained by C. L Siegel [S22J: Let k be a fixed finite algebraic number field Let K be a finite Galois extension of k, and let d = d(K), h = h(K), and R = R(K) be the discriminant of K, the class number of K, and the regulator of K, respectively. We assume that K runs over extensions of k such that [K: kJ/logd-->O, then we have 10g(hR)  logjidl. Siegel (-functions of indefinite quadratic forms: Siegel defined and investigated some (- functions attached to nondegenerate indefinite quadratic forms, which are also meromorphic on the whole complex plane and satisfy certain functional equations [S24]. The case of quadratic forms with irrational algebraic coefficients was treated by Tama- gawa and K G. Ramanathan L. (-Functions of Algebras K. Hey defined the (-function of a tSlmple algebra A over the rational number field Q (M. Deuring [DI OJ) ( 27 Arithmetic of As- sociative Algebras). Consider an arbitrary tmaximal order 0 of A, and let , 1 <'A(s)=I-, Res>l, " N(a)' with the summation taken over all left integral ideals 11 of o. Then SA is independent of the choice of a maximal order o. Let k be the tcenter of A, and put [A: kJ = n 2 . First, SA is decomposed into Euler's infinite product expansion (A(S) = TIp Zp(s) (p runs over the prime ideals of k). For p not dividing the dis- criminant b of A, Zp(s) coincides with the p- component of TIj'::6 (k(ns- j). Hence (A(") coincides with TIjb sdns - j) up to a product of p-factors for pi b which are explicit rational functions of N(p) "'. Moreover, if A is the total matrix algebra of degree r over the division algebra II, then we have (A(s)=TI'j=b(!'(rs-j), and (!'(s) satisfies a functIOnal equation similar to that of Sk(S) (Hey). Also, (A(S) is meromorphic over the whole complex plane, and at s= 1, (1\-1)/ n,..., l/n, it has poles of order 1. Using analy- tic methods, M Zorn (1931) showed that the simple algebra A with center k such that Ap is a matrix algebra over kp for every finite or in- finite prime divisor p of k is itself a matrix algebra over k ( 27 Arithmetic of Associative Algebras D). A purely algebraic proof of this was given by Brauer, H. Hasse, and E. Noether. G. FUJisaki (1958) gave another proof using the Iwasawa- Tate method. As a direct 
450M Zeta Functions application of the (-function, the computation of the residue at s = 1 of (A leads to the form ula containing the class number of maximal order :D. Godement defined the (-function of fairly general algebras [Gl], and Tamagawa inves- tigated in detail the explicit (-functions of division algebras, deriving their functional equations [Tl]. Let A = O Ap be the adele ring of A, and let G = O G p be the idele group (of A). We take a maximal order :Dp of Ap and a maximal com- pact subgroup Up of G p . Let w p be a tzonal spherical function of G p with respect to Up; that is, w p is a function in G p and satisfies wp(ugV)=wp(g) (U,VEU p ), w p (1)=I, r wp(guh)du=wp(g)wp(h). Ju p In addition, we define the weight function CPp on Ap by j the characteristic function of:D p when p is finite, cpp(X) = exp( -nTp(xx*)) when p is infinite, where  is the treduced trace of Ap/R and * is a positive tinvolution. Tamagawa gave an explicit form of the local (-function with the character w p defined by (p(s,w p )= r cpp(g)wp(g-I)INp(g)ldg, JG p where N p is the treduced norm of Ap/kp, and I I p is the valuation of kp. Then w = Op w p is the zonal spherical function of G with respect to 0 Up = U. In particular, if w is a positive definite zonal spherical function belonging to the spectrum of the discrete subgroup f = A* = {all the invertible elements of A} of G, then the Tamagawa (-function with character w is given by (s,w)= Tl (p(s,w p )= r cp(g)w(g-I)llgIISdg, p JG where cp(g) = 0 cpp(gp) and II II is the volume of the element g of G. When A is a division algebra, (s,w) is analytically continued to a meromorphic function over the whole complex plane and satisfies the functional equation. The Tamagawa (-function may also be consid- ered as one type of (-function of the Hecke operator. When A is an indefinite quaternion algebra over a totally real algebraic number field <1>, the groups of units of various orders of A operate discontinuously on the product of complex upper half-planes. Thus the spaces of holomorphic forms are naturally associated with A. The investigation of (-functions asso- 1704 dated with these holomorphic automorphic forms was initiated by M. Eichler arld ex- tended by G. Shimura, H. Shimizu, and others. Eichler investigated the case <1> = Q, and Shimura and Shimizu investigated t he case for an arbitrary totally real field <1> by defining general holomorphic automorphic forms, Hecke operators, and corresponding (- functions. The functional equations of these (- functions were proved by Shimizu. Shimizu generalized Eichler's work and found relations among (-functions of orders of various quater- nion algebras belonging to different discrimi- nants and levels [S 10]. For the related results, see, e.g., the work of K. Doi and H. Naganuma [D 1 2]. M. (-Functions Defined by Hecke Operators The (-functions of algebraic number fields, algebras, or quadratic forms, and the L- functions are all defined by Dirichlet series, are analytically continued to univalent functions on the complex plane, and satisfy functional equations. One problem is to characterize the functions having such properties. (I) H. Hamburger (1921-1922) characterized the Riemann (-function (up to constant multi- ples) by the following three properties: (i) It can be expanded as ((s) = 1:';;'1 an/n' (Res» 0); (ii) it is holomorphic on the complex plane except as s = I, and (s - 1)( (s) is an entire func- tion of finite tgenus; (iii) G(s) = G(I--s), where G(s) = n -s/2 [(s/2)(s). (2) E. Hecke's theory [H4]: Fixing ). > 0, k> 0, }' = :t 1, and putting R(s) = (2n/A) -S[(s)cp(s) for an analytic function cp(s), we make the following three assumptions: (i) (s - k)cp(s) is an entire function of finite genus; (ii) R(s) = 1'R(k - s); (iii) cp(s) can be expanded as cp(s) = 1:::'=1 an/n s (Res>O'o)' Then we call ,p(s) a function belonging to the sign (Jc, k, 1'). The functions ((2s), L(2s), and L(2s -I) satisfy assumptions (i)-(iii), where L may be either a Dirichlet L-function, an L-function with Grossencharakter of an imaginary qua- dratic field, or an L-function with class charac ter of a real quadratic form whose r-factors are of the form [(s/2)[«s + 1 )/2)  [(s). If cp(s) belongs to the sign (A, k, 1'), then n-'cp(s) be- longs to the sign (n)" k, 1'). To each Dirichlet series cp(s) = 1:::'=1 an/n s with the sign (A, k, 1'), we attach the series f(T) = ao + 1:::'=1 ane2nin,/" where a o = 1'(2n/Jc)-k[(k)Res s =dcp(s)) = l' Ress=dR(s)). This correspondence cp(s)-> f(T) may also be 
1705 realized by the tMellin transform R(S)= f oo ( f ane-2nnY!J. ) ys-1 dy o n=l = Loo (J(iy)-ao)yddy, f(iy)-ao=- r R(s)y-Sds. 2m J Res=uo In this case, (i) f(T) is holomorphic in the upper half-plane and f(T + .1)= f(T), (ii) f( -I/T)/( - iTl=yf(T), and (iii) f(x + iy)= O(yconst) (y + 0) uniformly for all x. Conversely, the Dirichlet series cp(s) = L1 ann- s formed by the transformation in the previous paragraph from f(T) satisfying (i)- (iii) belongs to the sign (A, k, y). We also say that the function f(T) belongs to the sign (A,k, y). If k is an even integer, then the functions f(T) belonging to (I, k, (-ll!2) are the tmodular forms of level I and weight k. A necessary and sufficient condition for a function cp(s) belong- ing to (I, k, ( -ll!2) to have an Euler product is that the corresponding modular form f(T) be a simultaneous eigenfunction of the ring formed by the tHecke operators 1;. (n = 1,2,...). In this case, the coefficient an of cp(s) = L ann -. coin- cides with the eigenvalue of 1;.. Namely, if fl Tn =tnf, we have cp(s)=a1(L1 tnn- S ), and this is decomposed into the Euler product cp(s) = a l Op(l- tpP -s + pk-I -2s)-I. We call cp(s)/a l a (-function defined by Hecke operators (Hecke [H5]). For example, (s). (s-k+ 1) and the Ramanujan function 00 I, T(n)n -s = TI (1- T(p)p-' + pll-2S)-1 11=1 P are (-functions defined by Hecke operators. Hecke applied the theory of Hecke operators to study the group f'(N) [H5]; the situation is more complicated than the case of f'(1) = SL(2, Z). The space of automorphic forms of weight k belonging to the tcongruence subgroup fo(N)={(: )ESL(2,Z+=O (mOdN)} is denoted by \Ink(fo(N)). The essential part of \Indfo(N)) is spanned by the functions f(T) = L ane2nin' satisfying the conditions: (1) cp(s) = L ann -s has the Euler product expansion cp(s) = TI (I-app-S)-I pIN x TI (l_a p p- s +pk-I-2S)-I. p{N (2) The functional equation R(s) = yR(k - s) holds, where R(s) = (2n/JN)-S f'(s)cp(s). (3) When X is an arbitrary primitive character of Z such that the conductor f is coprime to N, 450M Zeta Functions then R(s, X) = (2n/JN fJ-sf'(s)I,anx(n)n- S extends to an entire function satisfying the functional equation R(s,X)=wR(k-s,x)(lwl= I) (Shimura). Conversely, (2) and (3) charac- terize the Dirichlet series cp(s) corresponding to f(T)E\Ink(fo(N)) (Weil [WI (1967a)]). Considering the correspondence f(T) = Lanqn->cp(s)=Lann-s not as a Mellin trans- formation but rather as a correspondence effected through Hecke operators, we can derive the (-function defined by Hecke op- erators. When the Hecke operator 1;. is de- fined with respect to a discontinuous group f and we have a representation space \In of the Hecke operator ring %, we denote the matrix of the operation of Tn E % on \In by (Tn) = (1;.)m and call the matrix-valued function Ln(1;.)mn -s the (-function defined by Hecke operators. The equation cp(s)=Lann- s is a specific instance of the correspondence in the first sentence, where f = f'(N), \In c\Ink(fo(N)), dim \In = I. One advantage of this definition is that it may be applied whenever the concept of Hecke operators can be defined with respect to the group f (for instance, even for the Fuchs- ian group without a tcusp). Thus when f is a Fuchsian group given by the unit group of a quaternion algebra <1> over the rational num- ber field Q and \In is the space of automor- phic forms with respect to f, the (-function L(T,,)n- S is defined (Eichler). Moreover, by using its integral expression over the ide Ie group J<I> of <1>, we can obtain its functional equation following the Iwasawa-Tate method (Shimura). Furthermore, by algebrogeometric consideration of 1;., it can be shown that (s)(s-l)det(I,(1;.)6 2 n - S ) =(S)(S-I)det( l}(1-(T p )6 2 P-S + (R p ) 62p l-2S) -I) coincides (up to a trivial factor) with the Hasse (-function of some model of the Riemann surface defined by f when \In is the space 6 2 of all tcusp forms of weight 2 (Eichler [EI], Shimura [SI2]). The algebrogeometric meaning of det((1;.)6, n -S), when \In is the space 6k of all cusp forms of weight k, has been made clear for the case where f is obtained from fo(N), f'(N), and the quaternion algebra (M. Kuga, M. Sato, Shimura, Y. Ihara). From these facts, it becomes possible to express (T p )6" the decom- position of the prime number p in some type of Galois extension (Shimura [SI4], Kuga), in terms of Hecke operators. These works gave the first examples of non-Abelian class field 
450N Zeta Functions theory Note that this type of (-function may be regarded as the analog (or generalization) of L-functions of algebraic number fields, as can be seen from the comparison in Table I. Table 1 Algebraic Ideal number k Character X Lx(n)n -, group field 1 1 1 I Algebraic Hecke Representation G ring space 1]]1 L(Tnhnn -, group As for special values of (-functions defined by Hecke operators, the following fact is known: Let [(T)= L a n q"E'JJl k (SL(2, Z)) be a common eigenfunction of the Hecke operators, and let <pIs) = Lan" ., be the corresponding irichlet series Let KJ he the field generated over the rational number field Q by the coeffi- cients an of f. Then, for any two integers m and m' satisfying 0 < m, m' < k and m == m' (mod 2), the ratio (R(m) R(m')) of the special values of r(s) f J , t d R(s)=--!p(s)= (f(iy)-ao)Y - Y (21f)' ° at m and 111' belongs to the field K f' G. Shimura discovered this fact for Rama- nUjan's function (T) (J Math Soc Japan, II (1959)), and then Yu. I. Manin generalized it to the above case and, by constructing a p-adic analog of !p(s) from it, pointed out the impor- tance of such results [M I]. R. M. Oamerell also used such results to study special values of Hecke's L-function with Grosscncharakter of an imaginary quadratic field (Acta Arith., 17 (1970),19 (1971)). Furthermore, Shimura generalized these results to congruence sub- groups of SL(2, Z) (Coml11. Pure Appl. Math., 29 (1976)), and to Hilbert modular groups (Ann. Math., 102 (1975)). The connectIOn between special values of (-functions and the periods of integrals has been studied further by Shimura, Oeligne, and others. In addition, in connection with nonholo- morphic automorphic forms H Maass consid- ered L-functions of real quadratic fields (with class characters) having f(s/2)2 or r«(s + 1 )/2)2 as r-factors. Furthermore, T. Kubota studied the relation of (-functions (k(S) of an arbitrary algebraic field k or (-functions of simple rings to (nonanalytic) automorphic forms of several variables and considered the reciprocity law for the Gaussian sum from a new viewpoint. N. L-Functions of Automorphic Representations (I) R. P. Langlands reconstructed the theory of tHecke operators from the viewpoint of repre- 1706 sentation theory. and defined very general L- functions. He proposed many conjectures about them in [L4], and he and H. Jacquet proved most of them in [J I J for the case G =GL 2 . First Langlands defined the L-group LG for any connected reductive algebraic group G defined over a field k in the following manner [B6J There is a canonical bijection between iso- morphism classes of connected treductive alge- braic groups defined over a fixed algebraically closed field k and isomorphism classes of troot systems. It is defined by associating to G the root data 'P(G) = (X*(T), <1>, X *(T), <1>"), where T is a tmaximal torus of G, X*(T) (X*(T)) the group of characters (t I-parameter subgroups) of T, <1> (<1>V) the set of roots (co roots) of G with respect to T. Since the choice of a tBorel subgroup B of G containing T is equivalent to that of a basis Ll of <1>, the aforementioned bijecti,Jn yields one between isomorphism classes of triples (G, B, T) and isomorphism classes of based root data 'Po(G)=(X*(T), Ll, X*(T), '"). There is a split exact seq uence 1-. lnt G-.Aut G-+Aut 'Po (G)-+ I and this mapping induces a canonical bijection Aut 'Po(G)Aut(G, B, T, {X'}>E") if X,E G, (iXE) are fixed. Let G be a connected reductive algebraic group defined over k. Let T be a maximal torus of G, and let B be a Borel subgroup of G containing T. Let 'Po(G)=(X*(T),, X*(T), N) be as before. Then there is a connected reduc- tive algebraic group LGO over C sllch that 'Po(G)V =(X*(T), N, X*(T),) corresponds to the triple (LGO, LBo, LTo), where LBo and LT o are a Borel subgroup of LG O and the maxi- mal torus of LBo. For example, (I) if G = GL n , then LGO = GL,,(C); (2) if G = SP2n' then LGO = S02n+ dC). We assume that k is the algebraic closure of k and G is defined over k. Then }' E Gal(k/k) induces an automorphism of the h-group G x kk. Hence I' defines an elemenr of Aut(LGO, LBo, LTo) because it is a holomorphic image of Aut'Po(G x kk)= Aut'Po(G x kkr. Hence we can define the tsemidirect product LG=LGO xIGal(k/k), and call it the L-group of G. Let k be a tlocal field, and let G be a connec- ted reductive algebraic group defi [led over k We identify G with the group of its k-rational points Let W{ be the Weil-Oeligne group of k ( Section H), and let <1>(G) be the set of homomorphisms <p: W{-.LG over Gal(k/k). Let Il(G) be the set of infinitesimal equivalence classes of irreducible admissible representations of G. If k is a non-Archimedean field, then 
1707 n(G) is the set consisting of equivalence classes of irreducible representations 1[: G-->Aut V on complex vector spaces V such that the space V K of vectors invariant by K is finite dimen- sional for every compact open subgroup K of G and such that V= U vK, where K runs over the compact open subgroups of G. If k is an Archimedean field, then mG) is the set consist- ing of equivalence classes of representations 1[ of the pair (r!, K) of the Lie algebra r! of G and a maximal compact subgroup K satisfying similar conditions [B6J Then Langlands conjectured that we can parametrize Il(G) by <D(G) as mG)= U,pmG)",. Let nEn(G),p (41E<D(G)), and let r be a representation ofLG. Then we can define the L-function L(s, 1[, r) and the £-factor [;(s, 1[, r) of 1[ by L(s, n, r) = L(s, r 0 (p), f.(s, 1[, r) = f.(s, r 0 41, 1jJ), where the right-hand sides are those of the Weil-Deligne group ( Section H) and IjJ is a nontrivial character of k Let G be a connected reductive group over a global field k (i.e., an algebraic number field of finite degree or an algebraic fu nction field of one variable over a finite field), let 1[ be an Iffeducible admissible representation of G A , where G A is the group of rational points of G over the tadele ring k A of k, and let r be a finite-dimensional representation of LG. Let IjJ be a nontrivial character of k A which is trivial on k. For any place v of k, let r, be the repre- sentation of the L-group of G" = G x k k" in- duced by r, and let 1/1" be the additive character of k, associated with 1/1. It is known that 1[ is decomposed into the tensor product @1[" of 1["EJl(G(k,,)) [B6]. Hence we put L(s, 1[, r) = Tl L(s, 1[" r,), [;(s, n, r) = Il [;(s, n,., r,.). The local factor L(s, 1[", r,) is in fact defined if v is Archimedean, or G is a itorus, or 41 is unramified (i.e., G" is quasisplit and splits over an unramified extension of k" and G(o,,) is a special maximal compact subgroup of G(k,,), and 1[" is of class one with respect to G(o,,), where 0" is the integer ring of k,,) It foI1ows that the right-hand side n L(s, 1[", r,) is defined up to a finite number of non-Archimedean places r Furthermore, Langlands proved that n £(s, 1[", r,.) is in fact a finite product, and the infinite product n L(s, 1[", r,,) converges in some right half-plane if 1[ is automorphic (i.e, if 1[ is a subquotient of the right regular representa- tion of G A in G k \ G A)' It is conjectured that L(s, 1[, r) admits a meromorphic continuation to the whole complex plane and satisfies a functional equation L(s, 1[, r) = [;(s, 1[, r) L( I  s, ii, r) 4500 Zeta Functions if 1[ is automorphic, where ii is the tcontragre- dient representation of n. Furthermore, if G = GLn and r is the standard representation of GL n , then we can construct L(s, 1[, r) and [;(s, 1[, r) by generalizing the Iwasawa- Tate method. We can also show in this case that L(.\, 1[, r) is entire if 1[ is cuspidal. The conJec- tures are studied in some other cases [B6J O. L-Functions of Automorphic Representa tions (II) A Weil generalized the theory of tHecke operators and the corresponding L-functions to the case of tautomorphic forms (for holomor- phic and nonholomorphic cases together) of GL 2 over a global field [W9]. Then H. Jacquet and Langlands developed a theory from the viewpoint of +representation theory [J \,12]. They attached L-functions not to automorphic forms but to tautomorphic representations of GLz(k). Let k be a non-Archimedean local field, and let Ok be the maximal order of k. Let .if k be the space of functions on G k = GL 2 (k) that are locally constant and compactly supported Then .Yf k becomes an algebra with the convol- ution product (11 * 12)(h) = L, [1 (g)fz(g -1 h)dg, where dg is the tHaar measure of G k that assIgns I to the maximal compact subgroup K, = GL 2 (od. Let 1[ be a representation of ,Y(, on a complex vector space V Then we say that 1[ is admissible if and only if 1[ satisfies the following two conditions: (1) For every v in V, there is an [in ,Yf k so that n( 1)1' = v, (2) Let fJ, (i = I, ....r) be a family of inequivalent irreduc- ible finite-dimensional representations of K k , and let r (g)= I dim(fJi)-ItrfJi(g-I) i=1 Then  is an idempotent of 'Y(k We call such a  an elementary idempotent of Y(k Then for every elementary idempotent  of Y(k, the operator n() has a finite-dimensional range. If 1[ is an admissible representation of Gldk) ( Section N), then 1[(n= f l(g)1[(g)dg (jE.Yf k ) G, gives an admissible representation of Y(k in this sense. Furthermore, any admissible repre- sentation of 'Y(k can be obtained from an ad- mIssible representation of GL 2 (k) Let k be the real number field. Let .If 1 be the 
4500 Zeta Functions space of infinitely differentiable compactly supported functions on Gd = GLz(k)) that are K k ( = 0(2, k)) finite on both sides, let ft z be the space of functions on Kk that are finite sums of matrix elements of irreducible represen- tations of Kb and let Yl'k = Yr\ EB ,#'z. Then .it l , .ff z , and '#'k become algebras with the convolution product. Let n be a representation of Yl'k on a complex vector space V. Then n is admissible if and only if the fo11owing three conditions are satisfied: (1) Every vector v in V is of the form V=I:;=I nCt;)Vi with t;EYl'1 and ViE V; (2) for every elementary idempotent (g) = I:;I dim (O"if I trO"i(g-I), where the O"i are a family of inequivalent irreducible repre- sentations of Kb the range of n() is finite- dimensional; (3) for every elementary idempo- tent  of '#'k and for every vector v in n() V, the mapping fl--->n(J)v of Yl',  mto the finite- dimensional space n() V is continuous. We can define the Hecke algebra '#'k and the no- tion of admissible representations also in the case k = C. I n these cases, an admissible repre- sentation of ft k comes from a representation of the tuniversal enveloping algebra of GLz(k) but may not come from a representation of GLz(k). It is known that for any local field k, the tcharacter of each irreducible representa- tion is a 10ca11y integrable function. Let k be a global field, G k = GLz(k), and let G A =GLz(k A ) be the group of rational points of G k over the adele ring k A of k. For any place v of k, let k,. be the completion of k at v, let G" =GLz(k v ), and let kv be the standard maximal compact subgroup of G,,, Let Yl'v be the Hecke algebra Yl'k, of G v , and let <'v be the normalized Haar measure of Kv. Then 8v is an elementary idempotent of Yl'V' Let Yl' = @".Yl'v be the restricted tensor product of the local Hecke algebra Yl;. with respect to the family {8v}. We ca11 Yl' the global Hecke algebra of GA' Let n be a representation of Yl' on a com- plex vector space V. We define the notion of admissibility of n as before. Then we can show that, for any irreducible admissible represen- tation n of Yl' and for any .place v of k, there exists an irreducible admissible representation n" of Yl'v on a complex vector space V v such that (1) for almost a11 v, dim v,.K, = 1 and (2) n is equivalent to the restricted tensor product @ n,. of the n" with respect to a family of nonzero XvE V v K ". Furthermore, the factors {nv} are unique up to equivalence. Let k be a local field, let 1/1 be a nontriv- ial character of k, and let Yl'k be the Hecke algebra of G k = GLz(k). Let n be an infinite- dimensional admissible irreducible represen- tation of Yl'k' Then there is exactly one space W(n, 1/1) of continuous functions on G k with the following three properties: (1) If W is in 1708 W(n, 1/1), then for all 9 in G k and for all x in k, W(G 7)g)=I/1(X)W(g); (2) W(n, 1/1) is invariant under the right trans- lations of Yl'b and the representation on W(n, 1/1) is equivalent to n; (3) if k]s Archi- medean and if W is in W(n,I/1), then there is a positive number N such that w{ }=O(ltIN) as I t 1-> ex) . We ca11 W (n, 1/1) the Whittaker model of n. The Whittaker model exists in the global case if and only if each factor nv of n = @ nv is infinite-dimensional. Let k be a local field, and let n be as before. Then the L-function L(s, n) and the 8-factor <.(s, n, 1/1) are defined in the fo11owing manner: Let w be the quasicharacter of k' (i.e., the continuous homomorphism k x ->C X) defined by n(( }=W(a)idv. Then the tcontragredient representation it of n is equivalent to w -1 @ n. For any 9 in G k and Win W(n, 1/1), let 'P(g, s, W) = r W( ( a Jk x ° '1'(g,s, W)= r W( ( a Jk X ° )g)laIS-I/Z d X a, O ) g)lals-I/Zw -1 (a) d x a 1 . Then there is a real number So such that these integrals converge for Re(s) > So for any 9 E G k and WE W(n, 1/1). If k is a non-Archimedean local field with Fq as its residue field, then there is a unique factor L(s, n) such. that L(s, n)-I is a polynomial of q-S with constant term 1, <1>(g, s, W) = 'P (g, s, W)/ L( s, n) is a holomorphic function of s for all 9 and W, and there is at least one Win W(n, 1/1) so that <1>(e, s, W) = as with a positive constant a. If k is an Archimedean local field, then we can define the gamma factor L(s, n) in the same manner. Furthermore, for any local field k, if <1>(g, s, W)= '1'(g, s, W)/L(s, it), then there is a unique factor 8(S, 1/1, n) which, as a function of s, is an exponential such that _ ( 0 <1>( -1  )g, 1 - s, W) = 8(S, n, 1/1)'1>(g, s, W) for allgEG k and WE W(n, 1/1). Let nand n' be two infinite-dimensional irreducible admissible representations of G k . Then nand n' are equivalent if and only if the 
1709 quasicharacters wand w' are equal and L(I- S, [1 @ ii)f.(S, x@n, 1/1) -- L(s,x@n) L(I-s, X-I @ii')f.(s,x@n', 1/1) L(s, X @ n') holds for any quasicharacter X. In particular, the set {L(s, X @ n) and f.(s, X @ n, 1/1) for all X} characterizes the representation n. Let k be a global field, Gk=GLz(k), G A = GLz(k A ), and let K A = IT Kv be the standard maximal compact subgroup of GA' Then the tglobal Hecke algebra :If acts on the space of continuous functions on Gk\G A by the right translations. Let cp be a continuous function on G k \ G A- Then cp is an automorphic form if and only if (1) cp is K A -finite on the right, (2) for every telementary idempotent  in :If, the space (.YI')cp is finite-dimensional, and (3) cp is slowly increasing if k is an algebraic number field. An automorphic form cp is a cusp form if and only if r cp( (  '; ) 9)dX=0 J k\k A for all 9 in GA' Let co1 be the space of automor- phic forms on G k \ G A, and let cola be the space of cusp forms on G k \ G A- They are :If-modules. Let 1/1 = IT 1/1" be a nontrivial character of k\k A , and let n be an irreducible admissible repre- sentation n = 0 v nv of the glo bal Hecke al- gebra :If = 0", :lfv. If n is a tconstituent of the :If-module .01, then we can define the local factors L(s, nv) and f.(s, n,,, I/1v) for all v, al- though n,. may not be infinite-dimensional. Further, the infinite products L(s, n) = IT L(s, n,.) and L(s, ii) = IT L(s, iivJ converge absolutely in a right half-plane, and the functions L(s, n) and L(s, ii) can be analyti- cally continued to the whole complex plane as meromorphic functions of s. If n is a constitu- ent of ,01 0 , then all nv are infinite-dimensional, L(s, n) and L(s, ii) are entire functions, and n is contained in .01 0 with multiplicity one. If k is an algebraic number field, then they have only a finite number of poles and are bounded at infinity in any vertical strip of finite width. If k is an algebraic function field of one variable with field of constant F q' then they are rational functions of q-s. In either case, f.(s, n v , 1/1 v) = 1 for almost an v, and hence f.(s, n) = IT f.(s, n v , 1/1 v) is wen defined. Furthermore, the functional equation L(s, n) = f.(s, n) L(1 - s, ii) is satisfied. 450P Zeta Functions As for the condition for n being a constitu- ent of cola, we have the following: Let n= 0 nv be an irreducible admissible representation of ,ff. Then n is a constituent of cola if and only if (1) for every v, nv is infinite-dimensional; (2) the quasicharacter'1 defined by n(( }='1(a)id. is trivial on k x; (3) n satisfies a certain con- dition so that, for any quasicharacter w of p\kr, L(s,w@n)=IT L(s,wv@n v ) and L(s, w- 1 @ii v )= IT L(s, W;1 @ iiv) converge on a right half-plane; and (4) for any quasicharacter w of k x \kr, L(s, w@ n) and L(s, w -1 @ ii) are entire functions of s which are bounded in vertical strips and satisfy the functional equation L(s,w@n)=f.(s,w@n)L(I-s,w- 1 @ii). P. Congruence (-Functions of Algebraic Function Fields of One Variable or of Algebraic Curves Let K be an talgebraic function field of one variable over k = F q (finite field with q ele- ments). The (-function of the algebraic function field K/k, denoted by (K(S), is defined by the infinite sum L(N(lrS, where the summation is over all integral divisors W of K/k and where the norm N(I) equals qdeg(I). Equivalently, (K(S) is defined by the infinite product IT p ( I - N(prS)-I, where p runs over all prime divi- sors of K/k. By the change of variable U = q -', (K(S)=ZK(U) becomes a formal power series in u. (K(S) and Z K(U) are sometimes called the congruence (-functions of K/k. The fundamental theorem states that (i) (Rationality) ZK(U) is a rational function ofu of the form ZK(u)=P(u)/(I-u)(l-qu), where P(U)E Z[uJ is a polynomial of degree 2g, 9 being the genus of K; (ii) (Functional equation) ZK(U) satisfies the functional equation ZK(I/u)=qQ-1 U Z - Zg ZK(U/q); and (iii) if P(u) is decomposed into linear fac- tors in C[u]: P(u)=ITf(idl-Cl:;u), then all the reciprocal roots CI:; are complex numbers of absolute value ;q. Statement (iii) is the analog of the Riemann hypothesis because it is equiva- lent to saying that all the zeros of (K(S) = Z K(q -S) lie on the line Re s = 1/2. The congruence (-function was introduced by E. Artin [AI (1924)J as an analog of the Riemann or Dedekind (-functions. Of its fun- damental properties, the rationality (i) and the functional equation (ii) were proven by 
450Q Zeta Functions F. K. Schmidt (193 I), using the tRiemann- Roch theorem for the function field Kjk. The Riemann hypothesis (iii) was verified first in the elliptic case (g= I) by H. Hasse [HI] and then in the general case by A. Weil [W2 (1948)]. For the proof of (iii), it was essential to consider the geometry of algebraic curves that correspond to given function fields. Let C be a nonsingular complete curve over k with function field K. Then ZK(U) coincides with the (-function of Cjk, denoted by Z(u, C), which is defined by the formal power series exp(L_1 Nmumjm). Here N m is the number of rational points of C over the extension k m of k of degree m. The rationality of ZK(U) is then equivalent to the formula Zq N m = I +qm_ I rxi (mEN), i1 and the Riemann hypothesis for Zdu) is equivalent to the estimate (*) INm-l-qml 2g qm!Z (mEN). Now if F is the qth power morphism of C to itself (the Frobenius morphism of C relative to k), then an important observation is that N m is the number of fixed points of the mth iterate F m of F. In other words, N m is equal to the intersection number of the graph of F m with the diagonal on the surface C x C, and is re- lated to the "trace" of the Frobenius corre- spondence. Then (*) follows from tCastel- nuovo's lemma in the theory of correspon- dences on a curve. This is WeiJ's proof of the Riemann hypothesis in [W2]; compare the proof by A. Mattuck and J. Tate (Abh. Math. Sem. Hamburg 22 (1958)) and A. Grothendieck (J. Reine Angew. Math., 200 (1958)) using the Riemann-Roch theorem for an algebraic surface. On the other hand, let J be the tJacobian variety of Cover k. For each prime number I different from the characteristic of k, let Mt(rx) denote the t/-adic representation of an endo- morphism rx of J obtained from its action on points of J of order In (n = 1,2,...). Letting n be the endomorphism of J induced from F (which is the same as the Frobenius morphism of J), we have P(u) = det( 1- Mt(n)u), i.e., the numerator of the (-function coincides with the characteristic polynomial of Mt(n). In this setting, the Riemann hypothesis is a conse- quence of the positivity of the Rosati anti au to- morphism [EI]. This is the second proof given by Weil [W2], and applies to arbitary Abelian varieties. Recently E. Bombieri, inspired by Stepa- nov's idea, gave an elementary proof of (*) using only the Riemann-Roch theorem for a curve (Sbn. Bourbaki, no. 430 (1973)). 1710 Q. (-Functions of Algebraic Varieties over Finite Fields Let V be an algebraic variety over the finite field with q elements Fq, and let N m be the number of Fq m-rational points of V. Then the (-function of V over Fq is the formal power series in Z[[u]] defined by Z(u, v)=expCt NmUmjm} alternatively it can be defined by the infinite product I1p(l_udegp)-" where P runs over the set of prime divisors of V and deg P is the degree of the residue field of P over Fq (in other words, P runs over prime ra1 ional 0- cycles of V over Fq). Weil Conjecture. In 1949, the following prop- erties of the (-function were conjectured by Weil [W3]. Let V be an n-dimensi,;:mal com- plete nonsingular (absolutely irreducible) variety over Fq. Then (1) Z(u, V) is a rational function of u. (2) Z(u, V) satisfies the functional equation Z((qnU)"-I, V)= i:qnx!ZuXZ(u, V), where the integer X is the intersection number (the degree of v' v) of the diagorlal sub- variety v with itself in the product V x V, which is called the Euler-Poincare character- istic of V. (3) Moreover, we have PI (u). P 3 (u). ... . P Zn -' (u) Z(u, V)= , Po(u)' Pz(u). ... . Pzn(u) where Ph(u) = I11 (I - rx)h)u) is a polynomial with Z-coefficients such that rx)hi are alge braic integers of absolute value qh!Z (0h2n); the latter statement is the Riemann hypothesis for VjFq. (4) When V is the reduction modp of a complete nonsingular variety V* of character- istic 0, then the degree Bh of Ph(u) is the hth Betti number of V* considered as a complex manifold. This conjecture, called the Weil conjecture, has been completely proven. To give a brief history. first B. Dwork [D 13] proved the rationality of the (-function for any (not neces- sarily complete or nonsingular) variety over Fq. Then A. Grothendieck [A3, G2, G3] devel- oped the I-adic etale cohomology theory with M. Artin and others, and proved the above statements (1 )-(4) (except for the Riemann hypothesis) with Ph(u) replaced by some Ph.t(U)EQt[U]; and S. Lubkin [L7] obtained similar results for liftable varieties Finally Deligne [D4] proved the Riemann hypothesis and the independence of I of Ph.t(U). More details will be given below. Before the final solution for the general case was obtained, the 
1711 conjecture had been verified for some special types of varieties. For curves and Abelian varieties, its truth was previously shown by Weil ( Section P). In the paper [W3] in which he proposed the above conjecture, Weil verified it for Fermat hypersurfaces, i.e., those defined by the equation aox;;> +... + a n + 1 X:;'+l =0 (aiEFqX); in this case, the (-function is of the form P(u)(-I)"+\!njo(1-qju) with a poly- nomial P(u) that can be explicitly described in terms of Jacobi sums. Dwork [DI4] studied by p-adic analysis the case of hypersurfaces in a projective space, verifying the conjecture for them except for the Riemann hypothesis. Fur- ther nontrivial examples were provided by tK3 surfaces (Deligne [D2], Pyatetskii-Shapiro, Shafarevich [PI]) and cubic 3-folds (E. Bom- bieri, H. Swinnerton-Dyer [B5]); in these cases the proof of the Riemann hypothesis was reduced to that of certain Abelian varieties naturally attached to these varieties. It can be said that the Weil conjecture has greatly in- fluenced the development of algebraic geome- try, as regards both the foundations and the methods of proof of the conjecture itself; see the expositions by N. Katz [K2] or B. Mazur [M2]. Weil Cohomology, I-Adic Cohomology. The Weil conjecture suggested the possibility of a good cohomology theory for algebraic varieties over a field of arbitrary characteristic. We first formulate the desired properties of a good cohomology (S. Kleiman [K4]). Let k be an algebraically closed field and K a field of characteristic 0, which is called the coefficient field. A contravariant functor V --> H*(V) from the category of complete connected smooth varieties over k to the category of augmented Z + -graded finite-dimensional anticommuta- tive K -algebras (cup product as multiplication) is called a Weil cohomology with coefficients in K if it has the following three properties. (1) Poincare duality: If n = dim V, then a canonical isomorphism H 2n ( V)  K exists and the cup product Hj(V) X H2n-j(V)-->H211(V)K in- duces a perfect pairing. (2) Kiinneth formula: For any VI and V 2 the mapping H*(vd @ H*(V2)-->H*(VI x V 2 ) defined by a @ b--> Projf(a)' Proj!(b) is an isomorphism. (3) Good relation with algebraic cycles: Let Cj(V) be the group of algebraic cycles of codimension j on V. There exists a fundamental-class homomor- phism FUND: Cj(V)-->H 2 j(V) for all}, which is functorial in V, compatible with products via Kiinneth's formula, has compatibility of the intersection with the cup product, and maps O-cycleECn(V) to its degree as an ele- ment of K :::,:H 2n (v). If a Weil cohomology theory H exists for the V's over k, we can 450Q Zeta Functions prove the Lefschetz fixed-point formula: ((graph of F)' (diagonal))v x v 211 = I (-1)jtr(F* I Hj(V)) jO for a morphism F: V --> V. Irk = C (the field of complex numbers), the classical cohomology V -->H*(v an , Q), where van denotes the complex manifold associated with v: gives a Weil cohomology. Ifk is an arbitrary algebraically closed field and if I is a prime number different from the characteristic of k, then the principal results in the theory of the etale cohomology state that the I-adic cohomology V-->Htt(V, Q/) is a Weil coho- mology with coefficient field Q/ (the field of 1- adic numbers) [A3, D5, G3, M4]. In defining this, Grothendieck introduced a new concept of topology, which is now called Grothendieck topology. In the etale topology of a variety V, for example, any etale covering of a Zariski open subset is regarded as an "open set." With respect to the etale topology, the cohomology group H*(V, Zjn) of V with coefficients in Zjn is defined in the usual manner and is a finite Zjn-module. If I is a prime number as above, limvH*(V,ZjIV) is a module over Z/=limvZ/l" ;r finite rank, and <- Htt(V, Q/) =(limvH*(V, ZjIV))@ z\Q/ <- defines the I-adic cohomology group, giving rise to a Weil cohomology. For the characteristic p of k, p-adic etale cohomology does not give Weil cohomology; but the crystalline cohomology (Grothendieck and P. Berthelot [B2, B3]) takes the place of p- adic cohomology and is almost a Weil coho- mology: in this theory the fundamental class is defined only for smooth subvarieties. Now fix a Weil cohomology for k = Fq, an algebraic closure of a finite field Fq. Given an algebraic variety V over Fq, let V = V@ 1< denote the base extension of V to 1<; then Fqrn- rational points of V can be identified with the fixed points of the mth iterate of the Frobenius morphism F of V relative to Fq. Then the Lefschetz fixed-point formula implies the rationality of Z(u, V); more precisely, letting (u) = det(1- uF* I Hj(V)) be the characteristic polynomial of the automorphism F* of H j( V) induced by F, we have 2n Z(u, V)= TI (url)j+'. jO The functional equation of the (-function then follows from the Poincare duality. This proves (1), (2), and a part of (3) in the statement of the Weil conjecture. Further, in the case of I-adic cohomology, (4) means that deg(u)= 
450R Zeta Functions dim Q ,Hj(i7, Q/) is equal to the jth Betti num- ber of a lifting of V to characteristic 0; this follows from the comparison theorem of M. Artin for the /-adic cohomology and the class- ical cohomology, combined with the invariance of /-adic cohomology under specialization. Proof of the Riemann Hypothesis. In 1974, Deligne [D4, I] completed the proof of the Weil conjecture for projective nonsingular varieties by proving that, given such a V over Fq, any eigenvalue of F* on Ht(i7, Q/) is an algebraic integer, all the conjugates of which are of absolute value qj/2. (This implies that (u)=det(1-uF*IHt (i7,Q/)) is in Z[u] and is independent of /.) The proof is done by induction on n = dim V; by the general results in /-adic cohomology (the weak Lefschetz theorem on a hyperplane section, the Poin- care duality, and the Kiinneth formula), the proof is reduced to the assertion that (*) any eigenvalue ex of F* on Ht(i7, Q/) is an alge- braic integer such that lex'i  q(n+1)/2 for al1 conjugates ex' of ex. The main ingredients in proving (*) are (1) Grothendieck's theory of L- functions, based on the eta Ie cohomology with compact support and with coefficients in a Q/- sheaf [G2, G3]; (2) the theory of Lefschetz pencils (Deligne and Katz [D7]), and the Kajdan-Margulis theorem on the monodromy of a Lefschetz pencil (1. L. Verdier, Sem. Bour- baki, no. 423 (1972)); and (3) Rankin's methods to estimate the coefficients of modular forms, as adapted to the Grothendieck's L-series. By means of these geometric and arithmetic tech- niques, Deligne achieved the proof of the Riemann hypothesis for projective nonsingular varieties. For the generalization to complete varieties, see Deligne [D4, II]. Applications of the (Verified) Weil Conjecture. (1) The Ramanujan conjecture ( 32 Auto- morphic Functions D): The connection of this conjecture and the Weil conjecture for certain fiber varieties over a modular curve was ob- served by M. Sato and partially verified by Y. Ihara [II] and then established by Deligne [D3]. The Weil conjecture as proven above implies the truth of the Ramanujan conjecture and its generalization by H. Petersson. (2) Estimation of trigonometric sums: Let q be the power of a prime number p. Then I 2ni I I exp-trF 'F (F(Xb ...,X n )) (Xi> ,Xn)EF; P ql P (d_l)"qnI2, where F(X 1 , "', Xn)EFq[X b "', Xn] is a poly- nomial of degree d that is not divisible by p, and the homogeneous part of the highest degree of F defines a smooth irreducible 1712 hypersurface in pn-I. This is a generalization of the Weil estimation of the Kloosterman sum ([D4, WI (1948c)];  4 Additive Number Theory D). (3) The hard Lefschetz theorem: Let LE H2( V) be the class of a hyperplane section of an n-dimensional projective nonsingular variety V over an algebraically closed field. Then the cup product by Li:Hn-i(V)-->Hn+i(V) is an isomorphism for all i  n. Deligne [D4, II] proved this for /-adic cohomology, from which N. Katz and W. Messing [Kl] deduced its validity in any Weil cohomology or in the crystalline cohomology. Also some geometric properties of an alge- braic variety V are reflected in the properties of Z(u, V). The (-function Z(u, A) of an Abelian variety A determines the isogeny class of A [T4]. For any algebraic integer ex, every conju- gate of which has absolute value q1/2, there exists an Abelian variety A/Fq such that ex is a root of det(l-uF* I H1(A))=0 [H6]. J. Tate [T3] conjectured that the rank of the space cohomology classes of algebraic cycles of codimension r is equal to the order of the pole at u= l/qr of Z(u, V). This conjecture is still open but has been verified in certain nontrivial cases, e.g., (1) products of curves and Abelian varieties, r= 1 (Tate [T4]), (2) Fermat hyper- surfaces of dimension 2r with somt: condition on the degree and the characteristic (Tate [T3], T. Shioda, Proc. Japan Acad. 55 (1979)), and (3) elliptic K3 surfaces, r= 1 (M. Artin and Swinnerton-Dyer, lnventiones Math. 20 (1973)). R. (- and L-Functions of Schemes Let X be a tscheme of finite type over Z, and let IX I denote the set of closed points of X; for each xEIXI, the residue field k(x) is finite, and its cardinality is called the norm N(x) of x. The (-function of a scheme X is defined by the product ((s,X)=I1xElxl(1-N(x)-yl. This converges absolutely for Res>dimX, and it is conjectured to have an analytic continuation in the entire s-plane (Serre [S7]). It reduces to the Riemann (resp. Dedekind) (-function if X =Spec(Z) (resp. Spec(o), 0 being the ring of integers of an algebraic number field), and to the (-function Z(q-S,X) ( Section Q) if X is a variety over a finite field Fq. The case of varieties defined over an algebraic number field is discussed in Section S. Let G be a finite group of autornorphisms of a scheme X, and assume that the quotient Y =X/G exists (e.g., X is quasiprojective). For an element x in I X I, let y be its image in I YJ, and let D(X)={gEGlg(x)=x}, the decompo- sition group of x over y. The natural mapping D(x)-->Gal(k(x)/k(y)) is surjective, and its 
1713 kernel I(x) is called the inertia group at x. An element of D(x) is called a Frobenius element at x if its image in Gal(k(x)/k(y)) corresponds to the N(y)th-power automorphism of k(x). Now let R be a representation of G with char- acter X. The Artin L-function L(s, X, X) is de- fined by L(S,X,x)=ex p ( I I X(yn)N(y)-ns/n ) YEI YI n1 = n det(I-R(F y )N(y)-S)-I, YEIYI where X(yn) denotes the mean value of X on the nth power of Frobenius elements Fx at x (x any point of IXI over y), and similarly R(Fy) denotes the mean value of R(Fx); it converges absolutely for Res >dim X. Again this is re- duced to the usual Artin L-function ( Sec- tion G) if X is the spectrum of the ring of integers of an algebraic number field. The Artin L-functions of a scheme have many formal properties analogous to those of Artin L-functions of a number field (Serre [S7]). Let us consider the case where X is an alge- braic variety over a finite field Fq and ele- ments of G are automorphisms of X over F 2 ; in this case, L(s, X, X) is a formal power series in u=q-S, which is called a congruence Artin L-function. For the case where X is a complete nonsingular algebraic curve and X is an irre- ducible character of G different from the trivial one, Weil [W2] proved that L(s, X, X) is a polynomial in u=q-S; thus the analog of tArtin's conjecture holds here. More generally, for any algebraic variety X over Fq, Grothen- dieck [G2, G3] proved the rationality of L-functions together with the alternating product expression by polynomials in u, as in the case of (-functions, by the methods of 1- adic cohomology. Actually, Grothendieck treated a more general type of L-function associated with I-adic sheaves on X, which also play an important role in Deligne's proof of the Riemann hypothesis ( Section Q). S. Hasse (-Functions F or a nonsingular complete algebraic variety V defined over a finite algebraic number field K, let  be the reduction of V modulo a prime ideal p of K, K p be the residue field of p, and Z(u, ) be the (-function of  over Kp. The (-function ((s, V) of the complex variable s, determined by the infinite product (exclud- ing the finite number of p's where  is not defined), (s, V)= n Z(N(p)-S, V p ), p is called the Hasse (-function of V over the algebraic number field K. For this function, 450 S Zeta Functions we have Hasse's conjecture [W4]: ((s, V) is a meromorphic function over the whole complex plane of s and satisfies the functional equation of ordinary type. Sometimes it is more natural to consider (j(s, V)= n IJ(N(p)-S, )-1 (0j2dim V), p where IJ(u, ) is the jth factor of Z(u, ), and we have a similar conjecture for them. For the definition of US, V) taking into account the factors for bad primes and the precise form of the conjectural functional equation, see Serre [S8]. Note that (i(s, V) converges absolutely for Re s > jj2 + 1 as a conseq uence of the Weil conjecture. Hasse's conjecture remains unsolved for the general case, but has been verified when V is one of the following varieties: (I.) Algebraic curves defined by the equation ye =yxf + D and Fermat hypersurfaces (Weil [W6]). (I b) Elliptic curves with complex multiplica- tion (Deuring [D 11]). (Ie) Abelian varieties with complex multiplica- tion (Taniyama [T2], Shimura and Taniyama [S 11], Shimura, H. Yoshida). (ld) Singular K3 surfaces, i.e., K3 surfaces with 20 Picard numbers (Shafarevich and Pyatetskii-Shapiro [P 1], Deligne [D2], T. Shioda and H. Inose [S21]). (II.) Algebraic curves that are suitable models of the elliptic modular function fields (Eichler [El], Shimura [SI2]). (lIb) Algebraic curves that are suitable models of the automorphic function fields obtained from a quaternion algebra (Shimura [S13, SI5]). (lIe) Certain fiber varieties of which the base is a curve of type (II.) or (lIb) and the fibers are Abelian varieties (Kuga and Shimura [K6], Ihara [11], Deligne [D3]). (lId) Certain Shimura varieties of higher dimension (Langlands and others;  [B6]). In these cases, ((s, V) can be expressed by known functions, i.e., by Hecke L-functions with Grossencharakters of algebraic number fields in cases (I) or by Dirichlet series corre- sponding to modular forms in cases (II). This fact has an essential meaning for the arithme- tic properties of these functions. For example, the extended tRamanujan conjecture concern- ing the Hecke operator of the automorphic form reduces to Weil's conjecture on varieties related to those in cases II. Moreover, for (1I.)-(lIe) the essential point is the congruence relation Tp=II +II* (Kronecker, Eichler [El], Shimura). In particular, for (lib) this formula is related to the problem of constructing class fields over totally imaginary quadratic exten- sions of a totally real field F utilizing special 
450 S Zeta Functions values of automorphic functions and class fields over F. Actually, the formula is equiva- lent to the reciprocity law for class fields (Shimura). One of the facts that makes the Hasse (- function important is that it describes the decomposition law of prime ideals of algebraic number fields when V is an algebraic curve or an Abelian variety (Weil, Shimura [SI4], Taniyama [T2J, T. Honda [H6J). In that case, its Hasse (-function has the following arithme- tic meaning. Let C be a complete, nonsingular algebraic curve defined over an algebraic number field K, and let J be the Jacobian variety of C de- fined over K. For a prime n urn ber I, fix an I-adic coordinate system I:[ on J, and let K(J, lro) be the extension field of K obtained by adjoining to K all the coordinates of the I'th division points (v= 1,2,...) of J. Then K (J, lro)/ K is an infinite Galois extension of K. The corresponding Galois group ti>(J, I""') has the I-adic representation a-->M1(a) by the 1- adic coordinates 1:[. Almost all prime ideals p of K are unramified in K(J, I ""')/K. Thus when we take an arbitrary prime factor \13 of p in K(J, I"'), the Frobenius substitution of \13, _ l K(J,ICX,)/K ] a,t! - \13 ' is uniquely determined. Furthermore, the characteristic polynomial det(1 - Mt( avp)u) is determined only by p and does not depend on the choice of the prime factor \13; we denote this polynomial by Pp(u, C). In this case, for almost all p, P p(u, C) is a polynomial with rational integral coefficients independent of I; namely, the numerator of the (-function of the reduction of Cmodp. Thus (l(S,C)=n Pp(N(p)-', C)-I p  n det(1- M1( a 'l1)N(p)-r 1 . p Here the product I1'det(1-M{(a'j.JN(p)-S)-l has the same expression as the Artin L- function if we ignore the fact that M1 is the 1- adic representation and K (J, I"') is the infinite extension. Thus if we can describe ((s, C) ex- plicitly, then the decomposition process of the prime ideal for intermediate fields between K (J, lro) and K can be made fairly clear. In fact, this is the case for examples (I,)-(lc) and (1I,)-(lI c ), from which the relations between the arithmetic of the field of division points K (J, 1"')/ K and the eigenvalues of the Hecke operator have been obtained. Thus for curves and Abelian varieties, ((s, V) is related to the arithmetic of some number fields; but it is not known whether similar arithmetical relations exist for other kinds of varieties except in a few cases. 1714 Tate's Conjecture. For a projective nonsin- gular variety V over a finite algebraic number field K, let l'(j?) denote the group of algebraic cycles of codimension r on j? = V @ K C modulo homological equivalence and let [r(v) be the subgroup of 21'(V) generated by algebraic cycles rational over K. Then Tate [T3J conjec- tured that the rank of I2l r (V) is equal to the order of the pole of (2r(S, V) at s = "+ 1. This conjecture is closely connected with Hodge's conjecture that the space of rational coho- mology classes of type (r, r) on j? is spanned by {'(V); in fact, the equivalence of these conjec- tures is known for Abelian varietie's of tCM type (H. Pohlmann, Ann. Math., 88 (1968)) and for Fermat hypersurfaces of dimension 2r (Tate [T3J, Weil [W6J). Thus, when r= 1, Tate's conjecture for these varieties holds by Lefschetz's theorem, and when r> I, it holds in certain cases where the Hodge conjecture is verified (Shioda, Math. Ann., 245 (1979); Z. Ran, Compositio Math., 42 (1981)). Further examples are given by K3 surfaces with large Picard num bers (Shioda and Inose [S21 J; T. Oda, Proc. Japan Acad., 56 (1980) I. L-Functions of Elliptic Curves. Let E be an elliptic curve (with a rational point) over the rational number field Q, and let N be its con- ductor; a prime number p divides N if and only if E has bad reduction modp (Tate [T5J). The L-function of E over Q is defined as follows: L(s, E) = Il (1-l: p P-S)-' Il (I _a p p-S+pl-2S)-', pIN ptN where I: p =O or :tl and 1 -apu + pu 2 = P, (u, Emodp). There are many interesting results and conjectures concerning L(s, E) [T5]: (1) Functional equation. Let (s, E) = N S / 2 (2n) -sf(s)L(s, E). Then it is conjectured that (s, E) is holo- morphic in the entire s-plane and satisfies the functional equation (s, E) = :t (2 - s, E). This is true if E has complex multiplication (Deur- ing) or E is a certain modular curve (Eichler, Shimura). (2) Taniyama-Weil conjecture. Weil [WI (1967a)J conjectured that, if L(s, E) = L:, ann -s, then f(T) = L:l ane2nint is a cusp form of weight 2 for the congruence subgroup fo(N) which is an eigenfunction for Hecke operators; moreover E is isogenous to a factor of the Jacobian variety of the modular curve for fo(N) in such a way that f(T)d: corre- sponds to the differential of the first kind on E. If this conjecture is true, then the statements in (1) follow. 
1715 (3) Birch-Swinnerton-Dyer conjecture. As- suming analytic continuation of L(s, E), B. Birch and H. Swinnerton-Dyer [B4J conjec- tured that the order of the zero of L(s, E) at s = 1 is equal to the rank r of the group E(Q) of rational points of E which is finitely generated by the Mordell- Weil theorem. They verified this for many examples, especially for curves of the type y2 = x 3 -ax. J. Coates and A. Wiles (I nventiones Math., 39 (1977)) proved that if E has complex multiplication and if r > 0 then L(s, E) vanishes at s = 1. This conjecture has a refinement which extends also to Abelian varieties over a global field (Tate, Sem. Bour- baki, no. 306 (1966)). (4) Sato's conjecture. Let I -apu + pu 2 =(1- n p u)(1- npu), with np = ,/"P e iOp (0 < Op < n). When E has complex multiplication, the distribution of Op for half of p is uniform in the interval [0, n], and Op is n/2 for the remaining half of p. Sup- pose that E does not have complex multipli- cation. Then Sato conjectured that (the number of prime numbers p lim _ h..an x_ stich that OpE [ PJ L X7 (the number of prime numbers less than x) = f P sin20dV (O<IX<{j<n) n " (Tate [T3J). H. Yoshida [Y 1 J posed an analog of Sato's conjecture for elliptic curves defined over function fields with finite constant fields and proved it in certain cases. (5) Formal groups. Letting L(s, E)=Lann- s as before, set .f(x) = Ln'=1 anx"/n. Honda [H6J showed that r 1 (f(x) +.f(y)) is a tformal (Lie) group with coefficients in Z and that this group is isomorphic over Z to a formal group obtained by power series expansion of the group law of E with respect to suitable tlocal uniformizing coordinates at the origin. Such an interpretation of the (-function also applies 10 other cases in which (-functions oftgroup varieties may be characterized as Dirichlet series whose coefficients give a normal form of the group law; e.g., the case of algebraic tori (T. Ibukiyama, J. Fac. Sci. Univ. Tokyo, (IA) 21 (1974)). T. Selberg (-Functions and (-Functions Associated with Discontinuous Groups Let fcSL(2, R) be a tFuchsian group operat- ing on the complex upper half-plane H = {z = x+iyly>O}. When the two eigenvalues of an element y E f are distinct real num bers  I' 450T Zeta Functions 2 (I 2 = 1, I < 2)' we call }' thyperbolic. Then the number  is denoted by N(i) and is called the norm of y. When y is hyperbolic, y" (n = 1,2,3, ...) is also hyperbolic. When :t y is not a positive power of other hyperbol elements, y is called a primitive hyperbolic element. The elements conjugate to primitive hyperbolic elements are also primitive hyperbolic ele- ments and have the same norm as y. Let PI' P 2 ,... be the conjugacy classes of primitive hyperbolic elements of r, and let Yi E Pi be their representatives. Suppose that a matrix repres- entation ,''''''' M(y) of f is given. Then the analy- tic function given by YC Zr(s,M)=n n det(l-M(yJN(yJ-'-n) i n=O is called the Selberg (-function (Selberg [S5J). When f\H is compact and f is torsion-free, then Zr(s, M) has the following properties. (1) It can be analytically continued to the whole complex plane of s and gives an tin- tegral function of genus at most 2. (2) It has zeros of order (2n + 1 )(2g - 2)v at - n (n = 0,1,2,3,...). Here 9 is the genus of the Riemann surface f\H and v is the degree of the representation M. All other zeros lie on the line Res= 1/2, except for a finite number that lie on the interval (0, 1) of the real axis. (3) It satisfies the functional equatIOn Zr(l-s, M)=Zr(s, M) ex p ( -vA(f\H) (S-I/2 ) x Jo vtan(nv)dv, where A(r\H)= f ( dxy =2n(2g-2), Jr H Y Property (2) shows that the Riemann hy- pothesis is almost valid for Zr(s, M). The proof is based on the following fact concerning the eigenvalue problem for the variety f\H: The eigenvalue A of the equation X+iyEH. y2(i;2/CX 2 + iF /cy2)U + ),U = 0, UE L 2(f\H) cannot be a negative number. Using this function, T. Yamada (1965) inves- tigated the unit distribution of real quadratic fields. Selberg (-functions are defined similarly when f\G has finite volume but is noncom- pact. In this case, however, the decomposition of L 2 (f\G) into irreducible representation spaces has a continuous spectrum; hence the properties of the Selberg (-function of fare quite different from the case when f\ Gis compact. Selberg defined the generalized Eisenstein series to give the eigenfunctions of this continuous spectrum explicitly. When f = 
450 U Zeta Functions SL(2, Z), the series is given by I I' Y'd 1 2 " (c.d)=l C! + This type of generalized Eisenstein series is also defined for the general semisimple alge- braic group G and its arithmetic subgroup. It has been studied by Selberg, Godement, Gel'fand, Harish-Chandra, Langlands, D. Zagier, and others. U. Ihara (-Functions Let kp be a p-adic field, op the ring of integers in kp, and G= PSL 2 (R) x PSL 2 (k p ). Suppose that f is a subgroup of G such that (1) r is dis- crete, (2) f\ G is compact, (3) f has no torsion element except the identity, (4) f R (the projec- tion of fin PSL 2 (R)) is dense in PSL 2 (R), and (5) fp (the projection of f in PSL 2 (k p )) is dense in PSL 2 (k p )' Then ffRfp' Let X={x+ iyIY>O} be the upper half-plane, and let f act on X via r R' The action of r on X is not discontinuous, but the subgroup fo= {YEflprojection ofy to f p EPSL 2 (op)} oper- ates on X properly discontinuously. For each ZEX, define fz={YEfly(z)=z}. Then L is isomorphic to Z or {I}. Let P(f) = {;EX I fzZ}. The group f acts on P(f), since fz and r yZ are conjugate in f. Let P(f) =P(f)/f. Suppose that PEP(f) is represented by ZEX. Choose a generator y of fz and pro- ject y to fp. Then y is equivalent to a diagonal matrix ( A 0 ) with AEk . We denote the o ),-1 P valuation of k b y ord and consider lord p ()')\' p p . This value depends only on P and we denote It by deg(P). The Ihara (-function of f is defined by Z,(u)= n (l_u deg (P»)-l. PE P(r) Ihara proved that Z,(u) q n (1-n j u)(l-n;u) (=1 (1 _ U)H, (1 - u)(1 - q 2 u) where q is the number of elements in the re- sidue class field of p, and 9 is the genus of the Riemann surface f 0 \X and H = (g - 1 )q(q - 1). Similar results hold even if f has torsion ele- ments and the quotient f\G is only assumed to have finite vol ume. Aside from the factor (1-U)H, this looks like Weil's formula for the congruence (-function of an algebraic curve defined over F q 2. Ihara conjectured that the first factor of Z,(u) is always the congruence (-function of some algebraic curve over Fq2, and furthermore that f could be regarded as the fundamental group of a certain Galois covering of this curve which describes the decomposition law of 1716 prime divisors in this covering [12.13,14]. He verified the conjecture in the case f = PGL 2 (Z[I/p]) by using the tmoduli of elliptic curves. Related results have been obtained by Shimura, Ihara, Y. Morita, and others. V. (-Functions Associated with Prehomogeneous Vector Spaces M. Sato posed a notion of prehomogeneous vector spaces and defined (-functions as- sociated with them. Sato's program has been carried on by himself and T. Shintani [S2, S3, S 17, S18]. Let G be a linear algebraic group, V a finite-dimensional linear space of dimension , n, and p a rational representation G-+GL(V), where G, V. and p are defined over Q. The triple (G,p, V) is called a prehomogeneous vector space if there exists a proper algebraic subset S of V c such that Vc-S is a single G c orbit. The algebraic set S is called the set of singular points of V. We also assume that Gis reductive and S is an irreducible hypersurface of V. Let V* be the dual vector space of V, and p* the dual (contragredient) representa- tion of G. Then (G,p*, V*) is again a pre- homogeneous vector space, and WI denote its set of singular points by S*. There are homogeneous polynomials P and Q of the same degree £I on V and V*, respectively, such that S={xEXIP(x)=O} and S*={X*E V*IQ(x*)=O}. P and Q are relative invariants of G, i.e., P(p(g)x) = X(g) P(x) and Q(p*(g)x*) = X(g)-1 Q(x*) (for gEG, XE V, and X*E V*) hold with a rational character X of G. Put G 1 =kerx={gEGlx(g)= 1}. Denote by G; the connected component of 1 of the Lie group GR' Let VR-S= VI U...U vi, V-S*= V 1 * U '.. U v,* be the decompositions of V R - S and V - S* into their topologically con- nected components. Then V, and j* are G;- orbits. We further assume that V R rl S decom- poses into the union of a finite number of Gk- orbits. Set r = G; rl Gi, and take l'-invariant lattices Land L * in V Q and V Q , respectively. Consider the following functions in s: (1\(!,S)= f j(x)lP(x)I'dx, v, <tJj(!, s) = f j*(x*)IQ(x*)IS dx*, v and Z;(j, L, s) = f x(g)' I j(p(g)x) dg, GR/f xELn V. Zj(J*,L*,s) = f x(g)-' I j*(p*(g)x*)dg, GRlr x*EL"'nVj 
1717 where f and f* are trapidly decreasing func- tions on V R and V, respectively, dx and dx* are Haar measures of V R and V, respectively, and dg is a Haar measure of G. Then the ratios Z;(f,L,s) <b;(f,s-n/d) =i(s,L), Z'!' (f * L* S ) J ' , --,,* ( , L* ) <bj(f*,s-n/d) -Sj s, are independent of the choice of rand f* and are Dirichlet series in s. These Dirichlet series i(s,L) and j(s,L*) are called -functions associated with the prehomogeneous space. Considering Fourier transforms of I P(x)I' and IQ(x*)I\ we obtain functional equations for i and j under some additional (but mild) con- ditions on (G, p, V) as follows. The Dirichlet series i and r are analytically continuable to meromorphic functions on the whole s-plane, and they satisfy v(L*)j(n/d -s, L*) =y(s-n/d)(2n)-d'lb o l'exp(ndj=t s/2) I X I uij(sK(s, L), j=1 with a f-factor y(s)= rr1=1 f(s-c i + 1). Here uJs) (li,jl) are polynomials in exp( - nJ= 1 s) with degree  d, and ho and C i are constants depending only on (G, p, V). Epstein's (-functions and Siegel's Dirichlet series associated with indefinite quadratic forms are examples of the above-defined (- functions. Shintani defined such (-functions related to integral binary cubic forms and obtained asymptotic formulas concerning the class numbers of irreducible integral binary cubic forms with discriminant n, which are improvements on the results of Davenport [SI7]. Recently M. Sato studied (-functions of pre- homogeneous vector spaces without assuming the conditions that G is reductive and S is irreducible. In this case, (-functions of several complex variables are obtained. For examples and classification of prehomogeneous vector spaces  [S4]. References [AI] E. Artin, Collected papers, S. Lang and 1. T. Tate (eds.), Addison-Wesley, 1965. [A2] E. Artin, Zur Theorie der L-Reihen mit allgemeinen Gruppencharakteren, Abh. Math. Sem. Univ. Hamburg, 8 (1930), 292-306 [AI, 165-179]. 450 Ref. Zeta Functions [A3] M. Artin, A. Grothendieck, and J. L. Verdier, Theorie des topos et cohomologie etale des schemas (SGA 4), Lecture notes in math. 269,270,305, Springer, 1972-19]3. [A4] J. Ax, On the units of an algebraic num- ber fields, Illinois J. Math., 9 (1965), 584-589. [B1] P. Bayer and 1. Neukirch, On values of zeta functions and l-adic Euler characteristics, Inventiones Math., 50 (1978),35-64. [B2] P. Berthelot, Cohomogie cristalline des schema de caracteristique p > 0, Lecture notes in math. 407, Springer, 1974. [B3] P. Berthelot and A. Ogus, Notes on crystaiiine cohomology, Math. notes 21, Princeton U niv. Press, 1978. [B4] B. Birch and H. Swinnerton-Dyer, Notes on elliptic curves 11,1. Reine Angew. Math., 218 (1965), 79-108. [B5] E. Bombieri and H. P. F. Swinnerton- Dyer, On the local zeta function of a cubic threefold, Ann. Scuola Norm. Sup. Pisa, (3) 21 (1967),1-29. [B6] A. Borel and W. Casselman (eds.), Auto- morphic forms, representations, and L- functions, Amer. Math. Soc. Proc. Symp. Pure Math., 33 (pts. 1 and 2) (1979). [B7] A. Brumer, On the units of algebraic number fields, Mathematika, 14(1967), 121- 124. [Cl] K. Chandrasekharan, Lectures on the Riemann zeta-function, Tata Inst., 1953. [C2] S. Chowla and A. Selberg, On Epstein's zeta function I, Proc. Nat. Acad. Sci. US, 35 (1949),371-374. [C3] J. Coates. P-adic L-functions and Iwasawa's theory, Algebraic Number Fields (L-functions and Galois Properties), A. Froh- lich (ed.), Academic Press, 1977. [D 1] R. Dedekind, Dber die Theorie der gan- zen algebraischen Zahlen, Dirichlet's Vor- lesungen iiber Zahlentheorie, Supplement XI, fourth edition, 1894, secs. 184, 185, 186 (Ge- samme1te mathematische Werke III, Braun- schweig, 1932,297-314; Chelsea, 1968). [D2] P. Deligne, La conjecture de Weil pour les surfaces K3, Inventiones Math., 75 (1972), 206-226. [D3] P. Deligne, Formes modulaires et repre- sentations l-adiques, Sem. Bourbaki, expo 355, Lecture notes in math. 349, Springer, 1973. [D4] P. Deligne, La conjecture de Weil, I, II, Pub!. Math. Inst. HES, 43 (1974), 273-307; 52 (1980), 137-252. [D5] P. DeIigne, Cohomologie etale (SGA 41), Lecture notes in math. 569, Springer, 1977. [D6] P. Deligne, Les constantes des equations fonctionnelles des fonctions L, Lecture notes in math. 349, Springer, 1973. [D7] P. Deligne and N. Katz, Groupes de monodromie en geometrie algebrique (SGA 
450 Ref. Zeta Functions 711), Lecture notes in math. 340, Springer, 1973. [D8J P. Deligne and K. Ribet, Values of Abelian L-functions at negative integers over totally real fields, Inventiones Math., 59 (1980), 227-286. [D9J P. Deligne and J-P. Serre, Formes modulaires de poids L Ann. Sci. Ecole Norm. Sup. (4), 7 (1974), 507 530. [DIOJ M. Deuring, Algebren, Erg. Math., Springer, second edition, 1968. [01 IJ M. Deuring, Die Zetafunktionen einer algebraischen Kurve vom Geschlechte Eins 1, 11, 111, IV, Nachr. Akad. Wiss. Gottingen (1953),85-94; (1955),13 -42; (1956), 37-76; (1957),55-80. [DI2J K. Doi and H. Naganuma, On the functional equation of certain Dirichlet series, Inventiones Math., 9 (1969- I 970), 1-14. [DI3J B. Dwork, On the rationality of the zeta-function of an algebraic variety, Amer. 1. Math.. 82 (1960),631-648. [0 I 4J B. Dwork, On the zeta function of a hypersurface I, II, I1l, Publ. Math. Insl. HES, 1962, 5-68; Ann. Math., (2) 80 (1964), 227 - 299; (2) 83 (1966),457-519. [E I J M. Eichler, Einfiihrung in die Theorie der algebraischen Zahlen und Funktionen, Birkhauser, 1963; English translation, Intro- duction to the theory of algebraic numbers and functions, Academic Press, 1966. [FIJ B. Ferrero and L. C. Washington, The Iwasawa invariant J1 p vanishes for Abelian number fields, Ann. Math., (2) 109 (1979), 377- 395. [G I J R. Godement, Les fonctions , des alge- bres simples 1, 11, Sem. Bourbaki, expo 17 I, 176,1958- 1959. [G2J A. Grothendieck, Formule de Lefschetz et rationalite des fonction L, Sem. Bourbaki, expo 379, 1964-1965 (Benjamin, 1966). [G3J A. Grothendieck, Cohomologie l-adique et fonctions L (SGA 5), Lecture notes in math. 589, Springer, 1977. [H IJ H. Hasse, Zur Theorie der abstrakten elliptischen Funktionenkorper I, 11, 111, J. Reine Angew. Math., 175 (1936), 55-62, 69-88, 193-208. [H2J E. Hecke, Mathematische Werke, Van- denhocck & Ruprecht, 1959. [H3J E. Hecke, Eine neue Art von Zetafunc- tionen und ihre Beziehung zur Verteilung der Primzahlen I, 11, Math. Z., I (1918),357-376; 6 (1920), II-51 [H2, 215--234, 249-289]. [H4J E. Heeke, Ober die Bestimmung Dirich- letscher Reihen durch ihre Funktionalglei- chung, Math. Ann., 112 (1936), 664-699 (H2, 591-626]. CH5J E. Hecke, Ober Modulfunktionen und die Dirichletschen Reihen mit Eulerscher Produktentwicklung I, 11, Math. Ann., 114 1718 (1937),1-28; 316-351 [H2, 644-b71, 672- 707]. [H6J T. Honda, Formal groups and zeta functions, Osaka J. Math., 5 (1968). 199-213. [H7J T. Honda, Isogeny classes cf Abelian varieties over finite fields, J. MatI'. Soc. Japan, 20 (1968),83-95. [I I J Y. Ihara, Hecke polynomials as con- gruence , functions in elliptic modular case (To validate Sa to's identity), Ann, Math., (2) 85 (1967),267-295. [12J Y. Ihara, On congruence monodromy problems, 1, 11, Lecture notes, Univ. of Tokyo, 1968- I 969. [13J Y. Ihara, Non-Abelian class fields over function fields in special cases, Actes Congr. Intern. Math., 1970, Nice, Gauthier-Villars, p. 38 I ..389. [14J Y Ihara, Congruence relations and Shimura curves. Amer. Math. Soc. ProC. Symp. Pure Math., 33 (1979), pI. 2,291 -311. [15J K. lwasawa, On f-extensions of algebraic number fields, Bull. Amer. Math. Soc., 65 (1959),183-226. [16J K. Iwasawa, On p-adic L-functions, Ann. Math., (2) 89 (1969),198-205. [17J K. lwasawa, Lectures On p-adic L- functions, Ann. math. studies 74, Princeton Univ. Press. 1972. [JIJ H. Jacquet and R. P. Langlands, Auto- morphic forms on GL(2), Lecture notes in math. 114, Springer, 1970. [J2J H. Jacquet, Automorphic fOlms on GL(2), 11, Lecture notes in math. 278, Springer, 1972. [KIJ N. Katz and W. Messing, Some conse- quences of the Riemann hypothesis for varieties over finite fields, Inventiones Math., 23 (1974), 73-77. [K2J N. Katz, An overview of Deligne's proof of the Riemann hypothesis for varieties over finite fields, Amer. Math. Soc. Proc. Symp. Pure Math., 28 (1976), 275-305. [K3J N. Katz, P-adic L-functions for CM- fields, Inventiones Math., 49 (1978),199-297. [K4J S. Kleiman, Algebraic cycles and the Weil conjectures, Dix exposes sur la coho- mologies des schemas, North-Holland, 1968, 359-386. [K5J T. Kubota and H. W. Leopoldt, Eine p- adischer Theorie der Zetawerte, J Reine Angew. Math., 214/215 (1964), 32:-339. [K6J M. Kuga and G. Shimura, On the zeta function of a fiber variety whose fibers are Abelian varieties, Ann. Math., (2) 82 (1965), 478-539. [LlJ E. G. H. Landau, Handbuch der Lehre von der Verteilung der Primzahlen I, II, Teub- ner, 1909 (Chelsea, 1953). [L2] S. Lang, Sur les series L d'une variete algebrique, Bull. Soc. Math. France, 84 (1956), 385-407. 
1719 [L3J S. Lang, Algebraic number theory, Addison-Wesley, 1970. [L4J R. P. Langlands, Problems in the theory of automorphic forms, Lecture notes in math. 170, Springer, 1970, 18 -86. [L5J R. P. Langlands, Base change for GL(2), Ann. math. studies 96, Princeton Univ. Press, 1980. [L6J H. W. Leopoldt, Eine p-adische Theorie der Zetawerte 11, J. Reine Angew. Math., 274/275 (1975), 224-239. [L 7J S. Lubkin, A p-adic proof of the Weil conjecture, Ann. Math., (2) 87 (1968),105-194; (2) (1968), 195- 225. [MIJ Yu. 1. Manin, Periods ofparabolicforms and p-adic Hecke series, Math. USSR-Sb., 21 (1973),371-393. [M2J B. Mazur, Eigenvalues of Frobenius acting on algebraic varieties over finite fields, AmeI. Math. Soc. Proc. Symp. Pure Math., 29 (1975). [M3J B. Mazur and H. Swinnerton-Dyer, On the p-adic L-series of an elliptic curve, I nven- tiones Math., 25 (1974), 1-61. [M4J J. S. Milne, Etale cohomology, Prince- ton series in mathematics 33, Princeton Univ. Press, 1980. [P1J I. I. Pyatetskii-Shapiro and I. R. Shafa- revich, The arithmetic of K 3 surfaces, Proc. Steklov Inst. Math., 132 (1973), 45-57. [P2J K. Prachar, Primzahlverteilung, Springer, 1957 [R 1 J B. Riemann, Uber die Anzahl der Prim- zahlen unter einer gegebenen Grosse, Gesam- melte mathematische Werke, 1859, 145-153 (Dover, 1953). [SIJ H. Saito, Automorphic forms and alge- braic extensions of num ber fields, Lectures in math. 8, Kinokuniya, Tokyo, 1975. [S2J M. Sato, Theory of prehomogeneous vector spaces (notes by T. Shintani) (in Japa- nese), Sllgaku no Ayumi, 15 (1970), 85-157. [S3J M. Sato and T. Shintani, On zeta func- tions associated with prehomogeneous vector spaces, Ann. Math., (2) 100 (1974), 131-170. [S4J M. Sato and T. Kimura, A classification of irreducible prehomogeneous vector spaces and their invariants, Nagoya Math. J., 65 (1977),1-155. [55J A. Selberg, Harmonic analysis and dis- continuous groups on weakly symmetric Riemannian spaces with applications to Dirichlet series, 1. Indian Math. Soc., 20 (1956),47-87. [S6J A. Selberg, On the zeros of Riemann's zeta-function, SkI. Norske Vid. Akad. Oslo, no. 10 (1942), 1-59. [S7J J.-P. Serre, Zeta and L functions, Arith- metical Algebraic Geometry, Harper & Row, 1965, 82-92. [S8J J.-P. Serre, Facteurs locaux des fonctions 450 Ref. Zeta Functions zeta des varietes algebriques (Definitions et conjectures), Sem. Delange-Pisot-Poitou, 11 (1969/70), no. 19. [S9J J.-P. Serre, Formes modulaires et fonc- tions zeta p-adiques, Lecture notes in math. 317, Springer, 1973,319-338. [SIOJ H. Shimizu, On zeta functions of quater- nion algebras, Ann. Math., (2) 81 (1965), 166- 193. [SI1J G. Shimura and Y. Taniyama, Complex multiplication of Abelian varieties and its applications to number theory, Pub!. Math. Soc. Japan, no. 6, 1961. [S 12J G. Shimura, Introduction to the arith- metic theory of automorphic functions, Pub!. Math. Soc. Japan, no. 11, 1971. [SI3J G. Shimura, On the zeta-functions of the algebraic curves uniformized by certain automorphic functions, 1. Math. Soc. Japan, 13 (1961), 275-331. [SI4J G. Shimura, A reciprocity law in non- solvable extensions, J. Reine Angew. Math., 221 (1966),209-220. [S 15J G. Shimura, Construction of class fields and zeta functions of algebraic curves, Ann. Math., (2) 85 (1967), 58- 159. [SI6J G. Shimura, On canonical models of arithmetic quotients of bounded symmetric domains, 1, 11, Ann. Math., (2) 91 (1970) 144- 222; 92 (1970), 528-549. [S 17J T. Shintani, On Dirichlet series whose coefficients are class num bers of integral binary cubic forms, J. Math. Soc. Japan, 24 (1972), 132- I 88. [S 18J T. Shintani, On zeta functions as- sociated with the vector space of quadratic forms, 1. Fac. Sci. Univ. Tokyo, (IA) 22 (1975), 25-65. [S 19J T. Shintani, On a Kronecker limit for- mula for real quadratic fields, J. Fac. Sci. Univ. Tokyo, (1A) 24 (1977),167-199. [S20J T. Shintani, On liftings of holomorphic automorphic forms, AmeI. Math. Soc. Proc. Symp. Pure Math., 33 (1979), pt. 2, 97-110. [S21J T. Shioda and H. Inose, On singular K3 surfaces, Complex Analysis and Algebraic Geometry, Iwanami and Cambridge Univ. Press, 1977, 119-136. [S22J c. L. Siegel, Gesammelte Abhandlungen I, II, III, IV, Springer, 1966, 1979. [S23J c. L. Siegel, Uber die Classenzahl qua- dratischer Zahlkorper, Acta Arith., 1 (1935), 83-86 [S22, I. 406-409]. [S24J c. L. Siegel, Uber die Zetafunktionen indefiniter quadratischer Formen 1, 11, Math. Z., 43 (1938),682-708; 44 (1939),398-426 [S22, II, 41-67, 68-96]. [S25J H. M. Stark, L-functions at s= 1, II, III, Advances in Math., 17 (1975), 60-92; 22 (1976), 64-84. [TlJ T. Tamagawa, On the (-functions of a 
450 Ref. Zeta Functions 1720 division algebra, Ann. Math., (2) 77 (1963), 387-405. [T2] Y. Taniyama, L-functions of number fields and zeta functions of Abelian varieties, J. Math. Soc. Japan, 9 (1957),330-366. [T3] 1. Tate, Algebraic cycles and poles of zeta functions, Arithmetical Algebraic Geometry, Harper & Row, 1965,93-110. [T4] J. Tate, Endomorphisms of Abelian varieties over finite fields, Inventiones Math., 2 (1966),134-144. [T5] 1. Tate, The arithmetic of elliptic curves, Inventiones Math., 23 (1974), 179-206. [T6] 1. Tate, Number theoretic background, Amer. Math. Soc. Proc. Symp. Pure Math., 33 (1979), pI. 2, 3-26. [T7] T. Tatuzawa, On the Hecke-Landau L- series, Nagoya Math. J., 16 (1960), 11-20. [T8] E. C. Titchmarch, The theory of the Riemann zeta-function, Clarendon Press, 1951. [WI] A. Weil, Collected papers, I-III, Springer, 1980. [W2] A. Weil, Courbes algebriques et varietes abeliennes, Hermann, 1971 (orig., 1948). [W3] A. Weil, Numbers of solutions of equa- tions in finite fields, Bull. Amer. Math. Soc., 55 (1949), 497-508 [WI, I, 399-400]. [W4] A. Weil, Number theory and algebraic geometry, Proc. Intern. Congr. Math., 1950, Cambridge, vol. 2, 90-100 [WI, I, 442-449]. [W5] A. Weil, Sur la theorie du corps de classes, 1. Math. Soc. Japan, 3 (1951), 1-35 [Wl,I,483-517]. [W6] A. Weil, Jacobi sums as "Grossen- charaktere," Trans. Amer. Math. Soc., 73 (1952),487-495 [WI, II, 63-71]. [W7] A. Weil, On a certain type of characters of the idele-c1ass group of an algebraic n urn ber field, Proc. Intern. Symposium on Algebraic Number Theory, Tokyo-Nikko, 1955, 1-7 [WI, II, 255-261]. [W8] A. Weil, Basic number theory, Springer, 1967. [W9] A. Weil, Dirichlet series and automor- phic forms, Lecture notes in math. 189, Springer, 1971. [YI] H. Yoshida, On an analogue of the Sato conjecture, Inventiones Math., 19 (1973), 261- 277. 
Appendix A Tables of Formulas 1 Algebraic Equations 2 Trigonometry 3 Vector Analysis and Coordinate Systems 4 Differential Geometry 5 Lie Algebras, Symmetric Riemannian Spaces, and Singularities 6 Algebraic Topology 7 Knot Theory 8 Inequalities 9 Differential and Integral Calculus 10 Series 11 Fourier Analysis 12 Laplace Transforms and Operational Calculus 13 Conformal Mappings 14 Ordinary Differential Equations 15 Total and Partial Differential Equations 16 Elliptic Integrals and Elliptic Functions 17 Gamma Functions and Related Functions 18 Hypergeometric Functions and Spherical Functions 19 Functions of Confluent Type and Bessel Functions 20 Systems of Orthogonal Functions 21 Interpolation 22 Distribution of Typical Random Variables 23 Statistical Estimation and Statistical Hypothesis Testing 
Appendix A, Table 1 1722 Algebraic Equations 1. Algebraic Equations ( 10 Algebraic Equations) (I) Quadratic Equation ax 2 + bx + e = 0 (a 7"" 0) The roots are - b i: V b 2 - 4ae 2a The discriminant is b 2 -4ae. -b'i: a (b:=2b'). x= (II) Cubic Equation ax 3 + bx 2 + ex + d = 0 ( a 7"" 0) By the translation g=x+b/3a, the equation is transformed into g3+3pg+q=0, where p == (3ae - b 2 )/9a 2 , q == (2b 3 - 9abc + 27 a 2 d)/27 a 3 . Its discriminant is - 27(q2 +4 p 3). The roots of the latter equation are g=  + -?r/i, w1a +w 2 -?r/i, W2 +w-?r/i , where 2"i/3 -1 + V3 i a: } - qi: V q2+4 p 3 w= e = 2 '/3 = 2 Casus irreducibilis (the case when q2 + 4 p 3 < 0). Putting a::= re i9 (/3 = ii), the roots are (Cardano's formula). g=2-0: cos(0/3), 2-0: cos[(0+2'1T)/3], 2-0: cos[(0+4'1T)/3]. (III) Quartic Equation (Biquadratic Equation) ax 4 + bx 3 + ex 2 + dx + e = 0 ( a 7"" 0) By the translation g = x + b /4a, the equation is transformed into g4+ pg2+ qg+ r=O. The cubic resolvent of the latter IS t 3 - pt 2 - 4rt + (4pr - q2) = O. If to is one of the roots of the cubic resolvent, the roots g of the above equation are the solutions of two quadratic equations e i: V to - P [g - q /2( to - p) ] + to/2 = 0 (Ferrari's formula). 2. Trigonometry (I) Trigonometric Functions (432 Trigonometry) MP=sinO, OM = cosO, A T= tanO, (1) In Fig. 1, OA = OB= OP= 1, and BL=cotO, OT=secO, (2) sin2t1 + cos2t1 = 1, tanO=sinO/cosO, cotO= l/tanO, secO= l/cosO, OL = cosec O. cosecO= l/sinO, 1 +tan 2 0=sec 2 0, 1 +cot 2 0=cosec 2 0. (3) 0 -0 '1T/2i:0 '1Ti:0 n'1Ti:O sm s -s c +s i:(-lrs cos c e +s -e (- Ire tan t -t += lit i:t i:t 
1723 App. A, Table 2.11 Trigonometry (4) a 0 0 15 0 18 0 22.5 0 30 0 36 0 45 0 r 0 'IT/12 'IT/10 'IT/8 'IT/6 'IT/5 'IT/4 sina 0 Y3 -1 V5 -1 \h-V2 1 YlO-2V5 1 - --;r- 2 2 4 - cosa 2V2 V2 1 Y3 +1 Y lO+2Y5 Y 2+V2 V3 V5 +1 1 cosa 2 2 - sma 2V2 4 4 V2 'IT/2 5'IT/12 2'IT/5 3'IT/8 'IT/3 3'IT/10 'IT/4 .; 90 0 75 0 72 0 67.5 0 60 0 54 0 45 0 a (5) Addition Formulas sin (a :!: {3) = sin a cos{3 :!:cosa sin{3, cos (a:!: {3) = cosa cos{3:+: sina sin{3, tan(a:!: {3) = (tan a :!: tan{3)/(l:+: tan a tan{3). (6) sin 2a = 2 sin a cos a, cos 2a = cos 2 a - sin 2 a = 2 cos 2 a - 1 = 1 - 2 sin 2 a, tan2a=2tana/(1-ta n2 a). sin 3a = 3 sina - 4sin 3 a, cos 3a = 4cos 3 a - 3cosa, tan3a = (3 tana - tan 3 a)/(l- 3 tan 2 a). [(.-1)/2] ( n ) '. . sin nIX = I . (-I)'sin2'+1lXcos.-(2'+1)IX, i=O 2/+1 [] ( n ) . . cosnlX= L . (-lYsin 2 'lXcos.- 2 '1X. i=O 21 (7) sin 2 (a/2)=(1-cosa)/2, cos 2 (a/2)=(1 +cosa)/2, tan 2 (a/2) = (1- cosa)/(l + cosa). (8) 2sinacos{3 = sin (a + {3)+ sin(a - {3), 2cosa sin{3 = sin(a + {3) - sin(a - {3), 2cosa cos{3 = cos( a + {3) + cos (a - {3), - 2sin a sin{3 = cos(a + {3)- cos(a - {3). sina + sin{3 = 2 sin [(a + {3)/2]cos[( a - {3)/2], sina - sin{3 = 2cos[(a + {3)/2]sin [(a - {3)/2], cosa + cos{3 = 2 cos[( a + {3)/2]cos[(a - {3)/2], cosa - cos{3 = - 2 sin [(a + {3)/2] sin[(a - {3)/2]. (II) Plane Triangles As shown in Fig. 2, we denote the interior angles of a triangle ABC by a, {3, y; the corresponding side lengths by a, b, e; the area by S; the radii of inscribed, circumscribed, and escribed circles by" R, 'A, respectively; the perpendicular line from the vertex A to the side BC by AH; the midpoint of the side BC by M; bisector of the angle A by AD; and the lengths of AH, AM, AD by h A , mA,fA, respectively. Similar notations are used for Band C. Put s=(a + b + e)/2. The symbol... means similar formulas by the cyclic permutation of the letters A, B, C, and corre- sponding quantities. -/!- = -!!- =  =2R (law of sines). sma sln{3 sm y a=bcosy+ecos{3, (the first law of cosines). a 2 = b 2 + e 2 - 2be cos a, '" (the second law of cosines). sin 2 (a/2)=(s-b)(s-e)/be, ..., (b + e)sin( a/2) = a cos [( {3 - y)/2], a+b _ tan[(a+{3)/2] a-b - tan[(a-{3)/2]' cos 2 ( a/2) = s(s - a)/ be, .." (b - e)cos(a/2) = a sin[( {3 - y)/2], (Napier's rule). 
App. A, Table UII Trigonometry 1724 s= ahA /2 = (1 /2)be sin a: = (1 /2)a 2 sin{3 sin y /sina: = abe /4R = 2R 2 s in a: sin{3 sin 'I = rs = rA (s - a)= Yrr A rBr e = V s(s - a)(s - b )(s - e) (Heron's formula). r = (s - a)tan(a:/2) =4R sin(a:/2)sin( {3 /2)sin(y /2). rA = stan (a:/2) = (s - b)cot(y /2) = 4R sine a:/2)cos ({3 /2)cos( y /2). 1/ r =(1 / h A ) + (1/ hB) + (1/ he). mj = (2b 2 + 2e 2 - a 2 )/ 4= (b 2 + e 2 + 2becos a:)/4. fA = 2be cos (a:/2)/(b + e) = 2 V bes(s - a) /(b + e). fAfBfe = 8abcrs 2 /(b + e)(e + a)(a + b). A Fig. 2 (III) Spherical Triangles We denote the interior angles of a spherical triangle by a:, {3, y; the corresponding sid,es by a, b, e; the area by S; and the radius of the supporting sphere by p. We have sina: sin b : sin e = sina: : sin{3: sin y cosa = cos b cos e + sin b sin ecosa:, (law of cosines). sina cos{3 = cos b sin e - sinb cose cosa:, cotasinb = cosb cosy + cot a: sin y, ... (law of sines). cos Ct. = - cos {J cos y + sin {J sin y cos a, (law of sines and cosines). (law of cotangents). tan[(a + b)/2J!tan [(a - b)/2] = tan [(a: + {3)/2J!tan[(a: -{3)/2], (law of tangents). tan [(a: + {3)/2]tan (y /2) = cos [(a - b)/2J!cos [(a + b)/2], ..., tan [(a: -{3)/2]tan (y /2) = sin [(a - b)/2J!sin [(a + b)/2], ''', tan [(a + b)/2]cot( e /2) = cos [(a: -{3)/2J! cos[(a: + {3)/2], ..., tan [(a - b)/2]cot( e /2) = sin [( a: -{3)/2J!sin [(a: + {3)/2], (Napier's analogies). 2 2 cos 2 (a/2R) + cos 2 (b/2R) + cos 2 (c/2R) S=(a.+{J+y-n)p =2p arccos (Heron's formula). 2 cos (a/2R) cos(b /2R) cos( e /2R) For a right triangle (y = 7r /2), we have Napier's rule of circular parts: taking the subscripts modulo 5 in Fig. 3, sin (); = tan (}j+ ]tan (};_] = cos (}j+2COS (}j-2' 
1725 App. A, Table 3 Vector Analysis and Coordinate Systems For example, We have cose = cosacosb = cota cot{3, cos{3 = tan a cotc = cos b sina, sina = tan b cot{3 = sine sina. Fig. 3 3. Vector Analysis and Coordinate Systems We denote a 3-dimensional vector by Ao=(Ax,Ay,AJ=Axi+Ayj+Azk, IAI = yA ;+A;+A; . (I) Vector Algebra (442 Vectors) Scalar product A' Bo=ABo=(A,B)= AxBx+ AyBy+AzBz= \AIIBlcoso (where 0 is the angle between A and B). Vector product j k AXBo=[A,B]=(AyBz-AzBy)i+(AzBx-AxBz)j+(AxBy-AyBx)k= Ax Ay Az. Bx By Bz IAxBI = IAIIBlsinO. A.B=B'A. A.Ao=A 2 =IAI 2 . AxA=O. A'(AXB)=O. (AXB)2"'IAfIBI 2 -(A'B)2. Ax (BXC)=(A' C)B- (A' B)C. Ax (BX C)+BX (CXA)+CX (AXB)=O. (AX B) . (CXD)=A' {BX(CXD)} =(A' C)(B' D)-(B' C)(A' D). Ax Ay Az Scalar triple product [ABC]o=A'(BXC)=B'(CXA)=C'(AXB)= Bx By Bz. C x C y C z I A'E A'F A'G I [BCDJA + [ACDJB + [ABDJC = [ABC]D. [ABC][EFGJ= B'E B'F B'G. C'E C'F C'G (II) Differentiation of a Vector Field (...... 442 Vectors) 'V .a .a k a 0= I ax + J ay + az acp . acp . acp gradcpo= 'V cp= ax 1+ ay J+ azk (gradient of cp), rotA 0= 'V XA= ( aAz _ OA y ) i+ ( aAx _ aA z ) j+ ( aAy _ aA x ) k ay az az ax ax ay (Nabla), (rotation of A), . aA x aA y aAz dlV A 0= 'V . A = ax + ay- + az (divergence of A), . a 2 cp a a 2 cp Acpo='V 2 cpo=dlVgradcp= - + - + - (Laplacian of cp). ax 2 ay2 az 2 grad ( cpl/i) = cp grad I/i + I/i gradcp, grad (A . B)= (B' grad)A+ (A' grad)B+ Ax rotB+ B X rotA, rot(cpA) = cprotA - Ax gradcp, rot (A xB) = (B' grad)A -(A' grad)B+AdivB- Bdiv A, dive cpA) = cpdiv A +A' gradcp, div(AXB) = B' rotA - A' rotB. rotgradcp=O, divrotA=O. AA=graddivA-rotrotA. A(Jo cp) = (d! / dcp)Acp + (d1/ dcp2)(gradcp)2, A( cp1/;) = cpAI/i + l/iAcp + 2(gradcp' grad I/i). 
App. A, Table 3.m Vector Analysis and Coordinate Systems 1726 (III) Integration ofa Vector Field (94 Curvilinear Integrals and Surface Integrals, 442 Vectors) Let D be a 3-dimensional domain, B its boundary, dV the volume element of D, dS the surface element of B, and dS=n dS, where n is the outer normal vector of the surface B. We have Gauss's formula ffdiVAdV= fLdS.A= fL(n'A)dS, f f rotAdV= f L dSxA= f L(nXA)dS, f f gradcpdV= f L cpdS; fLcp  dS= ff(cpt.1/;+gradcp.grad1/;)dV, f L( cp  - 1/; : )dS= f f  (cpt.1/;- 1/;t.cp)dV, 4?Tcp(xo)=- ff t. r cp dV+ fL{ + : -cp aan (+)}dS, Green's formula where r is the distance from the point Xo. Let B be a bordered surface with a boundary curve r, ds the line element of r, dS the surface element of B, and ds= tds, dS=ndS, for t the unit tangent vector of r and under the proper choice of the positive direction for the surface normal n. We have Stokes's formula II dS'rotA=r A'dS=/'A)dS' II dSXgradCP=r cpds. If the domain D is simply connected, and the vector field V tends sufficiently rapidly to o near the boundary of D and at infinity, we have Helmholtz's theorem V = gradcp + rotA, cp = - f J k dl:; dV, A = f f  :; dV. (IV) Moving Coordinate System Denote differentiation with respect to the rest and the moving systems by d/ dt, d* / dt, respec- tively. Let the relative velocity of the systems be v. Then we have dcp d*cp - = - -v.grad cp dt dt ' dA d* A [ - = - - V' gradA - (A' grad)v J. dt dt With respect to rotating coordinates we have v=wXr. dA d* A [ ] - = - + w+A-«wXr).grad)A dt dt When the domain of integration is also a function of t, :r f A'ds= f { aa +grad(v'A)-vXrotA }'dS, :r f fA 'dS= f f { aa +rot(AXv)+ vdiv A} 'dS, :r fffcpdV= fff{  +(v.gradcp)+CPdivv}dV= ffJ  dV+ ffcpv'dS. (V) Curvilinear Coordinates (90 Coordinates) Let (xJ, ...,x n ) be rectangular coordinates in an n-dimensional Euclidean space. If Xj=Cp/UJ, ...,u n ) (j= 1, ...,n), J =det(acpj/auk)O, the system (uJ, ..., un) may be taken as a coordinate system of an n-dimensional space, and the 
1727 App. A, Table 3. V Vector Analysis and Coordinate Systems original space is a Riemannian manifold with the first fundamental form n  acpj acp; gjk=  au. au j= \ J k (j,k= 1, ...,n), g=det( gjk) =J2. When the metric is of the diagonal form gjk = gJ8jb the coordinate system (uJ, ..., un) is called an orthogonal curvilinear coordinate system or an isothermal curvilinear coordinate system. In such a case we have J = g\. ..gn' and the line element is given by ds 2 = "i.'J= 1 gJduJ. F or a scalar f and a vector  = (h . .. 'n)' we have n I::.f= 1.  .J... ( L a f ) J  au g 2 au ' j= \ J  J div= 1.  .J... ( Lj ) , (rot)jk= --L [ a (gkk) _ a (gA) ] J  au} gj gjgk aU j aUk }=I 1 af (gradj)j= -- gj aUj (j= 1, ...,n), (j,k= 1, ...,n). When n = 2, the rot may be considered a scalar, rot = (rotg) 12, and when n = 3, the rot may be considered a vector, with components rot= «rotb, (rotg)3h (rotg)12)' The following are examples of orthogonal coordinates. (1) Planar Curvilinear Coordinates. In the present Section (1), we put Xl=X, x2=Y' UI=U, U2=V, gl=P, g2=q. ds 2 =p 2 dx2+q 2 dy2, J=a(x,y)ja(u,v)=Ypq. Planar orthogonal curvilinear coordinates may be represented in the form x+ iy =F( U + iV), F being a complex analytic function, by suitable choice of the functions U = U (u), V = V (v). (i) Polar Coordinates (r,{}) (Fig. 4). x= rcos{}, y = rsin{}; x+ iy = exp (logr+ i{}). r= V x 2 + y2 , {}= arctan (y j x). p=l, q=r, J=r, ds 2 =dr 2 +r 2 d{}2. a :f 1 af I a 2f I::.f=--+--+-- ar 2 r ar r 2 a{} 2 . y p y o Fig. 4 x x (ii) Elliptic Coordinates (11, p) (Fig. 5). Among the family of confocal conics 2 2 +=1 (a>b), a 2 +p b 2 +p there are two values of p for which the curve passes through a given point P (x,y). Denote the two values of p by Jl. and P, where Jl. > - b 2 > p> - a 2 . The curve corresponding to p = Jl. or P = p is an ellipse or a hyperbola, respectively. Then we have the relations x 2 = (Jl. + a 2 )(p + a 2 ) j(a 2 - b 2 ), y2= (Jl.+ b 2 )(p + b 2 ) j(b 2 - a 2 ). Let the common foci be (:t c, 0) (c 2 = a 2 - b 2 ). Then we have rl = y( x- C)2 + y2 , r2= y( x+C)2+ y 2 where rl' r2 are the distances from the two foci as in Fig. 5, and 4( a 2 + 11) = (rl + r2)2, 4(a 2 + p) = (rl - r2)2. 1 p=- 2 Jl.-p ( Jl. + a 2 )( Jl. + b 2 ) , 1 q=- 2 P-Jl. (p+a 2 )(p+b 2 ) (iii) Parabolic Coordinates (a,{3) (Fig. 6). Among the family of parabol asy 2 = 4p(x + p) with the focus at the origin and having the x-axis as the principal axis, there are two values of p for which the curve passes through a given point P (x,y). Denote the two values of p by a, {3 (a> 0 > (3). We have x= -(a+ {3), y = V -4a{3 . (iv) Equilateral (or Rectangular) Hyperbolic Coordinates (u,v) (Fig. 7). This is a system that 
App. A, Table 3.V Vector Analysir; and Coordinate Systems 1728 replaces xj2, yj2 in (iii) by - y and x, respectively, with =u, H = v. The relations are x=uv, y=(U 2 -v 2 )j2; x+iy=i(u-iv)2j2, u 2 , V 2 = JX2+ y2 i:Y, p=q=F V2 . The curves x = constant or y = constant are equilateral hyperbolas. (v) Bipolar Coordinates (g,1)) (Fig. 8). These coordinates represent a point P(x,y) on a plane as the intersection of the family of circles passing through two fixed points (:!: a, 0) and the family of loci on which the ratio of distances from the same two fixed points (:!: a, 0) is constant. The latter is the set of Apollonius' circles. The relations are a sinh g x= coshg+cos1) , a p=q= coshg+cos1)' a Sin 1) Y = coshg + cOS1) (-00 <g<oo, 0.;; 1)';; 2'17). (2) Curvilinear Coordinates in 3-0imensional Space. In the present Section (2), we put Xl =x, X2=y,X3=Z. (i) Circular Cylindrical Coordinates (Cylindrical Coordinates) (p,cp,z) (Fig. 9). x=pcoscp, y=psmcp, z=z. 1 a ( a j ) 1 a 2 j a 2 j ds 2 =dp2+p 2 dcp2+dz 2 , J=p.!!.j p + + = p ap ap p2 acp2 az2 ' (ii) Polar Coordinates (Spherical Coordinates) (Fig. 9). x=rs inOcoscp, y =rsinOsincp, z=rcosO. r= yx 2 +y2+z2, cp=arctan(y/x), O=arctan( Vx2+y2 /z). y \ \ Fig. 5 y Fig. 7 z y Fig, 9 x x Fig. 6 H x .- / , I I I x I /' --- Fig. 8 z x Fig. 10 
1729 App. A, Table 3.V Vector Analysi!; and Coordinate Systems The angles qJ and fJ are called azimuth and zenith angle, respectively. We further have ds 2 = d,2 + ,2dfJ2 +,2 sin 2 fJdqJ2, J = ,2 s infJ. 1 a ( 2 a f ) 1 a ( . a f ) 1 a 2 f !J.f=-- , - +-- smfJ- +--. ,2 a, a, ,2 sinfJ afJ afJ ,2 sin 2 fJ aqJ2 (iii) Euler's Angles (Fig. 10). Let (x,y,z) and (, 1J, n be two linear orthogonal coordinate systems with common origin O. Denote the angle between the z-axis and the -axis by fJ; the angle between the zx-plane and the z-plane by qJ; and the angle between the 1J-axis and the intersec- tion OK of the xy-plane and the g1J-plane (or the angle between the g-axis and the intersection OL of the z-plane and the 1J-plane) by 1/!. The angles fJ, qJ, and 1/! are called Euler's angles. The direction cosines of one coordinate axis with respect to the other coordinate system are as follows: x y z g cosqJ cosfJ cos1/! - sinqJ sin 1/! sin qJ cos fJ cos 1/! + cos qJ sin 1/! - sin fJ cos I/- 1J - cosqJ cosfJ sin 1/! - sinqJ cos1/! - sinqJ cosfJ sin 1/! + cosqJcos1/! sinfJsinl/-  cos qJ sinfJ sin qJ sin fJ cosfJ (iv) Rotational (or Revolutional) Coordinates (u,v,p). Let (u,v) be curvilinear coordinates (Sec- tion (I» on the zp-plane. The rotational coordinates (u,v,p) are given by the combination of x=pcosqJ,y=psinqJ with the coordinates on the zp-plane. We have ds 2 = p2 du2 + q2 dv2 + p2 dqJ2, where p, q are the corresponding values for the coordinates (u, v). (v) Generalized Cylindrical Coordinates (u,v,z). These are a combination of curvilinear coordi- nates (u,v) on the xy-plane with z. We have ds 2 = p2 du2 + q2 dv 2 + dz2. For various selections of (u,v) we have coordinates as follows: Rotational Generalized (u,v) Coordinate System Cylindrical Coordinate System Linear rectangular Circular cylindrical Linear rectangular coordinates coordinates coordinates Polar coordinates Spherical coordinates Circular cylindrical « 1 )(i» coordinates Elliptic coordinates Spheroidal coordinates(l) El1iptic cylindrical «(I)(ii» coordinates Parabolic coordinates Rotational parabolic Parabolic cylindrical «1 )(iii» coordinates(2) coordinates Equilateral hyperbolic Rotational hyperbolic Hyperbolic cylindrical coordinates «I )(iv» coordinates coordinates Bipolar coordinates Toroidal coordinates(3) Bipolar cylindrical ((l)(v» Bipolar coordinates(4) coordinates Notes (1) When the p-axis is a minor or major axis, we have prolate or oblate spheroidal coordi- nates, respectively. (2) We take the z-axis as the common principal axis of the parabolas. (3) Where the line passing through two fixed points is the p-axis. (4) Where the line passing through two fixed points is the z-axis. (vi) El1ipsoidal Coordinates (A.,!-,-,p) (Fig. II). Among the family of confocal quadrics x2 y2 Z2 -+-+-=1 (a>b>c>O), a 2 +p b 2 +p c 2 +p 
App. A, Table 4 Differential Geometry 1730 there are three values of p for which the surface passes through a given point P(x,y,z). Denote the three values of p by A, /L, v, where A> - c 2 > /L> - b 2 > V > - a 2 . The surfaces corresponding to p = A, p = /L, and p = v are an ellipsoid, a hyperboloid of one sheet, and a hyperboloid of two sheets, respectively. We have X2= h(a) (a 2 -b 2 )(a 2 -c 2 ) , 2 h(c) z = . (c 2 _a 2 )(c 2 _b 2 ) , Y(A-/L)(A-V) gl = 2p(A) Y(V-A)(V-/L) g3= 2p(v) 2 h(b) Y = (b 2 -c 2 )(b 2 -a 2 ) ' p(t)= Y (t+a 2 )(t+b 2 )(t+C 2 ) . y h (a:) =(A + a: 2 )( /L + a: 2 )(v + a: 2 ). x g2= Y(/L-V)(/L-A) 2p( /L) Fig. 11 4. Differential Geometry (I) Classical Differential Geometry (111 Differential Geometry of Curves and Surfaces) (1) Plane Curves (Fig. 12). At a point P (xo,Yo) on a curve y = f(x), the equation of the' tangent line is Y - Yo= f'(xo)(x - xo), PT=IYo Yl+Yi? 1YOI, and the tangential shadow TM=Yolyo' The equation of the normal line isf'(xo)(Y-Yo)+(x- xo) = 0, PN=\yo yl+yo2 1, and the normal shadow MN = YoYo' The slope of the tangent is tan a: = f'(xo) = Yo' "(he curvatUre at Pis 2 3/2 K = 1 1 PQ = f"(xo)1 [1 + f'(xo) ] The coordinates of the center of curvature Q are (xo- f'(xo) [I + f'(xon 1 f"(xo), f(xo) + [1 + f'(XO)2J; f"(xo»). Y Q o Fig. 12 (2) Space Curves Xj=Xj(t) (i= 1,2,3), or x=x(t). The line element of a curve x=x(t) is V 2 2 2 , ds= (dxl) +( dX 2) +( dX 3) = V a1 x dt The curvature is (- = ; ). v ","2 "2 1 '2 K = £" Xa - S s For t=s (arc length), the curvature is K= YC'i.x;2), and the torsion is 'T=[det(x,x:, X")a_I.2.3J!K2, where '=dlds. When we denote Frenet's frame by (,1I,n we hav,e =x', 11 =f IK, K=€x 11 (vector product). 
1731 App. A, Table 4.1 Differential Geometry The Frenet-Serret formulas are f == K1J, r/, == - ,,+ -r!" r == -77/. (3) Surface in 3- Dimensional Space Xa == xa (ut. U2) (ex = 1,2,3). The first fundamental form of the surface is 3  aX a aX a k == £.J au. a;; a=1 "} k (j,k == 1,2). g == det( gjk) > O. Let (gjk) be the inverse matrix of (gjk)' The tangent plane at the point x() is given by det( Xa - xiO), (ax a / aU I )(0), (ax a / au 2 )(0») = O. The normal line at the point xO) is given by Xa - xO) = tvO), where t is a parameter, and Va is the unit normal vector, given by 1 a (XfJ,x y ) v = - (8;$$= + 1). a Yg a(u I ,U2) The second fundamental form is 3 2 3  a Xa  aVa aX a h jk == £.J Va aUjaUk == - £.J aU j aUk ' a=1 a=1 h == det(hjd. The principal radii of curvature Rt. R 2 are the roots of the quadratic equation  -  gjkhkl. +!!.- ==0. R 2 .L... "} R g j,k The mean curvature (or Germain's curvature) is 1 ( 1 1 ) I k H == 2. R + R = 2. £.. gJ h jk , I 2 j,k and H == 0 is the condition for the given surface to be a minimal surface. The Gaussian curvature (or total curvature) is K==!!.- - RIR2 g' and K = 0 is the condition for the surface to be developable. We use the notations of Riemannian geometry, with gjk the fundamental tensor: a 2 3 Xa  { a } aXa _ a . a == £.J k _ a + hjkV a U J Uk J U a a=1 (Gauss's formula). Rijkl = hj1h ik - hjkh il (Gauss's equation). hjk;1 = hjl;k (Codazzi-Mainardi equation). K== RI212 ==!i= [  ( yg { 2 }) _ ( yg { 2 })] g 2 Yg aU2 gll 11 aU I gll 12 1 [ a ( Yg { I }) a ( Yg { I })] = yg  g22 22 - aU2 g22 12 . aVa  kl aX a au == - £.J hjkg au; "} k,1 (Weingarten's formula). The third fundamental form is given by 3  aVa aVa  k == £.J au. aUk == £.. gSlhJshkl == 2 Hh jk - Kg jk . a= 1:J 5,t det( k) == K2g == Kh. (4) Geodesic Curvature. Let C: U; = Utes) be a curve on a surface Sand P be the curvature of C at a point P. Let e be the angle between the osculating plane of C and the plane tangent to S. The geodesic curvature Pg of C at P is given by ( du; d 2 u;  { i } dUj dUk ) Pg= pcose = vg det ds ' ds2 + £.J jk ds dS J,k 1-1.2 
App. A, Table 4.11 Differential Geometry 1732 Pg = ° is the condition for C to be a geodesic. Let D be a simply connected domain on the surface S, whose boundary r consists of n smooth curves. Let Ba be the outer angle at the intersection of two consecutive curves (ex = I, .. ., n). Then we have the Gauss-Bonnet formula: lpgds+ f f KdS=27T- i: Ba' r D a=1 (II) Riemannian Geometry, Tensor Calculus (417 Tensor Calculus) In the present section, we use Einstein's convention (omission of the summation symbol). (I) Numerical Tensor. Kronecker's S Sjk' SJk, S£ = {  (J=k) (J=I=k). Sh. J p =det ( Sjp ) _ = { : I k,...kp k" p,p-I, ..,p -I (Up} =I={k p }), (Up} = {k p } and (Jp) is an even permutation of (k.)), (Up} = {k p }) and Up) is an odd permutation of (k.). Eddington's E Eh ...j. = Sj: '::;., Eh ...jn = Sh...j.. I...n sj,. ..jpjp+ ,. ..jn = (n - p)! SJ,...jp. det (aU) = Eh ...jnaI a 2 an = E. . a J 'a 2 J2 .. . a n Jn . k""kpjp+)...j. k,...kp p P.p-I. "n 1,12'" 1. l,...Jn I (2) Fundamental Objects in Riemannian Geometry, Let gjk be the fundamental tensor, and (gJk) be the inverse matrix of (gjk)' We put g==det(gjk)' The Christoffel symbol is { i } =.l ia [ agaj + agak _ a g jk ] = { i } jk 2 g ax k ax j ax a kj' which has the transformation rule { a } = a log vg ak ax k ' { r } a xi ( axj a xk { i } a 2 x i ) jf = ax i ax J - ax" jk + axjax" under a coordinate transformation. A geometrical object r)k with a similar transformation rule is called the coefficiem of the affine connection. The torsion tensor is k ==rjk - r. The equation of a geodesic is d 2 x i { i } dxj dx k _ ds2 + jk dS dS -0. The covariant derivative of a tensor of weight W with respect to a coefficient of affi ne connec- tion r)k is given by p q p,...J p ==aTh...jpjax l +  Th...j,-,aj,+,."jpr j ,-  Tj'.........J p r a -WF,...Jpr a k,. .k.11 k,"'k. k,....... .k. al L.J k,...kp_,akp+,...k. Ikp  ...k al' v=I p=1 For the Christoffel symbol, we denote the covariant derivative by ;1. Then we have the- following formulas: glk;I=O, glk.I=O, c5i,I=O, .jgej,,,.ln.I=O, (l/.jg)ej,,,.in.I=O. For a scalar f gradf=(!,i)' for a covariant vector vi rot v=(v 1 . k - V k ) = (8vi/8xk_8vk/8xl), . . 1 8( ;g vi and for a contravariant vector vi dlV v = v J . i = .jg 7' Beltrami's differential operator of the first kind is t.d == gJ"J;j /k' 
1733 App. A, Table 4.11 Differential Geometry Beltrami's differential operator of the second kind is . 1 a (vg glk (aj / ax k») 2f=dlvgradj= - vg ax j For a domain D with sufficiently smooth boundary r, we denote the directional derivative along the inner normal by a / an, the volume element by dV, and the surface element on r by dS. Then we have Green's formulas, L(1(CPo/)+0/2CP)dV= -l 0/ : dS, f} CP20/ - 0/2CP) dV = [(0/  - cP  ) dS. We denote the curvature tensor with respect to the coefficients of a general affine connection r;k by Bj/' and by R/kl when r;k = { Jk }. We have the fol1owing formulas: ar i l ar i k B i - 1 1 r a r i r a r i . jkl =  - --'[ + jl ak - jk ai' ax ax [ 2 a  a 2 2 ] 1 a gik gjl k a g;1 Rijkl=-gaiR/kl= -Rijlk=Rklij="2 axjaxl + axiaxk - axiaxl - ax1ax k + gab ( {it} {;I } - {t } {Jk }); Bianchi's first identity RAI + Rklj + Rl k = 0, - ( BjL + Bklj + Bijk ) = 2( Sj11 + Sk/lj + Stlk ) + 4( Sfl + SkaSfJ + S!aZ ); Bianchi's second identity RA/; m + R)m; k + R/ mk ; I = 0, BjLlm + Bjmlk + i",kl/= - 2( B/maSf/+ BjLs/:" + B/laS::' k ); Ricci's tensor Rjk =- - RJ; = R kj ; scalar curvature R =-glkR jk ; Ricci's formula TJI "l - Th .jp k,"'k.lsil k, "'k.llis P q "" Tjl...l-,aj.+,...jpBj. +  Th.........jp B a +2Tjl"'1 SI + WTJ""l B a .::.. k" ......k asl'::" k,...k._,ak.+i>....k q k.st k,... II sl k,'''k . asl' v=\ I . 1'=\ . where Sand B are the torsion and curvature tensors given above, respectively, and W is the weight of the tensor T. (3) Special Riemannian Spaces ( 364 Riemannian Manifolds). In the present Section (3), n means the dimension of the space. (i) Space of Constant Curvature Rjkl = P(gjli5 - gjki5/); p =R/n(n -1), (ii) Einstein Space Rjk = pgjb P = R/n, for n  3, where R is a constant. (iii) Local1y Symmetric Riemannian Space Rjkl,m=O. (iv) Projectively Flat Space. Weyl's projective curvature tensor is defined by . . 1 . . a;kl == Rjkl +-(R jk i5 i - R jl i5 iJ n-1 The condition for the space to be projectively flat is given by a;,=0, Rjk,l=Rjl.k' If n  3, the latter condition fol1ows from the former condition, and the space reduces to a space of constant curvature. If n = 2, the former condition W = 0 always holds. (v) Concircularly Flat Space ZJkl=-R}kl+(gjki5/-gjli5D=O. This space reduces to a n(n-1) space of constant curvature. (vi) Conformal1y Flat Space. Weyl's conformal curvature tensor is defined by 1 . 1 . . . . R(gjk i5 /-g jl i5D C jkl =- Rjkl+ n_2 (R jk i5 i -R jl i5 k +gjkR;-gjlRk) (n-1)(n-2) ' RJk Rg jk k=- --+ (n-2) 2(n-1)(n-2) The condition for the space to be conformally flat is given by CJkl = 0, k.l = '.k' If n  4, the latter condition follows from the former condition, and if n = 3, the former condi- tion C=O always holds. 
App. A, Table 5 Lie Algebras, etc. 1734 5. Lie Algebras, Symmetric Riemannian Spaces, and Singularities (I) The Classification of Complex Simple Lie Algebras and Compact Real Simple Lie Algebras ( 248 Lie Algebras) (1) Lie Algebra. The unitary restriction of a noncommutative finite-dimensional com pie ( simple Lie algebra 9 is a compact real simple Lie algebra g., and 9 is given by the complexification g of g.. There exists a bijective correspondence between the classifications of these two kinds of Lie algebras. Using Dynkin diagrams, the classification is done as in Fig. 14 ( 248 Lie Algebras). The system of fundamental roots (IX 1"", IX,} of a simple Lie algebra 9 is in one-to-one corre- spondence with the vertices of a Dynkin diagram shown by simple circles in Fig. 14. The number of simple circles coincides with the rank I of g. The double circle in Fig. 14 means -1 times the highest rootB. Sometimes we mean by the term "Dynkin diagram" the diagram without the double circle and the lines issuing from it. Here we call the diagram with double circle r{presenting - B the extended Dynkin diagram. Corresponding to the value of the inner product with respect to the Killing form - 2(lX i , 1X)f( IX}, IX j ) (i # j) (which must be 0, 1, 2, or 3), we connect two vertices representing lX i and IX j as in Fig. 13. When the value is 0, we do not connect lX i and IXj' In Fig. 13, the left circle corresponds to lX i and the right circle to IXi' 0 0 0 0--0 -2(a"a J )/(a J ,a,) = 2 0= 3 0$eI0 Fig. 13 AI (l;i; 1)  1111111 12321 E'T -(J HI (1;i;2) n--L-W -(j E7  123432 -(J 2 C I (I ;i;3) 22222 1 @=o--<>--o------  -(j E8 3 234 2 F4 -(j DI  o l 2 2 2 ____ (1;i;4)  - -  _(j 1 2 3 G 2 @----os=$o -(j Fig. 14 We have relations H, = C, =A" C 2 =H2' and D3 =A3' (D 2 =A, + A" which is not simple.) In this figure, the number at each vertex means the coefficient m j in () = k m,iX i . From Fig. 14, we have the following information. (i) The quotient group of the automorphism group A (9) of 9 with respect to the inner automor- phism group I (9) is isomorphic to the automorphism group of the corresponding Dynkin dia- gram. The order of the latter group is 2 for Al (I ;;. 2) since the diagram is symmetric. It is also 2 for DI (l ;;. 5) and for E6, and it is 6 (= 3!) for D4' For all other cases, the order is I. 
1735 App. A, Table 5.11 Lie Algebras, etc. (ii) The order of the center of the simply connected Lie group associated with g is equal to the index of the subgroup consisting of elements stabilizing - 8 in the group of automorphisms of the extended Dynkin diagram of g (S. Murakami). This index is equal to the order of the fundamental group of the adjoint group of g and the number of connected Lie groups, whose Lie algebra is g. (iii) Any parabolic Lie subalgebra of g is isomorphic to a subalgebra generated by the root vector X" (and elements of the Cartan subalgebra) such that a =  nja j , where {a (, ..., ad is a system of fundamental roots, n j > 0 (i = I, ...,1) or nj ..; 0 (i = I, .. ., I), and n j = 0 for a j belonging to a fixed subset S of {a(, ...,ad. Hence, isomorphism classes of parabolic Lie subalgebras are in one-to-one correspondence with the set of subsets S of {a[, .'., ad. (iv) Maximal Lie subalgebra f of g with the same rank I as g. The Lie subalgebra f is classified by the following rule. First we remove a vertex a j from the Dynkin diagram. If the number m j attached to the vertex is 1, f is given by the product of the simple Lie algebra corresponding to the Dynkin diagram after removing the vertex a j and a one-dimensional Lie subalgebra. If m j > I, f is given by the diagram after removing a j from the extended Dynkin diagram. (2) Lie Groups. The classical complex simple Lie groups of rank n represented by A, B, C, D (in Cartan's symbolism) are the complex special linear group SL(n + 1, C), the complex special orthogonal group SO(2n + 1, C), the complex symplectic group Sp(n, C), and the complex special orthogonal group SO(2n, C), respectively. The classical compact simple Lie groups of rank n represented by A, B, C, D are the special unitary group SU(n+ 1), the special orthogonal group SO(2n + 1), the unitary-symplectic group Sp(n), and the special orthogonal group SO(2n), respec- tively ( 60 Classical Groups). Cartan's Complex Compact Dimension Rank Symbol Form Form An SL(n+I,C) SU(n+ I) (n + If-I n Bn SO(2n+ I,C) SO(2n+ I) 2n 2 + n n C n Sp(n,C) Sp(n) 2n2+n n Dn SO(2n,C) SO(2n) 2n 2 - n n G 2 Aut(£C Aut (£ 14 2 F4 Aut0 C Aut0 52 4 E6 78 6 E7 133 7 E8 248 8 Here (£ is the Cayley algebra over R, (£C is the complexification of (£, 0 is the Jordan algebra of Hermitian matrices of order 3 over (£, 0 c is the complexification of 0, and AutA is the automorphism group of A. (II) Classification of Noncompact Real Simple Lie Algebras Classical Cases AI All AlII BI BII CI CII Dl DII DIll Noncompact Real Simple Lie Algebra g €51(p+I;R) €51(n; H) 5U(p,q;C) €5o(p,q; R) €5o(l,n-l; R) €541(P; R) u(p, q; H) €5o(p,q; R) 50(I,n-I;R) 50(p; H) Maximal Compact Lie Algebra of g €5u(p + I) €541(n) 5U(p)+€5u(q) 50(p)+ 50(q) (p + q=2m + I) €5o(n-l) (n=2m+l) u(p) 541(P) H41( q) €5o(p)+€5o(q) (p+q=2m) 50(n-l) (n=2m) u(2p) Cartan's Symbol 
App. A, Table 5.11 Lie Algebras, etc. 1736 Here the field F is the real field R, the complex field C, or the quaternion field H (R c C c H). H is an algebra over R. For a quaternion x=xo+x1i+X2)+X3k(XO,xl,X2,x3ER), we put x= x o - xli- X2)- X3 k , x* = xo+ xli - X2) + X3 k . Then g! (n; F) = {set of aU square matrices over F of order n}, 51 (n; F) = { A E g! (n; F) I tr A = O}, 50 (p,q; F) = { A Eg!(p + q; F) I fA * Ip,q + Ip,qA = a}, where Ip,Q is the symmetric transformation of the Euclidean space Rp+q with respect to RP, i.e., AI -----()-----<) EI All ---- ----n (( ( , ----U Ell   AlII (Ir> Em Y B I --------o---...------ EN BII o . . ---- . ...;. EV CI ---- ------o---.----- EVI cn{ ----  --------0---...-------< Evn Dl ----) EVIll -----< EIX DII o . . -----< FI ( -----< DIII ----) FII . . 0 G  Fig. 15 
1737 App. A, Table 5.I1I Lie Algebras, etc. 1p.q is the diagonal sum of the unit matrix 1p of order p and - 1q. We have 5o(n; F) =5o(n,0; F), u(p,q;F)= {A EgI(p+q;F)! tA1p.q+1p.qA =O}, u(n; F) = u(n,O; F), 541(n; F) = {A EgI(2n; F) ItA] +]A =O}, where] is the matrix of an alternating form 7= \(x i Yi+n - x i + n yJ of order 2n. A noncompact real simple Lie algebra g is classified by the relation of the complex conjugation operator IJ with respect to the complexification gC of g. The results are given by Satake's diagram (Fig. 15). In the diagram, the fundamental root corresponding to a black circle is multiplied by - 1 under IJ for a suitable choice of Cartan subalgebra, and the arc with an arrow means that two elements corresponding to both ends of the arc are mutua11y transformed by a special transfor- mation p such that IJ = PW (w E W). (III) Classification of Irreducible Symmetric Riemannian Spaces (412 Symmetric Riemannian Spaces and Real Forms) A simply connected irreducible symmetric Riemannian space M = G / K is either a space in the following table or a simply connected compact simple Lie group mentioned in (I). The noncom- pact forms uniquely corresponding to the compact symmetric Riemannian space are in one-to- one correspondence with the noncompact real simple Lie algebras mentioned in (II). Cartan's G/K=M Dimension Rank Symbol AI SU(n)/ SO(n) (n > 2) (n-l)(n+2)/2 n-I All SU(2n)/ Sp(n) (n> I) (n -1)(2n + 1) n-l AlII U(p + q)/ U(p) X U(q) (p> q> 1) 2pq q BDI SO(p + q)/ SO(p)X SO(q) (p;;.q;;.2,p+q"",4) pq q BDII SO(n + 1)/ SO(n) (n ;;. 2) n 1 DIll SO(2/)/ U(I) (I ;;. 4) /(1-1) [//2] CI Sp(n)/ U(n) (n;;' 3) n(n + 1) n CII Sp(p + q)/ Sp(p) X Sp(q) (p;;. q> 1) 4pq q EI E6/ Sp(4) 42 6 Ell E6/ SU(2).SU(6) 40 4 ElII E6/ Spin(1O)'SO(2) 32 2 EIV E6/ F4 26 2 EV E7/ SU(8) 70 7 EVI E 7 /Spin(12). SU(2) 64 4 EVIl E7/E6 . SO(2) 54 3 EVlII E 8 /Spin(16) 128 8 EIX E8/E7' SU(2) 112 4 FI F 4 /Sp(3)' SU(2) 28 4 FII F 4 /Spin(9) 16 I G G 2 /S0(4) 8 2 Notes The group G = U(p+ q) in AlII is not effective, unless it is replaced by SU(p+ q). To be precise, K = Sp(4) in EI should be replaced by its quotient group factored by a subgroup of order 2 of its center. K in Ell is not a direct product of simple groups; the order of its fundamental group 11: 1 (K) is 2. To be precise, K in EV or EVlII should be replaced by its quotient group factored by a subgroup of order 2 of its center. The K's in Ell, EIIl, EVI, EVIl, EIX, and FI are not direct products. The fundamental group 11: 1 (K) of K is the infinite cyclic group Z for EIII, EVIl; for a11 other cases, the order of 11: 1 (K) is 2. In EIII, EVIl, the groups E6' E7 are adjoint groups of compact simple Lie algebras. In other cases, E6 and E7 (E 8 , F4 and G 2 also) are simply connected Lie groups. The compact symmetric Riemannian space M is a complex Grassmann manifold for AlII, a real Grassmann manifold for BDI, a sphere for BDII, a quaternion Grassmann manifold for CII, and a Cayley projective plane for FII. 
App. A, Table 5.IV Lie Algebras, etc. 1738 (IV) Isomorphic Relations among Classical Lie Algebras The isomorphic relations among the classical Lie algebras over R or C are all given in the following table. In the table, we denote, for example, the real form of type AI of the complex Lie algebra with rank 3 by A31 in Cartan's symbolism. When there are non isomorphic real forms of the same type and same rank (e.g., in the case of D 3 1) we distinguish them by the rank of the corresponding symmetric Riemannian space and denote them by, e.g., D31p, where p is the index of total isotropy of the sesquilinear form which is invariant under the corresponding Lie algebra. Cartan's Symbol AJ=BJ=C J B 2 = Cz A3 = D3 AJI=AJIII=BJI= CJI B 2 1 2 = CzI B 2 1( = C 2 11 A3 1 = D3 1 3 A3 11 = D3 1 J A3 111 2 = D3 1 2 A3 111 J = D3 111 D4 1 2 = D4 111 D2=AJXAJ* D212=AJIXAJI D2111=AJ XAJI* D21J =AJ* Isomorphisms among Classical Lie Algebras 51(2, C)  50(3, C)  541(1, C); 5u(2) 50 (3)  541(1) 50 (5, C)  5j:J (2, C); 50 (5)  5j:J (2) 51(4,C)50(6,C); 5u(4)50(6) 51(2, R) 5u(l, I; C)  50(2,1; R)  541(1; R) 50(3,2; R) 541(2, R) 50(4, I;R)u(I, 1;H) 51(4,R) 50(3, 3; R) 5(2,H)50(5, I; R) 5u(2,2; C)50(4,2; R) 5u(3, 1; C)50(3; H) 50(6,2; R)50(4,H) 50(4, C)  51(2, C) X 51(2, C); 50 (4)  5u(2) X 5u(2) 50(2,2; R)  51(2, R) X 51(2; R) 50(2; H)  5u(2) X 51(2; R) 50(3, I;R)51(2,C) Note (*) In these 3 cases, there are isomorphisms given by the replacement of 51 (2, C) or 5u(2) by isomorphic Lie algebras of type B( or type C J due to the isomorphism AJBJCJ' (V) Lists of Normal Forms of Sing ulari ties with Modulus Number m=O, 1, and 2 (418 Theory of Singularities) Letters A, "', Z stand here for stable equivalence classes of function germs (or families of function germs). (1) Simple Singularities (m=O). There are 2 infinite series A, D, and 3 "exceptional" singularities E6' E 7 , E8: Notation A. D. E6 E7 E8 Restrictions Normal form x.+1 + y2 +Z2 x.- 1 + xy 2 +Z2 x 4 + y3 + Z2 x3 y + y3 + Z2 x5 + y3 + Z2 nl n4 (2) Unimodular Singularities (m= 1). There are 3 families of parabolic singularities, one series of hyperbolic singularities (with 3 subscripts), and 14 families of exceptional singularities. The parabolic singularities Notation Normal form Restrictions P 8 = E6 x 3 + y3 +z3 +axyz a 3 +27#O X 9 = £7 x 4 + y4 +Z2 +axyz a4-64#O J 10 = £8 x6+y3+z2+axyz a 6 -432#O 
1739 App. A, Table 5. V Lie Algebras, etc. The hyperbolic singularities Notation Normal form Restrictions q, x P + yq +z' +axyz 1 1 1 a#O, -++-< 1 p q r The 14 exceptional families Gabrielov Dolgachi:v Gabrielov Dolgachi:v Notation Normal form numbers numbers Notation Normal form numbers numbers -- K 12 x 3 +y7+z2+axy' 2 3 7 2 3 7 W 13 x' + xl' + Z2 + ay6 2 5 6 3 4 4 K13 x 3 + xy' + Z2 + ay' 2 3 8 24 5 QI0 x 3 + y' + yz2+axy3 3 3 4 2 3 9 K 1 . x 3 + Y'+Z2+ axy 6 2 3 9 3 3 4 Q11 x 3 + y2Z+ xz 3 +az' 3 3 5 24 7 ZII x 3 y+ Y'+Z2+ axy ' 2 4 5 2 3 8 Q12 x 3 + y' + yz2 + axy' 3 3 6 3 3 6 Z12 x 3 y + xy' + Z2 + ax 2 y3 24 6 2 4 6 8 11 x 4 + y2z+xz 2 +ax 3 z 3 4 4 2 5 6 Z13 x 3 y+ y6+z2+axy' 2 4 7 3 3 5 8 12 x 2 y + y2 Z + xz 3 + az' 3 4 5 3 4 5 W 12 x 4 +y'+z2+ax 2 y3 2 5 5 2 5 5 U 12 x 3 +y3+z4+axyz2 44 4 4 4 4 (3) Bimodular Singularities (m = 2). There are 8 infinite series and 14 exceptional families. In all the formulas, a=aO+alY' The 8 infinite series of bimodular singularities Notation Normal form Restriction Milnor number J 3 . 0 x 3 +bx 2 y3 + y9 +Z2 + cxy 7 4b 3 +27#O 16 J 3 . P x 3 +X 2 y3+Z2 +ay9+P p>O, ao#O 16+p 21.0 y(x 3 +dx 2 y2 + cxy 5 + y6)+Z2 4d 3 +27#O 15 2 1 . P y(x 3 + x 2 y2 + ay6+P) + Z2 p>O, ao#O 15+p WI. 0 x 4 +ax 2 l + y6+Z2 aij#4 15 W 1 . p x 4 +x 2 y3+ay6+p +Z2 p> 0, ao # 0 15+p W12q-l (x 2 + y3)2+axy4+q+z2 q>O, ao#O 15+2q-l Wl2q (x 2 + y3)2 + ax2 y3+ q + Z2 q>O, ao#O 15 + 2q Q2.0 x 3 + yz2 +ax 2 y2 + xy* a#4 14 Q2.P x 3 + yz2 +X2y2 +az6+p p>O, ao#O 14+p SI.O x 2 z+ yz2+ y5 + azy 3 a#4 14 Sl.P x 2 z+ yz2+X 2 y2 +ay5+ P p>O, ao#O 14+p Sf.2q-1 x 2 z + yz2 + zy3 + axy2+ q q>O, ao#O 14+2q-l Sf.2q x 2 z + yz2 + zy3 + ax 2 y2+ q q>O, ao#O 14 + 2q U 1 . 0 x 3 +xz 2 + xy 3 +ay3z ao(a+I)#O 14 U 1 . 2q - 1 x 3 +xz 2 + xy 3 + ay 1 +qz2 q> 0, ao # 0 14+2q-1 UI. 2q x 3 +xz 2 + xy 3 +ay3+Qz q>O, ao#O 14+2q The 14 exceptional families Notation Normal form Notation Normal form £18 x 3 + y10 +Z2 + axy 7 W 18 X 4 +y7 +z2+ax 2 y* £19 x 3 + xy 7 +Z2 + ay 11 Q16 x 3 + yz2 + y7 +Z2 + axy 5 £20 x3+yll+z2+axy8 Q17 x 3 + yz2 + xy 5 +Z2 + ay 8 217 x 3 y+y8+z2+axy6 Q18 x 3 + yz2 + y8 +Z2 + axy 6 2 18 x 3 y+xy6 +Z2+ ay 9 S16 x 2 z+ yz2 + xy 4 +Z2 + ay 6 2 19 x 3 y+y9+z2+axy7 S17 x 2 z+ yz2 + y6 +Z2 +azy* W 17 x 4 + xy 5 +z2+ay7 U 16 x 3 +xz 2 + y5 +Z2 +ax 2 y2 
App. A, Table 6 Topology 1740 Adjacency relations between simple and simply el1iptic singularities ,.;f:!' /'/ti;'"r'7"7 A lof-- Az A3 A 4 As+-- A 6 +--A 7<1----AfI+-- A 9-- A lO - Adjacency relations among unimodular singularities KI4 t K l3 T Z39 2 13 W l3 ll K 12 T 238 2 12 T Z 47 W 12 T 256 l TZ46 T255 1 _ t QI:--------jI"5 _ T 3H x 9 = £7 1 _ P;cE, References [IJ V. I. Arnol'd, Singularity theory, Lecture note ser. 53, London Math. Soc., 1981. [2] E. Breiskorn, Die Hierarchie der I-modularen Singularitiiten, Manuscripta Math., 27 (1979), 183-219. [3] K. Saito, Einfach el1iptische Singularitiiten, Inventiones Math., 23 (1974), 289-325. 6. Topology (I) h-Cobordism Groups of Homotopy Spheres and Groups of Differentiable StructUf{'s on Combinatori::.l Spheres (1) The Structure of the h-Cobordism Group On of n-Dimensional Homotopy Spherls. In the fol1owing table, values of On have the fol1owing meanings: 0 means that the group consists only of the identity element, an integer.! means that the group is isomorphic to the cyclic group of order !, 2' means that the group is the direct sum of ! groups of order 2, + means the direct sum, and? means that the structure of the group is unknown. n 12345678 9 10 11 12 13 14 15 16 17 18 2 3 or On"'" 0 0 ? 0 0 0 28 2 4+2 6 992 0 3 2 4 or 2 8128+2 2 4+22 8+2 (2) The Group f n of Differentiable Structures on the n-Dimensional Combinatorial Sphere. f n "'" On (n"",3), f 3 =0. (II) Adem's Formula Concerning Steenrod Operators Sq and f!JJ (64 Cohomology Operations) For the cohomology operators Sq and f!JJ, we have [a I 2J ( b-C-l ) SqaSqb= L Sqa+b-csqc (a<2b). c=O a-2c 
1741 App. A, Table 6.I1I Topology [alp] ( b-C )(P -1 ) -1 ) rJ"'f?lb= I (_1)a+c rJ"'+H[JJ< (a <pb), c=O a-pc [alp] ( b-C )(P -1 )) !JJi" {jf?lb = I ( _1)a+c {jf?la+b-c [JJ< c=O a-pc [(a-1)/pJ (( b-C )(P -1 ) -1 ) + I (_1)"+C-1 rJ"'+b-c(j[JJ< co a-pc-l (apb). Several simple cases of the formula above are as follows. Sq1 s q 2n = S q 2 n +!, Sq2 Sq4n =Sq4n+2 + Sq4n+! Sq1, Sq2 Sq4n+2 = Sq4n+3 Sq1, Sq 4 S q 8n= S q 8n+4 + Sq8n+3 Sq1 + Sq8n+2Sq2, Sq4 Sq8n+ 2 = Sq8n+4 Sq2, Sq 4 S q 8n+4 = Sq8 n +7 Sql + Sq8 n +6 Sq2, Sq4 s q 8n+6 = s q 8n+10 + Sq8n+8Sq2, f?l1f?1n=(n+ 1)f?l n +1, f?l1 {jf?ln = n' (jf?ln+1 + f?l n + 1 (j. Sq1 Sq2n+1 =0, Sq2 Sq4n+! = Sq4n+2 Sq1, Sq2 Sq4n+3 = Sq4n+S + Sq4n+4sq1, Sq 4 S q 8n+1 = Sq8n+4S q 1 + Sq8n+3 Sq2, Sq4 Sq8n+ 3 = Sq8n+ 5 Sq2, Sq4 S q 8n+5 = s q 8n+9 +S q 8n+8 Sq1 + Sq8n+7 Sq2, Sq4 Sq8 n +7 = Sq8 n +11 + Sq8n+9Sq2, (III) Cohomology Ring H*(7T,n;Zp) of Eilenberg-MacLane Complex (70Complexes) Z means the set of integers, and Zp = Zj pZ, where p is a prime number. (I) The casep=2, 7T=Z or Z2f U> 1). The degree of a finite sequence l=(i"i r - I , ...,i l ) of positive integers is defined by d(l) = i 1 + i 2 + ... + ir' If such a sequence satisfies ik+ I> 2i k (k = I, .. ., r - I), it is called admissible, and we define its excess by e(J) = (ir - 2i r _ I ) + ... +(i2 - 2r l ) + i l = 2i r - d(J). Further, we put Sql = Sqi'Sqi'-l ... Sqi,. Then we have H*(Z2f, n; Z2)= Z2[Sq I U n I I is admissible, e(I)< n], H*(Z, n; Z2)= Z2[Sql Un I I is admissible, e(I) < n, i 1> 1]. Here, UnE H"(n, n; Z2) is the fundamental cohomology class. 1=0 (empty) is also admissible, and for this case we put n(I)=e(I)=O, Sql = 1. Due to the Kunneth theorem, we have H*( 'IT + 'IT', n; Zp) = H*( 7T, n; Zp)Q1) H*( 7T', n; Zp) if 7T is finitely generated. In particular, we have H*(Z2,I;Z2)=Z2[Uj], H*(Z2' 2; Z2) = Z2 [ U2' Sq I U2 , Sq 2 Sq I U2 , . . ., Sq 2 'sq2' -I.. . Sq I U2 ,... ], H*(Z2' 3; Z2) = Z2[ UJ' Sq2'Sq2'-1 .., Sq I UJ , Sq(2'+ 1)2' Sq(2' + 1)2'-1 ... Sq2'+IS q 2'-I...S q l uJ lr>0, 5>0] (2) The case p "'" 2, 7T = Z or Zpf U> I). We define the degree of a finite sequence 1= (ir' i r _ I' ..., iI' io) of nonnegative integers by d (I) = i r + . .. + i 1+ io. The sequence I is called admissible if it satisfies the following conditions: i k = 2Ak(P - I) + €k (Ak is a nonnegative integer, €k = 0 or I (0.;; k .;; r)), and io=Oorl, i l >2p-2, ik+l>pi k (1kr-1). We define its excess by e(l) = pir - (p - I)d (l). Further, we put 0'/ = S<k0'>v,... 8<'0'18<0, and assume that Un E Hn(7T,n; Z) is the fundamental cohomology class. Then we have H*(Zpf, n; Zp)= Zp[f?l l Un I I is admissible, e(I)< n(p-l), n +d(I) is even] 01\z (f?I/ unl I is admissible, e(I)< n(p-1), n + d(I) is odd). p H*(Z, n; Z) is given by the above formula when the admissible sequence is I with io = O. 
App. A, Table 6.IV Topology 1742 (IV) Cohomology Ring of Compact Connected Lie Groups (427 Topology of Lie Groups and Homogeneous Spaces) (1) General Remarks. Let G be a compact connected Lie group with rank 1 and dimension n. We have H*(G; R) !\R(X 1 , ... ,x,), where !\K(X 1 "" ,x,) means the exterior algebra over K of a linear space V = Kx 1 + ... + Kx , with the basis {x l' ... ,X,} over K. We define a new degree in !\K(X 1 , ... ,x,) by putting degxi=m;(m; is odd)(1  i I), where m 1 + .., +m,=n. The  means isomorphism as graded rings. (2) Classical Compact Simple Lie Groups. We set degxj=i. H*( U (n); R);;;,; AR(XI,x3' ... ,X2n-l), H*(SU(n);R);;;,; AR(x3,x5' ...,X2n-l), H*( Sp(n); R);;;,; AR(X 3 ,X7, .., ,X 4n -l), H* ( SO (n); Z2) ;;;,;(Ha ving x t. X2, ..., X n _ I as a simple system of generators) ;;;,; Z2[Xt.X3, '" ,X2n'-IJ; (x?,w I i = I, .., ,n') (n'=[nI2], s(i) is the least integer satisfying 2 S (;)(2i- 1) > n) H*( SO (2n); K);;;,; AK(x3' x7, ..., X4n-5,X2n -I), H*(SO(2n-l);K);;;,; AK(X3,X7, ...,X4n-5), where K is a commutative field whose characteristic is not 2. For SO (n), SqO (Xi) = ()Xi+O' For SU (n), pO (x2i-l) = e  1 )X2i-l+20(P-l)' For Sp(n), (;])0 ( ) _ ( _ 1) 0(P-l)/2 ( 2i-l ) J X4i-l - a x4i-l+20(p-I)' (3) Exceptional Compact Simple Lie Groups. nand m i (I .;; i .;; I) given in (I) are as fol1ows. G 2 : n= 14, m i =3, 11. F4: n=52, m i =3, 11, 15, 23. E6: n=78, m i =3, 9, 11, 15, 17, 23. E7: n= 133, m i =3, 11, 15, 19, 23, 27, 35. E8: n = 248, m i = 3, 15, 23, 27, 35, 39, 47, 59. (4) p-Torsion Groups of Exceptional Groups. The p-torsion groups of exceptional Lie groups are unit groups except when p = 2 for G 2 ; p = 2, 3 for F4' E6, E 7 ; and p = 2, 3, 5 for E8' The cohomology ring of Zp as a coefficient group in these exceptional cases is given as fol1ows. Here we put degx i = i. H*( G 2 ; Z2) =Z2[X 3 Ji( x1)0 A z ,( Sq2 X3 ); H*(F 4 ; Z2) =Z2lxJ/( x1)0 Az,( Sq2 x3 ,xI5' Sq8 xI5 ), H*(F 4 ; Z3) = Z3[ I> 0'1X3] I( (1)0' 1 X3)3) 0 Az,( X3' 0'I x3 ,Xl1' 0'I Xll ); H*(E 6 ; Z2) = Z2[x 3 ]/( xj)0 Az,( Sq2x3,Sq4Sq2x3,xI5,Sq8Sq4Sq2x3,Sq8xI5)' H*(E 6 ; Z3) = Z3[ IS 0'1X3] I( (IS 0'Ix3n0 Az,( X 3 , 0'I x3 ,X9' XII> 0'lxl1' X17); H*(E 7 ; Z2) = Z2[ X3' Sq2x3' Sq 4 Sq2x3] I( xj, (Sq2 X3 ) 4, (Sq 4 S q 2 x3 )4) o A z ,( X 15, Sq 8 Sq 4 Sq2x3' Sq8x 15' Sq 4 Sq8x 15)' H*(E 7 ; Z3) = Z3[ I> 0'1X3] I( (I> 0'1 x3 n 0 Az,( X 3 , 0'1x3' Xl1' 0'lxll' 0'30'I x3 ,x27,X35); H*(E 8 ; Z2) = Z2[ X 3 , Sq2x3' Sq4Sq2x3,XI5] I( x6, (Sq2X3)8, (Sq 4 S q 2 x3 )4,xf5) o Az,( Sq 8 Sq 4 Sq2x3' Sq8x 15' Sq 4 Sq8x 15, Sq 2 Sq 4 Sq8x 15)' H*(E 8 ; Z3) = Z3 [ I> 0'1X3' I> 0'30'1 X3] I( (IS 0'1 X3/' (IS 0'30'1 X 3 )3) o Az,( X3' 0' 1x 3' X 15 ' 0'30' 1x 3' 0'3 x15 , x 35 ,X39,X 4 7)' H*(E 8 ; Z5) = Z5 [ IS 0'1X3] I( (IS 0'I X3 )5)0 Az,( X 3 , 0'I x3 ,xI5' 0'1 x15 , X 27 ' X35' X39,X 4 7)' 
1743 App. A, Table 6.VI Topology (V) Cobomology Rings of Classifying Spaces (56 Characteristic Classes, 427 Topology of Lie Groups and Homogeneous Spaces C) (1) Let H*(G; K)= AK(Xj,X2'''' ,x n ). Then the degx t are odd and the Xt may be assumed to be transgressive. Yi being its image, the following formula holds: H*(BG; K)=K[Yj,Y2' '" ,Yn] (Borel's theorem). (2) H*(BU(n))=H*(BGL(n, C))= Z[C1, C 2 ,..., cn], H*(BSU(n)) = H*(BSL(n, C))= Z[C2'''' ,cn]' H*(BSp(n))=Z[q1,Q2' ...,qn], H*(BO(n); K 2 )= H*(BGL(n, R); Z2)= K 2 [W 1 , w 2 , ..., w n ], H*(BSO(n); K 2 )= H*(BSL(n, R); Z2)= K 2 [W 2 ,..., w n ], H*(BSO(2m+ 1); K)= K[P1' P2' ..., Pm], H*(BSO(2m); K)= K[P1,P2'''' ,Pm-I, xl Here, K denotes a field of characteristic #- 2, and K 2 is the field of characteristic 2. The C t denote the ith Chern classes and the qt the ith symplectic Pontryagin classes, the W t the ith Stiefel-Whitney classes. Moreover, the Pt denote the iih Pontryagin classes, and X the Euler class. Their degrees are given as follows: degct=2i, degqi=degpi=4i, degwi=i, and degx=2m. (3) Wu's Formula. Let H 2 (BSO(n); Z2)= Z2[W2"", w n ] and H*(BU(n), Z2)= Z2[C 1 , C2,"" cn]' We have . ( i - j + t - 1 ) SqJw t = I wj-,w i +' O';'';j t (W O = 1), 2' ( i- j +t-1 ) Sq J Ct = I Cj-,Ct+, O';'';j t (co = 1). Here the symbol () denotes the binomial coefficient for a b; (0)= 1, and () =0 otherwise. (VI) Homotopy Groups of Spheres (202 Homotopy Theory) Table of the (n + k)th Homotopy Group 7t n +k(sn) of the n-Dimensional Sphere sn. The table represents Abelian groups. 0 stands for the unit group; integer 1 the cyclic group of order 1; 00 the infinite cyclic group; 2/ the direct sum of 1 groups of order 2; and + means the direct sum. n k <0 0 2 4 6 7 9 10 11 12 13 4 0 1 00 00 - hi o 00 1 00 2 - hi o 00 2 I 2 - -, 2 I 00+12 2 2 2 2 24+3 - - - -I o 00 2 2 24 I 2 - - -I 0 1 00 - hi o 0 \ 2 - - --I 2 100+120 2 4 ---, 240 I.. _2 o o o o o o o o o o o 2 12 2 15 2 2 2 12+2 84+2 2 2 2 12 2 15 2 2 12+2 84+ 2 2 2 2 6 o 00 15 2 3 120+12+2 72+2 84+ 2' 504+ 2 2 2 6 24+ 6 + 2 30 2 2 3 2 3 6+2 6 o 00 2 24 60 24+2 2 3 72+2 504+4 240 6 7 o 00 2 2 24 120 2 3 2 4 24+2 504+ 2 o 6 o 00 2 24 o o 2' 24+24+2 504+2 o 6+2 00 2 24 o 240 2 4 2 2 100+23 12+2 I - hi 2 2 2 3 I 6+2 504 - - - - -I 2 3 6 I 00+504 - - --I 504 I 2 - - - -I o 1 504 12 6 6 9 10 o 00 2 2 24 o o 2 24+2 504+2 . 0 o 00 2 24 o o 2 240 II o 00 2 24 o o 2 240 2 6+2 12 o o 2 2 2 2 2 6+2 13 o 00 2 24 o o 240 2 2 2 3 6 6 14 o 00 24 o o 2 240 2 2 2 3 6 504 00+3 ;>15 0 00 2 2 24 o 0 240 2 2 2 3 6 504 o 
App. A, Table 6.VII Topology 1744 n 14 Table of the (n + k)th Homotopy Group 7T n +k(sn) of the n-Dimensional Sphere sn (Continued) k 22 30 4 J 2520+6+2 5 6+2 6 12+2 7 24+4 8 240+24+4 9 16+4 10 16+2 II 16+2 12 48+4+2 13 16+2 14 8+2 15 4+2 - - - -, 2+2 100+480+2 2 4 ---, 480+2 1 2 3 - - ---, 2 2 1 00 + 2 4 - - - - - -I 2 2 2 4 I 8+22 - - - - - -I 2 4 8+2 100+264+2 24+22 ------, 264+2 L + 16 17 2+2 18 2+2 19 2+2 20 2+2 21 2+2 22 2+2 23 2+2 ,. 24 2+2 o 6 30+2 60+2 120+ 2 3 120+ 2 5 240+ 2 3 240 + 2 2 240+ 2 240+2 480+2 480+2 480+ 2 480 + 2 480+2 480+2 480+ 2 480+2 480+2 480 + 2 15 o 30 30 6+2 16 17 18 o 12+2 2 12+2 2 120+ 12+2 5 24+ 2 2 24+6+2 24+2 504+24+2 24+2 24+2 2 8+4+2 480+4+4+2 8+8+2 8+8+2 8+8+2 24+8+8+2 8+8+2 8+4+2 8+2 8+2 8+2 8+2 o 12+2 2 132+2 132+2 5 264+2 1056+8 264+2 264 + 2 264+2 264+6 264+2 3 264+2 5 264+2 3 264+4+2 264+2 2 264+2 2 264+22 264+2 264+2 264+2 264+2 264+2 19 o 132+2 2 2 2 6 6+2 2 480+ 12 24 24+3 24 504+24 24+22 24+25 24+2 3 240+24 2 4 24+ 12 24+2 20 21 30 6+6+2 2 2 504+22 2 4 2' 2 4 o 30 o 6+2 12+2 2 2 4 o 0 2 2 2 2 210 24+22 9240+6+2 6+2 90+2 2 6 180+2 2 6+2 72+2 3 12+23 1440+24+24 6+2 2 144+2 3 6+22 144+6+2 2 4 48+2 2 6+24 2016+ 12+22 4+23 16+2 2 4+22 16+2 2 2 3 16+2 3 2 4 240+ 16+23 2 3 16+23 2 3 16+ 2 2 2 4 16+ 2 2 2 4 16+22 2 3 8 + 2 2 --, I 00 +2 2 4+2 2 - -2; -I 23 L_____ 22 2 2 240+2 2 2 2 24+2 2 3 24+ 12+4+22 4+2 2 2 4 2 4 6+2 4 2 4 2 3 2 3 2 4 2 4 2 4 2 5 2 6 2' 2 2 2 2 2 4 24 24 24 24 Remarks (1) When n> k + I (below the broken line in the table), 7T n +k(sn) is independent of II and is isomorphic with the kth stable homotopy group G k . (2) Let InE7Tn(sn) be the identity on sn; 1J2E7T3(S2), V4E7T7(S\ 08E7T15(S8) be tbe Hopf mapping S3S2, S7S4, S 15S8 (induced mapping in the homotopy class), respectively; and [12m' 12m] E 7T4m-1 (S2m) (m * 1,2,4) be the Whitehead product of 1 2m , These objects generate infinite cyclic groups which are direct factors of 7T n +k(sn) corresponding to the original map- pmgs. (3) 1Jn+2 = En1J2' V n + 4 = E"v 4 , 0n+8= E n 0 8 (n;;' I) (E is the suspension) are the generator for 7T n +k(sn), which contains the mappings. (4) The orders of the fol1owing compositions are 2:gs n + 7 2 2 2 2 2 2 2 4 2 4 2 4 1Jn"1J n +1 (n;;.2), V n OV n +3 (n;;.5), 0nOOn+7 (n;;.16), 1JnOVn+1 (n=3,4), V n o1Jn+3(n=4,5), 1Jno o n+l(n;;.7), °no1Jn+7(n;;.8), 1'10°°13' 01l0VI8, 1Jno1J n +lo1Jn+2 (n ;;.2), vnoVn+30Vn+6(n ;;'4), 0noon+7oon+14 (n ;;'9). (VII) The Homotopy Groups 1tk(G) of Compact Connected Lie Groups G Here the group G is one of the following: SO(n) (n;;.2), Spin(n) (n;;'3), Sp(n) (n;;. 1), G 2 , F4' E6' E7' U(n) (n ;;.1), SU(n) (n;;. 2), E8' (I) The Fundamental Group 7TI(G). "(G){f (G=U(n) (n;;.I), SO (2», (G=SO(n) (n;;'3», (for al1 other groups G). (2) Isomorphic Relations (k;;. 2). 7Tk(U(n» 7Tk(SU(n» (n;;. 2), 
1745 App. A, Table 6.VII Topology 7Tk( U (1))  7Tk( SO (2»  O. 7Tk(Spin(n»7Tk(SO(n» (n > 3), 7Tk( Spin (3))  7Tk( Sp (1»  7Tk(S U (2))  7Tk(S 3), 7Tk( Spin (4»  7Tk( Spin(3) + 7Tk(S 3), 7Tk (Spin (5))  7Tk (Sp (2», 7Tk (Spin(6»  7Tk( S U (4». (3) The Homotopy Group 7Tk(G) (k > 2). n2(G)0. n3(G)00 (G#SO(2), U(1), SO (4), Spin(4)), n3(SO(4))00+00. W'(G)W2 W'(G)! +2 7Tk(G), k>6. G k Sp (1 ) Sp(2) Sp(3) Sp(4) SU(2) SU(3) SU(4) SU(5) SU(6) SU(7) SU(8) (G=SO(4), Spin (4», (G=Sp(n), SU(2), SO (3), SO (5), Spin (3), Spin (5», (G=SU(n)(n>3), SO(n)(n>6), G 2 , F4' E 6 , E7' E8)' (G= SO (4), Spin (4», (G=Sp(n), SU(2), SO(3), SO(5) Spin (3), Spin (5)), (G=SU(n)(n>3), SO (6), Spin (6», (G=SO(n), Spin(n) (n>7), G 2 , F4' E6' E 7 , E8)' 6 7 8 9 10 11 12 13 14 15 12 2 2 3 15 2 2 2 12+2 84+22 2 2 - 0 -  - - 0- - 0 - 120 2 2 2 4+2 1680 2 -------------, o 00 0 0 0 00 2 2 L _ 1 QOO _ _ _ J. _ _ o 00 0 0 0 00 2 2 0 00 12 2 2 3 15 2 2 2 12+3 84+22 2 2 6 0 12 3 30 4 60 6 84 + 2 36 ----, o 00, _2 _ ,120+2 4 60 4 1680+2 72+2 o 00 0 00 L _10_ 0 360 4 1680 6 o 00 0 00 0 - -   12 _ _  _ -, 5040 + 2 6 o 00 0 00 0 00 0 00 L _ 04..9 _ _ _ _0_ _ o 00 0000 00 0 00 0 00 SO(5) SO(6) SO(7) SO(8) SO(9) SO (10) S0(11) S0(12) S0(13) SO (14) S0(15) SO(16) S0(17) G 2 F4 E6 E7 E8 o 00 0 0 120 2 2 2 o 00 24 2 120+2 4 60 o 00 2 2 2 2 8 00+2 0 - --, o Loo + 00 2 3 2 3 24+8 00 +2 0 o -00-: 2 2 2 2 8 00+2 0 o 00 -2 00+2 4 00 12 o 00 2--2-2 00 2 o 00 2 2 0 - :  +-00, 2 2 o 00 2 2 0 00 2 L - - 1 000220 00 0 00 4 l___, o 00 2 2 0 00 0 0 L 2 00 ---, o 00 2 2 0 00 0 0 0 L_+_oo_ o 00 2 2 0 00 0 0 0 00 3 0 2 6 0 00 +2 0 0 168+2 2 o 0 2 2 0 00 +2 0 0 2 00 o 0 0 00 0 00 12 0 0 00 o 0 0 0 0 00 2 2 0 00 o 0 0 0 0 0 0 0 0 00 4+2 4 2 2 2 2 2 2 2 2 2 2 1680 2 1680+2 72+2 2520+8+2 2 4 2520+ 120+ 8 + 2 2 7 8+2 00+23 8 00 + 2 2 8 00+2 24+4 00 +2 8 00+2 00 
App. A, Table 6.VII Topology 1746 (4) Stable Homotopy Groups. For sufficiently large n for fixed k, the homotopy groups for classical compact simple Lie groups G = Sp(n), SU(n), SO(n) become stable. We denote them by the following notations. Here we assume k > 2. 7Tk(Sp) = 7Tk(Sp(n») 7Tk( U) = 7Tk( U (n));;;; 7Tk(S U (n)) 7Tk( 0) = 7Tk(SO (n)) (n >(k-I)/4), (n >(k+ 1)/2), (n>k+2). Bott periodicity theorem W'(SP){ f W,(O){ f 7T k (U);;;;{: (k=:=3,7 (modS)), (k=:=4,5 (mod8)), (k=:=O, 1,2,6 (mod8)). (k=:=3,7 (mod8)), (k=:=O, I (mod8)), (k:=2,4,5,6 (mod8)). (k:=1 (mod2)), (k=:=O (mod2». (5) Metastable Homotopy Groups. (a,b) means the greatest common divisor of two integers a and b. 7T2n(SU(n));;;; n!. 7T2n+l (SU(n»);;;; {  (n even), (n odd). 7T ( Su(n) ) = { (n+I)!+2 2.+2 - (n + 1)!/2 (n even, > 4), (n odd). 7T ( Su(n) ) = { (24,n) 2n+3 - (24,7T+3)/2 (n even), (n odd). ( s ( ») { (n+2)!(24, n)/48 7T2n+4 Un;;;; (n+2)!(24, n+3)/24 7T2n+5(SU(n));;;;7T2n+5( U(n+ I)). (SU( ))  { 7T2n+6(U(n+l)) 7T2n+6 n = 7T2n+6( U(n+ 1))+2 (n even, > 4), (n odd). (n :=2,3 (mod4), n > 3), (n:=O, I (mod 4)). { (2n+l)! (neven), 7T4n+2(Sp(n));;;; 2(2n+I)! (n odd). 7T4n+3(Sp(n»;;;;2. { 2+2 (n even), 7T4n+4(Sp(n));;;; 2 (n odd). 7T ( s ( n ») ;;;; { (24, n+2)+2 4n+5 P (24, n+2) (n even), (n odd). 7T ( s ( n) ) ;;;; { (2n+3)!(24, n+2)/12 4n+6 P (2n+3)!(24, n+2)/24 7T4n+7 (Sp (n));;;; 2. 7T4n+8 (Sp (n))  2 + 2. (n even), (n odd). 
1747 App. A, Table 6.VIII Topology The homotopy groups 17 n +;(SO (n» for n > 16, 3> i> -1 are determined by the isomorphism 17 n +i(SO (n»  17 n +i( 0) + 17 n +;+ I (V i +3+n,;+3 (R») and the homotopy groups of Vm+",m(R) given below. (6) Homotopy Groups of Real Stiefel Manifolds Vm+n,m(R)= O(m + n)j 1m X O(n). 17 n +k( Vn+I,I) 17 n +k(sn), 17n-k(Vm+n,m)0 (k> 1), 17n(Vm+nm) { 2 , 00 (n=2s-1,m>2), (n=2s). 17 n +k( Vm+n,m) (k = 1,2,3,4,5) are given in the fol1owing table, X 1 2 3 4 5 6 8s-1 8s 8s+ I 8s+2 8s+3 8s+4 8s+5 8s+6 2 0 00' 2 2+00 2 2+00 2 2+00 2 2+00 2 2+00 2 2+00 '"n+1 ;>3 0 00 0 2' 2 4 0 2' 2 4 0 2' 2 4 2 00 2' 4 2 2 4 2' 4 2' 4 2' 4 2' 4 2' '"n+2 3 00' 2 2+00 2' 4+00 2 2+00 2' 4+00 2 2+00 2' 4+00 2 ;>4 00 0 2 2' 8 0 2 2' 8 0 2 2' 8 0 2 2 2' 2 00+12+2 2' 24+2 2' 24+2 2' 24+4 2' 24+2 2' 24+2 3 2' 2 2 00+ 12+4 2 3 12+2 2 2 24+4 2 3 12+2 2' 24+4 2 3 12+2 '"n+3 4 2 00 2 00'+ 12+4 2' 12+00 2' 24+4+ 00 2' 12+00 2' 24+4+ 00 2' 12+00 ;>5 0 0 2 12+4+ 00 2 12 2 24+8 2 12 2 2 4+48 2 12 2 2 12' 00+2 2'+ 24 2' 24 2 24 2 24 2 24 2 24 3 2' 0 00+4 2' 2 3 2 4 2' 2' 2 4 2' 2 2 2 '"n+4 4 2 0 4+00 2' 2' 2 8 2 3 2 2 8 2 3 2 2 5 00 0 4+ 00 2 2' 2+00 2 8+00 2' 00 2 8+ 00 2' 00 2 ;>6 0 0 4+00 2 3 2 0 8 2 0 2 16 2 0 0 2 12 2' 2 2 3 0 00 0 0 0 0 0 0 0 0 '"n+5 3 12' 00 2+24 2' 24 00+2 24 2 24 2 24 2 24 2 4 0 00 2 3 2' 2 00+4 2 2 2' 2 4 2' 2' 2 4 (VIII) Immersion and Embedding of Projective Spaces (114 Differential Topology) We denote immersion by c, and embedding by £. P"(A) is an n-dimensional real or complex projective space where A = R or C, k{P"(A)} is the integer k such that P"(A) c R k and P"(A)<:,t R H , and k{P"(A)} is the integer k such that P"(A)£R k and P"(A)$R H , In the table, for example, numbers 9-11 in the row k{P"(R)} for n=6 mean P 6 (R)<:,tR 8 , P6(R)cR Il . n 2 3 4 5 6 7 8 9 10 11 12  {P"(R)} 2 4 5 8 9 9-11 9-12 16 17 17-19 k {pn(R)} 2 3 4 7 7 7 8 15 15 16 16 17-19 {pn(C)} 3 7 9 15 17 22 22-25 31 33 38 38-41 k{pn(C)} 3 7 8-9 15 16-17 22 22-25 31 32-33 38 38-41 2' 2' + 1 2'+2 2' +3 2' + 2s (r > s > 0)  {pn(R)} 2n 2n-l 2n-3-2n-l k {pn(R)} 2n-l 2n-3 2n-4 2n-6  {pn(C)} 4n-l 4n-3 4n-2 4n-2 k {P"(C)} 4n-1 4n-4-4n-3 4n-2 4n-2 
App. A, Table 7 1748 Knot Theory 7. Knot Theory (235 Knot Theory) Let k be a projection on a plane of a knot K. We color the domains separated by k, white and black alternatively. The outermost (unbounded) domain determined by k is colored white. In Fig. 16, hatching means black. Take a point (a black point in Fig. 16) in each black domain. The self-intersections of k are represented by white points (Fig. 16). Through each white point we draw a line segment connecting the black points in the black regions meeting at the white point. In Fig. 16, we show this as a broken line. We assign the signature + if the torsion of K at the intersection of k has the orientation of a right-hand screw (as in Fig. 17, left), and the signature- if the orientation is opposite (as in Fig. 17, right). The picture of the line segments with signatures is called the graph corresponding to the projection k of the knot K. Given such a graph, we can reconstruct the original knot K.  / "" ' +; "" -j , " 1 ,I /rif / )., Fig. 16 Fig. 17 Fig. 18 shows the classification table of knots for which the numbers of intersections of k are 3 to 8 when we minimize the intersections. The projection of k is described by a solid line, and the graph by broken lines. We omit the signatures since for each graph from 3 1 to 8 18 they are all + or all -. Such knots are called alternating knots. 8. Inequalities ( 88 Convex Analysis, 211 Inequalities) (1) la+bl.;; lal+lbl, la - bl ;;. Iial-Ibil. For real a v , we have La;;;' 0, and the equality holds only if all av=O. (2) n!<nn«n!)2 (n;;.3). en;;. nn / n!. nl/n<31/3 (n#3). (3) 2/'IT< (sinx)/x < I (0<x<'IT/2) (lordan'sinequality). (4) Denote the elementary symmetric polynomials of positive numbers aj, ..., an> 0 by S, (r=l, ...,n). Then / / 1/2 / II' / II" SI (7);;'[S2 ()] ;;....;;.[s, ()] ;;....;;.[S" ()] If at least one equality holds, then al = '" = an' In particular, from the two external terms, we have the following inequalities concerning mean values: I n ( n ) 1/" / n I n VI a v ;;' VI a v ;;. n VI a v ' For weighted means, we have " n L AA;;' II a;' v=1 v=1 (Lx,,=I, x" >0). (5) When a v >0, b v >0, p > 1, q> I, (l/p)+(I/ q)= I, lip Ilq l Vl (av)p] l VI (hv)q] ;;. VI avhv (Holder's inequality). The equality holds only if (avY' = c(bv)q (c is a constant). 
1749 3, 8, 8 12 8, 8, 8" App. A, Table 8 Inequalities 76 8, 8 14 7 2 7, 8" Fig. 18 Classification table of knots. The signatures from 3\ to 8\8 are aU + or all 8, 8" 81b When p = q = 2, the inequality is called Cauchy's inequality, the Cauchy-Schwarz inequality, or BunyakovskiI's inequality. As special cases, we have ( i a v )( i  ) ? n 2 (av>O), v=l p=l v ( f av ) 2 <:::n ( f a; ) (av>O). p=l v=l When O<p < 1, we have an inequality by reversing the inequality sign in Holder's inequality. 
App. A, Table 8 Inequalities 1750 (6) When a v > 0, b v > 0, p > 0, and {a v } and {b v } are not proportional, we have [V1 (a v + bY] lip  [V1 (aY ] lip + [V1 (bv)P ] lip (p I) (Minkowski's ine'quality). The integral inequality corresponding to (5) or (6) has the same name. n n (7) If a"v>O, L a"v= L a"v=l, bv;;'O' ,,=1 v=1 V1 b v < ,,J V1 a"vbv). In particular, for the determinant  = det(a",,), n I n \ 112< V1  "lla"i). The equality in this holds only if al1 rows are mutual1y orthogonal. If aUla"vl < M, we have II < nn/2M n (Hadamard's estimation). (8) Suppose that a function j(x) is continuous, strictly monotone increasing in x; 0, and j(O)=O. Denote the inverse function of j by j-I. For a,b>O, we have ab < foa j(x)dx+ fob r l (x)dx (Young's inequality), and the equality holds only if b = j( a). In particular, for j(x)= XP-l (p> 1), we have a P b q -+- ;;.ab, p q where (1/p)+(1/ q)= 1. (9) Ifp,q>l, (1/p)+(1/q)=I, a,,;;' 0, bv;;'O, f b 1 < . 71 [ f (a,,)p ] I/P [ f (bv)q ] l/q (Hilbert's inequality), ""'=0 J.t P sm(71 /p) ,,=0 v=O and the equality holds only when the right-hand side vanishes. (10) For a continuous function j(x);;. 0 (0 < x < (0), we put F(x)= fox j(t)dt, and assume that p > 1. Then fo oo [ FX) r dx « p 1 f fooo[j(x)t dx (Hardy's inequality), and the equality holds only if j(x) is identicaUy O. Further, if j(x»O, fooo ex p [  foXIOGJ(t)dt] dx< e 10 00 j(x)dx (Car1eman's inequality). (11) Let a < x <  < b, p > 1, and 1 f  sup  j(t)dt= e(x).  s x x Then (b P ( p ) p (b )" [e (x)] dx < 2 -=T )" Ij(x)jP dx a p a (Hardy-Littlewood supremum theorem). (12) If j(x) is piecewise smooth in 0.;; x.;; 71 and j(O) = j( 71) =0, fo'T[f' (x)t dx;;. fo'T[f(x)t dx (Wirtinger's inequality), and the equality holds only if j(x) is a constant multiple of sinx. 
1751 App. A, Table 9 Differential and Integral Calculus 9. Differential and Integral Calculus (I) Derivatives and Primitive Functions (106 Differential Calculus, 216 Integral Calculus) F(x)= f f(x)dx acp + /# (a,/3 constants) CP'I/; cp/tf; (1/;",,0) loglcpl (cp""O) <P(cp) (composite) c (constant) f(x)= F'(x) acp' + /31/;' cp'l/; + cpl/;' (cp'l/; - cpl/;,)/tf;2 cp' / cp (logarithmic differentiation) (dlJ>/dcp)'cp' o x n nx n - 1 xn+l/(n+l) loglxl logolxJ x(logx-l) XX x n (n"" - I) 1/ x (logae)/ x logx expx = eX a X loga xX(l + logx) expx= eX aX (a>O) (x - l)e X xe X smx cosx cotx -smx sec 2 x - cosec 2 x cosx tanx secx secxtanx - cosecx cotx cosh x sinhx sech 2 x - cosech 2 x - sech x tanh x - cosech x co th x cosec x sinhx=(e X - e- X )/2 coshx=(e X + e- X )/2 tanh x = sinh x / cosh x coth x = cosh x / sinh x sechx= l/coshx cosech x = 1/ sinh x 1 1 I I + x 1 _ { arc tanh x -og- - 2 1- x arccothx (Ixl < 1) (Ixl> I) 1/ v'1=? -1/v'1=? 1/(1+ x 2 ) -1/(1 + x 2 ) 1/lxlvY=! -1/!xIY x 2 -1 1/ V?+T 1/ vY=! 1 l-x 2 arc sin x (IFI < 'TT /2) arc cos x (0 < F < 'TT) arc tan x (IFI < 'TT /2) arc cotx (IFI < 'TT /2) arc sec x (0 < F < 'TT) arc cosec x (IFI<'TT/2) arc sinh x = log( x + V?+l ) arc cosh x = log( x + vY=! ) arc sech x - 1/ x Y I - x 2 -1/lxlVT+? arc cosech x 
App, A, Table 9.11 Differential and Integral Calculus 1752 F(x)= f f(x)dx f(x)= F'(x) 2 log I =  ; I (a> 0) (11 a)arc tan(x 1 a) (xY 1- x 2 +arcsinx)/2 [xy x 2 :tl :tlog(x+Y x 2 :t1 )1/2 -log 1 cosxl log 1 sin xl 10gltanxl log I tan[('1T 1 4) + (x 12)]1 log I tan( x 12)1 (x 12) - (1 14)sin 2x x 2 - a 2 I/(x 2 +a 2 ) Y 1 - x 2 Y x 2 :t 1 tan x cot x 1 Isinx cosx secx sinx - x COSx cosx + x sinx n sin mx sin nx + m cos mx cos nx n 2 - m 2 bx bsinax - acosax e a 2 + b 2 bx bcosax+ a sin ax e a 2 + b 2 xarcsinx+ cosec x sin 2 x xsinx x cos x (n 2 * m 2 ) sin mx cos nx e bx sinax e bx cosax arc SIn X x arc tan x - (1/2)log(1 + x 2 ) det( fPjk)j,k-l, ...,n arctanx L det( fPjl'" fPj"-lfP'j"fPj"  1 .. .fPjn)j= I. ...,n (II) Recurrence Formulas for Indefinite Integrals (1) I = f dx (m is a positive integer). m - (1 + x2( Im= x + 2m-3 Im _ 1 (m:>2); II=arctanx. 2m-2 (1+X2(-1 2m-2 (2) Im= f x m dx (m is an integer, a*O). y ax 2 + bx + c The case m < 0 is reduced to the case m :> 0 by the change of variable 11 x = t. 1 m-I 2 (2m-l)b (m-1)c Im=-x Vax +bx+c - 2 I m - 1 - I m - 2 (m:>I); ma ma ma _ 1 (11 va )logI2ax+ b+2va Y ax 2 + bx+ c I 1 0 - -1. 2ax+b - arCSIn V Y b 2 -4ac (a>O), (a < 0); In this case, for the integrand to be a real function it is necessary that b 2 - 4ac > O. (3) Im= f xmexdx (m is an integer). Im=xmex-ml m _ l ; lo=e x , l_l=Eix, 
1753 App. A, Table 9.I1I Differential and Integral Calculus where Ei is the exponential integral function ( Table 19.11.3, this Appendix). (4) Im.n=' f x m (1ogxr dx (m,n are integers, nO). xm+1 n xm+1 Im.n= m+1 (1ogx)n- m+1 Im.n-I; Im.o= m+1 (m*-I), L I . n =(1ogxr+ I /(n+I). (5) 1m=' f xmsinxdx, J m =, f xmcosxdx (m is a nonnegative integer). 1m = - xmcosx+ mJ m - 1 = x"'-I(msinx - xcosx)- m(m-I)/ m _ 2 , lm=xmsinx- mlm-I = xm-I(xsinx+ mcosx)- m(m-I)J m _ 2 ; 1 0 = -cosx, Jo=sinx, (6) Im,n=' f sinmx.cosnxdx (m,n are integers). sinm+lxcosn-Ix n-I } Im.n m+n + m+n Im.n-2 (m+ n*O), -sinm-Ixcosn+lx m-I 1 = +-1 2 m.n m+n m+n m-.n -sinm+IXCOSn+IX m+n+2 Im,n= n+1 + n+1 I m . n + 2 (n*-I), sin m + IXCOS n + IX m+n+2 Im.n= m+1 + m+1 I m + 2 . n (m*-I); /1.1 = (sin 2 x)/2, / 1 ,0= -cosx, /1.-1 = -loglcosxl, / 0 ,1 =sinx, 1 0 ,0 =x, 10, -I = log I tan[(x/2)+ ('IT /4)]1, L 1.1 =logl sinxl, 1-1.0= logl tan(x /2)1, I -I. -I =Iogi tanxl. f tanm-Ix Im.-m=' tanmxdx= m-l -l m - 2 .-(m-2)(m"", I). (III) Derivatives of Higher Order f(x) f(n)(x) cp.tf; i ()cp(v)tf;(n-v) v=o (Leibniz's formula) Xk n-I II (k-v)x k - n p=o (x+ar n! expx sinx expx a X (1og a r (-Ir-I(n-I)!/x n sin[x+(n'IT/2)] cos[x+(n'IT/2)] aX (a>O) logx cosx e OX cosbx rneOXcos(bx + nfJ) (where a::.rcosfJ, b::.rsinfJ) arc Sill X n-\ ......L L (- I Y ( n - 1 ) (2v - I)! !(2n - 2v - 3)!!(1 + x)-(l/2)-V(l- x)(l/2)-n+v 2 n - 1 v=o v (where (2v - I)!! =' 1.3.5. ... . (2v - I), (- I)!! =' I) arc tan x (_I)n-I(n -1)!sin"{lsinnfJ (where x=cotfJ) ( ] )' = -  , (])' 2j'2- ff" l ( 1. ) '" = 6fj'f"-6j'3_f2j''' f t 
App. A, Table 9.IV Differential and Integral Calculus 1754 Higher-order derivatives of a composite function g(t)=j(xI(t), ...,xn(t» dg n aj dx; d 2 g = i  d 2 x; + i  dx; dX j dt = ;I ax; dt' dt 2 ;= 1 ax; dt 2 ;,j= I ax;ax j dt dt' d 3 g n aj d 3 x; n a 2j d 2 x; dX j n a 3j dx; dX j dXk -=L--+2L ---+ L -- dt 3 ;= 1 ax; dt 3 ;J= I ax;ax j dt 2 dt ;J,k= 1 ax;aXjaXk dt dt dt . For a function z=z (xj, ...,x n ) determined implicitly by F(z;xj, ...,xn)=O, we have az _ Fx, ax; - - Fz ' a 2Z F xjxj FXjFxjz + FXjFxjz -=--+ aX;aXj Fz F; Fx.Fx.F zz , 1 F 3 z Schwarzian derivative: {y;X}=( : )/( : ) -  [( : )/ ( : ) r {y;x}=Oy=(ax +b)/(cx+ d), {y;x} = ( ; )2[{y;z} - {x;z}]= - ( : Y{x;y}, {(ay+ b)/(cy + d);x} = {y;x}. (IV) The Taylor Expansion and Remainder If j(x) is n times continuously differentiable in the interval [a,b] (i.e., of class en), j(b)= nf (b!a)" j<'>(a) + Rn ,=0 (Taylor's formula). Rn is called the remainder, and is represented as follows: Rn=  l b(b-xr-Ij<n>(x)dx= (b-a)P(b-)n-p j<n)() (n-I)! a (n-I)!p (b a)" - (1-0)"-Pj<")(a+0(b-a)) (n):p>O, 0<0<1, a<<b, =a+O(b--a)) (n-1)!p (Roche-Schlomi1ch remainder); 1 " = -(b- a) j<")() (Lagrange's remainder); n! = (b- a)(b-)"-I j<n)m (Cauchy's remainder). (n -I)! If j(x,y) is m times continuously differentiable in a neighborhood of a point (xo,yo), m - 1 1 ( a a ) >- j(xo+h,yo+k)= '\o A! h ax +k ay j(xo,yo)+R m a "+l ( x ) '" h"k' o,Yo + R "'" II!V! ax "a ' m' O""+v"m-I;,,,v;>O r Y 1 ( a a ) m . Rm= m! h ax +k ay j(xo+fJh,yo+fJk) (O<fJ<I). If all partial derivatives up to order m - 1 are totally differentiable, m-] 1 ( n a ) ' j(x]+hl,...,xn+h n )= L;! L h"ax j(Xb...,xn)+Rm v=O ,,=1 " I a VI + ... + ''1 (X j, ..., X n ) = L , , hfi...h: n a' a +Rm' Pf....Pn. XII,.. x;n where the  means the sum for "I' ..., "n in the domain 0.;; "I + ... + "n .;; m - 1; "b .. ,,"m :> O. The remainder Rm is expressed as 1 ( n a ) m Rm=, L h"- a j(xI+fJhj,...,xn+fJhn) (O<fJ<I). m. ,,=1 x" 
1755 App. A, Table 9.V Differential and Integral Calculus (V) Definite Integrals [4] In the fol1owing formulas, we assume that m, n are positive integers. 8mn is Kronecker's delta (8 mn = 0 or 1 for m =1= n or m = n), f is the gamma function, B is the beta function, and C is the Euler constant. For simplicity, we put { 1.3.5.... '(m-2)'m=2(m+l)/2f[(m/2)+ I]/V7T =m!/2(m-I)/2[(m-I)/2]! 11_ (misodd), m..= 2.4.6.... '(m-2)'m=2 m / 2 f[(m/2)+ I]=2 m / 2 (m/2)! (m is even). In an n-dimensional real space, the volume of the domain 2 n [f(1/p )]" pn-Inf(n/p) . For p=2, this is the volume of the unit hypersphere, which is 'lT n / 2 ( (2'lTr/2/n!! (n is even), f[(n/2)+ 1] = 2(2'lT)(n-l)/2/ n !! (n is odd). IXIIP+...+lxnIP<1 (p>O) is The surface area of the (n -I)-dimensional unit hypersphere 2'lT n / 2 ( (2'lT)"/2/(n-2)!! f(n/2) = 2(2'lT)(n-l)/2/(n-2)!! Ixd 2 +... +lx n I 2 = 1 IS (n is even), (n is odd). fa 00 xp-1e- x dx= fa 1(10g )P-l dx=f(p). 1 1 ( l ) q f(q+l) x P log- dx= 1 (Rep,Req>-1). o x (p+ l)q+ ( l xP -I(I-x)q- 1 dx= 1 00 Xp-I + dx=B(p,q)= f(p)f(q) . J o 0 (1+x)p q r(p+q) (00 x a dx=..!. f[(a+1)/c]f[b-{(a-c+l)/c}] J o (1+X C )l+b c f(1+b) (Rec >0; Rea, Reb> -1; Reb> Re a- + 1 ). a (a- X)P-l(X_ b)q-l (a- bl+ q - 1 fa Ix-cip+q dx= la-clq\b-clp B(P,q) (0< c < b< a or 0< b < a < c; Rep,Req >0). f oo 1 'IT(2n)! (2n-l)!! dx= - 'IT -00 (1 +x2)"+1 2 2n (n!)2 (2n)!!' f 00 x2m 'IT -dx- (2m+l<2n). - 00 1 + x2n - n sin[(2m + 1)'lT /2n] l oox a - 1 _ 1+ dx=r(a)f(1-a)=---,-'!!.- (O<a<1). o x sma'lT looe-a2x2dx= V; 100xpe-x2dx=..!.f ( P+I ) = { (2n-O!!V;/2n+l (p=2n), o 21al' 0 2 2 n!/2 (p=2n+ 1). faOO(e-a2/X2_e-b2/X2)dx=(b_a)V7T (a,b ;;'0). faooe-[X-(l/x)l'dx= V; . ( OOe- x2 -(a'/x') d x=e- 2a V; ( ) (00 1 'IT J o 2 a;;'O. J o eax+e-ax dx= 4a ' 1 00 x dx=  (a>O). o eax_e- ax 8a 2 1 00 x _ 1 1 log(1 / x) _ 'lT 2 x _x dx - 2 dX-- 8 ' o e -e 0 I-x 
App. A, Table 9.V Differential and Integral Calculus 1756 I 00  _ I I log(1 / x) _ 'lT2 I 00 X _ I I log(1 / x) _ 'lT2 x 1 dx- 1+ dx-- 12 . -----x-- I dx- I dx-- 6 . o e + 0 x 0 e - o-x (00 10 g ( e: + I ) dx = (I log ( 1 + x ) 1. dx = 'lT2 . J o e - I J o 1 - x x 4 In l logx 1 'IT/2 'IT dx= logsinxdx= - _ 2 log2. o 0 I J1og(l +x) d x ='!!..lo g 2. I I logx dx =- i oo logx dx =  (_1)n-1 2 8 2 2 "'" 2 = -0.91596.... o l+x 0 l+x I l+x n=O (2n+1) 2 I OO (logx) I 6 'IT 3 I lxp-x q p+1 dx=-. -dx=log- o 1 + x + x 2 81 V3 0 logx q + 1 fo I log lIogxl dx = fo 00 e- t logtdt= - C= -0.57721... . (p,q> -1). ('IT ('IT 'IT J o sin mx sin n.x dx = J o cos mx cos nx dx = limn I . 'IT. { [I-(-1)m+n J ----P---2 (m*n); I smmxcosnxdx= m - n o 0 (m=n). fo'IT/2sinpxcosqXdx=B( P;l , q;l ) (Rep,Req>-); { ('IT/2)(P-1)!!(q-1)!!/(p+q)!! = (p-1)!!(q-1)!!/(p+q)!! (p, q are positive integers not both even). ('IT/2sinPxdx = ('IT/2 cosPxdx= y;;, r[(p + 1)/2] (Rep> -1); J o J o 2 r[(p/2)+ I] = { ('IT/2)(2n-1)!!/(2n)!! (p=2n), (2n)!! /(2n + 1)!! (p = 2n + 1). (p,q are even positive integers), f 00 sin(x 2 )dx= f 00 cos(x 2 )dx = (00 sinx dx = (00 cosx dx =- fi"" -00 -00 J o vx J o vx VI I oo sinax d - 'IT ( 0) I oo tanx d _ 'IT - x-- a>. - x-- o x 2 0 x 2 (take Cauchy's principal value at x == (n + i)'IT). ( 00 sin 2 x dx = '!!.. . (00 sin(x 2 ) dx = '!!.. . J o x 2 2 J o x 4 (Fresnel integral). I OO sin2n+lx 'IT (2n-1)!! dx=- . o x 2 (2n)!! I oo sinqx 'lTqP-1 -dx- (0<p<2). o x P - 2r(p )sin(p'IT /2) (OO cosPX dx='!!..e- 1p1 . (OO cos 2 ax dx=(1+e-2a) (a>O). J o 1 + x 2 2 J o I + x 2 4 I oo sinax d - 'IT (1 -a ) ( 0) I ooxsinax d _ 'IT -a x-- -e a>. - x--e o x (1 + x 2 ) 2 0 I + x 2 2 (a >0). I OO sin 2m + IX cos 2 nx I oo sin2m+lxcos2n-Ix 'IT (2m-1)!!(2n-I)!! = =- . o x 0 x 2 (2m+2n)!! f 'IT/2 f." 'inax:o'bx dxl W4 I 2" 1 d 2'IT x- o 1 + a cosx - v'T"=Q2 (" xsinx dx= 'lT2 J o I +cos 2 x 4 . (a>b>O), (a=b>O), (b>a>O) (I I 1 ) ('IT/2 1 dx=  (ab-'-O ) . a < . J o a2 cos 2 x + b 2 sin 2 x 2ab -,- (Dirichlet's discontinuous factor). 
1757 App. A, Table 10 Series ('" cosnx dx- { '7Wn/O-a2) )0 1-2acosx+a 2 - 'IT/a n (a 2 -1) (Ial < 1), (lal> 1). References [1] B. O. Peirce, A short table of integrals, Ginn, Boston, second revised edition, 1910. [2] D. Bierens de Haan, Nouvel1es tables d'integrales definies, Leiden, 1867. There are several mistakes in this table. For the errata, see [3] C. F. Lindmann, Examen des nouvel1es tables de M. Bierens de Haan, Handlingar Svenska Vetenskaps-Akad., 1891. [4] E. W. Sheldon, Critical revision of de Haan's tables of definite integrals, Amer. J. Math., 34 (1912), 39-114. 10. Series ( 379 Series) (I) Finite Series (1) Sk=l k +2 k +... +n k (k is an integer). For k-;>O, we have _ Bk+l(n+I)-Bk+I(1) _  i ( k+l ) B2i(n+l)k+l-i Sk - k + 1 - "'" ( - 1). k + 1 ' i=O 1 where Bl is a Bernoulli number and BI(X) is a Bernoulli polynomial. In particular, So=n, SI=n(n+1)/2, S2=n(n+I)(2n+I)/6, S3=n2(n+1)2/4, S4 = n (n + 1)(2n + 1 )(3n 2 + 3n -I) /30. For k<O and k= -I, S -I = Cl- [( _1)1/(1- I)!] [ d 1 logr(x) / dXI]x_n+ 1 1 1  ( ) i B 2(i+l) (I+i-I)! 1 =Cl- - + "'" -1 - (l-I)(n+1)/-1 2(n+l)' i=1 (i+1)! (I-I)! (n+l)l+i' For 1= 1, the second term in the latter formula is replaced by log[lj(n + 1)]. Here r is the gamma function, and the constants c , are C _ [C (Euler constant) (1= 1), l-t  (I) (is the Riemann zeta function) (I -;> 2). n ... n (i+m-I)! 1 (n+m)! (2)  1(1+ 1)...(I+m-l)=  ('_ , = m+ 1 ( _ )' ' i=1 i=1 1 I). n I.  (i-I)! _ _ _ 1 [ 1 _ n! ] "'" (m -;> 2), i=1 (i+m-1)! m-l (m-I)! (n+m-1)! n L i!i=(n+I)!-I, i=1 n . i(7)=n2n-l, 1=1 i () ( n +  =  - I ) = ( ::z   ) (m  n), l=m f (n(r:i)=(nm). 1=0 'I a i = { a(a n -1/(a-1) (a '1= I) (a= 1) (geometric progression) n n(n+l) n+l  (a+jd)=(n+l)a+ 2 d=-r-(a+a+nd) j=O  . ( '11 ) . ( n 11 ) . (n + I) /3 / . /3 "'" sm a+]p =sm a+ 2" p sm 2 sm 2 , j=O (arithmetic progression) 
App. A, Table 10.11 Series 1758 n ( n ) (n + 1),8 / ,8 L cos( a + j,8) = cos a + "2,8 sin 2 sin 2 ' J=O n L cosec2Ja= cot(aj2)-cot2 n a. j=O (II) Convergence Criteria for Positive Series  an In the present Section II, we assume that an  O. Cauchy's criterion: The series converges when limsup < 1 and it diverges when limsup>1. d'Alembert's criterion: The series converges when limsupa n +1la n < 1 and diverges when liminf an +1 Ian > 1. Raabe's criterion: The series converges when liminfn[(a n la n +1) -1] > 1 and diverges when limsupn[(a n lan+l)-I] < 1. Kummer's criterion: For a positive divergent series (l/bn)' the series  an converges when liminf[(b n a n la n +1)- b n +1] >0 and diverges when limsup[(b n a n la n +1)-b n +1] <0 diverges. Gauss's criterion: Suppose anlan+l = 1 + (kin) + (On In A), where A> 1 and {On} is bounded. Then the series  an converges when k> 1; and diverges when k.;;; 1. SchlOmilch's criterion: For a decreasing positive sequence avtO, let nv be an increasing sequence of positive integers and suppose that (n v + 2 - n V +1)j(n V +1 - nv) is bounded. Then the two series  an and (nv+l - nv)a n , converge or diverge simultaneously. Logarithmic criterion: For a positive integer k, we put 10g k x;: log(log k _ 1 x), logl X = logx. Then for sufficiently large n we have The first logarithmic criterion: If { .;;;o, p> 1 an-1/(nlog[ n ...10&'-1 n(log k n)p) O, p.;;;1 then  an converges, then  an diverges. The second logarithmic criterion: If . p a n + I n login log k _ 1 n ( 10gkn ) an n+ I 10g l (n+ 1) "'log k _ l (n+ 1) 10g k (n+ 1) { -< 0, p > 1 ;;.0, p-< 1 then an converges, then an diverges. (III) Infinite Series  (_1);-1 L . =log2, i=1 1  (_1);-1_'17 L 2i-l -"4 i=1 (Leibniz's formula), ;-1  (-1) = '17-3 i=\ 2i(2i+ 1)(2i+2) 4'  (2i)! 1 _ '17 i=O 2 2i (i!)2 2i + 1 -"2' ;I ( + -log( 1+ + )) = C (C is Euler's constant). Putting  I  I  (_1);-1 nn)=L-:n, a(n)=L n ' ,8(n)=L .n i=ll i=l (2i-I) i=l 1 00 (_1),-1 € (n) = L -----;;, i= \ (21-1) 
1759 App. A, Table lO.IV Series we have (2'11' )2n (2 2n -1)'lT2n n2n) = 2(2n)! B2n' a(2n) = 2(2n)! B2n' (2 2n -I _ 1)'lT 2n '1T 2n + 1 {3(2n)= (2n)! B 2m E(2n+ 1)= 2 2 E 2n . 2 n+ (2n)! where Bn is a Bernoul1i number, and En is an Euler number. n2) = '11'2/6, n4) = '11'4/90, n6) = '11'6/945. a(2)='11' 2 /8, a (4) = '11'4/96, a(6) ='11'6/960. {3 (2)= '11'2/12, {3(4) =7'11'4/720, {3 (6) =31'11'6/30240. E(1) = '11' / 4, E(2) = 0.915965594177219 015054603514932... E(3)='11' 3 /32, E(5) =5'11'5/1536, E(7) =61'11'7/92160. (Catalan's constant), (IV) Power Series (339 Power Series) (1) Binomial Series (1 + x)" = O(  )Xi. This converges always in Ixl < 1. If a > 0, it converges in - 1 .;; x.;; I, and if - 1 < a < 0, it converges in - 1 < x.;; 1. When a is 0 or a positive integer, it reduces to a polynomial and converges in Ixl < 00. 1  .. m = £.. (-I)'x ' (Ixl < 1), x ;=0  (-I)i-I(2i)! . VI+X = L x' (Ixl < 1), i=O (2i -1)2 2i (i!)2 1 00 (-I)i(2i)! . = L 2 2 x' (Ixl < 1). VI+X i=O 2 '(it) (2) Elementary Transcendental Functions (131 Elementary Functions).  Xi ( X ) n eX=expx= L"""'I= lim 1+- , aX = exp(x log a) (Ixl<oo). i=O 1. n-> n 00 (_I)i-1 .  1 ( X-l ) 2i+1 log(l+x)=il i x' (-I<x.;;l), logx=2;0 2i+l x+l (O<x<oo). 00 ( - lY. 00 (- 1); . sinx= L x2'+1 cosx= L _x 21 (Ixl < (0). i=0(2i+l)!' i=0(2i)! 00 2 2i (2 2i - I)B . tanx= L ., 21 X 2i - 1 (Ixl<'i) (Bi is a Bernou11i number), i= 1 (21). 1 00 2 2 ; B . cotx=--l: X2i-1 x ;1 (21)! (0< Ixl <'i),  E 2i 2' ( '11' ) secx = £.. ----:-r x' Ixl <"2 i=O (21). 1  (2 2i - 2)B 2i 2 '-1 cosecX=-+L., x' (O<lxl<n). x i1 (2i)! (Ei is an Euler number), . 00 (2i)! x2i+ 1 arcslnx= L - 2 ' 1 (Ixl.;; 1), i=O 2 2 '(i!) 1 + 00 (-1/ arc tan x = L ---;---- X 2i + 1 ;=0 21+ 1 (Ixl.;; 1). 
App. A, Table lO.V Series. 1760 (V) Partial Fractions for Elementary Functions 00 8x tanx= L 2 ' n=O (2n+ 1) '172-4x2 I 00 I cotx=-+2XL 2 2.rr 2 ' X n= 1 X - n 00 (-1f(2n-l)'17 secx=4 L 2 ' n = 1 (2n - 1) '17 2 - 4x 2 1 00 (-I)" cosecx= - +2x L 2 2.rr 2 ' X n=lx-n oc sec 2 x = L n=-OO I 2 ' [X+ {(2n+ 1)'17/2}] 00 1 cosec 2 x = L 2 . n=-OO (x+n'17) (VI) Infinite Products (379 Series F) IT  = '!!- (Wallis formula), n=14n 2 -1 2 (C is Euler's constant). 00 X ) r(1+a) II ( 1+ - e-x/n=e- cx n=1 a+n r(1+a+x) IT (1- 2n  I )( 1 + {n ) = v?T / r( 1+ I )r(  - I)' n=1 II 1/(1- p -S) =  (s) (p ranges over all prime numbers, s> 1), p IT ( 1- x\ ) = Six , n= 1 n2.rr 00 . II x smx cos- = - 2 n x' n=1 00 ( 4x 2 ) II 1- 2 =cosx. n = 1 (2n - 1) '17 2 00 00 00 For Iq\ < I, putting q(=:=. II (1 + In), q2= II (1 + q2n-l), q3= II (1- q2n-l), n=1 n=1 n=1 oc q4 -= II (1- q2n) we have ql q2q3 = 1. n=1 Further, putting q = e i .".,., we have the following formulas concerning i)-functions (134 Elliptic Functions): &4 (O,T) = q4q, &2 (O,T) = 2ql/4q4qr, &3 (O,'T) = q4q, &; (O,T) = 2'17ql/4ql. 11. Fourier Analysis (I) Fourier Series (159 Fourier Series) Fourier sine series ao=.1 {af(x)dx, an =  l a f(x)cos n'17X dx a )0 a 0 a 2 1 a . n'17X b = - f(x ) sm- dx. n a 0 a  n'17X { f(x) (O<x<a), a o + "'" ancos- = n=1 a f(-x) (-a<x<O). b"sin n'17x = { f(x) (O<x<a), n=1 a -f(-x) (-a<x<O). (1) Fourier coefficients Fourier cosine series The next table shows the Fourier coefficients of the functions F(x) directly in the following manner from a given functionf(x) on the interval [0, a]. For x in [- a,O] and when the cosine series {an} is in question, we setf(x)= f( - x), and when the sine series {b n } is in question we setf(x)= - f( - x). Thusf(x) is extended in two ways to functions on [- a, a]. The functions F(x) are the periodic continuations of such functions. We remark that the sum of the Fourier series given by the Fourier coefficients in the right hand side has, in general, some singularities (discontinuity of the function or its higher derivatives, for example) at the points given by the integral multiples of a, We assume that /-I is not an integer. 
1761 App. A, Table 11.11 Fourier Analysis f(x) a o x a 2" x 2 a 2 3" e kx e kn - 1 kCl M7TX cos- a sinM'IT J1.'IT . J1.'lTX sm- a l-cosM'IT J1.'IT 1_,\2 1- 2'\cos('lTx / a) +,\2 '\sin('lTx/ a) 1-2'\cos('lTX / a) +,\ 2 B 2m (x/2a) B 2m + I(X / a) logsin('lTx/2a) (I /2)cot( 'lTX /2a) o -log2 a" (n= 1,2, ...) o [1+(_1)"+1] -2a n 2 'IT2 (-I)" 4a 2 n 2 'IT2 2ka[ ( - I)" e ko - I J k 2 a 2 + n 2 'IT2 2 MsinM'IT (-1)"-- 'IT J1.2- n 2 2 J1. [ 1 - ( - I)" COS J1.'IT J 'IT J1.2- n 2 2'\" (1,\1 < I) (- I)m+ 12(2m)! /(2n'IT)2m -l/n b" (n=I,2,...) [I +( _1)"+1]2a/n'IT ( _1)"+1 2 a n'IT (-1)"-1 2a 2 _[I +( _1)"+1] 4a 2 n'IT n 3 'IT3 2n'IT[I-(-I)"Je ko k 2 a 2 + n 2 'IT2 2 [<-I)" COSM'IT-I) 'IT J1.2- n 2 2 nsinM'IT (_1)"__ 'IT J1.2- n 2 ,\" (1,\1 < 1) (- l)m+12(2m + I)! /(2n'IT)2m+1 I (I) Note (I)The Fourier series does not converge in the sense of Cauchy, but it is summable, for example, by the Cesaro summation of the first order.  _ cos nx ( X ) (2) L (-1)" 1-n=log 2cosi (-'IT<x<'IT), n=1  cos(2n-l)x I I X I L 2n-1 =2"logcoti (O<x<2'IT, n-1  S in(2n-I)X = { 'IT/4 n= 1 2n -I - 'IT /4  I 2 L cosnx = -(x- 'IT)2_  "= 1 n 2 4 12 x*'IT), (0< x< 'IT), ('IT_<x<2'IT). (0<::: x <:::2'IT),  sinnx r x ( . t ) """' =-xlog2- In log sm2" dt (0<::: X < 2 'IT). n=1 n 0  a" L fiTcosnx= eocOSXcos(asinx)-I, n-1 .  sinnx = .!.('IT-x) (0< X <2'IT). "=1 n 2  " L  sinnx = eocOSXsin(a sin x). I n. n=  (_1)"-1 cosnx = 'IT csax _.-.!... "=1 n 2 - a 2 2asma'IT 2a 2 ( - 'IT <::: x <::: 'IT),  . . L (_I)"-l nsmnx = 'lTsnax (-'IT <x < 'IT). n=1 n2-a22sma'IT In the final two formulas, we assume that a is not an integer. (II) Fourier Transforms (160 Fourier Transform) The Fourier transform [f] and the inverse Fourier transform [g] for integrable functions f and g are defined as 
App. A, Table 11.11 1762 Fourier Analysis 3"[J(x)]=3"[J]()=(2nrn/2 r J(x)e-iXdx, JR n #[g(m=#[g](x)=(2nr n / 2 r g()eiXd, X=X1I+X22+...+Xnn' JR" In some textbooks the factor (2nr n / 2 is deleted or the symbols i and - i are switched when defining 3" and .#. However, conversion of the formulas above to ones due to other ddinitions is straightforward. These transforms are also defined for some nonintegrable functions, or even more generally for ttempered distributions. The Fourier transform 3" and the inverse Fourier transform .# defined on the space of tempered distributions Y" = Y"(Rn) are linear homeomorphic mappings from Y" to itself. Useful formulas of these transforms are given in the table below, where CJ: = (CJ: 1 , CJ: 2 , ..., CJ: n ) (CJ: j = 0,1,2,...), lCJ:j = CJ: 1 + CJ: 2 +... + CJ: n , C is tEuler's constant, A.E'C, and Z+={mEZlmO}. Case 1. n = 1. First we explain the meaning of the symbols appearing in the table: x+ = max(x, 0) (the positive part of x), x_ = max( - x, 0) (the negative part of x), x and x are understood in the sense of finite parts ( 125 Distributions and Hyperfunctions), (x + il:)l= exp[A. Log(x + il:)] (1:#0; Log is the principal value of log) = (x 2 + 1: 2 )'/2 exp [iA. Arg(x + il:)] (-n < Argz:::; n), (x::!:: iO)' = lim (x::!:: il:)' (limit in the sense of distributions). ,.j.o Then the fOllowing formula holds: (X::!::iO)l=X +e:tilnx Pfxm=x';'+( _1)mx (mEZ) (Pf is the finite part). In the special case m= -1, pfx-1 coincides with Cauchy's principal value p.v.x- 1 . T (E.'I") <5 (x) P(x) (polynomial) p.v.ljx Pfx- m S"[T] (EY")   P(idjd)<5() j;j2 isgn j;j2 [( -i)mj(m-I)!]c;m-I sgnc; (mEN,Z) r(.!c+I)  [e-in(J.+1)/2:;:J.-I +ein(HI)/2Cl-1] y2n [ r(.!c+ I) . ] =-e-,"(HI)/2(+iO)-J.-I (.!c\fZ)  (imj)[n<5(m)_i( -1)mm! PfC m - l ] (mEZ+) ( i)m-I [( mf -c ) m-I_c;m-Isgn-m-IIOgl!I J (mEN,Z) (m-l)! j=IJ 2 r(Hl) .  [e,"(J.+I)/2c;:;:J.-I +e-'n(HI)/2CJ.-I] y2n [ r ( .!c+ 1 ) . ] =  e,"(J.+I)/2(_iO)-J.-I (.!c\fZ) (_i)m [n<5(m)+i(-I)mm! Pf C m - l ] (mEZ+)  i m - I [( mf _c ) m-I+m-1sgn_m-110gl';I J (mEN,Z) (m-l)! j=IJ 2 [e'nl/2jr(_.!c)E:;:H (.!c\fZ+) [ e-'nl/2jr( -.!c)]C H (.!c\fZ+)  i m <5(m) (mEZ+) j;ii isgn'(C+logIW J;;;i e-a'/4 (a>O) 1 1 -- (a>O) a+ic; xJ. + x m + -m x+ xJ. x m x:::: m (x+iO)J. (x-iO)J. (x::!:: io)m =x m x-1loglxl e-X'jrx. { e- ax (x>O) o (x';;O) 
1763 App. A, Table 11.11 Fourier Analysis T(e.'l") e- a1xl M e-b J x 2 +a: 2 x 2 + a 2 log x +b arctan(x/a) sin ax \X\I-, cosax IxI 1 -' sin ax x {/(a2 _ X 2 )'+(li 2 ) {/(X2 _ a 2 r+(li 2 ) (x 2 + a 2 r+(I{2) (Jxl<a) (lx\>a) (Ixl<a) (Ixl>a) I .9'[T] (eY') J a+ Ja 2 +(2 (a>O) J a 2 + e   KI(a J2+b2 ) (a>O,b>O) "j; --.;2+b2 ;z:; _(e-all_e-bll) (a?ii'O,b?ii'O) -j;fi isgna'(e-I.III/c;) (aER,a#O) _r(V)COS V7t ( _ ) (vZ) jhc 2 \ _al v \ +a\V  r(V)COS v;C alv - I alv) (vZ) { j;fi (JI<la!) o (II>lal) fi r«(1/2)-V) (Y J-v(aIW (Re v <1/2, a>O) -  r«1/2)-V)(Iil ) VNv(aIW (-1/2<Rev<1/2, --.;2 )a J2 ( Iil ) V Kv(alW (Rev> -1/2, a>O) r(v+(1/2) 2a a>O) Case 2. n> 1. Let r= J xt +x +... +x; and P= J t +  + .., +;, where x=(xi)ER n and  = (OE R n . If JELl (R n ) depends only on r, then 3"[J] depends only on p and is expressed as 3"[J] (p)= p-(n- 2)(2 too j(r)r n (2 J(n-2)/2(rp) dr. The constant C in the table stands for Euler's number. T (E .<I") 8(x) PIx) (polynomial) Pfr!. ,2m Pfr- n - 2m (1 + r 2 l (1 +r 2 t Pfr!.logA r 2m log r Pfr- n - 2m logr ear 3"[T] (E.<I") (2n)n/2 (2n)n(2 p(ia/a)8() 2(n/2)H r«n + .1)/2) r(-A/2) Pfp-n-!. (Art2Z+, Art-n-2Z+) (2n)"/2( - )m8() (mE Z+) (_lt p 2m [ 2 m 1 r' «n/2)+m) ] 2Iog--C+ I -+ (mEZ+) 2(n/2)+2mr«n/2) + m)m! p jd r«n/2) + m) p-[(n/2)H]K(n/2)H(P) ( A'/'Z ) 2 .< 1r(-A) 'F + (2n)n/2(1_)m8() (mEZ+) 2(n/2)Hr«n+A)/2) f -'-'< [ 1 2 1 r'«n+A)/2) 1 P(-A/2) ] r(-A/2) P p OgP-+2: r«n+A)/2) +2: r(-).j2) (At2Z+, Art-n-2Z+) (_lr- 1 2(n/2)+2m-1 m! r«n/2) + m)Pfp -n-2m [ 11 m 1 r' ((n/2)+m) ] +(2n)nl 2 log2--C+- :>.+ (-t8() (mEZ+) 2 2j=1l r«n/2)+m) (-It 2m [{ 21 Im ll P«n/2)+m) } 2 2(n/2)+2mr«n/2)+m)ml P logp--2: C +2:J1j+Z r«n/2)+m) + n 2 + f - r"«n/2)+m) + r' «n/2)+m)2 ] (mEZ+) 24 4 j=l/ 4 r«n/2) + m) r«n/2) + m)2 1; r( n 1)(a2 +p)'n+1)/2 (a>O) The Fourier transform mentioned above is a transformation in the family of complex-valued func- tions or distributions. Similar transformations in the family ofreal-valued functions are frequently used in applications: 
App. A, Table 12 Laplace Transforms and Operational Calculus 1764 Fourier cosine transform !c(u) = IX' F(t)cosutdt. Inverse transform  roo !c(U)COsutdU= { F(t) (t>O), 71:Jo F(-t) (t<O). Fourier sine transform J.(u) = Loo F(t)sinutdt. 2 1 00 . { F(t) (t>O), Inverse transform - J.(u)smutdu= ) 71: 0 -F(-t (t<O). The Fourier transform can be expressed in terms of these transforms. For example (in R 1 ), 1 (00 i r oo $'[J](u) =  Jo [J(t) + J( -t)]cosutdt- J2i. ,,0 [J(t)- J( -t)] sinutdt. { F(t) (O<t<a) (a < t) t- I t a - I (O<a<l) 1/(a 2 + t 2 ) e- at e -A1 2 (Rei\ > 0) e - AI2t sin at (a> 0) t tanh( 'lTt /2) sech( 'lTt /2) J.(t) (Rev> -1) J o( at) No(t) Ko(t) fc(u) smau u ( di verges) f( a)cos( 'lTa /2)u - a 'lTe -a 1 ul/2a a/(a 2 + u 2 ) V 'IT / 4i\ e - u 2 /4A { 'lT2 (O<u<a) (a<u) fs(u) 1- cosau u ('IT /2)sgn u f( a)sin( 'lTa/2)u- a [e -auEi(au) - eauEi( - QU))/ a(2) u/(a 2 +u 2 ) e - u 2 /4Arp( u /2 v'};. ) / \/Y; (3) Y'lT/4A (u/4A)e- u '/4A i log/ :   I cosech u sin( v arc sin u) Yl-u 2 (0< u< 1) sech u cos( v arc sin u) Vl-u 2 (u-Y)' Vu 2 -1 { I/Y a;- u 2 . V'lT smT { - V: - ] '1T/2Y 1+u 2 (u- v u 2 -1 )' V'lT cos- (I < u) Y u 2 - 1 2 o (O<u<a) I/Y u 2 -a 2 (a < II) 2 arcsinu (0< 1) u< 'IT Yl-u2 2 log(u- y u 2 -1 ) (\ < u) 'IT vu2-1 (arcsinhu)/Y 1 + u 2 Notes (2) Ei is the exponential integral function ( Table 19.11.3, this Appendix). (X 2 (3) We put rp(x) = J o e l dt. 12. Laplace Transforms and Operational Calculus (I) Laplace Transforms (240 Laplace Transform) Laplace transform V(p)= foooe-PIF(t)dt (Rep>O). Inverse transform (Bromwich integral)  1 c + ioo ePIV(p)dp = { F(t) 2'1T1 c-;oo 0 (r > 0), (t<O). 
1765 App. A, Table 12.11 Laplace Transforms and Operational Calculus F(t) V(p) F(t) V(p) I/p Jv(t) (ReI'> -I) (Y I+p 2 -P)" I YI+p2 l(t- a)= {  (O';;;t<a) e-ap /p I (ReI'> 0) (Y a 2 + p2 -P)" (a.;;; t) f.Jv(at) I'a v [x/a] (integral part) I/p(eap-I) t,,-I (Rea>O) r(a)/p" tVJv(at) (ReI' > - t) (2a)"r[1' + (I/2)] V;; (2 2)"+(1/2) w p +a e- at I/(p+a) e-att a - 1 (Rea >0, a>O) (p+a)-"r(a) t V / 2 J v (xYI ) e-x'/4p (Re v > -I) 2']1 v+ I e-a'F(t) (a>O) V(p+a) (I-e-')/t 10g(1 + P -I) Jo(t) (I + p2)-1/2 (wt) -1/2e - x' /4' p-I/2e- xvfi (x > 0) Jo(xYt) e- x '/4p /p logt -(1ogp+ C)/p(l) No(t) 2 log(Yi + p 2 -p) - 'IT YI+p2 sinat a/(p2+a 2 ) L. (t)(2) ( p I r cos at p/(p2+a 2 ) sin(xYI) Y;  -x'/4p t"L")(t) r(a+n+1) I ( P-I r 2 p3/2 e n! p"+1 p I . (4) t-I/ 2 COS(xYt) Y w/p e- x '/4p H 2n + I(Yt )(3) Vf (2n + I)!! ( - p) 2 pn+(3/2) t-1sinxt arctan(x/p) x-I(I-cosax) pog[1 +(a 2 /p2)] a/(p2-a 2 ) H 2n (Yt ) - (I-p)" sinh at Yw (2n-I)!!- Yt pn+(1/2) cosh at p/(p2_ a2) Notes (1) C is Euler's constant. (2) Ln(t) is a Laguerre polynomial. (3) Hn(t) is a Hermite polynomial. (4) (2n + I)!! = (2n + I )(2n - 1 )(2n - 3).. .5.3. I. (II) Operational Calculus (306 Operational Calculus) Dirac delta function (unit impulse function) { 0 (t<O) l(t) = 1 (t;;;. 0). 8(t)= lim- 2 1 [l(t+€)-l(t-€)]. .-->0 € Heaviside function (unit function) When an operator (p) operates on l(t) and the result is A(t) we write (p)l(t)=A(t). In the following table (i) of general formulas, we assume the relations i(p)l(t)=Ai(t) (i= 1,2). Carson's integral (p)=p !octO e-P'A(t)dt (Rep >0). Laplace transform (p ) i oc V(p)= - = e-P'A(t)dt. p 0 
App. A, Table 13 1766 Conformal Mappings (i) General Formulas (ii) Examples (p) A (t) (p)=pV(p) A (t) I(P)+ 2(P) A I (t)+A 2 (t) P oCt) al(p) aA I (t) lip n (n = 0, 1, 2, ...) W/n!)l(t) PI(P) A1(0)o(t)+Ai(t) pi (p + a) (e- at )l(t) p2 /(p2 + a2) (cosat)I(t) 1 foIAI(T)dr ap l(p2+ a 2 ) (sin at) 1( t) -,(p) P I(ap) AI(t/ a) [pl(p + a)]S1,(p + a) e-aiAI(t) (Rea:> 0) al a] / t t 2 \ _ ao++--";;-+'" t ao+ a) if + a2 t.- + .., }I(t) P r I fAI(T)Ait-T)dT n Bk n Bk -[Z,(P)2(P) -  -(e Pkt --I)I(t) p k=1 P-Pk k=l Pk = fo'A I (t-T)A 2 (T)dT n B =Q(O)I(t)+ L ePkt k=l Pk 13. Conformal Mappings (77 Conformal Mappings) Original Domain Image Domain Mapping Function Izi < I (unit disk) Iwl<1 z-zo /zo/<I, 1£1= 1 (g'neral form) w=€- l-z o z' Imz > 0 (upper half-plane) Iwl<1 Z-Zo Imzo>O, 1£1=1 (general form) w=€---=-, Z-Zo Imz > 0 (upper half-plane) O<argz<a (angular domain) Izl < 1,lmz >0 (upper semidisk) O<argz < a, Izi < 1 (fan shape) z-p a<arg- </3 z-q (circular triangle) 0< Imz < 1J (parallel strip) Rez<O, O<lmz<1J (semiparallel strip) y2>4c 2 (x+C 2 ), z=x+iy, c>O (exterior of a parabola) y2 < 4c 2 (x + c2), Z = x + iy, C > 0 (in tenOr ot a parabola) 1m w > 0 az+b w= cz+d ' a,b,c,darereal; ad-bc>O (general form) 1m w > 0 w=z,,/a 1m w > 0 W= (  ) 2 l-z 1m w > 0 w= ( I+Z"/c< ) 2 I-z"/C< O<argw< y _' z-p  w= ( e 1C<_ ) fJ-a z-q 1m w > 0 w=e"Z/TJ Imw>O, [wi < 1 w=e"Z/TJ 1m w > 0 w = -vz - ic 1m w > 0 . 'T'i-VZ w = I sec ----;-- 2lC 
1767 App. A, Table 13 Conformal Mappings Original Domain Mapping Function Image Domain x 2 [e+O/e)]2 y2 + 2 > 1, [e-O/e)] z = x + iy, e > 1 (exterior of an ellipse) x 2 y2 ---<4 cos 2 o: sin 2 o: ' z=x+iy, 0< 0: <'IT/2 (exterior of a hyperbola) x 2 y2 cos 2 a - sin 2 o: > 4, x>O, z=x+iy, 0< 0: < 'IT /2 (right-hand side interior of a hyperbola) Izl< 1 Izl< I Izi < 1 -'IT /2< Rez< 'IT/2 (paral1el strip) -7T< Imz < 'IT (paral1el strip) Arbitrary circle or half plane Arbitrary circle or half-plane Imz > 0 Imz > 0 Iwl>e 1m w > 0 1m w > 0 Slit domain with boundary IRewl <;; 2, Imw=O Slit domain with boundary !wi  1/4, argw=i\ Slit domain with boundary Iwl  1/4 1 / P argw=i\ + (2j'IT /p), j = 0, ... ,p - I Slit domain with boundary IRewl  I, Imw=O Slit domain with boundary Rew<;; -1, Imw=:!:'IT Interior of an n-gon Exterior of an n-gon Interior of an equilateral triangle Interior of an isosceles right triangle z + \17="4 w= 2 1 z=w+ - w ( , ) ,,(,,-2a) -ia Z+ v r - 4 w= e 2 =1- [( Z+v?=4 )  w 2i 2 +( ,-VF )  ] I w=z+ - z z w= O+e- iA z)2 z w= 0+ e- iPA z P )2/p w = sin z w=z+e Z w = e j Z IT (t - z) ar I dt + e' (e "'" 0, j=l e' are constants), where the inverse image of the vertex with the inner angle O:j'IT (j = 1,..., n) is z = Zj' When Zn = 00, we omit the factor (t - znt n -1 (Schwarz-Christoffel transformation) n w=e jZ(t_p)-2 II (t-zy-aJdt+e' j = 1 (e "'" 0, e' are constants), where the inverse image of the vertex with the inner angle O:j'IT (j = 1, ..., n) is z = Zj' and the inverse image of 00 is z = P w = f a z 1 dt Jr V t2(1-t)2 i z 1 w= dt a ij t 2 (1- t)3 
App. A, Table 13 Conformal Mappings 1768 Original Domain Mapping Function Image Domain Imz>O Izi < 1 0< Rez <wI' O<lmz <w3/i (rectangle) - K<Rez< K, O<lmz<K' ( ) (1) ,rectangle, v < Izi < 1 Imz < 0 (upper half- ring domain) Izi < 1 Imz >0 Izl< I Imz >0 11"1> 1, -1/2<Re1"<0 11"+ 1/21 < 1/2, -1 <Re1"<O Interior of a right triangle with one angle 'IT/6 rz 1 w=)o t3(I_t)4 dt Interior of a regular n-gon w= {(1-tn)-2/ndt 1m w > 0 w= (zI2wh2w3) (is the Weierstrass- function) 1m w > 0 w=sn(z,k), z (W I dl =)0 V (1- t 2 )(I- k 2 t 2 ) (sn is Jacobi's sn function) logq<Rew<O, O<lmw <'IT (rectangle) u: + v: <I, w= Y A2-B2 sin ( 2 (2z/kO+z2) dt ) , A B )0 V (1- t 2 )(1- k 2 t 2 ) W = u + v, A > B > 0; (1) ( . . f 11 ' ) 1 1 A + B K' mtenor 0 an e Ipse -;;; og A _ B = K w'" 3 ( w" ) 2 {w;Z}=7-I W' =R(z) (R (z) is a rational function) (I I I _!+..!. -+..!. Z )0 t - 1: - 2k (I - t) 6 2' (1 - Z 3t) 6 2k dt w= fo It- !-(I- t) -+(I- z 3 t) -+ dt The vertices are the images of z = 1, e 2 ";/3. and e 4fT ;/3; and [ dw ] r[(5/6) + (1/2k)]r(2/3) dz z-o = r[(I/6) + (1/2k)]r(4/3) w = logz in terior of a circular polygon Interior of an equilateral circular triangle with inner angle 'IT / k, 1 < k <. 00 Interior of a circular triangle with inner angles 'lTa, 'IT/3, 'IT''{, a + /3 + Y < 1(2) (t_ I+.;hY (1- t)_!.:t:_.-'i-='"\I_ ztr--'-=¥= dt w= fo1t- '+.;II+Y (I_ t)- '-.-r y (1- t+zt)_ '-.;II-Y dt ImJ > 0 J =J (1"), 1"=W3/WI' J= gV(g- 27g)(the absolute invariant of the elliptic modular function); J(e 2 ";/3)=0, J(i)= 1, J(oo)= 00 X=X( 1"), 1"=w3/Wh X= (e2 - e3)/(e\- e3); [ 2 2 4 X(1") -X(1") + I] J(1")=- - 27 X( d[X( 1") - d ' X(-1)=oo, A(O)=I, X(oo)=01 ImX<O Notes (1) K,K',k' are the usual notations in the in the theory of elliptic integrals: K= (I 1 dt, K'= (I 1 dt, k2+k'2=1. )0 V (1- t 2 )(1- k 2 t 2 ) )0 V (1- t 2 )(1- k'2t 2 ) (2) When a + /3 + Y = 1, the circular triangle is mapped into the ordinary linear triangle by a suitable linear transformation, and we can apply the Schwarz-Christoffel transformation. When IX + /3 + y> I, we have a similar mapping function replacing the integral represen ta- tions of hypergeometric functions in the formula by the corresponding integral representa- tions of the hypergeometric functions converging at a, /3, and y. 
1769 App. A, Table 14 Ordinary Differential Equations 14. Ordinary Differential Equations (I) Solution by Quadrature a, b,e, ... are integral constants. (1) Solution of the First-Order Differential Equations (313 Ordinary Differential Equations). (i) Separated type dy j dx = X (x) Y (y). The general solution is f y dy f x Y(y) = X(x)dx+e. (ii) Homogeneous ordinary differential equation dy j dx= f(y j x). Putting y = ux, we have dujdx=[f(u)-u]x, and the equation reduces to type (i). The general solution is x=eexp[f u f(:;-U ] (u=f). (iii) Linear ordinary differential equation of the first order. dy j dx + p(x)y + q(x) = O. The general solution is y=[e- fq(X)P(X)dX]/ P(x), where . p(x)=:ex p [ f p(x)dx l (iv) Bernoulli's differential equation dy j dx + p(x)y + q(x)y" =0 (a *0,1). Putting z= yl-", the equation is transformed into dzj dx + (1- a)p(x)z + (1- a)q(x) =0, which reduces to (iii). (v) Riccati's differential equation dy j dx+ ay2= bx m . If m= -2, 4kj(l-2k) (k an integer), this is solved by quadrature. In general, it is reduced to Bessel's differential equation by ay = u' j u. (vi) Generalized Riccati differential equation dy j dx + p(X)y2+ q(x)y + r(x)=O. If we know one, two, or three special solutionsy = Yi(X), the general solution is represented as follows. WhenYl(x) is one known special solution, y= Yl (x)+ P(x) / [f p(x)P(x)dx+ e]. where p(x)exp[ - f { q(x) +2P(X)Yl (x)} dx l WhenYl(x), Y2(x) are the known solutions, =::; = eex p [ f p(x){Y2(x)- Yl (x)} dx l When Yl(X), Y2(x), YJ(x) are known solutions, y - Yl (x) YJ(x) - Yl (x) =e Y-Y2(x) YJ(x)-Y2(x)' (vii) Exact differential equation P(x,y)dx+Q(x,y)dy=O. If the left-hand side is an exact differential form, the condition is ap jay = aQ j ax. The general solution is fPdx+ f( Q- ;y fpdx)dy=e, (viii) Integrating factors. A function M (x,y) is called an integrating factor of a differential equation P (x,y)dx + Q (x,y)dy =0, if M (x,y)[P(x,y)dx + Q (x,y)dy] is an exact differential form d<p(x,y). If we know an integrating factor, the general solution is given by <p(x,y) = e. If we know two independent integrating factors M and N, the general solution is given by M j N = e. (ix) Clairaut's differential equationy = xp + f(p) (p = dy j dx). The general solution is the family of straight lines y = ex + f( e), and the singular solution is the envelope of this family, which is given by eliminating p from the original equation and x + J'(p)=O. (x) Lagrange's differential equationy = xq;(p) + 1[;(p) (p  dy j dx). Differentiation with respect to x reduces the equation to a linear differential equation [<p(p) - p](dx j dp) + <p'(p)x + 1[;'(p) = 0 
App. A, Table 14. I Ordinary Differential Equations 1770 with respect to x, P (see (iii». The general solution of the original equation is given by eliminating P from the original equation and the solution of the latter linear equation. The parameter P may represent the solution. If the equation P = cp(p) has a solution P = Po, we have a solution y = Pox + \f;(po) (straight line). This solution is sometimes the singular solution. (xi) Singular solutions. The singular solution of f(x,y,p) =0 is included in the equation resulting from eliminatingp fromf=O and af / ap =0, though the eliminant may contain various curves that are not the singular solutions. (xii) System of differential equations. eq.(l) dx:dy:dz=P:Q:R. A function M(x, y, z) is called a Jacobi's last multiplier for eq. (1) if M is a solution of a partial differential equation (8MP/8x)+(8MQ/8y)+(8MR/8z)=0. If we know two independent last multipliers M and N, then M/N =c is a solution of eq. (1). If we know a last multiplier M and a solution f = a of eq. (1), we may find another solution of (1) as follows: solving f = a with re- spect to z and inserting the solution into eq. (1), we see that M(Qdx-Pdy)/fz is an exa,t differen- tial form dG(x, y, a) in three variables x, y, and a. Then G(x, y,J(x, y, z)) = b is another solution of eq. (1). (2) Solutions of Higher-Order Ordinary Differential Equations. The following (i)-(iv) are several examples of depression. (i)J(x,y(k), y(k+!>, ... ,y(n» = 0 (0< k <; n). Sety<k)= z; the equation reduces to one of the (n- k )th order in z. (ii)J(y ,y',y", ... ,y(n» = O. This is reduced to (n - l)st order if we consider y' = p as a variable dependent on y. (iii) y" = fey). The general solution is given by -1/2 x=a:t f[2 f f(y)dY+b] dy. We have a similar formula for y(n)= f(y(n-2». (iv) Homogeneous ordinary differential equation of higher order. If the left-hand side of F(x,y, y', ... ,y<n»=o satisfies the homogeneity relation F(x,py,py', .., ,py<n»= p«F(x,y,y', .,. ,y<n», the equation is reduced to one of the (n - 1 )st order in u by u = y' / y. If F satisfies F(px, pi y, pl-I y', ..., p I-n y(n» = p«F(x,y,y', ... ,y<n», then u = y / ),1, t = logx reduces the equation to one of type (ii) not containing t. (v) Euler's linear ordinary differential equation. Pn(x)x n y(n)+Pn_I(X)X n - 1 y(n-I)+... +PI(x)xy'+Po(x)y=q(x) is reduced to a linear equation by t = logx. (vi) Linear ordinary differential equations of higher order (exact equations). A necessary and sufficient condition that L[y]== J=op/x)y(j)= X (x) is an exact differential form is ;=o( -1))pP)=O, and then the first integral of the equation is given by n-\n-j-\ L L (-l)kpkkj+ly0)= f X(x)dx+ c. jO k=O (vii) Linear ordinary differential equation of higher order (depression). n L[yJ== L Pj(x)yU)=X(x). j=O If we know mutually independent special solutions y I (x), ... ,y m( x) for the homogeneous linear ordinary differential equation L[y] = 0, the equation is reduced to the (n - m)th linear ordinary differential equation with respect to z by a transformation z = A (y), where A (y);; 0 is the mth linear ordinary differential equation with solutionsYI(x), ...,Ym(x). For example, if m= I, the equation is reduced to the (n -l)st linear ordinary differential equation with respect to z by the transformation y(x)=y!(x) fz(x)dx. Also, if n = m = 2, the general solution is y= ClYI + C2Y2 - YI f TY2 dx + Y2 f TYI dx, where T(x)== X (x)/[YI (x)y;(x)- yix)y;(x)]. The denominator of the last expression is the Wronskian of y! and Y2. 
1771 App. A, Table 14.1 Ordinary Differential Equations (viii) Regular singularity. For a linear ordinary differential equation of higher order, eq. (1) xyn)+Xn-pl(X)y(n-l)+... +Pn(x)y=O, the point x = 0 is its regular singularity if plx), ... ,Pn( x) are analytic at x = O. We put Po= 1 and 00 n  f. (p)x' ==  Pn-/x)p(p- 1)... (p- j + 1). .-0 j=O If P is a root of the charactenstic equation fo(p) = 0 and P + I, P + 2, . .. are not roots, we can determine the coefficients c. uniquely from eq. (2) m  cJm-. (p+ /1)=0 .=0 (m= 1,2, ...), starting from a fixed value Co (¥'O), and the series y = xP =oc.x' converges and represents a solution of eq. (1). If the differences of all pairs of roots of the determining equation are not integers, we have n linearly independent solutions of eq. (1) applying the process for each characteristic root. H there are roots whose differences are integers (including multiple roots), we denote such a system of roots by PI' ..' ,Pl' We arrange them in increasing order, and denote the multiplicities of the roots by e l , ... ,e" respectively. Put qk = Pk - PI (k = 1,2, ...,1; 0= ql < q2 < ... < q,). Take N ;;. q, and a constant c (¥'O). Let A be a parameter, and starting from co= CO(A)== cII= doCA + k), we determine c. = C.(A) uniquely by the relation (2). Putting mk==ek+ek+I+...+e, (k=I,...,l) forhin mk+l<h<mk-l, the series [ ah 00 ] 00 [ h ( h ) . J y= ----'J;x A  C.(A)X' =x Pk  X'  . dJ)(Pk)(iOgX)h- J aA .=0 A=Pk .=0 j=O ) converges and gives ek independent solutions of eq. (1). Hence for k = I, ...,1, we may have _Iek = m l mutually independent solutions of (1). Applying this process to every characteristic root, we have finally n independent solutions of (1) (Frobenius method). In the practical computation of the solution, since it is known to have the expression (3), we often determine its coefficients successively by the method of undetermined coefficients. eq. (3) (3) Solution of Linear Ordinary Differential Equations with Constant Coefficient ( 252 Linear Ordinary Differential Equations). Let ai' ajk be constants. We consider the following linear ordinary differential equation of higher order (eq. (1» and system of linear ordinary differential equations (eq. (2». eq. (1) n  (Xiy(i)=X(x). i=O n yj=  (Xj/(Yk+Xj(x) (J= 1, ...,n). k=l (i) The general solution of the homogeneous equation (cofactor) is given by the following formulas: for eq. (I) y = Xl expAkx (J=O, I" .. .,e k - 1; k = I, ... ,m), eq. (2) for eq. (2) m y/x)=  Pjk(x)expAkx (J= I, ...,n), k=l where AI' .. ., A", are the roots of the characteristic equation of eq. (1) or eq. (2) given by n eq. (1')  (XiAi=O, i=O eq. (2') det( (Xjk - M jk ) = 0, respectively. We denote the multiplicities of the roots by e 1 , ...,e m (e 1 +... +em=n); Pjk(X) is a polynomial of degree at most e k -1 containing e k arbitrary constants. If aU the coefficients in the original equation are real, and the root A. k = Ilk + iV k is imaginary, then Ik = Ilk - iV k is also a root with the same multiplicity. Then we may replace exp A.kX and explkx by eXPllkxcOSVkX and eXPllkxsin Vkx, and in this way we can represent the solution using real functions. (ii) Inhomogenous equation. The solution of an inhomogeneous linear ordinary differential equa- tion is given by the method of variation of parameters or by the method described in Section (2)(vii). 
App. A, Table 14.11 Ordinary Differential Equations 1772 We explain the method of variation of parameters for eq. (2). First we use (i) to find a funda- mental system of n independent solutions Yj = <Pjk(X) (k = 1, ... ,n) by (i». InsertingYj =, k_lCk(X)<Pjk(X) into eq. (2), we have a system of linear equations in the ck(x). Solving for the ck(x) and integrating, we have ck(x). Special forms of X(x) or Aj(x) determine the form of the solutions, and the parame:ters may be found by the method of undetermined coefficients. The following table shows some examples of special solutions for eq. (I). In the table, a, k, a, b, c, are constants, P r , qr are polynomials of degree r, and fa is the operator defined by fa. F =  [sinax f cosax.F(x)dx -cosax f sinax.F(x)dX] (a¥'O). X(x) Pr(x) ke ax eaxPr(x) cos(ax+b) } sin(ax+b) cos(ax+b) } sin(ax + b) Condition A = 0 is an m-tuple root of(1 ') A= a is anm-tuple root of (1') A = a is anm-tuple root of (1 ') { (I')=<p(A2)+A1f(A2), and } <p( - a 2 ) + a;P( - a 2 ) ¥'O j (I,)=g(A)/!(A 2 ) and!(2» ) is divisible by (A 2 + a 2 ) (but not by (A2+a 2 )m+l) Special Solution xmqr(x) cx me ax xmqr(x)e ax clcos(ax+b)+ c 2 sin(ax+ b) c(l r { cos(ax+b) a sin(ax+b) (II) Riemann's P-Function and Special Functions (253 Linear Ordinary Differential Equations (Global Theory» (1) Some Examples Expressed by Elementary Functions. A,B are integral constants. pO b JJ. JJ.' c -JJ. -JJ.' { A( := r +B C= ( + ( : =  n A + B 10_( : =  ) ] (JJ.=JJ.'). (JJ.¥' JJ.'), p(   X)=A+Bf(x-a)A-l(x-b)-l(X-c)'-ldX (A+JJ.+P=1). These are for finite a, b, c. If c = 00, x - c should be replaced by 1. { 00 0 P 0 a' 0 1 { 00 P a 1-0 o 0 { 00 P a' 1-0' x} (::;xB::') ::::;:  x}A+BJ'X'-'dx (o,cO).  X } =xoeax[A+B fe(a'-a)xxO'-O-ldX] (a¥'a'). 0' Riemann's P- function is reduced to Gauss's hypergeometric function with paramet.ers a = A + JJ. + P, f3 = A + JJ. + p', y = 1 +A + A' by transforming a, b, c to 0, 1, 00 by a suitable linear transforma- tion and by putting z=x-A(x-l)-y. (2) Representation of Special Functions by Riemann's P-function. (i) Gauss's hypergeometric differential equation x(1- x)y" +[ y - (a + f3 + l)x]y' - ally =0. y=p ( g  X ) . 1- y f3 1 o y-a-f3 A special solution is F(a,/3, y; x) (206 Hypergeometric Functions). 
1773 App. A, Table 14.11 Ordinary Differential Equations (ii) Confluent hyper geometric differential equation xy" - (x - JJ.)y' - AY =0. y=p {  1 JJ.-A A special solution is 00 r(A+k) r(JJ.) Xk lFl(A,JJ.;x)=-kO f(A) f(JJ.+k) k!' ( "' ) Wh o k ' d ' ff . 1 . ,, [ 1 k (1/4)- n 2 ] 0 lit ltta er s t erentla equatIOn y + - 4 + x + x 2 Y = . y=p { 1;(-; l/:+n X } . -1/2 -k 1/2-n o } o . l-JJ. x Special solutions are Mk,n(x), Wk,n(x). (iv) Bessel's differential equation xy" + xy' + (x 2 - v 2 )y = O. y=p {  - i 1/2 ° xl V -v Special solutions are J.(x), N,(x) ( 39 Bessel Functions). When m=O, 1,2,... ,Jm-l/2(X) = ( _l) m 2 m + 1 / 2 n -1/2 x m - 1 / 2 dm(cos x)/d(x2)m. (v) Hermite's different equation (parabolic cylindrical equation) y" - 2xy' + 2ny =0. y = p !  : n / : 1 (n+ 1)/2 1/2 When n = 0,1,2, ..., the Hermite polynomial Hn(x)= ( _l)nrn/2e x2 d n ( e _X2)/ dx n is the solu tion. (vi) Laguerre's differential equation xy" + (1- x + 1 )y' + ny = O. x,) y=p {  : X } . 1 I+n+l -I When n =0,1,2,..., the Laguerre polynomial L(x)= (l/n!)x- 1 e X dn(x'+le-X)/dx' is the solution. (vii) Jacobi's differential equation x(l-x)y" + [q-(p + l)x]y' +n(n + p)y=O. y=p( lq qp Pnn x). When n = 0, 1,2, ..., the Jacobi polynomial f(q)x1-q(l-x)q-p d n [ x q + n - 1 (l-x)p+n- q ] Gn(p,q;x)= f(n+q) dx n is the solution. (viii) Legendre's differential equation (1- x 2 )y" -2xy' + n(n+ l)y=O. y=p( 1 n!n l x). When n=O, 1,2, "', the general solution is d dnn [ A(X2-l)n+B(X2_1)n j dx ] . X (x2_l)n+l The Legendre polynomial Pn(x) = [d n {(x 2 -I)"} / dx nJ/2nn! is a special solution. 
App. A, Table l4.I11 Ordinary Differential Equations 1774 (3) Solution by Cylindrical Functions of Ordinary Linear Differential Equations of the Second Order. We denote cylindrical functions by Cv(x) (39 Besse! Functions). Equation 1 2 [ 2 a 2 _,,2 y 2 ] y" +  y'+ (f3yx Y - 1 ) + y=O x x 2 y"+[ 12a -2 f3Y ix Y - 1 ]y'+[ a2:2y2 -f3y(y-2a)ix Y - 2 ]y=0 y" + [  - 2u(x) ]y' + [ 1- :: + U(X)2 - u'(x)- ux) ]y =0 y" + a 2 ,,2x 2v - 2 y =0 y" + (e 2x _ ,,2)y =0 x 2 y" + xy' +(f3 2 x 2 - ,,2)y =0 x 2 y" + xy'-(x 2 + ,,2)y=0 Solution y=x"'C v (f3x Y ) y = x "'exp(if3.x Y)C v ( f3x Y ) y=ex p [ f u(.x)dx ]Cv(X) y= vx C 1 / 2v (ax V ) y=Cv(e X ) y = C v ( f3x) y = Cv(ix) (modified Bessel function) (III) Transformation Groups and Invariants o 0 Let U == -+IJ- be the infinitesimal transformation of a given continuous transformation group ox oy . 0 0 0 . of two vanables, and U'==-+'1-+'- be that of Its extended group. ox oy op We have  = a't) + ( a't) _  ) _ 2 dg ax p ay ax P ax' We put d 2 y r=. dx Let a, 13 and y be invariants of the Oth, first, and second order, respectively. The general form of the diffferential equation of the first or of the second order invariant under U is given by cf>(a, 13) =0 (or f3=£(a», and 'I'(a,f3,y)=O (or y=G(a,f3», respectively, where F, ''1>, 'I', G denote arbitrary functions of the corresponding variables. dy P= dx ' Group With Infinitesimal Transformation U Invariants Note  11  Oth 1st 2nd 0 1 0 x P r (I) 1 0 0 y P r (I) -y x l+p2 x2+ y2 (y - xp)j(x + yp) r j(1 + p2)3/:' (2) 0 y P x pjy rjy (3) x 0 -p y xp x 2 r (3) x y 0 yjx p xr x -y -2p xy X7Y x 3 r (4) f.tX "y (,,- p,)p yP-jx V xI-v/'jJ or pxjy rp,jxv-p--I p, " 0 /x-p,y p r 0 h(x) h'(x) x h(x)p - h'(x)y h(x)r- h"(x)y key) 0 _ k'(y)p2 1 k'(y) r k"(y) (5) y ---x -+-- P k(y) p3 k'(y)p 0 k(y) k'(y)p P r k'(y)p2 (6) x ---- key) k(y) [k(y)J2 h(x) 0 - h'(x)p y h(x)p (h(x»2 r + h(x)h'(x)p 
1775 App. A, Table 15 Total and Partial Differential Equations Group With Infinitesimal Transformation U Invariants Note  11 r Oth 1st 2nd 0 h(x)k(y) h'(x)k(y) P h'(x) fY dy (7) x --- - - + h(x)k'(y)p key) hex) key) xh(x) yh(x) h'(x)(y- xp) y (p-f)h(x) (x 2 , ) - --1 h(x) x xp-y + h'(x) 1_p 2 x2- y2 1-px+y , Y x --or (1 _ p2)3/2 l+px-y (x - yp)j(l + p)(x - y) Notes (1) Parallel translation. (2) Rotation. (3) Affine transformation. (4) Similar transformation; the equation is a homogeneous differential equation. (5) Linear differential equation. (6) Separated variable type. (7) When k(y)= yn, the equation is Bernoulli's differential equation. Reference [1] A. R. Forsyth, A treatise On differential equations, Macmi11an, fourth edition, 1914. 15. Total and Partial Differential Equations (I) Total Differential Equations (428 Total Differential Equations) Suppose we are given a system of total differential equations n dz j = L P jk (x; z) dXk (j= 1,2, ... ,m). k=l A condition for complete integrability is given by aPjk(x;z) aljk(X;Z) apj/(x;z) apj/(x;z) a +L a Pi/(x;z)= a +L a Pik(X;Z).  i   i  Under this condition, the solution with the initial condition (x, "', x; z,..., z) is obtained as follows: First, solve the system of differential equations dz j /dx 1 = Ij1 (XI, xg, ..., x; z) in XI with the initial condition zix)= z?, and denote the solution by Zj= qJiX1)' Next, considering XI as a parameter, solve the system of differential equations dz)dx2 = Ij2(X 1, X2' x, '" ,x; z) in X2 with the initial condition Zj(xg) = qJj(x 1), and denote the solution by Zj = qJj(x l' X2)' Repeat the pro- cess, until we finally have Zj= qJj(x 1, '" ,x.), which is the solution of the original equation. Or, if we have m independent first integrals fj(x; z) = c j of the equation dz j /dx 1 = Ij1 (x; z), we may transform the equation into dU j = =1 Qjk(X; u)dx k by the transformation Uj = fj(x; z). Since the Qjk(X; u) do not involve XI and the equation is a completely integrable total differential equation, we have reduced the number of variables. We obtain the general solution by repeating this process n times. For P (x,y,z) dx + Q (x,y,z) dy + R (x,y,z)dz = 0 (n = 3, m = 1), the complete integrability condition is P ( aQ _ aR ) + Q ( aR _ ap ) + R ( ap _ a Q ) = o. az ay ax az ay ax 
App. A, Table 15.11 Total and Partial Differential Equations 1776 (II) Solution of Partial Differential Equations of First Order (322 Partial Differential Equations (Methods of Integration), 324 Partial Differential Equations of First Order) Let z be a function of x and y, and p= 'dzjax, q= azj'dy, r= 'd 2 zjax 2 , $= 'd 2 zj'dx'dy, t= 'd 2 zj'dy2. We consider a partial differential equation of the first order F(x,y,z,p,q)=O. (I) The Lagrange-Charpit Method. We consider the auxiliary equation dx_dy_ dz -dp -dq F;, - Fq - p+qFq Fx+pFz F'y+qFz' which is a system of ordinary differential equations. Let G (x,y,z,p,q)= a be the solution of the auxiliary equation. Using this together with the original equation F=O, we obtainp = P(x,y,z,a), q= Q (x,y,z,a), and the complete solution by integrating dz = P dx+ Qdy. If we know another solution of the auxiliary equation H(x,y,z,p,q) = b independent of G= a, we have thl: complete solution z = lI>(x,y, a, b) by eliminating p and q from F=O, G = a, and H = b. (2) Solution of Various Standard Forms of Partial Differential Equations of the First Order. The integration constants are a, b. (i) f(p, q) = O. The complete solution is z = ax + cp(a)y + b, where the function t = cp(a) is defined by f(t,a)=O. (ii) f(px, q) = 0, f(x, qy) = 0, f(p/z, q/z) = O. These equations reduce to (i) if x = eX, y = e Y , z = e Z , respectively. (iii) f(x,p,q)=O. If we can solvefor p = F(x,q), the complete solution is z = j F(x,a)dx + ay + b. A similar procedure applies tof(y,p,q)=O. (iv) f(z,p, q) = O. Solve f(z, t,at) = 0 for t = F(z,a). The complete solution is then given by x + ay+ b= j dz/ F(z,a).1f we eliminate a and b from the complete solution lI>(x,y,z,a,b)=O and 'd II> / aa = a II> / ab = 0, we have the singular solution of the original equation. (v) Separated variable typef(x,p)=g(y,q). Solve the two ordinary differential equationsf(x,p) = a and g(y,q)= a for the solutions p = P(x,a) and q= Q (y,a), respectively. Then the complete solution is z= jP(x,a)dx+ jQ(y,a)dy+b. (vi) Lagrange's partial differential equation Pp+ Qq= R. Here P, Q, R are functions of x,y, and z. Denote the solutions of the system of differential equations dx: dy: dz = P: Q: R by u(x,y,z) = a, v(x,y,z)=b. Then the general solution is lI>(u,v)=O, where II> is an arbitrary fun,;tion. A similar method is applicable to n az  lj(xJ,,,.,xn)a-=R(xJ,...,x n ). j=l x j If we have n independent solutions Uj(x) = a j of a system of n differential equations dx) lj = dz / R (j = I, ..., n), the general solution is given by 11>( U J, .. " un) = O. (vii) Clairaut's partial differential equation z = px + q Y + f(p, q). The complete solution is given by the family of planes z = ax + by + f(a, b). The singular solution as the envelope of the family of planes is given by eliminating p and q from the original equation and x = - af/ ap and y=-af/aq. (III) Solutions of Partial Differential Equations of Second Order (322 Partial Differential Equations (Methods of Integration)) (1) Quadrature. Here cp and t/J are arbitrary functions. (i) r = f(x). The general solution is z = J J f(x) dx dx + cp(y)x + t/J(y). A similar rule applies to t = f(y). (ii) s= f(x, y). The general solution is z = JJf(x, y)dxdy + cp(x) + t/J(y). (iii) Wave equation. r-t =0. The general solution is z= cp(x + y)+ t/J(x- y). (iv) Laplace's differential equation. r + t = o. Let x + iy =, and cp, t/J be complex analytic func- tions of ,. The general solution is z = cp(') + t/J(O, and a real solution is z = cpm + ij([). (v) r+Mp=N, where M and N are functions of x and y. The general solution is given by z= J[J L(x, y)N (x, y)dx + cp(y)]/L(x, y)dx + t/J(y), L(x,y) =exp J [M(x, y)dx]. In the integration, y is considered a constant. A similar method is applicable to s+Mp=N, s+ Mq=N, and t+Mq=N. (vi) Monge-Ampere partial differential equation. Rr + Ss + Tt + U(rt - s2) = V, where R, S, T, U, V are functions of x, y, z, p, q. 
1777 App. A, Table IS.III Total and Partial Differential Equations First, in the case U = 0, we take auxiliary equations eq. (1) eq. (2) Rd y 2+ Tdx 2 - Sdxdy=O, Rdpdy+ Tdqdx = Vdxdy. Equation (1) is decomposed into two linear differential forms Xjdx+ Yjdy=O (i=1,2). The combination with (2) gives a solution uj(x,y,z,p,q)= aj' vi(x,y,z,p,q)= b j (i= 1,2), and we have intermediate integrals F;(u j , Vi) = ° (i = 1,2) for an arbitrary function F;. We have the solution of the original equation by solving the intermediate integrals. If S2*4RT, two intermediate integrals are distinct, and hence we can solve them in the formp = P(x,y,z), q= Q (x,y,z), and then we may integrate dz = P dx + Q dy. Next, in the case U*O, let Al and A2 be the solutions of U 2 A 2 + USA+ TR + UV=O. We have two auxiliary equations { AIUdy+ Tdx+ Udp=O, or A2Udx+Rdy+ Udq=O, { A2Udy+ Tdx+ Udp=O, A1Udx+Rdy+ Udq=O, and from the solutions U i = ai' Vi = b i (i = 1,2), we have intermediate integrals Fj(u i , Vi) = ° (i = 1,2). If 4(TR + UV) #S2, A.1 #A.2' we have two different intermediate integrals F; =0. Solving the simul- taneous equations F j = ° in p = P (x, y, z), q = Q(x, y, z), we may also find the solution by integrat- ing dz=Pdx+Qdy. (vii) Poisson's differential equation. P = (rt - S2)"Q, where P = P(p, q, r, s, t) is homogeneous with respect to r, s, t and we assume that Q = Q(x, y, z) satisfies iJQ/iJz # ctJ for x, y, z when rt = S2. The equation P(p,qJ(p),r,rqJ'(p),r{qJ'(p)}2)=0 is then an ordinary differential equation in qJ as a func- tion of p. We first solve this for qJ, and then solve a partial differential equation of the first order q=qJ(p) by the method (11)(2)(i). (2) Intermediate Integrals. Letf(x,y,z,p,q,r,s,t) be polynomials with respect to r, s, t. Suppose thatf(x,y,z,p,q,r,s,t)=O has the first integral u(x,y,z,p,q)=O. We insert r = _ (  + p au + s au ) / au t = _ ( au + q au + s au ) / au ax az aq ap , ay az ap aq into the original equation, and replace an the coefficients that are polynomials of s by 0. We thus obtain a system of differential equations in u. If u and V are two independent solutions of this system, an intermediate integral of the original equation is given in the form <1>( u, v) = 0. (3) Initial Value Problem for a Hyperbolic Partial Differential Equation L[u]= u xy + au x + bu" + cu = h. u(t 1/) = [(UR)A +(UR)8 ]/2+ fiR (x,y; t 1/)h(x,y)dx dy {B [ 1 ( aR au ) ] + J A 2" uai"i'-Rai"i' -{acos(n,x)+bcos(n,y)}uR ds, where Ll is the hatched region in Fig. 19, and the conormal n' is the mirror image of the normal n with respect to x = y. u(,1/) = (uR )c+ feAR (u" + au)dy + fe B R (u x + bu)dx + f fo R (x,y; ,1/)h(x,y)dx dy (characteristic initial value problem). y y A(a,) p(g,) B B(g,!3) x x. Fig. 20 Fig. 19 Here 0 is the hatched rectangular region in Fig. 20. R (x,y; t 1/) is the Riemann function; it satisfies M [R (x,y;, 1/)) = 0, Rx - bR = ° (on x = ), Ry - aR = ° (on y = 1]), R(,1/;t1/)= 1. 
App. A, Table IS.IV Total and Partial Differential Equations 1778 Example (i). u xy = h(x,y). R (x,y; t1/) = 1. u(, 1/) = i [UA + UB] + i LB[ u y cos(n,x) + U x cos(n,y) ] ds + f i h(x,y )dx dy. Example (ii). Telegraph equation uxy+cu=O (c>O). R(x,y;t1/)=Jo( 2\/ c(x-)(Y-1/»). Example (iii). u xy +  (u x + u,,) = 0 (n = a constant > 0). x+y ( x+ Y ) . ( (x-mY-1/) ) R(x,y;t1/)=  F 1-n,n;I;- . ..+1/ (x+Y)(+1/) (IV) Contact Transformations (82 Contact Transformations) We consider a transformation (Xl> ...,x.;z)(XI> ...,X.;Z). We putPj= azjax j , Ij= aZjaAj (j = 1, ..., n). The transformation is caned a contact transformation if there exists a function p(x,z,p)7=O satisfying dZ - '5:.P j dX j = p(x,z,p)(dz - '5:.P j dx). A transformation given by (2n + 1) equations Q=O, aQj ax j + PjaQj az=o, aQjax j + p}Qjaz=o generated by a generating function Q(x,z,X,Z) is a contact transformation. Generating Function p Transformation Name '5:.x j X j +z+Z -1 Aj= -Pj' Ij=-xi' Legendre's Z='5:.p j x j - z transfo rmation '5:.X 2 +Z 2 _'5:.xX-zZ Zj(2Z-z) X j = -pjZ, Pedal transformation J J J Pj= -(2X j -x j )j(2Z-z) Aj = x j - apj(1 +'5:.p])-1/2, (Aj-xY+(Z -zi- a 2 1 Pj = Pi' SimJlarity Z = Zj+ a(1 +'5:.p})-1/2 X j = x j - Pjz, '5:.(X j - xi- Z2- Z2 1 Ij= - Pj('5:.pf-l)-1/2, - y'5:. p} - 1 Z=z('5:.p}-I)I/2 (V) Fundamental Solutions (320 Partial Differential Equations H) A function (or a generalized function such as a distribution) T satisfying LT= 8 (8 is the Dirac delta function) for a linear differential operator L is caned the fundamental (or elementary) solution of L. In the fonowing table, we put · a 2 a 2 .-1 a2 · { I Ll=  2' 0=2 -  2' ,2=  xl, l(x)= 0 ;=1 ax; ax. ;=1 ax; ;=1 (x> 0) (x.;; 0) (Heaviside function). J. is the Bessel function of the first kind; K. and I. are the modified Bessel functions. (Table 19.IV, this Appendix.) s= ( t ;-X?-... -X;_I (if x. > 0 and the quantity under the radical sign is positive), (otherwise). (For Pf (finite part)  125 Distributions and Hyperfunctions.) Operator djdx Fundamental Solution d m dx m 1 (x) { x m - 1 /(m-1)! (x>O) o (xO) 1 (x 1) 1 (x 2 )...1 (x.) 1 1 - -----;-- (i == J=t) 2n X+1Y a. jaxlaX2...aX. a . a I ax + I ay 
1779 App. A, Table IS.VI Total and Partial Differential Equations Operator Fundamental Solution  m ( 1_ ) m 4'17 2 Om (O-A)m (A is real and =1= 0)  n-I a 2 ) m -L - aX n i= 1 axl f - [r(  )/2(n _2)'17 n / 2 ] r L2 1 (1/2'17)logr (n3) (n=2) [ ( n ) /2 rr m ] 1 ( n - 2m is a positive integer ) r I /2 m (m-l)!'17 n k=I(2k-n) rn-2m ora negative odd integer [ r ( !!. ) /2m(m-l)!'17n/2 IT (2k-n) ] O!;m ( n:2m= -2h, ) 2 k = 1 rhO, 1,2, .. . kofom-h 2n m  ) rm-(n/2) K(n/2)-m(2nr) (m-l! (Pfs2m-n)/'17(n/2)-122m-l(m-l)!f[m+ 1-(n/2)] I A I (n/4)-(m/2) { 1m _ (n/2)( v As ) pfs m -(n/2) _ '17(n/2)-I(m_l)!2 m - I +(n/2) J m _(n/2)(v'!A!S) (A>O) (A<O) { m-I ( ) n-I ( n-l ) X n  exp - L x;/4x n m-1)! 2Y;X;;- i=1 (Xn >0) (Xn';; 0) (VI) Solution of Boundary Value Problems (188 Green's Functions, 323 Partial Differential Equations of Elliptic Type, 327 Partial Differential Equations of Parabolic Type) L[u] = Au xx + 2Bu xy + CU yy + Dux + Eu y + Fu, M [v] = (Av)xx + 2(Bv)xy + (Cv)yy - (Dv)x - (Ev)y + Fv. Green's formula f L {vUu] - uM [v]} dxdy= £ { p( u :, - v :, ) + QUv} ds. { A cos(n, x) + B cos(n,y) = Pcos(n',x), eq. (1) Bcos(n,X) + Ccos(n,y) = Pcos(n',y). Q = (Ax + By - D )cos(n,x) + (Bx + C y - E )cos(n,y). The integration contour C is the boundary of the domain D (Fig. 21), n is the inner normal of C, and n', caned the conormal, is given by (I). y . s Fig. 21 x (1) Elliptic Partial Differential Equation L[ u] == U xx + Uyy + au x + bUy + cu:::: h. u(,1)= - £u(s)  ds+ f L G(x,y;t1)h(x,y)dxdy. Here G (x,y; t 1) is Green's function, which satisfies M (G (x,y; t 1)) = 0 in the interior of D except at (X,y)=1= (t 1), and G (x,y; ,1) = - (1 /2'17)log y( x -d + (y _1)2 + a regular function, G (x,y; t1) = 0 «x,y) E C). 
App. A, Table IS.VI Total and Partial Differential Equations 1780 a2u a2u (2) Laplace's Differential Equation in the 2-Dimensional Case - + - =0, ax 2 ay2 u(x,y) == u (r,<p) = Rej(z) (z == x + (y = reiO). (i) Interior of a disk (r';;; I). I i 27T ei<p +z j (z ) = _ 2 u (1, <p) -----,----- d<p, 'TT 0 e'<P-z I i 2X l-r2 u(r,O)=- u(l,<p) dIp 2'TT 0 1-2rcos(O-<p)+r 2 (Poisson's integration formula). (ii) Annulus (0 < q ,;;; r';;; I). j(z)= ::i {fo 27Tu (l,<pnl (w)dqJ- fo 27Tu (q,<pn3(w)d<p-alo g z} (Villat's integration formula). w=(ilogz+<p), a= ( - 2 1 _ 1/1. ) {27T{u(1,<p)_u(q,<p)}d<p, w 3 =-..!..Iogq. 'TT W3 'TT/)o WI 'TT Here l and 3 are the Weierstrass -functions (134 Elliptic Functions) with the futldamental periods 2w I and 2w3' i f oo u(t,O) I f oo u(t,O)y (iii) Half-plane (y;;'O).j(z)=- -dt, u(x,y)=- 2 dt. 'TT -00 z-t 'TT -oo(x-t) +y2 (3) Laplace's Differential Equation in the 3-Dimensional Case. (i) Interior of a sphere (r,;;; 1). _ I {7T (27T _ I - r 2 . u(r,<p,O)=4)r )0 u(l,<I>,8) 3/2 stn8d8d<I>, 'TT 0 0 (1-2rcosy+r 2 ) where cosy = cos 8 cosO + sin 8 sin 0 cos(<I> - <p). (ii) Half-space (z ;;. 0). z ff oo u(t1/,O) u(x,y,z) = 2 3/2 dgdr,. 'TT -00 {(x-d +(y_1/)2 +z2} (4) Equation of Oscillation (Helmholtz Differential Equation) .lu + k 2 u = 0, Let Un be the nor- malized eigenfunction with the same boundary condition for the eigenvalue k n , Green's function IS G(P Q)= L un(P)u:(Q) , k 2 -k;. Domain Boundary Condition Eigenvalue Eigenfunction rectangle 0< x < a, 0 < y < b u=O circle 0< r< a u=O annulus b < r < a u=O fan shape 0< r < a, 0 < q> < a u=O rectangular parallelepiped 0< x < a, 0 < y < b, O<z<c au =0 an sphere 0< r< a au =0 an n 2 m 2 knm='TT -+- 2 a 2 b (n, m= 1,2, ...) . x. Y sm n'TT (ismm<T7i k nm is the root of Jm(kx) =0 k nm is the root of Jm(ka)Nm(kb) -Jm(kb)Nm(ka)=O J m(knmr)e:!:.'m<p { Jm(knmr) _ Nm(k"m r ) } e:!:.im<p J m (knma) N m (k..ma) k nm is the root of J (ka) =0 (p.=m'TT/a) J  (k nm r )sin Iq> knm/='TT n 2 m 2 1 2 -+-+- a 2 b 2 c 2 x Y z cos n'TT (i cos m'TT 7i cos I'TT C k n / is the root of ,p (ka ) = 0, where ,pn(P)=v' 'TT/2 I n +(1/2)(P) ,pn(kn/r)Pnm(cos/ I ):!:. im<p H E . au A ( A a 2 a 2 . . . ) B d d . (5) eat qUatlOn."" = K uU u =  + '" +  ; K IS a posItIve constant. oun ary con 1- ut aXI aX m 
1781 App. A, Table IS.VI Total and Partial Differential Equations tion: hu - k au j an = <p, where hand k are nonnegative constants with h + k = 1, and <p is a given function. /. (I ( { aG(p,Q,t-7") } u(p,t)= /(P,Q,t)u(Q,O)dVQ+K Jo d7" J s an Q +G(P,Q,t-7") <p(Q,7")dS Q . Here V is the domain, and S is its boundary. G (P, Q, t) is the elementary solution that satisfies aG jat = Kt:r.G in V and kaG jan= hG on S, and further in the neighborhood of P= Q, t =0, it 2 - has the form G(P,Q,t)=(47TKt)-m/2 e -R /4.1 + terms of lower degree (R=PQ). (i) - 00 < x < 00, G = U (x -, t), where U(x, t) = e -x 2 /4.ljY 47TKt (similar in the following case (ii». (ii) 0 < x < 00. au ax = hu: u(O,t)=O: G= U(x-,t)- U(x+,t). G= U(x-,t)+ U(x+tt)-2hehLelrrju(x-1/,t)dr,. G=( x; '7")_( x H. G=( x;/ 17")+( x /7")- G =  ( x / /7") _  ( x  17") u(O, t) = u(l, t) = 0: (iii) 0< x <J au au ax (O,t)= ax (l,t)=O: au u(O,t)= ax (l,t)=O: ( x+-2/ 1 ) _ ( X--2/ 1 ) +  4/ 7"  4/ 7". Here  is the elliptic theta function:  (xl7") = 3(X, 7") =:0 1 + 2ei".,.n2 cos2n7T x. (iv) O<x<oo, O<y<oo. u(x,O,t)=u(O,y,t)=O: G = (e -(x-i /4.1 _ e _(x+2 /4<1)( e - (y -)2/4<1 _ e - (y+)2/4.t) j47TKt. (v) 0< x < a, 0< y < b. u=O on the boundary: 4   { 2 ( m2 n 2 ) } . m7TX . m7T . mry . n7TlI G= ab £.. £.. exp -K7T t 2 + 2" sm-asm-smTsmT' m=ln.1 a b a (vi) 0 < x < a, 0 < y < b, 0 < z ..; c. u = 0 on the boundary: 8 00 00 00 { ( /2 2 2 ) } G = - L L L exp - K7T 2 t - + !:!!.- +  abc 1=1 m=1 n=1 a 2 b 2 c 2 . /7TX . /7T . m'7T)l . m7T7/ . n7TZ . n7Tr X sm-sm-sm-sm-sm-sm- a a b bee Ixl = r, II = r': (vii) 0 < r < 00. Spherically symmetric. G = (e-(r-r'i /4.1 _ e -(r+d/ 4 . t ) j87Trr'( 7TKt) 1/2. (viii) 0 < r < a. Spherically symmetric. u = 0 on the boundary: 1 00 , G -  -.n2.rr 2 t/a 2 . n7Tr . n7Tr -- £.. e sm-sm-. 27Tarr' n= I a a (ix) a < r < 00. Spherically symmetric. kau jar - hu = 0 on the boundary: G= 1 12 [ e-(r-r,)2/4K1+e-(r+r'-2a)2/4.t_ ah+k (4'ITKt)I/2 8'ITrr'('lTKt) / ak xex p { Kt( a:Z k f+(r+r'-2a) a:Zk } x erfc { r2a + a::k Y Kt }] (erfcx=:o iooe- t2 dt). (x) 0 < r < 00. Axially symmetric: G = e -(,.2+r'2)/4KI[0(rr' j2Kt) j47TKt. (xi) 0 < r < a. Axially symmetric. k  - hu = 0 on the boundary: G=   Jo(ran)Jo(r'Cl. n ) e-<1X"I/a 2 ..:.. 2 2' 'ITa n=1 {Jo(aCl. n )} +{J1(aCl. n )} where Cl. n is given by kCl.nJ I (aCl. n ) - h J o( aCl. n ) = O. 
App. A, Table 16 Elliptic Integrals and Elliptic Functions 1782 16. Elliptic Integrals and Elliptic Functions (I) Elliptic Integrals (134 E11iptic Functions) (1) Legendre-Jacobi Standard Form. E11iptic integral of the first kind 1 '1' d\f; 1 sin 'P dt F(k,rp)= = o Y 1- k 2 sin 2 \f; 0 Y (1- t 2 )(1- k 2 t 2 ) (k is the modulus). Elliptic integral of the second kind j Sin 'P M 2 E(k,rp)= ('PY1-k 2 sin 2 \f; d\f;= l-k  dt. )0 1- t o E11iptic integral of the third kind 1 '1' d\f; II(rp,n,k)= o (I + nsin 2 \f;)Yl- k 2 sin 2 \f; j Sin'l' dt o (1 + n( 2 ) Y (1- t 2 )(1- k 2 t 2 ) When rp = 'TT /2, e11iptic integrals of the first and the second kinds are ca11ed complete e11iptic integrals: K(k)=F(k)== 1 "/2 d\f; = 1 ' dt == IF ( -}.-};I;k2 ) , - '2 0 Yl-k 2 sin 2 \f; 0 Y(1-t 2 )(1-k 2 t 2 ) 1 ,,/2 I ' Mk 22 1 1 _ 'TT _ 2. 2 _ - t _'TT . . 2 E(k)=E(k'I)- 0 Yl-k sm \f; d\f;- 0 1=? dt-I F (-I'I,I,k), where F is the hyper geometric function. K(k')= K(-vJ=kI)= K'(k), E(k')= E(\l l- k 2 )=E'(k) (k'2== 1- e; k' is the complementary modulus). EK' + E' K - KK' = I (Legendre's relation). 2 K (  ) = r(1/4) V2 4\1'; a a F k =  ( E- t2F _ sinrpcosrp ) , k' V 1- k 2 sin 2 rp (2) Change of Variables. aE _ E- F ak - ( l-k' ) tan( \f; - rp) = k' tanrp: F 1 + k' , \f; = (l + k')F(k,rp), ( 1 - k' ) 2 , 1 - k' . E 1+k' ,\f; = 1+k, [E(k,rp)+k F(k,rp)]- l+k' sm\f;. (1+ k )sinrp smX= : 1 + ksin 2 rp ( 2Yk ) F 1- k 'X = (1 + k)F(k,rp), E(  ,x)= lk [2E(k,rp)-k'2 F (k,rp) 2k sinrpcosrp '/- 1 k 2' 2 J + V .- sm rp . 1 + k sin 2 rp 
1783 App. A, Table 16.1 Elliptic Integrals and Elliptic Functions k l sm<PI cos<pj F(k[.<PI) E(k[. <PI) .k k' sin<p cos<p k' F(k,<p) [ (k -k2 sin<pcos<p I k' k' E ,<p) V-I - k 2 sin 2 <p V-I - k 2 sin 2 <p V 1- k 2 sin 2 <p k' - itan<p - iF(k,<p) i[E(k,<p)- F(k,<p)- V 1- k 2 sin 2 <p tan<p] cos<p 1 k sin <p V 1- k 2 sin 2 <p kF(k,<p) t[E(k,<p)- k,2F(k,<p)] k (3) Transformation into Standard Form. (i) The following are reducible to elliptic integrals of the first kind (we assume a> b > 0 for parameters). AF(k,<p) A k <p fX dt 1 Y3 -1 Y3+1-x - arc cos I V t 3 - 1 V'3 2\12 Y3-1+x fX dt I Y3 +1 Y3 -I +x 1 V l-t 3 V'3 arc cos 2\12 Y3 + I-x LX dt 1 1 l-x 2 2 - arc cos- o V I + t 4 \12 l+x 2 IX dt b . x - arc SIn Ii o V (a 2 - t 2 )(b 2 - (2) a a IX dt 2 y l_(bja)2 I-(bjx) arCSIn 2 b V (a 2 - t 2 )(t 2 - b 2 ) a I-(bja) IX dt 2 1 b 1-(ajx) - arCSIn a V(t 2 -a 2 )(t2-b 2 ) a a 2 I-(bjx) IX dt y l-(bja)2 x o V(a 2 +t 2 )(b 2 +t 2 )- a arc tan Ii IX dt 2 1 a arc sin 1+(bja) o V-(a 2 - r2)(b 2 + r2) V a 2 + b 2 2 V a 2 + b 2 1+(bjx) IX dt 1 a b arc cos - b V(a 2 +r2)(r2-b 2 ) V a 2 + b 2 V a 2 + b 2 x (ii) The following are reducible to el1iptic integrals of the second kind (we assume a > b >0 for parameters). 
App. A, Table 16.11 1784 Elliptic Integrals and Elliptic Functions AE(k,cp) A k cp I\[f2f b x  d( a - arc sin b b - ( a 0 fa b2 + (2 d( Va 2 +b 2 a x arc cos - a2 - (2 V a 2 + b 2 a IX (12 (2 + a2 d( V a 2 + b 2 a b b 2 arc cos - (2 _ b2 V a 2 + b 2 x i X (2 (2 + a2 d( Va 2 + b 2 a . ..;'2 (b/a)' (2 _ b2 arcsm - Va 2 + b 2 1+(xja) IX a2 + (2 d( a V a 2 - b 2 x 3 b 2 arc tan b (b2 + (2) a IX d V a 2 - b 2 12' 1 . 1- (bjx) b (2Y«(2_b:)(a 2 _(2) ab 2 a arcsm - 1-(bja) (II) Elliptic Theta Functions (1) For Im"t">O, we put q=e in , and define 00 1't O (U,T)=1't 4 (U,T)= 1 +2 L (-I)"qn 2 cos2mTu, n=l 00 , 1't 1 (u,T)=2 L (_1)n q [n+(1/2)1 sin(2n+ 1)'1Tu, n=O 00 , 1't 2 (U,'T)=2 L q[n+(1/2)1 cos(2n+1)'1Tu, n=O 00 1't 3 (u,T)=1+2 L qn'cos2n'1Tu. n=l Each of the four functions 1't j () = 0, 1,2,3) as a function of two variables U and 'T satisfies the following partial differential equation a 2 1't(u,'T) . a1't(u,'T) 2 =4'1Tl a au 'T (2) Mutual Relations. 1't(u)+ 1'tf(u) = 1'tt(U) + 1'tf(u), 1'tJ(U) = k1'tf(U) + k'1't}(u), 1'ti( u) = - k' 1't J 2 ( u) + k1'tf( u), 1'tf( u) = k1'tJ( u) - k' 1'ti( u), where k is the modulus such that iK'(k)j K(k) = T, and k' is the corresponding complementary modulus. k = 1'ti(O) j 1't}(0), 1't; (0) = '1T1't 2 (0)1't 3 (0)1't O (0), k' = 1'tJ(O) j 1't}(0). 1'ti"(0) 1't2' (0) 1't:i' (0) 1't6' (0) -=-+-+-. -I}; (0) -I}o(O) -I}3 (0) -I}o(O) (3) Pseudo periodicity. In the following table, the only variables in 1't are U and 'T. m and n are integers. 
1785 App. A, Table 16.II1 Elliptic Integrals and Elliptic Functions Increment of u {}o {}I {}2 {}3 Exponential Factor m+m' ( - l)n{}o ( -l)m+n{}1 ( - 1 )"1't 2 {}3 \ exp[ - mrj I (_l)m+ 1{}2 X (2u + n'T)] m-"2 + nT {}3 ( _1)m+n{}1 ( - l)n{}o m+(n+i)T ( - l)nj{} 1 ( - l)m + nj{}o ( - l)m{}3 {}2 ! ex p [ -(n+i)wi m-i+(n+i)'T {}2 {}3 ( - l)m + nj{}o ( -lti{}l x{2u+(n+i)'T} ] Zeros u = I I m m+n'T m+"2 m+"2 +( n+ i)'T +m +(n+i)'T 00 (4) Expansion into Infinite Products. We put Qo== II (1- q2n). Then we have n=l 00 {}o (u) = Qo II (1- 2q2n-1 cos2wu + q4n-2), n=\ 00 {}I (u) = 2Qoql/4 s inwu II (1- 2 q 2n cos2wu + q4n), n=\ 00 {}2 (u) = 2 Qoql/4 coswU II (1 + 2 q 2n cos2wu + q4n), n=\ 00 {}3(U)=QO II (1+2q2n-Icos2wu+q4n-2). n=1 (III) Jacobi's Elliptic Functions (I) We express the modulus k and the complementary modulus as fol1ows. {}f (0) k=- {}f (0) , Then we have {}J (0) k'=- {}f (0) , k 2 + k,2= 1. K(k)=K= I{}f(O), K'(k)=K'= -jTK. The relation between q and k is q= e i7TT = e- 7T (K'/K), ql/4= (*) 1/2[ I +2( * f + 15( * f + 150( * f + 1707( *)8 +...]. ,4/- = l L+  L 5 + ll L 9 + 11Q L I3 + 1707 L 17 + where L = 1- v 1- k 2 q 2 2 5 2 9 2 13 2 17 ..., I +  1 - k 2 (2) Functions sn, cn, dn; Addition Theorem. I {}I (uj2K) s n ( u k ) =- , - VI {}o (uj2K) , sn + cn = 1, dn 2 u + k2sn = 1. , {i7 {}2 (uj2K) cn(u,k) == V T {}o (uj2K) , {} (u j 2K) dn(u,k)==vk' 3 . {}o(uj2K) ( ) snucnvdnv+snvcnudnu mu+v= , 1 - k 2 sn sn 2 v d ( ) dnudnv- k 2 snucnusnvcnv n u+ v = . 1 - k 2 sn 2 u sn 2 v dcnu - = -snudnu du ' ( + ) cn u cn v - sn u dn u sn v dn v cn u v = , I - k 2 sn sn 2 v dsnu dU =cnudnu, d dn u 2  = - k snucnu. 
App. A, Table l6.I11 Elliptic Integrals and Elliptic Functions 1786 (3) Periodicity. In the next table, m and n are integers. Increment of u snu cnu 2mK+2niK' ( - I)m sn u (-1)m+n cnu (2m - I)K + 2niK' ( _ l)m + I cn u (_ l)m+nk' snu dnu dnu 2mK+(2n+ l)iK' ( - 1 )mk - I  (_I)m+n+lik -I dnu snu snu (2m-l)K+(2n+ l)iK' (_I)m+lk- 1 dnu (-I)m+nik'k- 1  cnu cnu dnl ( - 1)n dn u (- lrk,--L dnu i(_lr+ 1 CnU snu (_ lr ik snu cnu Zeros u = 2nK+2miK' (2n+ I)K+2miK' (2n + I)K + (2m + 1 )iK' Poles u = 2nK+(2m+ l)iK' 2nK+(2m+ l)iK' 2nK + (2m + 1 )iK' Fundamental periods 4K, 2iK' 4K, 2K +2iK' 2K, 4iK' (4) Change of Variables. In the next table, the second column, for example, means the relation sn(ku, 1/ k) = k sn(u,k). u k sn cn dn ku Ilk ksn dn cn iu k' . sn dn 1- cn cn cn k'u .k k' cn I 'p dn dn dn iku .k' ik I cn 1- dn k dn dn ik'u I ik' sn dn p cn cn cn 2Vk (1 + k)sn cndn I - k sn 2 (1 + k)u l+k - I + k sn 2 I + k sn 2 I + k sn 2 (Gauss's transformation) (1 + k')u l-k' (1 + k') sncn 1- (I + k')sn 2 I - (I - k')sn! l+k' dn dn (Landen's transformation) (l+k,)2U C- VP f I+VP k 2 sncn dn - Yk' X Y2(1 + k') dn+ VP 2 I+Vk' I-Yk' (1+dn)(k'+dn) I-Yk' 1+Yk' Y(1 +dn )(k'+dn) 2(1 + k') (1 +dn)(k' +dn) sn(u,k') Jacobi's transformation. sn( iu, k) = i ( ) , cn u,k' cn(iu,k) = (1 ) ' cn u,k' dn( u, k') dn(iu, k) = (k') . cn u, (5) Amplitude. i 'P cbJ; The inverse function <p = am(u, k) of u(k, <p)= F(k, <p)= o V 1- k 2 sin 2 1/J is called the amplitude. 
1787 App. A, Table 16.IV Elliptic Integrals and Elliptic Functions sn(u, k) = sin <p= sinam(u ,k), cn(u, k) = cos <p= cosam(u,k), dn(u,k)= V 1- k 2 sin 2 <p = V 1- esn 2 (u,k) . u(k,x)= lX dt ,x=sn(u,k). o V (1- t 2 )(1- k 2 t 2 ) 7TU 00 2 q n . ( U ) am(u,k)= 2K + L 2n SID n7T 2K n=\ n(1 +q ) am(fJ, 1) = gd9 (Gudermann function). (q= ei'lM"= e-"(K'/K». (IV) Weierstrass's Elliptic Functions (1) Weierstrass's -function. For the fundamental periods 2w\, 2w 3 , we have &;>(u)=+L' [ 1 2 - 1 2 ] U n,m (u-2nWI-2mW3) (2nwl +2mw3) 1 g2 2 g3 4 g1 6 3g 2 g3 8 = u 2 + 20 u + 28 u + 1200 u + 6160 u +.... g2=60L' 1 4 ' g3= 140L' 1 6 ' n,m (2nwl +2mW3) n,m (2nwl + 12mw3) where ' means the sum over all integers except m = n = O. S:>( - u)= s:>(u). Putting W 2 = - (WI +w3)' = s:>(w j ) (J= 1,2,3) we have e l + e2 + e3 =0, e l e2 + e 2 e 3 + e3el = - g2/4, e l e 2 e3= g3/4. 1 &;>'(u)=df.J/ du =-2l. ( 2 2 )3 ' m." u- nW 1 - mW 3 s:>'2(U) =4[ S:>(u) - el] [ s:>(u) - e2] [ s:>(u) - e3] =4s:>3(U) - g2 s:>(u) - g3' Addition theorem 2 1 [ S:>'(u)- S:> '(v) ] &;>(u+ v) = - &;>(u) - S:>(v)+ 4" &;>(u) _ &;>(v) , (e j - ek)(e j - e/) s:> (u+w j .)=e j .+ (j,k,l)=(1,2,3). s:>(u)-e j Using theta functions corresponding to T = W3/ WI' 1)1 d 2 1og-lt l (u/2w l ) ( 1 *1"(0) ) s:>(u)=-- du 2 1)\=nwI)=- 12wI *1(0) , 1 -It? (0)-It 2 (u /2WI)-It 3 (u/2w I )-It O (u /2wI) S:>'(u) = -- 4wi -It 2 (O)-It o (O)-Itf (u/2wI) . The relations to Jacobi's elliptic functions are q=exp(i 7Tw 3/ W \), ( u ) ( ) cn2u dn 2 u 1 &;> -ve::::e:. =el+ el-e3 =e2+(e\-e3)=e3+(el-e3)-' el - e3 sn-u sn-u sn2u where the modulus is k= ye 2 -e 3 , K(k)=w\ Yel-e3 ' el- e3 (2)  -function. (U)=l.+L' [ 1 + u + 1 ] u n,m U - 2nwi - 2mw3 (2nwl + 2mw3)2 2nwi + 2mw3 2 _ I g2 3 g3 5 g2 7 g2 g3 9 - -;; - 60 u - 140 u - 8400 u - 18480 u -... =GJ!w\)u+dlog*1 (u/2w l )/du. t(u) = - &;>(u). 
App. A, Table 17 1788 Gamma Functions and Related Functions Pseudoperiodicity. Putting 1'/j=nw) u= 1,2,3) we have r(u+2nwI +2mw3)= r(u)+2n't)1 +2m11J (n,m=O,:t 1, :t2, ,..), 1 -It;"(0) 1'11=- 12wI -It; (0) , 1'11+1'12+1'13=0, 1'II W 3 -1'I3 W I = 1'I2 W I -1'I1 W 2 = 1'I3 W 2 -1'I2W3 = wi 12 (Legendre's relation). Addition theorem 1 r"(u)-r"(v) r(u+v)= r(u)+ r(v)+ I r'(u)- t(v) . (3) a-function. / ( U ) [ u 1 ( U ) 2 ] a u = u II 1 - ex + - () n.m 2nw l + 2mW 3 p 2nwI+ 2mw 3 2 2nwI+ 2mw 3 [ n, m = 0, :t 1, ] (n,',(O,O) 2 g2 5 g3 7 g2 9 =u-u- 3 u- 9 2 U-... 2.3.5 2.3.5.7 2.3.5.7 _ ( 1'IIU2 ) -ItI(UI2WI) -2w I exp- 2 ( ' WI -It; 0) r(u)=a'(u)la(u). a(-u)=-a(u). Pseudoperiodicity. a(u + 2nw i + 2mw3)= (-It+ m + mn [exp(2nrII + 2m1'l3)(U + nWI + mW3)]a(u). (4) Cosigma functions alo a2, a3' a(u+w) ( 1 IIU2 ) -Itj+1 (uI2wI) a/u)= -e'ljU - exp- a (w j ) 2w I -It j + 1(0) (j= 1,2,3; -It 4 =-It O ). [ aj(u) ] 2 S:>(u) - e j = a(u) , 2al(u)a2(u)a3(u) a(2u) S:>'(2u)=- - --. a 3 (u) a 4 (u) a(ula) snu=a ( ) ' a3 ul a al(ula) cnu= a3(ula) , a2(ula) dnu= , a3(ula) , K where a = vel - e3 = - . WI References [1] W. F. Magnus, F. Oberhettinger, and R. P. Soni, Formulas and theorems for the sp(:cial functions of mathematical physics, Springer, third enlarged edition, 1966. [2] Y. L. Luke, The special functions and their approximations I, II, Academic Press, 1969. [3] A. Erdelyi, Higher transcendental functions I, II, III (Bateman manuscript project) McGraw-Hill, 1953, 1955. In particular, for hypergeometric functions of two variables see [4] P. Appell, Sur les fonctions hypergeometriques de plusieurs variables, Memor. Sci. Math., Gauthier- Vi11ars, 1925. Also  references to 39 Bessel Functions, 134 E11iptic Functions, 174 Gamma Function, 389 Special Functions. 17. Gamma Functions and Related Functions (I) Gamma Functions and Beta Functions (174 Gamma Function) In this Section (I), C means Euler's constant, En means a Bernoulli number, r means the Rie- mann zeta function. (1) Gamma function. r(z)= too e-'t Z - 1 dt 1 f (o+) =-----,--- e-'t Z - 1 dt. e 2mz _l 00 (Rez>O) 
1789 App. A, Table 17.11 Gamma Functions and Related Functions In the last integral, the integration contour goes once around the positive real axis in the positive direction. f(n+l)=n! (n = 0, 1,2, . . .), f(l/2) = V7i . f(z+ I)=zf(z), f(z)f(l-z)= , SJn'TTZ n-I . II f(z + ) =(2'TTin-l)/2n(l/2)-nzf(nz), JO 1 sin'TTz 00 r (2n + 1) 2 + I logf(l +z)= - -Iog- - Cz- L z n 2 'TTZ n = I 2n + 1 (lzl< 1). 00 =zecz II (1+)e-z/n fez) n=1 n ' I I r(x+ry) 1 2 f(x) 00 /( 2 ) II 1 1+ y 2 n=O (x+n) (x,y are real and x> 0). Asymptotic expansion (Stirling formula).  [ 00 (- 1)' - I B z I - 2n ] f(z)R;e- Z z z - 1 V2'TTz exp L 2n n = I 2n (2n - I) =e-zzZ-(l/2)V27i [ I+-L+- 139 _ 571 +0(Z-5) ] . 12z 288z2 51840z 3 2488320z 4 (Iargzl < 'TT) (2) Beta Function. B(x,y)= ftX-l(l-tY-ldt = f(x)f(y)/f(x+ y). (Rex, Rey>O) (3) Incomplete Gamma Function. Y(I',x) = rx tv-Ie -I dt = f(l') - x( v-l)/2 e -X/2W(V_I)/2 v/2(X) (ReI' > 0). )0 . (4) Incomplete Beta Function. B,,(x,y)= fo"tX-1(I-ty-ldt (O<a<; 1). d (5) Polygamma Functions. ..p(z)= dz logf(z) = g; = fo oo [ e1 - 1-ez1 ]dt=-C+ nJ n1 - zn )' 00 I ..p'(z)= L , n=O (z+n) 00 ( I ) k+lk' ..p(k)(Z)= L k+I' n=O (z+n) (k = 1,2, .. . ). (II) Combinatorial Problems (330 Permutations and Combinations) Factorial n!=n(n-I)(n-2)...3.2.1. 0!=1. Binomial coefficient ( a ) = a (a -I)... (a - r + I) r r! . (I) Number of Permutations of n Elements Taken r at a Time. nPr= n(n -I)...(n- r+ 1)= n!/(n- r)!. Number of combinations of n elements taken r at a time C= nPr = n! = ( n ) n r r! r!(n-r)! r' nCr=nCn-n nCr= n-lCr+ n-l C r - 1 . Number of multiple permutations nIIr=n r . Number of multiple combinations (n+r-I)! nHr=.+r-1Cr= ( ) . r! n-I ! Number of circular permutations nPrl r . 
App. A, Table 18 Hypergeometric and Spherical Functions 1790 n (2) Binomial Theorem. (a + hr = L () an - rb', r=O Multinomial theorem (a 1+ '" + amr =  , n! , ai'... a!:.m. PI' "'Pm' The latter summation runs over all nonnegative integers satisfying PI + ... + Pm = n. References See references to Table 16, this Appendix. 18. Hypergeometric Functions and Spherical Functions (I) Hypergeometric Function (206 Hypergeometric Functions) (I) Hypergeometric Function. . . _  r(a+ n) r(b+ n) r(c) zn F(a,b,c,z)= L.. ,. n=O r(a) r(b) f(c+n) n. The fundamental system of solutions of the hypergeometric differential equation z(l_Z) d2 +[c-(a+b+ l )zj d d U -abu=O at z=O is given by dz z U\ = F(a,b;c;z), U2=zl-CF(a-c+l, b-c+l; 2-c; z) (c*o, -I, -2, ...). F(a,b; c;z)= F(b,a;c;z). dF j dz =(abj c)F(a + I, b+ I; c+ I; z). f(c)r(c-a-b) F(a,b;c; 1)= ) ) (Re(a+b-c)<O). f(c-a f(c-b f(c) 1 1 F ( a b'c'z ) = tb-I ( I-t ) c-b-I ( I-tz ) -adt , " f(b)f(c-b) 0 (Rec >Reb>O, Izi < I), I f(c) f ;oo f(a+s)r(b+s)f(-s) F ( a b'c'z ) = - ( -z ) Sds , " 27Ti r(a)r(b) -;00 f(c+s) . (2) Transformations of the Hypergeometric Function. F(a,b;c;z)=(l-Z)-aF(a,c-b;c; zl ) =(I-zy-a-bF(c-a,c-b; c; z) _ f(c)f(b- a) ( I ) =(I-z) a ( ) ) F a,c-b;a-b+l; _ I f b f(c - a - z +(I_Z)_b f(c)f(a-b) F ( b,c-a; b-a+l;  ) f(a)f(c-b) I-z f(c)f(c-a-b) = ) ) F(a,b;a+b-c+l; I-z) f(c-a r(c-b f(c)f(a+ b- c) + ( I_z ) c-a-b F ( c-a c-b' c-a-b+1' I-z » f(a)f(b) " , = f(c)r(b-a) ( -z ) -aF ( a I-c+a- I-b+a- 1. ) f(b)f(c-a) , , , Z f(c)r(a-b) ( I ) + ( (-z)-bF b,l-c+b; I-a+b; - . fa)f(c-b) z 
1791 App. A, Table 18.1 Hypergeometric and Spherical Functions (3) Riemann's Differential Equation ( Table 14.11, this Appendix). d2u [ 1- a - a' 1- {3 - {3' 1- y - y' ] du -+ + + - dz 2 Z - a z - b z - c dz [ aa'(a-b)(a-c) {3{3'(b-c)(b-a) YY '(c-a)(c-b) ] u + + + = O. z-a z-b z-c (z-a)(z-b)(z-c) Here we have a + a' + {3 + {3' + Y + y' =:' 1 (Fuchsian relation). The solution of this equation is given by Riemann's P-function u=p {  a' ( z-a ) a ( Z-c ) Y ( = z-b z-b F a+{3+y,a+{3'+y; l+a-a'; b {3 {3' c ,} y y' (C-b)(Z-a» ) (c-a)(z-b) (a - a',{3 - {3', y - y' * integer). We have 24 representations of the above function by interchanging the parameters a, b, c; a, a'; {3, {3'; y, y' in the right-hand side. (4) Barnes's Extended Hypergeometric Function. 00 (a[)n...(ap)n zn pFq(at. ...,<xp;{3t. ...,{3q;z)= L I' n=O ({3[)n...({3q)n n. where (a)n=a(a+l)...(a+n-l) =f(a+n)jf(a). F(a,b; c; z)=2F[(a,b; c; z). oFo(x) = eX, [Fo(a; x)=(I- x)-a. (5) Appe1!'s Hypergeometric Functions of Two Variables. F ( . {3 {3 '.. ) -   (a)m+n({3)m({3')n m n [a, , ,y,x,y - "'" "'" , , ( ) x y , m=O n=O m.n. Y m+n F ( . {3 {3 " '. ) -   (a)m+n({3)m({3')n m n 2 a, , , y, y ,x ,y - "'" "'" " ( ) ( ' ) x y, m=On=O m.n. y mY n F ( '. {3 {3 '" ) -   (a)m(a')n({3)m({3')n m n 3 a, a, , , y, x,y - "'" "'" x y , m=O n=O m!n!(y)m+n F ( . {3 . '. ) -   (a)m+n({3)m+n m n 4 a, ,y,y ,x,y - "'" "'" " ( ) ( ' ) x y . m=On=O m.n. y mY n (6) Representation of Various Special Functions by Hypergeometric Functions. (l-xY=F(-I',b;b;x), e-nx=( SehX )n(tanhX)F(l+I' l;n ; l+n; seCh 2 x). 1 1 + x _ ( 1 . 3. 2 ) log(l+x)=xF(I,1;2; -x), 2"log 1 _ x -xF 2",l'2"'x , . _ ( . F ( I+n I-n.3"2 ) smnx-n smx) ''2,smx, c osnx=F ( !! _!!. .!.'sin2v ) = ( COSX ) F ( 1+n I-n . .!.'sin2x ) 2' 2' 2'"" 2 ' 2 ' 2' , . _ F( 1 1. 3. 2 ) arc sm x - x 2" ' 2"' 2"' x , arctanx=xF(i,l; t; _X2). (2n-1)!! ( 1 1 ) P 2n (x)=( -lr (2n)!! F - n,n + 2"; 2"; x 2 , _ n(2n+1)!! ( 3.3. 2 ) P 2n +[(x)-(-I) (2n)!! xF -n,n+I' 2'x (spherical function), 
App. A, Table 18.11 Hypergeometric and Spherical Functions 1792 where ,,_ { m(m-2)...4'2 (m even), n=0,1,2,...; m..- 0!!=(-1)!!=1. m(m-2)...3.1 (m odd), K(X)=IFG,; 1; X2). E(X)=IF( -,; 1; X2) (complete elliptic integral). Jv(x) - x o F 1 ( V+l; _ X2 ) _ xVe-ix 1F1 ( V+- 2 1;2V+l;2iX ) . 2r(v+ 1) 4 2 v r(v+ 1) e X = \im F(a,b;a;x/b)=IFI(a;a;x)=oFo(x). 6->00 (II) Legendre Function (393 Spherical Functions) (I) Legendre Functions. The generalized spherical function corresponding to the rotatJon group of 3-dimensional space is the solution of the following differential equation. 2 ) d2u du [ ( f1-2 ] (I-z --2z-+ 1'1'+1)-- u=O. dz 2 dz l-z2 When f1- = 0, the equation is Legendre's differential equation, and the fundamental system of solutions is given by the following two kind of functions. Legendre function of the first kind \3.(z)= P.(Z)=2FI( - 1', I' + I; 1; I; z ). Legendre function of the second kind r(1'+1)v7T _ _ I ( 1'+2 Gz= z. F- .( )- 2v+ l r[I'+(3/2)] 2 I 2 ' Q.(x)= 1[G.(X + iO) +G.(x - iO)] (COSI'7T)P. (x) - p. (- x) =7T 2 sin 1'7T 1'+1 3 --y-; 1'+2; z12 ). (l'oFinteger; -1<x<I). Recurrence formulas: '13V(z) = '13-v-l (z). .Q.(z)-.Q-V-1 (z) = n(cot vn)'13v(z) (v # integer). '13v( -z)=e Ivni '13v(z)-(2/n)(sinvn).Qv(z), .Q.( -z)= _eIvni.Qv(z) (z2 - I )d.(z)/ dz = (I' + 1)[.+ I(Z) - z.(z)], (21' + I)z.(z) = (I' + I).+ I(z) + 1'._I(z), (z2-I)dG.(z)/dz=(I'+ 1)[G.+I(z)-zG.(z)], (21' + l)zG.(z) = (I' + 1 )G.+ I(z) + I'G._ I(Z). _ _ r(I'+I) __ ( I' 1'+1 3 ffi ( Z ) =7T 1/22-. I tan l'7T z. IF -+1 , -'1'+ '1-'. r[I'+(3/2)] 2 I 2 2' 2; _1/2J[I'+(1/2)]. ( I-I' -1'.1_. I ) +7T 2 ) Z 2FI _ 2 '- 2 '- 2 1', r(1'+1 z2 . sin 1'7T  ( I I ) p.(cos(J) = - "'"' (-If - - I Pn(cos(J) 7T n=O I'-n I'+n+ (:t =sgn(Imz)). zl2 ) (l'oFinteger; 0.;; (J<7T). p.(l)= I, 2 IP.(cos(J)I.;; , V I'1T sin (J P ( 0 ) = - sin 1'7T r ( I' + I ) r (  ) · 27T 3 / 2 2 2' I ( 1'+1 ) ( -I' ) Q.(O) = -(l-COSI'7T)r --y- r """2 . 4v7T v7T IQ.(cos(J)I.;; ----;-- V I' smiJ (0<(J<7T; 1'> I). Estimation: (2) The Case I' = n (= 0, 1,2, . . .). In the following, the symbol !! means m!!= { m(m-2)...4'2 m(m-2)...5.3.1 (m even), (m odd). 
1793 App. A, Table 18.11 Hypergeometric and Spherical Functions The function P" is a polynomial of degree n (Legendre polynomial) and is represented as follows: P ( ) - 1 d" 2 1) " " z = _ 2 " , - d ,,(z - n. z = (2n)! Z"2 F l ( -!!.' I-n . - 2 1 - n ; Z 1 2 ) 2"(n!)2 2 2' (2n-1)!! [ n(n-1) n(n-1)(n-2)(n-3) ] = z"- z"-2+ Z"-4::;: n! (2n-1) 2'4.(2n-1)(2n-3) ... .  m . (2m+2j-1)!! 2 P ( z ) = ( -I ) m-) Z'l 2m. (2 . ) ' (2 - 2 ' ) " ' }=o J. m J..  m . (2m+2j+1)!! 2 1 P ( z ) = ( -l)m-) z'l+ 2m+l j=O (2j+ 1)!(2m-2j)!! . (2n)! - 8 ( 1 1 + 2 ' 8) P (cosO) = e+ ln F - -n' - -n' e- I " 2 2 "(n !)2 2 1 2' '2 ' 2(2n-I)!! [ 1 n 1.3 n(n-I) = cosnO+ _ 1 ( ) cos(n-2)0+ _ 1 2 )( ) cos(n-4)0 (2n)!! 2n-l . (2n-l 2n-3 1.3.5 n(n-l)(n-2) ( ] + T"2-3 (2n -1)(2n _ 3)(2n _ 5) cos n - 6)0 + ... j[ (n-,)!! ] 2 (n even), + n.. o (n odd). 4 (2n)!! [ 1.(n+1) = - (2 ) sin(n+ 1)0+ ( sin(n+3)B 17 n+l!! 1.2n+3) 1.3.(n+l)(n+2) ] + ( )( sin(n +5)0+ ...(ad infinitum) 1.2.2n+3 2n+5) Laplace-Mehler integral representation P,,(cosO) = 1. (7T(COSO+ isinBcosq;)"dq; 17 J o (0<0< 17). = V2 (9 Cos[n+(1/2)]q; dq;= V2 ( 7T sin[n+(l/2)]q; dq;. 17 J o Y cosq;-cosB 17 J(} Y cosO-cosq; P (x)== (-1)" r"+l ( l ) ( x=  r ' r= v'z2+p2 ) . " n! az" r P,,(1) = 1, P,,(-l)=(-I)", P 2 ,,+ 1(0) =0, (2n)! (-I)"(2n-l)!! P 2 ,,(0)=(-I)" 2 2 "(n!)2 = (2n)!! Recurrence formulas: nP,,(z) -(2n -l)zP,,_ JCz) + (n - I)P"_2(z) == 0, 2 dP" n(n+ I) (z -1)I.iZ=n(zP,,-P,,-I)= 2n+l (P"+I-P,,-I)=(n+l)(P"+l- ZP n). _ 1 d n [ ( 2 ) " Z + 1 ] 1 z + 1 G,,(z)- 2"n! dz" z -1 log z_1 -I P ,,(z)log z _ 1 00 00 (dz)"+ 1 ==2"n! f. ... f. ,,+1 Z Z (z2 - 1) f oo (t-z)" == 2" dt Z (t 2 _1)"+1 ==(-1)" 1 d" [ (Z2-1)" f oo dt ] (Rez> 1). (2n-1)!! dz" Z (t2_1)"+1 
App. A, Table 18.11 Hypergeometric and Spherical Functions 1794 2'(2n)!! [ 1'(n+l) Qn(cosO)= (2n+1)!! cos(n+l)O+ 1.(2n+3) cos(n+3)0 1.3'(n + 1)(n+2) + ( ) ) cos(n +5)0+ ...J (0< 0< 'IT). 1.2. 2n+3 (2n+5 1 d n [ 2 n I + x ] I I + x Qn(X)= 2nn! dxn (x -1) log 1- x -"2 P n (x)log 1- x 1 l+x  1 = "2 Pn(x)log 1- x -  -; lj-I(X)Pn-J<x). j=1 ) I I+x x I+x Q 1 2 I+x 3 Qo(x) = I log I-x ' QI(x)="2 10g l_x - l , i X )=4"(3x -1)log l_x -"2 x . (3) Generating Functions. 1 Y 1 - 2hz + h 2 00 L hnP n (z) (Ihl < minlz:t V?=T I), n=O 00 L hnl+I Pn(Z) (lhl>maxlz:tv?=TI). n=O (If -1..;; z";; 1, the right-hand side is equal to 1.) 1 00 z-t = L (2n+1)\I3n(t)Gn<z) (It+Y t 2 - 1 1<lz+V?=TI). n=O 1 log Z-t+YI-2tz+z 2 =  tnG (z) Y I-2tz+z 2 v'7=T n=O n (Rez> I, It I < 1). r= y x 2 +y2+ Z 2, cosO=z/r, x,yreal, 1 1 00 [ 1 ] -+-+ L + r 2 m=1 y( 2mi'IT+z)2+x2+y2 y( 2mi'IT-z) 2 + x 2 + y 2 (Here the square root of a complex number is taken so that its real part is positive.) / 1+  e-nzJo(ny x2+y2 ) n=1 = 1 1  (-I)"B2n 2n_1 ( -;:- +"2 + "" ( 2n ) ! r P 2n - 1 cosO) n=1 (Rez > 0), (0 < 0 < 2?T; z real). (4) Integrals of Legendre Polynomials. . f +1 2 Orthogonal relatIOns: Pn(z)Pm(z)dz = 8 nm - 2 1 . -I n+ f +1 zkPn(z)dz=O -I { A(A -2)... (A - n + 2) 1 1 h ( ) (A + n + 1 )(A + n - 1) . . . (A + I) z Pn z dz = o (A-l)(A-3)...(A-n+2) (A+n + 1) (A + n-1)...(A+2) (k=O, 1, ...,n-l). (n even), (n odd) (ReA> - 1). f 2(m-n+l)(m-n+3)...(m+n-l) {TPn(Coso)sinmodo=lo (m - n)(m- n +2).. .(m+ n) (m>n; m+n is odd), (otherwise ). (5) Conical Function (Kegelfunktion). This is the Legendre function corresponding to the case I' = - (1 /2) + iA (A is a real parameter), _ 4;\ 2 + 1 2 . 2 (J (4A 2 + 12)( 4A 2 + 3 2 ) . 4 (J P _(1/2)+i>.(COSO) -I + 2 Stn _ 2 + 2 2 Stn _ 2 + .... 2 2 .4 P _(1/2)+I>.(X);: P -(1/2)-i>.(X). 
1795 App. A, Table 18.I11 Hypergeometric and Spherical Functions (III) Associated Legendre Functions (393 Spherical Functions) (1) Associated Legendre Functions. The fundamental system of solutions of the differential equation in (II) (I) is given by the following two kind of functions when 1-''f'0. Associated Legendre function of the first kind: I ( z + I ) ",/2 ( I - z ) t(z)==r(1_I-') z-I 2EI -1',1'+1; I-I-'; Z- ' where we take the branch satisfying arg[(z + 1)/(z-1)]"'/2 =0 for z> 1 in the expression raised to the (J.I/2)th power. Associated Legendre function of the second kind: '" _e i "''' r(v+J.I+l)fi 2 ",/2 -v-",-1 ( V+J.I+2 v+J.I+l. 3.1 ) tlv(z)-2V+1 r[v + (3/2)] (z -1) z 2 F 1 2 ' 2 ,v+2:' Z2 ' where we take the branch satisfying arg(z2 _1)"'/ 2 =0 for z> 1 in (Z2 _1)"1 2 , and argz- v -".-1 =0 for z>O in z-v-".-r, respectively. p!' (x):= ei","/2t(x + iO) = ei","/2t(x- iO) I ( I + x ) ",/2 ( I - x ) == r(1-I-') I-x 2EI -1',1'+1; I-I-';--r- Q!, (x):= e-il'''[ e- i ",,,/20t(x+ iO) + e i ",,,/20t(x - iO) ]/2 17 [ r(I'+I-'+1) _ ] == _ 2 . P!' (X)COSl-'l7- r( ) p. I' (x) sml-'l7 1'-1-'+ I (- I " x " 1). ( - I " x " 1). Integral representations: (z2_1)"'/2 i + 1 (1_t2)"'-1/2 -I'(z)== dt · 21-'Y1Tr[/-t+(1/2)] -I (z+tv?="T)"'-. (ReI-'> - i, larg(z:t 1)1 < 17). _ (z2_1)"'-2 £ 00 (sinht)2.+1 I' z = dt . () 2.[( 1-'-I')r(l'+ 1) 0 (z+cosht)"'H+1 (Rez> -I, larg(z:t 1)1 < 17, ReI'> -I, Re(I-'-I'»O). - ( ) Vf r[ 1-'+(I/2)](z2-I)",-2 f oo cosh[I'+(1/2)]t I'z= - dt · 17 [(1-'+1'+1)r(I-'-I') (z+cosht)",+(l / 2) o (Rez> -I, larg(z:t 1)1 < 17, Re(l-'+ 1'» -I, Re(I-'-I'»O). - Vf (sinha)'" £ "'COSh[{I'+(I/2)}t]dt ( I ) t(cosha)- - 1'+(1/2) a>0,Re/-t<- 2 ' 17 r[ -1-'+(1/2)] (cosha-cosht) o - Vf (sinO)'" £ (J cos[ {1'+(1/2)}cp ]dcp P!' (cosO)- - ",+0/2) 17 r[ -1-'+(1/2)] (coscp-cosO) o -"' ( ) _ r(21-'+1)2-"'(sinO)'" 1 00 tl'+.dt p. coso - ) +(1/2) r(I-'+I r(I-'+I'+1)r(I-'-I') 0 (1+2tcosO+t 2 )'" (Re(I-'+I'» -I, Re(I-'-I'»O). p.-"'(cosO)= ( I ) (ooe-1cos(JJ",(tsinO)t.dt ( 0<O<- 2 17 ,Re(I-'+I'»-I ) . r I' + I-' + I }o Ol'(z) = ell''' r(1' + I-' + 1) (z2- 1) "'/2 j + 1(1_ t)"(z -1) -.-",-1 dt · 2.+1 r(I'+1) -I (Re(1' + 1-') > -I, ReI' > -I, larg(z:t 1)1 < 17). (O<O<l7, Re l-'<i). Vf ei",,, (sinha)'" f oo e-[.+(1/2)ltdt Ot(cosha) = "2 +0/2) r[ -I-' + (1/2)] '" (cosht - cosha)'" ( a > 0, Re I-' < 1/2, Re( I' + 1-') > - I). 
App. A, Table 18.III Hypergeometric and Spherical Functions 1796 Recurrence formulas: (Z2 - I )d\T3 (z)/ dz = (v - /L + I )\T3+ I(Z) - (v + I )z\T3(z), (2v+ l)z\T3(z)=(v- /L+ 1)\T3+I(z)+(v+ /L)\T3-I(Z), \T3._I(Z) = \T3(z), . r(v-/L+I) o -I' ( z ) = e-21"x O "(z ) · r(v+/L+I) · , (1- x 2 )dP.f' (x)/ dx= (v + l)xP.f'(x)-(v - /L + I)P,!'+I(x). The case when /L is an integer m( m = 0, 1,2, ...) and v is also an integer n: P n m + 2 (x) + 2(m + l)x(l- x2)-1/2pnm+ I(X)+ (n - m)(n + m + I)Pnm(x)=O. (2n + l)xPnm(x) -(n - m + I)Pn': I(X) - (n + m)Pn I(X)=O (0";; m..;; n - 2), (x 2 -I)dP n m (x)/ dx -(n - m + I)Pn': (x) + (n + l)xPnm(x) = 0, Pnl(x)- P n ':l(x)=(2n+ I) Pnm-I(x). f(v-/L+I) [ 2. ] \T3;I'(z)= f(v+ /L+ I) \T3(z)-7ie-ll"rr (sin/L'1T)O(z) , Ot(z)sin[(v + /L)'1T] - O ._I(z)sin[(v - /L)'1T] = '1T e i l''' (cos V'1T)\T3t (z), W( - z) = e +i""\T3(Z) - (2/ '1T)[sin(v + /L)'1T]e- i l'''O.(z) O( - z)= - eZI""O(z) (:!:: =sgn(lmz). . '1Tf(I+/L+v) e-'I'''O(cosha)= \T3==lWcotha) V 2'1Tsinha (+: = -sgn(lmz)), (Re cosh a > 0). e-il'''Ot(x:!:: iO)= e zi l'''/2[ Qt(x) +: (i'1T /2)P.f'(x)]. sin(v + /L)'1T '1TCOS/l'lTcos/L'1T Q._I(X)= . ( ) Q.f'(x)- . ( ) P.f'(x), stn v - /L '1T SIn V - /L '1T p.-I'(x)= fV- /L+ I) [ COS/L'1TP.f' (x)-lsin/L'1TQ: (x) ] , f v+/L+l) '1T P.f'( - x)= [cos(v + /L)'1T]P.f'(x) - (2/ '1T)[sin(v + /L)'1T] Q.f' (x), Q.f'( - x)= - [cos(v + /L)'1T] Q.f' (x) + ('1T /2)[sin(v + /L)'1T]P,!'(x). f(I+v+m)(z2_I)m/2 ( I-Z ) dn1\T3'(z) \T3:,"(z) = 2EI m- v, m+ v+ I; m+ I; - =(z2_I)m/2_---",-, f(I+v-m)m!2 m 2 dz \T3;m(z) = (z2-1) -m/2 f... f P.(z)(dz)m. dmO.(z) f. f. 00 O:,"(z)=(z2-I)m/2 dxm ' o;m(z)=(_I)m(z2-I)-m/2 zoo... z O.(z)(dz)m. f(I+v+m)(I-x 2 t/ 2 ( I-X ) pm(x)=( -I)m 2EI m- v, m+ v+ I; m+ I; _ 2 . f(I + v- m)m!2 m dmp (x) =( - I)m(l- x2)m/2 d;m , f l f l f(v-m+I) p.- m (x)=(I_x 2 )-m/2 ... P.(x)(dx)m=(-I)m ( ) p.m(x). x x fv+m+1 dmQ (x) f(v-m+l) Q.m(x)=(-I)m(l_x 2 )m/2 d;m ' Q.-m(x)=(_I)m f(v+m+I) Q.m(x). The values at the origin are pl'(O) = V7i21' , · f[(v-/L)/2+I]f[(-v-/L+I)/2] 
1797 App. A, Table 18.II1 Hypergeometric and Spherical Functions dP: (0) 21'+l s in['1T(v+ fL)/2]r[(v+ fL+2)/2] dX - r[(v- fL+ I)/Z]Y1T -1,C . ( v+p, ) r[(v+fL+I)/2] QI'(O)= -21' V'1T sm -'1T . 2 r[(rp,+2)/2]' dQ:'(O) =2I'y; c os ( v+P, .rr ) r[(v+fL+2)/2] . dx 2 r[(v- p,+ 1)/2] V7i21'+1 r[(v- p,+ 1)/2]r[( - v- fL)/2] , (2) Generating Functions. n , (cosO + isinOsincpt= Pn(cosO)+2  (- i)m ( n. )' (cosmcp)P::' (cosO). m=1 n+m . _ sin V'1T  ( I I ) - ( n p. l'(cosO)= -  (lt - - 1 Pn I' cosu) '1T n=O v-n v+n+ (0<0<'1T, p,;;.0). (3) Orthogonal Relations. f +1 2 (n+m)! Pn m (x)P;::,(x)dx=- 2 1 8nn" -I n+ (n-m)! (21T (1T '(' , 4'1T (n+m)! J o dcp J o sinOe:!:1 m-m}<PP n m (cosO)Pn":' (cosO)dO= 2n+ I (n m)! 8 nn ,8 mm ,. (4) Addition Theorems. dO .(z-Y z2-1 Vf2=l coscp)=.(z)\I3.(n+2  (_I)m:(z);m(ncosmcp m=1 (Rez>O, Re>O, larg(z-I)I<'1T, larg(-I)I<'1T). 00 0.(tt'-Y t 2 -1 Y t'2-1 coscp)=0.(t).(t')+2  (-I)mO:'(t);m(t')cosmcp m=l (t,t' real, 1 < I' < t, v * negative integer, cp real). 00 p.(cosOcosO' + sinO sinO' coscp) = p.(cosO)P.(cosO') + 2  (_I)mp.- m (cosO')P.m(cosO')cosmcp m=1 00 r(v-m+l) =p.(cosO)p.(cosO')+2  ) p. m (cosO)p.m(cosO')cosmcp m=l r(v+m+ 1 (0<;0<'1T, 0<;0'<'1T, O+O'<'TT, cpreal). 00 Q.(cosOcosO' + sinO sinO' coscp) = p.(cosO')Q.(cosO)+ 2  (_I)mp.- m (cos 0 ')Q.m (cosO)cosmcp m=l (0<0'<'1T/2, O<O<'TT, 0+0'<'1T, rpreal). One 1'1" + v?+T Y 1',2 + 1 cosh a) =  -  O( iT )O( iT')e - ma m=n+1 (m-n-I !(m+n)! (1', 1", a >0). (5) Asymptotic Expansions. ( ) [ 2'r[v+(1/2)] 2-.- l r[-v-(1/2)] - _ 1][ ( - 2)J  z = _ z' + _ z' I + 0 z Y'1Tr(v-p,+I) Y'TTr(-p,-v) (v+(1/2)*integer, largzl<'1T, IzlI). Ol'(z)= Viieil''' r(v+p,+I) z-'-I[I+O(z-2)J · 2.+1 r[v+(3/2)] (v + (1 /2) * negative integer, largzl < '1T, Izi  I). 2 r(v+p,+I) cos[{v+(1/2)}0-('1T/4)+(p,'1T/2)J P:'(cosO)=-----=- [ [1+O(p-1)J Y'1T r v+(3/2)] Y2sinO (e<;O<;'1T-e, e>O, IvlI/E). 
App. A, Table 18.III Hypergeometric and Spherical Functions 1798 ( ) 2r(1' + It + 1) P" cosO = · V1Tr[I'+(3/2)] 1 ) 1 3 ) [( 2/+1 ) (2/+1)'77 It'77 ] 00 r(2+1t+1 r(2-It+/)r(I'+2 cos I'+----z- 0- 4 +T X L e ) e ) ( 3 ) . /+(1/2) , /-0 r 2+ 1t r 2- 1t r 1'+1+2 /! (2smO) r(l'+ It+ 1) Qf (cosO) = V1T [ r 1'+(3/2)] 1 ) 1 3 [( 2/+1 ) (2/+1)'77 It'77 ] 00 /r(2+1t+1 r(2-It+/)r(I'+2) cos I'+----z- O+-+T X L ( - 1) e ) e ) ( 3) . /+(1/2) . 1-0 r 2+ 1t r 2- 1t r 1'+1+2 /! (2smO) (In the final two formulas the series converges when I' + It =F negative integer, I' + (1 /2) =F negative integer, '77/6 < 0 < 5'77 /6.) [( 1 ) 0 ] " O [ J"+I(l'j) 1'+ 2 cos 2 p.-"(coSO)=J,,(1'j)+sin 2 2 21'1 J,,+2 (1'1) + J"+3 (1'1)] + o( sin4) (1'1=(21'+ I)sin(0/2». (6) Estimation. When 1';;' 1, I' - It + 1 > 0, It;;' 0, + r(I':tIt+ I ) ( 8 ) 1/2 1 IP.-" (cosO)1 < ( ) --;- 0 -----,., r 1'+ 1 I"ITsm (sinO) + r(I':t It +1) ( 2'77 ) 1/2 1 IQ.-" (cosO)1 < r(l'+ 1) I'sinO (sinO)'" + r(l':t m + 1) ( 4 ) 1/2 1 I P_m ( cosO )I < - · r(l'+ 1) 1''77 sinO (sinO)m' + r(l':tm+I) ( 4 ) 1/2 1 IQ _m ( cosO )I ..;; - · r(I'+I) I'sinO (sinO)m' (7) Torus Functions. These are solutions of the differential equation d du ( 21m 2 ) -+cothl'j-- n --+- u=O. dq2 dq 4 sinh21'j The fundamental system of solutions is given by \!3:;'_(1/2)(coshl'j), O::'_(1/2)(coshl'j). The asymptotic expansion when m = 0 is \!3n-(1/2)( cosh 1'1) (n _I)!en-(1/2) [ 2r2[n + (1/2)] -2"'1 ( 1 1 -2  ] = _ (log 4 + l'j)e 2FI - 2 ,n+ 2 ;n+I;e I+A+B. r[n+(1/2)]v''77 '77n!(n-l)! ) Here (1/2)[n-(1/2)] -2 (1/2)(3/2)[n-(1/2)][n-(3/2)]_4 A=I+ e + e +... I.(n-1) I.2'(n-I)(n-2) (2n-3)!!(2n-I)!! 2 ( I ) + 2 e- n- , [(2n-2)!!] r[n+(1/2)]  r [/+(1/2)]r[n+I+(1/2)] ( -2(/+n) B= 3/2( -1)' £.. (+/)'1' un+/+u/-V/_(1/2)-Vn+/_(1/2»e, '77 n . 1=1 n. . where 1 1 2 2 2 2 u r =I+ 2 +...+" V r -(1/2)=T+3"+s+"'+ 2r-I = 2U 2r- u r' References See references to Table 16, this Appendix. 
1799 App. A, Table 19 Confluent Functions, Bessel Functions 19. Functions of Confluent Type and Bessel Functions (I) Hypergeometric Function of Confluent Type (167 Functions of Confluent Type) (1) Kummer Functions. 00 f(a+n) f(c) zn v(z)=IF1(a; c; z)=no f(a) f(c+n) liT f(c) I -c l z t a-I ( ) c-a-I d = z el z-I I f(a)f(c-a) 0 = f(c)2 1 - C eZ/2 j +lezt/20_0c-a-lo+oa-ldl f(a)f(c-a) -\ (O<Rea<Rec) (0< Rea < Rec). The fundamental system of solutions of the confluent hypergeometric differential equation (Kummer's differential equation) d 2 v dv z- +(c-z)--av=O dz 2 dz ' when c'f'O, -1, -2,..., is given by t\(z)=IF1(a;c;z), V2(Z)=zl- C 1 F 1 (a-c+1; 2-c; z). dIFl(a; c; z)/dz=(a/c)IFI(a + l;c+ l;z), IFI(a;c;z)=ezIFI(c-a;c; -z), a1F1(a + 1; c+ 1; z)=(a - c)IFl(a; c + 1; z)+ cIF1(a; c;z), a1FI(a + 1; c;z)=(z + 2a - c)IFI(a; c;z)+(c - a)IFI(a -1; c; z). Putting (a)n=a(a+ 1)...(a+n-1)=f(a+n)/f(a) we have 1 zn+l(a) lim - ( ) IFI (a;c;z)= ( )n+I IFI (a+n+ l;n+2;z) c--+ - n f c n + 1 ! (n=O, 1,2, ...). Asymptotic expansion: o -a  (a).(a-c+1). ( ) -. z a-c  (c-a).(1-a). . vIAIZ..:., , -z +Blez..:., , z, .=0 n. .=0 n. o _ L oo (a).(a-c+l). ( ) -. _ L oo (c-a).(1-a)n V A z a - z + B ezz a c Z' 2 2 n' 2 n' .=0. .=0. (lzl»lal, IZI» lei, - 317 /2 < argz < 17 /2, c 'f'integer) where Al = e-i"af(c)/f(c- a), A 2 = e- i ,,(a-c+l)f(2_ c)/f(1- a), BI=f(c)/f(a), B 2 =f(2 - c) /f(a - c + 1). (2) The fundamental system of solutions at z = 0 of the hypergeometric differential equation of confluent type d2u + du + [ !+ (1/4)_,...2 ] u=o dz 2 dz Z z2 is given by z(l/2):tl'e- z IFI[(1 /2):t,... - K; :t 2,... + 1; z]. (II) Whittaker Functions (167 Functions of Confluent Type) (1) A pair of linearly independent solutions of Whittaker's differential equation d 2 W [ I K (1/4)_,...2 ] -+ --+-+ W=O dz 2 4 Z z2 
App. A, Table 19.11 Confluent Functions, Bessel Functions 1800 is given by M..I#(z)=zI#+(1/2)e-z/21F1[J:::P-K+(lj2); i::2p+ l;z]. Whittaker functions: f( - 2p.) f(2p.) W'I'(z)= [ ] M.#(z)+ [ ] M._I'(Z)=W._I'(z). , f O/2)-P.-K' f (1/2)+P.-K' , When 2p. is an integer, the above definition of W.,/z) loses meaning, but by taking the limit with respect to p. we can define it in terms of the fol1owing integrals. #+(1/1) -z/1 1 00 ( 1 / 2 ) W. # (z)= z e e- Z7 'T1'-'-(If2)(1 +1')#+'- d'T , f[p.+(1/2)-K] 0 . -z/2 ( ) 1'+.-(1/2) = ze (oot#-.-(1/2)e- t I+i dt f[/-C-+(1/2)-K]}O z (Re[p+(lj2)-KJ >0, largzl <n). ( ) _ e-z/2 1 + ioo f(s- K)f[ -s- p.+(1/2)]f[ -s+ p.+(1/2)] s W. # Z - -y-:- [ ] [ ] z ds. , 'lT1. f - K+ p.+O/2) f - K- p.+(1/2) -'00 M'+#+(1/2),#(Z)=( -1)'zl'+(I/2)e- Z / 2 (2p.+ 1)I1F. (-1;2p.+ I;z) (/=0,1,2, ...). M.,I' (z) = e -i"r#+(1/2)]M _"1' (ei"z). M..I'(z) = f(2p.+1) ei"'W_'I'(ei"z)+ [f(2P.+I) ] ei"I,-#-(I/2)lw.l'(z) f[p.+(1/2)-K] , f p.+(1/2)+K ' (-3'IT/2<argz< 'IT /2, 2p.* -I, -2, ...). f ( 2 11 +I ) . . f ( 211+I ) . M (z)= r- e-''''W_ (e-"'z)+ r- e-,,,r'-I'-(1/2)]W. (z) '.1' f[p.+(1/2)-K] ',# f[p.+(1/2)+K] ,I' (-'IT/2<argz<3'IT/2, 2p.*-1,-2,...). W.,# (z) = ZI/2W._(I/2).#_(If2)(z) + [(1/2) - K + p.]W._ 1 .1' (z) = ZI/2W._(I/2),#+(1/2)(Z) + [(1/2) - K - p.] W._ 1 .1' (z). zdW.,1' (z) / dz = [K - (z /2)]W.,# (z) - [ 11.1- {K - (1/2)} 2] W'_ I 'I' (z). When K is sufficiently large we have M.,# (z )_'IT- 1 / 2 f(2p.+ 1)K-I'-(1/4)ZI/4COS [2(ZK)I/2 - p.'IT - ('IT /4)], W.,# (z)- - (4z / K)I/4 exp( - K + K logK)sin [2(ZK)I/2 - 'lTK - ('IT / 4) J. W _',1' (z )-(z / 4K)I/4 exp(K - K logK - 2(ZK)I/2). Asymptotic expansion: W"I'(z):::::;e- z / 2 z' (  [p2 _ {K-(lj2WJ [p2 - {K-(3j2WJ... [p2 - {K-n+(l j 2 WJ ) x1+L.. 1 n . n=1 n.z (2) Representation of Various Special Functions by Whittaker Functions. (i) Probability integral (error function) erf x=lI>(x)=  (X e- t2 dt y; }o -1_ 'lT -1/2 x -1/2 e -x2/2W ( X2 ) - -1/4.1/4 2x ( I 3 ) 2 ( x3 x 5 X 7 ) = y; IFI 2;2; -x 2 = y; x-Tf3+ 2!5 - 3!7 :::':::.... Asymptotic expansion: y; [ ] e-X2 ( I 1.3 1.3,5 ) _ 2 1-II>(x) :::::;- 2 I-+-:::':::.... x 2x (2x 2 ) (2X2) Ii ll>(x I;i y; )=C(X)-iS(x), 
1801 App. A, Table 19.I1I Confluent Functions, Bessel Functions where C (x), S (x) are the following Fresnel integrals. (X 'iT 2 1 1. 'iT 2 ( 1 ) C(x)== J o cos"2 t dt="2+ 'iTX sm "2 x +0 x 2 ' S(x)== fo\in'It2dt=-}- 'iT COS'Ix2+0( :2 )' (ii) Logarithmic integral . (Z dt Llz= J o logt (When z> 1, take Cauchy's principal value at t = 1.) = - (log 1/ z)-1/2 z 1/2W -1/2,O( -Iogz). Liz is sometimes written as liz. (3) Exponential Integral I Eix== f x !!- dt (When x> 0, take the Cauchy's principal value at t = 0 while integrating.) _ 00 t 00 n =C+loglxl+ L  (x real, *0) n=1 n.n. N (n-I)! 00 t n - N ( I I ) =e X L -+N! L - 1+-+..' +- +C+loglxl. n=1 t n n=on!(n-N) 2 N n"i=N Cosine integral Cix== - f oo cost dt= C+logx- rx I-cost dt. X t J o t S . l x sint d IX= - t - 0 t ' Sine integral Asymptotic expansion . f 00 sin t d S . 'iT SIX == - x t t = IX - "2' . ( 1 1! 2! 3! ) Eiix=Cix+isixe'x -++++... . ix (ix) (ix) (ix) (III) Bessel Functions (39 Bessel Functions) (1) Cylindrical Functions. A cylindrical function 2, is a solution of Bessel's differential equation d 2 2, I d2, ( 1'2 ) -+--+ 1-- 2,=0. dz 2 Z dz Z2 Recurrence formulas: 2 p _ 1 (z)+2 p + 1 (z)=(21'/z)2 p (z), 2 p _ 1 (z) - 2 p + 1 (z) =2d2 p (z)/ dz. !zP+12 p (z)dz=zp+12 p + 1 (z), !z- P 2 p + 1 (z)dz= -z- P 2 p (z). As special solutions, we have the fol1owing three kinds of functions. (i) Bessel function (Bessel function of the first kind). J (z)= ( !.. ) P (_1)1 ( !.. ) 2/= Mo,p(2iz) (iargzl<'iT). p - 2 1=0 l!f(1'+ 1+ 1) 2 (2iz)I/222PiPf(1'+ I) J p (e lm7T Z) = elm""J p (z). J _n(z)=( -1)"Jn(z). J ( ) ' /2 n+(1/2) ( 1 d ) n ( Sinz ) n+(1/2)Z=y;z -z dz Z- (n=0,1,2,...). (ii) Neumann function (Bessel function of the second kind). I N p (z) == ----;-- [ (COS1''iT)Jp (z) - J _p (z) ] (1'* integer; largzl < 'iT), sm1''iT 2 ( Z ) I ( Z ) n 00 (-1)1 ( Z ) 21 Nn(z)==-Jn(z) C+log- 2 - - _ 2 L ) _ 2 [1p(l)+Ip(l+n)] 'iT 'iT 1=0 1!(n+1 ! _ 1- ( z ) - n n - 1 (n - 1- I)! ( z ) 21 ( _ 1 1 ) 'iT"2 L l! "2 Ip(l)= L m ' 1=0 m=1 
App. A, Table 19.m Confluent Functions, Bessel Functions 1802 N_n(z)==(-I)nNn(z) (n=0,1,2,oo.; largzl<'7T). N. (eim"z) = e -im""N. (z) + 2i(sinml"ITcotV'1T )J. (z). N n +(1/2)(Z) = (- I f+1J -[n+(1/2)](Z). (iii) Hankel function (Bessel function of the third kind). HS1)(z) == J. (z) + iN. (z), flS2)(Z)=J. (z) - iN. (z). flSl)(iz/2) = -2ie- i ",,/2 ('I7Z)-1/2 WO ..(Z). HIHz)=ei""HJl)(z), H2Hz)=e-i""flS2)(z). flS2)(X) =HP)(x) (x,vreal). (2) Integral Representation. Hansen-Bessel formula J (z)=  f +" eizeoS'ein['-("/2)]dt n 2'17_" i- n 1 " . = - e,zeos'cosntdt '7T 0 Mehler's formula 1 1 " . = - cos(z smt - nt)dt '7T 0 ) 2 1 00. Jo(x = - sm(xcosht)dt, '7T 0 2 1 00 No(x) = - - cos(xcosht)dt '7T 0 2(z/2)' 1 ,,/2 J.(z)= _ cos(zcost)sin 2 'tdt V'7T r[p+(1/2)] 0 L(z/2)' [1 "/2 1 00, ] N.(z) = sin(z sint)cos 2 . t dt - e -zsmh' cosh 2 . tdt y;;; r[p+(1/2)] 0 0 (n = 0, 1,2, .. .). (x> 0). Poisson's formula ( ReI' '> -1 ) , 2' Schliif1i's formula (Rez > 0, Rf:p> - 1/2). (Rez > 0), 1 1 '" sinp'7T 1 00 'nh J.(z)= - cos(zsmt-pt)dt- - e- ZS1 'e-"'dt '7T 0 '7T 0 1 1 " .. I 1 00 - . h - N.(z) = - sm(zsmt-pt)dt- - e zsm '[e"'+(cosp'7T)e "]dt (Rez>O). '1T 0 '7T 0 Z' f C+iOO [ I ( Z2 )] J.(z)= 2'7Ti c-ioo exp 2" t- t t-'-1dt (c>O, largzl < '7T, Rep>-l). 2(x/2)-' f oo sinxt J x = dt .() v:;T r[(l /2) - vJ I (t 2 _ 1)'+(1/2) , N x = _ 2(x/2)-' (00 cosxt dt .() V:;Tr[O/2)-v]J 1 (t 2 _1).+(1/2) (x>o, I I ) - 2" <Rep<2 . J.(z)=  f (O+) eZ['-(I/')J/2t-V-1dt (Rez > 0). 2'7Tl -00 (The contour goes once around the negative real axis in the positive direction.) Sommerfeld's formula J ( ) =  f 2"-'I)+iOO iz eos , i'['-("/2)) d . Z 2 e e t, '7T -'I)+ioo HY)(z) = 1 f 'l)-iOO eizeoS'ei'['-("/2)]dt, '7T -'I)+ioo HP)(z) = 1 1 2 "-'1)+ ioo eizeos'ei.[,-(" /2)]dt '7T '1)_ ioo HY)(z)= -  e- i ""/2fooo eizeosh'coshptdt (-1/<argz<'7T-1/, 0<1/ < '7T). (0 < argz < '7T; when I' = 0, it holds also at z = 0). 2ie- i ""(z/2)' 1 00 . H(I)(z)= - e,zcoshtsin 2 Ptdt . V1ir[I'+(I/2)] 0 (O<argz < '7T, ReI'> -1/2; when z=O, -1/2< Rel'< 1/2). 
1803 App. A, Table 19.m Confluent Functions, Bessel Functions -i.,,/2 (00 H.(l)(z) = - j  J o e i %['-(1/t)]/2 t -v-1 dt (O<argz<'IT;whenargz=O, -1<Rel'<I). (3) Generating Function. [ z(t- t- I ) ] 00 exp 2 =Jo(z)+ n1 [t n +( - t)-nVn(z), 00 00 exp(iz cosO) = L jnJn(z)einfJ = Jo(z) + 2 L j nJ n (z)cos nO. n- -00 n=l 00 f J.(z)dz=2 L J.+ 2n +\(z). n=O 1 00 Kapteyn's series 1- z = 1 +2 L In(nz), n=1 1 z2 00 L J 2n (2nz) 2" 1 - z2 - n=1 (I zexp 1 <1 ) . l+ Sch16milch's series. Supposing thatf(x) is twice continuously differentiable with respect to the real variable x in 0..;; x";; 'IT, we have 1 00 f(x)=2" a o+ L anJo(nx) (0< x< 'IT), n=1 2 1 " 1 ,,/2 . ao=2f(0) + - du j'(uSlnq;)dq;, 'IT 0 0 where 2 1 " 1 ,,/2 . a n =- du uf'(USlnq;)cosnq;dq;. 'IT 0 0 00 00 I=J o (z)+2 L J 2n (z)=[Jo(z)]2+2 L [In(z)f n=\ n=1 (4) Addition Theorem. For the cylindrical function Z., we have 00 ei",yZ.(kR)= L In(kp)Z.+n(kr)ein'P n=-oo (R = V r 2 +p2-2rpcosq;, O<!/;< 1-, r-pe-''P e 2 .." = O<p<r, r-pe''P k is an arbitrary complex number), Z.(kR) 00 J.+m(kp) ZHm(kr) () R. - 2.k-.f(l') L (I'+m) . . C":(cosq;) m=O p r (1''1' negative integer). eXp[<-I),+ljkR] (_l)'+lj 00 R = 1- y;:;; L (2m + 1)'Jm+(1/2)(kp)H'(1/2)(kr)P m(cosq;) rp m=O (/= 1,2). eikPCOS'P= ( 2 r/ 2  jm(2m+ I)Jm+(1/2)(kp)Pm(cosq;) p m=O 00 = 2.f(l') L (I' + m)jmJ.+m(kp)(kp)-.CJ;)(cosq;) m=O (1''1'0, -1, - 2, ...), where Pm is a Legendre polynomial, and C!:) is a Gegenbauer polynomial. (5) Infinite Products and Partial Fractions. Letj.,n be the zeros of z -.J.(z) in ascending order with respect to the real part. We have ( ) (z/2). rr oo ( z2 ) J z = }-- · f(I'+l)n=1 jn Note that if I' is real and greater than -1, all zeros are real. (1''1' -1, -2, -3, ...). 
App. A, Table 19.m Confluent Functions, Bessel Functions 1804 Kneser-Sommerfeld formula 'lTJ (xz) 00 J ( ) . X ) J () . X ) v [Jv(z)Nv(Xz) - N v (z)J v (Xz)] = L v 2 v,n' 2 v ,2 v, 4J v (z) n=l (z -}v,n)Jv,n(}v.n) (O<x<X<J, Rez>O). (6) Definite Integrals. l 7T12 I l 2z l z I l 7TSin(zSinO) O Jv(zcosO)cosOdO= 2z 0 J2v(t)dt, J,..(t)dt= - . 0 cos,.,.OdO. o 71 0 Sill (00 - at J( b ) ,..-l d = (b/2a)"r(p.+v) F ( P.+v p.+v+l +1 .:::..!:. ) }o e v t t t a"'r(v + 1) 2 I 2' 2 ; v ; a 2 (Re(a+ib»O, Re(a-ib»O, Re(p.+v»O). l OO -at v _ (2x)"r[v+(1/2)] e Jv(bt)t dt - v+(1/2) _ o (a 2 +b 2 ) V'lT ( Y a 2 +b 2 -a ) V (OOe-atJv(bt) dt = b V }O t v (ReV> -, Rea>IImb l ). (Rev>O, Rea>IImbl). Sommerfeld's formula d ik VT2+k2 l oo J ( ) -lxI VT'-k2 ".". _ e o".r e - o y".2_k 2 Yr 2 +x 2 (r,xreal; -71/2<.argy ".2-k 2 <71/2, O<.argk<'lT). Weyrich's formula i + 00 - eik Vr2+x2 - f eiTXHJl)(rYk2-".2 )d".= 2 -00 Yr2+x2 (r,xreal; 0<.arg Yk 2 -".2 <71, 0<. argk < 71). Weber-Sonine formula l OO J ( ) _ p 2t2 ,.. -l d = (a/2p)"r[(v+p.)/2] F ( v+p. . v at e t t () I I 2 ' v + 1 ; o 2p ,.. r v + 1 -a 2 ) 4p2 (00 Jv(at)e-p2t2tv+ I dt = a V e-a2/4p2 }o (2 p 2)V+l Sonine-Schafheitlin formula (Re( p. + v) > 0, largpl < 7 / 4, a> 0), (Rev> -1, largpl < 71 /4). l OO A a"'r[(p.+v-,\+I)/2] J (at)Jv(bt)t- dt= 0'" 2 A b,..- H1 r[(-p.+v+,\+I)/2]r(p.+I) ( p.+ v-,\+ 1 X 2 F 1 2 ' p.-v-'\+1 2 a 2 ) p. + I; b 2 (Re(p.+v-'\+I»O, Re'\>-I, O<a<b). (7) Asymptotic Expansion. (i) Hankel's asymptotic representation. We put [4v 2 -12][4v 2 -3 2 ]... [4v 2 -(2m-l)2] (v,m)= 2 2 mm! (m= 1,2,3, ...); (v,O)= 1. For Izl»lvl, Izl» I, , 12 ( V'lT 'IT ) f Ml m(v,2m) 2M ] Jv(z)=V m cos z-T- 4 l m -: o (-I) (2z)2m +O(lzl- ) _ , (2 s in ( z- V'lT _ )[ Mil (_I)m (v,2m+l) +O(IZI-2M-1) ] V-;Z 2 4 m=O (2z)2m+l (- 71 < argz < 71), 
1805 App. A, Table 19.IV Confluent Functions, Bessel Functions {[ ( 1''iT 'iT )[ M-I m(I',2m) 2M ] N.(z)= - sin z-T-"4 L (-I) -------z;;;-+O(lzl- ) 'iTZ m=O (2z) +  (2 c os ( z _ V'iT _ '!!.. )[ Mi 1 (_I)m (v,2m+ I) + 0 (IZI-2M-I) ] Y;Z 2 4 m=O (2z)2m+1 (- 'iT < argz < 'iT), _(2 [ ( V'iT 'iT )][ M-I (I',m) M ] H.(l)(z)=y ;z exp i z-T-"4 mo (_2iZ)m + O (lzl- ) (-'iT <argz < 2 'iT), _(2 [ ( 1''iT 'iT )][ M-I(V,m) ] HP)(z)=Y m exp -i z-T-"4 mo (2iZ)m + O (l z l- M ) (-2'iT <argz <'iT). (ii) Debye's asymptotic representation. vx, 1-(l'lx»E, vlx=sina, when 1-(l'lx»(3Ix)I'I/2, H.(l)(x)_ vb- ex p [ ix { cosa + (a -}- )sina } ] [ e;,,/4 ( 1 5 2 ) 3e3,,;/4 X ---y-+ 8+ 24 tan a 2X3 3 77 2 385 4 3. 5e 5 ,,;/4 +(128+ 576 tana+ 3456 tana) 22X5 +...] (X = [- xcos(aI2)]1/2). vx, (vlx)-I>E, vlx=cosha, when Iv2-x211/2»1, Iv2_x213/-2»1 H.(l)(x)_ v exp[x(a cosh a - sinha)] [ 1 ( 1 5 2 ) 3 ( 3 77 2 385 4 ) 3. 5 ] X X + 8 - 24 coth a 2X 3 + 128 - 576 coth a+ 3456 coth a 2 2 X 5 +... (X = [- x sinh( a 12)]1/2). Whenvx, Ix-vl«x l / 3 , x»l, x-v=8, (2) 6 1 / 3 e;,,/3 [ [(1/3) 1/3 ;,,/3 r(2/ 3 ) ( 2 3 ) r(4/3) H. (x)- 3 1 / 2 -----vJ -6 e 8 + _ 5 8-8  / 3 'iT x X x + ( --.L _  + 84 ) 6 1 / 3 ;,,/3 r(5/ 3 ) + ] 140 4 4 e x5/3 ... (iii) Watson-Nicholson formula. When x,v>O, w=[(xlv)2-1]1/2, H.(t)(x)=r 1/eXp[( - 1)'+ Ii {('iT 16) + v(w -(w 3 13)- arctanw)}]H I %(VW 3 1 3 ) + 0 Iv -II (/= 1,2). (IV) Functions Related to Bessel Functions (1) Modified Bessel Functions. I.(z) = e-;I"1T/2J.(e;,,/2 z ) =  (z 12)"+2" n=O n!r(v+n+I)' K.(z)= i; e;I"1T/2H.(l)(e;"/2 z ) = _ i; e-;I"1T/2Hp)(e-;"/2 z ) = '!!.. L.(z)- I.(z) = ( .!!- ) 1/2w: 2z 2 sin V'iT 2z o..(). 
App. A, Table 19.IV Confluent Functions, Bessel Functions 1806 Recurrence formulas: Iv_1(z) - I v + [(z) = (2v I z)Iv(z), Iv_[(z) + I v + [(z)= 2I;(z), Kv_[(z) - K v + [(z) = - (2v I z)Kv(z), Kv_[(z) + K v + [(z) = - 2K;(z), Airy's integral: K _ v(z)= Kv(z). ( 00 3 'iT. r;- [ ( 2xVX ) ( 2xVX ) ] J o cos(t -tx)dt=3V3 J[/3 3V3 +J- 1 / 3 3V3 ' (00 3 IJ ( 2XVX ) J o cos(t +tx)dt=3 vxK [/3 3\1'3 (x>O). H. Weber's formula: 22 e-<Q2+ b 2)/4p2 Iv ( ;:2 ) = 10 00 e-p212Jv(at)Jv(bt)tdt (Rev> -1, largpl < 'iT 14; a,b > 0). 4Sin(lt-v)'iT l oo Watson's formula: JIL(z)Nv(z) - Jv(z)N,/z) = 2 K v _ IL (2z sinh t)e<IL+V)1 dt 'iT 0 (Re z > 0, Re( It - v) < 1), Nicholson's formula: aNv(z) aJv(z) 4 l oo Jv(z)- a - Nv(z)- a = - - K o (2zsinht)e- 2v1 dt v v 'iT 0 8 l oo J}(z) + N v 2 (z) = 2" Ko(2zsinht)cosh2vtdt 'iT 0 2 2 8cosv'iT l oo . J v (z) + N v (z) = ---z K 2v (2z smh t)dt 'iT 0 (Rez > 0). (Rez>O). Dixon-Ferrar formula: (Rez>O; 1 1 ) - 2" < Re v < 2" . (2) Kelvin Functions. ber v(z):t ibeiv(z) = J v ( e :t3"i/4 Z ), herv(z):t iheiv(z)= H v (l)(e:t3"i/4 z ), kerv(z)= -('iT12)hei v (z), keivC z ) = ('iT 12)herv(z). When v is an integer n, bern(x) - ibein(x) = (- ItJn(VI x), hern(x)- ihein(x)= (-It+ [Hn(l)(VI x) (x real). (3) Struve Function. 2(zI2)" l "/2 Hv(x)= sin(zcosO)sin 2 vOdO r[v+ (/2)]\1'; 0 00 (_1)m(zI2)v+2m+l L m=O r[m+(3/ 2 )]r[v+m+(3/ 2 )] I i " . Jv(z)= - cos(vO-zsmO)dO. 'iT 0 EvCz) = 1 ("sin(vO - zsinO)dO. 'iT J o . \"1 _ 2 d 2 d 2 2 Puttmg v v = z 2 + z - d + z - v , dz z Anger function: H. F. Weber function: 4(zI2)"+[ (z-v)sinP'lT V' H ( z ) = V' J ( z ) = v v r[v+(1/ 2 )]v"?T' v v 'iT' z+v (z-v)COSV'iT V' E ( z ) = - - - . v v W W 
1807 App. A, Table 20 Systems of Orthogonal Functions When I' is an integer n, In(z) = In(z). rz 'lTZ J o Jo(t)dt=zJo(z)+ T[J1(z)Ho(z)-Jo(z)H1(z»), foz No(t)dt = zNo(z) + '"; [N1(z)Ho(z) - N o (z)H 1 (z»). [nI2] n(n-j-l)! ( 4 ) Neumann Pol y nomials. 0 ( t ) -  n - £.. . ( ) n-2J+l )=0 J! t/2 (n is a positive integer), °o(t)=I/t. I 00 - = I +2 L On(t)Jn(z) t-z n=l (I tl > Izl). Schliifli polynomials: Sn (t) =  [ tOn (t) - cos 2 n 2 'lT ] (n is a positive integer), So(t)=O. V'nSn(x)=2n+2(x-n)sin2(n'lT/2) (V'n is the same operator defined in (3». f(dm) ( I-m -m Lommel polynomials: Rm p(z)= m 2 F 3 ----r-' 2; 1', - m, . f(I')(z/2) I - I' - m; - Z2 ) = ('lTZ /2 sin 1''lT)[J p + m (z)J _p+ l(Z) + (- IrJ _p_m(z)Jp_l(z») (m is a nonnegative integer). References See references to Table 16, this Appendix. 20. Systems of Orthogonal Functions ( 317 Orthogonal Functions) i b Pn(x)Pm(x)q;(x)dx= 8nmAn Name Notationpn(x) Interval (a, b) Weight q;(x) NormAn Legendre Pn(x) (-1,+1) I 2/(2n + I) Gegenbauer C;(x) (-1,+1) (1- x 2 y-(1/2) 2'lTf(21' + n)/2 2p (n + l')n![f(I'»)2 Chebyshev Tn (x) (-1,+1) (I_X 2 )-1/2 'IT(n=O); 'IT/2 (n ;;.1) Hermite Hn(x) (-00,+00) e- x2 Y;'n! 2 Jacobi Gn(a, y; x) (0, I) xy-I(I- x)"-y n![f(y)] f(a+n-y+I) (a +2n)f(a + n)f( y + n) Laguerre LnQ(x) (0, 00) xQe- X f(a+n+ I)/n! For Legendre polynomials Pn(x)  Table 18.11, this Appendix. (I) Gegenbauer Polynomials (Gegenbauer Functions) f( n + 21') ( 1 1 - t ) C; (t)= 2FI n+2v, -n; v+-; - n!f(2v) 2 2 f(2v +n)r[v + (1/2)] [ 1 J (1/4)-<V/2) - ( t 2 1 ) ffi(I/2)-v ( ) f(2v)n! 4 - +'n+v-(1/2) t . 
App. A, Table 20.11 Systems of Orthogonal Functions 1808 Generating function ( I-2at+a2 ) -v=  CV ( t ) an CI+(1/2) ( X ) = 1 n-:O n . n-I (2/-1)!! dlPn (t) dt l Orthogonal relation (7' 17f(21' + n) In (sin2vfJ)C(cosfJ)C:(cosfJ)dfJ= 2 8 n m . o 2 2v - I (I'+n)n![f(I')] (II) Chebyshev (Tschebyscheff) Polynomials (1) Chebyshev Polynomial (Chebyshev Function of the First Kind) Tn( x)= cos( narc cosx) =(1/2)[ (X+iV l-x2 )' +(x-iv'T=":? )'] =F(n,-n; 1/2; (1-x)/2) [nI2] =  (_I)j ( ;. ) xn-2j(I_x2)j j=O J ( _1)' (1- X 2 )I/2 d n (1- X2),-(1/2) (2n -1)!! dx n Chebyshev function of the second kind Un(x)= sin(n arccosx) =(1/2i)[ (x + i)' - (x- iV I- x 2 )'] (_If-In d n - I (1_X 2 )"-(I/2) (2n-1)!! dX.-I T,,(x), U.(x) are mutually linearly independent solutions of Chebyshev's differential equation (1- x 2 )y" - xy' + n 2 y = 0. Recurrence relations are T n + 1 (x)-2xT n (x)+ Tn-I (x)=O, Generating function: 1 - t 2 00 2 = To(x)+2  Tn (x)t n , 1-2tx+ t n=O Orthogonal relation: U n + 1 (x)-2xU. (x) + U.- I (x) =0. 1 1 - 2tx + t 2 1 00  U n + 1 (x)t n . n=O f +1 Tm (x) Tn (x) { o _ dx= 7T/2 VI-x 2 -I 7T (m 0;1= n), (m= no;l=O), (m=n=O); f +1 { T.' Um(x)Un(x) _ 2 - dx= . -I V l-x 2 ° (m=no;l=O), (otherwise). Orthogonality in finite sums. Let uo, U I' . . ., Uk be the zeros of Tk + I (x). All zeros are real and situated in the interval (-1,1). Then we have k { o  Tm(uJTn(uJ= (k+ 1)/2 j=O k+l (mo;l=n, or m=n=k+ 1), (1 <;;m=n<;;k), (m= n=O). Let Pn (x) be the best approximation of x' in - 1 <;; x <;; I by polynomials of degrel at most n - 1. Then we have xn-Pn(x)=2-n+ITn(x). (2) Expansions by Tn (x). 00 e ax =Io(a)+2  In(a)Tn(x), n=1 00 sinax=2  (-I)"J 2n + I (a)T 2n + 1 (x), n=O 00 Cosax = J o (0) + 2 L (-l)'J 2n (a) T 2n (x), n=1 
1809 App. A, Table 20.IV Systems of Orthogonal Functions 00 logO + xsin2a) = 21ogcosa - 2  1..( - tana)n Tn (x), n n=1 00 , j- ) 2n + 1  n(v2-1 arctanx=2L.(-I) 2n+l T 2n + I (X). n=1 (III) Parabolic Cylinder Functions (Weber Functions) (167 Functions of Confluent Type) Parabolic cylinder functions: Dv(z) == 2(1/4)+(v/2) Z -1/2 "'( 1/4)+(v/2). -1/4 (Z2 /2) =;; 2(1/4)+(v/2)Z 1/2 [ M(1/4)+(V/2). _1/4(Z2/2) + M(I/4)-(v/2). -1/4(Z2/ 2 ) J r[(1-v)/2] r(-v/2) 2 C [ 1 ( -v 1 Z2 ) =2 v / 2 e- z /4 V n r[(1-v)/2] 1 F 1 T;"2; 2 -fiz ( l-V 3 Z2 )J r(_v/2)I F I 2 ;"2;2 . The solutions of Weber's differential equation ; + (v+  _ 2 )u=o are given by D.(z), D.(-z), D_._1(iz), D_._1(-iz), and the fol1owing relations hold among them. D. (z) = [f(v + 1)/V 27T J [ e i P'lT/2D -.-1 (iz) + e -i.w/2D -.-1 ( - iz) ] = e-iP'lTD. (- z) + [ V 27T If( - v) Je -i(p+ 1)w/2D _._[ (iz) = eiP'lTD. ( - z)+ [V 27T If( - v) Jei(P+ 1)w/2D -.-1 (- iz). Integral representation: -z2/ 4 1 oo D.(z)=  e- zt -(t 2 /2)t-.- 1 dt (Rev<O). r( - v) 0 00 n e-(z2/ 4 )-Zl-(t 2 /2)=  (-;) Dn(Z)= l c+ioot'f(-v)D.(z)dv (c<0,largtl<7T/4). n. 2m c-;oo n=O Recurrence formula: D.+ 1 (z)-zD.(z)+ vD._I (z)=O, dD. (z)ldz+ O/2)zD. (z)- vD._I (z)=O. 2./ 2 V-';; , 2('+ 1)/2V-';; D.(O)= f[(1-v)/2] , D.(O)=- f(-vI2) Asymptotic expansion: -z2/4 . ( v(v-l) v(v-l)(v-2)(v-3) _ ) D(z)e z 1- + +... · 2Z2 2.4z 4 (Iargz l < 7T). D_I(Z)=eZ2/[I-erf(  )} erf(X)= ..); fo X e- t2 dt (error function). (IV) Hermite Polynomials For the parabolic cylinder functions, when v is an integer n, we have Dn (z) = (- 1 )ne z2 / 4 d n (e -z2/2) I dz n = e -z2/ 4 Hn (z I VI ), 
App. A, Table 20.V Systems of Orthogonal Functions 1810 where HAx) is the Hermite polynomial Hn (x) == 2- n / 2 ( -1)"e x2 d n (e- X2 ) / dx n = e x2 / 2 Dn cvI x). A Hermite polynomial is more often defined by the following function Hen(x) (e.g, in W.F. Magnus, F. Oberhettinger, and R. P. Soni [ID. Hen(x) ==( - l)"e x2 / 2 d n (e- x2 / 2 ) / dx n = e x2 / 4 Dn (x) = Hn (x /VI ). The function y = Hn(x) is a solution of Hermite's differential equation y" -2xy' +2ny=O. Hn(x) is a polynomial in x of degree n, and is an even or odd function according to whether n is even or odd. H 2n (x)=( -1)"(2n-1)!!IF 1 (- n; 1/2;x 2 ), H2n+1 (x)=( -1)"(2n+ 1)!!VI xlFI (-n;3/2;x 2 ). Recurrence formula: H n + 1 (x)= VI xHn (x)- nHn_1 (x)= VI xHn (x)- H (x)/VI , H (x)= VI nHn_1 (x). (-1)"(2n)! " H 2n (O)= 2 n , =(-1) (2n-1)!!, H 2n + 1 (O)=O. n. Generating function: 00 e V2tx -(t'/2)=  Hn(x)tn/nL n-O Orthogonal relation: f +oc 2 - -00 Hn(x)Hm(x)e- X dx=8 nm n!V'lT . (V) Jacobi Polynomials G n (a, y;x)== F( - n,a + n; y;x) =XI-Y(I_X)y-a f(y+)n) d d n n [xy+n-I (l_x)a+n- Y ]. f( y x These satisfy Jacobi's differential equation x(1- x)y" + [y - (a + l)x]y' + n(a + n)y = O. Orthogonal relation: In l -I a-y n!r(a+n-y+ 1)r(y)2 x y (I-x) Gm(a,y;x)Gn(a,y;x)dx= ) ) 8mn o (a+2n f(a+n)f(y+n (Rey>O, Re(a-y»-I) Representation of other functions: ( I-X ) ( II-X ) Pn(x)=G 1,1;--y-, Tn(x)=G O'I;--Y- , "f(2v+n) ( II+X ) C;(x)=(-I) f(2v)'n! G n 2v,v+"2;--Y- . 
1811 App. A, Table 20.VII Systems of Orthogonal Functions (VI) Laguerre Functions (1) Laguerre Functions. L (O) ( ) qoc + v + 1) -(0+1)/2 =/2 M ( ) v Z -qoc+1)qv+1)Z e [(0+1)/2]+v,a/2 Z _ qoc+v+ 1) F -V'OC+ l' z - qoc+1)qv+1) 11( , ,), These satisfy Laguerre's differential equation zd 2 [La)(z)J; dz 2 + (a + 1- z)d [La)(z)J; dz + jlLa) (z) = 0. (2) Laguerre Polynomials. When v is an integer n (n = 0, 1,2, ...), the function LO)(x) reduces to a polynomial of degree 11 as follows. Laguerre polynomials: L (a) ( ) = exx-a  ( -x n+a ) = L n ( n+a ) (-x)j n X ' d n ex. .,. n. x n-J J. j=O ( -ltn! L n (O) ( x ) = 1 , L o (m) ( x ) = 1 , V-m) ( x ) = xmL(m) ( x ) ( m-Ol 2 ) n+m (n+m)! n - , , ,... . Recurrence formulas: nLa)(x)= ( - x + 2n + a -1)Ll(X)- (n + a -1)L2(x), xd[La)(x)J; dx = nLa)(x)- (n + a)LI(x) (n =2,3,...). Generating function: - xI/(l- I) 00 e a+1 =  La)(x)tn (Itl < 1). (I-t) n=O Orthogonal relations: 10 00 e-xxaLf.a)(x)La)dx= 8 mn f(a + n + 1)/ n! = 8 mn f(I + a)( n  a). H 2n (x) = ( - 2)"n! L-1/2)(x2), H2n+1 (x) = (- 2)"n! Y2 XLI/2)(x2). (3) So nine Polynomials. s(a) ( x) == ( - 1)n L(a) ( x ) . n f(a+n+l) n (VII) Orthogonal Polynomials P ( X =  ( -1 k ( n )( n+k ) x(x-l)...(x-k+1) n,m) L.J ) k k m(m-1)...(m-k+l) k=O (where n, m are positive integers and nm). We have the same polynomials if we replace Xk in Pn(l- 2x) by x (x - 1) .. . (x - k + 1) / m (m - 1) . .. (m - k + 1) (k = 0, 1, . .., n). Orthogonality in finite sums:  P (k)P ( k ) =8 (m+n+ 1)!(m-n)! . "'" n,m I,m nl ( ( 2 k-O 2n+1) m!) Chebyshev's q functions: (-I)n(m-l)! qn(m,x) = 2 n (m _ n _ I)! Pn,m-l (x), n,m (x) = [2 n (n!)2 /(2n)! ]qn(m, x -1). For given data Yk at m points x k = XI + (k - l)h (k = 1, ... , m) that are eq ual1y spaced with step h, the least square approximation among the polynomials Q(x) of degree n( <m), i.e., the m polynomial that minimizes the square sum of the residues s= L [Yk-Q(Xk)]2 is given by the k=1 
App. A, Table 21 Interpolation 1812 following formula ( 19 Analog Computation): n m 2  Bk ( X - XI ) m 2  Bk Q(x)= L.J Sk.m -----,z-+l, S=  (Yk) - L.J S' k=O k k= I k=O k m m Bk =  Yik.m (i), Sk =  [k.m (i)f i=1 i=1 References See references to Table 16, this Appendix. 21. Interpolation (223 Interpolation) (1) Lagrange's Interpolation Polynomial. n  (x- xo)(x - XI)'" (x- xs_I)(x - Xs+ d...(x- x n ) f(x) = L.J f(xs) . (x s - xo)(xs - XI)'" (x s - xs-I)(x s - x s + I )... (x s - X n ) s=O Aitken's interpolation scheme. The interpolation polynomialf(x) corresponding to the value Ys = f(xs) (s = 0, I, ... ,n) is given inductively by the following procedure. The order of xo,x b ''''X s is quite arbitrary. Ps.o(x)= Ys (s=O, 1, ...,n), Ps.k+l(X)= [(xs-X)Pk.k(X)-(xk-X)PS.k(X)]/(Xs-Xk) (s=k+ l,k+2, ...,n), f(x)='Pn.n(x). (2) Interpolation for Equally Spaced Points. When the points Xk lie in the order of th,ir sub- scripts at a uniform distance h (x s = xo+ sh), we make the following difference table (,h = h). Forward difference: tli=.tl! ='};+I- }; = J(x i + I) - f(x), tl =.tl 1- tl-I. Variable Value of Function (1st) Difference (2nd) (3rd) (4th) xo-2tlx f-2 xo-tlx f-I Xo fo xo+tlx fl Xo + 2tlx f2 Xo + 3tlx f3 tl_ 2 tl 2 -2 tl_ 1 tl1 tl:' 2 tl 4 -2 !lo tl5 tl:' 1 tl 4 -I tl l tl 2  I tl 2 Backward difference: is. =.is.:- I - is.::: 1 = tl_i' Central difference: 8/ = 8t:;II/2) - 8t::: 11/2)' 8/+ (sj2) = tl. Newton interpolation formula (forward type): u u(u-l) 2 u(u-l)(u-2) 3 f(xo+utlx)=f(xo)+TItlo+ 2! tl o + 3! tl o u(u-1)(u-2)(u-3) 4 + 4! tl o +.... Gauss's interpolation formula (forward type): u u(u-l) 2 u(u-I)(u+l) 3 f(xo+utlx)=f(xo)+TI!lo+ 2! tl_ l + 3! tl_ 1 u(u-l)(u+ 1)(u-2) 4 + 4! tl- 2 +.... 
1813 App. A, Table 22 Distribution of Typical Random Variables Stirling's interpolation formula: _ u -I+O u 2 2 u(u 2 -1) 2+1 u 2 (u 2 -1) 4 f(xo+ux)-f(xo)+TI 2 + 2! -1+ 3! 2 + 4! -2+"" Bessel's interpolation formula: f( XotXl +  ) = f(xo)+f(Xl)  1- ( 2_1 ) 1+5  ( 2_1 ) 3 2 v x 2 + l! 0+ 2! v 4 2 + 3! v 4 -1 1 ( 2 1 )( 2 9 ) 2+-:1 + 4! v - 4" v - 4" 2 + .... Everett's interpolation formula: He-I) (e-l)(2-4) f(xo+ux)=f(Xl-x)=if(xo)+ 3! l+ 5! -:2+". u(u 2 -I) 2 u(u 2 _I)(u 2 -4) 4 +uf(x 1 )+ 3! o+ 5! -l+"'. (=I-u) (3) Interpolation for Functions of Two Variables. Let xm=xo+mx,Yn==Yo+ny (m and n are integers). We define the finite differences as follows: x(xo,Yo) ==f(XbYO) - f(xo,Yo), ixo'Yo) ==f(XO'Yl) - f(xo,Yo), ;(xo,Yo) ==x(XbYO) - x(xo,Yo) == 8} (Xl'YO)' xy(xo,Yo) ==iXbYO) - /xo,Yo) =X<XO'Yl) -X<xo,Yo), }(xo,Yo) ==y(XO'Yl) - y(xo,Yo) == 8/ (XO'Yl), Newton's formula: f(xo+ ux,yo+ vy) = f(xo,Yo) + (ux + Vy)(xo,Yo) + (1 /2!) [ u(u -I); + 2uvxy + v( v -I)}] (xo,Yo) + ... . Everett's formula. Putting s == 1 - u, t == 1- v we have f(xo+ ux,Yo+ vy) = stf(xo,Yo) + svf(XO,Yl)+ utf(x1,yo) + uvf(X1'Yl) - (1 /6) [ us (1 + s){ t8} (xo,Yo) + v8} (XO'Yl)} + us (1 + u) { t8} (XbYO) + v8} (XbYl)} + vt(1 + t) { s8} (xo,Yo) + u8} (XbYO)} + vt(1 + v) { s8/ (XO'Yl) + u8} (XbYl)} ] + '" References [1] F. 1. Thompson, Table of the coefficients of Everett's central-difference interpolation formula, Tracts for computers, no. V, Cambridge Univ. Press, 1921. [2] M. Lindow, Numerische Infinitesimalrechnung, Dummler, Berlin, 1928. [3] H. T. Davis, Tables of the higher mathematical functions I, Principia Press, Bloomington, 1933. [4] K. Hayashi and S. Moriguti, Table of higher transcendental functions (in Japanese), Iwanami, second revised edition, 1967. 22. Distribution of Typical Random Variables ( 341 Probability Measures, 374 Sampling Distributions) In the following table, Nos. 1-13 are I-dimensional continuous distributions, and Nos. 20-21 are k-dimensional continuous distributions, for which the distribution density is the one with respect to Lebesgue measure. Nos. 14-19 are I-dimensional discrete distributions, and Nos. 22-24 are k-dimensional discrete distributions, where the density function P (x) means the probability at the point x. The characteristic function, average, and variance are given only for those represented in a simple form. 
App. A, Table 22 1814 Distribution of Typical Random Variables No. Name Symbol Density Function Domains N(",a Z ) 1 '[ (x - ,,)' ] Normal (haZj'!, ex p -  -O:)<X<r:A:; 2 Logarithmic normal .!..exp[ _ (108Y-")' ] O<y < co (2""Z) 'I' Y 2a z Gamma r(p,a) [f(p)l-I a -pxp-Ie-x/o O<x<oo 4 Exponential e(".a) (I/a)exp( -(x-,,)/ a) IJ.<x< 00 Two-sided exponential (1 /2a)e -Ixl/o -CX)<X<r::£J 6 Chi square xZ(n) 2 -nlz[f(n /2)1-lx(nlz)-le -xlz O<x < 00 Beta B(p.q) [B(p,qW IxP-I(l-x)"-1 O<x<l F F(m.n) 2Kpx(/"-I[1 +(mx/n)r(+')/', O<x<oo Kp",[B (m/2,n/2)r ' (m/ n)/' 9 z(m,n) Kpe"'z II + (me 2z /n)r(")/'. -CXI<Z<r::£J Kp",[B (m /2. n/2)] -I(m / n)/' 10 I(n) [v,; B(n/2.1/2W I [l+(tz/n)I-(n+I)/z -CXJ<t<r:A:; [ (X-")']"' II Cauchy C(".a) ("") - I 1+---;;2 -OO<X<r::£J 12 One-side stable c(2,,)-I/ Z x -3/ z exp( - C Z /2x) O<x<oo for exponent 1/2 13 Uniform rectangular U(a,{3) 1/({3-a) a<x<!3 14 Binomial Bin(n.p) ()pxqn-x x-0.1.2, ...,n 15 Poisson Pi}>,) e-'}>,x/x! x-O,I.2, '" (  )U:qx )/( ) x integer 16 H ypergeometnc H(N.n,p) 0<. <<.Np. 0<. n-x<. Nq 17 Negative binomial NB(m,p) f(m +x)[f(m)x!I-lp"'qX x-O.I,2. ... 18 Geometric G(p) pqx x-0.1.2. ... 19 Loganthmic KLqx / x. K L ", - I /Iogp x-I 2.3,... (2,,)-'I'I.l'I-I/Z -oo<xl'....x/c 20 Multidimensional normal N("..l') xexp[ -(x-,,).l'-I (x-,,)' /2], <co x-(xI, ...,Xk)'''-(''I' ....""). .l'-(aij) f(PI+'''+Pk+I) x't+I- 1 xr l - I xI> ....xk >0, 21 Dirichlet r(PI)...f(pk+l) k+ I, Xk+I-I-(xl +... +xk) Xt+.. +x/c<l -I M(n.(p,)) n!(xIL"Xk+l!) pI...P:"++II. x" ",xl< 22 Multinomial xk+l=n-(xl+ ...+xk) -O.I....,n. xl+.' +x/c<;n Multidimensional ( I ). . . ( :+,I )/( ). xI' ....)(k integers 23 hypergeometnc H(N.n.(p,)) O<;Xj<;NPi xk+l-n-(x,+ ...+Xk) (i-I, ....k + I) 24 Negative polynomial f(m+xI+...+Xk) xl'....xk f(m)xI! ...xk! p(J'p{I... pt,. -0.1.2. ... 
1815 App. A, Table 22 Distribution of Typical Random Variables Conditions for Charactenstic Mean Vanance No. Parameters Function -00<1-'<00, ( 0 2 12 ) 0 2 0>0 exp il-'I- 2 I-' -00<1-'<00. e"+(.,2/2) e 2 p.(e2cr 2 - eO") 2 0>0 p,o>O (I-iol)-P op olp -00<1-'<00,0>0 ei"'(I-iol)-1 1-'+0 0 2 4 0>0 (1 + 0 2 1 2 )-' 0 20 2 n positive integer (l-2it)-n/2 n 2n 6 P.q>O p pq 7 p+q (p+q)'(p+q+l) m, n positive integers n 2n 2 (m+n-2) n-2 (n>2) , (n>4) m(n-2) (n-4) m, n positive integers 9 n positive integer o (n> I) n/(n-2)(n>2) 10 -00<1-'<00. exp(iJLI- 0111) none II 0>0 none O<c<oo exp[ - clll'/2(1- il/III)1 none none 12 -oo<a<{3<oo (eipl-ei.')/il({3- a) (a+{3)/2 ({3-a)2/12 13 p+q-l.p.q>O, (peil + q)n npq 14 n positive integer np A>O exp[ -A(l- eil)l A A 15 p+q-l.p,q>O. (Nq)"'(N"')-' npq(N - n) N, Np, n positive integers X F( -no - Np; Nq- n+ I; eil). np N-I 16 N>n m"'=m!/(m- n)! p+q-l.p.q>O. pm mq mq - 17 m>O (I-qe il ( p p2 p+q-l.p.q>O p q q 18 - l-qe i' p p2 p+q-l.p.q>O - K L 10g(l- qe il ) KLq/p KLq(l- K L q)/p2 19 -00<1-'" ....I-'k ( IU) < 00. 2 symmetric exp il-'l'-2 . E(x,)=l-'i V(X;)=Oi;. 20 positive definite Cov(x;,x j ) - 0ij quadratic form 1=(1,. ....Ik) V(Xi) E(x) - Cvj(J'1 + '" +"k+ I-v,>. VI. ...."k+1 >0 vi COV(Xi,X j ) = - ell i "). 21 p(+'''+''k+1 C=(v, +... +Vk+')-' xCv, +... +vk+' + If' PI+...+pk+,=I. V(x,)=np,(I-p,), PI. ...,Pk+' >0 (p,e ,11 +.. +Pkeitk+Pk+l)n E(xi)=npi 22 n positive integer Cov(x i . x j )= - npjpj PI + ." +Pk+ I-I, V(x)= Cnpi(l-p). P,. ....Pk+,>O. E(xi)= npi Cov(xi,x)- - CnpJ'j' 23 N.Np" ...,N;..n N-n positive integers C=- N-I Po+p,+...+Pk- mpi V(x i )= mp,(Po+P,}/P5. I,Po.p"....;. p{{'(l-p,eill-... -;.eil,Y" E(x,) = Pc; 24 >0, m>O Cov(xi,Xj)- mpJ'j/p5 
App. A, Table 23 Statistical Estimation, Hypothesis Testing 1816 Remarks 1. Reproducing property with respect to /l, a 2 . 2. X = log Y: N (/l,a 2 ). 3. Reproducing property with respect to p. 4. e(O,a)=f(l,a). 6. n is the number of degrees of freedom; reproducing property with respect to n. 8. m and n are the numbers of degrees of freedom. 9. e 2z = F(m,n). 10. n is the number of degrees of freedom. 11. C(O,I)=t(l); reproducing property with respect to /l and a. 14. Reproducing property with respect to n. IS. Reproducing property with respect to A. 17. Reproducing property with respect to m. 18. G(p)=NB(I,p). 20. Generalization of normal distribution; reproducing property with respect to /l and 1:. 22. Generalization of binomial distribution; reproducing property with respect to n. 23. Generalization of hypergeometric distribution. 24. Generalization of negative binomial distribution; reproducing property with resplct to m. 23. Statistical Estimation and Statistical Hypothesis Testing Listed below are some frequently used and wen-investigated statistical procedures. (Concerning main probability distributions  398 Statistical Decision Functions, 399 Statistical Estimation, 400 Statistical Hypothesis Testing). The fonowing notations and conventions are adopted, unless otherwise stated. Immediately after the heading number, the distribution is indicated by the symbol as defined in Table 22, this Appendix. It is to be understood that a random sample (XI, X2,"" x n ) is ob- served from this distribution. Where two distributions are involved, samples (XI' ..., X n ,) and (Yr."" Yn2) are understood to be observed from the respective distributions. Next, a necessary and sufficient statistic based on the sample is marked with * when it is com- plete, and # otherwise. Then appears the sampling distribution of this statistic. For those statistics consisting of several independent components, the distribution of these are shown. Gwek lower- case letters except IX and X denote unknown parameters. Italic lowercase letters denote constants, each taking arbitrary real values. Italic capitalletters denote constants whose values are specified in each procedure; repeated occurrences of the same letter under the same heading number specify a certain common real value. Problems of point estimation, interval estimation, and hypothesis testing are presented, with corresponding estimators, confidence intervals, and tests (critical regions) as their solu1ions. An the confidence intervals here are those constructed from UMP unbiased tests, having 1 -IX as confidence levels. Alternative hypotheses are understood to be the negations of corresponding nun hypotheses. Significance levels of an the tests are IX. The fonowing symbols are attached to each procedure to describe its properties. For estimators: UMV: uniformly minimum variance unbiased. ML: maximum likelihood. AD: admissibility with respect to quadratic loss function. lAD: inadmissibility with respect to quadratic loss function. For tests: UMP: uniformly most powerful. UMPU: uniformly most powerful unbiased. UMPI( ): uniformly most powerful invariant with respect to the product of transformation groups shown in ( ). LR: likelihood ratio. 0: group of orthogonal transformations. L: group of shift transformations. S: group of change of scales. AD: admissibility with respect to simple loss function. 
1817 App. A, Table 23 Statistical Estimation, Hypothesis Testing lAD: inadmissibility with respect to simple loss function. (Note that UMPU implies AD.) The following symbols denote 100(1- a)% points of respective distributions, a being sufficiently smal!. u(a): standard normal distribution. t i ( a): t-distribution with f degrees of freedom. x;(a): X 2 distribution withf degrees of freedom. F/,'( a): F-distribution with U1J2) degrees of freedom. (1) N(/l,b 2 ). Xi*' N(n/l,nb 2 ). Point estimation of /l. x= tXi: UMV,ML,AD. Interval estimation of /l. (x:t: u(aI2) :n ). Hypothesis [/l';; k]. x>k+u(a): UMP,LR. ,I.: VfI Hypothesis [h.;; /l';; I]. x < h- C or x> 1+ C: UMPU,LR. (2) N (a, a 2 ). (Xi - a)2*. a2x. (a2x is the a 2 -multiplication of a random variable obeying the x 2 (n) distribution. We use similar notations in the following.) (Xi- a)2 Point estimation of a 2 . : UMV, ML, lAD. n Interval estimation of a 2 . (A(Xi - a)2, B(Xi - a)2). Hypothesis [a 2 .;;k]. (xi-a)2>x(a)k: UMP,LR. Hypothesis[a 2 =k]. (xi-a)2<Akor(xi-a)2>Bk: UMPU. (3) N(/l,a 2 ). ( Xi _ 2 ) *. ( N(n/l,na2) ) . (Xi- X) a 2 X:_l Point estimation of /l. x: UMV, ML, AD. [ _  (Xi- X)2 j Interval estimation of /l. x:t: t n - 1 (aI2) . Y n(n-I) x-k tn-1(a) Hypothesis [/l.;;k]. > UMPU,LR.  (Xi-X)2 Y n(n-l) Ix - kl t n - 1 (aI2) Hypothesis [ /l = k]. > UMPU, LR, UMPI(S, 0) for k = O.  (Xi-X)2 Y n(n-I) (Xi-X)2 (Xi-X)2 Point estimation of a 2 . UMV lAD ML l AD n-l ,. n ,. f[(n-I)/2]  Xi-X)2 Point estimation of a. V2 1 : UMV, lAD. 2 f(nI2) n- Interval estimation of a 2 . (A(Xi - X)2, B(Xi - X)2). Hypothesis [a 2 .;; k]. (xi-X)2>X;_I(a)k: UMP, LR. Hypothesis [a 2 =k]. (Xi-X)2<Ak or (Xi-X)2>Bk: UMPU. Hypothesis [a 2 ;;. k]. (xi-X)2<X;_I(1-a)k: UMPU,UMPI(L). Hypothesis [  .;; k l x > E: UMPI(S), AD.  (Xi-X)2 (4) Bin(N,O). Xi*' Bin(Nn,O). Point estimation of O.  : UMV, ML, AD. Hypothesis [O.;;k]. x>A: UMP. Hypothesis [h .;; 0.;; I]. x < B or x> C: UMPU. (5) H(N,m,O) (n= 1). x*. P . . . f 0 Nx Olnt estimatiOn 0 , - m Hypothesis [0.;; k]. x>A: UMV,AD. UMP. 
App. A, Table 23 Statistical Estimation, Hypothesis Testing 1818 (6) NB(N,O). x;*. NB(Nn,O). POI ' nt estl ' matl ' on of O. N Nn  - 1 1 (I h h d . . 0) UMV AD w en t e enommator IS: ,. n+ x j - Nn Nn+x : ML. I Hypothesis [0< k]. xj<A: UMP. Hypothesis [h < 0 < I]. Xj < B or Xj > C: UMPU. (7) PCA). Xj*' P(nA). Point estimation of A. x: UMV, ML, AD. Hypothesis [A < k]. x> A: UMP. Hypothesis [h < A < I]. x < B or x> C: UMPU. (8) G(O). Xj*' NB(n,O). For the point estimation of 0 and hypothesis testing  (6). (9) U [0, 0]. maxx;* . Point estimation of O. maxx j : ML, lAD. n + 1 maXXj: UMV, lAD. n Hypothesis [O<k]. maxx j >(1-a)l/nk: UMP. Hypothesis [0= k]. maxx j < ka l / n or maxx j > k: UMP. (10) U[,.,,]. (minx j , maxx j )*. n minxj - maxx j Point estimation of f 1 UMV, lAD. minx j : n- t+ '" minx , +maxx , Point estimation of Y 2 : UMV, AD. Hypothesis [.,,-< k]. maxxj-minxj>ka 1 / n : UMP. (11)U[9-i,0+i} (minxj,maxx;)#. minxj+maxx j Point estimation of O. ML, AD. ML, lAD. Hypothesis [0 < k]. 2 . k 1 I/n k 1. mm x j > + 2" - a or max x j > + 2"' ( f(n,a) + nil ) . e( Il, a / n) UMP. ( x ) * (12) e( Il, a). . ' . mmx j x-nminx. Point estimation of a. ' 1 ' UMV, lAD. x - minx j : ML, lAD. n- Point estimation of Il.  I minxj-  1 x: UMV, lAD. minx j : ML, lAD. n- n- Hypothesis [a<k,ll=h]. xj<horxj>kloga-l/n+h: UMP. Hypothesis [h<a</]. xj-nminxj<A orxj-nminxj>B: UMPU. nminxj-k nminxj-k Hypothesis [Il= k].  . <0 or  . >C: UMPU. xj-nmmx j xj-nmmx j (13) f(p,a), xi*' f(np, a). Point estimation of a. : UMV, ML, lAD. P Interval estimation of a. (CXi' DxJ Hypothesis [a< k]. xj>A: UMP. Hypothesis [a= k]. Xj < Ck or Xj > Dk: UMPU. N ( Ill> a 2 ) (14) N (1l2Y)' ( Xj ) *. ( N(nllll>nla2) ) . Yi N (n21l2,n 2 b 2 ) Point estimation of J.l1 - J.l2' X - y: UMV, ML, AD. ( a2 b 2 ) Interval estimation of J.ll -J.l2' x - Y :tu(oc/2) -+- . n 1 n 2 Hypothesis [J.lI-J.l2k]. x-y>k+u(oc) UMP, LR. 
App. A, Table 23 Statistical Estimation, Hypothesis Testing Hypothesis [1l]-1l2=k]. Ix-j-kl>u(aI2) a 2 b 2 - + - : UMPU, UMPI(L), LR. n] n2 [ X, ] * N (n] Ilbn](2) Y, N (n2 1l2,n2( 2 ) S2 = (X, - .>:)2 + (YI- j)2 a2x, +nz-2 Point estimation of Il]- 1l2' x- y: UMV, ML, AD. Intervalestimationofll]-1l2' ( X-j:ttn+n_2(aI2) yl + I .,./ S2 2 ) . I Z n] n2 V'n] +n2- . (x-j-k) Yn]n2 Vn]+n2-2 HypothesIs [IlI-1l2<k]. t= _ >t n ,+nz-2(a): UMPU, Yn] +n2 Ys2 N(Il],a 2 ) N(llb a2 ). UMPI(L), LR. Hypothesis [Il] - 112 = k]. It I > t n +n _2(a): UMPU, UMPI(L), LR. 2 I Z 2 Point estimation of 0 2 . s 2 : UMV, lAD. : ML, lAD. n] +n2- n] +n2 Interval estimation of a 2 . (As 2 , BS2). Hypothesis [a 2 <k]. s2>XI+nz_2(a)k: UMP,LR. Hypothesis [a 2 = k]. S2 < Ak or S2 > Bk: UMPU. Hypothesis [a 2 ;;'k]. s2>XI+nz-2(1-a)k: UMPU, UMPI(L), LR. N(Il],af) [ Xi' (XI_X)2 ] * N(1l2,an. Yi' (YI_j)2 a 2 [ (Xi-X)2 (Xi-X)2 ] Interval estimation of . A 2 ' B 2 . a (Yi- j) (Yi- j) . [ a i ] (n2-1) (Xi-X)2 HypothesIs 2" < k. - _ 2 > F::z'i(a)k: UMPU, UMPI(L, S), LR. a2 (nl-l)(Yi-Y) [  X"  ( X, - x) , ] N(llbIl2,af,a,p). : (x,-x)(y,-j) * Y" (YI-Y)' (Xi-X)(Yi-j) Point estimation of p. r = ML. V ( x i - X)2(Yi- j)2 t n -] (aI2) Hypothesis [p=O]. Irl> : UMPU, LR. Vt n-] (aI2)2 + n - 2 
Appendix B Numerical Tables 1 Prime Numbers and Primitive Roots 2 Indices Modulo p 3 Bernoulli Numbers and Euler Numbers 4 Class Numbers of Algebraic Number Fields S Characters of Finite Groups; Crystallographic Groups 6 Miscellaneous Constants 7 Coefficients of Polynomial Approximations 1. Prime Numbers and Primitive Roots (297 Number Theory, Elementary In the following table, p is a prime number and r is a corresponding primitive root. p r p r p r p r p r p r p r p r 2 79 3 191 19 311 17 439 17 577 5 709 2 857 3 3 2 83 2 193 5 313 17 443 2 587 2 719 11 859 2 5 2 89 3 197 2 317 2 449 3 593 3 727 5 863 5 7 3 97 5 199 3 331 3 457 13 599 7 733 7 877 2 11 2 101 2 211 2 337 19 461 2 601 7 739 3 881 3 13 2 103 5 223 3 347 2 463 3 607 3 743 5 883 2 17 3 107 2 227 2 349 2 467 2 613 2 751 3 887 5 19 2 109 11 229 7 353 3 479 13 617 3 757 2 907 2 23 5 113 3 233 3 359 7 487 3 619 2 761 7 911 17 29 2 127 3 239 7 367 11 491 2 631 3 769 11 919 7 31 3 131 2 241 7 373 2 499 7 641 3 773 2 929 3 37 2 137 3 251 11 379 2 503 5 643 11 787 2 937 5 41 7 139 2 257 3 383 5 509 2 647 5 797 2 941 2 43 3 149 2 263 5 389 2 521 3 653 2 809 3 947 2 47 5 151 7 269 2 397 5 523 2 659 2 811 3 953 3 53 2 157 5 271 43 401 3 541 2 661 2 821 2 967 5 59 2 163 2 277 5 409 29 547 2 673 5 823 3 971 11 61 2 167 5 281 3 419 2 557 2 677 2 827 2 977 3 67 2 173 2 283 3 421 2 563 2 683 5 829 2 983 5 71 7 179 2 293 2 431 7 569 3 691 3 839 11 991 7 73 5 181 2 307 5 433 5 571 3 701 2 853 2 997 7 tMersenne numbers. A prime number of the form 2 P -1 is called a Mersenne number. There exist 27 suchp's less than 44500:p=2, 3, 5, 7,13,17,19,31,61,89,107,127,521,607,1279, 2203, 2281, 3217, 4253, 4423, 9689, 9941, 11213, 19937, 21701, 23209, 44497. The: even perfect numbers are the numbers ofthe form 2 P - 1 (2 P -1), where 2 P -1 is a Mersenne number. 2. Indices Modulo p (297 Number Theory, Elementary) Let r be a primitive root corresponding to a prime number p. The index I=Ind,a of a with respect to the basis r is the integer I in Ol<p-l satisfying r';a(modp), a;b(modp) is equivalent to Ind,a:;: Ind, b(mod(p-l». The index satisfies the following congruence relations with respect to mod(p-1): Ind,ab:;: Ind,a+ Ind,b, Ind,a":;: nInd,a, Ind.a =Ind.r Ind,a. 
1821 App. B, Table 2 Indices Modulo p We can solve congruence equations using these relations. The following is a table of indices. p p-l r 2 3 5 7 II 13 17 19 23 29 31 37 41 43 47 2 1 - 3 2 2 I - 5 4 2 1 3 - 7 2'3 3 2 I 5 - 11 2.5 2 1 8 4 7 - 13 22,3 2 I 4 9 II 7 - 17 2 4 3 14 1 5 II 7 4 - 19 2.3 2 2 1 13 16 6 12 5 10 - 23 2.11 5 2 16 1 19 9 14 7 15 - 29 22,7 2 1 5 22 12 25 18 21 9 20 - 31 2.3.5 3 24 I 20 28 23 1\ 7 4 27 9 - 37 22,3 2 2 1 26 23 32 30 11 7 35 15 21 9 - 41 2 3 .5 7 14 25 18 I 37 9 7 31 4 33 12 8 - 43 2.3.7 3 27 I 25 35 30 32 38 19 16 41 34 7 6 - 47 2.23 5 18 20 1 32 7 11 16 45 5 35 3 42 15 13 - 53 2 2 .13 2 I 17 47 14 6 24 10 37 39 46 33 30 45 22 44 59 2.29 2 I 50 6 18 25 45 40 38 15 28 49 55 14 33 23 61 2 2 '3.5 2 I 6 22 49 15 40 47 26 57 35 59 39 54 43 20 67 2.3.11 2 I 39 15 23 59 19 64 10 28 44 47 22 53 9 50 71 2'5.7 7 6 26 28 I 31 39 49 16 15 68 11 20 25 48 9 73 2 3 ,3 2 5 8 6 I 33 55 59 21 62 46 35 11 64 4 51 31 79 2.3,13 3 4 1 62 53 68 34 21 32 26 1\ 56 19 75 49 59 83 2.41 2 1 72 27 8 24 77 56 47 60 12 38 20 40 71 23 89 2 3 '11 3 16 I 70 81 84 23 6 35 57 59 31 1\ 21 29 54 97 2 5 ,3 5 34 70 I 31 86 25 89 81 77 13 46 91 85 4 84 101 22,5 2 2 I 69 24 9 13 66 30 96 86 91 84 56 45 42 58 103 2. 3'17 5 44 39 1 4 61 72 70 80 24 86 57 93 50 77 85 107 2.53 2 I 70 47 43 22 14 29 78 62 32 27 38 40 59 66 109 2 2 .3 3 II 15 80 92 20 1 101 87 105 3 98 34 43 63 42 103 1\3 2 4 ,7 3 12 1 83 8 74 22 5 99 41 89 50 67 94 47 31 127 2'3 2 ,7 3 72 I 87 115 68 94 38 84 121 1\3 46 98 80 71 60 131 2. 5'13 2 I 72 46 96 56 18 43 35 23 51 29 41 126 124 105 137 2 3 '17 3 10 1 75 42 122 25 38 46 125 91 73 102 119 97 19 139 2.3.23 2 I 41 86 50 76 64 107 61 27 94 56 80 32 1\5 98 149 22,37 2 I 87 104 142 109 53 124 84 95 120 132 72 41 93 138 . 151 2,3,5 2 7 10 93 136 1 82 23 124 120 145 42 34 148 3 74 128 157 2 2 . 3 . 13 5 141 82 1 147 28 26 40 124 135 129 62 116 21 113 92 163 2.3 4 2 1 101 15 73 47 51 57 125 9 107 69 33 160 38 28 167 2.83 5 40 94 I 118 28 103 53 58 99 150 90 61 97 87 132 173 22,43 2 I 27 39 95 23 130 73 33 20 144 102 162 138 84 64 179 2.89 2 I 108 138 171 15 114 166 54 135 1\8 62 149 155 80 36 181 2 2 '3 2 '5 2 1 56 156 15 62 164 175 135 53 48 99 26 83 20 13 191 2.5.19 19 44 116 50 171 85 112 98 1 134 33 175 15 165 8 123 193 2 6 .3 5 34 84 I 104 183 141 31 145 162 123 82 5 151 24 29 197 22,7 2 2 I 181 89 146 29 25 159 154 120 36 141 192 1\0 78 66 199 2. 3 2 . II 3 106 I 138 142 189 172 123 55 118 70 164 11 167 88 76 211 2'3'5.7 2 1 43 132 139 162 144 199 154 21 179 115 118 17 80 124 223 2.3.37 3 180 I 89 210 107 147 144 172 163 128 82' 152 204 1\8 50 227 2 .113 2 I 46 11 154 28 61 99 178 34 8 197 77 131 150 218 229 2 2 .3'19 7 111 68 214 I 42 195 24 52 131 191 175 164 73 12 193 (table continued on fol1owing page) 
App. B, Table 3 1822 Bernoulli Numbers and Euler Numbers p p-l r 2 3 5 7 11 13 17 19 23 29 31 37 41 43 47 233 2 3 .29 3 72 1 165 222 197 158 103 136 112 132 182 8 85 25 139 239 2.7'17 7 66 74 138 1 4 43 52 155 63 160 188 31 99 15 113 241 2 4 .3.5 7 190 182 138 1 25 47 111 85 57 154 151 73 6 219 114 251 2.5 3 11 135 6 80 218 1 162 184 233 134 203 226 187 64 77 85 257 2" 3 48 1 55 85 196 106 120 125 28 94 242 219 ]9 207 61 263 2.131 5 190 50 1 79 166 62 126 43 156 221 136 170 17 154 65 269 22,67 2 1 109 208 19 230 142 105 223 176 187 259 56 200 254 32 271 2'3 3 .5 43 266 153 220 98 92 15 16 261 75 45 222 182 156 1 213 277 2 2 '3'23 5 147 188 1 22 7 222 103 252 208 74 47 87 126 55 218 281 2 3 .5. 7 3 204 1 186 182 253 9 166 221 197 172 62 135 23 132 75 3. Bernoulli Numbers and Euler Numbers (177 Generating Functions) Bn are Bernoulli numbers; En are Euler numbers. n Numerator of Bn Denominator of Bn Bn En 2 I 6 0.16667 1 4 I 30 0.03333 5 6 I 42 0.02381 61 8 I 30 0.03333 1385 10 5 66 0.07576 50521 12 691 2730 0.25311 2702765 14 7 6 1.16667 199360981 16 3617 510 7.09216 19391512145 18 43867 798 54.97118 2404879675441 20 174611 330 529.12424 370371188237525 22 854513 138 6192.12319 6.934887 X 10 16 24 236364091 2730 86580.25311 1.551453 X 10 19 26 8553103 6 1425517.16667 4.087073 X 10 21 28 23749461029 870 27298231.06782 1.252260 X 1Q24 30 8615841276005 14322 601580873.90064 4.415439 X 10 26 4. Class Numbers of Algebraic Number Fields (I) Class Numbers of Real Quadratic Field (347 Quadratic Fields) Let k = Q(vm ), where m is a positive integer without square factor (I < m .;; 501). h is the class number (in the wider sense) of k. The -sign in the row of N(f) means that the norm N(f) of the fundamental unit is - I. When N (f) = + I, the class number in the narrow sense is 2h, and when N(f)= -I, the class number in the narrow sense is also h. m h N(f) m h N(f) m h N(f) m h N(f) m h N(f) m h N(f) 2 1 - 85 2 - 170 4 - 253 1 335 2 421 I - 3 I 86 I 173 1 - 254 3 337 1 - 422 1 5 I - 87 2 174 2 255 4 339 2 426 2 6 I 89 I - 177 1 257 3 - 341 1 427 6 7 1 91 2 178 2 258 2 345 2 429 2 10 2 - 93 I 179 1 259 2 346 6 - 430 2 11 1 94 1 181 1 - 262 1 347 I 431 I 13 I - 95 2 182 2 263 I 349 I - 433 I - 14 1 97 I - 183 2 265 2 - 353 I - 434 4 
1823 App. B, Table 4.11 Class Numbers of Algebraic Number Fields m h N(f) m h N(f) m h N(f) m h N(f) m h N(f) m h N(f) 15 2 101 1 - 185 2 - 266 2 354 2 435 4 17 1 - 102 2 186 2 267 2 355 2 437 1 19 1 103 1 187 2 269 1 - 357 2 438 4 21 1 105 2 Ig0 2 271 1 358 1 439 5 22 1 106 2 - 191 1 273 2 359 3 442 8 - 23 1 107 1 193 1 - 274 4 - 362 2 - 443 3 26 2 - 109 1 - 194 2 277 1 - 365 2 - 445 4 - 29 1 - 110 2 195 4 278 1 366 2 446 1 30 2 111 2 197 1 - 281 1 - 367 1 447 2 31 1 113 1 - 199 1 282 2 370 4 - 449 1 - 33 1 114 2 201 1 283 1 371 2 451 2 34 2 115 2 202 2 - 285 2 373 1 - 453 1 35 2 118 1 203 2 286 2 374 2 454 1 37 1 - 119 2 205 2 287 2 377 2 455 4 38 1 122 2 - 206 1 290 4 - 379 1 457 1 - 39 2 123 2 209 1 291 4 381 1 458 2 - 41 1 - 127 1 210 4 293 1 - 382 1 461 1 - 42 2 129 1 211 1 295 2 383 1 462 4 43 1 130 4 - 213 1 298 2 - 385 2 463 1 46 1 131 1 214 1 299 2 386 2 465 2 47 1 133 1 215 2 301 1 389 1 - 466 2 51 2 134 1 217 1 302 1 390 4 467 1 53 1 - 137 1 - 218 2 - 303 2 391 2 469 3 55 2 138 2 219 4 305 2 393 1 470 2 57 1 139 1 221 2 307 1 394 2 - 471 2 58 2 - 141 1 222 2 309 1 395 2 473 3 59 1 142 3 223 3 310 2 397 1 - 474 2 61 1 - 143 2 226 8 - 311 1 398 1 478 1 62 1 145 4 - 227 1 313 1 - 399 8 479 1 65 2 - 146 2 229 3 - 314 2 - 401 5 - 481 2 - 66 2 149 1 - 230 2 317 1 - 402 2 482 2 67 1 151 1 231 4 318 2 403 2 483 4 69 1 154 2 233 1 - 319 2 406 2 485 2 - 70 2 155 2 235 6 321 3 407 2 487 1 71 1 157 1 - 237 1 322 4 409 1 - 489 1 73 1 - 158 1 238 2 323 4 410 4 491 1 74 2 - 159 2 239 1 326 3 411 2 493 2 - 77 1 161 1 241 1 - 327 2 413 1 494 2 78 2 163 1 246 2 329 1 415 2 497 1 79 3 165 2 247 2 330 4 417 1 498 2 82 4 - 166 1 249 1 331 1 418 2 499 5 83 1 167 ] 25] ] 334 1 419 1 501 1 One can find a table of fundamental units and representatives of ideal classes for 0 < m < 2025 in E. L. Ince, Cycles of reduced ideals in quadratic fields, Royal Society, London, 1968. (II) Class Numbers of Imaginary Quadratic Fields (347 Quadratic Fields) Let k = Q(v=m ), where m is a positive integer without square factor (1 .;; m .;; 509). h is the class number of k. In the present case, there is no distinction between the class numbers in the wider and narrow senses. m h m h m h m h m h m h m h m h 1 1 65 8 129 12 193 4 255 12 319 10 389 22 447 14 2 1 66 8 130 4 194 20 257 16 321 20 390 16 449 20 3 1 67 1 131 5 195 4 258 8 322 8 391 14 451 6 5 2 69 8 133 4 197 10 259 4 323 4 393 12 453 12 6 2 70 4 134 14 199 9 262 6 326 22 394 10 454 14 7 1 71 7 137 8 201 12 263 13 327 12 395 8 455 20 10 2 73 4 138 8 202 6 265 8 329 24 397 6 457 8 11 1 74 10 139 3 203 4 266 20 330 8 398 20 458 26 
App. B, Table 4.III Class Numbers of Algebraic Number Fields 1824 m h m h m h m h m h m h m h m 13 2 77 8 141 8 205 8 267 2 331 3 399 16 461 14 4 78 4 142 4 206 20 269 22 334 12 401 20 462 15 2 79 5 143 10 209 20 271 11 335 18 402 16 463 17 4 82 4 145 8 210 8 273 8 337 8 403 2 465 19 1 83 3 146 16 211 3 274 12 339 6 406 16 466 21 4 85 4 149 14 213 8 277 6 341 28 407 16 467 22 2 86 10 151 7 214 6 278 14 345 8 409 16 469 23 3 87 6 154 8 215 14 281 20 346 10 410 16 470 26 6 89 12 155 4 217 8 282 8 347 5 411 6 471 29 6 91 2 157 6 218 10 283 3 349 14 413 20 473 30 4 93 4 158 8 219 4 285 16 353 16 415 10 474 31 3 94 8 159 10 221 16 286 12 354 16 417 12 478 33 4 95 8 161 16 222 12 287 14 355 4 418 8 479 34 4 97 4 163 1 223 7 290 20 357 8 419 9 481 35 2 101 14 165 8 226 8 291 4 358 6 421 10 482 37 2 102 4 166 10 227 5 293 18 359 19 422 10 483 38 6 103 5 167 11 229 10 295 8 362 18 426 24 485 39 4 105 8 170 12 230 20 298 6 365 20 427 2 487 41 8 106 6 173 14 231 12 299 8 366 12 429 16 489 42 4 107 3 174 12 233 12 301 8 367 9 430 12 491 43 1 109 6 177 4 235 2 302 12 370 12 431 21 493 46 4 110 12 178 8 237 12 303 10 371 8 433 12 494 47 5 111 8 179 5 238 8 305 16 373 10 434 24 497 51 2 113 8 181 10 239 15 307 3 374 28 435 4 498 53 6 114 8 182 12 241 12 309 12 377 16 437 20 499 55 4 115 2 183 8 246 12 310 8 379 3 438 8 501 57 4 118 6 185 16 247 6 311 19 381 20 439 15 502 58 2 119 10 186 12 249 12 313 8 382 8 442 8 503 59 3 122 10 187 2 251 7 314 26 383 17 443 5 505 61 6 123 2 190 4 253 4 317 10 385 8 445 8 506 62 8 127 5 191 13 254 16 318 12 386 20 446 32 509 h 30 8 7 16 8 7 16 20 16 12 20 8 25 16 20 4 20 7 20 9 12 28 24 8 3 16 14 21 8 28 30 There are only 9 instances of m for which h = 1, and only 18 instances of m for which h = 2 (Baker, Stark). An these cases are in this table. One can find a table of structures of the ideal class groups and representatives of ideal classes for m < 24000 in H. Wada, A table of ideal class groups of imaginary quadratic fields, Proc. Japan Acad., 46 (1970), 401-403. (III) Class Numbers of Cyclotomic Fields Cyclotomic field k = Q(e 2ni / l ) (1 < k 100; I prime). hI is the first factor of the class number of k ( 14 Algebraic Number Fields). I hI I hI I hI I hi I hI I hI 3 1 13 1 29 2 3 43 211 61 41.1861 79 5.53.37791 I 5 1 17 1 31 3 2 47 5.139 67 67.12739 83 3.279405653 7 1 19 1 37 37 53 4889 71 7 2 . 79241 89 113.118401 11 1 23 3 41 11 2 59 3.59.233 73 89.134353 97 577.3457.20 449 6209 hi> 1 for I> 19 (Uchida). 5. Characters of Finite Groups; Crystallographic Groups (I) Symmetric Groups Sn, Alternating Groups An (3.;; n';; 7), and Mathieu Groups M n (n = 11,12,22,23,24) (1) In each table, the first column gives the representation of the conjugate class as we represent a permutation by the product of cyclic permutations. For example, (3)(2)2 means the conjugate class containing (123)(45)(67). 
1825 App. B, Table S.I Finite Groups; Crystallographic Groups (2) The second column gives the order of the centralizer of the elements of the conjugate class. (3) In the table of Sn' the first row gives the type of Young diagram corresponding to each irreducible character. For example, [3,2 2 , 1] means T (3,2,2, I). (4) In the table of An' when we restrict the self-conjugate character of Sn (the character with *) to An' it is decomposed into two mutually algebraically conjugate irreducible characters, and therefore we show only one of them. The other irreducible character of An is given by the restriction to An of the character of Sn that is not self-conjugate. (5) In the table of Mn' each character with a bar over the degree is one of the two mutually algebraically conjugate characters. fi = ( - 1::t V-=3 ) /2, f{ =(1::t VS )/2, f3 = (-I::t --..r=7 )/2 fl = ( - l::t V -II )/2, fs=(-I::tV -15 )/2, fl =( -I::t V -23 )/2 S3 [3] [2,1]* [13] A3 [2,1]* (I) 6 2 1 (I) 3 (2) 2 0 -1 (3) 3 ft (3) 3 -I I (3) 3 fl S4 [4] [3,1] [2 2 ]* [2,1 2 ] [14] A4 [2 2 ]* (I) 24 3 2 3 I (1) 12 (2) 4 1 0 -1 -1 (3) 3 ft (3) 3 0 -1 0 1 (3) 3 f1 (4) 4 -1 0 1 -1 (2f 4 (2)2 8 -1 2 -1 1 Ss [5] [4,1] [3,2] [3,12]* [2 2 ,1] [2,1 3 ] [ IS] (I) 120 1 4 5 6 5 4 1 (2) 12 1 2 1 0 -1 -2 -1 (3) 6 1 1 -1 0 -1 I 1 (4) 4 1 0 -1 0 1 0 -1 (2)2 8 1 0 1 -2 1 0 1 (3)(2) 6 1 -1 1 0 -1 I -1 (5) 5 1 -1 0 1 0 -I I As [3,12]* (I) 60 3 (3) 3 0 (2f 4 -1 (5) 5 fi (5) 5 f2 S6 [6] [5, I] [4,2] [4,12] [32] [3,2,1]* [2 3 ] [3,1 3 ] [2 2 ,1 2 ] [2,1 4 ] [16] (I) 720 1 5 9 10 5 16 5 10 9 5 1 (2) 48 1 3 3 2 1 0 -1 -2 -3 -3 -1 (3) 18 1 2 0 1 -1 -2 -1 1 0 2 1 (4) 8 1 1 -1 0 -1 0 1 0 1 -1 -1 (2f 16 1 1 1 -2 1 0 1 -2 1 1 1 (3)(2) 6 1 0 0 -1 1 0 -1 1 0 0 -1 (5) 5 1 0 -1 0 0 1 0 0 -1 0 1 (6) 6 I -1 0 1 0 0 0 -1 0 1 -1 (4)(2) 8 1 -1 1 0 -1 0 -1 0 1 -1 1 (2)3 48 1 -1 3 -2 -3 0 3 2 -3 1 -1 (3)2 18 1 -1 0 1 2 -2 2 1 0 -1 1 
App. B, Table S.I 1826 Finite Groups; Crystallographic Groups A6 [3,2,1]* (1) 360 8 (3) 9 -1 (2)2 8 0 (5) 5 f+ 2 (5) 5 f2 (4)(2) 4 0 (W 9 -1 8 7 [7] [6, I] [5,2] [5,1 2 ] [4,3] [4,2, I] [32, I] [4.13]. [3,2 2 ] [3,2,12] [2 3 , I] [3,1 4 ] [22,1 3 ] [2, I'] [17] (I) 5040 6 14 15 14 35 21 20 21 35 14 15 14 6 I (2) 240 4 6 5 4 5 I 0 -I -5 -4 -5 -6 -4 -I (3) 72 3 2 3 -I -I -3 2 -3 -I -I 3 2 3 I (4) 24 2 0 I -2 -I -I 0 I I 2 -I 0 -2 -I (2)2 48 2 2 -I 2 -I I -4 I -I 2 -I 2 2 I (3)(2) 12 I 0 -I I -I I 0 -I I -I I 0 -I -I (5) 10 I -I 0 -I 0 I 0 I 0 -I 0 -I I I (6) 6 0 -I 0 0 I 0 0 0 -I 0 0 I 0 -I (4)(2) 8 0 0 -I 0 I -I 0 -I I 0 -I 0 0 I (2)3 48 0 2 -3 0 I -3 0 3 -I 0 3 -2 0 -I (3)2 18 0 -I 0 2 -I 0 2 0 -I 2 0 -I 0 I (5)(2) 10 -I I 0 -I 0 I 0 -I 0 I 0 -I I -I (3)(2)2 24 -I 2 -I -I -I I 2 I -I -I -I 2 -I I (4)(3) 12 -I 0 I I -I -I 0 I I -I -I 0 I -I (7) 7 -I 0 I 0 0 0 -I 0 0 0 I 0 -I I A7 [4,1 3 ]* (1) 2520 \0 (3) 36 1 (2)2 24 -2 (5) 5 0 (4)(2) 4 0 (W 9 1 (3)(2)2 12 I (7) 7 f3+ (7) 7 f3 MIl (1) g \0 11 55 45 44 16 \0 (2)4 48 2 3 -1 -3 4 0 -2 (4)2 8 2 -1 -1 I 0 0 0 (3i 18 1 2 1 0 -1 -2 I (5)2 5 0 1 0 0 -1 1 0 (8)(2) 8 0 -I 1 -1 0 0 :k.iVi (8)(2) 8 0 -1 1 -1 0 0 +iVi (6)(3)(2) 6 -I 0 -1 0 I 0 (11 ) 11 -1 0 0 I 0 f+ -1 4 (11 ) 11 I -1 0 0 I 0 f4 -1 g= 11.\0.9.8=7920. M 12 (1) g I 11 11 55 55 55 45 54 66 99 120 144 176 16 (2)4 192 I 3 3 -1 -1 7 -3 6 2 3 -8 0 0 0 (4)2 32 I 3 -1 3 -1 -1 I 2 -2 -1 0 0 0 0 (3)3 54 1 2 2 1 I 1 0 0 3 0 3 0 -4 -2 (5)2 \0 1 I 1 0 0 0 0 -1 I -1 0 -[ 1 I (8)(2) 8 1 1 -1 -I 1 -1 -1 0. 0 1 0 0 0 0 (6)(3)(2) 6 I 0 0 -1 -1 1 0 0 -1 0 1 0 0 0 (11) 11 1 0 0 0 0 0 1 -1 0 0 -I 1 0 f4+ (11) 11 1 0 0 0 0 0 1 -I 0 0 -1 1 0 f4 (2)6 240 1 -1 -1 -5 -5 -5 5 6 6 -1 0 4 -4 4 ( \0)(2) \0 I -I -I 0 0 0 0 1 I -1 0 -] 1 -I ( 4)\2i 32 I -I 3 -1 3 -1 1 2 -2 -1 0 0 0 0 (3)4 36 1 -1 -1 1 1 I 3 0 0 3 0 -:' -1 1 (6)2 12 1 -1 -I 1 1 1 -I 0 0 -1 0 1 -1 1 (8)(4) 8 1 -I I 1 -1 -1 -1 0 0 1 0 0 0 0 g= 12.11.\0.9.8 = 95040. 
1827 App. B, Table 5.1 Finite Groups; Crystallographic Groups M 22 (1) g 1 21 55 154 210 280 231 385 99 45 (2)8 384 1 5 7 10 2 -8 7 1 3 -3 (3)6 36 1 3 1 I 3 1 -3 -2 0 0 (5)4 5 1 I 0 -I 0 0 I 0 -1 0 (4)4(2i 16 1 1 -1 2 -2 0 -I 1 -I 1 (4)4(2)2 32 1 I 3 -2 -2 0 -1 1 3 I (7») 7 1 0 -1 0 0 0 0 0 1 E)+ (7i 7 1 0 -1 0 0 0 0 0 1 E)- (8)2(4)(2) 8 1 -1 I 0 0 0 -1 I -1 -1 ( 6i(W(2)2 12 I -1 1 1 -1 1 1 -2 0 0 (11f 11 I -I 0 0 1 E4+ 0 0 0 I (11)2 11 1 -1 0 0 1 E" 0 0 0 1 g=22.21.20.48=443520. M23 (1) g 1 22 230 231 770 1035 2024 45 990 231 253 896 (2)8 2688 1 6 22 7 -14 27 8 -3 -18 7 13 0 (3)6 180 1 4 5 6 5 0 -I 0 0 -3 1 -4 (5)4 15 1 2 0 1 0 0 -1 0 0 I -2 1 ( 4)4(2)2 32 1 2 2 -I -2 -I 0 I 2 -1 I 0 (7») 14 1 1 -1 0 0 -1 1 E{ E{ 0 I 0 (7») 14 1 1 -1 0 0 -1 1 E)- E3 0 1 0 (8)2(4)(2) 8 1 0 0 -1 0 1 0 -1 0 -1 -1 0 (6f(3f(2)2 12 1 0 1 -2 1 0 -1 0 0 1 1 0 (11)2 11 I 0 -I 0 0 I 0 I 0 0 0 E4+ ( 11 )2 1 I 1 0 -1 0 0 I 0 1 0 0 0 E4 ( 15)(5)(3) 15 1 -I 0 I 0 0 -I 0 0 ES+ 1 I (15)( 5)(3) 15 1 -1 0 1 0 0 -1 0 0 ES- 1 1 (\4)(7)(2) 14 1 -I I 0 0 -I 1 -E{ E)+ 0 -I 0 ( 14)(7)(2) 14 1 -1 1 0 0 -1 1 -E3 E3 0 -1 0 (23) 23 1 -1 0 I ('1 0 0 -I 1 1 0 -1 (23) 23 1 -1 0 1 E6- 0 0 -I 1 1 0 -1 g=23' 22.21' 20'48 = 10200960. M 24 ( 1)24 g I 23 7.36 23,11 23.77 55.64 45 22.45 23.45 23.45 11.21 770 (2)8 21.2 10 I 7 28 13 -21 64 -3 -18 -21 27 7 -14 (3)6 27.40 I 5 9 10 16 10 0 0 0 0 -3 5 (5)4 60 I 3 2 3 I 0 0 0 0 0 I 0 (4)4(2)2 128 I 3 4 I -5 0 I 2 3 -I -I -2 (7)3 42 I 2 0 I 0 -I '3+ '3+ 2'3+ -I 0 0 (7)3 42 I 2 0 I 0 -I '3- '3 2'3 -I 0 0 (8)2(4)(2) 16 I I 0 -I -I 0 -I 0 -I I -I 0 (6)2(3)2(2)2 24 I I I -2 0 -2 0 0 0 0 I I (II )2 II I I -1 0 0 0 I 0 I I 0 0 ( 15)(5)(3) 15 I 0 -I 0 I 0 0 0 0 0 '5+ 0 ( 15)(5)(3) 15 I 0 -I 0 I 0 0 0 0 0 '5- 0 ( 14)(7)(2) 14 I 0 0 -I 0 I -£3 '1' 0 -I 0 0 ( 14)(7)(2) 14 I 0 0 -I 0 I -£; '3- 0 -I 0 0 (23) 23 I 0 -1 0 0 I -I I 0 0 I '6+ (23) 23 I 0 -I 0 0 I -I I 0 0 I '6- (12)2 12 I -I 0 I -I 0 I I -I 0 0 I (6)4 24 I -I 0 I -1 0 -1 -1 1 2 0 I (4)6 96 I -I 0 I -I 0 I -2 -I 3 3 -2 (W 7.72 I -I 0 I 7 -8 3 3 -3 6 0 -7 (2)12 15.2 9 1 -1 12 -11 11 0 5 -10 -5 35 -9 10 (10)2(2)2 20 I -I 2 -I I 0 0 0 0 0 I 0 (21)(3) 21 I -I 0 I 0 -I '3- '3- 0+ -I 0 0 - 3 (21)(3) 21 I -I 0 I 0 -I E3+ E3+ -03 -I 0 0 (4)4(2)4 3.2 7 I -I 4 -3 3 0 -3 6 3 3 -I 2 (12)(6)(4)(2) 12 I -I I 0 0 0 0 0 0 0 -I -I 
App. B, Table S.II 1828 Finite Groups; Crystallographic Groups (1)24 g 23.21 23.55 23.88 23.99 23.144 23.11.21 23.7.36 77.72 11.35.27 (2)8 21.2 10 35 49 8 21 48 49 -28 -56 -21 (3)6 27.40 6 5 -I 0 0 -15 -9 9 0 (5)4 60 -2 0 -1 -3 -3 3 1 -I 0 (4)4(2? 128 3 1 0 1 0 -3 4 0 -1 (7)3 42 0 -2 1 2 I 0 0 0 0 (7)3 42 0 -2 1 2 I 0 0 0 0 (8)2(4)(2) 16 -I 1 0 -1 0 -I 0 0 I (6)2(3?(2)2 24 2 1 -1 0 0 1 -1 I 0 (II? 11 -1 0 0 0 1 0 -1 0 0 (15)(5)(3) 15 1 0 -1 0 0 0 I -1 0 (15)(5)(3) 15 1 0 -1 0 0 0 I -1 0 (14)(7)(2) 14 0 0 1 0 -1 0 0 0 0 (14)(7)(2) 14 0 0 1 0 -1 0 0 0 0 (23) 23 0 0 0 0 0 0 0 1 -1 (23) 23 0 0 0 0 0 0 0 1 -1 (12? 12 0 0 0 0 0 0 0 0 0 (6)4 24 0 0 0 2 -2 0 0 0 0 (4)6 96 3 -3 0 -3 0 -3 0 0 3 (3)8 7.72 0 8 8 6 -6 0 0 0 0 (2)12 15.2 9 3 -15 24 -19 16 9 36 24 -45 ( 1O?(2? 20 -2 0 -1 1 1 -I I -I 0 (21 )(3) 21 0 I I -1 I 0 0 0 0 (21)(3) 21 0 1 1 -1 1 0 0 0 0 (4)4(2)4 3.2 7 3 -7 8 -3 0 I -4 -8 3 (12)(6)(4)(2) 12 0 -1 -1 0 0 I -I 1 0 g=24' 23'22.21.20. 48 = 244823040. (II) General Linear Groups G L(2, q), Unitary Groups U (2, q), and Special Linear Groups SL(2, q) (q is a power of a prime) ( 151 Finite Groups I) (1) The notations are as follows. e=exp[2'1TV-=T /(q-1)], 17 = exp[2'1Tv=T /(q2-1)], a=exp[2'1Tv=T /(q + 1)], P is the generator of the multiplicative group of GF(q) - {O}, '" is the generator of the multiplicative group of GF(q2)- {O}, ",q-l = a, B is an element of GL(2,q) with order q2 -1, and B 1 = Bq-l. (2) The first column gives a representative of the conjugate class. General Linear Group GL(2,q). X n (1) Xn(q) Ym.n Zn ( p a pa) e 2na qe 2na (q+ l)e(m+n)a (q _ I)l7 na(q+ I) ( PIa pa) e 2na 0 e(m+n)a _17 na(q+ 1) ( pa pb) en(a+b) en(a+b) ema+nb + e mb + na 0 Be e ne _Ene 0 _(l7ne + l7ne q ) (1) I <;;;a<;;;q-I, 1 <;;;b<;;;q-1, a¥,b(modq-1), l<;;;c<q2-1, cO(modq+l). (2) We assume that 1 <;;; n <;;; q-l, for XA1),X n (q), 1 <;;; m < n <;;; q-l, for Y m n' 1 <;;; n < q2-1 for Zn, n¥,O(modq+ 1). Here, Zn = Zn' when n= n' q(modq2- 1). . Unitary Group U(2,q). X(I) X(q) Y,n 7' 'n ( as as) a 2 '" qa 2 '" (q_1)a(m+n)s (q+ 1) a'" ( s as) a 2", 0 _ a(m+n)s a'" ( as at) an(s+t) _ an(s+t) _ (a ms + nt + a mt +",) 0 ( ",u ",-u q ) a- nu a- nu 0 l7nu+ 17 -nuq 
1829 App. B, Table 5.III Finite Gronps; Crystallographic Gronps (I) ( as ) , ( w u _ ) are the canonical forms of an element of U(2, q) in GL(2, q2). I as w uq (2)I.;;s.;;q+l, l.;;t.;;q+l, st(modq+l), l.;;u<q2_1, uO(modq-I).Whenu= - u'q (modq2-1) u, u ' gives the same conjugate class. (3) The ranges are 1.;; n';; q + I for X(I), X(q), I.;; m < n';; q + I for Y';',m I.;; n < q2_1 for Z, nO (modq-I). When n'= -nq (modq2-1), we have Z=Z,. Special Linear Gronp SL(2,2 n ) (the case when q = 2"). Y n Zm e I) q q+1 q-I (: I) 0 -I ( pa p -a ) fna+f- na 0 Bf -I 0 _(amc+a- mc ) (I) l.;;a.;;(q-2)/2, l,;;c';;q/2. (2) I.;; n';; (q-2)/2, I.;; m.;; q/2. Special Linear Gronp SL(2,q) (q = power of an odd prime number, e= (q -1)/2, e ' =(q + 1)/2). Y n fna+ f- na Zm q-I q+1 q-I --r- --r- q+1 q-I (-I)m(q-I) (-IY- (-I)" --r- 2 -I p.:!:: i\:!:: -I p.+ i\+ - ( - I)m ( - l)ep. :!:: (-I)"i\:!:: - ( - I)m ( - IYp. + (-Iti\ + 0 ( -I)a 0 _(amc+a-mc) 0 -(-It e I) z=(-I -I) P 1 =(: I) P 2 = (1 d P1Z P 2 Z (pa p-a) q q+ I q (-I)"(q+l) o o o (-I)" o (-I)" BC 1 -I 0 (1) 1 a(q-3)/2, 1 c(q-1)/2, 1 n(q- 3)/2, 1m(q-1)/2, A:t = { - 1 :t [( _l)e q] 1/2} /2, Jl.:t ={1 :t[(-1)"q]1/2}/2. (2) The last two columns mean two characters (with the same signs), respectively. (III) Ree gronp Re(q), Suzuki Gronp Sz(q), and Janko Gronp J. Ree group Re( q) (q = 3 2n + 1 = 3m 2 ). The order of Re(q) is q3(q3+ I)(q -I), qo= q2_ q + I, m+ = q+3m+ I, m_ = q -3m + 1. A B C Xp. I I I qo q3 qqo (q-l)mm+/2 (q-l)mm_/2 m(q2-1) q3+ I J 2 I -I q -q -(q -1)/2 (q-I)/2 0 q+1 X 3 I -(q-I) 0 q -(q+m)/2 (q-m)/2 -m I Y 9 1 I 0 0 m m -m I T 3 I I 0 0 a a 2a I T- 1 3 1 I 0 0 & & 2& I YT 9 I I 0 0 {3 {3 -{3 I YT- 1 9 I I 0 0 if {3 -{3 I JT 6 I -I 0 0 y -y 0 I JT- 1 6 I -I 0 0 y -'1 0 I R a I I I I 0 0 0 pIW+ p -IW Sb I 3 -I -3 I -I 0 0 JR a I -I I -I 0 0 0 pIW+ p -IW JS b 1 -I -I I I -I 0 0 V S I 0 -I 0 -I 0 -I 0 WI I 0 -I 0 0 I I 0 
App. B, Table 5.III 1830 Finite Groups; Crystallographic Groups X' Y v Y Z. Z' I' T 1 1 q3+ 1 (q-l)qo (q-l)qo (q2-1)m+ (q2-'I)m_ J 2 -(q+ 1) 3( q - 1) -(q-1) 0 0 X 3 1 2q-l 2q-l -m+ -m_ y 9 1 -1 -1 -1 --1 T 3 1 -1 -1 -3m-l 3m-l T- 1 3 1 -1 -1 -3m-l 3m-l YT 9 1 -1 -1 -1 --1 YT- 1 9 1 -1 -1 -1 -- 1 JT 6 -1 -3 1 0 0 JT- 1 6 -1 -3 1 0 0 R a pl'"'+p-I'"' 0 0 0 0 Sb 0 a(rb) a'(Ab) 0 0 JR a _(pl'"'+p -I'"') 0 0 0 0 JS b 0 a(rb) a' (Ab) 0 () 2 V S 0 0 0 _  (VKSqi+V-KSq) I) i=O 2 WI 0 0 0 0 _  (WTlq'+W-Tlq) i=O (1) The first column gives a representative of conjugate class, and the second column gives its order. The orders of R, S, V, Ware (q-l)/2, (q+ 1)/4, m_, m+, respectively. R, S, T are commuta- tive with J. (2) Ra_R-a, V'- v sq _ v sq > _ V- s _ V- sq _ V- Sq >, W'- wtq_ w'q> _ W-'- w-'q- I'V-'q>. Here we fix an integer b satisfying b 3 =: 1 [mod(q+ 1)14], (b-l, (q+ 1)/4)= 1. Sb_Sb8_Sb8 2 _S -b_S -b8_S -b6 2 , JRa_JR -a, JSb-JS -b, where A - B means that A and Bare mutual1y conjugate. (3) p=exp[4'ITv=I /(q-l)], v=exp(hV-=T /m_), w=exp(2'ITV-=T /m+), a=exp[8'ITv=T /(q+ 1)]. (4) 1 <;; J.t<;;(q-3)/4, 1 <;;A<;;(q-3)/8. Here r is considered mod(q+ 1)/4 and Y v = Y v8 = Y v8 2= Y -v = Y -v8= Y -v8>' K is considered modm_ and Z.= Z.q= Z.q2= Z_.= Z_.q= Z_.q2, T is considered modm+ and Z; = Z;q = Z;q' = Z  T = Z  Tq = Z  Tq2. 2 I (5) a(rb)= -  (a vb8 '+a- vb6 '), a'(Ab) =  (a'\b8i+a-'\b8')_(a'\b6'+a-M82). i=O i=O -m+mV-=q - m- V-=q I-V-=q (6) a = 2 ' f3 = 2 ' y = 2 We show one of the two rnutu;:tl1y complex conjugate characters, for the characters A, B, C. Suzuki group Sz (q). The order of Sz( q) is q2( q2 + l)(q - 1) (q = 2 2n +), 2q = r 2 ). x. Y p Zy q2 q2+ I (q-r+ l)(q-I) (q+ r+ I)(q-I) r(q-I)/2 r(q-I)/2 " 0 I r-} -r-} -r/2 -r/2 p 0 I -I -I ,v-=I /2 -rvCI /2 p-' 0 I -I -I -,v-=-1/2 ,v - I /2 '1Td I foa; + fO- cr.i 0 0 0 0 'lT J -I 0 - (ffJ + ffJq + f\-PJ + f\-PJq) 0 I I \ 'lT2 k -I 0 0 -(fI k + fI kq + f2- yk + f2- ykq ) -I -I (1) The first column gives a representative of the conjugate class. (2) 'lT0, 'IT), 'lT2 are the elements of order q - 1 q + r + 1, q - r + 1, respectively. (3) fO' f l , f2 are the primitive q - 1, q + r+ 1, q - r+ 1 roots of 1, respectively. (4) 'lTo and 'lTOi are mutual1y conjugate elements, and hence Xa and X -a give the same (;haracter. i, a run over the representatives of mod q-1, and i,a¥=O (modq-1). 
1831 App. B, Table 5.II1 Finite Groups; Crystallographic Groups (5) 'II'{, '11'1-)' 'II'{q, 'll'1-)q are mutually conjugate, and hence YjJ' Y _jJ' YjJq, Y _jJq give the same character. j, {3 run over the representatives of modq+ r+ I, and j, {3 =/= 0 (modq + r + 1). (6) 'II'{, 'll'2- k , 'll'fq, 'll'2- kq are mutually conjugate, and hence Zy' Z_y, Zyq, Z_yq give the same character. k, y run over the representatives of mod q- r+ 1, and k, y=/=O (mod q- r+ 1). Janko Group J. I I 77 133 209 133 77 77 133 76 76 56 56 120 120 120 2 1 5 5 1 -3 -3 -3 -3 4 -4 0 0 0 0 0 3 1 -1 1 -1 -2 2 2 -2 1 1 2 2 0 0 0 5 1 2 -2 -1 f+ - f+ -f f 1 1 2f- 2f+ 0 0 0 5 1 2 -2 -1 f -f -f+ f+ 1 1 2f+ 2f- 0 0 0 6 1 -1 -1 1 0 0 0 0 1 -1 0 0 0 0 0 7 1 0 0 -1 0 0 0 0 -1 -1 0 0 1 1 1 10 1 0 0 1 -f+ -€+ -€ -€ -1 1 0 0 0 0 0 10 1 0 0 1 -f -€ -f+ -e+ -1 1 0 0 0 0 0 11 1 0 1 0 0 0 -1 -1 1 1 -1 -1 -1 15 I -1 1 -1 €+ -€+ 1 1 -f - -€+ 0 0 0 -f f 15 1 -1 1 -1 f -f -f+ f+ 1 1 -f+ -f 0 0 0 19 1 1 0 0 0 0 0 0 -1 -1 i\.1 i\.2 i\.3 19 1 1 0 0 0 0 0 0 -1 -1 i\.2 i\.3 i\.1 19 1 1 0 0 0 0 0 0 -1 -1 i\.3 i\.1 i\.2 (1) The order of J is 8.3.5.7.11,19= 175560. (2) The first column gives the order of the elements of each conjugate class. (3) p=exp(2'11'v=T /19), i\.1 =p+p7 +pS+pll+ p I2+ p IS, i\.2 = p2+pI4+ p16+ p3+ pS+ p 17, i\.3 = p4+ p 9+ p13 + p6+ plO+pIS, f:!:: =(1:t V5 )/2. References For Sn(2';; n .;; 10), A n (3 .;; n .;; 9): [1] D. E. Littlewood. The theory of group characters, second edition, Oxford Univ. Press, 1950. For Sn(n = 11, 12, 13): [2] M. Zia-ud-Din, Proc. London Math. Soc., 39(1935), 200-204, 42 (1937), 340-355. For S14: [3] K. Kondo, Table of characters of the symmetric group of degree 14, Proc. Phys. Math. Soc. Japan, 22 (1940),585-593. For M n (n= 12,24): [4] G. Frobenius, Uber die Charaktere der mehrfach transitive Gruppen, S. B. Preuss. Akad. Wiss., 1904, 558-571. For M n (n= 11,22,23): [5] N. Burgoyne and P. Fong, The Schur multipliers of the Mathieu groups, Nagoya Math. 1.. 27 (1966), 733-745. For GL(2,q), SL(2,q): [6] 1. Schur, Untersuchungen iiber die Darstellung der endlichen Gruppen durch gebrochene lineare Substitutionen, J. Reine Angew. Math., 132 (1907),85-137. [7] H. E. Jordan, Group characters of various types of linear groups, Amer. J. Math., 29 (1907), 387-405. For GL(3,q), GL(4,q), GL(n,q): [8] R. Steinberg, The representations of GL(3,q), GL(4,q), PGL(3,q) and PGL(4,q). Canad. J. Math. 3 (1951), 225-235. [9] 1. A. Green, The characters of the finite general linear groups, Trans. Amer. Math. Soc., 80 (1955), 402-447. For U(2,q), U(3,q): [10] V. Ennola, On the characters of the finite unitary groups, Ann. Acad. Sci. Fenn., 323 (1963). 1-34. For Sz(q), Re(q), J: [II] M. Suzuki, On a class of doubly transitive groups, Ann. of Math., (2) 75 (1962), 105-145. [12) H. N. Ward, On Ree's series of simple groups, Trans. Amer. Math. Soc., 121 (1966),62-89. [13] Z. Janko, A new finite simple group with Abelian Sylow 2-subgroups and its characterization, J. Algebra, 3 (1966), 147-186. 
App. B, Table 5.IV Finite Groups; Crystallographic Groups 1832 For other Lie-type groups: [14] B. Srinivasan, The characters of the finite symplectic group Sp(4,q), Trans. Amer. Math. Soc., 131 (1968), 488-525. [15] H. Enomoto, The characters of the finite symplectic group Sp(4,q), q=2 f , Osaka J. Math., 9 (1972), 75-94. [16] G. 1. Lehrer, The characters of the finite special linear groups, J. Algebra, 26 (1973),564-583. [17] P. Deligne and G. Lusztig, Representations of reductive groups over finite fields, Ann. of Math., (2) 103 (1976), 103-161. For sporadic groups: [18] M. Hall, Jr. and D. Wales, The simple groups of order 604,800, J. Algebra, 9 (1968), 417-450. . [19] J. S. Frame, Computation of characters of the Hi8man-Sims group and its amomorphism group, J. Algebra, 20 (1972),320-349. [20] D. Fendel, A characterization of Conway's group 3, J. Algebra, 24 (1973), 159-196. [21] D. Wright, The irreducible characters of the simple group of M. Suzuki of order 448,345, 497,600, J. Algebra, 29 (1974), 303-323. (IV) Three-Dimensioual Crystal Classes ( 92 Crystallographic Groups) Crystal System Geometric Crystal Classes Arithmetic Crystal Classes Number of Bravais Types Schoenflies International Space GroupS(2) Notation Notation(') Short Full Tric1inic C, 1 1 (P,I) 1 P S2(C;) I I (P,I) 2 Monoclinic C 2 2 2 (P,2) (C,2) 3-5 P,C C'h m m (P, m)(C, m) 6-9 2/m 2 (P,2/m)(C,2/m) C 2h - 10-15 m Orthorhombic D 2 (V) 222 222 (P, 222) (C, 222) (F, 222) (I, 222) 16-24 P,C,F,I C 2v mm2 mm2 (P,mm2) (C,mm2) (A,mm2) (F,mm2) (/,mm2) 25-46 D 2h (VJ 222 mmm --- (P, mmm) (C, mmm) (F, mmm) (1, mmm) 47-74 mmm Tetragonal C 4 4 4 (P, 4)(3) (1,4) 75-80 P,I S4 4 4 (P,4) (/,4) 81-82 4 C 4h 4/m - (P,4/m) (/,4/m) 83-88 m D4 422 422 (P,422)(4) (/,422) 89-98 C 4v 4mm 4mm (P,4mm) (1,4mm) 99-110 D 2d (V d ) 42m 42m (P,42m) (P,4m2) (I, 4m2) (I, 42m) H 1-122 D 4h 4/mmm 422 --- (P,4/mmm) (I, 4/mmm) 123-142 mmm Trigonal C 3 3 3 (P, 3)(5) (R, 3) 143-146 P,R S6(C 3 ;) 3 3 (P,3)(R,3) 147-148 D3 32 32 (P, 312)(6) (P, 321)(1) (R, 32) 149-155 C 3v 3m 3m (P, 3ml) (P, 31m) (R, 3m) IS6-161 -2 (P, 31m) (P, 3ml) (R,3m) D3d 3m 3- 162-167 m Hexagonal C 6 6 6 (P,6)(8) 168-173 P C 3h 6 6 (P,6) -174 6 C 6h 6/m - (P,6/m) 175-176 m D6 622 622 (P,622)(9) 177-182 
1833 App. B, Table 6 Miscellaneous Constants Crystal System Geometric Crystal Classes Arithmetic Crystal Classes Number of Bravais Types Schoenflies International Space Groups(2) Notation Notation(l) Short Fun Hexagonal C 6v 6mm 6mm (P,6mm) 183-186 P D 3h 6m2 6m2 (P,6m2) (P,62m) 187-190 (cont.) 622 (P,6/mmm) 191-194 D 6h 6/mmm --- mmm Cubic T 23 23 (P, 23) (F, 23) (1, 23) 195-199 2- (P, m3) (F, m3) (1, m3) 200-206 P,F,1 Th m3 -3 m 0 432 432 (P, 432)(10) (F, 432) (1,432) 207-214 Td 43m 43m (P,43m) (F,43m) (1,43m) 215-220 4-2 (P, m3m) (F, m3m) (1, m3m) 221-230 Oh m3m -3- m m Notes (1) The notation is based upon International tablesfor X-ray crystallography I, Kynoch, 1969. In each crystal system, the lowest class is a holohedry. (2) These correspond to the consecutive numbers of space groups in the book cited in (1). (3)-(10) Enantiomorphic pairs arise from these classes: two pairs for (4), (8), (9), and one pair for the others. For the shapes of Bravais lattices  92 Crystal10graphic Groups E, Fig. 3. 6. Miscellaneous Constants Y2 = 1.414213562373095, YlO =3.162277660168379. 12 = 1.2599210498 94873, 1 100 =4.641588833612779. log I0 2=0.30102 99956 63981 = 1/3.321928094887364. (I) Base of Natural Logarithm e (1000 decimals) e=2.71828 18284590452353602874713526624977572 4709369995957496696762772 40766 30353 54759 45713 82178 525166427427466391932003059921 81741 3596629043572900334295260 59563 07381 323286279434907 63233 82988 07531 95251 01901 15738 34187 93070 21540 89149 93488 41675 09244 76146 06680 82264 80016 84774 1\853 74234 54424 37107 5390777449 92069 55170 27618 3860626133 13845 83000 75204 49338 26560 29760 67371 13200 70932 87091 27443 74704 72306 96977 20931 01416 9283681902 55151 0865746377 21\ 12 52389 78442 50569 53696 77078 54499 69967 94686 44549 05987 9316368892300987931277361 782154249992295763514822082698951936680331825 28869 39849 64651 0582093923 98294 88793 3203625094431\7 30123 81970 68416 14039 70198 37679 32068 32823 76464 80429 531180232878250981945581530175671736133206981 1250996181 88159 3041690351 598888519345807 27386 67385 89422 87922 84998 92086 80582 57492 79610 48419 84443 63463 24496 848756023362482704197862320900 2160990235304369941849164 31409 34317 38143 64054 62531 52096 18369088870701676839 64243 78140 59271 45635 49061 30310 72085 10383 75051 0115747704 17189 86106 87396 96552 12671 5468895703 50354. e (in octal) = 2.55760 52130505355. l/e=0.36787 9441171442, e 2 =7.38905 60989 30650= 1/0.13533 52832 36613, Ye = 1.64872 1270700128= 1/0.60653 0659712633. log, 10= 2.302585092994046= 1/0.434294481903252, log,2 = 0.6931471805 59945 = 1/1.4426950408 88964. (II) The Number'/r (1000 decima'ls) ( 328 Pi('/r» 
App. B, Table 7 Coefficients of Polynomial Approximations 1834 n=3.14159 265358979323846264338327950288419716939937510 58209 74944 59230 78164 06286 20899 8628034825 34211 70679 8214808651 32823 066470938446095 5058223172 53594 08128 48111 74502 841027019385211 055596446229489549303819644288 10975665933446128475648233786783165 27120190914564856692346034861045432664821339360726 024914127372458700660631558817 48815 20920 96282 92540 91715 3643678925 90360 01133053054882046652 13841 46951 94151 16094 33057270365759591953092186117381932611793105118548 07446 23799 62749567351885752724 8912279381 83011 9491298336 7336244065 66430 86021 3949463952 24737 190702179860943 70277 053921717629317675238467481846766940513200056 81271452635608277857713427577896091 73637 17872 14684 40901 22495 3430146549 58537 1050792279 68925 8923542019 95611 21290 21960 864034418159813 62977 47713 09960 51870 72113 49999 99837 297804995105973173281609631859 50244 5945534690830264252230825334468503526193 11881 71010 00313 78387 52886 58753 32083 81420617177669147303 59825 349042875546873 1159562863882353787593751 95778 18577 80532 1712268066130019278766111959092164201989. '/1 (in octal) = 3.1103755242 10264 3. 1/r.=0.3l830 9886183791. '/12=9.869604401089359= 1/0.101321183642338, Y iT = 1.77245 38509 05516 = 1/0.56418 95835 47756, Y 2r. = 2.5066282746 31001 = 1/0.3989422804 01433, V-:")"i = 1.25331 41373 15500 = 1/0.79788 45608 02865, r; = 1.46459 1887561523 = 1/0.68278 40632 55296. loglO'/1 = 0.497149872694134, loge'/1 = 1.14472 98858 49400. (III) Radian rad I rad = 57° .29577 95130 82321 = 3437'.74677 07849 393 = 206264".8062470964. 1° = 0.0 1745 32925 19943 rad, I' = 0.000290888208666 rad, I" = 0.00000 48481 368 I I rad. (IV) Euler's Constant C (100 decimals) (174 Gamma Function) c= 0.57721 566490153286060 651209008240243 10421 5933593992 35988057672348848677 26777 66467 09369470632917467495. e C = 1.7810724179 9019798522. n I Sn= L -c. i=l I n Sn n Sn n Sn n Sn 3 1.83333 333 6 2.45000 000 15 3.31822899 100 5.18737 752 4 2.08333 333 8 2.71785714 20 3.59773 966 500 6.79282343 5 2.28333 333 10 2.92896 825 50 4.79920534 1000 7.48547086 7. Coefficients of Polynomial Approximations In this table, we give some typical examples of approximation formulas for computation of func- tions on a digital computer ( 19 Analog Computation, 336 Polynomial Approximation). (I) Exponential Function (I) Putting IO2 + I = q + Y + i ( q is an integer, -  .;;; y <  ), we have eX = 2 Q v(y), v(y)  La,yi, which gives an approximation by a polynomial of the 7th degree, where the maximal error is 3 X 10- 11 . a o = 0.70710 67811 6, a I = 0.490 12 90717 2, a2 = 0.16986 57957 2, a3 = 0.03924 73321 5, a4 = 0.0068009712, as = 0.0009428173, a 6 = 0.000 I 093869, a7 = 0.0000 I 0826. (2) An approximation by a polynomial of the II th degree: eX  Laix i (- I .;;; x <: 0) Maximal error I X 10- 12 . a o = 0.9999999999990, a] = 0.9999999999995, a2 = 0.50000 00000 747, a3 = 0.16666 66666 812, a 4 = 0.04166 66657 960, as = 0.00833 33332 174, a6=0.00138 88925 998, 0 7 =0.0001984130955, as =0.00002 47944428, a9=0.00000 27550 711, ajQ=O.OOOOO 02819 019, an =0.00000 00255791. 
1835 App. B, Table 7.11 Coefficients of Polynomial Approximations (3) eX=; 1 + x (-logY2 <: x.;;; logY2 ). x ko+ kjx Z + k 2 x 4 --+ 2 1 + k)x z Maximal error lAx 10- 14 . ko= 1.000000000000327 L k l =0.107135066456464 2, k z =0.00059 45898 69018 8, k3 = 0.02380 17331 574186. (II) Logarithmic Function (I) An approximation by a polynomial of the 11th degree: log(l + x)=;Laix i (0 <: x <: [). Maximal error 1.1 X 10- 10 . ao = 0.00000 00001 10, a l =0.999999965498, a) = 0.33329 85059 64, a4 = - 0.24963 72428 65, a6= -0.157448895413, a7=0.11712 9115618, a9 = 0.03469 74937 56, alO= -0.010468229569. aZ = - 0.499998253798, as= 0.19773 31015 60, ag = - 0.07364 03719 14, all=0.00148 1991722. . x - \/2  . , /- (2) For I.;;; x.;;; 2, and puttmg y=  (3+2Y2 ) (-1';;; y.;;; 1), then logx=; logv 2 + x+y2  a i yZi+ 1 gives an approximation by a polynomial of the II th degree (0';;; i <: 5), where the maximal error is 9.2 X IO- IS . ao=0.34314 57505 07610 6, a l =0.0033670892 56222 5, a Z = 0.00005 94707 04347 4, a 3 =0.00000 12504997762, a 4 =0.00000 00285682928, as = 0.00000 00007437139. (Ill) Trigonometric Functions x q r Z .. 0 I 0 I I I) d . 'lTZ (I) We put 2'IT =P+2+4+"8 (plsanmteger;q= , ;r= , ; - <:z< ,an o5= Slll T' 'lTZ c=cosT' If r = 0, sin x = ( - 1 )qo5, cos x = ( - I) q c, If r = 1, sinx = ( - I )qc, cosx = - (- I )qo5. Here 05 and c are computed by the following approximation formulas. Putting - ZZ /2= y, o5(y)=sm('lTzj4)=;zLaiyi, c(y)=cos('lTz/4)=biY' gives an approximation by a polynomial of the 5th degree, where the maximal errors are 05: 2 X IO- Is , c: 2 X 10- 1 ). ao=0.78539 81633 97426, a l =0.161491024375338, a z =0.00996 1578261200, a3 = 0.000292609499152, a 4 = 0.0000050133389, as = 0.00000 00555 [357. b o = 0.99999 99999 999, b l =0.61685 02750 601, b z =0.06341 73767885, b) = 0.0026079335007, b 4= 0.000057447609, b s = 0.000000776593. sin(nx/2) .. (2) :"' L( -1)'a j x 2 ' (-1 x 1). This gives an approximation by a polynomial of 10th x degree (0  i  5), where the maximal error is 2.67 x 10- 11 . a o = 1.57079 63267 682, a 1 = 0.64596 40955 820, a z = 0.07969 26037 435, a 3 =0.00468 16578 837, a 4 =0.00016 02547 767, as==O.OOOOO 34318 696. (3) tan   x( ko + I:; + ... + I: ) (continued fraction) (- I <: x <: I). Maximal error 9.8 X 10- 12 . ko = 0.78539816349907, k 1 = 6.1922946807 1350, k z = -0.654498309523 [6, k3 = 520.24599063989939, k4 = - 0.0779795098 7751. (IV) Inverse Trigonometric Functions (1) An approximation by a polynomial of the 21st degree (0.;;; i <: 10): arcsinx Laix Zi + j (Ixl.;;; I/Y2). Maximal error 10- 10 . ao= 1.00000000053, al =0.16666 65754 5, az=0.07500 46066 5, a3=0.04453 58425 7, a 4 =0.03175 26509 6, as=0.0l176 582819, a6 = 0.0692 I 26185 7, a7= -0.14821 096288 a g =0.32889 76635 2, a9= -0.35020412015, alO=0.19740 50325 O. 
App. B, Table 7.VI Coefficients of Polynomial Approximations 1836 (2) Puttingx=w+u(w=H,,i; -i.;;u.;;i), v= ;;x:' (Ivl.;;U arc tan x = arc tan w + t(v), t( v) = arc tan c. The values of arc tan w: arctan(1/8)=0.12435 49945 46711, arc tan(3/8) =0.35877 06702 70611, arc tan(5 /8) = 0.558599315343560, arc tan(7 /8) = 0.718829999621623. t( v) is computed by an approximation by a polynomial of the 9th degree (0.;; i.;; 4), where t(v)=arctanv L(-lya,v 2i + l . Maximal error 1.6 X 10- 13 . ao = 0.99999999999992, a 1 = 0.33333 33328220, a2 = 0.1999997377 6, a) = 0.142809976, a4 = 0.1076360. . ( x21 x 2 1 ) . . (3) arctanxx ka+Tf(;"+"'+T1(,;" (contInued fractIOn) (-I<x<I). \ . ,  / Maximal error 3.6 X 10- la. k o = 0.99999999362, k 1= - 3.00000308694, k 2 = - 0.55556977284, k)= -15.77401811273, k4 = - 0.16190809780, ks = - 44.57191 79508 8, k6 = - 0.10810 67493 1. (V) Gamma Function An approximation by a polynomial of the 8th degree: r(2+x) >iXi (-1/2< x.;; 1/2). Maximal error 7.6 X IO- g . ao=0.999999926, al =0.42278 4604, a2=0.411849671, a3=0.0815652323, a4 = 0.07406 48982. as= -0.00012 513767, a6=0.0122995771, a7= -0.0034961289, ag=0.0021385778. (VI) Normal Distribution 2 f oo 1 (1) -;= e- t2 dt'=; ;+1 16 (Ox < co). This gives an approximation by a polynomial In x (l+1:a j x) "\I of the 6th degree. Maximal error 2.8 x \0-7. aD = 0.0705230784, a 1 = 0.0422820123, a 2 = 0.0092705272, a 3 = 0.00015 20143, a 4 =0.00027 65672, as = 0.00004 30638. 1 f oo (2) P(x)=- e- t2 / 2 dt, Jhr x 4P(x)(1- P(x)),=; [ex p ( - 2: 2 )] [1 +X 4 ( aa+ x 2 :aJ ] (Ox < co). Maximal error 2 x \O-s. a o =0.0055, a 1 =0.0551, a 2 =14.4. (3) The inverse function of (2) x=:[y(aa+ y:1aJ r 2 , y= -log[4P(x)(1-P(x))] (Oy< co). Maximal error 4.9 x 10- 4 . a o =2.0611786, a 1 =-5.7262204, a2=11.640595. 
Statistical Tables for Reference Statistical Tables [1] 1. A. Greenwood and H. O. Hartley, Guide to tables in mathematical statistics, Princeton Univ. Press, 1962. [2] Research Group for Statistical Sciences (T. Kitagawa and M. Masuyama, eds.) New statis- tical tables (Japanese), explanation p. 264, table p. 214, Kawade, 1952. [3] R. A. Fisher and F. Yates, Statistical tables for biological, agricultural and medical re- search, explanation p. 30, table p. 137, Oliver & Boyd, third edition, 1948. [4] E. S. Pearson and H. O. Hartley, Bio- metrika tables for statisticians, explanation p. 104, table p. 154, Cambridge Univ. Press, third edition, 1970. [5] K. Pearson, Tables for statisticians and biometricians I, 1930, explanation p. 83, table p. 143; II, 1931, explanation p. 250, table p. 262, Cambridge Univ. Press. [6] Statistiral tables JSA-1972 (Japanese), table p. 454, explanation p. 260, Japanese Standards Association, 1972. Tables of Special Statistical Values [8] Harvard Univ., Tables of the cumulative binomial probability distribution, Harvard, 1955, n 2: (npiqn-i; 5 dec., {=,. p = 0.0 I (0.0 I )0.50, n = I (I )50(2) 100( 1O)200(20)500( 50) 1000. [9] National Bureau of Standards, NBS ap- plied mathematical series, no. 6, Tables of the binomial probability distribution, 1950, (  )p iq n - i and the partial sum: 7 dec., p = 0.01(0.01)0.50, n=2(1)49. [10] T. Kitagawa, Table of Poisson distribu- tion (Japanese), Baihukan, 1951, e -mm i 1 i!: 7-8 dec., m=0.001(0.001)1.000(0.01)1O.00. [II] G. 1. Lieberman and D. B. Owen, Tables of the hypergeometric probability distribu- tion, Stanford, 1961, ()(  :) / ( ): 6 dec.. N = 2( I )50( 10) 100( I 00)2000, n = I(I)(N 12), k= 1(I)n. [12] National Bureau of Standards, NBS no. 23. Tables of normal probability functions, 1942, <p(x)=(I/ )exp( -lX\ <P(x)= fx <p(x)dx: 15 dec.. x =0(0.00001) 1.0000(0.001)8.285. [13] K. Pearson, Tables of the incomplete beta-functions, Cambridge, second edition, 1968, Ix(p,q): 8 dec.,p, q=0.5(0.5) 11(1)50. [14] K. Pearson, Tables of the incomplete gamma-function, Cambridge, 1922, revised edition, 1951, I(u,p)= !oUVP+l (lleV)(vPlr(p+ I»)dv: 7 dec., p = 0.0(0.1 )5.0(0.2)50.0, u = 0.1(0.1)20.0; p = - 1.0(0.05)0.0, u =0.1(0.1 )51.3. [15] N. V. Smirnov, Tables for the distribution and density function of the t-distribution, Pergamon, 1961,6 dec.,j= 1(1)35, t= 0(0.0 I )3.00(0.02)4.50(0.05)6.50. [16] G. J. Resnikoff and G. J. Lieberman, Tables of the non-central t-distribution, Stan- ford, 1957. [17) F. N. David, Tables of the ordinates and probability integral of the distribution of the correlation coefficient in small samples, Cam- bridge, 1938. [18] D. B. Owen, The bivariate normal proba- bility distribution, Sandia Corp., 1957, T(h,a): 6 dec., a =0.000(0.025)1.000, 00, h= 0.00(0.0 I )3.50(0.05)4.75, T (h, a) h ax I ( X2+ y2 ) = fa fa 2?T exp -  dydx. [19] National Bureau of Standards, NBS no. 50, Tables of the bivariate normal distribution function and related functions, 1959, L(h, k,r) (00 (00 I =)h )k 2?TV [ X2 + y2 - 2rxy ] Xexp - dydx: 6 dec., 2(1- r 2 ) r = ::':: 0.00(0.05)0.95(0.0 I )0.99, h, k =0.0(0.1)4.0. Tables of Allocation [20] T. Kitagawa and M. Midome, Table of al10ca tion of elements for experimental design (Japanese), Baihukan, 1953. [21] R. C. Bose, W. H. Clatworthy, and S. S. Shrikhande, Tables of partially balanced de- signs with two associate classes, North Caro- lina Agric. Expt. Station Tech. Bull., 1954 (table of PBIBD). 
Numerical Tables lor Reference General Tables [1 J M. Bo11, Tables numeriques universe11es, Dunod, 1947. [2J P. Barlow, Barlow's tables, Robinson, 1814, third edition, 1930. [3J W. Shibagaki, 0.01 % table of elementary functions (Japanese), Kyoritu, 1952. [4J K. Hayashi, Table of higher functions (Japanese), Iwanami, second edition, 1967, [5J E. Jahnke and F. Emde, Funktionentafeln mit Formelin und Kurven, Teubner, second edition 1933 (English translation: Tables of functions with formulae and curves, Dover, fourth edition, 1945). [6J Y. Yoshida and M. Yoshida, Mathematical tables (Japanese), Baihukan, 1958. [7] M, Abramowitz and 1. A. Stegun (eds.), Handbook of mathematical functions with formulas, graphs and mathematical tables, National Bureau of Standards, 1964 (Dover, 1965). [8] A. Fletcher et al. (eds.), Index of mathe- matical tables I, II, Scientific Computing Service, Addison-Wesley, second edition, 1962, Multiplication Table [9] A. L. Cre11e, Rechentafeln welche a11es Mutiplicieren und Dividiren mit Zahlen unter 1000 ersparen, bei grosseren Zahlen aber die Rechung erleichtern und sicherer machen, W. de Gruyter, new edition 1944. Table of Prime Numbers [IOJ D. N. Lehmer, List of prime numbers from 1 to 10,006,721, Carnegie Institution of Washington, 1914. Series of Tables of Functions [11 J British Association for the Advancement of Science, Mathematical tables, vol. 2, Emden functions, 1932; vol. 6, Bessel func- tions, p1. 1, 1937; vol. 8, Number-divisor tables, 1940; vol. 9, Tables of powers giving integral powers of integrals, 1940; vol. 10, Bessel functions, p1. 2, 1952. [12] Harvard University, Computation Laboratory, Annals, vol. 2, Tables of the modified Hankel functions of order one-third and their derivatives, 1945; vol. 3, Tables of the Bessel functions of the first kind of orders zero and one, 1947; vo!. 14, Orders seventy- nine through one hundred thirty-five, 1951; vol. 18, Tables of generalized sine.. and cosine-integral functions, p1. 1, 1949; vol. 19, p1. 2, 1949; vol. 20, Tables of inverse hyper- bolic functions, 1949; vol. 21, Tables of tic:: generalized exponential-integral functions, 1949. [13J National Bureau of Standards, Applied Mathematics Series (AMS), AMS 1, Tables of the Bessel functions Yo(x), Y1(x), Ko(x), Kl(X), 0< x < 1, 1948; AMS 5, Tables of sines and cosines to fifteen decimal places at hundredths of a degree, 1949; AMS II, Table of arctangents of rational numb(:rs, 1951; AMS 14, Tables of the exponentlal function eX (including e- X ), 1951; AMS 16, Tables of n! and f(n +!) for the first thousand val- ues of n, 1951; AMS 23, Tables of normal probability functions, 1953; AMS 25, Tables of the Bessel functions Yo(x), YJ(x), Ko(x), K1(x), 0< x < 1,1952; AMS 26, Tables of Arctanx, 1953; Tables of lOX, 1953; AMS 32, Table of sine and cosine integrals for arguments from 10 to 100, 1954; AMS 34, Table of the gamma function for complex arguments, 1954; AMS 36, Tab\es of circular and hyperbolic sines and cosines for radian arguments, 1953; AMS 40, Table of secants and cosecants to nine significant figures at hundredths of a degree, 1954; AMS 41, Tables of the error function and its derivative, 1954; AMS 43, Tables of sines and cosines for radian arguments, 1955; AMS 45, Table of hyperbolic sines and cosines, 1955; AMS 46. Table of the descending exponential, 1955. Tables of Special Functions [14] Akademiya Nauk SSSR, Tables of the exponential integral functions, 1954. [15] J. Brownlee, Table oflogr(x), Tracts for computers, no. 9, Cambridge lfniv. Press, 1923. [16] L. Dolansky and M. P. Dolansky, Table of log 2 (I/p), p 'log 2 (I/p) and p 'log 2 (1/p) + (1- p)log 2 (1/(1- p)), M.I.T. Research Lab. of Electronics tech. report 227,1952. [17] L. M. Milne-Thomson, Die el1iptischen Funktionen von Jacobi, Springer, 1931; En- glish translation: Jacobian elliptic function tables, Dover, 1950. [18] H. T. Davis, Tables of th'e higher math- ematical functions, Principia Press, vol. 1, Gamma function, 1933; vol. 2, Polygamma functions, 1935; vo!. 3, Arithmetical tables, 1962. [19] R. G. Selfridge and 1. E. Maxfield, A table of the incomplete elliptic integral of the third kind, Dover, 1958. 
1839 [20] W. Shibagaki, 0.01% table of modified Bessel functions and the method of numerical computation for them (Japanese), Baihukan, 1955. [21] W. Shibagaki, Theory and application of gamma function, Appendix. Table of gamma function of complex variable effective up to 6 decimals (Japanese), lwanami, 1952. [22] L. J. Slater, A short table of the Laguerre polynomials, Proc. lEE, monograph no. 103c, 1955,46-50. [23] G. N. Watson, A treatise on the theory of Bessel functions, Appendix, Cambridge Univ. Press, 1922, second edition, 1958. See also Statistical Tables for Reference. Tables of Approximation Formulas of Functions [24J S. Hitotumatu, Approximation formula (Japanese) Takeuti, 1963. [25J T. Uno (ed.), Approximation formulas of functions for computers 1-3, Joint research work for mathematical sciences, pt. IV, sec- tion 5,1961-1963 (Japanese, mimeographed). [26J Z. Yamauti, S. Moriguti, and S. Hito- tumatu (eds.), Numerical methods for elec- tronic computers (Japanese), Baihukan, I, 1965; II, 1967. [27J C. Hastings (ed.), Approximations for digital computers, Princeton, 1955. [28J A. J. W. Duijvestijn and A. J. Dekkers, Chebyshev approximations of some transcen- dental functions for use in digital computing, Philips Research Reports, 16 (1961),145-174. [29J L. A. Lyusternik, O. A. Chervonenkis, and A. P. Yanpol'skii, Mathematical analysis: functions, limits, series, continued fractions, Pergamon, 1965. (Original in Russian, 1963.) [30J 1. F. Hart ct a1. (eds.), Computer approxi- mations, Wiley, 1968. Numerical Tables for Referel1ce 
Journals Abh. Akad. Wiss. Gottingen Abhandlungen der Akademie der Wissen- schaften in Gottingen (Gottingen) Abh. Bayer. Akad. Wiss. Math.-Nat. K!. Abhandlungen der Bayerschen Akademie der Wissenschaften. Mathematisch- Naturwis- senschaftliche Klasse (M ullich) Abh. Math. Sem. Univ. Hamburg Abhandlungen aus dem Mathematischen Seminar der Universitiit Hamburg (Hamburg) Acta Arith. Acta Arithmetica, Polska Akademia Nauk, Instytut Matematyczny (Warsaw) Acta Informat. Acta Informatica (Berlin) Acta Math. Acta Mathematica (Uppsala) Acta Math. Acad. Sci. Hungar. Acta Mathematica Academiae Scientiarum Hungaricae (Budapest) Acta Math. Sinica Acta Mathematica Sinica (Peking). Chinese Math. Acta is its English translation Acta Sci. Math. Szeged. Acta Universitatis Szegediensis, Acta Scien- tiarum Mathematicarum (Szeged) Actualites Sci. Ind. Actualites Scientifiques et Industrielles (Paris) Advances in Math. Advances in Mathematics (New York) Aequationes Math. Aequationes Mathematicae (Basel-Waterloo) Algebra and Logic Algebra and Logic (New York). Translation of Algebra i Logika Algebra i Logika Akademiya Nauk SSSR Sibirskoe Otdelenie. Institut Matematiki. Algebra i Logika (Novosibirsk). Translated as Algebra and Logic Algebra Universalis Algebra Universalis (Basel) Amer. 1. Math. American Journal of Mathematics (Baltimore) Amer. Math. Monthly The American Mathematical Monthly (Menasha) Amer. Math. Soc. Colloq. Pub!. American Mathematical Society Colloquium Publications (Providence) Amer. Math. Soc. Math. Surveys American Mathematical Society Mathematical Surveys (Providence) Amer. Math. Soc. Proc. Symp. Pure Math. American Mathematical Society Proceedings of Symposia in Pure Mathematics (Provi- dence) Amer. Math. Soc. Trans!. American Mathematical Society Translations (Providence) Amer. Math. Soc. Trans!. Math. Monographs American Mathematical Society Translations of Mathematical Monographs (Providence) Ann. Acad. Sci. Fenn. Suomalaisen Tiedeakatemian Toimituksia, Annales Academiae Scientiarum Fennicae. Series A. I. Mathematica (Helsinki 1 Ann. der Phys. Annalen der Physik (Leipzig) Ann. Fac. Sci. Univ. Toulouse Annales de la FacuIte des Sciences de l'Uni- versite de Toulouse pour les Sciences Mathematiques et les Sciences Physiques (Toulouse) Ann. Inst. Fourier Annales de I'Institut Fourier, Universite de Grenoble (Grenoble) Ann. Inst. H. Poincare Annales de I'Institut Henri Poinca,e (Paris) Ann. Inst. Statist. Math. Annals of the Institute of Statistical Mathe- matics (Tokyo) Ann. Mat. Pura App!. Annali di Matematica Pura ed Applicata (Bologna) Ann. Math. Statist. The Annals of Mathematical Statistics (Baltimore) Ann. Math. Annals of Mathematics (Princeton) Ann. Physique Annales de Physique (Paris) Ann. Probability The Annals of Probability (San Francisco) Ann. Pol on. Math. Annales Polonici Mathematici. Polska Akademia Nauk (Warsaw) Ann. Roumaines Math. Annales Roumaines de Mathemattques. Journal de I'Institut Mathematique Roumain (Bucharest) Ann. Sci. Ecole Norm. Sup. Annales Scientifiques de I'Ecole Normale Superieure (Paris) 
1841 Ann. Scuola Norm. Sup. Pisa Annali dell a Scuola Normale Superiore de Pisa, Scienze Fisiche e Matematiche (Pisa) Ann. Statist. The Annals of Statistics (San Francisco) Arch. History Exact Sci. Archive for History of xact Sciences (Berlin-New York) Arch. Math. Archiv der Mathematik (Basel-Stuttgart) Arch. Rational Mech. Anal. Archive for Rational Mechanics and Analysis (Berlin) Ark. Mat. Arkiv fOr Matematik (Stockholm) Ark. Mat. Astr. Fys. Arkiv for Matematik, Astronomi och Fysik (Uppsala) Asterique Asterique. La Societe Mathematique de France (Paris) Atti Accad. Naz. Lincei, Mem. Atti della Accademia Nazionale dei Lincei, Memorie (Rome) Atti Accad. Naz. Lincei, Rend. Atti della Accademia Nazionale dei Lincei, Rendiconti (Rome) Automat. Control Comput. Sci. Automatic Control and Computer Sciences (New York). Translation of A vtomatika i Vychislitel'naya Tekhnik. Akademiya Nauk Latviiskol SSR (Riga) Bell System Tech. 1. The Bell System Technical Journal (New York) Biometrika Biometrika, A Journal for the Statistical Study of Biological Problems (London) Bol. Soc. Mat. Sao Paulo Boletim da Sociedade de Matematica de Sao Paulo (Sao Paulo) Bull. Acad, Pol. Sci. Bulletin de l'Academie Polonaise des Sciences (Warsaw) Bull. Amer. Math. Soc. Bulletin of the American Mathematical Society (Providence) Bull. Calcutta Math. Soc. Bulletin of the Calcutta Mathematical Society (Calcutta) Bull. Math. Statist. Bulletin of Mathematical Statistics (Fukuoka, Japan) Journals Bull. Nat. Res. Council Bulletin of the National Research Council (Washington) Bull. Sci. Math. Bulletin des Sciences Mathematiques (Paris) Bull. Soc. Math. Belg. Bulletin de la Societe Mathematique de Belgique (Brussels) Bull. Soc. Math. France Bulletin de la Societe Mathematique de France (Paris) Bull. Soc. Roy. Sci. Liege Bulletin de la Societe Royale des Sciences de Liege (Liege) c. R. Acad. Sci. Paris Comptes Rendus Hebdomadaires des Seances de l'Academie des Sciences (Paris) Can ad. 1. Math. Canadian Journal of Mathematics (Toronto) Casopis Pest. Mat. Casopis pro Pestovani Matematiky, Ces- koslovenska Akademie Ved (Prague) Colloq. Math. Colloquium Mathematicum (Warsaw) Comm. ACM Communications of the Association for Computing Machinery (New York) Comm. Math. Phys. Communications in Mathematical Physics (Berlin) Comm. Pure App1. Math. Communications on Pure and Applied Mathematics (New York) Comment. Math. Helv. Commentarii Mathematici Helvetici (Zurich) Compositio Math. Compositio Mathematica (Groningen) Com put. 1. The Computer Journal (London) Crelles 1. = 1. Reine Angew. Math. CWI Newslett. Centrum voor Wiskunde en Informatica. Newsletter (Amsterdam) Cybernetics Cybernetics (New York). Translation of Kibernetika (Kiev) Czech. Math. 1. Czechoslovak Mathematical Journal (Prague) Deutsche Math. Deutsche Mathematik (Berlin) 
Journals Differentsial'nye Uravneniya Differentsial'nye Uravneniya (Minsk). Trans- lated as Differential Equations Differential Equations Differential Equations (New York). Transla- tion of Differentsial'nye Uravneniya Dok\. Akad. Nauk SSSR Doklady Akademii Nauk SSSR (Moscow). Soviet Math. Dok\. is the English translation of its mathematics section Duke Math. J. Duke Mathematical Journal (Durham) Econometrica Econometrica, Journal of the Econometric Society (Chicago) Edinburgh Math. Notes The Edinburgh Mathematical Notes (Edin- burgh) Enseignement Math. L'Enseignement Mathematique (Geneva) Enzykl. Math. Enzyklopadie der Mathematischen Wissen- schaften mit Einschluss ihrer Anwendungen (Berlin) Erg. Angew. Math. Ergebnisse der Angewandte Mathematik (Berlin-New York) Erg. Math. Ergebnisse der Mathematik und ihrer Grenz- gebiete (Berlin-New York) Fund. Math. Fundamenta Mathematicae (Warsaw) Funkcial. Ekvac. Fako de I'Funkcialaj Ekvacioj Japana Matematika Societo. Funkcialaj Ekvacioj (Serio Internacia) (Kobe, Japan) Functional Anal. Appl. Functional Analysis and its Applications (New York). Translation of Funktsional. Anal. Prilozhen. Funktsional. Anal. Prilozhen. Funktsional'nyi Analiz i ego Prilozheniya. Akademiya Nauk SSSR (Moscow). Translated as Functional Anal. Appl. General Topology and Appl. General Topology and its Applications (Amsterdam) Hiroshima Math. 1. Hiroshima Mathematical Journal. Hiroshima Univ. (Hiroshima, Japan) Hokkaido Math. J. Hokkaido Mathematical Journal. Hokkaido Univ. (Sapporo, Japan) 1842 IBM J. Res. Develop. IBM Journal of Research and Development (Armonk, N.Y.) 1llinois 1. Math. Illinois Journal of Mathematics (Urbana) Indag. Math. Indagationes Mathematicae = Nederl. Akad. Wetensch. Proc. Indian 1. Math. Indian Journal of Mathematics (Allahabad) Indiana Univ. Math. 1. Indiana University Mathematics Journal (Bloomington) Information and Control Information and Control (New York) Inventiones Math. Inventiones Mathematicae (Berlin) lzv. Akad. Nauk SSSR Izvestiya Akademii Nauk SSSR (Moscow). Math. USSR-lzv. is the English translation of its mathematics section J. Algebra Journal of Algebra (New York) J. Analyse Math. Journal d'Analyse Mathematiques (Jerusalem) 1. Appl. Math. Mech. Journal of Applied Mathematics and Mechanics (New York). Translation of Prikl. Mat. Mekh. J. Approximation Theory Journal of Approximation Theory (New York) 1. Assoc. Comput. Mach. (1. ACM) Journal of the Association for Computing Machinery (New York) J. Austral. Math. Soc. The Journal of the Australian Mathematical Society (Sydney) J. Combinatorial Theory Journal of Combinatorial Theory. Series A and Series B (New York) J. Comput. System Sci. Journal of Computer and System Sciences (New York) J. Differential Equations Journal of Differential Equations (New York) J. Differential Geometry Journal of Differential Geometry (Bethlehem, Pa.) J. Ecole Poly tech. Journal de I'Ecole Poly technique (Paris) 1. Fac. Sci. Hokkaido Univ. Journal of the Faculty of Science, Hokkaido University. Series I. Mathematics (Sapporo, Japan) 
1843 1. Fac. Sci. Univ. Tokyo Journal of the Faculty of Science, University of Tokyo. Section I. (Tokyo) J. Franklin Inst. Journal of the Franklin Institute (Phila- delphia) J. Functional Anal. Journal of Functional Analysis (New York) 1. Indian Math. Soc. The Journal of the Indian Mathematical Society (Madras) J. Inst. Elec. Engrs. Journal of the Institution of Electrical En- gineers (London) 1. Inst. Poly tech. Osaka City Vniv. Journal of the Institute of Polytechnics, Osaka City University. Series A. Mathematics (Osaka) 1. London Math. Soc. The Journal of the London Mathematical Society (London) J. Math. Anal. Appl. Journal of Mathematical Analysis and Ap- plications (New York) J. Math. and Phys. Journal of Mathematics and Physics (Cam- bridge, Massachusetts, for issues prior to 1975; for 1975 and later, New York) 1. Math. Econom. Journal of Mathematical Economics (Amsterdam) 1. Math. Kyoto Univ. Journal of Mathematics of Kyoto University (Kyoto) J. Math. Mech. Journal of Mathematics and Mechanics (Bloomington) 1. Math. Pures Appl. Journal de Mathematiques Pures et Ap- pliquees (Paris) J. Math. Soc. Japan Journal of the Mathematical Society of Japan (Tokyo) J. Mathematical Phys. Journal of Mathematical Physics (New York) 1. Multivariate Anal. Journal of M uItivariate Analysis (New York) 1. Number Theory Journal of Number Theory (New York) 1. Operations Res. Soc. Japan Journal of the Operations Research Society of Japan (Tokyo) Journals J. Optimization Theory Appl. Journal of Optimization Theory and Applica- tions (New York) J. Phys. Soc. Japan Journal of the Physical Society of Japan (Tokyo) 1. Pure Appl. Algebra Journal of Pure and Applied Algebra (Amsterdam) 1. Rational Mech. Anal. Journal of Rational Mechanics and Analysis (Bloomington) J. Reine Angew. Math. Journal fUr die Reine und Angewandte Mathematik (Berlin). = Crelles 1. 1. Res. Nat. Bur. Standards Journal of Research of the National Bureau of Standards. Section B. Mathematics and Mathematical Physics (Washington) 1. Sci. Hiroshima Univ. Journal of Science of Hiroshima University. Series A (Mathematics, Physics, Chemistry); Series A-I. (Mathematics) (Hiroshima) J. Soviet Math. Journal of Soviet Mathematics (New York). Translation of (I) Itogi Nauki-Seriya Matematika (Progress in Science-Mathe- matical Series); (2) Problemy Matematichesk- ogo Analiza (Problems in Mathematical Analysis); (3) Zap. Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. J. Symbolic Logic The Journal of Symbolic Logic (New Bruns- wick) Japan. J. Math. Japanese Journal of Mathematics (Tokyo) Jber. Deutsch. Math. Verein. (Jber. D.M.V.) Jahresbericht der Deutschen Mathematiker Vereinigung (Stuttgart) Kibernetika (Kiev) Otdelenie Matematiki, Mekhaniki i Kiber- netiki Akademii Nauk Ukrainskoi SSR. Kibernetika (Kiev). Translated as Cybernetics K6dai Math. Sem. Rep. K6dai Mathematical Seminar Reports (Tokyo) Linear Algebra and Appl. Linear Algebra and Its Applications (New York) Linear and Multilinear Algebra. Linear and Multilinear Algebra (New York) Mat. Sb. Matematicheskii Sbornik (Recueil Mathema- tique). Akademiya Nauk SSSR (Moscow). Translated as Math. USSR-Sb. 
Journals Mat. Tidsskr. A Matematisk Tidsskrift. A (Copenhagen) Mat. Zametki MatematicheskiI Zametki. Akademiya Nauk SSSR (Moscow). Translated as Math. Notes Math. Ann. Mathematische Annalen (Berlin-Gottingen- Heidelberg) Math. Compo Mathematics of Computation (Providence). Formerly Math. Tables Aids Comput. Math. 1. Okayama Univ. Mathematical Journal of Okayama University (Okayama, Japan) Math. Japonicae Mathematica Japonicae (Osaka) Math. Nachr. Mathematische Nachrichten (Berlin) Math. Notes Mathematical Notes of the Academy of Sciences of the USSR (New York). Trans- lation of Mat. Zametki Math. Rev. Mathematical Reviews (Ann Arbor) Math. Scand. Mathematica Scandinavica (Copenhagen) Math. Student The Mathematical Student (Madras) Math. Tables Aids Comput. (MTAC) Mathematical Tables and Other Aids to Computation (Washington). Name changed to Mathematics of Computation in 1960 (vol. 141T.) Math. USSR-Izv. Mathematics of the USSR-Izvestiya (Provi- dence). Translation of Izv. Akad. Nauk SSSR Math. USSR-Sb. Mathematics of the USSR-Sbornik (Provi- dence). Translation of Mat. Sb. Math. Z. Mathematische Zeitschrift (Berlin-Gottingen- Heidelberg) Mathematika Mathematika, A Journal of Pure and Applied Mathematics (London) Meed. Lunds Univ. Mat. Sem. Meddelanden fran Lunds Universitets Matematiska Seminarium = Communications du Seminaire Mathematique de l'Universite de Lund (Lund) Mem. Amer. Math. Soc. Memoirs of the American Mathematical Society (Providence) 1844 Mem. ColI. Sci. Univ. Kyoto Memoirs of the College of Science. University of Kyoto. Series A (Kyoto) Mem. Fac. Sci. Kyushu Univ. Memoirs of the Faculty of Science, Kyushu University. Series A. Mathematics (Fukuoka, Japan) Memor. Sci. Math. Memorial des Sciences Mathematiques (Paris) Michigan Math. J. The Michigan Mathematical Journal (Ann Arbor) Mitt. Math. Ges. Hamburg Mitteilungen der Mathematischen Gesellschaft in Hamburg (Hamburg) Monatsh. Math. Phys. Monatschefte fUr Mathematik und Physik (Vienna) Monatsh. Math. Monatshefte fur Mathematik (Vienna) Monograf. Mat. Monografje Matematyczne (Warsaw) Moscow Univ. Math. Bull. Moscow University Mathematics Bulletin (New York). Translation of the mathematics section of Vestnik Moskov. Univ., Ser. I, Mat. Mekh. Nachr. Akad. Wiss. Gottingen Nachrichten der Akademie der Wissenschaften in Gottingen. Math.-Phys. Kl. (Gi:Htingen) Nagoya Math. 1. Nagoya Mathematical Journal (Nagoya) Nederl. Akad. Wetensch. Proc. Koninklijke Nederlandse Akademie van Wetenschappen, Proceedings. Series A. Mathematical Sciences (Amsterdam) = Indag. Math., Proc. Acad. Amsterdam Nieuw Arch. Wisk. Nieuw Archiefvoor Wiskunde (Groningl:n) Numerische Math. Numerische Mathematik (Berlin-Gottingen- Heidelberg) Nuovo Cimento II Nuovo Cimento (Bologna) Osaka 1. Math. Osaka Journal of Mathematics (Osaka) Osaka Math. J. Osaka Mathematical Journal (Osaka) Pacific 1. Math. Pacific Journal of Mathematics (Berkeley) Philos. Trans. Roy. Soc. London Philosophical Transactions of the Royal Society of London. Series A (London) 
1845 Phys. Rev. The Physical Review (New York) Portugal. Math. Portugaliae Mathematica (Lisbon) Prikl. Mat. Mekh. Adademiya Nauk SSSR. Otdelenie Tekhnich- eskikh Nauk. Institut Mekhaniki Prikladnaya Matematika i Mekhanika (Moscow). Trans- lated as J. App\. Mat. Mech. Proc. Acad. Amsterdam = Neder!. Akad. Wetensch. Proc. Proc. Amer. Math. Soc. Proceedings of the American Mathematical Society (Providence) Proc. Cambridge Phil os. Soc. Proceedings of the Cambridge Philosophical Society (Cambridge) Proc. Imp. Acad. Tokyo Proceedings of the Imperial Academy (Tokyo) Proc. Japan Acad. Proceedings of the Japan Academy (Tokyo) Proc. London Math. Soc. Proceedings of the London Mathematical Society (London) Proc. Nat. Acad. Sci. US Proceedings of the National Academy of Sciences of the United States of America (Washington) Proc. Phys.-Math. Soc. Japan Proceedings of the Physico- Mathematical Society of Japan (Tokyo) Proc. Roy. Soc. London Proceedings of the Royal Society of London. Series A (London) Proc. Steklov Inst. Math. Proceedings of the Steklov Institute of Mathematics (Providence). Translation of Trudy Mat. Inst. Steklov. Prog. Theoret. Phys. Progress of Theoretical Physics (Kyoto) Pub\. Inst. Math. Publications de l'Institut Mathematique (Belgrade) Pub!. Inst. Math. Univ. Strasbourg Publications de l'Institut de Mathematiques de I'Universite de Strasbourg (Strasbourg) Pub!. Math. Inst. HES Publications Mathematiques de I'Institut des Hautes Etudes Scientifiques (Paris) Pub\. Res. Inst. Math. Sci. Publications of the Research Institute for Mathematical Sciences (Kyoto) Journals Quart. Appl. Math. Quarterly of Applied Mathematics (Provi- dence) Quart. J. Math. The Quarterly Journal of Mathematics, Ox- ford. Second Series (Oxford) Quart. J. Mech. App!. Math. The Quarterly Journal of Mechanics and Applied Mathematics (Oxford) Rend. Circ. Mat. Palermo Rendiconti del Circolo Matematico de Palermo (Palermo) Rend. Sem. Mat. Univ. Padova Rendiconti del Seminario Matematico dell'Universita di Padova (Padua) Rev. Mat. Hisp. Amer. Revista Matemfltica Hispafio-Americana (Madrid) Rev. Mod. Phys. Reviews of Modern Physics (New York) Rev. Un. Mat. Argentina Revista de la Union Matematica Argentina (Buenos Aires) Rev. Univ. Tucuman Revista Universidad Nacional de Tucuman, Facultad de Ciencias Exactas y Tecnologia. Serie A. Matematicas y Fisica Teorica (Tucuman) Roczniki Polsk. Towar. Mat. Roczniki Polskiego Towarzystwa Matema- tycznego. Serja I. Prace Matematyczne (Krakow) Rozprawy Mat. Rozprawy Matematyczne, Polska Akademia Nauk, Instytut Matematyczny (Warsaw) Russian Math. Surveys. Russian Mathematical Surveys (London). Translation of Uspekhi Mat. Nauk Sammu\. Goschen Sammulung Goschen (Leipzig) Sankhya Sankhya, The Indian Journal of Statistics. Series A and Series B (Calcutta) S.-B. Berlin. Math. Ges. Sitzungsberichte der Berliner Mathematischen Gese\lschaft (Berlin) S.-B. Deutsch. Akad. Wiss. Berlin Sitzungsberichte der Deutschen Akademie der Wissenschaften zu Berlin, Mathematisch- Naturwissenschaftliche Klasse (Berlin) S.-B. Heidelberger Akad. Wiss. Sitzungsberichte der Heidelberger Akademie der Wissenschaften, Mathematisch-Natur- wissenschaftliche Klasse (Heidelberg) 
Journals S.-B. Math.-Nat. Kl. Bayer. Akad. Wiss. Sitzungsberichte der Mathematisch-Natur- wissenschallichen Klasse der Bayerischen Akademie der Wissenschaften zu M iinchen (Munich) S.-B. Oster. Akad. Wiss. Sitzungsberichte der Osterreichische Akade- mie der Wissenschaften (Vienna) S.-B. Phys.-Med. Soz. Erlangen Sitzungsberichte der Physikalisch- Medizinischen Sozietat zu Erlangen (Erlangen) S.-B. Preuss. Akad. Wiss. Sitzungsberichte der Preussischen Akademie der Wissenschaften. Physikalisch-Mathema- tische Klasse (Berlin) Schr. Math. Inst. u. Inst. Angew. Math. Univ. Berlin Schriften des Mathematischen Instituts und des Instituts fiir Angewandte Mathematik der Universitiit Berlin (Berlin) Schr. Math. Inst. Univ. Miinster Schriftenreihe des Mathematischen Instituts der Universitat Munster (Munster) Sci. Papers ColI. Gen. Ed. Univ. Tokyo Scientific Papers of the College of General Education, University of Tokyo (Tokyo) Sci. Rep. Tokyo Kyoiku Daigaku Science Reports of the Tokyo Kyoiku Daigaku. Section A (Tokyo) Scripta Math. Scripta Mathematica. A Quarterly Journal devoted to the Philosophy, History, and Ex- pository Treatment of Mathematics (New York) Sem. Bourbaki Seminaire Bourbaki (Paris) SIAM J. Appl. Math. SIAM Journal of Applied Mathematics. A Publication of the Society for Industrial and Applied Mathematics (Philadelphia) SIAM 1. Comput. SIAM Journal on Computing (Philadelphia) SIAM J. Control SIAM Journal on Control (Philadelphia) SIAM 1. Math. Anal. SIAM Journal on Mathematical Analysis (Philadelphia) SIAM J. Numer. Anal. SIAM Journal on Numerical Analysis (Philadelphia) SIAM Rev. SIAM Review (Philadelphia) Siberian Math. J. Siberian Mathematical Journal (New York). Translation of Sibirsk. Mat. Zh. 1846 Sibirsk. Mat. Zh. Akademiya Nauk SSSR. Sibirskoe Otdelenie. Sibirskii Matematicheskii Zhurnal (Moscow). Translated as Siberian Math. 1. Skr. Norske Vid. Akad. Oslo Skrifter Utgitt av det Norske Videnskaps- Akademii Oslo. Matematisk-Naturvidens- kapelig Klasse (Oslo) Soviet Math. Dok!. Soviet Mathematics, Doklady (Providence). Translation of mathematical sectiol1 of Dok!. Akad. Nauk SSSR SRIJ. Stanford Research Institute Journal (Menlo Park) Studia Math. Studia Mathematica. (Wroclaw) Subutu-kaisi Nihon Sugaku-buturi-gakkai Kaisi (Tokyo) Sugaku SCigaku, Mathematical Society of Japan (Tokyo) Summa Brasil. Math. Summa Brasiliensis Mathematicae (Rio de Janeiro) Tensor Tensor (Chigasaki, Japan) Teor. Veroyatnost. i Primenen. Teoriya Veroyatnoste1 i ee Primenenie. Akademiya Nauk SSSR (Moscow). Translated as Theor. Prob. Appl. Theor. Prob. Appl. Theory of Probability and Its Applications. Society for Industrial and Applied Mathe- matics. English translation of Teor. Veroyat- nost. i Primenen. (Philadelphia) Tohoku Math. J. The Tohoku Mathematical Journal (Sendai, Japan) Tohoku-riho Tohoku Teikokudaigaku RikahOkoku (Sendai, Japan) Topology Topology. An International Journal of Mathematics (Oxford) Trans. Amer. Math. Soc. Transactions of the American Mathematical Society (Providence) Trans. Moscow Math. Soc. Transactions of the Moscow Mathematical Society (Providence). Translation of Trudy Moskov. Mat. Obshch. Trudy Mat. Inst. Steklov. Trudy Matematicheskogo Instituta im. V. A. 
1847 Steklova. Akademiya Nauk SSSR (Moscow- Leningrad). Translated as Proc. Steklov Inst. Math. Trudy Moskov. Obshch. Trudy Moskovskogo Matematicheskogo Obshchestva (Moscow). Translated as Trans. Moscow Math. Soc. Tsukuba 1. Math. Tsukuba Journal of Mathematics. Univ. Tsukuba (Ibaraki, Japan) Ukrain. Mat. Zh. Akademiya Nauk Ukrainskoi SSR. Institut Matematiki. Ukrainskil Matematicheskii Zhurnal (Kiev). Translated as Ukrainian Math. 1. Ukrainian Math. J. Ukrainian Mathematical Journal (New York). Translation of Ukrain. Mat. Zh. Uspekhi Mat. Nauk Uspekhi Matematicheskikh Nauk (Moscow- Leningrad). Translated as Russian Math. Surveys Vestnik Moskov. Univ. Vestnik Moskovskogo Universiteta. I, Mate- matika i Mekhanika (Moscow). Mathematical section translated as Moscow Univ. Math. Bull. Vierteljschr. Naturf. Ges. Zurich Vierteljahrsschrifte der Naturforschenden Gesellschaft in Zurich (Zurich) Z. Angew. Math. Mech. (Z.A.M.M.) Zeitschrift fur Angewandte Mathematik und Mechanik, Ingenieurwissenschaftliche F or- schungsarbeiten (Berlin) Z. Angew. Math. Phys. (Z.A.M.P.) Zeitschrift fur Angewandte Mathematik und Physik (Basel) Z. Wahrscheinlichkeitstheorie Zeitschrift fur Wahrscheinlichkeitstheorie und Verwandte Gebiete (Berlin) Zbl. Angew. Math. Zentralblatt fur Angewandte Mathematik (Berlin) Zb1. Math. Zentralblatt fUr Mathematik und ihre Grenz- gebiete (Berlin-Gottingen-Heidelberg) Zh. Eksper. Teoret. Fiz. Zhurnal Eksperimental'noi i Teoreticheskoi Fiziki (Moscow) Journals 
Publishers Academic Press Academic Press Inc., New York-London Addison-Wesley Addison-Wesley Publishing Company, Inc., Reading (Massachusetts)-Menlo Park (Cali- fornia)-London-Don Mills (Ontario) Akademiai Kiad6 A kiadasert fclOs: az Adademiai Kiad6 igaza- t6ja (Publishing House of the Hungarian Academy of Sciences), Budapest Akademie- Verlag Berlin Akademische Verlag. Akademische Verlagsgesellschaft, Leipzig Allen W. H. Allen & Co. Ltd., London Allen & Unwin Allen & Unwin, Inc., Winchester (Massachu- setts) Allyn & Bacon Allyn & Bacon, Inc., Newton (Massachusetts) Almqvist and Wiksell Almqvist och Wiksell Forlag, Stockholm Asakura Asakura-syoten, Tokyo Aschelhoug H. Aschelhoug and Company, Oslo Baihlikan Tokyo Benjamin W. A. Benjamin, Inc., New York-London Birkhiiuser Birkhauser Verlag, Basel-Stuttgart Blackie Blackie & Son Ltd., London-Glasgow Cambridge Univ. Press Cambridge University Press, London-New York Chapman & Hall Chapman & Hall Ltd., London Chelsea Chelsea Publishing Company, New York Clarendon Press Oxford University Press, Oxford Cremona Edizioni Cremonese, Rome de Gruyter Walter de Gruyter and Company, Berlin Dekker Marcel Dekker, Inc., New York Deutscher Verlag der Wiss. Deutscher Verlag der Wissenschaften, Berlin Dover Dover Publications, Inc., New York Dunod Dunod, Editeur, Paris Elsevier Elsevier Publishing Company, Amsterdam- London-New York Fizmatgiz Gosudarstvennoe Izdatel'stvo Fiziko- Matematicheskoi Literatury, Mcscow Gauthier-Villars Gauthier- Villars & ce, Editeur, Paris Ginn Ginn and Company, Waltham (Massachu- setts)- Toronto- London Gordon & Breach Gordon & Breach, Science Publishers Ltd., London Goztekhizdat Gosudarstvennoe Izdate\'stvo Tekhniko- Teoreticheskoi Literatury, Moscow Griffin Charles Griffin and Company Ltd., London Hafner Hafner Publishing Company, New York Harper & Row Harper & Row Publishers, New York- Evanston-London Hermann Hermann & ce, Paris Hirokawa Hirokawa-syoten, Tokyo Hirzel Verlag von S. Hirzel, Leipzig Holden-Day Holden-Day, Inc., San Francisco-London- Amsterdam Holt, Rinehart and Winston Holt, Rinehart and Winston, Inc., New York- Chicago-San Francisco- T oron to- London Interscience Interscience Publishers, Inc., New Y ork- London Iwanami Iwanami Shoten, Tokyo Kawade Kawade-syob6, Tokyo Kinokoniya Kinokoniya Company, Tokyo 
1849 Kyoritu Kyoritu-syuppan, Tokyo Lippincott 1. B. Lippincott Company, Philadelphia Longman Longman Group, Ltd., Harlow (Essex) Longmans, Green Longmans, Green and Company, Ltd., London-New York-Toronto-Bombay- Calcutta-Madras Macmillan The Macmillan Company, New York-London Maki Maki-syoten, Tokyo Maruzen Maruzen Company Ltd., Tokyo Masson Masson et C i ., Paris Math-Sci Press Math-Sci Press, Brookline (Massachusetts) McGraw-Hili McGraw-Hill Book Company, Inc., New York-London-Toronto Methuen Methuen and Company Ltd., London MIT Press The MIT Press, Cambridge (Massachusetts)- London Nauka Izdatel'stvo Nauka, Moscow NoordhotT P Noordhoff Ltd., Groningen North-Holland North-Holland Publishing Company, Amsterdam Oldenbourg Verlag von R Oldenbourg, Munich-Vienna Oliver & Boyd Oliver & Boyd Ltd., Edinburgh-London Oxford Univ. Press Oxford University Press, London-New York Pergamon Pergamon Press, Oxford-London-Edinburgh- New Y ork-Paris-Frankfurt Polish Scientific Publishers Pailstwowe Wydawnictwo Naukowe, Warsaw Pren tice- H all Prentice-Hall, Inc., Englewood Cliffs (New Jersey) Princeton Univ Press Princeton University Press, Princeton Publishers Random House Random House, Inc., New York Sibundo Tokyo Springer Springer- Verlag, Berlin (-Gottingen)- Heidelberg-New York Teubner B. G. Teubner Verlagsgesellschaft, Leipzig- Stuttgart Tokai Tokai-syobo, Tokyo Tokyo-tosyo Tokyo Tokyo Univ. Press Tokyo University Press, Tokyo Ungar Frederick Ungar Publishing Company, New York Univ. of Tokyo Press University of Tokyo Press, Tokyo U tida-rokakuho Tokyo Van Nostrand D. Van Nostrand Company, Inc., Toronto- New York-London Vandenhoeck & Ruprecht Gottingen Veit Verlag von Veit & Company, Leipzig Vieweg Friedr Vieweg und Sohn Verlagsgesellschaft mbH, Wiesbaden Wiley Wiley & Sons, Inc., New Y ork- London Wiley- Interscience Wiley & Sons, Inc., New York-London Zanichelli Nicola Zanichelli Editore, Bologna 
Special Notation This list contains the notation commonly and frequently used throughout this work. The symbol * means that the same notation is used with more than one meaning. For more detailed definitions or further properties of the notation, see the articles cited. Article and Notation Example Definition Section I. Logic V VxF(x) Universal quantifier (for all x, F(x) 4] 18, C holds) :3 :3xF(x) Existential quantifier (there exists 4] 18, C an x such that F(x) holds) /\, & A /\ B, A & B Conjunction, logical product 4] 18* (A and B) v AvB Disjunction, logical sum (A or B) 4] 18* I IA Negation (not A) 4] 18 ,=>,=> A-.B, A=>B Implication (A implies B) 4]18* +--+,<=>,# A=B Equivalence (A and B are logically 4]18 equivalent) II. Sets E XEX Membership (element x is a 381A member of the set X) rt Xrt X Nonmembership (element x is not 381A a member of the set X) c AcB Inclusion (A is a subset of B) 381A cF AcF B Noninclusion (A is not a subset 381A of B) c A;;B Proper inclusion (A is a proper 381A subset of B) 0 Empty set 381A U,U AUB, UA; Union, join 3818,D* n,n AnB, nA; Intersection, meet 3818, D* c, C Ac, C(A) Complement (of a set A) 3818 - , "- A-B,A"-B Difference (A - B = A n W) 3818 x AxB Cartesian product (of A and B) 3818* R, xRy, x y Equivalence relation 135A* (for two elements x, y) I AIR Quotient set (set of eq uivalence 1358* classes of A with respect to an equivalence relation R) IT IT;A; Cartesian prod uct (of the A;) 381E L,U LA;, UA; Direct sum (of the A;) 381E IS IS(A) Power set (set of all subsets of A) 381E B A Set of all mappings from A to B 381C {I} {xIP(x)} Set of all elements x with the 381A property P(x) 
1851 Special Notation Article and Notation Example Definition Section { } {a,LEA Family with index set A. 165D {an} Sequence (of numbers, points, 165D functions, or sets) ,II, # X,IXI, #X Cardinal number (of the set X) 49A*  p Aleph (transfinite cardinal) 49E -> J:X -> Y Mapping (f from X to Y) 381C* f--> f: Xf--> Y Mapping (where J(X) = Y, but 381C not in the present volumes) 1, id lA' id A Identity mapping 381C (identity function) c,x cx(x), Xx(x) Characteristic function 381C (representing function) flA Restriction (of a mapping J to A) 381C* 0 gof Composite (of mappings J and g) 381C lim sup, hm limsupAn Superior !imit (of the sequence 270C* of sets An) lim inf, Iim Iim inf An Inferior limit (of the sequence 270C* of sets An) Iim !im An Limit (of the sequence of sets An) 270C* Iim IA, Inductive limit (of A,) 210B -> !im Iim A, Projective limit (of A,) 210B <- <- III. Order ( , ) (a, h) Open interval {xla<x<b} 355C* [ , ] [a,b] Closed interval {xlaxb} 355C* ( , ] (a,b] Half-open-interval {xla<xb} 355C [ , ) [a, b) Half-open interval {xlax<b} 355C max maxA Maximum (of A) 311B mm minA Minimum (of A) 311B sup supA Supremum, least upper bound 311B (of A) inf infA Infimum, greatest lower bound 311B (of A) « a«b Very large (b is very large compared to a) U, v aUb, a v b Join of a, b in an ordered set 243A* n, /\ anh, a /\ b Meet of a, b in an ordered set 243A* IV. Algebra mod a=b (modn) Modulo (a and b are congruent 297G modulo n) I alb Divisibility (a divides b) 297 A * 1 alb Nondivisibility (a does not 297A divide b) det, I I detA,IAI Determinant (of a square matrix 103A * A) 
Special Notation 1852 Article and Notation Example Definition Section tr, Sp tr A, SpA Trace (of a square matrix A) 269F , T , 'A;A"A1,A' Transpose (of a matrix A) 269B I In Unit matrix (of degree n) 269A EH Matrix unit (matrix whose 269B 'J (i,j)-component is 1 and all others are 0) \8) A@B Kronecker product (of two 269C* matrices A and B) - M;;;;N Isomorphism (of two algebraic 256B systems M and N) / M/N Quotient space (of an algebraic 256F* system M by N) dim dimM Dimension (of a linear space, etc.) 256C 1m Imf Image (of a mapping f) 277E* Ker Ker f Kernel (of a mapping f) 277E Coim Coimf Coimage (of a mapping f) 277E Coker Coker f Cokernel (of a mapping f) 277E b ij , bi Kronecker delta (b ii = 1 and 269A bij = 0 for i #- j) ( , ), (a, b), a.b Inner product (of two vectors 442B* a and b) [ , ], x [a,b],axb Vector product (of two 3- 442C* dimensional vectors a and b) @ M@N Tensor product (of two modules 2771,2561* M and N) Horn Hom(M, N) Set of all homomorphisms 277B (from M to N) Hom A HomA(M, N) Set of all A-homomorphisms 277E (of an A-module M to an A-module N) Tor Torn(M, N) Torsion product (of M, N) 200D Ext Extn(M, N) Extension (of M, N) 200G A.,A. P /\ M, /\PM Exterior algebra (of a linear space 2560 M), pth exterior product (of M) V. Algebraic Systems N Set of all natural numbers 294A Z Set of all rational integers 294A Zm Z/mZ (set of all residue classes 297G* modulo m) Q Set of all rational numbers 294A R Set of all real numbers 294A C Set of all complex numbers 294A H Set of all quaternions .29B GF(q), J< Finite field (with q elements) 149M 
1853 Special Notation Article and Notation Example Definition Section Qp p-adic number field 439F (p is a prime) Zp Ring of p-adic integers 439F [ J k[x l ,... ,xnJ Polynomial ring (of variables 369A Xl' "', X n with coefficients in k) ( ) k(x l ,... ,x n ) Field extension 149D (ofkbyx 1 , ..,x n ) [[ J],{ k[[x 1 ,..., xn]] Formal power series ring (with 370A coefficients in k). Note: The symbols N, Z, Q, R, C, and H stand for sets, each with its own natural mathematical structure VI. Groups GL GL(V), GL(n,K) General linear group (over V, 60B or over K of degree n) SL SL(n, K) Special linear group (over K of 60B degree n) PSL PSL(n, K) Projective speciallinear group 60B (over K of degree n) U U(n) Unitary group (of degree n) 60F SU S U(n) Special unitary group (of degree n) 60F 0 O(n) Orthogonal group (of degree n) 601 SO SO(n) Special orthogonal group, 60I rotation group (of degree n) Spin Spin(n) Spin or group (of degree n) 6lD Sp Sp(n) Symplectic group (of degree n) 60L [For PGL(n, K), LF(n, K), PU(n), Sp(n), PSp(n, K)60 Classical Groups] VII. Topology (Convergence) --> ana Convergence (sequence an 87B,E'" converges to a) L" anta, an "a Convergence monotonically 87B decreasing t,/' an t a, an /' a Convergence monotonically 87B increasing lim lima n Limit (of a sequence an) 87B,E'" lim sup, lim limsupa n , lim a n Superior limit (of a sequence an) 87C* lim inf, \im lim inf an, lim an Inferior limit (of a sequence an) 87C* a  ,Cl E", E, CIE Closure (of a set E) 425B , i, G, Int E i , EO, IntE Interior (of a set E) 425B p,d p(x, y), d(x, y) Distance (between two points 273B'" X and y) II II Ilxll Norm (of x) 37B l.i.m. l.i.m.fn Limit in the mean (of a 168B sequence fn) 
Special Notation 1854 t\rticle ,lnd Notation Example Definition Section s-Iim s-Iim X n Strong limit (of a sequence x n ) 37B w-Iim w- Iimx n Weak limit (of a sequence x n ) 37E  fg Homotopy (of two mappings 202B f and g)  XY Homeomorphism (of two 425G topological spaces X and Y) VIII. Geometry and Algebraic Topology En Euclidean space (of dimension n) 140 pn Projective space 343B (of dimension n) sn Spherical surface 140 (of dimension n) Tn Torus (of dimension n) 422E H n H"(X, A) n-dimensional cohomology group 20lH (of X with coefficients in A) Hn Hn(X,A) n-dimensional homology group 20IG (of X with coefficients in A) Hn(C) (of chain complex C) 20lB nn nn(X) n-dimensional homotopy group 202J, 170 (of X) a ac Boundary (of C) 20m 15 15f Coboundary (of f) 201H* Sq Sqi x Streenrod square (of x) 64B f1J' f1J';(x) Steenrod pth power (of x) 64B Z1'-,Z2 Cup product (of Z I and Z 2) 2011 ZI'-"Z2 Cap product (of ZI and z2) 20lK /\ w /\ 1'/ Exterior product (of two 105Q* differential forms wand 1'/) d dw Exterior derivative (of a 105Q differential form w) grad grad cp Gradient (of a function cp) 442D rot rotu Rotation (of a vector u) 442D div divu Divergence (of a vector u) 442D  cp Laplacian (of a function cp) 323A D Dcp d' Alembertian (of a function cp) 130A D Dcp Differential operator 112A* D(ub...,u n ) lau i I (au) Jacobian determinant (of '208B ---det- (u I'"'' un) with respect to D(xl,...,x n )' aX j , aX j (x1,...,x n » a(u l ,.. .,u n ) , CUi ) Jacobian matrix (of (u b . .. , un) 208B a(x l ,..., X n ) aX j with respect to (XI'"'' X n » IX. Function Spaces C C(O) Space of continuous functions 168B(I) (on 0) 
Example Definition Lp(Q), Lp(a, b) Space of functions such that Ifl P is integrable on Q C I (L)(1  I  00) Space of functions of class C I (Q) Space of C" functions with compact support 1855 Notation Lp C I f!) Special Notation Article and Section 168B(2) 168B(9) 168B(13) $ $(Q) Space of C" functions 168B(13) [For d(Q), A(Q), Ap(Q), gB(Q)( = DLx(Q)), BMO(Rn), BV(Q), c, C, Co(Q), Coo(Q), C(Q), u(Q), {MP}(Q), (M,)(Q), ${MP}(Q), $(MP)(Q), Hp(Rn), HI(Q), H(Q), i\S(Rn), n2(IX(k»), L2 X (IX(k»), lp, L(p.q)(Q), m, M(Q), ((.I(Q), ((.Ip(Q), !/', s, S(Q), W;(Q) 168 Function Spaces. For gB(Q) (Space of Sato hyperfunctions), '(Q), $'(Q), ((.I" ((.1M' !/"(Rn) 125 Distributions and Hyperfunctions] X. Functions II Izl Re Rez 1m Imz z arg argz [ ] [IX] o f(x)=O(g(x)) o fix) = o(g(x)) fix)  g(x) D R D(T) R(T) supp f p.v. Sf(x)dx supp p.v. Pf b PfSf(x)dx b(X), b x exp expx log, Log logx, Logx sin x, cos x, tan x, sec x, cosec x, cotan x arc sin x, arc cos x, arc tan x Arcsinx, Arccosx, Arctanx Absolute value (of a complex number z) Real part (of a complex number z) Imaginary part (of a complex number z) Complex conjugate (of a complex number z) Argument (of a complex number z) Gauss symbol (greatest integer not exceeding a real number IX) Landau's notation (f(x)/g(x) is bounded for X->IX) Landau's notation (f(x)/g(x) tends to 0 for X->IX) Infinite or infinitesimal of the same order (for X->IX) Domain (of an operator T) Range (of an operator T) Support (of a function fj Cauchy's principal value (of an integral) Finite part (of an integral) Dirac's delta function (measure or distribution) Exponential function (expx = eX) Natural logarithmic function and its principal value, respectively Trigonometric functions Inverse trigonometric functions Principal value of inverse trigonometric functions 74B* 74A 74A* 74A 74C 83A 87G 87G 87G* 251A 251A 168B(I) 216D 125C 125C* 113D,269H 13ID, G 131E 131E 131E 
Special Notation 1856 Article and Notation Example Definition Section sinh x, cosh x, tanh x Hyperbolic functions J31F ( ).C G)' n C , Binomial coefficient, combination i30 P nPr Permutation 330 n! Factorial (of n) 330 cp cp(n) Euler function 295C* J1 J1(n) Mobius function 295C ( (z) Riemann zeta function 450B* J v Jv(z) Bessel function of the first kind 39B r r(x) Gamma function 174A B B(x,y) Beta function 174C F F(rx, {J, y; z) Gauss's hypergeometric function 206A f b '} P P}'J1VX Riemann's P function 253B A'Ii Vi Li Li(x) Logarithmic integral 167D XI. Probability P,Pr P(E), Pr(l:) Probability (of an event) 342B* E E(X) Mean or expectation (of a 342C random variable X) V, (J2 V(X), (J2(X) Variance (of a random variable X) :;42C p p(X, Y) Correlation coefficient (of two 342C* random variables X and Y) P(I) P(EIF) Conditional probability (of an 342E event E under the condition F) E(I) E(XIY) Conditional mean (of a random 342E variable X under the condition Y) N N(m, (J2) One-dimensional normal Appendix A, distribution (with mean m and II able 22 variance (J2) N(I1, E) Multidimensional normal dis- Appendix A, tribution (with mean vector 11 llable 22 and variance matrix E) P P(},) Poisson distribution (with Appendix A, parameter A) Table 22* 
Systematic List of Articles I Logic and Foundations 1. Foundations of Mathematics Art. 156 2. Axiom Systems 35 3. Paradoxes 319 4. Symbolic Logic 411 5. Axiomatic Set Theory 33 6. Model Theory 276 7. Nonstandard Analysis 293 8. G5del Numbers 185 9. Recursive Functions 356 10. Decision Problem 97 11. Constructive Ordinal Numbers 81 12. Analytic Sets 22 II Sets, General Topology, and Categories 1. Sets 381 2. Relations 358 3. Equivalence Relations 4. Functions 165 5. Axiom of Choice and Equivalents 6. Cardinal Numbers 49 7. Structures 409 8. Permutations and Combinations 9. Numbers 294 10. Real Numbers 355 11. Complex Numbers 12. Ordering 31 I 13. Ordinal Numbers 14. Lattices 243 15. Boolean Algebras 16. Topological Spaces 17. Metric Spaces 273 18. Plane Domains 333 19. Convergence 87 20. Connectedness 79 21. Dimension Theory 117 22. Uniform Spaces 436 23. Uniform Convergence 435 24. Categories and Functors 52 25. Inductive Limits and Projective Limits 210 26. Sheaves 135 34 330 74 312 42 425 383 III Algebra 1. Algebra 8 2. Matrices 269 3. Determinants 103 4. Polynomials 337 5. Algebraic Equations 10 6. Fields 149 7. Galois Theory 172 8. Linear Spaces 256 9. Rings 368 10. Associative Algebras 29 11. Commutative Rings 67 12. Noetherian Rings 284 13. Rings of Polynomials 369 14 Rings of Power Series 370 15. Quadratic Forms 16. Clifford Algebras 17. Differential Rings 18. Witt Vectors 449 19. Valuations 439 20. Adeles and Ideles 21. Cayley Algebras 22. Jordan Algebras 23. Modules 277 24. Homological Algebra 25. Hopf Algebras 203 Appendix A, Table 1. Algebraic Equations Art. 348 61 113 6 54 231 200 IV Group Theory 1. Groups 190 2. Abelian Groups 2 3 Free Groups 161 4. Finite Groups 151 5. Classical Groups 60 6. Topological Groups 423 7. Topological Abelian Groups 8. Compact Groups 69 9. Lie Groups 249 10. Lie Algebras 248 11. Algebraic Groups 13 12. Homogeneous Spaces 199 13. Symmetric Riemannian Spaces and Real Forms 412 14. Discontinuous Groups 15. Crystallographic Groups 16. Representations 362 17. Unitary Representations 437 18. Invariants and Covariants 226 Appendix A, Table 5. Lie Algebra,.Symmetric Riemannian Spaces, and Singulantles Appendix B, Table 5. Characters of Finite Groups and Crystallographic Groups 422 122 92 V Number Theory 1 Number Theory 296 2. Number Theory, Elementary 297 3. Continued Fractions 83 4. Number-Theoretic Functions 295 5. Additive Number Theory 4 6. Partitions of Numbers 328 7. Distribution of Prime Numbers 123 8. Lattice-Point Problems 242 9. Diophantine Equations 118 10. Geometry of Numbers 182 11. Transcendental Numbers 430 12. Quadratic Fields 347 13 Algebraic Number Fields 14 14. Class Field Theory 59 15. Complex Multiplication 73 16. Fermat's Problem 145 17. Local Fields 257 18. Arithmetic of Associative Algebras 27 19. Zeta Functions 450 Appendix B, Table 1. Prime Numbers and Primitive Roots Appendix B, Table 2. Indices Modulo p 
Systematic List of Articles Appendix B, Table 4. Class Numbers of Algebraic Number Fields VI Euclidean and Projective Geometry 1. Geometry Art. 181 2. Foundations of Geometry 3. Euclidean Geometry 139 4. Euclidean Spaces 140 5. Geometric Construction 6. Regular Polyhedra 357 7. Pi (n) 332 8. Trigonometry 9. Conic Sections 10. Quadric Surfaces 11. Convex Sets 89 12. Vectors 442 13. Coordinates 90 14. Projective Geometry 15. Affine Geometry 7 16. Non-Euclidean Geometry 17. Conformal Geometry 76 18. Erlangen Program 137 19. Continuous Geometry 85 20. Curves 93 21. Surfaces 410 22. Four-Color Problem 157 Appendix A, Table 2. Trigonometry Appendix A, Table 3. Vector Analysis and Coordinate Systems 155 179 432 78 350 343 285 VII Differential Geometry 1. Differential Geometry 2. Differentiable Manifolds 3. Riemannian Manifolds 4. Connections 80 5. Tensor Calculus 6. Geodesics 178 7. Symmetric Spaces 8. G-Structures 191 9. Complex Manifolds 72 10. Kahler Manifolds 232 11. Harmonic Integrals 194 12. Differential Geometry of Curves and Surfaces 111 13. Riemannian Submanifolds 14. Minimal Submanifolds 15. Harmonic Mappings 16. Morse Theory 279 17. Differential Geometry in Specific Spaces 110 18. Finsler Spaces 152 19. Integral Geometry 218 20. Siegel Domains 384 21. Spectral Geometry 391 22. Pseudoconformal Geometry 23. Global Analysis 183 Appendix A, Table 4. Differential Geometry 109 105 364 417 413 365 275 195 344 VIII Algebraic Geometry 1. Algebraic Geometry 12 2. Algebraic Curves 9 1858 3. Algebraic Surfaces Art. 15 4. Algebraic Varieties 16 5. Abelian Varieties 3 6. Riemann- Roch Theorems 366 IX Topology 1. Topology 426 2. Fundamental Groups 3. Covering Spaces 91 4. Degree of Mapping 99 5. Complexes 70 6. Homology Theory 201 7. Fixed-Point Theorems 8. Cohomology Operations 9. Homotopy Theory 202 10. Fiber Spaces 148 11: Obstructions 305 12. Topology of Lie Groups and Homo- geneous Spaces 427 13. Fiber Bundles 147 14. Characteristic Classes 15. K-Theory 237 16. Knot Theory 235 17. Combinatorial Manifol<is 65 18. Differential Topology 114 19. Transformation Groups 431 20. Theory of Singularities 418 21. Foliations 154 22. Dynamical Systems 23. Shape Theory 382 24. Catastrophe Theory 51 Appendix A, Table 5. Lie Algebra, Symmetric Riemannian Spaces, and Singularities Appendix A, Table 6. Topology Appendix A, Table 7. Knot Theory 170 153 64 56 126 X Analysis 1. Analysis 20 2. Continuous Functions 3. Inequalities 211 4. Convex Analysis 88 5. Functions of Bounded Variation 6. Differential Calculus 106 7. Implicit FunctIOns 208 8. Elementary Functions 131 9. Coo-Functions and Quasi-Analytic Functions 58 10. Integral Calculus 216 11. Curvilinear Integrals and Surface Integrals 94 12. Measure Theory 13. Integration Theory 14. Invariant Measures 15. Set Functions 380 16. Length and Area 246 17. Denjoy Integrals 100 18. Series 379 19. Asymptotic Series 30 20. Polynomial Approximation 21. Orthogonal Functions 317 22. Fourier Series 159 84 166 270 221 225 336 
1859 23. Fourier Transform Art. 160 24. Harmonic Analysis 192 25. Almost Periodic Functions 18 26. Laplace Transform 240 27. Integral Transforms 220 28. Potential Theory 338 29. Harmonic Functions and Subharmonic Functions 193 30. Dirichlet Problem 120 31. Capacity 48 32. Calculus of Variations 46 33. Plateau's Problem 334 34. Isoperimetric Problems 228 35. Variational Inequalities 440 Appendix A, Table 8. Inequalities Appendix A, Table 9. Differential and Integral Calcul us Appendix A, Table 10. Series Appendix A, Table 11. Fourier Analysis Appendix A, Table 12. Laplace Transforms and Operational Calculus Appendix A, Table 20. Systems of Orthogonal Functions XI Complex Analysis 1. Holomorphic Functions 198 2. Power Series 339 3. Dirichlet Series 121 4. Bounded Functions 43 5. Univalent and Multivalent Functions 438 6. Transcendental Entire Functions 7. Meromorphic Functions 272 8. Distribution of Values of Functions of a Complex Variable 124 9. Cluster Sets 62 10. Algebraic Functions 11. Algebroidal Functions 12. Riemann Surfaces 367 13. Ideal Boundaries 207 14. Conformal Mappings 77 15. Quasiconformal Mappings 16. Teichmiiller Spaces 416 17. Kleinian Groups 234 18. Extremal Length 143 19. Function-Theoretic Null Sets 169 20. Analytic Functions of Several Complex Variables 21 21. Analytic Spaces 23 22. Automorphic Functions 32 Appendix A, Table 13. Conformal Mappings 429 11 17 352 XII Functional Analysis 1. Functional Analysis 162 2. Hilbert Spaces 197 3. Banach Spaces 37 4. Ordered Linear Spaces 310 5. Topological Linear Spaces 424 6. Function Spaces 168 7. Distributions and Hyperfunctions 125 8. Vector-Valued Integrals 443 9. Linear Operators 251 Systematic List of Articles 10. Compact and Nuclear Operators Art. 68 11. Interpolation of Operators 224 12. Spectral Analysis of Operators 390 13. Perturbation of Linear Operators 331 14. Semigroups of Operators and Evolution Equations 378 15. Differential Operators 112 16. Microlocal Analysis 274 17. Banach Algebras 36 18. Function Algebras 164 19. Operator Algebras 308 20. Operational Calculus 306 21. Nonlinear Functional Analysis 286 XIII Differential, Integral, and Functional Equations 1. Differential Equations 107 2. Ordinary Differential Equations 313 3. Ordinary Differential Equations (Initial Value Problems) 316 4. Ordinary Differential Equations (Bound- ary Value Problems) 315 5. Ordinary Differential Equations (Asymp- totic Behavior of Solutions) 314 6. Linear Ordinary Differential Equations 252 7. Linear Ordinary Differential Equations (Local Theory) 254 8. Linear Ordinary Differential Equations (Global Theory) 253 9. Nonlinear Ordinary Differential Equations (Local Theory) 289 10. Nonlinear Ordinary Differential Equations (Global Theory) 288 11. Nonlinear Oscillation 290 12. Nonlinear Problems 291 13. Stability 394 14. Integral Invariants 219 15. Difference Equations 104 16. Functional-Differential Equations 163 17. Total Differential Equations 428 18. Contact Transformations 82 19. Partial Differential Equations 320 20. Partial Differential Equations (Methods of Integration) 322 21. Partial Differential Equations (Initial Value Problems) 321 22. Partial Differential Equations of First Order 324 23 Monge-Ampere Equations 278 24. Partial Differential Equations of Elliptic Type 323 25. Partial Differential Equations of Hyper- bolic Type 325 26. Partial Differential Equations of Parabolic Type 327 27. Partial Differential Equations of Mixed Type 326 28. Green's Functions 188 29. Green's Operators 189 30. Integral Equations 217 31. Integrodifferential Equations 222 32. Special Functional Equations 388 33. PseudodifferentialOperators 345 
Systematic List of Articles Appendix A, Table 14. Ordinary Differential Equations Appendix A, Table 15. Total and Partial Differential Equations XIV Special Functions 1. Special Functions Art. 389 2 Generating Functions 177 3. Elliptic Functions 134 4. Gamma Function 174 5. Hypergeometric Functions 206 6. Spherical Functions 393 7. Functions of Confluent Type 167 8 Bessel Functions 39 9. Ellipsoidal Harmonics 133 10. Mathieu Functions 268 Appendix A, Table 16. Elliptic Integrals and Elliptic Functions Appendix A, Table 17. Gamma Function and Related Functions Appendix A, Table 18. Hypergeometric Func- tions and Spherical Functions Appendix A, Table 19. Functions of Confluent Type and Bessel Functions Appendix B, Table 3. Bernoulli Numbers and Euler Numbers XV Numerical Analysis 1. Numerical Methods 300 2. Interpolation 223 3. Error Analysis 138 4. Numerical Solution of Linear Equations 302 5 Numerical Solution of Algebraic Equa- tions 301 6. Numerical Computation of Eigenvalues 298 7. Numerical Integration 299 8. Numerical Solution of Ordinary Differ- ential Equations 303 9. Numerical Solution of Partial Differ- ential Equations 304 10. Analog Computation 19 11. Evaluation of Functions 142 Appendix A, Table 21. Interpolation Appendix B, Table 6. Miscellaneous Constants XVI Computer Science and Combinatorics 1. Automata 31 2. Computers 75 3. Coding Theory 63 4. Cybernetics 95 5. Random Numbers 354 6. Simulation 385 7. Data Processmg 96 8. Mathematical Models in Biology 263 9. Complexity of Computations 71 10. Combinatorics 66 II Latin Squares 241 12. Graph Theory 186 1860 XVII Probability Theory I. Probability Theory Art. 342 2. Probability Measures 341 3. Limit Theorems in Probability Theory 250 4. Stochastic Processes 5. Markov Processes 6. Markov Chains 7. Brownian Motion 8. Diffusion Processes 9. Additive Processes 10. Branching Processes 11. Martingales 262 12. Stationary Processes 395 13. Gaussian Processes 176 14. Stochastic Differential Equations 15. Ergodic Theory 136 16. Stochastic Control and Stochastic Fil- tering 405 17. Probabilistic Methods in Statis1ical Me- chanics 340 407 261 260 45 115 5 44 406 XVIII Statistics 1. Statistical Data Analysis 2. Statistical Inference 401 3. Statistic 396 4. Sampling Distributions 5. Statistical Models 403 6. Statistical Decision Functions 7. Statistical Estimation 399 8. Statistical Hypothesis Testing 9. Multivariate Analysis 280 10. Robust and Nonparametric Methods 371 11. Time Series Analysis 12. Design of Experiments 13. Sample Survey 373 14. Statistical Quality Control 15. Econometrics 128 16. Biometrics 40 17. Psychometrics 346 18. Insurance Mathematics 214 Appendix A, Table 22. Distribution of Typical Random Variables Appendix A, Table 23. Statistical Estimation and Statistical Hypothesis Testing 397 374 398 400 421 102 404 XIX Mathematical Programming and Operations Research 1. Mathematical Programming 2. Linear Programming 255 3. Quadratic Programming 349 4. Nonlinear Programming 292 5. Network Flow Problems 2El 6. Integer Programming 215 7. Stochastic Programming 408 8. Dynamic Programming 12/' 9. Game Theory 173 10. Differential Games 11. Control Theory 264 108 86 
1861 12. Information Theory Art. 213 13. Operations Research 307 14. Inventory Control 227 15. Scheduling and Production Planning 376 XX Mechanics and Theoretical Physics 1. Systems of Units 414 2. Dimensional Analysis 3. Variational Principles 4. Mechanics 271 5. Spherical Astronomy 6 Celestial Mechanics 7. Orbit Determination 8. Three-Body Problem 9 Hydrodynamics 205 10 Hydrodynamical Equations 204 11. Magnetohydrodynamics 259 12. Turbulence and Chaos 433 13. Wave Propagation 446 14. Oscillations 318 15 Geometric Optics 16 Electromagnetism 17. Networks 282 18. Thermodynamics 419 19. Statistical Mechanics 20. Boltzmann Equation 21. Relativity 359 22 Unified Field Theory 23. Quantum Mechanics 24. Lorentz Group 258 25. Racah Algebra 353 26. Scattering Theory 375 27. Second Quantization 377 28. Field Theory 150 29. S-Matrices 386 30. Feynman Integrals 146 31. Elementary Particles 132 32. Renormalization Group 361 33. Nonlinear Lattice Dynamics 34. Solitons 387 35. Approximation Methods in Physics 25 36. Inequalities in Physics 116 441 392 55 309 420 180 130 402 41 434 351 287 212 XXI History of Mathematics 1. Ancient Mathematics 24 2. Greek Mathematics 187 3. Roman and Medieval Mathematics 372 4. Arab Mathematics 26 5. Indian Mathematics 209 6. Chinese Mathematics 57 7. Japanese Mathematics (Wasan) 230 8. Renaissance Mathematics 360 9. Mathematics in the 17th Century 265 10 Mathematics in the 18th Century 266 11. Mathematics in the 19th Century 267 12. Abel, Niels Henrik 1 13. Artin, Emil 28 Systematic List of Articles 14. Bernoulli Family 15. Cantor, Georg 16. Cartan, Elie 50 17. Cauchy, Augustin Louis 53 18. Dedekind, Julius Wilhelm Richard 19. Descartes, Rene 101 20. Dirichlet, Peter Gustav Lejeune 21. Einstein, Albert 129 22 Euler, Leonhard 141 23. Fermat, Pierre de 144 24. Fourier, Jean Baptiste Joseph 25. Galois, Evariste 171 26. Gauss, Carl Friedrich 27. G6del, Kurt 184 28. Hilbert, David 196 29 Jacobi, Carl Gustav Jacob 30. Klein, Felix 233 31. Kronecker, Leopold 236 32. Lagrange, Joseph Louis 238 33. Laplace, Pierre Simon 239 34. Lebesgue, Henri Leon 244 35. Leibniz, Gottfried Wilhelm 36. Lie, Marius Sophus 247 37. Newton, Isaac 283 38. Pascal, Blaise 329 39. Poincare, Henri 335 40. Riemann, Georg Friedrich Bernhard 363 41 Takagi, Teiji 415 42. Viete, Fran90is 444 43. Von Neumann, John 445 44. Weierstrass, Karl 447 45. Weyl, Hermann 448 Art. 38 47 98 119 158 175 229 245 
Alphabetical List of Articles 1. Abel, Niels Henrik 2. Abelian Groups 3. Abelian Varieties 4. Additive Number Theory 5. Additive Processes 6. Adeles and Ideles 7. Affine Geometry 8. Algebra 9. Algebraic Curves 10. Algebraic Equations 11. Algebraic Functions 12. Algebraic Geometry 13. Algebraic Groups 14. Algebraic Number Fields 15. Algebraic Surfaces 16. Algebraic Varieties 17. Algebroidal Functions 18. Almost Periodic Functions 19. Analog Computation 20. Analysis 21. Analytic Functions of Several Complex Variables 22. Analytic Sets 23. Analytic Spaces 24. Ancient Mathematics 25. Approximation Methods in Physics 26. Arab Mathematics 27. Arithmetic of Associative Algebras 28. Artin, Emil 29. Associative Algebras 30. Asymptotic Series 31. Automata 32. Automorphic Functions 33. Axiomatic Set Theory 34. Axiom of Choice and Equivalents 35. Axiom Systems 36. Banach Algebras 37. Banach Spaces 38. Bernoulli Family 39. Bessel Functions 40. Biometrics 41. Boltzmann Equation 42. Boolean Algebras 43 Bounded Functions 44. Branching Processes 45. Brownian Motion 46. Calculus of Variations 47. Cantor, Georg 48. Capacity 49. Cardinal Numbers 50. Cartan, Elie 51. Catastrophe Theory 52. Categories and Functors 53. Cauchy, Augustin Louis 54. Cayley Algebras 55. Celestial Mechanics 56. Characteristic Classes 57. Chinese Mathematics 58. COO-Functions and Quasi-Analytic Func- tions 59. Class Field Theory 60. Classical Groups 61. Clifford Algebras 62. Cluster Sets 63. Coding Theory 64. Cohomology Operations 65. Combinatorial Manifolds 66. Combinatorics 67. Commutative Rings 68. Compact and Nuclear Operators 69. Compact Groups 70. Complexes 71. Complexity of Computations 72. Complex Manifolds 73. Complex Multiplication 74. Complex Numbers 75. Computers 76. Conformal Geometry 77. Conformal Mappings 78. Conic Sections 79. Connectedness 80. Connections 81. Constructive Ordinal Numbers 82. Contact Transformations 83. Continued Fractions 84. Continuous Functions 85. Continuous Geometry 86. Control Theory 87. Convergence 88. Convex Analysis 89. Convex Sets 90. Coordinates 91. Covering Spaces 92. Crystallographic Groups 93. Curves 94. Curvilinear Integrals and Surface Inte- grals 95. Cy bernetics 96. Data Processing 97. Decision Problem 98. Dedekind, Julius Wilhelm Richard 99. Degree of Mapping 100. Denjoy Integrals 101. Descartes, Rene 102. Design of Experiments 103. Determinants 104. Difference Equations 105. Differentiable Manifolds 106. Differential Calculus 107. Differential Equations 108. Differential Games 109. Differential Geometry 110. Differential Geometry in Specific Spaces 111. Differential Geometry of Curves and Surfaces 112. Differential Operators 113. Differential Rings 114. Differential Topology 115. Diffusion Processes 116. Dimensional Analysis 
Alphabetical List of Articles 117. Dimensional Theory 118. Diophantine Equations 119. Dirichlet, Peter Gustav Lejeune 120. Dirichlet Problem 121. Dirichlet Series 122. Discontinuous Groups 123. Distribution of Prime Numbers 124 Distribution of Values of Functions of a Complex Variable 125. Distributions and Hyperfunctions 126. Dynamical Systems 127. Dynamic Programming 128. Econometrics 129. Einstein, Albert 130. Electromagnetism 131. Elementary Functions 132. Elementary Particles 133. Ellipsoidal Harmonics 134 Elliptic Functions 135. Equivalence Relations 136. Ergodic Theory 137 Erlangen Program 138. Error Analysis 139. Euclidean Geometry 140. Euclidean Spaces 141. Euler, Leonhard 142. Evaluation of Functions 143. Extremal Length 144. Fermat, Pierre de 145. Fermat's Problem 146. Feynman Integrals 147. Fiber Bundles 148. Fiber Spaces 149 Fields 150 Field Theory 151. Finite Groups 152. Finsler Spaces 153. Fixed-Point Theorems 154. Foliations 155. Foundations of Geometry 156. Foundations of Mathematics 157. Four-Color Problem 158. Fourier, Jean Baptiste Joseph 159. Fourier Series 160. Fourier Transform 161. Free Groups 162. Functional Analysis 163. Functional-Differential Equations 164. Function Algebras 165. Functions 166. Functions of Bounded Variation 167. Functions of Confluent Type 168. Function Spaces 169. Function-Theoretic Null Sets 170. Fundamental Groups 171. Galois, Evariste 172. Galois Theory 173. Game Theory 174. Gamma Function 175. Gauss, Carl Friedrich 176. GaussIan Processes 1864 177. Generating Functions 178. Geodesics 179. Geometric Construction 180. Geometric Optics 181. Geometry 182 Geometry of Numbers 183. Global Analysis 184. G6del, Kurt 185. G6del Numbers 186. Graph Theory 187. Greek Mathematics 188. Green's Functions 189. Green's Operator 190. Groups 191. G-Structures 192. Harmonic Analysis 193 Harmonic Functions and Subharmonic Functions 194. Harmonic Integrals 195. Harmonic Mappings 196. Hilbert, David 197. Hilbert Spaces 198. Holomorphic Functions 199 Homogeneous Spaces 200. Homological Algebra 201. Homology Theory 202. Homotopy Theory 203. Hopf Algebras 204. Hydrodynamical Equations 205. Hydrodynamics 206. Hypergeometric Functions 207. Ideal Boundaries 208. ImplicIt Functions 209. Indian Mathematics 210. Inductive Limits and Projective Limits 211. Inequalities 212. Inequalities in Physics 213. Information Theory 214. Insurance Mathematics 215. Integer Programming 216. Integral Calculus 217. Integral Equations 218. Integral Geometry 219. Integral Invariants 220. Integral Transforms 221. Integration Theory 222. Integrodifferential Equations 223. Interpolation 224. Interpolation of Operators 225. Invariant Measures 226. Invariants and Covariants 227. Inventory Control 228. Isoperimetric Problems 229. Jacobi, Carl Gustav Jacob 230. Japanese Mathematics (Wasan) 231. Jordan Algebras 232. Kahler Manifolds 233. Klein, Felix 234. Kleinian Groups 235. Knot Theory 236. Kronecker, Leopold 
1865 237. K-Theory 238. Lagrange, Joseph Louis 239 Laplace, Pierre Simon 240. Laplace Transform 241. Latin Squares 242 Lattice- Point Problems 243. Lattices 244. Lebesgue, Henri Leon 245. Leibniz, Gottfried Wilhelm 246 Length and Area 247 Lie, Marius Sophus 248. Lie Algebras 249. Lie Groups 250. Limit Theorems in Probability Theory 251. Linear Operators 252. Linear Ordinary Differential Equations 253. Linear Ordinary Differential Equations (Global Theory) 254. Linear Ordinary Differential Equations (Local Theory) 255. Linear Programming 256. Linear Spaces 257. Local Fields 258. Lorentz Group 259. Magnetohydrodynamics 260. Markov Chains 261. Markov Processes 262 Martingales 263. Mathematical Models in Biology 264. Mathematical Programming 265 Mathematics in the 17th Century 266. Mathematics in the 18th Century 267. Mathematics in the 19th Century 268. Mathieu Functions 269. Matrices 270. Measure Theory 271. Mechanics 272. Meromorphic Functions 273. Metric Spaces 274. Microlocal Analysis 275. Minimal Submanifolds 276. ModeJ Theory 277. Modules 278. Monge-Ampere Equations 279. Morse Theory 280. Multivariate Analysis 281. Network Flow Problems 282. Networks 283. Newton, Isaac 284. Noetherian Rings 285. Non-Euclidean Geometry 286. Nonlinear Functional Analysis 287. Nonlinear Lattice Dynamics 288 Nonlinear Ordinary Differential Equa- tions (Global Theory) 289. Nonlinear Ordinary Differential Equa- tions (Local Theory) 290. Nonlinear Oscillation 291. Nonlinear Problems 292. Nonlinear Programming 293 Nonstandard Analysis 294. Numbers Alphabetical List of Articles 295. Number-Theoretic Functions 296. Number Theory 297. Number Theory, Elementary 298. Numerical Computation of Eigenvalues 299. Numerical Integration 300. Numerical Methods 301. Numerical Solution of Algebraic Equa- tions 302. Numerical Solution of Linear Equations 303. Numerical Solution of Ordinary Differ- ential Equations 304. Numerical Solution of Partial Differential Equations 305 Obstructions 306. Operational Calculus 307. Operations Research 308. Operator Algebras 309. Orbit Determination 3 I O. Ordered Linear Spaces 311. Ordering 312. Ordinal Numbers 313. Ordinary Differential Equations 314. Ordinary Differential Equations (Asymp- totic Behavior of Solutions) 315. Ordinary Differential Equations (Bound- ary Value Problems) 316. Ordinary Differential Equations (Initial Value Problems) 3 I 7. Orthogonal Functions 318. OscilIations 319. Paradoxes 320. Partial Differential Equations 321. Partial Differential Equations (Initial Value Problems) 322. Partial Differential Equations (Methods of Integration) 323. Partial Differential Equations of Elliptic Type 324. Partial Differential Equations of First Order 325. Partial Differential Equations of Hyper- bolic Type 326. Partial Differential Equations of Mixed Type 327. Partial Differential Equations of Para- bolic Type 328. Partitions of Numbers 329. Pascal, Blaise 330. Permutations and Combinations 331. Perturbation of Linear Operators 332. Pi (n) 333. Plane Domains 334. Plateau's Problem 335. Poincare, Henri 336. Polynomial Approximation 337. Polynomials 338. Potential Theory 339. Power Series 340. Probabilistic Methods in Statistical Mechanics 341. Probability Measures 342. Probability Theory 343. Projective Geometry 344. Pseudoconformal Geometry 
Alphabetical List of Articles 345. Pseudodifferential Operators 346. Psychometrics 347. Quadratic Fields 348. Quadratic Forms 349. Quadratic Programming 350. Quadric Surfaces 351. Quantum Mechanics 352. Quasiconformal Mappings 353. Racah Algebra 354. Random Numbers 355. Real Numbers 356. Recursive Functions 357. Regular Polyhedra 358. Relations 359. Relativity 360. Renaissance Mathematics 361. Renormalization Group 362. Representations 363. Riemann, Georg Friedrich Bernhard 364. Riemannian Manifolds 365. Riemannian Submanifolds 366. Riemann- Roch Theorems 367. Riemann Surfaces 368. Rings 369. Rings of Polynomials 370. Rings of Power Series 371. Robust and Nonparametric Methods 372. Roman and Medieval Mathematics 373. Sample Survey 374. Sampling Distributions 375. Scattering Theory 376. Scheduling and Production Planning 377. Second Quantization 378. Semigroups of Operators and Evolution Equations 379. Series 380. Set Functions 38 I. Sets 382. Shape Theory 383. Sheaves 384. Siegel Domains 385. Simulation 386. S-Matrices 387 Solitons 388. Special Functional Equations 389. Special Functions 390. Spectral Analysis of Operators 391. Spectral Geometry 392. Spherical Astronomy 393. Spherical Functions 394. Stability 395. Stationary Processes 396. Statistic 397. Statistical Data Analysis 398. Statistical Decision Functions 399. Statistical Estimation 400. Statistical Hypothesis Testing 401. Statistical Inference 402. Statistical Mechanics 403. Statistical Models 404. Statistical Quality Control 1866 405. Stochastic Control and Stochastic Fil- tering 406. Stochastic Differential Equations 407. Stochastic Processes 408. Stochastic Programming 409. Structures 410. Surfaces 411. Symbolic Logic 412. Symmetric Riemannian SpaCls and Real Forms 413. Symmetric Spaces 414. Systems of Units 415. Takagi, Teiji 416. Teichmiiller Spaces 417. Tensor Calculus 418. Theory of Singularities 419. Thermodynamics 420. Three-Body Problem 421. Time Series Analysis 422. Topological Abelian Groups 423. Topological Groups 424. Topological Linear Spaces 425. Topological Spaces 426. Topology 427. Topology of Lie Groups and Homo- geneous Spaces 428. Total Differential Equations 429. Transcendental Entire Fun,;tions 430. Transcendental Numbers 431. Transformation Groups 432. Trigonometry 433. Turbulence and Chaos 434. Unified Field Theory 435. Uniform Convergence 436. S-Matrices. 437. Unitary Representations 438. Univalent and MuItivalen1 Functions 439. Valuations 440. Variational Inequalities 441. Variational Principles 442. Vectors 443. Vector-Valued Integrals 444. Viete, Francois 445. Von Neumann, John 446. Wave Propagation 447. Weierstrass, Karl 448. Weyl, Hermann 449. Witt Vectors 450. Zeta Functions 
Contributors to the Second Edition Abe Eiichi Akahira Masafumi Akaike Hirotugu Akaza Touru Amari Shun-ichi Ando Tsuyoshi Araki H uzihiro Araki ShOr6 Arimoto Suguru Asai Akira Fuji'i'e Tatuo Fujiki Akira Fujisaki Genjiro Fujita Hiroshi FUjiwara Daisuke Fujiwara Masahiko Fukushima Masatoshi Furuya Shigeru Hamada Yusaku Hasumi Morisuke Hattori Akio Hayashi Kiyoshi Hida Takeyuki Hijikata Hiroaki Hirose Ken Hitotumatu Sin Huzii Mituaki Ibaraki Toshihide Ihara Shin-ichiro Iitaka Shigeru Ikawa Mitsuru Ikebe Teruo Ikebe Yasuhiko Ikeda Nobuyuki Inagaki Nobuo Inoue Atsushi Iri Masao Itaya Nobutoshi It6 Kiyosi It6 Seiz6 Ito Yuji Itoh Mitsuhiro Iwahori Nobuyoshi Iyanaga Sh6kichi Kadoya Norihiko Kageyama Sampei Kamae Teturo Kaneko Akira Kaneyuki Soji Kasahara K6ji Kataoka Shinji Kato JunJi Kato Mitsuyoshi Kawai Takahiro Kawasaki Tetsuro Kimura Tosihusa Kishi Masanori Kobayashi Shoshichi Kodama Yukihiro Komatsu Hikosaburo K6mura Yukio Kondo Takeshi Konishi Yoshio Konno Hiroshi Kotani Shinichi Kubota Yoshihisa Kumano-go Hajime Kunita Hiroshi Kuramoto Yuki Kuroda Shige-Toshi Kuroda Tadashi Kusama Tokitake Kusano Takasi Kusunoki Yukio Mabuchi Toshiki Maeda Shliichiro Maeda Yoshiaki Maehara Sh6ji Makabe Hajime Maruyama Masaki Matsumoto Kikuji Matsumoto Yukio Matsumura Mutsuhide Mitsui Takayoshi Mizohata Sigeru Mizutani Tadayoshi Mori Masatake Morimoto Akihiko Morimura Hidenori Morita Shigeyuki Morita Yasuo Motohashi Nobuyoshi Murasugi Kunio Nagami Keio Nagata Masayoshi Nakagawa Hisao Nakamura Tokushi Nakaoka Minoru Namba Kanji Namba Makoto Namikawa Yukihiko Naruki Isao Niiro Fumio Nishikawa Seiki Nishimura Junichi Nishitani Kensaku Nisio Makiko Noguchi Hiroshi Nozaki Akihiro Oaku Toshinori Obata Morio Ochiai Takushiro Ogiue Koichi Ohta Masami Ohya Yujiro Oikawa Kotaro Oka Mutsuo Okabe Yasunori 
Contributors to the Second Edition 1868 Okuno Tadakazu Omori Hideki Oshima Toshio Ozawa Mitsuru Saito Kyoji Saito Masahiko Saito Tosiya Sakai Fumio Sakawa Y oshiyuki Sato F umitaka Sato Ken-iti Sawashima Ikuko Shibata Katsuyuki Shibata Keiichi Shiga KOJi Shiga Tokuzo Shikata Y oshihiro Shimada Nobuo Shimakura Norio Shimizu Ryoichi Shioda Tetsuji Shiohama Katsuhiro Shiraiwa Kenichi Shiratani Katsumi Sibuya Masaaki Sugie Toru Sugiura Mitsuo Sugiura Shigeaki Suita Nobuyuki Sumihiro Hideyasu Suzuki Masuo Suzuki Mitsuo Takahashi Tsuneo Takakuwa Shoichiro Takami Hideo Takasu Toru Takeuchi Kei Takeuchi Masaru Takeuti Gaishi Tamura Itiro Tanabe Kunio Tanaka Hiroshi Tanaka Masatsugu Tanaka Shunichi Tanno Shukichi Tatsumi Tomomasa Terada Toshiaki Toda Hideo Toda Hirosi Toda Morihzu Toda Nobushige Tomiyama Jun Totoki Haruo Tuge Tosiyuki Uchida Fuichi Uchiyama Saburo Ueno Kenji Ukai SeiJi Ura Shoji Ushijima Teruo VVakabayashi Isao VVashizu Kyuichiro VVatanabe Kimio VV atanabe Shinzo VVatanabe Takesi Yamaguchi Masaya Yamamoto Koichi Yamamoto Sumiyasu Yamamoto Tetsuro Yamazaki Masao Yamazaki Tadashi Yanagawa Takashi Yanagiwara Hiroshi Yanai Haruo Yano Tamaki Yokonuma Takeo Yoshikawa Atsushi Yoshida J unzo Yo sid a Kosaku 
Contributors to the First Edition Inaba Eiji Inagaki Takeshi Inoue Masao Inui Teturo Iri Masao Irie Shoji Ise Mikio lseki Kanesiroo Ishida Yasushi Ishii Goro Isizu Takehiko I to J uniti Ito Kiyosi Ito Noboru Ita Seiza Ito Teiiti Ito Tsutomu Itoh Makoto Iwahashi Ry6suke Iwahori Nagayoshi Iwamura Tsurane Iwano Masahiro Iwasawa Kenkichi Iwata Giiti Iyanaga Shokichi Izumi Shin-ichi Kaburaki Masaki Kamae Keiko Kametani Shunji Kanbe Tsunekazu Kanitani J6y6 Kasahara Koji Katase Kiyoshi Kato Tosio Katuura Sutezo Kawada Yukiyosi Kawaguchi Akitsugu Kawai Saburo Kawakami Riiti Kawata Tatuo Kihara Taro Kimura Motoo Kimura Tosihusa Kinoshita Shin'ichi Kitagawa Toshio Kiyasu Zen'ichi Koba Ziro Kobayashi Mikio Kobori Akira Koda Akira Kodaira Kunihiko Koga Yukiyoshi Komatsu Hikosaburo Komatu Atuo Komatu Yusaku Kondo Jiro Kondo Kazuo Kondo Motokiti Kondo Ryoji Kondo Takeshi Kondo Y6itsu Aizawa Sadakazu Akao Kazuo Akizuki Yasuo Amemiya Ayao Anzai Hirotada Aoki Kiyoshi Araki Huzihiro Araki Shoro Arima Akito Asaka Tetsuichi Asano Keizo Asatani Teruo Azumaya Goro FUJinaka Megumu Fujita Hiroshi FUjiwara Daisuke Fukuda Takeo Fukutomi Setsuo Furuya Shigeru Goto Morikuni Goto Motinori Got6 Yiizo Hagihara Yusuke Harada Manabu Hasimoto Hidenori Hattori Akio Hattori Akira Hayashi Chikio Hayashi Keiichi Hayashi Tsuyoshi Hida Takeyuki Hidaka Koji Hirakawa Junko Hirano Tomoharu Hirasawa Y oshikazu Hirayama Akira Hirose Ken Hisatake Masao Hitotumatu Sin Hokari ShisanJi Homma Kiyomi Homma Tatsuo Honbu Hitoshi Hong Imsik Hosokawa Fujitsugu Hukuhara Masuo Husimi Kozi Huzii Mituaki Ichida Asajiro Ihara Shin-ichiro Ihara Yasutaka Iitaka Shigeru lkawa Mitsuru Ikeda Mineo Ikeda Nobuyuki Ikehara Shikao Imai Isao 
Contributors to the First Edition 1870 Kota Osamu Kotake Takesi Kotani Masao Kozai Y oshihide Kubo Izumi Kubo Ryogo Kubota Tadahiko Kudo Akio Kudo Hirokichi Kudo Tatsuji Kuga Michio Kunisawa Kiyonori Kunita Hiroshi Kunugi Kinjiro Kuranishi Masatake Kuroda Shige-Toshi Kuroda Sigekatu Kuroda Sige-Nobu Kusama Tokitake Kusano Takasi Kusunoki Yukio Maehara Shoji Maruyama Gisiro Mashio Shinji Masuda Kyuya Masuyama Motosaburo Matsukuma Ryozai Matsumoto Kikuji Matsumoto Makoto Matsumoto Toshizo Matsumura Hideyuki Matsumura Mutsuhide Matsushima Y ozo Matsuyama Noboru Matuda Tizuko Matusita Kameo Matuzaka Kazuo Mikami Masao Mimura Yukio Minagawa Takizo Mita Hiroo Mitome Michio Mitsui Takayoshi Miyake Kazuo Miyasawa Koichi Miyashita Totaro Miyazawa Hironari Miyazima Tatuoki Mizohata Sigeru Mizuno Katsuhiko Mogi Isamu Mori Shigeo Moriguti Sigeiti Morimoto Haruki Morimoto Seigo Morimura Hidenori Morishima Taro Morita Kiiti Morita Yuzo Moriya Mikao Moriya Tomijiro Motohashi Nobuyoshi Motoo Minoru Murakami Shingo Muramatu Tosinobu Murata Tamotsu Nabeya Seiji Nagano Tadashi Nagao Hirosi Nagashima Takashi Nagata Jun-iti Nagata Masayoshi Nagumo Mitio Nakai Mitsuru Nakai Yoshikazu Nakamori Kanji Nakamura Koshiro Nakamura Masahiro Nakamura Tokushi Nakanishi Shizu Nakano Shigeo Nakaoka Minoru Nakayama Tadasi Nakayama Takashi Namikawa Yukihiko Nanba Kanji Narita Masao Narumi Seimatu Niiro Fumio Nikaido Hukukane Nishi Mieo Nishimura Toshio Noguchi Hiroshi Nomizu Katsumi Nomoto Hisao Nomura Yukichi N onaka T oshio Noshiro Kiyoshi Nozaki Akihiro Nozaki Yasuo Ochiai Kiichiro Ogasawara Tojiro Ogawa JunJiro Ogawara Masami Ogino Shusaku Ohasi Saburo Ohtsuka Makoto Oikawa Kotaro Oishi Kiyoshi Oka Syoten Okamoto Kiyosato Okamoto Masashi Okubo Kenjiro Okubo Tanjiro Okugawa Kotaro Onari Setsuo Ono Akimasa Ono Katuzi Ono Shigeru Ono Takashi Onoyama Takuji Oshida Isao 
1871 Contributors to the First Edition Oshima Nobunori Osima Masaru Otsuki Tominosuke Ozaki Shigeo Ozawa Mitsuru Ozeki Hideki Saito Kinichiro Saito Tosiya Saito Y oshihiro Sakai Eiichi Sakai Takuzo Sakamoto Heihachi Sakata Shoichi Tanaka Hiroshi Tanaka Hisao Tanaka Minoru Tanaka Sen-ichiro Tannaka Tadao Tatsumi Tomomasa Tatuzawa Tikao Terasaka Hidetaka Toda Hideo Toda Hirosi Toda Morikazu Toki Yukinari Tomotika Susumu __1,: t..:___ dd1U lllgt:u Satake Ichiro Sato Ken-iti Sato Ryoichiro Sato Tokui Sato Yumiko Sawashima Ikuko Seimiya Toshio Seki Setsuya Shibagaki Wasao Shiga Koji Shimada Nobuo Shimizu Hideo Shimizu Tatsujiro Shimura Goro Shintani Takuro Shizuma Ryoji Shoda Kenjiro Shono Shigekata Sibuya Masaaki Simauti Takakazu Sirao Tunekiti Sueoka Seiichi Suetuna Zyoiti Sugawara Masahiro Sugawara M asao Sugiura Mitsuo Sugiura Nariaki Sugiyama Shohei Suita Nobuyuki Sumitomo Takeshi Sunouchi Genichiro Suzuki Michio Suzuki Y oshindo Takada Masaru Takagi Teiji Takagi Yutaka Takahashi Hidetoshi Takahashi Tsunero Takenouchi Osamu Takenouchi Tadao Takeuchi Kei Takeuti Gaisi Takizawa Seizi Tamagawa Tsuneo Tamura Hiro Tamura Jiro Tanaka Chuji T otaki Haiuo Toyama Hiraku Tsuboi Chuji Tsuchikura Tamotsu Tsuji Masatsugu Tsujioka Kunio Tsukamoto Yotaro Tsuzuku Toshiro Tugue Tosiyuki Tumura Y osiro Uchiyama Saburo Uchiyama Tatsuo Udagawa Masatomo Ueno Kenji Ueno Tadashi Ugaeri Tadashi Umegaki Hisaharu Umezawa Hiroomi U mezawa Toshio Uno Toshio U ra Shoji Ura Taro Urabe Minoru Wada Yasaku Washimi Shinichi Washio Yasutoshi Washizu Kyuichiro Watanabe Hiroshi Watanabe Hisao Watanabe Masaru Watanabe Shinzo Yamada Isamu Yamaguti Masaya Yamamoto Koichi Yamanaka Takesi Yamanoshita Tsuneyo Yamanouti Takahiko Yamashita Hideo Yamauti Ziro Yamazaki Keijiro Yanagihara Kichiji Yano Kentaro Yano Shigeki Y oneda Nobuo Y oshizawa Hisaaki Y osida Kosaku Y osida Setuzo Y osida Y oiti 
Translators (for the First Edition) Komatu Yusaku Kondo Ryoji Kondo Takeshi Kozai Y oshihide Kubo Ryogo Kubota Tomio Kudo Akio Kudo Hirokichi Kumano-go Hitoshi Kunisawa Kiyonori Kunita Hiroshi Kurita Minoru Kuroda Shige-Toshi Kusama Tokitake Kusano Takasi Maruyama Gisiro Matsuda Michihiko Matsumoto Kikuji Matsumoto Makoto Matsumura Hideyuki Matsushima Yozo Mimura Yukio Miyazawa Hironari Mizohata Sigeru Mori Toshio Morimoto Haruki Morimura Hidenori Morita Kiiti Morita Yasuo Motoo Minoru Murakami Shin go Muramatu Tosinobu Murase Ichiro Nagao Hirosi Nagasawa Masao Nagata Masayoshi Nakai Mitsuru Nakai Y oshikazu Nakamura Tokushi Nakanishi Shizu Nakano Shigeo Nakaoka Minoru Namikawa Yukihiko Namioka Isaac Nanba Kanji Niiro Fumio Nikaido Hukukane Nishi Mieo Nishijima Kazuhiko Nishimiya Han Noguchi Hiroshi Nonaka Toshio Noshiro Kiyoshi Nozaki Akihiro Obayashi Tadao Ogino Shusaku Ohtsuka Makoto Oikawa Kotaro Okamoto Masashi Okubo Kenjiro Okugawa Kotaro Adachi Masahisa Aizawa Sadakazu Akizuki Yasuo Aka Kiyoshi Araki Shara Arima Akito Asano Keizo Azumaya Goro Chiba Katsuhiro FUJisawa Takehisa Fujita Hiroshi Fukada Ichiro Fukuyama Masaru Furuya Shigeru Hashimoto Yasuko Hasimoto Hidenori Hasumi Morisuke Hattori Akio Hattori Akira Hayashi Kazumichi Hida Takeyuki Hinata Shigeru Hirasawa Yoshikazu Hirose Ken Hitotumatu Sin Honda Taira Hong Imsik Hosoi Tsutomu Hukuhara Masuo Ihara Shin-ichiro Ihara Yasutaka Ikeda Mineo Ikeda Nobuyuki Imai Isao Inoue Masao Iri Masao Ishii Goro Ita Seiza Ito Yuji Iwahashi Ryosuke Iwahori Nagayoshi Iwarmura Tsurane Iwano Masahiro Iwata Giichi Iyanaga Kenichi Iyanaga Shakichi Kametani Shunji Kasahara Koji Kato Junji Kawada Yukiyosi Kawano Sanehiko Kimura Tosihusa Kitagawa Toshio Kobori Akira Kodaira Kunihiko Komatsu Hikosaburo Komatu Atuo 
Translators (for the First Edition) 1874 Ono Katuzi Ono Shigeru Osima Masaru Ozawa Mitsuru Ozeki Hideki Saito Tosiya Sakai Shoichiro Sakamoto Minoru Sasaki Shigeo Satake Ichiro Sato Kenkichi Sawashima lkuko Shibuya Masaaki Shiga K oji Shimada Nobuo Shintani Takuro Shioda Tetsuji Shiraiwa Kenichi Shiratani Katsumi Shizuma Ryoji Sima uti Takakazu Sugawara Masahiro Sugiura Mitsuo Sugiyama Shohei Suita Nobuyuki Sumitomo Takeshi Sunouchi Genichiro Suzuki Michio Takahashi Moto-o Takahashi Reiji Takahashi Tsunero Takami Hideo Takaoka Seiki Takenouchi Osamu Takeuchi Kei Takeuti Gaisi Tamura Hiro Tanaka Hiroshi Tanaka Hisao Tanaka Minoru Tatsumi Tomomasa Tatuzawa Tikao Terasaka Hidetaka Toda Hideo Toda Hirosi Tsuka-da Nobutaka Tsukamoto Yotaro Tugue Tosiyuki Uchiyama Saburo Uesu Tadahiro Umegaki Hisaharu Umezawa Toshio Uno Toshio Ura Shoji Ura Taro Urabe Minoru Uzawa Hirobumi Wada Junzo Watanabe Hiroshi Watanabe Hisao Watanabe Shinzo Watanabe Takesi Yamaguti Masaya Yamamoto Koichi Yamanaka Takesi Yamazaki Keijiro Yano Kentaro Yano Shigeki Yoshizawa Taro Y osida Kosaku Y osida Setuzo 
Name Index Note. Citation is to article and section, not to page. r = references; STR = Statistical Tables for Reference (to be found after App. B); NTR = Numerical Tables for Reference. . means that the article is not divided into sections. A Abadie, Jean M. (1919-) 292.r Abe, Eiichi (1927-) l3.R Abe, Kinetsu(1941-) 365L Abel, Niels Henrik (1802-29) 12 A, B, C 3.A, B, G, J, L, M 8 10.D I I.B, C, E 20 21.B 52.B, N 60.L 73.A 121.D 136.B 172.A, B, G, H 190.A, H, Q 201.R 202.N 217.A, L 240.G 267308 E 339.B 367.H 379.0, F, K, N 383.B 388.D 422.A, E Abers, Ernest S. 132.r Aberth, Oliver George (1929-) 30LF Abhyankar, Shreeram Shankar (1930-) 15 B 16 L 23.r Abikoff, WiIliam (1944-) 234.E Abraham, C. T. 96.F Abraham, Ralph H. (1936-) 126.1, r 183 271.r 316.r 420.r Abramov, Leonid Mikhailovich (1931-) 136.E Abramowitz, Milton 1. NTR Abrikosov, Aieksel Alekseevich (1928-) 402.r Abuj Wafit (940-98) 432.C Achenbach, J. D. 446.r Ackermann, Wilhelm (1896- I 962) 9h, r 156 E, r 356.B 41 U, r Ackoff, Russell Lincoln (1919-) 307.A Aczel, Janos D. (1924-) 388.r Adams, Douglas Payne 19.r Adams, John Couch (1819-92) 303.E Adams, John Frank (1930-) 64 C 200.r 202.S, T, V 237.A, E, 1 249.r 426 Adamson, lain Thomas 200.M Addison, John West, Jr. (1930-) 22.D, F, G, H 81.r 356.H, r Adem, Jose (1921-) 64.B, C 305.A App. A, Table 6.11 Adler, Mark 387.C Adler, Roy L. (1931-) 126.K 136.E, H Adler, Stephen L. (1939-) 132.C, r Ado, Igor Dmitrievich 248.F, r Adriaan, Anthonisz (c 1543- 1620) 332 Agmon, Shmuel (1922-) lI2.F, H, Q 323.H, r 375.B, C Agnesi, Maria Gaetana (1718-99) 93.H Aguilar, Joseph 33l.F Ahern, Patrick Robert (1936-) 164.K Ahlberg, John Harold (1927-) 223.r Ahlfors, Lars Valerian (1907-) J7.D 2LN 43.G, K 74.r 77.E, F r 122.1 124.B, r 143.A 169.E 198.r 234.D, E, r 272.1, J, L 352.A, B, C, F 367.B, G, 1, r 416429.0 438.r Aho, Alfred V 31.r 71 r 75.r 186 r Aida, Yasuaki (1747-1817) 230 Airy, Sir George Biddell (1801-92) 325.L App. A, Table 19.1V Aitken, Alexander Craig (1895- I 967) 223.B App. A, Table 21 Aiza wa, Sadakazu (1934-) 286.X Aizenman, Michael (1945-) 136.G 340.r 402.G Akahira, Masafumi (1945-) 128.r 399.K, 0 400 r Akahori, Takao (1949-) 72.r Akaike, Hirotugu (1927-) 421.D Akaza, Tohru(1927-83) 234.r Akbar-Zadeh, Hassan (1927-) 152.C Akcoglu, Mustafa A. (1934-) 136 B Akemann, Charles A. (1941-) 36.K Akhiezer, Naum Illch (1901 -80) 197.r 25Lr 336.r 390.r Akizuki, Yasuo (1902-84) 8 59.H 284.F, G 368.F Alaoglu, Leonidas (1914-) 37.E 424.H Albanese, Giacomo (1890-1947) 16.P 232.C al-BaWinI, Mohamed ibn Gabis ibn Sinan, Abu Abdallah (858?-929) 26432.C Albert, Abraham Adrian (1905-72) 29.F, r l49.r 231.r Albertus Magnus (1 193?-1280) 372 Alcuin (735-804) 372 Aleksandrov, Aleksandr Danilovich (1912-) I I I.r I 78.A 255.D 365.H 425.r Aleksandrov (Alexandroff), Pavel Sergeevich (1896- 1982) 22.1 65.r 93.r 99.r I 17.A, E, F, r 201 .A, r 207.C 273.K 425.S- V, r 426. *, r Alekseev, Vladimir Mikhailovich (1932-80) 420.r AlekseevskiI, Dmitril Vladimirovich (1940-) 364.r Alexander, Herbert James (1940-) 344.F Alexander, James Waddell (1888-1971) 65.G 20LA, J, M, 0, P 235.A, C, D, E 426 Alexits, Gyorgy (1899-1978) 317.r Alfsen, Erik Magnus (1930-) 351.L Alfven, Hannes (1908-) 259.*, r Alinhac, Serge (1948-) 345.A al-Khwarizmi (Alkwarizmi), Mohammed ibn Musa (c. 780-c. 850) 26 Allard, William Kenneth (1941-) 275.G Allendoerfer, Carl Bennett (191 I -74) 109365.E Almgren, Frederick Justin, Jr. (1933-) 275.A, F, G, r 334 F Altman, Allen B. 16.r Amari, Shun-ichi (1936-) 399.0 Ambrose, Warren (1914-) 136.D Amemiya, Ichiro (1923-) 72.r Amitsur, Shimshon A (1921-) 200 P Ampere, Andre-Marie (1775-1836) 82.A 107.B 278.A Amrein, Werner O. 375.B, r Ananda-Rau, K. 121.D Andersen, Erik Sparre (1919-) 260J Anderson, Brian D. O. 86.r Anderson, Joel H. (1935-) 36J Anderson, Richard Louis (1915-) 19.r Anderson, Robert Murdoch (1951-) 293.D, r Anderson, Theodore Wilbur (1918-) 280.r 374.r 421.D Andersson, Karl Gustav (1943-) 274.1 Ando, Tsuyoshi (1932-) 310.H Ando, Y 304.r Andreotti, Aldo (1924-80) 32.F 72.r Andrews, David F. 371.H Andrews, Frank C. 419.r Andrianov, Anatolii N. (1936-) 32.F Andronov, Aleksandr Aleksandrovich (I 90J -52) 126.A, 1, r 290 r 3 I 8.r Anger, Carl Theodor (1803-58) 39.G App. A, Table 19 IV Anikin, S. A. 146 A Anosov, Dmitril Viktorovich (1936-) l26.A, J 136.G Antiphon (fl. 430? H.c.) 187 Antoine, Louis August (1888-1971) 65.G Anzai, Hirotada (1919-55) 136.E Apery, Roger (1916-) l82.G 
Name Index Apollonius (of Perga) Apollonius (of Perga) (262-c. 200 B.C) 179.A 181 187 App. A, Table 3.V Apostol, T. M. 106.r 216.r Appel, Kenneth I. (1932-) 157.A l86.r Appell, Paul-Emile (1855-1930) I I.r 206.D, r 393.E, r 428.r App. A, Table 18.1 App. A, Table 20 r Arakelov, S. Yu. 9.r 118 E Arakelyan, Norair Unanovich 164.J Araki, Huzihiro (1932-) 150D, E 2l2.B, r 308.1, r 351.L 377 r 402 G Araki, Shara (1930-) 427.B Aramata, Hideo (1905-47) 450.D Aramovich, I. 198.r Arbib, Michael Anthony (1940-) 75.r 95.r Arbuthnot, John (\667-1735) 371.A Archibald, Raymond Clare (1875-1955) 187.r Archimedes (c 287-212 B.C) 20 78.F 93.H 149.N 155 B, D 187 243.G 31O.C 332 355.B 439.C Archytas (of Taras) (c. 430-c 365 B c.) 187 Arens, Richard Friedrich (1919-) 36.M 424.N Arf, Cahit(191O-) 114.J Argand, Jean Robert (1768- I 822) 74.C Arima, Reiko (Sakamoto, Reiko) Arima, Yoriyuki (1714-83) 230 Arimoto, Suguru (1936-) 213.E, F, r Aristotle (384-322 B.C.) 187 Ariyama, Masataka (1929-) 446.r Arkhangel'skii, Aleksandr Vladimirovich (1938-) 273 K. r 425 F, S, Y, CC, r Arnauld. Antoine (1612-94) 265 Arnoff, E. Leonard (1922-) 307 A Arnold, Leslie K. 136.C Arnol'd, Vladimir Igorevich (1937-) 82.r 126 A, L, M, r 136r 196 219.r 271 r 402.r 418 E 420.G App A, Table 5, r AronszaJn, Nachman (1907-80) I 12.H 188.r 273.K 323.J 338.E Arraut, Jose Luis (1938-) 126.H Arrow, Kenneth Joseph (1921-) 227.r 292 E, r Arsenin, Vasilii Yakovlevich 22.C, F, r Artin, Emil (1898- I 962) 4.A 6 E, F, r 7.r 8 14.F, K, 0, R, S, U, r 28 29.r 59.A, C, F, H, r 60.r 65.G I 18.F 123 F 149.N 15J.I 155 G 172.F, r J74.r 196 198.B, r 200.N 235.F 257.H, r 277.1 284.A, G 295.E 343.r 368.F, r 439 L, r 450 A, G, P, R Artin, Michael (1934-) 15.r 16.U, W, r 126.K 210.r 41 8.C 450.Q, r Arveson, William B. (1934-) 36 r 308.r Aryabhata (Arya-Bhatta) (c. 476-c 550) 209332 432.C Arzeh't, Cesare (1847-1912) 168.B 216.B 435.D Asada, Kenji (1946-) 274.r 345.B Asano, Keizo (1910-) 829 I Asano, Kiyoshi (1938-) 41.D Aschbacher, Michael George (1944-) l5J.J Ascher, Edgar 92.r Ascoli, Giulio (1843-96) 168.B 435.D Ash, Avner Dolnick (\949-) 16.r Assmus, Edward F., Jr. (1931-) 2oo.K Athreya. Krishna B. 44 C Atiyah, Michael Francis (1929-) 16.r 20 68.F 80.G, r 109 114.E 1470 153.C 183 237.A, H, r 323.K 325.1 345.A 366 A-D, r 390.1, J 391.L, N, r 426 431 D437,X Atkin, Arthur O. L. (1935-) 328 Atsuji, Masahiko (1922-) 425.Y Aubin, Jean-Pierre 286,X Aubin, Thierry Emilien (1942-) 183.r 232.C 364 H, r 1876 Audley, Robert John (1928-) 346.r Auerbach, Herman (1902-42) 270.J Aumann, Robert John (1930-) 173.D, E, r 443.A Auslander, Joseph (1930-) 126 D Auslander, Louis (1928-) 105.r 136.C I 52.C 279.C 437.U Auslander, Maurice (1926-) 29.K 200.K, L 284.G A very, J 377.r Avez, Andre 126.r J36.r 402.r Ax, James B 14.D I 18.B, F 276.E, r 50.J Ayoub, Raymond George (1923-) 4.r 123 r 295.r 328.r Azencott, Robert Guy (1943-) J36.G Azima, Naonobu (1739-98) 230 Aziz, Abdul Kadir 303.r Azra, Jean-Pierre (\935-) 171.r Azumaya Goro (1920-) 8 *, r 29.1, K r 67. D 172.r 200.L 362.r 368.r B Babbage, Charles (1792-1871) 75.A Bachelier, Louis (1870-1946) 45.A Bachet de Meziriac, Claude Gaspar (1581-1638) 296.A Bachmann, Paul Gustav Heinrich (1837-1920) 297.1 Bacon, Francis (1561-1626) 401.E Baer, Reinhold (\902-79) 2.F 122.B 200.1, K Bagemihl, Frederick (1920-) 62.C-E Bahadur, Raghu Raj (1924-) 396.r 398.r 399.N, r 400.K, r Bahmann, H. 97 B Bailey, Norman T J 40.r Baillon, Jean- Bernard (195 1-) 286. Y Baily, Walter Lewis, Jr. (1930-) 16,Z 32.F, H 122.r 194.r Baiocchi, Claudio 440.r Baire, Rene Louis (1874- I 932) 2021 C, L 84.D, r 126.H 273.B, J 425.N Bairstow, L. 30l.E Baker, Alan (1939-) I 18.D 182.G, r 196 347.E 430.D, r Baker, George Allen, Jr. (\932-) 142.r Baker, Henry Frederick (\ 866-1956) 9.r I 5.r n.r 350 r Baker, Kenneth R. 376.r Bakhshali (c. 3rd century) 209 Balaban, Tadeusz 325.K Balakrishnan, A. V. (\922-) 378.D Balas, Egon 215.C, r Baldwin, John T. (\944-) 276.F Balian, Roger 386.r Ball, W. W. Rouse 157.r Banach, Stefan (1892-1945) 2023 G 36 A, F 37 A, B, E, F, H, I, 0, r 105 Z 162 168.r 246.G 286.K, Z 310.F. I 424.C, H, J, X 442.r Banerjee, Kali S. (1914-) 102.r Bang, Thoger Sophus Vilhelm (1917-) 58.F Banica (Banica), Constantin (1942-) 23.r Baouendi, M. Salah (1937-) 323.N 345.A Barankin, Edward William (1920-) 396.r 399.D, r Barban, Mark Borisovich (1935-) 123.E Barbey, Klaus 164.r Barbosa, Joao Lucas Marques 275.B Barbu, Viorel(1941-) 88.r440.r Barden, Dennis 65.C Bardos, Claude Williams (1940-) 204.E Bargmann, Valentine (1908-) 258.r 37 EE Bar-Hillel, Yehoshua(1915-75) 96.r 
1877 Bari, Nina Karlovna (1901-61) 159 J Barlow, Peter( 1776-1862) NTR Barlow, William (1845-1934) 92 F Barnes, Ernest William (1874-1953) 206.C App. A, Table 18 I Barr, Michael (1937-) 200.r Barrow, Isaac (1630-77) 265 283 Barth, Wolf Paul (1942-) 16r Bartle, Robert Gardner (1927-) 68.M 443.A, G Bartlett, Maurice Stevenson (1910-) 40 r 44.r 280.J 407.r 42J.C, r Barwise, Jon 356.r Bashforth, F 303.E Bass, Hyman (1932-) 122.F 200 r 237, J, r Bass, Robert Wauchope (1930-) 289.D Bastin, J 351 r Batchelder, Paul M. 104.r Batchelor, George Keith (1920-) 205.r 433.C, r Bateman, Paul Trevier (1919-) 4.D348.K Bauer, Friedrich Ludwig (1924-) 302.r Bauer, Heinz (1928-) 193 U Baum, Paul Frank (1936-) 366 E 427.B Bayer, Pilar 450.r Bayes, Thomas (1702-61) 342 A F 396.J 398.B 399 F 401 B, E 403 G 405.1 Bazilevich, Ivan Evgen'evich 438.B Beale, E. M. L 292 r 349 r Beals, Richard William (1938-) 320.r 345.A, B Beardon, Alan Frank (1940-) 234 r Beatley, Ralph 139.r Beauville, Arnaud (1947-) 15 r Bebutov, M. 126 E Becchi, C. M. 150.G Beck, James V. 2oo.Q, r Beckenbach, Edwin Ford (1906-82) 21 J.r Becker, Oskar Joachim (1889-1964) 156.r 187 r Beckmann, Petr (1924-) 332 Bede Venerabilis (673-735) 372 Beer, Stafford (1926-) 95.r Beeson, H. 275.C Beez, R. 365.E Behnke, Heinrich (1898-1979) 21 H, Q 23 E 198.r 367 B, G, I, r Behrends, Ralph Eugene (1926-) 132 r Behrens, W. V 400.G Belardinelli, Giuseppe 206 r Belavin, A A. 80 r Belinfante, Frederik J. 150.B Bell, Eric Temple (1883-1960) 177D Bell, James Frederick (1914-) 33.r Bell, John Stewart (1928-) 351.L Bell, Steve 344.D Bellissard, Jean Vincent (1946-) 351.L Bellman, Richard (Ernest) (1920-84) 86 A, F 127.A, D, E, G, r 163.B 211.r 291 r 314 r 394 r 405.B, r Belov, Nikolai Vasil'evich (1891-1 92 r Beltrami, Eugenio (1835-1900) 109 194 B 285.A 352 B App. A, Table 4.11 Belyaev, Yurii Konstantinovich (1932-) 176.G Bendat, Julius S (1923-) 212.r Bender, Helmut (1942-) 151J Benders, 1. F. 215.r Bendikson (Bendixson), Ivar Otto (1861-1936) 107 A 126.1 Bengel, Gunter (1939-) 112.D Benilan, Philippe (1940-) 162 Bensoussan, Alain (1940-) 405.r Berard-Bergery, Lionel (1945-) 364.r Berens, Hubert (1936-) 224 E, r 378.r Berezin, Feliks Aleksandrovich (1931-80) 377 r Name Index Bieberbach, Ludwig Berg, Christian (1944-) 338.r Berg, Ira David (1931-) 33J.E 390.1 Berge, Claude (1926-) 186.r 281.r 282 r Berger, Charles A. (1937-) 251.K, L Berger, James Orvis (1950-) 398.r Berger, M. (1926-) 109 *, r, 178.A, C 391.B, C, r Berger, Melvyn (1939-) 286.r Berger, Toby (1940-) 213.E Bergh, Joran (1941-) 224.r Bergman, Stefan (1895-1977) 21 Q 77.r 188.G, r 326.C Berkovitz, Leonard D. (1924-) 86 F 108.A, B Berlekamp, Elwyn R (1940-) 63.r Bernays, Paul Isaak (1888-1977) 33.A, C, r 97. * 156.r 411 J, r Bernoulli family 2038 107.A 266 Bernoulli, Daniel (1700-82) 2038 205.B 301.J 342.A 396.B Bernoulli, Jakob (1654-1705) 38 46.A 93.H 136.D- F 177 B 250.A 342 A 379 I App A, Table 14 I App. B, Table 3.1 Bernoulli, Jakob (1759-89) 38 Bernoulli, Johann (1667-1748) 38 46.A 93.H 163.B 165.A Bernoulli, Johann (1710-90) 38 Bernoulli, Johann (1744-1807) 38 Bernoulli, Nikolaus (1687-1759) 38 Bernoulli, Nikolaus (1695-1726) 38 Bernshtein, I. N. 125.EE 154.G 418.H, r Bernshtein, Sergei Natanovich (1880-1968) 49.B 58.E 196 240.E 255.D 261.A 275.A, F 323.1 334.C 336.A, C, F Bernshtein, Vladimir 121.r Bernstein, Allen R 276.E, r 293 D Bernstein, Felix (1878-1956) 228.A Berry, G G 319 B Berry, L. Gerard (1948-) 40D Bers, Lipman (1914-) 21.r 23.r I11.r 122.1, r 204.G 234.D, r 275 A 320 r 326.r 327 r 352.B- E, r 367.r 416.*, r Berstel, Jean (1941-) 31 r Berthelot, Pierre (1943-) 16.r 366.r 450.Q Bertini, Eugenio (1846-1933) 15 C Bertrand, Joseph Louis Fran90is (1822-1900) 111 F 123A Berwald, L. (1883-?) 152.C Berztiss, Alfs T 96.r Besikovich (Besicovitch), Abram Samoilovich (1891- 1970) 18.A, C 246 K Besov, Oleg Vladimirovich (1933-) 168.B, r Bessaga, Czeslaw (1932-) 286 D 443.D Besse, Arthur L. 109.r 178.r Besse,J 198N Bessel, Friedrich Wilhelm (1784-1846) 39 A, B, D, G 197.C 223.C App. A, Tables 14.11, 19 III, IV, 21.III, IV Besson, Gerard (1955-) 39I.F Betti, Enrico (1823-92) 105.A 2oo.K 201 A, B 426 Beurling, Arne (Karl-August) (1905-) 62.B, E 125.A, U 143.A 164.G, I 169.E 192 Q 251.L 338.Q, r 352.C Bezout, Etienne (1739-83) 9.B 12.B Bhaskara (1114-85?) 209296.A Bhatia, Nam Parshad (1932-) 86.r 126.r Bhattacharya, Rabindra Nath (1937-) 374.F, r Bhattacharyya, A 399 D, r Bianchi, Luigi (1856-1928) 80.J 365.J 417B BIckel, Peter John (1940-) 37I.r Bidal, Pierre 194 F Bieberbach, Ludwig (1886-1982) 43 r 77 E 89.C 
Name Index Biedenharn, Lawrence C. 92.F, r 107.r 179 B, r 198.r 254.r 288 r 339. 429.r 438 B, C Biedenharn, Lawrence C. (1922-) 353.r Biezeno, Cornelis Benjamin (1888-1975) 19.r Biggeri, Carlos 121.C Biggs, Norman Linstead (1941-) 157.r Billera, Louis J (1943-) 173 E Billingsley, Patrick P. (1925-) 45.r 136.r 250.r 341.r 374.r Binet, Jacques Philippe Marie (1786-1856) 174.A 295.A Bing, RudolfH. (1914-86) 65.F, G 79.D 273.K 382 D 425 AA Birch, Bryan John (1931-) 4.E 118.C-E 450 S Birkeland, R. 206.D Birkhoff, Garrett (1911-) 8.r 87.r 103 r 183.r 243.r 248.J 310 A 311.r 343.r 443 A, E, H Birkhoff, George David (1884-1944) 30.r 107.A 109111 I 126.A, E 136.A, B 139.r 153.B, D 157.A 162 253.C 254.D 279.A 286.D 420.F Birman, Joan S. (1922-) 235.r Birman, Mikhail Shlemovich (1928-) 331.E Birnbaum, Allan (1923-76) 399.C, r 4oo.r Birtel, Frank Thomas (1932-) 164.r Bisconcini, Giulio 420.C Bishop, Errett A. (1928-83) 164 D-F, J, K 367.r Bishop, Richard L (1931-) 105.r 178 r 417 r Bishop, Yvonne M M 280.r 403.r Bitsadze, Andrei Vasil'evich (1916-) 326 r Bjerknes, Carl Anton (1825-1903) I.r Bjorck, Ake (1934-) 302.r Bjorck, Goran (1930-) 125.r Bjork, Jan-Erik 112.r 125 EE 274.r Bjorken, James Daniel (1934-) 132.r 146.A, C 150.r Blackman, R. B 421.r Blackwell, David (Harold) (1919-) 22.H 398.r 399.C Blahut, Richard E (1937-) 213 r Blair, David E (1940-) 11O.E 364 G Blakers, Albert Laurence (1917-) 202.M Blanchard, Andre (1928-) 72.r Blanc-Lapierre, AndreJoseph (1915-) 395 r Bland, Robert G. (1948-) 255 C Blaschke, Wilhelm (1885-1962) 43 F 76 r 89 C, r 109.*, r IIO.C, rIll r 178 G 218.A, C, H 228.r Blatt, John Markus 353.r Blattner, Robert James (1931-) 437 W, EE Bleaney, B. I. 130.r Bleuler, Konrad (1912-) 150G Bloch, Andre (1893-1948) 21.N,0 77.F 272 L 429.D Bloch, Felix (1905-) 353.r 402.H Bloch, Spencer 16.R Block, Henry David (1920-78) 420.C Bloxham, M. J D 386 C Blum, Julius Rubin (1922-82) 136.E Blum, Manuel 71 D, r Blumenthal, Ludwig Otto (1876-1944) 32.G 122.E Blumenthal, Robert McCallum (1931-) 5.r 261.B, r Boas, Ralph Philip, Jr. (1912-) 58.r 220.D 240.K 429.r Bacher, Maxime (1867-1918) 107 A 167.E 193 D Bochner, Salomon (1899-1982) 5.r 18.A, r 21.Q, r 36.L!;U r 109.*, r 125.A 160.C, r 164.G 192.8, 0 194 G 261 F 327.r 341.C, J, r 367.F 378.D 443.A, C,H Bodewig, Ewald 298.r Boerner, Hermann (1906-82) 362.r Boetius, Anicius Manlius Torquatus Severinus (c. 480-524) 372 1878 Bogolyubov, Nikolai Nikolaevich (1909-) 125.W 136.H 146.A 150.r 212.B 290.A, D 361.r 402.J Bohme, Reinhold (1944-) 275.C Bohnenblust, (Henri) Frederic (1906-) 28 31O.A, G Bohr, Harald (1887-1951) 18.A, B, 1-1, r 69.B 121.B, C l23.r 450.1 Bohr, Niels Henrik David (1885-1962) 351.A Bokshtein (Bock stein), Meer Feliksovich (19\3-) 64.B 117.F Bol, Gerrit (1906-) IIO.r Boll, Marcel (1886-) NTR Bolley, Pierre (1943-) 323.N Boltyanskii, Vladimir Grigor'evich (1925-) 86.r 89 r 117.F 127.G Boltzmann, Ludwig (1844-1906) 41 A, B, r 136.A 402.B, H, r 403.B, r Bolyai, Janos (Johann) (1802-60) 35 A 181 267 285.A Bolza,Oskar(1857-1942) 46.r Bolzano, Bernard (1781-1848) 140 273.F Bombieri, Enrico (1940-) 15.r 72.K ] 18.B 123.D, E, r 151 J 275.F 438.C 450.P, Q Bompiani, Enrico (1889-1975) 110.B Bonnesen, Tommy (1873-) 89.r 228.A Bonnet, Ossian Pierre (1819-92) 109 111.H 275.A, C 364.D App. A, Table 4.1 Bonsall, Frank Featherstone (1920-) 31O.H Bony, Jean-Michel (1942-) 274.r Book, D L 304 r Boole, George (1815-64) 33.E 42.A-D, r 104.r 156.B 243.E 267 379.J 411.A, r Boone, William Werner (1920-83) 9'7.*, r 161.B Boothby, William M (1918-) IIO.E Borchardt, Carl Wilhelm (1817-80) 229.r Borchers, Hans-Jurgen (1926-) 150.E Borel, Armand (1923-) 12.B 13.A, G, r 16.Z 32.H, r 56.r 73.r 122 F, G, r 147.K 148.E, r 199.r 203.A 248.0 249 J, V, r 366.D 383.r 384.D 427.B, r 431.r 437 Q 450.r App. A, Table 6.V Borel, Emile (1871-1956) 2021.0 22.A, G 58.D 83 B 124 B 156.C 198.Q, r 261.D 270.B, C, G, J 272 E, F 273.F 339.D 342.A, B 379.0 429.B Borevich, Zenon Ivanovich (1922-) l4.r 297.r 347 r Borges, Carlos J. Rego (1939-) 273.K 425.Y Borisovich, Y urii Grigor'evich (1930-:, 286.r Born, Max (1882-1970) 402.J 446 r Borovkov, Aleksandr Alekseevich (19:1 1-) 260.H Borsuk, Karol (1905-82) 79.C, r 153 B 202.B, I 382. A, C Bortolotti, Ettore (1866-1947) 417.E Bose, Raj Chandra (1901-) 63.D 241.B STR Bose, Satyendra Nath (1894-1974) 132.A, C 351.H 377.B 402.E Bott, Raoul (1923-) 105.r 109 153.C I 54.F-H 178.G 202.V, r 237.D, H, r 248.r 272.L 279 D 325 J 345.A 366.r 391.N, r 413.r 427 E, r 437 Q App. A, Table 6.VII Bouligand, Georges (1889-?) 120.D Bouquet, Jean-Claude (1819-85) 107.A III.F 288.B 289 B Bourbaki, Nicolas 8 13.r 20.r 22.r 34 r 35.r 60.r 61.r 67.r 74.r 84.r 87.r 88.r 103.r 105.r 106.r 122.r 13l.r 135.r 149.r 162 168.C l72.r 187.r 216.r 221.r 225.r 248.r 249.r 256.r 265.r 266.r 267.r 270.r 277.r 284.r 310.1 311.r 312.r 337.r 348.r 355.r 360.r 362.r 368.r 379.r 381.r 409.r 423.r 424.1, r 425.S. W, Y, CC, r 435.r 436.r 443.A Bourgin, David G. 20l.r Bourgne, Robert I7l.r Bourguignon, Jean-Pierre (1947-) 80.r 364.r 
1879 Bourion, Georges 339.E Boussinesq, Joseph (1842-1929) 387.B, F Boutroux, Pierre Leon (1880-1922) 265.r 288.B, C, r Bowen, Rufus (1947-78) 126.A, J, K, r 136.C, G, r 234 r Bowman, Frank (1891-1983) 39.r Box, George E. P. (1919-) 102.r 128.r 301 L 371 A 421.G, r Boyle, James M 298 r Bradley, G. J. 92.r Bradley, Ralph Allan (1923-) 346.C Brahmagupta (598-660) 118.A 209 Bram, Joseph (1926-) 291.r Brams, Steven John (1940-) 173.r Brandt, Heinrich (1886-1954) 190.P 241.C Branges, Louis de (1932-) 176.K 438.C Bratteli, Ola (1946-) 36.H, K, r 308.r 402.G, r Brauer, Richard Dagobert (1901-77) 14.E 27.D, E 29.E, F, K 118.C 151.1, r 362.G, 1, r 427.B 450.D, G, L Braun, Hel (1914-) 32.H 122.E 231.r Brauner, Karl 418 r Bravais, Auguste (1811-63) 92.B, F App. B, Table 5.1V Bredikhin, Boris Maksimovich (1920-) 123.E Bredon, Glen E (1932-) 383.r 431.r Breiman, Leo (1928-) 260 r 342.r Brelot, Marcel (1903-) 120.C, E, r 193.J, L, N, U 207.C 338 G, H Bremermann, Hans-Joachim (1926-) 21 D,1 Bremmer, Hendricus (1904-) 240.r Brent, Richard Peirce (1946-) 123.B 142.A 450.1 Breuer, Manfred 390.1 Brewster, Sir David (1781-1868) 283.r Brezis, David 88.E Brezis, Haim (1944-) 88.E 162 *, r 286 C, X 440.r Brianchon, Charles Julien (1785-1864) 78.K 343.E Brieskorn, Egbert (1936-) 16.r 418.C, D, r App. A, Table 5 r Brigham, E. Oran (1940-) 142.r Brill, Alexander Wilhelm von (1842-1935) 9.E, r II.B, r 12.B Brillinger, David Ross (1937-) 421.r Brillouin, Leon Nicolas (1889-1969) 25.B 446.r Brin, Matthew G. 136.G Briot, Charles Auguste Albert (1817-82) 107.A 288 B 289.B Broadbent, S. R. 340 r Brocker, Theodor 51 r Broderick, Norma 92.r Brodskii, Mikhail Samoi1ovich (1913-) 251.r 390 H Brody, Robert 21.0 Bromwich, Thomas John I'Anson (1875-1929) 240.D 322.D 379.r App. A, Table 12.1 Bronshtein, M. D 3251 Bros, Jacques (1934-) 150.D 274.D, 1 386.B, C Brosilow, C. B. 303 r Brouncker, Lord William (1620-84) 332 Brouwer, Dirk (1902-66) 55.r Brouwer, Luitzen Egbertus Jan (1881-1966) 65.G 79.D 99 A 117 A, D, r 153.B I 56.A, C 202.B 305.A 426 Browder, Andrew (1931-) 164.r Browder, Felix Earl (1927-) 112 F, Q 286.C, X, r 323.H Browder, William (1934-) 114.1, L, r 427.B Brown, Edgar H, Jr. (1926-) 202.T Brown, Harold 92.F Brown, Lawrence David (1940-) 396.r Name Index Callen, Herbert Bernard Brown, Lawrence G. (1943-) 36.J 390.J, r Brown, Leon 43.G Brown, Morton (1931-) 65.G Brown, Robert (1773-1858) 5.D 45.A-C, F, I 176 C, 1 250.F 406.B, G Brown, Robert Freeman (1935-) 153.r Brown, Scott W. (1937-) 251 L Brownlee, John (1868-1927) NTR Bruck, Richard Hubert (1914-) 190.P, r 241.D Bruhat, Fran<;ois (1929-) 13.K, Q, R 437.0, EE Brumer, Armand 182.r 450.J Brun, Viggo (1885-1978) 4.A, C 123.D, E Brune, O. 282.r Brunei, Antoine 136.C Brunn 89 E Brunovsky, Pavol 126.M Bruns, Heinrich (1848-1919) 126.A 420.A Brunschvicg, Leon (1869-1944) 329.r Bruter, Claude Paul (1937-) 281 r Buchholz, Herbert (1895-1971) 167.r Buchner, Michael Anthony (1947-) 126 L Buchsbaum, David Alvin (1929-) 284.G Buck, R. Creighton (1920-) 43.F, r 106.r Buckner, Hans 217.r Bucur, Ion (1930-76) 52.r Bucy, Richard S. (1935-) 86.E 95.r 405.G, r Buerger, Martin 1. 92.r Buffon, Georges Louis Leclerc, Comte de (1707-88) 218.A 342.A 385.C Buhler, Joe P. 450.G Buhlmann, Hans (1930-) 214.r Bulirsch, Roland (1932-) 303.F Bulow, Rolf 92.F Bunimovich, Leonid Abramovich 136.G Bunyakovskii, Viktor Yakovlevich (1804-89) 211.C App. A, Table 8 Burali-Forti, Cesare (1861-1931) 319.B Burchnall, Joseph Langley (1892-1975) 387.C Burckhardt, Johann Jakob 92.F Burd, Vladimir Shepselevich (1938-) 290.r Burgess, David Albert (1935-) 295.E Burghelea, Dan (1943-) 105.r 183 Burgi, Joost(1552-1632) 265 Burgoyne, N. 150.D App. B, Table 5.r Burkholder, Donald L. (1927-) 168 B 262.B Burkill, John Charles (1900-) loo.A Burns, Daniel Matthew, Jr. (1946-) 344 C-E Burnside, William Snow (1852-1927) 151.D, H, J, r 161.C 190.Q, r 267 431 F Busemann, Herbert (1905-) 178.F, H, r Bush, Robert R 96.r 346.G Bush, Vannevar (1890-1974) 19.E Bustab, A. 4.A, C 123.E Butkovskii, A G. 86.r Butzer, Paul L. (1928-) 224.E, r 378.r Byrne, George D. 303.r c Cabannes, Henri (1923-) 259.r Caflisch, Russel E. 41.D, E Caianiello, Eduardo R. (1921-) 291.F, r Cairns, Stewart Scott (1904-82) 114.A, C 426 Calabi, Eugenio (1923-) 122.F 232.C 275.H 365.G, L Calderon, Alberto-Pedro (1920-) 36.M 217.J, r 224.A, F 251.0 274.B, 1 321.F 323.1 345.A Calkin, John Williams (1909-) 36.J 390.1 Callan, Curtis G. (1942-) 132.C 361 B, r Callen, Herbert Bernard (1919-) 419.r 
Name Index Campanato, Sergio Campanato, Sergio (1930-) 168.B Campbell, George Ashley (1870-'1) 220.r 249.R Camus, Jacques (1942-) 323 N Cannon, James W. (1943-) 65 A, C, F Cannonito, Frank Benjamin (1926-) 97.r Cantelli, Francesco Paolo (1875-) 342.B 374.E Cantor, Georg (1845-1918) 20 33.A 34.r 47.r 49.D, r 79 D 93.D 98 r 117 A 156.A 159 J 267 273 F 294.A, E, r 312.A, C 355.r 381 F, r 426 Cantor, Moritz Benedikt (1829-1920) 20.r 26 r 38 r 187 r 209.r 265.r 266.r 296.r 360.r 372 r Cantwell, John C. 154 H Cappell, Sylvain E. (1946-) 65.D 114.K, r Caratheodory, Constantin(e) (1873-1950) 20 r 21 0, Q 43.J, K, r 46 r 74.r 77.C, r 82 r 136 A, C 180.r I 98.r 246 G 255 D, E 270 E 316.F 320.r 321.r 324 r 333.B, C, r 334.E 438.B Cardano, Girolamo (1501-76) 8 10.D 294 A, r 360.*, r 444 App A, Table I Cardoso da Silva, Fernando Antonio Figueiredo (1939-) 320r Carleman, Torsten (1892-1949) 20 30.A 41 D 43.H 58.F 68 L 112 N 125.A 160r 164.J 168.B 217 J, r 240 K 321.F 323.J, M 3361 App A, Table 8 Carleson, Fritz (1888-1952) 121.C Carleson, Lennart A. E (1928-) 43 F, G, r 48.r 77E 124C 159.H 1641168B 169.r352.F Carlson, Bille Chandler (1924-) 389.r Carlson, James A. 21 N Carmeli, Moshe (1933-) 359 r Carnahan, Brice (1933-) 304 r Carnot, Lazare Nicholas Marguerite (1753-1823) 181266 Carrell, James Baldwin (1940-) 226.r Carrol,.r B 346 E, r Carroll, Robert Wayne (1930-) 378.r Carson, John R. App A, Table 12 II Cartan, EIie (1869- I 951) 13.H 21 P 50 80.A, M, N 90.r 105 r 109 *, r 11O.B, rIll r 137.*, r 147 A 152.C 178 A, B 183 191.E, H, 1218 D, E 219 B, r 248.F, I, K, N, W, r 249.E, I, R, S, V, r 285.r 344 A-C, F 362.1 364 F 365 B, I 384.A 412.*, r 413.F, r 417 r 427 B 428.E, G 434 B 437,X App A, Table 51 Cartan, Henri (1904-) 3.r 17.C, r 20.*, r 21.E, H, I, L, 0, Q, r 23.B, E, r 28 r 32 r 36.L 50 52.r 58.D M.B 70 F, r 72 E 80 r 87 r 94.r 105.r 124.B, r 192 r 198 r 200 I, r 201.1 208 r 21O.r 225.r 272 J 277.r 338 E. L, M, P 383 J, r 426 Carter, Roger William (1934-) 151 D, r Cartier, Pierre Emil (1932-) 9.E 12.B 16 E 203.H Cartwright, Mary L 62.E Case, James H 108 C Casimir, 248.1 Casorati, Felice (1835-90) 104.D 198.D Cassandro, M. 402.G Casselman, William Allen (1941-) 450.r Cassels, John William Scott (1922-) 14 r 59 r 118.r 182.r 257 r Cassini, Jean Dominique (1625-1712) 93.H Casson, Andrew J 114 K Cassou-Nogues, Pierrette (1945-) 450.J Castaing, Charles (1932-) 443.A Castelnllovo, Guido (1865-1952) 3.E 9.H, r 12.B, r 15.B, E, G, H Castillon, Giovanni Francesco Mauro Melchior Salveminide(1708-91) 179A Catalan, Eugene Charles (\814-94) App. A, Table 10.111 Cauchy, Augustin Louis (1789-1857) 4.D 5.F 20 1880 21.C 53 87C 100.E 107 A, B 164.J 165.A, r 190.Q 198 A, B, E, F, Q 211.C 216.D, E 267 273 J 274.G 284 B 286,X, Z 294.E 296. 301 G 316 A, C, G 320 B, D, I 321 A, B 339.A 341.D 344.A 379.A, B, F, K 388.B 436.G App. A, Tables 8, 9,10.11 Cauer, D (1889-1918) 179.B Cauer, Wilhelm (1900-45) 282 r Cavalieri, (Francesco) Bonaventura (1598-1647) 20 265 Cayley, Arthur (1821-95) 12 B 54 105 A 137 151.H 157.A 190.Q 226.G 251.1 267 269.F, J 285.A Cazenave, Thierry 286 Y Cebysev  Chebyshev tech, Edouard (1893-1960) 110.B, r 117.E 201.A, M, P 207.C 383.F 425 T, r 426436.1 Ceder, Jack G. (1933-) 425.Y Cerf, Jean (1928-) 114.1 Cesari, Lamberto (1910-) 246.r 290.1' 314.D, r 394 r Cesaro, Ernesto (1859-1906) 297 D 379 K, M Ceva, Giovanni (16471- I 734?) 7.A Chaber, Jozef 273.K Chacon, Rafael Van Severen (1931-) 136.B, H 162 Chadan, Khosrow (1930-) 375.r Chaikin, S E  Khaikin, S. E. Chaitin, Gregory J. 71.r 354.D Chakravarti, Indra-Mohan (1928-) 102.1 Chandler, Colston 274 D, I 386 C Chandrasekhar, S 433.r Chandrasekharan, Komaravolu (1920-) 121 r 123.r 160.r 379 r 450.r Chang Chen-Chung (1927-) 276.r 293 r Chang, J. J. 346.E, r Chang Sun-Yang (1948-) 164.1 Chaplygin, Sergei Alekseevich (1869-]942) 326.B Chapman, D. G. (1920-) 399.D Chapman, Sydney (1888-1970) 41.E 260.A 261 A 379.M 402.H, r Chapman, Thomas A. (1940-) 65 C 382.B, D Charnes, Abraham (1917-) 255.D, E 408.r Charpit, Paul (?-1774) 82.C 320.D 322.B App A, Table 15 II Charzynski, Zygmunt (1914-) 438.C Chase, A. B. 24.r Chase, Stephen Urban (1932-) 29.r 172.A, K Chasles, Michel (1793-1880) 12 B 78.J 267 350.C Chatelet, Fram;ois (1912-) 118.D Chaudhuri, Jyoti 63.D Chaundy, T 387.C Chauvenet, William (1820-70) 392.r Chazarain, Jacques (1942-) 274 r 325.H, M 345.B 378 F Chazy, Jean (1882-1955) 288 D 420 D Chebotarev, Nikolai Grigor'evich (1894-1947) 14 S 172.r Chebyshev, Pafnutii L'vovich (1821-94) 19.G 123.A 223.A 299.A 317.D 336.B, H,.r 342.C App. A, Tables 20.11, VII Cheeger, Jeff(1943-) 178 B, r 391.D, M, r Chen Bang-Yen (1943-) 365 F, H, 0, r 417.r Chen Jing-Run (1933-) 4.C, r 123.E 242.A Chen, T.-C. 142.C, r Cheney, Elliott Ward (1929-) 142 r Cheng, J. H. 365.L Cheng Shiu-Yuen (1948-) 275.H 391 F, H Ch'eng Ta-Wei (11.1592) 57.C Chern Shiing-Shen (1911-) 21. N, P 50.r 56.C, F, r 80.r 90.r 109.*, r 110.E II1.r I 47.A, N 152 C 218.D, E, r 237.B 272.L 275.A, E 279.C 344.B 365.B, H, L, 0, r 
1881 Chernoff, Herman (1923-) 371 A, C, H, r 374.r 4oo.K Chervonenkis, O. A NTR Cherwell, Lord (Lindemann, Frederick Alexander) 29 \.F Cheung, Fan-Bill 386.r Chevalier, Alfred 171 Chevalier, Jacques 329 r Chevalley, Claude (1909-84) 6.A I \.r 12.B 13.B, F, I, N, r 27.r 50.r 59 A, r 60.L, r 6\.r 69.D 105.r 118.B, F 125.M 151 I 200.0 248.Q, r 249.U, V, r 256.r 258 r 277.r 284.G 368.r 409.r 423.r 427.B Chew, Geoffrey Foucar(1924-) 132.r 386 C, r Chien, Robert Tienwen (1931-) 282 r Ch'in Chiu-Shao (fl. 1250) 57.B Ching Wai-Mee 308.F Chisini, Oscar 9.r Chittenden, Edward Wilson (1895-1977) 273.K Choi Man- Duen (1945-) 36 J 308.r Cholesky 298.G, 302.B, D Chomsky, Noam (1928-) 3\.D 75.E Choong, K. Y 332 Choquet, Gustave (1915-) 20 r 48.F, H, r 89 r 120.E 139.r 164.C 193 J, N, r 207 C 255.E 338.C, D, H, I, L-O 407.B 424 U, r Chow Wei-Liang (1911-) 3.B 12 B 13.F 16.H, R, S 72.F, H Chowla, Sarvandaman D. (1907-) 123 D 450 I, K, r Chretien, Max (1924-) 150.r Christian, Ulrich Hans Richard Otto (1932-) 32.F, H Christoffel, Elwin Bruno (1829-1900) 77 D 80L 109 II \.H 317.D 417.D App. A, Tables 4.n, 13 n Chu Hsin 126.N Chu Lan Jen (1913-) 133.r Chu Shih-Chieh (fl. 1300) 57 B Chudakov, Nikolai Grigor'evich (1904-) 4.C Chudnovskii, Grigorii V 430 D, r Chung Kai Lai (1917-) 45 r 260J 342.B Church, Alonzo (1903-) 22 G 3\.B 75.D 81.A, r 97 *, r 354.r 356.A, C, E, G, r Churchman, C. West (1913-) 307 A Ciarlet, PhilIipe G (1938-) 300 r 304 r Ciesielski, Zbigniew (1934-) 176.G Clagett, Marshall (1916-) 187 r 372.r Clairaut, Alexis Claude (1713-65) 20 107.A 165 A App. A, Tables 14.1, 15.n Clancey, Kevin F. 251 r Clark, Charles Edgar (1935-) 376.r Clarke, Douglas Albert 356.H, r Clarkson, James Andrew (1906-) 443.H Clatworthy, Willard H. STR Clausius, Rudolf Julius Emmanuel (1822-88) 419.A Clebsch, Rudolf Friedrich Alfred (1833-72) II.B 226.G 353.B Clemence, Gerald Maurice (1908-) 55.r 392.r Clemens, Charles Herbert (1939-) 16J Clenshaw, Charles William (1926-) 299.A Clifford, Alfred H (1908-) 190.r 243.G Clifford, William Kingdon (1845--79) 9 C 61.A, D 275.F Clough, Ray William, Jr. (1920-) 304.r Coates, John H. (1945-) 118 D 182.r 450J, r Cochran, William Gemmel (1909-80) 102.r 373 r 374.B Codazzi, Delfino (1824- 73) II \.H 365.C 417.F App. A, Table 4.1 Coddington, Earl Alexander (1925-) 107.r 252.r 253 r 254.r 314.r 315.r 316.r 394.r Name Index Coxeter, Harold Scott Macdonald Coffman, Charles V 246 J Cohen, Eckford (1920-) 12\.A Cohen, Irvin Sol (1917-) 284 A, D, G Cohen, Jacob Willem (1923-) 227.r Cohen, Marshall M. (1937-) 9\.r Cohen, Paul Joseph (1934-) 22 F 33.D, r 49.D 192P, Q Cohen, S. G. 353.r Cohn, Paul Moritz (1924-) 249.r Cohn, Richard M (1919-) 104.r Cohn-Vossen, Stefan (1902-36) 109 111.1178.F, H 357.r 365.E 41O.r Coifman, Ronald R 168.B 251.r Cole, B 164.D Cole, 1. D 25.r Coleman, Sidney Richard (1937-) 146.C Colin de Verdiere, Yves (1945-) 391.1 Collatz, Lothar Otto (1910-) 217 r 298 r Collingwood, Edward Foyle (1900-70) 62.C, D, r ColIins, P. D. B. 386.r Combes, Jean-Michel Christian (1941-) 33\.F Combescure, Edouard (c. 1819 (24?)-?) II \.F Commichau, Michael 72.r Condon, Edward U. 353.r Conforto, Fabio (1909-54) 3.r Conley, Charles Cameron (1933-84) 126.E Conlon, Laurence William (1933-) 154.H Conner, Pierre Euclide, Jr. (1932-) 237.r 431 E, r Connes, Alain (1947-) 136.F 308.H, I, r 35\.L Constantine, Alan Graham 374 r Constantinescu, Corneliu (1929-) 193 U 207.C, D, r 367 E, G, r Conti, Roberto (1923-) 290.r Conway, John Horton 151.1 235.A Conway, Richard W 376.r Cook, Joseph M. (1924-) 375.A Cook, Roger John (1947-) 118 D Cook, Stephen Arthur (1939-) 71.E, r Cooke, George Erskine (1932-) 201 r Cooke, Kenneth Lloyd (1925-) 163.B Cooke, Richard G. 379.r Cooley, James William (1926-) 142.D, r 304.r Cooper, William (1935-) 255.D, E 408.r Copernicus, Nicolaus (1473-1543) 360 Cop pel, William Andrew 314.r Corbato, Fernando J. 133.r Cordes, Heinz O. (1925-) 345.A Coriolis, Gaspard Gustav de (1792-1843) 271.D Cornea, Aurel (1933-) 193.U 207.C, D, r 367.E, G, r Cornish, Edmund Alfred (1909-73) 374 F Cornu, Marie Alfred (1841-1902) 93.H 167.D Corwin, Lawrence Jay (1943-) 132.r Coster, Joseph 386.C Cotes, Roger (1682-1716) 299.A Cottle, Richard Warren (1934-) 292 D Coulson, Charles Alfred (1910-74) 446.r Courant, Richard (1888-1972) 20.*, r 46.r 77.E, r 82.r 106.r 107.r 112.r 120.r 134.r 188.r 189.r 197.r 198.r 204.G 205 r 216.r 217.r 222.r 275 A, C, r 3oo.r 304.C, F, r 317.r 320.r 32 \.G, r 322.r 323.E, r 324.r 325.M, r 327.r 334.C, D 389.r 391.H 44\.r 446.r Cousin, Pierre (1867-1933) 20 2\.K, Q Cowen, Michael J (1945-) 124.r Cowling, Thomas George (1906-) 259.r 402.r Cox, David Roxbee (1924-) 40.r 403.r Cox, Gertrude Mary (1900-78) 102.r Coxeter, Harold Scott Macdonald (1907-) 13.R 92 r 122 H 151 r 16\.r 248.S 285.r 357.r 
Name Index Cracknell, Arthur P. Cracknell, Arthur P. 92 r Cramer, Gabriel (1704-52) 179 A 269 M 302 A Cramcr, Harald (1893-1985) 123 r 214C 242 A 250r 341 E, r 374r 395 r 399 D, M, r400 r Crandall, Michael G (1940-) 162286 T, X, r Crandall, Stephen Harry (1920-) 298 r Crapper, Gordon David (1935-) 205.F Crawford, Frank Stevens (1923-) 446 r Crelle, August Leopold (1780-1855) 1 NTR Cremona, Antonio Luigi Gaudenzio Giuseppe (1830-1903) 161 Crittenden. Richard J (1930-) 413 r Crofton, Morgan William (1826-1915) 218.B Cronin, Jane (Smiley) (1922-) 153 r Crout, Prescott D 302 B CiOW, James Franklin (1916-) 110:: T'\ '£:'1... IIJ L.I L.UJ 1 Crowell, Richard Henry (1928-) 235 r Cryer, Colin W 303 G Csiszar, I mre 213.r Cuda, Karel (1947-) 293 E, r Curtis, Charles Whittlesey (1926-) 29 r 92 r 151 r 277 r 362.r Curtis, E 70 r Curti" John H. 299.A Cutkosky, Richard Edwin (1928-) 146.A, C 386.C CutIand, Nigel John (1944-) 293 r Czuber, Emanuel (1851-1925) 19.B o Dade, Everett C. (1937-) 92.F Dahlqui:,l. Germund (1925-) 303.G Dakin, R J 215 r d'Alembert, Jean Ie Rond (1717-83) 20107.B 130 A 205 C 239 252 F 325.D ApI'. A, Table 10.11 Damerell, Robert Mark (1942-) 450 M Daniell, Percy John (1889-1946) 310.1 Danilevskii, A M 298 D Danilov, V 1 16 Z Dankner, Alan (1945-) 126 J Dantzig, Gcorge Bernard (1914-) 255 C, E, r 264.r 292 D 408.r Darboux, Jean Gaston (1842-1917) 50 109 *, r 110B 126L 158r216A275A317D320.C428A Darmois, Georges 374 H Darwin, Charles Robert (1809-82) 40 B Dashen, Roger Frederick (1938-) 132.r Date, Eturo (1950-) 287 C D'Atri, Joscph Eugene (1938-) 364.r 384.E Datta, Bibhutibhw,an 209 r Davenport. Harold (1907 -69) 4.E 118.D 192 P David, Florence Nightingale (1909-) STR David, Herbert Aron (1925-) 346.r 374.r Davidenko, Dmitrii Fedorovich (1922-) 301 M Davie, Alexander M. 164 J Davies, Laurie 374 r Davis, Burgess J (1944-) 262 B Davis, Chandler (1926-) 212.r Davis, Harold T. ApI'. A, Table 21 r NTR Davis, Martin (David)( 1928-) 22 H 31 r 97 *, r 1730293 r 356 H Da vis, Philip J. (1923-) 223.r 299 r Davis, William Jay (1939-) 68 M 443 H Davisson, Lee D. (1936-) 213.E Day, Mahlon Marsh (1913-) 37 r 310.r Daykin, David Edward 332 r De Alfaro, Vittorio (1933-) 132 r 375.r de Baggls, F. S. 126.A Debiard. Amedee 115.r Debreu, Gerard (1921-) 173 E 443 A, 1 1882 Debye, Peter Joseph William (1884-1966) 25.C, r 30 C 39 E ApI'. A, Table 19.m Dedekind, Julius Wilhelm Richard (18:,1-1916) 11.B, r 12 B 14 C-E, J, U 47 49 F, r E7 K 98 156 A. r 172 A, r 243.F 267 284.G 294.A, E, r 328 347 H 355.A, r 363.r 379.D 450 A, D, K De Giorgi, Ennio 275.F 323 L de Haan. David Bierens (1822-95) API' A, Table 9.r Dehn, Max (1878-1952) 65.E 155.F 196 Dejon, Bruno F. (1930-) 301.G, r Dekkers, A. J NTR Delaunay, Charles Eugene (1816-72) 93.H de la V allee- Poussin, Charles Jean (1866-1962) 20.r 48.A, B 123 B 379.S 437.r 450 B, I Dcleanu, Anstide (1932-) 52.r Delens, Paul Clement (1889-) 110 r de I'H6p,tal, Guillaume Fran<;ois Anto ne (1661- 1704) 20 Deligne, Pierre (1944-) 9.r 12.B 16.V, r 32.D 118.B 418.r 428 H 450 A, G, H, J, M, Q, S, . ApI' B, Table 5 Dellacherie, Claude (1943-) 22.r 261.r 262.r 407.B, r Deltheil, Robert 218 r Demazure, Michel (1937-) 13.r 16.1, Z, r de Miatello, 1. D. 384.E Deming, William Edwards (1900-) 2f;0 J 373 F, r Democritus(c 460-c 370BC'.) 187 de Moivre, Abraham (1667-1754) 74 C 250 A 342.A De Morgan, Augustus (1806-71) 42.A 156.B 157.A 381.B411.A, r Denes, Jozsef(1932-) 241.r Denjoy, Arnaud (1884-1974) 58 F 79 D 100.A, D 126 I 154.D, H, r 159 I 168 B 429.D Denker, Manfred (1944-) 136.H Deny, Jacques (1916-) 338.M-P, r de Oliveira, Mario Moreria Carvalho 126 J De Paris, Jean-Claude 321 G de Possel, Rene (1905-) 77.E 367.F Deprit, Andre Albert (1926-) 420.G Deprit-Bartholome, Andree 420.G de Rham, Georges-William (1903-) 12.B,105.R, V, r, 109.*, r 114.L 125.A, R 194.F, r 201.A, H, 1 237 H 249 V 274.G, 364.E, r, 417 r Desargues, Gerard (1593-1662) 155 E 181 265329 343.C Descartes, Renc (1596-1650) 7.C 10.E 20 93 H 101 180.A 181265426 Deser, Stanley (1931-) 150.r De-Shalit, Amos (1926-) 353.r Desoer, Charles A (1926-) 86.D d'Espagnat, Bernard (1921-) 351.r DeTurck, Dennis M. 364 r Deuring, Max Friedrich (1907-84) n r 29 r 73 A, r 123 D 257.r 439 L 450 S, r Deutsch, Robert William (1924-) 55 r de Vries, G 387 B De Wilde, Marc (1940-) 424X, r DeWitt, Bryce Seligman (1923-) 359.r DeWitt (DeWitt-Morette), Cecile (192:'-) 150.r 359 r DiCastro, C 361 r Dickinson, Bradley W. (1948-) 86.D Dickson, Leonard Eugene (1874-1954,> 4.E 10 r 54.r 60.K 118.A 151 I, r 296.r 297 r Didenko, Viktor Pavlovich 326.r Dido (Didon, Belus Elissa) 228.A Dienes, Paul (1882-1952) 339.r Diestel, Joseph (1943-) 37 r 443 A 
1883 Dieudonne, Jean (1906-) 10 r 12 r 13 C 20 r 60 K, r 139 r 151 r 183 r 200 r 203 r 226 r 321.r 355 r 389.r 424.r 425.S, X, Y 435.r 436.1 Dijkstra, Edsger Wybe (1930-) 281.C Dikil, Leonid Aleksandrovich 387 C Diller, Justus (1936-) 155 r Dilworth, Robert Palmer (1914-) 281 E Dinaburg, E. I. 126.K Dinculeanu, Nicolae (1925-) 443 r Dinghas, Alexander (1908 -74) 124 r 198 r DlOi, Ulisse (1845-1918) 39 D 1111159 B 314 D 435 B Dinits, E A 281.r Dinkelbach, Werner 264 r Dinostratus (fl 350 1 B.C ) 187 Diodes (200 R C ) 93 H Diophantus (c 246-c. 330 or c. 1st century) 118 A 182.F 187 296.A Dippolito, Paul Randall (1948-) 154 D Dirac, Paul Adrien Maurice (1902-84) 125 C 132 A 150.A 270D 297.r 351.G, r 359.C 377 B, C, r App. A, T ahle 12.11 Dirichlet, Peter Gustav Lejeune (1805-59) 14 D, U 84.D 98119 r 120.A, F 121 A 123 D 159 B 160 B 164 B, 165 A, r 182 A 193 F 234.C 242 A 261.C 267 295 D 296 A, B 323 C, E 334 C 338 Q 341 D 347.E, H. r 348 M 379.(', D 440 B 450 A, C, K App A, Table 9 V Dixmier, Jacques (1924-) 18.1 36.r 681308 F. r 351.C 437 r Dixon, A C. (1865 -1936) App. A. Table 191V Dmitriev, N A 44 r Dobrushin, Roland L'vovich (1929-) 250.r 340 B. r 402 G 407 B d'Ocagne, Maurice (1862-1938) 19 D, r do Carmo, Manfredo Perdigao 111.r 275 A, B 365 G, r Doetsch, Gustav (1892-1977) 43 E 208.r 240 r 379.M Doi, Koji (1934-) 450 L Doig, Alison G. 215.D Dolansky, Ladislav NTR Dolbeault, Pierre (1924-) 72 D Dolbeault-Lemoine, Simone 365.E Dold, Albrecht E (1928-) 70 F. r 201.r Dolgachev, Igor V App. A, Tahle 5 V Dollard, John Day (1937 -) 375 B Domb, Cyril 361.r 402 r Donaldson, Simon K I 14 K. r Dongarra, Jack J (1950-) 298 r Donin, losif Failovich (1945-) 72 G Donnelly, Harold Gerard (1951-) 391 N Donoghue, William F , Jr 212.r Donsker, Monroe David (1924-) 250 E 340 r Doob, Joseph Leo (1910--) 5.r 45 r 62 E 86 E 115 r 136 B 162 193 T 207 C 250 r 260.1 261.A, F, r 262 A, B, D 341 r 342 r 395 r 406.A 407.A, r Doolittle, M. H 302 B Doplicher, Sergio (1940-) 150 E Dorfmeister, Josef F (1946-) 384 r Dorge, Karl (1898-1975) 337 F Dorn, William Schroeder (1928-) 349 r Dornhoff, Larry 362.r Douady, Adrien (1935-) 23 G 72 G Douglas, Jes&e (1897-.1965) 77.E 109 152.C 275 A, C 334 C, D, F, r Douglas, R G (1938-) 36J, r 164.1251 r 390J. r Doughs, Avron (1918-) 112 H 323 H Dowker,CliffordHugh(l91282) 117E201 M 425 S, Y Name Index Eells, James Drach,Jules(1871-1941) 107A Drake, Frank Rohert (1936-) 33 r Draper, Norman Richard (1931-) 102.r Drasin. David (1940-) 272 K. r Dreitlein, Joseph F. (1934-) 132 r Drell, Sidney David (1926--) 132 r 150 r Dreyfus, Stuart Ernest (1931-) 127.r Driver, Rodney D. (1932.) 163 B Dryden, H L 433 r Dubinsky, Ed (1935-) 168 B de Bois-Reymond, Paul David Gustave (1831-89) 159.H 379.D Dubreil, Paul 243.r Dubreil-Jacotin, Marie-Louise 243 r Dubrovin, B. A. 387.r Dubyago, Alexander Dmitrievich (1903-59) 309.r Dudley, Richard Mansfield (1938-) 176.G Duffin. Richard J. 264.r Dulling, G 290.C Dufresnoy, Jacques 17 C 124 B Dugundji, James (\919-85) 425.r Duhem, Pierre Maurice Marie (1861-1916) 419.B Duijvestijn, A J W. (1927-) NTR Duistermaat, Johannes Jisse (1942-) 274.B, I 345 B, r 391 J Dulac, M H 289 C, D, r Dunford, Nelson (1906-) 37 r 68 M 112 I, 0136 B, r 162.*, r 168.r 240.r 251.G, r 31O.r 315.r 331.r 378 B, r 390 K, r 443.A, F -H, r Dupin, Pierre Charles Fran<;ois (1784-1873) 111.H Durand, E 30l.F Durand, William Frederick (1859-1958) 222 r Duren, Peter Larkin (1935-) 43 r 438.r Durer, Albrecht (1471-1528) 360 Durfee, Alan H. (1943-) 154.B 418.r Duschek, Adalbert (1895-1957) III r Du Val, Patrick (1903-) 418 C Duvaut, Georges (1934-) 440 r Dvoretsky, Aryeh (1916-) 45.r 443.D Dwork, Bernard M (1923-) 450G, Q Dwyer, Paul Summer (1901-) 298.r Dydak, ]erzy 382 A, C Dye, Henry Abel (1926-) 136 F, r Dyer, SEldon, Jr (\929-) 237.r Dym, Harry (1938-) 176 K Dynkin, Evgenii Bori,ovich (1924-) 115 A, r 248.S, r 250 r 261.A, C 270 B App A, Table 5.1 Dyson, Freeman John (1923-) 132.C 146.A I 50. A 182G212B402G Dzyaloshin&kii, Igor Ekhiel'vich (1931-) 402.r E Easton, William B 33.F. r Eberlein, Ernst 136.H Eberlein, Patrick Barry (1944-) 178.r Eberlein, William Frederick (1917-) 37.G 162 424.V Ebin, David G (1942-) 178 B 183 r Eckmann, Beno(1917-) 2001 Eddington, Sir Arthur Stanley (1882-1944) 109 App A, Table 4 II Eden, Richard John (1922-) 146 A, C, r 386.C, r Edgeworth. Franci& Ysidro (1845-1926) 374 F Edmond, C 432 r Edmonds, Alan Robert (1922-) 353.r Edrei, Albert (1914-) 17 D 272.K, r Edwards. Harold M (1936-) 123 r Edwards, Robert Duncan (1942-) 65 A, C, F 154.H Eells, James (1926-) 105 r 114 B 183 *, r 194 r 195 E, r 
Name Index Effros, Edward George Effros, Edward George (1935-) 36. H, J 308.r Efron, Bradley (1938-) 3990 Egorov, Dmitril Fedorovich (1869-1931) 270.J Egorov, I van Petrovich (1915-) 364 F Egorov, Yurii Vladimirovich (1938-) 112 D 274.C, I 345.A, B Ehrenfest, Paul (1880-1933) 260.A 402 r Ehrenfest, Tatiana Alekseevna Afanaseva 260 A 402.r Ehrenpreis, Leon (1930-) I 12.B, C, R 125.S 320 H 437.EE Ehresmann, Charles (1905-79) 80 A, r 90.r 109 154.A, r Ehrlich, Louis W. (1927-) 301 F Ehrlich, Paul Ewing (1948-) 301 r 364.r Eichler, Martin Maximilian Emil (1912-) II.B, r 13.P 27 D, r 32.D, H, r 60.r 61.r 348 r 450.A, L, M,S Eicken, W. 75.r Eidel'man, Samuil Davidovich (1921-) 112.B 327.H Eilenberg, Samuel (1913-) 31.r 52.r 70.E-G, r 75 r 91.r 2oo.K, M, 0, r 201.A, C, E, G, J, Q 202.B, T 21 O.r 277.r 305.A 426 *, r Einstein, Albert (1879-1955) 45.A 109 129 132.A 137 150.A 256.J 285 A 351 A 359.A, B, D, r 364.D, 1434 C, r App A, Table 4.11 Eisenberg, Edmund (?- I 965) 292 D Eisenhart, Luther Pfahler (1876-1965) 109.*, r II1.r417r Eisenstein, Ferdinand Gotthold Max (1823-52) 140 32.C, F 296.A 337.F 339.E 450.T Ejiri, Norio (1953-) 275.A 364.F 365 G 391 C Elias, Peter(1923-) 213 F Eliasson, Halldor Ingjmar (1939-) 364.H Elliott, George Arthur (1945-) 36.H, K, r Elliott, Peter D T A 295 r Ellis, George F R (1939-) 359.r El'sgol'ts, Lev Ernestovich (1909-67) 163.r Elworth, Kenneth David (1940-) 183 286.D 406.r Emch, Gerald Gustav (1936-) 351.r Emde, Fritz 389.r NTR Emden, Robert (1862-1940) 291.F Endo Shizuo (1933-) 2oo.K Enestrom, Gustav (1852-1923) 10.E Enflo, Per (1944-) 37.G, L Engel, Friedrich (1861-1941) 247.r 248.F 249.r Engelking, Ryszard I 17.r 273.r 425.r 436.r Enneper, A. 275.A, B Ennola, Veikko Olavi (1932-) App. B. Table 5.r Enoki Ichiro (1956-) 72.K Enomoto Hikoe (1945-) App B, Table 5 Enright, Wayne H 303.r Enriques, Federigo (1871-1946) 9 r 12.B, r 15.B, E, G, H, r 72 K Enskog, David (1884-1947) 41.E 217.N 402.H Enss, Volker (1942-) 375.D Epstein, David Bernard Alper (1937-) 154 H Epstein, Henri (1932-) 125.W 150.D 212.B 386.B, r Epstein, Paul Sophus (1883-) 450.A, K Eratosthenes (275-194 B.C.) 187297.B Erbacher, Joseph A. 365.H Erdelyi, Artur (1908-77) 25 r 30 r 220.r 254 r 389.r App. A, Table 20.r Erdos Paul (1913-) 4.A 45.r 123.C, r 241.E 250.r 295 E 328 336 E 342.B Erlang, A. K 260.H 307.C Ernst, B. 359.E Ernst, Bruno (1947-) 424.r Escobal, Pedro Ramon 309 r 1884 Eskin, Grigorii I1'ich (1936-) 274.C, I 345.B Estermann, Theodor 4 C, D 123 D Estes, William Kaye 346 G Ethier, Stewart N 263.r Euclid (Eukleides) (c 303-c. 275 B.C.) 13.R 24.C 35.A 67.L 70.B 93.A 139.A, B, E 140 150.F 155.A 179.A 180.A 181 187285 A, C 296.A 297.A, B 332 337.D 364.B 412.H 423 M 425.V Eudoxus (c 408-c. 355 B c.) 20 187 Euler, Leonhard (1707-83) 4.C 16.E 20 38 46.A, B 56.B, F 65.A 83.A 90.C 93 C 107.A, B 126.A 131.0, G 141 145 165.A, r I 74.A, C I77.C, D 181 186.A, F 201 B, F, N 204.E 205.A, B 240.A 241.B 266 271.E, F 275.A 294.A 295.C, E 296.A 297,D, H 303.D, E 320.0 332 379.1-K 419.B 420 B 432.C 441.B 450.B App. A, Tables 3 V, 14.1 App. B, Tables 3.1, 6.1V Evans, Griffith Conrad (1887-1973) 48.E 120.0 338.H Evens, Leonard (1933-) 2oo.M Everett, J. D. 223.C App. A, Table 21 Ewens, Warren J. 263.r F Faber, Georg (1877-) 228 B 336 I 391.D 438.B Fabry, Eugene (1856-1944) 339.D Faddeev, Dmitrii Konstantinovich (1907-) 112.P 302.r Faddeev. Lyudvig Dmitrievich (1934-) 132.C 150.G 375.F 387.G Faddeeva, Vera Nikolaevna (1906-) J02.r Fagnano, Giulio Carlo (1682-1766) 20 Falb, Peter L (1936-) 86.D F<ilthammar, Carl-Gunne (1931-) 259.r Failings, Gerd (1954-) 118.E 145 Fan Ky (J 914-) 153.D Fannes, Marcus Marie-Paul (1950-) 402.G Fano, Gino (1871-1952) 12B 137r Fano, Robert Mario (1917-) 130.r 213.F Fano, Ugo (1912-) 353.r Fantappie, Luigi 125.A Faraday, Michael (1791-1867) 150.A Farey, J. 4.B Farkas, Julius (1847-1930) 255.B, E Farquhar, Ian E. 402.r Farrell, O. J. 164.J Farrell, Roger Hamlin (1929-) 398.r Fary, Istvan (I922-?) Ill.r 365.0 Fathi, Albert 126 N Fatou, Pierre (1878-1929) 21.Q 43.0 221.C 272.D 339.D Fattorini, Hector 0 378.D Favard, Jean (1902-) 336.C Fazar, W. 376.r Federer, Herbert (1920-) 246.r 275.A. G, r 334.F Fedorov. Evgraf Stepanovich (1853-1919) 92.F 122.H Fedorov, Vyatseslav Vasil'evich 102.1 Feferman, Solomon (1928-) 81.A Fefferman, Charles L. (1949-) 21.P, Q 168.B, r 224.E 262.B 320 r 344 D, F 345 A, B Feinberg, Stephen E. 403.r Feinstein, Amiel 213.F Feit, Walter (1930-) I 51.D, J, r 362.r Fejer, Lipot (Leopold) (1880-1959) 43.J 77.B 159.C 255.D Fekete, Mihaly (Michael) (1886-1957) 48.0445 Feldblum 179.B Feldman, Jacob (1928-) 136.F 
1885 Fel'dman, Naum Il'ich 118.D 430.D, r Fell, James Michael Gardner (1923-) 308.M Feller, William (1906-70) 112.J 115.A, r 250 r 260.1 261 A, B 263 r 341 r 342.r 378 B Fenchel, Werner (1905-) 89.r 109 1I1.F 365.0 Fendel, D. App. B, Table 5 Fenstad, Jens Erik 356 F, r Fermat, Pierre de (1601-65) 4 D 20109144145 180A 265 296 A 297.F, G 329 342.A 441.C Fermi, Enrico (1901-54) 132, 287.A, r 351.H 377.B 402.E Fernique, Xavier 176.G Ferrar, William Leonard (1893-) App. A, Table 19. IV Ferrari, Ludovico (1522-65) 8 IO.D 360 444 App. A, Table I Ferrero, Bruce 14.L 450 J Ferrers, N M 393 C Ferus, Dirk 365 E, J, N, 0, r Feshbach, Herman (1917-) 25 r Fet, Abram Il'ich (1924-) 279.G Feynman, Richard Phillips (1918-) 132 C 146.A, B 150.A, F 351.F 361.A Fibonacci (Leonardo da Pisa, Leonardo Pisano) (c 1174-c 1250) 295.A 372 Fiedler, Wilhelm 350.r Fienberg, Stephen Elliott (1942-) 280.r Fierz, Markus Edoward (1912-) 150 A Fife, Paul C. 95.r 263.D Figiel, Tadeusz 68.K, M Figueira, Mario Sequeira Rodrigues 286.Y Filippov, Aleksei Fedorovich (1923-) 22.r Fillmore, Peter Arthur (1936-) 36 J 390.1, r Finn, Robert (1922-) 204.D, r 275.A, D Finney, David John (1917-) 40.r Finney, Ross L. 20l.r Finsler, Paul (\894-1970) 109152 A 286.L Fischer, Arthur Elliot 364 H Fischer, Bernd (1936-) 151.1, J Fischer, Ernst (1875-1956) 168.B 317.A Fischer, Gerd 23.r Fischer-Colbrie, Doris Helga (1949-) 275.F Fisher, Michael Ellis (1931-) 361.r Fisher, Ronald Aylmer (1890-1962) 19.r 40.B 102.A, E, r 263.E 371 A, C 374.B, F 397.r 399.D, K, N, 0, r 400 G 401 B, C, F, G, r 403.A, F, r STR Fisher, Stephen D. 43.G 77.E Fix, George Joseph (1939-) 3oo.r 304.r Flammer, Carson (1919-) 133 r Flanders, Harley (1925-) 94.r 432.r 442.r Flaschka, Hermann (1945-) 287.B, r Fleissner, William G. 273 K Fleming, Wendell Helms (1928-) 108 A 275.A, G 334 F 405.r Flon, L. 75 r Floquet, G 107 A 252.J 268.B Floyd, Edwin E. (1924-) 237 r 431.E, r Focken, C M 116.r Fader, Geza (1927-77) 33.r Fodor, Jerry A. 96.r Fogarty, John 226.r Fogels, E. 123 D, F Foguel, Shaul R. 136.C Foia, Ciprian (\933-) 251.N Fok (Fock), Vladimir Aleksandrovich (1898-1974) 105 C 377.A, r Fokker, Adriaan Daniel 115.A 402.1 Fomin, Sergei Vasil'evich (\917-'75) 2.F 46.r 136.G Fong, Paul 151.1 App. B, Table 5.r Name Index Frobenius, Ferdinand Georg Ford, Lester R. (1896-1971) 234.C, r 281.r 282.r Ford, Walter Burton (1874-1971) 30.r Forelli, Frank (1932-) 164.G, H, K Forrester, Jay Wright (1918-) 385.B, r Forst, Gunner 338 r Forster, Otto F. (1937-) 72.r Forsyth, Andrew Russell (1858-1942) 289.B 428 r App. A, Table 14 r Forsythe, George Elmer (1917-72) 302.r 304.r Fort, Marion Kirkland, Jr. (1921-) 65.r Fort, T. 104.r Fortet, Robert Marie (1912-) 395.r Fortuin, C. M. 212 A Foster, Ronald Martin (1896-) 220.r Fotiadi, Dimitri 146.A, C Fourier, Jean-Baptiste-Joseph (1768-1830) 10.E 18.B 20 36.L 39.D 125.0, P, BB 142.D 158 159.A 160A-D 176.1 192 B, D, F, K, 0197 C 220.B 255 E 266 267 274.C 317.A 327.B 345 B 437 Z App A, Tables 11. I, II Fowler, Kenneth Arthur (1916-) 151.J Fowler, Ralph Howard (1889-1944) 402.r Fox, Ralph Hunter (1913-73) 65.G 235.A, C, G, r 382.A Fraenkel, Abraham Adolf(1891-1965) 33.A, B, D, r 47.r 381.r Frame, J. S. App. B, Table 5 Francis, J G. F 298.F Frank, Philipp (1884-1966) 129.r Frankel, Theodore T. (1929-) 364.D Franklin, Philip (1898-1965) 157.A, E Franklin, Stanley P. (1931-) 425 CC Franks, John M. (1943-) 126.1, K Franz, Wolfgang (1905-) 91.r 337.F Fraser, Donald Alexander Stuart (1925-) 396.r 401 r Frautschi, Steven Clark (1931-) 386.r Frechet, Rene Maurice (1878-1973) 37.0 87.K, r 117.H 246A, I 273.A, r 286.E, K 424.1 425.Q, S, CC, r 426 Fredholm, Erik Ivar (1866-1927) 20 68.A, E, F, K, L 120 A 162217 A, E, F, r 222.A 251.D 286.E 339.D Freedman, David A. (1938-) 250.r Freedman, Michael Hartley (1951-) 114.K, r Frege, Friedrich Ludwig Gottlo b (1848-1925) 156.B 411.A, r Freitag, Eberhard (1942-) 32 F Frenet, Jean-Frederic (1816-1900) 1I0.A 1I1.D App. A, Table 4.1 Fresnel, Augustin Jean (1788-1827) 167 D App. A, Tables 9.V, 19.11 Freudenthal, Hans (1905-) 162 178.F 202.A, U, r 248.r 249.r 265 310 A, D Freyd, Peter John (1936-) 52.r 200 r Fricke, Robert (1861-1930) 32.r 73 r 122.r 233.r 234.r Friedberg, Richard Michael (1935-) 356.D Friedman, Avner(1932-) 108.A,B 115.D286.r 320 r 322.r 327.r 406.r 440.r Friedman, James W. 173.E Friedman, Lawrence 307 r Friedman, Nathaniel A. (1938-) 136 E, r Friedrichs, Kurt Otto (1901-83) 112 D, I, S 125.A 162 204.G 205.r 252.r 300 r 304.F 323.H, r 325.G, r 326.D 331.A 345.A 351.K 375.A Fristedt, Bert (1937-) 5.r Frobenius, Ferdinand Georg (1849-19 17) I.r 2 B 3A, D, N 14 K 29.H 107.A 145 151 H 154 B 190.Q, r 191.B 257.D 267 269 I, N 280.F 286 H 
Name Index Frohlich, Albrecht 297 I 310.H 362 E, G 390 B 428 A, D 437 EE 450 P App A, Table 14.1 Frohlich, Albrecht (1916-) l4.r 59.r Frohlich,JurgM (1946-) 402G Froissart, Marcel (1934-) 146 A 386 B Frolik, Zdenek (1933-) 425.Y, CC 436 r Fronsdal, Christian (1931-) 132 r Frostman, Otto Albin (1907-77) 48.A, G 120 D 338 C, r Froude, William (1810-79) ] 16 B Fu. James Chuan (1937-) 399 M Fubini, Guido (1879-1943) J09 110 B, r 221 E 270 L Fubini, Sergio Piero (1928-) 132.r Fuch" Immanuel Lazarus (1833-1902) 32.B J07 A I 19 r 122 C 178 F 234 B 288 B App A, Table 18 I Fuchs, Ladislas 2 r Fuchs, Maximilian Ernst Richard (1873-) 253 A, E 288 D Fuchs, Wolfgang Heinrich (1915-) ] 7 D 272.K, r Fucter, Karl Rudolf (] 880- 1950) 73.r Fuglede, Bent (1925-) 48.H 143.B 338.E Fujii, Hiroshi (1940-) 304 D Fuji'i'e, Tatuo (]930-) 143.r Fujiki, Akira (1947-) 23 G 72.H 232.C Fujikoshi. Yasunori (1942--) 280.r Fujimagari, Tctsuo (1943-) 44 E Fujimoto, Hirotaka(1937-) 21 M, N Fujisaki, Gcnjiro (1930-) 6 F 450.L Fujisaki, Masatoshi (1943-) 86 E 405.r Fujisawa, Rikitaro (]861-1933) 267 Fujishige Satoru (1947-) 66 r 281 r Fujita, Hiroshi (] 928-) 204 B-D 304 r 378 J Fujita, Takao (1949 -) 9.r 15.H 72 I Fujiwara, Daisuke (1939-) 323 H 345 B Fujiwara, Masahiko (1943-) 118.C, D Fujiwara, Matsusaburo (] 88 I -1946) 89 E 230.*, r 240 B App A, Table 9.r Fukamiya, Masanori (]912-) 162 Fuks, Boris Abramovich (1907-75) 21 r 198 r Fuks, Dmitri] Borisovich J05.AA, r 154.G Fukushima Masatoshi (1935-) 115 C 261 r Fulkerson, Delbert Ray (1924-76) 376 r Fuller, Francis Brock 126.G, N Fulton, William (1939-) 9.r 16.1 366.E, r 418 r Funk, Paul 46 r 109 Furlan. Giuscppi (1935-) 132 r Furstenberg, Hillel 136 C, H Furtwiingler, Philipp (1869- 1940) 14 L, 0, R 59.A, D, F 145 G Gaal, S A 425 r Gabriel, Pierre (1933-) 13 r 52.r Gabrielov, A M App A, Table 5 V Gackstatter, Fritz (1941-) 275 E Gagliardo, Emilio 224.A Gaier, D 77.r Gaifman, Haim (]934-) 33 r Galambo" Janos (1940-) 374 r Galanter, Eugene H 96 r Gale, David (1921-) 22.H Galerkin, Boris Grigor'evich (1871- I 945) 303.1 304 B Galilei, Galileo (1564 -1642) 78 F 107 A 126.A 265 27J.A 359.C 360 Gallager, Robert G 63 r 2\3 E, F Gallagher, Patrick Ximenes (1935-) 123 E Gallavotti, Giovanni 136 G 1886 Gallot, Sylvestre (1948-) 364.G Galois, Evariste (181 1-32) 883 C 137 149.M 15J.D l71.*,rl72.A,B,G,H 190Q2ooNi.67 Galton, Francis (1822-1911) 40.B 44 B, C, r 401 E Galvin, Fred (]936-) 33 F Gambier, Bertrand Olivier (1879- I 954) 288 C Gamelin, Theodore Williams (1939-) 36.r 43 r 164 G, H, J, K, r 169 r Gamkrelidze, Revaz Valerianovich 86 r Gandy, Robin Oliver 81 A 356 r Gangolli, Ramesh A (] 935-) 5 r 437 EE Gantmakher, Feliks Ruvimovich (1908--64) J03.r 248 r Garabedian, Paul Roesel (1927-) 77.E 107.r438C Garbow, Burton S 298 r Garcia, August 136.B Girding, Lars (1919-) I 12.G, N 189.B, r 32J.G 323 H 325 F, J, r 345.A Gardner, Clifford S (1924-) 387.B, D Garfinkel, Robert S (1939-) 215.r Garland, Howard (] 937-) 122 G Garnett, John B (1940-) 164.F, J 169 E, r Garnier, Rene (1887-) 253 E 288 C, D 334 C Garnir, Henri (1921-85) 189.r Garsia, Adriano M. (] 928-) 262 r Garside, F A. 235.F Garside, G. R 301 N Gaschutz. Wolfgang (]920-) 151.D Gass, S. I 255 r Gastwirth, Joseph L (1938-) 371 H Gateaux, R 286.E Gauduchon, Paul (1945-) J09.r 391 r Gauss, Carl Friedrich (1777- 1855) 3.A JO E lJ.B 14 U, r 16 V 73.A 74.A, C, r 77 B 83 A 94.F 99.C 107.A J09111 F-H ll8A 120A ]23.A l36.C 149.1 174 A 175 176.A-C, F, H 179 A 180.B 193.D 206 A 223.C 242 A 253 B 267 275.C 2 7 9.C 285.A 294 A 295 D 296.A, B 297.1 299 A 302 B, C 309 C, r 327 D 337 D 338 J 341 D 347 H 357.A 364.D, H 365.C 401 E 403407 D 417 F 426 441 B 450 C App A, Tables 3.m, 4.1,10,14.11, 16.m, 21 Gaveau, Bernard (1950-) 115.r Gear, Charles William (]935-) 303 E, G, r Gegenbauer, Leopold (] 849- I 903) 317.D 393.E App. A, Table 20 I Gel'fand, Izrail' Moiseevich (1913-) 32 r 36 E, G, L 46 r 82.r J05 AA, r 1120, r 125 A, Q, S 136 G 154G 160r 162.*,r 168.B 183.r 192.G218F,G 256.r 258 r 287.C 308 D 341 r 375 G 387 C, D 395 r 407 C 424.T, r 437 W, EE, r 443 A. F. H 450 A, T Gel'fond, Aleksandr Osipovich (1906-6:J) 182.G 196 295.r 430.A, D, r Gell-Mann, Murray (1929-) 132.A, D, r 150 C 361 r Gentzen, Gerhard (] 909-45) 156.E 411.1, r Geocze, Zoard de 246 D, E Geoffrion, Arthur M 215 D, r Georgescu, V 375.B Gerard, Raymond (1932-) 428.H, r Gerbert (Silvester II) (940?-1003) 372 Gerhardt, Carl Immanuel (1816-99) 245 r Germain, Sophie (1776- I 831) 11 I.H 145 App. A, Table 4 I Gerstner, Franz Joseph von (1789-1823) 205 F Getoor, Ronald Kay (1929-) 5 r 261 r 262 r Gevrey, Maurice-Joseph (J 884- 1957) 58.G, r l25.U 168 B 325.1 327 C Ghosh, Javanta Kumar (]937-) 374 F 396 r 399.0 Ghosh, Sakti P. 96.r 
1887 Ghouila-HoUl i, Alain 281 r 282 r Giacaglia, Georgio Eugenio Oscare 290.r Gibbs, Josiah Willard (1839-1903) 136.A, C 159.D 340B402B, D,G419B,C,r Gierer, Alfred (1929-) 263 D Gieseker, David (1943-) 16 Y 72 K Giga Yoshikazu (1955-) 204.C Gikhman, Iosif IJ'ich (1918-?) 406 r Gilbert, Edgar N. (1923-) 63 B Gilford, Dorothy Morrow (1919-) 227 r Gilkey, Peter Belden (1946-) 391 N, r Gill, Stanley Jemen (1929-) 303 D Gillies, Donald Bruce (1928-75) 114.D 173.D Gillman, Leonard (1917-) 425.r Gindikin, Semen Grigor'evich (1937-) 384.C, r Gini, Corrado 397 E Ginibre, Jean (1938-) 212.A, r 375 F Giraud, Georges 323 C. F Giraud, Jean (1936-) 200.M Giri, Narayan C (1928-) 280 r Girsanov, Igor Vladimirovich (1934-) 115 D 136 D 406.B Girshick, M A 398 r 399 F, r Giusti, Enrico (1940-) 275 F Givens, James Wallace, Jr (1910-) 298 D Glaeser, Georges (1918-) 58 C, r Glaser, V 150 D 386.B Glashow, Sheldon Lee (1932-) 132.D Glauber, Roy J. 340.C Glauberman, George Isaac (1941-) 151 J, r Glauert, Hernlann (1892-1934) 205 B, D Glazman, Izrail' Markovich (1916-68) 197.r 251.r 390 r Gleason, Andrew Mattei (1921-) 164 F 196351 L GIeser, Leon Jay (1939-) 399.M Glezerman, M 201 r Glicksberg, Irving Leonard (1925-83) 86.r 425 T Glimm, James Gilbert (1934-) 36 H 150.F, r 204 G, r 308 L Glivenko, Valerii Ivanovich (1896-1940) 374 E 411.F, J, r Glowinski, Roland (1937-) 440.r Gluck, Herman R. (1937-) 183.r Gnanadesikan, Ramanathan 280.r Gnedenko, Boris Vladimirovich (1912-) 250 r 341 G 374.G Godbillon, Claude 154 G, r 201.r Godel, Kurt (1906-78) 22 H 31 B 33 A, C, D, F, r 49 D 9h 156 E, r 184 185.A, C, r 276.D 356 A-C, r 411 J, r Godement, Roger (1921-) 36 L 103 r 192.r 2oo.r 201 r 256 r 277 r 337.r 368 r 383 E, r 450 A, L, T Godwin, A N 51 r Gokhberg (Gohberg), Izrail' Tsudikovich (1928-) 68.A, J, r 251 r 390.H, r Goldbach, Christian (1690-1764) 4.C Go!'dberg, Anatolii Asirovich (1930-) 17.C 272 K, r Goldberg, Richard R. 160 r Goldberg, Samuel!. (1923-) 105 r IIO.E 194.r 364 F 417.r Golden, Sidney (1917-) 212.B Goldman, Oscar (1925-) 29.K 200.L Goldschmidt. David M 151 J Gol'dsheld (Goldseid), I J 3' 1 0 r Goldstein, Sheldon (1947-) 136G Goldstein, Sydney (1903-) 205 r 268 C Goldstine, Herman Heine (1913-) 75 r 138 r Goldstone, Jeffrey (\ 933-) 132.C Golod, Evgenii Solomonovich (1934-) 59 F 161 C Golub, Gene Howard (1932-) 302.r Name Index Griess, Robert Louis Goluzin, Gennadii Mikhailovich (1906-52) 48.r 77 r 438 B, C, r Gomory, Ralph E. (1929-) 215.B, C, r Gonzalez-Acuna, Francisco 235 E Good, Irving John (1916-) 403 G, r Goodier, James Norman (1905-) 271.r Goodman, Timothy N T (1947-) 126 K 136 H Goodner, Dwight Benjamin (1913-) 37.M Goodwyn, L Wayne 126.K 136 H Gopel, Gustav Adolph (1812-47) 3.A Goppa, V D 63.E Gordan, Paul Albert (1837-1912) 11.B 226.G 255 E 353.B Gordon, Marilyn K 303.r Gordon, Walter (1893-1940) 351 G 377.C 387.A Gorensrein, Daniei (1923-) 151 J, r 200 K Gor'kov, Lev Petrovich (1929-) 402 r Gorny, A 58.D Gorry, G Anthony 215 r Gottschalk, W. D 126 r Goulaouic, Charles (?-1983) 323 N Goursat, Edouard Jean-Baptiste (1858-1936) l!.r 20r46r92F 94.F 193.0 198 B 217.F, r 278.r 320 r 321 r 322.r 324 r 428.r Govorov, Nikolai Vasil'evich (1928-) 272 K Grad, Harold (1923-) 41 B Graeffe (Graffe), Carl Heinrich (1799-1873) 301 N Graeub, Werner 256.r Graev. Mark 10sifovich (1922-) 125 r 162.r 183.r 218.r437r Graff, Karl F 446 r Gragg, William Bryant (1936-) 303.F Graham, Ronald Lewis (1935-) 376 r Gram, Jorgan Pedersen (1850-1916) 103.G 208 E 226 E 302.E 317.A Grammel, Richard (1889-) 19.r Granoff, Barry (1938-) 325.L Grant, J. A. 30!.L Grashof, Franz (1826-93) 116 B Grassmann, Hermann Gunther (1809-77) 90 B 105,A 147.1 199.B 256.0 267 Grau, Albert A (1918-) 301 E Grauert, Hans (1930-) 2021 I, L, Q, r 23 E-G, r 32.F 72.E, G, H, J, r 118.E 147 r 178.F 194 F 232.r 418.B Graunt, John (1620-74) 40.A 401.E Gray, Andrew 39.r Gray, John Walker (1931-) 11O.E Gray, Robert M (1943-) 213 E, F Graybill, Franklin Arno (1921-) 403 r Green, Albert E 271.r Green, George (1793-1841) 45.D 94 F 105.W 120 A 188.A 189 A, B 193 D, J, N 252 K 315.B 327.D App A, Tables 3.m, 4.11, 15.VI Green, H E 291.F Green, Herbert Sydney (1920-) 402.1, r Green, James Alexander (1926-) A pp B, Table 5.r Green, Leon W (1925-) 136 G 178.G Green, Mark L (1947-) 21 N Green, Melville S. 36!.r 402.r Greenberg, Bernand G (1919-85) 374.r Greenberg, Leon (1931-) 234.D, r Greenberg, Marvin Jay (1935-) 93.r 118.r 201 r Greene, John M. (1928-) 387.B Greene, Robert Everist (1943-) 178 r 365 B Greenwood, J Arthur STR Gregory, James (1638-75) 332 Gregory, R T 301 r Grenander, Ulf (1923-) 395.r 421 r Griess, Robert Louis, Jr. (1945-) 151 I 
Name Index Griffiths, Phillip A. Griffiths, Phillip A (1938-) 9 E, r 16 J, r 21.N 72.G l24.r 2l8.E 272 L Griffiths, Robert Budington (1937-) 212 A Grigelionis, Bronyus Igno (1935-) 262 r Grillenberger, Christian (1941-) 136 r Grisvard, Pierre (1940-) 224.E Grobman, David Matveevich (1922-) 126 G Gromoll, Detlef (1938-) 109 r 178 r 279 G Gromov, Mikhael L. 154 F l78.r 364.H Gross, Oliver Alfred (1919-) 86 r Gross, W. (?-19l8) 62.E 246.G 272 1 429 D Grothendieck, Alexander (1928-) 3.N, r 12 B, r 13.B, r l6.E, U, Y, AA, r 29.K 52.r 68 A, K, M, N, r 125.r l68.B, r 200.1, r 203.H 21O.r 237 A, B, J 325.J 366 A, D, r 383 E, r 424.L, S, X, r 426 443 A, J) 450 Grotschel, Martin 215.C Grotzsch, Herbert (1902-) 352.A, C 438 B Grove, Karsten 178.r Groves, G. W. 92 r Grunbaum, Branko (1929-) l6.r 89.r Grunbaum, F Alberto 41 C Grunsky, Helmut (1904-86) 77.E, F, r 226.r 438.B Grunwald, Geza (1910-42) 336.E Grushin, Viktor Vasil'evich (1938-) 323.K, N 345A Guckenheimer, John M. (1945-) l26.K, N Guderley, Karl Gottfried (1910-) 205 r Gudermann, Christoph (1798-1852) l31.F 447 App A, Tables 16, l6.m Guerra, Francesco (1942-) 150.F Guest, Philip George (1920-) 19.r Gugenheim, Victor K. A M. (1923-) 65 D Guggenheim, Edward Armand (1901-) 419 r Guignard, Monique M. 292.B Guilford, Joy Paul (1897-) 346.r Guillemin, Victor W. (1937-) 105.r 191 r 274.1, r 325 L 391.J, N 428.F, G 431.r Guiraud, Jean-Pierre 41.D Gulliver, Robert D., II (1945-) 275.C 334.F Gumbel, Emil J (1891-) 374.r Gundlach, Karl-Bernhard (1926-) 32.G Gundy, Richard Floyd (1933-) 168 B 262.B Gunning, Robert Clifford (1931-) 21.r 23.r 367 G, r Gunson, Jack 386.C Gupta, Suraj Narayan (1924-) l50.G Guthrie, Francis 157 A Guthrie, Frederick l57.A Guttman, Louis 346.r Gyory, Kidman (1940-) l18.D Gysin, Werner l14.G l48.E 2010 H Haag, Rudolf (1922-) 150.C-E 351.K 402 G Haar, Alfred (1885-1933) 142.B, r 225 C, r 255.E 3l7.C 321.C 323.E 334.C 336.B Haberman, Shelby J. 280.r Haboush, William J (1942-) l6.W Hadamard, Jacques Salomon (1865-1963) 20.r 43 E46A 58.F, r 109 III 1121 C 123 B 124.B l25.A l26.J l59.J l78.A, B 208 D 272 E 320.r 321.A, G 323 B 325 B, r 339.A, D 357.r 429 B 450.B, 1 App A, Table 8 Hadley, George F. 227.r Haefliger, Andre (1929-) 105.r l14.D I 54.A, C- H Haff, L R 280.0, r Hagihara Yusuke (1897-1979) l33.r 420.r Hagis, Peter H., Jr (1926-) 297.D 1888 Hahn, Frank John (1929-) l36.G Hahn, Hans (1879-1934) 37.F 93.D l60.F 390.G 424.C Hajek, Jaroslav (1926-74) 371.r 399.N, r Hajian, Arshag B. (1930-) 136.C, F Haj6s, Gyorgy (Georg) (1912-72) 2.B Haken, Hermann 95 Haken, Wolfgang R. G. (1928-) 157.A, D l86.r 235.A Halanay, Aristide (1924-) l63.r 394 r Halasz, Gabor (1941-) l23.E Halberstam, Heini (1926-) l23.r Hale, Jack Kenneth (1928-) l63.B, H, r 286.r 290.r Hall, G 303 r Hall, Marshall,Jr. (1910-) 29.H 66.r 151 I, r 161 C, r 190.r 241.C App. B, Table 5 Hall, Philip (1904-82) 151 B, D-F 190.G Hall, William Jackson (1929-) 396.r Hallen, E. G. l30.r Hallstrom, Gunnar af 124 C Halmos, Paul Richard (1916-) 42.r 136.E, H 197.r 225.r 251.r 256 r 270.r 276 E 381 r 390.r Halphen, Georges Henri (1844-89) llO.B l34.r Hamachi, Toshihif() (1942-) 136.F, r Hamada, Noboru (1940-) 96.r Hamada, Yusaku (1931-) 321.G Hamburger, Hans (1889-1956) 240.K 450.M Hamel, George (1877-1954) 270.J 388.B Hamilton, Richard S. (1943-) 195.E 3:,2.C Hamilton, William Rowan (1805-65) 20 29.B 108.B 126.A, L l51.B l86.F 2l9.C 267 269 F 271.F 294.F, r 324.E 351.D 441.B 442.D Hamm, Helmut A 4181 Hammersley, John Michael (1920-) 30.r 385 r Hammerstein, H. 217 M Hamming, Richard W. (1915-) 63.B, C l36.E 223.r Hampel, Frank R. 371.A, I, r431.r Hanai, Sitiro (1908-) 273.K 425 CC Hanania, Mary I. 346.r Hancock, H. l34.r Handel, Michael (1949-) l26.J Handelman, David E (1950-) 36.H Handscomb, David Christopher (1933-1 385.r Haneke, Wolfgang l23.C Hankel, Hermann (1839- 73) 39 B l74.A 220.B App. A, Table 19.1J1 Hanks, R 224.E Hannan, Edward James (1921-) 421 r Hanner, Olof(Olaf) (1922-) 79.r Hano, J un-ichi (1926-) 364.F 365.L Hansen, Frank (1950-) 212.C Hansen, Johan Peder (1951-) 16.1 Hansen, Peter Andreas (1795-1874) App A, Table 19.m Hanson, David Lee (1935-) 136.E Hanson, Richard J. (1938-) 302.r Happel, H. 420.r Hara, K6kiti (1918-) 329.r Harada, Koichiro (1941-) 151.J, J Harada, Manabu (1931-) 2oo.K Harari, Haim (1940-) l32.r Harary, Frank (1921-) 186.r Hardorp, Detlef l54.H Hardt, Robert Miller (1945-) 275.C Hardy, Godfrey Harold (1877-1947) .C, D, r 20.r 43.E, F 83.r 88.r 106.r 121.r l23.C, r 159.G, H, r l64.G l68.B 2l1.r 216.r 220.B 224.E 242.A, B, r 295.r 3l7.B 328.*, r 339.B, C 379.F, M. S, r 450.B, 1 App A, Table 8 
1889 Harish-Chandra (1923-83) 13 r 32 r 122.F, G 249.V 308 M 437.V, X, AA, CC, EE 450.T Harle, Carlos Edgard 365 E Harman, Harry Horace (1913-76) 346.F, r Harnack, Carl Gustav Axel (1851-88) loo.E 193.1 Harris, Joseph 9 E, r 16.r Harris, Theodore Edward (1919--) 44.A 260J 340.C Harris, William Ashton (1930-) 289 E Hart, 1. F. 142 r NTR Hartley, Herman Otto (1912-80) STR Hartley, Richard Ian 235.E Hartman, Philip (1915-) 107.r 126.G 195.E 252.r 254.r 313.r 314.r 315.r 316.r 365J Hartmanis, Juris (1928-) 71.r 75.r Hartogs, Friedrich (1874-1943) 20 21.C, F, H, Q Hartshorne, Robin (1938-) 9.r 16.R, r 343 r Harvey, F. Reese (1941-) 112 C, D 125.Y, Z Harvey, William James (1941 -) 234.r Hasegawa, Hirosi (1782-1838) 230 Hashimoto, Isao (1941-) 280 r Hashimoto, Takeshi (1952-) 213.E Hashimoto, Tsuyoshi (1948-) 173 E Hasse, Helmut (1898-1979) 9 E 12.B 14.E, L,O-R, U, r 27.D, E 29.G 59. A, G, H, r 73.A 113 118.C, F 149.r 242.B 2S7.F, H, r 295.r 297.1 347.r 348.D, G 449.r 450.A, L, P, S Hastings, Cecil (1920-) 142 r NTR Hastings, D. W. 40.D Hasumi, Morisuke (1932-) 12S.BB 164.K Hatakeyama, Yoji(1932-) 110E,r Hattori, Akio (1929-) 237.H 431.D Hattori, Akira (1927-86) 200 K, M Haupt, Otto (1887-) 268.E Hausdorff, Felix (1868-1942) 79 A 117 G 169 D 224 E 234 E 240 K 246.K 249.R 273 J 317 B 381 r 388.r 423.B 425.A, N, P, Q, AA 426 436.C 443.1 Hawking, Stephen W (1942-) 359.r Hayashi, Chihiro (1911-) 290.r Hayashi, Chikio (1918-) 346.E Hayashi, Kazumichi (1925-) 207 C, r Hayashi, Keiichi (1879-1957) NTR Hayashi, Mikihiro (1948-) 164 K Hayashi, Tsuruichi (1873-1935) 230267 Hayman, Walter Kurt(1926-) 17.D 124r 193r 272.K, r 391.D 438.C, E Haynal, A 33 F, r Heath, David Clay (1942-) 173.E Heath, Sir Thomas Little (1861-1940) 181.r 187.r Heath-Brown, David Rodney 118 D 123 C, E Heaviside, Oliver (1850-1925) 12S.E 306.A, B App. A, Table 12.11 Heawood, P. J (1861-1955) 157 A, E Hecht, Henryk (1946-) 437 X Hecke, Erich (1887-1947) 6.D I1.B 14.r 29.C 32.C, D, H. r 73.B 123.F, r 348.L, r 450.A, D-F, M,O Hector, GilbertJoseph (1941-) 154.H Hedlund, Gustav Arnold (1904-) 126 A, r 136 G Heegaard, Poul (1871-1948) 65.C 247.r Heesch, Heinrich (1906-) 92.F 157.A, D Heiberg, Johan Ludvig(1854-1928) 181 r 187.r Heilbronn, Hans Arnold (1908-75) 123.D 347.E 450.K Heine, Heinrich Eduard (1821-81) 206 C 273.F 393.C Heins, Maurice Haskell (1915-) 77.F 164 K 198.r 207 C 367.E, 0, r Heintze, Ernst 178 r Heinz, Erhard (1924-) 323 J Name Index Hikita, Teruo Heisenberg, Werner Karl (1901-76) 150.A 351.C, D 386.C Heitsch, James Lawrence (1946-) 154.G Held, A. 359.r Held, Dieter( 1936-) 151.1 Helgason, Sigurdur (1927-) 109.r 199.r 218.G 225.r 248 r 249.r 412 r 413.r 417.r 437.Y, AA, EE Hellerstein, Simon (1931-) 272.K, r Hellinger, Ernst D. (1883-1950) 197.r 217.r 390.G Helly, Eduard (1884-1943) 89.B 94.B Helmholtz, Hermann von (1821-94) 139.A 188.D 205.B 419.C 442.D App. A, Tables 3, 15.VI Helms, Lester L. 120.r 193.r Helson, Henry (1927-) 164.G, H, r 192.P-R, r 251 r Hemmingsen, Erik (1917-) 117.E Hempel, John Paul (1935-) 65.E Henderson, David William (1939-) 117.1 Henkin  Khenkin Henkin, Leon (Albert) (1921-) 276 D Henon, Michel 287.B, r Henrici, Peter K. (1923-) 138.r 3oo.r 301.r 303.r Henry, Charles 144.r Henry, N F M 92 r Hensel, Kurt(1861-1941) l1.r 12.B 14.U 118.C 236.r 370 C 439.L Henstock, Ralph (1923-) loo.r Hepp, Klaus (1936-) 146.A 150.r Herbrand, Jacques (1908-31) 14.K 59.A, E, H, r 156.E, r 2oo.N 356.A, E Herglotz, Gustav (1881-1953) 43.1 192.B 325J Hering, Christoph H. 151.J Herman, Michael-Robert (1942-) 126.1, N 154.G Hermann, Carl Heinrich (1898-1961) 92.F Hermes, Hans (1912-) 31.r 97.r 356.r Hermite, Charles (1822-1901) 14 B 60.0 107.A 13l.D 167 C 176.1 182.A 199.A 217.H 223.E 232.A 251.E, 0 2S6.Q 269.1 299 A 317 D 344.F 348 F 412 E, G 430 A App A, Tables 14 II, 20 IV Herodotus (c. 484-c. 425 B.C.) 181 Heron (between 150 B.C. and A.D. 200) 187 App. A, Tables 2.11. III Hersch, Joseph (1925-) 143 A 391.E Hertz, Heinrich Rudolf (1857-94) 441.B Hertzig, David (1932-) 151.1 Herve, Michel Andre (1921-) 23.r 62.B Herz, Carl Samuel (1930-) 206 E, r Herzberger, Maximilian Jacob (1899-1981) 180.r Hesse, Ludwig Otto (1811-74) 9.B 139 H 226.D 279 B, E, F Hessel, Johann Friedrich Christian (1796-1872) 92.F Hessenberg, Gerhard 155.r Hessenberg, K. (1874-1925) 298 D Hestenes, Magnus R. 302.D Heun, Karl 303 D Hewitt, Edwin (1920-) 1 92.P, r 342.G 422.r 425 S, BB Hey, Kate 27 F 450.A, L Heyting, Arend (1898-1980) 156.r 411 J, r Hicks, Noel J. (1929-79) 111.r Hida, Takeyuki (1927-) 176.r Higgs, Peter Ware (1929-) 132.D Higman, Donald Gordon 151.1 Higman, Graham (1917-) 97 *, r 151 A, F, I, r 190.M Higuchi, Teiichi (1933-) 23 r Higuchi, Yasunari (1949-) 340.0, r Hijikata, Hiroaki (1936-) 13.0, P, R Hikita, Teruo (1947-) 75 r 
Name Index Hilb, Emil Hilb, Emil (1882-1929) 253.r Hilbert. David (1862-1943) 4 A 9 F 11.B 14 J-L, R, U 15H 16E,S20*,r32.G35A,r41 E46A, C, E, r 59 A 68 A, C, I, L 73 B 77 B, E 82.r 92.F 93J 105.2 107 A, rIll I 112 D, r 120.A 1261 150.G 155.A-C, E, G, H I 56.A, C, D, r 160.D 162 172.J 179.B 181188.r 189r 196A, B 197.A, B, r 217 H-J, r 220 E 222 r 226 G, r 253.D 267 284 A 285.A, 286 K 304 r 317.r 320 r 321 r 322 r 323.E, I, r 324 r 325 M 327.r 337.F 347 G, H, r 356.A 357 r 3641 365 J 369 D, F 375.F 377 D 382.B 389.r 402 H 410 r 411.1, r 423 N 424.W 430 A 441 r 443 A. 446.r App A, Table 8 Hildebrandt, Stefan O. W (1936-) 195.E 275 B 334 F Hildebrandt, Theophil Henry (1888-1980) 310 r 443 A Hildreth, C. 349 r Hilfcrty, M. M. 374 F Hilfinger, P. N. 75 r Hill, Edward Lee (1904-) 150.B Hill, George William (1838-1914) 107 A 268.B, E Hill, Rodney (1921-) 271.r Hille, Einar (1894-1980) 106 r 107.r 115 A 136 B l60.E 162 198.r 2 I 6.r 251.r 252.r 253.r 254.r 286 X, r 288 r 289.r 313 r 315 r 316 r 336.r 378 B, D, r 438 B Hillier, Fredcrick S. 215.r Hilton, Peter John (1923-) 70 r 201 r 202.P, U HinCin - Khinchin Hinman, Peter G (1937-) 22.F, r 356 r Hinohara, Yukitoshi (1930-) 200 K Hipparchus (190?-125? B c.) 187432.C Hippias (late 5th century B C.) 187 Hippocrates (of Chi os) (470'1430'1 Be) 187 Hirai, Takeshi (1936-) 437 W Hirano, Sugayasu (1930-) 301 G Hirashita, Yukio (1946-) 164 C Hirayama, Akira (1904-) 230 r Hironaka, Heisuke (1931-) 12 B 16.L, 2 21 L 23 D 72 H 232.C 418 B Hirose, Hideo (1908-81) 230 r Hirota, Ryogo (1932-) 387.D Hirsch, Guy Charles (1915-) 201.1 427.E Hirsch, Morris William (1933-) 114 C, D, J, r 126 J, r 279.C Hirschfeld, Joram 276 r Hirschman, lsidore I (1922-) 220.r Hirzebruch, Friedrich Ernst Peter (1927-) 12.B 15 D, G, H 20 32.F 56G, r 72 K, r 109 114A 147 r 237 A 366 A, B, D, r 383 r418.r426 431.D, r Hitchcock, Frank Lauren (1875-1957) 301.E Hitchin, Nigel James (1946-) 80 r 364 r Hitchim., G D 30!.L Hitotumatu, Sin (1926-) 186 r 301 D 389.r NTR Hitsuda, Masuyuki (1938-) 176 H Hlavaty, Vaclav (1894-'J) 434.r Hlawka, Edmund (1916-) 182 D Ho, B L 86.D Hobson, Ernest William (1856-1933) 133.r 393 C, r Hochschild, Gerhard Paul (1915-) 6.E 13.r 59.H 200.K-M, 0, Q, 249.r Hochster, Melvin (1943-) 16 Z Hocking, John Gilbert (1920-) 79.r 201.r Hocquenghem, Alexis (1908-) 63.D Hodge. Sir William Vallance Douglas (1903-75) 12 B 15 D 16.V, r 20109.*, r 194 B, r 232 A, B, D 343r Hodges, Joseph Lawson (1922-) 371 A, H 399 E, H, N, P,r 1890 Hodgkin, Alan Lloyd (1914-) 291 F Hodozi, Yosi (1820-68) 230 Hoeffding, Wassily (1914-) 371.A 374 I 400.r Hoffman, Banesh H. (1906-) 359.D 434 C Hoffman, David Allen (1944-) 275.r 365.H Hoffman, Kenneth Myron (1930-) 43.r 164 F, G, I, r Hoffmann-Jorgensen, Jorgen (1942-) 22,r Hogg, Robert Vincent, Jr (1924-) 371 r Holder, Otto (1859-1937) 84.A 104 F 168 B 190 G 211 C 277.1 288 D 379 M App. A, Table 8 Holland, Paul W (1940-) 280 r 403.r Holley, Richard Andrews (1943-) 44.E 340 r Holm, Per (1934-) 418.r Holmgren, Erik Albert (1872-) 125.DD 321 F 327.C Holmstedt, Tord 224 C Homma, Tatsuo (1926-) 65.E 235 A Honda, TaIra (1932-75) 3 C 450.Q, S, r Hong Imsik (1916-) 228 B, r Hong Sing Leng 365 N Hood, William Clarence (1921-) 128.r Hooke, Robert (1635-1703) 271.G Hooker, Percy Francis 214.r Hooley, Christopher (1928-) 123.E, r 295 E Hopcroft, John E (1939-) 31 r 71 r 75 r 186.r Hopf, Eberhard (1902-83) 1111 126.A, M 136 B 162 B, C, G, r 204.B, C, r 222.C 234.r 270.E 286.U, X 433.B, C Hopf, Heinz (1894-1971) 65.r 72.K 93 r 99.r 109 1111, r 126.G 147.E 153.B 178.A 201 r 202.A, B, I, Q, S, U, V, r 203.A, C, D, H 249 V 305.A 365 H 425.r 426 *, r Hopkins, Charles 368.F Horikawa, Eiji (1947-) 72 K, r Hormander, Lars Valter (1931-) 2021.1, r 107.r 112 B-D, H, K, L, R, r 115.D 125.A 164 K 189.C 274.D, I 286.J 320.1 321.r 323.M 325 H 345 A, B Horn, Jacob (1867-1946) 107 A 206 D 314.A Horner, William George (1786-1837) 301.C Horowitz, Ellis (1944-) 71 r Horrocks. Geoffrey (1932-) 16.r Hosokawa Fujitsugu (1930-) 235 D Hotelling, Harold (1895- I 973) 280 B :;74 C Hotta, Ryoshi (1941-) 437'x Householder, Alston Scott (1904-) 298 D 301 r 302.E Houseman, E E 19.r Howard, Ronald Arthur (1934-) 127.[ Howarth, Leslie (1911-) 205 r Hrbacek, Karel 33.r 293.E, r Hsiang Wu-Chung (1935-) 114J, K 431.D, r Hsiang Wu-Yi (1937-) 275 F 365.K 431.D, r Hsiung Chuan-Chih (1916-) 364 F 365 H HsilKwang-Ch'i(1562-1633) 57.C Hu Sze-Tsen (1914-) 79 r 91 r 148 r 201 r 277 r Hu Te Chiang 281.r Hua Loo-Keng (Hua Luo K'ang) (1910-85) 4.A, E, r 122.E 242.A, r 295.E Huang, Kerson (1928-) 402.r Huber, Peter J 371.A, H, J, r 399 H, P r Huber-Dyson, Verena 362.r Hudson, John F P 65 C, D Huff, Robert E (1942-) 443 H Hugenholtz, Nicholaas Marinus (1924-'1 308 H Hugoniot. Pierre Henri (1851-87) 51 E 204 G 205 B Hukuhara, Masuo (1905-) 30.C, r 88.A 254 D 288 B, r 289.B-D 314 A, C, D 315.C 316 E 388 B 443.A 
1891 Hull, Thomas Edward (1922-) 206.r 303.r Humbert, Georges (1859-1921) 83 0 Humphreys, James E. (1939-) 13 r 248 r Hunt, Gilbert Agnew (1916-) 5.H 162 176 G 260.1, r 261 A, B 338 N, 0 400.F 407 B Hunt, Richard A. 159.H 168.B, r 224 E Huntley, H. E. 116.r Huppert, Bertram 151.0, r Hurewicz, Witold (1904-S6) 117 A, C', r 136 B 1480202 A, B, N, r 426 Hurley, Andrew Crowther (1926-) 92 F Hurst, Charles A. (1923-) 212.A 402.r Hurwicz, Leonid 2550, E, r 292 E, F, r Hurwitz, Adolf(1859-1919) 3 K 9.IIO.E 11.0 83.B 134.r 198 r 339 0 367 B 450 B, r Husemoller,Oa!cH (1933-) 15r56r147r Huxley, Andrew Fielding (1917-) 291 F Huxley, Martin Neil (1944-) 123.E, r Huygens, Christiaan (1629-95) 93.H 245 265 325 B, 0446 Huzita, Sadasuke (1734-1807) 230 Hwa, Rudolph C (1931-) 146.r Hypatia (370 L 4l8) 187 lagolnitzer, Daniel (1940--) 146 C 274.0, I 386 C, r Ibragimov, il'dar Abdulovich (1932-) 176 r 2S0.r Ibuki, Kimio (1918-) 75.0 Ibukiyama, Tomoyoshi (1948--) 4S0.S Ichihara, Kanji (1948-) 115 0 Igari, Satoru (1936-) 224 E Igusa, Jun-,chi (1924-) 3 r 9.J 12 B IS.E, F 16.Z, r 32.F !l8 C, 0 Ihara, Yasutaka (1938-) 59 A, r 450.A, M, Q, S, U Iitaka, Shigeru (1942-) 72.1, r lizuka, Kenzo (1923-) 3621 Ikawa, Mitsuru (1942-) 325 K 345.A Ikebe, Teruo (1930-) 112.E, P 375 B, C lkebe, Yasuhiko (1934-) 298.r Ikeda, Akira (1947-) 391 C Ikeda, Hideto (1948-) 96 r Ikeda, Masatoshi (1926-) 200 K, L Ikeda, Nobuyuki (1929-) 44.r 4S.r 115.r 250.r 262.r 399.r 406.F, r Ikeda, Tsutomu (1950-) 3040 Ikehara, Shikao (1904-84) 123 B 160G il'in, Arlen Mikhailovich (1932-) 327 r Illusie, Luc (1940-) 366.r 1m Hof, Hans-Christoph 178 r Imai, Kazuo(19S2-) 41 D Imanishi, Hidcki (1942-) 154.H Inaba, Eizi (1911-) 337.F Inaba, Takashi (1951-) 1540 Inagaki. Nobuo (1942-) 399 N Incc, Edward Lindsay (1891-1941) 107 r 252 r 254 r 268 C, 0 288.r 289 r Infeld. Leopold (1898-1968) 206 r 359 0 Ingham, Albert Edward (1900-) l23.C, r 242 A Inose, Hiroshi (1951--79) 16 r 450.S Inoue, Masahisa (1946 -) 72 K Inoue, Masao (191S-) 338 r Iochum, Bruno 35!.L lonescu-Tulcea, Alexandra 136.B, C Iooss, Gerard (1944-) 126.M Ipsen, 0 C 116.r In, Masao (1933-) 66.r 186 r 281.r 299.B 301 F 303 E. r Irie Seiiti (1911-) 62 E Name Index Jaglom Irwin, Michael C. 65.D 126.G, r Isaacs, Rufus Philip (1914--81) 108.A Isbell, John Rolfe (1930-) 436 r Iseki, Kanesiroo (1920-) 328 Iseki, Sho (1926-) 328 Iseki, Tomotoki (fl. 1690) 230 Ishida, Masanori (1952-) 16.Z Ishihara, Shigeru (1922-) 11O.r 364.F 365 H Ishihara, Toru (1942) 19S.r Isit, Keiiti (1932-) 255.0 399.r Ising, Ernest (1900-) 340.B, C Iskovskikh, Vitalii Alckscevich 16.J Ismagilov, R S. 183.r Isozaki, Hiroshi (1950-) 375 B Israel, Robert B (1951-) 402.r Israel, Werner (1931-) 3S9 r Iss'sa, Hej 367 G Itaya, Nobutoshi (1933-) 204 F Ita, Kiyosi (1915-) 5 E, r 45 G, r 115 A, C 1761, r 261.A, r 395 C, r 406 A-O, G, r 407 A, C, r Ita, Masayuki (1940-) 338.0 !to, Noboru (1925-) 151.H, J Ita, Seizo (1927-) 204 B 270.r 327 r !to, ShunJi (1943-) 126.K 136.C, r !to, Takashi (1926-) 192.r Ita, Teiiti (1898-) 92.F Ita, Y oshifumi (1940-) 125.BB lto, Yuji (1935-) 136C,F !toh, Mitsuhiro (1947-) 80.r Hoh, Takehiro (1943-) 275.A 365 G Iversen, F. 62.E 272.\ Ivory, Sir James (1765-1842) 350.E I vrii, V Ya. 325.H Iwahori, Nagayoshi (1926-) 13.R, r 249 r 442.r lwamura, Tsurane (1919-) 85.A,r Iwaniee, Henryk 123.C, E Iwano, Masahiro (1931-) 2S4.0 289.0, E Iwasawa, Kenkichi (1917-) 60, F l4.L 32 r 243.G 248 F, V 249 S, T, V, r 257.H 384.C 450 A, F, J, L, N, r lyanaga, Shokichi (1906-) 6.r 7 r 14 Q, r 59.0, E, r 60 r 149.r 161 r 200 r 277 r 294 r 343 r 362.r 368 r 409 r Izumi, Shin-ichi (1904-) 121 r 160 B, F 310 r J Jackiw, Roman Wildmir (1939-) 80r Jackson, Dunham (1888-1946) 336.C, E, r Jackson, John David (1925-) 130 r Jackson, Kenneth R 303.r Jacob, Maurice R 132 r 386.r Jacobi, Carl Gustav Jacob (1804-51) 3.A, G, L 4.0 9 E, F 11 B, C 2046 C 105. 107 B 108.B 126 A 134.A, C', I, J, r 178.A 182.H 202 P 208.B 229 248.A 267 271.F 296 A 2971 298.B 302 C 31703240, E 348 K 390.G 420 A, F 428 C App A, Tables 141, II, 16.1, III, 20 V Jacobowitz, Howard (1944) 286 J 344.B Jacobs. Konrad (1928-) 136.H, r Jacobson, Florence 0 231.r Jacobson, Nathan (1910-) 27.r 29 r 54 r 67.D 149 r i72.A, K, r 231 r 248.r 256 r 368.H, r 499 r Jacod, Jean M. (1944-) 262.r Jacquet, Herve Michel (1933-) 32 r 437.r 4S0.A, N,O Jaeckel, Louis A. 371 H, J, r Jaffe, Arthur Michael (1937-) 150.C, F, r Jaglom  Yaglom 
Name Index Jahnke, Paul Rudolf Eugene Jahnke, Paul Rudolf Eugene (1863-1921) 389.r NTR James, Alan Treleven (1924-) 102 r 374.r James, loan M. (1928-) 202 Q, U James, Ralph Duncan (1909- 79) 100 A, r James, Robert Clarke (1918-) 37.G James, W. 2800, r 398.r 399 G Jancel, Raymond (1926-) 402.r Janes, E. T. 403.C Janet, Maurice (1888-) 365.B Janiszewski, Zygmund (1888-1920) 426 Janko, Zvonimir (1932-) 151.1, J App B, Table 5.UI Janner, Aloysio 92.r Jans, James Patrick (1927-) 368 r Janzen, O. 246 G Jarratt, Peter(1935-) 301 N Jauch, Joscf-Maria (1914-74) 375.A, r Jayne, J E 22.r Jech, Thomas (1944-) 22 r 33 F, r Jeffreys, B. S. 25 r Jeffreys, Harold (1891-) 25.B, r Jelinek, Fredrick (1932-) 213 E, r Jenkins, Gwilym M. 128.r 4210, G, r Jcnkins, Howard B. 275.A,0 Jenkins, James Allistcr (1923-) 77.F 143 r 438.B, C Jensen, Johann Ludwig Wilhelm Waldemar (1859- 1925) 88.A 121.A 164 K 198.F Jensen, K. L. 145 Jensen, Ronald B. 33 F, r 356.r Jentsch, Robert 339.E Jerison, Meyer (1922-) 425.r Jessen, Raymond J (1910-) 373.r Jeulin, Thierry 406.r Jewett, Robert Israel (1937-) 136 H Jimbo, Michio (1951-) 253.E 387.C Jimbo, Toshiya (1941-) 164.r Jifina, Miloslav 44.E Joffe, Anatole 44 C John, Fritz (1910-) 112.B 168 B 218.F 262.B 274.F 292.B 3oo.r 304 r 320.r 321.r 323.r 324 r 325.r 327.r John, Peter W. M. (1923-) 102.r Johns, M. Vernon, Jr. (1925-) 371.H, r Johnson, Norman Lloyd (1917-) 374 r Johnson, Wells 14.L Johnson, William B. (1944-) 68.K, M Johnston, John (1923-) 128.r Jolley, Leonard Benjamin William (1886-) 379 r Joly, Jean-Rene Benoit (1938-) 118.r Jona-Lasinio, Giovanni 361.r J onckheere, A. R. 346 r Jones, B. W. 347.r 348.r Jones, Floyd Burton (1910-) 273.K Jones, John 0 S 80.r Jones, William (1675-1749) 332 Jordan, C. W 214.r Jordan, Camille (1838-1922) 20.r 79.A n.A, F 93.A, B, F, K 104.r 151.H 1 59.B 166.B 190 G, Q, r 267269 G 270.0, G 277.1 302.B 310.B 333.A 362.K 380.C App. A, Table 8 Jordan, Ernst Pascual (1902-) 150.A 231 B 351.L 377 B Jordan, Herbert E App. B, Table 5 r Joreskog, Karl G. (1935-) 403.r Joseph, Peter 0 405.r Jost, Res Wilhelm (1918-) 150 r 386.B Joule, James Prescott (1818-89) 130.B Julia, Gaston Maurice (1893-1978) 21.Q, r 43.K 124.B 198.r 272.F 429.C 435.E Jung, Heinrich Wilhelm Ewald (1876-1953) 15.r 1892 Jureckova, Jana (1940-) 371.J, r Jurkat, Wolfgang (Bernhard) (1929-) 123.0 Jutila, Matti Ilmari (1943-) 123.E Juzvinskii  Yuzvinskii K Kac, I. S  Kats Kac, Mark (1914-84) 41.C 115.C 150.F 250.r 261.r 287 C 295.E 340 r 341.r 351 F 391.C, r Kaczmarz, Stefan (1895-1939) 317r Kadanoff, Leo Philip (1937-) 361.r Kadison, Richard Vincent (1925-) 36.G, K 308 r Kadomtsev, Boris Borisovich (1928-) 387 F Kagan, Abram Meerovich (1936-) 374 H Kahan, William M. 302.r Kahane, Jean-Pierre (1926-) 159.H, r 192.Q, r Kiihler, Erich (1906-) 109191.I 199.A 232A 365.L 428.E, r Kailath, Thomas (1935-) 86.0, r Kainen, Paul C 157.r Kaiser, Henry F. 346 F, r Kakeshita Shin-ichi (1934-) 371.A Kakeya, Soichi (1886-1947) 10.E 89 E Kakutani, Shizuo (1911-) 37.N 136.B-0, F 153.0 162 286031 O.A, G 352.A 367.0 398.G Kalashnikov, Anatolil Sergeevich (1934-) 327.r Kall, Peter 408 r Kallen, Anders Olof Gunnar (1926-68) 150.0 Kallianpur, Gopinath (1925-) 86.r 250.r 405.r Kalman, Rudolf Emil (1930-) 86.A, C-F 95 405 G, r Kalmar, Laszlo (1905-76) 97.* Kaluza, Theodor, Jr (1910-) 434 C Kamae, Teturo (1941-) 136.H 354.r Kambayashi, Tatsuji (1933-) 15.r Kamber, Franz W. (1936-) 154.G, H, r Kamenskii, Georgii Aleksandrovich (1925-) 163.r Kametani, Shunji (1910-) 62 E 124 C Kamke, Erich (1890-1961) 316 r Kampe de Feriet, Joseph (1893-1982) :'06.0 393.E, r 428 r Kan, Daniel M 70.E Kanamori, Akihiro (1948-) 33.r Kaneda, Eiji (1948-) 365.E Kaneko, Akira (1945-) 162 Kanel', Yakob Isaakovich (1932-) 204.F, r Kaneyuki, Soji (1936-) 384.r Kanitani, Joyo (1893-) 110.B, r Kannan, Rangachary (1946-) 290.r Kano, Tadayoshi (1941-) 286,Z Kantor, William M. 151.J Kantorovich, Leonid Vital'evich (1912-86) 46.r 162 217.r 255.E 304 r 310.A Kaplan, Wilfred (1915-) 106.r216r313.r Kaplansky, Irving (1917-) 2 E, r 107.r l13.r 200.K 241.E 248.r 308 C, r Kapteyn, Willem(1849-?) 39.0 App. A. Table 19.111 Karacuba, Anatolii Alekseevich (1937-) 4.E Karhunen, Kari (1915-) 395 r Kariya, Takeaki (1944-) 280.r Karlin, Samuel (1923-) 222.r 227.r 260.1 263.E 310.H 336.r 374.r 399.G, r 4oo.r Karp, Carol (1926-72) 356.r Karrass, Abe 161.r Karzanov, A. V. 281.r Kas, Arnold S. 15.H 16.R Kasahara, Kenkiti (1935-) 21.M Kasahara, Koji (1932-) 325.H Kasai, Takumi (1946-) 71 r 
1893 Kasch, Friedrich (1921-) 29 H Kashiwara, Masaki (1947-) 68.F 125 DO, EE, 146.A, C 162274.1 386.C 418.H 428.H 437.r Kasteleyn, P. W. (1924-) 212.A Kastler, Daniel (1926-) 150.E 351 K 402.G Kataoka, Kiyomi (1951-) 125.00, r Kataoka, Shinji (1925-) 408.r Katetov, Miroslav II7.A, 0, E Kato, Junji (1935-) 163.r Kato, Kazuhisa (1946-) 126.J Kato, Masahide (1947-) 72.K Kato, Mitsuyoshi (1942-) 650147 Q 418 r Kato, Tosio (1917-) 68.r 162 204.B, C, E 289.E 304.r 331.A, B, E, r 345.A 351.0 375.A-C 3780, E, H-J 390 r Katok, Anatolil Borisovich (1944-) 126 N 136 E, F, H,r Kats, Izrail' Samoilovich (1926-) 115 r Katsurada, Yoshie (1911-80) 365.H Katz, Jerrold J. 96.r Katz, Nicholas M. (1943-) 16.r 450.1, Q, r Katznelson, Yitzhak (1934-) 136 E I 59.H, I 192.r Kaufman, Sol (1928-) 399.N, r Kaul, Helmut (1936-) 195 E Kaup, Wilhelm 384.r Kawada, Yukiyosi (1916-) 59.H Kawaguchi, Akitsugu (1902-84) 152.C Kawai, S6ichi (1937-) 72.r Kawai, Takahiro (1945-) 125.BB, DO, r I 46.A, C 162274 I 386 C 428.H Kawai, Toru (1945-) 293.E, r Kawakami, Hiroshi (1941-) 126.N Kawakubo, Katsuo (1942-) 431.0 Kawamata, Yujiro (1952-) 15.r 21.N 72.1, r 232.0 Kawashima, Shuichi (1953-) 41.r Kawata, Tatsuo (1911-) 374.H 395.r Kazama, Hideaki (1945-) 21.L Kazarinoff, Nicholas O. (1929-) 211 r Kazdan, Jerry L (1937-) 364.H, r Kazhdan, David A. 122.G 391.N Kazhikov, A. V 204 F Kechris, Alexander S. (1946-) 22.C, H, r KeedwelI, Anthony Donald (1928-) 241.r Keesling, James Edgar (1942-) 382.C Keim, Dieter 424.r Keisler, H. Jerome (1936-) 33.r 276.E, r 293.0, r Keldysh, Mstislav Vsevolodovich (1911-78) 336.F KelIer, Herbert Bishop (1925-) 303.r KelIer, Joseph Bishop (1923-) 274.C, r KelIey, O. G. 212.A KelIey, John Ernst, Jr (1937-) 376 r KelIey, John Leroy (1916-) 37.M 87.r 381.r 424.r 425 r 435.r 436.r KelIogg, Oliver Dimon (1878-1932) 120.B, 0, r 1530 193.r 286 0 Kelly, Anthony 92.r Kelvin, Lord (Thomson, WilIiam) (1824-1907) 19.B 39.G 120.A 193.B App A, Table 19.1V Kemeny, John George (1926-) 260J Kempe, Alfred Bray I 57.A, D Kempf, George R. (1944-) 9.r 16.r Kempisty, Stefan (1892- I 940) 100 A, r Kempthorne, Oscar (1919-) 102.r KendalI,OavidG (1918-) 44A218.r260.H,J KendalI, Maurice George (1907-83) 102.r 280.r 346.r 37J.K 397 r 4oo.r Kenmotsu Katsuei (1942-) 275.F Kennedy, P. B. 193.r Kepler, Johannes (1571-1630) 20 78 0 126.A 265 271 B 309 B 432.C Name Index Knapowski, Stanislaw Kerekjart6, Szerkeszti Bela (1898-1946) 207.C 410.r Kerner, Immo O. 30l.F Kerr, Roy Patrick (1934-) 359 E Kervaire, Michel Andre (1927-) 65.C 1I4.A, B, I-K 235.G Kerzman, Norberto Luis Maria (1943-) 164.K Kesten, Harry (1931-) 5.G 44.r 340.r Khachiyan, L. G. 7J.0 255.C Khalkin, Semen Emanuilovich 290.r 318.r Khas'minskii, Rafail Zalmanovich (1931-) 115.0 Khatri, Chinubhai Ghelabhai (1931-) 280.r Khavinson, Semeon Yakovlevich (1927-) 77.E Khayyam, Omar (c. 1040-c. I I 23(24?)) 26 Khenkin (Henkin), Gennadil Markovich (1942-) 164.K 344 F Khinchin (Hincin), Aleksandr Yakovlevich (1894- 1959) 4.A 45.r 83.r 100 A 115.0 213. F 250.C 307.C 332.r 341.G 342.0 395.B 402.r KhovanskiI, Aieksel Nikolaevich (1916-) 83.r Kiefer, Jack Carl (1924-81) 3990 Kikuchi, Fumio (1945-) 304.r Kikuti, Oairoku (1855-1917) 230267 KilIing, Wilhelm Karl Joseph (1847-1923) 50 248.B 279.C 364.F Kim Wan Hee (1926-) 282.r Kimura, Motoo (1924-) 115.0263.E Kimura, Tatsuo (1947-) 450.V Kimura, Tosihusa (1929-) 30.r 288.B-0 289.r Kinderlehrer, David S. (1941 -) 105.r Kingman, John Frank Charles (1939-) 136.B, r Kinnersley, William 205.F Kinney, John R. 115.A Kino, Akiko (1934-83) 81.0 356.G Kinoshita, Shin'ichi (1925-) 235.A, C, H Kinoshita, Toichiro (1925-) 146.B Kirby, Robin CromwelI (1938-) 65.A, C 70 C 114J-L Kirchhoff, Gustav Robert (1824-87) 255.0 282.B KirilIov, Aleksandr Aleksandrovich (1936-) 437.T Kirkwood, John Gamble (1907-59) 402J Kiselev, Andrei Alekseevich 204.C Kishi, Masanori (1932-) 48.H 338.1, J, M Kishimoto, Akitaka (1947-) 36.K 402.G Kister, James Milton (1930-) 147.r Kitada, Hitoshi (1948-) 375.B Kitagawa, Tosio (1909-) STR Kizner, WilIiam 3010 Klainerman, Sergiu 286.1 Klee, Victor La Rue, Jr. (1925-) 89.r 286.0 Kleene, Stephen Cole (1909-) 22.G 31.B, C, r 81 A, r 97.r 156 r 185.r 276.r 3 I 9.r 356.A, C-H, r 411.r Kleiman, Steven L. (1942-) 9.E, r 16.E, r 450.r Klein, Felix (1849-1925) I.r 7.E II.B 32.r 53.r 83 0 90.B, r 109 119.r 122.C, r 13h, r 139.A, r 151.G 167.E 171.r 175.r 181 190Q 196.r 206.r 229r 233 234.A, 0, r 26h, r 285.A, C, r 343.F 363.r 410 B 447.r Klein, Jacob 444.r Klein, Oskar Benjamin (1895-1977) 212.B 351.G 377.C Klema, Virginia C. 298.r Klingen, Helmut P. (1927-) 32.H 450.E Klingenberg, Wilhelm P. (1924-) 109.*, rIll r 178.C, r 279.G Kloosterman, Hendrik 0 (1900-) 4.0 32.C, G Klotz, TiUa 365.H Kluvanek, Igor (1931-) 443.A, G Knapowski, Stanislaw (1931-67) 123.0 
Name Index Knapp, Anthony William Knapp, Anthony William (1941 -) 260 r 437.EE Knaster, Bronislaw (1893--1980) 79.0 Knescr, Hellmuth (1898-1973) 198 r Kneser, Julius Carl Christian Adolf (1862-1930) 107 A 314A 316 E App A, Tahle 19 III Kneser, Martin L (1928-) 13 O-Q 60 K 391 C Knopp, Konrad (1882-1957) 208 C 339.C 379 I, M,r Knopp, Marvin Isadore (1933-) 328 r Knorr, Knul R. K. (1940-) 72 r Knowles, Greg 443.G Knudsen, Finn Fayc(1942-) 16r Knus, Max-Albcrt (1942-) 29.r Knuth, Donald Ervin (1938-) 71 r 96.r 354.r Knutson, Donald 16.r Kobayashi, Osamu (1955-) 364 H Kobayashi, Shoshichi (1932-) 21.N, 0, Q, r 80.r 105.r 109 r 275 A 364 r 365 K, 0, r 412.r 413 r 417r431r Kobayashi, Zen-ichi (1906-) 3670 Koch, John A. 157 A 186 r Kochen, Simon Bcrnard (1934-) 118.F 276 E, r Kocher, Max (1924-) 32.F 231.r Kodaira, Kunihiko (1915-) 12 B, r 15.B, E, F 16.V, r 2021 N 72 F, G, I-K 109112 1,0 1470 194 r 232 0, r 366 A, r Kodama, Akio (1949-) 384.F Kodama, Laura Ketchum 164.K Kodama, Yukihiro (1929-) 117 F 273 r 382.0 425 r Koebe, Paul (1882-1945) 77 B, E 193.0, E 196 438 B, C Kogut, John Benjamin (1945-) 361 r Kohn, Joseph John (1932-) 112 H 2741 345 A Koiso, Norihito(1951-) 364.r Koizumi, Shoji (1923-) 3 N Kojima, Tetsuzo (1886-1921) 121 B 240.B 379 L Koksma, Jurjen Ferdinand (1904-) 182 r 430 C Kolbin, Vyacheslav Viktorovich (1941-) 408 r Kolchin, Ellis Robert (1916-) 13.F 113 172 A Kolesov, Yuril Serafimovich (1939-) 290 r Kolmogorov, AndreI Nikolaevich (1903-) 20 44.r 45 F 58 0 71.r 100 A 115 A, D 126.A, L 136 E 159 H 196 201.A, M, P 205 E 213 E 214.C 250 F, r 260 A, F 26t.A 341 G, I, r 342 A, 0, G 3540 371 F 374 E 407.A, r 425.Q 426 433 C Komatsu, Hikosaburo (1935-) 68.F 107.r 1120, R 162168 B 224.C, E 254 D 378 D, I Komalu. Aluo (1909-) 305 A 425.U Komatu, Yusaku (1914-) 77 E App. A, Table 14.r K6mura, Takako (1930-.) 168.B 378 F 424 S K6mura, Yukio (1931-) 162168 B 286 X 378.F 424 S, W Kondo, Kazuo (1911-) 282r Kondo, Motokiti (1906-80) 22 C, F Konheim, Alan G. 126.K 136.H Konig, Denes (1884-1944) 33 F Konig, Hcinz J (1929-.) 164 G, r Konigs, G 44 B Konishi, Yoshio (1947-) 286 X, Y Konno, Hiroshi (1940-) 264 r Kono, Norio (1940-) 176 G 437 CC Koopman, Bernard Osgood (/900-81) 420 F Koopmans, Tjalling Charles (1910 85) 128.r 255 E 376 Koosis, Paul J 164 r Koninyi, Adam (1932-) 413 r Korn, Granillo Arthur (1922--) 19 r Korn, T(h)eresa Mikhailovich 19 r Korncr, Otto Hcrman (1934-) 4 F Korobov, NikolaI MikhaIlovich (1917-) 4 E 1894 Korolyuk, Vladimir Semenovich (1925-) 250.r Kortanek, Kenneth O. (1936-) 2550, E Korteweg, Oiederik Johannes (1848-1941) 387.B Kortum, Ludwig Hermann (1836-19041 179.B Koshiba, Zen'ichiro (1926-) 304.F Kostant, Bertram (1928-) 248 Z 287 r 387.C 437 U, EE Kostrikin, Aieksel Ivanovich (1929-) 161 C Koszul, Jean Louis (1921-) 200.J 412 r 413.r Kotake, Takeshi (1932-) 1120 321 G Kotani, Shinichi (1946-) 176 F Kothe, Gottfried (1905-) 29.1 125 Y 168.B, r 424.r Kotz, Samuel (1930-) 374.r Kovalevskaya (Kowalewskaja), Sof'ya Vasil'evna (1850-91) 107.B 267 286 Z 320.1 32] A, B Kowalewski, Gerhard (1876-1950) 103 r Kowata, Atsutaka (1947-) 437 r Kra, Irwin (1937-) 122.r 234 r Krahn, E 228 B 391.0 Kraichnan, Robert H. 433.C Krakus, Bronislav 178 r Kramers, Hendrik Anlhony (1894-1952) 25.B Krasinkiewicz, J 6sef (1944-) 382.C Krasner, Marc (1912-85) 145 Krasnosel'skiI, Mark Aleksandrovich (1920-) 251.r 286.r 290.r KrasovskiI, Nikolai Nikolaevich (1924-) 163.B Kraus, Fritz (1903-1980) 212.r Krazer, Karl Adolf Joseph (1858-1926) 3.r Kree, Paul (1933-) 224 C Kreider, Donald Lester (1931-) 81 *, r KreIn, Mark Grigor'evich (1907-) 37 E 68 A, J, r 89 r 115.r 162 176 K 2511, r 310.H 390.H, r 424 0, U, V 443 H KreIn, Selim Grigor'evich (1917-) 162 224.A 378.r Kreisel, Georg (1923-) 356 H Kreiss, Heinz-Otto (1930-) 304 F 325. K Krellc, Wilhelm 292 r Krengel, Ulrich (1937-) I 36.B, F, r Krichever, I M. 387.F Krieger, Wolfgang 136 E, F, H, r 308.1 Kripke, A. 411 F Kripke, Saul 356 G, r Krogh, Fred T. (1937-) 303.r Krohn, Kcnneth 31 r Kronecker, Leopold (1823-91) 2.B 10 B 14 L, U 15.C 47 73.A, r 136 G 156.C 190 Q 192 R 201 H 236.A, r 267 269 C 3470422 K 450.B, S App. A, Table 4 Krull, Wolfgang (1899-1970) 12 B 67.D, E, J, r 172 A, I 190 L 277 I 284.A, F, G 439.L Kruskal, Joseph Bernard (1928-) 346 E, r Kruskall, Martin David (1925-) 359.F 387 B Kruskal, William Henry (1919-) 371 D Krylov, NikolaI Mitrofanovich (1879-1955) 290A Krylov, N S 402 r Krylov, NikolaI Vladimirovich (1941-) 115 r 136.H 405 r Krylov, Vladimir Ivanovich (1902-) 46.r 217 r 299.r 304.r Krzyzailski, Miroslaw 325.r Kshirsagar, Anant M (1931--) 280 r Kubilius, Jonas P. (1921-) 295 r Kublanovskaya, Vera Nikolaevna (1920-) 298.F Kubo, Izumi (1939-) 136.F, G 395 r Kubo, Ryogo (1920-) 308.H 402 K Kubota, Tadahiko (1885-1952) 89.C Kubota, TomlO (1930-) l4.U 59 H 257 H 450A, J,M Kudo, Hirokichi (1916-) 399 r 443.A 
1895 Kudryavtsev, Valerii Borisovich (1936-) 750 Kuga, Ken'ichi (1956-) 114.K Kuga, Michio (1928-) 450.M, S Kugo, Taichiro (1949-) 150G Kuhn, Harry Waldo (1874-) 108.r 173 B, r 255.r 292.A, B Kuiper, Nicolaas H. (1920-) 105 r I 14 B 126 G 18h, r 286.0 364.F 365 B, 0 Kuipers, Lauwerens 182 r 354 r Kuhkov, Leonid Yakovlevich 2.0 Kulk, W V 0 428 r Kulkarni, Ravi S. (1942-) 72.r Kullback, Solomon (1907-) 213 0, 398.G 403.C, r Kumano-go, Hitoshi (1935-82) 112.L 274 r 323 r 345 A, B Kummer, Ernst Eduard (1810-93) 14 L, N, 0, U 15.H 145 167.A I 72.F 206.A 236 267 379.1 450 J App A, Tahles 10 II, 19 I Kunen, Kenneth (1943-) 33 r Kunieda, Motoji (1879- 1954) 121.B 240 B Kunita, Hiroshi (1937-) 86r 115.0260.1 261 r 406 B, r Kunneth, Hermann (1892- 1975) 2oo.E, H 201.1 450.Q Kuntzmann, Jean (1912-) 71.r 75.r Kunugui, Kinjiro (1903-75) 22 C, F 62.B, E, r loo.A, r Kiinzi, Hans Paul (1924-) 292.F, r Kuo Yung-Huai 25.B Kupka, Ivan Adolf-Karl (1937-) 126.r Kupradze, Viktor Omitrievich (1903-85) 188 r Kuramochi, Zenjiro (1920-) 207.C, 0367 E, G Kuranishi, Masatake (1926-) 72 G 249.0, V 286 J 428 F, G, r Kurata, Masahiro (1943-) 126J K uratowski, Kazimierz (Casimir) (1896- 1980) 220, G, r 79 D, r 186 H 425 A, Q, r 426 Kuroda, Shige Toshi (1932-) 331 E 375.C Kuroda, Sigekatu (1905-72) 411.1, r Kuroda, Tadashi (1926.) 367.E Kurosh, Aleksandr Gennadievich (1908-71) 2.E, r 29J 103.r 161 A, r 190r 337 r Kurosu, Konosuke (1893- I 970) 240 B Kiirschilk, J6zsef(1864-1933) 439.L Kurth, Rudolf(1917-) 116.r Kurusima, Yosihiro ("-1757) 230 Kusaka, Makoto (1764-1839) 230 Kushner, Harold Joseph (1933--) 86 E 405 r Kushnirenko, Anatolii Georgievich 418 r Kusunoki, Yukio (1925-) 143 r 207 C, 0, r 367 G, I Kutta, Wilhelm Martin (1867-1944) 3010303 0 Kuwabara, Ruishi (1951-) 391 N Kuyk, Willem(1934-) 32r L Lacey, Howald Elton (1937-) 37.r Lachlan, Alistair H 276 F Lacroix, Sylvestre Fran90is (1765-1843) 181 Ladyzhem.kaya, Ol'ga Aleksandrovna (1922-) 204 B-O, r 286 r 323 P Lafontaine, Jacques (1944-) 364.H Lagrange, Joseph Louis (1736- I 8 I 3) 4.0 20 46.A, B 82 A 83 C, D 105 A 106.E, L 107.A, B 109 126.A, E, L 150 B I 51. B 172 A, F 190 Q 205 A 223 A 238 2520, K 266271 F 274.C 275 A 296.A 301 C 322.B 324 D 336 G 342 A 420.B, 0 428.C 442 C App A, Tables 91V, 14.1, 15.21.11 Laguerre, Edmond Nicolas (1834-86) 76 B 137 299 A 3 I 7 D 429 B App A, Tables 14.II, 26 VI Name Index Leadbetter, Malcolm Ross Lainiotis. imitri G 86 r Lakshmikantham, Vangipuram (1924-) 163 r Laksov, Dan (1940-) 9 E Lamb, Sir Horace (1849-1934) 205.r 446 r Lambert, Jack 0 303.r Lambert, Johann Heinrich (1728-77) 83.A, E 332 339.C Lambert, Robert Joe (1921-) 303.r Lame, Gabriel (1795-1870) 133B,C 145167E Lamperti, John Williams (1932-) 44.E 342.r Lance, E. Christopher (194 1-) 308 F Lanczos, Cornelius (1893-1974) 298.0, E 301.1, N 302.r Land, A H 215.0 Landau, Edmund Georg Herman (1877-1938) 4.A, r 43 J, K, r 77.F 87.G, r 106.r 107.A 121.C-E 123 B, F, r 131 r 160G 216 r 240 B 242 A, r 294 r 2950 297.r 339.r 347.r 450 I, r Landau, Lev Davidovich (1908-68) 130 r 146 A, C', r 150 r 205.r 259 r 402.r 433. B Landau, Y oan 0 86.r Landen, John (1719-90) 134.B App. A, Table 16.m Landkof, Naum Samoi1ovich (1915-) 338.r Landsberg, Georg (1865-1912) Il.r Landshoff, Peter Vincent (1937-) 146.r 386.r Lane, Jonathan Homer (1819-80) 291.F Landford, Oscar Erasmus, m (1940-) 126 K 150 C 340 B, F 402.G Lang, Serge (1927-) 3 M, r 6 r 12 B 14.r 28 r 105 r I 18 0, F, r 134 r 172.r 182 r 198.r 200 r 256.r 277.r 337 r 368 r 430.r 450 r Langevin, Paul (1872- I 946) 45.1 402 K Langhaar, Henry L. 116 r Langlands, Robert Phelan (1936-) 32.H, r 437 DO, r 450.A, G, N, 0, S, T Lapidus, Leon (19 I 0- 75) 303 r Laplace, Pierre Simon (1749-1827) 30 B 103 0 107.B 126.A 192 F 194 B 239 240 A 250 A 266 306.A 323.A 342 A, r 401.E 442 0 App A, Tables 12.1, 1811 Lappo-Oanilevskii, Ivan Aleksandrovich 253 r LaSalle, Joseph Pierre (1916-83) 86 F Lascoux, Jean 146.A, C Lashnev, Nikolai Serafimovich 425.CC Lashof, Richard Kenneth (1922-) 279.C 365 0 Lasker, Emanuel (1868-1941) 12.B Latter, Robert H. (1946-) 168.B Laufer, Henry B (1945-) 418 r Laugwitz,Oetlef(1932-) 111.r Laurent, Pierre Alphonse (1813-54) 1980 339.A '"aurent-Ouhamel, Marie Jeanne (1797- I 872) 3220 Lauricella, G 206 0 Lavine, Richard B (1938-) 375 C Lavita, James A 375.r Lavrent'ev, Mikhail Alekseevich (1900-80) 336.F 352 A, 0, E 436.1 Lavrik, Aleksandr Fedorovich (1927-) 123.E Lawler, Eugene L (1933-) 66r 281.r 376.r Lawley, Derrick Norman 280 B, G, r 346 F, r Lawson, Charles L. 302.r Lawson, Herbert Blaine, Jr. (1942-) 80.r 154.r 178 r 275.F, r 364 H 365 K Lax, Peter David (1926-) 112.1, P, S 204.r 274 r 304.F 321 G 325.H 345 A, r 375.H 387.C, r Lazard, Daniel (194 I - ) 200 K Lazard, Michel Paul (1924-) 122.F Lazarov, Connor (1938-) 154.H Leadbetter, Malcolm Ross (1931-) 395 r 
Name Index Lebesgue, Henri Leon Lebesgue, Henri Leon (1875-1941) 20.r 22.A 84.0 93 F 94 C, r 117.B, 0, r 120.A, B, 0 136.A, E 156 C 159.A-C, J 160.A 166.C 168.B 179.r 221.A-C 244 246.C 270.0, E, G, J, L, r 273.F 379.S 380 C, 0, r 388.B Lebowitz, A. 134.r Lebowitz, Joel Louis (1930-) 136.G LeCam, Lucien (Marie) (1924-) 341.r 398.r 399.K, M, N,r Ledger, Arthur Johnson (1926-) 364.r Le Dung Trang (1947-) 418.1 Lee, Benjamin W (1935-77) 132.r Lee, E. Bruce (1932-) 86.r Lee Tsung Oao (1926-) 359.C Lee, Y. W. 95.r Leech, John 151.1 Leela, S. 163.r Lefebvre, Henri (1905-) 101.r Lefschetz, Solomon (1884-1972) 3 A 12 B l5.B 16.P, U, V 79.r 93.r 126.A, r 146 r 153.B, C, r 170.r 201 A, E, 0, r 210.r 290 r 291.r 394.r 410 r 418.F, I 422 r 426 *, r 450.Q Legendre, Adrien Marie (1752-1833) 4.0 46.C 82.A 83 B 107.A 109 123.A 134 A, F 145 174.A 266296 A, B 297.H, I 342 A 393 A-C 419 C App. A, Tables 14.11, 15 IV, 16 I, IV, 18.11, III Lehman, R. Sherman (1930-) 385.r 450.1 Lehmann, Erich Leo (1917-) 371 A, C, H, r 396.r 399.C, E, H, P, r 400 B, r Lehmann, Harry Pa ul (1924- ) 150 0 Lehmer, Derrick Henry (1905-) 145 301.K 354.B Lehmer, Derrick Norman (1867-1938) 123.r NTR Lehmer, Emma (1906-) 145 Lehner, Joseph (1912-) 32.r Lehrer, G I App. B, Table 5 Lehto, Olli (1925-) 62.C 352 C Leibenzon (Leibenson), Zinovii Lazarevich (1931-) 192.Q Leibler, Richard Arthur (1914-) 398.G Leibniz, Gottfried Wilhelm, Freiherr von (1646- 1716) 2038 75.A 106.0 107.A 156.B 165 A 245265 283 293.A 332 379.C App. A, Tables 9,10.111 Leith, Cecil Eldon, Jr. (1923-) 433.C Leja, Franciszek (Fran<;ois) (1885-1979) 48.0 Lelong, Pierre (1912-) 21.r Lelong-Ferrand, Jacqueline (1918-) 364 r Lemaire, Luc R (1950-) 195.E, r Lenstra, J K. 376.r Leon, Jeffrey S 151.1 Leonardo da Vinci (1452-1519) 360 Leonardo Pisano  Fibonacci Leontief, Wassily W. (1906-) 255.E Leontovich, A M 420.G Leopoldt, Heinrich Wolfgang (1927-) 140, U 450.A, J Leray, Jean (1906-) 20112 B 125.A 146.A 148.A, E 200 J 201 J 204.B, 0, r 240.r 286 C, 0 321.G 323.0 325 I, J, r 383 J, r 426 Lerner, R. G. 414.r LeRoy, Edouard (1870- I 954) 379.S Lesley, Frank David 275.C Lettenmeyer, Fritz (1891-1953) 254.02890 314A Levelt, Antonius H. M. 428.r LeVeque, William Judson (1923-) 118.r 295 r 296 r 297.r 430 r Levi, Eugenio Elia (1883-1917) 13.Q 21 F, I, Q 112.D 188 r 248.F 274 G 282 321 G 323 B 325.H 344 A Levi, Friedrich Wilhelm (1888-1966) 2.E 122.B 1896 Levi-Civita, Tullio (1873-1941) 80.A, K 109.*, r 364 B 420.F Levin, Viktor losifovich (1909-) 198.r 211.r Levine, Jerome Paul (1937-) 114.0 n5.G Levinson, Norman (1912-75) 107.r 123.B 160.G 252.r 253.r 254.r 314.C, r 315.r 316.r :',94.r 450 Levitan, Boris Moiseevich (1914-) 112.0 287.C 315.r 375.G 387.0 Levy, Azriel (1934-) 22.F 33.F, r 356.G Levy, Paul (1886- 1971) 5.B, E, r 45.A, E, G, I, r 115 r 159.1 176.A, E, F 192.N 260J 261.A 262 A 341.E-G 342.0 406.F 407.A, B Lewin, L. 167.r Lewis, Daniel Ralph (1944-) 443.A Lewis, Donald 1. (1926-) 4.E 118.0, F Lewis, Richard M 127.G Lewy, Hans (1904-) 112.C 274.G, I 2'i'5.B 300.r 304.F 320.1 323 I 334.F LiChih(1192-1279) 57.B Li, Peter Wai-Kwong (1952-) 391.0, N Li Tien-Yien (1945-) 126 N 303.G Li Yen (1892-1963) 57.r Liao San Oao (1920-) 126J Lichnerowicz, Andre (1915-) 80.r 152 C 359 r 364.F, H, r 3910 Lichtenstein, Leon (1878-) 217.r 222.1 Lickorish, William Bernard Raymond 114.L 154.B Lie, Marius Sophus (1842-99) 13.C, F 76.B, C 105.0, Q 107.B 109.0, Q 137 139 B 183 190.Q 247 248.A, B, F, H, P, S, T, V, r 249.A-0, G, H, L, M, V, r 267 286.K 313.0 406.G 431.C, G 437.U Lieb, Elliott Hershel (1932-) 212.B, r 402.r Lieb, Ingo (1939-) 164.K Lieberman, David Ira (1941-) 16.R 2:i.G Lieberman, Gerald J. (1925-) STR Liebmann, Karl Otto Heinrich (1874-1939) 111.1 365 J Lienard, Alfred 290.C Liepmann, Hans Wolfgang (1914-) 205.r Lifshits, Evgenii Mikhai\ovich (1915-) 130.r 150.r 205.r 259.r 402.r Liggett, Thomas Milton (1944-) 162 286 X 340.r Lighthill, Michael James (1924-) 25.B, r 160.r 205.r 446.r Ligocka, Ewa (1947-) 344.0 Lill 19.B Lin, C. C. 433.r Lin Jiguan 108.B Lin Shu 63.r Lind, Douglas A. (1946-) 136.E Lindeberg, 1. W. 250.B Lindelof, Ernst Leonhard (1870-1946) 43 C, H 123 C 425.S Lindemann, Carl Louis Ferdinand von (1852- 1939) 179.A 332430 A, 0 Lindenstrauss, Joram (1936-) 37.M, N, r 168.r 443 H Lindley, Dennis Viktor (1923-) 401.r Lindow, M App. A, Table 21.r Linfoot, Edward Hubert (1905-82) 37.E Linnik, Yuril Vladimirovich (1915-72) 4.A, C, E 123.0, E 136.H 250.r 341.E 374.H Lionnet, Eugene (1805-84) 297.0 Lions, Jaques-Louis (1928-) 86.r 112.E, F 204.B 224.A, E, F, r 286.C 320.r 322.r 323.r 327.r 378.F, I, r 405.r 440.r Liouville. Joseph (1809-82) 107.A 112.1 126.L 13l.A 134 E 171 182 G 219 A 252 C 272.A 315.B 402 C 430.B Lippmann, Bernard Abram (1914-) 375.C 
1897 Lipschitz, Rudolf Otto Sigismund (1832-1903) 84.A 107.A 159 B 168.B 279.C 286.B 3 I 60406.0 Liptser, Roberg Shevilevich (1936-) 86.E 405.r Listing, Johann Benedikt (1808-82) 426 Little, C. N 235.A Little, John A 365 N Little, John Dutton Conant (1928-) 133 r Littlewood, Dudley Ernest (1903-79) App. B, Table 5.r Littlewood, John Edensor (1885- I 977) 4 C, 0 43 E, r 88.r 123.B, C 159.G 168.B 192.0, P 21 Lr 224242 B, r 3 I 7 B 339.B, C 379 S 450.B, I App. A, Table 8 Liu Chung Laung (1934-) 66.r Liu Hui (11. 260) 57.A 332 Liulevicius, Arunas L. (1934-) 202 S Livesay, George Roger (1924-) 442.L Livingood, John 328 Livshits, Mikhail Samulovich (1917-) 68.J Lobachevskii, NikolaI Ivanovich (1793- 1856) 35.A 181267285 A Loday, Jean-Louis (1946-) 237.r Loeb, Peter Albert (1937-) 2930, r Loeve, Michel (1907 - 79) 341.r 342 r Loewy, Alfred (1873-1935) 190.P Lofstrom, Jorgen (1937-) 224.r Logunov, Anatolil Alekseevich (1926-) l50.r Lohwater, Arthur John (1922-82) 62 r Lojasiewicz, Stanislaw (1926-) 16 r 58.E Lommel, Eugen Cornelius Joseph von (1837-99) 39.C App A, Table 19 IV Lomonosov, V. I. 25LL Longley-Cook, Laurence H. 214.r Longo, Giuseppe (1941-) 2J3r Lonsdale, Dame Kathleen Yardley (1903- 71) 92.r Looman, H. 198.A Looman, M. 100 A Loomis, Herschel H., Jr 75.0, r Loomis, Lynn H. (1915-) 36.r 126 r 192.r 225.r Loos, Ottman 412.r LopatinskiI, Yaroslav Borisovich (1906-81) 323.H 325 K Lopez de Medrano, Santiago 114 r Lorentz, George G. (1910-) 168.B Lorentz, Hendrik Antoon (1853-1928) 60 J 150.B 258 A 359 B 39].1 402 H Lorenz, Edward N. 126 N 433.B, r Lorenz, Max O. 397.E Lorenzen, Paul Peter Wilhelm (1915-) 243.G Loria, Gino (1862- 1954) 93.r Los, Jerzy Maria (1920-) 276.F 293 C Loschmidt, Joseph (1821 -95) 4LA Losik, Mark V ol'fovich (1935-) 105 r Lotz, Heinrich P. 310.H Louveau, Alain 22.G Lovasz, Laszlo (1948-) 186.r Love, Augustus Edward Hough (1863-1940) 27J.G Love, Clyde Elton (1882-) 107 A Low, Francis Eugene (1921-) 150.C 361 r Lowdenslager, David B (1930-) lM.G, H Lowenheim, Leopold (1878- I 940) 97.B 156.E, r 2760 Lowner, Karl (Loewner, Charles I (1893-1968) 212 r 438.B, C Lozin'kii, Sergei Mikhailovich (1914-) 3140 Lli Yinian J7 0 Lubanski, J K. 258.0 Lubin, Jonathan (1936-) 257.r Lubkin, Saul (1939-) 450.Q Lucas, William F. (1933-) 1730, E, r Name Index Malcolm, Donald G. Luce, Robert Duncan (1925-) 96.r 173 C 346.G Liiders, Gerhart Claus Friedrich (1920-) 150.0 386B Ludwig, Donald A. (1933-) 321 G 325.L, r 345.r Luenberger, David G. (1937-) 86.E 264.r Lukacs, Eugene (1906-) 341 r Lukaszewicz, Jan (1878-1956) 41 LL Luke, YudeH L 389 r App. A, Table 16 NTR Lumer, Glinter (1929-) I M.F, G Lundberg, Filip 214.C Luneburg, Rudolf Karl (1903-49) 180 A, r 325.L Lunts, G 198.r Luroth, Jakob (1844-1910) 16.J Lustzig, G. App. B, Table 5 Luther, Herbert A 304.r Lutz, Elizabeth (1914-) 118.0, E Luxemburg, Wilhelmus Anthonius Josephus (1929-) 293 r Luzin (Lusin), NikolaI Nikolaevich (1883- I 950) 22.A, C, F, G, 1, r loo.A 156.C 159.1 270.1 425.CC Lyapin, Evgenij Sergeevich (1914-) 190.r Lyapunov, Aleksandr Mikhailovich (1857- I 918) 107.A 120.A I 26.A, F 163 G 250.B 286.V 314.A 394 A, C, r 398.C 443.G Lyapunov, Aleksei Andreevich (191 1-73) 22.r Lyndon, Roger Conant (1917-) 97 r 200.M Lyons, Richard Neil (1945-) 15].1 Lyusternik, Lazar' Aronovich (1899- I 98]) 279.G 286.Q, r NTR M Maak, Wilhelm (1912-) 18.r Maass, Hans (1911-) 32 F, G, r 450.M Macaulay, Francis Sowerby (1862- I 937) 12.B 2840 Mach, Ernst (1838- 191 6) I 16 B 205.B 271.A Machin, John (1680-175]) 332 Machover, Maurice (1931 -) 356.G Mack, C. 30l.N Mackay, Alan L. 92.F MacKenzie, Robert E (1920-) 279 C Mackey, George Whitelaw (1916-) 36.G 424.M, N 437.EE Mac Lane, Saunders (1909-) 8.r 52 r 70 F, r 9Lr 103.r 200.M, r 20LG 202.T 277.r 305.A Maclaurin, Colin (J 698- I 746) 20266 379.J MacMahon, Major Percy Alexander (J 854-1929) 328 330 r MacPherson, Robert Duncan (1944-) 366.E, r MacRobert, Thomas Murray 393.r Madansky, Albert 408 r Maeda, Fumitomo(1897-1965) 162 Maeda, Fumi-Yuki (1935-) 207.0, r Maeda, Yoshiaki (1948-) 3M.G Maehara, Shoji (1927-) 41 U, r Magenes, Enrico (1923-) I 12.E 323.r Magidor, Menachem 33.r Magnus, Wilhelm (1907-) 161 B, r 389.r App. A, Table 20.lV Mahalanobis, Prasanta Chandra (J893-1972) 280.E Mahler, Kurt (1903-) I 82.r 430 B, C Mahlo, P 33.r Mainardi, Gaspare (1800- 79) 1 I I H App A, Table 41 Maitra, Ashok P. 22 E 396.r Majima, Hideyuki (1952-) 428.H Makarov, Vitalij Sergeevich (1936-) 122 G Malcolm, Donald G 376.r 
Name Index Malfatti, Gian Francesco Malfatti. Gian Francesco (1731-1807) 179 A Malgrange, Bernard (1928-) 58 C, E 68 F 112.B, C,R 125W320.H418r428.H Malliavin, Paul (1925-) I 15 0, r 192.M 406.E, r Malmquist, Johannes (1882.- I 952) 2540288 B, C, r 289 B-D 314.A Mal'tsev, Anatolii Ivanovich (1909-67) 29 F 249 S 2760 Malus, Etiennc Louis (1775-1812) 180A Mandelbaum, Richard (1946-) 114.r Mandelbrojt, Szolem (1899- I 983) 58 F 134 C, r 339.A, r Mandelbrot, Benoit B (1924-) 246 K 433.r Mandelstam, Stanley (1928-) 132 C Mane, Ricardo 126.1 Mangasarian, Olvi L. (1934-) 292 0, r Mangoldt, Hans Carl Friedrich von (1854-1925) 123.B 450 B Manin, V G 80.r Manin, Yurii Ivanovich (1937-) 16 J, V 80.r 118 E 387 r 450 J, M Mann, Henry Berthold (1905-) 4.A 371 A, C 421 r Mann, Larry N. (1934-) 364 F Manna, Zohar (1939-) 75 r Mannheim, Amedec (183 I - I 906) 111.F Manning, Anthony Kevin (1946-) 51 r 126 J, K Mansfield, Richard B (194 1-) 22 F Maranda, Jean-Marie A e- 1971) 362 K Marchand, Jean-Paul 375 r Marchenko, Vladimir Aleksandrovich (1922-) 287 C 387 0 Marchuk, Gurii Ivanovich (1925-) 304.r Marcinkiewicz, J6zef(1910-) 159 H 224 A, E 336 E Marden, Morris (1905-) 10.r Mardesic, Sibe (1927-) 382.A Margulis, Grcgorii A. (1946-) 122 G Marion, Jerry Baskerville (1929-) 271.r Markov, Andrei Andreevich (1856-1922) 5 H 440, E 126 F, J 127 E 136 B, 0, G 150.F 176.F 182.G 260 A, H, J 261 A, B 336 C 340 C 379.1 403 E 405 C 406 0 407 B Markov, Andrei Andreevich (1903-) 3 I B 161.B 356.r Markus, Lawrence J (1922) 86.r 126 A, H, L, r 291.r Markwald, Werner 81 A, r Marotto, Frederick Robert (1950-) 126.J Marsden, Jcrrold E (1942-) 126 r 183 r 271 r 286 r 316.r 364.H 420.r Marshall, Donald E 164 I Marsten, Roy Earl (1942-) 215 r Martin, Andre (193 1-) 1500 386 B, r Martin, Donald A 22 0, F, H, r 33 F, r Martin, Harold C 304 r Martin, Paul C 308 H Martin, Robert S 207 C, 0260 I Martin, William Ted (191 1-) 21 r Martineau, Andre (1930- 72) 125 W, Y, r 162 168 B 424X Martinet, Jcan IIO.E Martin-Lor, Pcr (1942-) 354 r Martio, Olli Tapani (1941-) 352 F Marty, F e- I 939) 272.H 435.E Maruyama Gisiro (1916-86) 11501360, E 250r 260 J 395.r Maruyama, Masaki (1944-) 16 Y Maruyama, Toru (1949-) 443 A Masani,PesiR (1919-) 395.r Mascheroni, Lorenzo (1750- 1800) 179 B 1898 Maschke, Heinrich (1853-1908) 362.G Maschler, Michael (1927-) 173.0 Maskit, Bernard (1935-) 122.1 234.0, r 416 r Masley, John M (1947-) 14.L Maslov, Viktor Pavlovich (1930-) 30.r 274 C, I 345.r Massau, Junius (1852- I 909) 19.B Masser, David W 134 r 430 0, r Massey, William S (1920-) 91.r 1701 201 r 202M, P410.r Masuda. Kazuo (1946-) 72.r Masuda Kyuya (1937-) 378 I, J Masuyama, Motosaburo (1912-) STR Matano, Hiroshi (1952-) 263.C 303 G, r Mather, John Norman (1942-) 51.C-E 126.J 154.E, r 286.1 418 r Mathews, George Ballard (1861 - 1922) 39.r Mathieu, Emile Leonard (1835-90) 15,1 H 268.A-D Matiyasevich, Yurii Vladimirovich 9 7 .*, r 118 A 196 Matsuda, Michihiko (1938-) 428 G Matsumoto, Hideya (1939-) 13 R IEF Matsumoto, Kazuo (1922-) 411.J Matsumoto, Kikuji (1931-) 62.B 124C, r Matsumoto, Shigenori (1947-) 126.M Matsumoto, Yukio (1944-) 65.0114 K Matsumura, Akitaka (1951-) 4\.r 204.F Matsumura, Hideyuki (1930-) 284.r Matsusaka, Teruhisa (1926-) 12 B 16.P, W, r Matsushima, Yozo(1921-83) 32.r 122F 199.r 249 r 384.r Matsushita, Shin-ichi (1922-) 338.L Matsuyama, Noboru (1916-) 310.r Mattis, Daniel Charles 402 r Mattuck, Arthur Paul (1930-) 118 E 450.P Matuda, Tizuko (1923-) 30.r 288 B, r 289 r Matumoto, Takao (1946-) 65.C 114.K Matunaga Yosisuke (1692 L 1747) 230332 Matuzaka, Kazuo (1927-) 7.r 343.r Matveev, Vladimir Borisovich 387 r Maunder, Charles Richard Francis 2'01 r Maupertuis, Pierre Louis Moreau de ( '698-(759) 180 A 441.B Maurer, Ludwig (1859-?) 249.R Maurus (c. 780-c. 856) 372 Mautner, Friedrich Ignaz (1921-) 136G 308 G 437 EE Mawhin, Jean 290.r Maxfield,JohnE.(1927-) NTR Maxwell, Albert Ernest 280 r 346 F, [ Maxwell, George (1946-) 92 r Maxwell, James Clerk (183 I -79) 51.1' 130.A 150.A 180.A 393.D 402 B, H 419 B Maxwell, William L (1934-) 376r May, J Peter (1939-) 70.r May, Kcnneth Ownsworth (1915-77) 157.r May, Robert McCredie (1936-) 126.N 263.D, r Mayer, Dieter H. 402 G, r Mayer, Karl Heinz (1936-) 43 I r Mayer, Walter 111 r 201.C, E, L Maynard, Hugh B 443.H Mazet, Edmond 109 r 115 r 391 r Mazur, Barry C. (1937-) 16 r 37.C 65.C, G 114.C 126 K 426 450 J, r Mazur, Stanislaw (1905-81) 36 E Mazurkiewicz, Stefan (1888- 1945) 22.C 93.0 426 McAndrew, Michael H. 126.K 136 H McAuley, Van A 301 E McBride, Elna Browning 177.r McCarthy, John (1927-) 31 C 
1899 McCoy, Barry Malcolm (1940-) 402 r McCoy, Nea] Henry (1905-) 368.r McCracken, Marsden 126 r McDuff, Dusa Waddington (1945 -) 308.F McGregor, James 263 E McKay, John 151 [ McKean, Henry P, Jr (1930-) 41.C 45 I, r lI5.A, r 176 F, K 260 J 261.A, r 323 M 340.F 387 E, r 391 B, C, K, r 406.r 407 B McLachlan, Norman William (1888-) 268.r McLaughlin, Jack (1923-) 151 I McMillan, Brockway (1915-) 136 E 213 D McShane, Edward James (1904-) 310.1, r Meeks, William Hamilton, III 235 E 275 C, D Mehler, Ferdinand Gustav (1835-95) App A, Tables 18 II, 19.111 Mehra, Raman K 86 r Meinardus, Gunter (1926-) 328 Meinhardt, Hans 263.D Meixner, Josef (1908-) 268.r 389 r Melin, Anders (1943-) 345 A Mel1in, Robert Hjalmar (1854-1933) 2060 220.C Melrose, Richard B (1949-) 325 M Menaechmus (375-325 B C) 187 Mendelson, Elliot (1931-) 330, r 319 r Menelaus (of Alexandria) (fl. 98'1) 7.A 187 Menger, Karl (1902-) 93.0 117 A, B, 0, r 426 Menikoff, Arthur S (1947--) 325.H Men'shov, Dmitrii Evgen'evich (1892-) 77 A 159.1 198 A, r 317 B Meray, Hugues Charles Robert (1835-1911) 267 Mercer, J 217.H Mergelyan, Sergei Nikitovich (1928-) 164.J 336 F 367G Merluzzi, P 303 r Merman, G A 420 0 Mersenne, Marin (1588-1647) 297 E App B, Table 1 Mertens, Franz Carl Jmef(1840-1927) 123 A 379.F Meschkowski, Herbert (1909-) 188.r Meshalkin, Lev Dmitrievich (1934-) 136 E Messiah, Albert M L (1921-) 351 r Messing, William (1945-) 450 Q Metivier, Michel (1931-) 443 H Meusnier, Jean Baptiste Marie Charles (1754-93) 10911 I H 275 A 334 B Meyer, Franz (1856-1934) 267 Meyer, Kenneth R. (1937-) 126 K, I, M Meyer, Paul-Andre (1934-) 261 r 262.0, r 406.r 407 B, r Meyer, Wolfgang T (1936-) 109 r 178 r 279.G, r Meyer, Yves 251 r Michael, Ernest Arthur (1925-) 425 X, Y, AA, CC Michel, Rene 178.r 304 r Michelson, Albert Abraham (1852-1931) 359 A Migdal, A A 361 C Mikami, Yoshio (1875--1950) 230 r MikhaiJov, A. V. 387 G Mikhlin, Solomon Grigor'evich (1908-) 46 r 217.r Mikusinski, Jan G. (1913-) 306.A, B Miles, G 136.E Milgram, ArthurN (1912-) 112J Milgram, R James (1939-) 65 r MiIler, Charles F, III (1941-) 97.r Miller, David Charles (1918-) 291.F Miller, K S 104 r Miller, Louis W. 376.r 389.r MilIett, Kenneth Cary (1941-) 154.H Mills, Robert L. (1927-) 80.Q, r 150G Name Index Moldestad, Johan Mil'man, David Pinkhusovich (1913-) 37.G 424 U 443 H Milne, James Stuart (1942-) 450 r Milne, William Edmund (1890-) 303 E Milne-Thomson, Louis Melville (1891- I 974) 104 r NTR Milnor, John wiIlard (1931-) 56 F, r65C, E 70C 99 r 109 r 11] r I 14.A-C, F -K, r 126.N 147 H, P 154 H 178 E 237 J, r 3650 391.C 418 0, r 426 App A, Table 5 V Mimura, Masayasu (1941-) 2630 Mimura, Yositaka (1898-1965) 434.C, r Mimura, Yukio (1904- 84) 162 Minakshisundaram, Subbaramiah (1913-68) 121.r 323.M 379 r 391.B, r Minemura, Katsuhiro (1945-) 437 r Minkowski, Hermann (1864-1909) 14 B, 0, U 89.0, E 118.C 122 E, F 182 A, C-E, r 196211 C 255.B 258 A 296.A 348 D, G, K 359 B App. A, Table 8 Minlos, Robert Adol'fovich (1931-) 258.r 341 J 424.T Minorsky. Nicholas (1885-) 163.B Minsky, Marvin C 385 r Minsky, M. L. 75 r Minty, George James (1929-) 281 r 286.C Miranda, Carlo (1912-82) 323 r Mirimanov, D. 145 Mishchenko, Evgenii Frolovich (1922-) 86 r Mishkis, AnatoliI Dmitrievich 163.B Misiurewicz, Michal (1948-) 126 K Misner, Charles William (1932-) 359.r Mitchell, Andrew Ronald 223 r Mitchell, Benjamin Evans (1920-) 52 N, r 200.1 Mitome, Michiwo (1909-76) STR Mitropol'skii, YuriI Alekseevich (1917-) 2900, F Mitsui, Takayoshi (1929-) 4.F, r 123 F 328 Mittag-Leffler, Gustav Magnus (1846-1927) 47267 272.A Mityagin, Boris Samuilovich (1937-) 424 S Miura, Robert Mitsuru (1938-) 387.B Miwa, Megumu (1934-) 118 E Miwa, Tetsuji (1949--) 112 R 253 E 387 C Miyadera, Isao (1925-) 162378 B Miyajima, Kimio (1948-) 72.G Miyajima,Shizuo(1948-) 310H Miyakawa, Tetsuro (1948-) 204.C Miyake, Katsuya(1941-) 16Z Miyakoda, Tsuyako (1947-) 301.C Miyanishi, Masayoshi (1940-) 15 H, r Miyaoka, Yoichi (1949-) 72 K, r Miyata, Takehiko (1939-83) 226 r Miyoshi, Tetsuhiko (1938-) 304 r Mizohata, Sigeru (1924-) 112 B, 0, P 274. B, G, I 320.1, r 321.F, G, r 323 M 325.G, H 345 A Mizukami, Masumi(1951-) 16r Mizumoto, Hisao (1929-) 367 I Mizutani, Akira (1946-) 304 r Mizutani, Tadayoshi(1945-) 154.G Mobius, August Ferdinand (1790- I 868) 66 C 74.E 76A 267 295C 410.B Moedomo, S. 443.H Mohr, Georg (1640-97) 179 B Moise, Edwin Evariste (19 I 8-) 65 C 70 C 79.D 93.r 139.r 41 O.r Moiseiwitsch, Benjamin Lawrence (1927-) 441 r Moishezon, Boris Gershevich (1937-) 16.E, U, W 72.r Mo1chanov, Stanislav Alekseevich 1 15.0340 r Moldestad, Johan (1946-) 356 F, r 
Name Index Moler, Cleve B. Moler, Cleve B (1939-) 298.r 302.r Monge, Gaspard (1746-1818) 107.B 109158181 255.E 266 267 278 A 324.F Monin, Andrei Sergeevich 433 r Montel, Paul Antoine Aristide (1876-1975) 272 F 4240 435.E, r Montgomery, Deane (1909-) 196 249.V, r 423 N 431.r Montgomery, Hugh L. (1944-) 14.L 123.E, r Montucla, Jean Etienne (1725-99) 187 r Mook, Dent T. 290.r Moon, Philip Burton (1907-) 130.r Moore, Calvin C. (1936-) 122 F Moore, Eliakim Hastings (1862-1932) 87.H, K, r Moore, John Colemar (1923-) 147 r 200 r 203.r Moore, John Douglas 365 J Moore, Robert Lee (1882-1974) 65.F 273 K 425.AA 426 Moran, Patrick Alfred Pierce (1917-) 218 r Morawetz, Cathleen Synge (1923-) 112 S 345 A Mordell, Louis Joel (1888- I 972) 118 A, E Morera, Giacinto (1856- 1909) 198 A Morf, Martin (1944-) 86.r Morgan, Frank 27S C Morgenstern, Oskar (1902-77) 173.A, 0 376 r Mori, Akira (1924-55) 352 B, C 367.E Mori, Hiroshi (1944-) 275.F Mori, Mitsuya (1927-) S9 H Mori, Shigefumi (1951-) 16.R, r 364 r Mori, Shin'ichi (1913-) 207.C, r Mori, Shinziro (1893-1979) 284G Moriguti, Sigeiti (1916-) 299 B 389 r NTR Morimoto, Akihiko (1927-) 11O.E 126.1 344.C Morimoto, Haruki (1930-) 399.r Morimoto, Hiroko (1941-) 224 F Morimoto, Mituo (1942-) 125.BB, DO 162 Morimune, Kimio (1946-) 128.C Morishima, Taro (1903-) 145.* Morita, Kiiti (191S-) 8 I 17 A, C, E, r 273.K 425 S, X-Z, ('C Morita, Masato (1927-) 353 r Morita, Reiko (1934-) 353.r Morita, Shigeyuki (1946-) 154 G Morita, Yasuo (1945-) 450 U Moriya, Mikao (1906-82) S9.G, H Morlet, Claude 147.Q Morley, Edward Williams (1838-1923) 359 A Morley, Michael 276.F, r Morrey, Charles Bradfield, Jr (1907-84) 46 r 78 r 1120 125.A 194 F, r 195 246.C 27S A, C, r 323 r 334 D 350.r 3S2 B Morris, Peter D. 443 H Morrow, James 72 K Morse, Harold Marston (1892-1977) 109 114 A, F 126.1 178.A 275 B 279.A-F 286 N, Q, r 418.F Morse, Philip McCord (1903-) 25.r 133.r 227 r Morton, Keith W (1930-) 304 r Moschovak,s, Yiannis Nicholas (1938-) 220, F, H, r 33.r 3S6 G, r Moser, Jurgen (Kurt) (1928-) 21 P S5.r 126.A, L, r 136.r 286.J, r 323.L 344 B 420.C, G Moser, William 0 J. (1927-) 92 r 122.r 151.r 161.r Mosher, Robert E 64.r 70.r Mosteller, (Charles) Frederick (1916-) 346.C, G Mostow, George Daniel (1923-) 13.r 32 r 122.F, G 249.r Mostowski, Andrzej (1913-75) 330, r 356 C, H Motohashi, Y oichi (1944-) 123.E Motoo, Minoru (1927-) 44.E I 15.C, 0, r 261 r Moulin, M. 37S.F 1900 Moulton, Forest Ray (1872-1952) 55.r 303.E Moussu, Robert (1941-) 154.H Moyal, Jose E. 44 r Muchnik, Al'bert Aramovich (1934-) 356.0 Mugibayashi, Nobumichi (1926-) 125.BB Muhly, Paul Scott (1944-) 164 H Muirhead, Robb John (1946-) 280.r Mukherjee, Bishwa Nath 346.r Muller, Claus Ernst Friedrich (1920-) 323.1 393.r Muller, David Eugene (1924-) 301.C Muller, Werner (1949-) 391.M Muller-Breslau, Heinrich Franz Bernhard (1851- 1925) 19.r Mullikin, Thomas Wilson (1928-) 44 r Mullis, Clifford T (1943-) 86.0 Mumford, David Bryant (1937-) 3 A, N, r 9 J, r 12.B IS E, F, r 16 R, W, Y, Z, r 32.r 72.G 226.r 4180 Munkres, James Raymond (1930-) 70.r 105.r 114.C, r Muntz, C. H 336 A Munzner, Hans-Friedrich 365 I Murakami, Shingo (1927-) 32.r 122 F 384.r App A, Table 5.1 Muramatu, Toshinobu (1933-) 168.B 224.E 2510 Murasugi, Kunio (1929-) 235.A, E, r Murata, Hiroshi (194S-) 75.r Murray, Francis Joseph (1911-) 136 F 308.F Murre, Jacob P (1929-) 16 W Murthy, M. Pavaman 237.r Muskhelishvili, Nikolai I vanovich (189 I -1976) 217.r 222.r 2S3.r Muto, Yosio (1912-) 364 F, r Mutou, Hideo (1953-) 391 E Mycielski, Jan (1932-) 22 H 33.F, r Myers, Sumner Byron (1910-55) 152.12 178.B Myrberg, Pekka Juhana (1892-1976) 367.E N Nachbin, Leopoldo (1922-) 21.r 37.M 425.BB Nagaev, Sergei Viktrovich (1932-) 250.r Nagamati, Sigeaki (1945-) 125.BB Nagami, Keio (192S-) 117.A, C, r 273 K, r 425.Y, AA, CC, r Nagano, Tadashi (1930-) 191.r 275 F 279.C 344.C 364.F 365.F, K Naganuma, Hidehisa (1941-) 450.L Nagao, Hirosi (1925-) 151.H 2oo.L 362.1 Nagasawa, Masao (1933-) 44.r Nagase, Michihiro (1944-) 251.0 Nagata, Jun-iti (1925-) 117.(' 273.K, 1 42S.r Nagata, Masayoshi (1927-) 8 12.B 13 I 15.r 16.0, T, V, AA, r 67.1, r 196226 G, r 277.r ::84.E, G 369.r 370.r Nagell, Trygve (1895-) 118.0 Nagumo, Mitio (1905-) 162 286.Z, r 316.E, r 323.0 Na'im, Linda 120.E 207.C Naimark (Neumark), Mark Aronovich 11909-78) 36.G, r 107.r 112 r 192.r 252 r 258.r 308.0 315.r 437.W, EE Naito, Hiroo (1950-) 365.F, N Nakada, Hitoshi(1951-) 136.C Nakagami, Yoshiomi (1940-) 308.r Nakagawa, Hisao (1933-) 365.L Nakai, Mitsuru (1933-) 169 r 207.C, D Nakai, Yoshikazu (1920-) 15 C, F 16.E, r Nakajima, Kazufumi (1948-) 384.r 
1901 Nakamura, Iku (1947-) 72.K Nakamura, Kenjiro (1947-79) 310,r Nakamura, Michiko (1937-) 424 X Nakamura, Tokushi (1930-) 70 F, r Nakane, Genkei (1662-1733) 230 Nakanishi, Noboru (1932-) 146.A-C Nakanishi, Shizu (1924-) 100 A, r Nakano, Hidegoro (1909-74) 162 31O.A 436 r Nakano, Shigeo (1923-) 21.L 72.H 147.0 232.r Nakano, Tadao (1926-) 132.A Nakao, Shintaro (1946-) 340.r Nakaoka, Minoru (1925-) 70.F, r 153 B 202 P 305 A Nakayama, Mikio (1947-) 173.E Nakayama, Tadasi (1912-64) 6.E 8 29.H, 159 H 67.D 172 A 200 K - N 243.G Nakazi, Takahiko (1944-) 164 G Namba, Kanji (1939-) 33 r Namba, Makoto (1943-) 9 E 72 r Nambu, Yoichiro (1921-) 132.C Namikawa, Yukihiko (1945-) 16 Z Namioka, Isaac (1928-) 31Or424r Napier, John (1550-1617) 131 D 265 432.C App A, Tables 2.11, III Narasimhan, Mudumbai S. (1932-) 112 D Narasimhan, Raghavan (1937-) 23 r 367.G Naruki, Isao (1944-) 21.P, Q 344 D Nash, John Forbes, Jr. (1928-) I73.A, C, r 204.F 286.J 323.L 327 G, r 365 B Navier, Louis Marie Henri (1785-1836) 204 B, C, F 205.C Nayfeh, Ali Hasan (1933-) 25 r 290.r Neciis, Jindfich (1929-) 304.r Nedoma, Jifi 2 I 3.F Ne'eman, Yuval (1925-) 132.D, r Nehari, Zeev (1915-78) 77.r 367G 438.B Nelson, Joseph Edward (1932-) I I S.D 150.F 176 F 293 E, r 341 r 437.S Nemytskii, Viktor Vladimirovich (1900-) 126 E, r 394.r Nernst, Hermann Walter (1864-1941) 419.A N6ron, Andre (1922-) 3. M, N, r IS.D 16.P Nersesyan, A. A. 164 J Nesbitt, Cecil James (1912-) 29.r 362 r 368.r Netto, Eugen (1846-1919) 177.r 330 r Neubiiser, Joachim E. F G. (1932-) 92.F Neugebauer, Otto (Eduard) (1899--) 24.r Neuhoff, David L. 213.E, F Neukirch, Jiirgen (1937-) 450.r Neumann, Bernhard Hermann (1909-) 16 I C 190M Neumann, Carl (Karl) Gottfried (1832-1925) 39 B 120.A 188.H 193.F 217 D 323.F App A, Tables 19.111. IV Neustadt, L. W 292.r Neuwirth, Lee P. (1933-) 235 r Nevanlinna, Frithiof(1894-1977) 272.K Nevanlinna, Rolf Herman (1895-1980) 21.N 43 r 109 124 B 164.G 198.r 272 B, D, E, K, r 367.E, I, r 429 B 438.B Neveu, Jacques (1932-) 136 C Neville, Charles William (1941-) 164 K Newcomb, Robert Wayne (1933-) 282.r Newcomb, Simon (1835-1909) 392 r Newell, Allen 385.r Newhauser, George L. 215.r Newhouse, Sheldon E (1942-) 126.J, L, M Newlander, August, Jr n.r Newman, Charles Michael (1946-) 212.r Newman, Donald J. 328 Name Index Nourein, Abdel Wahab M. Newman, Maxwell Herman Alexander (1897-1984) 65 C, F 93.r 333.r Newton, Sir Isaac (1642-1727) 20 48.B, F, H 107 A 126 A 205.C 223.C 254.D 265 271.A-C 283 299.A 301.D 336.G 337.1 338.A 418.D App A, Table 21 Newton, Roger Gerhard (1924-) 375.G, r Ney, Peter E (1930-) 44.C Neyman, Jerzy (1894-1981) 373.A, r 396.F 4oo.B, D 401 B, C, F, G, r Nicholson, John William (188J -1955) App. A, Tables 19.111, IV Nickel, Karl L. E. (1924-) 222 r 30l.G Nickerson, Helen Kelsall (1918-) 94.r 442 r Nicolaenko, Basil 41.D Nicolaus Cusanus (1401-64) 360 Nicolescu, Miron (1903-75) 193r Nicomachus (50-I 50?) 187 Nicomedes (fl. 250? B c.) 93 H Niederreiter, Harald G (1944-) 182.r 354 r Nielsen, Niels (1865-1931) 167.r 174.r Niino, Kiyoshi(1941-) 17C Niiro, Fumio (1923-) 31O.H Nijenhuis, Albert (1926-) 72.B Nikaido, Hukukane (1923-) 89 r Nikodym, Otto Martin (1878-) 270.L 323.E 380 C 443.H Nikolai, Paul John (1931-) 151.H Nikol'skii, Nikolai Kapitonovich (1940-) 251.r Nikol'skii, Sergei Mikhailovich (1905-) 168 B Nilson, Edwin Norman (1917-) 223.r Ninomiya, Nobuyuki (1924-) 338 C, D, J-M Nirenberg, Louis (1925-) 72.r I 12.D, F, H 164.K 168 B 262.B 274.1 286.Z, r 304.F 320.1 323.H, r 345.A, B 365 J Nishi, Mieo (1924-) 12 B Nishida, Goro (1943-) 202 U Nishida, Takaaki (1942-) 41 D, E, r 204 F 263.D 286.Z Nishijima, Kazuhiko (1926-) 132 A 150.r Nishikawa, Seiki (1948-) 195.r Nishimori, Toshiyuki (1947-) I 54.G, H Nishimura, Toshio (1926-) 156.E Nishina, Yoshio (1890-1951) 351.G Nishino, Toshio (1932-) 21.L, Q Nishiura, Yasumasa (1950-) 263 r Nisio, Makiko (1931-) 45.r 260.1 405.r Nitecki, Zbigniew 126.r Nitsche, Johannes C. C. (1925-) 275.C, r 334.F, r Niven, Ivan (1915-) 118.r Nobeling, Georg J 17.D 246.r Noether, Amalie Emmy (1882-1935) 812.B 16.D, X 27 D, E 29 F 150.B 277.1 284.A, D, G 368 F 450.L Noether, Max (1844-1921) 9 E, F, r II.B, r 12.B 15.B, D 16 I 366.C Nogi, Tatsuo (1941-) 304.F Nohl, Craig R. 80.r Nomizu, Katsumi (1924-) 105.r 199.r 365.H, N, r 412 r 4J3.r 417r Nordin, Clas 323.M Norguet, Fram;ois (1932-) 21.1 Norkin, Sim Borisovich (1918-) 163.r Norlund (Norlund), Niels Erik (1885-1981) 104.B, r 379.J, Q Northcott, Douglas Geoffrey 67.1 200.r 277.r 284.r Norton, Richard E. (1928-) 146.C Norton, Simon Phillips (1952-) 151.1 Noshiro, Kiyoshi (1906-76) 62.B, C, E Nourein, Abdel Wahab M 30l.F 
Name Index Novikov, Petr Sergeevich Novikov, Petr Sergeevich (1901-75) 22 D, F, H 97 *, r 161 B Novikov, Sergei Petrovich (1937-) 56.F 114.J 126.N 154 B, D 387 C, r Nozaki, Akihiro (1936-) 31.r 75.D, r Nusselt, Ernst Kraft Wilhelm (1882- 1957) 116.B Nyikos, Peter J 273 K Nyquist, Harry (1889-1976) 86.A 402 K o Obata, Morio (1926-) 364.F, G, r 391.D Oberhettinger, Fritz (191 1-) 220r 389 r App.A, Table 20.lV Ochan, Y urii Semenovich (1913-) 100 r Ochiai, Takushiro (1943-) 21.N,0 191 r 384.r Oda, Tadao (1940-) 16.Z, r 72.K Oda, Takayuki (1950-) 450 S Odqvist, Folke K G 188.r Oenopides (c 5th century Be) 187 Oga,awara, Tojir6(191O-78) 162 Ogg, Andrew P (1934-) 32 r Ogiue, Koichi (1941-) llO.E 365 L, r Ogus, Arthur E (1946-) 450 r Oguztoreli, Mehmet Namik (J 923-) 163 r 222.r Oharu, Shinnosuke (1941-) 162286.X Ohm, Georg Simon (1787-1854) 130.B 259 Ohnishi, Masao (1923-) 411 J Ohtsuka, Makoto (1922-) 62 C, r 77 120 A 143.B 193 r 207 C, r 246 A 338 C, D, M, r Ohya, Yujiro (1935-) 325 H, I 345.A Oikawa, Kotaro (1928-) 48.r 77 E, r 367 r Ojanguren, Manuel (1940-) 29.r Ojima, Izumi (1949-) 150.G Oka, Kiyoshi (1901-78) 20 21.E, H, I, K, Q 23 D 72 E 147.0383 J Oka, Yukimasa (1942-) 136 F Okabe, Yasunori (1943-) 176.F Okada, Norio (1947-) 173.E Okada, Y oshitomo (1892-1957) 379 P Okamoto, Kazuo (1948.-) 253 E Okamoto, Kiyosato (1935-) 437.AA Okamoto, Masashi (1923-) 280.r Okamoto, ShOichi (1951-) 306.A Okamura, Hiroshi (1905-48) 94 r 216.B 246 F 316D, r Okano, Hatsuo (1932-) loo.r Okonek, C. 16 r Okubo, Kenjiro (1934-) 253 C Okugawa, K6taro (1913-) 113 Okumura, Masafumi (1936-) 110 E Okuyama, Akihiro (1933-) 273 K 425 Y Oleinik, Ol'ga Arsen'evna (1925-) 112 D 323.r 325H327r Olive, David Ian (1937-) 146 r 386.C, r Olivieri, E. 402 G Olkin, Ingram (1924-) 280 r Olmsted,JohnM H.(1911-) 106r216.r Olum, Paul (1918-) 91 r 305 A, r Olver, Frank W. J 30.r O'Meara, Onorato Timothy (1928-) 348 r Omnes, Roland Lucian (1931-) 150 r Omori, Hideki (1938-) 178 E 183 286 r Omura, Jim K. (1940-) 213.E O'Nan, Michael E. 151.H, I O'Neil, Richard 224 E O'Neill, Barrett (1924-) III r 178 r 365 B, G O'Neill, Bernard V ,Jr 164.F Ono, Harumi (1932-) 301.F Ono, Katuzi (1909-) 156 E, r 1902 Ono, Takashi (1928-) 13 P Onsager, Lars (1903--76) 340 B 402.K Oono, Yosiro (1920-) 282.r Oort, Frans (1935-) 9 J Oppenheim, Alexander 220.B 242.A Ord, J. Keith 374 r Ordeshook, Peter C 173.r Ore, Oystein (1899-1968) 157 r 190 L Oresme, Nicole (c. 1320(30)-82) 372 Orey, Steven (1928-) 260.1 Orihara, Masae (1915-) 310 r Orhcz, Wladyslaw (1903-) 168 B 44:, D Ornstein, Donald S (1934-) 5 G 136 B, C, E-G, r 162213 E, F Ornstein, Leonard Salomon (1880-1941) 45.1 Ortega, James McDonogh (1932-) 301 r Orzech, Morris (1942-) 29 r Oseen, William (1879-) 205.C Oseledets, Valeril Iustinovich (1940-) 136 B Osgood, William Fogg (1864-1943) 3 r II r 21.H, r 107.A Oshima, Toshio (1948-) 274.r 437 CC, r Osikawa, Motosige (1939-) 136.F Osima, Masaru (1912-) 109 r 275.A- E, r 334.F, r 365.H 391 D Oster walder, Konrad (1942-) 150.F Ostrogradskii, Mikhail Vasil'evich (1801-62) 94.F Ostrowski, Alexander (1893-) 14 F 58.F 88.A r 106 r 121 C 205.r 216 r 272 F 301 r 339 E 388 B 439.L Oswatitsch, Klaus (1910-) 207.C Otsuki, Nobukazu (1942-) 136.r Otsuki, Tominosuke (1917-) 275.A, F 365 B Ouchi, Sunao (1945-) 378 F Ovsyannikov, Lev Vasil'evich (1919-) 286.Z Owen, Donald B STR Oxtoby, John Corning (1910-) 136.H Ozawa, Mitsuru (1923-) 17 C 367 E 438.C Ozeki, Hideki (1931-) 365.1, r p Paatero, Veikko (1903-) 198 r Pacioli, Luca (c 1445-c.1514) 360 Pade, Henri Eugene (1863-1953) 142 E Page, Annie 123.D Paige, Christopher Conway (1939-) 241.C Painleve, Paul (1863-1933) 198.G 288.A-D, r 420.C Pal, J 89.C Palais, Richard Sheldon (1931-) 80 r 105 Z, r 183.*, r 191.G 279.A, E 286.Q, r 431 r Palamodov, Viktor Pavlovich (1938-) 112.R Paley, Raymond Edward Alan Christe'pher (1907- 33) 45.r 58.r 1250, BB 159 G 160 E, G, r 168 B 192 F. r 272.K 295.E 317.B Palis, Jacob, Jr. 126.C, J, M, r Pan, Viktor Yakovlevich (1939-) 71.D Panofsky, Wolfgang Kurt German (1919-) 130 r Papakyriakopoulos, Christos Dimitrie.u (1914- 76) 65.E 235.A Papanicolaou, George C. (1943-) l]j D Papert, Seymour 385.r Pappus (of Alexandria) (n. 320) 78.K 187343 D Parasyuk (Parasiuk), Ostap Stepanovih (1921-) 146.A Paris, Jeffrey B (1944-) 33.r Parker, Ernest Tilden (\926-) 151 H 241 B Parreau, Michel (1923-) 164 K 193 G 207.C 367.E Parry, William (1934-) 136C,r 
1903 Parseval, Marc Antoinc (1755- I 836) 18 B 159.A 160C, H 192 M 197.C 220 B, C, E Parshin, Alebei Nikolaevich 118 E Parthasarathy, Kalyanapuram Rangachari 213 F 341 r 374 r Parzen, Emanuel (1929-) 421.D Pascal, Blaise (1623-62) 2075 A 78 K 155 E 181 265329 330 342 A 343 E Pascal, Ernesto (I 865-'?) 15.r Pascal, Etienne (1588- I 651) 329 Pasch, Moritz (1843 1930) 155.B Passman, Donald S. (1940-) 151.r Pasta, J 287 r Pasternack, Joel 154 H Pastur, Lconid Andreevich 340.r Patil, Ganapati P. (1934-) 374 r Patodi, Vijay Kumar (1945-76) 237 r 366.r 391 C, K, L, N, r Pauli, Wolfgang Ernst (1900-58) 150 A 258 A, D 351 G, H, r 386.B Pawley, G Stuart (1937-) 92 F Pazy, Amnon (1936-) 162 286 X 378 F Peano, Giuseppe (1858-1932) 93 D, J 106 H 107 A I 17.A 156.B 246 F, G 267 294 A. B, r 316 E 41 1 A Pcarcy, Carl Mark (1935-) 251.r Pearl, Raymond (1879-1940) 263 A Pears. Alan (1938-) I 17 r Pearson, D B 331.E 375 B Pearson, Egon Sharpe (1895- 1980) 400 B 401.B, F, r STR Pcarson, Karl (1857- I 936) 40.B 174 r 374.r 397 D 401 E 403 C STR Peelet, Jean Claude Eugene 1 16.B Pcdcrsen, Gert Kjrergard (1940-) 36.K, r 2 I 2 C 308.r Pederson, Roger N. (1930-) 438 C Pedoe, Daniel (19 I 0-) 343 r Pcctrc. Jaak (1935 -) 112 E, K I 25.F 224 A, C, E. F, r Peierls, Rudolf Ernst (1907-) 212.B Peirce, Bcnjamin (1809-80) 23 I B 368 F Peirce, Benjamin Osgood (1854-1914) App A, Table 9 r Peirce, Charles Sanders (1839-1914) 156.B 41 LA Peixoto, Mauricio Matos (192 1-) 126 A, H, I, M Pelczynski, Aleksander 37 L, r 68 M 443.D pell, John (1611-85) 118 A Pepis, Jozef (c. 1922-c 1942) 97 * peressini, Anthony L. (1934-) 3 lOr Perko, Kenneth A, Jr (1943-) 235 E Perlman, Michael David (1942-) 280 r Perron, Oskar (1880- I 975) 83 r 100 A, F 107 A 120 C 121 C 123 B 254 D 269.N 280.F 289 D 294 r 310 H 3 14.A 316 E, r 379.L 394 r Pesin, Ya B 136 G Peter, F 69 B 249 U, r 437.EE Peter, Rozsa (1905- 77) 356.B, r Petermann, Abdreas (1922-) 361 r Peterson, Elmor Lce (1938-) 264 r Pcterson, William Wesley (1924-) 63.r Petersson, Hans (1902-84) 32 B- D 328.r 450 Q Petko v, Vessclin Mihailev (1942-) 325.H petrenko, Viktor Pavlovich (1936 -) 272 K, r Petrie, Ted E (1939-) 431 D Petrov, Valentin Vladimirovich (193 I -) 250 r Petrovskii, I van Georgievich (1901-73) 107.r 112 D 196 320.r 321 E, r 323.1 324 r 325 F, G, J, r 327 H Pettis, Billy James (1913-79) 68 M 443 A, B, D, F-H Name Index Polkinghorne, John Charlton Petty, Sir William (1623-87) 40 A 401 E Petvyashvili, V. I 387.F Peyret, Roger 304 r Pfaff, Johann Friedrich (1765-1825) 103 G 105 Q 107.B 428 A, B Pfanzagl, Johann (1928-) 399.M, 0400.r Pfluger, Albert (1907-) 143.A 272.K 367.E, r Ph am, Frederic 146 A, C 386.C 418.r Phelps, Robert Ralph (1926-) 443.H Phillips, Aris (1915-1985) 154 F 279 C Phillips, E 136 r Phillips, Melba N. (1907-) 130.r Phillips, Ralph Saul (1913-) 68 M 112.P, S 162 251 r 286 r 375.H 378.B, F, r 443 A, H Ph ragmen, Lars Edvard (1863-1937) 43.C Picard, Charles Emile (1856-1941) 3 A 1 I r 12.B, r 15 B, D, r 16 P, U 20 *, r 107 A 113 124.B 206 D 232 C 237 J 253 r 272.E 288 C 289 r 316 D 323.D 367.D 388.r 418.F, r 429.B Pick, Georg 21.0 Pielou, Evelyn C 263 r Pietsch, Albrecht (1934-) 68.N, r 424 r Pill ai, K. C. Sreedharan 280 B Pincherle, Salvatore (1853-1936) 217 F 240.B Pinchuk, S. I. 344.D, F Pinsker, Mark Shlemovich (1925-) 136 E 213 E Piper, Christopher J 215. E Pitcher, Tom Stephen (1926-) 396 r Pitman, Edwin James George (1897-) 37LA, E 399 G, r 400 K Pitt, Harry Raymond (1914-) 160.G 192.r 339.r 379.r Pitt, Loren D. (1939-) 176.F Pitts, Jon T 275.G Plancherel, Michel (1885- 1967) 192.M 218 G 437.L Planck, Max Karl Ernst Ludwig (1858- I 947) 115 A 351.A 402 I 419.r Plante, Joseph F (1946-) 154.H Plateau, Joseph Antoine Ferdinand (1801-83) 109 275 C 334 A, B Platek, Richard A. 356 G Plato (427-347 BC) 187357 B Platonov, Vladimir Petrovich (1939-) l3.Q Pleijel, Ake Vilhe1m Carl (1913-) 391.B, C, r Plemelj, Josip 253 r Plis, Andrzej (1929-) 32 I F 323 J Pliss, Viktor Aleksandrovich (1932-) 126.J Plotkin, Morris 63 B Plucker, Julius (1801 -68) 9 B 12.B 90 B 137267 Pochhammer, Leo (1841 -1920) 206.C Pogorelov, Aleksei Vasil'evich (1919-) 365 J Pohlmann, Henry 450 S Poincare, Henri (1854-19 I 2) 3.A, C, D 11 B 12. B 16 E 2021 Q 25.B 30 C, r 32.B, F, r 55.r 56.B, F 65.A, C 70 A 74 G 105 A 107.A 109 1 14 J, K 117.A 120.A, D 122 C, r 126.A, C, E, G, L L, r 136 A, C 153.B 156 C 170 198 J 201 A, B, F, 0 218.C, H 2 I 9 A, r 248 J 253 D 254 D 258 A 267 279 A 285 A, D 286.W 288.B 289 C 3 I 4 A 335 344.A 383 E 420 A, C 425 G 426 450.Q Poinsot, Louis (1777 - I 859) 271 E Poisson, Simeon Denis (178 I - I 840) 5.D, F 82.B 105.M 126 E 159 C 168.B 192 C, L 193 G 198 B 260 H 266 271.F, G 323.A 324 C, D 325.D 338.A 341 D 391 J 397.F 407.D App A, Tables 15.VI, 19 III Polit, Stephen H 136 E Polkinghorne, John Charlton (1930-) 146.r 386 C, r 
Name Index Pollaczek, Felix Pollaczek, Felix (1892-1981) 145307.C Pollaczek-Geiringer, H. 298.r Polonsky, Ivan P. 223.r 299.r P6lya, George (1887-1985) 20.r 48.0, r 66.E 88.r 121.C 211.r 228.B, r 272.K 339.0 374.J 429.B Polyakov, A. M. 80.r Pomeranchuk, Isaak Yakovlevich (1913-66) 386.B Pommerenke, Christian (1933-) 48.r 77.F 169.F 438.r Poncelet, Jean-Victor (1788-1867) 179.B 181 266267 Pong, O. H. 345.A Ponstein, J. 292.0 Pontryagin, Lev Semenovich (1908-) 2.G 56.0, F, H 64A, B 86.A, F 107.r 108 r 114.H 126.A, I, r 192.K 201.A, r 202.B, U 203.0 249.r 305.A 318.r 422.C, E, r 423.r Ponzano, Giorgio Enrico (1939-) 146.A Poor, Walter Andrew (1943-) 178.r Popov, M. V. 291.E Popov, Viktor Nikolaevich (1937-) 132 C 150.G Popp, Herbert (1936-) 16.W Port, Sidney Charles (1935-) 5.G Porter, Alfred William (1863-1939) 116.r Post, Emil Leon (1897-1954) 31.B 75.0 97.r 161.B 240 0 356.A, 0, H, r Postnikov, Aleksei Georgievich (1921-) 295.E 328 *, r Postnikov, Mikhail Mikhailovich (1927-) 70 G 1480 172.r 305.A Poston, Tim 51.r Povzner, Aleksandr Yakovlevich (1915-) 375.A Powell, H B 151.r Powell, M. J. 0 (1936-) 142.r Powers, Robert T. (1941-) 36.K 212 B 308.1, r Poynting, John Henry (1852-1914) 130.A Prabhu, Narahari Umanath (1924-) 260 J Prachar, Karl (1925-) 123.0, r 450.r Prandtl, Ludwig (1875-1953) 116.B 205.B-0 222.C Prasad, Gopal (1945-) 122 G Preissmann, Alexandre 178.B Presburger, M. 156.E, r Preston, Gordon Bamford (1925-) 190 r Price, Griffith Baley (1905-) 443.A Price, J. 423.r Priestley, Maurice Bertram (1933-) 421.r Prigogine, I1ya (1917-) 95 Prikry, Karel L. (1944-) 33.F, r Pringsheim, Alfred (1850- I 941) 58 E 83 E Proclus (410(411)-485) 187 Prokhorov, Yurii Vasil'evich (1929-) 115.0 250 E, r 341.F, r 374.r Protter, Murray H. (1918-) 78 r 106.r 216 r 323.r 327.r 350.r Prufer, Heinz (1896-1934) 2.02oo.K Prugovecki, Eduard (1937-) 375 r Przymusiiiski, Teodor C. 117 E Przytycki, Feliks 126.K Ptak, Vlastimil (1925-) 424 X Ptolemy (Claudius Ptolemaeus) (c. 85-c. 165) 187 432.C Pugh, Charles C. 126.J-L, r Puiseux, Victor Alexandre (1820-83) 339.A Pukanszky, Lajos 437 K, U Puppe, Dieter (1930-) 2oo.r 202.G Puri, Madan Lai (1929-) 280.r 371.r Pustyl'nik, Evgenii Izievich (1938-) 251.r Pusz, Wieslaw 402.G Putnam, Calvin Richard (1924-) 251.K Putnam, Hilary Whitehall (1926-) 81.0, r 9h, r 1904 Pyatetskii-Shapiro, I1'ya losifovich (1929-) 32.H 122.G 125.r 159.J 384.A, C, r 437.r 450.Q, S Pythagoras (572-492 B.C.) 60.0 118 A 139.B, 0 145 155.C 181 187 Q Quenouille, Maurice Henri (1924-73) 421.0 Quillen, Daniel G. (1940-) 12.r 16.Y 191.r 2oo.K 237.A, I, J 369.F Quinn, Barbara Keyfitz 286.X Quinn, Frank S (1946-) 114.K R Raabe, Joseph Ludwig App A, Table lO.n Raanan,Joseph 173.E Rabie, M 173 r Rabinowitz, Paul H. (1939-) 286.T, W, r Rabinowitz, Phillip (1926-) 223.r 299.r 301.r Racah, Giulio (1909-65) 353.A, B, r Rademacher, Hans Adolph (1892-1969) 4.A, C, 0 297.r 317.B, C 328.*, r 357.r Radjavi, Heydar (1935-) 251.r Radkevich, E. V. 112.0 323.r Rad6, Tibor (1895-1965) 65.C 77.B 109 164.1 193.r 246.r 275 A, C, 0, r 323.E, I 334.C, r 367.A, F 41O.B Radon, Johann (1887-1956) 94.C 125.CC 218.F 270.1, L 380.C 443.H Radstrom, Hans Vilhem (1919- 70) 443 I Raghavarao, Oamaraju (1938-) 102.r Raghunathan, Madabusi Santanam (1941-) 122.G, r Ra,ffa, Howard 173.C 398 r Raikov, Omitrii Abramovich (1905-) 192.G 256.r 341.E 424 X 437.EE Rainville, Earl O. 389.r Rajchman, A. 159.J RaIl, Louis B. (1930-) 138.r 30l.r Ralston, Anthony (1930-) 142.r 223 r 303.0, r Ralston, James V. 345.A Ramachandra, Kanakanahalli (1933-) 123.E Ramamoorthi, R V 396.r Ramanathan, Kollagunta Gopalaiyer (1921-) 118.0 450.K Ramanujam, Chidambaram Padmanabham (1938- 74) 232.0, r Ramanujan, Srinivasa (1887-1920) 4.0 32.C, 0 295.D, E 328.*, r Ramis, Jean-Pierre (1943-) 68.F Ramsey, Frank Plumpton (1903-30) 156.B Ran, Ziv (1957-) 450.S Randles, Ronald Herman (1942-) 371.r 374 r Range, R. Michael (1944-) 164.K Rankin, Robert Alexander (1915-) 123 C Rankine, William John Macquorn (1820-72) 204.G 205.B Rao, Calyampudi Radhakrishna (Radhakrishna Rao, Calyampudi) (1920-) 280.r :l74.H 399.C, 0,0, r 401.r Rao, Ranga R. (1935-) 374.r Raphson, Joseph (c. 1648-c. 1715) 01 0 Rapoport, Michael 16.r Rasmussen, O. L. 30l.r Rathbone, C. R. 332.r Ratner, Marina E. 126.J 136.F Rauch, A. 17.0 Rauch, Harry Ernest( 1925- 79) 134 r 178.A, C Ray, Daniel Burrill (1928-79) 5.r 115.A 
1905 Ray-Chaudhuri, Dwijendra K. 96.r Rayleigh, Lord (Strutt, John William) (1842- 19 I 9) 46.F 68.H 228.B 298.C 304.B 318.r 331.A, D 446.r Raynaud, Michel (1938-) 3.N, r Razumikhin, B S. 163 G, I Rebbi, Claudio 80.r Reckhow, Robert A 7 I.r Ree, Rim Hak 151.1, J App. B, Table 5.II1 Reeb, Georges (1920-) 90 r I 54. A, B, D 279.D Reed, George Michael (1945-) 273.K Reed, L. J. 263.A Reed, Michael (1942-) 331.r 375.r 390.r Reed, Myril Baird (1902-) 282.r Reeh, Helmut Rudolf (1932-) 150.E Rees, David (1918-) 67.1 284.A Regge, Tullio (1931-) 132.C 146.A, C 375 r 386.C Regiomontanus (Johann Miiller) (1436-76) 360 432.C Reich, Edgar (1927-) 352.C Reid, Constance 196 r Reid, John Ker (1938-) 302 r Reid, Miles A (1948-) 16r Reidemeister, Kurt Werner Friedrich (1893-1971) 91 r 155 r 235.A, r Reif, Frederick (1927-) 402.r Reifenberg, E R 275 A, G 334.F Reilly, Robert C. 365.H Reiner, Irving (1924-) 29.r 92 r 151 r 277.r 362 r Reinhardt, Hans 59.F Reinhardt, Karl 21.B, Q Reinsch, C. (1934-) 298 r 3oo.r Rellich, Franz (1906-55) 68.C 188 D 323.G 331.A, B 351.C Remak, Robert (1888-?) 190 L 277 I Remes, E. 142.B Remmert, Reinhold (1930-) 20 21.M, Q, r 23.B-E, r 199 r Remoundos, Ge6rgios (1878- I 928) 17.A, C, r Rengel, Ewald 77.E Renyi, Alfred (1921 -70) 4 C 123 E Reseboom, J H 376.r Resnikoff, George Joseph (1915-) STR Reuleaux, Franz (1829- I 905) 89.E 111 E Revuz, Daniel Robert (1936-) 260.1 261.E Reynolds, Osborne (1842- I 9 I 2) 1 16.B 205.C 259 Rhaeticus, Georg Joachim (1514-74) 432.C Rheinboldt, Werner Carl (1927-) 30l.r Rhodes, John L. (1937-) 31.r Ribenboim, Paulo (1928-) 145 r Ribet, Kenneth A 14 L 450J, r Riccati, Jacopo Francesco (1676- I 754) 86.E 107.A 405.G App. A, Table 14 I Ricci, Curbastro Gregorio (1853-1925) 109 364.D 365 C 417.B, F App. A, Table 4.11 Ricci, Matteo (l5S2- I 610) 57 C Rice, John Richard (1934-) 299.r 336.r Richard, Jules Antoine (1862-1956) 319 B Richardson, C H. 104.r Richardson, Lewis Fry (188 I -1953) 302 C 304.E Richardson, Roger Wolcott, Jr. (1930-) 431 r Richert, Hans-Egon (1924-) 4.C 123 D, E, r Richter, Hans (1912-78) 443 A Richter, Wayne H. (1936-) 81.D. r Richtmyer, Robert Davis (1910-) 304.r Rickart, Charles Earl (19 I 3-) 36 r 23 I.r 443.A Rickman, Seppo U. (1935-) 352.F Rieffel, Marc A. (1937-) 308.H 443.H Riemann, Georg Friedrich Bernhard (1826-66) 3.1, L, r 9.C, F, I I I.B-D, r 12.B 15 D 16.V 2021 A, C, F 30 C 37 K 46.E 51.E 74.0 77 B 80.K 94 B Name Index Rolle, Michel 105 A, P, W 107.A 109 IIO.E 123.A, B 137 152.A 159.A 160.A 181 198.A, D, Q 199.A 216.A 217J 237.G 253.B, D 267 274.G 275.A 285.A 286.L 323.E 325.D 334.C 344.A 363 364.A, B, D 365.A 366.A-D 367 A, B, E 379 C, S 412.A-D, J 413.* 416426447 450.A, B, I, Q App. A, Tables 4.11, 14.11,18.1 Riemenschneider, Oswald W. (1941-) 232.r Riesz, Frigyes (F rederic)( I 880-1956) 43. D 68.A, E, r 77.B 136.B 162 164.G, I 168.B 193 S 197.A, F, r 251.0, r 260. D 31O.A, B 317.A, B 390.r 425 r Riesz, Marcel (1886-1969) 43.D 88 C 121.r 125.A I 64.G, I 224.A 338.B 379.R Riley, Robert Freed (1935-) 235.E Rim, Dock S. (1928-) 200 M Ringel, Gerhard (1919-) 157.E, r 186.r Ringrose, John Robert (1932-) 308.r Rinnooy Kan, Alexander H. G. (1949-) 376.r Rinow, Willi (1907-79) 178.A Riordan, John (1903-) 66 r 330.r Riquier, C. 428.B, r Rishel, Raymond W. 405.r Rissanen, Jorma (1932-) 86.D Ritt, Joseph Fels (1893-1951) 113.*, r 428.r Ritter, Klaus (1936-) 292.r Ritz, Walter (1878-1909) 46.F 303 I 304.B Riviere, Nestor Marcelo (1940-78) 224.E Roache, Patrick John (1938-) 3oo.r Robbin, Joel W. (1941-) 126.G, r 183 Robbins, Herbert (Ellis) (1915-) 250.r 399.D Roberts, Joel L. (1940-) 16.1 Roberts, John Elias (1939-) 150.E Roberts, John Henderson (1906-) 117.C Roberts, Richard A. (1935-) 86.D Robertson, Alex P. 424.r Robertson, Howard Percy (1903-61) 359 E Robertson, Wendy J. 424.X, r Robin, Gustave (1855-97) 48 B 323.F Robinson, Abraham (1918-74) 118.D 276.D, E, r 293.A, D, r Robinson, Derek William (1935-) 36.K, r 308.r 402.G, r Robinson, G. 301.r Robinson, Julia Bowman (1919-85) 97.*, r Robinson, R. Clark 77.F 126.H, J, L, r Robinson, Raphael Mitchel (191 1-) 356.B Roch, Gustave (1839-66) 9.C, F 1 I.D I 5.D 237.G 366.A-D Roche, Edouard Albert (1820-83) App. A, Table 9.IV Rockafellar, R. Tyrrell (1935-) 89.r 292.D Rodin, Burton (1933-) 367.1, r RodoskiI, Kirill Andreevich (1913-) 123.E Rodrigues, Olinde (1794- I 851) 393.B Roepstorff, Gert (1937-) 402.G Rogers, Claude Ambrose (1920-) 22.r 182.D 443.D Rogers, Hartley, Jr. (1926-) 22.r 81.D, r 97.r 356.r Rogers, William H. 371.r Roggenkamp, Klaus W. (1940-) 362.r Rogosinski, Werner Wolfgang (1894-1964) 159.H, r 242 A Rohrl, Helmut (1927-) 196253.D Roitman, A. A 16.R, r Rokhlin, Vladimir Abramovich (1919-84) 56.H 114 H, K 136.E, H, r 213.r Rolfsen, Dale Preston Odin (1942-) 235.r Rolle, Michel (i652-1719) 106.E 
Name Index Romanov, Vladimir Gabrilovich Romanov, Vladimir Gabrilovich 218.H Romberg, W 299 C Roquette, Peter Jaques (1927-) 118.D Rose, Milton Edward (1925-) 353 r Rosen, Judah Ben (1922-) 292.E Rosenberg, Alex (1926-) 29 r Rosenberg, Ivo G (I939-) 75 D Rosenberg, Jonathan M (1951-) 437 r Rosenblatt, Murray (I 926-) 395.r 421.r Rosenbloom, Paul Charles (1920-) 255.0, E Rosenblueth, Arturo (1900-70) 95 r Rosenblum, Marvin (1926-) 331 E 421 r Rosenhain, Johann Georg (1816-87) 3 A Rosenhead, Louis (1906-84) 205 r Rosenlicht, Maxwell (1924-) 9.F 13.B, r Rosenthal, Peter( I 941-) 251 r Roshko, Anatol (1923-) 205 r Ross, George G 438 C Ross, Kenneth A (1936-) 192r Ross, Ronald (1857-1932) 263 A Rosser, John Barkley (1907.) 33 r 145 156 E, r 185r356D Rossetti, C. 132.r Rossi, Hugo 21 r 23.r 164 C, G 344 C 384 r Rota, Gian-Carlo (1932-) 66 r 203 r Roth, Klaus Friedrich (1925-) 118 0 182 G Roth, Leonard (1904-68) 12.r 16.r Rothstein, Wolfgang (1910-75) 21 M Rotman, Joseph J. (1934- ) 2.E Rouche, Eugene (1832-1910) 10 E 99 0 198.F Rouche, Nicolas 290 r Rouet, A 150.G Roumieu, Charles 125.A, U Rourke, Colin Patrik (1943-) 65 r 147 Q, r Roussarie, Robert 154.G, H Roussas, George Gregory (1933-) 399 N, r Roy, K K 396 r ROy,Prab,r(1937-) 117C Roy, Samarendra Nath 280 B Royden, Halsey Lawrence (1928-) 210 36.M 164 K 166.r 207 C, 0, r 221 r 270.r 367 E, I 380r416 Rozanov, Yuril Anatol'evich (1934-) 176 r 395.r Rozenfel'd (Rosenfel'd) B I 154 G Rozhdestvenskii, Boris Leonidvich (1928-) 204 r Rubel, Lee A. (1928-) 164 J Ruckert, Walter 23 B Rudakov, t\leksei Nikolaevich 15.r Rudin, Mary Ellen (1924-) 425 Y Rudin, Walter(I92I-) 20.r 36.r 84.r 87.r 106.r 164 I, K. r 166r 192.Q, r 198.r 216.r 221 r 270.r 367 380.r 422 r Rudolph, Daniel Jay (1949-) 136 E, F Rudvalis, Arunas (1945-) 15 \.I Ruelle, David Pierre (1935-) 126.A, J, K, M, N, r I 36.C, G, H, r 150.D 154.H 340.B 402.G, r 433.B, r Ruffini, Paolo (1765-1822) 172 A 190 Q Ruh, Ernst A (1936-) 178 r Rund, Hanno (1925-) 152 C, r Runge, Carl David Tolme (1856-1927) 19 r 118.0 223 A 301 0303 0416 F Running, Theodore Rudolph (1866-) 19.r Ruskai, Mary Beth (1944-) 212 B Russell, Bertrand Arthur William (1872-1970) 156A, B, r 319.B, r 411 A, K, r Rutman, Moisel Aronovich (1917-) 89 r 310 H Ryan, Patric J 365.r Ryll-Nardzewski, Czeslaw (1926-) 22 E Ryser, Herbert John (1923-85) 66.r 1906 S Saaty, Thomas L. (1926-) 157.r 22.r 260 H 291 r Sabatier, Pierre Celestin (1935-) 375 r Saburi, Yutaka (1948-) 125 BB Saccheri, Girolamo (1667-1733) 285 A Sacker, R S 126 M Sacks, Gerald Enoch (1933-) 22 F 3:i r 63 B 97 r 276.r 356 r Sacks, Jerome (1931-) 195E,r275.D Sacksteder, Richard (1928-) 136.G 154.H 365.E Sadanaga, Ryoichi (1920-) 92.F Sadler, 0 H. 291.F Sagher, Y oram 224 E Sah, Chih-Han (1934-) 151 r Sahni, SartaJ K. 71.r Sainouchi, Yoshikazu (1926-) 367.1 Saint-Beuve, Marie-France 22.r Saint-Donat, Bernard 9 r 16.r Saint-Raymond, Jean 22.F Saito, Hiroshi (1947-) 450.G, r Saito, Kyoji (1944-) 4180, r App A, Table 5 r Saito, Masahiko (1931-) 122.F Saito, Tosiya (1920-) 126.E 289.E Saito, Yoshimi (1939-) 375 C Sakai, Akira (1932-) 164.K Sakai, Fumio (1948-) 2\.N Sakai, Makoto (1943-) 367 E, r Sakai, Shoichiro (1928-) 36.K, r 308 C, E, F, L Sakai, Takashi (1941-) 178.C 39 \.B 413.r Sakamoto, Heihachi (1914-) 374 H Sakamoto, Kunio (1948-) 365 N, r Sakamoto, Reiko (1939-) 323.M 325 K, r 327.H Sakane, Yusuke (1946-) 365 L Sakata, Sh6ichi (1911- 70) 132 0, r Saks, Stanislaw (1897-1942) 84.r 94 - 1oo.r 198.r 22 \.r 246 r 270.r 380.r Salam, Abdus (1926-) 132.0 Salem, Raphael (1898-1963) 159.r 192.r Salmon, George (1819-1904) 78 r 350.r Salomaa, Arto Kustaa (1934-) 75.r Salzmann, Helmut Reinhard (1930-) 58.r Sambutsky, S. 353.r Samelson, Hans (1916-) 413 r 427 E Sampson, Joseph Harold (1925-) 183195.E Samuel, Pierre (1921-) 12 B 67.r 284 G, r 370.r 439.r Sanderson, Brian Joseph (1939-) 65.r 147.Q, r Sannami, Atsuro (1955-) 126.1 Sanov, Ivan Nikolaevich (1919-) 16 C 403 r Sansone, Giovanni (1888- 1979) 290.r 317.r Santal6, Luis Antonio (1911-) 218.C, E, H, r 228 A Sapiro  Shapiro Saprono Yu I 286r Sarason, Donald Erik (1933-) 164 K, r Sard, Arthur (1909-80) 105 J 208 B 286 P Sargsyan (Sargsjan), L S. 315 r Sarhan, Ahmed E 374.r Saribekovich, Sargsyan Iskhan (1931-) 315.r Sa rio, Leo Reino (1916-) 48.r 77 E, r 124.C, r 169.r 207.r 367 E, G, r Sarton, George Alfred Leon (1884-1956) 26.r 209.r 372 r Sasaki, Shigeo (1912-) IIO.E 275 C ::65.J Sasieni, Maurice W. 307 r Sataev, E A 136 F, r Satake, Ichiro (1927-) 13.r 16Z 21 Q 32.F 59 H 122 r 248.U, r 384 r 437.AA Sato, Atsushi (1954-) 154H Sato, Fumlhiro (1949-) 450 V 
1907 Sato, Ken-iti (1934-) 115 C, o 263.r Sato, Mikio (1928-) 20 I 12.D 125.A, V, W, BB, EE 146 A, C 162.*, r 274 I 345.B 386.C 387.C 41 8.H 4S0.A, M, Q, S, V Sato, Tokui (1906-83) 217 r 288 B Sattinger, David H 126 M 286.r Savage, I Richard (1925-) 371 A, C, r Savage, John E 71.r Savage, Leonard Jimmie (1917-71) 342 G 399 F, r 401 B, F Sawada, Ken (l9S3-) 126.J Sawashima, Ikuko (1929-) 31O.H Saxer, Walter (1896-1974) 214r Sazonov, Vyacheslav Vasil'evich (1935-) 341.1 Scarf, Herbcrt Ely (1930 - ) 173 E 227 r Scegoi'kov  Shchegoi'kov Schaaf, Manfred 258.r Schade, J P 95.r Schaefer, Helmut (1925-) 217.r 31OA, H Schaeffer, Albert Charles (1907-) 438.B, C Schafheitlin, Paul 11861-) App A, Table 19.m Schilfke, Friedrich Wilhelm (1922-) 268.r 389.r Schaible, Siegfried 264.r Schapira, Pierre M (1943-) 1120125 Y 162 Schark, I J 164 I Schatten, Robert (191 1-1977) 68 I Schauder, Juliusz Pawel (1899-1943) 37.L 68.E IS3 0 286.0 323 C, 0, r 325.C, r Schechter, Martin (1930-) I 12, F, H 189 B 320.r 323.H Scheffe, Henry (1907-77) 102.r 346.C 399 C, r Scheffers, Georg (1866-1945) 247 r Scheifele, Gerhard 55.r Scheinberg, Stephen 164 K Scheja, Gunter (1932-) 21.M, r Scherk, Heinrich Ferdinand 275 A Scherk, John (1947-) 132 r Scherk, Peter (1910-85) 4.A Schetzen, Martin 95 r Schickard, Wilhelm (1592 1635) 75.A Schiffer, Menahem Max (191 1-) 77 E, r 188 r 367.r 438 B, C Schiffman, M 275 B Schilling, Otto Franz Georg (19 11- 73) 257 r 439.r Schliifli, Ludwig (1814-95) 105.A 248 S 393 B App A, Tables 19 III, IV Schlaifer, Robert 398 r Schlesinger, Ludwig (1864- I 933) 253 E, r Schlessinger, Michael 16 r Schlichting, Hermann (1907-) 205.r 433.A Schlieder, Siegfried (1918-) 150 E Schlomilch, Otto 39 0 App A, Tables 9 IV, 10 II, 19 III Schmeidler, David 1730 Schmetterer, Leopold (1919-) 399 N Schmid, Hermann Ludwig (1908-56) 59 H Schmid, Wilfried (1943-) 16 r 437 W Schmidt, Erhald (1876-1959) 68 C, I 139 G 217.H, I 286.V 302 E 31 7.A 445 Schmidt, Friedrich Karl (1901 -77) 12.B 59.G 450P Schmidt, 0 Y  Shmidt Schmidt, Robert (1898-1964) 208.C 379.M Schmidt, Wolfgang M (1933-) 83.r I I 8.B, 0, r 182 G, r 354 r 430.C 437 W Schnee, Walter 379 M Schneider, Michael (1942-) 16.r Schneider, Theodor (\ 9 11-) 182.r 196 430.A, B, r Schober, Glenn E (1938-) 438.r Schoen, Richard M (l9S0-) 275 D, F 364 r Name Index Seidenberg, Abraham Schoenberg, IsaacJacob (1903-) 178 A Schoenfeld, Lov.ell (1920-) 328 Schoenl1ies, Arthur Moritz (1853-1928) 47.r 65.G 92 E, F 930, K 122.H 381 r App. B, Table S.lV Scholtz, Arnold 59 F Schonfinkel, M 97 * Schonhage, Arnold 298.r Schopf, Andreas 200 I Schottky, Friedrich Hermann (l8S1-1935) 9.J 43 J 234.B 367.C Schouten, Jan Arnoldus (1883-1971) 109.*, r 137 417 r 428.r 434 C Schrader, Robert (1939-) 150.F Schreier, Otto (1901-29) 7.r 28 151.A, I 161 A 172.F 190 G, N 2oo.M 256.r 343 r 3S0.r Schroder, A. 156.B Schroder, Friedrich Wilhelm Karl Ernst (1841-1902) 44.B 388.041 I.A Schrodinger, Erwin (1887-1961) 331 A, 0 340.E 351.C, o 434.C Schubauer, G. B 433.A Schubert, Hermann (1848-1911) 56.E 201 r Schubert, Horst (19 I 9-) 235.A Schur, Friedrich Heinrich (1856- I 932) 364 0 Schur, Issai (1875-1941) 29.E 43.J 122 C, E, F, H, r 151.E 226 r 277.H 295.E 368 G 379.L 4370, EE App B, Table 5 r Schutte, Kurt (1909-) 9h 156.E, r Schuur, Jerry Dee (1936-) 290 r Schwank, Friedrich (1900-) 217.r Schwartz, Arthur J. (1932-) 126.1 Schwartz, Jacob Theodore (1930-) 37 r 68 M 112.1,0 136 B, r 162.r 168 r 240.r 251 r 279.r 286 r 308.F, r 310 r 315 r 331.r 378.r 390 r 443.A, G, r Schwartz, Laurent (19 I 5-) 20.*, r 68 r 94.r 112.0, r 125 A, B, L, r 160 r 162.*, r 168.r 189 r 192.M 240 r 262.r 270.1 306 A 322.r 424 R, S, X, r Schwartz, Richard 280 r Schwarz, Hermann Amandus (1843-1921) 11.0 43 B 77 0 106 H 109 198.G 211 C 246 B 275.B, F 334.C App A, Tables 8, 9.11I, 13 III Schwarzenberger, Rolph Ludwig Edward (1936-) 92.r Schwarzschild, Karl (1873-1916) 359.E Schweber, Silvan Samuel (1928-) 150 r Schweitzer, Paul Alexander (1937-) 126.N 1540 Schwerdt, Hans 19 r Schwinger, Julian Seymour (1918-) 132 C 146.A 150.A, F 308 H 361.A 375.C Scidmore, Allan K. (1927 -) 96.r Scipione del Ferro (1465- I S26) 360 Scott, Dana S. 33 E, r Scott, William Raymond (1919-) 151.r Searle, Shayle R (1928-) 403 r Sebastiilo e Silva, Jose (1914-) 125 BB Secrest, Don H (1932-) 299 r Sedov, Leonid Ivanovich (1907-) 116 r Seebach, L 425 r Seeley, Robert Thomas (1932-) 274.1 323 K Seelig, Carl (1894-1962) 129 r Segal, Graeme Bryce (1941-) 105 r 237 J 366.r Segal, Irving Ezra (1918-) 3080 351.K Segal, Jack (1934-) 382.A, C Segre, Beniamino (1903- 77) 366.r Segre, Corrado (1863-1924) 11.B Seibert, Peter 126.0 Seidel, Philipp Ludwig von (\821-96) 302.C Seidel, Wladimir P (1906-81) 62 C, 0 Seidenberg, Abraham (1916-) 9.r 343 r 
Name Index Seifert, Herbert Seifert, Herbert (1907-) 65 r 91 r 99.r 126 N 1540 170.r 201 r 235 A, C, r 41O.r Seinfeld, John Hersh (1942-) 303 r Seitz, Gary M. 15 I.J Seki, Takakazu (Kowa) (c 1642 (1639?)-1708) 230332 Sekiguchi, Jiro (1951-) 437 CC Selberg, Aile (1917-) 4A 32.H, r 122.F, G 123.B, 0, E 412 K 437.X, CC, DO 450.A, I, K, T, r Selberg, Henrik Ludvig (1906-) 17.A, C, 0, r 48.E 124.B 338 H Selfridge, R G NTR Sell, George R (1937-) 126.M Selmer, Ernst Sejersted (1920-) 118 C Selten, Reinhard 173.B Semple, John Greenlees (1904-86) 12.r Sen, Pranab Kumar (1937-) 280.r 37l.r Senior, James Kuhn 151 r Seregin, L. V. 1 15.r Sergner, J A. 19.B Serre, Jean- Pierre (1926-) 3. N, r 9.r 12.B 13 r 15 E 16.C, E, T, r 20 21.L, Q 29.r 32.0 52.N 59.H, r 64.B, r 70.r 72 E, K, r 122.F 147 K, 0 148.A 172 r 2oo.K, M, r 202 N, U, r 237 J 248.r 249.r 257 r 284 G 362 r 366.0 369.F, r 426 428.G 450.G, J, R, r Serret, Joseph Alfred (1819-85) 111.0238.r App. A, Table 4 I Serrin, James Burton (1926-) 275.A, 0 323.0, E Servais, C 297 0 Seshadri, Conjeeveram Srirangachari (1932-) 16.Y, r Seshu, Sundaram (1926-) 282 r Sevast'yanov, Boris Aleksandrovich (1923-) 44.r Severi, Francesco (1879-1961) 9 F, r 11 B 12.B 15 B, 0, F 16.P 232.C Sewell, Geoffrey Leon (1927-) 402.G Sewell, Walter Ed win (1904-) 336.H Sgarro, Andrea (1947-) 213 r Shabat, Aleksei Borisovich 387.F Shafarevich, Igor' Rostislavovich (1923-) 14.r 15.r 16.r 59.F, H 118.0, E 257.H 297.r 347 r 450.Q, S Shampine, Lawrence Fred (1939-) 303.r Shaneson, Julius L. 650 114.1, K, r Shanks, Daniel (1917-) 332.r Shanks, E. B. 109. r Shanks, William (1812-82) 332 Shannon, Claude Elwood (1916-) 31 C 136.E 213 A, 0- F 403 r Shannon, Robert E. 385.r Shapiro, Harold N. (1922-) 123.0 Shapiro, Harold S. 43 r Shapiro, Harvey L. 425 r Shapiro, Jeremy F. 215.r 264.r Shapiro, Zoya Yakovlevna 258.r 323.H Shapley, Lloyd Stowell (1923-) 173 0, E Sharkovskii, Aleksandr Nikolaevich (1936-) 126.N Sharpe, Michael J. (1941-) 262.r Shaw, B. 251.K Shaw H. 75.r Shchegol'kov (Stschegolkow), Evgenii Alekseevich (1917-) 22.r Shelah, Saharon 33.r 276 E, F, r Shelly, Maynard Wolfe 227.r Shelukhin, V V. 204.F ShenChao-Liang(1951-) 36.H Shenk, Norman A., II 112 P Shepard, Roger Newland (1929-) 346.E, r Sher, Richard B. (1939-) 3820 Sherman, Seymour (1917 -77) 2 12 A, r 1908 Shewhart, Walter Andrew (1891-1967) 401 G 404.A, B Shiba, Masakazu (1944-) 367 I Shibagaki, Wasao (1906-) 174.r App. A, Table 20.r NTR Shidlovskii, Andrei Borisovich (1915-) 430.0, r Shields, Allen Lowell (1927-) 43.G, I' 164.J Shields, Paul C. (1933-) 136.E, r 213.F Shiga, Kiyoshi (1944-) 195.r Shiga, K6ji (1930-) 72 r 147.0 Shige-eda, Shinsei (1945-) 96. r Shikata, Yoshihiro (1936-) 178.r Shilov, Georgii Evgen'evich (1917-75) 21.036 M 125.A, Q, S 160.r 162 r 164.C 384 0 424 r Shimada, Nobuo (1925-) 114.B 202.S Shimakura, Norio (1940-) 323.H, N Shimizu, Hideo (1935-) 32.H 450.L, r Shimizu, Ryoichi(1931-) 374.H Shimizu, Tatsujiro (1897-) 124.B 272.1 Shimodaira, Kazuo (1928-) 230.r Shimura, Goro (1930-) 3 M, r II.B 13.P 16. r 32 0, F, H, r 59.A 73.B, r 122.F, r 450.A, L, M, S, U, r Shintani, Hisayoshi (1933-) 303 r Shintani, Takuro (1943-80) 450.A, E, G, V, r Shioda, Tetsuji (1940-) 450.Q, S Shiohama, Katsuhiro (1940-) 178.r Shiraiwa, Kenichi (1928-) 126.J Shirkov, Omitrii Vasil'evich (1928-) 150.r 361 r Shiryaev, AI'bert Nikolaevich (1934-) 86.E 395 r 405.r Shisha, Oved (1932-) 211 r Shizuta, Yasushi (1936-) 41.0 112.P Shimidt (Schmidt), Otto Yul'evich (1891-1956) 190.L 277.1 Shmul'yan, Yu V. 37.E, G 162424.0, V Shnider, Steven David (1945-) 344.C-E Shnirel'man, Lev Genrikhovich (1905-38) 4.A 279.G 286.Q, r Shoda, Kenjiro (1902-77) 829.F Shoenfield, Joseph Robert (1927-) 22. F, H, r 97.r 156 r 185.r 411.r Shohat, James Alexander (1886-1944) 240.r 341.r Shortley, George H. 353.r Shreider, Yulii Anatol'evich (1927-) I n.r Shrikhande, S. S. 102.K 241.B STR Shub, Michael (1943-) I 26.J, K, r Shubik, Martin 173.r Shubnikov, Aleksei Vasil'evich (1887-) 92.F, r Shult, Ernest E. (1933-) 151.1 Shultz, Frederic W. (1945-) 351.L Shvarts (Schwarz, harc), Al'bert Solomonovich (1934-) 56.H 80.r 286.0 Sibuya, Yasutaka (1930-) 289.0, E 428 H, r Sidilk, Zbynek (1933-) 371.r Sidon, S. 159.1 192.T Siebenmann, Laurence Carl (1939-) 65.A, C, r 70.C 114.J, K, r Siegel, Carl Ludwig (1896-1981) 3 A, r 4.F 1 LB, r 14 E 2LQ 22.A, C, H, r 27 r 32.F, r 49.0, r 55.r 72 r 118.A, C, 0122 B, E, F, r 123.D 126.1 154.0 182.0, E, G, r 242.A 289 0 296.A 297.r 328 347.E 348 K r 384.A, E, F 412.r 420.C, F 430.A, B, 0, r 450.A, E, K, r Sierpinski, Wac1aw (1882-1969) 22.A, C, H 49.0 242.A 297.r 425.r 426 Sigmund, Karl 136.r Sikonia, W. 331.E 390.1 Sikorski, Roman (1920-83) 42.r Silov  Shilov Silver, Jack H. 33.F, r 
1909 Silverman, Leonard M. (1939-) 86.0 Silverstein, Martin Louis (1939-) 44.E 168.B Silvester II  Gerbert Silvet, S. O. 102.r Simart, Georges l1.r l2.r l5.r 418.r Simauti, Takakazli (1930-) 41 I.J Simon, Barry (1946-) l50.r 212.B, r 331.r 351.r 375.r 390.r Simon, Herbert Alexander (1916-) 385.r Simon, Leon M. (1945-) 275.C Simonis, J uriaan (1943-) 16. Y Simons, James H. (1938-) 275.A, F 364.r 365.G Simphcius (c. 6th century) 187 Simpson, Thomas (1710-61) 299.A 303.E Sims, Charles C. (1937-) 14.L 151.A, I Sinai, Yakov Grigor'evich (1935-) 126.A, J, N 136.C, E, G, r Singer, Isadore Manual (1924-) 20 68.F 80.r 91.r 109 153.C 183 191.r 303.H, r 323.K, M 366.A-C, r 390J 391.B, C, K, L, r 428.r Singh, Avadhesh Narayan 209.r Sinha, Kalya B. 375.r Sinnott, W. 450.1 Sirao, T unekiti (1924-) 45.1, r Sitnikov, Kirill Aleksandrovich (1926-) 117.0 Siu Yum-Tong (1943-) 195.r 232.C 364.r Sjolin, Per B. (1943-) 159.r Skibinsky, Morris (1925-) 396J Skitovich, Viktor Pavlovich 374.H Skolem, Albert Thoralf(1887-1963) 97.B 118.C, 0 156.E, r 276.0 293.A Skornyakov, Lev Anatol'evich (1924-) 85.r Skorokhod, Anatolii Vladimirovich (1930-) 44.r 115.0, r 250.E, r 406.0, F, r Skramstad, H. K. 433.A Slater, Lucy Joan 167.r 206.r 292.B NTR Slodowy, Peter (1948-) 418.r Slowikowski, Wojciech (1932-) 424.X Smale, Stephen (1930-) 65.C 105.Z, r 114.A, B, 0, F, r 126.A, J, K, r 136.G 183279.0, E 286.P, Q 426 Small, Charles (1943-) 29.r Smart, O. R. 153.r Smart, William Marshall (1889-) 55.r 392.r Smirnov, Modest Mikhailovich (1921-) 326.r Smirnov, Nikolai Vasil'evich (1900-66) 250.F, r 374.E STR Smirnov, Vladimir Ivanovich (1887-1974) 20.r 106.r 216.r 371.F Smirnov, Yurii Mikhailovich (1921-) 273.K Smith, Brian T. (1942-) 298.r 301.0 Smith, David Eugene (1860-1944) 187.r Smith, Gordon Dennis 304.r Smith, Guy Watson (1885-) 19.r Smith, H. L. 87.H, K, r Smith, Henry John Stephen (1826-83) 179.B Smith, 1. M. 263.r Smith, Kennan Tayler (1926-) 276.E 338.E Smith, Paul Althaus (1900-80) 235.E 431.B Smith, Paul John (1943-) 151.I Smithies, Frank (1912-) 217.r Smorodinsky, Meir (1936-) 136.E Smullyan, Raymond M. 411.r Smyth, Brian 275.F 365.H, L Smythe, Robert T. (1941-) 340.r Snapper, Ernst (1913-) 16.E 2oo.M Sneddon, Ian Naismith (1919-) 389.r Sneddon, W. J. 336.r Snell, James Laurie (1925-) 260J Snell (Snel van Roijen, Snellius), Willebrord Name Index Steffensen, John F. (1580-1626) 180.A Sobolev, Sergei L'vovich (1908-) 20 46.r 125.A 162 168.B, r 224.E 320.r 323.G 325.r Sobolevskii, Pavel Evseevich (1930-) 251.r 286.r 378.1, J Sohncke, Leonhard (1842-97) 92.F Solitar, Donald Moiseevitch (1932-) 161.r Solovay, Robert M. (1938-) 22.F, H 33.E, F, r Sommer, Friedrich (1912-) 198.r 367.r Sommerfeld, Arnold Johannes Wilhelm (1868-1951) l30.r 188.D 271.r 274.r 402.H App. A, Table 19.m Sommerville, Duncan Mclaren YOU'1g (1879-1934) 285.r Soms, A. 399.N Sonine (Sonin), Nikolai Yakovlevich (1849-1915) 317.0 App. A, Tables 19.m, 20.VI Sono Masazo (1886-1969) 8 284.G Soreau, R. (1865-?) 19.r Sotomayor, Jorge (1942-) 126.M Sova, Miroslav 378.0 Sowey, E. R. 354.r Spanier, Edwin Henry (1921-) 64.r 70.r 148.r 170.r 201.M, r 202.1, r 305.r Spiith, R. A. 274.F 314.A Spearman, Charles (1863-1945) 346.F, r 37\.K Specht, Wilhelm (1907-85) 10.r 151.r 190.r Specker, W. H. 142.C Spector, Clifford (1930-) 81.r 156.E, r 356.H, r Speer, Eugene Richard (1943-) 146.A Speiser, Andreas (1885-1970) 151.r 172J 190.r Spencer, Domina Eberle (1920-) 130.r Spencer, Donald Clayton (1912-) 12.B 15.F 72.G, r 232.r 367.r 428.E, r 438.B, C 442.r Spencer, Thomas 402.G Sperner, Emanuel (1905-80) 7.r 256.r 343.r 350.r Spindler, Heinz 16.r Spitzer, Frank Ludwig (1926-) 44.C 250.r 260.E, J 340.r Spivak, Michael O. (1940-) 114.J 191.r 365.r Sprindzhuk, Vladimir Gennadievich (1936-) 118.0 430.C Springer, GelJfge (1924-) 367.r Springer, Tonny Albert (1926-) 13.A, I, 0, P, r Srinivasan, B. App. B, Table 5 Srinivasan, T. P. 164.G Srivastava, Muni Shanker (1936-) 280.r Stallings, John Robert, Jr. (1935-) 65.A, C, E, F 235.G 426 Stampacchia, Guido (1922-78) 440.r Stanasila (Stiinaila), Octavian (1939-) 23.r Stancu-Minasian, I. M. 408.r Stanley, Harry Eugene (1941-) 402.r Stanley, Richard Peter (1944-) 16,Z Stapp, Henry Pierce (1928-) 146.C 274.0, I 386.C Stark, Harold Mead (1939-) 83.r 118.0 182.G 347.E 450.E Stasheff, James Dillon (1936-) 56.r 201.r Staudt, Karl Georg Christian von (1798-1867) 267 343.C 450.1 Stavroudis, Orestes Nicholas (1923-) 180.r Stearns, Richard Edwin (1936-) 75.r Stechkin, SergeI Borisovich (1920-) 211.r 336.C Steel, J. 22.F Steele, John Hyslop (1926-) 263.0 Steen, Lynn Arthur (1941-) 425.r Steenbrink, Joseph H. M. (1947-) 9J Steenrod, Norman Earl (1910-71) 52.r 56.r 64.A, B, r 70.F, r 91.r 147.r 148.0 201.A, C, Q, R 210.r 305.A, r 426.*, r 442.r Steffensen, John F. 223.r 
Name Index Stegall, Charles Stegall, Charles 443.H Stcgun, Ircne A (1919) NTR Stcin, Charlcs M. 280.0, r 398.r 399.G, r 400.B, F Stein, Elias M. (1931-) 159.G 168.B, r 224.B. E, r 251.r 437.V, DD Stein, Karl (1913-) 20 21.H, L, M, Q 23.B, E, F 72.E 367.B, G, I Steinberg, Robert (1922-) 13.0151.1 237.1 248.z App. B, Table 5.r Steinbuch, Karl 95.r Steiner. Jakob (1796-1863) 78.K 89.C 179.A, B 181 228.B 267 Steinhal15, Hugo (1887-1972) 37.H 317.r 424J Steinitz, Ernst (1871--1928) 8149.1, r I72.A 357.r Steinmann, Othmar Viktor (1932-) 150.0 Stepanov, SergeI Aleksandrovich 450.P Stepanov, Vyacheslav Vasil'evich (1889-1950) 18.A. r 126 E, r 394.r Stephan, Frcderick F. 280J Stepin, Anatolil Mikhallovich (1940-) 136.E, G, H Stcrn. A. 297 D Sternbelg, S. H. 346.r Sternberg, Shlomo (1936-) 105.r II1.r 126.G, r 132.r 191.r 274.r 325.L 428 F, G, r 431.r Stetter, Hans J. (1930-) 303.r Stevin, Simon (1548-1620) 360 Stewart, F. M. 22.H Stewart, Gilbert W., III 298.r Stewart, Ian Nicholas (1945-) 51.r Stickelberger. Ludwig (1850. 1936) 2 B Stiefel, Eduard Ludwig (1909-78) 55 r 56.A, B, F 65.B 147.A, I, M 199.B 302.0 Stieltjes, Thomas Joannes (1856-94) 94.A-C 133.C 166.C 192 D, Q 220.D 240.A, K 270.L Stiemke, Erich 255.B, E Stigum, Bernt Petter (1931-) 44.r Stirling, James (1692-1770) 66.0 174.A 223.C App. A, Tables 17.I, 21 I Stocr, Josef (1934-) 303 F Stoll ow, Simion 207.B, C 367.r Stoka, Mariu& Ion (1934-) 218.r Stoker, James Johnston (1905-) lll.r 205.r Stokes, George Gabriel (1819-1903) 94.F 105.U 167.E 188.E 204.B, C, F 205.C, F 254.0 App. A, Table 3.m Stoll, Wilhelm Friedrich (1923-) 21.N 272.L Stolz, Otto (1842-1905) 106.G 333.B Stolzenberg, Gabriel 164.F Stone, Arthur Harold (1916-) 22.r 273.K 425 X, CC Stone, (,harle J. 5.F Stone, Harold S. (1938 -) 96.r Stonc, Marshall Harvey (1903-) 42.0 112.0 162 168.B 197.r 207.c 251.r 310.1 378.(' 390.r 425T Stong, Robcrt Evcrt (1936-) 114.r 237.H Stora, Raymond Felix (1930-) 150.G Storcr, James Edward (1927-) 282.r Stormer, Erling (1937 -) 212.B Stout, Edgar Lec (1938-) 164.r Stracke, Gu&tav (1887-) 309.r Strang, Wiliam Gilbert (1934-) 300.r 304.r Strassen, Volker (1936-) 250 E Stratila (Striitilii), Serban 308.r Stratonovich, Ruslan Leont'evich (1930-) 115.D 406.C Stratton, Julius Adam& (1901-) 130.r 133.r Strauss. Waltcr A. (1937-) 286.C 345.A Stray, Arne (1944-) 164J Streater, Raymond F. (1936-) 150.r 386.r Strcbcl, Kurt O. (1921-) 352.C 1910 Street, Anne Penfold (1932-) 241.r Stroock, Daniel Wyler (1940-) 44.E 115.C, 0, r 250.r 261.C 262.E 406.A, 0, r Stroud, Arthur H. 299.r Stroyan, Keith Duncan (1944-) 293. r Struik, irk Jan (1894-) 187.r 266.r Strutt, Maximilian Julius Otto (1903-1 133.r 268.r Struve, Friedrich George Wilhelm von (1793-1864) 39.G App. A, Table J 9.1V Stuart, Alan (1922-) 102.r 374.r 397.r 400.r Student (Gosset, William Sealy) (1876-1936) 374.B 400.G 40l.F Stueckelberg, Ernst Carl Gerlach (1905-) 361.r Sturm, Jacques Charles Fran<;ois (1803-55) 10.E 107.A 112.1 301.C 315.B Subramanyam, K. 399.0 Suetuna, Zyoiti (1898-1970) 242.B 295.0 450.E Sugawara, Masao (1902-70) 73.A Sugie, Tom (1952-) 15.H Sugimoto (Goto). Midori (1944-) 173.r Suita, Nobuyuki (1933-) 77.E Sukhatme, Balkrishna Vasudeo (1924-79) 373.r Sukhatme, Pandurang Vasudeo 373.r Sullivan, Dennis Parnell (1941-) 65.C 114J, L 154.H, r 234.E Sumihiro, Hideyasu (1941-) 16.Z Sunada, Toshikazu (1948-) 195.r 39i.C Sundman. Karl Frithiof(1873-1949) 420.C Sunouchi, Gen-ichiro (1911-) 159.G, H 310.r 336.D Sunzi (c. 3rd century) 57.A Suranyi, J<'mos (1918-) 97.B Suslin, Andrei Aleksandrovich 16.Y 200.K 369.F Suslin (Souslin), Mikhail Yakovlevich (1894-1919) 22.A-C, H, I 33.F 425.CC Sussmilch, Johann Peter (1707-67) 401.E Suzuki, Michio (1926-) 151.1, J, r 190.r App. B, Table 5.III Suzuki, Mitsuo (1928-) 173.E Svarc  Shvarts Swan, Richard G. (1933-) 200.M 23"'.r 362.r 383.r Sweedler, Moss E. (1942-) 172.A, K 203.A Swierczkowski, S. 22.H Swinnerton-Oyer, Henry Peter Francis (\927-) 118.0, E 450J, Q, S Switzer, Robert M. (1940-) 202.r Sylow, Peter Ludvig Mejdell (1832-1918) 151.B Sylvester, James Joseph (1814-97) 103.F 186.A 226.G 267 297 0 348.C 369.E, F Symanzik, Kurt (1923-83) 132.C 150.0, F 361.B, r 386.C Synge, John Lighton(l897-) 152.C 178.B,C Szab6, Arpad (1913-) 187.r Szankowski. Andrzej (1945-) 37.L Szasz, Otto (1884-1952) 121 B Szczarba, Robert H. (1932-) 114.K Szebehely, VictorG.(1921-) 420.r Szego, Gabor (1895-1985) 20.r 48.0, r 164.G 188.H 222.r 228 B, r 317.r 322.r 336.1 389.r Szego, Giorgio P. 86.r 108.r 126.r Szemeredi, End re 136.C Szmielew, Wanda (1918-76) 97.B Sz.-Nagy (Szekefalvi-Nagy), Bela (191 J-) 68.r 197.r 251 N, r 390.r Szpilrajn, Edward 117.G Szpiro, Lucien (1941-) 16. Y 118.E T Tabata, Masahisa (1947-) 304.D Tachibana, Shun-ichi (1926-) 11O.E 
1911 Tai Yung-Sheng (1944-) 16.r Ta,t, Peter Guthrie (1831-1901) 157.C 235.A Takagi, Ryoichi (1943 -) 365.1, K, L Takagi, Teiji (1875- I 960) 14.L, 0, R, U 59.A 73.A, r 196267297.1 336.A 348.M 415 450.E App. B, Tables 4.1, II Takahara, Y ositane (c. 17th century) 230 Takahashi, Hidetoshi (1915-85) 142.D 299.r Takahashi, Moto-o (1941-) 156.E 411.J, r Takahashi, Reiji (1927-) 437.BB Takahashi, Shuichi (1928-) 362.K Takahashi, Tsunero (1933-) 275.F 365.1, K, L Takahasi, Yositoki (1764-1804) 230 Takano, Kinsaku (1915-58) 213.F Takano, Kyoichi (1943-) 428.r Takasawa, Yoshimitsu (1942-) 299.8 Takasu, Satoru (1931-) 200.K Takebe, Katahiro (1664-1739) 230 332 Takeno, Hyoitiro (1910-) 434.r Takenouchi, Osamu (1925-) 437.E Takenouchi, Tanzo (1887-1945) 73.A 134.r Takens, Floris (1940-) 126.A, L, M, r 433.B, r Takesaki, Masamichi (1933-) 36.r 308.H, I, J Takeuchi, Kei (1933-) 128.C, r 346 r 371.A, H 373.r 399.K, 0400.r Takeuchi, Masaru (1932-) 365.1, L, N, 0 384.E, r Takeuchi, Mitsuhiro (1947-) 203.r Takeuti, Gaisi (1926-) 33.r 81.D I 56.E, r 356.G, r 411.J, r Takhtadzhan (Takhtajan), Leon Armenovich 387.G Tall, Franklin D. (1944-) 273.K Talman, James Davis (1931-) 389.r Talmi, Igal 353.r Tamagawa, Tsuneo (1925-) 6.F 13.P, r 59.H 118.C 122.F 348.K 450.A, H, K, L Tamano, Hisahiro(1928-69) 425.X Tamarkin, Jacob David (1888-1945) 160.E 240.r 341.r Tamura, Itiro (1926-) 114.B, F 154.B. H Tamura, Jiro (1920-) 124.C 367.F Tamura, Ryoji (1920-81) 371.C Tanabe, Hiroki (1932-) 378.1 Tanabe, Kunio (1943-) 302.r Tanaka, Chuji (1916-) 121.B, C Tanaka, Hiroshi (1932-) 41.C 261.r 340.r 406.D Tanaka, Hisao (1928-) 22.C, F Tanaka, Jun-ichi (1949-) 164.H Tanaka, Makoto (1942-) 437.r Tanaka, Minoru (1909-) 295.E Tanaka, Minoru (1949-) 279.r Tanaka, Noboru (1930-) 21.P 80.r 191.r 344.B 364.F 365.F 384.D, r Tanaka, Shigeru (1942-) 136.C Tanaka, Shunichi (1938-) 287.C Tanaka, Yosizanc(1651-1719) 230 Tandori, Karoly (1925-) 317.B Tangora, Martin Charles (1936-) 64.r Tani, Atsusi (1946-) 204.F Taniyama, Yutaka (1927-58) 3.M 73.B 450.F, S Tannaka, Tadao (1908-86) 59.D 69.D 249.U Tannery, Paul (1843-1904) 144.r 187.r Tanno, Shukichi (1937-) 110.E 364.F, G 365.L 391.C, E, N Tarski, Alfrcd (1902-83) 22.G 33.r 97.B, r 156.r 185.D, r 276.D Tartaglia, Niccolo (15007-57) 360 Tartakovskii, Vladimir Abramovich 4.E 161.B Tashiro, Yoshihiro (1926-) 364.F, G Tate, John Torrcnce (1925-) 3.C, M, N, r 6.E, F, r Name Index Thurston, William P. 28.r 59.H 118.D, E, r 200.K, N 257.r 450.F, G, L, N, P, Q, S, r Tatsuuma, Nobuhiko (1930-) 437.K Tatuzawa, Tikao (1915-) 4.F, r 123.E 450.r Tauber, Alfred (1866-1942?) 36.L 121.D 160.G 192.F 339.B 379.N Taubes, Clifford Henry (1955-) 150.r Taylor, Angus Ellis (1911-) 106.r 216.r Taylor, Brook (1685-1731) 20 21.B 58.C 106.E, J 286.F 339.A App. A, Table 9.1V Taylor, Sir Geoffrey Ingram (1886-1975) 205.E 433.A, C Taylor, Howard Milton (1937-) 260.r Taylor, James Henry (1893-?) 152.C Taylor, John Clayton (1930-) 132.r Taylor John R. 375.r Taylor, Joseph L. 36.M, r Taylor, Michael E. 345.A Taylor, Samuel James (1929-) 45.r Taylor, Thomas D. 304.r Teichmiiller, Oswald (1913-43) 9.J 43.E 77.E 352.A, C 416 438.B Teissier, Bernard (1945-) 16.Z 418.r Tcixeira, Francisco Gomcs 93.r Temam, Roger (1940-) 204.B, D 304.r Tennenbaum, Stanley (1927-) 33.F, r Teplitz, Vigdor L. 146.r Tcrada, T oshiaki (194 1-) 206. D 428.H Terano, Takao (1952-) 301.F Terasaka, Hidetaka (1904-) 235.A, C ter Haar, Dick 402.r te Riele, H. J. J. 297.D Terjanian, Guy 118.F Tcrry, Milton Everett (1916-) 346.C 371.C Thales (c. 639-c. 546 B.C.) 35.A 181 187 Theaitetus (415-369 B.c.) 187 Theil, Henri (1924-) 128.r Theodorsen, Theodore (1897-) 39.F Theodorus (ofCyrcne) (5th century B.C.) 187 Theon (of Alexandria) (11.370) 187 Theon (ofSmyrna)(l1. 130) 187 Thimm, Walter (1913-) 23.D Thirring, Waltcr Eduard (1927-) 212.B Thom, Rene F. (1923-) 12.B 51.A, B, E 56.E, F, r 70.r I 14.A, F-H 126.A, H, M 148.E 183 202.T 263.D 418.G, r 426 Thoma, Elmar Herbert (1926-) 437.E Thomas Aquinas (1225(27)- 74) 372 Thomas, J. 206.C Thomas, Lawrence E. 375.F Thomas, Paul Emery (1927-) 64.r Thomas, Richard Kenneth (1942-) 136.E Thomas, Tracy Yerkcs (1899-1984) 152.C Thomason, Steven Karl (1940-) 22.F Thompson, Colin John (1941-) 212.B Thompson. J. F. 304.E Thompson, John Griggs (1932-) 151.D, H-J Thomsen, Gerhard (1899-) 155.H 't Hooft, Gerald 132.C, D Thorin, G. O. 88.r 224.A Thorne, Kip S. 359.r Thorpe, J. A. 91.r Thrall, Robert McDowell (1914-) 29.r 173.D 368.r Threlfall, William (1888-) 65.r 91.r 99.r 170.r 201.r 235.r 410.r Thue, Axel (1863-1922) 31.B 118.D 182.G Thullen, Peter (1907-) 20 21.H, M, Q Thurston, William P. (1946-) 65.E 126.J, N I 54.A, D-H, r 234.A 235.B, E 
Name Index Thurstone, Louis Leon Thurstone, Louis Leon (l887-195S) 346.C, F Tierney, Myles 200.r Tietiiviiinen, Aimo A. (1937-) 63.r Tietze, Heinrich (1880-1964) 42S.Q Tikhonov, Andrei Nikolaevich (1906-) IS3.D 273.K 42S.Q, S, T Timmesfeld, Franz-Georg (1943-) IS1.J Timoshenko, Stephen P. (1878-1972) 271.r Tisserand, Francois Felix (1845-96) SS.r Tissot, Nicolas Auguste (l824-?) 206.C Titchmarsh, Edward Charles (1898-1963) 112.0 123.B, D, r 160.C, r 192.r 198.r 220.C 242.A, r 306.B 429.r 4S0.r Tits, Jacques Leon (1930-) 13.0, Q, R, r ISl.I, J 343.1 Toda, Hirosi (1928-) 202.P, R, U Toda,Morikazu(1917-) 287.A,r Toda, Nobushige (1938-) 17.C Todd, John Arthur (1908-) 237.F 366.B, r T odhun ter, Isaac (1820-84) 342.r Todorov, Andrei Nikolov (1948-) 232.C Todorov, Ivan T. (1933-) 146.r 150.r Toeplitz, Otto (1881-1940) 197.r 217.r 2S1.0 379.L Toki, Yukinari (1913-) 62.D 3S2.A 367.E Tollmien, Walter 433.A Tolman, Richard Chace (1881-1948) 402.r Tolstoy, I. 446.r tom Dieck, Tammo 431.E, r Tomi, Friedrich (1943-) 27S.C Tomita, Minoru (1924-) 308.H Tomiyama, Jun (1931-) 36.K 164.E Tomonaga, Sin-itiro (1906-79) 132.C 146.A IS0.A 3S9.C 36 LA Tomotika, Susumu (1903-64) 134.r Tompkins, Charles Brown (1912-) 27S.B 36S.B Tondeur, Philippe Maurice (1932-) IS4.G, H, r Tonelli, Leonida (188S-1946) 107.A 246.C Tonnelat-Baudot, Marie-Antoinette (1912-80) 434.r Toponogov, Viktor Andreevich 178.A, F Topp, L. J. 304.r Topsoe, Flemming (1938-) 22.r Torelli, R.(l884-19IS) 9.E,J II.C Torgerson, Warren S. 346.E, r Torii, Tatsuo (1934-) 301.C Totoki, Haruo (1934-) 136.D 39S.r Traub, Joe Fred (1932-) 71.r 301.r Trefftz, Erich Immanuel (1888-1937) 46.F Tremolieres, Raymond (1941-) 440.r Treves, J. Francois (1930-) 1 12.D, L 12S.r 274.1 286.Z 320.1, r 321.r 34S.A, B 424.r Tricomi, Francesco Giacomo (1897-1978) 217.N, r 288.C 317.r 326.C, r Triebel, Hans (1936-) 168.r 224.r Trigg, G. L. 414.r Tristram, Andrew G. 114.K Trjitzinsky, WaldemarJoseph (1901-73) 2S4.D 289.C, D 314.A Tromba, Anthony J. (1943-) 275.C, r 286.D Trotter, Hale Freeman (1931-) 23S.C 3Sl.F 378.E Trubowitz, Eugene B. (l9S1-) 387.E Trudinger, Neil Sidney (1942-) 364.H Truesdell, Clifford Ambrose T. (1919-) 389.B Trych-Pohlmeyer, E. B. 402.G Tschebyscheff  Chebyshev Tsen Chung-Tze 27.E 118.F Tsirel'son B. S. 406.D Tsu Ch'ung-Chih (429-S00) S7.A 332 1912 Tsuboi, Takashi (19S3-) IS4.G Tsuchiya, Nobuo (19S0-) IS4.G, H Tsuda, Takao (1932-) 3S4.r Tsuji, Masatsugu (1894-1960) 48.r 62.B, D 124.C 234.r 242.A 367.r 388.B Tsuji, Tadashi (1946-) 384.B Tsukada, Kazumi (19S3-) 36S.N 391.N Tsukamoto, Tatsuo (1940-) 3S3.r Tsukamoto, Yo taro (1932-74) 178.B Tsushima, Ryuji (19S2-) 32.r Tsuzuku, Tosiro (1929-) 13.R Tucker, Albert William (1905-) 173.r 2S5.B, E 292.A, B Tugue, Tosiyuki (1926-) 22.C 81 3S6.G, r Tukey, John Wilder (191S-) 34.r 87.r 142.D, r 304.r 371.A, r 397.r 421.C, r 425.X, r 436.r Tumarkin, Lev Abramovich (1904-741 117.1 Tumura, Yosiro (1912-) 17.C, D 62.1D Tung Chuan S7.A Turan, Pal (Paul) (1910-76) 123.D, r Turing, Alan Mathison (1912-S4) 2:2.G 31.B, C 97.B 161.B 3S6.A Turner, M. 1. 304.r Tushkanov, S. B. 17.C Tutte, William T. 186.r Tyupkin, Yu. S. 80.r Tzafriri, Lior (1936-) 37.N, r 168.r u Uchida, Fuichi (1938-) 431.G Uchida, Koji (1939-) 14.L Uchiyama, Akihito (1948-) 168.B Udagawa, Kanehisa (1920-6S) 389.r Ueda, Tetsuo (19SI-) 72.K Ueda, Yoshisuke (1936-) 126.N Uehara, Hiroshi (1923-) 202.P Ueno, Kenji (1945-) 16.r 72.1, r Ueno, Tadashi(1931-) 11S.C Ugaheri, Tadashi (191S-) 240.B 338.C Uhl, 1. Jerry, Jr. (1940-) 443.A, H Uhlenbeck, George Eugene (1900-) 41.C 4S.I Uhlenbeck, Karen (1942-) 195.E, r :27S.D Uhlhorn, Ulf 2S8.r Uhlmann, Armin 212.B Ukai, Seiji (1939-) 41.D Ulam, Stanislaw Marcin (1909-84) :i3.F, r IS3.B 287.r 38S.C Ullman, Jeffrey D. (1942-) 31.r 71.r 7S.r 186.r Ullrich, Egon (1902-S7) 17.A, C Ulm, Helmut (1908-7S) 2.D Umemura, Hiroshi (1944-) 16.1 Umemura (Yamasaki), Yasuo (1934-) 22S.r 437.BB Umezawa, Hiroomi (1924-) IS0.r Umezawa, Toshio (1928-) 411.F Uno, Toshio (1902-) NTR Ura, Taro (\ 920-) 126.D, F Urabe, Minoru (1912-7S) 301.D Urakawa, Hajime (1946-) 391.E Ural'tseva, Nina Nikolaevna 286.r 323.D Urbanik, Kazimierz (1930-) 407.C Ursell, Harold Douglas 246.r Uryson, Pavel Samuilovich (1898-1924) 22.1 93.D 117.A, r 273.K 42S.Q, S, U, V, CC Ushiki, Shigehiro (19S0-) 126.N Ushio, Kazuhiko (1946-) 96.r Utida, Itumi (180S-82) 230 Utida, Shunro (1913-) 263.A Uzawa, Hirofumi (1928-) 292.A, E, r 
1913 v Vahlen, Karl Theodor (1869-1945) 83.B Vaillancourt, Remi (1934-) 304.F 345.A VaIn berg, Boris Rufimovich (1938-) 323.K VaInshteIn, Isaak Aronovich 273.K VaisaUi, Jussi 143.r 352.F Vajda, Steven 408.r Valentine, F. A. 88.r Valiron, Georges (1884-1954) I7.A, C, D 43.K 121.B, C, r 124.B 272.F, K 429.B 435.r van Beijeren, Henk 402.G van Ceulen, Ludolf (1540-1610) 332 Van Daele, Alfons 308.H van Dantzig, D. 109434.C van den Berg, F ranciscus Johannes (1833-92) 19.B van der Corput, Johannes Gualtherus (1890-1975) 4.C 182.H 242.A Vandermonde, Alexandre Theophile (1735-96) 103.G 190.Q van der Pol, Balthasar (1889-1959) 240.r 290.C van der Waerden, Bartel Leendert (1903-) 8.*, r 12.B 24.r 29.r 60.r 66.r 67.r 90.r 92.F, r 122.r 149.r l72.r 187.r 190.r 196284.r 337.r 351.r 362.r 368.r 369.E 371.C 417.E Vandiver, Harry Shultz (1882-1973) 14.L 145.*, r van Hove, Leon Charles Prudent (1924-) 351.K 402.G van Kampen, Egbertos R. (1908-42) 170 Van Moerbeke, Pierre (1944-) 287.C van Roomen, Adriaan (1561-1615) 444 van Schoo ten, Frans (1615-60) 444.r Varadarajan, Veeravalli Seshadri (1937-) 249.r Varadhan, Sathamangalam Ranga Ayyangar Srinivasa (1940-) 115.C, D, r 136.r 250.r 261.C 262.E 340.r 406.A, D, r Varadier, M. 443.A Varaiya, Pravin P. 86.D 108.B 292.F Varchenko, A. N. 418.r Varga, Otto (1909-69) 152.C Varga, Richard Steven (1928-) 302.r Varopoulos, Nicholas Theodoros 192.U Varopoulos, T. 17.C 267.r Varouchas, J. 232.C Varshamov, Rom Rubenovich (1927-) 63.B Vasilesco, Florin (1897-1958) 120.D Vaughan, Robert Charles 123.E Vaught, Robert L. 276.D, F Veblen, Oswald (1880-1960) 90.r 109.*, r 137 152.C 201.r 343.r 434.C, r Vedesinov, N. 425.Q Veech, William Austin (1938-) 136.H Vekua, lI'ya Nestorovich (1907-77) 217.J 323.r Veldkamp, Ferdinando D. 13.R Velo, Giorgio 150.r Veneziano, Gabriele (1942-) 132.C 386.C Venkov, Boris Borisovich (1934-) 200.M venttsel', Aleksandr Dmitrievich (1937-) 115.C 261.r 406.F Verbeure, Andre (1940-) 402.G Verbiest, Ferdinand (1623-88) 57.C Verdier, Jean- Louis (1935-) 16.r 450.Q, r Ver Eecke, Paul (1867-1959) 187.r Vergne, Michele 384.r Verhulst, Pierre Francois (1804-49) 263.A Verner, James Hamilton (1940-) 303.r Veronese, Giuseppe (1854-1917) 275.F Vershik, Anatolil Moiseevich (1933-) 136.D, r 183.r Name Index Wagschal, Claude Vesentini, EdoarG (1928-) 122.F Vessiot, Ernest (1,65-1952) 107.A 113 249.V Vey, Jacques 15,G, r 384.r Vick, James Whi)< (1942-) 201.r '1:'$r Viehweg, Eckart A 8-) 72.1, r 232.D, r Viete, Francois (I AJ-1603) 820332360444 Vietoris, Leopold (1891-) 201.A, C, E, L 425.Q Vigneras, Marie- France (1946-) 391.C Vigue, Jean-Pierre (1948-) 384.r Vilenkin, Naum Yakovlevich (1920-) 112.r 125.r 162.r 218.r 341.r 389.r 395.r 407.C 437.AA ViII at, Henri Rene Pierre (1879-1972) App. A, Table 15.VI Ville, Jean A. 262.A Vinberg, Ernest Borisovich (1937-) 122.G 351.1 384.C, r!;"" Vinogr "''Jv, Ivan M. (1891-1983) 4.C, E 123.B, E 242.FL95.E Vinter, Richard B. 127 G Virtanen, Kaarlo I. 62.C 352.A, C 367.E, I Vishik, Mark losifovich (1921-) 112.E 323.N Vitali, Giuseppe (1875-1932) 270.G 380.D Viterbi, Andrew 1. (1935-) 2l3.E Vitt, Aleksandr Adol'fovich 290.r Vitushkin, AnatoliI Georgievich (1931-) IM.J 169.E Vivanti, Giulio (1859-") 217.r 339.A Vladimirova, S. M. 365. J Vogan, David Alexander, Jr. (1954-) 437.r Vogel, Kurt (1888-1985) 24.r Vogel, William R. 200.r Vogt, Dietmar (1941-) 168.B Voichick, Michael (1934-) 164.K V oiculescu, Dan Virgil (1949-) 36.J 331.E Voigt. Jiirgen (1943-) 331.E VoIder, J. E. 142.C Volk, Isai Mikhailovich 289.E Volkov, Yurii Aleksandrovich (1930-) 365.J V olkovyskiI, L. 198.r Volterra, Vito (1860-1940) 20 68.J 162 163.B 198.J 217.A 222.A 263. B VoltyanskiI, V. G. 155.r von Eotvos, Roland (1848-1919) 359.D von Karman, Theodore (1881-1963) 205.E 433.C von Koch, Helge (1870-1924) 246.K 450.1 von Mises, Richard (1883-1953) 298.r 342.A 354.E 399.K, r von Neumann, John (Johann) (1903-57) 18.A, E, r 20 22.F 33.A-C, r 36.G 68.1 69.B, C 75.B 85.A 95 136.A, B, E, F 138.r 156.E, r 162 173.A, C, D, r 197.A, r 225.r 251.M 255.E 304.F 308.C, F, G, I, r 312.A 331.E 351.C, L, r 354.B 376 385.C 390.1 445 Vopenka, Petr(1935-) 33.r V oronoI, GeorgiI F edoseevich (1868-1908) 242.A Voss, Heinz-Jirrgen 186.r Vranceanu, Gheorghe (1900-79) 434.C Vulikh, Boris Zakharovich (1913-78) 310.A W Wad a, Junzo (1927-) 164.C Wada, Yasusi (Nei) (1787-1840) 230 Waelbroeck, Lucien (1929-) 36.M Wage, M. L. 117.E Wagner, Harvey Maurice (1931-) 307.r 408.r Wagner, Herbert 39.F Wagner, S. W. 95.r Wagschal, Claude 321.G 
Name Index Wagstaff, Samuel S. Wagstaff, Samuel S. 14.L 145 Wahl, Jonathan Michael (1945-) 9.r Wahlin, G. E. 145.r Wait, R. 223.r 301.r 304.r Wakakuwa, Hidekiyo (1925-) 364.F Wald, Abraham (1902-50) 376 398.A 399.H, M, r 4oo.r 401.F, r 421.r Waldhausen, Friedhelm (1938-) 65.E 235.B Waldschmidt, Michel (1946-) 430.D, r Wales, David B. (1939-) 151.1 App. B, Table 6 Walfisz (Val'fis), Arnold Z. (1892-1962) 4.D 123.D 220.B 242.r 295.D Walker, Arthur G. 359.E Walker, G. 421.D Walker, M. R. 376.r Walker, R. C. 425.r Walker, Robert John (1909-) 9.r 15.B Wall, Charles Terens Clegg (1936-) 114.B, F, H,J,K,r Wallace, Andrew Hugh (1926-) I 14.F, L Wallach, Nolan R. 178.r 199.r 249.r 275.A, F 364.r 365.G 437.W Wallis, Jennifer Seberry 241.r Wallis, John (1616-1703) 20265332 App. A, Table 10.VI Wallis, Walter Denis 241.r Wallis, Wilson Allen (1912-) 371.D Wallman, Henry (1915-) 117.r Walsh, John Joseph (1948-) 117.1 Walsh, Joseph Leonard (1895-1973) 223.r 317.C 336.F, I Walter, John H. (1927-) 15l.J Walter, 1. S. 142.C Walter, Wolfgang (1927-) 211.r Walters, Peter (1943-) 136.H Walther, Hansjoachim 186.r Wang Hsien-Chung (1918-78) 81110.E 148.E 152.r 199.r 413.r Wang, Ju-Kwei (1934-) 164.G Wang Xiaotong (c. early 7th century) 57.A Wantzel, Pierre-Laurent (1814-48) 179.A Ward, Harold Nathaniel (1936-) App. B, Table 5.r Ward, R. 16.r Waring, Edward (1736-98) 4.E Warner, Frank Wilson, m (1938-) 364.H Warner, Garth William, Jr. (1940-) 249.r 437.r Warning, E. 118.B Warschawski, Stefan Emanuel (1904-) 77.C Washington, A. 432.r Washington, Lawrence Clinton (1951-) 14.L 450.1 Washio, Ya8utoshi (1929-) 399.r Washizu, Kyuichiro (1921-81) 271.r Wasow, Wolfgang Richard (1909-) 25.B, r 30.r 107.r 254.r 289.E 304.r Wassermann, Gordon 51.r Watabe, Tsuyoshi (1934-) 431.D Watanabe, Kinji (1946-) 323.1 Watanabe, Shinzo (1935-) 44.E, r 45.r 115.C, r 261.r 262.r 406.B, D, F Watanabe, Takeshi (1931-) 260.J Watari, Chinami (1932-) 336.D Waternaux, Christine M. 280.r Watson, George Leo (1909-) 4.E 348.r Watson, George Neville (1886-) 39.E, r 160.C 174.r 220.B 268.r 389.r App. A, Tables 19.m, IV NTR Watson, H. W. 44.B, C, r Watt, 1. M. 303.r Wayland, Harold (1909-) 298.r Weaver, W. 403.r 1914 Weber, Claude Alain (1937-) 65.r Weber, H. F. 39.G App. A, Table 19.1V Weber, Heinrich (1842-1913) 8.r I1.B, r 12.B 73.A 98 167.C 236.r 36h, r App. A, Tables 19.m, IV, 20 Weber, Wilhelm Eduard (1804-91) 363 Webster, Arthur Gordon (1863-1923) 322.r Webster, Sidney M. (1945-) 344.F Wedderburn, Joseph Henry Maclagan (1882-1948) 29.E, F 149.M 190.L 368.G Wehrl, Alfred (1941-) 212.r Weierstrass, Karl Theodor Wilhelm (1815-97) 9.D I1.B, D 20 21.A, E 46.C 58.C 84.C 106.B 109 120.A 134.F 140 168.B 174.A 198.D, I, N, Q 229.r 236 267 272.A 273.F 274.F 275.A, B 294.A 334.B, C 336.A, F 339.A, D 355.D, r 370.B 379.H 429.B 430.D 435.A 447 App. A, Table 16.1V Weit, Andre (1906-) 3.C, E, M, r 4.D 6.E, r 9.E, H, r 12.B, r 13.M, r 14.r 16.A, C 20 21.G 27.r 28 32.C, D 59.H, r 60.0 73.B 109.*, r 118.B, D, E 122.F, G, r 182.E 192.r 196 225.G, r 232.B 422.r 436.A, r 437.P 450.A, H, M, O-S, r Weinberg, B. L. 96.r Weinberg, Louis (1919-) 282.r Weinberg, N. 425.U Weinberg, Steven (1933-) 132.C, D, r Weinberger, Hans Felix (1928-) 323.r 327.r Weingarten, Leonhard Gottfried Johannes Julius (1836-1910) 111.H, I 365.C App. A, Table 4.1 Weinstein, Alan David (1943-) 126.N 178.r Weinstock, Robert 441.r Weir, M. D. 425.r Weisberger, William I. (1937-) 132.C Weiss, Benjamin (1941-) 136.E-G Weiss, Edwin (1927-) 14.r 200.r Weiss, Guido Leopold (1928-) 168.B 224.r Weiss, Lionel (1923-) 398.r Weiss, Max L. (1933-) 43.r Weitsman, Allen W. 272.K, r Weitzenbock, Roland W. (1885-) 226.C Welch, Bernard Lewis 4oo.G Weldon, Edward 1., Jr. 63.r Wells, Raymond O'Neil. Jr. (1940-) 164.K 232.r 344.D, E Welsh, James Anthony Dominic 66.r Wendroff, Burton 304.F Wentzel, Gregor (1898-) 25.B Wentzell  Venttsel' Wermer, John (1927-) 164.F, G, I, K, r West, James Edward (1944-) 382.D Westlake, Joan R. 302.r Westwater, Michael John (1942-) 146.A Wets, M. J. 408.r Weyl, Claus Hugo Hermann (1885-1955) 7.r 9.r 11.B 13.H, J, Q, R 14.r 18.A 20 21.N 6O.r 69.B 109 I 12.D, I, N, 0 124.B, r 126.L 137 139.B, r 156.B 182.F, H, r 190.r 196.r 197.A 198.r 225.1 226.r 248.F, P, R, W, Z, r 249.U, V, r 255.E 272.L 323.E, G, M 331.E 351.C, r 359.r 362.H 367.A, E, r 377.A 390.1 391.1 413.G, J 434.B 437.DD 442.r 445 448 App. A, Table 4.11 Weyl, Fritz Joachim (1915-77) 21.N, r 124.B, r 272.L 448 Weyrich, Rudolf(1894-) 39.r App. A, Table 19.m Whaples, George William (1914-81) 14.F Wheeler, John Archibald (1911-) 359.r 386.C 434.C White, Paul A. (1915-) 298.r Whitehead, Alfred North (1861-1947) 156.B 319.r 411.A, r 
1915 Whitehead, George William (1918-) 64.r 70.r 148.r 202.P, Q, T - V, r Whitehead, John Henry Constantine (1904-60) 65.A, C, F 90.r 91.r 109.r 114.A, C 178.A 2oo.L 202.F, N, P 237.J 426 Whiteside, Derek Thomas 265.r 283.r Whitham, Gerald Beresford (1927-) 205.r Whitin, Thomson M. 227.r Whitney, D. Ransom 371.A, C Whitney, Hassler (1907-) 56.B, F 58.B-E 66.r 105.A, D, K, r 111, J 114.A, B, D, r 126.E 147.A, F, M 168.B 186.H 201.A, J 418.G Whittaker, Edmund Taylor (1873-1956) 167.B 174.r 268.r 271.r 301.C 389.r 420.r 450.0 App. A, Tables 14.11, 19, 20.r Whittle, Peter (1927-) 421.r Whyburn, Gordon Thomas (1904-69) 79.r 93.r 426 Wick, Gian Carlo (1909-) 351.K Widder, David Vernon (1898-1973) 94.r 220.D 240.B, D, E 341.r Widlund, Olof B. (1938-) 303.G Widman, Kjell-Ove 195.E Widom, Harold (1932-) 164.K Wieferich, Arthur 145 Wielandt, Helmut (1910-) 151.B, E, H, r Wiener, Norbert (1894-1964) 5.C 18.A, r 20 36.L 37.A 45.A, B, D, r 48.A, B 58.r 86.E 9h, r 120.B-D 123.B 125.0, BB 136.B 159.1 160.B, E, G, r 162 176.C, I 192.F, H, 0, P, r 207.C, D 222.C 250.E 260.E 338.G 339.r 342.A 395.D, r 406.B 407.r Wierman, John Charles (1949-) 340.r Wigert, S. 295.E Wightman, Arthur Strong (1922-) 150.C, D, r 212.B 258.r 351.K 386.r Wigner, Eugene Paul (1902-) 150.A 212.B 258.C, r 351.H, J-L, r 353.B, r 377.B 437.DD Wijsman, Robert Arthur (1920-) 396.r Wilcox, Calvin Hayden (1924-) 375.B Wilcoxon, Frank 371.A-C Wilczynski, Ernst Julius (1876-1932) 10911O.B Wilder, Raymond Louis (1896-1982) 79.r 93.r Wiles, Andrew (1953-) 14.L 257.H 450.1 Wilkes, James Oscroft (1932-) 304.r Wilkinson, James Hardy (1919-) 138.C 298.r 3oo.r 301.r 302.r Wilks, Samuel Stanley (1906-64) 280.B 374.r 396.r 399.P Willers, Friedrich-Adolf(1883-) 19.r Williams, David (1938-) 260.P Williams, H. C. 123.C Williams, Robert F. (1928-) 126.J, K, N Williamson, Jack (1940-) 272.K, r Williamson, Robert E. (1937-) 114.H Willmore, Thomas James (1919-) l11.r 365.0 Wilson, B. M. 295.E Wilson, Edwin Bidwell (1879-1964) 374.F Wilson, John (1741-93) 297.G Wilson, K. B. 102.r 301.L Wilson, Kenneth G. (1936-) 361.r Wiman, Anders (1865-1959) 429.B Winnink, Marinus 308.H Winter, David John (1939-) l72.r Winters, Gayn B. 9.r Wintner, Aurel Frederick (1903-58) 55.r 420.r 435.E Wirtinger, Wilhelm (1865-?) 3.r 235.B, D 365.L App. A, Table 8 Wishart, John 374.C Witt, Ernst (1914-) 9.E 59.H 60.0 149.M 151.H Name Index Yamato, Kenji 161.B 248.J 348.E 449.A, B Witten, Louis (1921-) 359.r Wold, Herman O. A. 395.D Wolf, Emil (1922-) 446.r Wolf, Joseph Albert (1936-) 178.r 364.D 365.F 412.r 413.r Wolfe, Douglas A. 371.r 374.r Wolfe, Philip (1927-) 292.D 349.C Wolff, Julius 43.K 77.C Wolff, Thomas H. 164.1 Wolfowitz, Jacob (1910-81) 63.r 213.F 399.J, N, r 4OO.r Wolovich, William A. (1937-) 86.D Wondratschek, Hans 92.F Wong, Eugene (1934-) 403.C Wong, James Sai-Wing (1940-) 314.r Wong Pui-Kei (1935-) 21.N Wonham, W. Muray (1934-) 86.E Wood, John William (1941-) 154.B, H Wood, Rex Chester (1920-) 19.r Woodcock, Alexander E. R. 51.r Woods, E. J. (1936-) 308.1 377.r Woods, H. 1. 92.F Woolard, Edgar William (1899-) 392.r Woronowicz, Stanislaw Lech (1941-) 402.G Wrench, John William, Jr. (1911-) 332.r Wright, D. App. B, Table 5 Wright, Elisabeth Maitland 4.D, r 83.r 291.F 295.r 328.*, r Wright, Sewall (1889-) 263.E Wronski, Hoene Joseph Maria (1776-1853) 208.E Wu Chien-Shiung (1913-) 359.C Wu Hung-Hsi (1940-) 178.r Wu Tai Tsun (1933-) 402.r Wu Wen-Tsiin (1919-) 56.F, r 90.r App. A, Table 6.V Wulf, W. A. 75.r Wylie, Alexander (1815-87) 57.C Wylie, Shaun (1913-) 70.r 201.r x Xavier, Frederico Jose de Vasconcelos (1951-) 275.E Xiahou Yang (c. 5th century) 57.A Xu Y ue (fl. 200) 57.A Y Yabuta, K6z6 (1940-) 164.G Yaglom, Akiva Moiseevich (1921-) 44.B, C 395.r 407.C 433.r Yaglom, Isaak Moiseevich (1921-) 89.r Yamabe, Hidehiko (1923-60) 183196 249.D, V 364.H Yamada, Masami (1926-) 353.r Yamada, Toshihiko (1939-) 362.r 450.T Yamada, Toshio (1937-) 406.D Yamagami, Shigeru (1955-) 212.B Yamaguchi, Keizo (1951-) 344.C Yamaguti, Masaya (1925-) 126.N 303.G, r 304.F 325.H Yamamoto, Koichi (1921-) 241.E Yamamoto, Sumiyasu (1917-) 96.F Yamamoto, Yoshihiko (1941-) 450.S Yamanaka, Takesi (1931-) 286.Z Yamanoshita, Tsuneyo (1929-) 202.S Yamashita, Hiroshi (1946-) 301.F Yamasuge, Hiroshi (1926-60) 114.F Yamato, Kenji (1948-) 154.G 
Name Index Yamauchi, Kazunari Yamauchi, Kazunari (1945-) 364.F Vamauti, Ziro (1898-1984) 142.B NTR Yamazaki, Keijiro (1932-) 362.J Yamazato, Makoto (1949-) 341.H Yamazi, N usizumi (1704- 72) 230 Yanagawa, Takashi (1940-) 371.A Yanagihara, Hiroshi (1934-) 203.r Yanai, Haruo (1940-) 346,r Yanase, Mutsuo (1922-) . 212,B Yanenko, NikolaI Nikolaevich (1921-84) 204.r Yang Chen Ning (1922-) 80.Q, r 132.D, r 150.G 359.C Yang Chung-Tao (1923-) 178.r Yang Hui (fl. 1261) 57.B Yang, Paul C. P. (1947-) 391.E Yano, Kentaro (1912-) 72.r 80.r l09.r 110.E 194.r 364.F 365.H 417.r Yano, Shigeki (1922-) 159.G Yanpol'skii, Avraam Ruvimovich (1905-) NTR Yaspan, Arthur J. 307.r Yasugi, Mariko (1937-) 156.E Yasuhara, Mitsuru (1932-) 411.1 Yasuura, Kamenosuke (1923-) 282.r Yates, Frank (1902-) 19.r 371.C STR Yau Shing-Tung (1949-) 72.G 115.D 183.r 195.r 232.C, r 235.E 275.C, D, H 364.r 365.H, L 391.D, E Yen Chih-Ta 427.B Yennie, Donald Robert (1924-) 146.B Yin Wen-Lin 242.A Yoneda, Nobuo (1930-) 52.r 2oo.K Yoneyama, Kunizo (1877-1968) 79.D Yor, Marc (1949-) 176.C Yorke, James A. 126.N 303.G Yoshida, Hiroyuki (1947-) 450.s Yoshida, Masaaki (1948-) 428.H Yoshida, Masao (1913-) NTR Yoshida, Norio (1916-70) 367.1 Yoshida, Tomoyoshi (1946-) 431.D Yoshikawa, Atsushi (1942-) 224.E Yoshikawa, Zituo (1878-1915) 437.r Yoshizawa, Taro (1919-) 163.r 290.r 394.r Yosida, K6saku (1909-) 36.r 37.r 68.r 107.r 112.r 115.A 136.B, r 162 168.r 192.r 197.r 240 r 251.r 286.X 288.B 306.A 378.B, D, H, r 390.r Yosida, Mituyosi (1598-1672) 230 Yosida, Y6iti (1898-) NTR Youden, William Jdhn (1900-71) 102.K Y oula, Dante C. 86.D Young, Alfred (1873-1940) 362.H Young, David Monaghan (1923-) 301.r 302.r Young, Gail Sellers (1915-) 79.r Young, H. Peyton 173.E 201.r Young, John Wesley (1879-1932) 343.r Young, Richard Donald (1929-) 215.B Young, Thomas (1773-1829) 271.G Young, William Henry (1863-1942) 106.H 159.J 224.E 317.B App. A, Table 8 Youngs, 1. W. T. 157.E Yukawa, Hideki (1907-81) 132.A 150.A 338.M Yule, George Udny (1871-1951) 421.D YuzvinskiI, SergeI Aronovich (1936-) 136.r Yvon, Jacques (1903-) 402.J z Zabrelko, Petr Petrovich (1939-) 251.r Zabusky, Norman J. (1929-) 387.B Zacks, Shelemyahu (1932-) 399.H Zagier, Don 15.H 72.r 450.T 1916 Zak, F. L. 16.1 Zakharov, V. E. 80.r 387.F, G Zaleman, Lawrence 169.r Zamansky, Marc (1916-) 336.D Zaremba, Stanislaw (1863-1942) 120.B Zaring, Wilson B. 33.r Zariski, Oscar (1899-1986) 12.B, r 15.B, D, E, H, r 16,A, I, J, L, X, r 67.r 284,C, D, G, r 370,r 418,r 439.r Zarnke, Charles Robert 123.C Zassenhaus, Hans J. (1912-) 92.A, F 151.E, H, J, r 190.r 362.K Zeeman, Erik Christopher (1925-) 51.A, r 65.A, C, D, r 235.G 426 Zegalov  Zhegalov Zeigler, Bernard P. 385.r Zeller, Karl (1924-) 43.G 58.r 379.r Zeller-Meier, Georges 308.F Zemansky, Mark Waldo (1900-) 419.r Zener, Clarence (1905-) 264.r Zeno (c. 490-c. 430 B.C.) 187319.C Zenor, Phillip L. 273.K Zermelo, Ernst Friedrich Ferdinand (1871-1953) 33.A, B, r 34.B, r 41.A 381.F, G, r Zerna, Wolfgang (1916-) 271.r Zhang Qiujian (c. 5th century) 57.A Zhegalov, Valentin Ivanovich 326.r Zhelobenko, Dmitril Petrovich (1934-) 437.EE, r Zia-ud-Din, M. App. B, Table 5.r Ziemer, William P. (1934-) 246.J Zilber, Joseph Abraham (1923-) 70.E 201.1 Ziller, Wolfgang (1952-) 279.G 364.r Zimmermann, Wolfhart (1928-) 146.A 150.D Zippin, Leo (1905-) 2.D 196 249.V, r 423.N 431.r Zobin, N. M. 424.S Zoll, Otto 178.G Zoretti, M. Ludovic 79.D Zorn, Max A. (1906-) 34.C, r 54.r 450.L Zoutendijk, Guus (1929-) 292.E Zsido, Laszlo (1946-) 308.r Zuckerman, Herbert Samuel (1912-70) 118.r 192.P Zumino, Bruno (1923-) 150.D 386.B Zurmiihl, Rudolf (1904-) 298.r Zuse, Konrad (1910-) 75.A Zverkin, Aleksandr Mikhailovich (1927-) 163.r Zvyagin, V. G. 286.r Zwinggi, Ernst (1905-) 214.r Zygmund, Antoni (1902-) 136.B I 59.E, G, r 168.B, r 192.r 198.r 217.1, r 224.r 251.0 274.B, I 336.C, r Zykov, Aleksandr Aleksandrovich (1922-) 186.r 
Subject Index Note Citation is to article and section or to a table in an appendix, not to page. A a (cardinal number of N) 49.A A(a) (the totality of functions bounded and con- tinuous on the closure ofa and holomorphic in a) 168.B Ap(a)(the tota]ity offunctionsj that are holomor- phic in a and that satisfy r I {(z)l" dx dy < ."L) Jr, 168.B :x-capacity 169.C :x-excessive function 261. D :x-limit point 126.D :x-limit set (of an orbit) 126.D O(-perfect,O(-perfectness 186.J O(-point (of a meromorphic function) 272.B :x-pseudo-orbit 126.J :x-quartile 396.C :x-string 248.L :x-trimmed mean 371.H a-adic completion (of an R-module) 284.B a-adic topology (of an R-module) 284.B .<i-characteristic class (of a real oriented vector bundle) 237.F A-balanced mapping 277.J A-B-bimodule 277.D A-homomorphism between A-modules 277.E of degree p (between two graded A-modules) 200.B A-linear mapping (between A-modules) 277.E A-module 277.C A-optimality 102.E A set 22.A 409.A A-stability 303.G A (O()-stability 303.G AD-stability 303.G A(O)-stable 303.G A-submodule 277.C A-summable 379.N A-number 430.C abacus 75.A Abel, N. H. I Abel continuity theorem 121.D 339.B Abelian category 52.N Abelian differential II C 367.H Abelian equation 172.G Abelian ergodic theorem 136.B Abelian extension 172.B Abelian function 3.J elementary 3.M Abelian function field 3.J Abelian group(s) 2 190.A category of 52.B class of 202.N dual topological 422.C elementary 2.B elementary topological 422.E free 2. C meta- 190.H mixed 2.A primary 2.A reduced 2.D topological 422.A torsion 2.A torsion-free 2.A of type p x 2.D Abelian ideal (of a Lie algebra) 248.C Abelian integral I1.C Abelian Lie algebra 248.C Abelian Lie group 249.D Abelian linear group over K 60.L Abelian p-group 2.A complete 2.D divisible 2.D Abelian potential 402.G Abelian projection operator 308.E Abelian subvariety 3.B Abelian surface 15.H Abelian theorems 240.G Abelian variety (varieties) 3 isogenous 3.C polarized 3.G simple 3.B Abel integral equation 217.L Abel method, summable by 379.N Abel method of summation 379.N Abel partial summation 379.D Abel problem 217.L Abel test 379.D Abel theorem (on the Cauchy product of two series) 379.F (in the theory of algebraic functions) 3.L II.E aberration 180.C annual 392 diurnal 392 Aberth (DKA) method, Durand-Kerner- 301.F Abramov's formula 136.E abscissa of absolute convergence (of a Dirichlet series) 121.B of absolute convergence (of a Laplace trans- form) 240.B of boundedness (of a Dirichlet series) 121.B of convergence (of a Dirichlet series) 121.B of convergence (of a Laplace transform) 240.B.H of regularity (of a Dirichlet series) 121.B of regularity (of a Laplace transform) 240.C of simple convergence (of a Dirichlet series) 121.B of uniform convergence (of a Dirichlet series) 121.B of uniform convergence (of a Laplace transform) 240.B absolute (for a quadric hypersurface) 285.C absolute Borel summable 379.0 absolute class field 59.A absolute continuity generalized 100.C generalized, in the restricted sense 100.C space of 390.E absolute continuity (*), generalized 100.C absolute convergence, abscissa of (of a Dirichlet series) 121.B (ofa Laplace transform) 240.B absolute covariant 226.D absolute curvature (of a curve) 111.C absolute figure (in the Erlangen program) 137 absolute homology group 201.L absolute inequality 21l.A absolute integral invariant 219.A absolute invariant 12.A 226.A absolutely closed space 425. U 
Subject Index Absolutely continuous absolutely continuous (function) 100.C (mapping in the plane) 246.H (measure) 270.L (set function) 380.C (vector measure) 443.G generalized 100.C }1- 380.C in the restricted sense loo.C in the sense of ToneJli 246.C absolutely continuous (*) 100.C absolutely continuous spectrum 390.E absolutely convergent (double series) 379.E (infinite product) 379.G (Laplace-Stieltjes integral) 240.B (power series) 2\.B (series) 379.C (series in a Banach space) 443.D uniformly 435.A absolutely convex set (in a topological linear space) 424.E absolutely integrable function 216.E absolutely irreducible (representation) 362.F absolutely irreducible character 362.E absolutely measurable 270.L absolutely p-valent 438.E locally 438.E absolutely simple algebraic group 13.L absolutely stable (linear k-step method) 303.G (system of differential equations) 29\.E absolutely summing (operator) 68.N absolutely uniserial algebra 29.1 absolute minimality 16.1 absolute moment (kth) 341.B absolute multiple covariant 226.E absolute neighborhood retract 202.D fundamental (FANR) 382.C absolute norm (of an integral ideal) 14.C absolute parallelism 191.B absolute retract 202.D fundamental (FAR) 382.C absolute stability 303.G interval of 303.G region of 303.G region of, of the Runge-Kutta (P, p) method 303.G absolute temperature 419.A absolute value (of a complex number) 74.B (of an element of an ordered field) 149.N (of an element of a vector lattice) 310.B (of a real number) 355.A (of a vector) 442.B absorb (a subset, topological linear space) 424.E absorbing barrier 115.B absorbing set (in a topological linear space) 424.E absorption cross section 37S.A absorption law in the algebra of sets 38 \.B in a lattice 243.A absorption principle, limiting 375.C abstract algebraic variety 16.C abstract L space 3l0.G ab_tract Lp space 310.G abstract M space 31O.G abstract Riemann surface 367.A abstract simplicial complex 70.C abstract space 381.B 1918 abstract variety 16.C abundant number 297.D acceleration parameter 302.C acceptance 400.A acceptance region 4oo.A accepted 31.D accessible (from a region) 93.K accessible boundary point 333.B accretive operator (in a Hilbert space) 251.J 286.C accumulated error 138.C accumulation point 87.C 425.0 complete 425.0 acoustic problem 325.L act on a commutative ring 226.A freely (on a topological space) 122.A action 431.A R- l26.B rational 226.B reductive 226.B Z- 126.B action and reaction, law of 27l.A action integral 80.Q action space 398.A activity 28\.D activity analysis 376 actuarial mathematics 214.A acute angle 139.D acute type 304.C strongly 304.C acyclic complex 200.C 200.H Adams-Bashforth method 303.E Adams conjecture 237.1 Adams-Moulton method 303.E Adams operation 237.E adapted (stochastic process) 407.B adaptive scheme 299.C addition (in a commutative group) 190.A (of natural numbers) 294.B (in a ring) 368.A (of unfoldings) 51.D addition formula algebraic 3.M for e Z 13\.G for sine and cosine 432.A of trigonometric function's App. A, Table 2.1 addition theorem of Bessel functions 39.B of cylindrical functions App. A, Table 19.m of Legendre functions 393.C of the g;J-function App. A, Table 16.1V of sn, cn, dn App. A, Table 16.m of the (-function App. A, Table 16.1V additive completely  completely additive countably  countably additive finitely  finitely additive (J-  (J-additive totally  totally additive additive category 52.N additive class completely 270.B countably 270.B finitely 270.B additive functional (ofa Markov process) 261.E martingale 26\.E natural 26\.E perfect 261.E 
1919 additive functor 52.N additive group 2.E 190.A complete 2.E divisible 2.E free 2.E ordered 439.B totally ordered 439.B additive interval function 380.B continuous 380.B additive measure completely 270D finitely 270.D (J- 270.D additive number theory 4 additive operator 251.A additive processes 5 342.A temporally homogeneous 5.B additive set function 380.C completely 380.C finitely 380.B /l-absolutely continuous 380.C additive valuation 439.B additivity (in the theory of local observables) 150.E complete (of the integral) 221.C complete (of a measure) 270.D for the contours (in the curvilinear integral) 94D countable 270.D of probability 342.B (J- 270.D address 75.B single-, instructions 75.C adele 6.C and idele 6 principal 6.C adele group of an algebraic group 13.P of a linear algebraic group 6.C adele ring (of an algebraic number field) 6.C adeles and ideles 6 Adem formula App. A, Table 6.11 Adem relation 64.B adequacy 396.1 adherent point 425.B ADI (alternating direction implicit) method 304.F adiabatic law 205.B adiabatic process, quasi static 419.B adiabatic wall 419.A adjacent (chamber) 13.R (edges) 186.B (germ) 418.E (vertices) 186.B adjacement matrix 186.G adjoin a set to a field 149.D a variable to a commutative ring 337.A adjoint left (linear mapping) 52.K 256.Q right (linear mapping) 52.K 256.Q self-  self-adjoint adjoint boundary condition 315.B adjoint boundary value problem 315.B adjoint differential equations 252.K adjoint differential expression 252.K adjoint functor 52.K adjoint group isogenous to an algebraic group 13.N Subject Index Affine mapping of a Lie algebra 248.H of a Lie group 249.P adjoint Hilbert space 251.E adjoint kernel (of a kernel of a potential) 338.B adjoint Lie algebra 248.B adjoint matrix 269.1 adjoint operator (on Banach spaces) 37.D 251.D (on Hilbert spaces) 251.E (of a linear partial differential operator) 322.E (of a microdifferential operator) 274.F (of a microlocal operator) 274.F adjoint representation of a Lie algebra 248.B of a Lie group 249.P of a linear representation 362.E adjoint space (of a linear topological space) 424.D adjoint system (of a complete linear system on an algebraic surface) 15.D of differential equations 252.K adjunction formula 15.D adjustment sampling inspection with 404.C seasonal 397.N Adler-Weisberger sum rule 132.C admissible (decision function) 398.B (estimator) 399.G (extremal length) 143.A admissible automorphic representations 450.N admissible control 405.A admissible function 46.A 304.B admissible homomorphism (between O-groups) 190.E admissible isomorphism (between O-groups) 190.E admissible lattice (in R n ), S- 182.B admissible monomial (in Steenrod algebra) 64.B, admissible normal subgroup 190.E admissible ordinal 356.G admissible sequence (in Steenrod algebra) 64.B, App. A, Table 6.II1 admissible subgroup (of an O-group) 190.E Ado theorem 248.F advanced type (offunctional differential equation) 163.A a.e. (almost everywhere) 270.D AF-algebra 36.H affect (of an algebraic equation) 172.G affectless algebraic equation 172.G affine (morphism) 16.D affine algebraic group l3.A affine algebraic variety 16.A quasi- 16.C affine arc element IIO.C affine arc length IIO.C affine binormal 110.C affine connection 80.H 286.L canonical (on Rn) 80.J coefficien ts of 80. L affine coordinates 7.C affine curvature IIO.C affine differential geometry 110.C affine frame (of an affine space) 7.C affine geometry 7 in the narrower sense 7.E affine length 110.C affine locally symmetric space 80.1 affinely congruent 7.E affine mapping 7.E 
Subject Index Affine minimal surface affine minimal surface 110 C affine normal IIO.C affine princIpal normal IIO.C affine ring 16.A affine scheme 16.D affine space 7.A affine symmetric space 80J affine torsion IIO.C affine transformation(s) 7.E 364.F group of 7.E of a manifold with an affine connection 80J proper 7.E regular 7.E affine variety 16.A affine Weyl group (of a symmetric Riemann space) A1,",)f"' "+1 J.\J affinity 7.E equivalent 7.E age-dependent branching process 44.E Agnesi, witch of 93.H Ahlfors finiteness theorem 234.D Ahlfors five-disk theorem 272J Ahlfors function 43.G 77.E Ahlfors principal theorem 367.B Ahlfors theory of covering surfaces 367.B Airy integral App. A, Table 19.1V Aitken interpolation scheme 223.B, App. A, Table 21 Akizuki theorem, Krull- 284.F Alaoglu theorem, Banach- (in a Banach space) 37.E (in a topological linear space) 424.H Albanese variety 16.P of a compact Kahler manifold 232.C strict 16.P Aleksandrov compactification 207.B aleph 49.E alpha (Xa) 312.D zero (Xo) 49.E Alexander cohomology group 201.M relative 20l.M Alexander duality theorem 210.0 Alexander horned sphere 65.G Alexander ideal (of a knot) 235.C Alexander-Kolmogorov-Spanier cohomology theory 20l.M Alexander matrix (of a knot) 235.C Alexander polynomial (of a knot) 235.C Alexander polynomial (of a link) 235.D Alexander trick 65.D Alexander-Whitney mapping (map) 201.1 Alfven wave 259 algebra 8 (over a field) 203.F (of sets) 270.B algebraic 29J alternativc 231.A AF- 36.H approximately finite 36.H association 102J associative 29 231.A augmented 200.L A W*- 36.H Azumaya 29.K Banach 36.A Banach *- 36.F Boolean 42.A 243.A Boolean, generalized 42.B C*- 36.G C*-, of typc I 430J 1920 C*-group (of a locally compact He usdorff group) 36.L Calkin 36.1 390.1 Cayley 54 Cayley, general 54 central separable 29.K central simple 29.E Clifford 61.A over a commutative ring 29.A composition 231.B current 132.C cyclic 29.G derived (of a Lie algebra) 248.C Dirichlet 164.B di,k 164.B distributive 23 LA division 29.A Douglas 164.1 dual 203.F enveloping 200.L 231.A enveloping, universal (of a Lie algebra) 248J enveloping von Neumann 36.G exterior (of a linear space) 256.0 Frobenius 29.H full matrix 269.B function 164.A graded 203.B Grassmann (of a linear space) 256.0 group 29.C 38.L 192.H Hecke 29.C 32.D homological 200.A Hopf 203.H Hopf, dual 203.C Hopf, elementary 203.D Hopf, graded 203.C increasing family of a- 407.B j- 384.C Jordan 231 Lt- (of a locally compact HausdorlT group) 36.L Lie 248.A liminal C*- 36.H linear 8 of logic 411.A logmodular 164.B multiplier 36.K nonassociative 231.A normal j- 384.C normal simple 29.E operator 308.A optional a- 407.B PI- 29J po,tliminal C*- 36.H power associative 231.A predictable a- 407.B quasi-Frobenius 29.H quaternion 29.D quaternion, generalized 29.D quaternion, Hamilton 29.B quaternion, totally definite 27.D quotient 29.A Racah 353.A reduced 231.B relationship 102J residue class 29.A a- 270.B semigroup 29.C semigroup, large 29.C (semi)simple 29.A separable 29.F,K 200.L 
1921 a- 270 B a, tail 342.G a-, topological 270.C a-, well-measurable 407.B simple 29.A solvable 231.A Staudt 343.C Steenrod 64.B supplemented 200 L symmetric 29.H tensor (on a linear space) 256.K Thom 114.H total matrix 269.B uniform l64.A uniscl ial 29.1 uniserial, ab&olutely 29.1 uniserial, generalized 29.1 unitary 29.A universal enveloping (of a Lie algebra) 248.J vector App. A, Table 3.1 von Neumann 308.C von Ncumann, induced 308.C von Neumann, reduced 308.C W*- 308.C weak* Dirichlet 164.G zero 29A algebra class (of central simple algebras) 29.E algebra class group 29.E algebra extension 29.D 200.L algebra homomorphism 29.A algebraic addition formula 3.M algebraic algebra 29.J algebraic analysis l25.A algebraically closed (in a field) 149.1 algebraically closed field 149.1 quasi- l18.F algebraically dependent (on a family of elements of a field) l49.K algebraically dependent elements (of a ring) 369.A algebraically equivalent cycles 16.R algebraically equivalent to 0 (a divisor on an alge- braic variety) 16. P algebraically independent (over a field) 149.K algebraically independent elements (of a ring) 369.A algebraically simple eigenvalue 390.E algebraic branch point (of a Riemann surface) 367.B algebraic closure of a field 149.1 separable 257.E algebraic correspondence 9 H 16.1 group of classes of 9 H algebraic curves I1.A irreducible 11.B plane 9.B algebraic cycles 450.Q algebraic differential equation 113 288.A algebraic dimension (of a compact complex mani- fold) 74.F algebraic element (of a field) 149.E algebraic equations 10, App. A, Table 1 in m unknowns lOA algebraic cxtension 149.E algebraic family (of cycles on an algebraic varicty) 16.R algebraic fiber space 72.1 algebraic functIOn l1.A algebraic function field over k of dimension I 9.D Subject Index Algorithm over k of transcendence degree 1 9. D in n variable 149.K (-function of 450.P algebraic fundamental group 16.U algebraic geometry 12.A algebraic groups 13 absolutely simple 13.U affine 13.A almost simple 13. U connected 13.A isogenous l3.A k-almost simple 15.0 k-anisotropic 13.G k-compact 13.G k-isotropic 13.G k-quasisplit 13.0 k-simple 13.0 k-solvable 13.F k-split 13.N linear 13.A nilpotent 13.F reductive 13.1 semisimple 13.1 simple 13.L solvable 13.F unipotent 13.E algebraic group variety 13.B algebraic homotopy group 16.U algebraic integer 14.A algebraic K-theory 237.J higher 237.1 algebraic Lie algebra 13.C algebraic linear functional 424.B algebraic multiplicity (of an eigenvalue) 309.B algebraic number l4.A algebraic number fields 14 relative 14.1 algebraic pencil 15.C algebraic point (over a field) 369.C algebraic scheme 16. D algebraic sheaf, coherent 16.E 72.F algebraic singularity (of an analytic function) 198.M algebraic solution (of an algebraic equation) 10.D algebraic space 16.W algebraic subgroup 13.A algebraic surfaces 15 algebraic system of r equations 10 in the wider sense 409.B algebraic topology 426 algebraic torus 13.D algebraic varieties 1616.C abstract 16.C affine 16.A complex 16.T normal 16.F product 16.A projective 16.A quasi-affine 16.C quasiprojective 16.C algebra isomorphism 29.A algebroidal function(s) 17 entire 17.B k-valued 17.A algorithm 97 356.C composite simplex 2S5.F divisIOn 297.A division (of polynomials) 337 C dual simplex 255.F 
Subject Index Alignment chart Euclidean 297.A Euclidean (of polynomials) 337.D fractional cutting plane 2IS:B greedy 66.G heuristic 2IS.E partitioning 21S.E primal-dual 2SS.F variable-step variable-order (VSVO) 303.E VSVO 303.E aliases 102.1 alignment chart 19.D alIied series (of a trigonometric series) IS9.A alI-integer 2IS.B alI-integer algorithm 21S.B alI-integer programming problem 21S.A alIocation, optimum 373.E alIocation process, multistage 127.A alIowed homomorphism (between A-modules) 277.E alIowed submodule 277.C almost aU 342.B almost aU points of a variety 16.A almost certainly converge 342.D almost certainly occur 342.B almost complex manifold 72.B stably 114.H weakly 114.H almost complex structure 72.B tensor field of (induced by a complex structure) 72.B almost conformal 275.C almost contact manifold 110.E almost contact metric structure IIO.E almost contact structure 110.E almost elTective action (on a set) 4IS.B almost elTectively (act on a G-space) 431.A almost everywhere converge 342.D almost everywhere hold (in a measure space) 270.D almost finite memory channels 213.F almost G-invariant statistic 396.1 almost invariant test 4oo.E almost paralIelizable manifold 114.1 almost periodic (motion) 126.D almost periodic differential equation 290.A almost periodic functions 18 analytic 18.D on a group 18.F with respect to p 18.C in the sense of Bohr 18.B uniformly 18.B almost periodic group, maximalIy 18.1 almost periodic group, minimalIy 18.1 almost simple algebraic group 13.L k- 13.0 almost subharmonic 193.T almost surely converge 342.D almost surely occur 342.B almost symplectic structure 191.B alphabet 31.B 63.A 213.B alternate angles 139.D alternating contravariant tensor 256.N alternating covariant tensor 256.N alternating direction implicit (ADI) method 304.F alternating function 337.1 alternating group of degree n lSI.G alternating knot 23S.A alternating law (in a Lie algebra) 248.A alternating matrix 269.B alternating multilinear form 2S6.H alternating multilinear mapping 256.H 1922 alternating polynomial 337.1 simplest 337.1 alternating series 379.C alternating tensor field 10S.0 alternative (in game theory) 173.B alternative algebra 231.A alternative field 231A alternative hypothesis 400.A alternative theorem, Fredholm 68.E 217.F alternizer 2S6.N altitude (of a commutative ring) 67.E altitude theorem, KrulI 284.A amalgamated product (of a family of groups) 190.M amalgamated sum S2.G ambient isotropic 6S.D ambient isotropy 6S.D ambig class (of a quadratic field) 347.F ambig ideal (of a quadratic field) 347.F ambiguous point 62.D amicable number 297.D Amitsur cohomology group 200.P Amitsur complex 2OO.P amount insured 214.A amount of inspection, expected 404.(' amount of insurance 214.A Ampere equations, Monge- 278, App. A, Table IS.III Ampere transformation 82.A amphicheiral knot 23S.A ample divisor 16.N very 16.N ample linear system 16.N very 16.N ample over S (of a sheaf on a scheme) 16.E relatively 16.E very 16.E ample vector bundle 16.Y amplification matrix 304.F amplification operator of a scheme 304.F amplitude (of a complex number) 74.C (function) App. A, Table 16.III (of an oscilIation) 318.A (of time series data) 397.N (of a wave) 446 Feynman 146.B partial wave scattering 37S.E probability 3SI.D scattering 37S.C,E 386.B amplitude function (of a Fourier integral operator) 274.C 34S.B AMU estimator, kth-order 399.0 analog, difference 304.E analog computation 19 analog computers 19.E electronic 19.E analog of de Rham's theorem 21.L analog quantity 138.B analog simulation 38S.A analysis 20 activity 376 algebraic 12S.A backward 138.C backward error 302.B combinatorial 66.A consistency of IS6.E convex 88 design-of-experiment 403.D dimensional 116 
1923 Diophantine 296.A experimental 385.A factor 280.G forward 138.C function 20 functional 162 functional, nonlinear 286 harmonic (on locally compact Abelian groups) 192.G intrablock 102.D microlocal 274.A 345.A multivariate 280 of a network 282.C numerical 300 principal component 280.F regression 403.D spectral 390.A analysis of variance 400.H 403.D multivariate 280.B table 400.H analytic (function) 21.B,C 198.A,H (predicate) 356.H complex, structure 72.A micro- (hyperfunction) 125.CC pseudo- (function) 352.B quasi- (function) 352.B quasi- (in the generalized sense) 58.F real 106.K 198.H analytical dynamics 271.F analytically continuable 198.1 analytically hypoelliptic 112.D 323.1 analytically independent elements 370.A analytically normal local ring 284.D analytically thin set 23.D analytically uniform spaces 125.S analytically unramified semilocal ring 284.D analytic almost periodic function 18.D analytic automorphism 21.1 analytic capacity 169.F analytic continuation 198.G along a curve 198.1 direct 198.G uniqueness theorem of 198.1 in the wider sense 198.0 analytic covering space 23.E analytic curve in an analytic manifold 93.B in a Euclidean plane 93.B analytic differential (on a Riemann surface) 367.H analytic fiber bundle complex 147.0 real 147.0 analytic functions 198.A,H complex 198.H inverse 198. L many-valued 198.J multiple-valued 198.J n-valued 198.J real 106.K 198.H in the sense of Weierstrass 198.1 of several complex variables 21.B,C in the wider sense 198.0 analytic geometry 181 analytic hierarchy 356.H analytic homomorphism (between Lie groups) 249.N analytic index of a elliptic complex 237.H of an elliptic differential operator 237.H Subject Index Angle analytic isomorphism 21.1 between Lie groups 249.N analyticity, set of 192.N analytic manifold complex 72.A real 105.C analytic mapping 21.1 analytic measurable space 270,C analytic neighborhood of a function element in the wider sense 198.0 of a Riemann surface 367.A analytic number theory 296.B analytic operation 22. B analytic operation function 37.K analytic perturbation 331.D analytic polyhedron 21.G analytic prolongation 198.G analytic relations, invariance theorem of 198.K analytic representation (of GL( V)) 60.D analytic set (in set theory) 22 (in the theory of analytic spaces) 23.B co- 22.A complementary (in set theory) 22.A germ of 23.B irreducible (at a point) 23.B principal 23.B purely d-dimensional 23.B purely d-dimensional (at a point) 23.B analytic sheaf 72.E coherent 72.E analytic spaces 23 Banach 23.G C- 23.E general 23.G K-complete 23.F normal 23.D in the sense of Behnke-Stein 23.E analytic structure complex 72.A real 105.D on a Riemann surface 367.A analytic submanifold, complex 72.A analytic subset (of a complex manifold) 72.E analytic subspace 23.C,G analytic torsion 391.M analytic vector (with respect to a unitary representa tion of a Lie group) 437.S analytic wave front set 274.D analyzer differential 19.E harmonic 19.E anchor ring 410.B ancient mathematics 24 ancillary statistic 396.H 401.C Anger function 39.G, App. A, Table 19.1V angle 139.D 155.B (of a geodesic triangle) 178.A,H (of hyper spheres) 76.A (of a spherical triangle) 432.B acute 139.D alternate 139.D corresponding 139.D eccentric (of a point on a hyperbola) 78.E eccentric (of a point on an ellipse) 78.D Euler 90.C general 139.D non-Euclidean (in a Klein model) 285.C obtuse 139.D 
Subject Index Angular derivative (of a holomorphic function) regular polyhedral 357.B right 139 D straight 139.D straightening of l14.F supplementary l39.D trisection of 179.A vertical l39.D angular derivative (of a holomorphic function) 43.K angular domain 333.A angular frequency (of a sine wave) 446 angular momentum 258.D 271.E integrals of 420.A intrinsic 4l5.G orbital 3l5.E theorem of 271.E angular momentum density l50.B angular transformation 374.D anharmonIc ratio 343.D anisotropic (quadratic form) l3.G k- (algebraic grouP) l3.G annihilation operator 377.A annihilator 422.D left 29.H reciprocity of (in topological Abelian grouPS) 422.E right 29.H annual aberration 392 annual parallax 392 annuity contI act 214.B annular domain 333.A annulator 422.D annulus conjecture 65.C anomaly eccentric 309.B mean 309.B true 309.B Ano;,ov diffeomorphism l26.J 136.G Anosov flow 126.J l36.G Anosov foliations l26.J Anosov vector field l26.J ANR (absolute neighborhood retract) 202.D,E antiautomorphism (of a group) 190.D (of a ring) 368.D principal (of a Clifford algebra) 61.B antiendornorphism (of a group) 190.D (of a ring) 368.D antiequiva1ence (between categories) 52.H anti-Hermitian form 256.Q anti-Hermitian matrix 269.1 antiholomorphic 195.B 275.B antihomomorphism of groups 190.D of lattices 243.C of rings 368. D anti-isomorphic lattices 243.C anti-isomorphism of groups 190.D of lattices 243.C of ordered sets 3l1.E of rings 368.D antinomy 319.A antiparticle l32.A 386. B antipodal points (on a sphere) 140 antipode 203.H anti-self-dual (G-connection) 80.Q 1924 antisymmetric (Fock space) 377.A (multilinear form) 256.H (multilinear mapping) 256.H (relation) 358.A (tensor) 256.N law 3l1.A matrix 269.B antisymmetry, set of l64.E Antoine's necklace 65.G apartment 13.R aperiodic l36.E 260.B Apollonius problem (in geometric construction) l79.A a posteriori distribution 388.B a posteriori probability 342.F a pparen t fo rce 271.D apparent singular point 254.L Appell hypergeometric functions of two variables 206.D, App. A, Table 18.1 application 31.B approach Bayesian 401.B group-theoretic 2l5.C non-Bayesian 40l.B S-matrix 132.C state-space 86.A approximate derivative (of a measurabk function) 100.B approximate functional equation (for zeta function) 450.B approximately derivable (measurable function) 100.B approximately finite (von Neumann algebra) 308.1 approximately finite algebra 36.H approximately finite-dimensional 308.1 approximation(s) best (of a continuous function) 3.',6.B best (in evaluation of functions) 142.B best (of an irrational number) 83.B best polynomial, in the sense of Chebyshev 336.H Diophantine l82.F full discrete 304.B least square 336.D of linear type l42.B method of successive (for an elliptic partial dif- ferential eq uation) 323.D method of successive (for Fredholm integral equations of the second kind) 2] 7.D method of successive (for ordinary differential equations) 316.D nth (of a differentiable function) 106.E Oseen 205.C overall, formula 303.C Pade l42.E Pauli 4l5.G Prandtl-Glauert 205.B polynomial. 336 semidiscrete 304. B simplicial (to a continuous mapping) 70.C Stokes 205.C Wilson-Hilferty 374.F Y osida 286,X approximation method in physics 25 projective 304.B approximation property (of a Banach space) 37.L 
1925 bounded 37.L approximation theorem (on functions on a compact group) 69.B (on valuations) 439.G cellular 70.D Eichler's 27.D Kronecker's 422.K polynomial (for C"-functions) 58.E simplicial 70.C Weierstrass 336.A a priori distribution 388.B least favorable 388.H a priori estimate 323.C in L 2 sense 323.H a priori probability 342.F AR (absolute retract) 202.D Arabic numerals 26 Arab mathematics 26 Araki axioms, Haag- 150.E Araki-Sewell inequality, Roepstorff- 402.G arbitrary constant 313.A arbitrary set 381.G arc(s) 93.B 186.B continuous 93.B Farey 4.B geodesic 178.H 364.B joined by an 79.B Jordan 81.D 93 B major 4.B minor 4.B open 93.B pseudo- 79.B simple 93.B Arccos I3I.E arc cos (arc cosine) 131.E arc element affine IIO.C conformal IIO.D Archimedean lattice-ordered group 243.G Archimedean ordered field l49.N Archimedean unit (of a vector lattice) 310.B Archimedean valuation 14.F 439.C Archimedean vector lattice 310.C Archimedes axiom in geometry 155.B for real numbers 35S.B Archimedes spiral 93.H arc length lll.D affine 110.C representation in terms of (for a continuous arc) 246.A Arcsin I3I.E arcsin (arcsine) 131.E arc sine law for Brownian motion 45.E for distribution function 250.D for random walk 260.E arcsine transformation 374.D Arc tan 131. E arc tan (arc tangent) I31.E arcwise connected component 79.B arcwise connected space, locally 79.B area 246 (Euclidean) 139.F (of a polygon) IS5.F (of a set in R2) 216.F Banach (of a surface) 246.G of concentration 397 E definite, set of 216.F Geocze (of a surface) 246.E Subject Index Ascending chain Gross (of a Borel set) 246.G inner 216.F 270.G Janzen (of a Borel set) 246.G Lebesgue (of a surface) 246.C mixed (of two ovals) 89.D outer 216.F 270.G Peano (of a surface) 246.F surface, of unit hypersphere App. A, Table 9.V areal element (in a Cartan space) 152.C areal functional 334.B areally mean p-valent 438.E area theorem 438.B Bers 234.D Arens-Royden theorem 36.M Arens theorem, Mackey- 424.N Arf-Kervaire invariant 114.J Argand plane, Gauss- 74.C argument (of a complex number) 74.C behind-the-moon 3SI.K argument function 46.A argument principle 198.F arithmetical (predicate) 356.H arithmetical hierarchy 356.H of degree, of recursive unsolvability 356.H arithmetically equivalent (lattices) 92.B (pairs) 92.B (structures) 276.D arithmetic crystal classes 92.B arithmetic function 295.A arithmetic genus (of an algebraic curve) 9.F (of an algebraic surface) 15.C (of a complete variety) 16.E (of a divisor) 15.C virtual (of a divisor) 16.E arithmetic mean 2ll.C 397.C arithmetic of associative algebras 27 arithmetico-geometric mean 134.B arithmetic operations 294.A arithmetic progression 379.1, App. A, Table 10.1 prime number theorem for 123.D arithmetic subgroup l3.P 122.F,G arithmetic unit 7S.B arithmetization (of metamathematics) 185.C array 96.C balanced 102.L k- 330 orthogonal 102.L arrow diagram 281.D Arrow-Hurwicz-Uzawa gradient method 292.E artificial variables 255.C Artin, E. 28 Artin conjecture 450.G Artin general law of reciprocity 59.C Artin-Hasse function 257.H Artinian module 277.1 Artinian ring 284.A left 368.F right 368.F Artin L-function 450.G,R Artin-Rees lemma 284.A Artin-Schreier extension (of a field) 172.F Artin symbol 14.K Arzela theorem, Ascoli- 168.B 435.D ascending central series (of a Lie algebra) 248.C ascending chain in an ordered set 311.C of subgroups of a group 190.F 
Subject Index Ascending chain condition ascending chain condition in an ordered set 311.C for subgroups of a group 190 F Ascoli-Arzeh't theorem 168.B 435.D Ascoli theorem 435.0 a.s. consistent 399.K assembler 75.C associate (of an element of a ring) 67.H associated convergence radii 21.B associated diagrams (in irreducible representations of orthogonal groups) 6O.J associated differential equation, Legendre's 393.A associated factor sets (of crossed products) 29.F associated factor sets (for extension of groups) 190.N associated fiber bundle 147.D associated flow 136.F a"uciated form (of a projective variety) 16.S associated graded ring 284.D associated integral equation (of a homogeneous) integral equation) 217.F associated Laguerre polynomials 317.D associated Legendre functions 393.C, App. A, Table 18.m associated prime ideal 67.F associated principal bundle 147.D association, measure of 397.K association algebra 102.J association matrix 102.J associative, homotopy 203.D associative algebra(s) 102.J 231.A power 231.A associative law for the addition and multiplication of natural numbers 294.B in the algebra of sets 381.B for cardinal numbers 49.C for the composite of correspondences 358.B general (for group composition) 190.C for group composition 190.A in a lattice 243.A in a ring 368.A associative multiplication of a graded algebra 203.B assumed rate of interest 214.A assumption inverse 304.D Stokes 205.C asteroid 93.H astronomy, spherical 392 asymmetric (factorial experiment) 102.H asymmetric Cauchy process 5.F asymptote (of an infinite branch) 93.G asymptotically developable (function) 30.A asymptotically distributed 374.D asymptotically efficient estimator 399.N first-order 399.0 kth-order 399.0 asymptotically mean unbiased 399.K asymptotically median unbiased estimator (AMU) kth-order 399.0 asymptotically normal estimator best (BAN) 399.K consistent and (CAN) 399.K asymptotically normally distributed 399.K asymptotically optimal 354.D asymptotically stable 126.F 286.S 394.B globally 126.F uniformly 163.G asymptotically unbiased 399.K 1926 asymptotic bias 399.K asymptotic completeness asymptotic concentration asymptotic condition, LSZ asymptotic cone 350.B asymptotic convergence 168.B asymptotic covariance matrix 399.K asymptotic curve 11O.B III.H asymptotic direction 111.H asymptotic distribution, kth-order 399.0 asymptotic efficiency 399.N second-order 399.0 higher-order 399.0 asymptotic expansion 30.A, App. A, Table 17.1 (of a pseudodifferential operator) 345.A method of matched 112.B Minkshisundaram-Pleijel 391.B asymptotic fields 150.D asymptotic freedom 361.B asymptotic method 290.D asymptotic normality 399.K asymptotic path (for a meromorphic function) 272.H asymptotic perturbation theory 33\.D asymptotic power series 30A asymptotic property (of solutions of a system of linear ordinary differential equations) 314.A asymptotic ratio set 308.1 asymptotic ray 178.F asymptotic representation Debye 39.D, App. A, Table 19.m Hankel App. A, Table 19.m asymptotic sequence 30.A asymptotic series 30.A asymptotic set 62.A asymptotic solution 325.M asymptotic tangent IIO.B asymptotic value of a meromorphic function 62.A 272.H asymptotic valaue theorem, LindelOf 43.C asynchronous system (of circuits) 75.B Atiyah-Bott fixed point theorem 153.C Atiyah-Singer fixed point theorem 153.C Atiyah-Singer index theorem 237.H equivariant 237.H atlas 105.C of class C' 105.D of class C W 105.D oriented 105.F atled (nabla) 442.D atmospheric refraction 392 at most (for cardinal numbers) 49.B atomic (measurable set) 270.D at 0 163.H atomic element (in a complemented modular lat- tice) 243.F atomic formula 276.A 41 \.D atom less 398.C at random 401.F attaching a handle 114.F attaching space 202.E attraction, domain of 374.G attractor 126.F strange 126.N attribute, sampling inspection by 404 C augmentation (of an algebra) 2oo.M (of a chain complex) 2oo.C (of a coalgebra) 203.F 150.D 399.N 150.D 
1927 (of a cochain complex) 200.F (of a complex in an Abelian category) 200.H augmented algebra 200.M augmented chain complex 2OO.C autocorrelation 421.B au tocorrelation coefficient 397. N autocovariance, sample 421.B automata 31 automatic integration scheme 299.C automaton 31.A deterministic linear bounded 31.D finite 31.D nondeterministic linear bounded 31.D push-down 31.D automorphic form 450.0 of dimension -k 32.B oftypeU 437DD of weight m 32.A of weight k 32.B automorphic function(s) 32 multiplicative 32.A with respect to f 32.A automorphism (of an algebraic system) 409.C (of a field) 149.B (of a group) 190.D (of an object in a category) 52.D (of a polarized Abelian variety) 3.G (of a probability space) 136.E (of a ring) 368.D analytic 21.1 anti- (of a group) 190.D anti- (of a ring) 368.D differential 113 Frobenius (of a prime ideal) 14.K group of (of a group) 190.D holomorphic 21.1 inner (of a group) 190.D inner (of a ring) 368.D inner, group of (of a group) 190.D inner, group of (of a Lie algebra) 248.H involutive (of a Lie group) 412.B k-fold mixing 136.E Kolmogorov 136.E metrically isomorphic (on a measure space) 136.E modular 308.H outer, group of (of a group) 190.D outer, group of (of a Lie algebra) 248.H principal (of a Clifford algebra} 61.B shift 126.1 spatially isomorphic (on a measure space) 136.E spectrally isomorphic (on a measure space) 136.E strongly mixing 136.E weakly isomorphic 136.E weakly mixing 136.E automorphism group (of a Lie algebra) 248.A automorphy, factor of 32.A autonomous 163.D 290.A autoregressive Gaussian source 213.E autoregressive integrated moving average process 421.G autoregressive moving average process 421.D autoregressive process 421.D auxiliary circle 78.D auxiliary equation, Charpit 320.D auxiliary units 414.A Subject Index Axiom of the power set auxiliary variable 373.C average 211.C moving 397.N moving. process 421.D phase 402.C weighted moving 397.N average complexity 71.A average outgoing quality level 404.C average sample number 404.C averaging, method of 290.D A W*-algebra 36.H axial-vector currents, partially conserved 132.C axial visibility manifold 178.F axiom(s) 35.A 411.I Archimedes (in geometry) 15S.B Archimedes (for real numbers) 355.B congruence (in geometry) 155.B Eilenberg-Steenrod 201.Q Euclid 139.A first countability 42S.P the first separation 425.Q the fourth separation 425.Q Frechet 425.Q Haag-Araki 150.E Haag-Keslev 150.E Hausdorff 42S.Q Kolmogorov 42S.Q logical 337.C 41 \.I Martin 33.F mathematical 337.C 41l.I Osterwalder-Schrader ISO.F Pasch 155.B second countability 425.P the second separation 425.Q system of 35.B the. third separation 425.Q Tietze's first 425.Q Tietze's second 425.Q Tikhonov's separation 42S.Q Vietoris 425.Q Wightman 150.D axiom A diffeomorphism 126.1 axiom A flow 126.J axiomatic quantum field theory 150.D axiomatic set theory 36 lS6.E axiom A vector field 126.J axiomatization 35.A axiomatize (by specifying a system of axioms) 35.B axiom of choice 33.B 34.A and continuum hypothesis, consistency of 33.D and continuum hypothesis, independence of 33.D axiom of comprehension 33.B 381.G axiom of constructibility 33.D axiom of determinacy 22.H axiom of determinateness 33.F axiom of e-induction 33.B axiom of extensionality 33.B aXIOm of foundation 33.B axiom of free mobility (in Euclidean geometry) 139.B axiom of infinity 33.B 381.G axiom of linear completeness (in geometry) IS5.B axiom of mathematical induction 294.B axiom of pairing 381.G axiom of parallels (in Euclidean geometry) 139.A lSS.B axiom of the power set 33.B 381.G 
Subject Index Axiom of reducibility (in symbolic logic) axiom of reducibility (in symbolic logic) 156 B 411 K axiom of regularity 33.B axiom of replacement 33 B 381.G axiom of separation 33.B axiom of strong infinity 33.E axiom of subsets 33.B 381.G axiom of substitution 381.G axiom of the empty set 33.B axiom of the sum i>et 33 B axiom of the unordered pair 33.B axiom of union 381 G axioms of continuity Oedekind's 355 A axiom system(s) 35 of a structure 409.B of a theory 411.1 axis (axes) of a circular cone 78.A conjugate (of a hyperbola) 78.C of convergence 240.B coordinate (of an affine frame) 7.C coordinate (of a Euclidean space) 140 imaginary 74 C major (of an ellipse) 78.C minor (of an ellipse) 78.C optical 180.B of a parabola 78.C principal (of a central conic) 78.C principal (of a parabola) 78.C principal (of a quadric surface) 350.B principal, of inertia 271.E principal, transformation to 390.B real 74.C of rotation (of a surface of revolution) 111.1 transverse (of a hyperbola) 78.C X i - (of a Euclidean space) 140 Ax-Kochen isomorphism theorem 276.E azimuth App. A, Table 3.V azimuthal quantum number 315.E Azumaya algebra 29.K Azumaya lemma, Krull- 67.D B fi  beta !d(Q) (=.S1[,,(Q)) 168.B (the space of hyper functions) 125.V B.q (Besov spaces) 168.B fi-KMS state 402.G fi-shadowed 126J {i-traced 126J 'B-measurable function 270J 'B-measurable set 270.C 'B-regular measurc 270.F 'B-summable series 379.0 I'BI-summable series 379.0 b-function 125.EE 418.H BN-pair 13.R 343.1 (B, N)-pair 151.J Bn sct 22 D B-complete (locally convex space) 424.x BA 102.L back substitution 302.B backward analysis 138.C backward difference 223.C, App. A. Table 21 backward emission 320.A backward cquation, Kolmogorov 115.A 260.F backward error analysis 302.B 1928 backward interpolation formula Gauss 223.C Newton 223.C backward moving average representation 395.D canonical 395.D backward type 304.D,F badly approximable 83.B Baer sum (of extensions) 200.K Bahadur efficiency 400.K Baire condition 425.L Baire function 84.D Baire-Hausdorff theorem 273.1 425.N Baire measurable 270.L Baire property 425.L Lebesgue measurability and 33.F Haire set 126.H 210.C Baire space 425.L Baire zero-dimensional space 273.B Bairstow method 301.E balanced array 102.L balanced fractional factorial design 102.1 balanced incomplete block design 102.E partially 102J balanced mapping, A- 277J balayage 338.L balayage principle 338.L ball 140 n- 140 open 140 open n- 140 spin 351.L unit 140 unit (of a Banach space) 37.B ball knot, (p, q)- 235.G ball pair 235.G BAN (best asymptotically normal) 399.K Banach-Alaoglu theorem (in a Banach space) 37.E (in a topological linear space) 424.H Banach algebra(s) 36.A Banach analytic space 23.G Banach area (of a surface) 246.G Banach (extension) theorem, Hahn- (in a normed space) 37.F (in a topological linear space) 424.C Banach integral 310.1 Banach lattice 310.F Banach Lie group 286.K Banach limit 37.F Banach manifold 105.Z Banach space(s) 37.A,B reflexive 37.G regular 37.G Banach star algebra 36.F Banach-Steinhaus theorem (in a Banach space) 37.H (in a topological linear space) 424.J Banach theorem 37.1 band, Mobius 410.B bang-bang control 405.C Barankin theorem 399.D bar construction (of an Eilenberg-MacLane com- plex) 70.F bargaining set 173.D bargaining solution, Nash 173.C Barnes extended hypergeometric function 206.C, App. A, Table 18.1 barrel (Ill a locally convex space) 424.1 barreled (locally convex space) 424.1 quasi- 424.1 
1929 barrier 120.D absorbing 1IS.B reflecting J J S.B,C BartJe- Dunford-Schwartz integraJ 443.G barycenter (of points of an affine space) 7.C (of a rigid body) 271.E barycentric coordinates (in an affine space) 7.C 90.B (in a EucJidean complex) 70.B (in the polyhedron of a simplicial complex) 70.C barycentric derived neighborhood, second 6S.C barycentric refinement 425.R barycentric subdivision (of a Euclidean compJex) 70.B (of a simpJicial compJex) 70.C baryons 132.B base (in a Banach space) 37.L (curve of a roulette) 93.H (of a Jogarithmic function) 131.B (of a point range) 343.B (of a poJymatroid) 66.F,G data 96.B filter 87.1 10caJ 425.E for the neighborhood system 42S.E normaJ 172.E open 42S.F for the space 42S.E for the topoJogy 42S.F for the uniformity 436.B base functions 304.B base point of a Jinear system 16.N ofaloop 170 of a topoJogical space 202 B base space of a fiber bundJe 147.B of a fiber space 148.B of a Riemann surface 367.A base term (of a spectral sequence) 200.1 base units 414.A Bashforth method, Adams- 303.E basic components (of an m-dimensional surface) 110.A basic concept (of a structure) 409.B basic equation 320 E basic feasibJe solution 2SS.A basic fieJd (of linear space) 2S6.A basic form 2SS.A basic interval 4.B basIc invariant 226.B basic limit theorem 260.C basic open set 425.F basic optimal wJution 255.A basic property (of a structure) 409.B basic ring (of a moduJe) 277.D basic set (for an Axiom A flow) 126J basic set (of a structure) 409.B basic soJution 25S.A feasibJe 2SS.A optimal 25S A basic space (of a probabi1ity space) 342.B basic surface (of a covering surface) 367.B basic variable 2SS.A basic vector field 80.H basic Z/-extension l4.L Subject Index Bernoulli basin 126.F basis (of an AbeJian group) 2.B (in a Banach space) 37.L (of a homogeneous lattice) 182.B (of an ideaJ) 67.B (of a linear space) 256.E (of a moduJe) 277.G canonical 201.B canonical homoJogy II.C ChevaHey canonicaJ 248.Q duaJ 256.G minimal 14.B normaJ 172.E of order,. in N 4.A orthonormaJ 197.C Schauder 37.L strongJy distinguished 418.F transcendence 149.K Weyl canonicaJ 248.P basis theorem HiJbert (on Noetherian rings) 284.A Ritt (on differentiaJ poJynomials) 113 bath, heat 419.B Bayes estimator 399.G Bayes formuJa 342.F 405.1 Bayesian approach 401.B Bayesian modeJ 403.G Bayes risk 398.B Bayes soJution 398.B generalized 398. B in the wider sense 398. B Bayes sufficient a-fieJd 396.1 BCH (Base-Chaudhuri-Hooquenghem) code 63.D BDI, type (symmetric Riemannian spaces) 412.G BDII, type (symmetric Riemannian spaces) 412.G BDH (Brown-DougJas-FiJlmore) theory 36J 390J behavior, Regge 386.C behavior strategy 173.B behind-the-moon argument 351.K Behnke-Stein, anaJytic space in the sense of 23.E Behnke-Stein theorem 21.H Behrens-Fisher problem 400.G BeUman equation 40S.B BeUman function 127.G BeUman principJe 40S.B BeU inequality 351.L BeU number 177.D beJong (to a set) 381.A to the lower class with respect to Jocal con- tinuity 4S.F to the Jower cJass with respect to uniform con- inuity 4S.F to the upper class with respect to 10caJ con- tinuity 45.F to the upper cJass with respect to uniform con- tinuity 45.F BeJtrami differential equation 3S2.B Belt rami differentiaJ operator of the first kind App. A, TabJe 4.II of the second kind App. A, TabJe 4.II BeJtrami operator, LapJace- 194.B Bergman kerneJ function 188.G Bergman metric 188.G Bernays-Godel set theory 33.A BernouUi JooseJy 136.F monotoneJy very weak 136.F 
Subject Index Bernoulli differential equation Bernoulli differential equation App. A, Table 14.1 Bernoulli family 38 Bernoulli lemniscate 93.H Bernoulli method 301.J Bernoulli number 177.B Bernoulli polynomial 177. B Bernoulli process 136.E very weak 136.E weak 136.E Bernoulli sample 396.B Bernoulli shift 136.D generalized 136.D Bernoulli spiral 93.H Bernoulli theorem 20S.B Bernoulli trials, sequence of 396. B Bernshtein inequality (for trigonometric polyno- mials) 336.C Bernshtein polynomial 366.A 418.H Sato- 125.EE Bernshtein problem 27S.F generalized 275.F Bernshtein theorem (on cardinal numbers) 49.B (on the Laplace transform) 240.E (on minimal surfaces) 275.F Bers area theorem 234.D Bertini theorems IS.C Bertrand conjecture 123.A Bertrand curve 111.F Besov embedding theorem, Sobolev- 168.B Besov space 168.B Bessaga-Pelczynski theorem 443.D Bessel differential equation 39.B, App. A, Table 14.11 Bessel formula, Hansen- App. A, Table 19.111 Bessel function(s) 39, App. A, Table 19.m half 39.B modified 39.G spherical 39.B Bessel inequality 197.C Bessel integral 39.B Bessel interpolation formula App. A, Table 21 Bessel series, Fourier- 39.D Bessel transform, Fourier- 39.D best (statistical decision function) 398.B best approximation (of a continuous function) 336.B (in evaluation of functions) 142.B (of an irrational number) 83.B in the sense of Chebyshev 336.H best asymptoticaIly normal estimator 399.K best invariant estimator 399.1 best linear unbiased estimator (b.l.u.e.) 403.E best polynomial approximation (in the sense of Chebyshev) 336.H beta density 397.D beta distribution 341.D, App. A, Table 22 beta function 174.C, App. A, Table 17.1 incomplete 174.C, App. A, Table 17.1 beta-model, Luce 346.G better, uniformly (statistical decision function) 398.B Betti group (of a complex) 201.B Betti number of a commutative Noetherian ring 200.K ofa complex 201.B between (two points in an ordered set) 311.B between-group variance 397.L Beurling generalized distribution 125. U Beurling-Kunugui, theorem, Iversen- 62.B 1930 Bezout theorem 12.B BG (= Bernays-Godel set theory) 33.A Bhattacharyya inequality 399.D biadditive mapping 277.1 bialgebra 203.G quotient 203.G semigroup 203.G universal enveloping 203.G bialgebra homomorphism 203.G Bianchi identities 80.J 417.B, App. A, Table 4.11 bias 399.C biaxial spherical harmonics 393.D BIBD (balanced incomplete block design) 102.E bicharacteristic curve 325.A bicharacteristic strip 320.B bicompact 425.S bicomplex 200.H Bieberbach conjecture 438.C Biehler equality, Jacobi- 328 biequicontinuous convergence, topology of 424.R bifurcation, Hopf 126.M bifurcation equation 286.V bifurcation method 290.D bifurcation point (in bifurcation theory) 126.M 286.R (in nonlinear integral equations) 217.M bifurcation set S1.F 418.F bifurcation theorem, Hopf 286.U bifurcation theory 286.R biharmonic (function) 193.0 biholomorphic mapping 21.J biideal 203.G bijection (in a category) 52.D (of sets) 381.C bijective mapping 381.C bilateral network 382.C bilinear form (on linear spaces) 256.H (on modules) 277.J (on topological linear spaces) 424.G associated with a quadratic form 256.H matrix of 2S6.H nondegenerate 256.H symmetric (associated with a quadratic form) 348.A bilinear functional 424.G integral 424.R bilinear Hamiltonian 377.A bilinear mapping (of a linear space) 256.H (of a module) 277.1 canonical (on tensor products of linear spaces) 256.1 bilinear programming 264.D bilinear relations, Hodge-Riemann 16.V bimatrix game 173.C bimeasurable transformation 136.B bimodular germ (of an analytic functio'l) 418.E bimodule 277.D A-B- 2nD binary quadratic form(s) 348.M primitive 348.M properly equivalent 348.M binary relation 358.A 411.G binding energy 351.D Binet formula (on Fibonacci sequence) 295.A (on gamma function) 174.A binomial coefficient 330, App. A, Table 17.11 
1931 binomial coefficient series 121.E binomial distribution 341.D 397.F, App. A, Table 22 negative 341.D 397.F, App. A, Table 22 binomial equation IO.C binomial probability paper 19.B binomial series App. A, Table 10.lV binomial theorem 330, App. A, Table 17.11 binormal II1.F affine 110.C bioassay 4O.C biology, mathematical models in 263 biometrics 40 bipartite graph 186.C complete 186.C bipolar (relative to a pairing) 424.H bipolar coordinates 90.C bipolar cylindrical coordinates App. A, Table 3.V bipolar theorem 37.F 424.H biprojective space 343.H biquadratic equation App. A, Table 1 birational correspondence 16.1 birational invariant 12.A birational isomorphism between Abelian varieties 3.C between algcbraic groups 13.A birational mapping 16.1 birational transformation 16.1 Birch-Swinnerton-Dyer conjecture I 18.D 450.S biregular mapping (between prealgebraic varieties) 16.C Birkhoff integrable (function) 443.E Birkhoff integral 443.E Birkhofffixed-point theorem, Poincare- 153.B Birkhoff-Witt theorem, Poincare- (on Lie algebras) 248.1 Birnbaum theorem 399.C birth and death process 260.G birth process 260.G birth rate, infinitesimal 260.G bispectral density function 421.C bispinor of rank (k, n) 258.B bit 75.B 213.B check 63.C information 63.C bivariate data 397.H bivariate distribution 397.H bivariate moments 397.H bivariate normal density 397.1 Blackwell-Rao theorem 399.C Blakers-Massey theorem 202.M Blaschke manifold 178.G at a point p 178.G Blaschke product 43.F Blaschke sequence 43.F Bleuler formalism, Gupta- 105.G Bloch constant 77.F schlicht nF Bloch theorem 77.F block (bundle) 147.Q (of irreducible modular representations) 362.1 (of a permutation group) 151.H (of plots) 102. B complete 102.B incomplete 102.B initial 102.E block bundle 147.Q normal 147.Q q- 147.Q Subject Index Borel set(s) block code 63.A 213.F sliding 213.E block design 102. B balanced incomplete 102.E efficiency-balanced 102.E optimal 102.E randomized 102.B variance-balanced 102.E block effect 102. B b]ock size 102.B block structure, q- 147.Q blowing up (of an analytic space) 23.D (of a complex manifold) n.H (by an ideal sheaf) 16.K b.l.u.e (best linear unbiased estimator) 403.E Blumenthal zero-one law 261.B BMO (bounded mean oscillation) 168.B Bochner integrable 443.C Bochner integral 443.C Bochner theorem 36.L 192.B body bounded star 182.C rigid 27l.E body forces 271.G Bogolyubov inequality, Peierls- 212.B Bohr, almost periodic function in the sense of 18.B Bohr compactification 18.H Bokshtein homomorphism 64.B Bokshtein operation 64.B Boltzmann constant 402.B Boltzmann distribution law, Maxwell- 402.B Boltzmann equation 41.A 402.B Bolzano-Weierstrass theorem 140 273.F bond percolation process 340.D Bonnet formula, Gauss- 111.H 364.D, App. A, Table 4.1 Bonnet fundamental theorem 111.H Bonnet-Sasaki- Nitsche formula, Gauss- 275.C Boolean algebra 243.E generalized 42.B Boolean lattice 42.A 243.E of sets 243.E Boolean operations 42.A Boolean ring 42.C generalized 42.C Boolean space 42.D Boolean-valued set theory 33.E Borcher theorem 150.E bord (for a G-manifold) 431.E bordant 431.E border set 425.N Borel-CantelIi lemma 342.B Borel direction (of a meromorphic function) 272.F Borel embedding, generalized 384.D Borel exceptional value 272.E Borel exponential method, summable by 379.0 Borel field 270.B,C Borel integral method, summable by 379.0 Borel isomorphic 270.C Borel-Lebesgue theorem 273.H mapping 270.C Borel measurable function 270.1 Borel measure 270.G Borel method of summation 379.0 Borel set(s) (in a Euclidean space) 270.C (in the strict sense) 270.C (in a topological space) 270.C 
Subject Index Borel space nearly 261.D Borel space 270.C standard 270.C Borel subalgebra (of a semisimple Lie algebra) 248.0 Borel subgroup of an algebraic group 13.G k- (of an algebraic group) I3.G of a Lie group 249.J Borel su bset 270.C Borel summable, absolute 379.0 Borel theorem (on classifying spaces) App. A, Table 6.V (on meromorphic functions) 272.E Heine 273 F Borel-Weil theorem 437.Q bomological locally convex space 424.1 ultra- (locally convex space) 424.W Borsuk-Ulam theorem 153.B Bortolotti covariant derivative, van der Waerden- 417.E Bose particle 132.A Bose statistics 377.B 402.E boson 132.A 351.H Nambu-Goldstone 132.C Bott fixed point theorem, Atiyah- 153.C Bott generator 237.D Bott isomorphism 237.D Bott periodicity theorem on homotopy groups 202.V, App. A, Table 6.VIJ in K-theory 237.D bound Froissart 386.B greatest lower (of a su bset in an ordered set) 311.B greatest lower (of a subset of a vector lattice) 310.C Hamming 63.B least upper (of a subset in an ordered set) 311.B least upper (of a subset of a vector lattice) 310.C lower (of a subset in an ordered set) 311.B Plotkin 63.B upper (of a subset in an ordered set) 311.B Varshamov-Gilbert-Sacks 63.B boundary (boundaries) (of a convex cell) 7.D (cycle) 200.H (of a function alge bra) 164.C (of a manifold) 65.B 105.B (of a topological space) 425.N Choquet (for a function algebra) 164.C closed (for a function algebra) 164.C C'-manifold with 105.E C-manifold without 105.E differential manifold with, of class C 105.E domain with regular 105.U domain with smooth 105.U dual Martin 260.1 entrace (of a diffusion process) 115.B exit (ofa diffusion process) 115.B harmonic 207.B ideal 207.A Martin 207.C 260.1 module of 200.C natural (of an analytic function) 198.N natural (of a diffusion process) 115.B 1932 Newton, of f in the coordinate 418.D nondegenerate Newton 418.D of null (open Riemann surface) 367.E pasting together 114.F of positive (open Riemann surface I 367.E regular (of a diffusion process) I [5.B relative 367.B Shilov (for a function algebra) 2].D 164.C Shilov (of a Siegel domain) 384.D surface with 41O.B topological manifold with 105.B topological manifold without 105.B boundary cluster set 62.A boundary condition 315.A 323.F adjoint 315.B operator with 112.F boundary element (in a simply connected domain) 333.B boundary function 160.E boundary group 234.B boundary homomorphism of homology exact sequence 201 L in homotopy exact sequences 202.L boundary layer 205.C boundary layer equation, Prandtl 205.C boundary operator 200.C between chain groups 201.B linear 315.B partial 200.E total 200.E boundary point dual passive 260.H entrace 260.H exit 260.H irregular 120.D passive 260.H regular 120.D of a subset 425.N boundary set 425.N boundary space 112.E boundary value (of a conformal mapping) 77.B (of a hyperfunction) 125.V (relative to a differential operator) 112.E boundary value problem 315 (for harmonic functions) 193.F (in numerical solution of ordinary differential equations) 303.H adjoint 315.B first (for elliptic differential equations) 323.C first (for harmonic functions) 193.F general 323.H homogeneous (of ordinary differential equa- tions) 315.B inhomogeneous (of ordinary differential equa- tions) 315.B of ordinary differential equations 315 second (elliptic differential equations) 323.F second (for harmonic functions) [93.F self-adjoint 315.B solution of App. A, Table 15.VI third (for elliptic differential equations) 323.F third (for harmonic functions) 193.F two-point (of ordinary differential equations) 315.A weak form of the (of partial differential equa- tions) 304.B bounded (in an affine space) 7.D (half-plane) 155.B 
1933 (metric spacc) 273 B (ordered set) 311.B (set in a topological linear space) 424.F (set of real numbers) 87.B (torsion group) 2.F (vector lattice) 3 ]O.B (vector measure) 443.G order 310. B relatively 331.B T- 311 B totally 273.B bounded, essentialJy (measurable fUnction) 168.B bounded approximation property (of a Banach space) 37.J bounded automaton, linear deterministic 31.D nondeterministic 31.D bounded domain divisible 284.F homogeneous 384.A 412 F irreducible symmetric 412.F sweepable 284.F symmetric 412.F bounded from above (in an ordered set) 311.B (for rcal numbers) 87.B bounded from below (for a filtration) 200.1 (in an ordered set) 311. B (for real numbers) 87 B (for a spectral sequence) 200.J bounded functions 43.A boundcd linear operator 37 C boundedly complete u-ficld 396.E bounded matrix 269.K boundcd mean oscillation (BMO) 168.B bounded metric space, totally 273 B bounded motion 420.D bounded /l-operator 356 B boundedness, abscissa of (of a Dirichlet series) 121.B boundedness principle, upper (in potential theory) 388.C boundedness theorem, uniform 37.H bounded quantifier 356.B bounded set in an affine space 7.D in a locally convex space 424.F in a metric space 273.B totally (in a metric space) 273.B bounded star body 182.C bounded uniform space locally totally 436.H totally 436.H bounded variation function of 166.B intcgrable process of 406.B mapping of 246.H in the scnse of Tonelli 246.C set function of 380.B vector measure of 443.G bound state 351.D bound variable 411.C bouquet 202.F Bouquet differcntial equation, Briot- 288.B 289.B Bouquet formulas (on spacc curves) II1.F Bourbaki, Frechet space in the sensc of 424.1 Boussinesq equation 387.F boxes 140 Subject Index Brouwer mapping theorem box topology 425 K brachistochrone 93.H brackct 105.M Lagrange 84.A 324.D Poisson (of two functions) 105.M Poisson (of two vector fields) 271.F 324.C.D Toda 202.R bracket product (in a Lie algebra) 248.A Bradley- Terry model 346.C braid(s) 235.F closed 235.F braid group 235.F branch (of an analytic function) 198.J (of a curve of class C k ) 93.G (of a graph) 282.A finite (of a curve of class C k ) 93.G infinite (of a CUrve of class Ckl 93.G branch and bound methods 215.D branch divisor (in a covering of a nonsingular curve) 9.1 branched minimal immersion 275.B branched minimal surface 275.B branching Markov process 44.E branching processes 44342.A age-dependent 44.E continuous state 44.E Galton- Watson 44. B Markov 44.D multi type Markov 44 E branch point (of a covering surface) 367.B (of an ordinary curve) 93.C algebraic (of a Riemann surface) 367.B fixed (of an algebraic differential equation) 288.A logarithmic (of a Riemann surface) 367.B movable (of an algebraic differential equation) 288.A branch source 282.C Brandt law 241.C Brauer character (of a modular representation) 362.1 Brauer group (of algebra classes) 29.E (of a commutative ring) 29.K Brauer theorem 450.G Bra vais class 92 B Bravais group 92.B Bra vais lattice 92. B Bravais type 92.B simple 92.E breadth (of an oval) 89.C Brelot solution, Perron- (of Dirichlet problem) 120.C Brelot solution, Perron- Wiener- (of Dirichlet prob- lem) 120.C Brianchon theorem on conic sections 78.K in projective geometry 343.E bridge, Brownian 250.F 374.E Brieskorn variety 4 I 8.D Brill-Noether number 9.E Briot-Bouquet differential equation 288.B 289.B broken line 155.F broken symmetry 132.C Bromwich integral 240.0 322.0, App. A, Table 12.1 Brouwer fixed-point theorem 153.B Brouwer mapping theorem 99.A 
Subject Index Brouwer theorem on the invariance of domain Brouwer theorem on the in variance of domain 117.D Browder-Livesay invariant 114.L Brownian bridge 250.F 374.E Brownian functional 176.1 Brownian motion 5.D 45 342.A d-dimensional 45.C {F,)- 45.B 406.B on Lie groups 406.G with an N-dimensional time parameter 45.1 Ornstein-Uhlenbeck 45.1 right invariant 406.G space-time 45.F Brown-Shield-Zeller theorem 43.C BRS transformation 150.G Bruck-Ryser-Chowla theorem 102.E Bruhat decomposition (of an algebraic group) I3.K relative I3.Q Brun theorems, Poincare- 420.A Brun-Titchmarsh theorem 123.D Bucy filter, Kalman- 86.E 405.G building J30.R 343.1 bulk viscosity, coefficients of 205.C bundle(s) canonical 147.F complex conjugate 147.F complex line 72.F complex line, detcrmined by a divisor 72.F conormal 274.E coordinate 147.B coordinate, equivalent 147.B cotangent 147.F cotangential sphere 274.E dual 147.F fiber 147.B fiber, associated 147.D fiber, of class C' 147.0 fiber, complex analytic 147.0 fiber,orientable 147.L fiber, real analytic 147.0 flat F- 154.B foliated 154 B,H frame, orthogonal 364.A frame, tangent orthogonal n- 364.A G- 147.B Hopf 147.E induced 147.G line 147.F Maslov 274.C normal 105.C l14.B 154.B,E 364.C 274.E normal block 147.Q normal k-vector I 14.J normal sphere 274.E n-sphere 147.K n-universal 147.G principal 147.C principal, associated 147.D principal fiber 147.C product 147.E q-block 147.Q quotient 16.Y 147.B reduced 147.J spin 237.F Spin' 237.F sub- 16.Y 147.F tangent 105.H 147.F 154.B 286.K tangential sphere 274.E tangent r-frame 108.H 147.F tautological line 16.E tensor 147.F 1934 trivial 147.E unit tangent sphere 126.L universal 147.G,H vector 16.Y 147.F vector, ample 16.Y vector, complex 147.F vector, cotangent 147.F vector, dual 147.F vector, indecomposable 16.Y vector, normal 105.L vector, quaternion 147.F vector, quotient 16.Y vector, semistable 16.Y vector, stable (on algebraic varietks) 16.Y vector, stable (on topological spaces) 237.B vector, stably equivalent 237.B vector, tangent 108.H 147.F bundle group (of a fiber bundle) 147.B bundle mapping (map) 147.B bundle of homomorphisms 147.F bundle of p-vectors 147.F bundle space (of a fiber bundle) 147.B Bunyakovskil inequality 21 LC, App. A, Table 8 Burali-Forti paradox 319.B Burnside conjecture 151.D Burnside pro blem 16 LC restricted 16 LC Burnside ring 431.F Burnside theorem 151.D burst error 63.E Bush-Mosteller model 346.G C ( (cardinal number of R) 49.A C1(Q) (the totality of I times continuously differenti- able functions in Q) 168.B Cb(Q) (the totality of functions in C/(Q) whose supports are compact subsets of Q) 168.B c (a sequence space) 168.B C (complex numbers) 74.A 294.A x-equivalent (closed on G-manifolds) 431.F -group 52.M -theory, Serre 202.N C-analytic hierarchy 356.H C-arithmetical hierarchy 356.H C-equivalent almost complex manifolds 114.H C,-field 118.F Ci(d)-field 118.F C n set 22.D {c n i-consistency, {c n i-consistent 399.K C*-algebra 36.G liminal 36.E postliminal 36.E of type I 308.L C*-cross norm 36.H C*-dynamical system 36.K C*-group algebra (of a locally compact Hausdorff space) 36.L C*-tensor product, projective 36.H (C,tX)-summation 379.M c,-bundle 237.F c,-mapping 237.G C-analytic space 23.E C-covering space 23.E C'-conjugacy, C'-conjugate 126.B C'-equivalence, C'-equivalent 126.B C'-flow 126.B C'-foliation 154.G C'-function in a Coo-manifold 105.G 
1935 C'-manifold 105.D with boundary 105.E without boundary 105.E compact 105.D paracompact - 105.D C'-mapping 105.1 C'-norm 126.H C'-structure 108.D subordinate to (for a C'-structure) 108.D on a topological manifold 114.B C'- Q-stable 126.H C'-structurally stable 126.H C'-triangulation 114.C CW-homomorphism (between Lie groups) 249.N CW-isomorphism (between Lie groups) 249.N Cac-function (of many variables) 58.B germ of (at the origin) 58.C preparation theorem for 58.C rapidly decreasing 168.B slowly increasing 125.0 COO-functions and quasi-analytic functions 58 Coo topology, weak 401.C CA set (in set theory) 22.A Caianiello differential equation 291.F calculable function, effective 356.C calculable number 22.G calculation, graphical 19.B calculator 75.A calcul us differen tial 106 fundamental theorem of the infinitesimal 216.C Heaviside 306.A holomorphic functional 36.M infinitesimal (in nonstandard analysis) 273.D Kirby 114.L operational 251.G 306, App. A, Table 12.11 predicate 411J predicate, with equality 411.1 propositional 411.F of residue 198.F stochastic 406.A tensor 4l7.A, App. A, Table 4.11 calculus of variations 46 classical theory of 46.C conditional problems in 46.A fundamental lemma in 46.B Calderon-Zygmund singular integral operator 217.J 251.0 Calderon-Zygmund type, kernel of 217.J Calkin algebra 36.J Callan-Symanzik equation 361.B Campbell-Hausdorff formula 249.R CAN estimator 399.K canceling 138.B cancellation law on the addition of natural numbers 294.B in a commutative semigroup 190.P on the multiplication of natural numbers 294.B canonical affine connection (on R n ) 80.J canonical anticommutation relation 277.A canonical backward moving average representation 395.D canonical basis (of a chain group of a finite simpli- cial complex) 20l.B Chevalley (of a complex semisimple Lie algebra) 248.Q Weyl (of a semisimple Lie algebra) 248.P canonical bilinear mapping (on tensor products of linear spaces) 256.1 Subject Index Canonical representation canonical bundle (of a differentiable manifold) 147.F canonical class (of an algebraic curve) 9.C canonical cohomology class (in Galois cohomology in class field theory) 59.H canonical commutation relations 351.C 377.A,C canonical coordinates (of a Lie group) of the first kind 249.Q of the second kind 249.Q canonical coordinate system (for a conic section) 78.C canonical correlation coefficient 280.E 374.C canonical decomposition (of a closed operator) 251.E canonical decomposition theorem 86.C canonical divisor (of an algebraic curve) 9.C (of an algebraic variety) 16.0 (of a Jacobian variety) 9.E (of a Riemann surface) II.D canonical element (in the representation of a functor) 52.L canonical ensemble 402.D grand 402.D canonical equation 324.E Hamilton 271.F canonical field 377.C canonical form (of F(M)) 191.A (of a linear hypothesis) 400.H (of a regular submanifold of F(M)) 191.A of the equation (of a quadric surface) 350.B Khinchin (of an infinitely divisible probability distribution) 341.G Kolmogorov (of an infinitely divisible pro- bability distribution) 341.G Levy (of an infinitely divisible probability distribution) 341.G Weierstrass (for an elliptic curve) 9.D Weierstrass (of the gamma function) 174.A canonical function (on a nonsingular curve) 9.E canonical homology basis 11.C canonical homomorphism (on direct products of rings) 368.E (on tensor products of algebras) 29.A canonical injection (from a direct summand) 381.E (in direct sums of modules) 277.F (in free products of groups) 190.M (from a subgroup) 190.D (from a subset) 381.C canonically bounded 200.J canonically polarized Jacobian variety 3.G 9.E canonical measure (in a birth and death chain) 260.G (in a diffusion process) 115.B (in a Markov chain) 260.1 canonical model 251.N canonical I-form (of the bundle of tangent n-frames) 80.H canonical parameter of an arc III.D local (for power series) 339.A canonical product, Weierstrass 429.B canonical projection (on direct products of modules) 277.F (onto a quotient set) l35.B canonical representation (of a Gaussian process) 176.E generalized 176.E 
Subject Index Canonical scale canonical scale 115 B canonical scores 397.M canonical surjection (on direct products of groups) 190.L (to a factor group) 190.D (onto a quotient set) 135.B canonical transformation 271 F (concerning contact transformations) 82.B group of 27l.F homogeneous 82.B canonical variables (in analytical dynamics) 271.F canonical variates 280 E canonical vectorial form 417.C Cantelli lemma, Borel- 342.B Cantelli theorem, Glivenko- 374.E Cantor, G 47 Cantor discontinuum 79.D Cantor intw,ection theorem 273.F Cantor-Lebesgue theorem 159.J Cantor normal form (for an ordinal number) 312.C Cantor set 79.D general 79.D Cantor's theory of real numbers 244.E capability, error-corrccting 63.B capacitable set 48.H capacitary dimension 48.G capacitary mass distribution 338.K capacitated network 28l.C ca paci ty (of discrete memeoryless channel) 213.F (of a prime Ideal) 27.A (of a set) 48 260.D (transportation and scheduling) 28l.D 7,- 169.C analytic 169.F continuous analytic 164 J ergodic 213.F logarithmic 48 B Newtonian 48.B Newtonian exterior 48 H Newtonian inncr 48.F Newtonian interior 48.F Ncwtonian outer 48.H of order 7, 169.C stationary 213 F capacity constraint 28l.D capillary wave 205.F cap product (of a cochain and a chain) 200.K (of a cohomology class and a homology class) 20l.K capture complete 420.D partial 420.D CAR 377.A Caratheodory measure 270 E Caratheodory outer measure 270.E Caratheodory pseudodistancc 2!.D Cardano formula (on a cubic equation) 10.D, App A, Table I cardinality 33.F (of an ordinal number) 49.E (of a set) 49 A 312.D cardinal number(s) 49.A 3 I 2 D comparability theorem for 49.B of continuum 49.A corresponding to an ordinal number 49.E finite 49.A infinite 49.A measurable 33.E 1936 of N 49.A of R 49.A of all real-valued functions on [0, 49.A of a set 49.A 312.D strongly compact 33.E strongly inaccessible 33.E transfinite 49.A weakly compact 33.E weakly inaccessible 33.E cardinal product (of a family of ordered sets) 3ll.F cardinal sum (of a family of ordered sets) 311.F cardioid 93.H Carleman condition, Denjoy- 168.B Carleman inequality App. A, Table 8 Carleman theorem (on asymptotic expansions) 30.A (on bounded functions) 43.F Carleman type, kernel of 217.J carrier (of a differential form) 108 Q (of a distribution) 125.D (of a function) 125.B 168.B (of a real-valued function) 425.R Carson integral App. A, Table 12.II Cartan, E. 50 Cartan, differential form of Maurer- 249.R Cartan, system of differential equations of Maurer- 249.R Cart an connection 80.M Cart an criterion of semisimplicity (on Lie algebras) 248.F Cartan criterion of solvability (on Lie algebras) 248.F Cartan formula for Steenrod pth power operations 64.B for Steenrod square operations M.B Cartan integer (of a semisimple Lie algebra) 248.N Cart an invariant (of a finite group) 362.1 Cartan involution 437.X Cartan-Kiihler existence theorem 428.E Cartan-Mal'tsev-Iwasawa theorem (on maximal compact subgroups) 249.S Cartan maximum principle 338.L Cart an pseudoconvex domain 21 I locally 21.1 Cartan relative integral invariant 219.B Cartan subalgebra (of a Lie algebra) 248.1 (of a symmetric Riemannian space) 413.F Cartan subgroup (of an algebraic group) 13.H (of a group) 249.1 Cartan space 152.C Cartan theorem on analytic sheaves (H. Cartan) '2.E on representations of Lie algebras (E. Cartan) 248.W Cartan-Thullen theorem 2l.H Cartan-Weyl theorem 248.W Carter subgroup 151.D Cartesian coordinates (in an affine space) 7.C Cartesian product (of complexes) 70.C,E (of mappings) 381.C (of sets) 381 B,E Cartesian space 140 Cartier divisor 16.M Cartier operator 9.E CA set 22.A case complexity, worst 7l.A 
1937 Casimir element (of a Lie algebra) 248.1 Casorati determinant 104.D Casorati-Weierstrass theorem (on essential singular- ities) 198.D Cassini oval 93.H Casson handle 114.M Castelnuovo criterion 15.E Castelnuovo lemma 3.E 9.H casus irreducibilis 10.D, App. A, Table I Catalan constant App. A, Table 10.111 catastrophe, elementary 51.E catastrophe point 51.F catastrophe set 51.F catastrophe theory 51 categorical (data) 397.B (set of closed formulas) 276.F categorical system ( of axioms) 35.B categoricity in powers 276.F categories and functors 52 category 52. A Abelian 52.N of Abelian groups 52.B additive 52.N of analytic manifolds 52.B cohomology theory on the 261.Q of commutative rings 52.B diagram in the 52.C of differentiable manifolds 52.B dual 52.F exact 237.1 Grothendieck 200.1 of groups 52.B homotopy. of topological spaces 52.B of left (right) R-modules 52.B of linear spaces over R 52.B of pointed topological spaces 202. B PL 65 A product 52.A quotient 52 N of rings 52.B set of the first 425.N set of the second 425.N of sets 52.B shape 382.A of S-objects 52.G of topological spaces 52.B catenary 93.H catenoid I I 1.1 Cauchy, A. L. 53 Cauchy condensation test 379.B Cauchy condition (on D-integral and D(*)-integral) JOO.E Cauchy criterion (on the convergence of a sequence of real numbers) 87.C, App. A, Table 10.11 Cauchy data 321.A Cauchy distribution 341.D, App. A, Table 22 Cauchy existence theorem (for partial differential equations) 320.B Cauchy filter (on a uniform space) 436.G Cauchy-Hadamard formula 339.A Cauchy inequality 211.C, App. A, Table 8 Cauchy integral formula 198.B Cauchy integral representation 21.C Cauchy integral test (for convergence) 379.B Cauchy integral theorem 198.A,B stronger form of 198.B Cauchy-Kovalevskaya existence theorem 321.A Cauchy-Kovalevskaya theorem, abstract 286.Z Cauchy net (in a uniform space) 436.G Subject Index Cell complex Cauchy polygon 316.C Cauchy principal value of an improper integral 216.D of the integral on infinite intervals 216.E Cauchy problem (of ordinary differential equations) 316.A (for partial differential equations) 3 I 5.A 320.B 321.A 325.B abstract 286,X Cauchy process 5.F asymmetric 5.F symmetric 5.F Cauchy product (of two series) 379.F Cauchy remainder App.A, Table 9.IV Cauchy-Riemann (differential) equation 198.A 274.G (for a holomorphic function of several complex variables) 21.C (for a holomorphic function of two complex variables) 320.F Cauchy-Riemann structure 344.A Cauchy-Schwarz inequality 21 I.C, App. A, Table 8 Cauchy sequence (in an a-adic topology) 284.B (in a metric space) 273.J (of rational numbers) 294.E (of real numbers) 355.B (in a uniform space) 436.G Cauchy sum (of a series) 379.A Cauchy theorem 379.F Cauchy transform 164.1 causality, macroscopic 386.C cause, most probable 40l.E caustic 325.L Cayley algebras 54 general 54 Cayley number 54 Cayley projective plane 54 Cayley theorem (in group theory) 151.H Cayley theorem, Hamilton- 269.F Cayley transform (of a closed symmetric operator in a Hilbert space) 25J.I Cayley transformation 269.J CCP (chance-constrained programming) 408.B CCR 377.A CCR algebra 36.H Cech cohomology group (for topological spaces) 201.L,P relative 20l.M Cech cohomology group with coefficient sheaf :? 383.F Cech compactification, Stone- 207.C Cech complete space 425.T 436.1 Cech homology group (for topological spaces) 201.M relative 20l.M ceiling function 136.D ceiling states 402.G celestial mechanics 55 cell 70.D convex (in an affine space) 7.D fundamental (of a symmetric Riemann space) 4J3.F n- (in a Hausdorff space) 70.D (n - q)-dual 65.B topological n- 140 unit 140 cell complex 70.D closure finite 70.D countable 70.D 
Subject Index Cell-like (CE) Euclidean 70.B finite 70.D locally countable 70.D locally finite 70.D regular 70.D cell-like (CE) 382.D cellular approximation theorem 70.D cellular cohomology group 20l.H cellular decomposition (of a Hausdorff space) 70.D cellular homology group 201.F,G cellular mapping (between cell complexes) 70.D CE mapping 382.D center (of a central symmetry) 139.B (of a continuous geometry) 85.A (of gravity) 271.E (of a group) 190.C (of a hyperbola or ellipse) 78.C (of a lattice) 243.E (of a Lie algebra) 248.C (of mass) 271.E (of a nonassociative algebra) 231.A (of a pencil of hyperplanes) 343.B (of a quadric hypersurface) 7.G (of a quadric surface) 350.A (of a regular polygon) 357.A (of a regular polyhedron) 357.B (of a ring) 368.F (of a solid sphere) 140 (of a sphere) 139.1 (of a von Neumann algebra) 308.C centered process 5.B centering 5.B center manifold theorem 286.V center of curvature 111.E center of projection 343.B center surface 111.1 central composite design 102.M central configuration 420.B central conic(s) 78.C central difference 223.C 304.E, App. A, Table 21 central element (in a lattice) 243.E central extension (of a group) 190.N central figure 420.B centralizer (of a subset of a group) 190.C (of a subset of a ring) 368.F central limit theorem 250.B central moment 397.C central motion 126.E central potential 315.E central processor 75.B central quadric hypersurface 7.F 350.G central quadric surface 350.B cen tral series ascending (of a Lie algebra) 248.C descending (of a Lie algebra) 248.C lower (of a group) 190.1 upper (of a group) 190.1 central simple algebra 29.E similar 29.E central symmetry (of an affine space) 139.B centrifugal force 271.D certainly, almost 342.B,D certainty equivalent 408.B Cesaro method of summation of order r:J. 379.M summable by 379.M Ceva theorem 7.A CFL condition 304.F CG method 302.D 1938 chain 200.H ascending (of normal subgroups of a group) 190.F ascending (in an ordered set) 31l.C conservative 260.A descending (in a lattice) 243.F descending (of (normal) subgroup:; of a group) 190.F descending (in an ordered set) 3] l.C equivalence 200.H general Markov 260.J homotopy 200.H irreducible (a Markov chain) 260.B Markov 260.A minimal 260.F normal (in a group) 190.G normal (in Markov chains) 260.)) q- (of a chain complex) 201.B recurrent 260.B regular (of integral elements) 428.E u- 260.1 chain complex(es) 200.C,H 201.B in an Abelian category 201.B of A-modules 200.C augmented 200.C double 200.E oriented simplicial 201.C positive 200.H product double 200.E quotient 2OO.C relative 2OO.C singular (of a topological space) 201.E chain condition (in an ordered set) 311.C ascending (in an ordered set) 311.C descending (in an ordered set) 311.C chained metric space, well- 79.D chain equivalence 200.C chain homotopy 200.C chain mapping 200.C 201.B over an A-homomorphism 200.C chain recurrent 126.E chain recurrent set 126.E chain rule 106.C chain subcomplex 200.C chain theorem 14.J chain transformation (between complexes) 200.H Chaitin complexity, Kolmogorov- 354.D chamber positive Weyl 248.R Weyl 13.J 248.R chamber complex 13.R chance-constrained programming 408.A chance constraint 408.A chance move 173.B change scalar (of a B-module) 277.L time 261.F 406.B of variables (in integral calculus) 216.C channel 375.F almost finite memory 213.F d-continuous 213.F discrete memoryless 213.F finite memory 213.F noisy 213.A test 213.E channel coding theory 213.A channel Hilbert space 375.F channel wave operators 375.F chaos 126.N 303.G 433.B propagation of 340.F 
1939 Chaplygin's differential equation 326.B Chapman complement theorem 382.B Chapman-Kolmogorovequality 261.A Chapman-Kolmogorovequation 260.A Chapman-Robbins-Kiefer inequality 399.D Chapman theorem (on (C, iX)-summation) 379.M character (of an Abelian group) 2.G (of an algebraic group) 13.D (irreducible unitary representation) 437.V (of a linear representation) 362.E (of a regular chain) 428.E (of a representation of a Lie group) 249.0 (ofa semi-invariant) 226.A (of a topological Abelian group) 422.B absolutely irreducible 362.E Brauer (of a modular representation) 362.1 Chern (of a complex vector bundle) 237.B Dirichlet 295.D Hecke 6.D identity (of an Abelian group) 2.G integral (on the I-dimensional homology group of a Riemann surface) II.E irreducible (of an irreducible representation) 362.E Minkowski-Hasse (of a nondegenerate quadratic form) 348.D modular (ofa modular representation) 362.1 nonprimitive 450.C,E planar 367.G primitive 295.D 450.C,E principal (of an Abelian group) 2.G principal Dirichlet 295.D reduced (of an algebra) 362.E residue 295.D simple (of an irreducible representation) 362.E character formula, Weyl 248.Z character group (of an Abelian group) 2.G (of a topological Abelian group) 422.B characteristic(s) (of a common logarithm) 13l.C (ofa field) 149.B Euler (of a finite Euclidean cellular complex) 201.B Euler-Poincare (of a finite Euclidean complex) 16.E 201.B operating 404.C population 396.C quality 404.A sample 396.C Todd 366.B two-terminal 281.C characteristic class(es) 56 (of an extension of a module) 200.K (of a fiber bundle) 147.K (of foliations) 154.G (of a vector bundle) 56.D ,""(of a real oriented vector bundle) 237.F of codimension q 154.G of a manifold 56.F smooth, of foliations 154.G characteristic curve (ofa network) 281.B (ofa one-parameter family of surfaces) 11l.I (of a partial differential equation) 320.B 324.A,B characteristic equation (of a differential-difference equation) 163.F (for a homogeneous system of linear ordinary Subject Index Characteristic vector differential equations) 252.J (of a linear difference equation) 104.E (of a linear ordinary differential equation) 252.E (of a matrix) 269.F (of a partial differential equation) 320.D (of a partial differential equation of hyperbolic type) 325.A,F characteristic exponent (of an autonomous linear system) 163.F (of the HilI differential equation) 268.B (ofa variational equation) 394.C characteristic function(s) (of a density function) 397.G (ofa graded R-module) 369.F (of a meromorphic function) 272.B (of an n-person cooperative game) 173.D (for an optical system) 180.C (of probability distributions) 341.C (of a subset) 381.C empirical 396.C Hilbert (ofa coherent sheaf) 16.E Hilbert (ofa graded module) 369.F characteristic functional (of a probability distribu- tion) 407.C characteristic hyperplane (of a partial differential equation of hyperbolic type) 325.A characteristic hypersurface (of a partial differential equation of hyperbolic type) 325.A characteristic linear system (of an algebraic family) 15.F characteristic line element 82.C characteristic manifold (of a partial differential equation) 320.B characteristic mapping (map) (in the classification theorem of fiber bundles) 147.G characteristic measure 407.D characteristic multiplier (of a closed orbit) l26.G (of a periodic linear system) 163.F characteristic number (of a compact operator) 68.1 (ofa manifold) 56.F Lyapunov 314.A characteristic operator function 251.N characteristic polynomial (of a differential operator) 112.A (of a linear mapping) 269.L (of a matrix) 269.F (of a partial differential operator) 321.A characteristic root (of a differential-difference equation) 163.A (of a linear mapping) 269.L (of a linear partial differential equation) 325.F (of a matrix) 269.F characteristic series (in a group) 190.G characteristic set (of an algebraic family on a generic component) 15.F (of a partial differential operator) 320.B characteristic strip 320.D 324.B characteristic surface 320.B characteristic value (of a linear operator) 390.A sample 396.C characteristic variety (of a system of micro differential equations) 274.G characteristic vector (of a linear mapping) 269.L (of a linear operator) 390.A 
Subject Index Character module (of an algebraic group) character module (of an algebraic group) 13.0 character system (of a genus of a quadratic field) 347.F charge 150.B charge symmetry 415J Charpit method, Lagrange- 322.B, App. A, Table ] 5.11 Charpit subsidiary equation 82.C 320.D chart alignment 19.0 control 404 B intersection 19.D Chatelet group, Weil- I] 8.D Chebotarev density theorem 14.S Chebyshev formula 299.A Gauss- (in numerical integration) 299.A Chebyshev function App. A, Table 20.11 Chebyshev interpolation 233.A 366.1 Chebyshev orthogonal polynomial 19.G Cheby:,hev polynomial 336.H Chebyshev q-function 19 G, App. A, Table 20.VII simplest 19.G Chebyshev system 336.B Chebyshev theorem 336.B check bits 63 C check matrix, parity 63.C Cheng domain theorem, Courant- 39l.H Chern character (of a complex vector bundle) 237.B Chern class of a C"-bundle 56.C of a manifold 56 F of a real 2n-dimensional almost complex manifold 147 N total 56C of a U(n)-bundle 147.N universal 56.C Chern formula (in integral geometry) 218.D Chern number 56.F Cherwell-Wright differential equation 291.F Chevalley canonical basis (of a complex semisimple Lie algebra) 248 Q Chevalley complex,fication (of a compact Lie group) 249.U Chevalley decomposition (on algebraic groups) 13.1 Chevalley group 151 I Chevalley theorem (forms over finite fields) 118.B Chevalley theorem (on algebraic groups) 13.B Chevalley type (algebraic group) 13.N Chinese mathematics 57 Chinese remainder theorem 297.G chi-square distribution 374.B, App. A, Table 22 noncentral 374 B chi-square method, modified minimum 400.K ch,-square test 400.G of goodness of fit 400.K choice, axiom of 33.B 34.A choice function 33.B 34 A choice process, multistage 127.A choice oet 33.B 34.A Cholesky method 302.B Chomsky grammar 3l.D Chomsky hierarchy 31.D Choqueot boundary 164.C Chow coordinates (of a positive cycle) 16.S Chow-Kodaira theorem 72.F Chawla theorem, Bruck-Ry:,er- 102.E Chow lemma n.H 1940 Chow ring (of a projective variety) 16.R Chow theorem on an analytic submanifold of pN 72.F on the field of rational functions of an analytic space 23.D Chow variety 16.S Christoffel-Darboux formula 317.D Christoffel symbol 80.L III.H 4] 7.D Christoffel transformation, Schwarz- 77.D, App. A, Table 13 Christoffel transformation formula, Schwarz- 77.D chromatic number 157 186 I chromodynamic:" quantum l32.C Chung-Erdos theorem 342.B Church's thesis 356.C circlets) 140 Apollonius App. A, Table 3.V auxiliary (of an ellipse) 78 D of convergence (of a power series) 339.A of curvature III.E director (of an ellipse) 78 D great (of a sphere) 140 inscribed (of a regular polygon) 357.A isometric 234.C of meromorphy (of a power series) 339.D open 140 oscillating III.E quadrature of 179.A unit 74.C 140 circled subset (of a topological linear space) 424.E circle geometry 76.C circle method 4.B circle problem, Gauss 242.A circle type, limit 112.1 circuit 66.G circuit matrix 254.B circular cone 78.A 111.I oblique 350.B right 350.B circular cylinder flU 350.B circular cylindrical coordinates App. A, Table 3.V circular cylindrical surface 350.B circular disk 140 circular domain 333.A circular frequency (of a simple harmonic motion) 318.B circular function I3I.F 432 A circular section 350.F circular unit 14.L circulation (of a vector field) 205.B 442.D circumference 140 circumferentially mean p-valent 438. E circumscribed circle (of a regular polygon) 357.A circumscribing sphere (of a simplex) 139.1 cissoidal curve 93.H cissoid of Diocles 93.H Clairaut differential equation App. A, Table 14.I Clairaut partial differential equation App. A, Table 15.11 class (in axiomahc set theory) 33.C 3SI.G (of a lattice group) 13.P (of a nilpotent group) 190J (of a plane algebraic curve) 9.B (of a quadratic form) 348.H,1 .""-characteristic (of a real oriented vector bundle) 237.F algebra (of central simple algebra!;) 29.E ambig (of a quadratic field) 347.F 
1941 Bravais 92.B canonical (of an algebraic curve) 9.C canonical cohomology 59.H canonical divisor II.D characteristic (of an extension of module) 200.K characteristic (of a fiber bundle) 147.K characteristic (of foliations) 154.G characteristic (of a vector bundle) 56 characteristic, of codimension q 154.G characteristic, of a manifold 56.F Chern (of a CO-bundle) 56.C Chern (of a manifold) 56.F Chern (of a real 2n-dimensional almost complex manifold) 147.N Chern (of a U(n)-bundle) 147.N cohomology 200.H combinational Pontryagin 56.H complete 398.B completely additive 270.B conjugacy (of an element of a group) 190.C countablyadditive 270.B crystal 92 B curve of the second 78.K differential divisor (of a Riemann surface) II.D divisor (on a Riemann surface) 11.D Dynkin 270.B the Dynkin, theorem 270.B equivalence 135.B ergodic 260.B essentially complete 398.B Euler- Poincare (of a manifold) 56.F Euler-Poincare (of an oriented Rn-bundle) 56.B finitely additive 270.B fundamental (of an Eilenberg-MacLane space) 70.F fundamental (of a Poincare pair) 114.1 fundamental (of the Thom complex MG) 114.G fundamental, with coefficient Z2 65.B generalized Hardy 164.G Gevrey 58.G 125.U group of congruence 14.H Hardy 43 F 159.G Hilbert-Schmidt 68.1 holosymmetric 92.B homology 200.H 201.B homotopy 202.B ideal (of an algebraic number field) 14.E ideal (of a Dedekind domain) 67.K ideal, in the narrow sense 14.G 343.F idele 6.0 idele, group 6.D linear equivalence (of divisors) 16.M main 241.A mapping 202.B minimal complete 398.B monotone 270.B the monotone, theorem 270.B multiplicative 270.B nuclear 68.1 oriented cobordism 114.H Pontryagin (of a manifold) 56.F Pontryagin (of an Rn-bundle) 56.D proper 381 G q-dimensional homology 20l.B of a quadratic form over an algebraic number field 348.H residue (modulo an ideal in a ring) 368.F Subject Index Classical (state) Steifel-Whitney (of a differentiable manifold) 147.M Stiefel- Whitney (of a manifold) 56.F Stiefel-Whitney (of an O(n)-bundle) 147.M Stiefel-Whitney (of an Rn-bundle) 56.B Stiefel-Whitney (of a topological manifold) 56.F surface of the second 350.D Todd 237.F total Chern 56.C total Pontryagin 56.D total Stiefel-Whitney 56.B trace 68.1 universal Chern 56.C universal Euler-Poincare 56.B universal Stiefel-Whitney 56.B unoriented cobordism 114.H Wu (of a topological manifold) 56.F Zygmund 159.E class COC function of 106.K function of (of many variables) 58.B mapping of 286.E oriented singular r-simplex of 108.T partition of unity of 108.s singular r-chain of 108.T singular r-cochain of 108.T class C k curve of (in a differentiable manifold) 93.B curve of lin a Euclidean plane) 93.B class CO, function of 106.K class C W , function of 106.K class C' function of 106.K mapping of 286.E class C' atlas of 105.D coordinate neighborhood of 105.D diffeomorphism of 105.1 differentiable manifold of 105.D differentiable manifold with boundary of 105.E differentiable mapping of 105.J differentiable structure of 105.D fiber bundle of 147.0 function of (in a Coo-manifold) 105.G function of (at a point) 105.G functionally dependent of (components of a mapping) 208.C functional relation of 208.C mapping of 208.B 286.E nonsingular mapping of 208.B regular mapping of 208.B vector field of 105.M Z-action of 126.B class C', tensor field of 108.0 class Co, mapping into Banach space of 286.E class (Co), semigroup of 378.B class D, curve of 364.A class field 59.B absolute 59.A class field theory 59 local 59.G class field tower problem 59.F class formation 59.H class function (on a compact group) 69.B class group divisor I1.D classical (potential) 402.G classical (state) 402.G 
Subject Index Classical compact real simple Lie algebra classical compact real simple Lie algebra 248.T classical compact simple Lie group 249.L classical complex simple Lie algebra 248.S classical complex simple Lie group 249.M classical descriptive set theory 356.H classical dynamical system 126.L 136.G classical group(s) 60.A infinite 147.1 202.V classical logic 411.L classical mechanics 271.A classical risk theory 214.C classical solution (to Plateau's problem) 275.C classical statistical mechanics 402.A classical theory of the calculus of variations 46.C classification (with respect to an equivalence relation) 135.B classification theorem on a fiber bundle 147.G first (in the theory of obstructions) 305.B Hopf 202.1 second (in the theory of obstructions) 305.C third (in the theory of obstructions) 305.C classification theory of Riemann surfaces 367.E classificatory procedure 280.1 classifying mapping (map) (in the classification theorem of fiber bundles) 147.G classifying space (of a topological group) 174.G,H cohomology rings of App. A, Table 6.V for f;-structures 154.E n- (of a topological group) 147.G class n function of 84.D projective set of 22.D class N, null set of 169.E class number (of an algebraic number field) 14.E (of a Dedekind domain) 67.K (of a simple algebra) 27.D class of Abelian groups 202.N class w, function of 84.D class I function of 84.D function of at most 84.D class theorems, complete 398.D class C;, function of 84.D class 0, function of 84.D Clebsch-Gordan coefficient 258.B 353.B Clenshaw-Curtis formulas 299.A Clifford algebras 61 Clifford group 61.D reduced 61.D special 61. D Clifford number 61.A Clifford torus 275.F Clifford torus, generalized 275.F clinical trials 40.F closable operator 251.D closed absolutely (space) 425.U algebraically (in a field) 149.1 algebraically (field) 149.1 boundary 164.C H- (space) 425.U hyperbolic, orbit 126.G integrally (ring) 67.1 k- (algebraic set) 13.A multiplicatively, subset (of a ring) 67.1 quasi-algebraically (field) 118.F 1942 f- (space) 425.U real, field 149.N Zariski 16.A closed arc 93.B closed braid 235.F closed convex curve 111.E closed convex hull 424.H closed convex surface 111.1 closed covering 425.R closed curve, simple 93.B closed differential 367.H closed differential form 105.Q closed formula 276.A 299.A in predicate logic 411.J closed geodesic 178.G closed graph theorem 37.1 251.D 42<:..X closed group 362.J closed half-line (in affine geometry) 7.D closed half-space (of an affine space) 7.D closed ideals in L,(G) 192.M closed image (ofa variety) 16.1 closed interval 140 in R 355.C closed linear subspace (of a Hilbert space) 197.E closed manifold 105.B closed mapping 425.G closed operator (on a Banach space) 251.D closed orbit 126.D,G hyperbolic 126.G closed plane domain 333.A closed path (in a graph) 186.F (in a topological space) 170 direct 186.F space of 202. C closed range theorem 37.J closed Riemann surface 367.A closed set 425.B locally 425.J relative 425.J system of 425.B Zariski 16.A closed subalgebra 36.B closed subgroup (of a topological group) 423.D closed submanifold (of a Coo-manifold) 105.L closed subsystem (of a root system) 13.L closed surface 410.B in a 3-dimensional Euclidean spa(:e 111.1 closed system 419.A closed system entropy 402.G closed term (ofa language) 276.A closure 425.B (in a matroid) 66.G (of an operator) 251.D algebraic (of a field) 149.1 convex (in an affine space) 7.D integral (of a ring) 67.1 Pythagorean (ofa field) 155.C closure finite (cell complex) 70.D closure operator 425.B closure-preserving covering 425.X clothoid 93.H cloverleaf knot 235.C cluster 375.F cluster decomposition Hamiltonian :,75.F clustering property 402.G cluster point 425.0 cluster set(s) 62.A boundary 62.A 
1943 curvilinear 62.C interior 62.A cluster value 62.A cluster value theorem 43.G cn App. A, Table 16.m coalgebra 203.F cocommutative 203.F dual 203.F graded 203.B quotient 203.F coalgebra homomorphism 203.F coanalytic set 22.A coarse moduli scheme 16.W coarse moduli space of curves of genus g 9.J coarser relation 135.C coarser topology 425.H cobordant 114.H foliated 154.H h- 114.1 mod 2 114.H normally 114.J cobordism, knot 235.G cobordism class 114.H oriented 114.H un oriented 114.H cobordism group complex 114.H oriented 114.H un oriented 114.H cobordism group of homotopy n-spheres, h- 114.1 cobordism ring 114.H complex 114.H cobordism theorem, h- 114.F coboundary (coboundaries) 200.H (in a cochain complex) 20l.H (in the theory of generalized analytic functions) 164.H module of 2OG.F coboundary homomorphism (on cohomology groups) 20l.L coboundary operator 2OG.F cobounded 201.P cochain(s) 200.H 201.H (products of) 20l.K deformation 305.B finite (of a locally finite simplicial complex) 20l.P n- (for an associative algebra) 2OG.L separation 305.B cochain complex 200.F 201.H singular 201.H cochain equivalence 200.F cochain homotopy 200.F cochain mapping 200.F 20l.H cochain subcomplex 2OG.F Cochran theorem 374.B cocommutative co algebra 203.F cocycle(s) 2OG.H 20l.H (in the theory of generalized analytic functions) 164.H continuous 200.N difference 305.B module of 2OG.F obstruction 147.L 305.B separation 305.B vanishing (on an algebraic variety) 16.U Codazzi, equation of 365.C Codazzi-Mainardi equations 111.H, App. A, Table 4.1 code(s) 63.A 213.D Subject Index Coefficient(s) BHC (Bose-Chaundhuri-Hocquenghem) 63.D block 63.A 213.F convolutional 63.E cyclic 63.D Goppa 63.E group 63.C Hamming 63.C linear 63.C perfect 63.B sliding block 2I3.E tree 2 I3.E trellis 213.E code word 63.A codimension (of an algebraic subvariety) 16.A (of a C'-foliation) 154.B (of an element in a complex) I3.R (of the germ of a singularity 51.C (of a linear space) 256.F (of a PL embedding) 65.D codimension q, characteristic class of 154.G coding rate 213.D coding theorem block 2l3.D noiseless source 2 I3.D source 213.D,E coding theory 63 channel 213.A source 213.A codomain (of a mapping) 381.C co-echelon space 168.B coefficient(s) (of a linear representation) 362.E (of a system of algebraic equations) IO.A (of a term of a polynomial) 337.B of an affine connection SO.L autocorrelation 397.N binomial 330, App. A, Table 17.11 of bulk viscosity 205.C canonical correlation 280.E 374.C Clebsch-Gordan 258.A 353.B confidence 399.Q correlation (of two random variables) 342.C 397.H of determination 397.H,1 differential 106.A of excess 341.H 396.C expansion 317.A Fourier 159.A 197.C 317.A, App. A, Table 11.1 Fourier (of an almost periodic function) 18.B Gini, of concentration 397.D indeterminate, Lagrange method of 106.L Lagrange interpolation 223.A Legendre 393.B multipie correlation 397.1 of order p IIO.A partial correlation 397.J partial differential 106.F population correlation 396.D Racah 353.B reflection 387.D regression 397.H,1 403.D of a Riemannian connection 80.L sample correlation 396.D sample multiple correlation 397.J sample partial correlation 2S0.E of shear viscosity 205.C of skewness 341.H of thermal expansion 419.B torsion (of a complex) 201.B 
Subject Index Coefficient field transmission 387.D transport 402.K universal, theorem 2oo.D,G,H 201.G,H of variation 397.C of viscosity 205.C Wigner 353.B coefficien t field (of an affine space) 7.A (of a projective space) 343.C (of a semilocal ring) 284.D coefficient group (of the cohomology theory) 20l.Q coefficient module 200.L coefficient problem 438.C coefficient ring of a semilocal ring 284.D coefficient sheaf i\' tech cohomology with 383.F cohomology group with 383.E coercive (boundary condition) 112.H 323.H cofactor (of a minor) 103.D cofibering 202.G cofinal (ordinal numbers) 312.C (subset of a directed set) 311.D cofinality 312.C cofinality, cardinality and 33.F cofinalobject 52.D cofinal subnet 87.H cogenerator (of an Abelian category) 200.H cographic 66.H Cohen theorem (on Noetherian rings) 284.A coherence 397.N 421.E coherent algebraic sheaf 16 E 72.F coherent analytic sheaf 72.E coherently oriented (pseudomanifold) 65.B coherent (I,-module 16.E quasi- 16.E coherent sheaf of ring' 16.E coherent vector 277.D Cohn-Vossen theorem 111.1 cohomological dimension (of an associative algebra) 200.L (of a scheme) 16.E (of a topological space) 117.F cohomological functor 200.1 cohomology 6.E 200.H equivariant 431 D exact sequence of 200.F Galois 200.N Gel'fand-Fuks 105.AA /-adic 450.Q non-Abelian 200.M Tatc 200.N Weil 440.Q cohomology class 200.H canonical (in Galois cohomology in the class field theory) 59.H orientation 201.N cohomology exact sequence (for simplicial com- plexes) 20l.L cohomology group(s) 201.H (1. 72.D Alexander 201.M Amitsur 200.P tech (for topological spaces) 201.M,P tech, with coefficient sheaf.'F 383.F cellular 20l.H with coefficient sheaf.'F 383.E de Rham 201.H Dolbeault 72.D 1944 generalized (for CW complexes) 201.H Hochschild 200.L integral (of a topological space) 201.H local 125.W rational 200.0 reduced (of a topological space) 201.H relative (sheaf cohomology) 125 W relative Alexander 201.M relative tech 201.M singular, with compact supports 20\.P singular, of X with coefficients in G 201.H,R cohomology module 2oo.F cohomology operation(s) 64 functional 202.S primary 64.B stable 64.B stable primary 64.B stable secondary 64.C cohomology ring of a compact connected Lie Group App. A, Table 6.1V de Rham (of a differential manifold) 105.R de Rham (of a topological space) 201.1 of an Eilenberg-Maclane complex App. A, Table 6.III singular (of a topological space) 201.I cohomology sequence, exact 200.H cohomology set 172.K cohomology spectral sequence 200.J cohomology theory (theories) Alexander-Kolmogorov-Spanier 201.M on the category of topological pai rs 201.Q complete 200.N with E-coefficient, generalized 202.T generalized 201.Q cohomology vanishing theorems 194.G cohomotopy group 202.1 coideal 203.F coimage (of an A-homomorphism) 277.E (of a homomorphism of presheaves of sheaves) 383.D (of a morphism) 52.N coincidence number (of mappings) 153.B coincidence point (of mappings) 153.B coindex (of C"-function) 279.E cokernel (of an A-homomorphism) 277.£ (of a homomorphism of presheaves or sheaves) 383.D (of a morphism) 52.N collapsing 65.C elementary 65.C collectionwise Hausdorff space 425.AA collectionwise normal space 425.AA collective 342.A Ift- 354.E collective risk theory 214.C collinear points 343. B collinear vectors 442.A collineation(s) 343.D group of 343.D projective 343.D projective, in the wider sense 343.D in the wider sense 343.D collocation method 303.1 colocal (co algebra) 203.1 color conjecture, four 186.1 colored symmetric group 92.D coloring (of a graph) 186.1 
1945 coloring, Tait 157.C color lattice 92.D color point group n.D colors, number of 93.D color symmetric group 92.D column(s) (of a matrix) 269.A iterated series by (of a double series) 379.E repeated series by (of a double series) 379.E column finite matrix 269.K column nullity (of a matrix) 269.D column vector 269.A comass (on k-forms) 275.G Combescure, correspondence of III.F combination k- 330 linear (of elements in a linear space) 256.C linear (of ovals) 89.D multiple App. A, Table 17.11 number of treatment 102.L combination theorem, Klein 234.D combinatorial analysis 66.A combinatorial equivalence 65.A combinatoriallyequivalent 65.A combinatorial manifold 65.C combinatorial mathematics 66.A combinatorial Pontryagin class 56.H combinatorial problems App. A, Table 17.11 combinatorial property 65.A combinatorial sphere, group of oriented differentiable structures on 114.1 combinatorial theory 66.A combinatorial topology 426 combinatorial triangulation 65.C combinatorial triangulation problem 65.C combinatorics 66 comb space 89.A commensurable 122.F common divisor (of elements of a ring) 67.H greatest 67.H 297.A common logarithm 131.C common multiple (of elements of a ring) 67.H least 67.H 297.A common notion 35.A communality 346.F commutant 308.C commutation relations canonical 351.C 377.A normal 150.D commutative algebra 203.F commutative diagram 52.C commutative field 368.B commutative group 2.A 190.A commutative law of addition (in a ring) 368.A on the addition of natural numbers 294.B in the algebra of sets 381.B on cardinal numbers 49.C for group composition 190.A in a lattice 243.A for multiplication (in a commutative ring) 368.A on the multiplication of natural numbers 294.B commutative Lie group 249.D commutatively convergent series 379.C commutative multiplication (of a graded algebra) 203.B homotopy- 203.D commutative ring 368.A category of 52.B Subject Index Compact set commutativity, event 346.G commutator (of differential operators) 324.C (of two elements of a group) 190.H self- 251.K commutator group 190.H commutator subgroup 190.H commutor 368.E comonad 200.Q compact (continuous mapping) 286.D (kernel distribution) 125.L (linear operator) 68.B (topological space) 425.S linearly 422.L loea!!y 425.V locally linearly 422.L relatively (linear operator) 331.B relatively (maximum likelihood method) 399.M relatively (subset) 273.F 425.S real- 425.BB sequentially 425.S a- 425.V T- 68.F 331.B uniformly locally 425.V weakly (linear operator) 68.M compact algebraic group, k- 13.G compact cardinal number strongly 33.E weakly 33.E compact complex manifolds, family of 72.G compact C'-manifold 105.D compact element (of a topological Abelian group) 422.F compact foliation 154.H compact form (of a complex semisimple Lie algebra) 248.P compact group 69.A compact homotopy class 286.D compactification (of a complex manifold) 72.K (of a Hausdorff space) 207.A (of a topological space) 425.T compactifying (kernel) 125.L Aleksandrov 207.C Bohr 18.H F- 207.C Kerekjarto-Stoi10w 207.C Kuramochi 207.C Martin 207.C one-point (of a topological space) 425.T resolutive 207.B Royden 207.C Stone-Cech 207.C 425.T Wiener 207.C compact leaf 154.D compact metric space 273.F compactness theorem (in model theory) 276.E compact open Coo topology 279.C compact-open topology 279.C 435.D compact operators 68 compact real Lie algebra 248.P compact real simple Lie algebra classical 248.T exceptional 248.T compact set 425.S in a metric space 273.F relatively 399.M 425.S uniform convergence on 435.C 
Subject Index Compact simple Lie group compact ,imple Lie group clas,ical 249 L exceptional 249.L compact space 425 S countably 425.S locally 425.V real- 425.BB sequentially 425.S a- 425 V uniformly locally 425 V compact support (singular q-cochain) 201.P compact typc (symmetric Riemannian homogeneous space) 412.D compactum, dyadic 79.D companion matrix 301.1 comparability theorem for cardinal numbers 49.B comparison, paired 346.C comparison test (for convergence) 379.B comparison theorem (in the theory of differential equations) 316.E metric 178.A triangle 178.A compass 179.A compatible with composition 409.C with C'-,tructurc 1l4.B with the multiplication of a group 190.C with operation (of an operator domain) 277.C with operations in a linear spacc 256.F with topology 436.H with a triangulation (a C'-;,trllcture) 1l4.C compiler 75.C complement (of a decision problem) 71.B (in lattice theory) 42.A 243.E (in set theory) 381.B orthogonal (of a subset of a Hilbert space) 197.E relative (of two scts) 381.B complementary analytic set 22.A complementary degenerate series 437 W complementary degree 200.1 complcmentaryevent 342 B complcmcntary law of reciprocity 14.0 complementary law of the Jacobi symbol 297.1 complementary law of the Legendre symbol first 297.1 second 297.1 complementary modulus (in Jacobi elliptic func- tions) 134.J, App. A, Table 16.1 complementary series 437.W complementary set 381.B complementary slackness, Tucker theorem on 255.B complementary submodule 277.H complementary subspace (of a linear subspace) 256.F complcmentation, law of (in a Boolean algebra) 42A complement conjecture, knot 235.B complementcd (Banach space) 37.N complemented lattice 243.E complement theorem 382.B Chapman's 382.B completable topological group 423.H complete (Abelian p-group) 2.D (algebraic variety) l6.D (increasing family of a-algebras) 407.B (logical system) 276 D 1946 (metric space) 273.J (ordinary differential equation) 126.C (predicate) 356.H (recursively enumerable set) 356.D (set of closed formulas) 276.F (statistics) 396.E (system of axioms) 35.B (system of orthogonal functions) 317.A (topological group) 423.H (uniform space) 436.G (valuation) 439.D (vector lattice) 31O.C (wave operator) 375.B,H (Zariski ring) 284.C B- (locally convex space) 424.X fully (locally convex space) 424.X holomorphically (domain) 2l.F NP- 71.E at 0 (in the theory of deformation) 72.G quasi- (locally convex space) 424.F a- (vector lattice) 310.C weakly 1-, manifold 21.L complete accumulation point 425.0 complete additive group 2.E complete additivity (of the Lebesgue integral) 221.C complete additivity (of a measure) 270.D complete additivity theorem, Pettis 443.G complete analytic space, K- 23.F complete bipartite graph 186.C complete blocks 102.B complete capture 420.D complete class 398.B essentially 398.B minimal 398.B complete class theorems 398.D complete cohomology theory 200.N complete distributive law (in a lattice-ordered group) 243.G complete elliptic integral App. A, Table 16.1 of the first kind 134.B of the second kind 134.C complete form, theorem on 356.H complcte frec resolution (of Z) 2OO.N complete graph l86.C complete hyperbolic manifolds 21.0 complete induction 294.B complete integrability condition 428.C complete intersection 16.A complete lattice 243.D conditionally 243.D a- 243.D complete linear system on an algebraic curve 9.C on an algebraic variety 16.N defined by a divisor 16.N complete local ring 284.D structure theorem of 284.D completely additive (arithmetic function) 295.B (measure) 270.D,E (vector measure) 443.G completely additive class 270.B completely additive set function 380.C completely continuous operator 68.B completely integrable l54.B completely integrable (system of indepndent 1- forms) 428.D completely integrally closed (ring) 67.1 completely monotonic function 240.E,K 
1947 completely multiplicative number-theoretic function 295.B completely non unitary 251.M completely normal space 425.Q completcly passive 402.G completely positive 36.H completely positive entropy 136.E completcly primary ring 368.H completely randomized design 102.A completely reducible A-module 277 H completely reducible group 190.L completely reducible linear representation 362.C completely regular space 425.Q completely unstable flow 126.E complete manifold, weakly 1- 2J.L completc mapping 241.B complete maximum principle 338 M complete measure 270.D complete measure space 270.D complete metric space 273 J completene'is (for a Cartan connection) 80.N (of a logical system) 276 D (of thc prcdicate calculus) 411.1 asymptotic 150.D NP- 7!.E theorem of (in geometry) 155.B completeness of real numbers 294.E 355.B completeness of scattering states 150.D 386.B completeness theorem. Godel 411.1 complete observation 405.C,D complete orthogonal system 217 G complete orthonormal set (Hilbert space) 197.C complete orthonormal system 217.G of fundamcntal functions 2J7.G complete pivoting 302.B complete predicate 356.H complete product measure space 270.H complete quadrangle 343.C complete reducibility theorem, Poincare 3.C complete Reinhardt domain 2J.B complete residue system modulo In 297.G complete ring (with respect to an ideal I) 16.X complete scheme, k- 16.E complete set 241 B complete a-field 396.E boundedly 396.E complete solution (of partial differential equations) 320.C complete space 436.G tech 425.T 436.1 Dieudonnc 436.1 holomorphically 23.F topologically 436.1 complete 5ystem of axioms 35.B of independent linear partial differential equa- tion" 324.C of inhomogeneous partial differential equations 428. C of nonlinear partial differential equations 428.C complete valuation 439.D complete vector lattice 310.C a- 3JOC complete Zariski ring 284.C completion a-adic (of an R-module) 284.B of a field (with rcspect to a valuation) 439.D Subject Index Complex(es) of a measure space 270.D of a metric space 273.J /l- 270.D of an ordered set 243.D of a ring along an ideal 16X of Spec(A) along V(f) 16,X of aT 2-topological group 423.H of a uniform space 436.G of a valuation 439.D of a valuation ring (of a valuation) 439.D complex(es) 70 (in an Abelian category) 201.B (in buildings and EN pairs) 13.R (over an object of an Abelian category) 200.H f:- 72.D abstract simplicial 70.C acyclic 201. B Amitsur 2oo.P cell 70.D chain 200.C chain (in an Abelian category) 201.B chain, over C 20l.C chain, over A 201.G chamber 13.R closure finite cell 70.D cochain, over A 201.H cochain (in an Abelian category) 200.F,H cochain (of a simplicial complex) 201.H countable cell 70.D countable simplicial 70.C Coxeter 13.R CW 70.D de Rham 20J.H de Rham (as an elJiptic complex) 237.H Dolbeault 72.D double 2oo.H double chain 200.E dual 65.B Eilenberg-MacLane 70.F elliptic (on a compact C-manifold) 237.H Euclidean 70.B Euclidean cell 70.B Euclidean simplicial 70.B finite cell 70.D finite simplicial 70.C geometric 70.B isomorphic simplicial 70.C isomorphic s.s. 70.E Kan 70.E linear (in projective geometry) 343.E linear line 343.E locally countable cell 70.D locally countable simplicial 70.C locally finite cell 70.D locally finite simplicial 70.C minimal 70.E multiple 200.H negative 200.H ordered (of a simplicial complex) 70.E ordered simplicial 70.C oriented simplicial chain 201.C Poincare 114 J positive 2oo.H positive chain 200.C Postnikov 70.G product 200.H product (of cell complexes) 70.D product double chain 200.E quotient 201.L 
Subject Index Complex algebraic variety quotient chain 200.C rectilinear 70.B regular cell 70.D relative chain 200.C relative cochain 200.F semisimplicial (s.s.) 70.E simplicial 65.A 70.C singular (of a topological space) 70.E singular chain (of a topological space) 20l.E singular cochain 201.H space form 365.L S.s. 70.E s.s., realization of 70.E standard (of a Lie algebra) 200.0 Thom 114.G Thom, associated with (G, n) 114.G Thom, fundamental cohomology class of the 114.G complex algebraic variety 16.T complex analytic fiber bundle 147.0 complex analytic function 198.H complex analytic manifold 72.A complex analytic structure (in a complex manifold) 72. A complex analytic submanifold 72.A complex cobordism group 114.H complex cobordism ring 114.H complex conjugate bundle 147.F complex conjugate representation 362.F complex dimension (of a complex manifold) 72.A complex form (on a Fourier series) 159.A complex form (of a real Lie algebra) 248.P complex function 165.B complex Gaussian 176.B complex Gaussian process 176.C complex Gaussian random variable 176.B complex Gaussian system 176.B complex Grassmann manifold 199.B complex group (over a field) 60.L complex Hermitian homogeneous space 199.A complex Hilbert space 197.B complexification Chevalley 249.U of a real Lie algebra 248.P complex interpolation space 224.C complexity average 71.A of computation 71 Kolmogorov-Chaitin 354.D space 71.A time 71.A worst-case 71.A complex Lie algebra 248.A of a complex Lie group 249.M complex Lie group 249.A complex linear space 256.A complex line bundle 72.F determined by a divisor 72.F complex manifold(s) 72 almost 72.B compact, family of 72.G isomorphic 72.A stably almost l14.H weakly almost 114.H complex multiplication 73 complex number plane 74.C complex numbers 74.A 294.F conjugate 74.A complex of lines 110.B complex orthogonal group 60.1 1948 complex orthogonal matrix 269.J complex plane 74.C complex projective space 343.E infinite-dimensional 56 complex quadratic form 348.A,B complex representation (of a Lie grouP) 249.0 complex simple Lie algebra classical 248.S exceptional 248.S complex simple Lie group classical 249.M exceptional 249.M complex space form 365.L complex special orthogonal group 60.1 complex spectral measure 390.D complex spectral representation 390.E complex spectral resolution 390.E complex sphere 74 D complex spinor grou p 61.E complex Stiefel manifold 199.B complex structure (in a complex manifold) n.A (pseudogroup structure) 105.Y (on R 2 n) 3.H (on a Riemann surface) 367.A almost 72.B deformation of n.G tensor field of almost (induced by a complex structure) 72.B complex symplectic group 60.L complex topological linear space 424.A complex torus 3.H complex-valued function 165.B complex variable 165.C theory of functions of 198.Q complex vector bundle 147.F component(s) (of a direct product set) 381.E (in graph theory) 186.F (of a matrix) 269.A (of a point in a projective space) 343.C (of a tensor of type (p, q)) 256.J (of a vector) 256.A 442.A (of a vector field) 105.M arcwise connected 79.B basic (of an m-dimensional surface) IIO.A connected 79.A 186.F of degree n (of a graded A-module) 2oo.B embedded primary (of an ideal) 67.F fixed (of a linear system) 16.N ghost (of an infinite-dimensional vector) 449.A horizontal (of a homogeneous SpClce) 110.A horizontal (of a vector field) 80.C identity (of a topological group) 423.F irreducible (of an algebraic variety) 16.A irreducible (of an analytic space) 23.C irreducible (of a linear representalion) 362.D isolated primary (of an ideal) 67.F ith (of an element relative to a ba:;is) 256.C ith (of an n-tuple) 256.A nilpotent (of a linear mapping) 269.L orthogonal (of an element of a linear space) 139.G path- 79.B primary (of an ideal) 67.F principal 280.F principal, analysis 280.F principal, of order p IIO.A proper (of an intersection of subvarieties) 16.G Reeb 154.B 
1949 relative (of a Lie transformation group) 11O.A secondary (of a homogeneous space) 11O.A semisimple 269.L simple (of a semisimple ring) 368.G strongly connected 186.F unipotent (of a linear mapping) 269.L variable (of a linear system) 15.C 16.N vertical (of a vector field) 80.C component model 403.F components-of-variance model 403.C composite of cohomology operations 64.B of correspondences 358.B of homotopy classes 202.B of mappings 381.C of morphisms 52.A of subsets 436.A of valuations 439.F composite designs, central 102.M composite field 149.D compositefunction 106.1 composite hypothesis 400.A composite number 297.B composite particles 132.A composition (of knots) 235.A (of probability distributions) 341.E external law of (of a set of another set) 409.A internal law of (of a set) 409.A law of (on a set) 409.A secondary 202.R composition algebra 231.B composition factor 190.G composition factor series 190.G composition product (of functions) 192.H composition series (in a group) 190.G (in a lattice) 243.F composition theorem (in class field theory) 59.C compound Poisson process 5.F comprehension, axiom of 33.B 381.G compressibility. isothermal 419.B compressible fluid 205.B compression, information 96.B computable (partial function) 31.B computation analog 19.A complexity of 71 high-precision 138.B compute 31.B 71.B computers 75 analog 19.E digital 75.B electronic 75.A electronic analog 19.E hybrid 19.E comultiplication 203.B,F Hopf 203.D concatenation (of paths) 170 concave function 88.A strictly 88.A concave programming problem 292.A concentration area of 397.E asymptotic 399.L Gini coefficient of 397.E measure of 397.E spectral 331.F concentration function maximal 341.E Subject Index Condition(s) mean 341.E concept basic (of structure) 409.B global (in differential geometry) 109 in the large (in differential geometry) 109 local (in differential geometry) 109 in the small (in differential geometry) 109 conchoidal curve 93.H conchoid of Nicomedes 93.H concircularly flat space App. A, Table 4.11 concordant 154.F concurrent (in nonstandard analysis) 293.B (in projective geometry) 343.B condensation of singularities, principle of 37.H condensation point 425.0 condensation test, Cauchy 379.B condition(s) adjoint boundary 315. B ascending chain (for (normal) subgroups of a group) 190.F ascending chain (in an ordered set) 311.C Baire 425.N boundary (for an ordinary differential equation) 315.A boundary (for partial differential equations of elliptic type) 323.F boundary, operator with 112.F Cauchy (on the D-integral and the D(*)- integral) l00.E CFL (Courant-Friedrichs-Lewy) 304.F chain (in an ordered set) 311.C complete integrability 428.C consistency 341.1 Denjoy-Carleman 168.B descending chain (for (normal) subgroups of a group) 190.F descending chain (in an ordered set) 311.C entropy 204.G of finite character (for functions) 34.C of finite character (for sets) 34.C finiteness, for integral extension 284.F Frobenius integrability 154.B Haar (on best approximation) 336.B Harnack (on the D-integral and the D(*)- integral) l00.E Holder, of order a 84.A initial (for ordinary differential equations) 316.A initial (for partial differential equations) 321.A Jacobi 46.C KMS 308.H Levi 325.H Lindeberg 250.B Lipschitz 84.A 163D 286.B 316.D Lipschitz, of order a 84.A Lorentz 130.A LSZ asymptotic 150D Lyapunov 250.B maximal (in an ordered set) 311.C minimal (in an ordered set) 311.C no cycle 126.J Palais-Smale 279.E 286.Q Poincare (in the Dirichlet problem) 120.A restricted minimal (in a commutative ring) 284.A of R. Schmidt (in (C, a)-summation) 379.M Sommerfeld radiation 188.D spectrum 150.D strip 320.D 
Subject Index Conditional density strong transversality 126.1 transversality 108.B of transversality (in the calculus of variations) 46.B uniqueness (for solution of an ordinary differen- tial equation) 316.D von Neumann 304.F Whitney, (b) 418.G Whitney, (b) at a point 418 G conditional density 397.1 conditional distribution 3971 conditional cntropy 213 B conditional expectation (of a random variable) 342.E conditional ineq uality 2l1.A conditionality, principlc of 401 C conditionally complctc latticc 243.D conditionally convergent 379.C,E conditionally a-complete latticc 243.D conditionally stablc 394.D conditional mean (of a random variable) 342.E 397.1 conditional moments 397.J conditional probability 342.E regular 342.E conditional probability distribution 342.E conditional problems in the calculus of variations 46.A conditional relative extremum (of a function) J06.L conditiOnal sclf-intcrsection 213 B conditional stability 394.0 condition number 302 A conductivity 130.B conductor (of an Abelian extension) 14.Q (of a class field) 59.B (of Dirichlet L-functions) 450.C (of an ideal group) 14.H (of a Grossencharakter) 450.F (of Becke L-functions) 450.E (of a non primitive character or a primitive chalacter) 450.C (of a quadratic field) 347.G (of a residue charactcr) 295.D (of a subring of a principal order) 14 B p-(ofnorm-re<,idue) 14P conductor-ramification thcorcm (in class field the- ory) 59.C conductor with a group character 450.G conc (of a PL embedding) 65 D (in a projcctivc spacc) 343.E (of a simplicial complex) 70 C (over a space) 202 E asymptotic 350.B circular ?S.A 111.1 conjugate convcx 89.F convex 89 F convex polyhedral 89.F dual convcx 89.F extension (of a PL embedding) 65.D futurc 258.A light 258.A Mach 205 B mapping 202.E natnral positive 308.K obliquc circular 350.B past 258.A quadric 350 B reduced (of a topological space) 202.F 1950 reduced mapping 202.F regular 384.A right circular 350.B self-dual regular 384.E side 258.A confidence coefficient 399.Q confidence interval 399.Q confidence level 399.Q confidence limits 399.Q confidence region 399.Q invariance of 399.Q unbiased 399.Q uniformly most powerful 399.Q uniformly most powerful unbiased 399.Q configuration central 420.B Pascal 78.K configuration space 126.L 402.G confluent differential equation 167.A conflucnt hypergeometric differential equation 167.A, App. A, Table 14.1119.1 confluent type function of J 67.A hypergeometric function of 167.A, App. A, Table 19.1 confocal ccntral conics, family of 78.B confocal parabolas, family of ?S.H confocal quadrics, family of 350.E conformal 77.A almost 275.C conformal arc element 1 JO.D conformal connection 80.P conformal correspondence (between su rfaces) 11 1.1 conformal curvature Il0.D conformal curvature tensor, Weyl 80.P, App. A, Table 411 conformal differential geometry 110.D conformal function, fl- 352. B conformal geometry 76.A conformal invariant 77.E conformallyequivalent 77.A 367.A 191.B conformally flat 191.B conformally flat space App. A, Table 4.11 conformal mapping 198.A, App. A, Tlble 13 general ized 246.1 extremal quasi- 352.C quasi- 352 conformal space 76 A conformal structure 191.B conformal structure (on a Riemann surface) 367.A conformal torsion 1 10.D conformal transformation 80.P 364.F confounded with blocks 102.1 partially ('With blocks) 102.J congruence (in geometry) 155.B (in number theory) 297.G linear (in projective geometry) 343.E of lincs 110.B multiplicative 14.H congruence axiom (of geometry) 155.B congruence classes modulo m*, group of 14.H congruence subgroup (of a modular gfJUp) 122.D principal, of lcvel N 122.D congruence zeta function 450.P congruent (figures) l39.C (segment) 155.B affinely 7.E 
195\ in the Erlangen program 137 congruent modulo m 297 G congruent transformation(s) 139.B group of 285.C conic(s) 78.A central 78.C focal (of a quadric) 350.E pencil of 343. E conical function App. A, Table 18.11 conical hypersurface, quadric 350.G conical surface 111.1 quadric 350.B conic Lagrange manifold 274.C 345 B conic section(s) 78.A equation of 78.C canonical form of the equation of 78.C conjecture Adams (on i-homomorphisms) 237.1 annulus (on combinatorial manifolds) 65.C Artin (on Artin L-functions) 450.G Bieberbach (on univalent functions) 438.C BirchSwinnerton-Dyer (on L-functions of elliptic curves) 118.D 450.S Burnside (on finite groups) 151.D C n (on Kodaira dimension) 72.H entropy 126.K four color 186.1 fundamental (in topology) 70.C generalized Poincare 65.C general knot 235.B Hasse (on Hasse zeta function) 450.S Hodge (on cycles on algebraic varieties) 450.S 1wasawa main (on p-adic L-functions) 450J knot complement 235.B Leopoldt (on p-adic L-functions) 450.1 Mordell (on Diophantine equations) 118.E Poincare (on a characterization of spheres) 65.C property P- (on knot groups) 235.B Ramanujan (on automorphic functions) 32.D Ramanujan-Petersson (on Hecke operators) 32.D Sato (on Hasse zeta functions) 450.S Schreier (on simple groups) 151.1 Seifert (on vector fields) 126.K 154.D Smith (on knot theory) 235.E stability 126.1 Taniyama- Weil (on L-functions of elliptic curves) 450.S Tate (on Hasse (-functions) 450.S unknotting 235.E Vandiver (on the class number of cyclotomic fields) 14.L Weil (on congruence zeta functions) 450.Q conjugacy C'- 126.B topological 126.B conjugacy class (of an element of a group) 190.C conjugate (CG method) 302.D (diameter) 78.G (element) 149.1 (point in a geodesic) 178.A (point in a projective space) 343.E (with respect to a quadric surface) 350.C (quaternion) 29.D (subset) 190 C c- 126.B harmonic (in projective geometry) 343.D Q- 126.H Subject Index Connected sequences of functors topological 126.B conjugate axis (of a hyperbola) 78.C conjugate complex number 74.A conjugate convex cone 89.F conjugate differential (on Riemann surface) 367.H conjugate exponent 168.C conjugate field 149.J 377.C conjugate Fourier integral 160.D conjugate function 159.E 160.D conjugate gradient (CG) method 302.D conjugate harmonic function 193.C conjugate hyperbola 78.E conjugate ideal (of a fractional ideal) 14.1 conjugate operator (in Banach spaces) 37.D (of a differential operator) 125.F (of a linear operator) 251.D conjugate planes (with respect to a quadric surface) 350.C conjugate point(s) (in the calculus of variations) 46.C (in a Riemannian manifold) 364.C conjugate pole 350.C conjugate Radon transform 218.F conjugate representation 362.F complex 362.F conjugate series (of a trigonometric series) 159.A conjugate space (of a linear topological space) 424.D (of a normed linear space) 37.D conjugation mapping (of a Hopf algebra) 203.E conjugation operator 164.K conjunction fofpropositions) 411.B connected (affine algebraic group) 13.A (design) 102.K (graded module) 203.B (graph) 186.F (topological space) 79.A (treatment) 102.B arcwise (space) 79.B K- 186.F locally (at a point) 79.B locally (space) 79.A locally arcwise (at a point) 79.B locally arcwise (space) 79.B locally n- (at a point) 79.C 10cally n- (space) 79.C locally w- (space) 79.C multiply (plane domain or space) 333.A n- (pair of topological spaces) 202.L n- (space) 79.C 202.L n-ply (plane domain) 333.A w- (space) 79.C path- (space) 79.B simply (covering Lie group) 249.C simply (space) 79.C 170 simply, group 13.N strongly (components) 186.F connected component 79.A 186.F arcwise- 79.B strongly 186.F connected Lie subgroup 249.D connectedness 79 186. F of real numbers 294.E connectedness theorem general, due to W. Fulton and 1. Hansen 16.1 Zariski 16 X connected part 150.D connected sequences of functors 200.1 
Subject Index Connected set connected set 79.A connected space 79.A connected sum (of oriented compact Coo-manifolds) 114.F (of 3-manifolds) 65.E connecting homomorphism in cohomology 200.F in homology 200.C on homology groups 201.C,L connecting morphism 200.H,I connection(s) 80 affine 80.H 286.L affine, coefficients of 80.L canonical affine (on Rn) 80.1 Cartan 80.M conformal 80.F Euclidean 364.B Euclidean, manifold with 109 flat 80.E Gauss-Manin (of a variety) 16.V Levi-Civita 364.B linear 80.H 10calIy flat 80.E metric 80.K normal 365.C projective 80.0 Riemannian 80.K 364.B Riemannian, coefficients of 80.L connection form 80.E 417.B connection formula for the solutions of a differential equation 253.A connection of spin and statistics 132.A 150.D connection problem 253.A connective fiber space, n- 148.D connectives, propositional 411.E connectivity (of a space) 201.A conoid, right 111.1 conoidal neighborhood 274.E conormal 323.F conormal bundle 274.E conormal sphere bundle 274.E co-NP 71.E conservation laws, even-oddness 150.D conservative (measurable transformation) 136.C conservative chain 260.A conservative process 261.B conserved axial-vector currents, partialIy 132.C consistency (condition in the multistep method) 303.E (of an estimator) 399.K (of a logical system) 276.D of analysis 156.E of the axiom of choice and the continuum hypothesis 33.D {c n }- 399.K rela ti ve 156.D consistency condition 341.1 consistency proof 156.D of pure number theory 156.E consistent (finite difference scheme) 304.F (formal system) 411.1 (system of axioms) 35.B a.s. 399.K {c n }- 399.K Fisher 399.K w- 156.E consistent and asymptoticalIy normal (CAN) estimator 399.K 1952 consistent estimator 399.K consistent kernel (in potential theory) 338.E consistent-mass scheme 304.D consistent test 4oo.K uniformly 400.K constant(s) 165.C arbitrary (in a general solution of a differential equation) 313.A Bloch 77.F Boltzmann 402.B dielectric 130.B empirical 19.F error 303.E Euler 174.A integral 216.C integration (in a general solution of a differential equation) 313.A isoperimetric 391.D Lagrange method of variation of 252.D Landau 77.F method of variation of 55.B 2521 phase (of a sine wave) 446 Planck 115 renormalization 150.D Robin 48.B schlicht Bloch 77.F structural (of a Lie algebra) 248.C universal (in the theory of conformal mapping) 77.F constant breadth, curve of 89.C constant curvature space of 364.D, App. A, Table 4.11 surface of 111.1 constant function 381.C constant inclination, curve of 111.F constant mapping 381.C constant pressure, specific heat at 419.B constant sheaf 383.D 10calIy constructible 16.AA constant stratum, /l- 418.E constant-sum game 173.A constant term of a formal power series 376.A of a polynomial 337.B unfolding 51.D constant variational formula 163.E constant volume, specific heat at 419.B constant width, curve of 111.E constituent (of an analytic or coanalytil; set) 22.C constraint 102.L 264.B capacity 281.D chance 408.B unilateral 440.A constraint qualification Guignard 292.B Slater 292.B constraint set (of a minimization problt:m) 292.A constructibility, axiom of (in axiomatic set theory) 33.D constructible (set in axiomatic set them y) 33.D constructible sheaf 16.AA 10calIy, constant 16.AA construction bar (of an Eilenberg-MacLane cOfClplex) 70.F geometric, problem 179.A GNS 308.D group measure space 136.F impossible, problem 179.A possible, problem 179.A w- (of an Eilenberg-MacLane complex) 70.F 
1953 construction problem (of Class field tower) 59.F constructive field theory 150. F constructive method 156.D constructive ordinal numbers 81.B consumer's risk 404.C contact, thermal 419.A contact element 428.E in a space with a Lie transformation group IIOA contact form 110.E contact manifold IIO.E contact metric structure IIO.E contact network 282.B contact pair (in circle geometry) 76.C contact process 340.C contact structure 105.Y contact transformations 82, App. A, Table 15.1V quantized 274.F contain 381.A physically 35LK content (of a tolerance region) 399.R Jordan 270.G mean 399.R context-free grammar 31.D context-sensitive grammar 31.D contiguous 399.M contingency table 397.K 400.K continuable, analytically 198.1 continuation analytic 198.G analytic, along a curve 198.1 analytic, in the wider sense 198.0 direct analytic 198.G harmonic 193.M 198.G continuation method 30LM continuation theorem Hartogs 2LF Remmert-Stein 23.B Riemann 2LF unique 323) continued fractions 83.A finite 83.A infinite 83.A mixed periodic 83.C normal 83.E pure periodic 83.C recurring 83.B simple 83.A continuity absolute, space of 390.E axioms of (in geometry) 155.B Dedekind axiom of (for real numbers) 355.A equation of (for a fluid) 205.A equation of (for electro magnetics) 130.A 204.B Hartogs theorem of 21.H interval of (for a probability distribution) 341.C local 45.F modulus of (of a function) 84.A modulus of, of kth order (of a continuous function) 336.C properties of 85.A of real numbers 294.E uniform 45.F continuity (*), generalized absolute 100.C in the restricted sense lOO.C continuity principle for analytic functions of several complex vari- ables 21.H in potential theory 338.C Subject Index Continuous geometry quasi- (in potential theory) 338.1 continuity property for tech theory 201.M continuity requirement, variational principles with relaxed 271.G continuity theorem Abel (for Dirichlet series) 339.B Abel (for power series) 121.D Levy 341.F continuous (additive interval function) 380.B (flow) 136.D (function of ordinal numbers) 312.C (mapping) 84.A 425.G absolutely (function) 100.C absolutely (mapping in the plane) 246.H absolutely (measure) 270.L absolutely (set function) 380.C absolutely (vector measure) 443.G absolutely, in the restricted sense 100.C absolutely, in the sense of Tonelli 246.C absolutely, (*) 100.C completely (operator) 68.D equi- 435.D equi-, semigroup of class (CO) 378.B generalized absolutely 100.C hypo- 424.Q left 84.B from the left 84.B in the mean 217.M in the mean (stochastic process) 407.A /l-absolutely 380.C with respect to the parameter (a distribution) 125.H piecewise, function 84.B in probability 407.A right 84.B from the right 84.B separately (bilinear mapping) 424.Q strongly (function with values in a Banach space) 37.K uniformly 84.A 273.1 436.E uniformly, on a subset 436.G weakly (function with values in a Banach space) 37.K continuous action (in topological dynamics) 126.B continuous additive interval function 380.B continuous analytic capacity 164.J continuous arc(s) 93.B continuous co cycle 200.N continuous distribution (probability theory) 341.D continuous dynamical system 126.B continuous flow (in ergodic theory) 136.D (on a topological space) 126.B continuous functions 84 absolutely 100.C generalized absolutely 100.C lower semi- 84.C lower semi- (at a point) 84.C on a metric space 84.C piecewise 84.B quasi- 338.1 right 84.B semi- (at a point) 84.C uniformly (in a metric space) 84.A upper semi- 84.C upper semi- (at a point) 84.C continuous geometry 85 irreducible 85.A reducible 85.A 
Subject Index Continuous homomorphism continuous homomorphism (between topological groups) 423.1 open 423.J continuous image 425.G continuously differentiable function, n-times 106.K continuous mapping 425.G space of 435.D strongly 437.A uniformly (of metric spaces) 273.1 uniformly (of uniform spaces) 436.E continuous plane curvc 93.B continuous representation strongly (of a topological group) 69.B weakly (of a topological group) 69.B continuous scmiflow 126.B continuous semimartingale 406.B continuous spectrum 390.A absolutely 390 E of an integral eq uation 2l7.J continuous spin 258.C continuous state branching process 44.E continuous tensor product 377.D continuum 79.D cardinal number of 49.A indecomposable 79.D irreducible 79.D Peano 93.D continuum hypothesis 49.D consistency of the axiom of choice and 33.D generalized 49.D independence of the axiom of choice and 33.D contour(s) additivity of (in the curvilinear integral) 94.D of an integration 94.D contract, annuity 214.B contracted tensor 256.L contractible space 79.C 202.D locally 79.C 202 D locally, at a point 79 C contraction (of a graph) l86.E (linear operator) 37.C (of a mapping) 381.C (of a matroid) 66 H (of a tensor) 256.L sub- l86.E contraction principle 286.B contractive 251.N purely 251 N purely, part 251.N contradiction 41 U contradictory formal system 41U contragredicnt (of a linear mapping) 256.G contragredient representation 362.E contrast elementary 102.C normalized 102.C treatment 102.C contravariant functor 52.H contravariant index (of a component of a tensor) 256.1 contravariant of order r and covariant of order s 108 D contJavariant spinor 258.B contI avariant tensor alternating 256.N of degree p 256.J symmetric 256.N contravariant tcnsor algebra 256.K 1954 contravariant tensor field of order r 105.0 contravariant vector 256.1 contravariant vector field 10S.0 control admissible 405.A bang-bang 405.C feedback 405.C impulse 405.E inventory 227 local 102.A optimal 46.D 86.B,C 405.A quality 404.A stochastic 342.A 405 time-optimal 86.F control chart 404.B controllability 86.C controlled stochastic differential equation 405.A controlled tubular neighborhood system 418.G control limit lower 404.F upper 404.B control problem, time optimal 86.F control space (in static model in catasl rophe theory) 51.B control theory 86 control unit 75.B convention Einstein 256.1 Einstein summation 417.B Maxwell 51.F perfect delay SI.F con verge (filter) 87.1 (infinite product) 379.G (in a metric space) 273.D (net) 87.H (sequence of lattices) 182.B (sequence of numbers) 87.B 355.B (series) 379.A (in a topological space) 87.E almost certainly 342.D almost everywhere 342.D almo,t surely 342.D in distribution 168.B 342.D in the mean of order p 342.D in the mean of power p 168.B in probability 342.D with probability 342.D strongly 37.B uniformly (in a uniform space) 435.A weakly (in a normed linear space) 37.E weakly (in a topological linear ';pace) 424.H convergence 87 (of a filter) 87.1 (of a net) 87.H (of probability measures) 341 F (of truncation errors) 303.B abscissa of (of a Dirichlet serie:;) 121.B absci,sa of (of a Laplace transform) 240.B,H absolute, abscissa of (Dirichlrt series) 121.B absolute, abscissa of (of a Laplace transform) 240.B associated, radii 21.B asymptotic 168.B axis of 240.B cirele of (of a power series) 339.A exponent of 429.B generalized 331.C norm resolvent 331.C 
1955 radius of (of a power series) 339.A relative uniform star 31O.F simple, abscissa of (of a Dirichlet series) 121.B star 87 K strong (of operators) 251.C strong resolvent 331.C uniform 435 uniform (of a serics) 43S.A uniform (of operators) 251.C uniform, abscissa of (of a Dirichlet series) 121.B uniform, abscissa of (of a Laplace transform) 240.B uniform. on compact sch 435.C weak (of operators) 25!.C weak (of probability mcasures) 341.F weak (of a sequence of submodules) 200 ] Weierstrass criterion for uniform 435.A convergence criterion for positive series App. A, Table lO.n convergence domain (of a powcr scries) 21.B convergcnce in measure 168.B convergence method 354.B convergence theorem on distnbutions 125.G Lebesgue 22\.C of martingales 262.B convergent (continued fraction) 83 A (double series) 379.E (filtration) 200.J (infinite integral) 216.E (sequence) 87.B 355.B (series) 379.A absolutely (doublc series) 379.E absolutely (infinite product) 379.G absolutely (Laplace-Stieltjes integral) 240.B absolutcly (power series) 2!.B absolutely (series) 379.C absolutely (serics in a Banach space) 443.D commutativcly 379.C conditionally 379.C.E intermediate 83.B (0)- 87.L (a)-star 87 L ordcr (in a vector lattice) 31O.C pointwisc 435 B princi pal 83.B simply 435.B unconditionally 379.C uniformly (on a family of scts) 435 C uniformly (sequence, series, or infinite product) 435 A uniformly, in the wider sense 435.C uniformly absolutely 43S.A convergent power series 370 B convergent power series ring 370.B convergent scqucncc 355.B convex (function on a G-space) 178.H (function on a Riemannian manifold) 178.B (subset of a sphcrc) 274.E (subset of a sphere bundle) 274.E abwlutely 424.E holomorphically, domain 21.H locally (linear topological space) 424.E logarithmically (domain) 2!.B matrix (of ordcr m) 212.C opcrator 212.C properly 274.E Subject Index Coordinate(s) uniformly (normed Jinear space) 37.G convex analysis 88 convex body 89.A convex cell (in an affine space) 7.D convex closure (in an affine space) 7.D convex cone conjugate 89.F dual 89.F convex curve, closed III.E convex functions 88.A proper 88.D strictly 88.A convex hull 89.A (in an affine spacc) 7.D (of a boundary curve) 275.B (in lincar programming) 255.D closed 424.H convexity theorem Lyapunov 443.0 M. Riesz 88.C convex neighborhood 364.C convex polyhedral cone 89.F convex polyhedron 89.A convex programming 264.C convex programming problem 292.A convex rational polyhedral 16.Z convcx set(s) 89 absolutely (in a linear topological space) 424.E in an affine space 7.D p- (for a differential operator 112.C regularly 89.G strongly P- 112.C strongly separated 89.A convex surface, closed 111.1 convolution (of arithmetic functions) 295.C (of distributions) 125.M (of functions) 159.A 192.H (of hyperfunctions) 125.X (of probability distributions) 34!.E (in the theory of Hopf algebra) 203.H generalized (of distributions) 125.M convolutional code 63.E cooperative game 173.A,D coordinate(s) 90 (of an element of a direct product of sets) 381.E (in the real line) 355.E affine 7.C barycentric (in an affine space) 7.C 90.B barycentric (in a Euclidean simplicial complex) 70.B barycentric (in thc polyhedron of a simplicial complex) 70.C bipolar 90.C, App A, Table 3.Y bipolar cylindrical App. A, Table 3.Y canonical (of a Lie grouP) 249.Q Cartesian (in an affinc space) 7.C Chow (ofa positive cycle) 16.S circular cylindrical App. A, Table 3.V curvilinear 90.C, App. A, Table 3.Y cylindrical 90.C App. A, Table 3.Y ellipsoidal 90.C B3.A, App. A, Table 3.V elliptic 90.C 350.E, App. A, Table 3.Y elliptic cylindrical App. A, Table 3.Y equilateral hyperbolic 90.C, App. A, Table 3.Y gcncralized (in analytical dynamics) 271.F gcncralizcd cylindrical App. A, Table 3.y geodesic 80.1 geodesic polar 90.C 
Subject Index Coordinate axis Grassmann (in a Grassmann manifold) 90.B homogeneous (of a point in a projective space) 343.C hyperbolic cylindrical App. A, Table 3.V hyperplane (of a hyperplane in a projective space) 343.C inhomogeneous (of a point with respect to a frame) 343.C isothermal 90.C ith (of an element relative to a basis) 256.C Klein line 90.B Kruskal 359.D line (ofa line) 343.C local (on an algebraic variety) 16.0 local (on a topological manifold) 105.C local, transformation of 90.D moving App. A, Table 3.1V multiplanar 90.C multipolar 90.C (n + 2)-hyperspherical 76.A 90.B normal 90.C oblique (in a Euclidean space) 90.B orthogonal curvilinear 90.C parabolic 90.C parabolic cylindrical App. A, Table 3.V parallel (in an affine space) 7.C pentaspherical 90.B plane (of a plane) 343.C Pliicker (in a Grassmann manifold) 90.B polar 90.C, App. A, Table 3.V projecti ve 343. C rectangular (in a Euclidean space) 90.B rectangular hyperbolic 90.C rotational App. A, Table 3.V rotational hyperbolic App. A, Table 3.V rotational parabolic App. A, Table 3.V spherical 90.C 133.D tangential polar 90.C tetracyclic 90.B trilinear 90.C tripolar 90.C coordina te axis of an affine frame 7.C ith (ofa Euclidean space) 140 coordinate bundle(s) 147.B equivalent 147.B coordinate curve (in a Euclidean space) 90.C coordinate function (of a fiber bundle) 147.B (in the Ritz method) 304.B coordinate hyperplane (of an affine frame) 7.C coordinate hypersurface (in a Euclidean space) 90.C coordinate neighborhood of class C' 105.D of a fiber bundle 147.B of a manifold 105.C coordinate ring (of an affine variety) 16.A homogeneous 16.A coordinate system 90.A (of a line in a projective space) 343.C geodesic, in the weak sense 232.A holomorphic local 72.A isothermal curvilinear App. A, Table 3.V I-adic 3.E local (of a topological space) 90.D 105.C moving 90.B orthogonal, adapted to a flag 139.E orthogonal curvilinear App. A, Table 3.V projective 343.C 1956 coordinate transformation (of a fiber bundle) 147.B (of a locally free £1\-Module) 16.E coplanar vectors 442.A coproduct of commutative algebras 29.A of an element in a graded coalgebra 203.B Hopf 203.D of two objects 52.E co radical 293.F CORDIC 142.A core 173.D coregular representation (of an algebra) 362.E corestriction (homomorphism of cohomology groups) 200.M Corioli force 271.D corner polyhedron 215.C Cornish-Fisher expansions 374.F Cornu spiral 93.H 167.D Corona problem 43.G Corona theorem 164.1 coroot 13.J correcting, error- 63.A correcting capability, error- 63.B correctly posed (initial value problem) 321.E (problems for partial differential equations) 322.A corrector (in a multistep method) 303.E Milne 303.E correlation 343.D involutive 343.D Kendall rank 371.K serial 397.N serial cross 397.N Spearman rank 371.K correlation coefficient (of two random variables) 342.C 397.H canonical 280.E 374.C multiple 397.J partial 397.J population 396.D sample 396.D sample multiple 280.E sample partial 280.E serial 421.B correlation inequalities 212.A correlation matrix 397.J correlation ratios 397.L correlation tensor 433.C correlogram 397.N correspond 358.B correspondence 358.B algebraic (of an algebraic variety) 16.1 algebraic (of a nonsingular curve) 9.H algebraic, group of classes of 9.H birational 16.1 of Combescure I11.F conformal (between surfaces) 111.1 geodesic (between surfaces) 111.1 homothetic (between surfaces) 111.1 inverse 358.B one-to-one 358.B similar (between surfaces) 111.1 univalent 358.B correspondence principle 351.D correspondence ring (of a nonsingular curve) 9.H corresponding angles 139.D corresponding points (with respect to confocal quadrics) 350.E cos (cosine) 131.E 432.A 
1957 cos- 1 131.E cosec (cosecant) 131.E 432.A cosech (hyperbolic cosecant) 131.F cosemisimple 203.F coset double (of two subgroups of a group) 190.C left (of a subgroup of a group) 190.C right (of a subgroup of a group) 190.C coset space (of a topological group) left 423.E right 423.E cosh (hyperbolic cosine) I31.F cosigma functions 134.H, App. A, Table 16.1V cosine(s) 432.A first law of 432.A, App. A, Table 2.11 hyperbolic 131.F integral 167.D law of (on spherical triangles) 432.B, App. A, Table 2.m optical direction 180.A second law of 432.A, App. A, Table 2.11 cosine integral 167.D, App. A, Table 19.11 cosine series, Fourier, App. A, Table IU cosine transform, Fourier I 60.C, App. A, Table 11.11 cospecialization (in etale topology) 16.AA co spectral density 397.N cost 281.D imputed 2SS.B shadow 292.C unit 281.D cost of insurance 214.B cost of observation 398.F cot (cotangent) 131.E cotangent(s) 432.A hyperbolic 131.F law of App. A, Table 2.11I cotangent bundle 147.F cotangential sphere bundle 274.E cotangent vector bundle 147.F Cotes formula, Newton- (in numerical integration) 299.A coth (hyperbolic cotangent) 13 I.F cotrce 186.G cotriple 200.Q counit 203.F counity (of a coalgebra) 203.B countabilityaxioms 42S.P countable additivity 270.B countable cell complex 70.D countable Lebesgue spectrum 136.E countable model (of axiomatic set theory) IS6.E countable ordinal number 49.F countable set 49.A countable simplicial complex 70.C locally 70.C countably additive class 270.B countably compact space 42S.S countably equivalent sets 136.C countably Hilbertian space 424.W countably infinite set 49.A countably normed space 424.W countably paracompact space 42S.Y countably productive property 42S.Y counting constants, principle of 16.S counting function (of a meromorphic function) 272.B Courant-Cheng domain theorem 391.H Cousin problem 21.K first 21.K Subject Index Covering Lie group, simply connected second 21.K covariance (of two random variables) 342.C 397.H population 396.D sample 396.D covariance distribution 395.C covariance function 395.A,B sample 39S.G covariance matrix 341.B 397.J asymptotic 399.K variance 341.B 397.1 covariant 226.D absolute 226.D absolute multiple 226.E with ground forms 226.D multiple 226.E of an n-ary form of degree d 226.D relativistically IS0.D covariant derivative (of a tensor field) 80.1 417.B, App. A, Table 4.1I (of a vector field) 80.1 (of a vector field along a curve) 80.1 van der Waerden-Bortolotti 417.E covariant differential (of a differential form) 80.G (of a tensor field) 80.l,L 417.B (of a vector field) 80.1 covariant functor S2.H covariant index (of a component of a tensor) 2S6.J covariant spinor 2S8.B covariant tensor alternating 2S6.N of degree q 2S6.J symmetric 2S6.N covariant tensor field of order s 10S.0 covariant vector 2S6.J covariant vector field 10S.0 covector, p- 256.0 cover (a set) 381.D covering(s) (covering space) 91.A (curve) 9.1 (of a set) 381.D 42S.R closed (of a set) 42S.R closure-preserving 42S.X countable (of a set) 42S.R degree of (of a nonsingular curve) 9.1 discrete (of a set) 42S.R e- (of a metric space) 273.B finite (of a set) 42S.R locally finite (of a set) 42S.R mesh of (in a metric space) 273.B n-fold (space) 91.A normal (of a set) 42S.R open (of a set) 425.R point-finite (of a set) 42S.R regular (space) 91.A a-discrete (of a set) 42S.R a-locally finite (of a set) 42S.R star-finite (of a set) 42S.R unramified (of a nonsingular curve) 9.1 covering curve 9.1 covering differentiable manifold 91.A covering dimension (of a normal space) 117.B covering family (in Grothendieck topology) 16.AA covering group (of a topological group) 91.A 423.0 universal (of a topological group) 91.B 423.0 covering homotopy property 148.B covering law 381.D 42S.L covering Lie group, simply connected (of a Lie algebra) 249.C 
Subject Index Covering linkage invariant(s) covcring linkagc invariant(s) 235.E covcring manifold 91 A covcring mapping (map) 91.A covcring space(s) 91 analytic 23.E C- 23.E in the sense of Cartan 23. E ramified 23.B uni versal 91. B covering surfacc(s) 367.B Ahlfors theory of 367.B with relative boundary 367.B unbounded 267.B uni versal 367. B unramified 367.B covcring system, umform 436.D covcring thcorem, Vitali 380.D covering transformation 91.A of an unramified un bounded covering surface 367.B covering transformation group 91.A Coxctcr complex 13.R Coxctcr diagram (of a complex semisimple Lie algebra) 248.S Coxeter group l3.R CPM 307.C 376 Cramer-Castillon problem (in geometric construc- tion) l79.A Cramcr-Rao inequality 399.D Cramer rule 269.M Cramer theorem 399.M crash dut ation 281.D creation operator 377.A Cremona tmnsformation 16.1 CR-equivalcnce 344.A criterion Cartan, of semisimplicity (of Lic algebras) 248.F Cartan, of ';olvability (of Lie algebras) 248.F Castelnuovo 15.E Cauchy 87 C, App A, Table 10.n convergence, for positive series App. A. Table 10.11 d'Alembert App. A, Table 10.11 Eulcr 297 H Gau" App A, Table 10.11 Jacobian (on regularity of local rings) 370.B Kummer 145, App. A, Table 10.11 logarithmic App. A, Table 1O.n Nakai-Moishezon (of ampleness) l6.E Nyquist's 86.A Raabe App. A, Table 10 II of ruled surfaces 15.E Schlomilch App. A, Table 10.n simplex 255.D Wcicrstrass, for uniform convergence 435.A Weyl l82.H criterion function 127 A critical (Galton-Watson process) 44.B critical dcterminant 182.B critical exponcnt 111 C critical inclination problem 55.C critical lattice in M with respect to S 182.B critical manifold, non degenerate 279.D,E critical path 376 critical percolation probability 340 D critical point (of a C' -function on a manifold) 279.B (of a C' -mapping '1': M -> A1') 105.1 (ofa flow) 126.0 1958 (of a function on Rl) 106.L (of a harmonic function) 193.1 (of a mapping u: R n -> Rm) 208.B degenerate 106.L 279.B nondegenerate 106.L 279.B 286.N critical region 400.A critical value (in bifurcation theory) 286.R (of a CX -function on a manifold) 279.B (of a C' -mapping '1': M -> M') 105.1 (of a con tact process) 340. C (of an external magnetic field) 340.B (of a mapping u. Rn -> Rm) 208.B Crofton formula (in integral geometry) 2l8.B cross cap (a surface) 4l0.B crosscut(s) (of a planc domain) 333 A fundamental sequence of (in a simply connected domain) 333.B crossed homomorphism (of an associative algebra) 200.L crossed product (in C*-algebra theory) 36.1 (of a commutative ring and a group) 29.D (in von Neumann algebra theory) 308.1 crossings normal 16.L only normal l6.L crossing symmetry 132.C 386.B cross norm, C*- 36.H cross product (of cohomology groups) 201.1 (of homology groups) 201.1 (of vector bundlcs) 237.C cross ratio 343.D cross section (of a fiber bundle) 147.L 286.H (of a fiber space) 148.D (ofaflow) l26.C absorption 375.A differential 375.A 386.B local (in a topological group) 147.E scattering 375.A total 386.B total elastic 386.B cross-sectional data 128.A 397.A cross spectral density function 421.E Crout method 302.B CR structure 344.A crystal class 92.B arithmetic 92.B gcometric 92.B 3-dimensional App. B, Table 5.1V crystal family 92.B crystallographic group 92 crystallographic restriction 92.A crystallographic space group 92.A crystal system 92.B cube 357.B duplication of 179.A Hilbert 382.B unit l39.F 140 unit n- 140 cubic equation 10.D, App. A, Table I cubic map 157.B cubic P 92.E cubic resolvent App. A, Table I cumulant 397.G factorial 397.G joint 397 I cumulative distribution 397.B 
1959 cumulative distribution curve 397 B cumulative distribution function 341.B 342.C cumulative distribution polygon 397.B cup product (of cohomology classes) 20l.! (of derived functors) 200.K (in K-theory) 237.C cup product reduction thcorem (on cohomology or homology groups) 200.M curl (of a differentiable vector field) 442.D curren t ] 25. R 4-, density 150.B integral 275.G partially conserved axial-vector 132.C random 395.1 current algebra 132.C Curtis formulas, Clenshaw- 299.A curvature (of a curve of class cn) ]]] D (of a plane curve) ]]].E absolute (of a curve) II1.C affine ] lO.C circle of 1] l.E conformal ] 1O.D constant, space of 364.D, App. A, Table 4.II constant, surface of 11l.I Gaussian (of a surface) ]] ] .H, App A, Table 4.1 geodesic ]] 1.H, App. A, Table 4.1 S. Germain (of a surface) ] I1.H, App A, Table 4.1 holomorphic sectional 364.D integral (of a surface) 111 H line of (on a surface) ]] 1.H Lipschitz-Killing 279.C mean 364.D mean (of a surface) ]] l.H 365.D, App. A, Table 4.1 mean, vector 365.D minimum, property 223.F negative 178.H nonpositive, G-space with ] 78.H normal (of a surface) 1] l.H principal (of a surface) ] I ].H 365.C radius of (of a plane curve) ]]].E radius of (of a space curve) ] 1].F radius of principal (of a surface) ]] l.H Ricci 364.D Riemannian 364.D scalar 364.D, App. A, Table 4 II sectional 364.D total (of an immersion) 365.0 total (of a surface) ]]] .F,H, App. A, Table 4.1 total Gaut>sian (of a surface) J] 1.H total mean 365.0 curvature form 80.G 364.D curvature tensor (of an affine connection) 80.1,L 4]7.B (of a Frechet manifold) 286.L (of a Riemannian manifold) 364.D projective App. A, Table 4.\1 Weyl conformal 80 P, App A, Table 4.11 curvets) 93] 1].A algebraic 9.A analytic (in an analytic manifold) 93.B analytic (in a Euclidean plane) 93.B asymptotic ]] 0 B ] ] ].H Bertrand I] I.F bicharacteristic 325.A characteristic (network flow problem) 281.B Subject Index Curve(s) characteristic (of a ] -parameter family of surfaces) Ill.! characteristic (of a partial differential equation) 320.B 324.A,B cissoidal 93.H of class C k (in a differentiable manifold) 93.B of class C k (in a Euclidean plane) 93.B closed convex ] 11.E of constant breadth 89.C of constant inclination 11l.F of constant width IIl.E continuous plane 93.B coordinate (in a Euclidean space) 90.C covering 9.1 Darboux ] 10.B Delaunay 93.H dual (of a plane algebraic curve) 9.B elliptic 9.C exceptional ]5.G exponential 93.H of the first kind 15.G Frechet 246.A fundamental (with respect to a birational mapping) 16.1 fundamental theorem of the theory of II1.D general 93.D generating ] 1].1 hyperelliptic 9.D influence 371.1 integral (of a Monge equation) 324.F integral (of ordinary differential equations) 3]6.A Jordan 93.B logarithmic 93.H Lorentz 397.F Mannheim I11.F meromorphic 272.L nodal 391.H OC - 404.C ordinary 93.C Peano 93.1 pedal 93.H plane App. A, Table 4.1 plane algebraic 9.B of pursuit 93.H rational 9.C rational (in a Euclidean plane) 93.H rectifiable 93.F rolliag (of a roulette) 93.H of the second class 78.K of the second order 78.1 simple closed 93.B sine 93.H solution (of ordinary differential equations) 316.A stable 9.K stationary 46.B stationary (of the Euler-Lagrange differential equations) 324.E of steepest descent 46.A timelike 324.A tooth ] 81.E in a topological space 93.B transcendental 93.H u- ] I1.H unicursal 9.C 93.H unicursal ordinary 93.C universal 93.E v- 1] l.H variation 17P A 
Subject Index Curve fitting curve fitting 19.F curve tracing 93.G curvilinear cluster set 62.C curvilinear coordinates 90.C, App. A, Table 3.V orthogonal 90.C planar App. A, Table 3.V in 3-dimensional space App. A, Table 3.V curvilinear coordinate system isothermal App. A, Table 3.V orthogonal App. A, Table 3.V curvilinear integrals 94.A with respect to a line element 94.D with respect to a variable 94.D cushioned refinement 425.x cusp of a curve 93.G of a Fuchsian group 122.C parabolic (of a Fuchsian group) 122.C of a plane algebraic curve 9.B cusp form 450.0 in the case of one variable 32.B in Siegel half-space 32.F cuspidal parabolic subgroup 437.X cusp singularity 418.C cut (in a projective space) 343.B (ofQ) 294.E (of R) 355.A disjunctive 215.C Gomory 215.B subadditive 215.C cut locus 178.A cutoff ISO.C cut point (on a geodesic) 178.A cutset (in a graph) 186.G cutset matrix (of a graph), fundamental 186.G cutting (P' by p') 343.B cutting plane 215.B fractional, algorithm 2IS.B CW complex 70.D CW decomposition 70.D CW pair 201.L cy bernetics 95 cycle (on an algebraic variety) 16.M (of basic sets) 126.J (of a chain complex) 200.H (=cyclic permutation) 151.G (of time series data) 397.N algebraic 4S0.Q algebraically equivalent 16.R dividing (on an open Riemann surface) 367.1 foliation IS4.H fundamental (of an oriented pseudomanifold) 6S.A fundamental (in a resolution of a singular point) 418.C limit 126.1 module of 200.C no, condition 126.J numerically equivalent 16.Q one ] 6.R positive (on an algebraic variety) 16.M rationally equivalent 16.R Schubert S6.E vanishing 418.F zerO 16.R cycle index 66.E cyclic algebra 29.G cyclic code 63.D 1960 cyclic determinant 103.G cyclic element 251.K cyclic equation 172.G cyclic extension 172.B cyclic group 190.C cyclic Jacobi method 298.B cyclic part (of an ergodic class) 260.B cyclic representation (Banach algebra) 36.E cyclic representation (topological groups) 437.A cyclic subgroup (of a group) 190.C cyclic vector (of a representation space of a unitary representation) 437.A cyclide 90.B cyclide of Dupin III.H cycloid 93.H cyclomatic number 186.G cyclotomic field 14.L cyclotomic polynomial 14.L cyclotomic Z,-extension 14.L cyclotomy 296.A cylinder circular 111.1 350.B elliptic 350.B hyperbolic 350.B mapping 202.E parabolic 350.B cylinder function elliptic 268.B parabolic 167.C cylinder set 270.H n- 270.G cylindrical coordinates 90.C, App. A, Table 3.V bipolar App. A, Table 3.V circular App. A, Table 3.V elliptic App. A, Table 3.V generalized App. A, Table 3.V hyperbolic App. A, Table 3.V parabolic 167.C, App. A, Table 3 V cylindrical equation, parabolic App. A, Table 14.11 cylindrical functions 39.B, App. A, Table 19.111 cylindrical hypersurface, quadric 350.13 cylindrical surface 111.1 circular 350.B elliptic 3S0.B hyperbolic 3S0.B parabolic 350.B D b delta E0(Q) 12S.B 168.B E0'(Q) 125.B E0{M p l' E0 IMp ) 168.B E0{M p !, E0(M p ) 125.U E0 L (Q) (the totality of functions f(x) in Coo(Q) such p that for all IX, D"f(x) belongs to L,,(Q) with respect to Lebesgue measure) 168.B b-measure 270.D L\-refinement (of a covering) 42S.R L\-set 22.D 8-functor 200.1 uni versal 200.1 8*-functor 200.1 a-complex 72.D a-cohomology groups 72.D d-continuous channels 213.F d-dimensional analytic set, purely 23.13 d-trial path dependent 346.G d"-cohomology group n.D D-sufficient a-field 396.1 
1961 D-wave 315.E D-integrable function loo.D D-integral definite loo.D indefinite loo.D D(*)-integral 100D d'Alembert criterion App. A, Table 10.11 d'Alembertian 130.A d'Alembert method of reduction of order 252.F d'Alembert paradox 205.C d'Alembert solution 325.D damped oscillation 318.B damping ratio (of a damped oscillation) 318.B Daniell-Stone integrable function 310.1 Daniell-Stone integral 310.1 Danilevskil method 298.D Darboux curve 110.B Oarboux formula, Christoffel- 317.D Darboux frame 11O.B Darboux quadric IIO.B Darboux sum 216.A Darboux tangent 110.B Darboux theorem 216.A 428.A Darmois theorem, Skitovich- 374.H data 96.B cross-sectional 128.A macroeconomic 128.A microeconomic 128.A scattering 287.C 387.C data analysis, statistical 397.A data base 96.B data processing 96 data structures 96.B Davidenko's method of differentiation with respect to a parameter 301.M death insurance 214.B death process 260.G birth and 260.G death rate, infinitesimal 260.G Debye asymptotic representation 39.D, App. A, Table 19.m decidable number-theoretic predicate 356.C decision 127.A decision function(s) 398.A invariant 398.E minimax 398.B sequential 398.F space of 398.A statistical 398.A decision problem 71.B 97 186.J n- 398.A sequential 398.F statistical 398.A decision procedure, statistical 398.A decision process, Markov J 27.E decision rule sequential 398.F terminal 398.F decision space 398.A decision theoretically sufficient (J-field 396.1 decoder 213.D decoding 63.A decomposable operator (on a Hilbert space) 308.G decompose (a polygon) J 55.F decomposed into the direct sum of irreducibk representations 437.G decomposition (of a set) 381.D Bruhat (of an algebraic group) 13.K canonical (of a closed operator) 251.E Subject Index Decreasing real analytic function cellular (of a Hausdorff space) 70.D Chevalley (on algebraic groups) 13.1 cluster, Hamiltonian 375.F CW 70.D de Rham (of a Riemannian manifold) 364.E direct(of a group) 190.L Doob-Meyer 262.C D-optimality 102.E dual direct product (of a decomposition of a compact or discrete Abelian group) 422.H ergodic (of a Lebesgue measure space) 136.H Fefferman-Stein 168.B formula of Radon 125.CC Heegurard 65.C Iwasawa (of a connected semisimple Lie group) 249.T Iwasawa (of a real semisimple Lie algebra) 248.V Jordan (of an additive set function) 380.C Jordan (of a function of bounded variation) 166.B Jordan (of a linear mapping) 269.L Jordan (in an ordered linear space) 310. B Khinchin 395.B Lebesgue, theorem 270.L Levi (on algebraic groups) 13.Q Levi (on Lie algebras) 248.F multiplicative Jordan (of a linear transforma- tion) 269.L Peirce (of a Jordan algebra) 231.B Peirce left (in a unitary ring) 368.F Peirce right (in a unitary ring) 368.F plane wave 125.CC polar 251.E relative Bruhat 13.Q Riesz (in Markov process) 260.D Riesz (in martingale) 262.C Riesz (of a superharmonic or subharmonic function) 193.S semimartigale 406.B simplicial (of a topological space) 79.C singular value (SVD) 302.E spectral 126.J Wiener-Ita 176.1 Witt (of a quadratic form) 348.F Wold 395.D Zariski 15.D decomposition-equal polygons 155.F decomposition field (of a prime ideal) 14.K decomposition group (of a prime ideal) 14.K decomposition number (of a finite group) 362.1 generalized (of a finite group) 362.1 decomposition theorem canonical 86.C in class field theory 59.C for dimension 117.C Lebesgue (on a completely additive set function) 380.C unique (for a 3-manifold) 65.E decreasing, monotone 380.B decreasing COO-function, rapidly 168.B decreasing distribution, rapidly 125.0 decreasing Fourier hyperfunction, exponentially J25.BB decreasing function monotone 166.A strictly 166.A strictly monotone l66.A decreasing real analytic function, exponentially 125.BB 
Subject Index Decreasing sequence, monotonically decreasing sequence, monotonically (of real num- bers) 87.B rapidly 168.B decremcnt, logarithmic (of a damped oscillation) 318.B Dedekind,1. W. R. 98 Dedekind, test of du Bois-Reymond and 379.D Dedekind axiom of continuity (for real numbers) 355.A Dedckind dl'icriminant theorem 14.1 Dcdekind eta function 328.A Dedekind principle (in a modular lattice) 243.F Dedekind sum 328.A reciprocity law for 328.A Dedekind theory of real numbers 294.E Dedekind zeta function 14.C 450.D decp water wave 205.F defect (of a block of representations) 362.1 (of a conjugatc class in a group) 362.1 (of a meromorphic function) 272.E defect group of a block of representations 362.1 (of a conjugate class in a group) 362.1 deficiency (of an algebroidal function) 17.C (of a closed operator) 251.D (of a linear system on a surface) 15.C maximal (of an algebraic surface) 15 E deficiency index (of a closed symmetric operator) 251.1 (of a differential opcrator) 112.1 dcficient number (in elementary theory of numbers) 297.D dcfined along V' (for a rational mapping) 16.1 defincd over k' (for an algebraic variety) 16.A define recursively 356.C defining functions (of a hyperfunction) 125.V standard 125.Z defining idcal (of a formal spectrum) 16.X defl1ling module (of a lincar system) 16.N defining relations (among the generators of a group) 161.A definite negative (function) 394.C negative (Hermitian form) 348 F negative (quadratic form) 348.B positive (function) 36.L 192.B,1 394.C 437.B positive (Hermitian form) 348.F positive (kernel) 217.H positive (matrix) 269.1 positive (potential) 338.D positive (quadratic form) 348.B positive (sequence) 192.B semi- (Hermitian form) 348.F semi- (kernel) 217.H totally (quaternion algebra) 27.D definite D-integral 100.D definitc integral 216.C, App A, Table 9.V (of a hypcrfunction) 125.X definite quadratic form 348.C definition field of 16.A first (of algebraic K-group) 237.J second (of algebraic K-group) 237.J truth 185.D defmition by mathematical induction 294.B definition by transfinite induction 311 C det1ation in homological algebra 200.M 1962 method for an eigenvalue problem 298.C deformation (of complex structures) n.G (of a graph) 186.E infinitesimal, to the direction c/8s n.G isomonodromic 253.E isospectral 387.C projective (between surfaces) IIO.B of a scheme over a connected scheme 16. W of a surface 110. A deformation cochain 305.B deformation retract 202.D neighborhood 202.D strong 202.D degeneracy (of energy eigenvalues) 35].H set of (of a holomorphic mapping between analytic spaces) 23.C degeneracy index 17.C degeneracy operator (in a semisimplicial complex) 70.E degeneratc (critical point) 106.L 279.B (eigenvalue) 390.A,B (mapping) 208.B (quadratic surface) 350.B (simplex) 70.E totally 234.B degencrate kernel 217.F degenerate module 118.D degenerate series (of unitary representations of a complex semi- simple Lie group) 437.W complementary (of unitary representations of a complex semisimple Lie group) 437.W degree (of an algebraic element) 149.F (of an algebraic equation) 10.A (of an algebraic variety) 16.G (of an angle) 139.D (of a central simple algebra) 29.E (of a divisor class) II.D (of a divisor of an algebraic curve) 9.C (of an element with respect to a prime ideal of a Dedekind domain) 439.F (of an extension) 149.F (of a graph) 186.B (of a Jordan algebra) 231.B (of a linear representation) 362.D (of a matrix representation) 362.D (of an ordinary differential equation) 313.A (of a permutation representation) 362.B (of a polynomial) 337.A (of a prime divisor) 9.D (of a rational homomorphism) 3.C (of a representation of a Lie algebra) 248.B (of a representation of a Lie group) 249.0 (of a square matrix) 269.A (of a term of a polynomial) 337.B (of a valuation) 439.1 (of a O-cycle on an algebraic variety) 16.M complementary (of a spectral sequence) 200J of covering (of a nonsingular curve; 9.1 filtratiOn 200.1 formal (of a unitary representation) 437.M of freedom (of the dynamical system) 271.F of freedom (of error sum of squares) 403.E of freedom (of sampling distributions) 374.B in- 186.B Leray-Schauder 286.D local. of mapping 99.B 
1963 mapping 99 A of mapping 99 A out- 186.B of the point 99.D of a prime divisor of an algebraic function field of dimension I 9.D of ramification (of a branch point) 367.B of recursive unsolvability 97 relative (of a finite extension) 257.D relative (of a prime ideal over a field) 14.1 of symmetry 431.D total (of a spectral sequence) 200.1 tram,cendence (of a field extension) 149.K of transcendency (of a field extension) 149. K of unsolvability 97 degree k holomorphic differential forms of 72.A tensor space of 256.J degree n alternating group of 151.G component of 200.B general linear group of 60.B projective general linear group of 60.B Siegel modular function of 32.F Siegel modular group of 32.F Siegel space of 32.F Siegel upper half-space of 32.F special linear group of 60 B symmetric group of 151.G degree p, contravariant tensor of 256.J degree q, covariant tensor of 256.1 degree r differential form of J05.Q differential form of (on an algebraic variety) 16.0 mean of (of a function with respect to a weight function) 211.C Dehn lemma (on 3-manifoJds) 65.E Dejon-Nickel method 301.G Delaunay curve 93.H delay convention, perfect 51.F delay-differential equation 163.A delayed recurrent event 260.C Delos problem (in geometric construction) 179.A delta, Kronecker 269 A, App. A, Table 4.11 delta function, Dirac App. A, Table 12.11 demography 40.D de Moivre formula 74.C de Moivre-Laplace theorem 250.B de Morgan law 381.B in a Boolean algebra 42.A Denjoy-Carleman condition 168.B Denjoy integrable in the wider sense 100.D Denjoy integrals 100 in the restricted sense 100 D Denjoy-Luzin theorem 159.1 denominator, partial (of an infinite continued frac- tion) 83.A dense (set) 425.N (totally ordered set) 311.B locally 154.D nowhere 425.N relatively 126.E Zariski 16.A dense in itself 425.0 denseness of rational numbers 355.B density (on a maximal torus) 248.Y Subject Index De Rham theorem (on a C"'-manifold) (of a set of prime ideals) 14.S (of a subset of integers) 4.A angular momentum 150.B beta 397.0 bivariate normal 397.1 conditional 397.1 co spectral 397.N electric fl ux 130.A energy 195.B 4-current 150.B free Lagrangian 150 B gamma 397.D joint 397.1 kinetic 218.A Lagrangian 150.B magnetic flux 130.A point of (of a measurable set of the real line) loo.B posterior 401.B prior 401.B probability 341.D sojourn time 45 G density function 397.D bispectral 421.C marginal 397.1 normal 397.D rational spectral 176.F density matrix 351.B density theorem (on discrete subgroups of a Lie group) 122.F Chebotarev 14.S Kaplansky 308.C Lebesgue 100.B von Neumann 308.C dependence, domain of 325.B dependent algebraically (elements of a ring) 369.A algebraica11y (on a family of elements of a field) 149.K functionally (components of a mapping) 208.C functionally, of class C' (components of a mapping) 208.C linearly (elements in a linear space) 256.C linearly (elements in an additive group) 2.E linearly (with respect to a difference equation) 104.D path, d-trial 346.G dependent points (in an affine space) 7.A (in a projective space) 343.B dependent set 66.G dependent variable 165.C depending choice, principle of 33.F depth (of an ideal) 67 E de Rham cohomology group 201.H de Rham cohomology group (of a differentiable manifold) 105.R de Rham cohomology ring (of a differentiable manifold) 105.R de Rham cohomology ring (of a topological space) 20 1.1 de Rham complex (as an elliptic complex) 237.H de Rham decomposition (of a Riemannian manifold) 364.E de Rham equation, 274.G de Rham homology theory 114.L de Rham system, partial 274.G de Rham theorem (on a C"-manifold) 105.V 20l.H analog of 21.L 
Subject Index Derivable derivable approximately (measurable function) 100.B in the general sense (a set function) 380.0 in the ordinary sense (a set function) 380.0 deri va tion (of an algebra) 200.L (of an algebraic function field) 16.0 (of a commutative ring) 113 (of a field) 149.L (of a Lie algebra) 248.H (of a linear operator in a C*-algebra) 36.K *- 36.K inner (in an associative algebra) 2oo.L inner (in a Lie algebra) 248.H invariant (on an Abelian variety) 3.F over k 149.L Lie algebra of 248.H derivation tree 31.E derivative (of a distribution) 125.E (of a function) 106.A (of a holomorphic function) 198.A (of a hyperfunction) 125.X (of a polynomial) 337.G (of an element in a differential ring) 113 angular (of a holomorphic function) 43.K approximate (of a measurable function) 100.B covariant (of a tensor field) 80.1, App. A, Table 4.11 covariant (of a tensor field in the direction of a tangent vector) 417.B covariant (of a vector field) 80.1 covariant (of a vector field along a curve) 80.1 directional 106.G distribution 125.E exterior (of a differential form) 105.Q first-order 106.A Frechet 286.E free 235.C Gateaux 286.E general (of a set function) 380.0 generalized 125.E general lower (of a set function) 380.0 general upper (of a set function) 380.0 higher-order (of a differentiable function) 106.D. App. A, Table 9.II1 higher-order partial 106.H Lagrangian 205.A left (on the left) 106.A Lie (of a differential form) 105.Q Lie (of a tensor field) 105.0 normal 106.G nth (of a differentiable function) 106.0 ordinary (of a set function) 380.0 ordinary lower (of a set function) 380.D ordinary upper (of a set function) 380.0 partial 106.F,K partial, nth order 106.H at a point 106.A Radon-Nikodym 380.C right (on the right) 106.A Schwarzian App. A, Table 9.II1 spherical (for an analytic or meromorphic function) 435.E variational 46.B weak 125.E derivatives and primitive functions App. A, Table 9.1 derived (language) 31.D derived algebra (of a Lie algebra) 248.C derived function 106.A 1964 nth 106.0 derived group (of a group) 190.H derived neighborhood 65.C second barycentric 65.C derived normal model (of a variety) 16.F derived series (of a Lie algebra) 248.C derived set (of a set) 425.0 derived sheaf 125.W derived unit 414.P Oesarguesian geometry, non- 155.E 343.C Oesargues theorem 155.E 343.C Oescartes, R. 101 folium of 93.H Oescartes theorem 10.E descending central series (of a Lie algebra) 248.C descending chain (in a lattice) 243.F (of (normal) subgroups of a group) 190.F (in an ordered set) 311.C descending chain condition (for a (normal) subgroup of a group) 190.F (in an ordered set) 311.C descent curve of steepest 46.A line of swiftest 93.H method of steepest 212.C descriptive set theory classical 356.H effective 356.H design block (block design) central composite 102.M completely randomized 102.A factorial 102.H first-order 102.M fractional factorial 102.1 second-order 102.M Youden square 102.K design matrix 102.A 403.0 design-of-experiment analysis 403.0 design of experiments 102 designs for estimating parameters 102 M designs for exploring a response surface 102.M designs for two-way elimination of heterogeneity 102K desingularization (of an analytic space) 23.0 de Sitter space 355.D desuspend 114.L detecting, error- 63.A determinacy axiom of 22.H projective 22.H determinant(s) 103 (of an element of the general linear group over a noncommutative field) 60.0 (of a nuclear operator) 68.L Casorati 104.0 critical 182.B cyclic 103.G Fredholm 217.E Gramian 103.G 208.E Hankel 142.E Hill 268.B infinite (in Hill's method of solution) 268.B Jacobian 208.B of a lattice 182.B Lopatinski 325.K Pfaffian 103.G Vandermonde 103.G Wronskian 208.E 
1965 determinantal equation, Hill 268.B determinant factor (of a matrix) 269.E determinateness, axiom of 33.F determination, coefficient of 397.H,J determination, orbit 309.A determined system of differential operators 112.R of partial differential equations 320.F determining set (of a domain in CO) 21.C deterministic (in prediction theory) 395.D (Turing machine) 31.B deterministic linear bounded automaton 31.D deterministic process 127.B Deuring-Heilbronn phenomenon 123.D developable function, asymptotically 30.A developable space 425.AA developable surface l11.I development along a curve 364.B of a curve 80.N 111.H 364.B de via tion large 250.B mean absolute 397.C standard 341.B 342.C 397.C deviation point 335.B devices, peripheral 75.B de Vries equation, Korteweg- 387.A (DF)-space 424.P DFT (Discrete Fourier Transform) 142.D diagonal (of a Cartesian product of sets) 381.B 436.A diagonalizable (linear transformation) 269.L diagonalizable operator (in an Abelian von Neu- mann algebra) 308.G diagonal mapping (of a graded co algebra) 203.B,F diagonal matrix 269.A diagonal morphism (in a category) 52.E diagonal partial sum (of a double series) 379.E diagonal sum (of a matrix) 269.F diagram 52.C (of a symmetric Riemann space) 413.F arrow 281.D associated (in irreducible representations of orthogonal groups) 60.1 in a category 52.C commutative 52.C Coxeter (of a complex semisimple Lie algebra) 248.S Dynkin (of a complex semisimple Lie algebra) 248.S, App. A, Table 5.1 extended Dynkin App. A, Table 5.1 Feynman 146.B mutually associated 60J Newton 254.D Satake (of a compact symmetric Riemannian space) 437.AA Satake (of a real semisimple Lie algebra) 248.U, App. A, Table 5.n scatter 397.H Schlitfli (of a complex semisimple Lie algebra) 248.S Young 362.H diameter (of a central conic) 78.G (of a solid sphere) 140 (of a subset in a metric space) 273.B conjugate (of a diameter of a central conic) 78.G transfinite 48.D Subject Index Differentiable diathermal wall 419.A dichotomy 398.C Dido's problem 228.A dielectric constant 130.B Dieudonne complete 436.1 Dieudonne theorem 425.X diffeomorphic Coo-manifolds 105.1 diffeomorphism(s) Anosov 126.1 136.G Axiom A 126.1 C'- (in nonlinear functional analysis) 286.E of class C' 105.1 group of orientation-preserving 114.1 horse-shoe 126.1 minimal 126.N Morse-Smale 126.1 y- 136.G diffeotopy theorem 178.E difference 102.E 104.A backward 223.C, App. A, Table 21 central 223.C 304.E, App. A, Table 21 divided 223.D finite 223.C forward 304.E of the nth order 104.A primary 305.C second 104.A symmetric 304.E of two sets 381.B difference analog 304.E difference co cycle 305.B difference-differential equation 163.A difference equation 104 geometric 104.G homogeneous 104.C inhomogeneous 104.C linear 104.C nonhomogeneous 104.C difference group (of an additive group) 190.C difference method 303.A difference product 337.1 difference quotient 104.A difference scheme 304.F of backward type 304.F of forward type 304.F difference set 102.E difference table 223.C diff eren t (of an algebraic number field) 14.1 (of a maximal order) 27.B relative 14.1 differentiable complex function 21.C Frechet 286.E Gateaux 286.E infinitely 106.K left (on the left) 106.A n-times 106.D n-times continuously 106.K with respect to the parameter (a distribution) 125.H partially 106.F at a point (a complex function) 198.A at a point (a real function) 106.A right (on the right) 106.A in the sense of Stolz 106.G on a set 106.A termwise (infinite series with function terms) 379.H totally 21.C 106.G 
Subject Index Differentiable dynamical system of class C' differentiable dynamical systcm of class C' l26.B differentiable manifold(s) 105 with boundary of class C' 105.E of class C' 105.D Riemann-Roch theorem for 237.G different,able mapping of class C' 105.1 differential of (at a point) 105.J differentiable pinching problem l78.E differentiable semigroup 378.F differentiable slice theorem 431.C differentiable structure(,) l14.B of class C' 105.D group of oriented (on a combinatorial sphere) 114.1 differentiable transformation group 431.C differential (= boundary operator) 200.H (coboundary operator) 200.F (of a differentiable function) 106.B (of a differentiable mapping at a point) 105.J (of a function on a differentiable manifold) 105.1 (Frechet derivative) 286.E Abelian (of the first. second. third kind) l1.C 367.H analytic (on a Riemann surface) 367.H conjugate (on a Riemann surface) 367.H covariant (of a differential form) 80.G covariant (of a tensor field) 80.I.L 417.B covariant (of a vector field) 80.1 exterior (of a differential form) 105.Q harmonic (on a Riemann surface) 367.H holomorphic (on a Riemann surface) 367.H kernel 188.G meromorphic (on a Riemann surface) 367.H nth lof a differentiable function) 106.D of n1h order (of a differentiable function) 106.D D- (on an algebraic curve) 9.F partial 200.H quadratic (on a Riemann ,urface) 11.D rth (Frechet derivative) 286.E stochastic 406.C total 106.G 200.H 367.H differential analyzer 19.E differential and integral calculu, App. A, Table 9 differential automorphism J 13 differential calculus 106 differential coefficIent 106.A partial 106.A dIfferential cross section 375 A 386.B differential divisor (of an algebraic curve) 9.C differential divisor class (of a Riemann surface) l1.D differential equation(s) 3l3.A adjoint 252.K algebraic 113 288.A almost periodic 290.A Beltrami 352.B Bernoulli App. A, Table 14.1 Bessel 39.B. App. A, Table 14.11 Briot-Bouquet 288.B 289.B Caianicl10 291.F Cauchy- Riemann 198.A Cauchy-Riemann (for a holomorphic function of sevcral complex variables) 21.C Cauchy-Riemann (for a holomorphic function of 1 wo complex variables) 320.F Chaplygin 326.B Chebyshev App. A. Table 14.1 1966 Cherwell-Wright 291.F Clairaut App. A, Table 14.1 Clairaut partial App. A, Table 15.11 confluent 167.A confluent hypergeometric l67.A, App. A, Table 14.11 delay l63.A with deviating argument 163.A difference- 163.A Dulling 290.C elliptic partial App. A, Table 15. VI Emden 291.F Euler (in dynamics of rigid bodies) 27l.E Euler-Lagrange 46.B Euler linear ordinary App. A, Table 14.1 exact App. A, Table 14.1 Fokker-Planck partial Il5.A functional- J 63.A Gauss hypergeometric App. A, Table 14.11 Gaus,ian 206.A generalized Lame 167.E generalized Riccati App. A, Table l4.I Hamilton 324.E Hamilton-Jacobi 23.B 324.E Helmholtz l88.D, App. A, Table 15.VI Hermite App. A, Tables 14.11 20.111 Hill 268.B Hodgkin-Huxley 291.F hyperbolic 325 hyperbolic partial 325 hypergeometric 206.A. App. A, Table 18.1 hyperspherical 393.E integro- l63.A 222 integro-, of Fredholm type 222.A integro-, of Volterra type 222.A Jacobi App. A, Tables 14.11 20.V Killing 364.F Kummer App. A, Table 19.1 with lag 163.A Lagrange 320.A, App. A, Table 14.1 Lagrange partial App. A, Table 5.11 Laguerre App. A, Tables 14.11 20.VI Lame l13.B Laplace 323.A, App. A, Table 15 III Laplace, in the 2-dimensional case App. A, Table 15.VI Laplace, in the 3-dimensional case App. A, Table l5.VI Legendre 393.B, App. A, Table 14.11 Legendre associated 393.A Lienard 290.C linear ordinary 252.A 313.A linear partial 320.A Lowner 438.B Mathieu 268.A matrix Riccati 86.E modified Mathieu 268.A Monge 324.F Monge-Ampere 278.A, App. A, Table 15.\11 nonlinear ordinary 313.A nonlinear partial 320.A ordinary 313 partial 3l3.A 320 partial, of elIiptic type 323 partial, of hyperbolic type 325 partial, of mixed type 326 partial, of parabolic type 327 Poisson 323.A. App. A. Table 15.I1I polytropic 291 F rational 288.A 
1967 related 254.F with retardation 163.A retarded 163.A Riccati App. A, Table 14.1 Riemann App. A, Table 18.1 self-adjoint 252.K self-adjoint system of 252.K stochastic 342.A 406 Stokes 167.E 188.E strongly nonlinear 290.C system of. of Maurer-Cartan 249.R system of hyperbolic (in the sense of Petrovskii) 325.G system of linear, of the first order 252.G system of ordinary 313.B system of partial, of order I (on a differentiable manifold) 428.F system of total 428.A Tissot-Pochhammer 206.C total 428.A, App. A, Table 15.1 Tricomi 326.C van der Pol 290.C weakly nonlinear 290.C Weber 167.C, App. A, Table 20.111 Weber-Hermite l67.C Whittaker 167.B, App. A, Tables 14.11 19.11 differential extension ring 113 differential field 113 Galois theory of 1 13 differential form 105.Q closed 105.Q of degree I 105.0 of degree r 105.Q of degree r (on an algebraic variety) 16.0 divisor of (on an algebraic variety) 16.0 exact 105.Q exterior, of degree r 105.Q of the first kind (on an algebraic variety) 16.0 of the first kind (on a nonsingular curve) 9.E harmonic 194.B holomorphic, of degree k 72.A invariant (on an Abelian variety) 3.F of Maurer-Cartan 249.R primitive 232.B of the second kind (on a nonsingular curve) 9.E of the third kind (on a nonsingular curve) 9.E of type (r, 5) 72.C differential geometry 109, App. A, Table 4 affine 11 O.C conformal 110.D projective 11 O.B differential geometry in specific spaces 110 differential geometry of curves and surfaces ]]1 differential ideal 1 13 of a differential ring 113 involutive 428.E prime (of a differential ring) 113 semiprime (of a differential ring) 113 sheaf on a real analytic manifold 428. E differential index (in a covering of a nonsingular curve) 9.1 differential invariant fundamental (of a surface) 110.B on an tn-dimensional surface llO.A Poincare 74.G differential law 107.A differential operator(s) 112 223.C 306.B Beltrami, of the first kind App. A, Table 4.11 Subject Index Dimension Beltrami, of the second kind App. A, Table 4.11 elIiptic 1 12.A of the kth order 237.H ordinary 1 12.A partial ll2.A pseudo- 345 strongly elliptic 112.G 323.H system of 1 12.R differential polynomial(s) 113 ring of 113 differential quotient (at a point) 106.A differential representation (of a unitary representa- tion of a Lie group) 439.S differential rings 113 differen tial subring 113 differential system 191.1 restricted 191.1 differential topology 114 differential variable 113 differentiation (in a commutative ring) 113 (of a differential function) 106.A graphical 19.B higher (in a commutative ring) 113 logarithmic App. A, Table 9.1 numerical 299.E partial 106.F theorem of term wise (on distribution) 125.G of a vector field App. A, Table 3.11 diffraction (of waves) 446 diffusion, Ehrenfest model of 260.A diffusion-convection equation 304.B diffusion kernel 338.N diffusion process 115 on manifolds 11 5.D multidimensional 115.C digamma function 174.B digital computer 75.B digital quantity 138.B dihedral group 151.G dilatation in Laguerre geometry 76.B maximal 352.B dilated maximum principle (in potential theory) 338.C dilation (of a linear operator) 251.M power 251.M strong 251. M unitary 251.M dilation theorem 251.M dimension (of an affine space) 7.A (of an algebraic variety) 16.A (of an analytic set) 23.B (of an automorphic form) 32.B (of a celI complex) 70.D (of a convex cell in an affine space) 7.D (of a divisor class on a Riemann surface) 11.D (of a Euclidean simplicial complex) 70.B (of a free module) 277.G (of a Hilbert space) 197.C (of a linear space) 256.C (ofa linear system of divisors) 9.C 16.N (of a physical quantity) 116 (of a projective space) 343.B (of a simplicial complex) 70.C (of a topological space) 117 algebraic (of an algebraic surface) 72.F capacity 48.G 
Subject Index Dimension - k cohomological (of an associative algebra) 200.L cohomological (of a scheme) 16.E cohomological (of a topological space) 117.F complex (of a complex manifold) 72.A covering (of a normal space) 1 17.B decomposition theorem for 117.C geometric (of a vector bundle) 114.D global (of an analytic set) 23.B global (of a ring) 2oo.K harmonic (of a Heins end) 367.E Hausdorff 117.G 234.E 246.K homological (of a module) 200.K homological (of a topological space) 117.F injective (of a module) 2oo.K Kodaira (of a compact complex manifold) 72.1 Krull 67.E large inductive 117.B Lebesgue (of a normal space) 117.B left global (of a ring) 200.K local (of an analytic set at a point) 23.B product theorem for 117.C projective (of a module) 2oo.K right global (of a ring) 2oo.K small inductive 117.B sum theorem for 117.C theorem on in variance of, of Euclidean spaces \l7.D weak (of a module) 2oo.K weak global (of a ring) 200.K dimension - k automorphic form of 32.B Fuchsian form of 32.B Hilbert modular form of 32.G Siegel modular form of 32.F dimensional analysis 116 dimensional formula 116 dimension function (on a continuous geometry) 85.A dimension theorem (of affine geometry) 7.A (on modular lattice) 243.F (of projective geometry) 343.B dimension theory 117 dimension type (of a topological space) 117.H Dini-Hukuhara theorem 314.D Dini-Lipschitz test (on the convergence of Fourier series) 159.B Dini series 39.C Dini surface 111.I Dini test (on the convergence of Fourier series) 159.B Dini theorem (on uniform convergence) 435.B Diocles, cissoid of 93.H Diophantine (relation) 97 Diophantine analysis 296.A Diophantine approximation 182.F Diophantine equations 118 Dirac delta function App. A, Table 12.11 Dirac distribution 125.C Dirac equation 377.C 415.G Dirac field, free 377.C Dirac y-matrix 415.G Dirac hole theory 415.G Dirac matrix 377.C direct analytic continuation 198.G direct circle 78.D direct closed path 186.F direct decomposition (on a group) 190.L directed family 165.D 1968 directed gra ph 186. B directed set 311.D direct factor (of a direct product of sets) 381.E (of a group) 190.L direct image (of a sheaf) 383.G direct integral 308.G of unitary representations 437.H direction asymptotic I11.H Borel (of a meromorphic function) 272.F Julia (of a transcendental entire function) 429.C positive (in a curvilinear integral) 198.B principal (of a surface) II1.H directional derivative 106.G direction ratio (of a line in an affine sp,lce) 7.F direct limit (of a direct system of sets) 210.B direct method 46.E 302.B direct path 186.F direct product (of algebras) 29.A (of distributions) 125.K (of a family of lattices) 243.C (of a family of ordered sets) 31 J. F (of a family of sets) 381.E (of a family of topological groups; 423.C (of a family of topological spaces) 425.K (of groups) 190.L (of G-sets) 362.B (of Lie groups) 249.H (of mappings) 381.C (of measurable transformations) 136.D (of modules) 277.F (of objects of a category) 52.E (of rings) 368.E (of sets) 381.B (of sheaves) 383.1 restricted (of an infinite number of groups) 190.L restricted (of locally compact groups) 6.B semi- (of groups) 190.N direct product decomposition 190.L dual 422.H directrix (of an ellipse) 78.B of Wilczynski 110.B direct set, increasing 308.A direct sum (of a family of ordered sets) 311.F (ofa family of sets) 381.E (of G-sets) 362.B (of Hilbert spaces) 197.E (of ideals of a ring) 368. F (of an infinite number of groups) 190.L (of Lie algebras) 248.A (of linear representations) 362.(' (of\inear spaces) 256.F (of modules) 277.B,F (of a mutually disjoint family ofs,ts) 381.D (of quadratic forms) 348.E (of sheaves) 383.1 (of topological groups) 423.C (of two objects) 52.E (of unitary representations) 437 G integral 308.G topological (of topological space!.) 425.M direct summand (of a direct sum of sel s) 381.E direct sys tem (of sets) 21 O. B direct transcendental singularity 19f:.P Dirichlet, P. G. L. 119 
1969 Dirichlet algebra 164.B weak* 164.G Dirichlet character 295.D Dirichlet discontinuous factor App. A, Table 9.Y Dirichlet distribution 341.D, App. A, Table 22 Dirichlet divisor problem 242.A Dirichlet domain 120.A Dirichlet drawer principle 182.F Dirichlet form 261.C regular 261.C Dirichlet function 84.D 221.A Dirichlet functional 334.C Dirichlet integral (in the Dirichlet problem) 120.F (in Fourier's single integral theorem) 160.B Dirichlet kernel 159.B Dirichlet L-function 450.C Dirichlet principle 120.A 323.C Dirichlet problem 120 293.F 323.C Dirichlet problem with obstacle 440.B Dirichlet region 234.C Dirichlet series 121.A ordinary 121.A of the type {An} 121.A Dirichlet space 338.Q Dirichlet test (on Abel partial summation) 379.D Dirichlet test (on the convergence of Fourier series) 159.B Dirichlet theorem (of absolute convergence) 379.C (on the distribution of primes in arithmetical progression) 123.D Dirichlet unit theorem 14.D discharge of double negation 41 1.1 discharging 157.D disconnected, extremely 37.M disconnected metric space, totally 79.D discontinuity of the first kind 84.B fixed point of (of a stochastic process) 407.A at most of the first kind 84.B region of 234.A discontinuity formula 146.C 386.C discontinuity point 84.B of the first kind 84.B of the second kind 84.B discontinuous distribution, purely 341.D discontinuous factor, Dirichlet App. A, Table 9.Y discontinuous groups 122 of the first kind 122.B discontinuous transformation group 122.A properly 122.A discontinuum, Cantor 79.D discrete covering 425.R (J- 425.R discrete C-flow 126.B discrete dynamical system of class C 126. B discrete flow 126.B of class C 126.B discrete Fourier transform 142.D discrete mathematics 66.A discrete memory less channels 2I3.F discrete metric space 273.B discrete semiflow 126.B of class C 126. B discrete series (of unitary representations of a semi- simple Lie group) 437.X discrete series, principal 258.C discrete set 425.0 Subject Index Distance function discrete spectrum 136.E 390.E quasi- 136.E discrete topological space 425.C discrete topology 425.C discrete uniformity 436.D discrete valuation 439.E discrete valuation ring 439.E discrete variable method 303.A discrete von Neumann algebra 308.E discretization error 303.B discriminant (of an algebraic equation) 337.J (of an algebraic number field) 14.B (of a binary quadratic form) 348.M (of a curve of the second order) 78.1 (of a family of curves) 93.1 (of a quadratic form) 348.A (of a simple ring) 27.B fundamental 295.D relative 14.J discriminant function, linear 280.1 discriminant theorem, Dedekind 14.J disintegration 270.L disjoint family, mutually (of sets) 381.D disjoint sets 381.B disjoint sum 381.B disjoint union 381. B of a mutually disjoint family of sets 381.D disjoint unitary representations 437.C disjunction (of propositions) 411.B disjunctive cuts 215.C disjunctive programming 264.C disk 140 circular 140 n- 140 open 140 open n- 140 unit 140 disk algebra 164.B disk theorem (on meromorphic functions) 272.1 dispersion 397.C dispersion relations 132.C dispersive (flow) 126.E (linear operator) 286.Y dispersive wave 446 displacement electric 130.A parallel (of a tangent vector space) 80.H 364.B parallel, along a curve 80.C dissection, Farey 4.B dissipative (operator) 251.J 286.C maximal 251.J dissipative part (of a state space) 260.B distance (in Euclidean geometry) 139.E (in a metric space) 273.B Euclidean 139.E extremal 143.B Frechet (between surfaces) 246.1 Hamming 63.B 136.E Levy 34I.F Mahalanobis generalized 280.E non-Euclidean (in a Klein model) 285.C optical 180.A perihelion 309.B reduced extremal 143.B distance function 273.B pseudo- 273.B 
Subject Index Distinct differentiable structures distinct differentiable structures I 14.B distinct system of parameters 284.D distinguishable, finitely (hypothesis) 400.K distinguished basis, strongly 418.F distinguished pseudopolynomial 2J.E distortion function, rate 2 J3.E distortion ineq ualities 438.B distortion measure 2 J3.E distortion theorem 438.B d istributcd asymptotically 374.D uniformly 182.H distribution(s) 125 (on a differentiable manifold) 125.R (of random variables) 342.C (of a vector bundle) 428.D a posteriori 398.B a priori 398.B asymptotically normal 399.K beta 34J.D, App. A, Table 22 Beurling generalized 125.U binomial 341.D 397.F, App. A, Table 22 bivariate 397.H capacity mas. 338.K Cauchy 34J.D, App. A, Table 22 chi-square 374.A, App. A, Table 22 conditional probability 342.E continuous 34J.D converge in (a sequence of random variables) 342.D covariance 395.C cumulative 397.B Dirac 12S.C Dirichlet 34J.D, App. A, Table 22 double, potential of 338.A entropy of a 403.B equilibrium, Gibbs 136.C equilibrium mass 338.K exponential 34J.D, App. A, Table 22 exponential family of 396.G F- 34J.D 374.B, App. A. Table 22 fiducial 401. F finite-dimensional 407.A of finite order 125.1 function, empirical 374.D gamma 34J.D, App. A, Table 22 Gaussian 34J.D geometric 34J.D, App. A, Table 22 hypergeometric 34J.D 397.F, App. A, Table 22 infinitely divisible 34J.G ini tial 26J.A initial law 406.D integrable 12S.N invariant (of a Markov chain) 260.A invariant (second quantization) 377.C involutive (on a differentiable manifold) 428.D joint 342.C k-dimensional normal 34J.D k-Erlang 260.H kth-order asymptotic 399.0 L- 34J.G lattice 34J.D law, Maxwell-Boltzmann 402.B least favorable 400.B least fa vorable a priori 398.H limit 250.A logarithmic App. A, Table 22 logarithmic normal App. A, Table 22 marginal 342.C 397.H 1970 multidimensional hypergeometric App. A, Table 22 multidimensional normal App. A, Table 22 multinomial 341.D multiple hypergeometric 341.D multivariate normal 397.J n-dimensional 342.C n-dimensional probability 342.C negative binomial 34J.D 397.F, App. A, Table 22 negative multinomial 34J.D negative polynomial App. A, Table 22 noncentral chi-square 374.B noncentral F- 374.B noncntral t- 374.B noncentral Wishart 374.C normal 34J.D 397.C, App. A, Table 22 one-dimensional probability, of a -andom variable 342.C one-side stable for exponent 1/2 App. A, Table 22 operator-valued 150D p-dimensional noncentral Wishart 374.C Pearson 397.D pluriharmonic 2J.C Poisson 341.D 397.F, App. A, Table 22 polynomial App. A, Table 22 population 396.B 40J.F positive 12S.C posterior 40J.B 403.G predictive 403.G of prime numbers 123 prior 40J.B 403.G probability 342.B, App. A, Table 22 probability, of a random variable 342.C purely discontinuous 34J.D quasistable 34J.G random 395.H 407.C random, with independent values 11 every point 407.C random, in the wider sense 395.C 407.C rapidly decreasing 12S.0 rectangular App. A, Table 22 sampling 374.A semis table 341.G simple, potential of 338.A simultaneous 342.C slowly increasing 125.N stable 341.G standard Gaussian 176.A standard normal 34J.D strictly stationary random 395.H strongly stationary random 395.H substituted 12S.Q t- 34J.D 374.B, App. A, Table 22 tempered 12S.N two-sided exponential App. A, Table 22 of typical random variables App. A, Table 22 ultra-, of class {Mp} or (M p ) 125 U,BB uniform 34J.D, App. A, Table 22 unit 34J.D value 124.A of values of functions of a com pIe). variable 124 waiting time 307.C weakly stationary random 39S.C' Wishart 374.C z- 341.D 374.B. App. A, Table 2:' distribution curve, cumulative 397.B 
1971 distribution derivative 125.E distnbution-free (test) 371.A distribution-free method 371.A distnbution function 168.B 341.B 342.C cumulative 341.B 342.C empirical 250.F 396.C n-dimcnsional 342.C symmetric 341.H unimodal 341.H distribution kernel 338.P distribution law, Maxwell-Boltzmann 402.B distribution polygon, cumulative 397.B distributIOn scmigroup 378.F distributive algebra 231.A distributive lattice 243.E distributive law (in algebra of sets) 3gl.B (on cardinal numbers) 49.C (in a lattice) 243.E (on natural numbers) 294 B (in a ring) 368.A complete (in a lattice-ordered group) 243.G disturbance 128.C diurnal aberration 392 div (divergence) 442.D diverge 87.B,E 379.A to C/j 87.D divergence (of a differentiable vector field) 442.D (of a vector field with respect to a Riemannian metric) 105.W (of a vector field with respect to a volume element) 105.W infrared 146.B ultraviolet 146.B divergence fOlm 323.D divergence theorem 94.D divcrgent (double series) 379.r: (infinite product) 379.G (integral) 216.E (sequence of real numbcrs) 87.B (series) 379.A properly 379.A divide (a bounded domain) 384 F divided difference 223 D dividing cycle (on an open Riemann surface) 367.I divisibility rclation (in a ring) 67.H divisible (Abelian p-group) 2 D (additIve group) 2.E (element of ring) 67.H 277 D (fractional ideal) 14.E (general Siegcl domain) 384.F (number) 297.A divisible A-module 277.D divisible subgroup (of a discrete Abelian group) 422.G division (of a pseudomanifold) 65.A simplicial 65.A division algcbra 29.A division algorithm of natural numbers 297.A of polynomials 337.C division ring 36g.B division theorem Spath type (for microuifferential operators) 274.E Weicrstrdss type (for microdifferential opera- tors) 274.E Subject Index Divisor divisor (in an algebraic curve) 9.C (of an algebraic function field of dimension I) 9.D (of an algebraic number field) 14.F (in an algebraic variety) 16.M (in a closed Riemann surface) II.D (in a complex manifold) 72.F (of an element ofa ring) 67.H (ofa fractional ideal) 14.E (of a number) 297.A ample 16.N branch (in a covering) 9.1 canonical (of an algebraic curve) 9.C canonical (of an algebraic variety) 16.0 canonical (of a Jacobian variety) 9.E canonical (of a Riemann surface) 11.D Cartier 16 M common (of elements of a ring) 67.H complete linear system defined by 16.N complex line bundle determined by 72.F differential (of an algebraic curve) 9.C of a differcntial form (on an algebraic variety) 16.0 effective (on an algebraic curve) 9.C effective (on a variety) 16.M elementary (of a matrix) 269.E embedded prime (of an ideal) 67.F finite prime 439.H of a function (on an algebraic curve) 9.C of a function (on an algebraic variety) 16. M greatest common 297.A greatest common (of an element of a ring) 67.H imaginary infinite prime 439.H infinite prime 439.H integral (of an algebraic curve) 9.C integral (of an algebraic number field) 14.F integral (on a Riemann surface) II.D isolated prime (of an ideal) 67.F k-rational (on an algebraic curve) 9.C linearly equivalent (of a complex manifold) 72.F maximal prime (of an ideal) 67.F minimal prime (of an ideal) 67.F nondegenerate 16.N nondegenerate (on an Abelian variety) 3.D numerically connected 232.D D-linearly equivalent (on an algebraic curve) 9.F pole (of a function on an algebraic variety) 16.M positive (of an algebraic curve) 9.C positive (on a Riemann surface) 11.D prime (of an algebraic function field of dimen- sion I) 9.D prime (of an algebraic number field or an algebraic function field of one variable) 439.H prime (of an ideal) 67.F pnme (on a Riemann surface) 11.D prime rational, over a field (on an algebraic curve) 9.C principal (on an algebraic curve) 9.C principal (on a Ricmann surfacc) 11.D real infinite prime 439.H real prime 439.H sheaf of ideals of (of a complex manifold) 72.F special 9.C very ample l6.N 
Subject Index Divisor class (on a Riemann surface) zero (of a function on an algebraic variety) 16M zero (of a ring) 368.B zero, with respect to M/P 284.A divisor class (on a Riemann surface) 11.D canonical 11.D differen tial 11. D divisor class group (of a Riemann surface) 11.D divisor function 295.C generalized 295.C divisor group (of a compact complex manifold) 72.F divisor problem, Dirichlet 242.A Dixmier theorem, Rellich- 351.C Dixon-Ferrar formula App. A, Table 19.1V DK method 301.F DKA method 301.F DLR equation 402.G dn App. A, Table 16.m dodecahedron 357.B Doetsch three-line theorem 43.E Dolbeault cohomology group 72.D Dolbeault complex 72.D Dolbeault lemma 72.D Dolbeault theorem n.D domain(s) (of a correspondence) 358.B (of a mapping) 37.C 381.C (in a topological space) 79.A (of a variable) 165.C angular 333.A annular 333.A of attraction 374.G Brouwer theorem on the invariance of 117.D Cartan pseudocon vex 21.1 circular 33 3.A of class CI,). 323.F closed plane 333.A complete Reinhardt 21.B convergence (of a power series) 21.B Courant-Cheng, theorem 391.H of dependence 325.B Dirichlet 120.A divisible bounded 284.F d-pseudoconvex 21.G effective 88.D fundamental 234.C generated Siegel 384.F holomorphically complete 21.F holomorphically convex 21.H of holomorphy 2l.F homogeneous bounded 384.A 412.F individual 411.H of influence 325.B integral 368.B of integration 216.F irreducible symmetric bounded 412.F Jordan 333.A Levi pseudocon vex 21.1 of a local homomorphism 423.0 locally Cartan pseudo convex 21.1 locally Levi pseudoconvex 21.1 nodal 391.H Noetherian 284.A Noetherian integral 284.A object 411.G of operator 409.A operator (of a group) 190.E plane 333 principal ideal 67.K 1972 pseudoconvex 21.G with regular boundary (in a Coo-manifold) 105.U Reinhardt 21.B Siegel 384.A Siegel, generalized 384.F Siegel, irreducible 384.E Siegel, of the first kind 384.A Siegel, of the second kind 384.A Siegel, of the third kind 384.A slit 333.A with smooth boundary (in a C"-manifold) 105.U spectrum of 391.A strongly pseudoconvex 21.G sweepable bounded 284.F symmetric bounded 412.F unique factorization 40.H universal 16.A Weil 21.G domain kernel (of a sequence of domains) 333.C dominant (of a sequence of functions) 435.A dominant integral form (on a Cartan subalgebra) 248.W dominate (an imputation of a game) 173.D dominated (by a family of topological spaces) 425.M (statistical structures) 396.F weakly (statistical structure) 396.F dominated ergodic theorem 136.B dominating set 186.1 domination, number of 186.1 domination principle 338.L inverse 338.L Donsker invariance principle 250.E Doob-Meyer decomposition theorem 262.D Doolittle method 302.B dotted indices 258.B dotted spinor of rank k 258.B Douady space 23.G double chain complex 2oo.E double complex 2oo.H double coset (of two subgroups of a group) 190.C double distribution, potential of 338.A double exponential formula 299.B double integral 216.F double integral theorem, Fourier 160.B double layer, potential of 338.A double mathematical induction 294.13 double negation, discharge of 411.1 double point, rational 418.C double ratio 343.E double sampling inspection 404.C double sequence 379.E double series 379.E absolutely convergent 379.E conditionally convergent 379.E convergent 379.E divergent 379.E Weierstrass theorem of 379.H double suspension theorem 65.C double-valued representation 258.B doubly invariant 164.H doubly periodic function 134.E Douglas algebra 164.1 Douglas functional 334.C Douglas-Rad6 solution (to Plateau's problem) 275.C downhill method 301.L down-ladder 206.B 
1973 drawer principle, Dirichlet 182.F drift transformation by 261.F transformation of 406.B drift part 406.B dual (cell) 65.B (graded module) 203.B (graph) 186.H (matroid) 66.H (proposition in a projective space) 343.B (regular polyhedron) 357.B (symmetric Riemannian space) 412.D (topological group) 422.C 437.J dual algebra 203.F dua] basis (of a linear space) 256.G dual bundle 147.F dual category 52.F dual cell, (n - q)- 65.B dual co algebra 203.F dual complex 65.B dual cone 125.BB dual convex cone 89.F dual curve (of a plane algebraic curve) 9.B dual direct product decomposition 422.H dual frame 417.B dual homomorphism (of a homomorphism of algebraic tori) 13.D (of1attices) 243.C dual Hopf algebra 203.C dual isomorphism (of1attices) 243.C (between ordered sets) 311.E duality (in field theory) 150.E (for symmetric Riemannian space) 412.D Martineau-Harvey 125.Y Poincare (in manifolds) 201.0 Poincare (in Weil cohomology) 450.Q principle of (in projective geometry) 343.B duality mapping 251.1 duality principle (for closed convex cone) 89.F (for ordering) 311.A duality property (of1inear space) 256.G duality theorem (on Abelian varieties) 3.D (of linear programming) 255.B (in mathematical programming) 292.D Alexander 201.0 for Q-module 422.L Poincare-Lefschetz 201.0 Pontryagin (on topological Abelian groups) I92.K 422.C Serre (on complex manifolds) 72.E Serre (on projective varieties) 16.E of Takesaki 308.1 Tannaka (on compact groups) 69.D Tannaka (on compact Lie group) 249.U dual lattice 243.C 31 O.E 450.K dual linear space 256.G self- 256.H dually isomorphic (lattices) 243.C dual mapping (of a linear mapping) 256.G dual Martin boundary 260.1 dual module 277.K dual operator in Banach space 37.D of a differential operator 125.F of a linear operator 251.D Subject Index €-flat dual ordering 311.A dual passive boundary point 260.1 dual problem 255.B 349.B dual process 261.F dual representation 362.E dual resonance model 132.C dual semigroup 378.F dual space (of a C*-algebra) 36.G (of a linear space) 256.G (of a linear topological space) 424.D (of a locally compact group) 437.J (of a normed linear space) 37.D (of a projective space) 343.B quasi- (of a locally compact group) 437.1 ,..,+.............. A,)II V ;:)UUlle;. '"t'-t..I'Ir.. dual subdivision 65.B dual vector bundle 147.F du Bois-Reymond and Dedekind, test of 379.D du Bois Reymond problem 159.H Duffing differential equation 290.C Duhamel method 322.D Duhem relation, Gibbs- 419.B dummy index (of a tensor) 256.J Dunford integrable 443.F Dunford integral 251.G 443.F Dunford-Pettis theorem 68.M Donford-Schwartz integral, 8artle- 443.G duo-trio test 346.D Dupin, cyclide of 111.H Dupin indicatrix 111.H duplication of a cube 179.A Durand-Kerner-Aberth (DKA) method 30l.F Durand-Kerner (DK) method 301.F Dvoretzky-Rogers theorem 443.D dyadic compactum 79.D dynamical system(s) 126 classical 126.L 136.G continuous 126.B differential 126.B discrete 126. B linear 86.B dynamic programming 127264.C dynamic programming model 307.C dynamics analytical 271. F fluid 205.A magnetofluid 259 quantum flavor 132.D dynamo theory, hydro magnetic 259 Dynkin class 270.B Dynkin class theorem 270.B Dynkin diagram (of a complex semisimple Lie algebra) 248.S extended App. A, Table 5.1 Dynkin formula 261.C Dynkin representation of generator 261.C E g (topology) 424.R g, Eddington's App. A, Table 4.11 C(Q) (=Coo(Q)) 125.1 168.B C'(Q) 125.1 C{M,},C(M,) 168.B g-covering 273.B g-entropy 213.E g-expansion 111.C g-factor 450.N g-flat 178.D 
Subject Index i:-Hermitian form I:-Hermitian form 60.0 c.-mdependent partitions l36.E I:-induction, axiom of 33.B ,:-neighborhood (of a point) 273.C I:-number 3l2.e I.-operator, Hilbert 4l1.J E-quantifier, Hilbert 411.1 I:-sphere (of a point) 273.C I.-symbol, Hilbert 411.J I:-tensor product 424.R ,,-theorcm (in predicate logic) 411.J L-trace form 60.0 6' -function 46.e o -space 193.N H-function 430.D H-optimality 102.E E waves 130.B Eberlein-Shmul'yan theorem 37.G Eberlein theorem 424.V eccentric angle of a point on a hyperbola 78.E of a point on an ellipse 78 E eccentric anomaly 309.B eccentricity (of a conic section) 78.B cchelon space 168.B ecliptic 392 econometrics 128 Eddington's ,: App. A, Table 4.11 edge (of a convex cell in an affine space) 7.D (m a graph) l86.B (of a linear graph) 282.A reference 281.C cdge homomorphism 200J cdge of the wedge theorem 125. W Edgeworth cxpansion 374 F cffcct 403.D block 102.B factorial 102.H fixed 102.A fixed, model 102.A main 102.H random 102.A random, model 102.A treatment 102.B effective descriptive set theory 3S6.H effective di visor (on an algebraic curve) 9.e (on a variety) 16.M effective domain 88.D effective genus (of an algebraic curve) 9.C effectively (act on a G-space) 43 I.A almost 431.A effectively calculable function 356.C effectively given (object) 22.A effectively parametrized (at 0) n.G effect vector 102.A efficiency 399.D asymptotic 399.N Bahadur 400.K second-order 399.0 second-order asymptotic 399.0 efficiency-balanced block design 102.E efficient kth order asymptotic 399.0 efficient estimator 399.D asymptotically 399.N first-ordcr 399.0 first-order asymptotic 399.0 Egervary theorem, Konig- 281.E 1974 Egorov theorem 270.J Ehrenfest model of diffusion 260.A Ehrenpreis-Malgrange theorem 112.13 Eichler approximation theorem 27.D eigenchain 390.H eigenelement (of a linear operator) 390.A eigenfunction (ofa boundary value problem) 315.B (for an integral equation) 2l7.F (of a linear operator) 390.A generalized 375.C eigenspace (of a linear mapping) 269.L (of a linear operator) 390.A generalized 390.B in a weaker sense 269.L eigenvalue(s) (of a boundary value problem) 31S.B (of an integral equation) 2l7.F (of a linear mapping) 269.L (of a linear operator) 390.A (of the Mathieu equation) 268.B (of a matrix) 269.F degenerate 390.A generalized 37S.e geomctrically simple 390.A index of 217.F multiplicity of 217.F numerical computation of 298 eigenvalue problem 390.A generalized 298.G eigenvector (of a linear mapping) 269.L (of a linear operator) 390.A (of a matrix) 269.F generalized 390.B eightfold way 132.D eikonal 82.D 180.C eikonal equation 324.E 32S.L Eilenberg-MacLane complexes 70.F Eilenberg-MacLane space 70.F Eilenberg-MacLane spectrum 202.T Eilenberg-Postnikov invariants (of a CW complex) 70.G Eilenberg-Steenrod axioms 201.Q Eilenberg-Zilber theorem 201.J Einstein, A. ! 29 Einstein convention (on tensors) 2S6J Einstein-Kiihler metric 232.C Einstein metric 364.1 Einstein relation (in diffusion) 1, 8.A Einstein space 364.D, App. A, Table 4.11 Einstein summation convention 4l7.B Eisenstein- Poincare series 32.F Eisenstein series 32.e generalized 450.T Eisenstein theorem 337.F elastic, total, cross section 386.B elasticity modulus of, in shear 271.G modulus of, in tension 271.G small-displacement theory cf 271.G theory of 271.G elastic limit 271.G elastic scattering 375.A elation 110.D electric displacement 130.A electric field 130.A electric flux density 130.A electric network 282.B 
1975 electric polarization 130.A electric susccptibility 130.B clectric wavcs 130.B transvcrse 130.B electrodynamics, quantum 132.C clcctromagnetic wave 446 theory of 130.B transverse 130. B electromagnetism 130 electron 377 B electronic analog computcr 19.E electronic computer 75.A electrostatics 130.B clement(s) 381.A affine arc 11O.C algebraic (of a field) 149.E areal (in a Cartan space) I 52.C atomic (in a complemented modular lattice) 243.F boundary (in a simply connccted domain) 333.B canonical (in thc representation of a functor) 52.L Casimir (of a Lie algebra) 248.1 central (in a lattice) 243.E compact (of a topological Abelian group) 422.F conformal arc 11O.D conjugatc (in a field) 149.J conjugate (in a group) J 90.C contact 428.E contact (in a space with a Lie transformation group) 11O.A cyclic 25 I.J even (of a Clifford algebra) 61.B finite, method 304.C function 198.1 339.A function, in the wider sense 198.0 gcncralized nilpotent (in a commutative Banach algebra) 36.E generating 390.G greatest (in an ordered set) 311.B homogeneous (of a gradcd ring) 369.B homogeneous (of a homogeneous ring) 369.B hypcrsurface 324.B idempotent (of a ring) 368.B 450.0 identity (of an algebraic systcm) 409.C identity (of a field) 149.A identity (of a group) 190.A idcntity (of a ring) 368.A inscparable (of a field) J 49 H integral (of a system of total diffcrcntial equa- tiom,) 428.E invcrsc (in a group) 190.A inverse (in a ring) 368.B inverse function I98.L invertible (of a ring) 368.B irreducible (of a ring) 67.H isotropic (with respect to a quadratic form) 348.E k-dimcnsional integral 191.1 Kepler orbital 309.B least (in an ordcred set) 3 I l.B left inversc (of an element of a ring) 368.B line II1.C lincarly dependent 2.E linearly independent 2. E matrix 351.B maximal (in an ordered set) 311.B maximum (in an ordered set) 311.B Subject Index Elementary Hopf algebra minimal (in an ordered set) 311.B minimum (in an ordered set) 311.B negative (of an ordered field) 149.N neutral (in a lattice) 243.F nilpotent (of a ring) 368.B odd (of a Clifford algebra) 61.B ordinary 191.1 ordinary integral 428.E oriented (in a covering manifold of a homoge- neous space) I J O.A orthogonal (of a ring) 368. B osculating 309.D polar (of an analytic function in the wider sense) 198.0 polar (of an integral element) 428.E positive (of an ordered field) 149.N prime (of a ring) 67.H prime (for a valuation) 439.E primitive (of an extension of a field) 149.D projective line 110.B purely inseparable (of a field) 149.H quasi-inverse (in a ring) 368.B quasi-invertible (of a ring) 368. B quasiregular (of a ring) 368.B ramified J 98.0 rational 198.0 regular (of a connected Lie group) 249.P regular (of a ring) 368. B regular integral 428. E right inverse (in a ring) 368.B separable (of a field) 149.H singular (of a connected Lie group) 249.P singular (with respect to a quadratic form) 348.E surface 324. B surface, union of 324. B torsion (of an A-modulc) 277.D transcendental (of a field) 149.E transgressive (in the spectral sequence of a fiber space) 148.E triangular 304.C unit (of a field) 149.A unit (of a group) 190.A unit (of a ring) 368.A unity (of a ficld) 149.A unity (of a ring) 368.A volume (of an oriented Cif -manifold) 105.W volume, associated with a Riemannian metric 105.W zero (of an additive group) 2.E I90.A zero (of a field) 149.A zero (of a linear space) 256.A zero (of a ring) 368.A elementarily equivalent structures 276.D elementary (Kleinian group) 234.A elementary (path) 186.F elementary Abelian functions 3.M elementary Abelian group 2.B elementary catastrophe 51.E elementary collapsing 65.C clcmentary contract 102.C elementary divisor (of a matrix) 269.E simple (of a matrix) 269.E elementaryevent(s) 342.B space of 342. B elementary extension 276.D elementary function(s) 131 of class n 13 LA elementary Hopf algebra 203.D 
Subject Index Elementary ideal elementary ideal 235.C elementary kernel (of a linear partial differential operator) 320.H elementary number theory 297 fundamental theorem of, 297.C elementary partic1e(s) 132 elementary solution App. A, Table 15.V (of a differential operator) 112.B (of a linear partial differential operator) 320.H (of partial differential equations of elliptic type 323.B elementary symmetric function 337.1 elementary symmetric polynomial 337.1 elementary topological Abelian group 422.E eliminate (variables from a family of polynomials) 369.E elimination design for two-way, of heterogeneity 102.K forward 302.B Gaussian 302.B Gauss-Jordan 302.B elimination method, Sylvester 369.E ellipse 78.A ellipsoid 350.B of inertia 271.E of revolution 350.B ellipsoidal coordinates 90.C I 33.A, App. A, Table 3.V ellipsoidal harmonics I33.B four species of 133.C ellipsoidal type, special function of 389.A elliptic (differential operator) 112.A 323.A 237.H (pseudodifferential operator) 323.K (Riemann surface) 77.B 367.D (solution) 323.D analytically hypo- 323.1 analytic hypo- 112.D hypo- I 12.D 189.C 323.1 micro locally 345.A strongly 112.G 323.H elliptic omplex (on a compact Coo-manifold) 237.H elliptic coordinates 90.C 350.E, App. A, Table 3.V elliptic curve 9.C L-functions of 450.S elliptic cylinder 350.B elliptic cylinder function 268.B elliptic cylindrical coordinates App. A, Table 3.V elliptic cylindrical surface 350.B elliptic domain 77.B elliptic function(s) 134 of the first kind 134.G Jacobi App. A, Table 16.m of the second kind 134.G of the third kind 134.H Weierstrass App. A, Table 16.lV elliptic function field 9.D elliptic geometry 285.A elliptic integral I1.C I 34.A, App. A, Table 16.1 complete App. A, Table 16.1 of the first kind 134.A, App. A, Table 16.1 of the first kind, complete 134.B of the first kind, incomplete 134.B of the second kind I 34.A, App. A, Table 16.1 of tile second kind, complete 134.C of the third kind 134.A, App. A, Table 16.1 elliptic integrals and elliptic functions App. A, Table 16 elliptic irrational function 134.A 1976 elliptic modular group 122.D elliptic motions 55.A elliptic operator 323.H microlocally 345.A strongly 323.H elliptic paraboloid 350.B of revolution 350.B elliptic point (of a Fuchsian group) 122.C (on a surface) II1.H elliptic quadric hypersurface 350.G elliptic singularity 418.C minimally 418.C elliptic space 285.C elliptic surface 72.K elliptic theta function 134.1, App. A, Table 16.11 elliptic transformation 74.F elliptic type 323.A,D (Lie algebra) 191.D partial differential equation of 323, App. A, Table 15.VI elongation strain 271.G embedded (into a topological space) 425.J hyperbolically 21.0 embedded Markov chain 260.H embedded primary component (of an ideal) 67.F embedded prime divisor (of an ideal) 67.F embedding 105.K (of categories) 52.H (of a Coo-manifold) 105.K (of a topological space) 425.J formula of, form 303.D generalized Borel 384.D PL 65.D regular 105.K Tanaka 384.D toroidal 16.Z torus 16.Z embedding principle (in dynamic programming) 127.B embedding theorem full (of an Abelian category) 52.N Irwin 65.D Menger-Nobeling 117.D Sobolev-Besov 168.B Tikhonov 425T Emden differential equation 291.F Emden function, Lane- 291.F emission 325.A backward 325.A forward 325.A empirical characteristic function 396.C empirical constant 19.F empirical distribution function 250.F 374.E 396.C empirical formula 19.F empiricism, French 156.C empty set 33.B 381.A axiom of 33.B enantiomorphic pair 92.A enantiomorphous 92.A encoder 213.D encoding 63.A end (of an arc) 93.B (of a noncompact manifold) 178 F (of a segment) 155.B (of a segmenl in an affine space) 7.D Heins 367.E 
1977 lower (of a curvilinear integral) 94.D upper (of a curvilinear integral) 94.D endogenous variable 128.C endomorphism (of an algebraic system) 409.C (ofa group) 190.D (of a polarized Abelian variety) 3.G (of a probability space) 136.E (of a ring) 368.D anti- (ofa group) 190.D anti- (of a ring) 368.D aperiodic 136.E entropy of 136.E exact (of a measure space) 136.E periodic (at a point) 136.E ring of (of an Abelian variety) 3.C endomorphism ring (of an Abelian variety) 3.C (of a module) 277.B 368.C endpoint (of an ordinary curve) 93.C left (of an interval) 355.C right (of an interval) 355.C end vertex 186.B energy 195.B 338.B binding 351.D free 340.B 402.G Gibbs free 419.C Helmholtz free 419.C internal 419.A kinetic 271.C 351.D mean 402.G mean free 340.B 402.G mutual 338.B potential 271.C rest 359.C total 271.C energy density 195.B energy equation (for a fluid) 205.A energy function 126.L 279.F energy inequality 325.C energy integral 420.A energy minimum principle 419.C Gibbs 419.C energy-momentum 4-vector 258.C energy-momentum operator 258.D energy-momentum tensor 150.B 359.D energy principle 338.D energy spectrum function 433.C energy spectrum tensor 433.C energy surface 126.L 402.C,G Enestrom theorem, Kakeya- (on an algebraic equa- tion) 10.E Engel theorem 248.F engulfing lemma 65.C enlargement 293.B Enneper formula, Weierstrass- 275.A Enneper surface 275.B Enriques surface 72.K ensemble 402.D canonical 402.D grand canonical 402.D, G microcanonical 402.D Enskog method 217.N enthalpy 419.C enthalpy minimum principle 419.C entire algebroidal function 17.B entire function 429.A rational 429.A transcendental 429.A Subject Index Equation(s) entire linear transformation 74.E entrance 281.C entrance boundary (of a diffusion process) 115.B entrance boundary point 260.1 entropy 212.B (of an endomorphism) 136.E (in information theory) 213.B (in statistical mechanics) 402.B (in thermodynamics) 419.A closed system 402.G completely positive 136.E conditional 213.B of a distribution 403.B of the endomorphism q> 136.E e- (of source coding theorem) 213.E maximal 136.C,H mean 402.G open system 402.G of a partition 136.E relative 212.B topological 136.H topological, of f with respect to ()( 126.K entropy condition 204.G entropy conjecture 126.K entropy maximum principle 419.A entropy production, equation of 205.A entry (of a matrix) 269.A enumerable predicate, recursively 356.D enumerable set, recursively 356.D enumerating predicate 356.H enumeration method, implicit 215.D enumeration theorem 356.H P6lya's 66.E envelope (of a family of curves) 93.1 of holomorphy 21.F injective 200.1 lower 338.M upper (of a family of subharmonic functions) 193.R envelope power function 4oo.F enveloping algebra 231.A special universal (ofa Jordan algebra) 231.C universal (of a Lie algebra) 248.J enveloping surface 111.1 enveloping von Neumann algebra 36.G epicycloid 93.H epidemiology 40.E epimorphism (in a category) 52.D 2oo.Q epitrochoid 93.H Epstein zeta function 450.K equality Chapman-Kolmogorov 261.A J aco be- Beihler 328 Parse val 18.B 159.A 160.C 192.K 197.C 200.B,C,E equal weight, principle of 402.D equation(s) Abelian I72.G adjoint differential 252.K algebraic 10, App. A, Table 1 algebraic differential 113 288.A approximate functional (for zeta function) 450.B basic 320.E Bellman 405.B bifurcation 286.V binomial 10.C biquadratic App. A, Table 1 Boltzmann 41.A 402.B 
Subject Index Eqnation(s) Boussinesq 387.F Briot-Bouquet differential 288.B Callan-Symanzik 361.B canonical 324.E canonical forms of the (of surfaces) 350.B Cauchy-Riemann (differential) 21.C 198.A 274.G 320.F characteristic (for a homogeneous system of linear ordinary differential equations) 252.J characteristic (of a linear difference equation) 104.E characteristic (of a linear ordinary differential equation) 252.E characteristic (of a matrix) 269.F characteristic (of an autonomous linear system) 163.F characteristic (of a partial differential equation) 320.D characteristic (of a partial differential equation of hyperbolic type) 325.A,F Chaplygin differential 326.B Chapman-Kolmogorov 260.A 261.A Charpit subsidiary 82.C 320.D ofCodazzi 365.C Codazzi-Mainardi II1.H, App. A, Table 4.1 of a conic section 78.C of a conic section, canonical form of 78.C of continuity 130.A 204.B 205.A cubic 10.D, App. A, Table 1 cyclic 172.G delay-differential 163.A de Rham 274.G difference  difference equation difference-differential 163.A differential  differential equation(s) diffusion-convection 304.B Diophantine 118.A Dirac 377.C 415.G DLR 402.G eikonal 324.E 325.L energy (for a fluid) 205.A of entropy production 205.A Ernst 359.D estimating 399.P Euler (calculus of variations) 46.B Euler (of a perfect fluid) 204.E Euler, of motion (of a perfect fluid) 205.A Euler differential (dynamics of rigid bodies) 271.E Euler-Lagrange 46.B evolution 378.G exterior field 359.D field 150.B Fokker-Planck 402.1 functional  functional equation functional-differential 163.A Galois 172.G of Gauss (on an isometric immersion) 365.C Gauss (on surfaces) 111.H Gel'fand-Levitan-Marchenko 287.C 387.D general 172.G general Navier-Stokes 204.F Hamilton differential 324.E Hamilton-Jacobi 108.B Hamilton-Jacobi differential 271.F 324.E heat 327.A, App. A, Table 15.VI of heat conduction 327.A Hill determinantal 268.B indicial 254.C induction 259 1978 integral  integral equation(s) integrodifferential 163.A 222 integrodifferential, of Fredholm type 222.A integrodifferential, of Volterra type 222.A interior field 359.D Kadomtsev-Petvyashvili 387.F KdV 387.B Kepler 309.B Klein-Gordon 377.C 415.G Kolmogorov backward 115.A 260.F Kolmogorov forward 115.A 260.F Konigs-Schroder 44.B Korteweg-de Vries 387.A Lagrange,ofmotion 271.F Landau 146.C Landau-Nakanishi 146.C Langevin 45.1 402.K Lewy-Mizohata 274.G likelihood 399.M linear 10.D linear, system of 269.M linear integral 217.A linear ordinary differential 252.A linear structure, system 128.C Lippman-Schwinger 375.C local (of a divisor on an open set) 16.M logistic 263.A master 402.1 matrix Riccati differential 86.E Maxwell 130.A microdifferential 274.G minimal surface 275.A of motion (of a fluid) 205.A of motion (of a model) 264 of motion (in Newtonian mechanics) 271.A of motion (of a particle in a gravitational field) 359.D of motion, Euler (of a perfect fluid I 205.B of motion, Heisenberg 351.D of motion, Lagrange 271.F natural (of a curve) l11.D natural (of a surface) 110.A Navier-Stokes 204.B 205.C normal (in the method of least squares) 403.E 302.E 397.J ordinary differential  ordinary differential equation(s) of oscillation App. A, Table 15.vl Painleve 288.C parabolic cylindrical App. A, Table 14.11 partial differential  partial diffirential equation(s) Pell 118.A Pfaffian 428.A Pfaffian, system of 428.A Poisson 338.A Prandtl boundary layer 205.C Prandtl integrodifferential 222.(' pressure 205.B primitive 172.G quadratic 10.D, App. A, Table 1 quartic 10.D, App. A, Table I radial 315.E random Schrodinger 340.E of a rarefied gas 41.A rational differential 288.A reciprocal 10.C regular local (at an integral point) 428.E renewal 260.C renormalization I11.B 
1979 resolvent 25 \.F retarded differential 163.A Ricci 365.C Schlesinger 253.E Schrodinger 351.D secular 55.B 269.F self-adjoint differential 252.K self-adjoint system of differential 252.K Sine-Gordon 387.A singular integral 217.1 of sound propagation 325.A special functional 388 of state 419.A structure (of an affine connection) 417.B structure (for a curvature form) 80.G structure (for a torsion form) 80.H of structure (of a Euclidean space) II1.B of structure (for relative components) IIO.A system of 10.A system of linear ordinary differential 252.G telegraph 325.A, App. A, Table 15.111 time-dependant Schrodinger 35\.D time-independant Schrodinger 351.D transport 325. L two-dimension KdV 387.F variational 316.F 394.C of a vibrating membrane 325.A of a vibrating string 325.A wave 325.A 446.A, App. A, Table 15.111 Wiener-Hopf integrodifferential 222.C Yang-Mills 80.Q Yule-Walker 42\.D eq uator (of a sphere) 140 equiangular spiral 93.H equianharmonic range of points 343.D equicontinuous (family of mappings) 435.D equicontinuous group of class (CO) 378.C equicontinuous semigroup of class (Co) 378.B equidistant hypersurface (in hyperbolic geometry) 285.C equilateral hyperbola 78.E equilateral hyperbolic coordinates 90.C, App. A, Table 3.V equilateral triangle solution 420.B equilibrium, Nash 173.C equilibrium figures 55.D equilibrium Gibbs distribution 136.C equilibrium mass distribution 338.K equilibrium point (of a flow) 126.D (in an N-person differential game) 108.C (in a two-person zero-sum game) 173.C equilibrium potential 260.D equilibrium principle 338.K equilibrium state 136.H 340.B 419.A equilibrium statistical mechanics 402.A equilibrium system, transformation to 82.D equilong transformation 76.B equipollent sets 49.A equipotential surface 193.1 equipotent sets 49.A equivalence (of categories) 52.H (in a category) 52.D (of complexes) 2oo.H (of coverings) 91.A anti- (of categories) 52.H chain 2oo.H cochain 2oo.F combinatorial 65.A Subject Index Equivalent c- 126.B CR- 344.A homotopy 202.F Kakutani 136.F Lax, theorem 304.F natural 52.J principle of (in insurance mathematics) 214.A principle of (in physics) 359.D simple homotopy 65.C topological 126.B equivalence class 135.B linear (of divisors) 16.M equivalence properties 135.A equivalence relations 135 358.A proper (in an analytic space) 23.E equivalent (additive functionals) 261.E (arc) 246.A (coordinate bundle) 147.B (covering) 91.A (extension by a C*-algebra) 36.1 (fiber bundle) 147.B (formula) 41 \.E (functions with respect to a subset of CO) 21.E (G-structures) 191.A (knot) 235.A (linear representation) 362.C (measure) 225.J (methods of summation) 379.L (proposition) 411.B (quadratic form) 348.A (quadratic irrational numbers) 182.G (relation) 135.B (space group) 92.A (stochastic process) 407.A (surfaces in the sense of Frechet) 246.1 (system of neighborhoods) 425.E (unfolding) 5 \.E (unitary representation) 437.A (valuation) 439.B (word) 3 \.B algebraically (cycles) 16.R algebraically, to 0 16.P arithmetically 92.B 276.D C- 114.H certainty 408.B chain 2OO.C '1.- 431.F combinatorially 65.A conformally nA 19\.B 367.A countably (under a nonsingular bimeasurable transformation) 136.C c- 126.B elementarily 276.D fiber homotopy (vector bundles) 237.1 finitely (under a nonsingular bimeasurable transformation) 136.C flow 126.B f- (points) 122.A geometrically 92.B homotopy (systems of topological spaces) 202.F k- (COO-manifolds) 114.F linearly (divisors) 16.M 72.F locally (G-structure) 191.H numerically (cycles) 16.Q D-linearly, divisors 9.F Q- 126.H (PL) 65.D properly (binary quadratic forms) 348.M 
Subject Index Equivalent affinity pseudoconformally 344.A quasi- (unitary representations) 437.C rationally (cycles) 16.R right 51.C simple homotopy 65.C stably (vector bundles) 237.B stably fiber homotopy (vector bundles) 237.1 topologically 126. B,H uniformly (uniform spaces) 436.E unitarily (self-adjoint operators) 390.G equivalent affinity 7.E equivariant Atiyah-Singer index theorem 237.H equivariant cohomology 431.D equivariant J-group 431.F equivariant J-homomorphism 431.F equivariant K-group 237.H equivariant mapping (map) 431.A equivariant point (of a mapping) 153.B equivariant point index (of a mapping) 153.B Eratosthenes'sieve 297.B Erdos theorem, Chung- 342.B ergodic (Markov chain) 260J (transformation) 136.B ergodic capacity 213.F ergodic class 260.B positive recurrent 260.B ergodic decomposition (of a Lebesgue measure space) 136.H ergodic homeomorphism strictly 136.H uniquely 136.H ergodic hypothesis 136.A 402.C ergodic information source 2 I 3.C ergodic lemma, maximal 136. B ergodic Szemeredi theorem 136.C ergodic theorem 136.A,B Abelian 136.B dominated 136.B individual 136.B local 136.B mean 136. B multiplicative 136.B pointwise 136.B ratio 136.B subadditive 136.B ergodic theory 136 342.A ErJang distribution, k- 260.1 Erlangen program 137 Ernst equation 359.D error(s) 138.A accumulated 138.C burst 63.E discretization 303. B of the first kind 4oo.A of input data I38.B local truncation 303. E mean square 399.E 403.E propagation of J 38.C roundoff 138.B 303.B of the second kind 400.A theory of 138.A truncation 138.B 303.B error analysis 138 backward 302.B error constant 303.E error-correcting 63.A error-correcting capability 63.B error-detecting 63.A error estimate, one-step-two-half-steps 303.D 1980 error function 167.D, App. A, Table 19.11 error matrix 405.G error probability 213.D error space 403.E error sum of squares (with n -s degree of freedom) 403.E error term 403.D error vector 102.A essential (conformal transformation group) 364.F essentially bounded (measurable function) 168.B essentially complete class 398.B essentially normal 390.1 essentially self-adjoint 251.E 390.1 essentially singular point (with respect to an analytic set) 21.M essentially unitary 390.1 essential part 260.1 essential singularity (of an analytic function in the wider sense) 198.P (of a complex function) 198.D essential spectrum 390.E,H essential support (of a distribution) 274.D essential supremum (of a measurable function) 168.B Estes stimulus sampling model 346.G estimable (parametr.ic function) 399.C estimable parameter 403.E linearJy 403.E estimate 399.B a priori 323.C nonrandomized 399.B one-step-two-half-steps error 301.D Schauder 323.C estimating equation 399.P estimating function 399.P likelihood 399.M estimating parameters, design for 102.M estimation Hadamard App. A, Table 8 interval 399.Q 401.C point 371.H 399.B 40l.C region 399.0 statistical 399.A, App. A, Table 23 estimation space 403.E estimator 399.B asymptotically efficient 399.N BAN 399.K,N based on an estimating function 399.P Bayes 399.F best asymptotically normal 399.K best invariant 399.1 best linear unbiased 403.E CAN 399.K consistent 399.K consistent and asymptotically normal 399.K efficien t 399.D first-order asymptotic efficient 399.0 first-order efficient 399.0 generalized least squares 403.E invariant 399.1 kth-order AMU 399.0 kth-order asymptotically median unbiased 399.0 L- 371.H least squares 403.E M- 371.H maximum likelihood 399.M mean unbiased 399.C median unbiased 399.C 
1981 ML 399.M modal unbiased 399.C moment method 399.L Pitman 399.G R- 371.H randomized 399.B ratio 373.C state 86.E Stein shrinkage 399.G superefficient 399.N UMV unbiased 399.C unbiased 399.C uniformly minimum variance unbiased 399.C eta function (of a Riemann manifold) 39].L Dedekind 328.A etale morphism 16.F etale neighborhood 16.AA etale site 16.AA i:tale topology 16.AA Euclid axiom 139.A Euclidean algorithm 297.A of polynomials 337.D Euclidean cell complex 70.B Euclidean complex 70.B Euclidean connection 364.B manifold with 109 Euclidean distance 139.E Euclidean field 150.F Euclidean field theory 150.F Euclidean geometry 139 n-dimensional 139.B 181 non-  non-Euclidean geometry 285 in the wider sense 139. B Euclidean group, locally 423.M Euclidean Markov field theory 150.F Euclidean method 150.F Euclidean polyhedron 70.B Euclidean simplicial complex 70.B Euclidean space(s) 140 locally 259.B 425.V non- 285.A theorem on invariance of dimension of 117.D Euclidean space form 412.H Euclidean type (building) 13.R Euclid ring 67.L Euler, L. 141 Euler angles 90.C, App. A, Table 3.V Euler characteristic (of a finite Euclidean cellular complex) 201.B Euler class (of M) 201.N Euler constant 174.A Euler criterion 297.H Euler differential equation (dynamics of rigid bodies) 271.E Euler equation (calculus of variations) 46.B (of a perfect fluid) 204.E Euler equation of motion (of a perfect fluid) 205.A Euler formula I3I.G Euler function 295.C Euler graph 186.F Euler infinite product expansion 450.B Euler integral of the first kind 174.C of the second kind 174.A Euler-Lagrange differential equation 46.B Euler linear ordinary differential equation App. A, Table 14.1 Euler-Maclaurin formula 379.J Subject Index Exact Euler method of describing the motion of a fluid 205.A of numerical solution of ordinary differential equations 303.E summable by 379.P of summation 379.P Euler number 177.C 201.B, App. B, Table 4 Euler path 186.F Euler-Poincare characteristic 16.E 201.B Euler-Poincare class 56.B (of a manifold) 56.F universal 56.B Euler-Poincare formula 201.B,F Euler polynomial 177.C Euler product 450.B Euler relation 4l9.B Euler square 241.B Euler summation formula 295.E Euler theorem on polyhedra 201.F Euler transformation (of infinite series) 379.1 evaluable (locally convex space) 424.1 evaluation and review technique program 376 Evans-Selberg theorem 48.E 338.H Evans theorem 48.E even element (of a Clifford algebra) 61.B even function 165.B even half-spinor 61. E even half-spin representation 61.E even-oddness conservation laws 150.D even permutation 151.G even state 415.H event(s) 281D 342.B complementary 342.B delayed recurrent 260.C elementary 342.B exclusive 342.B impossible 342.B independent 342.B inferior limit 342.B intersection of 342. B measurable 342.B probability of an 342.B with probability I 342.B product 342.B random 342.B recurrent 250.D 260.C space of elementary 342. B sum 342.B superior limit 342.B sure 342.B symmetnc 342.G tail 342.B event commutativity 346.G event horizon 359D Everett formula (for functions of two variables) App. A, Table 21 Everett interpolation formula App. A, Table 21 everyw here almost 270.D 342.D nearly 338.F quasi- 338.F evolute (of a curve) 111.E evolution equation 378 evolution operator 378.G holomorphic 378.1 exact (additive covariant functor) 200.1 (differential on a Riemann surface) 367.H (endomorphism) 136.E 
Subject Index Exact differential equation (in Galois cohomology) 172.1 (in sheaf theory) 383.C half- 200.1 left - 200.1 right- 200.1 exact differential equation App. A, Table 14.1 exact differential form 105.Q exact functor 52.N exact sampling theory 401.F exact sequence (of A-homomorphisms of A-modules) 277.£ of cohomology 2oo.F cohomology 201.L of Ext 2oo.G fundamental (on cohomology groups) 2oo.M Gysin (of a fiber space) 148.E of homology 2OO.C homology (of a fiber space) 148.E homology (for simplicial complexes) 201.L homotopy 202.L homotopy (of a fiber space) 148.D homotopy (of a triad) 202.M homotopy (of a triple) 202.L Mayer-Vietoris (for a proper triple) 201.C Puppe 202.G reduced homology 201.F relative Mayer- Vietoris 201.L (R, S)- (of modules) 200.K short 200.1 of Tor 200.D Wang (of a fiber space) 148.E exceptional (Jordan algebra) 231.A (leaf) 154.D exceptional compact real simple Lie algebra 248.T exceptional compact simple Lie group 249.L exceptional complex simple Lie algebra 248.S exceptional complex simple Lie group 249.M exceptional curve IS.G of the first kind 15.G of the second kind IS.G exceptional function, Julia 272.F exceptional orbit 431.C exceptional sets 192.R exceptional value (of a transcendental entire function) 429.B Borel 272.E Nevanlinna 272.E Picard 272.E excess 178.H, App. A, Table 6.111 coefficient of 341.H 396.C spherical 432.B total 178.H excessive (function) 260.D 261.D ()(- 261.D excessive measure 261.F exchange 420.D exchange of stability 286.T excision isomorphism 201.F,L excluded middle, law of IS6.C 411.L exclusive events 342.B exhaustion 178.F existence theorem (in class field theory) S9.C (for ordinary differential equations) 316.C Cartan-Kiihler 191.1 428.E Cauchy 320.B Cauchy-Kovalevskaya 321.A existential proposition 411.B existential quantifier 411.C 1982 exit 281.C exit boundary (of a diffusion process) 115.B exit boundary point 260.1 exit time 261.B exogenous variable 128.C exotic sphere 114.B expA (exponential function of matrix A) 269.H expansion 65.C asymptotic 30.A, App. A, Table 17.1 asymptotic (of a pseudodifferential operator) 34S.A Cornish-Fisher 374.F Edgeworth 374.F E- 361.C Laurent 198.D method of matched asymptotic 11.,n J.Jk.U Minakshisundaram-Pleijel asymptotic 391.B orthogonal 317.A partial wave 375.E 386.B Taylor (of an analytic function of several vari- ables) 21.B Taylor (of a holomorphic function,! 339.A Taylor, and remainder App. A, Table 9.1V Taylor, formal S8.C expansion coefficient 317.A expansion formula, q- 134.1 expansion method 20S.B expansion theorem 306.B Hilbert-Schmidt 217.H Laplace (on determinants) 103.D expansi ve 126.J expectation IIS.B 342.C conditional 342.E mathematical 341.B expectation value (of an operator) 351.B expected amount of inspection 404.C expected value (of a random variable) 342.C experiment(s) design of 102 factorial 102.H Sk factorial 102.H statistical 398.G experimental analysis 385.A experimentation model 38S.A explanatory variable 403.D explicit (difference equation in a multi3tep method) 303.E (Runge-Kutta method) 303.D explicit function 165.C explicit method 303.E explicit reciprocity laws 14.R explicit scheme 304.F exploratory procedures 397.Q exploring a response surface, designs for 102.M explosion, n- 126.1 explosion time 406.D exponent (of an Abelian extension) 172.F (of an algebra class) 29.E (of a finite group) 362.G (of a Kummer extension) 172.F (of a power) 131.B (of a regular singular point) 2S4.C (of a stable distribution) 341.G characteristic (of an autonomous linear system) 163.F characteristic (of the Hill differential equation) 268.B characteristic (of a variational equation) 394.C 
1983 conjugate 168.C of convergence 429.B critical 111.C integral 167.D one-sided stable process of 5.F of the stable process 5.F subordinator of 5.F exponential curve 93.H exponential dichotomy 290.B exponential distribution 341.D, App. A, Table 22 two-sided App. A, Table 22 exponential family of distributions 396.G exponential formula 286.x double 299.B exponential function I31.D with the base a 131.B of an operator 306.C exponential generating function 177.A exponential group 437.U exponential Hilbert space 377.D exponential integral 167.D, App. A, Table 19.n exponential lattice 287.A exponential law (on cardinal numbers) 49.C exponentially decreasing Fourier hyperfunction 125.BB exponentially decreasing real analytic function 125.BB exponentially stable 163.G 394.B exponential mapping (of a Lie algebra into a Lie group) 249.Q (of a Riemannian manifold) 178.A 364.C exponential method, Borel 379.0 exponential series 131.D exponential valuation 439.B p-adic 439.F exposed, strongly 443.H expression field of rational 337.H rational 337.H expx 131.D Ext 2oo.G exact sequence of 2oo.G Ext groups 2oo.G Ext(A, B) 2oo.K Ext(M, N) 2oo.G extended Dynkin diagram App. A, Table 5.1 extended hypergeometric function, Barnes 206.C extended real number 87.E extension (of a connection) 80.F (of a field) 149.B (of a fractional ideal) 14.1 (of a group) 190.N (of an ideal of compact operators) 36.J 390J (of an isomorphism of subfields) 149.D (of a mapping) 381.C (of modules) 2oo.K (of an operator) 251.B (of a solution of an ordinary differential equation) 316.C (of a valuation) 439.B Abelian (of a field) 172.B algebra 2oo.L algebraic (of a field) 149.E Artin-Schreier (of a field) 172.F basic Z,- 14.L central (of a group) 190.N of the coefficient ring 29.A cone 65.D cyclic (of a field) 172.B Subject Index External (in nonstandard analysis) cycJotomic Zt- 14.L elementary 276.D finite 149.F Friedrichs 112.1 2Sl.E Galois I72.B t- 14.L group 2oo.M inseparable (of a field) 149.H Kummer (of a field) 172.F Lebesgue 270.D linear (of a rational mapping to an Abelian variety) 9.E maximal Abelian 257.F maximal separable (of a field) 149.H natural (of an endomorphism) 136.E normal 149.G 251.K p- (of a field) 59.F p-adic 439.F purely inseparable (of a field) 149.H Pythagorean (of a field) 155 C regular (of a field) 149.K scalar (of an algebra) 29.A scalar (of an A-module) 277.L scalar (of a linear representation) 362.F separable (of a field) 149.H,K separably generated (of a field) 149.K simple (of a field) 149.D split (of a group) 190.N strong (of a differential operator) 112.E,F transcendental 149.E transitive (of a permutation group) 151.H unramified 14.1 257.D weak (of a differential operator) 112.E,F Z,- 14.L extensionality, axiom of (in axiomatic set theory) 33.B extension field 149.B Picard- Vessiot 113 strongly normal 113 extension theorem first (in the theory of obstructions) 305.B Hahn-Banach 37.F Hopf(in measure) 270.E Kolmogorov 341.1 second (in the theory of obstructions) 305.C third (in the theory of obstructions) 305.C Tietze 42S.Q Whitney 168.B extensive thermodynamical quantity 419.A exterior (of an angle) 155.B (of a polygon) 155.F (of a segment) 155.B (of a subset) 425.N exterior algebra (of a linear space) 256.0 exterior capacity, Newtonian 48.H exterior derivative (of a differential form) 105.Q exterior differential form of degree r 105.Q exterior field equation 359 exterior point (of a subset) 425.N exterior power, p-fold (of a linear space) 256.0 (of a vector bundle) 147.F exterior problem (for the Dirichlet problem) 120.A exterior product (of differential forms) 105.Q (of elements of a linear space) 256.0 (of a p-vector and a q-vector) 256.0 (of two vectors) 442.C external (in nonstandard analysis) 293.B 
Subject Index External irregular point external irregular point 338.L external language 75.C external law of composition 409.A externally stable set 186.1 external product 2oo.K external space (in the static model in catastrophe theory) 51.B external variable 264 extinction probability 44.B extrapolation l76.K 223.A extrapolation method polynomial 303.F rational 303.F extremal (Jordan curve) 275.C extremal distance l43.B reduced 143.B extremal function, Koebe 438.C extremal horizontal slit mapping 367.G extremal length 143 defined by Hersch and Pfluger 143.A with weight 143.B extremal quasiconformal mapping 352.C extremal vertical slit mapping 367.G extremely disconnected 37.M extreme point of a convex set 89.A of a subset of a linear space 424.V extremum conditional relative (of a function) 106.L relative (of a function) 106.L F f (cardinal number of all real-valued functions on [0, I]) 49.A Fq (finite field with q elements) 450.Q f-metric 136.F iN-metric 136.F F-compactification 207.C F-distribution 341.D 374.B, App. A, Table 22 noncentral 374.B F-free (compact oriented G-manifold) 431.E F-test 4oo.G (F, F)-free (compact oriented G-manifold) 431.G F. set 270.C (F)-space 424.1 face (of a complex) 13.R (of a convex cell) 7.D ith (of a singular q-simplex) 201.E q- (of an n-simplex) 70.B face operator (in a semisimplicial complex) 70.E factor (of an element of a ring) 67.H (of a factorial experiment) 102.H (of a von Neumann algebra) 308.F of automorphy 32.A composition (in a composition factor series in a group) 190.G determinant (of a matrix) 269.E direct (of a direct product of sets) 381.E direct (of a group) 190.L Dirichlet discontinuous App. A, Table 9.V first (of a class number) 14.L integrating App. A, Table 14.1 Krieger 308.1 p- (of an element of a group) 362.1 Powers 308.1 proper lof an element of a ring) 67.H second (of a class number) 14.L 1984 Ulm (of an Abelian p-group) 2.D factor A-module 277.C factor analysis 280. G factor analysis model 403.C factor group 190.C factorial 330, App. A, Table 17.11 Jordan 330 factorial cumulant 397.G factorial cumulant generating function 397.G factorial design 102.H balanced fractional 102.1 fractional 102.1 orthogonal fractional 102.1 factorial effect 102.H factorial experiment 102.H Sk i02.H factorial function 174.A factorial moment 397.G factorial moment generating functions 397.G factorial series 104.F 121.E factorization incomplete 302.C regular 251.N triangular 302.B factorization method 206.B factorization theorem (of an HP-function) 43.F Neyman 396.F unique (in an integral domain) 67 H factor loading 280.G 346.F factor representation of a topological group 437.E of type I, II, or III 437.E factor ring 368.E,F factor score 280.G 346.F factor set(s) (in cohomology of groups) 200.M (of a crossed product) 29.D (in extensions of groups) 190.N (of a projective representation) 362.J associated (in extensions of groups) 190.N factor transformation (of a measure-preserving transformation) 136.D Faddeev-Popov ghost 132.C 150.G fading memory 163.1 faithful (functor) 52.H (linear representation) 362.C (permutation representation) 362.B (weight on a von Neumann algebra) 308.D fully (functor) 52.H faithfully flat A-module 277.K faithfuIly flat morphism 16.D false 411.E regular 301.C false position, method of 301.C family 165.D algebraic (of cycles on an algebraic variety) 16.R of compact complex manifolds 72.G of confocal central conics 78.H of confocal parabolas 78.H of confocal quadrics 350.E covering 16.AA crystal 92.B directed 165.D equicontinuous (of mappings) 43S.D exponential, of distributions 396.G of frames (on a homogeneous space) IIO.A of frames of order I 11 O.B 
1985 of functions 165.B,D of functions indexed by a set 165.B indexed by a set 16S.D of mappings 165 D normal (of functions) 435.E parameter space 72.G of points 165.D of quasi-analytic functions 58.A quasinormal (of analytic functions) 435.E separating 207.C of sets 16S.D 381.B,D of sets indexed by a set 381.D tight (of probability measures) 341.F uniform, of neighborhood system 436.D fan 16.Z Fannes-Verbeure inequality, Roebstorff- 402.G F ANR 382.C FAR 382 C Farcyarc 4.B Farey dissection 4.B Farey sequence 4 B Farkas theorem, Minkowski- 255.B fast Fourier transform 142.D fast wave 259 Fatou theorem on bounded functions in a disk 43.D on the Lebesgue integral 221 C favorable a priori distribution, least 398.H F D. generator 136.E F.D. process 136.E feasible (flow) 281.B feasible directions, method of (in nonlinear programming) 292.E feasible region 264.B 292.A feasible solution 255 A 264 B basic 255.A feedback control 405.C Fefferman-Stein decomposition 168.B Feit-Thompson theorem (on finite groups) 151.D Fejer kernel 159.C Fejer mean 159.C Fejer theorem 159 C Feller process 261 B Feller transition function 261.B Fermat, P. de 144 last theorem of 145 Fermat number 297.F Fermat principle 180.A 441.C Fermat problem 145 Fermat theorem 297.G fermions 132 A 351.H Fermi particle 132.A Fermi statistics 377.B 402.E Ferrar formula, Dixon- App. A, Table 19.1V Ferrari formula App. A, Table 1 Feynman amplitude 146 B Feynman diagram 146.B Feynman graphs 146.A,B 386.C Feynman integrals 146 Feynman-Kac formula 351.F Feynman-Kac-Nelson formula 150.F Feynman rule 146.A,B F F. 136.F FFT 142.D fiber (of a fiber bundle) 147.B (of a fiber space) 148.B (of a morphism) 16.D geometric (of a morphism) 16.D integration along (of a hyperfunction) 274.E Subject Index Field(s) fiber bundle(s) 147 associated 147.D of class C' 147.0 complex analytic 147.0 equivalent 147.B isomorphic 147.B orientable 147.L principal 147.C real analytic 147.0 fibered manifold 428.F fiber homotopy equivalent 237.1 stably 237.1 fiber homotopy type, spherical G- 431.F fibering, Hopf 147.E fiber mapping (map), linear 114.D fiber product 52.G fiber space(s) 72.1148 algebraic 72.1 locally trivial 148.B n-connective 148.D spectral sequence of (of singular cohomology) 148.E Spivak normal 114J fiber sum 52.G fiber term (of a spectral sequence) 200J Fibonacci sequence 295.A fibration (of a topological space) 148.B fibration, Milnor 418.D fictitious state 260.F fiducial distribution 401.F fiducial interval 4OI.F field(s) 149 (of sets) 270.B (of stationary curves) 46 C absolute class 59.A algebraically closed 149.1 algebraic function, in n variables 149.1 algebraic number 14.B alternative 231.A Anosov vector 126J Archimedean ordered 149.1 asymptotic 150.D Axiom A vector 126.J basic (of a linear space) 256.A Borel 270 B,C C i - 118.F C,(d)- 118 F canonical 377.C class 59.B coefficient (of an affine space) 7.A coefficient (of an algebra) 29.A coefficient (of a projective space) 343.C coefficient (of a semi local ring) 284.D commutative 368.B composite 149.D conjugate 149J 377.C cyclotomic 14.L decomposition (of a prime ideal) 14.K of definition (for an algebraic variety) 16.A differential 113 electric 130.B Euclidean 150.F extension 149 B finite 149.C formal power series, in one variable 370.A of formal power series in one variable 370.A formally real 149.N free 150.A free Dirac 377.C free scalar 377.C 
Subject Index Field equation function 16.A Galois 149.M Galois theory of differential 113 ground (of an algebra) 29.A ground (of a linear space) 256.A Hamiltonian vector 126.L 219.C holomorphic vector 72.A imaginary quadratic 347.A imperfect 149.H inertia (of a prime ideal) 14.K intermecliate 149.D invariant 172. B Jacobi 178.A Lagrangian vector 126.L local 257.A local class 257.A local class, theory 59. G linearly disjoint 149.K magnetic 130.B of moduli 73.B Morse-Smale vector 126J noncommutative 149.A number 149.C ordered 149.N p-adic number 257.A 439.F perfect 149.H Picard-Vessiot extension 113 power series, in one variable 370.A prime 149.B Pythagorean 139.B 155.C Pythagorean ordered 60.0 quadratic 347.A quasi-algebraically closed 118.F of quotients 67.G ramification (of a prime ideal) 14.K random 407.B of rational expressions 337.H rational function, in n variables 149.K ofrational functions 337.H real 149.N real closed 149.N real quadratic 347.A relative algebraic number 14.1 residue class 149.C 368.F residue class (of a valuation) 439.B scalar 108.0 scalar (in a 3-dimensional Euclidean space) 442.D of scalars (of a linear space) 256.A skew 149.A 368.B splitting (for an algebra) 362.F splitting (for an algebraic torus) 13.D splitting (of a polynomial) 149.G strongly normal extension 113 tension 195.B tensor  tensor field topological 423.P totally imaginary 14.F totally real 14.F transversal 136.G vector (in a differentiable manifold) 108.M vector (in a 3-dimensional Euclidean space) 442.D Wightman 150.D Yang-Mills 150.G field equation 150.B exterior 339.D interior 339.D field theory 150 1986 constructive 150.F Euclidean 150.F Markov 150.F nonsymmetric unified 434.C quantum 150.C unified 434 unitary 434.C fifth postulate (in Euclidean geometry) 139.A fifth problem of Hilbert 423.N figure(s) 137 absolute (in the Erlangen program) 137 central 420.B equilibrium 55.D fundamental (in a projective space) 343.B linear fundamental 343.B P'- 343.B file 96. B filing, inverted, scheme 96.F fill-in 302.E filter 87.1 Cauchy (on a uniform space) 436.G Kalman 86.E Kalman-Bucy 86.E 405.G linear 405.F maximal 87.1 nonlinear 405.F,H Wiener 86.E filter base 87.1 filtering 395.E stochastic 342.A 405.F filtration 2oo.J discrete 200J exhaustive 200.J filtration bounded from below 200J filtration degree 2oo.J final object 52.D final set (of a correspondence) 358.B (of a linear operator) 251.E final state 31.B finely continuous 261.C finely Open (set) 261.D fine moduli scheme 16. W finer relation 135.C finer topology 425.H fine topology (on a class of measures) 261.D 338.E finitary standpoints 156.D finite (cell complex) 70.D (measure) 270.D (morphism) 16.D (potency) 49.A (simplicial complex) 70.C (triangulation) 70.C (v on Neumann algebra) 308.E approximately 36.H 308.1 geometrically 234.C hyper- 308.1 locally  locally finite point- (covering) 425.R pro-, group 210.C semi- 308.1 (J- (J-finite star- (covering) 425.R finite automaton 31.D finite-band potentials 387.E finite basis (for an ideal) 67.B finite branch (of a curve of class C k ) 93.G finite character, conclition of 34.C 
1987 finite cochain (of a locally finite simplicial complex) 201.P finite continued fraction 83.A finite covering (of a set) 425.R finite differences 223.C finite-dimensional distribution 407.A finite-dimensional linear space 256.C finite-dimensional projective geometry 343.B finite-displacement theory 271.G finite element method 233.G 290.E 304.C finite extension 149.F finite field 149.C finite field Fq 149.M finite-gap potentials 387.E finite groups 151.A 190.C finite intersection property 425.S finite interval (in R) 355.C finite length 277.1 finitely additive (vector measure) 443.G finitely additive class 270.B finitely additive measure 270.D finitely additive set function 380.B finitely determined process 136.E finitely distinguishable (hypothesis) 4OO.K finitely equivalent sets (under a nonsingular bi- measurable transformation) 136.C finitely fixed 136.F finitely generated (A-module) 277.D (group) 190.C finitely presented (group) 161.A finite memory channel 213.F finiteness condition for integral extensions 284.F finiteness theorem 16.AA Ahlfors 234.D finite order (distribution) 12S.1 finite ordinal number 312.B finite part (of an integral) 125.C finite population 373.A finite presentation 16.E finite prime divisor 439.H finite projective plane 241.B finite rank (bounded linear operator) 68.C finite sequence 165.D finite series 379.A, App. A, Table 10.1 finite set 49.A 381.A hereditary 33.B finite subset property 396.F finite sum, orthogonality for a 19.G 317.D, App. A, Table 20. VII finite type (graded module) 203.B (module) 277.D (morphism of schemes) 16.D ((!i-module) 16.E algebraic space of 16.W locally of 16.D su bshift of 126.J finite-type power series space 168.B finite-valued function 443.B finitistic (topological space) 431.B Finsler manifold 286.L Finsler metric 152.A Finsler space 152 firmware 75.C first axiom, Tietze 425.Q first boundary value problem 193.F 323 C first category, set of 42S.N first classfication theorem (in theory of obstructions) 30S.B Subject Index Fixed point first complementary law of the Legendre symbol 297.1 first countability axiom 425.P first definition (of algebraic K-group) 237.J first extension theorem (in the theory of obstruc- tions) 305.B first factor (of a class number) 14.L first fundamental form (of a hypersurface) 111.G first fundamental quantities (of a surface) I11.H first fundamental theorem (Morse theory) 279.D first homotopy theorem (in the theory of obstruc- tions) 305.B first incompleteness theorem 185.C first-in first-out memory 96.E first-in last-out memory 96.E first integral (of a completely integrable system) 428.D first isomorphism theorem (on topological groups) 423.J first kind (integral equations of Fredholm type of the) 217.A Abelian differential of 11.C Abelian integral of II.C differential form of 16.0 first law of cosines 432.A, App.A, Table 2.11 first law of thermodynamics 419.A first maximum principle (in potential theory) 338.C first mean value theorem (for the Riemann integral) 216.B first negative prolongationallimit set 126.D first-order asymptotic efficient estimator 399.0 first-order derivatives 106.A first-order designs 102.M first-order efficient estimator 399.0 first-order predicate 411.K first-order predicate logic 411.K first positive prolongationallimit set 126.D first problem, Cousin 21.K first prolongation (of P) 191.E first quadrant (of a spectral sequence) 200.1 first quartile 396.C first regular integral 126.H first-return mapping (map) 126.B first separation axiom 425.Q first variation 46.B first variation formula 178.A Fisher consistent 399.K Fisher expansions, Cornish- 374.F Fisher inequality 102.E Fisher information 399.D Fisher information matrix 399.D Fisher problem, Behrens- 4oo.G Fisher theorem 43.G Fisher theorem, Riesz- 168.B 317.A Fisher three principles 102.A Fisher-Yates-Terry normal score test 371.C Fisher z-transformation 374.D fisheye, Maxwell 180A fit, chi-square test of goodness of 4oo.K fit, goodness of 397.Q fitting, curve 19.F five-disk theorem, Ahlfors 272.1 fixed branch points (of an algebraic differential equation) 288A fixed component (of a linear system) 16.N fixed effect 102.A fixed-effect model 102.A fixed point (of a discontinuous group) 122.A 
Subject Index Fixed-point index (of a continuous mapping) (of a flow) 126.D (of a mapping) l53.A,D (of a transformation group) 431 A of discontinuity (of an additive process) 5 B of discontinuity (of a stochastic process) 407 A hyperbolic 126.G isolated 126.G fixed-point index (of a continuous mapping) 153.B fixed point method 138.B fixed-point theorem(s) 153 Atiyah-Bott 153.C Atiyah-Singer 153.C Brouwer 153.B Kakutani 153.D Lefschetz 153.B Leray-Schauder 286.D Poincare-Birkoff 153.B Schauder 153 D 286.D Tikhonov 153.D fixed singularity (of an algebraic differential equation) 288.A fixed variates 403.D fixed vector 442.A FKG 212.A Fl (fleche) 52.A flabby resolution 125.W flabby sheaf 383.£ flag (in an affine space) 139.B flag manifold 199 B proper 199.B flat (connection) 80.£ (morphism of schemes) 16.D (Riemannian manifold) 364.£ (sphere pair) 235.G conformally 191.B G- 178.D faithfully (A-module) 277.K faithfully (morphism of schemes) 16.D locally (connection) 80.£ locally (PL embedding) 65 D locally (Riemannian manifold) 364.£ normally (along a subscheme) 16 L flat A-module 277.K flat deformation 16.W flat F-bundle 154.B flat function 58.C flat point (of a surface) 1I1.H flat site 16.AA flat space concircularly App A, Table 4.11 conformally App. A, Table 4.n projectively App. A, Table 4.11 flavor dynamics, quantum 132.D flex 9.B flip model, spin 340.C floating point method 138.B Floquet theorem 252.J 268.B flow (in ergodic theory) 136 D (on a network) 281.B (on a topological space) 126.B Anosov 126.B 136.G associated 136 F Axiom A 126 J built under a function 136 D c- 126 B of class C 126.B continuous 126.B discrete 126.B 1988 equivalent 126.B geodesic 126.L 136.G harmonic 193.K homentropic 205.B horocycle 136.G hypersonic 205.C K- 136.£ Kolmogorov 136 £ Kronecker 136.G laminar 205.£ 433.A maximum, minimum cut theorem 281 maximum, problem 281 measurable 136.D minimal 126.N minimum-cost, problem 281.C Morse-Smale 126.J muiticommodity, problem 281.F network, model 307.C network, problem 281 282.B S- 136.D single-commodity, problem 281. F special 136.D translational 126.L 136.G transonic 205.B transversal 136.G turbulent 205.£ 433.A Yo, 136.G flow-shop scheduling problem 376 fluctuation-dissipation theorem 402.K fluid 205.A compressible 205.B incompressible 205.B Newtonian 205.C non-Newtonian 205.C perfect 205.B fluid dynamics 205.A flux (of a regular tube) 193.K vector (through a surface) 442.D flux density electric 130.A magnetic 130 A focal conic (of a quadric) 350.£ focal length 180.B focal point (of a submanifold of a Riemannian manifold) 364.C Fock representation 150.C F ock space 377.A antisymmetric 377.A symmetric 377.A focus (of a conic section ellipse) 78.B (of an optimal system) 180.B (of a quadric) 350 £ Fokker-Planck partial differential equation' 115.A 402.1 Foia model, Sz. Nagy- 251.N folding (of a chamber complex) 13.R foliated bundle 154.B,H foliated cobordant (Cn-foliations) 154.H foliated cobordism l54.H foliated structure 105.Y foliation 154 Anosov 126.J compact 154.H C- 154.B,G ["- 154.H holomorphic 154.H real analytic 154.H Reeb 154.B 
1989 Riemannian 154.H transverse to 154.H foliation cycles 154.H folium cartesii 93.H folium of Descartes 93.H foot of the perpendicular 139.E force apparent 271.D body 271.G centrifugal 271.D Corioli 271.D line of 193.J Lorentz 130.A restitutive 318.B forced oscillation 318.B force polygon 19 C Ford fundamental region 234.C forgetful functor 52.1 form(s) 337.B anti-Hermitian 256.Q associated (of a projective variety) 16.S automorphic 437.DD 450.0 automorphic, of weight k (or of dimension -k) 32.B automorphic, of weight m 32.A basic (in linear programming) 255.A bilinear 256.H 277.J 424.G bilinear, associated with a quadratic form 256.H canonical (of F(M)) 191.A canonical (of a linear hypothesis) 4oo.H canonical, of the equation (of a quadric) 350.B canonical 1- (of the bundle of tangent n-frames) 80.H Cantor normal 312.C compact (of a complex semisimple Lie algebra) 248.P complex (of a Fourier series) 159.A complex (of a real Lie algebra) 248.P complex space 365.L connection 80.E 417.B contact 1l0.E covariant of n-ary, of degree d 226.D covariant with ground 226.D curvature 80.G 364.D cusp (in the case of one variable) 32.B cusp (in Siegel upper half-space) 32.F differential differential form Dirichlet 261.C divergence 323 D dominant integral (on a Cartan subalgebra) 248.W F.-Hermitian 60.0 F.-trace 600 Euclidean space 412.H first fundamental II1.G, App. A, Table 4 I formula of embedding 303.D Fuchsian, of weight k (or of dimension - k) 32.B fundamental (associated with a Hermitian metric) 232.A fundamental (of a Finsler space) 152.A games in partition-function 173.D generalized Levi 274 G ground 226.D Hermitian  Hermitian form 256.Q Hesse normal (of a hyperplane) 139.H Hilbert modular, of dimension -k 32.G Hilbert modular, of weight k 32.G holomorphic k- 72.A Subject Index Formally self-adjoint (differential operator) hyperbolic space 412.H integral (on a Cartan subalgebra) 248.W invariants of n-ary, of degree d 226.D Jordan normal 269.G k- (of an algebraic group) 13.M kernel 348.F Khinchin canonical 341.G Killing 248.B Kolmogorov canonical 341.G Legendre-Jacobi standard 134.A, App. A, Table 16.1 Levi 344.A Levy canonical 341 G limit of an indeterminate 106.E linear (on an A-module) 277.E linear (on a linear space) 256.B modular, of level N 32.C multilinear 256 H n-person 173.B- D norm 118.D normal (of an ordinal number) 312.C normal (of an ordinary differential equation) 313.B normal (of partial differential equations) 321.B normal (of a partial differential equation of the first order) 324.E normal (of a surface) 41O.B normal real (of a complex semisimple Lie algebra) 248.Q normic (in a field) 118.F Pfaffian 105.Q 428.A polar (of a complex number) 74.C primitive 232.C pseudotensorial 80.G quadratic  quadratic form real (of a complex algebraic group) 60.0 real (of a complex Lie algebra) 248.P reduced (of a linear structural equation system) 128.C regular Dirichlet 261.C second fundamental 111.G 365.C, App. A, Table 4.1 sesquilinear sesquilinear form Siegel modular, of dimension - k 32.F Siegel modular, of weight k 32 F skew-Hermitian 256.Q skew-symmetric multilinear 256.H space 285.E 412.H spherical space 412 H standard (of a difference equation) 104.C standard (of a latin square) 241.A symmetric multilinear 256.H symplectic 126.L tensorial 80.G third fundamental App. A, Table 4.1 torsion 80.H Weierstrass canonical (for an elliptic curve) 9.D Weierstrass canonical (of the gamma function) 174.A Weyl 351.C formal adjoint operator 322.E formal degree (of a unitary representation) 437.M formal dimension n, Poincare pair of 114.J formal geometry 16.x formal group 13.C formalism 156.A,D Gupta-Bleuler 150.G formally real field 149.N formally self-adjoint (differential operator) 112.1 
Subject Index Formally undecidable proposition formally undecidable proposition 185.C formal power series 370.A field of, in one variable 370A rings of 370.A formal power series field in one variable 370A formal power series ring 370.A formal scheme 16'x separated 16'x formal solution (for a system of ordinary differential equations) 289.C formal spectrum (of a Noetherian ring) 16.X formal system 156 D 411.I formal Taylor expansion 58.C formal vector fields 105.AA formation class 59.H pattern 263.D formation, class 59.H formation rule 411.D form ring 284.D formula(s) 411.D Abramov 136.E addition (for e Z ) 13I.G addition (for sine and cosine) 432.A Adem App.A, Table 6.11 algebraic addition 3.M atomic 411 D atomic (of a language) 276.A Bayes 342.F 405.1 Bessel interpolation App. A, Table 21 Binet 174.A 295.A Bouquet (on space curves) II1.F Campbell-Hausdorff 249.R Cardano App. A, Table 1 Cartan (for Steenrod pth power operations) 64.B Cartan (for Steenrod square operations) 64.B Cauchy-Hadamard (on the radius of conver- gence) 339A Cauchy integral 198.B Chebyshev (in numerical integration) 299.A Chern (in integral geometry) 218.D Christoffel- Darboux 317.D Clenshaw-Curtis 299.A closed 411.J closed (of a language) 276.A closed type (in numerical integration) 299A connection 253.A constant variational 163.E Crofton (in integral geometry) 218.B decomposition, of Radon 125.CC De Moivre 74.C dimensional 116 discontinuity 146.C 386.C Dixon-Ferrar App. A, Table 19.1V double exponential 299.B Dynkin 261.C of embedding form 303.D empirical 19.F Euler (for cos z, sin z, cosh z) 131.G Euler (for e'Y) 131.G Euler-Maclaurin 379.J Euler-Poincare (for a finite Eulidean cellular complex) 201.B,F Euler summation 295.E Everett App. A, Table 21 Everett interpolation 224.B, App. A, Table 21 exponential 286.X Ferrari App. A, Table 1 Feynman-Kac 315.F,G 1990 Feynman-Kac-Nelson 150.G first variation 178.A Fourier inversion 160.C Fredholm 68.L Frenet (on curves) 1\ I.D Frenet-Serret (on curves) 111.D Gauss (on Gauss sum) 295.D Gauss (on harmonic functions) 193.D Gauss (for integration of a vector field) App. A, Table 3.m Gauss (for isometric immersion) 365.C Gauss (in numerical integration) 299.A Gauss (for the surface integral) 94.F Gauss (in theory of surfaces) III.H, App. A, Table 4.1 Gauss backward interpolation 223.C Gauss-Bonnet 111.H 364.D, App A, Table 4.1 Gauss-Bonnet-Sasaki-Nitsche 275.C Gauss-Chebyshev (in numerical in1egration) 299 A Gauss forward interpolation 223.C Gauss-Hermite (in numerical integration) 299A Gauss integration (in the narrow st:nse) 299.A Gauss interpolation App. A, Table 21 Gauss-Laguerre (in numerical integration) 299.A Green (for differential operators) App. A, Table 15.VI Green (for harmonic functions) 193.D Green (for Laplace operator) ApI'. A, Tables 3.m 4.11 Green (for ordinary differential equations) 252.K Green (for partial differential equa1 ions of parabolic type) 327.D Green (on the plane) 94.F Green-Stokes 94.F Hansen-Bessel App. A, Table 19JII Heron (for plane triangles) App. A, Table 2.11 Heron (for spherical triangles) App. A, Table 2.m identically true 411.G IMT 299.B interpolation 223.A interpolatory 299.A inversion (for a characteristic function) 341.C inversion (of cosine transform) 160.C inversion (of Fourier transform) ] 60.C inversion ( of Fourier transform of distributions) 160.H inversion (of Fourier transform on a locally compact Abelian group) 192.K inversion (of generalized Fourier transform) 220.B inversion (of Hilbert transform) 220.E inversion (of integral transform) 220.A inversion (of Laplace-Stieltjes transform) 240.D inversion (on a locally compact group) 437.L inversion (of Mellin transform) 220.C inversion (for a semigroup of operators) 240.1 inversion (of Stieltjes transform) 220.D Ita 45.G 406.B Jensen 198.F Klein-Nishina 415.G Kneser-Sommerfeld App. A, Table 19.m Kostant (on representations of compact Lie groups) 248.Z Kronecker limit 450.B 
1991 Kubo 402.K Kunneth (in an Abelian category) 2oo.H Kunneth (in Weil cohomology) 450.Q Lagrange (for the vector triple product) 442.C of a language 276.A lattice-point 222.B Lefschetz fixed-point 450.Q Leibniz (in differentiation) 106.D, App. A, Table 9 m Leibniz (in infinite series) App. A, Table 1O.m Liouville 252.C Machin 332 Mehler App. A, Table 19.m Milne-Simpson 303.E Mobius inversion (in combinatorics) 66.C Mobius inversion (in number theory) 295.C Nakano-Nishijima-Gell-Mann 132.A Newton (on interpolation) App. A, Table 21 Newton (on symmetric functions) 337.1 Newton backward interpolation 223.C Newton-Cotes (in numerical integration) 299.A Newton forward interpolation 223.C Newton interpolation App. A, Table 21 Nicholson App. A, Table 19.1V open (in numerical integration) 299.A Ostrogradskii 94.F overall approximation 303.C Picard-Lefschetz 418.F Plancherel (on a unimodular locally compact group) 437.L Plucker (on plane algebraic curves) 9.B Poincare (in integral geometry) 218.C Poisson (on Bessel functions) App. A, Table 19.m Poisson (for a flat torus) 391.1 Poisson integral 198.B Poisson summation 192.C,L prime 411.D prime (of a language) 276.A principal, of integral geometry 218.C product (for the Hilbert norm-residue symbol) 14.R product (on invariant Haar measures) 225.F product (for the norm-residue symbol) 14 Q product (on valuations) 439.H q-expansion (on theta functions) 134.1 recurrence, for indefinite integrals App. A, Table 9.11 reduction (of a surface) llO.A Ricci 417.B, App. A, Table 4.11 Riemann-Hurwitz (on coverings of a non- singular curve) 9.1 Rodrigues 393 B Schliifli App A, Table. 19.m Schwarz-Christoffel transformation 77.D second variation 178.A set-theoretic 33.B Sommerfeld App. A, Table 19.m Sonine-Schafheitlin App. A, Table 19.111 Steinberg (on representations of compact Lie groups) 248.Z Stirling 174.A 212.C, App. A, Table 17.1 Stirling interpolation App. A, Table 21 Stokes 94.F Stokes (on a Coo-manifold) 108.U Stokes (for integration of a vector field) App. A, Table 3.m Taylor (for a function of many variables) 106.1 Subject Index Fourier series Taylor (for a function of one variable) 106 E, App. A, Table 9.1V theoretical 19.E 0- (on ideles) 6.F trace (on unitary representations) 437.DD transformation (for the generating function of the number of partitions) 328.A transformation (of a theta function) 3.1 transformation (for theta series) 348.L Trotter product 351.F valid 411.G Villat integration App. A, Table 15.VI Wallis App. A, Table 10.VI Watson 39.D, App. A, Table 19.1V Watson-Nicholson App. A, Table 19.111 Weber App. A, Table 19.1V Weber-So nine App A, Table 19.m Weierstrass-Enneper 275.A Weingarten (for isometric immersion) 365.C Weingarten (in theory of surfaces) II1.H, App. A, Table 4.1 Weyl 323 M Weyl character (on representations of compact Lie groups) 248.Z Weyl integral 225.1 Weyrich App. A, Table 19.m Wiener 160.B Wu's App. A, Table 6.V Forti paradox, Burali- 319.B forward analysis 138.C forward difference 304.E, App. A, Table 21 forward emission 325.A forward equation, Kolmogorov 115.A 260.F forward interpolation formula Gauss 223.C Newton 223.C forward type 304.D,F foundation, axiom of 33.B foundations of geometry 155 foundations of mathematics 156 four arithmetic operations 294.A four color conjecture 186.1 four-color problem 157 4-current density 150.B four-group 151.G Fourier, J. B J. 158 Fourier analysis App. A, Table 11 Fourier analysis on the adele group 6.F Fourier-Bessel series 39.D Fourier-Bessel transform 39.D Fourier coefficient 159.A, App. A, Table 11.1 (of an almost periodic function) 18.B (in a Hilbert space) 197.C (in an orthogonal system) 317.A Fourier cosine series App. A, Table 11.1 Fourier cosine transform 160.C, App. A, Table 11.11 Fourier double integral theorem 160.B Fourier-Hermite polynomial 176.1 Fourier hyperfunction 125.BB exponentially decreasing 125.BB modified 125.BB Fourier integral 160.A conjugate 160.D Fourier integral operator 274.C 345.B Fourier inversion formula 160.C Fourier kernel 220.B Fourier-Laplace transform 192.F Fourier reciprocity 160.C Fourier series 159, App. A, Table 11.l 
Subject Index Fourier sine series (of an almost periodic function) 18.B (of a distribution) 125.P (in a Hilbert space) 197.C Fourier sine series App. A, Table 11.1 Fourier sine transform 160 C, App. A, Table 11.11 Fourier single integraJ theorem 160 C Fourier-StieJtjes transform 192.B,0 Fourier theorem (on real roots of an algebraic equation) 10.E Fourier transform 160, App A, Table 11.11 (of a distribution) 1250 (in topological AbeJian groups) 36.L 192.1 discrete 142 D fast 142.D generaJized 220.B inverse 125.0 Fourier ultrahyperfunction 125.BB 4-momentum operators 258.A,D fourth separation axiom 425.Q four-vector 258.C 359.C four-vector, energy-momentum 258.C four-vertex theorem I11.E fractal 246.K fraction(s) continued 83 partial App. A, Table 10.V fractional cutting aJgorithm 215.B fractional factorial design 102.1 balanced 102.1 orthogonal 102.1 fractional group, linear 60.B fractional ideal 67.1 of an algebraic number field 14.E principal 67.K fractional power 378.D fractional programming 264.D fractional step 304 F Fraenkel set theory, Zermelo- 33.A.B fractional transformation, Jinear 74 E frame(s) 90.B (of an affine space) 7.C (of a C''' -manifold) 191.A (in En) II1.B (of a group manifold) 110 A (in projective geometry) 343 C (of a reaJ Jine) 355.E affine 7.C bundle of tangent T- 105.H Darboux 110.B dual 417B family of (on a homogeneous space) 110.A family of, of order I 11O.B Frenet 110.A 1 I I.D Gaussian (of a surface) 111.H k- (in R n ) 199.B method of moving 110.A moving 90 B 111.C 417.B natural moving 417.B normal 110B of order 0 110.C of order 1 110.B,C of order 2 110.B,C of order 3 110.B,C of order 4 110.B orthogonal (in a Euclidean space) 111.B 139.E orthogonaJ k- (in R n ) 199.B orthogonal moving 417.D projective 343.C Stiefel manifold of k- 199.B stochastic moving 406.G 1992 tangent T- 105.H frame bundle orthogonaJ 364.A tangent orthogonal n- 364.A tangent T- 105.H 147.F framed Jink 114.L framing 114.L Frechet axiom 425.Q Frechet curve 246.A Frechet denvative 286.E Frechet differentiable function 286.E Frechet differential 286.E Frechet distance (between surfaces) 246.1 Frechet L-space 87.K Frechet manifoJd 286.K Frechet space (quasinormed space) 37.0 (topological linear space) 424.1 (topoJogicaJ space) 425.CC locally convex 424.1 in the sense of Banach 37.0 in the sense of Bourbaki 424.1 Frechet surface 246.1 Frechet-Uryson space 425.CC Fredholm alternative theorem 68.E 21 7.F Fredholm determinant 217.E Fredholm first minor 217.E Fredholm formuJa 68.L Fredholm integral equation 217.A of the first kind 217.A of the second kind 217.A of the third kind 217.A Fredholm mapping 286.E Fredholm operator 68.F 251.D (in the sense of Grothendieck) 68. K Fredholm Tth minor 217.E Fredholm type integral equation of 217.A integrodifferential equation of 222.A free (discontinuous group) 122.A distribution- 371.A F- (oriented G-manifold) 431.E (F, F')- (oriented G-manifold) 431.E free Abelian group 2.C free additive group 2.E free derivative 235.C free Dirac field 377.C freedom asymptotic 361.B degrees of (of a dynamical system) 271.F n degrees of (sampJing distribution) 374.B,C free energy 340.B 402.G Gibbs 419.C Helmholtz 419.C mean 340.B 402.G free fields 150.A free grammar, context- 31 D free groups 161 free Hamiltonian 351.D free homotopy 202.B free Lagrangian density 150.B freely act 122.A 431.A free module 277 G free product (of groups) 190.M free scalar fieJd 377.C free semigroup 161.A free special Jordan algebra 231.A free vacuum vector 150.C free variable 411 C 
1993 free vector 442.A French empiricism 156.C Frenet formula II1.D Frenet frame 11 O.A II1.D Frenet-Serret formulas (on curves) 111 D, App. A, Table 4.1 frequency (of an oscillation) 318.A (of samples) 396.C 397 B (of a translational flow) 126.L 136.G (of a wave) 446 angular (of a wave) 446 circular (of a simple harmonic motion) 318.B relative (of samples) 396.C frequency distribution 397.B frequency function 397.D frequency response function 421 E Fresnel integral 167.D, App. A, Tables 9.V 19.11 Freudenthal theorem 202.U Friedrichs extension 1121251.1 Friedrichs scheme 304.F Friedrichs theorem 323.H 326.D frieze group 92.F Frobenius algebra 29.H quasi- 29.H Frobenius automorphism (of a prime ideal) 14.K Frobenius group 151 H Frobenius integrability condition 154.B Frobenius method App. A, Table 14.1 Frobenius morphism 450 P Frobenius substitution (of a prime ideal) 14.K Frobenius theorem (on Abelian varieties) 3.D (on matrices with nonnegative entries) 269.N (on polynomials of a matrix) 390.B (on representations of finite groups) 362.G (on total differential equations and on foliations) 154.B 286 H 428.D Frobenius theorem, Perron- 310.H Froissart bound 386.B Froissart-Martin bound 386.B frontier point (of a subset) 425.N front set, analytic wave 274.D front set, wave 274.B 345.A Frostman maximum principle 338.C Froude number 116.B Fubini theorem 221.E Fuchsian form of weight k (or of dimension - k) 32.B Fuchsian function 32.B Fuchsian group 122 C of the first kind 122. C of the second kind 122.C Fuchsian relation 253 A, App A, Table 18.1 Fuchsian type (visibility manifold) 178.F Fuchsian type, equation of 253.A Fuchsoid group 122 C Fuks cohomology, Gel'fand- 105.AA full discrete approximation 304.B full embedding theorem (of an Abelian category) 52.N full group 136.F 258.A full homogeneous Lorentz group 258.A full inhomogeneous Lorentz group 258 A full internationaJ notation 92 E full linear group 60.B full matrix algebra 269 B full Poincare group 258 A full subcategory 52.A fully complete (locally convex space) 424.Y Subject Index Function(s) fully faithful functor 52.H fully normal space 425,X fully transitive 92.C Fulton and Hansen, general connectedness theorem of 16.1 function(s) 165381.C Abelian 3.J absolutely integrable 214.E additive interval 380.B additive set 380.C admissible 46.A 304.B Ahlfors 43.G 77 E algebraic I1.A almost periodic 18 almost periodic, on a group 18.C almost pen OdIC, wIth respect to p l.L almost periodic, in the sense of Bohr 18 B (X-excessive 261.D alternating 337.1 amplitude (of a Fourier integral operator) 274.C 345.B analytic  analytic function(s) analytic almost periodic 18.D analytic operator 37.K Anger 39.G, App. A, Table 19.1V Appell hypergeometric, of two variables 206.D, App. A, Table 18.1 argument 46.A arithmetic 295.A Artin-Hasse 257 H associated Legendre 383.C, App. A, Table 18.m asymptotically developable 30.A automorphic 32 automorphic, with respect to f 32.A b- 125.EE 418.H 'B-measurable 270.1 Baire 84 D Barnes extended hypergeometric 206.C, App. A, Table 18.1 base 304 B Bellman 127.G Bergman kernel 188.G Bessel 39, App. A, Table 19.m beta 174.C, App A, Table 17.1 bispectral density 421.C Borel measurable 270.J boundary 160.E bounded 43.A of bounded variation 166 Busemann 178.F C oo _ (of many variables) 58.B Coo_, slowly increasing 125.0 C'-, in a Coo.manifold 105.G canonical (on a nonsingular curve) 9.E characteristic (of a density function) 397 G characteristic (of a graded R-module) 369.F characteristic (of a meromorphic function) 272.B characteristic (of an n-person cooperative game) 173.D characteristic (for an optical system) 180.C characteristic (of a probability measure) 341.C characteristic (of a subset) 381.C characteristic operator 251.N Chebyshev App. A, Table 20.11 Chebyshev q- 19.G, App. A, Table 20.VII choice 33.B 34.A circular 131.F 432.A class (on a compact group) 69.B 
Subject Index Function(s) ofclassC.,Co,C',COO,orCw 106.K of class C' at a point 105.G of class C' in a Coo -manifold 105.G of class Coo (of many variables) 58.B of class n, 0, 1, C;, or w 84.D cn App. A, Table 16.m completely additive set 380.C completely monotonic 240.E,K completely multiplicative number-theoretic 29S.B complex 16S B complex-valued 16S.B composite 106.1 concave 88.A of confluent type 167 of confluent type and Bessel functions App. A, Table 19 conical App. A, Table 18.11 conjugate IS9.E 160.D conjugate harmonic 193.C constant 381.C continuous (on a metric space) 84 continuous additive interval 380.B convex 88.A coordinate (of a fiber bundle) 147.B coordinate (in the Ritz method) 304.B cosigma 134.H, App. A, Table 16.1V counting (of a meromorphic function) 272.B covariance 386.A 395.A criterion 127.A cross spectral density 421.E cumulative distribution 341.B 342.C cylindrical 39.B, App A, Table 19.111 Daniell-Stone integrable 31O.E decision 398.A decision, space of 398.A Dedekind eta 328.A Dedekind zeta 14.C 4S0.D defining (of a hyperfunction) 12S.V density 397.D derived 106.A digamma 174 B dimension (on a continuous geometry) 8S.A D-integrable loo.D Dirac delta App. A, Table 12.11 Dirichlet 84.D 221.A distance 273.B distribution 168.B 341.B 342.C divisor 29S.C divisor of (on an algebraic curve) 9.C divisor of (on an algebraic variety) 16.M dn App. A, Table 16.m doubly periodic 134.E E- 430.D C- 46.C effectively calculable 3S6.C eigen- (of a boundary value problem) 31S.B eigen- (for an integral equation) 271.F eigen- (of a linear operator) 390.A elementary 131 elementary, of class n 131.A elementary Abelian 3.M elliptic 134323.A,D elliptic, of the first kind 134.G elliptic, of the second kind 134.G elliptic, of the third kind 134.H elliptic cylinder 268.B elliptic irrational 134.A elliptic theta 134.1, App. A, Table 16.11 empirical characteristic 396.C 1994 empirical distribution 250.F 374.E 396.C energy 126.L 279.F energy spectrum 433.C entire 429.A entire algebroidal 17.B envelope power 4oo.F error 167.D, App. A, Table 19.11 estimating 399.D fI- 391.L Euler 29S.C even 165.B explicit 165.C exponential 131.D exponential, with the base a 131. B exponential, of an operator 306.C exponential generating 177.A factorial 174.A family of quasi-analytic S8.A Feller transition 261.B finitely additive set 380.B finite-valued 443.B flat 58.C Frechet differentiable 286.E frequency 397.D frequency response 421.E Fuchsian 32.B gamma IS0.D 174, App. A, Table 17.1 Gel'fand-Shilov generalized 125.S generalized 12S.S generalized divisor 295.C generalized rational 142.B general Mathieu 268.B general recursive 3S6.C,F generating (of an arithmetic function) 295.E generating (of a contact transform<l tion) 82.A 271.F generating (of a sequence of functions) 177.A of Gevrey class 168.B global implicit, theorem 208.D grand partition 402.D Green 188.A 189.B Green (a-order) 45.D Green (of a boundary value problem) 315.B Green, method 402.J Gudermann I 31.F, App. A, Table 16.m half-Bessel 39.B Hamiltonian 219.C 27\.F Hankel 39.B, App. A, Table 19.m harmonic 193 harmonic kernel 188.H hazard 397.0 Heaviside 125.E 306.B, App. A, Table 12.11 Hey zeta 27.F higher transcendental 289.A Hilbert characteristic (of a coheren,: sheaf on a projective variety) 16.E Hilbert modular 32.G Hill 268.E holomorphic 198 holomorphic (of many variables) 21.C holomorphic (on an open set in a complex manifold) 72.A holomorphic, germ of 21.E hyper- 125.V hyperarithmetical 3S6.H hyperbolic 131.F hypergeometric 206.A, App. A, Table 18.1 hypergeometric (of the hyperspherkal dif- ferential equation) 393.E 
1995 hypergeometric, of confluent type 167 A, App. A, Table 19.1 hypergeometric, with matrix argument 206.E identity 381.C implicit 165.C 208 implicit, theorem 208.A 286.G implicit, theorem (in Banach algebra) 36.M implicit, theorem (in locaJly convex spaces) 286.1 impulse 306.B, App. A, Table 12.11 incomplete beta 174.C, App. A, Table 17.1 incomplete gamma 174.A, App. A, Table 17.1 increment 380.B indicator (of a subset) 342.E 376.C inner 43.F integral 429.A integral of, with respect to a volume element (on a C'-manifold) 105.W interpolation 223.A interval 380.A invariant decision 398.E inverse 198.L 381.C inverse analytic 198.L inverse trigonometric 131.E Jacobi elliptic App. A, Table 16.m joint density 397.J Julia exceptional 272.F jump 306.C K-pseudoanalytic 352.B K-quasiregular 352.B k-valued algebroidal 17.A Kelvin 39.G, App A, Table 19.1V kernel 188.G Koebe extremal 438.C Kummer 167.A, App. A, Table 19.1 L-  L-function Lagrangian 271.F 292.A Laguerre App. A, Table 20.VI .!c- 32.C Lame, of the first kind 133.B Lame, of the first species 133.C Lame, of the fourth species 133.C Lame, of the second kind 133.C Lame, of the second species 133 C Lame, of the third species 133.C Lane-Emden 291.F Laplace spherical 393.A Lebesgue measurable 270.J Legendre 393.B, App. A, Table 18.11 likelihood 374.J 399.M likelihood estimating 399.M linear 74.E linear discriminant 280.1 linear fractional 74.£ linear regression 397.H,J 403.D locaIly integrable 168.B logarithmic, to the base a 131.B loss 398.A lower limit 84.C lower semicontinuous (in a set) 84.C Lyapunov 126.F 163.G major 100.F Mangoldt 123.B many-valued 165.B of many variables 165.C Mathieu 268 Mathieu, of the first kind 268.B Mathieu, of the second kind 268.D maximal concentration 341.E mean concentration 341.E Subject Index Function(s) measurable 270.1 measurable vector 308. G meromorphic 21 J 272 meromorphic (on an analytic space) 23.D meromorphic (on a complex manifold) 72.A minimax decision 398.B minor loo.F Mobius 66.C 295.C modified Bessel 39.G, App. A, Table 19.1V modified indicator 341.C modified Mathieu 268.A modified Mathieu, of the first kind 268.D modified Mathieu, of the second kind 268.D modified Mathieu, of the third kind 268.D modular (of a locaIly compact group) 225.D modular, of level N 32.C moment-generating 177.A 341.C monotone 166.A monotone decreasing 166.A monotone increasing 166.A monotonic 166.A Morse 279.B of at most class 1 84.D }1-conformal 352.B multidimensional gamma 374.C multiplicative 32.A multiplicative automorphic 32.A multiplicity (of a mapping) 246.G multivalent 438 multivalued 165.B Nash-Moser implicit, theorem 286.1 n-dimensional distribution 342.C nth derived 106.D n-times continuously differentiable 106.K n-times differentiable 106.D of n variables 165.C nice (on a C'''-manifold) 114.F nondecreasing 166.A nondegenerate theta 3.1 nonincreasing 166.A nontangential maximal 168.B normal (of ordinal numbers) 312.C normal density 397.D nuJl 310.1 number-theoretic 295.A 356.A objective 264.B 307.C odd 165.B operating 192.N order (of a me rom orphic function) 272.B orthogonal 317, App. A, Table 20 orthogonal, Haar system of 317.C orthogonal, Rademacher system of 317.C orthogonal, Walsh system of 317.C outer 43.F po, of Riemann 253.B !D-, of Weierstrass 134.F, App. A, Table 16.1V Painleve transcendental 288.C parabolic cylinder 167.C parametric 102.A 399.A partial 356.E partition 402.D payoff 173.B pentagamma 174.B periodic 134.E phase (ofa Fourier integral operator) 274.C 345.B piecewise continuous 84.B plurisubharmonic 21.G point 380.A polygamma 174.B, App. A, Table 17.1 
Subject Index Function(s) positive real 282.C of positive type I 92.B,J power 400 A primitive 216.C primitive, derivatives and App A, Table 9.1 primitive recursive 356.A,B,F probability generating 341.F proper (of a boundary value problem) 315.B proper convex 88.D propositional 411.C proximity (ofa meromorphic function) 272.B pseudo- 125.C psi 174.B quadratic loss 398.A 399.E quasi-analytic 58.F quasi-analytic, family of 58.A quasi-analytic, set of 58 F quasicontinuous 3381 radial maximal 168.B rank 66 F rapidly decreasing C oo _ 168.B rate distortion 213.E rational, field of 337 H rational, on a variety 16.A rational entire 429 A real 165.B real analytic 106.K 198.H real-valued l6S.B recursive  recursive function(s) regression 397.1 regular 198 regular, on an open set (of a variety) 16.B regular, at a subvariety 16.B representative (of a compact Lie group) 249.U representing (of a predicate) 356.B representing (of a subset) 381 C reproduction 263.A Riemann (of a Cauchy problem) 325.D Riemann integrable 216.A Riemann P App. A, Tables 14.1 18.1 Riemann theta 3.L Riemann (- 450.B right continuous 84.B right majorizing 316.E right superior 316 E risk 398.A sample 407.A sample covariance 395.G with scattered zeros 208.C schlicht 438.A Schwinger 150.F selection 354 E self-reciprocal 220.B semicontinuous (at a point) 84.C sequential decision 398.F set 380 of several variables 106 1,1 shape 223.G Siegel modular, of degree n 32.F (J-, of Weierstrass 134 F, App. A, Table 16.1V simple 221.B 443.B simple loss 398.A simplest Chebyshev q- 19.G simply periodic 134 E single-valued 165.B singular inner function 43 F slope 46.C sn App A, Table 16.m special App. A, Table 14.11 spec'ial, of confluent type 389.A 1996 special, of eIlipsoidal type 389.A special, of hyper geometric type ::89.A spherical (on a homogeneous space) 437.Y spherical Bessel 39.B spherical harmonic 193.C spheroidal wave 133.E standard defining 125.Z stationary 46.B statistical decision 398.A stream 205.B strictly concave 88.A strictly convex 88.A strictly decreasing 166.A strictly increasing 166.A strictly monotone 166.A strictly monotone (of ordinal numbers) 312.C strictly monotone decreasing 166.A strictly monotone increasing 166 A strictly monotonic 166.A structure 191.C Struve 39.G, App. A, Table 19.1'1 subharmonic 193.A super harmonic 193.P supporting 125.0 symmetric 337.1 Szego kernel 188.H T- 150.D test 130.DD 4oo.A tetragamma 174.B Theodorsen 39.E theory of 198.Q theory of, of a complex variable 198.Q theta 134.1 theta (on a complex torus) 3.1 time ordered 150.D torus App. A, Table 18.m transcendental, of Painleve 288.C transcendental entire 429.A transcendental meromorphic 272.A transfer 86. D transfinite logical choice 411.J transition (of a fiber bundle) 147 B transition (of a Markov chain) 260.A 261.B trigamma 174.B trigonometric 432.A, App. A, Table 2 truncated Wightman 150.D truth 341.A 411.E ultradifferentiable 168.B uniformly almost periodic 18.B unit 306.B, App. A, Table 12.11 universalJy measurable 270.L upper limit 84.C upper semicontinuous 84.C value 108.B on a variety 16.A von Neumann 39.B 188.H, App. A, Table 19.m Wagner 39.E wave 351.D Weber 39.G 167.C, App. A, Tables 19.1V, 20.lV Weierstrass elliptic App. A, Table 16.1V Weierstrass!D- 134.F, App. A, Tlble 16.1V Weierstrass sigma 134.F, App. A, Table 16.1V weight (interpolatory) 299.A weight (for the mean of a functionl 211.C weight (in orthogonality) 317.A Whittaker 167.B, App. A, Table 19.11 Wightman 150.D I-valued holomorphic 251.G zeta zeta functions 
1997 zonal spherical (on a homogeneous space) 437.Y functional 46.A 162 165.B additive (of a Markov process) 261.E algebraic linear 424.B analytic 168.C areal 334.B bilinear 424 G Brownian 176 1 charactelistic (of a probability distribution) 407.C Dirichlet 334.C Douglas 334.C linear 37.CI97.F424B martingale additive 261.E multiplicative (of a Markov process) 261.E multiplicative, transformation by (in Markov process) 261.F perfect additive 261.E subadditive 88.B supporting (of a convex set) 89.G Yang-MilJs 80.Q functional analysis 162 functional analysis, nonlinear 286 functional cohomology operation 202.S functional-differential equation 163 system of 163 E functional equation Abel 388.D approximate (of zeta function) 450.B Schroder 388.D special 388.A of zeta function 450.B function algebra 164.A functionaJly dependent (components of mapping) 208.C of class C' 208.C functional model 251 N functional paper 19.D functional <I>-operation 202.S functional relation 208.C of class C' 208.C of gamma function 174.A function element 198.1 339.A inverse 198.L in the wider sense 198.0 function field (of an algebraic curve over a field) 9.C (of an algebraic variety) 16.A Abelian 3.1 algebraic, over k of dimension 1 9.D algebraic, over k of transcendence degree 1 9.D algebraic, in n variables 149 K eJliptic 9.D rational, in n variables 149.K function group 234.A function matrix rational 86.D transfer 86.B functions on a variety 16 A function space(s) 168435.D test 125.S function symbol 411.H function-theoretic nuIl sets 169 function variable 411.H functor 52.H 8- 200.1 8*- 2001 additive 52.N Subject Index Fundamental period parallelogram adjoint 52.K cohomological 200.1 connected seq uences of 200.1 contravariant 52.H covariant 52.H deri ved 200.1 exact 52.N 200.1 faithful 52.H forgetful 52.1 fully faithful 52.H half-exact 200.1 homological 200.1 left adjoint 52.K left balanced 200.1 left derived 2oo.l,Q left exact 200.1 partial derived 200.1 relative derived 2oo.K representable 53.L right adjoint 52.K right balanced 200.1 right derived 2oo.l,Q right exact 200.1 spectral 200J universal 8- 200.1 functorial isomorphism 53J functorial morphism 53.J fundamental absolute neighborhood retract (FANR) 382.C fundamental absolute retract (FAR) 382.C fundamental ceIl (of a symmetric Riemann space) 413.F fundamental class (of an Eilenberg-MacLane space) 70.F (of a Poincare pair) 114.J (of a Thom complex) 114.G with coefficient Z2 65.B fundamental conjecture (in topology) 70.C fundamental curve (with respect to a birational mapping) 16.1 fundamental cutset matrix 186.G fundamental cycle (of an oriented pseudomanifold) 65.B (in a resolution of a singularity) 418.C fundamental differential invariants (of a surface) 110.B fundamental discriminant 295.D fundamental domain 234.C fundamental exact sequence 2oo.M fundamental figure(s) 343.B linear 343 B fundamental form (associated with a Hermitian metric) 232.A (of a Finsler space) IS2A first 111 G, App. A, Table 4.1 second 111.G 360.G 365.C fundamental group 170 algebraic 16 U fundamental homology class 201.N around K 201.N fundamental invariants (of a space with a Lie trans- formation group) 110A fundamental kernel 320.H fundamental lemma in the calculus of variations 46.B Neyman-Pearson 4oo.B fundamental open set 122.B fundamental operator 163.H fundamental period (of a periodic function) 134.E fundamental period paralJelogram 134.E 
Subject Index Fundamental point fundamental point (with respect to a birational mapping) 16.1 (of a projective space) 343.C fundamental quantities first l11.H second l11.H fundamental region (of a discrete transformation group) 122.B Ford 234.C fundamental relations (of gamma functions) 174.A (among the generators of a group) 161.A (in thermodynamics) 419.A fundamental retract 382 C fundamental root system (of a semisimple Lie algebra) 248 N fundamental sequence of cross cuts (in a simply connected domain) 333.B ofrational numbers 294.E of real numbers 355.B in a uniform space 436.G fundamental set (of a transformation group) 122.B fundamental solution(s) (of a differential operator) 112.B 189.C (of an eIliptic equation) 323.B (of an evolution equation) 189.C (of a hyperbolic equation) 325.D (ofa parabolic equation) 327.D (of a partial differential equation) 320.H system of (of a system ofIinear equations) 269.M fundamental space 125.S fundamental subvariety (with respect to a birational mapping) 16.1 fundamental system (of eigenfunctions to an eigenvalue for an integral equation) 217.F (for a linear difference equation) 104.D (of a root system) 13.1 of irreducible representations (of a complex semisimple Lie algebra) 248.W of neighborhoods 425.E of solutions (of a homogeneous linear ordinary dilTerential equation) 252.B of solutions (of a homogeneous system of linear differential equations of the first order) 252.H fundamental tensor(s) (of a Finsler space) 152.A (ofa Riemannian manifold) 364.A Lie 413.G second 417.F fundamental theorem(s) of algebra 10 E Bonnet (on surfaces) l11.H of calculus 216 C of elementary number theory 297.C the first (of Morse theory) 279.D Gentzen 41 I.J Nevanlinna first 272.B Nevanlinna second 272.E of the principal order 0 14.C of projective algebraic varieties 72.F of projective geometry 343.D of proper mapping 16,X the second (of Morse theory) 279.D of Stein manifolds 21 L 72.E on symmetric polynomials 337.1 of the theory of curves 111.D of the theory of surfaces l11.G 1998 Thom 114.H of the topology of surfaces 41O.B of ultraproducts 276.E fundamental tie set matrix 186.G fundamental unit 414.A 116 fundamental units (of an algebraic num ber field) 14.D fundamental vector field 191.A fundamental vectors (in a vector space) 442.A future cone 258.A G y  gamma GL(n, k) (general linear group) 60.B y-matrices, Dirac 415.G y-perfect 186.J y-perfectness 186.1 y-point of the kth order (of a holomorphic function) 198.C f-equivalent (points) 122.A f-extension 14.L f .'I'-foliation 154.H f-structure 90.D 105.Y f.'l'-structure 154.H f; -structure 154.E g-Iattice (of a separable algebra) 27.A integral 27.A normal 27.A G-bundle 147.B G-connections, Yang-MiHs 80.Q G-fiber homotopy type, spherical 431.F G-group 172.1 G-invariant (element) 226.A (statistics) 396.1 almost 396.1 G-invariant measure 225.B G-isomorphism 191.A G-manifold 431.C oriented 431.E G-mapping (G-map) 362.B 431.A G-set k-ply transitive 362.B left 362.B quotient 362.B right 362.B simply transitive 362.B sub. 362.B G-space 178.H 431.A with nonpositive curvature 178.H G-stationary strictly 395.1 weakly 395.1 G-structure 191 G-subset 362.B G-surface 178.H G-vector bundle 237.H G.-set 270.C gain, heat 419.A Galerkin method 290.E 303.1 304.B Galilei transformation 359.C Galois, E. 171 Galois cohomology 172.1 200.N Galois equation 172.G Galois extension (of a field) 172.B Galois field 149.M Galois group of an algebraic equation 172.G of a Galois extension 172.B 
1999 of a polynomial 172.G Galois theory 172 of differential fields 113 Galton-Watson process 44.B multi (k)-type 44.C game bimatrix 173.C constant-sum 173.A cooperative 173.A differential 108 general-sum 173.A with infinitely many players 173.D matrix 173.C multistage 173.C noncooperative 173.A n-person, in extensive form 173.B n-person, in normal form 173.C n-person cooperative, in characteristic- function form 173.D in partition-function form 173.D without side payments 173.D zero-sum 173.A zero-sum two-person 108.B game-theoretic model 307 C game theory 173 gamma density 397.D gamma distribution 341.D, App. A, Table 22 gamma function 150.D 174, App. A, Table 17.1 incomplete 174.A, App. A, Table 17.1 multidimensional 374.C gamma function and related functions App. A, Table 17 gap (at a point) 84.B gap theorem 339.D Hadamard 339.D gap value (of a point on a Riemann surface) 11 D Garding, hyperbolic in the sense of 325.F Garding inequality 112.G 323.H Garnier system 253.£ Garside-Jarratt-Mack method 301.N gases, kinetic theory of 402.B Gateaux derivative 286.£ Gateaux differentiable 286.£ gauge theory 105.G lattice 150.G gauge transformation (in electromagnetism) 130.A (in a lattice spin system) 402.G (of a principal fiber bundle) 80.Q (in unified field theory) 434.B of the first kind 150.B Gauss, C. F. 175 Gauss-Argand plane 74.C Gauss backward interpolation formula 223.C Gauss-Bonnet formula II1.H 364.D, App. A, Table 4.1 Gauss-Bonnet-Sasaki-Nitsche formula 275.C Gauss-Chebyshev formula (in numerical integration) 299.A Gauss circle problem 242.A Gauss criterion App. A, Table 10.11 Gauss equation (on an isometric immersion) 365.C (on surfaces) l11.H Gauss formula (on Gauss sum) 295.D (on harmonic functions) 193.D (for integration of a vector field) App A, Table 3.m (for isometric immersion) 365.C Subject Index Gel'fand- Pettis integrable (in numerical integration) 299.A (for the surface integral) 94.F (in theory of surfaces) l11.H, App. A, Table 4.1 Gauss forward interpolation formula 223.C Gauss-Hermite formula (in numerical integration) 299.A Gauss hypergeometric differential equation App. A, Table 14.11 Gaussian (system ofrandom variables) 176.A complex 176.B Gaussian curvature (of a surface) II1.H, App. A, Table 4.1 total (of a surface) l11.H Gaussian differential equation 206.A Gaussian distribution 341.D standard 176.A Gaussian elimination 302.B Gaussian frame (of a surface) II1.H Gaussian integer 14.U Gaussian plane 74.C Gaussian process 176 342.A complex 176.C N -pIe Markov 176.F N -pie Markov, in the restricted sense 176.F stationary 176.C Gaussian random field Markov, in the McKean sense 176.F Markov, in the Nelson sense 176.F Gaussian random measure 407.D Gaussian random variable, complex 176.B Gaussian source, autoregressive 213.E Gaussian sum 295.D 450.C local 450.F Gaussian system (of random variables) 176.A complex 176.B Gaussian white noise 407.C Gauss integral 338.1 Gauss integration formula (in the narrow sense) 299.A Gauss interpolation formula App. A, Table 21 Gauss-Jordan elimination 302.B Gauss kernel 327.D Gauss-Laguerre formula (in numerical integration) 299.A Gauss-Manin connection (of a variety) 16.V Gauss mapping (in geometric optics) 180.B Gauss-Markov theorem 403.£ Gauss-Seidel method 302.C Gauss series 206.A Gauss symbol 83.A Gauss theorem (on algebraic closedness ofC) 1O.E (on primitive polynomials) 337.D Gauss theorem a egregium (on surfaces) l11.H Gauss transformation App. A, Table 16.111 Gauss variational problem 338.J G.c.D. (greatest common divisor) 67.H 297.A GCR algebra 36.H Gegenbauer polynomials 317.D 393.£, App. A, Table 20.1 Gel'fand-Fuks cohomology 105.AA Gel'fand integrable 443.F Gel'fand-Levitan-Marchenko equation (for KdV equations) 387.D (for nonlinear lattice) 287.C Gel'fand-Mazur theorem 36.E Gel'fand-Nalmark theorem 36.G Gel'fand-Pettis integrable 443.F 
Subject Index Gel'fand-Pettis integral GeI'fand- Pettis integral 443.F Gel'fand-Pyatetskil-Shapiro reciprocity law 437.DD Gel'fand representation (of a commutative Banach algebra) 36.E Gel'fand-Shilov generalized function 125.S Gel'fand theorem, Stone- 168.B Gel'fand topology 36.E GeI'fand transform 36.E Gel'fand triplet 424.T GeIl-Mann formula, Nakano-Nishijima- 132.A general addition theorem 388 C general analytic space 23.G general angle 139.D general associative law (for group composition) 190.C general boundary value problem 323.H general Cantor set 79.D general Cayley algebra 54 general connectedness theorem due to Fulton and Hansen 16.1 general curve 93.D general derivative (of a set function) 380.D general geometry of paths 152.C generalization (in etale topology) 16.AA generalized absolute continuity (*) 100 C generalized absolute continuity in the restricted sense 100 C generalized absolutely continuous function l00.C generalized Bayes solution 398.B generalized BernouIli shift 136.D generalized Bernshtein problem 275.F generalized Boolean algebra 42.B generalized Boolean ring 42.C generalized Borel embedding 384.D generalized Clifford torus 275.F generalized cohomology theories 201.A generalized cohomology theory with E-coefficient 202.T generalized conformal mapping 246.1 generalized continuum hypothesis 49.D generalized convergence 331.C generalized convolution (of distributions) 125.M generalized coordinates (in analytical dynamics) 271.F generalized cylindrical coordinates App. A, Table 3 V generalized decomposition number (of a finite group) 3621 generalized derivative 125.E generalized distance, Mahalanobis 280.E generalized distribution, Beuding 125.U generalized divisor function 295.C generalized eigenfunction 375 C generalized eigenspace (of a linear operator) 390.B generalized eigenvalue 375.C generalized eigenvalue problem 298.G generalized eigenvector 390 B generalized Eisenstein series 450.T generalized Fourier transform 220.B generalized function 125.S generalized Hardy class 164.G generalized helix II1.F generalized homology theory 201.A generalized homology theory with E-coefficient 202.T generalized Hopf homomorphism 202.U generalized Hopf invariant 202.Q generalized Hurewicz theorem 202.N 2000 generalized isoperimetric problem 46 A 228.A generalized Jacobian (of a set function) 246.H generalized Jacobian variety 9.F l1.C generalized Lame differential equation 167.E generalized least squares estimator 403.E generalized Lebesgue measure 270.E generalized Levi form 274.G generalized limit 37.F generalized minimal immersion 275.B generalized module 143.B generalized momentum 271.F generalized nilpotent (operator) 251.F generalized nilpotent element 36.E generalized nilpotent group 190.K generalized peak point 164.D generalized peak set 164.D generalized Pfaff problem 428.B generalized Poincare conjecture 65.C generalized quaternion group 151.B generalized rational function 142.B generalized Riccati differential equation App. A, Table 141 generalized Riemann-Roch theorem (on algebraic curves) 9.F generalized Schlomilch series 39.C generalized solvable group 190.K generalized stochastic process 407.C generalized suspension theorem 202 T generalized Tauberian theorem 36.L 160.G of Wiener 192.D generalized topological space 425.D generalized trigonometric polynomial 18.B generalized trigonometric series 18.B generalized uniserial algebra 29.1 generalized valuation 439.B generalized variance 280.E 397.J generalized variance, sample 280 E generalized wave operator 375.B generalized Whitehead theorem 202.N general knot conjecture 235.B general law of reciprocity 14.0 Artin 59.C general linear group 60.B 226.B 256.D of degree n over K 60 B 226 B 256.D over a noncommutative field 60.0 projective 60.B general linear hypothesis 400.C general linear Lie algebra 248.A general lower derivative (of a set function) 380.D general Markov chain 260 J general Mathieu function 268.B general Navier-Stokes equations 204.F general position (complexes) 70.B (PL mappings) 65.D (in a projective space) 343.B theorem 65.D general principle ofrelativity 358 general projective geometry 343.B general random walk 260.A general recursive function 356.C,F general recursive predicate 356.C general recursive set 97 general Runge-Kutta method 303.D general sense, derivable in the 380.D general set theory 33.B general solution (of a difference equation) 104.D (of an ordinary differential equation) 313.A (of a partial differential equation) 320 C 
2001 (of a system of partial differential equations) 428.B general sum 173.A general theory of perturbations 420.E ofrelativity 358 general topology 426 general type 72.H surface of 72.K general uniformization theorem 367.G general upper derivative (of a set function) 380.D generate (an A-module) 277.D (a completely additive class) 270 B (a field over k) 149.D (a filter) 87.1 (an ideal) 67.B (a linear subspace) 256.F (a subgroup) 190.C (a subring) 368.E (a topology) 425.F generated, finitely 277.D generating curve I I I.I generating element (with .respect to a self-adjoint operator) 390.G generating function(s) (of an arithmetic function) 295.E (of a canonical transformation) 82.B (of a contact transformation) 82.A (of an infinitesimal transformation) 27l.F (of a sequence of functions) 177.A exponential 177.A factorial cumulant- 397.G factorial moment 397 G joint moment 397.1,1 moment- 177.A 341.C 397 G,J probability- 34I.F 397.G generating line (of a circular cone) 78.A (of a quadric hypersurface) 343.E (of a quadric surfacc) 350.B (of a ruled surface in differential geometry) 11l.I generating representation (of a compact Lie group) 249.U generating space (of a quadric hypersurface) 343.E generator (of an Abelian category) 200.1 (of a cyclic code) 63.D (of an endomorphism) 136.E (of a group) 190.C (of a Markov process) 26l.C (ofasemigroup) 378.D Bott 237.D Oynkin representation of 261 B F D. 136.E infinitesimal 378.B system of (of an A -module) 277.0 topological (of a compact Abelian group) 136D two-sided 136.E generic (property) 126.H generic point 16.A Gcntzcn fundamental theorem 411.J genuine solution 323 G genus (of an algebraic curve) 9.C (of a differential ideal) 428 E (of an ideal group) 59.E (in integral representation theory) 362.K Subject Index Geometric realization (of the s.s. complex) (of a knot) 235.A (of a lattice group) 13.P (of a quadratic field) 347.F (of a quadratic form) 348.H (of a surface) 41O.B (of a transcendental integral function) 429.B arithmetic (of an algebraic curve) 9.F arithmetic (of an algebraic surface) 15.C arithmetic (of a complete variety) 16.E arithmetic, of a divisor (on an algebraic surface) 15.C boundary 41O.B effective (of an algebraic curve) 9.C of the function field Klk 9.D geometric (of an algebraic surface) 15 E geometric (of a complete variety) 16.0 geometric (of a singular point) 418.C i- 15.E linear 15.G measure of (of a positive definite symmetric matrix) 348.K SJ (of an algebraic curve) 9.F principal (of an ideal group) 59.E principal (of a quadratic field) 347.F virtual arithmetic (of a divisor) 16.E geocentric parallax 392 Geocze area (of a surface) 246.E Geocze problem 246.0 geodesic 80.L,I 11l.H 178 364.C, App. A, Table 4.1 null 359.D totally, submanifold 365.D geodesic arc 178.H 364.B geodesic coordinates 80.J geodesic coordinate system in the weak sense 232.A geodesic correspondence (between surfaces) 11l.I geodesic curvature 111.1, App. A, Table 4.1 geodesic flow 126.L 136.G geodesic line 178.H geodesic point 11l.H 365.D geodesic polar coordinates 90.C geodesic triangle 178.A geodesic variation 178.A geometrically finite 234.C geometrically reductive 226.B geometrically simple eigenvalue 390.A geometrical mean 397.C geometric complex 70.B geometric construction problem 179.A geometric crystal class 92.B geometric difference equation 104.G geometric dimension (of a vector bundle) 114.D geometric distribution 341.D, App. A, Table 22 geometric fiber (of a morphism) 16 D geometric genus of an algebraic surface 15.E of a complete variety 16.0 of a singular point 418.C geometric mean (of a function) 21l.C (of numbers) 211.C geometric multiplicity (of an eigenvalue) 390.A geometric number theory 296.B geometric optics 180 geometric point (of a scheme) 16.D geometric probability 218.A geometric programming 264.0 geometric progression 379.1, App. A, Table IO.VI geometric quotient 16.W geometric realization (of the s.s. complex) 70.E 
Subject Index Geometric series geometric series 379.B, App. A, Table 10.1 geometry 181 affine 7 affine, in the narrower sense 7.E affine differential 11O.C algebraic 12.A analytic 181 circle 76.C conformal 76.A conformal differential 110 D continuous 85.A differential 109 differential, of curves and surfaces 111 differential, in specific spaces 110 elliptic 285.A Euclidean, in the wider sense 139.B finite-dimensional projective 343.B formal 16,X general, of paths 152.C general projective 343.B hyperbolic 285 A hypersphere 76.C integral 218.A Laguerre 76.B Lobachevskii non-Euclidean 285.A Mobius 76.A natural 110.A n-dimensional Euclidean 139.B 181 non-Archimedean 155.0 non-Desarguesian 155.D 343.C non-Euclidean 285.A parabolic 285 A plane 181 projective 343.B projective, of paths 109 projective differential 11O.B pseudoconformal 344.A pure 181 Riemannian 137, App. A, Table 4.II Riemann non-Euclidean 285.A solid 181 space 181 of a space subordinate to a group 137 spectral 391 sphere 76 C spherical 285.0 on a surface 111.0 synthetic 181 wave 434.C geometry of numbers 182 germ(s) 383.B of an analytic set 23.B of a COO-function at the origin 58.C of a holomorphic function 21.E irreducible 23.B reducible 23.B sheaf of, of holomorphic functions 23.C sheaf of, of regular functions 16 B Oermain curvature (of a surface) II1.H, App. A, Table 4.1 Oevrey class 58.0 125.U 325.1 function of 168 B ghost component (of an infinite-dimensional vector) 449.A ghost, Faddeev-Popov 132.C 150.0 OHS inequality 212.A Oibbs distribution, equilibrium 136.C Oibbs-Duhem relation 419.B Oibbs free energy 419 C minimum principle 419.C 2002 Oibbs measure 136.C Oibbs phenomenon 159.D Oibbs state 340.B Oilbert-Sacks bound, Varsharmov 63.B Oill method, Runge-Kutta- 303.D Oini coefficient of concentration 397.E Oiraud theorem 323.C Oirsanov theorem 406.B Oirsanov transformation 406.B Oirshick-Savage theorem 399.F Oivens method 298.D Oivens transformation 302.E OKS first inequality 212.A OKS second inequality 212.A Olashow-Weinberg-Salam model 132.D Olauert approximation, Prandtl- 205.B Olauert law of similarity, Prandtl- 205.D g.l.b. (greatest lower bound) 311.B Oleason part (for a function algebra) 164.F Oleason theorem 351.L glide operation 92.E glide reflection 92.E Olivenko-Cantelli theorem 374.E global analysis 183 global dimension (of an analytic set) 23.B (of a ring) 200.K left(of a ring) 200.K right (of a ring) 200.K weak (of a ring) 200.K global discretization error 303.B global Hecke algebra 450.0 global implicit function theorem 208.D globally asymptotically stable 126.F globally symmetric Riemannian space 412.A global property (in differential geometry) 109 global roundoff error 303.B global truncation error 303.B gluing theorem 2l.I gluons 132.D ONS construction 308.D Oodbillion- Vey ciasses 154.0 Oodel, Kurt 184 Oodel completeness theorem 411.J Oodel incompleteness theorem 156.E Oodel number(s) 185 356.C,E Oodel numbering 185.A Oodel set theory 33.C Bernays- 33.A Ooldbach problem 4.C Oolden-Thompson inequality 212.B Ooldstein method, Ince- 268.C Ooldstone boson, Nambu- 132.C Ooldstone theorem 132.C Oomory cut 215.B Ooodner-Kelley theorem, Nachbin- 37.M goodness of fit 397.Q test 401.E good reduction (of an Abelian variety) 3.N potential (of an Abelian variety) 3.N good resolution 418.C Ooppa code 63.E Oordan coefficients, Clebsch- 258.B 353.B Oordan equation, Klein- 351.0 377.C Oordon equation, Sine- 387.A Oorenstein ring 200.K Ooursat kernel, Pincherle- 217.F Ooursat theorem 198.B grad (gradient) 442.D graded algebra 203.B 
2003 graded A-module 200.B graded coalgebra 203.B graded Hopf algebra 203.C graded ideal 369.B graded module connected 203.B dual 203.B graded object 200.B graded ring 369 B associated 284.D gradient 442 D, App. A, Table 3.11 gradient method 292.E Arrow-Hurwicz-Uzawa 292.E conjugate (CO) 302.D gradient projection method, Rosen 292.E Oraeffe method 301.N Oramian (determinant) 103.0 208.E grammar Chomsky 31.D context-free 31.0 context-sensitive 31.D regular 31. D Oram-Schmidt orthogonalization 317.A Oram theorem 226.E grand canonical ensemble 402.0 grand partition function 402.D graph 186.B (of a knot) App. A, Table 7 (= linear graph) 282.A (of a mapping) 381.C (of a meromorphic mapping) 23.0 (of an operator) 251.B (of a relation) 358.A bipartite 186.C complete 186.C complete bipartite 186.C direct 186.B Euler 186.F Feynman 146.A,B 386.C labeled 186.B linear 282.A oriented 186.B partial 186.C planar 186.H regular 186.C reoriented 186.B section 186.C sub- 186 D undirected 186.B unicursal, theorem (Euler's) 186.F unlabeled 186.B unoriented 186.B graphic 66.H graphical calculation 19.B graphical differentiation 19.B graphical integration 19.B graphical mechanics 19.0 graphical method of statistical inference 19.B graph norm 251.D graph theorem, closed 37.1 251.D 424,X graph theory 186 Orashoff number 116.B Orassmann algebra (of a linear space) 256.0 Orassmann coordinates (in a Orassmann manifold) 90.B Orassmann manifold 119.B 427.D complex 199.B formed by oriented subspaces 199.B infinite 147.1 Subject Index Group(s) real 199.B Orauert theorem (on proper holomorphic mappings) 23.E 72.E gravitation, law of universal 271.B gravitational units, system of 414.B gravity, center of 271.E gravity wave 205.F long 205.F grazmg ray 325.L great circle (of a sphere) 140 greater than (another compactification) 207.B greatest common divisor 67.H 297.A greatest element (in an ordered set) 311.B greatest lower bound (of an ordered set) 31O.C 311.B greedy algorithm 66.0 Oreek mathematics 187 Oreek quadratrix 187 Oreek quadrivium 187 Oreek three big problems 187 Oreen formula (for differential operators) App. A, Table 15.VI (for harmonic functions) 193.0 (for Laplace operator) App. A, Tables 3.m, 4.II (for ordinary differential equations) 252.K (for partial differential equations of parabolic type) 327.D (on the plane) 94.F Oreen function method 402.J Oreen functions 188 189.B (a-order) 45.0 (of a boundary value problem) 315.B Oreen line 193.1 Oreen measure 193.1 Oreen operator 189.A,B 194.C Oreen space 193.N Oreen-Stokes formula 94.F Oreen tensor 188.E Oreen theorem 105.W Oriffith first inequality 212.A Oriffith second inequality 212.A Oross area (of a Borel set) 246.0 Or6ssencharakter 6.D Hecke L-function with 450.F Oross theorem 272.1 Orothendieck category 200.1 Orothendieck construction 237.B Orothendieck criterion of completeness 424.L Orothendieck group (of a compact Hausdorff space) 237.B (of a ring) 237.J Orothendieck theorem of Riemann-Roch type 366.D Orothendieck topology 16.AA ground field (of a linear space) 256.A ground form 226.D covariant with 226.E ground ring (of a module) 277.D group(s) 190.A Abelian 2190.A Abelian linear 60.L absolute homology 201.L additive 2.E 190.A adele (of an algebraic group) 13.P adele (of a linear algebraic group) 6.C adjoint (isogenous to an algebraic group) B.N adjoint (of a Lie algebra) 248.H adjoint (of a Lie group) 249.P of affine transformations 7.E 
Subject Index Group(s) affine Weyl (of a symmetric Riemann space) 413.0 algebra 192.H algebra class 29.E algebraic 13 algebraic fundamental 16 U algebraic homotopy 16.U alternating, of degree n 151.0 automorphism (of a Lie algebra) 248.A of automorphisms (of a group) 190.D *-automorphism 36.K Betti (of a complex) 201.B black and white 92.D black and white point 92 0 boundary 234.B braid 235.F Brauer (of algebra classes) 29 E Brauer (of a commutative ring) 29.K Bravais 92.B bundle (of a fiber bundle) 147.B rc 52.M of canonical transformations 271.F category of 52.B cellular homology 201.F,0 character (of an Abelian group) 2.0 character (of a topological Abelian group) 422.B Chevalley 151.1 of classes of algebraic correspondences 9.H classical 60.A Clifford 61.D closed 362.J coefficient (of a homology group) 201.Q cohomology (of a complex) 200.0 cohomology (of a group) 200.M cohomology (of a Lie algebra) 200.0 cohomology, with coefficient sheaf:F 283.E co homotopy 202.1 of collineations 343.D color point 92.D color symmetry (colored symmetry) 92.D commutative 2.A 190.A commutator (of two elements) 190.H commutator (of two subsets of a group) 190.H compact 69.A completely reducible 190 L complex 60.L complex cobordism 114.H complex orthogonal 60.1 complex special orthogonal 60.1 complex symplectic 60.L of congruence classes modulo m* 14.H of congruent transformations 285.C covering 91 A 423.0 covering transformation 91.A Coxeter 13.R crystallographic 92.A crystallographic space 92.A cyclic 190.C decomposition (of a prime ideal) 14.K defect (of a block of representations) 362.1 defect (of a conjugate class in a group) 362.1 derived (of a group) 190.H difference (of an additive group) 190.C of differentiable structures on combinatorial spheres App. A, Table 6.1 differentiable transformation 431.C dihedral 151.0 direct product 190.L discontinuous, of the first kind 122.B 2004 discontinuous transformation 122.A divisor (of a compact complex manifold) 72.F divisor class (of a Riemann surface) I1.D elementary topological Abelian 422.E elliptic modular 122.D equicontinuous, of class (Co) 378.C equivariant J- 431.C exponential 437.U extension (cohomology of groups) 200.M factor 190.C finite 151.A 190.C finitely generated 190.C finitely presented 161.A of the first kind 122.C formal 13.C four- 151.0 free 161.A free product (of the system of groups) 190.M frieze 92.F Frobenius 151.H Fuchsian 122.C Fuchsoid 122.C full 136.F full linear 60.B full Poincare 258.A function 234.A fundamental (of a topological space) 170 Oalois (of an algebraic equation) 172.0 Oalois (of a Oalois extension) 172.B Oalois (of a polynomial) 172.0 generalized nilpotent 190.K generalized quaternion 151.B generalized solvable 190.K general linear 60.B 226.B general linear (over a noncommutative field) 60.0 Orothendieck (of a compact Hausdorff space) 237.B Orothendieck (ofa ring) 237.J h-cobordism (of homotopy n-spheres) 114.1, App. A, Table 6.1 Hamilton 151.B Hausdorff topological 423.B Hilbert modular 32.0 holonomy 80.D 364.E homogeneous holonomy 364.E homogeneous Lorentz 359 homology (of a chain complex) 201.B homology (of a group) 200.M homology (of a Lie algebra) 200.0 homology (of a polyhedron) 201.D homotopy 202.J hyper- 190.P icosahedral 151.0 ideal, modulo m* 14 H ideal class 14.E 67.K idele 6.C ideIe class 6.D indecomposable 190.L inductive limit 21O.C inductive system of 21O.C inertia (of a finite Oalois extension I 257 D inertia (ofa prime ideal) 14.K infinite 190.C infinite classical 147.1 202.V infinite orthogonal 202.V infinite symplectic 202.V infinite unitary 202.V inhomogeneous Lorentz 359 of inner automorphisms (of a group) 190.D 
2005 of inner automorphisms (of a Lie algebra) 248.H integral homology (of a polyhedron) 201.0 integral homology (of a simplicial complex) 201.C integral singular homology 201.E isotropy 362.8 J- 237.1 of Janko-Ree type 151.J k- 13.A K- (of a compact Hausdorff space) 237.8 Klein four- 151.G Kleinian 122.C 243 A knot 235.B L- 450.N lattice 182.8 lattice (of a crystallographic group) 92.A lattice-ordered Archimedean 243 G Lie 249.A 423.M Lie transformation 431.C linear fractional 60.8 linear isotropy (at a point) 199 A linear simple 151.I link 235.0 little 258.C local Lie 423.L local Lie, of local transformations 431.G locally Euclidean 423.M local one-parameter, of local transformations 105.N Lorentz 60 J 258 359.8 magnetic 92 0 Mathieu 151.H matric 226.8 matrix 226.8 maximally almost periodic 18.1 minimally almost periodic 18.1 mixed 190.P Mobius transformation 76 A modular 122 0 monodromy (of an n-fold covering) 91.A monodromy (of a system of linear ordinary differential equations) 253.B monothetic 136.0 of motions 139.8 of motions in the wider sense 139.8 multiplicative 190.A multiplicative (of a field) 149.A 190.8 Neron-Severi (of a variety) 15.0 16.P nilpotent 151.C 190.1 octahedral 151.G n 190.E one-parameter, of transformations (of a Coo_ manifold) 105.N one-parameter, of transformations of class C' 126.B one-parameter semi-, of class Co 378.8 one-parameter sub- 249 Q ordered 243.G ordered additive 439.8 of orientation-preserving diffeomorphisms 1141 oriented cobordism 114.H of oriented differentiable structures on a combinatorial sphere 114.1 orthogonal 60.1 139.B 151 1 orthogonal (over a field with respect to a quadratic form) 60.K orthogonal (over a noncommlltative field) 60.0 Subject Index Group(s) orthogonal transformation 60.1 of outer automorphisms (of a group) 190.0 of outer automorphisms (of a Lie algebra) 248.H p- 151.8 periodic 2.A permutation 190.8 permutation, of degree n 151 G re- 151.F re-solvable 151.F Picard (of a commutative ring) 237.J Poincare 170 258.A point (of a crystallographic group) 92.A polychromatic 92.0 principal isotropy 431.C profinite 210.C projective class 200.K projective general linear 60.8 projective limit 21O.C projective special linear 60.8,0 projective special unitary 60.H projective symplectic 60.L projective system of 210.C of projective transformations 343.0 projective unitary 60.F proper Lorentz 60.J 258.A 359.B proper orthogonal 60.1 258.A pseudo- (of topological transformations) 105 Y p-torsion, of exceptional groups App. A, Table 6.IV qth homology 201.8 quasi- 190 P quasi-Fuchsian 234.8 quaternion 151.8 quaternion unimodular 412.0 quotient 190.C quotient (of a topological group) 423.E of quotients (of a commutative semigroup) 190.P ramification (of a finite Galois extension) 257.0 ramification (of a prime ideal) 14.K rational cohomology 200.0 reductive 13.Q Ree 15t.I of Ree type 15l.J regular polyhedral 151.0 relative homotopy 202.K relative singular homology 201.L renormalization II1.A restricted holonomy 364.E restricted homogeneous holonomy 364.E Riemann-Roch 366.0 Riesz 36.H rotation 60.1 258.A Schottky 234.8 semi- 190.P 396.A separated topological 423.8 sequence of factor (of a normal chain) 190.0 shape 382.C Siegel modular (of degree n) 32.F simple 190.C simply connected (isogenous to an algebraic group) 13.N singular homology 201.G,L solvable 151.01901 space 92.A special Clifford 61.0 special linear 60.8 
Subject Index Group algebra special linear (over a noncommutative field) 60.0 special orthogonal 60.l,K special unitary 60.F,H,0 spinor 60.1 61.0 stability 362.B stable homotopy 202.T stable homotopy (of classical group) 202.V stable homotopy (of the Thom spectrum) 114.G Steinberg (of a ring) 237.1 structure (of a fiber bundle) 147.B supersolvable 151.0 Suzuki 15l.I symmetric 190.B symmetric, of degree n 151.G symplectic 60.L 15t.I symplectic (over a noncom mutative field) 600 symplectic transformation 60.L Tate-Shafarevich 118.0 tetrahedral 151. G theoretic approach 215.C Tits simple 151.I T 2 -topological 423.B topological 423 topological Abelian 422.A topological transformation 431.A torsion 2.A torsion (of a finite simplicial complex) 201.B torus 422.E totally ordered 243.G totally ordered additive 439.B total monodromy 418.F transformation 431, App. A, Table 14JII transitive permutation 151.H of translations 7.E 258.A of twisted type 15l.I type I 308.L 437.E underlying (of topological group) 423.A unimodular 60.B unimodular locally compact 225.C unit (of an algebraic number field) 14.0 unitary 60.F 15l.I unitary (over a field) 6O.H unitary (relative to an 8-Hermitian form) 60.0 unitary symplectic 60.L unitary transformation 60.F universal covering 91.B 423.0 unoriented cobordism 114.H value (of a valuation) 439.B,C vector 422.E Wall 114.J WC (Weil-Chiitelet) 118.0 weakly wandering under 136.F web 234.B weight 92.C Weil 6.E 450.H Weil-Chiitelet 118.0 Weyl (of an algebraic group) 13.H Weyl (of a BN pair) 13.R Weyl (of a Coxeter complex) 13.R Weyl (of a root system) 13.1 Weyl (of a semisimple Lie algebra) 248.R Weyl (of a symmetric Riemannian space) 413.F Weyl, affine 413.F Weyl, k- 13.Q White 92.0 Whitehead (of a ring) 237.1 2006 Witt (of nondegenerate quadratic forms) 348.E Zassenhaus 151.H group algebra 29.C 36.L C*- 36.L group code 63.C group extension 200.M grouplike 203.F group manifold (of a Lie transformation group) ltO.A group measure space construction 136.F group minimization problem 215.C group object (in a category) 52.M groupoid 190.P hyper- 190 group pair (of topological Abelian groups) 422.1 orthogonal 422.1 group ring (of a compact group) 69.A group scheme 16.H group system 235.B group theorem (on fractional ideals) 67.J group-theoretic approach 215.C group variety 13.B 16.H algebraic 13.B group velocity 446 growth, infra-exponential 125.AA Grunsky inequality 438.B Gudermann function (Gudermannian) 131.F, App. A, Table 16.m guide, wave 130.B Guignard constraint qualification 292.B Gupta-Bleuler formalism 150.G Gysin exact sequence (of a fiber space) 148.E Gysin homomorphism 201.0 Gysin isomorphism, Thom- 114.G of a fiber space 148.E H Hp (Hardy spaces) 168.B H'(n) (Sobolev spaces) 168.B Hb(n) (Sobolev spaces) 168.B h-cobordant oriented manifolds 114.1 h-cobordism group of n-dimensional homotopy spheres 114.1, App. A, Table 6.1 h-cobordism theorem 114.F H-function 402.B H-series, principal 437,X H-space 203.0 H-theorem 402.B (H, p)-summable 379.M H-closed space 425.U Haag-Araki axioms 150.E Haag-Kastler axioms 150.E Haag-Ruelle scattering theory 150.0 Haag theorem 150.C Haar condition (on best approximation) 336.B Haar measure left-invariant 225.C right-invariant 225.C Haar space 142.B Haar system of orthogonal functions 317.C Haar theorem 225.C Hadamard estimation App. A, Table 8 Hadamard formula, Cauchy- 339.A Hadamard gap theorem 339.0 Hadamard multiplication theorem 339.0 Hadamard theorem (on meromorphic functions) 272.E (on singularities of power series) 339.0 
2007 Hadamard three-circle theorem 43.E hadrons 132.B Haefliger structure 154.E c- 154.E Hahn-Banach (extension) theorem (in a normed space) 37.F (in a topological linear space) 424.C half Bessel function 39.B half-exact (functor) 200.1 half-life 13 2.A half-line 155.B closed (in affine geometry) 7.0 half-periodic solution (of Hill equation) 268.E half-plane 155.B 333 A half-space of an affine space 7.0 closed (of an affine space) 7.0 principal (of a flag) 139.B Siegel upper (of degree n) 32.F supporting (of a convex set) 89.A half-spinor (even, odd) 61.E half-spin representation (even, odd) 61.E half-trajectory negative 126.0 positive 126.0 half-width 132.A Hall subgroup 151.E Hallstrom-Kametani theorem 124.C Halmos theorem, von Neumann- 136.E Hamburger moment problem 240.K Hamilton canonical equation 271.F Hamilton-Cayley theorem 269.F Hamilton differential equation 324.E Hamilton group 151.B Hamiltonian 271.F 351.0442.0 bilinear 377.A cluster decomposition 375.F free 351.0 Hamiltonian function 219.C 271.F Hamiltonian operator 351.0 Hamiltonian system 126.L Hamiltonian vector field 126.L 219.C Hamilton-Jacobi differential equation 271.F 324.E Hamilton-Jacobi equation 108.B Hamilton path 186.F Hamilton principle 441.B Hamilton quaternion algebra 29.B Hammerstein integral equation 217.M Hamming bound (of a code) 63.B Hamming code 63.C Hamming distance 63.B 136.E handle 41O.B attaching 114.F Casson 114.K manifold with 114.F handlebody 114.F Hankel asymptotic representation App. A, Table 19.m Hankel determinant 142.E Hankel functions 39.B, App. A, Table 19.m Hankel transform 220.B Hansen, general connectedness theorem due to Fulton and 16.1 Hansen- Bessel formula App. A, Table 19.1II hard, NP- 71.E hard Lefschetz theorem 450.Q hardware 75.C Hardy class 43.F 159.0 generalized 164.E Hardy inequality App. A, Table 8 Subject Index Hasse norm-residue symbol, Hilbert- Hardy-Littlewood-Sobolev inequality 224.E Hardy-Littlewood supremum theorem App. A, Table 8 Hardy-Littlewood theorem on bounded functions 43.E on trigonometric systems 317.B Hardy space 168.B Hardy theorem on bounded functions 43.E on the Cauchy product of two series 379.F harmonic (form) 194.B (function) 193.A (function on a state space) 260.0 (mapping) 195.B harmonically separated points (in a projective space) 343.0 harmonic analyzer 19.E harmonic analysis 192 harmonic boundary 207.B harmonic conjugates 343.0 harmonic continuation 193.M 198.G harmonic differential (on a Riemann surface) 367.H harmonic dimension (of a Heins end) 367.E harmonic flow 193.K harmonic functions 193 conjugate 193.C spherical 193.C harmonic integrals 194 harmonic kernel function 188.H harmonic majorant (of subharmonic function) 193.S harmonic mapping 195 harmonic mean (of a distribution) 397.C (of a function) 211.C (of numbers) 211.C harmonic measure inner 169.B outer 169.B harmonic motion, simple 318.B harmonic oscillation 318.B harmonic range of points 343.0 harmonics ellipsoidal 133.B ellipsoidal, of four species 133.C solid 393.A spherical 193.C 393.A surface 393.A tesseral 393.D zonal 393.0 Harnack condition (on the O-integral) 100.E Harnack first theorem 193.1 Harnack lemma 193.1 Harnack second theorem 193.1 Hartogs continuation theorem 21.F Hartogs-Osgood theorem 21.H Hartogs theorem of continuity 21.H of holomorphy 21.C Hartshorne conjecture 16.R Harvey duality, Martineau- 125.Y hashing 96.B Hasse character, Minkowski- (of a nondegenerate quadratic form) 348.0 Hasse conjecture 450.s Hasse function, Artin- 257.H Hasse invariant (of a central simple algebra over a p-adic field) 29.G Hasse norm-residue symbol, Hilbert- 14.R 
Subject Index Hasse principle Hasse principle 348.G Hasse theorem, Minkowski- (on quadratic forms over algebraic number fields) 348.G Hasse- Witt matrix 9 E Hasse zeta function 450.S Haupt theorem 268.E Hauptvermutung (in topology) 65.C 70.C Hausdorff axiom 425 Q Hausdorff dimension 117.G 234.E 246.K Hausdorff formula, Campbell- 249.R Hausdorff measure 169.0 Hausdorff moment problem 240.K Hausdorff space 425.Q collectionwise 425.AA Hausdorff theorem, Baire- 273.J 425.N Hausdorff topological group 423.B Hausdorff uniform space 436.C Hausdorff-Young inequality 224.E Hausdorff- Young theorem 317.B Hawaiian earring 79.C hazard function 397.0 hazard rate 397.0 heat Joule 130 B specific, at constant pressure 419.B specific, at constant volume 419.B heat bath 419.B heat conduction, equation of 327 A heat equation 327.A, App. A, Table 15.VI heat gain 419 A heat loss 419.A Heaviside calculus 306 A Heaviside function 125.E 306.B, App. A, Table 12.II Hecke algebra 29 C 32 0 global 450.0 Hecke character 6.0 Hecke L-function 450.E with Grossencharakter 450 F Hecke operator 32.0 zeta function defined by 450.M Hecke ring 32.0 Heegaard decomposition 65.C heigh t (of an algebraic number) 430.B (of an element in a lattice) 243 F (of an ideal) 67.E (of an isogeny) 3 F (of a lattice) 243.F (of a prime ideal) 67.E infinite (element of an Abelian p-group) 2.0 Heilbronn phenomenon, Oeuring- 123.0 Heine-Borel theorem 273.F Heine series 206.C Heins end 367 E Heisenberg equation of motion 351 0 Heisenberg picture 351.0 Heisenberg uncertainty relation 351 C helicity 258.C helicoid ordinary 111 I right 111 I helicoidal surface 111.1 Helinger-Hahn theorem 390 G helix generalized II1.F ordinary II1.F Helly theorem 94.B Helmholtz differential equation 188.0, App. A, Table 15.VI Helmholtz free energy 419 C 2008 minimum principle 419.C Helmholtz theorem (on vector fields) 442.0, App. A, Table 3.m Helmholtz vorticity theorem 205.B Helson set 192 R hemisphere northern 140 southern 140 Henselian ring 370 C Henselization 370 C Hensellemma 118.C Hensel ring 370.C Herbrand lemma 200.N Herbrand quotient 200.N hereditarily normal space 425.Q 1-.""..""...1;+.......1., .....n......+.""....+ .............__.......". /1'1, r"':.. u\."H.uHaluy y'UVUvUl lUapp.l.l.ll5 ""t":::'J.U hereditarily weak topology 425.M hereditary finite set 33.B hereditary ring 200.K left 200.K right 200.K Herglotz integral representation 43.1 Herglotz theorem 192.B Hermite differential equation App. A, Table 14 II Hermite differential equation, Weber- 167.C Hermite formula, Oauss- (in numerical integration) 299.A Hermite interpolation polynomial 223.E Hermite polynomials 317.0, App. A, Table 20JV Hermite polynomials, F ourier- 176.1 Hermitian (integral operator) 251.0 Hermitian form 256.Q 348 F anti- 256.Q a- 60.0 indefinite 348.F negative definite 348.F positive definite 348.F semidefinite 398.F skew- 256.Q v- 384.A Hermitian homogeneous space, complex 199 A Hermitian hyperbolic space 412.0 Hermitian inner product 256.Q Hermitian kernel 217 H Hermitian linear space 256.Q Hermitian matrix 269.1 anti- 269.1 skew 269.I Hermitian metric 232 A Hermitian operator 251.E Hermitian space irreducible symmetric 412.E symmetric 412 E Heron formula (for plane triangles) App. A, Table 2.II (for spherical triangles) App. A, Table 2.m Hersch and Pfluger, extremal length defined by 143.A Hersch problem 391.E Hessenberg method 298.0 Hesse normal form (of a hyperplane) 139.H Hessian (on a differential manifold) 279.B,F (form) 226.0 (of a plane algebraic curve) 9.B heterogeneity, design for two-way elimination of 102.K heuristic algoritl'm 215 E Hewitt-Savage zero-one law 342.0 hexagon 155.F 
2009 hexagonal (system) 92.E hexahedron 357.B Hey zeta function 27.F hidden variable theories 351.L hierarchy 356.H analytic 356.H arithmetical 356.H arithmetical, of degrees of recursive unsolvability 356.H C-analytic 356.H C-arithmetical 356.H Chomsky 31.0 hyperarithmetical, of degrees of recursive unsolvability 356.H hierarchy theorem 356.H Higgs mechanism 132.0 higher algebraic K -theory 237.1 higher differentiation (in a commutative ring) 113 higher order, of (for infinitesimals) 87.G higher-order derivative (of a differentiable function) 106.0 higher-order ordinary differential equation App. A, Table 14.1 higher-order partial derivative 106.G higher-order predicate logic 411.K higher-transcendental function 389.A highest weight (of a representation of a Lie algebra) 248.W high-precision computation 138.B Hilbert, O. 196 Hilbert basis theorem 284.A Hilbert characteristic function (of a coherent sheaf) 16.E (of a graded module) 369.F Hilbert cube 382.B Hilbert 8-operator 411.J Hilbert 8-quantifier 41l.J Hilbert 8-symbol 41l.J Hilbert fifth problem 423.N Hilbert-Hasse norm-residue symbol 14.R Hilbertian space, countably 424.W Hilbert inequality App. A, Table 8 Hilbert invariant integral 46.C Hilbert irreducibility theorem (on polynomials) 337.F Hilbert manifold 105.Z 286.K Hilbert modular form of dimension - k 32.G of weight k 32.G Hilbert modular function 32.G Hilbert modular group 32.0 Hilbert modular surface 15.H Hilbert norm-residue symbol 14 R Hilbert NulIstelIensatz 369.0 Hilbert polynomial (of an algebraic curve) 9.F (of a coherent sheaf) 16.E (of a graded module) 369.F Hilbert problem (in calculus of van at ions) 46.A Hilbert problem, Riemann- (for integral equations) 217.1 (for ordinary differential equations) 253.0 Hilbert scheme 16.S Hilbert-Schmidt class 68.1 Hilbert-Schmidt expansion theorem 217.H Hilbert-Schmidt norm 68.1 Hilbert-Schmidt type integral operator of 68.C kernel of 217.1 Hilbert spaces 197 Subject Index Holomorphic adjoint 251.E channel 375.F complex 197.B exponential 377.0 physical 150.G pre- 197 B real 197.B rigged 424.T Hilbert-Speiser theorem 172.J Hilbert system of axioms (foundations of geometry) 155.B Hilbert syzygy theorem 369.F Hilbert theorem 90 172.J Hilbert transform 160.0 220.E Hilbert zero point theorem 369.0 Hilferty approximations, Wilson- 374.F Hill determinant 268.B Hill determinantal equation 268.B Hill differential equation 268.B Hille-Yosida theorem 378.B Hill function 268.E Hill method of solution 268.B Hirsch theorem, Leray- 201.J Hirzebruch index theorem (for differentiable manifolds) 56.G Hirzebruch signature theorem (on algebraic surface) 72.K Hirzebruch surface 15.0 Hirzebruch theorem of Riemann- Roch type 366.B histogram 397.B Hitchcock method 301.E hitting probability 5.0 hitting time 260.B 261.B 407.B Hlawka theorem, Minkowski- 182.0 Hochschild cohomology group 200.L Hodge conjecture 450.S Hodge index theorem 15.0 Hodge manifold 232.0 Hodge metric 232.0 Hodges- Lehmann theorem 399.E,H Hodge spectral sequence 16.U Hodge structure (of a vector space) 16.V mixed 16.V polarized 16.V Hodgkin-Huxley differential equation 291.F hodograph method 205.B hodograph plane 205.B hold almost everywhere (in a measure space) 270.0 hold at almost all points in a measure space 270.0 Holder condition of order IX 84.A Holder inequality 211.C, App. A, Table 8 Holder integral inequality 211.C Holder method of order p, summable by 379.M Holder sequence, Jordan- (in a group) 190.0 Holder space 168.B Holder theorem 104.F Holder theorem, Jordan- (in group theory) 190.G Holder theorem, Jordan- (on representations of algebras) 362.0 hole theory, Oirac 415.0 Holmgren type theorem (of Kashiwara-Kawai) 125.00 Holmgren uniqueness theorem 321.F holohedral 92.B holohedry 92.B holomorphic (family oflinear operators) 331.C (function) 198.A (in the sense of Riemann) 21.C 
Subject Index Holomorphically complete domain (vector-valued function) 37.K holomorphically complete domain 21.F holomorphically complete space 23.F holomorphically convex domain 21.H holomorphic automorphism 21.J holomorphic differential (on a Riemann surface) 367.H holomorphic differential form of degree k 72 A holomorphic distribution (with respect to a param- eter) 125.H holomorphic evolution operator 378.1 holomorphic foliation 154.H holomorphic function(s) 198 (on a complex manifold) n.A (of many variables) 21.A,C germ of a 21.E sheaf of germs of 23.C 383.0 holomorphic functional calculus 36 M holomorphic hull 21.H holomorphic k-form n.A holomorphic local coordinate system 72.A holomorphic mapping 21.J (of a complex manifold) n.A nondegenerate (between analytic spaces) 23.C holomorphic microfunction 274.F holomorphic modification (of an analytic space) 23.0 holomorphic part (in a Laurent expansion) 198.0 holomorphic sectional curvature 364.0 holomorphic semigroup 378.0 holomorphic tangent vector 72.A holomorphic vector field 72.A holomorphy domain of 21.F envelop of 21 F Hartogs theorem of 21.C holonomic (coherent tff-module) 274.H holonomic systems with regular singularities 274.H simple 274.H holonomy 154.C holonomy group 80.0 154.C 364.0 homogeneous 364 E restricted 80.0 364.E restricted homogeneous 364.E holonomy homomorphism 154.C linear 154.C holosymmetric class 92.B homentropic flow 205.B homeomorphic 425.G homeomorphism 425.G minimal 136.H PL 65 A strictly ergodic 136.H uniquely ergodic 136.H homeomorphism problem 425.G homoclinic point 126.1 transversal 126 J homogeneous (A-submodule) 200.B (boundary value problem) 315 B (difference equation) 104.C (lattice) 182.B (linear ordinary differential equation) 252.A (system oflinear differential equations of the first order) 252 G spatially (process) 261.A temporally (additive process) 5.B temporally (process) 261.A weighted (analytic function) 418.0 2010 homogeneous bounded domain 384.A 412.F homogeneous coordinate ring 16.A homogeneous coordinates 343.C homogeneous difference equation 104.C homogeneous element of a graded ring 369.B of a homogeneous ring 369.B homogeneous equations, system of linear 269.M homogeneous holonomy group 364.E restricted 364.E homogeneous hypersurface 344.A homogeneous ideal of a graded ring 369.B of a polynomial ring 369.B homogeneous integral equation 217.F homogeneous Lorentz group 258.A 359 homogeneously regular 275.C homogeneous Markov process 5.H 261.A homogeneous n-chain (for a group) 200.M homogeneous ordinary differential equation App. A, Table 14.1 of higher order App. A, Table 14.1 homogeneous part (of a formal power series) 370.A homogeneous polynomial 337.B homogeneous ring 369.B homogeneous Siegel domain, irreducible 384.E topology of Lie groups and 427 homogeneous space(s) 199 249.F 362.B complex Hermitian 199.A Kahler 199.A linearly connected 199.A reductive 199.A Riemannian 199.A symmetric 412.B symmetric Riemannian 412.B homogeneous turbulence 433.C homological algebra 200 relative 2oo.K homological dimension of a module 2oo.K of a topological space 117.F homological functor 200.1 homological mapping 2oo.C homologous 201.B homologous to zero 198.B homology 200.H intrinsic 114.H homology basis, canonical 11.C homology class 200.H fundamental 201.N fundamental, around K 201.N q-dimensional 201.B homology exact sequence 201.L (of fiber space) 148.E reduced 201.F homology group(s) (of a chain complex) 201.B (ofa group) 200.M (of a Lie algebra) 200.0 (of a polyhedron) 201.0 (of a simplicial complex) 201.G absolute 201.L tech 201.M cellular 201.F,G with coefficients in G 201.G integral 201.C,0 integral singular 201.E local 201.N red uced 201. E relative tech 201.M 
2011 relative singular 201 L simplicial 201.D singular 201 G,L,R homology manifold 65 B homology module 200.C homology theory 201 generalized 201.Q generalized, with E-coefficient 202.T uniqueness theorem of 201.Q homomorphic (algebraic system) 409 C (groups) 190.D (topological groups) 423.1 order (ordered sets) 311.E homomorphic image (of a measure-preserving transformation) 136.D homomorphism (of Abelian varieties) 3.C (of algebraic systems) 409.C (of fields) 149.B (of groups) 190.D (of lattices) 243.C (of Lie algebras) 248.A (of linear representations) 362.C (of pres heaves) 383.A (of rings) 368.D (of sheaves) 363 B A- (of A-modules) 277 E A-, of degree p (of graded A-modules) 200.B admissible (of n-groups) 190 E algebra 29.A allowed (of A-modules) 277.E analytic (of Lie groups) 249.N anti- (of groups) 190.D anti- (of rings) 368 D bialgebra 203 G Bokshtein 64.B boundary (on homology groups) 201.L boundary (in homotopy exact sequences) 202.L CW-(between Lie groups) 249.N canonical (on direct products of rings) 368.E coalgebra 203 F coboundary (on cohomology groups) 201.L connecting (in homology) 200.C 201 L connecting (on homology groups) 201 C continuous (of topological groups) 423.J crossed (of an associative algebra) 200 L dual (of a homomorphism of algebraic tori) 13D dual (of lattices) 243 C edge 200.1 equivariant J- 431.F generalized Hopf 202.V Gysin 2010 holonomy 154 C Hopf algebra 203 H Hurewicz 202 N induced by a continuous mapping (between homotopy groups) 202.K J- (in homotopy theory) 202 V J- (in K-theory) 237.1 Jordan (of Jordan algebras) 231.A lattice- 243.C local (of a topological group) 423.0 module of (of modules) 277 B module of A- (of A-modules) 277 E n- (of n-groups) 190.E open continuous (of topological groups) 423.1 operator (of A-modules) 277.E Subject Index Homotopy set operator (of n-groups) 190 E order 311.E rational (of Abelian varieties) 3 C rational (of algebraic groups) 13.A ring 368.D *- 36.F Umkehr 201.0 unitary (of rings) 368.D zero (of two A-modules) 277.H homomorphism theorem on groups 190.D on Lie algebras 248.A on topological groups 423.J on topological linear spaces 424,X homothetic correspondence (between surfaces) 111.1 homothety in conformal differential geometry 110 D in Euclidean geometry 139.B homotopic 154.E,F 202 B chain (chain mappings) 200.C integrably 154.F null- (continuous mapping) 202 B regularly (immersions) 114.0 relative to a subspace 202.B to zero 202.B homotopy 202.B composite 202.B free 202.B linear 114.0 restricted 202 B homotopy-associative (multiplication) 203.D homotopy category of topological spaces 52.B homotopy chain 200 C homotopy class 202.B compact 286.D homotopy cochain 200.F homotopy commutative (muJtiplication) 203 D homotopy equivalence 202 F simple 65.C weak 202.F homotopy equivalent systems (of topological spaces) 202. F homotopy exact sequence 202.L of a fiber space 148 D of a triad 202.M of a triple 202.L homotopy extension property 202.E homotopy group(s) 202 J algebraic 16.U of a compact connected Lie group App A, Table 6.VII realization theorem of 202.N of a real Stiefel manifold App A, Table 6 VII relative 202.K of a sphere App. A, Table 6 VI stable 202.T, App. A, Table 6.VII stable (of classical groups) 202 V stable (of the k-stem) 202.U stable (of Thom spectrum) 114.G of a triad 202.M homotopy identity (of an H-space) 203.D homotopy invariance (of a homology group) 201.D homotopy invariant 202.B homotopy inverse (for an H-space) 203.D homotopy n-spheres group of 114 I h-cobordism group of 114.1 homotopy operations 202.0 homotopy set 202.B 
Subject Index Homotopy sphere homotopy sphere 65.C homotopy theorem first (in obstruction theory) 305.B second (in obstruction theory) 305.C simple 65.C third (in obstruction theory) 305.C homotopy theory 202 de Rham 114.L homotopy type 202.F (of a link) 235.D spherical G-fiber 431.F homotopy type invariant 202.F Hooke law 271 G Hopf algebra(s) 203 dual 203.C elementary 203.D graded 203.C Hopf algebra homomorphism 203.H Hopf bifurcation 126.M Hopf bundle 147.E Hopf classification theorem 202.1 Hopf co multiplication 203 D Hopf coproduct 203. D Hopf extension theorem 270.E Hopf fibering 147.E Hopf homomorphism, generalized (of homotopy groups of spheres) 202.U Hopf integrodifferential equation, Wiener- 222.C Hopfinvanant 202.U generalized 202.Q modulo p 202.S Hopf mapping (Hopf map) 147.E Hopf surface 72.K Hopf theorem (continuous vector field) 153.B Hopf weak solution 204.C horizon, event 359.F horizontal components of a homogeneous space 110.A of a vector field 80.C horizontal slit mapping, extremal 367.G horizontal subs pace 191.C horizontal vector (in a differentiable principal fiber bundle) 80.C Hormander theorem 112.C,D horned sphere, Alexander 65.0 Horner method 301.C horocycle flow 136.0 horosphere 218.0 horseshoe diffeomorphism 126.J Hosokawa polynomial 235.D Hotelling T 2 statistic 280.B noncentral 374.C Householder method 298.D Householder transformation 302.E Hugoniot relation, Rankine- 204.G 205.B Hukuhara theorem, Dini- 314.0 Hukuhara problem 315.C hull closed convex 424.H convex 89.A convex (in an affine space) 7.D convex (of a boundary curve) 275.A convex (in linear programming) 255.D holomorphic 21.H hull-kernel topology 36.D human death and survival, model of 214.A Hunt process 261.B Hunt-Stein lemma 4OO.F 2012 Hurewicz homorphism 202.N Hurewicz isomorphism theorem 202.N Hurewicz-Steenrod isomorphism theorem 148.D Hurewicz theorem, generalized 202.N Hurewicz-Uzawa gradient method, Arrow- 292.E Hurwitz formula, Riemann- (on coverings of a nonsingular curve) 9.1 Hurwitz relation (on homomorphisms of Abelian varieties) 3.K Riemann- 367.B Hurwitz theorem 10.E Hurwitz zeta function 450.B Huxley differential equation, Hodgkin- 291.F Huygens principle 325.B 446 in the wider sense 325.D hybrid computer 19.E hydrodynamics 205 hydromagnetic dynamo theory 259 hydro magnetics 259 hydrostatics 205.A hyperalgebra 203.1 hyperarithmetical function 356.H hyperarithmetical hierarchy of degrees of recursive unsolvability 356.H hyperarithmetical predicate 356.H hyperbola 78.A conjugate 78.E equilateral 78.E rectangular 78.E hyperbolic (closed invariant set of a dynamical system) 126.1 (differential operator) 112.A 325 H (linear mapping) 126.G (partial differential equation) 325.A,E (Riemann surface) 367.D,E (simply connected domain) 77 B (space form) 412.H complete 21.0 regularly 325.A,F in the sense of Oilrding 325.F in the sense of Petrovskii 325.F in the strict sense 325.F strongly 325.H symmetric (in the sense of Friedrichs) 325.G weakly 325.H hyperbolically embedded 21.0 hyperbolic closed orbit 126.0 hyperbolic coordinates equilateral 90.C, App. A, Table 3.V rectangular 90.C hyperbolic cosecant 131.F hyperbolic cosine 131.F hyperbolic cotangent 13l.F hyperbolic cylinder 350.B hyperbolic cylindrical coordinates App. A, Table 3.V hyperbolic cylindrical surface 350.B hyperbolic differential equations, system of (in the sense of Petrovskii) 325.0 hyperbolic-elliptic motion 420.D hyperbolic fixed point 126.0 hyperbolic function 13l.F hyperbolic geometry 285.A hyperbolic knot 235.E hyperbolic manifold 21.0 235.E hyperbolic motion 420.D hyperbolic-parabolic motion 420.D 
2013 hyperbolic paraboloid 350.B hyperbolic plane 122 C hyperbolic point (on a surface) 111.H hyperbolic quadric hypersurface 350.1 hyperbolic secant 13l.F hyperbolic sine 131.F hyperbolic singular point 126.0 hyperbolic space 285.C Hermitian 412.G quaternion 412.0 real 412.G hyperbolic spiral 93.H hyperbolic tangent 13l.F hyperbolic transformation 76.F hyperbolic type, partial differential equation of 321.E 325 hyperbolic type, primitive 92.C hyperboloidic position 350.B hyperboloid of one sheet 350.B hyperboloid of revolution of one or two sheets 350.B hyperboloid of two sheets 350.B hypercohomology 200.1 hyperconstructive ordinal 81.E hypercubic type, primitive n.c hyperelliptic curve 9.D hyperelliptic integral I1.C hyperelliptic Riemann surface 11.C hyperelliptic surface 72.K hyperfinite 293.B 308.1 hyperfunction 125 in the Oirichlet problem 120.C exponentially decreasing Fourier 125.BB Fourier 125.BB Fourier ultra- 125.BB modified Fourier 125.BB Sato 125.V hypergeometric differential equation 260.A, App. A, Table 18.1 confluent 167.A, App. A, Tables 14.II,19J Oauss App. A, Table 14.II hypergeometric distribution 341 D 397.F, App. A, Table 22 multidimensional App. A, Table 22 multiple 341.D hypergeometric function(s) 209, App. A, Table 18.1 Appell, of two variables 206.D, App. A, Table 18.1 Barnes extended 206.0, App. A, Table 18.1 of confluent type 167.A, App. A, Table 19.1 of the hyperspherical differential equation 393.E with matrix argument 206 E and spherical functions App. A, Table 18 hypergeometric integral 253.B hypergeometric series 206.A hypergeometric type, special function of 389.A hypergroup 190.P hypergroupoid 190.P hyperinvariant (under an operator) 251.L hyperplanar symmetry (of an affine space) 139.B hyperplane(s) in an affine space 7.A characteristic (of a partial differential equation of hyperbolic type) 325.A at infinity (in affine geometry) 7.B pencil of (in a projective space) 343.B in a projective space 343.B Subject Index Hypotrochoid regression 403.D tangent (of a quadric hypersurface) 343.E hyperplane coordinates of an affine frame 7.C in projective geometry 343.C hyperplane section 418.1 hyper quadric in an affine space 350.0 in a projective space 343.D 350.1 hypersonic flow 205.C hypersphere 76.A imaginary 76.A limiting (in hyperbolic geometry) 285.C non-Euclidean 285.C oriented real 76.A point 76.A proper (in hyperbolic geometry) 285.C real 76.A hypersphere geometry 76.A hyperspherical coordinates, (n + 2)- 76.A 90.B hyperspherical differential equation 393.E hypersurface(s) (of an algebraic variety) 16.A (in a Euclidean space) 111.A central quadric 350.G characteristic (of a partial differential equation of hyperbolic type) 325.A coordinate (in a Euclidean space) 90.C elliptic quadric 350.0 homogeneous 344.A hyperbolic quadric 350.0 integral (partial differential equations) 320.A noncentral quadnc 350.0 nondegenerate 344.A parabolic quadric 350.G pencil of quadric 343.E properly (n - I)-dimensional quadric 350.G quadric 343.0350.0,1 quadric conical 350.Q quadric cylindrical 350.Q regular quadric 343.E singu]ar quadric (of the hth species) 343.E spherical real 344.C hypersurface element(s) 82.A 324.B union of 82.A hypocontinuous (bilinear mapping) 424.Q hypocylcoid 93.H hypo-Dirichlet 164.B hypoelliptic 112.D 189.C 323.1 analytically 112.D 323.1 hypofunction (in the Dirichlet problem) 120.C hyponormal 251.K hypothesis alternative 400.A composite 4OO.A continuum continuum hypothesis ergodic 136.A 402.C general linear 400.H Lindelof 123.C null 400.A Riemann 450.B,P simple 400.A statistical 400.A Suslin 33.F hypothesis testing 401.C, App. A, Table 23 statistical 400, App. A, Table 23 hypothetical infinite population 397.P hypotrochoid 93.H 
Subject Index i-genus i-genus 15.E l-adic topology (of a ring) 16.X ith component (of an n-tuple) 256.A,C ith coordinate 256.C ith coordinate axis (of a Euclidean space) 140 icosahedral group 151.G icosahedron 357 B ideal(s) (of an algebra) 29.A (of an algebraic number field) 14.B (of a lattice) 42.C (of a Lie algebra) 248.A (of a ring) 368 F Abelian (of a Lie algebra) 248.C Alexander (of a knot) 235 C ambig (of a quadratic field) 347.F conjugate (of a fractional idea]) 14 I defining (of a formal spectrum) 16X differential (of a differential ring) 113 differential (on a real analytic manifold) 428.E elementary 235.C fractional (of an algebraic number field) 14.E graded 369.B homogeneous (ofa graded ring) 369.B homogeneous (of a polynomial ring) 369.B integral (of an algebraic number field) 14.C integral left 27.A integral right 27.A integral two-sided D- 27 A involutive differential 428.E largest nilpotent (of a Lie algebra) 248.D left (of a ring) 368.F left 0- 27.A maximal 67.C maximal (left or right) 368.F maximal (with respect to S) 67.C maximal, space (of a Banach algebra) 36.E minimal (left or right) 368 F mixed 284 D nilpotent (of a Lie algebra) 248.C order (of a vector lattice) 310 B p-primary 67.F primary 67.F prime 67.C prime (of a maximal order) 27.A prime differential (of a differential ring) 113 primitive (of a Banach algebra) 36.E principal 67 K principal (of an algebraic number field) 14.E principal, theorem (in class field theory) 59.D principal fractional 67.K pure 284.D right (of a ring) 368.F right 0, 27 A semiprime (of a commutative ring) 113 semiprime differential (of a differential ring) 113 sheaf of (of a divisor of a complex manifold) 72.F solvable (of a Lie algebra) 248.C two-sided (of a ring) 368.P two-sided 0- 27.A unmixed 284.D valuation (of a valuation) 439.B ideal boundary 207.A in the narrow sense 14.0 ideal class 14 E ideal class group 14.E 67.K 2014 ideal group modulo m* 14.H ideal point (in hyperbolic geometry) 285 C idele (of an algebraic number field) 6.C principal 6.C ide1e class 6.0 idele class group 6.D idele group 6.C idempotent element (of a ring) 368.B elementary 450.0 primitive 368.B idempotent law (in a lattice) 243.A idempotent measure 192.P idempotent set (of a ring) 368.B idempotent theorem 36.M identical1y true formula 411.G identification (in factor analysis) 280.0 identification space (by a partition) 425.L identified equation 128.C just 128.C over- 128.C identity (identities) 231.A Bianchi 80.1 417.B Bianchi first App. A, Table 4.n Bianchi second App. A. Table 4.n homotopy (of an H-space) 203.D Jacobi (on the bracket of two vector fields) 105.M Jacobi (in a Lie algebra) 248.A Jacobi (with respect to Whitehead product) 202.P in Jordan algebras 231.A Lagrange 252.K Parseval 18.B 159.A 160.C 192.K 197.C 220.B,C,E resolution of 390.D theorem of (of one variable) 198.C theorem of (of several variables) 21.C identity character (of an Abelian group) 2.0 identity component (of a topological group) 423 F identity element of an algebraic system 409.C of a field 149.A of a group 190.A of a local Lie group 423.L of a ring 368.A identity function 381 C identity mapping 381.C identity matrix 269.A identity operator (in a linear space) 37.C identity relation 102 I Ihara zeta function 450.U Ikehara-Landau theorem, Wiener- 123 B iII-conditioned (coefficient matrix in numerical solution of linear equations) 3020 image (of a group homomorphism) 190.D (of a linear mapping) 256.F (of a line in p3) 343.E (of a mapping) 381.C (of a morphism) 52.N (of an operator homomorphism) 277.E (of a sheaf homomorphism) 383.D closed (of a variety) 18.1 continuous 425.0 direct (of a sheaf) 383.G homomorphic (of a measure preserving transformation) 1360 inverse (of a set) 381.C inverse (of a sheaf) 383.0 inverse (of a uniformity) 436 E 
2015 perfect 425.CC perfect inverse 425.CC image measure 270.K imaginary axis 74.C imaginary field, totally 14 F imagmary hypersphere 76.A imaginary infinite prime divisor 439 H imaginary number 74.A purely 74.A imaginary part 74.A imaginary prime divisor 439.H imaginary quadratic field 347 A imaginary root (of an algebraic equation) 10.E imaginary transformation, Jacobi's 134.1, App. A, Table 16.111 imaginary unit 74.A 294 F imbedded Markov chain 260.H imbedding 105 K imbedding principle 127.B immersed submanifold (of a Euclidean space) 11IA immersion (of a Coo -manifold) 105.K (of a Riemann surface) 367.0 branched minimal 275.B generalized minimal 275.B isometric 365.A Kiihler 365.L minimal 275.A minimum 365 0 tight 365.0 totally rea] 365.T imperfect field 149 H implication 411 B strict 4ll.L implicit (difference equation in a multistep method) 303.E (Runge-Kutta method) 303 D implicit enumeration method 215.0 implicit functions 165.C 208 implicit function theorem 208.A 286.0 (in Banach algebras) 36.M (in locally convex spaces) 286.J global 208.0 Nash-Moser 286.J implicit method 303 E implicit scheme 304.F impossible construction problem 179.A impossible event 342.B impredicative (object) 156.B imprimitive (permutation group) 151 H improper integral 216.D,E convergent 216.E divergent 216.E improper Riemann integral 216.E improvement, iterative 302.C impulse control 405.E impulse function 306.B, App. A, Table 12 II imputation 173.0 imputed costs 292.B imputed prices 292.C IMT formula 299.B inaccessible, strongly 33.F inaccessible cardinal number strongly 33.E weakly 33 E inaccessible ordinal number strongly 312.E weakly 312.E Subject Index Independent Ince definition 268.D Ince-Goldstein method 268.C incidence matrix of a block design 102.B of a graph 186.G incidence number 146 B 201.B incidence relation 282 A inclination, curve of constant II1.F inclination problem, critical 55.C inclusion 381 C inclusion mapping 381.C incoming wave operator 375.B incompatible system (of partial differential equations) 428.B incomplete beta function App. A, Table 17.1 incomplete blocks 102.B incomplete eIJiptic integral of the first kind 134.B incomplete factorization 302.C incomplete gamma function 174 A, App. A, Table 17.1 incompleteness theorem, Oodel 156.E first 18S.C second 185 C incompressible (measurable transformation) 136.C incompressible fluid 20S.B inconsistent problem (of geometric construction) 179.A inconsistent system (of algebraic equations) 10.A increasing (sequence function or distribution) monotone 166.A 380.B monotonically 87.B non- 166.A slowly (COO-function) 125.0 slowly (distribution) 125.N slowly (in the sense of Deny) 338.0 slowly, sequence 168.B strictly 166.A strictly monotone 166.A increasing directed set 308.A increasing process 406.B integrable 406.B increment of a function 106.B process with independent 5.B increment function 380.B Ind (large inductive dimension) 117.B ind (small inductive dimension) 117.B indecomposable A-module 277.1 indecomposable continuum 79.0 indecomposable group 190 L indecomposable linear representation 362.C indecomposable vector bundle 16.Y indefinite O-integral 100.0 indefinite Hermitian form 348.F indefinite integra] 198.B in Lebesgue integral 221.0 in Riemann integral 216.C indefinite quadratic form 348.C indefinite sum (of a function) 104 B in degree 186 B independence, number of 186.1 independence theorem (on valuations) 439.0 independent (axioms) 35.B (complexes) 70.B (differential operators) 324.C (events) 342.B (frequency) 126.L (partitions) 136.E (points) 7 A 
Subject Index Independent increments, process with (random variables) 342.C algebraically 149.K 369.A analytically (in a complete ring) 370.A a- (partitions) 136 E linearly 2.E 256.C,E 277.0 path 346.0 independent increments, process with 5.B independent of the past history 406.D independent process 136.E independent set 66.0 186.1 independent system, maximal (of an additive group) 2.E independent variable 165.C independent vector 66.F indeterminacy, set of points of (of a proper meromorphic mapping) 23.0 indeterminate 369.A in the algebraic sense 337.C indeterminate coefficients, Lagrange's method of 106.L indeterminate form, limit of 106.E indeterminate system (of algebraic equations) 10.A Indga 297.0 index (of a central simple algebra) 29.0 (of a critical point) 279.B,E 286.N (ofa divisor) 3.0 (of an a-Hermitian form) 60.0 (of an eigenvalue) 217.F (of a Fredholm mapping) 286.E (of a Fredholm operator) 68.F 251 D (of a manifold) 56.0 (of a number) 297.G (of an orthogonal array) 102.L (of a quadratic form) 348.E (of a recursive function) 356.F (of the Riemann-Hilbert problem) 217.1 (of a stable distribution) 341.0 (of a stable process) 5.F (of a subgroup) 190.C analytic (of an elliptic complex) 237.H analytic (of an elliptic differential operator) 237.H contravariant (of a component of a tensor) 256.1 covariant (of a component of a tensor) 256.J cycle 66.E deficiency (of a closed symmetric operator) 251.1 deficiency (of a differential operator) 112.1 degeneracy 17.C differential (in a covering of a nonsingular curve) 9.1 dummy (of a tensor) 256.1 fixed-point (of a continuous mapping) 153.B of inertia (of a quadratic form) 348.E Keller-Maslov 274.C Kronecker 201.H multi- 112.A o-speciality (of a divisor) 9.F p- (on a central simple algebra) 29.0 ramification (of an algebroidal function) 17.C ramification (of a finite extension) 2570 ramification (ofa prime ideal) 14.1 ramification (of a valuation) 439.1 ramification, relative (of a prime ideal) 14.1 of relative nullity 365.0 Schur (of a central simple algebra) 29.E Schur (of an irreducible representation) 362.F 2016 of a singular point (of a continuous vector field) 153.B speciality (of a divisor) 9.C 15.D topological (of an elliptic complex) 237.H of total isotropy (of a quadratic form) 348.E indexing set (of a family of elements) 381.D index set (of a balanced array) 102.L (of a family) 165.0 381.D index theorem Atiyah-Singer 237.H for differentiable manifolds 56.G Hirzebruch (for differentiable manifolds) 56.0 of Hodge 15.D Morse 279.F Indian mathematics 209 indicator function modified 341.C of a subset 342.E indicatrix Dupin 111.H spherical (of a space curve) I11.F indicial equation 254.C indirect least squares method 128.C indirect transcendental singularity (of an analytic function) 198.P indiscrete pseudometric space 273.B indiscrete topology 425.C individual 411.H individual domain 411.H individual ergodic theorem 136.B individual risk theory 214.C individual symbol 411.H individual variables 411.H indivisibilis 265 induced (Cartan connection) 80.0 (unfolding) 51.D induced bundle 147.G induced homomorphism 202.K induced module 277.L induced representation (of a finite group) 362.G (of a unitary representation) 437.0 induced topology 425.1 induced von Neumann algebra 308.C induction (of a von Neumann algebra) 308.C axiom of mathematical 294.B complete 294.B double mathematical 294.B magnetic 130.A mathematical 294.B multiple mathematical 294.B transfinite (in a well-ordered set) 311 C induction equation 259 inductive dimension small 117.B large 117.B inductive limit (in a category) 21O.D (of an inductive system of sets) 210.B (of a sequence of topological spaces) 425.M (of sheaves) 383.1 strictly (of a sequence of locally convex spaces) 424.W inductive limit and projective limit 210 inductive limit group 210.C inductive limit space 210.C inductively ordered set 34.C 
2017 inductive system (in a category) 210.D (of groups) 210.C (of sets) 21O.B of topological spaces 210.C inelastic (scattering) 375.A inequality (inequalities) 211, App. A, Table 8 absolute 21l.A Bell's 351.L Bernshteln (for trigonometric polynomials) 336.C Bessel 197.C Bhattacharyya 399.D Bunyakovskil 211.C, App. A, Table 8 Carleman App. A, Table 8 Cauchy 211.C, App. A, Table 8 Cauchy-Schwarz 211 C, App. A, Table 8 Chapman-Robbins-Kiefer 399.D Chebyshev 342.C conditional 211.A correlation 212.A Cramer-Rao 399.D distortion 438.B energy 325.C Fisher 102.E FKO 212.A Oarding 112 G GHS 212.A GKS first 212.A GKS second 212.A Golden-Thompson 212.B Oriffiths's first 212.A Oriffiths's second 212.A Grunsky 438.B Hardy App. A, Table 8 Hardy- Littlewood-Sobalen 224.E Haussdorff- Young 224.E Hilbert App. A, Table 8 Holder 21l.C, App. A, Table 8 Holder integral 211.C isoperimetric 228.B Jordan App. A, Table 8 Klein 212 B Markov (for polynomials) 336.C maximal (maximal ergodic lemma) 136.B Minkowski 211.C, App. A, Table 8 Morse 279.o,E Peierls-Bogolyubov 212 B Powers-St0rmer 212.B quasivariational 440.D Riemann's period 3.L Riemann-Roch (on algebraic surfaces) 15.D Roepstorff-Araki-Sewell 402.0 Roepstorff-Fannes- Verbeure 402.G Schwarz 211.C stationary variational 440.B triangle 273.A variational, of evolution 440 C von Neumann 251.M Wirtinger App A, Table 8 Wolfowitz 399.1 Young 224 E, App. A, Table 8 inertia ellipsoid of 271.E index of (of an a quadratic form) 348.B law of 27l.A law of, Sylvester (on a quadratic form) 348.B moment of 271.E principal axis of 271.E principal moment of 271.E Subject Index Infinitesimal transformation product of 271.E inertia field (of a prime ideal) 14.K inertia group 14.K 257.D inertial system 271.D 359 inf(infimum) 311.B inference rule of 411.1 statistical 401 statistical, graphical method of 19.B inferior limit (of a sequence of real numbers) 87.C (of a sequence of subsets of a set) 270.C inferior limit event 342.B infimum (of an ordered set) 311.B (of a subset of a vector lattice) 310.C infinite, purely (von Neumann algebra) 308.E infinite branch (of a curve of class Ck) 93.G infinite cardinal number 49.A infinite classical group 147.1 202.V infinite continued fraction 83.A infinite determinant (in Hill's method of solution) 268.B infinite-dimensional complex projective space 56.C infinite-dimensional linear space 256.C infinite-dimensional normal space 117.B infinite-dimensional real projective space 56.B infinite Grassmann manifold 147.1 infinite group 190.C infinite height (element of an Abelian p-group) 2.0 infinite interval 355.C infinite lens space 91.C infinitely differentiable (function) 106.K infinitely divisible distribution 341.0 infinitely recurrent (measurable transformation) 136.C infinite matrix 269.K infinite order (of an element in a group) 190.C infinite orthogonal group 202.V infinite population 401.E infinite prime divisor 439.H imaginary 439.H real 439.H infinite product 379.0, App. A, Table 10.VI divergent 379.G infinite product expansion, Euler's 436.B 450.B infinite sequence 165.D infinite series 379.A, App. A, Table 1O.m infinite set 49.F 381.A countably 49.A infinitesimal (for a function) 87.G (for a hyperreal number) 293.D (for a sequence of random variables) 250.B order of (of a function) 87.G infinitesimal birth rate 260.0 mfinitesimal calculus (in nonstandard analysis) 293.D infinitesimal death rate 260.G infinitesimal deformation to the direction alas 72.G infinitesimal element (in nonstandard Hilbert space) 276.E infinitesimal generator (of a semi group) 378.B infinitesimal motion (of a Riemannian manifold) 364.E infinitesimal real number 276.E infinitesimal transformation (of a differentiable transformation group) 431.0 
Subject Index Infinitesimal wedge (of a one-parameter transformation group) 105 N infinitesimal wedge 125.V infinite Stiefel manifold 147 I infinite symplectic group 202.V infinite type (Lie algebra) 191 D infinite type power series space 168 B infinite unitary group 202 V infinity 87 D,G axiom of 33 B 381.G axiom of strong 33.E hyperplane at (in affine geometry) 7 B minus 87 D negative 87.D 355.C order of (of a function) 87 G plus 87.0 point at 7 B 74 D 178 F 285 C positive 87 D 355 C regular at the point at 193.B space at (in affine geometry) 7.B inflation 200.M inflection, point of 9 B 93 G influence, domain of 325.B influence curve 371 I information Fisher 399.D limited (in maximum likelihood method) 128.C loss of 138 B mutual 213.E self- 213.B information bit 63 C information compression 96.B information matrix 102 I 399.D information number, Kullback-Leibler (K-L) 398G information retrieval 96.F information retrieval system 96 F information sciences 75 F information set 173 B information source 213 A ergodic 213 C information theory 213 informatiques 75.F informative, more (statistical experiment) 398.G infra-exponential growth 125 AA,BB infrared di\ergence 132 C 146.B ingoing subspaces 375.H inhomogeneous (boundary value problem) 315.B (difference equation) 104.C (linear ordinary diffcrential equation) 252 A (system of linear differential equations) 252 G inhomogeneous coordinates 343 C inhomogeneous lattice (in R") 182 B inhomogeneous Lorentz group 359.B inhomogeneous polarization 3.G initial blocks 102 E initial-boundary value problem 325 K initial condition (for ordinary differential equations) 316.A (for partial differential equations) 321.A initial data 321 A initial distribution (of a Markov process) 261 A (for a stochastic differential equation) 406.0 initial function (of a functional-differential equation) 163.C initial law (for a stochastic differential equation) 406D 2018 initial number 312.0 initial object 52.D initial ordinal number 49.E initial phase (of a simple harmonic motion) 318.B initial point (of a curvilinear integral) 94.D (of a path) 170 (of a position vector) 7.A (of a vector) 442.A initial set (of a correspondence) 358.B (of a linear operator) 251.E initial state 31 B initial surface 321.A initial term (of an infinite continued fraction) 83.A initial value (for an ordinary differential equation) 316.A (for a partial differential equation) 321.A (for a stochastic differential equation) 406 D initial value problem (for functional-differential equations) 163.D (for hyperbolic partial differential equations) App. A, Table 15m (for integrodifferential equations) 222.B (for ordinary differential equations) 313.C 316A (for partial differential equations) 321 A Navier-Stokes 204.B singular (for partial differential equations of mixed type) 326 C initial vertex 186.B injection 381.C (in a category) 52.D (homomorphism of cohomology groups) 200.M canonical 381 C,E canonical (on direct sums of modules) 277.F canonical (on free products of group) 190.M canonical (from a subgroup) 190 D natural (from a subgroup) 190.D injective (Banach space) 37.M (C*-algebra) 36.H (mapping) 381.C (object in an Abelian category) 200.1 injective A-module 277.K injective class 200.Q injective dimension 200.K injective envelope 200.1 injective module, (R, S)- 200 K injective resolution (in an Abelian category) 200.1 j- 200.Q right (of an A-module) 200.F injectivity, rational 200.0 injectivity radius 178.C inner area 216 F 270 G inner automorphism(s) (of a group) 190.D (of a ring) 368.D group of (of a group) 190.D group of (of a Lie algebra) 248.H inner capacity, Newtonian 48.F inner derivation (of an associative algebra) 200.L (of a Lie algebra) 248.H inner function 43.F inner harmonic measure 169.B inner measure 270.E Lebesgue 270.F 
2019 inner product (in a Hermitian linear space) 256 Q (in a Hilbert space) 197 B (of hyperspheres) 76.A (between a linear space and its dual space) 256.G (in a metric vector space) 256.H (of a pair of linear spaces) 424.G (of vectors) 256 A 442 B Hermitian 256 Q inner product space 442.B inner solution 112.B inner topology (of a Lie subgroup) 249 E inner transformation (in the sense of Stoilow) 367.B inner variable 112.B inner volume 270.G innovation 405 H input, Poisson 260.H input data, error of 138.B inrevolvable oval 89.E inscribe (in a sphere) 139.1 inscribed circle (of a regular polygon) 357.A inseparable, purely (rational mapping) 16 I inseparable element (of a field) 149 H purely 149.H inseparable extension (of a field) 149.H purely 149.H inseparable polynomial 337 G inspection expected amount of 404 C sampling (sampling inspection) 404.C instantaneous state 260.F 261 B in-state 150 D 386.A instruction 31 B single-address 75.C insurance amount of 214 A cost of 214.B death 214.B mixed 214 B survival 214.B insured, amount 214.A integer(s) 294 C algebraic 14.A Cartan (of a semisimple Lie algebra) 248.N Gaussian 14 U p-adic 439.F p-adic, ring of 439.F rational 294 B integer polyhedron 215.C integer programming 215264.C integer programming problem all- 215.A mixed 215.A pure 215 A 0-1 215.A integrability (of multi valued vector functions) 443 I strong 443.1 integrability condition, complete 428 C integrable (function) 221.B (G-structure) 191.A (increasing process) 262.D 406 B (representation) 437 X (in the sense of Riemann) 216.A absolutely 216.E,F Birkhoff(function) 443 E Bochner (function) 443 C Subject Index Integral(s) completely (system of independent I-forms) 154 B 428 D D- (function) 100.0 Daniell-Stone (function) 310.1 Denjoy (in the wider sense) 100 D Dunford 443.F Gel'fand-Pettis (function) 443.F Lebesgue (function) 221.B locally, function 168 B Ji- (function) 221.B Perron (function) 100.F Pettis 443.F Riemann (function) 216 A scalarly 443 F,I square (function) 168.B square (unitary representation) 437.M term wise (series) 216.B uniformly (family of random variables) 262.A weakly (function) 443.E integrable distribution 125.N integrable increasing process 262 D 406 B integrable process of bounded variation 406.B integrable system 287.A integrably homotopic 154.F integral( s) (of differential forms) 105 T (of a distribution with respect to ).) 125 H (of a function) 221.B (= integrally dependent) 67.1 (of a Monge-Ampere equation) 278.B (of multi valued vector functions) 443.1 (scheme) 16 D Abelian 11 C action 80.Q Airy App. A, Table 19.1V almost (element of a ring) 67.1 of angular momentum 420.A Banach 310.1 Bartle-Dunford-Schwartz 443 G Bessel 39.B Birkhoff 443 E Bochner 443.C Bromwich 240.D 322.D, App. A, Table 12 I Carson App. A, Table 12.11 of Cauchy type 198.B of the center of mass 420.A complete additivity of the (in Lebesgue integral) 221.C complete elliptic App A, Table 16.1 complete elliptic, of the first kind 134.B complete elliptic, of the second kind 134.C conjugate Fourier 160.D constant 216 C cosine 167.0, App. A, Table 19.II curvilinear 94.A curvilinear (with respect to a line element) 94.D curvilinear (with respect to a variable) 94.D D(*)- 100 D Daniell-Stone 310.1 definite App A, Table 9.V definite (of a hyperfunction) 125,X definite (in a Riemann integral) 216.C defmite D- 100.D Denjoy 100 Denjoy (in the restricted sense) 100.0 Denjoy (in the wide sense) lOO.D direct 308 G direct (of unitary representations) 437.H Dirichlet (in Dirichlet problem) 120.F 
Subject Index Integral bilinear functional Dirichlet (in Fourier's single integral theorem) 160B double (in Riemann integral) 216.F Dunford 251.0 443.F elliptic I1.C 134.A, App A, Table 16.1 elliptic, of the first kind 134.A elliptic, of the second kind 134.A elliptic, of the third kind 134.A energy 420 A Euler, of the first kind 174.C Euler, of the second kind 174.C exponential 167 D, App. A, Table 19.II Feynman 146 first (of a completely integrable system) 428.D Fourier 160.A Fresnel 1670, App. A, Tables 9.V 19.II of a function with respect to a volume element 105.W Gauss 338.1 Gel'fand 443 F Oel'fand-Pettis 443.F harmonic 194.A Hilbert's invariant 46.C hyperelliptic 11.C hypergeometric 253.B improper (in Riemann integral) 216.D,E incomplete elliptic (of the first kind) 134.B indefinite (in Lebesgue integral) 221.0 indefinite (in Riemann integral) 198.B 216.C indefinite D- 100.D intermediate App. A, Table 15.II1 intermediate (of a Monge-Ampere equation) 278.B iterated (in Lebesgue integral) 221.E iterated (in Riemann integral) 216.G Jacobi 420.F L- 221.B with respect to A (of a distribution) 125.H Lebesgue 221.B Lebesgue-Radon 94.C Lebesgue-Stieltjes 94.C 166.C logarithmic 167.D, App. A, Table 19.II Lommel 39.C multiple (in Lebesgue integral) 221.E multiple (in Riemann integral) 216.F n-tuple(in Riemann integral) 216.F over an oriented manifold 105.T Pettis 44 3.F Poisson 168.B 193.G probability App. A, Table 19.II regular first 126.H repeated (in Lebesgue integral) 221.E repeated (in Riemann integral) 216.0 Riemann 37K 216A Riemann lower 216.A Riemann-Stieltjes 94.B 166.C Riemann upper 216.A scalar 443. F,I sine 167.D, App. A, Table 19.II singular 217.1 over a singular chain 105.T spectral 390.0 Stieltjes 94.B stochastic 261.E 406.B stochastic, of Stratonovich type 406.C surface 94.A,E surface (with respect to a surface element) 94.E trigonometric 160.A 2020 vector 443.A of a vector field App. A, Table 3.m vector-valued 443.A integral bilinear functional 424.R integral calculus 216 integral character (of the homology group of a Riemann surface) I1.E integral closure (of a ring) 67.1 integral cohomology group 201.H integral constant 216.C integral cosine 167.D integral current 275 G integral curvature (of a surface) 111.H integral curve (of a Monge equation) 324.F (of ordinary differential equations) 316.A integral direct sum 308.G integral divisor (of an algebraic curve) 9.C (of an algebraic number field) 14.F (on a Riemann surface) 11.D integral domain 368.B Noetherian 284.A integral element 428.E k-dimensional 191.1 ordinary 428.E regular 428.E integral equation(s) 217 Abel 217.L associated 217.F Fredholm 217.A of Fredholm type 217.A Hammerstein 217.M homogeneous 217.F linear 217.A nonlinear 217.M numerical solution of 217.N singular 217.1 transposed 217.F Volterra 217.A of Volterra type 2l7.A integral exponent 167.D integral form 248.W integral formula Cauchy 198.B Poisson 198.B Villat App A, Table 15.VI Weyl 225.1 integral function 429.A integral f)-lattice 27.A integral geometry 218 principal formula of 218.C integral homology group of a polyhedron 201.D of a simplicial complex 201 C integral hyper surface (of a partial differential equation) 320.A integral ideal (of an algebraic number field) 14.C integral inequality, Holder 211.C integral invariant(s) 219 absolute 219.A Cartan's relative 219.B relative 219.A integral kernel 217.A 251.0 integral left ideal 27.A integral logarithm 167.D integrally closed (in a ring) 67.1 completely (ring) 67.1 
2021 integrally closed ring 67.1 integrally dependent element (of a ring) 67 I integral manifold 428.A,B,0 k-dimensional 191.I ordinary (of a differential ideal) 428 E regular (of a differential ideal) 428.E singular (of a differential ideal) 428.E integral method, summable by Borel's 3790 integral operator 68.N 100.E 250.0 Calder6n-Zygmund singular 217 J 251 0 Fourier 274.C 345.B of Hilbert-Schmidt type 68 C integral point 428.E,F integral quotient (in the division algorithm of polynomials) 337.C integral representation 362.0,K Cauchy 21.C Herglotz 43 I Laplace-Mehler App A, Table 18 II Schlafli 393 B integral right ideal 27.A integra] sine ]67.0 integral singular homology group 201.E integral test, Cauchy (for convergence) 379 B integral theorem Cauchy 198 A,B Fourier double 160 B Fourier single 160.B stronger form of Cauchy 198.B integral transforms 220.A integral two-sided o-ideal 27.A integral vector 428 E integrand 216.A integrate 216.A (an ordinary differential equation) 3l3.A integrating factor App. A, Table 14 I in tegration along a fiber (of a hyperfunction) 274.E automatic, scheme 299.C contour of (of curvilinear integral) 940 domain of 2]6F graphical 19.B Jacobi's second method of 324.0 numerical 299 path of (of curvilinear integral) 94.0 Romberg 299.C integration by parts 216.C (on D-integral) l00.G (in the Stieltjes integral) 94.C integration constant (in a general solution of a differential equation) 3 J3.A integration formula based on variable transformation 299.B Oauss (in the narrow sense) 299.A Poisson App. A, Table 15.Vl Villat App. A, Table 15.VI integrodifferential equation(s) 163.A 222 of Fredholm type 222.A PrandtJ's 222.C of Volterra type 222.A Wiener-Hopf 222.C intensity, traffic 260.H intensive (thermodynamical quantity) 419.A interaction 102.H interest, assumed rate of 214 A interference (of waves) 446 interior (of an angle) 1390 155.B (of a manifold) 105.B (of a polygon) 155.F Subject Index Intersect (of a segment) 155 B (of a set) 425 B (of a simplex) 70.C interior capacity, Newtonian 48.F interior cluster set 62.A interior field equation 359.0 interior operator 425.B interior point 425.B interior problem (in Oirichlet problems) 120.A interior product (of a differential form with a vector field) 105.Q intermediate convergent (of an irrational number) 83.B intermediate field 149.0 intermediate integrals App. A, Table 15 m of Monge-Ampere equation 278.B intermediate-value theorem 84.C intermittent structure 433.C internal (in nonstandard analysis) 293.B internal energy 419.A internal irregular point 338.L internal law of composition (of a set) 409.A internally stable set 186.1 internally thin set 338.G internal product 200 K internal space in catastrophe theory (in static model) 51.B internal state 31.B internal symmetry 150.B international notation (for crystal classes) 92.B international system of units 414.A interpolating (for a function algebra) 164.0 interpolating sequence 43.F interpolation (of a function) 223, App. A, Table 21 (of a stationary process) 176.K 395.E Chebyshev 223.A 336 J inverse 223.A Lagrange 223.A of operators 224 spline 223.F interpolation coefficient, Lagrange's 223.A interpolation formula 223.A Bessel App A, Table 21 Everett App. A, Table 21 Oauss App. A, Table 21 Gauss's backward 223.C Gauss's forward 223.C Newton App. A, Table 21 Newton's backward 223 C Newton's forward 223.C Stirling App. A, Table 21 interpolation function 223.A interpolation method 224.A interpolation polynomial 223.A Hermite 223.E Lagrange 336.G, App. A, Table 21 Newton 336.G trigonometric 336.E interpolation problem 43.F interpolation scheme, Aitken 223.B interpolation space 224.A complex 224.B real 224 C interpolation theorem 224.B,C interpolatory formula 299.A interquartile range 397.C intersect 155.B properly (on a variety) 16 G transversally 105.L 
Subject Index Intersection intersection (of events) 342 B (of projective subs paces) 343 B (of sets) 381B (of subspaces of an affine space) 7.A complete 16.A intersection chart 19.0 intersection multiplicity (of two subvarieties) 16.Q intersection number (of divisors) 15 C (of homology classes) 65 B 201.0 (of sheaves) 16.E self- 15 C intersection product (in algebraic varieties) 16.Q (in homology theory) 201 0 intersection property, finite 425 S intersection theorem (of affine geometry) 7.A (of projective geometry) 343.B Cantor's 273 F Krull 284.A interval (in a Boolean algebra) 42 B (in a lattice) 243.C (in a vector lattice) 310 B (in an ordered set) 311 B (in real number space) 355 C of absolute stability 303 G basic 4.B closed 140355 C confidence 399 Q of continuity (for a probability distribution) 341 C fiducial 401 F finite 355.C infinite 355.C open 140355C principle of nested 87.C of relative stability 303 G supplementary 4.B to lerance 399.R interval estimation 399 Q 401 C interval function 380.A additive 380 B continuous additive 380.B in the large (in differential geometry) 109 in the small (in differential geometry) 109 intrablock analysis 102.0 intransitive (permutation group) 151 H intrinsic angular momentum 415.G intrinsic homology 114 H intuitionism 156 A semi- 156.C intuitionistic logic 411.L invariance of a confidence region 399 Q of dimension, theorem on (of Euclidean spaces) 117.0 of domain, Brouwer theorem on 117 0 homotopy 201 0 isospin 351 J Lorentz 150 B of speed oflight, principle of 359.B topological (homology groups) 201.A invariance principle (of hypothesis testing) 400.E (of wave operators) 375 B Oonsker's 250.E Strassen's 250.E 2022 in variance theorem of analytic relations 198.K invariant(s) App. A, Table 14.II1 (of an Abelian group) 2 B (of a cohomology class of a Galois group) 59 H 257.E (decision problem) 398.E (element under a group action) 226.A (of an elliptic curve) 73.A (in the Erlangen program) 137 (under flow) 126.0 (function algebra) 164.H (hypothesis) 400.E (measure) 136.B 225 270.L (S-matrices) 386 B (of a normal simple algebra) 257.G (subspace of a Banach space) 25l.L absolute 12 A 226.A absolute integral 219 A almost G- 396.1 Arf- Kervaire 114.1 basic 226.B birational 12.A Browder-Livesay 114 L Cartan (of a finite group) 362.1 Cartan relative integral 219.B conformal 77.E covering linkage 235 E differential (on an m-dimensional surface) 110.A Eilenberg- Postnikov (of a CW -complex) 70.G fundamental (of a space with a Lie transforma- tion group) 110.A fundamental differential (of a surface) 110.B G- (element) 226.A G- (measure) 225.A G- (statistics) 396.1 generalized Hopf 202 Q Hasse (of a central simple algebra) 29 G homotopy 202.B homotopy type 202.F Hopf 202.S,U Hopf, modulo p 202 S integral 219 isomorphism (on a measure space) 136.E Iwasawa 14.L k- (of a CW-complex) 70.G left (metric in a topological group) 423.1 left, Haar measure 225.C left, tensor field 249.A metric (on a measure space) 136.E Milnor 2350 of n-ary form of degree d 226.0 negatively 126 0 normal 114.1 of order p 110.A p- (of a central simple algebra) 29.G PCT 386.8 Poincare's differential 74.G positively 126.0 rearrangement 168.B relative 12.A 226 A relative integral 219.A right, Haar measure 225.C right, tensor field 249 A sampling procedure 373.C semi- 226.A semi- (ofa probability distribution) 341.C shape 382 C spectral 136 E TCP- 386 B 
2023 topological 425.G V - (subspace of a representation space of a unitary representation) 437.C uniformly most powerful 399 Q vector 226.C of weight w 2260 invariant decision function 398 E invariant derivation (on an Abelian variety) 3 F invariant differential form (on an Abelian variety) 3.F invariant distribution(s) (of a Markov chain) 260.A (of second quantization) 377.C invariant estimator 399 I best 399 I invariant field 172 B invariant integral, Hilbert's 46.C invariant level IX test, uniformly most powerful (UMP) 400.E invariant Markov process 5 H invariant measure(s) 225 (of a Markov chain) 260.A (of a Markov process) 261 F (under a transformation) 136 B G- 225 B quasi- 225.J relatively 225 H smooth 126 J sub- 261.F transverse 154 H invariant measure problem 136.C invariants and co variants 226 invariant statistic 396.1 maximal 396.1 invariant subgroup (of a group) 190.C invariant subs pace (of a linear operator) 164 H doubly 164 H invariant tensor field left 249.A right 249.A invariant test 400 E almost 400 E invariant torus 126.L inventory control 227 inventory model 307 C inverse (in a groupl 190 A (of a mapping) 381.C homotopy (for an H-space) 2030 quasi- (on a Banach algebra) 36 C right (in nonlinear functional analysis) 286.G inverse analytic function 198 L inverse assumption 304.0 inverse correspondence 358.B inverse domination principle 338.L inverse element (in a group) 190 A (in a ring) 368 B left (in a ring) 368 B quasi- (in a ring) 368.B right (in a ring) 368.B inverse Fourier transform (of a distribution) 125.0 inverse function 198 L 381.C inverse function element 198.L inverse image (of a set) 381 C (of a sheaf) 383.0 (of a uniformity) 436.E perfect 425.CC Subject Index Involutory (involutive) system inverse interpolation 223.A inverse iteration 298 C inverse limit (of an inverse system of sets) 21O.B inverse mapping 381.C inverse mapping theorem 208.B inverse matrix 269.B inverse morphism 52.0 inverse operator 37.C 251 B in verse path 170 inverse problem (in potential scattering) 375 G Jacobi 3.L inverse relation 358.A inverse system (of sets) 210.B inverse transform (of an integral transform) 220.A inverse trigonometric function 131.E inversion (with respect to a circle) 74.E (of a domain in R") 193 B (with respect to a hypersphere) 76.A Laguerre 76 B space 258 A space-time 258.A inversion formula (for a characteristic function) 341.C (of a cosine transform) 160.C (of a Fourier transform) 160 C (of a Fourier transform of distributions) 160.H (of a Fourier transform on a locally compact Abelian group) 192.K (of a generalized Fourier transform) 220.B (of a Hilbert transform) 220.E (of an integral transform) 220.A (of a Laplace-Stieltjes transform) 240.0 (on a locally compact group) 437.L (of a Mellin transform) 220.C (for a semigroup of operators) 240 I (of a Stieltjes transform) 2200 Fourier 160.C Mobius (in combinatorics) 66.C Mobius (in number theory) 295 C in verted filing scheme 96.F invertible element quasi- 368 B of a ring 368.B invertible jet 105.X invertible knot 235.A invertible matrix 269 B invertible sheaf 16.E involute (of a curve) II1.E involution (of an algebraic correspondence) 9.H (in a Banach algebra) 36.F (of a division ring) 348.F (of a homotopy sphere) 114.L Cartan 427.X involutive (cross section) 286.H (differential ideal) 428.E (differential system) 191.I (distribution) 154 B 428 0 (Lie group) 191.H involutive automorphism (of a Lie group) 412 B involutive correlation 343.0 involutive distribution (on a differentiable manifold) 428.0 involutive subspace 428.F involutory (involutive) system (of differential forms) 428.F (of nonlinear equations) 428 C 
Subject Index Irrational function, elliptic (of partial differential equations) 428.F (of partial differential equations of first order) 324.0 irrational function, eJJiptic 134.E irrational number(s) 294.E 355.A space of 22.A irrational real number 294.E irreducibility theorem, Hilbert's (on polynomials) 337.F irreducible (algebraic curve) 9 B (algebraic equation) 10 B (algebraic variety) 16.A (coalgebra) 203.F (complemented modular lattice) 243.F (continuous geometry) 85 A (continuum) 790 (Coxeter complex) 13.R (discrete subgroup of a semisimple Lie group) 122.F (germ of an analytic set) 23.B (linear representation) 362.C (linear system) 16 N (linear system in control theory) 86.C (3-manifold) 65.E (Markov chain) 260.B (polynomial) 337 F (positive matrix) 269.N 31O.H (projective representation) 362.J (representation of a compact group) 69.B (Riemannian manifold) 364 E (root system) 13.L (scheme) 16.0 (Siegel domain) 384.E (transition matrix) 126 J (unitary representation) 437.A absolutely (representation) 362.F at 0 (for an algebraic set) 23.B irreducible character (of an irreducible representation) 362.E absolutely 362.E irreducible component (of an algebraic variety) 16.A (of an analytic space) 23.C (of a linear representation) 362.0 irreducible element (of a ring) 67 H irreducible representation(s) (of a Banach algebra) 360 fundamental system of (of a complex semisimple Lie algebra) 248.W irreducible symmetric bounded domain 412.F irreducible symmetric Hermitian space 412.E irreducible symmetric Riemannian space 412 C, App. A, Table 5 III irreducible tensor of rank k 353.C irredundant (intersection of primary ideals) 67.F irregular (boundary point) 120.0 (prime number) 14.L irregularity (of an algebraic surface) 15.E number of (of an algebraic variety) 16.0 irregular point (of Brownian motion) 450 (of a Markov process) 261.0 (in potential theory) 338.L external 338 L internal 338.L irregular singular point (of a solution) 254 B 2024 (of a system of linear ordinary differential equations) 254.B irreversible processes, statistical mechanics of 402.A irrotational (fluid) 205 B (vector field) 442.0 Irwin's embedding theorem 65.0 Ising model 340.B 402.0 stochastic 340.C island (in a Riemann surface) 272.1 isobaric polynomial 32.C isogenous (Abelian varieties) 3.C (algebraic groups) 13.A isogeny 13.A isolated fixed point 126.0 isolated ordinal number 312.B isolated point (of a curve) 93G (in a topological space) 425.0 isolated primary component (of an ideal) 67.F isolated prime divisor (of an ideal) 67.F isolated singularity (of an analytic function) 198.0,M isolated singular point 198.0418.0 isolated vertex 186.B isometrically isomorphic (normed spaces) 37.C isometric immersion 365.A isometric mapping 111 H 273.B isometric operator 251.E partially 251.E isometric Riemannian manifolds 364.A isometric spaces 273.B isometry ( = isometric operator) 251.E (between Riemannian manifold) 364.A isomonodromic deformation 253.E isomorphic (algebraic systems) 409.C (block bundles) 147.Q (cohomology theories) 201.Q (complex manifolds) 72.A (fiber bundles) 147.B (groups) 190.0 (Lie algebras) 248.A (Lie groups) 249.N (measure spaces) 398.0 (normed spaces) 37.C (objects) 52.0 (PL-embeddings) 65.0 (representations) 362.C (simplicial complexes) 70.C (s.s. complexes) 70.E (structures) 276.E (topological groups) 423.A (unitary representations) 437.A anti- (lattices) 243.C Borel 270.C dually (lattices) 243.C isometrically (normed spaces) 37.C locally 423 0 metrically (automorphisms on a measure space) 136.E order 311.E similarly (ordered fields) 149.N spatially (automorphisms on a measure space) 136.E spectrally 136.E weakly 136.E 
2025 isomorphic mapping, Borel 270.C isomorphic relations among classical Lie algebras App. A, Table 5.1V isomorphism (of Abelian varieties) 3.C (of algebraic systems) 409.C (of block bundles) 147.Q (of fields) 149.B (of functors) 52.1 (of groups) 190.0 (of lattices) 243.C (of Lie algebras) 248.A (of linear spaces) 256.B (of objects) 52.0 (of prealgebraic varieties) 16.C (of rings) 368.0 (of topological groups) 423.A (of unfoldings) 51.0 admissible (ofn-groups) 190.E algebra 29.A analytic 2l.J analytic (of Lie groups) 249.N anti- (of groups) 190.0 anti- (of lattices) 243.C anti- (of ordered sets) 311.E anti- (of rings) 368.0 birational (of Abelian varieties) 3.C birational (of algebraic groups) 13.A Bott 237.0 C"'- (of Lie groups) 249.N dual (oflattices) 243.C dual (of ordered sets) 311.E excision (on homology groups) 201.F,L functorial 52.1 G- 191.A k- (of algebraic groups) 13.A k- (of extension fields of k) 149.0 lattice- 243.C local (of topological groups) 423.0 modp (in a class of Abelian groups) 202.N n- (of n-groups) 190.E operator (ofn-groups) 190.E order 311.E ring 368.0 suspension (for homology) 201.E Thom-Gysin 114.G Thom-Gysin (of a fiber space) 148.E uniform 436.E isomorphism invariant (on a measure space) 136.E isomorphism problem (in ergodic theory) 136.E (for graphs) 186.1 (for integral group algebras) 362.K isomorphism theorem (in class field theory) 59.C (on groups) 190.0 (on rings) 368.F (on topological groups) 423.1 Ax-Kochen (on ultraproduct) 276.E first (on topological groups) 423.1 Hurewicz 202.N Hurewicz-Steenorod (on homotopy groups of fiber spaces) 148.0 Keisler-Shelah (in model theory) 276.E second (on topological groups) 423.1 third (on topological groups) 423.1 isoparametric (hypersurface) 365.1 ISO parametric method 304.C isoperimetric (curves) 228.A isoperimetric constant 391.0 Subject Index Iwasawa decomposition isoperimetric inequality 228.B isoperimetric problem(s) II1.E 228.A generalized 46.A 228.A special 228.A isospectral 391.B isospectral deformation 387.C isospin 351.J isospin invariance 351.J isothermal compressibility 419.B isothermal coordinates 90.C isothermal curvilinear coordinate system App. A, Table 3.V isothermal parameter 334.B (for an analytic surface) 111.1 isothermal process 419.B isotopic 65.0202.B (braids) 235.F (embeddings) 114.0 (latin square) 241.A ambient 65.0 isotopy 65.0202.B ambient 65.0 isotopy lemma, Thom's first 418.G isotopy type (of knots) 235.A isotropic (with respect to a quadratic form) 348.E k- (algebraic group) 13.G lotally (subspace) 60.0 348.E isolropic point 365.0 isotropic submanifold 365.0 isotropic turbulence 433.C isotropy, index of total (of a quadratic form) 348.E isotropy group 362.B linear 199.A principal 431.C isotropy representalion 431.C isotropy subgroup (of a topological group) 431.A isotropy Iype (of a transformation group) 431.A iterated integral (in Lebesgue integral) 221.E (in Riemann integral) 216.G iterated kernel (for a Fredholm integral equation) 217.0 iterated logarithm Khinchin's law of 250.C law of 45.F iterated series by columns (of a double series) 379.E by rows (of a double series) 379.E iteration (in a Banach space) 286.B inverse 298.C method of successive (for Fredholm integral equations) 217.0 two-body 271.C iteration matrix 302.C iterative improvement 302.C iterative method 302.C iterative process, linear stationary 302.C Ito circle operation 406.C Ita decomposition, Wiener- 176.1 Ita formula 45.G 406.B Ita process 406.B Ita theorem, Levy- (on Levy processes) 5.E Ita type, stochastic integral of 406.C ITPFI 308.1 Iversen-Beurling-Kunugi theorem 62.B Iversen theorem 272.1 Iwahori subgroup 13.R Iwasawa decomposition 
Subject Index Iwasawa group (of a Lie group) 249.T (of a real semisimple Lie algebra) 248.V Iwasawa group 384.C Iwasawa invariant 14.L Iwasawa main conjecture 450.1 Iwasawa theorem, Cartan-Mal'tsev- (on maximal compact subgroups) 249.S J J-group 237.1 equivariant 431.F J-homomorphism 202.V 237.1 equivariant 431.F J-method 224.C Jackson's theorem (on the degree of approximation) 336.C Jacobi, C. G. J. 229 Jacobian 208.B Jacobian, generalized (of a set function) 246.H Jacobian criterion (on regularity of local rings) 370.B Jacobian determinant 208.B Jacobian matrix 208.B Jacobian variety 9.E I I.C 16.P canonically polarized 3.G 9.E generalized 9.F II.C Jacobi- Biehler equality 328 Jacobi condition 46.C Jacobi differential equation App. A, Tables 14.II 20.V Hamilton- 271.F 324.E Jacobi elliptic functions App. A, Table 16.m Jacobi equation, Hamilton- 108.B Jacobi field 178.A Jacobi identity (on the bracket of two vector fields) 105.M (in a Lie algebra) 248.A (with respect to Whitehead product) 202.P Jacobi imaginary transformation 134.1 Jacobi integral 420.F Jacobi inverse problem 3.L Jacobi last multiplier App. A, Table 14.1 Jacobi matrix 390.0 Jacobi method (in numerical computation of eigenvalues) 298.B (in numerical solution of linear equations) 302.C cyclic 298.B threshold 298. B Jacobi polynomial 317.0, App. A, Table 20.V Jacobi second method of integration 324.0 Jacobi symbol 297.1 complementary law of 297.1 law of quadratic reciprocity of 297.1 Jacobi standard form, Legendre- 134.A, App. A, Table 16.1 Jacobi transformation App. A, Table 16.111 Jacobson radical (of a ring) 67.0 Jacobson topology 36.0 James theorem 37.G Janko-Ree type, group of 15l.J Janzen area (of a Borel set) 246.G Japanese mathematics (wasan) 230 Japanese ring, universally 284.F Jarrat-Mack method, Garside- 301.N Jeffreys method 112.B Jensen formula 198.F Jensen measure 164.K 2026 jet invertible 105.X of order r 105.X job 281.0 job-shop scheduling 307.C job-shop scheduling problem 376 John-Nirenberg space (= BMO) 168.B JOIn (in a Boolean algebra) 42.A (in a lattice) 243.A (of points) 155.B (of projective spaces) 343.B (of sets) 381.B (of simplicial complexes) 70.C (of subgroups of a group) 190.G reduced (of homotopy classes) 202.Q reduced (of mappings) 202.F reduced (of topological spaces) 202.F joined by an arc 79.B joint cumulant 397.1 joint density 397.1 joint distribution 342.C joint moment generating function 397.1,1 joint random variable 342.C joint sensity function 397.1 joint spectrum 36.M Jordan algebras 23 I exceptional 231.A free special 23 I.A semisimple 231.B special 231.A Jordan arc 93.B Jordan canonical form (ofa matrix) 269.G Jordan content 270.G Jordan curve 93.B Jordan curve theorem 93.K Jordan decomposition (of an additive set function) 380.C (of a function of bounded variation) 166.B (of a linear mapping) 269.L (in an ordered linear space) 310.B multiplicative 269.L Jordan domain 333.A Jordan elimination, Gauss- 302.B Jordan factorial 330 Jordan-Holder sequence (in a group) 190.G Jordan-Holder theorem (in group theory) 190.G Jordan-Holder theorem (on representations of algebras) 362.0 Jordan homomorphism (between Jordan alge- bras) 231.A Jordan inequality App. A, Table 8 Jordan measurable set (ofR") 270.G Jordan measure 270.o,G Jordan module 231.C Jordan normal form 269.G Jordan test (on the convergence of Fourier series) 159.B Jordan-Zassenhaus theorem (on integral represen- tation ofa group) 362.K Joule heat 130.B Joule's law 130.B Julia's direction (of a transcendental entire function) 272.F 429.C Julia's exceptional function 272.F jump (at a point) 84.B jump function 306.C jumping of the structures 72.G just identified (equation) 128.C 
2027 K K-recursiveness 356.0 k-almost simple algebraic group 13.0 k-anisotropic algebraic group 13.0 k-array 330 k-Borel subgroup (of an algebraic group) 13.G k-closed algebraic set 13.A k-combination 330 k-compact algebraic group 13.G k-connect (graph) 186.F k-dimensional integral element 191.1 k-dimensional integral manifold 191.1 k-dimensional normal distribution 341.0 k-equivalent CO-manifolds 114.F k-Erlang distribution 260.H k-foJd mixing automorphism 136.F k-fold screw glide with pitch 92.E k-form (of an algebraic group) 13.M holomorphic 72.A k-frame 199.B orthogonal 199.B k-group 13.A k-invariants (of a CW-complex) 70.0 k-isomorphism (between algebraic groups) 13.A k-isotropic algebraic group 13.G k-morphism (between algebraic groups) l3.A k-movable 382.C k-permutation 330 k-ply transitive G-set 362.B k-ply transitive permutation group 151.H k-quasisplit algebraic group 13.0 k-rank (of a connected reductive algebraic group) 13.Q k-rational divisor (on an algebraic curve) 9.C k-rational point (of an algebraic variety) 16.A k-root 13.Q k-sample problem 371.0 k-simple algebraic group 13.0 k-solvable algebraic group 13.F k-space 425.CC k-split algebraic group 13.N k-split torus 13.0 maximal 13.Q k-step method, linear 303.E k-subgroup minimal parabolic 13.Q standard parabolic 13.Q k-subset 330 k-transitive permutation group 151.H k-trivial torus 13.0 k-valued algebroidal function 17.A k-vector bundle, normal 114.1 k-way contingency table 397.K k-Weyl group 13.Q k' -space 425.CC kth prolongation (of G structure) 191.E (of a linear Lie algebra) 191.0 (ofa linear Lie group) 191.0 kth saturated model 293.B kth transform 160.F K-complete analytic space 23.F K-complete scheme 16.0 K-flow 136.E K-group 237.B equivariant 237.H K-method 224.C K-pseudoanalytic function 352.B Subject Index Kernel K-quasiregular function 352.B K-rational (=algebraic over K) 369.C K-regular measure 270.F K-theory 237 algebraic 237.1 higher algebraic 237.1 K3 surface 15.H 72.K marked 72.K Kac formula, Feynman- 351.F Kac- Nelson formula, Feynman- 150.F Kadomtsev-Petvyashvili equation 387.F Kahler existence theorem, Caftan- 19t.I 428.E Kahler homogeneous space 199.A Kahler immersion 365.L Kahler manifold 232 Kahler metric 232.A standard (of a complex projective space) 232.0 Kahler metric, Einstein- 232.C Kahler submanifold 365.L Kakeya-Enestrom theorem (on an algebraic equation) 10.E Katutani equivalence 136.F Kakutani fixed point theorem 153.0 Kakutani theorem (on complemented subs pace problems of Banach spaces) 37.N (on statistical decision problems) 398.G Kakutani unit 310.G Kallen-Lehmann representation 150.0 Kallen-Lehmann weight 150.0 Kalman-Bucy filter 86.E 405.G Kalman filter 86.E Kaluza's 5-dimensional theory 434.C Kametani theorem, Hallstrom- 124.C Kan complex 70.F Kaplansky's density theorem 308.C Kapteyn series 39.0, App. A, Table 19.III Karlin's theorem 399.G Kastler axioms, Haag- 150.E Kato perturbation 351.0 Kato theorem, Rellich- 331.B Kawaguchi space 152.C KdVequation 387.B two-dimensional 387.F Keisler-Shelah isomorphism theorem 276.E Keller-Maslov index 274.C Kelly theorem, Nachbin-Goodner- 37.M Kelvin function 39.G, App. A, Table 19.IV Kelvin transformation 193.B Kendall notation 260.H Kendall's rank correlation 371.K Kepler's equation 309.B Kepler's first law 271.B Kepler's orbital elements 309.B Kepler's second law 271. B Kepler's third law 271.B Kerekjarto-Stoi10w compactification 207.C kernel (of a bargaining set) 173.0 (distribution) 125.L (of a group homomorphism) 190.0 (of an integral equation) 217.A (of an integral operator) 251.0 (of an integral transform) 220.A (of a linear mapping) 256.F (of a morphism) 52.N (of an operator homomorphism) 277.E (of a potential) 338.B 
Subject Index Kernel differential (of a set) 425.0 (of a sheaf homomorphism) 383.C (of a system of Suslin) 22.B adjoint 338.B of Calder6n-Zygmund type 217.1 of Carleman type 217.1 consistent 338.E degenerate 217.F diffusion 338.N Oirichlet 159.B distribution 338.P domain (of a sequence of domains) 333.C elementary 320.H Fejer 159.C Fourier 220.B fundamental 320.H Hermitian 217.H of Hilbert-Schmidt type 217.1 Hunt 338.0 integral 217.A 251.0 iterated 217.0 Kuramochi 207.C Martin 207.C perfect (in potential theory) 338.E Pincherle-Ooursat 217.F Poisson 159.C positive 217.H positive definite 217.H 338.0 positive semidefinite 217.H reproducing 188.G semidefinite 217.H separated 217.F singular 217.1 symmetric 217.G symmetric, of positive type 338.0 weak potential 260.0 Wiener 95 kernel differential 188.G kernel form 348.E kernel function 188.G Bergman l88.G harmonic 188.H Szego 188.H kernel representation (of a Green's operator) 189.B kernel theorem (of Schwartz) 125.L 424.S Kerner-Aberth (OKA) method, Ourand- 301.F Kerner (OK) method, Ourand- 301.F Kerr metric 359.E Kervaire invariant, Arf- 114.1 Khachiyan's method 255.C Khinchin canonical form (of an infinitely divisible probability distribution) 341.G Khinchin decomposition (of a covariance function) 395.B Khinchin's law of the iterated logarithm 250.C Kiefer inequality, Chapman-Robbins- 399.0 killing 261.F Killing curvature, Lipschitz- 279.C Killing differential equation 364.F Killing form 248.B killing measure 115.B killing method (of obtaining a homotopy group) 202.N killing time 260.A Killing vector field 364.F kind Abelian differential of the first, second, third I1.C Abelian integral of the first, second, third 11. C 2028 associated Legendre function of the first, second 393.C Beltrami differential operator of the first, second App. A, Table 4.11 Bessel function of the first, second, third 39.B complete elliptic integral of the first, second 134.B,C differential form of the first (on an algebraic variety) 16.0 canonical coordinates of the first, second 249.Q dfferential form of the first, second, third (on a nonsingular curve) 9.E discontinuity of the first, second 84.B discontinuous group of the first 122.B elliptic function of the first, second, third 134.G,H elliptic integral of the first, second. third 134.A error of the first, second 400.A Euler integral of the first, second 174.A,C exceptional curve of the first, second 15.G Fredholm integral equation of the first, second, third 217.A Fuchsian group of the first, second 122.C Hankel function of the first, second 39.B incomplete elliptic integral of the first 134.B Lame function of the first, second 133.C Legendre function of the first, second 393.B Mathieu function of the first, second 268.B modified Mathieu function of the first, second, third 268.0 perfect members of the second 297.0 Stirling number of the second 66.0 kinetic density 218.A kinetic energy 271.C 351.0 kinetic measure 218.A kinetic theory of gases 402.B Kirby calculus 114.L Kirchhoff laws 282.B Kirchhoff solution 325.0 Kleene's normal form theorem 356.C Klein, F. 233 Klein bottle 410.B Klein combination theorem 234.0 Klein four-group 151.G Klein-Gordon equation 351.G 377.C Kleinian group 122.C 234.A Klein inequality 212.B Klein line coordinates 90.B Klein model (of non-Euclidean geometry) 285.C Klein-Nishina formula 351.G Klein transform 150.0 K -L (= Kullback-Leibler) information number 398.G Kloosterman sum 32.C KMS condition 308.H 402.G Kneser-Nagumo theorem 316.E Kneser-Sommerfeld formula App. A, Table 19.111 Knopp-Schmidt theorem 208.C Knopp-Schnee theorem (on method of summation) 379.M knot 235.A alternating 235.A, App. A, Table 7 amphicheiral 235.A clover leaf 235.C equivalent 235.A hyperbolic 235.E invertible 235.A (p, q)- 235.G (p, q)-ball 235.G 
2029 slice 235.0 trefoil 235.C knot cobordism 235.0 knot complement conjecture 235.B knot conjecture, general 235.B knot group (of a knot) 235.B knot projection, regular 235.A knotted 235.A knot theory 235, App. A, Table 7 knot types 235.A Ko bayashi pseudodistance 21.0 Kochen isomorphism theorem, Ax- 276.E Kodaira dimension (of a compact complex manifold) 72.1 Kodaira-Spencer mapping (map) 72.0 Kodaira theorem (on Hodge manifolds) 232.0 Chow- 72.F Kodaira theory, Weyl-Stone-Titchmarsh- 112.0 Kodaira vanishing theorem 232.0 Koebe extremal function 438.C Koebe theorem 193.E Kojima-Schur theorem (on linear transformations of sequences) 379.L Kolchin theorem, Lie- (on solvable algebraic groups) 13.F Kollektiv 342.A Kolmogorov automorphism 136.E Kolmogorov axiom To 425.Q Kolmogorov backward equation 115.A 260.F Kolmogorov canonical form 341.0 Kolmogorov-Chaitin complexity 354.0 Kolmogorov equality, Chapman- 261.A Kolmogorov equation, Chapman- 260.A Kolmogorov extension theorem 341.I 407.0 Kolmogorov flow 136.E Kolmogorov forward equation Il5.A 26O.F Kolmogorov-Smirnov test 371.F Kolmogorov-Smirnov test statistic 374.E Kolmogorov space 425.Q Kolmogorov-Spanier cohomology theory, Alex- ander- 201.M Kolmogorov spectrum 433.C Kolmogorov test 45.F Kolmogorov theorem 250.F Kolmogorov zero-one law 342.0 Kondo uniformization theorem 22.F Konig-Egervary theorem 281.E Konigs-Schroder equation 44.B Korteweg-de Vries (KdV) equation 387.A Kostant's formula (on representations of compact Lie groups) 248.Z Kovalevskaya existence theorem, Cauchy- 321.A Kovalevskaya theorem, abstract Cauchy- 286.z Krein-Mil'man property 443.H Krein-Mil'man theorem 424.U Krein-Schmul'yan theorem 37.E 424.0 Krein theorem 424.V Krieger's factor 308.1 Kronecker, L. 236 Kronecker approximation theorem 422.K Kronecker delta 269.A, App. A, Table 4.II Kronecker flow 136.0 Kronecker index (in homology theory) 20tH (of divisors on a surface) 15.C Kronecker limit formula 450.B Kronecker product (of matrices) 269.C Kronecker set 192.R Kronecker symbol (for a quadratic field) 347.0 Subject Index L-space Kronecker theorem (on an Abelian extension of Q) 14.L (on an algebraic equation) IO.B Krull-Akizuki theorem 284.F Krull altitude theorem 28.A Krull-Azumaya lemma 67.0 Krull dimension (of a commutative ring) 67.E (of an ideal) 67.E Krull intersection theorem 284.A Krull-Remak-Schmidt theorem (in group theory) 190.L Krull ring 67.J Krull topology (for an infinite Oalois group) 172.1 Kruskal coordinates 359.F Kruskal-Wallis test 371.0 Kubo formula 402.K Kuhn-Tucker theorem 292.B Kullback discrimination information 213.0 Kullback-Leibler (K-L) information number 398.0 Kummer criterion 145, App. A, Table 10.II Kummer differential equation App. A, Table 19.1 Kummer extension 172.F Kummer function 167.A, App. A, Table 19.1 Kummer surface 15.H Kiinneth formula (in Abelian category) 200.H (in Weil cohomology) 450.Q Kiinneth theorem 200.E 20l.J Kunugi theorem, Iversen-Beurling- 62.B Kuo (PLK) method, Poincan!:-Lighthill- 25.B Kuramochi compactification 207.C Kuramochi kernel 207.C Kuranishi prolongation theorem 428.0 Kuranishi space 72.0 Kuratowski space 425.Q kurtosis 396.C 397.C population 396.C Kutta-Oill method, Runge- 303.0 Kutta methods, Runge- 303.0 L A' (Lipschitz spaces) 168.B A-function 32.C (lp) or lp (a sequence space) 168.B Lp (n) (the space of measurable functionsJ(x) on n such that If(x)IP, 1  p  00, is integrable) 168.B Loo (n) 168.B L(P.q) (n) (the Lorentz spaces) 168.B l-adic cohomology 450.Q l-adic coordinate system 3.E l-adic representation 3.E L-distribution 341.0 L-estimator 371.H L-function Artin 450.0,R of automorphic representation 450.N Oirichlet 450.C of elliptic curves 450.S Hecke 450.E Hecke (with Orossencharakter) 450.F p-adic 450.1 Weil 450.H L-group 450.N L-integral 221.B L-space 87.K abstract 310.0 Frechet 87.K 
Subject Index L I -algebra LI-algebra (of a locally compact Hausdorff group) 36.L Lp-space, abstract 310.G L*-space 87.K (LF)-space 424.W labeled graph 186.B lacunas for hyperbolic operators 325.J lacunary structure (of a power series) 339.E ladder down- 206.B up- 206.B ladder method 206.B lag 163.A lagged variables 128.C 274.C Lagrange, J. L. 238 Lagrange bracket 82.A 324.0 Lagrange-Charpit method 322.B, App. A, Table 15.11 Lagrange differential equation 320.A, App. A, Table 14.1 Lagrange differential equation, Euler- 46.B Lagrange equations of motion 271.F Lagrange formula (for vector triple product) 442.C Lagrange identity 252.K Lagrange interpolation 223.A Lagrange interpolation coefficients 223.A Lagrange interpolation polynomial 223.A 336.G, App. A, Table 21 Lagrange manifold, conic 345.B Lagrange method (of describing the motion of a fluid) 205.A of indeterminate coefficients 106.L of variation of constants 252.0 Lagrange multiplier 46.B method of 106.L Lagrange partial differential equation App. A, Table 15.II Lagrange problem (in calculus of variations) 46.A Lagrange remainder App. A, Table 9.IV Lagrange resolvent 172.F Lagrange stable 126.E negatively 126.E positively 126.E Lagrange-stable motion 420.0 Lagrangian density 150.B free 150. B Lagrangian derivative 205.A Lagrangian function 271.F 292.A Lagrangian manifold, conic 274.C Lagrangian vector field 126.L Laguerre differential equation App. A, Tables 14.11 20. VI Laguerre formula, Gauss- (in numerical integration) 299.A Laguerre function App. A, Table 20.VI Laguerre geometry 76.B Laguerre inversion 76.B Laguerre polynomials 317.0, App. A, Table 20.VI associated 317.0 Laguerre transformation 76.B Lambert series 339.C Lame differential equation l33.B generalized 167.E Lame function of the first kind 133.B of the first species 133.C of the fourth species l33.C of the second kind 133.C of the second species 133.C 2030 of the third species 133.C lamellar vector field 442.0 laminar flow 205.E 433.A Lanczos method 298.0,E 301.N Landau constant 77.F Landau equation 146.C Landau-Nakanishi equation 146.C Landau-Nakanishi variety 146.C 386.C Landau symbols (0,0) 87.G Landau theorem 43.J Wiener-Ikehara 123.B Landau variety 146.C Landen transformation 134.B, App. A, Table 16.1II Lane-Emden function 291.F Langevin equation 45.1402.K language 31.0 276.A accepted by 31. 0 external 75.C machine 75.C Laplace, P. S. 239 Laplace-Beltrami operator 194.B Laplace differential equation 323.A, App. A, Table 15.I1I 2-dimensional case App. A, Table 15.VI 3-dimensional case App. A, Table 15.VI Laplace expansion theorem (on determinants) 103.0 Laplace-Mehler integral representation App. A, Table 18.II Laplace method 30.B Laplace operator 323.A 442.0 Laplace spherical functions 393.A Laplace-Stieltjes transform 240.A Laplace theorem, de Moivre- 250.B Laplace transform 240, App. A, Table 12.1 Fourier- 192.F Laplace transform and operational calculus App. A, Table 12 Laplacian 323.A 442.0, App. A, Table 3.II in the large 109 large deviation 250. B large inductive dimension (Ind) 117.B large numbers law of 250.B strong law of 250.C weak law of 395.B larger, stochastically 371.C larger topology 425.H large sample theory 401.E large semigroup algebra 29.C large sieve method 123.E largest nilpotent ideal (of a Lie algebra) 248.0 Lashnev space 425.CC last multiplier, Jacobi App. A, Table 14.1 last-out memory, first-in 96.E last theorem of Fermat 145 last theorem of Poincare 153.B latin rectangle 241.E Latin square(s) 102.K 241 lattice(s) (of a crystallographic group) 92.A (= lattice ordered set) 243 (ofa Lie group) 122.G (in R") 182.B A- 92.E anti-isomorphic 243.C Archimedean vector 310.C B- 92.E Banach 310.F 
2031 black and white 92.0 Boolean 42.A 243.E Boolean, of sets 243.E Bravais 92.B c- 92.E color 92.0 complemented 243.E complete 243.0 complete vector 310.C conditionally complete 243.0 conditionally O'-complete 243.0 critical, in M with respect to S 182.B distributive 243.E dual 243.C 310.E 450.K dually isomorphic 243.C F- 92.E !'I- (of a separable algebra) 27.A homogeneous (in R") ] 82. B integral !'I- 27.A 1- 92.E inhomogeneous (in R") 182.B modular 243.F normal !'I- 27.A normal vector 3 JO.F one-dimensiona] 287.A p- 92.E primitive 92.E quotient 243.C of sets 243.E O'-complete 243.0 O'-complete vector 3] O.C Toda 287.A 387.A vector 310. B weight 92.C lattice constants (of a lattice group) 92.C lattice distribution 341.0 lattice equivalent 92.0 lattice gauge theory 150.0 lattice group(s) 182.B (of a crystallographic group) 92.A lattice-homomorphism 243.C lattice-isomorphism 243.C lattice-ordered group 243.0 Archimedean 243.0 lattice-ordered linear space 310.B lattice-ordered set 243.A lattice-point formula 220.B lattice-point problems 242 lattice-spin systems 402.0 latus rectum 78.0,E Laurent expansion 198.0 Laurent series 339.A law(s) (of a solution of a stochastic differential equation) 406.0 absorption (in algebra of sets) 381.B absorption (in a lattice) 243.A of action and reaction 271.A adiabatic 205.B alternating (in a Lie algebra) 248.A antisymmetric (for ordering) 311.A arcsine (for Brownian motion) 45.E arcsine (for distribution function) 250.D arcsine (for random walk) 260.E associative (for addition) 294.B associative (in algebra of sets) 381.B associative (of correspondences) 358.B associative (of group composition) 190.A associative (in a lattice) 243.A associative (for multiplication) 294.B Subject Index Law(s) associative (in a ring) 368.A Blumenthal zero-one 261.B Brandt's 241.C cancellation (for addition) 294.B cancellation (in a commutative semigroup) 190.P cancellation (for multiplication) 294.B commutative (for addition) 294.B 368.A commutative (in algebra of sets) 381.B commutative (of group composition) 190.A commutative (in a lattice) 243.A commutative (for multiplication) 294.B 368.A complementary, of quadratic reciprocity of Jacobi symbol 297.1 complementary,ofreciprocity 14.0 of complementation (in a Boolean algebra) 42.A complete distributive (in a lattice-ordered group) 243.0 of composition 409.A of composition, external 409.A of composition, internal 409.A of cosines (on spherical triangles) 432.B, App. A, Table 2.m of cosines, first 432.A, App. A, Table 2.11 of cosines, second 432.A, App. A, Table 2.11 of cotangents App. A, Table 2.111 de Morgan's (in algebra of sets) 381.B de Morgan's (in a Boolean algebra) 42.A differential 107.A distributive (in algebra of sets) 381.B distributive (in a lattice) 243.E distributive (of natural numbers) 294.B distributive (in a ring) 368.A even-oddness conservation 150.0 of excluded middle 156.C 411.L explicit reciprocity (of norm-residue symbol) 14.R 257.H first complementary, of quadratic reciprocity of Legendre symbol 297.1 Oel'fand-Pyatetskii-Shapiro reciprocity (on unitary representations) 437.00 general associative (for group composition) 190.C Hewitt-Savage zero-one 342.0 Hooke's 271.0 idempotent (in a lattice) 243.A of inertia 27I.A of inertia, Sylvester (for a quadratic form) 348.C initial (for stochastic differential equation) 406.0 of iterated logarithm 45.F of iterated logarithm, Khinchin 250.C Joule's 130.B Kepler's first 271.B Kepler's second 271.B Kepler's third 271.B Kirchhoff 282.B Kolmogorov zero-one 342.0 of large numbers 250.B 395.B of large numbers, strong 250.C Maxwell-Boltzmann distribution 402.B modular (in a lattice) 243.F of motion 27l.A of motion, Newton's three 271.A Newton (on frictional stresses) 205.C Newton's first 27l.A Newton's second 271.A Newton's third 271.A 
Subject Index Lax equivalence theorem Ohm 130.B 259 of quadratic reciprocity of Jacobi symbol 297.1 of quadratic reciprocity of Legendre symbol 297.1 of reaction 271.A of reciprocity 14.0297.1 of reciprocity, Artin general 59.C of reciprocity, general 14.0 reciprocity, of Shafarevich 257.H reflexive (for an equivalence relation) 135.A reflexive (for ordering) 311.A second complementary, of quadratic reciprocity of Legendre symbol 297.1 of similarity, Prandtl-Olauert 205.0 of similarity, Reynolds 205.C of similitude 116 of sines 432.A, App. A, Table 2.11 of sines (for spherical triangles) 432.B, App. A, Table 2.m of sines and cosines App. A, Table 2.1II of small numbers 250.B symmetric (for an equivalence relation) 135.A of symmetry (for the Hilbert norm-residue symbol) 14.R of tangents App. A, Table 2.m transitive (for an equivalence relation) 135.A transitive (for ordering) 311.A of universal gravitation 271.B zero-one 342.0 Lax equivalence theorem 304.F Lax representation 287.B 387.C Lax-Wendroff scheme 304.F layout, two-way 155.H layer boundary 205.C potential of double 338.A potential of single 338.A LBA problem 31.F L.B. (loosely Bernoulli) 136.F LCL (lower control limit) 404.B L.C.M. (least common multiple) 67.H 297.A leaf (leaves) (of a foliation) 154.B compact 154.0 growth of 154.H leaf topology 154.0 learning model 346.0 least action, principle of 441.B least common multiple 67.H 297.A least element (in an ordered set) 311.B least favorable distribution 400.B least favorable a priori distribution 398.H least square approximation 336.0 least squares, method of (for estimation) 403.E (for higher-dimensional data) 397.1 (for numerical solution) 303.1 least squares estimator 403.E generalized 403.E least squares method indirect (in econometrics) 128.C three-stage 128. C two-stage 128.C least squares problem, linear 302.E least upper bound (ordered sets) 31O.C 311.B Lebesgue, H. L. 244 Lebesgue area (of a surface) 246.C Lebesgue convergence theorem 221.C Lebesgue decomposition theorem 270.L 380.C 2032 Lebesgue density theorem 100.B Lebesgue dimension 117.B Lebesgue extension 270.0 Lebesgue integrable 221.B Lebesgue integral 221.B Lebesgue measurability and the Baire property 33.F Lebesgue measurable (set) 270.0 Lebesgue measurable function 270.1 Lebesgue measure 270.0 generalized 270.E Lebesgue measure space (with a finite O'-finite measure) 136.A Lebesgue method of summation 379.S Lebesgue number 273.F Lebesgue outer measure 270.0 Lebesgue-Radon integral 94.C Lebesgue spectrum, countable 136.E Lebesgue-Stieltjes integral 94.C 166.C Lebesgue-Stieltjes measure 166.C 270.L Lebesgue test (on the convergence of Fourier series) 159.B Lebesgue theorem (on the dimension of R") 117.0 Borel- 273.H Cantor- 159.1 Riemann- 159.A 160.A Le Cam theorem 399.K Lefschetz duality theorem, Poincare- 201.0 Lefschetz fixed-point formula 450.Q Lefschetz fixed-point theorem 153.B Lefschetz formula, Picard- 418.F Lefschetz number (of a continuous mapping) 153.B (of a variety) 16.P Lefschetz pencil 16. U Lefschetz theorem strong 16.U weak 16.U Lefschetz transformation, Picard- 16.U left, limit on the 87.F left A-module 277.0 left adjoint functor 52.K left adjoint linear mapping 256.Q left annihilator 29.H left Artinian ring 368. F left balanced functor 200.1 left continuous 84.B left coset 190.C left coset space 423.E left decomposition, Peirce (in a unitary ring) 368.F left derivative 106.A left derived functor 200.l,Q left differentiable 106.A left distributive law 312.C left endpoint (of an interval) 355.C left exact functor 200.1 left G-set 362.B left global dimension (of a ring) 200.K left hereditary ring 200.K left ideal 368.F integral 27.A left invariant Haar measure 225.C left invariant metric (of a topological group) 423.1 left invariant tensor field (on a Lie group) 249.A left inverse element (in a ring) 368.B left linear space 256.A left Noetherian ring 368.F left D,-ideal 27.A left operation 409.A 
2033 left order (of a g-lattice) 27.A left parametrix 345.A left projective resolution (of an A-module) 200.C left projective space 343.F left quotient space (of a topological group) 423.E left regular representation (of an algebra) 362.C (of a group) 362.B left resolution (of an A-module) 200.C left satellite 200.1 left semihereditary ring 200.K left semi-integral 68.N left shunt 115.B left singular point (of a diffusion process) 115.B left translation 249.A 362.B left uniformity (of a topological group) 423.0 Legendre associated differential equation 393.A Legendre coefficient 393.B Legendre differential equation 393.B, App. A, Table 14.II Legendre function 393.B, App. A, Table 18.11 associated App. A, Table 18.m associated (of the first kind) 393.C, App. A, Table 18.m associated (of the second kind) 393.C, App. A, Table 18.III of the first kind 393.B, App. A, Table 18.II of the second kind 393.B, App. A, Table 18.II Legendre-Jacobi standard form 134.A, App. A, Table 16.1 Legendre polynomial 393.B, App. A, Table 18.11 Legendre relation 134.F, App. A, Table 16.1 Legendre symbol 297.H first complementary law of reciprocity of 297.1 law of quadratic reciprocity of 297.1 second complementary law of reciprocity of 297.1 Legendre transform 419.C Legendre transformation (contact transform) 82.A, App. A, Table 15.1V Lehmann representation, Kallen- 150.0 Lehmann-Scheffe theorem 399.C Lehmann-Stein theorem 400.B Lehmann theorem 371.C Hodges- 399.E,H Lehmann weight, Kiillen- 150.0 Lehmer method 301.K Leibler (K-L) information number, Kullback- 398.0 Leibniz, O. W. 245 Leibniz formula (in differentiation) 106.0, App. A, Table 9.II1 Leibniz test (for convergence) 379.C lemniscate 93.H Bernoulli 93.H length 246 (of a broken line) 139.F (of a curve) 93.F246.A (of a descending chain in a lattice) 243.F (of a module) 277.1 (of a multi-index) 112.A (of a normal chain in a group) 190.0 (of a path) 186.F (of a segment) 139.C (of a Witt vector) 449.B affine llO.C affine arc 11 O.C extremal (of a family of curves) 143.A extremal, defined by Hersch-Pfluger 143.A extremal, with weight 143.B Subject Index Lie algebra(s) of finite (module) 277.1 focal 180.B n- (of a group) 151.F queue 260.H wave 446 lens, Luneburg's 180.A lens space 91.C infinite 91.C Leopoldt's conjecture 450.1 leptons 132.B Leray-Hirsch theorem 20t.J Leray-Schauder degree 286.0 Leray-Schauder fixed-point theorem 286.0 323.0 letter (in information theory) 63.A 213.B ( = varia ble) 369.A level (of a factor) 102.H (of a modular form) 32.C (of a modular function) 32.C (of an orthogonal array) 102.L (of a principal congruence subgroup) 122.0 (of a test) 400.A (of a tolerance region) 399.R average outgoing quality 404.C confidence 399.Q level ex test 400.A minimax 400.F most stringent 400.F unbiased 400.C uniformly most powerful (UMP) invariant 400.E uniformly most powerful (UMP) unbiased 400.C level n structure (on an Abelian variety) 3.N level set (of a COO-function) 279.0 level surface 193.1 Levi-Civita, parallel in the sense of I11.H Levi-Civita connection 364.B Levi condition 321.0 325.H Levi decomposition (on algebraic groups) 13.Q (on Lie algebras) 248.F Levi form 344.A generalized 274.0 Levi problem 2l.I,F Levi pseudoconvex domain 21.1 locally 2l.I Levi subgroup 13.Q Levitan-Marchenko equation, Oel'fand- (for KdV equations) 387.0 (for a nonlinear lattice) 287.C Levy canonical form 341.0 Levy continuity theorem 341.F Levy distance 341.F Levy-It6 theorem (on Levy processes) 5.E Levy measure 5.E Levy process 5.B Levy theorem, Wiener- 159.1 Lewy-Mizohata equation 274.G lexicographic linear ordering 248.M lexicographic ordering 311.0 liability reserve 214.B Lie, M. S. 247 Lie algebra(s) 248, App. A, Table 5.1 (of an algebraic group) 13.C (ofa Lie group) 249.B Abelian 248.C adjoint 248.B algebraic 13.C 
Subject Index Lie derivative classical compact real simple 248.T classical complex simple 248.S compact real 248.P compact real simple App. A, Table 5.1 complex 248.A complex (of a complex Lie group) 249.M complex simple App. A, Table 5.1 of derivations 248.H exceptional compact real simple 248.T exceptional complex simple 248.S general linear 248.A isomorphic 248.A nilpotent 248.C noncompact real simple App. A, Table 5.1I quotient 248.A real 248.A reductive 248.0 restricted 248.V semisimple 248.E simple 248.E solvable 248.C Lie derivative 105.0,Q Lie fundamental theorem (on a local Lie group of local transformations) 43\.0 Lie group(s) 249423.M Abelian 249.0 Banach 286.K classical compact simple 249.L classical complex simple 249.M commutative 249.0 complex 249.A direct product of 249.H exceptional compact simple 249.L exceptional complex simple 249.M isomorphic 249.N local 423.L local (oflocal transformations) 431.0 nilpotent 249.0 quotient 249.0 semisimple 249.0 simple 249.0 simply connected covering 249.C solvable 249.0 topology of, and homogeneous spaces 427 Lie-Kolchin theorem (on solvable algebraic groups) 13.F Lie line-sphere transformation 76.C Lie minimal projection 76.B Lienard differential equation 290.C lies over (of a compactification) 207.B Lie subalgebra 248.A associated with a Lie subgroup 249.0 Lie subgroup (of a Lie group) 249.0 connected 249.0 Lie theorem (on Lie algebras) 248.F Lie transformation (in circle geometry) 76.C Lie transformation group (of a differentiable mani- fold) 431.C lifetime 260.A 261.B (of a particle by a scattering) 132.A lift (along a curve in a covering surface) 367.B (of a differentiable curve) 80.C (of a vector field) 80.C inflation 200.M lifting (in nonstandard analysis) 293.0 lifting theorem 251.M light cone 258.A Lighthill-Kuo (P.L.K.) method, Poincare- 25.B 2034 Lighthill method 25.B lightlike 258.A 359.B likelihood 374.1 likelihood equation 399.M likelihood estimating function 399.M likelihood estimator, maximum 399.M likelihood function 374.J 399.M likelihood method, maximum 399.M likelihood ratio 400.1 monotone 374.1 likelihood ratio test 400.1 limac;on of Pascal 93.H liminal C*-algebra 36.H limit (of a function) 87.F (of an indeterminate form) 106.E (of a mapping) 87.F (of a net) 87.H (of a sequence of lattices) 182.B (of a sequence of points) 87.E 273.0 (of a sequence of real numbers) 87.B 355.B (of a sequence of sets) 270.C (of a spectral sequence) 200.1 Banach 37.F confidence 399.Q direct (of a direct system) 21O.B elastic 271.0 generalized 37.F inductive (in a category) 210.0 inductive (group) 210.C inductive (of an inductive system) 210.B inductive (of sheaves) 383.1 inductive (space) 21O.C inductive (of topological spaces) 425.M inferior (event) 342.B inferior (of a sequence ofreal numbers) 87.C inferior (of a sequence of sets) 270.C inverse (of an inverse system) 21O.B on the left (of a real-valued function) 87.F lower (function) 84.C lower (of a Riemann integral) 216.A lower (of a sequence of real numbers) 87.C lower control 404.B in the mean 168.B order (of an order convergent sequence) 310.C projective (in a category) 210.0 projective (of a family of continuous homomor- phisms) 423.K projective (group) 210.C projective (of a projective system) 210.B projective (space) 210.C on the right (of a real-valued function) 87.F strictly inductive (of a sequence oflocally convex spaces) 424.W superior (event) 342.B superior (of a seque!1ce of real numbers) 87.C superior (of a sequence of sets) 270.C thermodynamic 402.0 tolerance 399.R upper (function) 84.C upper (of a Riemann integral 216.A upper (of a sequence of real numbers) 87.C upper control 404.B limit circle type (boundary point) 112.1 limit cycle 126.1 limit distribution 250.A limit formula, Kronecker's 450.B limited information maximum likelihood method 128.C limit inferior 
2035 (of a sequence of real numbers) 87.C (of a sequence of sets) 270.C limiting absorption principle 375.C limiting hypersphere (in hyperbolic geometry) 285.C limit ordinal number 312.B limit point (of a discontinuous group) 122.C (of a sequence) 87.B,E (1- 126.0 negative 126.0 ill- 126.0 positive 126.0 limit point type (boundary point) 112.1 limit &et 234.A 126.D first negative prolongational 126.0 first positive prolongational 126.0 ill- 126.0 residual 234.E limit superior (of a sequence of real numbers) 87.C (of a sequence of sets) 270.C limit theorem(s) 250.A basic 260.C central 250.B local 250.B in probability theory 250 limit value (of a mapping) 87.F Lindeberg condition 250.B Lindel6f asymptotic value theorem 43.C Lindel6f hypothesis 123.C Lindel6f space 425.S Lindel6f theorem 43.F Ph ragmen- 43.C Lindemann- Weierstrass theorem 430.0 Lindstedt-Poincare method 290.E line(s) 7.A 93.A 155.B (of a graph) 186.B broken 155.F complexes, linear 343.E complex of 11O.B concurrent (in projective geometry) 343.B congruence of 110.B congruences. linear 343.E of curvature (on a surface) 11 I.H offorce 193.1 generating (of a circular cone) 78.A generating (of a quadric hypersurface) 343.E generating (of a quadric surface) 350.B generating (of a ruled surface) ll1.I geodesic 178.H Green 193.1 Green, regular 193.1 half- 155.B long 105.B normal (to a curve) 93.G Pascal 78.K pencil of (in a projective plane) 343.13 projective 343.B real 355.E regression 403.0 of regression 111 .F,! straight 93.A 155.B stream 205.B supporting (function) 89.C supporting (of an oval) 89.C of swiftest descent 93.H tangent 93 G 11 I.C,F Subject Index Linearly dependent tangent, oriented 76.B vector (of a vector field) 442.0 vortex 205.B linear algebra 8 linear algebraic group B.A linear boundary operators, 315.B linear bounded automation deterministic 31.0 nondeterministic 31.0 linear code 63.C linear combination 256.C of ovals 89.0 linear connection 80.H linear difference equation 104.C linear differential equations, system of (of the first order) 252.G linear discriminant function 280.1 linear dynamical system 86.B linear equation(s) 10.0 16.M 269.M linear equivalence class (of divisors) 16.M linear extension (of a rational mapping to an Abelian variety) 9.E linear fiber mapping (map) 114.0 linear filter 405.F linear form 256.B 277.E linear fractional function 74.E linear fractional group 60.B linear fractional programming 264.0 linear fractional transformation 74.E linear function 74.E linear functional 37.C 197.F 424.B algebraic 424.B linear fundamental figure (of a projective space) 343.B linear genus 15.G linear graph 282.A linear group Abelian (over K) 60.L full 60.B general 60.B 256.0 general (of degree n over K) 60.B 226.B general (over a noncommutative field) 60.0 p rojecti ve general 60. B projective general (of degree n over K) 60.B projective special 60.B projective special (over a noncommutative field) 60.0 special 60.B special (of degree n over K) 60.B special (over a noncom mutative field) 60.0 linear holonomy 154.C linear homogeneous equations, system of 269.M linear homotopy 114.0 linear hypothesis, general 400.H linear integral equation 217.A linear isotropy group (at a point) 199.A linearized operator 183 286.E linearized stability, principle of 286.S linear k-step method 303.E linear least squares problem 302.E linear Lie algebra, general 248.A linear line complex 343.E linear line congruence 343.E linear logistic model 403.C linearly compact 422.L locally 422.L linearly connected homogeneous space 199.A linearly dependent (with respect to a difference equation) 104.0 
Subject Index Linearly dependent elements linearly dependent elements (in an additive group) 2.E (in a linear space) 256.C linearly disjoint (fields) 149.K linearly equivalent (divisors) 16.M 172.F 0- (on an algebraic curve) 9.F linearly estimable parameter 403.E linearly independent elements (in an A-module) 277.0 (in an additive group) 2.E (in a linear space) 256.C linearly independent family (of elements in a linear space) 256.E . linearly ordered set 311.A linearly reductive 226.B linearly representable 66.H linear mapping 70.C 256.B A- (of an A-module) 277.E piecewise 70.C linear model 403.0 log 403.C multivariate 280.B normal 403.C linear multistep method 303.E linear network 282.C linear operator(s) 37.C 251 (between linear spaces) 256.B bounded 37.C linear ordering 311.A lexicographic 248.M theorem of 155.B linear ordinary differential equation(s) 252253 254 313.A with constant coefficients App. A, Table 14.1 of the first order App. A, Table 14.1 of higher order App. A, Table 14.1 linear parameter 102.A linear partial differential equation 320.A linear pencil 16.N linear prediction theory 395.0 linear predictor 395.0 optimal 395.0 linear programming 255264.C linear programming problem 255.A linear recurrent sequence 295.A linear regression 397.1 linear regression function 397.1 403.0 linear representation (of an algebra) 362.C (of a group) 362.C (of a Lie algebra) 248.B associated with representation module 362.C completely reducible 362.C direct sum of 362.C equivalent 362.C faithful 362.C homomorphism of 362.C indecomposable 362.C irreducible 362.C isomorphic 362.C reciprocal (of an algebra) 362.C reducible 362.C semisimple 362.C similar 362.C simple 362.C tensor product of 362.C linear simple group 15l.I linear space(s) 256 category of (over a ring) 52.B complex 256.A 2036 dual 256.0 over a field 256.A finite-dimensional 256.C Hermitian 256.Q infinite-dimensional 256.C lattice-ordered 310.B left 256.A normed 37.B ordered 310.B quasinormed 37.0 quotient 256.F real 256.A right 256.A self-dual 256.H linear stationary iterative process 302.C linear structural equation system 128.C linear structure 96.C linear su bspace closed 197.E of a linear space 256.F linear system (of divisors) 15.C 16.N (of functional-differential equations) 163.E ample 16.N characteristic (of an algebraic family) 15.F complete 9.C 16.N complete (defined by a divisor) 16.N irreducible 16.N reducible 16.N very ample 16.N linear time-invariant (dynamical system) 86.B linear time-varying system 86.B linear topological space 424.A linear topology 422.L linear transformation (on a Banach space) 251.A (on a linear space) 256.B (on a Riemann sphere) 74.E (of series) 379.L entire 74.E semisimple 256.P triangular 379.L linear unbiased estimator, best 403.E linear variety 422.L linearly compact 422.L line bundle 147.F complex 72.F complex (determined by a divisor) 72.F tautological 16.P line coordinates (of a line) 343.C line element I11.C characteristic 82.C of higher order, space of 152.C projective 110.B line-sphere transformation, Lie 76.C linguistics, mathematical 75.E link 235.0 framed 114.L linkage invariant, covering 235.E link group 235.0 linking number 99.C link polynomial 235.0 reduced 235.0 link type 235.0 Linnik's constant 123.0 Liouville formula 252.C Liouville number 430.B Liouville operator, Sturm- 112.1 Liouville problem, Sturm- 315.B Liouville theorem 
2037 on bounded entire functions 272.A first 134. E fourth 134.E on integral invariants 219.A second 134.E third 134.E Lip (1 (Lipschitz) 84.A of order (1 84.A Lippman-Schwinger equation 375.C Lipschitz condition 84.A 163.0 286.B 316.0 of order (1 84.A Lipschitz-Killing curvature 279.C Lipschitz space 168.B Lipschitz test, Oini- (on the convergence of Fourier series) 159.B list (representation) 96.D 186,D little group 258.C Littlewood-Paley theory 168.B Littlewood-Sobolev inequality, Hardy- 224.E Littlewood supremum theorem, Hardy- App. A, Table 8 Littlewood theorem, Hardy- (on bounded functions) 43.E (on trigonometric systems) 317.B lituus 93.H Livesay invariant, Browder- 114.L loading 214.A factor 280.0 346.F Lobachevskiii non-Euclidean geometry 285.A local base (in a topological space) 425.E local canonical parameter (for power series) 339.A local class field theory 59.0 local cohomology group 125.W local concept (in differential geometry) 109 local continuity 45.F local control 102.A local coordinates (on an algebraic variety) 16.0 (on a differentiable manifold) 105.C transformation of 90.0 local coordinate system 90.0 105.C holomorphic 72.A local cross section (in a topological group) 147.E local degree of a mapping 99. B local dimension (of an analytic set at a point) 23.B local equation (of a divisor) 16.M regular (at an integral point) 428.E local ergodic theorem 136.B local field 257.A local Oaussian sum 450.F local homology group 201.N local homomorphism (of a topological group) 423.0 local isomorphism (of topological groups) 423.0 localization of a linear representation relative to a prime ideal 362.F principle of (on convergence tests of Fourier series) 159.B strict 16.AA local Lie group 423.L (of local transformations) 431.0 local limit theorem 250.B locally (on a topological space) 425.1 locally absolutely p-valent (function) 438.E locally arcwise connected (space) 79.B locally Caftan pseudoconvex (domain) 21.1 locally closed (set) 425.1 Subject Index Local property locally compact space 425.V uniformly 425.V locally connected (space) 79.A locally constructible (constant sheaf) 16.AA locally contractible (at a point) 79.C (space) 79.C 202.0 locally convex (topological linear space) 424.E locally convex Frechet space 424.1 locally countable cell complex 70.0 locally countable simplicial complex 70.C locally dense 154.0 locally equicontinuous semigroup 378.F locally equivalent (G-structure) 191.H locally Euclidean group 423.M locally Euc!idean space 425.V locally finite (algebra) 29.1 (cell complex) 70.0 (covering) 425.R (simplicial complex) 70.C 0'- (covering) 425.R locally flat (connection) 80.E (injection between topological manifolds) 65.0 (PL embedding) 65.0 (Riemannian manifold) 364.E locally integrable function 168.B locally isomorphic (topological groups) 423.0 locally Levi pseudo convex (domain) 21.1 locally linearly compact (n-module) 422.L locally Macaulay ring 284.0 locally n-connected (at a point) 79.C (space) 79.C locally Noetherian (scheme) 16.0 locally Noetherian formal scheme 16,X locally of finite type (for a morphism) 16.0 locally w-connected (space) 79.C locally p-valent 438.E locally quadratic transformation (of an algebraic surface) 15.0 (of an algebraic variety) 16.K (of a complex manifold) n.H locally rectifiable (curve) 143.A 246.A locally symmetric Riemannian space 412.A, App. A, Table 4.II locally symmetric space 364.0 affine 80.1 locally symmetrizable (diffusion processes) 115.0 locally totally bounded (uniform space) 436.H locally trivial fiber space 148. B locally uniformized 367.C local martingale 262.E local maximum modulus principle 164.C local moduli space (of a compact complex manifold) 72.0 local one-parameter group of local transformations 105.N local operator 125.00 local orientation (in an oriented manifold) 201.N local parameter (around a cusp of a Fuchsian group) 32.B (of a nonsingular algebraic curve) 9.C (of a Riemann surface) 367.A local problem (on the solutions of differential equations) 289.A local property (in differential geometry) 109 
Subject Index Local regime (of a pseudodifferential operator) 345.A micro-pseudo- 345.A pseudo- (of a pseudodifferential operator) 345.A local regime (in static model in catastrophe theory) 51.B local ring 284.0 (of a prime ideal) 67.0 (of a subvariety) 16.B analytically normal 284.0 analytically un ramified 284.0 complete 284.0 Macaulay 284.0 Noetherian 284.0 Noetherian scmi- 284.0 quasi- 284.0 quasisemi- 284.0 regular 284.0 semi- 284.0 structure theorem of complete 284.0 local-ringed space 383.H local section 126.E local strategy 173.B local systcm of groups (over a topological space) 201.R local time 45.0 local transformations, local Lie group of 431.0 local truncation error 303.E local uniformizing parameter 367.A location parameter 396.1400.E locus cut 178.A singular (of a variety) 16.F Locb measure 293.0 Loeb space 293.0 logx 131.0 logax 131.B Logz (logarithm) 131.0 logarithm 131.B common 131.C integral 167.0 Khinchin law of the iterated 250.C law of itcrated 45.F Napierian 131.0 natural 131.0 logarithmically convex 21.B logarithmic branch point (of a Riemann surface) 367.B logarithmic capacity 48.B logarithmic criterion App. A, Table 10.II logarithmic curve 93.H logarithmic decrement (of a damped oscillation) 318.B logarithmic differentiation App. A, Table 9.1 logarithmic distribution App. A, Table 22 logarithmic function to the base a 131.B logarithmic integral 167.0, App. A, Table 19.II logarithmic normal distribution App. A, Table 22 logarithmic paper 19.F semi- 19.F logarithmic potential 338.A logarithmic series 131.0 logarithmic singularity (of an analytic function) 198.M (of an analytic function in the wider sense) 198.P logarithmic spiral 93.H logic algebra of 411.A 2038 classical 411.L intUitlOfilstic 411.L many-valued 411.L mathematical 411.A modal 411.L predicate 411.0 predicate, with equality 411.J propositional 411.E quantum 351.L symbolic 411.A three-valued 411.L two-valued 411.L logical axiom 411.1 logical choice function, transfinite 41 \.J logical operator 411.E logical product (of propositions) 411.B logical sum (of propositions) 411.B logical symbol 411.B logicism 156.A,B logistic equation 263.A logistic model, linear 403.C log linear model 403.C logmodular algebra 164.B Lommel integral 39.C Lommel polynomials App. A, Table 19.IV long gravity wave 205.F longitude (of a knot) 235.B longitudinal wave 446 long line 105.B long water wave 205.F look-up, table 96.C Looman-Men'shov theorem 198.A loop 170190.P self- 186.B loop space 202.C loop theorem (on 3-manifolds) 65.E loosely Bernoulli 136.F Lopatinskii condition, Shapiro- 323.H Lopatinskii dterminant 325.K Lorentz condition 130.A Lorentz force 130.A Lorentz group 60.1 258.A 359.B full homogeneous 258.A full inhomogeneous 258.A homogeneous 359.B inhomogeneous 359.B proper 60.1 359.B proper complex 258.A Lorentz in variance 150.B Lorentz space 168.B Lorentz transformation 359.B Lorenz curve 397.E loss heat 419.A of information 138.B loss function 398.A quadratic 398.A 399.E simple 398.A lot tolerance percent defective 404.C Lowenheim theorem, Skolem- 156.E lower bound 311.B greatest 31O.C 311.B lower central series (of a group) 190.1 lower class with respect to local continuity 45.F with respect to uniform continuity 45.F lower control limit 404.B lower derivative general (of a set function) 380.0 ordinary (of a set function) 380.0 't 
2039 lower end (of a curvilinear integral) 94.D lower envelopc principle 338.M lowering the superscripts 417.0 lower integral, Riemann 2]6.A lower limit (of a Riemann integral) 216.A (of a sequence of real numbers) 87.C lower limit function 84.C lower order (for infinity) 87.0 (of a meromorphic function) 272.C lower semicontinuity (of length) 264.A lower semicontinuous (at a point) 84.C lower semicontinuous function 84.C lower semilattice 243.A lower triangular matrix 269.B lower variation (of a set function) 380.B Lowner differential equation 438.B loxodromic transformation 74.F LP (linear programming) 255 LSZ asymptotic condition 150.0 l.u.b. (least upper bound) 311.B Lubanski vector, Pauli- 258.0 Luenberger observer 86.E lumping, mass 304.0 Luneburg lens 180.A Liiroth theorem 16.J Lutz-Mattuck theorem 118.E Luzin first principle (in analytic set theory) 22.C Luzin second principlc (in analytic set theory) 22.C Luzin space 22.1 425.CC Luzin theorem 270.1 Denjoy- 159.1 Luzin unicity theorem (in analytic set theory) 22.C Lyapunov characteristic number 314.A Lyapunov condition 250.B Lyapunov convexity theorem 443.0 Lyapunov function 126.F 163.0 394.C Lyapunov-Schmidt procedure 286. V Lyapunov stable 126.F in both directions 394.A in the negative direction 394.A in the positive direction 394.A uniformly 126.F Lyapunov theorem 398.C Lyusternik- Shnirel'man theory 286.Q M M(n) (the set of all essentially bounded measurable fuctions on n) 168.B [Mp}, ultradistribution of class 125.U (M p), ultradistribution of class 125. U l1-absolutely continuous (additive set function) 380.C l1-completion 270.0 l1-conformal function 352.B l1-constant stratum 418.E l1-integrab]e 22].B l1-measurable 270.0 l1-null set 370.0 l1-operator, bounded 356.B l1-singular (additive set function) 380.C m-dissipative 251.J m x n matrix 269.A mth root IO.C M-estimator 371.H M-port network 282.C M-set 159.J M-space 425.Y Subject Index Manifold(s) (Mj-space (=Montel space) 424.0 M space, abstract 310.0 M waves 130.B Macaulay local ring 284.0 Macaulay ring 284.0 locally 284.0 Mach cone 205.B machine Turing 31.B universal Turing 31.C machine-language program 75.C machine scheduling problem 376 machine sequencing problem 376 Machin formula 322 Mach number 116.B 205.B Mach wave 205.B Mackey-Arens theorem 424.N Mackey space 424.N Mackey theorem 424.M Mackey topology 424.N Mack method, Oarside-Jarratt- 30l.N MacLane complexes, Eilenberg- 70.F MacLane space, Eilenberg- 70.F MacLane spectrum, Eilenberg- 202.1 Maclaurin formula, Euler- 379.J macroeconomic data 128.A macroscopic causality (of S-matrix) 386.C magnetic field 130.A magnetic flux density 130.A magnetic group 92.0 magnetic induction 130.A magnetic quantum number, orbital 351. E magnetic permeability 130.B magnetic polarization 130.A magnetic Reyno]ds number 259 magnetic susceptibility 130.B magentic viscosity 259 magnetic wave 130.B transverse 130. B magnetofluid dynamics 259 magneto hydrodynamics 259 magnetostatics 130.B magnitude (of a vector) 442.B Mahalanobis generalized distance 280.E Mainardi equations, Codazzi- II1.H, App. A, Table 4.1 main classes 241.A main effect 102.H main theorem (in class field theory) 59.C Zariski's 16.1 majorant (of a sequence of functions) 435.A harmonic (of a subharmonic function) 193.S method of 316.0 majorant series 316.0 435.A major arc 4.B major axis (of an ellipse) 78.C major function WO.F majorizing function, right 316.E Malfatti problem (in geometric construction) 179.A Malgrange theorem, Ehrenpreis- 112.B Mal'tsev-Iwasawa theorem, Cart an- (on maximal compact subgroups) 249.S Mal'tsev theorem, Wedderburn- (on algebras) 29.F Malus theorem 180.A Mandelstam representation 132.C Mangoldt function 123.B manifold(s) almost complex 72.B 
Subject Index Manin connection, Gauss- (of a variety) almost contact IIO.E almost parallelizable 114.1 analytic  analytic manifold Banach 105.Z 286.K Blaschke manifold 178.0 with boundary 105.B without boundary 105.B C- 105.0 C-, with boundary 105.E C-, without boundary 105.E center, theorem 286. V characteristic (ofa partial differential equation) 320.B characteristic classes of 56.F closed 105.B coherently oriented pseudo- 65.B combinatorial 65.C compact C'- 105.0 complex  complcx manifold(s) complex analytic 72.A conic Lagrange 345.B conic Lagrangian 274.C contact 110.E covering 91.A covering differentiable 91.A differentiable, with boundary of class C 105. E differentiable, of class C 105.0 with Euclidean connection 109 fibered 428.F Finsler 286.L flag 199.B Frechet 286.K G- 431.C Orassmann  Orassmann manifold group (of a Lie transformation) 110.A with a handle attached by f 114.F h-cobordant oriented 114.1 Hilbert 105.Z 286.K Hodge 232.0 homology 65.B Hopf 232.E hyperbolic 21.0 235.E integral 428.A,B,0 irreducible 3- 65.E Kahler 232 k-dimensional integral 191.1 nontrivial 3- 65.E ordinary integral (of a differential ideal) 428.E orientable (C-manifold) 105.F orientation 201.N oriented 105.F 201.N oriented G- 431.E paracompact C'- 105.0 parallelizable 114.1 n- 114.1 PL- 65.C Poincare 105.A at a point 178.0 prime 3- 65.E proper flag 199.B pseudo- 65.B pseudo-Hermitian 344.F Q- 382.0 real analytic 105.0 rcgular integral (of a differential ideal) 428.E Riemannian  Riemannian manifold singular integral (of a differential ideal) 428.E SCP- 178.0 smooth 105.0114.B space-time 359.0 2040 s-parallelizable 114.1 stable 126.0,1 stably almost complex 114.H stably parallelizable 114.1 Stein 21.L Stiefel  Stiefel manifold symplectic 219.C topological 105.B triangulated 65.B unstable 126.0,1 visibility 178.F weakly almost complex 114.H weakly I-complete 21.L Manin connection, Oauss- (of a variety) 16.V Mannheim curve II1.F .1ann",\Vhitney V-test 371.C MANOV A (multivariate analysis of variance) 280.B mantissa (of the common logarithm) 131.C many body problem 402.F 420.A many-valued (analytic function) 198.J many-valued function 165.B many-valued logic 411.L map 381.C (also  mapping) bundle 147.B covering 91.A cubic 157.B equivariant 431.A first-return 126.C G- 431.A Oauss 111.0 Kodaira-Spencer 72.0 linear fiber 114.0 normal 114.1 PL 65.A Poincare 126.C time-one 126.C trivalent 157.B mapping 381.C A-balanced 277.1 affine 7. E alternating multilinear 256.H analytic 21.J antiholomorphic 195.B antisymmetric multilinear 256.H biadditive 277.J biholomorphic 21.J bijective 381.C bilinear 256.H 277.J birational 16.1 biregular (between prealgebraic varieties) 16.C Borel isomorphic 270.C of bounded variation 246.H bundle 147.B C - 105.J C l - 237.0 CE 382.0 cel\ular (between cell complexes) 70.0 chain 200.C chain (between chain complexes) 201.B characteristic (in the classification theorem of fiber bundles) 147.0 class 202.B of class C 208.B classifying 147.0 closed 425.0 cochain 200.f 201.H complete 241.C conformal 198.A 
2041 conjugation (of a Hopf algebra) 203.E constant 381. C continuous 425 G covering 91.A degenerate 208.B degree 99.A degree of 99.A diagonal (of a graded coalgebra) 203.B,F differentiable, of class C' 105.1 dual (of a linear mapping) 256.G duality 251.J equivariant 431.A essential 202.B exponential 178.A 249.Q 364.C extremal horizontal slit 367.G extremal quasiconformal 352.C extremal vertical slit 367.G first-return 126.C Fredholm 286.E G- 362.B 431.A Gauss (in geometric optics) 180.B generalized conformal 246.1 of group algebra 192.Q harmonic 195.B hereditarily quotient 425.G holomorphic 21.J 72.A homological 200.C homotopy-associative 203.0 Hopf 147.E identity 381.C inclusion 381.C inverse 381.C inverse, theorem 208.B isometric II1.H 273.B Kodaira-Spencer 72.G linear (between linear spaces) 256.B Jinear (between polyhedrons) 70.C linear fiber 114.0 local degree of 99.B meromorphic 23.0 monotone 311.E muJtilinear 256.H nondegenerate holomorphic (between analytic spaces) 23.C nonexpansive 286.B nonsingular, of class C l 208.B normal 114.1 normal coordinate 364.C one-to-one 381.C onto 381.C open 425.G order-preserving 311.E orientation-pres\(rving 99.A orientation-reversing 99.A partial (of a mapping) 381.C perfect 425.W perspective (in projective geometry) 343.B piecewise affine 192. Q piecewise linear (between polyhedra) 70.C PL 65.A Poincare 126.C,G product 425.K projective (in projective geometry) 343.B proper 425.W purely inseparable rational 16.1 quasiconformal 352.B quasi perfect 425.CC quotient 425.G rational 16.1 regular (between prealgebraic varieties) 16.C Subject Index Markov subshift regular, of class C l 208.B semicontinuous (in a topological linear space) 153.0 semi linear 256.P 277.L separable (rational) 16.1 simplicial 70.C simplicial (between polyhedra) 70.C simplicial (relative to triangulations) 70.C skew-symmetric multilinear 256.H space 202.C space of continuous 435.0 spin 237.G s.s. (semisimplicial ) (between s.s. complexes) 70.E s.s., realization of 70.E surjective 381.C symmetric multilinear 256.H Teichmiiller 352.C time-one 126.C topological 425.G topology induced by a 425.1 transposed (of a diffusion kernel) 338.N transposed (of a linear mapping) 256.G uniformly continuous 273.1 436.E unit 203.F mapping chain 201.B mapping class 202.B mapping cone 202.E reduced 202.F mapping cylinder 202.E mapping space 435.0 mapping theorem Brouwer 99.A open 37.1 424.X Riemann 77.B spectral 251.G mapping truck 202.G Marchenko equation, Gel'fand-Levitan- (for KdV equations) 387.0 (for a nonlinear lattice) 287.C Marcinkiewicz theorem 224.E marginal density functions 397.1 marginal distribution 342.C 397.H marked K3 surface 72.K Markov branching process 44.0 multitype 44.E Markov chains 260.A 342.A embedded 260.H general 260.1 imbedded 260.H (non) recurrent 260.B Markov field theory, Euclidean 150.F Markovian decision process 127.E Markovian policy 405.C Markovian type (stochastic differential equation) 406.0 Markov inequality (for poJynomials) 336.C Markov measure 136.0 Markov operators 136.B Markov partition (for an automorphism) 136.G Markov process(es) 261 342.A branching 44.E homogeneous 5.H invariant 5.H strong 261.B Markov property 261.B strong 261.B Markov shift 136.0 Markov statistical mechanics 340.C Markov subshift 126.1 
Subject Index Markov theorem, Gauss- Markov theorem, Oauss- 403.E Markov time 261.B 407.B Martin axiom (in set theory) 33.F Martin bound, Froissart- 386.B Martin boundary 207.C 260.1 dual 260.1 Martin compactification 207.C Martineau-Harvey duality 125.Y martingale 262 342.A {F,}-Wiener 406.B local 262.E martingale additive functional 261.E martingale part 406.B martingale problem 115.C 261.C 406.A Martin kernel 207.C 1\,1aslov bundle 274.C Maslov index, Keller- 274.C mass 132.A 258.C 271.E (of a current) 275.0 center of 271.E integrals of the center of 420.A mass distribution capacitary 338.K equilibrium 338.K Massey theorem, Blakers- 202.M mass lumping 304.0 mass matrix 304.0 master equation 402.1 matched asymptotic expansions, method of 25.B mathematical axiom 411.1 mathematical expectation (of a probability distribu- tion) 341.B mathematical induction 294.B axiom of 294.B definition by 294.B doulbe 294.B multiple 294.B mathematical linguistics 75.E mathematical logic 411.A mathematical modeling 40.0 300 mathematical models in biology 263 mathematical object 52.A mathematical programming 264.A mathematical programming problem 264.B mathematical structure 409.B mathematical system (for a structure) 409.B mathematics actuarial 214.A combinatorial 66.A discrete 66.A mathematics in the 18th century 266 mathematics in the 19th century 267 mathematics in the 17th century 265 Mathieu differential equation 268.A modified 268.A Mathieu functions 268 modified 268.A modified, of the first kind 268.0 modified, of the second kind 268.0 modified, of the third kind 268.0 of the second kind 268.0 Mathieu group 151.H Mathieu method 268.C matric group 226. B matrix (matrices) 269 adjacement 186.0 adjoin! 269.1 Alexander (of a knot) 235.C alternating 269.B 2042 amplification 304.F anti-Hermitian 269.1 antisymmetric 269.B association 102.1 asymptotic covariance 399.K of a bilinear form 256.H bounded 269.K circuit 254.B column finite 269.K companion 301.I complex orthogonal 269.1 correlation 397.1 covariance 341.B 397.1 density 351.B design 102.A 403.0 ....1:_____1 ""I£C\ A uldgullaJ LO'i'.f\. Oirac 377.C Oirac's y 351.0 error 405.0 Fisher information 399.0 fundamental cutset 186.0 fundamental tieset 186.0 group 226.B Hasse- Witt 9.E Hermitian 269.1 identity 269.A incidence (of a block design) 102.B incidence (ofa graph) 186.0 infinite 269.K information 102.1 inverse 269.B invertible 269.B iteration 302.C Jacobi 390.0 Jacobian 208.B lower triangular 269.B mass 304.0 m by n 269.A of (m, n)-type 269.A m x n 269.A moment 341.B monodromy 254.B nilpotent 269.F noncentrality 374.C nonsingular 269.B normal 269.1 orthogonal 269.1 parity check 63.C Pauli spin 258.A 351.0 period (of a closed Riemann surface) I1.C period (of a complex torus) 3.H port-admittance 282.C port-impedance 282.C positive definite 269.1 positive semidefinite 269.1 principal 3.1 projection 269.1 proper orthogonal 269.1 Q- 260.F of quadratic form 348.A rational function 86.0 rectangular 269.A regular 269.B Riemann 3.1 row finite 269.K S- 150.0386 sample correlation 280.E scalar 269.A scale 374.C 
2043 Seifert 235.C semisimpJe 269.0 of sesquilinear form 256.Q similar square 269.0 skew h- 269.1 skew-Hermitian 269.1 skew-symmetric 269.B square 269.A stiffness 304.C stochastic 260.A of the sum of squares between classes 280.B of the sum of squares within cJasses 280.B symmetric 269.B symplectic 60.L transfer function 86.B transition 126.1 260.A transposed 269.B triangular 269.B tridiagonal 298.0 unipotent 269.F unit 269.A unitary 269.1 upper triangular 269.B variance 341.B variance-covariance 341.B 397.1 weighting 86.B zero 269.B matrix algebra full 269.B total 269.B matrix convex of order m 212.C matrix element 351.B matrix game 173.C matrix group 226.B matrix monotone decreasing of order m 212.C matrix montone increasing of order m 212.C matrix representation 362.0 matrix Riccati differential equation 86.E matrix Riccati equation 405.0 matrix unit 269.B matroid 66.0 p-ary 66.H poly- 66.F operations for 66.H Mattuck theorem, Lutz- 118.E Maupertuis principle 180.A Ma urer-Cartan differential form of 249.R system of differential equations of 249.R maximal (hypersurface in Minkowski space) 275.H (ideal) 368.F (in prediction theory) 395.0 (Riemann surface) 367.F maximal concentration function 341.E maximaJ condition 311.C maximal deficiency (of an algebraic surface) 15.E maximal dilatation 352.B maximal dissipative operator 25l.J maximal element (in an ordered set) 311.B maximal entropy 136.C,H maximal ergodic lemma 136.B maximal filter 87.1 maximal function nontangential 168.B radial 168.B maximal ideal 67.C 368.F with respect to S 67.C maximal ideal space (of a Banach algebra) 36.E Subject Index Mean maximal independent system (of an additive group) 2.E maximal inequality (= maximal ergodic lemma) 136.B maximal invariant statistic 396.1 maximal k-split torus 13.Q maximally almost periodic group 18.1 maximal1y overdetermined (= holonomic) 274.H maximal operator 112.E maximal order 27.A maximal prime divisor (of an ideal) 67.F maximaJ separable extension (of a fieJd) 149.H maximal toroidal subgroup (of a compact Lie group) 248.X maximal torsion subgroup (of an Abelian group) 2.A maximal torus (of a compact Lie group) 248.X maximum, relative (of a function) 106.L maximum element (in an ordered set) 311.B maximum-flow minimum-cut theorem 28t.C maximum-flow problem 281.C maximum likelihood estimator 399.M maximum likelihood method 399.M limited information 128.C maximum modulus principle (for a holomorphic function) 43.B local 164.C Cartan 338.L complete 338.M maximum principle (for analytic functions) 43.B (in control theory) 86.F (for harmonic functions) 193.E (for minimal surface) 275.B (for parabolic operators) 327.0 dilated (in potential theory) 338.C entropy 419.A first (in potential theory) 338.C Frostman's 338.C Hopf (for equations of elliptic type) 323.C strong (for equations of el1iptic type) 323.C Ugaheri's 338.C maximum return 127.B maximum-separation minimum-distance theorem 281.C maximum solution (of a scalar equation) 316.E maximum spectral measure 390.0 Maxwel1-Boltzmann distribution law 402.B Maxwel1 convention 51.F Maxwel1 equations 130.A Maxwel1 fisheye 180.A Maxwel1 relations 419.B Maxwel1 stress tensor 130.A Maxwel1 theorem (on spherical functions) 393.0 Mayer- Vietoris exact sequence (for a proper triple) 201.C relative 201.L Mazur theorem 37.F Gel'fand- 36.E meager set 425.N non- 425.N mean (of an almost periodic function) 18.B,E (of numbers or a function) 211.C (of a probability distribution) 341.B (of a random variable) 342.C (of a statistical data) 397.C (of a weakly stationary process) 395.A IX-trimmed 371.H 
Subject ludex MeaD absolute deviatiou arithmetic 211.C 397.C arithmetico-geometric 134.B bounded, oscillation 168.B conditional (of a random variable) 342.E 397.1 continuous in the 217.M 407.A convergence in the, of order p 168.B 342.D 407.A convergence in the, of power p 168.B of degree r (of a function with respect to a weight function) 211.C Fejer 159.C geometric 211.C geometrical 397.C harmonic 211.C 397.C limit in the 168.B moment about the (kth) 341.B population 396.C sample 396.C mean absolute deviation 397.C mean anomaly 309.B mean concentration function 341.E mean content (of a tolerance region) 399.R mean curvature 111.H 364.0 365.0, App. A, Table 4.1 total 365.0 mean curvature vector 365.0 mean energy 402.0 mean entropy 402.0 mean ergodic theorem 136.B mean free energy 340.B 402.0 mean motion 309.B mean number of sheets (of a covering surface of a Riemann sphere) 272.1 mean oval (of two ovals) 89.0 mean p-valent, areally 438.E mean p-valued, circumferentially 438.E mean recurrence time 260.C mean square error 399.E 403.E mean unbiased, asymptotically 399.K mean unbiased estimator 399.C mean value (of a continuous function on a compact group) 69.A (of a weakly stationary process) 395.C mean value theorem (in differential calculus) 106.E (on harmonic functions) 193.E first (in the Riemann integral) 216.B second (in the D-integral) 100.0 second (in the Riemann integral) 216.B second (for the Stieltjes integral) 94.C Siegel 182.E Vinogradov 4.E mean vector 341. B measurability 443.1 strong 443.1 measurability theorem, Pettis 443.B measurable (flow) 136.0 (multi valued vector function) 443.1 (operator function) 308.0 (set) 270.0,0 (in set theory) 33.F (stochastic process) 407.A (transformation) 136.B (vector valued function) 443.B absolutely 270.L . 270.C Baire 270.L Jordan 270.0,0 2044 Lebesgue 270.0 Ji- 270.0 nearly Borel 261.0 progressively (stochastic process) 407.A real-valued (in set theory) 33.F with respect to a family of random varia bles 342.C with respect to Ji* 270.E scalarly 443.1 strongly 443.B,I universally 270.L weakly 443.B,1 measurable cardinal number 33.E measurable event 342.B measurable function 270.1 - 270.1 Baire 270.L Borel 270.1 Lebesgue 270.1 universally 270.L measurable space(s) 270.C analytic 270.C complete 270.0 isomorphic 398.0 standard 270.C measurable vector function 308.0 measure 270.0,0 of an angle 139.0 of association 397.K atomless probability 398.C -regular 270.F Borel 270.0 bounded 270.0 canonical 115.B 260.0 characteristic 407.0 Caratheodory 270.E Caratheodory outer 270.E complete 270.0 completely additive 270.0 complex spectral 390.0 convergence in 168.B ,)- 270.0 distortion 213.E excessive 261.F finitely additive 270.0 G-invariant 225.B Oaussian random 407.0 generalized Lebesgue 270.E of genus (of a positive definite symmetric matrix) 348.K Oibbs 136.C Oreen 193.1 Haar 225.C harmonic 120.C 169.B 207.B 260.1 Hausdorff 169.0 idempotent 192.P image 270.K inner harmonic 169.B invariant 136.B 255.B 260.A,I 261.F Jensen 164.K Jordan 270.0,0 killing 115.B kinetic 228.A K -regular 270.F Lebesgue 270.0 Lebesgue outer 270.0 Lebesgue-Stieltjes 166.C 270.L left invariant Haar 225.C of length 139.C Levy 5.E 
2045 oflocation 397.C Loeb 293.0 Markov 136.0 orthogonal 164.C outer 270.E,0 outer harmonic 169.B Plancherel (of a locally compact group) 437.L Poisson random 407.0 positive Radon 270.1 probability 341 342.B product 270.H quasi-invariant 225.1 quotient 225.H Radon 270.0 real spectral 390.0 regular 270.F relatively invariant 225.H representing 164.C right invariant Haar 225.C O'-additive 270.0 O'-finite 270.0 signed 380.C smooth (for a Riemannian metric) 136.0 smooth invariant 126.1 spectral 390.0,K 395.B,C speed 115.B subinvariant 261.F superharmonic 260.1 of variability 397.C vector 443.0 Weil 225.0 Wiener 45.B 250.E measure algebra 192.0 measure-preserving (transformation) 136.B measure problem, invariant 136.C measure space 270.0 bounded 270.0 complete 270.0 complete product 270.H Lebesgue, with a finite (or O'-finite) measure 136.A product 270.H O'-finite 270.0 measure theory 270 mechanics celestial 55. A classical 271.A classical statistical 402.A equilibrium statistical 402.A graphical 19.C Markov statistical 340.C Newtonian 271.A nonlinear 290.A quantum 351 quantum statistical 402.A statistical 342.A 402 mechanism, Higgs 132.0 median 341.H 396.C 397.C sample 396.C mediant (of two fractions in Farey sequence) 4.B median unbiased estimator 399.C medieval mathematics 372 meet (in a Boolean algebra) 42.A (in an ordered set) 243.A (of sets) 381.B Mehler formula App. A, Table 19.m Mehler integral representation, Laplace- App. A, Table 18.II Mellin transform 220.C Subject Index Method(s) member (of a set) 381.A membrane equation of a vibrating 325.A permeable 419.A memory fading 163.1 first-in first-out 96.E first-in last-out 96.E memory channel almost finite 213.F finite 213.F memoryless channel 213.C discrete 213.F memory unit 75.B Menelaus theorem (in affine geometry) 7.A Menger-Nobeling embedding theorem 117.0 Men'shov theorem, Looman- 198.A Rademacher- 317.B Mercer theorem 217.H merging 96.C meridian (of a knot) 235.B (of a surface of revolution) 111.H meromorphic (in a domain) 272.A meromorphic curve 272.L meromorphic differential (on a Riemann surface) 367.H meromorphic function(s) 21.J 272.A (on an analytic set) 23.0 (on a complex manifold) 72.A transcendental 272.A meromorphic mapping, proper (between analytic spaces) 23.0 meromorphy circle of (of a power series) 339.0 radius of (of a power series) 339.0 Mersenne number 297.E, App. B, Table 1 Mersenne prime 297.E Mertens theorem (on the Cauchy product of two series) 379.F mesh of a covering (in a metric space) 273.B mesons 132.B meta-Abelian group 190.H metabolic model (in catastrophe theory) 51.F metamathematics 156.0 metastable range (of embeddings) 114.0 method(s) Abel, of summation 379.N Adams-Bashforth 303.E Adams-Moulton 303.E ADI 304.F alternating direction implicit (AOI) 304.F Arrow-Hurwicz-Uzawa gradient 292.E of averaging 290.0 Bairstow 301.E Bernoulli 301.J Borel, of summation 379.0 branch and bound 215.0 Cesaro, of summation of order a 379.M Cholesky 302.B circle 4.B collocation 303.1 congruence 354.B conjugate gradient (CO.) 302.0 constructive 156.0 continuation 301.M Crout 302.B cyclic Jacobi 298.B d' Alembert, of reduction of order 252.F Oanilevskii 298.0 
Subject Index Method(s) Oavidenko, of differentiation with respect to a parameter 301.M Oejon-Nickel 301.0 difference 303.A direct (in the calculus of variations) 46.E discrete variable 303.A distribution-free 371.A Ooolittle 302.B downhill 301.L Ouhamel 322.0 Ourand-Kerner (OK) 301.F Ourand-Kerner-Aberth (OKA) 301.F Enskog 217.N Euclidean 150.F Euler (of describing the motion of a fluid) 205.A Euler (of numerical solution of ordinary differ- ential equations) 303.£ Euler (of summation) 379.P expansion 205.B extrapolation 303.F factorization 206. B of false position 301.C of feasible directions (in nonlinear program- ming) 292.E finite element 223.0 304.C fixed point 138.B floating point 138.B Frobenius App. A, Table 14.1 Oalerkin 303.1 304.B Oarside-Jarratt-Mack 301.N Oauss-Seidel 302.C Oivens 298.0 gradient 292.£ Oraeffe 301.N graphical, of statistical inference 19. B Oreen function 402.J of harmonic balance 290.0,E Hessenberg 298.0 Hill, of solution 268.B Hitchcock 301.E hodograph 205.B Horner 301.C Householder 298.0 implicit 303.E implicit enumeration 215.0 Ince-Ooldstein 268.C indirect least squares 128.C interpolation 224.A isoparametric 304.C Jacobi (of numerical computation of eigen- values) 298.B Jacobi (in numerical solution of linear equations) 302.C Jacobi second, of integration 324.0 Jeffreys 25.B killing (of obtaining a homotopy group) 202.N ladder 206.B Lagrange (of describing the motion of a fluid) 205.A Lagrange (of indeterminate coefficients) 106.L Lagrange, of variation of constants 252.0 Lagrange, of variation of parameters 252.0 Lagrange-Charpit 322.B, App. A, Table 15.1I of Lagrange multipliers 106.L Lanczos 298.0301.N Laplace 30.B ofleast squares (for estimation) 403.E of least squares (for numerical solution of linear equations) 397.1 2046 of least squares (for numerical solution of ordinary differential equations) 303.1 Lebesgue, of summation 379.S Lehmer 301.K LighthilI 25.B limited information maximum likelihood 128.C of linearization 290.0 linear k-step 303.E linear multistep 303.E of major ants 316.0 of matched asymptotic expansions 25. B Mathieu 268.C maximum likelihood 399.M middle-square 354.B Milne 303.E modified minimum chi-square 4OO.K moment 399.L Monte Carlo 385.C of moving frames llO.A of multiple scales 290.E multistep 303.E multivalue 303.E Newton-Raphson 301.0 nonparametric 371.A Norlund, of summation 379.Q of orthogonal projection 323.0 (p + I)-stage 303.0 penalty 292.E Perron (in Oirichlet problem) 120.C perturbation 25.B Poincare 25.B Poincare-Lighthill-Kuo (P.L.K.) 25.B polynomial extrapolation 303.F power 298.C predictor-corrector (PC) 303.E projective approximation 304.B QR 298.F of quadrature 313.0 QZ 298.0 rational extrapolation 303.F Rayleigh-Ritz 46.F 271.0 renormalization 361.A Richardson 302.C Riemann, of summation 379.S Riesz, of summation of the kth order 379.R Ritz 46.F 303.1 304.B robust 371.A Rosen gradient projection 292.E Runge-Kutta 303.0 Runge-Kutta-Oill 303.0 saddle point 25.C scaling 346.E scoring 397.M simplex 255.C spectral 304.B stationary phase 30.B of steepest descent 25.C step by step 163.0 Strum 301.C of successive approximation (for an elliptic par- tial differential equation) 323.0 of successive approximation (for Fredholm integral equations of the second kind) 217.0 of successive approximation (for ordinary differential equations) 316.0 of successive iteration (for Fredholm integral equations of the second kind) 217.0 summable by Abel's 379.N summable by Borel's exponential 379.0 
2047 summable by Borel's integral 379.0 summable by Cesaro's, of order ex 379.M summable by Euler's 379.P summable by Holder's, of order p 379.M summable by Norlund's 379.Q summable by M. Riesz's, of order k 379.R of summation 379.L Sylvester elimination 369.E three-stage least squares 128.C threshold Jacobi 298.B two-phase simplex 255.C two-stage least squares 128.C variational 438.B of variation of constants 55.B 252.1 of variation of parameters App. A, Table 14.1 WKB 25.B WKBJ 25.B metric 273.B Bergman 188.0 Einstein 364.1 Einstein-Kahler 232.C J- 136.F IN- 136.F Finsler 152.A Hermitian 232.A Hodge 232.0 Kiihler 232.A Kerr 359.E left invariant (in a topological group) 423.1 Petersson 32.B Poincare 74.0 pseudo- 273.B pseudo-Riemannian 105.P Riemannian 105.P Robertson-Walker 359.E standard Kahler (of a complex projective space) 232.P Teichmiiller 416 metrically isomorphic automorphisms (on a measure space) 136.E metric comparison theorem l78.A metric connection 80.K metric invariant (on a measure space) 136.E metric multidimensional scaling 346.E metric space(s) 273 compact 273.F complete 273.1 discrete 273.B indiscrete pseudo- 273.B induced by a mapping 273.B precompact 273.B product 273.B pseudo- 273.B separable 273.E totally bounded 273.B metric structure almost contact 11O.E contact 110.E metric subspace 273.B metric topology 425.C metric vector space 256.H metrizable topological group 423.1 metrizable topological space 273.K metrizable uniform space 436.F pseudo- 436.F Meusnier theorem (on surfaces) 111.H Meyer decomposition theorem, Ooob- 262.0 Michael theorem 425X micro-analytic 125.CC 274.E microboundle 147.P Subject Index Minimal operator normal PL 147.P PL 147.P tangent PL 147.P microcanonical ensemble 402.0 micro differential equation 274.0 micro differential operator 274.F of finite order 274.F of infinite order 274.F microfunction 274.E holomorphic 274.F micro local analysis 274 345.A micro locally elliptic (operator) 345.A microlocaloperator 274.F micro-pseudo local property 345.A middle point 7.C middle-square method 354.B midpoint 7.C midpoint rule 303.E midrange 374.0 Mikusinski operator 306.B Mills equation, Yang- 80.0 Mills field, Yang- 150.0 Mills functional, Yang- 80.Q Mills G-connection, Yang- 80.Q Mil'man property, Krein- 443.H Mil'man theorem 37.0 Krein- 424.U Milne corrector 303.F Milne method 303.E Milne predictor 303.E Milne-Simpson formula 303.E Milnor fibering theorem 418.0 Milnor fibration 418.0 Milnor invariant 235.0 Milnor monodromy 418.0 Milnor number (in Milnor fibering theorem) 418.0 Minakshisundaram-Pleijel asymptotic expansion 391.B minimal (algebraic surface) 15.0 (algebraic variety) 16.1 (ideal) 368.F (immersion) 275.A (superharmonic function) 260.1 (transformation) 136.H relatively 15.0 16.1 minimal basis (of a principal order or an algebraic number field) 14.B minimal chain (for a transition probability) 26O.F minimal complete class 398.B minimal complex 70.E minimal condition (in an ordered set) 311. C restricted (in a commutative ring) 284.A minimal diffeomorphism 126.N minimal element (in an ordered set) 311.B minimal flow 126.N minimal function, X- 367.E minimal immersion 275.A branched 275.B generalized 275.B minimality 16.1 absolute 16.1 minimally almost periodic group 18.1 minimally elliptic singularity 418.C minimal model 15.0 (for the algebra of differential forms) 114.L N6ron (of an Abelian variety) 3.N relatively 15.0 minimal operator 112.E 
Subject Index Minimal parabolic k-subgroup minimal parabolic k-subgroup 13.Q minimal polynomial (of an algebraic element) 149.E (of a linear transformation) 269.L (of a matrix) 269.F minimal prime divisor (of an ideal) 67.F minimal projection, Lie 76.B minimal realization 86.0 minimal resolution 418.C minimal set 126.E minimal splitting field (of a polynomial) 149.0 minimal submanifold 275.A 365.0 minimal sufficient O'-field 396.E minimal surface 111.1 334. B affine 110. C branched 275.B minimal surface equation 275.A minimal variety 275.0 minimax (estimator) 399.H minimax decision function 398.B minimax level a test 400.F minimax principle (for eigenvalues of a compact operator) 68.H (for statistical decision problem) 398.B for Ak 391.0 minimax solution 398.B minimax theorem 173.C minimization problem, group 215.C minimizing sequence 46.E minimum (minima) ofa function 106.L relative (at a point) 106.L successive (in a lattice) 182.C weak 46.C minimum chi-square method, modified 400.K minimum-cost flow problem 281.C minimum curvature property 223.F minimum element (in an ordered set) 311.B minimum immersion 365.0 minimum norm property 223.F minimum principle energy 419.C enthalpy 419.C Oibbs free energy 419.C Helmholtz free energy 419.C for Ak 391.0 of A 391.0 minimum solution (of a scalar equation) 316.E minimum variance unbiased estimator, uniformly 399.C Minkowski-Farkas theorem 255.B Minkowski-Hasse character (of a nondegenerate quadratic form) 348.0 Minkowski-Hasse theorem (on quadratic forms over algebraic number fields) 348.0 Minkowski-Hlawka theorem 182.0 Minkowski inequality 211.C, App. A, Table 8 Minkowski reduction theory (on fundamental regions) 122.E Minkowski space 258.A Minkowski space-time 359.B Minkowski theorem 182.C on discriminants 14.B on units 14.0 Minlos theorem 424.T minor (of a matrix) 103.0 (of a matroid) 66.H Fredholm's first 203.E 2048 Fredholm's rth 203.E principal (of a matrix) 103.0 minor arc 4.B minor axis (of an ellipse) 78.C minor function l00.F minus infinity 87.0 minute (an angle) 139.0 Mittag-Lerner theorem 272.A mixed Abelian group 2.A mixed area (of two ovals) 89.0 mixed group 190.P mixed Hodge structure 16.V mixed ideal 284.0 mixed initial-boundary value problem (for hyper- bolic operator) 325.K mixed insurance 214.B mixed integer programming problem 215.A mixed model 102.A mixed periodic continued fraction 83.C mixed problem 322.0 mixed spin or rank (k, n) 258.B mixed strategy 173.C mixed tensor 256.1 mixed type, partial differential equation of 304.C 326.A mixing (automorphism) k-fold 136.E strongly 136.E weakly 136.E mixture 351.B Mizohata equation, Lewy- 274.0 ML estimator 399.M mobility, axiom of free (in Euclidean geometry) 139.B Mobius band 41O.B Mobius function 66.C 295.C Mobius geometry 76.A Mobius strip 41O.B Mobius transformation 74.E 76.A Mobius transformation group 76.A mod 1, real number 355.0 mod p (modulo p) Hopf invariant 202.S isomorphism (in a class of Abelian groups) 202.N modal logic 411.L modal proposition 411.L modal unbiased estimator 399.C mode 396.C 397.C sample 396.C model (of an algebraic function field) 9.0 (of a mathematical structure) 409.B (of a symbolic logic) 276.0411.0 Bayesian 403.0 Bradley-Terry 346.C Bush-Mosteller 346.0 canonical 251.N component 403.F components-of-variance 403.C countable (of axiomatic set theory) 156.E derived normal (of a variety) 16.F dual resonance 132.C Estes stimulus-sampling 346.0 factor analysis 403.C fixed effect 102.A functional 251.N game theoretic 307.C Olashow- Weinberg-Salam 132.0 
2049 of human death and survival 214.A Ising 340.B 402.0 Klein (of non-Euclidean geometry) 285.C learning 346.G linear 403.0 linear logistic 403.C log linear 403.C Luce p- 346.G mathematical, in biology 263 metabolic (in catastrophe theory) 51.F minimal 15.0 minimal (for the algebra of differential forms) 114.L mixed 102.A multiple 403.£ multivariate linear 280.B natural (in axiomatic set theory) 33.C normal linear 403.C Poincare (of geometry) 285.0 queuing 260.H random-effects 102.A 403.C relatively minimal 15.0 Sakata 132.0 Scheffe 346. C simple 403.F spin-flip 340.C static (in catastrophe theory) 51.B stochastic Ising 340. C stochastic programming 307.C string 132.C Sz.-Nagy-Foia 251.N Thurstone-Mosteller 346.C Veneziano 132.C 386.C Whittaker 450.0 model experimentation 385.A modeling, mathematical 300 model scheduling 307.C model selection 401.0 model theory 276 modification (of a stochastic process) 407.A holomorphic (of an analytic space) 23.0 proper (of an analytic space) 23.0 spherical 114.F modified Bessel functions App. A, Table 19.1V modified Fourier hyperfunction 125.BB modified indicator function 341.C modified Mathieu differential equation 268.A modified Mathieu function 268.A of the first kind 268.0 of the second kind 268.0 of the third kind 268.0 modified minimum chi-square method 400.K modified wave operator 375.B modular, weakly (in quantum mechanics) 351.L modular automorphism 308.H modular character (of a modular representation) 362.1 modular form Hilbert, of dimension -k 32.0 Hilbert, of weight k 32.G of level N 32.C Siegel, of dimension -k 32.F Siegel, of weight k 32.F modular function (of a locally compact group) 225.0 Hilbert 32.0 oflevel N 32.C Siegel, of degree n 32.F Subject Index Modulus (moduli) modular group 122.0 elliptic 122.0 Hilbert 32.0 Siegel, of degree n 32.F modular lattice 243.F modular law (in a lattice) 243.F modular operator 308.H modular represenation (of a finite group) 362.0 modular surface, Hilbert 15.H module(s) 277 (of a family of curves) 143.A A- 277.C over A 277.C of A-homomorphisms (between A-modules) 277.E Artinian 277.1 of boundaries 200.C category of left (right) R- 52.B character (of an algebraic group) 13.0 of co boundaries 200.F of cocycles 200. F coefficient 200.L cohomology 200.F connected graded 203.B of cycles 200.C defining (of a linear system) 16.N degenerate 118.0 divisible A- 277.0 dual 277.K dual graded 203.B duality theorem for n- 422.L factor A- 277.C faithfully flat A- 277.K of finite length 277.1 flat A- 277.K free 277.G generalized 143.B graded A- 200.B homology 200.C of homomorphisms (between two modules) 277.B induced 277.L injective A- 277.K Jordan 231.C left A- 277.0 Noetherian 277.1 (1.1-, 383.1 with operator domain A 277.C projective A- 277.K of quotients of an R-module with respect to S 67.0 (R, S)-injective 200.K (R, S)-projective 200.K representation (of a linear representation) 362.C ofrepresentations (of a compact group) 69.0 right A- 277.0 torsion A- 277.0 moduli functor 16.W moduli scheme 16.W coarse 16.W fine 16.W moduli space 16.W 72.0 of curves of genus g 9.1 local 72.0 modulus (moduli) (of a complex number) 74.B (of a complex torus of dimension I) 32.C (= a conformal invariant) 11.B 77.E 
Subject Index Modulus number (of a congruence) 297.0 (of an elliptic integral) 134.A, App. A, Table 16.1 (in Jacobi elliptic functions) 134.1, App. A, Table 16.m (of a locally multivalent function) 438.E (of a ring) 77.E complementary (of an elliptic integral) App. A, Table 16.1 complementary (in Jacobi elliptic functions) 134.1, App. A, Table 16.m of continuity (of a function) 84.A of continuity of kth order (of a continuous function) 336.C of elasticity in shear 271.0 of elasticity in tension 271.0 field of 73.B local maximum, principle 164.C maximum, principle (for a holomorphic func- tion) 43.B periodicity (of an elliptic integral) 134.A of rigidity 271.0 Young's 271.0 modulus number 418.E modus ponens 41l.I Moishezon criterion, Nakai- (of ampleness) 16.E Moishezon space 16.W mole numbers 419.A moment 397.C absolute (kth) 341.B bivariate 397.H central 397.C conditonal 397.1 factorial 397.0 of inertia 271.E (kth) 341.B about the mean (kth) 341.B population (of order k) 396.C principal, of inertia 271.E sample (of order k) 396.C moment generating function 177.A 341.C 397.0,1 moment matrix 341.B moment method 399.L moment method estimator 399.L moment problem Hamburger 240.K Hausdorff 240.K Stieltjes 240.K momentum 271.A,E angular 271. E generalized 271.F integrals of angular 420.A intrinsic angular 351.0 orbital angular 351.E theorem of 271.E theorem of angular 271.E momentum density, angular 150.B momentum 4-vector, energy- 258.C momentum operator angular 258.0 energy- 258.0 momentum phase space 126.L momentum representation 351.C momentum tensor angular 258.0 energy- 150.0 359.0 monad (in homology theory) 200.Q (in nonstandard analysis) 293.0 2050 Monge-Ampere equations 278, App. A, Table 15.m Monge differential equation 324.F monic polynomial 337.A monoclinic system 92.E monodromy group (of an n-fold covering) 91.A (of a system oflinear ordinary differential equa- tions) 253.B Milnor 418.0 total 418.F monodromy matrix 254.B monodromy theorem (on analytic continuation) 198.1 monogenic function in the sense of E. Borel 198.Q in the sense of Cauchy 198.Q monoid, unitary 409.C monoidal transformation (of an analytic space) 23.0 (of a complex manifold) 172.H (by an ideal sheaf) 16.K with center W 16.K real 274.E monomial 337.B (module) 277.0 admissible (in Steenrod algebra) 64.B monomial representation (of a finite group) 362.0 monomorphism (in a category) 52.0 monothetic group 136.0 monotone (curve) 281.B operator 212.C monotone class 270.B monotone class theorem 270.B monotone decreasing (set function) 380.B matrix, of order m 212.C monotone decreasing function 166.A strictly 166.A monotone function 166.A strictly 166.A strictly (of ordinal numbers) 312.C monotone increasing (set function) 380.B matrix, of order m 212.C monotone increasing function 166.A strictly 166.A monotone likelihood ratio 374.1 monotonely very weak Bernoulli 136.F monotone mapping 311.E monotone operator (in a Hilbert space) 286.C monotone sequence (of real numbers) 87.B monotonically decreasing (sequence of real numbers) 87.B monotonically increasing (sequence of real numbers) 87.B monotonic function, completely 240.E,K Monte Carlo method 385.C Montel space 424.0 Montel theorem 435.E moon argument, behind-the- 351.K Moore-Smith convergence 87.H Moore space 273.K 425.AA Moore space problem, normal 425.AA Mordell conjecture 118.E Mordell- Weil theorem 118.E weak 118.E more informative (experiment) 398.0 
2051 Morera theorem 198.A Morgenstern solution, von Neumann- 173.0 morphism (in a category) 52.A (of chain complexes) 2oo.H (of complexes) 13.R (of filtered mod ules) 200.1 (of inductive systems) 210.0 (of unfoldings) 51.0 affine 16.0 connecting 2oo.H diagonal (in a category) 52.E etale 16.F finite 16.0 flat 16.0 faithfully flat 16.0 Frobenius 450.P functorial 52.1 inverse 52.0 k- (between algebraic groups) 13.A projective 16.E proper (between schemes) 16.0 quasiprojective 16.E S- 52.0 of schemes 16.0 separated 16.0 shape 382.A smooth 16.F strict (between topological groups) 423.1 structure 52.G Morse function 279.B Morse index theorem 279.F Morse inequaJities 279.0 Morse lemma 279.B Morse-Smale diffeomorphism 126.1 Morse-Smale flow 126.1 Morse-Smale vector field 126.1 Morse theory 279 fundamental theorems of 279.0 Morsification 418.F Moser implicit function theorem, Nash- 286.1 Mosteller model Bush- 346.G Thurstone- 346.C most powerful (test) 400.A most probable cause 401.E most probable value 401.E most stringent level rx test 400.F motion(s) (in dynamical system) 126.B (Euclidean) 139.B Brownian 5.0 45 342.A Brownian (d-dimensional) 45.C Brownian, on Lie groups 406.G Brownian, with an N-dimensional time para- meter 45.1 central 126.E elliptic 55.A equation of (of a fluid) 205.A equation of (of a particJe in a gravitation field) 359.0 equations of (in Newtonian mechanics) 271.A Euler equation of (of a perfect fluid) 205.B {.}-Brownian 45.B 406.B group of (in EucJidean geometry) 139.B group of, in the wider sense 139.B Heisenberg equation of 351.0 hyperbolic 420.0 hyperbolic-elJiptic 420.0 Subject Index Multiple hyperbolic-parabolic 420.0 infinitesimal (of a Riemannian manifold) 364.F Lagrange equation of 271.F Lagrange-stable 420.0 law of 271.A mean 309.B Newton three laws of 271.A Ornstein-Uhlenbeck Brownian 45.1 oscillating 420.0 parabolic 420.0 parabolic-el1iptic 420.0 perpetual 402.G proper (in Euclidean geometry) 139.B proper (of a star) 392 quasiperiodic 136.G right-invariant Brownian 406.G simple harmonic 318.B space-time Brownian 45.F Moulton method, Adams- 303.E movability 382.C movable 382.C k- 382.C movable branch point (of an algebraic differential equation) 288.A movable singularity (of an algebraic differential equation) 288.A move 173.B chance 173.B moving average 397.N weighted 397.N moving average process 421.0 autoregressive 421.0 autoregressive integrated 421.G moving average representation backward 395.0 canonical backward 395.0 moving coordinates App. A, Table 3.1V moving coordinate system 90.B moving frame(s) 90 II1.C 417.B method of llO.A natural 417.B orthonormal 417.0 stochastic 406.G muJticolJinearity 128.B muJticommodity flow problem 281.F multidiagonal type 304.C multidimensional diffusion process 115.A,C multidimensional gamma function 374.C multidimensional hypergeometric distribution App. A, Table 22 multidimensional normal distribution App. A, Table 22 multidimensional scaling 346.E non metric 346.E multi-index 112.A 168.B multilinear form 256.H multilinear mapping 256.H alternating 256.H antisymmetric 256.H skew-symmetric 256.H symmetric 256.H multinomial distribution 341.0 negative 341.0 multinomial theorem 330 multi-objective model 307.C multiobjective programming 264.C multiplanar coordinates 90.C multiple 297.A (of an element of a ring) 67.H 
Subject Index Multiple complex (of a fractional ideal) 14.E common (of elements of a ring) 67.H least common 297.A least common (of elements of a ring) 67.H scalar (of an element of a module) 277.0 scalar (of a linear operator) 37.C scalar (in a linear space) 256.A scalar (of a vector) 442.A multiple complex 200.H multiple correlation coefficient 397.1 sample 280.E multiple covariant 226.E absolute 226.E multiple hypergeometric distribution 341.0 multiple integral (in Lebesgue integral) 227.E (in Riemann integral) 216.F multiple mathematical inductions 294.B multiple model 403.F multiple point (on an arc) 93.B (of a plane algebraic curve) 9.B (on a variety) 16.F multiple root (of an algebraic equation) 10.B multiple sampling inspection 404.C multiple-valued (analytic function) 198.J multiple Wiener integral 176.1 multiplication (of an algebra) 203.F (of a graded algebra) 203.B (in a group) 190.A (of an H-space) 203.0 (of local Lie groups) 423.L (by a natural number) 294.B (in a ring) 368.A associative (of a graded algebra) 203.B commutative (of a graded algebra) 203.B commutative law for (in a ring) 368.A complex 73.A homotopy associative 203.0 homotopy commutative 203.0 Pontryagin 203.0 scalar (m a module) 277.0 scalar (on vectors) 442.A symmetric 406.C multiplication theorem, Hadamard 339.0 multiplicative (arithmetic function) 295.B multiplicative automorphic function 32.A multiplicative class 270.B multiplicative congruence 14.H multiplicative ergodic theorem 136.B multiplicative function 32.A 295.B multiplicative functional (of a Markov process) 261.E transformation by 261.F multiplicative group 149.A 190.A of a field 190.B multiplicative Jordan decomposition (of a linear transformation) 269.L multiplicatively closed subset (of a ring) 67.C multiplicative valuation 439.C multiplicator (of a relative invariant measure) 225.H multiplicity (of a covering surface) 367.B (of an eigenvalue for an integral equation) 217.F (of an eigenvalue of a matrix) 390.B (of a Gaussian process) 176.E (of a local ring) 284.0 2052 (of a representation) 362.0 (of a root of an algebraic equation I 1O.B (of a weight) 248.W algebraic (of an eigenvalue) 390.B geometric (of an eigenvalue) 390.A intersection (of two subvarieties) 16.Q representation without 437.G set of 159.1 multiplicity function (of a mapping) 246.G multiplier (of a group) 362.1 (of a semi-invariant) 226.A characteristic (of a closed orbit) 126.G characteristic (of a periodic linear system) 163.F Jacobi's last App. A, Table 14.1 Lagrange 46.B method of Lagrange 106.L Stokes 254.0 multiplier algebra 36.K multiply connected domain 333.A multiply transitive (permutation group) 151.H multipolar coordinates 90.C multiprocessor scheduling problem 376 multistage allocation process 127.A multistage choice process 127.A multistage game 173.C multistage programming 264.C multistage sampling 373.E multistep method 303.E linear 303.E multi type Galton-Watson process 44.C multitype Markov branching process 44.E multivalent function 438.E multivalued function 165.B multivalue method 303.E multivariate (data) 397.A multivariate analysis 280 of variance 280.B multivariate linear model 280.B multivariate normal distribution 397.1 Muntz theorem (on polynomial approximation) 336.A mutual energy 338.B mutual information 213.E mutually associated diagrams (for O(n) diagrams) 60.1 mutually disjoint family (of sets) 381.0 mutually noncomparable (summations) 379.L mutually orthogonal (latin squares) 241.B M.V.W.B. (=monotonely very weak Bernoulli) 136F N N (natural numbers) 294.A,B NP 71.E n-ary predicate 411.G n-ary relation 411.G n-ball 140 open 140 nobody problem 420.A n-cell 70.0 140 open 70.0 topological 140 n-classifying space (of a topological group) 147G n-cochain (for an associative algebra) 200.L n-connected (pair of topological spaces) 202.L 
2053 (space) 79.C 202.L locally 79.C n-connective fiber space 148.0 n-cube, unit 140 n-cylinder set 270.H n-decision problem 398.A n degrees of freedom (sampling distribution) 374.B,C n-dimensional (normal space) 117.B n-dimensional distribution 342.C n-dimensional distibution function 342.C n-dimensional Euclidean geometry 139.B 181 n-dimensional probability distribution 342.C n-dimensional random variable 342.C n-dimensional sample space 396.B n-dimensional statistic 396.B n-disk 140 open ] 40 n-element 140 n-fold covering 91.A n-fold reduced suspension (of a topological space) 202.F n-gon, regular 357.A n-gonal number 296.A n-particle subs pace 377.A n-person game 173.B-0 n-ply connected (plane domain) 333.A n-section (in a cell complex) 70.0 n-sheeted (covering surface) 367.B n-simple (pair of topological spaces) 202.L (space) 202.L n-simplex (in a Euclidean simplicial complex) 70.B (in a semisimplicial complex) 70.E (in a simplicial complex) 70.C n-sphere 140 open 140 solid 140 n-sphere bundle 147.K n-times continuously differentiable (function) 106.K n-times differentiable (function) 106.0 n-torus 422.E n-tuple 256.A 381.B n-universal bundle 147.G n-valued (analytic function) 198.1 (n + 2)-hyperspherical coordinates 79.A 90.B N-ple Markov Gaussian process 176.E in the restricted sense 176.F nth approximation (of an n-times differentiable function) 106.E nth convergent (of an infinite continued fraction) 83.A nth derivative (of a differentiable function) 106.0 nth derived function 106.D nth differential (of a differentiable function) 106.0 nth partial quotient 83.A nth order, differential of 106.0 nth order partial derivatives 106.H nth term 165.0 nabla 442.0, App. A, Table 3.II Nachbin-Goodner-Kelley theorem 37 M Nagumo theorem, Kneser- 316.E NaJmark theorem, Gel'fand- 36.G Nakai-MoJshezon criterion (of ampJeness) 16.E Nakanishi equation, Landau- 146.C Nakanishi variety, Landau- 146.C 386.C Subject Index Negative resistance Nakano-Nishijima-Gell-Mann formula 132.A Nakayama lemma 67.0 Nambu-Goldstone boson 132.C Napier analogies App. A, Table 2.m Napierian logarithm 131.0 Napier number 131.0 Napier rule App. A, Table 2.ll Nash bargaining solution 173.C Nash equilibrium 173.C Nash-Moser implicit function theorem 286.1 nat 213.B natural additive functional 261.E natural boundary (of an analytic function) 198.N (of a diffusion process) 115.B natural equation (of a curve) 111.0 (of a surface) 110.A natural equivalence 52.1 natural extension (of an endomorphism) 136.E natural geometry 110.A natural injection (from a subgroup) 190.D naturality (of a homotopy operation) 202.0 natural logarithm 131.0 natural model (in axiomatic set theory) 33.C natural moving frame 417.B natural number 294.A,B nonstandard 276.E Skolem theorem on impossibility of 156.E natural positive cone 308.K natural scale 260.G natural spline 223.F natural surjection (to a factor group) 190.0 natural transfonnation 52.1 Navier-Stokes equation(s) 204.B 205.C general 204.F Navier-Stokes initial value problem 204.B nearly Borel measurable set 261.0 nearly everywhere (in potential theory) 338.F necessary (statistic) 396.E necessity 411.L necklace, Antoine's 65.G negation (of a proposition) 411.B negative (complex) 200.H (element of a lattice-ordered group) 243.G (element of an ordered field) 149.N (rational number) 294.0 negative binomial distribution 341.0 397.F, App. A, Table 22 negative curvature 178.H negative definite (function) 394.C negative definite Hermitian form 348.F negative definite quadratic form 348.B negative half-trajectory 126.0 negative infinity 87.0 355.C negative Jimit point 126.D negatively invariant 126.0 negatively Lagrange stable 126.E negatively Poisson stable 126.E negative multinomial distribution 341.0 negative number 355.A negative orientation (of an oriented C'-manifold) 105.F negative part (of an element of a vector lattice) 31O.B negative poly nominal distribution App. A, Table 22 negative prolongationallimit set, first 126.D negative resistance 318.B 
Subject Index Negative root negative root (of a semisimple Lie algebra) 248.M negative semidefinite quadratic form 348.C negative semiorbit 126.0 negative variation (of a mapping) 246.H (of a real bounded function) 166.B neighborhood 42S.B analytic (of a function element in the wider sense) 198.0 analytic (in a Riemann surface) 367.A conoidal 274.0 convex 364.C coordinate (in a fiber bundle) 147.B coordinate (in a topological manifold) 105.C coordinate, of class C' 10S.0 derived 65.C a- (of a point) 273.C etale 16.AA fundamental system of 425.£ open 425.£ open tubular 114.B regular 65.C regular, theorem 65.C relative 425.1 tubular 105.L 114.B 364.C neighborhood deformation retract 202.0 neighborhood retract 202.0 absolute 202.0 fundamental absolute (FANR) 382.C neighborhood system 425.B base for 425.E uniform 436.0 uniform family of 436.0 Nelson formula, Feynman-Kac- 150.F Nelson symmetry 150.F Nernst postulate 419.A Neron minimal model (of an Abelian variety) 3.N Neron-Severi group (of a surface) 15.0 (of a variety) 16.P nerve (of a covering) 70.C nested intervals, principle of (for real numbers) 87.C 355.B net (in a set) 87.H Cauchy (in a uniform space) 436.G square 304.£ universal (in a set) 87.H net premium 214.A network(s) 282425.F bilateral 282.C capacitated 281.C contact 282.B electric 282.B linear 282.C M-port 282.C passive 282.C reciprocal 282.C time-invariant 282.C two-terminal 281.C network flow model 307.C network flow problem 281 282.B network programming 264.C network scheduling 307.C Neumann function 39.B 188.H, App. A, Table 19.1II Neumann polynomial App. A, Table 19.IV Neumann problem 193.F 323.F Neumann series 217.0 neutral element (in a lattice) 243.E 2054 neutral type (of functional differential equation) 163.B Nevanlinna exceptional value 272.£ Nevanlinna first fundamental theorem 272.B Navanlinna second fundamental theorem 272.£ Nevanlinna theory (of me rom orphic functions) 124.B 272.B (for several complex variables) 21.N Newton, I. 329 Newton backward interpolation formula 223.C Newton boundary 418.0 nondegenerate 418.0 Newton-Cotes formula (in numerical integration) 299.A Newton diagram 254.0 Newton first law 271.A Newton formula (on interpolation) App. A, Table 21 Newton formula (on symmetric functions) 337.1 Newton forward interpolation formula 223.C Newtonian capacity 48.B Newtonian exterior capacity 48.H Newtonian fluid 205.C Newtonian inner capacity 48.F Newtonian interior capacity 48.F Newtonian mechanics 271.A Newtonian outer capacity 48.H Newtonian potential 271.C 338.A Newton interpolation formula App. A, Table 21 Newton interpolation polynomial 336.G Newton iterative process 301.0 Newton law (on frictional stresses) 205.C Newton law of universal gravitation 271.B Newton-Raphson method 301.0 Newton second law 271.A Newton third law 271.A Newton three laws of motion 271.A Neyman factorization theorem 396.F Neyman-Pearson lemma 400.B Neyman structure 400.D nice function (on a Coo-manifold) 114.F Nicholson formula App. A, Table 19.1V Nicholson formula, Watson- App. A, Table 19.III Nickel method, Oejon- 301.G Nicomedes conchoid 93.H Nijenhuis tensor 72.B Nikodym derivative, Radon- 270.L 380.C Nikodym property, Radon- 443.H Nikodym theorem, Radon- 270.L 380.C Nikodym theorem for vector measures, Radon- 443.H nil algebra 231.A nilmanifold 178.0 nilpotent (Lie algebra) 248.C (Lie group) 249.0 (subset of a ring) 368.B (zero-divisor) 284.A generalized (linear operator) 251.F nilpotent algebraic group 13.F nilpotent component (of a linear transformation) 269.L nilpotent element (of a ring) 368.B generalized (in a Banach algebra) 36.£ nilpotent group 190.1 finite ISl.C generalized 190.K nilpotent ideal (of a Lie algebra) 248.C 
2055 largest (of a Lie algebra) 248.D nilpotent matrix 269.F nilpotent radical (of a Lie algebra) 248.D nilradical (of a commutative ring) 67.B (of a ring) 368.H 9; symbol 353.C Nirenberg space, John- (= BMO) 168.B Nishijima-Gell-Mann formula, Nakano- 132.A Nishina formula, Klein- 351.G Nitsche formula, Gauss-Bonnet-Sasaki- 275.C niveau surface 193.1 Nobeling embedding theorem, Menger- 117.D no cycle condition 126.J nodal curve 391.H nodal domain 391.H nodal point 304.C nodal set 391.H node (of a curve) 93.G (of a graph) 186. B 282.A (of a plane algebraic curve) 9.B completion 281.0 start 281.D Noetherian domain 284.A Noetherian integral domain 284.A Noetherian local ring 284.D Noetherian module 277.1 Noetherian ring(s) 284.A left 368.F right 368.F Noetherian scheme 16.D locally 16.0 Noetherian semilocal ring 284.D Noether number, Brill- 9.E Noether theorem 150.B noise thermal 402.K white 176.D noisy channel 213.A nomograms 19.A,D non-Abelian cohomology 200.M nonadaptive scheme 299.C nonanticipative 406.0 non-Archimedean geometry 155.D non-Archimedean valuation 14.F 439.C nonassociative algebra 231.A nonatomic 168.C 443.G non-Bayesian approach 401.B noncentral (quadric hypersurface) 7.F 350.G noncentral chi-square distribution 374.B noncentral F-distribution 374.B noncentral Hotelling r 2 statistic 374.C noncentrality (sampling distribution) 374.B,C noncentrality matrix 374.C noncentral I-distribution 374.B noncentral Wishart distribution 374.C p-dimensional 374.C noncommutative field 149.A noncompact real simple Lie algebra App. A, Table 5.II noncom pact type (symmetric Riemannian homoge- neous space) 412.D noncom parable, mutually 379.L nonconforming type 304.C nonconvex quadratic programming 264.D noncooperative (game) 173.A nondecreasing function 166.A n ondegenerate (analytic mapping) 23.C Subject Index Nonstandard (bilinear form) 256.H (critical point) 106.L 279.B 286.N (function on a Hilbert manifold) 279.E (quadratic form) 348.A (representation) 437.N (sesquilinear form) 256.Q (theta-function) 3.1 nondegenerate critical manifold 279.0,E nondegenerate divisor 3.D 16.N nondegenerate hypersurface 344.A nondegenerate Newton boundary 418.0 non-Oesarguesian geometry 155.E 343.C nondeterministic (Turing machine) 31.B purely (weakly stationary process) 395.D nondeterministic linear bounded automaton 31.D nonelementary (Kleinian group) 234.A non-Euclidean angle (in a Klein model) 285.C non-Euclidean distance 285.C non-Euclidean geometry 285 non-Euclidean hypersphere 285.C non-Euclidean space 285.A nonexpansive mapping 286.B nonexpansive operator 37.C nonhomogeneous difference equation 104.C nonhomogeneous n-chain (for a group) 200.M nonincreasing function 166.A nonlinear differential equation 291.D nonlinear filter 405.F,H nonlinear functional analysis 286 nonlinear integral equation 217.M nonlinear lattice dynamics 287 nonlinear mechanics 290.A nonlinear ordinary differential equations 313.A (global theory) 288 (local theory) 289 nonlinear oscillation 290 nonlinear partial differential equations 320.A nonlinear problems 291 nonlinear programming 264.C nonlinear semigroup 88.E 378.F of operators 286.X nonmeager set 425.N nonmetric MDS 346.E nonnegative (matrix) 269.N nonnegative terms, series of 379.B non-Newtonian fluid 205.C non parametric method 371 nonparametric test 371.A nonpositive curvature 178.H G-space with 178.H nonprimitive character 450.C,E nonrandomized (decision function) 398.A nonrandomized estimate 399.B nonrandomized test 400.A nonrecurrent (chain) 260.B nonrecurrent (transient) 26O.B nonresidue, quadratic 297.H nonsaddle set 126.E nonsingular (flow) 126.G (point for a flow) 126.D (point of a variety) 16.F nonsingular mapping of class C l 208.B nonsingular matrix 269.B nonsingular transformation (of a linear space) 256.B (on a measure space) 136.B nonsingular variety 16.F nonstandard 33.B (element) 293.B 
Subject Index Nonstandard analysis nonstandard analysis 293 nonstandard natural number 276.E nonstandard real number 276.E nonstandard set theory 293.E nonstationary oscillations 290.F nonsymmetric unified field theory 343.C nontangential maximal function 168.B non tangential path 333.B nontrivial (3-manifold) 65.E nontrivially (to act on a G-space) 431.A non wandering 126.E set 126.E N6rlund method of summation 379.Q norm (of an algebraic element) 149.1 (of an element of a general Cayley algebra) 54 (of an element of a quaternion algebra) 29.0 (of an operator) 37.C (of a separable algebraic element) 149.1 (of a vector) 37.B absolute (of an integral ideal) 14.C C*-cross 36.H C'- 126.H graph 251.0 Hilbert-Schmidt 68.1 minimum, property 223.F nuclear 68.K pseudo- (on a topological linear space) 424.F reduced (of an algebra) 362.E relative (of a fractional ideal) 14.1 semi- (on a topological linear space) 424.F spinorial 61.0 supremum 168.B trace 68.1 uniform 168.B normal 62.C lLO.E 354.F (almost contact structure) 110.E (analytic space) 23.0 (current) 275.G (fundamental region) 122.B (*-isomorphism) 308.C (state) 351.B (for a valuation) 439.H (weight on a von Neumann algebra) 308.0 affine 110.C affine principal 1l0.C analytically 284.0 principal 111.F normal algebraic variety 16.F normal analytic structure 386.C normal basis 172.E normal block bundle 147.Q normal bundle (of a foliation) 154.B,E (of an immersion) 114.B (of a submanifold) 105.L 274.E 364.C normal Cartan connection 80.N normal chain (in a group) 190.G (in a Markov chain) 260.0 normal commutation relation 150.0 normal connection 365.C normal contact Riemannian manifold 1l0.E normal continued fraction 83.E normal coordinate(s) 90.C mapping 364.C normal covering 425.R normal crossings 16.L only 16.L normal curvature (of a surface) II1.H 2056 normal density function 397.0 normal derivative 106.G normal distribution 341.D 397.0, App. A, Table 22 k-dimensional 341.0, App. A, Table 22 logarithmic App. A, Table 22 multidimensional App. A, Table 22 standard 341.0 normal duration 28.1 normal equation (in the method of least squares) 302.E 403.E (in statistical data analysis) 397.1 normal estimator, best asymptotically 399.K normal estimator, consistent and asymptotically 399.K normal extension 149.G 251.K normal extension field, strongly 113 normal family 435.E normal fiber space, Spivak 144.1 normal form (of differential equations) 313.B 324.E (of a surface) 410.B Cantor (for an ordinal number) 3l2.C Hesse (of a hyperplane) 139.H Jordan (for a matrix) 269.G n-adic (for an ordinal number) 312.C prenex (in predicate logic) 41l.J normal form theorem, Kleene 356.C normal frame 110.B normal function (of ordinal numbers) 312.C normal g-lattice 27.A normal invariant 114.1 normality, asymptotic 399.K normalization (of an analytic space) 23.0 (of a variety) 16.F normalization theorem for finitely generated rings 369.0 for polynomial rings 369.0 normalized (function) 317.A (into an orthonormal set) 197.C (vector) 139.G normalized contrast 102.C normalized valuation 439.E normalizer 136.F 190.C normalj-algebra 384.C normal k-vector bundle 114.J normal line 93.G, App. A, Table 4.1 normal linear model 403.C normally cobordant 114.1 normally distributed, asymptotically 399.K normally flat along a subscheme (a scheme) 16.L normal mapping (map) 114.1 normal matrix 269.1 normal model, derived (of a variety) 16.F normal Moore space problem 425.AA normal number 354.F normal operator 390.E (of Sario) 367.G normal PL microbundle 147.P normal plane 11l.F normal point 16.F 23.0 normal polygon 234.C normal process 176.C normal real form (of a complex semisimple Lie algebra) 248.Q normal representation 308.C normal ring 67.1 normal score test, Fisher-Yates-Terry 371.C normal section 410.B 
2057 normal sequence (of coverings) 425.R normal simple algebra 29.E normal space 425.Q collectionwise 425.AA completely 425.Q fully 425.X hereditarily 425.Q perfectly 425.Q normal sphere bundle 274.E normal stress 271.G normal structure 276.0 normal subgroup 190.C admissible 190.E normal system (of £-functions) 430.0 normal transformation (of a sequence) 379.L normal valuation 439.E,H normal variety 16.F normal vector 105.L I11.H 364.A normal vector bundle 105.L normal vibration 318.B normed linear space 37.B normed ring 36.A normed space, countably 424.W normed vector lattice 31O.F norm form 118.0 normic form 118.F norm-residue 14.P norm-residue symbol 14.Q (in local class field theory) 257.F Hilbert 14.R Hilbert-Hasse 14.R norm resolvent convergence 331.C northern hemisphere 140 north pole 74.0 140 notation full international 92.E Kendall 260.H Schoen flies (for crystal classes) 92.E, App. B, Table 6.1V short international 92.E system of (for ordinal numbers) 81 notion, common 35.A Novikov closed leaf theorem 154.D nowhere dense set 425.N NP 71.E co- 71.E NP-complete 71.E NP-completeness 71.E NP-hard 71.E NP-space 71.E NP-time 71.E NR (neighborhood retract) 202.0 nuclear (C*-algebra) 36.H nuclear class 68.1 nuclear norm 68.K nuclear operator 68.1,K nuclear space 424.S nucleolus 173.0 null (vector in the Minkowski space-time) 359.B null-bicharacteristic 320.B null boundary, open Riemann surface of 367.E null cobordant 235.G null function 310.1 null geodesic 399.D null homotopic (continuous mapping) 202.B null hypothesis 400.A nullity (of a critical point) 279.B (of a graph) 186.0 (of a linear mapping) 256.F Subject Index Number(s) (of a linear operator) 251.0 (of a matrix) 269.0 column (of a matrix) 269.D of relative 365.0 row (of a matrix) 269.0 null recurrent (point) 260.0 null sequence (in a-adic topology) 284.B null set 270.0310.1 381.A of class N 169.E function-theoretic 169 null space 251.0 null system 343.E number(s) 294 A- 430.C abundant 297.D algebraic 14.A amicable 297.D average sample 404.C azimuthal quantum 315.E Bell 177.0 Bernoulli 177.B Betti 200.K 201.B Brill-Noether 9.E calculable 22.0 Cantor's theory of real 294.E cardinal 49.A 312.0 Cayley 54 characteristic (of a compact operator) 68.1 characteristic (of a manifold) 56.F Chern 56.F chromatic 157.E 186.1 class (of an algebraic number field) 14.E class (of a Oedekind domain) 67.K class (of a simple algebra) 27.0 Clifford 61.A coincidence (of a mapping) 153.B of colors 92.0 completeness of real 294.E complex 74.A 294.F composite 297.B condition 302.A connectedness of real 294.E 355.B continuity of real 294.E cyclomatic 186.0 decomposition (of a finite group) 362.1 Dedekind's theory of real 294.E deficient 297.0 of denominator 186.1 Euler 1 nc 201.B, App. B, Table 4 Fermat 297.F Froude ]16.B generalized decomposition (of a finite group) 362.1 geometry of 182 06del 185356.C,E Orashoff 116.B imaginary 74.A incidence 146.B 201.B of independence 186.1 initial 312.0 intersection (of divisors) 15.C intersection (of homology classes) 65.B 201.0 intersection (of sheaves) 16.E irrational 294.E 355.A irrational real 294.E of irregularity (of an algebraic variety) 16.P Kullback-Leibler information 398.0 Lebesgue 273.F Lefschetz 153.B Lefschetz (of a variety) 16.P 
Subject Index Number field linking 99.C Liouville 430.B Lyapunov characteristic 314.A Mach 116.B 205.B magnetic Reynolds 259 mean (of sheets) 272.1 Mersenne 297.E, App. B, Table 1 Milnor 418.0 modulus 418.E mole 419.A n-gonal 296.A Napier 131.0 natural 294.A,B negative 355.A negative rational 294.0 normal 354.F Nusselt 116.B orbital magnetic quantum 315.E ordinal 312.B p-adic 439.F of partitions 177.0 328 Peclet 116.B pentagonal 4.0 perfect 297.0 perfect, of the second kind 297.0 Picard (of a variety) 15.0 16.P Poisson 271.G polygonal, of order K 4 5 Pontryagin 56.F positive 355.A positie rational 294.0 Prandtl 116.B prime 297.B principal quantum 315.E pseudorandom 354.B Pythagorean 145 ramification 14.K random 354 rational 294.0,E rational real 294.E real 294.E 355.A,0 real, mod 1 355.0 relatively prime 297.A of replications 102.B Reynolds l16.B 205.C rotation 99.0 111.E 126.1 S- 430.C S*- 430.C self-intersection 15.C of sheets (of an anal ytic covering space) 23.E of sheets (of covering surface) 367.B Stiefel- Whitney 56.F Stirling, of the second kind 66.D T- 430.C p- 430.C Tamagawa B.P transcendental 430.A translation 18.B,0 of treatment combinations 102.L type 314.A v- 430.C V*- 430.C wave (of a sine wave) 446 wave, vector (of a sine wave) 205.F weakly compact cardinal 33.E weakly inaccessible cardinal 33.E Weij 3.C number field 149.C algebraic 14.B 2058 p-adic 439.F p-adic 257.A 439.F relative algebraic 14.1 numbering, G6del 185.A number operator 377.A number system, point range of 343.C number-theoretic function(s) 295.A 356.A additive 295.B completely additive 295.B completely multiplicative 295.B multiplicative 295.B number theory 296 analytic 296.B consistency proof for pure 156.E elementary 297 fundamental theorem of elementary 297.C geometric 296.B pure 156.E numerals, Arabic 26 numerator, partial (of an infinite continued fraction) 83.A numerical analysis 300 numerical differentiation 299.E numerical integration 299 numerically connected (divisor) 232.0 numerically equivalent (cycles) 16.Q numerically semi positive 15.0 numerical method 300 numerical range (of a linear operator) 251.E numerical solution of algebraic equations 301 of integral equations 217.N of linear equations 302 of ordinary differential equations 303 of partial differential equations 304 numerical tensor App. A, Table 4.11 Nusseltnumber 116.B nutation 392 Nyquist criterion 86.A Nyquist theorem 402.K o (1)(n) (space ofholomorphic functions in n) 168.B (1)p(n) 168.B O(n) (orthogonal group) 60.1 w-connected space 79.C 10cally 79.C w-consisten t (system) 156. E w-limit point 126.0 w-limit set 126.0 n-conjugate 126.H n-equivalent 126.H n-explosion 126.1 n-group 190.E n-homomorphism (between n-groups) 190.E n-isomorphism (between n-groups) 190.E n-modules, duality theorem for 422.L n-stability theorem 126.1 n-stable, C- 126.H n-subgroup (of an n-group) 190.E 0- ideal integrated two-sided 27.A two-sided 27.A oi-ideal, left 27.A or-ideal, right 27.A D-differential (on an algebraic curve) 9.F D-genus (of an algebraic curve) 9.F D-linearly equivalent divisors (on an algebraic curve) 9.F 
2059 D-specialty index (of a divisor of an algebraic curve) 9.F (I)-module 383.1 (o)-convergent 87.L (o)-star convergent 87.L OA (orthogonal array) 102.L Ob (object) 52.A object 52.A 411.G co final 52.0 final 52.0 graded 200.B group (in a category) 52.M initial 52.0 injective 200.1 isomorphic 52.D mathematical 52.A in predicate logic 411.G projective 200.1 quotient 52.0 So, category of 52.G of type j + 1 356.F of type a 356.F zero 52.N object domain 411.G objective function 264.B 307.C objective probability 401.B object variable 411. G oblate App. A, Table 3.V oblique circular cone 350.B oblique coordinates (in a Euclidean space) 90.B observability 86.C observables 351.B observation complete 405.C cost of 398.F partial 405.C observation process 405.F observation vector 102.A observer, Luenburger 86.E obstacle, Oirichlet problem with 440.B obstruction(s) 305 to an n-dimensional homotopy 305.B to an (n + I)-dimensional extension 305.B primary 147.L 305.C secondary 305.0 surgery 114.1 tertiary 305.0 obstruction class 56.E obstruction cocycle 147.L 305.B obtuse angle (in Euclidean geometry) 139.0 OC-curve (operative characteristic curve) 404.C octahedral group 151.G octahedron 357.B odd element (of a Clifford algebra) 61.B odd function 165.B odd half-spinor 61.E odd half-spin representation 61.E odd permutation (in a symmetric group) 151.G odd ratio 397.K odd state 315.H of bounded variation 443.G Ohm's law (for a moving medium) 130.B 259 Oka's principle 147.0 Oka's theorem 72.E I-complete manifold, weakly 21.L one cycle 16.R one-dimensional diffusion processes 115.A one-dimensional lattice 287.A one-dimensional probability distribution (of random variables) 342.C Subject Index Operation(s) one-dimensional statistic 396.B l00IX%-point 396.C I-I (mapping) 381.C one-parameter group local (of local transformations) 105.N of transformations 105.N 126.B one-parameter semigroup of class (CO) 378.B one-parameter subgroup (of a Lie group) 249.Q one-parameter variation 178.A one-point compactification 425.T one-point union 202.F one-sided (surface) 410.B one-sided stable for exponent 1/2 App. A, Table 22 one-sided stable process (of the exponent IX) 5.F one-step-two-half-steps errors estimate 303.0 one-to-one correspondence 358. B one-to-one mapping 381.C only normal crossings 16.L Onsager reciprocity relation 402.K onto mapping 381.C open (Riemann surface) 367.A (system) 419.A (topological manifold) 105.B finely 261.0 Zariski 16.A open arc 93 B open ball 140 open base 425.F open circle 140 open continuous homomorphism 423.1 open covering (of a set) 425.R open disk 140 open formulas 199.A opening 186.E open interval 140 355.C open mapping 425.G open mapping theorem (in Banach space) 37.1 (in topological linear spaces) 424.X open n-ball 140 open n-cell 140 open n-disk 140 open n-sphere 140 open neighborhood 425.E open parallelotope (in an affine space) 7.0 open set 425.B basic 425.F relative 425.1 system of 425.B open simplex 7.070.C open sphere 140 open star (in a complex) 70.B,C open subgroup (of a topological group) 423.0 open surface 410.B open system entropy 402.G open tubular neighborhood 105.L 114.B operate (in a function algebra) 192.N from the left (on a set) 362.B from the right (on a set) 362.B operating characteristic 404.C operating function 192.N operating systems 75.C operation(s) (of an operator domain on a module) 277.C (on a set) 409.A Adams 237.E Bokshteln 64.B Boolean 42.A 
Subject Index Operation A (in set theory) cohomology 64 compatible with 277.C four arithmetic 294.A functional cohomology 202.S functional $- 202.S glide 92.E homotopy 202.0 left 409.A primary cohomology 64.B Pontryagin (pth) power 64.B primitive (of a group) 362.B rational 294.A reduced square 64.B right 409.A ring 368.A stable cohomology 64.B stable primary cohomology 64.B stable secondary cohomology 64.C Steenrod (pth) power 64.B Steenrod square 64.B transitive (of a group) 362.B operation A (in set theory) 22.B operational calculus 251.0306, App. A, Table 12.II operator (in functional analysis) 162251.A (on a set) 409.A Abelian 308.E accretive (in a Hilbert space) 286.C additive 251.A adjoint (in Banach spaces) 37.D 251.0 adjoint (in Hilbert spaces) 251.E adjoint (of a linear partial differential operator) 322.E adjoint (of a microdifferential operator) 274.F adjoint (of a micro local operator) 274.F amplification (of the scheme) 304.F angular momentum 258.0 annihilation 377.A Beltrami differential, of the first kind App. A, Table 4.II BeJtrami differential, of the second kind App. A, Table 4.II boundary 200.C 201.B with a boundary condition 112.F bounded linear 37.C Calderon-Zygmund singular integral 217.1 251.0 Cartier 9.E channel wave 375.F closable 251. D closed 39.1251.0 closure 425.B coboundary 200.F compact 68 completely continuous 68.B conjugate (in Banach spaces) 37.0 conjugate (of a differential operator) 125.F conjugate (of a linear operator) 251.0 conjugation (in function algebras) 164.K creation 377.A decomposable (on a Hilbert space) 308.0 degeneracy (in a semisimplicial complex) 70.E diagonalizable (in an Abelian von Neumann algebra) 308.0 differential 112 223.C 306.B differential, of the kth order 237.H differentiation 223.C dissipative 286.C domain (of an n-group) 190.E domain of 409.A 2060 down-ladder 206.B dual (in Banach spaces) 37.0 dual (of a differential operator) 125.F dual (of a linear operator) 251.0 elliptic 112.A energy-momentum 258.0 evolution 378.0 exponential function of 306.C face (in a semisimplicial complex) 70.E formal adjoint 322.E 4-momentum 258.0 Fourier integral 274.C Fredholm 68.F 251.0 fundamental 163.E generalized wave 375.B Oreen's 189.A,B 194.C Hamiltonian 351.0 Hecke 32.0 Hermitian 251.E Hilbert's a- 411.J holomorphic evolution 378.1 identity (on a Banach space) 37.C incoming wave 375.B with index 68.F integral 68.N l00.E 251.0 306.B integral, of Hilbert-Schmidt type 68.C interior 425.B inverse 37.C 251.B isometric 251.E Laplace 323.A 442.0 Laplace-Beltrami 194.B linear 251 linear (in Banach spaces) 37.C linear (in linear spaces) 256.B linear boundary 315.B linearized 286.E local 125.00 logical 411.E Markov 136.B maximal (of a differential operator) 112.E maximal dissipative 251.J microdifferential 274.F microlocal 274.F microlocallyelliptic 345.A Mikusinski's 306.B minimal (of a differential operator) 112.E modified wave 375.B modular 308.H monotone (in a Hilbert space) 286.C nonlinear semigroups of 286.X nonnegative 251.E normal 390.E normal (of Sario) 367.G normal linear 251.E nuclear 68.I,K number 377.A ordinary differential 112.A outgoing wave 375.B partial differential 112.A positive (in vector lattices) 310.E positive semidefinite 251.E projection (in a Hilbert space) 197.E pseudodifferential 251.0 345 pseudodifferential (in microlocal analysis) 274.F resolvent (of a Markov process) 261.D ring of 308.C S- 150.D scalar 390.K scattering 375.F,H 
2061 Subject Index Order Schrodinger 351.0 self-adjoint 251.E 390.£ shift 223.C 251.0 306.C spectral 390.K Steenrod App. A, Table 6.II step-down 206.B step-up 206.B strongly elliptic 112.G 323.H Sturm-Liouville 112.1 of summable pth power 68.K symmetric 251.£ system of differential 112.R T- 375.C TCP 150.0 Toeplitz operator 251.0 total boundary 200.E trace 168.B translation 306.C transposed 112.£ 189.C 322.E unilateral shift 390.1 unitary 390.E up-ladder 206.B Volterra 68.1 wave 375.B,H operator algebra 308.A operator convex 212.C operator domain 277.C module with 277.C operator homomorphism (of A-modules) 277.E (of n-groups) 190.E operator isomorphism 190.E operator monotone 212.C operator topology strong 251.C uniform 251.C weak 251.C operator-valued distribution 150.0 opposite (simplex) 201.C orientation 105.F root 13.R optical axis 180.B optical direction cosines 180.A optical distance 180.A optical theorem 386.B optics, geometric 180 optimal (design) 102.£ asymptotically 354.0 optimal control 46.0 86.B,C 405.A optimal control problem, time 86.£ optimality A- 102.E D- 102.E E- 102.E principle of 127.A optimal policy 127.A optimal regular problem 86.F optimal solution 255.A 264.B 292.A basic 255.A optimal stopping 405.E optimization model 307.C optimum allocation 373.E optimum predictor, linear 395.0 optional (stochastic process) 407.B optionalO'-algebra 407.B optional sampling 262.C optional sampling theorem 262.A orbit (of a dynamical system) 126.B (of a permutation group) 151.H (= system of transitivity) 362.B (ofa topological transformation group) 110.A 431.A closed 126.0 exceptional 431.C principal 431.C pseudo-, iX- 126.J pseudo-, tracing property 126.1 singular 431. C orbital angular momentum 351.£ orbital elements, Kepler's 309.B orbitally stable 126.F orbital stability (of a solution of a differential equation) 394.0 orbit determination 309.A orbit space (of a topological group) 431.A orbit type 431.A principal 431.C order (of an algebraic number field) 14.B (of a covering) 425.R (of a differential equation) 313.A 320.A (of a differential operator) 112.A (of an element of a group) 190.C (of an elliptic function) 134.E (of a function defined by a Dirichlet series) 121.C (of a function on an algebraic curve) 9.C (of a generating point of a simple maximally overdetermined system) 274.H (of a group) 190.C (of a homomorphism of Abelian varieties) 3.C (of an infinitesimal) 87.G (of an infinity) 87.G (of a Lie algebra) 191.0 (of a meromorphic function) 272.C (of a microdifferential operator) 274.F (ofa multistep method) 303.E ( = order relation) 311.A (ofa plane algebraic curve) 9.B (ofa point in an ordinary curve) 93.C (of a point with respect to a cycle) 99.0 (of a pole of a complex function) 198.0 (of the precision of numerical solution) 303.B (=a subring) 27.A (of a system of differential equations) 313.B (of a transcendental entire function) 429.B (of a zero point of a complex function) 198.C d' Alembert's method of reduction of 252.F derivatives of higher App. A, Table 9.m difference of the nth 104.A finite (distribution) 125.1 y-point of the kth (of a holomorphic function) 198.C of higher 87.G infinite (element in a group) 190.C left (of a g-Iattice) 27.A of lower 87.G maximal (of a g-Iattice) 27.A at most (a function) 87.G of the nth 87.G principal (of an algebraic number field) 14.B principal (fundamental theorem of) 14.C right (ofa g-Iattice) 27.A of the same 87.G small set of 436.G space of line elements of higher 152.C surface of the second 350.A 
Subject Index Order C( zero point of the kth (of a holomorphic func- tion) 198.C zero point of the -kth (of a complex function) 198.0 order a: capacity of 169.C Cesaro method of summation of 379.M Holder condition of 84.A Lipschitz condition of 84.A potentia] of 338.B summable by Cesaro's method of 379.M order k coefficient of 110.A converge in the mean of 173.B 342.0 invariants of 110.A population moment of 396.C principal components of 110.A quantile of 341.H Riesz method of summation 379.R summable by Holder's method of 379.M summable by M. Riesz's method of 379.R order p contravariant tensor field of 105.0 jet of 105.X order s, covariant tensor field of 105.0 order 0, frame of 110.C order 1 family of frames of 110.B frame of 11O.C order 2, frame of 110.B,C order 3, frame of 110.B,C order 4, frame of 110.B order bounded 310.B order convergent sequence (in a vector lattice) 310.C order-disorder transition 402.F ordered additive group 439.B totally 439.B ordered complex (of a semisimplicial complex) 70.E ordered field 149.N Archimedean 149.N Pythagorean 60.0 ordered group 243.G lattice- 243.G totally 243.G ordered linear spaces 310.B lattice- 310.B ordered pair 33.B 381.B ordered set 311.A inductively 34.C lattice- 243.A linearly 311.A partially 311.A semi- 311.A totally 311.A ordered simplex (in a simplicial complex) 70.E ordered simplicial complex 70.C order function (meromorphic function) 272.B order homomorphic (ordered sets) 311.E order homomorphism 311.E order ideal (of a vector lattice) 310.B ordering 96.C 311.A dual 311.A duality principle for 311.A lexicographic 311.G lexicographic linear 248.M linear 311.A partial 311.A 2062 pre- 311.H total 311.A well- 311.C order isomorphic (ordered sets) 311.E order isomorphism 311.E order limit (in a vector lattice) 310.C order-preserving mapping 311.E order-preserving semigroup 286.Y order relation 311.A order statistic 396.C order topology 425.C order type 312.A ordinal numbers 312.B admissible 356.G cardinality of 49.E constructive 81.B countable 49.E finite 312.B of the first, second, or third number class 312.0 of a higher number class 312.0 hyperconstructive 81.E initial 49.E isolated 312.B limit 312.B strongly inaccessible 312.E transfinite 312.B transfinite initial 49.E weakly inaccessible 312 E ordinal product (of a family of ordered sets) 311.G ordinal scale 397.M ordinal sum (of a family of ordered sets) 311.G ordinary curve 93.C ordinary derivative (of a set function) 380.0 ordinary differential equation(s) 313, App. A, Table 14 (asymptotic behavior of solutions) 314 (boundary value problems) 315 (initial value problems) 316 Euler linear App. A, Table 14.1 higher-order App. A, Table 14.1 homogeneous App. A, Table 14.1 homogeneous (of higher order) App. A, Table 14.1 linear 252 313.A linear (with constant coefficients) App. A, Table 14.1 linear (of the first order) App. A. Table 14.1 linear (global theory) 253 linear (of higher order) App. A, Table 14.1 linear (local theory) 254 nonlinear 313.A nonlinear (global theory) 288 nonlinear (local theory) 289 system of 313.B ordinary differential operator 112.A ordinary Oirichlet series 121.A ordinary double point (of a plane algebraic curve) 9.B ordinary element 191.1 ordinary helicoid ll1.I ordinary helix 111.F 114.F ordinary integral element 428.E ordinary integral manifold (ofa differential ideal) 428.E ordinary lower derivative (of a set function) 380.0 ordinary point (of an analytic set) 23.B (of a curve) 93.G 
2063 (in hyperbolic geometry) 285.C (of an ordinary curve) 93.C (on a Riemann surface) 11.0 ordinary representation (of a finite group) 362.G ordinary sense, derivable in the 380.0 ordinary singularity (of an analytic function) 198.P in the wider sense 198.P ordinary solution (of a differential ideal) 428.E ordinary upper derivative (ofa set function) 380.0 ordinate set 221.E orientable (manifold) 105.F 201.N (pseudomanifold) 65.B transversely 154.B orientable fiber bundle 147.L orientation (of an affine space) 139.B (of a contact element) 110.A (of a manifold) 105.F 201.N local (in an oriented manifold) 201.N negative (of an oriented manifold) 105.F opposite (of oriented atlases) 105.F positive (of an oriented manifold) 105.F same (of oriented atlases) IOS.F orientation cohomology class 201.N orientation manifold 201.N orientation sheaf 201.R orientation-preserving mapping 99.A orientation-reversing mapping 99.A oriented atlas (of an orientable differentiable mani- fold) 105.F oriented cobordism class 114.H group 114.H oriented differentiable structures, group of (on the combinatorial sphere) 114.1 oriented element (in a covering manifold) 110.A oriented G-manifold 431.E oriented graph 186.B oriented manifold 105.F 201.N integrals over 105.T oriented pseudo manifold 65.B coherently 65.B oriented q-simplex 201.C oriented real hypershpere 76.A oriented segment 442.A oriented simplicial chain complex 201.C oriented singular r-simplex of class Coo 105.T oriented tangent line 76.B origin (of an affine space) 7.C (of a Euclidean space) 140 (of a projective frame) 343.C Orlicz-Pettis theorem 443.0 Orlicz space 168.B Ornstein-Uhlenbeck Brownian motion 45.1 orthant, positive 89.G orthochronous 258.A orthocomplement (of a subs pace of a linear space) 139.G orthogonal (block design) 102.J (elements of a ring) 368.B (in Euclidean geometry) 139.E,G (functions) 317.A (in a Hilbert space) 197.C (linear subspaces) 256.G mutually (latin squares) 241.B orthogonal array 102.L Subject Index Orthooomic system, passive orthogonal complement (of a subset of a Hilbert space) 197.E orthogonal component (of an element of a linear space) 139.G orthogonal coordinate system adapted to (a flag) 139.E orthogonal curvilinear coordinates 90.C orthogonal curvilinear coordinate system App. A, Table 3.V orthogonal expansion 317.A orthogonal for a finite sum 19.G orthogonal fractional factorial design 102.1 orthogonal frame 111.B 139.E orthogonal frame bundle 364.A tangent 364.A orthogonal function(s) 317, App. A, Table 20 Haar system o.f 317.C Rademacher system of 317.C Walsh's system of 317.C orthogonal group 60.1 139.B 151.1 (over a noncommutative group) 60.0 complex 60.1 complex special 60.1 infinite 202.V over K with respect to Q 60.K pair 422.1 proper 60.1 258.A reduced 61.0 special 60.1 orthogonality for a finite sum 317.0, App. A, Table 20. VII orthogonality relation (on irreducible characters) 362.G (for square integrable unitary representations) 437.M orthogonalization Gram-Schmidt 317.A Schmidt 317.A orthogonal k-frame (in R") 199.B orthogonal matrix 269.1 complex 269.1 proper 269.1 orthogonal measure 164.C orthogonal polynomial(s) 19.G, App. A, Table 20. VII Chebyshev 19.G simplest 19.G system of 317.0 orthogonal projection (in Euclidean geometry) 139.E,G (in a Hilbert space) 197.E method of 323.G orthogonal series (of functions) 317.A orthogonal set (of functions) 317.A (of a Hilbert space) 197.C (of a ring) 368.B orthogonal system (of functions) 317.A (of a Hilbert space) 197.C complete 217.G orthogonal trajectory 193.1 orthogonal transformation 139.B 348.B (ovcr a noncommutative field) 60.0 (with respect to a quadratic form) 60.K orthogonal transformation group 60.1 over K with respect to Q 6O.K orthomodular 351.L orthonomic system, passive (of partial differential equations) 428.B 
Subject Index Orthonormal basis orthonormal basis 197.C orthonormalization 139.G orthonormal moving frame 417.0 orthonormal set (offunctions) 317.A (ofa Hilbert space) 197.C complete (of a Hilbert space) 197.C orthonormal system complete 217.G complete (of fundamental functions) 217.G orthorhombic system 92.E oscillate (for a sequence) 87.0 oscillating (series) 379.A oscillating motion 420.0 oscillation(s) 318 (of a function) 216.A bounded mean 168.B damped 318.B equation of App. A, Table 15.VI forced 318.B harmonic 318.B nonlinear 290.A relaxation 318.C nonstationary 290.F oscillator process 351.F oscillatory 314.F osculating circle I11.F osculating elements 309.0 osculating plane II1.F osculating process 77.B Oseen approximation 205.C Osgood theorem, Hartogs- 21.H O-S positivity 150.F OsterwaJder-Schrader axioms 150.F Ostrogradskii formula 94.F outdegree 186.B outer area 216.F 270.G outer automorphisms group of (of a group) 190.0 group of (of a Lie algebra) 248.H outer capacity, Newtonian 48.H outer function 43.F outer harmonic measure 169. B outer measure 270.E,G Caratheodory 270.E Lebesgue 270.G outer solution 25.B outer variable 25.B outer volume 270.G outgoing subspace 375.H outgoing wave operator 375.B outlier test 397.Q out-state 150.0 386.A oval 89.C II1.E Cassini 93.H mean (of two ovals) 89.0 width of the 111.E ovaloid 89.C 111.1 overall approximation formula 303.C overconvergence 339.E overcrossing point 235.A overdetermined system (of differential operators) 112.1 (of partial differential equations) 320.F maximally (= holonomic) 274.H overfield 149.B overidentified 128.C overre]axation successive (SOR) 302.C 2064 p PSL(n, k) (projective special linear group) 6O.B PSp(n, k) (projective symplectic group over K) 60.L PV(n) (projective unitary group) 60.F P"(K) (projective space) 343.H <p-subsequence 354.E n-group 151.F n-length (of a group) 151.F n-manifold 114.1 n-series (of a group) 151.F n-solvable group 151.F n theorem 116 n topology 424.R III set 22.A Jl set 22.0 p-adic exponential valuation 439.F p-adic extension (of the field of quotients of a Oedekind domain) 439.F p-index (of a central simple algebra over a finite aJgebraic number field) 29.G p-invariant (of a central simple algebra over a flUite algebraic number field) 29.G p-primary ideal 67.F \j3-function, Weierstrass 134.F, App. A, Table 16.1V &I'-acyclic 200.Q p-adic integer(s) 439.F ring of 439.F p-adic L-function 450.1 p-adic number 439.F p-adic number field 257.A 439.F p-adic regulator 450.1 p-adic valuation 439.F p-ary matroid 66.H p-atom 168.B p-covector 256.0 p-dimensional noncentral Wishart distribution 374.C p-extension (of a field) 59.F p-factor (of an element of a group) 362.1 p-fold exterior power (of a linear space) 256.0 (of a vector bundle) 147.F p-form tensorial 417.C vectorial 417.C p-group 151.B Abelian 2.A complete (Abelian) 2.0 divisible (Abelian) 2.0 p-parabolic type 327.H p-rank (of a torsion-free additive group) 2.E p-regular (element of a finite group) 362.1 p-Sylow subgroup 151.B pth power, operator of summable 68.K pth power operation Pontryagin 64.B Steenrod 64.B p-torsion group of an exceptional group App. A, Table 6.1V p-valent (function) 438.E absolutely 438.E circumferentially mean 438.E locally 438.E locally absolute 438.E mean 438.E quasi- 438.E p-vector 256.0 
2065 bundle of 147.F (p, q)-ball knot 235.G (p, q)-knot 235.G (p + 1 )-stage method 303 0 P-convex (for a differential operator) 112.C strongly 112.C P-function, Riemann 253.B, App. A, Tables 14.II 18.1 P-projective resolution 200.Q P-wave 351.E p" set 22.0 P'-figure 343.B p-space 425.Y P-space 425.Y Pade approximation 142.E Pade table 142.E Painleve equation 288.C Painleve theorem 198.G Painleve transcendental function 288.C pair 381.B (in axiomatic set theory) 33.B ball 235.0 BN- 13.R contact (in circle geometry) 76.C group (of topological Abelian groups) 422.1 order 381.B ordered (in axiomatic set theory) 33.B orthogonal group 422.1 Poincare, of formal dimension n 114.1 simplicial 201 L sphere 65.0 235.G topological 201.L unordered 381.B unordered (in axiomatic set theory) 33.B paired comparison 346.C pairing (of linear spaces) 424 G axiom of 381.G pair test 346.0 pairwise sufficient (statistic) 396.F Palais-Smale condition (C) 279 £ 286.Q Paley theorem 317.B Paley theory, Littlewood- 168.B Paley-Wiener theorem 125.0,BB pantograph 19.E paper binomial probability 19.B functonal 19.0 logarithmic 19.F probability 19.F semilogarithmic 19.F stochastic 19.B Pappus theorem (on conic sections) 78.K (in projective geometry) 343.C parabola(s) 78.A family of confocal 78.H parabolic (differential operator) 112.A (Riemann surface) 367.o,E (simply connected domain) 77 B (visibility manifold) 178.F parabolic coordinates 90.C, App. A, Table 3.V parabolic cusp (of a Fuchsian group) 122.C parabolic cylinder 350.B parabolic cylinder function 167.C parabolic cylindrical coordinates 167.C, App. A, Table 3 V parabolic cylindrical equation App. A, Table 14.II Subject Index Parameter(s) parabolic cylindrical surface 350.B parabolic-elliptic motion 420.0 parabolic geometry 285.A parabolic motion 420.0 parabolic point (on a surface) 110.B I11.H parabolic quadric hypersurface 350.1 parabolic subalgebra (of a semisimple Lie algebra) 248.0 parabolic subgroup (of an algebraic group) t3.G (of the BN-pair) 13.R (of a Lie group) 249.1 cuspidal 437,X minimal k- 13.Q standard k- 13.Q parabolic transformation 74.F parabolic type (equation of evolution) 378.1 partial differential eq uation of 327 paraboloid elliptic 350.B elliptic, of revolution 350.B hyperbolic 350.B paracompact (space) 425.S countably 425.Y strongly 425.S paracompact C'-manifold 105.0 paradox(es) 319 Burali-Forti 319.B d'Alembert 205.C Richard 319.B Russel 319.B Skolem 156.£ Zeno 319.C parallax annual 392 geocentric 392 parallel(s) (affine subs paces) 7.B (lines) 139.A 155.B (lines in hyperbolic geometry) 285.B (tensor field) 364.B axioms of 139.A in the narrow sense (in an affine space) 7.B in the sense of Levi-Civita 111.H in the wider sense (in an affine geometry) 7.B parallel coordinates (in an affine space) 7.C parallel displacement (in an affine connection) 80.H (in a connection) 80.C (in the Riemannian connection) 364.B parallelepiped, rectangular 14.0 parallelism, absolute 191. B parallelizable (flow) 126.E (manifold) 114.1 almost 114.1 s- 114.1 stably 114.1 parallelogram, period 134.E parallelotope 425.T (in an affine space) 7.0 open (in an affine space) 7.0 parallel projection (in an affine space) 7.C parallel translation 80.C 364.B parameter(s) 165.C (of an elliptic integral) 134.A (in a population distribution) 401.F (of a probability distribution) 396.B 
Subject Index Parameter space acceleration 302.C canonical (of an arc) 111.0 design for estimating 102.M distinct system of 284.0 estimable 403.E isothermaJ 334.B isothermal (for an analytic surface) 111.1 334.B Lagrange's method of variation of 252.0 linear 102.A linearly estimable 403.E local (Fuchsian groups) 32.B locaJ (of a nonsingular algebraic curve) 9.C local (of a Riemann surface) 367.A locaJ canonical (for power series) 339.A local uniformizing (of a Riemann surface) 367.A location 396.1 400.E one- (group of transformations) 10S.N one- (subgroup of a Lie group) 249.Q one-, semigroup of class (Co) 378.B of regularity (of a Lebesgue measurable set) 380.0 regular system of 284.0 scaJe 396.I400.E secondary 11 O.A selection 396.F system of 284.0 time (of a stochastic process) 407.A transformation 396.1 transformation of 111.0 true vaJue of 398.A parameter space (of a family of compact complex manifolds) 72.G (of a family of probability measures) 398.A (of a probability distribution) 396.B parametrically sustained vibration 318.B parametric function 102.A 399.A parametric programming 264.C parametric representation 165.C (of Feynman integraJs) 146 B (of a subs pace of an affine space) 7.C parametrix 189.C left 34S.A right 34S.A parametrized, effectively (at 0) 72.G paraxial ray 180.B parity check matrix 63 C parity check polynomial 63.0 parity transformation 359 B Parreau-Widom type 164.K Parse val equality 18.B 197.C Parseval identity 18.B 159.A 160.C 192.K 220.B,C,E parsing 31.E part(s) (for a function algebra) 164.F connected 150.0 cyclic (of an ergodic class) 26O.B dissipative (of a state space) 260.B essentiaJ 260.1 finite (of an integral) 125.C Gleason (for a function algebra) 164.F hoJomorphic (in a Laurent expansion) 1980 homogeneous (of a formal power series) 370.A imaginary 74.A integration by (for the O-integral) 100.0 integration by (for the Stieltjes integral) 94.C integration by (for the Riemann integral) 216.C 2066 negative (of an element of a vector lattice) 310.B positive (of an element of a vector lattice) 310.B principal (of a differential operator) 112.A principal (of a Laurent expansion! 198.0 principal (of a partial differential operator) 320.B purely contractive 251.N real 74.A semisimple (of an algebraic group) 13.E semisimple (of a nonsingular matrix) 13.E singular (of a Laurent expansion) 198.0 unipotent (of an aJgebraic group) 13.E unipotent (of a nons in gular matrix) 13.E partial boundary operator 200.E partial capture 420.0 partial correlation coefficient 397.1 sample 280.E partial denominator (of an infinite continued frac- tion) 83.A partial de Rham system 274.0 partial derivative 106.F,K nth-order 106.H partial derived functor 200.1 partial differential 200.H partial differential coefficient 106.E partial differential equation(s) 313.A 320 (initial value problems) 321 (method of integration) 322 of elliptic type 323, App. A, TabJe I5.VI of the first order 324 Fokker-PJanck 115.A hyperbolic 32S of hyperboJic type 325 of mixed type 326 of parabolic type 327 solution, of the first order App. A, Table IS.II solution, of the second order App. A, Table IS.m system of, of order 1 (on a differentiabJe mani- fold) 428.F partial differential operator 112.A partial differentiation 106.F partial fraction App. A, Table 10.V partiaJ function 356.E partial graph 186.C partial1y balanced incomplete block design 102.1 406.1 partially confounded (with blocks) 102.1 partiaily conserved axial-vector currents 132.0 partiaJJy differentiable (function) 106.F partiaily isometric (operator) 2Sl.E partially ordered set 311.A partial mapping (of a mapping) 381.C partial numerator (of an infinite continued fraction) 83.A partial observation 40S.C partial ordering 311.A partiaJ pivoting 302.B partiaJ product 379.G partial quotient, nth 83.A partial recursive (in a partial recursive function) 356.E,F partial sum (of a series) 379.A diagonal (of a double series) 379.E partial summation, Abel 379.0 partial wave 386.B partial wave expansion 37S.E 386.B partial wave scattering amplitude 37S.E 
2067 particle(s) Bose 132.A composite 132.A elementary 132 Fermi 132.A particular solution (of a differential equation) 313.A (of partial differential equations) 320.C (for a system of differential equations) 313.C particular transformation (of :) 248.R partition(s) (in ergodic theory) 136.E (of an interval) 216.A (of a set) 381.0 (of a space) 425.L entropy of 136.E 8-independent 136.E independent sequence of 136.£ Markov (for an automorphism) 136.C,G number of 177.0328 of numbers 328 Pinsker 136.£ principal 66.H of unity 425.R of unity of class Coo 105.S of unity subordinate to a covering 425.R upper semicontinuous 425.L partition function 402.0 grand 402.0 partitioning algorithm 215 £ Pascal, B. 329 lima90n of 93.H Pascal configuration 78.K Pascal line 78.K Pascal theorem (on conic sections) 78.K (in geometry) 155.E (in projective geometry) 343.E Pascal triangle 330 Pasch's axiom (in geometry) 155.B passive (state) 402.G completely 402.G passive boundary point 260.1 dual 260.1 passive network 282.C passive orthonomic system (of partial differential equations) 428.B past cone 258.A past history, independent of the 406.0 pasting together the boundaries 114.F path (in a Finsler space) 152.C (in a graph) 186.F (of a Markov process) 261.B (of a stochastic process) 407.A (in a topological space) 148.C 170 asymptotic (for a me rom orphic function) 272.H closed (in a graph) 186.F closed (in a topological space) 170 closed, space of 202.C critical 376 direct 186.F direct closed 186.F Euler (in a graph) 186.F general geometry of 152.C Hamilton 186.F of an integration (curvilinear integral) 94.0 inverse 170 Subject Index Percolation process non tangential 333.B projective geometry of 109 quasi-independent of (response probability) 346.G sample 407.A simple 186.F Stolz (in a plane domain) 333.B path-component 79.B path-connected 79.B path-dependent, d-trial 346.G path-independent (response probability) 346.G path integral 351.F pathological (space) 65.F path space 148.C 261.B pathwise uniqueness of solution 406.0 pattern formation 263.0 Pauli approximation 351.G Pauli-Lubanski vector 258.0 Pauli principle 351.G Pauli spin matrix 258.A 351.G payoff 108.B,C 173.B payoff function 173.C PBIBO (partially balanced incomplete block design) 102.1 PC (predictor-corrector) method 303.E PCT in variance 386.B PCT theorem 386.B peak point 164.0 generalized 1640 peak set 164.0 generalized 164.0 Peano area (of a surface) 246.F Peano continuum 93.0 Peano curves 93.1 Peano postulates 294.B Pearson distribution 397.0 Pearson lemma, Neyman- 400.B Peclet number 116.B pedal curve 93.H Peierls-Bogolyubov inequality 212.B Peirce decomposition (of a Jordon algebra) 231.B Peirce left decomposition (in a unitary ring) 368.F Peirce right decomposition (in a unitary ring) 368.F Peirce space 231. B Pelczynski theorem, Bessaga- 443.0 Pell equation 118.A penalized problems 440.B penalty method 292.E penalty term 440. B pencil algebraic 15.C of conics 343.£ of hyperplanes (in a projective space) 343.B Lefschetz 16. U linear 16.N of lines (in a projective plane) 343.B of planes (in a 3-dimensional projective space) 343.B of quadric hypersurfaces 343.E of quadrics 343.E peninsula (in a Riemann surface) 272.1 pentagamma function 174.B pentagon 155.F pentagonal number 4.0 pentagonal number theorem 328 pentaspherical coordinates 90.B percolation process 340.D bond 340.0 site 340.0 
Subject Index Perfect perfect (image) 180.A (1.- (graph) 186.K y- (graph) 186.K perfect additive functional 261.E perfect code 63.B perfect delay convention 51.F perfect field 149.H perfect fluid 205.B perfect image 425.CC perfect inverse image 425.CC perfect kernel (in potential theory) 338.E perfectly normal space 425.Q perfectly separable space 425. P perfect mapping 425.W quasi- 425.CC perfectness theorem 186.K perfect number 297.D of the second kind 297.D perfect set 425.0 perigon, straight 139 D perihelion distance 309.B period (of an Abelian differential form) 11.C (of an ergodic class) 260.B (of a marked K3 surface) 72.K (of an orbit) 126.D (of an oscillation) 318.A (of a periodic continued fraction) 83.C (of a periodic function) 134.B (of a wave) 446 fundamental 134.E periodgram 421.C periodic (trajectory) 126.D almost 126.F periodic continued fraction 83.C periodic endomorphism (at a point) 136.E periodic function 134.E almost (on a graph) 18.E almost (with respect to p) 18.C almost (in the sense of Bohr) 18.B analytic almost 18.D doubly 134.E simply 134.E uniformly almost 18.B periodic group 2.A maximally almost 18.1 minimally almost 18.1 periodic inequality, Riemann 3.L periodicity 390.1 periodicity modulus (of an elliptic integral) 134 A periodicity theorem, Bott 202 V 237.D, App. A, Table 6.VII periodic solution (of Hill's equation) 268.E period matrix (of a closed Riemann surface) I1.C (of a complex torus) 3.H period parallelogram 134.E fundamental 134.E period relation, Riemann's 3.L 11.C peripheral devices 75.B peripheral system 235.B permeability, magnetic 130.B permeable membrane 419.A permutation 190.B (in a symmetric group) 151.G even 151.G k- 330 odd 151.G permutation group 190.B 2068 of degree n 151.G imprimitive 151.G intransitive 151.H K -transitive 151.H K-ply transitive 151.H multiply transitive 151.H primitive 151.H regular 151 H transitive 151.H permutation representation (of a group) 362.B degree of 362.B faithful 362.B primitive 362.B reciprocal 362.B similar 362.B perpendicular (to a hyperplane) 139.E foot of the 139.E perpetual motion 402.G Perron-Brelot solution (of Dirichlet problem) 120.C Perron-Frobenius theorem 310.H Perron integrable (function) l00.F Perron method (in Dirichlet problem) 120.C Perron theorem (on linear transformations of sequences) 379.L (on ordinary differential equations) 316.E (on positive matrices) 269.N Perron-Wiener-Brelot solution (of Dirichlet problem) 120.C persistent 260.1 perspective 343.B perspective mapping (in projective geometry) 343.B PERT 307 C 376 perturbation(s) analytic 331.D asymptotic 331.D general theory of 420 E Kato 351.D oflinear operators 331 method 25.A regular 331.0 secular 55. B singular 289.E special theory of 420.E Petersson conjecture, Ramanujan- 32 D Petersson metric 32. B Peter-Weyl theory (on compact groups) 69.B (on compact Lie groups) 249.U Petrovskii, hyperbolic in the sense of 325.F Petrovskii theorem 112.D Pettis completely additivity theorem 443 G Pettis integrable 443.F Gel'fand- 443.F Pettis integral 443.F Gel'fand- 443.F Pettis measurability theorem 443.B Pettis theorem Dunford- 68.M Orlicz- 443.D Petvyashvili equation, Kadomtsev- 387.F Pfaffian 103.G Pfaffian equation(s) 428.A system of 428.A Pfaffian form 428.A pfaff problem 428.A generalized 428.B Pfluger extremal length, Hersch- 143.A phase 
2069 initial (of a simple harmonic motion) 318.B pure 402.G phase average 402.C phase constant (of a sine wave) 446 phase function (of a Fourier integral operator) 274.C 345 B phase portrait 126 B phase shift 375.E 386.B phase space (of a dynamical system) 126.C 290 C (for functional-differential equation) 163.C (in statistical mechanics) 402.C momentum 126.L velocity 126.L phase transition 340.B phase velocity (of a sine wave) 446 phenomenon Gibbs 159.0 Runge 223.A Stokes 2540 photon 132.B 377.B Phragmen- Lindelof theorem 43.C physical Hilbert space 150.G physically contains 351.K PI-algebra (algebra with polynomial identities) 29.1 Picard exceptional value 272 E Picard group (of a commutative ring) 237.1 Picard-Lefschetz formula 418 F Picard-Lefschetz transformation 16.U Picard number (of a variety) 16.P Picard scheme 16 P Picard theorem (on transcendental entire functions) 429.B (on transcendental meromorphic functions) 272.E Picard variety 16.P (of a compact Kahler manifold) 232.C Picard- Vessiot extension field 113 Picard-Vessiot theory 113 picture Heisenberg 351.0 Schrodinger 3510 piecewise affine mapping 192.Q piecewise continuous function 84 B piecewise linear mapping 65.A 70.C piecewise smooth curve 364.A Pincherle-Goursat kernel 217.F Pinching problem (differentiable) 178.E Pinsker partition 136.E Pitman estimator 399.G pivot 302.B pivoting complete 302 B partial 302 B PL category 65 A PL embedding 65.0 PL homeomorphism 65.A PL isomorphism 65.A PL k-ball 65.C PL (k - I)-sphere 65.C PL mapping (map) 65 A PL microbundle 147.P PL (n, m)-ball knot 65.0 PL (n, m)-knot 65.0 PL normal 147.P PL structure 65.C PL tangent 147.P PL topology 65.A place (of a field) 439.J Subject Index Plucker relations (on Plucker coordinates) placement problem 235.A planar 367.G planar character 367.0 planar curvilinear coordinates App. A, Table 3.V planar graph 186.H planarity (of a graph) 186.H Plancherel formula (on a unimodular locally com- pact group) 437.L Plancherel measure (of a locally compact group) 437.L Plancherel theorem 160.H 192 A,K (with respect to the Radon transform) 218 G Planck constant 351.A Planck (partial differential) equation, Fokker- 115 A 402.1 plane(s) 155.B (as an affine space) 7.A (in a projective space) 343.B Cayley projective 54 complex 74.C conjugate (with respect to a quadric surface) 350.C coordinates (of a plane) 343.C finite projective 241.B Gauss-Argand 74.C Gaussian 74.C half- 155.B 333.A hodograph 205.B hyperbolic 122.C normal 111.F osculating II1.F pencil of (in a 3-dimensional projective space) 343.B polar (with respect to a quadric surface) 350.C principal (of a quadric surface) 350.B projective 343.B rectifying 111.F tangent II1.H, App. A, Table 4.1 w- 740 z- 74.0 plane algebraic curve 9.B plane coordinates (of a plane) 343.C plane curve App. A, Table 4.1 continuous 93.B plane domains 333 closed 333.A multiply connected 333.A n-ply connected 333.A plane geometry 181 plane polygon 155.F plane triangle App. A, Table 2.II plane trigonometry 432.A plane wave 446 plane wave decomposition 125 CC planimeter 19.A planning production 376 statistical 102.A plasticity, theory of 271.G Plateau problem 334 playable 108.B Pleijel asymptotic expansion, Minakshisundaram- 391.B PLK (Poincare-Lighthill-Kuo) method 25.B Plotkin bound 63.B plots 102 B Plucker coordinates (in a Grassman manifold) 90.B Plucker formulas (on plane algebraic curves) 9.B Plucker relations (on Plucker coordinates) 90.B 
Subject Index Plurigenera plurigenera 15 E pluriharmonic distribution 21.C plurisubharmonic function 21 G plus infinity 87.0 Pochhammer differential equation, Tissot- 206.C Poincare, H. 335 last theorem of 153.B theta-Fuchsian series of 32.B Poincare-Birkhofffixed-point theorem 153.B Poincare-Birkhoff-Witt theorem (on Lie algebras) 248.1 Poincare-Bruns theorem 420.A Poincare characteristic, Euler- 16.E 201.B Poincare class Euler- 56.B,F unIversal Euler- 56.B Poincare complete reducibility theorem 3.C Poincare complex 114.1 Poincare condition (in Oirichlet problem) 120.A Poincare conjecture 65.C generalized 65.C Poincare differential invariant 74.G Poincare duality 201.0 450.Q Poincare formula (in integral geometry) 218.C Poincare formula, Euler- 201.B,F Poincare group 170 258.A Poincare-Lefschetz duality theorem 20\.0 Poincare-Lighthill-Kuo (PLK) method 25.B Poincare manifold 105.A Poincare mapping (map) 126.C,G Poincare method 25.B Poincare method, Lindstedt- 290.E Poincare metric 74.G Poincare model (of geometry) 285.0 Poincare pair (of formal dimension n) 114.1 Poincare polynomial (of a finite simplicial complex) 201.B Poincare series 32.B Poincare series, Eisenstein- 32.F Poincare theorem 383.E (on Abelian varieties) 3.0 Poincare- Volterra theorem 198.1 Poinsot representation 271.E point(s) (of an affine space) 7.A (in the foundations of geometry) 155.B (of a graph) 186.B (in projective geometry) 343.B accessible boundary (of a plane domain) 333.B accumulation 425.0 accumulation (of a sequence of real numbers) 87.C adherent 425.B algebraic (over a field) 369.C algebraic branch (of a Riemann surface) 367.B almost all, of a variety 16.A (1.- (of a me rom orphic function) 272.B (1.-limit 126.0 ambiguous 62.0 antipodal (on a sphere) 140 apparent singular 254.C base (of a linear system) 16.N base (of a loop) 170 base (of a topological space) 202.B bifurcation 126.M 217.M 286.R boundary (of a subset) 425.N branch (of a covering surface) 367.B branch (of a harmonic mapping) 275.B branch (of an ordinary curve) 93.C catastrophe 51.F 2070 cluster 425.0 coincidence (of maps) 153.B collinear (in projective geometry) 343.B complete accumulation 425.0 condensation 425.0 conjugate 46.C 364.C conjugate (in a projective space) 343.E corresponding (with respect to confocal qua- drics) 350.E critical (of a COO-function) 279.B critical (of a function) 106.L critical (of a mapping) 105.1 193.1 208.B critical (of a trajectory) 126.0 cut (on a geodesic) 178.A degenerate critical 106.L 279.B degree of 99.D of density (of a measurable set of the real line) l00.B dependent (in an affine space) 7.A dependent (in a projective space) 343.B deviation 336.B discontinuity 84.B discontinuity, of the first kind 84.B discontinuity, of the second kind 84.B dual passive boundary 260.1 elliptic (of a Fuchsian group) 122.C elliptic (on a surface) II1.H end (of an ordinary curve) 93.C entrance boundary 260.1 equianharmonic range of 343.0 equilibrium 108.C equilibrium (in the theory of games) 173.C equilibrium (of a trajectory) 126.0 equivariant (of a mapping) 153.B equivariant, index (of a mapping) 153.B essentially singular (with respect to an analytic set) 21.M estimation 399.B 401.C exit boundary 260.1 exterior (of a subset) 425.N externally irregular 338.L extreme (of a convex set) 89.A extreme (of a subset of a linear space) 424.T fixed (of a discontinuous transformation group) 122.A fixed (of a flow) 126.0 fixed (of a mapping) 153.A fixed (of a mapping in a topological linear space) 153.0 fixed (method of roundoff) 138.B fixed (of a topological transformation group) 43\.A fixed, of discontinuity 5.B 407.A fixed, index (of a mapping) 153.B fixed branch (of an algebraic differential equa- tion) 288.A fixed, theorem 153 flat (of a surface) l11.H focal (of a submanifold of a Riemannian mani- fold) 364.C frontier (of a subset) 425.N functions 380.A fundamental (of a projective space) 343.C fundamental (with respect to a birational mapping) 16.1 Yo, of the kth order (of a holomorphic function) 198.C generalized peak 164.D generic 16.A geodesic II1.H 365.0 
2071 geometric (of a scheme) 16.D harmonic range of 343.D homoclinic 126.J hyperbolic (on a surface) I11.H hyperbolic fixed 126.G hyperbolic singular 126.G ideal (in hyperbolic geometry) 285.C independent (in an affine space) 7.A independent (in a projective space) 343.B at infinity (in affine geometry) 7.B at infinity (of a Oaussian plane) 74.D at infinity (in hyperbolic geometry) 285.C at infinity (of a Riemann manifold) 178.F of inflection (of a curve of class C 2 ) 93.G of inflection (of a plane algebraic curve) 9.B initial (of a curvilinear integral) 94.D initial (of a path) 170 initial (of a position vector) 7.A initial (of a vector) 442.A integral 428.E,F interior 425.B internally irregular 338.L irregular (of an analytic set) 45 D irregular (of a Markov process) 261.D irregular boundary 120.D irregular singular 254.B isolated 425.0 isolated (of a curve) 93.0 isolated fixed 126.G isotropic 365.D k-rational (of an algebraic variety) 16.A 369.C left singular (of a diffusion process) 115.B limit (of a discontinuous group) 122.C limit (of a sequence of points) 87.E limit (of a sequence of real numbers) 87.B limit, type 112.1 logarithmic branch (of a Riemann surface) 367.B middle (of two points of an affine space) 7.C movable branch (of an algebraic differential equation) 288.A multiple (on an arc) 93.B multiple (of a plane algebraic curve) 9.B multiple (on a variety) 16.F negative limit 126..D nodal 304.C nondegenerate critical 106.L 279 B nondegenerate critical (of a function on a Hilbert manifold) 286.N nonrecurrent 260 B nonsingular (of an algebraic variety) 16.F normal (of an analytic space) 23 D normal (of a variety) 16.F null recurrent 260.B ill-limit 126.D 1000(%- 396.C order of (with respect to a cycle) 99.D order of (in an ordinary curve) 93.C ordinary (of an analytic set) 23.B ordinary (of a curve of class C k ) 93.G ordinary (in hyperbolic geometry) 285.C ordinary (of an ordinary curve) 93.C ordinary (of a plane algebraic curve) 9 B ordinary (on a Riemann surface) 11.D overcrossing 235.A parabolic (on a surface) 110.B 11l.H passive boundary 260.1 peak 164.D positive limit 126 D positive recurrent 260.B Subject Index Points(s) principal 180.B r-ple (of a plane algebraic curve) 9.B ramification (of an analytic covering space) 23.E rational 118.E rational double 418.C recurrent (of a Markov process) 261.B reflection (with respect to a circle) 74.E regular (of an analytic set) 23.B 45.D regular (with respect to an analytic set) 21.M regular (in catastrophe theory) 51.F regular (of a differentiable mapping) 105.1 regular (of a diffusion process) 115.B regular (with respect to the Dirichlet problem) 207.B regular (of a flow) 126.D regular (of a Hunt process) 261.D regular (of a polyhedron or cell complex) 65.B regular (of a surface in £3) 11l.J regular boundary 120.D regular singular 254.B rest (of a trajectory) 126.D right singular (of a diffusion process) 115.B saddle (of a function) 255.B 292.A saddle (on a surface) 11l.H saddle (of a system of ordinary differential equations) 126.G saddle (of two-person games) 108.B saddle, method 25.C sample 342.B 396.B 398.A Schwinger 150.F semiregular (of a surface in £3) 11l.J simple (of an analytic set) 23.B 418.A simple (on a variety) 16.F singular (of an analytic set) 23.B 418.A singular (of a continuous vector field) 153.B singular (of a curve of class C k ) 93.G singular (of a linear difference equation) 104.D singular (of a plane algebraic curve) 9.B singular (of a polyhedron or cell complex) 65.B singular (of a quadric hypersurface) 343.E singular (of a surface in £3) 111.1 singular (of a system of linear ordinary dif- ferential equations) 254.A singular (of a system of ordinary differential equations) 289.A singular (of a trajectory) 126.D,G singular (on a variety) 16.F smooth (of variety) 16.F stable 16.W stationary (of an arc of class CO) II1.D successive minimum 182.C supporting (of a convex set) 89.G supporting (of a projective frame) 343.C symmetric (with respect to a circle) 74 E terminal (of a curvilinear integral) 94.D terminal (of a Markov process) 261.B terminal (of a path) 176 terminal (of a vector) 442.A transient 260.B transition 254.F transversal homoclinic 126.1 turning 25.B 254.F ultra infinite (in hyperbolic geometry) 285.C umbilical (of a surface) 11l.H 365.D undercrossing 235.A unit (of an affine frame) 7.C unit (of a projective frame) 343.C unit (of a projective space) 343.C 
Subject Index Point(s) at infinity w- (of an entire function) 429.B wandering (of a trajectory) 126.E Weierstrass 11.D zero (of a holomorphic function) 198.C zero (of a polynomial) 337.B 369.C zero (of a subset of a polynomial ring) 369 C zero, of the - kth order (of a complex function) 198.0 zero, of the kth order (of a holomorphic func- tion) 198 C point(s) at infinity 74.0 285.C regular at the 193.B pointed coalgebra 203.F pointed set 172.1 morphism of 172.1 pOInted shape category )82.A pointed topological spaces, category of 202.B pointer 96 B point estimation 399 B 401.C, App. A, Table 23 point-finite covering (of a set) 425.R point function 380.A 407 D point group (of a crystallographic group) 92.A point hypersphere 76.A point process 407.D point range (in projective geometry) 343 B of the number system (in projective geometry) 343.C point set 381 B points of indeterminacy, set of 23.0 point spectrum 390.A pure 136.E pointwise convergent sequence 435.B pointwise ergodic theorem 136.B Poisson bracket 82.B 271.F 324.C,D (of two vector fields) 105.M Poisson differential equation 323.A, App. A, Table 15.III Poisson distribution 341.D 397.F, App A, Table 22 Poisson equation 338 A Poisson formula App. A, Table 19.m of T m = Emir 391 J Poisson input 260.H Poisson integral 168 B 193 G Poisson integral formula 198.B Poisson integration formula App. A, Table 15.VI Poisson kernel 159.C Poisson number 271.G Poisson point process, stationary 407 D Poisson process 5.D compound 5.F Poisson random measure 407 D Poisson ratio 271.G Poisson solution 325 D Poisson stable 126.E negatively 126.E positively 126.E Poisson summation formula 192.C (of Fourier transforms) 192.C (on a locally compact Abelian group) 192.L polar (with respect to a conic) 78.1 (in projective geometry) 343 E (relative to pairing) 424.H polar coordinates 90.C geodesic 90.C tangential 90.C polar decomposition 251 E polar element (a function element in the wider sense) 198.0 2072 (of an integral element) 428.E polar form (of a complex number) 74.C polarity (with respect to a quadric hypersurface) 343.E polarization (on an Abelian variety) 3.G electric 130.A inhomogeneous 3.G magnetic 130.A principal 3.G polarized (Hodge structure) 16.V (wave) 446 polarized Abelian variety 3.G polarized Jacobian variety, canonically 3.G 9.E polar plane (with respect to a quadric surface) "'A ,-.. .JJU.L- polar set (in potential theory) 261.D 338.H polar space 191.1 polar system (in projective geometry) 343.D polar tetrahedron 350.C self- 350.C polar triangle 78.1 self- 78.1 pole (of a complex function) 198.D (of a function on an algebraic curve) 9.C (of a function on an algebraic variety) 16.M (of a polar with respect to a conic) 78.1 (of a polar plane) 350.C (of a polar of a quadric hypersurface) 158.E (of a roulette) 93.H - north (of a complex sphere) 74.D north (of a sphere) 140 order of 198.D Regge 132.C 386.C resonance 331.F south (of a complex sphere) 74.D south (of a sphere) 140 pole divisor (of a function on an algebraic variety) 16.M policy 127.A 405.C Markovian 405 C optimal 127.A Polish space 22.I 273.1 polya's enumeration theorem 66.E Polya type 374.J strictly of 374 J polychromatic group 92.D poly disk 21.B polygamma functions 174 B, App. A. Table 17.I polygon(s) 155.F Cauchy 316.C decomposition-equal 155.F force 19 C normal 234.C plane 155 F regular 357.A simple 155.F supplementation-equal 155.F polygonal number of order r 4.D polyharmonic 193.0 polyhedral, convex rational 16.Z polyhedral angle, regular 357.B polyhedral cone, convex 89.F polyhedral group, regular 151.G . polyhedron (polyhedra) (in an affine space) 7.D (ofa simplicial complex) 65.A 70.C analytic 21.G convex 89.A 
2073 corner 215.C Euclidean 70.B Euler theorem on 201.F integer 215.C regular 357.B topological 65.A polymatroid 66.F polynomial(s) 337 Alexander (of a knot) 235 C,O alternating 337.1 associated Laguerre 317.0 Bernoulli 177.B Bernshtein 336.A Bernshtein (generalized) 418.H characteristic (of a differential operator) 112.A 321.A characteristic (of a linear mapping) 269.L characteristic (of a ma trix) 269.F Chebyshev 317.0 336.H, App A, Table 20.n Chebyshevorthogonal 19.G cyclotomic 14.L differential 113 Euler 177.C Fourier-Hermite 176.1 Galois group of the 172.G Gegenbauer 317 D 393.E, App. A, Table 20.1 generalized trigonometric 18.B Hermite 317.0 Hermite interpolation 223.E Hilbert (of an algebraic curve) 9.F Hilbert (of a graded R-module) 369.F Hilbert (of a sheaf) 16.E homogeneous of degree n 337.B Hosokawa 235.0 inseparable 337 G irreducible 337.F isobaric 32.C Jacobi 317.0, App. A, Table 20.V Lagrange interpolation 223.A 336.G, App. A, Table 21 Laguerre 317.0, App. A, Table 20.YI Legendre 393.B, App. A, Table 18.II link 235 0 Lommel App. A, Table 19.1V in m variables 337.B minimal (of an algebraic element) 149.E minimal (of a linear mapping) 269.L minimal (of a matrix) 269.F monic 337.A Neumann App A, Table 19.1V Newton interpolation 336 G orthogonal 19 G, App. A, Table 20.VII parity check 63.E Poincare 201.B primitive 337.0 reduced link 235.0 reducible 337.F ring of 337.A 369 ring of differential 113 Sato-Bernshtein 125.EE Schlitfli App. A, Table 19.IV separable 337.G simplest orthogonal 19.G Snapper 16.E Sonine 3170, App. A, Table 20.VI symmetric 337 I system of orthogonal 317 0 trigonometric interpolation 336.E ultraspherical 317 0 zonal 374C Subject Index Positive cycle (on an algebraic variety) polynomial approximation 336 best (in the sense of Chebyshev) 336.H polynomial approximation theorem (for Coo_ functions) 58.E polynomial distribution App. A, Table 22 negative App. A, Table 22 polynomial extrapolation method 303.F polynomial identity (on an algebra) 29.1 polynomially transformable 71.E polynomial representation (of GL(V)) 60.0 polynomial ring 337.A 369 of m variables 337.B polynomial time 71.B polytropic differential equation 291.F Pomeranchuk theorem 386.B Pontryagin class(es) (of an R"-bundle) 56.0 combinatorial 56.H of a manifold 56.F rational 56.F total 56.0 universal 560 Pontryagin duality theorem (on topological Abelian groups) 192.K 422.C Pontryagin multiplication 203.0 Pontryagin number 56.F Pontryagin pth power operation 64.B Pontryagin product 203.0 Popov ghost, Faddeev- 132.C 150.G population (in statistics) 397.B 401.E finite 373.A infinite 401 E population characteristic 396.C population correlation coefficient 396.0 population covariance 396.0 population distribution 396.B 401.F hypothetical infinite 397.P population kurtosis 396.C population mean 396.C population moment of order k 396 C population standard deviation 396.C population variance 396.C port-admittance matrix 282.C porter 168.C port-impedance matrix 282.C port network, M- 282.C portrait, phase 126.B position general (complexes) 70.B general (of a PL mapping) 65.D general (in a projective space) 343.B general, theorem 65.0 hyperboloid 350.0 method of false 301.C position representation 351.C position vector 442.A (of a point of an affine space) 7.A positive (chain complex) 200.C (class of vector bundles) 114.0 (complex) 200.H (functional on a C*-algebra) 36.G (function on a C*-algebra) 308.0 (Hermitian operation) 308.A (square matrix) 31O.H completely (linear mapping between C*- algebras) 36.H positive boundary, open Riemann surface of 367.E positive cone, natural 308.K positive cycle (on an algebraic variety) 16.M 
Subject Index Positive definite positive definite (function) 192.B,1 394.C (Hermitian form) 348.F (matrix) 269 I (potential) 338.D (sequence) 192.B (on a topological group) 36.L 437.B positive definite kernel 217.H positive definite quadratic form 348.C positive direction (in a curvilinear integral) 198.B positive distribution 125 C positive divisor (of an algebraic curve) 9.C (on a Riemann surface) 11.D positive element (in a lattice-ordered group) 243 G (of an ordered field) 149 N strictly 310 H totally 14.G positive entropy, completely 136 E positive half-trajectory 126.D positive infinity 87.D 355 C positive kernel 217.H positive limit point 126.D positively invariant 126.D positively Lagrange stable 126.E positively Poisson stable 126.E positively regular process 44.C positive matrix 269.N positive number 355.A positive operator (in vector lattices) 310.E positive orientation (of an oriented C'-manifold) 105.F positive orthant 89.G positive part (of an element of a vector lattice) 310.B positive prolongationallimit set, first 126 D positive Radon measure 270 I positive real function 282.C positive recurrent ergodic class 260 B positive recurrent point 260.B positive root (of a semisimple Lie algebra) 248.M positive semidefinite (operator) 251.E positive semidefinite kernel 217.H positive semidefinite matrix 269.1 positive semidefinite quadratic form 348.C positive semiorbit 126.D positive system, symmetric 112.S 326.D positive terms, series of 379.B positive type (function of) 192.B,J (sequence of) 192.B (symmetric kernel of) 338.D positive variation (of a mapping) 246.H (of a real bounded function) 166 B positive Weyl chamber 248.R positivity O-S 150.F reflection 150.F T- 150.F possibility 411.L possible construction problem 179.A posterior density 401.B posterior distribution 398.B 401.B 403.G posterior risk 399 F post liminal C*-algebra 36.H Postnikov complex 70.G Postnikov system (of a CW complex) 148.D Post problem 356 D 2074 Post theorem 356.H postulate(s) 35.A fifth (in Euclidean geometry) 139.A Nernst 419.A Peano 294.B potency of a set 49.A potential 338.A (of a force) 271.C (of a Hamiltonian) 375.B (for a lattice spin system) 402 G (in a Markov chain) 260.D (on a network) 281.B central 351.E chemical 402.D 419.B of a double distribution 338.A of a double layer 338.A equilibrium 260.0 finite-band 387.E finite-gap 387 E logarithmic 338.A Newtonian 271.C 338.A of order rx 338.B reflectionless 387.D Riesz 338.B scalar 130.A 442.D of a simple distribution 338.A of a single layer 338.A vector 130.A 442.D velocity 205.B Yukawa 338.M potential energy 271.G potential good reduction (of an A beliall variety) 3.N potential kernel, weak 260.D potential stable reduction (of an Abelian variety) 3.N potential theory 338 power (of a cardinal number) 49.C (of an ordinal number) 312.C (of a test) 400.A of a with exponent x 13l.B,G fractional 378.D p-fold exterior (of a linear space) 256.0 p-fold exterior (of a vector bundle) 147.F Pontryagin (pth) operation 64.B residue of the nth 14.M of a set 49.A Steenrod (pth) operation 64.B power associative algebra 231.A power dilation 251.M powerful invariant, uniformly most 399.Q powerful unbiased, uniformly most 399.Q power function 400.A envelope 400.F power method 298.C power-residue symbol 14.N power series 21.B 339 370 with center at the point of infinity 339.A convergent 370.B field of, in one variable 370.A formal 370.A formal, field in one variable 370.A ring of 370.A ring of convergent 370.B ring offormal 370.A power series space finite type 168.B infinite type 168.B power set 
2075 (of a set) 381 B axiom of 33.B 381.G power sum theorem 123 0 Powers factor 308.1 Powers-Stormer inequality 212.B Poynting vector 130.A Prandtl boundary layer equation 205.C Prandtl-Glauert approximation 250.B Prandtl-Glauert law of similarity 205.0 Prandtl integrodifferential equation 222 C Prandtl number 116.B prealgebraic variety 16.C precession 392 precompact (metric space) 273.B (set in a metric space) 273.B precompact uniform space 436.H preconditioned (in numerical solution of linear equations) 302.0 predator relation, prey- 263 B predecessor (of an element in an ordered set) 311.B predetermined variables 128.C predicate 411.G analytic 356.H arithmetical 356.H complete 356.H decidable (number-theoretic) 356.C enumerating 356.H first-order 411 K general recursive 356.C hyperarithmctical 356 H of n-argument 411 G n-ary 411.G primitive recursive 356.B second-order 411 K predicate (object) 156 B predicate calculus 411 J with equality 4l1.J predica te logic 41l.J with equality 41l.J first-order 411.K higher-order 411.K second-order 411.K third-order 411 K predicate symbol 411 H predicate variable 411 G,H predictable (O'-algebra) 407.B prediction sufficiency 396.1 prediction theory 395.0 linear 395.0 predictive distribution 403.G predictor 303.E (in a multistep method) 303.E linear 3950 Milne's 303.E optimal linear 3950 predictor-corrector (PC) method 303 E predual 308.0 prefix condition code 2130 pre-Hilbert space 197.B prehomogeneous vector space 450.V zeta function associated with 450.V premium 214 A net 214.A risk 214 B savings 214.B prenex normal form (in predicate logic) 41l.J preordenng 311.H preparation theorem Subject Index Prime formula for COO-functions 58 C Weierstrass 21.E 370 B Weierstrass type (for microdifferential opera- tors) 274.F presentation 235.B of finite (@-Modules) 16.E Wirtinger (of a knot group) 235.B,0 presheaf 383.A sheaf associated 383.C presheaf on a site 16.AA pressure 402.G 419 A pressure, topological 136 H pressure equation 205.B prestratification, Whitney 418.G preventive maintenance model 307.C prey-predator relation 263.B price imputed 292.C shadow 255.B primal problem 255.B primary Abelian group 2 A primary cohomology operation 64.B stable 64.B primary component (of an ideal) 67.F embedded (of an ideal) 67.F isolated (of an ideal) 67.F primary difference 305.C primary ideal 67.F p 67.F primary obstruction 147.L 305.C primary problem 255.B primary ring 368.H completely 368.H semi- 368.H primary solution (of a homogeneous partial dif- ferential equation) 320.E primary submodule 284.A prime(s) (3-manifold) 65.E over an element (in a lattice) 243.F under an element (in a lattice) 243.F Mersenne 297.E relatively (fractional ideals) 14.E relatively (numbers) 297 A twin 123.C prime differential ideal (of a differential ring) 113 prime divisor (of an algebraic function field of dimension 1) 90 (of an algebraic number field or an algebraic function field of one variable) 439.H (of an ideal) 67.F (on a Riemann surface) 11.0 embedded (of an ideal) 67.F finite 439.H imaginary infinite 439.H infinite 439.H isolated (of an ideal) 67.F maximal (of an ideal) 67.F minimal (of an ideal) 67.F real 439.H real (infinite) 439.H prime element (of a ring) 67 H (for a valuation) 439.E prime field 149.B prime formula 411.D (of a language) 276.A 
Subject Index Prime ideal prime ideal 67.C (of a maximal order) 27.A associated (of an ideal) 67.F ramified 14.1 unramified 14 I prime ideal theorem 123.F prime knot 235.A prime number(s) 297 B regular 14.L relatively 297.A prime number theorem 123 B for arithmetic progression 123 0 prime quotient (in a lattice) 243.F prime rational divisor over a field (on an algebraic curve) 9 C prime spot (of an algebraic number field or an algebraic function field) 439.H primitive 427.B (differential form) 232.B (element of coalgebra) 203.1 (element of an extension of a field) 149.0 (generator of the cohomology algebra of a compact Lie group) 427.B primitive binary quadratic form 348.M primitive character 2950 450.C,E non- 450.C,E primitive equation 172.G primitive form 232.B primitive function(s) 216.C derivatives and App. A, Table 9.1 primitive hyperbolic type (reduced basis of) 92.C primitive hypercubic type (reduced basis of) 92.C primitive ideal (of a Banach algebra) 36.0 primitive idempotent element (of a ring) 368.B primitive lattice 92 E primitive operation (of a group) 362.B primitive permutation group 151 H primitive permutation representation (of a group) 362 B primitive polynomial 337.0 primitive recursive 356.B primitive recursive function 356.A,B,F uniformly 356.B primitive recursive in t/fl' ., t/fi 356.B primitive recursive predicate 356 B primitive root of unity 14.L modulo m 297.G primitive solution (of a partial differential equation) 320E principal adele (of an algebraic number field) 6.C principal analytic set 23.B principal antiautomorphism (of a Clifford algebra) 61.B principal automorphism (Clifford algebras) 61.B principal axis (axes) (of a central conic) 78 C (of inertia) 271.E (of a parabola) 78.C (of a quadric surface) 350.B transformation to 390 B principal bundle 147.C associated 147.0 reduced 147.1 reducible 147 J principal character 2950 (of an Abelian group) 2.G principal component(s) (in principal component analysis) 280.F of order p Il0.A principal component analysis 280.F 2076 principal congruence subgroup of level N 122.0 principal convergent (of an irrational number) 83.B principal curvature (of a surface) 111.H 365.C radius of (of a surface) 1l1.H principal directions (of a surface) II1.H principal discrete series 258.C principal divisor (on an algebraic curve) 9.C (on a Riemann surface) 11.0 principal fiber bundle 147.C principal formula of integral geometry 218.C principal fractional ideal 67.K principal genus (for an ideal group) 59.E of a quadratic field 347.F principal half-space (of a flag) 139.B principal H-series 437.X principal ideal 67.K of an algebraic number field 14.E principal ideal domain 67.K principal ideal ring 67.K principal ideal theorem (in class field theory) 59.0 principal ide Ie (of an algebraic number field) 6.C principal isotropy grouP(s) 431 C principal matrix (belonging to a Riemann matrix) 3.1 principal minor (of a matrix) 103.0 principal moment of inertia 271.E principal normal I11.F principalorbit(s) 431.C type 431. C principal order (of an algebraic number field) 14.B fundamental theorem of 14 C principal part (of a differential operator) 112.A (of a Laurent expansion) 198.0 (of a partial differential operator) 320.B principal partition 66.H principal plane (of a quadric surface) 350.B principal point(s) (for a Gauss mapping) 180.B principal polarization (of an Abelian variety) 3.G principal quantum number 315.E principal series 258.C (in an n-group) 190 G (of unitary representations of a complex semi- simple Lie group) 437.W (of unitary representations of a real semisimple Lie group) 437'x principal solution 104.B principal space (of a flag) 139.B principal subspace (of a linear operator) 390.B principal symbol 237 H (of a microdifferential operator) 274.F (of a simple holonomic system) 274.H principal theorem, Ahlfors 367.B principal value (of inverse trigonometric functions) 131.E Cauchy (of an improper integral) 216.0 Cauchy (of the integral on infinite intervals) 216.E oflogZ 131 G principle(s) argument 198.F balayage 338.L Bellman 405.B Cartan maximum 338.L complete maximum 338.M of condensation of singularities 37.H 
2077 of conditionality 401.C continuity 21 H continuity (in potential theory) 338.C contraction 286.B correspondence 351.D of counting constants 16.S Dedekind (in a modular lattice) 243.F of depending choice (DC) 33.F dilated maximum (in potential theory) 338.C Oirichlet 120.A 323.E Dirichlet drawer 182.F domination 338.L Donsker invariance 250.E duality (for closed convex cones) 89.F duality, for ordering 311 A of duality (in projective geometry) 343 B embedding (in dynamic programming) 127.B energy 338 D energy minimum 419.A enthalpy minimum 419.C entropy maximum 419.A of equal weight 402 E equilibrium 338.K of equivalence (in insurance mathematics) 214.A 359 D Fermat 180 A 441.C first maximum (in potential theory) 338.C Fisher three 102.A Frostman maximum 338.C general, of relativity 359 D Gibbs free energy minimum 419 C Hamilton 441.B Hasse 348.G Helmholtz free energy minimum 419.C Huygens 325.B 446 Huygens, in the wider sense 325 D invariance 375.B 400 E of invariance of speed ofIight 359.B inverse domination 338.L of least action 441.B limiting absorption 375.C of linearized stability 286.S of localization (on convergence tests of Fourier series) 159.B local maximum modulus 164.C lower envelope 338 M Maupertuis 180.A maximal 193.E maximum (for control theory) 86.F maximum (for a holomorphic function) 43.B maximum (for minimal surfaces) 275.B maximum modulus (for a holomorphic func- tion) 43.B minimax (for eigenvalues ofa compact opera- tor) 68.H minimax (for A k ) 391.G minimax (for statistical decision problem) 398 B minimum (for A) 391.D minimum (for Ad 391.G of nested intervals (for real numbers) 87 C 355.B Oka 21.K 1470 of optimality 127.A Pauli 351.H quasicontinuity (in potential theory) 338.1 Rayleigh 68 H ret1ection 45.E ofreflection 74 E Schwarz, of reflection 198.G Subject Index Problem(s) separation 405.C special, of relativity 359 stochastic maximum 405.D stored program 75.B Strassen invariance 250.E of sufficiency 401.C of superposition 252.B 322.C sweeping-out 338.L Ugaheri maximum 338.C uniqueness (in potential theory) 338.M upper boundedness (in potential theory) 338.C variational 441 variational (in statistical mechanics) 340.B 402.G variational (in the theory of elasticity) 271.G variational, with relaxed continuity requirement 27l.G variational, for topological pressure 136.H Pringsheim theorem 58 E prior density 401.B prior distribution 401.B 403.G probabilistic model 397.P probability 342 additivity of 342 B a posteriori 342.F a priori 342.F binomial, paper 19.B conditional 342.E continuous in 407.A converge in 342.D converge with, 1 342.D critical percolation 340.D error 213.D of an event 342.B that event 8 occurs 342.B event with, 1 342.B extinction 44.B geometric 218.A hitting, for single points 5 G objective 401.B regular conditional 342.E ruin 214.C standarq transition 260.F subjective 401.B theory of 342.A transition 260 A 261.A 351.B probability amplitude 351.D probability density 341.D probability distribution(s) 342.B, App. A, Table 22 (one-dimensional, of random variable) 342.C (of random variables) 342.C conditional 342.E n-dimensional 342.C probability generating function 341.F 397.G probability integral App. A, Table 19.II probability measure 341 342.B probability of loss 307.C probability paper 19.F binomial 19.B probability ratio test, sequential 400.L probability space 342.B probable cause, most 401.E probable value, most 401.E problem(s) Abel 217.L abstract Cauchy 286,X acousgc 325.L adjoint boundary value 315.B all-integer programming 215.A Appolonius (in geometric construction) 179.A 
Subject Index Problem(s) Behrens-Fisher 400.G Bernshtein, generalized 275.F boundary value (of ordinary differential equa- tions) 303.H 315.A Burnside (in group theory) 161.C Cauchy (for ordinary differential equations) 316.A Cauchy (for partial differential equations) 320.B 321.A 325.B class field tower 59 F combinatorial App. A, Table 17.II combinatorial triangulation 65.C concave programming 292.A conditional, in the calculus of variations 46.A connection 253.A construction 59.F convex programming 292 A corona 43.G correctly posed (for partial differential equa- tions) 322.A Cousin, first 21.K Cousin, second 21.K Cramer-Castillon (in geometric construction) 179A critical inclination 55.C decision 71.B 97 186.1 Oelos (in geometric construction) 179.A Oido 228.A differentiable pinching 178.E Oirichlet 120 193.F 323.C Oirichlet, with obstracle 440.B Oirichlet divisor 242.A dual 255 B 349.B du Bois Reymond 159.H eigenvalue 390.A exterior (Oirichlet problem) 120.A first boundary value 193 F 323.C flow-shop scheduling 376 four-color 157 Oauss circle 242.A Gauss variational 338.1 general boundary value 323.H generalized eigenvalue 298.G generalized isoperimetric 46 A 228.A generalized Pfaff 428.B Geocze 246.0 geometric construction 179.A Goldbach 4.C group-minimization 215.C Hamburger moment 240.K Hausdorff moment 240 K Hersch 391.E Hilbert (in calculus of variations) 46.A Hilbert fifth 423.N homeomorphism 425.G homogeneous boundary value (of ordinary differential equations) 315.B Hukuhara 315.C of identification (in econometrics) 128.C impossible construction 179.A inconsistent (of geometric construction) 179.A inhomogeneous boundary value (of ordinary differential equations) 315.B initial value (for functional differential equa- tions) 1630 initial value (of ordinary differential equations) 313.C 316.A initial value (for partial differential equations) 321 A 2078 initial value, for a hyperbolic partial differential equation App. A, Table 15.m interior (Oirichlet problem) 120.A interpolation 43.F invariant measure 136.C inverse (in potential scattenng) 375.0 isomorphism (for graphs) 186.1 isomorphism (for integral group algebra) 362.K isoperimetric l11.E 228.A Jacobi inverse 3 L job-shop scheduling problem 376 k-sample 371.0 Lagrange (in calculus of variations) 46.A LBA 31.0 Levi 2U linear least squares 302.E linear programming 255.A local (on the solutions of differential equations) 289.A machine scheduling 376 machine sequencing 376 Malfatti (in geometric construction) 179.A many-body 402.F 420.A martingale 115.C 261.C 406.A maximum flow 281.C minimum-cost flow 281.C mixed integer programming 215 A multicommodity flow 281.C multiprocessor scheduling 376 nobody 420.A n-decision 398.A network-flow 281 282.B Neumann (for harmonic functions) 193.F Neumann (for partial differential equations of elliptic type) 323.F nonlinear 291 normal Moore space 425.AA optimal regulator 86.F penalized 440.B Pfaff 428.A placement 235.A Plateau 334.A possible construction 179.A primal 255.B primary 255.B properly posed 322.A pure integer programming 215.A quadratic programming 292.A 349.A random walk 260.A representation (on surface) 246.1 restricted Burnside (in group theory) 161.C restricted three-body 420.F Riemann 253.0 Riemann-Hilbert (for integral equations) 217.1 Riemann-Hilbert (for ordinary differential equations) 253.0 Robin 323.F of satisfiability (of a proposition) 97 Schoenflies 65.G second boundary value (for harmonic functions) 193.F second boundary value (for partial differential equations of elliptic type) 323.F second Cousin 21.K self-adjoint boundary value 315 B sequential decision 398.F shortest path 281. C single- commodity flow 281 
2079 singular initial value (for partial differential equations of mixed type) 326.C smoothing 114C special isoperimetric 228.A of specification 397.P statistical decision 398.A Steiner (in geometric construction) 179.A Stieltjes moment 240 K Sturm-Liouville 315.B third boundary value (for harmonic functions) 193 F third boundary value (for partial differential equations of elliptic type) 323.F three big 187 three-body 420.A Thues (general) 31.B time optimal control 86.F transformation (in a finitely presented group) 161.B transient 322.D transportation 255.C transportation, on a network 255.C Tricomi 326.C two-body 55.A two-point boundary value (of ordinary dif- ferential equations) 315.A two-terminal 281 type (for Riemann surfaces) 367.D of universal validity of a proposition 97 Waring 4.E weak form of the boundary value 304.B well-posed (in general case) 322.A word (in a finitely presented group) 161 B 0-1 integer programming 215.A procedure classification 280 I exploratory 397.Q Lyapunov-Schmidt 286.V random sampling 373 A sampling 373 A shortest-path 281.C statistical decision 398 A process (in catastrophe theory) 51.F .(on a measure space) 136.E (=stochastic process) 407.A additive 5 342.A age-dependent branching 44.E asymmetric Cauchy 5.F autoregressive 421.D autoregressive integrated moving average 421 G autoregressive moving average 421.D Bernoulh 136.E Bernoulli, very weak 136.E Bernoulli, weak 136 E birth 260.G birth and death 260.G bond percolation 340.D branching 44 342.A branching Markov 44 E , Cauchy 5.F centered 5.B compound Poisson 5.F contact 340.C continuous-state branching 44.E death 260.G diffusion 115 dual 261.F Subject Index Process exponent of the stable 5.F Feller 261.B finitely determined (F. D.) 136.E Galton-Watson branching 44.B Gaussian 176 342.A Gaussian, complex 176.C generalized stochastic 407.C homogeneous Markov 5.H Hunt 261 B increasing 262.D independent 136.E with independent increments 5.B integrable, of bounded variation 406.B integrable increasing 406.B invariant Markov 5.H irreversible 402.A isothermal 419.B Ita 406.B Levy 5.B linear stationary iterative 302.C Markov 261 342.A Markov branching 44.D Markovian decision 127.E moving average 421.D multistage allocation 127.A multistage choice 127.A multitype Galton-Watson 44.C multitype Markov branching 44.E Newton iterative 301.D normal 176.C observation 405.F one-sided stable, of the exponent. rx 5.F oscillator 315.F osculating 77.B percolation 340.D point 407.0 Poisson 5 D positive regular 44.C progressive 407.B quadratic variation 406.B quasi static adiabatic 419.B recurrent 261.B reversed 261.F sample 407.A shift associated with the stationary 136.D 0'- (of a complex manifold) 72.H signal 405.F site percolation 340.D spatially homogeneous 261.A stable 5.F stationary 342.A 395.A stationary Gaussian 176 C strictly stable 5.F strictly stationary 395.A stochastic 342.A 407 stochastic, with stationary increments of order n 395.1 strongly stationary 395.A,F strong Markov 261 B subadditive 136.B sweeping-out 338.L symmetric Cauchy 5.F symmetric stable 5.F system 405 F temporally homogeneous 261.A temporally homogeneous additive 5.B transient 261.B very weak Bernoulli (V.W.B.) 136 E weak Bernoulli (W.B.) 136 E 
Subject Index Processing, data weakly stationary 395.A weakly stationary, of degree k 395.1 Wiener 5.D 45.B processing, data 96 processor, central 75 B producer's risk 404 C product(s) (of algebraic varieties) 16 A (of cardinal numbers) 49.C (of completely additive classes) 270 H (of elements of a graded algebra) 203.B (of elements of a group) 190 A (of hyperfunctions) 125 X 274.E (of ideals) 67 B (of knots) 235.A (of linear operators) 37.C 251 B (of matrices) 269 B (of objects) 52.E (of ordinal numbers) 312.C (of paths) 170 (in quadrangular set of six points) 343.C (of real numbers) 355.B (of sets) 381.B (of tensors) 256 K amalgamated (of a family of groups) 190.M Blaschke 43 F bracket (in a Lie algebra) 248 A cap (in (co)homology groups of a space) 201.K cap (in homological algebra) 200 K 201 K cardinal (of a family of ordered sets) 311.F Cartesian (of a family of sets) 381.E Cartesian (of mappings) 381 C Cartesian (of ordered simplicial complexes) 70C Cartesian (of s.s. complexes) 70 E Cartesian (of sets) 381. B Cauchy (of series) 379.F complex (of cell complexes) 70 D cross- (in cohomology groups of a space) 20lJ cross- (in homology groups of a space) 201.1 cross- (of vector bundles) 237 C crossed (of a C*-algebra) 36.1 crossed (of a commutative ring and a group) 29D crossed (in von Neumann algebra theory) 308.1 cup (of cohomology classes) 201 I cup (in K-theory) 237 C cup, reduction theorem 200 M cup of derived functors 200.K difference 337 I direct direct product divergent infinite 374.F exterior (of differential forms) 105 Q exterior (of elements of a linear space) 2560 exterior (of a p-vector and q-vector) 256.0 exterior (of two vectors) 442.C external (of derived functors) 200 K Euler 450.V fiber, over S 52.0 free (of groups) 190 M Hermitian inner 256.Q of ideals of a commutative ring 67.B of inertia 271.E infinite 379 G infinite App A, Table 10 VI inner (in a Hermitian linear space) 256 Q inner (in a Hilbert space) 197.B inner (in a metric vector space) 256.H 2080 inner (for a pairing) 424.0 inner (with respect to a Jinear spac and its dual space) 256.0 inner (of two hyperspheres) 76 A inner (of two n-tuples) 256.A inner (of two vectors) 442 B inner, space 442.B internal (of derived functors) 200.K interior (of a differential form with a vector field) 105.Q intersection (of homology classes) 201.0 intersection (of two subvarieties) 16.Q Kronecker (of matrices) 269.C logical (of propositions) 411 B ordinal (of a family of ordered sets) 311.G partial 379.G Pontryagin 203 D projective C*-tensor 36.H proper (of two normal g-Iattices) 27.A restricted direct 6.B Riemannian (of Riemannian manifolds) 364.A scalar (of linear operators) 37.C scalar (of two vectors) App. A, Table 3.1 scalar triple (of three vectors) 442.C skew (of measurable transformations) 136.0 slant 20l.K smash 202.F spatial tensor 36.H sum of 216.A symmetric (of a topological space) 70.F tensor (of A-homomorphisms) 277.1 tensor (of algebras) 29.A tensor (of A-modules) 277 J tensor(of chain complexes) 20l.J tensor (of cochain complexes) 20lJ tensor (of distributions) 125 K tensor (of Hilbert spaces) 308.C tensor (of linear mappings) 256.1 tensor (of linear representations) 362 C tensor (of linear spaces) 256.1 tensor (of locally convex spaces) 424.R tensor (of shea ves) 383.1 tensor (of vector bundles) 147.F tensor (of von Neumann algebras) 308.C torsion (in a category) 200 K torsion (of two A-modules) 200.0,K vector 442.C, App. A, Table :u vector triple 442.C, App. A, Table 3.1 wedge (of derived functors) 200.K Weierstrass canonical 429.B Whitehead 202.P product algebraic variety 16 A product bundle 147 E product category 52.B product complex 200.H 350 D product decomposition, dual direct 422.H product double chain complex 200 E product event 342.B product formula (for the Hilbert norm-residue symbol) 14.R (on invariant Haar measures) 225 E (for the norm-residue symbol I 14.Q (on valuations) 439 H Trotter 315.F production planning 376 production rule 31 B product mapping 425.K product measure 270.H product measure space 270.H 
2081 complete 270.H product metric space 273 B product of inertia 271.E product rule 299.0 product space 425 L reduced 202.Q product theorem for dimension 117.C product topological space 425.L product topology 425.L product uniformity 436 E product uniform space 436.E profinite groups 21O.C program 75 C machine-language 75.C program evaluation and review technique 376 programming 75.C 385.B bilinear 3640 chance-constrained 408.A convex 264 C disjunctive 264.C dynamic 127.A 264.C fractional 264.C geometric 2640 integer 264.C linear 264 C linear mathematical 2640 mathematical 264 A multiobjective 264.C multistage 264 C network 264.C nonconvex 264.0 parametric 264.C stochastic 264.C 408.A stochastic, model 307 C two-stage linear, under uncertainty 255.F two-stage stochastic 408 A programming problem all-integer 215.A concave 292.A convex 292.A linear 255.A mathematical 264.B mixed integer 215.A nonlinear 264.C pure-integer 215 A quadratic 292.A 349.A 0-1 integer 215.A progression arithmetic 379.1, App A, Table 10.1 geometric 379.1, App A, Table 10.1 progressive (set) 407.B progressively measurable (stochastic process) 407.B progressive process 407.B projecting (in a projective space) 343 B projection (of a covering space) 367.B (from a direct product set) 381.E (of a fiber bundle) 147.B (of a fiber space) 148. B (of a Hilbert space) 197.E (onto a homogeneous space) 199.A (in a projective space) 343.B (to a quotient set defined by an equivalence relation) 135.B (on a tangent bundle of a Banach manifold) 286.K canonical (on modules) 277.F canonical (onto a quotient set) 135 B center of (in projective geometry) 343 B Lie minimal 76.B Subject Index Projective set of class n method of orthogonal (of H Weyl) 323.0 orthogonal 139.E,0 orthogonal (on a Hilbert space) 197.E parallel (in an affine space) 7 C regular knot 235.A relaxation with 440.E stereographic 74.0 unramified (of a covering surface) 367.B projection matrix 269.1 projection method, Rosen's gradient 292 E projection operator (in a Hilbert space) 197.E projective (Banach space) 37.M (object in an Abelian category) 200.1 projective (object) 200.1 projective algebraic variety 16.A fundamental theorems of 72.F quasi- 16.C projective A-module 277.K projective approximation method 304.B projective C*-tensor product 36.H projective class 200 Q projective class group 200.K projective coIlineation 3430 in the wider sense 343.0 projective connection 80 0 projective coordinates 343.C projective coordinate system 343.C projective curvature tensor App. A, Table 4.II projective deformation (between surfaces) 110 B projective determinacy 22.H projective differentia] geometry 11O.B projective dimension (of a module) 200 K projective frame (in projective geometry) 343.C projective general linear group 60.B of degree n over K 60.B projective geometry 343 finite-dimensional 343.B fundamental theorem of 343.0 general 343 B of pa ths 109 projective limit (in a category) 2100 (of a projective system of sets) 21O.B (of a projective system of topological groups) 423K projective limit group 21O.C projective limit space 21O.C projective line 343.B projective line element IIO.B projectively flat space App. A, Table 4.n projectively related (fundamental figures) 343.B projective mapping (in projective geometry) 343.B projective module, (R, S)- 200.K projective morphism 16.E quasi- 16.E projective plane 343.B Cayley 54 finite 241.B projective representation (of a group) 362.1 irreducible 362 J similar 362.1 projective resolution (in an Abelian category) 200.1 left (of an A-module) 200.C \JJ- 200.Q projective scheme 16 E quasi- 16.E projective set of class n 22.0 
Subject Index Projective space projective space 343.B complex 343.0 infinite-dimensional complex 56.C infinite-dimensional real 56.B over A 147.E left 343.F real 343.0 right 343 F projective special linear group 60.B (over a noncommutative field) 600 projective special unitary group over K 60.H projective symplectic group over K 60.L projective system (in a category) 210.0 (of groups) 210.C (of sets) 2iO.B (of topological groups) 423 K (of toplogical spaces) 210. C projective topology 424.R projective transformation 343.0 364.F group of 343.0 regular 343.0 singular 343.0 singular, of the kth species 343.0 projective transformation group 343.0 projective unitary group 60 F projective variety 16.A prolate App. A, Table 3 V proliferation (of errors) 1380 prolongable (Riemann surface) 367.F prolongation (along a curve in a covering surface) 367.B (of a Riemann surface) 367 F (of a sol ution of an ordinary differential equa- tion) 316.C (of a system of partial differential equations) 428.B,F (of a valuation) 439.B analytic 198 G first(of P) 191.E kth (of G) 191.0 kth (ofa Lie subalgebra) 191.0 kth (of P) 191.E prolongationallimit set first negative 126.0 first positive 126.0 proof, consistency 156.0 for pure number theory 156.E proof theory 156.0 propagation of chaos 340.F eq uation of sound 325.A of errors 138. C of singularities 325 M wave 446 proper (continuous mapping) 425 W (equivalence relation in an analytic space) 23.E (leaf) 154.0 (Lorentz group) 258 A (morphism of scheme) 16.0 (PL embedding) 65.0 proper affine transformation 7.E proper class (in set theory) 381.G proper complex Lorentz group 258.A proper component (of an intersection of subvarieties) 16G proper convex function 88.0 proper factor (of an element of a ring) 67.H 2082 proper flag manifold 199.B proper function (of a boundary value problem) 31S.B proper hypersphere (in hyperbolic geometry) 285.C proper Lorentz group 60.1 properly convex (subset of a sphere) 274.E properly discontinuous transformation group 122A properly divergent 379.A properly equivalent (binary quadratic forms) 348.M properly infinite 308.E properly intersect (on a variety) 16.G properly (n - I)-dimensional quadric hypersurface 350.G properly posed (initial value problem) 321.E (problems for partial differential equations) 322.A proper mapping(s) 425.W fundamental theorem of 16,X proper meromorphic mapping (between analytic spaces) 23 D proper modification (of an analytic space) 23.0 proper motion in Euclidean geometry 139.B of a star 392 proper orthogonal group 60.1 258.A proper orthogonal matrix 269.1 proper product (of two normal g-latticf:s) 27.A proper quadric surface 350.B proper rotation group 258.A proper subset 381.A proper time 258.A proper transform (of a subvariety) 16.1 property (properties) 411.G approximation 37.L asymptotic (of sol utions of a system of linear ordinary differential equations) 314.A Baire 425.N basic (of a structure) 409.B bounded approximation 37 L clustering 402.G combinatorial 65.A continuity, for tech theory 201 M of continuity (in a continuous geometry) 85.A countably productive 425.Y covering homotopy 148.B duality (of linear spaces) 256.G equivalence 135 A finite intersection 425.S finite subset 396.F global (in differential geometry) lO9 homotopy extension 202.E in the large (in differential geometry) 109 local (in differential geometry) 109 local (of a pseudodifferential operator) 345.A Markov 261 B micro-pseudolocal (of a pseudodifferential operator) 345.A minimum curvature 223.F minimum norm 223.F P conjecture 235.B pseudolocal (of a pseudodifferential operator) 34S.A pseudo-orbit tracing 126.J reproducing (of a probability distribution) 341.E, App. A, Table 22 in the small (in differential geometry) 109 
2083 spectral 136.E star-finite 425 S strong Markov 261.B topological 425 G uniformity 399 N universal mapping 52.L proper value (of a boundary value problem) 315.B (of a linear mapping) 269.L (of a linear operator) 390.A (of a matrix) 269 F proper variation 279 F proper vector (belonging to an eigenvalue) 269.F (of a linear operator) 390.A (of a linear transformation) 269.L proposition(s) existential 411 B modal 411.L universal 411 B variables 411.E propositional calculus 411 F propositional connectives 411.E propositional function 411 C propositional logic 411.E provable (formula) 41l.I proximity function (of a me rom orphic function) 272.B Prufer ring 200.K pseudoanalytic function, K- 352.B pseudo-arc 79.0 pseudocompact (space) 425 S pseudoconformal geometry 344.A pseudoconformallyequivalent 344.A pseudoconformal transformation 344.A pseudoconvex (domain) 21.G Caftan 21.I d- 21.G Levi 21.1 local1y Cartan 21 I locally Levi 21.I strictly 344.A strongly 21 G pseudodifferentialoperator 251.0 274.F 345 pseudodistance Caratheodory 21.0 Kobayashi 21.0 pseudodistance function 273.B pseudofunction 125.C pseudogeometric ring 284.F pseudogroup (of topological transformations) 105.Y of transformations (on a topological space) 900 pseudogroup structure 105.Y pseudo-Hermitian manifold 344.F pseudointerior 382.B pseudo-isotopic 65.0 pseudo-isotopy 65.0 pseudolocal property (of a pseudodifferential opera- tor) 345.A micro- 345.A pseudomanifold 65.B pseudometric 273.B pseudometric space 273 B indiscrete 273 B pseudometric uniformity 436 F pseudometrizable 436.F pseudonorm (on a topological linear space) 37.0 424F Subject Index q-expansion formula pseudo-orbit 126.1 CI- 126.J tracing property 126.1 pseudo-ordering 31l.H pseudopolynomial, distinguished 21.E pseudorandom numbers 354.8 pseudo-Riemannian metric 105.P pseudo-Runge-Kutta method 303.0 pseudosphere 111.1 285.E pseudotensorjal form 80.G pseudovaluation 439.K ",-collective 354.E psi function 174.B psychometrics 346 Puiseux series 339.A pul1back (of a differential form) 105.Q (of a distribution) 125.Q (of a divisor) 16 M Puppe exact sequence 202.G pure (continued fraction) 83.C (differential form) 367.H (state) 351.B pure geometry 181 pure ideal 284.0 pure integer programming problem 215.A purely contractive 251. N purely contractive part 251.N purely d-dimensional analytic set 23.B (at a point) 23.B purely discontinuous distribution 341.0 purely imaginary number 74.A purely infinite (von Neumann algebra) 308.E purely inseparable (extension of a field) 149.H (rational mapping) 16.1 purely inseparable element (of a field) 149.H purely n-codimensional 125.W purely nondeterministic 395 0 purely transcendental extension 149.K pure number theory 156. E pure periodic continued fraction 83.C pure phase 402 G pure point spectrum 136 E pure strategy 173.B pursuit, curve of 93.H push-down automaton 31.0 push-down storage 96.E Putnam's theorem 251.K Pyatetskii-Shapiro reciprocity law, Gel'fand- 437.00 Pythagorean closure (of a field) 155 C Pythagorean extension (of a field) 155.C Pythagorean field 139.B l55.C Pythagorean number 145 Pythagorean ordered field 60.0 Pythagorean theorem 139.0 Q Q (rational numbers) 294.A,0 q-block bundle 147.Q q-block structure 147.Q q-boundary 201 B q-chains 201.B q-cochains, singular 201.H q-cycle 201.B q-dimensional homology classes 20l.B q-expansion formula 134.1 
Subject Index q-face q-face 70.B q-function Chebyshev ]9 G, App A, Tab]e 20.VII simplest Chebyshev 19 G q-numbers ]30.A q-representation 35J.C q-simplex oriented 20J.C singular 201.E standard 20J.E qth homology group 20J.B Q-manifold 382.0 Q-matrix 260 F Q-spaces 425 BB QR method 298.E QZ method 298.C QCO (=quantum chromodynamics) 132.D q e (=quasi-everywhere) 338 F QFD (=quantum flavor dynamics) ]32 D quadrangles ] 55.F complete 343.C quadrangular set of six points 343.C quadrant, first (of a spectra] sequence) 200 J quadratic differentia] (on a Riemann surface) 1] D quadratic equation 10 D, App A, Table] quadratic fie]d(s) 347 complex 347 A imaginary 347 A real 347.A quadratic form(s) 348 (on a linear space) 256.H bilinear form associated with 256 H binary 348 M complex 348.A,B definite 348 C equivalent 348 A indefinite 348.C matrix of 348 A negative definite 348 C negative semidefinite 348 C nondegenerate 348.A positive definite 348.C positive semidefinite 348 C primitive binary 348 M properly equivalent binary 348.M rea] 348.A,C reduced 348.1 Siegel zeta function of indefinite 450 K quadratic irrational number, irreducible 83.C quadratic loss function 398 A 399 E quadratic nonresidue 297 H quadratic programming 349 nonconvex 264.D quadratic programming problem 292.A 349.A quadratic reciprocity of Jacobi symbol, Jaw of 297 I quadratic reciprocity of Legendre symbol, Jaw of 297 I quadratic residue 297 H quadratic transformation ]6 [ ]oca]]y 16.K ]oca]]y (of an algebraic surface) 15 G ]oca]]y (of a complex manifold) 72 H quadratrix ] 87 quadrature J07.A, App A, Table ]5.m of a circle J79.A method of 3] 3 D solution by App. A, Table 14 I spectra] density 397 N quadric(s) 350 A 2084 confocal, family of 350.E Darboux ] 10 B pencil of 343.E quadric cone 350.B,G quadric conical hypersurface 350 G quadric conical surface 350.B quadric cylindrical hypersurface 350.G quadric hypersurface 350.G,I (in a projective space) 343 D 350 I centra] 350.G emptic 350.G of the hth species, singular (in a projective space) 343.E hyperbolic 350.G noncentra] 350 G parabolic 350 G pencil of 343 E properly (n  1)-dimensiona] 350.G regular (in a projective space) 343 E quadric surface(s) 350 canonical form of the equation of 350 B central 350 B degenerate 350.B proper 350.B quadrivium ]87 qualification Gauignard's constraint 292.B Slater's constraint 292.B qualitative (data) 397.A quality 404.A quality characteristic 404.A quality control 404.A statistical 404.A quantifier 411.C bounded 356 B existential 4] J.C Hilbert's a- 41 J.J universal 4]] C quantile Ct- 396.C of order p 341 H restricted 33.B quantitative (data) 397.A quantity (quantities) analog 138.B digital ]38.B first fundamental (of a surface) ] I1.H second fundamental (of a surface) l1J.H thermodynamical 419.A quantization 35J.D second 377.B quantized contact transformation 274.F quantum chromodynamics (QCD) ]32.C,D quantum electrodynamics 132.C quantum field theory J32.C 150.C quantum flavor dynamics (QFD) 132 D quantum logic 35J.L quantum mechanics 35] quantum number azimuthal 351.E orbital magnetic 35J.E principal 351 E quantum statistical mechanics 402.A quartic equation ]0 D, App A, Tab]e ] quartile(s) 396.C first 396.C third 396.C quasi-affine (algebraic variety) 16.C quasi-algebraicaHy dosed field 1]8 F quasi-analytic function 58.F 
2085 family of 58.A in the generalized sense 58.F set of 58.F quasi-Banach space 37.0 quasibarreled (locally convex space) 424.1 quasi bounded harmonic function 193.G quasicomplete (locally convex space) 424.F quasiconformal mappings 352 extremal 352.C quasiconformal reflection, theorem of 352.C quasicontinuity principle (in potential theory) 338.1 quasicontinuous function 338.1 quasidiscrete spectrum 136.E quasidual space (of a locally compact group) 437.1 quasi-equivalent unitary representation 437.C quasi-everywhere (in potential theory) 338.F quasi-Frobenius algebra 29.H quasi-Fuchsian group 234.B quasigroup 190.P 241.C quasi-independent of path (a response probability) 346.0 quasi-invariant measure 225.1 quasi-inverse (in a Banach algebra) 36.C quasi-inverse element (of an element of a ring) 368.B quasi-invertible element (of a ring) 368.B quasilinear (operator) 224.E (partial differential equation) 320.A 323.0 326.A quasilocal ring 284.0 quasinilpotent (operator) 251.F quasinorm (of a vector) 37.0 quasinormal family (of analytic functions) 435.E quasinormed linear space 37.0 quasi-perfect mapping 425.CC quasiperiodic (translational flow) 126.L quasiperiodic motion 136.0404.F quasiperiodic solution (of Hill's differential equation) 268.B quasiprojective algebraic variety 16.C quasiprojective morphism 16.E quasiprojective scheme 16.E quasi-p-valent 438.E quasiregular element (of a ring) 368.B quasiregular function, K- 352.B quasisemilocal ring 284.0 quasisimple ring 368.E quasisplit algebraic group, k- 13.0 quasistable distribution 341.0 q uasistatic adiabatic process 419.B quasistationary electric circuit 130.B quasisymmetric 384.E quasivariational inequalities 440.0 quaternion 29.B quaternion algebra 29.0 generalized 29.0 Hamilton 29.B total definite 27.0 quaternion field 29.B quaternion group 151.B generalized 151.B quaternion hyperbolic space 412.0 quaternion unimodular group 412.0 quaternion vector bundle 147.F query 96.F questions 351.L queue 96.E length 260.H queuing model 260.H 307.C Subject Index r-section queuing theory 260.H 307.C quotient(s) (of an ideal and a subset of a commutative ring) 67.B (in a lattice) 243.F (of numbers) 297.A (of an ordered set) 311.B difference 104.A differential (at a point) 106.A field of 67.G geometric 16.W group of (of a commutative semigroup) 190.P Herbrand 200.N integral (in the division algorithm of poly- nomials) 337.C module of, of an R-module with respect to S 67.0 prime (in a lattice) 243.F Rayleigh 68.H 304.B ring of, of a ring with respect to a prime ideal 67.0 ring of, of a ring with respect to S 67.0 ring of total 67.0 quotient bialgebra 203.0 quotient bundle 147.F (of a vector bundle on an algebraic variety) 16.Y quotient category 52.N quotient chain complex 200.C quotient coalgebra 203.F quotient complex 20l.L quotient group (of a group) 190.C (of a topological group) 423.E quotient G-set 362.B quotient lattice 243.C quotient Lie algebra 248.A quotient Lie group 249.0 quotient (linear) space (by a linear subspace) 256.F (with respect to an equivalence relation) 256.F quotient mapping 425.0 hereditarily 425.0 quotient measure 225.H quotient object 52.0 quotient representation (of a linear representation) 362.C quotient set (with respect to an equivalence relation) 135.B quotient singularity 418.C quotient space (by a discontinuous transformation group) 122.A (of a linear space) 256.F (of a topological space) 425.L left (of a topological group) 423.E right (of a topological group) 423.E quotient system (of an algebraic system) 409.C quotient topological space 425.L quotient topology 425.L R R (real numbers) 294.A 355.A p-set 308.1 r-closed space 425. U r-frame, tangent 105.H r-ple point (of a plane algebraic curve) 9.B r-section (of a Euclidean (simplicial) complex) 70.B 
Subject Index r-skeleton (of a Euclidean complex) (of a simplicial complex) 70.C r-skeleton (of a Euclidean complex) 70.B rth differential 286.E rth syzygy 369.F R-estimator 371.H R-progenerator 29.K (R, k)-summable 379.S (R, S)-exact sequence (of modules) 200.K (R, S)-injective module 200.K (R, S)-projective module 200.K R-action (continuous) 126.B R"-valued random variable 342.C Raabe criterion App. A, Table 10.II Racah algebra 353.A Racah coefficient 353.B Rademacher-Men'shov theorem 317.B Rademacher system of orthogonal functions 317.C radial equation 351.E radial maximal function 168.B radian 139.0 radiation condition, Sommerfeld 188.0 radical(s) (of an algebraic groUp) 13.1 (of a Banach algebra) 36.0 (of a commutative Banach algebra) 36.E (of a commutative ring) 67.B (of an ideal) 67.B (of a Jordan algebra) 231.B (of a Lie algebra) 248.0 (of a ring) 368.H Jacobson (of a ring) 67.0 nilpotent (of a Lie algebra) 248.0 solution by (of an algebraic equation) 10.0 solvable by 172.H unipotent 13.1 radius (radii) (of a solid sphere) 140 (of a sphere) 139.1 associated convergence 21.B of convergence (of a power series) 339.A of curvature (of a plane curve) 111.E of curvature (of a space curve) II1.F injectivity 178.C of meromorphy (of a power series) 339.0 of principal curvature (of a surface) 111.H spectral 126.K 251.F 390.A of torsion (of a space curve) I11.F Rado solution, Oouglas- (to Plateau problem) 275.C Radon, decomposition formula of 125.CC Radon integral, Lebesgue- 94.C Radon measure 270.1 positive 270.1 Radon-Nikodym derivative 270.L 380.C Radon-Nikodym property 443.H Radon-Nikodym theorem 270.L 380.C for vector measures 443.H Radon transform 218.F conjugate 218.F raising the subscripts (for tensor fields) 417.0 Ramanujan conjecture 32.0 Ramanujan-Petersson conjecture 32.0 Ramanujan sum 295.0 ramification, degree of (of a bram;h point) 367.B ramification field (ofa prime ideal) 14.K mth 14.K ramification group (of a finite Oalois extension) 257.0 (of a prime ideal) 14.K 2086 mth 14.K ramification index (of an algebroidal function) 17.C (of a finite extension) 257.0 (of a prime ideal over a field) 14.1 (of a valuation) 439.1 relative (of a prime ideal over a field) 14.1 ramification numbers (of a prime idea!) 14.K ramification point (of an analytic covering space) 23.E ramification theorem (in the theory of algebroidal functions) 17.C conductor- (in class field theory) 59.C ramified (prime ideal) 14.1 ramified covering space 23.B ramified element 198.0 ramified type theory 411.K random, at 401.F random current 395.1 random distribution 395.H 407.C with independent values at every point 407.C strictly stationary 395.H strongly stationary 395.H weakly stationary 395.C in the wider sense 395.C in the wide sense 407.C random effect 102.A random-effects model 102.A 403.C random event 342.B random field 407.B randomization 102.A randomized (decision function) 398.A randomized block design 102.B randomized design, completely 102.A randomized estimator 399.B randomized test 400.A random measure Oaussian 407.0 Poisson 407.0 random numbers 354 pseudo- 354.B random sample 374.A 396.B 401.F random sampling procedure 373.A random Schrodinger equations 340.E random sequence 354.E random tensor field 395.1 random variable(s) 342.C distribution of 342.C independent 342.C joint 342.C measurable with respect to a family of 342.C n-dimensional 342.C one-dimensional probability distribution of 342.C probability distribution of 342.C R"-valued 342.C (S, (f)-valued 342.C random walk 260.A general 260.A standard 260.A range (of a correspondence) 358.B (of a linear operator) 37.C (of a mapping) 381.C (of a population characteristic) 396.C (of statistical data) 397.C closed, theorem 37.1 equianharmonic (of points) 343.0 harmonic (of points) 343.0 
2087 interquartile 397.C long 375.8 metastable (of embeddings) 114.0 numerical (of a linear operator) 251.E point (in projective geometry) 343.8 point, of the number system (in projective geometry) 343.C sample 396.C short 375.8 stable (of embeddings) 114.0 of values (of a meromorphic function) 62.A rank (of an analytic mapping) 23.C (of a bilinear mapping) 256.H (ofa complex) 13.R (of a connected compact Lie group) 248.X (of an element of a complex) 13.R (of an elliptic curve over Q) 118.0 (of first-order predicates) 411.K (ofa free Abelian group) 2.C (of a free group) 161.A (of a free module) 277.0 (of a graph) 186.0 (of a Lie algebra) 248.K (of a linear mapping) 256.F (of a matrix) 269.0 (of a module) 2.E (of a normalj-algebra) 384.C (of a prime ideal) 67.E (of a quadratic form) 348.A (of a sesquilinear form) 256.Q (of a symmetric Riemannian homogeneous space) 412.0 (of a Tits system) 151.J (of a valuation) 439.8 bispinor 258.8 of finite (operator) 68.C k- 13.Q p- (of a torsion-free additive group) 2.E at a point (of an analytic mapping) 23.C rational (of a valuation) 439.8 rank correlation Kendall 371.K Spearman 371.K rank function 66.F Rankine-Hugoniot relation 204.0 205.8 rank k, irreducible tensor of 353.C rank test signed 371.8 Wilcoxon signed 371.8 Rao inequality, Cramer- 399.0 Raphson method, Newton- 301.0 rapidly decreasing COO-function 168.8 rapidly decreasing distribution 125.0 rapidly decreasing sequence 168.8 rarefied gas, equation of 41.A rate coding 213.0 hazard 397.0 infinitesimal birth 260.0 infinitesimal death 260.G of interest, assumed 214.A transmission 213.A ratio anharmonic 343.0 of the circumference of a circle to its diameter 332 cross 343.D damping (of a damped oscillation) 318.8 direction (of a line in an affine space) 7.F Subject Index Reaction, law of double 343.0 likelihood 400.1 likelihood, test 400.1 monotone likelihood 374.1 odds 397.K Poisson 271.0 sequential probability, test 400.L stiffness 303.0 ratio ergodic theorem 136.8 ratio estimator 373.C rational action 226.8 rational cohomology group 200.0 rational curve 9.C 93.H rational differential equation 288.A rational divisor k- (on an algebraic curve) 9.C prime 9.C rational double point 418.C rational element 198.0 rational entire function 429.A rational expression 337.H field of 337.H rational extrapolation method 303.F rational function(s) field of 337.H generali,'ed 142.8 on a variety 16.A rational function field in n variables 149.K rational function matrix 86.0 rational homomorphism 3.C 13.A rational injectivity 200.0 rational integer 294.C rationally equivalent cycles l6.R rational mapping 16.1 defined along a subvariety 16.1 purely inseparable 16.1 separable 16.1 rational number(s) 294.D denseness of 355.8 rational operation 294.A rational point l18.E over a field 369.C k' - (of an algebraic variety) 16.A rational polyhedral, convex l6.z rational Pontryagin class 56.F rational rank (of a valuation) 439.8 rational real number 294.E rational representation (of GL(V)) 60.0 (of a matrix group) 226.8 rational singularity 418.C rational surface 15.E rational variety 16.1 uni- 16.1 ratio set 136.F asymptotic 308.1 ray (in affine geometry) 7.0 (in foundation of geometry) 155.8 (modulo m*) 14.H (in a Riemannian manifold) 178.F asymptotic 178.F grazing 325.L paraxial 180.8 unit 351.8 Rayleigh principle 68.H Rayleigh quotient 68.H 298.C 304.8 Rayleigh-Ritz method 46.F 271.0 Rayleigh-Schrodinger series 331.0 reaction, law of 271.A 
Subject Index Real analytic (at a point) real analytic (at a point) 106.K real analytic fiber bundle 147.0 real analytic foliation 154.H real analytic function 106.K 198.H exponentially decreasing 125.BB real analytic manifold 105.0 real analytic structure 105.0 real axis 74.C real closed field 149.N real-compact space 425.BB real field 149.N formalJy 149.N totally 14.F real form 412 (of a complex algebraic group) 60.0 (of a complex Lie algebra) 248.P normal (of a complex semisimple Lie algebra) 248.Q real function 165.B real Grassmann manifold 199.B real Hilbert space 197.B real hyperbolic space 412.G real hypersphere 76.A oriented 76.A real hypersurface, spherical 344.C real immcrsion, totally 365.M real infinite prime divisor 439.H real interpolation space 224.C realizable (for a linear representation) 362.F (by a submanifold) 114.G realization (of a linear time-varying system) 86.0 (of an s.s. complex) 70.E (of an s.s. mapping) 70.E minimal 86.D realization theorem (of a homotopy group) 202.N realization theory 86.0 real Lie algebra 248.A compact 248.P real line 355.E real linear space 256.A real monoidal transform 274.E real number(s) 294.E 355 Cantor's theory of 294.E completeness of 294.E 355.B connectedness of 294.E continuity of 294.E Oedekind's theory of 294.E extended 87.E infinitesimal 276.E irrational 294.E mod 1 355.0 nonstandard 276.E rational 294.E real part 74.A real prime divisor 439.H real projective space 343.0 infinite-dimensional 56.B real quadratic field 347.A real quadratic form 348.A,C real representation (of a Lie group) 249.0 real root (of an algebraic equation) lO.E real simple Lie algebra classical compact 248.T exceptional compact 248.T real spectral measure 390.0 real Stiefel manifold of k-frames 199.B of orthogonal k-frame 199.B 2088 real submanifold, totally 365.M real-time (computation) 19.E real topological vector space 424.A real-val ued functions 165.B real-valued measurable (cardinal) 33.F real variable 165.C rearrangement 168.B rearrangement invariant 168.B reciprocal equation 10.C reciprocal linear representation (of an algebra) 362.C reciprocal network 282.C reciprocal permutation representation (of a group) 362.B reciprocal spiral 93.H reciprocity of annihilators (in topological Abelian groups) 422.E Artin's general law of 59.C complementary law of 14.0 Fourier 160.C general law of 14.0 law of quadratic, of Jacobi symbol 297.1 law of quadratic, of Legendre symbol 297.1 relations,Onsager's 402.K reciprocity law 297.1 for Oedekind sums 328 explicit (for Hilbert norm-residue symbol) 14.R Oel'fand- Pyatetskii-Shapiro (on unitary repre- sentation) 437.00 Shafarevich 257.H record 96. B rectangle 140 latin 241.E rectangular coordinates (in a Euclidean space) 90.B rectangular distribution App. A, Table 22 rectangular hyperbola 78.E rectangular hyperbolic coordinates 90.C rectangular matrix 269.A rectangular parallelepiped 140 rectifiable (current) 275.G (curve) 93.F 246.A locally 143.A 246.A rectifying plane 111.F rectifying surface II1.F rectilinear complex 70.B recurrence formulas for indefinite integrals App. A, Table 9.n recurrence theorem 136.A,C recurrence time 260.C mean 260.C recurren t (Levy process) 5.G (Markov chain) 260.B (Markov process) 261.B (nonsingular measurable transformation) 136.C (point of a dynamical system) 126.E chain 126.E infinitely (measurable transformation) 136.C linear (sequence) 295.A non- (Markov chain) 260.B null (point) 260.B positive (ergodic class) 260.B positive (point) 260.B regionally (flow) 126.E strongly (measurable transformation) 136.C 
2089 recurrent chain 260.B recurrent event 250.0 260.C delayed 260.C recurrent point (of a Markov chain) 260.B (of a Markov process) 261.B recurrent seq uence of order r 295.A recurrent set 260.E chain 126.E recursive function(s) 356 general 356.C,F partial 356.E,F primitive 356.A,B,F uniformly primitive 356.B recursi vel y (define a partial recursive function) 356.E uniformly in 'I' 356.E recursively enumerable predicate 356.0 recursively enumerable set 356.0 recursive predicate general 356.C primitive 356.B recursive set 97 356.0 general 97 red uced (a closed linear subspace) 251.L (latin sq uare) 241.A (scheme) 16.0 reduced Abelian group 2.0 reduced algebra 231.B reduced basis (of a lattice) 92.C reduced bundle (of a principal G-bundle) 147.1 reduced character (of an algebra) 362.E reduced Clifford group 61.0 reduced cone (of a topological space) 202.F reduced dual 437.L reduced extremal distance 143.B reduced form (of a linear structural equation system) 128.C reduced homology exact sequence 20l.F reduced homology group 201.E reduced join (of homotopy classes) 202.Q (of mappings) 202.F (of topological spaces) 202.F reduced link polynomial 235.0 reduced mapping cone 202.F reduced norm (of an algebra) 362.E reduced orthogonal group 61.0 reduced product space 202.Q reduced quadratic form 348.1 reduced representation (of an algebra) 362.E reduced residue system modulo m 297.0 reduced square, Steenrod 64.B reduced square operation, Steenrod 64.B reduced suspension (of a topological space) 202.F n-fold 202.F reduced trace (of an algebra) 362.E reduced von Neumann algebra 308.C reducibility, axiom of 156.B 411.K reducible (algebraic equation) 10.B (algebraic variety) 16.A (continuous geometry) 85.A (fiber bundle) 147.J (in four color problem) 157.0 (germ of an analytic set) 23.B (linear system) 16.N (linear system in control theory) 86.C Subject Index Reflexive law (polynomial) 337.F (positive matrix) 269.N (representation) 362.C (Riemannian manifold) 364.E completely (A-module) 277.H completely (group) 190.L completely (representation) 362.C reductio ad absurdum 156.C 411.I reduction d'Alembert method of, of order 252.F good (of an Abelian variety) 3.N modulo 2r (of a representation) 277.L modulo m (of a linear representation) 362.F potential good (of an Abelian variety) 3.N potential stable (of an Abelian variety) 3.N stable (of an Abelian variety) 3.N stable (of a curve) 9.K reduction formula (of a surface) 110.A reduction theorem, cup product (on cohomology or homology of groups) 200.M reduction theory, Minkowski (on fundamental regions) 122.E reductive (algebraic group) 13.1 (homogeneous space) 199.A (Lie algebra) 248.0 reductive action 226.B defined by a rational representation 226.B geometrically 226.B linearly 226.B semi- 226.B reductive stabilizer 199.A Reeb component 154.B Reeb foliation 154.B Reeb stability theorems 154.0 Ree group 151.I Reeh-Schlieder theorem 150.E Rees lemma, Artin- '284.A Ree type group of 151.J group of Janko- 151.J reference edge 281.C refinement (of a covering) 425.R (of a descending chain in a lattice) 243.F (of a normal chain in a grouP) 190.0 barycentric 425.R cushioned 425.X d- (of a covering) 425.R star (of a covering) 425.R reflected wave 325.L reflecting barrier 115.B,C reflection (associated with <1» 13.R (of a principal space) 139.B glide 92.E Schwartz's principle of 74.E 198.0 space 359 theorem of quasiconformal 352.C reflection coefficient 387.0 reflectionless potential 387.0 reflection points (with respect to a circle) 74.E reflection positivity 150.F reflection principle 45.E refl exi ve (locally convex space) 424.0 (relation) 358.A Banach space 37.0 reflexive law (for an equivalence relation) 135.A 
Subject Index Refraction, atmospheric (on ordering) 311.A refraction, atmospheric 392 Regge behavior 386.C Regge poles 132.C 386.C regime, local 51.B region 79.A acceptance 400.A confidence 399.Q confidence, uniformly most powerful 399.Q confidence, uniformly most powerful unbiased 399.Q critical 400.A Oirichlet 234.C of discontinuity 234.A estimation 399.Q feasible 264.B 292.A Ford fundamental 234.C fundamental (of a discrete transformation group) 122.B invariance of a confidence 399.Q of relative stability 303.G star 339.0 tolerance 399.R unbiased confidence 399.Q regionally recurrent (flow) 126.E regionally recurrent on an invariant set 126.E region of absolute stability (of the Runge-Kutta (P,p) method) 303.0 regression, line of II1.F,I regression analysis 403.0 regression coefficient 397.H,J 403.0 regression function 397.1 linear 397.H 403.0 regression hyperplane 403.0 regression line 403.0 regula falsi 301.C regular (almost contact manifold) 110.E (almost periodic system) 290.B (houndary point) 120.0 (cell complex) 70.D (closed set) 125.J (coherent 6'-module) 274.G (differential form on an algehraic variety) 16.0 (Dirichlet form) 261.C (element of a connected Lie group) 249.P (element of a real Lie algebra) 248.B (estimator) 399.N (Green line) 193.1 (kernel) 125.L (left ideal of a Banach algehra) 36.0 (ordinal number) 312.E (permutation group) 151.H (point for an additive process) 5.0 (point of an analytic set) 23.B 45.0 (point with respect to an analytic set) 21.M (point with respect to the Oirichlet problem) 207.B (point of a flow) 126.0 (prime number) 14.L (sampling procedure) 373.A (spectral sequence) 200.1 (submartingale) 262.0 (at a subvariety) 16.B homogeneously 275.C of the hth species 343.E at the point at infinity (for a harmonic function) 193.B along a subvariety (for a rational mapping) 16.1 2090 regular affine transformation 7.E regular Banach space 37.G regular boundary (of a diffusion process) 115.B domain with (in a Coo-manifold) 105.U regular chain (of integral elements) 428.E regular conditional probahility 342.E regular cone 384.A self-dual 384.E regular covering (space) 91.A regular element (of a ring) 368.B p- (of a finite group) 362.1 regular emhedding 105.K regular extension (of a field) 149.K regular factorization 251.N regular first integral 126.H regular form 16.0 regular function(s) 198.A on an open set (of a variety) 16.B sheaf of germs of 16.B at a subvariety 16.B regular grammar 31.0 regular graph 186.C regular integral element 19l.I 428.E regular integral manifold (of a differential ideal) 428.E regularity ahscissa of (of a Oirichlet series) 121.B axiom of (in axiomatic set theory) 33.B up to a boundary 112.F parameter of (of a Lebesgue measurable set) 380.0 regularization (of a distrihution) 125.M regularizing (kernel) 125.L regular knot projection 235.A regular local equation (at an integral point) 428.E regular local ring 284.0 regularly convex set 89.G regularly homotopic (immersion) 114.0 regularly hyperbolic (partial differential equation) 325.A,F regular mapping (between pre algebraic varieties) 16.C of class C 1 208.8 regular matrix 269.B regular measure 270.F :24- 270.F K- 270.F regular n-gon 357.A regular neighborhood 65.C regular neighborhood system 65.C regular outer measure 270.E regular perturbation 331.0 regular point (in catastrophe theory) 51.F (of a differentiable mapping) 105.1 (of a diffusion process) 115.B (for a Hunt process) 261.0 (of a polyhedron or cell complex) 65.B (of a surface in E) 11l.J semi- (of a surface in e) 1ll.J regular polygon 357.A regular polyhedra 357.B regular polyhedral angle 357.B regular polyhedral group 151.0 regular positive Radon measure 270.H regular process, positively 44.C regular projective transformation 343.0 regular representation 
2091 (of a group) 362.B (of a locally compact group) 69.B (of a topological transformation group) 437.A left (of an algebra) 362.C left(ofagroup) 362.C right(of an algebra) 362.E right (of a group) 362.E regular ring 284.0 regular ring (continuous geometry) 85.B regular sequence (of Lebesgue measurable sets) 380.0 regular singularity (of a coherent ,g'-module) 274.H regular singular point 254.B regular solution (of a differential ideal) 428.E regular space 425.Q completely 425.Q regular submanifold (of a Coo-manifold) 105.L regular system of algebraic equations 10.A of parameters (of a local ring) 284.0 regular transformation (of a linear space) 256. B (of a sequence) 379.L totally (of a sequence) 379.L regular tube 193.K regular value 105.1 regulator (of an algebraic number field) 14.0 p-adic 450.1 regulator problem, optimal 80.F Reinhardt domain 21.B complete 21.B reiteration theorem 224.0 rejection 400.A related differential equation 254.F relation(s) 358 (among elements of a group) 190.C (among the generators of a group) 161.A Adem (for Steenrod pth power operations) 64.B Adem (for Steenrod square operations) 64.B analytic, invariance theorem of 198.K antisymmetric 358.A binary 358.A 411.G canonical anticommutation 377.A canonical commutation 351.C 377.A,C coarser 135.C defining (among the generators of a group) 161.A dispersion 132.C equivalence 135.A 358.A Euler 419.B finer 135.C Fuchsian 253.A functional (among components of a mapping) 208.C functional, of class C' 208.C fundamental (among the generators of a group) 161.A 419.A Oibbs-Ouhem 419.B Heisenberg uncertainty 351.C Hurwitz (on homomorphisms of Abelian varie- ties) 3.K identity 102.1 incidence 282.A inverse 358.A Legendre 134.F, App. A, Table 16.1 Maxwell 419.B n-ary 411.0 normal commutation 150.0 order 31 LA Subject Index Relative nullity, index of orthogonality (on irreducible characters) 362.0 orthogonality (for square integrable unitary representations) 437.M period I1.C Plucker (on Plucker coordinates) 90.B prey-predator 263.B proper equivalence (in an analytic space) 23.E Rankine-Hugoniot 204.0 205.B reciprocity, On sager's 402.K reflexive 358.A Riemann-Hurwitz 367.B Riemann period 3.L 11.C stronger 135.C symmetric 358.A tiansitive 358.A weaker 135.C relationship algebra 102.1 relative Alexander cohomology group 201.M relative algebraic number field 14.1 relative boundary 367.B relative Bruhat decomposition 13.Q relative tech cohomology group 201.M relative tech homology group 201.M relative chain complex 200.C relative cochain complex 200.F relative cohomology group 215.W relative complement (at two sets) 381.B relative components (of a Lie transformation group) 110.A relative consistency 156.0 relative degree (of a finite extension) 257.0 (of a prime ideal over a field) 14.1 relative derived functor 200.K relative different 14.1 relative discriminant 14.1 relative entropy 212.B relative extremum, conditional 106.L relative frequency (of samples) 396.C relative homological algebra 200.K relative homotopy group 202.K relative integral invariant 219.A Cartan's 219.B relative invariant 12.A 226.A relative invariant measure 225.H relatively ample sheaf 16.E relatively bounded (with respect to a linear operator) 331.B relatively closed set 425.1 relatively compact (with respect to a linear operator) 331.B (maximum likelihood method) 399.M (set) 425.S (subset) 273.F relatively dense 126.E relatively invariant measure 225.H relatively minimal 15.G 16.1 relatively minimal model 15.0 relatively open set 425.1 relatively prime (fractional ideals) 14.E (numbers) 297.A relatively stable 303.G relative maximum (of a function) 106.L relative Mayer- Vietoris exact sequence 201.L relative minimum (of a function) 106.L relative neighborhood 425.1 relative norm (of a fractional ideal) 14.1 relative nullity, index of 365.0 
Subject Index Relative open set relative open set 425.1 relative ramification inde'\( (of a prime ideal over a field) 14.1 relative singular homology group 201.L relative stability 303.0 interval of 303.0 region of 303.0 relative topology 425.1 relative uniformity 436.E relative uniform star convergence 310.F relativistically covariant 150.0 relativity general principle of 359 general theory of 359.A special principle of 359 special theory of 359.A relativization (of a definition of primitive recursive functions) 356.B (of a topology) 425.1 (of a uniformity) 436.E relativized 356.F relaxation 215.A with projection 440.E relaxation oscillation 318.C relaxed continuity requirements, variational prin- ciples with 271.0 Rellich-Oixmier theorem 351.C Rellich-Kato theorem 331.B Rellich lemma 68.C Rellich theorem 323.0 Rellich uniqueness theorem 188.0 remainder 297.A 337.C (in Taylor's formula) 106.E Cauchy App. A, Table 9.IV Lagrange App. A, Table 9.1V Roche-Schlomilch App. A, Table 9.1V remainder theorem 337.E Chinese 297.0 Remak-Schmidt theorem, Krull- (in group theory) 190.L Remmert-Stein continuation theorem 23.B Remmert theorem 23.C removable (set for a family of functions) 169.C removable singularity (of a complex function) 198.0 (of a harmonicfunction) 193.L Renaissance mathematics 360 renewal equation 260.C theorem 260.C renormalizable 111.B 132.C 150.C super 150.C renormalization constant 150.C equation 111.B group II1.A method 111.A Renyi theorem 123.E reoriented graph 186.B repeated integral (for the Lebesgue integral) 221.E (for the Riemann integral) 216.0 repeated series by columns 379.E by rows 379.E replacement, axiom of 33.B 381.0 replacement, model 307.C replica 13.C replication 102.A 2092 number of 102.B represent (a functor) 52.L (an ordinal number) 81.B representable (functor) 52.L linearly (matroid) 66.H representation(s) 362.A (of an algebraic system) 409.C (of a Banach algebra) 36.0 (of a Jordan algebra) 231.C (of a knot group) 235.E (of a lattice) 243.E (of a Lie algebra) 248.B (of a mathematical system) 362.A (of a vector lattice) 310.0 absolutely irreducible 362.F adjoint (of a Lie algebra) 248.B adjoint (of a Lie group) 249.P adjoint (of a representation) 362.E analytic (of GL(V)) 60.B in terms of arc length (of a continuous arc) 246.A canonical (of Gaussian processes) 176.E completely reducible 362.C complex (of a Lie group) 249.0 complex conjugate 362.F conjugate 362.F contragredient 362.E coregular (of an algebra) 362.E cyclic (of a C*-algebra) 36.0 cyclic (of a topological group) 437.A differential (of a unitary representation of a Lie group) 437.S direct sum of 362.C double-valued 258.B dual 362.E equivalent 362.C factor (of a topological group) 43 7 .E factor, of type I, II, or m 308.M 437.E faithful 362.B Fock 150.C Oel'fand (of a commutative Banach algebra) 36.E generalized canonical (of Gaussian processes) 176.E generating (of a compact Lie group) 249.U half-spin (even, odd) 61.E Herglotz's integral 43.1 induced 362.G induced (of a finite group) 362.0 induced (of a unitary representation of a sub- group) 437.0 integral (of a group) 362.0,K integral, Cauchy's 21.C irreducible (of an algebra or a group) 362.C irreducible (of a Banach algebra) 36.0 irreducible projective 362.1 isomorphic 362.C isotropy 431.C Kiillen-Lehmann 150.0 kernel (of a Oreen's operator) 189.B l-adic 3.E Lax 287.B,C 387.C left regular (of a group) 362.B linear  linear representation list 186.0 Mandelstam B2.C matrix 362.0 modular (of a finite group) 362.0 
2093 module of 69.0 momentum 351.C without multiplicity 437.G normal (of a von Neumann algebra) 308.C ordinary (of a finite group) 362.0 parametric 165.C parametric (of Feynman integrals) 146.B parametric (of a subspace of an affine space) 7.C permutation (of a group) 362.B permutation, reciprocal (of a group) 362.B polynomial (of GL(V)) 60.0 position 351.C projective (of a group) 362.1 projective, irreducible 362.1 quotient (of a linear representation) 362.C rational (ofGL(V)) 60.0 rational (of a matrix group) 226.B real (of a Lie group) 349.0 reciprocal linear (of an algebra) 362.C reciprocal permutation (of a group) 362.B reduced (of an algebra) 362. E reducible 362.C regular (of a locally compact group) 69.B regular (of a topological transformation group) 437.A regular, left (of an algebra) 362.C regular, left (of a group) 362.B regular, right (of an algebra) 362.E regular, right (of a group) 362.B Schrooinger 351.C semisimple 362.C similar 362.C similar matrix (semilinear mapping) 256.0 similar projective 362.J simple 362.C slice 431.C special (of a Jordan algebra) 231.C spectral 390.E spherical (of a differentiable manifold) II1.G spherical (of a space curve) l11.F spherical (of a unimodular locally compact group) 437.Z spin 61.E spin (of SO(n)) 60.1 spinor, of rank 258.B strongly continuous (of a topological group) 69.B sub- 362.C sub- (of a projective representation) 362.1 tensor (of a general linear group) 256.M tensor product of 362.C translation, theorem 375.H transposed 362.E tree 96.0 unit (of a group) 362.C unitary  unitary representation vector (of a Clifford group) 61.0 weakly continuous (of a topological group) 69.B zero (of an algebra) 362.C representation module (of a linear representation) 362.C faithful 362.C representation problem (on surfaces) 246.1 representation ring 237.H representation space (of a Banach algebra) 36.0 (of a Lie algebra) 248.B (of a Lie group) 249.0 Subject Index Resolvent set (of a linear operator) (of a unitary representation) 437.A representative (of an equivalence class) 135.B representative function (of a compact Lie group) 249.U representative ring (of a compact Lie group) 249.U representing function (of a predicate) 356.B representing function (of a subset) 381.C representing measure 164.C reproducing kernel 188.G reproducing property (of a probability distribution) 341.E, App. A, Table 22 reproduction function 263.A requirements, variational principles with relaxed continuity 271.G reserve, liability 214.B residual (subset of a directed set) 311.0 residual limit set 234.E residual set 126.H 425.N residual spectrum 390.A residue(s) (of a complex function) 198.E calculus of 198.F of the nth power (modulo p) 14.M norm- (modulo p) 14.P norm- (symbol) 14.Q 257.F power- (symbol) 14.N quadratic 297.H residue character 295.0 residue class (modulo an ideal in a ring) 368.F residue (class) algebra 29.A residue (class) field 149.C 368.F 439.B residue (class) ring (modulo an ideal) 368.F residue system modulo m complete 297.0 reduced 297.G residue theorem 198.E (on a nonsingular curve) 9.E resistance, negative 318.B resistance, specific 130.B resolution 200.H complete free (of Z) 200.N complex spectral 390.E flabby 125.W of the identity 390.0 injective (in an Abelian category) 200.1 21 102.1 21 + 1 102.1 minimal 418.C projective (in an Abelian category) 200.1 right (of an A-module) 200.F right injective (of an A-module) 200.F of singularities 16.L of singularities (of an analytic space) 23.0 418.B spectral 390.E standard (of Z) 200.M resolutive 207.B resolutive compactification 207.B reso I ven t (of a kernel) 217.0 (of a linear operator) 251.F (operator of a Markov process) 261.0 cubic App. A, Table 1 resolvent (operator of a Markov process) 261.0 resolvent convergence norm 331.C strong 331. C resolvent equation 251.F resolvent set (of a linear operator) 251.F 390.A 
Subject Index Resonance model, dual resonance model, dual 132.C resonance pole 331.F resonance theorem 37.H response 405.A response surface 102.M designs for exploring 102.M rest energy 359.C restitutive force 318.B rest point (of a trajectory) 126.0 restricted (Lorentz group) 258.A restricted Burnside problem (in group theory) 161.C restricted differential system 191.1 restricted direct product 6.B (of an infinite number of groups) 190.L (of locally compact groups) 6.B restricted holonomy group 80.0 364.E restricted homogeneous holonomy group 364.E restricted homotopy 202.B restricted Lie algebra 248. V restricted minimal condition (in a commutative ring) 284.A restricted quantifier 33.B restricted three-body problem 420.F restriction (of a connection) 80.F (of a continuous flow) 126.0 (of a distribution) 125.0 (of a mapping) 381.C (in a presheaf) 383.A crystallographic 92.A scalar (of a B-module) 277.L unitary (of a semisimple Lie algebra) 248.P resultant(s) 369.E system of 369.E retardation 163.A retarded differential equation 163.A retarded type (functional differential equation) 163.A retract 202.0 absolute 202.0 absolute neighborhood 202.0 deformation 202.0 fundamental 382.C fundamental absolute (FAR) 382.C fundamental absolute neighborhood (F ANR) 382.C neighborhood 202.0 neighborhood deformation 202.0 strong deformation 202.0 retraction 202.0 retrieval, information (system) 96.F retrospective study 40.E return 127.C first- (mapping, map) 126.C maximum 127.B Reuleaux triangle 89.E II1.E reversal, time 258.A reversed process 261.F review technique, program evaluation and 376 revolution ellipsoid of 350.B elliptic paraboloid of 350.B hyperboloid of, of one sheet 350.B hyperboloid of, of two sheets 350.B surface of 111.1 Reynolds law of similarity 205.C Reynolds number 116.B 205.C Reynolds number, magnetic 259 2094 Riccati differential equation App. A, Table 14.1 generalized App. A, Table 14.1 matrix 86.E Riccati equation, matrix 405.0 Ricci curvature 364.0 Ricci equation 365.C Ricci formula 417.B, App. A, Table 4.II Ricci tensor 364.0 417.B, App. A, Table 4.n Richard paradox 319.B Richardson method 302.C Riemann, O. F. B. 363 P-function of 253.B Riemann bilinear relations, Hodge- 16.V Riemann continuation theorem 21.F Riemann differential equation App. A. Table 18.1 Riemann (differential) equation, Cauchy- 198.A 274.0 (for a holomorphic function of several complex variables) 21.C (for a holomorphic function of two complex variables) 320.F Riemann function (of a Cauchy problem) 325.0 Riemann-Hilbert problem (for integral equations) 217.1 (for linear ordinary differential equations) 253.0 Riemann-Hurwitz formula (on coverings of a non- singular curve) 9.1 Riemann-Hurwitz relation 367.B Riemann hypothesis 450.A,I,P,Q Riemannian connection 80.K 364.B coefficients of 80.L Riemannian curvature 364.0 Riemannian foliation 154.H Riemannian geometry 137, App. A, Table 4.II Riemannian homogeneous space 199..'\ symmetric 412.B Riemannian manifold(s) 105.P 286.K 364 flat 364.E irreducible 364.E isometric 364.A locally flat 364.E normal contact 110.E reducible 364.E Riemannian metric 105.P pseudo- 105.P volume element associated with J05.W Riemannian product (of Riemannian manifolds) 364.A Riemannian space 364.A irreducible symmetric App. A, Table 5.1II locally symmetric App. A, Table 4.n symmetric  symmetric Riemannian space Riemannian submanifold 365 Riemann integrable (function) 216.A Riemann integral 37.K 216.A Riemann-Lebesgue theorem 159.A 160.A Riemann lower integral 216.A Riemann mapping theorem 77.B Riemann matrix 3.1 Riemann method of summation 379.5 Riemann non-Euclidean geometry 285.A Riemann period inequality 3.L Riemann period relation 3.L Riemann P-function App. A, Tables 14.II 18.1 Riemann problem 253.0 Riemann-Roch group 366.0 Riemann-Roch inequality (on algebraic surfaces) 15.0 
2095 Riemann-Roch theorem(s) 366 (on algebraic surfaces) 15.0 (for compact complex surface) 366.C (on nonsingular algebraic curves) 9.C (on Riemann surfaces) 1\.0 for an adjoint system 15.0 for differentiable manifolds 237.0 generalized (on algebraic curves) 9.F for a line bundle 366.C Riemann-Roch type Orothendieck theorem of 366.0 Hirzebruch theorem of 366.B Riemann sphere 74.0 Riemann-Stieltjes integral 94.B 166.C Riemann structure, Cauchy- 344.A Riemann sum 216.A Riemann surface(s) 367 abstract 367.A classification theory of 367.E closed 367.A elliptic 367.0 hyperbolic 367.0 maximal 367.F open 367.A open, of null boundary 367.E open, of positive boundary 367.E parabolic 367.0 prolongable 367.F Riemann theorem (on removable singularities) 198.0 (on series with real terms) 379.C Riemann e function 3.L Riemann upper integral 216.A Riemann (function 450.V Riesz convexity theorem 88.C Riesz decomposition (in a Markov chain) 260.0 of a superharmonic or subharmonic function 193.S Riesz- Fischer theorem 168.B 317.A Riesz group 36.H Riesz method of order k, summable by 379.R Riesz method of summation of the kth order 379.R Riesz potential 338.B Riesz-Schauder theorem 68.E Riesz space 310.B Riesz (F.) theorem (on Lp functions) 317.B (representation) 197.F Riesz (F. and M.) theorem 168.C (on bounded holomorphic functions on a disk) 43.0 Riesz-Thorin theorem 224.A Riesz transform 25\.0 right, limit on the 87.F right-adjoint (linear mapping) 256.Q right adjoint functor 52.K right A-module 277.0 right angle 15\.0 right annihilator (of a subset of an algebra) 29.H right Artinian ring 368.F right balanced (functor) 200.1 right circular cone 350.B right conoid 111.1 right continuous (function) 84.B right coset (of a subgroup of a group) 190.C right coset space (of a topological group) 423.E right decomposition, Peirce (in a unitary ring) 368.F Subject Index Ring(s) right derivative 106.A right derived functor 200.1 right differentiable 106.A right endpoint (of an interval) 355.C right equivalent 5\.C right exact (functor) 200.1 right global dimension (of a ring) 200.K right G-set 362.B right helicoid 111.1 right ideal (of a ring) 368. F integral 27.A right injective resolution (of an A-module) 200.F right invariant Haar measure 225.C right invariant tensor field (on a Lie group) 249.A right inverse (of dfl (0)) 286.0 righ t in verse elemen t (of an elemen t of a ring) 368.B right linear space 256.A right majorizing function 316.E right Noetherian ring 368.F right 0, ideal 27.A right operation (of a set to another set) 409.A right order (of a g-lattice) 27.A right parametrix 345.A right projective space 343.H right quotient space (of a topological group) 423.E right regular representation (of an algebra) 362.E (of a group) 362.B right resolution (of an A-module) 200.F right satellite 200.1 right semihereditary ring 200.K right semi-integral 68.N right shunt 115.B right singular point (of a diffusion process) 115.B right superior function 316.E right translation 249.A 362.B right uniformity (of a topological group) 423.0 rigid (characteristic class of a foliation) 154.G (isometric immersion) 365.E rigid body 271.E rigidity (of a sphere) 111.1 modulus of 271.0 rigidity theorem 178.C strong 122.0 ring(s) 368 adele (of an algebraic number field) 6.C affine 16.A anchor 410.B Artinian 284.A associated graded 284.0 basic (of a module) 277.0 Boolean 42.C Burnside 431.F category of 52.B category of commutative 52.B Chow (of a projective variety) 16.R cobordism 114.H coefficient (of an algebra) 29.A coefficient (of a semilocal ring) 284.0 coherent sheaf of 16.E cohomology 201.1 cohomology, of compact connected Lie groups App. A, Table 6.1V 
Subject Index Ringed space cohomology, of an Eilenberg-MacLane com- plex App. A, Table 6.m commutative 67 368.A complete local 284.0 completely integrally closed 67.1 completely primary 368.H complete Zariski 284.C completion, with respect to an ideal 16,X complex cobordism 114.H of convergent power series 370.B coordinate (of an affine variety) 16.A correspondence (of a nonsingular curve) 9.H de Rham cohomology (of a differentiable mani- fold) 105.R 201.1 differential 113 differential extension 113 of differential polynomials 113 discrete valuation 439.E division 368.B endomorphism (of an Abelian variety) 3.C endomorphism (ofa module) 277.B 368.C of endomorphisms (of an Abelian variety) 3.C factor, modulo an ideal 368.F form 284.0 of formal power series 370.A of fractions 67.0 generalized Boolean 42.C Oorenstein 200.K graded 369.B ground (of an algebra) 29.A ground (of a module) 277.0 group (of a compact group) 69.A Hecke 32.0 Hensel 370.C Henselian 370.C hereditary 200.K homogeneous 369.B homogeneous coordinate 16.A integrally closed 67.1 Krull 67.J left Artinian 368.F left hereditary 200.K left Noetherian 368.F left semihereditary 200.K lociil 284.0 local (of a subvariety) 16.B locally Macaulay 284.0 Macaulay 284.0 Macaulay local 284.0 Noetherian 284.A Noetherian local 284.0 Noetherian semilocal 284.0 normal 67.1 normed 36.A of operators 308.C of p-adic integers 439.F polynomial 337.A 369.A polynomial, in m variables 337.B of polynomials 337.A 369 power series 370.A of power series 370 primary 368.H primitive 368.H principal ideal 67.K Priifer 200.K pseudogeometric 284.F quasilocal 284.0 quasisemilocal 284.0 quasisimple 368.E quotient 368.E 2096 of quotients of a ring with respect to a prime ideal 67.0 of quotients of a ring with respect to a subset of the ring 67.0 regular 85.B 284.0 regular local 284.0 representation 237.H representative (of a compact Lie group) 249.U residue class, modulo an ideal 368.F right Artinian 368.F right hereditary 200.K right Noetherian 368.F right semihereditary 200.K of scalars (of a module) 277.0 semi hereditary 200.K ,,,,,,.......l......""nl .,0/1 T\ ",,,Ull1v\,,;al ":;'O"1'..I..J semiprimary 368.H semi primitive 368.H semisimple 368.0 simple 368.0 splitting 29.K topological 423.P of total quotients 67.0 unitary 368.A 409.C universally Japanese 284.F of a valuation 439.B of valuation vectors 6.C Zariski 284.C zero 368.A ringed space 383.H local 383.H ring homomorphism 368.0 ring isomorphism 368.0 ring operations 368.A ripple 205.F risk Bayes 398.B consumer's 404.C posterior 399.F producer's 404.C risk function 398.A risk premium 214.B risk theory 214.C classical 214.C collective 214.C individual 214.C Ritt basis theorem (on differential polynomials) 113 Ritz method 46.F 303.1 304.B Rayleigh- 46.F 271.G Robbins-Kiefer inequality, Chapman- 399.0 Robertson- Walker metrics 359.E Robin constant 48.B Robin problem 323.F robust and nonparametric method 37l robust estimation 371.A robust method 371.A Roch  Rieman-Roch Roche-Schlomilch remainder App. A, Table 9.1V Rodrigues formula 393.B Roepstorff-Araki-Sewell inequality 402.0 Roepstorff-Fannes-Verbeure inequality 402.0 Rogers theorem, Ovoretzky- 443.0 Rokhlin theorem 114.K Rolle theorem 106.E rolling curve (of a roulette) 93.H Roman and medieval mathematics 372 Romberg integration 299.C Room square 241.0 root . 
2097 (of a chamber complex) 13.R (of a polynomial) 337.B characteristic (of an autonomous linear system) 163.F characteristic (of a linear mapping) 269.L characteristic (for a linear partial differential equation with variable coefficients) 325.F characteristic (of a matrix) . 269.F co- 13.1 imaginary (of an algebraic equation) 10.E k- t3.Q mth 10.C multiple (of an algebraic equation) 10.B negative (of a semisimple Lie algebra) 248.M of a polynomial 337.B positive (of a semisimple Lie algebra) 248.M primitive, modulo m 297.G primitive, of unity 14.L real (of an algebraic equation) 10.E of a semisimple algebraic group 13.1 of a semisimple Lie algebra 248.K simple (of an algebraic equation) 10.B simple (in a root system) 13.1 simple (of a semisimple Lie algebra) 248.M root extraction 10.C root subspace (of a linear operator) 390.B root subs pace (of a semisimple Lie algebra) 248.K root system (of a semisimple algebraic group) 13.1 (of a semisimple Lie algebra) 248.K (of a symmetric Riemannian space) 413.F (in a vector space over Q) 13.1 fundamental (of a semisimple Lie algebra) 248.N irreducible 13.L root vector 390.B Rosen gradient projection method 292.E rot (rotation) 136.0442.0, App. A, Table 3.II rotatable 102.M rotation App. A, Table 3.II (of a differentiable vector field) 442.0 (on a locally compact Abelian group) 136.0 axis of (of a surface of revolution) 111.1 rotational 20S.B rotational coordinates App. A, Table 3.V rotation group 60.1 rotation number 99.0 111.E 126.1 rotation theorem 438.B Roth theorem 118.0182.0 Rouche theorem 10.E 99.0 198.F roulette 93.H roundoff error 138.B global 303.B row (of a matrix) 269.A iterated series by (of a double series) 379.E repeated series by (of a double series) 379.E row finite matrix 269.K row nullity (of a matrix) 269.0 row vector 269.A Royden compactification 207.C Royden theorem, Arens- 36.M Ruckert zero-point theorem 23.B Ruelle scattering theorey, Haag- 150.0 ruin probability 214.C rule Adler-Weisberger sum 132.C chain (on the differentiation of composite functions) 106.C Cramer 269.M Feynman 146.A,B Subject Index a-discrete (covering of a set) formation 411.0 of inference 411.1 midpoint 303.E Napier App. A, Table 2.II product 299.0 projection 31.B selection 351.H sequential decision 398.F Simpson's 1/3 299.A Simpson's 3/8 299.A slide 19.A stopping 398.F terminal decision 398.F trapezOIdal (of numerical integration) 299.A trapezoidal (of numerical solution of ordinary differential equations) 303.E univalence superselection 351.K ruled surface (algebraic surface) 15.E (in differential geometry) 111.1 criterion of 15.E ruler 155.G 179.A run (in a sequence of Bernoulli trials) 396.C Runge-Kutta-Gill method 303.0 Runge-Kutta method 303.0 explicit 303.0 general 303.0 implicit 303.0 pseudo- 303.0 semi-explicit 303.0 semi-implicit 303.0 Runge-Kutta (P, p) method, region of absolute stability of 303.G Runge phenomenon 223.A Runge theorem (on polynomial approximation) 336.F Russell paradox 319.B Ryser-Chowla theorem, Bruck- 102.E s !/' (the totality of rapidly decreasing COO-functions) 168.B !/" (the totality of tempered distributions) 12S.N S, space of type 125.T SL(n, K) (special linear group) 60.B Sp(A) (Spur of a matrix A) 269.F SP(n, K) (symplectic group) 60.L SV(n) (special unitary group) 60.F S(n) (totality of measurable functions on n that take finite value almost everywhere) 168.B *  also star *-automorphism group 36.K *-derivation 36.K *-homomorphism 36.F *-representation (of a Banach *-algebra) 36.F *-subalgebra 443.C O'-additive measure 270.0 O'-additivity 270.0 O'-algebra 270.B optional 407.B predictable 407.B tail 342.G topological 270.C well-measurable 407.B O'-compact space 425.V O'-complete (vector lattice) 310.C O'-complete lattice 243.0 conditionally 243.0 O'-discrete (covering ofa set) 425.R 
Subject Index IT-field O'-field Bayer sufficient 396.1 boundedly complete 396.E complete 396.E O-sufficient 396.J decision theoretically sufficient 396.1 minimal sufficient 396.F pairwise sufficient 396.F test sufficient 396.1 O'-finite (measure space) 270.0 O'-function,ofWeierstrass 134.F, App. A, Table 16.1V O'-locally finite covering (of a set) 425.R O'-process (of a complex manifold) 72.H O'-space 425.Y O'-subfield necessary 396.E sufficient 396.E O'-weak topology 308.B 1: t set 22.A 1:'; set 22.0 1:-space 425.Y s-cobordism 65.C s-cobordism theorem 65.C s-field 149.A s-handle 114.F sh-factorial experiment 102.H S-admissible (lattice in R") 182.B S-flow 136.0 S-levels 102.K S-matrix 150.0 S-matrix theory 386.C S-morphism 52.G S-number 430.C S-object 52.0 S-operator 150.0 386.B S-scheme 16.0 S-set 308.1 S-topology (on a linear space) 424.K S-wave 351.E (S)-space 424.S S*-number 430.C (S, (f)-valued random variable 342.C s-parallelizable (manifold) 114.1 Sacks bound, Varshamov-Oilbert- 63.B saddle point (differential game) 108.B (of a dynamical system) 126.G (of a function) 255.B (in nonlinear programming) 292.A (on a surface) 111.H saddle-point method 25.C saddle set 126.E Sakata mOdel 132.0 Salam model, Olashow-Weinberg- 132.0 same kind (of mathematical systems) 409.B same orientation (for oriented atlases) 105.F same shape 382.A sample(s) 373.A 401.E Bernoulli 396.B random 374.A 396.B 401.F small 401.F sample autocovariancc 421.B sample characteristic 396.C sample characteristic value 396.C sample correlation coefficient 396.0 sample covariance 396.0 sample covariance function 395.G sample function 407.A sample generalized variance 280.E 2098 sample mean 396.C sample median 396.C sample mode 396.C sample moment of order k 396.C sample multiple correlation 280.E sample number, average 404.C sample partial correlation coefficient 280.E sample path 407.A sample point 342.B 396.B 398.A sample problem, k- 371.0 sample process 407.A sample range 396.C sample size 373.A sample space 342.B 396.B 398.A sample standard deviation 396.C sample survey 373 sample theory, large 401.E sample value 396.B sample variance 396.C sampling exact, theory 401.F multistage 373.E optional 262.C optional, theorem 262.A stratified 373.E two-stage 373.E sampling distribution 374.A sampling inspection 404.C with adjustment 404.C by attributes 404.C double 404.C multiple 404.C with screening 404.C sequential 404.C single 404.C by variables 404.C sampling inspection plan 404.C sampling inspection tables 404.C sampling procedure 373.A invariant 373.C random 373.A regular 373.A uniform 373.A Sard theorem 105.1 208.B Sard-Smale theorem 286.P Sasaki an manifold 11O.E Sasaki-Nitsche formula, Gauss-Bonnet- 275.C Satake diagram App. A, Table 5.II (of a compact symmetric Riemannian space) 437.AA (of a real semisimple Lie algebra) 248.U satellite I eft 200.1 right 200.1 satisfiability, prOblem of (of a proposition) 97 satisfiable (formula) 276.C Sato-Bernshte1n polynomial 125.EE Sato conjecture 450.S saturated «B, N)-pair) 151.J (fractional factorial design) 102.1 saturated model, K- 293.B Savage theorem, Girshick- 399.F Savage zero-one law, Hewitt- 342.0 savings premium 214.B Sazonov topology 341.J SCP-manifold 178.0 scal art s) (in a linear space) 256.A,J (of a module Over a ring) 277.0 
2099 field of (of a linear space) 256.A ring of (of a module) 277.0 scalar change (of a B-module) 277.L scalar curvature 364.0, App. A, Table 4.II scalar extension (of an algebra) 29.A (of an A-module) 277.L (of a linear representation) 362.F scalar field 105.0 (in a 3-dimensional Euclidean space) 442.0 free 377.C scalar integral 443.F,1 scalarly integrable 443.F,1 scalarly measurable 443.B,1 scalar matrix 269.A scalar multiple (of an element of a module) 277.0 (of a linear operator) 37.C (in a linear space) 256.A (of a vector) 442.A scalar multiplication (iri a module) 277.0 (on vectors) 442.A scalar operator 390.K scalar potential 130.A 442.0 scalar product 442.B, App. A, Table 3.1 scalar restriction (of a B-module) 277.L scalar sum (oflinear operators) 37.C scalar triple product 442.C scale of Banach space 286.Z canonical 115.B natural 260.0 ordinal 397.M two-sided 19.0 scaled, U;- 19.0 scale matrix 374.C scale parameter 396.1 4oo.E scaling, metric multidimensional 346.E scaling, multidimensional 346.E scaling method 346.E scatter diagram 397.H scattered (sheaf) 383.E scattered set 425.0 scattered zeros, function with 208.C scattering 375.A data 387.0 elastic 375.A inelastic 37 5.A scattering amplitude 375.C 386.B partial wave 375.E scattering cross section 375.A scattering data 387.0 scattering operator 375.B,F,H scattering state 375.B completeness of the 150.0 scattering theory, Haag-Ruelle 150.0 Schafheitlin formula, Sonine- App. A, Table 19.111 Schauder basis 37.L Schauder degree, Leray- 286.0 Schauder estimate 323.C Schauder fixed-point theorem 153.0 286.0 Leray- 286.0 323.0 Schauder theorem, Riesz- 68.E scheduling 376 job-shop 307.C model 307.C network 307.C scheduling and production planning 376 scheduling problem Subject Index Schmidt theorem, Knopp- flow-shop 376 job-shop 376 machine 376 multiprocessor 376 Scheffe model 346.C Scheffe theorem, Lehmann- 399.C Scheja theorem 21.M schema of Souslin 22.B scheme 16.0 adaptive 299.C affine 16.0 Aitken's interpolation 223.B algebraic 16.0 automatic integration 299.C coarse moduli 16.W complete 16.0 consistent-mass 304.0 deformation of X over a connected 16.W difference 304.E difference, of backward type 304.F difference, of forward type 304.F explicit 304.F fine moduli 16.W formal 16.X Friedrichs 304.F group 16.H Hilbert 16.S implicit 304.F integral 16.0 inverted filing 96.F irreducible 16.0 K-complete 16.0 Lax- Wendroff 304.F locally Noetherian formal 16,X moduli 16.W morphism of 16.0 Noetherian 16.0 nonadaptive 299.C Picard 16. P projective 16.E quasiprojective 16.E S- 16.0 over S 16.0 separated 16.0 separated formal 16.X Scherk's surface 275.A Schlafli diagram (of a complex semisimple Lie algebra) 248.S Schlafli formula App. A, Table 19.111 Schlafli integral representation 393.B Schlafli polynomial App. A, Table 19.1V Schlesinger equations 253.E schlicht 438.A schlicht Bloch constant 77.F schlichtartig 367.G Schlieder theorem, Reeh- 150.E Schlomilch criterion App. A, Table 10.11 Schlomilch remainder, Roche- App. A, Table 9.N Schlomilch series 39.0, App. A, Table 19.m generalized 39.0 Schmidt class, Hilbert- 68.1 Schmidt condition 379.M Schmidt expansion theorem, Hilbert- 217.H Schmidt norm, Hilbert- 68.1 Schmidt orthogonalization 317.A Gram- 317.A Schmidt procedure, Lyapunov- 286.V Schmidt theorem 118.0 Schmidt theorem, Knopp- 208.C 
Subject Index Schmidt theorem, KrulI-Remak- Schmidt theorem, Krull-Remak- (in group theory) 190.L Schmidt type, integral operator of Hilbert- 68.C Schmidt type, kernel of Hilbert- 217.1 Schnee theorem, Knopp- (on method of summation) 379.M Schoenflies notation (for crystal classes) 92.E, App. B, Table 6.1V Schoenflies problem 65.0 Schoenflies theorem 65.0 Schottky group 234.B Schottky theorem 43.J Schottky uniformization 367.C Schrader axioms, Osterwalder- 150.F Schreier conjecture (on simple groups) 15l.I Schreier extension, Artin- (of a field) 172.F Schroder equation, Konigs- 44.B Schroder functional equation 388.0 Schrodinger equation 351.0 I-body 351.E random 340.E time-dependent 351.0 time-independent 351.0 Schrodinger operator 351.0 Schrodinger picture 351.0 Schrodinger representation 351.C Schrodinger series, Rayleigh- 331.0 Schubert cycle 56.E Schubert variety 56.E Schur index (of a central simple algebra) 29.E (of an irreducible representation) 362.F Schur lemma (on linear representations) 362.C (on simple modules) 277.H 368.G (on unitary representations) 437.0 Schur subgroup 362.F Schur theorem (on linear transformations of sequences) 379.L Schur theorem, Kojima- (on linear transformations of sequences) 379.L Schur-Zassenhaus theorem (on Hall subgroups) 151.E Schwartz-Christoffel transformation 77.0, App. A, Table 13 Schwartz-Christoffel transformation formula 77.0 Schwartz integral, Bartle-Ounford- 443.G Schwartz space 424.S Schwarzian derivative App. A, Table 9.III Schwarz inequality 211.C Cauchy- 211.C, App. A, Table 8 Schwarz lemma 43.B Schwarz principle of reflection 198.0 Schwinger equation, Lippmann- 375.C Schwinger function 150.F Schwinger points 150.F sciences, information 75.F SCOres canonical 397.M factor 280.0 346.F score test, Fisher- Y ates- Terry normal 371.C scoring method 397.M screening, sampling inspection with 404.C seasonal adjustment 397.N sec (secant) 131.E secant 432.A hyperbolic 131.F sech (hyperbolic secant) 131.F second (unit of an angle) 139.0 secondary cohomology operation, stable 64.C 2100 secondary components (of a homogeneous space) 110.A secondary composition 202.R secondary obstruction 305.0 secondary parameters 110.A second axiom, Tietze's 425.Q second barycentric derived neighborhood 65.C second boundary value problem (for harmonic functions) 193.F (of partial differential equations of elliptic type) 323.F second category, set of 425.N second classification theorem (in the theory of obstructions) 305.C second complementary law (of Legendre symbols) 297.1 second countability axiom 425.P second Cousin problem 21.K second definition (of an algebraic K-group) 237.1 second difference 104.A second extension theorem (in the theory of obstructions) 305.C second factor (of a class number) 14.L second fundamental form (of an immersIOn of a manifold) 111.G 365.C, App. A, Table 4.1 second fundamental quantities (of a surface) 111.H second fundamental tensor 417.F second fundamental theorem (in Morse theory) 279.0 second homotopy theorem 305.C second incompleteness theorem 185.C second isomorphism theorem (on topological groups) 423.1 second kind (Abelian differential of) I1.C (Abelian integral of) 11.C (Fuchsian grOup of) 122.C (integral equations of Fredholm type of) 217.A perfect number of 297.0 Stirling number of 66.0 second law of cosines 432.A, App. A, Table 2.n second law of thermodynamics 419.A second mean value theorem (for the D-integral) 100.0 (for the Riemann integral) 216.B (for the Stieltjes integral) 94.C second-order asymptotic efficiency 399.0 second-order design 102.M second-order efficiency 399.0 second-order predicate 411.K second-order predicate logic 411.K second quantization 377 second separation axiom 425.Q second variation formula 178.A section (of a finite group) 362.1 (of a sheaf space) 383.C circular 350.F conic 78 crOss- 126.C 286.H cross (of a fiber bundle) 147.L cross (of a fiber space) 148.0 crOSS-, for a closed orbit 126.0 differential CrOSs 375.A 386.B local 126.E local CrOss (of a fiber bundle) 147.E n- (in a cell complex) 70.0 normal (of a surface) 41O.B r- (of a Euclidean complex) 70.B 
2101 r- (of a simplicial complex) 70.C scattering cross 375.A set of (of a sheaf) 383.C total (elastic) cross 386.B zero- (of a block bundle) 147.Q sectional curvature 364.0 holomorphic 364.0 section graph 186.C sectors, superselection 150.E 351.K secular equation 55.B 269.F secular perturbation 55.B sedenion 29.0 segment 155.B 178.H (in affine geometry) 7.0 (in an ordered set) 311.B oriented 442.A Seidel method, Oauss- 302.C Seifert conjecture 126.N 154.0 Seifert matrix 235.C Seifert surface 235.A Selberg sieve 123.E Selberg theorem, Evans- 48.E 338.H Selberg zeta function 450. T selection, measurable 443.1 selection, model 401.0 selection function 354.E selection parameter 396.F "election rule 35l.H selection statistic 396.F self-adjoint (linear homogeneous ordinary differential equation) 315.B essentially 25l.E 390.1 self-adjoint differential equation 252.K self-adjoint differential operator, formally 112.1 self-adjoint operator 251.E 390.F self-adjoint system of differential equations 252.K self-commutator 251.K self-dual (linear space) 256.H self-dual (regular cone) 384.E self-dual, anti- (G-connection) 80.Q self-excited vibration 318.B self-information 213.B self-intersection number 15.C self-loop 186.B self-polar tetrahedron 350.C self-polar triangle 78.1 self-reciprocal function 220.B semicontinuity, lower (of length) 246.A semi continuous 84.C (mapping in a topological linear space) 153.0 lower 84.C upper 84.C semicontinuous function 84.C semi continuous partition, upper 425.L semi definite Hermitian form 348.F semi definite kernel, positive 217.H semidefinite matrix, positive 269.1 semi definite operator, positive 251.E semidefinite quadratic form, positive or negative 348.C semi direct product (of two groups) 190.N semidiscrete approximation 304.B semi exact (differential on an open Riemann surface) 367.1 semi-explicit 303.0 semi finite (von Neumann algebra) 308.E semifinite (weight on a von Neumann algebra) 308.0 semiflow 126.B Subject Index Semisimple of class C' 126.B continuous 126.B discrete 126.B discrete, of class C' 126.B semigroup 88.E 190.P 409.A (of a Markov process) 261.B of class (Co) 378.B differentiable 378.F distribution 378.F dual 378.F equicontinuous, of class (Co) 378.B free 161.A holomorphic 378.0 locallyequicontinuous 378.F nonlinear 378.F nonlinear, of operators 286,X of operators 378 order-preserving 286.Y unitary 409.C semigroup algebra 29.C large 29.C semi group bialgebra 203.0 semihereditary ring 200.K left 2oo.K right 2oo.K semi-implicit 303.0 semi-integral, left 68.N semi-integral, right 68.N semi-intuitionism 156.C semi-invariant 226.A G- 226.A of a probability distribution 341.C semilattice 243.A lower 243.A upper 243.A semilinear (partial differential equations of elliptic type) 323.0 semilinear mapping 256.P 277.L semilinear transformation 256.P semilocal ring 284.0 analytically unramified 284.0 Noetherian 284.0 quasi- 284.0 semilogarithmic paper 19.F semi martingale 262.E 406.B continuous 406.B semi martingale decomposition 406.B seminorm (on a topological linear space) 424.F semiorbit 126.0 negative 126.0 positive 126.0 semiordered set 311.A semiordering 311.A semipolar set 261.0 semiprimary ring 368.H semiprime differential ideal (of a differential ring) 113 semiprime ideal (of a differential ring) 113 semiprimitive ring 368.H semireductive (action defined by a rational representation) 226.B semi reflexive (locally convex space) 424.0 semiregular point (of a surface in E3) 11l.J semiregular transformation (of a sequence) 379.L semisimple (algebraic group) 13.1 (Banach algebra) 36.0 (Jordan algebra) 231.B (Lie algebra) 248.E (Lie group) 249.0 
Subject Index Semisimple algebra (matrix) 269.0 semisimple algebra 29.A semisimple A-module 277.H semisimple component (of a linear transformation) 269.L semisimple linear representation 362.C semisimple linear transformation 269.L semisimple part of an algebraic group 13.E of a nonsingular matrix 13.E semisimple ring 368.0 semisimplicial complex 70.E semisimplicity, Cartan's criterion of 248.F semis table (coherent sheaf) 16.Y semistable distribution 341.G semistable reduction theorem 16.Z semistable vector bundle (algebraic) 16.Y semivariation 443.0 sensitive grammar, context- 31.0 sensory test 346.B separable (function in nomograms) 19.0 (polynomial) 337.0 (rational mapping) 16.1 (stochastic process) 407.A (topological space) 425.P perfectly 425. P separable algebra 29.F,K 2oo.L central 29.K separable element (of a field) 149.H separable extension (of a field) 149.H,K maximal (of a field) 149.H separable metric space 273. E separably generated extension (of a field) 149.K separated (formal scheme) 16,X (morphism) 16.0 separated convex sets, strongly 89.A separated kernel 217.F separated scheme 16.0 separated space 425.Q separated S-scheme 16.0 separated topological group 423.B separated type App. A, Table 14.1 separated uniform space 436.C separated variable type App. A, Table 15.II separately continuous (bilinear mapping) 424.Q separating family 207.C separating transcendence basis (of a field extension) 149.K separation axioms of (in set theory) 33.B of variables 322.C separation axioms 425.Q the first 425.Q the fourth 425.Q the second 425.Q the third 425.Q Tikhonov 425.Q separation cochain 305.B separation co cycle 305.B separation principle 405.C separation theorem (on convex sets) 89.A separator 186.F sequence(s) 165.0 admissible (in Steenrod algebra) 64.B, App. A, Table 6.III asymptotic 30.A of Bernoulli trials 396.B 2102 Blaschke 43.F Cauchy (in a-adic topology) 284.B Cauchy (in a metric space) 273.1 Cauchy (of rational numbers) 294.E Cauchy (of real numbers) 355.B Cauchy (in a uniform space) 436.0 cohomology exact 201.L cohomology spectral 200.1 connected, of functors 200.1 convergent (of real numbers) 87.B 355.B divergent (of real numbers) 87.B double 379.E exact (of A-homomorphisms of A-modules) 277.E exact, of cohomology 200.F exact, of Ext 200.0 exact, of homology 2oo.C exact, of Tor 200.0 of factor groups (of a normal chain) 190.0 Farey 4.B Fibonacci 295.A finite 165.0 of functions 165.B,0 fundamental (in a metric space) 273.1 fundamental (of rational numbers) 294.E fundamental (of real numbers) 355.B fundamental (in a uniform space) 436.0 fundamental, of cross cuts (in a simply connected domain) 333.B fundamental exact (on cOhomology of groups) 2oo.M Oysin exact (of a fiber space) 148.E Hodge spectral 16.U homology exact (of a fiber space) 148.E homology exact (for simplicial complexes) 201.L homotopy exact 202.L homotopy exact (of a fiber space) 148.0 homotopy exact (of a triad) 202.M homotopy exact (of a triple) 202.L independent, of partitions 136.E infinite 165.0 interpolating 43.F Jordan-Holder (in a group) 190.0 linear recurrent 295.A Mayer-Vietoris exact 201.C minimizing 46.E monotone (of real numbers) 87.B monotonically decreasing (of real numbers) 87.B monotonically increasing (of real numbers) 87.B normal (of open coverings) 425.R null (in a-adic topology) 284.B of numbers 165.0 (o)-convergent 87.L (o)-star convergent 87.L order-convergent (in a vector lattice) 310.C oscillating (of real numbers) 87.0 of pOints 165.0 pointwise convergent 435.B posi tive defini te 192.B of positive type 192.B Puppe exact 202.0 random 354.E rapidly decreasing 168.B recurrent, of order r 295.A reduced homology exact 201.F regular (of Lebesgue measurable sets) 380.0 regular spectral 200.1 
2103 relative Mayer- Vietoris exact 201.L (R,S)-exact (of modules) 2oo.K of sets 165.0 short exact 200.1 simply convergent 435.B slowly increasing 168.B spectral 200.1 spectral (of singular cohomology of a fiber space) 148.E standard 4oo.K symbol (in the theory of micro differential operators) 274.F of Ulm factors (of an Abelian p-group) 2.0 uniformly convergent 435.A Wang exact (of a fiber space) 148.E sequencing problem, machine 376 sequential decision function 398.F sequential decision problem 398.F sequential decision rule 398.F sequentially compact (space) 425.S sequential probability ratio test 4oo.L sequential sampling inspection 404.C sequential space 425.CC sequential test 400.L serial correlation coefficient 397.N 421.B serial cross correlation coefficient 397.N series 379, App. A, Table 10 absolutely convergent 379.C absolutely convergent double 379.E allied (of a trigonometric series) 159.A alternating 379.C ascending central (of a Lie algebra) 248.C asymptotic 30 asymptotic power 30.A binomial App. A, Table 10.IV binomial coefficient 121.E characteristic (in a group) 190.0 commutatively convergent 379.C complementary (of unitary representations of a complex semisimple Lie group) 437.W complementary degenerate (of unitary represen- tations of a complex semisimple Lie group) 437.W composition (in a group) 190.0 composition (in a lattice) 243.F composition factor (of a composition series in a group) 190.0 conditionally convergent 379.C conditionally convergent double 379.E conjugate (of a trigonometric series) 159.A convergent 379.A convergent double 379.E convergent power 370.B convergent power, ring 370.B degenerate (of unitary representations of a complex semisimple Lie group) 437.W derived (of Lie algebra) 248.C descending central (of a Lie algebra) 248.C Oini 39.0 irichlet 121 Oirichlet, of the type {A"} 121.A discrete (of unitary representations of a semi- simple Lie group) 437.X divergent 379.A divergent double 379.E double 379.E Eisenstein 32.C Eisenstein-Poincare 32.F exponential 131.0 factorial 104.F 121.E Subject Index Serre "(j -theory field of formal power, in one variable 370.A finite 379.A, App. A, Table 10.1 formal power 370.A formal power, field in one variable 370.A formal power, ring 370.A Fourier 159 197.C, App. A, Table II.I Fourier (of an almost periodic function) 18.B Fourier (of a distribution) 125.P Fourier-Bessel 39.0 Fourier cosine App. A, Table ll.I Fourier sine App. A, Table 11.1 Oauss 206.A generalized Eisenstein 450.T generalized SchlOmilch 39.0 generalized trigonometric 18.B geometric 379.B, App. A, Table 10.1 Heine 206.C hypergeometric 206.A infinite 379.A, App. A, Table 10.m iterated, by columns (of a double series) 379.E iterated, by rows (of a double series) 379.E Kapteyn 39.0, App. A, Table 19.m Lambert 339.C Laurent 339.A logarithmic 131.0 lower central (of a group) 190.1 majorant 316.0 majorant (of a sequence of functions) 435.A Neumann 217.0 of nonnegative terms 379.B ordinary Oirichlet 121.A orthogonal (of functions) 317.A oscillating 379.A n- (of a group) 151.F Poincare 32.B of positive terms 379.B power 21.B 339 370.A, App. A, Table 10.lV power (in a complete ring) 370.A power, with center at the point at infinity 339.A power, ring 370.A principal (in an Q-group) 190.0 principal (of unitary representations of a com- plex semisimple Lie group) 258.C 437.W principal (of unitary representations of a real semisimple Lie group) 258.C 437'x principal H- 437.X properly divergent 379.A Puiseux 339.A repeated, by columns (of a double series) 379.E repeated, by rows (of a double series) 379.E ring of convergent power 370.B ring of formal power 370.A ring of power 370.A Schl6milch 39.0, App. A, Table 19.m simple 379.E singular 4.0 supplementary 258.C Taylor 339.A termwise integrable 216.B theta 348.L theta-Fuchsian, of Poincare 32.B time 397.A 421.A trigonometric 159.A unconditionally convergent 379.C uniformly absolutely convergent 435.A upper central (of a group) 190.J Serre conjecture 369.F Serre -theory 202.N 
Subject Index Serre duality theorem Serre duality theorem (on complex manifolds) n.E (on projective varieties) 16.E Serre formulas, Frenet- (on curves) 111.0, App. A, Table 4.1 Serre theorem (for ample line bundles) 16.E sesquilinear form (on a linear space) 256.Q (on a product of two linear spaces) 256.Q matrix of 256.Q nondegenerate 256.Q set(s) 381 A- 22.A 409.A absolutely convex (in a linear topological space) 424.£ ct-limit 126.0 analytic 22.A,I analytic (in the theory of analytic spaces) 23.B analytically thin (in an analytic space) 23.0 of analyticity 192.N analytic wave front 274.0 of anti symmetry 164.E arbitrary 381.0 asymptotic 62.A asymptotic ratio 308.1 axiom of power 33.B 381.0 B" 22.0 'B-measurable 270.C Baire 126.H 270.C bargaining 173.0 basic (for an Axiom A flow) 126.1 basic (of a structure) 409.B basic open 425.F bifurcation 51.F 418.F border 425.N Borel (in a Euclidean space) 270.C Borel (in a topological space) 270.C Borel in the strict sense 270.C boundary 425.N boundary cluster 62.A bounded (in an affine space) 7.0 bounded (in a locally convex space) 424.F bounded (in a metric space) 273.B C" 22.0 CA 22.A Cantor 79.0 capacity of 260.0 catastrophe 51.F category of 52.B chain recurrent 126.E characteristic (of an algebraic family on a generic component) 15.F characteristic (of a partial differential operator) 320.B choice 34.A closed 425.B cluster 62.A coanalytic 22.A compact (in a metric space) 273.F compact (in a topological space) 425.S complemen tary 381.B complementary analytic 22.A complete 241.B complete orthonormal (of a Hilbert space) 197.C connected 79.A constraint (of a minimization problem) 292.A convex 7.0 89 countably equivalent (under a nonsingular bimeasurable transformation) 136.C 2104 curvilinear cluster 62.C cylinder 270.H of degeneracy (of a holomorphic mapping between analytic spaces) 23.C d 22.0 dense 425.N dependent 66.0 derived 425.0 determining (of a domain in CO) 2l.C difference (of blocks) 102.E directed 311.0 discrete 425.0 disjoint 381.B dominating 186.1 empty (0) 381.A externally stable 186.1 equipollent 49.A equipotent 49.A Fa 270.C factor (of a crossed product) 29.0 factor (of an extension of groups) 190.N factor (of a projective representation) 362.1 family of 165.0 381.B,0 family of (indexed by A) 381.0 final (of a correspondence) 358.B final (of a linear operator) 251.E finite 49.F 3R1.A finitely equivalent (under a nonsingular bimeasurable transformation) 136.C of the first category 425.N of the first kind 319.B first negative prolongationallimit 126.0 first positive prolongationallimit 126.0 function 380.A function-theoretic null 169.A fundamental (of a transformation group) 122.B fundamental open (of a transformation group) 122.B Go 270.C general Cantor 79.0 generalized peak 164.0 (general) recursive 97 germ of an analytic 23.B homotopy 202.B idempotent (of a ring) 368.B increasing directed 308.A independent 66.0 186.1 index 102.L index (of a family) 165.0 index (of a family of elements) 381.0 indexing (of a family of elements) 381.0 infinite 49.F 381.A information 173.B initial (of a correspondence) 358.B initial (of a linear operator) 251.E interior cluster 62.A internally stable 186.1 interpolating (for a function algebra) 164.0 Kronecker 192.R lattice of 243 E lattice-ordered 243.A Lebesgue measurable 270.0 Lebesgue measurable (of R") 270.G level 279.0 limit 234.A locally closed 425.1 M- 159.1 meager 425.N minimal 126.E 
2105 Ji-measurable 270.0 J1'null 270.0 of multiplicity 159.J n-cylinder 270.H nilpotent (of a ring) 368.B nodal 391.H nonmeager 425.N nonsaddle 120.E nonwandering 126.E nowhere dense 425.N null (in a measure space) 270.0310.1 null, of class Nil 169.E null (0) 381.A w-limit 126.0 open 425.B ordered  ordered set ordinate 221.E orthogonal (of functions) 317.A orthogonal (of a Hilbert space) 197.C orthogonal (of a ring) 368.B orthonormal (offunctions) 317.A orthonormal (of a Hilbert space) 197.C p" 22.0 P-convex (for a differential operator) 112.C peak 164.0 perfect 425.0 nl 22.A no' 22.0 point 381.B of points of indeterminacy (of a proper meromorphic mapping) 23.0 polar (in potential theory) 261.0 338.H power 381.B precompact (in a metric space) 273.B principal analytic 23.B projective, of class n 22.0 purely d-dimensional analytic 23.B of quasi-analytic functions 58.F quotient (with respect to an equivalence relation) 135.B ratio 136.F recurrent 260.E recursive 356.0 recursively enumerable 356.0 regularly convex 89.0 relative closed 425.1 relatively compact 425.S relatively compact (in a metric space) 273.F relatively open 425.1 removable (for a family of functions) 169.C residual 126.H 425.N resolvent (of a closed operator) 251.F resolvent (of a linear operator) 390.A p- 308.1 S- 308.1 saddle 126.E scattered 425.0 of the second category 425.N of the second kind 319.B semipolar 261.0 Sidon 192.R 194.R sieved 22.B 1:} 22.A E"' 22.0 singularity (of a proper meromorphic mapping) 23.0 stable 173.0 stable, externally 186.1 stable, internally 186.1 Subject Index Sheaf (sheaves) standard 22.1 strongly P-convex 112.C strongly separated convex 89.A system of closed 425.B system of open 425.B ternary 79.0 thin (in potential theory) 261.0 totally bounded (in a metric space) 273.B totally bounded (in a uniform space) 436.H V- 159.J of uniqueness 159.1 universal (for the projective sets of class n) 22.E universal (of set theory) 381.B wandering (under a measurable transformation) 136.C wave front 274.B 345.A wave front, analytic 274.0 weakly wandering 136.C weakly wandering (under a group) 136.F well-ordered 311.C Z- 382.B Zariski closed 16.A Zariski dense 16,A Zariski open 16.A set function(s) 380 additive 380.C of bounded variation 380.B completely additive 380.C finitely additive 380.B monotone decreasing 380.B monotone increasing 380.B Ji-absolutely continuous additive 380.C Ji-singular additive 380.C seHheoretic formula 33.B seHheoretic topology 426 set theory 381.F axiomatic 33156.E Bernays-Oi.idel 33.A,C Boolean-valued 33.E classical descriptive 356.H effective descriptive 356.H general 33.B Oi.idel 33.C Zermelo 33.B Zermelo-Fraenkel 33.A,B Severi group, Neron- (of a surface) 15.0 (of a variety) 16.P Sewell inequality, Roepstorff-Araki- 402.0 sgnP(sign) 103.A shadow costs 292.C shadow price 255.B Shafarevich group, Tate- 118.0 Shafarevich reciprocity law 257.H shallow water wave 205.F shape pointed 382.A same 382.A shape category 382.A shape dominate 382.A shape function 223.0 shape group 382.C shape invariant(s) 382.C shape morphism 382.A shape theory 382 Shapiro-Lopatinskii condition 323.H Shapley value 173.0 sheaf (sheaves) 383 
Subject Index Sheaf space (in etale (Orothendieck) topology) 16.AA of Abelian groups 383.B analytic 72.E associated with a presheaf 383.C tech cohomology group with coefficient 383.F coherent, of rings 16.E coherent algebraic 16.E 72.F coherent analytic 72.E constructible 16.AA cohomology group with coefficient 383.E constant 383.0 derived 125.W flabby 383.E of germs of analytic functions 383.0 of germs of analytic mapping 383.0 of germs of continuous functions 383.0 of germs of differentiable sections of a vector bundle 383.0 of germs of differential forms of degree of r 383.0 of germs of functions of class C' 383.0 of germs of holomorphic functions (on an analytic manifold) 383.0 of germs of holomorphic functions (on an analytic set) 23.C of germs of holomorphic functions (on an analytic space) 23.C of germs of regular functions 16.B of germs of sections of a vector bundle 383.0 of groups 383.C of ideals of a divisor (of a complex manifold) 72.F invertible 16.E locally constructible (constant) 16.AA of @-modules 383.1 orientation 201.R pre- 383.A pre-, on a site 16.AA of rings 383.C scattered 383.E structure (of a prealgebraic variety) 16.C structure (of a ringed space) 383.H structure (of a variety) 16.B trivial 383.0 sheaf space 383.C shear, modules of elasticity in 271.0 shearing strain 271.0 shearing stress 271.0 shear viscosity, coefficient of 205.C sheet(s) hyperboloid of one 350.B hyperboloid of revolution of one 350.B hyperboloid of revolution of two 350.B hyperboloid of two 350.B mean number of (of a covering surface of a Riemann sphere) 272.1 number of (of an analytic covering space) 23.E number of (of a covering surface) 367.B sheeted, n- 367.B Shelah isomorphism theorem, Keisler- 276.E Shields-Zeller theorem, Brown- 43.C shift 251.0 associated with the stationary process 136.0 automorphism 126.1 Bernoulli 136.0 generalized Bernoulli 136.0 Markov 136.0 phase 375.E 386.B 2106 shift operator 223.C 251.0 306.C unilateral 390.1 shift transformation 136.0 Shilov boundary (of a domain) 21.0 (for a function algebra) 164.C (of a Siegel domain) 384.0 Shilov generalized function, Oel'fand- 125.S Shmul'yan theorem 424.V Eberlein- 37.0 Krein- 37.E 424.0 Shnirel'man theory, Lyusternik- 286.Q shock wave 205.B 446 shortening 186.E shortest-path problem 281.C shortest representation (of an ideal) 67.F short exact sequence 200.1 short international notation 92.E short range 375.B Shrikhande square 102.K shrinking (a space to a point) 202.E shunt left 115.B right 115.B SI (international system of units) 414.A side 155.B,F (of an angle) 139.0 155.B (of a complete quadrangle) 343.C (on a line) 155.B (on a plane) 155.B (of a point with respect to a hyperplane) 7.0 (of a polygon) 155.F (of a spherical triangle) 432.B side cone 258.A Sidon set 192.R Siegel domain(s) 384 of the first kind 384.A generated 384.F irreducible 384.E of the second kind 384.A of the third kind 384.A Siegel mean value theorem 182.E Siegel modular form of weight k (or of dimension - k) 32.F Siegel modular function of degree n 32.F Siegel modular group of degree n 32.F Siegel space of degree n 32.F Siegel theorem (on Oiophantine equations) 118.0 (on positive definite forms) 348.K Siegel upper half-space of degree n 32.F Siegel zero 123.0 Siegel zeta function of indefinite quadratic forms 450.K sieve l6.AA 22.B Eratosthenes 297.B large 123.E large, method 123.0 Selberg 123.E sieved set 22.B sieve method 4.A large 123.0 sign (of a permutation) 100.A signal process 405.F signature (of a Hermitian form) 348.F (of an irreducible representation of GL(V)) 60.0 (of a knot) 235.C (of a manifold) 56.0 
2107 (of a quadratic form) 348.C Hirzebruch, theorem 72.K signed Lebesgue-Stieltjes measure l66.C signed measure 380.C signed rank test 371.B signed rank test, Wilcoxon 371.B sign test 371.B similar (central simple algebra) 29.E (linear representation) 362.C (matrix representation of a semilinear mapping) 256.P (permutation representation) 362.B (projective representation) 362.1 (square matrices) 269.0 simiiar central simple algebras 29.E similar correspondence (between surfaces) 111.1 similarity (of an affine space) 7.E Prandtl-Olauert law of 205.0 Reynolds law of 205.C von Karman transonic 205.0 similarly isomorphic (ordered fields) 149.N similar mathematical systems 409.B similar test 400.0 similar unitary representations 437.A simple (A-module) 277.H (Abelian variety) 3.B (algebraic group) 13.L (eigenvalue) 390.A,B (function) 438.A (Lie algebra) 248.E (Lie group) 249.0 (linear representation) 362.C (polygon) 155.F (subcoalgebra) 203.F absolutely (algebraic group) 13.L algebraically (eigenvalue) 390.B almost (algebraic group) 13.L geometrically (eigenvalue) 390.A k- (algebraic group) 13.0 k-almost (algebraic group) 13.0 simple algebra 29.A central 29.E normal 29.E zeta function of 27.F simple arc 93.8 simple Bravais lattice 92.E simple character (of an irreducible representation) 362.E simple closed curve 93.B simple component (of a semisimple ring) 368.0 simple continued fraction 85.A simple convergence, abscissa of (of a Oirichlet series) 121.B simple distribution, potential of 338.A simple extension (of a field) 149.0 simple function 221.B 443.B simple group 190.C linear 151.1 Tits 151.1 simple harmonic motion 318.B simple holonomic system 274.H simple homotopy equivalence 65.C simple homotopy equivalent 65.C simple homotopy theorem 65.C simple hypothesis 4oo.A simple Lie algebra 248.E classical compact real 248.T Subject Index Simplicial homology group classical complex 248.S exceptional compact real 248.T exceptional complex 248.S simple Lie group 249.0 classical compact 249.L classical complex 249.M exceptional compact 249.L exceptional complex 249.M simple loss function 398.A simple model 403.F simple pair (of an H-space and an H-subspace) 202.L n- (of topological spaces) 202.L simple path 186.F simple point (on an algebraic variety) 16.F (of an analytic set) 23.B 418.A simple ring 368.0 quasi- 368.E simple root (of an algebraic equation) 10.B (in a root system) 13.J (of a semisimple Lie algebra) 248.M simple series 379.E simple spectrum 390.0 simplest alternating polynomial simplest Chebyshev q-function simplest orthogonal polynomial simple type theory 411.K simplex (in an affine space) 7.0 (of a complex) 13.R (in a locally convex space) 424.U (in a polyhedron of a simplicial complex) 70.C (in a simplicial complex) 70.C (of a triangulation) 70.C degenerate (in a semisimplicial complex) 70.0 n- (in a Euclidean simplicial complex) 70.B n- (in a semisimplicial complex) 70.E n- (in a simplicial complex) 70.C open (in an affine space) 7.0 open (in the polyhedron of a simplicial complex) 70.C ordered (in a semisimplicial complex) 70.E ordered (in a simplicial complex) 70.C oriented q- 201.C oriented singular r-, of class Coo 105.T singular n- (in a topological space) 70.E simplex method 255.C two-phase 255.C simplex tableau 255.C simplicial approximation (to a continuous mapping) 70.C simplicial approximation theorem 70.C simplicial chain complex, oriented 201.C simplicial complex(es) 65.A 70.C abstract 70.C countable 70.C Euclidean 70.B finite 70.C isomorphic 70.C locally countable 70.C locally finite 70.C ordered 70.C simplicial decomposition (of a topological space) 70.C simplicial division 65.A simplicial homology group 201.0 337.1 19.0 19.0 
Subject Index Simplicial mapping simplicial mapping (map) 70.C (between polyhedra) 70.C (relative to triangulations) 70.C simplicial pair 20l.L simply connected (space) 79.C 170 simply connected covering Lie group (of a Lie algebra) 249.C simply connected group (isogenous to an algebraic group) l3.N simply convergent sequence 435.B simply elliptic (singularity) 4l8.C simply invariant (subspace) l64.H simply periodic function l34.E simply transitive (G-set) 362.B Simpson formula, Milne- 303.E Simpson t rule 299.A Simpson,} rule 299.A simulation 307.C 385 analog 385.A in the narrow sense 385.A system 385.A simultaneous distribution 342.C simultaneous equations 10.A sin (sine) l3l.E sin -1 l3l.E sine(s) 432.A hyperbolic l3l.F integral 167.0 laws of 432.A, App. A, Tables 2.II 2.m laws of (on spherical trigonometry) 432.B sine curve 93.H Sine-Oordon equation 387.A sine integral 167.0, App. A, Table 19.II sines and cosines, law of App. A, Table 2.II1 sine transform l60.C, App. A, Table I 1. II sine wave 446 Singer fixed point theorem, Atiyah- 153.C Singer index theorem Atiyah- 237.H equivariant Atiyah- 237.H single-address instruction 75.C single-commodity flow problem 281.F single integral theorem, Fourier l60.B single layer, potential of a 338.A single-objective model 307.C single sampling inspection 404.C single-valued function 165.8 singular (distribution) 374.C (element of a connected Lie group) 249.P (element of a real Lie algebra) 248.B (element with respect to a quadratic form) 348.E (Oalton-Watson process) 44.C (harmonic function) 193.0 (mapping) 208.B (ordinal number) 270.1 (set function) 380.C essentially (with respect to an analytic set) 2l.M of the h th species 343.0,E Ji- 380.C relative, homology group 20l.L singular cardinal problem 33.F singular chain complex (of a topological space) 201.E singular cochain complex 201.H singular cohomology group 201.H singular cohomology ring 201.1 singular complex (of a topological space) 70.E 2108 singular fiber 72.K singular homology group 201.E,0,L,R integral 201.E singular initial value problem (of a partial differen- tial equation of mixed type) 326.C singular inner function 43.F singular integral 217.1 singular integral equation 217.1 singular integral manifold (of a differential ideal) 428.E singular integral operator, Calderon-Zygmund 2l7.J 251.0 singularity (singularities) 51.C 198.M algebraic 198.M of an analytic function 198.M cusp 4i8.C direct transcendental (of an analytic function in the wider sense) 198.P elliptic 4l8.C essential (ofa complex function) 198.0 fixed (of an algebraic differential equation) 288.A indirect transcendental (of an analytic function in the wider sense) 198.P isolated (of an analytic function) 198.0 isolated (of a complex function) 198.M logarithmic (of an analytic function) 198.M logarithmic (of an analytic function in the wider sense) 198.P movable (of an algebraic differential equation) 288.A ordinary (of an analytic function in the wider sense) 198.P principle of condensation of 37.H propagation of 325.M quotient 418.C rational 418.C regular (of a coherent @-module) 274.H removable (of a complex function) 198.0 removable (of a harmonic function) 193.L resolution of 16.L 23.0 4l8.B space of 390.E theory of 418 transcendental (of an analytic function in the wider sense) 198.P two-dimensional 4l8.C singularity set (of a proper meromorphic mapping) 23.0 singularity spectrum (ofa hyperfunction) 125.CC 274.E singularity theorem (in physics) 359.F singular kernel 217.J singular locus (of a variety) 16.F singular n-simplex (in a topological space) 70.E singular orbit 431.C singular part (of a Laurent expansion) 198.0 singular perturbation 289.E singular point (of an algebraic variety) 16.F (of an analytic set) 23.B 418.A (of a continuous vector field) 153.B (of a curve of class Ck) 93.0 (of a flow) 126.0 (of a linear difference equation) 104.0 (of a plane algebraic curve) 9.B (of a polyhedron) 65.B (of a quadratic hypersurface) 343.E (of a surface in £3) 111.1 (of a system of linear ordinary differential equations) 254.A 
2109 (of a system of ordinary differential equations) 126.0 289.A apparent (of a system of linear ordinary dif- ferential equations) 254.C hyperbolic 126.0 irregular (of a solution) 254.B irregular (of a system of linear ordinary dif- ferential equations) 254.B isolated 198.0 left (of a diffusion process) 115.B regular (ofa solution) 254.B regular (of a system of linear ordinary dif- ferential equations) 254.B right (of a diffusion process) 115.B singular projective transformation 343.0 of the hth species 343.0 q-cochain 201.H q-simplex 201.E singular quadric hypersurface of the hth species (in a projective space) 343.E singular r-chain of class Coo 105.T singular r-cochain of class Coo 105.T singular r-simplex of class Coo, oriented 105.T singular series 4.0 singular solution (of a differential ideal) 428.E (of a general partial differential equation) 320.C (of an ordinary differential equation) 313.A, App. A, Table 14.1 (of a partial differential equation) 320.C totally (with respect to a quadratic form) 348.E singular spectrum 345.A 390.E (of a hyperfunction) 125.CC 274.E singular subspace 343.0 singular subspace, totally 348.E singular support (of a distribution) 112.C (of a hyperfunction) 125.W singular value 302.A singular value decomposition (SVO) 302.E sinh (hyperbolic sine) 13l.F sink 126.0 281.C sinusoid 93.0 sinusoidal wave 446 site 16.AA etale 16.AA flat 16.AA presheaf on 16.AA Zariski 16.AA site percolation process 340.0 6j-symbol 353.B size (of a balanced array) 102.L (complexity of computation) 71.A (of a population) 397.B (of a random sample) 396.B (of a sample) 401.E (of a test) 4OO.A block 102.B sample 373.A step 303.B skeleton (of a domain in CO) 21.C r- (of a Euclidean complex) 70.B skew field 149.A 368.B skew-Hermitian form 256.Q skew-Hermitian matrix 269.V skew h-matrix 269.1 Subject Index Smooth boundary, domain with skewness 396.C 397.C coefficient of 341.H skew product (of measure-preserving transforma- tions) 136.0 skew surface lll.I skew-symmetric (multilinear mapping) 256.H skew-symmetric matrix 269.B skew-symmetric tensor 256.N Skitovich-Oarmois theorem 374.H Skolem-L6wenheim theorem 156.E Skolem paradox 156.E Skolem theorem on the impossibility of characteriz- ing the system of natural numbers by axioms 156.E slackness, Tucker theorem on complementary 255.B slack variable 255.A slant product (of a cochain and a chain) 201.K (of a cohomology class and a homology class) 201.K Slater constraint qualification 292.B slender body theorem 205.B slice knot 235.0 slice representation 431.C slice theorem, differentiable 431.C slicing theorem, watermelon- 125.00 slide rule 19.A sliding block code 213.E slit (of a plane domain) 333.A slit domain 333.A slit mapping extremal horizontal 367.0 extremal vertical 367.0 slope function 46.C slowly increasing Coo-function 125.0 slowly increasing distribution 125.N slowly increasing function in the sense of Oeny 338.P slowly increasing sequences 168.8 slow wave 259 Smale condition C, Palais- 279.E 286.Q Smale diffeomorphism, Morse- 126.1 Smale flow, Morse- 126.1 Smale theorem, Sard- 286.P Smale vector field, Morse- 126.1 small-displacement theory of elasticity 271.G smaller topology 425.H small inductive dimension (in d) 117.B small numbers, law of 250.B small sample 401.F small set of order V 436.0 smashing (a space to a point) 202.E smash product 202.F Smirnov test, Kolmogorov- 317.F Smirnov test statistic, Kolmogorov- 374.E Smirnov theorem 250.F Smith conjecture 235.E Smith convergence, Moore- 87.H Smith theorem 431.B smooth (function) 106.K (measure for a Riemann metric) 136.G (morphism of schemes) 16.F (point ofa variety) 16.F piecewise (curve) 364.A in the sense of A. Zygmund 168.B uniformly (normed linear space) 37.G smooth boundary, domain with (in a Coo-manifold) 105.U 
Subject Index Smooth characteristic class of foliations smooth characteristic class offoliations 154.G smoothing (of a combinatorial manifold) 114.C smoothing problem 114.C smooth invariant measure 126.1 smooth manifold 105.0 114.B smooth structure 114.B smooth variety 16.F sn 134.1, App. A, Table 16.111 Snapper polynomial 16.E Sobolev-Besov embedding theorem 168.B Sobolev inequality, Hardy-Littlewood- 224.E Sobolev space 168.B software 75.C sojourn time density 45.0 solenoidal (vector field) 442.0 solid geometry 181 solid harmonics 393.A solid n-sphere 140 solid sphere 140 topological 140 solitary wave 387.B soliton 387.B solution (of equations of neutral type) 163.H (of a functional-differential equation) 163.C (of an inequality) 211.A (of an ordinary differential equation) 313.A (of a partial differential equation) 320.A (of partial differential equations of first order) App. A, Table 15.11 (of partial differential equations of second order) App. A, Table 15.111 (of a system of differential equations) 313.B (of a system of linear equations) 269.M (of a system of partial differential equations) 428.B algebraic (of an algebraic equation) 10.0 asymptotic 325.L basic 255.A basic feasible 255.A basic optimal 255.A Bayes 398.B Bayes, in the wider sense 398.B of boundary value problems App. A, Table 15.VI of the Cauchy problem 325.0 classical (to Plateau's problem) 275.C complete (of partial differential equations) 320.C d'Alembert 325.0 Oouglas-Rado (to Plateau's problem) 275.C elementary (of a differential operator) 112.B elementary (of a linear partial differential operator) 320.H elementary (of partial differential equations of elliptic type) 323.B elementary (of a partial differential operator) App. A, Table 15.V equilateral triangle 420.B feasible (of a linear equation in linear program- ming) 264.A formal (for a system of ordinary differential equations) 289.C fundamental (of a Cauchy problem) 325.0 fundamental (of a differential operator) 112.B fundamental (of an evolution equation) 189.C fundamental (of a linear parabolic equation with boundary conditions) 327.F fundamental (of a linear partial differential operator) 320.H 2110 fundamental (of a partial differential equation of parabolic type) 327.0 fundamental (of partial differential equations of elliptic type) 323.B fundamental (of a partial differentia! operator with Coo-coefficients) 189.C fundamental system of (of a homogeneous linear ordinary differential equation) 252.B fundamental system of (of a homogeneous system of linear differential equations of first order) 252.H general (of a differential equation) 313.A general (of a general partial differential equa- tion) 320.C general (of a nonhomogeneous linear difference equation) 104.0 general (of partial differential equations) 320.C general (of a system of differential equations) 313.C general (of a system of partial differential equations) 428.B generalized Bayes 398.B genuine 323.G half-periodic (of the Hill equation) 268.E HilI's method of 268.B Hopf's weak 204.C inner 25.B Kirchhoff 325.0 to the martingale problem 115.C maximum 316.E minimax 398.B mmlmum 316.E Nash bargaining 173.C numerical (of algebraic equations) 301 numerical (of integral equations) 217.N numerical (oflinear equations) 302 numerical (of ordinary differential equations) 303 numerical (of partial differential equations) 304 optimal (of a linear programming problem) 255.A optimal (of a nonlinear programming problem) 292.A ordinary (of a differential ideal) 428.E outer 25.B particular (of a differential equation) 313.A particular (of partial differential equations) 320.C particular (for a system of differential equations) 313.C path wise uniqueness of 406.0 periodic (of the HilI eq uation) 268.E Perron-Brelot (of the Oirichlet problem) 120.C Perron- Wiener-Brelot (of the irichlet problem) 120.C Poisson 325.0 primary (of a homogeneous partial differential equation) 320.E primitive (of a partial differential equation) 320.E principal 104.B by quadrature App. A, Table 14.1 quasiperiodic (of the Hill equation) 268.B by radicals (of an algebraic equation) 10.0 regular (of a differential ideal) 428.E singular App. A, Table 14.1 singular (of a differential ideal) 428.E 
2111 singular (of a general partial differentia] equa- tion) 320.C singular (of an ordinary differential equation) 313.A, App. A, Table 14.1 singular (of partial differential equations) 320.C stable (of the Hill equation) 268.E straight line 420.B strong (of Navier-Stokes equation) 204.C strong (of stochastic differential equations) 406.0 system of fundamental (of a system of linear homogeneous equations) 269.M trivial (of a system of linear homogeneous equations) 269.M unique strong 406.0 uniqueness theorem of (of systems oflinear differential equations of the first order) 316.0,0 unstable (of the Hill equation) 268.E von Neumann-Morgenstern 173.0 weak 204.C 323.0 378.1 solution curve (of ordinary differential equations) 316.A solution operator 163.E solvability, Cartan's criterion of 248.F solvable (ideal of a Lie algebra) 248.C (Lie algebra) 248.C (Lie group) 249.0 (by a Turing machine) 71.B by radicals 172.H solvable algebra 231.A solvable algebraic group 13.F k- 13.F solvable group 190.1 finite IS 1.0 generalized 190.K n- ISl.F solve (a conditional inequality) 211.A (by means of a Turing machine) 71.E (an ordinary differential equation) 313.A (a partial differential equation) 320.A (a system of algebraic equations) 10.A (a triangle) 432.A Sommerfeld formula App. A, Table 19.m Kneser- App. A, Table 19.'" Sommerfeld radiation condition 188.0 Sonine formula, Weber- App. A, Table 19.m Sonine polynomials 317.0, App. A, Table 20.VI Sonine-Schafheitlin formula App. A, Table 19.m SOR (successive overrelaxation) 302.C sorting 96.C soudure 80.N sound propagation, equation of 325.A source 126.0 281.C (of a jet) 105,X autoregressive Oaussian 213.E ergodic information 213.C information 213.A stationary 213.C without (vector field) 442.0 source branch 282.C SOurce coding theorem 213.0 with a fidelity criterion 213.E noiseless 213.0 sOurce coding theory 213.A southern hemisphere 140 Subject Index Space(s) south pole 74.0 140 space(s) 381.B of absolute continuity 390.£ absolutely closed 425.U abstract 381.B abstract L 310.0 abstract Lp 310.0 abstract M 310.0 action 398.A adjoint (of a topological linear space) 424.0 affine 7.A affine locally symmetric 80.1 affine symmetric 80.J o- 42S.Y algebraic 16.W algebraic fiber 72.1 analytic 23.C analytic, in the sense of Behnke and Stein 23.E analytically uniform 125.S analytic covering 23.E analytic measurable 270.C arcwise connected 79.B attaching 202.E Baire 42S.N Baire zero-dimensional 273.B Banach 37.A,B Banach analytic 23.0 base (of a fiber bundle) 147.B base (of a fiber space) 148.B base (of a Riemann surface) 367.A base for 42S.F basic (of a probability space) 342.B Besov 168.B bicompact 408.S biprojective 343.H Boolean 42.0 Borel 270.C boundary 112.E bundle (of a fiber bundle) 147.B C-analytic 23.E C-covering 23.E Cartan 1 S2.C Cartesian 140 tech-complete 436.1 classifying (of a topological group) 147.0,H closed half- 7.0 of closed paths 202.C co-eche]on 168.B collectionwise Hausdorff 425.AA collectionwise normal 425.AA comb 79.A compact 425.S compact metric 273.F complete 436.G completely normal 425.Q completely regular 425.Q complete measure 270.0 complete product measure 270.H complete uniform 436.0 complex, form 365.L complex Hilbert 197.B complex interpolation 224.B complexity 71.A complex projective 343.0 concircularly flat App. A, Table 4.11 configuration 126.L 402.G conformal 76.A conformally flat App. A, Table 4.II conjugate (of a normed linear space) 37.0 
Subject Index Space(s) conjugate (of a topological linear space) 424.0 connected 79.A of constant curvature 364.0, App. A, Table 4.II of continuous mapping 435.0 contractible 79.C control (in catastrophe theory) 51.B countable paracompact 425.Y countably compact 425.S countably Hilbertian 424.W countably normed 424.W covering 91.A crystallographic, group 92.A decision 398.A of decision functions 398.A de Sitter 359.0 developable 425.AA (OF)- 424.P Oieudonne complete topological 435.1 Oirichlet 338.Q discrete metric 273.B discrete topological 425.C Oouady 23.G dual (of a C*-algebra) 36.G dual (of a linear space) 256.G dual (of a locally compact group) 437.1 dual (of a normed linear space) 37.0 dual (of a projective space) 343.B dual (of a topological linear space) 424.0 8- 193.N echelon 168.B eigen- 269.L 390.A Eilenberg-Maclane 70.F Einstein 364.0, App. A, Table 4.II of elementary events 342.B elliptic 285.C error 403.E estimation 403.E Euclidean 140 external (in static model in catastrophe theory) 51.B (F)- 424.1 fiber 72.1 148.B finite type power series 168.B Finsler 152.A Fock (antisymmetric) 377.A Fock (symmetric) 377.A Frechet 37.0424.1 425.CC Frechet, in the sense of Bourbaki 37.0424.1 Frechet L- 87.K Frechet-Uryson 425.CC fully normal 425,X function 168.A 435.0 fundamental 125.S G- 178.H 431.A general analytic 23.0 generalized topological 425.0 generating (of a quadric hypersurface) 343.E globally symmetric Riemannian 412.A Green 193.N group 92.A H- 203.0 Haar l42.B half- (of an affine space) 7.0 Hardy 168.B Hausdorff 425.Q Hausdorff uniform 436.C H-closed 425.U hereditarily normal 425.Q Hermitian hyperbolic 412.0 2112 Hilbert 173.B 197.B Hilbert, adjoint 251.E Hilbert, exponential 377.0 Holder 168.B holomorphically complete 23.F homogeneous  homogeneous space hyperbolic 285.C 412.H identification (by a partition) 425.L indiscrete pseudometric 273.B inductive limit 21O.C infinite-dimensional 117.B infinite lens 91.C infinite type power series 168.B at infinity (in affine geometry) 7.B inner product 442.B internal (in static model in catastrophe theory) 51.B interpolation 224.A of irrational numbers 22.A irreducible symmetric Hermitian 412.A isometric 273.B John-Nirenberg (=BMO) 168.B k- 425.CC k' - 425.CC K-complete analytic 23.F Kawaguchi 152.C Kolmogorov 425.Q Kothe 168.B Kuranishi 72.0 Kuratowski 425.Q L- 87.K L*- 87.K Lashnev 425.CC lattice ordered linear 31O.B Lebesgue (= Lp(n)) 168.B Lebesgue measure, with (0'-) finite measure 136.A left coset (of a topological group) 423.E left projective 343.H left quotient (of a topological group) 423.E lens 91.C (LF)- 424.W of line elements of higher order 152.C linear  linear space linear topological 424.A Lindelof 425.S Lipschitz 168.B locally arc wise connected 79.B locally compact 425.V locally connected 79.A locally contractible 79.C 204.C locally convex Frechet 424.1 locally Euclidean 425.V locally n-connected 79.C locally w-connected 79.C locally symmetric 364.0 locally symmetric Riemannian 412.A locally totally bounded uniform 436.H locally trivial fiber 148.B local moduli, of a compact complex manifold 72.0 local ringed 383.H Loeb 293.0 loop 202.C Lorentz 168.B Luzin 22.1 422.CC M- 425.Y (M)- 424.0 Mackey 424.N mapping 202.C 435.0 
2113 maximal ideal (of a commutative Banach space) 36.E measurable 270.C measure 270.0 metric  metric space metric vector 256.H metrizable topological 273.K metrizable uniform 436.F Minkowski 258.A moduli 16.W 72.0 MOlshezon 16.W momentum phase 126.L Montel 424.0 Moor 273.K 425.AA n-classifying (of a topological group) 147.0 n-connected 79.C 202.L n-connective fiber 148.0 n-dimensional 117.B n-simple 202.L non-Euclidean 285.A normal 425.Q normal analytic 23.0 normed linear 37.B NP- 71.E nuclear 424.S null 251.0 w-connected 79.C orbit(ofa G-space) 431.A ordered linear 31O.B Orlicz ] 68.B P- 425.Y p- 425.Y paracompact 425.S parameter (of a family of compact complex manifolds) 72.0 parameter (for a family of probability measures) 398.A parameter (of a probability distribution) 396.B partition of a 425.L path 148.C path (of a Markov process) 261.B path-connected 79.B pathological 65.F Peirce 231.B perfectly normal 425.Q perfectly separable 425.P phase 126.B 163.C 402.C physical Hilbert 150.0 pinching a set to a point 202.E polar 191.1 Polish 22.1 273.1 precompact metric 273.B precompact uniform 436.H pre-Hilbert 197.B principal (of a flag) 139.B principal half- 139.B probability 342.B product 425.K product measure 270.H product metric 273.B product topological 425.K product uniform 436.E projective, over A 147.E projective limit 21O.C projectively flat App. A, Table 4.II pseudocompact 425.S pseudometric 273.B pseudometrizable uniform 436.F Q- 425.BB quasi-Banach 37.0 Subject Index Space(s) quasicompact 408.S quasidual (of a locally compact group) 437.1 quasinormed linear 37.0 quaternion hyperbolic 412.0 quotient 425.L quotient (by a discrete transformation group) 122.A quotient (of a linear space with respect to an equivalence relation) 256.F quotient (by a transformation group) 122.A quotient topological 425.L ,-closed 425.U ramified covering 23.B real-compact 425.BB real Hilbert 197.B real hyperbolic 412.0 real interpolation 224.C real linear 256.A real projective 343.0 reduced product 202.Q reflexive Banach 37.0 regular 425.Q regular Banach 37.0 representation (for a Banach algebra) 36.0 representation (of a representation of a Lie algebra) 248.B representation (of a representation of a Lie group) 249.0 representation (of a unitary representation) 437.A Riemannian 364.A Riesz 31O.B right coset (of a topological group) 423.E right projective 343.H right quotient (of a topological group) 423.E ringed 383.H (S)- 424.S sample 342.B 396.B 398.A scale of Banach 286.Z Schwartz 424.S separable 425.P separable metric 273.E separated 425.Q separated uniform 436.C sequential 425.CC sequentially compact 425.S sheaf 383.C shrinking, to a point 202.E Siegel, of degree n 32.F Siegel upper half-, of degree n 32.F E- 425.Y 0'- 425.Y O'-compact 425.V O'-finite measure 270.0 simply connected 79.C 170 of singularity 390.E smashing, to a point 202.E Sobolev 168.B Spanier cohomology theory, Alexander- Kolmogorov- 201.M spherical 285.0 Spivak normal fiber 114.1 standard Borel 270.C standard measurable 270.C standard vector (of an affine space) 7.A state (of a dynamical system) 126.B state (of a Markov process) 261.B state (in static model in catastrophe theory) 51.B state (of a stochastic prOccess) 407.B 
Subject Index Space complexity Stein 23.F stratifiable 425.Y strongly paracompact 425.S structure (of a Banach algebra) 36.0 subbase for 425.F Suslin 22.1 425.CC symmetric Hermitian 412.E symmetric homogeneous 412.B symmetric Riemannian 412 symmetric Riemannian homogeneous 412.8 To- 425.Q T l - 425.Q Tl-uniform 436.C Tz- 425.Q T 3 - 425.Q T 4 - 425.Q Ts- 425.Q T 6 - 425.Q tangent 105.H tangent vectOr 105.H Teichmiiller 416 tensor, of degree k 256.1 tensor, of type (p, q) 256.1 test function 125.S Thom 114.0 Tikhonov 425.Q time parameter 260.A topological  topological space topological complete 436.1 topological linear 424.A topological vector 424.A total (of a fiber bundle) 147.B total (of a fiber space) 148.B totally bounded metric 273.B totally bounded uniform 436.H totally disconnected 79.0 transformation (of an algebraic group) 13.0 of type S 125.T underlying topological (ofa complex manifold) 72.A underlying topological (of a topological group) 423.A uniform  uniform space uniformizable topological 436.H uniformly locally compact 425.V uniform topological 436.C unisolvent 142.B universal covering 91.B universal Teichmiiller 416 vector, over K 256.A velocity phase 126.L weakly symmetric Riemannian 412.J well-chained metric 79.0 wild 65.F space complexity 71.A space form 285.E 412.H Euclidean 412.H hyperbolic 412.H spherical 412.H space geometry 181 space group 92.A crystallographic 92.A equivalent 92.A spacelike 258.A 359.B space reflection 359.B space-time, Minkowski 359.B space-time Brownian motion 45.F space-time inversion 258.A space-time manifold 359.0 span 2114 (of a domain) 77.E (a linear subspace by a set) 256.F (ofa Riemann surface) 367.G spanning tree 186.0 sparse 302.C Spath type division theorem (for micro differential operators) 274.F spatial (*-isomorphism on von Neumann algebras) 308.C spatially homogeneous (process) 261.A spatially isomorphic (automorphisms on a measure space) 136.E spatial tensor product 36.H Spearman rank correlation 371.K Spec (spectrum) 16.0 special Clifford group 61.0 special divisor 9.C special flow 136.0 special function(s) 389, App. A, Table 14.II of confluent type 389.A of ellipsoidal type 389.A of hyper geometric type 389.A special functional equations 388 special isoperimetric problem 228.A speciality index (of a divisor of an algebraic curve) 9.C (of a divisor on an algebraic surface) 15.0 0- (of a divisor of an algebraic curve) 9.F specialization 16.A (in etale topology) 16.AA specialJ ordan algebra 231.A special linear group 60.B (over a noncommutative field) 60.0 of degree n over K 60.B projective 60.B projective (over a noncommutative field) 60.0 special orthogonal group 60.1 complex 60.1 over K with respect to Q 60.K special principle of relativity 359.B special relativity 359.B special representation (of a Jordan algebra) 231.C special surface 110.A special theory of perturbations 420.£ special theory of relativity 359.A special unitary group 60.F (relative to an a-Hermitian form) 60.0 over K 60.H projective, over K 60.H special universal enveloping algebra (of a Jordan algebra) 231.C special valuation 439.B species ellipsoidal harmonics of the first, second, third or fourth 133.C Lame functions of the first, second, third or fourth 133.C singular projective transformation of the h th 343.0 singular quadric hypersurface of the h th (in a projective space) 343.E specification 401.A problem of 397.P specific heat at constant pressure 419.B at constant volume 419.B specificity 346.F specific resistance BO.B spectral analysis 390.A spectral concentration 331.F 
2115 spectral decomposition 126.1 395.B spectral density, quadrature 397.N spectral functor 200.1 spectral geometry 391.A spectral integral 390.0 spectral invariant 136.E spectrally isomorphic (automorphisms on a measure space) 136.E spectral mapping theorem 251.0 spectral measure 390.B,K 395.B,C complex 390.0 maximum 390.0 real 390.0 spectral method 304.B spectral operator 390.K spectral property ] 36.E spectral radius 126.K 251.F 390.A spectral representation 390.E complex 390.E spectral resolution 390.E complex 390.E spectral sequence 200.1 (of a fiber space) 148.E cohomology 200.1 Hodge 16.U spectral synthesis 36.L spectral theorem 390.E spectrum 390.A (of a commutative ring) 16.0 (of a domain in a Riemannian manifold) 391.A (of an element of a Banach algebra) 36.C (in homotopy theory) 202.T (of a hyperfunction) 274.E (of an integral equation) 217.J (of a linear operator) 251.F 390.A (of a spectral measure) 390.C absolutely continuous 390.E continuous (of a linear operator) 390.A continuous (of an integral equation) 217.1 countable Lebesgue 136.E discrete 136.E 390.E Eilenberg-MacLane 202.T essential 390.E,1 formal (of a Noetherian ring) 16,X intermittent 433.C joint 36.M Kolmogorov 433.C for p-forms 391.B point 390.A pure point 136.E quasidiscrete 136.E residual 390.A simple 390.G singular 125.CC 345.A 390.A singular (of a hyperfunction) 274.E singularity (of a hyperfunction) 125.CC 274.E sphere 202.T stable homotopy group of the Thom 114.G Thom 114.G 202.T spectrum condition 150.0 speed measure 115.B Speiser theorem, Hilbert- 172.J Spencer mapping (map), Kodaira- 72.G sphere(s) 139.1 150 circumscribing (of a simplex) 139.1 combinatorial, group of oriented differentiable structures on the 114.1 complex 74.0 a- (of a point) 273.C exotic 114.B Subject Index Spinor(s) homotopy n- 65.C homotopy no, h-cobordism group of 114.1 horned, Alexander's 65.G open n- 140 open 140 PL (k - 1)- 65.C pseudo- 111.1 Riemann 74.0 solid 140 solid n- 140 topological 140 topological solid 140 unit 140 w- 74.0 z- 74.0 sphere bundle n- 147.K cotangential 274.E normal 274.E tangential 274.E unit tangent 126.L sphere geometry 76.C sphere pair 235.G 65.0 sphere spectrum 202.T sphere theorem (characterization of a sphere) 178.C (embedding in a 3-manifold) 65.E spherical (real h ypersurface) 344. C (space form) 412.H spherical astronomy 392 spherical Bessel function 39.B spherical coordinates 90.C, App. A, Table 3.V spherical derivative (for an analytic or meromorphic function) 435.E spherical excess 432.B spherical Fourier transform 437.Z spherical function(s) 393 (on a homogeneous space) 437.X Laplace 393.A zonal (on a homogeneous space) 437.Y spherical G-fiber homotopy type 431.F spherical geometry 285.0 spherical harmonic function 193.C spherical harmonics, biaxial 393.0 spherical indicatrix (of a space curve) 111.F spherical modification 114.F spherical representation of a differentiable manifold Il1.G of a space curve 111.F of a unimodular locally compact group 437.Z spherical space 285.0 spherical triangle 432.B, App. A, Table 2.m spherical trigonometry 432.B spherical type 13.R spherical wave 446 spheroidal coordinates 133.0, App. A, Table 3.V spheroidal wave function 133.E spin 132.A 258.A 415.G continuous 258.A spin and statistics, connection of 132.A 150.0 spin ball 351.L spin bundle 237.F Spinc bundle 237.F spin-flip model 340.C spin mapping (map) 237.G spin matrix, Pauli 258.A 415.G spinor(s) 61.E contravariant 258.A covariant 258.A dotted 258.B 
Subject Index Spinor group even half- 61.E mixed, ofrank (k, n) 258.A odd half- 61.E undotted 258.B spinor group 60.1 61.0 complex 61.E spinorial norm 61.0 spinor representation (of rank k) 258.A spin representation (of SO(n)) 60.1 (of Spin(n, C)) 61.E even half- 61.E half- 61.E odd half- 61.E spin-structure 237.F 431.0 spin systems, lattice 402.G spiral 93.H Archimedes 93.H Bernoulli 93.H Cornu 93.H equiangular 93.H hyperbolic 93.H logarithmic 93.H reciprocal 93.H Spivak normal fiber space 114.1 spline 223.F natural 223.F spline interpolation 223.F split «B,N)-pair) 151.J (co cycle in an extension) 257.E (exact sequence) 277.K k- (algebraic group) 13.N K- (algebraic torus) 13.0 k-quasi- (algebraic group) 13.0 maximal k-, torus 13.Q split extension (of a group) 190.N splitting, Heegaard 65.C splitting field for an algebra 362.F for an algebraic torus 13.0 minimal (of a polynomial) 149.G of a polynomial 149.0 splitting ring 29.K split torus, maximal k- 13.Q spot prime 439.H Spur 269.F squarc(s) Euler 241.B latin 241 Latin 102.K least, approximation 336.0 matrix of the sum of, between classes 280.B matrix of the sum of, within classes 280.B method of least 303.1 middle-, method 354.B Room 241.0 Shrikhande 102.K Y ouden 102.K Youden, design 102.K square-free integer 347.H square integrable 168.B square integrable unitary representation 437.M square matrix 269.A square net 304.E square numbers 4.0 S.s. complex(es) (semisimplicial complex) 70.E geometric realization of 70.E isomorphic 70.E 2116 s.s. mapping (semisimplicial mapping) 70.E realization of 70.E stability 286.S 303.E 394 A- 303.0 Ao- 303.0 A(a)- 303.G absolute 303.G conjecture 126.1 exchange of 286.T interval of absolute 303 G interval of relative 303.G orbital (of a solution of a differential equation) 394.0 principle of linearized 286.S region of absolute (of the Runge-Kutta (P, p) method) 303.0 region of relative 303.G relative 303.0 stiff- 303.0 structural 290.A structural, theorem 126.1 stability group 362.B stability subgroup (of a topological group) 431.A stability theorem 0- 126.1 structural 126.J stabilizer (in an operation of a group) 362.B (in a permutation group) 151.H (in a topological transformation group) 431.A reductive 199.A stable 394.A (coherent sheaf on a projective variety) 241.Y (compact leaf) 154.0 (discretization, initial value problems) 304.0 (equilibrium solution) 286.S (initial value problem) 304.F (invariant set) 126.F (linear function) 163.H (manifold) 126.0 (minimal submanifold) 275.B (static model in catastrophe theory) 51.E absolutely 303.0 asymptotically 126.F 286.S 394.B in both directions (Lyapunov stable) 394.A Cr_O_ 126.H C-structurally 126.H conditionally 394.0 exponentially 163.0 394.B externally, set 186.1 globaIlyasymptoticaIly 126.F internally, set 186.1 Lagrange 126.E Lyapunov 126.F Lyapunov, in the positive or negative direction 394.A negatively Lagrange 126.E negatively Poisson 126.E one-side, for exponent! App. A, Table 22 orbitally 126.F Poisson 126.E positively Lagrange 126.E positively Poisson 126.E relatively 303.0 uniformly 394.B uniformly asymptotically 163.G 394.B uniformly Lyapunov 126.F stable cohomology operation 64.B stable curve 9.K 
2117 stable distribution 341.G quasi- 341.G semi- 341 G stable homotopy group 202.T, App. A, Table 6.VII (ofThom spectrum) 114.G of classical groups 202. V of k-stem 202.U stable manifold l26.G,1 stable point 16.W stable primary cohomology operation 64.C stable prOcess 5.F exponent of 5.F one-sided, of the exponent (X 5.F strictly 5.F symmetric 5.F stable range (of embed dings) 114.0 stable reduction (of an Abelian variety) 3.N (of a curve) 9.K potential (of an Abelian variety) 3.N stable reduction theorem 3.N 9.K stable secondary cohomology operation 64.C stable set 173.0 externally 186.1 internally 186.1 stable solution (of the Hill equation) 268.E stable state 260.F 394.A 404.A stable vector bundle (algebraic) 16.Y (topological) 237.B stably almost complex manifold Il4.H stably equivalent (vector bundles) 237.B stably fiber homotopy equivalent 237.1 stably parallelizable (manifold) 114.1 stack 96.E stage method, (P + 1)- 303.0 stalk (of a sheaf over a point) 16.AA 383.B standard (in nonstandard analysis) 293.B (transition probability) 260.F standard Borel space 270.C standard complex (of a Lie algebra) 200.0 standard defining function 125.Z standard deviation (characteristics of the distribution) 397.C (of a probability distribution) 341.B (of a random variable) 342.C population 396.C sample 396.C standard form 241.A (of a difference equation) 104.C of the equation (of a conic section) 78.C Legendre-Jacobi (of an elliptic integral) 134.A, App. A, Table 16.1 standard Gaussian distribution 176.A standard Kahler metric (of a complex projective space) 232.0 standard measurable space 270.0 standard normal distribution 341.0 standard parabolic k-subgroup l3.Q standard part (in nonstandard analysis) 293.0 standard q-simplex 201.E standard random walk 260.A standard resolution (of Z) 200.M standard sequence 400.K standard set 22.1 standard vector space (of an affine space) 7.A star  also * (in a complex) 13.R (in a Euclidean complex) 70.B Subject Index Stationary random distribution (in a projective space) 343.B (in a simplicial complex) 70.C (of a subset defined by a covering) 425.R open 70.B,C star body, bounded 182.C star convergence 87.K (0)- 87.L relative uniform 31O.F star-finite (covering of a set) 425.R star-finite property 425.S star refinement (of a covering) 425.R star region 339.0 starting values (in a multistep method) 303.E start node 281.0 star topology, weak (of a normed linear space) 37.E 424.H state(s) (of a C*-algebra) 308.0 (in Ising model) 340.B (in quantum mechanics) 351.B bound 351.0 ceiling 402.G completeness of the scattering 150.0 equation of 419.A equilibrium 136.H 340.B 4l9.A even 415.H fictitious 260.F final 31.B Gibbs 340.B ground 402.G in- 150.0 386.A initial 31.B instantaneous 260.F 261.B internal 31.B odd 415.H out- 150.0 386.A scattering 395.B stable 260.F 394.A 404.A stationary 340.C 351.0 of statistical control 404.A sum over 402.0 unstable 394.A state estimator 86.E state space 126.B (in catastrophe theory) 51.B (of a dynamical system) 126.B (of a Markov process) 261.B (of a stochastic process) 407.B state-space approach 86.A state variable 127.A static model (in catastrophe theory) 51.B stable 51.B stationary capacity 213.F stationary curve (of the Euler-Lagrange differential equation) 324.E (of a variation problem) 46.B stationary function 46.B stationary iterative process, linear 302.C stationary phase method 30.B stationary point (of an arc of class CO) 111.0 stationary Poisson point process 407.0 stationary process(es) 342.A 395 shift associated with 136.0 strictly 395.A strongly 395.A weakly 395.A weakly, of degree k 395.1 in the wider sense 395.A stationary random distribution 
Subject Index Stationary source strictly 395.H strongly 395.F,H weakly 395.C stationary source 213.C stationary state 340.C 351.0 stationary value (of a function) 106.L stationary variational inequality 440.B stationary wave 446 statistic 396 ancillary 396.H 401.C HotelIing's T 2 280.B invariant 396.H Kolmogorov-Smirnov test 374.E maximal invariant 396.1 minimal sufficient 396.1 n-dinlensional 396.B necessary and sufficient 396.E I-dimensional 396.B order 396.C selection 396.F t- 374.B V- 374.1 statistical control, state of 404.A statistical data analysis 397.A statistical decision function 398 statistical decision problem 398.A statistical decision procedure 398.A statistical estimation 399, App. A, Table 23 statistical experiment 398.0 sta tis tical genetics 40.B statistical hypothesis 4oo.A statistical hypothesis testing 400, App. A, Table 23 statistical inference 40 1 statistical model 403 statistical mechanics 342.A 402 classical 402.A equilibrium 402.A of irreversible processes 402.A Markov 340.C quantum 402.A statistical planning 102.A statistical quality control 404 statistical structure 396.E dominated 396.F statistical thermodynamics 402.A statistics 397.C Bose 377.B 402.E Fermi 377.B 402.E statistics and spin, connection of 150.0 Staudt algebra 343.C Steenrod algebra 64.B Steenrod axioms, Eilenberg- 201.Q Steenrod isomorphism theorem, Hurewicz- 148.0 Steenrod operator App. A, Table 6.II Steenrod pth power operation 64.B Steenrod square operation 64.B steepest descent, curve of 46.A steepness, wave 205.F Stein, analytic space in the sense of Behnke and 23.E Steinberg formula (on representation of compact Lie groups) 248.Z Steinberg group 237.1 Steinberg symbol 237.1 Steinberg type 13.0 Stein continuation theorem, Remmert- 23.B Stein decomposition, Fefferman- 168.B Steiner problem 179.A Steiner symmetrization 228.B Steinhaus theorem, Banach- 2118 (in a Banach space) 37.H (in a topological linear space) 424.1 Stein lemma, Hunt- 4oo.F Stein manifold 21.L fundamental theorems of 21.L n.E Stein space 23.F Stein theorem, Behnke- 21.H Stein theorem, Lehmann- 4oo.B step, fractional 304.F step-by-step method 163.0 step-down operator 206.B step size (in numerical solution) 303.11 step-up operator 206.B stereographic projection 74.0 Stiefel manifold 199.B complex 199.B infinite 147.1 of k-frames 199.B of orthogonal k-frames 199.B real, of k-frames 199.B real, of orthogonal k-frames 199.B Stiefel- Whitney class (of a differentiable manifold) 56.F 147.M (of an O(n)-bundle) 147.M (of an R"-bundle) 56.B (of a topological manifold) 56.F total 56.B universal 56.B Stiefel-Whitney number 56.F Stieltjes integral 94.E Lebesgue- 94.C 166.C Riemann- 94.B 166.C StieItjes measure, Lebesgue- 166.C 270.L Stieltjes moment problem 240.K Stieltjes theorem 133.C Stieltjes transform 220.0 Fourier- 192.B,0 Laplace- 240.A Stiemke theorem 255.B stiff 303.G in an interval 303.G stiffness matrix 304.C stiffness ratio 303.G stiff-stability 303.G stimulus-sampling model, Estes 342.H 346.G Stirling formula 174.A 212.C, App. A. Table 17.1 Stirling interpolation formula App. A, Table 21 Stirling number of the second kind 66.0 stochastically larger (random variable) 371.C stochastic calculus 406.A stochastic control 342.A 405 stochastic differential 406.C stochastic differential equation 342.A linear (LSOE) 405.G of Markovian type 406.0 stochastic differential of Stratonovich type 406.C stochastic filtering 324.A 405.F stochastic inference, graphical method of 19.B stochastic integral 261.E 406.B ofIto type 406.C of Stratonovich type 406.C stochastic Ising model 340.C stochastic matrix 260.A stochastic maximum principle 405.0 stochastic model 264 stochastic moving frame 406.G stochastic paper 19.B stochastic process(es) 342.A 407 generalized 407.C with stationary increments of ord"r n 395.1 
2119 stochastic programming 264.C 307.C 408 two-stage 408.A Stoilow compactification, Kerekjarto- 207.C Stoi1ow type (compactification) 207.B Stokes approximation 205.C Stokes assumption 205.C Stokes differential equation 167.E 188.E Stokes equation, general Navier- 204.F Stokes equation, Navier- 204.B 205.C Stokes formula 94.F 105.0, App. A, Table 3.II Oreen- 94.F Stokes initial value problem, Navier- 204.B Stokes multiplier 254.C Stokes phenomenon 254.C Stokes theorem App. A, Table 3.m Stokes wave 205.F Stolz, differentiable in the sense of 106.0 Stolz path (in a plane domain) 333.B Stone-Cech compactification 207.C 425.T Stone-Oel'fand theorem 168.B Stone integrable function, Oaniell- 310.1 Stone integral, Oaniell- 310.1 Stone theorem 378.C 425,X 437.R Stone theorem, Weierstrass- 168.B Stone-Titchmarsh-Kodaira theory, Weyl- 112.0 stopping, optimal 405.E stopping rule 398.F stopping time 261.B 407.B storage, push-down 96.E stored program principle 75.B Stormer inequality, Powers- 212.B straight angle 139.0 straightening of the angle 114.F straight, G-space is 178.H straight line(s) 93.A 155.B straight line solution 420.B strain 271.G strain, shearing 271.0 strain tensor 271.0 strange attractor 126.N Strassen in variance principle 250.E strategic variable 264 strategy (strategies) 33.F 108.B,C 173.C behavior 173.B local 173.B mixed 173.C pair 108.B pure 173 winning 33.F stratifiable space 425. Y stratification, Whitney 418.0 stratified sampling 373.E Stratonovich type stochastic differential of 406.C stochastic integral of 406.C stratum (strata) 373.E 418.0 Ji-constant 418.E stream function 205.B streamlined (body) 205.C stream lines 205.B strength 102.L stress 271.G normal 271.0 shearing 271.G tangential 271.0 stress tensor 150.B 271.0 Maxwell 130.A strict Albanese variety 16.P strict implication 411.L strict localization 16.AA Subject Index Strongly inaccessible strictly concave function 88.A strictly convex function 88.A strictly decreasing function 166.A strictly ergodic (homeomorphism on a compact metric space) 136.H strictly G-stationary (system of random variables) 395.1 strictly increasing function 166.A strictly inductive limit (of a sequence of locally convex spaces) 424.W strictly monotone function 166.A (of ordinal numbers) 312.C strictly of Polya type (a family of probability densities) 374.1 strictly positive (element in E") 31O.H strictly pseudoconvex 344.A strictly stable process 5.F strictly stationary process 395.A strictly stationary random distribution 395.H strict morphism (between topological groups) 423.1 string (X- 248.L equation of a vibrating 325.A string model 132.C strip bicharacteristic 320.B characteristic 320.0 324.B Mobius 410.B strip condition 320.0 strong (boundary component) 77.E strong convergence (of operators) 251.C strong convergence theorem (on distributions) 125.G strong deformation retract 202.0 strong dilation 251.M strong dual (space) 424.K stronger (equivalence relation) 135.C (method of summation) 379.L (topology) 425.H (uniformity) 436.E stronger form of Cauchy's integral theorem 198.B strong extension (of a differential operator) 112.E (of a differential operator with boundary condi- tion) 112.F strong infinity, axiom of 33.E strong integrability 443.1 strong lacuna 325.1 strong law of large numbers 250.C strong Lefschetz theorem 16.U strongly, converge (in a Banach space) 37.B strongly acute type 304.C strongly closed subgroup 15l.J strongly compact cardinal number 33.E strongly connected (graph) 186.F strongly connected components 186.F strongly continuous (Banach space-valued function) 37.K (in unitary representations) 437.A strongly continuous representation (of a topological space) 69.B strongly continuous semigroup 378.B strongly distinguished basis 418.F strongly e11iptic (differential operator) 112.0 strongly elliptic operator 323.H strongly embedded subgroup 15l.J strongly exposed (of a convex set) 443.H strongly hyperbolic differential operator 325.H strongly inaccessible 33.F 312.E 
Subject Index Strongly inaccessible cardinal number strongly inaccessible cardinal number 33.E strongly measurable 443.B,1 strongly mixing automorphism 136.E strongly nonlinear differential equation 290.0 strongly normal extension field 113 strongly P-convex set 112.C strongly paracompact space 425.S strongly pseudoconvex domain 21.G strongly recurrent (measurable transformation) 136C strongly separated (convex sets) 89.A strongly stationary process 395.A strongly stationary random distribution 395.H strong Markov process 261.B strong Markov property 261.B strong maximum principle 323.C strong measurability 443.1 strong operator topology 251.C strong rigidity theorem 122.0 strong solution (of Navier-Stokes equation) 204.C (of a stochastic differential equation) 406.0 unique 406.0 strong topology (on a direct product space) 425.K (on a family of measures) 338.E (on a normed space) 37.E (on a topological linear space) 424.K strong transversality condition 126.1 structural constants (of a Lie algebra) 248.C structural equation system, linear 128.C structurally stable, C'- 126.H structural stability 290.A ,tructural stability theorem 126.J structure(s) 409 (of a language) 276.B almost complex 72.B almost contact ltO.E almost contact metric ltO.E almost symplectic 191.B analytic (in function algebras) 164.F analytic (on a Riemann surface) 367.A arithmetically equivalent 276.0 Cauchy Riemann 344.A CR 344.A C'- (of a differentiable manifold) 105.0 114.A C'-, Haefliger 154.F classifying space for i; 154.E coalition 173.0 compatible with C- 114.B complex tOS.Y complex (in a complex manifold) 72.A complex (on R2") 3.H complex (on a Riemann surface) 367.A complex analytic (in a complex manifold) 72.A conformal 191.B conformal (on a Riemann surface) 367.A contact t05.Y contact metric ltO.E data 96.B deformation of complex 72.0 differentiable 114.B differentiable, of class C 105.0 elementarily equivalent 276.0 equations (of a Euclidean space) 111.B equations of (for relative components) 110.A foliated 105.Y G- 191 i - (on a differentiable manifold) 105.Y 2120 i- (on a topological space) 90.0 i;- 154.E i'P- 154.H group of oriented differentiable (on a com- binatorial sphere) 114.1 Hodge (of a vectOr space) 16.V isomorphic 276.E jumping of 72.0 lacunary (of a power series) 339.E level n (on an Abelian variety) 3.N linear 96.C mathematical 409.B mixed 16.V Neyman 400.0 normal 276.0 normal analytic 386.C PL 65.C pseudogroup t05.Y real analytic lOS.0 smooth 114.B spin- 237.F 431.0 statistical 396.E symplectic 219.C tensor field of almost complex (induced by a complex structure) 72.B topological 425.A,B tree 96.0 twinning 92.0 uniform 436.B structure equation (of an affine connection) 417.B (for a curvature form) 80.G (for a torsion form) 80.H linear, system 128.C structure function 191.C structure group (of a fiber bundle) 147.B structure morphism 52.0 structure sheaf (of a prealgebraic variety) 16.C (of a ringed space) 383.H (of a variety) 16.B structure space (of a Banach algebra) 36.0 structure theorem (on topological Abelian groups) 422.E of complete local rings 284.0 for von Neumann algebras of type III 308.1 Sturm-Liouville operator 112.1 Sturm-Liouville problem 31S.B Sturm method 301.C Sturm theorem (on real roots of an algebraic equa- tion) to.E Struve function App. A, Table 19.1V Student test 400.0 subadditive cuts 215.C subadditive ergodic theorem 136.B subadditive functional 88.B subadditive process 136.B subalgebra 29.A Borel (of a semisimple Lie algebra) 248.0 Cartan (of a Lie algebra) 248.1 Cartan (symmetric Riemann spaCo) 413.F closed (of a Banach algebra) 36. B Lie 248.A of a Lie algebra associated with a Lie subgroup 249.0 parabolic (of a semisimp1e Lie algebra) 248.0 *- 308.C subbase for a space 425.F 
2121 for a topology 425.F subbialgebra 203.0 subbundle (of an algebraic vector bundle) 16.Y (of a vector bundle) 147.F subcategory 52.A full 52.A subcoalgebra 203.F subcomplex , (of a cell complex) 70.0 (of a chain complex) 200.H 201.B (ofa cochain complex) 201.H (of a complex) 13.R (of a Euclidean complex) 70.B (of a simplicial complex) 70.C (of an s.s. complex) 70.E chain 200.C co chain 200.F subcontraction 186.E subcritical (Galton-Watson process) 44.B subdifferential 88.0 subdivision (of a Euclidean complex) 70.B (of a simplicial complex) 70.C (of a triangulation) 70.C barycentric (of a Euclidean complex) 70.B barycentric (of a simplicial complex) 70.C dual (of a triangulation of a homology mani- fold) 65.B subelliptic 112.0 subfamily 165.0 subfield 149.8 valuation over a 439.B,C subgraph 186.E subgroup (of a group) 190.C (ofa topological group) 423.0 admissible 190.E admissible normal 190.E algebraic 13.A arithmetic 13.P 122.F,G Borel 13.G 249.J Cart an 13.H 249.1 Carter 151.0 closed 423.0 commutator 190.H congruence 122.0 connected Lie 249.0 cyclic 190.C divisible 422.0 Hall 151.E invariant 190.C irreducible discrete 122.F isotropy 431.A Iwahori 13.R k-Borel 13.G Levi- 13.Q Lie 249.0 maximal torsion 2.A minimal parabolic k- 13.Q normal 190.0 n- 190.E one-parameter 249.Q open 423.0 parabolic 13.G 249.J parabolic 13.R principal congruence, oflevel N 122.0 p-Sylow 151.B rational 404.B Schur 362.F Subject Index Subset(s) sequences of 190.F stability 431.A standard parabolic k- 13.Q strongly closed 151.J strongly embedded 151.J subnormal 190.G Sylow 151.B toroidal 248,X torsion 2.A,C subharmonic functions 193 almost 193.T subinvariant measure 261.F subjective probability 401.B sublattice 243.C submanifold (of a Banach manifold) 286.N (of a combinatorial manifold) 65.0 (of a Coo-manifold) 105.L closed 105.L complex analytic 72.A immersed (of a Euclidean space) II1.A isotropic 365.0 Kahler 365.L minimal 275 365.0 regular 105.L Riemannian 365.A totally geodesic 365.0 totally real 365.M totally umbilical 365.0 submartingale 262.A submedian 193.T submersion 105.L submodular 66.F submodule A- 277.C allowed 277.C complementary 277.H homogeneous A- (of a graded A-module) 200.B primary 284.A subnet 87.H cofinal 87.H subnormal (operator in a Hilbert space) 251.K subnormal subgroup 190.G subobject 52.0 subordinate 105.0 437.T subordination 5 261.F of the (Xth order 261.F subordinator of the exponent (X 5.F subproblems 215.0 subrepresentation (of a linear representation) 362.C (of a projective representation) 362.J (of a unitary representation) 437.C subring 368.E differential 113 subroutine 75.C subscripts, raising 417.0 subsequence 165.0 <P- 354.0 subset(s) 381.A (in axiomatic set theory) 33.B analytic (of a complex manifold) 72.E axiom of 33.B 381.G Borel 270.C circled (of a linear topological space) 424.E cofinal 311.0 G- 362.B k- 330 proper 381.A 
Subject Index Subsbift residual 311.0 subshift 126.1 of finite type 126.1 Markov 126.1 subsidiary equation, Charpit 82.C 320.0 subsonic (Mach number) 205.B subsonic flow 326.A subspace (of an affine space) 7.A (of a linear space) 256.F (of a projective space) 343.B (of a topological space) 425.J analytic 23.C,0 closed linear (of a Hilbert space) 197.E complementary (of a linear subspace) 256.F horizontal 191.C ingoing 375.H invariant (of a linear operator) 251.L involutive 428.F linear (of a linear space) 256.F metric 273.B n-particle 377.A orthogonal (determined by a linear subspace) 256.0 orthogonal (ofa linear space) 139.0 outgoing 375.H precompact (metric) 273.B parallel (in an affine space) 7.B parallel, in the narrower sense (in an affine space) 7.B parallel, in the wider sense (in an affine space) 7.B principal (of a linear operator) 390.8 root (of a linear operator) 390.B root (of a semisimple Lie algebra) 248.K singular (of a singular projective transforma- tion) 343.0 totally bounded (metric) 273.B totally isotropic (relative to an a-Hermitian form) 60.0 totally isotropic (with respect to a quadratic form) 348.E tota11y singular (with respect to a quadratic form) 348.E V -invariant (of a representation space of a unitary representation 437.C uniform 436.E substituted distribution 125.Q substitution (of a hyperfunction) 125,X 274.E axiom of 381.0 back 302.B Frobenius (of a prime ideal) 14.K subsystem (of an algebraic system) 409.C closed (of a root system) 13.L subtraction 361.B subtraction terms 361.B subvariety, Abelian 3.B successive approximation method of (for an elliptic partial differential equation) 323.0 method of (for Fredholm integral equations of the second kind) 217.0 method of (for ordinary differential equations) 316.0 successive minima (in a lattice) 182.C successive minimum points 182.C successive overrelaxation (SOR) 302.C successor 2122 (of an element in an ordered set) 311.B (of a natural number) 294.B sufficiency prediction 396.J principle of 401. C sufficient (O'-field, statistic) Bayes 396.J D- 396.1 decision theoretically 396.1 minimal 396.E pairwise 396.F test 396.J sufficiently many irreducible representations 437.B sum (of convergent double series) 379.E (of a divergent series by a summation) 379.L (of elements ofa group) 190.A (of elements ofa linear space) 256.A (a function) 104.B (of ideals) 67.B (of linear operators) 251.B (of linear subspaces) 256.F (of matrices) 269.B (of ordinal numbers) 312.C (of potencies) 49.C (of real numbers) 355.A (of a quadrangular set of six points) 343.C (of a series) 379.A (of submodules) 277.B (= union of sets) 33.B 381.B (of vectors) 442.A amalgamated 52.0 Baer (of extensions) 200.K cardinal (of a family of ordered sets) 311.F Cauchy (of a series) 379.A connected (of oriented compact CO-manifolds) 114.F connected (of 3-manifolds) 65.E constant- (game) 173.A Oarboux 216.A Oedekind 328.A diagonal (of a matrix) 269.F diagonal partial (of a double series) 379.E direct  direct sum disjoint 381.B fiber 52.0 Oaussian 295.0 450.C general- (game) 173.A indefinite (of a function) 104.B Kloosterman 32.C local Oaussian 450.F logical (of propositions) 411.B ordinal (of a family of ordered sets) 311.G orthogonality for a finite 19.0 317.0 partial (of a series) 379.A of products 216.A Ramanujan 295.0 Riemann 216.A scalar (of linear operators) 37.C over states 402.0 topological 425.M trigonometric 4.C Whitney (of vector bundles) 147.F zero (game) 173.A zero-, two-person game 108.B sum event 342.B summable A- 379.N by Abel's method 379.N 
2123 absolute Borel 379.0 'B- 379.0 I'BI- 379.0 by Borel's exponential method 379.0 by Borel's integral method 379.0 by Cesaro's method of order ()( 379.M by Euler's method 379.P by Holder's method of order p 379.M (H, p)- 379.M by Norlund's method 379.Q (R, k)- 379.S by Riesz's method of order k 379.R T- 379.L summable pth power, operator of 68.K summand, direct (of a direct sum of sets) 381.E summation Abel's method of 379.N Abel's partial 379.0 Borel's method of 379.N (C, ()()- 379.M Cesaro's method of, of order ()( 379.M Euler's method of 379.P of a function 104.B Lebesgue's method of 379.S methods of 379.L Norlund's method of 379.Q Riemann's method of 379.S Riesz's method of, of the kth order 379.R summation convention, Einstein's 417.B summation formula Euler 295.E Poisson (on Fourier transforms) 192.C Poisson (on a locally compact Abelian group) 192.L summing, absolutely (operator) 68.N sum theorem for dimension 117.C Sundman theorem 420.C sup (supremum) 311.B superabundance (of a divisor on an algebraic surface) 15.0 superadditive 173.0 superconductivity 130.B supercritical (Oalton-Watson process) 44.B superefficient estimator 399.N superharmonic (function) 193.P 260.0 superharmonic measure 260.1 superharmonic transformation 261.F superior function, right 316.E superior limit (of a sequence of real numbers) 87.C (of a sequence of subsets of a set) 270.C superior limit event 342.B supermartingale 262.A superm ultiplet theory 351.J superposition, principle of 2S2.B 322.C superregular function 260.0 superrenormalizable IS0.C superscripts, lowering 417.0 superselection rule, univalence 3Sl.K superselection sector IS0.E 351.K supersolvable group IS1.0 supersonic 205.B 326.A supplementary angles 139.0 supplementary interval 4.B supplementary series 258.C supplementation-equal polygons 15S.F supplemented algebra 200.M support (of a coherent sheaf) 16.E (of a differential form) 105.Q Subject Index Surface(s) (of a distribution) 125.0 (of a function) 12S.B 168.B 425.R (of a section of a sheaf) 383.C (of a spectral measure) 390.0 compact (of a singular q-cochain) 201.P essential (of a distribution) 274.0 singular (of a distribution) 112.C singular (of a hyperfunction) 125.W supporting function 125.0 supporting functional (of a convex set) 89.0 supporting half-space (of a convex set) 89.A supporting hyperplane (of a convex set) 89.A supporting line (of an oval) 89.C supporting line function (of an oval) 89.C supporting point (of a convex set) 89.0 (of a projective frame) 343.C supremum (of an ordered set) 168.B (of a set of Hermitian operators) 308.A (of a subset of a vector lattice) 310.C essential (of a measurable function) 168.B supremum norm 168.B supremum theorem, Hardy-Littlewood App. A, Table 8 sure event 342.B surely, almost 342.B,0 surface(s) 11l.A 410, App. A, Table 4.1 Abelian IS.H abstract Riemann 367.A affine minimal 110.C alge braic 15 basic (of a covering surface) 367.B with boundary 41O.B branched minimal 275.B center 11l.I characteristic 320.B circular cylindrical 350.B closed 410.B closed (in a 3-dimensional Euclidean space) 111.1 closed convex 11l.I conical 111.1 of constant curvature 11l.I covering 367.B covering, Ahlfors theory of 367.B covering, with relative boundary 367.B cylindrical 11l.I deformation of IIO.A degenerate quadric 350.B developable 111.1, App. A, Table 4.1 Oini 11l.I elliptic 72.K elliptic cylindrical 350.B energy 126.L 402.C,0 Enneper 275.B Enriques 72.K enveloping 111.1 equipotential 193.1 Frechet 246.1 fundamental theorem of the theory of 111.0 fundamental theorem of the topology of 41O.B G- 178.H of general type 72.K geometry on a 111.0 helicoidal 111.1 Hilbert modular 15.H Hirzebruch 15.0 Hopf 72.K hyperbolic cylindrical 3S0.B 
Subject Index Surface area of unit hypersphere hyperelliptic 72.K initial 321.A K3 15.H 72.K Kummer 15.H level 193.1 marked K3 n.K minimal 111.1 334.B niveau 193.1 one-sided 41O.B open 41O.B parabolic cylindrical 350.B proper quadric 350.B quadric 350.A quadric conical 350.B rational 15.E rectifying 111.1 response 102.L response, design for exploring 102.M of revolution 111.1 Riemann  Riemann surface ruled 15.E ruled (in differential geometry) 11l.I of the second class 350.0 of the second order 350.A Scherk's 275.A Seifert 235.A skew 111.1 special 110.A tangent 11l.F two-sided 410.8 unbounded covering 367.B unirational 15.H universal covering 367.B unramified covering 367.B Veronese 275.F W- 111.1 Weingarten (W) 111.1 surface area of unit hypersphere App. A, Table 9. V surface element 324.B surface harmonics 393.A surface integral 94.A,E (with respect to a surface element) 94.E surface wave 446 surgery 114.F,J surgery obstruction 114.1 surjection 381.C (in a category) 52.0 canonical (on direct products of groups) 190.L canonical (to a factor group) 190.0 canonical (onto a quotient set) 135.B natural (to a factor group) 190.0 surjective mapping 381.C survival insurance 214.B susceptibility electric 130.B magnetic 130.B suspension (of a discrete dynamical system) 126.C (of a homotopy class) 202.Q (of a map) 202.F (of a space) 202.E,F n-fold reduced 202.F reduced (of a topological space) 202.F Suslin K- 22.H schema of 22.B system of 22.B Suslin hypothesis 33.F Suslin space 22.1 425.CC Suslin theorem 22.C 2124 suspension isomorphism (on singular (co)homology groups) 201.E suspension theorem, generalized 202.T Suzuki group IS1.I SVO (singular value decomposition) 302.E sweep (a bounded domain) 384.F sweepable (bounded domain) 384.F sweeping-out principle 338.L sweeping-out process 338.L sweep out (a measure to a compact set) 338.L Swinnerton-Oyer conjecture, Birch- 118.0450.S Sylow subgroup 151.B p- 151.B Sylow theorem 151.B Sylvester elimination method 369.E Sylvester law of inertia (on a quadratic form) 348.B Sylvester theorem (on determinants) J 03.F Symanzik equation, Callan- 361.B symbol 369.A (of a Fourier integral operator) 274.C (of a pseudodilIerential operator) 251.0 345.B (= Steinberg symbol) 237.1 (of a vector field) 105.M Artin 14.K ChristolIei 80.L 11l.H 417.0, App. A, Table 4.11 function 411.H Oauss 83.A Hilbert a- 41l.J Hilbert-Hasse norm-residue 14.R Hilbert norm-residue 14.R individual 411.H Jacobi 297.1 Jacobi, complementary law of 297.1 Jacobi, law of quadratic reciprocity of 297.1 Kronecker 347.0 Landau (0,0) 87.0 Legendre 297.H Legendre, first complementary law of 297.1 Legendre, law of quadratic reciprocity of 297.1 Legendre, second complementary law of 297.1 logical 411.B 9j- 353.C norm-residue 14.Q 257.F power-residue 14.N predicate 411.H principal (of a dilIerential operator) 237.H principal (of a microdifferential operator) 274.F principal (of a simple holonomic system) 274.H 6j- 353.B Steinberg (in algebraic K -theory) 237.1 3j- 353.B symbolic logic 411 symbol sequence (in microlocal analysis) 274.F symmetric (block design) 102.E (factorial experiment) 102.H (Fock space) 377.A (member of a uniformity) 436.B (multilinear mapping) 256.H (relation) 358.A (tensor) 256.N symmetric algebra 29.H symmetric bilinear form (associated with a quadratic form) 348.A symmetric bounded domain 412.F irreducible 412.F symmetric Cauchy process 5.F 
2125 symmetric difference 304.E symmetric distribution function 341.H symmetric event 342.0 symmetric function 337.1 elementary 337.1 symmetric group 190.B of degree 11 151.0 symmetric Hermitian space 412.E irreducible 412.E symmetric homogeneous space 412.B symmetric hyperbolic system (of partial differential equations) 325.0 symmetric kernel 217.0335.0 symmetric law (in an equivalence relation) 135.A symmetric Markov process 261.C symmetric matrix 269.B anti- 269.B skew- 269.B symmetric multilinear form 256.H anti- 256 H skew- 256.H symmetric multilinear mapping 256.H anti- 256.H skew- 256.H symmetric multiplication 406.C symmetric operator 251.E symmetric points (with respect to a circle) 74.E symmetric polynomial 337.1 elementary 337.1 fundamental theorem on 337.1 symmetric positive system (of differential operators) 112.S (of first-order linear partial differential equa- tions) 326.0 symmetric product (of a topological space) 70.F symmetric Riemannian homogeneous space 412.B symmetric Riemannian space(s) 412 globally 412.A irreducible 412.C, App. A, Table 5.1II locally 412.A, App. A, Table 4.II weakly 412.J symmetric space 412.A affine 80.1 affine locally 80.J locally 80.J 364.0 symmetric stable process 5.F symmetric tensor 256.N anti- 256.N contravariant 256.N covariant 256.N skew- 256.N symmetric tensor field 105.0 symmetrization (in isoperimetric problem) 228.B Steiner (in isoperimetric problem) 228.B symmetrizer 256.N Young 362.H symmetry (at a point of a Riemannian space) 412.A (of a principal space) 139.B (in quantum mechanics) 415.H broken 132.C central (of an affine space) 139.B charge 415.1 crossing 132.C 386.B degree of 431.0 hyperplanar (of an affine space) 139.B internal 150.B law of (for the Hilbert norm-residue symbol) 14.R Subject Index System Nelson 150.F TCP 386.B symmetry group, color 92.0 symmorphic space group 92.B symmorphous space group 92.B symplectic form 126.L symplectic group 60.L 151.I complex 60.L infinite 202.V over a field 60.L over a noncommutative field 60.0 projective (over a field) 60.L unitary 60.L symplectic manifold 219.C symplectic matrix 60.L symplectic structure 219.C symplectic transformation 60.L (over a noncommutative field) 60.0 symplectic transformation group (over a field) 60.L synchronous (system of circuits) 75.B syndrome 63.C synthesis (in the theory of networks) 282.C spectral 36.L synthetic geometry 181 system adjoint (of a complete linear system on an algebraic surface) 15.0 adjoint, of differential equations 252.K algebraic 409.B algebraic, in the wider sense 409.B ample linear 16.N asynchronous (of circuits) 75.B axiom 35 axiom (of a structure) 409.B axiom (of a theory) 411.I of axioms 35.B base for the neighborhood 425.E categorical (of axioms) 35.B C*-dynamical 36.K character (of a genus of a quadratic field) 347.F characteristic linear (of an algebraic family) 15.F Chebyshev (of functions) 336.B classical dynamical 126.L 136.0 of closed sets 425.B complete (of axioms) 35.B complete (of independent linear partial differen- tial equations) 324.C complete (of inhomogeneous partial differential equations) 428.C complete (of nonlinear partial differential equations) 428.C complete linear (on an algebraic curve) 9.C complete linear (on an algebraic variety) 16.N complete linear, defined by a divisor 16.N completely integrable (of independent I-forms) 428.0 complete orthogonal 217.0 complete orthonormal 217.0 complete orthonormal, of fundamental func- tions 217.0 complete residue, modulo m 297.0 continuous dynamical 126.B coordinate 90.A coordinate (of a line in a projective space) 343.C crystal 92.B determined (of differential operators) 112.R 
Subject Index System determined (of partial differen tial equations) 320.F differentiable dynamical 126.B differential 191.1 of differential equations of Maurer-Cartan 249.R of differential operators 112.R direct (of sets) 210.B discrete dynamical 126.B distinct, of parameters 284.0 dynamical 126 of equations 10.A equilibrium, transformation to 82.0 formal 156.0411.1 of functional differential equations 163.E fundamental (of eigenfunctions to an eigenvalue for an integral equation) 217.F fundamental (for a linear difference equation) 104.0 fundamental (of a root system) 13.1 fundamental, of irreducible representations (of a complex semisimple Lie algebra) 248.W fundamental, of neighborhoods 425.E fundamental, of solutions (of a homogeneous linear ordinary differential equation) 252.B fundamental, of solutions (of a homogeneous system of first-order linear differential equa- tions) 252.H fundamental root (of a semisimple Lie algebra) 248.N of fundamental solutions (of a system oflinear homogeneous equations) 269.M Oarnier 253.E of generators (of a A-module) 277.0 of gravitational units 414.B group 235.B Haar, of orthogonal functions 317.C Hamiltonian 126.L holonomic 274.H holonomic, with regular singularities 274.H homotopy equivalent (of topological spaces) 202.F of hyperbolic differential equations (in the sense of Petrovskii) 325.0 incompatible (of partial differential equations) 428.B inconsistent (of algebraic equations) 10.A indeterminate (of algebraic equations) IO.A inductive (in a category) 210.0 inductive (of sets) 21O.B inductive, of groups 21O.C inductive, of topological spaces 21O.C inertial 271.0 359 information retrieval 96.F integrable 287.A international, of units 414.A inverse (of sets) 21O.B involutive  involutory involutory (of differential forms) 428.F involutory (of nonlinear equations) 428.C involutory (of partial differential equations) 428.F involutory (of partial differential equations of first order) 324.0 irreducible linear 16.N lattice spin 402.0 linear (on an algebraic variety) 16.N of linear differential equations of first order 252.0 linear dynamical 86.B 2126 oflinear equations 269.M oflinear homogeneous equations 269.M linear structural equation 128.C linear time-varying 86.B local, of groups (over a topological space) 201.R local coordinate (of a manifold) 105.C local coordinate (of a topological space) 90.0 local coordinate, holomorphic 72.A mathematical (for a structure) 409.B maximal independent (of an additive group) 2.E neighborhood 425.B of notations (for ordinal numbers) 81.B null (in projective geometry) 343.0 number, point range of (in projective geometry) 343.C of open sets 425.B operating 75.C of ordinary differential equations 313.B orthogonal (of functions) 317.A orthogonal (of a Hilbert space) 197.C of orthogonal functions App. A, Table 20 of orthogonal polynomials 317.0 overdetermined (of differential operators) 112.R overdetermined (of partial differential equa- tions) 320.F of parameters 284.0 partial de Rham 274.0 of partial differential equations of order 1 (on a differentiable manifold) 428.F passive orthonomic (of partial differential equations) 428.B peripheral 235.B of Pfaffian equations 428.A polar (in projective geometry) 3.J.o Postnikov (of a CW complex) 1.8.0 projective (in a category) 210.0 projecti ve (of sets) 210. B projective, of groups 210.C projective, of topological groups 423.K projective, of topological spaces 210.C quotient (of an algebraic system) 409.C Rademacher, of orthogonal functions 317.C reduced residue 297.0 reducible linear 16.N regular, of parameters 284.0 of resultants 369.E root (of a symmetric Riemann space) 413.F root (in a vector space over Q) 13.1 self-adjoint, of differential equatio ns 252.K simple holonomic 274.H of simultaneous differential equations App. A, Table 14.1 ofSuslin 22.B symmetric positive (of differential operators) 112.S symmetric positive (of first-order linear partial differential equations) 326.0 Tits 13.R 151.J 343.1 of total differential equations 42,s.A of transitivity (of a G-set) 362.B trigonometric 159.A two-bin 227 underdetermined (of differential operators) 112.R underdetermined (of partial differential equa- tions) 320.F uniform covering 436.0 
2127 uniform family of neighborhoods 436.0 uniform neighborhood 436.0 unisolvent (of functions) 336.B of units 414.A very ample linear 16.N Walsh, of orthogonal functions 317.C system process 405.F system simulation 385.A syzygy 369.F first 369.F rth 369.F syzygy theorem, Hilbert 369.F syzygy theory 200.K 369.F Szego kernel function 188.H Szemeredi theorem, ergodic 136.C Sz.-Nagy-Foia model 251.N T !-function 150.0 t-distribution 341.0 374.B, App. A, Table 22 noncentral 374.B t-statistic 374.B t-test 4oo.G T-bound 331.B T-bounded 331.B T-compact 68.F 331.B T-operator 375.C T2-statistic Hotelling's 280.B noncentral Hotel1ing 374.C T-number (transcendental number) 430.C T*-number (transcendental number) 430.C To-space 425.Q T, -space 425.Q T z-space 425.Q T 3 -space 425.Q T 4 -space 425.Q Ts-space 425.Q T 6 -space 425.Q T-positivity l50.F T -set 308.1 T-summable (series) 379.L Tl -uniformity 436.C Tl -uniform space 436.C T 2 -topological group 423.B table analysis-of-variance 400.H contingency 397.K 400.K difference 223.C k-way contingency 397.K Pade 142.E tableau, simplex 255.C table look-up 96.C tail event 342.G tail a-algebra 342.G Tait algorithm 157.C Tait coloring 157.C Takagi, Teiji 415 Takesaki, duality theorem of 308.1 Takesaki theory, Tomita- 308.H Tamagawa number (of an algebraic group) 13.P Tamagawa zeta function 450.L Tamano product theorem 425X tame (knot) 235.A tan (= tangent) 13l.E tan-' 13l.E Tanaka embedding 384.0 tangent 432.A (of pressure) 402.G Subject Index Teichmiiller space asymptotic IIO.B Oarboux 11O.B hyperbolic 131.F law of App. A, Table 2.m tangent bundle (of a Banach manifold) 286.K (of a differentiable manifold) 105.H 147.F (of a foliation) 154.B tangent hyperplane (of a quadric hypersurface) 343.E tangential polar coordinates 90.C tangential sphere bundle 274.E tangential stress 271.0 tangent line 93.G 1l1.C,F, App. A, Table 4.1 oriented 76.B tangent orthogonal n-frame bundle 364.A tangent plane II1.H, App. A, Table 4.1 tangent PL micro bundle 147.P tangent r-frame(s) 105.H bundle of 105.H tangent r-frame bundle 147.F tangent space 105.H tangent sphere bundle, unit 126.L tangent surface 1l1.F tangent vector 105.H holomorphic 72.A of type (0, 1) n.C of type (1,0) 72.C tangent vector bundle 105.H 147.F tangent vector space 105.H tanh (hyperbolic tangent) 13l.F Taniyama- Weil conjecture 450.S Tannaka duality theorem 69.0 249.U target(of a jet) 105.X target variable 264 Tate cohomology 2oo.N Tate conjecture 450.S Tate-Shafarevich group 118.0 Tate theorem 59.H Tauberian theorem 121.0 339.B generalized 36.L 160.G generalized, of Wiener 192.0 Tauberian type, theorem of 339.B Tauber theorem 339.B tautochrone 93.H tautological line bundle 16.E tautology 411.E Taylor expansion (of an analytic function of many variables) 21.B (of a holomorphic function of one variable) 339.A formal 58.C Taylor expansion and remainder App. A, Table 9.IV Taylor formula App. A, Table 9.1V (for a function of many variables) 106.1 (for a function of one variable) 106.E Taylor series 339.A Taylor theorem (in a Banach space) 286.F TCP invariance 386.B TCP operator 150.0 TCP symmetry 386.B TCP theorem 386.B technique, program evaluation and review 376 TE wave 130.B Teichmuller mapping 352.C Teichmuller metric 416 Teichmuller space 416 universal 416 
Subject Index Telegraph equation telegraph equation 325.A, App. A, Table 15.III temperature (of states) 419.A absolute 419.A tempered distribution 125.N temporally homogeneous (additive process) 5.B (Markov process) 261.A TEM wave 130.B tension 281.B modulus of elasticity in 271.G tension field 195.B tensor alternating 256.N angular momentum 258.0 anti symmetric 256.N conformal curvature App. A, Table 4.II contracted 256.L contravariant, of degree p 256.1 correlation 433.C covarian t, of degree q 256.1 curvature 80.1,L 364.0 417.B energy-momentum 150.B 359.0 energy spectrum 433.C fundamental (of a Finsler space) 152.C fundamental (of a Riemannian manifold) 364.A Green 188.E irreducible, of rank k 353.C Maxwell stress 130.A mixed 256.1 Nijenhuis 72.B numerical App. A, Table 4.11 projective curvature App. A, Table 4.II Ricci 364.0, App. A, Table 4.II second fundamental 417.F skew-symmetric 256.N strain 271. G stress 150.B 271.G symmetric 256.N torsion App. A, Table 4.II torsion (of an affine connection) 80.1 417.B torsion (of an almost contact structure) IIO.E torsion (of a Frechet manifold) 286.L torsion (of a Riemannian connection) 80.L of type (p, q) 256.1 Weyl's conformal curvature 80.P tensor algebra (on a linear space) 256.K contravariant 256.K tensor bundle (of a differentiable manifold) 147.F tensor calculus 417, App. A, Table 4.II tensor field 105.0 of almost complex structure (induced by a complex structure) 72.B alternating 105.0 of class C' 105.0 contravariant, of order r 105.0 covariant, of order s 105.0 covariant derivative of (in the direction of a tangent vector) 80.1 left invariant (on a Lie group) 249.A parallel 364.B random 395.1 right invariant (on a Lie group) 249.A symmetric 105.0 of type (r, s) 105.0 of type (r, s) with value in E 417.E tensorial form 80.G tensorial p-form 417.C tensor product 2128 (of A-homomorphisms) 277.1 (of algebras) 29.A (of A-modules) 277.J (of chain complexes) 201.1 (of cochain complexes) 201.J (of distributions) 125.K (of Hilbert spaces) 308.C (of linear mappings) 256.1 (oflinear representations) 362.C (oflinear spaces) 256.1 (of locally convex spaces) 424.R (of sheaves) 383.1 (of vector bundles) 147.F (of von Neumann algebras) 308.C continuous 377.0 a 424.R projective C*- 36.H spatial 36.H tensor representation (of a general linear group) 256.M tensor space of degree k 256.J of type (p, q) 256.J term (of a language) 276.A (of a polynomial) 337.B (in predicate logic) 411.H (of a sequence) 165.0 (of a series) 379.A base (of a spectral sequence) 200.1 closed (of a language) 276.A constant (of a formal power series) 370.A constant (of a polynomial) 337.B error 403.0 fiber (of a spectral sequence) 200 J initial (of an infinite continued fraction) 83.A nth (of seq uence) 165.0 penalty 440.B subtraction I11.B undefined 35.B terminal decision rule 398.F terminal point (of a curvilinear integral) 94.0 (in a Markov process) 261.B (of a path) 170 (of a vector) 442.A terminal time 261.B terminal vertex 186.B term wise differentiable (infinite series with function terms) 379.H term wise differentiation, theorem of (on distribu- tions) 125.G term wise integrable (series) 216.B ternary set 79.0 Terry model, Bradley- 346.C Terry normal score test, Fisher-Yates- 371.C tertiary obstruction 305.0 tertium non datur 156.C tesseral harmonics 393.0 test 400.A Abel 379.0 almost invariant 4oo.E Cauchy condensation 379.B Cauchy integral 379.B chi-square 400.G chi-square, of goodness of fit 4oo.K comparison 379.B consistent 400.K Oini (on the convergence of Fourier series) 159.B 
2129 Oini-Lipschitz (on the convergence of Fourier series) 159.B Oirichlet (on Abel's partial summation) 379.0 Oirichlet (on the convergence of Fourier series) 159.B of du Bois-Reymond and Oedekind 379.0 duo-trio 346.0 F- 400.0 Fisher-Yates-Terry normal score 371.C goodness-of-fit 397.Q 401.E invariant 4oo.E Jordan (on the convergence of Fourier series) 159.B Kolmogorov 45.F Kolmogorov-Smirnov 371.F Kruskal-Wallis 371.0 Lebesgue (on the convergence of Fourier series) 159.B Leibniz (for convergence) 379.C level ()( 400.A likelihood ratio 400.1 Mann-Whitney v- 371.C minimax level ()( 400.F most powerful 400.A most stringent level ()( 400.F nonparametric 371.A nonrandomized 400.A outlier 397.Q pair 346.0 randomized 400.A sensory 346.B sequential 400.L sequential probability ratio 400.L sign 371.B signed rank 371.B similar 400.0 Student 400.0 t- 400.0 triangle 346.0 UMP unbiased level ()( 4OO.C UMP in variant level ()( 4oo.E unbiased level ()( 4OO.C uniformly consistent 4oo.K uniformly most powerful (UMP) 400.A uniformly most powerful invariant level ()( 4oo.E uniformly most powerful unbiased level ()( 4oo.C van der Waerden, 371.C Welch 400.0 Wiener (for Brownian motion) 45.0 Wiener (for Oirichlet problem) 338.0 Wiener (for random walk) 260.E Wilcoxon 371.C Wilcoxon signed rank 371.B test channel 213.E test function 4oo.A test function space 125.S testing hypothesis 401.C statistical hypothesis 400 test statistics, Kolmogorov-Smirnov 374.E test sufficient (O'-field) 396.1 tetracyclic coordinates 90.B tetragamma function 174.B tetragonal (system) 92.E tetrahedral group 151.0 tetrahedron 7.0 357.B polar 350.C self-polar 350.C Subject Index Theory TE waves 130.B Theodorsen function 39.E theorem(s)  also specific theorems of angular momentum 271.E Brouwer's, on the in variance of domain 117.0 of coding 273.0- F on complete form 356.H of completeness (in geometry) 155.B cup product reduction 2oo.M fundamental  fundamental theorem(s) of identity 21.C invariance, of analytic relations 198.K on in variance of dimension of Euclidean spaces 117.0 kernel 125.L 424.S of linear ordering (in geometry) 155.B local limit 250.B of momentum 271.E product, for dimension 117.C of quasiconformal reflection 352.C structure, for von Neumann algebras of type III 308.1 of Tauberian type 339.B of term wise differentiation (of distributions) 125.0 translation (in class field theory) 59.C translation representation 375.H transversality 105.L triangle comparison 178.A Tucker's, on complementary slackness 255.B unicursal graph (Euler's) 186.F Weierstrass's, of double series 379.H Theorem A 21.L 72.E,F Theorem B 21.L 72.E,F theoretical formula 19.F theory Ahlfors's, of covering surfaces 272.1 367.B of buildings 343.1 of calculus of variations, classical 46.C Cantor's, of real numbers 294.E class field 59 classification, of Riemann surfaces 367.E combinatorial 66.A complete cohomology 2oo.N constructive field 150.F Oedekind's, of real numbers 294.E de Rham homotopy 114.L dimension 117 of elasticity 271.0 of electromagnetic waves 130.B of errors 138.A Euclidean field 150.F Euclidean Markov field 150.F exact sampling 401.F finite-displacement (of elasticity) 271.0 of functions 198.Q of functions of a complex variable 198.Q Oalois 172 Oalois, of differential fields 113 game 173 of gases, kinetic 402.B graph 186 Haag-Ruelle scattering 150.0 hidden variables 351.L hydromagnetic dynamo 259 information 213 Kaluza's 5-dimensional 434.C large sample 401.E lattice gauge 150.0 Littlewood-Paley 168.B 
Subject Index Thermal contact local class field 59.0 Lyusternik-Shnirel'man 286.Q Minkowski reduction (on fundamental regions) 122.E Morse 279 Morse, fundamental theorems of 279.0 Nevanlinna (of meromorphic functions) 124.B 272.B nonsymmetric unified field 434.C number, analytic 296.B number, elementary 297 number, geometric 296.B of perturbations, general 420.E of perturbations, special 420.E Peter-Weyl (on compact groups) 69.B Peter-Weyl (on compact Lie groups) 249.U Picard-Vessiot 113 of plasticity 271.0 prediction 395.0 prediction, linear 395.0 of probability 342.A proof 156.0 quantum field 150.C ramified type 411.K realization 86.0 of relativity, general 359.A of relativity, special 359.A risk 214.C risk, classical 214.C risk, collective 214.C risk, individual 214.C Serre «j- 202.N set 381.F (also set theory) of singularities 418 slender body 205.B small-displacement, of elasticity 271.0 S-matrix 386.C supermultiplet, Wigner's 351.J syzygy 200.K thin wing 205.B T omit a- Takesaki 308.H type 411.K unified field 434.A unitary field 434.C therrnalcontact 419.A thermal expansion, coefficient of 419.A thermal noise 402.K thermodynamical quantity 419.A extensive 419.A intensive 419.A thermodynamic limit 402.0 thermodynamics 419 first law of 419.A second law of 419.A statistical 402.A third law of 419.A Oth law of 419.A theta formula (on ideles) 6.F theta- Fuchsian series of Poincare 32.B theta function 134.1 (on a complex torus) 3.1 elliptic 134.1, App. A, Table 16.II graded ring of 3.N Jacobian 134.C nondegenerate 3.1 Riemann 3.L theta series 348.L thick (chamber complex) 13.R thickness (of an oval) 89.C thin (chamber complex) 13.R 2130 thin set (in Markov processes) 261.0 (in potential theory) 338.0 analytically (in an analytic space) 23.0 internally 338.0 thin wing theory 205.B third boundary value problem 193.F 323.F third classification theorem (in the theory of obstruc- tions) 305.C third extension theorem (in the theory of obstruc- tions) 305.C third fundamental form App. A, Table 4.1 third homotopy theorem (in the theory of obstruc- tions) 305.C third isomorphism theorem (on topological groups) 423.1 third kind Abelian differential of 11.C Abelian integral of 11.C third law of thermodynamics 419.A third-order predicate logic 411.K third quartiles 396.C third separation axiom 425.Q Thom algebra 114.H Thom complex 114.0 associated with (G,n) 114.0 Thom first isotropy theorem 418.0 Thom fundamental theorem 114.H Thom-Oysin isomorphism 114.0 (on a fiber space) 148.E Thompson inequality, Oolden- 212.B Thompson theorem, Feit- (on finite groups) 151.0 Thom space 114.0 Thom spectrum, stable homotopy group of 114.0 202.T Thorin theorem, Riesz- 224.A thorn (of a convergence domain) 21.B three big problems 187 three-body problem 420.A restricted 420.F three-circle theorem, Hadamard 43.E 3j-symbol 353.B three laws of motion, Newton's 271.A three-line theorem, Ooetsch 43.E three-stage least squares method 128.12 three-valued logic 411.L three principles, Fisher's 102.A three-series theorem 342.0 threshold Jacobi method 298.B Thue problem 31.B Thue theorem 118.0 Thullen theorem, Cartan- 21.H Thurstone-Mosteller model 346.C tieset 186.0 tieset matrix, fundamental 186.0 Tietze extension theorem 425.Q Tietze first axiom 425.Q Tietze second axiom 425.Q tight family (of probability measures) 341.F tight immersion 365.0 tightness 399.M Tikhonov embedding theorem 425.T Tikhonov fixed-point theorem 153.0 Tikhonov product theorem 425.S Tikhonov separation axiom 425.Q Tikhonov space 425.Q Tikhonov theorem 425.Q Uryson- (on metrizability) 273.K Tikhonov-Uryson theorem 425.Q time 
2131 exit 261.B explosion 406.0 first splitting 44.E hitting 260.B 261.B 407.B killing 260.A life 260.A 261.B local 45.0 Markov 261.B 407.B NP- 71.E polynomial 71.E proper 258.A real- 19.E recurrence 260.C sojourn, density 45.0 stopping 261.B 407.B terminal 261.B waiting 260.H waiting, distribution 307.C time change (of a Markov process) 261.B (of a semimartingale) 406.B (of a submartingale) 262.C time complexity 71.A time-dependent Schr6dinger equation 351.0 time-homogeneous Markovian type 406.0 time-independent Markovian type 406.0 time-independent Schrodinger equation 351.0 time-invariant, linear (dynamical systems) 86.B time-invariant network 282.C timelike (curve) 325.A (vector of a Minkowski space) 258.A 359.B time-one mapping (map) 126.C time optimal control problem 86.F time ordered function 150.0 time parameter (of a stochastic process) 407.A time parameter space 260.A time reversal 258.A 359.B time series 397.A 421.A time series analysis 421 time series data 397.N time-varying system, linear 86.B Tissot-Pochhammer differential equation 206.C Titchmarsh-Kodaira theory, Weyl- Stone- 112.0 Titchmarsh theorem 306.B Brun- 123.0 Tits simple group 151.I Tits system 13.R 151.J 343.1 TM waves 130.B Toda bracket 202.R Toda lattice 287.A 387.A Todd characteristic 366.B Todd class (of a complex vector bundle) 237.F Toeplitz operator 251.0 Toeplitz theorem 379.L tolerance interval 399.R tolerance limits 399.R tolerance percent defective, lot 404.C tolerance region 399.R Tomita- Takesaki theory 308.H Tonelli absolutely continuous in the sense of 246.C bounded variation in the sense of 246.C topological Abelian group(s) 422 dual 422.C elementary 422.E topological conjugacy 126.B topological entropy 126.K 136.H topological equivalence 126.B topological field 423.P Subject Index Topology topological generator (of a compact Abelian group) 136.0 topological group(s) 423 completable 423.H complete 423.H Hausdorff 423.B homomorphic 423.1 isomorphic 423.A locally isomorphic 423.0 metrizable 423.1 separated 423.B Tz- 423.B topological groupoid 154.C topological index (of an elliptic complex) 237.H topological invariance (of homology groups) 201.A topological invariant 425.0 topological linear spaces 424 topologically complete space 436.1 topologically conjugate 126.B topologically equivalent 126.B,H topological manifold 105.B with boundary 105.B without boundary 105.B topological mapping 425.0 topological n-cell 140 topological pair 201.L topological polyhedron 65.A topological pressure 136.H topological property 425.0 topological ring 423.P topologicalO'-algebra 270.C topological solid sphere 140 topological space(s) 425 category of 52.B category of pointed 202.B complex linear 424.A discrete 425.C generalized 425.0 homotopy category of 52.B inductive system of 21O.C linear 424.A metrizable 273.K product 425.K projective system of 210.C quotient 425.L real linear 424.A underlying (of a complex manifold) 72.A underlying (of a differentiable manifold) 105.0 underlying (of a topological group) 423.A uniform 436.C uniformizable 436.H topological sphere 140 topological structure 425.A,B topOlogical sum 425.M topological transformation group 431.A topological vector space 424.A topology 425.B 426 a-adic (of an R-module) 284.B algebraic 426 base for a 425.F of biequicontinuous convergence 424.R box 425.K coarser 425.H combinatorial 426 compact-open 279.C 435.0 compact-open Coo 279.C differential 114 discrete 425.C etale 16.AA fine (on a class of measures) 261.0 338.E 
Subject Index Topology a finer 425.H Oel'fand 36.E general 426 Orothendieck 16.AA hereditarily weak 425.M hull-kernel 36.0 J-adiclofaring) 16.X indiscrete 425.C induced 425.1 induced by a mapping 425.1 inner (of a Lie subgroup) 249.E Jacobson 36.0 Krull (for an infinite Oalois group) 172.1 larger 425.H leaf 154.0 of Lie groups and homogeneous spaces 427 linear 422.L Mackey 424.N metric 425.C order 425.C PL 65.A product 425.K projective 424.R quotient 425.L relative 425.1 S- (on a linear space) 424.K Sazonov 341.J set-theoretic 426 O'-weak 308.B smaller 425.H strong (on a class of measures) 338.E strong (on a direct product space) 425.K strong (on a normed space) 37.E strong (on a topological linear space) 424.K stronger 425.H strong operator 251.C subbase for a 425.F of surfaces, fundamental theorem of 410.B trivial 425.C uniform 436.C of uniform convergence 424.K of the uniformity 436.C uniformizable 436.H uniform operator 251.C vague (on a class of measures) 338.E weak (in a cell complex) 70.0 weak (on a class of measures) 338.E weak (on a direct product space) 425.K weak (on a direct sum) 425.M weak (on a locally convex space) 424.H weak (on a normed linear space) 37.E weak, relative to the pairing <E, F) 424.H weak Coo 279.C weaker 425.H weak operator 25J.C weak* (on a locally convex space) 424.H weak* (on a normed space) 37.E Zariski (of a spectrum) 16.0 Zariski (of a variety) 16.A topology a (on the tensor product of locally convex spaces) 424.R topology n (on the tensor product of locally convex spaces) 424.R Tor 200.0 exact sequence of 200.0 Tor:(A, B) 2oo.K Tor1(M, N) 200.0 Torelli theorem 9.E,J l1.C Tor groups 200.0 toric variety 16.Z 2132 toroidal coordinates App. A, Table 3.V toroidal embedding 16.Z toroidal subgroup, maximal (of a compact Lie group) 248,X torsion (ofacurveofclassC") 111.0 affine 110.C analytic 391.M conformal 110.0 radius of (of a space curve) 111.F Whitehead 65.C torsion A-module 277.0 torsion Abelian group 2.A bounded 2.F torsion coefficients (of a complex) 201.B torsion element (of an A-module) 277.0 torsion form 80.H torsion-free A-module 277.0 torsion-free Abelian group 2.A torsion group 2.A (of a complex) 201.B torsion product (in a category) 2oo.K (of A-modules) 200.0 torsion subgroup (of an Abelian group) 2.C maximal 2.A torsion tensor App. A, Table 4.11 (of an affine connection) 80.1,L 417.B (of an almost contact structure) ] 10.E (of a Frechet manifold) 286.L torus (algebraic group) 13.0 (compact group) 422.E (surface) 111.1 41O.B algebraic 13.0 Clifford 275.F complex 3.H generalized Clifford 275.F invariant 126.L K-split 13.0 K-trivial 13.0 maximal (of a compact Lie group) 248.X maximal K-split 13.Q n- 422.E torus embedding 16.Z torus function App. A, Table 18.m torus group 422.E total (set offunctions) 317.A total boundary operator 2oo.E total Chern class 56.C total cross section 386.B total curvature (of an immersion) 365.0 (of a space curve) 111.F (of a surface) II1.H, App. A, Table 4.1 Oaussian 111.H total degree 200.1 total differential (of a complex) 2oo.H (of a function) 106.0 (on a Riemann surface) 367.H total differential equation(s) 428, App. A, Table 15.1 system of 428.A total elastic cross section 386.B total energy 27l.C total excess 178.H total Oaussian curvature (of a surface) 11 J.H total isotropy, index of (with respect to a quadratic form) 348.E 
2133 totally bounded (metric space) 273.B (subset of a metric space) 273.B (subset of a uniform space) 436.H (uniform space) 436.H locally 436.H totally definite quaternion algebra 27.0 totally degenerate (group) 234.B totally differentiable (complex function) 21.C (real function) 106.0 totally disconnected (topological space) 79.0 totally geodesic submanifold 365.0 totally imaginary field 14.F totally isotropic (subspace) 60.0 348.E totally ordered additive group 439.B totally ordered group 243.0 totally ordered set 311.A totally positive (element) 14.0 totally real field 14.F totally real immersion 365.M totally real sub manifold 365.M totally regular transformation (of a sequence) 379.L totally singular (subspace) (with respect to a quadratic form) 348.E totally umbilical submanifold 365.0 totally unimodular 186.0 total matrix algebra 269.B total mean curvature 365.0 total monodromy group 418.F total ordering 311.A total Pontryagin class 56.0 total quotients, ring of 67.0 total space (of a fiber bundle) 147.B (of a fiber space) 148.B total Stiefel-Whitney class 56.B total transform (of a subvariety) 16.1 total variation (of a function) 166.B (of a mapping from a plane into a plane) 246.H (of a set function) 380.B (of a vector measure) 443.0 tower, class field (problem) 59.F trace (of an algebraic element) 149.J (of an element of a general Cayley algebra) 54 (of a linear system of an algebraic surface) 15.C (of a matrix) 269.F (of a nuclear operator) 68.L (in a von Neumann algebra) 308.0 reduced (of an algebra) 362.E trace class 68.1 traced, {3- 126.1 trace form, f;- 60.0 trace formula (on unitary representations) 437.00 trace norm 68.1 trace operator 168.B tracing property, pseudo-orbit 126.J tractrix 93.H traffic intensity 260.H trajectory 126.B,C negative half- 126.0 orthogonal 193.1 positive half- 126.0 transcendence basis (of a field extension) 149.K Subject Index Transformation( s) transcendence basis, separating (of field extension) 149.K transcendence degree (of a field extension) 149.K transcendency, degree of (of a field extension) 149.K transcendental curve 93.H transcendental element (of a field) 149.E transcendental entire functions 429 transcendental extension 149.E purely 149.K transcendental function higher 389.A of Painleve 288.C transcendental meromorphic function 272.A transcendental numbers 430 transcendental singularity (of an analytic function in the wider sense), direct, indirect 198 P transfer (in group theory) 190.0 transfer function 86.D transfer function matrix 86.B transferrer of constant lengths 155.0 transfinite cardinal number 49.A transfinite diameter 48.0 transfinite induction (in a well-ordered set) 311.C definition by 311.C transfinite initial ordinal number 49.E transfinite logical choice function 411.J transfinite ordinal number 312.B transform (of a sequence by a linear transformation) 379.L Cauchy (of a measure) 164.J Cayley (of a closed symmetric operator) 251.1 discrete Fourier 142.0 fast Fourier 142.0 Fourier 160, App. A, Table I1.II Fourier (of a distribution) 125.0 Fourier (in topological Abelian groups) 36.L 192.1 Fourier, generalized 220.B Fourier, invcrse (of a distribution) 125.0 Fourier, spherical 437.Z Fourier-Bessel 39.0 Fourier cosine 160.C, App. A, Table 1 1. II Fourier-Laplace 192.F Fourier sine 160.C, App. A, Table I1.II Fourier-Stieltjes 192.B,0 Oel'fand 36.E Hankel 220.B Hilbert 160.0 220.E integral 220 inverse (of an integral transform) 220.A Klein 150.0 kth 160.F Laplace 240, App. A, Table 12.1 Laplace-StieItjes 240.A Legendre 419.C Mellin 220.C proper (of a subvariety) 16.1 Radon 218.F Radon, conjugate 218.F real monoidal 274.E Riesz 251.0 StieItjes 220.0 total (of a subvariety) 16.1 Watson 160.C 220.B transformable, polynomially 71.E transformation(s) 381.C (on a measure space) 136.0 
Subject Index Transformation formula affine 7.E affine (of a manifold with an affine connection) 80.1 affine (of a Riemannian manifold) 364.F affine, group of 7.E affine, proper 7.E affine, regular 7.E Ampere 82.A angular 374.0 arc sine 374.0 bimeasurable (on a measure space) 136.B birational 16.1 BRS 150.0 canonical 82.B 271.F canonical, group of 27l.F Cayley (of a matrix) 269.1 chain (between complexes) 2oo.H conformal 80.P 364.F congruent (in Euclidean geometry) 139.B congruent, group of 285.C contact 82, App. A, Table 15.1V coordinate (of a fiber bundle) 147.B coordinate (of a locally free &x-Module) 16.E covering 91.A 367.B Cremona 16.1 by drift 261.F of drift 406.B elliptic 74.F entire linear 74.E to an equilibrium system 82.0 equilong 76.B ergodic (on a measure space) 136.B Euler (of infinite series) 379.1 factor (of a measure preserving transformation) 136.0 Fisher z- 374.0 Oalilei 359.C gauge (in electromagnetism) 130.A gauge (in a lattice spin system) 402.0 gauge (of a principal fiber bundle) 80.Q gauge (in unified field theory) 343.B gauge, of the first kind 150.B Oauss App. A, Table 16.m Oivens 302.E Householder 302.E hyperbolic 74.F infinitesimal (of a Lie transformation group) 431.0 infinitesimal (of a one-parameter group of transformations) 105.N inner (in the sense of Stoi:low) 367.B Jacobi imaginary 134.1, App. A, Table 16.111 Kelvin 193.B Laguerre 76.B Landen 134.B, App. A, Table 16.m Legendre 82.A, App. A, Table 15.1V Lie (in circle geomtry) 76.C Lie line-sphere 76.C linear (= linear fractional) 74.E linear (of a linear space) 251.A 256.B linear (of a sequence) 379.L linear fractional 74.E local, local Lie group of 431.0 local, local one-parameter group of 105.N of local coordinates 90.0 locally quadratic (of an algebraic surface) 15.0 locally quadratic (of an algebraic variety) 16.K 10caIly quadratic (of a complex manifold) 72.H Lorentz 359.B 2134 loxodromic 74.F measurable (on a measure space) 136.B measure-preserving 136.B Mobius 74.E 76.A monoidal (of an analytic space) 23.0 monoidal (of a complex manifold) 72.H monoidal (by an ideal sheaf) 16.K monoidal, with center W 16.K by a multiplicative functional (in a Markov process) 261.F natural 52.1 nonsingular (of a linear space) 256.B nonsingular (on a measure space) 136.B normal (of a sequence) 379.L one-parameter group of 105.N one-parameter group of class C' [26.B orthogonal 139.B 348.B orthogonal (over a noncommutative field) 60.0 orthogonal (with respect to a quadratic form) 60.K orthogonal, around the subs pace A k 139.B parabolic 74.F of the parameter 111.0 parity 359.B particular (of I):) 248.R Picard-Lefschetz 16.U to principal axes 390.B projective 343.0 projective (of a Riemannian manifold) 367.F projective, group of 343.0 pseudoconformal 344.A pseudogroup of (on a topological space) 90.0 quadratic 16.1,K quantized contact 274.F regular (of a linear space) 256.B regular (of a sequence) 379.L regular projective 343.0 Schwarz-Christoffel 77.0 semilinear 256.P semi regular (of a sequence) 379.L shift 136.0 singular projective 343.0 singular projective, of the hth species 343.0 superharmonic 261.F symplectic 60.L symplectic (over a noncummutativ<: field) 60.0 totally regular (of a sequence) 379.L triangular (linear) 379.L unitary 348.F unitary (relative to an a-Hermitian form) 60.0 weaklyequilvalent 136.F transformation lormulil (for the generating function of the number of partitions) 328 (of a theta function) 3.1 (for theta series) 348. L Schwarz-Christoffel 77.0 transformation group(s) 431, App. A, Table 14.m (of a set) 431.A covering 91.A differentiable 431.C discontinuous 122.A free discontinuous 122.A Lie 431.C Mobius 76.A orthogonal 60.1 orthogonal (over a field with respect to a quadratic form) 60.K 
2135 properly discontinuous 122.A symplectic 60.L topological 431.A unitary 60.F transformation parameter 396.1 transformation problem (in a finitely presented group) 161.B transformation space (of an algebraic group) 13.0 transgression (homomorphism of cohomology groups) 2oo.M (in the spectral sequence of a fiber space) 148.E transgressive (element in the spectral sequence of a fiber space) 148.E transient (Levy process) 5.0 (Markov chain) 260.B (Markov process) 261.B transient problem 322.0 transition order-disorder 402.F phase 340.B transition function (of a fiber bundle) 147.B (of a Markov chain) 260.A (ofa Markov process) 261.B Feller 261.B transitIOn matrix 126.1 260.A transition point 254.F transition probability (of a diffusion process) 115.B (of a Markov chain) 260.A (ofa Markov process) 261.A (in quantum mechanics) 351.B standard 260.F transitive (dynamical system) 126.1,1 (operation of a group) 362.B (permutation representation) 362.B (relation) 358.A fully (subgroup of an orthogonal group) 92.C k- (permutation group) 151.H k-ply (G-set) 362.B k-ply (permutation group) 151.H multiply (permutation group) 151.H simply (G-set) 362.B transitive extension (of a permutation group) 151.H transitive law (in an equivalence relation) 135.A (on ordering) 311.A transitively (act on G-space) 431.A transitive permutation group 151.H transitivity, system of (of a G-set) 362.B translation(s) (in an affine space) 7.E group of (of an affine space) 7.E left 249.A 362.B parallel 80.C 364.B right 249.A 362.B translational flow 126.L 136.0 frequencies of 126.L 136.0 translation group (of a Lorentz group) 258.A translation number 18.B,0 translation operator 306.C translation representation theorem 375.H translation theorem (in class field theory) 59.C transmission coefficient 387.0 transmission rate 213.A Subject Index Triangle transonic flow 205.B transonic similarity, von Karman 205.0 transportation problem 255.C transport coefficient 402.K transport equations 325.L transpose (of a linear mapping) 256.0 (of a rational homomorphism) 3.E transposed integral equation 217.F transposed mapping (of a diffusion kernel) 338.N (ofa linear mapping) 256.0 transposed matrix 269.B transposed operator 112.E 189.C 322.E transposed representation 362.E transposition (in a symmetric group) 151.0 transvection 60.0 transversal (matroid) 66.H transversal field 136.0 transversal flow 136.0 transversal homoclinic point 126.1 transversality, condition of (in calculus of variations) 46.B transversality condition 108.B strong 126.1 transversality theorem 105.L transverse (foliations) 154.H (to a submanifold of a differentiable manifold) 105.L to a foliation 154.B transverse axis (of a hyperbola) 78.C transverse electric waves 130.B transverse electromagnetic waves 130.B transverse invariant measure 154.H transversely (intersect) 105.L transversely orientable 154.B transverse magnetic waves 130.B transverse structure 154.H transverse wave 446 trap (of a diffusion process) 115.B (of a Markov process) 261.B trapezoidal rule (of numerical integration) 299.A (of numerical solution of ordinary differential equations) 303.E treatment 102.B connected 102.B treatment combinations, number of 102.L treatment contrast 102.C treatment effect 102.B tree 93.C 186.0 co- 186.0 derivation 31.E spanning 186.0 tree code 213.E tree representation 96.0 tree structure 96.0 trefoil knot 235.C trellis code 213.E trend 397.N triad 202.M homotopy exact sequence of 202.M homotopy group of 202.M trial path dependent, d- (response probability) 346.0 triangle 7.D 155.F 178.H geodesic 178.A Pascal 330 
Subject Index Triangle comparison theorem plane App. A, Table 2.II polar 78.1 Reuleaux 89.E 111.E self-polar 78.J solving a 432.A spherical 432.B, App. A, Table 2.1II triangle comparison theorem 178.A triangle inequality 273.A triangle test 346.E triangulable 65.A triangular (linear transformation) 379.L triangular element 304.C triangular factorization 302.B triangular matrix 269.B lower 269.B upper 269.B triangular number 4.0 triangulated manifold 65.B triangulation 65.A 70.C C- 114.C combinatorial 65.C combinatorial, problem 65.C compatible with 114.C finite 70.C trick, Alexander's 65.0 triclinic (system) 92.E Tricomi differential equation 326.C Tricomi problem 326.C tridiagonal matrix 298.0 trigamma function 174.B trigonal (system) 92.E trigonometric function 131.E 432.A, App. A, Table 2 inverse 131.E trigonometric integral 160.A trigonometric interpolation polynomial 336.E trigonometric polynomial, generalized 18.B trigonometric series 159.A generalized 18.B trigonometric sum 4.C trigonometric system 159.A trigonometry 432, App. A, Table 2 plane 432.A spherical 432.B trilinear coordinates 90.C trimmed mean, C(- 371.H triple 2oo.Q,L homotopy exact sequence of 202.L triple product, scalar, vector 442.C, App. A, Table 3.1 triplet, Oel'fand 424.T tripolar coordinates 90.C trisection of an angle 179.A trivalent map 157.B trivial (extension) 390.J (knot) 65.0 235.A K- (torus) 13.0 trivial bundle 147.E trivial fiber space, 10cal1y 148.B trivialization (of a block bundle) 147.Q trivially (act on a G-space) 431.A trivial sheaf 383.0 trivial solution (of a system of linear homogeneous equations) 269.M trivial topology 425.C trivial valuation 439.C,F trochoid 93.H Trotter product formula 351.F true 411.E 2136 true anomaly 309.B true value of a parameter 398.A truncated Wightman function 150.0 truncation error 138.B 303.B local 303.E truth definition 185.0 truth function 411.E truth value (of a formula) 411.E Tsen theorem 27.E 118.F tube 126.E regular 193.K vector 442.0 tuboid 125.V tubular neighborhood 105.L 114.B 364.C open 105.L 114.B tubular neighborhood system, controlled 418.0 Tucker theorem on complementary slackness 255.B Tucker theorem, Kuhn- 292.B tuple, n- 256.A 381.B turbulence 433 homogeneous 433.C isotropic 433.C turbulent flow 205.E 433 Turing machine 31.B universal 31.C turning point 25.B 254.F twinable 92.0 twinning structure 92.0 twin primes 123.C twisted type, group of 15l.I two-bin system 227 two-body interaction 27l.C two-body problem 55.A two-dimensional KdV equation 387.F 2-isomorphic 186.H two-person game, zero-sum 108.B two-phase simplex method 255.C two-point boundary value problem (of ordinary differential equations) 315.A two sheets hyperboloid of 350.B hyperboloid ofrevolution of 350.B two-sided exponential distribution App. A, Table 22 two-sided generator (for an automorphism of a measure space) 136.E two-sided ideal 368.F two-sided o-ideal 27.A integral 27.A two-sided scale 19.0 two-sided surface 410.B two-stage least squares method 128.C two-stage sampling 373.E two-stage stochastic programming 408.A two-terminal characteristic 281.C two-terminal network 281.C two-terminal problem 281.C two-valued logic 411.L two-way elimination of heterogeneity, design for 102.K two-way layout 102.H type (of an Abelian group) 2.B,0 (of an object) 356.F (of a quadratic form) 348.E (of a structure) 409.B (of a transcendental number) 430.C acute 304.C backward 304.0 
2137 Bravais (of lattices) 92.B Bravais, of the class of (T, K) 92.B compact 412.0 dimension 117.H of finite (graded module) 203.B of finite (module) 277.0 of finite (morphism of schemes) 16.0 of finite (iP-module) 16.E of finite, subshift 126.1 forward 304.0 Fredholm, integral equation of 217.A Fredholm, integrodilTerential equation of 222.A Fuchsian (linear ordinary differential equations) 253.A Fuchsian (visibility manifold) 178.F general 72.1 general, surface of 72.K homotopy 202.F homotopy (of a link) 235.0 homotopy, invariant 202.F isotopy (of knots) 235.A isotopy (of a transformation group) 431.A k-, Markov branching process 44.E knot 235.A link 235.0 locally of finite (morphism of schemes) 16.0 mixed 304.C 326.A multi-, Markov branching process 44.E multidiagonal 304.C noncom pact 412.0 nonconforming 304.C orbit (ofa G-space) 431.A Parreau- Widom 164.K positive (symmetric kernel) 338.0 principal orbit 431.C spherical G-fiber homotopy 431.F Stollow 207.B twisted, group of 151.1 Volterra, integral equation of 217.A Volterra, integrodifferential equation of 222.A Weierstrass-, preparation theorem (for micro- differential operators) 274.F type poo, Abelian group of 2.0 type (p, q) (of an operator) 224.E tensor of 256.1 tensor space of 256.J weak (of an operator) 224.E type (r, s), differential form of 72.C type S (harmonic boundary) 207.B type S, space of 125.T type number (of a solution of a system of linear ordinary differential equations) 314.A type problem (for Riemann surfaces) 367.0 type theory 411.K ramified 411.K simple 411.K type (0, 1), tangent vector of 72.C type (1,0), tangent vector of 72.C type I (v on Neumann algebra) 308.E C*-algebra of 308.L group of 437.E type I group 437.E type Im.m' (irreducible symmetric bounded domain) 412.1 type In (v on Neumann algebra) 308.E type II (von Neumann algebra) 308.E type II I Subject Index Unbiased estimator (ergodic countable group) 136.F (von Neumann algebra) 308.E,F type IIm (irreducible symmetric bounded domain) 412.1 type II", (ergodic countable group) 136.F (v on Neumann algebra) 308.E,F type III (ergodic countable group) 136.F (von Neumann algebra) 308.E structure theorem for von Neumann algebras of 308.1 type IlIa (factor) 308.1 type m I (factor) 308.1 type III, (factor) 308.1 type 111m (irreducible symmetric bounded domain) 412.1 type IV m (irreducible symmetric bounded domain) 412.1 type AI, All, AlII, AIV (irreducible symmetric Riemannian space) 412.0 type BOI, BOIl (irreducible symmetric Riemannian space) 412.0 type CI, CII (irreducible symmetric Riemannian space) 412.0 type 0111 (irreducible symmetric Riemannian space) 412.0 U V(n) (unitary group) 60.F u-chain 260.1 u-curve 111.H u,-scale 19.0 V-invariant (subspace) 437.C V-set 159.1 V-statistic 274.1 U test, Mann-Whitney 371.C U-number 430.C U*-number 430.C UCL (upper control limit) 404.B Ugaheri maximum priciple 338.C Uhlenbeck Brownian motion, Ornstein- 45.1 Ulam theorem, Borsuk- 153.B Ulm factor(s) 2.0 sequence of 2.0 ultrabornological (locally convex space) 424.W ultradifferentiable function 168.B ultradistribution 125.U,BB of class {Mp} 125.U of class {M p } 125.U ultrafilter 87.1 ultrainfinite point 285.C ultrapower 276.E ultraproduct(s) 276.E fundamental theorem of 276.E ultraspherical polynomials 317.0 ultraviolet divergence 132.C 146.B umbilical point (of a surface) III.H 365.0 umbilical submanifold, totally 365.0 Umkehr homomorphism 201.0 UMP (uniformly most powerful) (test) 400.A UMP invariant level C( test 4oo.E UMP unbiased level C( test 400.C UMV unbiased estimator 399.C unavoidable set 157.0 unbiased confidence region 399.Q uniformly mOst powerful 399.Q unbiased estimator 399.C asymptotically (mean) 399.K 
Subject Index Unbiased level rx test best linear 403.E kth order asymptotically median 399.0 mean 399.C median 399.C modal 399.C UMV 399.C uniformly minimum variance 399.C unbiased level C( test 400.C uniformly most powerful 400.C unbiasedness 399.C unbounded (covering surface) 367.B uncertainty (in observations) 351.C uncertainty relation, Heisenberg 351.C unconditonally convergent (series) 379.C (series in a Banach space) 443.0 undecidabJe proposition, formally 185.C undefined concept 35.B undefined term 35.B undercrossing point 235.A underdetermined system of differential operators 112.R of partial differential operators 320.F underflow 138.B underlying group (of a topological group) 423.A underlying topological space (of a complex manifold) 72.A (ofa differentiable manifold) 105.0 (of a topological group) 423.A undirected graph 186.B undotted index 258.B undotted spinor (of rank k) 258.B unfolding (of a germ of an analytic function) 418.E constant 51.0 f- 51.0 universal 418.E unicity theorem, Luzin's 22.C unicursal (ordinary Curve) 93.C unicursal curve 9.C 93.H unicursal graph theorem (Euler's) 186.F unified fieJd theory 434 nonsymmetric 434.C uniform (lattice of a Lie group) 122.K (sampJing procedure) 373.A uniform algebra 164.A uniform boundedness theorem 37.H uniform continuity lower class with respect to 45.F upper cJass with respect to 45.F uniform convergence 435 (of an infinite product) 435.A (of operators) 25l.C (of a series) 435.A abscissa of 121.B 240.B on compact sets 435.C Weierstrass criterion for 435.A uniform covering system 4360 uniform distribution 182.H 341.0, App. A, Table 22 uniform family of neighborhoods system 436.0 uniform isomorphism 436.E uniformity 436.B base for the 436.B discrete 436.0 generated by a family of pseudometrics 436.F generated by a pseudometric 436.F left (of a topological group) 423.0 product 436.E 2138 pseudometric 436.F relative 436.E right (of a topological group) 423.0 stronger 436.E T l - 436.C top010gy of the 436.C weaker 436.E uniformizable (topological space) 436.H uniformization 367.C (of a set in a product space) 22.F Schottky 367.C uniformization theorem general 367.0 Kondo 22.F uniformized 367.C locally 367.C uniformizing parameter, local (of a Riemann surface) 367.A uniformly (partial recursive function) 356.E (primitive recursive function) 356.B uniformly absolutely convergent (series I 435.A uniformly almost periodic function 18.B uniformly asymptotically stable (solution of a differential equation) 394.B (solution of a functional differential equation) 163.0 uniformly best (estimator) 399.C uniformly better (decision function) 398.B uniformly consistent test 4oo.K uniformly continuous (function) 84.A (mapping) 273.1 436.E on a subset 436.0 uniformly convergent (sequence) 435.A on a family of sets 435.C in the wider sense 435.C uniformly convex (normed linear space) 37.0 uniformly equivalent (uniform spaces) 436.E uniformly integrable (family of random variables) 262.A uniformly locally compact (space) 425.V uniformly Lyapunov stable 126.F uniformly minimum variance unbiased estimator 399.C uniformly most powerful (confidence region) 399.Q (test) 4oo.A invariant 399.Q invariant level C( 4oo.E unbiased 399.Q unbiased level C( 400.E uniformly recursive in 'P (define a partial recursive function) 356.E uniformly smooth (normed linear space) 37.0 uniformly stable 394.B uniform neighborhood system 436.0 uniform norm 168.B uniform operator topology 251.C uniform space(s) 436 analytically 125.S complete 436.0 Hausdorff 436.C locally totally bounded 436.H metrizable 436.F precompact 436.H product 436.E pseudometrizable 436.F separated 436.C 
2139 T , - 436.C totally bounded 436.H uniform star convergence, relative 310.F uniform structure 436.B uniform subs pace 436.E uniform topological space 436.B uniform topology 436.B unilateral constraints 440.A unilateral shift operator 390.1 unimodal (distribution function) 341.H unimodular (germ of an analytic function) 418.E (locally compact group) 225.0 totally 186.0 unimodular group 60.B quaternion 412.0 union (in axiomatic set theory) 33.B (ofmatroids) 66.H ( of sets) 381.B axiom of 381.0 disjoint 381.B,0 of hypersurface elements 82.A of surface elements 324.B unipotent (algebraic group) 13.E (linear transformation) 269.L unipotent component 269.L unipotent matrix 269.F unipotent part (of an algebraic group) 13.E (of a nonsingular matrix) 13.E unipotent radical 13.1 unique continuation theorem 323.1 unique decomposition theorem (for a 3-manifold) 65.E unique factorization domain 67.H unique factorization theorem (in an integral domain) 67.H uniquely ergodic (homeomorphism (on a compact metric space) 136.H uniqueness in the sense of law of solutions 406.0 set of 159.1 of solution, pathwise 406.0 uniqueness condition (for solutions of ordinary differential equations) 316.0 uniqueness principle (in potential theory) 338.M uniqueness theorem (for analytic functions) 198.C (for class field theory) 59.B (for differential equations in a complex domain) 316.0 (for Fourier transform) 192.1 (for harmonic functions) 193.E (for an initial value problem of ordinary differ- ential equations) 316.0 of the analytic continuation 198.C,I Holmgren 321.F of homology theory 201.R Rellich 188.0 von Neumann 351.C unique strong solution 406.0 unirational surface 15.H unirational variety 16.1 uniserial algebra 29.1 absolutely 29.1 generalized 29.1 unisolvent space 142.B unisolvent system (of functions) 336.B Subject Index Unit circle unit(s) (of an algebraic number field) 14.0 (for measure oflength) 139.C (in a ring) 368.B (of a symmetric matrix with rational coordi- nates) 348.1 (of a vector lattice) 31O.B Archimedean (of a vector lattice) 310.B arithmetic 75.B auxiliary 414.A base 414.A circular 14.L control 75.B derived 414.0 fundamental 414.A fundamental (of an algebraic number field) 14.0 gravitational, system of 414.B imaginary 74.A 294.F international system of 414.A Kakutani 310.0 matrix 269.B memory 75.B system of 414 unital 36.A unitarily equivalent (self-adjoint operators) 390.0 unitary (homomorphism between rings) 368.0 (module) 277.0 essentially 390.1 unitary algebra 29.A unitary dilation 251.M unitary field theory 434.C unitary group 60.F 151.I (relative to an a-Hermitian form) 60.0 infinite 202.V over K 60.H over K, projective special 60.H over K, special 60.H projective 60.F special 60.F special (relative to an a-Hermitian form) 60.0 unitary matrix 269.1 unitary monoid 409.C unitary operator 251.E 390.E unitary representation(s) 437 disjoint 437.C equivalent 437.A induced by a representation of a subgroup 437.0 integrable 437.X irreducible 437.A isomorphic 437.A quasi-equivalent 437.C similar 437.A square integrable 437.M sufficiently many irreducible 437.B unitary restriction (of a semisimple Lie algebra) 248.P unitary ring 368.A 409.C unitary semigroup 409.C unitary symplectic group 60.L unitary transformation 348.F (relative to an a-Hermitian form) 60.0 unitary transformation group 60.F unit ball (of a Banach space) 37.B (of a Euclidean space) 140 unit cell 140 unit circle 74.C 140 
Subject Index Unit cost unit cost 281.0 unit cube 139.F 140 unit disk 140 unit distribution 341.0 unit element (of a field) 149.A (of a group) 190.A (of a ring) 368.A unit function 306.B, App. A, Table 12.II unit group (of an algebraic number field) 14.0 unit impulsive function App. A, Table 12.II unit mapping 203.F unit matrix 269.A unit n-cube 140 unit point (of an affine frame) 7.C (of a projective frame) 343.C unit ray 351.B unit rcprcsentation (of a group) 362.C unit sphere 140 unit tangent sphere bundle 126.L unit theorem, Oirichlet 14.0 unit vector 7.C 442.B unity (in the axioms for the real numbers) 355.A partition of 425.R partition of, of class Coo 105.S partition of, subordinate to a covering 425.R primitive root of 14.L unity element (of a field) 149.A (of a ring) 368.A univalence superselection rule 351.K univalent (analytic function) 438.A univalent correspondence 358.B univalent function 438 univariate (statistical data) 397.A uni versal (ii-functor) 200.1 (*-representation of a Banach *-algebra) 36.0 (unfolding) 51.0 universal bundle 147.0,H n- 147.0 universal Chern class 56.C universal coefficient theorem (in Abelian categories) 200.H (for cohomology) 200.0201.H for homology 200.0201.0 universal constants (in the theory of conformal mapping) nF universal covering group 91.B 423.0 universal covering space 91.B universal covering surface 367.B universal curve 93.H universal domain 16.A universal enveloping algebra (of a Lie algebra) 248.1 special (of a Jordan algebra) 231.C universal enveloping bialgebra 203.0 universal Euler-Poincare class 56.B universal gravitation, law of 271.B universally Japanese ring 16.Y 284.F universally measurable 270.L universal mapping property 52.L universal net (in a set) 87.H universal Pontryagin class 56.0 universal proposition 411.B umversal quantifier 411.C universal set (for the projective sets of class n) 22.E 2140 (in set theory) 381.B universal Stiefel-Whitney class 56.B universal Teichmiiller space 416 universal Turing machine 31.C universal unfolding 418.£ universal validity of a proposition, problem of 97 universe (in nonstandard analysis) 293.B (of a structure) 276.B unknotted (ball pair) 65.0 235.0 (knot) 235.A (sphere pair) 65.0235.0 unknotting conjecture 235.0 unknotting theorem, Zeeman 65.0 unlabeled graph 186.B unmixed ideal 284.0 unmixedness theorem 284.0 unordered pair 381.B (in axiomatic set theory) 33.B axiom of 33.B unoriented cobordism class 114.H un oriented cobordism group 114.H unoriented graph 186.H unramified (covering surface) 367.B (prime ideal) 14.1 (projection of a covering surface) 367.B analytically (semilocal ring) 284.D unramified covering (of a nonsingular curve) 9.1 unramified extension 14.1 257.0 unrenormalizable 132.C 361.B unsolvability degree of 97 recursive, arithmetical hierarchy of degrees of 356.H recursive, degree of 97 recursive, hyperarithmetical hierarchy of de- grees of 356.H unstable (boundary component) 77.E (state) 394.A completely (flow) 126.£ unstable manifold 126.0,J unstable solution (of Hill's equation) 264.E up-ladder 206.B upper bound (of a subset in an ordered set) 31 I.B least (of an ordered set) 311.B least (of a subset of a vector lattice) 310.C upper boundedness principle (in potential theory) 338.C upper central series (of a group) 190.1 upper class with respect to local continuity 45.F with respect to uniform continuity 45.F upper control limit 404.B upper derivative general (of a set function) 380.0 ordinary (of a set function) 380.D upper end (of a curvilinear integral) 94.0 upper envelope (of a family of subharmonic func- tions) 193.R upper half-space of degree n, Siegel 32.F upper integral, Riemann 216.A upper limit (of a Riemann integral) 216.A upper limit function 84.C upper semi continuous (at a point) 84.C (partition) 425.L 
2141 in a set 84.C upper semilattice 243.A upper triangular matrix 269.B upper variation (of a set function) 380.B Uryson lemma 425.Q Uryson space, Frechet- 425.CC Uryson theorem, Tikhonov- 425.Q Uryson-Tikhonov theorem (on metrizability) 273.K Uzawa gradient method, Arrow-Hurwicz- 292.E v v-curve II1.H vacuum vector 377.A free 150.C vague topology (on a class of measures) 338.E valid formula 411.0 valuation(s) 439 additive 439.B Archimedean 14.F 439.C complete 439.0 completion of 439.0 discrete 439.E equivalent 439.B exponential 439.B generalized 439.B multiplicative 439.C non-Archimedean 14.F 439.C normal 439.E,H normalized 439.E p-adic 439.F p-adic exponential 439.F prolongation of 439.B pseudo- 439.K special 439.B over a subfield 439.B,C trivial 439.C,F valuation ideal (of a valuation) 439.B valuation ring 439.B completion of 439.0 discrete 439.E valuation vector(s) 6.C ring of 6.C value(s) (of an infinite continued fraction) 83.A (of an infinite product) 379.0 (of a variable) 165.C absolute (of a complex number) 74.B absolute (of an element of an ordered field) 149.N absolute (of an element of a vector lattice) 310.B absolute (of a real number) 355.A absolute (of a vector) 442.B asymptotic (of a meromorphic function) 62.A 272.H boundary (of a conformal mapping) 77.B boundary (hyperfunction) 125.V boundary (relative to a differential operator) 112.E boundary, problem  boundary value problem characteristic (of a linear operator) 390.A cluster 62.A cluster, theorem 43.0 critical (in bifurcation theory) 286.R critical (of a COO-function on a manifold) 279.B critical (of a Contact process) 340.C critical (of an external magnetic field) 340.B critical (of a mapping u: R" -. Rm) 208.B Subject Index Variable(s) critical (of a Coo-mapping <p: M -. M') 105.1 exceptional (of a transcendental entire function) 429.B exceptional, Borel 272.E exceptional, Nevanlinna 272.E exceptional, Picard 272.E expectation (of an observable) 351.B expected (of a random variable) 342.C gap (of a point on a Riemann surface) 11.0 initial (for ordinary differential equations) 316.A initial (for partial differential equatons) 321.A initial (for stochastic differential equations) 406.0 initial, problem  initial value problem limit (of a mapping) 87.F mean (of a function on a compact group) 69.A mean (of a weakly stationary process) 395.C mean, theorem  mean value theorem most probable 401.E principal (of inverse trigonometric functions) 131.E principal (oflog z) 131.0 principal, Cauchy (of an improper integral) 216.0 principal, Cauchy (of the integral of a function in ( - 00,00)) 216.E proper (of a boundary value problem) 315.B proper (of a linear mapping) 269.L proper (of a linear operator) 390.A proper (of a matrix) 269.F range of (of a meromorphic function) 62.A regular 105.1 sample 396.B sample characteristic 396.C Shapley 173.0 singular 302.A singular, decomposition (SVO) 302.E starting 303.E stationary (of a function) 106.L true, of parameter 398.A truth (ofaformula) 411.E value distribution 124.A value function 108.B 405.A value group (of an additive valuation) 439.B (of a multiplicative valuation) 439.C Vandermonde determinant 103.0 van der Pol differential equation 290.C van der Waerden-Bortolotti covariant derivative 417.E van der Waerden test 371.C Vandiver conjecture 14.L van Hove sense, limit in 402.0 vanishing cocycle 16.U vanishing cycle 418.F vanishing theorem (on compact complex manifolds) 194.0 Kodaira 232.0 van Kampen theorem (on fundamental groups) 170 variability, measure of 397.C variable(s) 165.C (of a polynomial) 369.A artificial 255.C auxiliary 373.C basic 255.A bound 411.C canonical (in analytical dynamics) 271.F change of (in integral calculus) 216.C 
Subject Index Variable component (of a linear system) complex 165.C complex, theory of functions of 198.Q dependent 165.C differential (of a differential polynomial) 113 endogenous 128.C exogenous 128.C explanatory 403.0 hidden, theories 351. L independent l65.C individual 411.H inner 25.B lagged l28.C object 411.0 outer 25.B predetermined 128.C predicate 411.0,H proposition 411.E random 342.C random, independent 342.C random, joint 342.C random, n-dimensional 342.C random, R"-valued 342.C random, (S, (f)-valued 342.C real 165.C sampling inspection by 404.C separation of 322.C slack 255.A state l27.A variable component (of a linear system) 15.C 16.N variable method, discrete 303.A variable-step variable-order (VSVO) algorithms 303.E vanance (of a probability distribution) 341.B (of a random variable) 342.C (of univariate quantitative data) 397.C analysis of 400.H 403.0 between-group 397.L generalized 280.E 397.1 multivariate analysis of 280.B population 396.C sample 396.C sample generalized 280.E uniformly minimum unbiased estimator 399.C within-group 397.L variance-covariance matrix 341.B 397.1 variance matrix 341.B variate canonical 280.E fixed 403.0 variation(s) (of an integral) loo.E bounded  of bounded variation calculus of 46 calculus of, classical theory of 46.C calculus of, conditional problems in 46.A calculus of, fundamental lemma in 46.B coefficient of 397.C of constants, Lagrange's method of 252.0 of constants, method of 55.B 252.1 first 46.B first, formula 178.A geodesic 178.A lower (of a set function) 380.B negative (of a mapping) 246.H negative (of a real bounded function) 166.B one-parameter 178.A of parameters, Lagrange method of 252.D of parameters, method of App. A, Table 14.1 positive (of a mapping) 246.H 2142 positive (of a real bounded function) 166.B proper 279.F quadratic, process 406.B second formula 178.A total (of a finitely additive vector measure) 443.0 total (of a mapping) 246.H total (of a real bounded function) 166.B total (of a set function) 380.B upper 380.B variational derivative 46.B variational equation 316.F 394.C variational formula, constant 163.E variational inequality 440 of evolution 440.C stationary 440.B variational method 438.B variational principles(s) 441 (in ergodic theory) 136.0 (in statistical mechanics) 340.B 402.0 (in the theory of elasticity) 271.0 with relaxed continuity requirements 271.0 for the topological pressure 136.H variational problem, Oauss 338.1 variation curve 178.A variation vector field 178.A variety (algebraic variety) 16.A (of block design) 102.B Abelian 3 Abelian, isogeneous 3.C Abelian, polarized 3.0 Abelian, simple 3.B abstract 16.C abstract algebraic 16.C affine 16.A affine algebraic 16.A Albanese 16.P Albanese (of a compact Kiihler manifold) 232.C algebraic 16 algebraic group 13.B almost all points of a 16.A Brieskorn 418.0 characteristic (of a microdifferential equation) 274.0 Chow 16.W complex algebraic 16.T function on a 16.A generalized Jacobian 9.F I1.C group 13.B 16.H irreducible 16.A Jacobian 9.E I1.C 16.P Landau 146.C Landau-Nakanishi 146.C 386.C linear (in an O-module) 422.L linear, linearly compact 422.L minimal 275.0 nonsingular 16.F normal 16.F normal algebraic 16.F Picard 16.P Picard (of a compact Kiihler manifold) 232.C prealgebraic 16.C product algebraic 16.A projective 16.A projective algebraic 16.A quasi-affine algebraic 16.C quasiprojective algebraic 16.C rational 16.1 
2143 rational function on a 16.A reducible 16.A Schubert 56.E smooth 16.F strict Albanese 16.P toric 16.z unirational 16.1 Zariski topology of a 16.A varifold 275.0 Varshamov-Oilbert-Sacks bound 63.B vector( s) (in a Euclidean space) 442 (in a linear space) 256.A analytic 437.S characteristic (of a linear mapping) 269.L characteristic (of a linear operator) 390.A characteristic (of a matrix) 269.F coherent 377.0 collinear 442.A column 269.A con tra varian t 256.1 coplanar 442.A covariant 256.1 cyclic (of a representation space of a unitary representation) 437.A effect 102.A eigen- (ofa linear mapping) 269.L eigen- (ofa linear operator) 390.A eigen- (ofa matrix) 269.F eigen-, generalized 390.B error 102.A fixed 442.A four- 359.C four-, energy-momentum 258.C free 442.A free vacuum 150.C fundamental (in a vector space) 442.A horizontal 80.C independent 66.F integral 428.E mean 341.B mean curvature 365.0 normal 105.L 111.H 364.A normalized 409.0 observation 102.A orthogonal 139.0 p- 256.0 po, bundle of 147.F positive 7.A 442.A Poynting 130.A proper (of a linear mapping) 269.L proper (of a linear operator) 390.A proper (of a matrix) 269.F root 390.B row 269.A tangent 105.H tangent, holomorphic n.A tangent, of type (0, 1) 72.C tangent, of type (1,0) n.c unit 442.B unit (of an affine frame) 7.C vacuum 377.A valuation 6.C valuation, ring of 6.C vertical 80.B wave number (of a sine wave) 446 Witt 449 Witt, oflength n 449.B zero 442.A vector algebra App. A, Table 3.1 Subject Index Vector triple product vector analysis and coordinate systems App. A, Table 3 vector bundle 147.F (algebraic) 16.Y ample 16.Y complex 147.F cotangent 147.F dual 147.F indecomposable 16.Y normal 105.L normal k- 114.1 quaternion 147.F semistable 16.Y stable 16.Y 237.B stably equivalent 237.B tangent 105.H 147.F vector field (on a differentiable manifold) 105.M (in a 3-dimensional Euclidean space) 442.0 Anosov 126.1 Axiom A 126.1 basic 80.H of class C' 105.M contravariant 105.0 covariant 105.0 differentiation of App. A, Table 3.II formal 105.AA fundamental 191.A G- 237.H Hamiltonian 126.L 219.C holomorphic n.A integral of App. A, Table 3.m irrotational 442.0 Killing 364.F Lagrangian 126.L lamellar 442.0 Morse-Smale 126.1 solenoidal 442.0 vanatlOn 178.A without source 442.0 without vortex 442.0 vector flux (through a surface) 442.0 vector function, measurable 308.0 vector group 422.E vectorial form, canonical 417.C vectorial p-form 417.C vector integral 443.A vector invariant 226.C vector lattice 310.B Archimedean 310.C complete 310.C normed 310.F O'-complete 310.C vector line (of a vector field) 442.0 vector measure 443.0 absolutely continuous 443.0 bounded 443.0 completely additive 443.0 finitely additive 443.0 vector potential 130.A 442.0 vector product 442.C, App. A, Table 3.1 vector representation (of a Clifford group) 61.0 vector space 442.A over a field 256.H metric 256.H prehomogeneous 450.V standard (of an affine space) 7.A tangent 105.H topological 424.A vector triple product 442.C, App. A, Table 3.1 
Subject Index Vector tube vector tube 442.0 vector-valued integral 443 velocity group 446 phase (of a sine wave) 446 velocity phase space 126.L velocity potential 205.B Veneziano model 132.C 386.C Verbeure inequality, Roepstorff- Fannes- 402.0 Veronese surface 275.F versal (unfolding) 51.0 version (of a stochastic process) 407.A vertex (vertices) (of an angle) 139.0 155.B (in a cell complex) 70.0 (of a circular cone) 78.A (of a complete quadrangle) 343.C (of a convex cell in an affine space) 7.0 (of a convex polyhedron) 89.A (in a Euclidean (simplicial) complex) 70.B (of a geodesic triangle) 178.A (of a graph) 186.B (of a linear graph) 282.A (of a parabola) 78.C (of a polygon) 155.F (in the polyhedron of a simplicial complex) 70.C (of a simplex in an affine space) 7.0 (in a simplicial complex) 70.C (of a spherical triangle) 432.B (of a star region) 339.0 adjacent 186.B end 186.B initial 186.B isolated 186.B terminal 186.B vertical angles 139.0 vertical component (of a vector field) 80.C vertical slit mapping, extremal 367.0 vertical vector 80.B very ample (divisor) 16.N (linear system) 16.N (sheaf) 16.E very weak Bernoulli process 136.E Vessiot extension field, Picard- 113 Vessiot theory, Picard- 113 Vey classes, Oodbillon- 154.0 vibrating membrane, equation of a 325.A vibrating string, equation of a 325.A vibration 318 normal 318.B parametrically sustained 318.B self-excited 318.B Viete, F. 444 Vietoris axiom 425.Q Vietoris exact sequence, Mayer- (for a proper triple) 201.C Villat integration formula App. A, Table 15.VI Vinogradov mean value theorem 4.E virtual arithmetic genus (of a divisor) 16.E viscosity 205.B coefficient of 205.C coefficient of bulk 205.C coefficient of shear 205.C magnetic 259 visibility manifold 178.F Vitali covering theorem 380.0 Vivanti theorem 339.A Volterra integral equation 217.A 2144 Volterra operator 68.J Volterra theorem, Poincare- 198.J Volterra type integral equation of 217.A integrodifferential equation of 222.A volume (of an idele) 6.0 (of a lattice in a Euclidean space) 92.0 (of a polyhedron) 139.F (of a simplex in an affine space) 7.E inner 270.0 outer 270.0 volume element (of an oriented Coo-manifold) 105 W associated with a Riemannian metric 105.W integral of a function with respect to a (on a Coo-manifold) 105.W von Karman transonic similarity 205. D von Mises theorem 399.K von Neumann, J. 445 von Neumann algebra 308.C discrete 308. E finite 308.E induced 308.C purely infinite 308.E reduced 308.C semifinite 308.E structure theorem for, of type III 308.1 of type I 308.E of type II 308.E of type III 308.E of type II", 308.E of type III 308. E von Neumann condition 304.F von Neumann density theorem 308.C von Neumann-Halmos theorem 136.E von Neumann inequality 251.M von Neumann-Morgenstern solution 173.0 von Neumann reduction theory 308.G von Neumann selection theorem 22.F von Neumann theorem, Weyl- 390.1 von Neumann uniqueness theorem 351.C vortex, vector field without 442.0 vortex line 205.B vorticity 205.B vorticity theorem, Helmholtz 205.B Vossen theorem, Cohn- 111.1 VSVO algorithm 303.E V.W.B. process 136.E w w(n) (Sobolev space) 168.B w-plane 74.0 w-point (of an entire function) 429.B w-sphere 74.0 W-construction (of an Eilenberg-MacLane complex) 70.F W-surface 111.1 W*-algebra 308.C Wagner function 39.F waiting time 260.H waiting time distribution 307.C Wald theorem 399.H,M Walker equation, Yule- 421.0 Walker metrics, Robertson- 359.E wall adiabatic 419.A diathermal 419.A Wall group 114.1 
2145 Wallis formula App. A, Table 10.V1 Wallis test, Kruskal- 371.0 Walsh system of orthogonal functions 317.C wandering 126.E weakly, under a group 136.E wandering point 126.E wandering set 136.C weakly l36.C Wang exact sequence (of a fiber space) 148.E Waring problem 4.E Warning second theorem 118.B Warning theorem 118.B wasan 230 water wave(s) 205.F deep 205.F long 205.F shallow 205.F Watson formula 39.E, App. A, Table 19.1V Watson-Nicholson formula App. A, Table 19.II1 Watson process Oalton- 44.B multi (k)-type Oalton- 44.C Watson transform 160.C 220.B wave(s) 446 Alfven 259 capillary 205.F dispersive 446 electromagnetic 446 electromagnetic, theory of 130.B fast 259 gravity 205.F gravity, long 205.F longitudinal 446 Mach 205.B partial 386.B partial, expansion 375.E 386.B plane 446 polarized 446 shock 205.B 446 sine 446 sinusoidal 446 slow 259 spherical 446 stationary 446 Stokes 205.F surface 446 transverse 446 water 205.F water, deep 205.F water, long 205.F water, shallow 205.F wave equation 325.A 446, App. A, Table 15.m wave expansion, partial 375.E 386.B wave front set 274.B 345.A analytic 274.0 wave function 351.0 spheroidal 133.E wave guide 130.B wavelength (of a sine wave) 446 wave number (of a sine wave) 446 wave operator 375.B,H generalized 375 B incoming 375.B modified 375.B outgoing 375.B wave propagation 446 wave scattering amplitude, partial 375.E wave steepness 205.F W.B. process l36.E WC group 118.0 Subject Index Weak topology weak (boundary component) 77.E weak Bernoulli process 136.E very 136.E weak convergence (of operators) 251.C (of probability measures) 341.F (of a sequence of submodules) 200.1 weak Coo topology 279.C weak derivative 125.E weak dimension (of a module) 2oo.K weaker (equivalence relation) 135.C (method of summation) 379.L (topology) 425.H (uniformity) 436.E weak extension (of a differential operator) 112.E,F weak form of the boundary value problem (of partial differential equations) 304.B weak global dimension (of a ring) 2oo.K weak homotopy equivalence 202.F weak lacuna 325.1 weak law of large numbers 395.B weak Lefschetz theorem 16. U weakly, converge (in a normal linear space) 37.E (in a topological linear space) 424.H weakly almost complex manifold 114.H weakly compact (linear operator) 68.M weakly compact cardinal number 33.E weakly continuous (function with values in a Banach space) 37.K weakly continuous representation (of a topological group) 69.B weakly dominated (statistical structure) 396.F weakly equivalent (transformations) 136.F weakly G-stationary (system of random variables) 395.1 weakly hyperbolic linear (differential operator) 325.H weakly inaccessible (ordinal number) 312.E weakly inaccessible (cardinal number) 33.E weakly integrable 443.F weakly isomorphic (automorphisms) 136.E weakly measurable 443.B,1 weakly mixing (automorphism) 136.E weakly modular 351.L weakly nonlinear differential equations 290.0 weakly I-complete manifold 114.H weakly stationary process 395.A weakly stationary process of degree k 395.1 weakly stationary random distribution 395.C weakly symmetric Riemannian space 412.1 weakly wandering set 136.C weakly wandering under a group 136.F weak minimum 46.C weak Mordell-Weil theorem 118.E weak operator topology 251.C weak potential kernel 260.0 weak solution 204.C 323.0 378.1 weak solution, Hopf's 204.C weak* Oirichlet algebra 164.0 weak star topology 37.E 424.H weak topology (in a cell complex) 70.0 (on a class of measures) 338.E (on a direct product space) 425.K (on a direct sum) 425.M (on a locally convex space) 424.H (on a normed linear space) 37.E (relative to the pairing <E, F» 424.H 
Subject Index Weak type (p, q), quasi-linear operator of hereditarily 425.M weak type (p, q), quasi-linear operator of 224.E Weber differential equation 167.C, App. A, Table 20.m Weber formula App. A, Table 19.1V Weber function 167.C, App. A, Table 19.1V 20.m Weber-Hermite differential equation 167.C Weber-Sonine formula App. A, Table 19.m web group 234.B Wedderburn-Mal'tsev theorem (on algebras) 29 F Wedderburn theorem (on commutativity of finite fields) 149.M (on simple algebras on a field) 29.E (on simple rings) 368.0 wedge 125.V infinitesimal 125 V wedge product (of derived functors) 200.K wedge theorem, edge of the 125. W Weierstrass, K. 447 analytic function in the sense of 198.1 Weierstrass approximation theorem 336.A Weierstrass canonical form (of elliptic curves) 9.0 (of the gamma function) 174.A Weierstrass canonical product 429.B Weierstrass criterion for uniform convergence 435.A Weierstrass elliptic functions 134.F, App. A, Table 16.1V Weierstrass-Enneper formula 275.A Weierstrass -function 134.F, App A, Table 16.1V Weierstrass point 11.0 Weierstrass preparation theorem 21.E 370.B Weierstrass sigma function 134.F Weierstrass-Stone theorem 168.B Weierstrass theorem (on compactness of subsets of R) 355.0 (on continuous functions on a compact set) 84.C (on essential singularities) 198.0 (on expansion of meromorphic functions) 272.A (on transcendental entire functions) 124.B Bolzano- 140 273.F Casorati- (on essential singularities) 198.0 of double series 379.H Lindemann- 430.0 Weierstrass-type preparation theorem (for micro- differential operators) 274.F Weierstrass zeta function 134.F weight(s) (of an automorphic form) 32.C (in a barycenter) 7.C (of a covariant) 226.0 (of a multiple covariant) 226.E (of a representation of a complex semisimple Lie algebra) 248.W (on a von Neumann algebra) 308.0 (of a weighted homogeneous analytic function) 418.0 equal, principle of 402.E extremal length with 143.B highest (of a representation of a complex semi- simple Lie algebra) 248.W Kallen-Lehmann 150.0 weighted homogeneous (analytic function) 418.0 weighted moving average 397.N weight function (for the mean of a function) 211.C 2146 (in numerical integration) 299.A (in orthogonality) 317.A weight group (of a pair (1', K)) 92.C weighting matrix 86.B weight k, automorphic form of 32.B weight k, Fuchsian form of 32.B weight k, Hilbert modular form of 32.B weight k, Siegel modular form of 32.F weight lattice (of a pair (1', K)) 92.C weight m, automorphic form of 32.A weight w, invariant of 226.0 Weil-Chatelet group 118.0 Weil cohomology 450.Q Weil conjecture 450.Q Taniyama- 450.S Wen domain 21.0 Weil group 6.E 450.H Wen L-function 450.H Weil measure 225.0 Wen number 3.C Wen theorem Borel- 437.Q Mordell- 118.E weak Mordell- 118.E Weinberg-Salam model, Olashow- 132.0 Weingarten formula (for an isometric immersion) 365.C (in the theory of surface) 111.H, App. A, Table 4.1 Weingarten surface 111.1 Weirich formula App A, Table 19.m Weisberger sum rule, Adler- 132.C Welch test 400.0 well-behaved 36.K well-chained (metric space) 79.0 well-measurable 407.B well-measurable CT-algebra 407.B well-ordered set 311.C well-ordering 311 C well-ordering theorem 34.B well-posed (initial value problem) 321.E (martingale problem) 115.C (problems for partial differential equations) 322.A Wendroff scheme, Lax- 304.F Weyl, H 448 Weyl canonical basis 248.P Weyl chamber 13 J 248.R positive 248.R Weyl character formula (on representation of com- pact Lie groups) 248.Z Weyl conformal curvature tensor 80.P Weyl criterion 182.H Weyl form 351.C Weyl form of the CCRs (canonical commutation relations) 337.A Weyl formula 323.M Weyl group (of a BN pair) 13.R (of a complex semisimple Lie algebra) 248.R (of a connected algebraic group) 13.H (of a Coxeter complex) 13.R (of a root system) 13.1 (of a symmetric Riemannian space:) 413.F affine (of a symmetric Riemannian space) 413.1 k- 13.Q Weyl integral formula 225 I 
2147 Weyllemma 1120 Weyl-Stone-Titchmarsh-Kodaira theory 112.0 Weyl theorem (on Lie algebras) 248.F Cartan- 248.W Weyl theory Peter- (on compact groups) 69 B Peter- (on compact Lie groups) 249 U Weyl-von Neumann theorem 390.1 white group 92.0 Whitehead group (of a ring) 237.1 Whitehead product 202.P Whitehead theorem 202.N generalized 202.N Whitehead torsion 65.C white noise 176.0 Gaussian 407 C Whitney class, Stiefel- 56.B,F 147.M Whitney class, total Stiefel- 56 B Whitney class, universal Stiefel- 56.B Whitney condition (b) 418 G at a point 418.G Whitney extension theorem 168.B Whitney mapping (map) 201.1 Whitney number, Stiefel- 56.F Whitney prestratification 418 G Whitney stratification 418.G Whitney sum (of vector bundles) 147.F Whitney theorem 105.K Whitney V-test, Mann- 371.C Whittaker differential equation 167.B, App. A, Tables 14.11 19.II Whittaker function 167.B, App. A, Table 19.II Whittaker model 450.0 wider sense, Bayes solution in the 398.B Widom type, Parreau- 164 K width, curve of constant III.E width of an oval 111.E Wiener, generalized Tauberian theorems of 192.0 Wiener-Brelot solution, Perron- (of Oirichlet prob- lem) 120.C Wiener compactification 207 C Wiener filter 86 E Wiener formula 160.B Wiener-Hopf integrodifferential equation 222 C Wiener-1kehara-Landau theorem 123.B Wiener integral, multiple 176.1 Wiener-Ita decomposition 176.1 Wiener kernel 95 Wiener-Levy theorem 159.1 Wiener martingales, {.'J';}- 406 B Wiener measure 250.E Wiener measure with the initial distribution J1 45.B Wiener process 5.0 45.B 98 B Wiener test (for Brownian motion) 450 (for Oirichlet problem) 338.G (for random walk) 260.E Wiener theorem, Paley- 125.0,BB Wightman axiom 150.0 Wightman field 150.0 Wightman function 150.0 truncatcd 150.0 Wigner coefficients 353.B Wigner rotation 258.C Wigner supermultiplet 351.J Wigner theorem 258.C 351.H Wilcoxon signed rank test 371.B Wilcoxon test 371.C Subject Index Y ourden square design Wilczynski, directrix of 110.B wild (knot) 235.A (space) 65.F Wilson-Hilfertyapproximations 374.F Wilson theorem 297.G Wiman theorem 429.B winding number 198 B window 421.C winning strategy 33.F Wirtinger inequality App. A, Table 8 Wirtinger presentation (of a knot group) 235.B,0 Wishart distribution 374.C p-dimensional noncentral 374.C witch of Agnesi 93.H within-group variance 397 L without source (vector field) 442.0 without vortex (vector field) 442.0 Witt decomposition (of a quadratic form) 348.E Witt group (of non degenerate quadratic forms) 348.E Witt matrix, Hasse- 9.E Witt theorem (on a-trace forms) 60.0 (on quadratic forms) 348.E Poincare-Birkhoff- (on Lie algebras) 248.1 Witt vector 449 of length n 449.B WKB method 25.B WKBJ method 25.B Wold decomposition 395.0 Wolfowitz inequality 399 J word 31.B 190.M cord 63.A equivalent 31.B word problem (in a finitely presented group) 161.B worst-case complexity 71.A Wright differential equation, Cherwell- 291.F Wronskian (determinant) 208.E Wu class (of a topological manifold) 56.F Wu formula App. A, Table 6.V x x (distribution) 125.EE X-minimal function 367.E xi-axis (of a Euclidean space) 140 Y Y-diffeomorphism 136.G Y-flow 136.G Yamabe problem 183364.H Yang-Mills equation 80.Q Yang-Mills field 150.G Yang-Mills functional 80.Q Yang-Mills G-connection 80.Q Yates-Terry normal score test, Fisher- 371 C Y osida approximation 286,X Y osida theorem, Hille- 378.B Young diagram 362 H Young incquality 224.E, App. A, Table 8 Hausdorff- 224 E Young modulus 271.G Young symmetrizer 362.H Young theorem, Hausdorff- 317 B Yourden square 102.K Yourden square design 102.K 
Subject Index Yukawa potential Yukawa potential 338.M Yule-Walker equation 421.0 Z Z (integers) 294.A,C Z (Zermelo set theory) 33 B ZF (Zermelo-Fraenkel set theory) 33.B z-distribution 374.B, App. A, Table 22 z-plane 74.0 z-sphere 74.0 z-transformation, Fisher 374.0 Z-distribution 341.0 Z-set 382.B Z-action (continuous) 126.B Z-action of class C' 126.B Zt -extension 14 L basic 14.L cyclotomic 14 L Zariski closed (set) 16.A Zariski connectedness theorem 16 X Zariski decomposition 15.0 Zariski dense (set) 16.A Zariski main theorem 16.I Zariski open (set) 16.A Zariski ring 284.C complete 284 C Zariski site 16.AA Zariski topology (of a spectrum) 16.0 (of a variety) 16.A Zassenhaus group 151.H Zassenhaus theorem Jordan- (on integral representation of a group) 362.K Schur- (on Hall subgroups) 151.E Zeeman unknotting theorem 65.0 Zeller theorem, Brown-Shields- 43.C zenith angle App A, Table 3.V Zeno, paradoxes of 319 C Zermelo-Fraenkel set theory 33.A,B Zermelo set theory 33.B zero(s) 355.A (of a function on an algebraic curve) 9.C (of a function on an algebraic variety) 16.M homologous to 198.B homotopic to 202.B scattered, function with 208.C zero algebra 29.A zero cycles 16.R zero-dimensional space, Baire 273.B zero divisor (of a function on an algebraic variety) 16.M (with respect to M/P) 284.A (of a ring) 368.B zero element (of an additive group) 2.E 190.A (of a field) 149.A (of a linear space) 256.A (of a ring) 368.A zero homomorphism (between two A-modules) 277.H zero matrix 269.B zero object 52.N zero-one law 342.0 Blumenthal 261.B Hewitt-Savage 342.0 Kolmogorov 342.0 zero point (of a holomorphic function) 198.C 2148 (of a polynomial) 337.B 369.C (of a subset of a polynomial ring) 369.C of the kth order (of a holomorphlc function) 198.C of the - kth order (of a complex function) 198.0 order of (of a homomorphic function) 198.C zero-point theorem Hilbert 369.0 Ruckert 23.B zero representation (of an algebra) 362.C zero ring 368.A zero-section (of a block bundle) 147 Q zero-sum (game) 173.A zero-sum two-person game 108.B zeroth law of thermodynamics 419.A zero vector 442.A zeta function(s) 450 (associated with a prehomogeneous space) 450.V (of a homeomorphism of a compact metric space) 126.K of an algebraic function field 450.P of an algebraic variety over a finite field 450.Q congruence 450.P Oedekind 14.C 450.0 defined by Hecke operators 450.M Epstein 450.K Hasse 450 S Hey 27.F Hurwitz 450 B Ihara 450.U Riemann 450.B of a scheme 450.R Selberg 450 T Siegel, of indefinite quadratic forms 450.K of a simple algebra 27.F Tamagawa 450.L Weierstrass 134.F, App. A, Table 16.IV ZFC 33.B Zilber theorem, Eilenberg- 201.J zonal harmonics 393.0 zonal polynomial 374.C zonal spherical function (on a homogeneous space) 437.Y Zorn lemma 34.C Zygmund, smooth in the sense of 168.B Zygmund class 159.E Zygmund singular integral operator, Calderon- 251.0 Zygmund type, kernal of Calderon 217.1